Query         gi|254780479|ref|YP_003064892.1| A/G-specific adenine glycosylase [Candidatus Liberibacter asiaticus str. psy62]
Match_columns 356
No_of_seqs    257 out of 2961
Neff          6.9 
Searched_HMMs 39220
Date          Sun May 29 17:49:10 2011
Command       /home/congqian_1/programs/hhpred/hhsearch -i 254780479.hhm -d /home/congqian_1/database/cdd/Cdd.hhm 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PRK10880 adenine DNA glycosyla 100.0       0       0  860.8  28.1  341    5-355     3-349 (350)
  2 TIGR01084 mutY A/G-specific ad 100.0       0       0  805.4  15.1  275    7-281     1-284 (297)
  3 COG1194 MutY A/G-specific DNA  100.0       0       0  764.3  23.3  330    5-350     8-340 (342)
  4 PRK13910 DNA glycosylase MutY; 100.0       0       0  684.3  20.9  285   47-354     1-286 (290)
  5 KOG2457 consensus              100.0       0       0  634.0  10.2  338    6-345    91-500 (555)
  6 PRK10702 endonuclease III; Pro 100.0       0       0  436.3  16.0  202   12-220     6-208 (211)
  7 COG0177 Nth Predicted EndoIII- 100.0 1.6E-39 4.1E-44  298.9  14.6  181   37-220    27-208 (211)
  8 TIGR01083 nth endonuclease III 100.0 7.9E-38   2E-42  287.0  10.8  187    3-206     2-192 (192)
  9 smart00478 ENDO3c endonuclease 100.0 2.8E-37 7.3E-42  283.0  11.1  148   48-197     1-149 (149)
 10 cd00056 ENDO3c endonuclease II 100.0 6.7E-34 1.7E-38  259.3  11.2  154   40-195     1-158 (158)
 11 KOG1921 consensus               99.9   6E-25 1.5E-29  196.3   8.4  181   39-220    78-262 (286)
 12 pfam00730 HhH-GPD HhH-GPD supe  99.9 8.8E-23 2.2E-27  181.0   9.8  122   44-165     1-130 (144)
 13 cd03431 DNA_Glycosylase_C DNA   99.8   2E-20   5E-25  164.4  10.7  112  239-350     3-118 (118)
 14 PRK13913 3-methyladenine DNA g  99.6 2.5E-15 6.4E-20  128.5  10.0  141   37-181    28-180 (218)
 15 COG2231 Uncharacterized protei  99.6 1.3E-14 3.4E-19  123.3  12.6  178   38-219    29-214 (215)
 16 PRK08999 hypothetical protein;  99.6 4.7E-15 1.2E-19  126.5   9.4  118  237-354     4-131 (312)
 17 PRK10776 nucleoside triphospha  99.6 1.4E-14 3.5E-19  123.2   9.5  113  238-350     4-126 (129)
 18 cd03425 MutT_pyrophosphohydrol  99.5 5.2E-14 1.3E-18  119.2   9.5  111  240-350     3-123 (124)
 19 PRK10546 pyrimidine (deoxy)nuc  99.5 4.1E-13   1E-17  112.9   9.5  110  240-350     6-125 (135)
 20 TIGR00588 ogg 8-oxoguanine DNA  99.1 8.4E-11 2.1E-15   96.6   4.8  123   37-161   146-318 (379)
 21 KOG2875 consensus               99.1 1.2E-10 3.1E-15   95.5   5.5  115   37-155   115-256 (323)
 22 COG0122 AlkA 3-methyladenine D  99.1 3.4E-09 8.8E-14   85.2  11.8  164   11-181    78-261 (285)
 23 TIGR00586 mutt mutator mutT pr  98.8 4.9E-09 1.3E-13   84.1   4.3  116  236-351     3-149 (150)
 24 cd03427 MTH1 MutT homolog-1 (M  98.7 1.5E-07 3.8E-12   73.7   8.3  108  242-350     4-124 (137)
 25 PRK10308 3-methyl-adenine DNA   98.6 3.9E-07 9.8E-12   70.8   9.6  136   32-180   106-262 (283)
 26 cd04691 Nudix_Hydrolase_32 Mem  98.6 2.6E-07 6.7E-12   72.0   7.5   93  241-335     3-106 (117)
 27 cd04696 Nudix_Hydrolase_37 Mem  98.5 8.3E-07 2.1E-11   68.5   8.6  103  241-348     5-125 (125)
 28 cd04699 Nudix_Hydrolase_39 Mem  98.5 8.9E-07 2.3E-11   68.2   8.2  102  240-341     3-118 (129)
 29 cd04667 Nudix_Hydrolase_10 Mem  98.4 2.2E-06 5.5E-11   65.5   8.7  101  242-347     3-111 (112)
 30 cd04681 Nudix_Hydrolase_22 Mem  98.3 5.4E-06 1.4E-10   62.7   9.3  105  243-349     6-128 (130)
 31 cd04683 Nudix_Hydrolase_24 Mem  98.1 1.9E-05 4.8E-10   58.9   7.9  100  241-340     2-117 (120)
 32 cd04664 Nudix_Hydrolase_7 Memb  98.1 6.2E-05 1.6E-09   55.2  10.4  102  245-348    10-129 (129)
 33 cd04673 Nudix_Hydrolase_15 Mem  98.1 1.8E-05 4.7E-10   59.0   7.3  102  242-344     3-121 (122)
 34 cd04511 Nudix_Hydrolase_4 Memb  98.0 2.5E-05 6.5E-10   58.0   7.8  104  242-347    16-130 (130)
 35 cd04690 Nudix_Hydrolase_31 Mem  98.0 1.8E-05 4.6E-10   59.0   6.7   96  241-340     3-113 (118)
 36 cd04676 Nudix_Hydrolase_17 Mem  98.0   3E-05 7.6E-10   57.5   7.8  105  240-348     4-128 (129)
 37 cd04682 Nudix_Hydrolase_23 Mem  98.0 2.7E-05 6.8E-10   57.8   7.3   97  243-339     5-116 (122)
 38 cd04677 Nudix_Hydrolase_18 Mem  98.0 5.5E-05 1.4E-09   55.6   8.4  105  239-348     8-132 (132)
 39 cd04684 Nudix_Hydrolase_25 Con  97.9 0.00013 3.3E-09   53.0   9.8  104  243-347     4-127 (128)
 40 cd04679 Nudix_Hydrolase_20 Mem  97.9   8E-05   2E-09   54.5   8.3  106  240-347     4-124 (125)
 41 cd03430 GDPMH GDP-mannose glyc  97.9 0.00017 4.3E-09   52.2   9.1   97  240-339    14-134 (144)
 42 cd04678 Nudix_Hydrolase_19 Mem  97.8 0.00014 3.5E-09   52.8   8.6  108  240-348     4-129 (129)
 43 cd02883 Nudix_Hydrolase Nudix   97.8 0.00025 6.3E-09   51.0   9.7  104  242-347     4-122 (123)
 44 cd04670 Nudix_Hydrolase_12 Mem  97.8  0.0001 2.6E-09   53.8   7.5  107  239-347     3-123 (127)
 45 cd03674 Nudix_Hydrolase_1 Memb  97.8 0.00017 4.4E-09   52.1   8.1  104  241-349     6-135 (138)
 46 cd04695 Nudix_Hydrolase_36 Mem  97.8 0.00033 8.4E-09   50.1   9.3  107  241-350     5-127 (131)
 47 cd04672 Nudix_Hydrolase_14 Mem  97.7 8.8E-05 2.3E-09   54.2   5.9  102  242-348     6-123 (123)
 48 cd04669 Nudix_Hydrolase_11 Mem  97.7 0.00016   4E-09   52.5   6.9   97  241-339     3-116 (121)
 49 cd04700 DR1025_like DR1025 fro  97.7 0.00023 5.9E-09   51.2   7.8   97  241-338    16-126 (142)
 50 cd04671 Nudix_Hydrolase_13 Mem  97.6 0.00023   6E-09   51.2   7.2  106  239-350     1-120 (123)
 51 pfam00293 NUDIX NUDIX domain.   97.6 0.00033 8.5E-09   50.1   7.4  107  242-348     6-128 (133)
 52 cd03673 Ap6A_hydrolase Diadeno  97.6  0.0016   4E-08   45.4  10.5   99  248-350    14-129 (131)
 53 cd04697 Nudix_Hydrolase_38 Mem  97.5 0.00055 1.4E-08   48.6   7.9   97  240-337     3-112 (126)
 54 cd04680 Nudix_Hydrolase_21 Mem  97.5  0.0005 1.3E-08   48.9   7.6  104  241-349     3-119 (120)
 55 cd04687 Nudix_Hydrolase_28 Mem  97.5 0.00076 1.9E-08   47.6   8.5   96  241-339     4-123 (128)
 56 cd04688 Nudix_Hydrolase_29 Mem  97.5  0.0011 2.8E-08   46.5   9.1   95  242-341     5-123 (126)
 57 cd04689 Nudix_Hydrolase_30 Mem  97.5   0.002   5E-08   44.7  10.4  103  242-350     4-125 (125)
 58 pfam00633 HHH Helix-hairpin-he  97.5 2.9E-05 7.5E-10   57.5   1.0   30  108-137     1-30  (30)
 59 cd04693 Nudix_Hydrolase_34 Mem  97.4  0.0006 1.5E-08   48.3   6.7   94  243-337     5-113 (127)
 60 KOG1918 consensus               97.3  0.0011 2.7E-08   46.5   7.4  148   37-194    73-242 (254)
 61 cd04694 Nudix_Hydrolase_35 Mem  97.2  0.0012 3.1E-08   46.2   6.1   44  240-283     3-47  (143)
 62 PRK09438 ntpA dATP pyrophospho  97.1  0.0022 5.7E-08   44.3   6.8  106  242-352    12-145 (148)
 63 cd03675 Nudix_Hydrolase_2 Cont  97.1   0.006 1.5E-07   41.2   8.8  105  242-348     3-125 (134)
 64 cd03429 NADH_pyrophosphatase N  96.8  0.0086 2.2E-07   40.2   7.6   92  243-336     5-106 (131)
 65 cd03671 Ap4A_hydrolase_plant_l  96.7  0.0096 2.4E-07   39.8   7.2   43  240-285     5-47  (147)
 66 cd03428 Ap4A_hydrolase_human_l  96.6   0.022 5.7E-07   37.2   8.7   96  249-350    16-128 (130)
 67 cd03426 CoAse Coenzyme A pyrop  96.5   0.012 2.9E-07   39.3   6.6   87  249-335    15-116 (157)
 68 PRK03759 isopentenyl-diphospha  96.3   0.048 1.2E-06   34.9   8.7  101  235-337    32-152 (184)
 69 cd02885 IPP_Isomerase Isopente  96.2   0.062 1.6E-06   34.1   9.0  101  236-337    29-148 (165)
 70 PRK01229 N-glycosylase/DNA lya  96.1  0.0075 1.9E-07   40.6   4.1  115   40-162    34-163 (208)
 71 cd04692 Nudix_Hydrolase_33 Mem  96.1   0.063 1.6E-06   34.1   8.8   99  238-337     3-127 (144)
 72 TIGR03252 uncharacterized HhH-  96.0   0.033 8.4E-07   36.0   7.0  103   35-137    14-134 (177)
 73 PRK00714 dinucleoside polyphos  95.9   0.026 6.6E-07   36.8   6.1  107  239-349     9-150 (156)
 74 smart00525 FES FES domain. iro  95.7  0.0032 8.2E-08   43.2   0.8   23  198-220     1-23  (26)
 75 cd04686 Nudix_Hydrolase_27 Mem  95.5    0.11 2.9E-06   32.3   7.9   89  243-336     5-118 (131)
 76 PRK10707 hypothetical protein;  95.2    0.16   4E-06   31.3   7.8   99  237-335    29-144 (190)
 77 cd04685 Nudix_Hydrolase_26 Mem  95.1    0.13 3.3E-06   31.9   7.3   96  243-338     5-124 (133)
 78 PRK05379 bifunctional nicotina  95.0    0.11 2.9E-06   32.3   6.7  109  239-349   203-335 (340)
 79 COG1051 ADP-ribose pyrophospha  95.0    0.37 9.3E-06   28.7   9.2  107  241-348    12-135 (145)
 80 cd04674 Nudix_Hydrolase_16 Mem  94.7    0.11 2.9E-06   32.3   6.1   44  244-288     9-52  (118)
 81 cd04665 Nudix_Hydrolase_8 Memb  94.5    0.18 4.5E-06   30.9   6.7   48  241-293     2-49  (118)
 82 pfam05559 DUF763 Protein of un  94.5   0.041   1E-06   35.4   3.4   34  112-145   263-299 (319)
 83 cd03672 Dcp2p mRNA decapping e  94.2    0.24 6.2E-06   30.0   6.9   89  243-336     6-110 (145)
 84 pfam10576 EndIII_4Fe-2S Iron-s  93.4   0.025 6.3E-07   36.9   0.5   21  199-219     1-21  (26)
 85 PRK00241 nudC NADH pyrophospha  93.0    0.81 2.1E-05   26.3   7.8  115  223-349   126-252 (257)
 86 cd04661 MRP_L46 Mitochondrial   93.0    0.39 9.9E-06   28.5   6.2   82  264-345    25-128 (132)
 87 COG1059 Thermostable 8-oxoguan  92.4    0.22 5.7E-06   30.2   4.3   90   72-164    66-167 (210)
 88 COG2816 NPY1 NTP pyrophosphohy  92.3     1.1 2.8E-05   25.4   7.9  152  170-339    82-253 (279)
 89 PRK13901 ruvA Holliday junctio  91.6   0.078   2E-06   33.4   1.3   76  117-195    71-155 (196)
 90 COG1555 ComEA DNA uptake prote  91.0   0.093 2.4E-06   32.9   1.1   24  115-138    94-117 (149)
 91 TIGR02705 nudix_YtkD nucleosid  90.9    0.36 9.1E-06   28.8   4.1   78  240-322    26-110 (158)
 92 cd03424 ADPRase_NUDT5 ADP-ribo  90.5     1.1 2.7E-05   25.4   6.2   93  243-337     7-115 (137)
 93 KOG3069 consensus               90.3    0.77   2E-05   26.4   5.3   88  249-336    57-162 (246)
 94 TIGR00575 dnlj DNA ligase, NAD  90.2    0.46 1.2E-05   28.0   4.1   22  116-137   562-583 (706)
 95 PRK00024 radC DNA repair prote  90.0    0.11 2.8E-06   32.4   0.8   70   62-138    14-86  (224)
 96 COG1415 Uncharacterized conser  88.6    0.21 5.4E-06   30.4   1.4   35  113-147   273-310 (373)
 97 TIGR01259 comE comEA protein;   88.6    0.15 3.8E-06   31.5   0.6   48   75-130    67-114 (124)
 98 COG1443 Idi Isopentenyldiphosp  87.1     1.4 3.5E-05   24.6   4.9   99  237-337    33-153 (185)
 99 TIGR00084 ruvA Holliday juncti  86.3    0.23 5.8E-06   30.1   0.5   23  115-137   115-137 (217)
100 TIGR00426 TIGR00426 competence  85.1    0.64 1.6E-05   27.0   2.3   50   74-132    11-62  (70)
101 TIGR00615 recR recombination p  84.9     0.3 7.6E-06   29.3   0.5   21  115-135     9-29  (205)
102 PRK13766 Hef nuclease; Provisi  84.7     1.6   4E-05   24.3   4.1   30  238-267   467-500 (764)
103 PRK11762 nudE ADP-ribose dipho  84.3     3.6 9.2E-05   21.7   6.7   92  244-337    53-159 (185)
104 PRK13266 Thf1-like protein; Re  84.0     3.2 8.1E-05   22.1   5.4  127    6-160     5-145 (224)
105 TIGR00084 ruvA Holliday juncti  83.8    0.48 1.2E-05   27.9   1.2   59  117-178    82-143 (217)
106 TIGR00593 pola DNA polymerase   83.5    0.46 1.2E-05   28.0   1.0   45  101-147   165-220 (1005)
107 PRK00116 ruvA Holliday junctio  82.1    0.65 1.7E-05   26.9   1.3   50   83-136    75-126 (198)
108 pfam11731 Cdd1 Pathogenicity l  81.7    0.61 1.5E-05   27.1   1.0   42  116-157    10-51  (92)
109 PRK00254 ski2-like helicase; P  81.6     1.7 4.3E-05   24.0   3.3   20    8-27      8-29  (717)
110 PRK08097 ligB NAD-dependent DN  81.5    0.68 1.7E-05   26.8   1.2   20   66-86     95-114 (563)
111 TIGR03060 PS_II_psb29 photosys  81.3     3.7 9.5E-05   21.6   5.0  126    6-160     5-144 (214)
112 KOG2841 consensus               81.1     2.7 6.9E-05   22.6   4.2   21  330-350   225-247 (254)
113 TIGR01259 comE comEA protein;   81.0    0.85 2.2E-05   26.1   1.6   24  114-137    68-91  (124)
114 PRK00116 ruvA Holliday junctio  80.2    0.58 1.5E-05   27.3   0.5   58  117-177    72-131 (198)
115 COG0632 RuvA Holliday junction  80.0    0.41   1E-05   28.3  -0.3   44  117-161    72-117 (201)
116 PRK07456 consensus              79.9       1 2.6E-05   25.5   1.7   34  102-136   179-223 (975)
117 PRK00076 recR recombination pr  79.8    0.77   2E-05   26.4   1.0   24  113-136     2-29  (197)
118 PRK06887 consensus              79.4     1.1 2.8E-05   25.3   1.7   35  101-136   161-206 (954)
119 TIGR01816 sdhA_forward succina  79.0    0.43 1.1E-05   28.2  -0.5   18   62-79    271-289 (615)
120 PRK08609 hypothetical protein;  79.0     3.1 7.8E-05   22.2   3.9   24  112-135    82-105 (570)
121 PRK05929 consensus              78.3     1.2 3.1E-05   25.1   1.6   33  103-136   160-203 (870)
122 PRK07625 consensus              78.1     1.3 3.3E-05   24.8   1.8   35  101-136   160-205 (922)
123 PRK01172 ski2-like helicase; P  77.7     1.7 4.2E-05   24.1   2.2   45    8-52      8-61  (674)
124 PRK08076 consensus              77.6     1.4 3.5E-05   24.6   1.8   34  102-136   162-206 (877)
125 smart00278 HhH1 Helix-hairpin-  77.3    0.54 1.4E-05   27.5  -0.4   23  118-140     1-23  (26)
126 PRK07956 ligA NAD-dependent DN  77.1     2.4 6.1E-05   22.9   2.9   13  342-354   506-518 (668)
127 PRK13844 recombination protein  77.0       1 2.6E-05   25.5   1.0   24  112-135     5-32  (200)
128 PRK07556 consensus              76.9     1.3 3.3E-05   24.8   1.5   33  103-136   169-212 (977)
129 PRK12911 bifunctional preprote  76.8    0.71 1.8E-05   26.7   0.1   30  118-157   958-987 (1405)
130 cd00141 NT_POLXc Nucleotidyltr  76.8     2.2 5.6E-05   23.2   2.6   26  111-136    78-103 (307)
131 PRK08835 consensus              76.6     1.5 3.7E-05   24.4   1.7   51  101-161   161-222 (931)
132 COG0353 RecR Recombinational D  76.4    0.91 2.3E-05   25.9   0.6   20  116-135    10-29  (198)
133 PRK07997 consensus              76.3     1.5 3.9E-05   24.3   1.7   35  101-136   161-206 (928)
134 PRK07898 consensus              76.3     1.5 3.9E-05   24.3   1.7   35  101-136   175-220 (902)
135 PRK08786 consensus              76.3     1.4 3.7E-05   24.5   1.6   34  102-136   157-201 (927)
136 PRK08928 consensus              75.4     1.7 4.3E-05   24.0   1.7   51  101-161   159-220 (861)
137 PRK05797 consensus              75.0     1.6   4E-05   24.2   1.5   34  102-136   162-206 (869)
138 PRK08434 consensus              74.7     1.7 4.4E-05   23.9   1.7   49  103-161   159-218 (887)
139 TIGR02789 nickel_nikB nickel A  74.6    0.54 1.4E-05   27.5  -1.0   38  107-150   245-282 (315)
140 TIGR02236 recomb_radA DNA repa  74.4    0.54 1.4E-05   27.5  -1.0   57   68-128    21-77  (333)
141 PRK13901 ruvA Holliday junctio  74.3     1.2   3E-05   25.1   0.7   98   39-140    25-129 (196)
142 PRK07300 consensus              74.2     1.9 4.7E-05   23.7   1.7   33  103-136   171-214 (880)
143 smart00483 POLXc DNA polymeras  73.9     5.5 0.00014   20.4   4.1   25  111-135    82-106 (334)
144 PRK02362 ski2-like helicase; P  71.5     3.1 7.8E-05   22.2   2.3   18   10-27     10-29  (736)
145 PRK05755 DNA polymerase I; Pro  71.2     2.5 6.4E-05   22.8   1.8   35  101-136   159-204 (889)
146 PRK07945 hypothetical protein;  70.9     0.9 2.3E-05   25.9  -0.5   61   94-162    23-85  (335)
147 KOG2534 consensus               70.8     3.7 9.4E-05   21.6   2.6   67   94-162    34-107 (353)
148 COG1796 POL4 DNA polymerase IV  70.6       2 5.1E-05   23.5   1.2   21  252-272   201-221 (326)
149 pfam02371 Transposase_20 Trans  70.4     1.6   4E-05   24.2   0.6   41  118-162     2-42  (87)
150 COG2003 RadC DNA repair protei  70.0     2.3 5.9E-05   23.1   1.4   62   71-137    21-85  (224)
151 TIGR01129 secD protein-export   69.2     1.7 4.3E-05   24.0   0.6   12  137-148   118-129 (522)
152 COG3547 Transposase and inacti  68.0     5.4 0.00014   20.5   2.9   37   97-135   169-205 (303)
153 pfam01601 Corona_S2 Coronaviru  66.5       3 7.7E-05   22.2   1.4   75   77-160   149-239 (609)
154 pfam03686 UPF0146 Uncharacteri  66.3     3.5   9E-05   21.7   1.7   92  100-199     3-95  (127)
155 TIGR02150 IPP_isom_1 isopenten  66.0       5 0.00013   20.7   2.4   32  236-268    29-61  (190)
156 COG0494 MutT NTP pyrophosphohy  65.7     5.9 0.00015   20.2   2.8   28  250-280    24-51  (161)
157 KOG3084 consensus               65.6      11 0.00028   18.3   4.2   93  242-336   191-297 (345)
158 TIGR03374 ABALDH 1-pyrroline d  64.5     9.9 0.00025   18.6   3.7   76   81-156    49-136 (472)
159 cd01701 Pol_zeta Pol_zeta, a m  62.0      13 0.00032   17.8   4.0   88   68-159   164-265 (405)
160 PRK04148 hypothetical protein;  61.9     8.5 0.00022   19.1   3.0   92  100-199     3-102 (135)
161 pfam01367 5_3_exonuc 5'-3' exo  61.4     3.2 8.2E-05   22.0   0.8   34  115-157    15-48  (100)
162 TIGR02757 TIGR02757 conserved   61.0     3.7 9.3E-05   21.6   1.0   46  141-187   210-260 (269)
163 KOG3690 consensus               60.3      14 0.00035   17.6   6.3   97   93-205    43-148 (646)
164 KOG3911 consensus               59.4     4.8 0.00012   20.8   1.3   29   63-92     29-58  (378)
165 COG4649 Uncharacterized protei  59.2      14 0.00036   17.5   3.7   74  121-194   124-213 (221)
166 PRK09482 xni exonuclease IX; P  58.4     3.3 8.3E-05   22.0   0.4   12  124-135   188-199 (256)
167 COG1255 Uncharacterized protei  58.1      11 0.00028   18.3   3.0   99  102-212     5-104 (129)
168 smart00475 53EXOc 5'-3' exonuc  57.9     5.5 0.00014   20.4   1.5   33  103-136   161-204 (259)
169 COG5071 RPN5 26S proteasome re  56.9     9.4 0.00024   18.8   2.5   23   58-80      7-29  (439)
170 PRK13482 DNA integrity scannin  56.4      16  0.0004   17.2   4.4   23  137-159   268-294 (352)
171 COG1491 Predicted RNA-binding   54.4     5.9 0.00015   20.2   1.1   84   76-163    74-171 (202)
172 pfam09674 DUF2400 Protein of u  50.8      11 0.00027   18.4   2.0   19  143-162   174-192 (230)
173 pfam05597 Phasin Poly(hydroxya  50.6      19 0.00049   16.6   3.3   82   74-161    11-106 (132)
174 KOG4368 consensus               50.0      20  0.0005   16.5   3.4   39   66-104   196-236 (757)
175 pfam04919 DUF655 Protein of un  49.5     7.6 0.00019   19.4   1.1   54  107-164   103-158 (181)
176 PRK04301 radA DNA repair and r  49.4     7.5 0.00019   19.5   1.0   40   66-107    26-65  (318)
177 pfam02889 Sec63 Sec63 Brl doma  49.1      20 0.00052   16.4   3.7   48   85-136   119-166 (309)
178 TIGR03029 EpsG chain length de  47.8      13 0.00033   17.8   2.0   41  123-163   112-156 (274)
179 PRK05812 secD preprotein trans  47.7     7.1 0.00018   19.6   0.7   20  145-164   144-165 (513)
180 COG4277 Predicted DNA-binding   47.5     6.3 0.00016   20.0   0.4   45   76-122    83-153 (404)
181 pfam11264 ThylakoidFormat Thyl  47.1      22 0.00055   16.2   4.8  119   13-160     7-141 (216)
182 TIGR02153 gatD_arch glutamyl-t  47.1     7.6 0.00019   19.4   0.8   91   56-168    87-191 (413)
183 pfam05997 Nop52 Nucleolar prot  47.0      16 0.00042   17.1   2.4   50   52-108    16-70  (212)
184 COG1561 Uncharacterized stress  47.0      13 0.00032   17.9   1.9   35   94-128    84-118 (290)
185 KOG4548 consensus               46.5      22 0.00057   16.1   3.8   90  249-340   138-250 (263)
186 pfam04669 DUF579 Protein of un  46.4      22 0.00057   16.1   4.8   98   60-163    49-176 (187)
187 pfam04904 NCD1 NAB conserved r  45.1      23 0.00059   16.0   3.8   56   53-108    17-76  (82)
188 KOG0135 consensus               44.8      24  0.0006   16.0   6.2   99   61-160   477-585 (661)
189 COG0272 Lig NAD-dependent DNA   44.5      17 0.00043   17.0   2.2   16   71-86     82-97  (667)
190 TIGR01954 nusA_Cterm_rpt trans  44.5      21 0.00055   16.2   2.7   38   66-105    12-51  (52)
191 CHL00137 rps13 ribosomal prote  44.2      10 0.00026   18.6   1.0   28  111-138     7-37  (122)
192 pfam10343 DUF2419 Protein of u  44.0      21 0.00055   16.2   2.7   41   74-114    69-115 (282)
193 COG0258 Exo 5'-3' exonuclease   43.8      15 0.00038   17.4   1.8   34  102-135   171-215 (310)
194 PTZ00221 cyclophilin; Provisio  43.7      13 0.00033   17.8   1.5   58   69-129    20-98  (249)
195 PRK05179 rpsM 30S ribosomal pr  43.0      11 0.00029   18.2   1.2   27  112-138     8-37  (122)
196 cd04666 Nudix_Hydrolase_9 Memb  42.2      26 0.00065   15.7   8.8   83  250-336    15-114 (122)
197 pfam05516 consensus             42.1      22 0.00055   16.2   2.5   55   59-124    47-102 (136)
198 PRK13023 bifunctional preprote  42.1      11 0.00028   18.3   0.9   20  145-164   156-177 (840)
199 PRK04053 rps13p 30S ribosomal   42.0      12  0.0003   18.0   1.1   47   97-150     5-51  (149)
200 TIGR02111 PQQ_syn_pqqC coenzym  42.0      26 0.00065   15.7   2.8   76   36-128    20-101 (239)
201 COG1031 Uncharacterized Fe-S o  41.0      13 0.00034   17.7   1.2   30  111-147   253-283 (560)
202 pfam05166 YcgL YcgL domain. Th  41.0      16  0.0004   17.2   1.6   21   72-92     48-68  (74)
203 PRK13024 bifunctional preprote  40.9      11 0.00029   18.2   0.9   31  118-158   309-339 (741)
204 PRK02406 DNA polymerase IV; Va  40.7      27 0.00069   15.5   4.7  132   66-201   118-277 (355)
205 TIGR01448 recD_rel helicase, R  40.7      10 0.00026   18.6   0.6   17  119-135   195-212 (769)
206 pfam09735 Nckap1 Membrane-asso  40.3      27  0.0007   15.5   3.6   17    8-24    368-384 (1118)
207 cd01700 Pol_V Pol V was discov  40.2      27  0.0007   15.5   2.9   69   70-141   112-198 (344)
208 PRK03137 1-pyrroline-5-carboxy  40.0      28 0.00071   15.5   4.9   68   83-150    86-164 (514)
209 PRK01151 rps17E 30S ribosomal   39.8      15 0.00039   17.2   1.4   29   94-122     4-32  (58)
210 cd03586 Pol_IV_kappa Pol_IV_ka  39.2      28 0.00072   15.4   3.3   82   66-151   110-205 (337)
211 cd00008 53EXOc 5'-3' exonuclea  38.9      15 0.00038   17.4   1.2   43  107-159   162-215 (240)
212 PTZ00134 40S ribosomal protein  38.8     9.9 0.00025   18.6   0.3   30  109-138    18-50  (154)
213 KOG3835 consensus               38.1      30 0.00075   15.3   4.2   81   53-133    19-119 (495)
214 TIGR03631 bact_S13 30S ribosom  37.9      12 0.00031   17.9   0.7   27  112-138     6-35  (113)
215 pfam00113 Enolase_C Enolase, C  37.8     6.5 0.00017   19.9  -0.8   29  106-140    66-96  (296)
216 pfam11372 DUF3173 Protein of u  37.6      20 0.00051   16.5   1.7   25   78-106     5-30  (59)
217 pfam08887 GAD-like GAD-like do  37.4      25 0.00064   15.8   2.2   13   60-72      3-15  (109)
218 cd00080 HhH2_motif Helix-hairp  37.1     8.2 0.00021   19.2  -0.3   19  118-136    22-40  (75)
219 PRK11820 hypothetical protein;  37.0      28 0.00071   15.4   2.4   34   94-127    84-117 (288)
220 TIGR03629 arch_S13P archaeal r  36.3     8.9 0.00023   18.9  -0.3   30  109-138     9-41  (144)
221 cd00424 Pol_Y Y-family of DNA   35.9      18 0.00047   16.7   1.3   89   67-159   111-214 (341)
222 COG0529 CysC Adenylylsulfate k  35.5      22 0.00055   16.2   1.6   42  120-161    28-73  (197)
223 PRK11241 gabD succinate-semial  35.5      32 0.00082   15.0   3.3   56   82-137    60-126 (482)
224 pfam03755 YicC_N YicC-like fam  35.3      30 0.00078   15.2   2.3   34   94-127    83-116 (159)
225 LOAD_Hrd consensus              35.1      11 0.00027   18.4  -0.0   21  112-132    41-61  (77)
226 COG1084 Predicted GTPase [Gene  34.9      33 0.00084   14.9   2.8   84   46-144    47-139 (346)
227 KOG2379 consensus               34.9      27 0.00069   15.5   2.0   36   97-135    38-74  (501)
228 KOG2705 consensus               34.8      20 0.00051   16.5   1.3   28  107-139   193-220 (471)
229 pfam00416 Ribosomal_S13 Riboso  34.7      12 0.00031   18.0   0.2   27  112-138     6-35  (106)
230 pfam02467 Whib Transcription f  34.3      12 0.00031   18.0   0.1   19  203-221    29-47  (66)
231 PRK10729 nudF ADP-ribose pyrop  34.3      34 0.00086   14.9   7.1   76  241-316    51-139 (202)
232 PRK02515 psbU photosystem II c  33.7      34 0.00088   14.8   5.0   80   99-189    51-133 (144)
233 cd07092 ALDH_ABALDH-YdcW Esche  33.6      31  0.0008   15.1   2.2   22   85-106    34-55  (450)
234 cd07093 ALDH_F8_HMSADH Human a  33.5      32 0.00081   15.1   2.2   24   84-107    33-56  (455)
235 smart00341 HRDC Helicase and R  33.5      13 0.00033   17.8   0.2   21  112-132    41-61  (81)
236 cd07090 ALDH_F9_TMBADH NAD+-de  33.0      34 0.00088   14.8   2.3   57   82-138    31-98  (457)
237 TIGR01862 N2-ase-Ialpha nitrog  32.2     5.5 0.00014   20.4  -1.9  114   82-202   231-362 (510)
238 pfam00570 HRDC HRDC domain. Th  31.8      13 0.00034   17.7   0.0   21  112-132    38-58  (68)
239 smart00279 HhH2 Helix-hairpin-  31.7      15 0.00038   17.3   0.3   19  118-136    16-34  (36)
240 PRK09407 gabD2 succinic semial  31.6      37 0.00095   14.6   2.5   55   83-137    66-131 (523)
241 PTZ00081 enolase (2-phospho-D-  31.6     8.9 0.00023   18.9  -0.9   12  147-158   132-143 (442)
242 KOG0648 consensus               31.5      37 0.00095   14.6   2.6   47  237-283   114-161 (295)
243 PHA00619 CRISPR-associated Cas  31.5      25 0.00064   15.8   1.4   12  205-216   196-207 (210)
244 KOG2171 consensus               31.5      37 0.00095   14.5   5.2   16  206-221   584-599 (1075)
245 PRK03352 DNA polymerase IV; Va  31.3      32 0.00082   15.0   1.9   86   69-159   118-217 (345)
246 TIGR00372 cas4 CRISPR-associat  30.5      18 0.00045   16.8   0.5   16  202-217   191-206 (206)
247 pfam11798 IMS_HHH IMS family H  30.5      14 0.00036   17.6  -0.0   16  120-135    14-29  (33)
248 KOG4233 consensus               30.4      19 0.00049   16.6   0.6   47  169-218    42-89  (90)
249 pfam10273 WGG Pre-rRNA-process  30.4      39 0.00099   14.4   2.8   12    5-16     33-44  (82)
250 PRK13698 plasmid-partitioning   30.4      39 0.00099   14.4   5.1   50  122-180   102-157 (323)
251 TIGR00658 orni_carb_tr ornithi  30.2      33 0.00085   14.9   1.9   50  140-194    85-142 (341)
252 PTZ00205 DNA polymerase kappa;  30.1      29 0.00073   15.3   1.5   19  120-138   311-329 (571)
253 TIGR02545 ATP_syn_fliI flagell  29.5      26 0.00067   15.6   1.2   15  145-161   331-345 (439)
254 PRK13473 gamma-aminobutyraldeh  29.5      40   0.001   14.3   4.1   56   83-138    52-119 (475)
255 KOG1338 consensus               29.3      40   0.001   14.3   4.5   93   47-144    96-196 (466)
256 cd03312 CIMS_N_terminal_like C  28.8      11 0.00029   18.2  -0.7   15  101-115   108-122 (360)
257 TIGR02535 hyp_Hser_kinase prop  28.6     3.4 8.6E-05   21.9  -3.5   23  178-200   200-223 (431)
258 cd01703 Pol_iota Pol iota is m  28.5      23 0.00059   16.0   0.8   53   85-138   163-230 (394)
259 cd07108 ALDH_MGR_2402 Magnetos  28.2      39 0.00099   14.4   1.9   53   83-135    32-96  (457)
260 TIGR02061 aprA adenylylsulfate  27.9      33 0.00083   14.9   1.5   24   62-87    562-585 (651)
261 PTZ00154 40S ribosomal protein  27.7      34 0.00086   14.8   1.5   29   94-122     5-33  (130)
262 KOG2966 consensus               27.5      22 0.00056   16.2   0.5   11  190-200   149-159 (325)
263 TIGR02418 acolac_catab acetola  27.4      17 0.00043   17.0  -0.1   30  123-152    75-104 (553)
264 cd07146 ALDH_PhpJ Streptomyces  27.3      38 0.00098   14.5   1.8   26   83-108    31-56  (451)
265 COG3600 GepA Uncharacterized p  27.2      28 0.00071   15.4   1.0   70   87-159    31-108 (154)
266 COG1468 CRISPR-associated prot  26.8      23 0.00058   16.0   0.5   17  201-217   172-188 (190)
267 cd07151 ALDH_HBenzADH NADP+-de  26.4      45  0.0012   14.0   3.3   56   83-138    45-111 (465)
268 pfam03118 RNA_pol_A_CTD Bacter  26.2      21 0.00054   16.3   0.3   47   87-135    11-57  (62)
269 PRK12308 bifunctional arginino  26.1      44  0.0011   14.0   1.9   23  110-132   202-224 (614)
270 COG1383 RPS17A Ribosomal prote  25.9      43  0.0011   14.1   1.8   29   94-122     5-33  (74)
271 cd07102 ALDH_EDX86601 Uncharac  25.9      46  0.0012   13.9   4.9   25   83-107    31-55  (452)
272 pfam06754 PhnG Phosphonate met  25.8      44  0.0011   14.0   1.8   20   72-91     11-30  (147)
273 PRK13181 hisH imidazole glycer  25.8      15 0.00038   17.4  -0.6   11  121-131    41-51  (199)
274 PRK09457 astD succinylglutamic  25.8      46  0.0012   13.9   2.1   23   84-106    39-61  (475)
275 KOG3771 consensus               25.7      47  0.0012   13.9   4.1   83    1-99     63-145 (460)
276 cd07098 ALDH_F15-22 Aldehyde d  25.5      47  0.0012   13.8   2.2   26   83-108    31-56  (465)
277 TIGR02923 AhaC ATP synthase A1  25.3      46  0.0012   13.9   1.8   19   40-58     47-65  (353)
278 pfam02961 BAF Barrier to autoi  25.2      16 0.00041   17.1  -0.5   74  114-218    15-89  (89)
279 cd04663 Nudix_Hydrolase_6 Memb  25.1      48  0.0012   13.8   4.1   36  250-290    14-49  (126)
280 pfam06424 PRP1_N PRP1 splicing  25.0      16 0.00042   17.1  -0.5   16  115-130   115-130 (132)
281 cd02027 APSK Adenosine 5'-phos  24.8      47  0.0012   13.8   1.8   32  121-152     5-39  (149)
282 cd03740 SOCS_SOCS6 SOCS (suppr  24.7      45  0.0011   14.0   1.7   22  328-349    17-38  (41)
283 cd07116 ALDH_ACDHII-AcoD Ralst  24.6      49  0.0012   13.7   2.0   27   83-109    51-77  (479)
284 cd02037 MRP-like MRP (Multiple  24.3      40   0.001   14.3   1.4   27  124-150     9-38  (169)
285 TIGR02291 rimK_rel_E_lig alpha  24.2      33 0.00084   14.9   0.9   13  259-271   119-131 (320)
286 TIGR02684 dnstrm_HI1420 probab  24.1      50  0.0013   13.7   2.6   30   73-105    24-55  (91)
287 COG3100 Uncharacterized protei  24.0      43  0.0011   14.1   1.5   20   73-92     61-80  (103)
288 COG0099 RpsM Ribosomal protein  24.0      37 0.00093   14.6   1.1   27  112-138     8-37  (121)
289 cd07147 ALDH_F21_RNP123 Aldehy  23.9      50  0.0013   13.6   2.0   21   85-105    36-56  (452)
290 TIGR00608 radc DNA repair prot  23.9      49  0.0013   13.7   1.8   42   97-138    34-83  (223)
291 PRK00558 uvrC excinuclease ABC  23.8      50  0.0013   13.6   3.6   23  187-209   376-398 (609)
292 pfam00286 Flexi_CP Viral coat   23.8      50  0.0013   13.6   3.9   78   80-175    11-90  (140)
293 pfam01583 APS_kinase Adenylyls  23.8      46  0.0012   13.9   1.6   32  121-152     8-42  (157)
294 COG0634 Hpt Hypoxanthine-guani  23.5      51  0.0013   13.6   2.7  106   95-216    18-134 (178)
295 cd07559 ALDH_ACDHII_AcoD-like   23.4      51  0.0013   13.6   2.1   58   81-138    49-118 (480)
296 KOG1362 consensus               23.4      15 0.00038   17.4  -1.0   16   40-55    213-228 (577)
297 TIGR01258 pgm_1 phosphoglycera  23.3      21 0.00053   16.3  -0.2   11   53-63    156-166 (248)
298 cd03313 enolase Enolase: Enola  23.1      12 0.00031   17.9  -1.4   49   86-137    54-109 (408)
299 TIGR02903 spore_lon_C ATP-depe  22.9      37 0.00095   14.6   1.0   30  124-153   185-224 (616)
300 cd07106 ALDH_AldA-AAD23400 Str  22.9      52  0.0013   13.5   2.0   24   84-107    33-56  (446)
301 COG1251 NirB NAD(P)H-nitrite r  22.9      52  0.0013   13.5   2.7   46  108-157   310-355 (793)
302 TIGR01203 HGPRTase hypoxanthin  22.8      52  0.0013   13.5   2.2   61   95-155     9-77  (183)
303 cd07591 BAR_Rvs161p The Bin/Am  22.7      53  0.0013   13.5   3.2   75    1-84     44-124 (224)
304 KOG0142 consensus               22.4      23 0.00059   16.0  -0.1   33  235-268    50-83  (225)
305 cd02011 TPP_PK Thiamine pyroph  22.1      48  0.0012   13.8   1.4   12  203-214    89-100 (227)
306 pfam05405 Mt_ATP-synt_B Mitoch  21.9      55  0.0014   13.4   2.7  121   68-194    35-162 (163)
307 TIGR03371 cellulose_yhjQ cellu  21.9      55  0.0014   13.4   1.8   39  125-163    12-53  (246)
308 KOG4579 consensus               21.8      53  0.0014   13.4   1.6   16   62-77     70-85  (177)
309 PRK00077 eno phosphopyruvate h  21.8      16 0.00041   17.2  -1.1   27  110-136    80-113 (427)
310 TIGR03650 violacein_E violacei  21.6      36 0.00093   14.6   0.7   15   38-52     53-67  (184)
311 TIGR01577 oligosac_amyl oligos  21.6      55  0.0014   13.3   2.0   40   97-136   402-444 (679)
312 cd07131 ALDH_AldH-CAJ73105 Unc  21.5      56  0.0014   13.3   4.3   56   83-138    50-116 (478)
313 COG1168 MalY Bifunctional PLP-  21.5      56  0.0014   13.3   2.6  109   21-158    14-124 (388)
314 pfam10391 DNA_pol_lambd_f Fing  21.4      44  0.0011   14.0   1.1   21  118-138     2-22  (52)
315 TIGR03018 pepcterm_TyrKin exop  21.3      41   0.001   14.2   1.0   41  123-163    44-89  (207)
316 pfam00506 Flu_NP Influenza vir  21.3      56  0.0014   13.3   7.1  105   92-202   156-274 (506)
317 COG4892 Predicted heme/steroid  21.2      30 0.00075   15.3   0.2   35   95-129    41-75  (81)
318 PRK04195 replication factor C   21.2      57  0.0014   13.3   3.1   27  122-148    47-73  (403)
319 pfam08440 Poty_PP Potyviridae   21.0      38 0.00097   14.5   0.7   14  132-145     1-14  (274)
320 COG2355 Zn-dependent dipeptida  20.9      57  0.0015   13.2   2.2   90  113-218   107-200 (313)
321 PRK13153 consensus              20.8      42  0.0011   14.2   0.9   11  121-131    41-51  (203)
322 cd02035 ArsA ArsA ATPase funct  20.8      58  0.0015   13.2   1.7   25  125-149     9-36  (217)
323 COG5133 Uncharacterized conser  20.7      43  0.0011   14.1   1.0   94  116-217    10-122 (181)
324 COG1313 PflX Uncharacterized F  20.6      52  0.0013   13.6   1.3   45  169-213    38-84  (335)
325 PRK09847 gamma-glutamyl-gamma-  20.6      58  0.0015   13.2   2.3   52   86-137    75-138 (494)
326 TIGR01944 rnfB electron transp  20.5      58  0.0015   13.2   2.7   54  107-161    41-96  (213)
327 pfam02764 Diphtheria_T Diphthe  20.1      42  0.0011   14.2   0.8  100   85-199    73-173 (180)
328 TIGR01837 PHA_granule_1 poly(h  20.1      59  0.0015   13.1   1.7   23  139-162    75-97  (121)
329 KOG2573 consensus               20.0      36 0.00091   14.7   0.4   31   57-96    126-157 (498)
330 cd07086 ALDH_F7_AASADH-like NA  20.0      60  0.0015   13.1   4.9   53   83-135    48-111 (478)

No 1  
>PRK10880 adenine DNA glycosylase; Provisional
Probab=100.00  E-value=0  Score=860.79  Aligned_cols=341  Identities=32%  Similarity=0.584  Sum_probs=310.5

Q ss_pred             HHHHHHHHHHHHHHCCC-CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHH
Q ss_conf             68999999999996289-98875485544456787046766446731161203326589987748997898729989999
Q gi|254780479|r    5 EHIIQSKILDWYDTNHR-VLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEIL   83 (356)
Q Consensus         5 ~~~~~~~ll~w~~~~~R-~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl   83 (356)
                      ...|+++||+||+.|+| |||||++        +|||+|||||||||||||+||+|||++||++|||+++||+|++||||
T Consensus         3 ~~~f~~~ll~Wy~~~~r~~lPWr~~--------~~PY~vwvSEiMLQQTqv~tV~~yy~rf~~~fP~~~~LA~A~~~~vl   74 (350)
T PRK10880          3 ASQFSAQVLDWYDKYGRKTLPWQID--------KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVL   74 (350)
T ss_pred             HHHHHHHHHHHHHHCCCCCCCCCCC--------CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHH
T ss_conf             7999999999998739988998999--------98226999999983487789999999999988399999779999999


Q ss_pred             HHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCC
Q ss_conf             99740463458899987767543005786873145688888777877667766434963131243146665765197545
Q gi|254780479|r   84 SAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKP  163 (356)
Q Consensus        84 ~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~  163 (356)
                      ++|+||||||||||||++|++|+++|+|.||+|+++|++|||||+||||||+|||||++.++|||||+||++|+|+++.+
T Consensus        75 ~~W~GLGYY~RArnLh~aA~~i~~~~~G~~P~~~~~L~~LPGIG~yTA~AI~siaf~~~~~ivDgNV~RVlsR~~~i~~~  154 (350)
T PRK10880         75 HLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGW  154 (350)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCCCC
T ss_conf             98640693899999999999999975898982599986266887279999999976995357664403288887503368


Q ss_pred             C--CHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCEEEE
Q ss_conf             7--35689998741025785226789999999986320412356431100000134310110012344322222211232
Q gi|254780479|r  164 A--PLYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGA  241 (356)
Q Consensus       164 ~--~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~~~~~~~P~kk~KKkk~~r~~~  241 (356)
                      .  +..++.+|.+++.++|.++++|||||||||||+||||++|+|..|||+..|.||++|.++.||+|++|++++.+...
T Consensus       155 ~~~~~~~~~l~~~a~~~~p~~~~~~~nQAlMDLGA~vCtp~~P~C~~CPl~~~C~A~~~~~~~~~P~kk~kk~~p~~~~~  234 (350)
T PRK10880        155 PGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMVCTRSKPKCELCPLQNGCIAAANHSWALYPGKKPKQTLPERTGY  234 (350)
T ss_pred             CCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCHHHCCCCCCCCCCCEEEEE
T ss_conf             87559999999999973884327799999999734102799998788988220646427985655886788988779999


Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCC-
Q ss_conf             0003413760577645720132047442111134672158886301010000213735899831599999999980776-
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQI-  320 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~-  320 (356)
                       +++++++|++||+||+++|+|+||||||+++..+....+.... .........++.|+|+||||+|+|.++.+.+... 
T Consensus       235 -~~ii~~~~~ill~kR~~~giw~gLw~fP~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~H~FSHf~L~I~p~~~~~~~~~  312 (350)
T PRK10880        235 -FLLLQHEDEVWLAQRPPSGLWGGLYCFPQFADEEELRQWLAQR-GINADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFT  312 (350)
T ss_pred             -EEEEEECCEEEEEECCCCCHHHHCCCCCCCCCHHHHHHHHHHC-CCCCCCCEECCCEEEEEECEEEEEEEEEEEECCCC
T ss_conf             -9999509989998379777345445488888677899999982-89810001147625841060767389999814500


Q ss_pred             --CCCCCCEEECHHHHHHCCCCHHHHHHHHHCCCCCC
Q ss_conf             --76678133227888747998789999984482379
Q gi|254780479|r  321 --VIIPDSTWHDAQNLANAALPTVMKKALSAGGIKVP  355 (356)
Q Consensus       321 --~~~~~~~Wv~~~el~~~~LPs~~kKIL~alg~~~p  355 (356)
                        ...+++.|++++++.+++||+|++|||+.|....|
T Consensus       313 ~~~~~~~~~W~~~~~~~~~GLPaPvkKiL~~L~~~~~  349 (350)
T PRK10880        313 GCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP  349 (350)
T ss_pred             CCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHCCCCC
T ss_conf             1355667856444530204787899999997140489


No 2  
>TIGR01084 mutY A/G-specific adenine glycosylase; InterPro: IPR005760    The DNA repair enzyme MutY plays an important role in the prevention of DNA mutations resulting from the presence of the oxidatively damaged lesion 7,8-dihydro-8-oxo-2'-deoxyguanosine (8-OxoG). 8-OxoG can mispair with 2'-deoxycytidine 5'-triphosphate or with 2'-deoxyadenosine triphosphate during DNA replication, forming C*8-oxoG and A*8-oxoG mispairs. Human MutY is responsible for recognition and removal of the inappropriately inserted adenine in an A=8-oxoG mispair. If unrepaired, the A=8-oxoG mispairs can result in deleterious C:G to A:T transversions.    Multiple forms of mammalian MutY are formed by alternative splicing and locate to the nucleus or mitochondrion, where they have been shown to interact with several other proteins involved in the repair of DNA damage . The HhH-GPD domain within the protein binds the phosphate backbone of the substrate.   This family represents bacterial MutY and a limited number of murine homologs. In rat, the Escherichia coli MutY DNA glycosylase homolog (MYH) is induced in response to mitochondrial DNA damage . ; GO: 0019104 DNA N-glycosylase activity, 0006284 base-excision repair, 0005622 intracellular.
Probab=100.00  E-value=0  Score=805.38  Aligned_cols=275  Identities=39%  Similarity=0.756  Sum_probs=259.0

Q ss_pred             HHHHHHHHHHHHCCC-CCCCCCCCC----CCCCCCCCHHHHHHHHHHHHHCCHHHCCH-HHHHHHHHCCCHHHHHCCCHH
Q ss_conf             999999999996289-988754855----44456787046766446731161203326-589987748997898729989
Q gi|254780479|r    7 IIQSKILDWYDTNHR-VLPWRTSPK----TEKSSLPSPYKVWISEIMLQQTTVKTVEP-YFKKFMQKWPTIFCLSSAKDE   80 (356)
Q Consensus         7 ~~~~~ll~w~~~~~R-~lpwr~~~~----~~~~~~~~py~v~vseimlqqT~v~~v~~-~~~~~~~~~P~~~~la~a~~~   80 (356)
                      .|++.||.||+++|| +||||.+..    .+..+++|||+|||||||||||||+|||| ||+|||++||||++||+|++|
T Consensus         1 ~f~~~ll~Wy~~~gRk~LPWr~~~~~~cdeslkhi~~pY~VW~SEvMLQQTqV~tV~prYf~rFle~FPTv~~LA~A~~d   80 (297)
T TIGR01084         1 QFREDLLSWYDKEGRKTLPWRQNKNQRCDESLKHIDDPYRVWVSEVMLQQTQVATVIPRYFERFLERFPTVQALANAPQD   80 (297)
T ss_pred             CHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEECCCCCEEEECCCCCCHHHHHHCCCHHHHHCCCHH
T ss_conf             93788999998717867887478755424577522354514400211001100112671004766427885787477965


Q ss_pred             HHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCC
Q ss_conf             99999740463458899987767543005786873145688888777877667766434963131243146665765197
Q gi|254780479|r   81 EILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDI  160 (356)
Q Consensus        81 ~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~  160 (356)
                      |||++|+||||||||||||+||++|+++|||+||.|+++|.+|||||+||||||+|||||+|.|+|||||+|||||+|++
T Consensus        81 eVL~lW~GLGYYaRARNL~kAA~~v~~~fGG~fP~d~~~~~~L~GVG~yTAgAils~a~~~~~p~~DGNV~RVLsR~fA~  160 (297)
T TIGR01084        81 EVLKLWEGLGYYARARNLHKAAQEVVEEFGGEFPQDLEDLKALPGVGRYTAGAILSFAYNKPVPILDGNVKRVLSRLFAV  160 (297)
T ss_pred             HHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH
T ss_conf             79998625786788899999999999871881772379785178976217999999872687620154078899999862


Q ss_pred             CCCC--CHHHHHHHHHHHCCCC-CCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCC
Q ss_conf             5457--3568999874102578-522678999999998632041235643110000013431011001234432222221
Q gi|254780479|r  161 IKPA--PLYHKTIKNYARKITS-TSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPM  237 (356)
Q Consensus       161 ~~~~--~~~~k~l~~~~~~~~~-~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~~~~~~~P~kk~KKkk~~  237 (356)
                      ++++  +..++.||.+++.++| ..++.+||||||||||+||++++|.|+.|||+++|.||+.|..+.||+|++|+..+.
T Consensus       161 ~~~~~~k~~e~~l~~~~~~llpe~~~~~~~nqalmDlGA~iC~rk~P~C~~CPl~~~C~A~~~~~~~~yP~kk~~~~~~~  240 (297)
T TIGR01084       161 EGWPGKKKVENRLWELAESLLPEKADPEAFNQALMDLGALICTRKKPKCDLCPLQDFCLAYKQGTPEEYPVKKKKKAPPE  240 (297)
T ss_pred             CCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHCCCCCCCCCHHH
T ss_conf             48988734889999999985886568658888998623610378478545487066555654277111678773100012


Q ss_pred             EEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHH
Q ss_conf             12320003413760577645720132047442111134672158
Q gi|254780479|r  238 RTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGN  281 (356)
Q Consensus       238 r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~  281 (356)
                      +...+.++.+.+++++|.||+++|||+|||+||.++..++....
T Consensus       241 ~~~~~l~l~~~d~~~~~~~r~~~~l~gGLy~fP~~~~~kG~~~~  284 (297)
T TIGR01084       241 RTTYFLVLLNEDGEVLLEQRPEKGLWGGLYCFPQFELEKGIKEE  284 (297)
T ss_pred             HHEEEEEEEECCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHH
T ss_conf             01012234414870576663766410010036525532013578


No 3  
>COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair]
Probab=100.00  E-value=0  Score=764.26  Aligned_cols=330  Identities=42%  Similarity=0.729  Sum_probs=299.8

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHH
Q ss_conf             68999999999996289988754855444567870467664467311612033265899877489978987299899999
Q gi|254780479|r    5 EHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILS   84 (356)
Q Consensus         5 ~~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~   84 (356)
                      ...|++.||.||++|+|+||||++        .+||+|||||||||||||++|+|||.+||++|||+++||+|++|||++
T Consensus         8 ~~~~~~~ll~Wy~~~~R~LPWR~~--------~~PY~VwvSEiMLQQT~v~~Vi~yy~~fl~rfPti~~LA~A~~~evl~   79 (342)
T COG1194           8 IEKFQEALLDWYDKNGRDLPWRET--------KDPYRVWVSEIMLQQTQVATVIPYYERFLERFPTIKALAAAPEDEVLK   79 (342)
T ss_pred             HHHHHHHHHHHHHHHCCCCCCCCC--------CCCCEEHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHH
T ss_conf             578888999999974874887789--------986220268887600607455456999998689989986688889999


Q ss_pred             HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCC
Q ss_conf             97404634588999877675430057868731456888887778776677664349631312431466657651975457
Q gi|254780479|r   85 AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPA  164 (356)
Q Consensus        85 ~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~  164 (356)
                      +|+||||||||||||++|++|+++|+|.||+|.++|.+|||||+|||+||+|||||++.++|||||+||++|+|+++.+.
T Consensus        80 ~W~gLGYysRArnL~~~A~~v~~~~~G~~P~~~~~l~~LpGiG~yTa~Ail~~a~~~~~~~lDgNV~RVl~R~f~i~~~~  159 (342)
T COG1194          80 AWEGLGYYSRARNLHKAAQEVVERHGGEFPDDEEELAALPGVGPYTAGAILSFAFNQPEPVLDGNVKRVLSRLFAISGDI  159 (342)
T ss_pred             HHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHCCCCCEEECCHHEEEHHHHCCCCCC
T ss_conf             98716737899999999999999819979999999986789738899999998718987533054221126653143655


Q ss_pred             C--HHHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCEEEE
Q ss_conf             3--568999874102578522-6789999999986320412356431100000134310110012344322222211232
Q gi|254780479|r  165 P--LYHKTIKNYARKITSTSR-PGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGA  241 (356)
Q Consensus       165 ~--~~~k~l~~~~~~~~~~~~-~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~~~~~~~P~kk~KKkk~~r~~~  241 (356)
                      .  ..++.+|.+++.++.+.+ +++||||||||||+||||++|+|..||++..|.+++.|.++.||+|++|++++ +...
T Consensus       160 ~~~~~~~~~~~~~~~ll~p~~~~~~fnqammdlGA~ICt~~~P~C~~CPl~~~c~a~~~g~~~~~P~k~~k~~~~-~~~~  238 (342)
T COG1194         160 GKPKTKKELWELAEQLLTPDRRPGDFNQAMMDLGATICTAKKPKCSLCPLRDNCAAYRNGTPEKYPVKKPKKKLP-RRFA  238 (342)
T ss_pred             CCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCC-HHEE
T ss_conf             654105899999997448988767999999986367615899987739466889999819964478766554551-0033


Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCCC
Q ss_conf             00034137605776457201320474421111346721588863010100002137358998315999999999807767
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQIV  321 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~~  321 (356)
                      ++++++.+|.+++.||+++|+|+|||+||.++++..............    ..+++++|+||||+|+++ +...... .
T Consensus       239 ~~~~~~~~~~~~l~kr~~~gl~~gl~~fP~~e~~~~~~~~~~~~~~~~----~~~~~~~H~fth~~l~i~-~~a~~~~-~  312 (342)
T COG1194         239 AFLILNRDGEVLLEKRPEKGLLGGLWCFPQFEDEADLLDWLAADGLAA----EPLGAFRHTFTHFRLTIE-LRASASL-V  312 (342)
T ss_pred             EEEEECCCCCHHHHHCCCCCCEECCCCCCCCCCCCHHHHHHHHCCCCC----CCCCCEEEEEEEEEEEEE-EEEECCC-C
T ss_conf             577870685316662765672211120564343320466765413210----124312434668999999-9863246-7


Q ss_pred             CCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             66781332278887479987899999844
Q gi|254780479|r  322 IIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       322 ~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      .. +..|++.++++.+++|++++|+++.+
T Consensus       313 ~~-~~~w~~~~~~~~~~l~~p~~k~l~~~  340 (342)
T COG1194         313 LS-DGRWYNLSDLESIGLPAPVKKLLQQL  340 (342)
T ss_pred             CC-CCEECCCCCCCCCCCCHHHHHHHHHH
T ss_conf             88-73451634443135527999999873


No 4  
>PRK13910 DNA glycosylase MutY; Provisional
Probab=100.00  E-value=0  Score=684.27  Aligned_cols=285  Identities=30%  Similarity=0.503  Sum_probs=248.6

Q ss_pred             HHHHHCCHHHCCH-HHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH
Q ss_conf             6731161203326-589987748997898729989999997404634588999877675430057868731456888887
Q gi|254780479|r   47 IMLQQTTVKTVEP-YFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPG  125 (356)
Q Consensus        47 imlqqT~v~~v~~-~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpG  125 (356)
                      ||||||||+||+| ||++||++|||+++||+|++||||++|+||||||||||||++|++|+++|+|+||+|+++|++|||
T Consensus         1 iMLQQTqv~tvip~y~~~f~~~fP~~~~la~a~~~~vl~~W~GLGYY~RArnl~~~a~~i~~~~~g~~P~~~~~L~~LPG   80 (290)
T PRK13910          1 VMSQQTQINTVVERFYSPFLEAFPTLKDLANAQLEEVLLLWRGLGYYSRAKNLKKSAEICVKEHHSQLPNDYQSLLKLPG   80 (290)
T ss_pred             CCCCCCCCCCCCHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCC
T ss_conf             96885742100578999999988399999778999999998746848999999999999999838989852999975889


Q ss_pred             HHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC
Q ss_conf             77877667766434963131243146665765197545735689998741025785226789999999986320412356
Q gi|254780479|r  126 IGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPL  205 (356)
Q Consensus       126 iG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~  205 (356)
                      ||+|||+||+|||||++.++|||||+||++|+|+++.+..  .+.+|..++.++|.++++|||||||||||+||+|+ |+
T Consensus        81 IG~yTA~AI~siaf~~~~~~vDgNv~RVl~R~~~~~~~~~--~k~~~~~~~~~~~~~~~~~~nqalMdlGa~iC~pk-P~  157 (290)
T PRK13910         81 IGAYTANAILCFGFREKSACVDANIKRVLLRLFGLDPNIH--AKDLQIKANDFLNLNESFNHNQALIDLGALICSPK-PK  157 (290)
T ss_pred             CCCHHHHHHHHHHCCCCCCCCCCCHHEEEEHHHCCCCCCC--HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC-CC
T ss_conf             9826999999986476332000452014102324789946--69999999982595422689999998633343799-99


Q ss_pred             CCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHH
Q ss_conf             43110000013431011001234432222221123200034137605776457201320474421111346721588863
Q gi|254780479|r  206 CPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTH  285 (356)
Q Consensus       206 C~~Cpl~~~C~~~~~~~~~~~P~kk~KKkk~~r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~  285 (356)
                      |..||+++.|.|+  +.+..+|.+++++..+.. ..+++ +..+++++|+|+ ++++|+||||||..+.+.+        
T Consensus       158 C~~CPl~~~C~a~--~~~~~~~~kkkkk~~~~~-~~~~~-i~~~~~ilL~k~-~~~~~~Glw~fP~~ke~~~--------  224 (290)
T PRK13910        158 CAICPFNPYCLGK--NHPEKHTLKKKQEIIQEE-RYLGV-VIQNNQIALEKI-EQKLYLGMHHFPNLKENLE--------  224 (290)
T ss_pred             CCCCCCHHHHHCC--CCCCCCCCCCCCCCCCCE-EEEEE-EEECCEEEEEEC-CCCCCCCCCCCCCCCCCCC--------
T ss_conf             8779795552315--876446888887677620-69999-998999999817-8866677665999875556--------


Q ss_pred             CCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCCCCCCCCEEECHHHHHHCCCCHHHHHHHHHCCCCC
Q ss_conf             010100002137358998315999999999807767667813322788874799878999998448237
Q gi|254780479|r  286 SAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQIVIIPDSTWHDAQNLANAALPTVMKKALSAGGIKV  354 (356)
Q Consensus       286 ~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~alg~~~  354 (356)
                           .....++.++|+||||++++++|.+.+++.  ....+|++++|++++|||++++|||+.+..++
T Consensus       225 -----~~~~~l~~ikH~fTH~kl~i~v~~~~~k~~--~~~~~W~~l~el~~l~lps~~kKIL~~Lk~~~  286 (290)
T PRK13910        225 -----FKLPFLGAIKHSHTKFKLNLNLYLAAIKDL--KNPIRFYSLKDLETLPISSMTLKILNFLKQKN  286 (290)
T ss_pred             -----CCHHHHHHCEEEEEEEEEEEEEEEEHHHCC--CCCCCEEEHHHHHHCCCCCHHHHHHHHHHHHC
T ss_conf             -----540233000001565779999998541058--98774316777520889828999999997745


No 5  
>KOG2457 consensus
Probab=100.00  E-value=0  Score=633.95  Aligned_cols=338  Identities=35%  Similarity=0.587  Sum_probs=275.5

Q ss_pred             HHHHHHHHHHHHHCCCCCCCCCCCCCCC-CCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCH-HHHH
Q ss_conf             8999999999996289988754855444-5678704676644673116120332658998774899789872998-9999
Q gi|254780479|r    6 HIIQSKILDWYDTNHRVLPWRTSPKTEK-SSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKD-EEIL   83 (356)
Q Consensus         6 ~~~~~~ll~w~~~~~R~lpwr~~~~~~~-~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~-~~vl   83 (356)
                      +.|+..||+|||.|+||||||+.+...+ +-.|..|.|||||||||||||.||+.||.+||++|||+.+||.|+. +||+
T Consensus        91 ~~fR~sLl~wYD~~KRdLPWR~r~sEde~DwerRaYeVwVSEiMLQQTrV~TV~~YYt~WMqkwPTl~dla~Asl~~eVn  170 (555)
T KOG2457          91 QKFRMSLLDWYDVNKRDLPWRNRRSEDEKDWERRAYEVWVSEIMLQQTRVQTVMKYYTRWMQKWPTLYDLAQASLEKEVN  170 (555)
T ss_pred             HHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH
T ss_conf             99999898876300223864468752135688889999999999989999999999999998375088888878988999


Q ss_pred             HHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH-HHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCC
Q ss_conf             997404634588999877675430057868731456888-8877787766776643496313124314666576519754
Q gi|254780479|r   84 SAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKK-LPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK  162 (356)
Q Consensus        84 ~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~-LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~  162 (356)
                      .+|+|||||+|||+|+++|+.+++.++|.||.+.++|++ +||||+||||||+|||||++..+|||||.||+||..+|..
T Consensus       171 ~lWaGlGyY~R~rrL~ega~~vv~~~~ge~Prta~~l~kgvpGVG~YTAGAiaSIAf~q~tGiVDGNVirvlsRalAIhs  250 (555)
T KOG2457         171 ELWAGLGYYRRARRLLEGAKMVVAGTEGEFPRTASSLMKGVPGVGQYTAGAIASIAFNQVTGIVDGNVIRVLSRALAIHS  250 (555)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC
T ss_conf             99841018988899999999999757887887389998518887742310455423047643204615777677676137


Q ss_pred             CCC--HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCC----------------
Q ss_conf             573--56899987410257852267899999999863204123564311000001343101100----------------
Q gi|254780479|r  163 PAP--LYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSH----------------  224 (356)
Q Consensus       163 ~~~--~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~~~~~----------------  224 (356)
                      +.+  +....+|.+|+.++++.+|||||||+|||||++|||.+|.|+.||+++.|.||+...++                
T Consensus       251 DcSkgk~~q~~wkLA~qLVDP~RPGDFNQalMELGAt~CTpq~P~CS~CPvss~CrA~q~~Eqn~~i~~g~t~~~dve~~  330 (555)
T KOG2457         251 DCSKGKFFQSSWKLAAQLVDPSRPGDFNQALMELGATLCTPQKPSCSSCPVSSQCRAFQLSEQNRTISVGDTYPTDVEKA  330 (555)
T ss_pred             CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCEECCCCCCCCCCCCCHHHHHHHHHHHHCCEEECCCCCCCCHHHC
T ss_conf             74300688999999998259889883779999836712157998767787288998876764234543488688527552


Q ss_pred             --------------------------CCCCCCCCCCCCCEEEEEEEEEEC-------CCEEEEEECCCCHHHHCCCCCCC
Q ss_conf             --------------------------123443222222112320003413-------76057764572013204744211
Q gi|254780479|r  225 --------------------------LLGINTIKKKRPMRTGAVFIAITN-------DNRILLRKRTNTRLLEGMDELPG  271 (356)
Q Consensus       225 --------------------------~~P~kk~KKkk~~r~~~~~vii~~-------~~kiLL~KRp~~gll~GLwEFP~  271 (356)
                                                .||++..+ ..+... ...++++.       .+.||+.+||..|+++|||+||+
T Consensus       331 ~P~~~~C~vCv~~ips~e~~Qs~gv~~~p~~p~~-~~~reE-~~~v~~~e~~dp~t~~~~ilv~~rp~~gllagLw~fpt  408 (555)
T KOG2457         331 KPRHDFCCVCVLEIPSLERNQSGGVFVLPKRPEQ-LAGREE-FPSVILNEEADPATRRNAILVYLRPAFGLLAGLWKFPT  408 (555)
T ss_pred             CCCCCCEEEEECCCCCCHHHCCCCEEEECCCHHH-CCCCCC-CCEEEEECCCCCHHHCCEEEEEECCCHHHHHHHHHCCC
T ss_conf             8887823686267897133146747971268554-476211-56035542578243101057785442147877643572


Q ss_pred             CCCCCC-CCHHHHHHCC------------CCCCCCEECCCEEEEEEEEEEEEEEEEEE--CCCC--CCCCCCEEECHHHH
Q ss_conf             113467-2158886301------------01000021373589983159999999998--0776--76678133227888
Q gi|254780479|r  272 SAWSST-KDGNIDTHSA------------PFTANWILCNTITHTFTHFTLTLFVWKTI--VPQI--VIIPDSTWHDAQNL  334 (356)
Q Consensus       272 ~e~~~~-~~~~~~~~~~------------~~~~~~~~l~~ikH~fTH~~L~i~v~~~~--~~~~--~~~~~~~Wv~~~el  334 (356)
                      +.+.+. +........-            .......+.|.++|+|||+.+.-+||..-  .+..  ..+.+.+|++..++
T Consensus       409 i~~~e~se~~~~~a~~q~~v~~w~~~~~~t~~~~~~~~G~~~htfshi~~ts~V~~~a~~~~~~vt~~p~~~~wi~q~~l  488 (555)
T KOG2457         409 IVSRELSEFVHIFAHIQRKVYVWLLVVQLTGGTEDLFKGQAKHTFSHICVTSDVLSTAGLTSAVVTVPPFRLQWIKQLSL  488 (555)
T ss_pred             EECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH
T ss_conf             11057522778999999999999623400123133216502200335556888988751578750269733202345446


Q ss_pred             HHCC-CCHHHHH
Q ss_conf             7479-9878999
Q gi|254780479|r  335 ANAA-LPTVMKK  345 (356)
Q Consensus       335 ~~~~-LPs~~kK  345 (356)
                      +..+ ....+++
T Consensus       489 ~h~~~~e~~lk~  500 (555)
T KOG2457         489 DHMVEKEQILKC  500 (555)
T ss_pred             HHHHHHHHHHHH
T ss_conf             899999999888


No 6  
>PRK10702 endonuclease III; Provisional
Probab=100.00  E-value=0  Score=436.32  Aligned_cols=202  Identities=23%  Similarity=0.310  Sum_probs=181.6

Q ss_pred             HHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCH
Q ss_conf             99999962899887548554445678704676644673116120332658998774899789872998999999740463
Q gi|254780479|r   12 ILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGY   91 (356)
Q Consensus        12 ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGy   91 (356)
                      ..++++.=.+.+|+..+    +.++.|||+|||||||+|||++++|.++|++|+++|||+++||+|+++||.++|+|+||
T Consensus         6 ~~~i~~~l~~~~p~~~~----~l~~~~P~~vLVs~ILsqqTtd~~v~~~~~~L~~~~~t~e~la~a~~~el~~~i~~~G~   81 (211)
T PRK10702          6 RLEILTRLRDNNPHPTT----ELNFSSPFELLIAVLLSAQATDVSVNKATAKLYPVANTPAAMLELGVEGVKTYIKTIGL   81 (211)
T ss_pred             HHHHHHHHHHHCCCCCC----CCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHHHH
T ss_conf             99999999987869999----85889858999999997418589999999999997799999870999999999998635


Q ss_pred             H-HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHH
Q ss_conf             4-588999877675430057868731456888887778776677664349631312431466657651975457356899
Q gi|254780479|r   92 Y-TRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKT  170 (356)
Q Consensus        92 y-~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~  170 (356)
                      | +||+|||++|++|+++|+|.+|++.++|++|||||+|||+||+|+|||+|+++|||||.||++|++.....   ..+.
T Consensus        82 y~~KA~~L~~~a~~i~~~~~G~vP~~~~~L~~LpGIG~kTA~aIl~~a~~~~~~~VDtnV~RV~~Rlg~~~~~---~~~~  158 (211)
T PRK10702         82 YNSKAENVIKTCRILLEKHNGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQFAPGK---NVEQ  158 (211)
T ss_pred             HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCC---CHHH
T ss_conf             9999999999999999990998766699999876635889999999984998652573599999997657789---9999


Q ss_pred             HHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHC
Q ss_conf             98741025785226789999999986320412356431100000134310
Q gi|254780479|r  171 IKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSE  220 (356)
Q Consensus       171 l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~  220 (356)
                      ++..+..++|.+.+.++||+|||||++||+|++|+|..|||++.|.++++
T Consensus       159 ~~~~l~~~~p~~~~~~~~~~li~~G~~iC~~~~P~C~~Cpl~~~C~~~~K  208 (211)
T PRK10702        159 VEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIEDLCEYKEK  208 (211)
T ss_pred             HHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC
T ss_conf             99999982890237999999999950150699993999989144999666


No 7  
>COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]
Probab=100.00  E-value=1.6e-39  Score=298.89  Aligned_cols=181  Identities=27%  Similarity=0.466  Sum_probs=166.8

Q ss_pred             CCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHH-HHHHHHHHHHHHHHCCCCCCCC
Q ss_conf             870467664467311612033265899877489978987299899999974046345-8899987767543005786873
Q gi|254780479|r   37 PSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYT-RARNLKKCADIIVKKYEGNFPH  115 (356)
Q Consensus        37 ~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~-Rar~l~~~a~~i~~~~~g~~P~  115 (356)
                      .|||.++|+-|+.|||+=+.|-+-+.+.+++|||+++|++|++++|-.+-...|+|+ +|+|++++|++|+++|+|.+|.
T Consensus        27 ~~pf~lLva~iLSaqttD~~vn~at~~Lf~~~~t~e~l~~a~~~~l~~~I~~iGlyr~KAk~I~~~~~~l~e~~~g~vP~  106 (211)
T COG0177          27 KDPFELLVAVILSAQTTDEVVNKATPALFKRYPTPEDLLNADEEELEELIKSIGLYRNKAKNIKELARILLEKFGGEVPD  106 (211)
T ss_pred             CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCC
T ss_conf             88389999999944674488999999999975999999749999999999863871899999999999999974999981


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH
Q ss_conf             14568888877787766776643496313124314666576519754573568999874102578522678999999998
Q gi|254780479|r  116 KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLG  195 (356)
Q Consensus       116 ~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlG  195 (356)
                      +.++|++|||||++||+.|+++|||.|+++||+||.||..|+......   +-.+++.-...++|.+.+.++|.+|+.||
T Consensus       107 ~~~eL~~LPGVGrKTAnvVL~~a~g~p~i~VDTHV~Rvs~R~gl~~~~---~p~~ve~~L~~~iP~~~~~~~h~~lI~~G  183 (211)
T COG0177         107 TREELLSLPGVGRKTANVVLSFAFGIPAIAVDTHVHRVSNRLGLVPGK---TPEEVEEALMKLIPKELWTDLHHWLILHG  183 (211)
T ss_pred             HHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCCCCC---CHHHHHHHHHHHCCHHHHHHHHHHHHHHH
T ss_conf             599997489966577898998655998652124299999984778899---99999999999789788999999999960


Q ss_pred             HHHCCCCCCCCCCCCCCCCHHHHHC
Q ss_conf             6320412356431100000134310
Q gi|254780479|r  196 ALICTSNKPLCPLCPIQKNCLTFSE  220 (356)
Q Consensus       196 a~iC~p~~P~C~~Cpl~~~C~~~~~  220 (356)
                      ..||+|++|+|+.||+++.|.++..
T Consensus       184 R~iC~ar~P~C~~C~l~~~C~~~~~  208 (211)
T COG0177         184 RYICKARKPRCEECPLADLCPSAGK  208 (211)
T ss_pred             HHHCCCCCCCCCCCCCHHHCCHHCC
T ss_conf             5311689998675646554811100


No 8  
>TIGR01083 nth endonuclease III; InterPro: IPR005759    The spectrum of DNA damage caused by reactive oxygen species includes a wide variety of modifications of purine and pyrimidine bases. Among these modified bases, 7,8-dihydro-8-oxoguanine (8-oxoG) is an important mutagenic lesion. Base excision repair is a critical mechanism for preventing mutations by removing the oxidative lesion from the DNA. Escherichia coli Nth protein (endonuclease III) has an 8-oxoG DNA glycosylase/AP lyase activity which removes 8-oxoG preferentially from 8-oxoG/G mispairs. Human hNTH1 protein, a homolog of E. coli Nth protein, has similar DNA glycosylase/AP lyase activity that removes 8-oxoG from 8-oxoG/G mispairs . ; GO: 0003906 DNA-(apurinic or apyrimidinic site) lyase activity, 0006284 base-excision repair, 0005622 intracellular.
Probab=100.00  E-value=7.9e-38  Score=286.98  Aligned_cols=187  Identities=26%  Similarity=0.424  Sum_probs=163.5

Q ss_pred             CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC---HHHCCHHHHHHHHHCCCHHHHHCCCH
Q ss_conf             716899999999999628998875485544456787046766446731161---20332658998774899789872998
Q gi|254780479|r    3 QPEHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTT---VKTVEPYFKKFMQKWPTIFCLSSAKD   79 (356)
Q Consensus         3 ~~~~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~---v~~v~~~~~~~~~~~P~~~~la~a~~   79 (356)
                      ++++.+-++|.+-|           +....+..+.|||.++||-|+.+|++   |+.|.|   +++++|||+++||.|++
T Consensus         2 ~~~~~il~~L~~~y-----------P~p~tEL~~~~PFeLLVAtiLSAQ~TD~~VNkaT~---~LF~~Y~tp~~~a~a~~   67 (192)
T TIGR01083         2 QKAQEILERLRKLY-----------PHPTTELDYKNPFELLVATILSAQATDKSVNKATK---KLFEVYPTPQALAAAGL   67 (192)
T ss_pred             CHHHHHHHHHHHHC-----------CCCEEEEEECCCHHHHHHHHHHHHHCCHHHHHCCH---HHHHCCCCHHHHHCCCH
T ss_conf             40789999999738-----------99725433307078999999986531326763167---86512778689960893


Q ss_pred             HHHHHHHHCCCHHH-HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHH
Q ss_conf             99999974046345-88999877675430057868731456888887778776677664349631312431466657651
Q gi|254780479|r   80 EEILSAWAGLGYYT-RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYF  158 (356)
Q Consensus        80 ~~vl~~w~gLGyy~-Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~  158 (356)
                      |||..+-.--|||+ .|+||.++|++|+++|+|++|+|.++|++||||||.||+-|++.|||.|+++||+||.||..|+ 
T Consensus        68 eel~~~Ik~iGlYr~KAk~I~~~~~~LvE~y~GeVP~~~~eL~~LPGVGRKTANVVL~~aFg~P~iAVDTHv~Rv~~Rl-  146 (192)
T TIGR01083        68 EELEEYIKSIGLYRNKAKNIIALCRKLVERYGGEVPEDREELVKLPGVGRKTANVVLNVAFGIPAIAVDTHVFRVSNRL-  146 (192)
T ss_pred             HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHCCCEEEECCCHHHHHHHH-
T ss_conf             1347764225864568999999999999981898775537661789987114562433442687057414346554331-


Q ss_pred             CCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC
Q ss_conf             975457356899987410257852267899999999863204123564
Q gi|254780479|r  159 DIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLC  206 (356)
Q Consensus       159 ~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C  206 (356)
                      |+....+.  ..+++-..+++|.+.+.+++-+|+-||..||++|+|.|
T Consensus       147 gl~~~~dp--~~vE~~L~~l~P~~~w~~~hh~lIlHGRy~CkAr~P~C  192 (192)
T TIGR01083       147 GLSKGKDP--DKVEEELLKLIPKEFWTKLHHWLILHGRYTCKARKPRC  192 (192)
T ss_pred             CCCCCCCH--HHHHHHHHHHCCCCCHHHCCHHHHHHCCCCCCCCCCCC
T ss_conf             35777898--99999998744850012223675431111116888889


No 9  
>smart00478 ENDO3c endonuclease III. includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Probab=100.00  E-value=2.8e-37  Score=283.04  Aligned_cols=148  Identities=29%  Similarity=0.428  Sum_probs=139.9

Q ss_pred             HHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH
Q ss_conf             731161203326589987748997898729989999997404634-5889998776754300578687314568888877
Q gi|254780479|r   48 MLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYY-TRARNLKKCADIIVKKYEGNFPHKVEILKKLPGI  126 (356)
Q Consensus        48 mlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy-~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGi  126 (356)
                      |.|||++++|.++|++|+++|||+++||+|+++||..+|+|+||| +||++|+++|++|+++|+|.+|++.++|++||||
T Consensus         1 LSqqt~~~~v~~~~~~l~~~~pt~~~l~~a~~~~l~~~i~~~g~~~~ka~~i~~~a~~i~~~~~~~~p~~~~~L~~lpGV   80 (149)
T smart00478        1 LSQQTSDEAVNKATERLFEKFPTPEDLAAADEEELEELIRPLGFYRRKAKYLIELARILVEEYGGEVPDDREELLKLPGV   80 (149)
T ss_pred             CCCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCC
T ss_conf             99865289999999999998859999986899999999998688999999999999999986655588559998758986


Q ss_pred             HHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHH
Q ss_conf             78776677664349631312431466657651975457356899987410257852267899999999863
Q gi|254780479|r  127 GDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLGAL  197 (356)
Q Consensus       127 G~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~  197 (356)
                      |+|||++|++++||++.++||+||.||++|++.++...  ..+.++..++.++|.+.+++||++|||+|++
T Consensus        81 G~~tA~~vl~~~~~~~~~~vD~~v~Rv~~R~~~~~~~~--~~~~~~~~l~~~~p~~~~~~~~~~l~~~G~~  149 (149)
T smart00478       81 GRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDKKS--TPEEVEKLLEKLLPKEDWRELNLLLIDFGRT  149 (149)
T ss_pred             CHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCC--CHHHHHHHHHHHCCCCCHHHHHHHHHHCCCC
T ss_conf             59999999999879983513413999999984788889--8999999999878934399999999981899


No 10 
>cd00056 ENDO3c endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Probab=100.00  E-value=6.7e-34  Score=259.28  Aligned_cols=154  Identities=33%  Similarity=0.482  Sum_probs=146.3

Q ss_pred             HHHHHHHHHHHHCCHHHCCHHHHHHHHHC-CCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCC---CC
Q ss_conf             46766446731161203326589987748-997898729989999997404634588999877675430057868---73
Q gi|254780479|r   40 YKVWISEIMLQQTTVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNF---PH  115 (356)
Q Consensus        40 y~v~vseimlqqT~v~~v~~~~~~~~~~~-P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~---P~  115 (356)
                      |.+||++||+|||++++|...|.+|.++| ||+++||+++++++..+|.|+||++||++|+++|+.|.+.++|..   |+
T Consensus         1 f~~Li~~Il~qq~s~~~a~~~~~~l~~~~~pt~~~l~~~~~~~l~~~~~~~gy~~Ka~~i~~~a~~i~~~~~~~~~~~~~   80 (158)
T cd00056           1 FEVLVSEILSQQTTDKAVNKAYERLFERYGPTPEALAAADEEELRELIRSLGYRRKAKYLKELARAIVEGFGGLVLDDPD   80 (158)
T ss_pred             CHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHH
T ss_conf             99999999981452999999999999854998999980999999999733568999999999888889860895789888


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH
Q ss_conf             14568888877787766776643496313124314666576519754573568999874102578522678999999998
Q gi|254780479|r  116 KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLG  195 (356)
Q Consensus       116 ~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlG  195 (356)
                      +.++|++|||||+|||++|++++|+.++++||+||.||++|++++..+  ...++++..++.++|....+++|++||++|
T Consensus        81 ~~~~L~~l~GIG~~TA~~vl~~~~~~~~~~vD~~v~R~~~rl~~~~~~--~~~~~~~~~~~~~~p~~~~~~~~~~L~~~g  158 (158)
T cd00056          81 AREELLALPGVGRKTANVVLLFALGPDAFPVDTHVRRVLKRLGLIPKK--KTPEELEELLEELLPKPYWGEANQALMDLG  158 (158)
T ss_pred             HHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCC--CCHHHHHHHHHHHCCCCHHHHHHHHHHHCC
T ss_conf             999987589828999999999987998362259999999995787789--999999999998589201999999998676


No 11 
>KOG1921 consensus
Probab=99.92  E-value=6e-25  Score=196.25  Aligned_cols=181  Identities=24%  Similarity=0.380  Sum_probs=154.1

Q ss_pred             HHHHHHHHHHHHHCCHHHCCHHHHHHHHHC-CCHHHHHCCCHHHHHHHHHCCCHHHH-HHHHHHHHHHHHHCCCCCCCCH
Q ss_conf             046766446731161203326589987748-99789872998999999740463458-8999877675430057868731
Q gi|254780479|r   39 PYKVWISEIMLQQTTVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYTR-ARNLKKCADIIVKKYEGNFPHK  116 (356)
Q Consensus        39 py~v~vseimlqqT~v~~v~~~~~~~~~~~-P~~~~la~a~~~~vl~~w~gLGyy~R-ar~l~~~a~~i~~~~~g~~P~~  116 (356)
                      -|.|+|+-.+.-||.=.+.-.--.+..+.- =|++++.++++..+-.+-...|||+| |.+|+++|+|+.++|+|.+|.+
T Consensus        78 RfqvLv~lmLSSQTKDevt~~Am~rL~~~~gLT~e~v~~~de~~l~~LI~~VgFy~rKA~ylkkta~IL~d~f~gDIP~~  157 (286)
T KOG1921          78 RFQVLVGLMLSSQTKDEVTAAAMLRLKEYGGLTLEAVLKIDEPTLNELIYPVGFYTRKAKYLKKTAKILQDKFDGDIPDT  157 (286)
T ss_pred             HHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCHH
T ss_conf             59999999970100788899999999985597899986167576876500012157888999999999998707997555


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHCCCEEEE-CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH
Q ss_conf             45688888777877667766434963131-24314666576519754573568999874102578522678999999998
Q gi|254780479|r  117 VEILKKLPGIGDYTASAIVAIAFNHFAVV-VDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLG  195 (356)
Q Consensus       117 ~~~l~~LpGiG~yta~ai~s~a~~~~~~~-vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlG  195 (356)
                      .++|++||||||..|.-+|++|+|.-+.+ ||+||.|+-.|+..++... +.-.+-+.-.+.++|...+...|--|.-||
T Consensus       158 v~dLlsLPGVGPKMa~L~m~~AWn~i~GI~VDtHVHRi~nrlgWv~~kt-kspE~TR~aLq~wLPk~lW~eIN~lLVGFG  236 (286)
T KOG1921         158 VEDLLSLPGVGPKMAHLTMQVAWNKIVGICVDTHVHRICNRLGWVDTKT-KSPEQTRVALQQWLPKSLWVEINHLLVGFG  236 (286)
T ss_pred             HHHHHCCCCCCHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHCCCCCCC-CCHHHHHHHHHHHCCHHHHHHHHCEEECCC
T ss_conf             9988558997659999999998566405786328999888756335666-887899999998676877756201055145


Q ss_pred             HHHCCCCCCCCCCCCC-CCCHHHHHC
Q ss_conf             6320412356431100-000134310
Q gi|254780479|r  196 ALICTSNKPLCPLCPI-QKNCLTFSE  220 (356)
Q Consensus       196 a~iC~p~~P~C~~Cpl-~~~C~~~~~  220 (356)
                      .+||+|+.|+|+.|-+ +..|.|...
T Consensus       237 Q~iC~p~~prC~~C~~~~~~Cpss~~  262 (286)
T KOG1921         237 QTICTPRRPRCGLCLLSRDLCPSSFK  262 (286)
T ss_pred             CEEEECCCCCCCCCCCCCCCCCHHHH
T ss_conf             21453279986332357566842444


No 12 
>pfam00730 HhH-GPD HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family.
Probab=99.89  E-value=8.8e-23  Score=180.98  Aligned_cols=122  Identities=34%  Similarity=0.512  Sum_probs=113.8

Q ss_pred             HHHHHHHHCCHHHCCHHHHHHHHH--CCCHHHHHCCCHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHCCCCCCCCHHHH-
Q ss_conf             644673116120332658998774--8997898729989999997404634-588999877675430057868731456-
Q gi|254780479|r   44 ISEIMLQQTTVKTVEPYFKKFMQK--WPTIFCLSSAKDEEILSAWAGLGYY-TRARNLKKCADIIVKKYEGNFPHKVEI-  119 (356)
Q Consensus        44 vseimlqqT~v~~v~~~~~~~~~~--~P~~~~la~a~~~~vl~~w~gLGyy-~Rar~l~~~a~~i~~~~~g~~P~~~~~-  119 (356)
                      |+.|+.|||+++++...|.+|+++  |||++++++++++++..+|.++||| +||++|+.+|+.+.++|+|..|.+.++ 
T Consensus         1 V~~IlsQq~s~~~a~~i~~rl~~~~~~pt~~~l~~~~~~~l~~~i~~~G~~~~Ka~~I~~~a~~~~~~~~~~~~~~~~~~   80 (144)
T pfam00730         1 VSAILSQQTSDKAANKITKRLFERYGFPTPEDLAEADEEELRELIKGLGFYRRKAKYIKELARILVEGYLGLVPLDLEEL   80 (144)
T ss_pred             CEEEECCCCHHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHH
T ss_conf             95542012349999999999999828989999985999999999870897699999999999888986289788611569


Q ss_pred             --HHHHHHHHHHHHHHHHHHHCCCE--EEECCCCHHHHHHHHHCCCCCCC
Q ss_conf             --88888777877667766434963--13124314666576519754573
Q gi|254780479|r  120 --LKKLPGIGDYTASAIVAIAFNHF--AVVVDTNIERIISRYFDIIKPAP  165 (356)
Q Consensus       120 --l~~LpGiG~yta~ai~s~a~~~~--~~~vD~Nv~RVl~R~~~~~~~~~  165 (356)
                        |++|||||++||..++.++|+.+  .+++|+||.|++.|++.++...+
T Consensus        81 ~~L~~l~GIG~~ta~~~l~~~~~~~d~~~~~D~~v~r~~~rl~~~~~~~~  130 (144)
T pfam00730        81 EALLALPGVGRWTAEAVLLFALGRPDVFPAVDTHVRRVAKRLGLIKEKPT  130 (144)
T ss_pred             HHHHCCCCCCHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHHCCCCCCCC
T ss_conf             99860889769999999999869998732644999999999779989999


No 13 
>cd03431 DNA_Glycosylase_C DNA glycosylase (MutY in bacteria and hMYH in humans) is responsible for repairing misread  A*oxoG residues to C*G by removing the inappropriately paired adenine base from the DNA backbone. It belongs to the Nudix hydrolase superfamily and is important for the repair of various genotoxic lesions. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+ for their activity. They are also recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V). However, DNA glycosylase does not seem to contain this signature motif. DNA glycosylase consists of 2 domains: the N-terminal domain contains the catalytic properties of the enzyme and the C-terminal domain affects substrate (oxoG) binding and enzymatic turnover. The C-terminal domain is highly similar to MutT, based on secondary structure and topology, despite low sequence identity. MutT sanitizes the nucleotide precursor pool by hydrolyzing oxo-dGTP to 
Probab=99.84  E-value=2e-20  Score=164.41  Aligned_cols=112  Identities=34%  Similarity=0.526  Sum_probs=88.3

Q ss_pred             EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHH---HCCCCCCCCEECCCEEEEEEEEEEEEEEEEE
Q ss_conf             2320003413760577645720132047442111134672158886---3010100002137358998315999999999
Q gi|254780479|r  239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDT---HSAPFTANWILCNTITHTFTHFTLTLFVWKT  315 (356)
Q Consensus       239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~---~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~  315 (356)
                      ...++++++.+|+|||+|||++|+|+||||||++++++........   ...........+++|+|+||||++++++|.+
T Consensus         3 ~~~~~~ii~~~~~iLl~kRp~~Gll~gLwefP~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~H~fSH~~l~~~~~~~   82 (118)
T cd03431           3 RGIAVVVIRNDGRVLLEKRPEKGLLAGLWEFPSVEWEEEADGEEALLSALKKALRLSLEPLGTVKHTFTHFRLTLHVYLA   82 (118)
T ss_pred             EEEEEEEEECCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEEEEEE
T ss_conf             99999999779999999889998456530078665565646279998766655301221044041342417999999999


Q ss_pred             ECCCC-CCCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             80776-766781332278887479987899999844
Q gi|254780479|r  316 IVPQI-VIIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       316 ~~~~~-~~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      .+... ....++.|++.+++++++||++++|+++.+
T Consensus        83 ~~~~~~~~~~~~~Wv~~~~l~~~~~p~~~~Kil~~l  118 (118)
T cd03431          83 RLEGDLLAPDEGRWVPLEELDEYALPTVMRKILELL  118 (118)
T ss_pred             EECCCCCCCCCCEEEEHHHCCCCCCCHHHHHHHHHC
T ss_conf             963798789888997678854068987999999759


No 14 
>PRK13913 3-methyladenine DNA glycosylase; Provisional
Probab=99.63  E-value=2.5e-15  Score=128.47  Aligned_cols=141  Identities=16%  Similarity=0.205  Sum_probs=116.9

Q ss_pred             CCHHHHHHHHHHHHHCCHHHCCHHHHHHHHH-------CCCHHHHHCCCHHHHHHHHHCCCHHH-HHHHHHHHHHHHHHC
Q ss_conf             8704676644673116120332658998774-------89978987299899999974046345-889998776754300
Q gi|254780479|r   37 PSPYKVWISEIMLQQTTVKTVEPYFKKFMQK-------WPTIFCLSSAKDEEILSAWAGLGYYT-RARNLKKCADIIVKK  108 (356)
Q Consensus        37 ~~py~v~vseimlqqT~v~~v~~~~~~~~~~-------~P~~~~la~a~~~~vl~~w~gLGyy~-Rar~l~~~a~~i~~~  108 (356)
                      ..|+.|.|.-|+-|+|.=..|..-..+.-+.       --++..+++.+++++-.+-..-|||+ +|++|+..|+.++.+
T Consensus        28 ~~~FEvivGAILtQNT~W~nVekAl~nLk~a~lL~~~~~~~l~~i~~l~~e~La~lIrPaGFy~~KA~rLk~l~~~~~~d  107 (218)
T PRK13913         28 ALKFEALLGAVLTQNTKFEAVLKSLENLKNAFILENDDEINLKKIAYIEFSKLAECVRPSGFYNQKAKRLIDLSKNILKD  107 (218)
T ss_pred             CCCCEEEEEEEEHHCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
T ss_conf             99765534411004187889999999999766778644159999971899999999504015899999999999999987


Q ss_pred             CCC----CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC
Q ss_conf             578----6873145688888777877667766434963131243146665765197545735689998741025785
Q gi|254780479|r  109 YEG----NFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITST  181 (356)
Q Consensus       109 ~~g----~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~  181 (356)
                      |++    .-+.++++|++++|||+-||.+|+..||++|+.+||...+|+++|+ |+... +  -..++.+.+.-++.
T Consensus       108 ~~~~~~~~~~~~Re~LL~lkGIG~ETADsILlYa~~~p~FVVDaYT~Ri~~rl-G~~~~-~--Ydelq~~fe~~l~e  180 (218)
T PRK13913        108 FQSFENFKQEVTREWLLDQKGIGKESADAILCYVCAKEVMVVDKYSYLFLKKL-GIEIE-D--YDELQHFFEKGVQE  180 (218)
T ss_pred             HHHHHHCCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEECHHHHHHHHHHC-CCCCC-C--HHHHHHHHHHCCHH
T ss_conf             52575145365899997489866333999999974998451118899999981-99857-9--99999999962277


No 15 
>COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair]
Probab=99.62  E-value=1.3e-14  Score=123.33  Aligned_cols=178  Identities=23%  Similarity=0.361  Sum_probs=140.1

Q ss_pred             CHHHHHHHHHHHHHCCHHHCCHHHHHHHHH-CCCHHHHHCCCHHHHHHHHHCCCHHH-HHHHHHHHHHHHHHCCC---CC
Q ss_conf             704676644673116120332658998774-89978987299899999974046345-88999877675430057---86
Q gi|254780479|r   38 SPYKVWISEIMLQQTTVKTVEPYFKKFMQK-WPTIFCLSSAKDEEILSAWAGLGYYT-RARNLKKCADIIVKKYE---GN  112 (356)
Q Consensus        38 ~py~v~vseimlqqT~v~~v~~~~~~~~~~-~P~~~~la~a~~~~vl~~w~gLGyy~-Rar~l~~~a~~i~~~~~---g~  112 (356)
                      +-+.|+++-|+-|.|.=+.|..--+..-+. --+++++..-+++++..+-.+-|||+ .|.+|....+.++..|.   +.
T Consensus        29 ~~~EiiigAILtQNT~WknvekAlenLk~~~~~~l~~I~~~~~~~L~elIrpsGFYnqKa~rLk~l~k~l~~~~~~~~~~  108 (215)
T COG2231          29 NKDEIIIGAILTQNTSWKNVEKALENLKNEGILNLKKILKLDEEELAELIRPSGFYNQKAKRLKALSKNLAKFFINLESF  108 (215)
T ss_pred             CCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC
T ss_conf             72068998898524529999999999988156799998458999999987042408999999999999999986423111


Q ss_pred             CCC-HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCC--HHHHHH
Q ss_conf             873-14568888877787766776643496313124314666576519754573568999874102578522--678999
Q gi|254780479|r  113 FPH-KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSR--PGDFVQ  189 (356)
Q Consensus       113 ~P~-~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~--~~~~nQ  189 (356)
                      -+. ..++|+++.|||.-||.||+.+||++|+.+||.-.+|+++|++++... +  ..++..+.+.-+|.+-  .-.|.-
T Consensus       109 ~~~~~R~~LL~iKGIG~ETaDsILlYa~~rp~FVvD~YtrR~l~rlg~i~~k-~--ydeik~~fe~~l~~~~~lyqe~HA  185 (215)
T COG2231         109 KSEVLREELLSIKGIGKETADSILLYALDRPVFVVDKYTRRLLSRLGGIEEK-K--YDEIKELFEENLPENLRLYQEFHA  185 (215)
T ss_pred             CHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCC-C--HHHHHHHHHHCCHHHHHHHHHHHH
T ss_conf             5188999987268866223999999980486446329999999994551025-4--999999998535067999999999


Q ss_pred             HHHHHHHHHCCCCCCCCCCCCCCCCHHHHH
Q ss_conf             999998632041235643110000013431
Q gi|254780479|r  190 AMMDLGALICTSNKPLCPLCPIQKNCLTFS  219 (356)
Q Consensus       190 AlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~  219 (356)
                      -+..+|-.-|.-+ |.|+.||+...|..+.
T Consensus       186 lIv~~~K~f~~k~-~~~~~cpL~~~~~~~~  214 (215)
T COG2231         186 LIVEHAKHFCKKK-PLCEKCPLKEKCKKYR  214 (215)
T ss_pred             HHHHHHHHHCCCC-CCCCCCHHHHHHHHCC
T ss_conf             9999999981588-6778765899875226


No 16 
>PRK08999 hypothetical protein; Provisional
Probab=99.61  E-value=4.7e-15  Score=126.55  Aligned_cols=118  Identities=18%  Similarity=0.140  Sum_probs=97.8

Q ss_pred             CEEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHH------HHCCCCCCCCEECCCEEEEEEEEEEEE
Q ss_conf             11232000341376057764572013204744211113467215888------630101000021373589983159999
Q gi|254780479|r  237 MRTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNID------THSAPFTANWILCNTITHTFTHFTLTL  310 (356)
Q Consensus       237 ~r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~------~~~~~~~~~~~~l~~ikH~fTH~~L~i  310 (356)
                      .....+.++++.+|+||+.||++++.|+|+||||+++.+.++.....      ++.............+.|.|.|+.+.+
T Consensus         4 ~i~V~~gvi~~~~~~vLi~~R~~~~~~~g~WEFPGGKvE~~Et~~~aL~REl~EElgI~v~~~~~~~~~~h~Y~~~~v~L   83 (312)
T PRK08999          4 RVHVAAAVIRDADGRILLARRPEGKHQGGLWEFPGGKVEPGETVEQALARELREELGIEVTAARPLITVPHDYPDKRVRL   83 (312)
T ss_pred             EEEEEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCEEEEE
T ss_conf             79999999985899699997899998788464898867999899999999988870927632138899999649849999


Q ss_pred             EEEEEEC----CCCCCCCCCEEECHHHHHHCCCCHHHHHHHHHCCCCC
Q ss_conf             9999980----7767667813322788874799878999998448237
Q gi|254780479|r  311 FVWKTIV----PQIVIIPDSTWHDAQNLANAALPTVMKKALSAGGIKV  354 (356)
Q Consensus       311 ~v~~~~~----~~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~alg~~~  354 (356)
                      ++|.+..    +...+.+..+|++++++.+++||.+++.||.++.++.
T Consensus        84 ~~~~v~~~~G~~~~~EgQ~~~Wv~~~~l~~~~fp~AN~~Ii~~l~LP~  131 (312)
T PRK08999         84 DVREVTAWQGEPHGREGQPLAWVAPDELAVYPFPPANQPIVRALRLPD  131 (312)
T ss_pred             EEEEEEEECCCCCCCCCCEEEEECHHHCCCCCCCCCCHHHHHHHCCCC
T ss_conf             999987307700465687566776533545899999869998503586


No 17 
>PRK10776 nucleoside triphosphate pyrophosphohydrolase; Provisional
Probab=99.58  E-value=1.4e-14  Score=123.24  Aligned_cols=113  Identities=15%  Similarity=0.139  Sum_probs=91.7

Q ss_pred             EEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHH------CCCCCCCCEECCCEEEEEEEEEEEEE
Q ss_conf             123200034137605776457201320474421111346721588863------01010000213735899831599999
Q gi|254780479|r  238 RTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTH------SAPFTANWILCNTITHTFTHFTLTLF  311 (356)
Q Consensus       238 r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~------~~~~~~~~~~l~~ikH~fTH~~L~i~  311 (356)
                      ....++|+++.+|++|+.||+.++.++|+||||+++.+.++.......      ..........+..+.|.|+|+.+.++
T Consensus         4 i~V~~aii~~~~~~~Li~~R~~~~~~~g~WEFPGGKvE~gEt~~~Al~REl~EElgi~v~~~~~~~~~~~~y~~~~i~l~   83 (129)
T PRK10776          4 LQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEAGETPEQALIRELQEEVGITPQQATLFEKLEYEFPDRHITLW   83 (129)
T ss_pred             EEEEEEEEEECCCEEEEEECCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHCEEEEECCEEEEEEEECCCEEEEEE
T ss_conf             89999999948998999996999988985919928527997989999999877517167605299999998998099999


Q ss_pred             EEEEECC----CCCCCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             9999807----76766781332278887479987899999844
Q gi|254780479|r  312 VWKTIVP----QIVIIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       312 v~~~~~~----~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      +|.+..-    .....++.+|++.++|.++++|.+.++||+.|
T Consensus        84 ~f~~~~~~G~~~~~e~~~~~Wv~~~eL~~~~~~~ad~~ii~~L  126 (129)
T PRK10776         84 FWLVESWEGEPWGKEGQPGRWVSQVALNADDFPPANEPIIAKL  126 (129)
T ss_pred             EEEEEECCCEECCCCCCCCEEEEHHHCCCCCCCCCCHHHHHHH
T ss_conf             9999971699975657675997788836599786849999998


No 18 
>cd03425 MutT_pyrophosphohydrolase The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi). This enzyme requires two divalent cations for activity; one coordinates the phosphoryl groups of the NTP/dNTP substrate, and the other coordinates to the enzyme. It also contains the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as metal binding and catalytic site. MutT pyrophosphohydrolase is important in preventing errors in DNA replication by hydrolyzing mutagenic nucleotides such as 8-oxo-dGTP (a product of oxidative damage), which can mispair with template adenine during DNA replication, to guanine nucleotides.
Probab=99.54  E-value=5.2e-14  Score=119.16  Aligned_cols=111  Identities=21%  Similarity=0.255  Sum_probs=89.7

Q ss_pred             EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHH------HHCCCCCCCCEECCCEEEEEEEEEEEEEEE
Q ss_conf             32000341376057764572013204744211113467215888------630101000021373589983159999999
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNID------THSAPFTANWILCNTITHTFTHFTLTLFVW  313 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~------~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~  313 (356)
                      ..+++|++++|++||.||++++.++|+|+||++..+.+++....      ++..........++.+.|.++|+++.+.+|
T Consensus         3 Vv~avI~~~~g~vLl~~R~~~~~~~g~W~~PGG~ve~gE~~~~a~~RE~~EEtGl~v~~~~~~~~~~~~~~~~~~~l~~f   82 (124)
T cd03425           3 VVAAIIIDDDGRILIAQRPAGKHLGGLWEFPGGKVEPGETPEQALVRELREELGIEVEVGELLATVEHDYPDKRVTLHVF   82 (124)
T ss_pred             EEEEEEEECCCEEEEEECCCCCCCCCEEECCEEEECCCCCHHHHHHHHHHHHHCCEEEECCEEEEEEEECCCEEEEEEEE
T ss_conf             99999997999899998689999999497987679699999999999999986970467469999989889918999999


Q ss_pred             EEECC----CCCCCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             99807----76766781332278887479987899999844
Q gi|254780479|r  314 KTIVP----QIVIIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       314 ~~~~~----~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      .+...    .....++++|+++++|.+++||.++++||++|
T Consensus        83 ~~~~~~g~~~~~E~~~~~Wv~~~el~~l~~~~a~~~il~~L  123 (124)
T cd03425          83 LVELWSGEPQLLEHQELRWVPPEELDDLDFPPADVPIVAAL  123 (124)
T ss_pred             EEEECCCCCCCCCCCCCEEEEHHHCCCCCCCCCHHHHHHHH
T ss_conf             98970780833645365980299936499982618999974


No 19 
>PRK10546 pyrimidine (deoxy)nucleoside triphosphate pyrophosphohydrolase; Provisional
Probab=99.47  E-value=4.1e-13  Score=112.90  Aligned_cols=110  Identities=15%  Similarity=0.079  Sum_probs=84.6

Q ss_pred             EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCC------CCCCCCEECCCEEEEEEEEEEEEEEE
Q ss_conf             320003413760577645720132047442111134672158886301------01000021373589983159999999
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSA------PFTANWILCNTITHTFTHFTLTLFVW  313 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~------~~~~~~~~l~~ikH~fTH~~L~i~v~  313 (356)
                      ..+.+ +.++|++|+.||+..+.++|+||||+++.+.+++.......+      ........+.++.|.++|+.+++++|
T Consensus         6 Vvaai-i~~~~rvLi~~R~~~~~~~g~WEFPGGKvE~gEs~~~Al~REl~EElgi~v~~~~~l~~~~~~~~~~~i~l~~~   84 (135)
T PRK10546          6 VVAAI-IERDGKILLAQRPAHSDQAGLWEFAGGKVEPGESQPQALIRELREELGIEATVGEYIASHQREVSGRIIHLHAW   84 (135)
T ss_pred             EEEEE-EEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEECCEEEEEEEECCCEEEEEEEE
T ss_conf             99999-99599999998899999998592983216898798999988878750958984568999988779989999999


Q ss_pred             EEECC----CCCCCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             99807----76766781332278887479987899999844
Q gi|254780479|r  314 KTIVP----QIVIIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       314 ~~~~~----~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      .+..-    .....++.+|++.+++.+++||.+...||+++
T Consensus        85 ~~~~~~g~~~~~e~~~~~Wv~~~el~~~~~~pAd~~il~~l  125 (135)
T PRK10546         85 HVPDFHGTLQAHEHQALVWCTPEEALRYPLAPADIPLLEAF  125 (135)
T ss_pred             EEEEECCEEEECCCCCCEEEEHHHHCCCCCCCCCHHHHHHH
T ss_conf             99988881821456756992389935499885859999999


No 20 
>TIGR00588 ogg 8-oxoguanine DNA-glycosylase (ogg); InterPro: IPR004577 All proteins in this family for which functions are known are 8-oxo-guanaine DNA glycosylases that function in base excision repair. The enzyme incises DNA at 8-oxoG residues, and excises 7,8-dihydro-8-oxoguanine from damaged DNA. It has beta-lyase activity that nicks DNA 3' to the lesion.; GO: 0008534 oxidized purine base lesion DNA N-glycosylase activity, 0006281 DNA repair, 0005634 nucleus.
Probab=99.11  E-value=8.4e-11  Score=96.60  Aligned_cols=123  Identities=25%  Similarity=0.480  Sum_probs=93.2

Q ss_pred             CCHHHHHHHHHHHHHCCHHHCCHHHHHH---------------HHHCCCHHHHHC---CCHHHHHHH------------H
Q ss_conf             8704676644673116120332658998---------------774899789872---998999999------------7
Q gi|254780479|r   37 PSPYKVWISEIMLQQTTVKTVEPYFKKF---------------MQKWPTIFCLSS---AKDEEILSA------------W   86 (356)
Q Consensus        37 ~~py~v~vseimlqqT~v~~v~~~~~~~---------------~~~~P~~~~la~---a~~~~vl~~------------w   86 (356)
                      +||++.++|=|-.----|++...--++.               ..-||+.+.||.   .+.||+-..            -
T Consensus       146 kdP~EcliSfIcSsNnni~RiTrm~e~lc~~fG~~~~~~dgvtyH~FP~~~~LtgvaeGsledl~~~E~nlPsdfsfn~L  225 (379)
T TIGR00588       146 KDPFECLISFICSSNNNIARITRMVERLCQAFGPRLITLDGVTYHGFPSLHALTGVAEGSLEDLPEAEANLPSDFSFNHL  225 (379)
T ss_pred             CCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH
T ss_conf             78715667788750465212543389999862203212246211479875665212100166799987515643462657


Q ss_pred             --HCCCHHHHHHHHHHHHHHHHHCCCCC-C----------------CCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE-C
Q ss_conf             --40463458899987767543005786-8----------------73145688888777877667766434963131-2
Q gi|254780479|r   87 --AGLGYYTRARNLKKCADIIVKKYEGN-F----------------PHKVEILKKLPGIGDYTASAIVAIAFNHFAVV-V  146 (356)
Q Consensus        87 --~gLGyy~Rar~l~~~a~~i~~~~~g~-~----------------P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~-v  146 (356)
                        =||||  |||.+.++|+.|+++-++. +                -+-.+.|+.|||||++.|.+|+..|++++.++ |
T Consensus       226 R~lG~GY--RA~Yi~~tar~l~ee~~~~nitsdta~LQ~ic~~~~Yedar~~L~~l~GVG~KVADCicLmgl~k~~avPV  303 (379)
T TIGR00588       226 RKLGLGY--RARYIRETARALLEEQGGRNITSDTAWLQQICKDADYEDAREALLELPGVGPKVADCICLMGLDKPQAVPV  303 (379)
T ss_pred             HHCCCCC--CCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCEEE
T ss_conf             5248875--40689999999884126642002357999860668867899997216999704888888652278971011


Q ss_pred             CCCHHHHHHHHHCCC
Q ss_conf             431466657651975
Q gi|254780479|r  147 DTNIERIISRYFDII  161 (356)
Q Consensus       147 D~Nv~RVl~R~~~~~  161 (356)
                      |.+|.||..|-|.+.
T Consensus       304 DVh~~~Ia~rdy~~s  318 (379)
T TIGR00588       304 DVHVRRIAKRDYQWS  318 (379)
T ss_pred             HHHHHHHHHHCCCCC
T ss_conf             156888864403531


No 21 
>KOG2875 consensus
Probab=99.10  E-value=1.2e-10  Score=95.46  Aligned_cols=115  Identities=25%  Similarity=0.461  Sum_probs=93.7

Q ss_pred             CCHHHHHHHHHHHHHCCHHHCCHHHHHHHH---------------HCCCHHHHHC-CCHHHHHHHHHCCCHHHHHHHHHH
Q ss_conf             870467664467311612033265899877---------------4899789872-998999999740463458899987
Q gi|254780479|r   37 PSPYKVWISEIMLQQTTVKTVEPYFKKFMQ---------------KWPTIFCLSS-AKDEEILSAWAGLGYYTRARNLKK  100 (356)
Q Consensus        37 ~~py~v~vseimlqqT~v~~v~~~~~~~~~---------------~~P~~~~la~-a~~~~vl~~w~gLGyy~Rar~l~~  100 (356)
                      +||+..+.|=|-.+.--+++...--++|..               .|||++.|+. +.|.++.++  |+||  |||.+-.
T Consensus       115 QdP~E~lfSFiCSSNNNIaRIT~Mve~fc~~fG~~i~~~dg~~~h~FPsl~~L~g~~~Ea~LR~~--gfGY--RAkYI~~  190 (323)
T KOG2875         115 QDPIECLFSFICSSNNNIARITGMVERFCQAFGPRIIQLDGVDYHGFPSLQALAGPEVEAELRKL--GFGY--RAKYISA  190 (323)
T ss_pred             CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHHHHCCHHHHHHHHHC--CCCH--HHHHHHH
T ss_conf             17197889887257875999999999999861750375557120368547776474769999981--7644--6899999


Q ss_pred             HHHHHHHCCCC----------CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEE-ECCCCHHHHHH
Q ss_conf             76754300578----------687314568888877787766776643496313-12431466657
Q gi|254780479|r  101 CADIIVKKYEG----------NFPHKVEILKKLPGIGDYTASAIVAIAFNHFAV-VVDTNIERIIS  155 (356)
Q Consensus       101 ~a~~i~~~~~g----------~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~-~vD~Nv~RVl~  155 (356)
                      +|+.|+++++|          .+-+-.+.|..|||||+..|.+|+.+++|.... +||.||.|+..
T Consensus       191 ta~~l~~~~g~~~wLqslr~~~yeear~~L~~lpGVG~KVADCI~Lm~l~~~~~VPVDvHi~ria~  256 (323)
T KOG2875         191 TARALQEKQGGLAWLQSLRKSSYEEAREALCSLPGVGPKVADCICLMSLDKLSAVPVDVHIWRIAQ  256 (323)
T ss_pred             HHHHHHHHCCCCHHHHHHHCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHH
T ss_conf             999999723500599988545289999998528887614756223120588765562245888762


No 22 
>COG0122 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]
Probab=99.07  E-value=3.4e-09  Score=85.24  Aligned_cols=164  Identities=20%  Similarity=0.309  Sum_probs=117.1

Q ss_pred             HHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHH----------CCCHHHHHCCCHH
Q ss_conf             999999962899887548554445678704676644673116120332658998774----------8997898729989
Q gi|254780479|r   11 KILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQK----------WPTIFCLSSAKDE   80 (356)
Q Consensus        11 ~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~----------~P~~~~la~a~~~   80 (356)
                      .|-.+++..+. +|.+..++..-....|||..+||-|.-||--++.-...+.++.+.          |||++.|+.++++
T Consensus        78 ~l~~i~~~~~~-~~~~~~~g~~~~~~~d~fe~lv~aI~~QqvS~~~A~~i~~rl~~~~g~~~~~~~~fptpe~l~~~~~~  156 (285)
T COG0122          78 DLAPIIDALGP-LPLLRAPGLRLPLAPDPFEALVRAILSQQVSVAAAAKIWARLVSLYGNALEIYHSFPTPEQLAAADEE  156 (285)
T ss_pred             CHHHHHHHCCC-CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCHH
T ss_conf             59999986465-65445556556666678999999999765059999999999999818766656679899999847999


Q ss_pred             HHHHHHHCCCHH-HHHHHHHHHHHHHHHCC---CCCCC--C--HHHHHHHHHHHHHHHHHHHHHHHCCCE--EEECCCCH
Q ss_conf             999997404634-58899987767543005---78687--3--145688888777877667766434963--13124314
Q gi|254780479|r   81 EILSAWAGLGYY-TRARNLKKCADIIVKKY---EGNFP--H--KVEILKKLPGIGDYTASAIVAIAFNHF--AVVVDTNI  150 (356)
Q Consensus        81 ~vl~~w~gLGyy-~Rar~l~~~a~~i~~~~---~g~~P--~--~~~~l~~LpGiG~yta~ai~s~a~~~~--~~~vD~Nv  150 (356)
                      ++.    ++|-- +||+.|..+|+.+.+..   .---+  .  -.+.|.+|+|||+.||..++.++++.+  .|+.|.-+
T Consensus       157 ~l~----~~g~s~~Ka~yi~~~A~~~~~g~~~~~~l~~~~~e~a~e~L~~i~GIG~WTAe~~llf~lgr~dvfP~~D~~l  232 (285)
T COG0122         157 ALR----RCGLSGRKAEYIISLARAAAEGELDLSELKPLSDEEAIEELTALKGIGPWTAEMFLLFGLGRPDVFPADDLGL  232 (285)
T ss_pred             HHH----HHCCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHH
T ss_conf             998----8378577899999999999859965676625889999999873788679999999998168898677155999


Q ss_pred             HHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC
Q ss_conf             6665765197545735689998741025785
Q gi|254780479|r  151 ERIISRYFDIIKPAPLYHKTIKNYARKITST  181 (356)
Q Consensus       151 ~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~  181 (356)
                      +|-+.++++.  +....++.+..+++.+-|-
T Consensus       233 r~~~~~~~~~--~~~~~~~~~~~~~e~w~p~  261 (285)
T COG0122         233 RRAIKKLYRL--PTRPTEKEVRELAERWGPY  261 (285)
T ss_pred             HHHHHHHHCC--CCCCHHHHHHHHHHHCCCH
T ss_conf             9999997257--8785088999998505678


No 23 
>TIGR00586 mutt mutator mutT protein; InterPro: IPR003561   MutT is a small bacterial protein (~12-15kDa) involved in the GO system  responsible for removing an oxidatively damaged form of guanine (8-hydroxy- guanine or 7,8-dihydro-8-oxoguanine) from DNA and the nucleotide pool. 8-oxo-dGTP is inserted opposite dA and dC residues of template DNA with near equal efficiency, leading to A.T to G.C transversions. MutT specifically degrades 8-oxo-dGTP to the monophosphate, with the concomitant release of pyrophosphate. A short conserved N-terminal region of mutT (designated the MutT domain) is also found in a variety of other prokaryotic, viral, and eukaryotic proteins , , , . Recently, the generic name 'NUDIX hydrolases' (NUcleoside DIphosphate linked to some other moiety X) has been coined for this domain family .    The solution structure of the MutT enzyme has been solved by NMR . The structure is globular and compact, and contains a five-stranded mixed beta-sheet. The parallel portion of the sheet is sandwiched between two alpha-helices, forming an alpha+beta fold. ; GO: 0008413 8-oxo-78-dihydroguanine triphosphatase activity, 0006281 DNA repair.
Probab=98.80  E-value=4.9e-09  Score=84.14  Aligned_cols=116  Identities=15%  Similarity=0.175  Sum_probs=91.2

Q ss_pred             CCEEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCC------CCCCCEECCCEEEEEEEEEEE
Q ss_conf             21123200034137605776457201320474421111346721588863010------100002137358998315999
Q gi|254780479|r  236 PMRTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAP------FTANWILCNTITHTFTHFTLT  309 (356)
Q Consensus       236 ~~r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~------~~~~~~~l~~ikH~fTH~~L~  309 (356)
                      +.....+.||+|.+|+|+|.||+.++-+++.||||+++.+.++.......++.      .........++.|.|-..+|+
T Consensus         3 K~~QiavgiIrNe~gei~itrR~~g~~~a~~lEFPGGK~e~GETpeqa~vREl~EEigI~~~~~~l~e~l~~eyP~~~l~   82 (150)
T TIGR00586         3 KKLQIAVGIIRNENGEIFITRRAAGAHFANKLEFPGGKIEMGETPEQAVVRELQEEIGITPQHFELFEKLEYEYPQKRLT   82 (150)
T ss_pred             CEEEEEEEEEECCCCCEEEEECCCCCCHHCCCCCCCCCCCCCCCHHHHHHHHHHHHEEEEEECCHHEEEECCCCCCEEEH
T ss_conf             60678888998188807887444562100024798981388989789776542220012320000000101306740000


Q ss_pred             EEEEEEECCC------------------------CCCCCCCEEECHHHHHHCC-CCHHHHHHHHHCC
Q ss_conf             9999998077------------------------6766781332278887479-9878999998448
Q gi|254780479|r  310 LFVWKTIVPQ------------------------IVIIPDSTWHDAQNLANAA-LPTVMKKALSAGG  351 (356)
Q Consensus       310 i~v~~~~~~~------------------------~~~~~~~~Wv~~~el~~~~-LPs~~kKIL~alg  351 (356)
                      +.+|.+..-.                        ....+...|+...+|+... +|..+.+++.+|-
T Consensus        83 l~~~L~~~~~gep~~~~raada~~a~klefpggk~e~g~~~~w~~~~~L~~~~gfppan~~~i~~L~  149 (150)
T TIGR00586        83 LYVYLVQEWSGEPFGKERAADAHMANKLEFPGGKIEMGEEGEWVAITDLDEERGFPPANAELIQKLR  149 (150)
T ss_pred             HHHEEHHHHCCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCEEEEEECCCCCCCCCCCHHHHHHHC
T ss_conf             2310014333881022455556665431278860221798517898414556788840268999961


No 24 
>cd03427 MTH1 MutT homolog-1 (MTH1) is a member of the Nudix hydrolase superfamily. MTH1, the mammalian counterpart of MutT, hydrolyzes oxidized purine nucleoside triphosphates, such as 8-oxo-dGTP and 2-hydroxy-ATP, to monophosphates, thereby preventing the incorporation of such oxygen radicals during replication. This is an important step in the repair mechanism in genomic and mitochondrial DNA.  Like other members of the Nudix family, it requires a divalent cation, such as Mg2+ or Mn2+, for activity, and contain the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. MTH1 is predominantly localized in the cytoplasm and mitochondria. Structurally, this enzyme adopts a similar fold to MutT despite low sequence similarity outside the conserved nudix motif. The most distinctive structural difference between MutT and MTH1 is the presence of a beta-hairpin, which is absent in MutT. This results in a m
Probab=98.66  E-value=1.5e-07  Score=73.74  Aligned_cols=108  Identities=15%  Similarity=0.055  Sum_probs=73.0

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCC------CEECCCEEEEE--EEEEEEEEEE
Q ss_conf             000341376057764572013204744211113467215888630101000------02137358998--3159999999
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTAN------WILCNTITHTF--THFTLTLFVW  313 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~------~~~l~~ikH~f--TH~~L~i~v~  313 (356)
                      .++++.++|+|||.+|+.+ -..|+|+||+++.+.++.......++...+.      ....+.+.+.+  .++.+.+++|
T Consensus         4 tl~ii~~~~kiLl~~R~~~-~~~G~W~~PGGkvE~gEt~~~a~~REl~EEtGl~v~~~~~~g~~~~~~~~~~~~~~~~~f   82 (137)
T cd03427           4 TLCFIKDPDKVLLLNRKKG-PGWGGWNGPGGKVEPGETPEECAIRELKEETGLTIDNLKLVGIIKFPFPGEEERYGVFVF   82 (137)
T ss_pred             EEEEEEECCEEEEEEECCC-CCCCEEECCEEECCCCCCHHHHHHHHHHHHHCEEEEEEEEEEEEEEEECCCCEEEEEEEE
T ss_conf             9999998999999996999-999959795565679979999999999898692887138999999992799989999999


Q ss_pred             EEECCCC-----CCCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             9980776-----766781332278887479987899999844
Q gi|254780479|r  314 KTIVPQI-----VIIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       314 ~~~~~~~-----~~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      .+.....     ....+.+|++.+++.+++++...+.+|..+
T Consensus        83 ~~~~~~G~~~~~~~e~~l~Wv~~~el~~l~~~p~d~~~l~~l  124 (137)
T cd03427          83 LATEFEGEPLKESEEGILDWFDIDDLPLLPMWPGDREWLPLM  124 (137)
T ss_pred             EEEEEEEEECCCCCCCEEEEEEHHHCCCCCCCCCHHHHHHHH
T ss_conf             998352476798833368998498977589982368999999


No 25 
>PRK10308 3-methyl-adenine DNA glycosylase II; Provisional
Probab=98.63  E-value=3.9e-07  Score=70.79  Aligned_cols=136  Identities=21%  Similarity=0.294  Sum_probs=97.4

Q ss_pred             CCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHH-------------HCCCHHHHHCCCHHHHHHHHHCCCHH-HHHHH
Q ss_conf             44567870467664467311612033265899877-------------48997898729989999997404634-58899
Q gi|254780479|r   32 EKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQ-------------KWPTIFCLSSAKDEEILSAWAGLGYY-TRARN   97 (356)
Q Consensus        32 ~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~-------------~~P~~~~la~a~~~~vl~~w~gLGyy-~Rar~   97 (356)
                      .-+..-|+|+..|=-|+=||-.|+.....-.|+++             .|||++.||.++.+++    .+||.- +|++.
T Consensus       106 RvPg~~d~fE~~vrAIlGQQvSv~aA~tl~~Rlv~~~G~~~~~~~~~~~FPtp~~la~~~~~~L----~~lg~p~~ra~t  181 (283)
T PRK10308        106 RLPGSVDAFEQGVRAILGQLVSVAMAAKLTAKVAQLYGERLDDFPDYVCFPTPQRLAAADPQAL----KALGMPLKRAEA  181 (283)
T ss_pred             CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEECCCHHHHHCCCHHHH----HHCCCCHHHHHH
T ss_conf             7888798899999999714021999999999999994895788988534799899853897566----455896689999


Q ss_pred             HHHHHHHHHHC-CCCCCCCH----HHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCH--HHHHHHHHCCCCCCCHHHHH
Q ss_conf             98776754300-57868731----4568888877787766776643496313124314--66657651975457356899
Q gi|254780479|r   98 LKKCADIIVKK-YEGNFPHK----VEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNI--ERIISRYFDIIKPAPLYHKT  170 (356)
Q Consensus        98 l~~~a~~i~~~-~~g~~P~~----~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv--~RVl~R~~~~~~~~~~~~k~  170 (356)
                      |...|+.+.+. ..-.-+.|    .+.|+.|||||+-||.-|+--+++.|..-..|-+  +|   ++-+.      ..++
T Consensus       182 l~~lA~a~~~g~l~l~~~~d~~~~~~~L~~l~GIGpWTa~Yv~mR~lg~pD~fp~~Dl~l~~---~l~~~------~~~~  252 (283)
T PRK10308        182 LIHLANAALEGTLPLTAPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---RFPGM------TPAQ  252 (283)
T ss_pred             HHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHH---HHCCC------CHHH
T ss_conf             99999999669866777899999999987367978899999999837899878760299997---61359------9999


Q ss_pred             HHHHHHCCCC
Q ss_conf             9874102578
Q gi|254780479|r  171 IKNYARKITS  180 (356)
Q Consensus       171 l~~~~~~~~~  180 (356)
                      +...++.+-|
T Consensus       253 ~~~~a~~W~P  262 (283)
T PRK10308        253 IRRYAERWKP  262 (283)
T ss_pred             HHHHHHCCCC
T ss_conf             9999753588


No 26 
>cd04691 Nudix_Hydrolase_32 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.57  E-value=2.6e-07  Score=71.95  Aligned_cols=93  Identities=14%  Similarity=0.246  Sum_probs=61.9

Q ss_pred             EEEEEEECCCEEEEEECC-CCHHHHCCCCCCCCCCCCCCCHHHHHHCCCC------CCCCEECCCEEEEEEEEEEEEEEE
Q ss_conf             200034137605776457-2013204744211113467215888630101------000021373589983159999999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRT-NTRLLEGMDELPGSAWSSTKDGNIDTHSAPF------TANWILCNTITHTFTHFTLTLFVW  313 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp-~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~------~~~~~~l~~ikH~fTH~~L~i~v~  313 (356)
                      .+.+++ ++|++||.||+ .++..+|+||||+++.+.++.......++..      ......+.++.|..++..+ +++|
T Consensus         3 Vagii~-~~~kvLl~rR~~~~~~~~G~WefPGGkvE~gEt~~~Al~REl~EElgI~v~~~~~~~~~~~~~~~~~~-l~~~   80 (117)
T cd04691           3 VVGVLF-SDDKVLLERRSLTKNADPGKLNIPGGHIEAGESQEEALLREVQEELGVDPLSYTYLCSLYHPTSELQL-LHYY   80 (117)
T ss_pred             EEEEEE-ECCEEEEEECCCCCCCCCCEEECCEECCCCCCCHHHHHHHHHHHHHCEEEECCEEEEEEEECCCCEEE-EEEE
T ss_conf             999999-89999999889999999996989745177988999999999889819156224589999855999699-9999


Q ss_pred             EEEC----CCCCCCCCCEEECHHHHH
Q ss_conf             9980----776766781332278887
Q gi|254780479|r  314 KTIV----PQIVIIPDSTWHDAQNLA  335 (356)
Q Consensus       314 ~~~~----~~~~~~~~~~Wv~~~el~  335 (356)
                      .+..    +.....++.+|++.+||-
T Consensus        81 ~~~~~~G~~~~~E~~e~~Wv~~~el~  106 (117)
T cd04691          81 VVTFWQGEIPAQEAAEVHWMTANDIV  106 (117)
T ss_pred             EEEEECCEECCCCCCCCEECCHHHCC
T ss_conf             99998176643347705771699953


No 27 
>cd04696 Nudix_Hydrolase_37 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.50  E-value=8.3e-07  Score=68.47  Aligned_cols=103  Identities=17%  Similarity=0.185  Sum_probs=66.4

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECCCEEEEE-------EEEE
Q ss_conf             200034137605776457201320474421111346721588863010100002------137358998-------3159
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCNTITHTF-------THFT  307 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~~ikH~f-------TH~~  307 (356)
                      ..++|++.+|++||.||+.   +.|+|+||++..+.++........+...+...      .++...+.+       +|+ 
T Consensus         5 V~avI~~~~g~vLl~r~~~---~~g~W~lPGG~ve~gEs~~~aa~RE~~EEtGl~v~~~~~~~~~~~~~~~~~~~~~h~-   80 (125)
T cd04696           5 VGALIYAPDGRILLVRTTK---WRGLWGVPGGKVEWGETLEEALKREFREETGLKLRDIKFAMVQEAIFSEEFHKPAHF-   80 (125)
T ss_pred             EEEEEEECCCEEEEEEECC---CCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEECCCCCCCCCCCEE-
T ss_conf             9999994999899999369---999598986588699599999999999975957653589988822126767777659-


Q ss_pred             EEEEEEEEECCC-----CCCCCCCEEECHHHHHHCCCCHHHHHHHH
Q ss_conf             999999998077-----67667813322788874799878999998
Q gi|254780479|r  308 LTLFVWKTIVPQ-----IVIIPDSTWHDAQNLANAALPTVMKKALS  348 (356)
Q Consensus       308 L~i~v~~~~~~~-----~~~~~~~~Wv~~~el~~~~LPs~~kKIL~  348 (356)
                       -+..|.+....     .....+++|++++++.+++|+..+|.+|+
T Consensus        81 -~~~~~~~~~~~~~~~~~~E~~e~~Wv~~~el~~~~l~~~~r~~ie  125 (125)
T cd04696          81 -VLFDFFARTDGTEVTPNEEIVEWEWVTPEEALDYPLNSFTRLLLE  125 (125)
T ss_pred             -EEEEEEEEEECCCCCCCCCCCEEEEECHHHHHCCCCCHHHHHHHC
T ss_conf             -999999998089277783542999943889565988868876519


No 28 
>cd04699 Nudix_Hydrolase_39 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.48  E-value=8.9e-07  Score=68.22  Aligned_cols=102  Identities=15%  Similarity=0.147  Sum_probs=65.0

Q ss_pred             EEEEEEEECCCEEEEEECCC-CHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECCCE--EEEEEEEEEEE
Q ss_conf             32000341376057764572-01320474421111346721588863010100002------13735--89983159999
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTN-TRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCNTI--THTFTHFTLTL  310 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~-~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~~i--kH~fTH~~L~i  310 (356)
                      ....+|++.+|++||.||.. +..++|+|+||++..+.++.......++...+...      .....  .+...+++..+
T Consensus         3 ~V~~~I~~~~g~vLl~kR~~~~~~~~G~W~~PGG~ie~gEt~~~aa~REl~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~   82 (129)
T cd04699           3 AVAALIVKDVGRILILKRSKDERTAPGKWELPGGKVEEGETFEEALKREVYEETGLTVTPFLRYPSTVTHEDSGVYNVIY   82 (129)
T ss_pred             EEEEEEEECCCEEEEEEECCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCEEEEE
T ss_conf             99999998999899998089999999818898050659989999999999998795778768876999972799508999


Q ss_pred             EEEEEECCCC-----CCCCCCEEECHHHHHHCCCCH
Q ss_conf             9999980776-----766781332278887479987
Q gi|254780479|r  311 FVWKTIVPQI-----VIIPDSTWHDAQNLANAALPT  341 (356)
Q Consensus       311 ~v~~~~~~~~-----~~~~~~~Wv~~~el~~~~LPs  341 (356)
                      .+|.+.....     ....+++|+++++|.+++...
T Consensus        83 ~~f~~~~~~~~~~~~~E~~e~~Wv~~eeL~~L~~d~  118 (129)
T cd04699          83 LVFVCEALSGAVKLSDEHEEYAWVTLEELAILKADI  118 (129)
T ss_pred             EEEEEEECCCCCCCCCCCEEEEEEEHHHHHHCCCCC
T ss_conf             999986248868788652066998489975487975


No 29 
>cd04667 Nudix_Hydrolase_10 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.41  E-value=2.2e-06  Score=65.54  Aligned_cols=101  Identities=14%  Similarity=0.146  Sum_probs=65.3

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEE--EEEEEEEEEEEECCC
Q ss_conf             000341376057764572013204744211113467215888630101000021373589983--159999999998077
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFT--HFTLTLFVWKTIVPQ  319 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fT--H~~L~i~v~~~~~~~  319 (356)
                      +.||.+++|++||.||+     +|+|+||++..+.++....................+.+.++  +.+-..++|.+.+..
T Consensus         3 A~vi~~~~g~vLLvrr~-----~g~W~lPGG~ve~GEt~~~Aa~RE~~EEtGl~~~~~~~~~~~~~~~~~~~~f~~~~~~   77 (112)
T cd04667           3 ATVICRRGGRVLLVRKS-----GSRWALPGGKIEPGETPLQAARRELQEETGLQGLDLLYLFHVDGGSTRHHVFVASVPP   77 (112)
T ss_pred             EEEEEECCCEEEEEEEC-----CCCEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEECCCEEEEEEEEEECC
T ss_conf             89999709999999918-----9938898748889989999999999998693765469999990898899999999758


Q ss_pred             ------CCCCCCCEEECHHHHHHCCCCHHHHHHH
Q ss_conf             ------6766781332278887479987899999
Q gi|254780479|r  320 ------IVIIPDSTWHDAQNLANAALPTVMKKAL  347 (356)
Q Consensus       320 ------~~~~~~~~Wv~~~el~~~~LPs~~kKIL  347 (356)
                            .....+.+|++.+++.+++++...+.|+
T Consensus        78 ~~~~~~~~E~~e~rW~~~del~el~~~~~~R~iv  111 (112)
T cd04667          78 SAQPKPSNEIADCRWLSLDALGDLNASAATRLIV  111 (112)
T ss_pred             CCCCCCCCCCEEEEEEEHHHCCCCCCCHHHHHHC
T ss_conf             8641899742147998699963688994672101


No 30 
>cd04681 Nudix_Hydrolase_22 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.33  E-value=5.4e-06  Score=62.72  Aligned_cols=105  Identities=14%  Similarity=0.241  Sum_probs=65.2

Q ss_pred             EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCCEEEEEE----EEEEEEEE
Q ss_conf             003413760577645720132047442111134672158886301010000------21373589983----15999999
Q gi|254780479|r  243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNTITHTFT----HFTLTLFV  312 (356)
Q Consensus       243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~ikH~fT----H~~L~i~v  312 (356)
                      .+|++.+|++||.||.+. -..|+|+||++..+.++.......++...+..      ..+..+.+.++    +++.-..+
T Consensus         6 ~vI~~~~g~vLl~kR~~~-~~~g~W~lPGG~ve~gEt~~~aa~REl~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~~~~~   84 (130)
T cd04681           6 VLILNEDGELLVVRRARE-PGKGTLDLPGGFVDPGESAEEALIREIREETGLKVTELSYLFSLPNTYPYGGMEYDTLDLF   84 (130)
T ss_pred             EEEEECCCEEEEEEECCC-CCCCCEECCEEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCEEEEEEEEE
T ss_conf             999968999999996899-8899092988206599999999989999986954688689999764554698238999999


Q ss_pred             EEEECCCC------CCCCCCEEECHHHHH--HCCCCHHHHHHHHH
Q ss_conf             99980776------766781332278887--47998789999984
Q gi|254780479|r  313 WKTIVPQI------VIIPDSTWHDAQNLA--NAALPTVMKKALSA  349 (356)
Q Consensus       313 ~~~~~~~~------~~~~~~~Wv~~~el~--~~~LPs~~kKIL~a  349 (356)
                      |.+.....      ....+.+|++++||.  ++++|+ +++.|..
T Consensus        85 ~~~~~~~~~~~~~~~E~~e~~W~~~~el~~~~~afp~-~~~~i~~  128 (130)
T cd04681          85 FVCQVDDKPIVKAPDDVAELKWVVPQDIELENFAFPS-IRQAVER  128 (130)
T ss_pred             EEEEECCCCCCCCCCCEEEEEEECHHHCCHHHCCCHH-HHHHHHH
T ss_conf             9999879874799982516899868789710259866-9999998


No 31 
>cd04683 Nudix_Hydrolase_24 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.10  E-value=1.9e-05  Score=58.93  Aligned_cols=100  Identities=16%  Similarity=0.233  Sum_probs=61.6

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC--C--EE---EEEEE--EEEEEE
Q ss_conf             200034137605776457201320474421111346721588863010100002137--3--58---99831--599999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN--T--IT---HTFTH--FTLTLF  311 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~--~--ik---H~fTH--~~L~i~  311 (356)
                      .++++++.++++||.||.+.+...|+|+||++..+.++........+..++....+.  .  ..   |..+.  ...-..
T Consensus         2 av~vvl~~~~~vLL~~R~~~~~~~G~W~~PGG~ve~GEt~~~aa~REl~EEtGl~v~~~~l~~~~~~~~~~~~~~~~~~~   81 (120)
T cd04683           2 AVYVLLRRDDEVLLQRRANTGYMDGQWALPAGHLEKGEDAVTAAVREAREEIGVTLDPEDLRLAHTMHRRTEDIESRIGL   81 (120)
T ss_pred             EEEEEEEECCEEEEEECCCCCCCCCEEECCEEECCCCCCHHHHHHHHHHHHHCCEEEECEEEEEEEEECCCCCCCEEEEE
T ss_conf             89999993999999977899999997989758877999999999999888769388734089999883268998799999


Q ss_pred             EEEEECC-------CCCCCCCCEEECHHHHHHCCCC
Q ss_conf             9999807-------7676678133227888747998
Q gi|254780479|r  312 VWKTIVP-------QIVIIPDSTWHDAQNLANAALP  340 (356)
Q Consensus       312 v~~~~~~-------~~~~~~~~~Wv~~~el~~~~LP  340 (356)
                      +|.+...       .+....+..|+++++|-...+|
T Consensus        82 ~f~~~~~~g~p~~~~p~e~~~~~Wf~~deLP~~~~~  117 (120)
T cd04683          82 FFTVRRWSGEPRNCEPDKCAELRWFPLDALPDDTVD  117 (120)
T ss_pred             EEEEEEECCEEECCCCCCEEEEEEECHHHCCCCCCC
T ss_conf             999997478986189981458999998899886577


No 32 
>cd04664 Nudix_Hydrolase_7 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s
Probab=98.09  E-value=6.2e-05  Score=55.24  Aligned_cols=102  Identities=13%  Similarity=0.083  Sum_probs=65.2

Q ss_pred             EEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCC--------EEE--EE--EEEEEEEEE
Q ss_conf             341376057764572013204744211113467215888630101000021373--------589--98--315999999
Q gi|254780479|r  245 AITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNT--------ITH--TF--THFTLTLFV  312 (356)
Q Consensus       245 ii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~--------ikH--~f--TH~~L~i~v  312 (356)
                      .+++++++||.||+++  +.|+|+||++..+.++........+...+.......        ...  .+  .+....+++
T Consensus        10 ~~~~~~~vLll~Rs~~--~~G~W~~p~G~ve~gE~~~~aA~REi~EETGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   87 (129)
T cd04664          10 RLTGEGRVLLLRRSDK--YAGFWQSVTGGIEDGESPAEAARREVAEETGLDPERLTLLDRGASIAFVEFTDNGRVWTEHP   87 (129)
T ss_pred             EECCCCEEEEEECCCC--CCCCCCCCEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEEECCCCEEEEEEE
T ss_conf             9889998999987999--98939198732289979999999999998889830179986145899999658985899999


Q ss_pred             EEEECCCC------CCCCCCEEECHHHHHHCCCCHHHHHHHH
Q ss_conf             99980776------7667813322788874799878999998
Q gi|254780479|r  313 WKTIVPQI------VIIPDSTWHDAQNLANAALPTVMKKALS  348 (356)
Q Consensus       313 ~~~~~~~~------~~~~~~~Wv~~~el~~~~LPs~~kKIL~  348 (356)
                      |.+.++..      ..+.+++|++.++..++...-..|++|.
T Consensus        88 f~~~v~~~~~v~l~~EH~~~~W~~~~ea~~~l~~~~~r~a~~  129 (129)
T cd04664          88 FAFHLPSDAVVTLDWEHDAFEWVPPEEAAALLLWESNRRAWA  129 (129)
T ss_pred             EEEECCCCCCCCCCCCCCCEEECCHHHHHHHCCCHHHHHHHC
T ss_conf             999867898776874447218836999997659933668759


No 33 
>cd04673 Nudix_Hydrolase_15 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.06  E-value=1.8e-05  Score=58.98  Aligned_cols=102  Identities=17%  Similarity=0.129  Sum_probs=61.6

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCCEEEEE------EEEEEE
Q ss_conf             0003413760577645720132047442111134672158886301010000------2137358998------315999
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNTITHTF------THFTLT  309 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~ikH~f------TH~~L~  309 (356)
                      +..++.++|++||.||... -..|+|+||++..+.+++.......+...+..      ..++.+.+..      .+++..
T Consensus         3 v~~ii~~~~kvLL~~R~~~-~~~g~W~lPGG~ve~gEs~~~a~~REv~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~~~~   81 (122)
T cd04673           3 VGAVVFRGGRVLLVRRANP-PDAGLWSFPGGKVELGETLEQAALRELLEETGLEAEVGRLLTVVDVIERDAAGRVEFHYV   81 (122)
T ss_pred             EEEEEEECCEEEEEEECCC-CCCCEEECCEEEEECCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCCCEEEEEE
T ss_conf             9999998999999996999-999979998968578999999999999998897647606999991020289996589999


Q ss_pred             EEEEEEECCC-----CCCCCCCEEECHHHHHHCCCCHHHH
Q ss_conf             9999998077-----6766781332278887479987899
Q gi|254780479|r  310 LFVWKTIVPQ-----IVIIPDSTWHDAQNLANAALPTVMK  344 (356)
Q Consensus       310 i~v~~~~~~~-----~~~~~~~~Wv~~~el~~~~LPs~~k  344 (356)
                      +.+|.+....     .....+.+|++.++|.+++|+-..|
T Consensus        82 ~~~f~~~~~~g~~~~~~E~~e~~W~~~del~~l~l~~~~R  121 (122)
T cd04673          82 LIDFLCRYLGGEPVAGDDALDARWVPLDELAALSLTESTR  121 (122)
T ss_pred             EEEEEEEECCCEEECCCCEEEEEEEEHHHCCCCCCCCCCC
T ss_conf             9999999739988419883888998699914798993436


No 34 
>cd04511 Nudix_Hydrolase_4 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate specifici
Probab=98.05  E-value=2.5e-05  Score=57.98  Aligned_cols=104  Identities=13%  Similarity=0.176  Sum_probs=64.9

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE--CCCEEEEEE--EEEEEEEEEEEEC
Q ss_conf             000341376057764572013204744211113467215888630101000021--373589983--1599999999980
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL--CNTITHTFT--HFTLTLFVWKTIV  317 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~--l~~ikH~fT--H~~L~i~v~~~~~  317 (356)
                      +.+++.++|+|||.||... --.|+|++|++..+.++.......++..++....  ...+..+|+  |......+|.+..
T Consensus        16 v~~ii~~~~kILLvrR~~~-P~~g~W~lPGG~vE~gEt~eea~~REv~EEtGl~v~~~~~~~v~~~p~~~~v~~~f~~~~   94 (130)
T cd04511          16 VGCVPEWEGKVLLCRRAIE-PRHGFWTLPAGFMENGETTEQGALRETWEEAGARVEIDGLYAVYSVPHISQVYMFYRARL   94 (130)
T ss_pred             EEEEEEECCEEEEEECCCC-CCCCCEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCCEEEEEEEEEE
T ss_conf             9999986999999940689-999958689748658989999999899874097899626999945787767999999999


Q ss_pred             CC-----CCCCCCCEEECHHHHH--HCCCCHHHHHHH
Q ss_conf             77-----6766781332278887--479987899999
Q gi|254780479|r  318 PQ-----IVIIPDSTWHDAQNLA--NAALPTVMKKAL  347 (356)
Q Consensus       318 ~~-----~~~~~~~~Wv~~~el~--~~~LPs~~kKIL  347 (356)
                      ..     .....+.+|++.++|.  ++++|+. |+.|
T Consensus        95 ~~g~~~~~~E~~e~~wf~~delP~~~lAf~s~-~~al  130 (130)
T cd04511          95 LDLDFAPGPESLEVRLFTEEEIPWDELAFPTV-RRTL  130 (130)
T ss_pred             ECCCCCCCCCEEEEEEECHHHCCHHHCCHHHH-HHHC
T ss_conf             12767999520167788788997065451655-6319


No 35 
>cd04690 Nudix_Hydrolase_31 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.03  E-value=1.8e-05  Score=59.02  Aligned_cols=96  Identities=17%  Similarity=0.150  Sum_probs=62.8

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE--------CCCEE---EEEEEEEEE
Q ss_conf             2000341376057764572013204744211113467215888630101000021--------37358---998315999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL--------CNTIT---HTFTHFTLT  309 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~--------l~~ik---H~fTH~~L~  309 (356)
                      .+.++++.+|++||.||..+    |.|+||++..+.++........+...+....        .+.+.   |....+.+.
T Consensus         3 ~~a~v~~~~grvLl~rr~~~----~~W~lPGG~vE~gEt~~~a~~REl~EE~Gl~v~~~~l~~~~~~~~~~~~~~~~~~~   78 (118)
T cd04690           3 AAALILVRDGRVLLVRKRGT----DVFYLPGGKIEAGETPLQALIRELSEELGLDLDPDSLEYLGTFRAPAANEPGVDVR   78 (118)
T ss_pred             EEEEEEECCCEEEEEEECCC----CEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEECCEEEEEEEEEECCCCCCCEEEE
T ss_conf             99999989998999997799----94989963765997999999999989759498602079989997216679998999


Q ss_pred             EEEEEEECCCC----CCCCCCEEECHHHHHHCCCC
Q ss_conf             99999980776----76678133227888747998
Q gi|254780479|r  310 LFVWKTIVPQI----VIIPDSTWHDAQNLANAALP  340 (356)
Q Consensus       310 i~v~~~~~~~~----~~~~~~~Wv~~~el~~~~LP  340 (356)
                      +++|.+.....    ....+.+|++++++.+++|.
T Consensus        79 ~~~~~~~~~g~~~~~~E~~e~~Wv~~~el~~~~la  113 (118)
T cd04690          79 ATVYVAELTGEPVPAAEIEEIRWVDYDDPADDRLA  113 (118)
T ss_pred             EEEEEEEEECCCCCCCCCCCEEEECHHHCCCCCCC
T ss_conf             99999998188457863100299579897689989


No 36 
>cd04676 Nudix_Hydrolase_17 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.02  E-value=3e-05  Score=57.51  Aligned_cols=105  Identities=17%  Similarity=0.202  Sum_probs=65.0

Q ss_pred             EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC--C-------EEEEEEE-----
Q ss_conf             3200034137605776457201320474421111346721588863010100002137--3-------5899831-----
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN--T-------ITHTFTH-----  305 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~--~-------ikH~fTH-----  305 (356)
                      ....++++.+|++||.||...    |.|.||++..+.++........+...+......  .       ..|.++.     
T Consensus         4 gv~~vv~~~~g~vLL~~r~~~----g~W~lPGG~ve~GE~~~~aa~RE~~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~~   79 (129)
T cd04676           4 GVTAVVRDDEGRVLLIRRSDN----GLWALPGGAVEPGESPADTAVREVREETGLDVEVTGLVGIYTGPVHVVTYPNGDV   79 (129)
T ss_pred             EEEEEEEECCCEEEEEEECCC----CEEECCCCEECCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCEECCCCCE
T ss_conf             799999979980999998699----9698992574199898998889999976957787279999726442162799986


Q ss_pred             EEEEEEEEEEECCC------CCCCCCCEEECHHHHHHCCCCHHHHHHHH
Q ss_conf             59999999998077------67667813322788874799878999998
Q gi|254780479|r  306 FTLTLFVWKTIVPQ------IVIIPDSTWHDAQNLANAALPTVMKKALS  348 (356)
Q Consensus       306 ~~L~i~v~~~~~~~------~~~~~~~~Wv~~~el~~~~LPs~~kKIL~  348 (356)
                      .+....+|.+....      .....+.+|++++++..++++-.+|..++
T Consensus        80 ~~~~~~~f~~~~~~~~~~~~~~E~~e~~Wf~~delp~l~~~p~~R~~i~  128 (129)
T cd04676          80 RQYLDITFRCRVVGGELRVGDDESLDVAWFDPDGLPPLLMHPSMRLRID  128 (129)
T ss_pred             EEEEEEEEEEEEECCCCCCCCCCEEEEEEECHHHCCCCCCCCCCHHHCC
T ss_conf             9999999998872884559997178889989989969999945043337


No 37 
>cd04682 Nudix_Hydrolase_23 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=98.01  E-value=2.7e-05  Score=57.83  Aligned_cols=97  Identities=11%  Similarity=0.017  Sum_probs=58.5

Q ss_pred             EEEEECCCEEEEEECCCC-HH-HHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC----CEEEEEEE--EEEEEEEEE
Q ss_conf             003413760577645720-13-20474421111346721588863010100002137----35899831--599999999
Q gi|254780479|r  243 FIAITNDNRILLRKRTNT-RL-LEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN----TITHTFTH--FTLTLFVWK  314 (356)
Q Consensus       243 ~vii~~~~kiLL~KRp~~-gl-l~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~----~ikH~fTH--~~L~i~v~~  314 (356)
                      ..++..+|++||.+|++. ++ |.|+|+||++..+.++........+..++....+.    ...+.|++  .....++|.
T Consensus         5 ~al~~~~g~vLl~~R~~~~~~~~pg~W~~pGG~~e~~E~~~~~~~RE~~EE~gl~~~~~~~~~~~~~~~~~~~~~~~~F~   84 (122)
T cd04682           5 LALLIGDGRLLLQLRDDKPGIPYPGHWDLPGGHREGGETPLECVLRELLEEIGLTLPESRIPWFRVYPSASPPGTEHVFV   84 (122)
T ss_pred             EEEEECCCEEEEEECCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHEEEEEEECCCCCCCEEEEEE
T ss_conf             99996799799998999999998993879966667998999999999999888856700020278521578994799999


Q ss_pred             EECC-------CCCCCCCCEEECHHHHHHCCC
Q ss_conf             9807-------767667813322788874799
Q gi|254780479|r  315 TIVP-------QIVIIPDSTWHDAQNLANAAL  339 (356)
Q Consensus       315 ~~~~-------~~~~~~~~~Wv~~~el~~~~L  339 (356)
                      +.+.       ...+.++++|++.+|+-+.+.
T Consensus        85 ~~l~~~~~~~~~~~Egq~~~W~~~eEll~~~~  116 (122)
T cd04682          85 VPLTAREDAILFGDEGQALRLMTVEEFLAHED  116 (122)
T ss_pred             EEEECCCCCCCCCCCCCEEEEEEHHHHHHCCC
T ss_conf             84005631535687330768745999964836


No 38 
>cd04677 Nudix_Hydrolase_18 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.98  E-value=5.5e-05  Score=55.61  Aligned_cols=105  Identities=18%  Similarity=0.181  Sum_probs=61.9

Q ss_pred             EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC--CEEEEEEEE----------
Q ss_conf             23200034137605776457201320474421111346721588863010100002137--358998315----------
Q gi|254780479|r  239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN--TITHTFTHF----------  306 (356)
Q Consensus       239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~--~ikH~fTH~----------  306 (356)
                      .....++++.+|++||.||...    |.|.||++..+.++.......++...+......  .+...|++.          
T Consensus         8 ~g~~~vv~~~~g~vLL~~r~~~----g~W~lPGG~ve~gEt~~~aa~RE~~EEtGi~~~~~~~~~~~~~~~~~~~~~~~~   83 (132)
T cd04677           8 VGAGVILLNEQGEVLLQKRSDT----GDWGLPGGAMELGESLEETARRELKEETGLEVEELELLGVYSGKEFYVKPNGDD   83 (132)
T ss_pred             EEEEEEEEECCCEEEEEEECCC----CCEECCEEEECCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCEEEECCCCCC
T ss_conf             5799999989998999998899----949898427669979999999999998695535247999977864554489984


Q ss_pred             --EEEEEEEEEECC------CCCCCCCCEEECHHHHHHCCCCHHHHHHHH
Q ss_conf             --999999999807------767667813322788874799878999998
Q gi|254780479|r  307 --TLTLFVWKTIVP------QIVIIPDSTWHDAQNLANAALPTVMKKALS  348 (356)
Q Consensus       307 --~L~i~v~~~~~~------~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~  348 (356)
                        .....+|.+...      ......+.+|++++++.++-+|. ++++|+
T Consensus        84 ~~~~~~~~~~~~~~~~~~~~~~~E~~e~~Wf~~delp~l~~p~-~r~~ie  132 (132)
T cd04677          84 EQYIVTLYYVTKVFGGKLVPDGDETLELKFFSLDELPELINPQ-HKEMLQ  132 (132)
T ss_pred             EEEEEEEEEEEEECCCCCCCCCCCEEEEEEECHHHCCCCCCHH-HHHHCC
T ss_conf             7999999999862587567999701589998299983448922-575609


No 39 
>cd04684 Nudix_Hydrolase_25 Contains a crystal structure of the Nudix hydrolase from Enterococcus faecalis, which has an unknown function. In general, members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability
Probab=97.94  E-value=0.00013  Score=52.99  Aligned_cols=104  Identities=14%  Similarity=0.169  Sum_probs=63.4

Q ss_pred             EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCCEEEEEE------EEEEEE
Q ss_conf             003413760577645720132047442111134672158886301010000------21373589983------159999
Q gi|254780479|r  243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNTITHTFT------HFTLTL  310 (356)
Q Consensus       243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~ikH~fT------H~~L~i  310 (356)
                      +.++.++|++||.||... -..|+|.||++..+.++........+...+..      ..++.+.+.++      +++.-.
T Consensus         4 ~avi~~~~~vLL~rr~~~-~~~g~W~lPGG~ve~gEt~~~aa~REv~EETGl~v~~~~~l~~~~~~~~~~~~~~~~~~v~   82 (128)
T cd04684           4 YAVIPRDGKLLLIQKNGG-PYEGRWDLPGGGIEPGESPEEALHREVLEETGLTVEIGRRLGSASRYFYSPDGDYDAHHLC   82 (128)
T ss_pred             EEEEEECCEEEEEEECCC-CCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEEEECCCCCEEEEEEE
T ss_conf             999993999999997789-9898298985607799899999999999872937886338899999997899977679999


Q ss_pred             EEEEEECCCCC--------CCCCCEEECHHHHHHCCCCHHHHHHH
Q ss_conf             99999807767--------66781332278887479987899999
Q gi|254780479|r  311 FVWKTIVPQIV--------IIPDSTWHDAQNLANAALPTVMKKAL  347 (356)
Q Consensus       311 ~v~~~~~~~~~--------~~~~~~Wv~~~el~~~~LPs~~kKIL  347 (356)
                      .+|.+......        ...+..|++.+++.+..++-.+++.+
T Consensus        83 ~~f~~~~~~g~~~~~~~~ee~~e~~W~~~del~e~~~~P~~~~~v  127 (128)
T cd04684          83 VFYDARVVGGALPVQEPGEDSHGAAWLPLDEAIERLLSPLVLWAV  127 (128)
T ss_pred             EEEEEEECCCCCCCCCCCCCEEEEEEECHHHHHCCCCCHHHHHHH
T ss_conf             999999868940568999830599995889942312794787863


No 40 
>cd04679 Nudix_Hydrolase_20 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.91  E-value=8e-05  Score=54.46  Aligned_cols=106  Identities=15%  Similarity=0.116  Sum_probs=64.0

Q ss_pred             EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCCEEEEEEE--EEEEEE
Q ss_conf             320003413760577645720132047442111134672158886301010000------213735899831--599999
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNTITHTFTH--FTLTLF  311 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~ikH~fTH--~~L~i~  311 (356)
                      ....++++.+|++||.||... -..|+|+||++..+.++........+...+..      ..++.+.|..+.  ++....
T Consensus         4 ~v~avv~~~~g~vLL~~R~~~-~~~g~W~lPGG~vE~gEt~~~aa~REv~EEtGl~v~~~~l~~~~~~~~~~~~~~~~~~   82 (125)
T cd04679           4 GCGAAILRDDGKLLLVKRLRA-PEAGHWGIPGGKVDWMEAVEDAVVREIEEETGLSIHSTRLLCVVDHIIEEPPQHWVAP   82 (125)
T ss_pred             EEEEEEEECCCEEEEEEECCC-CCCCEEECCEEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCCEEEEE
T ss_conf             699999977998999993899-9999793988305899899999999999987978585379999856435899659999


Q ss_pred             EEEEECC-------CCCCCCCCEEECHHHHHHCCCCHHHHHHH
Q ss_conf             9999807-------76766781332278887479987899999
Q gi|254780479|r  312 VWKTIVP-------QIVIIPDSTWHDAQNLANAALPTVMKKAL  347 (356)
Q Consensus       312 v~~~~~~-------~~~~~~~~~Wv~~~el~~~~LPs~~kKIL  347 (356)
                      +|.+...       .+....+..|+++++|-. +++..++.+|
T Consensus        83 ~f~~~~~~g~~~~~~~~e~~e~~Wf~~d~LP~-~l~~~~r~~i  124 (125)
T cd04679          83 VYLAENFSGEPRLMEPDKLLELGWFALDALPQ-PLTRATRDAV  124 (125)
T ss_pred             EEEEEECCCEEECCCCCEEEEEEEECHHHCCC-CCCHHHHHHH
T ss_conf             99999738878228998146887898889888-6598899974


No 41 
>cd03430 GDPMH GDP-mannose glycosyl hydrolase (AKA GDP-mannose mannosyl hydrolase (GDPMH)) is a member of the Nudix hydrolase superfamily. This class of enzymes is unique from other members of the superfamily in two aspects. First, it contains a modified Nudix signature sequence. The slight changes to the conserved sequence motif, GX5EX7REUXEEXGU, where U = I, L or V), are believed to contribute to the removal of all magnesium binding sites but one, retaining only the metal site that coordinates the pyrophosphate of the substrate. Secondly, it is not a pyrophosphatase that substitutes at a phosphorus; instead, it hydrolyzes nucleotide sugars such as GDP-mannose to GDP and mannose, cleaving the phosphoglycosyl bond by substituting at a carbon position. GDP-mannose provides mannosyl components for cell wall synthesis and is required for the synthesis of other glycosyl donors (such as GDP-fucose and colitose) for the cell wall. The importance of GDP-sugar hydrolase activities is thus close
Probab=97.85  E-value=0.00017  Score=52.23  Aligned_cols=97  Identities=16%  Similarity=0.191  Sum_probs=61.2

Q ss_pred             EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCC--------CCCEECCCEEEEEE-------
Q ss_conf             320003413760577645720132047442111134672158886301010--------00021373589983-------
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFT--------ANWILCNTITHTFT-------  304 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~--------~~~~~l~~ikH~fT-------  304 (356)
                      ..-++|.+.+|++||.||... -++|.|.+|++..+.++.......+....        .....++.+.|.|+       
T Consensus        14 ~V~~vI~n~~g~iLL~kR~~~-p~~g~W~~PGG~i~~gEt~~~a~~Re~~eE~Gl~i~~~~~~~~Gv~eh~y~d~~~~~~   92 (144)
T cd03430          14 SIDLIVENEDGQYLLGKRTNR-PAQGYWFVPGGRIRKNETLTEAFERIAKDELGLEFLISDAELLGVFEHFYDDNFFGDD   92 (144)
T ss_pred             EEEEEEECCCCCEEEEECCCC-CCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCCEEEEEEEEECCCCCCCCC
T ss_conf             999999979996999981699-8899498983447799799999999999974971640344897758998267777898


Q ss_pred             ---EEEEEEEEEEEECCCC------CCCCCCEEECHHHHHHCCC
Q ss_conf             ---1599999999980776------7667813322788874799
Q gi|254780479|r  305 ---HFTLTLFVWKTIVPQI------VIIPDSTWHDAQNLANAAL  339 (356)
Q Consensus       305 ---H~~L~i~v~~~~~~~~------~~~~~~~Wv~~~el~~~~L  339 (356)
                         |+-.  ..|.+.+...      ...++++|++.+++-+.+.
T Consensus        93 ~~~H~V~--l~y~~~~~~~e~~~~ddeh~e~~Wf~~del~~~~~  134 (144)
T cd03430          93 FSTHYVV--LGYVLKLSSNELLLPDEQHSEYQWLTSDELLADDD  134 (144)
T ss_pred             CCEEEEE--EEEEEEECCCCCCCCHHHCCEEEEECHHHHHCCCC
T ss_conf             5309999--99999955897559945700669967999521776


No 42 
>cd04678 Nudix_Hydrolase_19 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.85  E-value=0.00014  Score=52.83  Aligned_cols=108  Identities=15%  Similarity=0.120  Sum_probs=63.7

Q ss_pred             EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC--CEEEEEE------EEEEEEE
Q ss_conf             3200034137605776457201320474421111346721588863010100002137--3589983------1599999
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN--TITHTFT------HFTLTLF  311 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~--~ikH~fT------H~~L~i~  311 (356)
                      ....++++.+|++||.||... .-.|+|.||++..+.+++.......+...+....+.  .+.+.+.      +++....
T Consensus         4 gV~~iv~~~~g~vLL~rR~~~-~~~g~W~lPgG~ve~GEs~~~Aa~REv~EEtGl~v~~~~~~~~~~~~~~~~~~h~i~~   82 (129)
T cd04678           4 GVGVFVLNPKGKVLLGKRKGS-HGAGTWALPGGHLEFGESFEECAAREVLEETGLHIENVQFLTVTNDVFEEEGKHYVTI   82 (129)
T ss_pred             EEEEEEEECCCEEEEEEECCC-CCCCEEECCEEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCCEEEEE
T ss_conf             799999989995999994789-9999498981040589889999999999987985252499999887757789469999


Q ss_pred             EEEEECCC---------CCCCCCCEEECHHHHHHC-CCCHHHHHHHH
Q ss_conf             99998077---------676678133227888747-99878999998
Q gi|254780479|r  312 VWKTIVPQ---------IVIIPDSTWHDAQNLANA-ALPTVMKKALS  348 (356)
Q Consensus       312 v~~~~~~~---------~~~~~~~~Wv~~~el~~~-~LPs~~kKIL~  348 (356)
                      +|.+....         +....+.+|++.++|-++ |+-.+.+.+|+
T Consensus        83 ~f~~~~~~g~~~~~~~ep~e~~~~~Wf~~deLP~l~pl~~p~~~~~~  129 (129)
T cd04678          83 FVKAEVDDGEAEPNKMEPEKCEGWEWFDWEELPSVDPLFLPLKNLFE  129 (129)
T ss_pred             EEEEEEECCCCCCCCCCCHHCCEEEEECHHHCCCCCCCCHHHHHHHC
T ss_conf             99999837823647799324013089998998698634178899649


No 43 
>cd02883 Nudix_Hydrolase Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+ for their activity. Members of this family are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolase include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance and "house-cleaning" enzy
Probab=97.83  E-value=0.00025  Score=51.04  Aligned_cols=104  Identities=18%  Similarity=0.226  Sum_probs=65.7

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEEC--CCEEEEEEE------EEEEEEEE
Q ss_conf             0003413760577645720132047442111134672158886301010000213--735899831------59999999
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILC--NTITHTFTH------FTLTLFVW  313 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l--~~ikH~fTH------~~L~i~v~  313 (356)
                      ..++++.++++||.||...  ..|.|.||++..+.++.......++...+....+  ..+...|++      ......+|
T Consensus         4 ~~ii~~~~~~vLl~~r~~~--~~~~w~~PgG~ve~gE~~~~aa~RE~~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~~~f   81 (123)
T cd02883           4 GAVILDEDGRVLLVRRADS--PGGLWELPGGGVEPGETLEEAAIREVREETGLDVDVLRLLGVYEVESPDEGEHAVVFVF   81 (123)
T ss_pred             EEEEEECCCEEEEEEECCC--CCCEEECCCEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCCEEEEEEE
T ss_conf             9999979999999994289--98959998143159989899756678898594567758999999877789966999999


Q ss_pred             EEECCCC-------CCCCCCEEECHHHHHHCCCCHHHHHHH
Q ss_conf             9980776-------766781332278887479987899999
Q gi|254780479|r  314 KTIVPQI-------VIIPDSTWHDAQNLANAALPTVMKKAL  347 (356)
Q Consensus       314 ~~~~~~~-------~~~~~~~Wv~~~el~~~~LPs~~kKIL  347 (356)
                      .+.....       ....+..|++.+++..++++..++..+
T Consensus        82 ~~~~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~~~~~~~  122 (123)
T cd02883          82 LARLVGGEPTLLPPDEISEVRWVTLDELPALALSPALRPAL  122 (123)
T ss_pred             EEEECCCCCCCCCCCCEEEEEEEEHHHHHCCCCCHHHHHHH
T ss_conf             99995895212898668999998499913499983675965


No 44 
>cd04670 Nudix_Hydrolase_12 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.82  E-value=0.0001  Score=53.78  Aligned_cols=107  Identities=13%  Similarity=0.186  Sum_probs=61.0

Q ss_pred             EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC-----CEEEEEEE-E-EEEEE
Q ss_conf             23200034137605776457201320474421111346721588863010100002137-----35899831-5-99999
Q gi|254780479|r  239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN-----TITHTFTH-F-TLTLF  311 (356)
Q Consensus       239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~-----~ikH~fTH-~-~L~i~  311 (356)
                      .....++++.+|++||.|+...  ..|.|+||++..+.++.......++...+......     .+.|.... + ...+.
T Consensus         3 igv~avi~n~~~~vLlv~~~~~--~~g~W~lPgG~ve~GEt~~eaa~REv~EEtGl~v~~~~~l~~~~~~~~~~~~~~~~   80 (127)
T cd04670           3 VGVGGLVLNEKNEVLVVQERNK--TPNGWKLPGGLVDPGEDIFDGAVREVLEETGIDTEFVSVVGFRHAHPGAFGKSDLY   80 (127)
T ss_pred             EEEEEEEEECCCEEEEEEECCC--CCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEECCCCCCCCEEEE
T ss_conf             7899999979999999997789--99958998203159979999999999998794568975898996079888971799


Q ss_pred             E-EEEEC-C-----CCCCCCCCEEECHHHHHHCCCCHHHHHHH
Q ss_conf             9-99980-7-----76766781332278887479987899999
Q gi|254780479|r  312 V-WKTIV-P-----QIVIIPDSTWHDAQNLANAALPTVMKKAL  347 (356)
Q Consensus       312 v-~~~~~-~-----~~~~~~~~~Wv~~~el~~~~LPs~~kKIL  347 (356)
                      + +.... .     +.....+.+|++++|+-+.|+.+.+.|+|
T Consensus        81 f~~~~~~~~~~i~~~~~Ei~e~~W~~~~El~~~p~~~~~~r~~  123 (127)
T cd04670          81 FICRLKPLSFDINFDTSEIAAAKWMPLEEYISQPITSEVNRLI  123 (127)
T ss_pred             EEEEEECCCCCCCCCCCCEEEEEEECHHHHHCCCCCHHHHHHH
T ss_conf             9999981888778898505787883699987088704666656


No 45 
>cd03674 Nudix_Hydrolase_1 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamil
Probab=97.77  E-value=0.00017  Score=52.15  Aligned_cols=104  Identities=16%  Similarity=0.222  Sum_probs=63.9

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCE----------EEEE-------
Q ss_conf             20003413760577645720132047442111134672158886301010000213735----------8998-------
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTI----------THTF-------  303 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~i----------kH~f-------  303 (356)
                      .++|+.+.+|++||.+|..    .|.|.+|++..+.+++.......+...+.......+          .|.+       
T Consensus         6 ~~~Vv~~~~g~vLLv~~~~----~g~W~lPGG~ve~gEt~~~Aa~REv~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~~~   81 (138)
T cd03674           6 SAFVVNPDRGKVLLTHHRK----LGSWLQPGGHIDPDESLLEAALRELREETGIELLGLRPLSVLVDLDVHPIDGHPKRG   81 (138)
T ss_pred             EEEEEECCCCEEEEEEECC----CCEEECCCCEECCCCCHHHHHHHHHHHHHCCCCCCEEECCCCCEEEEEEECCCCCCC
T ss_conf             9999978999899998659----992989965867988999999999999879873222560434057898766856788


Q ss_pred             --EEEEEEEEEEEEECC-------CCCCCCCCEEECHHHHHHCCCCHHHHHHHHH
Q ss_conf             --315999999999807-------7676678133227888747998789999984
Q gi|254780479|r  304 --THFTLTLFVWKTIVP-------QIVIIPDSTWHDAQNLANAALPTVMKKALSA  349 (356)
Q Consensus       304 --TH~~L~i~v~~~~~~-------~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~a  349 (356)
                        .|+++.+. |.+...       ......+.+|++++++..+.++--++++|+.
T Consensus        82 ~~~~~h~~v~-f~~~~~~~~~~~~~~~E~~~~~W~~~~el~~l~~~~~~rr~i~~  135 (138)
T cd03674          82 VPGHLHLDLR-FLAVAPADDVAPPKSDESDAVRWFPLDELASLELPEDVRRLVEK  135 (138)
T ss_pred             CCCEEEEEEE-EEEEECCCCCCCCCCCCCCEEEEEEHHHCCCCCCCHHHHHHHHH
T ss_conf             8876999999-99996288866689973210299859993025899769999999


No 46 
>cd04695 Nudix_Hydrolase_36 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.75  E-value=0.00033  Score=50.13  Aligned_cols=107  Identities=12%  Similarity=0.121  Sum_probs=65.1

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE------CCCEEEEEEEEE--E-EEE
Q ss_conf             2000341376057764572013204744211113467215888630101000021------373589983159--9-999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL------CNTITHTFTHFT--L-TLF  311 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~------l~~ikH~fTH~~--L-~i~  311 (356)
                      .++..++.+.+|||-||..  -++|.|+||++..+.+++.......+........      ...+.+.+++-+  + .+.
T Consensus         5 v~~~~~~~~~~~LlLkR~~--~~~g~W~~p~G~iE~gEs~~eaA~REl~EETGl~~~~l~~~~~~~~~y~~~~~~i~~~~   82 (131)
T cd04695           5 VLLRSLDKETKVLLLKRVK--TLGGFWCHVAGGVEAGETAWQAALRELKEETGISLPELYNADYLEQFYEANDNRILMAP   82 (131)
T ss_pred             EEEEEECCCCEEEEEECCC--CCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCEEEEEEECCCCEEEEEE
T ss_conf             9999989988899997689--99995869823346999999999998889778774347712426889984897699999


Q ss_pred             EEEEECCC------CCCCCCCEEECHHHHHHCC-CCHHHHHHHHHC
Q ss_conf             99998077------6766781332278887479-987899999844
Q gi|254780479|r  312 VWKTIVPQ------IVIIPDSTWHDAQNLANAA-LPTVMKKALSAG  350 (356)
Q Consensus       312 v~~~~~~~------~~~~~~~~Wv~~~el~~~~-LPs~~kKIL~al  350 (356)
                      +|.+.++.      .....+++|++.++..++. +|. .|.+|+.+
T Consensus        83 ~f~~~~~~~~~v~ls~Eh~~y~W~~~eea~~ll~~~~-~r~~l~~~  127 (131)
T cd04695          83 VFVGFVPPHQEVVLNHEHTEYRWCSFAEALELAPFPG-QRALYDHV  127 (131)
T ss_pred             EEEEEECCCCCCCCCCCCCCCEECCHHHHHHHCCCCC-HHHHHHHH
T ss_conf             9999938999782562032089718999996769967-89999999


No 47 
>cd04672 Nudix_Hydrolase_14 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.72  E-value=8.8e-05  Score=54.18  Aligned_cols=102  Identities=11%  Similarity=0.108  Sum_probs=59.2

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECCCEEEEEEE-----EEEEE
Q ss_conf             00034137605776457201320474421111346721588863010100002------13735899831-----59999
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCNTITHTFTH-----FTLTL  310 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~~ikH~fTH-----~~L~i  310 (356)
                      ..+++ ++|++||.||+..    |+|.||++..+.++........+...+...      ..+...+..++     ++...
T Consensus         6 ~~vv~-~dg~vLL~rr~~~----g~W~lPGG~ve~gEt~~eaa~REv~EETGl~v~~~~l~~v~~~~~~~~~~~~~~~~~   80 (123)
T cd04672           6 RAAIF-KDGKILLVREKSD----GLWSLPGGWADVGLSPAENVVKEVKEETGLDVKVRKLAAVDDRNKHHPPPQPYQVYK   80 (123)
T ss_pred             EEEEE-ECCEEEEEEECCC----CEEECCCEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCCCCEEEEE
T ss_conf             99999-8999999998899----939898466379989999999999997790999858999983543279995249999


Q ss_pred             EEEEEECCC-----CCCCCCCEEECHHHHHHCCCCHHHHHHHH
Q ss_conf             999998077-----67667813322788874799878999998
Q gi|254780479|r  311 FVWKTIVPQ-----IVIIPDSTWHDAQNLANAALPTVMKKALS  348 (356)
Q Consensus       311 ~v~~~~~~~-----~~~~~~~~Wv~~~el~~~~LPs~~kKIL~  348 (356)
                      .+|.+....     .....+..|+++++|..+++..-.+++|+
T Consensus        81 ~~f~~~~~~g~~~~~~E~~e~~Wf~~deLP~L~~~~~t~~~i~  123 (123)
T cd04672          81 LFFLCEILGGEFKPNIETSEVGFFALDDLPPLSEKRNTEEQIE  123 (123)
T ss_pred             EEEEEEECCCCCCCCCHHHEEEEECHHHCCCCCCCCCCHHHHC
T ss_conf             9999998789388894564367988889987875698979919


No 48 
>cd04669 Nudix_Hydrolase_11 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.70  E-value=0.00016  Score=52.45  Aligned_cols=97  Identities=14%  Similarity=0.140  Sum_probs=53.8

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE--ECCCEEEEEEEEE-----EEEEEE
Q ss_conf             200034137605776457201320474421111346721588863010100002--1373589983159-----999999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI--LCNTITHTFTHFT-----LTLFVW  313 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~--~l~~ikH~fTH~~-----L~i~v~  313 (356)
                      .+.++++.+|++||.||...|  .++|.||++..+.++........+.......  ..+.+-.++.+..     ..+.++
T Consensus         3 a~~ii~~~~~~vLl~~r~~~~--~~~w~~PGG~ve~gEt~~~a~~RE~~EE~g~~v~~~~~~~~~~~~~~~~~~F~~~i~   80 (121)
T cd04669           3 ASIVIINDQGEILLIRRIKPG--KTYYVFPGGGIEEGETPEEAAKREALEELGLDVRVEEIFLIVNQNGRTEHYFLARVI   80 (121)
T ss_pred             EEEEEEECCCEEEEEEECCCC--CCEEECCCCEECCCCCHHHHHHHHHHHHCCCEEEEEEEEEEEEECCCEEEEEEEEEE
T ss_conf             899999779999999984899--986989960605999999999999887609889965599999747980699999998


Q ss_pred             EEECC--CCC----CC-C---CCEEECHHHHHHCCC
Q ss_conf             99807--767----66-7---813322788874799
Q gi|254780479|r  314 KTIVP--QIV----II-P---DSTWHDAQNLANAAL  339 (356)
Q Consensus       314 ~~~~~--~~~----~~-~---~~~Wv~~~el~~~~L  339 (356)
                      .....  ...    .. .   +..|+++++|..++|
T Consensus        81 ~g~~~~~~~~e~~~~~~~~~~e~~WvpldeL~~l~l  116 (121)
T cd04669          81 SGKLGLGVGEEFERQSDDNQYHPVWVDLDQLETIPL  116 (121)
T ss_pred             CCEECCCCCCCCCCCCCCCEEEEEEEEHHHHCCCCC
T ss_conf             147815878423577888728999977999272898


No 49 
>cd04700 DR1025_like DR1025 from Deinococcus radiodurans, a member of the Nudix hydrolase superfamily, show nucleoside triphosphatase and dinucleoside polyphosphate pyrophosphatase activities. Like other enzymes belonging to this superfamily, it requires a divalent cation, in this case Mg2+, for its activity. It also contains a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. In general, substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is us
Probab=97.70  E-value=0.00023  Score=51.20  Aligned_cols=97  Identities=15%  Similarity=0.152  Sum_probs=60.3

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCC------CEECCCEEEEEEEEEEEE-EEE
Q ss_conf             2000341376057764572013204744211113467215888630101000------021373589983159999-999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTAN------WILCNTITHTFTHFTLTL-FVW  313 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~------~~~l~~ikH~fTH~~L~i-~v~  313 (356)
                      ...||++.+|++||.||. .+-..|+|+||++..+.++........+...+.      ...++.+...+.+-...+ ++|
T Consensus        16 agavV~n~~g~vLLvrr~-~~p~~g~W~lPgG~vE~GEt~~eAa~REv~EETGl~v~~~~~l~~~~~~~~~g~~~~~~~f   94 (142)
T cd04700          16 AGAVILNERNDVLLVQEK-GGPKKGLWHIPSGAVEDGEFPQDAAVREACEETGLRVRPVKFLGTYLGRFDDGVLVLRHVW   94 (142)
T ss_pred             EEEEEEECCCEEEEEEEC-CCCCCCEEECCEECCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCEEEEEEE
T ss_conf             999999699989999976-8998998979833387996999999999999669788888898999468079988999999


Q ss_pred             EEECCCC-------CCCCCCEEECHHHHHHCC
Q ss_conf             9980776-------766781332278887479
Q gi|254780479|r  314 KTIVPQI-------VIIPDSTWHDAQNLANAA  338 (356)
Q Consensus       314 ~~~~~~~-------~~~~~~~Wv~~~el~~~~  338 (356)
                      .++....       ....+.+|++.+|+..+.
T Consensus        95 l~~~~~~~~~p~~~dE~~e~~Wf~~~El~~L~  126 (142)
T cd04700          95 LAEPEGQTLAPKFTDEIAEASFFSREDVAQLY  126 (142)
T ss_pred             EEEECCCCCCCCCCCCEEEEEEECHHHHHHHH
T ss_conf             99974883256999876788998599955545


No 50 
>cd04671 Nudix_Hydrolase_13 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.65  E-value=0.00023  Score=51.19  Aligned_cols=106  Identities=14%  Similarity=0.127  Sum_probs=60.1

Q ss_pred             EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC-----CEEEEEEEEEEEEEEE
Q ss_conf             23200034137605776457201320474421111346721588863010100002137-----3589983159999999
Q gi|254780479|r  239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN-----TITHTFTHFTLTLFVW  313 (356)
Q Consensus       239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~-----~ikH~fTH~~L~i~v~  313 (356)
                      |..+.++++.+|+|||.||... -..|+|+||++..+.++.......++...+......     .+....  ......+|
T Consensus         1 ~vV~avi~~~~g~vLLv~r~~~-~~~g~W~lPGG~vE~GEt~eeaa~REv~EETGl~v~~~~l~~v~~~~--~~~~~~~f   77 (123)
T cd04671           1 YIVAAVILNNQGEVLLIQEAKR-SCRGKWYLPAGRMEPGETIEEAVKREVKEETGLDCEPTTLLSVEEQG--GSWFRFVF   77 (123)
T ss_pred             CEEEEEEEECCCEEEEEEECCC-CCCCEEECCEEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEECC--CEEEEEEE
T ss_conf             9899999988998999996589-99995979801804998999999999999749888977799999269--80999999


Q ss_pred             EEECCC-----C----CCCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             998077-----6----766781332278887479987899999844
Q gi|254780479|r  314 KTIVPQ-----I----VIIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       314 ~~~~~~-----~----~~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      .++...     .    ....+.+|+++++|   |.|-..+-+++.+
T Consensus        78 ~~~~~~G~l~~~~~~~~E~~ea~Wf~~deL---P~pl~~~~~~~~~  120 (123)
T cd04671          78 TGNITGGDLKTEKEADSESLQARWYSNKDL---PLPLRAHDILSLI  120 (123)
T ss_pred             EEEEECCEECCCCCCCCCEEEEEEECHHHC---CCCCCCCHHHHHH
T ss_conf             999927651168999987799899888998---6876404345463


No 51 
>pfam00293 NUDIX NUDIX domain.
Probab=97.60  E-value=0.00033  Score=50.12  Aligned_cols=107  Identities=14%  Similarity=0.164  Sum_probs=64.6

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE------CCCEEEEEEEEEE--EEEEE
Q ss_conf             000341376057764572013204744211113467215888630101000021------3735899831599--99999
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL------CNTITHTFTHFTL--TLFVW  313 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~------l~~ikH~fTH~~L--~i~v~  313 (356)
                      ..++++.++++||.||...+..+|.|+||++..+.++........+...+....      ...+.+...+...  ..++|
T Consensus         6 ~vvi~~~~~~vll~~r~~~~~~~~~w~~PgG~ve~gE~~~~aa~RE~~EE~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~   85 (133)
T pfam00293         6 GVVLLNEDGEVLLVRRSRTPGTPGLWELPGGKVEPGESPEEAAVRELEEETGLRVLLLLLLGVLEYPAPGRDLREHVHVF   85 (133)
T ss_pred             EEEEEECCCEEEEEEEECCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEECCCCCCCCEEEEEE
T ss_conf             99999489989999927689999849998455727988999999998897595115789999998649999982999999


Q ss_pred             EEECCCC--------CCCCCCEEECHHHHHHCCCCHHHHHHHH
Q ss_conf             9980776--------7667813322788874799878999998
Q gi|254780479|r  314 KTIVPQI--------VIIPDSTWHDAQNLANAALPTVMKKALS  348 (356)
Q Consensus       314 ~~~~~~~--------~~~~~~~Wv~~~el~~~~LPs~~kKIL~  348 (356)
                      .+.....        ....+..|++.+++.+.....-++.++.
T Consensus        86 ~~~~~~~~~~~~~~~~E~~~~~w~~~~e~~~~~~~~~~~~~~~  128 (133)
T pfam00293        86 LAELLGGEPEPQPDPGEVSEVRWVPLEELLELLLAGEIREVIA  128 (133)
T ss_pred             EEEEECCCCCCCCCCCEEEEEEEEEHHHHHHHHHCCCHHHHHH
T ss_conf             9998168644668978389999999999998878897389999


No 52 
>cd03673 Ap6A_hydrolase Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily. Ap6A hydrolase specifically hydrolyzes diadenosine polyphosphates, but not ATP or diadenosine triphosphate, and it generates ATP as the product. Ap6A, the most preferred substrate, hydrolyzes to produce two ATP molecules, which is a novel hydrolysis mode for Ap6A. These results indicate that Ap6A  hydrolase is a diadenosine polyphosphate hydrolase. It requires the presence of a divalent cation, such as Mn2+, Mg2+, Zn2+, and Co2+, for activity. Members of the Nudix superfamily are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site.
Probab=97.56  E-value=0.0016  Score=45.37  Aligned_cols=99  Identities=13%  Similarity=0.022  Sum_probs=60.7

Q ss_pred             CCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCC------CCCEECCCEEEEEEEE----EEEEEEEEEEC
Q ss_conf             3760577645720132047442111134672158886301010------0002137358998315----99999999980
Q gi|254780479|r  248 NDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFT------ANWILCNTITHTFTHF----TLTLFVWKTIV  317 (356)
Q Consensus       248 ~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~------~~~~~l~~ikH~fTH~----~L~i~v~~~~~  317 (356)
                      .+.+|||.+|+.+    |.|.||++..+.+++.......+...      .....++.+.|.|.+-    .-.++.|.+..
T Consensus        14 ~~~~vLLv~r~~~----~~W~lPgG~ve~gEs~~eaa~RE~~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~~~   89 (131)
T cd03673          14 GGIEVLLIHRPRG----DDWSLPKGKLEPGETPPEAAVREVEEETGIRAEVGDPLGTIRYWFSSSGKRVHKTVHWWLMRA   89 (131)
T ss_pred             CCEEEEEEEECCC----CEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEECCEEEEEEECCCCEEEEEEEEEEEEE
T ss_conf             9679999995899----979998245279979999999999998795389842711668997349960788999999999


Q ss_pred             CC-------CCCCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             77-------6766781332278887479987899999844
Q gi|254780479|r  318 PQ-------IVIIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       318 ~~-------~~~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      ..       .....+.+|++.+++.++....-.++||+++
T Consensus        90 ~~~~~~~~~~~e~~~~~W~~~~ea~~~l~~~~~r~il~~a  129 (131)
T cd03673          90 LGGEFTPQPDEEVDEVRWLPPDEARDRLSYPNDRELLRAA  129 (131)
T ss_pred             ECCCCCCCCCCCCCEEEEEEHHHHHHHCCCHHHHHHHHHH
T ss_conf             6795320788654179992199997765998799999997


No 53 
>cd04697 Nudix_Hydrolase_38 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.54  E-value=0.00055  Score=48.58  Aligned_cols=97  Identities=8%  Similarity=0.002  Sum_probs=56.0

Q ss_pred             EEEEEEEECCCEEEEEECCCC-HHHHCCCCCCC-CCCCCCCCHHHH------HHCCCCCCCCEECCCEEEEEEEEEEEEE
Q ss_conf             320003413760577645720-13204744211-113467215888------6301010000213735899831599999
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNT-RLLEGMDELPG-SAWSSTKDGNID------THSAPFTANWILCNTITHTFTHFTLTLF  311 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~-gll~GLwEFP~-~e~~~~~~~~~~------~~~~~~~~~~~~l~~ikH~fTH~~L~i~  311 (356)
                      ..+ ++.|.+|++||+||+.. ..|.|+|+... +-...+++....      ++..........+..+.+.....+....
T Consensus         3 ~~v-~i~n~~G~iLlqkRs~~K~~~Pg~wd~s~~Ghv~~GE~~~~aa~REl~EElGI~~~~l~~~~~~~~~~~~~~~~~~   81 (126)
T cd04697           3 TYI-FVFNSEGKLCVHKRTLTKDWCPGYWDIAFGGVVQAGESYLQNAQRELEEELGIDGVQLTPLGLFYYDTDGNRVWGK   81 (126)
T ss_pred             EEE-EEECCCCCEEEEEECCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCEEEE
T ss_conf             999-9992999799998579998799613457687677998999999999999868986675675899987898848999


Q ss_pred             EEEEECCC-----CCCCCCCEEECHHHHHHC
Q ss_conf             99998077-----676678133227888747
Q gi|254780479|r  312 VWKTIVPQ-----IVIIPDSTWHDAQNLANA  337 (356)
Q Consensus       312 v~~~~~~~-----~~~~~~~~Wv~~~el~~~  337 (356)
                      +|.+..+.     +...++.+|++++||.+.
T Consensus        82 vf~~~~d~~l~~~~eEV~~~~w~~~~El~~~  112 (126)
T cd04697          82 VFSCVYDGPLKLQEEEVEEITWLSINEILQF  112 (126)
T ss_pred             EEEEEECCCCCCCHHHHHEEEECCHHHHHHH
T ss_conf             9999999999599677311798039999989


No 54 
>cd04680 Nudix_Hydrolase_21 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.53  E-value=0.0005  Score=48.85  Aligned_cols=104  Identities=13%  Similarity=0.110  Sum_probs=62.0

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC-------CEEEEEEEEEEEEEEE
Q ss_conf             200034137605776457201320474421111346721588863010100002137-------3589983159999999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN-------TITHTFTHFTLTLFVW  313 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~-------~ikH~fTH~~L~i~v~  313 (356)
                      ...++++.+|++||.||..    .|.|+||++..+.+++.......+...+......       .+.|........+.+|
T Consensus         3 v~~vv~~~~g~vLL~rr~~----~~~W~lPGG~ve~gEs~~~aa~RE~~EEtGl~v~~~~~l~~~~~~~~~~~~~~~~~f   78 (120)
T cd04680           3 ARAVVTDADGRVLLVRHTY----GPGWYLPGGGLERGETFAEAARRELLEELGIRLAVVAELLGVYYHSASGSWDHVIVF   78 (120)
T ss_pred             EEEEEECCCCEEEEEEECC----CCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEECCCCCCCEEEEEE
T ss_conf             9999992999099998679----991999954716998999999999999869755515479999825899975399999


Q ss_pred             EEECCC------CCCCCCCEEECHHHHHHCCCCHHHHHHHHH
Q ss_conf             998077------676678133227888747998789999984
Q gi|254780479|r  314 KTIVPQ------IVIIPDSTWHDAQNLANAALPTVMKKALSA  349 (356)
Q Consensus       314 ~~~~~~------~~~~~~~~Wv~~~el~~~~LPs~~kKIL~a  349 (356)
                      .+....      .....+..|+++++|.. +++-.++++|..
T Consensus        79 ~~~~~~~~~~~~~~E~~e~~Wf~~d~LPe-~l~p~~~~~i~~  119 (120)
T cd04680          79 RARADTQPVIRPSHEISEARFFPPDALPE-PTTPATRRRIAE  119 (120)
T ss_pred             EEEECCCCCCCCCHHHHCEEEECHHHCCC-CCCHHHHHHHHC
T ss_conf             99824888679983674113988889888-789779999753


No 55 
>cd04687 Nudix_Hydrolase_28 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.53  E-value=0.00076  Score=47.59  Aligned_cols=96  Identities=19%  Similarity=0.178  Sum_probs=56.3

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECCCEEE--------EEEEE
Q ss_conf             200034137605776457201320474421111346721588863010100002------1373589--------98315
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCNTITH--------TFTHF  306 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~~ikH--------~fTH~  306 (356)
                      ..++|+ ++|++||.||.+.+  +.+|.||++..+.++........+..++...      .+..+.+        ...++
T Consensus         4 a~AvI~-~~~~vLL~r~~~~~--~~~w~lPGG~ve~GEt~~~Al~RE~~EEtGl~v~v~~l~~v~e~~~~~~~~~~~~~~   80 (128)
T cd04687           4 AKAVII-KNDKILLIKHHDDG--GVWYILPGGGQEPGETLEDAAHRECKEEIGIDVEIGPLLFVREYIGHNPTSELPGHF   80 (128)
T ss_pred             EEEEEE-ECCEEEEEEECCCC--CCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEECCEEEEEEECCCCCCCCCCCCE
T ss_conf             999999-89999999983699--816989886676997999999998878639678856089999701325566788854


Q ss_pred             EEEEEEEEEECCCCC-----CCC-----CCEEECHHHHHHCCC
Q ss_conf             999999999807767-----667-----813322788874799
Q gi|254780479|r  307 TLTLFVWKTIVPQIV-----IIP-----DSTWHDAQNLANAAL  339 (356)
Q Consensus       307 ~L~i~v~~~~~~~~~-----~~~-----~~~Wv~~~el~~~~L  339 (356)
                      +.-..+|.+......     ...     +.+|+++++|.++++
T Consensus        81 h~v~~~f~~~~~~g~~~~~~~~~d~~~~~~~Wv~ldeL~~~~l  123 (128)
T cd04687          81 HQVELMFECKIKSGTPAKTPSKPDPNQIGVEWLKLKELGDIPL  123 (128)
T ss_pred             EEEEEEEEEEECCCCCCCCCCCCCCCEEEEEEECHHHHCCCCC
T ss_conf             9999999999868954667778876657899804999566988


No 56 
>cd04688 Nudix_Hydrolase_29 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.51  E-value=0.0011  Score=46.51  Aligned_cols=95  Identities=15%  Similarity=0.116  Sum_probs=55.6

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCC------CCCCEECCCEEEEEEE----EEEEEE
Q ss_conf             000341376057764572013204744211113467215888630101------0000213735899831----599999
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPF------TANWILCNTITHTFTH----FTLTLF  311 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~------~~~~~~l~~ikH~fTH----~~L~i~  311 (356)
                      .++++ ++|++||.+++..    ++|.||++..+.++........+..      ......+..+.+.|++    ++--..
T Consensus         5 ~aii~-~~~~vLl~~~~~~----~~w~lPGG~ve~gEt~~eA~~RE~~EEtGl~v~~~~l~~v~e~~f~~~~~~~h~i~~   79 (126)
T cd04688           5 AAIII-HNGKLLVQKNPDE----TFYRPPGGGIEFGESSEEALIREFKEELGLKIEITRLLGVVENIFTYNGKPGHEIEF   79 (126)
T ss_pred             EEEEE-ECCEEEEEEECCC----CEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEEEECCCCCCEEEEE
T ss_conf             99999-8999999997799----989998534369989999998888887596788246999997688729975479999


Q ss_pred             EEEEECCCCC-------------CCCCCEEECHHHHHHCCC-CH
Q ss_conf             9999807767-------------667813322788874799-87
Q gi|254780479|r  312 VWKTIVPQIV-------------IIPDSTWHDAQNLANAAL-PT  341 (356)
Q Consensus       312 v~~~~~~~~~-------------~~~~~~Wv~~~el~~~~L-Ps  341 (356)
                      .|.+......             ...+.+|++++++.++++ |.
T Consensus        80 ~f~~~~~~~~~~~~~~~~~~~~~~~~~~~Wv~ldeL~~l~l~P~  123 (126)
T cd04688          80 YYLVTLLDESLYQQDIEILEEEGEKIVFRWIPIDELKEIKLYPE  123 (126)
T ss_pred             EEEEEECCCCCCCCCCCCCCCCCCEEEEEECCHHHHCCCCCCCC
T ss_conf             99999978855778764327798799999847999276887970


No 57 
>cd04689 Nudix_Hydrolase_30 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U=I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate sp
Probab=97.51  E-value=0.002  Score=44.69  Aligned_cols=103  Identities=10%  Similarity=0.091  Sum_probs=62.4

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHH------CCCCCCCCEECCCEEEEEEEE----EEEEE
Q ss_conf             00034137605776457201320474421111346721588863------010100002137358998315----99999
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTH------SAPFTANWILCNTITHTFTHF----TLTLF  311 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~------~~~~~~~~~~l~~ikH~fTH~----~L~i~  311 (356)
                      +..++.++|++||.||...+    .|.||++..+.++.......      ..........++.+.|.|+..    +--..
T Consensus         4 a~aii~~~~~vLl~~~~~~~----~~~lPGG~ve~gE~~~~a~~RE~~EE~G~~v~~~~~l~~~e~~~~~~~~~~H~i~~   79 (125)
T cd04689           4 ARAIVRAGNKVLLARVIGQP----HYFLPGGHVEPGETAENALRRELQEELGVAVSDGRFLGAIENQWHEKGVRTHEINH   79 (125)
T ss_pred             EEEEEEECCEEEEEEECCCC----CEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEECEEEEEEEEEEECCCCCEEEEEE
T ss_conf             99999989999999988999----58999798569989999999999998799789465999997467759965189999


Q ss_pred             EEEEECCC------CC---CCCCCEEECHHHHHHCCCCHHHHHHHHHC
Q ss_conf             99998077------67---66781332278887479987899999844
Q gi|254780479|r  312 VWKTIVPQ------IV---IIPDSTWHDAQNLANAALPTVMKKALSAG  350 (356)
Q Consensus       312 v~~~~~~~------~~---~~~~~~Wv~~~el~~~~LPs~~kKIL~al  350 (356)
                      +|.+..+.      ..   ..-+..|+++++++-  .|++.+-+|..|
T Consensus        80 ~f~~~~~~~~~~~~~~~d~d~~~~~WvpL~eL~l--yP~~l~d~i~~~  125 (125)
T cd04689          80 IFAVESSWLASDGPPQADEDHLSFSWVPVSDLSL--YPALLRDLVKGL  125 (125)
T ss_pred             EEEEECCCCCCCCCCCCCCCEEEEEEECHHHHHH--CCHHHHHHHHCC
T ss_conf             9999525545568988888667899971899534--877588885349


No 58 
>pfam00633 HHH Helix-hairpin-helix motif. The helix-hairpin-helix DNA-binding motif is found to be duplicated in the central domain of RuvA. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain.
Probab=97.50  E-value=2.9e-05  Score=57.54  Aligned_cols=30  Identities=43%  Similarity=0.605  Sum_probs=27.5

Q ss_pred             CCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             057868731456888887778776677664
Q gi|254780479|r  108 KYEGNFPHKVEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus       108 ~~~g~~P~~~~~l~~LpGiG~yta~ai~s~  137 (356)
                      .++|..|.+.++|++|||||+|||.+|+++
T Consensus         1 ~~~~~~~as~eeL~~lpGVG~~tA~~I~~~   30 (30)
T pfam00633         1 SLEGLIPASREELLALPGVGPKTAEAILSY   30 (30)
T ss_pred             CCCCCCCCCHHHHHHCCCCCHHHHHHHHCC
T ss_conf             964435235999972889776889988539


No 59 
>cd04693 Nudix_Hydrolase_34 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.41  E-value=0.0006  Score=48.29  Aligned_cols=94  Identities=12%  Similarity=0.098  Sum_probs=47.4

Q ss_pred             EEEEECCCEEEEEEC-CCCHHHHCCCCCCCCCC-CCCCCHHHHHHCCCCCCCCEEC--C---CEEEEEEEE--EEEEEEE
Q ss_conf             003413760577645-72013204744211113-4672158886301010000213--7---358998315--9999999
Q gi|254780479|r  243 FIAITNDNRILLRKR-TNTRLLEGMDELPGSAW-SSTKDGNIDTHSAPFTANWILC--N---TITHTFTHF--TLTLFVW  313 (356)
Q Consensus       243 ~vii~~~~kiLL~KR-p~~gll~GLwEFP~~e~-~~~~~~~~~~~~~~~~~~~~~l--~---~ikH~fTH~--~L~i~v~  313 (356)
                      .++.|.+|++||+|| +.+.++.|+|++|.+.. ..++.... ............+  .   .+.+.+...  .-.+.++
T Consensus         5 v~v~n~~g~iLlqkRs~~K~~~Pg~W~~~~gg~v~~gE~~ea-a~RE~~EElGi~l~~~~~~~~~~~~~~~~~~~~~~~~   83 (127)
T cd04693           5 VCIFNSKGELLLQKRSPNKDGWPGMWDLSVGGHVQAGETSTA-AEREVKEELGLELDFSELRPLFRYFFEAEGFDDYYLF   83 (127)
T ss_pred             EEEECCCCCEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHH-HHHHHHHHHCCCCCCCCCEEEEEEECCCCCEEEEEEE
T ss_conf             999939998999966899889997276788755128985999-9999999717701766647889999269925899999


Q ss_pred             EEEC------CCCCCCCCCEEECHHHHHHC
Q ss_conf             9980------77676678133227888747
Q gi|254780479|r  314 KTIV------PQIVIIPDSTWHDAQNLANA  337 (356)
Q Consensus       314 ~~~~------~~~~~~~~~~Wv~~~el~~~  337 (356)
                      ....      .+.....+++|++.++|.+.
T Consensus        84 ~~~~~~~~~~~~~~EV~d~~Wvs~eEl~~~  113 (127)
T cd04693          84 YADVEIGKLILQKEEVDEVKFVSKDEIDGL  113 (127)
T ss_pred             EEEEECCCCCCCHHHEEEEEEECHHHHHHH
T ss_conf             970378987289378246899679999999


No 60 
>KOG1918 consensus
Probab=97.34  E-value=0.0011  Score=46.53  Aligned_cols=148  Identities=21%  Similarity=0.373  Sum_probs=111.5

Q ss_pred             CCHHHHHHHHHHHHHCCHHHCCHHHHHHH------HHCCCHHHHHCCCHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHCC
Q ss_conf             87046766446731161203326589987------748997898729989999997404634-58899987767543005
Q gi|254780479|r   37 PSPYKVWISEIMLQQTTVKTVEPYFKKFM------QKWPTIFCLSSAKDEEILSAWAGLGYY-TRARNLKKCADIIVKKY  109 (356)
Q Consensus        37 ~~py~v~vseimlqqT~v~~v~~~~~~~~------~~~P~~~~la~a~~~~vl~~w~gLGyy-~Rar~l~~~a~~i~~~~  109 (356)
                      +.||.-+++-|..||---+..-..|.||+      ..||+++..-.++-+++.+.    |+- +.|-.||..|..-.+.|
T Consensus        73 q~Pf~~LiraIlsQQLs~kAansI~~Rfvsl~~g~~~~~~pe~i~~~~~~~lrkc----G~S~rK~~yLh~lA~~~~ng~  148 (254)
T KOG1918          73 QTPFERLIRAILSQQLSGKAANSIYNRFVSLCGGAEKFPTPEFIDPLDCEELRKC----GFSKRKASYLHSLAEAYTNGY  148 (254)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCHHHHHHH----CCCHHHHHHHHHHHHHHHCCC
T ss_conf             4709999999999999899999999999998479767788321276789999985----733246799999999986477


Q ss_pred             CCCCCCHH-------HHH----HHHHHHHHHHHHHHHHHHCCCE--EEECCCCHHHHHHHHHCCCC-CCCHHHHHHHHHH
Q ss_conf             78687314-------568----8888777877667766434963--13124314666576519754-5735689998741
Q gi|254780479|r  110 EGNFPHKV-------EIL----KKLPGIGDYTASAIVAIAFNHF--AVVVDTNIERIISRYFDIIK-PAPLYHKTIKNYA  175 (356)
Q Consensus       110 ~g~~P~~~-------~~l----~~LpGiG~yta~ai~s~a~~~~--~~~vD~Nv~RVl~R~~~~~~-~~~~~~k~l~~~~  175 (356)
                         +|.+.       ++|    -...|||+.|+--.+-|+.+.+  .|+=|--|++=+.-++|.+. |.   .+++.++.
T Consensus       149 ---I~s~~~i~~mseEeL~~~LT~VKGIg~Wtv~MflIfsL~R~DVmp~dDlgir~g~k~l~gl~~~p~---~~evekl~  222 (254)
T KOG1918         149 ---IPSKSGIEKMSEEELIERLTNVKGIGRWTVEMFLIFSLHRPDVMPADDLGIRNGVKKLLGLKPLPL---PKEVEKLC  222 (254)
T ss_pred             ---CCCHHHHHHCCHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCC---HHHHHHHH
T ss_conf             ---776677761579999999986047551124432110147776668503668877999808999886---59999986


Q ss_pred             HCCCCCCCH-HHHHHHHHHH
Q ss_conf             025785226-7899999999
Q gi|254780479|r  176 RKITSTSRP-GDFVQAMMDL  194 (356)
Q Consensus       176 ~~~~~~~~~-~~~nQAlMdl  194 (356)
                      +..-|..-. .=|+.-+.|+
T Consensus       223 e~~kpyRtvaawYlWki~~~  242 (254)
T KOG1918         223 EKCKPYRTVAAWYLWKIIKL  242 (254)
T ss_pred             HHCCCHHHHHHHHHHHHHHH
T ss_conf             20340477888898732343


No 61 
>cd04694 Nudix_Hydrolase_35 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=97.17  E-value=0.0012  Score=46.17  Aligned_cols=44  Identities=18%  Similarity=0.311  Sum_probs=35.2

Q ss_pred             EEEEEEEECCCEEEEEECCCC-HHHHCCCCCCCCCCCCCCCHHHH
Q ss_conf             320003413760577645720-13204744211113467215888
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNT-RLLEGMDELPGSAWSSTKDGNID  283 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~-gll~GLwEFP~~e~~~~~~~~~~  283 (356)
                      ....++.+.+|++||.||+.. +.+.|+|+||++..+.++.....
T Consensus         3 gVa~ii~d~dgrvLL~rRs~~~~~~Pg~W~~PGG~ve~GEt~~ea   47 (143)
T cd04694           3 GVAVLLQSSDQKLLLTRRASSLRIFPNVWVPPGGHVELGENLLEA   47 (143)
T ss_pred             EEEEEEECCCCEEEEEEECCCCCCCCCCEECCCCCCCCCCCHHHH
T ss_conf             599999989992999985898889998171887745899898999


No 62 
>PRK09438 ntpA dATP pyrophosphohydrolase; Provisional
Probab=97.09  E-value=0.0022  Score=44.31  Aligned_cols=106  Identities=14%  Similarity=0.106  Sum_probs=64.6

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC-------------------CEEEE
Q ss_conf             00034137605776457201320474421111346721588863010100002137-------------------35899
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN-------------------TITHT  302 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~-------------------~ikH~  302 (356)
                      ..||.+.++++||.||...   .|+|++|++..+.++........+..++......                   ..+|.
T Consensus        12 lvvi~~~~~~vLl~~R~~~---~~~W~~~~G~~e~gE~~~~aa~RE~~EEtGi~~~~~~~~L~D~~~~~~yeI~~~wr~R   88 (148)
T PRK09438         12 LVVIYTPDLKVLMLQRADD---PDFWQSVTGSLEEGETPAQTAAREVKEETGIDVAAEQLTLIDCQRTVEYEIFPHWRHR   88 (148)
T ss_pred             EEEEECCCCCEEEEEECCC---CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEECCCCCEEEEECCHHHCCC
T ss_conf             9999859998999997489---9967098312689979899999999998686225671364266520232114422220


Q ss_pred             E----EEEEEEEEEEEEECCC-----CCCCCCCEEECHHHHHHCCCCHHHHHHHHHCCC
Q ss_conf             8----3159999999998077-----676678133227888747998789999984482
Q gi|254780479|r  303 F----THFTLTLFVWKTIVPQ-----IVIIPDSTWHDAQNLANAALPTVMKKALSAGGI  352 (356)
Q Consensus       303 f----TH~~L~i~v~~~~~~~-----~~~~~~~~Wv~~~el~~~~LPs~~kKIL~alg~  352 (356)
                      |    +|  -.-++|.+.++.     ..++.+++|++.++..+.-++-.++-.|..|.+
T Consensus        89 Y~pgv~~--n~EhvF~l~l~~~~~V~LsEH~~~~Wl~~~eA~~l~~s~sN~~Ai~~l~~  145 (148)
T PRK09438         89 YAPGVTH--NTEHWFCLALPHERPVVLTEHLAYQWLPAEEAAALTKSPSNAEAILQLVI  145 (148)
T ss_pred             CCCCCCC--CEEEEEEEECCCCCCCCCCHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHH
T ss_conf             4888524--16799999858899721201432452479999998669331999999986


No 63 
>cd03675 Nudix_Hydrolase_2 Contains a crystal structure of the Nudix hydrolase from Nitrosomonas europaea, which has an unknown function. In general, members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability,
Probab=97.06  E-value=0.006  Score=41.25  Aligned_cols=105  Identities=13%  Similarity=0.106  Sum_probs=59.7

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCC------CCEECCCEEEEEEEE--EEEEEEE
Q ss_conf             00034137605776457201320474421111346721588863010100------002137358998315--9999999
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTA------NWILCNTITHTFTHF--TLTLFVW  313 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~------~~~~l~~ikH~fTH~--~L~i~v~  313 (356)
                      +..++..+|++||.++...|  ++.|.+|++..+.++........+...+      ....++.+...+...  ......|
T Consensus         3 Vaavv~~~gr~LLV~~~~~~--~~~~~~PgG~~e~gE~~~~a~~RE~~EEtG~~v~~~~~~g~~~~~~~~~~~~ylr~~F   80 (134)
T cd03675           3 VAAVVERDGRFLLVEEETDG--GLVFNQPAGHLEPGESLIEAAVRETLEETGWHVEPTALLGIYQWTAPDSDTTYLRFAF   80 (134)
T ss_pred             EEEEEEECCEEEEEEEECCC--CCEEECCCEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCCEEEEEEE
T ss_conf             99999979999999996599--9729898227889989999999999997577677617999998755899961899999


Q ss_pred             EEECCCC-------CCCCCCEEECHHHHHHCC--CCHH-HHHHHH
Q ss_conf             9980776-------766781332278887479--9878-999998
Q gi|254780479|r  314 KTIVPQI-------VIIPDSTWHDAQNLANAA--LPTV-MKKALS  348 (356)
Q Consensus       314 ~~~~~~~-------~~~~~~~Wv~~~el~~~~--LPs~-~kKIL~  348 (356)
                      .++....       ....+..|++++||....  +-+| +.|.|.
T Consensus        81 ~~~~~~~~~~~~~D~eI~~a~W~tldEi~~~~~~lRsPlV~~~i~  125 (134)
T cd03675          81 AAELLEHLPDQPLDSGIVRAHWLTLEEILALAARLRSPLVLRCIE  125 (134)
T ss_pred             EEEECCCCCCCCCCCCCEEEEECCHHHHHHHHHCCCCHHHHHHHH
T ss_conf             999778998889998602417278999865254116884999999


No 64 
>cd03429 NADH_pyrophosphatase NADH pyrophosphatase, a member of the Nudix hydrolase superfamily, catalyzes the cleavage of NADH into reduced nicotinamide mononucleotide (NMNH) and AMP. Like other members of the Nudix family, it requires a divalent cation, such as Mg2+ or Mn2+, for activity. Members of this family are also recognized by the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. A block of 8 conserved amino acids downstream of the nudix motif is thought to give NADH pyrophosphatase its specificity for NADH. NADH pyrophosphatase forms a dimer.
Probab=96.78  E-value=0.0086  Score=40.17  Aligned_cols=92  Identities=13%  Similarity=0.114  Sum_probs=60.2

Q ss_pred             EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEE----EEEEEEEEEECC
Q ss_conf             0034137605776457201320474421111346721588863010100002137358998315----999999999807
Q gi|254780479|r  243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHF----TLTLFVWKTIVP  318 (356)
Q Consensus       243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~----~L~i~v~~~~~~  318 (356)
                      .++++.++++||.++....  .|+|.+|.+-.+.++........+...+.....+.+++..|..    .--|..|.+...
T Consensus         5 ~~V~~~~~~iLL~r~~~~~--~g~~~~~aG~ve~GEs~e~a~~REv~EE~gv~v~~i~y~~sqpwpfp~~Lm~~f~a~~~   82 (131)
T cd03429           5 VLVIDGGDRILLARQPRFP--PGMYSLLAGFVEPGESLEEAVRREVKEEVGIRVKNIRYVGSQPWPFPSSLMLGFTAEAD   82 (131)
T ss_pred             EEEEECCCEEEEECCCCCC--CCCEECCEEEECCCCCHHHCEEEEEEEEECCEECCEEEEEECCCCCCCEEEEEEEEEEC
T ss_conf             9999489889997056799--99594662664399997991189899871770221699887587998868999999976


Q ss_pred             C------CCCCCCCEEECHHHHHH
Q ss_conf             7------67667813322788874
Q gi|254780479|r  319 Q------IVIIPDSTWHDAQNLAN  336 (356)
Q Consensus       319 ~------~~~~~~~~Wv~~~el~~  336 (356)
                      .      ..+..+.+|++.+|+..
T Consensus        83 ~~e~~~~~~Ei~earWfs~del~~  106 (131)
T cd03429          83 SGEIVVDDDELEDARWFSRDEVRA  106 (131)
T ss_pred             CCCCCCCHHHEEEEEEECHHHHHH
T ss_conf             997669931621677676999866


No 65 
>cd03671 Ap4A_hydrolase_plant_like Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Members of this family are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one group (represented by this subfamily) and fungi/animals/archaea enzymes fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for the inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU where U is Ile, Leu, or Val), Ap4A hydrolase is structurally 
Probab=96.67  E-value=0.0096  Score=39.84  Aligned_cols=43  Identities=14%  Similarity=0.294  Sum_probs=32.5

Q ss_pred             EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHH
Q ss_conf             3200034137605776457201320474421111346721588863
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTH  285 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~  285 (356)
                      ....++.|.+|++|+.||...   .|.|.||.+..+.++.......
T Consensus         5 ~Vg~vi~n~~~~vl~~~R~~~---~~~W~~P~G~i~~gE~~~~aa~   47 (147)
T cd03671           5 NVGVVLFNEDGKVFVGRRIDT---PGAWQFPQGGIDEGEDPEQAAL   47 (147)
T ss_pred             EEEEEEEECCCEEEEEEECCC---CCCCCCCEEEECCCCCHHHHHH
T ss_conf             789999928994999996899---8888798002479999999999


No 66 
>cd03428 Ap4A_hydrolase_human_like Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Ap4A hydrolases are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one subfamily and fungi/animals/archaea enzymes, represented by this subfamily, fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val) that functions as a metal binding and 
Probab=96.59  E-value=0.022  Score=37.25  Aligned_cols=96  Identities=11%  Similarity=0.067  Sum_probs=55.0

Q ss_pred             CCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCC------CCEECCCEEEEEEE----EEEEEEEEEEECC
Q ss_conf             7605776457201320474421111346721588863010100------00213735899831----5999999999807
Q gi|254780479|r  249 DNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTA------NWILCNTITHTFTH----FTLTLFVWKTIVP  318 (356)
Q Consensus       249 ~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~------~~~~l~~ikH~fTH----~~L~i~v~~~~~~  318 (356)
                      +-++||.+|..     |.|.||.+..+.++.............      .......+.+.|.+    ..-.+..|.+.++
T Consensus        16 ~~~vLLi~~~~-----g~W~~P~G~ie~gEs~~~aA~REv~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~~v~~fl~~~~   90 (130)
T cd03428          16 EIEYLLLQASY-----GHWDFPKGHVEPGEDDLEAALRETEEETGITAEQLFIVLGFKETLNYQVRGKLKTVTYFLAELR   90 (130)
T ss_pred             CEEEEEEECCC-----CCEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCEEEEEECCCCCEEEEEEEEEEEEE
T ss_conf             48999999189-----9689990504899599999999999987987078785174789998168978999999999981


Q ss_pred             CC------CCCCCCEEECHHHHHHC-CCCHHHHHHHHHC
Q ss_conf             76------76678133227888747-9987899999844
Q gi|254780479|r  319 QI------VIIPDSTWHDAQNLANA-ALPTVMKKALSAG  350 (356)
Q Consensus       319 ~~------~~~~~~~Wv~~~el~~~-~LPs~~kKIL~al  350 (356)
                      ..      ....+++|++.++..++ ..+. .+.||+..
T Consensus        91 ~~~~~~~~~E~~d~~W~~~~eA~~~l~~~~-~r~il~~a  128 (130)
T cd03428          91 PDVEVKLSEEHQDYRWLPYEEALKLLTYED-LKAVLDKA  128 (130)
T ss_pred             CCCCCCCCCCEEEEEEECHHHHHHHCCCHH-HHHHHHHH
T ss_conf             786789982156789983999998839973-99999996


No 67 
>cd03426 CoAse Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes. The need of CoAses mainly arises under conditions of oxidative stress. CoAse has a conserved Nudix fold and requires a single divalent cation for catalysis. In addition to a signature Nudix motif G[X5]E[X7]REUXEEXGU, where U is  Ile, Leu, or Val, CoAse contains an additional motif upstream called the NuCoA motif (LLTXT(SA)X3RX3GX3FPGG) which is postulated to be involved in CoA recognition. CoA plays a central role in lipid metabolism. It is involved in the initial steps of fatty acid sythesis in the cytosol, in the oxidation of fatty acids and the citric acid cycle in the mitochondria, and in the oxidation of long-chain fatty acids in peroxisomes. CoA has the important role of activating fatty acids for further modification into key biological signalling molecules.
Probab=96.48  E-value=0.012  Score=39.28  Aligned_cols=87  Identities=16%  Similarity=0.143  Sum_probs=56.1

Q ss_pred             CCEEEEEECCCC-HHHHCCCCCCCCCCCCC-CCHH------HHHHCCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCC
Q ss_conf             760577645720-13204744211113467-2158------886301010000213735899831599999999980776
Q gi|254780479|r  249 DNRILLRKRTNT-RLLEGMDELPGSAWSST-KDGN------IDTHSAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQI  320 (356)
Q Consensus       249 ~~kiLL~KRp~~-gll~GLwEFP~~e~~~~-~~~~------~~~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~  320 (356)
                      +-++||.||+.. +-++|-+.||++..+.+ ++..      ..++..........++.+...+|...+.+++|.+.++..
T Consensus        15 ~~~ill~kRs~~l~~h~GqisfPGG~~e~~D~~~~~tAlREt~EEiGl~~~~~~~lg~L~~~~t~s~~~V~PfV~~~~~~   94 (157)
T cd03426          15 ELRVLLTKRASHLRSHPGQVAFPGGKVDPGDEDPVATALREAEEEIGLPPDSVEVLGRLPPYYTRSGFVVTPVVGLVPPP   94 (157)
T ss_pred             CEEEEEEEECCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCEEEEEEEEECCC
T ss_conf             67999998267667789848379960276653389999999999868991133898634871325984899999994798


Q ss_pred             C-------CCCCCEEECHHHHH
Q ss_conf             7-------66781332278887
Q gi|254780479|r  321 V-------IIPDSTWHDAQNLA  335 (356)
Q Consensus       321 ~-------~~~~~~Wv~~~el~  335 (356)
                      .       +.+...|++++++-
T Consensus        95 ~~~~~~~~EV~~vf~vPL~~ll  116 (157)
T cd03426          95 LPLVLNPDEVAEVFEVPLSFLL  116 (157)
T ss_pred             CCCCCCHHHHHHHHHCCHHHHH
T ss_conf             7657984897888140599973


No 68 
>PRK03759 isopentenyl-diphosphate delta-isomerase; Provisional
Probab=96.25  E-value=0.048  Score=34.94  Aligned_cols=101  Identities=12%  Similarity=0.089  Sum_probs=52.1

Q ss_pred             CCCEEEEEEEEEECCCEEEEEECCCCH-HHHCCCCCCCCCCC-CCCCH------HHHHHCCCCCCCC-EECCCEEEEEE-
Q ss_conf             221123200034137605776457201-32047442111134-67215------8886301010000-21373589983-
Q gi|254780479|r  235 RPMRTGAVFIAITNDNRILLRKRTNTR-LLEGMDELPGSAWS-STKDG------NIDTHSAPFTANW-ILCNTITHTFT-  304 (356)
Q Consensus       235 k~~r~~~~~vii~~~~kiLL~KRp~~g-ll~GLwEFP~~e~~-~~~~~------~~~~~~~~~~~~~-~~l~~ikH~fT-  304 (356)
                      ...|.+.++ +.+.+|++||+||...+ .+.|+|..-..... .++..      ...++........ ..++.+.+.-. 
T Consensus        32 ~lHRafsv~-l~~~~g~lllq~Rs~~K~~~pg~w~~s~~gH~~~gE~~~~aa~R~l~EElGi~~~~l~~~~~~f~Y~a~~  110 (184)
T PRK03759         32 PLHLAFSCY-LFDADGRLLVTRRALSKKTWPGVWTNSCCGHPQPGESLEDAIIRRCREELGVEITDLTLVLPDFRYRATD  110 (184)
T ss_pred             CEEEEEEEE-EECCCCCEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEEEECCC
T ss_conf             538899999-9959996999971589737999400354467899989999999999998698834440550038997238


Q ss_pred             -----EEEEEEEEEEEECC-----CCCCCCCCEEECHHHHHHC
Q ss_conf             -----15999999999807-----7676678133227888747
Q gi|254780479|r  305 -----HFTLTLFVWKTIVP-----QIVIIPDSTWHDAQNLANA  337 (356)
Q Consensus       305 -----H~~L~i~v~~~~~~-----~~~~~~~~~Wv~~~el~~~  337 (356)
                           -+. ..+||.+..+     ++.+..+.+|++.++|...
T Consensus       111 ~~g~~E~E-~~~Vf~~~~d~~~~pnp~EV~e~~Wv~~~eL~~~  152 (184)
T PRK03759        111 PNGIVENE-VCPVFAARATSALQPNPDEVMDYQWVDPADLLRA  152 (184)
T ss_pred             CCCCCEEE-EEEEEEEEECCCCCCCHHHHEEEEEECHHHHHHH
T ss_conf             99862257-9999999848989899689134689869999999


No 69 
>cd02885 IPP_Isomerase Isopentenyl diphosphate (IPP) isomerase, a member of the Nudix hydrolase superfamily, is a key enzyme in the isoprenoid biosynthetic pathway. Isoprenoids comprise a large family of natural products including sterols, carotenoids, dolichols and prenylated proteins. These compounds are synthesized from two precursors: isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). IPP isomerase catalyzes the interconversion of IPP and DMAPP by a stereoselective antarafacial transposition of hydrogen. The enzyme requires one Mn2+ or Mg2+ ion in its active site to fold into an active conformation and also contains the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. The metal binding site is present within the active site and plays structural and catalytical roles. IPP isomerase is well represented in several bacteria, archaebacteria and eukaryotes, including fungi, mamm
Probab=96.17  E-value=0.062  Score=34.10  Aligned_cols=101  Identities=11%  Similarity=0.024  Sum_probs=50.8

Q ss_pred             CCEEEEEEEEEECCCEEEEEECCCC-HHHHCCCCCCCCC-CCCCCCHHH------HHHCCCCCCCC-EECCCEEE--EEE
Q ss_conf             2112320003413760577645720-1320474421111-346721588------86301010000-21373589--983
Q gi|254780479|r  236 PMRTGAVFIAITNDNRILLRKRTNT-RLLEGMDELPGSA-WSSTKDGNI------DTHSAPFTANW-ILCNTITH--TFT  304 (356)
Q Consensus       236 ~~r~~~~~vii~~~~kiLL~KRp~~-gll~GLwEFP~~e-~~~~~~~~~------~~~~~~~~~~~-~~l~~ikH--~fT  304 (356)
                      ..+...++ +.+.+|++||+||+.. .++.|+|..-.+. ...++....      .++........ ..+..+.+  .+.
T Consensus        29 ~Hrav~v~-v~n~~g~lllqkRs~~K~~~Pg~wd~s~~Gh~~~GE~~~~aa~Rel~EElGi~~~~~~~~~~~~~y~~~~~  107 (165)
T cd02885          29 LHRAFSVF-LFNSKGRLLLQRRALSKYTFPGLWTNTCCSHPLPGEGVKDAAQRRLREELGITGDLLELVLPRFRYRAPDD  107 (165)
T ss_pred             EEEEEEEE-EECCCCCEEEEEECCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEEEECCCC
T ss_conf             48999999-99089979999857988689970735543467699758999999888861998543204666899932248


Q ss_pred             EE---EEEEEEEEEECC-----CCCCCCCCEEECHHHHHHC
Q ss_conf             15---999999999807-----7676678133227888747
Q gi|254780479|r  305 HF---TLTLFVWKTIVP-----QIVIIPDSTWHDAQNLANA  337 (356)
Q Consensus       305 H~---~L~i~v~~~~~~-----~~~~~~~~~Wv~~~el~~~  337 (356)
                      ..   +-..++|.+..+     ++....+++|++.++|...
T Consensus       108 ~~~~e~E~~~vy~~~~d~~~~~~~~EV~~~~w~~~~el~~~  148 (165)
T cd02885         108 GGLVEHEIDHVFFARADVTLIPNPDEVSEYRWVSLEDLKEL  148 (165)
T ss_pred             CCCCCEEEEEEEEEEECCCCCCCHHHHEEEEEECHHHHHHH
T ss_conf             98742379999999658987699579238899789999999


No 70 
>PRK01229 N-glycosylase/DNA lyase; Provisional
Probab=96.13  E-value=0.0075  Score=40.59  Aligned_cols=115  Identities=21%  Similarity=0.213  Sum_probs=76.5

Q ss_pred             HHHHHHHHHH----HHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCC--CHH-HHHHHHHHHHHHH------H
Q ss_conf             4676644673----1161203326589987748997898729989999997404--634-5889998776754------3
Q gi|254780479|r   40 YKVWISEIML----QQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGL--GYY-TRARNLKKCADII------V  106 (356)
Q Consensus        40 y~v~vseiml----qqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gL--Gyy-~Rar~l~~~a~~i------~  106 (356)
                      +.=|-||...    -||.+...+..- +-+.     +.+...+++++...-...  -|| .||+++..+-+++      .
T Consensus        34 e~~iF~EL~FCILTanssA~~~~ka~-~~l~-----~g~~~~~~eel~~~l~~~g~RF~n~rAkyIv~aR~~~~~lk~i~  107 (208)
T PRK01229         34 EEDLFSELSFCILTANSSAEGGIKAQ-KEIG-----DGFLYLSEEELREKLKEVGHRFPNLRAEYIVEARKLIGKLKEII  107 (208)
T ss_pred             HHHHHHHHHHHHHCCCCHHHHHHHHH-HHHH-----HHCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH
T ss_conf             88999999999938874077899999-9987-----40025999999999998496784226899999999988999999


Q ss_pred             HCCCCCCCCHHHHHHH-HHHHHHHHHHHHH-HHHCCCEEEECCCCHHHHHHHHHCCCC
Q ss_conf             0057868731456888-8877787766776-643496313124314666576519754
Q gi|254780479|r  107 KKYEGNFPHKVEILKK-LPGIGDYTASAIV-AIAFNHFAVVVDTNIERIISRYFDIIK  162 (356)
Q Consensus       107 ~~~~g~~P~~~~~l~~-LpGiG~yta~ai~-s~a~~~~~~~vD~Nv~RVl~R~~~~~~  162 (356)
                      +.. ....+..+-|.+ ..|+|-..|+-.+ .++| ...+++|-||.|.+.++.-++.
T Consensus       108 ~~~-~~~~e~Re~Lv~nIKG~G~KEASHFLRNiG~-~dlAIlDrHILr~l~~~g~i~~  163 (208)
T PRK01229        108 KAD-KDQFEAREFLVKNIKGIGYKEASHFLRNVGF-EDLAILDRHILRFLKRYGLIKE  163 (208)
T ss_pred             HCC-CCHHHHHHHHHHHCCCCCHHHHHHHHHHCCC-CHHHHHHHHHHHHHHHCCCCCC
T ss_conf             706-8778999999985647667999999996585-2357769999999998289765


No 71 
>cd04692 Nudix_Hydrolase_33 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=96.12  E-value=0.063  Score=34.10  Aligned_cols=99  Identities=15%  Similarity=0.145  Sum_probs=54.0

Q ss_pred             EEEEEEEEEEC---CCEEEEEECCCCH-HHHCCCCCCC-CCCCCCCCHH------HHHHCCCCC--CCCEECCCEEEEEE
Q ss_conf             12320003413---7605776457201-3204744211-1134672158------886301010--00021373589983
Q gi|254780479|r  238 RTGAVFIAITN---DNRILLRKRTNTR-LLEGMDELPG-SAWSSTKDGN------IDTHSAPFT--ANWILCNTITHTFT  304 (356)
Q Consensus       238 r~~~~~vii~~---~~kiLL~KRp~~g-ll~GLwEFP~-~e~~~~~~~~------~~~~~~~~~--~~~~~l~~ikH~fT  304 (356)
                      +...++++ ++   ++++||+||...+ .+.|+|+... +-...++...      +.++.....  .....++.+.+.+.
T Consensus         3 ra~h~~v~-~~~~~~g~illqkRs~~K~~~Pg~wd~s~~Ghv~~gEt~~~aa~REl~EElGl~~~~~~l~~lg~~~~~~~   81 (144)
T cd04692           3 RTFHCWII-TKDEGKGYVLLQKRSANKKTYPGLWDISSAGHILAGETPLEDGIRELEEELGLDVSADDLIPLGTFKIEYD   81 (144)
T ss_pred             EEEEEEEE-ECCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHCEEEEEEEEEEE
T ss_conf             38999999-66799528999978899878998311665750457999899999899998688767688188218999974


Q ss_pred             EEE-----EEEEEEEEECC--------CCCCCCCCEEECHHHHHHC
Q ss_conf             159-----99999999807--------7676678133227888747
Q gi|254780479|r  305 HFT-----LTLFVWKTIVP--------QIVIIPDSTWHDAQNLANA  337 (356)
Q Consensus       305 H~~-----L~i~v~~~~~~--------~~~~~~~~~Wv~~~el~~~  337 (356)
                      +..     -..++|.+..+        +.....+.+|++.+++..+
T Consensus        82 ~~~~~~d~e~~~vy~~~~~~~~~~~~~~~~EV~~~~wv~~~el~~~  127 (144)
T cd04692          82 HIGKLIDREFHHVYLYELKVPLEEFTLQKEEVAGVVLIPLDEFAEL  127 (144)
T ss_pred             CCCCCEEEEEEEEEEEEECCCCCCCCCCHHHHHEEEEECHHHHHHH
T ss_conf             4998172089999999968984567899788105999879999999


No 72 
>TIGR03252 uncharacterized HhH-GPD family protein. This model describes a small, well-conserved bacterial protein family. Its sequence largely consists of a domain, HhH-GPD, found in a variety of related base excision DNA repair enzymes (see pfam00730).
Probab=96.03  E-value=0.033  Score=36.05  Aligned_cols=103  Identities=17%  Similarity=0.312  Sum_probs=70.4

Q ss_pred             CCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHC--CCHHHHHCCCHHHHHHHHHC---CCHH--HHHHHHHHHHHHHHH
Q ss_conf             6787046766446731161203326589987748--99789872998999999740---4634--588999877675430
Q gi|254780479|r   35 SLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKW--PTIFCLSSAKDEEILSAWAG---LGYY--TRARNLKKCADIIVK  107 (356)
Q Consensus        35 ~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~--P~~~~la~a~~~~vl~~w~g---LGyy--~Rar~l~~~a~~i~~  107 (356)
                      -.+||...+|-=.+=||-.++..-.==.+..+|+  -+...+|.++.|+...+.+.   +-=|  |=|+++|+.|++|++
T Consensus        14 L~~~p~AlLiGMLLDQQvPmE~AF~GP~~l~~RlG~lD~~~IA~~Dpd~f~~l~~e~PAiHRfP~SMA~Riq~l~~~iv~   93 (177)
T TIGR03252        14 LSSDPFALLTGMLLDQQVPMERAFAGPHKIARRMGSLDAEDIAKYDPQAFVALFSERPAVHRFPGSMAKRVQALAQYVVD   93 (177)
T ss_pred             HCCCCHHHHHHHHHHCCCHHHHHHCCHHHHHHHHCCCCHHHHHHCCHHHHHHHHCCCCCHHHCCHHHHHHHHHHHHHHHH
T ss_conf             71392899999998233429998618599998837999899981799999999767961421728899999999999999


Q ss_pred             CCCCC--------CCCH---HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             05786--------8731---456888887778776677664
Q gi|254780479|r  108 KYEGN--------FPHK---VEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus       108 ~~~g~--------~P~~---~~~l~~LpGiG~yta~ai~s~  137 (356)
                      +|+|.        .|+-   +..|..|||.|.--|.-.++.
T Consensus        94 ~YdGda~~vW~~g~~dg~ell~Rl~~LPGFG~qKA~IFlAL  134 (177)
T TIGR03252        94 TYDGDATAVWTEGDPDGKELLRRLKALPGFGKQKAKIFLAL  134 (177)
T ss_pred             HHCCCHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHH
T ss_conf             80894988740679989999999986799619999999999


No 73 
>PRK00714 dinucleoside polyphosphate hydrolase; Reviewed
Probab=95.94  E-value=0.026  Score=36.81  Aligned_cols=107  Identities=7%  Similarity=0.096  Sum_probs=58.2

Q ss_pred             EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCC----EEEEE-----
Q ss_conf             2320003413760577645720132047442111134672158886301010000------21373----58998-----
Q gi|254780479|r  239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNT----ITHTF-----  303 (356)
Q Consensus       239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~----ikH~f-----  303 (356)
                      -....+++|.+|++|+.||..    .|.|.||.+..+.+++.......+..++..      ..++.    +.+-|     
T Consensus         9 ~nVg~vi~n~~g~vl~~~R~~----~~~Wq~PqGgid~gE~~~~aa~REl~EE~Gi~~~~v~ii~~~~~wl~Yd~P~~~~   84 (156)
T PRK00714          9 PNVGIILLNRQGQVFWGRRIG----QHSWQFPQGGIDPGETPEQAMYRELYEEVGLRPEDVEILAETRDWLRYDLPKRLV   84 (156)
T ss_pred             CCEEEEEEECCCCEEEEEECC----CCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCEEEECCHHHH
T ss_conf             763999990999299999889----9976799450079989999999999998589820004576258838985817764


Q ss_pred             -----EEEEEEEEEE---E-------EECC--CCCCCCCCEEECHHHHHHCCCC---HHHHHHHHH
Q ss_conf             -----3159999999---9-------9807--7676678133227888747998---789999984
Q gi|254780479|r  304 -----THFTLTLFVW---K-------TIVP--QIVIIPDSTWHDAQNLANAALP---TVMKKALSA  349 (356)
Q Consensus       304 -----TH~~L~i~v~---~-------~~~~--~~~~~~~~~Wv~~~el~~~~LP---s~~kKIL~a  349 (356)
                           .+|.=..+.|   .       ++++  ...+.++.+|++.+++-+.-+|   .+.+++|+.
T Consensus        85 ~~~~~~~y~GQ~QkWfl~rf~g~d~~Inl~~~~~pEF~~wkW~~~~~~~~~vv~FKr~vY~~vl~e  150 (156)
T PRK00714         85 RRDWKPVCIGQKQKWFLLRLTGDDSEINLNTSHHPEFDAWRWVSYWYPLRQVVPFKRDVYRRVLKE  150 (156)
T ss_pred             HHHCCCCCCCCEEEEEEEEEECCCCCEECCCCCCCCEEEEEECCHHHHHHHCCCCCHHHHHHHHHH
T ss_conf             243058702636899999981798517668999986233475888997776333189999999999


No 74 
>smart00525 FES FES domain. iron-sulpphur binding domain in DNA-(apurinic or apyrimidinic site) lyase (subfamily of ENDO3)
Probab=95.74  E-value=0.0032  Score=43.16  Aligned_cols=23  Identities=39%  Similarity=1.039  Sum_probs=21.2

Q ss_pred             HCCCCCCCCCCCCCCCCHHHHHC
Q ss_conf             20412356431100000134310
Q gi|254780479|r  198 ICTSNKPLCPLCPIQKNCLTFSE  220 (356)
Q Consensus       198 iC~p~~P~C~~Cpl~~~C~~~~~  220 (356)
                      ||+|++|+|+.||++++|.++..
T Consensus         1 iC~~~kP~C~~Cpl~~~C~~~~~   23 (26)
T smart00525        1 ICTARKPRCDECPLKDLCPAYXX   23 (26)
T ss_pred             CCCCCCCCCCCCCCHHHCCHHHC
T ss_conf             96566887676821210603331


No 75 
>cd04686 Nudix_Hydrolase_27 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=95.46  E-value=0.11  Score=32.27  Aligned_cols=89  Identities=15%  Similarity=0.135  Sum_probs=51.9

Q ss_pred             EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCC-------CCCEECCCEEE-------EEEEEEE
Q ss_conf             003413760577645720132047442111134672158886301010-------00021373589-------9831599
Q gi|254780479|r  243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFT-------ANWILCNTITH-------TFTHFTL  308 (356)
Q Consensus       243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~-------~~~~~l~~ikH-------~fTH~~L  308 (356)
                      .|++ ++++|||.+++..    |-|.||++..+.+++.......+...       .....++.+..       -+..+++
T Consensus         5 aIi~-~~~kiLL~~~~~~----~~~~lPGG~ve~gE~~~~al~RE~~EETG~~~i~~~~~~G~~~E~r~~~~~~~~~~~m   79 (131)
T cd04686           5 AIIL-QGDKILLLYTKRY----GDYKFPGGGVEKGEDHIEGLIRELQEETGATNIRVIEKFGTYTERRPWRKPDADIFHM   79 (131)
T ss_pred             EEEE-ECCEEEEEEECCC----CCEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCCCEEEEEEEEEEECCCCCEEEE
T ss_conf             9999-8999999997699----9676787737899898999999999874961013131145599984100588877989


Q ss_pred             EEEEEEEECCCCCC---CC--------CCEEECHHHHHH
Q ss_conf             99999998077676---67--------813322788874
Q gi|254780479|r  309 TLFVWKTIVPQIVI---IP--------DSTWHDAQNLAN  336 (356)
Q Consensus       309 ~i~v~~~~~~~~~~---~~--------~~~Wv~~~el~~  336 (356)
                      ..++|.|++.....   ..        .-.|+++++.-.
T Consensus        80 ~s~~Y~C~v~~~~g~~~le~~E~~~G~~pvWv~i~eAi~  118 (131)
T cd04686          80 ISYYYLCEVDAELGAQQLEDYEAELGMKPIWINIHEAIE  118 (131)
T ss_pred             EEEEEEEEECHHCCCCCCCCHHHHCCCCEEECCHHHHHH
T ss_conf             878999985122189877603442697217678999999


No 76 
>PRK10707 hypothetical protein; Provisional
Probab=95.16  E-value=0.16  Score=31.25  Aligned_cols=99  Identities=16%  Similarity=0.110  Sum_probs=56.8

Q ss_pred             CEEEEEEE--EEECCCEEEEEECCCC-HHHHCCCCCCCCCCCCCCCH-------HHHHHCCCCCCCCEECCCEEEEEEEE
Q ss_conf             11232000--3413760577645720-13204744211113467215-------88863010100002137358998315
Q gi|254780479|r  237 MRTGAVFI--AITNDNRILLRKRTNT-RLLEGMDELPGSAWSSTKDG-------NIDTHSAPFTANWILCNTITHTFTHF  306 (356)
Q Consensus       237 ~r~~~~~v--ii~~~~kiLL~KRp~~-gll~GLwEFP~~e~~~~~~~-------~~~~~~~~~~~~~~~l~~ikH~fTH~  306 (356)
                      .|.-.++|  +.+.+..+||.+|+.. .-.+|...||++..+..+..       +..++..........++.+....|..
T Consensus        29 ~r~AAVLvpl~~~~~~~iLlt~Rs~~L~~H~GqIsFPGGk~e~~D~~~~~TALREt~EEiGl~~~~v~vlG~L~~~~t~s  108 (190)
T PRK10707         29 QRQAAVLIPIVRRPQPGLLLTQRAIHLRKHAGQVAFPGGAVDDTDASLIAAALREAEEEVAIPPSAVEVIGVLPPVDSVT  108 (190)
T ss_pred             CCCEEEEEEEEECCCCEEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCEECCC
T ss_conf             86537999999889958999975653567999884899676888888899999988898689943258986347356678


Q ss_pred             EEEEEEEEEECCCCC-------CCCCCEEECHHHHH
Q ss_conf             999999999807767-------66781332278887
Q gi|254780479|r  307 TLTLFVWKTIVPQIV-------IIPDSTWHDAQNLA  335 (356)
Q Consensus       307 ~L~i~v~~~~~~~~~-------~~~~~~Wv~~~el~  335 (356)
                      ...+.++.+.++...       +.++..|++++++-
T Consensus       109 g~~V~P~Vg~i~~~~~~~~~~~EV~~vf~vPL~~ll  144 (190)
T PRK10707        109 GYQVTPVVGIIPPNLPYRASEDEVAAVFEMPLAEAL  144 (190)
T ss_pred             CCEEEEEEEEECCCCCCCCCHHHHHHHEECCHHHHH
T ss_conf             988999999977887778984884723554199962


No 77 
>cd04685 Nudix_Hydrolase_26 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=95.12  E-value=0.13  Score=31.86  Aligned_cols=96  Identities=15%  Similarity=0.110  Sum_probs=54.5

Q ss_pred             EEEEECCCEEEEEECCCC-HHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEEC---C----CEEEEEE----EEEEEE
Q ss_conf             003413760577645720-132047442111134672158886301010000213---7----3589983----159999
Q gi|254780479|r  243 FIAITNDNRILLRKRTNT-RLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILC---N----TITHTFT----HFTLTL  310 (356)
Q Consensus       243 ~vii~~~~kiLL~KRp~~-gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l---~----~ikH~fT----H~~L~i  310 (356)
                      .++++.+|++||.+|... .--.+.|.+|++..+.++........+.........   +    .....|+    +++-.-
T Consensus         5 vvl~d~~grvLL~r~~~~~~~~~~~W~~pgG~~~~gE~~~~a~~RE~~EE~G~~~~~~~~~v~~r~~~f~~~g~~~~~~E   84 (133)
T cd04685           5 VVLLDPDDRVLLLRGDDPDSPGPDWWFTPGGGVEPGESPEQAARRELREETGITVADLGPPVWRRDAAFTFLGVDGRQEE   84 (133)
T ss_pred             EEEECCCCCEEEEEECCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEECCCCCCEEEEEEEEECCCCCCCCEE
T ss_conf             99986999999997447899999918599874889869899999999987597800244637999678703787100068


Q ss_pred             EEEEEECCC------------CCCCCCCEEECHHHHHHCC
Q ss_conf             999998077------------6766781332278887479
Q gi|254780479|r  311 FVWKTIVPQ------------IVIIPDSTWHDAQNLANAA  338 (356)
Q Consensus       311 ~v~~~~~~~------------~~~~~~~~Wv~~~el~~~~  338 (356)
                      .+|.+....            .....+.+|.+.+||...+
T Consensus        85 ~fflvr~~~~e~~~~~~t~~E~~~i~e~RWws~~EL~~t~  124 (133)
T cd04685          85 RFFLARTPRTEPSPAGWTALERRSILGWRWWTRAELAATP  124 (133)
T ss_pred             EEEEEEECCCEECCCCCCHHHHHHCCCCCCCCHHHHHHCC
T ss_conf             9999994686035766766664010255258899993279


No 78 
>PRK05379 bifunctional nicotinamide mononucleotide adenylyltransferase/ADP-ribose pyrophosphatase; Provisional
Probab=95.00  E-value=0.11  Score=32.29  Aligned_cols=109  Identities=13%  Similarity=0.065  Sum_probs=58.2

Q ss_pred             EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE------CCCE--EEEEEEE----
Q ss_conf             232000341376057764572013204744211113467215888630101000021------3735--8998315----
Q gi|254780479|r  239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL------CNTI--THTFTHF----  306 (356)
Q Consensus       239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~------l~~i--kH~fTH~----  306 (356)
                      ...-.|++ .+|++||.+|... -..|+|.||++-.+.++........+..++....      .+.+  .++|+|-    
T Consensus       203 vT~DaVVv-~~ghVLLIrR~~~-P~kG~WALPGGfv~~~Etl~~aa~REL~EETgl~~~~~~l~~~l~~~~vfd~P~Rdp  280 (340)
T PRK05379        203 VTVDAVVV-QSGHVLLVRRRAE-PGKGLWALPGGFVGQDETLLDACLRELREETGLKLPEPVLRGSIRDQQVFDHPGRSL  280 (340)
T ss_pred             EEEEEEEE-ECCEEEEEEECCC-CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHCCCCCCCCCCCCCCCC
T ss_conf             65316999-8898999941899-988887589886999999899999988641268876203220012432478999999


Q ss_pred             --EEEEEEEEEECCCC--------CCCCCCEEECHHHHHHCCCCH--HHHHHHHH
Q ss_conf             --99999999980776--------766781332278887479987--89999984
Q gi|254780479|r  307 --TLTLFVWKTIVPQI--------VIIPDSTWHDAQNLANAALPT--VMKKALSA  349 (356)
Q Consensus       307 --~L~i~v~~~~~~~~--------~~~~~~~Wv~~~el~~~~LPs--~~kKIL~a  349 (356)
                        ..-.+.|.+.++..        ....+.+|++++++...+-.-  =+..||+.
T Consensus       281 RGrvvt~a~~~~l~~~~lp~v~agDDA~~A~W~pl~el~~~~~~~feDH~~Il~~  335 (340)
T PRK05379        281 RGRTITHAFLFEFPAGELPAVKGGDDADKARWVPLAELAAMRDRMFEDHFQIITH  335 (340)
T ss_pred             CCCEEEEEEEEEECCCCCCCCCCCCCHHHCEEEEHHHHHHCHHHHHCCHHHHHHH
T ss_conf             8778999999972677667878998556577653999654777564409999999


No 79 
>COG1051 ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]
Probab=94.97  E-value=0.37  Score=28.70  Aligned_cols=107  Identities=17%  Similarity=0.204  Sum_probs=55.8

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEE--EEE-------EEEEEEEE
Q ss_conf             200034137605776457201320474421111346721588863010100002137358--998-------31599999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTIT--HTF-------THFTLTLF  311 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ik--H~f-------TH~~L~i~  311 (356)
                      .+.+++..++++||.||... =..|.|.||++..+.++........+..++.......+.  -+|       ..+.+.+.
T Consensus        12 ~v~~~i~~~~~iLLvrR~~~-p~~g~WalPGG~ve~GEt~eeaa~REl~EETgL~~~~~~~~~v~~~~~rd~r~~~v~~~   90 (145)
T COG1051          12 AVGALIVRNGRILLVRRANE-PGAGYWALPGGFVEIGETLEEAARRELKEETGLRVRVLELLAVFDDPGRDPRGHHVSFL   90 (145)
T ss_pred             EEEEEEECCCEEEEEEECCC-CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCEEEEEEE
T ss_conf             57899942885999986688-88986818866278998889999999999878865430688885488878852699999


Q ss_pred             EEEEECCC-CC-----CCCCCEEECHHHHHHCCCC--HHHHHHHH
Q ss_conf             99998077-67-----6678133227888747998--78999998
Q gi|254780479|r  312 VWKTIVPQ-IV-----IIPDSTWHDAQNLANAALP--TVMKKALS  348 (356)
Q Consensus       312 v~~~~~~~-~~-----~~~~~~Wv~~~el~~~~LP--s~~kKIL~  348 (356)
                      ++.+.... ..     ....-.|++.+++....+|  ..+..+..
T Consensus        91 ~~~~~~~g~~~~~~~~d~~~~~~~~~~~l~~~~~~~~~~~~~l~~  135 (145)
T COG1051          91 FFAAEPEGELLAGDGDDAAEVGWFPLDELPELPLPLAFTLADLRR  135 (145)
T ss_pred             EEEECCCCCCCCCCHHHHHHCCEECHHHCCCCCCCCCCCHHHHHH
T ss_conf             999714786666772647627526454554544431022999999


No 80 
>cd04674 Nudix_Hydrolase_16 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=94.74  E-value=0.11  Score=32.31  Aligned_cols=44  Identities=16%  Similarity=0.086  Sum_probs=30.6

Q ss_pred             EEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCC
Q ss_conf             034137605776457201320474421111346721588863010
Q gi|254780479|r  244 IAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAP  288 (356)
Q Consensus       244 vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~  288 (356)
                      .++..++++||.||... --.|+|.||++..+.++........+.
T Consensus         9 ~~~~~~d~vLl~rR~~~-P~~G~w~lPgGf~e~gEt~~~aa~RE~   52 (118)
T cd04674           9 ALLPVDDGLLVIRRGIE-PGRGKLALPGGFIELGETWQDAVAREL   52 (118)
T ss_pred             EEEEECCEEEEEEECCC-CCCCCCCCCEEECCCCCCHHHHHHHHH
T ss_conf             99997999999985999-888858487537549999999999999


No 81 
>cd04665 Nudix_Hydrolase_8 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s
Probab=94.51  E-value=0.18  Score=30.92  Aligned_cols=48  Identities=15%  Similarity=0.197  Sum_probs=33.9

Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC
Q ss_conf             20003413760577645720132047442111134672158886301010000
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW  293 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~  293 (356)
                      .+.++-..++++||.+++..|     |+||++..+.++.......++...+..
T Consensus         2 sV~vi~~~~~~~Ll~~~~~rg-----w~~PGG~~e~gEt~~~aa~RE~~EEtG   49 (118)
T cd04665           2 SVLVICFYDDGLLLVRHKDRG-----WEFPGGHVEPGETIEEAARREVWEETG   49 (118)
T ss_pred             EEEEEEEECCEEEEEEECCCC-----EECCCCCCCCCCCHHHHHHHHHHHHHC
T ss_conf             599999789999999928997-----879830417999999999998877329


No 82 
>pfam05559 DUF763 Protein of unknown function (DUF763). This family consists of several uncharacterized bacterial and archaeal proteins of unknown function.
Probab=94.47  E-value=0.041  Score=35.39  Aligned_cols=34  Identities=26%  Similarity=0.345  Sum_probs=23.1

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHHHHHHH---CCCEEEE
Q ss_conf             687314568888877787766776643---4963131
Q gi|254780479|r  112 NFPHKVEILKKLPGIGDYTASAIVAIA---FNHFAVV  145 (356)
Q Consensus       112 ~~P~~~~~l~~LpGiG~yta~ai~s~a---~~~~~~~  145 (356)
                      .=|+|+++|+.+||||+.|..|++-+|   ||.|.--
T Consensus       263 ~~p~dfeeLLl~~GvGp~TlRALaLvaElIyg~p~s~  299 (319)
T pfam05559       263 RNPEDFEELLLLKGVGPSTLRALALVAEVIYGTPPSF  299 (319)
T ss_pred             CCCCCHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCC
T ss_conf             6801699997147988899999999999980899886


No 83 
>cd03672 Dcp2p mRNA decapping enzyme 2 (Dcp2p), the catalytic subunit, and Dcp1p are the two components of the decapping enzyme complex. Decapping is a key step in both general and nonsense-mediated 5'-3' mRNA-decay pathways. Dcp2p contains an all-alpha helical N-terminal domain and a C-terminal domain which has the Nudix fold. While decapping is not dependent on the N-terminus of Dcp2p, it does affect its efficiency. Dcp1p binds the N-terminal domain of Dcp2p stimulating the decapping activity of Dcp2p. Decapping permits the degradation of the transcript and is a site of numerous control inputs. It is responsible for nonsense-mediated decay as well as AU-rich element (ARE)-mediated decay. In addition, it may also play a role in the levels of mRNA. Enzymes belonging to the Nudix superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V).
Probab=94.20  E-value=0.24  Score=29.96  Aligned_cols=89  Identities=17%  Similarity=0.208  Sum_probs=49.3

Q ss_pred             EEEEECC-CEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECC-CEEEEEEEEEEEEEEEE
Q ss_conf             0034137-605776457201320474421111346721588863010100002------137-35899831599999999
Q gi|254780479|r  243 FIAITND-NRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCN-TITHTFTHFTLTLFVWK  314 (356)
Q Consensus       243 ~vii~~~-~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~-~ikH~fTH~~L~i~v~~  314 (356)
                      +++++.+ +++||.|+..    ++-|.||.+..+.++........+..++..-      ... -|.+.+--..+.+.+..
T Consensus         6 aIiln~~~~kvLLVk~~~----~~~W~fPkGki~~~E~~~~~AiREv~EEtG~Dv~~~i~~~~~Ie~~~~~q~~rlyiv~   81 (145)
T cd03672           6 AIILNEDLDKVLLVKGWK----SKSWSFPKGKINKDEDDHDCAIREVYEETGFDISKYIDKDDYIELIIRGQNVKLYIVP   81 (145)
T ss_pred             EEEEECCCCEEEEEEECC----CCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHCCCCEEEEEEECCEEEEEEEEC
T ss_conf             999918999899999438----9958599641689979899998887773275754215846069999789089999982


Q ss_pred             EECC--------CCCCCCCCEEECHHHHHH
Q ss_conf             9807--------767667813322788874
Q gi|254780479|r  315 TIVP--------QIVIIPDSTWHDAQNLAN  336 (356)
Q Consensus       315 ~~~~--------~~~~~~~~~Wv~~~el~~  336 (356)
                       .++        ...+.+...|++.++|..
T Consensus        82 -gVp~dt~F~p~TRkEI~~I~W~~i~~Lp~  110 (145)
T cd03672          82 -GVPEDTPFEPKTRKEISKIEWFDIKDLPT  110 (145)
T ss_pred             -CCCCCCCCCCCCCCCEEEEEEEEHHHHHH
T ss_conf             -88888751788723276578889888444


No 84 
>pfam10576 EndIII_4Fe-2S Iron-sulfur binding domain of endonuclease III. Escherichia coli endonuclease III (EC 4.2.99.18) is a DNA repair enzyme that acts both as a DNA N-glycosylase, removing oxidized pyrimidines from DNA, and as an apurinic/apyrimidinic (AP) endonuclease, introducing a single-strand nick at the site from which the damaged base was removed. Endonuclease III is an iron-sulfur protein that binds a single 4Fe-4S cluster. The 4Fe-4S cluster does not seem to be important for catalytic activity, but is probably involved in the proper positioning of the enzyme along the DNA strand. The 4Fe-4S cluster is bound by four cysteines which are all located in a 17 amino acid region at the C-terminal end of endonuclease III. A similar region is also present in the central section of mutY and in the C-terminus of ORF-10 and of the Micro-coccus UV endonuclease.
Probab=93.38  E-value=0.025  Score=36.95  Aligned_cols=21  Identities=38%  Similarity=0.999  Sum_probs=19.0

Q ss_pred             CCCCCCCCCCCCCCCCHHHHH
Q ss_conf             041235643110000013431
Q gi|254780479|r  199 CTSNKPLCPLCPIQKNCLTFS  219 (356)
Q Consensus       199 C~p~~P~C~~Cpl~~~C~~~~  219 (356)
                      |+|++|+|+.|||++.|....
T Consensus         1 Ct~rkP~C~~Cpl~~~C~~~~   21 (26)
T pfam10576         1 CTARKPKCEECPLADLCXXXX   21 (26)
T ss_pred             CCCCCCCCCCCCHHHHHHHCC
T ss_conf             987898766586898774101


No 85 
>PRK00241 nudC NADH pyrophosphatase; Reviewed
Probab=92.99  E-value=0.81  Score=26.28  Aligned_cols=115  Identities=11%  Similarity=0.053  Sum_probs=69.4

Q ss_pred             CCCCCCCCCCCCCCCEEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEE
Q ss_conf             00123443222222112320003413760577645720132047442111134672158886301010000213735899
Q gi|254780479|r  223 SHLLGINTIKKKRPMRTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHT  302 (356)
Q Consensus       223 ~~~~P~kk~KKkk~~r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~  302 (356)
                      ...||...|-       ..  +++.+++++||.+.+.  +=.|+|...-+-.+.++..+.....+..++....++.+++.
T Consensus       126 ~~~yPri~Pa-------VI--vlV~~gd~iLLar~~~--~~~~~yS~lAGFvE~GEslE~aV~REV~EE~Gi~V~~i~Y~  194 (257)
T PRK00241        126 ERYYPRIAPC-------II--VAVRRGDEILLARHPR--HRNGVYTVLAGFVEVGESLEQCVAREVMEEVGIKVKNLRYV  194 (257)
T ss_pred             CCCCCCCCCE-------EE--EEEEECCEEEECCCCC--CCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCEECCEEEE
T ss_conf             6316887976-------99--9997299887614788--99985230100365785899975351023467287540175


Q ss_pred             EEE-----EEEEEEEEEEEC------CCCCCCCCCEEECHHHHHHCCCC-HHHHHHHHH
Q ss_conf             831-----599999999980------77676678133227888747998-789999984
Q gi|254780479|r  303 FTH-----FTLTLFVWKTIV------PQIVIIPDSTWHDAQNLANAALP-TVMKKALSA  349 (356)
Q Consensus       303 fTH-----~~L~i~v~~~~~------~~~~~~~~~~Wv~~~el~~~~LP-s~~kKIL~a  349 (356)
                      -|.     ..|-+- |.+..      .+..+..+.+|++.+++..+|-+ |.-+++|++
T Consensus       195 ~SQpWPFP~sLMlG-f~A~~~~~ei~~d~~Ei~~A~Wf~~delp~lPp~~SIAr~LIe~  252 (257)
T PRK00241        195 GSQPWPFPRSLMLG-FHADYDSGEIVFDPKEIADAQWFTYDELPLLPPSGTIARRLIED  252 (257)
T ss_pred             EECCCCCCHHHHEE-EEEEECCCCCCCCHHHEEEEEEECHHHHHHCCCCCCHHHHHHHH
T ss_conf             41588897411210-69995688632583330224745689842069976599999999


No 86 
>cd04661 MRP_L46 Mitochondrial ribosomal protein L46 (MRP L46) is a component of the large subunit (39S) of the mammalian mitochondrial ribosome and a member of the Nudix hydrolase superfamily. MRPs are thought to be involved in the maintenance of the mitochondrial DNA. In general, members of the Nudix superfamily require a divalent cation, such as Mg2+ or Mn2+, for activity and contain the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. MRP L46 appears to contain a modified nudix motif.
Probab=92.99  E-value=0.39  Score=28.52  Aligned_cols=82  Identities=10%  Similarity=0.019  Sum_probs=39.9

Q ss_pred             HCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCC--EE-EEEEEEE--------------EEEEEEEEECC-----CCC
Q ss_conf             04744211113467215888630101000021373--58-9983159--------------99999999807-----767
Q gi|254780479|r  264 EGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNT--IT-HTFTHFT--------------LTLFVWKTIVP-----QIV  321 (356)
Q Consensus       264 ~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~--ik-H~fTH~~--------------L~i~v~~~~~~-----~~~  321 (356)
                      .+.|.||.+.+++++......+..........+..  +- +-+.|++              -.+.+|.+.+-     ...
T Consensus        25 ~~~W~fP~~~~~~~EtLr~~Aer~L~~~~G~~~~~~~vgnaP~g~~k~k~p~~~~~~~~~G~K~Ff~ka~~~~G~~~~~~  104 (132)
T cd04661          25 QNHWILPQGKREEGETLRQTAERTLKELCGNNLKAKFYGNAPVGFYKYKYPKAVRNEGIVGAKVFFFKARYMSGQFELSQ  104 (132)
T ss_pred             CCEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCCEEEEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCCCC
T ss_conf             88802884246678548999999999860777367896268758898756852244466634799999999558646567


Q ss_pred             CCCCCEEECHHHHHHCCCCHHHHH
Q ss_conf             667813322788874799878999
Q gi|254780479|r  322 IIPDSTWHDAQNLANAALPTVMKK  345 (356)
Q Consensus       322 ~~~~~~Wv~~~el~~~~LPs~~kK  345 (356)
                      ...++.|++.+||.++--|.....
T Consensus       105 ~~~df~Wltk~EL~~~l~~~y~~~  128 (132)
T cd04661         105 NQVDFKWLAKEELQKYLNPPYLQS  128 (132)
T ss_pred             CCCEEEECCHHHHHHHCCHHHHHH
T ss_conf             752038708999987659999988


No 87 
>COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]
Probab=92.42  E-value=0.22  Score=30.20  Aligned_cols=90  Identities=23%  Similarity=0.303  Sum_probs=61.6

Q ss_pred             HHHHCCCHHHHHHHHHCC--CHHH-HHHHHHHHHHHH------HHCCCCCCCC-HHHHHH-HHHHHHHHHHHHHH-HHHC
Q ss_conf             898729989999997404--6345-889998776754------3005786873-145688-88877787766776-6434
Q gi|254780479|r   72 FCLSSAKDEEILSAWAGL--GYYT-RARNLKKCADII------VKKYEGNFPH-KVEILK-KLPGIGDYTASAIV-AIAF  139 (356)
Q Consensus        72 ~~la~a~~~~vl~~w~gL--Gyy~-Rar~l~~~a~~i------~~~~~g~~P~-~~~~l~-~LpGiG~yta~ai~-s~a~  139 (356)
                      +.+-..++||+-..-.-.  -||+ ||+.+-.+-+.+      ++  ....+. ..+-|. ...|||=.-|+-.+ .++|
T Consensus        66 ~gfly~~~eEL~e~Lk~~g~Rf~n~raeyIVeaR~~~~~lk~~v~--~~~~~~vaRE~Lv~nikGiGyKEASHFLRNVG~  143 (210)
T COG1059          66 DGFLYLSEEELREKLKEVGYRFYNVRAEYIVEAREKFDDLKIIVK--ADENEKVARELLVENIKGIGYKEASHFLRNVGF  143 (210)
T ss_pred             CCCCCCCHHHHHHHHHHHCCHHCCCCHHHHHHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHCCCCCHHHHHHHHHHCCH
T ss_conf             331028899999999981601224245999999998877899882--674267899999997015537999999995471


Q ss_pred             CCEEEECCCCHHHHHHHHHCCCCCC
Q ss_conf             9631312431466657651975457
Q gi|254780479|r  140 NHFAVVVDTNIERIISRYFDIIKPA  164 (356)
Q Consensus       140 ~~~~~~vD~Nv~RVl~R~~~~~~~~  164 (356)
                       ...+++|-|+.|-+.|++.++..+
T Consensus       144 -~D~AIlDrHIlr~l~r~g~i~e~~  167 (210)
T COG1059         144 -EDLAILDRHILRWLVRYGLIDENP  167 (210)
T ss_pred             -HHHHHHHHHHHHHHHHHCCCCCCC
T ss_conf             -689999999999999836523474


No 88 
>COG2816 NPY1 NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding [DNA replication, recombination, and repair]
Probab=92.26  E-value=1.1  Score=25.38  Aligned_cols=152  Identities=15%  Similarity=0.145  Sum_probs=87.0

Q ss_pred             HHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCC---------CCCCHHHHHCCCCCCCCCCCCCCCCCCEEE
Q ss_conf             99874102578522678999999998632041235643110---------000013431011001234432222221123
Q gi|254780479|r  170 TIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCP---------IQKNCLTFSEGKSHLLGINTIKKKRPMRTG  240 (356)
Q Consensus       170 ~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cp---------l~~~C~~~~~~~~~~~P~kk~KKkk~~r~~  240 (356)
                      .++.+... ++....+.++||.+=+   .=..+..-|..|-         .+..|..  .| -..||+..|-       .
T Consensus        82 ~lR~l~~~-~~~~~~~~~~~a~~l~---~w~~~~RFCg~CG~~~~~~~~g~~~~C~~--cg-~~~fPR~dP~-------v  147 (279)
T COG2816          82 DLRSLLTE-LDEGLFGLAARAVQLL---EWYRSHRFCGRCGTKTYPREGGWARVCPK--CG-HEHFPRIDPC-------V  147 (279)
T ss_pred             EHHHHHCC-CCHHHHHHHHHHHHHH---HHHHHCCCCCCCCCCCCCCCCCEEEECCC--CC-CCCCCCCCCE-------E
T ss_conf             18877364-9878989999999999---88863767777898074356732455798--88-7007988976-------9


Q ss_pred             EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEE-----EEEEEEEEEE
Q ss_conf             20003413760577645720132047442111134672158886301010000213735899831-----5999999999
Q gi|254780479|r  241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTH-----FTLTLFVWKT  315 (356)
Q Consensus       241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH-----~~L~i~v~~~  315 (356)
                      .+  ++.+++++||-+++..  +.|||..-.+-++.++..+.....+..++....++.+.+.-|-     ..|-|-++-.
T Consensus       148 Iv--~v~~~~~ilLa~~~~h--~~g~yS~LAGFVE~GETlE~AV~REv~EE~Gi~V~~vrY~~SQPWPfP~SLMigf~ae  223 (279)
T COG2816         148 IV--AVIRGDEILLARHPRH--FPGMYSLLAGFVEPGETLEQAVAREVFEEVGIKVKNVRYVGSQPWPFPHSLMLGFMAE  223 (279)
T ss_pred             EE--EEECCCCEEECCCCCC--CCCCEEEEEECCCCCCCHHHHHHHHHHHHHCEEEEEEEEEECCCCCCCHHHHHHHEEE
T ss_conf             99--9961772000689887--7853000133045785499999999888647177656687326899953465240255


Q ss_pred             EC-----CCCCCCCCCEEECHHH-HHHCCC
Q ss_conf             80-----7767667813322788-874799
Q gi|254780479|r  316 IV-----PQIVIIPDSTWHDAQN-LANAAL  339 (356)
Q Consensus       316 ~~-----~~~~~~~~~~Wv~~~e-l~~~~L  339 (356)
                      ..     .+..+..+.+|++.+| +..++=
T Consensus       224 y~sgeI~~d~~Eleda~WFs~~evl~~L~~  253 (279)
T COG2816         224 YDSGEITPDEGELEDARWFSRDEVLPALPP  253 (279)
T ss_pred             ECCCCCCCCCCHHHHCCCCCHHHHHHHCCC
T ss_conf             426664678221334113477777762688


No 89 
>PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional
Probab=91.64  E-value=0.078  Score=33.44  Aligned_cols=76  Identities=24%  Similarity=0.363  Sum_probs=39.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHH-HHCCCE-EEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCC-------CHHHH
Q ss_conf             45688888777877667766-434963-1312431466657651975457356899987410257852-------26789
Q gi|254780479|r  117 VEILKKLPGIGDYTASAIVA-IAFNHF-AVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTS-------RPGDF  187 (356)
Q Consensus       117 ~~~l~~LpGiG~yta~ai~s-~a~~~~-~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~-------~~~~~  187 (356)
                      ++.|++..||||.||=+|+| +.-++- .++..+++.+ |+++=|+-.  +.+++.+-++.+++....       ...+.
T Consensus        71 F~~LisVsGIGpk~Al~iLs~~~~~~l~~aI~~~D~~~-L~~vpGIG~--KtA~rIi~ELk~Kl~~~~~~~~~~~~~~e~  147 (196)
T PRK13901         71 FEELIGVDGIGPRAALRVLSGIKYNEFRDAIDREDIEL-ISKVKGIGN--KMAGKIFLKLRGKLVKNDELESSLFKFKEL  147 (196)
T ss_pred             HHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHCCHHH-HHHCCCCCH--HHHHHHHHHHHHHHCCCCCCCCCCCCHHHH
T ss_conf             99987658826899999975799999999999289999-831999589--999999999976531566556553448999


Q ss_pred             HHHHHHHH
Q ss_conf             99999998
Q gi|254780479|r  188 VQAMMDLG  195 (356)
Q Consensus       188 nQAlMdlG  195 (356)
                      -+||+-||
T Consensus       148 ~~AL~~LG  155 (196)
T PRK13901        148 EQSIVNMG  155 (196)
T ss_pred             HHHHHHCC
T ss_conf             99999849


No 90 
>COG1555 ComEA DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair]
Probab=91.02  E-value=0.093  Score=32.90  Aligned_cols=24  Identities=42%  Similarity=0.584  Sum_probs=20.8

Q ss_pred             CHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             314568888877787766776643
Q gi|254780479|r  115 HKVEILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       115 ~~~~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .|.++|..|||||++.|.||...-
T Consensus        94 As~eeL~~lpgIG~~kA~aIi~yR  117 (149)
T COG1555          94 ASAEELQALPGIGPKKAQAIIDYR  117 (149)
T ss_pred             CCHHHHHHCCCCCHHHHHHHHHHH
T ss_conf             089999886798999999999999


No 91 
>TIGR02705 nudix_YtkD nucleoside triphosphatase YtkD; InterPro: IPR014078   The proteins in this entry belong to the nudix family and share some sequence identity with Escherichia coli MutT but appear not to be functionally interchangeable with it. The functional assignment of the proteins in this family is contentious. Reference  challenges the findings of reference , both in interpretation and in enzyme assay results..
Probab=90.94  E-value=0.36  Score=28.76  Aligned_cols=78  Identities=12%  Similarity=0.124  Sum_probs=51.7

Q ss_pred             EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCE------EEEEEE-EEEEEEE
Q ss_conf             320003413760577645720132047442111134672158886301010000213735------899831-5999999
Q gi|254780479|r  240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTI------THTFTH-FTLTLFV  312 (356)
Q Consensus       240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~i------kH~fTH-~~L~i~v  312 (356)
                      ..|+||=+.+|+.||+.=+.-|+     |||+++.+.++....+..++..++.+...+.+      .=.-.| -.+-=+|
T Consensus        26 ~HVlvip~~~~qWLlT~H~~RGl-----EFPGGKvE~GEt~~eAA~REv~EETGA~v~~l~Y~gQY~Ve~~~~~~f~K~v  100 (158)
T TIGR02705        26 NHVLVIPRYKDQWLLTEHKRRGL-----EFPGGKVEEGETSKEAAIREVYEETGAIVKELHYIGQYEVEGEDDTDFVKDV  100 (158)
T ss_pred             CCEEEEEEECCEEEECCCCCCCC-----CCCCCEECCCCCHHHHHHEECCCCCCEEEEEEEEEEEEEEECCCCCEEEEEE
T ss_conf             95689755689334403587870-----4788800799884676510010035807546888876787589996657776


Q ss_pred             EEEECCCCCC
Q ss_conf             9998077676
Q gi|254780479|r  313 WKTIVPQIVI  322 (356)
Q Consensus       313 ~~~~~~~~~~  322 (356)
                      |.+.+.....
T Consensus       101 Yfa~V~~le~  110 (158)
T TIGR02705       101 YFAEVSALES  110 (158)
T ss_pred             EEEEEEEEEE
T ss_conf             6889987324


No 92 
>cd03424 ADPRase_NUDT5 ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate. Like other members of the Nudix hydrolase superfamily, it requires a divalent cation, such as Mg2+, for its activity. It also contains a highly conserved 23-residue Nudix motif (GX5EX7REUXEEXGU, where U = I, L or V) which functions as a metal binding site/catalytic site. In addition to the Nudix motif, there are additional conserved amino acid residues, distal from the signature sequence, that correlate with substrate specificity. In humans, there are four distinct ADPRase activities, three putative cytosolic enzymes (ADPRase-I, -II, and -Mn) and a single mitochondrial enzyme (ADPRase-m). Human ADPRase-II is also referred to as NUDT5. It lacks the N-terminal target sequence unique to mitochondrial ADPRase. The different cytosolic types are distinguished by their specificities for substrate and specific requirem
Probab=90.49  E-value=1.1  Score=25.42  Aligned_cols=93  Identities=8%  Similarity=0.012  Sum_probs=52.2

Q ss_pred             EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCC------CCCCCEECCCEEE--EEEEEEEEEEEEE
Q ss_conf             0034137605776457201320474421111346721588863010------1000021373589--9831599999999
Q gi|254780479|r  243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAP------FTANWILCNTITH--TFTHFTLTLFVWK  314 (356)
Q Consensus       243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~------~~~~~~~l~~ikH--~fTH~~L~i~v~~  314 (356)
                      .+.+..+++++|.+.=--++=.-+||||.+..+.+++......++.      ....+..++++.-  .+|.-  .+++|.
T Consensus         7 il~~~~~~~i~lv~QyR~~~~~~~~E~PaG~id~gE~~~~aA~REL~EETG~~~~~~~~l~~~~~~pg~~~~--~~~~~~   84 (137)
T cd03424           7 VLPYDDDGKVVLVRQYRPPVGGWLLELPAGLIDPGEDPEEAARRELEEETGYEAGDLEKLGSFYPSPGFSDE--RIHLFL   84 (137)
T ss_pred             EEEECCCCEEEEEEEEECCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCCC--EEEEEE
T ss_conf             999869998999981442899838981554688997999999999741206513867893362588823371--799999


Q ss_pred             EECCCCC--------CCCCCEEECHHHHHHC
Q ss_conf             9807767--------6678133227888747
Q gi|254780479|r  315 TIVPQIV--------IIPDSTWHDAQNLANA  337 (356)
Q Consensus       315 ~~~~~~~--------~~~~~~Wv~~~el~~~  337 (356)
                      +......        ..-+..|++.+++.+.
T Consensus        85 a~~~~~~~~~~~d~~E~i~v~~~~~~e~~~~  115 (137)
T cd03424          85 AEDLSPGEEGLLDEGEDIEVVLVPLDEALEL  115 (137)
T ss_pred             EEECCCCCCCCCCCCCCEEEEEECHHHHHHH
T ss_conf             9944045567888775268999669999999


No 93 
>KOG3069 consensus
Probab=90.31  E-value=0.77  Score=26.42  Aligned_cols=88  Identities=15%  Similarity=0.152  Sum_probs=49.3

Q ss_pred             CCEEEEEECCCC-HHHHCCCCCCCCCCCCCCCHHH-------HHHCCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCC
Q ss_conf             760577645720-1320474421111346721588-------86301010000213735899831599999999980776
Q gi|254780479|r  249 DNRILLRKRTNT-RLLEGMDELPGSAWSSTKDGNI-------DTHSAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQI  320 (356)
Q Consensus       249 ~~kiLL~KRp~~-gll~GLwEFP~~e~~~~~~~~~-------~~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~  320 (356)
                      +-++||+||... .-+.|--.||++..+.......       .++..........++...--++...+.+..+.+.+...
T Consensus        57 ~l~vLltkRSr~LrshsGev~fPGG~~d~~D~s~~~tAlREt~EEIGl~~~~~~~~g~l~~~~~r~~~~v~p~v~~l~~~  136 (246)
T KOG3069          57 ELSVLLTKRSRTLRSHSGEVCFPGGRRDPHDKSDIQTALRETEEEIGLDPELVDVLGALPPFVLRSGWSVFPVVGFLSDK  136 (246)
T ss_pred             CEEEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCCCEEECCCCCCCEEEEEEECC
T ss_conf             56899983356333568715578876783445048889887898739998882564215440452676313068997314


Q ss_pred             ----------CCCCCCEEECHHHHHH
Q ss_conf             ----------7667813322788874
Q gi|254780479|r  321 ----------VIIPDSTWHDAQNLAN  336 (356)
Q Consensus       321 ----------~~~~~~~Wv~~~el~~  336 (356)
                                .+.....||+++++-.
T Consensus       137 ~~l~~~~ln~gEv~~~F~VPL~~ll~  162 (246)
T KOG3069         137 KILPSLRLNSGEVESAFWVPLTDLLL  162 (246)
T ss_pred             CCCCCCCCCCHHEEEEEEEEHHHHHH
T ss_conf             33421358802402332201899732


No 94 
>TIGR00575 dnlj DNA ligase, NAD-dependent; InterPro: IPR001679   DNA ligase (polydeoxyribonucleotide synthase) is the enzyme that joins two DNA fragments by catalyzing the formation of an internucleotide ester bond between phosphate and deoxyribose. It is active during DNA replication, DNA repair and DNA recombination. There are two forms of DNA ligase: one requires ATP (6.5.1.1 from EC), the other NAD (6.5.1.2 from EC).   This family is predominantly composed of NAD-dependent bacterial DNA ligases. They are proteins of about 75 to 85 Kd whose sequence is well conserved , . They also show similarity to yicF, an Escherichia coli hypothetical protein of 63 Kd.; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair.
Probab=90.20  E-value=0.46  Score=28.00  Aligned_cols=22  Identities=27%  Similarity=0.374  Sum_probs=16.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             1456888887778776677664
Q gi|254780479|r  116 KVEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus       116 ~~~~l~~LpGiG~yta~ai~s~  137 (356)
                      +..+|++++|||+-+|.+|..+
T Consensus       562 ~~s~L~~~~g~G~~vA~~~~~~  583 (706)
T TIGR00575       562 DRSELLSVEGVGPKVAESIVNF  583 (706)
T ss_pred             CCHHHHHCCCCHHHHHHHHHHH
T ss_conf             6055641014027899999998


No 95 
>PRK00024 radC DNA repair protein RadC; Reviewed
Probab=90.00  E-value=0.11  Score=32.40  Aligned_cols=70  Identities=21%  Similarity=0.364  Sum_probs=49.2

Q ss_pred             HHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCC---CCCCHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             99877489978987299899999974046345889998776754300578---687314568888877787766776643
Q gi|254780479|r   62 KKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEG---NFPHKVEILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus        62 ~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g---~~P~~~~~l~~LpGiG~yta~ai~s~a  138 (356)
                      +||++.  -+.+|+   ..|+|.+-  |||..+.++-...|+.+.++|||   -+=.+.++|.+.+|||+-.|..|.++.
T Consensus        14 ER~l~~--G~~~Ls---D~ELLall--L~~g~~~~d~~~lA~~ll~~~g~l~~l~~a~~~eL~~i~GiG~~kA~~l~a~~   86 (224)
T PRK00024         14 ERLLRR--GAAALS---DAELLAIL--LRTGTKGKSVLDLARELLERFGSLRGLLDASLEELQEIKGIGPAKAAQLKAAL   86 (224)
T ss_pred             HHHHHH--CCCCCC---HHHHHHHH--HHCCCCCCCHHHHHHHHHHHCCCHHHHHHCCHHHHHCCCCCCHHHHHHHHHHH
T ss_conf             999860--932077---79999999--84699999989999999998599999987088988447898899999999999


No 96 
>COG1415 Uncharacterized conserved protein [Function unknown]
Probab=88.58  E-value=0.21  Score=30.37  Aligned_cols=35  Identities=29%  Similarity=0.413  Sum_probs=25.1

Q ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHH---HCCCEEEECC
Q ss_conf             8731456888887778776677664---3496313124
Q gi|254780479|r  113 FPHKVEILKKLPGIGDYTASAIVAI---AFNHFAVVVD  147 (356)
Q Consensus       113 ~P~~~~~l~~LpGiG~yta~ai~s~---a~~~~~~~vD  147 (356)
                      =|.|+++|+..||||+.|..|.+-+   .||.|.---|
T Consensus       273 ~p~Df~elLl~~GiGpstvRALalVAEvIyg~~~s~rD  310 (373)
T COG1415         273 NPDDFEELLLVPGIGPSTVRALALVAEVIYGEPPSFRD  310 (373)
T ss_pred             CCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCC
T ss_conf             83249998740687889999999999998089988677


No 97 
>TIGR01259 comE comEA protein; InterPro: IPR004787   The comE locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. comEA and comEC are required for transformability, whereas the products of comEB and of the overlapping comER, which is transcribed in the reverse direction, are dispensable .   ComEA has been shown to be an integral membrane protein, as predicted from hydropathy analysis, with its C-terminal domain outside the cytoplasmic membrane. This C-terminal domain possesses a sequence with similarity to those of several proteins known to be involved in nucleic acid transactions including UvrC and a human protein that binds to the replication origin of the Human herpesvirus 4 (Epstein-Barr virus) ..
Probab=88.58  E-value=0.15  Score=31.45  Aligned_cols=48  Identities=33%  Similarity=0.569  Sum_probs=19.7

Q ss_pred             HCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH
Q ss_conf             72998999999740463458899987767543005786873145688888777877
Q gi|254780479|r   75 SSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYT  130 (356)
Q Consensus        75 a~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yt  130 (356)
                      =.|+++| |+...|+| -+||.-+-.     -.+-+|+| .+.|||.+.-|||+.|
T Consensus        67 N~As~~E-L~~l~GiG-P~kA~aIi~-----YRe~nG~F-~SvddL~kVsGIG~k~  114 (124)
T TIGR01259        67 NKASLEE-LQALPGIG-PAKAKAIIE-----YREENGAF-KSVDDLTKVSGIGEKS  114 (124)
T ss_pred             HHHHHHH-HHHCCCCC-CHHHHHHHH-----HHHHCCCC-CCHHHHHCCCCCCHHH
T ss_conf             0678999-86369998-133799999-----99856997-7755500357885466


No 98 
>COG1443 Idi Isopentenyldiphosphate isomerase [Lipid metabolism]
Probab=87.15  E-value=1.4  Score=24.62  Aligned_cols=99  Identities=13%  Similarity=0.127  Sum_probs=49.5

Q ss_pred             CEEEEEEEEEECCCEEEEEECCCC-HHHHCCCCCCCC----CCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEEEE---
Q ss_conf             112320003413760577645720-132047442111----134672158886301010000213735899831599---
Q gi|254780479|r  237 MRTGAVFIAITNDNRILLRKRTNT-RLLEGMDELPGS----AWSSTKDGNIDTHSAPFTANWILCNTITHTFTHFTL---  308 (356)
Q Consensus       237 ~r~~~~~vii~~~~kiLL~KRp~~-gll~GLwEFP~~----e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~~L---  308 (356)
                      .+.+.+| +.+.+|++||+||... ..|.|.|.=-..    ..+..+..........+...... ...-+.+++|.-   
T Consensus        33 HrAFS~~-lFne~g~LLltrRA~~K~twP~vWTNSvCsHP~~~es~~~A~~rRl~~ELGie~~~-~d~~~il~rf~YrA~  110 (185)
T COG1443          33 HRAFSSF-LFNERGQLLLTRRALSKKTWPGVWTNSVCSHPLPGESNEDAARRRLAYELGIEPDQ-YDKLEILPRFRYRAA  110 (185)
T ss_pred             HHHHHEE-EECCCCCEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCC-CCCCCCCCCEEEECC
T ss_conf             7431046-86788866310144532557663436542798688741889999889983998754-474021465178635


Q ss_pred             --------EEE-EEEEECC-----CCCCCCCCEEECHHHHHHC
Q ss_conf             --------999-9999807-----7676678133227888747
Q gi|254780479|r  309 --------TLF-VWKTIVP-----QIVIIPDSTWHDAQNLANA  337 (356)
Q Consensus       309 --------~i~-v~~~~~~-----~~~~~~~~~Wv~~~el~~~  337 (356)
                              +|. |+.+.+.     ++.+..+++|++.+++...
T Consensus       111 ~~~~~~E~Eic~V~~~~~~~~~~~npdEV~~~~wv~~e~l~~~  153 (185)
T COG1443         111 DPDGIVENEICPVLAARLDSALDPNPDEVMDYRWVSPEDLKEM  153 (185)
T ss_pred             CCCCCCEEEEEEEEEEEECCCCCCCHHHHHHEECCCHHHHHHH
T ss_conf             8987530024138888613778799688624112689999875


No 99 
>TIGR00084 ruvA Holliday junction DNA helicase RuvA; InterPro: IPR000085   In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure . In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB.; GO: 0003678 DNA helicase activity, 0006281 DNA repair, 0006310 DNA recombination.
Probab=86.31  E-value=0.23  Score=30.12  Aligned_cols=23  Identities=39%  Similarity=0.729  Sum_probs=15.8

Q ss_pred             CHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             31456888887778776677664
Q gi|254780479|r  115 HKVEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus       115 ~~~~~l~~LpGiG~yta~ai~s~  137 (356)
                      +++..|.++||||..||.+++|.
T Consensus       115 ~~~~~L~k~pGvGKK~A~~l~~l  137 (217)
T TIGR00084       115 EEVKALVKIPGVGKKTAERLLAL  137 (217)
T ss_pred             HHHHHHHCCCCCCHHHHHHHHHH
T ss_conf             10444204588573789999877


No 100
>TIGR00426 TIGR00426 competence protein ComEA helix-hairpin-helix repeat region; InterPro: IPR004509   This domain is found in competence protein ComEA and closely related proteins from a number of species that exhibit competence for transformation by exongenous DNA, including Streptococcus pneumoniae, Bacillus subtilis, Neisseria meningitidis, and Haemophilus influenzae. This domain represents a region of two tandem copies of a helix-hairpin-helix domain, each about 30 residues in length. Limited sequence similarity can be found among some members of this family N-terminal to this domain..
Probab=85.11  E-value=0.64  Score=27.00  Aligned_cols=50  Identities=40%  Similarity=0.599  Sum_probs=36.6

Q ss_pred             HHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHH--HCCCCCCCCHHHHHHHHHHHHHHHHH
Q ss_conf             872998999999740463458899987767543--00578687314568888877787766
Q gi|254780479|r   74 LSSAKDEEILSAWAGLGYYTRARNLKKCADIIV--KKYEGNFPHKVEILKKLPGIGDYTAS  132 (356)
Q Consensus        74 la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~--~~~~g~~P~~~~~l~~LpGiG~yta~  132 (356)
                      .=.|+.+|+-+...|.|        .+=|+-|+  .+-.|.| .+.|+|.+.||||..|--
T Consensus        11 INtAtaeElq~~~~GvG--------~kKAeAIv~YREe~G~F-~t~Edl~~V~GiG~~~~E   62 (70)
T TIGR00426        11 INTATAEELQKALSGVG--------AKKAEAIVAYREEYGRF-KTVEDLKKVSGIGEKLLE   62 (70)
T ss_pred             CCHHCHHHHHHHHCCCC--------HHHHHHHHHHHHCCCCC-CCHHHHHHCCCCCHHHHH
T ss_conf             01104788887642887--------23789998875327795-762223214787624555


No 101
>TIGR00615 recR recombination protein RecR; InterPro: IPR000093 The bacterial protein recR seems to play a role in a recombinational process of DNA repair . It may act with recF and recO. RecR is a protein of about 200 amino acid residues. This protein contains a putative C4-type zinc finger in the N-terminal section.; GO: 0006281 DNA repair, 0006310 DNA recombination.
Probab=84.86  E-value=0.3  Score=29.34  Aligned_cols=21  Identities=48%  Similarity=0.733  Sum_probs=16.7

Q ss_pred             CHHHHHHHHHHHHHHHHHHHH
Q ss_conf             314568888877787766776
Q gi|254780479|r  115 HKVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus       115 ~~~~~l~~LpGiG~yta~ai~  135 (356)
                      +=.|.|.+|||||+.||.=++
T Consensus         9 ~Lie~L~kLPgiG~KsA~RlA   29 (205)
T TIGR00615         9 KLIESLKKLPGIGPKSAQRLA   29 (205)
T ss_pred             HHHHHHHHCCCCCHHHHHHHH
T ss_conf             999986407898714789999


No 102
>PRK13766 Hef nuclease; Provisional
Probab=84.74  E-value=1.6  Score=24.27  Aligned_cols=30  Identities=20%  Similarity=0.352  Sum_probs=16.0

Q ss_pred             EEEEEEEEEEC----CCEEEEEECCCCHHHHCCC
Q ss_conf             12320003413----7605776457201320474
Q gi|254780479|r  238 RTGAVFIAITN----DNRILLRKRTNTRLLEGMD  267 (356)
Q Consensus       238 r~~~~~vii~~----~~kiLL~KRp~~gll~GLw  267 (356)
                      +.+.+++++.+    ++.++..+|.++.+...|-
T Consensus       467 r~G~v~vLi~~gt~de~~~~~~~~kek~m~~~l~  500 (764)
T PRK13766        467 RGGRVVVLIAKGTRDEAYYWSSRRKEKKMKEILK  500 (764)
T ss_pred             CCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHH
T ss_conf             8974999995686267788851120689999999


No 103
>PRK11762 nudE ADP-ribose diphosphatase NudE; Provisional
Probab=84.35  E-value=3.6  Score=21.69  Aligned_cols=92  Identities=8%  Similarity=0.024  Sum_probs=51.0

Q ss_pred             EEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHC------CCCCCCCEECCCEEE--EEEEEEEEEEEEEE
Q ss_conf             0341376057764572013204744211113467215888630------101000021373589--98315999999999
Q gi|254780479|r  244 IAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHS------APFTANWILCNTITH--TFTHFTLTLFVWKT  315 (356)
Q Consensus       244 vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~------~~~~~~~~~l~~ikH--~fTH~~L~i~v~~~  315 (356)
                      +-+..+++++|.+.=-.++-.=.||||.+..+.+++.......      ......+..++.+--  .||.-  .+++|.+
T Consensus        53 vpi~~~~~vvLVrQyR~~~~~~~lEiPaG~id~gE~p~~aA~REL~EETGy~a~~~~~l~~~~~spg~~~e--~~~~flA  130 (185)
T PRK11762         53 VPILDDDTLLLIREYAAGTESYELGFPKGLIDPGETPEEAANRELKEEVGFGARQLTFLKELSLAPSYFSS--KMNIVLA  130 (185)
T ss_pred             EEECCCCEEEEEECCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCC--EEEEEEE
T ss_conf             99759997999993368888879982430589996989999999889769774768997599837731153--2999998


Q ss_pred             EC--CCCCCC-----CCCEEECHHHHHHC
Q ss_conf             80--776766-----78133227888747
Q gi|254780479|r  316 IV--PQIVII-----PDSTWHDAQNLANA  337 (356)
Q Consensus       316 ~~--~~~~~~-----~~~~Wv~~~el~~~  337 (356)
                      .-  +.....     -+..+++++++.+.
T Consensus       131 ~~l~~~~~~~De~E~iev~~~~~~e~~~m  159 (185)
T PRK11762        131 QDLYPERLEGDEPEPLEVVRWPLADLDEL  159 (185)
T ss_pred             CCCCCCCCCCCCCCCEEEEEECHHHHHHH
T ss_conf             58605667999984149999769999999


No 104
>PRK13266 Thf1-like protein; Reviewed
Probab=83.98  E-value=3.2  Score=22.07  Aligned_cols=127  Identities=17%  Similarity=0.244  Sum_probs=79.3

Q ss_pred             HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC------C-------HHHCCHHHHHHHHHCCCHH
Q ss_conf             89999999999962899887548554445678704676644673116------1-------2033265899877489978
Q gi|254780479|r    6 HIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQT------T-------VKTVEPYFKKFMQKWPTIF   72 (356)
Q Consensus         6 ~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT------~-------v~~v~~~~~~~~~~~P~~~   72 (356)
                      ..+++.=-..|..+.|..|             .-|+-.|-|.|-++-      .       .--+..-|.+||+-||--+
T Consensus         5 ~TVsDsKr~F~~~~p~pI~-------------~lYrrvvdELLVElHLL~~n~~F~yD~lFAlGlvt~Fd~fm~GY~Pee   71 (224)
T PRK13266          5 RTVSDSKRAFHAAFPRVIP-------------SLYRRVVDELLVELHLLSVQSDFKYDPLFALGLVTVFDRFMQGYRPEE   71 (224)
T ss_pred             CCHHHHHHHHHHHCCCCCC-------------HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHH
T ss_conf             1148879999985898882-------------899999999999999987414664273778449999999976799825


Q ss_pred             HHHCCCHHHHHH-HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHH
Q ss_conf             987299899999-9740463458899987767543005786873145688888777877667766434963131243146
Q gi|254780479|r   73 CLSSAKDEEILS-AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIE  151 (356)
Q Consensus        73 ~la~a~~~~vl~-~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~  151 (356)
                      +..     .+.. +.+.+|+  -+..+++-|+.+.+...|.=+.+..++++-.|-|.-...+++.-.-        +|=.
T Consensus        72 ~~~-----~IF~Alc~s~~~--dp~~~r~dA~~l~~~a~~~s~~~i~~~l~~~~~~~~~l~~~~~~~~--------~~~~  136 (224)
T PRK13266         72 HKD-----ALFEALCQAVGF--DPEQLREDAEQLLELAKGKSLDEILSWLTQKGGGANELLATLQAIA--------NNSK  136 (224)
T ss_pred             HHH-----HHHHHHHHHCCC--CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHH--------CCCC
T ss_conf             699-----999999984489--9999999999999998758999999999735455506889999874--------2787


Q ss_pred             HHHHHHHCC
Q ss_conf             665765197
Q gi|254780479|r  152 RIISRYFDI  160 (356)
Q Consensus       152 RVl~R~~~~  160 (356)
                      =--||+++|
T Consensus       137 f~YSRl~AI  145 (224)
T PRK13266        137 FKYSRLFAI  145 (224)
T ss_pred             CHHHHHHHH
T ss_conf             328999999


No 105
>TIGR00084 ruvA Holliday junction DNA helicase RuvA; InterPro: IPR000085   In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure . In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB.; GO: 0003678 DNA helicase activity, 0006281 DNA repair, 0006310 DNA recombination.
Probab=83.80  E-value=0.48  Score=27.87  Aligned_cols=59  Identities=17%  Similarity=0.272  Sum_probs=41.2

Q ss_pred             HHHHHHHHHHHHHHHHHHH-HHHCCCEEEECC-CCHHHHHHHHHCCCCCCCHHHHHH-HHHHHCC
Q ss_conf             4568888877787766776-643496313124-314666576519754573568999-8741025
Q gi|254780479|r  117 VEILKKLPGIGDYTASAIV-AIAFNHFAVVVD-TNIERIISRYFDIIKPAPLYHKTI-KNYARKI  178 (356)
Q Consensus       117 ~~~l~~LpGiG~yta~ai~-s~a~~~~~~~vD-~Nv~RVl~R~~~~~~~~~~~~k~l-~~~~~~~  178 (356)
                      +.+|+.+-|||+.+|=||+ ++..+.-.-+|+ .||.+ |+.+-|+-.  +.+++.+ =++..++
T Consensus        82 F~~Li~~nGvGpk~ALaiL~~~~~~~~~~ai~~~~~~~-L~k~pGvGK--K~A~~l~~leL~gk~  143 (217)
T TIGR00084        82 FKELIKVNGVGPKLALAILSNMSPEEFVQAIETEEVKA-LVKIPGVGK--KTAERLLALELKGKL  143 (217)
T ss_pred             HHHHHHCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HHCCCCCCH--HHHHHHHHHHHHHHH
T ss_conf             99985148802899999866788758988886410444-204588573--789999877754544


No 106
>TIGR00593 pola DNA polymerase I; InterPro: IPR002298   DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication.
Probab=83.52  E-value=0.46  Score=27.99  Aligned_cols=45  Identities=31%  Similarity=0.561  Sum_probs=34.6

Q ss_pred             HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECC
Q ss_conf             76754300578687314568888-----------877787766776643496313124
Q gi|254780479|r  101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVD  147 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD  147 (356)
                      ++..|+++| |..|.-.-++.+|           ||||++||...+ -.||-=.-+.+
T Consensus       165 t~e~V~eKy-Gv~P~Q~~D~~AL~GD~SDNiPGV~GIGeKTA~kLL-~~fgsLe~iy~  220 (1005)
T TIGR00593       165 TPEYVVEKY-GVTPSQLVDLKALVGDSSDNIPGVKGIGEKTAAKLL-QEFGSLENIYE  220 (1005)
T ss_pred             CHHHHHHHC-CCCHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH-HHHHHHHHHHH
T ss_conf             778987541-867457887620468888785998973765689999-87210899998


No 107
>PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed
Probab=82.05  E-value=0.65  Score=26.92  Aligned_cols=50  Identities=26%  Similarity=0.371  Sum_probs=24.9

Q ss_pred             HHHHHCCCHHHHHHHHHH--HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             999740463458899987--767543005786873145688888777877667766
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKK--CADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVA  136 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~--~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s  136 (356)
                      |.-.+|.|. .-|-++..  ....+.+-   ..=.|.+.|.++||||+.||.-|..
T Consensus        75 Li~V~GIGp-K~AL~ILs~~~~~~l~~a---I~~~D~~~L~~vpGIG~KtA~rIi~  126 (198)
T PRK00116         75 LISVSGVGP-KLALAILSGLSPEELAQA---IANGDIKALTKVPGVGKKTAERIVL  126 (198)
T ss_pred             HHCCCCCCH-HHHHHHHCCCCHHHHHHH---HHHCCHHHHCCCCCCCHHHHHHHHH
T ss_conf             856688578-999988702999999999---9858999970688978899999999


No 108
>pfam11731 Cdd1 Pathogenicity locus. Cdd1 is expressed as part of the pathogenicity locus operon in several different orders of bacteria. Many members of the family are annotated as being putative mitomycin resistance proteins but this could not be confirmed.
Probab=81.70  E-value=0.61  Score=27.14  Aligned_cols=42  Identities=24%  Similarity=0.324  Sum_probs=33.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH
Q ss_conf             145688888777877667766434963131243146665765
Q gi|254780479|r  116 KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY  157 (356)
Q Consensus       116 ~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~  157 (356)
                      ...+|..|||||+-||.-...+++..+.-....|-.-...|+
T Consensus        10 ~l~~L~~lPnIG~a~a~DL~~LGi~~~~~L~g~dp~elY~~l   51 (92)
T pfam11731        10 ALKELTDLPNIGKATAKDLRLLGINSPAQLAGRDPLELYERL   51 (92)
T ss_pred             HHHHHHCCCCCCHHHHHHHHHHCCCCHHHHHCCCHHHHHHHH
T ss_conf             999874189746999999999189989999179999999999


No 109
>PRK00254 ski2-like helicase; Provisional
Probab=81.55  E-value=1.7  Score=23.99  Aligned_cols=20  Identities=10%  Similarity=0.205  Sum_probs=11.7

Q ss_pred             HHHHHHHHHHHCC-CC-CCCCC
Q ss_conf             9999999999628-99-88754
Q gi|254780479|r    8 IQSKILDWYDTNH-RV-LPWRT   27 (356)
Q Consensus         8 ~~~~ll~w~~~~~-R~-lpwr~   27 (356)
                      +.+++.++|+..| +. +||+.
T Consensus         8 ~~~~~~~~~~~~gI~~l~p~Q~   29 (717)
T PRK00254          8 VDERIKEILKERGIEELYPPQA   29 (717)
T ss_pred             CCHHHHHHHHHCCCCCCCHHHH
T ss_conf             9989999999769872689999


No 110
>PRK08097 ligB NAD-dependent DNA ligase LigB; Reviewed
Probab=81.55  E-value=0.68  Score=26.78  Aligned_cols=20  Identities=15%  Similarity=0.325  Sum_probs=9.2

Q ss_pred             HHCCCHHHHHCCCHHHHHHHH
Q ss_conf             748997898729989999997
Q gi|254780479|r   66 QKWPTIFCLSSAKDEEILSAW   86 (356)
Q Consensus        66 ~~~P~~~~la~a~~~~vl~~w   86 (356)
                      ...|- -||.++-.++-+..|
T Consensus        95 H~~pM-lSL~n~~~~~el~~~  114 (563)
T PRK08097         95 HPVAH-TGVRKLADKQALARW  114 (563)
T ss_pred             CCCCC-CCHHHCCCHHHHHHH
T ss_conf             89876-471111898999999


No 111
>TIGR03060 PS_II_psb29 photosystem II biogenesis protein Psp29. Psp29, originally designated sll1414 in Synechocystis 6803, is found universally in Cyanobacteria and in Arabidopsis. It was isolated and partially sequenced from purified photosystem II (PS II) in Synechocystis. While its function is unknown, mutant studies show an impairment in photosystem II biogenesis and/or stability, rather than in PS II core function.
Probab=81.35  E-value=3.7  Score=21.59  Aligned_cols=126  Identities=15%  Similarity=0.247  Sum_probs=77.0

Q ss_pred             HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC------C-------HHHCCHHHHHHHHHCCCHH
Q ss_conf             89999999999962899887548554445678704676644673116------1-------2033265899877489978
Q gi|254780479|r    6 HIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQT------T-------VKTVEPYFKKFMQKWPTIF   72 (356)
Q Consensus         6 ~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT------~-------v~~v~~~~~~~~~~~P~~~   72 (356)
                      ..+++.=-..|..+.|..|             .-|+-.|-|.|-++-      .       .--+..-|.+||+-||--+
T Consensus         5 ~TVsDsKr~F~~~~p~pI~-------------~lYrrvvdELLVElHLl~~n~~F~yD~lfAlGlvt~Fd~fm~GY~Pee   71 (214)
T TIGR03060         5 RTVSDSKRAFHAAFPRVIP-------------PLYRRVVDELLVELHLLSHQSDFKYDPLFALGLVTVFDRFMEGYRPEE   71 (214)
T ss_pred             CCHHHHHHHHHHHCCCCCC-------------HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHH
T ss_conf             1148879999985898881-------------899999999999999988503654373678459999999976799825


Q ss_pred             HHHCCCHHHHHH-HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHH
Q ss_conf             987299899999-9740463458899987767543005786873145688888777877667766434963131243146
Q gi|254780479|r   73 CLSSAKDEEILS-AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIE  151 (356)
Q Consensus        73 ~la~a~~~~vl~-~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~  151 (356)
                      +..     .+.. +.+.+|.  -+..+++-|+.+.+...|.=+.+..++++=.|-|.-...++.+++         +|-.
T Consensus        72 ~~~-----~IF~Alc~s~~~--dp~~~r~dA~~l~~~a~~~s~~~i~~~l~~~~~~~~~~~~l~~~a---------~n~k  135 (214)
T TIGR03060        72 HLD-----ALFDALCNSNGF--DPEQLREDAKQLLEQAKGKGLDEILSWLTQANLSNGGGDTLQGIA---------GRHK  135 (214)
T ss_pred             HHH-----HHHHHHHHHCCC--CHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCHHHHHHHHH---------CCCC
T ss_conf             699-----999999873489--999999999999999974899999999850475541167999874---------1787


Q ss_pred             HHHHHHHCC
Q ss_conf             665765197
Q gi|254780479|r  152 RIISRYFDI  160 (356)
Q Consensus       152 RVl~R~~~~  160 (356)
                      =--||+++|
T Consensus       136 f~YSRl~AI  144 (214)
T TIGR03060       136 FKYSRLFAI  144 (214)
T ss_pred             CCHHHHHHH
T ss_conf             408999999


No 112
>KOG2841 consensus
Probab=81.07  E-value=2.7  Score=22.59  Aligned_cols=21  Identities=14%  Similarity=0.058  Sum_probs=8.5

Q ss_pred             CHHHHHHCC--CCHHHHHHHHHC
Q ss_conf             278887479--987899999844
Q gi|254780479|r  330 DAQNLANAA--LPTVMKKALSAG  350 (356)
Q Consensus       330 ~~~el~~~~--LPs~~kKIL~al  350 (356)
                      +..+++..+  =|+-.+++.+.+
T Consensus       225 S~~ele~~~G~G~~kak~l~~~l  247 (254)
T KOG2841         225 SEGELEQCPGLGPAKAKRLHKFL  247 (254)
T ss_pred             CHHHHHHCCCCCHHHHHHHHHHH
T ss_conf             77679867573789999999998


No 113
>TIGR01259 comE comEA protein; InterPro: IPR004787   The comE locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. comEA and comEC are required for transformability, whereas the products of comEB and of the overlapping comER, which is transcribed in the reverse direction, are dispensable .   ComEA has been shown to be an integral membrane protein, as predicted from hydropathy analysis, with its C-terminal domain outside the cytoplasmic membrane. This C-terminal domain possesses a sequence with similarity to those of several proteins known to be involved in nucleic acid transactions including UvrC and a human protein that binds to the replication origin of the Human herpesvirus 4 (Epstein-Barr virus) ..
Probab=80.99  E-value=0.85  Score=26.13  Aligned_cols=24  Identities=42%  Similarity=0.609  Sum_probs=21.6

Q ss_pred             CCHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             731456888887778776677664
Q gi|254780479|r  114 PHKVEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus       114 P~~~~~l~~LpGiG~yta~ai~s~  137 (356)
                      -.+.+||..||||||.-|.||.++
T Consensus        68 ~As~~EL~~l~GiGP~kA~aIi~Y   91 (124)
T TIGR01259        68 KASLEELQALPGIGPAKAKAIIEY   91 (124)
T ss_pred             HHHHHHHHHCCCCCCHHHHHHHHH
T ss_conf             678999863699981337999999


No 114
>PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed
Probab=80.17  E-value=0.58  Score=27.27  Aligned_cols=58  Identities=16%  Similarity=0.225  Sum_probs=37.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH-HCCC-EEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHC
Q ss_conf             456888887778776677664-3496-313124314666576519754573568999874102
Q gi|254780479|r  117 VEILKKLPGIGDYTASAIVAI-AFNH-FAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARK  177 (356)
Q Consensus       117 ~~~l~~LpGiG~yta~ai~s~-a~~~-~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~  177 (356)
                      ++.|++..||||.||=+|+|. ..++ ..++..+|+.. |+++-|+-.  +.+++.+-++..+
T Consensus        72 F~~Li~V~GIGpK~AL~ILs~~~~~~l~~aI~~~D~~~-L~~vpGIG~--KtA~rIi~ELk~K  131 (198)
T PRK00116         72 FRLLISVSGVGPKLALAILSGLSPEELAQAIANGDIKA-LTKVPGVGK--KTAERIVLELKDK  131 (198)
T ss_pred             HHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHCCHHH-HCCCCCCCH--HHHHHHHHHHHHH
T ss_conf             99985668857899998870299999999998589999-706889788--9999999999988


No 115
>COG0632 RuvA Holliday junction resolvasome, DNA-binding subunit [DNA replication, recombination, and repair]
Probab=80.02  E-value=0.41  Score=28.34  Aligned_cols=44  Identities=23%  Similarity=0.283  Sum_probs=25.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH-HCCCEEEE-CCCCHHHHHHHHHCCC
Q ss_conf             456888887778776677664-34963131-2431466657651975
Q gi|254780479|r  117 VEILKKLPGIGDYTASAIVAI-AFNHFAVV-VDTNIERIISRYFDII  161 (356)
Q Consensus       117 ~~~l~~LpGiG~yta~ai~s~-a~~~~~~~-vD~Nv~RVl~R~~~~~  161 (356)
                      +..|++..||||.||=||+|- ..++=+-+ -.+|+.. |+|+=|+-
T Consensus        72 F~~LisVnGIGpK~ALaiLs~~~~~~l~~aI~~~d~~~-L~k~PGIG  117 (201)
T COG0632          72 FRLLISVNGIGPKLALAILSNLDPEELAQAIANEDVKA-LSKIPGIG  117 (201)
T ss_pred             HHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHCCHHH-HHCCCCCC
T ss_conf             99987118805899999984899999999998328676-44189877


No 116
>PRK07456 consensus
Probab=79.89  E-value=1  Score=25.52  Aligned_cols=34  Identities=32%  Similarity=0.403  Sum_probs=28.0

Q ss_pred             HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             6754300578687314568888-----------8777877667766
Q gi|254780479|r  102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ...|.++| |.-|+.+-++++|           ||||+.||...+.
T Consensus       179 ~~~V~ek~-GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~  223 (975)
T PRK07456        179 EAGVKEKL-GVAPEQVVDLKALTGDSSDNIPGVKGVGPKTAINLLK  223 (975)
T ss_pred             HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             99999986-9598998999997288656889999855999999999


No 117
>PRK00076 recR recombination protein RecR; Reviewed
Probab=79.82  E-value=0.77  Score=26.43  Aligned_cols=24  Identities=42%  Similarity=0.790  Sum_probs=16.7

Q ss_pred             CCCHH----HHHHHHHHHHHHHHHHHHH
Q ss_conf             87314----5688888777877667766
Q gi|254780479|r  113 FPHKV----EILKKLPGIGDYTASAIVA  136 (356)
Q Consensus       113 ~P~~~----~~l~~LpGiG~yta~ai~s  136 (356)
                      +|...    +.|.+|||||+.||--++-
T Consensus         2 ~p~~i~~LI~~l~kLPGIG~KsA~Rla~   29 (197)
T PRK00076          2 YPPPIEKLIEALRKLPGIGPKSAQRLAF   29 (197)
T ss_pred             CCHHHHHHHHHHHHCCCCCHHHHHHHHH
T ss_conf             8789999999998789998899999999


No 118
>PRK06887 consensus
Probab=79.39  E-value=1.1  Score=25.33  Aligned_cols=35  Identities=26%  Similarity=0.398  Sum_probs=28.4

Q ss_pred             HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             76754300578687314568888-----------8777877667766
Q gi|254780479|r  101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ....|.++| |.-|+.+-++++|           ||||+.||...+.
T Consensus       161 d~~~V~ek~-GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~  206 (954)
T PRK06887        161 DREGVIEKY-GIPPELIIDYLALMGDSSDNIPGVAGVGEKTALGLLQ  206 (954)
T ss_pred             CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCEEHHHHHHHHH
T ss_conf             799998987-9798999999997188524778878541699999999


No 119
>TIGR01816 sdhA_forward succinate dehydrogenase, flavoprotein subunit; InterPro: IPR011281   Succinate dehydrogenase and fumarate reductase are homologous enzymes reversible in principle but favoured under different circumstances. This entry represents a narrowly defined clade of the succinate dehydrogenase flavoprotein subunit as found in mitochondria, in Rickettsia, in Escherichia coli and other proteobacteria, and in a few other lineages. However, excluded are all known fumarate reductases. It also excludes putative succinate dehydrogenases that appear to diverged before the split between E. coli succinate dehydrogenase and fumarate reductase.; GO: 0016627 oxidoreductase activity acting on the CH-CH group of donors, 0050660 FAD binding, 0006099 tricarboxylic acid cycle.
Probab=79.01  E-value=0.43  Score=28.21  Aligned_cols=18  Identities=33%  Similarity=0.571  Sum_probs=15.4

Q ss_pred             HHHHHHC-CCHHHHHCCCH
Q ss_conf             9987748-99789872998
Q gi|254780479|r   62 KKFMQKW-PTIFCLSSAKD   79 (356)
Q Consensus        62 ~~~~~~~-P~~~~la~a~~   79 (356)
                      ||||+|| ||++|||.=|.
T Consensus       271 ERFMERYAP~~KDLAsRDv  289 (615)
T TIGR01816       271 ERFMERYAPTAKDLASRDV  289 (615)
T ss_pred             CCCHHCCCCCCCCCCCCCH
T ss_conf             5102001675100368636


No 120
>PRK08609 hypothetical protein; Provisional
Probab=78.96  E-value=3.1  Score=22.19  Aligned_cols=24  Identities=29%  Similarity=0.511  Sum_probs=13.4

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHHHH
Q ss_conf             687314568888877787766776
Q gi|254780479|r  112 NFPHKVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus       112 ~~P~~~~~l~~LpGiG~yta~ai~  135 (356)
                      ++|...-+|+.+|||||.+|..+-
T Consensus        82 ~~P~gl~eLl~IpGlGPKka~~L~  105 (570)
T PRK08609         82 EVPEGLLPLLKLPGLGGKKIAKLY  105 (570)
T ss_pred             HCCHHHHHHHCCCCCCHHHHHHHH
T ss_conf             487779999778987789999999


No 121
>PRK05929 consensus
Probab=78.30  E-value=1.2  Score=25.07  Aligned_cols=33  Identities=24%  Similarity=0.458  Sum_probs=27.3

Q ss_pred             HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             754300578687314568888-----------8777877667766
Q gi|254780479|r  103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ..|.++| |.-|+..-++++|           ||||+.||...+.
T Consensus       160 ~~V~ek~-Gv~P~qi~D~laL~GDsSDNIPGVpGIG~KTA~kLL~  203 (870)
T PRK05929        160 NEVIEQY-GVPPGQIADYLALVGDSSDNIPGVSGCGPKKAAALLK  203 (870)
T ss_pred             HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             9999997-9698999999997287646899999760999999998


No 122
>PRK07625 consensus
Probab=78.11  E-value=1.3  Score=24.81  Aligned_cols=35  Identities=23%  Similarity=0.241  Sum_probs=28.0

Q ss_pred             HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             76754300578687314568888-----------8777877667766
Q gi|254780479|r  101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ....|.++| |.-|+.+-++++|           ||||+.||...+.
T Consensus       160 d~~~V~ek~-GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~  205 (922)
T PRK07625        160 DRDGVLAKF-GVPPERIVDYLSLIGDTVDNVPGVEKCGPKTAVKWLT  205 (922)
T ss_pred             CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             899999987-9798999999997197635789999856798999999


No 123
>PRK01172 ski2-like helicase; Provisional
Probab=77.73  E-value=1.7  Score=24.08  Aligned_cols=45  Identities=11%  Similarity=0.059  Sum_probs=27.5

Q ss_pred             HHHHHHHHHHHCCCC-CCCCCCCCCCCCCCC--------CHHHHHHHHHHHHHC
Q ss_conf             999999999962899-887548554445678--------704676644673116
Q gi|254780479|r    8 IQSKILDWYDTNHRV-LPWRTSPKTEKSSLP--------SPYKVWISEIMLQQT   52 (356)
Q Consensus         8 ~~~~ll~w~~~~~R~-lpwr~~~~~~~~~~~--------~py~v~vseimlqqT   52 (356)
                      +.+.+++||+.+|=+ .||+.-.-..-...+        .-=+.+|+|++..++
T Consensus         8 ~~~~~~~~~~~~g~~l~p~Q~ea~~~~~~gkNllvsaPTgsGKTlvAe~ai~~~   61 (674)
T PRK01172          8 YDDEFLNLFTGNDFELYDHQRMAIEQLRKGENVIVSVPTAAGKTLIAYSAIYET   61 (674)
T ss_pred             CCHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHH
T ss_conf             997999999967998898999999999779959997899986999999999999


No 124
>PRK08076 consensus
Probab=77.60  E-value=1.4  Score=24.63  Aligned_cols=34  Identities=32%  Similarity=0.641  Sum_probs=27.3

Q ss_pred             HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             6754300578687314568888-----------8777877667766
Q gi|254780479|r  102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ...|.++| |.-|+..-++++|           ||||+.||...+.
T Consensus       162 ~~~v~ek~-Gv~P~q~~D~~aL~GDssDNIPGVpGiG~KtA~~ll~  206 (877)
T PRK08076        162 PEALFEKY-GLTPKQIIDMKGLMGDSSDNIPGVPGVGEKTAIKLLK  206 (877)
T ss_pred             HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             99999997-9698999999997187646799999863799999999


No 125
>smart00278 HhH1 Helix-hairpin-helix DNA-binding motif class 1.
Probab=77.29  E-value=0.54  Score=27.48  Aligned_cols=23  Identities=43%  Similarity=0.573  Sum_probs=19.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHCC
Q ss_conf             56888887778776677664349
Q gi|254780479|r  118 EILKKLPGIGDYTASAIVAIAFN  140 (356)
Q Consensus       118 ~~l~~LpGiG~yta~ai~s~a~~  140 (356)
                      ++|+++||||+.||.+|+.....
T Consensus         1 ~~L~~v~GIG~k~A~~ll~~~~~   23 (26)
T smart00278        1 EELLKVPGIGPKTAEKILEAXXX   23 (26)
T ss_pred             CCCCCCCCCCCHHHHHHHHHHHC
T ss_conf             92101799881159999997620


No 126
>PRK07956 ligA NAD-dependent DNA ligase LigA; Validated
Probab=77.07  E-value=2.4  Score=22.93  Aligned_cols=13  Identities=31%  Similarity=0.319  Sum_probs=7.6

Q ss_pred             HHHHHHHHCCCCC
Q ss_conf             8999998448237
Q gi|254780479|r  342 VMKKALSAGGIKV  354 (356)
Q Consensus       342 ~~kKIL~alg~~~  354 (356)
                      +..++|-|||+++
T Consensus       506 ~l~r~L~ALGI~~  518 (668)
T PRK07956        506 PLARFLYALGIRH  518 (668)
T ss_pred             CHHHHHHHCCCCC
T ss_conf             5888998627864


No 127
>PRK13844 recombination protein RecR; Provisional
Probab=77.04  E-value=1  Score=25.53  Aligned_cols=24  Identities=38%  Similarity=0.641  Sum_probs=16.7

Q ss_pred             CCCCHH----HHHHHHHHHHHHHHHHHH
Q ss_conf             687314----568888877787766776
Q gi|254780479|r  112 NFPHKV----EILKKLPGIGDYTASAIV  135 (356)
Q Consensus       112 ~~P~~~----~~l~~LpGiG~yta~ai~  135 (356)
                      .+|...    +.|.+|||||+.||-=++
T Consensus         5 ~~~~~i~~LI~~l~kLPGIG~KsA~Rla   32 (200)
T PRK13844          5 IFSPKISAVIESLRKLPTIGKKSSQRLA   32 (200)
T ss_pred             CCCHHHHHHHHHHHHCCCCCHHHHHHHH
T ss_conf             5889999999998168998788999999


No 128
>PRK07556 consensus
Probab=76.87  E-value=1.3  Score=24.81  Aligned_cols=33  Identities=36%  Similarity=0.596  Sum_probs=26.8

Q ss_pred             HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             754300578687314568888-----------8777877667766
Q gi|254780479|r  103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ..|.++| |.-|+.+-++++|           ||||+.||...+.
T Consensus       169 ~~V~eK~-GV~P~qv~D~lAL~GDsSDNIPGVpGIG~KTA~kLL~  212 (977)
T PRK07556        169 PEVIEKF-GVPPEKVVDVQALAGDSVDNVPGIPGIGIKTAAQLIN  212 (977)
T ss_pred             HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             9999986-9698999999997187646789899854899999999


No 129
>PRK12911 bifunctional preprotein translocase subunit SecD/SecF; Reviewed
Probab=76.84  E-value=0.71  Score=26.67  Aligned_cols=30  Identities=27%  Similarity=0.517  Sum_probs=17.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH
Q ss_conf             5688888777877667766434963131243146665765
Q gi|254780479|r  118 EILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY  157 (356)
Q Consensus       118 ~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~  157 (356)
                      +.-++||||    ||-|++|+     ++||+||. ++.|.
T Consensus       958 nATLTLPGI----AGIVLTIG-----MAVDANVL-IFERI  987 (1405)
T PRK12911        958 DAPLTLSGL----AGIVLAMG-----MAVDANVL-VFERI  987 (1405)
T ss_pred             CCCCCCCHH----HHHHHHHH-----HHHHHHHH-HHHHH
T ss_conf             876455008----89997640-----01304479-99999


No 130
>cd00141 NT_POLXc Nucleotidyltransferase (NT) domain of family X DNA Polymerases. X family polymerases fill in short gaps during DNA repair. They are relatively inaccurate enzymes and play roles in base excision repair, in non-homologous end joining (NHEJ) which acts mainly to repair damage due to ionizing radiation, and in V(D)J recombination. This family includes eukaryotic Pol beta, Pol lambda, Pol mu, and terminal deoxyribonucleotidyl transferase (TdT). Pol beta and Pol lambda are primarily DNA template-dependent polymerases. TdT is a DNA template-independent polymerase. Pol mu has both template dependent and template independent activities. This subgroup belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. These three carboxylate residues are fairly well conserved in this
Probab=76.84  E-value=2.2  Score=23.20  Aligned_cols=26  Identities=27%  Similarity=0.542  Sum_probs=21.4

Q ss_pred             CCCCCHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             86873145688888777877667766
Q gi|254780479|r  111 GNFPHKVEILKKLPGIGDYTASAIVA  136 (356)
Q Consensus       111 g~~P~~~~~l~~LpGiG~yta~ai~s  136 (356)
                      ...|....+|+++|||||.||..+-.
T Consensus        78 ~~~p~~l~~l~~I~GiGpk~a~~l~~  103 (307)
T cd00141          78 EDVPPGLLLLLRVPGVGPKTARKLYE  103 (307)
T ss_pred             CCCCHHHHHHHCCCCCCHHHHHHHHH
T ss_conf             65637899996478878899999998


No 131
>PRK08835 consensus
Probab=76.60  E-value=1.5  Score=24.45  Aligned_cols=51  Identities=27%  Similarity=0.400  Sum_probs=35.2

Q ss_pred             HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCC
Q ss_conf             76754300578687314568888-----------87778776677664349631312431466657651975
Q gi|254780479|r  101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDII  161 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~  161 (356)
                      ....|.++| |.-|+.+-++++|           ||||+.||...+.= |        |++.-++..+-.+.
T Consensus       161 d~~~V~ek~-GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~e-y--------GsLE~Iy~nld~Ik  222 (931)
T PRK08835        161 DREGVVEKF-GIPPELIIDYLALMGDKVDNIPGVPGVGDKTATALLQG-I--------GGLDALYDNLDDIA  222 (931)
T ss_pred             CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH-H--------CCHHHHHHHHHHCC
T ss_conf             899999987-96989989999971886468899997458889999998-4--------86999997698614


No 132
>COG0353 RecR Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair]
Probab=76.45  E-value=0.91  Score=25.91  Aligned_cols=20  Identities=40%  Similarity=0.747  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHH
Q ss_conf             14568888877787766776
Q gi|254780479|r  116 KVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus       116 ~~~~l~~LpGiG~yta~ai~  135 (356)
                      =.+.|.+|||||+.||.-++
T Consensus        10 LI~~l~kLPGvG~KsA~R~A   29 (198)
T COG0353          10 LIDALKKLPGVGPKSAQRLA   29 (198)
T ss_pred             HHHHHHHCCCCCHHHHHHHH
T ss_conf             99999768998832799999


No 133
>PRK07997 consensus
Probab=76.34  E-value=1.5  Score=24.33  Aligned_cols=35  Identities=34%  Similarity=0.568  Sum_probs=28.3

Q ss_pred             HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             76754300578687314568888-----------8777877667766
Q gi|254780479|r  101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ....|.++| |.-|+.+-++++|           ||||+.||...+.
T Consensus       161 ~~~~V~ek~-GV~P~qv~D~laL~GDsSDNIPGVpGIG~KTA~kLL~  206 (928)
T PRK07997        161 GPEEVVNKY-GVPPELIIDFLALMGDSSDNIPGVPGVGEKTAQALLQ  206 (928)
T ss_pred             CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             799999987-9798988999997288656889999754899999999


No 134
>PRK07898 consensus
Probab=76.33  E-value=1.5  Score=24.32  Aligned_cols=35  Identities=31%  Similarity=0.557  Sum_probs=28.0

Q ss_pred             HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             76754300578687314568888-----------8777877667766
Q gi|254780479|r  101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ....|.++| |.-|+..-++++|           ||||+.||...+.
T Consensus       175 d~~~V~ek~-GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~  220 (902)
T PRK07898        175 TPEAVEEKY-GLTPAQYPDFAALRGDPSDNLPGIPGVGEKTAAKWIA  220 (902)
T ss_pred             CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             899999986-9798999999998188645799999844788999999


No 135
>PRK08786 consensus
Probab=76.31  E-value=1.4  Score=24.52  Aligned_cols=34  Identities=26%  Similarity=0.348  Sum_probs=27.7

Q ss_pred             HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             6754300578687314568888-----------8777877667766
Q gi|254780479|r  102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ...|.++| |.-|+.+-++++|           ||||+.||...+.
T Consensus       157 ~~~V~eK~-GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~  201 (927)
T PRK08786        157 DAAVIAKF-GVRPDQIVDLLALMGDTVDNVPGVEKCGPKTAAKWLA  201 (927)
T ss_pred             HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             89999996-9698999999998087646789999856899999999


No 136
>PRK08928 consensus
Probab=75.42  E-value=1.7  Score=24.05  Aligned_cols=51  Identities=29%  Similarity=0.442  Sum_probs=35.1

Q ss_pred             HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCC
Q ss_conf             76754300578687314568888-----------87778776677664349631312431466657651975
Q gi|254780479|r  101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDII  161 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~  161 (356)
                      ....|.++| |.-|+.+-++++|           ||||+.||...+.- |        |++.-++..+-.+.
T Consensus       159 ~~~~V~ek~-GV~P~qiiD~laL~GDsSDNIPGVpGIG~KTA~kLL~e-y--------GsLE~Iy~n~d~Ik  220 (861)
T PRK08928        159 TEDDVVEKF-GVTSDKLLDVMALTGDRSDNIPGVPSIGPKTAAKLITQ-F--------GSLENILNSLDQIS  220 (861)
T ss_pred             CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH-C--------CCHHHHHHHHHHCC
T ss_conf             899999997-97989999999980876468899988562899999996-7--------97999999886113


No 137
>PRK05797 consensus
Probab=74.97  E-value=1.6  Score=24.23  Aligned_cols=34  Identities=35%  Similarity=0.598  Sum_probs=26.5

Q ss_pred             HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             6754300578687314568888-----------8777877667766
Q gi|254780479|r  102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ...|.++| |.-|+..-++++|           ||||+.||...+.
T Consensus       162 ~~~v~ek~-Gv~P~q~~D~~aL~GD~sDNIPGVpGIG~KTA~kLL~  206 (869)
T PRK05797        162 KNKMVEEY-GVTPTQFIDVKGLMGDKSDNIPGVPGIGEKTAFKLIK  206 (869)
T ss_pred             HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             99989997-7899999999998088645789999878189999998


No 138
>PRK08434 consensus
Probab=74.74  E-value=1.7  Score=23.93  Aligned_cols=49  Identities=24%  Similarity=0.351  Sum_probs=33.5

Q ss_pred             HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCC
Q ss_conf             754300578687314568888-----------87778776677664349631312431466657651975
Q gi|254780479|r  103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDII  161 (356)
Q Consensus       103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~  161 (356)
                      ..|.++| |.-|+.+-++++|           ||||+.||...+.- |        |++.-++..+-.++
T Consensus       159 ~~V~eK~-GV~P~qv~D~lAL~GDsSDNIPGVpGIG~KTA~kLL~e-y--------GsLE~I~~n~d~Ik  218 (887)
T PRK08434        159 AACLEKY-GVKPKQIRDFLALCGDSADNIPGVKGIGAKGAKKLLDE-F--------GSLEGIYENLELVR  218 (887)
T ss_pred             HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH-C--------CCHHHHHHHHHHCC
T ss_conf             9999997-96989999999981886457899998568999999998-2--------88899999887124


No 139
>TIGR02789 nickel_nikB nickel ABC transporter, permease subunit NikB; InterPro: IPR014156   ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs.   ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain .    The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , .   The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , .   Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1).    This family consists of the NikB family of nickel ABC transporter permeases. Operons that contain this protein also contain a homologous permease subunit NikC. Nickel is used in cells as part of urease or certain hydrogenases or superoxide dismutases..
Probab=74.59  E-value=0.54  Score=27.51  Aligned_cols=38  Identities=29%  Similarity=0.433  Sum_probs=25.5

Q ss_pred             HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCH
Q ss_conf             00578687314568888877787766776643496313124314
Q gi|254780479|r  107 KKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNI  150 (356)
Q Consensus       107 ~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv  150 (356)
                      +=-+|.+  =.|..=++||||||.-+||.    |...|++-.-|
T Consensus       245 ~Ll~GT~--viE~iFswPG~GRy~i~Aif----NRDyPviQcy~  282 (315)
T TIGR02789       245 ELLGGTL--VIENIFSWPGVGRYAISAIF----NRDYPVIQCYT  282 (315)
T ss_pred             HHHHCCE--EEEEECCCCCHHHHHHHHHH----HCCCHHHHHHH
T ss_conf             7840711--45410126661045743356----23763899999


No 140
>TIGR02236 recomb_radA DNA repair and recombination protein RadA; InterPro: IPR011938   This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO), and archaeal RadB (IPR011939 from INTERPRO). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination.
Probab=74.42  E-value=0.54  Score=27.51  Aligned_cols=57  Identities=21%  Similarity=0.374  Sum_probs=34.9

Q ss_pred             CCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH
Q ss_conf             8997898729989999997404634588999877675430057868731456888887778
Q gi|254780479|r   68 WPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGD  128 (356)
Q Consensus        68 ~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~  128 (356)
                      |-|++++|.|+..|+-.. +|.| =++|..+..+||...+=  |.|-.-.+-|..=.-||+
T Consensus        21 y~t~~~iA~A~~~EL~~~-~gI~-E~~A~kiI~AAR~a~~~--~~F~~a~~vl~rR~~v~k   77 (333)
T TIGR02236        21 YDTLEAIAVASPKELSEI-AGIG-EGTAAKIIQAARKAADL--GGFETADDVLERRKSVGK   77 (333)
T ss_pred             HHHHHHHHCCCHHHHHHH-CCCC-HHHHHHHHHHHHHHHCC--CCCHHHHHHHHHHHCCCE
T ss_conf             788999844585795320-3787-77899999999998467--672148999998720750


No 141
>PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional
Probab=74.25  E-value=1.2  Score=25.10  Aligned_cols=98  Identities=15%  Similarity=0.177  Sum_probs=48.4

Q ss_pred             HHHHHHHHHHHHHCCHHHCCHHH--HHHHHHCCCHHHHHCCCHHHH---HHHHHCCCHHHHHHHHHH--HHHHHHHCCCC
Q ss_conf             04676644673116120332658--998774899789872998999---999740463458899987--76754300578
Q gi|254780479|r   39 PYKVWISEIMLQQTTVKTVEPYF--KKFMQKWPTIFCLSSAKDEEI---LSAWAGLGYYTRARNLKK--CADIIVKKYEG  111 (356)
Q Consensus        39 py~v~vseimlqqT~v~~v~~~~--~~~~~~~P~~~~la~a~~~~v---l~~w~gLGyy~Rar~l~~--~a~~i~~~~~g  111 (356)
                      -|.|.||.-.+.+-+...-+..|  .-+=+--.++--+....|.++   |.-.+|.|.- -|-.+..  .+..+.+-   
T Consensus        25 GY~V~vs~~~~~~l~~~~~v~l~t~~~vrEd~~~LyGF~~~~Er~~F~~LisVsGIGpk-~Al~iLs~~~~~~l~~a---  100 (196)
T PRK13901         25 EFELLVSSFCLAELRLLEDVEILTYLHTREDELKLFGFLNSSEREVFEELIGVDGIGPR-AALRVLSGIKYNEFRDA---  100 (196)
T ss_pred             EEEEEECHHHHHHCCCCCEEEEEEEEEEEECCCEEECCCCHHHHHHHHHHHCCCCCCHH-HHHHHHCCCCHHHHHHH---
T ss_conf             99999678999765899808999999995167713365988999999998765882689-99999757999999999---


Q ss_pred             CCCCHHHHHHHHHHHHHHHHHHHHHHHCC
Q ss_conf             68731456888887778776677664349
Q gi|254780479|r  112 NFPHKVEILKKLPGIGDYTASAIVAIAFN  140 (356)
Q Consensus       112 ~~P~~~~~l~~LpGiG~yta~ai~s~a~~  140 (356)
                      ..-.|.+.|.++||||+.||.-|..=--|
T Consensus       101 I~~~D~~~L~~vpGIG~KtA~rIi~ELk~  129 (196)
T PRK13901        101 IDREDIELISKVKGIGNKMAGKIFLKLRG  129 (196)
T ss_pred             HHHCCHHHHHHCCCCCHHHHHHHHHHHHH
T ss_conf             99289999831999589999999999976


No 142
>PRK07300 consensus
Probab=74.15  E-value=1.9  Score=23.73  Aligned_cols=33  Identities=27%  Similarity=0.387  Sum_probs=26.0

Q ss_pred             HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             754300578687314568888-----------8777877667766
Q gi|254780479|r  103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ..|.++| |.-|+..-++++|           ||||+.||...+.
T Consensus       171 ~~v~ek~-GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~  214 (880)
T PRK07300        171 AYLMEKM-GLTPNQFIDLKALMGDKSDNIPGVTKIGEKTGLKLLH  214 (880)
T ss_pred             HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             9999997-8898999999998088624677889853699999999


No 143
>smart00483 POLXc DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases
Probab=73.90  E-value=5.5  Score=20.40  Aligned_cols=25  Identities=16%  Similarity=0.290  Sum_probs=20.1

Q ss_pred             CCCCCHHHHHHHHHHHHHHHHHHHH
Q ss_conf             8687314568888877787766776
Q gi|254780479|r  111 GNFPHKVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus       111 g~~P~~~~~l~~LpGiG~yta~ai~  135 (356)
                      ...|....+|+++|||||.||...-
T Consensus        82 ~~~~~~l~el~~I~GvGpk~a~~l~  106 (334)
T smart00483       82 DEVYKSLKLFTNVFGVGPKTAAKWY  106 (334)
T ss_pred             CCCCHHHHHHHCCCCCCHHHHHHHH
T ss_conf             8651689998538887789999999


No 144
>PRK02362 ski2-like helicase; Provisional
Probab=71.51  E-value=3.1  Score=22.20  Aligned_cols=18  Identities=17%  Similarity=0.477  Sum_probs=10.2

Q ss_pred             HHHHHHHHHCC-CC-CCCCC
Q ss_conf             99999999628-99-88754
Q gi|254780479|r   10 SKILDWYDTNH-RV-LPWRT   27 (356)
Q Consensus        10 ~~ll~w~~~~~-R~-lpwr~   27 (356)
                      +.++++|+..| +. +||+.
T Consensus        10 ~~~~~~~~~~gI~~Lyp~Q~   29 (736)
T PRK02362         10 DGVIDFYEGSGIEELYPPQA   29 (736)
T ss_pred             HHHHHHHHHCCCCCCCHHHH
T ss_conf             89999999769975789999


No 145
>PRK05755 DNA polymerase I; Provisional
Probab=71.16  E-value=2.5  Score=22.82  Aligned_cols=35  Identities=29%  Similarity=0.570  Sum_probs=26.7

Q ss_pred             HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             76754300578687314568888-----------8777877667766
Q gi|254780479|r  101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ....|.++| |.-|...-++++|           ||||+.||...+.
T Consensus       159 ~~~~v~ek~-Gv~P~q~~D~~aL~GD~sDNIPGVpGIG~KtA~kLl~  204 (889)
T PRK05755        159 DPEGVIEKY-GVTPEQIIDFLALMGDSSDNIPGVPGVGEKTAAKLLK  204 (889)
T ss_pred             CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             899999997-9698999999998088645789988626999999998


No 146
>PRK07945 hypothetical protein; Provisional
Probab=70.88  E-value=0.9  Score=25.93  Aligned_cols=61  Identities=20%  Similarity=0.286  Sum_probs=34.5

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCC--HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCC
Q ss_conf             8899987767543005786873--14568888877787766776643496313124314666576519754
Q gi|254780479|r   94 RARNLKKCADIIVKKYEGNFPH--KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK  162 (356)
Q Consensus        94 Rar~l~~~a~~i~~~~~g~~P~--~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~  162 (356)
                      |.+-...+|..|..--.+.+-.  ....|.+|||||+.||..|.-        +++|-|---|.++-.-..
T Consensus        23 rv~a~r~aa~~~~~~~~~~~~~~~~~~~~~~l~gig~~ta~vi~~--------a~~g~~p~~l~~l~~~~~   85 (335)
T PRK07945         23 RVMAFRNAADVVEALDEAERARHGRAGSWQSLPGIGPKTAKVIAQ--------AWAGRVPDYLAELRAAAE   85 (335)
T ss_pred             HHHHHHHHHHHHHHCCHHHHHHHHHCCCCEECCCCCHHHHHHHHH--------HHCCCCCHHHHHHHHHCC
T ss_conf             899999999999837999999987449811278878058999999--------965898289999997446


No 147
>KOG2534 consensus
Probab=70.85  E-value=3.7  Score=21.63  Aligned_cols=67  Identities=24%  Similarity=0.286  Sum_probs=40.5

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCC-------HHHHHHHHHCCCC
Q ss_conf             88999877675430057868731456888887778776677664349631312431-------4666576519754
Q gi|254780479|r   94 RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTN-------IERIISRYFDIIK  162 (356)
Q Consensus        94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~N-------v~RVl~R~~~~~~  162 (356)
                      ||+---+||..+ +.+.-.+ .+.+++.+|||||+..|.-|--|.=.-...-++.-       +.-.|.++||+-.
T Consensus        34 r~~~y~~Aasvl-k~~p~~I-~S~~ea~~lP~iG~kia~ki~EiletG~l~ele~v~~de~~~~lklFtnifGvG~  107 (353)
T KOG2534          34 RARAYRRAASVL-KSLPFPI-TSGEEAEKLPGIGPKIAEKIQEILETGVLRELEAVRNDERSQSLKLFTNIFGVGL  107 (353)
T ss_pred             HHHHHHHHHHHH-HHCCCCC-CCHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHCCCH
T ss_conf             999999999999-8499976-6578855799977779999999997087366787754506789999998725577


No 148
>COG1796 POL4 DNA polymerase IV (family X) [DNA replication, recombination, and repair]
Probab=70.63  E-value=2  Score=23.47  Aligned_cols=21  Identities=29%  Similarity=0.474  Sum_probs=11.9

Q ss_pred             EEEEECCCCHHHHCCCCCCCC
Q ss_conf             577645720132047442111
Q gi|254780479|r  252 ILLRKRTNTRLLEGMDELPGS  272 (356)
Q Consensus       252 iLL~KRp~~gll~GLwEFP~~  272 (356)
                      +++.--.+..++.-|.+||..
T Consensus       201 ~~~s~~~~~~v~~~~~~~~~~  221 (326)
T COG1796         201 ILISTSHPESVLEELLEMPNV  221 (326)
T ss_pred             EEECCCCCHHHHHHHHCCCCC
T ss_conf             675069947899998667876


No 149
>pfam02371 Transposase_20 Transposase IS116/IS110/IS902 family. Transposases are needed for efficient transposition of the insertion sequence or transposon DNA. This family includes transposases for IS116, IS110 and IS902. This region is often found with pfam01548. The exact function of this region is uncertain. This family contains a HHH motif suggesting a DNA-binding function.
Probab=70.40  E-value=1.6  Score=24.24  Aligned_cols=41  Identities=29%  Similarity=0.387  Sum_probs=25.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCC
Q ss_conf             568888877787766776643496313124314666576519754
Q gi|254780479|r  118 EILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK  162 (356)
Q Consensus       118 ~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~  162 (356)
                      +-|.++||||+-||..|++..-+...   --| .+-+.-+.|+.-
T Consensus         2 ~~L~sipGiG~~~a~~l~aeigd~~r---F~~-~~~~~s~~Gl~P   42 (87)
T pfam02371         2 ELLLSIPGIGPITAAALLAEIGDISR---FKS-ARQLAAYAGLAP   42 (87)
T ss_pred             CHHHCCCCCCHHHHHHHHHHHCCHHH---CCC-HHHHHHHCCCCC
T ss_conf             23426999529999999999298532---789-999999839999


No 150
>COG2003 RadC DNA repair proteins [DNA replication, recombination, and repair]
Probab=69.96  E-value=2.3  Score=23.06  Aligned_cols=62  Identities=27%  Similarity=0.244  Sum_probs=41.8

Q ss_pred             HHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCC---CCCCHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             78987299899999974046345889998776754300578---68731456888887778776677664
Q gi|254780479|r   71 IFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEG---NFPHKVEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus        71 ~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g---~~P~~~~~l~~LpGiG~yta~ai~s~  137 (356)
                      .++|+++..=+++     |+.=.+..+-...|+.+.++||+   -+..+.++|..+||||..-|.-+.++
T Consensus        21 ~~~Lsd~ELLail-----LrtG~~~~~~~~la~~lL~~fg~L~~l~~a~~~el~~v~GiG~aka~~l~a~   85 (224)
T COG2003          21 AEALSDAELLAIL-----LRTGTKGESVLDLAKELLQEFGSLAELLKASVEELSSVKGIGLAKAIQIKAA   85 (224)
T ss_pred             HHHCCHHHHHHHH-----HHCCCCCCCHHHHHHHHHHHCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHH
T ss_conf             4534668999999-----9628999878999999999732588887379999951788338899999999


No 151
>TIGR01129 secD protein-export membrane protein SecD; InterPro: IPR005791    Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome.     The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB  is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . Together with SecY and SecG, SecE forms a multimeric channel through which preproteins are translocated, using both proton motive forces and ATP-driven secretion. The latter is mediated by SecA. The structure of the Escherichia coli SecYEG assembly revealed a sandwich of two membranes interacting through the extensive cytoplasmic domains . Each membrane is composed of dimers of SecYEG. The monomeric complex contains 15 transmembrane helices.     This entry describes the SecD family of transport proteins. Members of this family are highly variable in length immediately after the well-conserved motif LGLGLXGG at the amino-terminal end of this model. Archaeal homologs are not included in the seed. SecD from Mycobacterium tuberculosis has a long Pro-rich insert. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0009276 1-2nm peptidoglycan-based cell wall.
Probab=69.15  E-value=1.7  Score=24.01  Aligned_cols=12  Identities=8%  Similarity=-0.011  Sum_probs=5.8

Q ss_pred             HHCCCEEEECCC
Q ss_conf             434963131243
Q gi|254780479|r  137 IAFNHFAVVVDT  148 (356)
Q Consensus       137 ~a~~~~~~~vD~  148 (356)
                      ++..+|+.-.+|
T Consensus       118 lGv~EP~v~~~G  129 (522)
T TIGR01129       118 LGVAEPVVQRQG  129 (522)
T ss_pred             HCCCCCEEEEEC
T ss_conf             167776488724


No 152
>COG3547 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=67.97  E-value=5.4  Score=20.48  Aligned_cols=37  Identities=27%  Similarity=0.367  Sum_probs=26.4

Q ss_pred             HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH
Q ss_conf             998776754300578687314568888877787766776
Q gi|254780479|r   97 NLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus        97 ~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~  135 (356)
                      .+....+.+........+.+.  |.++||||+-+|.+|.
T Consensus       169 ~i~~l~~~~~~~~~~~~~~~~--l~~ipgig~~~a~~i~  205 (303)
T COG3547         169 EIAALDREIEARLGIGVVTDR--LASIPGIGELTAAAIA  205 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH--HHHCCCCCHHHHHHHH
T ss_conf             898888999998500034568--8858672077899998


No 153
>pfam01601 Corona_S2 Coronavirus S2 glycoprotein. The coronavirus spike glycoprotein forms the characteristic 'corona' after which the group is named. The Spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 pfam01600 and S2.
Probab=66.50  E-value=3  Score=22.23  Aligned_cols=75  Identities=27%  Similarity=0.419  Sum_probs=42.0

Q ss_pred             CCHHHHHH---HHHCCCH----HHHHHHHHHHHHHHHHC-CCCCCCCHHHHHHHHHHH------HHHHHHHHHHHHCCCE
Q ss_conf             99899999---9740463----45889998776754300-578687314568888877------7877667766434963
Q gi|254780479|r   77 AKDEEILS---AWAGLGY----YTRARNLKKCADIIVKK-YEGNFPHKVEILKKLPGI------GDYTASAIVAIAFNHF  142 (356)
Q Consensus        77 a~~~~vl~---~w~gLGy----y~Rar~l~~~a~~i~~~-~~g~~P~~~~~l~~LpGi------G~yta~ai~s~a~~~~  142 (356)
                      +-.||+|=   .-.|||+    |.++-.=...+-.++.+ |+|        ++-||||      +-|||+++.+++|+--
T Consensus       149 S~IEDLLF~KV~t~glgtvd~dYk~Ct~g~~i~DL~CaQ~yNG--------i~VLPpv~da~~~amYT~sL~g~ma~ggi  220 (609)
T pfam01601       149 SAIEDLLFNKVVTSGLGTVDEDYKKCTGGGSIADLICAQYYNG--------IMVLPPVVDAEMMAMYTASLVGAMALGGI  220 (609)
T ss_pred             HHHHHHHHCEEEECCCCCCHHHHHHCCCCCCHHHHHHHHHCCC--------EEECCCCCCHHHHHHHHHHHHHHHHHCCC
T ss_conf             3788763231696589853545875569963577500533175--------58548878878998899999998761451


Q ss_pred             EE--ECCCCHHHHHHHHHCC
Q ss_conf             13--1243146665765197
Q gi|254780479|r  143 AV--VVDTNIERIISRYFDI  160 (356)
Q Consensus       143 ~~--~vD~Nv~RVl~R~~~~  160 (356)
                      .+  ++ -=-.-|-+|+-++
T Consensus       221 taaaaI-PFa~~vQ~RlN~l  239 (609)
T pfam01601       221 TAAAAI-PFATQVQARLNYV  239 (609)
T ss_pred             HHHEEC-CHHHHHHHHHHHH
T ss_conf             031017-4699899886334


No 154
>pfam03686 UPF0146 Uncharacterized protein family (UPF0146). The function of this family of proteins is unknown.
Probab=66.30  E-value=3.5  Score=21.75  Aligned_cols=92  Identities=17%  Similarity=0.164  Sum_probs=59.0

Q ss_pred             HHHHHHHHC-CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCC
Q ss_conf             776754300-5786873145688888777877667766434963131243146665765197545735689998741025
Q gi|254780479|r  100 KCADIIVKK-YEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKI  178 (356)
Q Consensus       100 ~~a~~i~~~-~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~  178 (356)
                      +.|..|.++ +.|++=+        =|||.++.-|-..--.|-...++|-|-..|-.-+-.+..+.-.-.-.+.+-|+-+
T Consensus         3 ~~a~~ia~~~~~gkiVE--------VGIG~~~~vA~~L~~~g~dv~~tDi~~~av~~gl~~v~DDif~P~~~lY~~A~lI   74 (127)
T pfam03686         3 EVAEYIARENARGKVVE--------VGIGFFLDVAKRLAERGFDVLATDINEKAVPEGLRFVVDDITNPNISIYEGADLI   74 (127)
T ss_pred             HHHHHHHHHCCCCCEEE--------EECCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCEEECCCCCCCHHHHCCCCEE
T ss_conf             39999874187885899--------9136888999999985996899977765766788878725889898895587889


Q ss_pred             CCCCCHHHHHHHHHHHHHHHC
Q ss_conf             785226789999999986320
Q gi|254780479|r  179 TSTSRPGDFVQAMMDLGALIC  199 (356)
Q Consensus       179 ~~~~~~~~~nQAlMdlGa~iC  199 (356)
                      -.-.-|.+...++++++..|=
T Consensus        75 YSIRPp~El~~~i~~lA~~v~   95 (127)
T pfam03686        75 YSIRPPPELQSAILDVAKAVG   95 (127)
T ss_pred             EECCCCHHHHHHHHHHHHHCC
T ss_conf             981898788899999999819


No 155
>TIGR02150 IPP_isom_1 isopentenyl-diphosphate delta-isomerase, type 1; InterPro: IPR011876    This entry represents type 1 of two non-homologous families of the enzyme isopentenyl-diphosphate delta-isomerase (IPP isomerase). IPP is an essential building block for many compounds, including enzyme cofactors, sterols, and prenyl groups. This inzyme interconverts isopentenyl diphosphate and dimethylallyl diphosphate.; GO: 0004452 isopentenyl-diphosphate delta-isomerase activity, 0008299 isoprenoid biosynthetic process.
Probab=66.01  E-value=5  Score=20.70  Aligned_cols=32  Identities=25%  Similarity=0.471  Sum_probs=24.8

Q ss_pred             CCEEEEEEEEEECCCEEEEEECCCCH-HHHCCCC
Q ss_conf             21123200034137605776457201-3204744
Q gi|254780479|r  236 PMRTGAVFIAITNDNRILLRKRTNTR-LLEGMDE  268 (356)
Q Consensus       236 ~~r~~~~~vii~~~~kiLL~KRp~~g-ll~GLwE  268 (356)
                      ..+-+-+|| .+.++++||+||...+ .|-|+|.
T Consensus        29 LHrAFS~fl-fn~~~~LL~qrRa~~K~twP~vWT   61 (190)
T TIGR02150        29 LHRAFSVFL-FNDQGRLLLQRRAASKITWPGVWT   61 (190)
T ss_pred             EEEEEEEEE-ECCCCCEEEEEECCCCCCCCCCEE
T ss_conf             231367677-718983688884011884378400


No 156
>COG0494 MutT NTP pyrophosphohydrolases including oxidative damage repair enzymes [DNA replication, recombination, and repair / General function prediction only]
Probab=65.69  E-value=5.9  Score=20.17  Aligned_cols=28  Identities=29%  Similarity=0.488  Sum_probs=24.2

Q ss_pred             CEEEEEECCCCHHHHCCCCCCCCCCCCCCCH
Q ss_conf             6057764572013204744211113467215
Q gi|254780479|r  250 NRILLRKRTNTRLLEGMDELPGSAWSSTKDG  280 (356)
Q Consensus       250 ~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~  280 (356)
                      +++|+.+|...+   |.|+||++..+.++..
T Consensus        24 ~~vl~~~~~~~~---~~~~~PgG~ve~~e~~   51 (161)
T COG0494          24 GEVLLAQRRDDG---GLWELPGGKVEPGEEL   51 (161)
T ss_pred             CEEEEEEEECCC---CEEEECCCCCCCCCCC
T ss_conf             799999984799---9599088855899987


No 157
>KOG3084 consensus
Probab=65.56  E-value=11  Score=18.27  Aligned_cols=93  Identities=15%  Similarity=0.194  Sum_probs=52.0

Q ss_pred             EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEEEEEEE----------
Q ss_conf             0003413760577645720132047442111134672158886301010000213735899831599999----------
Q gi|254780479|r  242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHFTLTLF----------  311 (356)
Q Consensus       242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~~L~i~----------  311 (356)
                      +.++++.+|+.+|-.| ..+.=.|||.-+-+-.+.++..+.....+..++.....+.|...-| ..|=+.          
T Consensus       191 Im~li~~d~~~~LL~R-~~r~~~gl~t~lAGFlEpGES~eeav~REtwEEtGi~V~~I~~~as-QPWP~~p~SLMIgc~a  268 (345)
T KOG3084         191 IMLLIDHDGKHALLGR-QKRYPPGLWTCLAGFLEPGESIEEAVRRETWEETGIEVEVISYVAS-QPWPLMPQSLMIGCLA  268 (345)
T ss_pred             EEEEECCCCCEEEEEC-CCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEC-CCCCCCCHHHHHHHHH
T ss_conf             9999768887741001-4679996010000027755029999999889974954135766504-8989996689999999


Q ss_pred             ---E-EEEECCCCCCCCCCEEECHHHHHH
Q ss_conf             ---9-999807767667813322788874
Q gi|254780479|r  312 ---V-WKTIVPQIVIIPDSTWHDAQNLAN  336 (356)
Q Consensus       312 ---v-~~~~~~~~~~~~~~~Wv~~~el~~  336 (356)
                         . -.+.+++..+..+.+|++..++.+
T Consensus       269 la~~~~~I~vd~dlEleDaqwF~r~ev~~  297 (345)
T KOG3084         269 LAKLNGKISVDKDLELEDAQWFDREEVKS  297 (345)
T ss_pred             HHHHCCCCCCCCCHHHHHCCCCCHHHHHH
T ss_conf             98617851237534544234424999999


No 158
>TIGR03374 ABALDH 1-pyrroline dehydrogenase. Members of this protein family are 1-pyrroline dehydrogenase (1.5.1.35), also called gamma-aminobutyraldehyde dehydrogenase. This enzyme can follow putrescine transaminase (EC 2.6.1.82) for a two-step conversion of putrescine to gamma-aminobutyric acid (GABA). The member from Escherichia coli is characterized as a homotetramer that binds one NADH per momomer. This enzyme belongs to the medium-chain aldehyde dehydrogenases, and is quite similar in sequence to the betaine aldehyde dehydrogenase (EC 1.2.1.8) family.
Probab=64.53  E-value=9.9  Score=18.59  Aligned_cols=76  Identities=16%  Similarity=0.161  Sum_probs=47.1

Q ss_pred             HHHHHHHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CCCH-HHHHHHHHHHHHHHHHHHHHHHCCCEEEECCC
Q ss_conf             999997404634588999877675430057----------86-8731-45688888777877667766434963131243
Q gi|254780479|r   81 EILSAWAGLGYYTRARNLKKCADIIVKKYE----------GN-FPHK-VEILKKLPGIGDYTASAIVAIAFNHFAVVVDT  148 (356)
Q Consensus        81 ~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P~~-~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~  148 (356)
                      +-...|+-+..+.|++-|+++|..+.++.+          |+ +... .+++..+-.+=+|.|++......-.+...++|
T Consensus        49 ~A~~~W~~~~~~eR~~il~~~a~~l~~~~~~la~l~~~e~GK~~~~a~~~ev~~~~~~~~~~a~~~~~~~~~~~~~~~~g  128 (472)
T TIGR03374        49 AAFAEWGQTTPKARAECLLKLADVIEENAQVFAELESRNCGKPLHSVFNDEIPAIVDVFRFFAGAARCLSGLAAGEYLEG  128 (472)
T ss_pred             HHHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC
T ss_conf             99998712999999999999999999789999999999719469999999999999999999988987047757777788


Q ss_pred             CHHHHHHH
Q ss_conf             14666576
Q gi|254780479|r  149 NIERIISR  156 (356)
Q Consensus       149 Nv~RVl~R  156 (356)
                      +..++..+
T Consensus       129 ~~~~~~r~  136 (472)
T TIGR03374       129 HTSMIRRD  136 (472)
T ss_pred             CCEEEEEE
T ss_conf             63069971


No 159
>cd01701 Pol_zeta Pol_zeta, a member of the Y-family of DNA polymerases. Y-family polymerases can transverse normal replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at higher error rate. The Y-family has no 3'-5' exonuclease activity. Pol zeta has also been named Rev1; this subfamily is mainly present in eukaryotes.
Probab=62.05  E-value=13  Score=17.83  Aligned_cols=88  Identities=23%  Similarity=0.334  Sum_probs=39.2

Q ss_pred             CCCHHHHHCCCHHHHHHHH-----HCCCHHHHHHHHHHHHHHHHHCCCCCC---CCHHHHH------HHHHHHHHHHHHH
Q ss_conf             8997898729989999997-----404634588999877675430057868---7314568------8888777877667
Q gi|254780479|r   68 WPTIFCLSSAKDEEILSAW-----AGLGYYTRARNLKKCADIIVKKYEGNF---PHKVEIL------KKLPGIGDYTASA  133 (356)
Q Consensus        68 ~P~~~~la~a~~~~vl~~w-----~gLGyy~Rar~l~~~a~~i~~~~~g~~---P~~~~~l------~~LpGiG~yta~a  133 (356)
                      +-+++.+|..=.++|...-     -|.|   .-+-|=|.|--.. +-+|.+   |++..++      ..|||||+-|+.-
T Consensus       164 ~~~~~~~a~~Ir~~I~~~tglt~SiGIa---~nK~lAKiAs~~~-KP~G~~~i~~~~~~~~L~~lpv~~i~GIG~~t~~k  239 (405)
T cd01701         164 YELPEELAETIRNEIKETTGCSASVGIG---PNILLARLATRKA-KPNGQYYLSPEKVEEFLDTLPVKDLPGVGSSTKEK  239 (405)
T ss_pred             CCCHHHHHHHHHHHHHHHHCCCEEECCC---CCHHHHHHHHHHC-CCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHH
T ss_conf             6779999999999999987997564437---6499999988736-99873236878999999855211158717899999


Q ss_pred             HHHHHCCCEEEECCCCHHHHHHHHHC
Q ss_conf             76643496313124314666576519
Q gi|254780479|r  134 IVAIAFNHFAVVVDTNIERIISRYFD  159 (356)
Q Consensus       134 i~s~a~~~~~~~vD~Nv~RVl~R~~~  159 (356)
                      ........-.-.......-.|.+.||
T Consensus       240 L~~~gi~t~~dl~~~~~~~~L~~~fG  265 (405)
T cd01701         240 LKSLGIDTCGDLQLISTKEKLQKVFG  265 (405)
T ss_pred             HHHCCCCCHHHHHCCCHHHHHHHHHH
T ss_conf             99849916899862787999999977


No 160
>PRK04148 hypothetical protein; Provisional
Probab=61.86  E-value=8.5  Score=19.07  Aligned_cols=92  Identities=21%  Similarity=0.234  Sum_probs=59.1

Q ss_pred             HHHHHHHHCCC----CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHH-HHHCCCC---CCCHHHHHH
Q ss_conf             77675430057----868731456888887778776677664349631312431466657-6519754---573568999
Q gi|254780479|r  100 KCADIIVKKYE----GNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIIS-RYFDIIK---PAPLYHKTI  171 (356)
Q Consensus       100 ~~a~~i~~~~~----g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~-R~~~~~~---~~~~~~k~l  171 (356)
                      ..|..|.++|.    |++=+        =|||.|+--|-..--.|-.+.++|-|-..|.. +-.|+..   +.-.-.-.+
T Consensus         3 ~iaeyI~~~y~~~~~~kIvE--------vGIGf~~~vA~~L~e~g~dv~~~Din~~aV~~a~~~Gl~~v~DDif~P~l~i   74 (135)
T PRK04148          3 TIAEYIARNYPHLKNGKIAE--------LGIGFYFKVAKKLKESGFDVIVIDINKKAVEKAKKLGLNAFVDDIFNPNLEI   74 (135)
T ss_pred             HHHHHHHHHCCCCCCCEEEE--------EECCCCHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCCEEECCCCCCCHHH
T ss_conf             69999998542112873899--------9236667899999874998899957655432212328763751588998888


Q ss_pred             HHHHHCCCCCCCHHHHHHHHHHHHHHHC
Q ss_conf             8741025785226789999999986320
Q gi|254780479|r  172 KNYARKITSTSRPGDFVQAMMDLGALIC  199 (356)
Q Consensus       172 ~~~~~~~~~~~~~~~~nQAlMdlGa~iC  199 (356)
                      .+-|.-+-.-.-|.+.-.++++++-.|=
T Consensus        75 Y~~a~lIYSIRPp~Elq~~il~lakkv~  102 (135)
T PRK04148         75 YKNAKLIYSIRPPRDLQPFILELAKKLN  102 (135)
T ss_pred             HCCCCEEEEECCCHHHHHHHHHHHHHCC
T ss_conf             5178879981898788899999999759


No 161
>pfam01367 5_3_exonuc 5'-3' exonuclease, C-terminal SAM fold.
Probab=61.40  E-value=3.2  Score=22.05  Aligned_cols=34  Identities=24%  Similarity=0.422  Sum_probs=21.7

Q ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH
Q ss_conf             3145688888777877667766434963131243146665765
Q gi|254780479|r  115 HKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY  157 (356)
Q Consensus       115 ~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~  157 (356)
                      ++.+.+--.||||+.||..++.--         |++.-++...
T Consensus        15 D~sDnIPGv~GiG~KtA~~Ll~~~---------gsle~i~~n~   48 (100)
T pfam01367        15 DSSDNIPGVPGIGEKTAAKLLKEY---------GSLENIYENL   48 (100)
T ss_pred             CCCCCCCCCCCCCCHHHHHHHHHC---------CCHHHHHHHH
T ss_conf             764588999998816899999981---------9899999819


No 162
>TIGR02757 TIGR02757 conserved hypothetical protein TIGR02757; InterPro: IPR014127   Members of this uncharacterised protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighbourhoods show little conservation..
Probab=60.99  E-value=3.7  Score=21.65  Aligned_cols=46  Identities=26%  Similarity=0.418  Sum_probs=23.7

Q ss_pred             CEEEECCCCHHHHHHHHHCCCCCCCHHHHH---HHHHHHCCCCCCCH--HHH
Q ss_conf             631312431466657651975457356899---98741025785226--789
Q gi|254780479|r  141 HFAVVVDTNIERIISRYFDIIKPAPLYHKT---IKNYARKITSTSRP--GDF  187 (356)
Q Consensus       141 ~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~---l~~~~~~~~~~~~~--~~~  187 (356)
                      +=..++||||.||-|+.+|+..-....-|.   +-+...++.| ++|  +||
T Consensus       210 ~Li~PLDTH~~~~~s~~Lkl~~rk~~dlK~A~~iT~~L~~~~p-~DP~kYDF  260 (269)
T TIGR02757       210 DLILPLDTHVFRIASKKLKLLKRKSYDLKAAIEITEALKKLNP-EDPIKYDF  260 (269)
T ss_pred             CCCCCCHHHHHHHHHHHHCHHHHHHCCHHHHHHHHHHHHHHCC-CCCCCCCH
T ss_conf             1403335899999877623256652366899999999876178-85952441


No 163
>KOG3690 consensus
Probab=60.34  E-value=14  Score=17.63  Aligned_cols=97  Identities=15%  Similarity=0.169  Sum_probs=51.6

Q ss_pred             HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE----CCCCHHHHHHHHHCCCCCCCHHH
Q ss_conf             58899987767543005786873145688888777877667766434963131----24314666576519754573568
Q gi|254780479|r   93 TRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVV----VDTNIERIISRYFDIIKPAPLYH  168 (356)
Q Consensus        93 ~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~----vD~Nv~RVl~R~~~~~~~~~~~~  168 (356)
                      .+++....-+...--.|.-+.+.+  .++++--++..++.-+.+.|-.....-    -|.-|+|.+-++           
T Consensus        43 ~q~~~v~~q~~la~~~y~~~~~~~--~~~a~~ea~~~~~~~~r~~a~q~k~~~~~~l~D~~~rRql~~~-----------  109 (646)
T KOG3690          43 AQARVVYRQEALAGWNYNTNGTEE--NLLALLEAEDKLVEFVRSLATQAKKFDVAELQDERLRRQLQKV-----------  109 (646)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCHHHHHHHHHH-----------
T ss_conf             999999999999988875379849--9999998889999999999877754248640488899999988-----------


Q ss_pred             HHHHHHHHCCCCCCCHHHHHHHHHHH-----HHHHCCCCCCC
Q ss_conf             99987410257852267899999999-----86320412356
Q gi|254780479|r  169 KTIKNYARKITSTSRPGDFVQAMMDL-----GALICTSNKPL  205 (356)
Q Consensus       169 k~l~~~~~~~~~~~~~~~~nQAlMdl-----Ga~iC~p~~P~  205 (356)
                         .++--.-++..+--++|+++-++     +++||.+.+|.
T Consensus       110 ---s~lg~saL~~~d~~~~~~ll~~m~~~y~~~~vC~y~~p~  148 (646)
T KOG3690         110 ---SQLGMSALSPDDLKRYNQLLSSMSTNYNTATVCAYDQPS  148 (646)
T ss_pred             ---HHHCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCC
T ss_conf             ---663320289789999999999998524776212888999


No 164
>KOG3911 consensus
Probab=59.38  E-value=4.8  Score=20.84  Aligned_cols=29  Identities=21%  Similarity=0.572  Sum_probs=19.4

Q ss_pred             HHHHHCCCHHHHHC-CCHHHHHHHHHCCCHH
Q ss_conf             98774899789872-9989999997404634
Q gi|254780479|r   63 KFMQKWPTIFCLSS-AKDEEILSAWAGLGYY   92 (356)
Q Consensus        63 ~~~~~~P~~~~la~-a~~~~vl~~w~gLGyy   92 (356)
                      +-+.+|=+.++-.+ =+.+|.|++|.|| ||
T Consensus        29 r~Lrkyi~ak~~k~~F~~~dflklWKGL-fY   58 (378)
T KOG3911          29 RKLRKYISAKTQKEGFDQDDFLKLWKGL-FY   58 (378)
T ss_pred             HHHHHHHHHHHHCCCCCHHHHHHHHHHH-HH
T ss_conf             9999999987500478789999998766-88


No 165
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=59.20  E-value=14  Score=17.54  Aligned_cols=74  Identities=19%  Similarity=0.137  Sum_probs=49.8

Q ss_pred             HHHHHHHHHHHHHHHHHH------CC-----CEEEECCCCHHHHHHHH-HCCC----CCCCHHHHHHHHHHHCCCCCCCH
Q ss_conf             888877787766776643------49-----63131243146665765-1975----45735689998741025785226
Q gi|254780479|r  121 KKLPGIGDYTASAIVAIA------FN-----HFAVVVDTNIERIISRY-FDII----KPAPLYHKTIKNYARKITSTSRP  184 (356)
Q Consensus       121 ~~LpGiG~yta~ai~s~a------~~-----~~~~~vD~Nv~RVl~R~-~~~~----~~~~~~~k~l~~~~~~~~~~~~~  184 (356)
                      -..|-||+-+|.-=.++.      |+     .--...|||-.|+.+|= +|+.    ++-...++.+.++++.-..+.+.
T Consensus       124 t~~P~~~rd~ARlraa~lLvD~gsy~dV~srvepLa~d~n~mR~sArEALglAa~kagd~a~A~~~F~qia~Da~aprni  203 (221)
T COG4649         124 TSIPQIGRDLARLRAAYLLVDNGSYDDVSSRVEPLAGDGNPMRHSAREALGLAAYKAGDFAKAKSWFVQIANDAQAPRNI  203 (221)
T ss_pred             CCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHH
T ss_conf             78760136799998789874356488899874130489970178899987688873224677999999997014696779


Q ss_pred             HHHHHHHHHH
Q ss_conf             7899999999
Q gi|254780479|r  185 GDFVQAMMDL  194 (356)
Q Consensus       185 ~~~nQAlMdl  194 (356)
                      +.=.|.|||+
T Consensus       204 rqRAq~mldl  213 (221)
T COG4649         204 RQRAQIMLDL  213 (221)
T ss_pred             HHHHHHHHHH
T ss_conf             9999999999


No 166
>PRK09482 xni exonuclease IX; Provisional
Probab=58.37  E-value=3.3  Score=22.01  Aligned_cols=12  Identities=33%  Similarity=0.675  Sum_probs=5.7

Q ss_pred             HHHHHHHHHHHH
Q ss_conf             877787766776
Q gi|254780479|r  124 PGIGDYTASAIV  135 (356)
Q Consensus       124 pGiG~yta~ai~  135 (356)
                      ||||+.||..++
T Consensus       188 ~GIG~KtA~~LL  199 (256)
T PRK09482        188 AGIGPKSAAELL  199 (256)
T ss_pred             CCCCHHHHHHHH
T ss_conf             985888999999


No 167
>COG1255 Uncharacterized protein conserved in archaea [Function unknown]
Probab=58.09  E-value=11  Score=18.29  Aligned_cols=99  Identities=17%  Similarity=0.143  Sum_probs=58.9

Q ss_pred             HHHHH-HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCC
Q ss_conf             67543-00578687314568888877787766776643496313124314666576519754573568999874102578
Q gi|254780479|r  102 ADIIV-KKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITS  180 (356)
Q Consensus       102 a~~i~-~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~  180 (356)
                      |..|. +...|++=+        =|||.|+.-|=..--.|-...++|-|-.++=.-+-....+...-.-.+..-++.+-.
T Consensus         5 a~~iAre~~~gkVvE--------VGiG~~~~VA~~L~e~g~dv~atDI~~~~a~~g~~~v~DDitnP~~~iY~~A~lIYS   76 (129)
T COG1255           5 AEYIARENARGKVVE--------VGIGFFLDVAKRLAERGFDVLATDINEKTAPEGLRFVVDDITNPNISIYEGADLIYS   76 (129)
T ss_pred             HHHHHHHHCCCCEEE--------ECCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCHHHHHCCCCEEE
T ss_conf             999998705883799--------814117999999997488689985230137655338872477964777627001266


Q ss_pred             CCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC
Q ss_conf             52267899999999863204123564311000
Q gi|254780479|r  181 TSRPGDFVQAMMDLGALICTSNKPLCPLCPIQ  212 (356)
Q Consensus       181 ~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~  212 (356)
                      -.-|.+..-++||+...|-    -.|-.-||-
T Consensus        77 iRpppEl~~~ildva~aVg----a~l~I~pL~  104 (129)
T COG1255          77 IRPPPELQSAILDVAKAVG----APLYIKPLT  104 (129)
T ss_pred             CCCCHHHHHHHHHHHHHHC----CCEEEEECC
T ss_conf             2798899899999998609----978997168


No 168
>smart00475 53EXOc 5'-3' exonuclease.
Probab=57.92  E-value=5.5  Score=20.39  Aligned_cols=33  Identities=36%  Similarity=0.685  Sum_probs=21.6

Q ss_pred             HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH
Q ss_conf             754300578687314568888-----------8777877667766
Q gi|254780479|r  103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA  136 (356)
Q Consensus       103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s  136 (356)
                      ..+.++| |.-|+.+-+++.|           ||||+.||..++.
T Consensus       161 ~~v~e~~-gv~P~q~~d~~aL~GD~sDnIpGV~GIG~KtA~kLL~  204 (259)
T smart00475      161 ENVIEKY-GLTPEQIIDYKALMGDSSDNIPGVPGIGEKTAAKLLK  204 (259)
T ss_pred             HHHHHHH-CCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHH
T ss_conf             9999884-8998998637640464535899999847899999999


No 169
>COG5071 RPN5 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=56.91  E-value=9.4  Score=18.78  Aligned_cols=23  Identities=17%  Similarity=0.202  Sum_probs=16.7

Q ss_pred             CHHHHHHHHHCCCHHHHHCCCHH
Q ss_conf             26589987748997898729989
Q gi|254780479|r   58 EPYFKKFMQKWPTIFCLSSAKDE   80 (356)
Q Consensus        58 ~~~~~~~~~~~P~~~~la~a~~~   80 (356)
                      ..|-+.|-+-||.+.+||.-+.+
T Consensus         7 ~dys~~~~e~~~~~~~l~~~d~~   29 (439)
T COG5071           7 VDYSEKFAELQKSLNNLNTIDID   29 (439)
T ss_pred             CCHHHHHHHHHHHHCCHHHCCHH
T ss_conf             56899999973421303223415


No 170
>PRK13482 DNA integrity scanning protein DisA; Provisional
Probab=56.39  E-value=16  Score=17.19  Aligned_cols=23  Identities=17%  Similarity=0.288  Sum_probs=10.5

Q ss_pred             HHCCCEEEECCCCH----HHHHHHHHC
Q ss_conf             43496313124314----666576519
Q gi|254780479|r  137 IAFNHFAVVVDTNI----ERIISRYFD  159 (356)
Q Consensus       137 ~a~~~~~~~vD~Nv----~RVl~R~~~  159 (356)
                      ++|+.....+|..|    .|+|+|+-.
T Consensus       268 LGy~~~~~~lD~~v~pRGYR~L~kipr  294 (352)
T PRK13482        268 LGYPGGSEALDTAVSPRGYRILAKIPR  294 (352)
T ss_pred             HCCCCCCCCCCCCCCCCHHHHHCCCCC
T ss_conf             589987653346778725888605999


No 171
>COG1491 Predicted RNA-binding protein [Translation, ribosomal structure and biogenesis]
Probab=54.37  E-value=5.9  Score=20.21  Aligned_cols=84  Identities=29%  Similarity=0.318  Sum_probs=44.2

Q ss_pred             CCCHHHHHHHHHCCCHH---HHHH-HHHHHHHHHHHCCCCC--------CCCH--HHHHHHHHHHHHHHHHHHHHHHCCC
Q ss_conf             29989999997404634---5889-9987767543005786--------8731--4568888877787766776643496
Q gi|254780479|r   76 SAKDEEILSAWAGLGYY---TRAR-NLKKCADIIVKKYEGN--------FPHK--VEILKKLPGIGDYTASAIVAIAFNH  141 (356)
Q Consensus        76 ~a~~~~vl~~w~gLGyy---~Rar-~l~~~a~~i~~~~~g~--------~P~~--~~~l~~LpGiG~yta~ai~s~a~~~  141 (356)
                      ....+.|.+.=+++.|-   .-|| +|-..-..|+.+-.++        =|-+  ...|..|||||..|.-+|+-=--.+
T Consensus        74 ~g~rdki~~v~rri~Y~DLT~~Ar~eLpyvve~iV~~~E~rFV~FFN~A~PIt~RLH~LELLpGiGkK~m~~IleERkkk  153 (202)
T COG1491          74 PGKRDKIDKVRRRIRYEDLTDTARSELPYVVEKIVKENEDRFVKFFNEAEPITLRLHQLELLPGIGKKTMWAILEERKKK  153 (202)
T ss_pred             CCCCCHHHHHHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHCC
T ss_conf             88753079998622285456777755189999999861567888733567633878888753120499999999987428


Q ss_pred             EEEECCCCHHHHHHHHHCCCCC
Q ss_conf             3131243146665765197545
Q gi|254780479|r  142 FAVVVDTNIERIISRYFDIIKP  163 (356)
Q Consensus       142 ~~~~vD~Nv~RVl~R~~~~~~~  163 (356)
                      |.    ....-|-.|+-++..|
T Consensus       154 pF----eSFeDi~~Rv~~~~~p  171 (202)
T COG1491         154 PF----ESFEDIKERVKGLHDP  171 (202)
T ss_pred             CC----CCHHHHHHHHCCCCCH
T ss_conf             88----6689999880567778


No 172
>pfam09674 DUF2400 Protein of unknown function (DUF2400). Members of this uncharacterized protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighbourhoods show little conservation.
Probab=50.77  E-value=11  Score=18.40  Aligned_cols=19  Identities=21%  Similarity=0.508  Sum_probs=10.3

Q ss_pred             EEECCCCHHHHHHHHHCCCC
Q ss_conf             13124314666576519754
Q gi|254780479|r  143 AVVVDTNIERIISRYFDIIK  162 (356)
Q Consensus       143 ~~~vD~Nv~RVl~R~~~~~~  162 (356)
                      ..++|+||.||--++ |+-.
T Consensus       174 ~iPLDtH~~rvar~L-gL~~  192 (230)
T pfam09674       174 IIPLDTHTHRVARKL-GLLK  192 (230)
T ss_pred             EEECCHHHHHHHHHH-CCCC
T ss_conf             320207089999992-8822


No 173
>pfam05597 Phasin Poly(hydroxyalcanoate) granule associated protein (phasin). Polyhydroxyalkanoates (PHAs) are storage polyesters synthesized by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon.
Probab=50.60  E-value=19  Score=16.56  Aligned_cols=82  Identities=20%  Similarity=0.387  Sum_probs=39.0

Q ss_pred             HHCCCHHHHHHHH-HCCCHHHHHHH--------HHHHHHHHHHCCCCCCCCHHHHHH-----HHHHHHHHHHHHHHHHHC
Q ss_conf             8729989999997-40463458899--------987767543005786873145688-----888777877667766434
Q gi|254780479|r   74 LSSAKDEEILSAW-AGLGYYTRARN--------LKKCADIIVKKYEGNFPHKVEILK-----KLPGIGDYTASAIVAIAF  139 (356)
Q Consensus        74 la~a~~~~vl~~w-~gLGyy~Rar~--------l~~~a~~i~~~~~g~~P~~~~~l~-----~LpGiG~yta~ai~s~a~  139 (356)
                      ++.-=.+-.-+.| +|||=|+|+..        |-+-...+-.+-...+-+..+++.     ..-.|+.-+     .-.+
T Consensus        11 ~~~~v~~~arqIWLAGLGA~aka~eEg~k~Fe~LVk~Ge~~e~~~~~~~ee~~~~~~~~~~~~~~~v~~~a-----~~~w   85 (132)
T pfam05597        11 LSGEVKDSARQIWLAGLGAFAKAQEEGGKVFEALVKEGEELEKRTRKVAEEQVEAVRESVKSRVSEVKDKA-----EGQW   85 (132)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHH
T ss_conf             87889999999999874599999999999999999978999999999999999999998888677799888-----7658


Q ss_pred             CCEEEECCCCHHHHHHHHHCCC
Q ss_conf             9631312431466657651975
Q gi|254780479|r  140 NHFAVVVDTNIERIISRYFDII  161 (356)
Q Consensus       140 ~~~~~~vD~Nv~RVl~R~~~~~  161 (356)
                      ++-.-++|.-|.++|.|+ |+.
T Consensus        86 dklE~~fe~rV~~aL~rL-Gip  106 (132)
T pfam05597        86 DKLEQAFDERVAKALNRL-GVP  106 (132)
T ss_pred             HHHHHHHHHHHHHHHHHC-CCC
T ss_conf             889999999999999856-999


No 174
>KOG4368 consensus
Probab=49.97  E-value=20  Score=16.50  Aligned_cols=39  Identities=18%  Similarity=0.215  Sum_probs=25.2

Q ss_pred             HHCCCHHHHHCCCHHHHHHHHHCCCHHH--HHHHHHHHHHH
Q ss_conf             7489978987299899999974046345--88999877675
Q gi|254780479|r   66 QKWPTIFCLSSAKDEEILSAWAGLGYYT--RARNLKKCADI  104 (356)
Q Consensus        66 ~~~P~~~~la~a~~~~vl~~w~gLGyy~--Rar~l~~~a~~  104 (356)
                      ..|-.++.+.+--+..+|.+|+|-||+.  -...|...|.+
T Consensus       196 ~~~~~~~e~~~~~~~~ll~~we~~~yf~ds~~~ql~~~~~~  236 (757)
T KOG4368         196 TSFLAVEEDKQQKIARLLQLWEKNGYFDDSIIQQLQSPALG  236 (757)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH
T ss_conf             76641677777799999998760573138999987414654


No 175
>pfam04919 DUF655 Protein of unknown function, DUF655. This family includes several uncharacterized archaeal proteins.
Probab=49.49  E-value=7.6  Score=19.40  Aligned_cols=54  Identities=26%  Similarity=0.273  Sum_probs=30.0

Q ss_pred             HCCCCCCCCH--HHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCC
Q ss_conf             0057868731--456888887778776677664349631312431466657651975457
Q gi|254780479|r  107 KKYEGNFPHK--VEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPA  164 (356)
Q Consensus       107 ~~~~g~~P~~--~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~  164 (356)
                      +=||-.-|-+  .-.|.-|||||..++-.|+----..|--    ...-+=.|+-++..|.
T Consensus       103 ~FfN~A~pit~rlH~leLLPGIGkK~~~~ileeR~~~~Fe----SFedi~~Rv~~l~dp~  158 (181)
T pfam04919       103 KFFNEAEPITTRLHQLELLPGIGKKMMWAILEERKKKPFE----SFEDIKERVKGLHDPV  158 (181)
T ss_pred             HHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCC----CHHHHHHHHCCCCCHH
T ss_conf             7651368744878888753340389999999996559988----9999998835789999


No 176
>PRK04301 radA DNA repair and recombination protein RadA; Validated
Probab=49.39  E-value=7.5  Score=19.47  Aligned_cols=40  Identities=18%  Similarity=0.257  Sum_probs=30.2

Q ss_pred             HHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHH
Q ss_conf             748997898729989999997404634588999877675430
Q gi|254780479|r   66 QKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVK  107 (356)
Q Consensus        66 ~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~  107 (356)
                      .-|-|++++|.++..|+.+. .|+. +..|..+.++|+.++.
T Consensus        26 aG~~tv~~l~~~~~~~L~~~-~gis-~~~a~ki~~~a~~~~~   65 (318)
T PRK04301         26 AGYDTVEAIAVASPKELSEI-AGIS-ESTAAKIIEAAREALD   65 (318)
T ss_pred             CCCCCHHHHHCCCHHHHHHH-HCCC-HHHHHHHHHHHHHHCC
T ss_conf             69954999874899999985-0999-9999999999998536


No 177
>pfam02889 Sec63 Sec63 Brl domain. This domain (also known as the Brl domain) is required for assembly of functional endoplasmic reticulum translocons.
Probab=49.12  E-value=20  Score=16.41  Aligned_cols=48  Identities=17%  Similarity=0.182  Sum_probs=37.8

Q ss_pred             HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             9740463458899987767543005786873145688888777877667766
Q gi|254780479|r   85 AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVA  136 (356)
Q Consensus        85 ~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s  136 (356)
                      ....-|+.+-|.+....+|.|+..   .+ .+..-|+.||++|+..+.++..
T Consensus       119 i~~~~~~~~~~~~~l~l~q~i~q~---~w-~~~~pL~Qlp~~~~~~i~~l~~  166 (309)
T pfam02889       119 IALSKGWLSPALNALELSKMIEQR---LW-PSDSPLRQLPHIPPEVIKKLEK  166 (309)
T ss_pred             HHHHCCCHHHHHHHHHHHHHHHHC---CC-CCCCHHHCCCCCCHHHHHHHHH
T ss_conf             998689799999999999999833---59-9998545179999999999997


No 178
>TIGR03029 EpsG chain length determinant protein tyrosine kinase EpsG. The proteins in this family are homologs of the EpsG protein found in Methylobacillus strain 12S and are generally found in operons with other Eps homologs. The protein is believed to function as the protein tyrosine kinase component of the chain length regulator (along with the transmembrane component EpsF).
Probab=47.82  E-value=13  Score=17.77  Aligned_cols=41  Identities=24%  Similarity=0.388  Sum_probs=33.9

Q ss_pred             HHHHHHHHHHHHHHHHC---CCEEEECCCCHHH-HHHHHHCCCCC
Q ss_conf             88777877667766434---9631312431466-65765197545
Q gi|254780479|r  123 LPGIGDYTASAIVAIAF---NHFAVVVDTNIER-IISRYFDIIKP  163 (356)
Q Consensus       123 LpGiG~yta~ai~s~a~---~~~~~~vD~Nv~R-Vl~R~~~~~~~  163 (356)
                      -||=|..|-+|=++++|   |+++..||+|.+| .+.++|+++..
T Consensus       112 ~pGEGKS~vAaNLA~~~Aq~G~RvLLVDaDLRrP~lh~~f~l~~~  156 (274)
T TIGR03029       112 KSGEGCSYIAANLAIVFSQLGEKTLLIDANLRDPVQHRNFKLSEQ  156 (274)
T ss_pred             CCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHCCCCC
T ss_conf             999989999999999999679919999588888447797599997


No 179
>PRK05812 secD preprotein translocase subunit SecD; Reviewed
Probab=47.72  E-value=7.1  Score=19.61  Aligned_cols=20  Identities=20%  Similarity=0.421  Sum_probs=9.7

Q ss_pred             ECCCCHHHHHHHH--HCCCCCC
Q ss_conf             1243146665765--1975457
Q gi|254780479|r  145 VVDTNIERIISRY--FDIIKPA  164 (356)
Q Consensus       145 ~vD~Nv~RVl~R~--~~~~~~~  164 (356)
                      +++-++.=+=.|+  +|+.+|.
T Consensus       144 av~qaieiIR~RVd~lGv~EP~  165 (513)
T PRK05812        144 ALEQAITIIRNRVNELGVAEPV  165 (513)
T ss_pred             HHHHHHHHHHHHHHHHCCCCCH
T ss_conf             9999999999987754777674


No 180
>COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif [General function prediction only]
Probab=47.53  E-value=6.3  Score=19.98  Aligned_cols=45  Identities=27%  Similarity=0.414  Sum_probs=23.0

Q ss_pred             CCCHHHHHHHHHCCCHHHH-------------------HHHHHHHHHHHHHC--CCCC-----CCCHHHHHHH
Q ss_conf             2998999999740463458-------------------89998776754300--5786-----8731456888
Q gi|254780479|r   76 SAKDEEILSAWAGLGYYTR-------------------ARNLKKCADIIVKK--YEGN-----FPHKVEILKK  122 (356)
Q Consensus        76 ~a~~~~vl~~w~gLGyy~R-------------------ar~l~~~a~~i~~~--~~g~-----~P~~~~~l~~  122 (356)
                      .-+.+||..+  -|-||+|                   ...|.+.|+++.-+  |+|-     +|..-.+|++
T Consensus        83 ~ftp~Eiv~l--tlnfYrRnYIeGLFLSSGvi~~~DyTmE~mi~var~LRle~~f~GYIHlK~IPgas~~li~  153 (404)
T COG4277          83 RFTPEEIVDL--TLNFYRRNYIEGLFLSSGVIKNPDYTMEEMIEVARILRLEHKFRGYIHLKIIPGASPDLIK  153 (404)
T ss_pred             CCCHHHHHHH--HHHHHHHHHHHHHEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHH
T ss_conf             0589999999--9989887424330002463368614799999999988320455757998756999989999


No 181
>pfam11264 ThylakoidFormat Thylakoid formation protein. THF1 is localized to the outer plastid membrane and the stroma. THF1 has a role in sugar signalling. THF1 is also thought to have a role in chloroplast and leaf development. THF1 has been shown to play a crucial role in vesicle-mediated thylakoid membrane biogenesis.
Probab=47.11  E-value=22  Score=16.20  Aligned_cols=119  Identities=18%  Similarity=0.235  Sum_probs=64.8

Q ss_pred             HHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC------C-------HHHCCHHHHHHHHHCCCHHHHHCCCH
Q ss_conf             9999962899887548554445678704676644673116------1-------20332658998774899789872998
Q gi|254780479|r   13 LDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQT------T-------VKTVEPYFKKFMQKWPTIFCLSSAKD   79 (356)
Q Consensus        13 l~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT------~-------v~~v~~~~~~~~~~~P~~~~la~a~~   79 (356)
                      -..|..+.|..|             .-|+-.|-|.|-++-      .       .--|..-|.+||+-||.-++..    
T Consensus         7 r~F~~~~p~pI~-------------~lYrrvvdELLVElHLL~~n~~F~yD~lFAlGlvt~Fd~fm~GY~Pee~~~----   69 (216)
T pfam11264         7 RAFHAAYPRPIP-------------SLYRRVVDELLVELHLLSHQSDFKYDPLFALGLVTVFDRFMEGYRPEEHKD----   69 (216)
T ss_pred             HHHHHHCCCCCC-------------HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHH----
T ss_conf             999985898882-------------899999999999999988603654273788459999999976799856799----


Q ss_pred             HHHHH-HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH-HHHHH-HHHHHHHHHCCCEEEECCCCHHHHHHH
Q ss_conf             99999-9740463458899987767543005786873145688888-77787-766776643496313124314666576
Q gi|254780479|r   80 EEILS-AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLP-GIGDY-TASAIVAIAFNHFAVVVDTNIERIISR  156 (356)
Q Consensus        80 ~~vl~-~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~Lp-GiG~y-ta~ai~s~a~~~~~~~vD~Nv~RVl~R  156 (356)
                       .+.. +.+.+|+  -+..+++-|+.+.+...|.=+.+..++++-. |=|+- .++.+.+++-+         -.=--||
T Consensus        70 -~IF~Alc~s~~~--dp~~~r~dA~~l~~~a~~~s~~~i~~~l~~~~~~~~~~l~~~l~~~a~~---------~~f~YSR  137 (216)
T pfam11264        70 -AIFNALCSALGF--DPEQLRKDAEQLEEAAKSHSLEEIVSWLSQKGGGAPNPLKDILQRIAGN---------SKFHYSR  137 (216)
T ss_pred             -HHHHHHHHHCCC--CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCC---------CCCHHHH
T ss_conf             -999999985489--9999999999999998759999999999736667531689999987516---------8741899


Q ss_pred             HHCC
Q ss_conf             5197
Q gi|254780479|r  157 YFDI  160 (356)
Q Consensus       157 ~~~~  160 (356)
                      +|+|
T Consensus       138 l~AI  141 (216)
T pfam11264       138 LFAI  141 (216)
T ss_pred             HHHH
T ss_conf             9999


No 182
>TIGR02153 gatD_arch glutamyl-tRNA(Gln) amidotransferase, subunit D; InterPro: IPR011878    This peptide is found only in the Archaea. It is part of a heterodimer, with GatD (IPR004414 from INTERPRO), that acts as an amidotransferase on misacylated Glu-tRNA(Gln) to produce Gln-tRNA(Gln). The analogous amidotransferase found in bacteria is the GatABC system, although GatABC homologs in the Archaea appear to act instead on Asp-tRNA(Asn) .; GO: 0006450 regulation of translational fidelity.
Probab=47.10  E-value=7.6  Score=19.40  Aligned_cols=91  Identities=22%  Similarity=0.358  Sum_probs=52.1

Q ss_pred             HCCHHH--HHHHHHCCCHHHHHCCCHHHHHH---------HHHCCCHHHH-HHHHHHHHHHHHHCCCCCCCCHHHHHHHH
Q ss_conf             332658--99877489978987299899999---------9740463458-89998776754300578687314568888
Q gi|254780479|r   56 TVEPYF--KKFMQKWPTIFCLSSAKDEEILS---------AWAGLGYYTR-ARNLKKCADIIVKKYEGNFPHKVEILKKL  123 (356)
Q Consensus        56 ~v~~~~--~~~~~~~P~~~~la~a~~~~vl~---------~w~gLGyy~R-ar~l~~~a~~i~~~~~g~~P~~~~~l~~L  123 (356)
                      .|-|+|  +..+..-|-+.+.|+=+-+.|+.         .|.-++  .+ |+-|.+.|.=||--||-.       =+  
T Consensus        87 AV~P~ftAeel~~~~PEL~~iAni~a~~~~niLSENMkP~~W~~IA--~~V~~al~~G~~GVVvaHGTD-------TM--  155 (413)
T TIGR02153        87 AVKPAFTAEELARAVPELLEIANIKARAVFNILSENMKPEYWIKIA--EAVAKALEEGADGVVVAHGTD-------TM--  155 (413)
T ss_pred             CCCCCCCHHHHHHHCCCHHEECCCCEEEEEHHCCCCCCHHHHHHHH--HHHHHHHCCCCCEEEEEECCC-------HH--
T ss_conf             6004539899986174002012431056000107788858999999--999998408995389941587-------46--


Q ss_pred             HHHHHHHHHHHHHHHCCC--EEEECCCCHHHHHHHHHCCCCCCCHHH
Q ss_conf             877787766776643496--313124314666576519754573568
Q gi|254780479|r  124 PGIGDYTASAIVAIAFNH--FAVVVDTNIERIISRYFDIIKPAPLYH  168 (356)
Q Consensus       124 pGiG~yta~ai~s~a~~~--~~~~vD~Nv~RVl~R~~~~~~~~~~~~  168 (356)
                          .|||+|. ||+|..  |.|+|=.=-.|      --|.|++.+.
T Consensus       156 ----~YTAaAL-SFm~~~Gl~~PvVlVGAQR------SSDRPSSDAa  191 (413)
T TIGR02153       156 ----AYTAAAL-SFMFKTGLPVPVVLVGAQR------SSDRPSSDAA  191 (413)
T ss_pred             ----HHHHHHH-HHHHHCCCCCCEEEECCCC------CCCCCCCHHH
T ss_conf             ----8999999-9999716999889985743------5888731379


No 183
>pfam05997 Nop52 Nucleolar protein,Nop52. Nop52 believed to be involved in the generation of 28S rRNA.
Probab=46.97  E-value=16  Score=17.07  Aligned_cols=50  Identities=24%  Similarity=0.474  Sum_probs=27.5

Q ss_pred             CCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHH-----HHHHHHHHHHHHHHC
Q ss_conf             612033265899877489978987299899999974046345-----889998776754300
Q gi|254780479|r   52 TTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYT-----RARNLKKCADIIVKK  108 (356)
Q Consensus        52 T~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~-----Rar~l~~~a~~i~~~  108 (356)
                      ||-+++. -..+|+..-.+     ..++.|.+++|.|| ||.     +...=++.|..|.+-
T Consensus        16 ~Rd~al~-~L~~~L~~~~~-----~~~~~d~~KlWKGL-fy~mW~sDkpl~Q~~La~~la~L   70 (212)
T pfam05997        16 TRDKALR-SLRKYLSKRSN-----LLDELDLLKLWKGL-FYCMWMSDKPLVQQELADKIAQL   70 (212)
T ss_pred             HHHHHHH-HHHHHHHHCCC-----CCCHHHHHHHHHHH-HHHHHHCCCHHHHHHHHHHHHHH
T ss_conf             7999999-99999987678-----89999999999999-99999728869999999999999


No 184
>COG1561 Uncharacterized stress-induced protein [Function unknown]
Probab=46.97  E-value=13  Score=17.86  Aligned_cols=35  Identities=14%  Similarity=0.210  Sum_probs=29.8

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH
Q ss_conf             88999877675430057868731456888887778
Q gi|254780479|r   94 RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGD  128 (356)
Q Consensus        94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~  128 (356)
                      -++.+..+++.|...|+-..|-+.++|+++|||=.
T Consensus        84 ll~~~~~~~~~i~~~~~~~~~~~~~~lL~~~gVl~  118 (290)
T COG1561          84 LLKQLVTLLKELAAQFDLAAELSLDDLLRLPGVLA  118 (290)
T ss_pred             HHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCCEEE
T ss_conf             99999999999998708667899889856766030


No 185
>KOG4548 consensus
Probab=46.45  E-value=22  Score=16.13  Aligned_cols=90  Identities=14%  Similarity=0.052  Sum_probs=40.6

Q ss_pred             CCEEEEEECCCCHHHHCCCCCCCCCC-CCCCCHHHHHHCCCCCCCCEE-----CC--CEE---EEEEEE-------EEEE
Q ss_conf             76057764572013204744211113-467215888630101000021-----37--358---998315-------9999
Q gi|254780479|r  249 DNRILLRKRTNTRLLEGMDELPGSAW-SSTKDGNIDTHSAPFTANWIL-----CN--TIT---HTFTHF-------TLTL  310 (356)
Q Consensus       249 ~~kiLL~KRp~~gll~GLwEFP~~e~-~~~~~~~~~~~~~~~~~~~~~-----l~--~ik---H~fTH~-------~L~i  310 (356)
                      +.-+||.+|+=+ = .+.|.||...+ ++++......+..........     .+  .+.   |.+..-       .-.+
T Consensus       138 r~LyLLV~~k~g-~-~s~w~fP~~~~s~~~~~lr~~ae~~Lk~~~ge~~~t~fvgnaP~g~~~~q~pr~~~~e~~~~sk~  215 (263)
T KOG4548         138 RKLYLLVKRKFG-K-SSVWIFPNRQFSSSEKTLRGHAERDLKVLSGENKSTWFVGNAPFGHTPLQSPREMTTEEPVSSKV  215 (263)
T ss_pred             CEEEEEEEECCC-C-CCEEECCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHEECCCCCCCCCCCCCCCCCCCCCCCCEE
T ss_conf             017887763267-6-55205778654776522888999999987233023310205745656666841234212445315


Q ss_pred             EEEEEEC-----CCCCCCCCCEEECHHHHHHCCCC
Q ss_conf             9999980-----77676678133227888747998
Q gi|254780479|r  311 FVWKTIV-----PQIVIIPDSTWHDAQNLANAALP  340 (356)
Q Consensus       311 ~v~~~~~-----~~~~~~~~~~Wv~~~el~~~~LP  340 (356)
                      .++.+.+     .+..+..++.|++.+++.++--+
T Consensus       216 ff~k~~lv~~~~~kn~n~edfvWvTkdel~e~l~~  250 (263)
T KOG4548         216 FFFKASLVANSNQKNQNKEDFVWVTKDELGEKLPK  250 (263)
T ss_pred             EEEEEEECCCCCHHCCCCCCEEEECHHHHHHHCCH
T ss_conf             77653201564122045341488655887535456


No 186
>pfam04669 DUF579 Protein of unknown function (DUF579). This is a eukaryotic family of uncharacterized proteins.
Probab=46.38  E-value=22  Score=16.12  Aligned_cols=98  Identities=22%  Similarity=0.338  Sum_probs=53.8

Q ss_pred             HHHHHHHHCCCHH------HHHCCCHHHHHHHHHC--------CCHHHHHH--------HHHHHH--HHHHHCCCCCCCC
Q ss_conf             5899877489978------9872998999999740--------46345889--------998776--7543005786873
Q gi|254780479|r   60 YFKKFMQKWPTIF------CLSSAKDEEILSAWAG--------LGYYTRAR--------NLKKCA--DIIVKKYEGNFPH  115 (356)
Q Consensus        60 ~~~~~~~~~P~~~------~la~a~~~~vl~~w~g--------LGyy~Rar--------~l~~~a--~~i~~~~~g~~P~  115 (356)
                      +|..|-++||+++      +-...+.++++..+.+        |. +|.++        -+++..  .++++.+.|.+|+
T Consensus        49 ~~~~~~~~~P~l~~y~v~Y~t~~~~a~~L~~~~~~~~c~~~~~~~-~s~c~Lal~~lP~~vy~~~WDvi~VD~P~G~~~~  127 (187)
T pfam04669        49 IYEKFRKDFPELEAYHVVYRTKVSDADELKSAAKDPECRPVQNLL-FSDCKLALNDLPNEVYKEKWDRIFVDAPEGYVED  127 (187)
T ss_pred             HHHHHHHHCCCCEEEEEEEECHHCCHHHHHHHHCCCCCCCCCCCC-CCCCCHHHCCCCHHHHCCCCCEEEEECCCCCCCC
T ss_conf             999887409875378988750230489998752277676665545-5756231125988995570668998089998989


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH----HCCCEEEE--CCCCHHHHHHHHHCCCCC
Q ss_conf             1456888887778776677664----34963131--243146665765197545
Q gi|254780479|r  116 KVEILKKLPGIGDYTASAIVAI----AFNHFAVV--VDTNIERIISRYFDIIKP  163 (356)
Q Consensus       116 ~~~~l~~LpGiG~yta~ai~s~----a~~~~~~~--vD~Nv~RVl~R~~~~~~~  163 (356)
                      ...-.-++     ||||.++=-    ++...+.+  ||--|+|.+|+-|--++.
T Consensus       128 ~PGRM~aI-----ytAg~laR~r~~~~~t~~VfVHdv~R~vE~~~s~eFLC~e~  176 (187)
T pfam04669       128 APGRMGAI-----YTAGTLARSRASGEYTTTVFVHDVDRRVEDVFSKEFLCIEI  176 (187)
T ss_pred             CCCHHHHH-----HHHHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHCCHHH
T ss_conf             98504889-----99999975337899875089836884899998898743477


No 187
>pfam04904 NCD1 NAB conserved region 1 (NCD1). Nab1 and Nab2 are co-repressors that specifically interact with and repress transcription mediated by the three members of the NGFI-A (Egr-1, Krox24, zif/268) family of transcription factors. This region consists of the N-terminal NAB conserved region 1, which interacts with the EGR1 inhibitory domain (R1). It may also mediate multimerisation.
Probab=45.12  E-value=23  Score=15.99  Aligned_cols=56  Identities=23%  Similarity=0.319  Sum_probs=42.8

Q ss_pred             CHHHCCHHHHHHHHHC-CCHHHHHCCCHHHHHHHHHCCCHHH---HHHHHHHHHHHHHHC
Q ss_conf             1203326589987748-9978987299899999974046345---889998776754300
Q gi|254780479|r   53 TVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYT---RARNLKKCADIIVKK  108 (356)
Q Consensus        53 ~v~~v~~~~~~~~~~~-P~~~~la~a~~~~vl~~w~gLGyy~---Rar~l~~~a~~i~~~  108 (356)
                      |-+..+.||.-|++.= -+++.|.+|.+||.+...+=.|--+   -.|+|+|+-+.-+.+
T Consensus        17 qrAnLL~Yyd~fi~~GGDDvqQlc~a~eeEFleim~lVGM~sKPLHVrRlqKaL~ew~~~   76 (82)
T pfam04904        17 QRANLLQYYDTFIAQGGDDVQQLCEAGEEEFLEIMALVGMASKPLHVRRLQKALQEWVTN   76 (82)
T ss_pred             HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC
T ss_conf             980079999999997676799999973899999999966577652899999999999638


No 188
>KOG0135 consensus
Probab=44.79  E-value=24  Score=15.96  Aligned_cols=99  Identities=15%  Similarity=0.173  Sum_probs=37.7

Q ss_pred             HHHHHHHCCCHHHHHCC--CHHHHHHHHHCC-CHHH-HHHHHHHHHHHHHHCCCCCCCCHHHHHHHH---HH---HHHHH
Q ss_conf             89987748997898729--989999997404-6345-889998776754300578687314568888---87---77877
Q gi|254780479|r   61 FKKFMQKWPTIFCLSSA--KDEEILSAWAGL-GYYT-RARNLKKCADIIVKKYEGNFPHKVEILKKL---PG---IGDYT  130 (356)
Q Consensus        61 ~~~~~~~~P~~~~la~a--~~~~vl~~w~gL-Gyy~-Rar~l~~~a~~i~~~~~g~~P~~~~~l~~L---pG---iG~yt  130 (356)
                      +..|+..+|+.-+..-+  ...+.+..|+=+ -|.. |.+.|.++.........+.. .+++..-.+   -+   -=.||
T Consensus       477 ~~~~ln~~~s~~~~~~~s~~~~d~l~~~~f~~~~~~l~~~lLl~t~a~~~~~l~~g~-~~fea~n~~q~h~~~~La~a~~  555 (661)
T KOG0135         477 FVGFLNEYPSSPDPVLASQLVSDTLRSSQFQLCYFWLRCHLLLETVASRERQLQSGL-SSFEARNKCQVHQLRELALAFT  555 (661)
T ss_pred             HHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHHHHHH
T ss_conf             777776431467743222222350107999889999999999999999999984255-2089999998850058899999


Q ss_pred             HHHHHHHHCCCEEEECCCCHHHHHHHHHCC
Q ss_conf             667766434963131243146665765197
Q gi|254780479|r  131 ASAIVAIAFNHFAVVVDTNIERIISRYFDI  160 (356)
Q Consensus       131 a~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~  160 (356)
                      --+|++--.+.-.-++|+-|+-||.++.+.
T Consensus       556 E~~il~~~l~h~~k~~~~s~k~vL~l~~dl  585 (661)
T KOG0135         556 EHTILQRFLDHVHKLPDPSVKAVLGLVRDL  585 (661)
T ss_pred             HHHHHHHHHHHHHCCCCHHHHHHHHHHHHH
T ss_conf             999999999998628983089999998879


No 189
>COG0272 Lig NAD-dependent DNA ligase (contains BRCT domain type II) [DNA replication, recombination, and repair]
Probab=44.52  E-value=17  Score=16.97  Aligned_cols=16  Identities=44%  Similarity=0.586  Sum_probs=8.5

Q ss_pred             HHHHHCCCHHHHHHHH
Q ss_conf             7898729989999997
Q gi|254780479|r   71 IFCLSSAKDEEILSAW   86 (356)
Q Consensus        71 ~~~la~a~~~~vl~~w   86 (356)
                      .-||.+|=-++=+..|
T Consensus        82 MlSL~naf~~~el~~f   97 (667)
T COG0272          82 MLSLDNAFDEEELRAF   97 (667)
T ss_pred             CCCHHHCCCHHHHHHH
T ss_conf             5434422798999999


No 190
>TIGR01954 nusA_Cterm_rpt transcription termination factor NusA, C-terminal duplication; InterPro: IPR010214    NusA, or N utilisation substance protein A, is a bacterial transcription termination factor. It binds to RNA polymerase alpha subunit and promotes termination at certain RNA hairpin structures. It is named for the interaction in Escherichia coli of phage lambda antitermination protein N with the N-utilisation substance, consisting of NusA, NusB, NusE (ribosomal protein S10), and NusG . This entry represents an acidic 50-residue region found in two copies toward the C terminus of most proteobacterial NusA proteins, spaced about 26 residues apart. Analogous C-terminal extensions in some other bacterial lineages lack apparent homology but appear similarly acidic  ; GO: 0030528 transcription regulator activity.
Probab=44.51  E-value=21  Score=16.24  Aligned_cols=38  Identities=26%  Similarity=0.263  Sum_probs=30.0

Q ss_pred             HHCCCHHHHHCCCHHHHHH--HHHCCCHHHHHHHHHHHHHHH
Q ss_conf             7489978987299899999--974046345889998776754
Q gi|254780479|r   66 QKWPTIFCLSSAKDEEILS--AWAGLGYYTRARNLKKCADII  105 (356)
Q Consensus        66 ~~~P~~~~la~a~~~~vl~--~w~gLGyy~Rar~l~~~a~~i  105 (356)
                      +-|-|+++||..+.+|++.  . +||= ==+|.-|+..||..
T Consensus        12 eGf~t~EdlAy~~~~EL~~fGI-eG~~-Ee~A~~L~~~Ar~~   51 (52)
T TIGR01954        12 EGFTTVEDLAYVPVDELLSFGI-EGLD-EETAKELINRARNA   51 (52)
T ss_pred             HCCCHHHHHHCCCHHHHHHHCC-CCCC-HHHHHHHHHHHHHH
T ss_conf             3673178875045566886358-9988-78899999998752


No 191
>CHL00137 rps13 ribosomal protein S13; Validated
Probab=44.23  E-value=10  Score=18.57  Aligned_cols=28  Identities=29%  Similarity=0.328  Sum_probs=21.2

Q ss_pred             CCCCCHH---HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             8687314---568888877787766776643
Q gi|254780479|r  111 GNFPHKV---EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       111 g~~P~~~---~~l~~LpGiG~yta~ai~s~a  138 (356)
                      -.+|.+.   -.|..+.|||+.+|.+|+.-+
T Consensus         7 ~~i~~~K~v~~aLt~I~GIG~~~A~~Ic~~l   37 (122)
T CHL00137          7 VDLPRNKRIEIALTYIYGIGLTSAKKILEKA   37 (122)
T ss_pred             CCCCCCCEEEEHHHHHCCCCHHHHHHHHHHC
T ss_conf             6799997731311100061899999999984


No 192
>pfam10343 DUF2419 Protein of unknown function (DUF2419). This is a family of conserved proteins found from plants to humans. The function is not known. A few members are annotated as being cobyrinic acid a,c-diamide synthetase but this could not be confirmed.
Probab=44.02  E-value=21  Score=16.25  Aligned_cols=41  Identities=20%  Similarity=0.402  Sum_probs=27.2

Q ss_pred             HHCCCHHHHHHHHHCC-----C-HHHHHHHHHHHHHHHHHCCCCCCC
Q ss_conf             8729989999997404-----6-345889998776754300578687
Q gi|254780479|r   74 LSSAKDEEILSAWAGL-----G-YYTRARNLKKCADIIVKKYEGNFP  114 (356)
Q Consensus        74 la~a~~~~vl~~w~gL-----G-yy~Rar~l~~~a~~i~~~~~g~~P  114 (356)
                      +++-+.+++-..-.|=     - .-.|.+.|+++.+.++++|+|.|-
T Consensus        69 ~~~~t~~~l~~iF~~~~~~e~Pll~ER~~~L~EvG~vL~~~f~Gs~~  115 (282)
T pfam10343        69 WAKCTLEELRHIFRSATDEEIPLLEERLRCLREAGRVLLEKFDGSFV  115 (282)
T ss_pred             HHHCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHH
T ss_conf             98679999999834799886878899999999999999987298799


No 193
>COG0258 Exo 5'-3' exonuclease (including N-terminal domain of PolI) [DNA replication, recombination, and repair]
Probab=43.84  E-value=15  Score=17.38  Aligned_cols=34  Identities=32%  Similarity=0.476  Sum_probs=19.2

Q ss_pred             HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHH
Q ss_conf             6754300578687314568888-----------877787766776
Q gi|254780479|r  102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIV  135 (356)
Q Consensus       102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~  135 (356)
                      ...+.+.|.|.-|...-++..|           ||||+.||-.++
T Consensus       171 ~~~~~e~~~g~~p~qliD~~~L~Gd~sDnipGV~GIG~ktA~~Ll  215 (310)
T COG0258         171 LEEVEEKFKGLTPEQLIDLKALVGDSSDNIPGVKGIGPKTALKLL  215 (310)
T ss_pred             HHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHH
T ss_conf             999999856999899889999829653578999983899999999


No 194
>PTZ00221 cyclophilin; Provisional
Probab=43.75  E-value=13  Score=17.79  Aligned_cols=58  Identities=22%  Similarity=0.465  Sum_probs=40.9

Q ss_pred             CCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHH-----------------HHHHHHHCCCCCCCCHHHHHHHH----HHHH
Q ss_conf             99789872998999999740463458899987-----------------76754300578687314568888----8777
Q gi|254780479|r   69 PTIFCLSSAKDEEILSAWAGLGYYTRARNLKK-----------------CADIIVKKYEGNFPHKVEILKKL----PGIG  127 (356)
Q Consensus        69 P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~-----------------~a~~i~~~~~g~~P~~~~~l~~L----pGiG  127 (356)
                      |+.++.-+--|+.-||.|+.   |.|+.++.+                 +-++|.+-|.-..|++.+...+|    -|+|
T Consensus        20 ~~~~~~y~~~e~rrlkewen---yq~~hr~~ee~~~~RvFfDIsIGg~~~GRIVfELf~DvvPkTaENFRaLCTGEkG~~   96 (249)
T PTZ00221         20 PSREQVYNDIEERRLKEWEN---YQKSHRMKEEQNSCRAFLDISIGDVLAGRLVFELFEDVVPETVENFRALITGSCGID   96 (249)
T ss_pred             CCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHCCCEEEEEEEECCEECCEEEEEECCCCCHHHHHHHHHHHCCCCCCC
T ss_conf             63888875799988888887---777765567523742899987799876359999758987276888898853788778


Q ss_pred             HH
Q ss_conf             87
Q gi|254780479|r  128 DY  129 (356)
Q Consensus       128 ~y  129 (356)
                      ..
T Consensus        97 ~~   98 (249)
T PTZ00221         97 TN   98 (249)
T ss_pred             CC
T ss_conf             88


No 195
>PRK05179 rpsM 30S ribosomal protein S13; Validated
Probab=43.00  E-value=11  Score=18.15  Aligned_cols=27  Identities=33%  Similarity=0.443  Sum_probs=20.4

Q ss_pred             CCCCHH---HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             687314---568888877787766776643
Q gi|254780479|r  112 NFPHKV---EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       112 ~~P~~~---~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .+|.+.   -.|..+-|||+.+|.+|+.-+
T Consensus         8 dl~~~K~v~~ALt~I~GIG~~~A~~Ic~~l   37 (122)
T PRK05179          8 DIPRNKRVVIALTYIYGIGRTRAKEILAAA   37 (122)
T ss_pred             CCCCCCCHHHHHHHHCCCCHHHHHHHHHHC
T ss_conf             489997868477300275899999999982


No 196
>cd04666 Nudix_Hydrolase_9 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s
Probab=42.25  E-value=26  Score=15.70  Aligned_cols=83  Identities=11%  Similarity=0.035  Sum_probs=45.0

Q ss_pred             CEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC-------EECCCEEEEEEE----EEEEEEEEEEECC
Q ss_conf             60577645720132047442111134672158886301010000-------213735899831----5999999999807
Q gi|254780479|r  250 NRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW-------ILCNTITHTFTH----FTLTLFVWKTIVP  318 (356)
Q Consensus       250 ~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~-------~~l~~ikH~fTH----~~L~i~v~~~~~~  318 (356)
                      -+|||......    |-|.||-+.++.++........+...+-.       ..++.+.|...+    ..-.+.+|...+.
T Consensus        15 ~eVLLItsr~~----~rWilPKG~~e~~es~~~aA~REa~EEAGv~g~i~~~~lG~~~y~k~~~~~~~~~~V~~f~~~V~   90 (122)
T cd04666          15 VEVLLVTSRRT----GRWIVPKGGPEKDESPAEAAAREAWEEAGVRGKIGKRPLGRFEYRKRSKNRPPRCEVAVFPLEVT   90 (122)
T ss_pred             EEEEEEEECCC----CEEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEECCCCCEEEEEEECCCCCCCCEEEEEEEEEEE
T ss_conf             89999992699----94998875367998999998888887429488886758889999982799853199999999998


Q ss_pred             CC----CCC--CCCEEECHHHHHH
Q ss_conf             76----766--7813322788874
Q gi|254780479|r  319 QI----VII--PDSTWHDAQNLAN  336 (356)
Q Consensus       319 ~~----~~~--~~~~Wv~~~el~~  336 (356)
                      ..    +..  -.-+|+++++-..
T Consensus        91 ~~~~~wpE~~~R~R~W~s~~eA~~  114 (122)
T cd04666          91 EELDEWPEMHQRKRKWFSPEEAAL  114 (122)
T ss_pred             EECCCCCCCCCEEEEECCHHHHHH
T ss_conf             511259975867999938999998


No 197
>pfam05516 consensus
Probab=42.14  E-value=22  Score=16.21  Aligned_cols=55  Identities=15%  Similarity=0.320  Sum_probs=33.1

Q ss_pred             HHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCC-HHHHHHHHH
Q ss_conf             658998774899789872998999999740463458899987767543005786873-145688888
Q gi|254780479|r   59 PYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPH-KVEILKKLP  124 (356)
Q Consensus        59 ~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~-~~~~l~~Lp  124 (356)
                      ..|+.|.+.||++..++.-+++++ |.=+|       +   .-=+..++.|.+++++ ++..|+.+-
T Consensus        47 eIy~~f~~~FPd~~~v~~ide~el-KS~~~-------K---ekWR~F~~~fe~~veDyNfGTllR~d  102 (136)
T pfam05516        47 EIYEHFREEFPEFKPVAVIDEDEL-KSKEG-------K---EKWREFCNKFEKKVEDYNFGTLLRLD  102 (136)
T ss_pred             HHHHHHHHHCCCCCCCCCCCHHHH-CCCHH-------H---HHHHHHHHHHHCCCCCCCCEEEEECC
T ss_conf             999999987868520130699997-08331-------3---99999999985626666640346337


No 198
>PRK13023 bifunctional preprotein translocase subunit SecD/SecF; Reviewed
Probab=42.10  E-value=11  Score=18.34  Aligned_cols=20  Identities=20%  Similarity=0.263  Sum_probs=8.9

Q ss_pred             ECCCCHHHHHHHH--HCCCCCC
Q ss_conf             1243146665765--1975457
Q gi|254780479|r  145 VVDTNIERIISRY--FDIIKPA  164 (356)
Q Consensus       145 ~vD~Nv~RVl~R~--~~~~~~~  164 (356)
                      ++|-++.-+=.|+  +|+.+|.
T Consensus       156 Ai~qaieiiR~RId~lGv~EP~  177 (840)
T PRK13023        156 ALARSIEIVRRRVEELGYGNPL  177 (840)
T ss_pred             HHHHHHHHHHHHHHHHCCCCCE
T ss_conf             9998999999866652688860


No 199
>PRK04053 rps13p 30S ribosomal protein S13P; Reviewed
Probab=42.01  E-value=12  Score=18.03  Aligned_cols=47  Identities=26%  Similarity=0.298  Sum_probs=29.5

Q ss_pred             HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCH
Q ss_conf             998776754300578687314568888877787766776643496313124314
Q gi|254780479|r   97 NLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNI  150 (356)
Q Consensus        97 ~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv  150 (356)
                      ++..++++.-.+..|.-|- .-.|..+-|||+.+|.+|+.-+      -+|.|.
T Consensus         5 ~~~~mvRI~gvdIp~~K~i-~~ALt~IyGIG~~~A~~Ic~~l------gId~~~   51 (149)
T PRK04053          5 EFRYIVRIANTDLDGTKPV-EYALTGIKGIGRRTARAIARKL------GLDPHA   51 (149)
T ss_pred             HHHHHHHHCCCCCCCCCEE-EEECCCCCCCCHHHHHHHHHHH------CCCCCC
T ss_conf             8667586508678999686-4441111484899999999991------899877


No 200
>TIGR02111 PQQ_syn_pqqC coenzyme PQQ biosynthesis protein C; InterPro: IPR011845    This entry describes the coenzyme PQQ (pyrrolo-quinoline-quinone) biosynthesis protein PqqC. Pyrroloquinoline quinone (PQQ) is the prosthetic group of several bacterial enzymes, including methanol dehydrogenase of methylotrophs and the glucose dehydrogenase of a number of bacteria . PQQC has been found to be required in the synthesis of PQQ but its function is unclear. This entry does not include related proteins likely to be functionally distinct from PqqC, such as homologs found in the Chlamydias.; GO: 0018189 pyrroloquinoline quinone biosynthetic process.
Probab=41.98  E-value=26  Score=15.70  Aligned_cols=76  Identities=20%  Similarity=0.332  Sum_probs=54.2

Q ss_pred             CCCHHHHHHHHHHHHHCCHHH-CCH--HHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCC
Q ss_conf             787046766446731161203-326--58998774899789872998999999740463458899987767543005786
Q gi|254780479|r   36 LPSPYKVWISEIMLQQTTVKT-VEP--YFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGN  112 (356)
Q Consensus        36 ~~~py~v~vseimlqqT~v~~-v~~--~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~  112 (356)
                      ..-||++++=+=-+-++||.. |++  ||+.- -=.-|-.-||+-+..+|-+.|.       -|         +.+++|.
T Consensus        20 ~~HPFH~~lh~G~~~~~Qv~AWv~NRyyYQ~~-IP~KDAA~Lar~~D~~~Rr~W~-------qR---------i~DhDG~   82 (239)
T TIGR02111        20 DKHPFHALLHEGKLTREQVQAWVLNRYYYQVN-IPLKDAAILARCPDPELRRIWR-------QR---------ILDHDGE   82 (239)
T ss_pred             CCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCC-CCHHHHHHHCCCCCHHHHHHHH-------HH---------HHCCCCC
T ss_conf             56743577516998477899999877887325-8456799860699888899998-------74---------4204867


Q ss_pred             CCCH--HHHHHHHH-HHHH
Q ss_conf             8731--45688888-7778
Q gi|254780479|r  113 FPHK--VEILKKLP-GIGD  128 (356)
Q Consensus       113 ~P~~--~~~l~~Lp-GiG~  128 (356)
                      .|++  .+.++.|- +||=
T Consensus        83 ~~~dGGi~rWLrLaeAvGL  101 (239)
T TIGR02111        83 VEEDGGIERWLRLAEAVGL  101 (239)
T ss_pred             CCCCCHHHHHHHHHHCCCC
T ss_conf             8887428999987631688


No 201
>COG1031 Uncharacterized Fe-S oxidoreductase [Energy production and conversion]
Probab=40.96  E-value=13  Score=17.73  Aligned_cols=30  Identities=33%  Similarity=0.409  Sum_probs=12.4

Q ss_pred             CCCCC-HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECC
Q ss_conf             86873-14568888877787766776643496313124
Q gi|254780479|r  111 GNFPH-KVEILKKLPGIGDYTASAIVAIAFNHFAVVVD  147 (356)
Q Consensus       111 g~~P~-~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD  147 (356)
                      |.+|. +.++|.+|=       .+|...|=+..+.-+|
T Consensus       253 ~e~P~PnPealekL~-------~Gir~~AP~l~tLHiD  283 (560)
T COG1031         253 GEVPRPNPEALEKLF-------RGIRNVAPNLKTLHID  283 (560)
T ss_pred             CCCCCCCHHHHHHHH-------HHHHHHCCCCEEEEEC
T ss_conf             879999989999999-------9998618987266545


No 202
>pfam05166 YcgL YcgL domain. This family of proteins formerly called DUF709 includes the E. coli gene ycgL. Homologues of YcgL are found in gammaproteobacteria. The structure of this protein shows a novel alpha/beta/alpha sandwich structure.
Probab=40.95  E-value=16  Score=17.21  Aligned_cols=21  Identities=33%  Similarity=0.472  Sum_probs=17.1

Q ss_pred             HHHHCCCHHHHHHHHHCCCHH
Q ss_conf             898729989999997404634
Q gi|254780479|r   72 FCLSSAKDEEILSAWAGLGYY   92 (356)
Q Consensus        72 ~~la~a~~~~vl~~w~gLGyy   92 (356)
                      +.||.|+.++|+..-+--|||
T Consensus        48 r~LA~~d~~~Vl~~l~~~Gfy   68 (74)
T pfam05166        48 KKLARADVEKVLEALEEQGFY   68 (74)
T ss_pred             CHHHCCCHHHHHHHHHHCCEE
T ss_conf             132139999999999868978


No 203
>PRK13024 bifunctional preprotein translocase subunit SecD/SecF; Reviewed
Probab=40.92  E-value=11  Score=18.19  Aligned_cols=31  Identities=32%  Similarity=0.477  Sum_probs=23.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHH
Q ss_conf             56888887778776677664349631312431466657651
Q gi|254780479|r  118 EILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYF  158 (356)
Q Consensus       118 ~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~  158 (356)
                      ..-++||||    ||-|++|+     ++||+||.= +.|+-
T Consensus       309 ~~tLTLpGI----AGiiLtiG-----maVDaNVli-fERik  339 (741)
T PRK13024        309 GAVLTLPGI----AGLVLGIG-----MAVDANVII-YERIK  339 (741)
T ss_pred             HHHCCHHHH----HHHHHHHH-----HHHCCCEEH-HHHHH
T ss_conf             422269999----99999998-----763211100-29899


No 204
>PRK02406 DNA polymerase IV; Validated
Probab=40.74  E-value=27  Score=15.54  Aligned_cols=132  Identities=18%  Similarity=0.152  Sum_probs=61.2

Q ss_pred             HHCCCHHHHHCCCHHHHHHHHHCCCHH-----HHHHHHHHHHHHHHHCCCCC---CCCHHHHHH------HHHHHHHHHH
Q ss_conf             748997898729989999997404634-----58899987767543005786---873145688------8887778776
Q gi|254780479|r   66 QKWPTIFCLSSAKDEEILSAWAGLGYY-----TRARNLKKCADIIVKKYEGN---FPHKVEILK------KLPGIGDYTA  131 (356)
Q Consensus        66 ~~~P~~~~la~a~~~~vl~~w~gLGyy-----~Rar~l~~~a~~i~~~~~g~---~P~~~~~l~------~LpGiG~yta  131 (356)
                      ..|++...+|..=.++|.+.   +|..     +.-+-|=|.|--.. +-+|.   -|++..+.+      .+||||+-|+
T Consensus       118 ~~~~~~~~~A~~ir~~I~~~---~gl~~SiGIa~nk~lAK~As~~~-KP~G~~vv~~~~~~~~L~~lpv~~l~GIG~~~~  193 (355)
T PRK02406        118 LCIGSATLIAQEIRRDIWNE---LGLTASAGVAPNKFLAKIASDWN-KPNGQFVITPEEVPAFVATLPVEKIPGVGKVTA  193 (355)
T ss_pred             CCCCCHHHHHHHHHHHHHHH---HCCCEEEECCCCHHHHHHHHHHC-CCCCCEEECHHHHHHHHHHCCCCCCCCCCHHHH
T ss_conf             01789999999999999998---79966870256799999988623-987663656789999996188740588588999


Q ss_pred             HHHHHHHCCCEEEECCCCHHH-----------HHHHHHCCCCCCCHHHHHHHHHHHCCC---CCCCHHHHHHHHHHHHHH
Q ss_conf             677664349631312431466-----------657651975457356899987410257---852267899999999863
Q gi|254780479|r  132 SAIVAIAFNHFAVVVDTNIER-----------IISRYFDIIKPAPLYHKTIKNYARKIT---STSRPGDFVQAMMDLGAL  197 (356)
Q Consensus       132 ~ai~s~a~~~~~~~vD~Nv~R-----------Vl~R~~~~~~~~~~~~k~l~~~~~~~~---~~~~~~~~nQAlMdlGa~  197 (356)
                      .-...++...-.-..+.....           +.....|++...-..+..-+.+...-+   +.....+.-++|.+|...
T Consensus       194 ~~L~~~GI~ti~DL~~~~~~~L~~~fG~~g~~l~~~a~Gid~~~V~~~~~~KSI~~~~Tf~~d~~~~~~i~~~l~~l~~~  273 (355)
T PRK02406        194 EKLHALGIRTCADLQKWDLATLLRRFGKFGRRLYERARGIDERPVKPDRERKSVGVERTYAKDLYDLEDCEAELERLYPE  273 (355)
T ss_pred             HHHHHCCCCCHHHHHHCCHHHHHHHHCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHH
T ss_conf             99998298179997609999999997989999999965998743033667610555678799999999999999999999


Q ss_pred             HCCC
Q ss_conf             2041
Q gi|254780479|r  198 ICTS  201 (356)
Q Consensus       198 iC~p  201 (356)
                      ||..
T Consensus       274 v~~r  277 (355)
T PRK02406        274 LEAR  277 (355)
T ss_pred             HHHH
T ss_conf             9999


No 205
>TIGR01448 recD_rel helicase, RecD/TraA family; InterPro: IPR006345   These sequences represent a family similar to RecD, the exodeoxyribonuclease V alpha chain of IPR006344 from INTERPRO. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD. .
Probab=40.68  E-value=10  Score=18.57  Aligned_cols=17  Identities=35%  Similarity=0.540  Sum_probs=7.7

Q ss_pred             HHHH-HHHHHHHHHHHHH
Q ss_conf             6888-8877787766776
Q gi|254780479|r  119 ILKK-LPGIGDYTASAIV  135 (356)
Q Consensus       119 ~l~~-LpGiG~yta~ai~  135 (356)
                      .|.. ..|||=-||.+++
T Consensus       195 ~L~~d~~GiGF~~AD~lA  212 (769)
T TIGR01448       195 LLAEDVKGIGFKTADQLA  212 (769)
T ss_pred             HHHHHHCCCCHHHHHHHH
T ss_conf             136751374056899999


No 206
>pfam09735 Nckap1 Membrane-associated apoptosis protein. Expression of this protein was found to be markedly reduced in patients with Alzheimer's disease. It is involved in the regulation of actin polymerisation in the brain as part of a WAVE2 signalling complex.
Probab=40.31  E-value=27  Score=15.49  Aligned_cols=17  Identities=12%  Similarity=0.288  Sum_probs=8.2

Q ss_pred             HHHHHHHHHHHCCCCCC
Q ss_conf             99999999996289988
Q gi|254780479|r    8 IQSKILDWYDTNHRVLP   24 (356)
Q Consensus         8 ~~~~ll~w~~~~~R~lp   24 (356)
                      +.+.=+-||-.|.-..|
T Consensus       368 ~ardEV~Wl~RH~~~~p  384 (1118)
T pfam09735       368 LARDEVLWLLRHAGNVP  384 (1118)
T ss_pred             HHHHHHHHHHHCCCCCC
T ss_conf             87778999884156787


No 207
>cd01700 Pol_V Pol V was discovered in Escherichia coli as Umuc and UmuD proteins induced by UV. This branch of DNA polymerases is mostly found in bacteria. Pol V enables DNA replication to bypass covalently linked cys-sin T-T photo-dimers and 6-4 T-T or T-C photoproducts, which would otherwise stall the DNA replication fork.
Probab=40.24  E-value=27  Score=15.49  Aligned_cols=69  Identities=20%  Similarity=0.249  Sum_probs=36.6

Q ss_pred             CHHHHHCCCHHHHHHHHH-----CCCHHHHHHHHHHHHHHHHHC---C-CCCC---CCHHHH------HHHHHHHHHHHH
Q ss_conf             978987299899999974-----046345889998776754300---5-7868---731456------888887778776
Q gi|254780479|r   70 TIFCLSSAKDEEILSAWA-----GLGYYTRARNLKKCADIIVKK---Y-EGNF---PHKVEI------LKKLPGIGDYTA  131 (356)
Q Consensus        70 ~~~~la~a~~~~vl~~w~-----gLGyy~Rar~l~~~a~~i~~~---~-~g~~---P~~~~~------l~~LpGiG~yta  131 (356)
                      ++.++|..=.++|.+.-.     |.|+   -+-|=|.|--+.+.   + +|-+   |++...      +.++||||+-|+
T Consensus       112 ~~~~~a~~ir~~I~~~~Glt~SiGIa~---nk~lAKiAs~~~Kp~~~~~~~~~~~~~~~~~~~l~~lpv~~i~GIG~~t~  188 (344)
T cd01700         112 DLEDLGREIRATVLQWTGIPVGVGIGP---TKTLAKLANHAAKKWPAYGGVVVLDSEERRRKLLKRTPVGDVWGIGRRLS  188 (344)
T ss_pred             CHHHHHHHHHHHHHHHHCCCEEEEECC---CHHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHCCHHHCCCCCHHHH
T ss_conf             999999999999999879976983343---17899997887377888997144189899998755287444278369999


Q ss_pred             HHHHHHHCCC
Q ss_conf             6776643496
Q gi|254780479|r  132 SAIVAIAFNH  141 (356)
Q Consensus       132 ~ai~s~a~~~  141 (356)
                      .....+...-
T Consensus       189 ~kL~~~GI~T  198 (344)
T cd01700         189 KRLAAMGIHT  198 (344)
T ss_pred             HHHHHCCCCC
T ss_conf             9999869978


No 208
>PRK03137 1-pyrroline-5-carboxylate dehydrogenase; Provisional
Probab=39.97  E-value=28  Score=15.46  Aligned_cols=68  Identities=15%  Similarity=0.131  Sum_probs=43.0

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHC-----------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCH
Q ss_conf             99974046345889998776754300-----------578687314568888877787766776643496313124314
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKK-----------YEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNI  150 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~-----------~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv  150 (356)
                      .+.|+.+....|++-|+++|..|.++           -|--+.+...|....-..-+|.|+....+..+.+....+|..
T Consensus        86 f~~W~~~~~~eRa~~L~k~ad~l~~~~~ela~l~~~E~GK~~~ea~~Ev~~ai~~~~~~a~~~~~~~~~~~~~~~~g~~  164 (514)
T PRK03137         86 FETWRKWSPEDRARILFRAAAILRRRKHEFSAILVKEAGKPWDEADADTAEAIDFLEYYARQMLKLADGKPVLSRPGEH  164 (514)
T ss_pred             HHHHCCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCE
T ss_conf             9988049999999999999999997999999999997592889999999999999999999999852884557889850


No 209
>PRK01151 rps17E 30S ribosomal protein S17e; Validated
Probab=39.75  E-value=15  Score=17.24  Aligned_cols=29  Identities=31%  Similarity=0.488  Sum_probs=24.7

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCCHHHHHHH
Q ss_conf             88999877675430057868731456888
Q gi|254780479|r   94 RARNLKKCADIIVKKYEGNFPHKVEILKK  122 (356)
Q Consensus        94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~  122 (356)
                      |-+.+.++|+.|++.|.+.|-.|++.=..
T Consensus         4 r~k~IKr~a~~lieky~~~ft~DF~~NK~   32 (58)
T PRK01151          4 RPTFIKRTAEELLEKYPDKFTDDFEHNKK   32 (58)
T ss_pred             CCHHHHHHHHHHHHHHHHHHCCCHHHHHH
T ss_conf             62799999999999832052478888889


No 210
>cd03586 Pol_IV_kappa Pol_IV_kappa, a member of the Y-family of DNA polymerases. Pol_Y's can transverse replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at the cost of an elevated error rate. The Y-family has no 3'-5' exonuclease activity. In addition to possessing a topology akin to a right hand, with "thumb", "fingers" and "palm" motifs, like polymerases from the A-, B-, C- and X-families, the Y-family has a unique "little finger" motif. Expression of Y-family polymerases is often induced by DNA damage. These polymerases are phylogenetically unrelated to classical DNA polymerases. Originally called the DinB family, they belong to the recently described Y-family of DNA polymerases. Pol IV is mostly found in bacteria and archaea. Although the structure of Pol IV is similar to that of Pol eta, it shows markedly differenct efficiencies and fidelities in bypassing various DNA lesions. All Pol IV-like polymerases studied to date are able to b
Probab=39.23  E-value=28  Score=15.38  Aligned_cols=82  Identities=21%  Similarity=0.253  Sum_probs=39.7

Q ss_pred             HHCCCHHHHHCCCHHHHHHHHH-----CCCHHHHHHHHHHHHHHHHHCCCCCC---CCHHHHHH------HHHHHHHHHH
Q ss_conf             7489978987299899999974-----04634588999877675430057868---73145688------8887778776
Q gi|254780479|r   66 QKWPTIFCLSSAKDEEILSAWA-----GLGYYTRARNLKKCADIIVKKYEGNF---PHKVEILK------KLPGIGDYTA  131 (356)
Q Consensus        66 ~~~P~~~~la~a~~~~vl~~w~-----gLGyy~Rar~l~~~a~~i~~~~~g~~---P~~~~~l~------~LpGiG~yta  131 (356)
                      ..|-+..++|..=.++|.+.-.     |+|   .-+-|=|.|--+.+. +|.+   |++..+.+      .+||||+.|+
T Consensus       110 ~~~~~~~~~a~~ir~~I~~~tgl~~SiGIa---~nk~lAKiAs~~~KP-~G~~~i~~~~~~~~L~~lpv~~l~GIG~~~~  185 (337)
T cd03586         110 RLFGSATEIAKEIRARIREETGLTCSVGIA---PNKFLAKIASDLNKP-NGLTVIPPEKVWAFLAPLPVRKIWGIGKVTA  185 (337)
T ss_pred             CCCCCHHHHHHHHHHHHHHHHCCCEEEEEE---CCHHHHHHHHHHHCC-CCCEEECHHHHHHHHCCCCCHHCCCCCHHHH
T ss_conf             028999999999999999987997489995---739999999876100-7967608899999861276302078468999


Q ss_pred             HHHHHHHCCCEEEECCCCHH
Q ss_conf             67766434963131243146
Q gi|254780479|r  132 SAIVAIAFNHFAVVVDTNIE  151 (356)
Q Consensus       132 ~ai~s~a~~~~~~~vD~Nv~  151 (356)
                      .-....+...-.-..+-+..
T Consensus       186 ~~L~~~gI~t~~dl~~~~~~  205 (337)
T cd03586         186 EKLNRLGIRTIGDLAAADLE  205 (337)
T ss_pred             HHHHHCCCCCHHHHHCCCHH
T ss_conf             99998089689998649999


No 211
>cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity.
Probab=38.90  E-value=15  Score=17.37  Aligned_cols=43  Identities=33%  Similarity=0.504  Sum_probs=22.6

Q ss_pred             HCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHC
Q ss_conf             00578687314568888-----------877787766776643496313124314666576519
Q gi|254780479|r  107 KKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFD  159 (356)
Q Consensus       107 ~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~  159 (356)
                      ++| |.-|+.+-+++.|           ||||+.||..++.--         |++.-++..+-.
T Consensus       162 ~~~-gv~p~q~~d~~aL~GD~sDnIpGV~GiG~KtA~kLl~~~---------gsle~i~~~~~~  215 (240)
T cd00008         162 EKY-GVTPAQIIDYKALMGDSSDNIPGVPGIGEKTAAKLLKEY---------GSLEGILENLDK  215 (240)
T ss_pred             HHC-CCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHH---------CCHHHHHHHHHH
T ss_conf             960-989999987699728764688999985789999999980---------999999982985


No 212
>PTZ00134 40S ribosomal protein S18; Provisional
Probab=38.80  E-value=9.9  Score=18.61  Aligned_cols=30  Identities=23%  Similarity=0.278  Sum_probs=21.7

Q ss_pred             CCCCCCCHH---HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             578687314---568888877787766776643
Q gi|254780479|r  109 YEGNFPHKV---EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       109 ~~g~~P~~~---~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .|-.+|.+.   -.|..+.|||+.||.+|+.-+
T Consensus        18 ~gtdi~g~K~v~~aLt~I~GIG~~~A~~Ic~~~   50 (154)
T PTZ00134         18 LNTNVDGREKVTIALTAIKGIGRRFATVVCKQA   50 (154)
T ss_pred             CCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHC
T ss_conf             387589995889985322064899999999980


No 213
>KOG3835 consensus
Probab=38.06  E-value=30  Score=15.26  Aligned_cols=81  Identities=22%  Similarity=0.235  Sum_probs=54.3

Q ss_pred             CHHHCCHHHHHHHHHC-CCHHHHHCCCHHHHHHHHHCCCHHH---HHHHHHHHHHHHHHCCC---C-------------C
Q ss_conf             1203326589987748-9978987299899999974046345---88999877675430057---8-------------6
Q gi|254780479|r   53 TVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYT---RARNLKKCADIIVKKYE---G-------------N  112 (356)
Q Consensus        53 ~v~~v~~~~~~~~~~~-P~~~~la~a~~~~vl~~w~gLGyy~---Rar~l~~~a~~i~~~~~---g-------------~  112 (356)
                      |-+-.+.||+-||++= -||+.|.+|.|+|.|....=.|--.   -.|+|+++-+.-+.+-+   -             +
T Consensus        19 qkANLlsYyd~FIqQGGDDvqQlceagEeEFLEIMaLVGMa~KPLHVRRlQkALrew~tnpglFnqpl~~ipvssip~yk   98 (495)
T KOG3835          19 QKANLLSYYDVFIQQGGDDVQQLCEAGEEEFLEIMALVGMAPKPLHVRRLQKALREWSTNPGLFNQPLQQIPVSSIPNYK   98 (495)
T ss_pred             HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCCC
T ss_conf             87317999999998645689999875579999999986147764269999999998712950411852116999876556


Q ss_pred             CCCHHHHHHHHHHHHHHHHHH
Q ss_conf             873145688888777877667
Q gi|254780479|r  113 FPHKVEILKKLPGIGDYTASA  133 (356)
Q Consensus       113 ~P~~~~~l~~LpGiG~yta~a  133 (356)
                      .+++---.+-||||---|..-
T Consensus        99 ~~et~pt~lgl~~~s~~~s~~  119 (495)
T KOG3835          99 PAETDPTALGLPGISAATSPK  119 (495)
T ss_pred             CCCCCCCCCCCCCCHHCCCCC
T ss_conf             788884201475300014899


No 214
>TIGR03631 bact_S13 30S ribosomal protein S13. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type.
Probab=37.88  E-value=12  Score=17.94  Aligned_cols=27  Identities=30%  Similarity=0.396  Sum_probs=20.0

Q ss_pred             CCCCHH---HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             687314---568888877787766776643
Q gi|254780479|r  112 NFPHKV---EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       112 ~~P~~~---~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .+|.+.   -.|..+-|||+.+|.+|+.-+
T Consensus         6 di~~~K~v~~ALt~I~GIG~~~A~~Ic~~l   35 (113)
T TIGR03631         6 DIPNNKRVEIALTYIYGIGRTRARKILEKA   35 (113)
T ss_pred             CCCCCCCHHHHHHCEECCCHHHHHHHHHHC
T ss_conf             499996746065200275899999999992


No 215
>pfam00113 Enolase_C Enolase, C-terminal TIM barrel domain.
Probab=37.83  E-value=6.5  Score=19.90  Aligned_cols=29  Identities=17%  Similarity=0.140  Sum_probs=13.0

Q ss_pred             HHCCCCCCCC--HHHHHHHHHHHHHHHHHHHHHHHCC
Q ss_conf             3005786873--1456888887778776677664349
Q gi|254780479|r  106 VKKYEGNFPH--KVEILKKLPGIGDYTASAIVAIAFN  140 (356)
Q Consensus       106 ~~~~~g~~P~--~~~~l~~LpGiG~yta~ai~s~a~~  140 (356)
                      +.+-||-.|.  +.++-+.      +...||-.-+|.
T Consensus        66 vgDEGGfaP~~~~~e~aL~------li~eAi~~ag~~   96 (296)
T pfam00113        66 VGDEGGFAPNIQSNKEALD------LIVEAIEKAGYT   96 (296)
T ss_pred             CCCCCCCCCCCCCHHHHHH------HHHHHHHHCCCC
T ss_conf             5778787999988199999------999999974898


No 216
>pfam11372 DUF3173 Protein of unknown function (DUF3173). This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=37.56  E-value=20  Score=16.48  Aligned_cols=25  Identities=20%  Similarity=0.513  Sum_probs=10.9

Q ss_pred             CHHHHHHHHHCCCHH-HHHHHHHHHHHHHH
Q ss_conf             989999997404634-58899987767543
Q gi|254780479|r   78 KDEEILSAWAGLGYY-TRARNLKKCADIIV  106 (356)
Q Consensus        78 ~~~~vl~~w~gLGyy-~Rar~l~~~a~~i~  106 (356)
                      +-+|+++    |||- +-|+++.+-|+.+.
T Consensus         5 t~~dLi~----lGf~~~~A~~IIrqAK~~l   30 (59)
T pfam11372         5 TKKDLIK----LGFKPHTARDIIRQAKELL   30 (59)
T ss_pred             CHHHHHH----HCCCHHHHHHHHHHHHHHH
T ss_conf             7999999----5798879999999999999


No 217
>pfam08887 GAD-like GAD-like domain. This domain is functionally uncharacterized, but it appears to be distantly related to the GAD domain pfam02938.
Probab=37.39  E-value=25  Score=15.77  Aligned_cols=13  Identities=8%  Similarity=0.376  Sum_probs=8.0

Q ss_pred             HHHHHHHHCCCHH
Q ss_conf             5899877489978
Q gi|254780479|r   60 YFKKFMQKWPTIF   72 (356)
Q Consensus        60 ~~~~~~~~~P~~~   72 (356)
                      .|+.|+++|....
T Consensus         3 ~~e~fl~~fG~~~   15 (109)
T pfam08887         3 TLEEFLESFGPPI   15 (109)
T ss_pred             HHHHHHHHCCCCC
T ss_conf             7899999728966


No 218
>cd00080 HhH2_motif Helix-hairpin-helix class 2 (Pol1 family) motif. HhH2 domains are found in Rad2 family of prokaryotic and eukaryotic replication and repair nucleases, i.e., DNA polymerase I, Taq DNA polymerase, DNA repair protein Rad2 endonuclease, flap endonuclease, exonuclease I and IX, 5'-3' exonuclease and also bacteriophage Rnase H. These nucleases degrade RNA-DNA or DNA-DNA duplexes, or both and play essential roles in DNA duplication, repair, and recombination.
Probab=37.12  E-value=8.2  Score=19.17  Aligned_cols=19  Identities=32%  Similarity=0.552  Sum_probs=14.4

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q ss_conf             5688888777877667766
Q gi|254780479|r  118 EILKKLPGIGDYTASAIVA  136 (356)
Q Consensus       118 ~~l~~LpGiG~yta~ai~s  136 (356)
                      |.+-.+||||+.||..++.
T Consensus        22 DnipGV~GIG~ktA~~ll~   40 (75)
T cd00080          22 DNIPGVPGIGPKTALKLLK   40 (75)
T ss_pred             CCCCCCCCCCHHHHHHHHH
T ss_conf             5887758637999999999


No 219
>PRK11820 hypothetical protein; Provisional
Probab=37.04  E-value=28  Score=15.43  Aligned_cols=34  Identities=15%  Similarity=0.158  Sum_probs=27.4

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH
Q ss_conf             8899987767543005786873145688888777
Q gi|254780479|r   94 RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIG  127 (356)
Q Consensus        94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG  127 (356)
                      .+..+++.-+.|.+.++...|-+..+|+++|||=
T Consensus        84 ~~~~y~~~l~~i~~~~~~~~~~~l~~ll~~p~v~  117 (288)
T PRK11820         84 LAKQYLKALEEVKEELPEAGEISLDDLLRWPGVL  117 (288)
T ss_pred             HHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCCCC
T ss_conf             9999999999999874998999999996386522


No 220
>TIGR03629 arch_S13P archaeal ribosomal protein S13P. This model describes exclusively the archaeal ribosomal protein S13P. It excludes the homologous eukaryotic 40S ribosomal protein S18 and bacterial 30S ribosomal protein S13.
Probab=36.34  E-value=8.9  Score=18.92  Aligned_cols=30  Identities=23%  Similarity=0.224  Sum_probs=22.0

Q ss_pred             CCCCCCCHH---HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             578687314---568888877787766776643
Q gi|254780479|r  109 YEGNFPHKV---EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       109 ~~g~~P~~~---~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .|-.+|.+.   -.|..+.|||+.+|.+|+.-+
T Consensus         9 ~g~di~~~K~v~~aLt~I~GIG~~~A~~Ic~~~   41 (144)
T TIGR03629         9 ADTDLDGNKPVEYALTGIKGIGRRFARAIARKL   41 (144)
T ss_pred             ECCCCCCCCEEEEEEEEECCCCHHHHHHHHHHH
T ss_conf             075489996898872212372899999999990


No 221
>cd00424 Pol_Y Y-family of DNA polymerases. Pol_Y's can transverse replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at the cost of an elevated error rate. The Y-family has no 3'-5' exonuclease activity. In addition to possessing a topology akin to a right hand, with "thumb", "fingers" and "palm" motifs, like polymerases from the A-, B-, C- and X-families, the Y-family has a unique "little finger" motif. Expression of Y-family polymerases is often induced by DNA damage. These polymerases are phylogenetically unrelated to classical DNA polymerases.
Probab=35.93  E-value=18  Score=16.70  Aligned_cols=89  Identities=22%  Similarity=0.247  Sum_probs=41.6

Q ss_pred             HCCCHHHHHCCCHHHHHHHHHCCCHHHH---HH--HHHHHHHHHHHCCCCC---CCCHHH-------HHHHHHHHHHHHH
Q ss_conf             4899789872998999999740463458---89--9987767543005786---873145-------6888887778776
Q gi|254780479|r   67 KWPTIFCLSSAKDEEILSAWAGLGYYTR---AR--NLKKCADIIVKKYEGN---FPHKVE-------ILKKLPGIGDYTA  131 (356)
Q Consensus        67 ~~P~~~~la~a~~~~vl~~w~gLGyy~R---ar--~l~~~a~~i~~~~~g~---~P~~~~-------~l~~LpGiG~yta  131 (356)
                      .|.+...+|..=.++|..   -+|.-..   |.  -|=|+|--+. +-+|.   .|++..       .+..|||||+-|+
T Consensus       111 ~~~~~~~~a~~ir~~I~~---~~gl~~SiGIa~nk~lAKlAs~~~-Kp~g~~~~~~~~~~~~~l~~lpv~~l~GIG~~~~  186 (341)
T cd00424         111 LLEDGLDLALEIKQIIAE---ETGITASIGIAANKLLAKLAAKLA-KPNGLTILFPESLPEEFLIKLPLSDIPGIGSVTA  186 (341)
T ss_pred             CCCCHHHHHHHHHHHHHH---HHCCCEEEEECCCHHHHHHHHHHC-CCCCEEEECCHHHHHHHHCCCCHHHHCCCCHHHH
T ss_conf             078999999999999999---849966996465199999999744-9998698385777999741388788748788999


Q ss_pred             HHHHHHHCCCEEEECCCCHHHHHHHHHC
Q ss_conf             6776643496313124314666576519
Q gi|254780479|r  132 SAIVAIAFNHFAVVVDTNIERIISRYFD  159 (356)
Q Consensus       132 ~ai~s~a~~~~~~~vD~Nv~RVl~R~~~  159 (356)
                      .-..........-..+.+...+...+++
T Consensus       187 ~~L~~~gi~ti~dl~~~~~~~l~~~~~~  214 (341)
T cd00424         187 SRLEALGIIRKCDLLALQPDQLLASLGG  214 (341)
T ss_pred             HHHHHCCCCCHHHHHHCCHHHHHHHHCC
T ss_conf             9999849935999871899999999768


No 222
>COG0529 CysC Adenylylsulfate kinase and related kinases [Inorganic ion transport and metabolism]
Probab=35.50  E-value=22  Score=16.20  Aligned_cols=42  Identities=19%  Similarity=0.214  Sum_probs=29.3

Q ss_pred             HHHHHHHHHHHHHHHH---HHHCCCEEEECCCCHHH-HHHHHHCCC
Q ss_conf             8888877787766776---64349631312431466-657651975
Q gi|254780479|r  120 LKKLPGIGDYTASAIV---AIAFNHFAVVVDTNIER-IISRYFDII  161 (356)
Q Consensus       120 l~~LpGiG~yta~ai~---s~a~~~~~~~vD~Nv~R-Vl~R~~~~~  161 (356)
                      +.-|+|-|..|-+-.+   .++-|..+-++||-..| -|+|=+|.+
T Consensus        28 ~TGLSGsGKSTiA~ale~~L~~~G~~~y~LDGDnvR~gL~~dLgFs   73 (197)
T COG0529          28 FTGLSGSGKSTIANALEEKLFAKGYHVYLLDGDNVRHGLNRDLGFS   73 (197)
T ss_pred             EECCCCCCHHHHHHHHHHHHHHCCCEEEEECCHHHHHCCCCCCCCC
T ss_conf             6468888787999999999997597589855746765005788978


No 223
>PRK11241 gabD succinate-semialdehyde dehydrogenase I; Provisional
Probab=35.50  E-value=32  Score=14.99  Aligned_cols=56  Identities=18%  Similarity=0.275  Sum_probs=33.2

Q ss_pred             HHHHHHCCCHHHHHHHHHHHHHHHHHCC----------CCC-CCCHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             9999740463458899987767543005----------786-8731456888887778776677664
Q gi|254780479|r   82 ILSAWAGLGYYTRARNLKKCADIIVKKY----------EGN-FPHKVEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus        82 vl~~w~gLGyy~Rar~l~~~a~~i~~~~----------~g~-~P~~~~~l~~LpGiG~yta~ai~s~  137 (356)
                      -.+.|+.+..+.|++-|++.|..+.++-          .|+ +-+...|+...--.-+|.|.....+
T Consensus        60 A~~~W~~~~~~~R~~iL~k~a~~l~~~~eela~l~~~E~GK~~~~A~~ev~~~~~~~~~~a~~a~~~  126 (482)
T PRK11241         60 ALPAWRALTAKERANILRRWFNLMMEHQDDLARLMTLEQGKPLAEAKGEISYAASFIEWFAEEGKRI  126 (482)
T ss_pred             HHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             9899814999999999999999999789999999999839972455348999999999999999984


No 224
>pfam03755 YicC_N YicC-like family, N-terminal region. Family of bacterial proteins. Although poorly characterized, the members of this protein family have been demonstrated to play a role in stationary phase survival. These proteins are not essential during stationary phase.
Probab=35.26  E-value=30  Score=15.17  Aligned_cols=34  Identities=15%  Similarity=0.186  Sum_probs=26.6

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH
Q ss_conf             8899987767543005786873145688888777
Q gi|254780479|r   94 RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIG  127 (356)
Q Consensus        94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG  127 (356)
                      .++.+....+.|.+.++-..|-+..+++++|||=
T Consensus        83 ~~~~~~~~~~~l~~~~~~~~~~~~~~ll~~~~vi  116 (159)
T pfam03755        83 LLKAYLAALKELAEELGLAAPISLDDLLRLPGVL  116 (159)
T ss_pred             HHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCCCE
T ss_conf             9999999999999972998999999996598700


No 225
>LOAD_Hrd consensus
Probab=35.13  E-value=11  Score=18.39  Aligned_cols=21  Identities=24%  Similarity=0.447  Sum_probs=17.7

Q ss_pred             CCCCHHHHHHHHHHHHHHHHH
Q ss_conf             687314568888877787766
Q gi|254780479|r  112 NFPHKVEILKKLPGIGDYTAS  132 (356)
Q Consensus       112 ~~P~~~~~l~~LpGiG~yta~  132 (356)
                      ..|.|.++|..++|||+....
T Consensus        41 ~~P~t~~eL~~I~Gig~~k~~   61 (77)
T LOAD_Hrd         41 LLPTTVSELLAIDGVGEAKVE   61 (77)
T ss_pred             HCCCCHHHHCCCCCCCHHHHH
T ss_conf             789999998289996999999


No 226
>COG1084 Predicted GTPase [General function prediction only]
Probab=34.89  E-value=33  Score=14.92  Aligned_cols=84  Identities=15%  Similarity=0.224  Sum_probs=49.7

Q ss_pred             HHHHHHCCHHHCCHHHHHHHHHCCCHHHHH---------CCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCH
Q ss_conf             467311612033265899877489978987---------29989999997404634588999877675430057868731
Q gi|254780479|r   46 EIMLQQTTVKTVEPYFKKFMQKWPTIFCLS---------SAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHK  116 (356)
Q Consensus        46 eimlqqT~v~~v~~~~~~~~~~~P~~~~la---------~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~  116 (356)
                      |+-.-+|.-+.|..++.+++++||++..|-         -.+.+++-+..+.+   ++|..+   .+.|.++|-+.+.  
T Consensus        47 e~~rv~t~~~i~~d~l~~iv~~~P~id~LhpFY~eLidvl~d~d~~k~sLs~v---~~A~~~---i~~l~~eYi~~lk--  118 (346)
T COG1084          47 EIRRVKTASNIVRDRLDKIVERFPSLDDLHPFYRELIDVLVDIDHLKISLSAV---SWASKI---IEKLAREYIRLLK--  118 (346)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCHHHHHHHHHHH---HHHHHH---HHHHHHHHHHHHH--
T ss_conf             99999999999999999999848884326858999999994789999999997---799999---9999999999874--


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHCCCEEE
Q ss_conf             4568888877787766776643496313
Q gi|254780479|r  117 VEILKKLPGIGDYTASAIVAIAFNHFAV  144 (356)
Q Consensus       117 ~~~l~~LpGiG~yta~ai~s~a~~~~~~  144 (356)
                             .+=.+++|+.++--|||.-+.
T Consensus       119 -------~a~~~~~~~~lrR~a~GR~aS  139 (346)
T COG1084         119 -------AAKDPKEANQLRRQAFGRVAS  139 (346)
T ss_pred             -------CCCCHHHHHHHHHHHHHHHHH
T ss_conf             -------279866899999999878999


No 227
>KOG2379 consensus
Probab=34.87  E-value=27  Score=15.53  Aligned_cols=36  Identities=22%  Similarity=0.283  Sum_probs=19.3

Q ss_pred             HHHHHHHHHHHCCCCCCCC-HHHHHHHHHHHHHHHHHHHH
Q ss_conf             9987767543005786873-14568888877787766776
Q gi|254780479|r   97 NLKKCADIIVKKYEGNFPH-KVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus        97 ~l~~~a~~i~~~~~g~~P~-~~~~l~~LpGiG~yta~ai~  135 (356)
                      -++++.+.+++.   ++|- +..++-++-|+|.+-..-++
T Consensus        38 s~~ka~r~l~~~---~~~l~tl~dl~~ikG~G~~~~~k~~   74 (501)
T KOG2379          38 SLKKALRNLCKA---KDPLMTLPDLSSIKGFGKKWIVKLM   74 (501)
T ss_pred             CHHHHHHHHHCC---CCCCCCHHHCCCCCCCCCHHHHHHH
T ss_conf             678899875316---6752211210010576503688887


No 228
>KOG2705 consensus
Probab=34.82  E-value=20  Score=16.48  Aligned_cols=28  Identities=25%  Similarity=0.489  Sum_probs=22.1

Q ss_pred             HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC
Q ss_conf             005786873145688888777877667766434
Q gi|254780479|r  107 KKYEGNFPHKVEILKKLPGIGDYTASAIVAIAF  139 (356)
Q Consensus       107 ~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~  139 (356)
                      -+.+|+.|++.     ||||||..||+.--+.+
T Consensus       193 ~d~~G~~p~s~-----~p~i~Rf~aG~fYlv~~  220 (471)
T KOG2705         193 LDSDGQPPKSA-----LPSIGRFLAGCFYLVLH  220 (471)
T ss_pred             ECCCCCCCCCC-----CHHHHHHHCCCHHHHHH
T ss_conf             34899997321-----32467672241888888


No 229
>pfam00416 Ribosomal_S13 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes.
Probab=34.68  E-value=12  Score=17.96  Aligned_cols=27  Identities=30%  Similarity=0.341  Sum_probs=20.1

Q ss_pred             CCCCHH---HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             687314---568888877787766776643
Q gi|254780479|r  112 NFPHKV---EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       112 ~~P~~~---~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .+|.+.   -.|..+-|||+++|.+|+.-+
T Consensus         6 di~~~K~v~~ALt~I~GIG~~~A~~Ic~~l   35 (106)
T pfam00416         6 NLDGNKKIEIALTYIKGIGRRKANQILKKA   35 (106)
T ss_pred             CCCCCCEEEEEECCCCCCCHHHHHHHHHHC
T ss_conf             799996864441121052899999999991


No 230
>pfam02467 Whib Transcription factor WhiB. WhiB is a putative transcription factor in Actinobacteria, required for differentiation and sporulation.
Probab=34.31  E-value=12  Score=18.00  Aligned_cols=19  Identities=26%  Similarity=0.759  Sum_probs=15.1

Q ss_pred             CCCCCCCCCCCCHHHHHCC
Q ss_conf             3564311000001343101
Q gi|254780479|r  203 KPLCPLCPIQKNCLTFSEG  221 (356)
Q Consensus       203 ~P~C~~Cpl~~~C~~~~~~  221 (356)
                      +..|..|||...|..|...
T Consensus        29 k~iC~~CpV~~~Cl~~Al~   47 (66)
T pfam02467        29 KRICMRCPVLQECAAHALE   47 (66)
T ss_pred             HHHHCCCCCHHHHHHHHHH
T ss_conf             9998089629999999997


No 231
>PRK10729 nudF ADP-ribose pyrophosphatase NudF; Provisional
Probab=34.28  E-value=34  Score=14.86  Aligned_cols=76  Identities=4%  Similarity=0.014  Sum_probs=39.4

Q ss_pred             EEEEE-EE-CCCEEEEEECCCCHHHHC-----CCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEE------E
Q ss_conf             20003-41-376057764572013204-----74421111346721588863010100002137358998315------9
Q gi|254780479|r  241 AVFIA-IT-NDNRILLRKRTNTRLLEG-----MDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHF------T  307 (356)
Q Consensus       241 ~~~vi-i~-~~~kiLL~KRp~~gll~G-----LwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~------~  307 (356)
                      .+.|+ ++ ..++++|.+.=--+.+..     |||+|-+..+.+++.......+..++..-..+.+.+..+-+      +
T Consensus        51 av~vL~yDp~~d~vvLveQfR~~~~~~~~~~wllEipAG~id~gE~p~~~A~REl~EEtG~~~~~l~~l~~~~~SPG~s~  130 (202)
T PRK10729         51 AAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIEEGESVEDVARREAIEEAGLIVGRTKPVLSYLASPGGTS  130 (202)
T ss_pred             EEEEEEEECCCCEEEEEEECCCHHCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCC
T ss_conf             79999980778969999823512203799963999402033899798999999988875958052899778816897533


Q ss_pred             EEEEEEEEE
Q ss_conf             999999998
Q gi|254780479|r  308 LTLFVWKTI  316 (356)
Q Consensus       308 L~i~v~~~~  316 (356)
                      =.+++|.+.
T Consensus       131 E~i~lf~a~  139 (202)
T PRK10729        131 ERSSIMVGE  139 (202)
T ss_pred             EEEEEEEEE
T ss_conf             159999998


No 232
>PRK02515 psbU photosystem II complex extrinsic protein precursor U; Provisional
Probab=33.75  E-value=34  Score=14.80  Aligned_cols=80  Identities=11%  Similarity=0.094  Sum_probs=46.8

Q ss_pred             HHHHHHHHHCCCCCCC---CHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHH
Q ss_conf             8776754300578687---3145688888777877667766434963131243146665765197545735689998741
Q gi|254780479|r   99 KKCADIIVKKYEGNFP---HKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYA  175 (356)
Q Consensus        99 ~~~a~~i~~~~~g~~P---~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~  175 (356)
                      -.+-..+..+||+++-   .+...+.++||.+|.-|+-|.+-+   |.--    |    .-++.+..-+...++.++...
T Consensus        51 n~aD~kl~t~~g~KIDlNNa~vr~f~q~pGmYPtlA~kIv~~~---PY~s----V----eDvl~ipgLse~qK~~l~k~~  119 (144)
T PRK02515         51 NAADAKLATERGEKIDLNNSSVRAFRQFPGMYPTLAGKIVKNA---PYDS----V----EDVLNLPGLSERQKELLEANL  119 (144)
T ss_pred             CHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHCC---CCCC----H----HHHHHCCCCCHHHHHHHHHHH
T ss_conf             5688888887156402562749999868884679999998479---9787----9----999717899999999999865


Q ss_pred             HCCCCCCCHHHHHH
Q ss_conf             02578522678999
Q gi|254780479|r  176 RKITSTSRPGDFVQ  189 (356)
Q Consensus       176 ~~~~~~~~~~~~nQ  189 (356)
                      +.++-..-.-.||+
T Consensus       120 ~~Ftvt~p~~~~~e  133 (144)
T PRK02515        120 ENFTVTEPEIALNE  133 (144)
T ss_pred             CCCEECCCCHHHHC
T ss_conf             36634687288863


No 233
>cd07092 ALDH_ABALDH-YdcW Escherichia coli NAD+-dependent gamma-aminobutyraldehyde dehydrogenase YdcW-like. NAD+-dependent, tetrameric, gamma-aminobutyraldehyde dehydrogenase (ABALDH), YdcW of Escherichia coli K12, catalyzes the oxidation of gamma-aminobutyraldehyde to gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl medium-chain aldehydes, but with a lower catalytic efficiency.
Probab=33.64  E-value=31  Score=15.09  Aligned_cols=22  Identities=32%  Similarity=0.397  Sum_probs=9.4

Q ss_pred             HHHCCCHHHHHHHHHHHHHHHH
Q ss_conf             9740463458899987767543
Q gi|254780479|r   85 AWAGLGYYTRARNLKKCADIIV  106 (356)
Q Consensus        85 ~w~gLGyy~Rar~l~~~a~~i~  106 (356)
                      .|+.+....|++-|+++|..|.
T Consensus        34 ~W~~~~~~eR~~~L~k~a~~l~   55 (450)
T cd07092          34 SWRRTTPAERSKALLKLADAIE   55 (450)
T ss_pred             HHCCCCHHHHHHHHHHHHHHHH
T ss_conf             9744999999999999999999


No 234
>cd07093 ALDH_F8_HMSADH Human aldehyde dehydrogenase family 8 member A1-like. In humans, the  aldehyde dehydrogenase family 8 member A1 (ALDH8A1) protein functions to convert 9-cis-retinal to 9-cis-retinoic acid and has a preference for NAD+. Also included in this CD is the 2-hydroxymuconic semialdehyde dehydrogenase (HMSADH) which catalyzes the conversion of 2-hydroxymuconic semialdehyde to 4-oxalocrotonate, a step in the meta cleavage pathway of aromatic hydrocarbons in bacteria. Such HMSADHs seen here are: XylG of the TOL plasmid pWW0 of Pseudomonas putida, TomC  of Burkholderia cepacia G4, and AphC of Comamonas testosterone.
Probab=33.51  E-value=32  Score=15.06  Aligned_cols=24  Identities=38%  Similarity=0.576  Sum_probs=13.3

Q ss_pred             HHHHCCCHHHHHHHHHHHHHHHHH
Q ss_conf             997404634588999877675430
Q gi|254780479|r   84 SAWAGLGYYTRARNLKKCADIIVK  107 (356)
Q Consensus        84 ~~w~gLGyy~Rar~l~~~a~~i~~  107 (356)
                      +.|+.+..+.|++-|.++|..+.+
T Consensus        33 ~~W~~~~~~eR~~~L~~~a~~l~~   56 (455)
T cd07093          33 PGWSRMSPAERARILHKVADLIEA   56 (455)
T ss_pred             HHHHCCCHHHHHHHHHHHHHHHHH
T ss_conf             988029999999999999999997


No 235
>smart00341 HRDC Helicase and RNase D C-terminal. Hypothetical role in nucleic acid binding. Mutations in the HRDC domain cause human disease.
Probab=33.45  E-value=13  Score=17.80  Aligned_cols=21  Identities=29%  Similarity=0.523  Sum_probs=17.0

Q ss_pred             CCCCHHHHHHHHHHHHHHHHH
Q ss_conf             687314568888877787766
Q gi|254780479|r  112 NFPHKVEILKKLPGIGDYTAS  132 (356)
Q Consensus       112 ~~P~~~~~l~~LpGiG~yta~  132 (356)
                      ..|.|.++|.+++|+|++...
T Consensus        41 ~~P~~~~~L~~i~g~~~~~~~   61 (81)
T smart00341       41 ALPTNVSELLAIDGVGEEKAR   61 (81)
T ss_pred             HCCCCHHHHHCCCCCCHHHHH
T ss_conf             887999998468999999999


No 236
>cd07090 ALDH_F9_TMBADH NAD+-dependent 4-trimethylaminobutyraldehyde dehydrogenase, ALDH family 9A1. NAD+-dependent, 4-trimethylaminobutyraldehyde dehydrogenase (TMABADH, EC=1.2.1.47), also known as aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in humans, is a cytosolic tetramer which catalyzes the oxidation of gamma-aminobutyraldehyde involved in 4-aminobutyric acid (GABA) biosynthesis  and also oxidizes betaine aldehyde (gamma-trimethylaminobutyraldehyde) which is involved in carnitine biosynthesis.
Probab=32.96  E-value=34  Score=14.80  Aligned_cols=57  Identities=18%  Similarity=0.285  Sum_probs=32.7

Q ss_pred             HHHHHHCCCHHHHHHHHHHHHHHHHHCC----------CCC-CCCHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             9999740463458899987767543005----------786-87314568888877787766776643
Q gi|254780479|r   82 ILSAWAGLGYYTRARNLKKCADIIVKKY----------EGN-FPHKVEILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus        82 vl~~w~gLGyy~Rar~l~~~a~~i~~~~----------~g~-~P~~~~~l~~LpGiG~yta~ai~s~a  138 (356)
                      ..+.|+-+....|++-|++.|..+.++-          .|+ +-+...|+...-..-+|.|+....+.
T Consensus        31 A~~~W~~~~~~eR~~~l~k~a~~l~~~~~~la~~~~~E~GK~~~ea~~Ev~~a~~~~~~~a~~~~~~~   98 (457)
T cd07090          31 AQKEWSATSGMERGRILRKAADLLRERNDEIARLETIDNGKPIEEARVDIDSSADCLEYYAGLAPTLS   98 (457)
T ss_pred             HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHC
T ss_conf             99998229999999999999999997899999999997190899999999999999999999999824


No 237
>TIGR01862 N2-ase-Ialpha nitrogenase component I, alpha chain; InterPro: IPR010143   This entry represents all three varieties (Fe-Fe, Mo-Fe and V-Fe) of the component I alpha chain of nitrogenase.   Nitrogenase (1.18.6.1 from EC)  is the enzyme system responsible for biological nitrogen fixation. Nitrogenase is an oligomeric complex which consists of two components: component 2 is an homodimer of an iron-sulphur protein, while component 1 which contains the active site for the reduction of nitrogen to ammonia exists in three different forms: the molybdenum-iron containing protein (MoFe) is a hetero-tetramer consisting of two pairs of alpha (nifD) and beta (nifK) subunits; the vanadium-iron containing protein (VFe) is a hexamer of two pairs each of alpha (vnfD), beta (vnfK), and delta (vnfG) subunits; the third form seems to only contain iron and is a hexamer composed of alpha (anfD), beta (anfK), and delta (anfG) subunits. The alpha and beta chains of the three types of component 1 are evolutionary related and they are also related to proteins nifE and nifN, which are most probably involved in the iron-molybdenum cofactor biosynthesis .; GO: 0016163 nitrogenase activity, 0051536 iron-sulfur cluster binding, 0009399 nitrogen fixation.
Probab=32.18  E-value=5.5  Score=20.41  Aligned_cols=114  Identities=20%  Similarity=0.288  Sum_probs=69.8

Q ss_pred             HHHHHHCCCHHHHHHHHHHH--------------HHHHHHCCCCCCCCHHHHHHHHHHHHH-HHHHHHHHHHCCCEEEEC
Q ss_conf             99997404634588999877--------------675430057868731456888887778-776677664349631312
Q gi|254780479|r   82 ILSAWAGLGYYTRARNLKKC--------------ADIIVKKYEGNFPHKVEILKKLPGIGD-YTASAIVAIAFNHFAVVV  146 (356)
Q Consensus        82 vl~~w~gLGyy~Rar~l~~~--------------a~~i~~~~~g~~P~~~~~l~~LpGiG~-yta~ai~s~a~~~~~~~v  146 (356)
                      |+..|.|=|||..-|++|++              |+.+.++||  +|     +++.-++|. |++..++.||--=-.+-+
T Consensus       231 V~a~fTGdg~yd~~~~mh~A~LNlV~C~RS~nYIa~~le~rYG--IP-----~~~~dffG~~~~~e~LR~I~~~Fg~~eI  303 (510)
T TIGR01862       231 VVATFTGDGTYDEIRLMHKAKLNLVECARSANYIANELEERYG--IP-----WLKIDFFGFEYIAESLRAIAAFFGDSEI  303 (510)
T ss_pred             EEEECCCCCCHHHHHCCCCCCCCCEEECCHHHHHHHHHHHCCC--CC-----EEEECCCCCHHHHHHHHHHHHHCCCCCH
T ss_conf             9775134422266312662110111101405799998553078--75-----3775465703789989999964588542


Q ss_pred             CCCHHHHHHHHHC-CCCCCCHHHHHHHHHHHCCCC-CCCHHHHHH-HHHHHHHHHCCCC
Q ss_conf             4314666576519-754573568999874102578-522678999-9999986320412
Q gi|254780479|r  147 DTNIERIISRYFD-IIKPAPLYHKTIKNYARKITS-TSRPGDFVQ-AMMDLGALICTSN  202 (356)
Q Consensus       147 D~Nv~RVl~R~~~-~~~~~~~~~k~l~~~~~~~~~-~~~~~~~nQ-AlMdlGa~iC~p~  202 (356)
                      .--.++|++-... +.......+..|+-.---|-. ..++.++.. +.=|||-.|-...
T Consensus       304 ~~r~EevIae~~a~w~p~ld~yKerL~GK~v~ly~GG~r~wH~~~~~~~~lG~~VV~~~  362 (510)
T TIGR01862       304 EKRAEEVIAEEIAKWKPELDYYKERLQGKRVCLYVGGSRLWHWIKSLEEDLGVEVVAVG  362 (510)
T ss_pred             HHHHHHHHHHHHHHCCCHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCEEEEEE
T ss_conf             26888877874421041245411311498899975888001111257776394599975


No 238
>pfam00570 HRDC HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease. It is interesting to note that the RecQ helicase in Deinococcus radiodurans has three tandem HRDC domains.
Probab=31.83  E-value=13  Score=17.74  Aligned_cols=21  Identities=29%  Similarity=0.662  Sum_probs=17.9

Q ss_pred             CCCCHHHHHHHHHHHHHHHHH
Q ss_conf             687314568888877787766
Q gi|254780479|r  112 NFPHKVEILKKLPGIGDYTAS  132 (356)
Q Consensus       112 ~~P~~~~~l~~LpGiG~yta~  132 (356)
                      ..|.|.++|.+++|+|+..+.
T Consensus        38 ~~P~s~~~L~~i~g~~~~~~~   58 (68)
T pfam00570        38 KLPRTLEELLRIPGVGPRKLE   58 (68)
T ss_pred             HCCCCHHHHHCCCCCCHHHHH
T ss_conf             784999998089999999999


No 239
>smart00279 HhH2 Helix-hairpin-helix class 2 (Pol1 family) motifs.
Probab=31.70  E-value=15  Score=17.35  Aligned_cols=19  Identities=26%  Similarity=0.405  Sum_probs=13.9

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q ss_conf             5688888777877667766
Q gi|254780479|r  118 EILKKLPGIGDYTASAIVA  136 (356)
Q Consensus       118 ~~l~~LpGiG~yta~ai~s  136 (356)
                      +.+-.+||||+.||.-++.
T Consensus        16 DnipGV~GIG~ktA~~ll~   34 (36)
T smart00279       16 DNIPGVKGIGPKTALKLLR   34 (36)
T ss_pred             CCCCCCCCCCHHHHHHHHH
T ss_conf             6889999747899999998


No 240
>PRK09407 gabD2 succinic semialdehyde dehydrogenase; Reviewed
Probab=31.64  E-value=37  Score=14.57  Aligned_cols=55  Identities=20%  Similarity=0.268  Sum_probs=33.7

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CCCHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             9997404634588999877675430057----------86-8731456888887778776677664
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKKYE----------GN-FPHKVEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P~~~~~l~~LpGiG~yta~ai~s~  137 (356)
                      .+.|+.+....|++-|++.|..|.++-+          |+ +-+...|.......-+|.|+....+
T Consensus        66 ~~~W~~~~~~eR~~iL~k~ad~l~~~~eela~l~~~E~GK~~~~a~~Ev~~~~~~~~~~a~~a~~~  131 (523)
T PRK09407         66 QRDWAARPVRERAAVFLRFHDLVLENREELMDLIQLETGKARAHAFEEVLDVAINARYYARVAEKL  131 (523)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             898821999999999999999999779999999999729179999999999999999999999997


No 241
>PTZ00081 enolase (2-phospho-D-glycerate hydrolase); Provisional
Probab=31.58  E-value=8.9  Score=18.95  Aligned_cols=12  Identities=8%  Similarity=0.180  Sum_probs=4.8

Q ss_pred             CCCHHHHHHHHH
Q ss_conf             431466657651
Q gi|254780479|r  147 DTNIERIISRYF  158 (356)
Q Consensus       147 D~Nv~RVl~R~~  158 (356)
                      .--..|-++.+.
T Consensus       132 ~~pLy~yl~~l~  143 (442)
T PTZ00081        132 GVPLYRYLAQLA  143 (442)
T ss_pred             CCCHHHHHHHHH
T ss_conf             993999988761


No 242
>KOG0648 consensus
Probab=31.50  E-value=37  Score=14.55  Aligned_cols=47  Identities=15%  Similarity=0.160  Sum_probs=32.6

Q ss_pred             CEEEEEEEEEECCCEEEEEECC-CCHHHHCCCCCCCCCCCCCCCHHHH
Q ss_conf             1123200034137605776457-2013204744211113467215888
Q gi|254780479|r  237 MRTGAVFIAITNDNRILLRKRT-NTRLLEGMDELPGSAWSSTKDGNID  283 (356)
Q Consensus       237 ~r~~~~~vii~~~~kiLL~KRp-~~gll~GLwEFP~~e~~~~~~~~~~  283 (356)
                      .+....+.++|.++++|+.+.. .--...|.|-||++.+...++.+.-
T Consensus       114 h~vgvg~~V~n~~~eVlVv~e~d~~~~~~~~wK~ptG~v~~~e~i~~g  161 (295)
T KOG0648         114 HRVGVGAFVLNKKKEVLVVQEKDGAVKIRGGWKLPTGRVEEGEDIWHG  161 (295)
T ss_pred             HHEEEEEEEECCCCEEEEEEECCCCEEECCCCCCCCEEECCCCCCHHH
T ss_conf             340001367368863678972255323126510561375366540134


No 243
>PHA00619 CRISPR-associated Cas4-like protein
Probab=31.50  E-value=25  Score=15.77  Aligned_cols=12  Identities=33%  Similarity=0.863  Sum_probs=10.9

Q ss_pred             CCCCCCCCCCHH
Q ss_conf             643110000013
Q gi|254780479|r  205 LCPLCPIQKNCL  216 (356)
Q Consensus       205 ~C~~Cpl~~~C~  216 (356)
                      .|..||....|-
T Consensus       196 ~C~~CP~~~~Cw  207 (210)
T PHA00619        196 ECIECPFYNVCW  207 (210)
T ss_pred             CEEECCCEEEEE
T ss_conf             102477101664


No 244
>KOG2171 consensus
Probab=31.49  E-value=37  Score=14.55  Aligned_cols=16  Identities=6%  Similarity=0.189  Sum_probs=6.5

Q ss_pred             CCCCCCCCCHHHHHCC
Q ss_conf             4311000001343101
Q gi|254780479|r  206 CPLCPIQKNCLTFSEG  221 (356)
Q Consensus       206 C~~Cpl~~~C~~~~~~  221 (356)
                      |..|-....|..+...
T Consensus       584 y~~~~warmc~ilg~~  599 (1075)
T KOG2171         584 YMIAFWARMCRILGDD  599 (1075)
T ss_pred             HHHHHHHHHHHHHCHH
T ss_conf             8899999999996521


No 245
>PRK03352 DNA polymerase IV; Validated
Probab=31.27  E-value=32  Score=15.00  Aligned_cols=86  Identities=19%  Similarity=0.166  Sum_probs=44.5

Q ss_pred             CCHHHHHCCCHHHHHHHHHCCCHH-----HHHHHHHHHHHHHHHCCCCCC---CCHHHHH------HHHHHHHHHHHHHH
Q ss_conf             997898729989999997404634-----588999877675430057868---7314568------88887778776677
Q gi|254780479|r   69 PTIFCLSSAKDEEILSAWAGLGYY-----TRARNLKKCADIIVKKYEGNF---PHKVEIL------KKLPGIGDYTASAI  134 (356)
Q Consensus        69 P~~~~la~a~~~~vl~~w~gLGyy-----~Rar~l~~~a~~i~~~~~g~~---P~~~~~l------~~LpGiG~yta~ai  134 (356)
                      -+...+|..=.++|.+ |  +|..     +.-+-|=|.|--+.+ -+|.+   |++..++      .++||||+-|+.-.
T Consensus       118 ~~~~~~a~~ir~~I~~-~--tgl~~SiGIa~nk~LAKiAs~~~K-P~G~~~i~~~~~~~~l~~lpv~~i~GIG~~~~~kL  193 (345)
T PRK03352        118 DDPEEVAEEIRAVVLE-E--TGLSCSVGISDNKQRAKIATGFAK-PAGVFRLTDANWMAVMGDRPVDALWGVGPKTAKKL  193 (345)
T ss_pred             CCHHHHHHHHHHHHHH-H--HCCCCEECCHHHHHHHHHHHHHHH-CCCEEECCHHHHHHHHHCCCCHHCCCCCHHHHHHH
T ss_conf             9999999999999999-8--789824304079999999987610-48747448889999985278211278289999999


Q ss_pred             HHHHCCCEEEECCCCHHHHHHHHHC
Q ss_conf             6643496313124314666576519
Q gi|254780479|r  135 VAIAFNHFAVVVDTNIERIISRYFD  159 (356)
Q Consensus       135 ~s~a~~~~~~~vD~Nv~RVl~R~~~  159 (356)
                      .+++...-.=..+.+... |.+.||
T Consensus       194 ~~~GI~Ti~DL~~~~~~~-L~~~fG  217 (345)
T PRK03352        194 AALGITTVADLAATDPAV-LTATFG  217 (345)
T ss_pred             HHCCCCCHHHHHCCCHHH-HHHHHC
T ss_conf             975997199986699999-999978


No 246
>TIGR00372 cas4 CRISPR-associated protein Cas4; InterPro: IPR013343   This entry consists of conserved proteins found in many prokaryotic genomes whose genes are associated with CRISPRs (Clustered, Regularly Interspaced Short Palidromic Repeats). The function of these proteins has not been experimentally determined, but computational analysis suggests that they be nucleases like RecB (IPR004586 from INTERPRO), functioning as part of a hypothetical DNA repair system , . .
Probab=30.52  E-value=18  Score=16.82  Aligned_cols=16  Identities=31%  Similarity=0.930  Sum_probs=12.8

Q ss_pred             CCCCCCCCCCCCCHHH
Q ss_conf             2356431100000134
Q gi|254780479|r  202 NKPLCPLCPIQKNCLT  217 (356)
Q Consensus       202 ~~P~C~~Cpl~~~C~~  217 (356)
                      ++++|..|+.+..|..
T Consensus       191 ~~~kC~~C~y~~~C~~  206 (206)
T TIGR00372       191 KSRKCKFCPYREICLP  206 (206)
T ss_pred             CCCCCCCCCCCCCCCC
T ss_conf             8775777555212589


No 247
>pfam11798 IMS_HHH IMS family HHH motif. These proteins are involved in UV protection, eg.
Probab=30.48  E-value=14  Score=17.55  Aligned_cols=16  Identities=44%  Similarity=0.769  Sum_probs=13.2

Q ss_pred             HHHHHHHHHHHHHHHH
Q ss_conf             8888877787766776
Q gi|254780479|r  120 LKKLPGIGDYTASAIV  135 (356)
Q Consensus       120 l~~LpGiG~yta~ai~  135 (356)
                      +..+||||+.|+...-
T Consensus        14 i~~i~GIG~~~~~~L~   29 (33)
T pfam11798        14 ISKIPGIGRKTAEKLE   29 (33)
T ss_pred             CCCCCCCCHHHHHHHH
T ss_conf             2216886667899999


No 248
>KOG4233 consensus
Probab=30.42  E-value=19  Score=16.59  Aligned_cols=47  Identities=17%  Similarity=0.224  Sum_probs=30.0

Q ss_pred             HHHHHHHHCCCCCCCHHHHHHHHHH-HHHHHCCCCCCCCCCCCCCCCHHHH
Q ss_conf             9998741025785226789999999-9863204123564311000001343
Q gi|254780479|r  169 KTIKNYARKITSTSRPGDFVQAMMD-LGALICTSNKPLCPLCPIQKNCLTF  218 (356)
Q Consensus       169 k~l~~~~~~~~~~~~~~~~nQAlMd-lGa~iC~p~~P~C~~Cpl~~~C~~~  218 (356)
                      |..-.+.+.++-+++-..|-..|.| .||+-|.++  +|..| |++.|..|
T Consensus        42 kAYvllGQfLllkKdE~lF~~Wlk~~~gat~~~a~--~~~~C-L~eWc~~F   89 (90)
T KOG4233          42 KAYVLLGQFLLLKKDEDLFQEWLKETCGATAKQAQ--DCFNC-LNEWCDCF   89 (90)
T ss_pred             HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHH--HHHHH-HHHHHHHH
T ss_conf             78999878897226699999999998284488899--99988-99999975


No 249
>pfam10273 WGG Pre-rRNA-processing protein TSR2. This entry represents the central conserved section of a family of proteins described as pre-rRNA-processing protein TSR2. The region has a distinctive WGG motif but the function is unknown.
Probab=30.39  E-value=39  Score=14.42  Aligned_cols=12  Identities=17%  Similarity=0.448  Sum_probs=4.3

Q ss_pred             HHHHHHHHHHHH
Q ss_conf             689999999999
Q gi|254780479|r    5 EHIIQSKILDWY   16 (356)
Q Consensus         5 ~~~~~~~ll~w~   16 (356)
                      +..+...++.||
T Consensus        33 ~~~l~~~i~~~f   44 (82)
T pfam10273        33 ADWLAGAIVDLF   44 (82)
T ss_pred             HHHHHHHHHHHH
T ss_conf             999999999998


No 250
>PRK13698 plasmid-partitioning protein; Provisional
Probab=30.38  E-value=39  Score=14.42  Aligned_cols=50  Identities=8%  Similarity=0.060  Sum_probs=25.5

Q ss_pred             HHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCC-----C-CCHHHHHHHHHHHCCCC
Q ss_conf             88877787766776643496313124314666576519754-----5-73568999874102578
Q gi|254780479|r  122 KLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK-----P-APLYHKTIKNYARKITS  180 (356)
Q Consensus       122 ~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~-----~-~~~~~k~l~~~~~~~~~  180 (356)
                      ..|-||+-..+-         .-++||--+|-.+.+.+-+-     + ....-+.+.++++..-|
T Consensus       102 Q~PAigRr~~g~---------iEVlDGSRRR~a~il~~~Dy~VlV~el~Deq~~~Lsdv~N~~rp  157 (323)
T PRK13698        102 QTPAFGRRVSGV---------IEIADGSRRRKAAILTESDYRVLVGELDDEQMAALSRLGNDYRP  157 (323)
T ss_pred             CCCCCEEECCCC---------EEEECCHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHHHHCCCC
T ss_conf             476210341796---------89803527778776068844899730887999999998750177


No 251
>TIGR00658 orni_carb_tr ornithine carbamoyltransferase; InterPro: IPR002292 The ornithine carbamoyltransferases are a family of enzymes that catalyse the production of citrulline from carbamoyl-phosphate and ornithine . This reaction is part of the arginine biosynthetic pathway, and in some organisms the reaction is also part of the arginine deaminase pathway . In most prokaryotes, and some lower eukaryotes, the enzyme is found in the cytoplasm, but in higher eukaryotes, such as mammals, the enzyme is found in the mitochondrial matrix and functions as part of the urea cycle. In most organisms analysed to date, the enzyme has been shown to consist of a trimer of identical or nonidentical subunits . Extensive similarity has been found within the ornithine carbamoyltransferase family, some of the conserved areas being important in catalysis .   The C-terminal region of these enzymes shows a degree of similarity to the aspartate carbamoyltransferases, which are also known to bind carbamoyl- phosphate . This region contains a highly conserved Cys residue (in a His-Cys-Lys-Pro motif) implicated in ornithine binding . ; GO: 0004585 ornithine carbamoyltransferase activity, 0006520 amino acid metabolic process, 0009348 ornithine carbamoyltransferase complex.
Probab=30.23  E-value=33  Score=14.88  Aligned_cols=50  Identities=32%  Similarity=0.458  Sum_probs=27.0

Q ss_pred             CCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHC--------CCCCCCHHHHHHHHHHH
Q ss_conf             96313124314666576519754573568999874102--------57852267899999999
Q gi|254780479|r  140 NHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARK--------ITSTSRPGDFVQAMMDL  194 (356)
Q Consensus       140 ~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~--------~~~~~~~~~~nQAlMdl  194 (356)
                      |...++=||  -|||||+...-.-=...+..++.+|+.        |++..||   .|+|=||
T Consensus        85 G~gEsi~DT--ARVLsRyVD~I~~R~~~h~~v~~lA~yasVPVINgLtD~~HP---cQ~LADl  142 (341)
T TIGR00658        85 GRGESIKDT--ARVLSRYVDGIMARVYKHEDVEELAKYASVPVINGLTDLFHP---CQALADL  142 (341)
T ss_pred             CCCCCCCCH--HHHHCCEEEEEEEECCCHHHHHHHHHHCCCCEECCCCCCCCH---HHHHHHH
T ss_conf             368870001--102031565677732553789999854898546055685523---7999998


No 252
>PTZ00205 DNA polymerase kappa; Provisional
Probab=30.05  E-value=29  Score=15.34  Aligned_cols=19  Identities=26%  Similarity=0.688  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q ss_conf             8888877787766776643
Q gi|254780479|r  120 LKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       120 l~~LpGiG~yta~ai~s~a  138 (356)
                      +.++||||+-|+.-..+++
T Consensus       311 vRKIpGIGkVte~~L~alG  329 (571)
T PTZ00205        311 LRSVPGVGKVTEALLKGLG  329 (571)
T ss_pred             CCCCCCCCHHHHHHHHHCC
T ss_conf             7668975688899998759


No 253
>TIGR02545 ATP_syn_fliI flagellar protein export ATPase FliI; InterPro: IPR013379    FliI proteins are involved in bacterial flagellum systems, acting to drive protein export for flagellar biosynthesis. The most closely related proteins are the YscN proteins of bacterial type III secretion systems.; GO: 0016887 ATPase activity, 0001539 ciliary or flagellar motility, 0009296 flagellum biogenesis, 0009288 flagellin-based flagellum.
Probab=29.46  E-value=26  Score=15.64  Aligned_cols=15  Identities=27%  Similarity=0.468  Sum_probs=6.6

Q ss_pred             ECCCCHHHHHHHHHCCC
Q ss_conf             12431466657651975
Q gi|254780479|r  145 VVDTNIERIISRYFDII  161 (356)
Q Consensus       145 ~vD~Nv~RVl~R~~~~~  161 (356)
                      +|||||  ||+|-++-.
T Consensus       331 iLDGHI--vLsR~lA~~  345 (439)
T TIGR02545       331 ILDGHI--VLSRKLAER  345 (439)
T ss_pred             HCCCEE--EECHHHHHC
T ss_conf             102207--732267645


No 254
>PRK13473 gamma-aminobutyraldehyde dehydrogenase; Provisional
Probab=29.46  E-value=40  Score=14.32  Aligned_cols=56  Identities=20%  Similarity=0.277  Sum_probs=32.5

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CCCHH-HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             9997404634588999877675430057----------86-87314-568888877787766776643
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKKYE----------GN-FPHKV-EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P~~~-~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .+.|+-+.+..|+.-|.++|..+.++.+          |+ +.... .|+...-..=+|.|+....+.
T Consensus        52 ~~~W~~~~~~~R~~~L~k~a~~l~~~~~~la~~~~~e~GKp~~~a~~~ev~~~i~~~~~~a~~~~~~~  119 (475)
T PRK13473         52 FPEWSQTTPKERAEALLKLADAIEENADEFARLESLNCGKPLHLVLNDEIPAIVDVFRFFAGAARCLE  119 (475)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC
T ss_conf             99882599999999999999999978999999999984987999999999999999999998888534


No 255
>KOG1338 consensus
Probab=29.29  E-value=40  Score=14.30  Aligned_cols=93  Identities=13%  Similarity=0.294  Sum_probs=58.6

Q ss_pred             HHHHHCCHHHCCHHHHHHHHHCCCHHHHHCC---CHHHHHHHHHCCCHHH---HHHHHHHHHHHHHHCCCCCCCCHHHHH
Q ss_conf             6731161203326589987748997898729---9899999974046345---889998776754300578687314568
Q gi|254780479|r   47 IMLQQTTVKTVEPYFKKFMQKWPTIFCLSSA---KDEEILSAWAGLGYYT---RARNLKKCADIIVKKYEGNFPHKVEIL  120 (356)
Q Consensus        47 imlqqT~v~~v~~~~~~~~~~~P~~~~la~a---~~~~vl~~w~gLGyy~---Rar~l~~~a~~i~~~~~g~~P~~~~~l  120 (356)
                      +||.--++.+- ..|...+++||++..+-..   +++|.--+-.|++--.   -+.++++--.-++..+++.+|.-+.  
T Consensus        96 vll~E~~~pq~-SrWrPYfs~wp~p~rm~spifWdEnEl~~Ll~stvlee~~Kd~aeI~~~~i~~i~pf~~~~p~vfs--  172 (466)
T KOG1338          96 VLLREKKMPQK-SRWRPYFSRWPQPARMHSPIFWDENELSMLLCSTVLEETVKDKAEIEKDFIFVIQPFKQHCPIVFS--  172 (466)
T ss_pred             HHHHHHHCCCC-CCCCCHHHHCCCHHHCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHC--
T ss_conf             99987505320-234407773798665278755780577777523452266768999999999999999876854103--


Q ss_pred             HHHHHHHH--HHHHHHHHHHCCCEEE
Q ss_conf             88887778--7766776643496313
Q gi|254780479|r  121 KKLPGIGD--YTASAIVAIAFNHFAV  144 (356)
Q Consensus       121 ~~LpGiG~--yta~ai~s~a~~~~~~  144 (356)
                        .|-.-.  |.++-+++.+|+.+.-
T Consensus       173 --~~slEdF~y~~Al~laysfdve~~  196 (466)
T KOG1338         173 --RPSLEDFMYAYALGLAYSFDVEFL  196 (466)
T ss_pred             --CCCHHHHHHHHHHHHHHHEEEEHH
T ss_conf             --537888999999998883210020


No 256
>cd03312 CIMS_N_terminal_like CIMS - Cobalamine-independent methonine synthase, or MetE, N-terminal domain_like. Many members have been characterized as 5-methyltetrahydropteroyltriglutamate-homocysteine methyltransferases, EC:2.1.1.14, mostly from bacteria and plants. This enzyme catalyses the last step in the production of methionine by transferring a methyl group from 5-methyltetrahydrofolate to L-homocysteine without using an intermediate methyl carrier. The active enzyme has a dual (beta-alpha)8-barrel structure, and this model covers the N-terminal barrel, and a few single-barrel sequences most similar to the N-terminal barrel. It is assumed that the homologous N-terminal barrel has evolved from the C-terminus via gene duplication and has subsequently lost binding sites, and it seems as if the two barrels forming the active enzyme may sometimes reside on different polypeptides. The C-terminal domain incorporates the Zinc ion, which binds and activates homocysteine. Side chains fro
Probab=28.78  E-value=11  Score=18.17  Aligned_cols=15  Identities=13%  Similarity=0.155  Sum_probs=5.3

Q ss_pred             HHHHHHHCCCCCCCC
Q ss_conf             767543005786873
Q gi|254780479|r  101 CADIIVKKYEGNFPH  115 (356)
Q Consensus       101 ~a~~i~~~~~g~~P~  115 (356)
                      |-|....+|.=.+|+
T Consensus       108 MtKWFdTNYHYiVPE  122 (360)
T cd03312         108 MTKWFDTNYHYIVPE  122 (360)
T ss_pred             HHCCCCCCCCEECCC
T ss_conf             651036776247011


No 257
>TIGR02535 hyp_Hser_kinase proposed homoserine kinase; InterPro: IPR013371    The genes encoding proteins in this entry are largely adjacent to genes involved in the biosynthesis of threonine (aspartate kinase, homoserine dehydrogenase and threonine synthase) in genomes which are lacking any other known homoserine kinase, and in which the presence of a homoserine kinase would indicate a complete pathway for the biosynthesis of threonine. These proteins are homologous to the archaeal form of 2,3-bisphosphoglycerate-independent phosphoglycerate mutase and form part of a superfamily of metalloenzymes including phosphopentomutases, alkaline phosphatases and sulphatases. The proposal that these proteins encode a kinase is based on analogy to phosphomutases which are intramolecular phosphotransferases. A mutase active site could evolve to bring together homoserine and a phosphate donor such as phosphoenolpyruvate, resulting in kinase activity.; GO: 0004619 phosphoglycerate mutase activity.
Probab=28.59  E-value=3.4  Score=21.89  Aligned_cols=23  Identities=9%  Similarity=0.196  Sum_probs=12.0

Q ss_pred             CCCCC-CHHHHHHHHHHHHHHHCC
Q ss_conf             57852-267899999999863204
Q gi|254780479|r  178 ITSTS-RPGDFVQAMMDLGALICT  200 (356)
Q Consensus       178 ~~~~~-~~~~~nQAlMdlGa~iC~  200 (356)
                      ++|.. .-.+|=+-||.-.+.|=+
T Consensus       200 lvPeg~~~~~~l~~Li~~S~~iL~  223 (431)
T TIGR02535       200 LVPEGGEEAELLKDLILKSAEILK  223 (431)
T ss_pred             CCCCCCCCHHHHHHHHHHHHHHHC
T ss_conf             577888734899999999999860


No 258
>cd01703 Pol_iota Pol iota is member of the DNA polymerase Y-family, and has also been called Rad30 homolog B. Unlike classic DNA polymerases,Y-family polymerases are induced by DNA damage. They can transverse normal replication-blocking DNA lesions. Unlike Pol eta, Pol iota is unable to replicate through a cis-syn T-T dimer. In human Pol iota, the base-pairing mode in the active site at the replicative end mat bee Hoogsteen instead of Watson-Click. Human Pol iota can incorporate the correct nucleotide opposite a purine much more efficiently than opposite a pyrimidine. Pol iota prefers to insert Guanosine instead of Adenosine opposite Thymidine.
Probab=28.51  E-value=23  Score=15.98  Aligned_cols=53  Identities=26%  Similarity=0.322  Sum_probs=28.5

Q ss_pred             HHHCCCHH-----HHHHHHHHHHHHHHHCCCC---CCCCHHHH-------HHHHHHHHHHHHHHHHHHH
Q ss_conf             97404634-----5889998776754300578---68731456-------8888877787766776643
Q gi|254780479|r   85 AWAGLGYY-----TRARNLKKCADIIVKKYEG---NFPHKVEI-------LKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus        85 ~w~gLGyy-----~Rar~l~~~a~~i~~~~~g---~~P~~~~~-------l~~LpGiG~yta~ai~s~a  138 (356)
                      .++-|||-     +.-+-|=|.|--..+ -+|   .+|.+..+       +.++||||+.|+.-+.+..
T Consensus       163 I~~~lGlt~S~GIa~NKlLAKlas~~~K-Pngqt~l~p~~~~~~l~~~~~i~ki~GIG~kt~~~L~~~g  230 (394)
T cd01703         163 IENELGLTCCAGIASNKLLAKLVGSVHK-PNQQTTLLPESADHLIMDLGSLRKIPGIGYKTAKKLEDHG  230 (394)
T ss_pred             HHHHHCCCEEEEECCCHHHHHHHHHCCC-CCCEEEECCCCHHHHHCCCCCHHHHCCCCHHHHHHHHHCC
T ss_conf             9998799767766340328987753258-8980597771187784366880552781799999999838


No 259
>cd07108 ALDH_MGR_2402 Magnetospirillum NAD(P)+-dependent aldehyde dehydrogenase MSR-1-like. NAD(P)+-dependent aldehyde dehydrogenase of Magnetospirillum gryphiswaldense MSR-1 (MGR_2402) , and other similar sequences, are present in this CD.
Probab=28.17  E-value=39  Score=14.41  Aligned_cols=53  Identities=21%  Similarity=0.256  Sum_probs=31.8

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CC-CHHHHHHHHHHHHHHHHHHHH
Q ss_conf             9997404634588999877675430057----------86-87-314568888877787766776
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKKYE----------GN-FP-HKVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P-~~~~~l~~LpGiG~yta~ai~  135 (356)
                      -+.|+.+..+.|++-|.++|..|.++.+          |+ +. +...|+...-..=+|+|+...
T Consensus        32 ~~~W~~~~~~~R~~il~~~a~~l~~~~~~la~~~~~E~GK~l~~~a~~Ev~~~~~~~~~~a~~a~   96 (457)
T cd07108          32 FPEWAATPARERGKLLARIADALEARSEELARLLALETGNALRTQARPEAAVLADLFRYFGGLAG   96 (457)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             89870299999999999999999978999999999870841188999999999999999997788


No 260
>TIGR02061 aprA adenylylsulfate reductase, alpha subunit; InterPro: IPR011803    During dissimilatory sulphate reduction or sulphur oxidation, adenylylsulphate (APS) reductase catalyzes reversibly the two-electron reduction of APS to sulphite and AMP. Found in several bacterial lineages and in Archaeoglobales, APS reductase is a heterodimer composed of an alpha subunit containing a noncovalently bound FAD, and a beta subunit containing two [4Fe-4S] clusters. This entry describes the alpha subunit of APS reductase, which shares a common evolutionary origin with fumarate reductase/succinate dehydrogenase flavoproteins..
Probab=27.94  E-value=33  Score=14.95  Aligned_cols=24  Identities=21%  Similarity=0.403  Sum_probs=15.0

Q ss_pred             HHHHHHCCCHHHHHCCCHHHHHHHHH
Q ss_conf             99877489978987299899999974
Q gi|254780479|r   62 KKFMQKWPTIFCLSSAKDEEILSAWA   87 (356)
Q Consensus        62 ~~~~~~~P~~~~la~a~~~~vl~~w~   87 (356)
                      ..||+  =|.+.||+-++-|+|++||
T Consensus       562 la~le--EDleKLAArdlHELlRaWE  585 (651)
T TIGR02061       562 LALLE--EDLEKLAARDLHELLRAWE  585 (651)
T ss_pred             HHHHH--HHHHHHHHHHHHHHHHHHH
T ss_conf             87764--1177887877889998888


No 261
>PTZ00154 40S ribosomal protein S17; Provisional
Probab=27.70  E-value=34  Score=14.84  Aligned_cols=29  Identities=34%  Similarity=0.518  Sum_probs=24.0

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCCHHHHHHH
Q ss_conf             88999877675430057868731456888
Q gi|254780479|r   94 RARNLKKCADIIVKKYEGNFPHKVEILKK  122 (356)
Q Consensus        94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~  122 (356)
                      |-+.++++|+.|++.|-..|-.|++.=..
T Consensus         5 RtktvKraAr~iiEkYy~~lt~DF~~NK~   33 (130)
T PTZ00154          5 RTKTVKRAARQIVEKYYAKLTLDFQINKK   33 (130)
T ss_pred             CHHHHHHHHHHHHHHHHHHHCCCHHHHHH
T ss_conf             00599999999999803141454777479


No 262
>KOG2966 consensus
Probab=27.48  E-value=22  Score=16.17  Aligned_cols=11  Identities=36%  Similarity=0.863  Sum_probs=5.7

Q ss_pred             HHHHHHHHHCC
Q ss_conf             99999863204
Q gi|254780479|r  190 AMMDLGALICT  200 (356)
Q Consensus       190 AlMdlGa~iC~  200 (356)
                      -+.++|++||.
T Consensus       149 ~v~vl~~~V~l  159 (325)
T KOG2966         149 NVIVLGDLVCL  159 (325)
T ss_pred             EEEHHHHHHHH
T ss_conf             11215217876


No 263
>TIGR02418 acolac_catab acetolactate synthase, catabolic; InterPro: IPR012782    Acetolactate synthase is a thiamin pyrophosphate-dependent enzyme that combines two molecules of pyruvate to yield 2-acetolactate with the release of CO2. It exists in two distinct forms that have different properties, though they are clearly related , . The biosynthetic form, found in plants, fungi and bacteria, is involved in the biosynthesis of branched-chain amino acids and contains non-catalytic FAD. The catabolic form is found only in some bacteria, participates in butanediol fermentation, and does not contain FAD.   This entry represents the catabolic form of acetolactate synthase, generally found in species that posses acetolactate decarboxylase, which is often encoded in the same operon. The enzyme is thought to be a homotetramer, though this has not conclusively been shown. Each monomer consists of three distinct domains, designated, starting from the N-terminus, as alpha, beta and gamma . The core of the alpha and gamma domains is a six-stranded parallel beta sheet surrounded by alpha helices, while the the beta domain forms a double Rossman fold. The thiamin pyrophosphate cofactor has an unusual conformation that has not been seen in other enzymes using this cofactor, including the biosynthetic form of acetolactate synthase. The active site is located in a funnel at the dimer interface, bordered by residues from all three domains.; GO: 0000287 magnesium ion binding, 0003984 acetolactate synthase activity, 0030976 thiamin pyrophosphate binding, 0050660 FAD binding.
Probab=27.40  E-value=17  Score=16.97  Aligned_cols=30  Identities=17%  Similarity=0.305  Sum_probs=28.3

Q ss_pred             HHHHHHHHHHHHHHHHCCCEEEECCCCHHH
Q ss_conf             887778776677664349631312431466
Q gi|254780479|r  123 LPGIGDYTASAIVAIAFNHFAVVVDTNIER  152 (356)
Q Consensus       123 LpGiG~yta~ai~s~a~~~~~~~vD~Nv~R  152 (356)
                      =||||.=+.|-.-+=+=|-|+.++=|+|+|
T Consensus        75 GPG~sNL~TGl~TAn~EGDPVvAiGG~VkR  104 (553)
T TIGR02418        75 GPGVSNLVTGLLTANSEGDPVVAIGGQVKR  104 (553)
T ss_pred             CCCHHHHHHHHHEECCCCCHHHHCCCCCCH
T ss_conf             888144674331015677635411677632


No 264
>cd07146 ALDH_PhpJ Streptomyces putative phosphonoformaldehyde dehydrogenase PhpJ-like. Putative phosphonoformaldehyde dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog reportedly involved in the biosynthesis of phosphinothricin tripeptides in Streptomyces viridochromogenes DSM 40736, and similar sequences are included in this CD.
Probab=27.34  E-value=38  Score=14.46  Aligned_cols=26  Identities=27%  Similarity=0.323  Sum_probs=18.5

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHC
Q ss_conf             99974046345889998776754300
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKK  108 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~  108 (356)
                      .+.|+.++...|++-|+++|.++.++
T Consensus        31 ~~a~~~~~~~eR~~~l~~~a~~l~~~   56 (451)
T cd07146          31 ASYRSTLTRYQRSAILNKAAALLEAR   56 (451)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHH
T ss_conf             45532099999999999999999978


No 265
>COG3600 GepA Uncharacterized phage-associated protein [Function unknown]
Probab=27.25  E-value=28  Score=15.44  Aligned_cols=70  Identities=21%  Similarity=0.248  Sum_probs=41.0

Q ss_pred             HCCCHHHHHHHHHHHHHHHHH------CCCCCCCCHHHHHHHHHHHHHH--HHHHHHHHHCCCEEEECCCCHHHHHHHHH
Q ss_conf             404634588999877675430------0578687314568888877787--76677664349631312431466657651
Q gi|254780479|r   87 AGLGYYTRARNLKKCADIIVK------KYEGNFPHKVEILKKLPGIGDY--TASAIVAIAFNHFAVVVDTNIERIISRYF  158 (356)
Q Consensus        87 ~gLGyy~Rar~l~~~a~~i~~------~~~g~~P~~~~~l~~LpGiG~y--ta~ai~s~a~~~~~~~vD~Nv~RVl~R~~  158 (356)
                      +-|=||.-+..|+..-+-+++      +||-.+|.-+.+..+   =|+|  |-.+...--++-....+|++|.++|+|+.
T Consensus        31 qKLlYyA~~~~L~~~~~pL~~~~ieAW~~GPVip~~Yn~~K~---~Gs~~I~~r~~~~~l~~~~~~~iD~~~s~~L~~Vw  107 (154)
T COG3600          31 QKLLYYAHGWFLAVTGRPLFDEKIEAWKHGPVIPSLYNAFKQ---YGSNSIDERLPVRGLSNGNALPIDADVSAILARVW  107 (154)
T ss_pred             HHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCHHHHHHHHH---CCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHH
T ss_conf             999999999999980885632378887618873999999997---08877871122117776204753214999999999


Q ss_pred             C
Q ss_conf             9
Q gi|254780479|r  159 D  159 (356)
Q Consensus       159 ~  159 (356)
                      .
T Consensus       108 ~  108 (154)
T COG3600         108 D  108 (154)
T ss_pred             H
T ss_conf             9


No 266
>COG1468 CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]
Probab=26.81  E-value=23  Score=16.03  Aligned_cols=17  Identities=24%  Similarity=0.682  Sum_probs=14.8

Q ss_pred             CCCCCCCCCCCCCCHHH
Q ss_conf             12356431100000134
Q gi|254780479|r  201 SNKPLCPLCPIQKNCLT  217 (356)
Q Consensus       201 p~~P~C~~Cpl~~~C~~  217 (356)
                      ++++.|..|+++..|..
T Consensus       172 ~~~~~C~~C~y~~iC~~  188 (190)
T COG1468         172 KKKKKCKKCAYREICFP  188 (190)
T ss_pred             CCCCCCCCCCCCEECCC
T ss_conf             99886999986334267


No 267
>cd07151 ALDH_HBenzADH NADP+-dependent p-hydroxybenzaldehyde dehydrogenase-like. NADP+-dependent, p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH) which catalyzes oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic acid and other related sequences are included in this CD.
Probab=26.44  E-value=45  Score=13.96  Aligned_cols=56  Identities=20%  Similarity=0.129  Sum_probs=35.9

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCC----------CCC-CCCHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             999740463458899987767543005----------786-87314568888877787766776643
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKKY----------EGN-FPHKVEILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~~----------~g~-~P~~~~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .+.|+-+..+.|++-|+++|..|.++.          .|+ +.+...|+.....+-+|.|+......
T Consensus        45 ~~~W~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~~e~Gk~~~~a~~ev~~~~~~~~~~a~~~~~~~  111 (465)
T cd07151          45 QKEWAATLPQERAEILEKAAQILEERRDEIVEWLIRESGSTRIKANIEWGAAMAITREAATFPLRME  111 (465)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC
T ss_conf             8987329999999999999999997899999999997299889999999999999999987899831


No 268
>pfam03118 RNA_pol_A_CTD Bacterial RNA polymerase, alpha chain C terminal domain. The alpha subunit of RNA polymerase consists of two independently folded domains, referred to as amino-terminal and carboxyl terminal domains. The amino terminal domain is involved in the interaction with the other subunits of the RNA polymerase. The carboxyl-terminal domain interacts with the DNA and activators. The amino acid sequence of the alpha subunit is conserved in prokaryotic and chloroplast RNA polymerases. There are three regions of particularly strong conservation, two in the amino-terminal and one in the carboxyl- terminal.
Probab=26.22  E-value=21  Score=16.29  Aligned_cols=47  Identities=23%  Similarity=0.159  Sum_probs=33.7

Q ss_pred             HCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH
Q ss_conf             4046345889998776754300578687314568888877787766776
Q gi|254780479|r   87 AGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus        87 ~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~  135 (356)
                      +-|+.-.|+.|-.+.+.+--  .+.-+-.+.++|+++|++|+.+-.-|-
T Consensus        11 ~~L~LS~R~~N~Lk~~~I~t--v~dL~~~s~~dLl~i~N~G~kSl~EI~   57 (62)
T pfam03118        11 EELELSVRSYNCLKRAGINT--VGDLLSKSEEDLLKIKNFGKKSLEEIK   57 (62)
T ss_pred             HHHCCCHHHHHHHHHCCCCC--HHHHHHCCHHHHHHCCCCCHHHHHHHH
T ss_conf             98168689999998949967--999985899999748898685799999


No 269
>PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional
Probab=26.08  E-value=44  Score=14.03  Aligned_cols=23  Identities=22%  Similarity=0.225  Sum_probs=9.8

Q ss_pred             CCCCCCHHHHHHHHHHHHHHHHH
Q ss_conf             78687314568888877787766
Q gi|254780479|r  110 EGNFPHKVEILKKLPGIGDYTAS  132 (356)
Q Consensus       110 ~g~~P~~~~~l~~LpGiG~yta~  132 (356)
                      |..||-|.+..-++=|...-+.+
T Consensus       202 GT~~pIDR~~tA~~LGF~~~~~N  224 (614)
T PRK12308        202 GTAYPIDREALAHNLGFRRATRN  224 (614)
T ss_pred             CCCCCCCHHHHHHHCCCCCCCCC
T ss_conf             89898799999997599986678


No 270
>COG1383 RPS17A Ribosomal protein S17E [Translation, ribosomal structure and biogenesis]
Probab=25.92  E-value=43  Score=14.13  Aligned_cols=29  Identities=28%  Similarity=0.503  Sum_probs=24.4

Q ss_pred             HHHHHHHHHHHHHHCCCCCCCCHHHHHHH
Q ss_conf             88999877675430057868731456888
Q gi|254780479|r   94 RARNLKKCADIIVKKYEGNFPHKVEILKK  122 (356)
Q Consensus        94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~  122 (356)
                      |-++.+.+|+.|.+.|...|-.|++.=..
T Consensus         5 R~~~vKR~a~el~ekY~~~ft~dFe~NK~   33 (74)
T COG1383           5 RPKFVKRTARELIEKYPDKFTDDFETNKR   33 (74)
T ss_pred             CHHHHHHHHHHHHHHHHHHHCCCHHHHHH
T ss_conf             25799999999999857775232788899


No 271
>cd07102 ALDH_EDX86601 Uncharacterized aldehyde dehydrogenase of Synechococcus sp. PCC 7335 (EDX86601). Uncharacterized aldehyde dehydrogenase of Synechococcus sp. PCC 7335 (locus EDX86601) and other similar sequences, are present in this CD.
Probab=25.90  E-value=46  Score=13.89  Aligned_cols=25  Identities=8%  Similarity=0.329  Sum_probs=14.9

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHH
Q ss_conf             9997404634588999877675430
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVK  107 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~  107 (356)
                      .+.|+-+..+.|++-|++.|..+.+
T Consensus        31 ~~~W~~~~~~eR~~~L~~~a~~l~~   55 (452)
T cd07102          31 QKGWRAVPLEERKAIVTRAVELLAA   55 (452)
T ss_pred             HHHHCCCCHHHHHHHHHHHHHHHHH
T ss_conf             9988029999999999999999996


No 272
>pfam06754 PhnG Phosphonate metabolism protein PhnG. This family consists of several bacterial phosphonate metabolism protein PhnG sequences. In Escherichia coli, the phn operon encodes proteins responsible for the uptake and breakdown of phosphonates. The exact function of PhnG is unknown, however it is thought likely that along with six other proteins PhnG makes up the the C-P (carbon-phosphorus) lyase.
Probab=25.85  E-value=44  Score=14.02  Aligned_cols=20  Identities=35%  Similarity=0.615  Sum_probs=17.6

Q ss_pred             HHHHCCCHHHHHHHHHCCCH
Q ss_conf             89872998999999740463
Q gi|254780479|r   72 FCLSSAKDEEILSAWAGLGY   91 (356)
Q Consensus        72 ~~la~a~~~~vl~~w~gLGy   91 (356)
                      .-||.|+.+++..+|++|+.
T Consensus        11 s~LA~a~~~~L~~~~~~~~~   30 (147)
T pfam06754        11 AVLARAPAAELAALWAALGL   30 (147)
T ss_pred             HHHCCCCHHHHHHHHHHCCC
T ss_conf             99814999999999985588


No 273
>PRK13181 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional
Probab=25.84  E-value=15  Score=17.37  Aligned_cols=11  Identities=36%  Similarity=0.836  Sum_probs=5.9

Q ss_pred             HHHHHHHHHHH
Q ss_conf             88887778776
Q gi|254780479|r  121 KKLPGIGDYTA  131 (356)
Q Consensus       121 ~~LpGiG~yta  131 (356)
                      +-|||||.++.
T Consensus        41 lIlPGvG~~~~   51 (199)
T PRK13181         41 VILPGVGAFGQ   51 (199)
T ss_pred             EECCCCCCCCH
T ss_conf             04579865330


No 274
>PRK09457 astD succinylglutamic semialdehyde dehydrogenase; Reviewed
Probab=25.79  E-value=46  Score=13.88  Aligned_cols=23  Identities=26%  Similarity=0.546  Sum_probs=10.9

Q ss_pred             HHHHCCCHHHHHHHHHHHHHHHH
Q ss_conf             99740463458899987767543
Q gi|254780479|r   84 SAWAGLGYYTRARNLKKCADIIV  106 (356)
Q Consensus        84 ~~w~gLGyy~Rar~l~~~a~~i~  106 (356)
                      +.|+.+....|++-|++.|..+.
T Consensus        39 ~~W~~~~~~eR~~~L~~~a~~l~   61 (475)
T PRK09457         39 PAWARLSFEARQAIVERFAALLE   61 (475)
T ss_pred             HHHCCCCHHHHHHHHHHHHHHHH
T ss_conf             99815999999999999999999


No 275
>KOG3771 consensus
Probab=25.68  E-value=47  Score=13.87  Aligned_cols=83  Identities=14%  Similarity=0.221  Sum_probs=42.8

Q ss_pred             CCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHH
Q ss_conf             96716899999999999628998875485544456787046766446731161203326589987748997898729989
Q gi|254780479|r    1 MPQPEHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDE   80 (356)
Q Consensus         1 ~~~~~~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~   80 (356)
                      |+.-...|++.|++-|+-   +.+=|...    ..+-+-=.++..|.|  |+-+++|+-=-.+|+..||+++....-...
T Consensus        63 m~~a~~~l~e~l~eiy~p---~~~g~~~l----~~v~~~~d~l~~d~~--~~l~d~vl~pl~~~~~~fpdik~~i~KR~~  133 (460)
T KOG3771          63 MLAASKKLAESLQEIYEP---DWPGRDYL----QAVADNDDLLWKDLD--QKLVDQVLLPLDTYLGQFPDIKKAIAKRGR  133 (460)
T ss_pred             HHHHHHHHHHHHHHHCCC---CCCHHHHH----HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHCHHHHHHHHHHCC
T ss_conf             999999999999985476---32308999----999987777899998--888876321598763103458999986140


Q ss_pred             HHHHHHHCCCHHHHHHHHH
Q ss_conf             9999974046345889998
Q gi|254780479|r   81 EILSAWAGLGYYTRARNLK   99 (356)
Q Consensus        81 ~vl~~w~gLGyy~Rar~l~   99 (356)
                      .++       =|.|+||=.
T Consensus       134 Kl~-------DyD~~r~~~  145 (460)
T KOG3771         134 KLV-------DYDSARHSF  145 (460)
T ss_pred             HHH-------HHHHHHHHH
T ss_conf             221-------268788899


No 276
>cd07098 ALDH_F15-22 Aldehyde dehydrogenase family 15A1 and 22A1-like. Aldehyde dehydrogenase family members ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1 (Arabidopsis thaliana, EC=1.2.1.3), and similar sequences, are in this CD. Significant improvement of stress tolerance in tobacco plants was observed by overexpressing the ALDH22A1 gene from maize (Zea mays) and was accompanied by a reduction of malondialdehyde  derived from cellular lipid peroxidation.
Probab=25.52  E-value=47  Score=13.85  Aligned_cols=26  Identities=19%  Similarity=0.410  Sum_probs=17.4

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHC
Q ss_conf             99974046345889998776754300
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKK  108 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~  108 (356)
                      .+.|+-+..+.|+.-|+++|..+.++
T Consensus        31 ~~~W~~~~~~~R~~~L~~~a~~l~~~   56 (465)
T cd07098          31 QREWAKTSFAERRKVLRSLLKYILEN   56 (465)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHH
T ss_conf             99983299999999999999999978


No 277
>TIGR02923 AhaC ATP synthase A1, C subunit; InterPro: IPR014272   A-ATPases (or A1A0-ATPase) (3.6.3.14 from EC) are found exclusively in Archaea and display a close resemblance in structure and subunit composition to V-ATPases, although their function in both ATP synthesis and ATP hydrolysis is closer to that of F-ATPases . A-ATPases are composed of two linked complexes: the A1 complex, consisting of seven subunits, contains the catalytic core that synthesizes/hydrolyses ATP, while the A0 complex, consisting of at least two subunits, forms the membrane-spanning pore . The rotary motor in A-ATPases is composed of only two subunits, the stator subunit I and the rotor subunit C . A-ATPases may have arisen as an adaptation to the different cellular needs and the more extreme environmental conditions faced by Archaeal species.   This entry represents subunit C of the A1 component.   More information about this protein can be found at Protein of the Month: ATP Synthases ..
Probab=25.31  E-value=46  Score=13.92  Aligned_cols=19  Identities=32%  Similarity=0.305  Sum_probs=9.1

Q ss_pred             HHHHHHHHHHHHCCHHHCC
Q ss_conf             4676644673116120332
Q gi|254780479|r   40 YKVWISEIMLQQTTVKTVE   58 (356)
Q Consensus        40 y~v~vseimlqqT~v~~v~   58 (356)
                      |.-.+||+.-|-+-|.-|.
T Consensus        47 Y~~~~d~l~~~~~g~~~~E   65 (353)
T TIGR02923        47 YKKELDELGSKYEGVDLVE   65 (353)
T ss_pred             HHHHHHHHHCCCCCCHHHH
T ss_conf             3789988621267844899


No 278
>pfam02961 BAF Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration.
Probab=25.19  E-value=16  Score=17.15  Aligned_cols=74  Identities=19%  Similarity=0.288  Sum_probs=41.2

Q ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHH
Q ss_conf             73145688888777877667766434963131243146665765197545735689998741025785226789999999
Q gi|254780479|r  114 PHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMD  193 (356)
Q Consensus       114 P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMd  193 (356)
                      |-.-.....|||||+--+.-.-.-+|++                            .---+-+.|+-..+-..|-.-|+|
T Consensus        15 PMG~K~V~~l~GIG~~lg~~L~~kGfdK----------------------------AyvvLGqfLvl~kde~~F~~Wlk~   66 (89)
T pfam02961        15 PMGDKPVTALAGIGEVLGGRLEDKGFDK----------------------------AYVVLGQFLLLKKDEELFKEWLKD   66 (89)
T ss_pred             CCCCCCCCCCCCCCHHHHHHHHHCCCCH----------------------------HHHHHHHHHHHCCCHHHHHHHHHH
T ss_conf             7778874505786289988999848539----------------------------999999999856649999999999


Q ss_pred             H-HHHHCCCCCCCCCCCCCCCCHHHH
Q ss_conf             9-863204123564311000001343
Q gi|254780479|r  194 L-GALICTSNKPLCPLCPIQKNCLTF  218 (356)
Q Consensus       194 l-Ga~iC~p~~P~C~~Cpl~~~C~~~  218 (356)
                      . ||.-...  -+|-.| |++.|.+|
T Consensus        67 ~~gAn~kQa--~dcy~c-LkeWCd~F   89 (89)
T pfam02961        67 TCGANAKQA--RDCYNC-LKEWCSCF   89 (89)
T ss_pred             HCCCCHHHH--HHHHHH-HHHHHHCC
T ss_conf             808888899--889999-99998629


No 279
>cd04663 Nudix_Hydrolase_6 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belong to this superfamily requires a divalent cation, such as Mg2+ or Mn2+ for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V) which functions as metal binding and catalytic site. Substrates of nudix hydrolase include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate specificity are 
Probab=25.09  E-value=48  Score=13.79  Aligned_cols=36  Identities=6%  Similarity=-0.032  Sum_probs=24.6

Q ss_pred             CEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCC
Q ss_conf             60577645720132047442111134672158886301010
Q gi|254780479|r  250 NRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFT  290 (356)
Q Consensus       250 ~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~  290 (356)
                      .++|+.+-|.    +| |++|.+..+.++........+...
T Consensus        14 ~elL~f~hp~----~g-~q~p~G~ve~gE~~~~aa~RE~~E   49 (126)
T cd04663          14 LELLVFEHPL----AG-FQIVKGTVEPGETPEAAALRELQE   49 (126)
T ss_pred             EEEEEEECCC----CC-EECCCCCCCCCCCHHHHHHHHHHH
T ss_conf             6899981788----85-476576758998989999999888


No 280
>pfam06424 PRP1_N PRP1 splicing factor, N-terminal. This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression.
Probab=25.02  E-value=16  Score=17.08  Aligned_cols=16  Identities=31%  Similarity=0.549  Sum_probs=12.3

Q ss_pred             CHHHHHHHHHHHHHHH
Q ss_conf             3145688888777877
Q gi|254780479|r  115 HKVEILKKLPGIGDYT  130 (356)
Q Consensus       115 ~~~~~l~~LpGiG~yt  130 (356)
                      -+.++|.+||-||+||
T Consensus       115 vt~eeW~~IPE~GD~t  130 (132)
T pfam06424       115 VTEDEWANIPEAGDYT  130 (132)
T ss_pred             CCHHHHHCCCCCCCCC
T ss_conf             8899997188532225


No 281
>cd02027 APSK Adenosine 5'-phosphosulfate kinase (APSK) catalyzes the phosphorylation of adenosine 5'-phosphosulfate to form 3'-phosphoadenosine 5'-phosphosulfate (PAPS). The end-product PAPS is a biologically "activated" sulfate form important for the assimilation of inorganic sulfate.
Probab=24.79  E-value=47  Score=13.83  Aligned_cols=32  Identities=22%  Similarity=0.225  Sum_probs=24.8

Q ss_pred             HHHHHHHHHHHHHHHH---HHCCCEEEECCCCHHH
Q ss_conf             8888777877667766---4349631312431466
Q gi|254780479|r  121 KKLPGIGDYTASAIVA---IAFNHFAVVVDTNIER  152 (356)
Q Consensus       121 ~~LpGiG~yta~ai~s---~a~~~~~~~vD~Nv~R  152 (356)
                      .-|||-|..|-+-.+.   ...+.++..+||...|
T Consensus         5 tGLsgsGKTTlA~~l~~~L~~~~~~~~~lDGD~iR   39 (149)
T cd02027           5 TGLSGSGKSTIARALEEKLFQRGRPVYVLDGDNVR   39 (149)
T ss_pred             ECCCCCCHHHHHHHHHHHHHHCCCCEEEECCHHHH
T ss_conf             79999999999999999999869975997748899


No 282
>cd03740 SOCS_SOCS6 SOCS (suppressors of cytokine signaling) box of SOCS6-like proteins. Together with CIS1, the CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
Probab=24.67  E-value=45  Score=14.00  Aligned_cols=22  Identities=23%  Similarity=0.298  Sum_probs=17.2

Q ss_pred             EECHHHHHHCCCCHHHHHHHHH
Q ss_conf             3227888747998789999984
Q gi|254780479|r  328 WHDAQNLANAALPTVMKKALSA  349 (356)
Q Consensus       328 Wv~~~el~~~~LPs~~kKIL~a  349 (356)
                      -+..+.|..+|||+.++..|+.
T Consensus        17 ~vr~d~I~~LPLP~~l~~Yl~e   38 (41)
T cd03740          17 YTRIDLIQKLPLPNKMKGYLLE   38 (41)
T ss_pred             HHHHHHHHHCCCCHHHHHHHHC
T ss_conf             7323001107996889998752


No 283
>cd07116 ALDH_ACDHII-AcoD Ralstonia eutrophus NAD+-dependent acetaldehyde dehydrogenase II-like. Included in this CD is the NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus H16 involved in the catabolism of acetoin and ethanol, and similar proteins, such as, the dimeric dihydrolipoamide dehydrogenase of the acetoin dehydrogenase enzyme system of Klebsiella pneumonia. Also included are sequences similar to the NAD+-dependent chloroacetaldehyde dehydrogenases (AldA and AldB) of Xanthobacter autotrophicus GJ10 which are involved in the degradation of 1,2-dichloroethane. These proteins apparently require RpoN factors for expression.
Probab=24.58  E-value=49  Score=13.73  Aligned_cols=27  Identities=30%  Similarity=0.335  Sum_probs=20.6

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCC
Q ss_conf             999740463458899987767543005
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKKY  109 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~~  109 (356)
                      .+.|+-+..+.|++-|+++|..+.++.
T Consensus        51 ~~~W~~~~~~eR~~~l~~~a~~l~~~~   77 (479)
T cd07116          51 KEAWGKTSVAERANILNKIADRMEANL   77 (479)
T ss_pred             HHHHCCCCHHHHHHHHHHHHHHHHHCH
T ss_conf             998804999999999999999999789


No 284
>cd02037 MRP-like MRP (Multiple Resistance and pH adaptation) is a homologue of the Fer4_NifH superfamily. Like the other members of the superfamily, MRP contains a ATP-binding domain at the N-termini. It is found in bacteria as a membrane-spanning protein and functions as a Na+/H+ antiporter.
Probab=24.26  E-value=40  Score=14.33  Aligned_cols=27  Identities=26%  Similarity=0.402  Sum_probs=23.0

Q ss_pred             HHHHHHHHHHHHHHHC---CCEEEECCCCH
Q ss_conf             8777877667766434---96313124314
Q gi|254780479|r  124 PGIGDYTASAIVAIAF---NHFAVVVDTNI  150 (356)
Q Consensus       124 pGiG~yta~ai~s~a~---~~~~~~vD~Nv  150 (356)
                      .|||..|-++-++.++   |..+.++|..+
T Consensus         9 ggvgkst~~~~la~~l~~~g~~v~~~d~di   38 (169)
T cd02037           9 GGVGKSTVAVNLALALAKLGYKVGLLDADI   38 (169)
T ss_pred             CCCCHHHHHHHHHHHHHHCCCCEEEEEEEC
T ss_conf             988199999999999998799789997137


No 285
>TIGR02291 rimK_rel_E_lig alpha-L-glutamate ligase homolog; InterPro: IPR011758   Members of this protein family contain a region of homology to the RimK family of alpha-L-glutamate ligases (IPR004666 from INTERPRO), various members of which modify the Glu-Glu C-terminus of ribosomal protein S6, or tetrahydromethanopterin, or a form of coenzyme F420 derivative. Members of this family are found so far in various Vibrio and Pseudomonas species and some other gamma and beta Proteobacteria. The function is unknown..
Probab=24.16  E-value=33  Score=14.92  Aligned_cols=13  Identities=31%  Similarity=0.570  Sum_probs=8.5

Q ss_pred             CCHHHHCCCCCCC
Q ss_conf             2013204744211
Q gi|254780479|r  259 NTRLLEGMDELPG  271 (356)
Q Consensus       259 ~~gll~GLwEFP~  271 (356)
                      =+.+|+|||++=+
T Consensus       119 vSniL~GLySLGG  131 (320)
T TIGR02291       119 VSNILAGLYSLGG  131 (320)
T ss_pred             HHHHHHHHHHCCC
T ss_conf             8888877765079


No 286
>TIGR02684 dnstrm_HI1420 probable addiction module antidote protein; InterPro: IPR014057   Members of this strictly bacterial protein family are small, at roughly 100 amino acids. The gene is almost invariably the downstream member of a gene pair. It is a predicted DNA-binding protein from a clade within the helix-turn-helix family IPR001387 from INTERPRO. These gene pairs, when found on the bacterial chromosome, are located often with prophage regions, but also both in integrated plasmid regions and in housekeeping gene regions. Analysis suggests that the gene pair may serve as an addiction module..
Probab=24.11  E-value=50  Score=13.67  Aligned_cols=30  Identities=30%  Similarity=0.376  Sum_probs=23.6

Q ss_pred             HHHCC--CHHHHHHHHHCCCHHHHHHHHHHHHHHH
Q ss_conf             98729--9899999974046345889998776754
Q gi|254780479|r   73 CLSSA--KDEEILSAWAGLGYYTRARNLKKCADII  105 (356)
Q Consensus        73 ~la~a--~~~~vl~~w~gLGyy~Rar~l~~~a~~i  105 (356)
                      .|...  |-..|..+   |||=.|||.|-+.|+..
T Consensus        24 a~e~gGdDp~~~~~A---LG~iArArGMtqlA~~t   55 (91)
T TIGR02684        24 ALEDGGDDPALIAAA---LGVIARARGMTQLAKKT   55 (91)
T ss_pred             HHHCCCCCHHHHHHH---HHHHHHHCCHHHHHHHH
T ss_conf             974369998899999---88998651657899982


No 287
>COG3100 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.04  E-value=43  Score=14.12  Aligned_cols=20  Identities=30%  Similarity=0.415  Sum_probs=8.2

Q ss_pred             HHHCCCHHHHHHHHHCCCHH
Q ss_conf             98729989999997404634
Q gi|254780479|r   73 CLSSAKDEEILSAWAGLGYY   92 (356)
Q Consensus        73 ~la~a~~~~vl~~w~gLGyy   92 (356)
                      .||+|+.++|+...+--|||
T Consensus        61 ~L~n~dv~kV~~~i~~QGfy   80 (103)
T COG3100          61 KLVNADVEKVKQAIEEQGFY   80 (103)
T ss_pred             HHHHHHHHHHHHHHHHCCEE
T ss_conf             88785499999999865546


No 288
>COG0099 RpsM Ribosomal protein S13 [Translation, ribosomal structure and biogenesis]
Probab=23.99  E-value=37  Score=14.60  Aligned_cols=27  Identities=30%  Similarity=0.347  Sum_probs=20.4

Q ss_pred             CCCCHH---HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             687314---568888877787766776643
Q gi|254780479|r  112 NFPHKV---EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       112 ~~P~~~---~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .+|.+.   -.|..+.|||.-+|.+|+.-+
T Consensus         8 dip~~K~v~iALt~IyGIG~~~a~~I~~~~   37 (121)
T COG0099           8 DIPGNKRVVIALTYIYGIGRRRAKEICKKA   37 (121)
T ss_pred             CCCCCCEEEEHHHHHCCCCHHHHHHHHHHC
T ss_conf             799982576504630353699999999991


No 289
>cd07147 ALDH_F21_RNP123 Aldehyde dehydrogenase family 21A1-like. Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was first described in the moss Tortula ruralis and is believed to play an important role in the detoxification of aldehydes generated in response to desiccation- and salinity-stress, and ALDH21A1 expression represents a unique stress tolerance mechanism. So far, of plants, only the bryophyte sequence has been observed, but similar protein sequences from bacteria and archaea are also present in this CD.
Probab=23.88  E-value=50  Score=13.64  Aligned_cols=21  Identities=24%  Similarity=0.238  Sum_probs=8.6

Q ss_pred             HHHCCCHHHHHHHHHHHHHHH
Q ss_conf             974046345889998776754
Q gi|254780479|r   85 AWAGLGYYTRARNLKKCADII  105 (356)
Q Consensus        85 ~w~gLGyy~Rar~l~~~a~~i  105 (356)
                      .|+.+....|++-|+++|..+
T Consensus        36 ~W~~~~~~eR~~~l~~~a~~l   56 (452)
T cd07147          36 PMRALPAHRRAAILLHCVARL   56 (452)
T ss_pred             HHHCCCHHHHHHHHHHHHHHH
T ss_conf             982599999999999999999


No 290
>TIGR00608 radc DNA repair protein RadC; InterPro: IPR001405 RadC is a DNA repair protein found in many bacteria. They share a region of similarity at the C-terminus that contains two perfectly conserved histidine residues.; GO: 0006281 DNA repair.
Probab=23.86  E-value=49  Score=13.69  Aligned_cols=42  Identities=24%  Similarity=0.320  Sum_probs=28.5

Q ss_pred             HHHHHHHHHHHCCC--------CCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             99877675430057--------8687314568888877787766776643
Q gi|254780479|r   97 NLKKCADIIVKKYE--------GNFPHKVEILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus        97 ~l~~~a~~i~~~~~--------g~~P~~~~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .-...|+.+.+.+|        +-.-.++++|.+.||||.=-|--|.+++
T Consensus        34 dVl~Lsk~lL~~fG~~~~d~l~~L~~~~~~~L~~v~GiG~ak~~qlka~~   83 (223)
T TIGR00608        34 DVLSLSKELLKQFGQEELDSLDLLLSAEPEELSSVPGIGLAKAIQLKAIV   83 (223)
T ss_pred             CHHHHHHHHHHHHCHHCCCCHHHHHCCCHHHHHCCCCCHHHHHHHHHHHH
T ss_conf             62678999998707000130014321585785046885088999999999


No 291
>PRK00558 uvrC excinuclease ABC subunit C; Validated
Probab=23.76  E-value=50  Score=13.62  Aligned_cols=23  Identities=17%  Similarity=0.206  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHHCCCCCCCCCCC
Q ss_conf             99999999863204123564311
Q gi|254780479|r  187 FVQAMMDLGALICTSNKPLCPLC  209 (356)
Q Consensus       187 ~nQAlMdlGa~iC~p~~P~C~~C  209 (356)
                      ...+|=+|....=.++.|.=-+|
T Consensus       376 ~~~~l~~L~~~L~L~~~p~rIE~  398 (609)
T PRK00558        376 QTAALEELAEALGLPTLPKRIEC  398 (609)
T ss_pred             HHHHHHHHHHHHCCCCCCCEEEE
T ss_conf             99999999998388998878999


No 292
>pfam00286 Flexi_CP Viral coat protein. Family includes coat proteins from Potexviruses and carlaviruses.
Probab=23.75  E-value=50  Score=13.62  Aligned_cols=78  Identities=23%  Similarity=0.269  Sum_probs=38.6

Q ss_pred             HHHHHHHHCCCHHH--HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH
Q ss_conf             99999974046345--8899987767543005786873145688888777877667766434963131243146665765
Q gi|254780479|r   80 EEILSAWAGLGYYT--RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY  157 (356)
Q Consensus        80 ~~vl~~w~gLGyy~--Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~  157 (356)
                      ..|...|+|||--.  -+.-+...|....+.-.+.+-.=...-...+|+-+..+.  .++       +-+.-..|-++|+
T Consensus        11 ~~I~~~~~~lGvp~~~v~~~~~~la~~C~d~gSS~~~~~~G~~~~~~g~~~~~~a--~~v-------i~~~~TLRqfC~~   81 (140)
T pfam00286        11 VAIAALWQGLGVPAEHVAKVAIDLARACADVGSSRSTDLTGTSEWPPGITRADLA--LAV-------VKAHATLRQFCRY   81 (140)
T ss_pred             HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHH--HHH-------HHCCCCHHHHHHH
T ss_conf             9999999983998889999999999997077887545527867889996489999--999-------9807989999999


Q ss_pred             HCCCCCCCHHHHHHHHHH
Q ss_conf             197545735689998741
Q gi|254780479|r  158 FDIIKPAPLYHKTIKNYA  175 (356)
Q Consensus       158 ~~~~~~~~~~~k~l~~~~  175 (356)
                      |+         |.+|..+
T Consensus        82 yA---------kvvWN~m   90 (140)
T pfam00286        82 YA---------KVVWNYM   90 (140)
T ss_pred             HH---------HHHHHHH
T ss_conf             83---------9999988


No 293
>pfam01583 APS_kinase Adenylylsulphate kinase. Enzyme that catalyses the phosphorylation of adenylylsulphate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif.
Probab=23.75  E-value=46  Score=13.94  Aligned_cols=32  Identities=25%  Similarity=0.242  Sum_probs=25.1

Q ss_pred             HHHHHHHHHHHHHHHH---HHCCCEEEECCCCHHH
Q ss_conf             8888777877667766---4349631312431466
Q gi|254780479|r  121 KKLPGIGDYTASAIVA---IAFNHFAVVVDTNIER  152 (356)
Q Consensus       121 ~~LpGiG~yta~ai~s---~a~~~~~~~vD~Nv~R  152 (356)
                      .-|||-|..|-+.-+.   ...+.++..+||...|
T Consensus         8 TGLsGsGKTTlA~~l~~~L~~~~~~~~~LDGD~~R   42 (157)
T pfam01583         8 TGLSGSGKSTIANALERKLFAQGISVYVLDGDNVR   42 (157)
T ss_pred             ECCCCCCHHHHHHHHHHHHHHCCCCEEEECCHHHH
T ss_conf             89899999999999999999759977997688775


No 294
>COG0634 Hpt Hypoxanthine-guanine phosphoribosyltransferase [Nucleotide transport and metabolism]
Probab=23.46  E-value=51  Score=13.58  Aligned_cols=106  Identities=21%  Similarity=0.181  Sum_probs=58.9

Q ss_pred             HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCC--HHHHH--
Q ss_conf             89998776754300578687314568888877787766776643496313124314666576519754573--56899--
Q gi|254780479|r   95 ARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAP--LYHKT--  170 (356)
Q Consensus        95 ar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~--~~~k~--  170 (356)
                      +..+.++|++|.++|+|+.|--   +--|.|-=+.+|.-+..|.+....--++      .|++.+-...+.  ...+.  
T Consensus        18 ~~ri~ela~~I~~~y~g~~~~v---v~iLkGs~~F~~dL~~~i~~~~e~dFm~------vSSYg~~t~ssg~v~i~kDld   88 (178)
T COG0634          18 KARIKELAAQITEDYGGKDPLV---VGVLKGSFPFMADLIRAIDFPLEVDFMH------VSSYGGGTSSSGEVKILKDLD   88 (178)
T ss_pred             HHHHHHHHHHHHHHHCCCCEEE---EEECCCCHHHHHHHHHHCCCCCEEEEEE------EECCCCCCCCCCCEEEECCCC
T ss_conf             9999999999977627996199---9974664530999998559984167899------740478763477168853566


Q ss_pred             --HH----HHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCHH
Q ss_conf             --98----741025785226789999999986320412356-43110000013
Q gi|254780479|r  171 --IK----NYARKITSTSRPGDFVQAMMDLGALICTSNKPL-CPLCPIQKNCL  216 (356)
Q Consensus       171 --l~----~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~-C~~Cpl~~~C~  216 (356)
                        +.    .+.+.+++.   |.--+.++|+    |+-++|+ |..|-|-+.+.
T Consensus        89 ~di~grdVLiVeDIiDs---G~TLs~i~~~----l~~r~a~sv~i~tLldK~~  134 (178)
T COG0634          89 EDIKGRDVLIVEDIIDS---GLTLSKVRDL----LKERGAKSVRIATLLDKPE  134 (178)
T ss_pred             CCCCCCEEEEEECCCCC---CHHHHHHHHH----HHHCCCCEEEEEEEEECCC
T ss_conf             67789849998521036---7559999999----9756997689999860765


No 295
>cd07559 ALDH_ACDHII_AcoD-like Ralstonia eutrophus NAD+-dependent acetaldehyde dehydrogenase II and Staphylococcus aureus AldA1 (SACOL0154)-like. Included in this CD is the NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus H16 involved in the catabolism of acetoin and ethanol, and similar proteins, such as, the dimeric dihydrolipoamide dehydrogenase of the acetoin dehydrogenase enzyme system of Klebsiella pneumonia. Also included are sequences similar to the NAD+-dependent chloroacetaldehyde dehydrogenases (AldA and AldB) of Xanthobacter autotrophicus GJ10 which are involved in the degradation of 1,2-dichloroethane, as well as, the uncharacterized aldehyde dehydrogenase from Staphylococcus aureus (AldA1, locus SACOL0154) and other similar sequences.
Probab=23.39  E-value=51  Score=13.57  Aligned_cols=58  Identities=24%  Similarity=0.204  Sum_probs=33.9

Q ss_pred             HHHHHHHCCCHHHHHHHHHHHHHHHHHCC----------CCCCC-CHH-HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             99999740463458899987767543005----------78687-314-568888877787766776643
Q gi|254780479|r   81 EILSAWAGLGYYTRARNLKKCADIIVKKY----------EGNFP-HKV-EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus        81 ~vl~~w~gLGyy~Rar~l~~~a~~i~~~~----------~g~~P-~~~-~~l~~LpGiG~yta~ai~s~a  138 (356)
                      +-.+.|+-++.+.|++-|+++|..+.++-          .|+-. +.. .|+..--..-+|.|+......
T Consensus        49 ~A~~~W~~~~~~eR~~~L~~~a~~l~~~~~ela~~~~~E~GK~~~~a~~~ev~~a~~~~~~~a~~~~~~~  118 (480)
T cd07559          49 EAFKTWGKTSVAERANILNKIADRIEENLELLAVAETLDNGKPIRETLAADIPLAIDHFRYFAGVIRAQE  118 (480)
T ss_pred             HHHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHC
T ss_conf             9999980499999999999999999968999999999970923888888799999999999999998705


No 296
>KOG1362 consensus
Probab=23.35  E-value=15  Score=17.36  Aligned_cols=16  Identities=13%  Similarity=-0.045  Sum_probs=9.6

Q ss_pred             HHHHHHHHHHHHCCHH
Q ss_conf             4676644673116120
Q gi|254780479|r   40 YKVWISEIMLQQTTVK   55 (356)
Q Consensus        40 y~v~vseimlqqT~v~   55 (356)
                      ..-+.-++++|+|...
T Consensus       213 ~~~~~~~~~~~~~~l~  228 (577)
T KOG1362         213 TSSLFVAVGNQLTLLD  228 (577)
T ss_pred             CHHHHHHHHHHHHHHH
T ss_conf             2179999976999999


No 297
>TIGR01258 pgm_1 phosphoglycerate mutase 1 family; InterPro: IPR005952    Most members of this family are phosphoglycerate mutase (5.4.2.1 from EC). This enzyme interconverts 2-phosphoglycerate and 3-phosphoglycerate.  2-phospho-D-glycerate + 2,3-diphosphoglycerate = 3-phospho-D-glycerate + 2,3-diphosphoglycerate. The enzyme is transiently phosphorylated on an active site histidine by 2,3-diphosphoglyerate, which is both substrate and product. Some members of this family have are phosphoglycerate mutase as a minor activity and act primarily as a bisphoglycerate mutase, interconverting 2,3-diphosphoglycerate and 1,3-diphosphoglycerate (5.4.2.4 from EC). ; GO: 0016868 intramolecular transferase activity phosphotransferases, 0006096 glycolysis.
Probab=23.32  E-value=21  Score=16.35  Aligned_cols=11  Identities=45%  Similarity=0.794  Sum_probs=4.2

Q ss_pred             CHHHCCHHHHH
Q ss_conf             12033265899
Q gi|254780479|r   53 TVKTVEPYFKK   63 (356)
Q Consensus        53 ~v~~v~~~~~~   63 (356)
                      +|++|+|||+.
T Consensus       156 T~~RvlPYW~d  166 (248)
T TIGR01258       156 TVARVLPYWND  166 (248)
T ss_pred             HHHHHCCCHHH
T ss_conf             99650653111


No 298
>cd03313 enolase Enolase: Enolases are homodimeric enzymes that catalyse the reversible dehydration of 2-phospho-D-glycerate to phosphoenolpyruvate as part of the glycolytic and gluconeogenesis pathways. The reaction is facilitated by the presence of metal ions.
Probab=23.09  E-value=12  Score=17.94  Aligned_cols=49  Identities=24%  Similarity=0.348  Sum_probs=18.3

Q ss_pred             HHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHH----HHHHHHH---HHHHHHHHHHH
Q ss_conf             7404634588999877675430057868731456----8888877---78776677664
Q gi|254780479|r   86 WAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEI----LKKLPGI---GDYTASAIVAI  137 (356)
Q Consensus        86 w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~----l~~LpGi---G~yta~ai~s~  137 (356)
                      +.|.|-..-..|+..   .|....-|.=|.|..+    |+.|.|-   .+-=|+||++.
T Consensus        54 y~GkgV~~Av~~vn~---~I~~~L~G~~~~dQ~~iD~~LielDgT~nKs~lGaNailav  109 (408)
T cd03313          54 YLGKGVLKAVKNVNE---IIAPALIGMDVTDQRAIDKLLIELDGTPNKSKLGANAILGV  109 (408)
T ss_pred             CCCCCHHHHHHHHHH---HHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH
T ss_conf             478159999999999---88999769986768999999997208866344651799999


No 299
>TIGR02903 spore_lon_C ATP-dependent protease, Lon family; InterPro: IPR014252   This entry shows some relation to the widely distributed ATP-dependent protease La, also called Lon or LonA (IPR004815 from INTERPRO), but is more closely related to LonB (IPR014251 from INTERPRO), a LonA paralog found only in endospore-forming bacteria. Proteins in this entry are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ). They are restricted to a subset of endospore-forming species, and probably participate in the program of endospore formation. We propose the designation LonC..
Probab=22.95  E-value=37  Score=14.56  Aligned_cols=30  Identities=33%  Similarity=0.367  Sum_probs=25.1

Q ss_pred             HHHHHHHHHHHHH----------HHCCCEEEECCCCHHHH
Q ss_conf             8777877667766----------43496313124314666
Q gi|254780479|r  124 PGIGDYTASAIVA----------IAFNHFAVVVDTNIERI  153 (356)
Q Consensus       124 pGiG~yta~ai~s----------~a~~~~~~~vD~Nv~RV  153 (356)
                      ||||.-||+=++.          |+-|-+..=|||-..|=
T Consensus       185 PGVGKTTaARl~LEe~K~~~~tPF~~DA~FvEVDGtTLRW  224 (616)
T TIGR02903       185 PGVGKTTAARLALEEAKKLKNTPFAEDAPFVEVDGTTLRW  224 (616)
T ss_pred             CCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCC
T ss_conf             3884789999987621368744761137857515762667


No 300
>cd07106 ALDH_AldA-AAD23400 Streptomyces aureofaciens putative aldehyde dehydrogenase AldA (AAD23400)-like. Putative aldehyde dehydrogenase, AldA, from Streptomyces aureofaciens (locus AAD23400) and other similar sequences are present in this CD.
Probab=22.87  E-value=52  Score=13.51  Aligned_cols=24  Identities=25%  Similarity=0.315  Sum_probs=12.6

Q ss_pred             HHHHCCCHHHHHHHHHHHHHHHHH
Q ss_conf             997404634588999877675430
Q gi|254780479|r   84 SAWAGLGYYTRARNLKKCADIIVK  107 (356)
Q Consensus        84 ~~w~gLGyy~Rar~l~~~a~~i~~  107 (356)
                      +.|+-+....|++-|++.|..|.+
T Consensus        33 ~~W~~~~~~~R~~~L~~~a~~l~~   56 (446)
T cd07106          33 PGWSATPLEERRAALLAIADAIEA   56 (446)
T ss_pred             HHHCCCCHHHHHHHHHHHHHHHHH
T ss_conf             987239999999999999999997


No 301
>COG1251 NirB NAD(P)H-nitrite reductase [Energy production and conversion]
Probab=22.86  E-value=52  Score=13.50  Aligned_cols=46  Identities=17%  Similarity=0.267  Sum_probs=26.4

Q ss_pred             CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH
Q ss_conf             05786873145688888777877667766434963131243146665765
Q gi|254780479|r  108 KYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY  157 (356)
Q Consensus       108 ~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~  157 (356)
                      .|.|..+   ..-++++||--+.+|-+. =.-|.+..+.+-..+.|..|+
T Consensus       310 ~y~gsv~---stkLKv~Gvdl~S~GD~~-e~~~~~~iv~~D~~~~iYKrl  355 (793)
T COG1251         310 AYEGSVT---STKLKVSGVDVFSAGDFQ-ETEGAESIVFRDEQRGIYKKL  355 (793)
T ss_pred             CCCCCCC---HHHHCCCCCCEEECCCHH-HCCCCCEEEEECCCCCCEEEE
T ss_conf             5545542---454315566333011132-248994489865411300489


No 302
>TIGR01203 HGPRTase hypoxanthine phosphoribosyltransferase; InterPro: IPR005904    Phosphoribosyltransferases (PRT) are enzymes that catalyze the synthesis of beta-n-5'-monophosphates from phosphoribosylpyrophosphate (PRPP) and an enzyme specific amine. A number of PRT's are involved in the biosynthesis of purine, pyrimidine, and pyridine nucleotides, or in the salvage of purines and pyrimidines. Purine nucleotides are synthesized both via the de novo pathway and via the salvage pathway and are vital for cell functions and cell proliferation through DNA and RNA syntheses and ATP energy supply.    This entry presents hypoxanthine phosphoribosyltransferase (2.4.2.8 from EC), which belongs to phosphoribosyltransferase family and is involved in purine salvage.; GO: 0004422 hypoxanthine phosphoribosyltransferase activity, 0006166 purine ribonucleoside salvage, 0005737 cytoplasm.
Probab=22.84  E-value=52  Score=13.50  Aligned_cols=61  Identities=26%  Similarity=0.273  Sum_probs=42.8

Q ss_pred             HHHHHHHHHHHHHCCCCCCCCHHHHHHH---HHHHHHHHHHHHHHHH-----CCCEEEECCCCHHHHHH
Q ss_conf             8999877675430057868731456888---8877787766776643-----49631312431466657
Q gi|254780479|r   95 ARNLKKCADIIVKKYEGNFPHKVEILKK---LPGIGDYTASAIVAIA-----FNHFAVVVDTNIERIIS  155 (356)
Q Consensus        95 ar~l~~~a~~i~~~~~g~~P~~~~~l~~---LpGiG~yta~ai~s~a-----~~~~~~~vD~Nv~RVl~  155 (356)
                      ..+..++|++|.++|.|.-|.+-++|..   |.|==.-.|.-|..|.     -+...+.+--.-.+|-|
T Consensus         9 ~~r~~eLa~~I~~~Y~~~~~tt~~~l~~~gvLKGs~~F~aDL~~~i~~~~~~S~RS~~~~~~dFm~vSS   77 (183)
T TIGR01203         9 KARIAELAKQITEDYQGKKPTTDEELVLVGVLKGSFPFFADLIRAIDKLNKYSDRSVVPVQVDFMAVSS   77 (183)
T ss_pred             HHHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEEEE
T ss_conf             999999999999873567899897389999725624459999999887422068730114888898875


No 303
>cd07591 BAR_Rvs161p The Bin/Amphiphysin/Rvs (BAR) domain of Saccharomyces cerevisiae Reduced viability upon starvation protein 161 and similar proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. This subfamily is composed of fungal proteins with similarity to Saccharomyces cerevisiae Reduced viability upon starvation protein 161 (Rvs161p) and Schizosaccharomyces pombe Hob3 (homolog of Bin3). S. cerevisiae Rvs161p plays a role in regulating cell polarity, actin cytoskeleton polarization, vesicle trafficking, endocytosis, bud formation, and the mating response. It forms a heterodimer with another BAR domain protein Rvs167p. Rvs161p and Rvs167p share common functions but are not interchangeable. Their BAR domains cannot be replaced with each other and the overexpression of one cannot suppress the mutant phenotypes of the other. S. pombe Hob3 is important in regulating filamentous actin localization an
Probab=22.69  E-value=53  Score=13.48  Aligned_cols=75  Identities=16%  Similarity=0.176  Sum_probs=42.2

Q ss_pred             CCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC------CHHHCCHHHHHHHHHCCCHHHH
Q ss_conf             9671689999999999962899887548554445678704676644673116------1203326589987748997898
Q gi|254780479|r    1 MPQPEHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQT------TVKTVEPYFKKFMQKWPTIFCL   74 (356)
Q Consensus         1 ~~~~~~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT------~v~~v~~~~~~~~~~~P~~~~l   74 (356)
                      |.+-...|.+.|..+|..++.+-++-.         ..-|..-|.|+--+..      --.+|+.=-.+|...||.+...
T Consensus        44 m~~~~~~lae~l~~~Y~~~~~~~~~~~---------~~~y~~~v~el~~~l~~el~~~~~~~vl~Pl~~~~s~f~~i~k~  114 (224)
T cd07591          44 LTSSQARIAETISSFYGDAGDKDGAML---------SQEYKQAVEELDAETVKELDGPYRQTVLDPIGRFNSYFPEINEA  114 (224)
T ss_pred             HHHHHHHHHHHHHHHHCCCCCCCHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             999999999999997353579644568---------89999999999999899998888999988999999985889999


Q ss_pred             HCCCHHHHHH
Q ss_conf             7299899999
Q gi|254780479|r   75 SSAKDEEILS   84 (356)
Q Consensus        75 a~a~~~~vl~   84 (356)
                      ..-.....+-
T Consensus       115 I~KR~~KllD  124 (224)
T cd07591         115 IKKRNHKLLD  124 (224)
T ss_pred             HHHHHHHHHH
T ss_conf             9997716877


No 304
>KOG0142 consensus
Probab=22.45  E-value=23  Score=15.99  Aligned_cols=33  Identities=24%  Similarity=0.495  Sum_probs=24.4

Q ss_pred             CCCEEEEEEEEEECCCEEEEEECCCCHH-HHCCCC
Q ss_conf             2211232000341376057764572013-204744
Q gi|254780479|r  235 RPMRTGAVFIAITNDNRILLRKRTNTRL-LEGMDE  268 (356)
Q Consensus       235 k~~r~~~~~vii~~~~kiLL~KRp~~gl-l~GLwE  268 (356)
                      .-.|.+.+|+ .+.+|+.||+||+..++ |.|+|.
T Consensus        50 lLHRaFSVFl-Fns~~~lLlQqRS~~KitFP~~~T   83 (225)
T KOG0142          50 LLHRAFSVFL-FNSKNELLLQQRSDEKITFPGLWT   83 (225)
T ss_pred             HHHHEEEEEE-ECCCCHHHHHHHCCCCCCCCCHHH
T ss_conf             4432046898-658514776540354411650555


No 305
>cd02011 TPP_PK Thiamine pyrophosphate (TPP) family, Phosphoketolase (PK) subfamily, TPP-binding module; PK catalyzes the conversion of D-xylulose 5-phosphate and phosphate to acetyl phosphate, D-glyceraldehyde-3-phosphate and H2O. This enzyme requires divalent magnesium ions and TPP for activity.
Probab=22.15  E-value=48  Score=13.76  Aligned_cols=12  Identities=8%  Similarity=0.113  Sum_probs=4.6

Q ss_pred             CCCCCCCCCCCC
Q ss_conf             356431100000
Q gi|254780479|r  203 KPLCPLCPIQKN  214 (356)
Q Consensus       203 ~P~C~~Cpl~~~  214 (356)
                      .=.|+..|+...
T Consensus        89 DGEaETGPlAts  100 (227)
T cd02011          89 DGEAETGPLATS  100 (227)
T ss_pred             CCCCCCCCCHHH
T ss_conf             863235841222


No 306
>pfam05405 Mt_ATP-synt_B Mitochondrial ATP synthase B chain precursor (ATP-synt_B). The Fo sector of the ATP synthase is a membrane bound complex which mediates proton transport. It is composed of nine different polypeptide subunits (a, b, c, d, e, f, g F6, A6L).
Probab=21.86  E-value=55  Score=13.37  Aligned_cols=121  Identities=15%  Similarity=0.119  Sum_probs=76.4

Q ss_pred             CCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECC
Q ss_conf             89978987299899999974046345889998776754300578687314568888877787766776643496313124
Q gi|254780479|r   68 WPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVD  147 (356)
Q Consensus        68 ~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD  147 (356)
                      =|++.+.+.+..+++...+.      -+|+.|..|-.-..++....=.-.+.-..|..+-+-++...+-.++-.....|-
T Consensus        35 G~~i~~~lD~~~e~i~~~l~------~~r~~~~~a~~~~ie~~k~~~~~~~~~~~l~~~~ke~~al~~e~~~~~~~~~v~  108 (163)
T pfam05405        35 GPSIKEWLDKRIEKIQDELN------QSRNLHEKALKERIEYVKKLQSVVEETKVLFEVSKETVALEAEAFERELQAALA  108 (163)
T ss_pred             HHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             08899999998999999999------888878999999999999999999989999998246799777999999999999


Q ss_pred             CCHHHHHHHHHCCCCCCCH-HHHH-----HHHHHHCCCCCC-CHHHHHHHHHHH
Q ss_conf             3146665765197545735-6899-----987410257852-267899999999
Q gi|254780479|r  148 TNIERIISRYFDIIKPAPL-YHKT-----IKNYARKITSTS-RPGDFVQAMMDL  194 (356)
Q Consensus       148 ~Nv~RVl~R~~~~~~~~~~-~~k~-----l~~~~~~~~~~~-~~~~~nQAlMdl  194 (356)
                      .+|+|+|--+...+..... .++.     +......+.|+. .---..|++-||
T Consensus       109 ~evk~~Ld~~v~~e~~~r~~~Q~~~v~~v~~~V~~~i~~~~~~k~~l~~~I~~l  162 (163)
T pfam05405       109 REIKSKLDTLVRKESSVRQREQDHLVNWVISSVLKELSPPKFQKKSLQESIADL  162 (163)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
T ss_conf             999999999999998999999999999999999987079167999999998860


No 307
>TIGR03371 cellulose_yhjQ cellulose synthase operon protein YhjQ. Members of this family are the YhjQ protein, found immediately upsteam of bacterial cellulose synthase (bcs) genes in a broad range of bacteria, including both copies of the bcs locus in Klebsiella pneumoniae. In several species it is seen clearly as part of the bcs operon. It is identified as a probable component of the bacterial cellulose metabolic process not only by gene location, but also by partial phylogenetic profiling, or Haft-Selengut algorithm (PMID:16930487), based on a bacterial cellulose biosynthesis genome property profile. Cellulose plays an important role in biofilm formation and structural integrity in some bacteria. Mutants in yhjQ in Escherichia coli, show altered morphology an growth, but the function of YhjQ has not yet been determined.
Probab=21.85  E-value=55  Score=13.37  Aligned_cols=39  Identities=21%  Similarity=0.310  Sum_probs=24.6

Q ss_pred             HHHHHHHHHHHHHHC---CCEEEECCCCHHHHHHHHHCCCCC
Q ss_conf             777877667766434---963131243146665765197545
Q gi|254780479|r  125 GIGDYTASAIVAIAF---NHFAVVVDTNIERIISRYFDIIKP  163 (356)
Q Consensus       125 GiG~yta~ai~s~a~---~~~~~~vD~Nv~RVl~R~~~~~~~  163 (356)
                      |||..|.++=++.++   |.++.+||..-..-++..||.+..
T Consensus        12 GVGKTT~avNLA~~La~~G~rVLlIDlDpQ~~l~~~~g~~~~   53 (246)
T TIGR03371        12 GVGRTTLTAALASALKLLGEPVLAIDLDPQNLLRLHFGMDWS   53 (246)
T ss_pred             CCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCC
T ss_conf             854999999999999968997899975999850322488875


No 308
>KOG4579 consensus
Probab=21.85  E-value=53  Score=13.45  Aligned_cols=16  Identities=50%  Similarity=0.615  Sum_probs=10.3

Q ss_pred             HHHHHHCCCHHHHHCC
Q ss_conf             9987748997898729
Q gi|254780479|r   62 KKFMQKWPTIFCLSSA   77 (356)
Q Consensus        62 ~~~~~~~P~~~~la~a   77 (356)
                      ++|..+|||...|--|
T Consensus        70 ~kft~kf~t~t~lNl~   85 (177)
T KOG4579          70 KKFTIKFPTATTLNLA   85 (177)
T ss_pred             HHHHHCCCHHHHHHCC
T ss_conf             9884054046555221


No 309
>PRK00077 eno phosphopyruvate hydratase; Provisional
Probab=21.83  E-value=16  Score=17.15  Aligned_cols=27  Identities=26%  Similarity=0.199  Sum_probs=10.1

Q ss_pred             CCCCCCHHHH----HHHHHHH---HHHHHHHHHH
Q ss_conf             7868731456----8888877---7877667766
Q gi|254780479|r  110 EGNFPHKVEI----LKKLPGI---GDYTASAIVA  136 (356)
Q Consensus       110 ~g~~P~~~~~----l~~LpGi---G~yta~ai~s  136 (356)
                      -|.=|.|..+    |+.|.|-   .+.-|+||++
T Consensus        80 ~G~~~~dQ~~iD~~LielDgT~nKs~lGaNaila  113 (427)
T PRK00077         80 IGLDAFDQAGIDKAMIELDGTENKSKLGANAILA  113 (427)
T ss_pred             CCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH
T ss_conf             7997462899999999732985412244267899


No 310
>TIGR03650 violacein_E violacein biosynthesis enzyme VioE. This enzyme catalyzes the third step in violacein biosynthesis from a pair of Trp residues, as in Chromobacterium violaceum, but the first step that distinguishes that pathway from staurosporine (an indolocarbazole antibiotic) biosynthesis.
Probab=21.60  E-value=36  Score=14.62  Aligned_cols=15  Identities=33%  Similarity=0.740  Sum_probs=9.5

Q ss_pred             CHHHHHHHHHHHHHC
Q ss_conf             704676644673116
Q gi|254780479|r   38 SPYKVWISEIMLQQT   52 (356)
Q Consensus        38 ~py~v~vseimlqqT   52 (356)
                      .-|+.|.||||--.|
T Consensus        53 ~G~rLWMSEIg~a~~   67 (184)
T TIGR03650        53 TGYRLWMSEIGNAAS   67 (184)
T ss_pred             CCHHHHHHHHCCCCC
T ss_conf             211226766466433


No 311
>TIGR01577 oligosac_amyl oligosaccharide amylase; InterPro: IPR006465   This entry represents glycoamylase enzymes (also known as glucan 1,4-alpha-glucosidase), which catalyse the hydrolysis of terminal 1,4-linked alpha-D-glucose residues successively from non-reducing ends of the chains with the release of beta-D-glucose. This enzyme has been characterised in  Thermoactinomyces vulgaris, where, unlike previously characterised glucoamylases, this enzyme hydrolyzes maltooligosaccharides (maltotetraose, maltose) more efficiently than starch, indicating this enzyme belongs to a class of glucoamylase-type enzymes with oligosaccharide-metabolizing activity , .  More information about this protein can be found at Protein of the Month: alpha-Amylase ..
Probab=21.58  E-value=55  Score=13.33  Aligned_cols=40  Identities=23%  Similarity=0.349  Sum_probs=30.5

Q ss_pred             HHHHHHHHHHHCCCC---CCCCHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             998776754300578---6873145688888777877667766
Q gi|254780479|r   97 NLKKCADIIVKKYEG---NFPHKVEILKKLPGIGDYTASAIVA  136 (356)
Q Consensus        97 ~l~~~a~~i~~~~~g---~~P~~~~~l~~LpGiG~yta~ai~s  136 (356)
                      .+.+||+.++.--..   -+-...|-+..-=||=.||++||.|
T Consensus       402 sv~kAA~yL~l~~~~~~Gl~~pc~DLWEer~GvftYt~~a~y~  444 (679)
T TIGR01577       402 SVEKAAEYLILVIKSELGLFEPCFDLWEERYGVFTYTAAAVYA  444 (679)
T ss_pred             HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH
T ss_conf             9999999999873367887677530001003212223468888


No 312
>cd07131 ALDH_AldH-CAJ73105 Uncharacterized Candidatus kuenenia aldehyde dehydrogenase AldH (CAJ73105)-like. Uncharacterized aldehyde dehydrogenase of Candidatus kuenenia AldH (locus CAJ73105) and similar sequences with similarity to alpha-aminoadipic semialdehyde dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31), Arabidopsis ALDH7B4, and Streptomyces clavuligerus delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH) are included in this CD.
Probab=21.51  E-value=56  Score=13.32  Aligned_cols=56  Identities=14%  Similarity=0.206  Sum_probs=35.0

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHH-----------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             9997404634588999877675430-----------0578687314568888877787766776643
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVK-----------KYEGNFPHKVEILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~-----------~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a  138 (356)
                      .+.|+-+..+.|++-|+++|..+.+           +-|--+.+...|+...-..-+|.|+....+.
T Consensus        50 ~~~W~~~~~~eR~~~L~~~a~~l~~~~~~la~~~~~E~GKp~~~a~~ev~~~~~~~~~~a~~~~~~~  116 (478)
T cd07131          50 FPEWRKVPAPRRAEYLFRAAELLKKRKEELARLVTREMGKPLAEGRGDVQEAIDMAQYAAGEGRRLF  116 (478)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHC
T ss_conf             9988259999999999999999997799999999997393899999999999999999999999851


No 313
>COG1168 MalY Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]
Probab=21.47  E-value=56  Score=13.32  Aligned_cols=109  Identities=19%  Similarity=0.257  Sum_probs=74.4

Q ss_pred             CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHC-CCHH-HHHHHH
Q ss_conf             99887548554445678704676644673116120332658998774899789872998999999740-4634-588999
Q gi|254780479|r   21 RVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAG-LGYY-TRARNL   98 (356)
Q Consensus        21 R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~g-LGyy-~Rar~l   98 (356)
                      ..+-|+..++     ..|--..|||+.                   -||++..+.+|=.+.|-.   | .||- .-....
T Consensus        14 ~s~KWd~~~g-----~~DvlPmWVADM-------------------Df~~pp~i~~Al~~rvdh---GvfGY~~~~~~~~   66 (388)
T COG1168          14 NSLKWDAHPG-----NEDVLPMWVADM-------------------DFPTPPEIIEALRERVDH---GVFGYPYGSDELY   66 (388)
T ss_pred             CCEECCCCCC-----CCCCCEEEEECC-------------------CCCCCHHHHHHHHHHHHC---CCCCCCCCCHHHH
T ss_conf             7500255689-----888413432004-------------------678887999999999865---8778878877899


Q ss_pred             HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHH
Q ss_conf             877675430057868731456888887778776677664349631312431466657651
Q gi|254780479|r   99 KKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYF  158 (356)
Q Consensus        99 ~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~  158 (356)
                      .+.+....++|+=.+..  +.+.--|||=+=.+-+|-.+.---...+|.+=|.-=|.+..
T Consensus        67 ~ai~~w~~~r~~~~i~~--e~i~~~p~VVpgi~~~I~~~T~~gd~Vvi~tPvY~PF~~~i  124 (388)
T COG1168          67 AAIAHWFKQRHQWEIKP--EWIVFVPGVVPGISLAIRALTKPGDGVVIQTPVYPPFYNAI  124 (388)
T ss_pred             HHHHHHHHHHCCCCCCC--CEEEECCCCHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHH
T ss_conf             99999999713777885--52898375127699999983769982696488861899987


No 314
>pfam10391 DNA_pol_lambd_f Fingers domain of DNA polymerase lambda. DNA polymerases catalyse the addition of dNMPs onto the 3-prime ends of DNA chains. There is a general polymerase fold consisting of three subdomains that have been likened to the fingers, palm, and thumb of a right hand. DNA_pol_lambd_f is the central three-helical region of DNA polymerase lambda referred to as the F and G helices of the fingers domain. Contacts with DNA involve this conserved helix-hairpin-helix motif in the fingers region which interacts with the primer strand. This motif is common to several DNA binding proteins and confers a sequence-independent interaction with the DNA backbone.
Probab=21.41  E-value=44  Score=14.04  Aligned_cols=21  Identities=19%  Similarity=0.281  Sum_probs=15.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             568888877787766776643
Q gi|254780479|r  118 EILKKLPGIGDYTASAIVAIA  138 (356)
Q Consensus       118 ~~l~~LpGiG~yta~ai~s~a  138 (356)
                      ..+.+.-||||.||.-..+-+
T Consensus         2 ~~f~~I~GvGp~~A~~~~~~G   22 (52)
T pfam10391         2 KLFTNIWGVGPKTARKWYRQG   22 (52)
T ss_pred             HHHHHCCCCCHHHHHHHHHHC
T ss_conf             036635440699999999947


No 315
>TIGR03018 pepcterm_TyrKin exopolysaccharide/PEPCTERM locus tyrosine autokinase. Members of this protein family are related to a known protein-tyrosine autokinase and to numerous homologs from exopolysaccharide biosynthesis region proteins, many of which are designated as chain length determinants. Most members of this family contain a short region, immediately C-terminal to the region modeled here, with an abundance of Tyr residues. These C-terminal tyrosine residues are likely to be autophosphorylation sites. Some members of this family are fusion proteins.
Probab=21.31  E-value=41  Score=14.25  Aligned_cols=41  Identities=22%  Similarity=0.453  Sum_probs=29.3

Q ss_pred             HHHHHHHHHHHHHHHHC----CCEEEECCCCHHH-HHHHHHCCCCC
Q ss_conf             88777877667766434----9631312431466-65765197545
Q gi|254780479|r  123 LPGIGDYTASAIVAIAF----NHFAVVVDTNIER-IISRYFDIIKP  163 (356)
Q Consensus       123 LpGiG~yta~ai~s~a~----~~~~~~vD~Nv~R-Vl~R~~~~~~~  163 (356)
                      .||-|..|-++=++.++    ++.+..||++.+| -+++++++...
T Consensus        44 ~pgeGKTtva~nLA~~lA~~~~~~VLLVDaDlr~p~l~~~l~~~~~   89 (207)
T TIGR03018        44 LPGEGKSFTAINLAISLAQEYDKTVLLIDADLRRPSLHRTLGLEAE   89 (207)
T ss_pred             CCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHCCCCCC
T ss_conf             9999889999999999997249859999535789971001388999


No 316
>pfam00506 Flu_NP Influenza virus nucleoprotein.
Probab=21.26  E-value=56  Score=13.29  Aligned_cols=105  Identities=20%  Similarity=0.315  Sum_probs=59.6

Q ss_pred             HHHHHHHHHH---HHHHHHCCCCCCCCHHHH-HHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHH------HHHHHHC-C
Q ss_conf             4588999877---675430057868731456-888887778776677664349631312431466------6576519-7
Q gi|254780479|r   92 YTRARNLKKC---ADIIVKKYEGNFPHKVEI-LKKLPGIGDYTASAIVAIAFNHFAVVVDTNIER------IISRYFD-I  160 (356)
Q Consensus        92 y~Rar~l~~~---a~~i~~~~~g~~P~~~~~-l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~R------Vl~R~~~-~  160 (356)
                      |.|.|+|...   ++.+.---|..+|.-... -.+..|||.-.|.||.-|.-+    +-|-|..|      -+.|... +
T Consensus       156 yQR~ral~R~g~dp~l~sl~qGstlprRsga~g~a~kG~Gtlvaeair~IkRg----i~dr~~lR~~~t~~ayEr~~~~l  231 (506)
T pfam00506       156 YQRTRALVRTGMDPRLCSLMQGSTLPRRSGAAGAAIKGVGTLVAEAIRMIKRG----INDRNFLRGRRTRIAYERMCNIL  231 (506)
T ss_pred             HHHHHHHHHCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHC----CCCCCEECCCCCCCHHHHHHHHH
T ss_conf             99889998708987886553268777533332432111667799999998742----22100001253576899999986


Q ss_pred             CCCCCHHHHHHHHHHHCCCCCCCHHHHH-HHHHHH--HHHHCCCC
Q ss_conf             5457356899987410257852267899-999999--86320412
Q gi|254780479|r  161 IKPAPLYHKTIKNYARKITSTSRPGDFV-QAMMDL--GALICTSN  202 (356)
Q Consensus       161 ~~~~~~~~k~l~~~~~~~~~~~~~~~~n-QAlMdl--Ga~iC~p~  202 (356)
                      +.  +..-.....+++......+|++-- ++|.-|  .++|-+|+
T Consensus       232 k~--K~~~~~qrAl~dqV~~srNpg~A~iEdL~~lArs~lilrps  274 (506)
T pfam00506       232 KG--KFQTAAQRALMDQVRESRNPGNAEIEDLIFLARSALILRPS  274 (506)
T ss_pred             HC--CCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCC
T ss_conf             42--26757899999999853798732278999999877740510


No 317
>COG4892 Predicted heme/steroid binding protein [General function prediction only]
Probab=21.20  E-value=30  Score=15.26  Aligned_cols=35  Identities=26%  Similarity=0.501  Sum_probs=23.7

Q ss_pred             HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH
Q ss_conf             89998776754300578687314568888877787
Q gi|254780479|r   95 ARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDY  129 (356)
Q Consensus        95 ar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~y  129 (356)
                      ---||.+.+-+.++++..-|.-.+-|..+|-||.-
T Consensus        41 HqglhsaG~DLs~~~~~~aphg~eil~~~PvVG~L   75 (81)
T COG4892          41 HQGLHSAGKDLSSEFNSCAPHGMEILTSLPVVGAL   75 (81)
T ss_pred             CCCCCCCCHHHHHHHHHCCCCHHHHHHCCCHHHEE
T ss_conf             10100024137788763187606788627412101


No 318
>PRK04195 replication factor C large subunit; Provisional
Probab=21.16  E-value=57  Score=13.27  Aligned_cols=27  Identities=15%  Similarity=0.331  Sum_probs=16.9

Q ss_pred             HHHHHHHHHHHHHHHHHCCCEEEECCC
Q ss_conf             888777877667766434963131243
Q gi|254780479|r  122 KLPGIGDYTASAIVAIAFNHFAVVVDT  148 (356)
Q Consensus       122 ~LpGiG~yta~ai~s~a~~~~~~~vD~  148 (356)
                      --||||.-|+.-+++=.+|-.+.=+++
T Consensus        47 GPpGvGKTT~a~~lAk~~g~~viElNA   73 (403)
T PRK04195         47 GPPGVGKTSLAHALANDYGWEVIELNA   73 (403)
T ss_pred             CCCCCCHHHHHHHHHHHHCCCEEEECC
T ss_conf             939987999999999984998599771


No 319
>pfam08440 Poty_PP Potyviridae polyprotein. This domain is found in polyproteins of the viral Potyviridae taxon.
Probab=21.00  E-value=38  Score=14.47  Aligned_cols=14  Identities=14%  Similarity=0.280  Sum_probs=8.2

Q ss_pred             HHHHHHHCCCEEEE
Q ss_conf             67766434963131
Q gi|254780479|r  132 SAIVAIAFNHFAVV  145 (356)
Q Consensus       132 ~ai~s~a~~~~~~~  145 (356)
                      +|.+|||||.|++.
T Consensus         1 AAflcFaYGLPvmt   14 (274)
T pfam08440         1 AAFLCFAYGLPVMT   14 (274)
T ss_pred             CCHHHHHCCCCCEE
T ss_conf             92787651998633


No 320
>COG2355 Zn-dependent dipeptidase, microsomal dipeptidase homolog [Amino acid transport and metabolism]
Probab=20.86  E-value=57  Score=13.23  Aligned_cols=90  Identities=14%  Similarity=0.169  Sum_probs=45.9

Q ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCC---CCCCCHHHHHH
Q ss_conf             873145688888777877667766434963131243146665765197545735689998741025---78522678999
Q gi|254780479|r  113 FPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKI---TSTSRPGDFVQ  189 (356)
Q Consensus       113 ~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~---~~~~~~~~~nQ  189 (356)
                      |=.+++.|..|--.|-    ..+.+.+|.+-..=||=..|   +-.|+   ++.++..+ ..++++   ++-.+.++  +
T Consensus       107 ~~~dl~~L~~~~~~Gv----R~lgltwn~~N~~g~g~~~~---~~~GL---s~~Gk~lV-~~~N~LgIiiDlSH~s~--k  173 (313)
T COG2355         107 LGDDLDKLELFHALGV----RSLGLTWNRDNLFGDGCYER---TGGGL---TPFGKELV-REMNELGIIIDLSHLSD--K  173 (313)
T ss_pred             CCCCHHHHHHHHHHCC----EEEEEEECCCCCCCCCCCCC---CCCCC---CHHHHHHH-HHHHHCCCEEEECCCCC--C
T ss_conf             0053899999998176----47876623677656756778---89987---87899999-99986597787223587--2


Q ss_pred             HHHHHHHHHCCCCCCCCC-CCCCCCCHHHH
Q ss_conf             999998632041235643-11000001343
Q gi|254780479|r  190 AMMDLGALICTSNKPLCP-LCPIQKNCLTF  218 (356)
Q Consensus       190 AlMdlGa~iC~p~~P~C~-~Cpl~~~C~~~  218 (356)
                      .++|--++   ++.|-|. .|-.+..|..-
T Consensus       174 t~~Dvl~~---s~~PviaSHSN~~al~~h~  200 (313)
T COG2355         174 TFWDVLDL---SKAPVVASHSNARALVDHP  200 (313)
T ss_pred             CHHHHHHC---CCCCEEEECCCCHHCCCCC
T ss_conf             29999854---5886687437724105888


No 321
>PRK13153 consensus
Probab=20.81  E-value=42  Score=14.20  Aligned_cols=11  Identities=36%  Similarity=0.821  Sum_probs=5.4

Q ss_pred             HHHHHHHHHHH
Q ss_conf             88887778776
Q gi|254780479|r  121 KKLPGIGDYTA  131 (356)
Q Consensus       121 ~~LpGiG~yta  131 (356)
                      +=|||||.+..
T Consensus        41 lIlPGVGsf~~   51 (203)
T PRK13153         41 LILPGVGAFGD   51 (203)
T ss_pred             EEECCCCCHHH
T ss_conf             89878883488


No 322
>cd02035 ArsA ArsA ATPase functionas as an efflux pump located on the inner membrane of the cell. This ATP-driven oxyanion pump catalyzes the extrusion of arsenite, antimonite and arsenate. Maintenance of a low intracellular concentration of oxyanion produces resistance to the toxic agents. The pump is composed of two subunits, the catalytic ArsA subunit and the membrane subunit ArsB, which are encoded by arsA and arsB genes respectively. Arsenic efflux in bacteria is catalyzed by either ArsB alone or by ArsAB complex. The ATP-coupled pump, however, is more efficient. ArsA is composed of two homologous halves, A1 and A2, connected by a short linker sequence.
Probab=20.77  E-value=58  Score=13.22  Aligned_cols=25  Identities=28%  Similarity=0.484  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHHHHC---CCEEEECCCC
Q ss_conf             777877667766434---9631312431
Q gi|254780479|r  125 GIGDYTASAIVAIAF---NHFAVVVDTN  149 (356)
Q Consensus       125 GiG~yta~ai~s~a~---~~~~~~vD~N  149 (356)
                      |||..|.+|-++.++   |+++.+||+.
T Consensus         9 GVGKTTvAaalA~~lA~~G~rvLlvs~D   36 (217)
T cd02035           9 GVGKTTIAAATAVRLAEEGKKVLLVSTD   36 (217)
T ss_pred             CCCHHHHHHHHHHHHHHCCCEEEEEECC
T ss_conf             6619999999999999689949999589


No 323
>COG5133 Uncharacterized conserved protein [Function unknown]
Probab=20.72  E-value=43  Score=14.11  Aligned_cols=94  Identities=12%  Similarity=0.262  Sum_probs=52.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHCCCEEEE--CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHH---
Q ss_conf             145688888777877667766434963131--243146665765197545735689998741025785226789999---
Q gi|254780479|r  116 KVEILKKLPGIGDYTASAIVAIAFNHFAVV--VDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQA---  190 (356)
Q Consensus       116 ~~~~l~~LpGiG~yta~ai~s~a~~~~~~~--vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQA---  190 (356)
                      +..++-.||-.-..+-.-.++-......|+  -||++.-        +++.......+.++...+-++++|-.+.|-   
T Consensus        10 di~e~NqlP~r~e~~~dlll~ss~e~~~piese~~~~~~--------ee~d~id~qeiydLia~I~DPEHPltL~QLsVV   81 (181)
T COG5133          10 DIKEMNQLPFRAESEEDLLLGSSLEERHPIESENGVLQN--------EEPDLIDQQEIYDLIADIRDPEHPLTLEQLSVV   81 (181)
T ss_pred             CHHHHHCCCCCCCCHHHHHCCCCHHHCCCCCCCCCCCCC--------CCCCCCCHHHHHHHHHHCCCCCCCCCHHHCCEE
T ss_conf             356651387401212444314425441755334651126--------788876889999999860798887407660135


Q ss_pred             --------------HHHHHHHHCCCCCCCCCCCCCCCCHHH
Q ss_conf             --------------999986320412356431100000134
Q gi|254780479|r  191 --------------MMDLGALICTSNKPLCPLCPIQKNCLT  217 (356)
Q Consensus       191 --------------lMdlGa~iC~p~~P~C~~Cpl~~~C~~  217 (356)
                                    .|-.--.+-||.-|.|+.|.+-..|.-
T Consensus        82 ~lEdi~v~D~~~~n~i~~v~v~itPTIpHCSmatlIGLcIr  122 (181)
T COG5133          82 SLEDISVPDGIAPNVIRCVKVVITPTIPHCSMATLIGLCIR  122 (181)
T ss_pred             EECCCCCCCCCCCCEEEEEEEEECCCCCCHHHHHHHHHHEE
T ss_conf             20255346788877047899996358872068877643203


No 324
>COG1313 PflX Uncharacterized Fe-S protein PflX, homolog of pyruvate formate lyase activating proteins [General function prediction only]
Probab=20.58  E-value=52  Score=13.55  Aligned_cols=45  Identities=18%  Similarity=0.279  Sum_probs=24.2

Q ss_pred             HHHHHHHHCCCCCCCHHHHHHH--HHHHHHHHCCCCCCCCCCCCCCC
Q ss_conf             9998741025785226789999--99998632041235643110000
Q gi|254780479|r  169 KTIKNYARKITSTSRPGDFVQA--MMDLGALICTSNKPLCPLCPIQK  213 (356)
Q Consensus       169 k~l~~~~~~~~~~~~~~~~nQA--lMdlGa~iC~p~~P~C~~Cpl~~  213 (356)
                      ..+|.+-+.-++....+|...+  +||+-..|-...--.|..||-+.
T Consensus        38 ~elw~~H~~~~~~~~~~d~~d~~~lldlK~eia~~~Le~C~lCerrC   84 (335)
T COG1313          38 EELWRIHERKLRKLASGDIKDSPSLLDLKVEIAYRILEKCTLCERRC   84 (335)
T ss_pred             HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
T ss_conf             99999987640354466601206788889999998851367463004


No 325
>PRK09847 gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase; Provisional
Probab=20.57  E-value=58  Score=13.19  Aligned_cols=52  Identities=23%  Similarity=0.253  Sum_probs=30.0

Q ss_pred             HHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CCCH-HHHHHHHHHHHHHHHHHHHHH
Q ss_conf             7404634588999877675430057----------86-8731-456888887778776677664
Q gi|254780479|r   86 WAGLGYYTRARNLKKCADIIVKKYE----------GN-FPHK-VEILKKLPGIGDYTASAIVAI  137 (356)
Q Consensus        86 w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P~~-~~~l~~LpGiG~yta~ai~s~  137 (356)
                      |+.+....|++-|++.|..|.++-+          |+ +-.. ..++...-..=+|.|+..-.+
T Consensus        75 W~~~~~~eR~~iL~~~a~~l~~~~~ela~l~~~E~GK~~~~a~~~ev~~~~~~~~~~a~~~~~~  138 (494)
T PRK09847         75 WSLSSPAKRKAVLNKLADLMEAHAEELALLETLDTGKPIRHSLRDDIPGAARAIRWYAEAIDKV  138 (494)
T ss_pred             CHHCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             0109999999999999999997999999999997290679999999999999999999988873


No 326
>TIGR01944 rnfB electron transport complex, RnfABCDGE type, B subunit; InterPro: IPR010207   The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation. A closely related complex in Escherichia coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized. This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase, but is distinct. This entry describes the B subunit.; GO: 0009055 electron carrier activity, 0051536 iron-sulfur cluster binding, 0006118 electron transport.
Probab=20.50  E-value=58  Score=13.18  Aligned_cols=54  Identities=15%  Similarity=0.132  Sum_probs=19.9

Q ss_pred             HCCCCCCCCHHHHHHHHHHHHHHHHHHHHH--HHCCCEEEECCCCHHHHHHHHHCCC
Q ss_conf             005786873145688888777877667766--4349631312431466657651975
Q gi|254780479|r  107 KKYEGNFPHKVEILKKLPGIGDYTASAIVA--IAFNHFAVVVDTNIERIISRYFDII  161 (356)
Q Consensus       107 ~~~~g~~P~~~~~l~~LpGiG~yta~ai~s--~a~~~~~~~vD~Nv~RVl~R~~~~~  161 (356)
                      ++-+--+|.|.=.==--||=.|| |-||..  =+-=-+.++==-+|.|=|+-+++++
T Consensus        41 e~id~lLPQTQCGqCGypGC~PY-AeAia~NGea~In~C~PGG~a~~~~lA~~L~v~   96 (213)
T TIGR01944        41 ERIDALLPQTQCGQCGYPGCRPY-AEAIAENGEALINKCPPGGEAVILKLAELLGVE   96 (213)
T ss_pred             HHHHHHCCCCCCCCCCCCCCHHH-HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCC
T ss_conf             99986477554646788874158-999986078778788887078999999983889


No 327
>pfam02764 Diphtheria_T Diphtheria toxin, T domain. Central domain of diphtheria toxin is the translocation (T) domain. pH induced conformational change in this domain triggers insertion into the endosomal membrane and facilitates the transfer of the catalytic domain into the cytoplasm.
Probab=20.07  E-value=42  Score=14.21  Aligned_cols=100  Identities=23%  Similarity=0.374  Sum_probs=55.1

Q ss_pred             HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCC
Q ss_conf             97404634588999877675430057868731456888887778776677664349631312431466657651975457
Q gi|254780479|r   85 AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPA  164 (356)
Q Consensus        85 ~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~  164 (356)
                      ..+|-.|-.-|-|   .|+.|-.+-...+-++-..|--|||||.     +|.||    ..+|.-|...+.+.-.++..- 
T Consensus        73 vfaganyaawavn---vaqvidsetadnlekttaalsilpgigs-----vmgia----dgavhhnteeivaqsialssl-  139 (180)
T pfam02764        73 VFAGANYAAWAVN---VAQVIDSETADNLEKTTAALSILPGIGS-----VMGIA----DGAVHHNTEEIVAQSIALSSL-  139 (180)
T ss_pred             CCCCCCHHHHHHH---HHHHHCCHHHHHHHHHHHHHHHCCCCCC-----HHEEC----CCCCCCCHHHHHHHHHHHHHH-
T ss_conf             3345513898876---9999730523248877788874557521-----20003----430004789999999999999-


Q ss_pred             CHHHHHHHHHHHCCCCCC-CHHHHHHHHHHHHHHHC
Q ss_conf             356899987410257852-26789999999986320
Q gi|254780479|r  165 PLYHKTIKNYARKITSTS-RPGDFVQAMMDLGALIC  199 (356)
Q Consensus       165 ~~~~k~l~~~~~~~~~~~-~~~~~nQAlMdlGa~iC  199 (356)
                       .. .+---+..++++.. -..+|.++++.|=..|-
T Consensus       140 -mv-aqaiplvgelvdigfaaynfvesiinlfqvvh  173 (180)
T pfam02764       140 -MV-AQAIPLVGELVDIGFAAYNFVESIINLFQVVH  173 (180)
T ss_pred             -HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             -99-86510556787631799999999999999986


No 328
>TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein; InterPro: IPR008769   Polyhydroxyalkanoates (PHAs) are storage polyesters synthesised by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon..
Probab=20.05  E-value=59  Score=13.12  Aligned_cols=23  Identities=17%  Similarity=0.455  Sum_probs=17.2

Q ss_pred             CCCEEEECCCCHHHHHHHHHCCCC
Q ss_conf             496313124314666576519754
Q gi|254780479|r  139 FNHFAVVVDTNIERIISRYFDIIK  162 (356)
Q Consensus       139 ~~~~~~~vD~Nv~RVl~R~~~~~~  162 (356)
                      +++=.-++|.-|.++|+|| +|..
T Consensus        75 w~~l~~~FD~R~~~~L~rL-~IPs   97 (121)
T TIGR01837        75 WDKLEKAFDERVEQALNRL-NIPS   97 (121)
T ss_pred             HHHHHHHHHHHHHHHHHHC-CCCC
T ss_conf             9889999999999999857-8998


No 329
>KOG2573 consensus
Probab=20.02  E-value=36  Score=14.67  Aligned_cols=31  Identities=35%  Similarity=0.517  Sum_probs=18.8

Q ss_pred             CCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCH-HHHHH
Q ss_conf             32658998774899789872998999999740463-45889
Q gi|254780479|r   57 VEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGY-YTRAR   96 (356)
Q Consensus        57 v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGy-y~Rar   96 (356)
                      |--.|.++|+-++.-         +.-++-=|||| ||||+
T Consensus       126 vR~hf~kl~K~L~~~---------d~~kaqLGLghsYSRaK  157 (498)
T KOG2573         126 VRKHFDKLMKGLDPG---------DLEKAQLGLGHSYSRAK  157 (498)
T ss_pred             HHHHHHHHHCCCCCC---------CHHHHHHCCCCHHHHHH
T ss_conf             999999997469975---------27888750100111545


No 330
>cd07086 ALDH_F7_AASADH-like NAD+-dependent alpha-aminoadipic semialdehyde dehydrogenase and related proteins. ALDH subfamily which includes the NAD+-dependent, alpha-aminoadipic semialdehyde dehydrogenase (AASADH, EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1, ALDH7B or delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH), and other similar sequences, such as the uncharacterized aldehyde dehydrogenase of Candidatus kuenenia AldH (locus CAJ73105).
Probab=20.01  E-value=60  Score=13.11  Aligned_cols=53  Identities=19%  Similarity=0.344  Sum_probs=33.8

Q ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCCC----------C-CCCCHHHHHHHHHHHHHHHHHHHH
Q ss_conf             9997404634588999877675430057----------8-687314568888877787766776
Q gi|254780479|r   83 LSAWAGLGYYTRARNLKKCADIIVKKYE----------G-NFPHKVEILKKLPGIGDYTASAIV  135 (356)
Q Consensus        83 l~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g-~~P~~~~~l~~LpGiG~yta~ai~  135 (356)
                      ...|+.+....||+-|.++|+.+.++.+          | -+.+...|+......-+|.|+...
T Consensus        48 ~~~W~~~~~~eRa~iL~~~a~~l~~~~~ela~l~~~e~GK~~~~a~~Ev~~~~~~~~~~a~~~~  111 (478)
T cd07086          48 FKEWRKVPAPRRGEIVRQIGEALRKKKEALGRLVSLEMGKILPEGLGEVQEMIDICDYAVGLSR  111 (478)
T ss_pred             HHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             9988039999999999999999997899999999998392899999999999999999998899


Done!