Query         026921
Match_columns 231
No_of_seqs    43 out of 45
Neff          2.4 
Searched_HMMs 46136
Date          Fri Mar 29 02:27:51 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/026921.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/026921hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF01985 CRS1_YhbY:  CRS1 / Yhb  99.1 3.7E-11 8.1E-16   89.4   3.5   69    2-70     16-84  (84)
  2 TIGR00253 RNA_bind_YhbY putati  97.3 0.00045 9.8E-09   53.9   5.1   70    3-72     17-86  (95)
  3 COG1534 Predicted RNA-binding   97.0 0.00098 2.1E-08   53.0   4.4   75    3-78     18-92  (97)
  4 PRK10343 RNA-binding protein Y  96.2   0.006 1.3E-07   48.0   4.0   70    3-72     19-88  (97)
  5 KOG1990 Poly(A)-specific exori  95.9  0.0036 7.7E-08   61.0   1.4   76    7-82    403-483 (564)
  6 PF04591 DUF596:  Protein of un  64.8     4.4 9.6E-05   31.0   1.7   24    6-29      6-34  (70)
  7 TIGR01444 fkbM_fam methyltrans  61.0     9.2  0.0002   28.7   2.8   35   27-61    108-142 (143)
  8 PF12747 DdrB:  DdrB-like prote  58.2     2.6 5.7E-05   35.3  -0.6   50   41-97     56-105 (123)
  9 PF01188 MR_MLE:  Mandelate rac  52.9      21 0.00045   25.0   3.3   35   18-52      2-37  (67)
 10 PF13304 AAA_21:  AAA domain; P  48.3      11 0.00025   27.9   1.5   39  117-155   244-287 (303)
 11 PF01755 Glyco_transf_25:  Glyc  46.6      21 0.00044   28.6   2.8   41  108-149    65-108 (200)
 12 PF01695 IstB_IS21:  IstB-like   45.4      29 0.00064   28.6   3.6   47  122-169   103-149 (178)
 13 PLN00410 U5 snRNP protein, DIM  40.6      72  0.0016   26.5   5.2   51   18-73     45-97  (142)
 14 cd03319 L-Ala-DL-Glu_epimerase  36.9      41  0.0009   29.5   3.4   37   16-53    166-203 (316)
 15 PRK11169 leucine-responsive tr  36.3      39 0.00084   27.4   3.0   68    5-73     49-123 (164)
 16 cd03005 PDI_a_ERp46 PDIa famil  34.7 1.3E+02  0.0028   20.9   5.1   50   19-72     39-92  (102)
 17 COG1484 DnaC DNA replication p  34.1      40 0.00086   29.6   2.9   60  110-170   137-209 (254)
 18 PRK13695 putative NTPase; Prov  33.1      58  0.0013   25.8   3.5   51  117-170    86-136 (174)
 19 PRK02714 O-succinylbenzoate sy  32.9      61  0.0013   29.1   3.9   42   15-56    150-192 (320)
 20 cd03318 MLE Muconate Lactonizi  32.4      55  0.0012   29.4   3.6   40   14-53    174-214 (365)
 21 cd03315 MLE_like Muconate lact  32.4      72  0.0016   27.3   4.1   39   16-54    117-156 (265)
 22 PF11830 DUF3350:  Domain of un  32.1      52  0.0011   24.5   2.8   24  108-131    22-45  (56)
 23 PF00626 Gelsolin:  Gelsolin re  31.7      55  0.0012   22.2   2.7   17   56-73     20-36  (76)
 24 PF14795 Leucyl-specific:  Leuc  30.7      21 0.00045   26.6   0.5   25   38-70     31-55  (56)
 25 TIGR01927 menC_gamma/gm+ o-suc  30.6      78  0.0017   28.3   4.2   38   15-52    141-179 (307)
 26 cd03329 MR_like_4 Mandelate ra  30.0      65  0.0014   29.2   3.6   38   16-53    177-215 (368)
 27 cd00308 enolase_like Enolase-s  29.4      75  0.0016   26.6   3.6   38   15-52     81-119 (229)
 28 cd03325 D-galactonate_dehydrat  29.2      67  0.0015   29.0   3.6   39   15-53    163-202 (352)
 29 PRK15072 bifunctional D-altron  28.4      67  0.0015   29.8   3.5   38   16-53    194-232 (404)
 30 cd05797 Ribosomal_L10 Ribosoma  27.1      79  0.0017   25.3   3.3   38   16-53      7-45  (157)
 31 smart00344 HTH_ASNC helix_turn  26.4      91   0.002   22.8   3.3   54    5-61     38-94  (108)
 32 cd03316 MR_like Mandelate race  25.4 1.1E+02  0.0024   27.0   4.2   39   15-53    176-215 (357)
 33 KOG1990 Poly(A)-specific exori  25.1      27 0.00059   34.6   0.4   76    6-81    194-272 (564)
 34 PF07728 AAA_5:  AAA domain (dy  25.0      27 0.00059   26.2   0.3   24  124-147    62-85  (139)
 35 PRK14017 galactonate dehydrata  25.0      90   0.002   28.5   3.7   38   16-53    165-203 (382)
 36 KOG0469 Elongation factor 2 [T  24.4      92   0.002   32.6   3.9   50   23-72    481-535 (842)
 37 PF09026 CENP-B_dimeris:  Centr  24.3      31 0.00067   28.2   0.5   10  216-225    60-69  (101)
 38 PTZ00051 thioredoxin; Provisio  23.5 2.6E+02  0.0057   19.4   5.2   51   18-72     40-90  (98)
 39 PF09248 DUF1965:  Domain of un  23.1      66  0.0014   24.8   2.0   24   10-33     32-55  (74)
 40 cd03326 MR_like_1 Mandelate ra  22.9 1.3E+02  0.0028   28.1   4.3   40   16-55    193-233 (385)
 41 cd03320 OSBS o-Succinylbenzoat  22.8 1.4E+02  0.0029   25.8   4.1   38   16-53    115-153 (263)
 42 PRK10645 divalent-cation toler  21.6      81  0.0018   25.3   2.4   42   30-73     13-56  (112)
 43 COG1062 AdhC Zn-dependent alco  21.6 2.4E+02  0.0052   27.6   5.8  104   16-161   242-355 (366)
 44 PF03091 CutA1:  CutA1 divalent  21.5      97  0.0021   24.3   2.7   34   39-72     10-45  (102)
 45 cd03327 MR_like_2 Mandelate ra  21.3 1.1E+02  0.0024   27.5   3.5   38   16-53    159-197 (341)
 46 cd06532 Glyco_transf_25 Glycos  21.2      67  0.0014   24.8   1.8   27  115-141    69-98  (128)
 47 PRK05105 O-succinylbenzoate sy  21.0 1.2E+02  0.0025   27.4   3.5   37   15-52    145-182 (322)
 48 PF12153 CAP18_C:  LPS binding   20.8      35 0.00075   22.4   0.1   14   41-54     13-26  (28)
 49 PF13401 AAA_22:  AAA domain; P  20.8 1.1E+02  0.0023   22.3   2.7   25  114-138    72-98  (131)
 50 PF07039 DUF1325:  SGF29 tudor-  20.7      77  0.0017   25.8   2.1   27  157-183    85-112 (130)
 51 PRK06512 thiamine-phosphate py  20.7 1.5E+02  0.0031   25.8   3.9   39   16-54     27-67  (221)
 52 cd02998 PDI_a_ERp38 PDIa famil  20.1 2.4E+02  0.0052   19.4   4.3   52   17-71     39-94  (105)

No 1  
>PF01985 CRS1_YhbY:  CRS1 / YhbY (CRM) domain;  InterPro: IPR001890 The CRM domain is an ~100-amino acid RNA-binding domain. The name chloroplast RNA splicing and ribosome maturation (CRM) has been suggested to reflect the functions established for the four characterised members of the family: Zea mays (Maize) CRS1 (Q9FYT6 from SWISSPROT), CAF1 (Q84N49 from SWISSPROT) and CAF2 (Q84N48 from SWISSPROT) proteins and the Escherichia coli protein YhbY (P0AGK4 from SWISSPROT). The CRM domain is found in eubacteria, archaea, and plants. The CRM domain is represented as a stand-alone protein in archaea and bacteria, and in single- and multi-domain proteins in plants. It has been suggested that prokaryotic CRM proteins existed as ribosome-associated proteins prior to the divergence of archaea and bacteria, and that they were co-opted in the plant lineage as RNA binding modules by incorporation into diverse protein contexts. Plant CRM domains are predicted to reside not only in the chloroplast, but also in the mitochondrion and the nucleo/cytoplasmic compartment. The diversity of the CRM domain family in plants suggests a diverse set of RNA targets [, ]. The CRM domain is a compact alpha/beta domain consisting of a four-stranded beta sheet and three alpha helices with an alpha-beta-alpha-beta-alpha-beta-beta topology. The beta sheet face is basic, consistent with a role in RNA binding. Proximal to the basic beta sheet face is another moiety that could contribute to nucleic acid recognition. Connecting strand beta1 and helix alpha2 is a loop with a six amino acid motif, GxxG flanked by large aliphatic residues, within which one 'x' is typically a basic residue [].   Escherichia coli YhbY is associated with pre-50S ribosomal subunits, which implies a function in ribosome assembly. GFP fused to a single-domain CRM protein from maize localises to the nucleolus, suggesting that an analogous activity may have been retained in plants []. A CRM domain containing protein in plant chloroplasts has been shown to function in group I and II intron splicing []. In vitro experiments with an isolated maize CRM domain have shown it to have RNA binding activity. These and other results suggest that the CRM domain evolved in the context of ribosome function prior to the divergence of Archaea and Bacteria, that this function has been maintained in extant prokaryotes, and that the domain was recruited to serve as an RNA binding module during the evolution of plant genomes []. YhbY has a fold similar to that of the C-terminal domain of translation initiation factor 3 (IF3C), which binds to 16S rRNA in the 30S ribosome [].; GO: 0003723 RNA binding; PDB: 1RQ8_A 1JO0_B 1LN4_A.
Probab=99.13  E-value=3.7e-11  Score=89.36  Aligned_cols=69  Identities=25%  Similarity=0.341  Sum_probs=61.4

Q ss_pred             chhhhhhccCceEeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEe
Q 026921            2 LFFLHIAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWR   70 (231)
Q Consensus         2 l~~l~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWR   70 (231)
                      |=|+..++|||++-++++.+++||+.+|||||.|.+-.+.|.+.+...|.+..+|.++...+.++++||
T Consensus        16 l~p~v~IGk~Glt~~vi~~i~~~l~~~eLvKVk~~~~~~~~~~~~~~~l~~~t~~~~V~~iG~~~vlyR   84 (84)
T PF01985_consen   16 LKPVVQIGKNGLTDGVIEEIDDALEKHELVKVKVLGNCREDRKEIAEQLAEKTGAEVVQVIGRTIVLYR   84 (84)
T ss_dssp             C--SEEE-TTSS-HHHHHHHHHHHHHHSEEEEEETT--HHHHHHHHHHHHHHHTEEEEEEETTEEEEEE
T ss_pred             CCCeEEECCCCCCHHHHHHHHHHHHhCCeeEEEEccCCHHHHHHHHHHHHHHhCCEEEEEECCEEEEEC
Confidence            347788999999999999999999999999999999999999999999999999999999999999998


No 2  
>TIGR00253 RNA_bind_YhbY putative RNA-binding protein, YhbY family. A combination of crystal structure, molecular modeling, and bioinformatic data together suggest that members of this family, including YhbY of E. coli, are RNA binding proteins.
Probab=97.30  E-value=0.00045  Score=53.87  Aligned_cols=70  Identities=17%  Similarity=0.256  Sum_probs=65.1

Q ss_pred             hhhhhhccCceEeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecC
Q 026921            3 FFLHIAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGK   72 (231)
Q Consensus         3 ~~l~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk   72 (231)
                      =|+...||||+--+++..+.+|++.+|||||---.-.++|.+.+...|-.-..|.++..-+..+++||..
T Consensus        17 ~p~v~IGK~Glt~~vi~ei~~aL~~hELIKVkvl~~~~~~~~e~a~~i~~~~~a~~Vq~iG~~~vlYR~~   86 (95)
T TIGR00253        17 KPVVLVGKNGLTEGVIKEIEQALEHRELIKVKVATEDREDKTLIAEALVKETGACNVQVIGKTIVLYRPT   86 (95)
T ss_pred             CCeEEECCCCCCHHHHHHHHHHHHhCCcEEEEecCCChhHHHHHHHHHHHHHCCEEEEEEccEEEEEecC
Confidence            4677889999999999999999999999999877777889999999999999999999999999999985


No 3  
>COG1534 Predicted RNA-binding protein containing KH domain, possibly ribosomal protein [Translation, ribosomal structure and biogenesis]
Probab=97.02  E-value=0.00098  Score=52.97  Aligned_cols=75  Identities=19%  Similarity=0.259  Sum_probs=67.8

Q ss_pred             hhhhhhccCceEeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecCCCCCCC
Q 026921            3 FFLHIAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGKDWKSMY   78 (231)
Q Consensus         3 ~~l~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk~~k~~~   78 (231)
                      -|+...+|||+=-++++.+.+|.+.+||||+.--.=...|-+.|-.+|-+-+.|-++..-+..+++|| .+|....
T Consensus        18 ~piv~IGk~Glte~vi~Ei~~aL~~reLIKVkvl~~~~edr~eia~~l~~~~~a~lVqviG~~~vlyr-~~~e~~~   92 (97)
T COG1534          18 KPIVQIGKNGLTEGVIKEIDRALEARELIKVKVLQNAREDKKEIAEALAEETGAELVQVIGKTLVLYR-ESKEKRK   92 (97)
T ss_pred             CceEEecCCccCHHHHHHHHHHHHhCCcEEEEeeccchhhHHHHHHHHHHHhCCEEeeeeeeEEEEEe-cCccccc
Confidence            46778899999999999999999999999998777777899999999999999999999999999999 6665533


No 4  
>PRK10343 RNA-binding protein YhbY; Provisional
Probab=96.24  E-value=0.006  Score=48.01  Aligned_cols=70  Identities=11%  Similarity=0.153  Sum_probs=65.2

Q ss_pred             hhhhhhccCceEeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecC
Q 026921            3 FFLHIAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGK   72 (231)
Q Consensus         3 ~~l~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk   72 (231)
                      =|+...||||+=-+++..+.+|++.+|||||--..-.+.|-+.+...|-+-..|-++.-=+..+++||=.
T Consensus        19 ~Pvv~IGk~Glt~~vi~ei~~aL~~hELIKvkv~~~~~~~~~e~~~~i~~~~~ae~Vq~IG~~~vlYR~~   88 (97)
T PRK10343         19 KPVVLLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETKTLIVEAIVRETGACNVQVIGKTLVLYRPT   88 (97)
T ss_pred             CCeEEECCCCCCHHHHHHHHHHHHHCCcEEEEecCCChhHHHHHHHHHHHHHCCEEEeeeCcEEEEEecC
Confidence            4677889999999999999999999999999988888889999999999999999999999999999965


No 5  
>KOG1990 consensus Poly(A)-specific exoribonuclease PARN [Replication, recombination and repair]
Probab=95.86  E-value=0.0036  Score=60.96  Aligned_cols=76  Identities=25%  Similarity=0.465  Sum_probs=68.1

Q ss_pred             hhccCceEeehhHHHHHhhccCceEEEcCCCCCc-hhHHHhhhhhhcccCeeEEeec-C---ceEEEEecCCCCCCCCCC
Q 026921            7 IAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHA-SDYKKLGAKLKELVPCVLLSFD-D---EQILMWRGKDWKSMYPEP   81 (231)
Q Consensus         7 kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~-sDyKKIGaKLrdLVpCvllsF~-~---eqIlmWRGk~~k~~~~~~   81 (231)
                      -+++.|+|..++.+++-+|+..|+|++.|...+. +.-+..+..+..+...++++.+ +   .+|+.+||++|..+..-.
T Consensus       403 ~~~rrg~f~g~i~n~~l~wk~~e~~k~i~~~~~~~~~~~~~a~~le~esg~~~v~~~~~~~~~ai~~yr~k~y~~p~~l~  482 (564)
T KOG1990|consen  403 LSGRRGVFDGVIENMHLHWKSRELVKVICKEKNLPSQVKQYASALERESGGILVSIDKNPKGYAIIAYRGKNYDRPTSLR  482 (564)
T ss_pred             ccCCcccccceeecchhhhhhcccceeeeccccccHHHHHHHHHHHHHhCCceeeeccCCchhhHHHhhhhhccCCcccC
Confidence            3678999999999999999999999999999999 9999999999999999999993 3   349999999999887533


Q ss_pred             C
Q 026921           82 P   82 (231)
Q Consensus        82 ~   82 (231)
                      |
T Consensus       483 P  483 (564)
T KOG1990|consen  483 P  483 (564)
T ss_pred             c
Confidence            3


No 6  
>PF04591 DUF596:  Protein of unknown function, DUF596;  InterPro: IPR007670 This family contains several uncharacterised proteins.; PDB: 2O5H_B.
Probab=64.83  E-value=4.4  Score=30.97  Aligned_cols=24  Identities=29%  Similarity=0.418  Sum_probs=18.6

Q ss_pred             hhhccCceEee-----hhHHHHHhhccCc
Q 026921            6 HIAAKNGVYLT-----LVRDVRNAFEGSS   29 (231)
Q Consensus         6 ~kL~kNGvY~~-----lV~~VrdAF~~~e   29 (231)
                      .||+|||.|+.     +|++.|.+|-.++
T Consensus         6 lKLa~~g~fl~Gt~eEqve~fr~~fP~sd   34 (70)
T PF04591_consen    6 LKLARKGEFLEGTTEEQVEMFRKSFPASD   34 (70)
T ss_dssp             EEEEETTEEE---HHHHHHHHHHH--SSS
T ss_pred             EEeccCCEEecCCHHHHHHHHHHHCCCCh
Confidence            59999999985     7999999998754


No 7  
>TIGR01444 fkbM_fam methyltransferase, FkbM family. Members of this family are characterized by two well-conserved short regions separated by a variable in both sequence and length. The first of the two regions is found in a large number of proteins outside this subfamily, a number of which have been characterized as methyltransferases. One member of the present family, FkbM, was shown to be required for a specific methylation in the biosynthesis of the immunosuppressant FK506 in Streptomyces strain MA6548.
Probab=61.04  E-value=9.2  Score=28.70  Aligned_cols=35  Identities=17%  Similarity=0.275  Sum_probs=30.2

Q ss_pred             cCceEEEcCCCCCchhHHHhhhhhhcccCeeEEee
Q 026921           27 GSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSF   61 (231)
Q Consensus        27 ~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF   61 (231)
                      .-.++||||+|..-.=.+.+...|+.-+|.|++-+
T Consensus       108 ~i~~lKiDiEG~E~~vL~g~~~~l~~~~~~i~~E~  142 (143)
T TIGR01444       108 KVDLLKIDVEGAELEVLRGAKETLLRKRPGIVLEV  142 (143)
T ss_pred             CCCEEEEeCCCchHHHHhChHHHHHHhCCeEEEEe
Confidence            45799999999999999999888888899888754


No 8  
>PF12747 DdrB:  DdrB-like protein;  InterPro: IPR024305 This family includes the Deinococcus DdrB protein, a ssDNA binding protein induced by ionizing radiation which has a role in genome reconstitution []. The structure of DdrB is known []. This family also includes some possibly distantly related cyanobacterial proteins.; PDB: 3KDV_D.
Probab=58.24  E-value=2.6  Score=35.28  Aligned_cols=50  Identities=20%  Similarity=0.353  Sum_probs=31.4

Q ss_pred             hhHHHhhhhhhcccCeeEEeecCceEEEEecCCCCCCCCCCCCCCCcccccccCCCC
Q 026921           41 SDYKKLGAKLKELVPCVLLSFDDEQILMWRGKDWKSMYPEPPSFSNPVDLDIAGDAD   97 (231)
Q Consensus        41 sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk~~k~~~~~~~~~~ip~~~~~~~~~~   97 (231)
                      =|...|||+..-      -+=++++.++|||+.||++-..+-. -.|...+-.-+.+
T Consensus        56 FDW~lIGArV~~------t~~eG~~~V~~~G~~ykRR~l~~~d-klpaaIkySrgak  105 (123)
T PF12747_consen   56 FDWALIGARVIS------TDGEGEEVVQYRGHAYKRRELEAVD-KLPAAIKYSRGAK  105 (123)
T ss_dssp             B-GGGGTEE-EE------EESSSEEEEEETTEEEEEEEETTTE-E---SSS-SSSSE
T ss_pred             cChhhccceecc------cCCCceEEEEEcCeeeeeecchhhc-cccceeEEccccc
Confidence            467889998543      4558999999999999999866433 3444444444433


No 9  
>PF01188 MR_MLE:  Mandelate racemase / muconate lactonizing enzyme, C-terminal domain;  InterPro: IPR013342 Mandelate racemase 5.1.2.2 from EC (MR) and muconate lactonising enzyme 5.5.1.1 from EC (MLE) are two bacterial enzymes involved in aromatic acid catabolism. They catalyse mechanistically distinct reactions yet they are related at the level of their primary, quaternary (homooctamer) and tertiary structures [, ]. A number of other proteins also seem to be evolutionary related to these two enzymes. These include, various plasmid-encoded chloromuconate cycloisomerases 5.5.1.7 from EC, Escherichia coli protein rspA [], E. coli bifunctional DGOA protein, E. coli hypothetical proteins ycjG, yfaW and yidU and a hypothetical protein from Streptomyces ambofaciens []. This entry represents the C-terminal region of these proteins.; PDB: 3QLD_B 3CYJ_C 2QDD_B 3FVD_A 3H7V_A 2OZT_A 3NXL_A 1JDF_D 1JCT_B 1EC9_C ....
Probab=52.88  E-value=21  Score=24.95  Aligned_cols=35  Identities=20%  Similarity=0.281  Sum_probs=30.0

Q ss_pred             hHHHHHhhccCceEEEcCC-CCCchhHHHhhhhhhc
Q 026921           18 VRDVRNAFEGSSLVKVNCK-GMHASDYKKLGAKLKE   52 (231)
Q Consensus        18 V~~VrdAF~~~elVrIDC~-gl~~sDyKKIGaKLrd   52 (231)
                      |+.||+++...--+++||- +++..+..+++..|++
T Consensus         2 i~avr~~~g~~~~l~vDan~~~~~~~a~~~~~~l~~   37 (67)
T PF01188_consen    2 IRAVREAVGPDIDLMVDANQAWTLEEAIRLARALED   37 (67)
T ss_dssp             HHHHHHHHSTTSEEEEE-TTBBSHHHHHHHHHHHGG
T ss_pred             HHHHHHhhCCCCeEEEECCCCCCHHHHHHHHHHcCh
Confidence            6889999988888999995 5788899999999998


No 10 
>PF13304 AAA_21:  AAA domain; PDB: 3QKS_B 1US8_B 1F2U_B 1F2T_B 3QKT_A 1II8_B 3QKR_B 3QKU_A.
Probab=48.26  E-value=11  Score=27.87  Aligned_cols=39  Identities=26%  Similarity=0.364  Sum_probs=27.2

Q ss_pred             HHHHHHHHHhhc---ccceeccC--CCCchhHHHhhhhhhhhhh
Q 026921          117 MISLWKRAIEST---KALVLDEI--NLGPDDLLKKVEEFEGISQ  155 (231)
Q Consensus       117 M~~LWk~AieSs---~Al~Lde~--~L~PD~LL~~VEef~~~Sq  155 (231)
                      ++.|...++...   ..+++||.  +|-|...-+-++-|..+++
T Consensus       244 ~l~l~~~l~~~~~~~~illiDEpE~~LHp~~q~~l~~~l~~~~~  287 (303)
T PF13304_consen  244 LLSLLSLLLSAKKNGSILLIDEPENHLHPSWQRKLIELLKELSK  287 (303)
T ss_dssp             HHHHHHHHHTTTTT-SEEEEESSSTTSSHHHHHHHHHHHHHTGG
T ss_pred             HHHHHHHHhCcCCCCeEEEecCCcCCCCHHHHHHHHHHHHhhCc
Confidence            334444444444   77999998  8999988888877765554


No 11 
>PF01755 Glyco_transf_25:  Glycosyltransferase family 25 (LPS biosynthesis protein);  InterPro: IPR002654 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 25 GT25 from CAZY comprises enzymes with only one known activity; as a lipopolysaccharide biosynthesis protein. These enzymes catalyse the transfer of various sugars onto the growing lipopolysaccharide chain during its biosynthesis [].; GO: 0009103 lipopolysaccharide biosynthetic process
Probab=46.58  E-value=21  Score=28.64  Aligned_cols=41  Identities=32%  Similarity=0.552  Sum_probs=27.0

Q ss_pred             ccccccchhHHHHHHHHHhhc--ccceeccC-CCCchhHHHhhhh
Q 026921          108 QGTIRSSPKMISLWKRAIEST--KALVLDEI-NLGPDDLLKKVEE  149 (231)
Q Consensus       108 ~~~~~~sPkM~~LWk~AieSs--~Al~Lde~-~L~PD~LL~~VEe  149 (231)
                      ++.+--+--=+++|+++++++  -||+|+|- -+.++ +.+.+++
T Consensus        65 ~gEiGC~lSH~~~w~~~v~~~~~~~lIlEDDv~~~~~-f~~~l~~  108 (200)
T PF01755_consen   65 PGEIGCALSHIKAWQRIVDSGLEYALILEDDVIFDPD-FKEFLEE  108 (200)
T ss_pred             cceEeehhhHHHHHHHHHHcCCCeEEEEecccccccc-HHHHHHH
Confidence            333444455689999999999  78888765 55554 4444443


No 12 
>PF01695 IstB_IS21:  IstB-like ATP binding protein;  InterPro: IPR002611 Proteins in this entry contain an ATP/GTP binding P-loop motif. They are found associated with IS21 family insertion sequences []. Functionally they have not been characterised, but they may be involved in transposition [].; GO: 0005524 ATP binding; PDB: 3EC2_A 3ECC_A 2W58_A 2QGZ_A.
Probab=45.36  E-value=29  Score=28.64  Aligned_cols=47  Identities=21%  Similarity=0.315  Sum_probs=32.9

Q ss_pred             HHHHhhcccceeccCCCCchhHHHhhhhhhhhhhhhhcccCeEEeecC
Q 026921          122 KRAIESTKALVLDEINLGPDDLLKKVEEFEGISQAAEHSYPALVLSRE  169 (231)
Q Consensus       122 k~AieSs~Al~Lde~~L~PD~LL~~VEef~~~SqA~EHS~pAl~~s~~  169 (231)
                      .+.+.....|+|||+|..+-.=-..--=|+.+.+-.++. |-++.||-
T Consensus       103 ~~~l~~~dlLilDDlG~~~~~~~~~~~l~~ii~~R~~~~-~tIiTSN~  149 (178)
T PF01695_consen  103 LKRLKRVDLLILDDLGYEPLSEWEAELLFEIIDERYERK-PTIITSNL  149 (178)
T ss_dssp             HHHHHTSSCEEEETCTSS---HHHHHCTHHHHHHHHHT--EEEEEESS
T ss_pred             cCccccccEecccccceeeecccccccchhhhhHhhccc-CeEeeCCC
Confidence            345667899999999988744333444588999999986 88888874


No 13 
>PLN00410 U5 snRNP protein, DIM1 family; Provisional
Probab=40.56  E-value=72  Score=26.50  Aligned_cols=51  Identities=16%  Similarity=0.359  Sum_probs=33.6

Q ss_pred             hHHHHHhhcc-CceEEEcCCCCCchhHHHhhhhhhcccCe-eEEeecCceEEEEecCC
Q 026921           18 VRDVRNAFEG-SSLVKVNCKGMHASDYKKLGAKLKELVPC-VLLSFDDEQILMWRGKD   73 (231)
Q Consensus        18 V~~VrdAF~~-~elVrIDC~gl~~sDyKKIGaKLrdLVpC-vllsF~~eqIlmWRGk~   73 (231)
                      ++.+-+.+.. ..+++||.+..+     .+..+++=-.|| +++-|.+.++.+|||..
T Consensus        45 l~~la~~~~~~~~~~kVDVDe~~-----dla~~y~I~~~~t~~~ffk~g~~~vd~~tG   97 (142)
T PLN00410         45 LASVAETIKNFAVIYLVDITEVP-----DFNTMYELYDPCTVMFFFRNKHIMIDLGTG   97 (142)
T ss_pred             HHHHHHHcCCceEEEEEECCCCH-----HHHHHcCccCCCcEEEEEECCeEEEEEecc
Confidence            4455555554 667888888543     455555444344 44488999999999965


No 14 
>cd03319 L-Ala-DL-Glu_epimerase L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides. The genomic context and the substrate specificity of characterized members of this family from E.coli and B.subtilis indicates a possible role in the metabolism of the murein peptide of peptidoglycan, of which L-Ala-D-Glu is a component. L-Ala-D/L-Glu epimerase is a member of the enolase-superfamily, which is characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=36.90  E-value=41  Score=29.47  Aligned_cols=37  Identities=24%  Similarity=0.381  Sum_probs=30.4

Q ss_pred             ehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhcc
Q 026921           16 TLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKEL   53 (231)
Q Consensus        16 ~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrdL   53 (231)
                      ..|+.||+++. .--+++|| .+.+..+..+++..|+++
T Consensus       166 ~~v~~lr~~~g-~~~l~vD~n~~~~~~~A~~~~~~l~~~  203 (316)
T cd03319         166 ERIRAIREAAP-DARLRVDANQGWTPEEAVELLRELAEL  203 (316)
T ss_pred             HHHHHHHHhCC-CCeEEEeCCCCcCHHHHHHHHHHHHhc
Confidence            45788899988 66789998 458888999999999885


No 15 
>PRK11169 leucine-responsive transcriptional regulator; Provisional
Probab=36.33  E-value=39  Score=27.44  Aligned_cols=68  Identities=15%  Similarity=0.152  Sum_probs=45.8

Q ss_pred             hhhhccCce---EeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeE----EeecCceEEEEecCC
Q 026921            5 LHIAAKNGV---YLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVL----LSFDDEQILMWRGKD   73 (231)
Q Consensus         5 l~kL~kNGv---Y~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvl----lsF~~eqIlmWRGk~   73 (231)
                      +-||.++||   |.-+|+--+--|...-+|.|.|.+-.+..+.+++..|+. .|.|.    ++=+..-++.=+.++
T Consensus        49 i~rL~~~GvI~~~~~~v~p~~lg~~~~a~v~i~~~~~~~~~~~~~~~~l~~-~p~V~~~~~vtG~~d~~l~v~~~~  123 (164)
T PRK11169         49 VRRLERQGFIQGYTALLNPHYLDASLLVFVEITLNRGAPDVFEQFNAAVQK-LEEIQECHLVSGDFDYLLKTRVPD  123 (164)
T ss_pred             HHHHHHCCCeEEEEEEECHHHhCCCEEEEEEEEEcCCChHHHHHHHHHHhc-CcceeeeeeecCCCCEEEEEEECC
Confidence            568899997   667776666566556679999987777778899988887 45544    332444444434443


No 16 
>cd03005 PDI_a_ERp46 PDIa family, endoplasmic reticulum protein 46 (ERp46) subfamily; ERp46 is an ER-resident protein containing three redox active TRX domains. Yeast complementation studies show that ERp46 can substitute for protein disulfide isomerase (PDI) function in vivo. It has been detected in many tissues, however, transcript and protein levels do not correlate in all tissues, suggesting regulation at a posttranscriptional level. An identical protein, named endoPDI, has been identified as an endothelial PDI that is highly expressed in the endothelium of tumors and hypoxic lesions. It has a protective effect on cells exposed to hypoxia.
Probab=34.68  E-value=1.3e+02  Score=20.91  Aligned_cols=50  Identities=20%  Similarity=0.380  Sum_probs=31.5

Q ss_pred             HHHHHhhc----cCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecC
Q 026921           19 RDVRNAFE----GSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGK   72 (231)
Q Consensus        19 ~~VrdAF~----~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk   72 (231)
                      +.+.+.+.    .-.+++|||..-. .-+++.|.   .-+|.+++--++..+...+|.
T Consensus        39 ~~~~~~~~~~~~~~~~~~vd~~~~~-~~~~~~~v---~~~Pt~~~~~~g~~~~~~~G~   92 (102)
T cd03005          39 EQLAKKFNNENPSVKIAKVDCTQHR-ELCSEFQV---RGYPTLLLFKDGEKVDKYKGT   92 (102)
T ss_pred             HHHHHHHhccCCcEEEEEEECCCCh-hhHhhcCC---CcCCEEEEEeCCCeeeEeeCC
Confidence            34444444    3678999998643 44566554   348987777666656566664


No 17 
>COG1484 DnaC DNA replication protein [DNA replication, recombination, and repair]
Probab=34.08  E-value=40  Score=29.61  Aligned_cols=60  Identities=22%  Similarity=0.260  Sum_probs=49.8

Q ss_pred             ccccchhHHHHHHHHHh-------------hcccceeccCCCCchhHHHhhhhhhhhhhhhhcccCeEEeecCC
Q 026921          110 TIRSSPKMISLWKRAIE-------------STKALVLDEINLGPDDLLKKVEEFEGISQAAEHSYPALVLSRED  170 (231)
Q Consensus       110 ~~~~sPkM~~LWk~Aie-------------Ss~Al~Lde~~L~PD~LL~~VEef~~~SqA~EHS~pAl~~s~~~  170 (231)
                      +..+.|.+++-++.|..             ....|+|||+|--|-.=-.+.+=|..|+|=.+-..+ +++||-.
T Consensus       137 ~f~~~~el~~~Lk~~~~~~~~~~~l~~~l~~~dlLIiDDlG~~~~~~~~~~~~~q~I~~r~~~~~~-~~tsN~~  209 (254)
T COG1484         137 LFITAPDLLSKLKAAFDEGRLEEKLLRELKKVDLLIIDDIGYEPFSQEEADLLFQLISRRYESRSL-IITSNLS  209 (254)
T ss_pred             EEEEHHHHHHHHHHHHhcCchHHHHHHHhhcCCEEEEecccCccCCHHHHHHHHHHHHHHHhhccc-eeecCCC
Confidence            56677888888877655             345799999999998888889999999999999999 8888743


No 18 
>PRK13695 putative NTPase; Provisional
Probab=33.11  E-value=58  Score=25.79  Aligned_cols=51  Identities=20%  Similarity=0.351  Sum_probs=36.3

Q ss_pred             HHHHHHHHHhhcccceeccCCCCchhHHHhhhhhhhhhhhhhcccCeEEeecCC
Q 026921          117 MISLWKRAIESTKALVLDEINLGPDDLLKKVEEFEGISQAAEHSYPALVLSRED  170 (231)
Q Consensus       117 M~~LWk~AieSs~Al~Lde~~L~PD~LL~~VEef~~~SqA~EHS~pAl~~s~~~  170 (231)
                      +..+=+++++....++|||+  ||.+.+. -.-++.+.++.+.--|-+++++..
T Consensus        86 ~~~l~~~~l~~~~~lllDE~--~~~e~~~-~~~~~~l~~~~~~~~~~i~v~h~~  136 (174)
T PRK13695         86 GIPALERALEEADVIIIDEI--GKMELKS-PKFVKAVEEVLDSEKPVIATLHRR  136 (174)
T ss_pred             HHHHHHhccCCCCEEEEECC--Ccchhhh-HHHHHHHHHHHhCCCeEEEEECch
Confidence            44555677888999999997  5655555 334677888887667777777754


No 19 
>PRK02714 O-succinylbenzoate synthase; Provisional
Probab=32.89  E-value=61  Score=29.06  Aligned_cols=42  Identities=7%  Similarity=0.177  Sum_probs=35.3

Q ss_pred             eehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhcccCe
Q 026921           15 LTLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKELVPC   56 (231)
Q Consensus        15 ~~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrdLVpC   56 (231)
                      +..|+.||++|...--+|+|+ ++.+..+..+++..|++|.+.
T Consensus       150 ~~~v~air~~~g~~~~l~vDaN~~w~~~~A~~~~~~l~~l~~~  192 (320)
T PRK02714        150 LKIFEQLLERLPAGAKLRLDANGGLSLEEAKRWLQLCDRRLSG  192 (320)
T ss_pred             HHHHHHHHHhcCCCCEEEEECCCCCCHHHHHHHHHHHhhccCC
Confidence            356889999998888899999 569999999999999986443


No 20 
>cd03318 MLE Muconate Lactonizing Enzyme (MLE), an homooctameric enzyme, catalyses the conversion of cis,cis-muconate (CCM) to muconolactone (ML) in the catechol branch of the beta-ketoadipate pathway. This pathway is used in soil microbes to breakdown lignin-derived aromatics, catechol and protocatechuate, to citric acid cycle intermediates. Some bacterial species are also capable of dehalogenating chloroaromatic compounds by the action of chloromuconate lactonizing enzymes (Cl-MLEs). MLEs are members of the enolase superfamily characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and that is stabilized by coordination to the essential Mg2+ ion.
Probab=32.40  E-value=55  Score=29.37  Aligned_cols=40  Identities=18%  Similarity=0.186  Sum_probs=33.1

Q ss_pred             EeehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921           14 YLTLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL   53 (231)
Q Consensus        14 Y~~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL   53 (231)
                      .+..|+.||++|...--++|||-+ ++..+..+++.+|.++
T Consensus       174 d~~~v~avr~~~g~~~~l~iDaN~~~~~~~A~~~~~~l~~~  214 (365)
T cd03318         174 DLAHVEAIAKALGDRASVRVDVNQAWDESTAIRALPRLEAA  214 (365)
T ss_pred             HHHHHHHHHHHcCCCcEEEEECCCCCCHHHHHHHHHHHHhc
Confidence            345678899999877778999954 9999999999999885


No 21 
>cd03315 MLE_like Muconate lactonizing enzyme (MLE) like subgroup of the enolase superfamily. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and residues that can function as general acid/base catalysts, a Lys-X-Lys motif and another conserved lysine. Despite these conserved residues, the members of the MLE subgroup, like muconate lactonizing enzyme, o-succinylbenzoate synthase (OSBS) and N-acylamino acid racemase (NAAAR), catalyze different reactions.
Probab=32.39  E-value=72  Score=27.27  Aligned_cols=39  Identities=15%  Similarity=0.204  Sum_probs=31.7

Q ss_pred             ehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhccc
Q 026921           16 TLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKELV   54 (231)
Q Consensus        16 ~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdLV   54 (231)
                      ..|+.||+++...-.+++||.+ .+..+..+++..|.++-
T Consensus       117 ~~v~~vr~~~g~~~~l~vDan~~~~~~~a~~~~~~l~~~~  156 (265)
T cd03315         117 AVVAALREAVGDDAELRVDANRGWTPKQAIRALRALEDLG  156 (265)
T ss_pred             HHHHHHHHhcCCCCEEEEeCCCCcCHHHHHHHHHHHHhcC
Confidence            4578889998777788999854 88899999999998853


No 22 
>PF11830 DUF3350:  Domain of unknown function (DUF3350);  InterPro: IPR021785  This domain is functionally uncharacterised. This domain is found in eukaryotes. This presumed domain is typically between 50 to 64 amino acids in length. 
Probab=32.14  E-value=52  Score=24.48  Aligned_cols=24  Identities=25%  Similarity=0.517  Sum_probs=19.1

Q ss_pred             ccccccchhHHHHHHHHHhhcccc
Q 026921          108 QGTIRSSPKMISLWKRAIESTKAL  131 (231)
Q Consensus       108 ~~~~~~sPkM~~LWk~AieSs~Al  131 (231)
                      +..-+++-..-.||+.||..-..|
T Consensus        22 ~~~krt~eelR~LWrkAI~QqIlL   45 (56)
T PF11830_consen   22 EKKKRTREELRELWRKAIHQQILL   45 (56)
T ss_pred             cccccCHHHHHHHHHHHHHHHHHH
Confidence            556677888999999999876554


No 23 
>PF00626 Gelsolin:  Gelsolin repeat;  InterPro: IPR007123 Gelsolin is a cytoplasmic, calcium-regulated, actin-modulating protein that binds to the barbed ends of actin filaments, preventing monomer exchange (end-blocking or capping) []. It can promote nucleation (the assembly of monomers into filaments), as well as sever existing filaments. In addition, this protein binds with high affinity to fibronectin. Plasma gelsolin and cytoplasmic gelsolin are derived from a single gene by alternate initiation sites and differential splicing. Sequence comparisons indicate an evolutionary relationship between gelsolin, villin, fragmin and severin []. Six large repeating segments occur in gelsolin and villin, and 3 similar segments in severin and fragmin. While the multiple repeats have yet to be related to any known function of the actin-severing proteins, the superfamily appears to have evolved from an ancestral sequence of 120 to 130 amino acid residues [].; PDB: 3FG6_F 1RGI_G 2FGH_A 1D0N_B 3EGD_B 2NUP_B 2NUT_B 3EGX_B 1JHW_A 1J72_A ....
Probab=31.65  E-value=55  Score=22.22  Aligned_cols=17  Identities=35%  Similarity=0.903  Sum_probs=12.3

Q ss_pred             eeEEeecCceEEEEecCC
Q 026921           56 CVLLSFDDEQILMWRGKD   73 (231)
Q Consensus        56 CvllsF~~eqIlmWRGk~   73 (231)
                      |.+|.- +..|.+|.|+.
T Consensus        20 ~yIld~-~~~i~vW~G~~   36 (76)
T PF00626_consen   20 CYILDC-GYEIFVWVGKK   36 (76)
T ss_dssp             EEEEEE-SSEEEEEEHTT
T ss_pred             EEEEEe-CCCcEEEEecc
Confidence            444444 44999999998


No 24 
>PF14795 Leucyl-specific:  Leucine-tRNA synthetase-specific domain; PDB: 1OBC_A 2BTE_A 2V0G_A 2BYT_A 1OBH_A 1H3N_A 2V0C_A.
Probab=30.70  E-value=21  Score=26.56  Aligned_cols=25  Identities=28%  Similarity=0.729  Sum_probs=18.7

Q ss_pred             CCchhHHHhhhhhhcccCeeEEeecCceEEEEe
Q 026921           38 MHASDYKKLGAKLKELVPCVLLSFDDEQILMWR   70 (231)
Q Consensus        38 l~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWR   70 (231)
                      |...|.||+||.||.        -++.++-+|+
T Consensus        31 Ls~ee~~KmGAELR~--------hedG~~h~WK   55 (56)
T PF14795_consen   31 LSLEEVKKMGAELRP--------HEDGTLHFWK   55 (56)
T ss_dssp             EEHHHHHHTT-EEEE---------TTSSEEEEE
T ss_pred             ccHHHHHhhchhccc--------CCCCeEeecC
Confidence            677899999999986        4667777774


No 25 
>TIGR01927 menC_gamma/gm+ o-succinylbenzoic acid (OSB) synthetase. This model describes the enzyme o-succinylbenzoic acid synthetase (menC) that is involved in one of the steps of the menaquinone biosynthesis pathway. It takes SHCHC and makes it into 2-succinylbenzoate. Included in this model are gamma proteobacteria and archaea. Many of the com-names of the proteins identified by the model are identified as O-succinylbenzoyl-CoA synthase in error.
Probab=30.62  E-value=78  Score=28.29  Aligned_cols=38  Identities=13%  Similarity=0.211  Sum_probs=32.2

Q ss_pred             eehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhc
Q 026921           15 LTLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKE   52 (231)
Q Consensus        15 ~~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrd   52 (231)
                      +..|+.||+++...--+|+|| ++.+..+..+++..|.+
T Consensus       141 ~~~v~~vr~~~g~~~~l~vDaN~~w~~~~A~~~~~~l~~  179 (307)
T TIGR01927       141 GMLVNLLLEALPDKAELRLDANGGLSPDEAQQFLKALDP  179 (307)
T ss_pred             HHHHHHHHHHcCCCCeEEEeCCCCCCHHHHHHHHHhccc
Confidence            356888999997767899999 66999999999999986


No 26 
>cd03329 MR_like_4 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 4. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues,  a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown.
Probab=29.99  E-value=65  Score=29.19  Aligned_cols=38  Identities=24%  Similarity=0.205  Sum_probs=31.4

Q ss_pred             ehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhcc
Q 026921           16 TLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKEL   53 (231)
Q Consensus        16 ~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrdL   53 (231)
                      ..|+.||+++...--+++|| ++.+..+..++...|.++
T Consensus       177 ~~i~~vR~~~G~~~~l~vDan~~~~~~~A~~~~~~l~~~  215 (368)
T cd03329         177 KACLAVREAVGPDMRLMHDGAHWYSRADALRLGRALEEL  215 (368)
T ss_pred             HHHHHHHHHhCCCCeEEEECCCCcCHHHHHHHHHHhhhc
Confidence            34678899998877889999 568999999999888874


No 27 
>cd00308 enolase_like Enolase-superfamily, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Enolase superfamily contains different enzymes, like enolases, glutarate-, fucanate- and galactonate dehydratases, o-succinylbenzoate synthase, N-acylamino acid racemase, L-alanine-DL-glutamate epimerase, mandelate racemase, muconate lactonizing enzyme and 3-methylaspartase.
Probab=29.37  E-value=75  Score=26.57  Aligned_cols=38  Identities=26%  Similarity=0.385  Sum_probs=34.0

Q ss_pred             eehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhc
Q 026921           15 LTLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKE   52 (231)
Q Consensus        15 ~~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrd   52 (231)
                      +..|+.||+++...--+++||.+ ++..+..++...|.+
T Consensus        81 ~~~i~~lr~~~g~~~~l~lDaN~~~~~~~a~~~~~~l~~  119 (229)
T cd00308          81 IERVRAVREAFGPDARLAVDANGAWTPKEAIRLIRALEK  119 (229)
T ss_pred             HHHHHHHHHHhCCCCeEEEECCCCCCHHHHHHHHHHhhh
Confidence            57789999999888889999987 888999999999987


No 28 
>cd03325 D-galactonate_dehydratase D-galactonate dehydratase catalyses the dehydration of galactonate to 2-keto-3-deoxygalactnate (KDGal), as part of the D-galactonate nonphosphorolytic catabolic Entner-Doudoroff pathway. D-galactonate dehydratase belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=29.23  E-value=67  Score=28.99  Aligned_cols=39  Identities=21%  Similarity=0.242  Sum_probs=33.4

Q ss_pred             eehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921           15 LTLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL   53 (231)
Q Consensus        15 ~~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL   53 (231)
                      ...|+.||++|...--+++||-+ .+..+..+++..|.++
T Consensus       163 ~~~i~avr~~~g~~~~l~vDaN~~~~~~~A~~~~~~l~~~  202 (352)
T cd03325         163 VERVAALREAVGPDIDIGVDFHGRVSKPMAKDLAKELEPY  202 (352)
T ss_pred             HHHHHHHHHhhCCCCEEEEECCCCCCHHHHHHHHHhcccc
Confidence            34688999999888789999955 9999999999999884


No 29 
>PRK15072 bifunctional D-altronate/D-mannonate dehydratase; Provisional
Probab=28.35  E-value=67  Score=29.77  Aligned_cols=38  Identities=21%  Similarity=0.275  Sum_probs=33.5

Q ss_pred             ehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921           16 TLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL   53 (231)
Q Consensus        16 ~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL   53 (231)
                      ..|+.||++|...--+++||-+ .+..+..+++..|.++
T Consensus       194 ~~v~avre~~G~~~~l~vDaN~~w~~~~A~~~~~~l~~~  232 (404)
T PRK15072        194 KLFEAVRNKFGFDLHLLHDVHHRLTPIEAARLGKSLEPY  232 (404)
T ss_pred             HHHHHHHhhhCCCceEEEECCCCCCHHHHHHHHHhcccc
Confidence            4799999999887779999965 9999999999999884


No 30 
>cd05797 Ribosomal_L10 Ribosomal protein L10 family, L10 subfamily; composed of bacterial 50S ribosomal protein and eukaryotic mitochondrial 39S ribosomal protein, L10. L10 occupies the L7/L12 stalk of the ribosome. The N-terminal domain (NTD) of L10 interacts with L11 protein and forms the base of the L7/L12 stalk, while the extended C-terminal helix binds to two or three dimers of the NTD of L7/L12 (L7 and L12 are identical except for an acetylated N-terminus). The L7/L12 stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-in
Probab=27.08  E-value=79  Score=25.33  Aligned_cols=38  Identities=16%  Similarity=0.441  Sum_probs=33.9

Q ss_pred             ehhHHHHHhhccCc-eEEEcCCCCCchhHHHhhhhhhcc
Q 026921           16 TLVRDVRNAFEGSS-LVKVNCKGMHASDYKKLGAKLKEL   53 (231)
Q Consensus        16 ~lV~~VrdAF~~~e-lVrIDC~gl~~sDyKKIGaKLrdL   53 (231)
                      .+|+.+++.|+.+. ++-++++|++.++...+-.+||.-
T Consensus         7 ~~v~~l~~~l~~~~~v~v~~~~gl~~~~~~~lR~~lr~~   45 (157)
T cd05797           7 EIVAELKEKLKEAKSVVVADYRGLTVAQLTELRKELREA   45 (157)
T ss_pred             HHHHHHHHHHHhCCEEEEEecCCCcHHHHHHHHHHHHHc
Confidence            36889999999987 678899999999999999999973


No 31 
>smart00344 HTH_ASNC helix_turn_helix ASNC type. AsnC: an autogenously regulated activator of asparagine synthetase A transcription in Escherichia coli
Probab=26.35  E-value=91  Score=22.84  Aligned_cols=54  Identities=17%  Similarity=0.349  Sum_probs=35.4

Q ss_pred             hhhhccCce---EeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEee
Q 026921            5 LHIAAKNGV---YLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSF   61 (231)
Q Consensus         5 l~kL~kNGv---Y~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF   61 (231)
                      |.+|.++|+   |...+..=+-.+...-+|.|.|..  +....+++..|+. .|+|.--+
T Consensus        38 l~~L~~~g~i~~~~~~~~~~~~g~~~~~~v~i~~~~--~~~~~~v~~~l~~-~p~v~~~~   94 (108)
T smart00344       38 VKRLEEEGVIKGYTAVINPKKLGLSVTAFVGVDLES--PDKLEEFLEKLEK-LPEVVEVY   94 (108)
T ss_pred             HHHHHHCCCeeceEEEeCHHHcCCCEEEEEEEEECC--hhHHHHHHHHHhC-CcceEEee
Confidence            457778885   333333222233457888999986  5678999999998 57665433


No 32 
>cd03316 MR_like Mandelate racemase (MR)-like subfamily of the enolase superfamily. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues,  a Lys-X-Lys motif and a conserved histidine-aspartate dyad. Members of the MR subgroup are mandelate racemase, D-glucarate/L-idarate dehydratase (GlucD),  D-altronate/D-mannonate dehydratase , D-galactonate dehydratase (GalD) , D-gluconate dehydratase (GlcD), and L-rhamnonate dehydratase (RhamD).
Probab=25.39  E-value=1.1e+02  Score=27.04  Aligned_cols=39  Identities=26%  Similarity=0.274  Sum_probs=31.7

Q ss_pred             eehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhcc
Q 026921           15 LTLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKEL   53 (231)
Q Consensus        15 ~~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrdL   53 (231)
                      +..|+.||+++..+--+++|+ ++++.++..++...|+++
T Consensus       176 ~~~v~~ir~~~g~~~~l~vDaN~~~~~~~a~~~~~~l~~~  215 (357)
T cd03316         176 LARVRAVREAVGPDVDLMVDANGRWDLAEAIRLARALEEY  215 (357)
T ss_pred             HHHHHHHHHhhCCCCEEEEECCCCCCHHHHHHHHHHhCcc
Confidence            345788899998777889998 578888888998888875


No 33 
>KOG1990 consensus Poly(A)-specific exoribonuclease PARN [Replication, recombination and repair]
Probab=25.12  E-value=27  Score=34.62  Aligned_cols=76  Identities=18%  Similarity=0.218  Sum_probs=61.1

Q ss_pred             hhhccCceEeehhHHHHHhhccCceEEEcCC-CCCchhHHHhhhhh--hcccCeeEEeecCceEEEEecCCCCCCCCCC
Q 026921            6 HIAAKNGVYLTLVRDVRNAFEGSSLVKVNCK-GMHASDYKKLGAKL--KELVPCVLLSFDDEQILMWRGKDWKSMYPEP   81 (231)
Q Consensus         6 ~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~-gl~~sDyKKIGaKL--rdLVpCvllsF~~eqIlmWRGk~~k~~~~~~   81 (231)
                      .-++.++.-..++..+++.|..++..++-|+ |....-.+-.+.-|  +-+-+-+||+=.+.-++|+||+++-+..|.+
T Consensus       194 ~~~~~~~~~~~~~~~~~~~~~k~~~~k~~~~rg~~~~~~~~~a~~l~~~~~tg~~lv~hN~~~dv~y~~~~Fl~~lp~~  272 (564)
T KOG1990|consen  194 FALGRSRKLQGLAVAMVSFWEKHEFAKILIKRGVLETRKERMADELQELLLTGKVLVLHNKLLDVMYRYKNFLSPLPST  272 (564)
T ss_pred             ceehhccccccchhHHHHHHHHHHHHHHHHHhcchhhhccchHHHHHHHHhcCCeEEeeccceeeeeehhhcccccchh
Confidence            3467788888999999999999999998886 44444444455566  8899999999999999999999998855543


No 34 
>PF07728 AAA_5:  AAA domain (dynein-related subfamily);  InterPro: IPR011704 The ATPases Associated to a variety of cellular Activities (AAA) are a family distinguished by a highly conserved module of 230 amino acids []. The highly conserved nature of this module across taxa suggests that it has a key cellular role. Members of the family are involved in diverse cellular functions including gene expression, peroxisome assembly and vesicle mediated transport. Although the role of this ATPase AAA domain is not, as yet, clear, the AAA+ superfamily of proteins to which the AAA ATPases belong has a chaperone-like function in the assembly, operation or disassembly of proteins []. This ATPase domain includes some proteins not detected by the IPR003959 from INTERPRO model.; GO: 0005524 ATP binding, 0016887 ATPase activity; PDB: 3NBX_X 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 3VKH_A 3VKG_A.
Probab=25.01  E-value=27  Score=26.20  Aligned_cols=24  Identities=42%  Similarity=0.649  Sum_probs=17.4

Q ss_pred             HHhhcccceeccCCCCchhHHHhh
Q 026921          124 AIESTKALVLDEINLGPDDLLKKV  147 (231)
Q Consensus       124 AieSs~Al~Lde~~L~PD~LL~~V  147 (231)
                      |++....++|||+++.+.+++..+
T Consensus        62 a~~~~~il~lDEin~a~~~v~~~L   85 (139)
T PF07728_consen   62 AMRKGGILVLDEINRAPPEVLESL   85 (139)
T ss_dssp             THHEEEEEEESSCGG--HHHHHTT
T ss_pred             cccceeEEEECCcccCCHHHHHHH
Confidence            444889999999999888876654


No 35 
>PRK14017 galactonate dehydratase; Provisional
Probab=25.01  E-value=90  Score=28.50  Aligned_cols=38  Identities=26%  Similarity=0.281  Sum_probs=32.6

Q ss_pred             ehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921           16 TLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL   53 (231)
Q Consensus        16 ~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL   53 (231)
                      ..|+.||++|...--+++||.+ .+..+..+++..|+++
T Consensus       165 ~~i~avr~~~g~~~~l~vDaN~~w~~~~A~~~~~~l~~~  203 (382)
T PRK14017        165 ARVAAVREAVGPEIGIGVDFHGRVHKPMAKVLAKELEPY  203 (382)
T ss_pred             HHHHHHHHHhCCCCeEEEECCCCCCHHHHHHHHHhhccc
Confidence            4577899999887789999965 8999999999999884


No 36 
>KOG0469 consensus Elongation factor 2 [Translation, ribosomal structure and biogenesis]
Probab=24.37  E-value=92  Score=32.64  Aligned_cols=50  Identities=30%  Similarity=0.383  Sum_probs=45.7

Q ss_pred             HhhccCceEEEcCCCCCchhHHHhhhhhhccc---CeeEEee--cCceEEEEecC
Q 026921           23 NAFEGSSLVKVNCKGMHASDYKKLGAKLKELV---PCVLLSF--DDEQILMWRGK   72 (231)
Q Consensus        23 dAF~~~elVrIDC~gl~~sDyKKIGaKLrdLV---pCvllsF--~~eqIlmWRGk   72 (231)
                      +-|..+++||+.-.--++.|.-|+-+-|++|.   ||++..+  .+|+||-=-|.
T Consensus       481 MKFSVSPVV~VAVe~Knp~DLpKLvEGLkrLakSDP~v~~~~~esGehiiAgaGe  535 (842)
T KOG0469|consen  481 MKFSVSPVVRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGE  535 (842)
T ss_pred             EEeeccceEEEEEecCChhhhHHHHHHHHHHhccCCeEEEEeccCCceEEeccch
Confidence            56999999999999999999999999999986   9999998  68999988874


No 37 
>PF09026 CENP-B_dimeris:  Centromere protein B dimerisation domain;  InterPro: IPR015115 Centromere protein B (CENP-B) interacts with centromeric heterochromatin in chromosomes and binds to a specific subset of alphoid satellite DNA, called the CENP-B box. CENP-B may organise arrays of centromere satellite DNA into a higher order structure, which then directs centromere formation and kinetochore assembly in mammalian chromosomes. The CENP-B dimerisation domain is composed of two alpha-helices, which are folded into an antiparallel configuration. Dimerisation of CENP-B is mediated by this domain, in which monomers dimerise to form a symmetrical, antiparallel, four-helix bundle structure with a large hydrophobic patch in which 23 residues of one monomer form van der Waals contacts with the other monomer. This CENP-B dimer configuration may be suitable for capturing two distant CENP-B boxes during centromeric heterochromatin formation []. ; GO: 0003677 DNA binding, 0003682 chromatin binding, 0006355 regulation of transcription, DNA-dependent, 0000775 chromosome, centromeric region, 0005634 nucleus; PDB: 1UFI_A.
Probab=24.26  E-value=31  Score=28.25  Aligned_cols=10  Identities=40%  Similarity=0.697  Sum_probs=4.6

Q ss_pred             CCCccHHHHH
Q 026921          216 LGSLPVDHIA  225 (231)
Q Consensus       216 lGsLPvD~i~  225 (231)
                      |+|+|||.=+
T Consensus        60 ltSf~id~~v   69 (101)
T PF09026_consen   60 LTSFPIDDKV   69 (101)
T ss_dssp             HCTS---HHH
T ss_pred             hhccchhHhh
Confidence            6888887643


No 38 
>PTZ00051 thioredoxin; Provisional
Probab=23.49  E-value=2.6e+02  Score=19.42  Aligned_cols=51  Identities=12%  Similarity=0.142  Sum_probs=31.1

Q ss_pred             hHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecC
Q 026921           18 VRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGK   72 (231)
Q Consensus        18 V~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk   72 (231)
                      +..+...|..--+++|||... +.-.++.|.  + =+|++++.-++..+-.+.|.
T Consensus        40 l~~l~~~~~~~~~~~vd~~~~-~~~~~~~~v--~-~~Pt~~~~~~g~~~~~~~G~   90 (98)
T PTZ00051         40 YEECSKEYTKMVFVKVDVDEL-SEVAEKENI--T-SMPTFKVFKNGSVVDTLLGA   90 (98)
T ss_pred             HHHHHHHcCCcEEEEEECcch-HHHHHHCCC--c-eeeEEEEEeCCeEEEEEeCC
Confidence            344444555677899999742 233444444  2 48997766566666567774


No 39 
>PF09248 DUF1965:  Domain of unknown function (DUF1965);  InterPro: IPR015328 Members of this family of fungal domains adopt a structure that consists of an alpha/beta motif. Their exact function has not, as yet, been determined []. ; PDB: 1N9E_A 1RKY_A 1W7C_A 3PGB_A.
Probab=23.09  E-value=66  Score=24.82  Aligned_cols=24  Identities=21%  Similarity=0.387  Sum_probs=18.7

Q ss_pred             cCceEeehhHHHHHhhccCceEEE
Q 026921           10 KNGVYLTLVRDVRNAFEGSSLVKV   33 (231)
Q Consensus        10 kNGvY~~lV~~VrdAF~~~elVrI   33 (231)
                      =||.|-..+...|+||..-+++|+
T Consensus        32 Yn~~fy~tteeFr~A~~~~~f~k~   55 (74)
T PF09248_consen   32 YNGQFYPTTEEFREAWWSGDFKKL   55 (74)
T ss_dssp             ETTEEESSHHHHHHHHCSTT----
T ss_pred             ECCEEcccHHHHHHHHhCCCceec
Confidence            489999999999999999999886


No 40 
>cd03326 MR_like_1 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 1. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues,  a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown.
Probab=22.86  E-value=1.3e+02  Score=28.09  Aligned_cols=40  Identities=5%  Similarity=0.022  Sum_probs=30.8

Q ss_pred             ehhHHHHHhhccCceEEEcCC-CCCchhHHHhhhhhhcccC
Q 026921           16 TLVRDVRNAFEGSSLVKVNCK-GMHASDYKKLGAKLKELVP   55 (231)
Q Consensus        16 ~lV~~VrdAF~~~elVrIDC~-gl~~sDyKKIGaKLrdLVp   55 (231)
                      ..|+.||+++...--+++||- +.+.++-.+++..|+++-+
T Consensus       193 ~~v~avRe~~G~~~~l~vDaN~~w~~~~A~~~~~~l~~~~~  233 (385)
T cd03326         193 RRIEAALDVLGDGARLAVDANGRFDLETAIAYAKALAPYGL  233 (385)
T ss_pred             HHHHHHHHhcCCCCeEEEECCCCCCHHHHHHHHHHhhCcCC
Confidence            457778888877777888884 4888888888888888543


No 41 
>cd03320 OSBS o-Succinylbenzoate synthase (OSBS) catalyzes the conversion of 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate (SHCHC) to 4-(2'-carboxyphenyl)-4-oxobutyrate (o-succinylbenzoate or OSB), a reaction in the menaquinone biosynthetic pathway. Menaquinone is an essential cofactor for anaerobic growth in eubacteria and some archaea. OSBS belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=22.75  E-value=1.4e+02  Score=25.80  Aligned_cols=38  Identities=13%  Similarity=0.217  Sum_probs=30.5

Q ss_pred             ehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921           16 TLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL   53 (231)
Q Consensus        16 ~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL   53 (231)
                      ..|+.||+++...--+++|+-+ ++..+..+++..|.++
T Consensus       115 ~~v~~vr~~~g~~~~l~vDaN~~w~~~~A~~~~~~l~~~  153 (263)
T cd03320         115 ARLRALREALPADAKLRLDANGGWSLEEALAFLEALAAG  153 (263)
T ss_pred             HHHHHHHHHcCCCCeEEEeCCCCCCHHHHHHHHHhhccc
Confidence            4578888888877778898855 8888888888888774


No 42 
>PRK10645 divalent-cation tolerance protein CutA; Provisional
Probab=21.58  E-value=81  Score=25.34  Aligned_cols=42  Identities=26%  Similarity=0.462  Sum_probs=33.5

Q ss_pred             eEEEcCCCCCchhHHHhhhhh--hcccCeeEEeecCceEEEEecCC
Q 026921           30 LVKVNCKGMHASDYKKLGAKL--KELVPCVLLSFDDEQILMWRGKD   73 (231)
Q Consensus        30 lVrIDC~gl~~sDyKKIGaKL--rdLVpCvllsF~~eqIlmWRGk~   73 (231)
                      +|-+.|.  +...-++|+..|  +.|+.||=|.-.-+-+-.|.|+-
T Consensus        13 lV~tT~p--~~e~A~~ia~~Lve~rLaACvni~p~i~S~Y~W~G~i   56 (112)
T PRK10645         13 VVLCTAP--DEATAQDLAAKVLAEKLAACVTLLPGATSLYYWEGKL   56 (112)
T ss_pred             EEEEeCC--CHHHHHHHHHHHHHCCeeEEEecCCCeeEEEEECCEE
Confidence            4555554  566789999998  88999998877788999999975


No 43 
>COG1062 AdhC Zn-dependent alcohol dehydrogenases, class III [Energy production and conversion]
Probab=21.55  E-value=2.4e+02  Score=27.59  Aligned_cols=104  Identities=19%  Similarity=0.294  Sum_probs=68.8

Q ss_pred             ehhHHHHHhhc-cCceEEEcCCCCCchhHHHhhhhhhcccCeeEEee--cCceE-----EEEecCCCCCCCCC--CCCCC
Q 026921           16 TLVRDVRNAFE-GSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSF--DDEQI-----LMWRGKDWKSMYPE--PPSFS   85 (231)
Q Consensus        16 ~lV~~VrdAF~-~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF--~~eqI-----lmWRGk~~k~~~~~--~~~~~   85 (231)
                      .+|+.+++... +...+ |||.|-..-=---+.+.-| .--||++-+  -+..|     -+-.||.||.++--  .|..+
T Consensus       242 ~vv~~i~~~T~gG~d~~-~e~~G~~~~~~~al~~~~~-~G~~v~iGv~~~~~~i~~~~~~lv~gr~~~Gs~~G~~~p~~d  319 (366)
T COG1062         242 DVVEAIVELTDGGADYA-FECVGNVEVMRQALEATHR-GGTSVIIGVAGAGQEISTRPFQLVTGRVWKGSAFGGARPRSD  319 (366)
T ss_pred             hHHHHHHHhcCCCCCEE-EEccCCHHHHHHHHHHHhc-CCeEEEEecCCCCceeecChHHeeccceEEEEeecCCccccc
Confidence            57777787777 44444 8888866532233444444 566777776  22222     24456777766633  13333


Q ss_pred             CcccccccCCCCCCCCCCCCCCccccccchhHHHHHHHHHhhcccceeccCCCCchhHHHhhhhhhhhhhhhhccc
Q 026921           86 NPVDLDIAGDADGSGTPSDDPSQGTIRSSPKMISLWKRAIESTKALVLDEINLGPDDLLKKVEEFEGISQAAEHSY  161 (231)
Q Consensus        86 ip~~~~~~~~~~~s~~s~~e~~~~~~~~sPkM~~LWk~AieSs~Al~Lde~~L~PD~LL~~VEef~~~SqA~EHS~  161 (231)
                      |                            ||+..||+.            =.|..|.|++..-.++.|.||.++..
T Consensus       320 i----------------------------P~lv~~y~~------------Gkl~~d~lvt~~~~Le~INeaf~~m~  355 (366)
T COG1062         320 I----------------------------PRLVDLYMA------------GKLPLDRLVTHTIPLEDINEAFDLMH  355 (366)
T ss_pred             h----------------------------hHHHHHHHc------------CCCchhHHhhccccHHHHHHHHHHHh
Confidence            3                            999999974            24888999999999999999998754


No 44 
>PF03091 CutA1:  CutA1 divalent ion tolerance protein;  InterPro: IPR004323  The CutA family of proteins which exhibit ion tolerance are found in a large variety of species []. In E.Coli, two operons on the cutA locus contain genes that encode three proteins, CutA1, CutA2 and CutA3. CutA1 proteins are found in the cytoplasm while CutA2 (50kDa) and CutA3 (24kDa) are located in the inner membrane. Although the role of E. Coli CutA1 is not clear, studies on E. coli cutA locus describe some mutations that lead to an increase in copper sensitivity, thus suggesting a role in ion tolerance []. To date, the structure of CutA proteins from several species have been solved [, ]. The crystal structures of the E.Coli and rat CutA1 proteins show both these proteins to be trimeric in the crystal as well as in solution[].Trimerisation seems to supported by the formation of beta sheets between the subunit. This trimeric structure suggests the protein may be involved in signal transduction due to architectural similarities with PII signal transducer proteins []. Recent studies propose that mammalian CutA1 in the neuronal cell membrane acts as an anchor for acetylcholinesterase (AChE)1 [].; GO: 0010038 response to metal ion; PDB: 1O5J_A 1KR4_A 1VHF_A 3GSD_H 1OSC_A 3OPK_A 4E98_C 1V9B_F 1UKU_A 2E66_C ....
Probab=21.49  E-value=97  Score=24.30  Aligned_cols=34  Identities=24%  Similarity=0.473  Sum_probs=26.4

Q ss_pred             CchhHHHhhhhh--hcccCeeEEeecCceEEEEecC
Q 026921           39 HASDYKKLGAKL--KELVPCVLLSFDDEQILMWRGK   72 (231)
Q Consensus        39 ~~sDyKKIGaKL--rdLVpCvllsF~~eqIlmWRGk   72 (231)
                      +...-++|+..|  +.|+.|+=+.=.=+-+.+|.|+
T Consensus        10 ~~e~A~~ia~~lve~rLaACvni~~~i~S~Y~W~g~   45 (102)
T PF03091_consen   10 DREEAERIARALVEERLAACVNIIPPITSIYRWEGK   45 (102)
T ss_dssp             SHHHHHHHHHHHHHTTS-SEEEEEEEEEEEEEETTE
T ss_pred             CHHHHHHHHHHHHhCCeEEEEEecCCceeEEEECCc
Confidence            346778888887  7899999776577788899997


No 45 
>cd03327 MR_like_2 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 2. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues,  a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown.
Probab=21.34  E-value=1.1e+02  Score=27.47  Aligned_cols=38  Identities=21%  Similarity=0.287  Sum_probs=31.2

Q ss_pred             ehhHHHHHhhccCceEEEcCC-CCCchhHHHhhhhhhcc
Q 026921           16 TLVRDVRNAFEGSSLVKVNCK-GMHASDYKKLGAKLKEL   53 (231)
Q Consensus        16 ~lV~~VrdAF~~~elVrIDC~-gl~~sDyKKIGaKLrdL   53 (231)
                      ..|+.||+++...--+++||- +.+..+..+++..|.++
T Consensus       159 ~~v~avr~~~g~~~~l~vDan~~~~~~~A~~~~~~l~~~  197 (341)
T cd03327         159 ELVRAIREAVGYDVDLMLDCYMSWNLNYAIKMARALEKY  197 (341)
T ss_pred             HHHHHHHHHhCCCCcEEEECCCCCCHHHHHHHHHHhhhc
Confidence            357889999987777899984 48889999999998884


No 46 
>cd06532 Glyco_transf_25 Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis. The members include the beta(1,4) galactosyltransferases: Lgt2 of Moraxella catarrhalis, LgtB and LgtE of Neisseria gonorrhoeae and Lic2A of Haemophilus influenzae. M. catarrhalis Lgt2 catalyzes the addition of galactose (Gal) to the growing chain of LOS on the cell surface. N. gonorrhoeae LgtB and LgtE link Gal-beta(1,4)  to GlcNAc (N-acetylglucosamine) and Glc (glucose), respectively. The genes encoding LgtB and LgtE are two genes of a five gene locus involved in the synthesis of gonococcal LOS. LgtE is believed to perform the first step in LOS biosynthesis.
Probab=21.22  E-value=67  Score=24.82  Aligned_cols=27  Identities=37%  Similarity=0.698  Sum_probs=19.9

Q ss_pred             hhHHHHHHHHHhhc--ccceecc-CCCCch
Q 026921          115 PKMISLWKRAIEST--KALVLDE-INLGPD  141 (231)
Q Consensus       115 PkM~~LWk~AieSs--~Al~Lde-~~L~PD  141 (231)
                      =.-+++|+++++++  -||+|+| +.+.|+
T Consensus        69 lSH~~~w~~~~~~~~~~alIlEDDv~~~~~   98 (128)
T cd06532          69 LSHYKLWQKIVESNLEYALILEDDAILDPD   98 (128)
T ss_pred             HHHHHHHHHHHHcCCCeEEEEccCcEECCC
Confidence            45689999999988  4666654 467666


No 47 
>PRK05105 O-succinylbenzoate synthase; Provisional
Probab=20.99  E-value=1.2e+02  Score=27.40  Aligned_cols=37  Identities=11%  Similarity=0.140  Sum_probs=29.4

Q ss_pred             eehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhc
Q 026921           15 LTLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKE   52 (231)
Q Consensus        15 ~~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrd   52 (231)
                      +..|+.||+++ .+--+|+|| ++.+..+..+.+..|.+
T Consensus       145 ~~~i~~vr~~~-~~~~l~vDaN~~w~~~~A~~~~~~l~~  182 (322)
T PRK05105        145 GMLVNLLLEAI-PDLKLRLDANRGWTLEKAQQFAKYVPP  182 (322)
T ss_pred             HHHHHHHHHhC-CCCeEEEECCCCCCHHHHHHHHHHhhh
Confidence            35577889988 677889999 55888888888888876


No 48 
>PF12153 CAP18_C:  LPS binding domain of CAP18 (C terminal);  InterPro: IPR022746 This entry represents the cathlecidin antimicrobial C-terminal peptides. The C terminus is cleaved from the cathlecidin precursor, and is approximately 30 amino acids in length with a helical structure. This entry is found in association with PF00666 from PFAM. The C terminus peptides possess antimicrobial activity by virtue of their binding to bacterial lipopolysaccharides [][].; GO: 0042742 defense response to bacterium; PDB: 1LYP_A 2LMF_A 2FBU_H 2FBS_N 2K6O_A 2FCG_F.
Probab=20.77  E-value=35  Score=22.35  Aligned_cols=14  Identities=36%  Similarity=0.722  Sum_probs=10.6

Q ss_pred             hhHHHhhhhhhccc
Q 026921           41 SDYKKLGAKLKELV   54 (231)
Q Consensus        41 sDyKKIGaKLrdLV   54 (231)
                      ...||||.|.+|+.
T Consensus        13 eklkkIGQkIKDff   26 (28)
T PF12153_consen   13 EKLKKIGQKIKDFF   26 (28)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHh
Confidence            35788888888864


No 49 
>PF13401 AAA_22:  AAA domain; PDB: 2QBY_B 1FNN_B 1W5T_A 1W5S_B.
Probab=20.77  E-value=1.1e+02  Score=22.29  Aligned_cols=25  Identities=32%  Similarity=0.444  Sum_probs=19.6

Q ss_pred             chhHHHHHHHHHhhcc--cceeccCCC
Q 026921          114 SPKMISLWKRAIESTK--ALVLDEINL  138 (231)
Q Consensus       114 sPkM~~LWk~AieSs~--Al~Lde~~L  138 (231)
                      ...+++++.++++...  .|++||++.
T Consensus        72 ~~~l~~~~~~~l~~~~~~~lviDe~~~   98 (131)
T PF13401_consen   72 SDELRSLLIDALDRRRVVLLVIDEADH   98 (131)
T ss_dssp             HHHHHHHHHHHHHHCTEEEEEEETTHH
T ss_pred             HHHHHHHHHHHHHhcCCeEEEEeChHh
Confidence            4667788888888776  599999954


No 50 
>PF07039 DUF1325:  SGF29 tudor-like domain;  InterPro: IPR010750  SAGA-associated factor 29 is involved in transcriptional regulation, probably through association with histone acetyltransferase (HAT) complexes like the TFTC-HAT or STAGA complexes. It also may be involved in MYC-mediated oncogenic transformation. It is a component of the ATAC complex, which is a complex with histone acetyltransferase activity on histones H3 and H4 [].   This entry represents a domain found in yeast and human SAGA-associated factor 29 proteins that is related to the tudor domain. ; PDB: 3MP6_A 3MP1_A 3MP8_A 3MET_B 3ME9_A 3MEU_B 3MEA_A 3MEV_B 3LX7_A 3MEW_A.
Probab=20.74  E-value=77  Score=25.81  Aligned_cols=27  Identities=30%  Similarity=0.423  Sum_probs=18.3

Q ss_pred             hhcccCeEEeecCCCCCCcc-ccccCCC
Q 026921          157 AEHSYPALVLSREDGASSSM-AEYEDGS  183 (231)
Q Consensus       157 ~EHS~pAl~~s~~~~~~~~~-ae~~dd~  183 (231)
                      |---|||.|++......+.+ ..|+||.
T Consensus        85 TT~FY~A~V~~~p~~~~~~y~l~Fedd~  112 (130)
T PF07039_consen   85 TTCFYPATVVSPPKKKSGEYKLKFEDDE  112 (130)
T ss_dssp             SSEEEEEEEEEE-SSTTS-EEEEECTTT
T ss_pred             CceEEEEEEEeCCCCCCCcEEEEEeCCC
Confidence            45569999999976555555 7777743


No 51 
>PRK06512 thiamine-phosphate pyrophosphorylase; Provisional
Probab=20.72  E-value=1.5e+02  Score=25.81  Aligned_cols=39  Identities=23%  Similarity=0.413  Sum_probs=34.9

Q ss_pred             ehhHHHHHhhccC--ceEEEcCCCCCchhHHHhhhhhhccc
Q 026921           16 TLVRDVRNAFEGS--SLVKVNCKGMHASDYKKLGAKLKELV   54 (231)
Q Consensus        16 ~lV~~VrdAF~~~--elVrIDC~gl~~sDyKKIGaKLrdLV   54 (231)
                      .+...++.|...-  ..|.+-.++++..++.++..+|++++
T Consensus        27 ~~~~~l~~al~~G~v~~vQlR~K~l~~~~~~~~a~~l~~l~   67 (221)
T PRK06512         27 ELAKLLRAALQGGDVASVILPQYGLDEATFQKQAEKLVPVI   67 (221)
T ss_pred             cHHHHHHHHHcCCCccEEEEeCCCCCHHHHHHHHHHHHHHH
Confidence            5677888998887  59999999999999999999999985


No 52 
>cd02998 PDI_a_ERp38 PDIa family, endoplasmic reticulum protein 38 (ERp38) subfamily; composed of proteins similar to the P5-like protein first isolated from alfalfa, which contains two redox active TRX (a) domains at the N-terminus, like human P5, and a C-terminal domain with homology to the C-terminal domain of ERp29, unlike human P5. The cDNA clone of this protein (named G1) was isolated from an alfalfa cDNA library by screening with human protein disulfide isomerase (PDI) cDNA. The G1 protein is constitutively expressed in all major organs of the plant and its expression is induced by treatment with tunicamycin, indicating that it may be a glucose-regulated protein. The G1 homolog in the eukaryotic social amoeba Dictyostelium discoideum is also described as a P5-like protein, which is located in the endoplasmic reticulum (ER) despite the absence of an ER-retrieval signal. G1 homologs from Aspergillus niger and Neurospora crassa have also been characterized, and are named TIGA and ER
Probab=20.07  E-value=2.4e+02  Score=19.40  Aligned_cols=52  Identities=15%  Similarity=0.204  Sum_probs=31.3

Q ss_pred             hhHHHHHhhc---cCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecC-ceEEEEec
Q 026921           17 LVRDVRNAFE---GSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDD-EQILMWRG   71 (231)
Q Consensus        17 lV~~VrdAF~---~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~-eqIlmWRG   71 (231)
                      .+..+.+.+.   .-.+++|||..-.+.=+++.|..   -+|++++--.+ .+...+.|
T Consensus        39 ~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~~~~i~---~~P~~~~~~~~~~~~~~~~g   94 (105)
T cd02998          39 EYEKLAAVFANEDDVVIAKVDADEANKDLAKKYGVS---GFPTLKFFPKGSTEPVKYEG   94 (105)
T ss_pred             HHHHHHHHhCCCCCEEEEEEECCCcchhhHHhCCCC---CcCEEEEEeCCCCCccccCC
Confidence            3455555665   24678999987444556666642   37887765444 34555555


Done!