Query 026921
Match_columns 231
No_of_seqs 43 out of 45
Neff 2.4
Searched_HMMs 46136
Date Fri Mar 29 02:27:51 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/026921.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/026921hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01985 CRS1_YhbY: CRS1 / Yhb 99.1 3.7E-11 8.1E-16 89.4 3.5 69 2-70 16-84 (84)
2 TIGR00253 RNA_bind_YhbY putati 97.3 0.00045 9.8E-09 53.9 5.1 70 3-72 17-86 (95)
3 COG1534 Predicted RNA-binding 97.0 0.00098 2.1E-08 53.0 4.4 75 3-78 18-92 (97)
4 PRK10343 RNA-binding protein Y 96.2 0.006 1.3E-07 48.0 4.0 70 3-72 19-88 (97)
5 KOG1990 Poly(A)-specific exori 95.9 0.0036 7.7E-08 61.0 1.4 76 7-82 403-483 (564)
6 PF04591 DUF596: Protein of un 64.8 4.4 9.6E-05 31.0 1.7 24 6-29 6-34 (70)
7 TIGR01444 fkbM_fam methyltrans 61.0 9.2 0.0002 28.7 2.8 35 27-61 108-142 (143)
8 PF12747 DdrB: DdrB-like prote 58.2 2.6 5.7E-05 35.3 -0.6 50 41-97 56-105 (123)
9 PF01188 MR_MLE: Mandelate rac 52.9 21 0.00045 25.0 3.3 35 18-52 2-37 (67)
10 PF13304 AAA_21: AAA domain; P 48.3 11 0.00025 27.9 1.5 39 117-155 244-287 (303)
11 PF01755 Glyco_transf_25: Glyc 46.6 21 0.00044 28.6 2.8 41 108-149 65-108 (200)
12 PF01695 IstB_IS21: IstB-like 45.4 29 0.00064 28.6 3.6 47 122-169 103-149 (178)
13 PLN00410 U5 snRNP protein, DIM 40.6 72 0.0016 26.5 5.2 51 18-73 45-97 (142)
14 cd03319 L-Ala-DL-Glu_epimerase 36.9 41 0.0009 29.5 3.4 37 16-53 166-203 (316)
15 PRK11169 leucine-responsive tr 36.3 39 0.00084 27.4 3.0 68 5-73 49-123 (164)
16 cd03005 PDI_a_ERp46 PDIa famil 34.7 1.3E+02 0.0028 20.9 5.1 50 19-72 39-92 (102)
17 COG1484 DnaC DNA replication p 34.1 40 0.00086 29.6 2.9 60 110-170 137-209 (254)
18 PRK13695 putative NTPase; Prov 33.1 58 0.0013 25.8 3.5 51 117-170 86-136 (174)
19 PRK02714 O-succinylbenzoate sy 32.9 61 0.0013 29.1 3.9 42 15-56 150-192 (320)
20 cd03318 MLE Muconate Lactonizi 32.4 55 0.0012 29.4 3.6 40 14-53 174-214 (365)
21 cd03315 MLE_like Muconate lact 32.4 72 0.0016 27.3 4.1 39 16-54 117-156 (265)
22 PF11830 DUF3350: Domain of un 32.1 52 0.0011 24.5 2.8 24 108-131 22-45 (56)
23 PF00626 Gelsolin: Gelsolin re 31.7 55 0.0012 22.2 2.7 17 56-73 20-36 (76)
24 PF14795 Leucyl-specific: Leuc 30.7 21 0.00045 26.6 0.5 25 38-70 31-55 (56)
25 TIGR01927 menC_gamma/gm+ o-suc 30.6 78 0.0017 28.3 4.2 38 15-52 141-179 (307)
26 cd03329 MR_like_4 Mandelate ra 30.0 65 0.0014 29.2 3.6 38 16-53 177-215 (368)
27 cd00308 enolase_like Enolase-s 29.4 75 0.0016 26.6 3.6 38 15-52 81-119 (229)
28 cd03325 D-galactonate_dehydrat 29.2 67 0.0015 29.0 3.6 39 15-53 163-202 (352)
29 PRK15072 bifunctional D-altron 28.4 67 0.0015 29.8 3.5 38 16-53 194-232 (404)
30 cd05797 Ribosomal_L10 Ribosoma 27.1 79 0.0017 25.3 3.3 38 16-53 7-45 (157)
31 smart00344 HTH_ASNC helix_turn 26.4 91 0.002 22.8 3.3 54 5-61 38-94 (108)
32 cd03316 MR_like Mandelate race 25.4 1.1E+02 0.0024 27.0 4.2 39 15-53 176-215 (357)
33 KOG1990 Poly(A)-specific exori 25.1 27 0.00059 34.6 0.4 76 6-81 194-272 (564)
34 PF07728 AAA_5: AAA domain (dy 25.0 27 0.00059 26.2 0.3 24 124-147 62-85 (139)
35 PRK14017 galactonate dehydrata 25.0 90 0.002 28.5 3.7 38 16-53 165-203 (382)
36 KOG0469 Elongation factor 2 [T 24.4 92 0.002 32.6 3.9 50 23-72 481-535 (842)
37 PF09026 CENP-B_dimeris: Centr 24.3 31 0.00067 28.2 0.5 10 216-225 60-69 (101)
38 PTZ00051 thioredoxin; Provisio 23.5 2.6E+02 0.0057 19.4 5.2 51 18-72 40-90 (98)
39 PF09248 DUF1965: Domain of un 23.1 66 0.0014 24.8 2.0 24 10-33 32-55 (74)
40 cd03326 MR_like_1 Mandelate ra 22.9 1.3E+02 0.0028 28.1 4.3 40 16-55 193-233 (385)
41 cd03320 OSBS o-Succinylbenzoat 22.8 1.4E+02 0.0029 25.8 4.1 38 16-53 115-153 (263)
42 PRK10645 divalent-cation toler 21.6 81 0.0018 25.3 2.4 42 30-73 13-56 (112)
43 COG1062 AdhC Zn-dependent alco 21.6 2.4E+02 0.0052 27.6 5.8 104 16-161 242-355 (366)
44 PF03091 CutA1: CutA1 divalent 21.5 97 0.0021 24.3 2.7 34 39-72 10-45 (102)
45 cd03327 MR_like_2 Mandelate ra 21.3 1.1E+02 0.0024 27.5 3.5 38 16-53 159-197 (341)
46 cd06532 Glyco_transf_25 Glycos 21.2 67 0.0014 24.8 1.8 27 115-141 69-98 (128)
47 PRK05105 O-succinylbenzoate sy 21.0 1.2E+02 0.0025 27.4 3.5 37 15-52 145-182 (322)
48 PF12153 CAP18_C: LPS binding 20.8 35 0.00075 22.4 0.1 14 41-54 13-26 (28)
49 PF13401 AAA_22: AAA domain; P 20.8 1.1E+02 0.0023 22.3 2.7 25 114-138 72-98 (131)
50 PF07039 DUF1325: SGF29 tudor- 20.7 77 0.0017 25.8 2.1 27 157-183 85-112 (130)
51 PRK06512 thiamine-phosphate py 20.7 1.5E+02 0.0031 25.8 3.9 39 16-54 27-67 (221)
52 cd02998 PDI_a_ERp38 PDIa famil 20.1 2.4E+02 0.0052 19.4 4.3 52 17-71 39-94 (105)
No 1
>PF01985 CRS1_YhbY: CRS1 / YhbY (CRM) domain; InterPro: IPR001890 The CRM domain is an ~100-amino acid RNA-binding domain. The name chloroplast RNA splicing and ribosome maturation (CRM) has been suggested to reflect the functions established for the four characterised members of the family: Zea mays (Maize) CRS1 (Q9FYT6 from SWISSPROT), CAF1 (Q84N49 from SWISSPROT) and CAF2 (Q84N48 from SWISSPROT) proteins and the Escherichia coli protein YhbY (P0AGK4 from SWISSPROT). The CRM domain is found in eubacteria, archaea, and plants. The CRM domain is represented as a stand-alone protein in archaea and bacteria, and in single- and multi-domain proteins in plants. It has been suggested that prokaryotic CRM proteins existed as ribosome-associated proteins prior to the divergence of archaea and bacteria, and that they were co-opted in the plant lineage as RNA binding modules by incorporation into diverse protein contexts. Plant CRM domains are predicted to reside not only in the chloroplast, but also in the mitochondrion and the nucleo/cytoplasmic compartment. The diversity of the CRM domain family in plants suggests a diverse set of RNA targets [, ]. The CRM domain is a compact alpha/beta domain consisting of a four-stranded beta sheet and three alpha helices with an alpha-beta-alpha-beta-alpha-beta-beta topology. The beta sheet face is basic, consistent with a role in RNA binding. Proximal to the basic beta sheet face is another moiety that could contribute to nucleic acid recognition. Connecting strand beta1 and helix alpha2 is a loop with a six amino acid motif, GxxG flanked by large aliphatic residues, within which one 'x' is typically a basic residue []. Escherichia coli YhbY is associated with pre-50S ribosomal subunits, which implies a function in ribosome assembly. GFP fused to a single-domain CRM protein from maize localises to the nucleolus, suggesting that an analogous activity may have been retained in plants []. A CRM domain containing protein in plant chloroplasts has been shown to function in group I and II intron splicing []. In vitro experiments with an isolated maize CRM domain have shown it to have RNA binding activity. These and other results suggest that the CRM domain evolved in the context of ribosome function prior to the divergence of Archaea and Bacteria, that this function has been maintained in extant prokaryotes, and that the domain was recruited to serve as an RNA binding module during the evolution of plant genomes []. YhbY has a fold similar to that of the C-terminal domain of translation initiation factor 3 (IF3C), which binds to 16S rRNA in the 30S ribosome [].; GO: 0003723 RNA binding; PDB: 1RQ8_A 1JO0_B 1LN4_A.
Probab=99.13 E-value=3.7e-11 Score=89.36 Aligned_cols=69 Identities=25% Similarity=0.341 Sum_probs=61.4
Q ss_pred chhhhhhccCceEeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEe
Q 026921 2 LFFLHIAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWR 70 (231)
Q Consensus 2 l~~l~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWR 70 (231)
|=|+..++|||++-++++.+++||+.+|||||.|.+-.+.|.+.+...|.+..+|.++...+.++++||
T Consensus 16 l~p~v~IGk~Glt~~vi~~i~~~l~~~eLvKVk~~~~~~~~~~~~~~~l~~~t~~~~V~~iG~~~vlyR 84 (84)
T PF01985_consen 16 LKPVVQIGKNGLTDGVIEEIDDALEKHELVKVKVLGNCREDRKEIAEQLAEKTGAEVVQVIGRTIVLYR 84 (84)
T ss_dssp C--SEEE-TTSS-HHHHHHHHHHHHHHSEEEEEETT--HHHHHHHHHHHHHHHTEEEEEEETTEEEEEE
T ss_pred CCCeEEECCCCCCHHHHHHHHHHHHhCCeeEEEEccCCHHHHHHHHHHHHHHhCCEEEEEECCEEEEEC
Confidence 347788999999999999999999999999999999999999999999999999999999999999998
No 2
>TIGR00253 RNA_bind_YhbY putative RNA-binding protein, YhbY family. A combination of crystal structure, molecular modeling, and bioinformatic data together suggest that members of this family, including YhbY of E. coli, are RNA binding proteins.
Probab=97.30 E-value=0.00045 Score=53.87 Aligned_cols=70 Identities=17% Similarity=0.256 Sum_probs=65.1
Q ss_pred hhhhhhccCceEeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecC
Q 026921 3 FFLHIAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGK 72 (231)
Q Consensus 3 ~~l~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk 72 (231)
=|+...||||+--+++..+.+|++.+|||||---.-.++|.+.+...|-.-..|.++..-+..+++||..
T Consensus 17 ~p~v~IGK~Glt~~vi~ei~~aL~~hELIKVkvl~~~~~~~~e~a~~i~~~~~a~~Vq~iG~~~vlYR~~ 86 (95)
T TIGR00253 17 KPVVLVGKNGLTEGVIKEIEQALEHRELIKVKVATEDREDKTLIAEALVKETGACNVQVIGKTIVLYRPT 86 (95)
T ss_pred CCeEEECCCCCCHHHHHHHHHHHHhCCcEEEEecCCChhHHHHHHHHHHHHHCCEEEEEEccEEEEEecC
Confidence 4677889999999999999999999999999877777889999999999999999999999999999985
No 3
>COG1534 Predicted RNA-binding protein containing KH domain, possibly ribosomal protein [Translation, ribosomal structure and biogenesis]
Probab=97.02 E-value=0.00098 Score=52.97 Aligned_cols=75 Identities=19% Similarity=0.259 Sum_probs=67.8
Q ss_pred hhhhhhccCceEeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecCCCCCCC
Q 026921 3 FFLHIAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGKDWKSMY 78 (231)
Q Consensus 3 ~~l~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk~~k~~~ 78 (231)
-|+...+|||+=-++++.+.+|.+.+||||+.--.=...|-+.|-.+|-+-+.|-++..-+..+++|| .+|....
T Consensus 18 ~piv~IGk~Glte~vi~Ei~~aL~~reLIKVkvl~~~~edr~eia~~l~~~~~a~lVqviG~~~vlyr-~~~e~~~ 92 (97)
T COG1534 18 KPIVQIGKNGLTEGVIKEIDRALEARELIKVKVLQNAREDKKEIAEALAEETGAELVQVIGKTLVLYR-ESKEKRK 92 (97)
T ss_pred CceEEecCCccCHHHHHHHHHHHHhCCcEEEEeeccchhhHHHHHHHHHHHhCCEEeeeeeeEEEEEe-cCccccc
Confidence 46778899999999999999999999999998777777899999999999999999999999999999 6665533
No 4
>PRK10343 RNA-binding protein YhbY; Provisional
Probab=96.24 E-value=0.006 Score=48.01 Aligned_cols=70 Identities=11% Similarity=0.153 Sum_probs=65.2
Q ss_pred hhhhhhccCceEeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecC
Q 026921 3 FFLHIAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGK 72 (231)
Q Consensus 3 ~~l~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk 72 (231)
=|+...||||+=-+++..+.+|++.+|||||--..-.+.|-+.+...|-+-..|-++.-=+..+++||=.
T Consensus 19 ~Pvv~IGk~Glt~~vi~ei~~aL~~hELIKvkv~~~~~~~~~e~~~~i~~~~~ae~Vq~IG~~~vlYR~~ 88 (97)
T PRK10343 19 KPVVLLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETKTLIVEAIVRETGACNVQVIGKTLVLYRPT 88 (97)
T ss_pred CCeEEECCCCCCHHHHHHHHHHHHHCCcEEEEecCCChhHHHHHHHHHHHHHCCEEEeeeCcEEEEEecC
Confidence 4677889999999999999999999999999988888889999999999999999999999999999965
No 5
>KOG1990 consensus Poly(A)-specific exoribonuclease PARN [Replication, recombination and repair]
Probab=95.86 E-value=0.0036 Score=60.96 Aligned_cols=76 Identities=25% Similarity=0.465 Sum_probs=68.1
Q ss_pred hhccCceEeehhHHHHHhhccCceEEEcCCCCCc-hhHHHhhhhhhcccCeeEEeec-C---ceEEEEecCCCCCCCCCC
Q 026921 7 IAAKNGVYLTLVRDVRNAFEGSSLVKVNCKGMHA-SDYKKLGAKLKELVPCVLLSFD-D---EQILMWRGKDWKSMYPEP 81 (231)
Q Consensus 7 kL~kNGvY~~lV~~VrdAF~~~elVrIDC~gl~~-sDyKKIGaKLrdLVpCvllsF~-~---eqIlmWRGk~~k~~~~~~ 81 (231)
-+++.|+|..++.+++-+|+..|+|++.|...+. +.-+..+..+..+...++++.+ + .+|+.+||++|..+..-.
T Consensus 403 ~~~rrg~f~g~i~n~~l~wk~~e~~k~i~~~~~~~~~~~~~a~~le~esg~~~v~~~~~~~~~ai~~yr~k~y~~p~~l~ 482 (564)
T KOG1990|consen 403 LSGRRGVFDGVIENMHLHWKSRELVKVICKEKNLPSQVKQYASALERESGGILVSIDKNPKGYAIIAYRGKNYDRPTSLR 482 (564)
T ss_pred ccCCcccccceeecchhhhhhcccceeeeccccccHHHHHHHHHHHHHhCCceeeeccCCchhhHHHhhhhhccCCcccC
Confidence 3678999999999999999999999999999999 9999999999999999999993 3 349999999999887533
Q ss_pred C
Q 026921 82 P 82 (231)
Q Consensus 82 ~ 82 (231)
|
T Consensus 483 P 483 (564)
T KOG1990|consen 483 P 483 (564)
T ss_pred c
Confidence 3
No 6
>PF04591 DUF596: Protein of unknown function, DUF596; InterPro: IPR007670 This family contains several uncharacterised proteins.; PDB: 2O5H_B.
Probab=64.83 E-value=4.4 Score=30.97 Aligned_cols=24 Identities=29% Similarity=0.418 Sum_probs=18.6
Q ss_pred hhhccCceEee-----hhHHHHHhhccCc
Q 026921 6 HIAAKNGVYLT-----LVRDVRNAFEGSS 29 (231)
Q Consensus 6 ~kL~kNGvY~~-----lV~~VrdAF~~~e 29 (231)
.||+|||.|+. +|++.|.+|-.++
T Consensus 6 lKLa~~g~fl~Gt~eEqve~fr~~fP~sd 34 (70)
T PF04591_consen 6 LKLARKGEFLEGTTEEQVEMFRKSFPASD 34 (70)
T ss_dssp EEEEETTEEE---HHHHHHHHHHH--SSS
T ss_pred EEeccCCEEecCCHHHHHHHHHHHCCCCh
Confidence 59999999985 7999999998754
No 7
>TIGR01444 fkbM_fam methyltransferase, FkbM family. Members of this family are characterized by two well-conserved short regions separated by a variable in both sequence and length. The first of the two regions is found in a large number of proteins outside this subfamily, a number of which have been characterized as methyltransferases. One member of the present family, FkbM, was shown to be required for a specific methylation in the biosynthesis of the immunosuppressant FK506 in Streptomyces strain MA6548.
Probab=61.04 E-value=9.2 Score=28.70 Aligned_cols=35 Identities=17% Similarity=0.275 Sum_probs=30.2
Q ss_pred cCceEEEcCCCCCchhHHHhhhhhhcccCeeEEee
Q 026921 27 GSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSF 61 (231)
Q Consensus 27 ~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF 61 (231)
.-.++||||+|..-.=.+.+...|+.-+|.|++-+
T Consensus 108 ~i~~lKiDiEG~E~~vL~g~~~~l~~~~~~i~~E~ 142 (143)
T TIGR01444 108 KVDLLKIDVEGAELEVLRGAKETLLRKRPGIVLEV 142 (143)
T ss_pred CCCEEEEeCCCchHHHHhChHHHHHHhCCeEEEEe
Confidence 45799999999999999999888888899888754
No 8
>PF12747 DdrB: DdrB-like protein; InterPro: IPR024305 This family includes the Deinococcus DdrB protein, a ssDNA binding protein induced by ionizing radiation which has a role in genome reconstitution []. The structure of DdrB is known []. This family also includes some possibly distantly related cyanobacterial proteins.; PDB: 3KDV_D.
Probab=58.24 E-value=2.6 Score=35.28 Aligned_cols=50 Identities=20% Similarity=0.353 Sum_probs=31.4
Q ss_pred hhHHHhhhhhhcccCeeEEeecCceEEEEecCCCCCCCCCCCCCCCcccccccCCCC
Q 026921 41 SDYKKLGAKLKELVPCVLLSFDDEQILMWRGKDWKSMYPEPPSFSNPVDLDIAGDAD 97 (231)
Q Consensus 41 sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk~~k~~~~~~~~~~ip~~~~~~~~~~ 97 (231)
=|...|||+..- -+=++++.++|||+.||++-..+-. -.|...+-.-+.+
T Consensus 56 FDW~lIGArV~~------t~~eG~~~V~~~G~~ykRR~l~~~d-klpaaIkySrgak 105 (123)
T PF12747_consen 56 FDWALIGARVIS------TDGEGEEVVQYRGHAYKRRELEAVD-KLPAAIKYSRGAK 105 (123)
T ss_dssp B-GGGGTEE-EE------EESSSEEEEEETTEEEEEEEETTTE-E---SSS-SSSSE
T ss_pred cChhhccceecc------cCCCceEEEEEcCeeeeeecchhhc-cccceeEEccccc
Confidence 467889998543 4558999999999999999866433 3444444444433
No 9
>PF01188 MR_MLE: Mandelate racemase / muconate lactonizing enzyme, C-terminal domain; InterPro: IPR013342 Mandelate racemase 5.1.2.2 from EC (MR) and muconate lactonising enzyme 5.5.1.1 from EC (MLE) are two bacterial enzymes involved in aromatic acid catabolism. They catalyse mechanistically distinct reactions yet they are related at the level of their primary, quaternary (homooctamer) and tertiary structures [, ]. A number of other proteins also seem to be evolutionary related to these two enzymes. These include, various plasmid-encoded chloromuconate cycloisomerases 5.5.1.7 from EC, Escherichia coli protein rspA [], E. coli bifunctional DGOA protein, E. coli hypothetical proteins ycjG, yfaW and yidU and a hypothetical protein from Streptomyces ambofaciens []. This entry represents the C-terminal region of these proteins.; PDB: 3QLD_B 3CYJ_C 2QDD_B 3FVD_A 3H7V_A 2OZT_A 3NXL_A 1JDF_D 1JCT_B 1EC9_C ....
Probab=52.88 E-value=21 Score=24.95 Aligned_cols=35 Identities=20% Similarity=0.281 Sum_probs=30.0
Q ss_pred hHHHHHhhccCceEEEcCC-CCCchhHHHhhhhhhc
Q 026921 18 VRDVRNAFEGSSLVKVNCK-GMHASDYKKLGAKLKE 52 (231)
Q Consensus 18 V~~VrdAF~~~elVrIDC~-gl~~sDyKKIGaKLrd 52 (231)
|+.||+++...--+++||- +++..+..+++..|++
T Consensus 2 i~avr~~~g~~~~l~vDan~~~~~~~a~~~~~~l~~ 37 (67)
T PF01188_consen 2 IRAVREAVGPDIDLMVDANQAWTLEEAIRLARALED 37 (67)
T ss_dssp HHHHHHHHSTTSEEEEE-TTBBSHHHHHHHHHHHGG
T ss_pred HHHHHHhhCCCCeEEEECCCCCCHHHHHHHHHHcCh
Confidence 6889999988888999995 5788899999999998
No 10
>PF13304 AAA_21: AAA domain; PDB: 3QKS_B 1US8_B 1F2U_B 1F2T_B 3QKT_A 1II8_B 3QKR_B 3QKU_A.
Probab=48.26 E-value=11 Score=27.87 Aligned_cols=39 Identities=26% Similarity=0.364 Sum_probs=27.2
Q ss_pred HHHHHHHHHhhc---ccceeccC--CCCchhHHHhhhhhhhhhh
Q 026921 117 MISLWKRAIEST---KALVLDEI--NLGPDDLLKKVEEFEGISQ 155 (231)
Q Consensus 117 M~~LWk~AieSs---~Al~Lde~--~L~PD~LL~~VEef~~~Sq 155 (231)
++.|...++... ..+++||. +|-|...-+-++-|..+++
T Consensus 244 ~l~l~~~l~~~~~~~~illiDEpE~~LHp~~q~~l~~~l~~~~~ 287 (303)
T PF13304_consen 244 LLSLLSLLLSAKKNGSILLIDEPENHLHPSWQRKLIELLKELSK 287 (303)
T ss_dssp HHHHHHHHHTTTTT-SEEEEESSSTTSSHHHHHHHHHHHHHTGG
T ss_pred HHHHHHHHhCcCCCCeEEEecCCcCCCCHHHHHHHHHHHHhhCc
Confidence 334444444444 77999998 8999988888877765554
No 11
>PF01755 Glyco_transf_25: Glycosyltransferase family 25 (LPS biosynthesis protein); InterPro: IPR002654 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 25 GT25 from CAZY comprises enzymes with only one known activity; as a lipopolysaccharide biosynthesis protein. These enzymes catalyse the transfer of various sugars onto the growing lipopolysaccharide chain during its biosynthesis [].; GO: 0009103 lipopolysaccharide biosynthetic process
Probab=46.58 E-value=21 Score=28.64 Aligned_cols=41 Identities=32% Similarity=0.552 Sum_probs=27.0
Q ss_pred ccccccchhHHHHHHHHHhhc--ccceeccC-CCCchhHHHhhhh
Q 026921 108 QGTIRSSPKMISLWKRAIEST--KALVLDEI-NLGPDDLLKKVEE 149 (231)
Q Consensus 108 ~~~~~~sPkM~~LWk~AieSs--~Al~Lde~-~L~PD~LL~~VEe 149 (231)
++.+--+--=+++|+++++++ -||+|+|- -+.++ +.+.+++
T Consensus 65 ~gEiGC~lSH~~~w~~~v~~~~~~~lIlEDDv~~~~~-f~~~l~~ 108 (200)
T PF01755_consen 65 PGEIGCALSHIKAWQRIVDSGLEYALILEDDVIFDPD-FKEFLEE 108 (200)
T ss_pred cceEeehhhHHHHHHHHHHcCCCeEEEEecccccccc-HHHHHHH
Confidence 333444455689999999999 78888765 55554 4444443
No 12
>PF01695 IstB_IS21: IstB-like ATP binding protein; InterPro: IPR002611 Proteins in this entry contain an ATP/GTP binding P-loop motif. They are found associated with IS21 family insertion sequences []. Functionally they have not been characterised, but they may be involved in transposition [].; GO: 0005524 ATP binding; PDB: 3EC2_A 3ECC_A 2W58_A 2QGZ_A.
Probab=45.36 E-value=29 Score=28.64 Aligned_cols=47 Identities=21% Similarity=0.315 Sum_probs=32.9
Q ss_pred HHHHhhcccceeccCCCCchhHHHhhhhhhhhhhhhhcccCeEEeecC
Q 026921 122 KRAIESTKALVLDEINLGPDDLLKKVEEFEGISQAAEHSYPALVLSRE 169 (231)
Q Consensus 122 k~AieSs~Al~Lde~~L~PD~LL~~VEef~~~SqA~EHS~pAl~~s~~ 169 (231)
.+.+.....|+|||+|..+-.=-..--=|+.+.+-.++. |-++.||-
T Consensus 103 ~~~l~~~dlLilDDlG~~~~~~~~~~~l~~ii~~R~~~~-~tIiTSN~ 149 (178)
T PF01695_consen 103 LKRLKRVDLLILDDLGYEPLSEWEAELLFEIIDERYERK-PTIITSNL 149 (178)
T ss_dssp HHHHHTSSCEEEETCTSS---HHHHHCTHHHHHHHHHT--EEEEEESS
T ss_pred cCccccccEecccccceeeecccccccchhhhhHhhccc-CeEeeCCC
Confidence 345667899999999988744333444588999999986 88888874
No 13
>PLN00410 U5 snRNP protein, DIM1 family; Provisional
Probab=40.56 E-value=72 Score=26.50 Aligned_cols=51 Identities=16% Similarity=0.359 Sum_probs=33.6
Q ss_pred hHHHHHhhcc-CceEEEcCCCCCchhHHHhhhhhhcccCe-eEEeecCceEEEEecCC
Q 026921 18 VRDVRNAFEG-SSLVKVNCKGMHASDYKKLGAKLKELVPC-VLLSFDDEQILMWRGKD 73 (231)
Q Consensus 18 V~~VrdAF~~-~elVrIDC~gl~~sDyKKIGaKLrdLVpC-vllsF~~eqIlmWRGk~ 73 (231)
++.+-+.+.. ..+++||.+..+ .+..+++=-.|| +++-|.+.++.+|||..
T Consensus 45 l~~la~~~~~~~~~~kVDVDe~~-----dla~~y~I~~~~t~~~ffk~g~~~vd~~tG 97 (142)
T PLN00410 45 LASVAETIKNFAVIYLVDITEVP-----DFNTMYELYDPCTVMFFFRNKHIMIDLGTG 97 (142)
T ss_pred HHHHHHHcCCceEEEEEECCCCH-----HHHHHcCccCCCcEEEEEECCeEEEEEecc
Confidence 4455555554 667888888543 455555444344 44488999999999965
No 14
>cd03319 L-Ala-DL-Glu_epimerase L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides. The genomic context and the substrate specificity of characterized members of this family from E.coli and B.subtilis indicates a possible role in the metabolism of the murein peptide of peptidoglycan, of which L-Ala-D-Glu is a component. L-Ala-D/L-Glu epimerase is a member of the enolase-superfamily, which is characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=36.90 E-value=41 Score=29.47 Aligned_cols=37 Identities=24% Similarity=0.381 Sum_probs=30.4
Q ss_pred ehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhcc
Q 026921 16 TLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKEL 53 (231)
Q Consensus 16 ~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrdL 53 (231)
..|+.||+++. .--+++|| .+.+..+..+++..|+++
T Consensus 166 ~~v~~lr~~~g-~~~l~vD~n~~~~~~~A~~~~~~l~~~ 203 (316)
T cd03319 166 ERIRAIREAAP-DARLRVDANQGWTPEEAVELLRELAEL 203 (316)
T ss_pred HHHHHHHHhCC-CCeEEEeCCCCcCHHHHHHHHHHHHhc
Confidence 45788899988 66789998 458888999999999885
No 15
>PRK11169 leucine-responsive transcriptional regulator; Provisional
Probab=36.33 E-value=39 Score=27.44 Aligned_cols=68 Identities=15% Similarity=0.152 Sum_probs=45.8
Q ss_pred hhhhccCce---EeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeE----EeecCceEEEEecCC
Q 026921 5 LHIAAKNGV---YLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVL----LSFDDEQILMWRGKD 73 (231)
Q Consensus 5 l~kL~kNGv---Y~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvl----lsF~~eqIlmWRGk~ 73 (231)
+-||.++|| |.-+|+--+--|...-+|.|.|.+-.+..+.+++..|+. .|.|. ++=+..-++.=+.++
T Consensus 49 i~rL~~~GvI~~~~~~v~p~~lg~~~~a~v~i~~~~~~~~~~~~~~~~l~~-~p~V~~~~~vtG~~d~~l~v~~~~ 123 (164)
T PRK11169 49 VRRLERQGFIQGYTALLNPHYLDASLLVFVEITLNRGAPDVFEQFNAAVQK-LEEIQECHLVSGDFDYLLKTRVPD 123 (164)
T ss_pred HHHHHHCCCeEEEEEEECHHHhCCCEEEEEEEEEcCCChHHHHHHHHHHhc-CcceeeeeeecCCCCEEEEEEECC
Confidence 568899997 667776666566556679999987777778899988887 45544 332444444434443
No 16
>cd03005 PDI_a_ERp46 PDIa family, endoplasmic reticulum protein 46 (ERp46) subfamily; ERp46 is an ER-resident protein containing three redox active TRX domains. Yeast complementation studies show that ERp46 can substitute for protein disulfide isomerase (PDI) function in vivo. It has been detected in many tissues, however, transcript and protein levels do not correlate in all tissues, suggesting regulation at a posttranscriptional level. An identical protein, named endoPDI, has been identified as an endothelial PDI that is highly expressed in the endothelium of tumors and hypoxic lesions. It has a protective effect on cells exposed to hypoxia.
Probab=34.68 E-value=1.3e+02 Score=20.91 Aligned_cols=50 Identities=20% Similarity=0.380 Sum_probs=31.5
Q ss_pred HHHHHhhc----cCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecC
Q 026921 19 RDVRNAFE----GSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGK 72 (231)
Q Consensus 19 ~~VrdAF~----~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk 72 (231)
+.+.+.+. .-.+++|||..-. .-+++.|. .-+|.+++--++..+...+|.
T Consensus 39 ~~~~~~~~~~~~~~~~~~vd~~~~~-~~~~~~~v---~~~Pt~~~~~~g~~~~~~~G~ 92 (102)
T cd03005 39 EQLAKKFNNENPSVKIAKVDCTQHR-ELCSEFQV---RGYPTLLLFKDGEKVDKYKGT 92 (102)
T ss_pred HHHHHHHhccCCcEEEEEEECCCCh-hhHhhcCC---CcCCEEEEEeCCCeeeEeeCC
Confidence 34444444 3678999998643 44566554 348987777666656566664
No 17
>COG1484 DnaC DNA replication protein [DNA replication, recombination, and repair]
Probab=34.08 E-value=40 Score=29.61 Aligned_cols=60 Identities=22% Similarity=0.260 Sum_probs=49.8
Q ss_pred ccccchhHHHHHHHHHh-------------hcccceeccCCCCchhHHHhhhhhhhhhhhhhcccCeEEeecCC
Q 026921 110 TIRSSPKMISLWKRAIE-------------STKALVLDEINLGPDDLLKKVEEFEGISQAAEHSYPALVLSRED 170 (231)
Q Consensus 110 ~~~~sPkM~~LWk~Aie-------------Ss~Al~Lde~~L~PD~LL~~VEef~~~SqA~EHS~pAl~~s~~~ 170 (231)
+..+.|.+++-++.|.. ....|+|||+|--|-.=-.+.+=|..|+|=.+-..+ +++||-.
T Consensus 137 ~f~~~~el~~~Lk~~~~~~~~~~~l~~~l~~~dlLIiDDlG~~~~~~~~~~~~~q~I~~r~~~~~~-~~tsN~~ 209 (254)
T COG1484 137 LFITAPDLLSKLKAAFDEGRLEEKLLRELKKVDLLIIDDIGYEPFSQEEADLLFQLISRRYESRSL-IITSNLS 209 (254)
T ss_pred EEEEHHHHHHHHHHHHhcCchHHHHHHHhhcCCEEEEecccCccCCHHHHHHHHHHHHHHHhhccc-eeecCCC
Confidence 56677888888877655 345799999999998888889999999999999999 8888743
No 18
>PRK13695 putative NTPase; Provisional
Probab=33.11 E-value=58 Score=25.79 Aligned_cols=51 Identities=20% Similarity=0.351 Sum_probs=36.3
Q ss_pred HHHHHHHHHhhcccceeccCCCCchhHHHhhhhhhhhhhhhhcccCeEEeecCC
Q 026921 117 MISLWKRAIESTKALVLDEINLGPDDLLKKVEEFEGISQAAEHSYPALVLSRED 170 (231)
Q Consensus 117 M~~LWk~AieSs~Al~Lde~~L~PD~LL~~VEef~~~SqA~EHS~pAl~~s~~~ 170 (231)
+..+=+++++....++|||+ ||.+.+. -.-++.+.++.+.--|-+++++..
T Consensus 86 ~~~l~~~~l~~~~~lllDE~--~~~e~~~-~~~~~~l~~~~~~~~~~i~v~h~~ 136 (174)
T PRK13695 86 GIPALERALEEADVIIIDEI--GKMELKS-PKFVKAVEEVLDSEKPVIATLHRR 136 (174)
T ss_pred HHHHHHhccCCCCEEEEECC--Ccchhhh-HHHHHHHHHHHhCCCeEEEEECch
Confidence 44555677888999999997 5655555 334677888887667777777754
No 19
>PRK02714 O-succinylbenzoate synthase; Provisional
Probab=32.89 E-value=61 Score=29.06 Aligned_cols=42 Identities=7% Similarity=0.177 Sum_probs=35.3
Q ss_pred eehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhcccCe
Q 026921 15 LTLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKELVPC 56 (231)
Q Consensus 15 ~~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrdLVpC 56 (231)
+..|+.||++|...--+|+|+ ++.+..+..+++..|++|.+.
T Consensus 150 ~~~v~air~~~g~~~~l~vDaN~~w~~~~A~~~~~~l~~l~~~ 192 (320)
T PRK02714 150 LKIFEQLLERLPAGAKLRLDANGGLSLEEAKRWLQLCDRRLSG 192 (320)
T ss_pred HHHHHHHHHhcCCCCEEEEECCCCCCHHHHHHHHHHHhhccCC
Confidence 356889999998888899999 569999999999999986443
No 20
>cd03318 MLE Muconate Lactonizing Enzyme (MLE), an homooctameric enzyme, catalyses the conversion of cis,cis-muconate (CCM) to muconolactone (ML) in the catechol branch of the beta-ketoadipate pathway. This pathway is used in soil microbes to breakdown lignin-derived aromatics, catechol and protocatechuate, to citric acid cycle intermediates. Some bacterial species are also capable of dehalogenating chloroaromatic compounds by the action of chloromuconate lactonizing enzymes (Cl-MLEs). MLEs are members of the enolase superfamily characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and that is stabilized by coordination to the essential Mg2+ ion.
Probab=32.40 E-value=55 Score=29.37 Aligned_cols=40 Identities=18% Similarity=0.186 Sum_probs=33.1
Q ss_pred EeehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921 14 YLTLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL 53 (231)
Q Consensus 14 Y~~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL 53 (231)
.+..|+.||++|...--++|||-+ ++..+..+++.+|.++
T Consensus 174 d~~~v~avr~~~g~~~~l~iDaN~~~~~~~A~~~~~~l~~~ 214 (365)
T cd03318 174 DLAHVEAIAKALGDRASVRVDVNQAWDESTAIRALPRLEAA 214 (365)
T ss_pred HHHHHHHHHHHcCCCcEEEEECCCCCCHHHHHHHHHHHHhc
Confidence 345678899999877778999954 9999999999999885
No 21
>cd03315 MLE_like Muconate lactonizing enzyme (MLE) like subgroup of the enolase superfamily. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and residues that can function as general acid/base catalysts, a Lys-X-Lys motif and another conserved lysine. Despite these conserved residues, the members of the MLE subgroup, like muconate lactonizing enzyme, o-succinylbenzoate synthase (OSBS) and N-acylamino acid racemase (NAAAR), catalyze different reactions.
Probab=32.39 E-value=72 Score=27.27 Aligned_cols=39 Identities=15% Similarity=0.204 Sum_probs=31.7
Q ss_pred ehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhccc
Q 026921 16 TLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKELV 54 (231)
Q Consensus 16 ~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdLV 54 (231)
..|+.||+++...-.+++||.+ .+..+..+++..|.++-
T Consensus 117 ~~v~~vr~~~g~~~~l~vDan~~~~~~~a~~~~~~l~~~~ 156 (265)
T cd03315 117 AVVAALREAVGDDAELRVDANRGWTPKQAIRALRALEDLG 156 (265)
T ss_pred HHHHHHHHhcCCCCEEEEeCCCCcCHHHHHHHHHHHHhcC
Confidence 4578889998777788999854 88899999999998853
No 22
>PF11830 DUF3350: Domain of unknown function (DUF3350); InterPro: IPR021785 This domain is functionally uncharacterised. This domain is found in eukaryotes. This presumed domain is typically between 50 to 64 amino acids in length.
Probab=32.14 E-value=52 Score=24.48 Aligned_cols=24 Identities=25% Similarity=0.517 Sum_probs=19.1
Q ss_pred ccccccchhHHHHHHHHHhhcccc
Q 026921 108 QGTIRSSPKMISLWKRAIESTKAL 131 (231)
Q Consensus 108 ~~~~~~sPkM~~LWk~AieSs~Al 131 (231)
+..-+++-..-.||+.||..-..|
T Consensus 22 ~~~krt~eelR~LWrkAI~QqIlL 45 (56)
T PF11830_consen 22 EKKKRTREELRELWRKAIHQQILL 45 (56)
T ss_pred cccccCHHHHHHHHHHHHHHHHHH
Confidence 556677888999999999876554
No 23
>PF00626 Gelsolin: Gelsolin repeat; InterPro: IPR007123 Gelsolin is a cytoplasmic, calcium-regulated, actin-modulating protein that binds to the barbed ends of actin filaments, preventing monomer exchange (end-blocking or capping) []. It can promote nucleation (the assembly of monomers into filaments), as well as sever existing filaments. In addition, this protein binds with high affinity to fibronectin. Plasma gelsolin and cytoplasmic gelsolin are derived from a single gene by alternate initiation sites and differential splicing. Sequence comparisons indicate an evolutionary relationship between gelsolin, villin, fragmin and severin []. Six large repeating segments occur in gelsolin and villin, and 3 similar segments in severin and fragmin. While the multiple repeats have yet to be related to any known function of the actin-severing proteins, the superfamily appears to have evolved from an ancestral sequence of 120 to 130 amino acid residues [].; PDB: 3FG6_F 1RGI_G 2FGH_A 1D0N_B 3EGD_B 2NUP_B 2NUT_B 3EGX_B 1JHW_A 1J72_A ....
Probab=31.65 E-value=55 Score=22.22 Aligned_cols=17 Identities=35% Similarity=0.903 Sum_probs=12.3
Q ss_pred eeEEeecCceEEEEecCC
Q 026921 56 CVLLSFDDEQILMWRGKD 73 (231)
Q Consensus 56 CvllsF~~eqIlmWRGk~ 73 (231)
|.+|.- +..|.+|.|+.
T Consensus 20 ~yIld~-~~~i~vW~G~~ 36 (76)
T PF00626_consen 20 CYILDC-GYEIFVWVGKK 36 (76)
T ss_dssp EEEEEE-SSEEEEEEHTT
T ss_pred EEEEEe-CCCcEEEEecc
Confidence 444444 44999999998
No 24
>PF14795 Leucyl-specific: Leucine-tRNA synthetase-specific domain; PDB: 1OBC_A 2BTE_A 2V0G_A 2BYT_A 1OBH_A 1H3N_A 2V0C_A.
Probab=30.70 E-value=21 Score=26.56 Aligned_cols=25 Identities=28% Similarity=0.729 Sum_probs=18.7
Q ss_pred CCchhHHHhhhhhhcccCeeEEeecCceEEEEe
Q 026921 38 MHASDYKKLGAKLKELVPCVLLSFDDEQILMWR 70 (231)
Q Consensus 38 l~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWR 70 (231)
|...|.||+||.||. -++.++-+|+
T Consensus 31 Ls~ee~~KmGAELR~--------hedG~~h~WK 55 (56)
T PF14795_consen 31 LSLEEVKKMGAELRP--------HEDGTLHFWK 55 (56)
T ss_dssp EEHHHHHHTT-EEEE---------TTSSEEEEE
T ss_pred ccHHHHHhhchhccc--------CCCCeEeecC
Confidence 677899999999986 4667777774
No 25
>TIGR01927 menC_gamma/gm+ o-succinylbenzoic acid (OSB) synthetase. This model describes the enzyme o-succinylbenzoic acid synthetase (menC) that is involved in one of the steps of the menaquinone biosynthesis pathway. It takes SHCHC and makes it into 2-succinylbenzoate. Included in this model are gamma proteobacteria and archaea. Many of the com-names of the proteins identified by the model are identified as O-succinylbenzoyl-CoA synthase in error.
Probab=30.62 E-value=78 Score=28.29 Aligned_cols=38 Identities=13% Similarity=0.211 Sum_probs=32.2
Q ss_pred eehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhc
Q 026921 15 LTLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKE 52 (231)
Q Consensus 15 ~~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrd 52 (231)
+..|+.||+++...--+|+|| ++.+..+..+++..|.+
T Consensus 141 ~~~v~~vr~~~g~~~~l~vDaN~~w~~~~A~~~~~~l~~ 179 (307)
T TIGR01927 141 GMLVNLLLEALPDKAELRLDANGGLSPDEAQQFLKALDP 179 (307)
T ss_pred HHHHHHHHHHcCCCCeEEEeCCCCCCHHHHHHHHHhccc
Confidence 356888999997767899999 66999999999999986
No 26
>cd03329 MR_like_4 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 4. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown.
Probab=29.99 E-value=65 Score=29.19 Aligned_cols=38 Identities=24% Similarity=0.205 Sum_probs=31.4
Q ss_pred ehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhcc
Q 026921 16 TLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKEL 53 (231)
Q Consensus 16 ~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrdL 53 (231)
..|+.||+++...--+++|| ++.+..+..++...|.++
T Consensus 177 ~~i~~vR~~~G~~~~l~vDan~~~~~~~A~~~~~~l~~~ 215 (368)
T cd03329 177 KACLAVREAVGPDMRLMHDGAHWYSRADALRLGRALEEL 215 (368)
T ss_pred HHHHHHHHHhCCCCeEEEECCCCcCHHHHHHHHHHhhhc
Confidence 34678899998877889999 568999999999888874
No 27
>cd00308 enolase_like Enolase-superfamily, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Enolase superfamily contains different enzymes, like enolases, glutarate-, fucanate- and galactonate dehydratases, o-succinylbenzoate synthase, N-acylamino acid racemase, L-alanine-DL-glutamate epimerase, mandelate racemase, muconate lactonizing enzyme and 3-methylaspartase.
Probab=29.37 E-value=75 Score=26.57 Aligned_cols=38 Identities=26% Similarity=0.385 Sum_probs=34.0
Q ss_pred eehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhc
Q 026921 15 LTLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKE 52 (231)
Q Consensus 15 ~~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrd 52 (231)
+..|+.||+++...--+++||.+ ++..+..++...|.+
T Consensus 81 ~~~i~~lr~~~g~~~~l~lDaN~~~~~~~a~~~~~~l~~ 119 (229)
T cd00308 81 IERVRAVREAFGPDARLAVDANGAWTPKEAIRLIRALEK 119 (229)
T ss_pred HHHHHHHHHHhCCCCeEEEECCCCCCHHHHHHHHHHhhh
Confidence 57789999999888889999987 888999999999987
No 28
>cd03325 D-galactonate_dehydratase D-galactonate dehydratase catalyses the dehydration of galactonate to 2-keto-3-deoxygalactnate (KDGal), as part of the D-galactonate nonphosphorolytic catabolic Entner-Doudoroff pathway. D-galactonate dehydratase belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=29.23 E-value=67 Score=28.99 Aligned_cols=39 Identities=21% Similarity=0.242 Sum_probs=33.4
Q ss_pred eehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921 15 LTLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL 53 (231)
Q Consensus 15 ~~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL 53 (231)
...|+.||++|...--+++||-+ .+..+..+++..|.++
T Consensus 163 ~~~i~avr~~~g~~~~l~vDaN~~~~~~~A~~~~~~l~~~ 202 (352)
T cd03325 163 VERVAALREAVGPDIDIGVDFHGRVSKPMAKDLAKELEPY 202 (352)
T ss_pred HHHHHHHHHhhCCCCEEEEECCCCCCHHHHHHHHHhcccc
Confidence 34688999999888789999955 9999999999999884
No 29
>PRK15072 bifunctional D-altronate/D-mannonate dehydratase; Provisional
Probab=28.35 E-value=67 Score=29.77 Aligned_cols=38 Identities=21% Similarity=0.275 Sum_probs=33.5
Q ss_pred ehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921 16 TLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL 53 (231)
Q Consensus 16 ~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL 53 (231)
..|+.||++|...--+++||-+ .+..+..+++..|.++
T Consensus 194 ~~v~avre~~G~~~~l~vDaN~~w~~~~A~~~~~~l~~~ 232 (404)
T PRK15072 194 KLFEAVRNKFGFDLHLLHDVHHRLTPIEAARLGKSLEPY 232 (404)
T ss_pred HHHHHHHhhhCCCceEEEECCCCCCHHHHHHHHHhcccc
Confidence 4799999999887779999965 9999999999999884
No 30
>cd05797 Ribosomal_L10 Ribosomal protein L10 family, L10 subfamily; composed of bacterial 50S ribosomal protein and eukaryotic mitochondrial 39S ribosomal protein, L10. L10 occupies the L7/L12 stalk of the ribosome. The N-terminal domain (NTD) of L10 interacts with L11 protein and forms the base of the L7/L12 stalk, while the extended C-terminal helix binds to two or three dimers of the NTD of L7/L12 (L7 and L12 are identical except for an acetylated N-terminus). The L7/L12 stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-in
Probab=27.08 E-value=79 Score=25.33 Aligned_cols=38 Identities=16% Similarity=0.441 Sum_probs=33.9
Q ss_pred ehhHHHHHhhccCc-eEEEcCCCCCchhHHHhhhhhhcc
Q 026921 16 TLVRDVRNAFEGSS-LVKVNCKGMHASDYKKLGAKLKEL 53 (231)
Q Consensus 16 ~lV~~VrdAF~~~e-lVrIDC~gl~~sDyKKIGaKLrdL 53 (231)
.+|+.+++.|+.+. ++-++++|++.++...+-.+||.-
T Consensus 7 ~~v~~l~~~l~~~~~v~v~~~~gl~~~~~~~lR~~lr~~ 45 (157)
T cd05797 7 EIVAELKEKLKEAKSVVVADYRGLTVAQLTELRKELREA 45 (157)
T ss_pred HHHHHHHHHHHhCCEEEEEecCCCcHHHHHHHHHHHHHc
Confidence 36889999999987 678899999999999999999973
No 31
>smart00344 HTH_ASNC helix_turn_helix ASNC type. AsnC: an autogenously regulated activator of asparagine synthetase A transcription in Escherichia coli
Probab=26.35 E-value=91 Score=22.84 Aligned_cols=54 Identities=17% Similarity=0.349 Sum_probs=35.4
Q ss_pred hhhhccCce---EeehhHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEee
Q 026921 5 LHIAAKNGV---YLTLVRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSF 61 (231)
Q Consensus 5 l~kL~kNGv---Y~~lV~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF 61 (231)
|.+|.++|+ |...+..=+-.+...-+|.|.|.. +....+++..|+. .|+|.--+
T Consensus 38 l~~L~~~g~i~~~~~~~~~~~~g~~~~~~v~i~~~~--~~~~~~v~~~l~~-~p~v~~~~ 94 (108)
T smart00344 38 VKRLEEEGVIKGYTAVINPKKLGLSVTAFVGVDLES--PDKLEEFLEKLEK-LPEVVEVY 94 (108)
T ss_pred HHHHHHCCCeeceEEEeCHHHcCCCEEEEEEEEECC--hhHHHHHHHHHhC-CcceEEee
Confidence 457778885 333333222233457888999986 5678999999998 57665433
No 32
>cd03316 MR_like Mandelate racemase (MR)-like subfamily of the enolase superfamily. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. Members of the MR subgroup are mandelate racemase, D-glucarate/L-idarate dehydratase (GlucD), D-altronate/D-mannonate dehydratase , D-galactonate dehydratase (GalD) , D-gluconate dehydratase (GlcD), and L-rhamnonate dehydratase (RhamD).
Probab=25.39 E-value=1.1e+02 Score=27.04 Aligned_cols=39 Identities=26% Similarity=0.274 Sum_probs=31.7
Q ss_pred eehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhcc
Q 026921 15 LTLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKEL 53 (231)
Q Consensus 15 ~~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrdL 53 (231)
+..|+.||+++..+--+++|+ ++++.++..++...|+++
T Consensus 176 ~~~v~~ir~~~g~~~~l~vDaN~~~~~~~a~~~~~~l~~~ 215 (357)
T cd03316 176 LARVRAVREAVGPDVDLMVDANGRWDLAEAIRLARALEEY 215 (357)
T ss_pred HHHHHHHHHhhCCCCEEEEECCCCCCHHHHHHHHHHhCcc
Confidence 345788899998777889998 578888888998888875
No 33
>KOG1990 consensus Poly(A)-specific exoribonuclease PARN [Replication, recombination and repair]
Probab=25.12 E-value=27 Score=34.62 Aligned_cols=76 Identities=18% Similarity=0.218 Sum_probs=61.1
Q ss_pred hhhccCceEeehhHHHHHhhccCceEEEcCC-CCCchhHHHhhhhh--hcccCeeEEeecCceEEEEecCCCCCCCCCC
Q 026921 6 HIAAKNGVYLTLVRDVRNAFEGSSLVKVNCK-GMHASDYKKLGAKL--KELVPCVLLSFDDEQILMWRGKDWKSMYPEP 81 (231)
Q Consensus 6 ~kL~kNGvY~~lV~~VrdAF~~~elVrIDC~-gl~~sDyKKIGaKL--rdLVpCvllsF~~eqIlmWRGk~~k~~~~~~ 81 (231)
.-++.++.-..++..+++.|..++..++-|+ |....-.+-.+.-| +-+-+-+||+=.+.-++|+||+++-+..|.+
T Consensus 194 ~~~~~~~~~~~~~~~~~~~~~k~~~~k~~~~rg~~~~~~~~~a~~l~~~~~tg~~lv~hN~~~dv~y~~~~Fl~~lp~~ 272 (564)
T KOG1990|consen 194 FALGRSRKLQGLAVAMVSFWEKHEFAKILIKRGVLETRKERMADELQELLLTGKVLVLHNKLLDVMYRYKNFLSPLPST 272 (564)
T ss_pred ceehhccccccchhHHHHHHHHHHHHHHHHHhcchhhhccchHHHHHHHHhcCCeEEeeccceeeeeehhhcccccchh
Confidence 3467788888999999999999999998886 44444444455566 8899999999999999999999998855543
No 34
>PF07728 AAA_5: AAA domain (dynein-related subfamily); InterPro: IPR011704 The ATPases Associated to a variety of cellular Activities (AAA) are a family distinguished by a highly conserved module of 230 amino acids []. The highly conserved nature of this module across taxa suggests that it has a key cellular role. Members of the family are involved in diverse cellular functions including gene expression, peroxisome assembly and vesicle mediated transport. Although the role of this ATPase AAA domain is not, as yet, clear, the AAA+ superfamily of proteins to which the AAA ATPases belong has a chaperone-like function in the assembly, operation or disassembly of proteins []. This ATPase domain includes some proteins not detected by the IPR003959 from INTERPRO model.; GO: 0005524 ATP binding, 0016887 ATPase activity; PDB: 3NBX_X 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 3VKH_A 3VKG_A.
Probab=25.01 E-value=27 Score=26.20 Aligned_cols=24 Identities=42% Similarity=0.649 Sum_probs=17.4
Q ss_pred HHhhcccceeccCCCCchhHHHhh
Q 026921 124 AIESTKALVLDEINLGPDDLLKKV 147 (231)
Q Consensus 124 AieSs~Al~Lde~~L~PD~LL~~V 147 (231)
|++....++|||+++.+.+++..+
T Consensus 62 a~~~~~il~lDEin~a~~~v~~~L 85 (139)
T PF07728_consen 62 AMRKGGILVLDEINRAPPEVLESL 85 (139)
T ss_dssp THHEEEEEEESSCGG--HHHHHTT
T ss_pred cccceeEEEECCcccCCHHHHHHH
Confidence 444889999999999888876654
No 35
>PRK14017 galactonate dehydratase; Provisional
Probab=25.01 E-value=90 Score=28.50 Aligned_cols=38 Identities=26% Similarity=0.281 Sum_probs=32.6
Q ss_pred ehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921 16 TLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL 53 (231)
Q Consensus 16 ~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL 53 (231)
..|+.||++|...--+++||.+ .+..+..+++..|+++
T Consensus 165 ~~i~avr~~~g~~~~l~vDaN~~w~~~~A~~~~~~l~~~ 203 (382)
T PRK14017 165 ARVAAVREAVGPEIGIGVDFHGRVHKPMAKVLAKELEPY 203 (382)
T ss_pred HHHHHHHHHhCCCCeEEEECCCCCCHHHHHHHHHhhccc
Confidence 4577899999887789999965 8999999999999884
No 36
>KOG0469 consensus Elongation factor 2 [Translation, ribosomal structure and biogenesis]
Probab=24.37 E-value=92 Score=32.64 Aligned_cols=50 Identities=30% Similarity=0.383 Sum_probs=45.7
Q ss_pred HhhccCceEEEcCCCCCchhHHHhhhhhhccc---CeeEEee--cCceEEEEecC
Q 026921 23 NAFEGSSLVKVNCKGMHASDYKKLGAKLKELV---PCVLLSF--DDEQILMWRGK 72 (231)
Q Consensus 23 dAF~~~elVrIDC~gl~~sDyKKIGaKLrdLV---pCvllsF--~~eqIlmWRGk 72 (231)
+-|..+++||+.-.--++.|.-|+-+-|++|. ||++..+ .+|+||-=-|.
T Consensus 481 MKFSVSPVV~VAVe~Knp~DLpKLvEGLkrLakSDP~v~~~~~esGehiiAgaGe 535 (842)
T KOG0469|consen 481 MKFSVSPVVRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGE 535 (842)
T ss_pred EEeeccceEEEEEecCChhhhHHHHHHHHHHhccCCeEEEEeccCCceEEeccch
Confidence 56999999999999999999999999999986 9999998 68999988874
No 37
>PF09026 CENP-B_dimeris: Centromere protein B dimerisation domain; InterPro: IPR015115 Centromere protein B (CENP-B) interacts with centromeric heterochromatin in chromosomes and binds to a specific subset of alphoid satellite DNA, called the CENP-B box. CENP-B may organise arrays of centromere satellite DNA into a higher order structure, which then directs centromere formation and kinetochore assembly in mammalian chromosomes. The CENP-B dimerisation domain is composed of two alpha-helices, which are folded into an antiparallel configuration. Dimerisation of CENP-B is mediated by this domain, in which monomers dimerise to form a symmetrical, antiparallel, four-helix bundle structure with a large hydrophobic patch in which 23 residues of one monomer form van der Waals contacts with the other monomer. This CENP-B dimer configuration may be suitable for capturing two distant CENP-B boxes during centromeric heterochromatin formation []. ; GO: 0003677 DNA binding, 0003682 chromatin binding, 0006355 regulation of transcription, DNA-dependent, 0000775 chromosome, centromeric region, 0005634 nucleus; PDB: 1UFI_A.
Probab=24.26 E-value=31 Score=28.25 Aligned_cols=10 Identities=40% Similarity=0.697 Sum_probs=4.6
Q ss_pred CCCccHHHHH
Q 026921 216 LGSLPVDHIA 225 (231)
Q Consensus 216 lGsLPvD~i~ 225 (231)
|+|+|||.=+
T Consensus 60 ltSf~id~~v 69 (101)
T PF09026_consen 60 LTSFPIDDKV 69 (101)
T ss_dssp HCTS---HHH
T ss_pred hhccchhHhh
Confidence 6888887643
No 38
>PTZ00051 thioredoxin; Provisional
Probab=23.49 E-value=2.6e+02 Score=19.42 Aligned_cols=51 Identities=12% Similarity=0.142 Sum_probs=31.1
Q ss_pred hHHHHHhhccCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecCceEEEEecC
Q 026921 18 VRDVRNAFEGSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDDEQILMWRGK 72 (231)
Q Consensus 18 V~~VrdAF~~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~eqIlmWRGk 72 (231)
+..+...|..--+++|||... +.-.++.|. + =+|++++.-++..+-.+.|.
T Consensus 40 l~~l~~~~~~~~~~~vd~~~~-~~~~~~~~v--~-~~Pt~~~~~~g~~~~~~~G~ 90 (98)
T PTZ00051 40 YEECSKEYTKMVFVKVDVDEL-SEVAEKENI--T-SMPTFKVFKNGSVVDTLLGA 90 (98)
T ss_pred HHHHHHHcCCcEEEEEECcch-HHHHHHCCC--c-eeeEEEEEeCCeEEEEEeCC
Confidence 344444555677899999742 233444444 2 48997766566666567774
No 39
>PF09248 DUF1965: Domain of unknown function (DUF1965); InterPro: IPR015328 Members of this family of fungal domains adopt a structure that consists of an alpha/beta motif. Their exact function has not, as yet, been determined []. ; PDB: 1N9E_A 1RKY_A 1W7C_A 3PGB_A.
Probab=23.09 E-value=66 Score=24.82 Aligned_cols=24 Identities=21% Similarity=0.387 Sum_probs=18.7
Q ss_pred cCceEeehhHHHHHhhccCceEEE
Q 026921 10 KNGVYLTLVRDVRNAFEGSSLVKV 33 (231)
Q Consensus 10 kNGvY~~lV~~VrdAF~~~elVrI 33 (231)
=||.|-..+...|+||..-+++|+
T Consensus 32 Yn~~fy~tteeFr~A~~~~~f~k~ 55 (74)
T PF09248_consen 32 YNGQFYPTTEEFREAWWSGDFKKL 55 (74)
T ss_dssp ETTEEESSHHHHHHHHCSTT----
T ss_pred ECCEEcccHHHHHHHHhCCCceec
Confidence 489999999999999999999886
No 40
>cd03326 MR_like_1 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 1. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown.
Probab=22.86 E-value=1.3e+02 Score=28.09 Aligned_cols=40 Identities=5% Similarity=0.022 Sum_probs=30.8
Q ss_pred ehhHHHHHhhccCceEEEcCC-CCCchhHHHhhhhhhcccC
Q 026921 16 TLVRDVRNAFEGSSLVKVNCK-GMHASDYKKLGAKLKELVP 55 (231)
Q Consensus 16 ~lV~~VrdAF~~~elVrIDC~-gl~~sDyKKIGaKLrdLVp 55 (231)
..|+.||+++...--+++||- +.+.++-.+++..|+++-+
T Consensus 193 ~~v~avRe~~G~~~~l~vDaN~~w~~~~A~~~~~~l~~~~~ 233 (385)
T cd03326 193 RRIEAALDVLGDGARLAVDANGRFDLETAIAYAKALAPYGL 233 (385)
T ss_pred HHHHHHHHhcCCCCeEEEECCCCCCHHHHHHHHHHhhCcCC
Confidence 457778888877777888884 4888888888888888543
No 41
>cd03320 OSBS o-Succinylbenzoate synthase (OSBS) catalyzes the conversion of 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate (SHCHC) to 4-(2'-carboxyphenyl)-4-oxobutyrate (o-succinylbenzoate or OSB), a reaction in the menaquinone biosynthetic pathway. Menaquinone is an essential cofactor for anaerobic growth in eubacteria and some archaea. OSBS belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=22.75 E-value=1.4e+02 Score=25.80 Aligned_cols=38 Identities=13% Similarity=0.217 Sum_probs=30.5
Q ss_pred ehhHHHHHhhccCceEEEcCCC-CCchhHHHhhhhhhcc
Q 026921 16 TLVRDVRNAFEGSSLVKVNCKG-MHASDYKKLGAKLKEL 53 (231)
Q Consensus 16 ~lV~~VrdAF~~~elVrIDC~g-l~~sDyKKIGaKLrdL 53 (231)
..|+.||+++...--+++|+-+ ++..+..+++..|.++
T Consensus 115 ~~v~~vr~~~g~~~~l~vDaN~~w~~~~A~~~~~~l~~~ 153 (263)
T cd03320 115 ARLRALREALPADAKLRLDANGGWSLEEALAFLEALAAG 153 (263)
T ss_pred HHHHHHHHHcCCCCeEEEeCCCCCCHHHHHHHHHhhccc
Confidence 4578888888877778898855 8888888888888774
No 42
>PRK10645 divalent-cation tolerance protein CutA; Provisional
Probab=21.58 E-value=81 Score=25.34 Aligned_cols=42 Identities=26% Similarity=0.462 Sum_probs=33.5
Q ss_pred eEEEcCCCCCchhHHHhhhhh--hcccCeeEEeecCceEEEEecCC
Q 026921 30 LVKVNCKGMHASDYKKLGAKL--KELVPCVLLSFDDEQILMWRGKD 73 (231)
Q Consensus 30 lVrIDC~gl~~sDyKKIGaKL--rdLVpCvllsF~~eqIlmWRGk~ 73 (231)
+|-+.|. +...-++|+..| +.|+.||=|.-.-+-+-.|.|+-
T Consensus 13 lV~tT~p--~~e~A~~ia~~Lve~rLaACvni~p~i~S~Y~W~G~i 56 (112)
T PRK10645 13 VVLCTAP--DEATAQDLAAKVLAEKLAACVTLLPGATSLYYWEGKL 56 (112)
T ss_pred EEEEeCC--CHHHHHHHHHHHHHCCeeEEEecCCCeeEEEEECCEE
Confidence 4555554 566789999998 88999998877788999999975
No 43
>COG1062 AdhC Zn-dependent alcohol dehydrogenases, class III [Energy production and conversion]
Probab=21.55 E-value=2.4e+02 Score=27.59 Aligned_cols=104 Identities=19% Similarity=0.294 Sum_probs=68.8
Q ss_pred ehhHHHHHhhc-cCceEEEcCCCCCchhHHHhhhhhhcccCeeEEee--cCceE-----EEEecCCCCCCCCC--CCCCC
Q 026921 16 TLVRDVRNAFE-GSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSF--DDEQI-----LMWRGKDWKSMYPE--PPSFS 85 (231)
Q Consensus 16 ~lV~~VrdAF~-~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF--~~eqI-----lmWRGk~~k~~~~~--~~~~~ 85 (231)
.+|+.+++... +...+ |||.|-..-=---+.+.-| .--||++-+ -+..| -+-.||.||.++-- .|..+
T Consensus 242 ~vv~~i~~~T~gG~d~~-~e~~G~~~~~~~al~~~~~-~G~~v~iGv~~~~~~i~~~~~~lv~gr~~~Gs~~G~~~p~~d 319 (366)
T COG1062 242 DVVEAIVELTDGGADYA-FECVGNVEVMRQALEATHR-GGTSVIIGVAGAGQEISTRPFQLVTGRVWKGSAFGGARPRSD 319 (366)
T ss_pred hHHHHHHHhcCCCCCEE-EEccCCHHHHHHHHHHHhc-CCeEEEEecCCCCceeecChHHeeccceEEEEeecCCccccc
Confidence 57777787777 44444 8888866532233444444 566777776 22222 24456777766633 13333
Q ss_pred CcccccccCCCCCCCCCCCCCCccccccchhHHHHHHHHHhhcccceeccCCCCchhHHHhhhhhhhhhhhhhccc
Q 026921 86 NPVDLDIAGDADGSGTPSDDPSQGTIRSSPKMISLWKRAIESTKALVLDEINLGPDDLLKKVEEFEGISQAAEHSY 161 (231)
Q Consensus 86 ip~~~~~~~~~~~s~~s~~e~~~~~~~~sPkM~~LWk~AieSs~Al~Lde~~L~PD~LL~~VEef~~~SqA~EHS~ 161 (231)
| ||+..||+. =.|..|.|++..-.++.|.||.++..
T Consensus 320 i----------------------------P~lv~~y~~------------Gkl~~d~lvt~~~~Le~INeaf~~m~ 355 (366)
T COG1062 320 I----------------------------PRLVDLYMA------------GKLPLDRLVTHTIPLEDINEAFDLMH 355 (366)
T ss_pred h----------------------------hHHHHHHHc------------CCCchhHHhhccccHHHHHHHHHHHh
Confidence 3 999999974 24888999999999999999998754
No 44
>PF03091 CutA1: CutA1 divalent ion tolerance protein; InterPro: IPR004323 The CutA family of proteins which exhibit ion tolerance are found in a large variety of species []. In E.Coli, two operons on the cutA locus contain genes that encode three proteins, CutA1, CutA2 and CutA3. CutA1 proteins are found in the cytoplasm while CutA2 (50kDa) and CutA3 (24kDa) are located in the inner membrane. Although the role of E. Coli CutA1 is not clear, studies on E. coli cutA locus describe some mutations that lead to an increase in copper sensitivity, thus suggesting a role in ion tolerance []. To date, the structure of CutA proteins from several species have been solved [, ]. The crystal structures of the E.Coli and rat CutA1 proteins show both these proteins to be trimeric in the crystal as well as in solution[].Trimerisation seems to supported by the formation of beta sheets between the subunit. This trimeric structure suggests the protein may be involved in signal transduction due to architectural similarities with PII signal transducer proteins []. Recent studies propose that mammalian CutA1 in the neuronal cell membrane acts as an anchor for acetylcholinesterase (AChE)1 [].; GO: 0010038 response to metal ion; PDB: 1O5J_A 1KR4_A 1VHF_A 3GSD_H 1OSC_A 3OPK_A 4E98_C 1V9B_F 1UKU_A 2E66_C ....
Probab=21.49 E-value=97 Score=24.30 Aligned_cols=34 Identities=24% Similarity=0.473 Sum_probs=26.4
Q ss_pred CchhHHHhhhhh--hcccCeeEEeecCceEEEEecC
Q 026921 39 HASDYKKLGAKL--KELVPCVLLSFDDEQILMWRGK 72 (231)
Q Consensus 39 ~~sDyKKIGaKL--rdLVpCvllsF~~eqIlmWRGk 72 (231)
+...-++|+..| +.|+.|+=+.=.=+-+.+|.|+
T Consensus 10 ~~e~A~~ia~~lve~rLaACvni~~~i~S~Y~W~g~ 45 (102)
T PF03091_consen 10 DREEAERIARALVEERLAACVNIIPPITSIYRWEGK 45 (102)
T ss_dssp SHHHHHHHHHHHHHTTS-SEEEEEEEEEEEEEETTE
T ss_pred CHHHHHHHHHHHHhCCeEEEEEecCCceeEEEECCc
Confidence 346778888887 7899999776577788899997
No 45
>cd03327 MR_like_2 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 2. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown.
Probab=21.34 E-value=1.1e+02 Score=27.47 Aligned_cols=38 Identities=21% Similarity=0.287 Sum_probs=31.2
Q ss_pred ehhHHHHHhhccCceEEEcCC-CCCchhHHHhhhhhhcc
Q 026921 16 TLVRDVRNAFEGSSLVKVNCK-GMHASDYKKLGAKLKEL 53 (231)
Q Consensus 16 ~lV~~VrdAF~~~elVrIDC~-gl~~sDyKKIGaKLrdL 53 (231)
..|+.||+++...--+++||- +.+..+..+++..|.++
T Consensus 159 ~~v~avr~~~g~~~~l~vDan~~~~~~~A~~~~~~l~~~ 197 (341)
T cd03327 159 ELVRAIREAVGYDVDLMLDCYMSWNLNYAIKMARALEKY 197 (341)
T ss_pred HHHHHHHHHhCCCCcEEEECCCCCCHHHHHHHHHHhhhc
Confidence 357889999987777899984 48889999999998884
No 46
>cd06532 Glyco_transf_25 Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis. The members include the beta(1,4) galactosyltransferases: Lgt2 of Moraxella catarrhalis, LgtB and LgtE of Neisseria gonorrhoeae and Lic2A of Haemophilus influenzae. M. catarrhalis Lgt2 catalyzes the addition of galactose (Gal) to the growing chain of LOS on the cell surface. N. gonorrhoeae LgtB and LgtE link Gal-beta(1,4) to GlcNAc (N-acetylglucosamine) and Glc (glucose), respectively. The genes encoding LgtB and LgtE are two genes of a five gene locus involved in the synthesis of gonococcal LOS. LgtE is believed to perform the first step in LOS biosynthesis.
Probab=21.22 E-value=67 Score=24.82 Aligned_cols=27 Identities=37% Similarity=0.698 Sum_probs=19.9
Q ss_pred hhHHHHHHHHHhhc--ccceecc-CCCCch
Q 026921 115 PKMISLWKRAIEST--KALVLDE-INLGPD 141 (231)
Q Consensus 115 PkM~~LWk~AieSs--~Al~Lde-~~L~PD 141 (231)
=.-+++|+++++++ -||+|+| +.+.|+
T Consensus 69 lSH~~~w~~~~~~~~~~alIlEDDv~~~~~ 98 (128)
T cd06532 69 LSHYKLWQKIVESNLEYALILEDDAILDPD 98 (128)
T ss_pred HHHHHHHHHHHHcCCCeEEEEccCcEECCC
Confidence 45689999999988 4666654 467666
No 47
>PRK05105 O-succinylbenzoate synthase; Provisional
Probab=20.99 E-value=1.2e+02 Score=27.40 Aligned_cols=37 Identities=11% Similarity=0.140 Sum_probs=29.4
Q ss_pred eehhHHHHHhhccCceEEEcC-CCCCchhHHHhhhhhhc
Q 026921 15 LTLVRDVRNAFEGSSLVKVNC-KGMHASDYKKLGAKLKE 52 (231)
Q Consensus 15 ~~lV~~VrdAF~~~elVrIDC-~gl~~sDyKKIGaKLrd 52 (231)
+..|+.||+++ .+--+|+|| ++.+..+..+.+..|.+
T Consensus 145 ~~~i~~vr~~~-~~~~l~vDaN~~w~~~~A~~~~~~l~~ 182 (322)
T PRK05105 145 GMLVNLLLEAI-PDLKLRLDANRGWTLEKAQQFAKYVPP 182 (322)
T ss_pred HHHHHHHHHhC-CCCeEEEECCCCCCHHHHHHHHHHhhh
Confidence 35577889988 677889999 55888888888888876
No 48
>PF12153 CAP18_C: LPS binding domain of CAP18 (C terminal); InterPro: IPR022746 This entry represents the cathlecidin antimicrobial C-terminal peptides. The C terminus is cleaved from the cathlecidin precursor, and is approximately 30 amino acids in length with a helical structure. This entry is found in association with PF00666 from PFAM. The C terminus peptides possess antimicrobial activity by virtue of their binding to bacterial lipopolysaccharides [][].; GO: 0042742 defense response to bacterium; PDB: 1LYP_A 2LMF_A 2FBU_H 2FBS_N 2K6O_A 2FCG_F.
Probab=20.77 E-value=35 Score=22.35 Aligned_cols=14 Identities=36% Similarity=0.722 Sum_probs=10.6
Q ss_pred hhHHHhhhhhhccc
Q 026921 41 SDYKKLGAKLKELV 54 (231)
Q Consensus 41 sDyKKIGaKLrdLV 54 (231)
...||||.|.+|+.
T Consensus 13 eklkkIGQkIKDff 26 (28)
T PF12153_consen 13 EKLKKIGQKIKDFF 26 (28)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHh
Confidence 35788888888864
No 49
>PF13401 AAA_22: AAA domain; PDB: 2QBY_B 1FNN_B 1W5T_A 1W5S_B.
Probab=20.77 E-value=1.1e+02 Score=22.29 Aligned_cols=25 Identities=32% Similarity=0.444 Sum_probs=19.6
Q ss_pred chhHHHHHHHHHhhcc--cceeccCCC
Q 026921 114 SPKMISLWKRAIESTK--ALVLDEINL 138 (231)
Q Consensus 114 sPkM~~LWk~AieSs~--Al~Lde~~L 138 (231)
...+++++.++++... .|++||++.
T Consensus 72 ~~~l~~~~~~~l~~~~~~~lviDe~~~ 98 (131)
T PF13401_consen 72 SDELRSLLIDALDRRRVVLLVIDEADH 98 (131)
T ss_dssp HHHHHHHHHHHHHHCTEEEEEEETTHH
T ss_pred HHHHHHHHHHHHHhcCCeEEEEeChHh
Confidence 4667788888888776 599999954
No 50
>PF07039 DUF1325: SGF29 tudor-like domain; InterPro: IPR010750 SAGA-associated factor 29 is involved in transcriptional regulation, probably through association with histone acetyltransferase (HAT) complexes like the TFTC-HAT or STAGA complexes. It also may be involved in MYC-mediated oncogenic transformation. It is a component of the ATAC complex, which is a complex with histone acetyltransferase activity on histones H3 and H4 []. This entry represents a domain found in yeast and human SAGA-associated factor 29 proteins that is related to the tudor domain. ; PDB: 3MP6_A 3MP1_A 3MP8_A 3MET_B 3ME9_A 3MEU_B 3MEA_A 3MEV_B 3LX7_A 3MEW_A.
Probab=20.74 E-value=77 Score=25.81 Aligned_cols=27 Identities=30% Similarity=0.423 Sum_probs=18.3
Q ss_pred hhcccCeEEeecCCCCCCcc-ccccCCC
Q 026921 157 AEHSYPALVLSREDGASSSM-AEYEDGS 183 (231)
Q Consensus 157 ~EHS~pAl~~s~~~~~~~~~-ae~~dd~ 183 (231)
|---|||.|++......+.+ ..|+||.
T Consensus 85 TT~FY~A~V~~~p~~~~~~y~l~Fedd~ 112 (130)
T PF07039_consen 85 TTCFYPATVVSPPKKKSGEYKLKFEDDE 112 (130)
T ss_dssp SSEEEEEEEEEE-SSTTS-EEEEECTTT
T ss_pred CceEEEEEEEeCCCCCCCcEEEEEeCCC
Confidence 45569999999976555555 7777743
No 51
>PRK06512 thiamine-phosphate pyrophosphorylase; Provisional
Probab=20.72 E-value=1.5e+02 Score=25.81 Aligned_cols=39 Identities=23% Similarity=0.413 Sum_probs=34.9
Q ss_pred ehhHHHHHhhccC--ceEEEcCCCCCchhHHHhhhhhhccc
Q 026921 16 TLVRDVRNAFEGS--SLVKVNCKGMHASDYKKLGAKLKELV 54 (231)
Q Consensus 16 ~lV~~VrdAF~~~--elVrIDC~gl~~sDyKKIGaKLrdLV 54 (231)
.+...++.|...- ..|.+-.++++..++.++..+|++++
T Consensus 27 ~~~~~l~~al~~G~v~~vQlR~K~l~~~~~~~~a~~l~~l~ 67 (221)
T PRK06512 27 ELAKLLRAALQGGDVASVILPQYGLDEATFQKQAEKLVPVI 67 (221)
T ss_pred cHHHHHHHHHcCCCccEEEEeCCCCCHHHHHHHHHHHHHHH
Confidence 5677888998887 59999999999999999999999985
No 52
>cd02998 PDI_a_ERp38 PDIa family, endoplasmic reticulum protein 38 (ERp38) subfamily; composed of proteins similar to the P5-like protein first isolated from alfalfa, which contains two redox active TRX (a) domains at the N-terminus, like human P5, and a C-terminal domain with homology to the C-terminal domain of ERp29, unlike human P5. The cDNA clone of this protein (named G1) was isolated from an alfalfa cDNA library by screening with human protein disulfide isomerase (PDI) cDNA. The G1 protein is constitutively expressed in all major organs of the plant and its expression is induced by treatment with tunicamycin, indicating that it may be a glucose-regulated protein. The G1 homolog in the eukaryotic social amoeba Dictyostelium discoideum is also described as a P5-like protein, which is located in the endoplasmic reticulum (ER) despite the absence of an ER-retrieval signal. G1 homologs from Aspergillus niger and Neurospora crassa have also been characterized, and are named TIGA and ER
Probab=20.07 E-value=2.4e+02 Score=19.40 Aligned_cols=52 Identities=15% Similarity=0.204 Sum_probs=31.3
Q ss_pred hhHHHHHhhc---cCceEEEcCCCCCchhHHHhhhhhhcccCeeEEeecC-ceEEEEec
Q 026921 17 LVRDVRNAFE---GSSLVKVNCKGMHASDYKKLGAKLKELVPCVLLSFDD-EQILMWRG 71 (231)
Q Consensus 17 lV~~VrdAF~---~~elVrIDC~gl~~sDyKKIGaKLrdLVpCvllsF~~-eqIlmWRG 71 (231)
.+..+.+.+. .-.+++|||..-.+.=+++.|.. -+|++++--.+ .+...+.|
T Consensus 39 ~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~~~~i~---~~P~~~~~~~~~~~~~~~~g 94 (105)
T cd02998 39 EYEKLAAVFANEDDVVIAKVDADEANKDLAKKYGVS---GFPTLKFFPKGSTEPVKYEG 94 (105)
T ss_pred HHHHHHHHhCCCCCEEEEEEECCCcchhhHHhCCCC---CcCEEEEEeCCCCCccccCC
Confidence 3455555665 24678999987444556666642 37887765444 34555555
Done!