Query 029269
Match_columns 196
No_of_seqs 140 out of 288
Neff 4.0
Searched_HMMs 46136
Date Fri Mar 29 10:11:42 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/029269.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/029269hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1696 60s ribosomal protein 100.0 1.3E-83 2.9E-88 535.7 16.2 193 1-193 1-193 (193)
2 PTZ00097 60S ribosomal protein 100.0 3.2E-79 6.9E-84 508.9 18.7 174 3-176 1-174 (175)
3 cd01417 Ribosomal_L19e_E Ribos 100.0 6.8E-78 1.5E-82 497.0 17.4 164 4-167 1-164 (164)
4 PTZ00436 60S ribosomal protein 100.0 4.1E-76 9E-81 524.9 20.1 170 1-170 1-170 (357)
5 PF01280 Ribosomal_L19e: Ribos 100.0 1.1E-73 2.3E-78 466.0 14.2 148 2-149 1-148 (148)
6 PRK08570 rpl19e 50S ribosomal 100.0 1.2E-72 2.7E-77 460.4 15.7 147 1-147 1-147 (150)
7 cd00481 Ribosomal_L19e Ribosom 100.0 1.8E-72 3.9E-77 457.3 15.0 145 4-148 1-145 (145)
8 cd01418 Ribosomal_L19e_A Ribos 100.0 8.1E-71 1.8E-75 447.5 15.5 142 4-145 1-142 (145)
9 COG2147 RPL19A Ribosomal prote 100.0 3.6E-68 7.7E-73 432.7 14.9 149 1-149 1-149 (150)
10 TIGR01764 excise DNA binding d 68.0 3.4 7.3E-05 25.6 1.4 30 23-53 1-30 (49)
11 PF12728 HTH_17: Helix-turn-he 67.1 3.7 8E-05 26.7 1.5 30 23-53 1-30 (51)
12 PRK00247 putative inner membra 64.8 70 0.0015 30.9 10.1 75 92-173 291-369 (429)
13 PF13453 zf-TFIIB: Transcripti 57.3 4.7 0.0001 25.9 0.6 15 20-34 26-40 (41)
14 PF13880 Acetyltransf_13: ESCO 57.1 3.4 7.3E-05 30.4 -0.1 14 16-29 5-18 (70)
15 TIGR02075 pyrH_bact uridylate 42.7 26 0.00057 29.8 3.1 48 5-52 77-125 (233)
16 PRK05920 aromatic acid decarbo 36.6 40 0.00087 29.0 3.3 47 9-58 121-167 (204)
17 PF03428 RP-C: Replication pro 35.0 2.8E+02 0.0061 23.5 9.4 55 90-144 71-138 (177)
18 PF09851 SHOCT: Short C-termin 34.7 47 0.001 20.4 2.5 20 104-123 6-25 (31)
19 PF01479 S4: S4 domain; Inter 34.3 29 0.00064 22.1 1.6 27 31-57 9-35 (48)
20 smart00345 HTH_GNTR helix_turn 32.7 34 0.00074 21.7 1.8 24 33-56 33-56 (60)
21 TIGR01610 phage_O_Nterm phage 32.2 54 0.0012 24.4 3.0 38 16-54 40-81 (95)
22 PF12802 MarR_2: MarR family; 31.9 33 0.00071 22.5 1.6 52 7-60 7-61 (62)
23 PRK06029 3-octaprenyl-4-hydrox 31.5 58 0.0013 27.6 3.4 46 9-57 106-151 (185)
24 cd04254 AAK_UMPK-PyrH-Ec UMP k 31.2 48 0.001 28.1 2.9 44 9-52 81-124 (231)
25 PHA00616 hypothetical protein 27.5 35 0.00076 23.1 1.1 20 129-148 8-27 (44)
26 PF08535 KorB: KorB domain; I 27.4 53 0.0012 24.1 2.2 39 6-51 5-43 (93)
27 COG3592 Uncharacterized conser 27.1 30 0.00066 25.9 0.9 19 11-30 35-53 (74)
28 PRK11548 outer membrane biogen 25.7 49 0.0011 25.6 1.8 26 22-47 36-61 (113)
29 PRK00083 frr ribosome recyclin 23.9 4.1E+02 0.0089 22.5 7.2 84 22-125 69-158 (185)
30 PF15319 RHINO: RAD9, RAD1, HU 22.5 33 0.00072 30.7 0.4 34 83-120 201-234 (236)
31 PF06353 DUF1062: Protein of u 22.4 64 0.0014 26.6 2.0 23 34-56 113-135 (142)
32 smart00099 btg1 tob/btg1 famil 22.1 32 0.0007 27.3 0.2 10 21-30 93-102 (108)
33 PF00392 GntR: Bacterial regul 22.0 71 0.0015 21.6 1.9 27 32-58 36-62 (64)
34 PF00096 zf-C2H2: Zinc finger, 22.0 69 0.0015 17.2 1.5 16 129-144 7-22 (23)
35 TIGR00421 ubiX_pad polyprenyl 22.0 1.1E+02 0.0024 25.6 3.4 47 9-58 103-149 (181)
36 cd04762 HTH_MerR-trunc Helix-T 21.7 76 0.0017 19.1 1.8 17 36-52 12-28 (49)
37 smart00420 HTH_DEOR helix_turn 21.3 79 0.0017 19.4 1.9 25 32-56 26-50 (53)
38 TIGR02988 YaaA_near_RecF S4 do 21.3 69 0.0015 21.5 1.7 20 36-55 22-41 (59)
39 KOG1596 Fibrillarin and relate 20.9 61 0.0013 30.0 1.7 71 6-92 137-219 (317)
40 PF14468 DUF4427: Protein of u 20.4 71 0.0015 26.4 1.8 33 12-53 30-62 (132)
41 cd01145 TroA_c Periplasmic bin 20.2 52 0.0011 27.3 1.0 19 19-37 107-125 (203)
No 1
>KOG1696 consensus 60s ribosomal protein L19 [Translation, ribosomal structure and biogenesis]
Probab=100.00 E-value=1.3e-83 Score=535.65 Aligned_cols=193 Identities=72% Similarity=1.083 Sum_probs=190.7
Q ss_pred CCcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCC
Q 029269 1 MVSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGK 80 (196)
Q Consensus 1 M~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~Gk 80 (196)
|++|++|||||||||+||+.+||+||||+++|++|||||+|++||+||+||.+|+++|||+||+++.+++++|||+|+|+
T Consensus 1 Ms~lrlqKRLAssVl~cGKkKvWlDpNE~~eI~~ansRq~irkLikdg~iI~Kp~~vhsr~r~rk~~~akrkgrH~G~GK 80 (193)
T KOG1696|consen 1 MSNLRLQKRLAASVLKCGKKKVWLDPNEISEISGANSRQNIRKLIKDGLIIRKPVTVHSRSRCRKRLEAKRKGRHMGYGK 80 (193)
T ss_pred CchHHHHHHHHHHHHHhcccceeeCccHHHHhcccchHHHHHHHHhCCeEeecchhhhHHHHHHHHHHHHHhccccCccc
Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHHHH
Q 029269 81 RKGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFE 160 (196)
Q Consensus 81 RkGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQae 160 (196)
|+||+|||||+|++||++||+||++|++|||+|+||+|+||.||+++|||+|+|+.+||||||+.++|++++|+|+||||
T Consensus 81 RkGTanArmP~k~~Wmrr~RvlRrlL~kyR~skKIdkh~YH~lY~k~KGnvFKnK~~LmE~I~K~KAe~~r~K~LadQae 160 (193)
T KOG1696|consen 81 RKGTANARMPSKVLWMRRMRVLRRLLKKYRDSKKIDKHMYHDLYLKVKGNVFKNKRVLMEHIHKSKAEKAREKLLADQAE 160 (193)
T ss_pred ccccccccCchhHHHHHHHHHHHHHHHHhhhcccchHHHHHHHHHHHhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhchHHHHHHHHHHHHHHHHHHHHHhhhhhh
Q 029269 161 AKRAKNKASRERKIARREERLAQVRALLAYTIE 193 (196)
Q Consensus 161 A~r~k~k~~r~rr~~~~~~k~~~~~~~~~~~~~ 193 (196)
|+|.++++++++++|++++++++++.++..++|
T Consensus 161 Arr~k~k~ar~rreer~~~k~~~~~~~~~k~ee 193 (193)
T KOG1696|consen 161 ARRLKNKAARKRREERLAAKPQELIKTLSKEEE 193 (193)
T ss_pred HHHhhhHHHhhhHHHHHhhchhhhhcccccccC
Confidence 999999999999999999999999999987754
No 2
>PTZ00097 60S ribosomal protein L19; Provisional
Probab=100.00 E-value=3.2e-79 Score=508.95 Aligned_cols=174 Identities=72% Similarity=1.159 Sum_probs=171.2
Q ss_pred cchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCcc
Q 029269 3 SLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKRK 82 (196)
Q Consensus 3 ~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkRk 82 (196)
+|++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+|||||+
T Consensus 1 ~l~~QKRLAA~vL~cG~~rVWiDP~~~~eI~~A~tR~dIR~LIkdG~I~~kp~kg~SR~R~r~~~~~k~kGR~~G~G~RK 80 (175)
T PTZ00097 1 NLRLQKRLAASVLKCGKNRVWLDPNEASEISLANSRFSIRKLIKDGLIIRKPVAVHSRARARRFHEAKRKGRHTGIGKRR 80 (175)
T ss_pred CchHHHHHHHHHHCCCCCceeeCHHHHHHHHHhhhHHHHHHHHHCCCeeecCCCCCChHHHHHHHHHHHhCCCCCCCCCc
Confidence 58999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHHHHHH
Q 029269 83 GTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFEAK 162 (196)
Q Consensus 83 Gt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQaeA~ 162 (196)
||+|||+|+|+.||++||+||++|++|||+|+||+|+||.||++||||+|+|++||++||++.++|+++++.|+||+||+
T Consensus 81 Gtk~AR~p~K~~W~~riR~lRr~Lk~~R~~~kIdk~~Yr~lY~kaKGn~Fknk~~L~~~I~~~kae~~r~k~l~~q~~a~ 160 (175)
T PTZ00097 81 GTREARMPTKVLWMRRQRVLRRLLRKYRAAKKIDRHMYHEFYLKSKGNQFKNKRVLIEAIHKTKNEKVKEKKIQDQLEAR 160 (175)
T ss_pred CcccccCcHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhchHHHHHHHHHH
Q 029269 163 RAKNKASRERKIAR 176 (196)
Q Consensus 163 r~k~k~~r~rr~~~ 176 (196)
+.+++++++++.++
T Consensus 161 r~k~~~~~~~r~~~ 174 (175)
T PTZ00097 161 RAKAKALRNKRKAK 174 (175)
T ss_pred HHHHHHHHHHHhhc
Confidence 99999999987653
No 3
>cd01417 Ribosomal_L19e_E Ribosomal protein L19e, eukaryotic. L19e is found in the large ribosomal subunit of eukaryotes and archaea. L19e is distinct from the ribosomal subunit L19, which is found in prokaryotes. It consists of two small globular domains connected by an extended segment. It is located toward the surface of the large subunit, with one exposed end involved in forming the intersubunit bridge with the small subunit. The other exposed end is involved in forming the translocon binding site, along with L22, L23, L24, L29, and L31e subunits.
Probab=100.00 E-value=6.8e-78 Score=496.96 Aligned_cols=164 Identities=78% Similarity=1.218 Sum_probs=162.0
Q ss_pred chhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCccc
Q 029269 4 LKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKRKG 83 (196)
Q Consensus 4 l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkRkG 83 (196)
|++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+|||||+|
T Consensus 1 l~~QKRLAA~vL~cG~~rVW~DP~~~~eI~~A~tR~dIR~LIkdG~I~~kp~kg~SR~R~r~~~~~k~kGR~~G~G~RkG 80 (164)
T cd01417 1 LRLQKRLAASVLKCGKRKVWLDPNEISEISNANSRQSIRKLIKDGLIIKKPVKVHSRSRARKRHEAKRKGRHMGYGKRKG 80 (164)
T ss_pred CcHHHHHHHHHHCCCCCceeeCHHHHHHHHHhhhHHHHHHHHHCCCeeecCCCcCCHHHHHHHHHHHHhCCCCCCCCCcC
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHHHHHHH
Q 029269 84 TREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFEAKR 163 (196)
Q Consensus 84 t~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQaeA~r 163 (196)
|+|||+|+|+.||++||+||++|++|||+|+||+|+||.||++||||+|+|++||++||++.++|+++++.|+||+||++
T Consensus 81 t~~AR~p~K~~W~~riR~lRr~Lk~~R~~~kIdk~~Yr~lY~kaKGn~Fknk~~L~~~I~~~kae~~r~k~l~~q~~a~r 160 (164)
T cd01417 81 TANARMPSKVLWMRRQRVLRRLLKKYRESKKIDKHLYHELYLKAKGNVFKNKRVLMEHIHKAKAEKAREKELADQAEARR 160 (164)
T ss_pred cccccCcHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hchH
Q 029269 164 AKNK 167 (196)
Q Consensus 164 ~k~k 167 (196)
++++
T Consensus 161 ~~~~ 164 (164)
T cd01417 161 AKNK 164 (164)
T ss_pred hccC
Confidence 8863
No 4
>PTZ00436 60S ribosomal protein L19-like protein; Provisional
Probab=100.00 E-value=4.1e-76 Score=524.89 Aligned_cols=170 Identities=68% Similarity=1.106 Sum_probs=167.9
Q ss_pred CCcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCC
Q 029269 1 MVSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGK 80 (196)
Q Consensus 1 M~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~Gk 80 (196)
|++|++||||||+||+||++|||||||+++||++||||+|||+||+||+|+++|++||||||+|++++++++|||+|+||
T Consensus 1 M~dLklQKRLAAsVL~cGk~RVWiDPnel~eIa~AiTReDIRkLIkdGlIikKp~KGhSRgRaRkr~eaKrKGRhrG~Gs 80 (357)
T PTZ00436 1 MVSLKLQARLAADILRCGRHRVWLDPNEASEISNANSRKSVRKLIKDGLIIRKPVKVHSRSRWRHMKEAKSMGRHEGAGR 80 (357)
T ss_pred CcchHHHHHHHHHHhCCCCCceeeCHHHHHHHHHhhhHHHHHHHHHCCCeeecCcccCChHHHHHHHHHHHhCcCCCCCC
Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHHHH
Q 029269 81 RKGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFE 160 (196)
Q Consensus 81 RkGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQae 160 (196)
|+||+|||||+|++||+|||+||++|++|||+|+||+|+||.||+++|||+|+|++||++|||+.++|+.+++.|.||++
T Consensus 81 RKGTk~AR~P~K~~WIrRIRaLRRlLKklRd~gKIDkh~YR~LYrKAKGn~FKNK~~L~e~I~k~KaE~~R~K~L~dQ~e 160 (357)
T PTZ00436 81 REGTREARMPSKELWMRRLRILRRLLRKYREEKKIDRHIYRELYVKAKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQLA 160 (357)
T ss_pred CcCcccccCcHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCccCcHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhchHHHH
Q 029269 161 AKRAKNKASR 170 (196)
Q Consensus 161 A~r~k~k~~r 170 (196)
|+|.++++.+
T Consensus 161 ArR~k~~~~r 170 (357)
T PTZ00436 161 AKRLKDEQHR 170 (357)
T ss_pred HHHHHhhhhh
Confidence 9999987654
No 5
>PF01280 Ribosomal_L19e: Ribosomal protein L19e; InterPro: IPR000196 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This entry represents structural domain of the ribosomal protein L19 from eukaryotes, as well as L19e from archaea []. L19/L19e is absent in bacteria. L19/L19e is part of the large ribosomal subunit, whose structure has been determined in a number of eukaryotic and archaeal species []. L19/L19e is a multi-helical protein consisting of two different 3-helical domains connected by a long, partly helical linker.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3IZR_T 3O58_S 3O5H_S 3IZS_T 2WWA_J 1S1I_P 2WW9_J 2ZKR_7 4A1A_O 4A1C_O ....
Probab=100.00 E-value=1.1e-73 Score=465.97 Aligned_cols=148 Identities=68% Similarity=1.097 Sum_probs=130.9
Q ss_pred CcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCc
Q 029269 2 VSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKR 81 (196)
Q Consensus 2 ~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkR 81 (196)
|||++||||||+||+||++|||||||+++||++||||+|||+||+||+|+++|++|||||||+++++++++|||+|||+|
T Consensus 1 m~l~~QKRLAa~vL~~G~~rVw~DP~~~~eI~~A~tR~~IR~LIk~G~I~~k~~k~~Sr~R~r~~~~~r~kGr~~G~G~R 80 (148)
T PF01280_consen 1 MDLKLQKRLAASVLGCGKNRVWIDPNELEEIANAITREDIRKLIKDGLIIKKPVKGHSRGRARKRKEARRKGRHRGPGKR 80 (148)
T ss_dssp -STHHHHHHHHHHHTS-GGGEEE-STTHHHHHH--SHHHHHHHHHTTSEEE---S--STHHHHHHHHHHHCTTS-SSTTS
T ss_pred CccHHHHHHHHHHHCCCCCcEEeCHHHHHHHHhhhhHHHHHHHHHCCCeEeCCCCCCchHHHHHHHHHHhhccccccccc
Confidence 59999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHH
Q 029269 82 KGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEK 149 (196)
Q Consensus 82 kGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~ 149 (196)
+||+|||+|+|+.||++||+||++|++|||+|+||+|+||.||++||||+|+|++||++|||+.++|+
T Consensus 81 kGt~~AR~~~K~~W~~riR~lRr~Lk~~r~~~kID~~~Yr~lY~kaKGn~Fkn~~~L~~~i~~~k~e~ 148 (148)
T PF01280_consen 81 KGTKNARMPEKELWMRRIRALRRLLKRLRDSGKIDRHMYRSLYRKAKGNVFKNKRHLMEHIHKLKAEK 148 (148)
T ss_dssp -S-HHHHS-HHHHHHHHHHHHHHHHHHHHHTTSS-HHHHHHHHHHHHTTS-SSHHHHHHHHHHHHHCH
T ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCcccCHHHHHHHHHHhccCC
Confidence 99999999999999999999999999999999999999999999999999999999999999999875
No 6
>PRK08570 rpl19e 50S ribosomal protein L19e; Reviewed
Probab=100.00 E-value=1.2e-72 Score=460.35 Aligned_cols=147 Identities=43% Similarity=0.742 Sum_probs=145.4
Q ss_pred CCcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCC
Q 029269 1 MVSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGK 80 (196)
Q Consensus 1 M~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~Gk 80 (196)
|++|++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+||||
T Consensus 1 M~~l~~qkRLAA~iL~~G~~rVw~DP~~~~eI~~A~tR~dIR~LI~~G~I~~kp~kg~Sr~R~r~~~~~r~kGr~~G~G~ 80 (150)
T PRK08570 1 MMDLSAQKRLAADILGVGVSRVWIDPEALEDVAEAITREDIRELIKEGVIKAKPKKGISRGRARERHEKRKKGRRRGPGS 80 (150)
T ss_pred CcchHHHHHHHHHHHCCCccceeeCHHHHHHHHHHhhHHHHHHHHHCCCeeecCccCCChHHHHHHHHHHHhCCCCCCCC
Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhh
Q 029269 81 RKGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKA 147 (196)
Q Consensus 81 RkGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~ 147 (196)
|+||+|||+|+|+.||.+||+||++|++|||+|+||+|+||.||++||||+|+|++||++||+....
T Consensus 81 RkGt~~AR~p~K~~W~~riR~lRr~Lk~lR~~~kId~~~Yr~lY~kaKGn~Fkn~~~L~~~i~~~~~ 147 (150)
T PRK08570 81 RKGKKGARTPKKERWINRIRALRRYLRELRDEGKIDRKTYRKLYRKAKGGEFRSVSHLKTYIEEHGL 147 (150)
T ss_pred CCCcccCCCCHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCccCCHHHHHHHHHHhcc
Confidence 9999999999999999999999999999999999999999999999999999999999999998764
No 7
>cd00481 Ribosomal_L19e Ribosomal protein L19e. L19e is found in the large ribosomal subunit of eukaryotes and archaea. L19e is distinct from the ribosomal subunit L19, which is found in prokaryotes. It consists of two small globular domains connected by an extended segment. It is located toward the surface of the large subunit, with one exposed end involved in forming the intersubunit bridge with the small subunit. The other exposed end is involved in forming the translocon binding site, along with L22, L23, L24, L29, and L31e subunits.
Probab=100.00 E-value=1.8e-72 Score=457.27 Aligned_cols=145 Identities=64% Similarity=1.101 Sum_probs=142.9
Q ss_pred chhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCccc
Q 029269 4 LKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKRKG 83 (196)
Q Consensus 4 l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkRkG 83 (196)
|++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+|||||+|
T Consensus 1 l~~QkRLAA~vL~~G~~rVW~DP~~~~eI~~A~tR~dIR~LIkdG~I~~kp~kg~Sr~R~r~~~~~r~kGr~~G~G~RkG 80 (145)
T cd00481 1 LRLQKRLAADILKCGKNRVWIDPNELEEIANANTREDIRKLIKDGLIIKKPKKGHSRGRARKRHEARRKGRHRGPGSRKG 80 (145)
T ss_pred CcHHHHHHHHHHCCCCCceeeCHHHHHHHHHhhhHHHHHHHHHCCCeeecCCCCCChHHHHHHHHHHHhCcCCCCCCccC
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhH
Q 029269 84 TREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAE 148 (196)
Q Consensus 84 t~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e 148 (196)
|++||+|+|+.||++||+||++|++|||+|+||+|+||.||++||||+|+|++||++||++.++|
T Consensus 81 t~~AR~p~K~~W~~riR~lRr~Lk~~R~~~kIdk~~Yr~lY~kaKG~~Fknk~~L~~~i~~~~~~ 145 (145)
T cd00481 81 TKGARMPSKELWIRRIRALRRLLKKLRDSGKIDKHTYRELYLKAKGNVFKNKRHLKEYIHKAKAE 145 (145)
T ss_pred ccccCCcHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCCcCCHHHHHHHHHHhccC
Confidence 99999999999999999999999999999999999999999999999999999999999998764
No 8
>cd01418 Ribosomal_L19e_A Ribosomal protein L19e, archaeal. L19e is found in the large ribosomal subunit of eukaryotes and archaea. L19e is distinct from the ribosomal subunit L19, which is found in prokaryotes. It consists of two small globular domains connected by an extended segment. It is located toward the surface of the large subunit, with one exposed end involved in forming the intersubunit bridge with the small subunit. The other exposed end is involved in forming the translocon binding site, along with L22, L23, L24, L29, and L31e subunits.
Probab=100.00 E-value=8.1e-71 Score=447.48 Aligned_cols=142 Identities=39% Similarity=0.722 Sum_probs=140.5
Q ss_pred chhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCccc
Q 029269 4 LKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKRKG 83 (196)
Q Consensus 4 l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkRkG 83 (196)
|++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+|||||+|
T Consensus 1 l~~QkRLAA~iL~~G~~rVw~DP~~~~eI~~A~tR~dIR~LI~~G~I~~kp~kg~Sr~R~r~~~~~r~kGr~~G~G~RkG 80 (145)
T cd01418 1 LSSQRRLAADILGVGINRVWIDPERLEEVAEAITRDDIRALIKEGVIKAKPKKGISRGRLKERHEKRKKGRRRGPGSRKG 80 (145)
T ss_pred CcHHHHHHHHHHCCCCCeeeeChHHHHHHHHhhhHHHHHHHHHCCCeeecCCCCCCHHHHHHHHHHHHhCcCCCCCCCCC
Confidence 67999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHh
Q 029269 84 TREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKS 145 (196)
Q Consensus 84 t~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~ 145 (196)
|+|||+|+|+.||.+||+||++|++|||+|+||+|+||.||++||||+|+|++||++||..+
T Consensus 81 t~~AR~p~K~~W~~riR~lRr~Lk~~R~~~kId~~~Yr~lY~kaKGn~Fkn~~~L~~~I~~~ 142 (145)
T cd01418 81 KKGARTPKKERWIKTIRALRRYLKELRDKGKIDKKTYRKLYRKAKGGSFRSLSHLKSYLKQH 142 (145)
T ss_pred ccccCCCHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCCcCCHHHHHHHHHHh
Confidence 99999999999999999999999999999999999999999999999999999999999876
No 9
>COG2147 RPL19A Ribosomal protein L19E [Translation, ribosomal structure and biogenesis]
Probab=100.00 E-value=3.6e-68 Score=432.68 Aligned_cols=149 Identities=49% Similarity=0.808 Sum_probs=146.7
Q ss_pred CCcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCC
Q 029269 1 MVSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGK 80 (196)
Q Consensus 1 M~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~Gk 80 (196)
|+||++|+||||+||+||++|||||||++++|++|+||+|||.||+||+|+++|++|+|+||+++++++++||||+||||
T Consensus 1 M~nl~~qkRLAA~il~vG~~Rvwidp~~~eei~~A~TR~dIr~LIk~g~I~~k~~kg~SrgR~rkr~~qkkkgr~rG~Gs 80 (150)
T COG2147 1 MSNLRTQKRLAADILGVGENRVWIDPNEIEEIASAITREDIRALIKDGVIKAKPKKGISRGRARKRHAQKKKGRRRGPGS 80 (150)
T ss_pred CchHHHHHHHHHHHHccCcceeeeChHHHHHHHHhhhHHHHHHHHHCCCeeeccccccchHHHHHHHHHHhcccCCCCCc
Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHH
Q 029269 81 RKGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEK 149 (196)
Q Consensus 81 RkGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~ 149 (196)
|+||++||||+|+.||.+||+||++|++|||+|+||+|+||.||+++|||.|++++||.+||.+++.++
T Consensus 81 RKG~k~AR~p~K~~Wi~~IRalR~~Lr~lrd~gkIdk~~YR~lY~~aKGg~fk~~~~L~~~i~~~~~~k 149 (150)
T COG2147 81 RKGTKGARMPSKERWIKRIRALRRELRKLRDDGKIDKHTYRKLYRMAKGGAFKSKSHLKSYIEEAKLLK 149 (150)
T ss_pred cccccccCCCHHHHHHHHHHHHHHHHHHHHHcCCcCHHHHHHHHHHHcCCccccHHHHHHHHHHhcccc
Confidence 999999999999999999999999999999999999999999999999999999999999999987653
No 10
>TIGR01764 excise DNA binding domain, excisionase family. An excisionase, or Xis protein, is a small protein that binds and promotes excisive recombination; it is not enzymatically active. This model represents a number of putative excisionases and related proteins from temperate phage, plasmids, and transposons, as well as DNA binding domains of other proteins, such as a DNA modification methylase. This model identifies mostly small proteins and N-terminal regions of large proteins, but some proteins appear to have two copies. This domain appears similar, in both sequence and predicted secondary structure (PSIPRED) to the MerR family of transcriptional regulators (pfam00376).
Probab=68.04 E-value=3.4 Score=25.63 Aligned_cols=30 Identities=20% Similarity=0.297 Sum_probs=24.7
Q ss_pred ecCccchhHHHhhhhhHHHHhhhhcCceeec
Q 029269 23 WLDPNEVNEISMANSRQNIRKLVKDGFIIRK 53 (196)
Q Consensus 23 WlDPne~~eIa~A~sR~dIRkLIkdG~I~~k 53 (196)
||++.|+.++-. +|+..|+.|+++|.|-.-
T Consensus 1 ~lt~~e~a~~lg-is~~ti~~~~~~g~i~~~ 30 (49)
T TIGR01764 1 YLTVEEAAEYLG-VSKDTVYRLIHEGELPAY 30 (49)
T ss_pred CCCHHHHHHHHC-CCHHHHHHHHHcCCCCeE
Confidence 677788888775 789999999999987653
No 11
>PF12728 HTH_17: Helix-turn-helix domain
Probab=67.13 E-value=3.7 Score=26.69 Aligned_cols=30 Identities=23% Similarity=0.297 Sum_probs=25.2
Q ss_pred ecCccchhHHHhhhhhHHHHhhhhcCceeec
Q 029269 23 WLDPNEVNEISMANSRQNIRKLVKDGFIIRK 53 (196)
Q Consensus 23 WlDPne~~eIa~A~sR~dIRkLIkdG~I~~k 53 (196)
||++.|+-++-. +|+..|+.+++.|.|..-
T Consensus 1 ~lt~~e~a~~l~-is~~tv~~~~~~g~i~~~ 30 (51)
T PF12728_consen 1 YLTVKEAAELLG-ISRSTVYRWIRQGKIPPF 30 (51)
T ss_pred CCCHHHHHHHHC-cCHHHHHHHHHcCCCCeE
Confidence 688889988888 799999999999977433
No 12
>PRK00247 putative inner membrane protein translocase component YidC; Validated
Probab=64.75 E-value=70 Score=30.85 Aligned_cols=75 Identities=16% Similarity=0.279 Sum_probs=40.2
Q ss_pred hHHHHHHHHHHHHHHHHHHhcC---CCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHH-HHHHHhchH
Q 029269 92 KVLWMRRMRVLRRLLRKYREAK---KIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQ-FEAKRAKNK 167 (196)
Q Consensus 92 K~~Wm~riR~lRr~Lr~~Re~~---kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQ-aeA~r~k~k 167 (196)
+..|..+.+..+..++..|... .|.+..+..|..- +...=.+.=...+++++++|.++.+ -||.|.-++
T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~-------~~~~~~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~ 363 (429)
T PRK00247 291 RAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAE-------NAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINR 363 (429)
T ss_pred HHHHHHHHHHHHHHHHHHHhccccccCCcccHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666666666665555554 7888877777432 2222233334445666666666543 344444444
Q ss_pred HHHHHH
Q 029269 168 ASRERK 173 (196)
Q Consensus 168 ~~r~rr 173 (196)
++++.+
T Consensus 364 ~~~~~~ 369 (429)
T PRK00247 364 EARQER 369 (429)
T ss_pred HhhhhH
Confidence 333333
No 13
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=57.28 E-value=4.7 Score=25.92 Aligned_cols=15 Identities=27% Similarity=0.809 Sum_probs=12.4
Q ss_pred CceecCccchhHHHh
Q 029269 20 GKVWLDPNEVNEISM 34 (196)
Q Consensus 20 ~rVWlDPne~~eIa~ 34 (196)
.-||||++|++.|.+
T Consensus 26 ~G~W~d~~el~~~~e 40 (41)
T PF13453_consen 26 GGIWFDAGELEKLLE 40 (41)
T ss_pred CeEEccHHHHHHHHh
Confidence 469999999998753
No 14
>PF13880 Acetyltransf_13: ESCO1/2 acetyl-transferase
Probab=57.06 E-value=3.4 Score=30.40 Aligned_cols=14 Identities=29% Similarity=1.037 Sum_probs=12.1
Q ss_pred ccCCCceecCccch
Q 029269 16 KCGRGKVWLDPNEV 29 (196)
Q Consensus 16 ~cGk~rVWlDPne~ 29 (196)
-||++|||..|..-
T Consensus 5 ~~GI~RIWV~~~~R 18 (70)
T PF13880_consen 5 VCGISRIWVSPSHR 18 (70)
T ss_pred EEEeEEEEeChhhh
Confidence 59999999999863
No 15
>TIGR02075 pyrH_bact uridylate kinase. This protein, also called UMP kinase, converts UMP to UDP by adding a phosphate from ATP. It is the first step in pyrimidine biosynthesis. GTP is an allosteric activator. In a large fraction of all bacterial genomes, the gene tends to be located immediately downstream of elongation factor Ts and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function and found in the archaea and in spirochetes, is described by a separate model, TIGR02076.
Probab=42.73 E-value=26 Score=29.80 Aligned_cols=48 Identities=15% Similarity=0.126 Sum_probs=38.0
Q ss_pred hhhHHH-HHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceee
Q 029269 5 KLQKRL-SASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIR 52 (196)
Q Consensus 5 ~~QKRL-AA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~ 52 (196)
.++-+| +..+...|.+-++++|.....+..-.+.+.+..|++.|.|..
T Consensus 77 ~l~~~l~~~~L~~~Gi~a~~l~~~~~~~~~~~~~~~~i~~ll~~g~VpV 125 (233)
T TIGR02075 77 VINGLALRDALEKLGVKTRVLSAISMPQICESYIRRKAIKHLEKGKVVI 125 (233)
T ss_pred HHHHHHHHHHHHhCCCCcEEeccccCCCCccccCHHHHHHHHHCCCEEE
Confidence 355665 777778999999999998765555567899999999999854
No 16
>PRK05920 aromatic acid decarboxylase; Validated
Probab=36.63 E-value=40 Score=29.04 Aligned_cols=47 Identities=30% Similarity=0.259 Sum_probs=36.5
Q ss_pred HHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCC
Q 029269 9 RLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIH 58 (196)
Q Consensus 9 RLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~ 58 (196)
+.|...|+.|.. |.|=|.+... ++++++++..|-..|.++.-|..|.
T Consensus 121 ~~a~~~L~~~~p-vvi~P~~m~~--~~~~~~nl~~L~~~G~~ii~P~~g~ 167 (204)
T PRK05920 121 RAADVVLKERRK-LILVPRETPL--SLIHLENMLKLAEAGAIILPAIPAF 167 (204)
T ss_pred HHHHHHHhcCCC-EEEEeCCCCC--CHHHHHHHHHHHHCCCEEeCCcccc
Confidence 555577887774 5555765444 8889999999999999999999874
No 17
>PF03428 RP-C: Replication protein C N-terminal domain; InterPro: IPR005090 Proteins in this group have homology with the RepC protein of Agrobacterium Ri and Ti plasmids []. They may be involved in plasmid replication and stabilisation functions.
Probab=34.97 E-value=2.8e+02 Score=23.48 Aligned_cols=55 Identities=25% Similarity=0.329 Sum_probs=39.8
Q ss_pred chhHHHHHHH-----HHHHHHHHHHHhcCCC---ChhhhHHHHhhhcCCc----cc-cHHHHHHHHHH
Q 029269 90 PTKVLWMRRM-----RVLRRLLRKYREAKKI---DRHMYHDMYMKVKGNV----FK-NKRVLMESIHK 144 (196)
Q Consensus 90 p~K~~Wm~ri-----R~lRr~Lr~~Re~~kI---D~~~Yr~lY~kaKGn~----Fk-nk~~L~e~I~k 144 (196)
|+...=..++ +.+|+.|..|-|.|.| |+..++-+..+-.+|. |= |.+-|..-..+
T Consensus 71 pSN~~La~r~~G~s~~tlrR~l~~LveaGLI~rrDS~NgkRy~~R~~~G~I~~A~GfdLsPL~~R~~E 138 (177)
T PF03428_consen 71 PSNAQLAERLNGMSERTLRRHLARLVEAGLIVRRDSPNGKRYARRDRGGRIVEAFGFDLSPLIARAEE 138 (177)
T ss_pred cCHHHHHHHHcCCCHHHHHHHHHHHHHCCCeeeccCCCCCccCccCCCCCEEeEeCcCHHHHHHHHHH
Confidence 4444445556 8999999999999999 5567888777766654 43 67777766555
No 18
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=34.70 E-value=47 Score=20.37 Aligned_cols=20 Identities=15% Similarity=0.298 Sum_probs=17.1
Q ss_pred HHHHHHHhcCCCChhhhHHH
Q 029269 104 RLLRKYREAKKIDRHMYHDM 123 (196)
Q Consensus 104 r~Lr~~Re~~kID~~~Yr~l 123 (196)
..|+.+.+.|.|+...|...
T Consensus 6 ~~L~~l~~~G~IseeEy~~~ 25 (31)
T PF09851_consen 6 EKLKELYDKGEISEEEYEQK 25 (31)
T ss_pred HHHHHHHHcCCCCHHHHHHH
Confidence 45888999999999999764
No 19
>PF01479 S4: S4 domain; InterPro: IPR002942 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yeast protein containing a pseudouridine synthetase and a deaminase domain, bacterial tyrosyl-tRNA synthetases, and a number of uncharacterised, small proteins that may be involved in translation regulation []. The S4 domain probably mediates binding to RNA.; GO: 0003723 RNA binding; PDB: 3BBU_A 1DM9_B 2K6P_A 3U5G_E 3U5C_E 3IZB_D 2XZM_D 2XZN_D 3O30_E 3O2Z_E ....
Probab=34.31 E-value=29 Score=22.12 Aligned_cols=27 Identities=26% Similarity=0.477 Sum_probs=20.7
Q ss_pred HHHhhhhhHHHHhhhhcCceeecCCCC
Q 029269 31 EISMANSRQNIRKLVKDGFIIRKPTKI 57 (196)
Q Consensus 31 eIa~A~sR~dIRkLIkdG~I~~kp~~~ 57 (196)
+...+.||.++++||+.|.|..--...
T Consensus 9 ~~~~~~sr~~a~~~I~~g~V~VNg~~v 35 (48)
T PF01479_consen 9 RLGLASSRSEARRLIKQGRVKVNGKVV 35 (48)
T ss_dssp HTTSSSSHHHHHHHHHTTTEEETTEEE
T ss_pred HcCCcCCHHHHHHhcCCCEEEECCEEE
Confidence 445577999999999999987654433
No 20
>smart00345 HTH_GNTR helix_turn_helix gluconate operon transcriptional repressor.
Probab=32.69 E-value=34 Score=21.71 Aligned_cols=24 Identities=25% Similarity=0.559 Sum_probs=17.6
Q ss_pred HhhhhhHHHHhhhhcCceeecCCC
Q 029269 33 SMANSRQNIRKLVKDGFIIRKPTK 56 (196)
Q Consensus 33 a~A~sR~dIRkLIkdG~I~~kp~~ 56 (196)
+.+.-+..+..|.++|+|...|-.
T Consensus 33 s~~tv~~~l~~L~~~g~i~~~~~~ 56 (60)
T smart00345 33 SRTTVREALSRLEAEGLVQRRPGS 56 (60)
T ss_pred CHHHHHHHHHHHHHCCCEEEecCC
Confidence 334557778899999999876644
No 21
>TIGR01610 phage_O_Nterm phage replication protein O, N-terminal domain. This model represents the N-terminal region of the phage lambda replication protein O and homologous regions of other phage proteins.
Probab=32.22 E-value=54 Score=24.41 Aligned_cols=38 Identities=18% Similarity=0.305 Sum_probs=25.4
Q ss_pred ccCCCceecCccchhHHHhhhhhHH----HHhhhhcCceeecC
Q 029269 16 KCGRGKVWLDPNEVNEISMANSRQN----IRKLVKDGFIIRKP 54 (196)
Q Consensus 16 ~cGk~rVWlDPne~~eIa~A~sR~d----IRkLIkdG~I~~kp 54 (196)
|.++..+++.+.|+.++... ||+. |..|.+.|+|....
T Consensus 40 G~~~~~~~is~~eLa~~~g~-sr~tVsr~L~~Le~~GlI~r~~ 81 (95)
T TIGR01610 40 GWNKKQDRVTATVIAELTGL-SRTHVSDAIKSLARRRIIFRQG 81 (95)
T ss_pred CccccCCccCHHHHHHHHCc-CHHHHHHHHHHHHHCCCeeeec
Confidence 34456677777766666544 4444 55799999998654
No 22
>PF12802 MarR_2: MarR family; PDB: 3ECO_B 2QWW_B 3KP6_B 3KP4_B 3KP2_A 3KP5_A 3KP3_B 3KP7_A 3NQO_B 3K0L_B ....
Probab=31.87 E-value=33 Score=22.47 Aligned_cols=52 Identities=21% Similarity=0.242 Sum_probs=30.5
Q ss_pred hHHHHHHHhccCCCceecCccchhHH---HhhhhhHHHHhhhhcCceeecCCCCCCh
Q 029269 7 QKRLSASVLKCGRGKVWLDPNEVNEI---SMANSRQNIRKLVKDGFIIRKPTKIHSR 60 (196)
Q Consensus 7 QKRLAA~vL~cGk~rVWlDPne~~eI---a~A~sR~dIRkLIkdG~I~~kp~~~~SR 60 (196)
|-++-..|..+|.. ++-+.++.+. +-...-.-|..|++.|+|.+.+.....|
T Consensus 7 q~~vL~~l~~~~~~--~~t~~~la~~l~~~~~~vs~~v~~L~~~Glv~r~~~~~D~R 61 (62)
T PF12802_consen 7 QFRVLMALARHPGE--ELTQSELAERLGISKSTVSRIVKRLEKKGLVERERDPGDRR 61 (62)
T ss_dssp HHHHHHHHHHSTTS--GEEHHHHHHHHTS-HHHHHHHHHHHHHTTSEEEEE-SSSTT
T ss_pred HHHHHHHHHHCCCC--CcCHHHHHHHHCcCHHHHHHHHHHHHHCCCEEEeCCCCCCC
Confidence 44445555555554 3333333332 2233345688999999999998877665
No 23
>PRK06029 3-octaprenyl-4-hydroxybenzoate carboxy-lyase; Provisional
Probab=31.52 E-value=58 Score=27.57 Aligned_cols=46 Identities=26% Similarity=0.255 Sum_probs=35.8
Q ss_pred HHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCC
Q 029269 9 RLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKI 57 (196)
Q Consensus 9 RLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~ 57 (196)
+.|...|.+++ .|.|=|. +--.+.++.+++..|-+.|.++..|..|
T Consensus 106 ~~a~~~L~~~~-pvii~P~--~M~~~p~~~~Nl~~L~~~G~~vi~P~~g 151 (185)
T PRK06029 106 RAADVMLKERR-RLVLCVR--ETPLHLGHLRNMTKLAEMGAIIMPPVPA 151 (185)
T ss_pred HHHHHHHhcCC-CEEEEec--cccCCHHHHHHHHHHHHCcCEEECCCcc
Confidence 44556778765 4555574 4566889999999999999999999876
No 24
>cd04254 AAK_UMPK-PyrH-Ec UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of E. coli (Ec) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial and chloroplast UMPKs (this CD) have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of this CD be
Probab=31.15 E-value=48 Score=28.15 Aligned_cols=44 Identities=9% Similarity=0.130 Sum_probs=35.9
Q ss_pred HHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceee
Q 029269 9 RLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIR 52 (196)
Q Consensus 9 RLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~ 52 (196)
-|++.+...|.+-+||+|-+.+.+.....-+.+..+++.|.|..
T Consensus 81 ll~~~L~~~Gv~a~~l~~~~~~~~~~~~~~~~l~~~l~~g~ipV 124 (231)
T cd04254 81 ALQDALESLGVKTRVMSAIPMQGVAEPYIRRRAIRHLEKGRVVI 124 (231)
T ss_pred HHHHHHHHcCCCeEEEcHHHhhhhhcccCHHHHHHHHHCCCEEE
Confidence 35667788999999999998876655566799999999998744
No 25
>PHA00616 hypothetical protein
Probab=27.51 E-value=35 Score=23.12 Aligned_cols=20 Identities=20% Similarity=0.571 Sum_probs=16.6
Q ss_pred CCccccHHHHHHHHHHhhhH
Q 029269 129 GNVFKNKRVLMESIHKSKAE 148 (196)
Q Consensus 129 Gn~Fknk~~L~e~I~k~k~e 148 (196)
|..|.++++|..|+.+.-.+
T Consensus 8 G~~F~~~s~l~~H~r~~hg~ 27 (44)
T PHA00616 8 GGIFRKKKEVIEHLLSVHKQ 27 (44)
T ss_pred hHHHhhHHHHHHHHHHhcCC
Confidence 78999999999999765443
No 26
>PF08535 KorB: KorB domain; InterPro: IPR013741 This entry contains several KorB transcriptional repressor proteins. The korB gene is a major regulatory element in the replication and maintenance of broad host-range plasmid RK2. It negatively controls the replication gene trfA, the host-lethal determinants kilA and kilB, and the korA-korB operon []. This domain includes the DNA-binding HTH motif []. ; PDB: 1R71_C.
Probab=27.39 E-value=53 Score=24.11 Aligned_cols=39 Identities=23% Similarity=0.600 Sum_probs=23.7
Q ss_pred hhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCcee
Q 029269 6 LQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFII 51 (196)
Q Consensus 6 ~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~ 51 (196)
+|.-+|..| |+++-|+ .+-++-+.-=++|+.|+.+|.|.
T Consensus 5 tq~eIA~~l---Gks~s~V----s~~l~Ll~lP~~i~~~v~~g~~~ 43 (93)
T PF08535_consen 5 TQEEIAKRL---GKSRSWV----SNHLALLDLPEEIKELVRSGRIS 43 (93)
T ss_dssp -HHHHHHHT---T--HHHH----HHHHGGGS--HHHHHHHHTTS--
T ss_pred CHHHHHHHH---CCCHHHH----HHHHHHHcCCHHHHHHHHcCCCc
Confidence 467777766 9998887 33444555667899999999774
No 27
>COG3592 Uncharacterized conserved protein [Function unknown]
Probab=27.11 E-value=30 Score=25.93 Aligned_cols=19 Identities=32% Similarity=0.789 Sum_probs=14.6
Q ss_pred HHHHhccCCCceecCccchh
Q 029269 11 SASVLKCGRGKVWLDPNEVN 30 (196)
Q Consensus 11 AA~vL~cGk~rVWlDPne~~ 30 (196)
++.|+++|..= ||+|+..+
T Consensus 35 n~~vF~~~rkP-WI~Pd~~~ 53 (74)
T COG3592 35 NPKVFNLGRKP-WIMPDAVD 53 (74)
T ss_pred CHhhcccCCCC-ccCCCCCC
Confidence 56788888765 99998754
No 28
>PRK11548 outer membrane biogenesis protein BamE; Provisional
Probab=25.69 E-value=49 Score=25.58 Aligned_cols=26 Identities=23% Similarity=0.440 Sum_probs=22.9
Q ss_pred eecCccchhHHHhhhhhHHHHhhhhc
Q 029269 22 VWLDPNEVNEISMANSRQNIRKLVKD 47 (196)
Q Consensus 22 VWlDPne~~eIa~A~sR~dIRkLIkd 47 (196)
-++||+.+++|.--.|+++|+.|+=.
T Consensus 36 ~~~~~~~l~~l~~GmTk~qV~~lLGt 61 (113)
T PRK11548 36 NYLTPNDVAKIHVGMTQQQVAYTLGT 61 (113)
T ss_pred ccCCHHHHHHhcCCCCHHHHHHHcCC
Confidence 37899999999999999999998743
No 29
>PRK00083 frr ribosome recycling factor; Reviewed
Probab=23.86 E-value=4.1e+02 Score=22.50 Aligned_cols=84 Identities=23% Similarity=0.300 Sum_probs=55.1
Q ss_pred eecCccchhHHHhhhhhHHHH-hhhhcCceeecCCCCCChhHH-HHHHHHHhcCcCCCCCCcccccccCCchhHHHHHHH
Q 029269 22 VWLDPNEVNEISMANSRQNIR-KLVKDGFIIRKPTKIHSRSRA-RRMKEAKRKGRHSGYGKRKGTREARLPTKVLWMRRM 99 (196)
Q Consensus 22 VWlDPne~~eIa~A~sR~dIR-kLIkdG~I~~kp~~~~SR~R~-r~~~~~k~kGR~~G~GkRkGt~~AR~p~K~~Wm~ri 99 (196)
+| ||+-+..|..|+.-.++- .=+.||-.++-|..--+.-|- ---+.++..+ +.+-..|
T Consensus 69 p~-D~~~i~~I~kAI~~s~lgl~P~~dg~~Iri~iP~lT~E~R~elvK~~k~~~-------------------E~aKv~i 128 (185)
T PRK00083 69 PW-DKSMLKAIEKAIRASDLGLNPSNDGTVIRLPIPPLTEERRKELVKQVKKEA-------------------EEAKVAI 128 (185)
T ss_pred eC-CHhHHHHHHHHHHHCCCCCCcccCCCEEEecCCCCCHHHHHHHHHHHHHHH-------------------HHHHHHH
Confidence 45 999999999999876653 224588888888766665443 2233333322 2344678
Q ss_pred HHHHHH----HHHHHhcCCCChhhhHHHHh
Q 029269 100 RVLRRL----LRKYREAKKIDRHMYHDMYM 125 (196)
Q Consensus 100 R~lRr~----Lr~~Re~~kID~~~Yr~lY~ 125 (196)
|.+|+- |+.+...|.|.....+.+..
T Consensus 129 RniRr~~~~~iKk~~k~~~iseD~~k~~e~ 158 (185)
T PRK00083 129 RNIRRDANDKLKKLEKDKEISEDELKRAED 158 (185)
T ss_pred HHHHHHHHHHHHHHhhcCCCChHHHHHHHH
Confidence 888886 66666667788887776654
No 30
>PF15319 RHINO: RAD9, RAD1, HUS1-interacting nuclear orphan protein
Probab=22.54 E-value=33 Score=30.75 Aligned_cols=34 Identities=41% Similarity=0.641 Sum_probs=26.0
Q ss_pred cccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhh
Q 029269 83 GTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMY 120 (196)
Q Consensus 83 Gt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Y 120 (196)
.|-.--..-|+.|-+| +.||..|||.|++++.+|
T Consensus 201 DTPE~~YGiKvTWRRR----~hL~~yLrerGkL~~sq~ 234 (236)
T PF15319_consen 201 DTPEEKYGIKVTWRRR----RHLLAYLRERGKLSRSQF 234 (236)
T ss_pred cCCcccccceeeeecc----HHHHHHHHHhCccchhhc
Confidence 3444455669999666 678999999999998765
No 31
>PF06353 DUF1062: Protein of unknown function (DUF1062); InterPro: IPR009412 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=22.39 E-value=64 Score=26.57 Aligned_cols=23 Identities=30% Similarity=0.673 Sum_probs=20.7
Q ss_pred hhhhhHHHHhhhhcCceeecCCC
Q 029269 34 MANSRQNIRKLVKDGFIIRKPTK 56 (196)
Q Consensus 34 ~A~sR~dIRkLIkdG~I~~kp~~ 56 (196)
.-+||.+|..|+++|.|...|.+
T Consensus 113 L~lSrs~l~~l~~~G~I~~~~~~ 135 (142)
T PF06353_consen 113 LGLSRSRLKRLIEQGLIRSDPDK 135 (142)
T ss_pred hCcCHHHHHHHHHCCCEEecCcc
Confidence 45799999999999999998876
No 32
>smart00099 btg1 tob/btg1 family. The tob/btg1 is a family of proteins that inhibit cell proliferation.
Probab=22.13 E-value=32 Score=27.28 Aligned_cols=10 Identities=50% Similarity=1.165 Sum_probs=8.2
Q ss_pred ceecCccchh
Q 029269 21 KVWLDPNEVN 30 (196)
Q Consensus 21 rVWlDPne~~ 30 (196)
.||+||.|++
T Consensus 93 tiwvDP~eVs 102 (108)
T smart00099 93 TLWVDPFEVS 102 (108)
T ss_pred EEEECCCEEE
Confidence 5899999864
No 33
>PF00392 GntR: Bacterial regulatory proteins, gntR family; InterPro: IPR000524 Many bacterial transcription regulation proteins bind DNA through a helix-turn-helix (HTH) motif, which can be classified into subfamilies on the basis of sequence similarities. The HTH GntR family has many members distributed among diverse bacterial groups that regulate various biological processes. It was named GntR after the Bacillus subtilis repressor of the gluconate operon []. Family members include GntR, HutC, KorA, NtaR, FadR, ExuR, FarR, DgoR and PhnF. The crystal structure of the FadR protein has been determined []. In general, these proteins contain a DNA-binding HTH domain at the N terminus, and an effector-binding or oligomerisation domain at the C terminus (IPR011711 from INTERPRO). The DNA-binding domain is well conserved in structure for the whole of the GntR family, consisting of a 3-helical bundle core with a small beta-sheet (wing); the GntR winged helix structure is similar to that found in several other transcriptional regulator families. The regions outside the DNA-binding domain are more variable and are consequently used to define GntR subfamilies []. This entry represents the N-terminal DNA-binding domain of the GntR family.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1HW1_B 1H9T_A 1HW2_A 1H9G_A 1E2X_A 3IHU_A 3C7J_A 2RA5_A 3BY6_C 3IC7_A ....
Probab=22.00 E-value=71 Score=21.61 Aligned_cols=27 Identities=30% Similarity=0.570 Sum_probs=20.9
Q ss_pred HHhhhhhHHHHhhhhcCceeecCCCCC
Q 029269 32 ISMANSRQNIRKLVKDGFIIRKPTKIH 58 (196)
Q Consensus 32 Ia~A~sR~dIRkLIkdG~I~~kp~~~~ 58 (196)
|+-..-|+-+..|..+|+|...|-.|.
T Consensus 36 vsr~tvr~al~~L~~~g~i~~~~~~G~ 62 (64)
T PF00392_consen 36 VSRTTVREALRRLEAEGLIERRPGRGT 62 (64)
T ss_dssp S-HHHHHHHHHHHHHTTSEEEETTTEE
T ss_pred cCCcHHHHHHHHHHHCCcEEEECCceE
Confidence 445566788889999999999987763
No 34
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=21.97 E-value=69 Score=17.25 Aligned_cols=16 Identities=25% Similarity=0.497 Sum_probs=13.2
Q ss_pred CCccccHHHHHHHHHH
Q 029269 129 GNVFKNKRVLMESIHK 144 (196)
Q Consensus 129 Gn~Fknk~~L~e~I~k 144 (196)
|-.|.+...|..||..
T Consensus 7 ~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 7 GKSFSSKSNLKRHMRR 22 (23)
T ss_dssp TEEESSHHHHHHHHHH
T ss_pred CCccCCHHHHHHHHhH
Confidence 5679999999999863
No 35
>TIGR00421 ubiX_pad polyprenyl P-hydroxybenzoate and phenylacrylic acid decarboxylases. In E.coli, the protein UbiX (3-octaprenyl-4-hydroxybenzoate carboxy-lyase) has been shown to be involved in the third step of ubiquinone biosynthesis. It catalyzes the reaction [3-octaprenyl-4-hydroxybenzoate = 2-octaprenylphenol + CO2]. The knockout of the homologous protein in yeast confers sensitivity to phenylacrylic acid. Members are not restricted to ubiquinone-synthesizing species. This family represents a distinct clade within the flavoprotein family of Pfam model pfam02441.
Probab=21.96 E-value=1.1e+02 Score=25.57 Aligned_cols=47 Identities=30% Similarity=0.295 Sum_probs=35.3
Q ss_pred HHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCC
Q 029269 9 RLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIH 58 (196)
Q Consensus 9 RLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~ 58 (196)
+.|...|+++. .|.|=|.+. -.++.+.+++..|-+.|.++.-|..|.
T Consensus 103 ~~a~~~L~~~~-pv~i~P~~m--~~~~~~~~Nl~~L~~~G~~ii~P~~g~ 149 (181)
T TIGR00421 103 RAADVCLKERR-KLVLVPRET--PLNSIHLENMLRLSRMGAIILPPMPAF 149 (181)
T ss_pred HHHHHHHhcCC-CEEEEeCCC--cCCHHHHHHHHHHHHCCCEEECCCCcc
Confidence 44555788776 455557544 458888999999999999999998763
No 36
>cd04762 HTH_MerR-trunc Helix-Turn-Helix DNA binding domain of truncated MerR-like proteins. Proteins in this family mostly have a truncated helix-turn-helix (HTH) MerR-like domain. They lack a portion of the C-terminal region, called Wing 2 and the long dimerization helix that is typically present in MerR-like proteins. These truncated domains are found in response regulator receiver (REC) domain proteins (i.e., CheY), cytosine-C5 specific DNA methylases, IS607 transposase-like proteins, and RacA, a bacterial protein that anchors chromosomes to cell poles.
Probab=21.68 E-value=76 Score=19.07 Aligned_cols=17 Identities=29% Similarity=0.432 Sum_probs=14.4
Q ss_pred hhhHHHHhhhhcCceee
Q 029269 36 NSRQNIRKLVKDGFIIR 52 (196)
Q Consensus 36 ~sR~dIRkLIkdG~I~~ 52 (196)
+|+..|+.++++|.|..
T Consensus 12 vs~~tl~~~~~~g~~~~ 28 (49)
T cd04762 12 VSPSTLRRWVKEGKLKA 28 (49)
T ss_pred cCHHHHHHHHHcCCCCc
Confidence 68899999999998753
No 37
>smart00420 HTH_DEOR helix_turn_helix, Deoxyribose operon repressor.
Probab=21.30 E-value=79 Score=19.41 Aligned_cols=25 Identities=24% Similarity=0.411 Sum_probs=18.0
Q ss_pred HHhhhhhHHHHhhhhcCceeecCCC
Q 029269 32 ISMANSRQNIRKLVKDGFIIRKPTK 56 (196)
Q Consensus 32 Ia~A~sR~dIRkLIkdG~I~~kp~~ 56 (196)
++.+.-+..|..|.+.|+|...+..
T Consensus 26 ~s~~tv~~~l~~L~~~g~i~~~~~~ 50 (53)
T smart00420 26 VSEMTIRRDLNKLEEQGLLTRVHGG 50 (53)
T ss_pred CCHHHHHHHHHHHHHCCCEEEeecC
Confidence 3444557788899999999876543
No 38
>TIGR02988 YaaA_near_RecF S4 domain protein YaaA. This small protein has a single S4 domain (pfam01479), as do bacterial ribosomal protein S4, some pseudouridine synthases, tyrosyl-tRNA synthetases. The S4 domain may bind RNA. Members of this protein family are found almost exclusively in the Firmicutes, and almost invariably just a few nucleotides upstream of the gene for the DNA replication and repair protein RecF. The few members of this family that are not near recF are found instead near dnaA and/or dnaN, the usual neighbors of recF, near the origin of replication. The conserved location suggests a possible role in replication in the Firmicutes lineage.
Probab=21.26 E-value=69 Score=21.55 Aligned_cols=20 Identities=5% Similarity=0.152 Sum_probs=17.2
Q ss_pred hhhHHHHhhhhcCceeecCC
Q 029269 36 NSRQNIRKLVKDGFIIRKPT 55 (196)
Q Consensus 36 ~sR~dIRkLIkdG~I~~kp~ 55 (196)
.||..+++||++|.|..--.
T Consensus 22 ~SR~~~k~li~~G~V~VNg~ 41 (59)
T TIGR02988 22 DSGGQAKWFLQENEVLVNGE 41 (59)
T ss_pred cCHHHHHHHHHcCCEEECCE
Confidence 49999999999999988443
No 39
>KOG1596 consensus Fibrillarin and related nucleolar RNA-binding proteins [RNA processing and modification]
Probab=20.86 E-value=61 Score=30.01 Aligned_cols=71 Identities=23% Similarity=0.316 Sum_probs=40.7
Q ss_pred hhHHHHHHHhccCCCceecCccc------------hhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcC
Q 029269 6 LQKRLSASVLKCGRGKVWLDPNE------------VNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKG 73 (196)
Q Consensus 6 ~QKRLAA~vL~cGk~rVWlDPne------------~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kG 73 (196)
.|-.|||.||+ |..-+||.|-. ++.|++-.- .+|++.+-.. .|-.||--.+.++|+--
T Consensus 137 frSKLAA~I~g-GvdnihikpGsKVLYLGAasGttVSHvSDiVG--------peG~VYAVEf-s~rsGRdL~nmAkkRtN 206 (317)
T KOG1596|consen 137 FRSKLAAGILG-GVDNIHIKPGSKVLYLGAASGTTVSHVSDIVG--------PEGCVYAVEF-SHRSGRDLINMAKKRTN 206 (317)
T ss_pred HHHHHHHHhhc-CccceeecCCceEEEeeccCCceeehhhcccC--------CCceEEEEEe-cccchHHHHHHhhccCC
Confidence 46789999986 99999999953 233333222 4566665443 23345655555554310
Q ss_pred cCCCCCCcccccccCCchh
Q 029269 74 RHSGYGKRKGTREARLPTK 92 (196)
Q Consensus 74 R~~G~GkRkGt~~AR~p~K 92 (196)
== -=--.||+|.|
T Consensus 207 ii------PIiEDArhP~K 219 (317)
T KOG1596|consen 207 II------PIIEDARHPAK 219 (317)
T ss_pred ce------eeeccCCCchh
Confidence 00 00135788877
No 40
>PF14468 DUF4427: Protein of unknown function (DUF4427)
Probab=20.37 E-value=71 Score=26.39 Aligned_cols=33 Identities=21% Similarity=0.399 Sum_probs=25.3
Q ss_pred HHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeec
Q 029269 12 ASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRK 53 (196)
Q Consensus 12 A~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~k 53 (196)
.-++.-|...||| |+ |+.+=+|.||++|.|+.-
T Consensus 30 ~~~~e~G~~wvWi-~D--------N~~~~vRALl~~grV~v~ 62 (132)
T PF14468_consen 30 DYDREFGNAWVWI-HD--------NQSEVVRALLQAGRVKVN 62 (132)
T ss_pred cchhhcCceEEEE-ec--------CcCHHHHHHHHcCceeec
Confidence 3456779999999 33 456678999999998764
No 41
>cd01145 TroA_c Periplasmic binding protein TroA_c. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains.
Probab=20.18 E-value=52 Score=27.32 Aligned_cols=19 Identities=37% Similarity=0.704 Sum_probs=15.8
Q ss_pred CCceecCccchhHHHhhhh
Q 029269 19 RGKVWLDPNEVNEISMANS 37 (196)
Q Consensus 19 k~rVWlDPne~~eIa~A~s 37 (196)
---||+||+....++.++.
T Consensus 107 dPH~Wldp~~~~~~a~~I~ 125 (203)
T cd01145 107 NPHVWLDPNNAPALAKALA 125 (203)
T ss_pred CcCeecCHHHHHHHHHHHH
Confidence 4468999999998888876
Done!