Query         029269
Match_columns 196
No_of_seqs    140 out of 288
Neff          4.0 
Searched_HMMs 46136
Date          Fri Mar 29 10:11:42 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/029269.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/029269hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1696 60s ribosomal protein  100.0 1.3E-83 2.9E-88  535.7  16.2  193    1-193     1-193 (193)
  2 PTZ00097 60S ribosomal protein 100.0 3.2E-79 6.9E-84  508.9  18.7  174    3-176     1-174 (175)
  3 cd01417 Ribosomal_L19e_E Ribos 100.0 6.8E-78 1.5E-82  497.0  17.4  164    4-167     1-164 (164)
  4 PTZ00436 60S ribosomal protein 100.0 4.1E-76   9E-81  524.9  20.1  170    1-170     1-170 (357)
  5 PF01280 Ribosomal_L19e:  Ribos 100.0 1.1E-73 2.3E-78  466.0  14.2  148    2-149     1-148 (148)
  6 PRK08570 rpl19e 50S ribosomal  100.0 1.2E-72 2.7E-77  460.4  15.7  147    1-147     1-147 (150)
  7 cd00481 Ribosomal_L19e Ribosom 100.0 1.8E-72 3.9E-77  457.3  15.0  145    4-148     1-145 (145)
  8 cd01418 Ribosomal_L19e_A Ribos 100.0 8.1E-71 1.8E-75  447.5  15.5  142    4-145     1-142 (145)
  9 COG2147 RPL19A Ribosomal prote 100.0 3.6E-68 7.7E-73  432.7  14.9  149    1-149     1-149 (150)
 10 TIGR01764 excise DNA binding d  68.0     3.4 7.3E-05   25.6   1.4   30   23-53      1-30  (49)
 11 PF12728 HTH_17:  Helix-turn-he  67.1     3.7   8E-05   26.7   1.5   30   23-53      1-30  (51)
 12 PRK00247 putative inner membra  64.8      70  0.0015   30.9  10.1   75   92-173   291-369 (429)
 13 PF13453 zf-TFIIB:  Transcripti  57.3     4.7  0.0001   25.9   0.6   15   20-34     26-40  (41)
 14 PF13880 Acetyltransf_13:  ESCO  57.1     3.4 7.3E-05   30.4  -0.1   14   16-29      5-18  (70)
 15 TIGR02075 pyrH_bact uridylate   42.7      26 0.00057   29.8   3.1   48    5-52     77-125 (233)
 16 PRK05920 aromatic acid decarbo  36.6      40 0.00087   29.0   3.3   47    9-58    121-167 (204)
 17 PF03428 RP-C:  Replication pro  35.0 2.8E+02  0.0061   23.5   9.4   55   90-144    71-138 (177)
 18 PF09851 SHOCT:  Short C-termin  34.7      47   0.001   20.4   2.5   20  104-123     6-25  (31)
 19 PF01479 S4:  S4 domain;  Inter  34.3      29 0.00064   22.1   1.6   27   31-57      9-35  (48)
 20 smart00345 HTH_GNTR helix_turn  32.7      34 0.00074   21.7   1.8   24   33-56     33-56  (60)
 21 TIGR01610 phage_O_Nterm phage   32.2      54  0.0012   24.4   3.0   38   16-54     40-81  (95)
 22 PF12802 MarR_2:  MarR family;   31.9      33 0.00071   22.5   1.6   52    7-60      7-61  (62)
 23 PRK06029 3-octaprenyl-4-hydrox  31.5      58  0.0013   27.6   3.4   46    9-57    106-151 (185)
 24 cd04254 AAK_UMPK-PyrH-Ec UMP k  31.2      48   0.001   28.1   2.9   44    9-52     81-124 (231)
 25 PHA00616 hypothetical protein   27.5      35 0.00076   23.1   1.1   20  129-148     8-27  (44)
 26 PF08535 KorB:  KorB domain;  I  27.4      53  0.0012   24.1   2.2   39    6-51      5-43  (93)
 27 COG3592 Uncharacterized conser  27.1      30 0.00066   25.9   0.9   19   11-30     35-53  (74)
 28 PRK11548 outer membrane biogen  25.7      49  0.0011   25.6   1.8   26   22-47     36-61  (113)
 29 PRK00083 frr ribosome recyclin  23.9 4.1E+02  0.0089   22.5   7.2   84   22-125    69-158 (185)
 30 PF15319 RHINO:  RAD9, RAD1, HU  22.5      33 0.00072   30.7   0.4   34   83-120   201-234 (236)
 31 PF06353 DUF1062:  Protein of u  22.4      64  0.0014   26.6   2.0   23   34-56    113-135 (142)
 32 smart00099 btg1 tob/btg1 famil  22.1      32  0.0007   27.3   0.2   10   21-30     93-102 (108)
 33 PF00392 GntR:  Bacterial regul  22.0      71  0.0015   21.6   1.9   27   32-58     36-62  (64)
 34 PF00096 zf-C2H2:  Zinc finger,  22.0      69  0.0015   17.2   1.5   16  129-144     7-22  (23)
 35 TIGR00421 ubiX_pad polyprenyl   22.0 1.1E+02  0.0024   25.6   3.4   47    9-58    103-149 (181)
 36 cd04762 HTH_MerR-trunc Helix-T  21.7      76  0.0017   19.1   1.8   17   36-52     12-28  (49)
 37 smart00420 HTH_DEOR helix_turn  21.3      79  0.0017   19.4   1.9   25   32-56     26-50  (53)
 38 TIGR02988 YaaA_near_RecF S4 do  21.3      69  0.0015   21.5   1.7   20   36-55     22-41  (59)
 39 KOG1596 Fibrillarin and relate  20.9      61  0.0013   30.0   1.7   71    6-92    137-219 (317)
 40 PF14468 DUF4427:  Protein of u  20.4      71  0.0015   26.4   1.8   33   12-53     30-62  (132)
 41 cd01145 TroA_c Periplasmic bin  20.2      52  0.0011   27.3   1.0   19   19-37    107-125 (203)

No 1  
>KOG1696 consensus 60s ribosomal protein L19 [Translation, ribosomal structure and biogenesis]
Probab=100.00  E-value=1.3e-83  Score=535.65  Aligned_cols=193  Identities=72%  Similarity=1.083  Sum_probs=190.7

Q ss_pred             CCcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCC
Q 029269            1 MVSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGK   80 (196)
Q Consensus         1 M~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~Gk   80 (196)
                      |++|++|||||||||+||+.+||+||||+++|++|||||+|++||+||+||.+|+++|||+||+++.+++++|||+|+|+
T Consensus         1 Ms~lrlqKRLAssVl~cGKkKvWlDpNE~~eI~~ansRq~irkLikdg~iI~Kp~~vhsr~r~rk~~~akrkgrH~G~GK   80 (193)
T KOG1696|consen    1 MSNLRLQKRLAASVLKCGKKKVWLDPNEISEISGANSRQNIRKLIKDGLIIRKPVTVHSRSRCRKRLEAKRKGRHMGYGK   80 (193)
T ss_pred             CchHHHHHHHHHHHHHhcccceeeCccHHHHhcccchHHHHHHHHhCCeEeecchhhhHHHHHHHHHHHHHhccccCccc
Confidence            89999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHHHH
Q 029269           81 RKGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFE  160 (196)
Q Consensus        81 RkGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQae  160 (196)
                      |+||+|||||+|++||++||+||++|++|||+|+||+|+||.||+++|||+|+|+.+||||||+.++|++++|+|+||||
T Consensus        81 RkGTanArmP~k~~Wmrr~RvlRrlL~kyR~skKIdkh~YH~lY~k~KGnvFKnK~~LmE~I~K~KAe~~r~K~LadQae  160 (193)
T KOG1696|consen   81 RKGTANARMPSKVLWMRRMRVLRRLLKKYRDSKKIDKHMYHDLYLKVKGNVFKNKRVLMEHIHKSKAEKAREKLLADQAE  160 (193)
T ss_pred             ccccccccCchhHHHHHHHHHHHHHHHHhhhcccchHHHHHHHHHHHhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHhchHHHHHHHHHHHHHHHHHHHHHhhhhhh
Q 029269          161 AKRAKNKASRERKIARREERLAQVRALLAYTIE  193 (196)
Q Consensus       161 A~r~k~k~~r~rr~~~~~~k~~~~~~~~~~~~~  193 (196)
                      |+|.++++++++++|++++++++++.++..++|
T Consensus       161 Arr~k~k~ar~rreer~~~k~~~~~~~~~k~ee  193 (193)
T KOG1696|consen  161 ARRLKNKAARKRREERLAAKPQELIKTLSKEEE  193 (193)
T ss_pred             HHHhhhHHHhhhHHHHHhhchhhhhcccccccC
Confidence            999999999999999999999999999987754


No 2  
>PTZ00097 60S ribosomal protein L19; Provisional
Probab=100.00  E-value=3.2e-79  Score=508.95  Aligned_cols=174  Identities=72%  Similarity=1.159  Sum_probs=171.2

Q ss_pred             cchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCcc
Q 029269            3 SLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKRK   82 (196)
Q Consensus         3 ~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkRk   82 (196)
                      +|++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+|||||+
T Consensus         1 ~l~~QKRLAA~vL~cG~~rVWiDP~~~~eI~~A~tR~dIR~LIkdG~I~~kp~kg~SR~R~r~~~~~k~kGR~~G~G~RK   80 (175)
T PTZ00097          1 NLRLQKRLAASVLKCGKNRVWLDPNEASEISLANSRFSIRKLIKDGLIIRKPVAVHSRARARRFHEAKRKGRHTGIGKRR   80 (175)
T ss_pred             CchHHHHHHHHHHCCCCCceeeCHHHHHHHHHhhhHHHHHHHHHCCCeeecCCCCCChHHHHHHHHHHHhCCCCCCCCCc
Confidence            58999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHHHHHH
Q 029269           83 GTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFEAK  162 (196)
Q Consensus        83 Gt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQaeA~  162 (196)
                      ||+|||+|+|+.||++||+||++|++|||+|+||+|+||.||++||||+|+|++||++||++.++|+++++.|+||+||+
T Consensus        81 Gtk~AR~p~K~~W~~riR~lRr~Lk~~R~~~kIdk~~Yr~lY~kaKGn~Fknk~~L~~~I~~~kae~~r~k~l~~q~~a~  160 (175)
T PTZ00097         81 GTREARMPTKVLWMRRQRVLRRLLRKYRAAKKIDRHMYHEFYLKSKGNQFKNKRVLIEAIHKTKNEKVKEKKIQDQLEAR  160 (175)
T ss_pred             CcccccCcHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HhchHHHHHHHHHH
Q 029269          163 RAKNKASRERKIAR  176 (196)
Q Consensus       163 r~k~k~~r~rr~~~  176 (196)
                      +.+++++++++.++
T Consensus       161 r~k~~~~~~~r~~~  174 (175)
T PTZ00097        161 RAKAKALRNKRKAK  174 (175)
T ss_pred             HHHHHHHHHHHhhc
Confidence            99999999987653


No 3  
>cd01417 Ribosomal_L19e_E Ribosomal protein L19e, eukaryotic.  L19e is found in the large ribosomal subunit of eukaryotes and archaea. L19e is distinct from the ribosomal subunit L19, which is found in prokaryotes. It consists of two small globular domains connected by an extended segment. It is located toward the surface of the large subunit, with one exposed end involved in forming the intersubunit bridge with the small subunit.  The other exposed end is involved in forming the translocon binding site, along with L22, L23, L24, L29, and L31e subunits.
Probab=100.00  E-value=6.8e-78  Score=496.96  Aligned_cols=164  Identities=78%  Similarity=1.218  Sum_probs=162.0

Q ss_pred             chhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCccc
Q 029269            4 LKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKRKG   83 (196)
Q Consensus         4 l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkRkG   83 (196)
                      |++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+|||||+|
T Consensus         1 l~~QKRLAA~vL~cG~~rVW~DP~~~~eI~~A~tR~dIR~LIkdG~I~~kp~kg~SR~R~r~~~~~k~kGR~~G~G~RkG   80 (164)
T cd01417           1 LRLQKRLAASVLKCGKRKVWLDPNEISEISNANSRQSIRKLIKDGLIIKKPVKVHSRSRARKRHEAKRKGRHMGYGKRKG   80 (164)
T ss_pred             CcHHHHHHHHHHCCCCCceeeCHHHHHHHHHhhhHHHHHHHHHCCCeeecCCCcCCHHHHHHHHHHHHhCCCCCCCCCcC
Confidence            68999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHHHHHHH
Q 029269           84 TREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFEAKR  163 (196)
Q Consensus        84 t~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQaeA~r  163 (196)
                      |+|||+|+|+.||++||+||++|++|||+|+||+|+||.||++||||+|+|++||++||++.++|+++++.|+||+||++
T Consensus        81 t~~AR~p~K~~W~~riR~lRr~Lk~~R~~~kIdk~~Yr~lY~kaKGn~Fknk~~L~~~I~~~kae~~r~k~l~~q~~a~r  160 (164)
T cd01417          81 TANARMPSKVLWMRRQRVLRRLLKKYRESKKIDKHLYHELYLKAKGNVFKNKRVLMEHIHKAKAEKAREKELADQAEARR  160 (164)
T ss_pred             cccccCcHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hchH
Q 029269          164 AKNK  167 (196)
Q Consensus       164 ~k~k  167 (196)
                      ++++
T Consensus       161 ~~~~  164 (164)
T cd01417         161 AKNK  164 (164)
T ss_pred             hccC
Confidence            8863


No 4  
>PTZ00436 60S ribosomal protein L19-like protein; Provisional
Probab=100.00  E-value=4.1e-76  Score=524.89  Aligned_cols=170  Identities=68%  Similarity=1.106  Sum_probs=167.9

Q ss_pred             CCcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCC
Q 029269            1 MVSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGK   80 (196)
Q Consensus         1 M~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~Gk   80 (196)
                      |++|++||||||+||+||++|||||||+++||++||||+|||+||+||+|+++|++||||||+|++++++++|||+|+||
T Consensus         1 M~dLklQKRLAAsVL~cGk~RVWiDPnel~eIa~AiTReDIRkLIkdGlIikKp~KGhSRgRaRkr~eaKrKGRhrG~Gs   80 (357)
T PTZ00436          1 MVSLKLQARLAADILRCGRHRVWLDPNEASEISNANSRKSVRKLIKDGLIIRKPVKVHSRSRWRHMKEAKSMGRHEGAGR   80 (357)
T ss_pred             CcchHHHHHHHHHHhCCCCCceeeCHHHHHHHHHhhhHHHHHHHHHCCCeeecCcccCChHHHHHHHHHHHhCcCCCCCC
Confidence            89999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHHHH
Q 029269           81 RKGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFE  160 (196)
Q Consensus        81 RkGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQae  160 (196)
                      |+||+|||||+|++||+|||+||++|++|||+|+||+|+||.||+++|||+|+|++||++|||+.++|+.+++.|.||++
T Consensus        81 RKGTk~AR~P~K~~WIrRIRaLRRlLKklRd~gKIDkh~YR~LYrKAKGn~FKNK~~L~e~I~k~KaE~~R~K~L~dQ~e  160 (357)
T PTZ00436         81 REGTREARMPSKELWMRRLRILRRLLRKYREEKKIDRHIYRELYVKAKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQLA  160 (357)
T ss_pred             CcCcccccCcHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCccCcHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHhchHHHH
Q 029269          161 AKRAKNKASR  170 (196)
Q Consensus       161 A~r~k~k~~r  170 (196)
                      |+|.++++.+
T Consensus       161 ArR~k~~~~r  170 (357)
T PTZ00436        161 AKRLKDEQHR  170 (357)
T ss_pred             HHHHHhhhhh
Confidence            9999987654


No 5  
>PF01280 Ribosomal_L19e:  Ribosomal protein L19e;  InterPro: IPR000196 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This entry represents structural domain of the ribosomal protein L19 from eukaryotes, as well as L19e from archaea []. L19/L19e is absent in bacteria. L19/L19e is part of the large ribosomal subunit, whose structure has been determined in a number of eukaryotic and archaeal species []. L19/L19e is a multi-helical protein consisting of two different 3-helical domains connected by a long, partly helical linker.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3IZR_T 3O58_S 3O5H_S 3IZS_T 2WWA_J 1S1I_P 2WW9_J 2ZKR_7 4A1A_O 4A1C_O ....
Probab=100.00  E-value=1.1e-73  Score=465.97  Aligned_cols=148  Identities=68%  Similarity=1.097  Sum_probs=130.9

Q ss_pred             CcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCc
Q 029269            2 VSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKR   81 (196)
Q Consensus         2 ~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkR   81 (196)
                      |||++||||||+||+||++|||||||+++||++||||+|||+||+||+|+++|++|||||||+++++++++|||+|||+|
T Consensus         1 m~l~~QKRLAa~vL~~G~~rVw~DP~~~~eI~~A~tR~~IR~LIk~G~I~~k~~k~~Sr~R~r~~~~~r~kGr~~G~G~R   80 (148)
T PF01280_consen    1 MDLKLQKRLAASVLGCGKNRVWIDPNELEEIANAITREDIRKLIKDGLIIKKPVKGHSRGRARKRKEARRKGRHRGPGKR   80 (148)
T ss_dssp             -STHHHHHHHHHHHTS-GGGEEE-STTHHHHHH--SHHHHHHHHHTTSEEE---S--STHHHHHHHHHHHCTTS-SSTTS
T ss_pred             CccHHHHHHHHHHHCCCCCcEEeCHHHHHHHHhhhhHHHHHHHHHCCCeEeCCCCCCchHHHHHHHHHHhhccccccccc
Confidence            59999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHH
Q 029269           82 KGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEK  149 (196)
Q Consensus        82 kGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~  149 (196)
                      +||+|||+|+|+.||++||+||++|++|||+|+||+|+||.||++||||+|+|++||++|||+.++|+
T Consensus        81 kGt~~AR~~~K~~W~~riR~lRr~Lk~~r~~~kID~~~Yr~lY~kaKGn~Fkn~~~L~~~i~~~k~e~  148 (148)
T PF01280_consen   81 KGTKNARMPEKELWMRRIRALRRLLKRLRDSGKIDRHMYRSLYRKAKGNVFKNKRHLMEHIHKLKAEK  148 (148)
T ss_dssp             -S-HHHHS-HHHHHHHHHHHHHHHHHHHHHTTSS-HHHHHHHHHHHHTTS-SSHHHHHHHHHHHHHCH
T ss_pred             cccccccccHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCcccCHHHHHHHHHHhccCC
Confidence            99999999999999999999999999999999999999999999999999999999999999999875


No 6  
>PRK08570 rpl19e 50S ribosomal protein L19e; Reviewed
Probab=100.00  E-value=1.2e-72  Score=460.35  Aligned_cols=147  Identities=43%  Similarity=0.742  Sum_probs=145.4

Q ss_pred             CCcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCC
Q 029269            1 MVSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGK   80 (196)
Q Consensus         1 M~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~Gk   80 (196)
                      |++|++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+||||
T Consensus         1 M~~l~~qkRLAA~iL~~G~~rVw~DP~~~~eI~~A~tR~dIR~LI~~G~I~~kp~kg~Sr~R~r~~~~~r~kGr~~G~G~   80 (150)
T PRK08570          1 MMDLSAQKRLAADILGVGVSRVWIDPEALEDVAEAITREDIRELIKEGVIKAKPKKGISRGRARERHEKRKKGRRRGPGS   80 (150)
T ss_pred             CcchHHHHHHHHHHHCCCccceeeCHHHHHHHHHHhhHHHHHHHHHCCCeeecCccCCChHHHHHHHHHHHhCCCCCCCC
Confidence            89999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhh
Q 029269           81 RKGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKA  147 (196)
Q Consensus        81 RkGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~  147 (196)
                      |+||+|||+|+|+.||.+||+||++|++|||+|+||+|+||.||++||||+|+|++||++||+....
T Consensus        81 RkGt~~AR~p~K~~W~~riR~lRr~Lk~lR~~~kId~~~Yr~lY~kaKGn~Fkn~~~L~~~i~~~~~  147 (150)
T PRK08570         81 RKGKKGARTPKKERWINRIRALRRYLRELRDEGKIDRKTYRKLYRKAKGGEFRSVSHLKTYIEEHGL  147 (150)
T ss_pred             CCCcccCCCCHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCccCCHHHHHHHHHHhcc
Confidence            9999999999999999999999999999999999999999999999999999999999999998764


No 7  
>cd00481 Ribosomal_L19e Ribosomal protein L19e.  L19e is found in the large ribosomal subunit of eukaryotes and archaea. L19e is distinct from the ribosomal subunit L19, which is found in prokaryotes. It consists of two small globular domains connected by an extended segment. It is located toward the surface of the large subunit, with one exposed end involved in forming the intersubunit bridge with the small subunit.  The other exposed end is involved in forming the translocon binding site, along with L22, L23, L24, L29, and L31e subunits.
Probab=100.00  E-value=1.8e-72  Score=457.27  Aligned_cols=145  Identities=64%  Similarity=1.101  Sum_probs=142.9

Q ss_pred             chhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCccc
Q 029269            4 LKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKRKG   83 (196)
Q Consensus         4 l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkRkG   83 (196)
                      |++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+|||||+|
T Consensus         1 l~~QkRLAA~vL~~G~~rVW~DP~~~~eI~~A~tR~dIR~LIkdG~I~~kp~kg~Sr~R~r~~~~~r~kGr~~G~G~RkG   80 (145)
T cd00481           1 LRLQKRLAADILKCGKNRVWIDPNELEEIANANTREDIRKLIKDGLIIKKPKKGHSRGRARKRHEARRKGRHRGPGSRKG   80 (145)
T ss_pred             CcHHHHHHHHHHCCCCCceeeCHHHHHHHHHhhhHHHHHHHHHCCCeeecCCCCCChHHHHHHHHHHHhCcCCCCCCccC
Confidence            68999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhH
Q 029269           84 TREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAE  148 (196)
Q Consensus        84 t~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e  148 (196)
                      |++||+|+|+.||++||+||++|++|||+|+||+|+||.||++||||+|+|++||++||++.++|
T Consensus        81 t~~AR~p~K~~W~~riR~lRr~Lk~~R~~~kIdk~~Yr~lY~kaKG~~Fknk~~L~~~i~~~~~~  145 (145)
T cd00481          81 TKGARMPSKELWIRRIRALRRLLKKLRDSGKIDKHTYRELYLKAKGNVFKNKRHLKEYIHKAKAE  145 (145)
T ss_pred             ccccCCcHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCCcCCHHHHHHHHHHhccC
Confidence            99999999999999999999999999999999999999999999999999999999999998764


No 8  
>cd01418 Ribosomal_L19e_A Ribosomal protein L19e, archaeal.  L19e is found in the large ribosomal subunit of eukaryotes and archaea. L19e is distinct from the ribosomal subunit L19, which is found in prokaryotes. It consists of two small globular domains connected by an extended segment. It is located toward the surface of the large subunit, with one exposed end involved in forming the intersubunit bridge with the small subunit.  The other exposed end is involved in forming the translocon binding site, along with L22, L23, L24, L29, and L31e subunits.
Probab=100.00  E-value=8.1e-71  Score=447.48  Aligned_cols=142  Identities=39%  Similarity=0.722  Sum_probs=140.5

Q ss_pred             chhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCCccc
Q 029269            4 LKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGKRKG   83 (196)
Q Consensus         4 l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~GkRkG   83 (196)
                      |++||||||+||+||+++||||||+++||++||||+|||+||+||+|+++|++||||||++++++++++|||+|||||+|
T Consensus         1 l~~QkRLAA~iL~~G~~rVw~DP~~~~eI~~A~tR~dIR~LI~~G~I~~kp~kg~Sr~R~r~~~~~r~kGr~~G~G~RkG   80 (145)
T cd01418           1 LSSQRRLAADILGVGINRVWIDPERLEEVAEAITRDDIRALIKEGVIKAKPKKGISRGRLKERHEKRKKGRRRGPGSRKG   80 (145)
T ss_pred             CcHHHHHHHHHHCCCCCeeeeChHHHHHHHHhhhHHHHHHHHHCCCeeecCCCCCCHHHHHHHHHHHHhCcCCCCCCCCC
Confidence            67999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHh
Q 029269           84 TREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKS  145 (196)
Q Consensus        84 t~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~  145 (196)
                      |+|||+|+|+.||.+||+||++|++|||+|+||+|+||.||++||||+|+|++||++||..+
T Consensus        81 t~~AR~p~K~~W~~riR~lRr~Lk~~R~~~kId~~~Yr~lY~kaKGn~Fkn~~~L~~~I~~~  142 (145)
T cd01418          81 KKGARTPKKERWIKTIRALRRYLKELRDKGKIDKKTYRKLYRKAKGGSFRSLSHLKSYLKQH  142 (145)
T ss_pred             ccccCCCHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHhcCCCcCCHHHHHHHHHHh
Confidence            99999999999999999999999999999999999999999999999999999999999876


No 9  
>COG2147 RPL19A Ribosomal protein L19E [Translation, ribosomal structure and biogenesis]
Probab=100.00  E-value=3.6e-68  Score=432.68  Aligned_cols=149  Identities=49%  Similarity=0.808  Sum_probs=146.7

Q ss_pred             CCcchhhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcCcCCCCCC
Q 029269            1 MVSLKLQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKGRHSGYGK   80 (196)
Q Consensus         1 M~~l~~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kGR~~G~Gk   80 (196)
                      |+||++|+||||+||+||++|||||||++++|++|+||+|||.||+||+|+++|++|+|+||+++++++++||||+||||
T Consensus         1 M~nl~~qkRLAA~il~vG~~Rvwidp~~~eei~~A~TR~dIr~LIk~g~I~~k~~kg~SrgR~rkr~~qkkkgr~rG~Gs   80 (150)
T COG2147           1 MSNLRTQKRLAADILGVGENRVWIDPNEIEEIASAITREDIRALIKDGVIKAKPKKGISRGRARKRHAQKKKGRRRGPGS   80 (150)
T ss_pred             CchHHHHHHHHHHHHccCcceeeeChHHHHHHHHhhhHHHHHHHHHCCCeeeccccccchHHHHHHHHHHhcccCCCCCc
Confidence            88999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cccccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHH
Q 029269           81 RKGTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEK  149 (196)
Q Consensus        81 RkGt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~  149 (196)
                      |+||++||||+|+.||.+||+||++|++|||+|+||+|+||.||+++|||.|++++||.+||.+++.++
T Consensus        81 RKG~k~AR~p~K~~Wi~~IRalR~~Lr~lrd~gkIdk~~YR~lY~~aKGg~fk~~~~L~~~i~~~~~~k  149 (150)
T COG2147          81 RKGTKGARMPSKERWIKRIRALRRELRKLRDDGKIDKHTYRKLYRMAKGGAFKSKSHLKSYIEEAKLLK  149 (150)
T ss_pred             cccccccCCCHHHHHHHHHHHHHHHHHHHHHcCCcCHHHHHHHHHHHcCCccccHHHHHHHHHHhcccc
Confidence            999999999999999999999999999999999999999999999999999999999999999987653


No 10 
>TIGR01764 excise DNA binding domain, excisionase family. An excisionase, or Xis protein, is a small protein that binds and promotes excisive recombination; it is not enzymatically active. This model represents a number of putative excisionases and related proteins from temperate phage, plasmids, and transposons, as well as DNA binding domains of other proteins, such as a DNA modification methylase. This model identifies mostly small proteins and N-terminal regions of large proteins, but some proteins appear to have two copies. This domain appears similar, in both sequence and predicted secondary structure (PSIPRED) to the MerR family of transcriptional regulators (pfam00376).
Probab=68.04  E-value=3.4  Score=25.63  Aligned_cols=30  Identities=20%  Similarity=0.297  Sum_probs=24.7

Q ss_pred             ecCccchhHHHhhhhhHHHHhhhhcCceeec
Q 029269           23 WLDPNEVNEISMANSRQNIRKLVKDGFIIRK   53 (196)
Q Consensus        23 WlDPne~~eIa~A~sR~dIRkLIkdG~I~~k   53 (196)
                      ||++.|+.++-. +|+..|+.|+++|.|-.-
T Consensus         1 ~lt~~e~a~~lg-is~~ti~~~~~~g~i~~~   30 (49)
T TIGR01764         1 YLTVEEAAEYLG-VSKDTVYRLIHEGELPAY   30 (49)
T ss_pred             CCCHHHHHHHHC-CCHHHHHHHHHcCCCCeE
Confidence            677788888775 789999999999987653


No 11 
>PF12728 HTH_17:  Helix-turn-helix domain
Probab=67.13  E-value=3.7  Score=26.69  Aligned_cols=30  Identities=23%  Similarity=0.297  Sum_probs=25.2

Q ss_pred             ecCccchhHHHhhhhhHHHHhhhhcCceeec
Q 029269           23 WLDPNEVNEISMANSRQNIRKLVKDGFIIRK   53 (196)
Q Consensus        23 WlDPne~~eIa~A~sR~dIRkLIkdG~I~~k   53 (196)
                      ||++.|+-++-. +|+..|+.+++.|.|..-
T Consensus         1 ~lt~~e~a~~l~-is~~tv~~~~~~g~i~~~   30 (51)
T PF12728_consen    1 YLTVKEAAELLG-ISRSTVYRWIRQGKIPPF   30 (51)
T ss_pred             CCCHHHHHHHHC-cCHHHHHHHHHcCCCCeE
Confidence            688889988888 799999999999977433


No 12 
>PRK00247 putative inner membrane protein translocase component YidC; Validated
Probab=64.75  E-value=70  Score=30.85  Aligned_cols=75  Identities=16%  Similarity=0.279  Sum_probs=40.2

Q ss_pred             hHHHHHHHHHHHHHHHHHHhcC---CCChhhhHHHHhhhcCCccccHHHHHHHHHHhhhHHHHhhhhhHH-HHHHHhchH
Q 029269           92 KVLWMRRMRVLRRLLRKYREAK---KIDRHMYHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQ-FEAKRAKNK  167 (196)
Q Consensus        92 K~~Wm~riR~lRr~Lr~~Re~~---kID~~~Yr~lY~kaKGn~Fknk~~L~e~I~k~k~e~~r~k~L~dQ-aeA~r~k~k  167 (196)
                      +..|..+.+..+..++..|...   .|.+..+..|..-       +...=.+.=...+++++++|.++.+ -||.|.-++
T Consensus       291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~-------~~~~~~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~  363 (429)
T PRK00247        291 RAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAE-------NAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINR  363 (429)
T ss_pred             HHHHHHHHHHHHHHHHHHHhccccccCCcccHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4556666666666665555554   7888877777432       2222233334445666666666543 344444444


Q ss_pred             HHHHHH
Q 029269          168 ASRERK  173 (196)
Q Consensus       168 ~~r~rr  173 (196)
                      ++++.+
T Consensus       364 ~~~~~~  369 (429)
T PRK00247        364 EARQER  369 (429)
T ss_pred             HhhhhH
Confidence            333333


No 13 
>PF13453 zf-TFIIB:  Transcription factor zinc-finger
Probab=57.28  E-value=4.7  Score=25.92  Aligned_cols=15  Identities=27%  Similarity=0.809  Sum_probs=12.4

Q ss_pred             CceecCccchhHHHh
Q 029269           20 GKVWLDPNEVNEISM   34 (196)
Q Consensus        20 ~rVWlDPne~~eIa~   34 (196)
                      .-||||++|++.|.+
T Consensus        26 ~G~W~d~~el~~~~e   40 (41)
T PF13453_consen   26 GGIWFDAGELEKLLE   40 (41)
T ss_pred             CeEEccHHHHHHHHh
Confidence            469999999998753


No 14 
>PF13880 Acetyltransf_13:  ESCO1/2 acetyl-transferase
Probab=57.06  E-value=3.4  Score=30.40  Aligned_cols=14  Identities=29%  Similarity=1.037  Sum_probs=12.1

Q ss_pred             ccCCCceecCccch
Q 029269           16 KCGRGKVWLDPNEV   29 (196)
Q Consensus        16 ~cGk~rVWlDPne~   29 (196)
                      -||++|||..|..-
T Consensus         5 ~~GI~RIWV~~~~R   18 (70)
T PF13880_consen    5 VCGISRIWVSPSHR   18 (70)
T ss_pred             EEEeEEEEeChhhh
Confidence            59999999999863


No 15 
>TIGR02075 pyrH_bact uridylate kinase. This protein, also called UMP kinase, converts UMP to UDP by adding a phosphate from ATP. It is the first step in pyrimidine biosynthesis. GTP is an allosteric activator. In a large fraction of all bacterial genomes, the gene tends to be located immediately downstream of elongation factor Ts and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function and found in the archaea and in spirochetes, is described by a separate model, TIGR02076.
Probab=42.73  E-value=26  Score=29.80  Aligned_cols=48  Identities=15%  Similarity=0.126  Sum_probs=38.0

Q ss_pred             hhhHHH-HHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceee
Q 029269            5 KLQKRL-SASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIR   52 (196)
Q Consensus         5 ~~QKRL-AA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~   52 (196)
                      .++-+| +..+...|.+-++++|.....+..-.+.+.+..|++.|.|..
T Consensus        77 ~l~~~l~~~~L~~~Gi~a~~l~~~~~~~~~~~~~~~~i~~ll~~g~VpV  125 (233)
T TIGR02075        77 VINGLALRDALEKLGVKTRVLSAISMPQICESYIRRKAIKHLEKGKVVI  125 (233)
T ss_pred             HHHHHHHHHHHHhCCCCcEEeccccCCCCccccCHHHHHHHHHCCCEEE
Confidence            355665 777778999999999998765555567899999999999854


No 16 
>PRK05920 aromatic acid decarboxylase; Validated
Probab=36.63  E-value=40  Score=29.04  Aligned_cols=47  Identities=30%  Similarity=0.259  Sum_probs=36.5

Q ss_pred             HHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCC
Q 029269            9 RLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIH   58 (196)
Q Consensus         9 RLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~   58 (196)
                      +.|...|+.|.. |.|=|.+...  ++++++++..|-..|.++.-|..|.
T Consensus       121 ~~a~~~L~~~~p-vvi~P~~m~~--~~~~~~nl~~L~~~G~~ii~P~~g~  167 (204)
T PRK05920        121 RAADVVLKERRK-LILVPRETPL--SLIHLENMLKLAEAGAIILPAIPAF  167 (204)
T ss_pred             HHHHHHHhcCCC-EEEEeCCCCC--CHHHHHHHHHHHHCCCEEeCCcccc
Confidence            555577887774 5555765444  8889999999999999999999874


No 17 
>PF03428 RP-C:  Replication protein C N-terminal domain;  InterPro: IPR005090 Proteins in this group have homology with the RepC protein of Agrobacterium Ri and Ti plasmids []. They may be involved in plasmid replication and stabilisation functions.
Probab=34.97  E-value=2.8e+02  Score=23.48  Aligned_cols=55  Identities=25%  Similarity=0.329  Sum_probs=39.8

Q ss_pred             chhHHHHHHH-----HHHHHHHHHHHhcCCC---ChhhhHHHHhhhcCCc----cc-cHHHHHHHHHH
Q 029269           90 PTKVLWMRRM-----RVLRRLLRKYREAKKI---DRHMYHDMYMKVKGNV----FK-NKRVLMESIHK  144 (196)
Q Consensus        90 p~K~~Wm~ri-----R~lRr~Lr~~Re~~kI---D~~~Yr~lY~kaKGn~----Fk-nk~~L~e~I~k  144 (196)
                      |+...=..++     +.+|+.|..|-|.|.|   |+..++-+..+-.+|.    |= |.+-|..-..+
T Consensus        71 pSN~~La~r~~G~s~~tlrR~l~~LveaGLI~rrDS~NgkRy~~R~~~G~I~~A~GfdLsPL~~R~~E  138 (177)
T PF03428_consen   71 PSNAQLAERLNGMSERTLRRHLARLVEAGLIVRRDSPNGKRYARRDRGGRIVEAFGFDLSPLIARAEE  138 (177)
T ss_pred             cCHHHHHHHHcCCCHHHHHHHHHHHHHCCCeeeccCCCCCccCccCCCCCEEeEeCcCHHHHHHHHHH
Confidence            4444445556     8999999999999999   5567888777766654    43 67777766555


No 18 
>PF09851 SHOCT:  Short C-terminal domain;  InterPro: IPR018649  This family of hypothetical prokaryotic proteins has no known function. 
Probab=34.70  E-value=47  Score=20.37  Aligned_cols=20  Identities=15%  Similarity=0.298  Sum_probs=17.1

Q ss_pred             HHHHHHHhcCCCChhhhHHH
Q 029269          104 RLLRKYREAKKIDRHMYHDM  123 (196)
Q Consensus       104 r~Lr~~Re~~kID~~~Yr~l  123 (196)
                      ..|+.+.+.|.|+...|...
T Consensus         6 ~~L~~l~~~G~IseeEy~~~   25 (31)
T PF09851_consen    6 EKLKELYDKGEISEEEYEQK   25 (31)
T ss_pred             HHHHHHHHcCCCCHHHHHHH
Confidence            45888999999999999764


No 19 
>PF01479 S4:  S4 domain;  InterPro: IPR002942 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yeast protein containing a pseudouridine synthetase and a deaminase domain, bacterial tyrosyl-tRNA synthetases, and a number of uncharacterised, small proteins that may be involved in translation regulation []. The S4 domain probably mediates binding to RNA.; GO: 0003723 RNA binding; PDB: 3BBU_A 1DM9_B 2K6P_A 3U5G_E 3U5C_E 3IZB_D 2XZM_D 2XZN_D 3O30_E 3O2Z_E ....
Probab=34.31  E-value=29  Score=22.12  Aligned_cols=27  Identities=26%  Similarity=0.477  Sum_probs=20.7

Q ss_pred             HHHhhhhhHHHHhhhhcCceeecCCCC
Q 029269           31 EISMANSRQNIRKLVKDGFIIRKPTKI   57 (196)
Q Consensus        31 eIa~A~sR~dIRkLIkdG~I~~kp~~~   57 (196)
                      +...+.||.++++||+.|.|..--...
T Consensus         9 ~~~~~~sr~~a~~~I~~g~V~VNg~~v   35 (48)
T PF01479_consen    9 RLGLASSRSEARRLIKQGRVKVNGKVV   35 (48)
T ss_dssp             HTTSSSSHHHHHHHHHTTTEEETTEEE
T ss_pred             HcCCcCCHHHHHHhcCCCEEEECCEEE
Confidence            445577999999999999987654433


No 20 
>smart00345 HTH_GNTR helix_turn_helix gluconate operon transcriptional repressor.
Probab=32.69  E-value=34  Score=21.71  Aligned_cols=24  Identities=25%  Similarity=0.559  Sum_probs=17.6

Q ss_pred             HhhhhhHHHHhhhhcCceeecCCC
Q 029269           33 SMANSRQNIRKLVKDGFIIRKPTK   56 (196)
Q Consensus        33 a~A~sR~dIRkLIkdG~I~~kp~~   56 (196)
                      +.+.-+..+..|.++|+|...|-.
T Consensus        33 s~~tv~~~l~~L~~~g~i~~~~~~   56 (60)
T smart00345       33 SRTTVREALSRLEAEGLVQRRPGS   56 (60)
T ss_pred             CHHHHHHHHHHHHHCCCEEEecCC
Confidence            334557778899999999876644


No 21 
>TIGR01610 phage_O_Nterm phage replication protein O, N-terminal domain. This model represents the N-terminal region of the phage lambda replication protein O and homologous regions of other phage proteins.
Probab=32.22  E-value=54  Score=24.41  Aligned_cols=38  Identities=18%  Similarity=0.305  Sum_probs=25.4

Q ss_pred             ccCCCceecCccchhHHHhhhhhHH----HHhhhhcCceeecC
Q 029269           16 KCGRGKVWLDPNEVNEISMANSRQN----IRKLVKDGFIIRKP   54 (196)
Q Consensus        16 ~cGk~rVWlDPne~~eIa~A~sR~d----IRkLIkdG~I~~kp   54 (196)
                      |.++..+++.+.|+.++... ||+.    |..|.+.|+|....
T Consensus        40 G~~~~~~~is~~eLa~~~g~-sr~tVsr~L~~Le~~GlI~r~~   81 (95)
T TIGR01610        40 GWNKKQDRVTATVIAELTGL-SRTHVSDAIKSLARRRIIFRQG   81 (95)
T ss_pred             CccccCCccCHHHHHHHHCc-CHHHHHHHHHHHHHCCCeeeec
Confidence            34456677777766666544 4444    55799999998654


No 22 
>PF12802 MarR_2:  MarR family; PDB: 3ECO_B 2QWW_B 3KP6_B 3KP4_B 3KP2_A 3KP5_A 3KP3_B 3KP7_A 3NQO_B 3K0L_B ....
Probab=31.87  E-value=33  Score=22.47  Aligned_cols=52  Identities=21%  Similarity=0.242  Sum_probs=30.5

Q ss_pred             hHHHHHHHhccCCCceecCccchhHH---HhhhhhHHHHhhhhcCceeecCCCCCCh
Q 029269            7 QKRLSASVLKCGRGKVWLDPNEVNEI---SMANSRQNIRKLVKDGFIIRKPTKIHSR   60 (196)
Q Consensus         7 QKRLAA~vL~cGk~rVWlDPne~~eI---a~A~sR~dIRkLIkdG~I~~kp~~~~SR   60 (196)
                      |-++-..|..+|..  ++-+.++.+.   +-...-.-|..|++.|+|.+.+.....|
T Consensus         7 q~~vL~~l~~~~~~--~~t~~~la~~l~~~~~~vs~~v~~L~~~Glv~r~~~~~D~R   61 (62)
T PF12802_consen    7 QFRVLMALARHPGE--ELTQSELAERLGISKSTVSRIVKRLEKKGLVERERDPGDRR   61 (62)
T ss_dssp             HHHHHHHHHHSTTS--GEEHHHHHHHHTS-HHHHHHHHHHHHHTTSEEEEE-SSSTT
T ss_pred             HHHHHHHHHHCCCC--CcCHHHHHHHHCcCHHHHHHHHHHHHHCCCEEEeCCCCCCC
Confidence            44445555555554  3333333332   2233345688999999999998877665


No 23 
>PRK06029 3-octaprenyl-4-hydroxybenzoate carboxy-lyase; Provisional
Probab=31.52  E-value=58  Score=27.57  Aligned_cols=46  Identities=26%  Similarity=0.255  Sum_probs=35.8

Q ss_pred             HHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCC
Q 029269            9 RLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKI   57 (196)
Q Consensus         9 RLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~   57 (196)
                      +.|...|.+++ .|.|=|.  +--.+.++.+++..|-+.|.++..|..|
T Consensus       106 ~~a~~~L~~~~-pvii~P~--~M~~~p~~~~Nl~~L~~~G~~vi~P~~g  151 (185)
T PRK06029        106 RAADVMLKERR-RLVLCVR--ETPLHLGHLRNMTKLAEMGAIIMPPVPA  151 (185)
T ss_pred             HHHHHHHhcCC-CEEEEec--cccCCHHHHHHHHHHHHCcCEEECCCcc
Confidence            44556778765 4555574  4566889999999999999999999876


No 24 
>cd04254 AAK_UMPK-PyrH-Ec UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of E. coli (Ec) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial and chloroplast UMPKs (this CD) have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of this CD be
Probab=31.15  E-value=48  Score=28.15  Aligned_cols=44  Identities=9%  Similarity=0.130  Sum_probs=35.9

Q ss_pred             HHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceee
Q 029269            9 RLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIR   52 (196)
Q Consensus         9 RLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~   52 (196)
                      -|++.+...|.+-+||+|-+.+.+.....-+.+..+++.|.|..
T Consensus        81 ll~~~L~~~Gv~a~~l~~~~~~~~~~~~~~~~l~~~l~~g~ipV  124 (231)
T cd04254          81 ALQDALESLGVKTRVMSAIPMQGVAEPYIRRRAIRHLEKGRVVI  124 (231)
T ss_pred             HHHHHHHHcCCCeEEEcHHHhhhhhcccCHHHHHHHHHCCCEEE
Confidence            35667788999999999998876655566799999999998744


No 25 
>PHA00616 hypothetical protein
Probab=27.51  E-value=35  Score=23.12  Aligned_cols=20  Identities=20%  Similarity=0.571  Sum_probs=16.6

Q ss_pred             CCccccHHHHHHHHHHhhhH
Q 029269          129 GNVFKNKRVLMESIHKSKAE  148 (196)
Q Consensus       129 Gn~Fknk~~L~e~I~k~k~e  148 (196)
                      |..|.++++|..|+.+.-.+
T Consensus         8 G~~F~~~s~l~~H~r~~hg~   27 (44)
T PHA00616          8 GGIFRKKKEVIEHLLSVHKQ   27 (44)
T ss_pred             hHHHhhHHHHHHHHHHhcCC
Confidence            78999999999999765443


No 26 
>PF08535 KorB:  KorB domain;  InterPro: IPR013741 This entry contains several KorB transcriptional repressor proteins. The korB gene is a major regulatory element in the replication and maintenance of broad host-range plasmid RK2. It negatively controls the replication gene trfA, the host-lethal determinants kilA and kilB, and the korA-korB operon []. This domain includes the DNA-binding HTH motif []. ; PDB: 1R71_C.
Probab=27.39  E-value=53  Score=24.11  Aligned_cols=39  Identities=23%  Similarity=0.600  Sum_probs=23.7

Q ss_pred             hhHHHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCcee
Q 029269            6 LQKRLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFII   51 (196)
Q Consensus         6 ~QKRLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~   51 (196)
                      +|.-+|..|   |+++-|+    .+-++-+.-=++|+.|+.+|.|.
T Consensus         5 tq~eIA~~l---Gks~s~V----s~~l~Ll~lP~~i~~~v~~g~~~   43 (93)
T PF08535_consen    5 TQEEIAKRL---GKSRSWV----SNHLALLDLPEEIKELVRSGRIS   43 (93)
T ss_dssp             -HHHHHHHT---T--HHHH----HHHHGGGS--HHHHHHHHTTS--
T ss_pred             CHHHHHHHH---CCCHHHH----HHHHHHHcCCHHHHHHHHcCCCc
Confidence            467777766   9998887    33444555667899999999774


No 27 
>COG3592 Uncharacterized conserved protein [Function unknown]
Probab=27.11  E-value=30  Score=25.93  Aligned_cols=19  Identities=32%  Similarity=0.789  Sum_probs=14.6

Q ss_pred             HHHHhccCCCceecCccchh
Q 029269           11 SASVLKCGRGKVWLDPNEVN   30 (196)
Q Consensus        11 AA~vL~cGk~rVWlDPne~~   30 (196)
                      ++.|+++|..= ||+|+..+
T Consensus        35 n~~vF~~~rkP-WI~Pd~~~   53 (74)
T COG3592          35 NPKVFNLGRKP-WIMPDAVD   53 (74)
T ss_pred             CHhhcccCCCC-ccCCCCCC
Confidence            56788888765 99998754


No 28 
>PRK11548 outer membrane biogenesis protein BamE; Provisional
Probab=25.69  E-value=49  Score=25.58  Aligned_cols=26  Identities=23%  Similarity=0.440  Sum_probs=22.9

Q ss_pred             eecCccchhHHHhhhhhHHHHhhhhc
Q 029269           22 VWLDPNEVNEISMANSRQNIRKLVKD   47 (196)
Q Consensus        22 VWlDPne~~eIa~A~sR~dIRkLIkd   47 (196)
                      -++||+.+++|.--.|+++|+.|+=.
T Consensus        36 ~~~~~~~l~~l~~GmTk~qV~~lLGt   61 (113)
T PRK11548         36 NYLTPNDVAKIHVGMTQQQVAYTLGT   61 (113)
T ss_pred             ccCCHHHHHHhcCCCCHHHHHHHcCC
Confidence            37899999999999999999998743


No 29 
>PRK00083 frr ribosome recycling factor; Reviewed
Probab=23.86  E-value=4.1e+02  Score=22.50  Aligned_cols=84  Identities=23%  Similarity=0.300  Sum_probs=55.1

Q ss_pred             eecCccchhHHHhhhhhHHHH-hhhhcCceeecCCCCCChhHH-HHHHHHHhcCcCCCCCCcccccccCCchhHHHHHHH
Q 029269           22 VWLDPNEVNEISMANSRQNIR-KLVKDGFIIRKPTKIHSRSRA-RRMKEAKRKGRHSGYGKRKGTREARLPTKVLWMRRM   99 (196)
Q Consensus        22 VWlDPne~~eIa~A~sR~dIR-kLIkdG~I~~kp~~~~SR~R~-r~~~~~k~kGR~~G~GkRkGt~~AR~p~K~~Wm~ri   99 (196)
                      +| ||+-+..|..|+.-.++- .=+.||-.++-|..--+.-|- ---+.++..+                   +.+-..|
T Consensus        69 p~-D~~~i~~I~kAI~~s~lgl~P~~dg~~Iri~iP~lT~E~R~elvK~~k~~~-------------------E~aKv~i  128 (185)
T PRK00083         69 PW-DKSMLKAIEKAIRASDLGLNPSNDGTVIRLPIPPLTEERRKELVKQVKKEA-------------------EEAKVAI  128 (185)
T ss_pred             eC-CHhHHHHHHHHHHHCCCCCCcccCCCEEEecCCCCCHHHHHHHHHHHHHHH-------------------HHHHHHH
Confidence            45 999999999999876653 224588888888766665443 2233333322                   2344678


Q ss_pred             HHHHHH----HHHHHhcCCCChhhhHHHHh
Q 029269          100 RVLRRL----LRKYREAKKIDRHMYHDMYM  125 (196)
Q Consensus       100 R~lRr~----Lr~~Re~~kID~~~Yr~lY~  125 (196)
                      |.+|+-    |+.+...|.|.....+.+..
T Consensus       129 RniRr~~~~~iKk~~k~~~iseD~~k~~e~  158 (185)
T PRK00083        129 RNIRRDANDKLKKLEKDKEISEDELKRAED  158 (185)
T ss_pred             HHHHHHHHHHHHHHhhcCCCChHHHHHHHH
Confidence            888886    66666667788887776654


No 30 
>PF15319 RHINO:  RAD9, RAD1, HUS1-interacting nuclear orphan protein
Probab=22.54  E-value=33  Score=30.75  Aligned_cols=34  Identities=41%  Similarity=0.641  Sum_probs=26.0

Q ss_pred             cccccCCchhHHHHHHHHHHHHHHHHHHhcCCCChhhh
Q 029269           83 GTREARLPTKVLWMRRMRVLRRLLRKYREAKKIDRHMY  120 (196)
Q Consensus        83 Gt~~AR~p~K~~Wm~riR~lRr~Lr~~Re~~kID~~~Y  120 (196)
                      .|-.--..-|+.|-+|    +.||..|||.|++++.+|
T Consensus       201 DTPE~~YGiKvTWRRR----~hL~~yLrerGkL~~sq~  234 (236)
T PF15319_consen  201 DTPEEKYGIKVTWRRR----RHLLAYLRERGKLSRSQF  234 (236)
T ss_pred             cCCcccccceeeeecc----HHHHHHHHHhCccchhhc
Confidence            3444455669999666    678999999999998765


No 31 
>PF06353 DUF1062:  Protein of unknown function (DUF1062);  InterPro: IPR009412 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=22.39  E-value=64  Score=26.57  Aligned_cols=23  Identities=30%  Similarity=0.673  Sum_probs=20.7

Q ss_pred             hhhhhHHHHhhhhcCceeecCCC
Q 029269           34 MANSRQNIRKLVKDGFIIRKPTK   56 (196)
Q Consensus        34 ~A~sR~dIRkLIkdG~I~~kp~~   56 (196)
                      .-+||.+|..|+++|.|...|.+
T Consensus       113 L~lSrs~l~~l~~~G~I~~~~~~  135 (142)
T PF06353_consen  113 LGLSRSRLKRLIEQGLIRSDPDK  135 (142)
T ss_pred             hCcCHHHHHHHHHCCCEEecCcc
Confidence            45799999999999999998876


No 32 
>smart00099 btg1 tob/btg1 family. The tob/btg1 is a family of proteins that inhibit cell proliferation.
Probab=22.13  E-value=32  Score=27.28  Aligned_cols=10  Identities=50%  Similarity=1.165  Sum_probs=8.2

Q ss_pred             ceecCccchh
Q 029269           21 KVWLDPNEVN   30 (196)
Q Consensus        21 rVWlDPne~~   30 (196)
                      .||+||.|++
T Consensus        93 tiwvDP~eVs  102 (108)
T smart00099       93 TLWVDPFEVS  102 (108)
T ss_pred             EEEECCCEEE
Confidence            5899999864


No 33 
>PF00392 GntR:  Bacterial regulatory proteins, gntR family;  InterPro: IPR000524 Many bacterial transcription regulation proteins bind DNA through a helix-turn-helix (HTH) motif, which can be classified into subfamilies on the basis of sequence similarities. The HTH GntR family has many members distributed among diverse bacterial groups that regulate various biological processes. It was named GntR after the Bacillus subtilis repressor of the gluconate operon []. Family members include GntR, HutC, KorA, NtaR, FadR, ExuR, FarR, DgoR and PhnF. The crystal structure of the FadR protein has been determined []. In general, these proteins contain a DNA-binding HTH domain at the N terminus, and an effector-binding or oligomerisation domain at the C terminus (IPR011711 from INTERPRO). The DNA-binding domain is well conserved in structure for the whole of the GntR family, consisting of a 3-helical bundle core with a small beta-sheet (wing); the GntR winged helix structure is similar to that found in several other transcriptional regulator families. The regions outside the DNA-binding domain are more variable and are consequently used to define GntR subfamilies []. This entry represents the N-terminal DNA-binding domain of the GntR family.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1HW1_B 1H9T_A 1HW2_A 1H9G_A 1E2X_A 3IHU_A 3C7J_A 2RA5_A 3BY6_C 3IC7_A ....
Probab=22.00  E-value=71  Score=21.61  Aligned_cols=27  Identities=30%  Similarity=0.570  Sum_probs=20.9

Q ss_pred             HHhhhhhHHHHhhhhcCceeecCCCCC
Q 029269           32 ISMANSRQNIRKLVKDGFIIRKPTKIH   58 (196)
Q Consensus        32 Ia~A~sR~dIRkLIkdG~I~~kp~~~~   58 (196)
                      |+-..-|+-+..|..+|+|...|-.|.
T Consensus        36 vsr~tvr~al~~L~~~g~i~~~~~~G~   62 (64)
T PF00392_consen   36 VSRTTVREALRRLEAEGLIERRPGRGT   62 (64)
T ss_dssp             S-HHHHHHHHHHHHHTTSEEEETTTEE
T ss_pred             cCCcHHHHHHHHHHHCCcEEEECCceE
Confidence            445566788889999999999987763


No 34 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=21.97  E-value=69  Score=17.25  Aligned_cols=16  Identities=25%  Similarity=0.497  Sum_probs=13.2

Q ss_pred             CCccccHHHHHHHHHH
Q 029269          129 GNVFKNKRVLMESIHK  144 (196)
Q Consensus       129 Gn~Fknk~~L~e~I~k  144 (196)
                      |-.|.+...|..||..
T Consensus         7 ~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    7 GKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             TEEESSHHHHHHHHHH
T ss_pred             CCccCCHHHHHHHHhH
Confidence            5679999999999863


No 35 
>TIGR00421 ubiX_pad polyprenyl P-hydroxybenzoate and phenylacrylic acid decarboxylases. In E.coli, the protein UbiX (3-octaprenyl-4-hydroxybenzoate carboxy-lyase) has been shown to be involved in the third step of ubiquinone biosynthesis. It catalyzes the reaction [3-octaprenyl-4-hydroxybenzoate = 2-octaprenylphenol + CO2]. The knockout of the homologous protein in yeast confers sensitivity to phenylacrylic acid. Members are not restricted to ubiquinone-synthesizing species. This family represents a distinct clade within the flavoprotein family of Pfam model pfam02441.
Probab=21.96  E-value=1.1e+02  Score=25.57  Aligned_cols=47  Identities=30%  Similarity=0.295  Sum_probs=35.3

Q ss_pred             HHHHHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeecCCCCC
Q 029269            9 RLSASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRKPTKIH   58 (196)
Q Consensus         9 RLAA~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~   58 (196)
                      +.|...|+++. .|.|=|.+.  -.++.+.+++..|-+.|.++.-|..|.
T Consensus       103 ~~a~~~L~~~~-pv~i~P~~m--~~~~~~~~Nl~~L~~~G~~ii~P~~g~  149 (181)
T TIGR00421       103 RAADVCLKERR-KLVLVPRET--PLNSIHLENMLRLSRMGAIILPPMPAF  149 (181)
T ss_pred             HHHHHHHhcCC-CEEEEeCCC--cCCHHHHHHHHHHHHCCCEEECCCCcc
Confidence            44555788776 455557544  458888999999999999999998763


No 36 
>cd04762 HTH_MerR-trunc Helix-Turn-Helix DNA binding domain of truncated MerR-like proteins. Proteins in this family mostly have a truncated helix-turn-helix (HTH) MerR-like domain. They lack a portion of the C-terminal region, called Wing 2 and the long dimerization helix that is typically present in MerR-like proteins. These truncated domains are found in response regulator receiver (REC) domain proteins (i.e., CheY), cytosine-C5 specific DNA methylases, IS607 transposase-like proteins, and RacA, a bacterial protein that anchors chromosomes to cell poles.
Probab=21.68  E-value=76  Score=19.07  Aligned_cols=17  Identities=29%  Similarity=0.432  Sum_probs=14.4

Q ss_pred             hhhHHHHhhhhcCceee
Q 029269           36 NSRQNIRKLVKDGFIIR   52 (196)
Q Consensus        36 ~sR~dIRkLIkdG~I~~   52 (196)
                      +|+..|+.++++|.|..
T Consensus        12 vs~~tl~~~~~~g~~~~   28 (49)
T cd04762          12 VSPSTLRRWVKEGKLKA   28 (49)
T ss_pred             cCHHHHHHHHHcCCCCc
Confidence            68899999999998753


No 37 
>smart00420 HTH_DEOR helix_turn_helix, Deoxyribose operon repressor.
Probab=21.30  E-value=79  Score=19.41  Aligned_cols=25  Identities=24%  Similarity=0.411  Sum_probs=18.0

Q ss_pred             HHhhhhhHHHHhhhhcCceeecCCC
Q 029269           32 ISMANSRQNIRKLVKDGFIIRKPTK   56 (196)
Q Consensus        32 Ia~A~sR~dIRkLIkdG~I~~kp~~   56 (196)
                      ++.+.-+..|..|.+.|+|...+..
T Consensus        26 ~s~~tv~~~l~~L~~~g~i~~~~~~   50 (53)
T smart00420       26 VSEMTIRRDLNKLEEQGLLTRVHGG   50 (53)
T ss_pred             CCHHHHHHHHHHHHHCCCEEEeecC
Confidence            3444557788899999999876543


No 38 
>TIGR02988 YaaA_near_RecF S4 domain protein YaaA. This small protein has a single S4 domain (pfam01479), as do bacterial ribosomal protein S4, some pseudouridine synthases, tyrosyl-tRNA synthetases. The S4 domain may bind RNA. Members of this protein family are found almost exclusively in the Firmicutes, and almost invariably just a few nucleotides upstream of the gene for the DNA replication and repair protein RecF. The few members of this family that are not near recF are found instead near dnaA and/or dnaN, the usual neighbors of recF, near the origin of replication. The conserved location suggests a possible role in replication in the Firmicutes lineage.
Probab=21.26  E-value=69  Score=21.55  Aligned_cols=20  Identities=5%  Similarity=0.152  Sum_probs=17.2

Q ss_pred             hhhHHHHhhhhcCceeecCC
Q 029269           36 NSRQNIRKLVKDGFIIRKPT   55 (196)
Q Consensus        36 ~sR~dIRkLIkdG~I~~kp~   55 (196)
                      .||..+++||++|.|..--.
T Consensus        22 ~SR~~~k~li~~G~V~VNg~   41 (59)
T TIGR02988        22 DSGGQAKWFLQENEVLVNGE   41 (59)
T ss_pred             cCHHHHHHHHHcCCEEECCE
Confidence            49999999999999988443


No 39 
>KOG1596 consensus Fibrillarin and related nucleolar RNA-binding proteins [RNA processing and modification]
Probab=20.86  E-value=61  Score=30.01  Aligned_cols=71  Identities=23%  Similarity=0.316  Sum_probs=40.7

Q ss_pred             hhHHHHHHHhccCCCceecCccc------------hhHHHhhhhhHHHHhhhhcCceeecCCCCCChhHHHHHHHHHhcC
Q 029269            6 LQKRLSASVLKCGRGKVWLDPNE------------VNEISMANSRQNIRKLVKDGFIIRKPTKIHSRSRARRMKEAKRKG   73 (196)
Q Consensus         6 ~QKRLAA~vL~cGk~rVWlDPne------------~~eIa~A~sR~dIRkLIkdG~I~~kp~~~~SR~R~r~~~~~k~kG   73 (196)
                      .|-.|||.||+ |..-+||.|-.            ++.|++-.-        .+|++.+-.. .|-.||--.+.++|+--
T Consensus       137 frSKLAA~I~g-GvdnihikpGsKVLYLGAasGttVSHvSDiVG--------peG~VYAVEf-s~rsGRdL~nmAkkRtN  206 (317)
T KOG1596|consen  137 FRSKLAAGILG-GVDNIHIKPGSKVLYLGAASGTTVSHVSDIVG--------PEGCVYAVEF-SHRSGRDLINMAKKRTN  206 (317)
T ss_pred             HHHHHHHHhhc-CccceeecCCceEEEeeccCCceeehhhcccC--------CCceEEEEEe-cccchHHHHHHhhccCC
Confidence            46789999986 99999999953            233333222        4566665443 23345655555554310


Q ss_pred             cCCCCCCcccccccCCchh
Q 029269           74 RHSGYGKRKGTREARLPTK   92 (196)
Q Consensus        74 R~~G~GkRkGt~~AR~p~K   92 (196)
                      ==      -=--.||+|.|
T Consensus       207 ii------PIiEDArhP~K  219 (317)
T KOG1596|consen  207 II------PIIEDARHPAK  219 (317)
T ss_pred             ce------eeeccCCCchh
Confidence            00      00135788877


No 40 
>PF14468 DUF4427:  Protein of unknown function (DUF4427)
Probab=20.37  E-value=71  Score=26.39  Aligned_cols=33  Identities=21%  Similarity=0.399  Sum_probs=25.3

Q ss_pred             HHHhccCCCceecCccchhHHHhhhhhHHHHhhhhcCceeec
Q 029269           12 ASVLKCGRGKVWLDPNEVNEISMANSRQNIRKLVKDGFIIRK   53 (196)
Q Consensus        12 A~vL~cGk~rVWlDPne~~eIa~A~sR~dIRkLIkdG~I~~k   53 (196)
                      .-++.-|...||| |+        |+.+=+|.||++|.|+.-
T Consensus        30 ~~~~e~G~~wvWi-~D--------N~~~~vRALl~~grV~v~   62 (132)
T PF14468_consen   30 DYDREFGNAWVWI-HD--------NQSEVVRALLQAGRVKVN   62 (132)
T ss_pred             cchhhcCceEEEE-ec--------CcCHHHHHHHHcCceeec
Confidence            3456779999999 33        456678999999998764


No 41 
>cd01145 TroA_c Periplasmic binding protein TroA_c.  These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea.  They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism.  A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains.
Probab=20.18  E-value=52  Score=27.32  Aligned_cols=19  Identities=37%  Similarity=0.704  Sum_probs=15.8

Q ss_pred             CCceecCccchhHHHhhhh
Q 029269           19 RGKVWLDPNEVNEISMANS   37 (196)
Q Consensus        19 k~rVWlDPne~~eIa~A~s   37 (196)
                      ---||+||+....++.++.
T Consensus       107 dPH~Wldp~~~~~~a~~I~  125 (203)
T cd01145         107 NPHVWLDPNNAPALAKALA  125 (203)
T ss_pred             CcCeecCHHHHHHHHHHHH
Confidence            4468999999998888876


Done!