Query         035398
Match_columns 53
No_of_seqs    31 out of 33
Neff          3.0 
Searched_HMMs 46136
Date          Fri Mar 29 02:43:29 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/035398.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/035398hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF05680 ATP-synt_E:  ATP synth  99.7 6.3E-17 1.4E-21   99.1   3.4   39   11-49      6-44  (86)
  2 KOG4326 Mitochondrial F1F0-ATP  98.5 2.6E-07 5.6E-12   57.7   4.3   32   11-42      8-39  (81)
  3 COG5454 Predicted secreted pro  82.2    0.95 2.1E-05   28.8   1.6   18   20-37     59-76  (89)
  4 KOG3300 NADH:ubiquinone oxidor  72.5      11 0.00024   25.9   4.6   32    9-40     34-66  (146)
  5 PF06783 UPF0239:  Uncharacteri  68.6     8.3 0.00018   24.2   3.2   29    3-36      7-35  (85)
  6 PF03648 Glyco_hydro_67N:  Glyc  67.7     2.7 5.9E-05   25.7   0.9   10   25-34    113-122 (122)
  7 TIGR03725 bact_YeaZ universal   64.9     4.5 9.7E-05   26.5   1.5   37    4-40     63-100 (202)
  8 PF12617 LdpA_C:  Iron-Sulfur b  54.7      10 0.00022   26.5   1.9   37    3-40    100-151 (183)
  9 COG1214 Inactive homolog of me  50.9      13 0.00029   25.2   2.0   30    3-32     66-96  (220)
 10 PF10766 DUF2592:  Protein of u  49.2      19 0.00042   20.1   2.1   21   13-33      8-28  (41)
 11 PHA02047 phage lambda Rz1-like  44.4      72  0.0016   20.7   4.6   25   13-37      4-29  (101)
 12 COG3777 Uncharacterized conser  43.2     7.4 0.00016   28.9  -0.1   24    2-25    156-181 (273)
 13 COG3855 Fbp Uncharacterized pr  40.0      20 0.00043   29.3   1.8   18    6-23    233-251 (648)
 14 PF08636 Pkr1:  ER protein Pkr1  39.5      36 0.00077   20.7   2.5   19    3-24     13-31  (75)
 15 COG3105 Uncharacterized protei  35.2      81  0.0018   21.5   3.9   32   15-46      7-38  (138)
 16 PF01214 CK_II_beta:  Casein ki  34.7      34 0.00074   23.2   2.0   15   25-39     69-83  (184)
 17 PRK01617 hypothetical protein;  33.3      21 0.00046   23.6   0.9   14   15-28     33-46  (154)
 18 PRK04233 hypothetical protein;  33.3      21 0.00045   23.2   0.8   13   16-28     38-50  (129)
 19 PF00814 Peptidase_M22:  Glycop  32.9      21 0.00047   24.6   0.8   39    4-42     59-98  (268)
 20 PRK01752 hypothetical protein;  32.7      22 0.00048   23.7   0.8   14   15-28     36-49  (156)
 21 PRK01842 hypothetical protein;  32.6      22 0.00048   23.8   0.8   13   16-28     53-65  (149)
 22 PRK00183 hypothetical protein;  31.1      24 0.00053   23.6   0.9   14   15-28     33-46  (157)
 23 PF10335 DUF294_C:  Putative nu  29.7      35 0.00076   21.4   1.4   31    9-39     41-71  (145)
 24 smart00735 ZM ZASP-like motif.  29.6      27 0.00059   16.8   0.7   11    3-13      9-19  (26)
 25 PF06305 DUF1049:  Protein of u  29.4      99  0.0022   16.6   3.3   22   21-42     26-51  (68)
 26 PF10947 DUF2628:  Protein of u  28.2      63  0.0014   19.1   2.2   20   20-39     40-59  (108)
 27 TIGR01947 rnfG electron transp  28.2 1.5E+02  0.0033   19.5   4.3   34   13-46      3-36  (186)
 28 KOG2467 Glycine/serine hydroxy  26.3      36 0.00078   27.1   1.1   16    7-24    112-127 (477)
 29 COG3776 Predicted membrane pro  25.9      33 0.00072   21.9   0.8   22   18-43     67-88  (91)
 30 COG2959 HemX Uncharacterized e  24.8 2.7E+02  0.0058   21.8   5.5   30   10-39     31-62  (391)
 31 PF07330 DUF1467:  Protein of u  24.8      69  0.0015   19.6   2.0   36    4-39     36-78  (85)
 32 TIGR03368 cellulose_yhjU cellu  24.6      42 0.00092   26.9   1.3   14    7-21    235-248 (518)
 33 TIGR00329 gcp_kae1 metallohydr  24.5      62  0.0013   22.6   1.9   35    3-37     77-112 (305)
 34 PF03824 NicO:  High-affinity n  24.5      47   0.001   22.7   1.3   14   20-33      2-15  (282)
 35 PRK12705 hypothetical protein;  24.2 2.1E+02  0.0046   22.5   4.9   30   11-40      4-33  (508)
 36 PRK01908 electron transport co  24.1 2.1E+02  0.0046   19.4   4.5   34   13-46      7-40  (205)
 37 PRK11677 hypothetical protein;  23.8      97  0.0021   20.3   2.7   19   21-39      8-26  (134)
 38 PF11658 DUF3260:  Protein of u  23.6      46 0.00099   26.7   1.3   14    7-21    237-250 (518)
 39 PTZ00396 Casein kinase II subu  22.9      71  0.0015   23.0   2.0   15   26-40     91-105 (251)
 40 PRK06531 yajC preprotein trans  22.7      95  0.0021   19.9   2.4    9   21-29      9-17  (113)
 41 KOG4085 Uncharacterized conser  22.6      48   0.001   23.3   1.1   15   20-34    127-141 (175)
 42 PF07047 OPA3:  Optic atrophy 3  22.1 1.5E+02  0.0032   18.8   3.2   26   23-48     87-112 (134)
 43 KOG4236 Serine/threonine prote  22.1      18  0.0004   30.4  -1.2   14   25-38    577-590 (888)
 44 PF01386 Ribosomal_L25p:  Ribos  21.8      26 0.00057   20.6  -0.3   22   10-32     10-31  (88)
 45 PRK06770 hypothetical protein;  21.8      62  0.0013   22.7   1.5   15   20-34     14-28  (180)
 46 TIGR03723 bact_gcp putative gl  21.7      66  0.0014   22.8   1.7   36    3-38     78-114 (314)
 47 PRK12821 aspartyl/glutamyl-tRN  21.3   1E+02  0.0023   24.5   2.8   30    2-38    127-156 (477)
 48 PF02838 Glyco_hydro_20b:  Glyc  20.5      67  0.0015   18.7   1.3   16   24-39     94-109 (124)
 49 PRK10019 nickel/cobalt efflux   20.4      63  0.0014   23.4   1.4   16   19-34     13-28  (279)

No 1  
>PF05680 ATP-synt_E:  ATP synthase E chain;  InterPro: IPR008386 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit E found in the F0 complex of F-ATPases. Mitochondrial F-ATPases can associate together to form dimeric or oligomeric complexes, such interactions involving the physical association of membrane-embedded F0 complexes. In yeast, the F0 complex E subunit appears to play an important role in supporting F-ATPase dimerisation. This subunit is anchored to the inner mitochondrial membrane via its N-terminal region, which is involved in stabilising subunits G and K of the F0 complex. The C-terminal region of subunit E is hydrophilic, protruding into the intermembrane space where it can also help stabilise the F-ATPase dimer complex []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0000276 mitochondrial proton-transporting ATP synthase complex, coupling factor F(o)
Probab=99.66  E-value=6.3e-17  Score=99.11  Aligned_cols=39  Identities=41%  Similarity=0.565  Sum_probs=33.2

Q ss_pred             hhHHHHHHHHhhhhhhhhhhhhhHHHhHHHHhHHHHHHH
Q 035398           11 TSTLALVARVSAFSFGLVYGNIKLKYLKSKAKSLKKAEA   49 (53)
Q Consensus        11 tStlanV~R~SAl~~Gv~YG~~h~~~Lka~~~~~k~~~e   49 (53)
                      .|+++||+|||||++||+||++|+++|++++++++.+++
T Consensus         6 VSp~inv~RySaL~~Gv~YG~~~~~~L~~~~~~~~~~~e   44 (86)
T PF05680_consen    6 VSPLINVLRYSALGLGVVYGAYHQRYLKAKAKKEAAERE   44 (86)
T ss_pred             cCcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence            577899999999999999999999999976666655544


No 2  
>KOG4326 consensus Mitochondrial F1F0-ATP synthase, subunit e [Energy production and conversion]
Probab=98.47  E-value=2.6e-07  Score=57.66  Aligned_cols=32  Identities=34%  Similarity=0.572  Sum_probs=28.5

Q ss_pred             hhHHHHHHHHhhhhhhhhhhhhhHHHhHHHHh
Q 035398           11 TSTLALVARVSAFSFGLVYGNIKLKYLKSKAK   42 (53)
Q Consensus        11 tStlanV~R~SAl~~Gv~YG~~h~~~Lka~~~   42 (53)
                      .|-|+...|||||++||.||+.|++.|+..+.
T Consensus         8 vSPLIkfGRysaL~lGvaYGa~r~~~l~~~~e   39 (81)
T KOG4326|consen    8 VSPLIKFGRYSALSLGVAYGAFRLRQLREYHE   39 (81)
T ss_pred             ecHHHHhhHHHHHHHHHHHhHHHHHHHhHHHH
Confidence            56789999999999999999999999986544


No 3  
>COG5454 Predicted secreted protein [Function unknown]
Probab=82.16  E-value=0.95  Score=28.82  Aligned_cols=18  Identities=39%  Similarity=0.519  Sum_probs=15.4

Q ss_pred             HhhhhhhhhhhhhhHHHh
Q 035398           20 VSAFSFGLVYGNIKLKYL   37 (53)
Q Consensus        20 ~SAl~~Gv~YG~~h~~~L   37 (53)
                      .||+.+|++||.+|-.-+
T Consensus        59 ~savi~gl~y~lt~~~g~   76 (89)
T COG5454          59 ASAVIFGLFYGLTHFLGY   76 (89)
T ss_pred             HHHHHHHHHHHHHHHHcC
Confidence            599999999999987644


No 4  
>KOG3300 consensus NADH:ubiquinone oxidoreductase, B16.6 subunit/cell death-regulatory protein [Energy production and conversion; Cell cycle control, cell division, chromosome partitioning]
Probab=72.55  E-value=11  Score=25.85  Aligned_cols=32  Identities=34%  Similarity=0.392  Sum_probs=21.9

Q ss_pred             CChhHHHHHHHHhhhhhhhhh-hhhhHHHhHHH
Q 035398            9 SGTSTLALVARVSAFSFGLVY-GNIKLKYLKSK   40 (53)
Q Consensus         9 sgtStlanV~R~SAl~~Gv~Y-G~~h~~~Lka~   40 (53)
                      ||-++++-|.=+.|.|.+.+| |+-+++.||..
T Consensus        34 Sg~t~~aa~~gatayG~~~~~~~~kk~rr~kiE   66 (146)
T KOG3300|consen   34 SGMTMFAAVSGATAYGMYQVGQGNKKRRRLKIE   66 (146)
T ss_pred             CcchhhhHHHHHHHHHHHHHHhchhHHHHHHHH
Confidence            677777777777777777665 66667776653


No 5  
>PF06783 UPF0239:  Uncharacterised protein family (UPF0239);  InterPro: IPR009621 This is a group of transmembrane proteins of unknown function.; GO: 0016021 integral to membrane
Probab=68.55  E-value=8.3  Score=24.19  Aligned_cols=29  Identities=21%  Similarity=0.181  Sum_probs=20.7

Q ss_pred             CCCCCCCChhHHHHHHHHhhhhhhhhhhhhhHHH
Q 035398            3 PPPGPYSGTSTLALVARVSAFSFGLVYGNIKLKY   36 (53)
Q Consensus         3 ~~~g~ysgtStlanV~R~SAl~~Gv~YG~~h~~~   36 (53)
                      +.|--|--.+-+-||+||     |||.|.+-|-.
T Consensus         7 ~sp~ei~Eet~~e~llRY-----GLf~GAIFQli   35 (85)
T PF06783_consen    7 FSPPEIPEETFFENLLRY-----GLFVGAIFQLI   35 (85)
T ss_pred             CCCccCCcchHHHHHHHH-----HHHHHHHHHHH
Confidence            445555556677899997     88888887753


No 6  
>PF03648 Glyco_hydro_67N:  Glycosyl hydrolase family 67 N-terminus;  InterPro: IPR005154 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This represents a family of alpha-glucuronidases (GH67 from CAZY). Deletion mutants have indicated that the central region is responsible for the catalytic activity. Within this central domain, the invariant Glu and Asp (residues 391 and 364 respectively from Bacillus stearothermophilus (Geobacillus stearothermophilus)) are thought to from the the catalytic centre [].; GO: 0046559 alpha-glucuronidase activity, 0045493 xylan catabolic process; PDB: 1GQL_A 1GQI_B 1GQJ_B 1GQK_A 1H41_A 1MQR_A 1K9F_A 1K9E_A 1L8N_A 1K9D_A ....
Probab=67.69  E-value=2.7  Score=25.70  Aligned_cols=10  Identities=40%  Similarity=0.826  Sum_probs=8.3

Q ss_pred             hhhhhhhhhH
Q 035398           25 FGLVYGNIKL   34 (53)
Q Consensus        25 ~Gv~YG~~h~   34 (53)
                      -|++||.+|+
T Consensus       113 ~G~LYGvF~l  122 (122)
T PF03648_consen  113 RGVLYGVFHL  122 (122)
T ss_dssp             HHHHHHHHHH
T ss_pred             cEEEEEEeeC
Confidence            4899999985


No 7  
>TIGR03725 bact_YeaZ universal bacterial protein YeaZ. This family describes a protein family, YeaZ, that appears to be universal in bacteria, but whose function is unknown. This family is related to the gcp (glycoprotease) protein family, also universal in bacteria and unknown in function. In Gram-positive lineages, members of these two related families often belong to the same operon, along with the ribosomal-protein-alanine acetyltransferase gene. Members of this family may occur as fusions with gcp or the ribosomal protein N-acetyltransferase rimI, and is frequently encoded next to rimI.
Probab=64.88  E-value=4.5  Score=26.54  Aligned_cols=37  Identities=24%  Similarity=0.420  Sum_probs=31.6

Q ss_pred             CCCCCCChhHHHHHHHHhhhhhhh-hhhhhhHHHhHHH
Q 035398            4 PPGPYSGTSTLALVARVSAFSFGL-VYGNIKLKYLKSK   40 (53)
Q Consensus         4 ~~g~ysgtStlanV~R~SAl~~Gv-~YG~~h~~~Lka~   40 (53)
                      .||.|.|.=.++-++|--|++.++ ++|.-++..|-.+
T Consensus        63 GPGSfTGlRig~~~akgla~~~~~p~~~vssL~~lA~~  100 (202)
T TIGR03725        63 GPGSFTGLRIGLATAKGLALALGIPLVGVSSLEALAAQ  100 (202)
T ss_pred             CCChHHhHHHHHHHHHHHHHHhCCCEEecCHHHHHHhh
Confidence            599999999999999999999997 7888887777543


No 8  
>PF12617 LdpA_C:  Iron-Sulfur binding protein C terminal;  InterPro: IPR021039  This entry represents the C-terminal region of the iron-sulphur protein LdpA (Light dependent period), which is found in phototropic organisms. LdpA was originally identified in cyanobacteria where it is involved in light-dependent modulation of the circadian clock. The presence of iron-sulphur clusters on LdpA suggests that it may modulate the circadian clock as an indirect function of light intensity by sensing changes in cellular physiology []. 
Probab=54.67  E-value=10  Score=26.46  Aligned_cols=37  Identities=41%  Similarity=0.632  Sum_probs=26.9

Q ss_pred             CCCCCC---CCh--hHHHHHHHH----------hhhhhhhhhhhhhHHHhHHH
Q 035398            3 PPPGPY---SGT--STLALVARV----------SAFSFGLVYGNIKLKYLKSK   40 (53)
Q Consensus         3 ~~~g~y---sgt--StlanV~R~----------SAl~~Gv~YG~~h~~~Lka~   40 (53)
                      -|||++   .||  .| +.-+|-          +.+..||-||.|=|+.|+--
T Consensus       100 ~~PG~vQLAGGTN~~T-v~~Lk~~gl~~~~~~~~~~iaGVAyGsyARkll~p~  151 (183)
T PF12617_consen  100 KPPGPVQLAGGTNAHT-VEKLKQMGLLQPPPSSSAFIAGVAYGSYARKLLQPI  151 (183)
T ss_pred             CCCCceEecCccchhH-HHHHHHccCcCCCccccCCCceeeehHHHHHHHHHH
Confidence            377865   344  34 455566          77999999999999998853


No 9  
>COG1214 Inactive homolog of metal-dependent proteases, putative molecular chaperone [Posttranslational modification, protein turnover, chaperones]
Probab=50.89  E-value=13  Score=25.21  Aligned_cols=30  Identities=20%  Similarity=0.355  Sum_probs=25.0

Q ss_pred             CCCCCCCChhHHHHHHHHhhhhhhh-hhhhh
Q 035398            3 PPPGPYSGTSTLALVARVSAFSFGL-VYGNI   32 (53)
Q Consensus         3 ~~~g~ysgtStlanV~R~SAl~~Gv-~YG~~   32 (53)
                      -.||.|.|.=.++=+++--|+++|+ ++|.-
T Consensus        66 ~GPGSFTGlRIG~~~AkgLA~~l~iplvgvs   96 (220)
T COG1214          66 KGPGSFTGLRIGVAFAKGLALALNIPLVGVS   96 (220)
T ss_pred             cCCCcccchhhHHHHHHHHHHHcCCCEEEeC
Confidence            4699999999999999998888887 56653


No 10 
>PF10766 DUF2592:  Protein of unknown function (DUF2592);  InterPro: IPR019702  This entry represents proteins with unknown function, and appear to be restricted to Enterobacteriaceae. Some members are annotated as ybhY. 
Probab=49.19  E-value=19  Score=20.10  Aligned_cols=21  Identities=33%  Similarity=0.548  Sum_probs=16.5

Q ss_pred             HHHHHHHHhhhhhhhhhhhhh
Q 035398           13 TLALVARVSAFSFGLVYGNIK   33 (53)
Q Consensus        13 tlanV~R~SAl~~Gv~YG~~h   33 (53)
                      +++.|-=.-|+.+|++||.=.
T Consensus         8 a~iMVPVvma~ilglIyGlGe   28 (41)
T PF10766_consen    8 AVIMVPVVMALILGLIYGLGE   28 (41)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            356677788999999999643


No 11 
>PHA02047 phage lambda Rz1-like protein
Probab=44.44  E-value=72  Score=20.70  Aligned_cols=25  Identities=20%  Similarity=0.441  Sum_probs=17.9

Q ss_pred             HHHHHHHHhhhhhhhhhhhhh-HHHh
Q 035398           13 TLALVARVSAFSFGLVYGNIK-LKYL   37 (53)
Q Consensus        13 tlanV~R~SAl~~Gv~YG~~h-~~~L   37 (53)
                      |++.++=.-++++|-.||+.. .+.|
T Consensus         4 t~~~~~~~v~~~~g~~y~~~~~~r~~   29 (101)
T PHA02047          4 TIVAILVLVVVALGASYGFVQSYRAL   29 (101)
T ss_pred             hHHHHHHHHHHHhhhhHHHHHHHHHH
Confidence            445556667788899999988 5554


No 12 
>COG3777 Uncharacterized conserved protein [Function unknown]
Probab=43.18  E-value=7.4  Score=28.88  Aligned_cols=24  Identities=42%  Similarity=0.553  Sum_probs=18.2

Q ss_pred             CCCCCCC--CChhHHHHHHHHhhhhh
Q 035398            2 APPPGPY--SGTSTLALVARVSAFSF   25 (53)
Q Consensus         2 ~~~~g~y--sgtStlanV~R~SAl~~   25 (53)
                      +.|+|+.  -=|+|=+++.||||+.|
T Consensus       156 ~~p~~~w~~~~tptpvllfrYsaltf  181 (273)
T COG3777         156 AEPNGKWLKNFTPTPVLLFRYSALTF  181 (273)
T ss_pred             CCCCCchhhcCCCCchheeehhhhcc
Confidence            3466666  45677789999999987


No 13 
>COG3855 Fbp Uncharacterized protein conserved in bacteria [Carbohydrate transport and metabolism]
Probab=40.03  E-value=20  Score=29.34  Aligned_cols=18  Identities=39%  Similarity=0.501  Sum_probs=15.3

Q ss_pred             CCCCChhH-HHHHHHHhhh
Q 035398            6 GPYSGTST-LALVARVSAF   23 (53)
Q Consensus         6 g~ysgtSt-lanV~R~SAl   23 (53)
                      |.|||++. +|||.|.+|=
T Consensus       233 gA~sGs~vc~aNvIRIcaR  251 (648)
T COG3855         233 GAASGSKVCMANVIRICAR  251 (648)
T ss_pred             ecccCChHHHHHHHHHHHh
Confidence            78999874 7899999984


No 14 
>PF08636 Pkr1:  ER protein Pkr1;  InterPro: IPR013945  Pkr1 has been identified as an ER protein of unknown function. 
Probab=39.51  E-value=36  Score=20.67  Aligned_cols=19  Identities=37%  Similarity=0.441  Sum_probs=14.9

Q ss_pred             CCCCCCCChhHHHHHHHHhhhh
Q 035398            3 PPPGPYSGTSTLALVARVSAFS   24 (53)
Q Consensus         3 ~~~g~ysgtStlanV~R~SAl~   24 (53)
                      ++||+   |+++....+.|...
T Consensus        13 ftPG~---tp~li~a~n~sF~~   31 (75)
T PF08636_consen   13 FTPGT---TPTLIIATNVSFAA   31 (75)
T ss_pred             cCCCC---ChHHHHHHHHHHHH
Confidence            78998   68888888887644


No 15 
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.25  E-value=81  Score=21.45  Aligned_cols=32  Identities=16%  Similarity=0.261  Sum_probs=23.2

Q ss_pred             HHHHHHhhhhhhhhhhhhhHHHhHHHHhHHHH
Q 035398           15 ALVARVSAFSFGLVYGNIKLKYLKSKAKSLKK   46 (53)
Q Consensus        15 anV~R~SAl~~Gv~YG~~h~~~Lka~~~~~k~   46 (53)
                      .-++=..+|++||+-|++-.|-.+-+.+.+++
T Consensus         7 ~W~~a~igLvvGi~IG~li~Rlt~~~~k~q~~   38 (138)
T COG3105           7 TWEYALIGLVVGIIIGALIARLTNRKLKQQQK   38 (138)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHcchhhhhHHH
Confidence            44556678999999999888777666555433


No 16 
>PF01214 CK_II_beta:  Casein kinase II regulatory subunit;  InterPro: IPR000704 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Casein kinase, a ubiquitous, well-conserved protein kinase involved in cell metabolism and differentiation, is characterised by its preference for Ser or Thr in acidic stretches of amino acids. The enzyme is a tetramer of 2 alpha- and 2 beta-subunits [, ]. However, some species (e.g., mammals) possess 2 related forms of the alpha-subunit (alpha and alpha'), while others (e.g., fungi) possess 2 related beta-subunits (beta and beta') []. The alpha-subunit is the catalytic unit and contains regions characteristic of serine/threonine protein kinases. The beta-subunit is believed to be regulatory, possessing an N-terminal auto-phosphorylation site, an internal acidic domain, and a potential metal-binding motif []. The beta subunit is a highly conserved protein of about 25kDa that contains, in its central section, a cysteine-rich motif, CX(n)C, that could be involved in binding a metal such as zinc []. The mammalian beta-subunit gene promoter shares common features with those of other mammalian protein kinases and is closely related to the promoter of the regulatory subunit of cAMP-dependent protein kinase [].; GO: 0019887 protein kinase regulator activity, 0005956 protein kinase CK2 complex; PDB: 2R6M_B 1RQF_K 1DS5_G 1QF8_B 3EED_A 4DGL_A 1JWH_D.
Probab=34.72  E-value=34  Score=23.20  Aligned_cols=15  Identities=27%  Similarity=0.645  Sum_probs=12.6

Q ss_pred             hhhhhhhhhHHHhHH
Q 035398           25 FGLVYGNIKLKYLKS   39 (53)
Q Consensus        25 ~Gv~YG~~h~~~Lka   39 (53)
                      +-.+||.+|+|.|-.
T Consensus        69 a~~LYGLIHaRyI~T   83 (184)
T PF01214_consen   69 AEMLYGLIHARYILT   83 (184)
T ss_dssp             HHHHHHHHHHHHTTS
T ss_pred             HHHHHhhhHHHHhhc
Confidence            457899999999875


No 17 
>PRK01617 hypothetical protein; Provisional
Probab=33.35  E-value=21  Score=23.61  Aligned_cols=14  Identities=21%  Similarity=0.226  Sum_probs=11.3

Q ss_pred             HHHHHHhhhhhhhh
Q 035398           15 ALVARVSAFSFGLV   28 (53)
Q Consensus        15 anV~R~SAl~~Gv~   28 (53)
                      ..=.|||||..|.+
T Consensus        33 LMRSRYsAy~~~~~   46 (154)
T PRK01617         33 LMRSRYCAFVMKDA   46 (154)
T ss_pred             HHHHHHHHHHhccc
Confidence            45589999999875


No 18 
>PRK04233 hypothetical protein; Provisional
Probab=33.33  E-value=21  Score=23.19  Aligned_cols=13  Identities=31%  Similarity=0.192  Sum_probs=10.9

Q ss_pred             HHHHHhhhhhhhh
Q 035398           16 LVARVSAFSFGLV   28 (53)
Q Consensus        16 nV~R~SAl~~Gv~   28 (53)
                      .=.|||||++|-+
T Consensus        38 MRSRYsAfv~~~~   50 (129)
T PRK04233         38 MRARYSAYVRRDA   50 (129)
T ss_pred             HHHHHHHHHhCcc
Confidence            4579999999986


No 19 
>PF00814 Peptidase_M22:  Glycoprotease family;  InterPro: IPR000905 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M22 (clan MK). The type example being O-sialoglycoprotein endopeptidase (3.4.24.57 from EC) from Pasteurella haemolytica (Mannheimia haemolytica). O-Sialoglycoprotein endopeptidase is secreted by the bacterium P. haemolytica, and digests only proteins that are heavily sialylated, in particular those with sialylated serine and threonine residues []. Substrate proteins include glycophorin A and leukocyte surface antigens CD34, CD43, CD44 and CD45 [, ]. Removal of glycosylation, by treatment with neuraminidase, completely negates susceptibility to O-sialoglycoprotein endopeptidase digestion [, ]. Sequence similarity searches have revealed other members of the M22 family, from yeast, Mycobacterium, Haemophilus influenzae and the cyanobacterium Synechocystis []. The zinc-binding and catalytic residues of this family have not been determined, although the motif HMEGH may be a zinc-binding region [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2A6A_A 2GEL_G 2GEM_B 1OKJ_B 3ENO_A 3EN9_B 2VWB_B 3ENH_B 2IVO_D 2IVP_A ....
Probab=32.90  E-value=21  Score=24.58  Aligned_cols=39  Identities=21%  Similarity=0.306  Sum_probs=30.7

Q ss_pred             CCCCCCChhHHHHHHHHhhhhhhh-hhhhhhHHHhHHHHh
Q 035398            4 PPGPYSGTSTLALVARVSAFSFGL-VYGNIKLKYLKSKAK   42 (53)
Q Consensus         4 ~~g~ysgtStlanV~R~SAl~~Gv-~YG~~h~~~Lka~~~   42 (53)
                      .||.|.|--.++.+++--|+.+++ +||.-|+..+-....
T Consensus        59 GPGsftgLrvG~~~Ak~La~~~~~Pli~v~~l~a~a~~~~   98 (268)
T PF00814_consen   59 GPGSFTGLRVGLSFAKGLALALNIPLIGVSHLEAHALSAR   98 (268)
T ss_dssp             ESS-HHHHHHHHHHHHHHHHHTT--EEEEEHHHHHHHHHH
T ss_pred             CCCcccccHHHHHHHHHHHHHhCCCeEeeccHHHHHHhHh
Confidence            489999999999999999999998 788888877765443


No 20 
>PRK01752 hypothetical protein; Provisional
Probab=32.66  E-value=22  Score=23.74  Aligned_cols=14  Identities=14%  Similarity=0.309  Sum_probs=11.6

Q ss_pred             HHHHHHhhhhhhhh
Q 035398           15 ALVARVSAFSFGLV   28 (53)
Q Consensus        15 anV~R~SAl~~Gv~   28 (53)
                      ..=.|||||..|.+
T Consensus        36 LMRSRYSAy~~~~~   49 (156)
T PRK01752         36 LMRSRYAAYVLKNI   49 (156)
T ss_pred             HHHHHHHHHHhccc
Confidence            35689999999976


No 21 
>PRK01842 hypothetical protein; Provisional
Probab=32.64  E-value=22  Score=23.77  Aligned_cols=13  Identities=31%  Similarity=0.554  Sum_probs=11.0

Q ss_pred             HHHHHhhhhhhhh
Q 035398           16 LVARVSAFSFGLV   28 (53)
Q Consensus        16 nV~R~SAl~~Gv~   28 (53)
                      .=.|||||.+|-+
T Consensus        53 MRSRYSAy~l~~~   65 (149)
T PRK01842         53 MRSRYSAYVLGAT   65 (149)
T ss_pred             HHHHHHHHHhccc
Confidence            4579999999975


No 22 
>PRK00183 hypothetical protein; Provisional
Probab=31.06  E-value=24  Score=23.60  Aligned_cols=14  Identities=43%  Similarity=0.691  Sum_probs=11.2

Q ss_pred             HHHHHHhhhhhhhh
Q 035398           15 ALVARVSAFSFGLV   28 (53)
Q Consensus        15 anV~R~SAl~~Gv~   28 (53)
                      ..=.|||||.+|.+
T Consensus        33 LMRSRYsAf~~~~~   46 (157)
T PRK00183         33 LMRSRYSAYVLGLV   46 (157)
T ss_pred             HHHHHHHHHHhccc
Confidence            34579999999975


No 23 
>PF10335 DUF294_C:  Putative nucleotidyltransferase substrate binding domain;  InterPro: IPR018821  This entry is found associated with presumed nucleotidyltransferase domains and seems to be distantly related to other helical substrate binding domains. 
Probab=29.67  E-value=35  Score=21.44  Aligned_cols=31  Identities=16%  Similarity=0.124  Sum_probs=23.7

Q ss_pred             CChhHHHHHHHHhhhhhhhhhhhhhHHHhHH
Q 035398            9 SGTSTLALVARVSAFSFGLVYGNIKLKYLKS   39 (53)
Q Consensus         9 sgtStlanV~R~SAl~~Gv~YG~~h~~~Lka   39 (53)
                      .|..-+|+.+|+-||-.|+.=.+...|.-..
T Consensus        41 ~gl~piv~~~R~lAL~~gi~~~~T~eRl~~L   71 (145)
T PF10335_consen   41 GGLFPIVNAARLLALRHGISETSTLERLEAL   71 (145)
T ss_pred             hhHHHHHHHHHHHHHHcCCCCCCHHHHHHHH
Confidence            3667789999999999998777666665443


No 24 
>smart00735 ZM ZASP-like motif. Short motif (26 amino acids) present in an alpha-actinin-binding protein, ZASP, and similar molecules.
Probab=29.58  E-value=27  Score=16.83  Aligned_cols=11  Identities=36%  Similarity=0.616  Sum_probs=7.8

Q ss_pred             CCCCCCCChhH
Q 035398            3 PPPGPYSGTST   13 (53)
Q Consensus         3 ~~~g~ysgtSt   13 (53)
                      -|||+||-.+.
T Consensus         9 ~P~glys~~n~   19 (26)
T smart00735        9 SPIGLYSSENI   19 (26)
T ss_pred             CCCCCCCcccH
Confidence            48999976554


No 25 
>PF06305 DUF1049:  Protein of unknown function (DUF1049);  InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=29.41  E-value=99  Score=16.56  Aligned_cols=22  Identities=18%  Similarity=0.223  Sum_probs=9.5

Q ss_pred             hhhhhhhh----hhhhhHHHhHHHHh
Q 035398           21 SAFSFGLV----YGNIKLKYLKSKAK   42 (53)
Q Consensus        21 SAl~~Gv~----YG~~h~~~Lka~~~   42 (53)
                      .+|++|++    ++......++.+.+
T Consensus        26 ~~f~~G~llg~l~~~~~~~~~r~~~~   51 (68)
T PF06305_consen   26 IAFLLGALLGWLLSLPSRLRLRRRIR   51 (68)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34444444    44444444443333


No 26 
>PF10947 DUF2628:  Protein of unknown function (DUF2628)    ;  InterPro: IPR024399 Some members in this family of proteins have been annotated as YigF. Their function is currently unknown.
Probab=28.24  E-value=63  Score=19.12  Aligned_cols=20  Identities=25%  Similarity=0.270  Sum_probs=16.6

Q ss_pred             HhhhhhhhhhhhhhHHHhHH
Q 035398           20 VSAFSFGLVYGNIKLKYLKS   39 (53)
Q Consensus        20 ~SAl~~Gv~YG~~h~~~Lka   39 (53)
                      |.||.+|.+|-.+++-.+.+
T Consensus        40 w~Af~f~~~w~l~r~mw~~~   59 (108)
T PF10947_consen   40 WWAFFFGPLWLLYRKMWLYA   59 (108)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            67999999999998876554


No 27 
>TIGR01947 rnfG electron transport complex, RnfABCDGE type, G subunit. The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation. A closely related complex in E. coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized. This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase, but is distinct. This model describes the A subunit.
Probab=28.23  E-value=1.5e+02  Score=19.47  Aligned_cols=34  Identities=32%  Similarity=0.177  Sum_probs=25.2

Q ss_pred             HHHHHHHHhhhhhhhhhhhhhHHHhHHHHhHHHH
Q 035398           13 TLALVARVSAFSFGLVYGNIKLKYLKSKAKSLKK   46 (53)
Q Consensus        13 tlanV~R~SAl~~Gv~YG~~h~~~Lka~~~~~k~   46 (53)
                      .|+.+.=.|++.++++|...+-++-+.+.+..++
T Consensus         3 ~L~~i~~v~~~~La~v~~~T~~~I~~~~~~~~~~   36 (186)
T TIGR01947         3 ILGLFAAVSAGLLALVNQFTKEVIAEAEAKQQLE   36 (186)
T ss_pred             hHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence            3556667789999999999998887765554443


No 28 
>KOG2467 consensus Glycine/serine hydroxymethyltransferase [Amino acid transport and metabolism]
Probab=26.29  E-value=36  Score=27.09  Aligned_cols=16  Identities=38%  Similarity=0.561  Sum_probs=13.6

Q ss_pred             CCCChhHHHHHHHHhhhh
Q 035398            7 PYSGTSTLALVARVSAFS   24 (53)
Q Consensus         7 ~ysgtStlanV~R~SAl~   24 (53)
                      ||||+++  |.+=|.|+.
T Consensus       112 p~SGSPA--NfavYtall  127 (477)
T KOG2467|consen  112 PYSGSPA--NFAVYTALL  127 (477)
T ss_pred             cCCCCch--hhHHHhhhc
Confidence            8999997  888888874


No 29 
>COG3776 Predicted membrane protein [Function unknown]
Probab=25.92  E-value=33  Score=21.89  Aligned_cols=22  Identities=23%  Similarity=0.375  Sum_probs=16.1

Q ss_pred             HHHhhhhhhhhhhhhhHHHhHHHHhH
Q 035398           18 ARVSAFSFGLVYGNIKLKYLKSKAKS   43 (53)
Q Consensus        18 ~R~SAl~~Gv~YG~~h~~~Lka~~~~   43 (53)
                      =|++-|.|||    .|+..++++.+.
T Consensus        67 ek~~IllFGV----f~Ll~~~~k~~~   88 (91)
T COG3776          67 EKLQILLFGV----FELLAWQKKFKM   88 (91)
T ss_pred             HHHHHHHHHH----HHHHHHHHhccc
Confidence            4777777766    589999987654


No 30 
>COG2959 HemX Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=24.85  E-value=2.7e+02  Score=21.81  Aligned_cols=30  Identities=23%  Similarity=0.184  Sum_probs=14.5

Q ss_pred             ChhHHHHHHHHhhh--hhhhhhhhhhHHHhHH
Q 035398           10 GTSTLALVARVSAF--SFGLVYGNIKLKYLKS   39 (53)
Q Consensus        10 gtStlanV~R~SAl--~~Gv~YG~~h~~~Lka   39 (53)
                      ++.-++.++=+-||  |.|++|+-.-|..++.
T Consensus        31 ~g~~l~~~all~aLgLGagg~~f~QqQ~~~~~   62 (391)
T COG2959          31 AGLLLSLAALLLALGLGAGGYYFGQQQNVLQT   62 (391)
T ss_pred             chhHHHHHHHHHHHHhchhHHHHHHHHHHHHH
Confidence            33444455555554  4555555544444443


No 31 
>PF07330 DUF1467:  Protein of unknown function (DUF1467);  InterPro: IPR009935 This family consists of several bacterial proteins of around 90 residues in length. The function of this family is unknown.
Probab=24.78  E-value=69  Score=19.59  Aligned_cols=36  Identities=19%  Similarity=0.337  Sum_probs=26.1

Q ss_pred             CCCCCCChhHHHHHHH-------HhhhhhhhhhhhhhHHHhHH
Q 035398            4 PPGPYSGTSTLALVAR-------VSAFSFGLVYGNIKLKYLKS   39 (53)
Q Consensus         4 ~~g~ysgtStlanV~R-------~SAl~~Gv~YG~~h~~~Lka   39 (53)
                      .||.=+|.+.-.+..|       .|++.++++|.++.-..+.-
T Consensus        36 v~Gt~~sAP~~~~l~rk~~~TTiiaavi~~~~~~~~~~g~i~~   78 (85)
T PF07330_consen   36 VPGTDPSAPANPRLKRKALITTIIAAVIFAIIYLIIVSGWISL   78 (85)
T ss_pred             CCCCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHhccccH
Confidence            4677777776666665       58899999999887655543


No 32 
>TIGR03368 cellulose_yhjU cellulose synthase operon protein YhjU. This protein was identified by the partial phylogenetic profiling algorithm (PubMed:16930487) as part of the system for cellulose biosynthesis in bacteria, and in fact is found in cellulose biosynthesis gene regions. The protein was designated YhjU in Salmonella enteritidis, where disruption of its gene disrupts cellulose biosynthesis and biofilm formation (PubMed:11929533).
Probab=24.57  E-value=42  Score=26.89  Aligned_cols=14  Identities=36%  Similarity=0.558  Sum_probs=12.8

Q ss_pred             CCCChhHHHHHHHHh
Q 035398            7 PYSGTSTLALVARVS   21 (53)
Q Consensus         7 ~ysgtStlanV~R~S   21 (53)
                      .|||..+ .+++|+|
T Consensus       235 SYSGPAa-IRlLRAS  248 (518)
T TIGR03368       235 SYSGPAA-IRLLRAS  248 (518)
T ss_pred             ccchHHH-HHHHHhc
Confidence            7999988 7999998


No 33 
>TIGR00329 gcp_kae1 metallohydrolase, glycoprotease/Kae1 family. This subfamily includes the well-studied secreted O-sialoglycoprotein endopeptidase (glycoprotease, EC 3.4.24.57) of Pasteurella haemolytica, a pathogen. A member from Riemerella anatipestifer, associated with cohemolysin activity, likewise is exported without benefit of a classical signal peptide and shows glycoprotease activity on the test substrate glycophorin. However, archaeal members of this subfamily show unrelated activities as demonstrated in Pyrococcus abyssi: DNA binding, iron binding, apurinic endonuclease activity, genomic association with a kinase domain, and no glycoprotease activity. This family thus pulls together a set of proteins as a homology group that appears to be near-universal in life, yet heterogeneous in assayed function between bacteria and archaea.
Probab=24.49  E-value=62  Score=22.63  Aligned_cols=35  Identities=26%  Similarity=0.117  Sum_probs=27.1

Q ss_pred             CCCCCCCChhHHHHHHHHhhhhhhh-hhhhhhHHHh
Q 035398            3 PPPGPYSGTSTLALVARVSAFSFGL-VYGNIKLKYL   37 (53)
Q Consensus         3 ~~~g~ysgtStlanV~R~SAl~~Gv-~YG~~h~~~L   37 (53)
                      -.||.|.|-..++.++|--|+.+++ ++|+-|+..+
T Consensus        77 ~GPG~~tglrvg~~~Ak~la~~~~~p~~~v~hl~~h  112 (305)
T TIGR00329        77 QGPGLGGSLRVGATFARSLALSLDKPLIGVNHLLGH  112 (305)
T ss_pred             cCCCchhhHHHHHHHHHHHHHHhCCCEeecccHHHH
Confidence            3699999999999999999998886 4555555443


No 34 
>PF03824 NicO:  High-affinity nickel-transport protein;  InterPro: IPR011541 High affinity nickel transporters are involved in the incorporation of nickel into H2-uptake hydrogenase [, ] and urease [] enzymes and are essential for the expression of catalytically active hydrogenase and urease. Ion uptake is dependent on proton motive force. HoxN in Ralstonia eutropha (Alcaligenes eutrophus) is thought to be an integral membrane protein with seven transmembrane helices []. The family also includes a cobalt transporter. ; GO: 0046872 metal ion binding, 0030001 metal ion transport, 0055085 transmembrane transport, 0016021 integral to membrane
Probab=24.46  E-value=47  Score=22.65  Aligned_cols=14  Identities=21%  Similarity=0.567  Sum_probs=9.8

Q ss_pred             Hhhhhhhhhhhhhh
Q 035398           20 VSAFSFGLVYGNIK   33 (53)
Q Consensus        20 ~SAl~~Gv~YG~~h   33 (53)
                      ++.+.++++||..|
T Consensus         2 l~ll~laf~~G~~H   15 (282)
T PF03824_consen    2 LSLLLLAFLYGLLH   15 (282)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45667777777775


No 35 
>PRK12705 hypothetical protein; Provisional
Probab=24.16  E-value=2.1e+02  Score=22.49  Aligned_cols=30  Identities=13%  Similarity=0.155  Sum_probs=24.3

Q ss_pred             hhHHHHHHHHhhhhhhhhhhhhhHHHhHHH
Q 035398           11 TSTLALVARVSAFSFGLVYGNIKLKYLKSK   40 (53)
Q Consensus        11 tStlanV~R~SAl~~Gv~YG~~h~~~Lka~   40 (53)
                      |.-++.++=.+.|.+|++|-+++.+.+..+
T Consensus         4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   33 (508)
T PRK12705          4 SILLVILLLLIGLLLGVLVVLLKKRQRLAK   33 (508)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455778888999999999999888876655


No 36 
>PRK01908 electron transport complex protein RnfG; Validated
Probab=24.06  E-value=2.1e+02  Score=19.39  Aligned_cols=34  Identities=26%  Similarity=0.077  Sum_probs=25.6

Q ss_pred             HHHHHHHHhhhhhhhhhhhhhHHHhHHHHhHHHH
Q 035398           13 TLALVARVSAFSFGLVYGNIKLKYLKSKAKSLKK   46 (53)
Q Consensus        13 tlanV~R~SAl~~Gv~YG~~h~~~Lka~~~~~k~   46 (53)
                      .|..+.=.|++.++++|...+-++-+++++..++
T Consensus         7 ~L~~i~~i~~~lLa~v~~~T~~~I~~~~~~~~~~   40 (205)
T PRK01908          7 TLALFAALCTGLTAVVNQLTKDTIAEQAALQQKA   40 (205)
T ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence            4566777888999999999998887765554443


No 37 
>PRK11677 hypothetical protein; Provisional
Probab=23.80  E-value=97  Score=20.32  Aligned_cols=19  Identities=11%  Similarity=0.482  Sum_probs=13.2

Q ss_pred             hhhhhhhhhhhhhHHHhHH
Q 035398           21 SAFSFGLVYGNIKLKYLKS   39 (53)
Q Consensus        21 SAl~~Gv~YG~~h~~~Lka   39 (53)
                      .+|++|++=|++=.|.+..
T Consensus         8 i~livG~iiG~~~~R~~~~   26 (134)
T PRK11677          8 IGLVVGIIIGAVAMRFGNR   26 (134)
T ss_pred             HHHHHHHHHHHHHHhhccc
Confidence            5778888888776665443


No 38 
>PF11658 DUF3260:  Protein of unknown function (DUF3260);  InterPro: IPR017744 This protein was identified by the partial phylogenetic profiling algorithm [] as part of the system for cellulose biosynthesis in bacteria, and in fact is found in cellulose biosynthesis gene regions. The protein was designated YhjU in Salmonella enteritidis, where disruption of its gene disrupts cellulose biosynthesis and biofilm formation [].
Probab=23.60  E-value=46  Score=26.65  Aligned_cols=14  Identities=36%  Similarity=0.558  Sum_probs=12.7

Q ss_pred             CCCChhHHHHHHHHh
Q 035398            7 PYSGTSTLALVARVS   21 (53)
Q Consensus         7 ~ysgtStlanV~R~S   21 (53)
                      .|||..+ ++++|+|
T Consensus       237 SYSGPAa-IRLLRAS  250 (518)
T PF11658_consen  237 SYSGPAA-IRLLRAS  250 (518)
T ss_pred             ccchHHH-HHHHHhc
Confidence            6999988 7999998


No 39 
>PTZ00396 Casein kinase II subunit beta; Provisional
Probab=22.89  E-value=71  Score=23.03  Aligned_cols=15  Identities=20%  Similarity=0.547  Sum_probs=12.2

Q ss_pred             hhhhhhhhHHHhHHH
Q 035398           26 GLVYGNIKLKYLKSK   40 (53)
Q Consensus        26 Gv~YG~~h~~~Lka~   40 (53)
                      -.+||.||.|+|...
T Consensus        91 ~~LYGLIHARyI~T~  105 (251)
T PTZ00396         91 SDLYGLIHARFITTP  105 (251)
T ss_pred             HHHHHHHhHhHhcCH
Confidence            378999999998653


No 40 
>PRK06531 yajC preprotein translocase subunit YajC; Validated
Probab=22.75  E-value=95  Score=19.86  Aligned_cols=9  Identities=22%  Similarity=0.623  Sum_probs=6.1

Q ss_pred             hhhhhhhhh
Q 035398           21 SAFSFGLVY   29 (53)
Q Consensus        21 SAl~~Gv~Y   29 (53)
                      -++.++++|
T Consensus         9 ~vv~~~i~y   17 (113)
T PRK06531          9 FVVMLGLIF   17 (113)
T ss_pred             HHHHHHHHH
Confidence            456677777


No 41 
>KOG4085 consensus Uncharacterized conserved protein [Function unknown]
Probab=22.61  E-value=48  Score=23.28  Aligned_cols=15  Identities=33%  Similarity=0.616  Sum_probs=11.7

Q ss_pred             HhhhhhhhhhhhhhH
Q 035398           20 VSAFSFGLVYGNIKL   34 (53)
Q Consensus        20 ~SAl~~Gv~YG~~h~   34 (53)
                      -.+|+-||+||.+-+
T Consensus       127 g~iFatGV~YG~mAL  141 (175)
T KOG4085|consen  127 GIIFATGVLYGLMAL  141 (175)
T ss_pred             chhhhhHHHHHHHHh
Confidence            457888999998754


No 42 
>PF07047 OPA3:  Optic atrophy 3 protein (OPA3);  InterPro: IPR010754 OPA3 deficiency causes type III 3-methylglutaconic aciduria (MGA) in humans. This disease manifests with early bilateral optic atrophy, spasticity, extrapyramidal dysfunction, ataxia, and cognitive deficits, but normal longevity []. This family consists of several optic atrophy 3 (OPA3) proteins and related proteins from other eukaryotic species, the function is unknown.
Probab=22.12  E-value=1.5e+02  Score=18.78  Aligned_cols=26  Identities=19%  Similarity=0.051  Sum_probs=17.4

Q ss_pred             hhhhhhhhhhhHHHhHHHHhHHHHHH
Q 035398           23 FSFGLVYGNIKLKYLKSKAKSLKKAE   48 (53)
Q Consensus        23 l~~Gv~YG~~h~~~Lka~~~~~k~~~   48 (53)
                      .+.|+++.-+.++.-+.+.+++..++
T Consensus        87 Va~~li~~E~~Rs~~ke~~Ke~~~~~  112 (134)
T PF07047_consen   87 VAAGLIIYEYWRSARKEAKKEEELQE  112 (134)
T ss_pred             HHHHHHHHHHHHHHhhHHHHHHHHHH
Confidence            45677777888887776665555444


No 43 
>KOG4236 consensus Serine/threonine protein kinase PKC mu/PKD and related proteins [Signal transduction mechanisms]
Probab=22.06  E-value=18  Score=30.35  Aligned_cols=14  Identities=43%  Similarity=0.629  Sum_probs=11.8

Q ss_pred             hhhhhhhhhHHHhH
Q 035398           25 FGLVYGNIKLKYLK   38 (53)
Q Consensus        25 ~Gv~YG~~h~~~Lk   38 (53)
                      ||+|||-+|+++=+
T Consensus       577 FG~VYgg~hRktGr  590 (888)
T KOG4236|consen  577 FGTVYGGKHRKTGR  590 (888)
T ss_pred             ceeeecceecccCc
Confidence            79999999998743


No 44 
>PF01386 Ribosomal_L25p:  Ribosomal L25p family;  InterPro: IPR020055 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ].  This entry models the short-form of the ribosomal L25 protein. The long-form has homology to the general stress protein Ctc of Bacillus subtilis, a mesophile, and ribosomal protein TL5 of Thermus thermophilus, a thermophile. Ribosomal protein L25 of Escherichia coli and Haemophilus influenzae appear to be orthologous but consist only of the N-terminal half of Ctc and TL5. Both short (L25-like) and full-length (CTC-like) members of this family bind the E-loop of bacterial 5S rRNA.; GO: 0003735 structural constituent of ribosome, 0008097 5S rRNA binding, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 2HGQ_Y 2HGJ_Y 2Y19_Z 2WRL_Z 3KIT_Z 2WH2_Z 2WDN_Z 3V25_Z 3HUZ_Z 3KIY_Z ....
Probab=21.80  E-value=26  Score=20.58  Aligned_cols=22  Identities=27%  Similarity=0.261  Sum_probs=18.4

Q ss_pred             ChhHHHHHHHHhhhhhhhhhhhh
Q 035398           10 GTSTLALVARVSAFSFGLVYGNI   32 (53)
Q Consensus        10 gtStlanV~R~SAl~~Gv~YG~~   32 (53)
                      |++. ++-+|.+-+.=||+||-=
T Consensus        10 gk~~-~r~LR~~G~iPaviYG~~   31 (88)
T PF01386_consen   10 GKSA-ARRLRREGKIPAVIYGKG   31 (88)
T ss_dssp             SSSH-HHHHHHTTEEEEEEEESS
T ss_pred             CCHH-HHHHHHcCCceEEEECCC
Confidence            4555 799999999999999954


No 45 
>PRK06770 hypothetical protein; Provisional
Probab=21.75  E-value=62  Score=22.65  Aligned_cols=15  Identities=20%  Similarity=0.612  Sum_probs=11.3

Q ss_pred             HhhhhhhhhhhhhhH
Q 035398           20 VSAFSFGLVYGNIKL   34 (53)
Q Consensus        20 ~SAl~~Gv~YG~~h~   34 (53)
                      .-.|++|++||.+..
T Consensus        14 m~vl~~g~t~G~l~y   28 (180)
T PRK06770         14 MAVLGIGVTYGMLYY   28 (180)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            346789999997765


No 46 
>TIGR03723 bact_gcp putative glycoprotease GCP. This model represents bacterial members of a protein family that is widely distributed. In a few pathogenic species, the protein is exported in a way that may represent an exceptional secondary function. This model plus companion (archaeal) model TIGR03722 together span the prokaryotic member sequences of TIGR00329, a protein family that appears universal in life, and whose broad function is unknown. A member of TIGR03722 has been characterized as a DNA-binding protein with apurinic endopeptidase activity. In contrast, the rare characterized members of the present family show O-sialoglycoprotein endopeptidase (EC. 3.4.24.57) activity after export. These include glycoprotease (gcp) from Pasteurella haemolytica A1 and a cohemolysin from Riemerella anatipestifer (GB|AAG39646.1). The member from Staphylococcus aureus is essential and is related to cell wall dynamics and the modulation of autolysis, but members are also found in the Mycoplasmas
Probab=21.66  E-value=66  Score=22.75  Aligned_cols=36  Identities=17%  Similarity=0.099  Sum_probs=29.1

Q ss_pred             CCCCCCCChhHHHHHHHHhhhhhhh-hhhhhhHHHhH
Q 035398            3 PPPGPYSGTSTLALVARVSAFSFGL-VYGNIKLKYLK   38 (53)
Q Consensus         3 ~~~g~ysgtStlanV~R~SAl~~Gv-~YG~~h~~~Lk   38 (53)
                      --||.|.|--.++.++|--|+..++ ++|.-|+..+-
T Consensus        78 ~GPGsftglrig~~~Ak~la~~~~~p~~~v~h~~aha  114 (314)
T TIGR03723        78 AGPGLIGALLVGVSFAKALALALNKPLIGVNHLEGHL  114 (314)
T ss_pred             cCCChHHhHHHHHHHHHHHHHHhCCCEEecccHHHHH
Confidence            3699999999999999999998886 67776666553


No 47 
>PRK12821 aspartyl/glutamyl-tRNA amidotransferase subunit C-like protein; Provisional
Probab=21.30  E-value=1e+02  Score=24.54  Aligned_cols=30  Identities=20%  Similarity=0.258  Sum_probs=22.3

Q ss_pred             CCCCCCCCChhHHHHHHHHhhhhhhhhhhhhhHHHhH
Q 035398            2 APPPGPYSGTSTLALVARVSAFSFGLVYGNIKLKYLK   38 (53)
Q Consensus         2 ~~~~g~ysgtStlanV~R~SAl~~Gv~YG~~h~~~Lk   38 (53)
                      -+|||+|-=-=|+      ||+..|++||... +.|.
T Consensus       127 if~~G~fFpGFTL------sA~L~GlIyGl~r-~~l~  156 (477)
T PRK12821        127 IFSGGVFNYGYVL------GAILTGMIAGILR-EVLI  156 (477)
T ss_pred             hcCCCCccHHHHH------HHHHHHHHHHHHH-HHHH
Confidence            4578888766664      7999999999875 3443


No 48 
>PF02838 Glyco_hydro_20b:  Glycosyl hydrolase family 20, domain 2;  InterPro: IPR015882 Glycoside hydrolase family 20 GH20 from CAZY comprises enzymes with several known activities; beta-hexosaminidase (3.2.1.52 from EC); lacto-N-biosidase (3.2.1.140 from EC). Carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base in this family of enzymes. In the brain and other tissues, beta-hexosaminidase A degrades GM2 gangliosides; specifically, the enzyme hydrolyses terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-beta-D-hexosaminides. There are 3 forms of beta-hexosaminidase: hexosaminidase A is a trimer, with one alpha, one beta-A and one beta-B chain; hexosaminidase B is a tetramer of two beta-A and two beta-B chains; and hexosaminidase S is a homodimer of alpha chains. The two beta chains are derived from the cleavage of a precursor. Mutations in the beta-chain lead to Sandhoff disease, a lysosomal storage disorder characterised by accumulation of GM2 ganglioside []. This entry represents the alpha and beta subunit of beta-N-acetylhexosaminidase. It contains a similar fold but lacks the catalytic centre.; GO: 0004563 beta-N-acetylhexosaminidase activity, 0005975 carbohydrate metabolic process; PDB: 3SUT_A 3SUS_A 3GH7_A 3SUR_A 3SUW_A 3SUV_A 3SUU_A 3GH5_A 3GH4_A 2J4G_B ....
Probab=20.46  E-value=67  Score=18.74  Aligned_cols=16  Identities=38%  Similarity=0.524  Sum_probs=11.7

Q ss_pred             hhhhhhhhhhHHHhHH
Q 035398           24 SFGLVYGNIKLKYLKS   39 (53)
Q Consensus        24 ~~Gv~YG~~h~~~Lka   39 (53)
                      ..|+|||..-++-|-.
T Consensus        94 ~~G~~yg~qTL~Qll~  109 (124)
T PF02838_consen   94 PAGLFYGLQTLRQLLR  109 (124)
T ss_dssp             HHHHHHHHHHHHHHSB
T ss_pred             chHHHHHHHHHHHHhh
Confidence            4699999987766543


No 49 
>PRK10019 nickel/cobalt efflux protein RcnA; Provisional
Probab=20.38  E-value=63  Score=23.42  Aligned_cols=16  Identities=6%  Similarity=0.017  Sum_probs=12.4

Q ss_pred             HHhhhhhhhhhhhhhH
Q 035398           19 RVSAFSFGLVYGNIKL   34 (53)
Q Consensus        19 R~SAl~~Gv~YG~~h~   34 (53)
                      =|+-++++++||+.|=
T Consensus        13 ~~~l~~~~f~yG~~HA   28 (279)
T PRK10019         13 AWFFIPSAILLGALHG   28 (279)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3667788999999873


Done!