Query         008936
Match_columns 548
No_of_seqs    46 out of 48
Neff          2.8 
Searched_HMMs 46136
Date          Thu Mar 28 18:24:37 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/008936.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/008936hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PRK11637 AmiB activator; Provi  78.1      39 0.00084   35.8  12.7   78   54-131    45-135 (428)
  2 PF10186 Atg14:  UV radiation r  69.7      61  0.0013   31.4  10.9   90   57-147    28-125 (302)
  3 PLN03144 Carbon catabolite rep  69.6     4.7  0.0001   45.6   3.8   35  349-383   171-205 (606)
  4 PF14988 DUF4515:  Domain of un  68.8      23 0.00049   35.0   7.8   76   59-134    81-174 (206)
  5 PF12128 DUF3584:  Protein of u  64.5 1.2E+02  0.0026   36.8  14.1  106   57-163   789-894 (1201)
  6 PF04012 PspA_IM30:  PspA/IM30   64.2      37 0.00081   32.5   8.2  117   44-191    19-139 (221)
  7 PRK09039 hypothetical protein;  60.0      68  0.0015   33.8   9.8  122   58-191    48-171 (343)
  8 COG1704 LemA Uncharacterized c  56.7      23 0.00051   35.1   5.5   47  133-191    91-138 (185)
  9 TIGR03185 DNA_S_dndD DNA sulfu  55.7      57  0.0012   36.6   8.9   59   51-109   393-453 (650)
 10 PRK10698 phage shock protein P  55.6      87  0.0019   31.1   9.3   76   45-125    21-101 (222)
 11 TIGR01005 eps_transp_fam exopo  54.4 1.1E+02  0.0024   34.7  10.9  109   62-170   200-329 (754)
 12 KOG0977 Nuclear envelope prote  52.1      79  0.0017   35.9   9.3   40   62-101    41-80  (546)
 13 PRK09239 chorismate mutase; Pr  49.3   2E+02  0.0042   25.8   9.9   94   88-191     7-100 (104)
 14 PRK14153 heat shock protein Gr  47.9   2E+02  0.0042   28.7  10.2   65  139-211    82-150 (194)
 15 TIGR02977 phageshock_pspA phag  45.8 1.9E+02  0.0042   28.3   9.9   54   44-106    20-74  (219)
 16 COG2900 SlyX Uncharacterized p  43.8      48   0.001   28.8   4.7   37   86-125     3-41  (72)
 17 PF15294 Leu_zip:  Leucine zipp  42.1 2.6E+02  0.0056   29.5  10.5   41   58-98    134-174 (278)
 18 KOG0978 E3 ubiquitin ligase in  41.4 2.3E+02  0.0049   33.4  10.9  109   80-191   499-621 (698)
 19 KOG0250 DNA repair protein RAD  40.7 3.3E+02  0.0072   33.6  12.4  139   49-191   734-891 (1074)
 20 COG1579 Zn-ribbon protein, pos  40.3 1.6E+02  0.0035   30.2   8.6   92   61-161    36-128 (239)
 21 PRK14148 heat shock protein Gr  39.9 3.1E+02  0.0068   27.3  10.3   64  139-210    89-156 (195)
 22 PF11464 Rbsn:  Rabenosyn Rab b  39.0      69  0.0015   25.2   4.5   40   63-104     3-42  (42)
 23 KOG3091 Nuclear pore complex,   39.0      75  0.0016   35.9   6.5   80   47-128   346-446 (508)
 24 PRK14154 heat shock protein Gr  37.7 3.4E+02  0.0075   27.4  10.3  152   19-211    11-171 (208)
 25 COG4942 Membrane-bound metallo  37.7 5.7E+02   0.012   28.6  12.7   93   58-158    40-132 (420)
 26 PRK14140 heat shock protein Gr  37.0 4.1E+02  0.0089   26.4  10.6   65  139-211    86-154 (191)
 27 PF10186 Atg14:  UV radiation r  36.6 3.7E+02   0.008   26.1  10.2   51   57-107    57-107 (302)
 28 PF01895 PhoU:  PhoU domain;  I  36.6 1.2E+02  0.0027   23.5   5.8   50   52-101    24-73  (88)
 29 KOG1222 Kinesin associated pro  36.5 1.4E+02  0.0031   34.4   8.2  128   23-178   231-359 (791)
 30 PRK10361 DNA recombination pro  36.2 3.6E+02  0.0078   30.4  11.1   57   59-116    56-112 (475)
 31 TIGR03185 DNA_S_dndD DNA sulfu  35.9 1.3E+02  0.0027   33.9   7.8   23  128-150   274-296 (650)
 32 PLN02939 transferase, transfer  35.4 2.5E+02  0.0055   34.2  10.4  100   50-153   129-280 (977)
 33 TIGR01076 sortase_fam LPXTG-si  34.6      39 0.00085   30.5   3.0   43  322-365    42-85  (136)
 34 PRK14154 heat shock protein Gr  33.0 2.2E+02  0.0048   28.7   8.2   26   47-72     50-75  (208)
 35 PRK14162 heat shock protein Gr  32.9 4.7E+02    0.01   26.1  10.3   64  140-211    89-156 (194)
 36 PF04156 IncA:  IncA protein;    32.7 2.4E+02  0.0052   26.3   8.0   27  162-188   156-182 (191)
 37 PRK02119 hypothetical protein;  32.6      98  0.0021   26.2   4.9   37   84-123     2-40  (73)
 38 PRK02793 phi X174 lysis protei  32.0   1E+02  0.0022   25.9   4.9   33   88-123     5-39  (72)
 39 TIGR01795 CM_mono_cladeE monof  31.5 3.6E+02  0.0077   23.7   8.3   84   91-187     3-89  (94)
 40 TIGR02169 SMC_prok_A chromosom  31.2 4.1E+02  0.0089   30.8  10.9   18  398-415   605-622 (1164)
 41 PRK11020 hypothetical protein;  30.5      77  0.0017   29.7   4.2   40   67-109    16-60  (118)
 42 PRK14139 heat shock protein Gr  30.1 4.1E+02  0.0088   26.3   9.3   64  140-211    82-147 (185)
 43 cd05827 Sortase_C_3 Sortase C   29.4      53  0.0011   29.3   2.9   44  321-365    44-88  (131)
 44 COG1196 Smc Chromosome segrega  29.4   4E+02  0.0088   32.3  10.9   61   59-119   372-432 (1163)
 45 PF11932 DUF3450:  Protein of u  29.2 5.3E+02   0.011   25.6  10.1   84   68-152    40-127 (251)
 46 PF14048 MBD_C:  C-terminal dom  29.2      40 0.00087   30.1   2.1   20   56-75     76-95  (96)
 47 PF01025 GrpE:  GrpE;  InterPro  28.5 2.3E+02  0.0051   25.9   7.0   53  137-191    58-114 (165)
 48 PF14389 Lzipper-MIP1:  Leucine  27.9      83  0.0018   27.3   3.8   36   43-78     48-83  (88)
 49 PRK14146 heat shock protein Gr  27.6 6.4E+02   0.014   25.5  10.4   64  140-211   104-171 (215)
 50 PRK06342 transcription elongat  27.6      80  0.0017   30.2   4.0   42   51-97     32-77  (160)
 51 PF00038 Filament:  Intermediat  27.4 3.8E+02  0.0082   26.8   8.8   50   58-107   211-260 (312)
 52 PRK00295 hypothetical protein;  27.2 1.4E+02  0.0031   24.9   4.9   31   89-122     3-35  (68)
 53 PF07989 Microtub_assoc:  Micro  26.8 1.5E+02  0.0032   25.3   5.0   63   58-144     2-64  (75)
 54 PF04849 HAP1_N:  HAP1 N-termin  26.6 1.4E+02   0.003   31.8   5.8   48   59-106   230-277 (306)
 55 PRK14149 heat shock protein Gr  26.3 3.8E+02  0.0082   26.7   8.4   64  139-211    85-152 (191)
 56 PF04012 PspA_IM30:  PspA/IM30   25.7 1.2E+02  0.0026   29.1   4.9   48   51-98     86-133 (221)
 57 PF09726 Macoilin:  Transmembra  25.6 1.8E+02  0.0039   33.8   7.0   67   54-127   543-611 (697)
 58 PRK11281 hypothetical protein;  25.6 1.9E+02  0.0041   35.6   7.3   61   49-110    66-147 (1113)
 59 PRK14151 heat shock protein Gr  25.5 6.3E+02   0.014   24.7   9.6   64  140-211    70-138 (176)
 60 PRK11281 hypothetical protein;  25.4 3.4E+02  0.0075   33.4   9.4   71   88-158   357-459 (1113)
 61 COG1579 Zn-ribbon protein, pos  25.2 3.5E+02  0.0077   27.9   8.2   47   56-102    59-114 (239)
 62 PRK04406 hypothetical protein;  24.7 1.7E+02  0.0037   25.0   5.0   32   88-122     8-41  (75)
 63 PRK14160 heat shock protein Gr  23.6 6.2E+02   0.013   25.6   9.4   63  139-211   110-174 (211)
 64 PLN02524 S-adenosylmethionine   23.4 1.6E+02  0.0035   31.9   5.7   87  324-448   228-315 (355)
 65 KOG3083 Prohibitin [Posttransl  22.9      77  0.0017   33.0   3.1   41  103-143   141-199 (271)
 66 PLN02542 fructose-1,6-bisphosp  22.9   1E+02  0.0022   33.9   4.2   80  319-416   234-314 (412)
 67 PF08838 DUF1811:  Protein of u  22.4      87  0.0019   28.7   3.0   33   60-94     11-43  (102)
 68 PRK14141 heat shock protein Gr  22.1 7.8E+02   0.017   24.9   9.8   64  139-210    80-153 (209)
 69 PF07926 TPR_MLP1_2:  TPR/MLP1/  22.0   5E+02   0.011   23.6   7.8   67   62-128    58-124 (132)
 70 KOG3990 Uncharacterized conser  21.9 2.7E+02  0.0059   29.6   6.7   29   46-74    215-243 (305)
 71 PF13600 DUF4140:  N-terminal d  21.5 1.3E+02  0.0027   25.8   3.7   36   61-96     68-103 (104)
 72 KOG3620 Uncharacterized conser  21.4 1.7E+02  0.0037   36.7   5.8   75  413-489    40-131 (1626)
 73 cd05829 Sortase_E Sortase E (S  21.3 1.9E+02  0.0042   26.5   5.1   51  316-366    42-96  (144)
 74 TIGR02168 SMC_prok_B chromosom  21.3 1.1E+03   0.023   27.4  11.8    8  310-317  1071-1078(1179)
 75 PRK09293 fructose-1,6-bisphosp  21.3 1.1E+02  0.0024   32.3   3.9   76  319-413   145-221 (327)
 76 PRK14143 heat shock protein Gr  21.2 8.9E+02   0.019   24.9  10.1   64  139-210   116-184 (238)
 77 PRK02224 chromosome segregatio  21.0 8.3E+02   0.018   28.2  10.9   35  475-509   767-801 (880)
 78 PF13874 Nup54:  Nucleoporin co  20.8 4.3E+02  0.0092   24.4   7.2   79   52-140    33-113 (141)
 79 PF06034 DUF919:  Nucleopolyhed  20.7 1.9E+02  0.0042   24.4   4.4   43   58-100     6-54  (62)
 80 PRK04325 hypothetical protein;  20.7 2.2E+02  0.0048   24.1   4.9   32   89-123     7-40  (74)
 81 PF04201 TPD52:  Tumour protein  20.6 2.1E+02  0.0045   28.2   5.3   51   44-108    24-79  (162)
 82 PF11932 DUF3450:  Protein of u  20.6   5E+02   0.011   25.8   8.1   38  135-180   125-162 (251)
 83 KOG2896 UV radiation resistanc  20.6 3.5E+02  0.0075   29.9   7.4   44   92-135   142-192 (377)
 84 PF00038 Filament:  Intermediat  20.2 3.6E+02  0.0079   27.0   7.1   43   62-104    53-95  (312)
 85 PF04011 LemA:  LemA family;  I  20.1 5.2E+02   0.011   24.5   7.8   48  132-191    90-138 (186)

No 1  
>PRK11637 AmiB activator; Provisional
Probab=78.13  E-value=39  Score=35.81  Aligned_cols=78  Identities=15%  Similarity=0.159  Sum_probs=47.1

Q ss_pred             HHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHH-------------HHHHHHHHHHhhh
Q 008936           54 LYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEA-------------ITSALNELARRKG  120 (548)
Q Consensus        54 L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdl-------------VdAASKaLSyRQd  120 (548)
                      +-.+.++.+++|.++.+.|.+..-+..++.++...|+++|+.+.-.++..|+++             |+.+-+.|..+++
T Consensus        45 ~~~~l~~l~~qi~~~~~~i~~~~~~~~~~~~~l~~l~~qi~~~~~~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~  124 (428)
T PRK11637         45 NRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQER  124 (428)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            334555666666666666666666666677777777777766666665555443             2333445555556


Q ss_pred             hHhhhhhhhhh
Q 008936          121 VLEENLKLAHD  131 (548)
Q Consensus       121 iiEENiRLtya  131 (548)
                      .+.+=+|-.|.
T Consensus       125 ~l~~rlra~Y~  135 (428)
T PRK11637        125 LLAAQLDAAFR  135 (428)
T ss_pred             HHHHHHHHHHH
Confidence            66666666665


No 2  
>PF10186 Atg14:  UV radiation resistance protein and autophagy-related subunit 14;  InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 [].  The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=69.75  E-value=61  Score=31.43  Aligned_cols=90  Identities=18%  Similarity=0.146  Sum_probs=49.8

Q ss_pred             HhhccHHHHHHHHHHHHHHhh--------hhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhh
Q 008936           57 RARMQKEEIHSLRQQIAVACL--------KELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKL  128 (548)
Q Consensus        57 k~r~~eeEI~~LRk~Ladasv--------KE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRL  128 (548)
                      +...-.++...|+.++.+.--        .-.++..|...++.|++.+|....++++.+-+.=-+.-..|+.+-...-.|
T Consensus        28 ~l~~~~~~~~~l~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l~~~~~~l  107 (302)
T PF10186_consen   28 ELQQLKEENEELRRRIEEILESDSNGQLLEIQQLKREIEELRERLERLRERIERLRKRIEQKRERLEELRESLEQRRSRL  107 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344445555666666666433        334556677778888888888887777665444333333344443334444


Q ss_pred             hhhhhhhhhhhHHHHHHhH
Q 008936          129 AHDLKVAEDERYFFMSSML  147 (548)
Q Consensus       129 tyalq~aeQERstFvSSLL  147 (548)
                      . +.+...+++..++..+.
T Consensus       108 ~-~~~~~~~~~~~~~~~~~  125 (302)
T PF10186_consen  108 S-ASQDLVESRQEQLEELQ  125 (302)
T ss_pred             H-HHHHHHHHHHHHHHHHH
Confidence            4 34444455555554443


No 3  
>PLN03144 Carbon catabolite repressor protein 4 homolog; Provisional
Probab=69.57  E-value=4.7  Score=45.57  Aligned_cols=35  Identities=31%  Similarity=0.568  Sum_probs=29.0

Q ss_pred             ccccccCCCCCcceeccccccceeEEEeEecCCCC
Q 008936          349 TRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQG  383 (548)
Q Consensus       349 s~~~IeGA~~p~Y~vTADDvd~llAiec~PmDdr~  383 (548)
                      ....|+=.....|+.|+|||+-.|-+||+|+|...
T Consensus       171 ~~~w~~vg~~~~y~p~~~d~g~~lk~~~~~~~~~~  205 (606)
T PLN03144        171 GETWIEVGRSKTYTPTADDVGHVLKFECVVVDAET  205 (606)
T ss_pred             CCceEEeCCCccccCChhhCCceEEEEEEEccccc
Confidence            34445555788999999999999999999998774


No 4  
>PF14988 DUF4515:  Domain of unknown function (DUF4515)
Probab=68.76  E-value=23  Score=35.01  Aligned_cols=76  Identities=26%  Similarity=0.328  Sum_probs=52.4

Q ss_pred             hccHHHHHHHHHHHHHH----hh----hhhhhhcchhhhHHhHhHHHHHH---------hHHhhHHHHHHHHHHH-Hhhh
Q 008936           59 RMQKEEIHSLRQQIAVA----CL----KELQLQNEKYTLERKVSELRMAI---------DEKQNEAITSALNELA-RRKG  120 (548)
Q Consensus        59 r~~eeEI~~LRk~Lada----sv----KE~QllnEKyvLErRiA~mRmAf---------DqqQqdlVdAASKaLS-yRQd  120 (548)
                      ..+|.||..|++.++..    +.    =+.|.+.||.-||+-..+++++-         .++=|-|.-||-++|. +=++
T Consensus        81 ~~qe~eI~~Le~e~~~~~~e~~~~l~~~~~qfl~EK~~LEke~~e~~i~~l~e~a~~el~~k~~ale~~A~~~l~e~~~~  160 (206)
T PF14988_consen   81 EQQEREIQTLEEELEKMRAEHAEKLQEAESQFLQEKARLEKEASELKILQLGERAHKELKKKAQALELAAKKSLDEFTRS  160 (206)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhHHHhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            46788998888887652    22    25789999999999887777651         2223334444555443 5678


Q ss_pred             hHhhhhhhhhhhhh
Q 008936          121 VLEENLKLAHDLKV  134 (548)
Q Consensus       121 iiEENiRLtyalq~  134 (548)
                      |-.||.+|.-+|.-
T Consensus       161 i~~EN~~L~k~L~~  174 (206)
T PF14988_consen  161 IKRENQQLRKELLQ  174 (206)
T ss_pred             HHHHHHHHHHHHHH
Confidence            99999999877654


No 5  
>PF12128 DUF3584:  Protein of unknown function (DUF3584);  InterPro: IPR021979  This family consist of uncharacterised bacterial proteins. 
Probab=64.53  E-value=1.2e+02  Score=36.75  Aligned_cols=106  Identities=23%  Similarity=0.260  Sum_probs=68.2

Q ss_pred             HhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhh
Q 008936           57 RARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAE  136 (548)
Q Consensus        57 k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~ae  136 (548)
                      +++.++.++..-++-+...=.+.-.+..+|-.|+.+++.++-++.+.++++-.. .+++..+..=+|+.+.-.-+.+..-
T Consensus       789 ~ie~~r~~V~eY~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~le~~~~~~~~~~~~~  867 (1201)
T PF12128_consen  789 RIEERRAEVIEYEDWLQEEWDKVDELREEKPELEEQLRDLEQELQELEQELNQL-QKEVKQRRKELEEELKALEEQLEQL  867 (1201)
T ss_pred             HHHHhHHHHHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            334444444444444444444555677788888888888888888777776543 4455555566666666555555544


Q ss_pred             hhhHHHHHHhHHHhhhcCCCCCcccch
Q 008936          137 DERYFFMSSMLGLLADYGLWPHVTNAS  163 (548)
Q Consensus       137 QERstFvSSLLpLLaEY~LqP~V~DAq  163 (548)
                      ++.-.-+..++.-|+++.+.|+..+|+
T Consensus       868 ~~~l~~l~~~~~~l~~~~~~~~~~~~~  894 (1201)
T PF12128_consen  868 EEQLRRLRDLLEKLAELSEPPNAEDAE  894 (1201)
T ss_pred             HHHHHHHHHHHhhhhhcCCCCCchhhh
Confidence            555555566666678888888888888


No 6  
>PF04012 PspA_IM30:  PspA/IM30 family;  InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=64.24  E-value=37  Score=32.53  Aligned_cols=117  Identities=18%  Similarity=0.239  Sum_probs=72.8

Q ss_pred             ccccchhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHH----HHHHhHHhhHHHHHHHHHHHHhh
Q 008936           44 ISFQDREAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSEL----RMAIDEKQNEAITSALNELARRK  119 (548)
Q Consensus        44 ~~~~d~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~m----RmAfDqqQqdlVdAASKaLSyRQ  119 (548)
                      ..+.||+ .-|=.-+|+-+++|..+|+.+|.+...+.++-.+...+++.++.+    +.|...-..||.-.|   |..++
T Consensus        19 d~~EDP~-~~l~q~ird~e~~l~~a~~~~a~~~a~~~~le~~~~~~~~~~~~~~~~A~~Al~~g~edLAr~a---l~~k~   94 (221)
T PF04012_consen   19 DKAEDPE-KMLEQAIRDMEEQLRKARQALARVMANQKRLERKLDEAEEEAEKWEKQAELALAAGREDLAREA---LQRKA   94 (221)
T ss_pred             HhhcCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHH---HHHHH
Confidence            3467888 445567899999999999999998877777666666666655543    456655555554332   22222


Q ss_pred             hhHhhhhhhhhhhhhhhhhhHHHHHHhHHHhhhcCCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936          120 GVLEENLKLAHDLKVAEDERYFFMSSMLGLLADYGLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT  191 (548)
Q Consensus       120 diiEENiRLtyalq~aeQERstFvSSLLpLLaEY~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q  191 (548)
                      +.-+              +.    ..         |+..+..+...|..+|.-++.|..||..+..|.....
T Consensus        95 ~~e~--------------~~----~~---------l~~~~~~~~~~~~~l~~~l~~l~~kl~e~k~k~~~l~  139 (221)
T PF04012_consen   95 DLEE--------------QA----ER---------LEQQLDQAEAQVEKLKEQLEELEAKLEELKSKREELK  139 (221)
T ss_pred             HHHH--------------HH----HH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            1111              00    01         1223345666777888888888888877777655444


No 7  
>PRK09039 hypothetical protein; Validated
Probab=60.04  E-value=68  Score=33.75  Aligned_cols=122  Identities=14%  Similarity=0.064  Sum_probs=65.4

Q ss_pred             hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhh-hhhhh
Q 008936           58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHD-LKVAE  136 (548)
Q Consensus        58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtya-lq~ae  136 (548)
                      +...++|+..|..+|++.+.--.==.+....|+.+|+.||--|+.-+.            +.+-+|.-+--.++ ...++
T Consensus        48 i~~~~~eL~~L~~qIa~L~e~L~le~~~~~~l~~~l~~l~~~l~~a~~------------~r~~Le~~~~~~~~~~~~~~  115 (343)
T PRK09039         48 ISGKDSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSAAEA------------ERSRLQALLAELAGAGAAAE  115 (343)
T ss_pred             HhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH------------HHHHHHHHHhhhhhhcchHH
Confidence            456788999999999885432222234456677777777776662111            11122221110111 00011


Q ss_pred             hhhHHHHHHhHHHhhhc-CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936          137 DERYFFMSSMLGLLADY-GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT  191 (548)
Q Consensus       137 QERstFvSSLLpLLaEY-~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q  191 (548)
                      ..-...-.-|=-+.++| .-+|.|.-.++=+..+|.=-..|+..|...+.+.++.+
T Consensus       116 ~~~~~l~~~L~~~k~~~se~~~~V~~L~~qI~aLr~Qla~le~~L~~ae~~~~~~~  171 (343)
T PRK09039        116 GRAGELAQELDSEKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQ  171 (343)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            11111123344455666 47777777777777777766667777766666655555


No 8  
>COG1704 LemA Uncharacterized conserved protein [Function unknown]
Probab=56.66  E-value=23  Score=35.10  Aligned_cols=47  Identities=21%  Similarity=0.250  Sum_probs=40.1

Q ss_pred             hhhhhhhHHHHHHhHHHhhhc-CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936          133 KVAEDERYFFMSSMLGLLADY-GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT  191 (548)
Q Consensus       133 q~aeQERstFvSSLLpLLaEY-~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q  191 (548)
                      |.||.|.+-+.+.|+=+-.-| +|.-.            ..|..||++|..+|.++.-++
T Consensus        91 ~~aq~~Ls~~L~rl~a~~E~YPdLKAn------------~~f~~Lq~ql~~tEn~Ia~aR  138 (185)
T COG1704          91 QEAQAELSSALGRLFAVAEAYPDLKAN------------ENFLELQSQLEGTENRIAVAR  138 (185)
T ss_pred             HHHHHHHHHHHHHHHHHHHhCcchhhh------------hHHHHHHHHHHhHHHHHHHHH
Confidence            788899999999999988889 76543            469999999999999987666


No 9  
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=55.69  E-value=57  Score=36.55  Aligned_cols=59  Identities=20%  Similarity=0.181  Sum_probs=46.6

Q ss_pred             hhhHHHHhhccHHHHHHHHHHHHHHhhhh--hhhhcchhhhHHhHhHHHHHHhHHhhHHHH
Q 008936           51 AMELYSRARMQKEEIHSLRQQIAVACLKE--LQLQNEKYTLERKVSELRMAIDEKQNEAIT  109 (548)
Q Consensus        51 ~~~L~~k~r~~eeEI~~LRk~LadasvKE--~QllnEKyvLErRiA~mRmAfDqqQqdlVd  109 (548)
                      ...|..+.++.|+||..|.++|+.+.-+|  .++..++..++++++.++-.+...++.+-.
T Consensus       393 ~~~~~~~~~~~e~el~~l~~~l~~~~~~e~i~~l~e~l~~l~~~l~~~~~~~~~~~~~~~~  453 (650)
T TIGR03185       393 KSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLET  453 (650)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34555667788999999999999875533  356677888999999999999998887753


No 10 
>PRK10698 phage shock protein PspA; Provisional
Probab=55.60  E-value=87  Score=31.09  Aligned_cols=76  Identities=16%  Similarity=0.181  Sum_probs=46.5

Q ss_pred             cccchh-hhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHH----HHHHhHHhhHHHHHHHHHHHHhh
Q 008936           45 SFQDRE-AMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSEL----RMAIDEKQNEAITSALNELARRK  119 (548)
Q Consensus        45 ~~~d~~-~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~m----RmAfDqqQqdlVdAASKaLSyRQ  119 (548)
                      ...||+ ..++  -+++-++.+..+|+-+|.+-..+.++-.+...++.++++.    ++|...-..||   |-+||..++
T Consensus        21 kaEDP~k~l~q--~i~em~~~l~~~r~alA~~~A~~k~~er~~~~~~~~~~~~e~kA~~Al~~G~EdL---Ar~AL~~K~   95 (222)
T PRK10698         21 KAEDPQKLVRL--MIQEMEDTLVEVRSTSARALAEKKQLTRRIEQAEAQQVEWQEKAELALRKEKEDL---ARAALIEKQ   95 (222)
T ss_pred             hhcCHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHH---HHHHHHHHH
Confidence            456998 5565  4788899999999999887666544444444444433332    44554444444   456677776


Q ss_pred             hhHhhh
Q 008936          120 GVLEEN  125 (548)
Q Consensus       120 diiEEN  125 (548)
                      ..-+.-
T Consensus        96 ~~~~~~  101 (222)
T PRK10698         96 KLTDLI  101 (222)
T ss_pred             HHHHHH
Confidence            665443


No 11 
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=54.39  E-value=1.1e+02  Score=34.66  Aligned_cols=109  Identities=13%  Similarity=0.113  Sum_probs=57.4

Q ss_pred             HHHHHHHHHHHHHHhhhhhhhhcc--------hhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhh--h-----
Q 008936           62 KEEIHSLRQQIAVACLKELQLQNE--------KYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEEN--L-----  126 (548)
Q Consensus        62 eeEI~~LRk~LadasvKE~QllnE--------KyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEEN--i-----  126 (548)
                      ++++..||++|.++-.+=.+...+        .-+.+.+++++.-..-.-|.++++|.++.-..|..+-.-+  .     
T Consensus       200 ~~ql~~l~~~l~~aE~~l~~fk~~~~l~~~~~~~~~~~~L~~l~~ql~~a~~~~~~a~a~~~~l~~~l~~~~~~~~~~~~  279 (754)
T TIGR01005       200 APEIADLSKQSRDAEAEVAAYRAQSDLLMGNNATLATQQLAELNTELSRARANRAAAEGTADSVKKALQNGGSLDVLPEV  279 (754)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcCCcccCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhh
Confidence            455667777776664443333222        1134588888887777777777776666555444332111  0     


Q ss_pred             -----hhhhhhhhhhhhhHHHHHHhHHHhhhc-CCCCCcccchhhhhhHH
Q 008936          127 -----KLAHDLKVAEDERYFFMSSMLGLLADY-GLWPHVTNASAISNTVK  170 (548)
Q Consensus       127 -----RLtyalq~aeQERstFvSSLLpLLaEY-~LqP~V~DAqSIvsnvK  170 (548)
                           .+.=.++...++....-+-+--|...| .-+|.|.+++.=+..++
T Consensus       280 ~~~~~~~~~~i~~L~~~l~~l~~~~~~l~~~y~~~hP~v~~l~~qi~~l~  329 (754)
T TIGR01005       280 LSSQLKLEDLIQRLRERQAELRATIADLSTTMLANHPRVVAAKSSLADLD  329 (754)
T ss_pred             hcCcccccHHHHHHHHHHHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHH
Confidence                 000112222333333344444566778 77888877554444443


No 12 
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=52.15  E-value=79  Score=35.95  Aligned_cols=40  Identities=28%  Similarity=0.356  Sum_probs=36.4

Q ss_pred             HHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHh
Q 008936           62 KEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAID  101 (548)
Q Consensus        62 eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfD  101 (548)
                      ++||.+|=.|||+|.=|=-+|=.|.+.||-.|-.+|-.|-
T Consensus        41 K~El~~LNDRLA~YIekVR~LEaqN~~L~~di~~lr~~~~   80 (546)
T KOG0977|consen   41 KKELQELNDRLAVYIEKVRFLEAQNRKLEHDINLLRGVVG   80 (546)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence            4789999999999999999999999999999998887664


No 13 
>PRK09239 chorismate mutase; Provisional
Probab=49.26  E-value=2e+02  Score=25.78  Aligned_cols=94  Identities=15%  Similarity=0.142  Sum_probs=58.1

Q ss_pred             hhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhHHHHHHhHHHhhhcCCCCCcccchhhhh
Q 008936           88 TLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAEDERYFFMSSMLGLLADYGLWPHVTNASAISN  167 (548)
Q Consensus        88 vLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQERstFvSSLLpLLaEY~LqP~V~DAqSIvs  167 (548)
                      -+...++++|-.+|+-=..||+.-+|-+++=++|.+-...-...+-..++|... +..+--+.++++|.|.+        
T Consensus         7 ~~~~~L~~lR~~ID~ID~eIv~LLa~R~~l~~~Ia~~K~~~~~~i~dp~RE~~v-l~~~~~~a~~~gl~p~~--------   77 (104)
T PRK09239          7 RAPAELAALRQSIDNIDAALIHMLAERFKCTQAVGVLKAEHGLPPADPAREAYQ-IERLRQLAKDANLDPDF--------   77 (104)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH-HHHHHHHHHHCCCCHHH--------
Confidence            344557777777777666777654444444444433333222222333444433 35566777889999974        


Q ss_pred             hHHHHHHHHHHHHHhhhhhhcccc
Q 008936          168 TVKHLYDQLQSQIRTSYDRIRDLT  191 (548)
Q Consensus       168 nvKvLf~hLq~kL~~~e~klke~q  191 (548)
                       ++.+|+.+-+.....++++...+
T Consensus        78 -~~~i~~~ii~esir~q~~i~~~~  100 (104)
T PRK09239         78 -AEKFLNFIIKEVIRHHERIAAEH  100 (104)
T ss_pred             -HHHHHHHHHHHHHHHHHHHHhcc
Confidence             47889999888888888776655


No 14 
>PRK14153 heat shock protein GrpE; Provisional
Probab=47.91  E-value=2e+02  Score=28.73  Aligned_cols=65  Identities=14%  Similarity=0.193  Sum_probs=43.9

Q ss_pred             hHHHHHHhHHHhhhc--CCC--CCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          139 RYFFMSSMLGLLADY--GLW--PHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       139 RstFvSSLLpLLaEY--~Lq--P~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      ...|+..|||.+-.+  -|.  +.-.+..+|+.+|+..+++|..-|.  ...|+++. +.|     ..++|+.|+..
T Consensus        82 ~~~~~~~LLpv~DnLerAl~~~~~~~~~~~l~~Gvemi~k~~~~vL~--k~Gv~~I~-~~G-----~~FDP~~HEAv  150 (194)
T PRK14153         82 LEQVLLDLLEVTDNFERALESARTAEDMNSIVEGIEMVSKQFFSILE--KYGLERIE-CEG-----EEFDPHRHEAM  150 (194)
T ss_pred             HHHHHHHHhhHHhHHHHHHhcccccchHHHHHHHHHHHHHHHHHHHH--HCCCeeeC-CCC-----CCCChhHhcee
Confidence            346889999998877  232  2234568899999999999998885  55556655 123     23667666543


No 15 
>TIGR02977 phageshock_pspA phage shock protein A. Members of this family are the phage shock protein PspA, from the phage shock operon. This is a narrower family than the set of PspA and its homologs, sometimes several in a genome, as described by PFAM model pfam04012. PspA appears to maintain the protonmotive force under stress conditions that include overexpression of certain phage secretins, heat shock, ethanol, and protein export defects.
Probab=45.80  E-value=1.9e+02  Score=28.25  Aligned_cols=54  Identities=22%  Similarity=0.271  Sum_probs=35.8

Q ss_pred             ccccchh-hhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhH
Q 008936           44 ISFQDRE-AMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNE  106 (548)
Q Consensus        44 ~~~~d~~-~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqd  106 (548)
                      ....||+ ..+.  -+|+-+++|...|+-||.+-..+       ..|||+++.++...+..++.
T Consensus        20 dk~EDP~~~l~q--~irem~~~l~~ar~~lA~~~a~~-------k~~e~~~~~~~~~~~~~~~~   74 (219)
T TIGR02977        20 DKAEDPEKMIRL--IIQEMEDTLVEVRTTSARTIADK-------KELERRVSRLEAQVADWQEK   74 (219)
T ss_pred             HhccCHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHH
Confidence            3457898 5555  46888889999999998876654       45566665555444444433


No 16 
>COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.81  E-value=48  Score=28.76  Aligned_cols=37  Identities=27%  Similarity=0.443  Sum_probs=29.4

Q ss_pred             hhhhHHhHhH--HHHHHhHHhhHHHHHHHHHHHHhhhhHhhh
Q 008936           86 KYTLERKVSE--LRMAIDEKQNEAITSALNELARRKGVLEEN  125 (548)
Q Consensus        86 KyvLErRiA~--mRmAfDqqQqdlVdAASKaLSyRQdiiEEN  125 (548)
                      ...||+||.+  +|+||   |.+.|+.-|-+|..-|-+|+.+
T Consensus         3 ~~~lE~Ri~eLE~r~Af---QE~tieeLn~~laEq~~~i~k~   41 (72)
T COG2900           3 DMELEARIIELEIRLAF---QEQTIEELNDALAEQQLVIDKL   41 (72)
T ss_pred             hhhHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH
Confidence            3468999986  58898   7778999888888888777754


No 17 
>PF15294 Leu_zip:  Leucine zipper
Probab=42.11  E-value=2.6e+02  Score=29.53  Aligned_cols=41  Identities=22%  Similarity=0.232  Sum_probs=36.1

Q ss_pred             hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHH
Q 008936           58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRM   98 (548)
Q Consensus        58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRm   98 (548)
                      ++.-.+|...||.||...--.=...+.||..|++.+.+||.
T Consensus       134 i~rLq~EN~kLk~rl~~le~~at~~l~Ek~kl~~~L~~lq~  174 (278)
T PF15294_consen  134 IDRLQEENEKLKERLKSLEKQATSALDEKSKLEAQLKELQD  174 (278)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35556788999999999888888899999999999999998


No 18 
>KOG0978 consensus E3 ubiquitin ligase involved in syntaxin degradation [Posttranslational modification, protein turnover, chaperones]
Probab=41.41  E-value=2.3e+02  Score=33.43  Aligned_cols=109  Identities=17%  Similarity=0.159  Sum_probs=76.2

Q ss_pred             hhhhcchhhhHHhHhHHHHHHhHHhhHH-------------HHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhHHHHHHh
Q 008936           80 LQLQNEKYTLERKVSELRMAIDEKQNEA-------------ITSALNELARRKGVLEENLKLAHDLKVAEDERYFFMSSM  146 (548)
Q Consensus        80 ~QllnEKyvLErRiA~mRmAfDqqQqdl-------------VdAASKaLSyRQdiiEENiRLtyalq~aeQERstFvSSL  146 (548)
                      --|++||++|+.-|+.|--+.|-.++.+             +..-+|.|.-++..+|.+=|.+=++.---...-.++.. 
T Consensus       499 k~L~~ek~~l~~~i~~l~~~~~~~~~~i~~leeq~~~lt~~~~~l~~el~~~~~~le~~kk~~~e~~~~~~~Lq~~~ek-  577 (698)
T KOG0978|consen  499 KLLREEKSKLEEQILTLKASVDKLELKIGKLEEQERGLTSNESKLIKELTTLTQSLEMLKKKAQEAKQSLEDLQIELEK-  577 (698)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
Confidence            3578899999999999887777666543             33456777778888887766654433222222222222 


Q ss_pred             HHHhhhc-CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936          147 LGLLADY-GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT  191 (548)
Q Consensus       147 LpLLaEY-~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q  191 (548)
                        .=+.+ .+++.+.|+.+=+...+--.+||++++..+..||....
T Consensus       578 --~~~~le~i~~~~~e~~~ele~~~~k~~rleEE~e~L~~kle~~k  621 (698)
T KOG0978|consen  578 --SEAKLEQIQEQYAELELELEIEKFKRKRLEEELERLKRKLERLK  621 (698)
T ss_pred             --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence              22344 68888888888888888889999999999999988777


No 19 
>KOG0250 consensus DNA repair protein RAD18 (SMC family protein) [Replication, recombination and repair]
Probab=40.71  E-value=3.3e+02  Score=33.58  Aligned_cols=139  Identities=17%  Similarity=0.138  Sum_probs=90.9

Q ss_pred             hhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhH------
Q 008936           49 REAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVL------  122 (548)
Q Consensus        49 ~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdii------  122 (548)
                      +.-.+|+...+..++||......+-+..-+.-+|..|+.-++.-.+.+|.++- ++++..|++--+|..|++-+      
T Consensus       734 ~~~~~l~~ei~~~~~eIe~~~~~~e~l~~e~e~~~~e~~e~~~~~~~~~~~l~-~e~~~l~~l~~el~~r~dk~~s~e~~  812 (1074)
T KOG0250|consen  734 SKLEDLAREIKKKEKEIEEKEAPLEKLKEELEHIELEAQELEEYYAAGREKLQ-GEISKLDALKEELKLREDKLRSAEDE  812 (1074)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhHHHHHHHHHHHHHhhhhhh
Confidence            44567788888899999999888888889999999999999999999999995 55566778878888887755      


Q ss_pred             ----hhhhh-hhhhhhhhhhhhHHHHHHhHHHh-------hhcCCCC-CcccchhhhhhHHHHHHHHHHHHHhhhhhhcc
Q 008936          123 ----EENLK-LAHDLKVAEDERYFFMSSMLGLL-------ADYGLWP-HVTNASAISNTVKHLYDQLQSQIRTSYDRIRD  189 (548)
Q Consensus       123 ----EENiR-Ltyalq~aeQERstFvSSLLpLL-------aEY~LqP-~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke  189 (548)
                          |++++ -.-++.-+|-|   .+..=.|+.       +++-=-| -+.+-+--|.-+|.--+.|+.+++-.++.+.+
T Consensus       813 ~~HyE~~~K~~l~~l~~~E~~---~~~~e~~~~e~~~ka~~~cp~~~~ei~~~~~~~~eik~ei~rlk~~i~~~ee~~~~  889 (1074)
T KOG0250|consen  813 KRHYEDKLKSRLEELKQKEVE---KVNLEEPRAEEDQKARTECPEEGIEIEALGKTVAEIKREIKRLKRQIQMCEESLGE  889 (1074)
T ss_pred             hhhHHHHHHHhhHHHHHHHHH---HHhhhcchhhhCchhhhhCccccchhhcccchHHHHHHHHHHHHHHHHHHHHhcch
Confidence                44555 12222222222   222222222       2221000 11111113566777788888888888877776


Q ss_pred             cc
Q 008936          190 LT  191 (548)
Q Consensus       190 ~q  191 (548)
                      ++
T Consensus       890 ~~  891 (1074)
T KOG0250|consen  890 LE  891 (1074)
T ss_pred             HH
Confidence            66


No 20 
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=40.32  E-value=1.6e+02  Score=30.25  Aligned_cols=92  Identities=17%  Similarity=0.297  Sum_probs=64.4

Q ss_pred             cHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhH
Q 008936           61 QKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAEDERY  140 (548)
Q Consensus        61 ~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQERs  140 (548)
                      .+.|+..+++.+.+.=+.=--+-+++.-+|-.|..+|=-.+--+..+ ++++++=-|+.        |..+++.|++.+-
T Consensus        36 ~~~e~e~~~~~~~~~~~e~e~le~qv~~~e~ei~~~r~r~~~~e~kl-~~v~~~~e~~a--------L~~E~~~ak~r~~  106 (239)
T COG1579          36 AKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKL-SAVKDERELRA--------LNIEIQIAKERIN  106 (239)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hccccHHHHHH--------HHHHHHHHHHHHH
Confidence            34566777777777766666677888888888888875555555555 77766655553        6777888888888


Q ss_pred             HHHHHhHHHhhhc-CCCCCccc
Q 008936          141 FFMSSMLGLLADY-GLWPHVTN  161 (548)
Q Consensus       141 tFvSSLLpLLaEY-~LqP~V~D  161 (548)
                      ..-+.|.+|+-++ .|+=-..+
T Consensus       107 ~le~el~~l~~~~~~l~~~i~~  128 (239)
T COG1579         107 SLEDELAELMEEIEKLEKEIED  128 (239)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            8888888888777 44443333


No 21 
>PRK14148 heat shock protein GrpE; Provisional
Probab=39.88  E-value=3.1e+02  Score=27.30  Aligned_cols=64  Identities=14%  Similarity=0.193  Sum_probs=43.3

Q ss_pred             hHHHHHHhHHHhhhc----CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCC
Q 008936          139 RYFFMSSMLGLLADY----GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGV  210 (548)
Q Consensus       139 RstFvSSLLpLLaEY----~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~  210 (548)
                      ..-|+..|||.+--+    +.-+.-.++.+++.+|+-.+++|..=|.  ..-|+++. +.|     ..++|+.|+-
T Consensus        89 ~~~~~~~LLpV~DnlerAl~~~~~~~~~~~l~~Gv~mi~k~l~~vL~--k~Gv~~I~-~~G-----~~FDP~~HEA  156 (195)
T PRK14148         89 IEKFAKELLPVIDSIEQALKHEVKLEEAIAMKEGIELTAKMLVDILK--KNGVEELD-PKG-----EKFDPNLHEA  156 (195)
T ss_pred             HHHHHHHHhhHHhHHHHHHhccccchhHHHHHHHHHHHHHHHHHHHH--HCCCEEeC-CCC-----CCCChhHhhe
Confidence            346889999998877    2234334567899999999999988885  44555554 112     2366776754


No 22 
>PF11464 Rbsn:  Rabenosyn Rab binding domain;  InterPro: IPR021565  Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=39.05  E-value=69  Score=25.19  Aligned_cols=40  Identities=38%  Similarity=0.426  Sum_probs=34.1

Q ss_pred             HHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHh
Q 008936           63 EEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQ  104 (548)
Q Consensus        63 eEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQ  104 (548)
                      |.|..+|..|-+|  |.+.=..|-.+||+-+.+|+-.|++||
T Consensus         3 eQi~~I~~~I~qA--k~~~r~dEV~~L~~NL~EL~~e~~~qq   42 (42)
T PF11464_consen    3 EQINIIESYIKQA--KAARRFDEVATLEENLRELQDEIDEQQ   42 (42)
T ss_dssp             HHHHHHHHHHHHH--HHTT-HHHHHHHHHHHHHHHHHHHHHC
T ss_pred             HHHHHHHHHHHHH--HHhcCcHHHHHHHHHHHHHHHHHHhcC
Confidence            5678888888876  677788999999999999999999987


No 23 
>KOG3091 consensus Nuclear pore complex, p54 component (sc Nup57) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=39.03  E-value=75  Score=35.86  Aligned_cols=80  Identities=26%  Similarity=0.283  Sum_probs=54.9

Q ss_pred             cchhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHh---------------------HHHHHHhHHhh
Q 008936           47 QDREAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVS---------------------ELRMAIDEKQN  105 (548)
Q Consensus        47 ~d~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA---------------------~mRmAfDqqQq  105 (548)
                      ||-+++.---|++.-.|++..|.||=||+++|=+++.+---.|++||=                     +||.-+|--++
T Consensus       346 Q~q~~~~~r~ri~~i~e~v~eLqk~~ad~~~KI~~~k~r~~~Ls~RiLRv~ikqeilr~~G~~L~~~EE~Lr~Kldtll~  425 (508)
T KOG3091|consen  346 QDQEVKQHRIRINAIGERVTELQKHHADAVAKIEEAKNRHVELSHRILRVMIKQEILRKRGYALTPDEEELRAKLDTLLA  425 (508)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCccHHHHHHHHHHHHH
Confidence            455555444566777788888999999999999999998889999982                     34444444333


Q ss_pred             HHHHHHHHHHHHhhhhHhhhhhh
Q 008936          106 EAITSALNELARRKGVLEENLKL  128 (548)
Q Consensus       106 dlVdAASKaLSyRQdiiEENiRL  128 (548)
                      .+=+=  .+|.-|=+.|-|=+|.
T Consensus       426 ~ln~P--nq~k~Rl~~L~e~~r~  446 (508)
T KOG3091|consen  426 QLNAP--NQLKARLDELYEILRM  446 (508)
T ss_pred             HhcCh--HHHHHHHHHHHHHHHh
Confidence            33222  5677777777666654


No 24 
>PRK14154 heat shock protein GrpE; Provisional
Probab=37.72  E-value=3.4e+02  Score=27.39  Aligned_cols=152  Identities=13%  Similarity=0.149  Sum_probs=82.2

Q ss_pred             CChhhhhhccccc-ccccccccCCCC--ccccchhhhhH-HHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHh
Q 008936           19 KNSDFVNRHKIET-HLAPTKQKEDNF--ISFQDREAMEL-YSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVS   94 (548)
Q Consensus        19 ~~s~~l~r~~~~~-~l~~~~~~~~~~--~~~~d~~~~~L-~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA   94 (548)
                      |-|++-..++.++ .-+..-+.++.-  ..-|.|+-..+ +..+.+.+++|..|+++++++-        ++        
T Consensus        11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~el~~le~e~~elk--------d~--------   74 (208)
T PRK14154         11 PESETKAKNKWEKVMEAEEEQEEGGGDGSQEMEPHREGLEFPSREKLEGQLTRMERKVDEYK--------TQ--------   74 (208)
T ss_pred             chhhhhhhhHHHHHHHHHHHHhhcCCCcccccCcccccccCcchhhHHHHHHHHHHHHHHHH--------HH--------
Confidence            5666666666665 222223333322  33467776666 4455556677777777776542        11        


Q ss_pred             HHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhHHHHHHhHHHhhhc--CCC--C-CcccchhhhhhH
Q 008936           95 ELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAEDERYFFMSSMLGLLADY--GLW--P-HVTNASAISNTV  169 (548)
Q Consensus        95 ~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQERstFvSSLLpLLaEY--~Lq--P-~V~DAqSIvsnv  169 (548)
                      +||+.=|-            =.||+..-.|=-      ++-..-..-|+..|||.+-.+  -|.  + .-.+..+|+..|
T Consensus        75 ~lRl~ADf------------eNyRKR~~kE~e------~~~~~a~e~~~~~LLpVlDnLeRAL~~~~~~~~~~~~l~eGv  136 (208)
T PRK14154         75 YLRAQAEM------------DNLRKRIEREKA------DIIKFGSKQLITDLLPVADSLIHGLESPASEDPQVKSMRDGM  136 (208)
T ss_pred             HHHHHHHH------------HHHHHHHHHHHH------HHHHHHHHHHHHHHhhHHhHHHHHHhcccccchhHHHHHHHH
Confidence            23332211            123333322210      111122346888999988777  232  1 223468899999


Q ss_pred             HHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          170 KHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       170 KvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      +-.+++|..-|.  ...|+++.-.+|     ..++|+.|+-.
T Consensus       137 emi~k~l~~vL~--k~GVe~I~~~~G-----~~FDP~~HEAv  171 (208)
T PRK14154        137 SLTLDLLHNTLA--KHGVQVINPNPG-----DPFDPALHEAM  171 (208)
T ss_pred             HHHHHHHHHHHH--HCCCEEecCCCC-----CCCChhHhhee
Confidence            999999988885  555666652234     24666666543


No 25 
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=37.68  E-value=5.7e+02  Score=28.56  Aligned_cols=93  Identities=24%  Similarity=0.282  Sum_probs=52.0

Q ss_pred             hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhh
Q 008936           58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAED  137 (548)
Q Consensus        58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQ  137 (548)
                      .+.-..||..+.+.|++..=+=.+|..+--.||+.|+.++       ..|+..+..-=.-+++|-+=|=+| -+|++.+.
T Consensus        40 l~q~q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~~i~~~~-------~ql~~s~~~l~~~~~~I~~~~~~l-~~l~~q~r  111 (420)
T COG4942          40 LKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLE-------AQLIETADDLKKLRKQIADLNARL-NALEVQER  111 (420)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHhHHHHHHhhHHHHHHHH-HHHHHHHH
Confidence            3344455556666665555444445445445555554443       334433332222233333333333 36777778


Q ss_pred             hhHHHHHHhHHHhhhcCCCCC
Q 008936          138 ERYFFMSSMLGLLADYGLWPH  158 (548)
Q Consensus       138 ERstFvSSLLpLLaEY~LqP~  158 (548)
                      ||.-+++-+|--+--.+..||
T Consensus       112 ~qr~~La~~L~A~~r~g~~p~  132 (420)
T COG4942         112 EQRRRLAEQLAALQRSGRNPP  132 (420)
T ss_pred             HHHHHHHHHHHHHHhccCCCC
Confidence            888899988888877788885


No 26 
>PRK14140 heat shock protein GrpE; Provisional
Probab=37.01  E-value=4.1e+02  Score=26.42  Aligned_cols=65  Identities=15%  Similarity=0.220  Sum_probs=41.8

Q ss_pred             hHHHHHHhHHHhhhc--CC--CCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          139 RYFFMSSMLGLLADY--GL--WPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       139 RstFvSSLLpLLaEY--~L--qP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      ..-|+..|||.+--+  -|  -|.-.+..+|+.+|+-.+++|..=|.  .-.++.+. ..|     ..++|+.|+.-
T Consensus        86 ~~~~~~~LLpvlDnLerAl~~~~~~~~~~~i~~Gv~mi~k~l~~~L~--k~GV~~i~-~~G-----e~FDP~~HEAv  154 (191)
T PRK14140         86 AQSLASDLLPALDNFERALQIEADDEQTKSLLKGVEMVHRQLLEALK--KEGVEVIE-AVG-----EQFDPNLHQAV  154 (191)
T ss_pred             HHHHHHHHHHHHHHHHHHHhccCccchHHHHHHHHHHHHHHHHHHHH--HCCCEeeC-CCC-----CCCChHHhccc
Confidence            456899999998877  22  24445567888999988888877774  33333332 122     24677777643


No 27 
>PF10186 Atg14:  UV radiation resistance protein and autophagy-related subunit 14;  InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 [].  The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=36.63  E-value=3.7e+02  Score=26.14  Aligned_cols=51  Identities=18%  Similarity=0.287  Sum_probs=21.6

Q ss_pred             HhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHH
Q 008936           57 RARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEA  107 (548)
Q Consensus        57 k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdl  107 (548)
                      .++..+.++..++.+++...-+=.++..+-..+-+|++++|-+.....+.+
T Consensus        57 ~~~~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l~~~~~~l  107 (302)
T PF10186_consen   57 EIQQLKREIEELRERLERLRERIERLRKRIEQKRERLEELRESLEQRRSRL  107 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            334444444444444443332223333333344444444444444444433


No 28 
>PF01895 PhoU:  PhoU domain;  InterPro: IPR008170 This family contains phosphate regulatory proteins including PhoU. PhoU proteins are known to play a role in the regulation of phosphate uptake. The PhoU domain is composed of a three helix bundle []. The PhoU protein contains two copies of this domain. The domain binds to an iron cluster via its conserved E/DXXXD motif. Deletion of PhoU activates constitutive expression of the phosphate ABC transporter and allows phosphate transport, but causes a growth defect; suggesting that the protein has some secondary function []. ; PDB: 2I0M_A 1T72_B 1T8B_A 1SUM_B 1VCT_A 2BKN_A 2BKP_A 2BKO_A.
Probab=36.59  E-value=1.2e+02  Score=23.51  Aligned_cols=50  Identities=36%  Similarity=0.302  Sum_probs=36.0

Q ss_pred             hhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHh
Q 008936           52 MELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAID  101 (548)
Q Consensus        52 ~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfD  101 (548)
                      .++..++...|++|..+...+.++.++..+-.+....++.=+.+|+++.+
T Consensus        24 ~~~a~~i~~~e~~id~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~   73 (88)
T PF01895_consen   24 SELAQEIIQLEEEIDELYREIRRQILKILKNQNPLEELRELVGLLRIARD   73 (88)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCGHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHhHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHH
Confidence            46668888999999999999999988877543433335555666666544


No 29 
>KOG1222 consensus Kinesin associated protein KAP [Intracellular trafficking, secretion, and vesicular transport]
Probab=36.49  E-value=1.4e+02  Score=34.36  Aligned_cols=128  Identities=22%  Similarity=0.209  Sum_probs=83.9

Q ss_pred             hhhhccccc-ccccccccCCCCccccchhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHh
Q 008936           23 FVNRHKIET-HLAPTKQKEDNFISFQDREAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAID  101 (548)
Q Consensus        23 ~l~r~~~~~-~l~~~~~~~~~~~~~~d~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfD  101 (548)
                      .|+|+.+-- -|..++-.-|     .+|++..|       +|||..|-|+|--+..|..|||             |+||.
T Consensus       231 ElkRye~w~~El~k~krs~d-----e~p~netL-------k~e~dr~~kklk~~~~KQeqLL-------------rva~y  285 (791)
T KOG1222|consen  231 ELKRYEFWIAELKKTKRSTD-----EKPKNETL-------KEEIDRLNKKLKTAIRKQEQLL-------------RVAVY  285 (791)
T ss_pred             HHHHHHHHHHHHhhhhcccc-----cCcchhhH-------HHHHHHHHHHHHHHHHHHHHHH-------------HHHHH
Confidence            567776543 3433332222     68888777       8999999999999999999985             44544


Q ss_pred             HHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhHHHHHHhHHHhhhcCCCCCcccchhhhhhHHHHHHHHHH
Q 008936          102 EKQNEAITSALNELARRKGVLEENLKLAHDLKVAEDERYFFMSSMLGLLADYGLWPHVTNASAISNTVKHLYDQLQS  178 (548)
Q Consensus       102 qqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQERstFvSSLLpLLaEY~LqP~V~DAqSIvsnvKvLf~hLq~  178 (548)
                      --=+ |..-.+-+|.-|...|-.  -|--||+..--+..+.|++.|-=|+=|+=---|.-+--||.-+-.||+-=..
T Consensus       286 lLlN-lAed~~~ElKMrrkniV~--mLVKaLdr~n~~Ll~lv~~FLkKLSIf~eNK~~M~~~~iveKL~klfp~~h~  359 (791)
T KOG1222|consen  286 LLLN-LAEDISVELKMRRKNIVA--MLVKALDRSNSSLLTLVIKFLKKLSIFDENKIVMEQNGIVEKLLKLFPIQHP  359 (791)
T ss_pred             HHHH-HhhhhhHHHHHHHHhHHH--HHHHHHcccchHHHHHHHHHHHHhhhhccchHHHHhccHHHHHHHhcCCCCH
Confidence            3222 223334444443322211  3556677777788899999999898887777777777888777777654333


No 30 
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=36.20  E-value=3.6e+02  Score=30.38  Aligned_cols=57  Identities=11%  Similarity=0.154  Sum_probs=33.1

Q ss_pred             hccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHH
Q 008936           59 RMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELA  116 (548)
Q Consensus        59 r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLS  116 (548)
                      .+.+++...++.+|...--....+..+...|+.|+...|-+..+|++. ...|.+.|+
T Consensus        56 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~~~le~~~~~~~ek~~~-l~~~~~~L~  112 (475)
T PRK10361         56 EHWRAECELLNNEVRSLQSINTSLEADLREVTTRMEAAQQHADDKIRQ-MINSEQRLS  112 (475)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH
Confidence            344445555555555555555555666667778888888877665543 444444443


No 31 
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=35.93  E-value=1.3e+02  Score=33.93  Aligned_cols=23  Identities=17%  Similarity=0.108  Sum_probs=15.1

Q ss_pred             hhhhhhhhhhhhHHHHHHhHHHh
Q 008936          128 LAHDLKVAEDERYFFMSSMLGLL  150 (548)
Q Consensus       128 Ltyalq~aeQERstFvSSLLpLL  150 (548)
                      +--.++.++++..-|.+..+||+
T Consensus       274 le~e~~e~~~~l~~l~~~~~p~~  296 (650)
T TIGR03185       274 IEAARKANRAQLRELAADPLPLL  296 (650)
T ss_pred             HHHHHHHHHHHHHHHhcccCCHh
Confidence            34455556677777887777774


No 32 
>PLN02939 transferase, transferring glycosyl groups
Probab=35.41  E-value=2.5e+02  Score=34.18  Aligned_cols=100  Identities=25%  Similarity=0.372  Sum_probs=61.6

Q ss_pred             hhhhHHHHhhccHHHHHHHHHHHHHHhhhhh-hhhcchhhhHHhHh--HHHHHHhHHhhH--------------------
Q 008936           50 EAMELYSRARMQKEEIHSLRQQIAVACLKEL-QLQNEKYTLERKVS--ELRMAIDEKQNE--------------------  106 (548)
Q Consensus        50 ~~~~L~~k~r~~eeEI~~LRk~LadasvKE~-QllnEKyvLErRiA--~mRmAfDqqQqd--------------------  106 (548)
                      .-.+|..-++.-|.-|+.|-+ ...-.+.++ .||+||..|.++|.  +||+|=-..+-.                    
T Consensus       129 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  207 (977)
T PLN02939        129 QLEDLVGMIQNAEKNILLLNQ-ARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKLR  207 (977)
T ss_pred             cHHHHHHHHHHHHhhhHhHHH-HHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhhhhhhhccccchhhHHHHHHHh
Confidence            345666667777777766543 233334444 48999999988865  666664211110                    


Q ss_pred             ------------HHHHHHHHHHHhhhhHhhhhhhhhhhhh-----------------hhhhhHHHHHHhHHHhhhc
Q 008936          107 ------------AITSALNELARRKGVLEENLKLAHDLKV-----------------AEDERYFFMSSMLGLLADY  153 (548)
Q Consensus       107 ------------lVdAASKaLSyRQdiiEENiRLtyalq~-----------------aeQERstFvSSLLpLLaEY  153 (548)
                                  -|.+-+++|.   .+-|||+-|--+++.                 .+.||+..=+||-.|=+.+
T Consensus       208 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  280 (977)
T PLN02939        208 NELLIRGATEGLCVHSLSKELD---VLKEENMLLKDDIQFLKAELIEVAETEERVFKLEKERSLLDASLRELESKF  280 (977)
T ss_pred             hhhhccccccccccccHHHHHH---HHHHHhHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence                        1112355554   245899999988876                 4567777777777765555


No 33 
>TIGR01076 sortase_fam LPXTG-site transpeptidase (sortase) family protein. of an LPXTG motif to the cell wall. It also includes a protein required for correct assembly of an LPXTG-containing fimbrial protein, a set of homologous proteins from Streptococcus pneumoniae, in which LPXTG proteins are common. However, related proteins are found in Bacillus subtilis and Methanobacterium thermoautotrophicum, in which LPXTG-mediated cell wall attachment is not known.
Probab=34.56  E-value=39  Score=30.50  Aligned_cols=43  Identities=23%  Similarity=0.286  Sum_probs=30.7

Q ss_pred             cceeeeccccCCccceeeEEEEeccCCccccccCC-CCCcceecc
Q 008936          322 EKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGA-TNPEYVVTA  365 (548)
Q Consensus       322 ~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA-~~p~Y~vTA  365 (548)
                      +-..-.||... ..-.+|..+..|+.|..-++... ...+|.|++
T Consensus        42 gN~vIaGH~~~-~~~~~F~~L~~l~~GD~i~v~~~~~~~~Y~V~~   85 (136)
T TIGR01076        42 TRIVITGHRGL-PTATMFTNLDKLKKGDMLYLHVGNEVLTYQVTS   85 (136)
T ss_pred             CeEEEEecCCC-CCCCccCCHHHCCCCCEEEEEECCcEEEEEEEE
Confidence            45666778754 33569999999999999887743 345677654


No 34 
>PRK14154 heat shock protein GrpE; Provisional
Probab=32.98  E-value=2.2e+02  Score=28.69  Aligned_cols=26  Identities=8%  Similarity=0.072  Sum_probs=17.2

Q ss_pred             cchhhhhHHHHhhccHHHHHHHHHHH
Q 008936           47 QDREAMELYSRARMQKEEIHSLRQQI   72 (548)
Q Consensus        47 ~d~~~~~L~~k~r~~eeEI~~LRk~L   72 (548)
                      .-|+-.+|-.++.+-++++..|+.++
T Consensus        50 ~~~~~~~l~~el~~le~e~~elkd~~   75 (208)
T PRK14154         50 EFPSREKLEGQLTRMERKVDEYKTQY   75 (208)
T ss_pred             cCcchhhHHHHHHHHHHHHHHHHHHH
Confidence            44556666666677777777776665


No 35 
>PRK14162 heat shock protein GrpE; Provisional
Probab=32.86  E-value=4.7e+02  Score=26.08  Aligned_cols=64  Identities=14%  Similarity=0.215  Sum_probs=42.5

Q ss_pred             HHHHHHhHHHhhhcC--CC--CCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          140 YFFMSSMLGLLADYG--LW--PHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       140 stFvSSLLpLLaEY~--Lq--P~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      .-|+..|||.+--+.  |.  |.-.+..+|+.+|+-.+++|..=|.  ..-|+++. ..|     ..++|+.|+--
T Consensus        89 ~~~~~~LLpV~DnLerAl~~~~~~~~~~~l~~Gvemi~k~l~~vL~--~~GV~~I~-~~G-----~~FDP~~HEAv  156 (194)
T PRK14162         89 QSLAKDVLPAMDNLERALAVKADDEAAKQLKKGVQMTLDHLVKALK--DHGVTEIK-ADG-----EKFDPTLHQAV  156 (194)
T ss_pred             HHHHHHHhhHHhHHHHHHhccccchhHHHHHHHHHHHHHHHHHHHH--HCCCEEeC-CCC-----CCCChhHhhhh
Confidence            458899999988773  22  3323457899999999999988875  44555554 223     23667666543


No 36 
>PF04156 IncA:  IncA protein;  InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=32.72  E-value=2.4e+02  Score=26.34  Aligned_cols=27  Identities=11%  Similarity=0.282  Sum_probs=11.3

Q ss_pred             chhhhhhHHHHHHHHHHHHHhhhhhhc
Q 008936          162 ASAISNTVKHLYDQLQSQIRTSYDRIR  188 (548)
Q Consensus       162 AqSIvsnvKvLf~hLq~kL~~~e~klk  188 (548)
                      ...-+...+.-+++++..+....+++.
T Consensus       156 ~~~~~~~~~~~~~~~~~~~~~l~~~~~  182 (191)
T PF04156_consen  156 SREEVQELRSQLERLQENLQQLEEKIQ  182 (191)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            333444444444444444444444333


No 37 
>PRK02119 hypothetical protein; Provisional
Probab=32.61  E-value=98  Score=26.17  Aligned_cols=37  Identities=22%  Similarity=0.382  Sum_probs=27.7

Q ss_pred             cchhhhHHhHhHH--HHHHhHHhhHHHHHHHHHHHHhhhhHh
Q 008936           84 NEKYTLERKVSEL--RMAIDEKQNEAITSALNELARRKGVLE  123 (548)
Q Consensus        84 nEKyvLErRiA~m--RmAfDqqQqdlVdAASKaLSyRQdiiE  123 (548)
                      +|-..+|.||.+|  |+||   |.+.|+.=++++..=|..|.
T Consensus         2 ~~~~~~e~Ri~~LE~rla~---QE~tie~LN~~v~~Qq~~id   40 (73)
T PRK02119          2 QIQQNLENRIAELEMKIAF---QENLLEELNQALIEQQFVID   40 (73)
T ss_pred             cchHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH
Confidence            3455688888764  7777   88999999999887776553


No 38 
>PRK02793 phi X174 lysis protein; Provisional
Probab=32.04  E-value=1e+02  Score=25.92  Aligned_cols=33  Identities=24%  Similarity=0.332  Sum_probs=25.2

Q ss_pred             hhHHhHhHH--HHHHhHHhhHHHHHHHHHHHHhhhhHh
Q 008936           88 TLERKVSEL--RMAIDEKQNEAITSALNELARRKGVLE  123 (548)
Q Consensus        88 vLErRiA~m--RmAfDqqQqdlVdAASKaLSyRQdiiE  123 (548)
                      .+|.||.+|  |+||   |++.|+.=++++..=|..|.
T Consensus         5 ~~e~Ri~~LE~~laf---Qe~tIe~Ln~~v~~Qq~~I~   39 (72)
T PRK02793          5 SLEARLAELESRLAF---QEITIEELNVTVTAHEMEMA   39 (72)
T ss_pred             hHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH
Confidence            377777754  7777   88999999999988775553


No 39 
>TIGR01795 CM_mono_cladeE monofunctional chorismate mutase, alpha proteobacterial type. The alpha proteobacterial members are trusted because the pathways of CM are evident and there is only one plausible CM in the genome. In S. coelicolor, however, there is another aparrent monofunctional CM.
Probab=31.50  E-value=3.6e+02  Score=23.70  Aligned_cols=84  Identities=23%  Similarity=0.286  Sum_probs=53.7

Q ss_pred             HhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhh--hhhhhhhhhhHHH-HHHhHHHhhhcCCCCCcccchhhhh
Q 008936           91 RKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLA--HDLKVAEDERYFF-MSSMLGLLADYGLWPHVTNASAISN  167 (548)
Q Consensus        91 rRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLt--yalq~aeQERstF-vSSLLpLLaEY~LqP~V~DAqSIvs  167 (548)
                      ++++++|-.+|+-=..||+-    |+.|..+..+--++-  ..+-+-+.+|-.- +..+.-+..++||.|.+        
T Consensus         3 ~~L~~lR~~ID~ID~qLv~L----L~~R~~~~~~ia~~K~~~~~~v~dp~Re~~vl~~~~~~a~~~gl~p~~--------   70 (94)
T TIGR01795         3 AELKALRQSIDNIDAAVIHM----LAERFKCTSQVGVLKANAGLAPADPAREDYQIARLRRLAIDAGLDPEF--------   70 (94)
T ss_pred             hhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHhCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHH--------
Confidence            35677787777766666654    555555555544333  2344444555433 35678888999999985        


Q ss_pred             hHHHHHHHHHHHHHhhhhhh
Q 008936          168 TVKHLYDQLQSQIRTSYDRI  187 (548)
Q Consensus       168 nvKvLf~hLq~kL~~~e~kl  187 (548)
                       +..||+.+.+.....++++
T Consensus        71 -~e~i~~~i~~esir~q~~~   89 (94)
T TIGR01795        71 -AEKFLNFIVTEVIKHHERI   89 (94)
T ss_pred             -HHHHHHHHHHHHHHHHHHH
Confidence             3677888888777666553


No 40 
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=31.18  E-value=4.1e+02  Score=30.79  Aligned_cols=18  Identities=6%  Similarity=0.075  Sum_probs=14.0

Q ss_pred             ccccCccchHHHHHHhhc
Q 008936          398 KIKCDLGMQSEIDAYISR  415 (548)
Q Consensus       398 kI~cdpeMq~~I~~~l~~  415 (548)
                      -|.||++....|+..+..
T Consensus       605 ~i~~~~~~~~~~~~~lg~  622 (1164)
T TIGR02169       605 LVEFDPKYEPAFKYVFGD  622 (1164)
T ss_pred             HccCcHHHHHHHHHHCCC
Confidence            388999988888877654


No 41 
>PRK11020 hypothetical protein; Provisional
Probab=30.54  E-value=77  Score=29.74  Aligned_cols=40  Identities=28%  Similarity=0.439  Sum_probs=27.5

Q ss_pred             HHHHHHHHHhhh-----hhhhhcchhhhHHhHhHHHHHHhHHhhHHHH
Q 008936           67 SLRQQIAVACLK-----ELQLQNEKYTLERKVSELRMAIDEKQNEAIT  109 (548)
Q Consensus        67 ~LRk~LadasvK-----E~QllnEKyvLErRiA~mRmAfDqqQqdlVd  109 (548)
                      .+|.+||.+...     =+|+..|+..|+++|+.|+=   +|.++|-.
T Consensus        16 ~~~~Klaaa~~rgd~~~i~qf~~E~~~l~k~I~~lk~---~~~~~lsk   60 (118)
T PRK11020         16 AIRHKLAAASLRGDAEKYAQFEKEKATLEAEIARLKE---VQSQKLSK   60 (118)
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHH
Confidence            345566655443     37899999999999999983   44444433


No 42 
>PRK14139 heat shock protein GrpE; Provisional
Probab=30.05  E-value=4.1e+02  Score=26.34  Aligned_cols=64  Identities=19%  Similarity=0.231  Sum_probs=41.4

Q ss_pred             HHHHHHhHHHhhhc--CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          140 YFFMSSMLGLLADY--GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       140 stFvSSLLpLLaEY--~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      .-|+..|||.+-.+  -|.=.-.+..+|+.+|+-.+++|..-|.  ...|+++. ..|     ..++|+.|+--
T Consensus        82 ~~~~~~LLpv~DnLerAl~~~~~~~~~l~~Gv~mi~k~l~~vL~--k~Gv~~I~-~~G-----~~FDP~~HEAv  147 (185)
T PRK14139         82 ESFAESLLPVKDSLEAALADESGDLEKLREGVELTLKQLTSAFE--KGRVVEIN-PVG-----EKFDPHQHQAI  147 (185)
T ss_pred             HHHHHHHhhHHhHHHHHHhcccchHHHHHHHHHHHHHHHHHHHH--HCCCceeC-CCC-----CCCChHHhhee
Confidence            45888899988777  2221113567899999999999888774  44555555 223     24667666543


No 43 
>cd05827 Sortase_C_3 Sortase C (SrtC) or subfamily-3 sortases are membrane cysteine transpeptidases found in Gram-positive bacteria that anchor surface proteins to peptidoglycans of the bacterial cell wall envelope. This involves a transpeptidation reaction in which the surface protein substrate is cleaved at a conserved cell wall sorting signal and covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. Sortase C cleaves surface proteins at the LPXTG motif between Thr and Gly and catalyzes the formation of an amide bond between the carboxyl group of Thr and the amino group of cell-wall cross bridges. It is a narrow-range enzyme required for anchoring a few substrates. It plays a role in Streptococcus pneumoniae pathogenesis and is required in the assembly of pili on the surface of Corynebacterium diphtheriae. Sortase C is 
Probab=29.40  E-value=53  Score=29.28  Aligned_cols=44  Identities=20%  Similarity=0.256  Sum_probs=29.8

Q ss_pred             CcceeeeccccCCccceeeEEEEeccCCccccccCC-CCCcceecc
Q 008936          321 GEKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGA-TNPEYVVTA  365 (548)
Q Consensus       321 G~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA-~~p~Y~vTA  365 (548)
                      |+-..-.||... ....+|.++..|+.|..-+|.-. ....|.|++
T Consensus        44 ~~N~viaGH~~~-~~~~~F~~L~~l~~gd~i~l~t~~~~~~Y~V~~   88 (131)
T cd05827          44 GTHSVITGHRGL-PNAKLFTDLDKLKKGDKFYIHVLGETLAYQVDQ   88 (131)
T ss_pred             CCEEEEEeCCCC-CCCcccCCHHHCCCCCEEEEEECCeEEEEEEEE
Confidence            345777888731 23459999999999998777733 334565554


No 44 
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=29.40  E-value=4e+02  Score=32.32  Aligned_cols=61  Identities=15%  Similarity=0.130  Sum_probs=44.2

Q ss_pred             hccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhh
Q 008936           59 RMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRK  119 (548)
Q Consensus        59 r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQ  119 (548)
                      .+.++.+..+|..+++.--..+++.++...|++.+.+++...+.....+-+.-.+-..-+.
T Consensus       372 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~  432 (1163)
T COG1196         372 EELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEA  432 (1163)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4456666788888888888888889999999999988888777776666554444333333


No 45 
>PF11932 DUF3450:  Protein of unknown function (DUF3450);  InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=29.22  E-value=5.3e+02  Score=25.59  Aligned_cols=84  Identities=23%  Similarity=0.233  Sum_probs=40.1

Q ss_pred             HHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhh----hhhhhhhhhhhhhhhHHHH
Q 008936           68 LRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEE----NLKLAHDLKVAEDERYFFM  143 (548)
Q Consensus        68 LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEE----NiRLtyalq~aeQERstFv  143 (548)
                      ..+++.+..-...+++.|...|++.+.-|+. |-++.+..|++.-+.+..-+.=|++    .-.|.=-+...-++...||
T Consensus        40 sQ~~id~~~~e~~~L~~e~~~l~~e~e~L~~-~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~p~m~~m~~~L~~~v  118 (251)
T PF11932_consen   40 SQKRIDQWDDEKQELLAEYRQLEREIENLEV-YNEQLERQVASQEQELASLEQQIEQIEETRQELVPLMEQMIDELEQFV  118 (251)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3344444444555566666666666666664 4555555555555444433222221    1112222333344444555


Q ss_pred             HHhHHHhhh
Q 008936          144 SSMLGLLAD  152 (548)
Q Consensus       144 SSLLpLLaE  152 (548)
                      ..-+|++.+
T Consensus       119 ~~d~Pf~~~  127 (251)
T PF11932_consen  119 ELDLPFLLE  127 (251)
T ss_pred             hcCCCCChH
Confidence            555555544


No 46 
>PF14048 MBD_C:  C-terminal domain of methyl-CpG binding protein 2 and 3; PDB: 2L2L_B.
Probab=29.22  E-value=40  Score=30.07  Aligned_cols=20  Identities=25%  Similarity=0.381  Sum_probs=17.6

Q ss_pred             HHhhccHHHHHHHHHHHHHH
Q 008936           56 SRARMQKEEIHSLRQQIAVA   75 (548)
Q Consensus        56 ~k~r~~eeEI~~LRk~Lada   75 (548)
                      .-+|.||+.....|+|||+|
T Consensus        76 eDIr~QE~rVk~aR~RLaeA   95 (96)
T PF14048_consen   76 EDIRRQERRVKKARKRLAEA   95 (96)
T ss_dssp             HHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            34689999999999999987


No 47 
>PF01025 GrpE:  GrpE;  InterPro: IPR000740  Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolysing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. In prokaryotes the grpE protein. Dimeric GrpE is the co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold []. DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle.  The X-ray crystal structure of GrpE in complex with the ATPase domain of DnaK revealed that GrpE is an asymmetric homodimer, bent in a manner that favours extensive contacts with only one DnaKATPase monomer []. GrpE does not actively compete for the atomic positions occupied by the nucleotide. GrpE and ADP mutually reduce one another's affinity for DnaK 200-fold, and ATP instantly dissociates GrpE from DnaK.; GO: 0000774 adenyl-nucleotide exchange factor activity, 0042803 protein homodimerization activity, 0051087 chaperone binding, 0006457 protein folding; PDB: 3A6M_A 4ANI_A 1DKG_B.
Probab=28.47  E-value=2.3e+02  Score=25.88  Aligned_cols=53  Identities=9%  Similarity=0.187  Sum_probs=33.3

Q ss_pred             hhhHHHHHHhHHHhhhc----CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936          137 DERYFFMSSMLGLLADY----GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT  191 (548)
Q Consensus       137 QERstFvSSLLpLLaEY----~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q  191 (548)
                      ....-|+..|||++.-+    .--+.-.+..++..+++.++++|..-|.  .-.+.++.
T Consensus        58 ~~~~~~~~~ll~v~D~l~~a~~~~~~~~~~~~~~~g~~~~~~~l~~~L~--~~Gv~~i~  114 (165)
T PF01025_consen   58 YALEKFLKDLLPVLDNLERALEAAKSNEEEESLLEGLEMILKQLEDILE--KNGVEEIE  114 (165)
T ss_dssp             CCHHHHHHHHHHHHHHHHHHHCC-SHHCTCHHHHHHHHHHHHHHHHHHH--TTTEEEE-
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhhccchHHHHHHHHHHHHHHHHHHHH--HCCCEecC
Confidence            34556788899988766    2222334567888888888888876663  33444444


No 48 
>PF14389 Lzipper-MIP1:  Leucine-zipper of ternary complex factor MIP1
Probab=27.88  E-value=83  Score=27.31  Aligned_cols=36  Identities=19%  Similarity=0.177  Sum_probs=26.8

Q ss_pred             CccccchhhhhHHHHhhccHHHHHHHHHHHHHHhhh
Q 008936           43 FISFQDREAMELYSRARMQKEEIHSLRQQIAVACLK   78 (548)
Q Consensus        43 ~~~~~d~~~~~L~~k~r~~eeEI~~LRk~LadasvK   78 (548)
                      .....++.+++|+.-+-.-|+||..|-+++-+.-.+
T Consensus        48 ~~~~lp~~~keLL~EIA~lE~eV~~LE~~v~~L~~~   83 (88)
T PF14389_consen   48 SPSSLPKKAKELLEEIALLEAEVAKLEQKVLSLYRQ   83 (88)
T ss_pred             ccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345788999999888888888888777776654433


No 49 
>PRK14146 heat shock protein GrpE; Provisional
Probab=27.64  E-value=6.4e+02  Score=25.46  Aligned_cols=64  Identities=11%  Similarity=0.103  Sum_probs=42.5

Q ss_pred             HHHHHHhHHHhhhcC--C--CCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          140 YFFMSSMLGLLADYG--L--WPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       140 stFvSSLLpLLaEY~--L--qP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      ..|+..|||.|--+.  |  -+.-.+..+|+.+|+-.+++|..=|.  ..-|+++. +-|     ..++|+.|+-.
T Consensus       104 e~~~~~lLpv~DnlerAl~~~~~~~~~~~l~~Gv~mi~k~l~~~L~--k~Gv~~i~-~~G-----~~FDP~~HeAv  171 (215)
T PRK14146        104 KSLVSGFLNPIDNLERVGATQNQSEELKPFVEGVKMILKEFYSVLE--KSNVIRFD-PKG-----EPFDPMSMEAL  171 (215)
T ss_pred             HHHHHHHhhHHhHHHHHHhcccccchhhHHHHHHHHHHHHHHHHHH--HCcCeeeC-CCC-----CCCChhHhcee
Confidence            468999999998873  2  22234568899999999999988874  34444443 112     23667777543


No 50 
>PRK06342 transcription elongation factor regulatory protein; Validated
Probab=27.62  E-value=80  Score=30.17  Aligned_cols=42  Identities=12%  Similarity=0.073  Sum_probs=24.5

Q ss_pred             hhhHHHHhhccHHHHHHHHHHHHHHh----hhhhhhhcchhhhHHhHhHHH
Q 008936           51 AMELYSRARMQKEEIHSLRQQIAVAC----LKELQLQNEKYTLERKVSELR   97 (548)
Q Consensus        51 ~~~L~~k~r~~eeEI~~LRk~Ladas----vKE~QllnEKyvLErRiA~mR   97 (548)
                      |.+.|.+.   ++|+..|++.|++|.    +.|  =..+...+|+||+||.
T Consensus        32 T~~G~~~L---~~El~~L~~~i~~Ar~~GDlsE--ak~~~~~~e~rI~~L~   77 (160)
T PRK06342         32 TEAGLKAL---EDQLAQARAAYEAAQAIEDVNE--RRRQMARPLRDLRYLA   77 (160)
T ss_pred             CHHHHHHH---HHHHHHHHHHHHHHHHCCChhH--HHHHHHHHHHHHHHHH
Confidence            55555554   678889988877773    334  1223344555555553


No 51 
>PF00038 Filament:  Intermediate filament protein;  InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups:  Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C.   All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=27.41  E-value=3.8e+02  Score=26.83  Aligned_cols=50  Identities=32%  Similarity=0.419  Sum_probs=41.0

Q ss_pred             hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHH
Q 008936           58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEA  107 (548)
Q Consensus        58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdl  107 (548)
                      +....+|+..+|+.+.....+=..|.+.+..||++|++|.--++.+.++.
T Consensus       211 ~~~~~~E~~~~r~~~~~l~~el~~l~~~~~~Le~~l~~le~~~~~~~~~~  260 (312)
T PF00038_consen  211 LESAKEELKELRRQIQSLQAELESLRAKNASLERQLRELEQRLDEEREEY  260 (312)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             cchhHhHHHHHHhhhhHhhhhhhccccchhhhhhhHHHHHHHHHHHHHHH
Confidence            35567888899999998888888899999999999999987777665543


No 52 
>PRK00295 hypothetical protein; Provisional
Probab=27.23  E-value=1.4e+02  Score=24.86  Aligned_cols=31  Identities=32%  Similarity=0.499  Sum_probs=21.5

Q ss_pred             hHHhHhH--HHHHHhHHhhHHHHHHHHHHHHhhhhH
Q 008936           89 LERKVSE--LRMAIDEKQNEAITSALNELARRKGVL  122 (548)
Q Consensus        89 LErRiA~--mRmAfDqqQqdlVdAASKaLSyRQdii  122 (548)
                      +|.||.+  +|+||   |++.|+.=++++..-|..|
T Consensus         3 ~e~Ri~~LE~kla~---qE~tie~Ln~~v~~Qq~~I   35 (68)
T PRK00295          3 LEERVTELESRQAF---QDDTIQALNDVLVEQQRVI   35 (68)
T ss_pred             HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
Confidence            5566664  36666   7788888888887777444


No 53 
>PF07989 Microtub_assoc:  Microtubule associated;  InterPro: IPR012943 Proteins with this domain associate with the spindle body during cell division [].
Probab=26.84  E-value=1.5e+02  Score=25.35  Aligned_cols=63  Identities=21%  Similarity=0.186  Sum_probs=32.9

Q ss_pred             hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhh
Q 008936           58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAED  137 (548)
Q Consensus        58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQ  137 (548)
                      .|+||+.|..|+|-  +.++|            =||-.|.     ++..     ...=..-++++.|||-|--.+...+.
T Consensus         2 lrEqe~~i~~L~KE--NF~LK------------LrI~fLe-----e~l~-----~~~~~~~~~~~keNieLKve~~~L~~   57 (75)
T PF07989_consen    2 LREQEEQIDKLKKE--NFNLK------------LRIYFLE-----ERLQ-----KLGPESIEELLKENIELKVEVESLKR   57 (75)
T ss_pred             HHHHHHHHHHHHHh--hhhHH------------HHHHHHH-----HHHH-----hcccccHHHHHHHHHHHHHHHHHHHH
Confidence            57888888888873  23333            2333322     2222     11123345667777777666665555


Q ss_pred             hhHHHHH
Q 008936          138 ERYFFMS  144 (548)
Q Consensus       138 ERstFvS  144 (548)
                      |-.-.-.
T Consensus        58 el~~~~~   64 (75)
T PF07989_consen   58 ELQEKKK   64 (75)
T ss_pred             HHHHHHH
Confidence            5444333


No 54 
>PF04849 HAP1_N:  HAP1 N-terminal conserved region;  InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=26.58  E-value=1.4e+02  Score=31.83  Aligned_cols=48  Identities=31%  Similarity=0.275  Sum_probs=38.9

Q ss_pred             hccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhH
Q 008936           59 RMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNE  106 (548)
Q Consensus        59 r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqd  106 (548)
                      ..|.|||.+|..+|+|.--|.-|+-.|+..|=..++..+-+=.+-|-+
T Consensus       230 ~rQQEEIt~LlsqivdlQ~r~k~~~~EnEeL~q~L~~ske~Q~~L~aE  277 (306)
T PF04849_consen  230 RRQQEEITSLLSQIVDLQQRCKQLAAENEELQQHLQASKESQRQLQAE  277 (306)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Confidence            567899999999999999999999999999988887776554333333


No 55 
>PRK14149 heat shock protein GrpE; Provisional
Probab=26.27  E-value=3.8e+02  Score=26.74  Aligned_cols=64  Identities=13%  Similarity=0.122  Sum_probs=43.5

Q ss_pred             hHHHHHHhHHHhhhcC----CCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          139 RYFFMSSMLGLLADYG----LWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       139 RstFvSSLLpLLaEY~----LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      ..-|+..|||.|-.+.    .-+...+..+|+.+|+-.+++|..-|.  ..-|+++.  |   .|  .++|+.|+.-
T Consensus        85 ~~~~~~~LLpVlDnLerAl~~~~~~~~~~~l~~Gv~mi~k~l~~vL~--k~GV~~I~--~---~G--~FDP~~HEAv  152 (191)
T PRK14149         85 YEKIALDLLPVIDALLGALKSAAEVDKESALTKGLELTMEKLHEVLA--RHGIEGIE--C---LE--EFDPNFHNAI  152 (191)
T ss_pred             HHHHHHHHhhHHhHHHHHHhccccccchHHHHHHHHHHHHHHHHHHH--HCCCEEeC--C---CC--CCChHHhhee
Confidence            3468999999998872    223345667899999999999988775  33444443  2   23  3888777643


No 56 
>PF04012 PspA_IM30:  PspA/IM30 family;  InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=25.67  E-value=1.2e+02  Score=29.07  Aligned_cols=48  Identities=29%  Similarity=0.279  Sum_probs=34.9

Q ss_pred             hhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHH
Q 008936           51 AMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRM   98 (548)
Q Consensus        51 ~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRm   98 (548)
                      ..+...+....++++..|...+..+.-...+|.+....||.||.+|+-
T Consensus        86 Ar~al~~k~~~e~~~~~l~~~~~~~~~~~~~l~~~l~~l~~kl~e~k~  133 (221)
T PF04012_consen   86 AREALQRKADLEEQAERLEQQLDQAEAQVEKLKEQLEELEAKLEELKS  133 (221)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344455667777777778888888777777777777777777777763


No 57 
>PF09726 Macoilin:  Transmembrane protein;  InterPro: IPR019130  This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=25.59  E-value=1.8e+02  Score=33.83  Aligned_cols=67  Identities=33%  Similarity=0.419  Sum_probs=46.8

Q ss_pred             HHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHH--hHHhhHHHHHHHHHHHHhhhhHhhhhh
Q 008936           54 LYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAI--DEKQNEAITSALNELARRKGVLEENLK  127 (548)
Q Consensus        54 L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAf--DqqQqdlVdAASKaLSyRQdiiEENiR  127 (548)
                      +=.|.++.|.||.+||.-|.   .||-|    +..||..+.+||..-  +++-.+..=+|+-||--+-.+||+|+.
T Consensus       543 ~r~r~~~lE~E~~~lr~elk---~kee~----~~~~e~~~~~lr~~~~e~~~~~e~L~~aL~amqdk~~~LE~sLs  611 (697)
T PF09726_consen  543 CRQRRRQLESELKKLRRELK---QKEEQ----IRELESELQELRKYEKESEKDTEVLMSALSAMQDKNQHLENSLS  611 (697)
T ss_pred             HHHHHHHHHHHHHHHHHHHH---HHHHH----HHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhh
Confidence            55677888999999987553   56654    467899899999852  244456666777777766667776654


No 58 
>PRK11281 hypothetical protein; Provisional
Probab=25.58  E-value=1.9e+02  Score=35.57  Aligned_cols=61  Identities=26%  Similarity=0.317  Sum_probs=44.9

Q ss_pred             hhhhhHHHHhhccHHHHHHHHHHHHHHhh------------hh---------hhhhcchhhhHHhHhHHHHHHhHHhhHH
Q 008936           49 REAMELYSRARMQKEEIHSLRQQIAVACL------------KE---------LQLQNEKYTLERKVSELRMAIDEKQNEA  107 (548)
Q Consensus        49 ~~~~~L~~k~r~~eeEI~~LRk~Ladasv------------KE---------~QllnEKyvLErRiA~mRmAfDqqQqdl  107 (548)
                      ..+.++|.++..++++...|+++|++|--            |+         ..-++. ..||.++++..-.-.+.|.+|
T Consensus        66 ~~tL~~L~qi~~~~~~~~~L~k~l~~Ap~~l~~a~~~Le~Lk~~~~~~~~~~~~~~Sl-~qLEq~L~q~~~~Lq~~Q~~L  144 (1113)
T PRK11281         66 EQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETLSTLSL-RQLESRLAQTLDQLQNAQNDL  144 (1113)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhhccccccccccccccCH-HHHHHHHHHHHHHHHHHHHHH
Confidence            35678899999999999999999888531            21         111122 349999999988888888888


Q ss_pred             HHH
Q 008936          108 ITS  110 (548)
Q Consensus       108 VdA  110 (548)
                      -++
T Consensus       145 a~~  147 (1113)
T PRK11281        145 AEY  147 (1113)
T ss_pred             HHH
Confidence            776


No 59 
>PRK14151 heat shock protein GrpE; Provisional
Probab=25.52  E-value=6.3e+02  Score=24.69  Aligned_cols=64  Identities=11%  Similarity=0.179  Sum_probs=42.1

Q ss_pred             HHHHHHhHHHhhhcC--CC--CCcc-cchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          140 YFFMSSMLGLLADYG--LW--PHVT-NASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       140 stFvSSLLpLLaEY~--Lq--P~V~-DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      .-|+..|||.+-.+.  |.  ++-. ...+|+.+|+-.+++|..-|.  ..-|+++. ..|     ..++|+.|+--
T Consensus        70 ~~~~~~LLpv~DnlerAl~~~~~~~~~~~~~~~Gv~mi~k~l~~~L~--k~Gv~~i~-~~G-----~~FDP~~HEAv  138 (176)
T PRK14151         70 EKFAGDLLPVVDSLERGLELSSADDEAIKPMREGVELTLKMFQDTLK--RYQLEAVD-PHG-----EPFNPEHHQAM  138 (176)
T ss_pred             HHHHHHHhhHHhHHHHHHhcccccchhHHHHHHHHHHHHHHHHHHHH--HCCCEEeC-CCC-----CCCCHHHhhcc
Confidence            468899999988772  22  2222 247899999999999988774  44556555 223     24667666543


No 60 
>PRK11281 hypothetical protein; Provisional
Probab=25.45  E-value=3.4e+02  Score=33.44  Aligned_cols=71  Identities=21%  Similarity=0.390  Sum_probs=51.1

Q ss_pred             hhHHhHhHHHHHH---hHHhhHH------H----------------HHHHHHHHHhhhhHhhhhh-------hhhhhhhh
Q 008936           88 TLERKVSELRMAI---DEKQNEA------I----------------TSALNELARRKGVLEENLK-------LAHDLKVA  135 (548)
Q Consensus        88 vLErRiA~mRmAf---DqqQqdl------V----------------dAASKaLSyRQdiiEENiR-------Ltyalq~a  135 (548)
                      .|.++||++|+.-   .||+..|      +                ++..+-|.-|+++|++=++       +.=.||.+
T Consensus       357 ~l~~~iAdlrl~~f~~~q~~~~l~~~~~y~~~l~~~~~~~~~~~~~~~l~~ll~~r~~LL~~l~~~~~~~l~~~~~l~~~  436 (1113)
T PRK11281        357 GLADRIADLRLEQFEINQQRDALFQPDAYIDKLEAGHKSEVTDEVRDALLQLLDERRELLDQLNKQLNNQLNLAINLQLN  436 (1113)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHhcChhhHHHHHHhcccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3889999999763   2222111      1                2344678889999988333       45577888


Q ss_pred             hhhhHHHHHHhHHHhhhcCCCCC
Q 008936          136 EDERYFFMSSMLGLLADYGLWPH  158 (548)
Q Consensus       136 eQERstFvSSLLpLLaEY~LqP~  158 (548)
                      +|+-..-+.+|=-.|.|.-+|=+
T Consensus       437 q~Ql~~~~~~l~~~L~~~lfWv~  459 (1113)
T PRK11281        437 QQQLLSVSDSLQSTLTQQIFWVN  459 (1113)
T ss_pred             HHHHHHHHHHHHHHHHHhhhccC
Confidence            99998889999999999988844


No 61 
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=25.20  E-value=3.5e+02  Score=27.88  Aligned_cols=47  Identities=21%  Similarity=0.235  Sum_probs=30.3

Q ss_pred             HHhhccHHHHHHHHHHHHHHhhhh---------hhhhcchhhhHHhHhHHHHHHhH
Q 008936           56 SRARMQKEEIHSLRQQIAVACLKE---------LQLQNEKYTLERKVSELRMAIDE  102 (548)
Q Consensus        56 ~k~r~~eeEI~~LRk~LadasvKE---------~QllnEKyvLErRiA~mRmAfDq  102 (548)
                      ..++..+.||.-+|+|++++..|=         ..|-.|.+++++|++.|+=...+
T Consensus        59 ~qv~~~e~ei~~~r~r~~~~e~kl~~v~~~~e~~aL~~E~~~ak~r~~~le~el~~  114 (239)
T COG1579          59 NQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAE  114 (239)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345667788888888887775542         13555667777777666655443


No 62 
>PRK04406 hypothetical protein; Provisional
Probab=24.73  E-value=1.7e+02  Score=24.98  Aligned_cols=32  Identities=22%  Similarity=0.489  Sum_probs=24.0

Q ss_pred             hhHHhHhHH--HHHHhHHhhHHHHHHHHHHHHhhhhH
Q 008936           88 TLERKVSEL--RMAIDEKQNEAITSALNELARRKGVL  122 (548)
Q Consensus        88 vLErRiA~m--RmAfDqqQqdlVdAASKaLSyRQdii  122 (548)
                      .||.||.+|  |+||   |++.|+.=++++..-|..|
T Consensus         8 ~le~Ri~~LE~~lAf---QE~tIe~LN~~v~~Qq~~I   41 (75)
T PRK04406          8 QLEERINDLECQLAF---QEQTIEELNDALSQQQLLI   41 (75)
T ss_pred             HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
Confidence            467777754  6776   7889999888888777444


No 63 
>PRK14160 heat shock protein GrpE; Provisional
Probab=23.64  E-value=6.2e+02  Score=25.64  Aligned_cols=63  Identities=10%  Similarity=0.176  Sum_probs=42.4

Q ss_pred             hHHHHHHhHHHhhhc--CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936          139 RYFFMSSMLGLLADY--GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP  211 (548)
Q Consensus       139 RstFvSSLLpLLaEY--~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p  211 (548)
                      ..-|+..|||.+--+  -|.- ..+..+|+.+|+-.+++|..=|.  ..-|+++.  |   .|  .++|+.|+.-
T Consensus       110 ~e~~~~~LLpVlDnLerAl~~-~~~~~~l~~Gv~mi~kql~~vL~--k~GVe~I~--~---~G--~FDP~~HEAv  174 (211)
T PRK14160        110 CEDVLKELLPVLDNLERAAAV-EGSVEDLKKGIEMTVKQFKTSLE--KLGVEEIS--T---EG--EFDPNLHNAV  174 (211)
T ss_pred             HHHHHHHHhhHHhHHHHHHhc-ccchhHHHHHHHHHHHHHHHHHH--HCCCEEeC--C---CC--CCChHHhcee
Confidence            356899999998877  2321 24557899999999999888774  33455543  2   23  4788777643


No 64 
>PLN02524 S-adenosylmethionine decarboxylase
Probab=23.37  E-value=1.6e+02  Score=31.92  Aligned_cols=87  Identities=23%  Similarity=0.269  Sum_probs=50.7

Q ss_pred             eeeeccccCCccceeeEEEEeccCCccccccCCCCCcceeccccccceeEEEeEecCCCCcccceeeeeecCCCccccCc
Q 008936          324 LLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVRRFANDQNKIKCDL  403 (548)
Q Consensus       324 L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA~~p~Y~vTADDvd~llAiec~PmDdr~RkGelVk~~ANd~~kI~cdp  403 (548)
                      -.=||||.||                   |+|..=-+--||..|==.+.+.|+.=-|.                   +..
T Consensus       228 F~PCGYSmN~-------------------i~g~~y~TIHVTPE~~~SYaSFEtn~~~~-------------------~~~  269 (355)
T PLN02524        228 FDPCGYSMNG-------------------IEGDAISTIHVTPEDGFSYASFEAMGYDP-------------------GDL  269 (355)
T ss_pred             cCCCcccccc-------------------cCCCceEEEEECCCCCCeEEEEEeccCCc-------------------ccc
Confidence            4569999996                   44554445557777777788887753111                   112


Q ss_pred             cchHHHHHHhhc-cceEEEEeeeccccccccceEEEEeecceEEEe
Q 008936          404 GMQSEIDAYISR-GHATFSVLMLMDSSENWEQATLILRRSIYRIKI  448 (548)
Q Consensus       404 eMq~~I~~~l~~-G~AsF~V~ll~~~~d~wEpATLvirR~gY~IK~  448 (548)
                      .+.+.|++-+.. .=..|.|.+-.+............+-.||.-+-
T Consensus       270 ~~~~lv~rVl~~F~P~~fsvt~~~~~~~~~~~~~~~~~~~gY~~~~  315 (355)
T PLN02524        270 DLSQLVERVLACFKPKEFSVAVHANVGGEAGSWGCSLDPDGYSCKG  315 (355)
T ss_pred             CHHHHHHHHHhhcCCceEEEEEEecCccchhccccccCcCCceecC
Confidence            366666666654 334577777654333333334445557876543


No 65 
>KOG3083 consensus Prohibitin [Posttranslational modification, protein turnover, chaperones]
Probab=22.92  E-value=77  Score=33.03  Aligned_cols=41  Identities=29%  Similarity=0.524  Sum_probs=28.5

Q ss_pred             HhhHHHHH-HHHHHHHh---hhhHhhhhhhhh------------hhhhhhhh--hHHHH
Q 008936          103 KQNEAITS-ALNELARR---KGVLEENLKLAH------------DLKVAEDE--RYFFM  143 (548)
Q Consensus       103 qQqdlVdA-ASKaLSyR---QdiiEENiRLty------------alq~aeQE--RstFv  143 (548)
                      -|.++|++ .+++|..|   -+++-+++-+||            +-|+||||  |.-||
T Consensus       141 TqRe~vS~~v~~~lt~rA~~Fgl~LddvsiThltfGkEFt~AvE~KQVAQQEAErarFv  199 (271)
T KOG3083|consen  141 TQRELVSRQVSNDLTERAATFGLILDDVSITHLTFGKEFTEAVEAKQVAQQEAERARFV  199 (271)
T ss_pred             hhhHHHHHHHHHHHHHHHHhhCeeechhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence            36677776 67777777   456677777765            56888885  67775


No 66 
>PLN02542 fructose-1,6-bisphosphatase
Probab=22.90  E-value=1e+02  Score=33.92  Aligned_cols=80  Identities=24%  Similarity=0.424  Sum_probs=56.7

Q ss_pred             CCCcceeeeccccCCccceeeEEEEeccCCccccccCCCCCcceeccccccceeEEEeEecCCCCcccceeeeee-cCCC
Q 008936          319 TPGEKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVRRFA-NDQN  397 (548)
Q Consensus       319 ~pG~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA~~p~Y~vTADDvd~llAiec~PmDdr~RkGelVk~~A-Nd~~  397 (548)
                      -||+++.|.||-+=|.+.-.   |=.+.+|...|.=....=+|++|.+++-       +|  ++      =++|+ |++|
T Consensus       234 qpG~~qvAAGY~lYGpsT~L---Vlt~G~GV~~FtLDp~~geFvLt~~~i~-------IP--~~------g~iySiN~~N  295 (412)
T PLN02542        234 QPGSNLLAAGYCMYSSSVIF---VLTIGTGVFSFTLDPMYGEFVLTQENIQ-------IP--KA------GKIYSFNEGN  295 (412)
T ss_pred             CcchhhhEEEEEEEccceEE---EEEECCCEEEEEEcCCCCeEEEeCCCee-------eC--CC------CcEeeeCccc
Confidence            39999999999998865332   2245777766664555557888877651       22  22      26788 7788


Q ss_pred             ccccCccchHHHHHHhhcc
Q 008936          398 KIKCDLGMQSEIDAYISRG  416 (548)
Q Consensus       398 kI~cdpeMq~~I~~~l~~G  416 (548)
                      .=.+|+.|+.-|+.....|
T Consensus       296 ~~~W~~~~~~yi~~~~~~~  314 (412)
T PLN02542        296 YQLWDDKLKKYIDDLKDPG  314 (412)
T ss_pred             ccccCHHHHHHHHHHhhCC
Confidence            8889999999999887543


No 67 
>PF08838 DUF1811:  Protein of unknown function (DUF1811);  InterPro: IPR014938 This entry consists uncharacterised bacterial proteins. Some of the proteins are annotated as being transcriptional regulators (see Q4MQL7 from SWISSPROT, Q65MA2 from SWISSPROT). The structure of one of the proteins has revealed a beta-barrel like structure with helix-turn-helix like motif. ; PDB: 2YXY_A 1SF9_A.
Probab=22.36  E-value=87  Score=28.72  Aligned_cols=33  Identities=30%  Similarity=0.402  Sum_probs=24.9

Q ss_pred             ccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHh
Q 008936           60 MQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVS   94 (548)
Q Consensus        60 ~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA   94 (548)
                      |-++||..|+.+-..|  ...-+.||--|||||++
T Consensus        11 EL~~Ei~~L~ekarKA--Eq~G~~nE~aV~erK~~   43 (102)
T PF08838_consen   11 ELRQEIARLKEKARKA--EQLGIVNEYAVYERKII   43 (102)
T ss_dssp             HHHHHHHHHHHHHHHH--HHCT-HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHH--HHcCCccHHHHHHHHHH
Confidence            4467888888886544  45568899999999985


No 68 
>PRK14141 heat shock protein GrpE; Provisional
Probab=22.14  E-value=7.8e+02  Score=24.90  Aligned_cols=64  Identities=17%  Similarity=0.200  Sum_probs=41.1

Q ss_pred             hHHHHHHhHHHhhhcC--CC--CC------cccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCC
Q 008936          139 RYFFMSSMLGLLADYG--LW--PH------VTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRH  208 (548)
Q Consensus       139 RstFvSSLLpLLaEY~--Lq--P~------V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~  208 (548)
                      ..-|+..|||.+--+.  |.  +.      -.+..+|+.+|+-.+++|..=|.  ...|+++. +.|     ..++|..|
T Consensus        80 ~~~~~~dLLpViDnLerAl~~~~~~~~~~~~~~~~~l~eGv~mi~k~l~~vLe--k~GV~~I~-~~G-----e~FDP~~H  151 (209)
T PRK14141         80 IAGFARDMLSVSDNLRRALDAIPAEARAAADAGLKALIEGVEMTERAMLNALE--RHGVKKLD-PEG-----QKFDPNFH  151 (209)
T ss_pred             HHHHHHHHhhhHhHHHHHHhccccccccccchhHHHHHHHHHHHHHHHHHHHH--HCCCEEEC-CCC-----CCCChHHh
Confidence            3468899999987662  21  21      12357899999999999988775  44556555 223     24666666


Q ss_pred             CC
Q 008936          209 GV  210 (548)
Q Consensus       209 q~  210 (548)
                      +.
T Consensus       152 EA  153 (209)
T PRK14141        152 QA  153 (209)
T ss_pred             ce
Confidence            53


No 69 
>PF07926 TPR_MLP1_2:  TPR/MLP1/MLP2-like protein;  InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=22.03  E-value=5e+02  Score=23.57  Aligned_cols=67  Identities=16%  Similarity=0.168  Sum_probs=44.6

Q ss_pred             HHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhh
Q 008936           62 KEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKL  128 (548)
Q Consensus        62 eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRL  128 (548)
                      =++|..||+.+..+-..=.++..+.......+...+-.+..+...|-..-+.+=..+.|+-+.|=-|
T Consensus        58 ~~~L~~lr~e~~~~~~~~~~l~~~~~~a~~~l~~~e~sw~~qk~~le~e~~~~~~r~~dL~~QN~lL  124 (132)
T PF07926_consen   58 IKELQQLREELQELQQEINELKAEAESAKAELEESEASWEEQKEQLEKELSELEQRIEDLNEQNKLL  124 (132)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3456677777777666666677777777777777777777776666666666666666666666433


No 70 
>KOG3990 consensus Uncharacterized conserved protein [Function unknown]
Probab=21.88  E-value=2.7e+02  Score=29.55  Aligned_cols=29  Identities=28%  Similarity=0.408  Sum_probs=25.7

Q ss_pred             ccchhhhhHHHHhhccHHHHHHHHHHHHH
Q 008936           46 FQDREAMELYSRARMQKEEIHSLRQQIAV   74 (548)
Q Consensus        46 ~~d~~~~~L~~k~r~~eeEI~~LRk~Lad   74 (548)
                      |-|+.--+-.++..--+|||..|++.|+.
T Consensus       215 ~~D~~~~dh~V~i~~lkeeia~Lkk~L~q  243 (305)
T KOG3990|consen  215 FGDRDPGDHMVKIQKLKEEIARLKKLLHQ  243 (305)
T ss_pred             CCCCCCcchHHHHHHHHHHHHHHHHHHhh
Confidence            66888888889999999999999999985


No 71 
>PF13600 DUF4140:  N-terminal domain of unknown function (DUF4140)
Probab=21.50  E-value=1.3e+02  Score=25.81  Aligned_cols=36  Identities=22%  Similarity=0.273  Sum_probs=26.4

Q ss_pred             cHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHH
Q 008936           61 QKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSEL   96 (548)
Q Consensus        61 ~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~m   96 (548)
                      ..+++..|+++|.++--+-..+.+++.+++.+++.|
T Consensus        68 ~~~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~L  103 (104)
T PF13600_consen   68 DSPELKELEEELEALEDELAALQDEIQALEAQIAFL  103 (104)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            455777777777777777777778888888887755


No 72 
>KOG3620 consensus Uncharacterized conserved protein [Function unknown]
Probab=21.40  E-value=1.7e+02  Score=36.72  Aligned_cols=75  Identities=21%  Similarity=0.316  Sum_probs=56.1

Q ss_pred             hhccceEEEEeeeccccccccceEEEEeecc----------------eEEEec-CCcceeeeccCCcceeEecCCCcceE
Q 008936          413 ISRGHATFSVLMLMDSSENWEQATLILRRSI----------------YRIKID-STEAIIEERFPKEVSIKVPCGLSTQF  475 (548)
Q Consensus       413 l~~G~AsF~V~ll~~~~d~wEpATLvirR~g----------------Y~IK~~-~~~~~I~EKfS~~~~IkIP~G~~~qf  475 (548)
                      .--|..+|+|-.+-+-.-.-+  |++|--..                |+||.= |=.+..+..|||.+.|-=|+.++-|.
T Consensus        40 ~P~GnTsf~VVFLpR~~G~~~--t~~~~~ts~Gvfty~vqG~g~~nPYRl~Pf~G~r~PmNss~sp~I~i~NPh~~plqi  117 (1626)
T KOG3620|consen   40 PPQGNTSFNVVFLPRQLGAIA--TDLLIHTSFGVFTYAVQGEGSENPYRLKPFVGIRAPMNSSLSPEIHIYNPHERPLQI  117 (1626)
T ss_pred             CCCCcceEEEEEehhhhccce--eeEEEeeccceEEEEEeccCCCCccccccccceecccccccCCceEecCCccCceEE
Confidence            346899999998865444444  55554433                777765 44488999999999999999999999


Q ss_pred             EEEecCCCeeeecc
Q 008936          476 VLTFSDGSSYPFST  489 (548)
Q Consensus       476 vi~~sdG~e~~L~~  489 (548)
                      +=|.+.|.+-.|++
T Consensus       118 ~EiysSGG~~hlEL  131 (1626)
T KOG3620|consen  118 LEIYSSGGEFHLEL  131 (1626)
T ss_pred             EEEeccCccEEEec
Confidence            98877777766664


No 73 
>cd05829 Sortase_E Sortase E (SrtE) is a membrane transpeptidase found in gram-positive bacteria that cleaves surface proteins at a cell sorting motif and catalyzes a transpeptidation reaction in which the surface protein substrate is covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. The function of Sortase E is unknown. In two different sortase families, the N-terminus either functions as both a signal peptide for secretion and a stop-transfer signal for membrane anchoring, or it contains a signal peptide only and the C-terminus serves as a membrane anchor. Most gram-positive bacteria contain more than one sortase and it is thought that the different sortases anchor different surface protein classes. The sortase domain is a modified beta-barrel flanked by two (SrtA) or three (SrtB) short alpha-helices.
Probab=21.29  E-value=1.9e+02  Score=26.52  Aligned_cols=51  Identities=18%  Similarity=0.232  Sum_probs=35.8

Q ss_pred             cccCCC--cceeeeccccCCccceeeEEEEeccCCccccccCC--CCCcceeccc
Q 008936          316 GEATPG--EKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGA--TNPEYVVTAD  366 (548)
Q Consensus       316 Gda~pG--~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA--~~p~Y~vTAD  366 (548)
                      +.+.||  +...-.||....-....|-.+..|+-|..-+|..+  ....|.||.-
T Consensus        42 ~~~~pG~~Gn~viaGH~~~~g~~~~F~~L~~l~~GD~I~v~~~~g~~~~Y~V~~~   96 (144)
T cd05829          42 GGPKPGEKGTAVLAGHVDSRGGPAVFFRLGDLRKGDKVEVTRADGQTATFRVDRV   96 (144)
T ss_pred             CCCCCCCCCCEEEEEecCCCCCChhhcchhcCCCCCEEEEEECCCCEEEEEEeEE
Confidence            344444  35556788754224678999999999999999884  3478888753


No 74 
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=21.28  E-value=1.1e+03  Score=27.37  Aligned_cols=8  Identities=25%  Similarity=0.688  Sum_probs=4.0

Q ss_pred             ccceeecc
Q 008936          310 EGFQIIGE  317 (548)
Q Consensus       310 eglqI~Gd  317 (548)
                      .|+.|.-+
T Consensus      1071 ~~~~~~~~ 1078 (1179)
T TIGR02168      1071 AGIEIFAQ 1078 (1179)
T ss_pred             cCceEEEe
Confidence            45555544


No 75 
>PRK09293 fructose-1,6-bisphosphatase; Provisional
Probab=21.28  E-value=1.1e+02  Score=32.31  Aligned_cols=76  Identities=25%  Similarity=0.547  Sum_probs=55.3

Q ss_pred             CCCcceeeeccccCCccceeeEEEEeccCCccccccCCCCCcceeccccccceeEEEeEecCCCCcccceeeeee-cCCC
Q 008936          319 TPGEKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVRRFA-NDQN  397 (548)
Q Consensus       319 ~pG~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA~~p~Y~vTADDvd~llAiec~PmDdr~RkGelVk~~A-Nd~~  397 (548)
                      -||+++.|.||-+=|.+.-+   |-.+.+|...|.=....-+|++|-+++-       +|  +++      ++|+ |.+|
T Consensus       145 q~G~~~vaAgy~~YG~~t~l---v~t~g~gv~~ftld~~~g~f~l~~~~i~-------ip--~~~------~~ys~n~~n  206 (327)
T PRK09293        145 QPGNNQVAAGYVLYGPSTML---VLTTGDGVHGFTLDPSLGEFVLTHENIR-------IP--EDG------KEYAINEGN  206 (327)
T ss_pred             CCchhheeEEEEEEcCceEE---EEEeCCCEEEEEEeCCCCeEEEecCCce-------eC--CCC------CEEccCchh
Confidence            39999999999998876544   3344677666665555667888876653       22  222      6899 8888


Q ss_pred             ccccCccchHHHHHHh
Q 008936          398 KIKCDLGMQSEIDAYI  413 (548)
Q Consensus       398 kI~cdpeMq~~I~~~l  413 (548)
                      -=.++|.++.-| .++
T Consensus       207 ~~~w~~~~~~yi-~~~  221 (327)
T PRK09293        207 QRHWEPGVKKYI-ELL  221 (327)
T ss_pred             hhhcCHHHHHHH-HHh
Confidence            889999999999 554


No 76 
>PRK14143 heat shock protein GrpE; Provisional
Probab=21.21  E-value=8.9e+02  Score=24.90  Aligned_cols=64  Identities=20%  Similarity=0.328  Sum_probs=41.5

Q ss_pred             hHHHHHHhHHHhhhc-----CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCC
Q 008936          139 RYFFMSSMLGLLADY-----GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGV  210 (548)
Q Consensus       139 RstFvSSLLpLLaEY-----~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~  210 (548)
                      ..-|+..|||.+--+     .+.|.-.++.+|+.+++-++++|..-|.  ..-|+++. ..|     ..++|+.|.-
T Consensus       116 ~~~~~~~lLpV~DnLerAl~~~~~~~~~~~~l~~Gve~i~k~l~~~L~--k~GV~~i~-~~G-----~~FDP~~HEA  184 (238)
T PRK14143        116 KCNTLSEILPVVDNFERARQQLKPEGEEAQALHRSYQGLYKQLVDVLK--RLGVSPMR-VVG-----QEFDPNLHEA  184 (238)
T ss_pred             HHHHHHHHHHHHhHHHHHHhcccccchhHHHHHHHHHHHHHHHHHHHH--HCCCeeeC-CCC-----CCCChHHhhe
Confidence            456899999998776     2335445568899999999998877664  33444444 122     2356666643


No 77 
>PRK02224 chromosome segregation protein; Provisional
Probab=20.96  E-value=8.3e+02  Score=28.19  Aligned_cols=35  Identities=14%  Similarity=0.137  Sum_probs=20.2

Q ss_pred             EEEEecCCCeeeeccCCCCCchhhHHHHHHHHHHH
Q 008936          475 FVLTFSDGSSYPFSTYNVRMRDTLVLTMRMLQGKV  509 (548)
Q Consensus       475 fvi~~sdG~e~~L~~~~~~~RD~iVLtlR~F~~~~  509 (548)
                      ..++..+|.......-..+.|..+.|.||+-.-++
T Consensus       767 i~~~~~~g~~~~~~~lS~G~~~~~~lalr~a~~~~  801 (880)
T PRK02224        767 LTVYQKDGEPLEPEQLSGGERALFNLSLRCAIYRL  801 (880)
T ss_pred             eeeeCCCCCccChhhcCccHHHHHHHHHHHHHHHH
Confidence            33455566655555555666766666666655443


No 78 
>PF13874 Nup54:  Nucleoporin complex subunit 54; PDB: 3T97_B.
Probab=20.82  E-value=4.3e+02  Score=24.41  Aligned_cols=79  Identities=24%  Similarity=0.242  Sum_probs=30.2

Q ss_pred             hhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhH--HhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhh
Q 008936           52 MELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLE--RKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLA  129 (548)
Q Consensus        52 ~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLE--rRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLt  129 (548)
                      .+|-.|++.|++++.+++.+|.+..-+=.+|  .|+.++  .||+++|--.-+.++-++.-++|     ++|+.   +..
T Consensus        33 ~dL~~R~~~Q~~~~~~~~~~l~~i~~~l~~L--~~~~~~~~~rl~~~r~r~~~L~hR~l~v~~~-----~eilr---~~g  102 (141)
T PF13874_consen   33 EDLKKRVEAQEEEIAQHRERLKEINDKLEEL--QKHDLETSARLEEARRRHQELSHRLLRVLRK-----QEILR---NRG  102 (141)
T ss_dssp             -------------HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHH---H--
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHH---HcC
Confidence            4678899999999999999998776553344  444443  56666665555555555444433     22322   234


Q ss_pred             hhhhhhhhhhH
Q 008936          130 HDLKVAEDERY  140 (548)
Q Consensus       130 yalq~aeQERs  140 (548)
                      |+|+..|.+..
T Consensus       103 ~~l~~eEe~L~  113 (141)
T PF13874_consen  103 YALSPEEEELR  113 (141)
T ss_dssp             -----------
T ss_pred             CCCCHHHHHHH
Confidence            56655555433


No 79 
>PF06034 DUF919:  Nucleopolyhedrovirus protein of unknown function (DUF919);  InterPro: IPR009265 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf29. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several short baculovirus proteins of unknown function.
Probab=20.73  E-value=1.9e+02  Score=24.37  Aligned_cols=43  Identities=26%  Similarity=0.387  Sum_probs=29.6

Q ss_pred             hhccHHHHHHHHHHHHHHhhhhhhh------hcchhhhHHhHhHHHHHH
Q 008936           58 ARMQKEEIHSLRQQIAVACLKELQL------QNEKYTLERKVSELRMAI  100 (548)
Q Consensus        58 ~r~~eeEI~~LRk~LadasvKE~Ql------lnEKyvLErRiA~mRmAf  100 (548)
                      ++.|=.+|.+.+++|.-.+--+-.|      -+|-+.+++|+.+||+-|
T Consensus         6 L~~QLd~I~~~K~~l~ik~~H~Ekl~kitK~p~El~~i~~kl~~~R~~F   54 (62)
T PF06034_consen    6 LTQQLDEINQMKRQLTIKSQHWEKLKKITKNPKELQEIEKKLQELRQNF   54 (62)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHH
Confidence            3556677777777776655443333      245577999999999977


No 80 
>PRK04325 hypothetical protein; Provisional
Probab=20.67  E-value=2.2e+02  Score=24.08  Aligned_cols=32  Identities=25%  Similarity=0.522  Sum_probs=22.8

Q ss_pred             hHHhHhH--HHHHHhHHhhHHHHHHHHHHHHhhhhHh
Q 008936           89 LERKVSE--LRMAIDEKQNEAITSALNELARRKGVLE  123 (548)
Q Consensus        89 LErRiA~--mRmAfDqqQqdlVdAASKaLSyRQdiiE  123 (548)
                      +|.||.+  +|+||   |++.|+.=++++..-|..|.
T Consensus         7 ~e~Ri~~LE~klAf---QE~tIe~LN~vv~~Qq~~I~   40 (74)
T PRK04325          7 MEDRITELEIQLAF---QEDLIDGLNATVARQQQTLD   40 (74)
T ss_pred             HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH
Confidence            4555554  46676   78899999998887775553


No 81 
>PF04201 TPD52:  Tumour protein D52 family;  InterPro: IPR007327 The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family []. These proteins have an N-terminal coiled-coil that allows members to form homo- and heterodimers with each other [].
Probab=20.62  E-value=2.1e+02  Score=28.18  Aligned_cols=51  Identities=33%  Similarity=0.421  Sum_probs=35.1

Q ss_pred             ccccchhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHH-----HhHHhhHHH
Q 008936           44 ISFQDREAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMA-----IDEKQNEAI  108 (548)
Q Consensus        44 ~~~~d~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmA-----fDqqQqdlV  108 (548)
                      .-|.+.|-.+|-.-...-||||..||+=|+.   |           ||+.++||=.     +.+-+|++-
T Consensus        24 ~~LsEeE~eeLr~EL~KvEeEI~TLrqvL~a---K-----------er~~~eLKrkLGit~l~elkqnls   79 (162)
T PF04201_consen   24 EGLSEEEREELRSELAKVEEEIQTLRQVLAA---K-----------ERHCAELKRKLGITPLSELKQNLS   79 (162)
T ss_pred             ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH---H-----------HHhHHHHHHHHCCchHHHHHHHHH
Confidence            4567777777877777789999999998874   3           5666666532     444555553


No 82 
>PF11932 DUF3450:  Protein of unknown function (DUF3450);  InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=20.58  E-value=5e+02  Score=25.76  Aligned_cols=38  Identities=13%  Similarity=0.189  Sum_probs=25.5

Q ss_pred             hhhhhHHHHHHhHHHhhhcCCCCCcccchhhhhhHHHHHHHHHHHH
Q 008936          135 AEDERYFFMSSMLGLLADYGLWPHVTNASAISNTVKHLYDQLQSQI  180 (548)
Q Consensus       135 aeQERstFvSSLLpLLaEY~LqP~V~DAqSIvsnvKvLf~hLq~kL  180 (548)
                      -.+||..-+-.|=-+|+..++-+.        .-+..++.-++-+.
T Consensus       125 ~~~eR~~Rl~~L~~~l~~~dv~~~--------ek~r~vlea~~~E~  162 (251)
T PF11932_consen  125 LLEERQERLARLRAMLDDADVSLA--------EKFRRVLEAYQIEM  162 (251)
T ss_pred             ChHHHHHHHHHHHHhhhccCCCHH--------HHHHHHHHHHHHHH
Confidence            556888888888888888877544        33445555555544


No 83 
>KOG2896 consensus UV radiation resistance associated protein [General function prediction only]
Probab=20.57  E-value=3.5e+02  Score=29.87  Aligned_cols=44  Identities=18%  Similarity=0.248  Sum_probs=30.1

Q ss_pred             hHhHHHHHHhHHhhHH-------HHHHHHHHHHhhhhHhhhhhhhhhhhhh
Q 008936           92 KVSELRMAIDEKQNEA-------ITSALNELARRKGVLEENLKLAHDLKVA  135 (548)
Q Consensus        92 RiA~mRmAfDqqQqdl-------VdAASKaLSyRQdiiEENiRLtyalq~a  135 (548)
                      ++..||.+|+-+|-.+       +..+----.||+-.||-++-.|+.+.--
T Consensus       142 kL~~kr~q~~~~q~~l~k~~k~l~e~~~~l~a~re~fL~~~~~~~~~irq~  192 (377)
T KOG2896|consen  142 KLEDKRQQFNASQVKLQKQLKSLIELRNELVAKRELFLEQRIQDTFKIRQD  192 (377)
T ss_pred             HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhHhHHHhhhhhhhcc
Confidence            4566888888776543       4444455578888888888887776543


No 84 
>PF00038 Filament:  Intermediate filament protein;  InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups:  Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C.   All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=20.23  E-value=3.6e+02  Score=26.96  Aligned_cols=43  Identities=28%  Similarity=0.392  Sum_probs=34.7

Q ss_pred             HHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHh
Q 008936           62 KEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQ  104 (548)
Q Consensus        62 eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQ  104 (548)
                      +.||..||+.|.+.+..=+++.-|.-.|..-+.++|.-|....
T Consensus        53 e~el~~lr~~id~~~~eka~l~~e~~~l~~e~~~~r~k~e~e~   95 (312)
T PF00038_consen   53 EEELRELRRQIDDLSKEKARLELEIDNLKEELEDLRRKYEEEL   95 (312)
T ss_dssp             HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hhHHHHhHHhhhhHHHHhhHHhhhhhhHHHHHHHHHHHHHHHH
Confidence            6788888888888888878888888888888888887777653


No 85 
>PF04011 LemA:  LemA family;  InterPro: IPR007156 The members of this family are related to the LemA protein P71452 from SWISSPROT. The exact molecular function of this protein is uncertain. It is predicted to be a transmembrane protein with an extracellular N terminus [].; PDB: 2ETD_A.
Probab=20.05  E-value=5.2e+02  Score=24.46  Aligned_cols=48  Identities=25%  Similarity=0.361  Sum_probs=31.6

Q ss_pred             hhhhhhhhHHHHHHhHHHhhhc-CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936          132 LKVAEDERYFFMSSMLGLLADY-GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT  191 (548)
Q Consensus       132 lq~aeQERstFvSSLLpLLaEY-~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q  191 (548)
                      ++.++.+-+.-++.|+=....| .|+..            ..|++|+.+|..+|++|....
T Consensus        90 ~~~~~~~l~~al~~l~~~~e~yP~Lka~------------~~~~~l~~~l~~~E~~I~~aR  138 (186)
T PF04011_consen   90 FQQAEAELSQALSRLLAVVENYPELKAD------------ENFQQLMAQLEETENRIAAAR  138 (186)
T ss_dssp             HHHHHHHHHHHHHHHHHHHTT-HHHHH-------------HHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHcCCccchh------------HHHHHHHHHHHHHHHHHHHHH
Confidence            3456666666677776555555 33221            248999999999999987665


Done!