Query         gi|254781141|ref|YP_003065554.1| hypothetical protein CLIBASIA_05225 [Candidatus Liberibacter asiaticus str. psy62]
Match_columns 196
No_of_seqs    124 out of 155
Neff          6.0 
Searched_HMMs 39220
Date          Mon May 30 06:50:58 2011
Command       /home/congqian_1/programs/hhpred/hhsearch -i 254781141.hhm -d /home/congqian_1/database/cdd/Cdd.hhm 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd07321 Extradiol_Dioxygenase_  74.9     4.8 0.00012   20.7   3.9   42   92-133    11-53  (77)
  2 pfam07746 LigA Aromatic-ring-o  74.1     6.1 0.00015   20.0   4.3   42   94-135     9-50  (88)
  3 cd07925 LigA_like_1 The A subu  72.4     6.8 0.00017   19.6   4.3   43   93-135    23-65  (106)
  4 cd07921 PCA_45_Doxase_A_like S  70.6     7.9  0.0002   19.2   4.2   43   93-135    23-65  (106)
  5 PRK13378 protocatechuate 4,5-d  66.9      10 0.00026   18.4   4.3   44   92-135    33-76  (117)
  6 cd07924 PCA_45_Doxase_A The A   64.5      11 0.00029   18.1   4.3   43   93-135    31-73  (121)
  7 PRK13377 protocatechuate 4,5-d  63.2      12 0.00031   18.0   4.3   43   93-135    34-76  (129)
  8 pfam08855 DUF1825 Domain of un  63.0     9.2 0.00023   18.8   3.3   44   82-126    52-95  (108)
  9 cd07923 Gallate_dioxygenase_C   61.8      13 0.00033   17.8   4.2   44   93-136    15-58  (94)
 10 pfam08506 Cse1 Cse1. This doma  60.7      13 0.00034   17.7   6.3   23   25-47    200-222 (370)
 11 PRK06800 fliH flagellar assemb  58.9      14 0.00037   17.5   8.4  101   33-133    24-137 (229)
 12 KOG1991 consensus               58.7      14 0.00037   17.4   4.3  116    4-121   334-500 (1010)
 13 COG0587 DnaE DNA polymerase II  58.3      15 0.00038   17.4   4.5   54   71-125   683-736 (1139)
 14 pfam01244 Peptidase_M19 Membra  57.6      15 0.00038   17.3   4.5   28  105-132   287-314 (316)
 15 TIGR00084 ruvA Holliday juncti  56.8      11 0.00028   18.3   2.8   28   98-125   167-194 (217)
 16 COG3184 Uncharacterized protei  53.9      17 0.00044   16.9   5.9   97   22-125    21-118 (183)
 17 PRK13379 protocatechuate 4,5-d  52.5      18 0.00046   16.8   4.2   44   93-136    35-78  (119)
 18 TIGR02389 RNA_pol_rpoA2 DNA-di  50.8      19 0.00048   16.7   3.2   45   92-142   288-332 (397)
 19 pfam04695 Pex14_N Peroxisomal   49.9      16  0.0004   17.2   2.7   39   85-123     5-43  (79)
 20 cd07922 CarBa CarBa is the A s  47.9      21 0.00054   16.3   3.3   35   93-127    14-48  (81)
 21 pfam09440 eIF3_N eIF3 subunit   47.5      22 0.00055   16.3   5.7   46   79-127    81-130 (133)
 22 pfam02787 CPSase_L_D3 Carbamoy  46.6      19 0.00049   16.6   2.8   25  103-127    60-84  (122)
 23 cd06549 GH18_trifunctional GH1  42.0      26 0.00067   15.7   3.4   25   93-117    83-107 (298)
 24 pfam10302 DUF2407 Putative mem  40.8      17 0.00043   17.0   1.7   18  106-123   119-138 (252)
 25 cd02874 GH18_CFLE_spore_hydrol  40.6      27  0.0007   15.6   3.3   25   93-117    82-106 (313)
 26 KOG0455 consensus               40.2     7.4 0.00019   19.4  -0.2   48   86-133   180-231 (364)
 27 pfam06382 DUF1074 Protein of u  39.8      28 0.00072   15.5   4.3   61   91-161    75-140 (187)
 28 COG2355 Zn-dependent dipeptida  39.7      28 0.00072   15.5   4.6   43   91-137   270-312 (313)
 29 TIGR02302 aProt_lowcomp conser  38.9      29 0.00074   15.4   7.2  113   72-193   506-622 (899)
 30 PRK13367 protocatechuate 4,5-d  38.9      29 0.00074   15.4   4.0   55   84-143   323-378 (418)
 31 pfam03359 GKAP Guanylate-kinas  38.2      30 0.00076   15.3   6.2   70  114-183   233-312 (342)
 32 pfam07499 RuvA_C RuvA, C-termi  38.0      29 0.00073   15.5   2.5   25  102-126     2-26  (47)
 33 pfam01793 Glyco_transf_15 Glyc  35.7      26 0.00067   15.7   2.1   29   89-117    73-101 (274)
 34 PRK13901 ruvA Holliday junctio  35.4      25 0.00065   15.8   1.9   24  102-125   143-166 (196)
 35 pfam04877 Hairpins HrpZ. HrpZ   35.0      33 0.00085   15.0   4.4   11   11-21    166-176 (277)
 36 TIGR01991 HscA Fe-S protein as  34.8      30 0.00076   15.3   2.2   10  131-140   523-532 (628)
 37 pfam11867 DUF3387 Domain of un  34.5      34 0.00087   15.0   2.9  136   48-190   170-313 (333)
 38 KOG2629 consensus               34.4      25 0.00063   15.9   1.8   95   14-116   103-198 (300)
 39 PTZ00217 flap endonuclease-1;   33.0      36 0.00092   14.8   3.2   29   92-125   299-328 (394)
 40 PHA00542 putative Cro-like pro  31.6      38 0.00097   14.6   2.8   39   89-127     4-44  (82)
 41 COG5656 SXM1 Importin, protein  31.6      28 0.00072   15.5   1.7   38    9-46    337-379 (970)
 42 pfam10929 DUF2811 Protein of u  31.2      39 0.00098   14.6   5.0   47   75-124     6-52  (57)
 43 cd02876 GH18_SI-CLP Stabilin-1  31.0      39 0.00099   14.6   3.3   28   93-120    87-114 (318)
 44 KOG4472 consensus               30.8      33 0.00085   15.0   1.9   30   89-118   151-180 (399)
 45 COG5020 KTR1 Mannosyltransfera  30.8      33 0.00085   15.0   1.9   30   89-118   151-180 (399)
 46 pfam00816 Histone_HNS H-NS his  30.6      39   0.001   14.5   3.7   21  100-120    23-43  (91)
 47 KOG1992 consensus               30.2      24 0.00061   16.0   1.1  113   10-126   331-456 (960)
 48 pfam11198 DUF2857 Protein of u  30.1      40   0.001   14.5   2.3   30  159-188   116-145 (181)
 49 pfam08069 Ribosomal_S13_N Ribo  28.5      43  0.0011   14.3   3.0   33   89-123    21-53  (60)
 50 pfam01371 Trp_repressor Trp re  28.4      43  0.0011   14.3   3.4   56   82-145     2-59  (88)
 51 COG3722 MtlR Transcriptional r  28.1      43  0.0011   14.3   4.0   35   88-122    80-116 (174)
 52 PRK05672 dnaE2 error-prone DNA  27.8      44  0.0011   14.2   4.7   60   66-125   666-725 (1050)
 53 TIGR03253 oxalate_frc formyl-C  27.4      20 0.00052   16.5   0.3   32   91-122   292-325 (415)
 54 KOG0139 consensus               27.2      45  0.0011   14.2   7.2   85   35-119   291-378 (398)
 55 PHA02512 minor structural prot  26.9      45  0.0012   14.1   9.3   48   73-123   147-198 (844)
 56 PRK05398 formyl-coenzyme A tra  26.5      22 0.00056   16.2   0.4   33   91-123   293-327 (416)
 57 pfam06401 Alpha-2-MRAP_C Alpha  25.4      48  0.0012   13.9   3.5   29   91-124   106-134 (205)
 58 cd06548 GH18_chitinase The GH1  25.0      49  0.0013   13.9   3.4   27   94-120   105-131 (322)
 59 pfam01471 PG_binding_1 Putativ  24.9      49  0.0013   13.9   1.9   36  102-137     4-39  (57)
 60 PRK03525 crotonobetainyl-CoA:c  24.2      20  0.0005   16.6  -0.2   11  177-187   391-401 (405)
 61 cd02875 GH18_chitobiase Chitob  24.1      51  0.0013   13.8   3.4   28   93-120    91-118 (358)
 62 pfam04587 ADP_PFK_GK ADP-speci  23.6      49  0.0012   13.9   1.7   33   92-124   259-291 (444)
 63 pfam02375 JmjN jmjN domain.     23.4      53  0.0013   13.7   2.0   20   92-114     9-28  (34)
 64 pfam09388 SpoOE-like Spo0E lik  23.3      53  0.0013   13.7   3.0   25  100-124     7-31  (45)
 65 cd06545 GH18_3CO4_chitinase Th  23.1      53  0.0014   13.7   3.4   28   93-120    78-105 (253)
 66 PRK13795 hypothetical protein;  22.9      54  0.0014   13.6   3.0   37   80-121   423-459 (630)
 67 pfam07862 Nif11 Nitrogen fixat  22.7      51  0.0013   13.8   1.7   23   94-116    15-42  (49)
 68 TIGR02990 ectoine_eutA ectoine  22.6      50  0.0013   13.8   1.6   24  113-136   154-178 (239)
 69 TIGR01245 trpD anthranilate ph  22.1      56  0.0014   13.5   4.4   40   80-124    12-53  (331)
 70 KOG0031 consensus               22.0      42  0.0011   14.3   1.1   43   85-127   102-146 (171)
 71 cd01938 ADPGK_ADPPFK ADP-depen  21.9      56  0.0014   13.5   2.0   34   92-125   253-286 (445)
 72 TIGR01420 pilT_fam twitching m  21.7      14 0.00036   17.5  -1.4   14   29-42    158-171 (350)
 73 PRK07668 hypothetical protein;  21.6      57  0.0015   13.5   3.1   34   97-130     4-37  (248)
 74 pfam10932 DUF2783 Protein of u  21.5      57  0.0015   13.4   3.4   40   89-134     3-42  (60)
 75 TIGR01926 peroxid_rel uncharac  21.5      55  0.0014   13.5   1.6   21  109-129   141-161 (179)
 76 KOG1546 consensus               20.4      46  0.0012   14.1   1.1   37  103-139    87-131 (362)
 77 pfam00531 Death Death domain.   20.2      61  0.0016   13.3   3.4   26  100-125     9-34  (83)
 78 TIGR01369 CPSaseII_lrg carbamo  20.2      61  0.0016   13.3   4.7   46   76-130   468-518 (1089)

No 1  
>cd07321 Extradiol_Dioxygenase_3A_like Subunit A of Class III extradiol dioxygenases. Extradiol dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings.  There are two major groups of dioxygenases according to the cleavage site of the aromatic ring. Intradiol enzymes cleave the aromatic ring between two hydroxyl groups, whereas extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Extradiol dioxygenases can be divided into three classes. Class I and II enzymes are evolutionary related and show sequence similarity, with the two domain class II enzymes evolving from the class I enzyme through gene duplication. Class III enzymes are different in sequence and structure and usually have two subunits, designated A and B, which form a tetramer composed of two copies of each subunit. This model represents subunit A of c
Probab=74.92  E-value=4.8  Score=20.68  Aligned_cols=42  Identities=19%  Similarity=0.173  Sum_probs=34.3

Q ss_pred             CC-CCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHH
Q ss_conf             65-8989999999999999997599989997205999999999
Q gi|254781141|r   92 QT-KDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAY  133 (196)
Q Consensus        92 e~-~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~  133 (196)
                      ++ .||+.++.|..+-..|...+|.|++|...|.+--...+++
T Consensus        11 ~l~~~p~~rerF~~D~~a~l~~~~Lt~ee~~al~~~D~~~L~~   53 (77)
T cd07321          11 QLLVKPEVKERFKADPEAVLAEYGLTPEEKAALLARDVGALYV   53 (77)
T ss_pred             HHCCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHCCCHHHHHH
T ss_conf             9837999999998299999987699999999988568999999


No 2  
>pfam07746 LigA Aromatic-ring-opening dioxygenase LigAB, LigA subunit. This is a family of aromatic ring opening dioxygenases which catalyse the ring-opening reaction of protocatechuate and related compounds.
Probab=74.13  E-value=6.1  Score=19.98  Aligned_cols=42  Identities=14%  Similarity=0.165  Sum_probs=36.4

Q ss_pred             CCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHH
Q ss_conf             898999999999999999759998999720599999999999
Q gi|254781141|r   94 KDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYA  135 (196)
Q Consensus        94 ~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~A  135 (196)
                      .+|+-++.|..+-..|...+|.|++|...|.+-..+.+++-.
T Consensus         9 ~~~~~R~~F~~D~eay~~~~~Lt~eqr~av~~rD~~~L~~~G   50 (88)
T pfam07746         9 IKAENRERFLADEEAYLDEYGLTEEQKAAVRARDWNGLIALG   50 (88)
T ss_pred             CCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC
T ss_conf             879999999839999987279999999999826699999945


No 3  
>cd07925 LigA_like_1 The A subunit of Uncharacterized proteins with similarity to Protocatechuate 4,5-dioxygenase (LigAB). The proteins of unknown function in this subfamily are similar to the A subunit of the Protocatechuate (PCA) 4,5-dioxygenase (LigAB). LigAB belongs to the class III extradiol dioxygenase family, composed of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Dioxygenases play key roles in the degradation of aromatic compounds. PCA 4,5-dioxygenase catalyzes the oxidization and subsequent ring-opening of PCA (or 3,4-dihydroxybenzoic acid), which is an intermediate in the breakdown of lignin and other compounds.
Probab=72.44  E-value=6.8  Score=19.63  Aligned_cols=43  Identities=19%  Similarity=0.270  Sum_probs=36.7

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHH
Q ss_conf             5898999999999999999759998999720599999999999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYA  135 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~A  135 (196)
                      +..|+-++.|+.+=..|...+|.|+++...|.|-..+.+|+-.
T Consensus        23 l~~~~nRe~F~ade~Ay~d~~~Lt~eqk~aV~~RD~~~ll~~G   65 (106)
T cd07925          23 FNDAANREAFLADEEAYCEKFGLTPEQKQAVRNRDVLRMLEAG   65 (106)
T ss_pred             HCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC
T ss_conf             7779889999829999998769999999999973699999906


No 4  
>cd07921 PCA_45_Doxase_A_like Subunit A of the Class III Extradiol dioxygenase, Protocatechuate 4,5-dioxygenase, and similar enzymes. This subfamily includes the A subunit of protocatechuate (PCA) 4,5-dioxygenase (LigAB) and two subfamilies of unknown function. The A subunit is the smaller, non-catalytic subunit of LigAB. PCA 4,5-dioxygenase catalyzes the oxidization and subsequent ring-opening of PCA (or 3,4-dihydroxybenzoic acid), which is an intermediate in the breakdown of lignin and other compounds. PCA 4,5-dioxygenase is one of the aromatic ring opening dioxygenases which play key roles in the degradation of aromatic compounds. As members of the Class III extradiol dioxygenase family, the enzymes use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. LigAB-like class III enzymes are usually composed of two subunits, designated A and B, which form a tetramer composed of two copies of each subunit.
Probab=70.64  E-value=7.9  Score=19.22  Aligned_cols=43  Identities=19%  Similarity=0.227  Sum_probs=36.2

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHH
Q ss_conf             5898999999999999999759998999720599999999999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYA  135 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~A  135 (196)
                      +..|+-++.|+.+=..|...+|.|+++...|.|-..+.+|+..
T Consensus        23 l~~~~nRe~F~ade~Ay~d~~~Lt~eqk~aV~~RD~~~ll~~G   65 (106)
T cd07921          23 LNKAENREAFKADEEAYCDKFGLTEEQKQAVLDRDWLRLLELG   65 (106)
T ss_pred             HCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC
T ss_conf             7779889999819999998769999999999973699999906


No 5  
>PRK13378 protocatechuate 4,5-dioxygenase subunit alpha; Provisional
Probab=66.93  E-value=10  Score=18.42  Aligned_cols=44  Identities=20%  Similarity=0.269  Sum_probs=36.9

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHH
Q ss_conf             65898999999999999999759998999720599999999999
Q gi|254781141|r   92 QTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYA  135 (196)
Q Consensus        92 e~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~A  135 (196)
                      .+..|+-++.|..+=..|...+|.|+++-..|.|-..+.+|+-.
T Consensus        33 Sl~~~~nRe~F~ade~AY~d~~~Lteeqr~aV~~RD~~~ll~lG   76 (117)
T PRK13378         33 SFNDAANRAAFLADEAAYCRKYGLNEEQKEAIRNRDVLQLLAAG   76 (117)
T ss_pred             HHCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC
T ss_conf             96679999999829999998769999999999973799999946


No 6  
>cd07924 PCA_45_Doxase_A The A subunit of Protocatechuate 4,5-dioxygenase (LigAB) is the smaller, non-catalytic subunit. The A subunit is the non-catalytic subunit of Protocatechuate (PCA) 4,5-dioxygenase (LigAB), which is composed of A and B subunits that form a tetramer. PCA 4,5-dioxygenase catalyzes the oxidization and subsequent ring-opening of PCA (or 3,4-dihydroxybenzoic acid), which is an intermediate in the breakdown of lignin and other compounds. PCA 4,5-dioxygenase is one of the aromatic ring opening dioxygenases which  play key roles in the degradation of aromatic compounds. As a member of the Class III extradiol dioxygenase family, LigAB uses a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon.
Probab=64.54  E-value=11  Score=18.12  Aligned_cols=43  Identities=9%  Similarity=0.021  Sum_probs=35.9

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHH
Q ss_conf             5898999999999999999759998999720599999999999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYA  135 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~A  135 (196)
                      +..|+-++.|..+=..|...+|.|+++-..|.|-..+.+|+-.
T Consensus        31 L~~~~nRe~F~ade~Ay~d~~~Lteeqk~aV~~RD~~~li~lG   73 (121)
T cd07924          31 LMKAENRERFKADERAYLDKWPMTEEQKQAVLARDYNRMLALG   73 (121)
T ss_pred             HCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC
T ss_conf             7679889999839999998769999999999974799999926


No 7  
>PRK13377 protocatechuate 4,5-dioxygenase subunit alpha; Provisional
Probab=63.16  E-value=12  Score=17.96  Aligned_cols=43  Identities=9%  Similarity=-0.011  Sum_probs=35.7

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHH
Q ss_conf             5898999999999999999759998999720599999999999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYA  135 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~A  135 (196)
                      +..|+-++.|+.+=..|...+|.|+++-..|.|-..+.+|+-.
T Consensus        34 L~~~~nRe~F~aDe~AY~d~~~Lteeqk~aV~~RD~~~li~lG   76 (129)
T PRK13377         34 LMKAENRARFKADERAYLDEWPMTEEQKQAVLARDLNRCIALG   76 (129)
T ss_pred             HCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC
T ss_conf             7679899999819999998769999999999973799999926


No 8  
>pfam08855 DUF1825 Domain of unknown function (DUF1825). This uncharacterized family of proteins are principally found in cyanobacteria.
Probab=63.00  E-value=9.2  Score=18.78  Aligned_cols=44  Identities=14%  Similarity=0.216  Sum_probs=33.4

Q ss_pred             HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCH
Q ss_conf             999999958365898999999999999999759998999720599
Q gi|254781141|r   82 ENQQLEHLFPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDH  126 (196)
Q Consensus        82 e~~~L~e~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~  126 (196)
                      -++.+..++ .++|+--|...+..|.+.+.-.||++++|+.+.|.
T Consensus        52 KQki~y~Rl-sLSdDpeA~~mke~i~~sa~~mG~~~~dm~~~F~~   95 (108)
T pfam08855        52 RQRIFYKRL-ELSDDFEAKMMKEQLKTQLNQFGFTPTNLLEMFDQ   95 (108)
T ss_pred             HHHHHHHHH-HHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH
T ss_conf             999999997-65589889999999999999859982159999999


No 9  
>cd07923 Gallate_dioxygenase_C The C-terminal domain of Gallate Dioxygenase, which catalyzes the oxidization and subsequent ring-opening of gallate. Gallate Dioxygenase catalyzes the oxidization and subsequent ring-opening of gallate, an intermediate in the degradation of the aromatic compound, syringate. The reaction product of gallate dioxygenase is 4-oxalomesaconate. The amino acid sequence of the N-terminal and C-terminal regions of gallate dioxygenase exhibits homology with the sequence of the PCA 4,5-dioxygenase B (catalytic) and A subunits, respectively. This model represents the C-terminal domain, which is similar to the A subunit of PCA 4,5-dioxygenase (or LigAB). The enzyme is estimated to be a homodimer according to the Escherichia coli enzyme. Since enzymes in this subfamily have fused A and B subunits, the dimer interface may resemble the tetramer interface of classical LigAB enzymes. This enzyme belongs to the class III extradiol dioxygenase family, composed of enzymes whi
Probab=61.79  E-value=13  Score=17.80  Aligned_cols=44  Identities=23%  Similarity=0.363  Sum_probs=36.6

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHH
Q ss_conf             58989999999999999997599989997205999999999999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYAQ  136 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~Am  136 (196)
                      +..|+-++.|..+=..|...+|.|+|+-..|-+-..+.+|+..-
T Consensus        15 l~~~~nRe~F~ade~Ay~d~~~Lt~eqk~av~~RDw~~mi~~Gg   58 (94)
T cd07923          15 LIEPAHRERFLEDPEALFDEAGLTEEERTLIRNRDWIGMIRYGV   58 (94)
T ss_pred             HCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCC
T ss_conf             77788899997099999987699999999999616999998376


No 10 
>pfam08506 Cse1 Cse1. This domain is present in Cse1 nuclear export receptor proteins. Cse1 mediates the nuclear export of importin alpha. This domain contains HEAT repeats.
Probab=60.69  E-value=13  Score=17.67  Aligned_cols=23  Identities=22%  Similarity=0.173  Sum_probs=18.7

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHHH
Q ss_conf             99918999943899999999999
Q gi|254781141|r   25 DDSSPTLEKTYPETYHRIRYLRQ   47 (196)
Q Consensus        25 ~~p~~~L~~~DP~~Y~~~k~~~~   47 (196)
                      -+.|++|++.||++|+|..-+--
T Consensus       200 Re~DeElFED~P~EyIRrDlEGS  222 (370)
T pfam08506       200 RESDEELFEDDPIEYIRRDLEGS  222 (370)
T ss_pred             CHHHHHHHCCCHHHHHHHHCCCC
T ss_conf             67779886679899998733347


No 11 
>PRK06800 fliH flagellar assembly protein H; Validated
Probab=58.91  E-value=14  Score=17.47  Aligned_cols=101  Identities=10%  Similarity=0.045  Sum_probs=48.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHCCCCCCH---------HHHHH
Q ss_conf             9438999999999999999999999999999999999999999--999999999999958365898---------99999
Q gi|254781141|r   33 KTYPETYHRIRYLRQKALNIINHFVETGRNPDNIAKELDSEIL--EKKLIAENQQLEHLFPQTKDP---------AQREN  101 (196)
Q Consensus        33 ~~DP~~Y~~~k~~~~~~~~~~~~~~~e~q~~~q~q~~~q~~~~--q~~l~~e~~~L~e~iPe~~D~---------~k~~~  101 (196)
                      =-|||+-.....+++...+.+...+.....-..+-..+|+..-  .+.|..+...+.+.+-||--.         +.-..
T Consensus        24 y~~~i~~~~ee~e~~~d~~eL~~~Q~~L~~e~~~l~~eqQ~l~~e~~~l~aek~~~~~~~~e~~~~~e~~r~~fekekqq  103 (229)
T PRK06800         24 FPKPIEVEEEEEEIQVDHEELLAQQKSLHKELNQLRQEQQKLERERQQLLADREQFQMHVQEQMKEIEAARQQFQKEQQE  103 (229)
T ss_pred             CCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             36702354312620226999999999999999999999999999999999889999999999999999999999999999


Q ss_pred             HHHHHHHHH--HHCCCCHHHHHHHCCHHHHHHHH
Q ss_conf             999999999--97599989997205999999999
Q gi|254781141|r  102 FFKNLFRIG--NILGFQKEEMVDITDHRLLSLAY  133 (196)
Q Consensus       102 ~~~~i~~~~--~~~Gfs~eEi~~v~D~R~i~vl~  133 (196)
                      .+..+.+++  .+....+.=|++..|.|.+.||-
T Consensus       104 ~~~~~t~~LwdeSi~LAEkIV~QaiDtRql~vlP  137 (229)
T PRK06800        104 TAYEWTELLWDQSFQLAEKIVNQAVDTRLLDVLP  137 (229)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             9999999999878889999999864377887889


No 12 
>KOG1991 consensus
Probab=58.70  E-value=14  Score=17.45  Aligned_cols=116  Identities=15%  Similarity=0.137  Sum_probs=59.8

Q ss_pred             HHHHHHHHHHHHHHHHHHC-----CCCCCCHHHHHHHHHHHHHHHHHHH-------------------HH-HHHHHHHHH
Q ss_conf             9999999999999997320-----6899918999943899999999999-------------------99-999999999
Q gi|254781141|r    4 QAKILAQSAEALAQLINHH-----IPDDSSPTLEKTYPETYHRIRYLRQ-------------------KA-LNIINHFVE   58 (196)
Q Consensus         4 ~a~~~aq~~~~l~q~~~~~-----~p~~p~~~L~~~DP~~Y~~~k~~~~-------------------~~-~~~~~~~~~   58 (196)
                      .+..+..+.+-+.-++..-     -++.-+++|+++||.+|.|.+...-                   .+ ++.++.+.+
T Consensus       334 ~~~twkll~PHl~~ii~~vIFPlmc~~d~deelwe~DP~EYiR~~~Di~ed~~sp~~Aa~~~l~~~~~KR~ke~l~k~l~  413 (1010)
T KOG1991         334 HASTWKLLKPHLQVIIQDVIFPLMCFNDEDEELWEEDPYEYIRKKFDIFEDGYSPDTAALDFLTTLVSKRGKETLPKILS  413 (1010)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCHHHHHHHCCCHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHH
T ss_conf             89899874259999998742331078813187774197999985373104567974789999999998403222266999


Q ss_pred             HHHHHHHHHHHH---------HHHH------HHH------HHHHHHH--HHHHHCCCCCCHHHHHHHHHHHHHHHH---H
Q ss_conf             999999999999---------9999------999------9999999--999958365898999999999999999---7
Q gi|254781141|r   59 TGRNPDNIAKEL---------DSEI------LEK------KLIAENQ--QLEHLFPQTKDPAQRENFFKNLFRIGN---I  112 (196)
Q Consensus        59 e~q~~~q~q~~~---------q~~~------~q~------~l~~e~~--~L~e~iPe~~D~~k~~~~~~~i~~~~~---~  112 (196)
                      =.-.+-..-.+.         ..-+      ...      -.+.+.+  .....+|+|++|---  +++..+.++.   +
T Consensus       414 F~~~Il~~~~~~~~~~~~~rqkdGAL~~vgsl~~~L~K~s~~~~~mE~flv~hVfP~f~s~~g~--Lrarac~vl~~~~~  491 (1010)
T KOG1991         414 FIVDILTRYKEASPPNKNPRQKDGALRMVGSLASILLKKSPYKSQMEYFLVNHVFPEFQSPYGY--LRARACWVLSQFSS  491 (1010)
T ss_pred             HHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHH--HHHHHHHHHHHHHH
T ss_conf             9999997511369976573665218999999999980687067899999998731753283267--89999999998774


Q ss_pred             CCCCHHHHH
Q ss_conf             599989997
Q gi|254781141|r  113 LGFQKEEMV  121 (196)
Q Consensus       113 ~Gfs~eEi~  121 (196)
                      .-|+++-+-
T Consensus       492 ~df~d~~~l  500 (1010)
T KOG1991         492 IDFKDPNNL  500 (1010)
T ss_pred             CCCCCHHHH
T ss_conf             368874789


No 13 
>COG0587 DnaE DNA polymerase III, alpha subunit [DNA replication, recombination, and repair]
Probab=58.25  E-value=15  Score=17.40  Aligned_cols=54  Identities=19%  Similarity=0.163  Sum_probs=42.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCC
Q ss_conf             9999999999999999995836589899999999999999975999899972059
Q gi|254781141|r   71 DSEILEKKLIAENQQLEHLFPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDITD  125 (196)
Q Consensus        71 q~~~~q~~l~~e~~~L~e~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D  125 (196)
                      -.+...-|--.|...|-.+|- +++++.-.+++..+++-+...||+.+-...+.|
T Consensus       683 IAq~~agfSlgeAD~LRRAMg-KK~~~~m~~~r~~F~~Ga~~~G~~~~~a~~ifd  736 (1139)
T COG0587         683 IAQVLAGFSLGEADLLRRAMG-KKKAEEMEKQREKFIEGAVKNGYDKEFAEKIFD  736 (1139)
T ss_pred             HHHHHCCCCCHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH
T ss_conf             999971899426789999871-477899999999999789876999899999999


No 14 
>pfam01244 Peptidase_M19 Membrane dipeptidase (Peptidase family M19).
Probab=57.61  E-value=15  Score=17.33  Aligned_cols=28  Identities=21%  Similarity=0.163  Sum_probs=11.5

Q ss_pred             HHHHHHHHCCCCHHHHHHHCCHHHHHHH
Q ss_conf             9999999759998999720599999999
Q gi|254781141|r  105 NLFRIGNILGFQKEEMVDITDHRLLSLA  132 (196)
Q Consensus       105 ~i~~~~~~~Gfs~eEi~~v~D~R~i~vl  132 (196)
                      +|.+.+...|||++||..|.--..+.++
T Consensus       287 ~l~~~L~~rG~se~~i~ki~g~N~lRvl  314 (316)
T pfam01244       287 NLTAELLRRGYSEAEIEKILGGNWLRVL  314 (316)
T ss_pred             HHHHHHHHCCCCHHHHHHHHHHHHHHHH
T ss_conf             9999999869899999999856399984


No 15 
>TIGR00084 ruvA Holliday junction DNA helicase RuvA; InterPro: IPR000085   In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure . In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB.; GO: 0003678 DNA helicase activity, 0006281 DNA repair, 0006310 DNA recombination.
Probab=56.80  E-value=11  Score=18.28  Aligned_cols=28  Identities=18%  Similarity=0.325  Sum_probs=22.4

Q ss_pred             HHHHHHHHHHHHHHHCCCCHHHHHHHCC
Q ss_conf             9999999999999975999899972059
Q gi|254781141|r   98 QRENFFKNLFRIGNILGFQKEEMVDITD  125 (196)
Q Consensus        98 k~~~~~~~i~~~~~~~Gfs~eEi~~v~D  125 (196)
                      .....+.++...+.++||.+.||+.+..
T Consensus       167 ~~~~~y~E~~~aL~sLGY~~~Ei~~~l~  194 (217)
T TIGR00084       167 AEAARYDELFEALVSLGYKPQEIQKALK  194 (217)
T ss_pred             HHHHHHHHHHHHHHHCCCCHHHHHHHHH
T ss_conf             1021188999999972988889999998


No 16 
>COG3184 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=53.91  E-value=17  Score=16.93  Aligned_cols=97  Identities=13%  Similarity=0.026  Sum_probs=50.1

Q ss_pred             CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH
Q ss_conf             068999189999438999999999999999999999999-9999999999999999999999999999583658989999
Q gi|254781141|r   22 HIPDDSSPTLEKTYPETYHRIRYLRQKALNIINHFVETG-RNPDNIAKELDSEILEKKLIAENQQLEHLFPQTKDPAQRE  100 (196)
Q Consensus        22 ~~p~~p~~~L~~~DP~~Y~~~k~~~~~~~~~~~~~~~e~-q~~~q~q~~~q~~~~q~~l~~e~~~L~e~iPe~~D~~k~~  100 (196)
                      -.-.+|...-.-+|-+.-.++.-----.-+....+.-+. +.+.++--+..-......++.-.++.+..-|+|+|=    
T Consensus        21 af~~p~~~q~v~~~~~~aa~~ai~~~~~~~~fd~i~~q~a~~l~~~liqa~p~~~~~i~~r~~ekal~lA~~~~dl----   96 (183)
T COG3184          21 AFATPPSDQDVAEDLDTAARQAIAAKNATELFDAILPQYAQMLAAELIQAKPDQQALILERYDEKALKLAPRRADL----   96 (183)
T ss_pred             HCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHH----
T ss_conf             0358851388999899999999998188711003207899999988764495789999999999999865008775----


Q ss_pred             HHHHHHHHHHHHCCCCHHHHHHHCC
Q ss_conf             9999999999975999899972059
Q gi|254781141|r  101 NFFKNLFRIGNILGFQKEEMVDITD  125 (196)
Q Consensus       101 ~~~~~i~~~~~~~Gfs~eEi~~v~D  125 (196)
                       ++..+.-|+  --||.+||+.++.
T Consensus        97 -e~e~~~~Ya--~~FT~eEl~ai~a  118 (183)
T COG3184          97 -EREAALLYA--KIFTEEELNAITA  118 (183)
T ss_pred             -HHHHHHHHH--HHCCHHHHHHHHH
T ss_conf             -199999999--8647999999999


No 17 
>PRK13379 protocatechuate 4,5-dioxygenase subunit alpha; Provisional
Probab=52.54  E-value=18  Score=16.79  Aligned_cols=44  Identities=16%  Similarity=0.191  Sum_probs=35.9

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHH
Q ss_conf             58989999999999999997599989997205999999999999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYAQ  136 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~Am  136 (196)
                      +..|+-++.|+.+=..|...+|.|++|-..|-+-..+.+++..-
T Consensus        35 l~~~~~Re~F~aDe~Ay~d~~~LTeEqk~aV~~RDw~~li~~Gg   78 (119)
T PRK13379         35 MIRAPWRDRFLQDAEALMQEADLTEQEKELIRARDWLGLVQYGA   78 (119)
T ss_pred             HCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCC
T ss_conf             67788899998399999977599999999999615999998476


No 18 
>TIGR02389 RNA_pol_rpoA2 DNA-directed RNA polymerase, subunit A''; InterPro: IPR012757   DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel.   RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise:  RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors.  RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs.  Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits.    This family consists of the archaeal A'' subunit of the DNA-directed RNA polymerase. The example from Methanococcus jannaschii contains an intein.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription.
Probab=50.77  E-value=19  Score=16.69  Aligned_cols=45  Identities=18%  Similarity=0.025  Sum_probs=38.0

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHHHHH
Q ss_conf             658989999999999999997599989997205999999999999999997
Q gi|254781141|r   92 QTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYAQIGLQSQ  142 (196)
Q Consensus        92 e~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~Am~y~~~~  142 (196)
                      =+.-++-|+....++..++.++|.++      .|=|++++.-|.|-|+-..
T Consensus       288 VLGIEAAR~AII~E~~~tL~EQGL~d------VDiRHlmLvAD~MT~~G~V  332 (397)
T TIGR02389       288 VLGIEAARNAIIEEIKRTLEEQGLDD------VDIRHLMLVADLMTWDGEV  332 (397)
T ss_pred             HCCHHHHHHHHHHHHHHHHHHCCCCH------HHHHHHHHHHHHCCCCCEE
T ss_conf             30668999999999999986428860------2489898886510788548


No 19 
>pfam04695 Pex14_N Peroxisomal membrane anchor protein (Pex14p) conserved region. Family of peroxisomal membrane anchor proteins which bind the PTS1 (peroxisomal targeting signal) receptor and are required for the import of PTS1-containing proteins into peroxisomes. Loss of functional Pex14p results in defects in both the PTS1 and PTS2-dependent import pathways. Deletion analysis of this conserved region implicates it in selective peroxisome degradation. In the majority of members this region is situated at the N-terminus of the protein.
Probab=49.88  E-value=16  Score=17.20  Aligned_cols=39  Identities=18%  Similarity=0.269  Sum_probs=32.6

Q ss_pred             HHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHH
Q ss_conf             999958365898999999999999999759998999720
Q gi|254781141|r   85 QLEHLFPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDI  123 (196)
Q Consensus        85 ~L~e~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v  123 (196)
                      .+..+++=+.||.++..=...=+.|+.+-|.+++||...
T Consensus         5 li~~Av~FL~~p~V~~sp~~~k~~FL~sKgLt~~eI~~a   43 (79)
T pfam04695         5 LVESAVKFLQDPSVRDSPLEKKIEFLESKGLTEEEIDEA   43 (79)
T ss_pred             HHHHHHHHHCCHHHHCCCHHHHHHHHHHCCCCHHHHHHH
T ss_conf             999999995680020085899999999759999999999


No 20 
>cd07922 CarBa CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and   subsequent ring-opening of 2-aminophenyl-2,3-diol. CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. 2-aminophenol 1,6-dioxygenase is a key enzyme in the carbazole degradation pathway isolated from bacterial strains with carbazole degradation ability. The enzyme is a heterotetramer composed of two A and two B subunits. CarB belongs to the class III extradiol dioxygenase family, composed of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Although the enzyme was originally isolated as a meta-cleavage enzyme for 2'-aminobiphenyl-2,3-diol involved in carbazole degradation, the enzyme has also shown high specificity for 2,3-dihydroxybiphenyl.
Probab=47.95  E-value=21  Score=16.32  Aligned_cols=35  Identities=20%  Similarity=0.236  Sum_probs=30.5

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHH
Q ss_conf             58989999999999999997599989997205999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHR  127 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R  127 (196)
                      ++||..-+.+..+-......||.|++|...+.+.-
T Consensus        14 ~r~p~~~e~~~~dp~av~e~y~Lt~~E~~AL~eg~   48 (81)
T cd07922          14 FKDPGLIERFQDDPSAVFEEYGLTPAERAALREGT   48 (81)
T ss_pred             HCCHHHHHHHHHCHHHHHHHCCCCHHHHHHHHCCC
T ss_conf             36988899987498998775289989999986388


No 21 
>pfam09440 eIF3_N eIF3 subunit 6 N terminal domain. This is the N terminal domain of subunit 6 translation initiation factor eIF3.
Probab=47.47  E-value=22  Score=16.27  Aligned_cols=46  Identities=15%  Similarity=0.170  Sum_probs=30.4

Q ss_pred             HHHHHHHHHHHCCCCCCHHHHHHHHH---HHHHHH-HHCCCCHHHHHHHCCHH
Q ss_conf             99999999995836589899999999---999999-97599989997205999
Q gi|254781141|r   79 LIAENQQLEHLFPQTKDPAQRENFFK---NLFRIG-NILGFQKEEMVDITDHR  127 (196)
Q Consensus        79 l~~e~~~L~e~iPe~~D~~k~~~~~~---~i~~~~-~~~Gfs~eEi~~v~D~R  127 (196)
                      ++++...+.+.+   .||++...+++   ....|+ ..+||+.+-|+++|+.-
T Consensus        81 l~~~~~~i~~~l---~d~~v~~~lrsdk~~nl~~L~~~~~it~e~i~aLY~~a  130 (133)
T pfam09440        81 LQEEVAPIVKFL---EDPEVIKNLRSDKKANLEFLEEEHGITSEMVNALYKFA  130 (133)
T ss_pred             HHHHHHHHHHHH---CCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
T ss_conf             999889999996---59999999882489999999986199999999999877


No 22 
>pfam02787 CPSase_L_D3 Carbamoyl-phosphate synthetase large chain, oligomerisation domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain.
Probab=46.63  E-value=19  Score=16.60  Aligned_cols=25  Identities=16%  Similarity=0.312  Sum_probs=19.6

Q ss_pred             HHHHHHHHHHCCCCHHHHHHHCCHH
Q ss_conf             9999999997599989997205999
Q gi|254781141|r  103 FKNLFRIGNILGFQKEEMVDITDHR  127 (196)
Q Consensus       103 ~~~i~~~~~~~Gfs~eEi~~v~D~R  127 (196)
                      -.++...++.+||||..|+.+...-
T Consensus        60 ~~~~l~~aK~~GFSD~~IA~l~~~~   84 (122)
T pfam02787        60 SDDLLLKAKKLGFSDAQIAKLLGVT   84 (122)
T ss_pred             CHHHHHHHHHHCCCHHHHHHHHCCC
T ss_conf             9999999999499989999883986


No 23 
>cd06549 GH18_trifunctional GH18 domain of an uncharacterized family of bacterial proteins, which share a common three-domain architecture: an N-terminal glycosyl hydrolase family 18 (GH18) domain, a glycosyl transferase family 2 domain, and a C-terminal polysaccharide deacetylase domain.
Probab=42.01  E-value=26  Score=15.73  Aligned_cols=25  Identities=20%  Similarity=0.204  Sum_probs=22.3

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCH
Q ss_conf             5898999999999999999759998
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQK  117 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~  117 (196)
                      ++||+.++++..+|..++..+||.-
T Consensus        83 L~d~~~R~~~i~~i~~~~~~~~~dG  107 (298)
T cd06549          83 LADPSARAKFIANIAAYLERNQADG  107 (298)
T ss_pred             HCCHHHHHHHHHHHHHHHHHCCCCE
T ss_conf             3799999999999999999829983


No 24 
>pfam10302 DUF2407 Putative membrane protein (DUF2407). This is a family of proteins found in fungi. The function is not known. There is a characteristic GFDRL sequence motif.
Probab=40.77  E-value=17  Score=17.00  Aligned_cols=18  Identities=17%  Similarity=0.008  Sum_probs=13.6

Q ss_pred             HHHH--HHHCCCCHHHHHHH
Q ss_conf             9999--99759998999720
Q gi|254781141|r  106 LFRI--GNILGFQKEEMVDI  123 (196)
Q Consensus       106 i~~~--~~~~Gfs~eEi~~v  123 (196)
                      .+-|  +.+.|||++||+.+
T Consensus       119 ~rGFDRLlsaGFs~~EV~~L  138 (252)
T pfam10302       119 PRGFDRLLSAGFSPAEVSLL  138 (252)
T ss_pred             CCCHHHHHHCCCCHHHHHHH
T ss_conf             74487898758999999999


No 25 
>cd02874 GH18_CFLE_spore_hydrolase Cortical fragment-lytic enzyme (CFLE) is a peptidoglycan hydrolase involved in  bacterial endospore germination.  CFLE is expressed as an inactive preprotein (called SleB) in the forespore compartment of sporulating cells.  SleB translocates across the forespore inner membrane and is deposited as a mature enzyme in the cortex layer of the spore.  As part of a sensory mechanism capable of initiating germination, CFLE degrades a spore-specific peptidoglycan constituent called muramic-acid delta-lactam that comprises the outer cortex.  CFLE has a C-terminal glycosyl hydrolase family 18 (GH18) catalytic domain as well as two N-terminal LysM peptidoglycan-binding domains.  In addition to SleB, this family includes YaaH, YdhD, and YvbX from Bacillus subtilis.
Probab=40.57  E-value=27  Score=15.58  Aligned_cols=25  Identities=16%  Similarity=0.539  Sum_probs=20.6

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCH
Q ss_conf             5898999999999999999759998
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQK  117 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~  117 (196)
                      .+||+++++|..+|.+++..+||.-
T Consensus        82 ls~~~~R~~fi~siv~~~~~~~fDG  106 (313)
T cd02874          82 LSNPEARQRLINNILALAKKYGYDG  106 (313)
T ss_pred             HCCHHHHHHHHHHHHHHHHHCCCCC
T ss_conf             6799999999999999999819985


No 26 
>KOG0455 consensus
Probab=40.22  E-value=7.4  Score=19.39  Aligned_cols=48  Identities=15%  Similarity=0.324  Sum_probs=39.0

Q ss_pred             HHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCH----HHHHHHCCHHHHHHHH
Q ss_conf             99958365898999999999999999759998----9997205999999999
Q gi|254781141|r   86 LEHLFPQTKDPAQRENFFKNLFRIGNILGFQK----EEMVDITDHRLLSLAY  133 (196)
Q Consensus        86 L~e~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~----eEi~~v~D~R~i~vl~  133 (196)
                      |--.+-||+|-+....-.+++...++.+|||+    ++++++-=+|-+.+|-
T Consensus       180 LsYifne~s~gk~~~~sfsdvVk~AKklGYTEPDPRDDLnGmDVARKvtIl~  231 (364)
T KOG0455         180 LSYIFNELSDGKPGTLSFSDVVKAAKKLGYTEPDPRDDLNGMDVARKVTILA  231 (364)
T ss_pred             HHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH
T ss_conf             7888777605998756599999999972899999642346511344432311


No 27 
>pfam06382 DUF1074 Protein of unknown function (DUF1074). This family consists of several proteins which appear to be specific to Drosophila melanogaster. The function of this family is unknown.
Probab=39.84  E-value=28  Score=15.51  Aligned_cols=61  Identities=13%  Similarity=0.179  Sum_probs=37.1

Q ss_pred             CCCCCHHHHHH----HHHHHHHHHHH-CCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC
Q ss_conf             36589899999----99999999997-5999899972059999999999999999974300112320369843478
Q gi|254781141|r   91 PQTKDPAQREN----FFKNLFRIGNI-LGFQKEEMVDITDHRLLSLAYYAQIGLQSQKLSEDAYHKIRHKPFVNIT  161 (196)
Q Consensus        91 Pe~~D~~k~~~----~~~~i~~~~~~-~Gfs~eEi~~v~D~R~i~vl~~Am~y~~~~~~k~~a~kk~~~~p~~~~~  161 (196)
                      |-++-.+++.-    .-+=|++|-+. +|+|++||          |.+-|+.|.+|-.+.+.+-+.++++.++-++
T Consensus        75 p~yK~qKv~~vTnNgYLNflrefkKk~~gLsPqdm----------v~~aAkaW~~LS~aeke~fk~kkp~tv~~k~  140 (187)
T pfam06382        75 PIYKRQKVARVTNNGYLNFMTEYKKRFYGLSPQDM----------VHYAAKQWTQLSMAEKEAFKSKKPSTITLKS  140 (187)
T ss_pred             CHHHHHHEEECCCCCHHHHHHHHHHHHCCCCHHHH----------HHHHHHHHHHHHHHHHHHHCCCCCCEEEECC
T ss_conf             33432010001356316999999988639988999----------9999999887658889886147998478638


No 28 
>COG2355 Zn-dependent dipeptidase, microsomal dipeptidase homolog [Amino acid transport and metabolism]
Probab=39.66  E-value=28  Score=15.49  Aligned_cols=43  Identities=19%  Similarity=0.083  Sum_probs=27.4

Q ss_pred             CCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHH
Q ss_conf             36589899999999999999975999899972059999999999999
Q gi|254781141|r   91 PQTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYAQI  137 (196)
Q Consensus        91 Pe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~Am~  137 (196)
                      +.|.|+.    ...++...+..-||+++||..+.--..+.|+...|.
T Consensus       270 ~gled~~----~l~~l~~~L~~~G~~e~~i~~i~~~N~lRV~~~~~~  312 (313)
T COG2355         270 DGLEDVG----KLPNLTAALIERGYSEEEIEKIAGENWLRVLKEVWG  312 (313)
T ss_pred             HHHCCHH----HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHC
T ss_conf             1012756----779999999976999999999998769999999736


No 29 
>TIGR02302 aProt_lowcomp conserved hypothetical protein TIGR02302; InterPro: IPR012683    Members of this family are bacterial proteins from the alphaproteobacteria. Each has 2-3 predicted transmembrane helices near the N-terminus and a long C-terminal region that includes stretches of Gln/Gly-rich low complexity sequence, predicted to be outside the membrane. Bradyrhizobium japonicum contains two tandem copies that are, together, homologous to the single members found in other species. .
Probab=38.95  E-value=29  Score=15.42  Aligned_cols=113  Identities=12%  Similarity=0.043  Sum_probs=61.9

Q ss_pred             HHHHHHHHHHHHHHHHHH-CCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             999999999999999995-8365898999999999999999759998999720599999999999999999743001123
Q gi|254781141|r   72 SEILEKKLIAENQQLEHL-FPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYAQIGLQSQKLSEDAYH  150 (196)
Q Consensus        72 ~~~~q~~l~~e~~~L~e~-iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~Am~y~~~~~~k~~a~k  150 (196)
                      ..++.+.+..=.++-+-+ -=+.++-+++=--..+-..-++..|=|+|||..++|-     ||+||.- -+..-   +..
T Consensus       506 DdaLRd~ad~lW~~AlgIEdGdlS~AerrLR~AQ~AL~dALergASdeEI~~Ltd~-----LR~Amq~-ymr~l---Aqq  576 (899)
T TIGR02302       506 DDALRDVADYLWEVALGIEDGDLSAAERRLRDAQDALKDALERGASDEEIKKLTDE-----LRKAMQE-YMREL---AQQ  576 (899)
T ss_pred             CCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH-----HHHHHHH-HHHHH---HHH
T ss_conf             60488998777664441036764179999999999999997438888899999999-----9999999-99999---999


Q ss_pred             HCCCCCCC-CCCCCC--CCCCCCHHHHHHHHHHHHHHCCHHHHHHH
Q ss_conf             20369843-478877--87632148999999999761999999750
Q gi|254781141|r  151 KIRHKPFV-NITTNK--AKNHRRITSQEKAIQKLYQTGSLYDSLEI  193 (196)
Q Consensus       151 k~~~~p~~-~~~~~~--~~~~~~~~~~~ka~~rL~ksG~v~DA~al  193 (196)
                      ..+++-.. .+--+.  .=....-.+==..+.+|..||+-+.|--|
T Consensus       577 ~~~NP~~~a~p~dpN~~vlrqqDL~~M~dqiEnLaRsG~rdqA~Ql  622 (899)
T TIGR02302       577 LRKNPAALARPEDPNAKVLRQQDLQKMMDQIENLARSGDRDQAKQL  622 (899)
T ss_pred             HHHCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH
T ss_conf             8605597718888735664688899999999975067887899999


No 30 
>PRK13367 protocatechuate 4,5-dioxygenase; Provisional
Probab=38.87  E-value=29  Score=15.41  Aligned_cols=55  Identities=24%  Similarity=0.238  Sum_probs=42.9

Q ss_pred             HHHHHHC-CCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHH
Q ss_conf             9999958-36589899999999999999975999899972059999999999999999974
Q gi|254781141|r   84 QQLEHLF-PQTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYAQIGLQSQK  143 (196)
Q Consensus        84 ~~L~e~i-Pe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~Am~y~~~~~  143 (196)
                      .-|...| |+|+     +.|..+-..+....|.|+||-.-|..-.-+-+++....|.=+.+
T Consensus       323 ~flh~li~p~~R-----~rf~~d~ea~~~e~~Lt~ee~~li~~rdw~gli~yGv~ff~leK  378 (418)
T PRK13367        323 RFLHRLIEPAWR-----ERFLADPEALFDEAGLSEEERDLIRRRDWRGLIHYGVIFFLLEK  378 (418)
T ss_pred             HHHHHHCCHHHH-----HHHHHCHHHHHHHCCCCHHHHHHHHCCCHHHHHHHCHHHHHHHH
T ss_conf             999996483779-----99862989898763999899999860548899983326999998


No 31 
>pfam03359 GKAP Guanylate-kinase-associated protein (GKAP) protein.
Probab=38.21  E-value=30  Score=15.34  Aligned_cols=70  Identities=14%  Similarity=0.133  Sum_probs=39.0

Q ss_pred             CCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH---------HHCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHH
Q ss_conf             999899972059999999999999999974300112---------32036984347887787-63214899999999976
Q gi|254781141|r  114 GFQKEEMVDITDHRLLSLAYYAQIGLQSQKLSEDAY---------HKIRHKPFVNITTNKAK-NHRRITSQEKAIQKLYQ  183 (196)
Q Consensus       114 Gfs~eEi~~v~D~R~i~vl~~Am~y~~~~~~k~~a~---------kk~~~~p~~~~~~~~~~-~~~~~~~~~ka~~rL~k  183 (196)
                      --|-+++.+.-|-=.+.|=+=-|+|++|..-+...=         ++..+.+..++...+.+ .+...+.|..+++||..
T Consensus       233 ~~t~~DL~GFWDm~~lqiedv~~kF~~l~~lk~n~W~~~~~p~~~kk~~k~~~~~~~~~r~~s~ds~dk~RqeARkRL~A  312 (342)
T pfam03359       233 RTTSQDLAGFWDMLQLQIEDVDKKFDELQQLKANDWQKKVPPPVPKKPAKGPIVKPAISREKSLDSSDRQRQEARSRLAA  312 (342)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH
T ss_conf             88646778788989854998999999999998769863788999988889999998865344554157789999999999


No 32 
>pfam07499 RuvA_C RuvA, C-terminal domain. Homologous recombination is a crucial process in all living organisms. In bacteria, this process the RuvA, RuvB, and RuvC proteins are involved. More specifically the proteins process the Holliday junction DNA. RuvA is comprised of three distinct domains. The domain represents the C-terminal domain and plays a significant role in the ATP-dependent branch migration of the hetero-duplex through direct contact with RuvB. Within the Holliday junction, the C-terminal domain makes no interaction with DNA.
Probab=38.01  E-value=29  Score=15.47  Aligned_cols=25  Identities=12%  Similarity=0.163  Sum_probs=19.8

Q ss_pred             HHHHHHHHHHHCCCCHHHHHHHCCH
Q ss_conf             9999999999759998999720599
Q gi|254781141|r  102 FFKNLFRIGNILGFQKEEMVDITDH  126 (196)
Q Consensus       102 ~~~~i~~~~~~~Gfs~eEi~~v~D~  126 (196)
                      ...+....+.++||+..|+..+...
T Consensus         2 ~~~da~~AL~~LGy~~~ea~~av~~   26 (47)
T pfam07499         2 ALEEAVSALLALGYKEKEAEKAVAK   26 (47)
T ss_pred             HHHHHHHHHHHCCCCHHHHHHHHHH
T ss_conf             5889999999939989999999999


No 33 
>pfam01793 Glyco_transf_15 Glycolipid 2-alpha-mannosyltransferase. This is a family of alpha-1,2 mannosyl-transferases involved in N-linked and O-linked glycosylation of proteins. Some of the enzymes in this family have been shown to be involved in O- and N-linked glycan modifications in the Golgi.
Probab=35.74  E-value=26  Score=15.73  Aligned_cols=29  Identities=17%  Similarity=0.185  Sum_probs=22.5

Q ss_pred             HCCCCCCHHHHHHHHHHHHHHHHHCCCCH
Q ss_conf             58365898999999999999999759998
Q gi|254781141|r   89 LFPQTKDPAQRENFFKNLFRIGNILGFQK  117 (196)
Q Consensus        89 ~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~  117 (196)
                      .+|+|-|..++.+.+..+.+-+..||.|.
T Consensus        73 ~~P~~id~~k~~~~~~~~~~~~i~yg~s~  101 (274)
T pfam01793        73 SYPEWIDQDKAAESREKMGEKGIIYGDSE  101 (274)
T ss_pred             CCCCCCCHHHHHHHHHHHHHCCCCCCCCH
T ss_conf             79998598999999999887098765425


No 34 
>PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional
Probab=35.36  E-value=25  Score=15.80  Aligned_cols=24  Identities=21%  Similarity=0.217  Sum_probs=17.0

Q ss_pred             HHHHHHHHHHHCCCCHHHHHHHCC
Q ss_conf             999999999975999899972059
Q gi|254781141|r  102 FFKNLFRIGNILGFQKEEMVDITD  125 (196)
Q Consensus       102 ~~~~i~~~~~~~Gfs~eEi~~v~D  125 (196)
                      ...++...+.++||+..++..+.+
T Consensus       143 ~~~e~~~AL~~LGy~~~~a~~al~  166 (196)
T PRK13901        143 KFKELEQSIVNMGFDRKLVNSAIK  166 (196)
T ss_pred             CHHHHHHHHHHCCCCHHHHHHHHH
T ss_conf             489999999984999899999999


No 35 
>pfam04877 Hairpins HrpZ. HrpZ from the plant pathogen Pseudomonas syringae binds to lipid bilayers and forms a cation-conducting pore in vivo. This pore-forming activity may allow nutrient release or delivery of virulence factors during bacterial colonisation of host plants. The family of hairpinN proteins, Harpin, has been merged into this family. HrpN is a virulence determinant which elicits lesion formation in Arabidopsis and tobacco and triggers systemic resistance in Arabidopsis.
Probab=35.01  E-value=33  Score=15.01  Aligned_cols=11  Identities=18%  Similarity=0.413  Sum_probs=4.4

Q ss_pred             HHHHHHHHHHH
Q ss_conf             99999999732
Q gi|254781141|r   11 SAEALAQLINH   21 (196)
Q Consensus        11 ~~~~l~q~~~~   21 (196)
                      ++..|+||+.+
T Consensus       166 ml~eIaQFMDd  176 (277)
T pfam04877       166 MLKEIAQFMDD  176 (277)
T ss_pred             HHHHHHHHHHC
T ss_conf             99999998613


No 36 
>TIGR01991 HscA Fe-S protein assembly chaperone HscA; InterPro: IPR010236   Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems.   The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly .   The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets.   In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen .   This entry represents the HscA chaperone protein from the SUF system. HscA (or Hsc66) is a specialised bacterial Hsp70-class molecular chaperone that participates in the assembly of iron-sulphur cluster proteins. HscA resembles DnaK, but belongs to a separate clade. HscA interacts with IscU, which is believed to serve as a template for Fe-S cluster formation. The HscA-IscU interaction is facilitated by the J-type co-chaperone protein HscB (or Hsc20), which binds to both HscA and IscU, bringing them into contact with each other. HscA recognises a conserved LPPVK sequence motif at positions 99-103 of IscU .; GO: 0005524 ATP binding, 0016887 ATPase activity, 0051082 unfolded protein binding, 0016226 iron-sulfur cluster assembly.
Probab=34.83  E-value=30  Score=15.34  Aligned_cols=10  Identities=0%  Similarity=-0.300  Sum_probs=4.1

Q ss_pred             HHHHHHHHHH
Q ss_conf             9999999999
Q gi|254781141|r  131 LAYYAQIGLQ  140 (196)
Q Consensus       131 vl~~Am~y~~  140 (196)
                      ||+|+|-|-.
T Consensus       523 MlkDs~s~a~  532 (628)
T TIGR01991       523 MLKDSLSYAE  532 (628)
T ss_pred             HHHHHHHHHH
T ss_conf             9998876578


No 37 
>pfam11867 DUF3387 Domain of unknown function (DUF3387). This domain is functionally uncharacterized. This domain is found in bacteria and archaea. This presumed domain is typically between 255 to 340 amino acids in length. This domain is found associated with pfam04851, pfam04313.
Probab=34.54  E-value=34  Score=14.96  Aligned_cols=136  Identities=14%  Similarity=0.137  Sum_probs=56.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHH
Q ss_conf             99999999999999999999999999999999999999999-------58365898999999999999999759998999
Q gi|254781141|r   48 KALNIINHFVETGRNPDNIAKELDSEILEKKLIAENQQLEH-------LFPQTKDPAQRENFFKNLFRIGNILGFQKEEM  120 (196)
Q Consensus        48 ~~~~~~~~~~~e~q~~~q~q~~~q~~~~q~~l~~e~~~L~e-------~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi  120 (196)
                      =+.+-++.+....-........-+...+.+.|+.-..+-..       .|-|+-+   -.++..+-..-|..+|.|++|+
T Consensus       170 laiE~L~kLL~~~Ik~~~~~N~v~~~~FsE~Le~~i~~Y~nr~it~aevieELi~---lakel~~e~~rg~~lGLs~eEl  246 (333)
T pfam11867       170 LALELLKKLLNDEIKSRSKTNLVQSKKFSERLEEAINKYHNRAITTAEVIEELIK---LAKEIREADKRGEELGLSEEEI  246 (333)
T ss_pred             HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH---HHHHHHHHHHHHHHCCCCHHHH
T ss_conf             7999999999999999970068989999999999999998077749999999999---9999998886235259998999


Q ss_pred             HHHCCHHHHHH-HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHH
Q ss_conf             72059999999-99999999997430011232036984347887787632148999999999761999999
Q gi|254781141|r  121 VDITDHRLLSL-AYYAQIGLQSQKLSEDAYHKIRHKPFVNITTNKAKNHRRITSQEKAIQKLYQTGSLYDS  190 (196)
Q Consensus       121 ~~v~D~R~i~v-l~~Am~y~~~~~~k~~a~kk~~~~p~~~~~~~~~~~~~~~~~~~ka~~rL~ksG~v~DA  190 (196)
                      + .||.=.--- ....|--+.+..-...-+..++....+   -=..+.+.++.-|..-+.=|+|-|=.-|-
T Consensus       247 A-FYDaLa~~~sa~~~~g~e~L~~iA~el~~~vr~~~~i---DW~~re~vrAklR~~vkrlL~k~gYPPd~  313 (333)
T pfam11867       247 A-FYDALADNESAVREMGDDKLKKIAKELVEKLRKNVTV---DWQKRESVRAKLRVLVKRILRKYGYPPDK  313 (333)
T ss_pred             H-HHHHHHCCHHHHHHCCCHHHHHHHHHHHHHHHHCCCC---CCCCCHHHHHHHHHHHHHHHHHCCCCCHH
T ss_conf             9-9999964416888758289999999999999865884---76300748999999999999985989226


No 38 
>KOG2629 consensus
Probab=34.41  E-value=25  Score=15.88  Aligned_cols=95  Identities=13%  Similarity=0.111  Sum_probs=44.6

Q ss_pred             HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
Q ss_conf             99999732068999189999438999999999999999999999999999999999999999999999999999958365
Q gi|254781141|r   14 ALAQLINHHIPDDSSPTLEKTYPETYHRIRYLRQKALNIINHFVETGRNPDNIAKELDSEILEKKLIAENQQLEHLFPQT   93 (196)
Q Consensus        14 ~l~q~~~~~~p~~p~~~L~~~DP~~Y~~~k~~~~~~~~~~~~~~~e~q~~~q~q~~~q~~~~q~~l~~e~~~L~e~iPe~   93 (196)
                      .+..++.+-+-.++.+.|...++.    .-..+......++++..+..++.+....+|. .+-.-|......|.   -.-
T Consensus       103 ~~K~YV~P~~l~~~~~k~e~~k~~----Ld~~~~~~~~~~~~l~~~va~v~q~~~~qq~-Els~~L~~l~~~~~---~~s  174 (300)
T KOG2629         103 FVKSYVLPRFLGESKDKLEADKRQ----LDDQFDKAAKSLNALMDEVAQVSQLLATQQS-ELSRALASLKNTLV---QLS  174 (300)
T ss_pred             HHHHHHHHHHHCCCCHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH---HHH
T ss_conf             999988887607651167777888----8899999999999999999999999999999-99999999997777---753


Q ss_pred             CCHHHHHHHHHHHHHHHHH-CCCC
Q ss_conf             8989999999999999997-5999
Q gi|254781141|r   94 KDPAQRENFFKNLFRIGNI-LGFQ  116 (196)
Q Consensus        94 ~D~~k~~~~~~~i~~~~~~-~Gfs  116 (196)
                      +|-++.+.+.+.|-.-+.+ .-|+
T Consensus       175 ~~~~k~esei~~Ik~lvln~~~f~  198 (300)
T KOG2629         175 RNIEKLESEINTIKQLVLNMSNFA  198 (300)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCCCC
T ss_conf             019888878888889985343357


No 39 
>PTZ00217 flap endonuclease-1; Provisional
Probab=33.02  E-value=36  Score=14.80  Aligned_cols=29  Identities=14%  Similarity=0.124  Sum_probs=21.8

Q ss_pred             CCCCHHHHHHHHHHHHHHHH-HCCCCHHHHHHHCC
Q ss_conf             65898999999999999999-75999899972059
Q gi|254781141|r   92 QTKDPAQRENFFKNLFRIGN-ILGFQKEEMVDITD  125 (196)
Q Consensus        92 e~~D~~k~~~~~~~i~~~~~-~~Gfs~eEi~~v~D  125 (196)
                      +|+.|     -...|++|+. +.|||++-|.+..+
T Consensus       299 ~w~~p-----d~e~l~~fL~~e~~Fse~Rv~~~~~  328 (394)
T PTZ00217        299 KWSEP-----DIEGLKKFLVEEKNFNEERVEKGIE  328 (394)
T ss_pred             CCCCC-----CHHHHHHHHHHCCCCCHHHHHHHHH
T ss_conf             08999-----9899999976466979899999999


No 40 
>PHA00542 putative Cro-like protein
Probab=31.59  E-value=38  Score=14.65  Aligned_cols=39  Identities=26%  Similarity=0.273  Sum_probs=28.5

Q ss_pred             HCCCCCC--HHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHH
Q ss_conf             5836589--89999999999999997599989997205999
Q gi|254781141|r   89 LFPQTKD--PAQRENFFKNLFRIGNILGFQKEEMVDITDHR  127 (196)
Q Consensus        89 ~iPe~~D--~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R  127 (196)
                      .+|+..+  |...++--.+|+..+...|||..+|..+++-+
T Consensus         4 ~~~~~~~~~~~~y~k~P~E~V~aLi~~G~tQ~qI~~~tgv~   44 (82)
T PHA00542          4 AMPATTPTIPAAYTQRPDELVCALIRAGWSQEQIADATDVS   44 (82)
T ss_pred             CCCCCCCCCHHHHHCCHHHHHHHHHHCCCCHHHHHHHHCCC
T ss_conf             66788997437653298999999998225699999860877


No 41 
>COG5656 SXM1 Importin, protein involved in nuclear import [Posttranslational modification, protein turnover, chaperones]
Probab=31.57  E-value=28  Score=15.52  Aligned_cols=38  Identities=18%  Similarity=0.152  Sum_probs=24.0

Q ss_pred             HHHHHHHHHHHHH-----CCCCCCCHHHHHHHHHHHHHHHHHH
Q ss_conf             9999999999732-----0689991899994389999999999
Q gi|254781141|r    9 AQSAEALAQLINH-----HIPDDSSPTLEKTYPETYHRIRYLR   46 (196)
Q Consensus         9 aq~~~~l~q~~~~-----~~p~~p~~~L~~~DP~~Y~~~k~~~   46 (196)
                      +-+.+.+.=++.+     .-+++-.++|++.||.+|.+....+
T Consensus       337 ~l~ePhlq~ii~~vIfPllc~see~eElfEnDp~eyirry~df  379 (970)
T COG5656         337 RLMEPHLQYIISGVIFPLLCLSEEEEELFENDPDEYIRRYYDF  379 (970)
T ss_pred             HHHCCHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHHHCCH
T ss_conf             6634279999873024220787305877724989999974211


No 42 
>pfam10929 DUF2811 Protein of unknown function (DUF2811). This is a bacterial family of uncharacterized proteins.
Probab=31.15  E-value=39  Score=14.60  Aligned_cols=47  Identities=11%  Similarity=0.016  Sum_probs=37.5

Q ss_pred             HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHC
Q ss_conf             99999999999999583658989999999999999997599989997205
Q gi|254781141|r   75 LEKKLIAENQQLEHLFPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDIT  124 (196)
Q Consensus        75 ~q~~l~~e~~~L~e~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~  124 (196)
                      .-+.|.+.....++..|.|---   .-+.+.|..|+...|.++-.+..+|
T Consensus         6 iPe~L~~am~~fie~hP~WDQy---Rl~~aALa~FL~QnG~~~r~vtr~Y   52 (57)
T pfam10929         6 IPEALQQAMKDFIESHPNWDQY---RLIQAALAGFLLQNGCQDRAVTRLY   52 (57)
T ss_pred             CCHHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHHHHCCCCCHHHHHHH
T ss_conf             6299999999999839994099---9999999999998488622889998


No 43 
>cd02876 GH18_SI-CLP Stabilin-1 interacting chitinase-like protein (SI-CLP) is a eukaryotic chitinase-like protein of unknown function that interacts with the endocytic/sorting transmembrane receptor stabilin-1 and is secreted from the lysosome.  SI-CLP has a glycosyl hydrolase family 18 (GH18) domain but lacks a chitin-binding domain. The catalytic amino acids of the GH18 domain are not conserved in SI-CLP, similar to the chitolectins YKL-39, YKL-40, and YM1/2.  Human SI-CLP is sorted to late endosomes and secretory lysosomes in alternatively activated macrophages.
Probab=30.99  E-value=39  Score=14.58  Aligned_cols=28  Identities=21%  Similarity=0.247  Sum_probs=23.3

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHH
Q ss_conf             5898999999999999999759998999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEM  120 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi  120 (196)
                      .+||+.++++..+|+++.+.+||.-=.|
T Consensus        87 ls~~~~R~~~i~~iv~~~~~~gfDGidi  114 (318)
T cd02876          87 LNDEQEREKLIKLLVTTAKKNHFDGIVL  114 (318)
T ss_pred             HCCHHHHHHHHHHHHHHHHHHCCCCEEE
T ss_conf             6399999999999999999819985888


No 44 
>KOG4472 consensus
Probab=30.79  E-value=33  Score=15.03  Aligned_cols=30  Identities=17%  Similarity=0.251  Sum_probs=25.6

Q ss_pred             HCCCCCCHHHHHHHHHHHHHHHHHCCCCHH
Q ss_conf             583658989999999999999997599989
Q gi|254781141|r   89 LFPQTKDPAQRENFFKNLFRIGNILGFQKE  118 (196)
Q Consensus        89 ~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~e  118 (196)
                      -+|+|-|..++.....++.+-+.-||.|+-
T Consensus       151 ~~P~~Id~~~~~e~~~~~~~~~i~Yg~s~S  180 (399)
T KOG4472         151 NFPEWIDEDKAAESLDDMADEGILYGGSES  180 (399)
T ss_pred             CCCCCCCHHHHHHHHHHHHHCCCCCCCCHH
T ss_conf             796101569999999987643954368589


No 45 
>COG5020 KTR1 Mannosyltransferase [Carbohydrate transport and metabolism]
Probab=30.79  E-value=33  Score=15.03  Aligned_cols=30  Identities=17%  Similarity=0.251  Sum_probs=25.6

Q ss_pred             HCCCCCCHHHHHHHHHHHHHHHHHCCCCHH
Q ss_conf             583658989999999999999997599989
Q gi|254781141|r   89 LFPQTKDPAQRENFFKNLFRIGNILGFQKE  118 (196)
Q Consensus        89 ~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~e  118 (196)
                      -+|+|-|..++.....++.+-+.-||.|+-
T Consensus       151 ~~P~~Id~~~~~e~~~~~~~~~i~Yg~s~S  180 (399)
T COG5020         151 NFPEWIDEDKAAESLDDMADEGILYGGSES  180 (399)
T ss_pred             CCCCCCCHHHHHHHHHHHHHCCCCCCCCHH
T ss_conf             796101569999999987643954368589


No 46 
>pfam00816 Histone_HNS H-NS histone family.
Probab=30.59  E-value=39  Score=14.54  Aligned_cols=21  Identities=14%  Similarity=0.267  Sum_probs=17.2

Q ss_pred             HHHHHHHHHHHHHCCCCHHHH
Q ss_conf             999999999999759998999
Q gi|254781141|r  100 ENFFKNLFRIGNILGFQKEEM  120 (196)
Q Consensus       100 ~~~~~~i~~~~~~~Gfs~eEi  120 (196)
                      ..-...|...+..+|||.+||
T Consensus        23 ~~a~~~i~~~~~~~glt~~el   43 (91)
T pfam00816        23 AAALAEIRELAAEYGLTLEEL   43 (91)
T ss_pred             HHHHHHHHHHHHHHCCCHHHH
T ss_conf             999999999999909999998


No 47 
>KOG1992 consensus
Probab=30.19  E-value=24  Score=16.00  Aligned_cols=113  Identities=18%  Similarity=0.150  Sum_probs=53.4

Q ss_pred             HHHHHHHHHHHH-CCC----CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_conf             999999999732-068----999189999438999999999999999999999999999999999999999999999999
Q gi|254781141|r   10 QSAEALAQLINH-HIP----DDSSPTLEKTYPETYHRIRYLRQKALNIINHFVETGRNPDNIAKELDSEILEKKLIAENQ   84 (196)
Q Consensus        10 q~~~~l~q~~~~-~~p----~~p~~~L~~~DP~~Y~~~k~~~~~~~~~~~~~~~e~q~~~q~q~~~q~~~~q~~l~~e~~   84 (196)
                      +.-+.++++.+. -.|    .+-+++|+|.||++|+|..-+=-+---+++....-...+...-    +...-..+..+.+
T Consensus       331 ~~~~vl~~i~e~VvlpN~~lR~eDeElFED~pleYiRRDlEGsDvdTRRR~a~dlvrgL~~~f----e~~vt~v~~~~v~  406 (960)
T KOG1992         331 EGENVLAQICEKVVLPNLILREEDEELFEDNPLEYIRRDLEGSDVDTRRRAAIDLVRGLCKNF----EGQVTGVFSSEVQ  406 (960)
T ss_pred             CCHHHHHHHHHHHCCCCCCCCHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHH----CCHHHHHHHHHHH
T ss_conf             354899999986532510010536877356889999875346775325779999999999870----2244778999999


Q ss_pred             HHHHH---C--CCCCCHHHHHHHHHHHHHHHHH--CCCCH-HHHHHHCCH
Q ss_conf             99995---8--3658989999999999999997--59998-999720599
Q gi|254781141|r   85 QLEHL---F--PQTKDPAQRENFFKNLFRIGNI--LGFQK-EEMVDITDH  126 (196)
Q Consensus        85 ~L~e~---i--Pe~~D~~k~~~~~~~i~~~~~~--~Gfs~-eEi~~v~D~  126 (196)
                      .|+..   -  -+|++...+=-+...+.--|.+  -|.|. .++.++.|.
T Consensus       407 ~~l~~y~~nPS~nWk~kd~aiyL~talaik~~t~~~Gvtstn~lvdv~~F  456 (960)
T KOG1992         407 RLLDQYSKNPSGNWKKKDRAIYLVTALAIKGQTAKHGVTSTNELVDVVDF  456 (960)
T ss_pred             HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCHHHCCEEECCCCCCHHHH
T ss_conf             89887323887453301014335678886032210451212323458899


No 48 
>pfam11198 DUF2857 Protein of unknown function (DUF2857). This is a bacterial family of uncharacterized proteins.
Probab=30.14  E-value=40  Score=14.49  Aligned_cols=30  Identities=13%  Similarity=0.017  Sum_probs=15.5

Q ss_pred             CCCCCCCCCCCCHHHHHHHHHHHHHHCCHH
Q ss_conf             478877876321489999999997619999
Q gi|254781141|r  159 NITTNKAKNHRRITSQEKAIQKLYQTGSLY  188 (196)
Q Consensus       159 ~~~~~~~~~~~~~~~~~ka~~rL~ksG~v~  188 (196)
                      .+|-+..+.+......=..++.+.+.+.+.
T Consensus       116 ~~GR~~~~~ee~~~~lW~~W~~~~~~~~~~  145 (181)
T pfam11198       116 RKGRPPALDEEQEAALWRAWQQLRSKPDVE  145 (181)
T ss_pred             CCCCCCCCCHHHHHHHHHHHHHHHHHCCCC
T ss_conf             999999998688999999999989864888


No 49 
>pfam08069 Ribosomal_S13_N Ribosomal S13/S15 N-terminal domain. This domain is found at the N-terminus of ribosomal S13 and S15 proteins. This domain is also identified as NUC021.
Probab=28.50  E-value=43  Score=14.31  Aligned_cols=33  Identities=12%  Similarity=0.123  Sum_probs=23.4

Q ss_pred             HCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHH
Q ss_conf             58365898999999999999999759998999720
Q gi|254781141|r   89 LFPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDI  123 (196)
Q Consensus        89 ~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v  123 (196)
                      .-|+|.+- ..+.....|.++++ .|+|+-+|+.+
T Consensus        21 ~~P~W~~~-~~~eve~~I~klak-kG~~pSqIG~i   53 (60)
T pfam08069        21 TPPEWLKY-SPEEVEELIVKLAK-KGLTPSQIGVI   53 (60)
T ss_pred             CCCCCCCC-CHHHHHHHHHHHHH-CCCCHHHHHHH
T ss_conf             89865568-99999999999998-79988897356


No 50 
>pfam01371 Trp_repressor Trp repressor protein. This protein binds to tryptophan and represses transcription of the Trp operon.
Probab=28.37  E-value=43  Score=14.29  Aligned_cols=56  Identities=11%  Similarity=0.061  Sum_probs=41.9

Q ss_pred             HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHH--HHHHHHHHHHHHHHH
Q ss_conf             9999999583658989999999999999997599989997205999999--999999999997430
Q gi|254781141|r   82 ENQQLEHLFPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLS--LAYYAQIGLQSQKLS  145 (196)
Q Consensus        82 e~~~L~e~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~--vl~~Am~y~~~~~~k  145 (196)
                      +-+.|.++|=...+++-...|..+|        +|+.|+.++..-=.|.  +|+..+-|+++....
T Consensus         2 ~~d~l~~ail~l~~~ee~~~f~~dl--------~T~~E~~ala~R~~va~~LL~~~~syreI~~~~   59 (88)
T pfam01371         2 EWDALFEALLKLENQEECYAFLTDL--------LTPDEREALAQRLRIAKELLRGELSQREIAQEL   59 (88)
T ss_pred             HHHHHHHHHHHCCCHHHHHHHHHHH--------CCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH
T ss_conf             5999999999749999999999983--------999999999999999999997899899999985


No 51 
>COG3722 MtlR Transcriptional regulator [Transcription]
Probab=28.15  E-value=43  Score=14.27  Aligned_cols=35  Identities=20%  Similarity=0.317  Sum_probs=21.6

Q ss_pred             HHCCC--CCCHHHHHHHHHHHHHHHHHCCCCHHHHHH
Q ss_conf             95836--589899999999999999975999899972
Q gi|254781141|r   88 HLFPQ--TKDPAQRENFFKNLFRIGNILGFQKEEMVD  122 (196)
Q Consensus        88 e~iPe--~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~  122 (196)
                      -.||.  |.|-+---+.+..+-+-+.+|||+|++|-+
T Consensus        80 Gvi~~~~y~Diehl~~lre~Ln~d~~ey~FtDd~ild  116 (174)
T COG3722          80 GVIDRWEYEDIEHLMALREELNHDGEEYGFTDDEILD  116 (174)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH
T ss_conf             5661888862899999999965665214766589886


No 52 
>PRK05672 dnaE2 error-prone DNA polymerase; Validated
Probab=27.82  E-value=44  Score=14.23  Aligned_cols=60  Identities=13%  Similarity=-0.059  Sum_probs=38.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCC
Q ss_conf             999999999999999999999995836589899999999999999975999899972059
Q gi|254781141|r   66 IAKELDSEILEKKLIAENQQLEHLFPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDITD  125 (196)
Q Consensus        66 ~q~~~q~~~~q~~l~~e~~~L~e~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D  125 (196)
                      +|-.+-.+.+.-+--.+...|..+|=-+++++.-.+++..+++-+...||+++....|.|
T Consensus       666 EQvM~ia~~~aGfslgeAD~lRramgsKK~~~~m~~~~~~f~~g~~~~g~~~~~a~~i~~  725 (1050)
T PRK05672        666 EQVMQIAIDAAGFTPGEADQLRRAMAAWRRVGRLERLRERLVDGMLARGYTGEFAERIFE  725 (1050)
T ss_pred             HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHH
T ss_conf             999999999749998999999999875437222889999998679875999999999999


No 53 
>TIGR03253 oxalate_frc formyl-CoA transferase. This enzyme, formyl-CoA transferase, transfers coenzyme A from formyl-CoA to oxalate. It forms a pathway, together with oxalyl-CoA decarboxylase, for oxalate degradation; decarboxylation by the latter gene regenerates formyl-CoA. The two enzymes typically are encoded by a two-gene operon.
Probab=27.38  E-value=20  Score=16.47  Aligned_cols=32  Identities=19%  Similarity=0.295  Sum_probs=11.7

Q ss_pred             CCCCCHHHHHHHHHHHHHHHHH--CCCCHHHHHH
Q ss_conf             3658989999999999999997--5999899972
Q gi|254781141|r   91 PQTKDPAQRENFFKNLFRIGNI--LGFQKEEMVD  122 (196)
Q Consensus        91 Pe~~D~~k~~~~~~~i~~~~~~--~Gfs~eEi~~  122 (196)
                      |.|.++..+.....+|......  ..+|-+|+..
T Consensus       292 ~rf~~~~~R~~~~~el~~~i~~~~~~~t~~e~~~  325 (415)
T TIGR03253       292 PAYATPEARQPKLNDIFAFIETYTATKDKFEVTE  325 (415)
T ss_pred             HHHCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHH
T ss_conf             0015768888989999999999984203999999


No 54 
>KOG0139 consensus
Probab=27.22  E-value=45  Score=14.16  Aligned_cols=85  Identities=8%  Similarity=-0.029  Sum_probs=56.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHCCCCCCHHHHHHHHHHHHHHHH
Q ss_conf             389999999999999999999999999999999999999999999999999---99958365898999999999999999
Q gi|254781141|r   35 YPETYHRIRYLRQKALNIINHFVETGRNPDNIAKELDSEILEKKLIAENQQ---LEHLFPQTKDPAQRENFFKNLFRIGN  111 (196)
Q Consensus        35 DP~~Y~~~k~~~~~~~~~~~~~~~e~q~~~q~q~~~q~~~~q~~l~~e~~~---L~e~iPe~~D~~k~~~~~~~i~~~~~  111 (196)
                      --|.|.++|..|-.+.-++|-++.+...+..+.++..--.+..-.-.++-.   ....|--+.-++++...-....+++-
T Consensus       291 ~tI~Y~q~R~~FGk~l~d~Q~iQhqiA~~~teiEaaRlL~ynaAr~k~~G~pf~keAAMAKl~ase~A~~~t~qCiq~lG  370 (398)
T KOG0139         291 ETIPYAQERLQFGKRLLDFQGLQHQIADMATEIEAARLLVYNAARMKDQGLPFVKEAAMAKLYASEVATKTTHQCIQWLG  370 (398)
T ss_pred             HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC
T ss_conf             63078999988450466678899999999999999999999999998759964789999987421565778999999970


Q ss_pred             HCCCCHHH
Q ss_conf             75999899
Q gi|254781141|r  112 ILGFQKEE  119 (196)
Q Consensus       112 ~~Gfs~eE  119 (196)
                      ..||+.|-
T Consensus       371 G~Gyt~d~  378 (398)
T KOG0139         371 GVGYTKDF  378 (398)
T ss_pred             CCCCCCCC
T ss_conf             34723346


No 55 
>PHA02512 minor structural protein; Provisional
Probab=26.93  E-value=45  Score=14.13  Aligned_cols=48  Identities=15%  Similarity=0.174  Sum_probs=29.9

Q ss_pred             HHHHHHHHHHHHHHHHHCCCCCCHHHHH----HHHHHHHHHHHHCCCCHHHHHHH
Q ss_conf             9999999999999999583658989999----99999999999759998999720
Q gi|254781141|r   73 EILEKKLIAENQQLEHLFPQTKDPAQRE----NFFKNLFRIGNILGFQKEEMVDI  123 (196)
Q Consensus        73 ~~~q~~l~~e~~~L~e~iPe~~D~~k~~----~~~~~i~~~~~~~Gfs~eEi~~v  123 (196)
                      -+++..|....+.   +---|.|+.-..    .-...|.+|+..-|+|+|+|..-
T Consensus       147 gqfqatl~~N~k~---Aa~~ygDn~ay~~~~~q~fqqieeyg~ahGws~EQI~ak  198 (844)
T PHA02512        147 GQFQATLANNGKL---AAAAYGDNAAYVLANKQTFQQIEDYGAAHGWSHEQIQAK  198 (844)
T ss_pred             HHHHHHHHCCHHH---HHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH
T ss_conf             1268888522666---788737819999989999999998614228988999999


No 56 
>PRK05398 formyl-coenzyme A transferase; Provisional
Probab=26.54  E-value=22  Score=16.22  Aligned_cols=33  Identities=27%  Similarity=0.277  Sum_probs=15.0

Q ss_pred             CCCCCHHHHHHHHHHHHHHHHH--CCCCHHHHHHH
Q ss_conf             3658989999999999999997--59998999720
Q gi|254781141|r   91 PQTKDPAQRENFFKNLFRIGNI--LGFQKEEMVDI  123 (196)
Q Consensus        91 Pe~~D~~k~~~~~~~i~~~~~~--~Gfs~eEi~~v  123 (196)
                      |.|.++..+.....+|......  ...|-+|+-.+
T Consensus       293 ~rf~t~~~R~~~~~el~~~i~~~~~~~t~~e~~~~  327 (416)
T PRK05398        293 PAYATPEARQPHIFDIFAEIEKWTATKTKFEAVDI  327 (416)
T ss_pred             CCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH
T ss_conf             42158899999799999999999825999999987


No 57 
>pfam06401 Alpha-2-MRAP_C Alpha-2-macroglobulin RAP, C-terminal domain. The alpha-2-macroglobulin receptor-associated protein (RAP) is a intracellular glycoprotein that binds to the 2-macroglobulin receptor and other members of the low density lipoprotein receptor family. The protein inhibits binding of all currently known ligands of these receptors. Two different studies have provided conflicted domain boundaries.
Probab=25.35  E-value=48  Score=13.94  Aligned_cols=29  Identities=14%  Similarity=0.437  Sum_probs=22.8

Q ss_pred             CCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHC
Q ss_conf             3658989999999999999997599989997205
Q gi|254781141|r   91 PQTKDPAQRENFFKNLFRIGNILGFQKEEMVDIT  124 (196)
Q Consensus        91 Pe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~  124 (196)
                      -||.+|.+     .+|=..|.+-+||++|+.++-
T Consensus       106 ~eF~ePrV-----q~LW~~A~~~nFs~~ELeSlk  134 (205)
T pfam06401       106 SEFNEPRV-----QGLWKLAQNSNFTEKELESLK  134 (205)
T ss_pred             CCCCCHHH-----HHHHHHHHHCCCCHHHHHHHH
T ss_conf             46456689-----999999987699989999999


No 58 
>cd06548 GH18_chitinase The GH18 (glycosyl hydrolases, family 18) type II chitinases hydrolyze chitin, an abundant polymer of N-acetylglucosamine and have been identified in bacteria, fungi, insects, plants, viruses, and protozoan parasites.  The structure of this domain is an eight-stranded alpha/beta barrel with a pronounced active-site cleft at the C-terminal end of the beta-barrel.
Probab=25.03  E-value=49  Score=13.90  Aligned_cols=27  Identities=19%  Similarity=0.279  Sum_probs=24.3

Q ss_pred             CCHHHHHHHHHHHHHHHHHCCCCHHHH
Q ss_conf             898999999999999999759998999
Q gi|254781141|r   94 KDPAQRENFFKNLFRIGNILGFQKEEM  120 (196)
Q Consensus        94 ~D~~k~~~~~~~i~~~~~~~Gfs~eEi  120 (196)
                      +|++.++.|..++++|...+||.-=+|
T Consensus       105 ~~~~~R~~Fi~s~i~f~~~~~fDGiDi  131 (322)
T cd06548         105 ATEASRAKFADSAVDFIRKYGFDGIDI  131 (322)
T ss_pred             CCHHHHHHHHHHHHHHHHHCCCCCEEE
T ss_conf             599999999999999999729981455


No 59 
>pfam01471 PG_binding_1 Putative peptidoglycan binding domain. This domain is composed of three alpha helices. This domain is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. This family is found N-terminal to the catalytic domain of matrixins. The domain is found to bind peptidoglycan experimentally.
Probab=24.90  E-value=49  Score=13.89  Aligned_cols=36  Identities=17%  Similarity=-0.004  Sum_probs=26.0

Q ss_pred             HHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHH
Q ss_conf             999999999975999899972059999999999999
Q gi|254781141|r  102 FFKNLFRIGNILGFQKEEMVDITDHRLLSLAYYAQI  137 (196)
Q Consensus       102 ~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~Am~  137 (196)
                      ....|-.++..+||....+++++|......+..--.
T Consensus         4 ~V~~lQ~~L~~~Gy~~~~~dg~~~~~t~~Av~~fQ~   39 (57)
T pfam01471         4 DVKELQRYLKRLGYYPGPVDGVFGPSTEAAVKAFQR   39 (57)
T ss_pred             HHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHH
T ss_conf             999999999997799999999688899999999999


No 60 
>PRK03525 crotonobetainyl-CoA:carnitine CoA-transferase; Provisional
Probab=24.21  E-value=20  Score=16.56  Aligned_cols=11  Identities=36%  Similarity=0.428  Sum_probs=5.5

Q ss_pred             HHHHHHHHCCH
Q ss_conf             99999761999
Q gi|254781141|r  177 AIQKLYQTGSL  187 (196)
Q Consensus       177 a~~rL~ksG~v  187 (196)
                      .+.+|+.+|=|
T Consensus       391 eI~~L~~~GVI  401 (405)
T PRK03525        391 DIQELVSKGLA  401 (405)
T ss_pred             HHHHHHHCCCE
T ss_conf             99999988296


No 61 
>cd02875 GH18_chitobiase Chitobiase (also known as di-N-acetylchitobiase) is a lysosomal glycosidase that hydrolyzes the reducing-end N-acetylglucosamine from the chitobiose core of oligosaccharides during the ordered degradation of asparagine-linked glycoproteins in eukaryotes. Chitobiase can only do so if the asparagine that joins the oligosaccharide to protein is previously removed by a glycosylasparaginase. Chitobiase is therefore the final step in the lysosomal degradation of the protein/carbohydrate linkage component of asparagine-linked glycoproteins. The catalytic domain of chitobiase is an eight-stranded alpha/beta barrel fold similar to that of other family 18 glycosyl hydrolases such as hevamine and chitotriosidase.
Probab=24.06  E-value=51  Score=13.78  Aligned_cols=28  Identities=7%  Similarity=0.196  Sum_probs=21.1

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHH
Q ss_conf             5898999999999999999759998999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEM  120 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi  120 (196)
                      ..||+.|..|..+.++++..+||.-=+|
T Consensus        91 v~np~~R~~FI~s~V~flrk~~FDGlDl  118 (358)
T cd02875          91 ISNPTYRTQWIQQKVELAKSQFMDGINI  118 (358)
T ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCCEEE
T ss_conf             3898999999999999999819987556


No 62 
>pfam04587 ADP_PFK_GK ADP-specific Phosphofructokinase/Glucokinase conserved region. In archaea a novel type of glycolytic pathway exists that is deviant from the classical Embden-Meyerhof pathway. This pathway utilizes two novel proteins: an ADP-dependent Glucokinase and an ADP-dependent Phosphofructokinase. This conserved region is present at the C-terminal of both these proteins. Interestingly this family contains sequences from higher eukaryotes..
Probab=23.55  E-value=49  Score=13.90  Aligned_cols=33  Identities=15%  Similarity=0.424  Sum_probs=28.7

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHC
Q ss_conf             658989999999999999997599989997205
Q gi|254781141|r   92 QTKDPAQRENFFKNLFRIGNILGFQKEEMVDIT  124 (196)
Q Consensus        92 e~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~  124 (196)
                      -+.|++.+......|..+..+.|..++||.++.
T Consensus       259 S~~d~~l~~~i~~~ilp~vDSlGmNEqELa~l~  291 (444)
T pfam04587       259 SIQDRELRKEIVHNVLPMVDSVGMNEQELANLL  291 (444)
T ss_pred             CCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHH
T ss_conf             426099999999971420563777989999999


No 63 
>pfam02375 JmjN jmjN domain.
Probab=23.42  E-value=53  Score=13.70  Aligned_cols=20  Identities=25%  Similarity=0.579  Sum_probs=8.6

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHCC
Q ss_conf             65898999999999999999759
Q gi|254781141|r   92 QTKDPAQRENFFKNLFRIGNILG  114 (196)
Q Consensus        92 e~~D~~k~~~~~~~i~~~~~~~G  114 (196)
                      ||+||   -.....|+.+|..+|
T Consensus         9 EF~Dp---~~Yi~~i~~~g~~~G   28 (34)
T pfam02375         9 EFKDP---IKYIEKIRPLGEKYG   28 (34)
T ss_pred             HHHCH---HHHHHHHHHHHHHCC
T ss_conf             99689---999999999898859


No 64 
>pfam09388 SpoOE-like Spo0E like sporulation regulatory protein. Spore formation is an extreme response to starvation and can also be a component of disease transmission. Sporulation is controlled by an expanded two-component system where starvation signals result in sensor kinase activation and phosphorylation of the master sporulation response regulator Spo0A. Phosphatases such as Spo0E dephosphorylate Spo0A thereby inhibiting sporulation. This is a family of Spo0E-like phosphatases. The structure of a Bacillus anthracis member of this family has revealed an anti-parallel alpha-helical structure.
Probab=23.30  E-value=53  Score=13.68  Aligned_cols=25  Identities=20%  Similarity=0.469  Sum_probs=19.4

Q ss_pred             HHHHHHHHHHHHHCCCCHHHHHHHC
Q ss_conf             9999999999997599989997205
Q gi|254781141|r  100 ENFFKNLFRIGNILGFQKEEMVDIT  124 (196)
Q Consensus       100 ~~~~~~i~~~~~~~Gfs~eEi~~v~  124 (196)
                      +..+..|.+.+..+||++.++-..+
T Consensus         7 e~kR~el~~l~~~~gl~~~~vi~~S   31 (45)
T pfam09388         7 EKKRKELIELAEKYGLTDEKVIELS   31 (45)
T ss_pred             HHHHHHHHHHHHHCCCCCHHHHHHH
T ss_conf             9999999999998099948999998


No 65 
>cd06545 GH18_3CO4_chitinase The Bacteroides thetaiotaomicron protein represented by pdb structure 3CO4 is an uncharacterized bacterial member of the family 18 glycosyl hydrolases with homologs found in Flavobacterium, Stigmatella, and Pseudomonas.
Probab=23.08  E-value=53  Score=13.65  Aligned_cols=28  Identities=14%  Similarity=0.294  Sum_probs=23.6

Q ss_pred             CCCHHHHHHHHHHHHHHHHHCCCCHHHH
Q ss_conf             5898999999999999999759998999
Q gi|254781141|r   93 TKDPAQRENFFKNLFRIGNILGFQKEEM  120 (196)
Q Consensus        93 ~~D~~k~~~~~~~i~~~~~~~Gfs~eEi  120 (196)
                      .+||+.+.+|..++.++...+||.-=+|
T Consensus        78 ~~~~~~R~~Fi~~~~~~~~~~~~DGvDi  105 (253)
T cd06545          78 LNDPAKRKALVDKIINYVVSYNLDGIDV  105 (253)
T ss_pred             HCCHHHHHHHHHHHHHHHHHCCCCCEEE
T ss_conf             4599999999999999999809972898


No 66 
>PRK13795 hypothetical protein; Provisional
Probab=22.95  E-value=54  Score=13.64  Aligned_cols=37  Identities=24%  Similarity=0.354  Sum_probs=29.5

Q ss_pred             HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHH
Q ss_conf             999999999583658989999999999999997599989997
Q gi|254781141|r   80 IAENQQLEHLFPQTKDPAQRENFFKNLFRIGNILGFQKEEMV  121 (196)
Q Consensus        80 ~~e~~~L~e~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~  121 (196)
                      ..+...+.+.+||.-+     .+..-|..|+...|++++.|.
T Consensus       423 ~~e~~~~k~~~Pe~~~-----kw~~~L~~~a~~~g~~~e~v~  459 (630)
T PRK13795        423 LAELERLKELHPELYE-----KWNAFLERWAKKMGLSEEWVK  459 (630)
T ss_pred             HHHHHHHHHHCHHHHH-----HHHHHHHHHHHHCCCCHHHHH
T ss_conf             5689999866879999-----999999999998299989974


No 67 
>pfam07862 Nif11 Nitrogen fixation protein of unknown function. This domain is found in the Cyanobacteria, and may be involved in nitrogen fixation, but no role has been assigned.
Probab=22.70  E-value=51  Score=13.78  Aligned_cols=23  Identities=26%  Similarity=0.559  Sum_probs=16.5

Q ss_pred             CCHHHHHHHH-----HHHHHHHHHCCCC
Q ss_conf             8989999999-----9999999975999
Q gi|254781141|r   94 KDPAQRENFF-----KNLFRIGNILGFQ  116 (196)
Q Consensus        94 ~D~~k~~~~~-----~~i~~~~~~~Gfs  116 (196)
                      .||..+.++.     .+++.++...||+
T Consensus        15 ~d~~l~~~l~~~~~~~~vi~lA~~~Gf~   42 (49)
T pfam07862        15 SDPSLREQLKACSDAEEVVALAKEAGFE   42 (49)
T ss_pred             CCHHHHHHHHHCCCHHHHHHHHHHCCCC
T ss_conf             3999999998759999999999995998


No 68 
>TIGR02990 ectoine_eutA ectoine utilization protein EutA; InterPro: IPR014332   Members of this protein family are EutA, a predicted arylmalonate decarboxylase found in a conserved ectoine utilization operon of species that include include Mesorhizobium loti, Silicibacter pomeroyi and Sinorhizobium meliloti 1021, where it is known to be induced by ectoine. It is missing from two other species with the other ectoine transport and utilization genes: Pseudomonas putida and Agrobacterium tumefaciens. ,.
Probab=22.64  E-value=50  Score=13.84  Aligned_cols=24  Identities=33%  Similarity=0.511  Sum_probs=21.5

Q ss_pred             CCCCHH-HHHHHCCHHHHHHHHHHH
Q ss_conf             599989-997205999999999999
Q gi|254781141|r  113 LGFQKE-EMVDITDHRLLSLAYYAQ  136 (196)
Q Consensus       113 ~Gfs~e-Ei~~v~D~R~i~vl~~Am  136 (196)
                      +||+|| ||+.|+....|.+++.+.
T Consensus       154 lGl~DDREMARIs~~~iv~~A~~~~  178 (239)
T TIGR02990       154 LGLTDDREMARISPDAIVELAKEAA  178 (239)
T ss_pred             CCCCCCHHHHCCCHHHHHHHHHHHC
T ss_conf             5754452432389889999999850


No 69 
>TIGR01245 trpD anthranilate phosphoribosyltransferase; InterPro: IPR005940    In many widely different species, including Escherichia coli, Thermotoga maritima, and Archaeoglobus fulgidus, this enzyme is a C-terminal part of a multifunctional protein together with glutamine amidotransferase; the fusion protein is designated anthranilate synthase component II (4.1.3.27 from EC).; GO: 0004048 anthranilate phosphoribosyltransferase activity, 0000162 tryptophan biosynthetic process.
Probab=22.05  E-value=56  Score=13.52  Aligned_cols=40  Identities=18%  Similarity=0.155  Sum_probs=30.1

Q ss_pred             HHHHHHHHHHC--CCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHC
Q ss_conf             99999999958--3658989999999999999997599989997205
Q gi|254781141|r   80 IAENQQLEHLF--PQTKDPAQRENFFKNLFRIGNILGFQKEEMVDIT  124 (196)
Q Consensus        80 ~~e~~~L~e~i--Pe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~  124 (196)
                      .+|...|.+.|  ++..|-     ....|.--++--|.|++||.+.+
T Consensus        12 ~~ea~~lm~~~~~g~~~~~-----~~aA~L~al~~KGet~~Ei~GfA   53 (331)
T TIGR01245        12 EDEAEQLMKEIMSGEASPA-----QIAAILAALRIKGETPEEIAGFA   53 (331)
T ss_pred             HHHHHHHHHHHHCCCCCHH-----HHHHHHHHHHHCCCCHHHHHHHH
T ss_conf             8999999999847898888-----89999999975188999999999


No 70 
>KOG0031 consensus
Probab=22.05  E-value=42  Score=14.34  Aligned_cols=43  Identities=16%  Similarity=0.167  Sum_probs=24.4

Q ss_pred             HHHHHCCCCCCHHHHHHHHHHHHHHHHHCC--CCHHHHHHHCCHH
Q ss_conf             999958365898999999999999999759--9989997205999
Q gi|254781141|r   85 QLEHLFPQTKDPAQRENFFKNLFRIGNILG--FQKEEMVDITDHR  127 (196)
Q Consensus        85 ~L~e~iPe~~D~~k~~~~~~~i~~~~~~~G--fs~eEi~~v~D~R  127 (196)
                      .++.++-.|-+..++.--...+++.+.+.|  |+++||..++..-
T Consensus       102 ~I~~AF~~FD~~~~G~I~~d~lre~Ltt~gDr~~~eEV~~m~r~~  146 (171)
T KOG0031         102 VILNAFKTFDDEGSGKIDEDYLRELLTTMGDRFTDEEVDEMYREA  146 (171)
T ss_pred             HHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHC
T ss_conf             999999841855788657899999998734438999999999858


No 71 
>cd01938 ADPGK_ADPPFK ADP-dependent glucokinase (ADPGK) and phosphofructokinase (ADPPFK). ADPGK and ADPPFK are proteins that rely on ADP rather than ATP to donate a phosphoryl group.  They are found in certain hyperthermophilic archaea and in higher eukaryotes.  A functional ADPGK has been characterized in mouse and is assumed to be desirable during ischemia/hypoxia.  ADPGK and ADPPFK contain a large and a small domain with the binding site located in a groove between the domains. Partial domain closing is seen when ADP is bound, and further domain closing is observed when glucose is also bound.  The oligomerization state apparently varies depending on the species, with some existing as monomers, some as dimers, and some as tetramers.
Probab=21.95  E-value=56  Score=13.51  Aligned_cols=34  Identities=21%  Similarity=0.413  Sum_probs=30.1

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCC
Q ss_conf             6589899999999999999975999899972059
Q gi|254781141|r   92 QTKDPAQRENFFKNLFRIGNILGFQKEEMVDITD  125 (196)
Q Consensus        92 e~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D  125 (196)
                      -+.|++.+..+...|.-+..+.|..++||.+++.
T Consensus       253 S~~d~~l~~~i~~~ilp~vdSlGmNEqELa~l~~  286 (445)
T cd01938         253 STVDEELREEILHEVVPYVDSLGLNEQELANLLQ  286 (445)
T ss_pred             CCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH
T ss_conf             4576999999998725324657788899999999


No 72 
>TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321   These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport.
Probab=21.68  E-value=14  Score=17.52  Aligned_cols=14  Identities=14%  Similarity=0.124  Sum_probs=6.8

Q ss_pred             HHHHHHHHHHHHHH
Q ss_conf             89999438999999
Q gi|254781141|r   29 PTLEKTYPETYHRI   42 (196)
Q Consensus        29 ~~L~~~DP~~Y~~~   42 (196)
                      .-|=-||||||+-.
T Consensus       158 HIiTIEDPIEyvh~  171 (350)
T TIGR01420       158 HIITIEDPIEYVHK  171 (350)
T ss_pred             CCEEEECCEEEEEC
T ss_conf             82563177314104


No 73 
>PRK07668 hypothetical protein; Validated
Probab=21.60  E-value=57  Score=13.46  Aligned_cols=34  Identities=12%  Similarity=0.167  Sum_probs=27.7

Q ss_pred             HHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHH
Q ss_conf             9999999999999997599989997205999999
Q gi|254781141|r   97 AQRENFFKNLFRIGNILGFQKEEMVDITDHRLLS  130 (196)
Q Consensus        97 ~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~  130 (196)
                      +.++++-.+++-|+...|-+++||+++-|---+.
T Consensus         4 KE~~~FL~~~r~yL~~~Gkke~dI~~~~eele~H   37 (248)
T PRK07668          4 KEGRKFLDDTRVYLIAKGIKEEDIESFLEDAELH   37 (248)
T ss_pred             HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
T ss_conf             4789999999999999189788999999999999


No 74 
>pfam10932 DUF2783 Protein of unknown function (DUF2783). This is a bacterial family of uncharacterized protein.
Probab=21.50  E-value=57  Score=13.45  Aligned_cols=40  Identities=25%  Similarity=0.457  Sum_probs=29.1

Q ss_pred             HCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHH
Q ss_conf             5836589899999999999999975999899972059999999999
Q gi|254781141|r   89 LFPQTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLSLAYY  134 (196)
Q Consensus        89 ~iPe~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~vl~~  134 (196)
                      .-|.+.||   ..|+..+.+.  ..|.|+++ +...++|.|++|-.
T Consensus         3 t~pnl~~p---D~fY~~Li~a--H~gLs~~q-S~~lNArLiLlLaN   42 (60)
T pfam10932         3 TTPNLADP---DDFYEALIEA--HRGLSDEQ-SAALNARLILLLAN   42 (60)
T ss_pred             CCCCCCCC---HHHHHHHHHH--HCCCCHHH-HHHHHHHHHHHHHH
T ss_conf             88899980---2999999998--62899999-98872899999998


No 75 
>TIGR01926 peroxid_rel uncharacterized peroxidase-related enzyme; InterPro: IPR010195   Members of this family are conserved hypothetical proteins of around 200 amino acids in length. Many of them contain an akylhydroperoxidase (AhpD) domain. .
Probab=21.46  E-value=55  Score=13.54  Aligned_cols=21  Identities=24%  Similarity=0.366  Sum_probs=12.4

Q ss_pred             HHHHCCCCHHHHHHHCCHHHH
Q ss_conf             999759998999720599999
Q gi|254781141|r  109 IGNILGFQKEEMVDITDHRLL  129 (196)
Q Consensus       109 ~~~~~Gfs~eEi~~v~D~R~i  129 (196)
                      -+...||||.+|-.+...-+|
T Consensus       141 ~Lr~~GfsD~~IlD~~~~~Af  161 (179)
T TIGR01926       141 ALRAAGFSDEEILDLIHVVAF  161 (179)
T ss_pred             HHHHCCCCHHHHHHHHHHHHH
T ss_conf             887548995899999999999


No 76 
>KOG1546 consensus
Probab=20.39  E-value=46  Score=14.07  Aligned_cols=37  Identities=16%  Similarity=0.150  Sum_probs=25.1

Q ss_pred             HHHHHHHHHH-CCCCHHHHHHHCCHHHHH-------HHHHHHHHH
Q ss_conf             9999999997-599989997205999999-------999999999
Q gi|254781141|r  103 FKNLFRIGNI-LGFQKEEMVDITDHRLLS-------LAYYAQIGL  139 (196)
Q Consensus       103 ~~~i~~~~~~-~Gfs~eEi~~v~D~R~i~-------vl~~Am~y~  139 (196)
                      -..|+..+.+ |||+++.|--++|--.=.       =+++||.|-
T Consensus        87 v~~M~~~Lv~rfGFs~ddI~~LtDt~~s~~~~PT~~Nir~Al~wL  131 (362)
T KOG1546          87 VHRMRKLLVERFGFSEDDILMLTDTDESPVRIPTGKNIRRALRWL  131 (362)
T ss_pred             HHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCCHHHHHHHHHHH
T ss_conf             999999999861898334688744777642367277899999999


No 77 
>pfam00531 Death Death domain.
Probab=20.20  E-value=61  Score=13.27  Aligned_cols=26  Identities=23%  Similarity=0.331  Sum_probs=20.6

Q ss_pred             HHHHHHHHHHHHHCCCCHHHHHHHCC
Q ss_conf             99999999999975999899972059
Q gi|254781141|r  100 ENFFKNLFRIGNILGFQKEEMVDITD  125 (196)
Q Consensus       100 ~~~~~~i~~~~~~~Gfs~eEi~~v~D  125 (196)
                      .....+...+|..+|||+.+|..+..
T Consensus         9 ~~~g~~W~~la~~Lgl~~~~i~~i~~   34 (83)
T pfam00531         9 ELLGKDWRELARKLGLSESEIDEIEQ   34 (83)
T ss_pred             HCCHHHHHHHHHHHCCCHHHHHHHHH
T ss_conf             32404199999995998999999998


No 78 
>TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275   Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC).   Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain.   Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains .    This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process.
Probab=20.16  E-value=61  Score=13.27  Aligned_cols=46  Identities=17%  Similarity=0.257  Sum_probs=30.2

Q ss_pred             HHHHHHHHHHHHH--HCC---CCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHH
Q ss_conf             9999999999999--583---658989999999999999997599989997205999999
Q gi|254781141|r   76 EKKLIAENQQLEH--LFP---QTKDPAQRENFFKNLFRIGNILGFQKEEMVDITDHRLLS  130 (196)
Q Consensus        76 q~~l~~e~~~L~e--~iP---e~~D~~k~~~~~~~i~~~~~~~Gfs~eEi~~v~D~R~i~  130 (196)
                      ...+.-+.+....  .++   ||.         .++..-++.+||||..|+.+..-..-.
T Consensus       468 ~~iV~~e~~l~~~~n~~~~I~~~~---------~e~L~~aK~lGFSD~~iA~~~~~~~~~  518 (1089)
T TIGR01369       468 KNIVELEKELEELKNKLTGIEELD---------KELLRKAKKLGFSDAQIARLVNVTEAE  518 (1089)
T ss_pred             HHHHHHHHHHHHHHCCCCCCCCCC---------HHHHHHHHHCCCCHHHHHHHHCCCHHH
T ss_conf             999999999875301244533489---------889999864188878999986897444


Done!