Query         gi|254780800|ref|YP_003065213.1| hypothetical protein CLIBASIA_03455 [Candidatus Liberibacter asiaticus str. psy62]
Match_columns 192
No_of_seqs    115 out of 360
Neff          3.3 
Searched_HMMs 39220
Date          Sun May 29 19:03:20 2011
Command       /home/congqian_1/programs/hhpred/hhsearch -i 254780800.hhm -d /home/congqian_1/database/cdd/Cdd.hhm 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 COG3820 Uncharacterized protei 100.0       0       0  627.3   7.8  189    1-191     1-190 (230)
  2 pfam06242 DUF1013 Protein of u 100.0       0       0  505.1  11.2  140   35-174     1-140 (141)
  3 TIGR00938 thrB_alt homoserine   71.0     2.3   6E-05   24.1   1.6  108   19-135     5-130 (312)
  4 smart00347 HTH_MARR helix_turn  60.5     9.8 0.00025   20.4   3.1   47  107-153    14-63  (101)
  5 pfam01047 MarR MarR family. Th  56.2      13 0.00032   19.7   3.1   34  107-140     7-40  (59)
  6 PRK09954 hypothetical protein;  44.8      28 0.00072   17.6   3.4   12   57-68    170-181 (362)
  7 pfam02787 CPSase_L_D3 Carbamoy  44.2      15 0.00038   19.3   1.9   24  117-140    71-94  (122)
  8 smart00760 Bac_DnaA_C Bacteria  43.4      15 0.00038   19.3   1.8   22  107-128    35-56  (60)
  9 COG4977 Transcriptional regula  42.5      18 0.00046   18.7   2.1   18  118-135   237-254 (328)
 10 TIGR01369 CPSaseII_lrg carbamo  41.6      18 0.00046   18.7   2.0   38  103-141   489-532 (1089)
 11 cd05131 RasGAP_IQGAP2 IQGAP2 i  40.3      22 0.00056   18.2   2.3   48  102-159   142-197 (339)
 12 PRK12520 RNA polymerase sigma   37.6      31  0.0008   17.3   2.7   33  108-140   138-170 (191)
 13 pfam11762 Arabinose_Iso_C L-ar  37.3      16  0.0004   19.1   1.1   23   16-38     91-113 (115)
 14 KOG0237 consensus               37.0      15 0.00038   19.2   1.0   44   15-58     46-102 (788)
 15 pfam03376 Adeno_E3B Adenovirus  36.2      16  0.0004   19.1   1.0   18  111-128    72-89  (90)
 16 cd00015 ALBUMIN Albumin domain  35.5      29 0.00073   17.5   2.2   28  107-134   138-165 (185)
 17 pfam08299 Bac_DnaA_C domain.    35.0      24 0.00061   18.0   1.8   29  107-135    35-64  (70)
 18 PRK03573 transcriptional regul  34.8      43  0.0011   16.4   3.1   33  108-140    37-69  (144)
 19 cd05133 RasGAP_IQGAP1 IQGAP1 i  33.4      34 0.00086   17.1   2.3   47  103-159   143-197 (360)
 20 PRK10681 DNA-binding transcrip  31.1      37 0.00095   16.8   2.2   20   14-33     15-34  (252)
 21 TIGR02739 TraF type-F conjugat  30.7      31 0.00078   17.3   1.7   23  150-172   240-262 (270)
 22 pfam05917 DUF874 Helicobacter   30.6      25 0.00063   17.9   1.2   14   98-111    89-102 (417)
 23 PRK03975 tfx putative transcri  30.2      38 0.00096   16.8   2.1   55  117-171    19-81  (139)
 24 cd06171 Sigma70_r4 Sigma70, re  29.7      52  0.0013   15.9   2.8   34  107-140    16-49  (55)
 25 cd01388 SOX-TCF_HMG-box SOX-TC  29.1      31 0.00079   17.3   1.5   20  111-130    20-39  (72)
 26 pfam08281 Sigma70_r4_2 Sigma-7  29.0      54  0.0014   15.8   2.8   32  107-138    16-47  (54)
 27 pfam11199 DUF2891 Protein of u  28.9      27 0.00069   17.7   1.2   45   97-141    95-140 (324)
 28 TIGR02814 pfaD_fam PfaD family  27.9      22 0.00057   18.2   0.6   18   63-80    361-378 (449)
 29 COG1476 Predicted transcriptio  26.1      64  0.0016   15.4   3.0   31  112-142     9-39  (68)
 30 PRK13501 transcriptional activ  26.1      45  0.0011   16.3   1.9   40   96-137   172-212 (290)
 31 KOG4414 consensus               26.1      29 0.00074   17.5   0.9   30    5-39      2-31  (197)
 32 KOG2530 consensus               25.8      25 0.00065   17.8   0.6   22   12-33    258-279 (483)
 33 pfam04545 Sigma70_r4 Sigma-70,  25.2      53  0.0014   15.9   2.1   30  111-140    14-43  (50)
 34 PRK13500 transcriptional activ  24.7      52  0.0013   15.9   2.0   30  107-136   211-241 (312)
 35 pfam04645 DUF603 Protein of un  24.5      45  0.0012   16.3   1.7   57  116-175    18-74  (181)
 36 PRK05231 homoserine kinase; Pr  24.4      34 0.00086   17.1   1.0  125   19-158     5-144 (319)
 37 TIGR01362 KDO8P_synth 3-deoxy-  23.8      39   0.001   16.7   1.3   18  158-175   103-120 (279)
 38 KOG0289 consensus               22.8      38 0.00098   16.7   1.0   17   50-66     35-51  (506)
 39 KOG3832 consensus               22.7      34 0.00086   17.1   0.7   26    5-30     31-58  (319)
 40 smart00342 HTH_ARAC helix_turn  22.7      51  0.0013   16.0   1.7   22   12-33     42-63  (84)
 41 PRK03902 manganese transport t  22.4      75  0.0019   15.0   2.8   56  101-156     6-62  (142)
 42 TIGR02983 SigE-fam_strep RNA p  22.4      46  0.0012   16.3   1.4   27  111-137   123-149 (165)
 43 TIGR02643 T_phosphoryl thymidi  22.4      31  0.0008   17.3   0.5   25   17-48     12-36  (440)
 44 cd00090 HTH_ARSR Arsenical Res  22.3      76  0.0019   14.9   3.2   30  107-137    11-40  (78)
 45 PRK10371 DNA-binding transcrip  22.3      63  0.0016   15.4   2.1   71  107-181   196-291 (302)
 46 PRK12544 RNA polymerase sigma   22.2      76  0.0019   14.9   2.8   36  106-141   154-189 (207)
 47 pfam00685 Sulfotransfer_1 Sulf  22.0      36 0.00091   16.9   0.8   12    7-18      9-20  (254)
 48 COG4989 Predicted oxidoreducta  21.8      39 0.00099   16.7   0.9   22  106-132   245-266 (298)
 49 pfam07750 GcrA GcrA cell cycle  21.7      36 0.00091   16.9   0.7   13   19-31     17-29  (162)
 50 PRK02251 putative septation in  21.3      66  0.0017   15.3   2.0   33   90-124     7-57  (84)
 51 pfam00325 Crp Bacterial regula  21.3      48  0.0012   16.2   1.3   16   21-36      3-18  (32)
 52 smart00103 ALBUMIN serum album  20.8      75  0.0019   14.9   2.2   29  107-135   145-173 (187)
 53 pfam03339 Pox_L3_FP4 Poxvirus   20.7      34 0.00086   17.1   0.4   42   96-137   204-247 (319)
 54 pfam00242 DNA_pol_viral_N DNA   20.5      73  0.0019   15.1   2.1   42   55-100    84-132 (379)
 55 PRK11512 DNA-binding transcrip  20.3      83  0.0021   14.7   3.3   46  108-153    45-93  (144)
 56 PRK05294 carB carbamoyl phosph  20.2      71  0.0018   15.1   2.0   25  116-140   490-514 (1063)
 57 cd06405 PB1_Mekk2_3 The PB1 do  20.0      60  0.0015   15.5   1.6   22  154-175    46-67  (79)

No 1  
>COG3820 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=100.00  E-value=0  Score=627.32  Aligned_cols=189  Identities=67%  Similarity=1.033  Sum_probs=176.9

Q ss_pred             CCCCCCCCHHHHHHHHHHCCCCHHHHHHHHCCCHHHEEEECCHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEE-CC
Q ss_conf             98868861345430242043479999967087120511112414638961147401278489999997616487704-03
Q gi|254780800|r    1 MCQRPLMPKASAVWLIDNTSLSFDQIAEFCGLHLLEVMAIADGESLQGIKGFNLISSGQLSAEEIALGEKDKNYKLK-IS   79 (192)
Q Consensus         1 M~~~pLMPkATAvWLvdNT~LtF~QIa~FC~lH~LEVq~IADGeva~gi~g~dPi~~gqLt~eEI~~ce~d~~~~L~-l~   79 (192)
                      |+++|||||||||||||||+|||+|||+|||||+|||+||||||||+||+|+|||.+||||||||+|||+||+++|+ +.
T Consensus         1 ~~~~~lmpkatavwlvdNTsLsF~QIA~FCglHplEvk~iADGE~aq~IkGldPI~~GQLtreEi~rae~dP~~~lk~l~   80 (230)
T COG3820           1 MATQLLMPKATAVWLVDNTSLSFDQIADFCGLHPLEVKGIADGEVAQGIKGLDPIANGQLTREEIARAEKDPNYRLKILS   80 (230)
T ss_pred             CCCCCCCCCCEEEEEEECCCCCHHHHHHHHCCCCCEEEEECCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHCC
T ss_conf             97444566643799871663259999988476801133102614541455788413575169999865249324444305


Q ss_pred             CCCCCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCHHHCCC
Q ss_conf             66433442243556534203311571799999984245584243466438758999996288877012683773322564
Q gi|254780800|r   80 KPKSYILESTKKKKRYTPVSKRQDRPNAILWLIRNYPRLKDAQISHLVGTTGSTIEQIRNRTHWNTANLVPMDPVTLGLC  159 (192)
Q Consensus        80 ~~~~~~~~~~~Kg~kYTPlSkRqDkPdAI~WLiknhPeL~D~QI~KLiGTTK~TI~sIRdRtHWN~~Ni~P~DPV~LGLC  159 (192)
                      .+++.++++++|||||||+||||||||||+||++|||||+|+||+|||||||.||++||+|||||++||+|+|||+||||
T Consensus        81 ~p~~~~~~~kkk~prYTPvsrRQDrP~AI~Wl~rnhPelkDaQIskliGTTK~TIeqiR~RtHWNsani~p~DPVtLGlc  160 (230)
T COG3820          81 SPKVELPESKKKGPRYTPVSRRQDRPDAILWLLRNHPELKDAQISKLIGTTKSTIEQIRTRTHWNSANITPMDPVTLGLC  160 (230)
T ss_pred             CCCCCCCCCCCCCCCCCCCHHCCCCCCCEEHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCEEEEEH
T ss_conf             75111661100388767602204785211047424864017899999732188899997631233146761796114200


Q ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
Q ss_conf             89999999999730121245545554205012
Q gi|254780800|r  160 AQIDLDREIEKCSKNTEAKYNKAVDDISTPFS  191 (192)
Q Consensus       160 sQ~dLd~av~KA~~~~~~~~~~~~~~~s~p~~  191 (192)
                      ||+|||..|++|++...  .....+....|++
T Consensus       161 sQidLd~~Veras~~r~--~~~e~~~TL~pAs  190 (230)
T COG3820         161 SQIDLDKEVERASRGRP--TAKELGPTLLPAS  190 (230)
T ss_pred             HHHHHHHHHHHHCCCCC--CCCCCCCCCCCHH
T ss_conf             01005889987447999--9788887426535


No 2  
>pfam06242 DUF1013 Protein of unknown function (DUF1013). Family of uncharacterized proteins found in Proteobacteria.
Probab=100.00  E-value=0  Score=505.06  Aligned_cols=140  Identities=63%  Similarity=0.984  Sum_probs=137.8

Q ss_pred             HHEEEECCHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHH
Q ss_conf             05111124146389611474012784899999976164877040366433442243556534203311571799999984
Q gi|254780800|r   35 LEVMAIADGESLQGIKGFNLISSGQLSAEEIALGEKDKNYKLKISKPKSYILESTKKKKRYTPVSKRQDRPNAILWLIRN  114 (192)
Q Consensus        35 LEVq~IADGeva~gi~g~dPi~~gqLt~eEI~~ce~d~~~~L~l~~~~~~~~~~~~Kg~kYTPlSkRqDkPdAI~WLikn  114 (192)
                      ||||||||||||+||+|||||++||||+|||+|||+||+++|+|.+++++.+..++||+|||||||||||||||+|||||
T Consensus         1 LEVqgIADGeVa~gI~G~dPI~~gqLT~eEI~rce~d~~~~L~i~~~~~~~~~~k~kg~rYtPlSkRqDrPdAI~WLikn   80 (141)
T pfam06242         1 LEVQGIADGEVAQGIKGLDPIANGQLTREEIERCEKDPNYRLKLSKSKVIVPRRKRKGPRYTPVSKRQDRPDAIAWLLRN   80 (141)
T ss_pred             CCEEEECCHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCHHHHHHHHC
T ss_conf             93014536677668867797003783899999884287668332456442551001456767526514671999999971


Q ss_pred             CCCCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHH
Q ss_conf             245584243466438758999996288877012683773322564899999999997301
Q gi|254780800|r  115 YPRLKDAQISHLVGTTGSTIEQIRNRTHWNTANLVPMDPVTLGLCAQIDLDREIEKCSKN  174 (192)
Q Consensus       115 hPeL~D~QI~KLiGTTK~TI~sIRdRtHWN~~Ni~P~DPV~LGLCsQ~dLd~av~KA~~~  174 (192)
                      ||||+|+||+|||||||+||++||||||||++||+|+|||+||||||+|||++|+||.++
T Consensus        81 hPeL~D~QI~kLiGTTK~TI~~IR~rthWN~~Ni~p~dPV~LGLCsQ~dLd~~v~ka~~~  140 (141)
T pfam06242        81 HPELSDAQISKLIGTTKTTIEAIRDRSHWNSANIKPRDPVTLGLCSQIDLDAAVEKAAKR  140 (141)
T ss_pred             CCCCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHHCC
T ss_conf             670248788887477578899986357444457786998235665799999999998725


No 3  
>TIGR00938 thrB_alt homoserine kinase; InterPro: IPR005280    Homoserine kinase is required in the biosynthesis of threonine from aspartate.The member of this family from Pseudomonas aeruginosa was shown by direct assay and complementation to act specifically as a homoserine kinase. .
Probab=71.00  E-value=2.3  Score=24.15  Aligned_cols=108  Identities=19%  Similarity=0.249  Sum_probs=61.5

Q ss_pred             CCCCHHHHHHHHCCC-----HHHEEEECCHHHHCCCCCCCCCCCC-----CCCHHHHHH-HHHCCCCCEE----CCCCCC
Q ss_conf             434799999670871-----2051111241463896114740127-----848999999-7616487704----036643
Q gi|254780800|r   19 TSLSFDQIAEFCGLH-----LLEVMAIADGESLQGIKGFNLISSG-----QLSAEEIAL-GEKDKNYKLK----ISKPKS   83 (192)
Q Consensus        19 T~LtF~QIa~FC~lH-----~LEVq~IADGeva~gi~g~dPi~~g-----qLt~eEI~~-ce~d~~~~L~----l~~~~~   83 (192)
                      |+.||+|+..|-.+-     .|.+||||.|     |.--|=..+-     -||-=|-.. .|.=|.- |.    |+...+
T Consensus         5 T~vsD~e~~~FL~GYPDlG~ll~lkGIA~G-----v~NSNYl~tT~~GrYiLTlyE~~~~~EeLPfF-L~L~~hLA~~g~   78 (312)
T TIGR00938         5 TSVSDEEMEAFLEGYPDLGELLSLKGIAEG-----VENSNYLLTTDKGRYILTLYEKRVKAEELPFF-LELLTHLAERGL   78 (312)
T ss_pred             ECCCHHHHHHHHCCCCCCCCHHHCCCCCCC-----CCEECCEEEECCCCEEEEEEEECCCCCCCCHH-HHHHHHHHHCCC
T ss_conf             024189999985337755531000132236-----21003156605873688765201686577108-899999986389


Q ss_pred             CCC--CCCCCCCCCCCHHHHCCCHHHHHHHHHHCC-CCCHHHHHHHHCCCHHHHH
Q ss_conf             344--224355653420331157179999998424-5584243466438758999
Q gi|254780800|r   84 YIL--ESTKKKKRYTPVSKRQDRPNAILWLIRNYP-RLKDAQISHLVGTTGSTIE  135 (192)
Q Consensus        84 ~~~--~~~~Kg~kYTPlSkRqDkPdAI~WLiknhP-eL~D~QI~KLiGTTK~TI~  135 (192)
                      +++  -+++-|..   +|.=.-||.||-=.++--. ..+-.+=|+-||--+--+.
T Consensus        79 p~a~p~~~~~G~~---~~~L~GkPAaLv~fL~G~s~~~P~~~~c~~~G~~lA~~h  130 (312)
T TIGR00938        79 PVAKPVKSRDGRA---LSSLAGKPAALVEFLKGLSVAKPTAEHCREVGEVLAWMH  130 (312)
T ss_pred             CCCCCCCCCCCCH---HHHCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH
T ss_conf             9887500025430---222067715666312688778880775367647887898


No 4  
>smart00347 HTH_MARR helix_turn_helix multiple antibiotic resistance protein.
Probab=60.54  E-value=9.8  Score=20.36  Aligned_cols=47  Identities=17%  Similarity=0.235  Sum_probs=34.2

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHCC---CCCCCCCCCCCCC
Q ss_conf             9999998424558424346643875899999628---8877012683773
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVGTTGSTIEQIRNR---THWNTANLVPMDP  153 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiGTTK~TI~sIRdR---tHWN~~Ni~P~DP  153 (192)
                      .|++.|.++|.++-++|++..|.+++||..+=++   .-|=.-.-.|.|-
T Consensus        14 ~vL~~l~~~~~~t~~~la~~l~~~~~~vs~~v~~L~~~glv~r~~~~~D~   63 (101)
T smart00347       14 LVLRILYEEGPLSVSELAKRLGVSPSTVTRVLDRLEKKGLIRRLPSPEDR   63 (101)
T ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHCCCCEEEEECCCCC
T ss_conf             99999997699799999999896887999999999458938982189998


No 5  
>pfam01047 MarR MarR family. The Mar proteins are involved in the multiple antibiotic resistance, a non-specific resistance system. The expression of the mar operon is controlled by a repressor, MarR. A large number of compounds induce transcription of the mar operon. This is thought to be due to the compound binding to MarR, and the resulting complex stops MarR binding to the DNA. With the MarR repression lost, transcription of the operon proceeds. The structure of MarR is known and shows MarR as a dimer with each subunit containing a winged-helix DNA binding motif.
Probab=56.19  E-value=13  Score=19.66  Aligned_cols=34  Identities=21%  Similarity=0.356  Sum_probs=30.0

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHCC
Q ss_conf             9999998424558424346643875899999628
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVGTTGSTIEQIRNR  140 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiGTTK~TI~sIRdR  140 (192)
                      .+++.|..||.++-++|++.++.+++||..+=+|
T Consensus         7 ~vL~~l~~~~~~s~~~la~~~~~~~~~vs~~i~~   40 (59)
T pfam01047         7 HILRILYEHGPLTVSELAEKLGVDRSTVTRVLDR   40 (59)
T ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHH
T ss_conf             9999999469929999999988586549999999


No 6  
>PRK09954 hypothetical protein; Provisional
Probab=44.84  E-value=28  Score=17.55  Aligned_cols=12  Identities=17%  Similarity=0.080  Sum_probs=6.3

Q ss_pred             CCCCCHHHHHHH
Q ss_conf             278489999997
Q gi|254780800|r   57 SGQLSAEEIALG   68 (192)
Q Consensus        57 ~gqLt~eEI~~c   68 (192)
                      ..++|-+.|+..
T Consensus       170 ~~~itp~~l~~~  181 (362)
T PRK09954        170 LQQLTPQLLNGS  181 (362)
T ss_pred             HHHCCHHHHHHH
T ss_conf             433699999999


No 7  
>pfam02787 CPSase_L_D3 Carbamoyl-phosphate synthetase large chain, oligomerisation domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain.
Probab=44.20  E-value=15  Score=19.26  Aligned_cols=24  Identities=33%  Similarity=0.619  Sum_probs=21.7

Q ss_pred             CCCHHHHHHHHCCCHHHHHHHHCC
Q ss_conf             558424346643875899999628
Q gi|254780800|r  117 RLKDAQISHLVGTTGSTIEQIRNR  140 (192)
Q Consensus       117 eL~D~QI~KLiGTTK~TI~sIRdR  140 (192)
                      -.||.||++|.|.+-..|...|..
T Consensus        71 GFSD~~IA~l~~~~e~~Ir~~R~~   94 (122)
T pfam02787        71 GFSDAQIAKLLGVTEEEVRKLRKE   94 (122)
T ss_pred             CCCHHHHHHHHCCCHHHHHHHHHH
T ss_conf             999899998839869999999998


No 8  
>smart00760 Bac_DnaA_C Bacterial dnaA protein helix-turn-helix domain. Could be involved in DNA-binding.
Probab=43.37  E-value=15  Score=19.28  Aligned_cols=22  Identities=23%  Similarity=0.463  Sum_probs=15.4

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHC
Q ss_conf             9999998424558424346643
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVG  128 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiG  128 (192)
                      ...||++.+..++-.+|++..|
T Consensus        35 iamyL~r~~~~~s~~~IG~~fg   56 (60)
T smart00760       35 IAMYLARELTDLSLPEIGKIFG   56 (60)
T ss_pred             HHHHHHHHHHCCCHHHHHHHHC
T ss_conf             9999999986799999999977


No 9  
>COG4977 Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain [Transcription]
Probab=42.54  E-value=18  Score=18.71  Aligned_cols=18  Identities=17%  Similarity=0.298  Sum_probs=8.2

Q ss_pred             CCHHHHHHHHCCCHHHHH
Q ss_conf             584243466438758999
Q gi|254780800|r  118 LKDAQISHLVGTTGSTIE  135 (192)
Q Consensus       118 L~D~QI~KLiGTTK~TI~  135 (192)
                      |+-.++++.+|-.--+.+
T Consensus       237 lsl~~LA~~~~~S~R~le  254 (328)
T COG4977         237 LSLEELADRAGLSRRQLE  254 (328)
T ss_pred             CCHHHHHHHHCCCHHHHH
T ss_conf             689999999687887899


No 10 
>TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275   Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC).   Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain.   Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains .    This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process.
Probab=41.61  E-value=18  Score=18.74  Aligned_cols=38  Identities=24%  Similarity=0.404  Sum_probs=28.4

Q ss_pred             CCHHHHHHHHHHCCCCCHHHHHHHHCCCHHH------HHHHHCCC
Q ss_conf             5717999999842455842434664387589------99996288
Q gi|254780800|r  103 DRPNAILWLIRNYPRLKDAQISHLVGTTGST------IEQIRNRT  141 (192)
Q Consensus       103 DkPdAI~WLiknhPeL~D~QI~KLiGTTK~T------I~sIRdRt  141 (192)
                      |=|...+=-.|.+ --||.||++++|+|-.+      |.++|...
T Consensus       489 ~~~~e~L~~aK~l-GFSD~~iA~~~~~~~~~~~~~~~vR~~R~~~  532 (1089)
T TIGR01369       489 ELDKELLRKAKKL-GFSDAQIARLVNVTEAESIEELEVRKLRKEL  532 (1089)
T ss_pred             CCCHHHHHHHHHC-CCCHHHHHHHHCCCHHHCCCHHHHHHHHHHC
T ss_conf             4898899998641-8887899998689744420068999999846


No 11 
>cd05131 RasGAP_IQGAP2 IQGAP2 is a member of the IQGAP family that contains a calponin-homology (CH) domain which binds F-actin, IQGAP-specific repeat, a single WW domain, four IQ motifs which mediate interactions with calmodulin, and a Ras-GTPase-activating protein (GAP)-related domain that binds Rho family GTPases. IQGAP2 and IQGAP3 play important roles in the regulation of the cytoskeleton for axon outgrowth in hippocampal neurons and are thought to stay in a common regulatory pathway. The results of RNA interference studies indicated that IQGAP3 partially compensates functions of IQGAP2, but has lesser ability than IQGAP2 to promote axon outgrowth in hippocampal neuron. Morevover, IQGAP2 is required for the cadherin-mediated cell-to-cell adhesion in Xenopus laevis embryos.
Probab=40.33  E-value=22  Score=18.20  Aligned_cols=48  Identities=19%  Similarity=0.364  Sum_probs=32.8

Q ss_pred             CCCHHHHHH--------HHHHCCCCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCHHHCCC
Q ss_conf             157179999--------9984245584243466438758999996288877012683773322564
Q gi|254780800|r  102 QDRPNAILW--------LIRNYPRLKDAQISHLVGTTGSTIEQIRNRTHWNTANLVPMDPVTLGLC  159 (192)
Q Consensus       102 qDkPdAI~W--------LiknhPeL~D~QI~KLiGTTK~TI~sIRdRtHWN~~Ni~P~DPV~LGLC  159 (192)
                      -.=|.+|-|        +.+.||+-++.+|+++||.-      +=-| |-|-+-+.|..   .|++
T Consensus       142 ~~iPygiR~ick~i~~~l~~kFP~a~~~~il~vvG~~------~~~r-yinPaIvsPe~---f~ii  197 (339)
T cd05131         142 DLIPYGMRYIAKVLKNSLHEKFPDATEDELLKIVGNL------LYYR-YMNPAIVAPDG---FDII  197 (339)
T ss_pred             HHCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH------HHHH-HHHHHHCCCHH---CCCC
T ss_conf             7489999999999999999778999778899999999------9999-73016148411---2830


No 12 
>PRK12520 RNA polymerase sigma factor; Provisional
Probab=37.63  E-value=31  Score=17.27  Aligned_cols=33  Identities=15%  Similarity=0.312  Sum_probs=28.9

Q ss_pred             HHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHCC
Q ss_conf             999998424558424346643875899999628
Q gi|254780800|r  108 ILWLIRNYPRLKDAQISHLVGTTGSTIEQIRNR  140 (192)
Q Consensus       108 I~WLiknhPeL~D~QI~KLiGTTK~TI~sIRdR  140 (192)
                      -.++++.+-+|+-.+|+..+|.+-+||.+.--|
T Consensus       138 ~v~~Lr~~~~ls~~EIA~~l~~s~~tVk~~l~R  170 (191)
T PRK12520        138 RVFMMREWLELETEEICKELQITATNLWVMLYR  170 (191)
T ss_pred             HHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHH
T ss_conf             999999993899999999989499999999999


No 13 
>pfam11762 Arabinose_Iso_C L-arabinose isomerase C-terminal domain. This is a family of L-arabinose isomerases, AraA, EC:5.3.1.4. These enzymes catalyse the reaction: L-arabinose <= L-ribulose. This reaction is the first step in the pathway of L-arabinose utilisation as a carbon source after entering the cell L-arabinose is converted into L-ribulose by the L-arabinose isomerases enzyme. This is a C-terminal non catalytic domain.
Probab=37.35  E-value=16  Score=19.14  Aligned_cols=23  Identities=13%  Similarity=0.388  Sum_probs=18.7

Q ss_pred             HHHCCCCHHHHHHHHCCCHHHEE
Q ss_conf             42043479999967087120511
Q gi|254780800|r   16 IDNTSLSFDQIAEFCGLHLLEVM   38 (192)
Q Consensus        16 vdNT~LtF~QIa~FC~lH~LEVq   38 (192)
                      |=.+++|.+|+.+||.|.-+|.-
T Consensus        91 v~S~~~~~e~~~d~A~~~giE~v  113 (115)
T pfam11762        91 VFSQAVTTEQLEDFAEMAGIELV  113 (115)
T ss_pred             EEECCCCHHHHHHHHHHHCCEEE
T ss_conf             88836899999999998798898


No 14 
>KOG0237 consensus
Probab=37.04  E-value=15  Score=19.24  Aligned_cols=44  Identities=30%  Similarity=0.464  Sum_probs=35.4

Q ss_pred             HHHHCCCC---HHHHHHHHCCCHHH----------EEEECCHHHHCCCCCCCCCCCC
Q ss_conf             24204347---99999670871205----------1111241463896114740127
Q gi|254780800|r   15 LIDNTSLS---FDQIAEFCGLHLLE----------VMAIADGESLQGIKGFNLISSG   58 (192)
Q Consensus        15 LvdNT~Lt---F~QIa~FC~lH~LE----------Vq~IADGeva~gi~g~dPi~~g   58 (192)
                      -|+|+.++   |+++++||.-|-.+          |-||+|--...||.-|-|-.-+
T Consensus        46 ~~~~~dI~~~d~~ala~f~~e~~I~lVvvGPE~PL~~Gl~~~l~~~gi~~FGPs~~a  102 (788)
T KOG0237          46 KVPNLDISVADFEALASFCKEHNINLVVVGPELPLVAGLADVLRSAGIPCFGPSKQA  102 (788)
T ss_pred             CCCCCCCCHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEECCHHHH
T ss_conf             375565576559999999987462189978732654235666640585101743777


No 15 
>pfam03376 Adeno_E3B Adenovirus E3B protein.
Probab=36.24  E-value=16  Score=19.08  Aligned_cols=18  Identities=17%  Similarity=0.567  Sum_probs=15.9

Q ss_pred             HHHHCCCCCHHHHHHHHC
Q ss_conf             998424558424346643
Q gi|254780800|r  111 LIRNYPRLKDAQISHLVG  128 (192)
Q Consensus       111 LiknhPeL~D~QI~KLiG  128 (192)
                      -.|+|||-.|.||++|.+
T Consensus        72 Y~rHhPqYrn~~iA~LL~   89 (90)
T pfam03376        72 YLRHHPQYRNQNVARLLR   89 (90)
T ss_pred             HHHCCCCCCCHHHHHHHC
T ss_conf             886281004868999864


No 16 
>cd00015 ALBUMIN Albumin domain, contains five or six internal disulphide bonds; albuminoid superfamily includes alpha-fetoprotein which binds various cations, fatty acids and bilirubin; vitamin D-binding protein which binds to vitamin D, its metabolites, and fatty acids; alpha-albumin which binds water, cations (such as Ca2+, Na+ and K+), fatty acids, hormones, bilirubin and drugs; and afamin of which little is known; these belong to a multigene family with highly conserved intron/exon organization and encoded protein structures; evolutionary comparisons strongly support vitamin D-binding protein as the original gene in this group with subsequent local duplications generating the remaining genes in the cluster
Probab=35.50  E-value=29  Score=17.51  Aligned_cols=28  Identities=14%  Similarity=0.140  Sum_probs=22.0

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHH
Q ss_conf             9999998424558424346643875899
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVGTTGSTI  134 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiGTTK~TI  134 (192)
                      -++.+.|.||++++.+|-++...-...+
T Consensus       138 flye~~rrhP~~~~~~ll~~a~~y~~~l  165 (185)
T cd00015         138 FLYEYSRRHPELSVPELLRLAKEYEELL  165 (185)
T ss_pred             HHHHHHHHCCCCCHHHHHHHHHHHHHHH
T ss_conf             9999997288888999999999999999


No 17 
>pfam08299 Bac_DnaA_C domain.
Probab=35.02  E-value=24  Score=17.99  Aligned_cols=29  Identities=21%  Similarity=0.429  Sum_probs=22.0

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHC-CCHHHHH
Q ss_conf             9999998424558424346643-8758999
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVG-TTGSTIE  135 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiG-TTK~TI~  135 (192)
                      ...||++.+-.++-.+|++..| -.-.||-
T Consensus        35 iamyL~r~~t~~s~~~IG~~fg~RdHsTVi   64 (70)
T pfam08299        35 IAMYLLRELTNLSLPEIGEIFGGRDHTTVL   64 (70)
T ss_pred             HHHHHHHHHHCCCHHHHHHHHCCCCCHHHH
T ss_conf             999999998679999999993899737999


No 18 
>PRK03573 transcriptional regulator SlyA; Provisional
Probab=34.77  E-value=43  Score=16.41  Aligned_cols=33  Identities=9%  Similarity=0.188  Sum_probs=26.3

Q ss_pred             HHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHCC
Q ss_conf             999998424558424346643875899999628
Q gi|254780800|r  108 ILWLIRNYPRLKDAQISHLVGTTGSTIEQIRNR  140 (192)
Q Consensus       108 I~WLiknhPeL~D~QI~KLiGTTK~TI~sIRdR  140 (192)
                      +.+|..+.|.++-.++++.+|..++|+..+=||
T Consensus        37 L~~L~~~~~g~tq~eLa~~l~v~~~t~tr~ld~   69 (144)
T PRK03573         37 LHNIHQLPPDQSQIQLAKAIGIEQPSLVRTLDQ   69 (144)
T ss_pred             HHHHHHCCCCCCHHHHHHHHCCCHHHHHHHHHH
T ss_conf             999997398989999999979878369999999


No 19 
>cd05133 RasGAP_IQGAP1 IQGAP1 is a homodimeric protein that is widely expressed among vertebrate cell types from early embryogenesis. Mammalian IQGAP1 protein is the best characterized member of the IQGAP family, and contains several protein-interacting domains. Human IQGAP1 is most similar to mouse Iqgap1 (94% identity) and has 62% identity to human IQGAP2. IQGAP1 binds and cross-links actin filaments in vitro and has been implicated in Ca2+/calmodulin signaling, E-cadherin-dependent cell adhesion, cell motility, and invasion. Yeast IQGAP homologues have a role in the recruitment of actin filaments, are components of the spindle pole body, and are required for actomyosin ring assembly and cytokinesis. Furthermore, IQGAP1 over-expression has also been detected in gastric and colorectal carcinomas and gastric cancer cell lines.
Probab=33.45  E-value=34  Score=17.09  Aligned_cols=47  Identities=15%  Similarity=0.315  Sum_probs=31.7

Q ss_pred             CCHHHHHHHHH--------HCCCCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCHHHCCC
Q ss_conf             57179999998--------4245584243466438758999996288877012683773322564
Q gi|254780800|r  103 DRPNAILWLIR--------NYPRLKDAQISHLVGTTGSTIEQIRNRTHWNTANLVPMDPVTLGLC  159 (192)
Q Consensus       103 DkPdAI~WLik--------nhPeL~D~QI~KLiGTTK~TI~sIRdRtHWN~~Ni~P~DPV~LGLC  159 (192)
                      .=|.+|-|++|        .||+-++.+|+++||.-=       =.-|-|-+-..|..   .|++
T Consensus       143 ~iPygiR~ick~l~~~l~~kFP~a~~~~il~vvG~f~-------~~ryinPaIvsPe~---f~iv  197 (360)
T cd05133         143 KIPYGMRFIAKVLKDSLHEKFPDAGEDELLKIVGNLL-------YYRYMNPAIVAPDA---FDII  197 (360)
T ss_pred             HCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH-------HHHHCCHHCCCCHH---CCCC
T ss_conf             5888999999999999997689997788999999999-------99860004068411---2811


No 20 
>PRK10681 DNA-binding transcriptional repressor DeoR; Provisional
Probab=31.09  E-value=37  Score=16.81  Aligned_cols=20  Identities=20%  Similarity=0.277  Sum_probs=11.7

Q ss_pred             HHHHHCCCCHHHHHHHHCCC
Q ss_conf             02420434799999670871
Q gi|254780800|r   14 WLIDNTSLSFDQIAEFCGLH   33 (192)
Q Consensus        14 WLvdNT~LtF~QIa~FC~lH   33 (192)
                      ||-.+-.++..++|+.++.-
T Consensus        15 ~L~~~g~v~v~eLa~~~~VS   34 (252)
T PRK10681         15 ALKRSDKLHLKDAAALLGVS   34 (252)
T ss_pred             HHHHCCCEEHHHHHHHHCCC
T ss_conf             99986979999999887979


No 21 
>TIGR02739 TraF type-F conjugative transfer system pilin assembly protein TraF; InterPro: IPR014110   This entry includes TraF; a protein that is part of a large group of proteins involved in conjugative transfer of plasmid DNA, specifically in the F-type system. TraF has been predicted to contain a thioredoxin fold and has been shown to be localized to the periplasm . Unlike the related protein TrbB (IPR014109 from INTERPRO), TraF does not contain a conserved pair of cysteines and has been shown not to function as a thiol disulphide isomerase by complementation of an Escherichia coli DsbA defect . The protein is believed to be involved in pilin assembly . Even more closely related than TrbB is a clade of genes (IPR014111 from INTERPRO), which do contain the CXXC motif, but it is unclear whether these are involved in type-F conjugation systems per se..
Probab=30.75  E-value=31  Score=17.35  Aligned_cols=23  Identities=26%  Similarity=0.430  Sum_probs=16.9

Q ss_pred             CCCCHHHCCCCHHHHHHHHHHHH
Q ss_conf             37733225648999999999973
Q gi|254780800|r  150 PMDPVTLGLCAQIDLDREIEKCS  172 (192)
Q Consensus       150 P~DPV~LGLCsQ~dLd~av~KA~  172 (192)
                      -..||.-||.||.||++-+-.+.
T Consensus       240 ~~~PlAYGf~Sqd~L~~ri~~V~  262 (270)
T TIGR02739       240 KMSPLAYGFISQDELKERILNVL  262 (270)
T ss_pred             CEEEEEHHHCCHHHHHHHHHHHC
T ss_conf             22010010204889999999853


No 22 
>pfam05917 DUF874 Helicobacter pylori protein of unknown function (DUF874). This family consists of several hypothetical proteins specific to Helicobacter pylori. The function of this family is unknown.
Probab=30.61  E-value=25  Score=17.90  Aligned_cols=14  Identities=36%  Similarity=0.681  Sum_probs=6.2

Q ss_pred             HHHHCCCHHHHHHH
Q ss_conf             03311571799999
Q gi|254780800|r   98 VSKRQDRPNAILWL  111 (192)
Q Consensus        98 lSkRqDkPdAI~WL  111 (192)
                      -||...+|+-|.|+
T Consensus        89 kskq~~~~NqinW~  102 (417)
T pfam05917        89 KSKQAEHENQINWF  102 (417)
T ss_pred             HHHHCCCCCCHHHH
T ss_conf             45420475511077


No 23 
>PRK03975 tfx putative transcriptional regulator; Provisional
Probab=30.23  E-value=38  Score=16.79  Aligned_cols=55  Identities=24%  Similarity=0.265  Sum_probs=35.9

Q ss_pred             CCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCC--------CCCCCHHHCCCCHHHHHHHHHHH
Q ss_conf             55842434664387589999962888770126--------83773322564899999999997
Q gi|254780800|r  117 RLKDAQISHLVGTTGSTIEQIRNRTHWNTANL--------VPMDPVTLGLCAQIDLDREIEKC  171 (192)
Q Consensus       117 eL~D~QI~KLiGTTK~TI~sIRdRtHWN~~Ni--------~P~DPV~LGLCsQ~dLd~av~KA  171 (192)
                      -++-.+|++++||++-.|.+|-.|.-=|+..-        .-.-||..-+=.-.||.++-...
T Consensus        19 G~tQ~eIA~~lgTSraNvs~iEk~A~eNIekAreTl~~~~~l~Apv~i~i~~GtDl~~iP~~v   81 (139)
T PRK03975         19 GLTQQEIADILGTSRANISIIEKRARENIEKARNTLAFWNTLNAPVQVTIPAGTDLFEIPKII   81 (139)
T ss_pred             CCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHH
T ss_conf             897999999977328899999999999999999999999966898899967999288828999


No 24 
>cd06171 Sigma70_r4 Sigma70, region (SR) 4 refers to the most C-terminal of four conserved domains found in Escherichia coli (Ec) sigma70, the main housekeeping sigma, and related sigma-factors (SFs). A SF is a dissociable subunit of RNA polymerase, it directs bacterial or plastid core RNA polymerase to specific promoter elements located upstream of transcription initiation points. The SR4 of Ec sigma70 and other essential primary SFs contact promoter sequences located 35 base-pairs upstream of the initiation point, recognizing a 6-base-pair -35 consensus TTGACA.  Sigma70 related SFs also include SFs which are dispensable for bacterial cell growth for example Ec sigmaS, SFs which activate regulons in response to a specific signal for example heat-shock Ec sigmaH, and a group of SFs which includes the extracytoplasmic function (ECF) SFs and is typified by Ec sigmaE which contains SR2 and -4 only. ECF SFs direct the transcription of genes that regulate various responses including periplas
Probab=29.73  E-value=52  Score=15.92  Aligned_cols=34  Identities=26%  Similarity=0.309  Sum_probs=28.5

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHCC
Q ss_conf             9999998424558424346643875899999628
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVGTTGSTIEQIRNR  140 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiGTTK~TI~sIRdR  140 (192)
                      .....++++-+++-.+|+...|.+-+||.+.-.|
T Consensus        16 r~vl~l~y~~~~s~~EIa~~lgis~~tVk~~l~r   49 (55)
T cd06171          16 REVILLRFGEGLSYEEIAEILGISRSTVRQRLHR   49 (55)
T ss_pred             HHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHH
T ss_conf             8989999980999999999989599999999999


No 25 
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=29.08  E-value=31  Score=17.32  Aligned_cols=20  Identities=25%  Similarity=0.642  Sum_probs=17.2

Q ss_pred             HHHHCCCCCHHHHHHHHCCC
Q ss_conf             99842455842434664387
Q gi|254780800|r  111 LIRNYPRLKDAQISHLVGTT  130 (192)
Q Consensus       111 LiknhPeL~D~QI~KLiGTT  130 (192)
                      +...+|+++.+.|+|++|..
T Consensus        20 ~~~~~P~~~~~eiSk~lg~~   39 (72)
T cd01388          20 VLQEYPLKENRAISKILGDR   39 (72)
T ss_pred             HHHHCCCCCHHHHHHHHHHH
T ss_conf             99888899889999999999


No 26 
>pfam08281 Sigma70_r4_2 Sigma-70, region 4. Region 4 of sigma-70 like sigma-factors are involved in binding to the -35 promoter element via a helix-turn-helix motif.
Probab=29.00  E-value=54  Score=15.83  Aligned_cols=32  Identities=25%  Similarity=0.386  Sum_probs=24.7

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHH
Q ss_conf             99999984245584243466438758999996
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVGTTGSTIEQIR  138 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiGTTK~TI~sIR  138 (192)
                      ..+.+++++-+++-.+|++..|.+-+||.+--
T Consensus        16 r~~~~l~~~~~~s~~eIA~~lg~s~~tVk~~l   47 (54)
T pfam08281        16 REVFLLRYLEGLSYAEIAELLGISEGTVKSRL   47 (54)
T ss_pred             HHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHH
T ss_conf             98968799878599999999894999999999


No 27 
>pfam11199 DUF2891 Protein of unknown function (DUF2891). This is a bacterial family of uncharacterized proteins.
Probab=28.90  E-value=27  Score=17.65  Aligned_cols=45  Identities=18%  Similarity=0.296  Sum_probs=34.6

Q ss_pred             CHHHHCCCHHHHHHHHHHCCCCCHHH-HHHHHCCCHHHHHHHHCCC
Q ss_conf             20331157179999998424558424-3466438758999996288
Q gi|254780800|r   97 PVSKRQDRPNAILWLIRNYPRLKDAQ-ISHLVGTTGSTIEQIRNRT  141 (192)
Q Consensus        97 PlSkRqDkPdAI~WLiknhPeL~D~Q-I~KLiGTTK~TI~sIRdRt  141 (192)
                      |-++--+||.+.+||++-+-||.-.. -..+-++.++-.+.|.+|.
T Consensus        95 ~~~~~FERpYGwAWlL~L~~EL~~~~~a~~w~~~l~PL~~~i~~r~  140 (324)
T pfam11199        95 PGNASFERPYGWAWLLKLAAELRRWDQAARWAEALEPLADLIVERL  140 (324)
T ss_pred             CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             7668884865599999999999972888899986208999999999


No 28 
>TIGR02814 pfaD_fam PfaD family protein; InterPro: IPR014179   The protein PfaD is part of a four-gene locus, similar to polyketide biosynthesis systems, which is responsible for omega-3 polyunsaturated fatty acid biosynthesis in several high pressure and/or cold-adapted bacteria. Several other members of the entry are found in loci presumed to act in polyketide biosyntheses per se..
Probab=27.85  E-value=22  Score=18.15  Aligned_cols=18  Identities=28%  Similarity=0.353  Sum_probs=15.4

Q ss_pred             HHHHHHHHCCCCCEECCC
Q ss_conf             999997616487704036
Q gi|254780800|r   63 EEIALGEKDKNYKLKISK   80 (192)
Q Consensus        63 eEI~~ce~d~~~~L~l~~   80 (192)
                      +||+|.|.||++|+-|.+
T Consensus       361 a~i~rAe~dPKhkMALvF  378 (449)
T TIGR02814       361 AEIERAERDPKHKMALVF  378 (449)
T ss_pred             HHHHHHHHCCCHHHHHHH
T ss_conf             799998627760057898


No 29 
>COG1476 Predicted transcriptional regulators [Transcription]
Probab=26.08  E-value=64  Score=15.39  Aligned_cols=31  Identities=26%  Similarity=0.354  Sum_probs=26.7

Q ss_pred             HHHCCCCCHHHHHHHHCCCHHHHHHHHCCCC
Q ss_conf             9842455842434664387589999962888
Q gi|254780800|r  112 IRNYPRLKDAQISHLVGTTGSTIEQIRNRTH  142 (192)
Q Consensus       112 iknhPeL~D~QI~KLiGTTK~TI~sIRdRtH  142 (192)
                      +|+.--++-.+.++.+|-++-||.+|-+..|
T Consensus         9 ~R~~~~ltQ~elA~~vgVsRQTi~~iEkgky   39 (68)
T COG1476           9 LRAELGLTQEELAKLVGVSRQTIIAIEKGKY   39 (68)
T ss_pred             HHHHHCCCHHHHHHHCCCCHHHHHHHHCCCC
T ss_conf             9998285899999991957999999991798


No 30 
>PRK13501 transcriptional activator RhaR; Provisional
Probab=26.06  E-value=45  Score=16.33  Aligned_cols=40  Identities=15%  Similarity=0.150  Sum_probs=25.2

Q ss_pred             CCHHHHCCCHHHHHHHHHHCC-CCCHHHHHHHHCCCHHHHHHH
Q ss_conf             420331157179999998424-558424346643875899999
Q gi|254780800|r   96 TPVSKRQDRPNAILWLIRNYP-RLKDAQISHLVGTTGSTIEQI  137 (192)
Q Consensus        96 TPlSkRqDkPdAI~WLiknhP-eL~D~QI~KLiGTTK~TI~sI  137 (192)
                      .|-+.|-|+  .+.|+-.|+. .++-+++++..|.++..+..+
T Consensus       172 ~~~~~~~~~--i~~~i~~~~~~~~~l~~lA~~~~lS~~~lsrl  212 (290)
T PRK13501        172 LPDGEQLDL--IMSALQQSLGAYFDMADFCHKNQLVERSLKQL  212 (290)
T ss_pred             CCCHHHHHH--HHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHH
T ss_conf             884689999--99999997468999999999969899999999


No 31 
>KOG4414 consensus
Probab=26.06  E-value=29  Score=17.49  Aligned_cols=30  Identities=40%  Similarity=0.625  Sum_probs=22.3

Q ss_pred             CCCCHHHHHHHHHHCCCCHHHHHHHHCCCHHHEEE
Q ss_conf             88613454302420434799999670871205111
Q gi|254780800|r    5 PLMPKASAVWLIDNTSLSFDQIAEFCGLHLLEVMA   39 (192)
Q Consensus         5 pLMPkATAvWLvdNT~LtF~QIa~FC~lH~LEVq~   39 (192)
                      ||-|.|-|.     .+.||+.|+|-|.--+||+-|
T Consensus         2 ~la~m~Eal-----aA~SFdKi~D~Cdn~~Lea~~   31 (197)
T KOG4414           2 DLAPMAEAL-----AAFSFDKIADICDNLELEAAG   31 (197)
T ss_pred             CCHHHHHHH-----HHHHHHHHHHHHHHHHHCCCC
T ss_conf             842888988-----864288999886135540557


No 32 
>KOG2530 consensus
Probab=25.80  E-value=25  Score=17.82  Aligned_cols=22  Identities=36%  Similarity=0.635  Sum_probs=14.6

Q ss_pred             HHHHHHHCCCCHHHHHHHHCCC
Q ss_conf             4302420434799999670871
Q gi|254780800|r   12 AVWLIDNTSLSFDQIAEFCGLH   33 (192)
Q Consensus        12 AvWLvdNT~LtF~QIa~FC~lH   33 (192)
                      --|++=|++|+|-|..+||.|-
T Consensus       258 ~~~~~lN~als~~qLs~~~~l~  279 (483)
T KOG2530         258 NKWLKLNKALSLSQLSQECSLY  279 (483)
T ss_pred             HHHHHHHHHHHHHHHHHHCCEE
T ss_conf             7788888777788775300234


No 33 
>pfam04545 Sigma70_r4 Sigma-70, region 4. Region 4 of sigma-70 like sigma-factors are involved in binding to the -35 promoter element via a helix-turn-helix motif. Due to the way Pfam works, the threshold has been set artificially high to prevent overlaps with other helix-turn-helix families. Therefore there are many false negatives.
Probab=25.20  E-value=53  Score=15.89  Aligned_cols=30  Identities=23%  Similarity=0.329  Sum_probs=25.1

Q ss_pred             HHHHCCCCCHHHHHHHHCCCHHHHHHHHCC
Q ss_conf             998424558424346643875899999628
Q gi|254780800|r  111 LIRNYPRLKDAQISHLVGTTGSTIEQIRNR  140 (192)
Q Consensus       111 LiknhPeL~D~QI~KLiGTTK~TI~sIRdR  140 (192)
                      .++++-+++-.+|++..|.+..+|.++-.|
T Consensus        14 ~l~y~~~~t~~EIA~~lgis~~~V~~~~~r   43 (50)
T pfam04545        14 VLRFGEGLTLEEIGERLGISRERVRQIEKR   43 (50)
T ss_pred             HHHCCCCCHHHHHHHHHCCCHHHHHHHHHH
T ss_conf             987068824999999989799999999999


No 34 
>PRK13500 transcriptional activator RhaR; Provisional
Probab=24.70  E-value=52  Score=15.94  Aligned_cols=30  Identities=10%  Similarity=0.075  Sum_probs=19.2

Q ss_pred             HHHHHHHHCCC-CCHHHHHHHHCCCHHHHHH
Q ss_conf             99999984245-5842434664387589999
Q gi|254780800|r  107 AILWLIRNYPR-LKDAQISHLVGTTGSTIEQ  136 (192)
Q Consensus       107 AI~WLiknhPe-L~D~QI~KLiGTTK~TI~s  136 (192)
                      .|.||-.|+-| ++-.++|...|-...+...
T Consensus       211 ll~~l~~~~~ep~~l~~lA~~~~lS~r~L~R  241 (312)
T PRK13500        211 LITRLAASLKSPFALDKFCDEASCSERVLRQ  241 (312)
T ss_pred             HHHHHHHHHCCCCCHHHHHHHHCCCHHHHHH
T ss_conf             9999998744999899999897889999999


No 35 
>pfam04645 DUF603 Protein of unknown function, DUF603. This family includes several uncharacterized proteins from Borrelia species.
Probab=24.51  E-value=45  Score=16.30  Aligned_cols=57  Identities=12%  Similarity=0.264  Sum_probs=39.2

Q ss_pred             CCCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHH
Q ss_conf             455842434664387589999962888770126837733225648999999999973012
Q gi|254780800|r  116 PRLKDAQISHLVGTTGSTIEQIRNRTHWNTANLVPMDPVTLGLCAQIDLDREIEKCSKNT  175 (192)
Q Consensus       116 PeL~D~QI~KLiGTTK~TI~sIRdRtHWN~~Ni~P~DPV~LGLCsQ~dLd~av~KA~~~~  175 (192)
                      .-|+|++|++-.|....-|-..|++  |-+.+-.+..---+- .++..|+..+..+-++.
T Consensus        18 G~L~D~eIA~~LGVsr~nV~kmRqK--wes~~dsv~edsrvT-isEdtf~~ll~~t~~s~   74 (181)
T pfam04645        18 GSLSDAEIAKELGVSRVNVWRMRQK--WESGEDSVNEDSRVT-ISEDTFEHLLIQTFQSE   74 (181)
T ss_pred             CCCCHHHHHHHHCCHHHHHHHHHHH--HHHCCCCCCCCCCEE-ECHHHHHHHHHHHHHHH
T ss_conf             7875799999978309999999999--981278855567300-03724999999999999


No 36 
>PRK05231 homoserine kinase; Provisional
Probab=24.44  E-value=34  Score=17.09  Aligned_cols=125  Identities=21%  Similarity=0.232  Sum_probs=64.3

Q ss_pred             CCCCHHHHHHHHCC----CHHHEEEECCHHHHCCCCCCCCCCCC--------CCCHHHHHHHHHCCCCCEECCCCCCCCC
Q ss_conf             43479999967087----12051111241463896114740127--------8489999997616487704036643344
Q gi|254780800|r   19 TSLSFDQIAEFCGL----HLLEVMAIADGESLQGIKGFNLISSG--------QLSAEEIALGEKDKNYKLKISKPKSYIL   86 (192)
Q Consensus        19 T~LtF~QIa~FC~l----H~LEVq~IADGeva~gi~g~dPi~~g--------qLt~eEI~~ce~d~~~~L~l~~~~~~~~   86 (192)
                      |.||.++|+.|+.-    -..+++||+.|=.-....=.-  ..|        ..+.+|+...-   +.-..|.....+.|
T Consensus         5 T~ls~~el~~~l~~Y~lG~l~~~~gI~~GieNTNY~v~T--~~g~yILTifEr~~~~~lpff~---~Lm~~L~~~gipcP   79 (319)
T PRK05231          5 TPVSDDELAAFLAPYDLGELLSLKGIAEGIENSNFFLTT--TQGEYVLTLFERLTAEDLPFFL---ELMQHLAARGLPVP   79 (319)
T ss_pred             CCCCHHHHHHHHHHCCCCCCCEECCCCCCCCCCEEEEEE--CCCCEEEEEEECCCHHHCCHHH---HHHHHHHHCCCCCC
T ss_conf             148999999999966999750210646663474289994--5971899996128836552999---99999987899788


Q ss_pred             --CCCCCCCCCCCHHHHCCCHHHHH-HHHHHCCCCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCHHHCC
Q ss_conf             --22435565342033115717999-9998424558424346643875899999628887701268377332256
Q gi|254780800|r   87 --ESTKKKKRYTPVSKRQDRPNAIL-WLIRNYPRLKDAQISHLVGTTGSTIEQIRNRTHWNTANLVPMDPVTLGL  158 (192)
Q Consensus        87 --~~~~Kg~kYTPlSkRqDkPdAI~-WLiknhPeL~D~QI~KLiGTTK~TI~sIRdRtHWN~~Ni~P~DPV~LGL  158 (192)
                        -..+.|.....+   .+||-+|. ||=-.+++..+..-|+-||..=--       -|=-..+.....|-.+|+
T Consensus        80 ~Pi~~~~G~~i~~l---~gKpa~L~~fL~G~~~~~~~~~~c~~iG~~LA~-------lH~a~~~f~~~r~n~~~~  144 (319)
T PRK05231         80 APLARRDGELLGTL---AGKPAAIVTFLEGKHPRAPTAAHCAEVGEMLAR-------MHLAGRDFPLERPNLRSL  144 (319)
T ss_pred             CEEECCCCCCHHHH---CCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHH-------HHHCCCCCCCCCCCCCCH
T ss_conf             36788998755785---690489998258988799999999999999999-------665216798668888986


No 37 
>TIGR01362 KDO8P_synth 3-deoxy-8-phosphooctulonate synthase; InterPro: IPR006269   These sequences describe 2-dehydro-3-deoxyphosphooctonate aldolase. Alternate names include 3-deoxy-d-manno-octulosonic acid 8-phosphate and KDO-8 phosphate synthetase. It catalyzes the aldol condensation of phosphoenolpyruvate with D-arabinose 5-phosphate.   phosphoenolpyruvate + D-arabinose 5-phosphate + H_2O = 2-dehydro-3-deoxy-D-octonate 8-phosphate + phosphate    In Gram-negative bacteria, this is the first step in the biosynthesis of 3-deoxy-D-manno-octulosonate, part of the oligosaccharide core of lipopolysaccharide. ; GO: 0008676 3-deoxy-8-phosphooctulonate synthase activity, 0008152 metabolic process, 0005737 cytoplasm.
Probab=23.81  E-value=39  Score=16.67  Aligned_cols=18  Identities=39%  Similarity=0.416  Sum_probs=14.7

Q ss_pred             CCCHHHHHHHHHHHHHHH
Q ss_conf             648999999999973012
Q gi|254780800|r  158 LCAQIDLDREIEKCSKNT  175 (192)
Q Consensus       158 LCsQ~dLd~av~KA~~~~  175 (192)
                      ||.|.||=.|+.+.++=.
T Consensus       103 LCRQTDLl~AaA~TG~iv  120 (279)
T TIGR01362       103 LCRQTDLLVAAAKTGKIV  120 (279)
T ss_pred             HHHHHHHHHHHHHCCCEE
T ss_conf             211468999998239868


No 38 
>KOG0289 consensus
Probab=22.82  E-value=38  Score=16.75  Aligned_cols=17  Identities=35%  Similarity=0.475  Sum_probs=14.8

Q ss_pred             CCCCCCCCCCCCHHHHH
Q ss_conf             11474012784899999
Q gi|254780800|r   50 KGFNLISSGQLSAEEIA   66 (192)
Q Consensus        50 ~g~dPi~~gqLt~eEI~   66 (192)
                      -|.|||.++.|+.|||-
T Consensus        35 ~G~DPIt~~pLs~eelV   51 (506)
T KOG0289          35 TGKDPITNEPLSIEELV   51 (506)
T ss_pred             CCCCCCCCCCCCHHHEE
T ss_conf             08998779967889905


No 39 
>KOG3832 consensus
Probab=22.70  E-value=34  Score=17.08  Aligned_cols=26  Identities=38%  Similarity=0.604  Sum_probs=21.0

Q ss_pred             CCCCHH--HHHHHHHHCCCCHHHHHHHH
Q ss_conf             886134--54302420434799999670
Q gi|254780800|r    5 PLMPKA--SAVWLIDNTSLSFDQIAEFC   30 (192)
Q Consensus         5 pLMPkA--TAvWLvdNT~LtF~QIa~FC   30 (192)
                      .-||||  ||-||+--|-|+|--.-.|.
T Consensus        31 lalpkafatagwllsi~ll~fl~fmsfm   58 (319)
T KOG3832          31 LALPKAFATAGWLLSITLLTFLAFMSFM   58 (319)
T ss_pred             CCCHHHHHHCCHHHHHHHHHHHHHHHHH
T ss_conf             2250767532099999999999999999


No 40 
>smart00342 HTH_ARAC helix_turn_helix, arabinose operon control protein.
Probab=22.67  E-value=51  Score=15.99  Aligned_cols=22  Identities=36%  Similarity=0.415  Sum_probs=13.3

Q ss_pred             HHHHHHHCCCCHHHHHHHHCCC
Q ss_conf             4302420434799999670871
Q gi|254780800|r   12 AVWLIDNTSLSFDQIAEFCGLH   33 (192)
Q Consensus        12 AvWLvdNT~LtF~QIa~FC~lH   33 (192)
                      |..++.++..+..+||..||..
T Consensus        42 a~~~L~~~~~~i~~ia~~~Gy~   63 (84)
T smart00342       42 ARRLLRDTDLSVTEIALRVGFS   63 (84)
T ss_pred             HHHHHHCCCCHHHHHHHHCCCC
T ss_conf             9999985763499998881999


No 41 
>PRK03902 manganese transport transcriptional regulator; Provisional
Probab=22.43  E-value=75  Score=14.96  Aligned_cols=56  Identities=13%  Similarity=0.084  Sum_probs=37.2

Q ss_pred             HCCCHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHH-HHHCCCCCCCCCCCCCCCHHH
Q ss_conf             11571799999984245584243466438758999-996288877012683773322
Q gi|254780800|r  101 RQDRPNAILWLIRNYPRLKDAQISHLVGTTGSTIE-QIRNRTHWNTANLVPMDPVTL  156 (192)
Q Consensus       101 RqDkPdAI~WLiknhPeL~D~QI~KLiGTTK~TI~-sIRdRtHWN~~Ni~P~DPV~L  156 (192)
                      ..|==-+|+.|.......+-+.|++..|.+++||. .|+.=.-=+.-+-.|-..|.|
T Consensus         6 ~EdYLe~Iy~L~~~~~~vr~~dIA~~L~Vs~~SVs~mikkL~~~Glv~~~~y~~i~L   62 (142)
T PRK03902          6 MEDYIEQIYLLIEEKGYARVSDIAEALSVHPSSVTKMVQKLDKDEYLIYEKYRGLVL   62 (142)
T ss_pred             HHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHCCCCEECCCCCEEE
T ss_conf             999999999998258971299999997899533999999999879924558987568


No 42 
>TIGR02983 SigE-fam_strep RNA polymerase sigma-70 factor, sigma-E family; InterPro: IPR014325   The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes.   With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , .    This group of similar sigma-70 factors includes the sigE factor from Streptomyces coelicolor . The protein appears to include a paralogous expansion in the Streptomycetes lineage, while related Actinomycetales have at most two representatives..
Probab=22.41  E-value=46  Score=16.26  Aligned_cols=27  Identities=22%  Similarity=0.534  Sum_probs=23.8

Q ss_pred             HHHHCCCCCHHHHHHHHCCCHHHHHHH
Q ss_conf             998424558424346643875899999
Q gi|254780800|r  111 LIRNYPRLKDAQISHLVGTTGSTIEQI  137 (192)
Q Consensus       111 LiknhPeL~D~QI~KLiGTTK~TI~sI  137 (192)
                      .++||-.||+.||+.-.|.--.||.|=
T Consensus       123 VLRY~eDlSe~~~A~~LG~SvGTVKS~  149 (165)
T TIGR02983       123 VLRYYEDLSEAEVAEVLGISVGTVKSR  149 (165)
T ss_pred             EEECCCCCCHHHHHHHHCCCCHHHHHH
T ss_conf             320457898689998819993228998


No 43 
>TIGR02643 T_phosphoryl thymidine phosphorylase; InterPro: IPR013465    Thymidine phosphorylase (alternate name: pyrimidine phosphorylase) 2.4.2.4 from EC is involved, in E. coli and other Proteobacteria, in (deoxy)nucleotide degradation. It is often encoded in an operon together with a deoxyribose-phosphate aldolase, a phosphopentomutase and a purine nucleoside phosphorylase. In many other lineages, the corresponding enzyme is designated pyrimidine-nucleoside phosphorylase (2.4.2.2 from EC); the naming convention here follows standard literature practice.; GO: 0009032 thymidine phosphorylase activity.
Probab=22.37  E-value=31  Score=17.27  Aligned_cols=25  Identities=40%  Similarity=0.591  Sum_probs=20.6

Q ss_pred             HHCCCCHHHHHHHHCCCHHHEEEECCHHHHCC
Q ss_conf             20434799999670871205111124146389
Q gi|254780800|r   17 DNTSLSFDQIAEFCGLHLLEVMAIADGESLQG   48 (192)
Q Consensus        17 dNT~LtF~QIa~FC~lH~LEVq~IADGeva~g   48 (192)
                      |+.+|+-..|+.|       |+||.||.|+-|
T Consensus        12 D~~~Ls~~eI~~F-------i~G~~~~~vseG   36 (440)
T TIGR02643        12 DGLSLSDAEIAAF-------INGVTDGSVSEG   36 (440)
T ss_pred             CCCCCCHHHHHHH-------HHHHCCCCCCHH
T ss_conf             7888898899999-------852037884468


No 44 
>cd00090 HTH_ARSR Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors. ARSR subfamily of helix-turn-helix bacterial transcription regulatory proteins (winged helix topology). Includes several proteins that appear to dissociate from DNA in the presence of metal ions.
Probab=22.29  E-value=76  Score=14.95  Aligned_cols=30  Identities=27%  Similarity=0.400  Sum_probs=26.3

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHH
Q ss_conf             9999998424558424346643875899999
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVGTTGSTIEQI  137 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiGTTK~TI~sI  137 (192)
                      .|+.++...| ++-+||++.+|.+.+||..-
T Consensus        11 ~Il~~L~~~~-~~~~eia~~l~is~~~vs~h   40 (78)
T cd00090          11 RILRLLLEGP-LTVSELAERLGLSQSTVSRH   40 (78)
T ss_pred             HHHHHHHHCC-CCHHHHHHHHCCCHHHHHHH
T ss_conf             9999998489-06999987778487899999


No 45 
>PRK10371 DNA-binding transcriptional regulator MelR; Provisional
Probab=22.26  E-value=63  Score=15.44  Aligned_cols=71  Identities=17%  Similarity=0.225  Sum_probs=41.9

Q ss_pred             HHHHHHHHCC-CCCHHHHHHHHCCCHHHHHHH----------------H--------CCCCCCCCCCCCCCCHHHCCCCH
Q ss_conf             9999998424-558424346643875899999----------------6--------28887701268377332256489
Q gi|254780800|r  107 AILWLIRNYP-RLKDAQISHLVGTTGSTIEQI----------------R--------NRTHWNTANLVPMDPVTLGLCAQ  161 (192)
Q Consensus       107 AI~WLiknhP-eL~D~QI~KLiGTTK~TI~sI----------------R--------dRtHWN~~Ni~P~DPV~LGLCsQ  161 (192)
                      -+.|+-.||- +|+-.+||+.+|-.+.....+                |        -.|..++..|.    ...|+.+.
T Consensus       196 il~YI~~hY~e~IsL~diA~~~~lS~~y~sr~FKk~tG~T~~~YL~~~Rl~~A~~LL~~Td~sIteIA----~~~GF~n~  271 (302)
T PRK10371        196 MLGFIAENYDQALTINDVAEHVKLNANYAMGIFQRVMQLTMKQYITAMRINHVRALLSDTDKSILDIA----LTAGFRSS  271 (302)
T ss_pred             HHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHH----HHHCCCCH
T ss_conf             99999998289999999998979599999999999989099999999999999999874899899999----99589987


Q ss_pred             HHHHHHHHHHHHHHHHHHHH
Q ss_conf             99999999973012124554
Q gi|254780800|r  162 IDLDREIEKCSKNTEAKYNK  181 (192)
Q Consensus       162 ~dLd~av~KA~~~~~~~~~~  181 (192)
                      .-+.+.-.|.-+-.+.+-.+
T Consensus       272 s~F~r~FKk~~G~TP~eYRK  291 (302)
T PRK10371        272 SRFYSTFGKYVGMSPQQYRK  291 (302)
T ss_pred             HHHHHHHHHHHCCCHHHHHH
T ss_conf             99999999887909899998


No 46 
>PRK12544 RNA polymerase sigma factor; Provisional
Probab=22.21  E-value=76  Score=14.94  Aligned_cols=36  Identities=22%  Similarity=0.355  Sum_probs=30.9

Q ss_pred             HHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHCCC
Q ss_conf             799999984245584243466438758999996288
Q gi|254780800|r  106 NAILWLIRNYPRLKDAQISHLVGTTGSTIEQIRNRT  141 (192)
Q Consensus       106 dAI~WLiknhPeL~D~QI~KLiGTTK~TI~sIRdRt  141 (192)
                      ....++++++-+++-.+|+...|.|-+||.+.-.|.
T Consensus       154 ~R~vl~Lr~~eglS~~EIAe~Lgis~~tVks~L~RA  189 (207)
T PRK12544        154 YARVFMMREFIELETPEICHNEDLTVSNLNVMLYRS  189 (207)
T ss_pred             HHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHH
T ss_conf             999999999819999999999797999999999999


No 47 
>pfam00685 Sulfotransfer_1 Sulfotransferase domain.
Probab=21.97  E-value=36  Score=16.92  Aligned_cols=12  Identities=33%  Similarity=0.803  Sum_probs=7.0

Q ss_pred             CCHHHHHHHHHH
Q ss_conf             613454302420
Q gi|254780800|r    7 MPKASAVWLIDN   18 (192)
Q Consensus         7 MPkATAvWLvdN   18 (192)
                      +||.---||-.-
T Consensus         9 yPKSGnTWlr~i   20 (254)
T pfam00685         9 YPKSGTTWLQEI   20 (254)
T ss_pred             CCCHHHHHHHHH
T ss_conf             998679999999


No 48 
>COG4989 Predicted oxidoreductase [General function prediction only]
Probab=21.85  E-value=39  Score=16.71  Aligned_cols=22  Identities=41%  Similarity=0.759  Sum_probs=17.2

Q ss_pred             HHHHHHHHHCCCCCHHHHHHHHCCCHH
Q ss_conf             799999984245584243466438758
Q gi|254780800|r  106 NAILWLIRNYPRLKDAQISHLVGTTGS  132 (192)
Q Consensus       106 dAI~WLiknhPeL~D~QI~KLiGTTK~  132 (192)
                      =||+||+|+ |    +++.-.|||+|.
T Consensus       245 VaiAWllR~-P----a~~~PiiGt~~~  266 (298)
T COG4989         245 VAIAWLLRH-P----AKPQPIIGTGNL  266 (298)
T ss_pred             HHHHHHHHC-C----CCCCCEECCCCH
T ss_conf             999999967-6----756541057988


No 49 
>pfam07750 GcrA GcrA cell cycle regulator. GcrA is a master cell cycle regulator that, together with CtrA (see pfam00072 and pfam00486), is involved in controlling cell cycle progression and asymmetric polar morphogenesis. During this process, there are temporal and spatial variations in the concentrations of GcrA and CtrA. The variation in concentration produces time and space dependent transcriptional regulation of modular functions that implement cell-cycle processes. More specifically, GcrA acts as an activator of components of the replisome and the segregation machinery.
Probab=21.69  E-value=36  Score=16.93  Aligned_cols=13  Identities=46%  Similarity=0.419  Sum_probs=7.5

Q ss_pred             CCCCHHHHHHHHC
Q ss_conf             4347999996708
Q gi|254780800|r   19 TSLSFDQIAEFCG   31 (192)
Q Consensus        19 T~LtF~QIa~FC~   31 (192)
                      -.||--|||.--|
T Consensus        17 eGlSaSqIA~~LG   29 (162)
T pfam07750        17 EGLSASQIAAQLG   29 (162)
T ss_pred             CCCCHHHHHHHHC
T ss_conf             5365999999976


No 50 
>PRK02251 putative septation inhibitor protein; Reviewed
Probab=21.27  E-value=66  Score=15.32  Aligned_cols=33  Identities=27%  Similarity=0.711  Sum_probs=20.0

Q ss_pred             CCCCCCCCHHHHCCCH------------------HHHHHHHHHCCCCCHHHHH
Q ss_conf             3556534203311571------------------7999999842455842434
Q gi|254780800|r   90 KKKKRYTPVSKRQDRP------------------NAILWLIRNYPRLKDAQIS  124 (192)
Q Consensus        90 ~Kg~kYTPlSkRqDkP------------------dAI~WLiknhPeL~D~QI~  124 (192)
                      +|...|||-..++...                  -+++|+|-+|  |+.+|+.
T Consensus         7 RKK~~~t~~~~~~~~~~~~~~P~W~~p~m~~lm~iGL~Wivv~Y--is~~~~P   57 (84)
T PRK02251          7 RKKQDYTPPPSREAKKGTKVNPRWFAPVMVALMIIGLIWLVVYY--VTSGSLP   57 (84)
T ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH--HCCCCCC
T ss_conf             57877889964445666789984089999999999899984531--1368778


No 51 
>pfam00325 Crp Bacterial regulatory proteins, crp family.
Probab=21.27  E-value=48  Score=16.17  Aligned_cols=16  Identities=31%  Similarity=0.465  Sum_probs=8.6

Q ss_pred             CCHHHHHHHHCCCHHH
Q ss_conf             4799999670871205
Q gi|254780800|r   21 LSFDQIAEFCGLHLLE   36 (192)
Q Consensus        21 LtF~QIa~FC~lH~LE   36 (192)
                      +|-..||+||||-..-
T Consensus         3 msrqdIadylGlt~ET   18 (32)
T pfam00325         3 MSRQEIADYLGLTRET   18 (32)
T ss_pred             CCHHHHHHHHCCHHHH
T ss_conf             6588999984725999


No 52 
>smart00103 ALBUMIN serum albumin.
Probab=20.79  E-value=75  Score=14.95  Aligned_cols=29  Identities=21%  Similarity=0.283  Sum_probs=22.4

Q ss_pred             HHHHHHHHCCCCCHHHHHHHHCCCHHHHH
Q ss_conf             99999984245584243466438758999
Q gi|254780800|r  107 AILWLIRNYPRLKDAQISHLVGTTGSTIE  135 (192)
Q Consensus       107 AI~WLiknhPeL~D~QI~KLiGTTK~TI~  135 (192)
                      -++=+.|+||++++.||-++...-...++
T Consensus       145 fl~e~~rrhP~~~~~~ll~~a~~y~~~l~  173 (187)
T smart00103      145 FLYELSRRHPELSEPQLLRVAKEYEALLE  173 (187)
T ss_pred             HHHHHHHHCCCCCHHHHHHHHHHHHHHHH
T ss_conf             99999961888989999999999999999


No 53 
>pfam03339 Pox_L3_FP4 Poxvirus L3/FP4 protein.
Probab=20.69  E-value=34  Score=17.09  Aligned_cols=42  Identities=19%  Similarity=0.210  Sum_probs=32.4

Q ss_pred             CCHHHHCCCHH--HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHH
Q ss_conf             42033115717--9999998424558424346643875899999
Q gi|254780800|r   96 TPVSKRQDRPN--AILWLIRNYPRLKDAQISHLVGTTGSTIEQI  137 (192)
Q Consensus        96 TPlSkRqDkPd--AI~WLiknhPeL~D~QI~KLiGTTK~TI~sI  137 (192)
                      -++++|.-.+-  =+-|++|||-+|-+-|-..+--+-|+||..-
T Consensus       204 ~~la~r~~e~~nYFfEW~I~NHl~lLsk~~~difkikkk~~t~~  247 (319)
T pfam03339       204 DTLATRNLESTNYFFEWFIRNHFELLSRQCLDIFKIKKKYITTP  247 (319)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHEEEEHHCCCC
T ss_conf             27776320226899999998798774103422522331002566


No 54 
>pfam00242 DNA_pol_viral_N DNA polymerase (viral) N-terminal domain.
Probab=20.48  E-value=73  Score=15.05  Aligned_cols=42  Identities=21%  Similarity=0.405  Sum_probs=30.2

Q ss_pred             CCCCCCCHHHHHHHHH-------CCCCCEECCCCCCCCCCCCCCCCCCCCHHH
Q ss_conf             0127848999999761-------648770403664334422435565342033
Q gi|254780800|r   55 ISSGQLSAEEIALGEK-------DKNYKLKISKPKSYILESTKKKKRYTPVSK  100 (192)
Q Consensus        55 i~~gqLt~eEI~~ce~-------d~~~~L~l~~~~~~~~~~~~Kg~kYTPlSk  100 (192)
                      +..--|..+-|.+|+.       |++++||++.+.--.|+    --+|-||.|
T Consensus        84 fp~iHl~~diin~c~q~~gpltvnekRrlKli~PArF~Pk----stkYfPl~k  132 (379)
T pfam00242        84 FPDIHLHEDIINKCEQFVGPLTVNEKRRLKLIMPARFYPK----STKYFPLDK  132 (379)
T ss_pred             CCCCHHHHHHHHHHHHHCCCCCCCHHCCEEEECCCCCCCC----CCCCCCCCC
T ss_conf             7420002889987675247654120004024133220567----530244013


No 55 
>PRK11512 DNA-binding transcriptional repressor MarR; Provisional
Probab=20.27  E-value=83  Score=14.69  Aligned_cols=46  Identities=15%  Similarity=0.279  Sum_probs=33.2

Q ss_pred             HHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHCC---CCCCCCCCCCCCC
Q ss_conf             999998424558424346643875899999628---8877012683773
Q gi|254780800|r  108 ILWLIRNYPRLKDAQISHLVGTTGSTIEQIRNR---THWNTANLVPMDP  153 (192)
Q Consensus       108 I~WLiknhPeL~D~QI~KLiGTTK~TI~sIRdR---tHWN~~Ni~P~DP  153 (192)
                      ++--+..++.++-+++++++|.++.||..+=+|   .-|-.---.|.|=
T Consensus        45 vL~~L~~~~~~t~~eLa~~l~id~~tvt~~ldrLe~~GlI~R~~~p~Dr   93 (144)
T PRK11512         45 VLCSIRCAACITPVELKKVLSVDLGALTRMLDRLVCKGWVERLPNPNDK   93 (144)
T ss_pred             HHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEECCCCCCC
T ss_conf             9999998699799999999788887899999999978796635798776


No 56 
>PRK05294 carB carbamoyl phosphate synthase large subunit; Reviewed
Probab=20.16  E-value=71  Score=15.11  Aligned_cols=25  Identities=28%  Similarity=0.551  Sum_probs=21.1

Q ss_pred             CCCCHHHHHHHHCCCHHHHHHHHCC
Q ss_conf             4558424346643875899999628
Q gi|254780800|r  116 PRLKDAQISHLVGTTGSTIEQIRNR  140 (192)
Q Consensus       116 PeL~D~QI~KLiGTTK~TI~sIRdR  140 (192)
                      =-.+|.||++|.|++-..|...|..
T Consensus       490 ~GFSD~~IA~l~~~~~~~Vr~~R~~  514 (1063)
T PRK05294        490 LGFSDARIAKLLGVTEDEVRKLRKE  514 (1063)
T ss_pred             HCCCHHHHHHHHCCCHHHHHHHHHH
T ss_conf             0998599998849999999999998


No 57 
>cd06405 PB1_Mekk2_3 The PB1 domain is present in the two mitogen-activated protein kinase kinases MEKK2 and MEKK3 which are two members of the signaling kinase cascade involved in angiogenesis and early cardiovascular development. The PB1 domain of MEKK2 (and/or MEKK3) interacts with the PB1 domain of another member of the kinase cascade Map2k5.  A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster.  Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, f
Probab=20.04  E-value=60  Score=15.54  Aligned_cols=22  Identities=36%  Similarity=0.439  Sum_probs=17.2

Q ss_pred             HHHCCCCHHHHHHHHHHHHHHH
Q ss_conf             3225648999999999973012
Q gi|254780800|r  154 VTLGLCAQIDLDREIEKCSKNT  175 (192)
Q Consensus       154 V~LGLCsQ~dLd~av~KA~~~~  175 (192)
                      +..-|-+|.|||+||+--.++.
T Consensus        46 L~IPl~~Q~DLDkAiellD~s~   67 (79)
T cd06405          46 LLIPLKNQEDLDRAIELLDRSP   67 (79)
T ss_pred             EEEECCCHHHHHHHHHHHCCCC
T ss_conf             7886256777899999870592


Done!