Query         030521
Match_columns 176
No_of_seqs    113 out of 765
Neff          7.5 
Searched_HMMs 46136
Date          Fri Mar 29 14:59:15 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/030521.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/030521hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1724 SCF ubiquitin ligase,  100.0 6.6E-49 1.4E-53  300.5  16.5  155   12-167     2-162 (162)
  2 COG5201 SKP1 SCF ubiquitin lig 100.0   8E-46 1.7E-50  269.5  15.0  150   15-167     2-157 (158)
  3 PF01466 Skp1:  Skp1 family, di 100.0 5.7E-30 1.2E-34  174.8   8.2   78   89-167     1-78  (78)
  4 smart00512 Skp1 Found in Skp1  100.0 2.2E-28 4.7E-33  175.4  11.2  102   15-117     2-104 (104)
  5 PF03931 Skp1_POZ:  Skp1 family  99.8   4E-20 8.8E-25  120.8   6.8   62   15-78      1-62  (62)
  6 KOG3473 RNA polymerase II tran  99.8 2.8E-18   6E-23  119.5   7.9  103    6-117     8-112 (112)
  7 PF00651 BTB:  BTB/POZ domain;   98.8 1.1E-07 2.5E-12   67.1   9.8   99   13-130     9-109 (111)
  8 PHA02713 hypothetical protein;  98.7 1.6E-07 3.5E-12   85.0  10.8  106   14-141    25-132 (557)
  9 smart00225 BTB Broad-Complex,   98.5 5.7E-07 1.2E-11   60.0   6.8   85   21-125     5-90  (90)
 10 PHA02790 Kelch-like protein; P  98.4 1.1E-06 2.3E-11   78.3   7.5   99   17-136    23-124 (480)
 11 PHA03098 kelch-like protein; P  98.3 3.8E-06 8.2E-11   75.2  10.5   98   15-136    10-109 (534)
 12 KOG4441 Proteins containing BT  98.3   4E-06 8.8E-11   76.2   8.8   96   16-132    38-134 (571)
 13 KOG4682 Uncharacterized conser  96.5   0.008 1.7E-07   52.4   6.3  108   21-151    75-186 (488)
 14 KOG4350 Uncharacterized conser  96.3   0.037   8E-07   48.6   9.2  120   13-141    43-181 (620)
 15 KOG2716 Polymerase delta-inter  95.5    0.17 3.6E-06   41.2   9.4  103   13-135     3-108 (230)
 16 KOG3433 Protein involved in me  94.0    0.13 2.7E-06   40.4   5.0   31  116-151   170-200 (203)
 17 KOG0783 Uncharacterized conser  93.4    0.22 4.7E-06   47.4   6.4  112   17-143   713-826 (1267)
 18 COG5124 Protein predicted to b  91.9    0.17 3.7E-06   39.5   3.1   31  116-151   174-204 (209)
 19 PF11822 DUF3342:  Domain of un  90.3    0.54 1.2E-05   40.0   4.9   90   24-133    13-105 (317)
 20 PF02214 BTB_2:  BTB/POZ domain  89.3    0.29 6.2E-06   33.6   2.1   83   23-124     6-94  (94)
 21 PF03962 Mnd1:  Mnd1 family;  I  84.7     1.4 3.1E-05   34.6   4.0   44  103-151   142-187 (188)
 22 KOG2075 Topoisomerase TOP1-int  73.9      18 0.00039   32.7   7.6  108   25-132   129-250 (521)
 23 PF03656 Pam16:  Pam16;  InterP  73.5     1.5 3.3E-05   32.4   0.8   39  131-172    51-89  (127)
 24 KOG0511 Ankyrin repeat protein  67.8      16 0.00034   32.3   5.8  108   22-133   298-431 (516)
 25 PF05409 Peptidase_C30:  Corona  63.2      23 0.00049   29.8   5.7   98   54-155   161-265 (293)
 26 KOG1665 AFH1-interacting prote  60.0      25 0.00054   28.9   5.3   93   13-126     7-105 (302)
 27 PF07928 Vps54:  Vps54-like pro  57.5     3.4 7.4E-05   30.8   0.0   47   23-77      1-47  (135)
 28 PRK05365 malonic semialdehyde   53.1     9.4  0.0002   29.6   1.9   35  107-151   130-164 (195)
 29 KOG3442 Uncharacterized conser  52.2      12 0.00027   27.6   2.2   36  133-171    54-89  (132)
 30 PRK13696 hypothetical protein;  51.6      42 0.00091   21.7   4.3   26  133-161    36-61  (62)
 31 cd02148 Nitroreductase_5 Nitro  50.5      10 0.00023   29.0   1.7   33  108-150   124-156 (185)
 32 KOG4591 Uncharacterized conser  49.2      33 0.00071   27.8   4.3   57   56-130   108-164 (280)
 33 PF11978 MVP_shoulder:  Shoulde  46.0      18 0.00038   26.5   2.1   27  109-135    37-63  (118)
 34 PF12556 CobS_N:  Cobaltochelat  44.6      13 0.00029   21.3   1.1   12  140-151    14-25  (36)
 35 PF10892 DUF2688:  Protein of u  44.5      27  0.0006   22.1   2.6   25  137-161    30-56  (60)
 36 KOG3840 Uncharaterized conserv  39.8      46   0.001   28.6   4.0   67    8-75     89-161 (438)
 37 PF10075 PCI_Csn8:  COP9 signal  39.3      71  0.0015   23.4   4.7   48  117-168    76-127 (143)
 38 PRK14981 DNA-directed RNA poly  38.9 1.2E+02  0.0025   21.7   5.6   45  116-161    61-106 (112)
 39 PF15063 TC1:  Thyroid cancer p  38.7      33 0.00072   23.1   2.4   31  127-157    29-59  (79)
 40 smart00385 CYCLIN domain prese  38.2      97  0.0021   19.1   5.1   41  117-160    37-79  (83)
 41 PF11460 DUF3007:  Protein of u  37.2      34 0.00074   24.4   2.5   24  136-162    79-102 (104)
 42 KOG4350 Uncharacterized conser  36.6      34 0.00073   30.6   2.8   33  101-133   114-146 (620)
 43 PF13921 Myb_DNA-bind_6:  Myb-l  35.4      64  0.0014   19.7   3.3   34  125-158    20-60  (60)
 44 PF11197 DUF2835:  Protein of u  35.2      46   0.001   21.8   2.7   25   11-35     17-41  (68)
 45 PRK11053 dihydropteridine redu  35.1      30 0.00065   27.2   2.2   36  105-150   147-183 (217)
 46 PF06851 DUF1247:  Protein of u  33.3 1.5E+02  0.0032   22.4   5.4   72   72-147    61-134 (148)
 47 cd00068 GGL G protein gamma su  33.0      59  0.0013   20.3   2.8   20   62-81     22-41  (57)
 48 TIGR02698 CopY_TcrY copper tra  32.6 1.5E+02  0.0032   21.6   5.4   38  124-161    76-113 (130)
 49 PF10415 FumaraseC_C:  Fumarase  32.3 1.1E+02  0.0024   18.9   4.0   23  133-158    24-46  (55)
 50 cd00167 SANT 'SWI3, ADA2, N-Co  31.9      51  0.0011   18.1   2.3   20  126-145    23-42  (45)
 51 PF11116 DUF2624:  Protein of u  31.5 1.2E+02  0.0025   20.9   4.3   27  130-158    10-39  (85)
 52 PF08671 SinI:  Anti-repressor   31.4      38 0.00083   18.6   1.5   13  133-145    15-27  (30)
 53 cd05029 S-100A6 S-100A6: S-100  30.5 1.1E+02  0.0025   20.6   4.2   45  117-161     9-59  (88)
 54 PF02519 Auxin_inducible:  Auxi  30.3 1.9E+02  0.0042   20.2   6.1   62   10-73     34-99  (100)
 55 smart00717 SANT SANT  SWI3, AD  30.1      57  0.0012   18.2   2.3   20  126-145    25-44  (49)
 56 TIGR02476 BluB cob(II)yrinic a  30.1      42 0.00091   26.1   2.2   38  104-151   133-170 (205)
 57 PF13798 PCYCGC:  Protein of un  27.6      89  0.0019   24.0   3.5   23  122-144   119-141 (158)
 58 PRK09296 cysteine desufuration  27.6   1E+02  0.0022   23.0   3.8   28  125-152    82-117 (138)
 59 PF11198 DUF2857:  Protein of u  26.9 2.4E+02  0.0052   21.8   6.0   63  101-165    59-136 (180)
 60 cd02144 iodotyrosine_dehalogen  26.9      53  0.0012   24.8   2.3   38  105-152   129-167 (193)
 61 PF00382 TFIIB:  Transcription   26.7 1.6E+02  0.0035   18.6   4.3   30  118-147    36-67  (71)
 62 TIGR01999 iscU FeS cluster ass  25.8      79  0.0017   22.8   2.9   38  108-145    50-88  (124)
 63 TIGR03147 cyt_nit_nrfF cytochr  25.8      71  0.0015   23.6   2.6   19  126-144    64-83  (126)
 64 COG1926 Predicted phosphoribos  25.6 1.2E+02  0.0025   24.7   3.9   63   53-116   125-215 (220)
 65 TIGR03419 NifU_clost FeS clust  25.6      88  0.0019   22.5   3.1   37  108-144    46-83  (121)
 66 PRK11325 scaffold protein; Pro  25.5      79  0.0017   23.0   2.9   38  108-145    52-90  (127)
 67 PF03750 DUF310:  Protein of un  25.3   1E+02  0.0022   22.1   3.4   34  115-149     8-43  (119)
 68 PHA03098 kelch-like protein; P  25.3 2.4E+02  0.0053   25.1   6.5   98   62-168    74-182 (534)
 69 PRK15019 CsdA-binding activato  24.6 1.1E+02  0.0025   23.0   3.6   37  112-152    83-127 (147)
 70 COG4957 Predicted transcriptio  24.5      67  0.0014   24.2   2.2   18  134-151    98-115 (148)
 71 TIGR03391 FeS_syn_CsdE cystein  24.1 1.2E+02  0.0026   22.6   3.6   36  113-152    79-122 (138)
 72 PF00249 Myb_DNA-binding:  Myb-  24.1 1.6E+02  0.0034   17.1   4.9   39  101-144     5-44  (48)
 73 cd02140 Nitroreductase_4 Nitro  23.9      62  0.0013   24.8   2.2   39  104-150   126-164 (192)
 74 PF01592 NifU_N:  NifU-like N t  23.7      99  0.0021   22.3   3.1   21  122-142    65-85  (126)
 75 PF07076 DUF1344:  Protein of u  23.7      54  0.0012   21.1   1.4   32    2-33      2-33  (61)
 76 PF07638 Sigma70_ECF:  ECF sigm  23.6 3.3E+02  0.0072   20.7   7.1   46  101-148   119-165 (185)
 77 PF00690 Cation_ATPase_N:  Cati  23.5 1.2E+02  0.0026   19.1   3.2   30  134-163     3-34  (69)
 78 PF07671 DUF1601:  Protein of u  23.5      84  0.0018   18.1   2.1   30  114-143     7-36  (37)
 79 PF11338 DUF3140:  Protein of u  23.1      84  0.0018   21.9   2.4   22   58-79     54-75  (92)
 80 KOG0027 Calmodulin and related  22.8 3.1E+02  0.0067   20.0   6.1   60  102-161    43-113 (151)
 81 PF03918 CcmH:  Cytochrome C bi  21.8      78  0.0017   23.9   2.2   12  133-144    72-83  (148)
 82 COG4049 Uncharacterized protei  21.7      67  0.0014   20.5   1.5   14   15-28      5-18  (65)
 83 PF03883 DUF328:  Protein of un  21.6      88  0.0019   25.3   2.7   28  125-152    31-58  (237)
 84 PF00134 Cyclin_N:  Cyclin, N-t  21.3 1.5E+02  0.0033   20.5   3.6   42  119-160    74-116 (127)
 85 smart00224 GGL G protein gamma  21.2 1.1E+02  0.0025   19.4   2.6   21   62-82     22-42  (63)
 86 PF04751 DUF615:  Protein of un  21.0   3E+02  0.0064   21.0   5.3   57   88-145    97-153 (157)
 87 PLN03219 uncharacterized prote  20.6 3.3E+02  0.0072   19.5   6.3   59   12-72     39-104 (108)
 88 PF02657 SufE:  Fe-S metabolism  20.5 1.8E+02  0.0039   21.1   3.9   35  115-153    67-108 (125)
 89 smart00875 BACK BTB And C-term  20.3 2.6E+02  0.0055   18.1   5.7   62  106-168     2-73  (101)

No 1  
>KOG1724 consensus SCF ubiquitin ligase, Skp1 component [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=6.6e-49  Score=300.50  Aligned_cols=155  Identities=45%  Similarity=0.695  Sum_probs=142.0

Q ss_pred             CCCCcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCC--C----CC
Q 030521           12 QATKKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKK--R----SP   85 (176)
Q Consensus        12 ~~~~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~--~----~~   85 (176)
                      |+.++|+|+|+||++|.|+.++|++|.+|++++...+...++.+||||+|+|.||++|++||+||+.++++  +    ..
T Consensus         2 ~~~~~ikL~SsDG~~f~ve~~~a~~s~~i~~~~~~~~~~~~~~~IPl~nV~~~iL~kVIewC~~Hk~d~~~~~~~~~~~~   81 (162)
T KOG1724|consen    2 MSKKKIKLESSDGEIFEVEEEVARQSQTISAHMIEDGCADENDPIPLPNVTSKILKKVIEWCKKHKDDDPANPEDKELPE   81 (162)
T ss_pred             CCCCeEEEEccCCceeehhHHHHHHhHHHHHHHHHcCCCccCCccccCccCHHHHHHHHHHHHHcccccccccccccccc
Confidence            67899999999999999999999999999999987776543369999999999999999999999998654  1    12


Q ss_pred             ccccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccc
Q 030521           86 KQEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWA  165 (176)
Q Consensus        86 ~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~  165 (176)
                      ...+++||++|++. ++.+||+|+.|||||+|++|+++||+.||++++||||+|||++|||++|+|+||++++++++.|+
T Consensus        82 ~~~i~~WD~~Flk~-d~~tLfdli~AAnyLdi~gLl~~~ck~va~mikgktpeEir~~f~I~~d~t~eE~~~~~~e~~~~  160 (162)
T KOG1724|consen   82 ETDIPEWDAEFLKV-DQGTLFDLILAANYLDIKGLLDLTCKTVANMIKGKTPEEIREIFNIENDETPEEEEAIRKENEWA  160 (162)
T ss_pred             cCCccHHHHHHHhc-CHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHccCCHHHHHHHcCCCCCCChhHHHHHhhccccc
Confidence            23599999999995 99999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cC
Q 030521          166 FE  167 (176)
Q Consensus       166 ~~  167 (176)
                      ++
T Consensus       161 ~~  162 (162)
T KOG1724|consen  161 FE  162 (162)
T ss_pred             cC
Confidence            85


No 2  
>COG5201 SKP1 SCF ubiquitin ligase, SKP1 component [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=8e-46  Score=269.48  Aligned_cols=150  Identities=35%  Similarity=0.531  Sum_probs=137.6

Q ss_pred             CcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCC-CCC-----ccc
Q 030521           15 KKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKK-RSP-----KQE   88 (176)
Q Consensus        15 ~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~-~~~-----~~~   88 (176)
                      ++|+|.|.||++|+|+..+|..|-+|++++.+.+..  +-|||+|+|+|.+|.+|++||+||+....+ .++     ..+
T Consensus         2 s~i~l~s~dge~F~vd~~iAerSiLikN~l~d~~~~--n~p~p~pnVrSsvl~kv~ew~ehh~~s~sede~d~~~rks~p   79 (158)
T COG5201           2 SMIELESIDGEIFRVDENIAERSILIKNMLCDSTAC--NYPIPAPNVRSSVLMKVQEWMEHHTSSLSEDENDLEIRKSKP   79 (158)
T ss_pred             CceEEEecCCcEEEehHHHHHHHHHHHHHhcccccc--CCCCcccchhHHHHHHHHHHHHhccccCCCccChHhhhccCC
Confidence            579999999999999999999999999998876553  358999999999999999999999986544 222     236


Q ss_pred             cchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccC
Q 030521           89 VKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFE  167 (176)
Q Consensus        89 ~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~  167 (176)
                      .+.||+.|+.. |+++|+++++|||||+|++|+++||+.||++|+||||+|||++|||++||||||+++|||||.|+.+
T Consensus        80 ~D~wdr~Fm~v-DqemL~eI~laaNYL~ikpLLd~gCKivaemirgkSpeeir~tfni~ndfTpEEe~~irkEnEWaed  157 (158)
T COG5201          80 SDFWDRFFMEV-DQEMLLEICLAANYLEIKPLLDLGCKIVAEMIRGKSPEEIRETFNIENDFTPEEERRIRKENEWAED  157 (158)
T ss_pred             ccHHHHHHHHh-hHHHHHHHHHhhccccchHHHHHHHHHHHHHHccCCHHHHHHHhCCCCCCCHHHHHHHHHhcccccc
Confidence            88999999999 9999999999999999999999999999999999999999999999999999999999999999975


No 3  
>PF01466 Skp1:  Skp1 family, dimerisation domain;  InterPro: IPR016072 SKP1 (together with SKP2) was identified as an essential component of the cyclin A-CDK2 S phase kinase complex []. It was found to bind several F-box containing proteins (e.g., Cdc4, Skp2, cyclin F) and to be involved in the ubiquitin protein degradation pathway. A yeast homologue of SKP1 (P52286) was identified in the centromere bound kinetochore complex [] and is also involved in the ubiquitin pathway []. In Dictyostelium discoideum (Slime mold) FP21 was shown to be glycosylated in the cytosol and has homology to SKP1 []. This entry represents a dimerisation domain found at the C-terminal of SKP1 proteins [], as well as in subunit D of the centromere DNA-binding protein complex Cbf3 []. This domain is multi-helical in structure, and consists of an interlocked herterodimer in F-box proteins.; GO: 0006511 ubiquitin-dependent protein catabolic process; PDB: 2P1O_A 3OGL_G 3OGM_A 3C6O_A 2P1N_A 2P1Q_A 3OGK_I 3C6N_A 3C6P_A 2P1P_A ....
Probab=99.96  E-value=5.7e-30  Score=174.83  Aligned_cols=78  Identities=55%  Similarity=0.860  Sum_probs=68.3

Q ss_pred             cchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccC
Q 030521           89 VKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFE  167 (176)
Q Consensus        89 ~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~  167 (176)
                      +++||++|++. +.+.|++|+.||+||+|++|+++||++||.+++|||++|||++|||++|+|+||++++|+||+|+|+
T Consensus         1 l~~wD~~F~~~-~~~~L~~l~~AA~yL~I~~L~~~~~~~iA~~i~gks~eeir~~fgi~~d~t~eee~~i~~e~~w~~~   78 (78)
T PF01466_consen    1 LPEWDQEFLDV-DNDELFDLLNAANYLDIKGLLDLCCKYIANMIKGKSPEEIRKYFGIENDLTPEEEEEIRKENEWAFE   78 (78)
T ss_dssp             HHHHHHHHT-S--HHHHHHHHHHHHHHT-HHHHHHHHHHHHHHHTTS-HHHHHHHHT---TSSHHHHHHHHHHCCCTB-
T ss_pred             CCHHHHHHHHc-CHHHHHHHHHHHHHHcchHHHHHHHHHHHHHhcCCCHHHHHHHcCCCCCCCHHHHHHHHHHcchhcC
Confidence            47899999966 9999999999999999999999999999999999999999999999999999999999999999986


No 4  
>smart00512 Skp1 Found in Skp1 protein family. Family of Skp1 (kinetochore protein required for cell cycle progression) and elongin C (subunit of RNA polymerase II transcription factor SIII) homologues.
Probab=99.96  E-value=2.2e-28  Score=175.38  Aligned_cols=102  Identities=37%  Similarity=0.618  Sum_probs=89.7

Q ss_pred             CcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCCC-CCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhh
Q 030521           15 KKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDAS-EDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFN   93 (176)
Q Consensus        15 ~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~-~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd   93 (176)
                      .+|+|+|+||++|.|++.+|++|++|++++++.+.++ ...+||||+|+|.+|++|++||+||+.++.+......+++||
T Consensus         2 ~~v~L~S~Dg~~f~v~~~~a~~S~~i~~~l~~~~~~~~~~~~Ipl~~v~~~~L~~Vi~yc~~h~~~~~~~~~~~~~~~wD   81 (104)
T smart00512        2 KYIKLISSDGEVFEVEREVARQSKTIKAMIEDLGVDDENNNPIPLPNVTSKILSKVIEYCEHHVDDPPSVADKDDIPTWD   81 (104)
T ss_pred             CeEEEEeCCCCEEEecHHHHHHHHHHHHHHHccCcccCCCCCccCCCcCHHHHHHHHHHHHHcccCCCCccccccccHHH
Confidence            5799999999999999999999999999999866543 235999999999999999999999998875422224689999


Q ss_pred             hhhccCCChHHHHHHHHhhhccch
Q 030521           94 EEFGKSKSNDELKEMILVANYLNI  117 (176)
Q Consensus        94 ~eFl~~~~~~~l~~Ll~AA~~L~I  117 (176)
                      .+|++. +++.|++|+.|||||+|
T Consensus        82 ~~F~~~-d~~~l~dLl~AAnyL~I  104 (104)
T smart00512       82 AEFLKI-DQETLFELILAANYLDI  104 (104)
T ss_pred             HHHHcC-CHHHHHHHHHHHHhhCC
Confidence            999996 99999999999999997


No 5  
>PF03931 Skp1_POZ:  Skp1 family, tetramerisation domain;  InterPro: IPR016073 SKP1 (together with SKP2) was identified as an essential component of the cyclin A-CDK2 S phase kinase complex []. It was found to bind several F-box containing proteins (e.g., Cdc4, Skp2, cyclin F) and to be involved in the ubiquitin protein degradation pathway. A yeast homologue of SKP1 (P52286) was identified in the centromere bound kinetochore complex [] and is also involved in the ubiquitin pathway []. In Dictyostelium discoideum (Slime mold) FP21 was shown to be glycosylated in the cytosol and has homology to SKP1 []. This entry represents a POZ domain with a core structure consisting of beta(2)/alpha(2)/beta(2)/alpha(2) in two layers, alpha/beta. This domain is found at the N-terminal of SKP1 proteins [] as well as in subunit D of the centromere DNA-binding protein complex Cbf3 []. ; GO: 0006511 ubiquitin-dependent protein catabolic process; PDB: 1LM8_C 2XAI_E 1VCB_E 3ZRC_K 3ZRF_E 3DCG_B 2C9W_C 1LQB_B 2IZV_C 1HV2_A ....
Probab=99.81  E-value=4e-20  Score=120.81  Aligned_cols=62  Identities=34%  Similarity=0.600  Sum_probs=55.4

Q ss_pred             CcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhh
Q 030521           15 KKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVE   78 (176)
Q Consensus        15 ~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~   78 (176)
                      .+|+|+|+||+.|.|++++|++|++|++|+++.+..+.  +||||+|++++|++|++||+||+.
T Consensus         1 ~~v~L~SsDg~~f~V~~~~a~~S~~i~~ml~~~~~~~~--~Ipl~~v~~~~L~kViewc~~H~~   62 (62)
T PF03931_consen    1 MYVKLVSSDGQEFEVSREAAKQSKTIKNMLEDLGDEDE--PIPLPNVSSRILKKVIEWCEHHKN   62 (62)
T ss_dssp             -EEEEEETTSEEEEEEHHHHTTSHHHHHHHHCTCCCGT--EEEETTS-HHHHHHHHHHHHHHHH
T ss_pred             CEEEEEcCCCCEEEeeHHHHHHhHHHHHHHhhhccccc--ccccCccCHHHHHHHHHHHHhcCC
Confidence            36999999999999999999999999999998766543  799999999999999999999974


No 6  
>KOG3473 consensus RNA polymerase II transcription elongation factor Elongin/SIII, subunit elongin C [Transcription]
Probab=99.76  E-value=2.8e-18  Score=119.52  Aligned_cols=103  Identities=18%  Similarity=0.343  Sum_probs=89.0

Q ss_pred             ccccCcCCCCcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCC--CCCcccccCCCChHHHHHHHHHHHHhhhccCCC
Q 030521            6 SSSCSAQATKKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDA--SEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKR   83 (176)
Q Consensus         6 ~~~~~~~~~~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~--~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~   83 (176)
                      -++|..|.+.+++|+|+||++|.|.+.+|..|++|+.|+.+.+..  .+++.+.++.++|.+|+||++|+.+...+..  
T Consensus         8 ~~g~egp~~~yVkLvS~Ddhefiikre~AmtSgTiraml~gpg~~se~~~n~v~f~di~shiLeKvc~Yl~Yk~rY~~--   85 (112)
T KOG3473|consen    8 YGGCEGPDSMYVKLVSSDDHEFIIKREHAMTSGTIRAMLSGPGVFSEAEKNEVYFRDIPSHILEKVCEYLAYKVRYTN--   85 (112)
T ss_pred             cCCccCcchhheEeecCCCcEEEEeehhhhhhhHHHHHHcCCccccccccceEEeccchHHHHHHHHHHhhheeeecc--
Confidence            368999999999999999999999999999999999999976543  2457899999999999999999998776643  


Q ss_pred             CCccccchhhhhhccCCChHHHHHHHHhhhccch
Q 030521           84 SPKQEVKTFNEEFGKSKSNDELKEMILVANYLNI  117 (176)
Q Consensus        84 ~~~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I  117 (176)
                       ....+++||     + ..+..++|+.|||||++
T Consensus        86 -~s~eiPeF~-----I-ppemaleLL~aAn~Lec  112 (112)
T KOG3473|consen   86 -SSTEIPEFD-----I-PPEMALELLMAANYLEC  112 (112)
T ss_pred             -ccccCCCCC-----C-CHHHHHHHHHHhhhhcC
Confidence             235688875     5 89999999999999974


No 7  
>PF00651 BTB:  BTB/POZ domain;  InterPro: IPR013069 The BTB (for BR-C, ttk and bab) [] or POZ (for Pox virus and Zinc finger) [] domain is present near the N terminus of a fraction of zinc finger (IPR007087 from INTERPRO) proteins and in proteins that contain the IPR006652 from INTERPRO motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation []. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule []. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT [, , ]. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.; GO: 0005515 protein binding; PDB: 3M5B_A 1R28_B 3LBZ_A 3E4U_F 3BIM_B 1R2B_A 1R29_A 2VPK_A 2YY9_B 3GA1_A ....
Probab=98.75  E-value=1.1e-07  Score=67.15  Aligned_cols=99  Identities=16%  Similarity=0.334  Sum_probs=77.2

Q ss_pred             CCCcEEEEeCCCCEEEecHHHH-HHhHHHHHHHhcCC-CCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccc
Q 030521           13 ATKKISLKTADGHLFEVEEPVA-MEFEIVKSFFDEND-DASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVK   90 (176)
Q Consensus        13 ~~~~i~L~s~DG~~f~V~~~~a-~~S~~l~~ll~~~~-~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~   90 (176)
                      ....++|+..||+.|.|++.++ .+|+++++++...+ .+.....|++++++...++.+++||+.....         ++
T Consensus         9 ~~~D~~i~v~d~~~~~vhk~iL~~~S~~F~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~l~~~Y~~~~~---------~~   79 (111)
T PF00651_consen    9 EFSDVTIRVGDGKTFYVHKNILAARSPYFRNLFEGSKFKESTVPEISLPDVSPEAFEAFLEYMYTGEIE---------IN   79 (111)
T ss_dssp             TS--EEEEETTTEEEEE-HHHHHHHBHHHHHHHTTTTSTTSSEEEEEETTSCHHHHHHHHHHHHHSEEE---------EE
T ss_pred             CCCCEEEEECCCEEEeechhhhhccchhhhhcccccccccccccccccccccccccccccccccCCccc---------CC
Confidence            3466899999999999999996 56999999999773 2222246888999999999999999643211         11


Q ss_pred             hhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHH
Q 030521           91 TFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVAN  130 (176)
Q Consensus        91 ~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~  130 (176)
                                +.+.+.+++..|++|+++.|.+.|+.++..
T Consensus        80 ----------~~~~~~~ll~lA~~~~~~~L~~~~~~~l~~  109 (111)
T PF00651_consen   80 ----------SDENVEELLELADKLQIPELKKACEKFLQE  109 (111)
T ss_dssp             -----------TTTHHHHHHHHHHTTBHHHHHHHHHHHHH
T ss_pred             ----------HHHHHHHHHHHHHHhCcHHHHHHHHHHHHh
Confidence                      256689999999999999999999999864


No 8  
>PHA02713 hypothetical protein; Provisional
Probab=98.68  E-value=1.6e-07  Score=85.00  Aligned_cols=106  Identities=13%  Similarity=0.193  Sum_probs=85.4

Q ss_pred             CCcEEEEeCCCCEEEecHHHHH-HhHHHHHHHhcCCCCC-CCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccch
Q 030521           14 TKKISLKTADGHLFEVEEPVAM-EFEIVKSFFDENDDAS-EDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKT   91 (176)
Q Consensus        14 ~~~i~L~s~DG~~f~V~~~~a~-~S~~l~~ll~~~~~~~-~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~   91 (176)
                      -..++|...+|+.|.+++.++. .|++++.|+...-.++ ....|.|++++..+|+.|++|++...              
T Consensus        25 l~DV~L~v~~~~~f~~Hr~vLaa~S~YF~amF~~~~~e~~~~~~v~l~~v~~~~~~~ll~y~Yt~~--------------   90 (557)
T PHA02713         25 LCDVIITIGDGEEIKAHKTILAAGSKYFRTLFTTPMIIRDLVTRVNLQMFDKDAVKNIVQYLYNRH--------------   90 (557)
T ss_pred             CCCEEEEeCCCCEEeehHHHHhhcCHHHHHHhcCCchhhccCceEEeccCCHHHHHHHHHHhcCCC--------------
Confidence            3468888878999999999964 7999999997643222 23578899999999999999999621              


Q ss_pred             hhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHH
Q 030521           92 FNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVR  141 (176)
Q Consensus        92 wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir  141 (176)
                             . +.+.+.+|+.||++|+|+.|.+.||.++...+.-.+-=+|.
T Consensus        91 -------i-~~~nv~~ll~aA~~lqi~~l~~~C~~~l~~~l~~~NCl~i~  132 (557)
T PHA02713         91 -------I-SSMNVIDVLKCADYLLIDDLVTDCESYIKDYTNHDTCIYMY  132 (557)
T ss_pred             -------C-CHHHHHHHHHHHHHHCHHHHHHHHHHHHHhhCCccchHHHH
Confidence                   1 45679999999999999999999999998877665544444


No 9  
>smart00225 BTB Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.
Probab=98.48  E-value=5.7e-07  Score=60.02  Aligned_cols=85  Identities=19%  Similarity=0.341  Sum_probs=67.9

Q ss_pred             eCCCCEEEecHHHHHH-hHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhccC
Q 030521           21 TADGHLFEVEEPVAME-FEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFGKS   99 (176)
Q Consensus        21 s~DG~~f~V~~~~a~~-S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl~~   99 (176)
                      ..+|+.|.+++.++.. |++++.++...........|++++++..+++.+++|++...-                   ..
T Consensus         5 ~v~~~~~~~h~~iL~~~s~~f~~~~~~~~~~~~~~~i~l~~~~~~~f~~~l~~ly~~~~-------------------~~   65 (90)
T smart00225        5 VVGGKKFKAHKAVLAACSPYFKALFSGDFKESKKSEIYLDDVSPEDFRALLEFLYTGKL-------------------DL   65 (90)
T ss_pred             EECCEEEehHHHHHhhcCHHHHHHHcCCCccCCCCEEEecCCCHHHHHHHHHeecCcee-------------------ec
Confidence            4567999999999764 899999998654322345788999999999999999996431                   12


Q ss_pred             CChHHHHHHHHhhhccchhhHHHHHH
Q 030521          100 KSNDELKEMILVANYLNIKDMLDYLT  125 (176)
Q Consensus       100 ~~~~~l~~Ll~AA~~L~I~~L~~l~~  125 (176)
                       +...+.+++.+|++++++.|.+.|+
T Consensus        66 -~~~~~~~l~~~a~~~~~~~l~~~c~   90 (90)
T smart00225       66 -PEENVEELLELADYLQIPGLVELCE   90 (90)
T ss_pred             -CHHHHHHHHHHHHHHCcHHHHhhhC
Confidence             3447899999999999999998874


No 10 
>PHA02790 Kelch-like protein; Provisional
Probab=98.37  E-value=1.1e-06  Score=78.29  Aligned_cols=99  Identities=6%  Similarity=0.054  Sum_probs=74.5

Q ss_pred             EEEEeCCCCEEEecHHH-HHHhHHHHHHHhcCCCCCCCccccc--CCCChHHHHHHHHHHHHhhhccCCCCCccccchhh
Q 030521           17 ISLKTADGHLFEVEEPV-AMEFEIVKSFFDENDDASEDTVVPL--PNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFN   93 (176)
Q Consensus        17 i~L~s~DG~~f~V~~~~-a~~S~~l~~ll~~~~~~~~~~~Ipl--~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd   93 (176)
                      -.+.---|++|.+++.+ |..|++++.|+..+-.+.+ ..|.+  .+|+..+|+.|++|++.-+                
T Consensus        23 ~~~~~~~~~~~~~HR~VLAa~S~YFraMF~~~~~Es~-~~v~~~~~~v~~~~l~~lldy~YTg~----------------   85 (480)
T PHA02790         23 KTIIEAIGGNIIVNSTILKKLSPYFRTHLRQKYTKNK-DPVTRVCLDLDIHSLTSIVIYSYTGK----------------   85 (480)
T ss_pred             ceEEEEcCcEEeeehhhhhhcCHHHHHHhcCCccccc-cceEEEecCcCHHHHHHHHHhheeee----------------
Confidence            34445568899999999 4669999999986433322 23443  3799999999999997433                


Q ss_pred             hhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCC
Q 030521           94 EEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKS  136 (176)
Q Consensus        94 ~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks  136 (176)
                         +.. +.+.+.+|+.||++|+|+.+++.||+++...|.-.+
T Consensus        86 ---l~i-t~~nV~~ll~aA~~Lqi~~v~~~C~~fL~~~l~~~N  124 (480)
T PHA02790         86 ---VYI-DSHNVVNLLRASILTSVEFIIYTCINFILRDFRKEY  124 (480)
T ss_pred             ---EEE-ecccHHHHHHHHHHhChHHHHHHHHHHHHhhCCcch
Confidence               233 667799999999999999999999998877655433


No 11 
>PHA03098 kelch-like protein; Provisional
Probab=98.34  E-value=3.8e-06  Score=75.18  Aligned_cols=98  Identities=16%  Similarity=0.264  Sum_probs=78.1

Q ss_pred             CcEEEEe-CCCCEEEecHHHHH-HhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchh
Q 030521           15 KKISLKT-ADGHLFEVEEPVAM-EFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTF   92 (176)
Q Consensus        15 ~~i~L~s-~DG~~f~V~~~~a~-~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~w   92 (176)
                      +.++|.- .+|+.|.+++.++. .|++++.|+.+.-.   ...|.|+. +..+|+.|++|++...               
T Consensus        10 ~Dv~l~~~~~~~~~~~Hk~vLaa~S~yF~~mf~~~~~---~~~i~l~~-~~~~~~~~l~y~Ytg~---------------   70 (534)
T PHA03098         10 CDESIIIVNGGGIIKVHKIILSSSSEYFKKMFKNNFK---ENEINLNI-DYDSFNEVIKYIYTGK---------------   70 (534)
T ss_pred             CCEEEEEEcCCEEEEeHHHHHHhhhHHHHHHHhCCCC---CceEEecC-CHHHHHHHHHHhcCCc---------------
Confidence            4466664 68999999999965 69999999986433   24788887 9999999999999643               


Q ss_pred             hhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCC
Q 030521           93 NEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKS  136 (176)
Q Consensus        93 d~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks  136 (176)
                          +.. +.+.+.+|+.||++|+++.|.+.|++++...+.-.+
T Consensus        71 ----~~i-~~~~~~~ll~~A~~l~~~~l~~~C~~~l~~~l~~~n  109 (534)
T PHA03098         71 ----INI-TSNNVKDILSIANYLIIDFLINLCINYIIKIIDDNN  109 (534)
T ss_pred             ----eEE-cHHHHHHHHHHHHHhCcHHHHHHHHHHHHHhCCHhH
Confidence                123 566799999999999999999999999887665444


No 12 
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=98.26  E-value=4e-06  Score=76.24  Aligned_cols=96  Identities=18%  Similarity=0.307  Sum_probs=79.1

Q ss_pred             cEEEEeCCCCEEEecHHHH-HHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhh
Q 030521           16 KISLKTADGHLFEVEEPVA-MEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNE   94 (176)
Q Consensus        16 ~i~L~s~DG~~f~V~~~~a-~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~   94 (176)
                      .+.|.-.+ +.|..++.++ ..|++++.|+...-.+.....|.+.+|+..+|+.+++|++...-                
T Consensus        38 Dv~L~v~~-~~~~aHR~VLAa~S~YFraMFt~~l~e~~~~~i~l~~v~~~~l~~ll~y~Yt~~i----------------  100 (571)
T KOG4441|consen   38 DVTLLVGD-REFPAHRVVLAACSPYFRAMFTSGLKESKQKEINLEGVDPETLELLLDYAYTGKL----------------  100 (571)
T ss_pred             eEEEEECC-eeechHHHHHHhccHHHHHHhcCCcccccceEEEEecCCHHHHHHHHHHhhcceE----------------
Confidence            35555555 9999999995 56999999999654443456899999999999999999997652                


Q ss_pred             hhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHH
Q 030521           95 EFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRI  132 (176)
Q Consensus        95 eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i  132 (176)
                         .. +.+.+-+|+.||.+|+|+.+++.||.++...+
T Consensus       101 ---~i-~~~nVq~ll~aA~~lQi~~v~~~C~~fL~~~l  134 (571)
T KOG4441|consen  101 ---EI-SEDNVQELLEAASLLQIPEVVDACCEFLESQL  134 (571)
T ss_pred             ---Ee-chHhHHHHHHHHHHhhhHHHHHHHHHHHHhcC
Confidence               34 78889999999999999999999999887644


No 13 
>KOG4682 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=96.46  E-value=0.008  Score=52.37  Aligned_cols=108  Identities=15%  Similarity=0.224  Sum_probs=80.0

Q ss_pred             eCCCCEEEecHHHHHHhHHHHHHHhcCCCCCCCc----ccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhh
Q 030521           21 TADGHLFEVEEPVAMEFEIVKSFFDENDDASEDT----VVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEF   96 (176)
Q Consensus        21 s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~----~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eF   96 (176)
                      ..=|.++++++--+.||+++..|+.+.-.+....    .||=|+|+...|.-++-=+++..                   
T Consensus        75 ~alg~eWrlHk~yL~QS~yf~smf~Gtw~es~~~iIqleI~Dp~Id~~al~~a~gsLY~dE-------------------  135 (488)
T KOG4682|consen   75 EALGFEWRLHKPYLFQSEYFKSMFSGTWKESSMNIIQLEIPDPNIDVVALQVAFGSLYRDE-------------------  135 (488)
T ss_pred             hhccceeeeeeeeeeccHHHHHHhccccChhhCceEEEEcCCCcccHHHHHHHHhhhhhhh-------------------
Confidence            3458889999988999999999988764433222    34456899999988877665422                   


Q ss_pred             ccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521           97 GKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT  151 (176)
Q Consensus        97 l~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t  151 (176)
                      +.+ +.+.+..++.||.+|..++|++-|...+...++-||   +.+++....-+.
T Consensus       136 veI-~l~dv~gvlAaA~~lqldgl~qrC~evMie~lspkt---a~~yYea~ckYg  186 (488)
T KOG4682|consen  136 VEI-KLSDVVGVLAAACLLQLDGLIQRCGEVMIETLSPKT---ACGYYEAACKYG  186 (488)
T ss_pred             eec-cHHHHHHHHHHHHHHHHhhHHHHHHHHHHHhcChhh---hhHhhhhhhhhh
Confidence            335 889999999999999999999999998877665555   455665554444


No 14 
>KOG4350 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=96.26  E-value=0.037  Score=48.58  Aligned_cols=120  Identities=10%  Similarity=0.198  Sum_probs=87.3

Q ss_pred             CCCcEEEEeCCCCEEEecHHH-HHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCC--CCC----
Q 030521           13 ATKKISLKTADGHLFEVEEPV-AMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKK--RSP----   85 (176)
Q Consensus        13 ~~~~i~L~s~DG~~f~V~~~~-a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~--~~~----   85 (176)
                      ....++++-.| +.|...+-+ |..|.+++.|+-++-.+.....|||+.-++..++.+++|++.-+-+-.-  +..    
T Consensus        43 ~y~DVtfvve~-~rfpAHRvILAaRs~yFRAlLYgGm~Es~q~~ipLq~t~~eAF~~lLrYiYtg~~~l~~~~ed~lld~  121 (620)
T KOG4350|consen   43 DYSDVTFVVED-TRFPAHRVILAARSSYFRALLYGGMQESHQQLIPLQETNSEAFRALLRYIYTGKIDLAGVEEDILLDY  121 (620)
T ss_pred             cccceEEEEec-cccchhhhhHHHHHHHHHHHHhhhhhhhhhcccccccccHHHHHHHHHHHhhcceecccchHHHHHHH
Confidence            34557888777 889999988 4679999999986655445578999978899999999999876554221  000    


Q ss_pred             ------------ccccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHH
Q 030521           86 ------------KQEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVR  141 (176)
Q Consensus        86 ------------~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir  141 (176)
                                  ...++++-++.   +..+.+|-++.||.+.+...|.++||.++     .++..++.
T Consensus       122 LslAh~Ygf~~Le~aiSeYl~~i---L~~~NvCmifdaA~ly~l~~Lt~~C~mfm-----DrnA~~lL  181 (620)
T KOG4350|consen  122 LSLAHRYGFIQLETAISEYLKEI---LKNENVCMIFDAAYLYQLTDLTDYCMMFM-----DRNADQLL  181 (620)
T ss_pred             HHHHHhcCcHHHHHHHHHHHHHH---HcccceeeeeeHHHHhcchHHHHHHHHHH-----hcCHHhhh
Confidence                        01233333332   45677899999999999999999999887     55666654


No 15 
>KOG2716 consensus Polymerase delta-interacting protein PDIP1 and related proteins, contain BTB/POZ domain [Inorganic ion transport and metabolism]
Probab=95.47  E-value=0.17  Score=41.23  Aligned_cols=103  Identities=17%  Similarity=0.285  Sum_probs=75.4

Q ss_pred             CCCcEEEEeCCCCEEEecHHHHH-HhHHHHHHHhcCCCC--CCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCcccc
Q 030521           13 ATKKISLKTADGHLFEVEEPVAM-EFEIVKSFFDENDDA--SEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEV   89 (176)
Q Consensus        13 ~~~~i~L~s~DG~~f~V~~~~a~-~S~~l~~ll~~~~~~--~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~   89 (176)
                      ++..++| ---|.+|.-+...+. ..+.++.|++..-.-  +....|=+ .=|..=+..|+.||+.-...         +
T Consensus         3 ~~~~vkL-nvGG~~F~Tsk~TLtk~dg~fk~m~e~~i~~~~d~s~~IFI-DRSpKHF~~ILNfmRdGdv~---------L   71 (230)
T KOG2716|consen    3 MSETVKL-NVGGTIFKTSKSTLTKFDGFFKTMLETDIPVEKDESGCIFI-DRSPKHFDTILNFMRDGDVD---------L   71 (230)
T ss_pred             ccceEEE-ecCCeEEEeehhhhhhhhhHHHHHhhcCCccccCCcCcEEe-cCChhHHHHHHHhhhccccc---------C
Confidence            3444553 345889999998875 599999999875321  11223434 36778899999999842221         2


Q ss_pred             chhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCC
Q 030521           90 KTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNK  135 (176)
Q Consensus        90 ~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gk  135 (176)
                      |        . +.++|-+|+.-|.|..+++|+++|...++..+++.
T Consensus        72 P--------e-~~kel~El~~EA~fYlL~~Lv~~C~~~i~~~~~~~  108 (230)
T KOG2716|consen   72 P--------E-SEKELKELLREAEFYLLDGLVELCQSAIARLIRGY  108 (230)
T ss_pred             c--------c-chHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccCc
Confidence            2        3 67889999999999999999999999999987774


No 16 
>KOG3433 consensus Protein involved in meiotic recombination/predicted coiled-coil protein [Cell cycle control, cell division, chromosome partitioning; General function prediction only]
Probab=93.96  E-value=0.13  Score=40.37  Aligned_cols=31  Identities=39%  Similarity=0.583  Sum_probs=28.0

Q ss_pred             chhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521          116 NIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT  151 (176)
Q Consensus       116 ~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t  151 (176)
                      ||--|.++||+..     |.-+.+||+.||||.||.
T Consensus       170 nI~il~dy~~rkf-----~~e~nqi~~~fgIPed~d  200 (203)
T KOG3433|consen  170 NIFILIDYLYRKF-----GLEPNQIRKEFGIPEDFD  200 (203)
T ss_pred             hHHHHHHHHHHhc-----CCCHHHHhHhcCCCcccc
Confidence            7888999998887     999999999999999974


No 17 
>KOG0783 consensus Uncharacterized conserved protein, contains ankyrin and BTB/POZ domains [Function unknown]
Probab=93.45  E-value=0.22  Score=47.42  Aligned_cols=112  Identities=18%  Similarity=0.263  Sum_probs=77.0

Q ss_pred             EEEEeCCCCEEEecHHHH-HHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhh
Q 030521           17 ISLKTADGHLFEVEEPVA-MEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEE   95 (176)
Q Consensus        17 i~L~s~DG~~f~V~~~~a-~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~e   95 (176)
                      ..++.+||++|..+...+ ..+.++..|+.-.-.+.......+-.++++.|+-|+.|++.--.               .+
T Consensus       713 ~~i~~KDGkvl~aHkc~L~aRlEYF~smf~~~w~E~sS~t~~~~p~~~e~m~ivLdylYs~d~---------------~~  777 (1267)
T KOG0783|consen  713 TVIKLKDGKVLKAHKCFLSARLEYFSSMFQFVWMESSSITVNLSPLTVEHMSIVLDYLYSDDK---------------VE  777 (1267)
T ss_pred             EEEEecCCcCcccceeEeeeHHHHHHHHHHHHHhhhccceeecCcchHHHHHHHHHHHHccch---------------HH
Confidence            445566999999988875 35777877776432221111222222779999999999983221               23


Q ss_pred             hccC-CChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHH
Q 030521           96 FGKS-KSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKF  143 (176)
Q Consensus        96 Fl~~-~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~  143 (176)
                      |++. .+.+-+|+++..|+-|=|..|-+.|-+.+-+++.=|+..++.++
T Consensus       778 ~~k~~~~~dF~~~il~iaDqlli~~Lk~Ice~~ll~kl~lk~~~~llef  826 (1267)
T KOG0783|consen  778 LFKDLKESDFMFEILSIADQLLILELKSICEQSLLRKLNLKTLPTLLEF  826 (1267)
T ss_pred             HHhccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhcccchHHHHHH
Confidence            4443 25677899999999999999999998888888887776555443


No 18 
>COG5124 Protein predicted to be involved in meiotic recombination [Cell division and chromosome partitioning / General function prediction only]
Probab=91.93  E-value=0.17  Score=39.51  Aligned_cols=31  Identities=39%  Similarity=0.592  Sum_probs=26.6

Q ss_pred             chhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521          116 NIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT  151 (176)
Q Consensus       116 ~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t  151 (176)
                      +|.-|.++.|+..     +.-+++||+.||||.||.
T Consensus       174 nI~ilidy~c~kf-----~~~~~qir~~fgIPedld  204 (209)
T COG5124         174 NIEILIDYLCKKF-----FLKPEQIRKEFGIPEDLD  204 (209)
T ss_pred             hHHHHHHHHHHHc-----CCCHHHHHHhcCCCcchh
Confidence            6777888888776     889999999999999874


No 19 
>PF11822 DUF3342:  Domain of unknown function (DUF3342);  InterPro: IPR021777  This family of proteins are functionally uncharacterised. This family is found in bacteria. This presumed domain is typically between 170 to 303 amino acids in length. The N-terminal half of this family is a BTB-like domain. 
Probab=90.31  E-value=0.54  Score=39.97  Aligned_cols=90  Identities=6%  Similarity=0.256  Sum_probs=59.7

Q ss_pred             CCEEEecHHHH-HHhHHHHHHHhcC-CCCCCCcccccC-CCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhccCC
Q 030521           24 GHLFEVEEPVA-MEFEIVKSFFDEN-DDASEDTVVPLP-NVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFGKSK  100 (176)
Q Consensus        24 G~~f~V~~~~a-~~S~~l~~ll~~~-~~~~~~~~Ipl~-~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl~~~  100 (176)
                      .+.|..++..+ ...++++..+... +......+|++. .=+-.|++=+++|++.    ..|                .+
T Consensus        13 ~rdF~C~~~lL~~~M~YF~~~l~~~~~~~~~~~~idisVhCDv~iF~WLm~yv~~----~~p----------------~l   72 (317)
T PF11822_consen   13 KRDFTCPRDLLVSEMRYFAEYLSRYINDSQRWEEIDISVHCDVHIFEWLMRYVKG----EPP----------------SL   72 (317)
T ss_pred             ceeeeccHHHHHHhhHHHHHHHhhcccccCcCCCcceEEecChhHHHHHHHHhhc----CCC----------------cC
Confidence            46799999996 4599999998541 111112245443 1344556666666654    211                23


Q ss_pred             ChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc
Q 030521          101 SNDELKEMILVANYLNIKDMLDYLTETVANRIK  133 (176)
Q Consensus       101 ~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~  133 (176)
                      +...+..++.-|+||+|++|++.|-.|+...+.
T Consensus        73 ~~~NvvsIliSS~FL~M~~Lve~cl~y~~~~~~  105 (317)
T PF11822_consen   73 TPSNVVSILISSEFLQMESLVEECLQYCHDHMS  105 (317)
T ss_pred             CcCcEEEeEehhhhhccHHHHHHHHHHHHHhHH
Confidence            677788999999999999999999988855443


No 20 
>PF02214 BTB_2:  BTB/POZ domain;  InterPro: IPR003131 Potassium channels are the most diverse group of the ion channel family [, ]. They are important in shaping the action potential, and in neuronal excitability and plasticity []. The potassium channel family is composed of several functionally distinct isoforms, which can be broadly separated into 2 groups []: the practically non-inactivating 'delayed' group and the rapidly inactivating 'transient' group. These are all highly similar proteins, with only small amino acid changes causing the diversity of the voltage-dependent gating mechanism, channel conductance and toxin binding properties. Each type of K+ channel is activated by different signals and conditions depending on their type of regulation: some open in response to depolarisation of the plasma membrane; others in response to hyperpolarisation or an increase in intracellular calcium concentration; some can be regulated by binding of a transmitter, together with intracellular kinases; while others are regulated by GTP-binding proteins or other second messengers []. In eukaryotic cells, K+ channels are involved in neural signalling and generation of the cardiac rhythm, act as effectors in signal transduction pathways involving G protein-coupled receptors (GPCRs) and may have a role in target cell lysis by cytotoxic T-lymphocytes []. In prokaryotic cells, they play a role in the maintenance of ionic homeostasis [].  All K+ channels discovered so far possess a core of alpha subunits, each comprising either one or two copies of a highly conserved pore loop domain (P-domain). The P-domain contains the sequence (T/SxxTxGxG), which has been termed the K+ selectivity sequence. In families that contain one P-domain, four subunits assemble to form a selective pathway for K+ across the membrane. However, it remains unclear how the 2 P-domain subunits assemble to form a selective pore. The functional diversity of these families can arise through homo- or hetero-associations of alpha subunits or association with auxiliary cytoplasmic beta subunits. K+ channel subunits containing one pore domain can be assigned into one of two superfamilies: those that possess six transmembrane (TM) domains and those that possess only two TM domains. The six TM domain superfamily can be further subdivided into conserved gene families: the voltage-gated (Kv) channels; the KCNQ channels (originally known as KvLQT channels); the EAG-like K+ channels; and three types of calcium (Ca)-activated K+ channels (BK, IK and SK) []. The 2TM domain family comprises inward-rectifying K+ channels. In addition, there are K+ channel alpha-subunits that possess two P-domains. These are usually highly regulated K+ selective leak channels. The Kv family can be divided into several subfamilies on the basis of sequence similarity and function. Four of these subfamilies, Kv1 (Shaker), Kv2 (Shab), Kv3 (Shaw) and Kv4 (Shal), consist of pore-forming alpha subunits that associate with different types of beta subunit. Each alpha subunit comprises six hydrophobic TM domains with a P-domain between the fifth and sixth, which partially resides in the membrane. The fourth TM domain has positively charged residues at every third residue and acts as a voltage sensor, which triggers the conformational change that opens the channel pore in response to a displacement in membrane potential []. More recently, 4 new electrically-silent alpha subunits have been cloned: Kv5 (KCNF), Kv6 (KCNG), Kv8 and Kv9 (KCNS). These subunits do not themselves possess any functional activity, but appear to form heteromeric channels with Kv2 subunits, and thus modulate Shab channel activity []. When highly expressed, they inhibit channel activity, but at lower levels show more specific modulatory actions. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated potassium channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels []. This domain is found in a subset of a larger group of proteins that contain the BTB/POZ domain.; GO: 0005249 voltage-gated potassium channel activity, 0006813 potassium ion transport, 0008076 voltage-gated potassium channel complex, 0016020 membrane; PDB: 1NN7_A 3KVT_A 1EXB_E 1QDV_A 1DSX_E 1QDW_F 3LUT_B 3LNM_B 2A79_B 3DRY_C ....
Probab=89.27  E-value=0.29  Score=33.57  Aligned_cols=83  Identities=14%  Similarity=0.310  Sum_probs=53.9

Q ss_pred             CCCEEEecHHHHHH--hHHHHHHHhcC---CCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhc
Q 030521           23 DGHLFEVEEPVAME--FEIVKSFFDEN---DDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFG   97 (176)
Q Consensus        23 DG~~f~V~~~~a~~--S~~l~~ll~~~---~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl   97 (176)
                      -|+.|.+++..+..  ...+..++...   ........+=+ +=++..++.|+.|++.. . ..        +       
T Consensus         6 GG~~f~~~~~tL~~~~~s~l~~~~~~~~~~~~~~~~~~~fi-DRdp~~F~~IL~ylr~~-~-~l--------~-------   67 (94)
T PF02214_consen    6 GGTIFETSRSTLTRYPDSLLARLFSGERSDDYDDDDGEYFI-DRDPELFEYILNYLRTG-G-KL--------P-------   67 (94)
T ss_dssp             TTEEEEEEHHHHHTSTTSTTTSHHHTGHGGGEETTTTEEEE-SS-HHHHHHHHHHHHHT-S-SB----------------
T ss_pred             CCEEEEEcHHHHhhCCCChhhhHHhhccccccCCccceEEe-ccChhhhhHHHHHHhhc-C-cc--------C-------
Confidence            48999999999863  45667777643   11112235544 47999999999999985 1 11        0       


Q ss_pred             cCCChHHHHHHHHhhhccchhhH-HHHH
Q 030521           98 KSKSNDELKEMILVANYLNIKDM-LDYL  124 (176)
Q Consensus        98 ~~~~~~~l~~Ll~AA~~L~I~~L-~~l~  124 (176)
                       .-+...+..|...|.|.+|+.| ++.|
T Consensus        68 -~~~~~~~~~l~~Ea~fy~l~~l~i~~c   94 (94)
T PF02214_consen   68 -IPDEICLEELLEEAEFYGLDELFIEDC   94 (94)
T ss_dssp             ---TTS-HHHHHHHHHHHT-HHHHBHHC
T ss_pred             -CCCchhHHHHHHHHHHcCCCccccCCC
Confidence             1024457789999999999999 6654


No 21 
>PF03962 Mnd1:  Mnd1 family;  InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=84.74  E-value=1.4  Score=34.63  Aligned_cols=44  Identities=36%  Similarity=0.474  Sum_probs=33.5

Q ss_pred             HHHHHHHHhhhcc--chhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521          103 DELKEMILVANYL--NIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT  151 (176)
Q Consensus       103 ~~l~~Ll~AA~~L--~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t  151 (176)
                      ..+..+..|||..  ||..|..+|+...     |.+.++|++.||||+||-
T Consensus       142 ~~~~~~~~~anrwTDNI~~l~~~~~~k~-----~~~~~~i~k~f~Ip~d~d  187 (188)
T PF03962_consen  142 EEIKIAKEAANRWTDNIFSLKSYLKKKF-----GMDEEDIRKEFGIPEDFD  187 (188)
T ss_pred             HHHHHHHHHHHHHHhhHHHHHHHHHHhc-----CCCHHHHHHHcCCccccC
Confidence            3445556677765  6777777777653     999999999999999973


No 22 
>KOG2075 consensus Topoisomerase TOP1-interacting protein BTBD1 [Function unknown]
Probab=73.94  E-value=18  Score=32.65  Aligned_cols=108  Identities=11%  Similarity=0.169  Sum_probs=64.8

Q ss_pred             CEEEecHHHHHH-hHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCC--------CCcc---ccchh
Q 030521           25 HLFEVEEPVAME-FEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKR--------SPKQ---EVKTF   92 (176)
Q Consensus        25 ~~f~V~~~~a~~-S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~--------~~~~---~~~~w   92 (176)
                      +.+..++-+... |.++..|+.+.-.+.....|.+|.|...++...+.|++--...-.++        ..+-   .+..=
T Consensus       129 q~~paHk~vla~gS~VFdaMf~g~~a~~~s~ei~lpdvepaaFl~~L~flYsdev~~~~dtvi~tl~~AkKY~VpaLer~  208 (521)
T KOG2075|consen  129 QRIPAHKLVLADGSDVFDAMFYGGLAEDASLEIRLPDVEPAAFLAFLRFLYSDEVKLAADTVITTLYAAKKYLVPALERQ  208 (521)
T ss_pred             cccchhhhhhhcchHHHHHHhccCcccccCceeecCCcChhHhHHHHHHHhcchhhhhHHHHHHHHHHHHHhhhHHHHHH
Confidence            455566666544 88889998765443334689999999999999998887521110000        0000   11111


Q ss_pred             hhhhccC--CChHHHHHHHHhhhccchhhHHHHHHHHHHHHH
Q 030521           93 NEEFGKS--KSNDELKEMILVANYLNIKDMLDYLTETVANRI  132 (176)
Q Consensus        93 d~eFl~~--~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i  132 (176)
                      --+|++.  .....++.|...|..++=+.|+..|...|....
T Consensus       209 CVkflr~~l~~~naf~~L~q~A~lf~ep~Li~~c~e~id~~~  250 (521)
T KOG2075|consen  209 CVKFLRKNLMADNAFLELFQRAKLFDEPSLISICLEVIDKSF  250 (521)
T ss_pred             HHHHHHHhcCChHHHHHHHHHHHhhcCHHHHHHHHHHhhhHH
Confidence            1224442  255666666666888888888888877775433


No 23 
>PF03656 Pam16:  Pam16;  InterPro: IPR005341 The Pam16 protein is the fifth essential subunit of the pre-sequence translocase-associated protein import motor (PAM) []. In Saccharomyces cerevisiae (Baker's yeast), Pam16 is required for preprotein translocation into the matrix, but not for protein insertion into the inner membrane [].; PDB: 2GUZ_J.
Probab=73.46  E-value=1.5  Score=32.43  Aligned_cols=39  Identities=23%  Similarity=0.531  Sum_probs=22.9

Q ss_pred             HHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccCCCCCC
Q 030521          131 RIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFEGVDPD  172 (176)
Q Consensus       131 ~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~~~~~~  172 (176)
                      ..+|+|.+|-+++|||+...++|+   |.+.|.-+|+.-+|.
T Consensus        51 ~~~~Mtl~EA~~ILnv~~~~~~ee---I~k~y~~Lf~~Nd~~   89 (127)
T PF03656_consen   51 NSKGMTLDEARQILNVKEELSREE---IQKRYKHLFKANDPS   89 (127)
T ss_dssp             ------HHHHHHHHT--G--SHHH---HHHHHHHHHHHT-CC
T ss_pred             hcCCCCHHHHHHHcCCCCccCHHH---HHHHHHHHHhccCCC
Confidence            346899999999999998777654   777888888876664


No 24 
>KOG0511 consensus Ankyrin repeat protein [General function prediction only]
Probab=67.82  E-value=16  Score=32.27  Aligned_cols=108  Identities=10%  Similarity=0.036  Sum_probs=75.0

Q ss_pred             CCCCEEEecHHHHHHhHHHHHHHhcCCCC-CCCccc---ccCCCChHHHHHHHHHHHHhhhccCCC--------------
Q 030521           22 ADGHLFEVEEPVAMEFEIVKSFFDENDDA-SEDTVV---PLPNVSAAPLSYIIEFCRAHVEFRKKR--------------   83 (176)
Q Consensus        22 ~DG~~f~V~~~~a~~S~~l~~ll~~~~~~-~~~~~I---pl~~v~s~~L~~Iie~c~~h~~~~~~~--------------   83 (176)
                      .......++...+....++..|+.+.-.+ ..+..+   .||+..+.+.+.+++|++.|+.+-++.              
T Consensus       298 ~~~~RyP~hla~i~R~eyfk~mf~g~f~e~s~n~~~p~lslp~~~~~vveI~lr~lY~d~tdi~~~~A~dvll~ad~lal  377 (516)
T KOG0511|consen  298 PEEDRYPAHLARILRVEYFKSMFVGDFIESSVNDTRPGLSLPSLADVVVEIDLRNLYCDQTDIIFDVASDVLLFADKLAL  377 (516)
T ss_pred             cccccccHHHHHHHHHHHHHHHhccchhhhcCCccccccccchHHHHHHHHHHHHhhcccccchHHHHhhHHHHhhHhhh
Confidence            33344777777777888888888754222 112233   478889999999999999999876541              


Q ss_pred             --CC------ccccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc
Q 030521           84 --SP------KQEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIK  133 (176)
Q Consensus        84 --~~------~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~  133 (176)
                        ..      ...+..|. +|+   +.-.+++++.-+--+....|=..+..++|+.+.
T Consensus       378 ~~dr~Lkt~as~~itq~~-e~i---d~y~V~dIl~~~wd~~~~rlEqfa~~~~a~hl~  431 (516)
T KOG0511|consen  378 ADDRLLKTAASAEITQWL-ELI---DMYGVLDILEYCWDLVACRLEQFAETHEARHLL  431 (516)
T ss_pred             hhhhhhhhhhhHHHHHHH-HHH---HhhhHHHHHHHHHHhhhHHHHHHHHHHHHHHHH
Confidence              00      11345552 332   334578889999999999999999999988765


No 25 
>PF05409 Peptidase_C30:  Coronavirus  endopeptidase C30;  InterPro: IPR008740 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases correspond to MEROPS peptidase family C30 (clan PA(C)). These peptidases are related to serine endopeptidases of family S1 and are restricted to RNA viruses, where they are involved in viral polyprotein processing during replication [, , ].  This entry represents the CoV M-pro which comprises three domains. Domains I and II are six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain, a globular cluster of five helices, has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad. In contrast to serine proteinases and other cysteine proteinases, which have a catalytic triad, there is no third catalytic residue present [, , , ]. ; GO: 0019082 viral protein processing; PDB: 2VJ1_B 2K7X_A 2Q6D_B 2Q6F_A 1P9U_E 1LVO_B 2AMP_B 1P9S_A 2ZU2_A 3D23_A ....
Probab=63.24  E-value=23  Score=29.77  Aligned_cols=98  Identities=13%  Similarity=0.172  Sum_probs=57.0

Q ss_pred             cccccCCCChHHHHHHHHHHHHhhhccCC---C---CCccccchhhhhh-ccCCChHHHHHHHHhhhccchhhHHHHHHH
Q 030521           54 TVVPLPNVSAAPLSYIIEFCRAHVEFRKK---R---SPKQEVKTFNEEF-GKSKSNDELKEMILVANYLNIKDMLDYLTE  126 (176)
Q Consensus        54 ~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~---~---~~~~~~~~wd~eF-l~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~  126 (176)
                      +...++....-.=.-|+.|++-|--...+   .   -..+...+|-..- +..+..+..++.+.|..-..+..|+...-+
T Consensus       161 ~~~Q~~~~~~~~T~NVvA~LYAalln~~~w~l~~~~~svedFN~WA~~N~ft~~~~~~~~~~Laa~TgVsV~rlLaaI~~  240 (293)
T PF05409_consen  161 EVAQLEGADQYITDNVVAWLYAALLNGCPWWLSSQRISVEDFNEWAMNNGFTSFSSDDAFDMLAAKTGVSVERLLAAIQR  240 (293)
T ss_dssp             SSSTTTTS-SB-HHHHHHHHHHHHHTT--TTSSSS---HHHHHHHHHHTTB--SSHHHHHHHHHHHHT--HHHHHHHHHH
T ss_pred             hhhhhcCccceeehhHHHHHHHHHHcCCCeeeccceecHHHHHHHHHhCCCcccchhhHHHHHHHHhCCcHHHHHHHHHH
Confidence            34455555666677899999987655432   1   1223556664431 223377788999999999999999987665


Q ss_pred             HHHHHHcCCCHHHHHHHcCCCCCCCHHHH
Q 030521          127 TVANRIKNKSVEYVRKFFGIENDFTPEEE  155 (176)
Q Consensus       127 ~iA~~i~gks~eeir~~f~i~~d~t~EEe  155 (176)
                       +..-..|++   |.-...+++||||++-
T Consensus       241 -L~~g~~G~~---IlG~~~ledE~Tp~~V  265 (293)
T PF05409_consen  241 -LHKGFGGRT---ILGSCSLEDEFTPEMV  265 (293)
T ss_dssp             -HHHTTTTT----BTTBSS--TSS-HHHH
T ss_pred             -HhcCCCCCc---ccccccccccCCHHHH
Confidence             666556665   4444557788999764


No 26 
>KOG1665 consensus AFH1-interacting protein FIP2, contains BTB/POZ domain and pentapeptide repeats [General function prediction only]
Probab=60.02  E-value=25  Score=28.89  Aligned_cols=93  Identities=16%  Similarity=0.226  Sum_probs=61.2

Q ss_pred             CCCcEEEEeCCCCEEE--ecHHHHHH-hHHHHHHHhcCCCC---CCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCc
Q 030521           13 ATKKISLKTADGHLFE--VEEPVAME-FEIVKSFFDENDDA---SEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPK   86 (176)
Q Consensus        13 ~~~~i~L~s~DG~~f~--V~~~~a~~-S~~l~~ll~~~~~~---~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~   86 (176)
                      .+++++|- -.|+.|.  ++.-+.+. -..+..|+.+.+..   ++...+-+ .=+..-++-|+.|+.+-.-        
T Consensus         7 ~~~~vrln-igGk~f~TTidTlv~rEPDSMLa~MF~~~g~~~~~d~kGa~lI-DRsp~yFepIlNyLr~Gq~--------   76 (302)
T KOG1665|consen    7 LSSMVRLN-IGGKKFCTTIDTLVIREPDSMLAAMFSGRGAMCQEDKKGAVLI-DRSPKYFEPILNYLRDGQI--------   76 (302)
T ss_pred             hhhhheee-cCCeEEEEeehhhcccCchHHHHHHHccCCCccccccCceEEE-ccCchhhHHHHHHHhcCce--------
Confidence            45556654 3577776  44444444 56677788766532   12234433 3678889999999986331        


Q ss_pred             cccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHH
Q 030521           87 QEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTE  126 (176)
Q Consensus        87 ~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~  126 (176)
                                ... +...+++++.+|.|++|-+|.+..-.
T Consensus        77 ----------~~~-s~i~~lgvLeeArff~i~sL~~hle~  105 (302)
T KOG1665|consen   77 ----------PSL-SDIDCLGVLEEARFFQILSLKDHLED  105 (302)
T ss_pred             ----------eec-CCccHHHHHHHhhHHhhHhHHhHHhh
Confidence                      122 66778999999999999999987643


No 27 
>PF07928 Vps54:  Vps54-like protein;  InterPro: IPR012501 This family contains various proteins that are homologues of the yeast Vps54 protein, such as the rat homologue (Q9JMK8 from SWISSPROT), the human homologue (Q86YF7 from SWISSPROT), and the mouse homologue (Q8R3X1 from SWISSPROT). In yeast, Vps54 associates with Vps52 and Vps53 proteins to form a trimolecular complex that is involved in protein transport between Golgi, endosomal, and vacuolar compartments []. All Vps54 homologues contain a coiled coil region (not found in the region featured in this family) and multiple dileucine motifs []. ; GO: 0042147 retrograde transport, endosome to Golgi; PDB: 3N1E_B 3N1B_B.
Probab=57.52  E-value=3.4  Score=30.78  Aligned_cols=47  Identities=15%  Similarity=0.314  Sum_probs=0.0

Q ss_pred             CCCEEEecHHHHHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhh
Q 030521           23 DGHLFEVEEPVAMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHV   77 (176)
Q Consensus        23 DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~   77 (176)
                      ||+.|.|+..++...+.|.+.+.-        ...+|.+..+++.++++|++...
T Consensus         1 d~e~f~vv~s~l~ll~~l~~Y~~~--------~~~~P~~a~di~~~l~elLk~fN   47 (135)
T PF07928_consen    1 DNEKFVVVGSALLLLKMLSDYLQL--------ASNFPSLAPDILSRLLELLKLFN   47 (135)
T ss_dssp             -------------------------------------------------------
T ss_pred             CCCceecHHHHHHHHHHHHHHHHH--------HHHCchhHHHHHHHHHHHHHHHH
Confidence            677777777777777766665443        12355677788888888776543


No 28 
>PRK05365 malonic semialdehyde reductase; Provisional
Probab=53.12  E-value=9.4  Score=29.58  Aligned_cols=35  Identities=17%  Similarity=0.163  Sum_probs=27.3

Q ss_pred             HHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521          107 EMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT  151 (176)
Q Consensus       107 ~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t  151 (176)
                      .|+.||..||+..          ..+.|-..+.+++.|||++++.
T Consensus       130 ~l~LaA~~~Glgs----------~~~~g~~~~~v~~~l~ip~~~~  164 (195)
T PRK05365        130 YLILAARALGLDA----------GPMSGFDAAAVDAEFFAGTTWK  164 (195)
T ss_pred             HHHHHHHHcCCcc----------CCccccCHHHHHHHhCCCCCee
Confidence            3888888888876          2445778899999999987653


No 29 
>KOG3442 consensus Uncharacterized conserved protein [Function unknown]
Probab=52.17  E-value=12  Score=27.55  Aligned_cols=36  Identities=19%  Similarity=0.352  Sum_probs=28.1

Q ss_pred             cCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccCCCCC
Q 030521          133 KNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFEGVDP  171 (176)
Q Consensus       133 ~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~~~~~  171 (176)
                      .|.|.+|-+++|||.++++.|   +|.+.|.-+|+.-|+
T Consensus        54 ~~iTlqEa~qILnV~~~ln~e---ei~k~yehLFevNdk   89 (132)
T KOG3442|consen   54 GKITLQEAQQILNVKEPLNRE---EIEKRYEHLFEVNDK   89 (132)
T ss_pred             ccccHHHHhhHhCCCCCCCHH---HHHHHHHHHHhccCc
Confidence            457899999999999877765   466788888886443


No 30 
>PRK13696 hypothetical protein; Provisional
Probab=51.56  E-value=42  Score=21.70  Aligned_cols=26  Identities=38%  Similarity=0.602  Sum_probs=20.3

Q ss_pred             cCCCHHHHHHHcCCCCCCCHHHHHHHHHh
Q 030521          133 KNKSVEYVRKFFGIENDFTPEEEEAARKE  161 (176)
Q Consensus       133 ~gks~eeir~~f~i~~d~t~EEe~~i~~e  161 (176)
                      +..+...+.++||+   +++||.++++++
T Consensus        36 ~~~~~~~l~~~~Gi---l~dee~~e~~~~   61 (62)
T PRK13696         36 KKGNLDKLMKAFGI---LSEEEAEELKKE   61 (62)
T ss_pred             hhccHHHHHHHHCC---CCHHHHHHHHhh
Confidence            34568899999999   568888888765


No 31 
>cd02148 Nitroreductase_5 Nitroreductase-like family 5.  A subfamily of the nitroreductase family containing uncharacterized proteins that are similar to nitroreductase. Nitroreductase catalyzes the reduction of nitroaromatic compounds such as nitrotoluenes, nitrofurans and nitroimidazoles. This process requires NAD(P)H as electron donor in an obligatory two-electron transfer and uses FMN as cofactor.  The enzyme is typically a homodimer. Members of this family are also called NADH dehydrogenase, oxygen-insensitive NAD(P)H nitrogenase or dihydropteridine reductase.
Probab=50.48  E-value=10  Score=29.02  Aligned_cols=33  Identities=18%  Similarity=0.219  Sum_probs=26.1

Q ss_pred             HHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCC
Q 030521          108 MILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDF  150 (176)
Q Consensus       108 Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~  150 (176)
                      |+.||.-||+..          .+|.|-..+++++.|||++++
T Consensus       124 l~LaA~~lGlgs----------~~~~g~d~~~~~~~L~lp~~~  156 (185)
T cd02148         124 FILAARALGLDA----------GPMSGFDAAAVDAEFFADTRW  156 (185)
T ss_pred             HHHHHHHcCCCc----------CCCcccCHHHHHHHhCCCCCe
Confidence            778888887765          345677889999999998776


No 32 
>KOG4591 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=49.17  E-value=33  Score=27.76  Aligned_cols=57  Identities=12%  Similarity=0.098  Sum_probs=33.6

Q ss_pred             cccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHH
Q 030521           56 VPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVAN  130 (176)
Q Consensus        56 Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~  130 (176)
                      .-+..++-.++-..++|.+...-+                 |+. +...|.+|+.-||.++++.|.+.|-+-+..
T Consensus       108 ~~~dDad~Ea~~t~iRWIYTDEid-----------------fk~-dD~~L~el~e~An~FqLe~Lke~C~k~l~a  164 (280)
T KOG4591|consen  108 LDLDDADFEAFHTAIRWIYTDEID-----------------FKE-DDEFLLELCELANRFQLELLKERCEKGLGA  164 (280)
T ss_pred             hcccccCHHHHHHhheeeeccccc-----------------ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            334566667777777776532211                 233 556666777777777777776666655533


No 33 
>PF11978 MVP_shoulder:  Shoulder domain;  InterPro: IPR021870  This domain is found in the Major Vault Protein and has been called the shoulder domain []. This family includes two bacterial proteins A6FXE2 from SWISSPROT and A1ZGE7 from SWISSPROT. This suggests that some bacteria may possess vault particles. ; PDB: 2ZUO_G 2QZV_B 2ZV5_c 2ZV4_Y.
Probab=46.02  E-value=18  Score=26.47  Aligned_cols=27  Identities=19%  Similarity=0.314  Sum_probs=24.6

Q ss_pred             HHhhhccchhhHHHHHHHHHHHHHcCC
Q 030521          109 ILVANYLNIKDMLDYLTETVANRIKNK  135 (176)
Q Consensus       109 l~AA~~L~I~~L~~l~~~~iA~~i~gk  135 (176)
                      -.|+..+.++..+-.+|+.||++|+|.
T Consensus        37 ~~~~k~F~VpDFVGd~Ck~iaSRIR~a   63 (118)
T PF11978_consen   37 EDAAKLFSVPDFVGDACKAIASRIRGA   63 (118)
T ss_dssp             HHHHHTTSSTTHHHHHHHHHHHHHHHH
T ss_pred             hHHHHhcCCcchHHHHHHHHHHHHHHH
Confidence            378999999999999999999999974


No 34 
>PF12556 CobS_N:  Cobaltochelatase CobS subunit N terminal 
Probab=44.59  E-value=13  Score=21.32  Aligned_cols=12  Identities=50%  Similarity=1.027  Sum_probs=10.4

Q ss_pred             HHHHcCCCCCCC
Q 030521          140 VRKFFGIENDFT  151 (176)
Q Consensus       140 ir~~f~i~~d~t  151 (176)
                      +|+.|||..|++
T Consensus        14 vre~FGiDsDm~   25 (36)
T PF12556_consen   14 VREVFGIDSDMK   25 (36)
T ss_pred             HHHhcCCCcCCe
Confidence            689999999975


No 35 
>PF10892 DUF2688:  Protein of unknown function (DUF2688);  InterPro: IPR024392 Members of this protein family are annotated as KleB, and may play a role in the regulation of transcription in plasmids.
Probab=44.47  E-value=27  Score=22.11  Aligned_cols=25  Identities=32%  Similarity=0.478  Sum_probs=18.7

Q ss_pred             HHHHHHHcC-CCCC-CCHHHHHHHHHh
Q 030521          137 VEYVRKFFG-IEND-FTPEEEEAARKE  161 (176)
Q Consensus       137 ~eeir~~f~-i~~d-~t~EEe~~i~~e  161 (176)
                      .+++|+.|| |=.+ +||||..+|.+-
T Consensus        30 ad~lr~klG~IC~~CitpEE~~~I~e~   56 (60)
T PF10892_consen   30 ADDLRVKLGGICGDCITPEEDREILEA   56 (60)
T ss_pred             hHHHHHHHcchhhccCCHHHHHHHHHH
Confidence            577887776 5555 899999998653


No 36 
>KOG3840 consensus Uncharaterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=39.76  E-value=46  Score=28.60  Aligned_cols=67  Identities=16%  Similarity=0.326  Sum_probs=41.4

Q ss_pred             ccCcCCCCcEEEEeCCCCEEEecHHHHHHhH--HHHHHHhcCCC---CCCCccccc-CCCChHHHHHHHHHHHH
Q 030521            8 SCSAQATKKISLKTADGHLFEVEEPVAMEFE--IVKSFFDENDD---ASEDTVVPL-PNVSAAPLSYIIEFCRA   75 (176)
Q Consensus         8 ~~~~~~~~~i~L~s~DG~~f~V~~~~a~~S~--~l~~ll~~~~~---~~~~~~Ipl-~~v~s~~L~~Iie~c~~   75 (176)
                      +|.+.+...+++ --||..|.|+..++.-++  .+-+|+..+..   .++.....+ .++++.+++.|++|...
T Consensus        89 G~~pg~~~~~t~-lvd~~rf~v~q~llt~~p~Tmlg~mf~~g~~f~~pNErgEyeVAdGi~s~vFRAILdYYks  161 (438)
T KOG3840|consen   89 GCSPGEGDKVCL-LVDQTRFLVSQRLLTSKPDTMLGRMFSMGADLVSPNERDEFEVADGMTSSCFRAILDYYQS  161 (438)
T ss_pred             CCCCCCCcceEE-EeeeEEEEeeeeeecCCcchhhhhhhcccccccCCCcCCceehhcchhHHHHHHHHHHHhc
Confidence            444444554444 468999999998875533  34445543211   112334544 47999999999998664


No 37 
>PF10075 PCI_Csn8:  COP9 signalosome, subunit CSN8;  InterPro: IPR019280 The photomorphogenic 9 (COP9) signalosome or CSN complex is composed of eight subunits: Cops1/GPS1, Cops2, Cops3, Cops4, Cops5, Cop6, Cops7 (Cops7A or Cops7B) and Cops8. In the complex, Cops8, which is the smallest subunit, probably interacts directly with Cops3, Cops4 and Cops7 (Cops7A or Cops7B). This signalosome is homologous to the lid subcomplex of the 26S proteasome and regulates the ubiquitin-proteasome pathway. It functions as a structural scaffold for subunit-subunit interactions within the complex and is a key regulator of photomorphogenic development [].; PDB: 1RZ4_A.
Probab=39.30  E-value=71  Score=23.38  Aligned_cols=48  Identities=19%  Similarity=0.369  Sum_probs=30.9

Q ss_pred             hhhHHHHHH----HHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccCC
Q 030521          117 IKDMLDYLT----ETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFEG  168 (176)
Q Consensus       117 I~~L~~l~~----~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~~  168 (176)
                      |.+|.+..-    ..|+.-.+.-+...+.+++|++    ++|..++-+...|-.++
T Consensus        76 v~~~~~~iR~~i~~~i~~aY~sIs~~~la~~Lg~~----~~el~~~~~~~gW~~d~  127 (143)
T PF10075_consen   76 VPGFEDTIRERIAHLISKAYSSISLSDLAEMLGLS----EEELEKFIKSRGWTVDG  127 (143)
T ss_dssp             STTHHHHHHHHHHHHHHHH-SEE-HHHHHHHTTS-----HHHHHHHHHHHT-EE--
T ss_pred             HHHHHHHHHHHHHHHHHHHHhHcCHHHHHHHhCCC----HHHHHHHHHHcCCEECC
Confidence            444555544    4445555667899999999985    77888888888999874


No 38 
>PRK14981 DNA-directed RNA polymerase subunit F; Provisional
Probab=38.90  E-value=1.2e+02  Score=21.72  Aligned_cols=45  Identities=18%  Similarity=0.284  Sum_probs=31.9

Q ss_pred             chhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCC-CCCHHHHHHHHHh
Q 030521          116 NIKDMLDYLTETVANRIKNKSVEYVRKFFGIEN-DFTPEEEEAARKE  161 (176)
Q Consensus       116 ~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~-d~t~EEe~~i~~e  161 (176)
                      ++..|-+.....|++. .=.|++|+|.+|.-.. -+++|+.++|-..
T Consensus        61 ~~~~l~e~~a~~I~nL-~P~~~dElrai~~~~~~~~~~e~l~~ILd~  106 (112)
T PRK14981         61 ELEKMKEKTAVKIADI-LPETRDELRAIFAKERYTLSPEELDEILDI  106 (112)
T ss_pred             HccCCCHHHHHHHHhc-CCCCHHHHHHHHHHhccCCCHHHHHHHHHH
Confidence            4444566666666664 5688999999987653 3889998888653


No 39 
>PF15063 TC1:  Thyroid cancer protein 1
Probab=38.71  E-value=33  Score=23.10  Aligned_cols=31  Identities=23%  Similarity=0.353  Sum_probs=17.3

Q ss_pred             HHHHHHcCCCHHHHHHHcCCCCCCCHHHHHH
Q 030521          127 TVANRIKNKSVEYVRKFFGIENDFTPEEEEA  157 (176)
Q Consensus       127 ~iA~~i~gks~eeir~~f~i~~d~t~EEe~~  157 (176)
                      .+|+++++-+..+|+.+|.-..|--.||-.+
T Consensus        29 asaNIFe~vn~~qlqrLF~~sGD~kAEeRA~   59 (79)
T PF15063_consen   29 ASANIFENVNLDQLQRLFQKSGDKKAEERAR   59 (79)
T ss_pred             hhhhhhhccCHHHHHHHHHHccchhHHHHHH
Confidence            4455555666666666666555555554443


No 40 
>smart00385 CYCLIN domain present in cyclins, TFIIB and Retinoblastoma. A helical domain present in cyclins and TFIIB (twice) and Retinoblastoma (once). A protein recognition domain functioning in cell-cycle and transcription control.
Probab=38.23  E-value=97  Score=19.11  Aligned_cols=41  Identities=17%  Similarity=0.159  Sum_probs=26.6

Q ss_pred             hhhHHHHHHHHHHHHHcCC--CHHHHHHHcCCCCCCCHHHHHHHHH
Q 030521          117 IKDMLDYLTETVANRIKNK--SVEYVRKFFGIENDFTPEEEEAARK  160 (176)
Q Consensus       117 I~~L~~l~~~~iA~~i~gk--s~eeir~~f~i~~d~t~EEe~~i~~  160 (176)
                      -+.++-.+|-++|.++.+.  +..++..+.|.   +|.++..+..+
T Consensus        37 ~~~~ia~a~l~lA~k~~~~~~~~~~~~~~~~~---~~~~~i~~~~~   79 (83)
T smart00385       37 SPSLIAAAALYLAAKTEEIPPWTKELVHYTGY---FTEEEILRMEK   79 (83)
T ss_pred             CHHHHHHHHHHHHHHHhcCCCCchhHhHhhCC---CCHHHHHHHHH
Confidence            3455666777888888774  45677666665   67766655543


No 41 
>PF11460 DUF3007:  Protein of unknown function (DUF3007);  InterPro: IPR021562  This is a family of uncharacterised proteins found in bacteria and eukaryotes. 
Probab=37.18  E-value=34  Score=24.41  Aligned_cols=24  Identities=33%  Similarity=0.441  Sum_probs=17.0

Q ss_pred             CHHHHHHHcCCCCCCCHHHHHHHHHhc
Q 030521          136 SVEYVRKFFGIENDFTPEEEEAARKEY  162 (176)
Q Consensus       136 s~eeir~~f~i~~d~t~EEe~~i~~e~  162 (176)
                      +.++|.+.|.   .+||||.+++.+|-
T Consensus        79 ~~~~lqkRle---~l~~eE~~~L~~ei  102 (104)
T PF11460_consen   79 TNEELQKRLE---ELSPEELEALQAEI  102 (104)
T ss_pred             hHHHHHHHHH---hCCHHHHHHHHHHh
Confidence            3456777765   58899988887763


No 42 
>KOG4350 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=36.62  E-value=34  Score=30.62  Aligned_cols=33  Identities=15%  Similarity=0.221  Sum_probs=15.7

Q ss_pred             ChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc
Q 030521          101 SNDELKEMILVANYLNIKDMLDYLTETVANRIK  133 (176)
Q Consensus       101 ~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~  133 (176)
                      +.+.+++.+.-|+..+.+.|...++.++-..++
T Consensus       114 ~ed~lld~LslAh~Ygf~~Le~aiSeYl~~iL~  146 (620)
T KOG4350|consen  114 EEDILLDYLSLAHRYGFIQLETAISEYLKEILK  146 (620)
T ss_pred             hHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHc
Confidence            344444555555555555554444444444433


No 43 
>PF13921 Myb_DNA-bind_6:  Myb-like DNA-binding domain; PDB: 1A5J_A 1MBH_A 1GV5_A 1H89_C 1IDY_A 1MBK_A 1IDZ_A 1H88_C 1GVD_A 1MBG_A ....
Probab=35.39  E-value=64  Score=19.69  Aligned_cols=34  Identities=18%  Similarity=0.290  Sum_probs=22.0

Q ss_pred             HHHHHHHHcCCCHHHHHHHcCC------CC-CCCHHHHHHH
Q 030521          125 TETVANRIKNKSVEYVRKFFGI------EN-DFTPEEEEAA  158 (176)
Q Consensus       125 ~~~iA~~i~gks~eeir~~f~i------~~-d~t~EEe~~i  158 (176)
                      =..||..+..+|+.+++..|.-      .. .+|+||.+++
T Consensus        20 W~~Ia~~l~~Rt~~~~~~r~~~~l~~~~~~~~wt~eEd~~L   60 (60)
T PF13921_consen   20 WKKIAEHLGNRTPKQCRNRWRNHLRPKISRGPWTKEEDQRL   60 (60)
T ss_dssp             HHHHHHHSTTS-HHHHHHHHHHTTSTTSTSSSSSHHHHHHH
T ss_pred             HHHHHHHHCcCCHHHHHHHHHHHCcccccCCCcCHHHHhcC
Confidence            3577888834999999876432      22 4788887654


No 44 
>PF11197 DUF2835:  Protein of unknown function (DUF2835);  InterPro: IPR021363  This is a bacterial family of uncharacterised proteins. One member of this family (A4VM42 from SWISSPROT) is annotated as the A subunit of Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV). 
Probab=35.21  E-value=46  Score=21.83  Aligned_cols=25  Identities=16%  Similarity=0.244  Sum_probs=21.2

Q ss_pred             cCCCCcEEEEeCCCCEEEecHHHHH
Q 030521           11 AQATKKISLKTADGHLFEVEEPVAM   35 (176)
Q Consensus        11 ~~~~~~i~L~s~DG~~f~V~~~~a~   35 (176)
                      -+..+.|.+++.||..+.+|....+
T Consensus        17 ~G~a~~V~v~s~~Gr~v~~Pa~~lR   41 (68)
T PF11197_consen   17 QGAASKVVVRSDDGRRVQFPARHLR   41 (68)
T ss_pred             cccccEEEEEecCCcEEEEeHHHCc
Confidence            3567889999999999999998643


No 45 
>PRK11053 dihydropteridine reductase; Provisional
Probab=35.11  E-value=30  Score=27.16  Aligned_cols=36  Identities=22%  Similarity=0.337  Sum_probs=25.2

Q ss_pred             HHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCC-CC
Q 030521          105 LKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIEN-DF  150 (176)
Q Consensus       105 l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~-d~  150 (176)
                      +..|+.||..||+.+-          .|.|-..+.+++.|||+. ++
T Consensus       147 ~~~lmLaA~~~Glgs~----------~i~g~~~~~v~~~l~ip~~~~  183 (217)
T PRK11053        147 LGNLLLGAAALGIDAT----------PIEGFDAAILDAEFGLREKGL  183 (217)
T ss_pred             HHHHHHHHHHcCCCCC----------CcCCcCHHHHHHHhCCCCcCc
Confidence            3466666666666543          445678899999999984 45


No 46 
>PF06851 DUF1247:  Protein of unknown function (DUF1247);  InterPro: IPR009657 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf2; it is a family of uncharacterised viral proteins.
Probab=33.27  E-value=1.5e+02  Score=22.43  Aligned_cols=72  Identities=17%  Similarity=0.249  Sum_probs=41.1

Q ss_pred             HHHHhhhccCCCCCccccchhhhhhccC-CChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcC-CCHHHHHHHcCCC
Q 030521           72 FCRAHVEFRKKRSPKQEVKTFNEEFGKS-KSNDELKEMILVANYLNIKDMLDYLTETVANRIKN-KSVEYVRKFFGIE  147 (176)
Q Consensus        72 ~c~~h~~~~~~~~~~~~~~~wd~eFl~~-~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~g-ks~eeir~~f~i~  147 (176)
                      ||.+|+.+--...+.....++ -+|++. +..-.+.+|.  =.||.++. ....++.+-..+.| .|..++-..+|-+
T Consensus        61 rC~FHkkyif~~~~~~~~deY-v~fLNSeMGiIS~VElY--Y~yLs~~~-wk~~a~~~l~dLTgF~s~~~LL~~Yny~  134 (148)
T PF06851_consen   61 RCDFHKKYIFTKDRKTHYDEY-VEFLNSEMGIISFVELY--YTYLSVDF-WKITAKRVLRDLTGFESLKDLLTHYNYE  134 (148)
T ss_pred             hhhHhHhhhccCCcccCHHHH-HHHHccccchhhhHHHH--HHHHcccc-cHHHHHHHHHHhhCcccHHHHHHHcCCC
Confidence            788887753322233334444 466664 1223333332  24555544 66677777777777 6788888887765


No 47 
>cd00068 GGL G protein gamma subunit-like motifs, the alpha-helical G-gamma chain dimerizes with the G-beta propeller subunit as part of the heterotrimeric G-protein complex; involved in signal transduction via G-protein-coupled receptors
Probab=32.96  E-value=59  Score=20.32  Aligned_cols=20  Identities=10%  Similarity=0.363  Sum_probs=15.5

Q ss_pred             ChHHHHHHHHHHHHhhhccC
Q 030521           62 SAAPLSYIIEFCRAHVEFRK   81 (176)
Q Consensus        62 ~s~~L~~Iie~c~~h~~~~~   81 (176)
                      .|.....++.||+.|....+
T Consensus        22 vS~a~~~l~~y~e~~~~~Dp   41 (57)
T cd00068          22 VSKAAAELLKYCEQNAENDP   41 (57)
T ss_pred             HHHHHHHHHHHHHhcCCCCC
Confidence            36778889999999976543


No 48 
>TIGR02698 CopY_TcrY copper transport repressor, CopY/TcrY family. This family includes metal-fist type transcriptional repressors of copper transport systems such as copYZAB of Enterococcus hirae and tcrYAZB (transferble copper resistance) of an Enterocuccus faecium plasmid. High levels of copper can displace zinc and prevent binding by the repressor, activating efflux by copper resistance transporters. The most closely related proteins excluded by this model are antibiotic resistance regulators including the methicillin resistance regulatory protein MecI.
Probab=32.63  E-value=1.5e+02  Score=21.60  Aligned_cols=38  Identities=18%  Similarity=0.108  Sum_probs=23.6

Q ss_pred             HHHHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHh
Q 030521          124 LTETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKE  161 (176)
Q Consensus       124 ~~~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e  161 (176)
                      ..+.+.+.+-|.+...+-..|--.+.+++||.+++++-
T Consensus        76 ~~~~~~~~~f~gs~~~ll~~l~~~~~ls~eele~L~~l  113 (130)
T TIGR02698        76 AAQELFSRICSRKVGAVIADLIEESPLSQTDIEKLEKL  113 (130)
T ss_pred             HHHHHHHHHHCCCHHHHHHHHHhcCCCCHHHHHHHHHH
Confidence            34455555555565555444433467999999999873


No 49 
>PF10415 FumaraseC_C:  Fumarase C C-terminus;  InterPro: IPR018951  Fumarase C catalyses the stereo-specific interconversion of fumarate to L-malate as part of the Krebs cycle. The full-length protein forms a tetramer with visible globular shape. FumaraseC_C is the C-terminal 65 residues referred to as domain 3. The core of the molecule consists of a bundle of 20 alpha-helices from the five-helix bundle of domain 2. The projections from the core of the tetramer are generated from domains 1 and 3 of each subunit []. This entry does not appear to be part of either the active site or the activation site but is helical in structure forming a little bundle. ; GO: 0016829 lyase activity, 0006099 tricarboxylic acid cycle; PDB: 3RRP_A 3OCE_D 3OCF_D 3E04_B 3GTD_A 3R6V_F 3R6Q_F 1J3U_B 1FUR_A 1YFE_A ....
Probab=32.26  E-value=1.1e+02  Score=18.93  Aligned_cols=23  Identities=30%  Similarity=0.268  Sum_probs=16.0

Q ss_pred             cCCCHHHHHHHcCCCCCCCHHHHHHH
Q 030521          133 KNKSVEYVRKFFGIENDFTPEEEEAA  158 (176)
Q Consensus       133 ~gks~eeir~~f~i~~d~t~EEe~~i  158 (176)
                      .|+|..|+-.-.|+   +|++|.+++
T Consensus        24 ~g~svre~v~~~g~---lt~ee~d~l   46 (55)
T PF10415_consen   24 EGRSVREVVLEEGL---LTEEELDEL   46 (55)
T ss_dssp             HT--HHHHHHHTTS---S-HHHHHHH
T ss_pred             cCCCHHHHHHHcCC---CCHHHHHHH
Confidence            69999999877775   788887775


No 50 
>cd00167 SANT 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Probab=31.86  E-value=51  Score=18.11  Aligned_cols=20  Identities=15%  Similarity=0.338  Sum_probs=16.0

Q ss_pred             HHHHHHHcCCCHHHHHHHcC
Q 030521          126 ETVANRIKNKSVEYVRKFFG  145 (176)
Q Consensus       126 ~~iA~~i~gks~eeir~~f~  145 (176)
                      ..||..+.++|..+++..+.
T Consensus        23 ~~Ia~~~~~rs~~~~~~~~~   42 (45)
T cd00167          23 EKIAKELPGRTPKQCRERWR   42 (45)
T ss_pred             HHHHhHcCCCCHHHHHHHHH
Confidence            56777888899999988763


No 51 
>PF11116 DUF2624:  Protein of unknown function (DUF2624);  InterPro: IPR020277 This entry contains proteins with no known function.
Probab=31.47  E-value=1.2e+02  Score=20.92  Aligned_cols=27  Identities=19%  Similarity=0.431  Sum_probs=13.6

Q ss_pred             HHHcCCCHHHHHHH---cCCCCCCCHHHHHHH
Q 030521          130 NRIKNKSVEYVRKF---FGIENDFTPEEEEAA  158 (176)
Q Consensus       130 ~~i~gks~eeir~~---f~i~~d~t~EEe~~i  158 (176)
                      .+++..|++|+.++   |||+  +|+++.+.|
T Consensus        10 ~Kln~iT~~eLlkyskqy~i~--it~~QA~~I   39 (85)
T PF11116_consen   10 QKLNNITAKELLKYSKQYNIS--ITKKQAEQI   39 (85)
T ss_pred             HHHhcCCHHHHHHHHHHhCCC--CCHHHHHHH
Confidence            34455555555554   5553  555544444


No 52 
>PF08671 SinI:  Anti-repressor SinI;  InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=31.38  E-value=38  Score=18.58  Aligned_cols=13  Identities=31%  Similarity=0.437  Sum_probs=9.3

Q ss_pred             cCCCHHHHHHHcC
Q 030521          133 KNKSVEYVRKFFG  145 (176)
Q Consensus       133 ~gks~eeir~~f~  145 (176)
                      .|-|.+|+|+|+.
T Consensus        15 ~Gls~eeir~FL~   27 (30)
T PF08671_consen   15 SGLSKEEIREFLE   27 (30)
T ss_dssp             TT--HHHHHHHHH
T ss_pred             cCCCHHHHHHHHH
Confidence            6899999999874


No 53 
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=30.52  E-value=1.1e+02  Score=20.58  Aligned_cols=45  Identities=13%  Similarity=0.137  Sum_probs=27.9

Q ss_pred             hhhHHHHHHHHHHHHH-cC-CCHHHHHHHcC----CCCCCCHHHHHHHHHh
Q 030521          117 IKDMLDYLTETVANRI-KN-KSVEYVRKFFG----IENDFTPEEEEAARKE  161 (176)
Q Consensus       117 I~~L~~l~~~~iA~~i-~g-ks~eeir~~f~----i~~d~t~EEe~~i~~e  161 (176)
                      |..|+.+..+|-++-- .| -+.+|+|+++.    +.+.+|++|..++.++
T Consensus         9 ~~~~i~~F~~y~~~~~~~g~Is~~EL~~~l~~~~~lg~k~t~~ev~~m~~~   59 (88)
T cd05029           9 IGLLVAIFHKYSGREGDKNTLSKKELKELIQKELTIGSKLQDAEIAKLMED   59 (88)
T ss_pred             HHHHHHHHHHHHccCCCCCEECHHHHHHHHHHHHhcCCCCCHHHHHHHHHH
Confidence            4455666666665321 23 37888888883    4455788887776554


No 54 
>PF02519 Auxin_inducible:  Auxin responsive protein;  InterPro: IPR003676 This family consists of the protein products of a gene cluster that encodes a group of auxin-regulated RNAs (small auxin up RNAs, SAURs) []. Proteins from this ARG7 auxin responsive genes family have no identified functional role [].
Probab=30.32  E-value=1.9e+02  Score=20.20  Aligned_cols=62  Identities=8%  Similarity=0.198  Sum_probs=41.0

Q ss_pred             CcCCCCcEEEEeCC-CCEEEecHHHHHHhHHHHHHHhcCCC---CCCCcccccCCCChHHHHHHHHHH
Q 030521           10 SAQATKKISLKTAD-GHLFEVEEPVAMEFEIVKSFFDENDD---ASEDTVVPLPNVSAAPLSYIIEFC   73 (176)
Q Consensus        10 ~~~~~~~i~L~s~D-G~~f~V~~~~a~~S~~l~~ll~~~~~---~~~~~~Ipl~~v~s~~L~~Iie~c   73 (176)
                      .+.+...+.+...+ .+.|.|+...+. .++++.+|+....   -.++.+|.|| -+...++.|+..+
T Consensus        34 ~~vp~G~~~VyVG~~~~Rfvvp~~~L~-hp~f~~LL~~aeeEfG~~~~G~l~iP-C~~~~Fe~~l~~l   99 (100)
T PF02519_consen   34 SDVPKGHFAVYVGEERRRFVVPVSYLN-HPLFQELLEQAEEEFGFDQDGPLTIP-CDVVLFEHLLWLL   99 (100)
T ss_pred             CCCCCCeEEEEeCccceEEEechHHcC-chhHHHHHHHHhhhcCcCCCCcEEee-CCHHHHHHHHHHh
Confidence            33445555555555 999999999754 4667777764321   1134578887 7888888887654


No 55 
>smart00717 SANT SANT  SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
Probab=30.14  E-value=57  Score=18.18  Aligned_cols=20  Identities=20%  Similarity=0.433  Sum_probs=16.0

Q ss_pred             HHHHHHHcCCCHHHHHHHcC
Q 030521          126 ETVANRIKNKSVEYVRKFFG  145 (176)
Q Consensus       126 ~~iA~~i~gks~eeir~~f~  145 (176)
                      ..||..+.++|+.+++..+.
T Consensus        25 ~~Ia~~~~~rt~~~~~~~~~   44 (49)
T smart00717       25 EKIAKELPGRTAEQCRERWN   44 (49)
T ss_pred             HHHHHHcCCCCHHHHHHHHH
Confidence            56777788999999988764


No 56 
>TIGR02476 BluB cob(II)yrinic acid a,c-diamide reductase. The BluB protein is related to the nitroreductase family (pfam0881), and is found to substitute for the FldA cobalt reductase in aerobic species (by phylogenetic profiling). BluB is also found to be preceded by B12 promoter elements elements. The gene in R. capsulatus has been shown to be essential for cobalamin biosynthesis.
Probab=30.05  E-value=42  Score=26.11  Aligned_cols=38  Identities=13%  Similarity=0.247  Sum_probs=27.3

Q ss_pred             HHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521          104 ELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT  151 (176)
Q Consensus       104 ~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t  151 (176)
                      .+..|+.||..||+-.-          ++.+-..+.+++.|||+.+++
T Consensus       133 A~~nl~LaA~~~GlGs~----------~~~~~~~~~v~~~L~lp~~~~  170 (205)
T TIGR02476       133 AIQNLWLAARAEGLGVG----------WVSILDPDAVRRLLGVPEGWR  170 (205)
T ss_pred             HHHHHHHHHHHCCCcce----------eecccChHHHHHHhCcCCCce
Confidence            44578888888888752          223445789999999998754


No 57 
>PF13798 PCYCGC:  Protein of unknown function with PCYCGC motif
Probab=27.63  E-value=89  Score=24.02  Aligned_cols=23  Identities=22%  Similarity=0.313  Sum_probs=16.4

Q ss_pred             HHHHHHHHHHHcCCCHHHHHHHc
Q 030521          122 DYLTETVANRIKNKSVEYVRKFF  144 (176)
Q Consensus       122 ~l~~~~iA~~i~gks~eeir~~f  144 (176)
                      +.+...+...=+|||+.|||++.
T Consensus       119 ~ia~~a~~~~~~Gks~~eIR~~I  141 (158)
T PF13798_consen  119 DIAVQAVQMYQEGKSPKEIRQYI  141 (158)
T ss_pred             HHHHHHHHHHHcCCCHHHHHHHH
Confidence            44455555566899999999863


No 58 
>PRK09296 cysteine desufuration protein SufE; Provisional
Probab=27.62  E-value=1e+02  Score=23.01  Aligned_cols=28  Identities=14%  Similarity=0.258  Sum_probs=19.6

Q ss_pred             HHHHHHHHcCCCHHHHHHH--------cCCCCCCCH
Q 030521          125 TETVANRIKNKSVEYVRKF--------FGIENDFTP  152 (176)
Q Consensus       125 ~~~iA~~i~gks~eeir~~--------f~i~~d~t~  152 (176)
                      +..+...+.|+|++||.++        +|+..-+||
T Consensus        82 ~alL~~~~~g~tp~eIl~~d~~~~~~~lGL~~~LSp  117 (138)
T PRK09296         82 IAVVFILYQQMTPQDIVNFDVRPWFEKLALTQHLTP  117 (138)
T ss_pred             HHHHHHHHcCCCHHHHHhCChHHHHHHcCcccccCc
Confidence            4445556689999998752        577766776


No 59 
>PF11198 DUF2857:  Protein of unknown function (DUF2857);  InterPro: IPR021364  This is a bacterial family of uncharacterised proteins. 
Probab=26.94  E-value=2.4e+02  Score=21.82  Aligned_cols=63  Identities=25%  Similarity=0.303  Sum_probs=35.7

Q ss_pred             ChHHHHHHHHhhh-----ccchhhHHHH-HHHHHHHHHcCCCHHHH---HHHcCCCCC------CCHHHHHHHHHhcccc
Q 030521          101 SNDELKEMILVAN-----YLNIKDMLDY-LTETVANRIKNKSVEYV---RKFFGIEND------FTPEEEEAARKEYEWA  165 (176)
Q Consensus       101 ~~~~l~~Ll~AA~-----~L~I~~L~~l-~~~~iA~~i~gks~eei---r~~f~i~~d------~t~EEe~~i~~e~~w~  165 (176)
                      |.+.|-.++.-|.     .--|+.++.+ ++..+-..+-|.|+.|+   |.++||+..      .|+|++.+|  =+.|-
T Consensus        59 n~~~l~~~L~~~~~~~~~~~~idr~L~lGAS~~mm~~~FGls~~ev~~rR~llgi~~~~GR~~~~~ee~~~~i--W~~W~  136 (180)
T PF11198_consen   59 NHDVLWRLLEQARREQQEQQLIDRALRLGASIEMMQRLFGLSSAEVAARRRLLGIPVRKGRPPALSEEEEAAI--WRRWQ  136 (180)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHCCCHHHHHHHHHHhCCCCCCCCCCCcCHHHHHHH--HHHHH
Confidence            4444444444443     2334444444 34555666678887776   777888864      355555555  45565


No 60 
>cd02144 iodotyrosine_dehalogenase Iodotyrosine dehalogenase catalyzes the removal of iodine from the 3, 5 positions of L-tyosine in thyroid, liver and kidney,  using NADPH as electron donor. This enzyme is a homolog of the nitroreductase family. These enzymes are usually homodimers.
Probab=26.91  E-value=53  Score=24.81  Aligned_cols=38  Identities=5%  Similarity=0.141  Sum_probs=26.3

Q ss_pred             HHHHHHhhhccchhhHHHHHHHHHHHHHcCCCH-HHHHHHcCCCCCCCH
Q 030521          105 LKEMILVANYLNIKDMLDYLTETVANRIKNKSV-EYVRKFFGIENDFTP  152 (176)
Q Consensus       105 l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~-eeir~~f~i~~d~t~  152 (176)
                      +..|+.||.-||+...          .+.+... +.+++.|||++++.+
T Consensus       129 ~~~l~LaA~~~Glgs~----------~~~~~~~~~~v~~~l~ip~~~~~  167 (193)
T cd02144         129 CGLLLAALQNAGLGTV----------TTTPLNMGPFLRRLLGRPANEKL  167 (193)
T ss_pred             HHHHHHHHHHCCCcee----------ecCCcchhHHHHHHhCCCCCCce
Confidence            3457888888888762          2234344 899999999986543


No 61 
>PF00382 TFIIB:  Transcription factor TFIIB repeat;  InterPro: IPR013150 Cyclins are eukaryotic proteins that play an active role in controlling nuclear cell division cycles [], and regulate cyclin dependent kinases (CDKs). Cyclins, together with the p34 (cdc2) or cdk2 kinases, form the Maturation Promoting Factor (MPF). There are two main groups of cyclins, G1/S cyclins, which are essential for the control of the cell cycle at the G1/S (start) transition, and G2/M cyclins, which are essential for the control of the cell cycle at the G2/M (mitosis) transition. G2/M cyclins accumulate steadily during G2 and are abruptly destroyed as cells exit from mitosis (at the end of the M-phase). In most species, there are multiple forms of G1 and G2 cyclins. For example, in vertebrates, there are two G2 cyclins, A and B, and at least three G1 cyclins, C, D, and E. Cyclin homologues have been found in various viruses, including Saimiriine herpesvirus 2 (Herpesvirus saimiri) and Human herpesvirus 8 (HHV-8) (Kaposi's sarcoma-associated herpesvirus). These viral homologues differ from their cellular counterparts in that the viral proteins have gained new functions and eliminated others to harness the cell and benefit the virus []. In eukaryotes, transcription initiation of all protein encoding genes involves the polymerase II system. This sytem is modulated by both general and specific transcription factors. The general factors (which include TFIIA, TFIIB, TFIID, TFIIE, TFIIF, TFIIG and TFIIH) operate through common promoter elements, such as the TATA box. Transcription factor IIB (TFIIB) is of central importance in transcription of class II genes. It associates with TFIID-TFIIA bound to DNA (the DA complex) to form a ternary TFIID-IIA-IBB (DAB) complex, which is recognised by RNA polymerase II [, ]. TFIIB comprises ~315-340 residues and contains an imperfect C-terminal repeat of a 75-residue domain that may contribute to the symmetry of the folded protein. The basal archaeal transcription machinery resembles that of the eukaryotic polymerase II system and includes a homologue of TFIIB []. This entry represents a cyclin-like domain which is found repeated in the C-terminal region of a variety of eukaryotic TFIIB's and their archaeal counterparts. These domains individually form the typical cyclin fold, and in the transcription complex they straddle the C-terminal region of the TATA-binding protein - an interaction essential for the formation of the transcription initiation complex [, ].; GO: 0003743 translation initiation factor activity, 0006413 translational initiation; PDB: 2PHG_A 1C9B_Q 1TFB_A 1VOL_A 3K7A_M 1AIS_B 1D3U_B.
Probab=26.71  E-value=1.6e+02  Score=18.61  Aligned_cols=30  Identities=10%  Similarity=0.216  Sum_probs=23.4

Q ss_pred             hhHHHHHHHHHHHHHcC--CCHHHHHHHcCCC
Q 030521          118 KDMLDYLTETVANRIKN--KSVEYVRKFFGIE  147 (176)
Q Consensus       118 ~~L~~l~~~~iA~~i~g--ks~eeir~~f~i~  147 (176)
                      +.-+-.+|-|+|.+..|  +|..||.+.+||.
T Consensus        36 ~~~iaAA~iY~acr~~~~~~t~~eIa~~~~Vs   67 (71)
T PF00382_consen   36 PESIAAACIYLACRLNGVPRTLKEIAEAAGVS   67 (71)
T ss_dssp             HHHHHHHHHHHHHHHTTSSSSHHHHHHHCTSS
T ss_pred             HHHHHHHHHHHHHHHcCCCcCHHHHHHHhCCC
Confidence            34455577888888877  6899999999985


No 62 
>TIGR01999 iscU FeS cluster assembly scaffold IscU. This model represents IscU, a homolog of the N-terminal region of NifU, an Fe-S cluster assembly protein found mostly in nitrogen-fixing bacteria. IscU is considered part of the IscSUA-hscAB-fdx system of Fe-S assembly, whereas NifU is found in nitrogenase-containing (nitrogen-fixing) species. A NifU-type protein is also found in Helicobacter and Campylobacter. IscU and NifU are considered scaffold proteins on which Fe-S clusters are assembled before transfer to apoproteins. This model excludes true NifU proteins as in Klebsiella pneumoniae and Anabaena sp. as well as archaeal homologs. It includes largely proteobacterial and eukaryotic forms.
Probab=25.81  E-value=79  Score=22.82  Aligned_cols=38  Identities=21%  Similarity=0.100  Sum_probs=24.9

Q ss_pred             HHHhhhccchhhHHHHHH-HHHHHHHcCCCHHHHHHHcC
Q 030521          108 MILVANYLNIKDMLDYLT-ETVANRIKNKSVEYVRKFFG  145 (176)
Q Consensus       108 Ll~AA~~L~I~~L~~l~~-~~iA~~i~gks~eeir~~f~  145 (176)
                      .+.-+.|-.-=.-.-.+| ..++++++|||.+|.+.+-.
T Consensus        50 ~I~d~~f~~~GC~~s~Asas~~~e~i~Gktl~ea~~i~~   88 (124)
T TIGR01999        50 IIEDAKFKTFGCGSAIASSSLATELIKGKSLEEALKIKN   88 (124)
T ss_pred             eEEEEEEEecCcHHHHHHHHHHHHHHcCCCHHHHHhccH
Confidence            444455555544555544 56678999999999886643


No 63 
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=25.75  E-value=71  Score=23.60  Aligned_cols=19  Identities=21%  Similarity=0.324  Sum_probs=14.0

Q ss_pred             HHHHHHH-cCCCHHHHHHHc
Q 030521          126 ETVANRI-KNKSVEYVRKFF  144 (176)
Q Consensus       126 ~~iA~~i-~gks~eeir~~f  144 (176)
                      +.|..++ .|+|.+||+.+|
T Consensus        64 ~~Vr~~i~~G~Sd~eI~~~~   83 (126)
T TIGR03147        64 HEVYSMVNEGKSNQQIIDFM   83 (126)
T ss_pred             HHHHHHHHcCCCHHHHHHHH
Confidence            3455544 699999999885


No 64 
>COG1926 Predicted phosphoribosyltransferases [General function prediction only]
Probab=25.62  E-value=1.2e+02  Score=24.65  Aligned_cols=63  Identities=22%  Similarity=0.287  Sum_probs=42.9

Q ss_pred             CcccccC-C-CChHHHHHHHHHHHHhhhccCC----C----------------------CCccccchhhhhhccCCChHH
Q 030521           53 DTVVPLP-N-VSAAPLSYIIEFCRAHVEFRKK----R----------------------SPKQEVKTFNEEFGKSKSNDE  104 (176)
Q Consensus        53 ~~~Ipl~-~-v~s~~L~~Iie~c~~h~~~~~~----~----------------------~~~~~~~~wd~eFl~~~~~~~  104 (176)
                      ...|-+. + -++.++..-+++|+++......    .                      .+...+..|..+|-+. +.++
T Consensus       125 ~~VIlVDDGiATGatm~aAi~~~r~~~~~~IviAVPV~p~~a~~~l~s~~D~vvc~~~P~~F~AVg~~Y~dF~q~-sdeE  203 (220)
T COG1926         125 RTVILVDDGIATGATMKAAVRALRAKGPKEIVIAVPVAPEDAAAELESEADEVVCLYMPAPFEAVGEFYRDFRQV-SDEE  203 (220)
T ss_pred             CEEEEEeCCcchhHHHHHHHHHHHhcCCceEEEEcccCCHHHHHHHHhhcCeEEEEcCCccHHHHHHHHHHHhhc-CHHH
Confidence            3455554 3 4888899999999887765321    0                      1123467788888777 8888


Q ss_pred             HHHHHHhhhccc
Q 030521          105 LKEMILVANYLN  116 (176)
Q Consensus       105 l~~Ll~AA~~L~  116 (176)
                      +..||..|+--+
T Consensus       204 V~~lL~~a~~~~  215 (220)
T COG1926         204 VRALLRRANAPN  215 (220)
T ss_pred             HHHHHHhccCCC
Confidence            999988887543


No 65 
>TIGR03419 NifU_clost FeS cluster assembly scaffold protein NifU, Clostridium type. NifU and NifS form a pair of iron-sulfur (FeS) cluster biosynthesis proteins much simpler than the ISC and SUF systems. Members of this protein family are a distinct group of NifU-like proteins, found always to a NifS-like protein and restricted to species that lack a SUF system. Typically, NIF systems service a smaller number of FeS-containing proteins than do ISC or SUF. Members of this particular branch typically are found, almost half the time, near the mnmA gene, involved in the carboxymethylaminomethyl modification of U34 in some tRNAs (see GenProp0704). While other NifU proteins are associated with nitrogen fixation, this family is not.
Probab=25.60  E-value=88  Score=22.51  Aligned_cols=37  Identities=14%  Similarity=0.103  Sum_probs=24.7

Q ss_pred             HHHhhhccchhhHHHHHH-HHHHHHHcCCCHHHHHHHc
Q 030521          108 MILVANYLNIKDMLDYLT-ETVANRIKNKSVEYVRKFF  144 (176)
Q Consensus       108 Ll~AA~~L~I~~L~~l~~-~~iA~~i~gks~eeir~~f  144 (176)
                      .+.-+.|-.-=..+-.+| ..++.++.|||.+|..++.
T Consensus        46 ~I~d~~f~~~GC~is~Asas~~~e~i~Gk~l~ea~~i~   83 (121)
T TIGR03419        46 IIKDVKFKTFGCGAAIASSSMATEMIKGKTLEEAWELT   83 (121)
T ss_pred             EEEEEEEEEeccHHHHHHHHHHHHHHcCCCHHHHHHhh
Confidence            444555555555555544 4567799999999987664


No 66 
>PRK11325 scaffold protein; Provisional
Probab=25.54  E-value=79  Score=23.00  Aligned_cols=38  Identities=13%  Similarity=0.065  Sum_probs=25.0

Q ss_pred             HHHhhhccchhhHHHHHH-HHHHHHHcCCCHHHHHHHcC
Q 030521          108 MILVANYLNIKDMLDYLT-ETVANRIKNKSVEYVRKFFG  145 (176)
Q Consensus       108 Ll~AA~~L~I~~L~~l~~-~~iA~~i~gks~eeir~~f~  145 (176)
                      .+.-+.|-.-=.-.-.+| ..++.+++|||++|.+.+-+
T Consensus        52 ~I~d~~f~~~GC~is~Asas~~~e~~~Gktl~ea~~i~~   90 (127)
T PRK11325         52 IIEDAKFKTYGCGSAIASSSLVTEWVKGKTLDEALAIKN   90 (127)
T ss_pred             eEEEEEEEeeCCHHHHHHHHHHHHHHcCCCHHHHHhcCH
Confidence            444555555555555544 45577999999999886633


No 67 
>PF03750 DUF310:  Protein of unknown function (DUF310);  InterPro: IPR010149 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny.  This entry represents the C-terminal domain of a minor family of CRISPR-associated proteins. These proteins are found adjacent to a characteristic short, palindromic repeat cluster termed CRISPR, a probable mobile DNA element.
Probab=25.30  E-value=1e+02  Score=22.14  Aligned_cols=34  Identities=26%  Similarity=0.490  Sum_probs=21.4

Q ss_pred             cchhhHHHHHHHHHHHHHc--CCCHHHHHHHcCCCCC
Q 030521          115 LNIKDMLDYLTETVANRIK--NKSVEYVRKFFGIEND  149 (176)
Q Consensus       115 L~I~~L~~l~~~~iA~~i~--gks~eeir~~f~i~~d  149 (176)
                      ++++.|.+.+.+. |..++  +.|+.+||++|+--..
T Consensus         8 ~~~~~~~~~Ae~~-~k~l~~~~lttsQlRkf~~~v~~   43 (119)
T PF03750_consen    8 LDIELLVDYAEKI-AKELKKNKLTTSQLRKFYDEVKR   43 (119)
T ss_pred             cCHHHHHHHHHHH-HHHHHhCCCCHHHHHHHHHHHHH
Confidence            4455555555443 34444  4799999999985444


No 68 
>PHA03098 kelch-like protein; Provisional
Probab=25.26  E-value=2.4e+02  Score=25.10  Aligned_cols=98  Identities=8%  Similarity=0.071  Sum_probs=62.0

Q ss_pred             ChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhcc-CCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc-------
Q 030521           62 SAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFGK-SKSNDELKEMILVANYLNIKDMLDYLTETVANRIK-------  133 (176)
Q Consensus        62 ~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl~-~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~-------  133 (176)
                      +...+..++.=+.+..-.        .+...-.+|+. .++.+..++++..|...+.+.|.+.|-++|+..+.       
T Consensus        74 ~~~~~~~ll~~A~~l~~~--------~l~~~C~~~l~~~l~~~nc~~~~~~a~~~~~~~L~~~~~~~i~~nf~~v~~~~~  145 (534)
T PHA03098         74 TSNNVKDILSIANYLIID--------FLINLCINYIIKIIDDNNCIDIYRFSFFYGCKKLYSAAYNYIRNNIELIYNDPD  145 (534)
T ss_pred             cHHHHHHHHHHHHHhCcH--------HHHHHHHHHHHHhCCHhHHHHHHHHHHHcCcHHHHHHHHHHHHHHHHHHhcCch
Confidence            344455555544443321        23333445544 35888999999999999999999999999987553       


Q ss_pred             --CCCHHHHHHHcCCCCCCCH-HHHHHHHHhcccccCC
Q 030521          134 --NKSVEYVRKFFGIENDFTP-EEEEAARKEYEWAFEG  168 (176)
Q Consensus       134 --gks~eeir~~f~i~~d~t~-EEe~~i~~e~~w~~~~  168 (176)
                        ..+.+++.+++.= +++.- .|+.-.+---.|+..+
T Consensus       146 f~~l~~~~l~~ll~~-~~L~v~~E~~v~~av~~W~~~~  182 (534)
T PHA03098        146 FIYLSKNELIKILSD-DKLNVSSEDVVLEIIIKWLTSK  182 (534)
T ss_pred             hhcCCHHHHHHHhcC-CCcCcCCHHHHHHHHHHHHhcC
Confidence              2467888888663 23542 3333344456777654


No 69 
>PRK15019 CsdA-binding activator; Provisional
Probab=24.62  E-value=1.1e+02  Score=23.02  Aligned_cols=37  Identities=8%  Similarity=0.255  Sum_probs=23.8

Q ss_pred             hhccchhhHHHHHHHHHHHHHcCCCHHHHHHH--------cCCCCCCCH
Q 030521          112 ANYLNIKDMLDYLTETVANRIKNKSVEYVRKF--------FGIENDFTP  152 (176)
Q Consensus       112 A~~L~I~~L~~l~~~~iA~~i~gks~eeir~~--------f~i~~d~t~  152 (176)
                      ++-.=+++|+-+    +...+.|+|++||.++        +|+..-+||
T Consensus        83 SDA~IvkGl~al----L~~~~~g~tp~eIl~~d~~~~~~~lGL~~~LSp  127 (147)
T PRK15019         83 SEGRIVRGLLAV----LLTAVEGKTAAELQAQSPLALFDELGLRAQLSA  127 (147)
T ss_pred             CccHHHHHHHHH----HHHHHcCCCHHHHHhcCHHHHHHHCCchhhcCc
Confidence            444445555544    4455589999999762        577766776


No 70 
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=24.49  E-value=67  Score=24.16  Aligned_cols=18  Identities=17%  Similarity=0.361  Sum_probs=16.8

Q ss_pred             CCCHHHHHHHcCCCCCCC
Q 030521          134 NKSVEYVRKFFGIENDFT  151 (176)
Q Consensus       134 gks~eeir~~f~i~~d~t  151 (176)
                      |+||+|-|.-+|++.|+.
T Consensus        98 gmTPd~YR~KW~LP~dYP  115 (148)
T COG4957          98 GLTPDEYRAKWGLPPDYP  115 (148)
T ss_pred             CCCHHHHHHhcCCCCCCC
Confidence            899999999999999974


No 71 
>TIGR03391 FeS_syn_CsdE cysteine desulfurase, sulfur acceptor subunit CsdE. Members of this protein family are CsdE, formerly called YgdK. This protein, found as a paralog to SufE in Escherichia coli, Yersinia pestis, Photorhabdus luminescens, and related species, works together and physically interacts with CsdA (a paralog of SufS). CsdA has cysteine desulfurase activity that is enhanced by this protein (CsdE), in which Cys-61 (numbered as in E. coli) is a sulfur acceptor site. This gene pair, although involved in FeS cluster biosynthesis, is not found next to other such genes as are its paralogs from the Suf or Isc systems.
Probab=24.14  E-value=1.2e+02  Score=22.58  Aligned_cols=36  Identities=11%  Similarity=0.253  Sum_probs=22.4

Q ss_pred             hccchhhHHHHHHHHHHHHHcCCCHHHHHHH--------cCCCCCCCH
Q 030521          113 NYLNIKDMLDYLTETVANRIKNKSVEYVRKF--------FGIENDFTP  152 (176)
Q Consensus       113 ~~L~I~~L~~l~~~~iA~~i~gks~eeir~~--------f~i~~d~t~  152 (176)
                      +-+=+++|+-    .+...+.|+|++||.++        +|+..-+||
T Consensus        79 Da~IvkGl~a----lL~~~~~g~tp~eI~~~d~~~~~~~lGL~~~LSp  122 (138)
T TIGR03391        79 EGRIVRGLLA----VLLTAVEGKTPEQLLAQDPLALFDELGLRAQLSA  122 (138)
T ss_pred             ccHHHHHHHH----HHHHHHcCCCHHHHHHCCHHHHHHHcCchhccCc
Confidence            3333445544    44455589999999744        566666665


No 72 
>PF00249 Myb_DNA-binding:  Myb-like DNA-binding domain;  InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins. These belong to the SANT domain family that specifically recognise the sequence YAAC(G/T)G [, ]. In myb, one of the most conserved regions consisting of three tandem repeats has been shown to be involved in DNA-binding [].; PDB: 1X41_A 2XAF_B 2XAG_B 2XAH_B 2UXN_B 2Y48_B 2XAQ_B 2X0L_B 2IW5_B 2XAJ_B ....
Probab=24.12  E-value=1.6e+02  Score=17.12  Aligned_cols=39  Identities=10%  Similarity=0.226  Sum_probs=25.8

Q ss_pred             ChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc-CCCHHHHHHHc
Q 030521          101 SNDELKEMILVANYLNIKDMLDYLTETVANRIK-NKSVEYVRKFF  144 (176)
Q Consensus       101 ~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~-gks~eeir~~f  144 (176)
                      +.++---|+.|..-++-.     -=..||..+. |+|+.++|..|
T Consensus         5 t~eE~~~l~~~v~~~g~~-----~W~~Ia~~~~~~Rt~~qc~~~~   44 (48)
T PF00249_consen    5 TEEEDEKLLEAVKKYGKD-----NWKKIAKRMPGGRTAKQCRSRY   44 (48)
T ss_dssp             -HHHHHHHHHHHHHSTTT-----HHHHHHHHHSSSSTHHHHHHHH
T ss_pred             CHHHHHHHHHHHHHhCCc-----HHHHHHHHcCCCCCHHHHHHHH
Confidence            334444555555555555     3467888888 99999999875


No 73 
>cd02140 Nitroreductase_4 Nitroreductase-like family 4.  A subfamily of the nitroreductase family containing uncharacterized proteins that are similar to nitroreductase. Nitroreductase catalyzes the reduction of nitroaromatic compounds such as nitrotoluenes, nitrofurans and nitroimidazoles. This process requires NAD(P)H as electron donor in an obligatory two-electron transfer and uses FMN as cofactor.  The enzyme is typically a homodimer. Members of this family are also called NADH dehydrogenase, oxygen-insensitive NAD(P)H nitrogenase or dihydropteridine reductase.
Probab=23.86  E-value=62  Score=24.80  Aligned_cols=39  Identities=15%  Similarity=0.015  Sum_probs=26.4

Q ss_pred             HHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCC
Q 030521          104 ELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDF  150 (176)
Q Consensus       104 ~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~  150 (176)
                      .+..|+.||.-+|+-.-+..      .  .....+.+|++|||++++
T Consensus       126 a~~nl~LaA~~~GlGs~~~~------~--~~~~~~~v~~~l~ip~~~  164 (192)
T cd02140         126 AQIAVWTALAAEGIGANLQH------Y--NPLIDEEVAKKWNIPSNW  164 (192)
T ss_pred             HHHHHHHHHHhCCCcccHHH------h--CccCCHHHHHhcCCCccc
Confidence            34688889999988763211      0  012358999999999775


No 74 
>PF01592 NifU_N:  NifU-like N terminal domain;  InterPro: IPR002871 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] []. FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins [, ]. It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly []. The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA []. SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA [], acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins []. Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen []. This entry represents the N-terminal of NifU and homologous proteins. NifU contains two domains: an N-terminal and a C-terminal domain (IPR001075 from INTERPRO) []. These domains exist either together or on different polypeptides, both domains being found in organisms that do not fix nitrogen (e.g. yeast), so they have a broader significance in the cell than nitrogen fixation. ; GO: 0005506 iron ion binding, 0051536 iron-sulfur cluster binding, 0016226 iron-sulfur cluster assembly; PDB: 3LVL_A 4EB5_C 4EB7_C 1WFZ_A 2Z7E_C 2AZH_A 1XJS_A 1Q48_A 1R9P_A 2KQK_A ....
Probab=23.74  E-value=99  Score=22.29  Aligned_cols=21  Identities=24%  Similarity=0.145  Sum_probs=15.8

Q ss_pred             HHHHHHHHHHHcCCCHHHHHH
Q 030521          122 DYLTETVANRIKNKSVEYVRK  142 (176)
Q Consensus       122 ~l~~~~iA~~i~gks~eeir~  142 (176)
                      -.++..++.+++|||.+|+.+
T Consensus        65 ~Asas~~~~~i~gk~l~ea~~   85 (126)
T PF01592_consen   65 IASASMMCELIKGKTLEEALK   85 (126)
T ss_dssp             HHHHHHHHHHHTTSBHHHHHC
T ss_pred             HHHHHHHHHHHcCCCHHHHHH
Confidence            345567788999999888853


No 75 
>PF07076 DUF1344:  Protein of unknown function (DUF1344);  InterPro: IPR009780 This family consists of several short, hypothetical bacterial proteins of around 80 residues in length. Members of this family are found in Rhizobium, Agrobacterium and Brucella species. The function of this family is unknown.
Probab=23.73  E-value=54  Score=21.13  Aligned_cols=32  Identities=13%  Similarity=0.329  Sum_probs=22.9

Q ss_pred             ccccccccCcCCCCcEEEEeCCCCEEEecHHH
Q 030521            2 AEESSSSCSAQATKKISLKTADGHLFEVEEPV   33 (176)
Q Consensus         2 ~~~~~~~~~~~~~~~i~L~s~DG~~f~V~~~~   33 (176)
                      |.++-..=........+|.-.||+.|.++...
T Consensus         2 A~~veG~I~~id~~~~titLdDGksy~lp~ef   33 (61)
T PF07076_consen    2 AADVEGTIKSIDPETMTITLDDGKSYKLPEEF   33 (61)
T ss_pred             CccceEEEEEEcCCceEEEecCCCEEECCCcc
Confidence            33444444455667788999999999999875


No 76 
>PF07638 Sigma70_ECF:  ECF sigma factor
Probab=23.58  E-value=3.3e+02  Score=20.65  Aligned_cols=46  Identities=24%  Similarity=0.161  Sum_probs=33.5

Q ss_pred             ChHHHHHHHHhhhccchhhHHHHHHHHHHH-HHcCCCHHHHHHHcCCCC
Q 030521          101 SNDELKEMILVANYLNIKDMLDYLTETVAN-RIKNKSVEYVRKFFGIEN  148 (176)
Q Consensus       101 ~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~-~i~gks~eeir~~f~i~~  148 (176)
                      +.+.+.+|-.+-+-|.-  |-.-.++.|.- .+.|.|.+||-+.+||+.
T Consensus       119 ~~~~~~~l~e~l~~L~~--l~~~~~~~v~l~~~~Gls~~EIA~~lgiS~  165 (185)
T PF07638_consen  119 SPEELLELEEALERLLA--LDPRQRRVVELRFFEGLSVEEIAERLGISE  165 (185)
T ss_pred             CHHHHHHHHHHHHHHHc--cCHHHHHHHHHHHHCCCCHHHHHHHHCcCH
Confidence            56677777666654433  55666777765 468999999999999964


No 77 
>PF00690 Cation_ATPase_N:  Cation transporter/ATPase, N-terminus;  InterPro: IPR004014 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles []. P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H+, Na+, K+, Mg2+, Ca2+, Ag+ and Ag2+, Zn2+, Co2+, Pb2+, Ni2+, Cd2+, Cu+ and Cu2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2.  This entry represents the conserved N-terminal region found in several classes of cation-transporting P-type ATPases, including those that transport H+ (3.6.3.6 from EC), Na+ (3.6.3.7 from EC), Ca2+ (3.6.3.8 from EC), Na+/K+ (3.6.3.9 from EC), and H+/K+ (3.6.3.10 from EC). In the H+/K+- and Na+/K+-exchange P-ATPases, this domain is found in the catalytic alpha chain. In gastric H+/K+-ATPases, this domain undergoes reversible sequential phosphorylation inducing conformational changes that may be important for regulating the function of these ATPases [, ]. More information about this protein can be found at Protein of the Month: ATP Synthases [].; PDB: 3KDP_C 3N2F_A 3B8E_A 3N23_A 2XZB_A 1MHS_B 3A3Y_A 2ZXE_A 3B8C_A 3B9B_A ....
Probab=23.54  E-value=1.2e+02  Score=19.09  Aligned_cols=30  Identities=37%  Similarity=0.403  Sum_probs=21.8

Q ss_pred             CCCHHHHHHHcC--CCCCCCHHHHHHHHHhcc
Q 030521          134 NKSVEYVRKFFG--IENDFTPEEEEAARKEYE  163 (176)
Q Consensus       134 gks~eeir~~f~--i~~d~t~EEe~~i~~e~~  163 (176)
                      .++.+++.+.|+  ..+.+|++|-++.++.|.
T Consensus         3 ~~~~~~v~~~l~t~~~~GLs~~ev~~r~~~~G   34 (69)
T PF00690_consen    3 QLSVEEVLKRLNTSSSQGLSSEEVEERRKKYG   34 (69)
T ss_dssp             TSSHHHHHHHHTTBTSSBBTHHHHHHHHHHHS
T ss_pred             CCCHHHHHHHHCcCCCCCCCHHHHHHHHHhcc
Confidence            357888888887  334588888777777654


No 78 
>PF07671 DUF1601:  Protein of unknown function (DUF1601);  InterPro: IPR011632 This repeat is found in a small number of proteins and is apparently limited to Coxiella burnetii.
Probab=23.47  E-value=84  Score=18.08  Aligned_cols=30  Identities=17%  Similarity=0.237  Sum_probs=22.7

Q ss_pred             ccchhhHHHHHHHHHHHHHcCCCHHHHHHH
Q 030521          114 YLNIKDMLDYLTETVANRIKNKSVEYVRKF  143 (176)
Q Consensus       114 ~L~I~~L~~l~~~~iA~~i~gks~eeir~~  143 (176)
                      ||.-++|-+.....|+..+..-++.+|-.+
T Consensus         7 ~l~~q~L~~~L~~aV~~~a~~Fn~QeiaNt   36 (37)
T PF07671_consen    7 YLEEQRLSDRLLDAVRRNAEQFNPQEIANT   36 (37)
T ss_pred             hhhhccchHHHHHHHHHHHHHcCHHHHhhc
Confidence            566777888888888887777888777654


No 79 
>PF11338 DUF3140:  Protein of unknown function (DUF3140);  InterPro: IPR021487  Some members in this family of proteins are annotated as DNA binding proteins. No function is currently known. 
Probab=23.08  E-value=84  Score=21.93  Aligned_cols=22  Identities=9%  Similarity=0.288  Sum_probs=17.5

Q ss_pred             cCCCChHHHHHHHHHHHHhhhc
Q 030521           58 LPNVSAAPLSYIIEFCRAHVEF   79 (176)
Q Consensus        58 l~~v~s~~L~~Iie~c~~h~~~   79 (176)
                      |..-.-..|++|+.||..|...
T Consensus        54 ltddD~~hMrkVV~yv~rhlaq   75 (92)
T PF11338_consen   54 LTDDDYEHMRKVVGYVKRHLAQ   75 (92)
T ss_pred             CCHHHHHHHHHHHHHHHHHHhc
Confidence            3334556799999999999987


No 80 
>KOG0027 consensus Calmodulin and related proteins (EF-Hand superfamily) [Signal transduction mechanisms]
Probab=22.83  E-value=3.1e+02  Score=19.99  Aligned_cols=60  Identities=17%  Similarity=0.350  Sum_probs=32.5

Q ss_pred             hHHHHHHHHhhh-----ccchhhHHHHHHHHHHHHHcC-CCHHHHHHHcCCCC-C----CCHHHHHHHHHh
Q 030521          102 NDELKEMILVAN-----YLNIKDMLDYLTETVANRIKN-KSVEYVRKFFGIEN-D----FTPEEEEAARKE  161 (176)
Q Consensus       102 ~~~l~~Ll~AA~-----~L~I~~L~~l~~~~iA~~i~g-ks~eeir~~f~i~~-d----~t~EEe~~i~~e  161 (176)
                      ...+-+++...+     .++...++.+........-.. .+.+++|+.|.+-+ |    +|.+|..++...
T Consensus        43 ~~el~~~~~~~D~dg~g~I~~~eF~~l~~~~~~~~~~~~~~~~el~eaF~~fD~d~~G~Is~~el~~~l~~  113 (151)
T KOG0027|consen   43 EEELRDLIKEIDLDGDGTIDFEEFLDLMEKLGEEKTDEEASSEELKEAFRVFDKDGDGFISASELKKVLTS  113 (151)
T ss_pred             HHHHHHHHHHhCCCCCCeEcHHHHHHHHHhhhcccccccccHHHHHHHHHHHccCCCCcCcHHHHHHHHHH
Confidence            444555555554     466666666666555433222 25556666665433 1    566666666543


No 81 
>PF03918 CcmH:  Cytochrome C biogenesis protein;  InterPro: IPR005616 Members of this family include NrfF, CcmH, CycL, Ccl2.; PDB: 2KW0_A 2HL7_A.
Probab=21.80  E-value=78  Score=23.88  Aligned_cols=12  Identities=33%  Similarity=0.509  Sum_probs=8.5

Q ss_pred             cCCCHHHHHHHc
Q 030521          133 KNKSVEYVRKFF  144 (176)
Q Consensus       133 ~gks~eeir~~f  144 (176)
                      .|+|.+||+++|
T Consensus        72 ~G~s~~eI~~~~   83 (148)
T PF03918_consen   72 EGKSDEEIIDYF   83 (148)
T ss_dssp             HT--HHHHHHHH
T ss_pred             cCCCHHHHHHHH
Confidence            699999999874


No 82 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=21.67  E-value=67  Score=20.46  Aligned_cols=14  Identities=36%  Similarity=0.570  Sum_probs=11.4

Q ss_pred             CcEEEEeCCCCEEE
Q 030521           15 KKISLKTADGHLFE   28 (176)
Q Consensus        15 ~~i~L~s~DG~~f~   28 (176)
                      +-+++.+.||++|-
T Consensus         5 KA~Kv~~RDGE~~l   18 (65)
T COG4049           5 KAIKVRDRDGEEFL   18 (65)
T ss_pred             eeeEeeccCCceee
Confidence            35789999999885


No 83 
>PF03883 DUF328:  Protein of unknown function (DUF328);  InterPro: IPR005583  The members of this family are functionally uncharacterised. They are about 250 amino acids in length.
Probab=21.57  E-value=88  Score=25.34  Aligned_cols=28  Identities=25%  Similarity=0.383  Sum_probs=15.1

Q ss_pred             HHHHHHHHcCCCHHHHHHHcCCCCCCCH
Q 030521          125 TETVANRIKNKSVEYVRKFFGIENDFTP  152 (176)
Q Consensus       125 ~~~iA~~i~gks~eeir~~f~i~~d~t~  152 (176)
                      .+.|+..++..|.+|+++.|+|...+..
T Consensus        31 ~~~l~~~L~~~s~~el~~l~~is~~la~   58 (237)
T PF03883_consen   31 TEELLEALKSLSEEELKKLMKISDKLAE   58 (237)
T ss_pred             HHHHHHHHHhcCHHHHHHHHCCCHHHHH
Confidence            4444555555666666666666544433


No 84 
>PF00134 Cyclin_N:  Cyclin, N-terminal domain;  InterPro: IPR006671 Cyclins are eukaryotic proteins that play an active role in controlling nuclear cell division cycles [], and regulate cyclin dependent kinases (CDKs). Cyclins, together with the p34 (cdc2) or cdk2 kinases, form the Maturation Promoting Factor (MPF). There are two main groups of cyclins, G1/S cyclins, which are essential for the control of the cell cycle at the G1/S (start) transition, and G2/M cyclins, which are essential for the control of the cell cycle at the G2/M (mitosis) transition. G2/M cyclins accumulate steadily during G2 and are abruptly destroyed as cells exit from mitosis (at the end of the M-phase). In most species, there are multiple forms of G1 and G2 cyclins. For example, in vertebrates, there are two G2 cyclins, A and B, and at least three G1 cyclins, C, D, and E. Cyclin homologues have been found in various viruses, including Saimiriine herpesvirus 2 (Herpesvirus saimiri) and Human herpesvirus 8 (HHV-8) (Kaposi's sarcoma-associated herpesvirus). These viral homologues differ from their cellular counterparts in that the viral proteins have gained new functions and eliminated others to harness the cell and benefit the virus []. Cyclins contain two domains of similar all-alpha fold, of which this entry is associated with the N-terminal domain.; PDB: 2W2H_B 3RGF_B 1KXU_A 1JKW_A 1URC_D 2WMB_D 1H26_D 1GY3_B 2UUE_D 1E9H_D ....
Probab=21.34  E-value=1.5e+02  Score=20.48  Aligned_cols=42  Identities=10%  Similarity=0.245  Sum_probs=28.6

Q ss_pred             hHHHHHHHHHHHHHcCCCHHHHHHHcCCCC-CCCHHHHHHHHH
Q 030521          119 DMLDYLTETVANRIKNKSVEYVRKFFGIEN-DFTPEEEEAARK  160 (176)
Q Consensus       119 ~L~~l~~~~iA~~i~gks~eeir~~f~i~~-d~t~EEe~~i~~  160 (176)
                      .|+-++|-.+|.++.+..+-.+..+..+.+ .+|.++..+.+.
T Consensus        74 ~li~~~cl~lA~K~~e~~~~~~~~~~~~~~~~~~~~~i~~~E~  116 (127)
T PF00134_consen   74 QLIALACLFLASKMEEDNPPSISDLIRISDNTFTKKDILEMER  116 (127)
T ss_dssp             HHHHHHHHHHHHHHHTSS--HHHHHHHHTTTSSHHHHHHHHHH
T ss_pred             hhhhhhHHHHhhhhhccccchHHHHHHHHcCCCCHHHHHHHHH
Confidence            477788999999998876555666655554 488877666554


No 85 
>smart00224 GGL G protein gamma subunit-like motifs.
Probab=21.22  E-value=1.1e+02  Score=19.40  Aligned_cols=21  Identities=14%  Similarity=0.343  Sum_probs=16.2

Q ss_pred             ChHHHHHHHHHHHHhhhccCC
Q 030521           62 SAAPLSYIIEFCRAHVEFRKK   82 (176)
Q Consensus        62 ~s~~L~~Iie~c~~h~~~~~~   82 (176)
                      .|.....++.||+.|....+.
T Consensus        22 vS~a~~~li~y~e~~~~~DP~   42 (63)
T smart00224       22 VSKAAEELLAYCEQHAEEDPL   42 (63)
T ss_pred             HHHHHHHHHHHHHcCCCCCCC
Confidence            367788899999998776543


No 86 
>PF04751 DUF615:  Protein of unknown function (DUF615);  InterPro: IPR006839 The proteins in this entry are functionally uncharacterised. The entry contains the Escherichia coli (strain K12) protein YjgA (P0A8X0 from SWISSPROT), which has been shown to comigrate with the mature 50S ribosome subunit. Therefore it either represents a novel ribosome-associated protein or it is associated with a different oligomeric complex that comigrates with ribosomal particles [].; PDB: 2P0T_A.
Probab=20.97  E-value=3e+02  Score=20.97  Aligned_cols=57  Identities=12%  Similarity=0.225  Sum_probs=44.2

Q ss_pred             ccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcC
Q 030521           88 EVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFG  145 (176)
Q Consensus        88 ~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~  145 (176)
                      .+..|-..++.. +...+-+++..-.-.+.+.|..++-..-...-.+|++.--|++|.
T Consensus        97 ~lE~wRdrLi~~-~~~al~e~~~~~p~~D~Q~LRqLiR~a~ke~~~~k~~ka~R~Lfr  153 (157)
T PF04751_consen   97 RLERWRDRLIAD-DDSALTEFLAEYPDADRQQLRQLIRNARKEREANKPPKAYRELFR  153 (157)
T ss_dssp             HHHHHHHHHHHS--HHHHHHHHHHSTTS-HHHHHHHHHHHHHHHHTT---HHHHHHHH
T ss_pred             HHHHHHHHHHhC-ChHHHHHHHHHCChhhHHHHHHHHHHHHHHHHcCCCchhHHHHHH
Confidence            467888888888 888888888888888999999999888888888999888888774


No 87 
>PLN03219 uncharacterized protein; Provisional
Probab=20.60  E-value=3.3e+02  Score=19.55  Aligned_cols=59  Identities=8%  Similarity=0.126  Sum_probs=38.1

Q ss_pred             CCCCcEEEEeCC---CCEEEecHHHHHHhHHHHHHHhcC----CCCCCCcccccCCCChHHHHHHHHH
Q 030521           12 QATKKISLKTAD---GHLFEVEEPVAMEFEIVKSFFDEN----DDASEDTVVPLPNVSAAPLSYIIEF   72 (176)
Q Consensus        12 ~~~~~i~L~s~D---G~~f~V~~~~a~~S~~l~~ll~~~----~~~~~~~~Ipl~~v~s~~L~~Iie~   72 (176)
                      .+...+.+...+   ++.|.|+...+. -+.++.||+..    |.+....+|.|| =....++.|+..
T Consensus        39 vpkGh~aVYVG~~~E~kRFvVPi~yL~-hP~F~~LL~~AeEEfGf~~~~G~L~IP-Cd~~~F~~ll~~  104 (108)
T PLN03219         39 VPKGHVAVYVGEQMEKKRFVVPISYLN-HPLFREFLNRAEEECGFHHSMGGLTIP-CREESFLHLITS  104 (108)
T ss_pred             CCCCeEEEEECCCCCceEEEEEHHHcC-ChHHHHHHHHHHHHhCCCCCCCCEEEe-CCHHHHHHHHHh
Confidence            555556665554   699999998754 46667776532    332112477777 777888887764


No 88 
>PF02657 SufE:  Fe-S metabolism associated domain;  InterPro: IPR003808 This entry represents the core domain of SufE and related proteins. This domain of SufE shows strong structural similarity to IscU, and the sulfur-acceptor site in SufE coincides with the location of the cysteine residues mediating Fe-S cluster assembly in IscU. Thus, a conserved core structure is implicated in mediating the interactions of both SufE and IscU with the mutually homologous cysteine desulfurase enzymes present in their respective operons [].; PDB: 1MZG_B 1WLO_A 3G0M_A 1NI7_A.
Probab=20.47  E-value=1.8e+02  Score=21.11  Aligned_cols=35  Identities=23%  Similarity=0.475  Sum_probs=23.6

Q ss_pred             cchhhHHHHHHHHHHHHHcCCCHHHHHHH-------cCCCCCCCHH
Q 030521          115 LNIKDMLDYLTETVANRIKNKSVEYVRKF-------FGIENDFTPE  153 (176)
Q Consensus       115 L~I~~L~~l~~~~iA~~i~gks~eeir~~-------f~i~~d~t~E  153 (176)
                      +=+++|+-+.+.    .++|+|++||..+       +|+..-+||-
T Consensus        67 ~ivkGl~all~~----~~~g~t~~eI~~~~~~fl~~lgl~~~Ls~s  108 (125)
T PF02657_consen   67 RIVKGLLALLLE----VLNGQTPEEILAFDPDFLEQLGLSQHLSPS  108 (125)
T ss_dssp             HHHHHHHHHHHH----HTTT-BHHHHHHS-THHHHHHTSCCCSTHH
T ss_pred             HHHHHHHHHHHH----HHcCCCHHHHHhCCHHHHHHcCcccccCch
Confidence            335555555544    4589999999765       7888888874


No 89 
>smart00875 BACK BTB And C-terminal Kelch. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues.
Probab=20.33  E-value=2.6e+02  Score=18.10  Aligned_cols=62  Identities=16%  Similarity=0.288  Sum_probs=37.1

Q ss_pred             HHHHHhhhccchhhHHHHHHHHHHHH---------HcCCCHHHHHHHcCCCCCCC-HHHHHHHHHhcccccCC
Q 030521          106 KEMILVANYLNIKDMLDYLTETVANR---------IKNKSVEYVRKFFGIENDFT-PEEEEAARKEYEWAFEG  168 (176)
Q Consensus       106 ~~Ll~AA~~L~I~~L~~l~~~~iA~~---------i~gks~eeir~~f~i~~d~t-~EEe~~i~~e~~w~~~~  168 (176)
                      +++...|...+.+.|.+.|.++++..         +...+.+++..+++= ++++ ..|..-.+---.|+...
T Consensus         2 ~~i~~~a~~~~~~~L~~~~~~~i~~nf~~~~~~~~f~~L~~~~l~~iL~~-d~l~v~~E~~v~~av~~W~~~~   73 (101)
T smart00875        2 LGIRRFAELYGLEELLEKALRFILKNFLEVAQSEEFLELSLEQLLSLLSS-DDLNVPSEEEVFEAVLRWVKHD   73 (101)
T ss_pred             HhHHHHHHHhChHHHHHHHHHHHHHHHHHHhcCcHHhcCCHHHHHHHhCc-ccCCCCCHHHHHHHHHHHHHCC
Confidence            34566677788888888888887663         234567777777654 2343 22333333345566543


Done!