Query 030521
Match_columns 176
No_of_seqs 113 out of 765
Neff 7.5
Searched_HMMs 46136
Date Fri Mar 29 14:59:15 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/030521.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/030521hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1724 SCF ubiquitin ligase, 100.0 6.6E-49 1.4E-53 300.5 16.5 155 12-167 2-162 (162)
2 COG5201 SKP1 SCF ubiquitin lig 100.0 8E-46 1.7E-50 269.5 15.0 150 15-167 2-157 (158)
3 PF01466 Skp1: Skp1 family, di 100.0 5.7E-30 1.2E-34 174.8 8.2 78 89-167 1-78 (78)
4 smart00512 Skp1 Found in Skp1 100.0 2.2E-28 4.7E-33 175.4 11.2 102 15-117 2-104 (104)
5 PF03931 Skp1_POZ: Skp1 family 99.8 4E-20 8.8E-25 120.8 6.8 62 15-78 1-62 (62)
6 KOG3473 RNA polymerase II tran 99.8 2.8E-18 6E-23 119.5 7.9 103 6-117 8-112 (112)
7 PF00651 BTB: BTB/POZ domain; 98.8 1.1E-07 2.5E-12 67.1 9.8 99 13-130 9-109 (111)
8 PHA02713 hypothetical protein; 98.7 1.6E-07 3.5E-12 85.0 10.8 106 14-141 25-132 (557)
9 smart00225 BTB Broad-Complex, 98.5 5.7E-07 1.2E-11 60.0 6.8 85 21-125 5-90 (90)
10 PHA02790 Kelch-like protein; P 98.4 1.1E-06 2.3E-11 78.3 7.5 99 17-136 23-124 (480)
11 PHA03098 kelch-like protein; P 98.3 3.8E-06 8.2E-11 75.2 10.5 98 15-136 10-109 (534)
12 KOG4441 Proteins containing BT 98.3 4E-06 8.8E-11 76.2 8.8 96 16-132 38-134 (571)
13 KOG4682 Uncharacterized conser 96.5 0.008 1.7E-07 52.4 6.3 108 21-151 75-186 (488)
14 KOG4350 Uncharacterized conser 96.3 0.037 8E-07 48.6 9.2 120 13-141 43-181 (620)
15 KOG2716 Polymerase delta-inter 95.5 0.17 3.6E-06 41.2 9.4 103 13-135 3-108 (230)
16 KOG3433 Protein involved in me 94.0 0.13 2.7E-06 40.4 5.0 31 116-151 170-200 (203)
17 KOG0783 Uncharacterized conser 93.4 0.22 4.7E-06 47.4 6.4 112 17-143 713-826 (1267)
18 COG5124 Protein predicted to b 91.9 0.17 3.7E-06 39.5 3.1 31 116-151 174-204 (209)
19 PF11822 DUF3342: Domain of un 90.3 0.54 1.2E-05 40.0 4.9 90 24-133 13-105 (317)
20 PF02214 BTB_2: BTB/POZ domain 89.3 0.29 6.2E-06 33.6 2.1 83 23-124 6-94 (94)
21 PF03962 Mnd1: Mnd1 family; I 84.7 1.4 3.1E-05 34.6 4.0 44 103-151 142-187 (188)
22 KOG2075 Topoisomerase TOP1-int 73.9 18 0.00039 32.7 7.6 108 25-132 129-250 (521)
23 PF03656 Pam16: Pam16; InterP 73.5 1.5 3.3E-05 32.4 0.8 39 131-172 51-89 (127)
24 KOG0511 Ankyrin repeat protein 67.8 16 0.00034 32.3 5.8 108 22-133 298-431 (516)
25 PF05409 Peptidase_C30: Corona 63.2 23 0.00049 29.8 5.7 98 54-155 161-265 (293)
26 KOG1665 AFH1-interacting prote 60.0 25 0.00054 28.9 5.3 93 13-126 7-105 (302)
27 PF07928 Vps54: Vps54-like pro 57.5 3.4 7.4E-05 30.8 0.0 47 23-77 1-47 (135)
28 PRK05365 malonic semialdehyde 53.1 9.4 0.0002 29.6 1.9 35 107-151 130-164 (195)
29 KOG3442 Uncharacterized conser 52.2 12 0.00027 27.6 2.2 36 133-171 54-89 (132)
30 PRK13696 hypothetical protein; 51.6 42 0.00091 21.7 4.3 26 133-161 36-61 (62)
31 cd02148 Nitroreductase_5 Nitro 50.5 10 0.00023 29.0 1.7 33 108-150 124-156 (185)
32 KOG4591 Uncharacterized conser 49.2 33 0.00071 27.8 4.3 57 56-130 108-164 (280)
33 PF11978 MVP_shoulder: Shoulde 46.0 18 0.00038 26.5 2.1 27 109-135 37-63 (118)
34 PF12556 CobS_N: Cobaltochelat 44.6 13 0.00029 21.3 1.1 12 140-151 14-25 (36)
35 PF10892 DUF2688: Protein of u 44.5 27 0.0006 22.1 2.6 25 137-161 30-56 (60)
36 KOG3840 Uncharaterized conserv 39.8 46 0.001 28.6 4.0 67 8-75 89-161 (438)
37 PF10075 PCI_Csn8: COP9 signal 39.3 71 0.0015 23.4 4.7 48 117-168 76-127 (143)
38 PRK14981 DNA-directed RNA poly 38.9 1.2E+02 0.0025 21.7 5.6 45 116-161 61-106 (112)
39 PF15063 TC1: Thyroid cancer p 38.7 33 0.00072 23.1 2.4 31 127-157 29-59 (79)
40 smart00385 CYCLIN domain prese 38.2 97 0.0021 19.1 5.1 41 117-160 37-79 (83)
41 PF11460 DUF3007: Protein of u 37.2 34 0.00074 24.4 2.5 24 136-162 79-102 (104)
42 KOG4350 Uncharacterized conser 36.6 34 0.00073 30.6 2.8 33 101-133 114-146 (620)
43 PF13921 Myb_DNA-bind_6: Myb-l 35.4 64 0.0014 19.7 3.3 34 125-158 20-60 (60)
44 PF11197 DUF2835: Protein of u 35.2 46 0.001 21.8 2.7 25 11-35 17-41 (68)
45 PRK11053 dihydropteridine redu 35.1 30 0.00065 27.2 2.2 36 105-150 147-183 (217)
46 PF06851 DUF1247: Protein of u 33.3 1.5E+02 0.0032 22.4 5.4 72 72-147 61-134 (148)
47 cd00068 GGL G protein gamma su 33.0 59 0.0013 20.3 2.8 20 62-81 22-41 (57)
48 TIGR02698 CopY_TcrY copper tra 32.6 1.5E+02 0.0032 21.6 5.4 38 124-161 76-113 (130)
49 PF10415 FumaraseC_C: Fumarase 32.3 1.1E+02 0.0024 18.9 4.0 23 133-158 24-46 (55)
50 cd00167 SANT 'SWI3, ADA2, N-Co 31.9 51 0.0011 18.1 2.3 20 126-145 23-42 (45)
51 PF11116 DUF2624: Protein of u 31.5 1.2E+02 0.0025 20.9 4.3 27 130-158 10-39 (85)
52 PF08671 SinI: Anti-repressor 31.4 38 0.00083 18.6 1.5 13 133-145 15-27 (30)
53 cd05029 S-100A6 S-100A6: S-100 30.5 1.1E+02 0.0025 20.6 4.2 45 117-161 9-59 (88)
54 PF02519 Auxin_inducible: Auxi 30.3 1.9E+02 0.0042 20.2 6.1 62 10-73 34-99 (100)
55 smart00717 SANT SANT SWI3, AD 30.1 57 0.0012 18.2 2.3 20 126-145 25-44 (49)
56 TIGR02476 BluB cob(II)yrinic a 30.1 42 0.00091 26.1 2.2 38 104-151 133-170 (205)
57 PF13798 PCYCGC: Protein of un 27.6 89 0.0019 24.0 3.5 23 122-144 119-141 (158)
58 PRK09296 cysteine desufuration 27.6 1E+02 0.0022 23.0 3.8 28 125-152 82-117 (138)
59 PF11198 DUF2857: Protein of u 26.9 2.4E+02 0.0052 21.8 6.0 63 101-165 59-136 (180)
60 cd02144 iodotyrosine_dehalogen 26.9 53 0.0012 24.8 2.3 38 105-152 129-167 (193)
61 PF00382 TFIIB: Transcription 26.7 1.6E+02 0.0035 18.6 4.3 30 118-147 36-67 (71)
62 TIGR01999 iscU FeS cluster ass 25.8 79 0.0017 22.8 2.9 38 108-145 50-88 (124)
63 TIGR03147 cyt_nit_nrfF cytochr 25.8 71 0.0015 23.6 2.6 19 126-144 64-83 (126)
64 COG1926 Predicted phosphoribos 25.6 1.2E+02 0.0025 24.7 3.9 63 53-116 125-215 (220)
65 TIGR03419 NifU_clost FeS clust 25.6 88 0.0019 22.5 3.1 37 108-144 46-83 (121)
66 PRK11325 scaffold protein; Pro 25.5 79 0.0017 23.0 2.9 38 108-145 52-90 (127)
67 PF03750 DUF310: Protein of un 25.3 1E+02 0.0022 22.1 3.4 34 115-149 8-43 (119)
68 PHA03098 kelch-like protein; P 25.3 2.4E+02 0.0053 25.1 6.5 98 62-168 74-182 (534)
69 PRK15019 CsdA-binding activato 24.6 1.1E+02 0.0025 23.0 3.6 37 112-152 83-127 (147)
70 COG4957 Predicted transcriptio 24.5 67 0.0014 24.2 2.2 18 134-151 98-115 (148)
71 TIGR03391 FeS_syn_CsdE cystein 24.1 1.2E+02 0.0026 22.6 3.6 36 113-152 79-122 (138)
72 PF00249 Myb_DNA-binding: Myb- 24.1 1.6E+02 0.0034 17.1 4.9 39 101-144 5-44 (48)
73 cd02140 Nitroreductase_4 Nitro 23.9 62 0.0013 24.8 2.2 39 104-150 126-164 (192)
74 PF01592 NifU_N: NifU-like N t 23.7 99 0.0021 22.3 3.1 21 122-142 65-85 (126)
75 PF07076 DUF1344: Protein of u 23.7 54 0.0012 21.1 1.4 32 2-33 2-33 (61)
76 PF07638 Sigma70_ECF: ECF sigm 23.6 3.3E+02 0.0072 20.7 7.1 46 101-148 119-165 (185)
77 PF00690 Cation_ATPase_N: Cati 23.5 1.2E+02 0.0026 19.1 3.2 30 134-163 3-34 (69)
78 PF07671 DUF1601: Protein of u 23.5 84 0.0018 18.1 2.1 30 114-143 7-36 (37)
79 PF11338 DUF3140: Protein of u 23.1 84 0.0018 21.9 2.4 22 58-79 54-75 (92)
80 KOG0027 Calmodulin and related 22.8 3.1E+02 0.0067 20.0 6.1 60 102-161 43-113 (151)
81 PF03918 CcmH: Cytochrome C bi 21.8 78 0.0017 23.9 2.2 12 133-144 72-83 (148)
82 COG4049 Uncharacterized protei 21.7 67 0.0014 20.5 1.5 14 15-28 5-18 (65)
83 PF03883 DUF328: Protein of un 21.6 88 0.0019 25.3 2.7 28 125-152 31-58 (237)
84 PF00134 Cyclin_N: Cyclin, N-t 21.3 1.5E+02 0.0033 20.5 3.6 42 119-160 74-116 (127)
85 smart00224 GGL G protein gamma 21.2 1.1E+02 0.0025 19.4 2.6 21 62-82 22-42 (63)
86 PF04751 DUF615: Protein of un 21.0 3E+02 0.0064 21.0 5.3 57 88-145 97-153 (157)
87 PLN03219 uncharacterized prote 20.6 3.3E+02 0.0072 19.5 6.3 59 12-72 39-104 (108)
88 PF02657 SufE: Fe-S metabolism 20.5 1.8E+02 0.0039 21.1 3.9 35 115-153 67-108 (125)
89 smart00875 BACK BTB And C-term 20.3 2.6E+02 0.0055 18.1 5.7 62 106-168 2-73 (101)
No 1
>KOG1724 consensus SCF ubiquitin ligase, Skp1 component [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=6.6e-49 Score=300.50 Aligned_cols=155 Identities=45% Similarity=0.695 Sum_probs=142.0
Q ss_pred CCCCcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCC--C----CC
Q 030521 12 QATKKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKK--R----SP 85 (176)
Q Consensus 12 ~~~~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~--~----~~ 85 (176)
|+.++|+|+|+||++|.|+.++|++|.+|++++...+...++.+||||+|+|.||++|++||+||+.++++ + ..
T Consensus 2 ~~~~~ikL~SsDG~~f~ve~~~a~~s~~i~~~~~~~~~~~~~~~IPl~nV~~~iL~kVIewC~~Hk~d~~~~~~~~~~~~ 81 (162)
T KOG1724|consen 2 MSKKKIKLESSDGEIFEVEEEVARQSQTISAHMIEDGCADENDPIPLPNVTSKILKKVIEWCKKHKDDDPANPEDKELPE 81 (162)
T ss_pred CCCCeEEEEccCCceeehhHHHHHHhHHHHHHHHHcCCCccCCccccCccCHHHHHHHHHHHHHcccccccccccccccc
Confidence 67899999999999999999999999999999987776543369999999999999999999999998654 1 12
Q ss_pred ccccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccc
Q 030521 86 KQEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWA 165 (176)
Q Consensus 86 ~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~ 165 (176)
...+++||++|++. ++.+||+|+.|||||+|++|+++||+.||++++||||+|||++|||++|+|+||++++++++.|+
T Consensus 82 ~~~i~~WD~~Flk~-d~~tLfdli~AAnyLdi~gLl~~~ck~va~mikgktpeEir~~f~I~~d~t~eE~~~~~~e~~~~ 160 (162)
T KOG1724|consen 82 ETDIPEWDAEFLKV-DQGTLFDLILAANYLDIKGLLDLTCKTVANMIKGKTPEEIREIFNIENDETPEEEEAIRKENEWA 160 (162)
T ss_pred cCCccHHHHHHHhc-CHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHccCCHHHHHHHcCCCCCCChhHHHHHhhccccc
Confidence 23599999999995 99999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cC
Q 030521 166 FE 167 (176)
Q Consensus 166 ~~ 167 (176)
++
T Consensus 161 ~~ 162 (162)
T KOG1724|consen 161 FE 162 (162)
T ss_pred cC
Confidence 85
No 2
>COG5201 SKP1 SCF ubiquitin ligase, SKP1 component [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=8e-46 Score=269.48 Aligned_cols=150 Identities=35% Similarity=0.531 Sum_probs=137.6
Q ss_pred CcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCC-CCC-----ccc
Q 030521 15 KKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKK-RSP-----KQE 88 (176)
Q Consensus 15 ~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~-~~~-----~~~ 88 (176)
++|+|.|.||++|+|+..+|..|-+|++++.+.+.. +-|||+|+|+|.+|.+|++||+||+....+ .++ ..+
T Consensus 2 s~i~l~s~dge~F~vd~~iAerSiLikN~l~d~~~~--n~p~p~pnVrSsvl~kv~ew~ehh~~s~sede~d~~~rks~p 79 (158)
T COG5201 2 SMIELESIDGEIFRVDENIAERSILIKNMLCDSTAC--NYPIPAPNVRSSVLMKVQEWMEHHTSSLSEDENDLEIRKSKP 79 (158)
T ss_pred CceEEEecCCcEEEehHHHHHHHHHHHHHhcccccc--CCCCcccchhHHHHHHHHHHHHhccccCCCccChHhhhccCC
Confidence 579999999999999999999999999998876553 358999999999999999999999986544 222 236
Q ss_pred cchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccC
Q 030521 89 VKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFE 167 (176)
Q Consensus 89 ~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~ 167 (176)
.+.||+.|+.. |+++|+++++|||||+|++|+++||+.||++|+||||+|||++|||++||||||+++|||||.|+.+
T Consensus 80 ~D~wdr~Fm~v-DqemL~eI~laaNYL~ikpLLd~gCKivaemirgkSpeeir~tfni~ndfTpEEe~~irkEnEWaed 157 (158)
T COG5201 80 SDFWDRFFMEV-DQEMLLEICLAANYLEIKPLLDLGCKIVAEMIRGKSPEEIRETFNIENDFTPEEERRIRKENEWAED 157 (158)
T ss_pred ccHHHHHHHHh-hHHHHHHHHHhhccccchHHHHHHHHHHHHHHccCCHHHHHHHhCCCCCCCHHHHHHHHHhcccccc
Confidence 88999999999 9999999999999999999999999999999999999999999999999999999999999999975
No 3
>PF01466 Skp1: Skp1 family, dimerisation domain; InterPro: IPR016072 SKP1 (together with SKP2) was identified as an essential component of the cyclin A-CDK2 S phase kinase complex []. It was found to bind several F-box containing proteins (e.g., Cdc4, Skp2, cyclin F) and to be involved in the ubiquitin protein degradation pathway. A yeast homologue of SKP1 (P52286) was identified in the centromere bound kinetochore complex [] and is also involved in the ubiquitin pathway []. In Dictyostelium discoideum (Slime mold) FP21 was shown to be glycosylated in the cytosol and has homology to SKP1 []. This entry represents a dimerisation domain found at the C-terminal of SKP1 proteins [], as well as in subunit D of the centromere DNA-binding protein complex Cbf3 []. This domain is multi-helical in structure, and consists of an interlocked herterodimer in F-box proteins.; GO: 0006511 ubiquitin-dependent protein catabolic process; PDB: 2P1O_A 3OGL_G 3OGM_A 3C6O_A 2P1N_A 2P1Q_A 3OGK_I 3C6N_A 3C6P_A 2P1P_A ....
Probab=99.96 E-value=5.7e-30 Score=174.83 Aligned_cols=78 Identities=55% Similarity=0.860 Sum_probs=68.3
Q ss_pred cchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccC
Q 030521 89 VKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFE 167 (176)
Q Consensus 89 ~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~ 167 (176)
+++||++|++. +.+.|++|+.||+||+|++|+++||++||.+++|||++|||++|||++|+|+||++++|+||+|+|+
T Consensus 1 l~~wD~~F~~~-~~~~L~~l~~AA~yL~I~~L~~~~~~~iA~~i~gks~eeir~~fgi~~d~t~eee~~i~~e~~w~~~ 78 (78)
T PF01466_consen 1 LPEWDQEFLDV-DNDELFDLLNAANYLDIKGLLDLCCKYIANMIKGKSPEEIRKYFGIENDLTPEEEEEIRKENEWAFE 78 (78)
T ss_dssp HHHHHHHHT-S--HHHHHHHHHHHHHHT-HHHHHHHHHHHHHHHTTS-HHHHHHHHT---TSSHHHHHHHHHHCCCTB-
T ss_pred CCHHHHHHHHc-CHHHHHHHHHHHHHHcchHHHHHHHHHHHHHhcCCCHHHHHHHcCCCCCCCHHHHHHHHHHcchhcC
Confidence 47899999966 9999999999999999999999999999999999999999999999999999999999999999986
No 4
>smart00512 Skp1 Found in Skp1 protein family. Family of Skp1 (kinetochore protein required for cell cycle progression) and elongin C (subunit of RNA polymerase II transcription factor SIII) homologues.
Probab=99.96 E-value=2.2e-28 Score=175.38 Aligned_cols=102 Identities=37% Similarity=0.618 Sum_probs=89.7
Q ss_pred CcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCCC-CCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhh
Q 030521 15 KKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDAS-EDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFN 93 (176)
Q Consensus 15 ~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~-~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd 93 (176)
.+|+|+|+||++|.|++.+|++|++|++++++.+.++ ...+||||+|+|.+|++|++||+||+.++.+......+++||
T Consensus 2 ~~v~L~S~Dg~~f~v~~~~a~~S~~i~~~l~~~~~~~~~~~~Ipl~~v~~~~L~~Vi~yc~~h~~~~~~~~~~~~~~~wD 81 (104)
T smart00512 2 KYIKLISSDGEVFEVEREVARQSKTIKAMIEDLGVDDENNNPIPLPNVTSKILSKVIEYCEHHVDDPPSVADKDDIPTWD 81 (104)
T ss_pred CeEEEEeCCCCEEEecHHHHHHHHHHHHHHHccCcccCCCCCccCCCcCHHHHHHHHHHHHHcccCCCCccccccccHHH
Confidence 5799999999999999999999999999999866543 235999999999999999999999998875422224689999
Q ss_pred hhhccCCChHHHHHHHHhhhccch
Q 030521 94 EEFGKSKSNDELKEMILVANYLNI 117 (176)
Q Consensus 94 ~eFl~~~~~~~l~~Ll~AA~~L~I 117 (176)
.+|++. +++.|++|+.|||||+|
T Consensus 82 ~~F~~~-d~~~l~dLl~AAnyL~I 104 (104)
T smart00512 82 AEFLKI-DQETLFELILAANYLDI 104 (104)
T ss_pred HHHHcC-CHHHHHHHHHHHHhhCC
Confidence 999996 99999999999999997
No 5
>PF03931 Skp1_POZ: Skp1 family, tetramerisation domain; InterPro: IPR016073 SKP1 (together with SKP2) was identified as an essential component of the cyclin A-CDK2 S phase kinase complex []. It was found to bind several F-box containing proteins (e.g., Cdc4, Skp2, cyclin F) and to be involved in the ubiquitin protein degradation pathway. A yeast homologue of SKP1 (P52286) was identified in the centromere bound kinetochore complex [] and is also involved in the ubiquitin pathway []. In Dictyostelium discoideum (Slime mold) FP21 was shown to be glycosylated in the cytosol and has homology to SKP1 []. This entry represents a POZ domain with a core structure consisting of beta(2)/alpha(2)/beta(2)/alpha(2) in two layers, alpha/beta. This domain is found at the N-terminal of SKP1 proteins [] as well as in subunit D of the centromere DNA-binding protein complex Cbf3 []. ; GO: 0006511 ubiquitin-dependent protein catabolic process; PDB: 1LM8_C 2XAI_E 1VCB_E 3ZRC_K 3ZRF_E 3DCG_B 2C9W_C 1LQB_B 2IZV_C 1HV2_A ....
Probab=99.81 E-value=4e-20 Score=120.81 Aligned_cols=62 Identities=34% Similarity=0.600 Sum_probs=55.4
Q ss_pred CcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhh
Q 030521 15 KKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVE 78 (176)
Q Consensus 15 ~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~ 78 (176)
.+|+|+|+||+.|.|++++|++|++|++|+++.+..+. +||||+|++++|++|++||+||+.
T Consensus 1 ~~v~L~SsDg~~f~V~~~~a~~S~~i~~ml~~~~~~~~--~Ipl~~v~~~~L~kViewc~~H~~ 62 (62)
T PF03931_consen 1 MYVKLVSSDGQEFEVSREAAKQSKTIKNMLEDLGDEDE--PIPLPNVSSRILKKVIEWCEHHKN 62 (62)
T ss_dssp -EEEEEETTSEEEEEEHHHHTTSHHHHHHHHCTCCCGT--EEEETTS-HHHHHHHHHHHHHHHH
T ss_pred CEEEEEcCCCCEEEeeHHHHHHhHHHHHHHhhhccccc--ccccCccCHHHHHHHHHHHHhcCC
Confidence 36999999999999999999999999999998766543 799999999999999999999974
No 6
>KOG3473 consensus RNA polymerase II transcription elongation factor Elongin/SIII, subunit elongin C [Transcription]
Probab=99.76 E-value=2.8e-18 Score=119.52 Aligned_cols=103 Identities=18% Similarity=0.343 Sum_probs=89.0
Q ss_pred ccccCcCCCCcEEEEeCCCCEEEecHHHHHHhHHHHHHHhcCCCC--CCCcccccCCCChHHHHHHHHHHHHhhhccCCC
Q 030521 6 SSSCSAQATKKISLKTADGHLFEVEEPVAMEFEIVKSFFDENDDA--SEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKR 83 (176)
Q Consensus 6 ~~~~~~~~~~~i~L~s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~--~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~ 83 (176)
-++|..|.+.+++|+|+||++|.|.+.+|..|++|+.|+.+.+.. .+++.+.++.++|.+|+||++|+.+...+..
T Consensus 8 ~~g~egp~~~yVkLvS~Ddhefiikre~AmtSgTiraml~gpg~~se~~~n~v~f~di~shiLeKvc~Yl~Yk~rY~~-- 85 (112)
T KOG3473|consen 8 YGGCEGPDSMYVKLVSSDDHEFIIKREHAMTSGTIRAMLSGPGVFSEAEKNEVYFRDIPSHILEKVCEYLAYKVRYTN-- 85 (112)
T ss_pred cCCccCcchhheEeecCCCcEEEEeehhhhhhhHHHHHHcCCccccccccceEEeccchHHHHHHHHHHhhheeeecc--
Confidence 368999999999999999999999999999999999999976543 2457899999999999999999998776643
Q ss_pred CCccccchhhhhhccCCChHHHHHHHHhhhccch
Q 030521 84 SPKQEVKTFNEEFGKSKSNDELKEMILVANYLNI 117 (176)
Q Consensus 84 ~~~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I 117 (176)
....+++|| + ..+..++|+.|||||++
T Consensus 86 -~s~eiPeF~-----I-ppemaleLL~aAn~Lec 112 (112)
T KOG3473|consen 86 -SSTEIPEFD-----I-PPEMALELLMAANYLEC 112 (112)
T ss_pred -ccccCCCCC-----C-CHHHHHHHHHHhhhhcC
Confidence 235688875 5 89999999999999974
No 7
>PF00651 BTB: BTB/POZ domain; InterPro: IPR013069 The BTB (for BR-C, ttk and bab) [] or POZ (for Pox virus and Zinc finger) [] domain is present near the N terminus of a fraction of zinc finger (IPR007087 from INTERPRO) proteins and in proteins that contain the IPR006652 from INTERPRO motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation []. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule []. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT [, , ]. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.; GO: 0005515 protein binding; PDB: 3M5B_A 1R28_B 3LBZ_A 3E4U_F 3BIM_B 1R2B_A 1R29_A 2VPK_A 2YY9_B 3GA1_A ....
Probab=98.75 E-value=1.1e-07 Score=67.15 Aligned_cols=99 Identities=16% Similarity=0.334 Sum_probs=77.2
Q ss_pred CCCcEEEEeCCCCEEEecHHHH-HHhHHHHHHHhcCC-CCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccc
Q 030521 13 ATKKISLKTADGHLFEVEEPVA-MEFEIVKSFFDEND-DASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVK 90 (176)
Q Consensus 13 ~~~~i~L~s~DG~~f~V~~~~a-~~S~~l~~ll~~~~-~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~ 90 (176)
....++|+..||+.|.|++.++ .+|+++++++...+ .+.....|++++++...++.+++||+..... ++
T Consensus 9 ~~~D~~i~v~d~~~~~vhk~iL~~~S~~F~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~l~~~Y~~~~~---------~~ 79 (111)
T PF00651_consen 9 EFSDVTIRVGDGKTFYVHKNILAARSPYFRNLFEGSKFKESTVPEISLPDVSPEAFEAFLEYMYTGEIE---------IN 79 (111)
T ss_dssp TS--EEEEETTTEEEEE-HHHHHHHBHHHHHHHTTTTSTTSSEEEEEETTSCHHHHHHHHHHHHHSEEE---------EE
T ss_pred CCCCEEEEECCCEEEeechhhhhccchhhhhcccccccccccccccccccccccccccccccccCCccc---------CC
Confidence 3466899999999999999996 56999999999773 2222246888999999999999999643211 11
Q ss_pred hhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHH
Q 030521 91 TFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVAN 130 (176)
Q Consensus 91 ~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~ 130 (176)
+.+.+.+++..|++|+++.|.+.|+.++..
T Consensus 80 ----------~~~~~~~ll~lA~~~~~~~L~~~~~~~l~~ 109 (111)
T PF00651_consen 80 ----------SDENVEELLELADKLQIPELKKACEKFLQE 109 (111)
T ss_dssp -----------TTTHHHHHHHHHHTTBHHHHHHHHHHHHH
T ss_pred ----------HHHHHHHHHHHHHHhCcHHHHHHHHHHHHh
Confidence 256689999999999999999999999864
No 8
>PHA02713 hypothetical protein; Provisional
Probab=98.68 E-value=1.6e-07 Score=85.00 Aligned_cols=106 Identities=13% Similarity=0.193 Sum_probs=85.4
Q ss_pred CCcEEEEeCCCCEEEecHHHHH-HhHHHHHHHhcCCCCC-CCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccch
Q 030521 14 TKKISLKTADGHLFEVEEPVAM-EFEIVKSFFDENDDAS-EDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKT 91 (176)
Q Consensus 14 ~~~i~L~s~DG~~f~V~~~~a~-~S~~l~~ll~~~~~~~-~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~ 91 (176)
-..++|...+|+.|.+++.++. .|++++.|+...-.++ ....|.|++++..+|+.|++|++...
T Consensus 25 l~DV~L~v~~~~~f~~Hr~vLaa~S~YF~amF~~~~~e~~~~~~v~l~~v~~~~~~~ll~y~Yt~~-------------- 90 (557)
T PHA02713 25 LCDVIITIGDGEEIKAHKTILAAGSKYFRTLFTTPMIIRDLVTRVNLQMFDKDAVKNIVQYLYNRH-------------- 90 (557)
T ss_pred CCCEEEEeCCCCEEeehHHHHhhcCHHHHHHhcCCchhhccCceEEeccCCHHHHHHHHHHhcCCC--------------
Confidence 3468888878999999999964 7999999997643222 23578899999999999999999621
Q ss_pred hhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHH
Q 030521 92 FNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVR 141 (176)
Q Consensus 92 wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir 141 (176)
. +.+.+.+|+.||++|+|+.|.+.||.++...+.-.+-=+|.
T Consensus 91 -------i-~~~nv~~ll~aA~~lqi~~l~~~C~~~l~~~l~~~NCl~i~ 132 (557)
T PHA02713 91 -------I-SSMNVIDVLKCADYLLIDDLVTDCESYIKDYTNHDTCIYMY 132 (557)
T ss_pred -------C-CHHHHHHHHHHHHHHCHHHHHHHHHHHHHhhCCccchHHHH
Confidence 1 45679999999999999999999999998877665544444
No 9
>smart00225 BTB Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.
Probab=98.48 E-value=5.7e-07 Score=60.02 Aligned_cols=85 Identities=19% Similarity=0.341 Sum_probs=67.9
Q ss_pred eCCCCEEEecHHHHHH-hHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhccC
Q 030521 21 TADGHLFEVEEPVAME-FEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFGKS 99 (176)
Q Consensus 21 s~DG~~f~V~~~~a~~-S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl~~ 99 (176)
..+|+.|.+++.++.. |++++.++...........|++++++..+++.+++|++...- ..
T Consensus 5 ~v~~~~~~~h~~iL~~~s~~f~~~~~~~~~~~~~~~i~l~~~~~~~f~~~l~~ly~~~~-------------------~~ 65 (90)
T smart00225 5 VVGGKKFKAHKAVLAACSPYFKALFSGDFKESKKSEIYLDDVSPEDFRALLEFLYTGKL-------------------DL 65 (90)
T ss_pred EECCEEEehHHHHHhhcCHHHHHHHcCCCccCCCCEEEecCCCHHHHHHHHHeecCcee-------------------ec
Confidence 4567999999999764 899999998654322345788999999999999999996431 12
Q ss_pred CChHHHHHHHHhhhccchhhHHHHHH
Q 030521 100 KSNDELKEMILVANYLNIKDMLDYLT 125 (176)
Q Consensus 100 ~~~~~l~~Ll~AA~~L~I~~L~~l~~ 125 (176)
+...+.+++.+|++++++.|.+.|+
T Consensus 66 -~~~~~~~l~~~a~~~~~~~l~~~c~ 90 (90)
T smart00225 66 -PEENVEELLELADYLQIPGLVELCE 90 (90)
T ss_pred -CHHHHHHHHHHHHHHCcHHHHhhhC
Confidence 3447899999999999999998874
No 10
>PHA02790 Kelch-like protein; Provisional
Probab=98.37 E-value=1.1e-06 Score=78.29 Aligned_cols=99 Identities=6% Similarity=0.054 Sum_probs=74.5
Q ss_pred EEEEeCCCCEEEecHHH-HHHhHHHHHHHhcCCCCCCCccccc--CCCChHHHHHHHHHHHHhhhccCCCCCccccchhh
Q 030521 17 ISLKTADGHLFEVEEPV-AMEFEIVKSFFDENDDASEDTVVPL--PNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFN 93 (176)
Q Consensus 17 i~L~s~DG~~f~V~~~~-a~~S~~l~~ll~~~~~~~~~~~Ipl--~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd 93 (176)
-.+.---|++|.+++.+ |..|++++.|+..+-.+.+ ..|.+ .+|+..+|+.|++|++.-+
T Consensus 23 ~~~~~~~~~~~~~HR~VLAa~S~YFraMF~~~~~Es~-~~v~~~~~~v~~~~l~~lldy~YTg~---------------- 85 (480)
T PHA02790 23 KTIIEAIGGNIIVNSTILKKLSPYFRTHLRQKYTKNK-DPVTRVCLDLDIHSLTSIVIYSYTGK---------------- 85 (480)
T ss_pred ceEEEEcCcEEeeehhhhhhcCHHHHHHhcCCccccc-cceEEEecCcCHHHHHHHHHhheeee----------------
Confidence 34445568899999999 4669999999986433322 23443 3799999999999997433
Q ss_pred hhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCC
Q 030521 94 EEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKS 136 (176)
Q Consensus 94 ~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks 136 (176)
+.. +.+.+.+|+.||++|+|+.+++.||+++...|.-.+
T Consensus 86 ---l~i-t~~nV~~ll~aA~~Lqi~~v~~~C~~fL~~~l~~~N 124 (480)
T PHA02790 86 ---VYI-DSHNVVNLLRASILTSVEFIIYTCINFILRDFRKEY 124 (480)
T ss_pred ---EEE-ecccHHHHHHHHHHhChHHHHHHHHHHHHhhCCcch
Confidence 233 667799999999999999999999998877655433
No 11
>PHA03098 kelch-like protein; Provisional
Probab=98.34 E-value=3.8e-06 Score=75.18 Aligned_cols=98 Identities=16% Similarity=0.264 Sum_probs=78.1
Q ss_pred CcEEEEe-CCCCEEEecHHHHH-HhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchh
Q 030521 15 KKISLKT-ADGHLFEVEEPVAM-EFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTF 92 (176)
Q Consensus 15 ~~i~L~s-~DG~~f~V~~~~a~-~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~w 92 (176)
+.++|.- .+|+.|.+++.++. .|++++.|+.+.-. ...|.|+. +..+|+.|++|++...
T Consensus 10 ~Dv~l~~~~~~~~~~~Hk~vLaa~S~yF~~mf~~~~~---~~~i~l~~-~~~~~~~~l~y~Ytg~--------------- 70 (534)
T PHA03098 10 CDESIIIVNGGGIIKVHKIILSSSSEYFKKMFKNNFK---ENEINLNI-DYDSFNEVIKYIYTGK--------------- 70 (534)
T ss_pred CCEEEEEEcCCEEEEeHHHHHHhhhHHHHHHHhCCCC---CceEEecC-CHHHHHHHHHHhcCCc---------------
Confidence 4466664 68999999999965 69999999986433 24788887 9999999999999643
Q ss_pred hhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCC
Q 030521 93 NEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKS 136 (176)
Q Consensus 93 d~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks 136 (176)
+.. +.+.+.+|+.||++|+++.|.+.|++++...+.-.+
T Consensus 71 ----~~i-~~~~~~~ll~~A~~l~~~~l~~~C~~~l~~~l~~~n 109 (534)
T PHA03098 71 ----INI-TSNNVKDILSIANYLIIDFLINLCINYIIKIIDDNN 109 (534)
T ss_pred ----eEE-cHHHHHHHHHHHHHhCcHHHHHHHHHHHHHhCCHhH
Confidence 123 566799999999999999999999999887665444
No 12
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=98.26 E-value=4e-06 Score=76.24 Aligned_cols=96 Identities=18% Similarity=0.307 Sum_probs=79.1
Q ss_pred cEEEEeCCCCEEEecHHHH-HHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhh
Q 030521 16 KISLKTADGHLFEVEEPVA-MEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNE 94 (176)
Q Consensus 16 ~i~L~s~DG~~f~V~~~~a-~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~ 94 (176)
.+.|.-.+ +.|..++.++ ..|++++.|+...-.+.....|.+.+|+..+|+.+++|++...-
T Consensus 38 Dv~L~v~~-~~~~aHR~VLAa~S~YFraMFt~~l~e~~~~~i~l~~v~~~~l~~ll~y~Yt~~i---------------- 100 (571)
T KOG4441|consen 38 DVTLLVGD-REFPAHRVVLAACSPYFRAMFTSGLKESKQKEINLEGVDPETLELLLDYAYTGKL---------------- 100 (571)
T ss_pred eEEEEECC-eeechHHHHHHhccHHHHHHhcCCcccccceEEEEecCCHHHHHHHHHHhhcceE----------------
Confidence 35555555 9999999995 56999999999654443456899999999999999999997652
Q ss_pred hhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHH
Q 030521 95 EFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRI 132 (176)
Q Consensus 95 eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i 132 (176)
.. +.+.+-+|+.||.+|+|+.+++.||.++...+
T Consensus 101 ---~i-~~~nVq~ll~aA~~lQi~~v~~~C~~fL~~~l 134 (571)
T KOG4441|consen 101 ---EI-SEDNVQELLEAASLLQIPEVVDACCEFLESQL 134 (571)
T ss_pred ---Ee-chHhHHHHHHHHHHhhhHHHHHHHHHHHHhcC
Confidence 34 78889999999999999999999999887644
No 13
>KOG4682 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=96.46 E-value=0.008 Score=52.37 Aligned_cols=108 Identities=15% Similarity=0.224 Sum_probs=80.0
Q ss_pred eCCCCEEEecHHHHHHhHHHHHHHhcCCCCCCCc----ccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhh
Q 030521 21 TADGHLFEVEEPVAMEFEIVKSFFDENDDASEDT----VVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEF 96 (176)
Q Consensus 21 s~DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~----~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eF 96 (176)
..=|.++++++--+.||+++..|+.+.-.+.... .||=|+|+...|.-++-=+++..
T Consensus 75 ~alg~eWrlHk~yL~QS~yf~smf~Gtw~es~~~iIqleI~Dp~Id~~al~~a~gsLY~dE------------------- 135 (488)
T KOG4682|consen 75 EALGFEWRLHKPYLFQSEYFKSMFSGTWKESSMNIIQLEIPDPNIDVVALQVAFGSLYRDE------------------- 135 (488)
T ss_pred hhccceeeeeeeeeeccHHHHHHhccccChhhCceEEEEcCCCcccHHHHHHHHhhhhhhh-------------------
Confidence 3458889999988999999999988764433222 34456899999988877665422
Q ss_pred ccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521 97 GKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT 151 (176)
Q Consensus 97 l~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t 151 (176)
+.+ +.+.+..++.||.+|..++|++-|...+...++-|| +.+++....-+.
T Consensus 136 veI-~l~dv~gvlAaA~~lqldgl~qrC~evMie~lspkt---a~~yYea~ckYg 186 (488)
T KOG4682|consen 136 VEI-KLSDVVGVLAAACLLQLDGLIQRCGEVMIETLSPKT---ACGYYEAACKYG 186 (488)
T ss_pred eec-cHHHHHHHHHHHHHHHHhhHHHHHHHHHHHhcChhh---hhHhhhhhhhhh
Confidence 335 889999999999999999999999998877665555 455665554444
No 14
>KOG4350 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=96.26 E-value=0.037 Score=48.58 Aligned_cols=120 Identities=10% Similarity=0.198 Sum_probs=87.3
Q ss_pred CCCcEEEEeCCCCEEEecHHH-HHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCC--CCC----
Q 030521 13 ATKKISLKTADGHLFEVEEPV-AMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKK--RSP---- 85 (176)
Q Consensus 13 ~~~~i~L~s~DG~~f~V~~~~-a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~--~~~---- 85 (176)
....++++-.| +.|...+-+ |..|.+++.|+-++-.+.....|||+.-++..++.+++|++.-+-+-.- +..
T Consensus 43 ~y~DVtfvve~-~rfpAHRvILAaRs~yFRAlLYgGm~Es~q~~ipLq~t~~eAF~~lLrYiYtg~~~l~~~~ed~lld~ 121 (620)
T KOG4350|consen 43 DYSDVTFVVED-TRFPAHRVILAARSSYFRALLYGGMQESHQQLIPLQETNSEAFRALLRYIYTGKIDLAGVEEDILLDY 121 (620)
T ss_pred cccceEEEEec-cccchhhhhHHHHHHHHHHHHhhhhhhhhhcccccccccHHHHHHHHHHHhhcceecccchHHHHHHH
Confidence 34557888777 889999988 4679999999986655445578999978899999999999876554221 000
Q ss_pred ------------ccccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHH
Q 030521 86 ------------KQEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVR 141 (176)
Q Consensus 86 ------------~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir 141 (176)
...++++-++. +..+.+|-++.||.+.+...|.++||.++ .++..++.
T Consensus 122 LslAh~Ygf~~Le~aiSeYl~~i---L~~~NvCmifdaA~ly~l~~Lt~~C~mfm-----DrnA~~lL 181 (620)
T KOG4350|consen 122 LSLAHRYGFIQLETAISEYLKEI---LKNENVCMIFDAAYLYQLTDLTDYCMMFM-----DRNADQLL 181 (620)
T ss_pred HHHHHhcCcHHHHHHHHHHHHHH---HcccceeeeeeHHHHhcchHHHHHHHHHH-----hcCHHhhh
Confidence 01233333332 45677899999999999999999999887 55666654
No 15
>KOG2716 consensus Polymerase delta-interacting protein PDIP1 and related proteins, contain BTB/POZ domain [Inorganic ion transport and metabolism]
Probab=95.47 E-value=0.17 Score=41.23 Aligned_cols=103 Identities=17% Similarity=0.285 Sum_probs=75.4
Q ss_pred CCCcEEEEeCCCCEEEecHHHHH-HhHHHHHHHhcCCCC--CCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCcccc
Q 030521 13 ATKKISLKTADGHLFEVEEPVAM-EFEIVKSFFDENDDA--SEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEV 89 (176)
Q Consensus 13 ~~~~i~L~s~DG~~f~V~~~~a~-~S~~l~~ll~~~~~~--~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~ 89 (176)
++..++| ---|.+|.-+...+. ..+.++.|++..-.- +....|=+ .=|..=+..|+.||+.-... +
T Consensus 3 ~~~~vkL-nvGG~~F~Tsk~TLtk~dg~fk~m~e~~i~~~~d~s~~IFI-DRSpKHF~~ILNfmRdGdv~---------L 71 (230)
T KOG2716|consen 3 MSETVKL-NVGGTIFKTSKSTLTKFDGFFKTMLETDIPVEKDESGCIFI-DRSPKHFDTILNFMRDGDVD---------L 71 (230)
T ss_pred ccceEEE-ecCCeEEEeehhhhhhhhhHHHHHhhcCCccccCCcCcEEe-cCChhHHHHHHHhhhccccc---------C
Confidence 3444553 345889999998875 599999999875321 11223434 36778899999999842221 2
Q ss_pred chhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCC
Q 030521 90 KTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNK 135 (176)
Q Consensus 90 ~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gk 135 (176)
| . +.++|-+|+.-|.|..+++|+++|...++..+++.
T Consensus 72 P--------e-~~kel~El~~EA~fYlL~~Lv~~C~~~i~~~~~~~ 108 (230)
T KOG2716|consen 72 P--------E-SEKELKELLREAEFYLLDGLVELCQSAIARLIRGY 108 (230)
T ss_pred c--------c-chHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccCc
Confidence 2 3 67889999999999999999999999999987774
No 16
>KOG3433 consensus Protein involved in meiotic recombination/predicted coiled-coil protein [Cell cycle control, cell division, chromosome partitioning; General function prediction only]
Probab=93.96 E-value=0.13 Score=40.37 Aligned_cols=31 Identities=39% Similarity=0.583 Sum_probs=28.0
Q ss_pred chhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521 116 NIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT 151 (176)
Q Consensus 116 ~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t 151 (176)
||--|.++||+.. |.-+.+||+.||||.||.
T Consensus 170 nI~il~dy~~rkf-----~~e~nqi~~~fgIPed~d 200 (203)
T KOG3433|consen 170 NIFILIDYLYRKF-----GLEPNQIRKEFGIPEDFD 200 (203)
T ss_pred hHHHHHHHHHHhc-----CCCHHHHhHhcCCCcccc
Confidence 7888999998887 999999999999999974
No 17
>KOG0783 consensus Uncharacterized conserved protein, contains ankyrin and BTB/POZ domains [Function unknown]
Probab=93.45 E-value=0.22 Score=47.42 Aligned_cols=112 Identities=18% Similarity=0.263 Sum_probs=77.0
Q ss_pred EEEEeCCCCEEEecHHHH-HHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhh
Q 030521 17 ISLKTADGHLFEVEEPVA-MEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEE 95 (176)
Q Consensus 17 i~L~s~DG~~f~V~~~~a-~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~e 95 (176)
..++.+||++|..+...+ ..+.++..|+.-.-.+.......+-.++++.|+-|+.|++.--. .+
T Consensus 713 ~~i~~KDGkvl~aHkc~L~aRlEYF~smf~~~w~E~sS~t~~~~p~~~e~m~ivLdylYs~d~---------------~~ 777 (1267)
T KOG0783|consen 713 TVIKLKDGKVLKAHKCFLSARLEYFSSMFQFVWMESSSITVNLSPLTVEHMSIVLDYLYSDDK---------------VE 777 (1267)
T ss_pred EEEEecCCcCcccceeEeeeHHHHHHHHHHHHHhhhccceeecCcchHHHHHHHHHHHHccch---------------HH
Confidence 445566999999988875 35777877776432221111222222779999999999983221 23
Q ss_pred hccC-CChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHH
Q 030521 96 FGKS-KSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKF 143 (176)
Q Consensus 96 Fl~~-~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~ 143 (176)
|++. .+.+-+|+++..|+-|=|..|-+.|-+.+-+++.=|+..++.++
T Consensus 778 ~~k~~~~~dF~~~il~iaDqlli~~Lk~Ice~~ll~kl~lk~~~~llef 826 (1267)
T KOG0783|consen 778 LFKDLKESDFMFEILSIADQLLILELKSICEQSLLRKLNLKTLPTLLEF 826 (1267)
T ss_pred HHhccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhcccchHHHHHH
Confidence 4443 25677899999999999999999998888888887776555443
No 18
>COG5124 Protein predicted to be involved in meiotic recombination [Cell division and chromosome partitioning / General function prediction only]
Probab=91.93 E-value=0.17 Score=39.51 Aligned_cols=31 Identities=39% Similarity=0.592 Sum_probs=26.6
Q ss_pred chhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521 116 NIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT 151 (176)
Q Consensus 116 ~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t 151 (176)
+|.-|.++.|+.. +.-+++||+.||||.||.
T Consensus 174 nI~ilidy~c~kf-----~~~~~qir~~fgIPedld 204 (209)
T COG5124 174 NIEILIDYLCKKF-----FLKPEQIRKEFGIPEDLD 204 (209)
T ss_pred hHHHHHHHHHHHc-----CCCHHHHHHhcCCCcchh
Confidence 6777888888776 889999999999999874
No 19
>PF11822 DUF3342: Domain of unknown function (DUF3342); InterPro: IPR021777 This family of proteins are functionally uncharacterised. This family is found in bacteria. This presumed domain is typically between 170 to 303 amino acids in length. The N-terminal half of this family is a BTB-like domain.
Probab=90.31 E-value=0.54 Score=39.97 Aligned_cols=90 Identities=6% Similarity=0.256 Sum_probs=59.7
Q ss_pred CCEEEecHHHH-HHhHHHHHHHhcC-CCCCCCcccccC-CCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhccCC
Q 030521 24 GHLFEVEEPVA-MEFEIVKSFFDEN-DDASEDTVVPLP-NVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFGKSK 100 (176)
Q Consensus 24 G~~f~V~~~~a-~~S~~l~~ll~~~-~~~~~~~~Ipl~-~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl~~~ 100 (176)
.+.|..++..+ ...++++..+... +......+|++. .=+-.|++=+++|++. ..| .+
T Consensus 13 ~rdF~C~~~lL~~~M~YF~~~l~~~~~~~~~~~~idisVhCDv~iF~WLm~yv~~----~~p----------------~l 72 (317)
T PF11822_consen 13 KRDFTCPRDLLVSEMRYFAEYLSRYINDSQRWEEIDISVHCDVHIFEWLMRYVKG----EPP----------------SL 72 (317)
T ss_pred ceeeeccHHHHHHhhHHHHHHHhhcccccCcCCCcceEEecChhHHHHHHHHhhc----CCC----------------cC
Confidence 46799999996 4599999998541 111112245443 1344556666666654 211 23
Q ss_pred ChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc
Q 030521 101 SNDELKEMILVANYLNIKDMLDYLTETVANRIK 133 (176)
Q Consensus 101 ~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~ 133 (176)
+...+..++.-|+||+|++|++.|-.|+...+.
T Consensus 73 ~~~NvvsIliSS~FL~M~~Lve~cl~y~~~~~~ 105 (317)
T PF11822_consen 73 TPSNVVSILISSEFLQMESLVEECLQYCHDHMS 105 (317)
T ss_pred CcCcEEEeEehhhhhccHHHHHHHHHHHHHhHH
Confidence 677788999999999999999999988855443
No 20
>PF02214 BTB_2: BTB/POZ domain; InterPro: IPR003131 Potassium channels are the most diverse group of the ion channel family [, ]. They are important in shaping the action potential, and in neuronal excitability and plasticity []. The potassium channel family is composed of several functionally distinct isoforms, which can be broadly separated into 2 groups []: the practically non-inactivating 'delayed' group and the rapidly inactivating 'transient' group. These are all highly similar proteins, with only small amino acid changes causing the diversity of the voltage-dependent gating mechanism, channel conductance and toxin binding properties. Each type of K+ channel is activated by different signals and conditions depending on their type of regulation: some open in response to depolarisation of the plasma membrane; others in response to hyperpolarisation or an increase in intracellular calcium concentration; some can be regulated by binding of a transmitter, together with intracellular kinases; while others are regulated by GTP-binding proteins or other second messengers []. In eukaryotic cells, K+ channels are involved in neural signalling and generation of the cardiac rhythm, act as effectors in signal transduction pathways involving G protein-coupled receptors (GPCRs) and may have a role in target cell lysis by cytotoxic T-lymphocytes []. In prokaryotic cells, they play a role in the maintenance of ionic homeostasis []. All K+ channels discovered so far possess a core of alpha subunits, each comprising either one or two copies of a highly conserved pore loop domain (P-domain). The P-domain contains the sequence (T/SxxTxGxG), which has been termed the K+ selectivity sequence. In families that contain one P-domain, four subunits assemble to form a selective pathway for K+ across the membrane. However, it remains unclear how the 2 P-domain subunits assemble to form a selective pore. The functional diversity of these families can arise through homo- or hetero-associations of alpha subunits or association with auxiliary cytoplasmic beta subunits. K+ channel subunits containing one pore domain can be assigned into one of two superfamilies: those that possess six transmembrane (TM) domains and those that possess only two TM domains. The six TM domain superfamily can be further subdivided into conserved gene families: the voltage-gated (Kv) channels; the KCNQ channels (originally known as KvLQT channels); the EAG-like K+ channels; and three types of calcium (Ca)-activated K+ channels (BK, IK and SK) []. The 2TM domain family comprises inward-rectifying K+ channels. In addition, there are K+ channel alpha-subunits that possess two P-domains. These are usually highly regulated K+ selective leak channels. The Kv family can be divided into several subfamilies on the basis of sequence similarity and function. Four of these subfamilies, Kv1 (Shaker), Kv2 (Shab), Kv3 (Shaw) and Kv4 (Shal), consist of pore-forming alpha subunits that associate with different types of beta subunit. Each alpha subunit comprises six hydrophobic TM domains with a P-domain between the fifth and sixth, which partially resides in the membrane. The fourth TM domain has positively charged residues at every third residue and acts as a voltage sensor, which triggers the conformational change that opens the channel pore in response to a displacement in membrane potential []. More recently, 4 new electrically-silent alpha subunits have been cloned: Kv5 (KCNF), Kv6 (KCNG), Kv8 and Kv9 (KCNS). These subunits do not themselves possess any functional activity, but appear to form heteromeric channels with Kv2 subunits, and thus modulate Shab channel activity []. When highly expressed, they inhibit channel activity, but at lower levels show more specific modulatory actions. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated potassium channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels []. This domain is found in a subset of a larger group of proteins that contain the BTB/POZ domain.; GO: 0005249 voltage-gated potassium channel activity, 0006813 potassium ion transport, 0008076 voltage-gated potassium channel complex, 0016020 membrane; PDB: 1NN7_A 3KVT_A 1EXB_E 1QDV_A 1DSX_E 1QDW_F 3LUT_B 3LNM_B 2A79_B 3DRY_C ....
Probab=89.27 E-value=0.29 Score=33.57 Aligned_cols=83 Identities=14% Similarity=0.310 Sum_probs=53.9
Q ss_pred CCCEEEecHHHHHH--hHHHHHHHhcC---CCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhc
Q 030521 23 DGHLFEVEEPVAME--FEIVKSFFDEN---DDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFG 97 (176)
Q Consensus 23 DG~~f~V~~~~a~~--S~~l~~ll~~~---~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl 97 (176)
-|+.|.+++..+.. ...+..++... ........+=+ +=++..++.|+.|++.. . .. +
T Consensus 6 GG~~f~~~~~tL~~~~~s~l~~~~~~~~~~~~~~~~~~~fi-DRdp~~F~~IL~ylr~~-~-~l--------~------- 67 (94)
T PF02214_consen 6 GGTIFETSRSTLTRYPDSLLARLFSGERSDDYDDDDGEYFI-DRDPELFEYILNYLRTG-G-KL--------P------- 67 (94)
T ss_dssp TTEEEEEEHHHHHTSTTSTTTSHHHTGHGGGEETTTTEEEE-SS-HHHHHHHHHHHHHT-S-SB----------------
T ss_pred CCEEEEEcHHHHhhCCCChhhhHHhhccccccCCccceEEe-ccChhhhhHHHHHHhhc-C-cc--------C-------
Confidence 48999999999863 45667777643 11112235544 47999999999999985 1 11 0
Q ss_pred cCCChHHHHHHHHhhhccchhhH-HHHH
Q 030521 98 KSKSNDELKEMILVANYLNIKDM-LDYL 124 (176)
Q Consensus 98 ~~~~~~~l~~Ll~AA~~L~I~~L-~~l~ 124 (176)
.-+...+..|...|.|.+|+.| ++.|
T Consensus 68 -~~~~~~~~~l~~Ea~fy~l~~l~i~~c 94 (94)
T PF02214_consen 68 -IPDEICLEELLEEAEFYGLDELFIEDC 94 (94)
T ss_dssp ---TTS-HHHHHHHHHHHT-HHHHBHHC
T ss_pred -CCCchhHHHHHHHHHHcCCCccccCCC
Confidence 1024457789999999999999 6654
No 21
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=84.74 E-value=1.4 Score=34.63 Aligned_cols=44 Identities=36% Similarity=0.474 Sum_probs=33.5
Q ss_pred HHHHHHHHhhhcc--chhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521 103 DELKEMILVANYL--NIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT 151 (176)
Q Consensus 103 ~~l~~Ll~AA~~L--~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t 151 (176)
..+..+..|||.. ||..|..+|+... |.+.++|++.||||+||-
T Consensus 142 ~~~~~~~~~anrwTDNI~~l~~~~~~k~-----~~~~~~i~k~f~Ip~d~d 187 (188)
T PF03962_consen 142 EEIKIAKEAANRWTDNIFSLKSYLKKKF-----GMDEEDIRKEFGIPEDFD 187 (188)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHhc-----CCCHHHHHHHcCCccccC
Confidence 3445556677765 6777777777653 999999999999999973
No 22
>KOG2075 consensus Topoisomerase TOP1-interacting protein BTBD1 [Function unknown]
Probab=73.94 E-value=18 Score=32.65 Aligned_cols=108 Identities=11% Similarity=0.169 Sum_probs=64.8
Q ss_pred CEEEecHHHHHH-hHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhhhccCCC--------CCcc---ccchh
Q 030521 25 HLFEVEEPVAME-FEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKR--------SPKQ---EVKTF 92 (176)
Q Consensus 25 ~~f~V~~~~a~~-S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~--------~~~~---~~~~w 92 (176)
+.+..++-+... |.++..|+.+.-.+.....|.+|.|...++...+.|++--...-.++ ..+- .+..=
T Consensus 129 q~~paHk~vla~gS~VFdaMf~g~~a~~~s~ei~lpdvepaaFl~~L~flYsdev~~~~dtvi~tl~~AkKY~VpaLer~ 208 (521)
T KOG2075|consen 129 QRIPAHKLVLADGSDVFDAMFYGGLAEDASLEIRLPDVEPAAFLAFLRFLYSDEVKLAADTVITTLYAAKKYLVPALERQ 208 (521)
T ss_pred cccchhhhhhhcchHHHHHHhccCcccccCceeecCCcChhHhHHHHHHHhcchhhhhHHHHHHHHHHHHHhhhHHHHHH
Confidence 455566666544 88889998765443334689999999999999998887521110000 0000 11111
Q ss_pred hhhhccC--CChHHHHHHHHhhhccchhhHHHHHHHHHHHHH
Q 030521 93 NEEFGKS--KSNDELKEMILVANYLNIKDMLDYLTETVANRI 132 (176)
Q Consensus 93 d~eFl~~--~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i 132 (176)
--+|++. .....++.|...|..++=+.|+..|...|....
T Consensus 209 CVkflr~~l~~~naf~~L~q~A~lf~ep~Li~~c~e~id~~~ 250 (521)
T KOG2075|consen 209 CVKFLRKNLMADNAFLELFQRAKLFDEPSLISICLEVIDKSF 250 (521)
T ss_pred HHHHHHHhcCChHHHHHHHHHHHhhcCHHHHHHHHHHhhhHH
Confidence 1224442 255666666666888888888888877775433
No 23
>PF03656 Pam16: Pam16; InterPro: IPR005341 The Pam16 protein is the fifth essential subunit of the pre-sequence translocase-associated protein import motor (PAM) []. In Saccharomyces cerevisiae (Baker's yeast), Pam16 is required for preprotein translocation into the matrix, but not for protein insertion into the inner membrane [].; PDB: 2GUZ_J.
Probab=73.46 E-value=1.5 Score=32.43 Aligned_cols=39 Identities=23% Similarity=0.531 Sum_probs=22.9
Q ss_pred HHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccCCCCCC
Q 030521 131 RIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFEGVDPD 172 (176)
Q Consensus 131 ~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~~~~~~ 172 (176)
..+|+|.+|-+++|||+...++|+ |.+.|.-+|+.-+|.
T Consensus 51 ~~~~Mtl~EA~~ILnv~~~~~~ee---I~k~y~~Lf~~Nd~~ 89 (127)
T PF03656_consen 51 NSKGMTLDEARQILNVKEELSREE---IQKRYKHLFKANDPS 89 (127)
T ss_dssp ------HHHHHHHHT--G--SHHH---HHHHHHHHHHHT-CC
T ss_pred hcCCCCHHHHHHHcCCCCccCHHH---HHHHHHHHHhccCCC
Confidence 346899999999999998777654 777888888876664
No 24
>KOG0511 consensus Ankyrin repeat protein [General function prediction only]
Probab=67.82 E-value=16 Score=32.27 Aligned_cols=108 Identities=10% Similarity=0.036 Sum_probs=75.0
Q ss_pred CCCCEEEecHHHHHHhHHHHHHHhcCCCC-CCCccc---ccCCCChHHHHHHHHHHHHhhhccCCC--------------
Q 030521 22 ADGHLFEVEEPVAMEFEIVKSFFDENDDA-SEDTVV---PLPNVSAAPLSYIIEFCRAHVEFRKKR-------------- 83 (176)
Q Consensus 22 ~DG~~f~V~~~~a~~S~~l~~ll~~~~~~-~~~~~I---pl~~v~s~~L~~Iie~c~~h~~~~~~~-------------- 83 (176)
.......++...+....++..|+.+.-.+ ..+..+ .||+..+.+.+.+++|++.|+.+-++.
T Consensus 298 ~~~~RyP~hla~i~R~eyfk~mf~g~f~e~s~n~~~p~lslp~~~~~vveI~lr~lY~d~tdi~~~~A~dvll~ad~lal 377 (516)
T KOG0511|consen 298 PEEDRYPAHLARILRVEYFKSMFVGDFIESSVNDTRPGLSLPSLADVVVEIDLRNLYCDQTDIIFDVASDVLLFADKLAL 377 (516)
T ss_pred cccccccHHHHHHHHHHHHHHHhccchhhhcCCccccccccchHHHHHHHHHHHHhhcccccchHHHHhhHHHHhhHhhh
Confidence 33344777777777888888888754222 112233 478889999999999999999876541
Q ss_pred --CC------ccccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc
Q 030521 84 --SP------KQEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIK 133 (176)
Q Consensus 84 --~~------~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~ 133 (176)
.. ...+..|. +|+ +.-.+++++.-+--+....|=..+..++|+.+.
T Consensus 378 ~~dr~Lkt~as~~itq~~-e~i---d~y~V~dIl~~~wd~~~~rlEqfa~~~~a~hl~ 431 (516)
T KOG0511|consen 378 ADDRLLKTAASAEITQWL-ELI---DMYGVLDILEYCWDLVACRLEQFAETHEARHLL 431 (516)
T ss_pred hhhhhhhhhhhHHHHHHH-HHH---HhhhHHHHHHHHHHhhhHHHHHHHHHHHHHHHH
Confidence 00 11345552 332 334578889999999999999999999988765
No 25
>PF05409 Peptidase_C30: Coronavirus endopeptidase C30; InterPro: IPR008740 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases correspond to MEROPS peptidase family C30 (clan PA(C)). These peptidases are related to serine endopeptidases of family S1 and are restricted to RNA viruses, where they are involved in viral polyprotein processing during replication [, , ]. This entry represents the CoV M-pro which comprises three domains. Domains I and II are six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain, a globular cluster of five helices, has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad. In contrast to serine proteinases and other cysteine proteinases, which have a catalytic triad, there is no third catalytic residue present [, , , ]. ; GO: 0019082 viral protein processing; PDB: 2VJ1_B 2K7X_A 2Q6D_B 2Q6F_A 1P9U_E 1LVO_B 2AMP_B 1P9S_A 2ZU2_A 3D23_A ....
Probab=63.24 E-value=23 Score=29.77 Aligned_cols=98 Identities=13% Similarity=0.172 Sum_probs=57.0
Q ss_pred cccccCCCChHHHHHHHHHHHHhhhccCC---C---CCccccchhhhhh-ccCCChHHHHHHHHhhhccchhhHHHHHHH
Q 030521 54 TVVPLPNVSAAPLSYIIEFCRAHVEFRKK---R---SPKQEVKTFNEEF-GKSKSNDELKEMILVANYLNIKDMLDYLTE 126 (176)
Q Consensus 54 ~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~---~---~~~~~~~~wd~eF-l~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~ 126 (176)
+...++....-.=.-|+.|++-|--...+ . -..+...+|-..- +..+..+..++.+.|..-..+..|+...-+
T Consensus 161 ~~~Q~~~~~~~~T~NVvA~LYAalln~~~w~l~~~~~svedFN~WA~~N~ft~~~~~~~~~~Laa~TgVsV~rlLaaI~~ 240 (293)
T PF05409_consen 161 EVAQLEGADQYITDNVVAWLYAALLNGCPWWLSSQRISVEDFNEWAMNNGFTSFSSDDAFDMLAAKTGVSVERLLAAIQR 240 (293)
T ss_dssp SSSTTTTS-SB-HHHHHHHHHHHHHTT--TTSSSS---HHHHHHHHHHTTB--SSHHHHHHHHHHHHT--HHHHHHHHHH
T ss_pred hhhhhcCccceeehhHHHHHHHHHHcCCCeeeccceecHHHHHHHHHhCCCcccchhhHHHHHHHHhCCcHHHHHHHHHH
Confidence 34455555666677899999987655432 1 1223556664431 223377788999999999999999987665
Q ss_pred HHHHHHcCCCHHHHHHHcCCCCCCCHHHH
Q 030521 127 TVANRIKNKSVEYVRKFFGIENDFTPEEE 155 (176)
Q Consensus 127 ~iA~~i~gks~eeir~~f~i~~d~t~EEe 155 (176)
+..-..|++ |.-...+++||||++-
T Consensus 241 -L~~g~~G~~---IlG~~~ledE~Tp~~V 265 (293)
T PF05409_consen 241 -LHKGFGGRT---ILGSCSLEDEFTPEMV 265 (293)
T ss_dssp -HHHTTTTT----BTTBSS--TSS-HHHH
T ss_pred -HhcCCCCCc---ccccccccccCCHHHH
Confidence 666556665 4444557788999764
No 26
>KOG1665 consensus AFH1-interacting protein FIP2, contains BTB/POZ domain and pentapeptide repeats [General function prediction only]
Probab=60.02 E-value=25 Score=28.89 Aligned_cols=93 Identities=16% Similarity=0.226 Sum_probs=61.2
Q ss_pred CCCcEEEEeCCCCEEE--ecHHHHHH-hHHHHHHHhcCCCC---CCCcccccCCCChHHHHHHHHHHHHhhhccCCCCCc
Q 030521 13 ATKKISLKTADGHLFE--VEEPVAME-FEIVKSFFDENDDA---SEDTVVPLPNVSAAPLSYIIEFCRAHVEFRKKRSPK 86 (176)
Q Consensus 13 ~~~~i~L~s~DG~~f~--V~~~~a~~-S~~l~~ll~~~~~~---~~~~~Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~ 86 (176)
.+++++|- -.|+.|. ++.-+.+. -..+..|+.+.+.. ++...+-+ .=+..-++-|+.|+.+-.-
T Consensus 7 ~~~~vrln-igGk~f~TTidTlv~rEPDSMLa~MF~~~g~~~~~d~kGa~lI-DRsp~yFepIlNyLr~Gq~-------- 76 (302)
T KOG1665|consen 7 LSSMVRLN-IGGKKFCTTIDTLVIREPDSMLAAMFSGRGAMCQEDKKGAVLI-DRSPKYFEPILNYLRDGQI-------- 76 (302)
T ss_pred hhhhheee-cCCeEEEEeehhhcccCchHHHHHHHccCCCccccccCceEEE-ccCchhhHHHHHHHhcCce--------
Confidence 45556654 3577776 44444444 56677788766532 12234433 3678889999999986331
Q ss_pred cccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHH
Q 030521 87 QEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTE 126 (176)
Q Consensus 87 ~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~ 126 (176)
... +...+++++.+|.|++|-+|.+..-.
T Consensus 77 ----------~~~-s~i~~lgvLeeArff~i~sL~~hle~ 105 (302)
T KOG1665|consen 77 ----------PSL-SDIDCLGVLEEARFFQILSLKDHLED 105 (302)
T ss_pred ----------eec-CCccHHHHHHHhhHHhhHhHHhHHhh
Confidence 122 66778999999999999999987643
No 27
>PF07928 Vps54: Vps54-like protein; InterPro: IPR012501 This family contains various proteins that are homologues of the yeast Vps54 protein, such as the rat homologue (Q9JMK8 from SWISSPROT), the human homologue (Q86YF7 from SWISSPROT), and the mouse homologue (Q8R3X1 from SWISSPROT). In yeast, Vps54 associates with Vps52 and Vps53 proteins to form a trimolecular complex that is involved in protein transport between Golgi, endosomal, and vacuolar compartments []. All Vps54 homologues contain a coiled coil region (not found in the region featured in this family) and multiple dileucine motifs []. ; GO: 0042147 retrograde transport, endosome to Golgi; PDB: 3N1E_B 3N1B_B.
Probab=57.52 E-value=3.4 Score=30.78 Aligned_cols=47 Identities=15% Similarity=0.314 Sum_probs=0.0
Q ss_pred CCCEEEecHHHHHHhHHHHHHHhcCCCCCCCcccccCCCChHHHHHHHHHHHHhh
Q 030521 23 DGHLFEVEEPVAMEFEIVKSFFDENDDASEDTVVPLPNVSAAPLSYIIEFCRAHV 77 (176)
Q Consensus 23 DG~~f~V~~~~a~~S~~l~~ll~~~~~~~~~~~Ipl~~v~s~~L~~Iie~c~~h~ 77 (176)
||+.|.|+..++...+.|.+.+.- ...+|.+..+++.++++|++...
T Consensus 1 d~e~f~vv~s~l~ll~~l~~Y~~~--------~~~~P~~a~di~~~l~elLk~fN 47 (135)
T PF07928_consen 1 DNEKFVVVGSALLLLKMLSDYLQL--------ASNFPSLAPDILSRLLELLKLFN 47 (135)
T ss_dssp -------------------------------------------------------
T ss_pred CCCceecHHHHHHHHHHHHHHHHH--------HHHCchhHHHHHHHHHHHHHHHH
Confidence 677777777777777766665443 12355677788888888776543
No 28
>PRK05365 malonic semialdehyde reductase; Provisional
Probab=53.12 E-value=9.4 Score=29.58 Aligned_cols=35 Identities=17% Similarity=0.163 Sum_probs=27.3
Q ss_pred HHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521 107 EMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT 151 (176)
Q Consensus 107 ~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t 151 (176)
.|+.||..||+.. ..+.|-..+.+++.|||++++.
T Consensus 130 ~l~LaA~~~Glgs----------~~~~g~~~~~v~~~l~ip~~~~ 164 (195)
T PRK05365 130 YLILAARALGLDA----------GPMSGFDAAAVDAEFFAGTTWK 164 (195)
T ss_pred HHHHHHHHcCCcc----------CCccccCHHHHHHHhCCCCCee
Confidence 3888888888876 2445778899999999987653
No 29
>KOG3442 consensus Uncharacterized conserved protein [Function unknown]
Probab=52.17 E-value=12 Score=27.55 Aligned_cols=36 Identities=19% Similarity=0.352 Sum_probs=28.1
Q ss_pred cCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccCCCCC
Q 030521 133 KNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFEGVDP 171 (176)
Q Consensus 133 ~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~~~~~ 171 (176)
.|.|.+|-+++|||.++++.| +|.+.|.-+|+.-|+
T Consensus 54 ~~iTlqEa~qILnV~~~ln~e---ei~k~yehLFevNdk 89 (132)
T KOG3442|consen 54 GKITLQEAQQILNVKEPLNRE---EIEKRYEHLFEVNDK 89 (132)
T ss_pred ccccHHHHhhHhCCCCCCCHH---HHHHHHHHHHhccCc
Confidence 457899999999999877765 466788888886443
No 30
>PRK13696 hypothetical protein; Provisional
Probab=51.56 E-value=42 Score=21.70 Aligned_cols=26 Identities=38% Similarity=0.602 Sum_probs=20.3
Q ss_pred cCCCHHHHHHHcCCCCCCCHHHHHHHHHh
Q 030521 133 KNKSVEYVRKFFGIENDFTPEEEEAARKE 161 (176)
Q Consensus 133 ~gks~eeir~~f~i~~d~t~EEe~~i~~e 161 (176)
+..+...+.++||+ +++||.++++++
T Consensus 36 ~~~~~~~l~~~~Gi---l~dee~~e~~~~ 61 (62)
T PRK13696 36 KKGNLDKLMKAFGI---LSEEEAEELKKE 61 (62)
T ss_pred hhccHHHHHHHHCC---CCHHHHHHHHhh
Confidence 34568899999999 568888888765
No 31
>cd02148 Nitroreductase_5 Nitroreductase-like family 5. A subfamily of the nitroreductase family containing uncharacterized proteins that are similar to nitroreductase. Nitroreductase catalyzes the reduction of nitroaromatic compounds such as nitrotoluenes, nitrofurans and nitroimidazoles. This process requires NAD(P)H as electron donor in an obligatory two-electron transfer and uses FMN as cofactor. The enzyme is typically a homodimer. Members of this family are also called NADH dehydrogenase, oxygen-insensitive NAD(P)H nitrogenase or dihydropteridine reductase.
Probab=50.48 E-value=10 Score=29.02 Aligned_cols=33 Identities=18% Similarity=0.219 Sum_probs=26.1
Q ss_pred HHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCC
Q 030521 108 MILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDF 150 (176)
Q Consensus 108 Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~ 150 (176)
|+.||.-||+.. .+|.|-..+++++.|||++++
T Consensus 124 l~LaA~~lGlgs----------~~~~g~d~~~~~~~L~lp~~~ 156 (185)
T cd02148 124 FILAARALGLDA----------GPMSGFDAAAVDAEFFADTRW 156 (185)
T ss_pred HHHHHHHcCCCc----------CCCcccCHHHHHHHhCCCCCe
Confidence 778888887765 345677889999999998776
No 32
>KOG4591 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=49.17 E-value=33 Score=27.76 Aligned_cols=57 Identities=12% Similarity=0.098 Sum_probs=33.6
Q ss_pred cccCCCChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHH
Q 030521 56 VPLPNVSAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVAN 130 (176)
Q Consensus 56 Ipl~~v~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~ 130 (176)
.-+..++-.++-..++|.+...-+ |+. +...|.+|+.-||.++++.|.+.|-+-+..
T Consensus 108 ~~~dDad~Ea~~t~iRWIYTDEid-----------------fk~-dD~~L~el~e~An~FqLe~Lke~C~k~l~a 164 (280)
T KOG4591|consen 108 LDLDDADFEAFHTAIRWIYTDEID-----------------FKE-DDEFLLELCELANRFQLELLKERCEKGLGA 164 (280)
T ss_pred hcccccCHHHHHHhheeeeccccc-----------------ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 334566667777777776532211 233 556666777777777777776666655533
No 33
>PF11978 MVP_shoulder: Shoulder domain; InterPro: IPR021870 This domain is found in the Major Vault Protein and has been called the shoulder domain []. This family includes two bacterial proteins A6FXE2 from SWISSPROT and A1ZGE7 from SWISSPROT. This suggests that some bacteria may possess vault particles. ; PDB: 2ZUO_G 2QZV_B 2ZV5_c 2ZV4_Y.
Probab=46.02 E-value=18 Score=26.47 Aligned_cols=27 Identities=19% Similarity=0.314 Sum_probs=24.6
Q ss_pred HHhhhccchhhHHHHHHHHHHHHHcCC
Q 030521 109 ILVANYLNIKDMLDYLTETVANRIKNK 135 (176)
Q Consensus 109 l~AA~~L~I~~L~~l~~~~iA~~i~gk 135 (176)
-.|+..+.++..+-.+|+.||++|+|.
T Consensus 37 ~~~~k~F~VpDFVGd~Ck~iaSRIR~a 63 (118)
T PF11978_consen 37 EDAAKLFSVPDFVGDACKAIASRIRGA 63 (118)
T ss_dssp HHHHHTTSSTTHHHHHHHHHHHHHHHH
T ss_pred hHHHHhcCCcchHHHHHHHHHHHHHHH
Confidence 378999999999999999999999974
No 34
>PF12556 CobS_N: Cobaltochelatase CobS subunit N terminal
Probab=44.59 E-value=13 Score=21.32 Aligned_cols=12 Identities=50% Similarity=1.027 Sum_probs=10.4
Q ss_pred HHHHcCCCCCCC
Q 030521 140 VRKFFGIENDFT 151 (176)
Q Consensus 140 ir~~f~i~~d~t 151 (176)
+|+.|||..|++
T Consensus 14 vre~FGiDsDm~ 25 (36)
T PF12556_consen 14 VREVFGIDSDMK 25 (36)
T ss_pred HHHhcCCCcCCe
Confidence 689999999975
No 35
>PF10892 DUF2688: Protein of unknown function (DUF2688); InterPro: IPR024392 Members of this protein family are annotated as KleB, and may play a role in the regulation of transcription in plasmids.
Probab=44.47 E-value=27 Score=22.11 Aligned_cols=25 Identities=32% Similarity=0.478 Sum_probs=18.7
Q ss_pred HHHHHHHcC-CCCC-CCHHHHHHHHHh
Q 030521 137 VEYVRKFFG-IEND-FTPEEEEAARKE 161 (176)
Q Consensus 137 ~eeir~~f~-i~~d-~t~EEe~~i~~e 161 (176)
.+++|+.|| |=.+ +||||..+|.+-
T Consensus 30 ad~lr~klG~IC~~CitpEE~~~I~e~ 56 (60)
T PF10892_consen 30 ADDLRVKLGGICGDCITPEEDREILEA 56 (60)
T ss_pred hHHHHHHHcchhhccCCHHHHHHHHHH
Confidence 577887776 5555 899999998653
No 36
>KOG3840 consensus Uncharaterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=39.76 E-value=46 Score=28.60 Aligned_cols=67 Identities=16% Similarity=0.326 Sum_probs=41.4
Q ss_pred ccCcCCCCcEEEEeCCCCEEEecHHHHHHhH--HHHHHHhcCCC---CCCCccccc-CCCChHHHHHHHHHHHH
Q 030521 8 SCSAQATKKISLKTADGHLFEVEEPVAMEFE--IVKSFFDENDD---ASEDTVVPL-PNVSAAPLSYIIEFCRA 75 (176)
Q Consensus 8 ~~~~~~~~~i~L~s~DG~~f~V~~~~a~~S~--~l~~ll~~~~~---~~~~~~Ipl-~~v~s~~L~~Iie~c~~ 75 (176)
+|.+.+...+++ --||..|.|+..++.-++ .+-+|+..+.. .++.....+ .++++.+++.|++|...
T Consensus 89 G~~pg~~~~~t~-lvd~~rf~v~q~llt~~p~Tmlg~mf~~g~~f~~pNErgEyeVAdGi~s~vFRAILdYYks 161 (438)
T KOG3840|consen 89 GCSPGEGDKVCL-LVDQTRFLVSQRLLTSKPDTMLGRMFSMGADLVSPNERDEFEVADGMTSSCFRAILDYYQS 161 (438)
T ss_pred CCCCCCCcceEE-EeeeEEEEeeeeeecCCcchhhhhhhcccccccCCCcCCceehhcchhHHHHHHHHHHHhc
Confidence 444444554444 468999999998875533 34445543211 112334544 47999999999998664
No 37
>PF10075 PCI_Csn8: COP9 signalosome, subunit CSN8; InterPro: IPR019280 The photomorphogenic 9 (COP9) signalosome or CSN complex is composed of eight subunits: Cops1/GPS1, Cops2, Cops3, Cops4, Cops5, Cop6, Cops7 (Cops7A or Cops7B) and Cops8. In the complex, Cops8, which is the smallest subunit, probably interacts directly with Cops3, Cops4 and Cops7 (Cops7A or Cops7B). This signalosome is homologous to the lid subcomplex of the 26S proteasome and regulates the ubiquitin-proteasome pathway. It functions as a structural scaffold for subunit-subunit interactions within the complex and is a key regulator of photomorphogenic development [].; PDB: 1RZ4_A.
Probab=39.30 E-value=71 Score=23.38 Aligned_cols=48 Identities=19% Similarity=0.369 Sum_probs=30.9
Q ss_pred hhhHHHHHH----HHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHhcccccCC
Q 030521 117 IKDMLDYLT----ETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKEYEWAFEG 168 (176)
Q Consensus 117 I~~L~~l~~----~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e~~w~~~~ 168 (176)
|.+|.+..- ..|+.-.+.-+...+.+++|++ ++|..++-+...|-.++
T Consensus 76 v~~~~~~iR~~i~~~i~~aY~sIs~~~la~~Lg~~----~~el~~~~~~~gW~~d~ 127 (143)
T PF10075_consen 76 VPGFEDTIRERIAHLISKAYSSISLSDLAEMLGLS----EEELEKFIKSRGWTVDG 127 (143)
T ss_dssp STTHHHHHHHHHHHHHHHH-SEE-HHHHHHHTTS-----HHHHHHHHHHHT-EE--
T ss_pred HHHHHHHHHHHHHHHHHHHHhHcCHHHHHHHhCCC----HHHHHHHHHHcCCEECC
Confidence 444555544 4445555667899999999985 77888888888999874
No 38
>PRK14981 DNA-directed RNA polymerase subunit F; Provisional
Probab=38.90 E-value=1.2e+02 Score=21.72 Aligned_cols=45 Identities=18% Similarity=0.284 Sum_probs=31.9
Q ss_pred chhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCC-CCCHHHHHHHHHh
Q 030521 116 NIKDMLDYLTETVANRIKNKSVEYVRKFFGIEN-DFTPEEEEAARKE 161 (176)
Q Consensus 116 ~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~-d~t~EEe~~i~~e 161 (176)
++..|-+.....|++. .=.|++|+|.+|.-.. -+++|+.++|-..
T Consensus 61 ~~~~l~e~~a~~I~nL-~P~~~dElrai~~~~~~~~~~e~l~~ILd~ 106 (112)
T PRK14981 61 ELEKMKEKTAVKIADI-LPETRDELRAIFAKERYTLSPEELDEILDI 106 (112)
T ss_pred HccCCCHHHHHHHHhc-CCCCHHHHHHHHHHhccCCCHHHHHHHHHH
Confidence 4444566666666664 5688999999987653 3889998888653
No 39
>PF15063 TC1: Thyroid cancer protein 1
Probab=38.71 E-value=33 Score=23.10 Aligned_cols=31 Identities=23% Similarity=0.353 Sum_probs=17.3
Q ss_pred HHHHHHcCCCHHHHHHHcCCCCCCCHHHHHH
Q 030521 127 TVANRIKNKSVEYVRKFFGIENDFTPEEEEA 157 (176)
Q Consensus 127 ~iA~~i~gks~eeir~~f~i~~d~t~EEe~~ 157 (176)
.+|+++++-+..+|+.+|.-..|--.||-.+
T Consensus 29 asaNIFe~vn~~qlqrLF~~sGD~kAEeRA~ 59 (79)
T PF15063_consen 29 ASANIFENVNLDQLQRLFQKSGDKKAEERAR 59 (79)
T ss_pred hhhhhhhccCHHHHHHHHHHccchhHHHHHH
Confidence 4455555666666666666555555554443
No 40
>smart00385 CYCLIN domain present in cyclins, TFIIB and Retinoblastoma. A helical domain present in cyclins and TFIIB (twice) and Retinoblastoma (once). A protein recognition domain functioning in cell-cycle and transcription control.
Probab=38.23 E-value=97 Score=19.11 Aligned_cols=41 Identities=17% Similarity=0.159 Sum_probs=26.6
Q ss_pred hhhHHHHHHHHHHHHHcCC--CHHHHHHHcCCCCCCCHHHHHHHHH
Q 030521 117 IKDMLDYLTETVANRIKNK--SVEYVRKFFGIENDFTPEEEEAARK 160 (176)
Q Consensus 117 I~~L~~l~~~~iA~~i~gk--s~eeir~~f~i~~d~t~EEe~~i~~ 160 (176)
-+.++-.+|-++|.++.+. +..++..+.|. +|.++..+..+
T Consensus 37 ~~~~ia~a~l~lA~k~~~~~~~~~~~~~~~~~---~~~~~i~~~~~ 79 (83)
T smart00385 37 SPSLIAAAALYLAAKTEEIPPWTKELVHYTGY---FTEEEILRMEK 79 (83)
T ss_pred CHHHHHHHHHHHHHHHhcCCCCchhHhHhhCC---CCHHHHHHHHH
Confidence 3455666777888888774 45677666665 67766655543
No 41
>PF11460 DUF3007: Protein of unknown function (DUF3007); InterPro: IPR021562 This is a family of uncharacterised proteins found in bacteria and eukaryotes.
Probab=37.18 E-value=34 Score=24.41 Aligned_cols=24 Identities=33% Similarity=0.441 Sum_probs=17.0
Q ss_pred CHHHHHHHcCCCCCCCHHHHHHHHHhc
Q 030521 136 SVEYVRKFFGIENDFTPEEEEAARKEY 162 (176)
Q Consensus 136 s~eeir~~f~i~~d~t~EEe~~i~~e~ 162 (176)
+.++|.+.|. .+||||.+++.+|-
T Consensus 79 ~~~~lqkRle---~l~~eE~~~L~~ei 102 (104)
T PF11460_consen 79 TNEELQKRLE---ELSPEELEALQAEI 102 (104)
T ss_pred hHHHHHHHHH---hCCHHHHHHHHHHh
Confidence 3456777765 58899988887763
No 42
>KOG4350 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=36.62 E-value=34 Score=30.62 Aligned_cols=33 Identities=15% Similarity=0.221 Sum_probs=15.7
Q ss_pred ChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc
Q 030521 101 SNDELKEMILVANYLNIKDMLDYLTETVANRIK 133 (176)
Q Consensus 101 ~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~ 133 (176)
+.+.+++.+.-|+..+.+.|...++.++-..++
T Consensus 114 ~ed~lld~LslAh~Ygf~~Le~aiSeYl~~iL~ 146 (620)
T KOG4350|consen 114 EEDILLDYLSLAHRYGFIQLETAISEYLKEILK 146 (620)
T ss_pred hHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHc
Confidence 344444555555555555554444444444433
No 43
>PF13921 Myb_DNA-bind_6: Myb-like DNA-binding domain; PDB: 1A5J_A 1MBH_A 1GV5_A 1H89_C 1IDY_A 1MBK_A 1IDZ_A 1H88_C 1GVD_A 1MBG_A ....
Probab=35.39 E-value=64 Score=19.69 Aligned_cols=34 Identities=18% Similarity=0.290 Sum_probs=22.0
Q ss_pred HHHHHHHHcCCCHHHHHHHcCC------CC-CCCHHHHHHH
Q 030521 125 TETVANRIKNKSVEYVRKFFGI------EN-DFTPEEEEAA 158 (176)
Q Consensus 125 ~~~iA~~i~gks~eeir~~f~i------~~-d~t~EEe~~i 158 (176)
=..||..+..+|+.+++..|.- .. .+|+||.+++
T Consensus 20 W~~Ia~~l~~Rt~~~~~~r~~~~l~~~~~~~~wt~eEd~~L 60 (60)
T PF13921_consen 20 WKKIAEHLGNRTPKQCRNRWRNHLRPKISRGPWTKEEDQRL 60 (60)
T ss_dssp HHHHHHHSTTS-HHHHHHHHHHTTSTTSTSSSSSHHHHHHH
T ss_pred HHHHHHHHCcCCHHHHHHHHHHHCcccccCCCcCHHHHhcC
Confidence 3577888834999999876432 22 4788887654
No 44
>PF11197 DUF2835: Protein of unknown function (DUF2835); InterPro: IPR021363 This is a bacterial family of uncharacterised proteins. One member of this family (A4VM42 from SWISSPROT) is annotated as the A subunit of Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV).
Probab=35.21 E-value=46 Score=21.83 Aligned_cols=25 Identities=16% Similarity=0.244 Sum_probs=21.2
Q ss_pred cCCCCcEEEEeCCCCEEEecHHHHH
Q 030521 11 AQATKKISLKTADGHLFEVEEPVAM 35 (176)
Q Consensus 11 ~~~~~~i~L~s~DG~~f~V~~~~a~ 35 (176)
-+..+.|.+++.||..+.+|....+
T Consensus 17 ~G~a~~V~v~s~~Gr~v~~Pa~~lR 41 (68)
T PF11197_consen 17 QGAASKVVVRSDDGRRVQFPARHLR 41 (68)
T ss_pred cccccEEEEEecCCcEEEEeHHHCc
Confidence 3567889999999999999998643
No 45
>PRK11053 dihydropteridine reductase; Provisional
Probab=35.11 E-value=30 Score=27.16 Aligned_cols=36 Identities=22% Similarity=0.337 Sum_probs=25.2
Q ss_pred HHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCC-CC
Q 030521 105 LKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIEN-DF 150 (176)
Q Consensus 105 l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~-d~ 150 (176)
+..|+.||..||+.+- .|.|-..+.+++.|||+. ++
T Consensus 147 ~~~lmLaA~~~Glgs~----------~i~g~~~~~v~~~l~ip~~~~ 183 (217)
T PRK11053 147 LGNLLLGAAALGIDAT----------PIEGFDAAILDAEFGLREKGL 183 (217)
T ss_pred HHHHHHHHHHcCCCCC----------CcCCcCHHHHHHHhCCCCcCc
Confidence 3466666666666543 445678899999999984 45
No 46
>PF06851 DUF1247: Protein of unknown function (DUF1247); InterPro: IPR009657 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf2; it is a family of uncharacterised viral proteins.
Probab=33.27 E-value=1.5e+02 Score=22.43 Aligned_cols=72 Identities=17% Similarity=0.249 Sum_probs=41.1
Q ss_pred HHHHhhhccCCCCCccccchhhhhhccC-CChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcC-CCHHHHHHHcCCC
Q 030521 72 FCRAHVEFRKKRSPKQEVKTFNEEFGKS-KSNDELKEMILVANYLNIKDMLDYLTETVANRIKN-KSVEYVRKFFGIE 147 (176)
Q Consensus 72 ~c~~h~~~~~~~~~~~~~~~wd~eFl~~-~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~g-ks~eeir~~f~i~ 147 (176)
||.+|+.+--...+.....++ -+|++. +..-.+.+|. =.||.++. ....++.+-..+.| .|..++-..+|-+
T Consensus 61 rC~FHkkyif~~~~~~~~deY-v~fLNSeMGiIS~VElY--Y~yLs~~~-wk~~a~~~l~dLTgF~s~~~LL~~Yny~ 134 (148)
T PF06851_consen 61 RCDFHKKYIFTKDRKTHYDEY-VEFLNSEMGIISFVELY--YTYLSVDF-WKITAKRVLRDLTGFESLKDLLTHYNYE 134 (148)
T ss_pred hhhHhHhhhccCCcccCHHHH-HHHHccccchhhhHHHH--HHHHcccc-cHHHHHHHHHHhhCcccHHHHHHHcCCC
Confidence 788887753322233334444 466664 1223333332 24555544 66677777777777 6788888887765
No 47
>cd00068 GGL G protein gamma subunit-like motifs, the alpha-helical G-gamma chain dimerizes with the G-beta propeller subunit as part of the heterotrimeric G-protein complex; involved in signal transduction via G-protein-coupled receptors
Probab=32.96 E-value=59 Score=20.32 Aligned_cols=20 Identities=10% Similarity=0.363 Sum_probs=15.5
Q ss_pred ChHHHHHHHHHHHHhhhccC
Q 030521 62 SAAPLSYIIEFCRAHVEFRK 81 (176)
Q Consensus 62 ~s~~L~~Iie~c~~h~~~~~ 81 (176)
.|.....++.||+.|....+
T Consensus 22 vS~a~~~l~~y~e~~~~~Dp 41 (57)
T cd00068 22 VSKAAAELLKYCEQNAENDP 41 (57)
T ss_pred HHHHHHHHHHHHHhcCCCCC
Confidence 36778889999999976543
No 48
>TIGR02698 CopY_TcrY copper transport repressor, CopY/TcrY family. This family includes metal-fist type transcriptional repressors of copper transport systems such as copYZAB of Enterococcus hirae and tcrYAZB (transferble copper resistance) of an Enterocuccus faecium plasmid. High levels of copper can displace zinc and prevent binding by the repressor, activating efflux by copper resistance transporters. The most closely related proteins excluded by this model are antibiotic resistance regulators including the methicillin resistance regulatory protein MecI.
Probab=32.63 E-value=1.5e+02 Score=21.60 Aligned_cols=38 Identities=18% Similarity=0.108 Sum_probs=23.6
Q ss_pred HHHHHHHHHcCCCHHHHHHHcCCCCCCCHHHHHHHHHh
Q 030521 124 LTETVANRIKNKSVEYVRKFFGIENDFTPEEEEAARKE 161 (176)
Q Consensus 124 ~~~~iA~~i~gks~eeir~~f~i~~d~t~EEe~~i~~e 161 (176)
..+.+.+.+-|.+...+-..|--.+.+++||.+++++-
T Consensus 76 ~~~~~~~~~f~gs~~~ll~~l~~~~~ls~eele~L~~l 113 (130)
T TIGR02698 76 AAQELFSRICSRKVGAVIADLIEESPLSQTDIEKLEKL 113 (130)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHhcCCCCHHHHHHHHHH
Confidence 34455555555565555444433467999999999873
No 49
>PF10415 FumaraseC_C: Fumarase C C-terminus; InterPro: IPR018951 Fumarase C catalyses the stereo-specific interconversion of fumarate to L-malate as part of the Krebs cycle. The full-length protein forms a tetramer with visible globular shape. FumaraseC_C is the C-terminal 65 residues referred to as domain 3. The core of the molecule consists of a bundle of 20 alpha-helices from the five-helix bundle of domain 2. The projections from the core of the tetramer are generated from domains 1 and 3 of each subunit []. This entry does not appear to be part of either the active site or the activation site but is helical in structure forming a little bundle. ; GO: 0016829 lyase activity, 0006099 tricarboxylic acid cycle; PDB: 3RRP_A 3OCE_D 3OCF_D 3E04_B 3GTD_A 3R6V_F 3R6Q_F 1J3U_B 1FUR_A 1YFE_A ....
Probab=32.26 E-value=1.1e+02 Score=18.93 Aligned_cols=23 Identities=30% Similarity=0.268 Sum_probs=16.0
Q ss_pred cCCCHHHHHHHcCCCCCCCHHHHHHH
Q 030521 133 KNKSVEYVRKFFGIENDFTPEEEEAA 158 (176)
Q Consensus 133 ~gks~eeir~~f~i~~d~t~EEe~~i 158 (176)
.|+|..|+-.-.|+ +|++|.+++
T Consensus 24 ~g~svre~v~~~g~---lt~ee~d~l 46 (55)
T PF10415_consen 24 EGRSVREVVLEEGL---LTEEELDEL 46 (55)
T ss_dssp HT--HHHHHHHTTS---S-HHHHHHH
T ss_pred cCCCHHHHHHHcCC---CCHHHHHHH
Confidence 69999999877775 788887775
No 50
>cd00167 SANT 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Probab=31.86 E-value=51 Score=18.11 Aligned_cols=20 Identities=15% Similarity=0.338 Sum_probs=16.0
Q ss_pred HHHHHHHcCCCHHHHHHHcC
Q 030521 126 ETVANRIKNKSVEYVRKFFG 145 (176)
Q Consensus 126 ~~iA~~i~gks~eeir~~f~ 145 (176)
..||..+.++|..+++..+.
T Consensus 23 ~~Ia~~~~~rs~~~~~~~~~ 42 (45)
T cd00167 23 EKIAKELPGRTPKQCRERWR 42 (45)
T ss_pred HHHHhHcCCCCHHHHHHHHH
Confidence 56777888899999988763
No 51
>PF11116 DUF2624: Protein of unknown function (DUF2624); InterPro: IPR020277 This entry contains proteins with no known function.
Probab=31.47 E-value=1.2e+02 Score=20.92 Aligned_cols=27 Identities=19% Similarity=0.431 Sum_probs=13.6
Q ss_pred HHHcCCCHHHHHHH---cCCCCCCCHHHHHHH
Q 030521 130 NRIKNKSVEYVRKF---FGIENDFTPEEEEAA 158 (176)
Q Consensus 130 ~~i~gks~eeir~~---f~i~~d~t~EEe~~i 158 (176)
.+++..|++|+.++ |||+ +|+++.+.|
T Consensus 10 ~Kln~iT~~eLlkyskqy~i~--it~~QA~~I 39 (85)
T PF11116_consen 10 QKLNNITAKELLKYSKQYNIS--ITKKQAEQI 39 (85)
T ss_pred HHHhcCCHHHHHHHHHHhCCC--CCHHHHHHH
Confidence 34455555555554 5553 555544444
No 52
>PF08671 SinI: Anti-repressor SinI; InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=31.38 E-value=38 Score=18.58 Aligned_cols=13 Identities=31% Similarity=0.437 Sum_probs=9.3
Q ss_pred cCCCHHHHHHHcC
Q 030521 133 KNKSVEYVRKFFG 145 (176)
Q Consensus 133 ~gks~eeir~~f~ 145 (176)
.|-|.+|+|+|+.
T Consensus 15 ~Gls~eeir~FL~ 27 (30)
T PF08671_consen 15 SGLSKEEIREFLE 27 (30)
T ss_dssp TT--HHHHHHHHH
T ss_pred cCCCHHHHHHHHH
Confidence 6899999999874
No 53
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=30.52 E-value=1.1e+02 Score=20.58 Aligned_cols=45 Identities=13% Similarity=0.137 Sum_probs=27.9
Q ss_pred hhhHHHHHHHHHHHHH-cC-CCHHHHHHHcC----CCCCCCHHHHHHHHHh
Q 030521 117 IKDMLDYLTETVANRI-KN-KSVEYVRKFFG----IENDFTPEEEEAARKE 161 (176)
Q Consensus 117 I~~L~~l~~~~iA~~i-~g-ks~eeir~~f~----i~~d~t~EEe~~i~~e 161 (176)
|..|+.+..+|-++-- .| -+.+|+|+++. +.+.+|++|..++.++
T Consensus 9 ~~~~i~~F~~y~~~~~~~g~Is~~EL~~~l~~~~~lg~k~t~~ev~~m~~~ 59 (88)
T cd05029 9 IGLLVAIFHKYSGREGDKNTLSKKELKELIQKELTIGSKLQDAEIAKLMED 59 (88)
T ss_pred HHHHHHHHHHHHccCCCCCEECHHHHHHHHHHHHhcCCCCCHHHHHHHHHH
Confidence 4455666666665321 23 37888888883 4455788887776554
No 54
>PF02519 Auxin_inducible: Auxin responsive protein; InterPro: IPR003676 This family consists of the protein products of a gene cluster that encodes a group of auxin-regulated RNAs (small auxin up RNAs, SAURs) []. Proteins from this ARG7 auxin responsive genes family have no identified functional role [].
Probab=30.32 E-value=1.9e+02 Score=20.20 Aligned_cols=62 Identities=8% Similarity=0.198 Sum_probs=41.0
Q ss_pred CcCCCCcEEEEeCC-CCEEEecHHHHHHhHHHHHHHhcCCC---CCCCcccccCCCChHHHHHHHHHH
Q 030521 10 SAQATKKISLKTAD-GHLFEVEEPVAMEFEIVKSFFDENDD---ASEDTVVPLPNVSAAPLSYIIEFC 73 (176)
Q Consensus 10 ~~~~~~~i~L~s~D-G~~f~V~~~~a~~S~~l~~ll~~~~~---~~~~~~Ipl~~v~s~~L~~Iie~c 73 (176)
.+.+...+.+...+ .+.|.|+...+. .++++.+|+.... -.++.+|.|| -+...++.|+..+
T Consensus 34 ~~vp~G~~~VyVG~~~~Rfvvp~~~L~-hp~f~~LL~~aeeEfG~~~~G~l~iP-C~~~~Fe~~l~~l 99 (100)
T PF02519_consen 34 SDVPKGHFAVYVGEERRRFVVPVSYLN-HPLFQELLEQAEEEFGFDQDGPLTIP-CDVVLFEHLLWLL 99 (100)
T ss_pred CCCCCCeEEEEeCccceEEEechHHcC-chhHHHHHHHHhhhcCcCCCCcEEee-CCHHHHHHHHHHh
Confidence 33445555555555 999999999754 4667777764321 1134578887 7888888887654
No 55
>smart00717 SANT SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
Probab=30.14 E-value=57 Score=18.18 Aligned_cols=20 Identities=20% Similarity=0.433 Sum_probs=16.0
Q ss_pred HHHHHHHcCCCHHHHHHHcC
Q 030521 126 ETVANRIKNKSVEYVRKFFG 145 (176)
Q Consensus 126 ~~iA~~i~gks~eeir~~f~ 145 (176)
..||..+.++|+.+++..+.
T Consensus 25 ~~Ia~~~~~rt~~~~~~~~~ 44 (49)
T smart00717 25 EKIAKELPGRTAEQCRERWN 44 (49)
T ss_pred HHHHHHcCCCCHHHHHHHHH
Confidence 56777788999999988764
No 56
>TIGR02476 BluB cob(II)yrinic acid a,c-diamide reductase. The BluB protein is related to the nitroreductase family (pfam0881), and is found to substitute for the FldA cobalt reductase in aerobic species (by phylogenetic profiling). BluB is also found to be preceded by B12 promoter elements elements. The gene in R. capsulatus has been shown to be essential for cobalamin biosynthesis.
Probab=30.05 E-value=42 Score=26.11 Aligned_cols=38 Identities=13% Similarity=0.247 Sum_probs=27.3
Q ss_pred HHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCCC
Q 030521 104 ELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDFT 151 (176)
Q Consensus 104 ~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~t 151 (176)
.+..|+.||..||+-.- ++.+-..+.+++.|||+.+++
T Consensus 133 A~~nl~LaA~~~GlGs~----------~~~~~~~~~v~~~L~lp~~~~ 170 (205)
T TIGR02476 133 AIQNLWLAARAEGLGVG----------WVSILDPDAVRRLLGVPEGWR 170 (205)
T ss_pred HHHHHHHHHHHCCCcce----------eecccChHHHHHHhCcCCCce
Confidence 44578888888888752 223445789999999998754
No 57
>PF13798 PCYCGC: Protein of unknown function with PCYCGC motif
Probab=27.63 E-value=89 Score=24.02 Aligned_cols=23 Identities=22% Similarity=0.313 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHcCCCHHHHHHHc
Q 030521 122 DYLTETVANRIKNKSVEYVRKFF 144 (176)
Q Consensus 122 ~l~~~~iA~~i~gks~eeir~~f 144 (176)
+.+...+...=+|||+.|||++.
T Consensus 119 ~ia~~a~~~~~~Gks~~eIR~~I 141 (158)
T PF13798_consen 119 DIAVQAVQMYQEGKSPKEIRQYI 141 (158)
T ss_pred HHHHHHHHHHHcCCCHHHHHHHH
Confidence 44455555566899999999863
No 58
>PRK09296 cysteine desufuration protein SufE; Provisional
Probab=27.62 E-value=1e+02 Score=23.01 Aligned_cols=28 Identities=14% Similarity=0.258 Sum_probs=19.6
Q ss_pred HHHHHHHHcCCCHHHHHHH--------cCCCCCCCH
Q 030521 125 TETVANRIKNKSVEYVRKF--------FGIENDFTP 152 (176)
Q Consensus 125 ~~~iA~~i~gks~eeir~~--------f~i~~d~t~ 152 (176)
+..+...+.|+|++||.++ +|+..-+||
T Consensus 82 ~alL~~~~~g~tp~eIl~~d~~~~~~~lGL~~~LSp 117 (138)
T PRK09296 82 IAVVFILYQQMTPQDIVNFDVRPWFEKLALTQHLTP 117 (138)
T ss_pred HHHHHHHHcCCCHHHHHhCChHHHHHHcCcccccCc
Confidence 4445556689999998752 577766776
No 59
>PF11198 DUF2857: Protein of unknown function (DUF2857); InterPro: IPR021364 This is a bacterial family of uncharacterised proteins.
Probab=26.94 E-value=2.4e+02 Score=21.82 Aligned_cols=63 Identities=25% Similarity=0.303 Sum_probs=35.7
Q ss_pred ChHHHHHHHHhhh-----ccchhhHHHH-HHHHHHHHHcCCCHHHH---HHHcCCCCC------CCHHHHHHHHHhcccc
Q 030521 101 SNDELKEMILVAN-----YLNIKDMLDY-LTETVANRIKNKSVEYV---RKFFGIEND------FTPEEEEAARKEYEWA 165 (176)
Q Consensus 101 ~~~~l~~Ll~AA~-----~L~I~~L~~l-~~~~iA~~i~gks~eei---r~~f~i~~d------~t~EEe~~i~~e~~w~ 165 (176)
|.+.|-.++.-|. .--|+.++.+ ++..+-..+-|.|+.|+ |.++||+.. .|+|++.+| =+.|-
T Consensus 59 n~~~l~~~L~~~~~~~~~~~~idr~L~lGAS~~mm~~~FGls~~ev~~rR~llgi~~~~GR~~~~~ee~~~~i--W~~W~ 136 (180)
T PF11198_consen 59 NHDVLWRLLEQARREQQEQQLIDRALRLGASIEMMQRLFGLSSAEVAARRRLLGIPVRKGRPPALSEEEEAAI--WRRWQ 136 (180)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHCCCHHHHHHHHHHhCCCCCCCCCCCcCHHHHHHH--HHHHH
Confidence 4444444444443 2334444444 34555666678887776 777888864 355555555 45565
No 60
>cd02144 iodotyrosine_dehalogenase Iodotyrosine dehalogenase catalyzes the removal of iodine from the 3, 5 positions of L-tyosine in thyroid, liver and kidney, using NADPH as electron donor. This enzyme is a homolog of the nitroreductase family. These enzymes are usually homodimers.
Probab=26.91 E-value=53 Score=24.81 Aligned_cols=38 Identities=5% Similarity=0.141 Sum_probs=26.3
Q ss_pred HHHHHHhhhccchhhHHHHHHHHHHHHHcCCCH-HHHHHHcCCCCCCCH
Q 030521 105 LKEMILVANYLNIKDMLDYLTETVANRIKNKSV-EYVRKFFGIENDFTP 152 (176)
Q Consensus 105 l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~-eeir~~f~i~~d~t~ 152 (176)
+..|+.||.-||+... .+.+... +.+++.|||++++.+
T Consensus 129 ~~~l~LaA~~~Glgs~----------~~~~~~~~~~v~~~l~ip~~~~~ 167 (193)
T cd02144 129 CGLLLAALQNAGLGTV----------TTTPLNMGPFLRRLLGRPANEKL 167 (193)
T ss_pred HHHHHHHHHHCCCcee----------ecCCcchhHHHHHHhCCCCCCce
Confidence 3457888888888762 2234344 899999999986543
No 61
>PF00382 TFIIB: Transcription factor TFIIB repeat; InterPro: IPR013150 Cyclins are eukaryotic proteins that play an active role in controlling nuclear cell division cycles [], and regulate cyclin dependent kinases (CDKs). Cyclins, together with the p34 (cdc2) or cdk2 kinases, form the Maturation Promoting Factor (MPF). There are two main groups of cyclins, G1/S cyclins, which are essential for the control of the cell cycle at the G1/S (start) transition, and G2/M cyclins, which are essential for the control of the cell cycle at the G2/M (mitosis) transition. G2/M cyclins accumulate steadily during G2 and are abruptly destroyed as cells exit from mitosis (at the end of the M-phase). In most species, there are multiple forms of G1 and G2 cyclins. For example, in vertebrates, there are two G2 cyclins, A and B, and at least three G1 cyclins, C, D, and E. Cyclin homologues have been found in various viruses, including Saimiriine herpesvirus 2 (Herpesvirus saimiri) and Human herpesvirus 8 (HHV-8) (Kaposi's sarcoma-associated herpesvirus). These viral homologues differ from their cellular counterparts in that the viral proteins have gained new functions and eliminated others to harness the cell and benefit the virus []. In eukaryotes, transcription initiation of all protein encoding genes involves the polymerase II system. This sytem is modulated by both general and specific transcription factors. The general factors (which include TFIIA, TFIIB, TFIID, TFIIE, TFIIF, TFIIG and TFIIH) operate through common promoter elements, such as the TATA box. Transcription factor IIB (TFIIB) is of central importance in transcription of class II genes. It associates with TFIID-TFIIA bound to DNA (the DA complex) to form a ternary TFIID-IIA-IBB (DAB) complex, which is recognised by RNA polymerase II [, ]. TFIIB comprises ~315-340 residues and contains an imperfect C-terminal repeat of a 75-residue domain that may contribute to the symmetry of the folded protein. The basal archaeal transcription machinery resembles that of the eukaryotic polymerase II system and includes a homologue of TFIIB []. This entry represents a cyclin-like domain which is found repeated in the C-terminal region of a variety of eukaryotic TFIIB's and their archaeal counterparts. These domains individually form the typical cyclin fold, and in the transcription complex they straddle the C-terminal region of the TATA-binding protein - an interaction essential for the formation of the transcription initiation complex [, ].; GO: 0003743 translation initiation factor activity, 0006413 translational initiation; PDB: 2PHG_A 1C9B_Q 1TFB_A 1VOL_A 3K7A_M 1AIS_B 1D3U_B.
Probab=26.71 E-value=1.6e+02 Score=18.61 Aligned_cols=30 Identities=10% Similarity=0.216 Sum_probs=23.4
Q ss_pred hhHHHHHHHHHHHHHcC--CCHHHHHHHcCCC
Q 030521 118 KDMLDYLTETVANRIKN--KSVEYVRKFFGIE 147 (176)
Q Consensus 118 ~~L~~l~~~~iA~~i~g--ks~eeir~~f~i~ 147 (176)
+.-+-.+|-|+|.+..| +|..||.+.+||.
T Consensus 36 ~~~iaAA~iY~acr~~~~~~t~~eIa~~~~Vs 67 (71)
T PF00382_consen 36 PESIAAACIYLACRLNGVPRTLKEIAEAAGVS 67 (71)
T ss_dssp HHHHHHHHHHHHHHHTTSSSSHHHHHHHCTSS
T ss_pred HHHHHHHHHHHHHHHcCCCcCHHHHHHHhCCC
Confidence 34455577888888877 6899999999985
No 62
>TIGR01999 iscU FeS cluster assembly scaffold IscU. This model represents IscU, a homolog of the N-terminal region of NifU, an Fe-S cluster assembly protein found mostly in nitrogen-fixing bacteria. IscU is considered part of the IscSUA-hscAB-fdx system of Fe-S assembly, whereas NifU is found in nitrogenase-containing (nitrogen-fixing) species. A NifU-type protein is also found in Helicobacter and Campylobacter. IscU and NifU are considered scaffold proteins on which Fe-S clusters are assembled before transfer to apoproteins. This model excludes true NifU proteins as in Klebsiella pneumoniae and Anabaena sp. as well as archaeal homologs. It includes largely proteobacterial and eukaryotic forms.
Probab=25.81 E-value=79 Score=22.82 Aligned_cols=38 Identities=21% Similarity=0.100 Sum_probs=24.9
Q ss_pred HHHhhhccchhhHHHHHH-HHHHHHHcCCCHHHHHHHcC
Q 030521 108 MILVANYLNIKDMLDYLT-ETVANRIKNKSVEYVRKFFG 145 (176)
Q Consensus 108 Ll~AA~~L~I~~L~~l~~-~~iA~~i~gks~eeir~~f~ 145 (176)
.+.-+.|-.-=.-.-.+| ..++++++|||.+|.+.+-.
T Consensus 50 ~I~d~~f~~~GC~~s~Asas~~~e~i~Gktl~ea~~i~~ 88 (124)
T TIGR01999 50 IIEDAKFKTFGCGSAIASSSLATELIKGKSLEEALKIKN 88 (124)
T ss_pred eEEEEEEEecCcHHHHHHHHHHHHHHcCCCHHHHHhccH
Confidence 444455555544555544 56678999999999886643
No 63
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=25.75 E-value=71 Score=23.60 Aligned_cols=19 Identities=21% Similarity=0.324 Sum_probs=14.0
Q ss_pred HHHHHHH-cCCCHHHHHHHc
Q 030521 126 ETVANRI-KNKSVEYVRKFF 144 (176)
Q Consensus 126 ~~iA~~i-~gks~eeir~~f 144 (176)
+.|..++ .|+|.+||+.+|
T Consensus 64 ~~Vr~~i~~G~Sd~eI~~~~ 83 (126)
T TIGR03147 64 HEVYSMVNEGKSNQQIIDFM 83 (126)
T ss_pred HHHHHHHHcCCCHHHHHHHH
Confidence 3455544 699999999885
No 64
>COG1926 Predicted phosphoribosyltransferases [General function prediction only]
Probab=25.62 E-value=1.2e+02 Score=24.65 Aligned_cols=63 Identities=22% Similarity=0.287 Sum_probs=42.9
Q ss_pred CcccccC-C-CChHHHHHHHHHHHHhhhccCC----C----------------------CCccccchhhhhhccCCChHH
Q 030521 53 DTVVPLP-N-VSAAPLSYIIEFCRAHVEFRKK----R----------------------SPKQEVKTFNEEFGKSKSNDE 104 (176)
Q Consensus 53 ~~~Ipl~-~-v~s~~L~~Iie~c~~h~~~~~~----~----------------------~~~~~~~~wd~eFl~~~~~~~ 104 (176)
...|-+. + -++.++..-+++|+++...... . .+...+..|..+|-+. +.++
T Consensus 125 ~~VIlVDDGiATGatm~aAi~~~r~~~~~~IviAVPV~p~~a~~~l~s~~D~vvc~~~P~~F~AVg~~Y~dF~q~-sdeE 203 (220)
T COG1926 125 RTVILVDDGIATGATMKAAVRALRAKGPKEIVIAVPVAPEDAAAELESEADEVVCLYMPAPFEAVGEFYRDFRQV-SDEE 203 (220)
T ss_pred CEEEEEeCCcchhHHHHHHHHHHHhcCCceEEEEcccCCHHHHHHHHhhcCeEEEEcCCccHHHHHHHHHHHhhc-CHHH
Confidence 3455554 3 4888899999999887765321 0 1123467788888777 8888
Q ss_pred HHHHHHhhhccc
Q 030521 105 LKEMILVANYLN 116 (176)
Q Consensus 105 l~~Ll~AA~~L~ 116 (176)
+..||..|+--+
T Consensus 204 V~~lL~~a~~~~ 215 (220)
T COG1926 204 VRALLRRANAPN 215 (220)
T ss_pred HHHHHHhccCCC
Confidence 999988887543
No 65
>TIGR03419 NifU_clost FeS cluster assembly scaffold protein NifU, Clostridium type. NifU and NifS form a pair of iron-sulfur (FeS) cluster biosynthesis proteins much simpler than the ISC and SUF systems. Members of this protein family are a distinct group of NifU-like proteins, found always to a NifS-like protein and restricted to species that lack a SUF system. Typically, NIF systems service a smaller number of FeS-containing proteins than do ISC or SUF. Members of this particular branch typically are found, almost half the time, near the mnmA gene, involved in the carboxymethylaminomethyl modification of U34 in some tRNAs (see GenProp0704). While other NifU proteins are associated with nitrogen fixation, this family is not.
Probab=25.60 E-value=88 Score=22.51 Aligned_cols=37 Identities=14% Similarity=0.103 Sum_probs=24.7
Q ss_pred HHHhhhccchhhHHHHHH-HHHHHHHcCCCHHHHHHHc
Q 030521 108 MILVANYLNIKDMLDYLT-ETVANRIKNKSVEYVRKFF 144 (176)
Q Consensus 108 Ll~AA~~L~I~~L~~l~~-~~iA~~i~gks~eeir~~f 144 (176)
.+.-+.|-.-=..+-.+| ..++.++.|||.+|..++.
T Consensus 46 ~I~d~~f~~~GC~is~Asas~~~e~i~Gk~l~ea~~i~ 83 (121)
T TIGR03419 46 IIKDVKFKTFGCGAAIASSSMATEMIKGKTLEEAWELT 83 (121)
T ss_pred EEEEEEEEEeccHHHHHHHHHHHHHHcCCCHHHHHHhh
Confidence 444555555555555544 4567799999999987664
No 66
>PRK11325 scaffold protein; Provisional
Probab=25.54 E-value=79 Score=23.00 Aligned_cols=38 Identities=13% Similarity=0.065 Sum_probs=25.0
Q ss_pred HHHhhhccchhhHHHHHH-HHHHHHHcCCCHHHHHHHcC
Q 030521 108 MILVANYLNIKDMLDYLT-ETVANRIKNKSVEYVRKFFG 145 (176)
Q Consensus 108 Ll~AA~~L~I~~L~~l~~-~~iA~~i~gks~eeir~~f~ 145 (176)
.+.-+.|-.-=.-.-.+| ..++.+++|||++|.+.+-+
T Consensus 52 ~I~d~~f~~~GC~is~Asas~~~e~~~Gktl~ea~~i~~ 90 (127)
T PRK11325 52 IIEDAKFKTYGCGSAIASSSLVTEWVKGKTLDEALAIKN 90 (127)
T ss_pred eEEEEEEEeeCCHHHHHHHHHHHHHHcCCCHHHHHhcCH
Confidence 444555555555555544 45577999999999886633
No 67
>PF03750 DUF310: Protein of unknown function (DUF310); InterPro: IPR010149 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents the C-terminal domain of a minor family of CRISPR-associated proteins. These proteins are found adjacent to a characteristic short, palindromic repeat cluster termed CRISPR, a probable mobile DNA element.
Probab=25.30 E-value=1e+02 Score=22.14 Aligned_cols=34 Identities=26% Similarity=0.490 Sum_probs=21.4
Q ss_pred cchhhHHHHHHHHHHHHHc--CCCHHHHHHHcCCCCC
Q 030521 115 LNIKDMLDYLTETVANRIK--NKSVEYVRKFFGIEND 149 (176)
Q Consensus 115 L~I~~L~~l~~~~iA~~i~--gks~eeir~~f~i~~d 149 (176)
++++.|.+.+.+. |..++ +.|+.+||++|+--..
T Consensus 8 ~~~~~~~~~Ae~~-~k~l~~~~lttsQlRkf~~~v~~ 43 (119)
T PF03750_consen 8 LDIELLVDYAEKI-AKELKKNKLTTSQLRKFYDEVKR 43 (119)
T ss_pred cCHHHHHHHHHHH-HHHHHhCCCCHHHHHHHHHHHHH
Confidence 4455555555443 34444 4799999999985444
No 68
>PHA03098 kelch-like protein; Provisional
Probab=25.26 E-value=2.4e+02 Score=25.10 Aligned_cols=98 Identities=8% Similarity=0.071 Sum_probs=62.0
Q ss_pred ChHHHHHHHHHHHHhhhccCCCCCccccchhhhhhcc-CCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc-------
Q 030521 62 SAAPLSYIIEFCRAHVEFRKKRSPKQEVKTFNEEFGK-SKSNDELKEMILVANYLNIKDMLDYLTETVANRIK------- 133 (176)
Q Consensus 62 ~s~~L~~Iie~c~~h~~~~~~~~~~~~~~~wd~eFl~-~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~------- 133 (176)
+...+..++.=+.+..-. .+...-.+|+. .++.+..++++..|...+.+.|.+.|-++|+..+.
T Consensus 74 ~~~~~~~ll~~A~~l~~~--------~l~~~C~~~l~~~l~~~nc~~~~~~a~~~~~~~L~~~~~~~i~~nf~~v~~~~~ 145 (534)
T PHA03098 74 TSNNVKDILSIANYLIID--------FLINLCINYIIKIIDDNNCIDIYRFSFFYGCKKLYSAAYNYIRNNIELIYNDPD 145 (534)
T ss_pred cHHHHHHHHHHHHHhCcH--------HHHHHHHHHHHHhCCHhHHHHHHHHHHHcCcHHHHHHHHHHHHHHHHHHhcCch
Confidence 344455555544443321 23333445544 35888999999999999999999999999987553
Q ss_pred --CCCHHHHHHHcCCCCCCCH-HHHHHHHHhcccccCC
Q 030521 134 --NKSVEYVRKFFGIENDFTP-EEEEAARKEYEWAFEG 168 (176)
Q Consensus 134 --gks~eeir~~f~i~~d~t~-EEe~~i~~e~~w~~~~ 168 (176)
..+.+++.+++.= +++.- .|+.-.+---.|+..+
T Consensus 146 f~~l~~~~l~~ll~~-~~L~v~~E~~v~~av~~W~~~~ 182 (534)
T PHA03098 146 FIYLSKNELIKILSD-DKLNVSSEDVVLEIIIKWLTSK 182 (534)
T ss_pred hhcCCHHHHHHHhcC-CCcCcCCHHHHHHHHHHHHhcC
Confidence 2467888888663 23542 3333344456777654
No 69
>PRK15019 CsdA-binding activator; Provisional
Probab=24.62 E-value=1.1e+02 Score=23.02 Aligned_cols=37 Identities=8% Similarity=0.255 Sum_probs=23.8
Q ss_pred hhccchhhHHHHHHHHHHHHHcCCCHHHHHHH--------cCCCCCCCH
Q 030521 112 ANYLNIKDMLDYLTETVANRIKNKSVEYVRKF--------FGIENDFTP 152 (176)
Q Consensus 112 A~~L~I~~L~~l~~~~iA~~i~gks~eeir~~--------f~i~~d~t~ 152 (176)
++-.=+++|+-+ +...+.|+|++||.++ +|+..-+||
T Consensus 83 SDA~IvkGl~al----L~~~~~g~tp~eIl~~d~~~~~~~lGL~~~LSp 127 (147)
T PRK15019 83 SEGRIVRGLLAV----LLTAVEGKTAAELQAQSPLALFDELGLRAQLSA 127 (147)
T ss_pred CccHHHHHHHHH----HHHHHcCCCHHHHHhcCHHHHHHHCCchhhcCc
Confidence 444445555544 4455589999999762 577766776
No 70
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=24.49 E-value=67 Score=24.16 Aligned_cols=18 Identities=17% Similarity=0.361 Sum_probs=16.8
Q ss_pred CCCHHHHHHHcCCCCCCC
Q 030521 134 NKSVEYVRKFFGIENDFT 151 (176)
Q Consensus 134 gks~eeir~~f~i~~d~t 151 (176)
|+||+|-|.-+|++.|+.
T Consensus 98 gmTPd~YR~KW~LP~dYP 115 (148)
T COG4957 98 GLTPDEYRAKWGLPPDYP 115 (148)
T ss_pred CCCHHHHHHhcCCCCCCC
Confidence 899999999999999974
No 71
>TIGR03391 FeS_syn_CsdE cysteine desulfurase, sulfur acceptor subunit CsdE. Members of this protein family are CsdE, formerly called YgdK. This protein, found as a paralog to SufE in Escherichia coli, Yersinia pestis, Photorhabdus luminescens, and related species, works together and physically interacts with CsdA (a paralog of SufS). CsdA has cysteine desulfurase activity that is enhanced by this protein (CsdE), in which Cys-61 (numbered as in E. coli) is a sulfur acceptor site. This gene pair, although involved in FeS cluster biosynthesis, is not found next to other such genes as are its paralogs from the Suf or Isc systems.
Probab=24.14 E-value=1.2e+02 Score=22.58 Aligned_cols=36 Identities=11% Similarity=0.253 Sum_probs=22.4
Q ss_pred hccchhhHHHHHHHHHHHHHcCCCHHHHHHH--------cCCCCCCCH
Q 030521 113 NYLNIKDMLDYLTETVANRIKNKSVEYVRKF--------FGIENDFTP 152 (176)
Q Consensus 113 ~~L~I~~L~~l~~~~iA~~i~gks~eeir~~--------f~i~~d~t~ 152 (176)
+-+=+++|+- .+...+.|+|++||.++ +|+..-+||
T Consensus 79 Da~IvkGl~a----lL~~~~~g~tp~eI~~~d~~~~~~~lGL~~~LSp 122 (138)
T TIGR03391 79 EGRIVRGLLA----VLLTAVEGKTPEQLLAQDPLALFDELGLRAQLSA 122 (138)
T ss_pred ccHHHHHHHH----HHHHHHcCCCHHHHHHCCHHHHHHHcCchhccCc
Confidence 3333445544 44455589999999744 566666665
No 72
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins. These belong to the SANT domain family that specifically recognise the sequence YAAC(G/T)G [, ]. In myb, one of the most conserved regions consisting of three tandem repeats has been shown to be involved in DNA-binding [].; PDB: 1X41_A 2XAF_B 2XAG_B 2XAH_B 2UXN_B 2Y48_B 2XAQ_B 2X0L_B 2IW5_B 2XAJ_B ....
Probab=24.12 E-value=1.6e+02 Score=17.12 Aligned_cols=39 Identities=10% Similarity=0.226 Sum_probs=25.8
Q ss_pred ChHHHHHHHHhhhccchhhHHHHHHHHHHHHHc-CCCHHHHHHHc
Q 030521 101 SNDELKEMILVANYLNIKDMLDYLTETVANRIK-NKSVEYVRKFF 144 (176)
Q Consensus 101 ~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~-gks~eeir~~f 144 (176)
+.++---|+.|..-++-. -=..||..+. |+|+.++|..|
T Consensus 5 t~eE~~~l~~~v~~~g~~-----~W~~Ia~~~~~~Rt~~qc~~~~ 44 (48)
T PF00249_consen 5 TEEEDEKLLEAVKKYGKD-----NWKKIAKRMPGGRTAKQCRSRY 44 (48)
T ss_dssp -HHHHHHHHHHHHHSTTT-----HHHHHHHHHSSSSTHHHHHHHH
T ss_pred CHHHHHHHHHHHHHhCCc-----HHHHHHHHcCCCCCHHHHHHHH
Confidence 334444555555555555 3467888888 99999999875
No 73
>cd02140 Nitroreductase_4 Nitroreductase-like family 4. A subfamily of the nitroreductase family containing uncharacterized proteins that are similar to nitroreductase. Nitroreductase catalyzes the reduction of nitroaromatic compounds such as nitrotoluenes, nitrofurans and nitroimidazoles. This process requires NAD(P)H as electron donor in an obligatory two-electron transfer and uses FMN as cofactor. The enzyme is typically a homodimer. Members of this family are also called NADH dehydrogenase, oxygen-insensitive NAD(P)H nitrogenase or dihydropteridine reductase.
Probab=23.86 E-value=62 Score=24.80 Aligned_cols=39 Identities=15% Similarity=0.015 Sum_probs=26.4
Q ss_pred HHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcCCCCCC
Q 030521 104 ELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFGIENDF 150 (176)
Q Consensus 104 ~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~i~~d~ 150 (176)
.+..|+.||.-+|+-.-+.. . .....+.+|++|||++++
T Consensus 126 a~~nl~LaA~~~GlGs~~~~------~--~~~~~~~v~~~l~ip~~~ 164 (192)
T cd02140 126 AQIAVWTALAAEGIGANLQH------Y--NPLIDEEVAKKWNIPSNW 164 (192)
T ss_pred HHHHHHHHHHhCCCcccHHH------h--CccCCHHHHHhcCCCccc
Confidence 34688889999988763211 0 012358999999999775
No 74
>PF01592 NifU_N: NifU-like N terminal domain; InterPro: IPR002871 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] []. FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins [, ]. It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly []. The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA []. SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA [], acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins []. Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen []. This entry represents the N-terminal of NifU and homologous proteins. NifU contains two domains: an N-terminal and a C-terminal domain (IPR001075 from INTERPRO) []. These domains exist either together or on different polypeptides, both domains being found in organisms that do not fix nitrogen (e.g. yeast), so they have a broader significance in the cell than nitrogen fixation. ; GO: 0005506 iron ion binding, 0051536 iron-sulfur cluster binding, 0016226 iron-sulfur cluster assembly; PDB: 3LVL_A 4EB5_C 4EB7_C 1WFZ_A 2Z7E_C 2AZH_A 1XJS_A 1Q48_A 1R9P_A 2KQK_A ....
Probab=23.74 E-value=99 Score=22.29 Aligned_cols=21 Identities=24% Similarity=0.145 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHcCCCHHHHHH
Q 030521 122 DYLTETVANRIKNKSVEYVRK 142 (176)
Q Consensus 122 ~l~~~~iA~~i~gks~eeir~ 142 (176)
-.++..++.+++|||.+|+.+
T Consensus 65 ~Asas~~~~~i~gk~l~ea~~ 85 (126)
T PF01592_consen 65 IASASMMCELIKGKTLEEALK 85 (126)
T ss_dssp HHHHHHHHHHHTTSBHHHHHC
T ss_pred HHHHHHHHHHHcCCCHHHHHH
Confidence 345567788999999888853
No 75
>PF07076 DUF1344: Protein of unknown function (DUF1344); InterPro: IPR009780 This family consists of several short, hypothetical bacterial proteins of around 80 residues in length. Members of this family are found in Rhizobium, Agrobacterium and Brucella species. The function of this family is unknown.
Probab=23.73 E-value=54 Score=21.13 Aligned_cols=32 Identities=13% Similarity=0.329 Sum_probs=22.9
Q ss_pred ccccccccCcCCCCcEEEEeCCCCEEEecHHH
Q 030521 2 AEESSSSCSAQATKKISLKTADGHLFEVEEPV 33 (176)
Q Consensus 2 ~~~~~~~~~~~~~~~i~L~s~DG~~f~V~~~~ 33 (176)
|.++-..=........+|.-.||+.|.++...
T Consensus 2 A~~veG~I~~id~~~~titLdDGksy~lp~ef 33 (61)
T PF07076_consen 2 AADVEGTIKSIDPETMTITLDDGKSYKLPEEF 33 (61)
T ss_pred CccceEEEEEEcCCceEEEecCCCEEECCCcc
Confidence 33444444455667788999999999999875
No 76
>PF07638 Sigma70_ECF: ECF sigma factor
Probab=23.58 E-value=3.3e+02 Score=20.65 Aligned_cols=46 Identities=24% Similarity=0.161 Sum_probs=33.5
Q ss_pred ChHHHHHHHHhhhccchhhHHHHHHHHHHH-HHcCCCHHHHHHHcCCCC
Q 030521 101 SNDELKEMILVANYLNIKDMLDYLTETVAN-RIKNKSVEYVRKFFGIEN 148 (176)
Q Consensus 101 ~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~-~i~gks~eeir~~f~i~~ 148 (176)
+.+.+.+|-.+-+-|.- |-.-.++.|.- .+.|.|.+||-+.+||+.
T Consensus 119 ~~~~~~~l~e~l~~L~~--l~~~~~~~v~l~~~~Gls~~EIA~~lgiS~ 165 (185)
T PF07638_consen 119 SPEELLELEEALERLLA--LDPRQRRVVELRFFEGLSVEEIAERLGISE 165 (185)
T ss_pred CHHHHHHHHHHHHHHHc--cCHHHHHHHHHHHHCCCCHHHHHHHHCcCH
Confidence 56677777666654433 55666777765 468999999999999964
No 77
>PF00690 Cation_ATPase_N: Cation transporter/ATPase, N-terminus; InterPro: IPR004014 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles []. P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H+, Na+, K+, Mg2+, Ca2+, Ag+ and Ag2+, Zn2+, Co2+, Pb2+, Ni2+, Cd2+, Cu+ and Cu2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the conserved N-terminal region found in several classes of cation-transporting P-type ATPases, including those that transport H+ (3.6.3.6 from EC), Na+ (3.6.3.7 from EC), Ca2+ (3.6.3.8 from EC), Na+/K+ (3.6.3.9 from EC), and H+/K+ (3.6.3.10 from EC). In the H+/K+- and Na+/K+-exchange P-ATPases, this domain is found in the catalytic alpha chain. In gastric H+/K+-ATPases, this domain undergoes reversible sequential phosphorylation inducing conformational changes that may be important for regulating the function of these ATPases [, ]. More information about this protein can be found at Protein of the Month: ATP Synthases [].; PDB: 3KDP_C 3N2F_A 3B8E_A 3N23_A 2XZB_A 1MHS_B 3A3Y_A 2ZXE_A 3B8C_A 3B9B_A ....
Probab=23.54 E-value=1.2e+02 Score=19.09 Aligned_cols=30 Identities=37% Similarity=0.403 Sum_probs=21.8
Q ss_pred CCCHHHHHHHcC--CCCCCCHHHHHHHHHhcc
Q 030521 134 NKSVEYVRKFFG--IENDFTPEEEEAARKEYE 163 (176)
Q Consensus 134 gks~eeir~~f~--i~~d~t~EEe~~i~~e~~ 163 (176)
.++.+++.+.|+ ..+.+|++|-++.++.|.
T Consensus 3 ~~~~~~v~~~l~t~~~~GLs~~ev~~r~~~~G 34 (69)
T PF00690_consen 3 QLSVEEVLKRLNTSSSQGLSSEEVEERRKKYG 34 (69)
T ss_dssp TSSHHHHHHHHTTBTSSBBTHHHHHHHHHHHS
T ss_pred CCCHHHHHHHHCcCCCCCCCHHHHHHHHHhcc
Confidence 357888888887 334588888777777654
No 78
>PF07671 DUF1601: Protein of unknown function (DUF1601); InterPro: IPR011632 This repeat is found in a small number of proteins and is apparently limited to Coxiella burnetii.
Probab=23.47 E-value=84 Score=18.08 Aligned_cols=30 Identities=17% Similarity=0.237 Sum_probs=22.7
Q ss_pred ccchhhHHHHHHHHHHHHHcCCCHHHHHHH
Q 030521 114 YLNIKDMLDYLTETVANRIKNKSVEYVRKF 143 (176)
Q Consensus 114 ~L~I~~L~~l~~~~iA~~i~gks~eeir~~ 143 (176)
||.-++|-+.....|+..+..-++.+|-.+
T Consensus 7 ~l~~q~L~~~L~~aV~~~a~~Fn~QeiaNt 36 (37)
T PF07671_consen 7 YLEEQRLSDRLLDAVRRNAEQFNPQEIANT 36 (37)
T ss_pred hhhhccchHHHHHHHHHHHHHcCHHHHhhc
Confidence 566777888888888887777888777654
No 79
>PF11338 DUF3140: Protein of unknown function (DUF3140); InterPro: IPR021487 Some members in this family of proteins are annotated as DNA binding proteins. No function is currently known.
Probab=23.08 E-value=84 Score=21.93 Aligned_cols=22 Identities=9% Similarity=0.288 Sum_probs=17.5
Q ss_pred cCCCChHHHHHHHHHHHHhhhc
Q 030521 58 LPNVSAAPLSYIIEFCRAHVEF 79 (176)
Q Consensus 58 l~~v~s~~L~~Iie~c~~h~~~ 79 (176)
|..-.-..|++|+.||..|...
T Consensus 54 ltddD~~hMrkVV~yv~rhlaq 75 (92)
T PF11338_consen 54 LTDDDYEHMRKVVGYVKRHLAQ 75 (92)
T ss_pred CCHHHHHHHHHHHHHHHHHHhc
Confidence 3334556799999999999987
No 80
>KOG0027 consensus Calmodulin and related proteins (EF-Hand superfamily) [Signal transduction mechanisms]
Probab=22.83 E-value=3.1e+02 Score=19.99 Aligned_cols=60 Identities=17% Similarity=0.350 Sum_probs=32.5
Q ss_pred hHHHHHHHHhhh-----ccchhhHHHHHHHHHHHHHcC-CCHHHHHHHcCCCC-C----CCHHHHHHHHHh
Q 030521 102 NDELKEMILVAN-----YLNIKDMLDYLTETVANRIKN-KSVEYVRKFFGIEN-D----FTPEEEEAARKE 161 (176)
Q Consensus 102 ~~~l~~Ll~AA~-----~L~I~~L~~l~~~~iA~~i~g-ks~eeir~~f~i~~-d----~t~EEe~~i~~e 161 (176)
...+-+++...+ .++...++.+........-.. .+.+++|+.|.+-+ | +|.+|..++...
T Consensus 43 ~~el~~~~~~~D~dg~g~I~~~eF~~l~~~~~~~~~~~~~~~~el~eaF~~fD~d~~G~Is~~el~~~l~~ 113 (151)
T KOG0027|consen 43 EEELRDLIKEIDLDGDGTIDFEEFLDLMEKLGEEKTDEEASSEELKEAFRVFDKDGDGFISASELKKVLTS 113 (151)
T ss_pred HHHHHHHHHHhCCCCCCeEcHHHHHHHHHhhhcccccccccHHHHHHHHHHHccCCCCcCcHHHHHHHHHH
Confidence 444555555554 466666666666555433222 25556666665433 1 566666666543
No 81
>PF03918 CcmH: Cytochrome C biogenesis protein; InterPro: IPR005616 Members of this family include NrfF, CcmH, CycL, Ccl2.; PDB: 2KW0_A 2HL7_A.
Probab=21.80 E-value=78 Score=23.88 Aligned_cols=12 Identities=33% Similarity=0.509 Sum_probs=8.5
Q ss_pred cCCCHHHHHHHc
Q 030521 133 KNKSVEYVRKFF 144 (176)
Q Consensus 133 ~gks~eeir~~f 144 (176)
.|+|.+||+++|
T Consensus 72 ~G~s~~eI~~~~ 83 (148)
T PF03918_consen 72 EGKSDEEIIDYF 83 (148)
T ss_dssp HT--HHHHHHHH
T ss_pred cCCCHHHHHHHH
Confidence 699999999874
No 82
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=21.67 E-value=67 Score=20.46 Aligned_cols=14 Identities=36% Similarity=0.570 Sum_probs=11.4
Q ss_pred CcEEEEeCCCCEEE
Q 030521 15 KKISLKTADGHLFE 28 (176)
Q Consensus 15 ~~i~L~s~DG~~f~ 28 (176)
+-+++.+.||++|-
T Consensus 5 KA~Kv~~RDGE~~l 18 (65)
T COG4049 5 KAIKVRDRDGEEFL 18 (65)
T ss_pred eeeEeeccCCceee
Confidence 35789999999885
No 83
>PF03883 DUF328: Protein of unknown function (DUF328); InterPro: IPR005583 The members of this family are functionally uncharacterised. They are about 250 amino acids in length.
Probab=21.57 E-value=88 Score=25.34 Aligned_cols=28 Identities=25% Similarity=0.383 Sum_probs=15.1
Q ss_pred HHHHHHHHcCCCHHHHHHHcCCCCCCCH
Q 030521 125 TETVANRIKNKSVEYVRKFFGIENDFTP 152 (176)
Q Consensus 125 ~~~iA~~i~gks~eeir~~f~i~~d~t~ 152 (176)
.+.|+..++..|.+|+++.|+|...+..
T Consensus 31 ~~~l~~~L~~~s~~el~~l~~is~~la~ 58 (237)
T PF03883_consen 31 TEELLEALKSLSEEELKKLMKISDKLAE 58 (237)
T ss_pred HHHHHHHHHhcCHHHHHHHHCCCHHHHH
Confidence 4444555555666666666666544433
No 84
>PF00134 Cyclin_N: Cyclin, N-terminal domain; InterPro: IPR006671 Cyclins are eukaryotic proteins that play an active role in controlling nuclear cell division cycles [], and regulate cyclin dependent kinases (CDKs). Cyclins, together with the p34 (cdc2) or cdk2 kinases, form the Maturation Promoting Factor (MPF). There are two main groups of cyclins, G1/S cyclins, which are essential for the control of the cell cycle at the G1/S (start) transition, and G2/M cyclins, which are essential for the control of the cell cycle at the G2/M (mitosis) transition. G2/M cyclins accumulate steadily during G2 and are abruptly destroyed as cells exit from mitosis (at the end of the M-phase). In most species, there are multiple forms of G1 and G2 cyclins. For example, in vertebrates, there are two G2 cyclins, A and B, and at least three G1 cyclins, C, D, and E. Cyclin homologues have been found in various viruses, including Saimiriine herpesvirus 2 (Herpesvirus saimiri) and Human herpesvirus 8 (HHV-8) (Kaposi's sarcoma-associated herpesvirus). These viral homologues differ from their cellular counterparts in that the viral proteins have gained new functions and eliminated others to harness the cell and benefit the virus []. Cyclins contain two domains of similar all-alpha fold, of which this entry is associated with the N-terminal domain.; PDB: 2W2H_B 3RGF_B 1KXU_A 1JKW_A 1URC_D 2WMB_D 1H26_D 1GY3_B 2UUE_D 1E9H_D ....
Probab=21.34 E-value=1.5e+02 Score=20.48 Aligned_cols=42 Identities=10% Similarity=0.245 Sum_probs=28.6
Q ss_pred hHHHHHHHHHHHHHcCCCHHHHHHHcCCCC-CCCHHHHHHHHH
Q 030521 119 DMLDYLTETVANRIKNKSVEYVRKFFGIEN-DFTPEEEEAARK 160 (176)
Q Consensus 119 ~L~~l~~~~iA~~i~gks~eeir~~f~i~~-d~t~EEe~~i~~ 160 (176)
.|+-++|-.+|.++.+..+-.+..+..+.+ .+|.++..+.+.
T Consensus 74 ~li~~~cl~lA~K~~e~~~~~~~~~~~~~~~~~~~~~i~~~E~ 116 (127)
T PF00134_consen 74 QLIALACLFLASKMEEDNPPSISDLIRISDNTFTKKDILEMER 116 (127)
T ss_dssp HHHHHHHHHHHHHHHTSS--HHHHHHHHTTTSSHHHHHHHHHH
T ss_pred hhhhhhHHHHhhhhhccccchHHHHHHHHcCCCCHHHHHHHHH
Confidence 477788999999998876555666655554 488877666554
No 85
>smart00224 GGL G protein gamma subunit-like motifs.
Probab=21.22 E-value=1.1e+02 Score=19.40 Aligned_cols=21 Identities=14% Similarity=0.343 Sum_probs=16.2
Q ss_pred ChHHHHHHHHHHHHhhhccCC
Q 030521 62 SAAPLSYIIEFCRAHVEFRKK 82 (176)
Q Consensus 62 ~s~~L~~Iie~c~~h~~~~~~ 82 (176)
.|.....++.||+.|....+.
T Consensus 22 vS~a~~~li~y~e~~~~~DP~ 42 (63)
T smart00224 22 VSKAAEELLAYCEQHAEEDPL 42 (63)
T ss_pred HHHHHHHHHHHHHcCCCCCCC
Confidence 367788899999998776543
No 86
>PF04751 DUF615: Protein of unknown function (DUF615); InterPro: IPR006839 The proteins in this entry are functionally uncharacterised. The entry contains the Escherichia coli (strain K12) protein YjgA (P0A8X0 from SWISSPROT), which has been shown to comigrate with the mature 50S ribosome subunit. Therefore it either represents a novel ribosome-associated protein or it is associated with a different oligomeric complex that comigrates with ribosomal particles [].; PDB: 2P0T_A.
Probab=20.97 E-value=3e+02 Score=20.97 Aligned_cols=57 Identities=12% Similarity=0.225 Sum_probs=44.2
Q ss_pred ccchhhhhhccCCChHHHHHHHHhhhccchhhHHHHHHHHHHHHHcCCCHHHHHHHcC
Q 030521 88 EVKTFNEEFGKSKSNDELKEMILVANYLNIKDMLDYLTETVANRIKNKSVEYVRKFFG 145 (176)
Q Consensus 88 ~~~~wd~eFl~~~~~~~l~~Ll~AA~~L~I~~L~~l~~~~iA~~i~gks~eeir~~f~ 145 (176)
.+..|-..++.. +...+-+++..-.-.+.+.|..++-..-...-.+|++.--|++|.
T Consensus 97 ~lE~wRdrLi~~-~~~al~e~~~~~p~~D~Q~LRqLiR~a~ke~~~~k~~ka~R~Lfr 153 (157)
T PF04751_consen 97 RLERWRDRLIAD-DDSALTEFLAEYPDADRQQLRQLIRNARKEREANKPPKAYRELFR 153 (157)
T ss_dssp HHHHHHHHHHHS--HHHHHHHHHHSTTS-HHHHHHHHHHHHHHHHTT---HHHHHHHH
T ss_pred HHHHHHHHHHhC-ChHHHHHHHHHCChhhHHHHHHHHHHHHHHHHcCCCchhHHHHHH
Confidence 467888888888 888888888888888999999999888888888999888888774
No 87
>PLN03219 uncharacterized protein; Provisional
Probab=20.60 E-value=3.3e+02 Score=19.55 Aligned_cols=59 Identities=8% Similarity=0.126 Sum_probs=38.1
Q ss_pred CCCCcEEEEeCC---CCEEEecHHHHHHhHHHHHHHhcC----CCCCCCcccccCCCChHHHHHHHHH
Q 030521 12 QATKKISLKTAD---GHLFEVEEPVAMEFEIVKSFFDEN----DDASEDTVVPLPNVSAAPLSYIIEF 72 (176)
Q Consensus 12 ~~~~~i~L~s~D---G~~f~V~~~~a~~S~~l~~ll~~~----~~~~~~~~Ipl~~v~s~~L~~Iie~ 72 (176)
.+...+.+...+ ++.|.|+...+. -+.++.||+.. |.+....+|.|| =....++.|+..
T Consensus 39 vpkGh~aVYVG~~~E~kRFvVPi~yL~-hP~F~~LL~~AeEEfGf~~~~G~L~IP-Cd~~~F~~ll~~ 104 (108)
T PLN03219 39 VPKGHVAVYVGEQMEKKRFVVPISYLN-HPLFREFLNRAEEECGFHHSMGGLTIP-CREESFLHLITS 104 (108)
T ss_pred CCCCeEEEEECCCCCceEEEEEHHHcC-ChHHHHHHHHHHHHhCCCCCCCCEEEe-CCHHHHHHHHHh
Confidence 555556665554 699999998754 46667776532 332112477777 777888887764
No 88
>PF02657 SufE: Fe-S metabolism associated domain; InterPro: IPR003808 This entry represents the core domain of SufE and related proteins. This domain of SufE shows strong structural similarity to IscU, and the sulfur-acceptor site in SufE coincides with the location of the cysteine residues mediating Fe-S cluster assembly in IscU. Thus, a conserved core structure is implicated in mediating the interactions of both SufE and IscU with the mutually homologous cysteine desulfurase enzymes present in their respective operons [].; PDB: 1MZG_B 1WLO_A 3G0M_A 1NI7_A.
Probab=20.47 E-value=1.8e+02 Score=21.11 Aligned_cols=35 Identities=23% Similarity=0.475 Sum_probs=23.6
Q ss_pred cchhhHHHHHHHHHHHHHcCCCHHHHHHH-------cCCCCCCCHH
Q 030521 115 LNIKDMLDYLTETVANRIKNKSVEYVRKF-------FGIENDFTPE 153 (176)
Q Consensus 115 L~I~~L~~l~~~~iA~~i~gks~eeir~~-------f~i~~d~t~E 153 (176)
+=+++|+-+.+. .++|+|++||..+ +|+..-+||-
T Consensus 67 ~ivkGl~all~~----~~~g~t~~eI~~~~~~fl~~lgl~~~Ls~s 108 (125)
T PF02657_consen 67 RIVKGLLALLLE----VLNGQTPEEILAFDPDFLEQLGLSQHLSPS 108 (125)
T ss_dssp HHHHHHHHHHHH----HTTT-BHHHHHHS-THHHHHHTSCCCSTHH
T ss_pred HHHHHHHHHHHH----HHcCCCHHHHHhCCHHHHHHcCcccccCch
Confidence 335555555544 4589999999765 7888888874
No 89
>smart00875 BACK BTB And C-terminal Kelch. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues.
Probab=20.33 E-value=2.6e+02 Score=18.10 Aligned_cols=62 Identities=16% Similarity=0.288 Sum_probs=37.1
Q ss_pred HHHHHhhhccchhhHHHHHHHHHHHH---------HcCCCHHHHHHHcCCCCCCC-HHHHHHHHHhcccccCC
Q 030521 106 KEMILVANYLNIKDMLDYLTETVANR---------IKNKSVEYVRKFFGIENDFT-PEEEEAARKEYEWAFEG 168 (176)
Q Consensus 106 ~~Ll~AA~~L~I~~L~~l~~~~iA~~---------i~gks~eeir~~f~i~~d~t-~EEe~~i~~e~~w~~~~ 168 (176)
+++...|...+.+.|.+.|.++++.. +...+.+++..+++= ++++ ..|..-.+---.|+...
T Consensus 2 ~~i~~~a~~~~~~~L~~~~~~~i~~nf~~~~~~~~f~~L~~~~l~~iL~~-d~l~v~~E~~v~~av~~W~~~~ 73 (101)
T smart00875 2 LGIRRFAELYGLEELLEKALRFILKNFLEVAQSEEFLELSLEQLLSLLSS-DDLNVPSEEEVFEAVLRWVKHD 73 (101)
T ss_pred HhHHHHHHHhChHHHHHHHHHHHHHHHHHHhcCcHHhcCCHHHHHHHhCc-ccCCCCCHHHHHHHHHHHHHCC
Confidence 34566677788888888888887663 234567777777654 2343 22333333345566543
Done!