Query psy15066
Match_columns 62
No_of_seqs 159 out of 1507
Neff 8.0
Searched_HMMs 46136
Date Fri Aug 16 19:34:25 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15066.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15066hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1577|consensus 99.7 1.3E-17 2.8E-22 106.1 6.6 56 1-56 234-289 (300)
2 COG0656 ARA1 Aldo/keto reducta 99.7 3.1E-17 6.7E-22 103.7 7.0 54 2-55 214-267 (280)
3 PRK11565 dkgA 2,5-diketo-D-glu 99.6 2.7E-15 5.9E-20 93.9 7.1 57 1-57 211-267 (275)
4 PRK11172 dkgB 2,5-diketo-D-glu 99.6 9.6E-15 2.1E-19 91.0 7.1 56 1-56 201-256 (267)
5 KOG1575|consensus 99.5 1.2E-13 2.7E-18 89.2 6.4 55 1-55 272-328 (336)
6 PRK10625 tas putative aldo-ket 99.4 3E-13 6.6E-18 86.7 6.5 56 1-56 286-343 (346)
7 PRK10376 putative oxidoreducta 99.4 6.3E-13 1.4E-17 83.7 6.4 52 1-52 235-288 (290)
8 COG0667 Tas Predicted oxidored 99.4 5.8E-13 1.3E-17 85.2 6.3 53 1-53 256-310 (316)
9 TIGR01293 Kv_beta voltage-depe 99.4 6.7E-13 1.5E-17 84.3 6.0 50 1-50 263-316 (317)
10 PRK09912 L-glyceraldehyde 3-ph 99.4 1E-12 2.2E-17 84.5 6.5 57 1-57 279-338 (346)
11 PF00248 Aldo_ket_red: Aldo/ke 99.3 1.1E-11 2.3E-16 76.9 5.9 51 1-51 230-282 (283)
12 cd06660 Aldo_ket_red Aldo-keto 99.2 5.5E-11 1.2E-15 73.8 6.1 50 1-50 234-285 (285)
13 PLN02587 L-galactose dehydroge 99.2 5E-11 1.1E-15 75.7 6.0 53 1-53 243-301 (314)
14 PRK14863 bifunctional regulato 99.0 7.3E-10 1.6E-14 70.2 4.1 50 1-50 229-280 (292)
15 COG4989 Predicted oxidoreducta 98.7 3.7E-08 8E-13 62.4 5.7 53 1-53 240-294 (298)
16 COG1453 Predicted oxidoreducta 98.0 1.2E-05 2.5E-10 53.2 4.1 50 2-51 230-284 (391)
17 KOG1576|consensus 92.7 0.36 7.7E-06 31.6 4.7 41 1-41 268-310 (342)
18 PF11116 DUF2624: Protein of u 79.2 6.1 0.00013 21.3 3.9 39 22-60 14-52 (85)
19 TIGR02530 flg_new flagellar op 72.2 7.2 0.00016 21.5 3.0 26 25-50 18-43 (96)
20 PF08145 BOP1NT: BOP1NT (NUC16 65.9 18 0.00038 23.4 4.2 19 36-54 49-67 (260)
21 PRK10945 gene expression modul 65.1 9 0.00019 20.1 2.4 40 8-52 9-48 (72)
22 PF08270 PRD_Mga: M protein tr 63.5 9.5 0.00021 23.4 2.7 45 6-51 35-94 (217)
23 PRK06552 keto-hydroxyglutarate 59.6 18 0.0004 22.3 3.4 50 7-56 101-158 (213)
24 PF04746 DUF575: Protein of un 58.9 6 0.00013 21.8 1.1 30 20-49 64-93 (101)
25 PF07862 Nif11: Nitrogen fixat 56.6 16 0.00034 17.0 2.3 23 22-44 26-48 (49)
26 PRK14981 DNA-directed RNA poly 56.2 21 0.00045 19.9 3.0 33 21-53 77-109 (112)
27 PRK00819 RNA 2'-phosphotransfe 55.1 41 0.00089 20.3 4.4 50 7-56 12-67 (179)
28 PF01885 PTS_2-RNA: RNA 2'-pho 54.7 27 0.00058 21.1 3.5 50 7-56 11-66 (186)
29 TIGR01457 HAD-SF-IIA-hyp2 HAD- 50.5 36 0.00079 21.0 3.8 40 5-44 23-66 (249)
30 PF11338 DUF3140: Protein of u 50.4 34 0.00074 18.7 3.2 50 2-52 9-67 (92)
31 PF03874 RNA_pol_Rpb4: RNA pol 50.3 20 0.00043 19.6 2.4 28 23-50 85-112 (117)
32 PRK07114 keto-hydroxyglutarate 50.3 32 0.00069 21.5 3.4 49 7-55 104-160 (222)
33 TIGR03798 ocin_TIGR03798 bacte 48.7 34 0.00074 16.9 3.1 26 22-47 24-49 (64)
34 PHA02681 ORF089 virion membran 48.6 21 0.00046 19.3 2.1 30 23-52 36-65 (92)
35 PRK10391 oriC-binding nucleoid 48.2 17 0.00037 19.0 1.7 33 19-51 10-43 (71)
36 PRK06015 keto-hydroxyglutarate 48.1 37 0.00079 20.9 3.4 49 7-55 89-146 (201)
37 COG1460 Uncharacterized protei 47.7 33 0.00071 19.5 2.9 33 21-53 78-110 (114)
38 TIGR00135 gatC glutamyl-tRNA(G 47.4 42 0.00091 17.7 3.3 24 25-48 3-26 (93)
39 KOG0650|consensus 47.3 15 0.00033 26.6 1.9 29 34-62 177-206 (733)
40 PF13833 EF-hand_8: EF-hand do 45.4 32 0.00069 15.8 2.4 28 23-50 5-33 (54)
41 PRK00034 gatC aspartyl/glutamy 45.3 47 0.001 17.5 3.3 26 23-48 3-28 (95)
42 PF01081 Aldolase: KDPG and KH 45.0 27 0.00058 21.4 2.5 50 7-56 93-151 (196)
43 TIGR01182 eda Entner-Doudoroff 44.6 46 0.00099 20.5 3.5 28 7-34 93-120 (204)
44 PF07308 DUF1456: Protein of u 43.8 46 0.001 17.0 3.5 28 24-51 15-42 (68)
45 PF14407 Frankia_peptide: Ribo 43.2 24 0.00052 17.9 1.8 18 34-51 27-44 (61)
46 PF10281 Ish1: Putative stress 42.6 34 0.00074 15.1 2.5 26 8-33 10-35 (38)
47 COG5201 SKP1 SCF ubiquitin lig 42.4 21 0.00045 21.1 1.6 41 10-51 109-152 (158)
48 PTZ00315 2'-phosphotransferase 39.6 84 0.0018 22.6 4.5 51 7-57 384-440 (582)
49 COG1026 Predicted Zn-dependent 39.4 46 0.001 25.4 3.3 47 5-51 438-492 (978)
50 cd07321 Extradiol_Dioxygenase_ 39.3 34 0.00075 17.7 2.1 28 24-51 18-47 (77)
51 TIGR02744 TrbI_Ftype type-F co 39.2 65 0.0014 18.2 3.3 44 6-49 20-65 (112)
52 PF05321 HHA: Haemolysin expre 39.2 10 0.00022 19.0 0.0 32 21-52 5-36 (57)
53 PF12162 STAT1_TAZ2bind: STAT1 39.0 33 0.00072 14.0 1.6 15 37-51 8-22 (23)
54 TIGR01458 HAD-SF-IIA-hyp3 HAD- 37.8 86 0.0019 19.5 4.0 39 6-44 28-70 (257)
55 TIGR02311 HpaI 2,4-dihydroxyhe 37.4 64 0.0014 20.2 3.4 31 7-37 76-108 (249)
56 PRK05439 pantothenate kinase; 36.8 58 0.0013 21.4 3.2 30 24-53 14-43 (311)
57 PF13986 DUF4224: Domain of un 36.3 11 0.00024 17.8 -0.1 12 39-50 3-14 (47)
58 PF12551 PHBC_N: Poly-beta-hyd 36.1 36 0.00079 16.2 1.7 11 1-11 24-34 (46)
59 PHA02902 putative IMV membrane 35.1 27 0.00059 18.0 1.2 26 26-51 38-66 (70)
60 COG4536 CorB Putative Mg2+ and 35.0 34 0.00075 23.5 2.0 33 15-50 344-376 (423)
61 PF04358 DsrC: DsrC like prote 35.0 30 0.00064 19.3 1.5 32 20-51 20-51 (109)
62 COG4710 Predicted DNA-binding 35.0 18 0.00039 19.1 0.6 25 29-53 2-26 (80)
63 PRK02290 3-dehydroquinate synt 34.7 6.2 0.00013 26.4 -1.5 52 5-57 128-185 (344)
64 PF03457 HA: Helicase associat 33.6 35 0.00076 16.7 1.5 47 5-51 13-66 (68)
65 COG0313 Predicted methyltransf 33.4 36 0.00078 22.2 1.8 28 2-29 93-122 (275)
66 PF12651 RHH_3: Ribbon-helix-h 33.2 20 0.00043 16.6 0.5 22 34-55 4-25 (44)
67 PHA03188 UL14 tegument protein 33.1 70 0.0015 19.9 2.9 41 5-45 147-189 (199)
68 TIGR03342 dsrC_tusE_dsvC sulfu 32.8 58 0.0013 18.2 2.4 33 20-52 19-51 (108)
69 PF06819 Arc_PepC: Archaeal Pe 32.5 48 0.001 18.7 2.0 18 39-56 87-104 (110)
70 PF09336 Vps4_C: Vps4 C termin 32.3 71 0.0015 15.8 2.5 32 22-53 29-60 (62)
71 PF10802 DUF2540: Protein of u 32.2 66 0.0014 17.0 2.4 45 4-52 9-60 (75)
72 PF01466 Skp1: Skp1 family, di 31.3 80 0.0017 16.1 3.4 31 19-50 42-72 (78)
73 PF00352 TBP: Transcription fa 31.1 42 0.00091 17.4 1.6 20 14-33 57-76 (86)
74 PRK12821 aspartyl/glutamyl-tRN 30.9 83 0.0018 22.1 3.3 26 23-48 389-414 (477)
75 PRK11508 sulfur transfer prote 30.9 64 0.0014 18.1 2.4 33 20-52 20-52 (109)
76 cd07922 CarBa CarBa is the A s 30.8 62 0.0013 17.2 2.2 25 26-50 21-47 (81)
77 PRK05718 keto-hydroxyglutarate 30.4 1E+02 0.0022 19.0 3.4 28 6-33 99-126 (212)
78 PF10260 SAYSvFN: Uncharacteri 29.8 52 0.0011 17.1 1.8 17 15-31 53-69 (71)
79 PHA01748 hypothetical protein 29.8 25 0.00055 17.4 0.6 20 34-53 4-23 (60)
80 PF11762 Arabinose_Iso_C: L-ar 29.8 1E+02 0.0023 17.5 3.1 35 4-38 75-111 (115)
81 COG1465 Predicted alternative 29.6 9.4 0.0002 25.4 -1.2 52 5-57 159-217 (376)
82 PRK13619 psbV cytochrome c-550 29.5 45 0.00098 20.0 1.7 26 25-50 113-143 (160)
83 PF12824 MRP-L20: Mitochondria 29.1 52 0.0011 19.7 1.9 19 36-54 83-101 (164)
84 KOG0237|consensus 28.8 56 0.0012 24.0 2.3 51 5-56 14-69 (788)
85 PF09202 Rio2_N: Rio2, N-termi 28.4 56 0.0012 17.2 1.8 22 38-59 3-24 (82)
86 PRK14700 recombination factor 27.8 66 0.0014 21.2 2.4 35 13-47 5-47 (300)
87 COG2250 Uncharacterized conser 27.5 1.3E+02 0.0027 17.1 3.9 47 4-51 40-86 (132)
88 TIGR00096 probable S-adenosylm 26.7 27 0.00059 22.5 0.5 26 2-27 88-115 (276)
89 PF10937 DUF2638: Protein of u 26.6 55 0.0012 18.4 1.7 13 37-49 97-109 (112)
90 PF01113 DapB_N: Dihydrodipico 26.6 61 0.0013 17.9 1.9 15 39-53 101-115 (124)
91 COG2256 MGS1 ATPase related to 26.2 1.1E+02 0.0023 21.4 3.2 38 11-48 127-172 (436)
92 KOG2749|consensus 26.1 52 0.0011 22.6 1.7 39 2-40 118-170 (415)
93 cd07921 PCA_45_Doxase_A_like S 26.0 55 0.0012 18.3 1.6 30 21-50 25-56 (106)
94 PF10078 DUF2316: Uncharacteri 25.9 96 0.0021 16.8 2.4 29 24-52 11-48 (89)
95 PF07801 DUF1647: Protein of u 25.2 1.1E+02 0.0025 17.9 2.9 35 16-50 61-110 (142)
96 smart00657 RPOL4c DNA-directed 24.1 1.4E+02 0.003 16.5 3.1 28 22-49 82-109 (118)
97 COG0099 RpsM Ribosomal protein 24.1 69 0.0015 18.4 1.7 15 38-52 47-61 (121)
98 PF07128 DUF1380: Protein of u 24.0 1.5E+02 0.0032 17.5 3.1 26 23-48 27-52 (139)
99 PRK12302 bssR biofilm formatio 23.8 1.2E+02 0.0026 17.2 2.6 28 23-50 37-75 (127)
100 PF01402 RHH_1: Ribbon-helix-h 23.7 23 0.00049 15.4 -0.2 18 36-53 3-20 (39)
101 PRK01002 nickel responsive reg 23.3 31 0.00068 19.9 0.3 23 32-54 4-26 (141)
102 PRK10444 UMP phosphatase; Prov 23.1 2E+02 0.0043 17.9 3.9 39 6-44 24-66 (248)
103 PF02844 GARS_N: Phosphoribosy 22.9 18 0.00039 19.9 -0.8 46 5-55 12-62 (100)
104 PF14794 DUF4479: Domain of un 22.9 68 0.0015 16.6 1.5 18 35-52 44-61 (73)
105 PF07939 DUF1685: Protein of u 22.8 64 0.0014 16.5 1.3 11 39-49 1-11 (64)
106 PF08127 Propeptide_C1: Peptid 22.8 60 0.0013 14.9 1.1 17 39-55 1-17 (41)
107 PRK10128 2-keto-3-deoxy-L-rham 22.6 1.8E+02 0.004 18.6 3.6 32 7-38 82-115 (267)
108 PF08920 SF3b1: Splicing facto 22.5 83 0.0018 18.5 1.9 17 32-48 80-96 (144)
109 COG3195 Uncharacterized protei 22.5 44 0.00095 20.4 0.8 18 38-55 100-117 (176)
110 PRK13717 conjugal transfer pro 22.5 1.7E+02 0.0037 17.0 3.9 43 6-48 33-77 (128)
111 PF11256 DUF3055: Protein of u 22.4 84 0.0018 16.8 1.8 38 12-51 41-78 (81)
112 COG1056 NadR Nicotinamide mono 21.8 42 0.0009 20.3 0.6 25 7-31 22-48 (172)
113 PF02401 LYTB: LytB protein; 21.8 81 0.0018 20.4 1.9 31 2-32 243-274 (281)
114 PF11017 DUF2855: Protein of u 21.7 1.1E+02 0.0024 20.3 2.5 46 3-49 147-197 (314)
115 PF14658 EF-hand_9: EF-hand do 21.6 1.3E+02 0.0029 15.3 3.0 28 24-51 16-44 (66)
116 PF05922 Inhibitor_I9: Peptida 21.6 27 0.0006 17.3 -0.2 20 32-51 49-68 (82)
117 KOG0658|consensus 21.5 1.7E+02 0.0037 20.0 3.4 37 14-51 221-257 (364)
118 PF13101 DUF3945: Protein of u 21.3 43 0.00092 16.4 0.5 22 28-49 24-45 (59)
119 TIGR02698 CopY_TcrY copper tra 21.2 82 0.0018 17.7 1.7 17 37-53 100-116 (130)
120 PRK00630 nickel responsive reg 20.6 41 0.0009 19.7 0.4 30 24-54 3-32 (148)
121 PF04316 FlgM: Anti-sigma-28 f 20.5 1.2E+02 0.0026 14.5 2.0 28 18-45 23-50 (57)
122 cd07925 LigA_like_1 The A subu 20.3 94 0.002 17.4 1.8 32 19-50 23-56 (106)
123 COG0289 DapB Dihydrodipicolina 20.3 2.1E+02 0.0046 18.6 3.5 17 39-55 103-119 (266)
124 PF14823 Sirohm_synth_C: Siroh 20.2 84 0.0018 16.1 1.5 19 38-56 49-67 (70)
125 KOG1461|consensus 20.2 2.2E+02 0.0048 21.0 3.9 39 4-42 58-98 (673)
126 cd04517 TLF TBP-like factors ( 20.2 1E+02 0.0022 18.4 2.1 19 15-33 148-166 (174)
127 PF03965 Penicillinase_R: Peni 20.2 84 0.0018 17.0 1.6 14 38-51 100-113 (115)
128 cd07923 Gallate_dioxygenase_C 20.1 1.2E+02 0.0026 16.6 2.1 30 21-50 17-48 (94)
129 PF13344 Hydrolase_6: Haloacid 20.0 1E+02 0.0022 16.4 1.9 41 4-44 19-63 (101)
130 PF14338 Mrr_N: Mrr N-terminal 20.0 1.5E+02 0.0033 15.4 2.9 29 20-49 17-45 (92)
No 1
>KOG1577|consensus
Probab=99.73 E-value=1.3e-17 Score=106.12 Aligned_cols=56 Identities=39% Similarity=0.615 Sum_probs=53.6
Q ss_pred CCHHHHHHHHHHhCCCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCcc
Q psy15066 1 MFMTSFILRPQVQSGTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 1 ~s~aqval~W~l~~~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~ 56 (62)
.|+|||+|+|.+++|+++||+|++++||++|+++++|+|++|||+.|++++.++|+
T Consensus 234 kt~aQIlLrw~~q~g~~vipKS~~~~Ri~eN~~vfdf~Lt~ed~~~i~~~~~~~r~ 289 (300)
T KOG1577|consen 234 KTPAQILLRWALQRGVSVIPKSSNPERIKENFKVFDFELTEEDMKKLDSLNSNERY 289 (300)
T ss_pred CCHHHHHHHHHHhCCcEEEeccCCHHHHHHHHhhccccCCHHHHHHHhhcccccee
Confidence 48999999999999999999999999999999999999999999999999988874
No 2
>COG0656 ARA1 Aldo/keto reductases, related to diketogulonate reductase [General function prediction only]
Probab=99.71 E-value=3.1e-17 Score=103.74 Aligned_cols=54 Identities=44% Similarity=0.628 Sum_probs=52.0
Q ss_pred CHHHHHHHHHHhCCCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCc
Q psy15066 2 FMTSFILRPQVQSGTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQR 55 (62)
Q Consensus 2 s~aqval~W~l~~~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r 55 (62)
|++||+|+|++++|+++||++++++|+++|+.+++|+||+|||++|++++...+
T Consensus 214 t~AQv~L~W~i~~gv~~Ipks~~~~ri~eN~~~~~f~Ls~ed~~~i~~l~~~~~ 267 (280)
T COG0656 214 TPAQVALRWHIQRGVIVIPKSTTPERIRENLAAFDFELSEEDMAAIDALDRGYG 267 (280)
T ss_pred CHHHHHHHHHHhCCcEEecCCCCHHHHHHHHhhhcCCCCHHHHHHHHhhccccC
Confidence 899999999999999999999999999999999999999999999999998763
No 3
>PRK11565 dkgA 2,5-diketo-D-gluconate reductase A; Provisional
Probab=99.61 E-value=2.7e-15 Score=93.95 Aligned_cols=57 Identities=37% Similarity=0.588 Sum_probs=53.6
Q ss_pred CCHHHHHHHHHHhCCCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCccc
Q psy15066 1 MFMTSFILRPQVQSGTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQRSC 57 (62)
Q Consensus 1 ~s~aqval~W~l~~~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~~ 57 (62)
+|++|+||+|+++++.++|||+++++|+++|+++.++.|+++++++|+.+..+.|+.
T Consensus 211 ~s~aq~aL~w~l~~~~~~I~g~~~~~~i~~n~~a~~~~Ls~~~~~~i~~~~~~~~~~ 267 (275)
T PRK11565 211 KTPAQIVIRWHLDSGLVVIPKSVTPSRIAENFDVFDFRLDKDELGEIAKLDQGKRLG 267 (275)
T ss_pred CCHHHHHHHHHHcCCCEeeCCCCCHHHHHHHHhccCCCcCHHHHHHHHhhcccCCcC
Confidence 589999999999999999999999999999999999999999999999998877763
No 4
>PRK11172 dkgB 2,5-diketo-D-gluconate reductase B; Provisional
Probab=99.57 E-value=9.6e-15 Score=91.04 Aligned_cols=56 Identities=29% Similarity=0.351 Sum_probs=52.4
Q ss_pred CCHHHHHHHHHHhCCCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCcc
Q psy15066 1 MFMTSFILRPQVQSGTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 1 ~s~aqval~W~l~~~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~ 56 (62)
+|++|++|+|++++++++|||+++++|+++|+++.+++|+++++++|+++.++.|.
T Consensus 201 ~s~aqval~w~l~~~~~~i~g~~~~~~l~~n~~~~~~~L~~~~~~~i~~~~~~~~~ 256 (267)
T PRK11172 201 ATPAQVILAWAMQLGYSVIPSSTKRENLASNLLAQDLQLDAEDMAAIAALDRNGRL 256 (267)
T ss_pred CCHHHHHHHHHHhCCCEeecCCCCHHHHHHHHhhcCCCcCHHHHHHHhhhccCCcc
Confidence 58999999999999999999999999999999999999999999999999877553
No 5
>KOG1575|consensus
Probab=99.48 E-value=1.2e-13 Score=89.17 Aligned_cols=55 Identities=18% Similarity=0.396 Sum_probs=51.2
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCc
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQR 55 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r 55 (62)
+|++|+||+|+++++ ++||||+++.+|+++|+.++.++|++|+++.|++......
T Consensus 272 ~T~~qlALawv~~~~~v~~pIpG~s~ve~l~eni~Al~~~Lt~e~~~~l~~~~~~~~ 328 (336)
T KOG1575|consen 272 CTVPQLALAWVLSNGKVSSPIPGASKIEQLKENIGALSVKLTPEEIKELEEIIDKIL 328 (336)
T ss_pred CCHHHHHHHHHHHhCCCEEecCCCCcHHHHHHHHhhhhccCCHHHHHHHHHhhcccc
Confidence 589999999999996 9999999999999999999999999999999999876644
No 6
>PRK10625 tas putative aldo-keto reductase; Provisional
Probab=99.44 E-value=3e-13 Score=86.74 Aligned_cols=56 Identities=14% Similarity=0.186 Sum_probs=50.6
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCcc
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~ 56 (62)
+|++|+||+|+++++ +++|||+++++|+++|+++.+++|+++++++|+++...+++
T Consensus 286 ~t~aqval~w~l~~~~v~~~I~G~~~~~~l~en~~a~~~~L~~~~~~~l~~~~~~~~~ 343 (346)
T PRK10625 286 LDPAQMALAFVRRQPFVASTLLGATTMEQLKTNIESLHLTLSEEVLAEIEAVHQVYTY 343 (346)
T ss_pred CCHHHHHHHHHHhCCCCeeEEeCCCCHHHHHHHHhhccCCCCHHHHHHHHHHHhhcCC
Confidence 489999999999997 46899999999999999999999999999999998765443
No 7
>PRK10376 putative oxidoreductase; Provisional
Probab=99.42 E-value=6.3e-13 Score=83.69 Aligned_cols=52 Identities=17% Similarity=0.303 Sum_probs=48.1
Q ss_pred CCHHHHHHHHHHhC--CCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhc
Q psy15066 1 MFMTSFILRPQVQS--GTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDK 52 (62)
Q Consensus 1 ~s~aqval~W~l~~--~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~ 52 (62)
+|++|++|+|++++ ++++|+|+++++|+++|+++.++.|++++++.|+++.+
T Consensus 235 ~t~aq~al~w~l~~~~~~~~i~G~~~~~~l~en~~a~~~~L~~e~~~~l~~~~~ 288 (290)
T PRK10376 235 ATPMQVALAWLLQRSPNILLIPGTSSVAHLRENLAAAELVLSEEVLAELDGIAR 288 (290)
T ss_pred CCHHHHHHHHHHhCCCCeeEeeCCCCHHHHHHHHhhccCCCCHHHHHHHHHHHh
Confidence 48999999999987 48899999999999999999999999999999998764
No 8
>COG0667 Tas Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) [Energy production and conversion]
Probab=99.42 E-value=5.8e-13 Score=85.23 Aligned_cols=53 Identities=17% Similarity=0.305 Sum_probs=49.0
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcC
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~ 53 (62)
+||+|+||+|+++++ ++||+|+++++|+++|+++.+++|++++++.|+.....
T Consensus 256 ~t~aq~ALawvl~~~~v~~~I~Ga~~~~qL~en~~A~~~~L~~~~~~~l~~~~~~ 310 (316)
T COG0667 256 ATPAQVALAWVLAQPGVTSPIVGASKAEQLEENLAALDIKLSEEELAALDEISAE 310 (316)
T ss_pred CCHHHHHHHHHHhCCCCceEeecCCCHHHHHHHHHHhcCCCCHHHHHHHHHHhhh
Confidence 589999999999995 89999999999999999999999999999999977543
No 9
>TIGR01293 Kv_beta voltage-dependent potassium channel beta subunit, animal. Plant beta subunits and their closely related bacterial homologs (in Deinococcus radiudurans, Xylella fastidiosa, etc.) appear more closely related to each other than to animal forms. However, the bacterial species lack convincing counterparts the Kv alpha subunit and the Kv beta homolog may serve as an enzyme. Cutoffs are set for this model such that yeast and plant forms and bacterial close homologs score between trusted and noise cutoffs.
Probab=99.40 E-value=6.7e-13 Score=84.34 Aligned_cols=50 Identities=12% Similarity=0.255 Sum_probs=46.1
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCC--CCCHHHHHHHHhh
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFDF--TLEQSDVDKIDAL 50 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~--~L~~e~~~~l~~~ 50 (62)
+|++|+||+|+++++ +++|||+++++|+++|++++++ +|++|++++|+++
T Consensus 263 ~s~aqlal~w~l~~~~v~~~i~G~~~~~ql~en~~a~~~~~~Ls~e~~~~l~~~ 316 (317)
T TIGR01293 263 CTLPQLAIAWCLRNEGVSSVLLGASSAEQLMENLGSLQVLPKLSSSIIHEIDSI 316 (317)
T ss_pred cCHHHHHHHHHhcCCCCeEEEeCCCCHHHHHHHHHHhhccCCCCHHHHHHHHhh
Confidence 589999999999985 5899999999999999999987 9999999999874
No 10
>PRK09912 L-glyceraldehyde 3-phosphate reductase; Provisional
Probab=99.40 E-value=1e-12 Score=84.53 Aligned_cols=57 Identities=14% Similarity=0.201 Sum_probs=50.6
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhcc-CCCCCHHHHHHHHhhhcCCccc
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVF-DFTLEQSDVDKIDALDKKQRSC 57 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~-~~~L~~e~~~~l~~~~~~~r~~ 57 (62)
+|++|+||+|+++++ +++|||+++++|+++|+++. +++|++|++++|+++......+
T Consensus 279 ~t~aq~AL~w~l~~~~v~~~i~G~~~~~ql~en~~a~~~~~L~~e~~~~l~~~~~~~~~~ 338 (346)
T PRK09912 279 QSMAQMALSWLLKDERVTSVLIGASRAEQLEENVQALNNLTFSTEELAQIDQHIADGELN 338 (346)
T ss_pred CCHHHHHHHHHHhCCCCeEEEeCCCCHHHHHHHHhhhcCCCCCHHHHHHHHHhhCcccee
Confidence 588999999999996 78999999999999999998 5999999999999987554443
No 11
>PF00248 Aldo_ket_red: Aldo/keto reductase family; InterPro: IPR023210 The aldo-keto reductase family includes a number of related monomeric NADPH-dependent oxidoreductases, such as aldehyde reductase, aldose reductase, prostaglandin F synthase, xylose reductase, rho crystallin, and many others []. All possess a similar structure, with a beta-alpha-beta fold characteristic of nucleotide binding proteins []. The fold comprises a parallel beta-8/alpha-8-barrel, which contains a novel NADP-binding motif. The binding site is located in a large, deep, elliptical pocket in the C-terminal end of the beta sheet, the substrate being bound in an extended conformation. The hydrophobic nature of the pocket favours aromatic and apolar substrates over highly polar ones []. Binding of the NADPH coenzyme causes a massive conformational change, reorienting a loop, effectively locking the coenzyme in place. This binding is more similar to FAD- than to NAD(P)-binding oxidoreductases []. Some proteins of this entry contain a K+ ion channel beta chain regulatory domain; these are reported to have oxidoreductase activity []. This entry represents the NADP-dependent oxidoreductase domain found in these proteins.; PDB: 1C9W_A 4F40_B 1VBJ_A 1XGD_A 1X97_A 2ACS_A 1EF3_A 2ACU_A 1PWM_A 2NVD_A ....
Probab=99.28 E-value=1.1e-11 Score=76.92 Aligned_cols=51 Identities=20% Similarity=0.300 Sum_probs=44.3
Q ss_pred CCHHHHHHHHHHhC--CCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhh
Q psy15066 1 MFMTSFILRPQVQS--GTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 1 ~s~aqval~W~l~~--~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~ 51 (62)
+|++|+||+|++++ +.++|+|+++++|+++|+++.+++|+++++++|+++.
T Consensus 230 ~s~~q~al~~~l~~~~~~~~i~g~~~~~~l~en~~a~~~~L~~~~~~~i~~~~ 282 (283)
T PF00248_consen 230 VSPAQLALRWVLSHPGVASVIVGASSPEHLEENLAALDFPLTEEELAEIDQIL 282 (283)
T ss_dssp SSHHHHHHHHHHTSHTTEEEEEB-SSHHHHHHHHGGSSSG--HHHHHHHHTTH
T ss_pred cccchhhhhhhhhccccccccCCCCCHHHHHHHHHHhCCCCCHHHHHHHHhhh
Confidence 48999999999965 5999999999999999999999999999999999863
No 12
>cd06660 Aldo_ket_red Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
Probab=99.20 E-value=5.5e-11 Score=73.81 Aligned_cols=50 Identities=32% Similarity=0.460 Sum_probs=46.6
Q ss_pred CCHHHHHHHHHHhC--CCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhh
Q psy15066 1 MFMTSFILRPQVQS--GTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDAL 50 (62)
Q Consensus 1 ~s~aqval~W~l~~--~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~ 50 (62)
+|++|++++|.+++ +.++|+|+++++|+++|+++..++|++++++.|+++
T Consensus 234 ~s~~q~al~~~l~~p~~~~~i~g~~~~~~l~~n~~~~~~~L~~~~~~~l~~~ 285 (285)
T cd06660 234 VTPAQVALRWLLQQPGVTSVIPGASSPERLEENLAALDFELSDEDLAALDAL 285 (285)
T ss_pred CCHHHHHHHHHhcCCCCeEEEeCCCCHHHHHHHHhhccCCCCHHHHHHHhhC
Confidence 47899999999999 599999999999999999999999999999999863
No 13
>PLN02587 L-galactose dehydrogenase
Probab=99.20 E-value=5e-11 Score=75.66 Aligned_cols=53 Identities=11% Similarity=0.231 Sum_probs=46.9
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccC----CCCCHHHHHHHHhhhcC
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFD----FTLEQSDVDKIDALDKK 53 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~----~~L~~e~~~~l~~~~~~ 53 (62)
+|++|+||+|+++++ .++|+|+++++|+++|+++.+ ++|+++++++|+++...
T Consensus 243 ~s~aq~al~~~l~~~~v~~~i~G~~~~~~l~~nl~a~~~~~~~~l~~~~~~~l~~~~~~ 301 (314)
T PLN02587 243 KNISKLALQYSLSNKDISTTLVGMNSVQQVEENVAAATELETSGIDEELLSEVEAILAP 301 (314)
T ss_pred CCHHHHHHHHHHhCCCCeeEEecCCCHHHHHHHHHHHhhcccCCCCHHHHHHHHHhhcc
Confidence 589999999999997 579999999999999999865 47999999999987643
No 14
>PRK14863 bifunctional regulator KidO; Provisional
Probab=98.97 E-value=7.3e-10 Score=70.16 Aligned_cols=50 Identities=8% Similarity=-0.057 Sum_probs=44.6
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhh
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDAL 50 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~ 50 (62)
+|++|+||+|+++++ +++|+|+++++|+++|+++.+..++++.+++|.--
T Consensus 229 ~s~aqlalaw~l~~p~v~~~I~G~~~~~ql~~n~~a~~~~~~~~~~~~l~~~ 280 (292)
T PRK14863 229 SDPLQAALGFALSRPEGSAVLVGVNSAAELSAVVAAASSPPPDLDWDDMAID 280 (292)
T ss_pred CCHHHHHHHHHHhCCCCCeEEEecCCHHHHHHHHHHHhcCCCccchhhccCC
Confidence 489999999999985 78899999999999999999989999888887643
No 15
>COG4989 Predicted oxidoreductase [General function prediction only]
Probab=98.73 E-value=3.7e-08 Score=62.41 Aligned_cols=53 Identities=17% Similarity=0.194 Sum_probs=48.3
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcC
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~ 53 (62)
+|..+|+++|++.++ ..||.|+.+++|+++.+++.+++|+.+++-+|.....+
T Consensus 240 ~s~~~VaiAWllR~Pa~~~PiiGt~~~eRi~~a~~Al~~~LtRqqWf~Iy~Aa~G 294 (298)
T COG4989 240 VSITAVAIAWLLRHPAKPQPIIGTGNLERIRAAIKALSLTLTRQQWFEIYTAAIG 294 (298)
T ss_pred ccHHHHHHHHHHhCcCcccceecCCCHHHHHHHHHHhhccccHHHHHHHHHHhcc
Confidence 477899999999996 89999999999999999999999999999999876543
No 16
>COG1453 Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]
Probab=97.97 E-value=1.2e-05 Score=53.18 Aligned_cols=50 Identities=16% Similarity=0.320 Sum_probs=43.7
Q ss_pred CHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccC-C--CCCHHHHHHHHhhh
Q psy15066 2 FMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFD-F--TLEQSDVDKIDALD 51 (62)
Q Consensus 2 s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~-~--~L~~e~~~~l~~~~ 51 (62)
||+..+++|+++++ .+++.|.++++|++||++..+ + +|+++++..|.++.
T Consensus 230 sP~~wa~R~~~shp~V~~vlsGm~~~~~l~enLk~~~~~~p~lte~e~~il~~v~ 284 (391)
T COG1453 230 SPAEWALRYLLSHPEVTTVLSGMNTPEQLEENLKIASELEPSLTEEELQILEKVE 284 (391)
T ss_pred CcHHHHHHHHhcCCCeEEEecCCCCHHHHHHHHHHHhhcCCccCHHHHHHHHHHH
Confidence 68999999999996 899999999999999998653 3 49999998887765
No 17
>KOG1576|consensus
Probab=92.69 E-value=0.36 Score=31.63 Aligned_cols=41 Identities=5% Similarity=-0.040 Sum_probs=33.8
Q ss_pred CCHHHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCCCCCH
Q psy15066 1 MFMTSFILRPQVQSG--TIPIPKSNTVKRITENINVFDFTLEQ 41 (62)
Q Consensus 1 ~s~aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~L~~ 41 (62)
+..+.+|+.|.++.+ ..+++|..+.++++.|+.+..-.|+.
T Consensus 268 v~l~kLA~~Yam~~~~~~~~lvGm~s~~~l~~nLdan~~~ls~ 310 (342)
T KOG1576|consen 268 VELGKLAMYYAMSLPGVSTVLVGMSSRQLLRINLDANFDRLSS 310 (342)
T ss_pred ccHHHHHHHHHHccCCcceEEecCchHHHHHHHHHhhhccccc
Confidence 356789999999974 88999999999999999975445655
No 18
>PF11116 DUF2624: Protein of unknown function (DUF2624); InterPro: IPR020277 This entry contains proteins with no known function.
Probab=79.23 E-value=6.1 Score=21.31 Aligned_cols=39 Identities=8% Similarity=0.248 Sum_probs=31.7
Q ss_pred CCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCccccCC
Q psy15066 22 SNTVKRITENINVFDFTLEQSDVDKIDALDKKQRSCVMA 60 (62)
Q Consensus 22 ~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~~~~~ 60 (62)
+-+++.+....+-.++++++++.+.|-.+-++..+..|+
T Consensus 14 ~iT~~eLlkyskqy~i~it~~QA~~I~~~lr~k~inIfn 52 (85)
T PF11116_consen 14 NITAKELLKYSKQYNISITKKQAEQIANILRGKNINIFN 52 (85)
T ss_pred cCCHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCCCCC
Confidence 345777888888889999999999999988887776654
No 19
>TIGR02530 flg_new flagellar operon protein. Members of this family are found in a subset of bacterial flagellar operons, generally between genes designated flgD and flgE, in species as diverse as Bacillus halodurans and various other Firmicutes, Geobacter sulfurreducens, and Bdellovibrio bacteriovorus. The specific molecular function is unknown.
Probab=72.24 E-value=7.2 Score=21.51 Aligned_cols=26 Identities=15% Similarity=0.187 Sum_probs=21.6
Q ss_pred HHHHHHhHhccCCCCCHHHHHHHHhh
Q psy15066 25 VKRITENINVFDFTLEQSDVDKIDAL 50 (62)
Q Consensus 25 ~~~l~~n~~~~~~~L~~e~~~~l~~~ 50 (62)
..|..+-++.-++.|++++|+.|++-
T Consensus 18 SkHA~~RL~~R~I~l~~~~~~~i~~a 43 (96)
T TIGR02530 18 SKHALERMRERNISINPDDWKKLLEA 43 (96)
T ss_pred cHHHHHHHHHcCCCCCHHHHHHHHHH
Confidence 46777778888899999999999764
No 20
>PF08145 BOP1NT: BOP1NT (NUC169) domain; InterPro: IPR012953 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This N-terminal domain is found in BOP1-like WD40 proteins. Bop1 is a nucleolar protein involved in rRNA processing, thereby controlling the cell cycle []. It is required for the maturation of the 25S and 5.8S ribosomal RNAs. It may serve as an essential factor in ribosome formation that coordinates processing of the spacer regions in pre-rRNA. The Pes1-Bop1 complex has several components: BOP1, GRWD1, PES1, ORC6L, and RPL3 and is involved in ribosome biogenesis and altered chromosome segregation. The overexpression of BOP1 increases the percentage of multipolar spindles in human cells. Deregulation of the BOP1 pathway may contribute to colorectal tumourigenesis in humans []. Elevated levels of Bop1 induces Bop1/WDR12 and Bop1/Pes1 subcomplexes and the assembly and integrity of the PeBoW complex is highly sensitive to changes in Bop1 protein levels []. Nop7p-Erb1p-Ytm1p, found in yeast, is potentially the homologous complex of Pes1-Bop1-WDR12 as it is involved in the control of ribosome biogenesis and S phase entry. The integrity of the PeBoW complex is required for ribosome biogenesis and cell proliferation in mammalian cells []. In Giardia, the species specific cytoskeleton protein, beta-giardin, interacts with Bop1 []. ; GO: 0006364 rRNA processing, 0005634 nucleus
Probab=65.91 E-value=18 Score=23.37 Aligned_cols=19 Identities=16% Similarity=0.414 Sum_probs=16.2
Q ss_pred CCCCCHHHHHHHHhhhcCC
Q psy15066 36 DFTLEQSDVDKIDALDKKQ 54 (62)
Q Consensus 36 ~~~L~~e~~~~l~~~~~~~ 54 (62)
++.||++|++.|.++-.+.
T Consensus 49 ~v~Lt~eel~lI~ri~~g~ 67 (260)
T PF08145_consen 49 EVVLTDEELELIRRIQKGE 67 (260)
T ss_pred eeeeCHHHHHHHHHHHcCC
Confidence 4789999999999997663
No 21
>PRK10945 gene expression modulator; Provisional
Probab=65.07 E-value=9 Score=20.05 Aligned_cols=40 Identities=8% Similarity=0.177 Sum_probs=30.7
Q ss_pred HHHHHhCCCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhc
Q psy15066 8 LRPQVQSGTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDK 52 (62)
Q Consensus 8 l~W~l~~~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~ 52 (62)
.-|+++ +-+.++.+-++.-+.....+|+++|+..+.+...
T Consensus 9 ~dyL~~-----fRrcss~eTLEkvie~~~~~L~~~E~~~f~~AaD 48 (72)
T PRK10945 9 TDYLMR-----LRRCQTIDTLERVIEKNKYELSDDELAVFYSAAD 48 (72)
T ss_pred HHHHHH-----HHhcCcHHHHHHHHHHhhccCCHHHHHHHHHHHH
Confidence 346655 4466788899988888888999999988877543
No 22
>PF08270 PRD_Mga: M protein trans-acting positive regulator (MGA) PRD domain; InterPro: IPR013236 Mga is a DNA-binding protein that activates the expression of several important virulence genes in group A streptococcus in response to changing environmental conditions []. This region corresponds to the PRD like region.
Probab=63.47 E-value=9.5 Score=23.36 Aligned_cols=45 Identities=20% Similarity=0.313 Sum_probs=33.6
Q ss_pred HHHHHHHhCC-CeEecCCCCHHHHHHhH--------------hccCCCCCHHHHHHHHhhh
Q psy15066 6 FILRPQVQSG-TIPIPKSNTVKRITENI--------------NVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 6 val~W~l~~~-~~~I~g~~~~~~l~~n~--------------~~~~~~L~~e~~~~l~~~~ 51 (62)
+++.|.. +| .+.+|.+..-+++++-+ ..++++|++++++.|-...
T Consensus 35 ~~l~wkR-~~~~v~ip~~~~~~~lk~l~~~~~~~~~~~~~~~~~~~i~l~~~dldylFl~Y 94 (217)
T PF08270_consen 35 LALNWKR-HGYFVDIPQSKIFDQLKELFIYDEEMEDLKELFEPKLGIELNEEDLDYLFLIY 94 (217)
T ss_pred HHHheee-cCcccCCCCchHHHHHHHHhcCcHHHHHHHHHHHHHhCCCCCHHHHHHHHHHH
Confidence 6788888 55 67789888877777653 2457889999998886554
No 23
>PRK06552 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional
Probab=59.63 E-value=18 Score=22.26 Aligned_cols=50 Identities=16% Similarity=0.050 Sum_probs=34.8
Q ss_pred HHHHHHhCCCeEecCCCCHHHHHHhHhcc-C---C----CCCHHHHHHHHhhhcCCcc
Q psy15066 7 ILRPQVQSGTIPIPKSNTVKRITENINVF-D---F----TLEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 7 al~W~l~~~~~~I~g~~~~~~l~~n~~~~-~---~----~L~~e~~~~l~~~~~~~r~ 56 (62)
.+.++.++++..|||+.+++.+.+-.+.. + + .+..+.++.+.....+.++
T Consensus 101 v~~~~~~~~i~~iPG~~T~~E~~~A~~~Gad~vklFPa~~~G~~~ik~l~~~~p~ip~ 158 (213)
T PRK06552 101 TAKICNLYQIPYLPGCMTVTEIVTALEAGSEIVKLFPGSTLGPSFIKAIKGPLPQVNV 158 (213)
T ss_pred HHHHHHHcCCCEECCcCCHHHHHHHHHcCCCEEEECCcccCCHHHHHHHhhhCCCCEE
Confidence 57788889999999999999998866421 1 1 2455666666665554443
No 24
>PF04746 DUF575: Protein of unknown function (DUF575); InterPro: IPR006835 This represents a conserved region found in a number of Chlamydophila pneumoniae proteins.
Probab=58.86 E-value=6 Score=21.81 Aligned_cols=30 Identities=20% Similarity=0.444 Sum_probs=23.6
Q ss_pred cCCCCHHHHHHhHhccCCCCCHHHHHHHHh
Q psy15066 20 PKSNTVKRITENINVFDFTLEQSDVDKIDA 49 (62)
Q Consensus 20 ~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~ 49 (62)
.|-...+++++.++.....+++||..++..
T Consensus 64 ~G~dhisrve~~Lk~~R~~i~~ed~~KVHG 93 (101)
T PF04746_consen 64 RGRDHISRVEEYLKSLRVTIEPEDLGKVHG 93 (101)
T ss_pred hCCCchHHHHHHHHHhcCCCCccccccccC
Confidence 455667888999998888999998776543
No 25
>PF07862 Nif11: Nitrogen fixation protein of unknown function; InterPro: IPR012903 This domain is found in the cyanobacteria, and the nitrogen-fixing proteobacterium Azotobacter vinelandii and may be involved in nitrogen fixation, but no role has been assigned [].
Probab=56.59 E-value=16 Score=17.03 Aligned_cols=23 Identities=0% Similarity=0.147 Sum_probs=16.7
Q ss_pred CCCHHHHHHhHhccCCCCCHHHH
Q psy15066 22 SNTVKRITENINVFDFTLEQSDV 44 (62)
Q Consensus 22 ~~~~~~l~~n~~~~~~~L~~e~~ 44 (62)
..+++.+..-.+..+|.+|.+|+
T Consensus 26 ~~~~~e~~~lA~~~Gy~ft~~el 48 (49)
T PF07862_consen 26 CQNPEEVVALAREAGYDFTEEEL 48 (49)
T ss_pred cCCHHHHHHHHHHcCCCCCHHHh
Confidence 34677777777777788887775
No 26
>PRK14981 DNA-directed RNA polymerase subunit F; Provisional
Probab=56.23 E-value=21 Score=19.86 Aligned_cols=33 Identities=21% Similarity=0.238 Sum_probs=23.9
Q ss_pred CCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcC
Q psy15066 21 KSNTVKRITENINVFDFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 21 g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~ 53 (62)
.+.+.+.++.-+...+..+++|+++.|-..-..
T Consensus 77 ~P~~~dElrai~~~~~~~~~~e~l~~ILd~l~k 109 (112)
T PRK14981 77 LPETRDELRAIFAKERYTLSPEELDEILDIVKK 109 (112)
T ss_pred CCCCHHHHHHHHHHhccCCCHHHHHHHHHHHHH
Confidence 345677777777777788888888888665443
No 27
>PRK00819 RNA 2'-phosphotransferase; Reviewed
Probab=55.13 E-value=41 Score=20.33 Aligned_cols=50 Identities=12% Similarity=0.048 Sum_probs=34.3
Q ss_pred HHHHHHhCC------CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCcc
Q psy15066 7 ILRPQVQSG------TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 7 al~W~l~~~------~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~ 56 (62)
.|+|+|.|+ ..=-=|-...+.+-+.++.....++.+++.++-.-+...|+
T Consensus 12 ~LS~lLRH~p~~~GL~ld~~G~v~v~~Ll~~~~~~~~~~t~~~l~~vV~~d~K~Rf 67 (179)
T PRK00819 12 FLSGVLRHKPEAIGLTLDEEGWVDIDALIEALAKAYKWVTRELLEAVVESDDKGRF 67 (179)
T ss_pred HHHHHHCCCHHHcCCccCCCCCEEHHHHHHHHHHccCCCCHHHHHHHHHcCCCcce
Confidence 588999875 11223556677777766543345899999999887777665
No 28
>PF01885 PTS_2-RNA: RNA 2'-phosphotransferase, Tpt1 / KptA family; InterPro: IPR002745 The final step of tRNA splicing in Saccharomyces cerevisiae (Baker's yeast) requires 2'-phosphotransferase (Tpt1) to transfer the 2'-phosphate from ligated tRNA to NAD, producing mature tRNA and ADP ribose-1' '-2' '-cyclic phosphate. Yeast and Mus musculus (Mouse) Tpt1 protein and bacterial KptA protein can catalyze the conversion of the generated intermediate to both product and the original substrate, these enzymes likely use the same reaction mechanism. Step 1 of this reaction is strikingly similar to the ADP-ribosylation of proteins catalyzed by a number of bacterial toxins. KptA, a functional Tpt1 protein homologue from Escherichia coli is strikingly similar to yeast Tpt1 in its kinetic parameters, although E. coli is not known to have a 2'-phosphorylated RNA substrate [,].; GO: 0016772 transferase activity, transferring phosphorus-containing groups, 0006388 tRNA splicing, via endonucleolytic cleavage and ligation; PDB: 1WFX_A.
Probab=54.71 E-value=27 Score=21.12 Aligned_cols=50 Identities=16% Similarity=0.197 Sum_probs=28.4
Q ss_pred HHHHHHhCC------CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCcc
Q psy15066 7 ILRPQVQSG------TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 7 al~W~l~~~------~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~ 56 (62)
.|+|+|.|+ ..--=|-...+.+-+..+.....++.+++.++-.-+...||
T Consensus 11 ~Ls~lLRH~p~~~gL~~d~~G~v~v~dLL~~~~~~~~~~t~~~i~~vV~~~~K~Rf 66 (186)
T PF01885_consen 11 ALSYLLRHGPEKEGLVMDPDGWVSVDDLLRALRFKGLWVTEEDIREVVETDDKQRF 66 (186)
T ss_dssp HHHHHHHT-TGGGT----TT--EEHHHHHHHHHHT-TT--HHHHHHHHHH-SS--E
T ss_pred HHHHHhCCChhhcCCccCCCCCEeHHHHHHHHHHcCCCCCHHHHHHHHhhCCCCCe
Confidence 588999985 11123455677777766655556899999999887777775
No 29
>TIGR01457 HAD-SF-IIA-hyp2 HAD-superfamily subfamily IIA hydrolase, TIGR01457. This hypothetical equivalog is a member of the Class IIA subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. The sequences modelled by this equivalog are all gram positive (low-GC) bacteria. Sequences found in this model are annotated variously as related to NagD or 4-nitrophenyl phosphatase, and this hypothetical equivalog, of all of those within the Class IIA subfamily, is most closely related to the E. coli NagD enzyme and the PGP_euk equivalog (TIGR01452). However, there is presently no evidence that this hypothetical equivalog has the same function of either those.
Probab=50.46 E-value=36 Score=21.01 Aligned_cols=40 Identities=13% Similarity=0.201 Sum_probs=28.9
Q ss_pred HHHHHHHHhCC--CeEecC--CCCHHHHHHhHhccCCCCCHHHH
Q psy15066 5 SFILRPQVQSG--TIPIPK--SNTVKRITENINVFDFTLEQSDV 44 (62)
Q Consensus 5 qval~W~l~~~--~~~I~g--~~~~~~l~~n~~~~~~~L~~e~~ 44 (62)
.-+|+.+.++| +..+.+ +++.+.+.+-++...++.+++++
T Consensus 23 ~~~l~~l~~~g~~~~~~Tnn~~r~~~~~~~~l~~~g~~~~~~~i 66 (249)
T TIGR01457 23 ETFVHELQKRDIPYLFVTNNSTRTPESVAEMLASFDIPATLETV 66 (249)
T ss_pred HHHHHHHHHCCCeEEEEeCCCCCCHHHHHHHHHHcCCCCChhhE
Confidence 34667777777 445554 57889999988888888877665
No 30
>PF11338 DUF3140: Protein of unknown function (DUF3140); InterPro: IPR021487 Some members in this family of proteins are annotated as DNA binding proteins. No function is currently known.
Probab=50.44 E-value=34 Score=18.70 Aligned_cols=50 Identities=10% Similarity=-0.024 Sum_probs=35.2
Q ss_pred CHHHHHHHHHHhCC-------C--eEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhc
Q psy15066 2 FMTSFILRPQVQSG-------T--IPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDK 52 (62)
Q Consensus 2 s~aqval~W~l~~~-------~--~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~ 52 (62)
|+.++ =.|+.... . .=-+|..+-.+|.+.++--.-.|+++|++.+.++..
T Consensus 9 t~~EL-~~WL~t~~S~~~g~~~~~~es~Gh~sGRrIv~IL~K~k~dltddD~~hMrkVV~ 67 (92)
T PF11338_consen 9 TPAEL-EDWLRTDESKSVGEKKDGGESVGHESGRRIVEILRKRKTDLTDDDYEHMRKVVG 67 (92)
T ss_pred CHHHH-HHHHcCccccccccCCCCCcccCcchhhHHHHHHhcCcccCCHHHHHHHHHHHH
Confidence 44443 36776642 1 223567778899999887777899999999988763
No 31
>PF03874 RNA_pol_Rpb4: RNA polymerase Rpb4; InterPro: IPR005574 The eukaryotic RNA polymerase subunits RPB4 and RPB7 form a heterodimer that reversibly associates with the RNA polymerase II core. Archaeal cells contain a single RNAP made up of about 12 subunits, displaying considerable homology to the eukaryotic RNAPII subunits. The RPB4 and RPB7 homologs are called subunits F and E, respectively, and have been shown to form a stable heterodimer. While the RPB7 homologue is reasonably well conserved, the similarity between the eukaryotic RPB4 and the archaeal F subunit is barely detectable [].; GO: 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 3AYH_A 3H3V_E 4A3C_D 3PO3_D 3HOX_D 2R92_D 3HOU_D 1Y77_D 2R7Z_D 3QT1_D ....
Probab=50.34 E-value=20 Score=19.56 Aligned_cols=28 Identities=18% Similarity=0.304 Sum_probs=13.6
Q ss_pred CCHHHHHHhHhccCCCCCHHHHHHHHhh
Q psy15066 23 NTVKRITENINVFDFTLEQSDVDKIDAL 50 (62)
Q Consensus 23 ~~~~~l~~n~~~~~~~L~~e~~~~l~~~ 50 (62)
++...+..-+...+-.+++++++.|-.+
T Consensus 85 ~~~~El~~ii~~~~~r~~ee~l~~iL~~ 112 (117)
T PF03874_consen 85 TTAVELRAIIESLESRFSEEDLEEILDL 112 (117)
T ss_dssp SSHHHHHHHSTTGTTTSTHHHHHHHHHH
T ss_pred CCHHHHHHHHHHhccCCCHHHHHHHHHH
Confidence 3444444444444444555555555433
No 32
>PRK07114 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional
Probab=50.28 E-value=32 Score=21.48 Aligned_cols=49 Identities=10% Similarity=-0.018 Sum_probs=33.1
Q ss_pred HHHHHHhCCCeEecCCCCHHHHHHhHhcc-C-CC------CCHHHHHHHHhhhcCCc
Q psy15066 7 ILRPQVQSGTIPIPKSNTVKRITENINVF-D-FT------LEQSDVDKIDALDKKQR 55 (62)
Q Consensus 7 al~W~l~~~~~~I~g~~~~~~l~~n~~~~-~-~~------L~~e~~~~l~~~~~~~r 55 (62)
.++++.++++..|||..++..+.+-++.. + ++ +...-++.|.....+-+
T Consensus 104 v~~~~~~~~i~~iPG~~TpsEi~~A~~~Ga~~vKlFPA~~~G~~~ikal~~p~p~i~ 160 (222)
T PRK07114 104 IAKVCNRRKVPYSPGCGSLSEIGYAEELGCEIVKLFPGSVYGPGFVKAIKGPMPWTK 160 (222)
T ss_pred HHHHHHHcCCCEeCCCCCHHHHHHHHHCCCCEEEECcccccCHHHHHHHhccCCCCe
Confidence 57888899999999999999998876532 1 12 33444555555544444
No 33
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=48.72 E-value=34 Score=16.87 Aligned_cols=26 Identities=8% Similarity=0.095 Sum_probs=21.3
Q ss_pred CCCHHHHHHhHhccCCCCCHHHHHHH
Q psy15066 22 SNTVKRITENINVFDFTLEQSDVDKI 47 (62)
Q Consensus 22 ~~~~~~l~~n~~~~~~~L~~e~~~~l 47 (62)
+.+++.+-...+...|.+|.+|+...
T Consensus 24 ~~~~e~~~~lA~~~Gf~ft~~el~~~ 49 (64)
T TIGR03798 24 AEDPEDRVAIAKEAGFEFTGEDLKEA 49 (64)
T ss_pred cCCHHHHHHHHHHcCCCCCHHHHHHH
Confidence 46688888888888999999999753
No 34
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=48.59 E-value=21 Score=19.31 Aligned_cols=30 Identities=13% Similarity=0.194 Sum_probs=24.9
Q ss_pred CCHHHHHHhHhccCCCCCHHHHHHHHhhhc
Q psy15066 23 NTVKRITENINVFDFTLEQSDVDKIDALDK 52 (62)
Q Consensus 23 ~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~ 52 (62)
++.+++.++-..|+=+|++++++.+.++..
T Consensus 36 r~~D~L~~~ds~F~D~lTpDQVrAlHRlvT 65 (92)
T PHA02681 36 RNKDLLPPGASSFEDKMTDDQVRAFHALVT 65 (92)
T ss_pred cccccCCCCCchhhccCCHHHHHHHHHHHh
Confidence 677777777778888999999999988863
No 35
>PRK10391 oriC-binding nucleoid-associated protein; Provisional
Probab=48.16 E-value=17 Score=18.95 Aligned_cols=33 Identities=9% Similarity=0.264 Sum_probs=25.1
Q ss_pred ecCCCCHHHHHHhHhccCCCCCH-HHHHHHHhhh
Q psy15066 19 IPKSNTVKRITENINVFDFTLEQ-SDVDKIDALD 51 (62)
Q Consensus 19 I~g~~~~~~l~~n~~~~~~~L~~-e~~~~l~~~~ 51 (62)
.-+.++.+-++.-+.....+|++ +|+..+.+..
T Consensus 10 fRkcss~eTLEkv~e~~~y~L~~~~e~~~f~~Aa 43 (71)
T PRK10391 10 FRKISSLESLEKLFDHLNYTLTDDQEIINMYRAA 43 (71)
T ss_pred HHhcCcHHHHHHHHHHhhcccCCHHHHHHHHHHH
Confidence 34567888888888888888997 7887776654
No 36
>PRK06015 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional
Probab=48.07 E-value=37 Score=20.91 Aligned_cols=49 Identities=8% Similarity=-0.022 Sum_probs=33.0
Q ss_pred HHHHHHhCCCeEecCCCCHHHHHHhHhcc-C-CC------C-CHHHHHHHHhhhcCCc
Q psy15066 7 ILRPQVQSGTIPIPKSNTVKRITENINVF-D-FT------L-EQSDVDKIDALDKKQR 55 (62)
Q Consensus 7 al~W~l~~~~~~I~g~~~~~~l~~n~~~~-~-~~------L-~~e~~~~l~~~~~~~r 55 (62)
.++++.++++..+||..++..+..-.+.. + ++ + -..-++.|.....+-+
T Consensus 89 vi~~a~~~~i~~iPG~~TptEi~~A~~~Ga~~vK~FPa~~~GG~~yikal~~plp~~~ 146 (201)
T PRK06015 89 LLAAANDSDVPLLPGAATPSEVMALREEGYTVLKFFPAEQAGGAAFLKALSSPLAGTF 146 (201)
T ss_pred HHHHHHHcCCCEeCCCCCHHHHHHHHHCCCCEEEECCchhhCCHHHHHHHHhhCCCCc
Confidence 56788889999999999999998876532 1 11 2 2445566655554444
No 37
>COG1460 Uncharacterized protein conserved in archaea [Function unknown]
Probab=47.69 E-value=33 Score=19.50 Aligned_cols=33 Identities=18% Similarity=0.184 Sum_probs=26.4
Q ss_pred CCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcC
Q psy15066 21 KSNTVKRITENINVFDFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 21 g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~ 53 (62)
.+++++.++..+.-....+++|+++.|-.+...
T Consensus 78 ~P~t~~ElRsIla~e~~~~s~E~l~~Ildiv~K 110 (114)
T COG1460 78 MPRTPDELRSILAKERVMLSDEELDKILDIVDK 110 (114)
T ss_pred CCCCHHHHHHHHHHccCCCCHHHHHHHHHHHHH
Confidence 456788888888888888999999998776544
No 38
>TIGR00135 gatC glutamyl-tRNA(Gln) and/or aspartyl-tRNA(Asn) amidotransferase, C subunit. This model has been revised to remove the candidate sequence from Methanococcus jannaschii, now part of a related model.
Probab=47.45 E-value=42 Score=17.68 Aligned_cols=24 Identities=4% Similarity=0.180 Sum_probs=17.1
Q ss_pred HHHHHHhHhccCCCCCHHHHHHHH
Q psy15066 25 VKRITENINVFDFTLEQSDVDKID 48 (62)
Q Consensus 25 ~~~l~~n~~~~~~~L~~e~~~~l~ 48 (62)
.+.++.-.+...+.|++++.+.+.
T Consensus 3 ~~~v~~lA~La~L~l~eee~~~~~ 26 (93)
T TIGR00135 3 DEEVKHLAKLARLELSEEEAESFA 26 (93)
T ss_pred HHHHHHHHHHhCCCCCHHHHHHHH
Confidence 456666666778889999976543
No 39
>KOG0650|consensus
Probab=47.34 E-value=15 Score=26.62 Aligned_cols=29 Identities=21% Similarity=0.305 Sum_probs=21.1
Q ss_pred ccCCCCCHHHHHHHHhhhcCCcc-ccCCCC
Q psy15066 34 VFDFTLEQSDVDKIDALDKKQRS-CVMAPY 62 (62)
Q Consensus 34 ~~~~~L~~e~~~~l~~~~~~~r~-~~~~~~ 62 (62)
..++.||+||++.|.++.++.-. .+|+||
T Consensus 177 G~dv~LTkEev~lI~ri~~g~~pd~d~dpY 206 (733)
T KOG0650|consen 177 GKDVNLTKEEVKLIRRIARGEIPDPDFDPY 206 (733)
T ss_pred CceeeecHHHHHHHHHHHcccCCCCCCCcc
Confidence 45788999999999999876332 245554
No 40
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=45.45 E-value=32 Score=15.77 Aligned_cols=28 Identities=14% Similarity=0.239 Sum_probs=20.8
Q ss_pred CCHHHHHHhHhccCCC-CCHHHHHHHHhh
Q psy15066 23 NTVKRITENINVFDFT-LEQSDVDKIDAL 50 (62)
Q Consensus 23 ~~~~~l~~n~~~~~~~-L~~e~~~~l~~~ 50 (62)
-+.+.+..-+...... ++++++..|-..
T Consensus 5 i~~~~~~~~l~~~g~~~~s~~e~~~l~~~ 33 (54)
T PF13833_consen 5 ITREEFRRALSKLGIKDLSEEEVDRLFRE 33 (54)
T ss_dssp EEHHHHHHHHHHTTSSSSCHHHHHHHHHH
T ss_pred ECHHHHHHHHHHhCCCCCCHHHHHHHHHh
Confidence 3567777777666778 999999887554
No 41
>PRK00034 gatC aspartyl/glutamyl-tRNA amidotransferase subunit C; Reviewed
Probab=45.29 E-value=47 Score=17.47 Aligned_cols=26 Identities=12% Similarity=0.235 Sum_probs=19.1
Q ss_pred CCHHHHHHhHhccCCCCCHHHHHHHH
Q psy15066 23 NTVKRITENINVFDFTLEQSDVDKID 48 (62)
Q Consensus 23 ~~~~~l~~n~~~~~~~L~~e~~~~l~ 48 (62)
-+.+.++.-.+...+.|++++.+.+.
T Consensus 3 i~~e~i~~la~La~l~l~~ee~~~~~ 28 (95)
T PRK00034 3 ITREEVKHLAKLARLELSEEELEKFA 28 (95)
T ss_pred CCHHHHHHHHHHhCCCCCHHHHHHHH
Confidence 35667777777778899999876543
No 42
>PF01081 Aldolase: KDPG and KHG aldolase; InterPro: IPR000887 4-Hydroxy-2-oxoglutarate aldolase (4.1.3.16 from EC) (KHG-aldolase) catalyzes the interconversion of 4-hydroxy-2-oxoglutarate into pyruvate and glyoxylate. Phospho-2-dehydro-3-deoxygluconate aldolase (4.1.2.14 from EC) (KDPG-aldolase) catalyzes the interconversion of 6-phospho-2-dehydro-3-deoxy-D-gluconate into pyruvate and glyceraldehyde 3-phosphate. These two enzymes are structurally and functionally related []. They are both homotrimeric proteins of approximately 220 amino-acid residues. They are class I aldolases whose catalytic mechanism involves the formation of a Schiff-base intermediate between the substrate and the epsilon-amino group of a lysine residue. In both enzymes, an arginine is required for catalytic activity.; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 3VCR_A 1FQ0_A 1EUN_A 1EUA_B 1FWR_A 2C0A_B 1WBH_A 1WAU_A 2YW3_B 2YW4_A ....
Probab=45.04 E-value=27 Score=21.39 Aligned_cols=50 Identities=16% Similarity=0.119 Sum_probs=34.1
Q ss_pred HHHHHHhCCCeEecCCCCHHHHHHhHhcc----C-C---CCC-HHHHHHHHhhhcCCcc
Q psy15066 7 ILRPQVQSGTIPIPKSNTVKRITENINVF----D-F---TLE-QSDVDKIDALDKKQRS 56 (62)
Q Consensus 7 al~W~l~~~~~~I~g~~~~~~l~~n~~~~----~-~---~L~-~e~~~~l~~~~~~~r~ 56 (62)
.+.++.++++..+||..++..+..-.+.. + | .+. ..-++.|.....+-++
T Consensus 93 v~~~~~~~~i~~iPG~~TptEi~~A~~~G~~~vK~FPA~~~GG~~~ik~l~~p~p~~~~ 151 (196)
T PF01081_consen 93 VIEYAREYGIPYIPGVMTPTEIMQALEAGADIVKLFPAGALGGPSYIKALRGPFPDLPF 151 (196)
T ss_dssp HHHHHHHHTSEEEEEESSHHHHHHHHHTT-SEEEETTTTTTTHHHHHHHHHTTTTT-EE
T ss_pred HHHHHHHcCCcccCCcCCHHHHHHHHHCCCCEEEEecchhcCcHHHHHHHhccCCCCeE
Confidence 56788889999999999999998866431 1 1 234 5666666665555443
No 43
>TIGR01182 eda Entner-Doudoroff aldolase. 2-deydro-3-deoxyphosphogluconate aldolase (EC 4.1.2.14) is an enzyme of the Entner-Doudoroff pathway. This aldolase has another function, 4-hydroxy-2-oxoglutarate aldolase (EC 4.1.3.16) shown experimentally in Escherichia coli and Pseudomonas putida
Probab=44.55 E-value=46 Score=20.51 Aligned_cols=28 Identities=18% Similarity=0.230 Sum_probs=23.7
Q ss_pred HHHHHHhCCCeEecCCCCHHHHHHhHhc
Q psy15066 7 ILRPQVQSGTIPIPKSNTVKRITENINV 34 (62)
Q Consensus 7 al~W~l~~~~~~I~g~~~~~~l~~n~~~ 34 (62)
.+.++.++++..+||..++..+..-++.
T Consensus 93 v~~~~~~~~i~~iPG~~TptEi~~A~~~ 120 (204)
T TIGR01182 93 LAKHAQDHGIPIIPGVATPSEIMLALEL 120 (204)
T ss_pred HHHHHHHcCCcEECCCCCHHHHHHHHHC
Confidence 4677888899999999999999887653
No 44
>PF07308 DUF1456: Protein of unknown function (DUF1456); InterPro: IPR009921 This domain occurs in several hypothetical bacterial proteins of around 150 residues in length. The function of this domain is unknown.
Probab=43.78 E-value=46 Score=16.96 Aligned_cols=28 Identities=7% Similarity=0.130 Sum_probs=22.6
Q ss_pred CHHHHHHhHhccCCCCCHHHHHHHHhhh
Q psy15066 24 TVKRITENINVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 24 ~~~~l~~n~~~~~~~L~~e~~~~l~~~~ 51 (62)
+.+.|.+.++..++++|++++..+-+-.
T Consensus 15 ~d~~m~~if~l~~~~vs~~el~a~lrke 42 (68)
T PF07308_consen 15 KDDDMIEIFALAGFEVSKAELSAWLRKE 42 (68)
T ss_pred ChHHHHHHHHHcCCccCHHHHHHHHCCC
Confidence 4567888888889999999998876554
No 45
>PF14407 Frankia_peptide: Ribosomally synthesized peptide prototyped by Frankia Franean1_4349.
Probab=43.19 E-value=24 Score=17.88 Aligned_cols=18 Identities=17% Similarity=0.501 Sum_probs=14.4
Q ss_pred ccCCCCCHHHHHHHHhhh
Q psy15066 34 VFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 34 ~~~~~L~~e~~~~l~~~~ 51 (62)
+.++.|++|+++.|+++.
T Consensus 27 ~~~Y~Lt~eeL~al~~~~ 44 (61)
T PF14407_consen 27 CKEYDLTPEELQALERLD 44 (61)
T ss_pred HhhCCCCHHHHHHHHHHH
Confidence 345889999999998664
No 46
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=42.63 E-value=34 Score=15.11 Aligned_cols=26 Identities=8% Similarity=-0.114 Sum_probs=16.4
Q ss_pred HHHHHhCCCeEecCCCCHHHHHHhHh
Q psy15066 8 LRPQVQSGTIPIPKSNTVKRITENIN 33 (62)
Q Consensus 8 l~W~l~~~~~~I~g~~~~~~l~~n~~ 33 (62)
-.|+..+|+.+=.++.+.+++.+.++
T Consensus 10 ~~wL~~~gi~~~~~~~~rd~Ll~~~k 35 (38)
T PF10281_consen 10 KSWLKSHGIPVPKSAKTRDELLKLAK 35 (38)
T ss_pred HHHHHHcCCCCCCCCCCHHHHHHHHH
Confidence 46777777665555546777666544
No 47
>COG5201 SKP1 SCF ubiquitin ligase, SKP1 component [Posttranslational modification, protein turnover, chaperones]
Probab=42.44 E-value=21 Score=21.07 Aligned_cols=41 Identities=15% Similarity=0.275 Sum_probs=29.5
Q ss_pred HHHhCC---CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhh
Q psy15066 10 PQVQSG---TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 10 W~l~~~---~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~ 51 (62)
+++.-| ++-..++.+++.+.+.+... -.+|+|++.+|++-+
T Consensus 109 pLLd~gCKivaemirgkSpeeir~tfni~-ndfTpEEe~~irkEn 152 (158)
T COG5201 109 PLLDLGCKIVAEMIRGKSPEEIRETFNIE-NDFTPEEERRIRKEN 152 (158)
T ss_pred HHHHHHHHHHHHHHccCCHHHHHHHhCCC-CCCCHHHHHHHHHhc
Confidence 445544 44457889999999877543 359999999998754
No 48
>PTZ00315 2'-phosphotransferase; Provisional
Probab=39.55 E-value=84 Score=22.64 Aligned_cols=51 Identities=16% Similarity=0.143 Sum_probs=37.1
Q ss_pred HHHHHHhCC----CeEe--cCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCccc
Q psy15066 7 ILRPQVQSG----TIPI--PKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQRSC 57 (62)
Q Consensus 7 al~W~l~~~----~~~I--~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~~ 57 (62)
.|+|+|.|+ -..+ =|-...+.+.+..+.....++.++++++=+-+...||.
T Consensus 384 ~LS~lLRH~pe~~GL~ld~~Gwv~vd~LL~~~~~~~~~~t~e~i~~VV~~ndK~RF~ 440 (582)
T PTZ00315 384 FMSRLLRHKADQWRVPITSNGYVLLDDILRQPPMRNDPVSVQDVARVVRDSDKQRFK 440 (582)
T ss_pred HHHHHHcCCHhhcCCCcCCCCCEEHHHHHHHHHhcCCCCCHHHHHHHHHcCCCCceE
Confidence 589999986 1222 56677788877666544568999999998888777763
No 49
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=39.36 E-value=46 Score=25.42 Aligned_cols=47 Identities=15% Similarity=0.203 Sum_probs=34.0
Q ss_pred HHHHHHHHhCC----CeEecCCCCHHHHHH----hHhccCCCCCHHHHHHHHhhh
Q psy15066 5 SFILRPQVQSG----TIPIPKSNTVKRITE----NINVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 5 qval~W~l~~~----~~~I~g~~~~~~l~~----n~~~~~~~L~~e~~~~l~~~~ 51 (62)
.+.-.|++.++ ++++|.+.-.+++++ -++...-.|++|+.++|.+-.
T Consensus 438 ~LI~ky~l~N~h~~~v~~~Ps~~~~~~~ekee~e~L~~~~~~l~de~~~ki~~~~ 492 (978)
T COG1026 438 KLIRKYFLDNPHYVTVIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDS 492 (978)
T ss_pred HHHHHHhhcCCccEEEEEecChHHHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHH
Confidence 46678999986 566677666666655 345566789999999887643
No 50
>cd07321 Extradiol_Dioxygenase_3A_like Subunit A of Class III extradiol dioxygenases. Extradiol dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings. There are two major groups of dioxygenases according to the cleavage site of the aromatic ring. Intradiol enzymes cleave the aromatic ring between two hydroxyl groups, whereas extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Extradiol dioxygenases can be divided into three classes. Class I and II enzymes are evolutionary related and show sequence similarity, with the two domain class II enzymes evolving from the class I enzyme through gene duplication. Class III enzymes are different in sequence and structure and usually have two subunits, designated A and B, which form a tetramer composed of two copies of each subunit. This model represents subunit A of c
Probab=39.31 E-value=34 Score=17.73 Aligned_cols=28 Identities=14% Similarity=0.247 Sum_probs=19.2
Q ss_pred CHHHHHHhHhcc--CCCCCHHHHHHHHhhh
Q psy15066 24 TVKRITENINVF--DFTLEQSDVDKIDALD 51 (62)
Q Consensus 24 ~~~~l~~n~~~~--~~~L~~e~~~~l~~~~ 51 (62)
+.++.+++-++. .+.||+||.+.|.+-+
T Consensus 18 ~re~f~~dp~a~~~~~~Lt~eE~~al~~rD 47 (77)
T cd07321 18 VKERFKADPEAVLAEYGLTPEEKAALLARD 47 (77)
T ss_pred HHHHHHhCHHHHHHHcCCCHHHHHHHHcCC
Confidence 445556655443 6789999999887643
No 51
>TIGR02744 TrbI_Ftype type-F conjugative transfer system protein TrbI. This protein is an essential component of the F-type conjugative transfer sytem for plasmid DNA transfer and has been shown to be localized to the periplasm.
Probab=39.25 E-value=65 Score=18.20 Aligned_cols=44 Identities=7% Similarity=0.050 Sum_probs=28.5
Q ss_pred HHHHHHHhCCCeEecCCCCHHHHHHhHh--ccCCCCCHHHHHHHHh
Q psy15066 6 FILRPQVQSGTIPIPKSNTVKRITENIN--VFDFTLEQSDVDKIDA 49 (62)
Q Consensus 6 val~W~l~~~~~~I~g~~~~~~l~~n~~--~~~~~L~~e~~~~l~~ 49 (62)
.++.|.+.+--.|.+-+.+.++.-+.+. +.+-+||+++.+.+..
T Consensus 20 ~~~s~~v~~~~~P~iV~fdmk~tld~F~~q~~~~~lte~q~~~~~~ 65 (112)
T TIGR02744 20 ALVSYGIVRLNSPVTVAFDMKQTLDAFFDSASQKKLSEAQQKALLG 65 (112)
T ss_pred HHHHHHhhhcCCCeEEEEecHHHHHHHHHHHhhcCCCHHHHHHHHH
Confidence 4577888775555555556666666543 3467899999876643
No 52
>PF05321 HHA: Haemolysin expression modulating protein; InterPro: IPR007985 This family consists of haemolysin expression modulating protein (Hha) from Escherichia coli and its enterobacterial homologues, such as YmoA from Yersinia enterocolitica, and RmoA encoded on the R100 plasmid. These proteins act as modulators of bacterial gene expression. Members of the Hha/YmoA/RmoA family act in conjunction with members of the H-NS family, participating in the thermoregulation of different virulence factors and in plasmid transfer []. Hha, along with the chromatin-associated protein H-NS, is involved in the regulation of expression of the toxin alpha-haemolysin in response to osmolarity and temperature []. YmoA modulates the expression of various virulence factors, such as Yop proteins and YadA adhesin, in response to temperature. RmoA is a plasmid R100 modulator involved in plasmid transfer []. The HHA family of proteins display striking similarity to the oligomerization domain of the H-NS proteins.; PDB: 1JW2_A 2K5S_A 2JQT_A.
Probab=39.17 E-value=10 Score=18.98 Aligned_cols=32 Identities=6% Similarity=0.305 Sum_probs=21.7
Q ss_pred CCCCHHHHHHhHhccCCCCCHHHHHHHHhhhc
Q psy15066 21 KSNTVKRITENINVFDFTLEQSDVDKIDALDK 52 (62)
Q Consensus 21 g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~ 52 (62)
+.++.+-++.-+.....+|+.+|+..+.+...
T Consensus 5 rcss~dtLEkv~e~~~~~L~~~e~~~f~~AaD 36 (57)
T PF05321_consen 5 RCSSLDTLEKVIEHNRYKLTDDELEAFNSAAD 36 (57)
T ss_dssp TSS-HHHHHHHHHHHHHHS-HHHHHHHHHHHH
T ss_pred ccCCHhhHHHHHHHhhCcCCHHHHHHHHHHHH
Confidence 34566777777766667799999988877653
No 53
>PF12162 STAT1_TAZ2bind: STAT1 TAZ2 binding domain; InterPro: IPR022752 This entry represents the C-terminal domain of STAT1, which selectively binds the TAZ2 domain of CRB (CREB-binding protein) []. This group of eukaryotic proteins is approximately 20 amino acids in length, and is found in association with PF02865 from PFAM, PF00017 from PFAM, PF01017 from PFAM, PF02864 from PFAM. By binding to CRB, it becomes a transcriptional activator and can initiate transcription of certain genes. ; GO: 0003700 sequence-specific DNA binding transcription factor activity; PDB: 2KA6_B.
Probab=39.03 E-value=33 Score=14.00 Aligned_cols=15 Identities=13% Similarity=0.430 Sum_probs=9.9
Q ss_pred CCCCHHHHHHHHhhh
Q psy15066 37 FTLEQSDVDKIDALD 51 (62)
Q Consensus 37 ~~L~~e~~~~l~~~~ 51 (62)
.++++||.+++.++.
T Consensus 8 mPMSPddy~~l~~~V 22 (23)
T PF12162_consen 8 MPMSPDDYDELERMV 22 (23)
T ss_dssp --S-HHHHHHHHHHH
T ss_pred cCCCHHHHHHHHHhh
Confidence 468999999987653
No 54
>TIGR01458 HAD-SF-IIA-hyp3 HAD-superfamily subfamily IIA hydrolase, TIGR01458. This hypothetical equivalog is a member of the IIA subfamily (TIGR01460) of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. One sequence (GP|10716807) has been annotated as a "phospholysine phosphohistidine inorganic pyrophosphatase," probably in reference to studies on similarly described (but unsequenced) enzymes from bovine and rat tissues. However, the supporting information for this annotation has never been published.
Probab=37.84 E-value=86 Score=19.48 Aligned_cols=39 Identities=8% Similarity=0.198 Sum_probs=26.4
Q ss_pred HHHHHHHhCC--CeEecC--CCCHHHHHHhHhccCCCCCHHHH
Q psy15066 6 FILRPQVQSG--TIPIPK--SNTVKRITENINVFDFTLEQSDV 44 (62)
Q Consensus 6 val~W~l~~~--~~~I~g--~~~~~~l~~n~~~~~~~L~~e~~ 44 (62)
=+++.+.++| +..+.+ +++++++.+-++..++.++++++
T Consensus 28 ~al~~l~~~G~~~~~~Tn~~~~~~~~~~~~l~~~g~~~~~~~i 70 (257)
T TIGR01458 28 EAVKRLRGASVKVRFVTNTTKESKQDLLERLQRLGFDISEDEV 70 (257)
T ss_pred HHHHHHHHCCCeEEEEECCCCCCHHHHHHHHHHcCCCCCHHHe
Confidence 3567777777 444454 34556788888888888887764
No 55
>TIGR02311 HpaI 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase. This model represents the aldolase which performs the final step unique to the 4-hydroxyphenylacetic acid catabolism pathway in which 2,4-dihydroxyhept-2-ene-1,7-dioic acid is split into pyruvate and succinate-semialdehyde. The gene for enzyme is generally found adjacent to other genes for this pathway organized into an operon.
Probab=37.39 E-value=64 Score=20.22 Aligned_cols=31 Identities=13% Similarity=0.378 Sum_probs=24.3
Q ss_pred HHHHHHhCC--CeEecCCCCHHHHHHhHhccCC
Q psy15066 7 ILRPQVQSG--TIPIPKSNTVKRITENINVFDF 37 (62)
Q Consensus 7 al~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~ 37 (62)
.+..++..| .+++|...+++++++-.++..+
T Consensus 76 ~i~~~Ld~Ga~gIivP~v~s~e~a~~~v~~~~y 108 (249)
T TIGR02311 76 LIKQLLDIGAQTLLVPMIETAEQAEAAVAATRY 108 (249)
T ss_pred HHHHHhCCCCCEEEecCcCCHHHHHHHHHHcCC
Confidence 456777776 6888999999999988777654
No 56
>PRK05439 pantothenate kinase; Provisional
Probab=36.85 E-value=58 Score=21.37 Aligned_cols=30 Identities=10% Similarity=0.259 Sum_probs=22.0
Q ss_pred CHHHHHHhHhccCCCCCHHHHHHHHhhhcC
Q psy15066 24 TVKRITENINVFDFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 24 ~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~ 53 (62)
+.+++..-.+.....|+++|++.|.+++..
T Consensus 14 ~r~~w~~l~~~~~~~l~~~~~~~l~~~~~~ 43 (311)
T PRK05439 14 SREQWAALRDSTPLTLTEEELERLRGLNDP 43 (311)
T ss_pred CHHHHHHHHhcCCCCCCHHHHHHhhcCCCC
Confidence 355666655667788999999999887643
No 57
>PF13986 DUF4224: Domain of unknown function (DUF4224)
Probab=36.33 E-value=11 Score=17.83 Aligned_cols=12 Identities=8% Similarity=0.415 Sum_probs=7.8
Q ss_pred CCHHHHHHHHhh
Q psy15066 39 LEQSDVDKIDAL 50 (62)
Q Consensus 39 L~~e~~~~l~~~ 50 (62)
||+||+.+|...
T Consensus 3 LT~~El~elTG~ 14 (47)
T PF13986_consen 3 LTDEELQELTGY 14 (47)
T ss_pred CCHHHHHHHHCC
Confidence 667777666554
No 58
>PF12551 PHBC_N: Poly-beta-hydroxybutyrate polymerase N terminal; InterPro: IPR022211 This domain family is found in bacteria and eukaryotes, and is approximately 50 amino acids in length. The family is found in association with PF07167 from PFAM, PF00561 from PFAM. There is a single completely conserved residue W that may be functionally important. PHBC is the third enzyme of the poly-beta-hydroxybutyrate biosynthetic pathway.
Probab=36.08 E-value=36 Score=16.18 Aligned_cols=11 Identities=9% Similarity=-0.022 Sum_probs=8.4
Q ss_pred CCHHHHHHHHH
Q psy15066 1 MFMTSFILRPQ 11 (62)
Q Consensus 1 ~s~aqval~W~ 11 (62)
+||+.++++|.
T Consensus 24 lSPaal~lA~~ 34 (46)
T PF12551_consen 24 LSPAALALAYL 34 (46)
T ss_pred cCHHHHHHHHH
Confidence 57888888874
No 59
>PHA02902 putative IMV membrane protein; Provisional
Probab=35.12 E-value=27 Score=17.98 Aligned_cols=26 Identities=15% Similarity=0.376 Sum_probs=18.0
Q ss_pred HHHHHhH---hccCCCCCHHHHHHHHhhh
Q psy15066 26 KRITENI---NVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 26 ~~l~~n~---~~~~~~L~~e~~~~l~~~~ 51 (62)
+++.+.+ ..|+=+|++++++++.++.
T Consensus 38 ~~~~~~l~~d~~F~D~lTpDQirAlHrlv 66 (70)
T PHA02902 38 ERFGDTLEDDPLFKDSLTPDQIKALHRLV 66 (70)
T ss_pred ccccccCCCCchhhccCCHHHHHHHHHHH
Confidence 3454433 4556689999999998764
No 60
>COG4536 CorB Putative Mg2+ and Co2+ transporter CorB [Inorganic ion transport and metabolism]
Probab=35.03 E-value=34 Score=23.52 Aligned_cols=33 Identities=27% Similarity=0.519 Sum_probs=26.4
Q ss_pred CCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhh
Q psy15066 15 GTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDAL 50 (62)
Q Consensus 15 ~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~ 50 (62)
|...|=|+.+...+. ++++|+|++||-..++.+
T Consensus 344 gs~iidGs~~iRdlN---r~l~W~Lp~e~a~TinGL 376 (423)
T COG4536 344 GSFIIDGSANVRDLN---RALDWNLPDEDARTINGL 376 (423)
T ss_pred CcEEEeCCCcHHHHH---HhcCCCCCCCcchhHHHH
Confidence 578888998887665 788999999987766654
No 61
>PF04358 DsrC: DsrC like protein; InterPro: IPR007453 DsrC (P45573 from SWISSPROT) has been observed to co-purify with Desulphovibrio vulgaris dissimilatory sulphite reductase []. However, DsrC appears to be only loosely associated to the sulphite reductase, which suggests that it may not be an integral part of the dissimilatory sulphite reductase. Many proteins in this entry are found in organisms such as Escherichia coli and Haemophilus influenzae which do not contain dissimilatory sulphite reductases but can synthesise assimilatory sirohaem sulphite and nitrite reductases. It is speculated that DsrC may be involved in the assembly, folding or stabilisation of sirohaem proteins []. The strictly conserved cysteine in the C terminus suggests that DsrC may have a catalytic function in the metabolism of sulphur compounds []. Also included in this entry is TusE, a partner to TusBCD in a sulphur relay system for 2-thiouridine biosynthesis, a tRNA base modification process. Many proteins in this entry are annotated as the third (gamma) subunit of dissimilatory sulphite reductase ; PDB: 2V4J_F 2A5W_C 1SAU_A 1JI8_A 1YX3_A.
Probab=35.00 E-value=30 Score=19.31 Aligned_cols=32 Identities=13% Similarity=0.041 Sum_probs=20.0
Q ss_pred cCCCCHHHHHHhHhccCCCCCHHHHHHHHhhh
Q psy15066 20 PKSNTVKRITENINVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 20 ~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~ 51 (62)
+.-.+.+-.+.-.+..+++||++.++-|+-+-
T Consensus 20 ~~dW~eevA~~lA~~egI~Ltd~HW~vI~flR 51 (109)
T PF04358_consen 20 PEDWNEEVAEALAKEEGIELTDEHWEVIRFLR 51 (109)
T ss_dssp GGG--HHHHHHHHHCTT-S--HHHHHHHHHHH
T ss_pred hHhCCHHHHHHHHHHcCCCCCHHHHHHHHHHH
Confidence 44456666777777788999999998887654
No 62
>COG4710 Predicted DNA-binding protein with an HTH domain [General function prediction only]
Probab=34.96 E-value=18 Score=19.08 Aligned_cols=25 Identities=20% Similarity=0.332 Sum_probs=19.5
Q ss_pred HHhHhccCCCCCHHHHHHHHhhhcC
Q psy15066 29 TENINVFDFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 29 ~~n~~~~~~~L~~e~~~~l~~~~~~ 53 (62)
+.|.....|.|++|--++|+.+..+
T Consensus 2 ~~~~k~tsiRlp~E~~eRL~~Ls~~ 26 (80)
T COG4710 2 KRNVKTTSIRLPLELKERLDNLSKN 26 (80)
T ss_pred CccccceeeeCCHHHHHHHHHHHHh
Confidence 3456677888999999999988754
No 63
>PRK02290 3-dehydroquinate synthase; Provisional
Probab=34.74 E-value=6.2 Score=26.37 Aligned_cols=52 Identities=13% Similarity=0.326 Sum_probs=28.8
Q ss_pred HHHHHHHHhCC-CeEecCCCCHHHHHHhHhcc-----CCCCCHHHHHHHHhhhcCCccc
Q psy15066 5 SFILRPQVQSG-TIPIPKSNTVKRITENINVF-----DFTLEQSDVDKIDALDKKQRSC 57 (62)
Q Consensus 5 qval~W~l~~~-~~~I~g~~~~~~l~~n~~~~-----~~~L~~e~~~~l~~~~~~~r~~ 57 (62)
++++. +|.+| --++..+.++..+++-.... .++|....+.++..++.++|.|
T Consensus 128 ~~a~~-~LE~G~dGVvl~~~d~~ei~~~~~~~~~~~~~l~L~~a~Vt~V~~vG~GdRVC 185 (344)
T PRK02290 128 KLALE-ILEKGVDGVLLDPDDPNEIKAIVALIEEAREKLELVPATVTRVEPVGMGDRVC 185 (344)
T ss_pred HHHHH-HhccCCCeEEECCCCHHHHHHHHHHHhccCCcceeEEEEEEEEEEcCCccEEE
Confidence 34444 45555 45566666666666654432 3445555555555566666665
No 64
>PF03457 HA: Helicase associated domain; InterPro: IPR005114 This short domain is found in multiple copies in bacterial helicase proteins. The domain is predicted to contain 3 alpha helices. The function of this domain may be to bind nucleic acid.; PDB: 2KTA_A.
Probab=33.64 E-value=35 Score=16.71 Aligned_cols=47 Identities=15% Similarity=0.311 Sum_probs=23.0
Q ss_pred HHHHHHHHhCCCeEecCCC-----CHHHHHHhHhc-c-CCCCCHHHHHHHHhhh
Q psy15066 5 SFILRPQVQSGTIPIPKSN-----TVKRITENINV-F-DFTLEQSDVDKIDALD 51 (62)
Q Consensus 5 qval~W~l~~~~~~I~g~~-----~~~~l~~n~~~-~-~~~L~~e~~~~l~~~~ 51 (62)
+-+.+|.-.+|..-+|... ...+.-.+.+. . .-+|++|.++.|++++
T Consensus 13 ~~l~~y~~~~G~~~vp~~~~~~~~~Lg~Wl~~qR~~~r~g~L~~er~~~L~~lg 66 (68)
T PF03457_consen 13 EALKAYKEEHGHLNVPRDYVTDGFPLGQWLNNQRRKYRKGKLTPERIERLDALG 66 (68)
T ss_dssp HHHHHHHHHHS--S-SS-----SSHHHHHHHHHHHHHHHT---HHHHHHHHHHT
T ss_pred HHHHHHHHHHCCCCCCcccCcCCCcHHHHHHHHHHHHHcCCCCHHHHHHHHcCC
Confidence 3456777777755566543 12233223221 1 2359999999999886
No 65
>COG0313 Predicted methyltransferases [General function prediction only]
Probab=33.38 E-value=36 Score=22.17 Aligned_cols=28 Identities=14% Similarity=0.267 Sum_probs=21.8
Q ss_pred CHHHHHHHHHHhCC--CeEecCCCCHHHHH
Q psy15066 2 FMTSFILRPQVQSG--TIPIPKSNTVKRIT 29 (62)
Q Consensus 2 s~aqval~W~l~~~--~~~I~g~~~~~~l~ 29 (62)
.|+-...+++..+| +.||||++..-..-
T Consensus 93 DPG~~LV~~a~~~gi~V~~lPG~sA~~tAL 122 (275)
T COG0313 93 DPGYELVRAAREAGIRVVPLPGPSALITAL 122 (275)
T ss_pred CccHHHHHHHHHcCCcEEecCCccHHHHHH
Confidence 35667788888887 89999998875444
No 66
>PF12651 RHH_3: Ribbon-helix-helix domain
Probab=33.24 E-value=20 Score=16.60 Aligned_cols=22 Identities=23% Similarity=0.484 Sum_probs=16.6
Q ss_pred ccCCCCCHHHHHHHHhhhcCCc
Q psy15066 34 VFDFTLEQSDVDKIDALDKKQR 55 (62)
Q Consensus 34 ~~~~~L~~e~~~~l~~~~~~~r 55 (62)
.+.+.|+++.++.|+.+....+
T Consensus 4 r~t~~l~~el~~~L~~ls~~t~ 25 (44)
T PF12651_consen 4 RFTFSLDKELYEKLKELSEETG 25 (44)
T ss_pred EEEEecCHHHHHHHHHHHHHHC
Confidence 4567789999998888876544
No 67
>PHA03188 UL14 tegument protein; Provisional
Probab=33.08 E-value=70 Score=19.85 Aligned_cols=41 Identities=10% Similarity=0.214 Sum_probs=28.2
Q ss_pred HHHHHHHHhCCCeEec--CCCCHHHHHHhHhccCCCCCHHHHH
Q psy15066 5 SFILRPQVQSGTIPIP--KSNTVKRITENINVFDFTLEQSDVD 45 (62)
Q Consensus 5 qval~W~l~~~~~~I~--g~~~~~~l~~n~~~~~~~L~~e~~~ 45 (62)
.++..|+|.....+|| .+++--.+++.+-+..+.-+..+|+
T Consensus 147 ALLt~WmL~~aP~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (199)
T PHA03188 147 ALLTQWMLTTHPSPIPHAPPSKNLLLEDCCTCSTINTDKNDMK 189 (199)
T ss_pred HHHHHHHHhcCCCCCCCCCCchhhHHHHHccccccccchhhcc
Confidence 5788999998755565 4455566777776666666666654
No 68
>TIGR03342 dsrC_tusE_dsvC sulfur relay protein, TusE/DsrC/DsvC family. Members of this protein family may be described as TusE, a partner to TusBCD in a sulfur relay system for 2-thiouridine biosynthesis, a tRNA base modification process. Other members are DsrC, a functionally similar protein in species where the sulfur relay system exists primarily for sulfur metabolism rather than tRNA base modification. Some members of this family are known explicitly as the gamma subunit of sulfite reductases.
Probab=32.79 E-value=58 Score=18.19 Aligned_cols=33 Identities=9% Similarity=-0.015 Sum_probs=22.8
Q ss_pred cCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhc
Q psy15066 20 PKSNTVKRITENINVFDFTLEQSDVDKIDALDK 52 (62)
Q Consensus 20 ~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~ 52 (62)
+.-.+.+-.+.-.+..+++||++.++-|+-+-.
T Consensus 19 ~~dW~e~vA~~lA~~egieLT~~Hw~vI~~lR~ 51 (108)
T TIGR03342 19 LDDWSEDVAEALAEEEGIELTEAHWEVINFLRD 51 (108)
T ss_pred hHHCCHHHHHHHHHHcCCCCCHHHHHHHHHHHH
Confidence 344555656666666788999999987776543
No 69
>PF06819 Arc_PepC: Archaeal Peptidase A24 C-terminal Domain; InterPro: IPR009639 This region is of unknown function found at the C terminus of some archael proteins that have multiple transmembrane domains and are predicted to be aspartic peptidases belonging to the MEROPS peptidase subfamily A24A (type 4 prepilin peptidase 1.
Probab=32.52 E-value=48 Score=18.72 Aligned_cols=18 Identities=22% Similarity=0.481 Sum_probs=15.3
Q ss_pred CCHHHHHHHHhhhcCCcc
Q psy15066 39 LEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 39 L~~e~~~~l~~~~~~~r~ 56 (62)
|++||++.|.++....++
T Consensus 87 Ls~E~IE~Lk~Lv~eGKi 104 (110)
T PF06819_consen 87 LSKEDIEKLKKLVEEGKI 104 (110)
T ss_pred CCHHHHHHHHHHHHcCCC
Confidence 799999999999876554
No 70
>PF09336 Vps4_C: Vps4 C terminal oligomerisation domain; InterPro: IPR015415 This domain is found at the C-terminal of ATPase proteins involved in vacuolar sorting. It forms an alpha helix structure and is required for oligomerisation []. ; PDB: 1XWI_A 3EIH_C 2QPA_C 3EIE_A 2RKO_A 2QP9_X 3MHV_C 3CF3_C 3CF1_A 3CF2_A ....
Probab=32.26 E-value=71 Score=15.82 Aligned_cols=32 Identities=19% Similarity=0.291 Sum_probs=24.6
Q ss_pred CCCHHHHHHhHhccCCCCCHHHHHHHHhhhcC
Q psy15066 22 SNTVKRITENINVFDFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 22 ~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~ 53 (62)
..+.+++++-++...-+.+++|+++.++..+.
T Consensus 29 ~it~~DF~~Al~~~kpSVs~~dl~~ye~w~~~ 60 (62)
T PF09336_consen 29 PITMEDFEEALKKVKPSVSQEDLKKYEEWTKE 60 (62)
T ss_dssp HBCHHHHHHHHHTCGGSS-HHHHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHcCCCCCHHHHHHHHHHHHH
Confidence 45678888888888888999999988876544
No 71
>PF10802 DUF2540: Protein of unknown function (DUF2540); InterPro: IPR024254 This family of proteins with unknown function appears to be restricted to Methanococcus.; PDB: 2EFV_A.
Probab=32.24 E-value=66 Score=16.96 Aligned_cols=45 Identities=18% Similarity=0.158 Sum_probs=27.8
Q ss_pred HHHHHHHHHhC--CCeEecCCCCHHHHHHhHhcc-----CCCCCHHHHHHHHhhhc
Q psy15066 4 TSFILRPQVQS--GTIPIPKSNTVKRITENINVF-----DFTLEQSDVDKIDALDK 52 (62)
Q Consensus 4 aqval~W~l~~--~~~~I~g~~~~~~l~~n~~~~-----~~~L~~e~~~~l~~~~~ 52 (62)
..-.++|.+++ +.-.| .++-+.+-+.+- .+.|+++|-+-+++.+.
T Consensus 9 d~r~lRY~LHklen~~~I----dpeiL~ka~e~~K~~k~titLs~~E~~IieKyGK 60 (75)
T PF10802_consen 9 DARDLRYYLHKLENLEEI----DPEILKKAIECKKKYKRTITLSDEEEKIIEKYGK 60 (75)
T ss_dssp -HHHHHHHHHHGGG---------HHHHHHHHHS---EEEEEE--HHHHHHHHHHGG
T ss_pred CHHHHHHHHHHHccCCCC----CHHHHHHHHhcccccceeeEecHHHHHHHHHHhH
Confidence 34578999997 35444 488888877643 35799999999988874
No 72
>PF01466 Skp1: Skp1 family, dimerisation domain; InterPro: IPR016072 SKP1 (together with SKP2) was identified as an essential component of the cyclin A-CDK2 S phase kinase complex []. It was found to bind several F-box containing proteins (e.g., Cdc4, Skp2, cyclin F) and to be involved in the ubiquitin protein degradation pathway. A yeast homologue of SKP1 (P52286) was identified in the centromere bound kinetochore complex [] and is also involved in the ubiquitin pathway []. In Dictyostelium discoideum (Slime mold) FP21 was shown to be glycosylated in the cytosol and has homology to SKP1 []. This entry represents a dimerisation domain found at the C-terminal of SKP1 proteins [], as well as in subunit D of the centromere DNA-binding protein complex Cbf3 []. This domain is multi-helical in structure, and consists of an interlocked herterodimer in F-box proteins.; GO: 0006511 ubiquitin-dependent protein catabolic process; PDB: 2P1O_A 3OGL_G 3OGM_A 3C6O_A 2P1N_A 2P1Q_A 3OGK_I 3C6N_A 3C6P_A 2P1P_A ....
Probab=31.27 E-value=80 Score=16.08 Aligned_cols=31 Identities=13% Similarity=0.229 Sum_probs=21.1
Q ss_pred ecCCCCHHHHHHhHhccCCCCCHHHHHHHHhh
Q psy15066 19 IPKSNTVKRITENINVFDFTLEQSDVDKIDAL 50 (62)
Q Consensus 19 I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~ 50 (62)
..+..++++|++-+...+ .+++++.+++.+-
T Consensus 42 ~i~gks~eeir~~fgi~~-d~t~eee~~i~~e 72 (78)
T PF01466_consen 42 MIKGKSPEEIRKYFGIEN-DLTPEEEEEIRKE 72 (78)
T ss_dssp HHTTS-HHHHHHHHT----TSSHHHHHHHHHH
T ss_pred HhcCCCHHHHHHHcCCCC-CCCHHHHHHHHHH
Confidence 445688999999887654 5888888887654
No 73
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=31.14 E-value=42 Score=17.45 Aligned_cols=20 Identities=25% Similarity=0.373 Sum_probs=16.3
Q ss_pred CCCeEecCCCCHHHHHHhHh
Q psy15066 14 SGTIPIPKSNTVKRITENIN 33 (62)
Q Consensus 14 ~~~~~I~g~~~~~~l~~n~~ 33 (62)
.|-++|.|+++++++++-++
T Consensus 57 sGki~itGaks~~~~~~a~~ 76 (86)
T PF00352_consen 57 SGKIVITGAKSEEEAKKAIE 76 (86)
T ss_dssp TSEEEEEEESSHHHHHHHHH
T ss_pred CCEEEEEecCCHHHHHHHHH
Confidence 46788999999999987654
No 74
>PRK12821 aspartyl/glutamyl-tRNA amidotransferase subunit C-like protein; Provisional
Probab=30.93 E-value=83 Score=22.15 Aligned_cols=26 Identities=12% Similarity=0.335 Sum_probs=19.5
Q ss_pred CCHHHHHHhHhccCCCCCHHHHHHHH
Q psy15066 23 NTVKRITENINVFDFTLEQSDVDKID 48 (62)
Q Consensus 23 ~~~~~l~~n~~~~~~~L~~e~~~~l~ 48 (62)
-+.+.++.-.+...+++++||.+++.
T Consensus 389 ItkEeVkKLAkLARLeLSEEElEkl~ 414 (477)
T PRK12821 389 LNKDELKKLARLVMFDLDDAELEKLQ 414 (477)
T ss_pred CCHHHHHHHHHHhCCCCCHHHHHHHH
Confidence 45677777777778899999887654
No 75
>PRK11508 sulfur transfer protein TusE; Provisional
Probab=30.87 E-value=64 Score=18.06 Aligned_cols=33 Identities=3% Similarity=-0.093 Sum_probs=22.2
Q ss_pred cCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhc
Q psy15066 20 PKSNTVKRITENINVFDFTLEQSDVDKIDALDK 52 (62)
Q Consensus 20 ~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~ 52 (62)
+.-.+.+-...-.+..+++||++.++-|+-+-.
T Consensus 20 ~~dW~e~vA~~lA~~egieLT~~HW~VI~~lR~ 52 (109)
T PRK11508 20 SSQWSEPLAVVIAENEGISLSPEHWEVVRFVRD 52 (109)
T ss_pred hHHCCHHHHHHHHHHhCCCCCHHHHHHHHHHHH
Confidence 334455555555666678999999988776543
No 76
>cd07922 CarBa CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. 2-aminophenol 1,6-dioxygenase is a key enzyme in the carbazole degradation pathway isolated from bacterial strains with carbazole degradation ability. The enzyme is a heterotetramer composed of two A and two B subunits. CarB belongs to the class III extradiol dioxygenase family, composed of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Although the enzyme was originally isolated as a meta-cleavage enzyme for 2'-aminobiphenyl-2,3-diol involved in carbazole degradation, the enzyme has also shown high specificity for 2,3-dihydroxybiphenyl.
Probab=30.81 E-value=62 Score=17.17 Aligned_cols=25 Identities=8% Similarity=0.267 Sum_probs=17.0
Q ss_pred HHHHHhHhcc--CCCCCHHHHHHHHhh
Q psy15066 26 KRITENINVF--DFTLEQSDVDKIDAL 50 (62)
Q Consensus 26 ~~l~~n~~~~--~~~L~~e~~~~l~~~ 50 (62)
++.+++-+++ .+.||+||.+.|.+-
T Consensus 21 erF~~DPea~~~~~gLt~eE~~aL~~~ 47 (81)
T cd07922 21 ERFQDDPSAVFEEYGLTPAERAALREG 47 (81)
T ss_pred HHHHHCHHHHHHHcCCCHHHHHHHHcc
Confidence 4455554443 567999999988764
No 77
>PRK05718 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional
Probab=30.37 E-value=1e+02 Score=19.01 Aligned_cols=28 Identities=14% Similarity=0.170 Sum_probs=22.6
Q ss_pred HHHHHHHhCCCeEecCCCCHHHHHHhHh
Q psy15066 6 FILRPQVQSGTIPIPKSNTVKRITENIN 33 (62)
Q Consensus 6 val~W~l~~~~~~I~g~~~~~~l~~n~~ 33 (62)
-.++++.+.++..+||..++..+.+-.+
T Consensus 99 ~vi~~a~~~~i~~iPG~~TptEi~~a~~ 126 (212)
T PRK05718 99 PLLKAAQEGPIPLIPGVSTPSELMLGME 126 (212)
T ss_pred HHHHHHHHcCCCEeCCCCCHHHHHHHHH
Confidence 3567788889999999999988876553
No 78
>PF10260 SAYSvFN: Uncharacterized conserved domain (SAYSvFN); InterPro: IPR019387 This domain of approximately 75 residues contains a highly conserved SATSv/iFN motif. The function is unknown but the domain is conserved from plants to humans.
Probab=29.79 E-value=52 Score=17.08 Aligned_cols=17 Identities=24% Similarity=0.374 Sum_probs=10.6
Q ss_pred CCeEecCCCCHHHHHHh
Q psy15066 15 GTIPIPKSNTVKRITEN 31 (62)
Q Consensus 15 ~~~~I~g~~~~~~l~~n 31 (62)
|.-.|+|+-++||++.-
T Consensus 53 ~~~~i~Gtl~aE~~ere 69 (71)
T PF10260_consen 53 GCERIPGTLTAEQFERE 69 (71)
T ss_pred CccccCCCCCHHHHHHh
Confidence 45566777777766643
No 79
>PHA01748 hypothetical protein
Probab=29.78 E-value=25 Score=17.36 Aligned_cols=20 Identities=25% Similarity=0.438 Sum_probs=14.7
Q ss_pred ccCCCCCHHHHHHHHhhhcC
Q psy15066 34 VFDFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 34 ~~~~~L~~e~~~~l~~~~~~ 53 (62)
...+.|+++.+++|+.+...
T Consensus 4 ~iSvrLp~el~~eld~~a~~ 23 (60)
T PHA01748 4 VITFKIEEDLLELLDRYAIK 23 (60)
T ss_pred EEEEECCHHHHHHHHHHHHH
Confidence 34677888888888887543
No 80
>PF11762 Arabinose_Iso_C: L-arabinose isomerase C-terminal domain; InterPro: IPR024664 The Escherichia coli araBAD operon consists of three genes encoding three enzymes that convert L-arabinose to D-xylulose-5 phosphate. L-arabinose isomerase (AraA) 5.3.1.4 from EC catalyses the conversion of L-arabinose to L-ribulose as the first step in the pathway of L-arabinose utilization as a carbon source []. This entry represents a C-terminal non-catalytic domain in L-arabinose isomerase.; PDB: 4F2D_A 2AJT_C 2HXG_C.
Probab=29.75 E-value=1e+02 Score=17.52 Aligned_cols=35 Identities=6% Similarity=-0.001 Sum_probs=23.1
Q ss_pred HHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCCC
Q psy15066 4 TSFILRPQVQSG--TIPIPKSNTVKRITENINVFDFT 38 (62)
Q Consensus 4 aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~ 38 (62)
..-+-+|+.+-| ..++--+-+.+++++-.+-.+++
T Consensus 75 ~~~a~aWi~aGGaHHtv~s~~~t~e~~~d~A~~~giE 111 (115)
T PF11762_consen 75 KTGAEAWILAGGAHHTVFSTAVTAEQLEDFAEMAGIE 111 (115)
T ss_dssp HHHHHHHHHTT--SEEEEESS--HHHHHHHHHHTT-E
T ss_pred HHHHHHHHHcCCCccEEEEccCCHHHHHHHHHHhCCE
Confidence 344678999987 67788888888888877665543
No 81
>COG1465 Predicted alternative 3-dehydroquinate synthase [Amino acid transport and metabolism]
Probab=29.65 E-value=9.4 Score=25.45 Aligned_cols=52 Identities=21% Similarity=0.345 Sum_probs=30.0
Q ss_pred HHHHHHHHhCC-CeEecCCCCHHHHHHhHhc------cCCCCCHHHHHHHHhhhcCCccc
Q psy15066 5 SFILRPQVQSG-TIPIPKSNTVKRITENINV------FDFTLEQSDVDKIDALDKKQRSC 57 (62)
Q Consensus 5 qval~W~l~~~-~~~I~g~~~~~~l~~n~~~------~~~~L~~e~~~~l~~~~~~~r~~ 57 (62)
.+|+. .|.+| -.++..+.+++.+++..+. ..++|....+.++.-++.+.|.|
T Consensus 159 ~vA~e-TLE~GaDgVll~~~d~~eIk~~~~~~~e~~~E~~eL~~a~vt~ieplG~gDRVC 217 (376)
T COG1465 159 RVALE-TLEKGADGVLLDSDDPEEIKKTAEVVEEAESERLELVTATVTEIEPLGSGDRVC 217 (376)
T ss_pred HHHHH-HHhccCceEEeCCCCHHHHHHHHHHHHHhccceeEEEEEEEEEEeecCCCceEE
Confidence 34444 45555 4566666677777665432 33445555555666666777766
No 82
>PRK13619 psbV cytochrome c-550; Provisional
Probab=29.47 E-value=45 Score=20.04 Aligned_cols=26 Identities=12% Similarity=0.273 Sum_probs=17.9
Q ss_pred HHHHHHhHhccCC-----CCCHHHHHHHHhh
Q psy15066 25 VKRITENINVFDF-----TLEQSDVDKIDAL 50 (62)
Q Consensus 25 ~~~l~~n~~~~~~-----~L~~e~~~~l~~~ 50 (62)
...+..++..+++ .|++||+..+...
T Consensus 113 ~a~~hpsi~~~di~P~mr~LtdedL~~iAg~ 143 (160)
T PRK13619 113 YSELHPNVSRPDIFPELRNFTEDDLYDVAGY 143 (160)
T ss_pred hhhhcccccccccccccCCCCHHHHHHHHHH
Confidence 3444455666665 6999999988764
No 83
>PF12824 MRP-L20: Mitochondrial ribosomal protein subunit L20; InterPro: IPR024388 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This entry represents the essential mitochondrial ribosomal protein L20 family from fungi [].
Probab=29.14 E-value=52 Score=19.66 Aligned_cols=19 Identities=21% Similarity=0.485 Sum_probs=15.5
Q ss_pred CCCCCHHHHHHHHhhhcCC
Q psy15066 36 DFTLEQSDVDKIDALDKKQ 54 (62)
Q Consensus 36 ~~~L~~e~~~~l~~~~~~~ 54 (62)
+..|++||+++|..+-..+
T Consensus 83 ~y~Lt~e~i~Eir~LR~~D 101 (164)
T PF12824_consen 83 KYHLTPEDIQEIRRLRAED 101 (164)
T ss_pred cccCCHHHHHHHHHHHHcC
Confidence 4679999999999886553
No 84
>KOG0237|consensus
Probab=28.76 E-value=56 Score=24.03 Aligned_cols=51 Identities=10% Similarity=0.173 Sum_probs=36.1
Q ss_pred HHHHHHHHhCC-----CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCcc
Q psy15066 5 SFILRPQVQSG-----TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 5 qval~W~l~~~-----~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r~ 56 (62)
+=+|+|-|.+. +.+.||..-... .+--+..++.++..|.+++.++..++.+
T Consensus 14 EHal~wkL~qSp~v~~v~vaPGn~G~a~-~~~~~~~~~dI~~~d~~ala~f~~e~~I 69 (788)
T KOG0237|consen 14 EHALAWKLKQSPKVKKVYVAPGNGGTAS-GDASKVPNLDISVADFEALASFCKEHNI 69 (788)
T ss_pred HhHHHHHhhcCCccceEEEccCCCCccc-CccccCcccccChhhHHHHHHHHHHcce
Confidence 34789998873 777887655444 2323445677889999999998877665
No 85
>PF09202 Rio2_N: Rio2, N-terminal; InterPro: IPR015285 This N-terminal domain is found in RIO2 kinases, and is structurally homologous to the winged helix (wHTH) domain. It adopts a structure consisting of four alpha helices followed by two beta strands and a fifth alpha helix. The domain confers DNA binding properties to the protein, as per other winged helix domains []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1TQP_A 1ZAR_A 1TQI_A 1ZAO_A 1TQM_A.
Probab=28.36 E-value=56 Score=17.20 Aligned_cols=22 Identities=14% Similarity=0.200 Sum_probs=14.2
Q ss_pred CCCHHHHHHHHhhhcCCccccC
Q psy15066 38 TLEQSDVDKIDALDKKQRSCVM 59 (62)
Q Consensus 38 ~L~~e~~~~l~~~~~~~r~~~~ 59 (62)
.|+++|+.-|..+..+.|-..|
T Consensus 3 ~L~~~d~rvL~aiE~gmk~hE~ 24 (82)
T PF09202_consen 3 YLSKEDFRVLRAIEMGMKNHEW 24 (82)
T ss_dssp T--HHHHHHHHHHHTTTTT-SS
T ss_pred cCCHHHHHHHHHHHHcccCCcc
Confidence 4788899888888877664444
No 86
>PRK14700 recombination factor protein RarA; Provisional
Probab=27.78 E-value=66 Score=21.19 Aligned_cols=35 Identities=17% Similarity=0.297 Sum_probs=19.9
Q ss_pred hCCCeEecCCCCH-------HHHHHhHhccCC-CCCHHHHHHH
Q psy15066 13 QSGTIPIPKSNTV-------KRITENINVFDF-TLEQSDVDKI 47 (62)
Q Consensus 13 ~~~~~~I~g~~~~-------~~l~~n~~~~~~-~L~~e~~~~l 47 (62)
..|.+.+.|+++. .-+.+-.+++.+ +|+.+|+..+
T Consensus 5 E~G~i~LIGATTENP~f~vn~ALlSR~~v~~l~~L~~~di~~i 47 (300)
T PRK14700 5 ESGKIILIGATTENPTYYLNDALVSRLFILRLKRLSLVATQKL 47 (300)
T ss_pred cCCcEEEEeecCCCccceecHhhhhhhheeeecCCCHHHHHHH
Confidence 3466666666654 233334455555 5777777655
No 87
>COG2250 Uncharacterized conserved protein related to C-terminal domain of eukaryotic chaperone, SACSIN [Function unknown]
Probab=27.48 E-value=1.3e+02 Score=17.11 Aligned_cols=47 Identities=9% Similarity=0.162 Sum_probs=34.5
Q ss_pred HHHHHHHHHhCCCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhh
Q psy15066 4 TSFILRPQVQSGTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 4 aqval~W~l~~~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~ 51 (62)
++++|..++.....-.|++-+..++-.++... ++..++.++.+..+.
T Consensus 40 vel~lKalL~~~~~~~p~tH~l~~Ll~~l~~~-~~~~e~~~~~~~~Le 86 (132)
T COG2250 40 VELALKALLIRLGGEPPKTHSLRELLRELSRE-LEVPEEILECARELE 86 (132)
T ss_pred HHHHHHHHHHHhcCCCCCcCCHHHHHHHHHHh-ccCcHHHHHHHHHHH
Confidence 46777777776333389999999999888654 778888887775544
No 88
>TIGR00096 probable S-adenosylmethionine-dependent methyltransferase, YraL family. No member of this family is characterized, but Pfam model pfam00590 (tetrapyrrole methylase) demonstrates homology between this family and its other members, which include several methylases for the tetrapyrrole class of compound, as well as the enzyme diphthine synthase.
Probab=26.75 E-value=27 Score=22.48 Aligned_cols=26 Identities=8% Similarity=0.258 Sum_probs=18.8
Q ss_pred CHHHHHHHHHHhCC--CeEecCCCCHHH
Q psy15066 2 FMTSFILRPQVQSG--TIPIPKSNTVKR 27 (62)
Q Consensus 2 s~aqval~W~l~~~--~~~I~g~~~~~~ 27 (62)
.|+...++.+..+| +.++||++-.-.
T Consensus 88 DPG~~LV~~~~~~~i~v~~ipG~sA~~~ 115 (276)
T TIGR00096 88 DPGHLLVACREKANIIVVPLPGAAALTA 115 (276)
T ss_pred CccHHHHHHHHHCCCeEEcCChHHHHHH
Confidence 46777788888887 777888766543
No 89
>PF10937 DUF2638: Protein of unknown function (DUF2638); InterPro: IPR020373 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This entry represents a component of the mitochondrial small ribosomal subunit. Mature mitochondrial ribosomes consist of a small (37S) and a large (54S) subunit. The 37S subunit contains at least 33 different proteins and 1 molecule of RNA (15S). The 54S subunit contains at least 45 different proteins and 1 molecule of RNA (21S). This entry is represented by a mitochondrial ribosomal protein of the small subunit, which has similarity to human mitochondrial ribosomal protein MRP-S36 [, , ].
Probab=26.62 E-value=55 Score=18.36 Aligned_cols=13 Identities=8% Similarity=0.440 Sum_probs=10.4
Q ss_pred CCCCHHHHHHHHh
Q psy15066 37 FTLEQSDVDKIDA 49 (62)
Q Consensus 37 ~~L~~e~~~~l~~ 49 (62)
..|+++||+.|++
T Consensus 97 ~p~se~EiE~Ins 109 (112)
T PF10937_consen 97 KPISEEEIEAINS 109 (112)
T ss_pred CCCCHHHHHHHHc
Confidence 3689999998874
No 90
>PF01113 DapB_N: Dihydrodipicolinate reductase, N-terminus; InterPro: IPR000846 Dihydrodipicolinate reductase catalyzes the second step in the biosynthesis of diaminopimelic acid and lysine, the NAD or NADP-dependent reduction of 2,3-dihydrodipicolinate into 2,3,4,5-tetrahydrodipicolinate [, , ]. In Escherichia coli and Mycobacterium tuberculosis, dihydrodipicolinate reductase has equal specificity for NADH and NADPH, however in Thermotoga maritima there it has a greater affinity for NADPH []. In addition, the enzyme is inhibited by high concentrations of its substrate, which consequently acts as a feedback control on the lysine biosynthesis pathway. In T. maritima, the enzyme also lacks N-terminal and C-terminal loops which are present in enzyme of the former two organisms. This entry represents the N-terminal domain of dihydrodipicolinate reductase which binds the dinucleotide NAD(P)H.; GO: 0008839 dihydrodipicolinate reductase activity, 0009089 lysine biosynthetic process via diaminopimelate, 0055114 oxidation-reduction process; PDB: 3QY9_D 1VM6_C 1ARZ_A 1DIH_A 1DRW_A 1DRV_A 1DRU_A 2DAP_A 1DAP_B 3DAP_A ....
Probab=26.60 E-value=61 Score=17.87 Aligned_cols=15 Identities=27% Similarity=0.641 Sum_probs=11.5
Q ss_pred CCHHHHHHHHhhhcC
Q psy15066 39 LEQSDVDKIDALDKK 53 (62)
Q Consensus 39 L~~e~~~~l~~~~~~ 53 (62)
+++++++.|+++...
T Consensus 101 ~~~~~~~~l~~~a~~ 115 (124)
T PF01113_consen 101 FSDEQIDELEELAKK 115 (124)
T ss_dssp SHHHHHHHHHHHTTT
T ss_pred CCHHHHHHHHHHhcc
Confidence 578888888887665
No 91
>COG2256 MGS1 ATPase related to the helicase subunit of the Holliday junction resolvase [DNA replication, recombination, and repair]
Probab=26.20 E-value=1.1e+02 Score=21.37 Aligned_cols=38 Identities=26% Similarity=0.434 Sum_probs=27.1
Q ss_pred HHhCCCeEecCCCCH-------HHHHHhHhccCC-CCCHHHHHHHH
Q psy15066 11 QVQSGTIPIPKSNTV-------KRITENINVFDF-TLEQSDVDKID 48 (62)
Q Consensus 11 ~l~~~~~~I~g~~~~-------~~l~~n~~~~~~-~L~~e~~~~l~ 48 (62)
.+.+|.++++|+++. .-+.+-.+.+.+ +|+.+|+..+-
T Consensus 127 ~vE~G~iilIGATTENPsF~ln~ALlSR~~vf~lk~L~~~di~~~l 172 (436)
T COG2256 127 HVENGTIILIGATTENPSFELNPALLSRARVFELKPLSSEDIKKLL 172 (436)
T ss_pred hhcCCeEEEEeccCCCCCeeecHHHhhhhheeeeecCCHHHHHHHH
Confidence 456788888888875 345555567776 69999987664
No 92
>KOG2749|consensus
Probab=26.15 E-value=52 Score=22.61 Aligned_cols=39 Identities=28% Similarity=0.450 Sum_probs=27.9
Q ss_pred CHHHHHHHHHHhC-------------CCeEecCCCCHHHHHHhHhcc-CCCCC
Q psy15066 2 FMTSFILRPQVQS-------------GTIPIPKSNTVKRITENINVF-DFTLE 40 (62)
Q Consensus 2 s~aqval~W~l~~-------------~~~~I~g~~~~~~l~~n~~~~-~~~L~ 40 (62)
|..-++++|.+.+ |+..|||+-..-+++.-+++. .|.|+
T Consensus 118 Tl~r~L~nyavk~gr~Plfv~LDvgQ~sitiPGsiaA~~i~~~~D~~eGf~l~ 170 (415)
T KOG2749|consen 118 TLCRILLNYAVKQGRRPLFVELDVGQGSITIPGSIAAIPIEMPLDVIEGFSLT 170 (415)
T ss_pred HHHHHHHHHHHHcCCcceEEEcCCCCCceecccchhheecccccchhhCcccC
Confidence 4667888898764 368899999998888776643 34443
No 93
>cd07921 PCA_45_Doxase_A_like Subunit A of the Class III Extradiol dioxygenase, Protocatechuate 4,5-dioxygenase, and similar enzymes. This subfamily includes the A subunit of protocatechuate (PCA) 4,5-dioxygenase (LigAB) and two subfamilies of unknown function. The A subunit is the smaller, non-catalytic subunit of LigAB. PCA 4,5-dioxygenase catalyzes the oxidization and subsequent ring-opening of PCA (or 3,4-dihydroxybenzoic acid), which is an intermediate in the breakdown of lignin and other compounds. PCA 4,5-dioxygenase is one of the aromatic ring opening dioxygenases which play key roles in the degradation of aromatic compounds. As members of the Class III extradiol dioxygenase family, the enzymes use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. LigAB-like class III enzymes are usually composed of two subunits, designated A and B, which form a tetramer composed of two copies of each subunit.
Probab=25.98 E-value=55 Score=18.33 Aligned_cols=30 Identities=10% Similarity=0.219 Sum_probs=22.1
Q ss_pred CCCCHHHHHHhHhcc--CCCCCHHHHHHHHhh
Q psy15066 21 KSNTVKRITENINVF--DFTLEQSDVDKIDAL 50 (62)
Q Consensus 21 g~~~~~~l~~n~~~~--~~~L~~e~~~~l~~~ 50 (62)
.+.+.++.+++-++. .+.||+|+.+.|.+-
T Consensus 25 ~a~~Re~F~aD~eAy~~~~gLTeEe~~AV~~r 56 (106)
T cd07921 25 KAENREAFKADEEAYCDKFGLTEEQKQAVLDR 56 (106)
T ss_pred CHHHHHHHHhCHHHHHHHcCCCHHHHHHHHhC
Confidence 556667777776553 688999999988764
No 94
>PF10078 DUF2316: Uncharacterized protein conserved in bacteria (DUF2316); InterPro: IPR018757 Members of this family of hypothetical bacterial proteins have no known function.
Probab=25.92 E-value=96 Score=16.79 Aligned_cols=29 Identities=14% Similarity=0.296 Sum_probs=17.2
Q ss_pred CHHHHHHhHhcc---------CCCCCHHHHHHHHhhhc
Q psy15066 24 TVKRITENINVF---------DFTLEQSDVDKIDALDK 52 (62)
Q Consensus 24 ~~~~l~~n~~~~---------~~~L~~e~~~~l~~~~~ 52 (62)
+.+.+++|++.. ++.+|.++++.+-.+..
T Consensus 11 T~~ELq~nf~~~~ls~~~ia~dL~~s~~~le~vL~l~~ 48 (89)
T PF10078_consen 11 TRQELQANFELSGLSLEQIAADLGTSPEHLEQVLNLKQ 48 (89)
T ss_pred HHHHHHHHHHHcCCCHHHHHHHhCCCHHHHHHHHcCCC
Confidence 345566776533 44566667777666653
No 95
>PF07801 DUF1647: Protein of unknown function (DUF1647); InterPro: IPR012444 This entry consists of hypothetical proteins of unknown function.
Probab=25.22 E-value=1.1e+02 Score=17.86 Aligned_cols=35 Identities=11% Similarity=0.360 Sum_probs=25.4
Q ss_pred CeEecCCCCHHHHHHhHh---------------ccCCCCCHHHHHHHHhh
Q psy15066 16 TIPIPKSNTVKRITENIN---------------VFDFTLEQSDVDKIDAL 50 (62)
Q Consensus 16 ~~~I~g~~~~~~l~~n~~---------------~~~~~L~~e~~~~l~~~ 50 (62)
-++++.+.+..|+.+-.+ ..++.|++++++.|.+.
T Consensus 61 ~vvfVSa~S~~h~~~~~~~i~si~~~~P~~k~ilY~LgL~~~~i~~L~~~ 110 (142)
T PF07801_consen 61 DVVFVSATSDNHFNESMKSISSIRKFYPNHKIILYDLGLSEEQIKKLKKN 110 (142)
T ss_pred ccEEEEEecchHHHHHHHHHHHHHHHCCCCcEEEEeCCCCHHHHHHHHhc
Confidence 556777777777766432 34678999999999874
No 96
>smart00657 RPOL4c DNA-directed RNA-polymerase II subunit.
Probab=24.12 E-value=1.4e+02 Score=16.51 Aligned_cols=28 Identities=7% Similarity=0.191 Sum_probs=12.8
Q ss_pred CCCHHHHHHhHhccCCCCCHHHHHHHHh
Q psy15066 22 SNTVKRITENINVFDFTLEQSDVDKIDA 49 (62)
Q Consensus 22 ~~~~~~l~~n~~~~~~~L~~e~~~~l~~ 49 (62)
+++.+.+..-+...+=.+++|+++.|-.
T Consensus 82 P~s~~E~~~lI~sl~~r~~ee~l~~iL~ 109 (118)
T smart00657 82 PETAEEAQLLIPSLEERIDEEELEELLD 109 (118)
T ss_pred CCCHHHHHHHhhhhhccCCHHHHHHHHH
Confidence 3444444444444433355555554443
No 97
>COG0099 RpsM Ribosomal protein S13 [Translation, ribosomal structure and biogenesis]
Probab=24.06 E-value=69 Score=18.40 Aligned_cols=15 Identities=7% Similarity=0.348 Sum_probs=12.7
Q ss_pred CCCHHHHHHHHhhhc
Q psy15066 38 TLEQSDVDKIDALDK 52 (62)
Q Consensus 38 ~L~~e~~~~l~~~~~ 52 (62)
.|++||++.|.....
T Consensus 47 eLteeei~~ir~~i~ 61 (121)
T COG0099 47 ELTEEEIERLRDAIQ 61 (121)
T ss_pred cCCHHHHHHHHHHHH
Confidence 599999999987765
No 98
>PF07128 DUF1380: Protein of unknown function (DUF1380); InterPro: IPR009811 This family consists of several hypothetical bacterial proteins of around 140 residues in length. Members of this family seem to be specific to Enterobacteria. The function of this family is unknown.
Probab=24.02 E-value=1.5e+02 Score=17.48 Aligned_cols=26 Identities=8% Similarity=0.159 Sum_probs=19.0
Q ss_pred CCHHHHHHhHhccCCCCCHHHHHHHH
Q psy15066 23 NTVKRITENINVFDFTLEQSDVDKID 48 (62)
Q Consensus 23 ~~~~~l~~n~~~~~~~L~~e~~~~l~ 48 (62)
.+++-+..-.+.-++.+|++|...+-
T Consensus 27 WT~eDV~~~a~gme~~lTd~E~~aVL 52 (139)
T PF07128_consen 27 WTREDVRALADGMEYNLTDDEARAVL 52 (139)
T ss_pred ecHHHHHHHHhcCCCCCCHHHHHHHH
Confidence 45677776666668899999976553
No 99
>PRK12302 bssR biofilm formation regulatory protein BssR; Reviewed
Probab=23.82 E-value=1.2e+02 Score=17.22 Aligned_cols=28 Identities=11% Similarity=0.265 Sum_probs=20.6
Q ss_pred CCHHHHHHhHh-----------ccCCCCCHHHHHHHHhh
Q psy15066 23 NTVKRITENIN-----------VFDFTLEQSDVDKIDAL 50 (62)
Q Consensus 23 ~~~~~l~~n~~-----------~~~~~L~~e~~~~l~~~ 50 (62)
+..+|+++|+= ..+..+++|++..+...
T Consensus 37 SEsdhLRdN~FeL~re~~~~s~~l~~h~d~eE~~aL~~A 75 (127)
T PRK12302 37 SESDHLRDNFFELNRELHDKSLRLNLHLDQEEWSALRHA 75 (127)
T ss_pred chhhHHHHHHHHHHHHHHHhhHHHhccCCHHHHHHHHHH
Confidence 56789999872 23557899999888654
No 100
>PF01402 RHH_1: Ribbon-helix-helix protein, copG family; InterPro: IPR002145 CopG, also known as RepA, is responsible for the regulation of plasmid copy number. It binds to the repAB promoter and controls synthesis of the plasmid replication initiator protein RepB. Many bacterial transcription regulation proteins bind DNA through a 'helix-turn-helix' motif, nevertheless CopG displays a fully defined HTH-motif structure that is involved not in DNA-binding, but in the maintenance of the intrinsic dimeric functional structure and cooperativity [, ].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2BJ3_B 2BJ8_A 2BJ1_A 2BJ9_A 2BJ7_B 1EA4_L 2CPG_C 1B01_B 2BA3_A 2K9I_B ....
Probab=23.68 E-value=23 Score=15.37 Aligned_cols=18 Identities=22% Similarity=0.396 Sum_probs=13.7
Q ss_pred CCCCCHHHHHHHHhhhcC
Q psy15066 36 DFTLEQSDVDKIDALDKK 53 (62)
Q Consensus 36 ~~~L~~e~~~~l~~~~~~ 53 (62)
.+.|+++..+.|+.+...
T Consensus 3 ti~l~~~~~~~l~~~a~~ 20 (39)
T PF01402_consen 3 TIRLPDELYERLDELAKE 20 (39)
T ss_dssp EEEEEHHHHHHHHHHHHH
T ss_pred EEEeCHHHHHHHHHHHHH
Confidence 456889999999887643
No 101
>PRK01002 nickel responsive regulator; Provisional
Probab=23.30 E-value=31 Score=19.93 Aligned_cols=23 Identities=9% Similarity=0.212 Sum_probs=17.3
Q ss_pred HhccCCCCCHHHHHHHHhhhcCC
Q psy15066 32 INVFDFTLEQSDVDKIDALDKKQ 54 (62)
Q Consensus 32 ~~~~~~~L~~e~~~~l~~~~~~~ 54 (62)
+..+.++|+++.+++||++....
T Consensus 4 ~~risislp~~ll~~lD~~~~~~ 26 (141)
T PRK01002 4 MMRISISLPDKLLGEFDEIIEER 26 (141)
T ss_pred cEEEEEEeCHHHHHHHHHHHHHc
Confidence 34567789999999999886543
No 102
>PRK10444 UMP phosphatase; Provisional
Probab=23.13 E-value=2e+02 Score=17.90 Aligned_cols=39 Identities=8% Similarity=0.112 Sum_probs=25.3
Q ss_pred HHHHHHHhCC--CeEecCCCC--HHHHHHhHhccCCCCCHHHH
Q psy15066 6 FILRPQVQSG--TIPIPKSNT--VKRITENINVFDFTLEQSDV 44 (62)
Q Consensus 6 val~W~l~~~--~~~I~g~~~--~~~l~~n~~~~~~~L~~e~~ 44 (62)
-+++.+.++| ...+.+.+. .+++.+-++...+.++++++
T Consensus 24 ~~l~~L~~~g~~~~~~Tn~~~~~~~~~~~~l~~~G~~~~~~~i 66 (248)
T PRK10444 24 EFLHRILDKGLPLVLLTNYPSQTGQDLANRFATAGVDVPDSVF 66 (248)
T ss_pred HHHHHHHHCCCeEEEEeCCCCCCHHHHHHHHHHcCCCCCHhhE
Confidence 3567777776 445554443 56677777777787777764
No 103
>PF02844 GARS_N: Phosphoribosylglycinamide synthetase, N domain; InterPro: IPR020562 Phosphoribosylglycinamide synthetase (6.3.4.13 from EC) (GARS) (phosphoribosylamine glycine ligase) [] catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by phosphoribosylglycinamide synthetase is the ATP-dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide: ATP + 5-phosphoribosylamine + glycine = ADP + Pi + 5'-phosphoribosylglycinamide In bacteria, GARS is a monofunctional enzyme (encoded by the purD gene). In yeast, GARS is part of a bifunctional enzyme (encoded by the ADE5/7 gene) in conjunction with phosphoribosylformylglycinamidine cyclo-ligase (AIRS) (IPR000728 from INTERPRO). In higher eukaryotes, GARS is part of a trifunctional enzyme in conjunction with AIRS (IPR000728 from INTERPRO) and with phosphoribosylglycinamide formyltransferase (GART) (), forming GARS-AIRS-GART. This entry represents the N-domain, which is related to the N-terminal domain of biotin carboxylase/carbamoyl phosphate synthetase (IPR005481 from INTERPRO).; GO: 0004637 phosphoribosylamine-glycine ligase activity, 0009113 purine base biosynthetic process; PDB: 3MJF_A 2XD4_A 2XCL_A 2IP4_A 2YW2_B 2YYA_A 3LP8_A 1VKZ_A 2YS6_A 2YRX_A ....
Probab=22.87 E-value=18 Score=19.91 Aligned_cols=46 Identities=9% Similarity=0.242 Sum_probs=24.1
Q ss_pred HHHHHHHHhCC-----CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhhcCCc
Q psy15066 5 SFILRPQVQSG-----TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALDKKQR 55 (62)
Q Consensus 5 qval~W~l~~~-----~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~r 55 (62)
+=||+|-+.+. +.+.||---..++.+| +.++..+.+.|-++....+
T Consensus 12 EHAia~~l~~s~~v~~v~~aPGN~G~~~~~~~-----~~~~~~d~~~l~~~a~~~~ 62 (100)
T PF02844_consen 12 EHAIAWKLSQSPSVEEVYVAPGNPGTAELGKN-----VPIDITDPEELADFAKENK 62 (100)
T ss_dssp HHHHHHHHTTCTTEEEEEEEE--TTGGGTSEE-----E-S-TT-HHHHHHHHHHTT
T ss_pred HHHHHHHHhcCCCCCEEEEeCCCHHHHhhcee-----cCCCCCCHHHHHHHHHHcC
Confidence 34899999873 6777884433333333 2346666666666554433
No 104
>PF14794 DUF4479: Domain of unknown function (DUF4479); PDB: 3BU2_C.
Probab=22.85 E-value=68 Score=16.56 Aligned_cols=18 Identities=17% Similarity=0.145 Sum_probs=10.1
Q ss_pred cCCCCCHHHHHHHHhhhc
Q psy15066 35 FDFTLEQSDVDKIDALDK 52 (62)
Q Consensus 35 ~~~~L~~e~~~~l~~~~~ 52 (62)
..+.||+++++.|+++-.
T Consensus 44 G~V~Lt~eqv~~LN~~l~ 61 (73)
T PF14794_consen 44 GQVFLTEEQVAKLNQALQ 61 (73)
T ss_dssp ------HHHHHHHHHHHH
T ss_pred cCEEcCHHHHHHHHHHHH
Confidence 356799999999987653
No 105
>PF07939 DUF1685: Protein of unknown function (DUF1685); InterPro: IPR012881 The members of this family are hypothetical eukaryotic proteins of unknown function. The region in question is approximately 100 amino acid residues long.
Probab=22.84 E-value=64 Score=16.50 Aligned_cols=11 Identities=27% Similarity=0.652 Sum_probs=8.7
Q ss_pred CCHHHHHHHHh
Q psy15066 39 LEQSDVDKIDA 49 (62)
Q Consensus 39 L~~e~~~~l~~ 49 (62)
||++|+++|..
T Consensus 1 lTd~dldELkG 11 (64)
T PF07939_consen 1 LTDDDLDELKG 11 (64)
T ss_pred CcHhHHHHHhh
Confidence 57888888874
No 106
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=22.81 E-value=60 Score=14.87 Aligned_cols=17 Identities=18% Similarity=0.278 Sum_probs=11.0
Q ss_pred CCHHHHHHHHhhhcCCc
Q psy15066 39 LEQSDVDKIDALDKKQR 55 (62)
Q Consensus 39 L~~e~~~~l~~~~~~~r 55 (62)
|++|-++.|++....|+
T Consensus 1 Lsde~I~~IN~~~~tWk 17 (41)
T PF08127_consen 1 LSDEFIDYINSKNTTWK 17 (41)
T ss_dssp S-HHHHHHHHHCT-SEE
T ss_pred CCHHHHHHHHcCCCccc
Confidence 67888888888754443
No 107
>PRK10128 2-keto-3-deoxy-L-rhamnonate aldolase; Provisional
Probab=22.62 E-value=1.8e+02 Score=18.59 Aligned_cols=32 Identities=16% Similarity=0.408 Sum_probs=25.0
Q ss_pred HHHHHHhCC--CeEecCCCCHHHHHHhHhccCCC
Q psy15066 7 ILRPQVQSG--TIPIPKSNTVKRITENINVFDFT 38 (62)
Q Consensus 7 al~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~ 38 (62)
.+..+|+.| .+++|-..+.++.++-+++.++.
T Consensus 82 ~i~r~LD~GA~GIivP~V~saeeA~~~V~a~rYp 115 (267)
T PRK10128 82 LIKQVLDIGAQTLLIPMVDTAEQARQVVSATRYP 115 (267)
T ss_pred HHHHHhCCCCCeeEecCcCCHHHHHHHHHhcCCC
Confidence 456777877 68888899999999888877653
No 108
>PF08920 SF3b1: Splicing factor 3B subunit 1; InterPro: IPR015016 This group of proteins consists of several eukaryotic splicing factor 3B subunit 1 proteins, which associate with p14 through a C terminus beta-strand that interacts with beta-3 of the p14 RNA recognition motif (RRM) beta-sheet, which is in turn connected to an alpha-helix by a loop that makes extensive contacts with both the shorter C-terminal helix and RRM of p14. This subunit is required for 'A' splicing complex assembly (formed by the stable binding of U2 snRNP to the branchpoint sequence in pre-mRNA) and 'E' splicing complex assembly []. ; PDB: 2FHO_A 3LQV_P 2PEH_D 2F9J_P 2F9D_Q.
Probab=22.52 E-value=83 Score=18.47 Aligned_cols=17 Identities=18% Similarity=0.247 Sum_probs=11.1
Q ss_pred HhccCCCCCHHHHHHHH
Q psy15066 32 INVFDFTLEQSDVDKID 48 (62)
Q Consensus 32 ~~~~~~~L~~e~~~~l~ 48 (62)
+..-+-.|++||++.|.
T Consensus 80 id~RNrpLTDEELD~mL 96 (144)
T PF08920_consen 80 IDERNRPLTDEELDAML 96 (144)
T ss_dssp HHHCTS-S-HHHHHHTS
T ss_pred hhhccCcCCHHHHHHhC
Confidence 33456679999999886
No 109
>COG3195 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.50 E-value=44 Score=20.38 Aligned_cols=18 Identities=11% Similarity=0.222 Sum_probs=14.9
Q ss_pred CCCHHHHHHHHhhhcCCc
Q psy15066 38 TLEQSDVDKIDALDKKQR 55 (62)
Q Consensus 38 ~L~~e~~~~l~~~~~~~r 55 (62)
.|+++|.+++..++..++
T Consensus 100 ~Ls~~E~a~f~~LN~aY~ 117 (176)
T COG3195 100 RLSPEEFARFTELNAAYV 117 (176)
T ss_pred cCCHHHHHHHHHHHHHHH
Confidence 699999999998886544
No 110
>PRK13717 conjugal transfer protein TrbI; Provisional
Probab=22.46 E-value=1.7e+02 Score=16.96 Aligned_cols=43 Identities=7% Similarity=0.120 Sum_probs=27.3
Q ss_pred HHHHHHHhCCCeEecCCCCHHHHHHhH--hccCCCCCHHHHHHHH
Q psy15066 6 FILRPQVQSGTIPIPKSNTVKRITENI--NVFDFTLEQSDVDKID 48 (62)
Q Consensus 6 val~W~l~~~~~~I~g~~~~~~l~~n~--~~~~~~L~~e~~~~l~ 48 (62)
.+++|++.+-..|-+-+.+.+...+.+ ++...+|++++.+.+.
T Consensus 33 a~~s~~~~~~~~P~iV~FDmK~Tld~F~~Q~~~~~lte~q~e~lt 77 (128)
T PRK13717 33 AAVSYGIVRLNAPVTAAFNMKQTVDAFFDSASQKQLSEAQSKALS 77 (128)
T ss_pred HHHHHHHhhcCCCeEEEEehHHHHHHHHHHHhccCCCHHHHHHHH
Confidence 356677666434444455666665554 2346789999998775
No 111
>PF11256 DUF3055: Protein of unknown function (DUF3055); InterPro: IPR021415 This family of proteins with unknown function appear to be restricted to Firmicutes.
Probab=22.42 E-value=84 Score=16.78 Aligned_cols=38 Identities=13% Similarity=0.252 Sum_probs=21.8
Q ss_pred HhCCCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhh
Q psy15066 12 VQSGTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 12 l~~~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~ 51 (62)
+|.|-..|.|+-..+.-.---.+ |.|+++|-++|.++-
T Consensus 41 mQtgr~ailg~dDlee~gyl~~~--f~l~~eea~eL~~fl 78 (81)
T PF11256_consen 41 MQTGRFAILGPDDLEEPGYLEHA--FGLSEEEAEELREFL 78 (81)
T ss_pred ecCCceEEEChhhcccccHHHHH--hCCCHHHHHHHHHHH
Confidence 44566667776544322222233 347999998887653
No 112
>COG1056 NadR Nicotinamide mononucleotide adenylyltransferase [Coenzyme metabolism]
Probab=21.80 E-value=42 Score=20.32 Aligned_cols=25 Identities=12% Similarity=0.130 Sum_probs=18.2
Q ss_pred HHHHHHhCC--CeEecCCCCHHHHHHh
Q psy15066 7 ILRPQVQSG--TIPIPKSNTVKRITEN 31 (62)
Q Consensus 7 al~W~l~~~--~~~I~g~~~~~~l~~n 31 (62)
.+.|++.+- .+++.||.+..|...|
T Consensus 22 vi~~al~~vDeliI~iGSa~~~~t~~n 48 (172)
T COG1056 22 VIKRALSKVDELIIVIGSAQESHTLKN 48 (172)
T ss_pred HHHHHHHhCCEEEEEEccCcccccccC
Confidence 577888874 8999999776655444
No 113
>PF02401 LYTB: LytB protein; InterPro: IPR003451 Terpenes are among the largest groups of natural products and include compounds such as vitamins, cholesterol and carotenoids. The biosynthesis of all terpenoids begins with one or both of the two C5 precursors of the pathway: isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). In animals, fungi, and certain bacteria, the synthesis of IPP and DMAPP occurs via the well-known mevalonate pathway, however, a second, nonmevalonate terpenoid pathway has been identified in many eubacteria, algae and the chloroplasts of higher plants []. LytB(IspH) catalyses the conversion of 1-hydroy-2-methyl-2-(E)-butenyl 4-diphosphate into IPP and DMAPP in this second pathway The enzyme appears to be responsible for a branch-step in the nonmevalonate pathway, in that IPP and DMAPP are produced in parallel from a single precursor although the exact mechanism of this is not currently fully understood []. Escherichia coli LytB protein had been found to regulate the activity of RelA (guanosine 3',5'-bispyrophosphate synthetase I), which in turn controls the level of a regulatory metabolite. It is involved in penicillin tolerance and the stringent response [].; GO: 0019288 isopentenyl diphosphate biosynthetic process, mevalonate-independent pathway, 0055114 oxidation-reduction process; PDB: 3DNF_B 3SZL_B 3KE8_B 3KEF_B 3SZU_A 3KEL_A 3F7T_B 3KE9_B 3KEM_B 3T0G_A ....
Probab=21.77 E-value=81 Score=20.42 Aligned_cols=31 Identities=10% Similarity=-0.144 Sum_probs=19.7
Q ss_pred CHHHHHHHHHHhCC-CeEecCCCCHHHHHHhH
Q psy15066 2 FMTSFILRPQVQSG-TIPIPKSNTVKRITENI 32 (62)
Q Consensus 2 s~aqval~W~l~~~-~~~I~g~~~~~~l~~n~ 32 (62)
++.++-..|+...+ +.+..|+++|+.+-+.+
T Consensus 243 ~~~el~~~~l~~~~~VGItaGASTP~~ii~eV 274 (281)
T PF02401_consen 243 TADELDPEWLKGVKKVGITAGASTPDWIIEEV 274 (281)
T ss_dssp SGGG--HHHHTT-SEEEEEE-TTS-HHHHHHH
T ss_pred CccccCHhHhCCCCEEEEEccCCCCHHHHHHH
Confidence 34566678888775 88889999999887654
No 114
>PF11017 DUF2855: Protein of unknown function (DUF2855); InterPro: IPR021276 This family of proteins has no known function.
Probab=21.72 E-value=1.1e+02 Score=20.28 Aligned_cols=46 Identities=24% Similarity=0.403 Sum_probs=29.6
Q ss_pred HHHHHHHHHHh--CC---CeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHh
Q psy15066 3 MTSFILRPQVQ--SG---TIPIPKSNTVKRITENINVFDFTLEQSDVDKIDA 49 (62)
Q Consensus 3 ~aqval~W~l~--~~---~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~ 49 (62)
-..++++|+|+ .+ ++-+..+.|.. ..+.+.+.+--++=+++..|+.
T Consensus 147 KTA~glA~~L~~~~~~~~~vglTS~~N~~-Fve~lg~Yd~V~~Yd~i~~l~~ 197 (314)
T PF11017_consen 147 KTAIGLAYCLKKQRGPPKVVGLTSARNVA-FVESLGCYDEVLTYDDIDSLDA 197 (314)
T ss_pred HHHHHHHHHhhccCCCceEEEEecCcchh-hhhccCCceEEeehhhhhhccC
Confidence 45688999999 33 44444455554 4455566666677777777754
No 115
>PF14658 EF-hand_9: EF-hand domain
Probab=21.65 E-value=1.3e+02 Score=15.30 Aligned_cols=28 Identities=11% Similarity=0.081 Sum_probs=22.4
Q ss_pred CHHHHHHhHhccCC-CCCHHHHHHHHhhh
Q psy15066 24 TVKRITENINVFDF-TLEQSDVDKIDALD 51 (62)
Q Consensus 24 ~~~~l~~n~~~~~~-~L~~e~~~~l~~~~ 51 (62)
..+++...+++... ..++.+++.|.+..
T Consensus 16 ~v~~l~~~Lra~~~~~p~e~~Lq~l~~el 44 (66)
T PF14658_consen 16 PVSDLITYLRAVTGRSPEESELQDLINEL 44 (66)
T ss_pred eHHHHHHHHHHHcCCCCcHHHHHHHHHHh
Confidence 38899999999887 78888888876543
No 116
>PF05922 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IPR010259 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Limited proteolysis of most large protein precursors is carried out in vivo by the subtilisin-like pro-protein convertases. Many important biological processes such as peptide hormone synthesis, viral protein processing and receptor maturation involve proteolytic processing by these enzymes []. The subtilisin-serine protease (SRSP) family hormone and pro-protein convertases (furin, PC1/3, PC2, PC4, PACE4, PC5/6, and PC7/7/LPC) act within the secretory pathway to cleave polypeptide precursors at specific basic sites, generating their biologically active forms. Serum proteins, pro-hormones, receptors, zymogens, viral surface glycoproteins, bacterial toxins, amongst others, are activated by this route []. The SRSPs share the same domain structure, including a signal peptide, the pro-peptide, the catalytic domain, the P/middle or homo B domain, and the C terminus. Proteinase propeptide inhibitors (sometimes refered to as activation peptides) are responsible for the modulation of folding and activity of the pro-enzyme or zymogen. The pro-segment docks into the enzyme moiety shielding the substrate binding site, thereby promoting inhibition of the enzyme. Several such propeptides share a similar topology [], despite often low sequence identities []. The propeptide region has an open-sandwich antiparallel-alpha/antiparallel-beta fold, with two alpha-helices and four beta-strands with a (beta/alpha/beta)x2 topology. This group of sequences contain the propeptide domain at the N terminus of peptidases belonging to MEROPS family S8A, subtilisins. A number of the members of this group of sequences belong to MEROPS inhibitor family I9, clan I-. The propeptide is removed by proteolytic cleavage; removal activating the enzyme.; GO: 0004252 serine-type endopeptidase activity, 0042802 identical protein binding, 0043086 negative regulation of catalytic activity; PDB: 3CNQ_P 1SPB_P 3CO0_P 1ITP_A 1V5I_B 1SCJ_B 3P5B_P 2XTJ_P 2W2M_P 2P4E_P ....
Probab=21.64 E-value=27 Score=17.28 Aligned_cols=20 Identities=20% Similarity=0.368 Sum_probs=14.8
Q ss_pred HhccCCCCCHHHHHHHHhhh
Q psy15066 32 INVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 32 ~~~~~~~L~~e~~~~l~~~~ 51 (62)
++++..+|++++++.|.+..
T Consensus 49 ~~Gfs~~l~~~~i~~L~~~p 68 (82)
T PF05922_consen 49 FNGFSAKLSEEEIEKLRKDP 68 (82)
T ss_dssp SSEEEEEE-HHHHHHHHTST
T ss_pred EEEEEEEeCHHHHHHHHcCC
Confidence 56667789999999998653
No 117
>KOG0658|consensus
Probab=21.49 E-value=1.7e+02 Score=19.95 Aligned_cols=37 Identities=22% Similarity=0.370 Sum_probs=29.6
Q ss_pred CCCeEecCCCCHHHHHHhHhccCCCCCHHHHHHHHhhh
Q psy15066 14 SGTIPIPKSNTVKRITENINVFDFTLEQSDVDKIDALD 51 (62)
Q Consensus 14 ~~~~~I~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~~~ 51 (62)
+|-...||-+..+|+++.++++. .-++||+..++--.
T Consensus 221 ~g~plFpG~s~~dQL~eIik~lG-~Pt~e~I~~mn~~y 257 (364)
T KOG0658|consen 221 KGQPLFPGDSSVDQLVEIIKVLG-TPTREDIKSMNPNY 257 (364)
T ss_pred cCCcccCCCCHHHHHHHHHHHhC-CCCHHHHhhcCccc
Confidence 46677899999999999999876 45788888776543
No 118
>PF13101 DUF3945: Protein of unknown function (DUF3945)
Probab=21.27 E-value=43 Score=16.42 Aligned_cols=22 Identities=9% Similarity=0.096 Sum_probs=15.4
Q ss_pred HHHhHhccCCCCCHHHHHHHHh
Q psy15066 28 ITENINVFDFTLEQSDVDKIDA 49 (62)
Q Consensus 28 l~~n~~~~~~~L~~e~~~~l~~ 49 (62)
+.-+-...+.+|++++.+.|.+
T Consensus 24 i~ip~~i~g~~Ls~~q~~~L~~ 45 (59)
T PF13101_consen 24 IRIPDKIKGVELSPEQKEDLRE 45 (59)
T ss_pred ccccceecCccCCHHHHHHHHC
Confidence 3333445578899999998864
No 119
>TIGR02698 CopY_TcrY copper transport repressor, CopY/TcrY family. This family includes metal-fist type transcriptional repressors of copper transport systems such as copYZAB of Enterococcus hirae and tcrYAZB (transferble copper resistance) of an Enterocuccus faecium plasmid. High levels of copper can displace zinc and prevent binding by the repressor, activating efflux by copper resistance transporters. The most closely related proteins excluded by this model are antibiotic resistance regulators including the methicillin resistance regulatory protein MecI.
Probab=21.23 E-value=82 Score=17.74 Aligned_cols=17 Identities=29% Similarity=0.507 Sum_probs=13.8
Q ss_pred CCCCHHHHHHHHhhhcC
Q psy15066 37 FTLEQSDVDKIDALDKK 53 (62)
Q Consensus 37 ~~L~~e~~~~l~~~~~~ 53 (62)
=.|+++|+++|.++-..
T Consensus 100 ~~ls~eele~L~~li~~ 116 (130)
T TIGR02698 100 SPLSQTDIEKLEKLLSE 116 (130)
T ss_pred CCCCHHHHHHHHHHHHh
Confidence 36999999999887654
No 120
>PRK00630 nickel responsive regulator; Provisional
Probab=20.55 E-value=41 Score=19.75 Aligned_cols=30 Identities=27% Similarity=0.286 Sum_probs=21.1
Q ss_pred CHHHHHHhHhccCCCCCHHHHHHHHhhhcCC
Q psy15066 24 TVKRITENINVFDFTLEQSDVDKIDALDKKQ 54 (62)
Q Consensus 24 ~~~~l~~n~~~~~~~L~~e~~~~l~~~~~~~ 54 (62)
+++++. .+..++++|+++-++++|++....
T Consensus 3 ~~~~m~-~~~risVslp~~Ll~~lD~~~~~~ 32 (148)
T PRK00630 3 TPNKDD-SIIRFSVSLQQNLLDELDNRIIKN 32 (148)
T ss_pred hhhhcc-CeEEEEEEcCHHHHHHHHHHHHHc
Confidence 345554 345678889999999999885443
No 121
>PF04316 FlgM: Anti-sigma-28 factor, FlgM; InterPro: IPR007412 FlgM binds and inhibits the activity of the transcription factor sigma 28. Inhibition of sigma 28 prevents the expression of genes from flagellar transcriptional class 3, which include genes for the filament and chemotaxis. Correctly assembled basal body-hook structures export FlgM, relieving inhibition of sigma 28 and allowing expression of class 3 genes. NMR studies show that free FlgM is mostly unfolded, which may facilitate its export. The C-terminal half of FlgM adopts a tertiary structure when it binds to sigma 28. All mutations in FlgM that prevent sigma 28 inhibition affect the C-terminal domain and is the region thought to constitute the binding domain. A minimal binding domain has been identified between Glu 64 and Arg 88 in Salmonella typhimurium (P26477 from SWISSPROT).The N-terminal portion remains unstructured and may be necessary for recognition by the export machinery [].; GO: 0045892 negative regulation of transcription, DNA-dependent, 0019861 flagellum; PDB: 1RP3_B 1SC5_B.
Probab=20.46 E-value=1.2e+02 Score=14.49 Aligned_cols=28 Identities=14% Similarity=0.293 Sum_probs=15.5
Q ss_pred EecCCCCHHHHHHhHhccCCCCCHHHHH
Q psy15066 18 PIPKSNTVKRITENINVFDFTLEQSDVD 45 (62)
Q Consensus 18 ~I~g~~~~~~l~~n~~~~~~~L~~e~~~ 45 (62)
+-..+.+++.|++-+....+.++.++++
T Consensus 23 ~~vr~~kV~~ik~~I~~G~Y~vd~~~iA 50 (57)
T PF04316_consen 23 PDVRAEKVAEIKAAIASGTYKVDAEKIA 50 (57)
T ss_dssp HHCSHHHHHHHHHHHHTT-----HHHHH
T ss_pred chhhHHHHHHHHHHHHcCCCCCCHHHHH
Confidence 3344556777787787778888877764
No 122
>cd07925 LigA_like_1 The A subunit of Uncharacterized proteins with similarity to Protocatechuate 4,5-dioxygenase (LigAB). The proteins of unknown function in this subfamily are similar to the A subunit of the Protocatechuate (PCA) 4,5-dioxygenase (LigAB). LigAB belongs to the class III extradiol dioxygenase family, composed of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Dioxygenases play key roles in the degradation of aromatic compounds. PCA 4,5-dioxygenase catalyzes the oxidization and subsequent ring-opening of PCA (or 3,4-dihydroxybenzoic acid), which is an intermediate in the breakdown of lignin and other compounds.
Probab=20.30 E-value=94 Score=17.43 Aligned_cols=32 Identities=6% Similarity=0.170 Sum_probs=23.4
Q ss_pred ecCCCCHHHHHHhHhcc--CCCCCHHHHHHHHhh
Q psy15066 19 IPKSNTVKRITENINVF--DFTLEQSDVDKIDAL 50 (62)
Q Consensus 19 I~g~~~~~~l~~n~~~~--~~~L~~e~~~~l~~~ 50 (62)
+..+.+.++.+++-++. ++.||+|+-+.|..-
T Consensus 23 l~~~~nRe~F~aD~~Ay~~~~~Lteeqr~av~~R 56 (106)
T cd07925 23 FNDAANREAFLADEEAYCEKFGLTPEQKQAVRNR 56 (106)
T ss_pred HCCHHHHHHHHhCHHHHHHHcCCCHHHHHHHHHh
Confidence 34566677777776653 789999999888754
No 123
>COG0289 DapB Dihydrodipicolinate reductase [Amino acid transport and metabolism]
Probab=20.25 E-value=2.1e+02 Score=18.61 Aligned_cols=17 Identities=12% Similarity=0.304 Sum_probs=12.7
Q ss_pred CCHHHHHHHHhhhcCCc
Q psy15066 39 LEQSDVDKIDALDKKQR 55 (62)
Q Consensus 39 L~~e~~~~l~~~~~~~r 55 (62)
+++++++.|+++.+..+
T Consensus 103 f~~e~~~~l~~~a~~v~ 119 (266)
T COG0289 103 FTEEQLEKLREAAEKVP 119 (266)
T ss_pred CCHHHHHHHHHHHhhCC
Confidence 78888888888776643
No 124
>PF14823 Sirohm_synth_C: Sirohaem biosynthesis protein C-terminal; PDB: 1KYQ_B.
Probab=20.23 E-value=84 Score=16.08 Aligned_cols=19 Identities=11% Similarity=0.251 Sum_probs=13.1
Q ss_pred CCCHHHHHHHHhhhcCCcc
Q psy15066 38 TLEQSDVDKIDALDKKQRS 56 (62)
Q Consensus 38 ~L~~e~~~~l~~~~~~~r~ 56 (62)
+|++++++.|-......++
T Consensus 49 ~l~~~d~~~Ll~~y~~~~~ 67 (70)
T PF14823_consen 49 ELDDEDMENLLDFYWSNKV 67 (70)
T ss_dssp GB-HHHHHHHHHHHHTTTT
T ss_pred HCCHHHHHHHHHHhhccCC
Confidence 5899999988776655444
No 125
>KOG1461|consensus
Probab=20.20 E-value=2.2e+02 Score=21.02 Aligned_cols=39 Identities=21% Similarity=0.199 Sum_probs=31.1
Q ss_pred HHHHHHHHHhCC--CeEecCCCCHHHHHHhHhccCCCCCHH
Q psy15066 4 TSFILRPQVQSG--TIPIPKSNTVKRITENINVFDFTLEQS 42 (62)
Q Consensus 4 aqval~W~l~~~--~~~I~g~~~~~~l~~n~~~~~~~L~~e 42 (62)
-.-+|.|+...| -+.+.++.+..|+.+.++..+|.+...
T Consensus 58 IdYtL~~L~~agV~eVfvfc~~~~~qi~e~i~~sew~~~~~ 98 (673)
T KOG1461|consen 58 IDYTLEWLERAGVEEVFVFCSAHAAQIIEYIEKSEWYLPMS 98 (673)
T ss_pred HHHHHHHHHhcCceEEEEEecccHHHHHHHHhhcccccccc
Confidence 456799999998 577888999999999998877655443
No 126
>cd04517 TLF TBP-like factors (TLF; also called TLP, TRF, TRP), which are found in most metazoans. TLFs and TBPs have well-conserved core domains; however, they only share about 60% similarity. TLFs, like TBPs, interact with TFIIA and TFIIB, which are part of the basal transcription machinery. Yet, in contrast to TBPs, TLFs seem not to interact with the TATA-box and even have a negative effect on the transcription of TATA-containing promoters. Recent results indicate that TLFs are involved in the transcription via TATA-less promoters.
Probab=20.19 E-value=1e+02 Score=18.40 Aligned_cols=19 Identities=11% Similarity=0.510 Sum_probs=16.1
Q ss_pred CCeEecCCCCHHHHHHhHh
Q psy15066 15 GTIPIPKSNTVKRITENIN 33 (62)
Q Consensus 15 ~~~~I~g~~~~~~l~~n~~ 33 (62)
|-++|.|+++++++++-++
T Consensus 148 GkivitGaks~~~~~~a~~ 166 (174)
T cd04517 148 GSVTVTGARSMEDVREAVE 166 (174)
T ss_pred CEEEEEecCCHHHHHHHHH
Confidence 6788999999999987664
No 127
>PF03965 Penicillinase_R: Penicillinase repressor; InterPro: IPR005650 Proteins in this entry are transcriptional regulators found in a variety of bacteria and a small number of archaea. Many are BlaI/MecI proteins which regulate resistance to penicillins (beta-lactams), though at least one protein (Q47839 from SWISSPROT) appears to be involved in the regulation of copper homeostasis []. BlaI regulators repress the expression of penicillin-degrading enzymes (penicillinases) until the cell encounters the antiobiotic, at which point repression ceases and penicillinase expression occurs, allowing cell growth []. MecI regulators repress the expression of MecA, a cell-wall biosynthetic enzyme not inhibited by penicillins at clinically achievable concentrations, until the presence of the antibiotic is detected []. At this point repression ends and MecA expression occurs which, together with the switching off of the penicillin-sensitive enzymes, allows the cell to grow despite the presence of antibiotic.; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent; PDB: 2G9W_A 2K4B_A 1XSD_A 1SD4_A 1SD7_A 1SD6_A 2P7C_B 1P6R_A 1OKR_B 2D45_B ....
Probab=20.15 E-value=84 Score=17.04 Aligned_cols=14 Identities=14% Similarity=0.354 Sum_probs=10.1
Q ss_pred CCCHHHHHHHHhhh
Q psy15066 38 TLEQSDVDKIDALD 51 (62)
Q Consensus 38 ~L~~e~~~~l~~~~ 51 (62)
+|+++|+++|.++-
T Consensus 100 ~ls~~el~~L~~li 113 (115)
T PF03965_consen 100 ELSPEELEELRKLI 113 (115)
T ss_dssp -S-HHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHH
Confidence 79999999998753
No 128
>cd07923 Gallate_dioxygenase_C The C-terminal domain of Gallate Dioxygenase, which catalyzes the oxidization and subsequent ring-opening of gallate. Gallate Dioxygenase catalyzes the oxidization and subsequent ring-opening of gallate, an intermediate in the degradation of the aromatic compound, syringate. The reaction product of gallate dioxygenase is 4-oxalomesaconate. The amino acid sequence of the N-terminal and C-terminal regions of gallate dioxygenase exhibits homology with the sequence of the PCA 4,5-dioxygenase B (catalytic) and A subunits, respectively. This model represents the C-terminal domain, which is similar to the A subunit of PCA 4,5-dioxygenase (or LigAB). The enzyme is estimated to be a homodimer according to the Escherichia coli enzyme. Since enzymes in this subfamily have fused A and B subunits, the dimer interface may resemble the tetramer interface of classical LigAB enzymes. This enzyme belongs to the class III extradiol dioxygenase family, composed of enzymes whi
Probab=20.09 E-value=1.2e+02 Score=16.63 Aligned_cols=30 Identities=13% Similarity=0.176 Sum_probs=21.6
Q ss_pred CCCCHHHHHHhHhcc--CCCCCHHHHHHHHhh
Q psy15066 21 KSNTVKRITENINVF--DFTLEQSDVDKIDAL 50 (62)
Q Consensus 21 g~~~~~~l~~n~~~~--~~~L~~e~~~~l~~~ 50 (62)
.+.+.++.+++-++. ++.||+|+.+.|...
T Consensus 17 ~a~~RerF~~D~ea~~~e~gLt~Ee~~av~~r 48 (94)
T cd07923 17 EPAHRERFLEDPEALFDEAGLTEEERTLIRNR 48 (94)
T ss_pred CHHHHHHHHhCHHHHHHHcCCCHHHHHHHHcc
Confidence 455566777766553 678999999988764
No 129
>PF13344 Hydrolase_6: Haloacid dehalogenase-like hydrolase; PDB: 2HO4_B 1YV9_A 1WVI_B 3EPR_A 2P27_A 2OYC_A 2CFT_A 2P69_A 2CFS_A 2CFR_A ....
Probab=20.02 E-value=1e+02 Score=16.39 Aligned_cols=41 Identities=7% Similarity=0.242 Sum_probs=24.4
Q ss_pred HHHHHHHHHhCC--CeEecC--CCCHHHHHHhHhccCCCCCHHHH
Q psy15066 4 TSFILRPQVQSG--TIPIPK--SNTVKRITENINVFDFTLEQSDV 44 (62)
Q Consensus 4 aqval~W~l~~~--~~~I~g--~~~~~~l~~n~~~~~~~L~~e~~ 44 (62)
+.=++.++..+| +..+.- +++++++.+-++...+..+++++
T Consensus 19 a~e~l~~L~~~g~~~~~lTNns~~s~~~~~~~L~~~Gi~~~~~~i 63 (101)
T PF13344_consen 19 AVEALDALRERGKPVVFLTNNSSRSREEYAKKLKKLGIPVDEDEI 63 (101)
T ss_dssp HHHHHHHHHHTTSEEEEEES-SSS-HHHHHHHHHHTTTT--GGGE
T ss_pred HHHHHHHHHHcCCCEEEEeCCCCCCHHHHHHHHHhcCcCCCcCEE
Confidence 344677787776 444433 34557777777888888777653
No 130
>PF14338 Mrr_N: Mrr N-terminal domain
Probab=20.01 E-value=1.5e+02 Score=15.38 Aligned_cols=29 Identities=21% Similarity=0.187 Sum_probs=20.7
Q ss_pred cCCCCHHHHHHhHhccCCCCCHHHHHHHHh
Q psy15066 20 PKSNTVKRITENINVFDFTLEQSDVDKIDA 49 (62)
Q Consensus 20 ~g~~~~~~l~~n~~~~~~~L~~e~~~~l~~ 49 (62)
-|+.+...+.+.+. ..+.|++||....-.
T Consensus 17 g~~~~~~ei~~~v~-~~~~ls~e~~~~~~~ 45 (92)
T PF14338_consen 17 GGSASRKEIYERVA-ERFGLSDEERNERLP 45 (92)
T ss_pred CCCcCHHHHHHHHH-HHhCCCHHHHHHHcc
Confidence 35667777877774 457899999887543
Done!