Query psy959
Match_columns 109
No_of_seqs 162 out of 1022
Neff 8.8
Searched_HMMs 46136
Date Fri Aug 16 23:33:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy959.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/959hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.9 4.1E-26 8.8E-31 175.6 9.1 104 3-108 1092-1196(1289)
2 KOG1214|consensus 99.9 4.7E-25 1E-29 169.8 9.6 103 3-109 1049-1154(1289)
3 KOG1215|consensus 99.7 5.4E-16 1.2E-20 123.5 9.8 98 3-103 504-603 (877)
4 PF00058 Ldl_recept_b: Low-den 99.6 2.3E-14 4.9E-19 74.5 6.8 41 33-77 1-42 (42)
5 KOG1215|consensus 99.4 1.1E-12 2.4E-17 104.7 10.0 100 5-108 463-564 (877)
6 smart00135 LY Low-density lipo 99.3 1.2E-11 2.5E-16 63.7 6.1 42 61-102 2-43 (43)
7 smart00135 LY Low-density lipo 99.2 2E-10 4.4E-15 58.9 6.3 40 16-59 4-43 (43)
8 PLN02919 haloacid dehalogenase 99.0 1.4E-08 3.1E-13 82.9 12.2 87 15-105 562-661 (1057)
9 PF00058 Ldl_recept_b: Low-den 98.9 3.3E-09 7.1E-14 54.9 5.4 30 80-109 1-31 (42)
10 PF08450 SGL: SMP-30/Gluconola 98.9 4E-08 8.8E-13 67.9 12.2 88 13-105 126-220 (246)
11 PF08450 SGL: SMP-30/Gluconola 98.9 5.2E-08 1.1E-12 67.3 11.0 78 19-99 84-165 (246)
12 PF03088 Str_synth: Strictosid 98.8 5.3E-08 1.1E-12 58.2 8.5 76 24-100 1-89 (89)
13 PLN02919 haloacid dehalogenase 98.7 5E-07 1.1E-11 74.0 12.3 78 20-101 623-716 (1057)
14 COG3386 Gluconolactonase [Carb 98.4 5.3E-06 1.2E-10 59.8 10.6 91 10-105 152-250 (307)
15 COG3386 Gluconolactonase [Carb 98.3 7.8E-06 1.7E-10 59.0 9.2 77 20-98 110-193 (307)
16 TIGR02604 Piru_Ver_Nterm putat 98.0 0.00029 6.3E-09 51.8 11.7 93 5-106 54-180 (367)
17 TIGR02604 Piru_Ver_Nterm putat 97.9 9.2E-05 2E-09 54.4 8.8 79 12-93 107-208 (367)
18 PF01436 NHL: NHL repeat; Int 97.9 4.1E-05 8.8E-10 35.9 4.6 27 67-94 1-27 (28)
19 PF01436 NHL: NHL repeat; Int 97.9 4.5E-05 9.7E-10 35.8 4.4 27 20-51 1-27 (28)
20 PF06977 SdiA-regulated: SdiA- 97.8 0.00017 3.7E-09 50.6 8.2 63 20-87 170-240 (248)
21 PF10282 Lactonase: Lactonase, 97.8 0.0004 8.7E-09 50.5 10.2 74 22-99 246-323 (345)
22 KOG4659|consensus 97.7 0.0002 4.2E-09 59.6 7.7 75 19-102 473-565 (1899)
23 KOG1520|consensus 97.6 0.00012 2.6E-09 53.9 5.0 81 20-102 160-253 (376)
24 PRK11028 6-phosphogluconolacto 97.5 0.0079 1.7E-07 43.1 12.7 77 18-98 123-205 (330)
25 PF10282 Lactonase: Lactonase, 97.4 0.0065 1.4E-07 44.2 12.2 75 20-98 143-222 (345)
26 KOG1520|consensus 97.3 0.00071 1.5E-08 49.9 6.2 63 20-88 114-180 (376)
27 KOG4499|consensus 97.3 0.005 1.1E-07 43.3 9.5 85 9-98 146-241 (310)
28 PF03022 MRJP: Major royal jel 97.3 0.0025 5.5E-08 45.6 8.4 64 21-89 186-256 (287)
29 COG3391 Uncharacterized conser 97.3 0.0064 1.4E-07 45.0 10.8 79 22-103 117-195 (381)
30 PF07995 GSDH: Glucose / Sorbo 97.3 0.0027 5.8E-08 46.2 8.5 75 20-100 1-92 (331)
31 PF01731 Arylesterase: Arylest 97.2 0.0015 3.2E-08 38.8 5.5 42 8-53 41-82 (86)
32 TIGR03606 non_repeat_PQQ dehyd 97.2 0.019 4.2E-07 43.7 12.5 82 13-99 22-125 (454)
33 PRK11028 6-phosphogluconolacto 97.2 0.0068 1.5E-07 43.4 9.8 75 21-99 80-157 (330)
34 PF07995 GSDH: Glucose / Sorbo 97.2 0.0016 3.5E-08 47.3 6.6 69 22-91 115-204 (331)
35 PF03088 Str_synth: Strictosid 97.2 0.0014 3.1E-08 39.1 5.1 47 7-57 43-89 (89)
36 PF01731 Arylesterase: Arylest 97.1 0.0028 6E-08 37.6 6.1 43 54-97 41-83 (86)
37 KOG4659|consensus 96.9 0.007 1.5E-07 50.9 8.6 91 2-99 337-439 (1899)
38 TIGR03606 non_repeat_PQQ dehyd 96.9 0.012 2.6E-07 44.8 9.4 75 13-89 133-250 (454)
39 PF06977 SdiA-regulated: SdiA- 96.7 0.063 1.4E-06 37.8 11.1 80 23-105 120-208 (248)
40 COG3391 Uncharacterized conser 96.4 0.074 1.6E-06 39.4 10.2 73 21-98 74-148 (381)
41 COG2706 3-carboxymuconate cycl 96.2 0.12 2.5E-06 38.0 10.0 77 19-99 189-275 (346)
42 PF03022 MRJP: Major royal jel 95.9 0.28 6.1E-06 35.1 11.0 81 22-108 129-229 (287)
43 COG2133 Glucose/sorbosone dehy 95.8 0.034 7.5E-07 41.7 6.1 79 12-92 163-263 (399)
44 PRK04792 tolB translocation pr 95.8 0.59 1.3E-05 35.4 12.8 98 6-106 203-302 (448)
45 TIGR03866 PQQ_ABC_repeats PQQ- 95.6 0.43 9.3E-06 32.7 12.0 76 22-102 208-283 (300)
46 TIGR03866 PQQ_ABC_repeats PQQ- 95.4 0.54 1.2E-05 32.2 13.2 78 17-99 27-104 (300)
47 COG2706 3-carboxymuconate cycl 95.1 0.57 1.2E-05 34.5 10.1 73 24-100 247-323 (346)
48 PRK02889 tolB translocation pr 95.0 1.1 2.3E-05 33.7 12.7 98 5-105 180-279 (427)
49 PRK00178 tolB translocation pr 94.9 1.1 2.4E-05 33.3 13.3 99 5-106 183-283 (430)
50 PRK04922 tolB translocation pr 94.9 1.2 2.5E-05 33.5 11.8 97 6-105 189-287 (433)
51 PRK01742 tolB translocation pr 94.7 1.3 2.9E-05 33.1 13.4 98 6-106 189-288 (429)
52 TIGR03032 conserved hypothetic 94.6 0.24 5.1E-06 36.2 7.1 63 14-86 196-259 (335)
53 COG3204 Uncharacterized protei 94.5 0.27 5.9E-06 35.6 7.2 62 21-87 233-302 (316)
54 PRK03629 tolB translocation pr 94.5 1.5 3.3E-05 32.9 13.5 99 5-106 183-283 (429)
55 COG4257 Vgb Streptogramin lyas 94.3 0.81 1.8E-05 33.2 9.1 74 17-97 58-131 (353)
56 TIGR02658 TTQ_MADH_Hv methylam 94.2 1.4 2.9E-05 32.7 10.5 76 22-98 246-330 (352)
57 PRK05137 tolB translocation pr 94.1 1.9 4.1E-05 32.3 11.8 78 25-106 338-420 (435)
58 PF13449 Phytase-like: Esteras 94.0 0.78 1.7E-05 33.3 8.8 84 20-105 19-127 (326)
59 PF13449 Phytase-like: Esteras 93.9 0.3 6.5E-06 35.4 6.6 65 22-87 86-166 (326)
60 PF05787 DUF839: Bacterial pro 93.6 0.89 1.9E-05 35.4 8.9 69 18-87 347-454 (524)
61 COG4946 Uncharacterized protei 93.5 2.9 6.3E-05 32.5 11.8 95 4-103 385-482 (668)
62 PRK04792 tolB translocation pr 93.4 2.6 5.7E-05 31.9 12.7 81 23-106 308-390 (448)
63 PF02333 Phytase: Phytase; In 93.0 3 6.5E-05 31.3 11.7 77 21-102 208-294 (381)
64 PRK04043 tolB translocation pr 92.9 3.1 6.8E-05 31.4 13.9 98 5-106 173-273 (419)
65 PF02333 Phytase: Phytase; In 92.6 3.5 7.5E-05 31.0 11.4 77 18-98 153-237 (381)
66 PRK05137 tolB translocation pr 92.4 3.6 7.8E-05 30.8 13.2 98 6-106 187-286 (435)
67 PRK04922 tolB translocation pr 92.4 3.6 7.8E-05 30.9 12.8 80 24-106 295-376 (433)
68 COG3823 Glutamine cyclotransfe 92.4 2.2 4.9E-05 29.8 8.5 69 28-100 181-261 (262)
69 PRK04043 tolB translocation pr 92.3 3.9 8.4E-05 30.9 12.4 77 26-105 238-316 (419)
70 TIGR02276 beta_rpt_yvtn 40-res 92.1 0.77 1.7E-05 22.4 6.3 42 30-76 1-42 (42)
71 KOG4499|consensus 92.0 1.8 3.8E-05 30.9 7.8 47 11-62 195-249 (310)
72 COG3204 Uncharacterized protei 92.0 3.1 6.7E-05 30.3 9.2 82 22-106 182-271 (316)
73 TIGR02800 propeller_TolB tol-p 91.8 3.9 8.5E-05 30.0 11.6 81 21-105 322-404 (417)
74 PRK01029 tolB translocation pr 91.4 5 0.00011 30.3 13.5 81 24-107 330-412 (428)
75 TIGR02800 propeller_TolB tol-p 91.4 4.5 9.7E-05 29.7 12.9 98 5-105 174-273 (417)
76 PF06739 SBBP: Beta-propeller 91.1 0.22 4.8E-06 24.7 2.0 20 21-41 13-32 (38)
77 TIGR02658 TTQ_MADH_Hv methylam 90.8 5.3 0.00012 29.6 12.8 80 20-100 46-138 (352)
78 PF00930 DPPIV_N: Dipeptidyl p 90.5 5.3 0.00012 29.1 9.8 84 20-105 234-323 (353)
79 PRK02889 tolB translocation pr 90.0 6.6 0.00014 29.4 12.7 80 24-106 287-368 (427)
80 PF14583 Pectate_lyase22: Olig 89.7 4.4 9.5E-05 30.5 8.5 72 32-107 48-119 (386)
81 KOG4649|consensus 89.6 2.1 4.5E-05 31.0 6.5 30 21-54 31-60 (354)
82 PRK03629 tolB translocation pr 89.6 7.2 0.00016 29.3 12.5 79 24-105 246-326 (429)
83 PRK00178 tolB translocation pr 89.1 7.6 0.00016 28.9 13.1 79 25-106 291-371 (430)
84 COG0823 TolB Periplasmic compo 88.1 9.9 0.00021 28.9 9.9 77 27-106 244-322 (425)
85 TIGR03118 PEPCTERM_chp_1 conse 84.7 14 0.0003 27.3 9.4 75 31-105 197-287 (336)
86 COG4257 Vgb Streptogramin lyas 84.6 13 0.00029 27.1 8.8 79 21-105 233-312 (353)
87 COG2133 Glucose/sorbosone dehy 83.1 18 0.00039 27.4 9.2 75 22-100 315-399 (399)
88 PF02239 Cytochrom_D1: Cytochr 82.7 17 0.00038 26.9 11.8 71 21-98 36-108 (369)
89 PF05694 SBP56: 56kDa selenium 82.1 16 0.00034 28.2 8.2 61 23-87 314-393 (461)
90 PRK02888 nitrous-oxide reducta 82.0 25 0.00054 28.3 9.6 32 67-98 320-351 (635)
91 PRK01742 tolB translocation pr 81.5 20 0.00043 26.9 12.6 77 25-104 252-330 (429)
92 PF14759 Reductase_C: Reductas 80.4 8.6 0.00019 22.2 5.2 28 36-67 2-29 (85)
93 PF05096 Glu_cyclase_2: Glutam 80.0 19 0.00042 25.7 8.9 68 28-99 180-262 (264)
94 PF00930 DPPIV_N: Dipeptidyl p 79.0 7.7 0.00017 28.3 5.7 56 25-81 285-341 (353)
95 TIGR03032 conserved hypothetic 76.2 11 0.00024 27.8 5.7 38 65-104 200-238 (335)
96 PF05345 He_PIG: Putative Ig d 74.9 4 8.6E-05 21.3 2.4 24 66-89 9-32 (49)
97 TIGR03118 PEPCTERM_chp_1 conse 73.2 35 0.00076 25.2 8.1 81 3-87 224-322 (336)
98 COG4247 Phy 3-phytase (myo-ino 69.9 40 0.00087 24.5 7.3 67 19-88 151-225 (364)
99 PRK01029 tolB translocation pr 67.9 50 0.0011 24.9 12.8 80 24-106 284-367 (428)
100 COG3211 PhoX Predicted phospha 65.8 69 0.0015 25.7 8.2 68 20-87 416-519 (616)
101 COG4946 Uncharacterized protei 57.5 61 0.0013 25.6 6.5 53 45-102 204-260 (668)
102 PF05787 DUF839: Bacterial pro 56.3 35 0.00076 26.8 5.3 23 66-88 348-370 (524)
103 KOG0315|consensus 56.1 74 0.0016 23.0 6.4 54 24-82 128-182 (311)
104 TIGR03075 PQQ_enz_alc_DH PQQ-d 55.2 84 0.0018 24.6 7.2 16 26-41 274-289 (527)
105 PF14251 DUF4346: Domain of un 54.8 51 0.0011 20.7 5.0 65 22-100 8-72 (119)
106 COG3823 Glutamine cyclotransfe 54.4 45 0.00098 23.5 5.0 34 20-57 228-261 (262)
107 cd01248 PH_PLC Phospholipase C 52.2 46 0.001 20.2 4.5 22 68-89 21-42 (115)
108 KOG0285|consensus 52.0 82 0.0018 24.0 6.3 73 19-97 150-222 (460)
109 PF01011 PQQ: PQQ enzyme repea 51.6 19 0.00041 17.3 2.2 17 25-41 12-28 (38)
110 KOG4649|consensus 51.2 34 0.00075 24.9 4.1 29 69-97 32-60 (354)
111 PF02239 Cytochrom_D1: Cytochr 50.4 1E+02 0.0022 22.9 12.9 77 20-100 77-160 (369)
112 PF09910 DUF2139: Uncharacteri 47.5 89 0.0019 23.1 5.8 78 26-106 40-147 (339)
113 PF06433 Me-amine-dh_H: Methyl 46.1 1.2E+02 0.0027 22.6 6.9 53 29-86 192-256 (342)
114 TIGR03075 PQQ_enz_alc_DH PQQ-d 44.3 1.5E+02 0.0033 23.2 8.1 70 25-106 238-319 (527)
115 KOG3567|consensus 43.1 75 0.0016 24.8 5.1 91 6-100 394-498 (501)
116 KOG0273|consensus 42.7 1.7E+02 0.0036 23.1 7.4 75 21-99 317-392 (524)
117 KOG0650|consensus 41.9 98 0.0021 25.1 5.6 76 6-88 539-637 (733)
118 PF12814 Mcp5_PH: Meiotic cell 41.0 27 0.00058 21.7 2.2 21 21-41 30-50 (123)
119 COG4247 Phy 3-phytase (myo-ino 39.6 1.5E+02 0.0033 21.7 6.2 78 7-91 188-271 (364)
120 PF06433 Me-amine-dh_H: Methyl 36.3 1.8E+02 0.004 21.7 9.7 81 26-106 243-339 (342)
121 smart00564 PQQ beta-propeller 35.6 46 0.00099 14.8 2.1 13 26-38 19-31 (33)
122 cd00200 WD40 WD40 domain, foun 35.1 1.3E+02 0.0027 19.5 10.8 70 22-98 137-207 (289)
123 KOG0291|consensus 35.1 2.7E+02 0.0059 23.4 8.1 75 25-104 440-514 (893)
124 KOG2586|consensus 33.8 54 0.0012 22.7 2.8 32 1-41 59-92 (228)
125 PF15390 DUF4613: Domain of un 33.1 2.7E+02 0.0059 22.7 7.0 60 13-76 331-393 (671)
126 PHA02885 putative interleukin 31.2 1.3E+02 0.0029 18.6 4.1 34 25-58 53-86 (135)
127 COG0259 PdxH Pyridoxamine-phos 29.3 96 0.0021 21.5 3.5 10 32-41 73-82 (214)
128 smart00155 PLDc Phospholipase 28.1 57 0.0012 14.4 1.6 20 80-99 7-26 (28)
129 PF05131 Pep3_Vps18: Pep3/Vps1 25.8 1.9E+02 0.0041 18.6 5.8 47 5-57 57-105 (147)
130 PF08310 LGFP: LGFP repeat; I 25.2 1.1E+02 0.0025 15.8 3.1 17 72-88 16-32 (54)
131 PF15232 DUF4585: Domain of un 24.4 1.5E+02 0.0033 17.0 4.4 39 23-64 7-47 (75)
132 PF14956 DUF4505: Domain of un 24.4 59 0.0013 22.0 1.7 18 69-86 115-132 (183)
133 COG4993 Gcd Glucose dehydrogen 24.4 3E+02 0.0065 22.7 5.7 60 26-90 429-490 (773)
134 PF10042 DUF2278: Uncharacteri 24.0 87 0.0019 21.6 2.6 66 12-83 29-101 (206)
135 PF12216 m04gp34like: Immune e 23.9 82 0.0018 22.7 2.5 21 21-41 135-155 (272)
136 PF00622 SPRY: SPRY domain; I 23.4 1.3E+02 0.0027 17.7 3.1 20 22-41 68-87 (124)
137 COG3211 PhoX Predicted phospha 23.3 2.4E+02 0.0052 22.8 5.0 67 20-87 499-573 (616)
138 KOG0263|consensus 23.1 4.3E+02 0.0094 21.9 6.5 60 15-80 530-590 (707)
139 KOG0649|consensus 21.8 3.2E+02 0.007 19.8 6.7 38 20-62 114-152 (325)
140 PF08470 NTNH_C: Nontoxic nonh 21.6 2.2E+02 0.0049 18.7 3.9 42 46-87 99-140 (165)
141 PF09000 Cytotoxic: Cytotoxic; 20.4 2E+02 0.0044 17.0 4.8 30 73-102 34-63 (85)
142 TIGR03074 PQQ_membr_DH membran 20.3 5.1E+02 0.011 21.6 7.5 14 26-39 417-430 (764)
143 KOG0266|consensus 20.3 4E+02 0.0086 20.3 11.3 83 12-101 238-321 (456)
144 PF14280 DUF4365: Domain of un 20.0 2.3E+02 0.0049 17.4 4.1 25 68-92 82-107 (138)
No 1
>KOG1214|consensus
Probab=99.93 E-value=4.1e-26 Score=175.58 Aligned_cols=104 Identities=27% Similarity=0.435 Sum_probs=99.9
Q ss_pred eeecCCCCCccEEEe-cCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCe
Q psy959 3 STDKDLLKTHRSLLL-QAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEM 81 (109)
Q Consensus 3 ~~~~~l~G~~~~~li-~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~ 81 (109)
+..|.|||+.||+|+ ++|-+|++|++|+..|+|||+||.++ +++|++.+|||+++++|+.+++..|+||++|+.++.
T Consensus 1092 IevA~LdG~~rkvLf~tdLVNPR~iv~D~~rgnLYwtDWnRe--nPkIets~mDG~NrRilin~DigLPNGLtfdpfs~~ 1169 (1289)
T KOG1214|consen 1092 IEVALLDGSERKVLFYTDLVNPRAIVVDPIRGNLYWTDWNRE--NPKIETSSMDGENRRILINTDIGLPNGLTFDPFSKL 1169 (1289)
T ss_pred hheeecCCceeeEEEeecccCcceEEeecccCceeecccccc--CCcceeeccCCccceEEeecccCCCCCceeCcccce
Confidence 567899999999999 89999999999999999999999986 899999999999999999999999999999999999
Q ss_pred EEEEeCCCCeEEEEeCCCCCeEEEEeC
Q psy959 82 LYWTDAVREKIERSDLAGNNREALYLN 108 (109)
Q Consensus 82 lYw~D~~~~~I~~~~~dG~~~~~l~~~ 108 (109)
|.|+|.+.++++....+|..|++++++
T Consensus 1170 LCWvDAGt~rleC~~p~g~gRR~i~~~ 1196 (1289)
T KOG1214|consen 1170 LCWVDAGTKRLECTLPDGTGRRVIQNN 1196 (1289)
T ss_pred eeEEecCCcceeEecCCCCcchhhhhc
Confidence 999999999999999999999998864
No 2
>KOG1214|consensus
Probab=99.92 E-value=4.7e-25 Score=169.80 Aligned_cols=103 Identities=27% Similarity=0.356 Sum_probs=97.3
Q ss_pred eeecCCCCCccEEEe-cCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCe
Q psy959 3 STDKDLLKTHRSLLL-QAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEM 81 (109)
Q Consensus 3 ~~~~~l~G~~~~~li-~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~ 81 (109)
+.-+.|+|...++++ ++|.+|.|||||+..++|||+|.. ..+|++|.+||+.|++|+.+++.+|++|++|+.++.
T Consensus 1049 I~rasL~G~Ep~ti~n~~L~SPEGiAVDh~~Rn~ywtDS~----lD~IevA~LdG~~rkvLf~tdLVNPR~iv~D~~rgn 1124 (1289)
T KOG1214|consen 1049 ISRASLEGAEPETIVNSGLISPEGIAVDHIRRNMYWTDSV----LDKIEVALLDGSERKVLFYTDLVNPRAIVVDPIRGN 1124 (1289)
T ss_pred cccccccCCCCceeecccCCCccceeeeeccceeeeeccc----cchhheeecCCceeeEEEeecccCcceEEeecccCc
Confidence 456789999999999 789999999999999999999998 799999999999999999999999999999999999
Q ss_pred EEEEeCC--CCeEEEEeCCCCCeEEEEeCC
Q psy959 82 LYWTDAV--REKIERSDLAGNNREALYLNK 109 (109)
Q Consensus 82 lYw~D~~--~~~I~~~~~dG~~~~~l~~~~ 109 (109)
|||+||. ..+|++++|||+||++|++++
T Consensus 1125 LYwtDWnRenPkIets~mDG~NrRilin~D 1154 (1289)
T KOG1214|consen 1125 LYWTDWNRENPKIETSSMDGENRRILINTD 1154 (1289)
T ss_pred eeeccccccCCcceeeccCCccceEEeecc
Confidence 9999996 679999999999999999863
No 3
>KOG1215|consensus
Probab=99.66 E-value=5.4e-16 Score=123.52 Aligned_cols=98 Identities=30% Similarity=0.595 Sum_probs=90.8
Q ss_pred eeecCCCCCccEEEe-cCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCe
Q psy959 3 STDKDLLKTHRSLLL-QAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEM 81 (109)
Q Consensus 3 ~~~~~l~G~~~~~li-~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~ 81 (109)
+..++++|+.|++++ ..+..|+++++||..|.+||+|++. .++|.|+.|||+.+.+++..++.+|+||++|...++
T Consensus 504 i~v~~~~g~~~~vl~~~~l~~~r~~~v~p~~g~~~wtd~~~---~~~i~ra~~dg~~~~~l~~~~~~~p~glt~d~~~~~ 580 (877)
T KOG1215|consen 504 IEVADLDGSSRKVLVSKDLDLPRSIAVDPEKGLMFWTDWGQ---PPRIERASLDGSERAVLVTNGILWPNGLTIDYETDR 580 (877)
T ss_pred eEEEEccCCceeEEEecCCCCccceeeccccCeeEEecCCC---CchhhhhcCCCCCceEEEeCCccCCCcceEEeecce
Confidence 456678999999999 5679999999999999999999985 459999999999999999988999999999999999
Q ss_pred EEEEeCCCC-eEEEEeCCCCCeE
Q psy959 82 LYWTDAVRE-KIERSDLAGNNRE 103 (109)
Q Consensus 82 lYw~D~~~~-~I~~~~~dG~~~~ 103 (109)
+||+|.... .|++++++|.+|+
T Consensus 581 ~yw~d~~~~~~i~~~~~~g~~r~ 603 (877)
T KOG1215|consen 581 LYWADAKLDYTIESANMDGQNRR 603 (877)
T ss_pred eEEEcccCCcceeeeecCCCceE
Confidence 999999887 8999999999998
No 4
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=99.56 E-value=2.3e-14 Score=74.46 Aligned_cols=41 Identities=37% Similarity=0.855 Sum_probs=38.5
Q ss_pred CeEEEEecCCCCCCC-eEEEEEcCCCCcEEEEeCCCCCceeEEEec
Q psy959 33 RTLYYTDWGRFGTAG-KILRTTMAGSLKKVIIEKDLSQPSGLAIDF 77 (109)
Q Consensus 33 g~lywsd~~~~~~~~-~I~~~~~dG~~~~~l~~~~~~~p~glaiD~ 77 (109)
++|||+|+. .. .|++++|||+++++++..++..|.|||||+
T Consensus 1 ~~iYWtD~~----~~~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWS----QDPSIERANLDGSNRRTVISDDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETT----TTEEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred CEEEEEECC----CCcEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence 689999998 45 999999999999999999999999999996
No 5
>KOG1215|consensus
Probab=99.43 E-value=1.1e-12 Score=104.71 Aligned_cols=100 Identities=27% Similarity=0.391 Sum_probs=89.7
Q ss_pred ecCCCCCccEEEe-cCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEE
Q psy959 5 DKDLLKTHRSLLL-QAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLY 83 (109)
Q Consensus 5 ~~~l~G~~~~~li-~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lY 83 (109)
.+..+|....-++ .++..|.++|+|+..+++||+|.. ...|.++.++|+.+.+++...+..|+.+++|+..+.+|
T Consensus 463 ~~~~~~~~~~~~~~~g~~~~~~lavD~~~~~~y~tDe~----~~~i~v~~~~g~~~~vl~~~~l~~~r~~~v~p~~g~~~ 538 (877)
T KOG1215|consen 463 RASQDGSSECELCGDGLCIPEGLAVDWIGDNIYWTDEG----NCLIEVADLDGSSRKVLVSKDLDLPRSIAVDPEKGLMF 538 (877)
T ss_pred eeccCCCccceEeccCccccCcEEEEeccCCceecccC----CceeEEEEccCCceeEEEecCCCCccceeeccccCeeE
Confidence 4556677555555 789999999999999999999998 69999999999999999998889999999999999999
Q ss_pred EEeCC-CCeEEEEeCCCCCeEEEEeC
Q psy959 84 WTDAV-REKIERSDLAGNNREALYLN 108 (109)
Q Consensus 84 w~D~~-~~~I~~~~~dG~~~~~l~~~ 108 (109)
|+|++ ..+|+++.+||+.+.+++..
T Consensus 539 wtd~~~~~~i~ra~~dg~~~~~l~~~ 564 (877)
T KOG1215|consen 539 WTDWGQPPRIERASLDGSERAVLVTN 564 (877)
T ss_pred EecCCCCchhhhhcCCCCCceEEEeC
Confidence 99998 56899999999999999865
No 6
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=99.31 E-value=1.2e-11 Score=63.69 Aligned_cols=42 Identities=40% Similarity=0.756 Sum_probs=38.1
Q ss_pred EEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCCCCCe
Q psy959 61 VIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLAGNNR 102 (109)
Q Consensus 61 ~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~dG~~~ 102 (109)
+++...+..|+|||+|+.+++|||+|.....|++++++|+++
T Consensus 2 ~~~~~~~~~~~~la~d~~~~~lYw~D~~~~~I~~~~~~g~~~ 43 (43)
T smart00135 2 TLLSEGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43 (43)
T ss_pred EEEECCCCCcCEEEEeecCCEEEEEeCCCCEEEEEeCCCCCC
Confidence 355668999999999999999999999999999999999864
No 7
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=99.16 E-value=2e-10 Score=58.93 Aligned_cols=40 Identities=23% Similarity=0.370 Sum_probs=36.1
Q ss_pred EecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCc
Q psy959 16 LLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLK 59 (109)
Q Consensus 16 li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~ 59 (109)
+..++..|.++|+||..++|||+|.. ...|++++++|+.+
T Consensus 4 ~~~~~~~~~~la~d~~~~~lYw~D~~----~~~I~~~~~~g~~~ 43 (43)
T smart00135 4 LSEGLGHPNGLAVDWIEGRLYWTDWG----LDVIEVANLDGTNR 43 (43)
T ss_pred EECCCCCcCEEEEeecCCEEEEEeCC----CCEEEEEeCCCCCC
Confidence 34789999999999999999999998 69999999999863
No 8
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.96 E-value=1.4e-08 Score=82.90 Aligned_cols=87 Identities=20% Similarity=0.313 Sum_probs=71.7
Q ss_pred EEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeC-------------CCCCceeEEEecCCCe
Q psy959 15 LLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEK-------------DLSQPSGLAIDFDDEM 81 (109)
Q Consensus 15 ~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~-------------~~~~p~glaiD~~~~~ 81 (109)
++.+.+..|.++|+|+.+|.||++|.. +.+|.+.+.+|.....+... .+..|.||++|+.++.
T Consensus 562 ~~~s~l~~P~gvavd~~~g~lyVaDs~----n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~ 637 (1057)
T PLN02919 562 LLTSPLKFPGKLAIDLLNNRLFISDSN----HNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNL 637 (1057)
T ss_pred cccccCCCCceEEEECCCCeEEEEECC----CCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCE
Confidence 444778999999999989999999998 79999999998765444321 2568999999999999
Q ss_pred EEEEeCCCCeEEEEeCCCCCeEEE
Q psy959 82 LYWTDAVREKIERSDLAGNNREAL 105 (109)
Q Consensus 82 lYw~D~~~~~I~~~~~dG~~~~~l 105 (109)
||++|...++|.+.++.+...+++
T Consensus 638 LYVaDt~n~~Ir~id~~~~~V~tl 661 (1057)
T PLN02919 638 LYVADTENHALREIDFVNETVRTL 661 (1057)
T ss_pred EEEEeCCCceEEEEecCCCEEEEE
Confidence 999999999999998876655544
No 9
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=98.95 E-value=3.3e-09 Score=54.89 Aligned_cols=30 Identities=37% Similarity=0.750 Sum_probs=27.7
Q ss_pred CeEEEEeCCCC-eEEEEeCCCCCeEEEEeCC
Q psy959 80 EMLYWTDAVRE-KIERSDLAGNNREALYLNK 109 (109)
Q Consensus 80 ~~lYw~D~~~~-~I~~~~~dG~~~~~l~~~~ 109 (109)
++|||+|.... +|+++++||++++++++++
T Consensus 1 ~~iYWtD~~~~~~I~~a~~dGs~~~~vi~~~ 31 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLDGSNRRTVISDD 31 (42)
T ss_dssp TEEEEEETTTTEEEEEEETTSTSEEEEEESS
T ss_pred CEEEEEECCCCcEEEEEECCCCCeEEEEECC
Confidence 58999999999 9999999999999999764
No 10
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=98.93 E-value=4e-08 Score=67.85 Aligned_cols=88 Identities=23% Similarity=0.220 Sum_probs=69.7
Q ss_pred cEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCC-----cEEEEeCCC--CCceeEEEecCCCeEEEE
Q psy959 13 RSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSL-----KKVIIEKDL--SQPSGLAIDFDDEMLYWT 85 (109)
Q Consensus 13 ~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~-----~~~l~~~~~--~~p~glaiD~~~~~lYw~ 85 (109)
.+.+.+++..|+||+++|..+.||++|.. ..+|.+..++... ++++..... ..|.||++|. .++||.+
T Consensus 126 ~~~~~~~~~~pNGi~~s~dg~~lyv~ds~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~-~G~l~va 200 (246)
T PF08450_consen 126 VTVVADGLGFPNGIAFSPDGKTLYVADSF----NGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDS-DGNLWVA 200 (246)
T ss_dssp EEEEEEEESSEEEEEEETTSSEEEEEETT----TTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBT-TS-EEEE
T ss_pred EEEEecCcccccceEECCcchheeecccc----cceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcC-CCCEEEE
Confidence 44556789999999999988899999998 6899999996332 455554332 3599999996 6789999
Q ss_pred eCCCCeEEEEeCCCCCeEEE
Q psy959 86 DAVREKIERSDLAGNNREAL 105 (109)
Q Consensus 86 D~~~~~I~~~~~dG~~~~~l 105 (109)
++..++|.+.+.+|+-...+
T Consensus 201 ~~~~~~I~~~~p~G~~~~~i 220 (246)
T PF08450_consen 201 DWGGGRIVVFDPDGKLLREI 220 (246)
T ss_dssp EETTTEEEEEETTSCEEEEE
T ss_pred EcCCCEEEEECCCccEEEEE
Confidence 99999999999999866554
No 11
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=98.88 E-value=5.2e-08 Score=67.29 Aligned_cols=78 Identities=28% Similarity=0.390 Sum_probs=61.4
Q ss_pred CCCCcceEEeeCCCCeEEEEecCCCCC--C--CeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEE
Q psy959 19 AIPGRRHFMPPIRLRTLYYTDWGRFGT--A--GKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIER 94 (109)
Q Consensus 19 ~l~~p~~iavd~~~g~lywsd~~~~~~--~--~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~ 94 (109)
.+..|+.+++|+ .|.+|+++...... . +.|++...+|+. ..+ ...+..|+||++++..+.||++|...++|.+
T Consensus 84 ~~~~~ND~~vd~-~G~ly~t~~~~~~~~~~~~g~v~~~~~~~~~-~~~-~~~~~~pNGi~~s~dg~~lyv~ds~~~~i~~ 160 (246)
T PF08450_consen 84 PFNRPNDVAVDP-DGNLYVTDSGGGGASGIDPGSVYRIDPDGKV-TVV-ADGLGFPNGIAFSPDGKTLYVADSFNGRIWR 160 (246)
T ss_dssp CTEEEEEEEE-T-TS-EEEEEECCBCTTCGGSEEEEEEETTSEE-EEE-EEEESSEEEEEEETTSSEEEEEETTTTEEEE
T ss_pred ccCCCceEEEcC-CCCEEEEecCCCccccccccceEEECCCCeE-EEE-ecCcccccceEECCcchheeecccccceeEE
Confidence 467889999999 79999999764221 1 579999999443 333 3458899999999999999999999999999
Q ss_pred EeCCC
Q psy959 95 SDLAG 99 (109)
Q Consensus 95 ~~~dG 99 (109)
++++.
T Consensus 161 ~~~~~ 165 (246)
T PF08450_consen 161 FDLDA 165 (246)
T ss_dssp EEEET
T ss_pred Eeccc
Confidence 99974
No 12
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=98.83 E-value=5.3e-08 Score=58.16 Aligned_cols=76 Identities=24% Similarity=0.336 Sum_probs=58.6
Q ss_pred ceEEeeCCCCeEEEEecCCC-------------CCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCC
Q psy959 24 RHFMPPIRLRTLYYTDWGRF-------------GTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVRE 90 (109)
Q Consensus 24 ~~iavd~~~g~lywsd~~~~-------------~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~ 90 (109)
.+++|++.+|.+||||...+ ...+++.+.++.....+++.. ++..|+|+++.+....|+.++....
T Consensus 1 ndldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~~-~L~fpNGVals~d~~~vlv~Et~~~ 79 (89)
T PF03088_consen 1 NDLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLLD-GLYFPNGVALSPDESFVLVAETGRY 79 (89)
T ss_dssp -EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEEE-EESSEEEEEE-TTSSEEEEEEGGGT
T ss_pred CceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEehh-CCCccCeEEEcCCCCEEEEEeccCc
Confidence 37899997799999997532 246899999988877777765 6999999999999999999999999
Q ss_pred eEEEEeCCCC
Q psy959 91 KIERSDLAGN 100 (109)
Q Consensus 91 ~I~~~~~dG~ 100 (109)
+|.+.-+.|.
T Consensus 80 Ri~rywl~Gp 89 (89)
T PF03088_consen 80 RILRYWLKGP 89 (89)
T ss_dssp EEEEEESSST
T ss_pred eEEEEEEeCC
Confidence 9999988873
No 13
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.67 E-value=5e-07 Score=74.03 Aligned_cols=78 Identities=12% Similarity=0.127 Sum_probs=63.7
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeC----------------CCCCceeEEEecCCCeEE
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEK----------------DLSQPSGLAIDFDDEMLY 83 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~----------------~~~~p~glaiD~~~~~lY 83 (109)
+..|.||++|+..+.||++|.. +.+|.+.++.+...+++... .+..|.++++|+.++.||
T Consensus 623 f~~P~GIavd~~gn~LYVaDt~----n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~Ly 698 (1057)
T PLN02919 623 FNRPQGLAYNAKKNLLYVADTE----NHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVY 698 (1057)
T ss_pred cCCCcEEEEeCCCCEEEEEeCC----CceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEE
Confidence 5679999999977889999987 68999998876665555321 156899999999999999
Q ss_pred EEeCCCCeEEEEeCCCCC
Q psy959 84 WTDAVREKIERSDLAGNN 101 (109)
Q Consensus 84 w~D~~~~~I~~~~~dG~~ 101 (109)
++|.+.++|...+..+..
T Consensus 699 Vad~~~~~I~v~d~~~g~ 716 (1057)
T PLN02919 699 IAMAGQHQIWEYNISDGV 716 (1057)
T ss_pred EEECCCCeEEEEECCCCe
Confidence 999999999888875443
No 14
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=98.42 E-value=5.3e-06 Score=59.80 Aligned_cols=91 Identities=21% Similarity=0.241 Sum_probs=64.2
Q ss_pred CCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcC---CCC--cE--EEEeCCCCCceeEEEecCCCeE
Q psy959 10 KTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMA---GSL--KK--VIIEKDLSQPSGLAIDFDDEML 82 (109)
Q Consensus 10 G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~d---G~~--~~--~l~~~~~~~p~glaiD~~~~~l 82 (109)
|+-.+.+...+..|+|||..|....||++|.. ..+|.+..+| |.. ++ +.....-..|-|+++| ..+.|
T Consensus 152 g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~----~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vD-adG~l 226 (307)
T COG3386 152 GGVVRLLDDDLTIPNGLAFSPDGKTLYVADTP----ANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVD-ADGNL 226 (307)
T ss_pred CCEEEeecCcEEecCceEECCCCCEEEEEeCC----CCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEe-CCCCE
Confidence 33343334559999999999988899999998 6999999987 322 22 2232334689999999 34444
Q ss_pred E-EEeCCCCeEEEEeCCCCCeEEE
Q psy959 83 Y-WTDAVREKIERSDLAGNNREAL 105 (109)
Q Consensus 83 Y-w~D~~~~~I~~~~~dG~~~~~l 105 (109)
| ++-++..+|.+.+.+|.....+
T Consensus 227 w~~a~~~g~~v~~~~pdG~l~~~i 250 (307)
T COG3386 227 WVAAVWGGGRVVRFNPDGKLLGEI 250 (307)
T ss_pred EEecccCCceEEEECCCCcEEEEE
Confidence 4 3333445999999998876654
No 15
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=98.30 E-value=7.8e-06 Score=58.96 Aligned_cols=77 Identities=25% Similarity=0.440 Sum_probs=64.1
Q ss_pred CCCcceEEeeCCCCeEEEEecC-----CC--CCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeE
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWG-----RF--GTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKI 92 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~-----~~--~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I 92 (109)
+..|+.+.+|| .|.+|+++.. .. ...+.++|.+.+|...+. +...+..|+||+++|.++.||++|+...+|
T Consensus 110 ~~r~ND~~v~p-dG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~~~~l-~~~~~~~~NGla~SpDg~tly~aDT~~~~i 187 (307)
T COG3386 110 LNRPNDGVVDP-DGRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVVRL-LDDDLTIPNGLAFSPDGKTLYVADTPANRI 187 (307)
T ss_pred cCCCCceeEcC-CCCEEEeCCCccccCccccCCcceEEEEcCCCCEEEe-ecCcEEecCceEECCCCCEEEEEeCCCCeE
Confidence 57899999999 7999999987 22 235689998886555544 445589999999999999999999999999
Q ss_pred EEEeCC
Q psy959 93 ERSDLA 98 (109)
Q Consensus 93 ~~~~~d 98 (109)
.++.++
T Consensus 188 ~r~~~d 193 (307)
T COG3386 188 HRYDLD 193 (307)
T ss_pred EEEecC
Confidence 999998
No 16
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=97.95 E-value=0.00029 Score=51.75 Aligned_cols=93 Identities=15% Similarity=0.128 Sum_probs=64.3
Q ss_pred ecCCCCCc-c-EEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEE-EcCCC-----CcEEEEeCC-------CCC
Q psy959 5 DKDLLKTH-R-SLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRT-TMAGS-----LKKVIIEKD-------LSQ 69 (109)
Q Consensus 5 ~~~l~G~~-~-~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~-~~dG~-----~~~~l~~~~-------~~~ 69 (109)
..+-||+. . +++.+++..|.||++++ .| ||.++. ..|.+. +.+|. .+++|+..- .+.
T Consensus 54 d~dgdG~~d~~~vfa~~l~~p~Gi~~~~-~G-lyV~~~------~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~ 125 (367)
T TIGR02604 54 DADGDGKYDKSNVFAEELSMVTGLAVAV-GG-VYVATP------PDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHS 125 (367)
T ss_pred cCCCCCCcceeEEeecCCCCccceeEec-CC-EEEeCC------CeEEEEeCCCCCCCCCCccEEEEEccCCCCCccccc
Confidence 34446663 2 44458899999999987 67 999863 568777 34442 445555421 244
Q ss_pred ceeEEEecCCCeEEEEeCC-------------------CCeEEEEeCCCCCeEEEE
Q psy959 70 PSGLAIDFDDEMLYWTDAV-------------------REKIERSDLAGNNREALY 106 (109)
Q Consensus 70 p~glaiD~~~~~lYw~D~~-------------------~~~I~~~~~dG~~~~~l~ 106 (109)
++++++++ .++||+++.. .+.|.+++.||+..+++.
T Consensus 126 ~~~l~~gp-DG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~a 180 (367)
T TIGR02604 126 LNSLAWGP-DGWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLRVVA 180 (367)
T ss_pred ccCceECC-CCCEEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEEEEe
Confidence 88999997 5699998872 157999999998776653
No 17
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=97.94 E-value=9.2e-05 Score=54.36 Aligned_cols=79 Identities=18% Similarity=0.267 Sum_probs=58.4
Q ss_pred ccEEEecCCC--------CcceEEeeCCCCeEEEEecCCC---------------CCCCeEEEEEcCCCCcEEEEeCCCC
Q psy959 12 HRSLLLQAIP--------GRRHFMPPIRLRTLYYTDWGRF---------------GTAGKILRTTMAGSLKKVIIEKDLS 68 (109)
Q Consensus 12 ~~~~li~~l~--------~p~~iavd~~~g~lywsd~~~~---------------~~~~~I~~~~~dG~~~~~l~~~~~~ 68 (109)
.+++|++++. .|.+++++| .|+||+++.... ...+.|.|++.||+..+++ ..++.
T Consensus 107 ~~~~l~~~~~~~~~~~~~~~~~l~~gp-DG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~-a~G~r 184 (367)
T TIGR02604 107 EREVLLSGFGGQINNHHHSLNSLAWGP-DGWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLRVV-AHGFQ 184 (367)
T ss_pred ccEEEEEccCCCCCcccccccCceECC-CCCEEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEEEE-ecCcC
Confidence 4566766542 277899999 899999887310 0126799999999887765 46799
Q ss_pred CceeEEEecCCCeEEEEeCCCCeEE
Q psy959 69 QPSGLAIDFDDEMLYWTDAVREKIE 93 (109)
Q Consensus 69 ~p~glaiD~~~~~lYw~D~~~~~I~ 93 (109)
.|.||++|+ .+.||.+|.......
T Consensus 185 np~Gl~~d~-~G~l~~tdn~~~~~~ 208 (367)
T TIGR02604 185 NPYGHSVDS-WGDVFFCDNDDPPLC 208 (367)
T ss_pred CCccceECC-CCCEEEEccCCCcee
Confidence 999999997 678899998544333
No 18
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=97.93 E-value=4.1e-05 Score=35.95 Aligned_cols=27 Identities=26% Similarity=0.577 Sum_probs=24.2
Q ss_pred CCCceeEEEecCCCeEEEEeCCCCeEEE
Q psy959 67 LSQPSGLAIDFDDEMLYWTDAVREKIER 94 (109)
Q Consensus 67 ~~~p~glaiD~~~~~lYw~D~~~~~I~~ 94 (109)
+..|.|+++| .++.||.+|.+.++|..
T Consensus 1 f~~P~gvav~-~~g~i~VaD~~n~rV~v 27 (28)
T PF01436_consen 1 FNYPHGVAVD-SDGNIYVADSGNHRVQV 27 (28)
T ss_dssp BSSEEEEEEE-TTSEEEEEECCCTEEEE
T ss_pred CcCCcEEEEe-CCCCEEEEECCCCEEEE
Confidence 3589999999 89999999999999875
No 19
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=97.89 E-value=4.5e-05 Score=35.80 Aligned_cols=27 Identities=11% Similarity=0.047 Sum_probs=23.2
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEE
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILR 51 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~ 51 (109)
|..|.|||+|+ +|.||.+|.+ +.+|.+
T Consensus 1 f~~P~gvav~~-~g~i~VaD~~----n~rV~v 27 (28)
T PF01436_consen 1 FNYPHGVAVDS-DGNIYVADSG----NHRVQV 27 (28)
T ss_dssp BSSEEEEEEET-TSEEEEEECC----CTEEEE
T ss_pred CcCCcEEEEeC-CCCEEEEECC----CCEEEE
Confidence 46899999995 9999999987 688765
No 20
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=97.82 E-value=0.00017 Score=50.59 Aligned_cols=63 Identities=21% Similarity=0.224 Sum_probs=42.0
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeC--------CCCCceeEEEecCCCeEEEEeC
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEK--------DLSQPSGLAIDFDDEMLYWTDA 87 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~--------~~~~p~glaiD~~~~~lYw~D~ 87 (109)
+..|.++++||.+|+||.-... .+.+...+.+|.....+.-. .+.+|.|||+|+ +++||.+..
T Consensus 170 ~~d~S~l~~~p~t~~lliLS~e----s~~l~~~d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~-~G~LYIvsE 240 (248)
T PF06977_consen 170 VRDLSGLSYDPRTGHLLILSDE----SRLLLELDRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDP-DGNLYIVSE 240 (248)
T ss_dssp SS---EEEEETTTTEEEEEETT----TTEEEEE-TT--EEEEEE-STTGGG-SS---SEEEEEE-T-T--EEEEET
T ss_pred eccccceEEcCCCCeEEEEECC----CCeEEEECCCCCEEEEEEeCCcccCcccccCCccEEEECC-CCCEEEEcC
Confidence 5679999999999999999776 68999999999876555332 357899999996 679999876
No 21
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.80 E-value=0.0004 Score=50.51 Aligned_cols=74 Identities=15% Similarity=0.163 Sum_probs=57.7
Q ss_pred CcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCC-cEEE---EeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeC
Q psy959 22 GRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSL-KKVI---IEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDL 97 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~-~~~l---~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~ 97 (109)
.|.+|++.|..++||.++.+ ...|....+|+.. .... +......|+++++++..++||.+....+.|...++
T Consensus 246 ~~~~i~ispdg~~lyvsnr~----~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~ 321 (345)
T PF10282_consen 246 APAEIAISPDGRFLYVSNRG----SNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDI 321 (345)
T ss_dssp SEEEEEE-TTSSEEEEEECT----TTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEE
T ss_pred CceeEEEecCCCEEEEEecc----CCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEE
Confidence 68899999988899999988 6889999986442 2222 22345679999999999999999999999998887
Q ss_pred CC
Q psy959 98 AG 99 (109)
Q Consensus 98 dG 99 (109)
|.
T Consensus 322 d~ 323 (345)
T PF10282_consen 322 DP 323 (345)
T ss_dssp ET
T ss_pred eC
Confidence 64
No 22
>KOG4659|consensus
Probab=97.70 E-value=0.0002 Score=59.58 Aligned_cols=75 Identities=21% Similarity=0.232 Sum_probs=55.9
Q ss_pred CCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEe------------------CCCCCceeEEEecCCC
Q psy959 19 AIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIE------------------KDLSQPSGLAIDFDDE 80 (109)
Q Consensus 19 ~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~------------------~~~~~p~glaiD~~~~ 80 (109)
.|..|+|||+|. .|+|||+|. ..|...+.+|--...+-+ -.+.||..|||||..+
T Consensus 473 ~L~~PkGIa~dk-~g~lYfaD~------t~IR~iD~~giIstlig~~~~~~~p~~C~~~~kl~~~~leWPT~LaV~Pmdn 545 (1899)
T KOG4659|consen 473 QLIFPKGIAFDK-MGNLYFADG------TRIRVIDTTGIISTLIGTTPDQHPPRTCAQITKLVDLQLEWPTSLAVDPMDN 545 (1899)
T ss_pred eeccCCceeEcc-CCcEEEecc------cEEEEeccCceEEEeccCCCCccCccccccccchhheeeecccceeecCCCC
Confidence 378999999998 999999995 467776666644333221 1367999999999999
Q ss_pred eEEEEeCCCCeEEEEeCCCCCe
Q psy959 81 MLYWTDAVREKIERSDLAGNNR 102 (109)
Q Consensus 81 ~lYw~D~~~~~I~~~~~dG~~~ 102 (109)
.||..|. +.|.++.-++.-+
T Consensus 546 sl~Vld~--nvvlrit~~~rV~ 565 (1899)
T KOG4659|consen 546 SLLVLDT--NVVLRITVVHRVR 565 (1899)
T ss_pred eEEEeec--ceEEEEccCccEE
Confidence 9999996 5566666665433
No 23
>KOG1520|consensus
Probab=97.62 E-value=0.00012 Score=53.88 Aligned_cols=81 Identities=21% Similarity=0.326 Sum_probs=61.7
Q ss_pred CCCcceEEeeCCCCeEEEEecCCC------------C-CCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEe
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRF------------G-TAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTD 86 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~------------~-~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D 86 (109)
+....++.||+ +|.+||||.... + ..+++.+.+..-+..++|. +++.-|+|+++.+....+-.|+
T Consensus 160 ~kf~N~ldI~~-~g~vyFTDSSsk~~~rd~~~a~l~g~~~GRl~~YD~~tK~~~VLl-d~L~F~NGlaLS~d~sfvl~~E 237 (376)
T KOG1520|consen 160 FKFLNDLDIDP-EGVVYFTDSSSKYDRRDFVFAALEGDPTGRLFRYDPSTKVTKVLL-DGLYFPNGLALSPDGSFVLVAE 237 (376)
T ss_pred eeecCceeEcC-CCeEEEeccccccchhheEEeeecCCCccceEEecCcccchhhhh-hcccccccccCCCCCCEEEEEe
Confidence 34567899999 999999997642 1 2344444444333444444 4799999999999999999999
Q ss_pred CCCCeEEEEeCCCCCe
Q psy959 87 AVREKIERSDLAGNNR 102 (109)
Q Consensus 87 ~~~~~I~~~~~dG~~~ 102 (109)
....+|.+.-+.|.+.
T Consensus 238 t~~~ri~rywi~g~k~ 253 (376)
T KOG1520|consen 238 TTTARIKRYWIKGPKA 253 (376)
T ss_pred eccceeeeeEecCCcc
Confidence 9999999999998876
No 24
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=97.46 E-value=0.0079 Score=43.07 Aligned_cols=77 Identities=12% Similarity=0.138 Sum_probs=59.0
Q ss_pred cCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCc------EEEEeCCCCCceeEEEecCCCeEEEEeCCCCe
Q psy959 18 QAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLK------KVIIEKDLSQPSGLAIDFDDEMLYWTDAVREK 91 (109)
Q Consensus 18 ~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~------~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~ 91 (109)
.....|.+++++|..+++|.++.+ ...|...+++.... ..+....-..|+++++++..+++|.++...+.
T Consensus 123 ~~~~~~~~~~~~p~g~~l~v~~~~----~~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~ 198 (330)
T PRK11028 123 EGLEGCHSANIDPDNRTLWVPCLK----EDRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSS 198 (330)
T ss_pred cCCCcccEeEeCCCCCEEEEeeCC----CCEEEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCE
Confidence 345678999999988899999987 68899998864221 11111224579999999999999999998888
Q ss_pred EEEEeCC
Q psy959 92 IERSDLA 98 (109)
Q Consensus 92 I~~~~~d 98 (109)
|...+++
T Consensus 199 v~v~~~~ 205 (330)
T PRK11028 199 VDVWQLK 205 (330)
T ss_pred EEEEEEe
Confidence 8888876
No 25
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.44 E-value=0.0065 Score=44.21 Aligned_cols=75 Identities=19% Similarity=0.187 Sum_probs=59.0
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCC--c---EEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEE
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSL--K---KVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIER 94 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~--~---~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~ 94 (109)
-.+|+++.++|..+++|.+|.+ ..+|....++... - ..+....-.-|+.|++++..+++|.+......|..
T Consensus 143 ~~h~H~v~~~pdg~~v~v~dlG----~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v 218 (345)
T PF10282_consen 143 GPHPHQVVFSPDGRFVYVPDLG----ADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSV 218 (345)
T ss_dssp STCEEEEEE-TTSSEEEEEETT----TTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEE
T ss_pred cccceeEEECCCCCEEEEEecC----CCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEE
Confidence 4678899999988999999998 6899999997554 1 22222445679999999999999999999999998
Q ss_pred EeCC
Q psy959 95 SDLA 98 (109)
Q Consensus 95 ~~~d 98 (109)
.+++
T Consensus 219 ~~~~ 222 (345)
T PF10282_consen 219 FDYD 222 (345)
T ss_dssp EEEE
T ss_pred Eeec
Confidence 8888
No 26
>KOG1520|consensus
Probab=97.35 E-value=0.00071 Score=49.87 Aligned_cols=63 Identities=19% Similarity=0.215 Sum_probs=52.2
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeC----CCCCceeEEEecCCCeEEEEeCC
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEK----DLSQPSGLAIDFDDEMLYWTDAV 88 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~----~~~~p~glaiD~~~~~lYw~D~~ 88 (109)
...|-||+.+..+|.||.+|.. -.+...+..|...+.+... .+...+++.||+ ++.+||+|+.
T Consensus 114 CGRPLGl~f~~~ggdL~VaDAY-----lGL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSS 180 (376)
T KOG1520|consen 114 CGRPLGIRFDKKGGDLYVADAY-----LGLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSS 180 (376)
T ss_pred cCCcceEEeccCCCeEEEEecc-----eeeEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEeccc
Confidence 4789999999999999999986 6788888888776665543 245678899998 9999999985
No 27
>KOG4499|consensus
Probab=97.30 E-value=0.005 Score=43.26 Aligned_cols=85 Identities=18% Similarity=0.195 Sum_probs=65.4
Q ss_pred CCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcC-----CCCcEEEEeC------CCCCceeEEEec
Q psy959 9 LKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMA-----GSLKKVIIEK------DLSQPSGLAIDF 77 (109)
Q Consensus 9 ~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~d-----G~~~~~l~~~------~~~~p~glaiD~ 77 (109)
.|.+.+.+...+.-|+||+-|.....+|++|.. +-.|...+.| -++|++|+.- .-..|-|++||-
T Consensus 146 ~~h~v~~i~~~v~IsNgl~Wd~d~K~fY~iDsl----n~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~ 221 (310)
T KOG4499|consen 146 AGHQVELIWNCVGISNGLAWDSDAKKFYYIDSL----NYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDT 221 (310)
T ss_pred cCCCceeeehhccCCccccccccCcEEEEEccC----ceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEcc
Confidence 345556666778889999999999999999987 5777555533 2568887751 244699999995
Q ss_pred CCCeEEEEeCCCCeEEEEeCC
Q psy959 78 DDEMLYWTDAVREKIERSDLA 98 (109)
Q Consensus 78 ~~~~lYw~D~~~~~I~~~~~d 98 (109)
.++||.+-+..++|.+++..
T Consensus 222 -eG~L~Va~~ng~~V~~~dp~ 241 (310)
T KOG4499|consen 222 -EGNLYVATFNGGTVQKVDPT 241 (310)
T ss_pred -CCcEEEEEecCcEEEEECCC
Confidence 88999999988888888765
No 28
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=97.29 E-value=0.0025 Score=45.55 Aligned_cols=64 Identities=22% Similarity=0.195 Sum_probs=47.7
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCC----CCcEEEEe-CC-CCCceeEEEec-CCCeEEEEeCCC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAG----SLKKVIIE-KD-LSQPSGLAIDF-DDEMLYWTDAVR 89 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG----~~~~~l~~-~~-~~~p~glaiD~-~~~~lYw~D~~~ 89 (109)
....|+++|+ +|.||+++.. ...|.+.+.++ .+.+++.. .. +.+|.+++++. ..+.||......
T Consensus 186 ~~s~g~~~D~-~G~ly~~~~~----~~aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~snrl 256 (287)
T PF03022_consen 186 SQSDGMAIDP-NGNLYFTDVE----QNAIGCWDPDGPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSNRL 256 (287)
T ss_dssp -SECEEEEET-TTEEEEEECC----CTEEEEEETTTSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-S-
T ss_pred CCCceEEECC-CCcEEEecCC----CCeEEEEeCCCCcCccchheeEEcCceeeccceeeeccccCceEEEEECcc
Confidence 4557899999 9999999998 69999999998 34555555 33 89999999995 357888887643
No 29
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=97.29 E-value=0.0064 Score=44.99 Aligned_cols=79 Identities=19% Similarity=0.218 Sum_probs=59.4
Q ss_pred CcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCCCCC
Q psy959 22 GRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLAGNN 101 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~dG~~ 101 (109)
.|.++++++..+++|.++... .+..|.+.+-.......-.. .-..|.++++|+...++|-++.....|...+..+..
T Consensus 117 ~P~~~~~~~~~~~vYV~n~~~--~~~~vsvid~~t~~~~~~~~-vG~~P~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~ 193 (381)
T COG3391 117 GPVGLAVDPDGKYVYVANAGN--GNNTVSVIDAATNKVTATIP-VGNTPTGVAVDPDGNKVYVTNSDDNTVSVIDTSGNS 193 (381)
T ss_pred CCceEEECCCCCEEEEEeccc--CCceEEEEeCCCCeEEEEEe-cCCCcceEEECCCCCeEEEEecCCCeEEEEeCCCcc
Confidence 899999999999999999851 15788777765333222222 123689999999999999999888888888876654
Q ss_pred eE
Q psy959 102 RE 103 (109)
Q Consensus 102 ~~ 103 (109)
..
T Consensus 194 v~ 195 (381)
T COG3391 194 VV 195 (381)
T ss_pred ee
Confidence 43
No 30
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=97.27 E-value=0.0027 Score=46.19 Aligned_cols=75 Identities=20% Similarity=0.205 Sum_probs=53.8
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeC------CCCCceeEEEecC---CCeEEEEeCCC-
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEK------DLSQPSGLAIDFD---DEMLYWTDAVR- 89 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~------~~~~p~glaiD~~---~~~lYw~D~~~- 89 (109)
|+.|++|++.| .|.||+++.. ++|.+...+|.....+... ...-+.|||+|+. ++.||.+-...
T Consensus 1 L~~P~~~a~~p-dG~l~v~e~~-----G~i~~~~~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f~~n~~lYv~~t~~~ 74 (331)
T PF07995_consen 1 LNNPRSMAFLP-DGRLLVAERS-----GRIWVVDKDGSLKTPVADLPEVFADGERGLLGIAFHPDFASNGYLYVYYTNAD 74 (331)
T ss_dssp ESSEEEEEEET-TSCEEEEETT-----TEEEEEETTTEECEEEEE-TTTBTSTTBSEEEEEE-TTCCCC-EEEEEEEEE-
T ss_pred CCCceEEEEeC-CCcEEEEeCC-----ceEEEEeCCCcCcceecccccccccccCCcccceeccccCCCCEEEEEEEccc
Confidence 57899999999 6999999873 8999999888874443331 2345788999993 67888776632
Q ss_pred -------CeEEEEeCCCC
Q psy959 90 -------EKIERSDLAGN 100 (109)
Q Consensus 90 -------~~I~~~~~dG~ 100 (109)
.+|.+..++..
T Consensus 75 ~~~~~~~~~v~r~~~~~~ 92 (331)
T PF07995_consen 75 EDGGDNDNRVVRFTLSDG 92 (331)
T ss_dssp TSSSSEEEEEEEEEEETT
T ss_pred CCCCCcceeeEEEeccCC
Confidence 47888877654
No 31
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=97.22 E-value=0.0015 Score=38.75 Aligned_cols=42 Identities=10% Similarity=-0.088 Sum_probs=35.5
Q ss_pred CCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEE
Q psy959 8 LLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTT 53 (109)
Q Consensus 8 l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~ 53 (109)
-+|+..+++.+++..|.||+++|...+||.++.. ...|.+..
T Consensus 41 yd~~~~~~va~g~~~aNGI~~s~~~k~lyVa~~~----~~~I~vy~ 82 (86)
T PF01731_consen 41 YDGKEVKVVASGFSFANGIAISPDKKYLYVASSL----AHSIHVYK 82 (86)
T ss_pred EeCCEeEEeeccCCCCceEEEcCCCCEEEEEecc----CCeEEEEE
Confidence 4677778888999999999999999999999987 46776654
No 32
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=97.19 E-value=0.019 Score=43.68 Aligned_cols=82 Identities=23% Similarity=0.271 Sum_probs=57.9
Q ss_pred cEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEE------e-CCCCCceeEEEecC------C
Q psy959 13 RSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVII------E-KDLSQPSGLAIDFD------D 79 (109)
Q Consensus 13 ~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~------~-~~~~~p~glaiD~~------~ 79 (109)
.+++.++|..|++|++.| .|.||.+... .++|.+...++.....+. . ....-+.|||+||. +
T Consensus 22 ~~~va~GL~~Pw~maflP-DG~llVtER~----~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n 96 (454)
T TIGR03606 22 KKVLLSGLNKPWALLWGP-DNQLWVTERA----TGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGN 96 (454)
T ss_pred EEEEECCCCCceEEEEcC-CCeEEEEEec----CCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCC
Confidence 345568999999999999 7899999874 478988876544333221 1 12345789999855 4
Q ss_pred CeEEEEeC---------CCCeEEEEeCCC
Q psy959 80 EMLYWTDA---------VREKIERSDLAG 99 (109)
Q Consensus 80 ~~lYw~D~---------~~~~I~~~~~dG 99 (109)
+.||++-. ...+|.+..++.
T Consensus 97 ~~lYvsyt~~~~~~~~~~~~~I~R~~l~~ 125 (454)
T TIGR03606 97 PYVYISYTYKNGDKELPNHTKIVRYTYDK 125 (454)
T ss_pred cEEEEEEeccCCCCCccCCcEEEEEEecC
Confidence 67888842 246899988863
No 33
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=97.19 E-value=0.0068 Score=43.41 Aligned_cols=75 Identities=17% Similarity=0.160 Sum_probs=57.2
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcC--CCCcEEEEe-CCCCCceeEEEecCCCeEEEEeCCCCeEEEEeC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMA--GSLKKVIIE-KDLSQPSGLAIDFDDEMLYWTDAVREKIERSDL 97 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~d--G~~~~~l~~-~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~ 97 (109)
..|.+++++|..+.+|.+... ...|...+++ |...+.+.. .....|.++++++.++++|.++.+.++|...++
T Consensus 80 ~~p~~i~~~~~g~~l~v~~~~----~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~ 155 (330)
T PRK11028 80 GSPTHISTDHQGRFLFSASYN----ANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKEDRIRLFTL 155 (330)
T ss_pred CCceEEEECCCCCEEEEEEcC----CCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCCEEEEEEE
Confidence 578999999988889988765 4777777664 433222221 234679999999999999999999999998888
Q ss_pred CC
Q psy959 98 AG 99 (109)
Q Consensus 98 dG 99 (109)
+.
T Consensus 156 ~~ 157 (330)
T PRK11028 156 SD 157 (330)
T ss_pred CC
Confidence 64
No 34
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=97.19 E-value=0.0016 Score=47.31 Aligned_cols=69 Identities=26% Similarity=0.218 Sum_probs=49.6
Q ss_pred CcceEEeeCCCCeEEEEecCCC---------CCCCeEEEEEcCCCC------------cEEEEeCCCCCceeEEEecCCC
Q psy959 22 GRRHFMPPIRLRTLYYTDWGRF---------GTAGKILRTTMAGSL------------KKVIIEKDLSQPSGLAIDFDDE 80 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~~~---------~~~~~I~~~~~dG~~------------~~~l~~~~~~~p~glaiD~~~~ 80 (109)
...+|+++| .|+||++-.... ...++|.|.+.||+. ...++..++.+|.+|++|+.++
T Consensus 115 ~g~~l~fgp-DG~LYvs~G~~~~~~~~~~~~~~~G~ilri~~dG~~p~dnP~~~~~~~~~~i~A~GlRN~~~~~~d~~tg 193 (331)
T PF07995_consen 115 NGGGLAFGP-DGKLYVSVGDGGNDDNAQDPNSLRGKILRIDPDGSIPADNPFVGDDGADSEIYAYGLRNPFGLAFDPNTG 193 (331)
T ss_dssp -EEEEEE-T-TSEEEEEEB-TTTGGGGCSTTSSTTEEEEEETTSSB-TTSTTTTSTTSTTTEEEE--SEEEEEEEETTTT
T ss_pred CCccccCCC-CCcEEEEeCCCCCcccccccccccceEEEecccCcCCCCCccccCCCceEEEEEeCCCccccEEEECCCC
Confidence 345699999 889999953321 136899999999982 2345667899999999999999
Q ss_pred eEEEEeCCCCe
Q psy959 81 MLYWTDAVREK 91 (109)
Q Consensus 81 ~lYw~D~~~~~ 91 (109)
.||.+|.+...
T Consensus 194 ~l~~~d~G~~~ 204 (331)
T PF07995_consen 194 RLWAADNGPDG 204 (331)
T ss_dssp EEEEEEE-SSS
T ss_pred cEEEEccCCCC
Confidence 99999987543
No 35
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=97.17 E-value=0.0014 Score=39.09 Aligned_cols=47 Identities=17% Similarity=0.124 Sum_probs=39.3
Q ss_pred CCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCC
Q psy959 7 DLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGS 57 (109)
Q Consensus 7 ~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~ 57 (109)
|+..+.-++|+++|..|+|||+.+....+++++.. ..+|.|..+.|.
T Consensus 43 dp~t~~~~vl~~~L~fpNGVals~d~~~vlv~Et~----~~Ri~rywl~Gp 89 (89)
T PF03088_consen 43 DPSTKETTVLLDGLYFPNGVALSPDESFVLVAETG----RYRILRYWLKGP 89 (89)
T ss_dssp ETTTTEEEEEEEEESSEEEEEE-TTSSEEEEEEGG----GTEEEEEESSST
T ss_pred ECCCCeEEEehhCCCccCeEEEcCCCCEEEEEecc----CceEEEEEEeCC
Confidence 44445567888999999999999988899999998 699999999873
No 36
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=97.13 E-value=0.0028 Score=37.59 Aligned_cols=43 Identities=26% Similarity=0.350 Sum_probs=33.8
Q ss_pred cCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeC
Q psy959 54 MAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDL 97 (109)
Q Consensus 54 ~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~ 97 (109)
.||+..++ +..++..|+||++++.++.||.++...+.|.....
T Consensus 41 yd~~~~~~-va~g~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~ 83 (86)
T PF01731_consen 41 YDGKEVKV-VASGFSFANGIAISPDKKYLYVASSLAHSIHVYKR 83 (86)
T ss_pred EeCCEeEE-eeccCCCCceEEEcCCCCEEEEEeccCCeEEEEEe
Confidence 34444444 34578999999999999999999999888877654
No 37
>KOG4659|consensus
Probab=96.94 E-value=0.007 Score=50.89 Aligned_cols=91 Identities=19% Similarity=0.147 Sum_probs=69.9
Q ss_pred eeeecCCCCCccEEEecC---------CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEe--CCCCCc
Q psy959 2 VSTDKDLLKTHRSLLLQA---------IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIE--KDLSQP 70 (109)
Q Consensus 2 ~~~~~~l~G~~~~~li~~---------l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~--~~~~~p 70 (109)
+++-+-.||-.|.+=+.+ |-.|.++|.-| .|.||.-|. .-|.|...||.-+.++-- .+...-
T Consensus 337 vitt~mgdG~qR~veC~~C~G~a~~~~L~aPvala~a~-DGSl~VGDf------NyIRRI~~dg~v~tIl~L~~t~~sh~ 409 (1899)
T KOG4659|consen 337 VITTAMGDGHQRDVECPKCEGKADSISLFAPVALAYAP-DGSLIVGDF------NYIRRISQDGQVSTILTLGLTDTSHS 409 (1899)
T ss_pred eEEEeccCcccccccCCCCCCccccceeeceeeEEEcC-CCcEEEccc------hheeeecCCCceEEEEEecCCCccce
Confidence 355566677776655522 55799999998 999999885 589999999988776643 233445
Q ss_pred eeEEEecCCCeEEEEeCCCCeEEEEe-CCC
Q psy959 71 SGLAIDFDDEMLYWTDAVREKIERSD-LAG 99 (109)
Q Consensus 71 ~glaiD~~~~~lYw~D~~~~~I~~~~-~dG 99 (109)
.-||+||..+.||.+|...++|.+++ +.+
T Consensus 410 Yy~AvsPvdgtlyvSdp~s~qv~rv~sl~~ 439 (1899)
T KOG4659|consen 410 YYIAVSPVDGTLYVSDPLSKQVWRVSSLEP 439 (1899)
T ss_pred eEEEecCcCceEEecCCCcceEEEeccCCc
Confidence 57999999999999999999998885 554
No 38
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=96.93 E-value=0.012 Score=44.78 Aligned_cols=75 Identities=27% Similarity=0.317 Sum_probs=55.1
Q ss_pred cEEEecCCC-----CcceEEeeCCCCeEEEEecCCC----------------------------CCCCeEEEEEcCCCC-
Q psy959 13 RSLLLQAIP-----GRRHFMPPIRLRTLYYTDWGRF----------------------------GTAGKILRTTMAGSL- 58 (109)
Q Consensus 13 ~~~li~~l~-----~p~~iavd~~~g~lywsd~~~~----------------------------~~~~~I~~~~~dG~~- 58 (109)
.++|++.+. .-..|+++| .|+||++-.... ...++|.|.+.||+.
T Consensus 133 ~~~Il~~lP~~~~H~GgrI~FgP-DG~LYVs~GD~g~~~~~n~~~~~~aQ~~~~~~~~~~~d~~~~~GkILRin~DGsiP 211 (454)
T TIGR03606 133 PVDLLAGLPAGNDHNGGRLVFGP-DGKIYYTIGEQGRNQGANFFLPNQAQHTPTQQELNGKDYHAYMGKVLRLNLDGSIP 211 (454)
T ss_pred ceEEEecCCCCCCcCCceEEECC-CCcEEEEECCCCCCCcccccCcchhccccccccccccCcccCceEEEEEcCCCCCC
Confidence 455665442 234689999 789999743320 013589999999973
Q ss_pred ---------cEEEEeCCCCCceeEEEecCCCeEEEEeCCC
Q psy959 59 ---------KKVIIEKDLSQPSGLAIDFDDEMLYWTDAVR 89 (109)
Q Consensus 59 ---------~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~ 89 (109)
+..|...++.+|.||++|+ ++.||.+|.+.
T Consensus 212 ~dNPf~~g~~~eIyA~G~RNp~Gla~dp-~G~Lw~~e~Gp 250 (454)
T TIGR03606 212 KDNPSINGVVSHIFTYGHRNPQGLAFTP-DGTLYASEQGP 250 (454)
T ss_pred CCCCccCCCcceEEEEeccccceeEECC-CCCEEEEecCC
Confidence 3457777899999999999 79999999864
No 39
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=96.72 E-value=0.063 Score=37.77 Aligned_cols=80 Identities=14% Similarity=0.033 Sum_probs=52.3
Q ss_pred cceEEeeCCCCeEEEEecCCCCCCCeEEEEEc--CCCCcEEEEe-------CCCCCceeEEEecCCCeEEEEeCCCCeEE
Q psy959 23 RRHFMPPIRLRTLYYTDWGRFGTAGKILRTTM--AGSLKKVIIE-------KDLSQPSGLAIDFDDEMLYWTDAVREKIE 93 (109)
Q Consensus 23 p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~--dG~~~~~l~~-------~~~~~p~glaiD~~~~~lYw~D~~~~~I~ 93 (109)
..|||.|+.++.+|...-.. ...|+.... .+....+... ..+..|.+|++|+.+++||.......+|-
T Consensus 120 ~EGla~D~~~~~L~v~kE~~---P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~lliLS~es~~l~ 196 (248)
T PF06977_consen 120 FEGLAYDPKTNRLFVAKERK---PKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHLLILSDESRLLL 196 (248)
T ss_dssp -EEEEEETTTTEEEEEEESS---SEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEEEEEETTTTEEE
T ss_pred eEEEEEcCCCCEEEEEeCCC---ChhhEEEccccCccceeeccccccccccceeccccceEEcCCCCeEEEEECCCCeEE
Confidence 57999999999999886542 356777776 2223222222 13567999999999999999999999999
Q ss_pred EEeCCCCCeEEE
Q psy959 94 RSDLAGNNREAL 105 (109)
Q Consensus 94 ~~~~dG~~~~~l 105 (109)
..+.+|.-...+
T Consensus 197 ~~d~~G~~~~~~ 208 (248)
T PF06977_consen 197 ELDRQGRVVSSL 208 (248)
T ss_dssp EE-TT--EEEEE
T ss_pred EECCCCCEEEEE
Confidence 999888755443
No 40
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=96.39 E-value=0.074 Score=39.39 Aligned_cols=73 Identities=19% Similarity=0.108 Sum_probs=55.7
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC--CCeEEEEeCC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV--REKIERSDLA 98 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~--~~~I~~~~~d 98 (109)
..|.++++.+...++|.++.. ...|.+.+.+..........+ ..|.++++|+..+.+|.++.+ ...|...+-.
T Consensus 74 ~~p~~i~v~~~~~~vyv~~~~----~~~v~vid~~~~~~~~~~~vG-~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~ 148 (381)
T COG3391 74 VYPAGVAVNPAGNKVYVTTGD----SNTVSVIDTATNTVLGSIPVG-LGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAA 148 (381)
T ss_pred ccccceeeCCCCCeEEEecCC----CCeEEEEcCcccceeeEeeec-cCCceEEECCCCCEEEEEecccCCceEEEEeCC
Confidence 668899999988899999976 578888885544433333222 289999999999999999994 5677666654
No 41
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.18 E-value=0.12 Score=37.99 Aligned_cols=77 Identities=12% Similarity=0.078 Sum_probs=57.8
Q ss_pred CCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCC--cEEEEe--------CCCCCceeEEEecCCCeEEEEeCC
Q psy959 19 AIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSL--KKVIIE--------KDLSQPSGLAIDFDDEMLYWTDAV 88 (109)
Q Consensus 19 ~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~--~~~l~~--------~~~~~p~glaiD~~~~~lYw~D~~ 88 (109)
.-..||.|+.+|.....|...-- +++|.+...++.. .+.+.. .+-.+...|.|.+..+.||-++.+
T Consensus 189 ~G~GPRHi~FHpn~k~aY~v~EL----~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg 264 (346)
T COG2706 189 PGAGPRHIVFHPNGKYAYLVNEL----NSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRG 264 (346)
T ss_pred CCCCcceEEEcCCCcEEEEEecc----CCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCC
Confidence 34689999999977788887755 6999999998742 222221 123456678999999999999999
Q ss_pred CCeEEEEeCCC
Q psy959 89 REKIERSDLAG 99 (109)
Q Consensus 89 ~~~I~~~~~dG 99 (109)
...|..+..|-
T Consensus 265 ~dsI~~f~V~~ 275 (346)
T COG2706 265 HDSIAVFSVDP 275 (346)
T ss_pred CCeEEEEEEcC
Confidence 98888877663
No 42
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=95.93 E-value=0.28 Score=35.12 Aligned_cols=81 Identities=31% Similarity=0.316 Sum_probs=54.7
Q ss_pred CcceEEeeC---CCCeEEEEecCCCCCCCeEEEEEc----CCCC---------cEEEEeCCCCCceeEEEecCCCeEEEE
Q psy959 22 GRRHFMPPI---RLRTLYYTDWGRFGTAGKILRTTM----AGSL---------KKVIIEKDLSQPSGLAIDFDDEMLYWT 85 (109)
Q Consensus 22 ~p~~iavd~---~~g~lywsd~~~~~~~~~I~~~~~----dG~~---------~~~l~~~~~~~p~glaiD~~~~~lYw~ 85 (109)
...|+++.| ..+.|||.... ..++++... |.+. .+.+-.. ..+..|+++|. ++.||++
T Consensus 129 g~~gial~~~~~d~r~LYf~~ls----s~~ly~v~T~~L~~~~~~~~~~~~~~v~~lG~k-~~~s~g~~~D~-~G~ly~~ 202 (287)
T PF03022_consen 129 GIFGIALSPISPDGRWLYFHPLS----SRKLYRVPTSVLRDPSLSDAQALASQVQDLGDK-GSQSDGMAIDP-NGNLYFT 202 (287)
T ss_dssp SEEEEEE-TTSTTS-EEEEEETT-----SEEEEEEHHHHCSTT--HHH-HHHT-EEEEE----SECEEEEET-TTEEEEE
T ss_pred CccccccCCCCCCccEEEEEeCC----CCcEEEEEHHHhhCccccccccccccceecccc-CCCCceEEECC-CCcEEEe
Confidence 456778866 34689999876 567888763 3332 1233221 24678999997 9999999
Q ss_pred eCCCCeEEEEeCCC----CCeEEEEeC
Q psy959 86 DAVREKIERSDLAG----NNREALYLN 108 (109)
Q Consensus 86 D~~~~~I~~~~~dG----~~~~~l~~~ 108 (109)
+.....|.+.+.++ .+.++|+.+
T Consensus 203 ~~~~~aI~~w~~~~~~~~~~~~~l~~d 229 (287)
T PF03022_consen 203 DVEQNAIGCWDPDGPYTPENFEILAQD 229 (287)
T ss_dssp ECCCTEEEEEETTTSB-GCCEEEEEE-
T ss_pred cCCCCeEEEEeCCCCcCccchheeEEc
Confidence 99999999999999 566666653
No 43
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=95.83 E-value=0.034 Score=41.65 Aligned_cols=79 Identities=22% Similarity=0.172 Sum_probs=52.9
Q ss_pred ccEEEecCCC-----CcceEEeeCCCCeEEEEecCC-------C--CCCCeEEEE--------EcCCCCcEEEEeCCCCC
Q psy959 12 HRSLLLQAIP-----GRRHFMPPIRLRTLYYTDWGR-------F--GTAGKILRT--------TMAGSLKKVIIEKDLSQ 69 (109)
Q Consensus 12 ~~~~li~~l~-----~p~~iavd~~~g~lywsd~~~-------~--~~~~~I~~~--------~~dG~~~~~l~~~~~~~ 69 (109)
..++++..+. .-..|+++| .|+||.+-... . +..++|.+. +.++.+ ..+++.++.+
T Consensus 163 ~~~~i~~~lP~~~~H~g~~l~f~p-DG~Lyvs~G~~~~~~~aq~~~~~~Gk~~r~~~a~~~~~d~p~~~-~~i~s~G~RN 240 (399)
T COG2133 163 EPKVIFRGIPKGGHHFGGRLVFGP-DGKLYVTTGSNGDPALAQDNVSLAGKVLRIDRAGIIPADNPFPN-SEIWSYGHRN 240 (399)
T ss_pred cccEEeecCCCCCCcCcccEEECC-CCcEEEEeCCCCCcccccCccccccceeeeccCcccccCCCCCC-cceEEeccCC
Confidence 3466666553 234699999 66999986443 1 123433343 334444 3455678999
Q ss_pred ceeEEEecCCCeEEEEeCCCCeE
Q psy959 70 PSGLAIDFDDEMLYWTDAVREKI 92 (109)
Q Consensus 70 p~glaiD~~~~~lYw~D~~~~~I 92 (109)
|.||+.|+.++.||.+|.+...+
T Consensus 241 ~qGl~w~P~tg~Lw~~e~g~d~~ 263 (399)
T COG2133 241 PQGLAWHPVTGALWTTEHGPDAL 263 (399)
T ss_pred ccceeecCCCCcEEEEecCCCcc
Confidence 99999999999999999876433
No 44
>PRK04792 tolB translocation protein TolB; Provisional
Probab=95.79 E-value=0.59 Score=35.38 Aligned_cols=98 Identities=12% Similarity=0.047 Sum_probs=63.1
Q ss_pred cCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEE
Q psy959 6 KDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWT 85 (109)
Q Consensus 6 ~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~ 85 (109)
++.+|...+.+...-....+.+..|...+|+|+.... ....|+..++++...+.+.... ......+..|.+++|+++
T Consensus 203 ~d~dG~~~~~l~~~~~~~~~p~wSPDG~~La~~s~~~--g~~~L~~~dl~tg~~~~lt~~~-g~~~~~~wSPDG~~La~~ 279 (448)
T PRK04792 203 ADYDGYNEQMLLRSPEPLMSPAWSPDGRKLAYVSFEN--RKAEIFVQDIYTQVREKVTSFP-GINGAPRFSPDGKKLALV 279 (448)
T ss_pred EeCCCCCceEeecCCCcccCceECCCCCEEEEEEecC--CCcEEEEEECCCCCeEEecCCC-CCcCCeeECCCCCEEEEE
Confidence 4667776666554333334567788777888875432 1578999999877665554321 122356788888888886
Q ss_pred eC--CCCeEEEEeCCCCCeEEEE
Q psy959 86 DA--VREKIERSDLAGNNREALY 106 (109)
Q Consensus 86 D~--~~~~I~~~~~dG~~~~~l~ 106 (109)
.. +...|+..++++...+.+.
T Consensus 280 ~~~~g~~~Iy~~dl~tg~~~~lt 302 (448)
T PRK04792 280 LSKDGQPEIYVVDIATKALTRIT 302 (448)
T ss_pred EeCCCCeEEEEEECCCCCeEECc
Confidence 43 3457888888877655443
No 45
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=95.63 E-value=0.43 Score=32.70 Aligned_cols=76 Identities=13% Similarity=-0.038 Sum_probs=53.7
Q ss_pred CcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCCCCC
Q psy959 22 GRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLAGNN 101 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~dG~~ 101 (109)
.|.+++++|....+|.+... ..+|...+++......... .-..|.++++++.+.+||-+....+.|...++++..
T Consensus 208 ~~~~i~~s~dg~~~~~~~~~----~~~i~v~d~~~~~~~~~~~-~~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~~~ 282 (300)
T TIGR03866 208 QPVGIKLTKDGKTAFVALGP----ANRVAVVDAKTYEVLDYLL-VGQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAALK 282 (300)
T ss_pred CccceEECCCCCEEEEEcCC----CCeEEEEECCCCcEEEEEE-eCCCcceEEECCCCCEEEEEcCCCCeEEEEECCCCc
Confidence 35678899877777887654 4578877775333222222 234688999999999999887767789999988765
Q ss_pred e
Q psy959 102 R 102 (109)
Q Consensus 102 ~ 102 (109)
.
T Consensus 283 ~ 283 (300)
T TIGR03866 283 V 283 (300)
T ss_pred E
Confidence 4
No 46
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=95.39 E-value=0.54 Score=32.22 Aligned_cols=78 Identities=13% Similarity=0.089 Sum_probs=53.2
Q ss_pred ecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEe
Q psy959 17 LQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSD 96 (109)
Q Consensus 17 i~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~ 96 (109)
++.-..|.+++++|..+.+|.+... ...|...+.++......+. ....|..+++++..+.+|.+....+.|...+
T Consensus 27 ~~~~~~~~~l~~~~dg~~l~~~~~~----~~~v~~~d~~~~~~~~~~~-~~~~~~~~~~~~~g~~l~~~~~~~~~l~~~d 101 (300)
T TIGR03866 27 FPVGQRPRGITLSKDGKLLYVCASD----SDTIQVIDLATGEVIGTLP-SGPDPELFALHPNGKILYIANEDDNLVTVID 101 (300)
T ss_pred EECCCCCCceEECCCCCEEEEEECC----CCeEEEEECCCCcEEEecc-CCCCccEEEECCCCCEEEEEcCCCCeEEEEE
Confidence 3434568899999976778888755 5788888876433222222 2345778899988888888766556777777
Q ss_pred CCC
Q psy959 97 LAG 99 (109)
Q Consensus 97 ~dG 99 (109)
+..
T Consensus 102 ~~~ 104 (300)
T TIGR03866 102 IET 104 (300)
T ss_pred CCC
Confidence 654
No 47
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=95.11 E-value=0.57 Score=34.49 Aligned_cols=73 Identities=14% Similarity=0.105 Sum_probs=55.3
Q ss_pred ceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCC-CcEEEEe---CCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCCC
Q psy959 24 RHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGS-LKKVIIE---KDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLAG 99 (109)
Q Consensus 24 ~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~-~~~~l~~---~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~dG 99 (109)
.+|.+.+..++||.+|.+ ...|.....|.. ....++. .....|+++.+++..+.|+-+....+.|.....|.
T Consensus 247 aaIhis~dGrFLYasNRg----~dsI~~f~V~~~~g~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q~sd~i~vf~~d~ 322 (346)
T COG2706 247 AAIHISPDGRFLYASNRG----HDSIAVFSVDPDGGKLELVGITPTEGQFPRDFNINPSGRFLIAANQKSDNITVFERDK 322 (346)
T ss_pred eEEEECCCCCEEEEecCC----CCeEEEEEEcCCCCEEEEEEEeccCCcCCccceeCCCCCEEEEEccCCCcEEEEEEcC
Confidence 358889989999999988 467777776533 2233332 34556999999999999999998888888887775
Q ss_pred C
Q psy959 100 N 100 (109)
Q Consensus 100 ~ 100 (109)
.
T Consensus 323 ~ 323 (346)
T COG2706 323 E 323 (346)
T ss_pred C
Confidence 4
No 48
>PRK02889 tolB translocation protein TolB; Provisional
Probab=95.02 E-value=1.1 Score=33.65 Aligned_cols=98 Identities=13% Similarity=0.005 Sum_probs=61.1
Q ss_pred ecCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEE
Q psy959 5 DKDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYW 84 (109)
Q Consensus 5 ~~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw 84 (109)
.+|.+|...+.+...-..-.+.+..|....|+++.... ..+.|+..++++..++.+.... ......+..|..++|++
T Consensus 180 ~~D~dG~~~~~l~~~~~~v~~p~wSPDG~~la~~s~~~--~~~~I~~~dl~~g~~~~l~~~~-g~~~~~~~SPDG~~la~ 256 (427)
T PRK02889 180 ISDADGQNAQSALSSPEPIISPAWSPDGTKLAYVSFES--KKPVVYVHDLATGRRRVVANFK-GSNSAPAWSPDGRTLAV 256 (427)
T ss_pred EECCCCCCceEeccCCCCcccceEcCCCCEEEEEEccC--CCcEEEEEECCCCCEEEeecCC-CCccceEECCCCCEEEE
Confidence 45667776666653323334567778677887765432 2478999999866665554321 22345677788888877
Q ss_pred EeC--CCCeEEEEeCCCCCeEEE
Q psy959 85 TDA--VREKIERSDLAGNNREAL 105 (109)
Q Consensus 85 ~D~--~~~~I~~~~~dG~~~~~l 105 (109)
+-. +...|+.++.+|...+.+
T Consensus 257 ~~~~~g~~~Iy~~d~~~~~~~~l 279 (427)
T PRK02889 257 ALSRDGNSQIYTVNADGSGLRRL 279 (427)
T ss_pred EEccCCCceEEEEECCCCCcEEC
Confidence 532 456788888877665544
No 49
>PRK00178 tolB translocation protein TolB; Provisional
Probab=94.93 E-value=1.1 Score=33.34 Aligned_cols=99 Identities=14% Similarity=0.052 Sum_probs=61.7
Q ss_pred ecCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEE
Q psy959 5 DKDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYW 84 (109)
Q Consensus 5 ~~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw 84 (109)
.++.+|...+.+...-......+..|....|+|+.... ..+.|++.++++...+.+.... ......+..|..++|++
T Consensus 183 ~~d~~g~~~~~l~~~~~~~~~p~wSpDG~~la~~s~~~--~~~~l~~~~l~~g~~~~l~~~~-g~~~~~~~SpDG~~la~ 259 (430)
T PRK00178 183 RSDYDGARAVTLLQSREPILSPRWSPDGKRIAYVSFEQ--KRPRIFVQNLDTGRREQITNFE-GLNGAPAWSPDGSKLAF 259 (430)
T ss_pred EECCCCCCceEEecCCCceeeeeECCCCCEEEEEEcCC--CCCEEEEEECCCCCEEEccCCC-CCcCCeEECCCCCEEEE
Confidence 34666766655554333335567778677887765432 2578999999876666554321 12234677777888887
Q ss_pred EeC--CCCeEEEEeCCCCCeEEEE
Q psy959 85 TDA--VREKIERSDLAGNNREALY 106 (109)
Q Consensus 85 ~D~--~~~~I~~~~~dG~~~~~l~ 106 (109)
+-. +...|+..++++...+.+.
T Consensus 260 ~~~~~g~~~Iy~~d~~~~~~~~lt 283 (430)
T PRK00178 260 VLSKDGNPEIYVMDLASRQLSRVT 283 (430)
T ss_pred EEccCCCceEEEEECCCCCeEEcc
Confidence 643 3457888888877655443
No 50
>PRK04922 tolB translocation protein TolB; Provisional
Probab=94.90 E-value=1.2 Score=33.49 Aligned_cols=97 Identities=14% Similarity=0.052 Sum_probs=59.9
Q ss_pred cCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEE
Q psy959 6 KDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWT 85 (109)
Q Consensus 6 ~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~ 85 (109)
.|.+|...+.+...-..-.+.+..|....|+|+.... ..+.|++.++++..++.+.... ......++.+.+++|+++
T Consensus 189 ~D~~g~~~~~lt~~~~~v~~p~wSpDg~~la~~s~~~--~~~~l~~~dl~~g~~~~l~~~~-g~~~~~~~SpDG~~l~~~ 265 (433)
T PRK04922 189 ADSDGYNPQTILRSAEPILSPAWSPDGKKLAYVSFER--GRSAIYVQDLATGQRELVASFR-GINGAPSFSPDGRRLALT 265 (433)
T ss_pred ECCCCCCceEeecCCCccccccCCCCCCEEEEEecCC--CCcEEEEEECCCCCEEEeccCC-CCccCceECCCCCEEEEE
Confidence 3455555555543222234556777677888876432 2578999999876666554321 122356788888888776
Q ss_pred eC--CCCeEEEEeCCCCCeEEE
Q psy959 86 DA--VREKIERSDLAGNNREAL 105 (109)
Q Consensus 86 D~--~~~~I~~~~~dG~~~~~l 105 (109)
-. +...|...++++...+.+
T Consensus 266 ~s~~g~~~Iy~~d~~~g~~~~l 287 (433)
T PRK04922 266 LSRDGNPEIYVMDLGSRQLTRL 287 (433)
T ss_pred EeCCCCceEEEEECCCCCeEEC
Confidence 43 345788888887665544
No 51
>PRK01742 tolB translocation protein TolB; Provisional
Probab=94.69 E-value=1.3 Score=33.14 Aligned_cols=98 Identities=11% Similarity=0.044 Sum_probs=62.3
Q ss_pred cCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEE
Q psy959 6 KDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWT 85 (109)
Q Consensus 6 ~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~ 85 (109)
+|.+|..++.+...-....+.+..|....|.|+.... ..+.|+..++++..++.+.... ......+..|.+++|+++
T Consensus 189 ~d~dg~~~~~lt~~~~~v~~p~wSPDG~~la~~s~~~--~~~~i~i~dl~tg~~~~l~~~~-g~~~~~~wSPDG~~La~~ 265 (429)
T PRK01742 189 ADYDGFNQFIVNRSSQPLMSPAWSPDGSKLAYVSFEN--KKSQLVVHDLRSGARKVVASFR-GHNGAPAFSPDGSRLAFA 265 (429)
T ss_pred ECCCCCCceEeccCCCccccceEcCCCCEEEEEEecC--CCcEEEEEeCCCCceEEEecCC-CccCceeECCCCCEEEEE
Confidence 4667777665544333446678888777887775432 1468999998766655554321 122356788888888887
Q ss_pred eC--CCCeEEEEeCCCCCeEEEE
Q psy959 86 DA--VREKIERSDLAGNNREALY 106 (109)
Q Consensus 86 D~--~~~~I~~~~~dG~~~~~l~ 106 (109)
-. +...|+..+.++...+.+.
T Consensus 266 ~~~~g~~~Iy~~d~~~~~~~~lt 288 (429)
T PRK01742 266 SSKDGVLNIYVMGANGGTPSQLT 288 (429)
T ss_pred EecCCcEEEEEEECCCCCeEeec
Confidence 43 3456888888876655543
No 52
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=94.63 E-value=0.24 Score=36.25 Aligned_cols=63 Identities=13% Similarity=0.108 Sum_probs=45.4
Q ss_pred EEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcC-CCCcEEEEeCCCCCceeEEEecCCCeEEEEe
Q psy959 14 SLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMA-GSLKKVIIEKDLSQPSGLAIDFDDEMLYWTD 86 (109)
Q Consensus 14 ~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~d-G~~~~~l~~~~~~~p~glaiD~~~~~lYw~D 86 (109)
++++++|..|.+--.+ .|+||++|++ .+.+.+.+.+ |....+..- -..|+||+.. ++.+|..-
T Consensus 196 evl~~GLsmPhSPRWh--dgrLwvldsg----tGev~~vD~~~G~~e~Va~v--pG~~rGL~f~--G~llvVgm 259 (335)
T TIGR03032 196 EVVASGLSMPHSPRWY--QGKLWLLNSG----RGELGYVDPQAGKFQPVAFL--PGFTRGLAFA--GDFAFVGL 259 (335)
T ss_pred CEEEcCccCCcCCcEe--CCeEEEEECC----CCEEEEEcCCCCcEEEEEEC--CCCCccccee--CCEEEEEe
Confidence 5666788888776544 7999999999 6999999997 655444322 2488998876 55555543
No 53
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=94.52 E-value=0.27 Score=35.56 Aligned_cols=62 Identities=19% Similarity=0.279 Sum_probs=47.4
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEe--------CCCCCceeEEEecCCCeEEEEeC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIE--------KDLSQPSGLAIDFDDEMLYWTDA 87 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~--------~~~~~p~glaiD~~~~~lYw~D~ 87 (109)
....|+..|+.++.|+.-... ...+...+.+|..+..+.- .++.+|.|+|.| ..+.||.+..
T Consensus 233 ~DvSgl~~~~~~~~LLVLS~E----Sr~l~Evd~~G~~~~~lsL~~g~~gL~~dipqaEGiamD-d~g~lYIvSE 302 (316)
T COG3204 233 LDVSGLEFNAITNSLLVLSDE----SRRLLEVDLSGEVIELLSLTKGNHGLSSDIPQAEGIAMD-DDGNLYIVSE 302 (316)
T ss_pred eccccceecCCCCcEEEEecC----CceEEEEecCCCeeeeEEeccCCCCCcccCCCcceeEEC-CCCCEEEEec
Confidence 455678889989988877655 5888999999987655542 346789999999 6778888865
No 54
>PRK03629 tolB translocation protein TolB; Provisional
Probab=94.47 E-value=1.5 Score=32.90 Aligned_cols=99 Identities=11% Similarity=-0.000 Sum_probs=64.2
Q ss_pred ecCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEE
Q psy959 5 DKDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYW 84 (109)
Q Consensus 5 ~~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw 84 (109)
.+|.+|.+++.+...-....+.+..|....|.|+.... ....|+..++++...+.+.... ......+..|.+++|++
T Consensus 183 ~~d~dg~~~~~lt~~~~~~~~p~wSPDG~~la~~s~~~--g~~~i~i~dl~~G~~~~l~~~~-~~~~~~~~SPDG~~La~ 259 (429)
T PRK03629 183 VSDYDGYNQFVVHRSPQPLMSPAWSPDGSKLAYVTFES--GRSALVIQTLANGAVRQVASFP-RHNGAPAFSPDGSKLAF 259 (429)
T ss_pred EEcCCCCCCEEeecCCCceeeeEEcCCCCEEEEEEecC--CCcEEEEEECCCCCeEEccCCC-CCcCCeEECCCCCEEEE
Confidence 35667777777654333345677888666777664321 1578999999876665554321 12335688888999998
Q ss_pred EeC--CCCeEEEEeCCCCCeEEEE
Q psy959 85 TDA--VREKIERSDLAGNNREALY 106 (109)
Q Consensus 85 ~D~--~~~~I~~~~~dG~~~~~l~ 106 (109)
+.. +...|..+++++...+.+.
T Consensus 260 ~~~~~g~~~I~~~d~~tg~~~~lt 283 (429)
T PRK03629 260 ALSKTGSLNLYVMDLASGQIRQVT 283 (429)
T ss_pred EEcCCCCcEEEEEECCCCCEEEcc
Confidence 744 3457888888876655554
No 55
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=94.31 E-value=0.81 Score=33.17 Aligned_cols=74 Identities=19% Similarity=0.156 Sum_probs=53.2
Q ss_pred ecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEe
Q psy959 17 LQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSD 96 (109)
Q Consensus 17 i~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~ 96 (109)
+.+-..|+.++.++ .|.++|++.+ .+.|-+.+......+..--..-..|+++.+++.. -.+.+|.+. .|.+.+
T Consensus 58 vp~G~ap~dvapap-dG~VWft~qg----~gaiGhLdP~tGev~~ypLg~Ga~Phgiv~gpdg-~~Witd~~~-aI~R~d 130 (353)
T COG4257 58 VPNGSAPFDVAPAP-DGAVWFTAQG----TGAIGHLDPATGEVETYPLGSGASPHGIVVGPDG-SAWITDTGL-AIGRLD 130 (353)
T ss_pred cCCCCCccccccCC-CCceEEecCc----cccceecCCCCCceEEEecCCCCCCceEEECCCC-CeeEecCcc-eeEEec
Confidence 34557889999999 9999999988 6888887764333333333456799999999654 567888876 555554
Q ss_pred C
Q psy959 97 L 97 (109)
Q Consensus 97 ~ 97 (109)
-
T Consensus 131 p 131 (353)
T COG4257 131 P 131 (353)
T ss_pred C
Confidence 3
No 56
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=94.22 E-value=1.4 Score=32.73 Aligned_cols=76 Identities=11% Similarity=-0.014 Sum_probs=51.0
Q ss_pred Ccce---EEeeCCCCeEEEEecCC-----CCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCC-eEEEEeCCCCeE
Q psy959 22 GRRH---FMPPIRLRTLYYTDWGR-----FGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDE-MLYWTDAVREKI 92 (109)
Q Consensus 22 ~p~~---iavd~~~g~lywsd~~~-----~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~-~lYw~D~~~~~I 92 (109)
.|.+ +++++..+++|....+. ..+...|.+.+.....+.--+. --..|.+|++.+..+ +||-+....+.|
T Consensus 246 rP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~-vG~~~~~iavS~Dgkp~lyvtn~~s~~V 324 (352)
T TIGR02658 246 RPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIE-LGHEIDSINVSQDAKPLLYALSTGDKTL 324 (352)
T ss_pred CCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEe-CCCceeeEEECCCCCeEEEEeCCCCCcE
Confidence 3555 99999999999954221 1123577777764333222222 245899999999999 999999877777
Q ss_pred EEEeCC
Q psy959 93 ERSDLA 98 (109)
Q Consensus 93 ~~~~~d 98 (109)
...+..
T Consensus 325 sViD~~ 330 (352)
T TIGR02658 325 YIFDAE 330 (352)
T ss_pred EEEECc
Confidence 777654
No 57
>PRK05137 tolB translocation protein TolB; Provisional
Probab=94.05 E-value=1.9 Score=32.33 Aligned_cols=78 Identities=15% Similarity=0.100 Sum_probs=53.5
Q ss_pred eEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC--C---CeEEEEeCCC
Q psy959 25 HFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV--R---EKIERSDLAG 99 (109)
Q Consensus 25 ~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~--~---~~I~~~~~dG 99 (109)
..+..|..+.|+++.... ....|...+++|...+.+... ......+..+.++.||++-.. . ..+..++++|
T Consensus 338 ~~~~SpdG~~ia~~~~~~--~~~~i~~~d~~~~~~~~lt~~--~~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g 413 (435)
T PRK05137 338 TPVWSPRGDLIAFTKQGG--GQFSIGVMKPDGSGERILTSG--FLVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTG 413 (435)
T ss_pred CeEECCCCCEEEEEEcCC--CceEEEEEECCCCceEeccCC--CCCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCC
Confidence 355677677887776432 135788888887776555432 235567888888898887542 2 4799999999
Q ss_pred CCeEEEE
Q psy959 100 NNREALY 106 (109)
Q Consensus 100 ~~~~~l~ 106 (109)
...+.+.
T Consensus 414 ~~~~~l~ 420 (435)
T PRK05137 414 RNEREVP 420 (435)
T ss_pred CceEEcc
Confidence 8877653
No 58
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=93.96 E-value=0.78 Score=33.28 Aligned_cols=84 Identities=17% Similarity=0.218 Sum_probs=54.1
Q ss_pred CCCcceEEeeCCCCeEEEE--ecCCCCCCCeEEEEEcCCC---C-cEE------EEeCC---C----CCceeEEEecCCC
Q psy959 20 IPGRRHFMPPIRLRTLYYT--DWGRFGTAGKILRTTMAGS---L-KKV------IIEKD---L----SQPSGLAIDFDDE 80 (109)
Q Consensus 20 l~~p~~iavd~~~g~lyws--d~~~~~~~~~I~~~~~dG~---~-~~~------l~~~~---~----~~p~glaiD~~~~ 80 (109)
+..-.||+.++..+ .||+ |.+.+....+++...++.. . ... +...+ + .-+.||++ +.++
T Consensus 19 ~GGlSgl~~~~~~~-~~~avSD~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~G~~~~~~~~D~Egi~~-~~~g 96 (326)
T PF13449_consen 19 FGGLSGLDYDPDDG-RFYAVSDRGPNKGPPRFYTFRIDYDQGGIGGVTILDMIPLRDPDGQPFPKNGLDPEGIAV-PPDG 96 (326)
T ss_pred cCcEeeEEEeCCCC-EEEEEECCCCCCCCCcEEEEEeeccCCCccceEeccceeccCCCCCcCCcCCCChhHeEE-ecCC
Confidence 45557899997344 4444 4442223466888877651 1 111 11111 1 15679999 7899
Q ss_pred eEEEEeCCC------CeEEEEeCCCCCeEEE
Q psy959 81 MLYWTDAVR------EKIERSDLAGNNREAL 105 (109)
Q Consensus 81 ~lYw~D~~~------~~I~~~~~dG~~~~~l 105 (109)
.+||++.+. ++|.+++.+|...+.+
T Consensus 97 ~~~is~E~~~~~~~~p~I~~~~~~G~~~~~~ 127 (326)
T PF13449_consen 97 SFWISSEGGRTGGIPPRIRRFDLDGRVIRRF 127 (326)
T ss_pred CEEEEeCCccCCCCCCEEEEECCCCcccceE
Confidence 999999999 9999999999876554
No 59
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=93.93 E-value=0.3 Score=35.43 Aligned_cols=65 Identities=15% Similarity=0.180 Sum_probs=46.8
Q ss_pred CcceEEeeCCCCeEEEEecCC--CCCCCeEEEEEcCCCCcEEE-EeCCC-------------CCceeEEEecCCCeEEEE
Q psy959 22 GRRHFMPPIRLRTLYYTDWGR--FGTAGKILRTTMAGSLKKVI-IEKDL-------------SQPSGLAIDFDDEMLYWT 85 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~~--~~~~~~I~~~~~dG~~~~~l-~~~~~-------------~~p~glaiD~~~~~lYw~ 85 (109)
.|.||++ +..|.+||++-+. .+..+.|.+.+.+|...+.+ +...+ .-..+||+.+..+.||-+
T Consensus 86 D~Egi~~-~~~g~~~is~E~~~~~~~~p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~ 164 (326)
T PF13449_consen 86 DPEGIAV-PPDGSFWISSEGGRTGGIPPRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAA 164 (326)
T ss_pred ChhHeEE-ecCCCEEEEeCCccCCCCCCEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEEEEE
Confidence 6679999 5699999999771 00129999999999886665 22211 125689999998878877
Q ss_pred eC
Q psy959 86 DA 87 (109)
Q Consensus 86 D~ 87 (109)
-.
T Consensus 165 ~E 166 (326)
T PF13449_consen 165 ME 166 (326)
T ss_pred EC
Confidence 54
No 60
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=93.56 E-value=0.89 Score=35.41 Aligned_cols=69 Identities=16% Similarity=0.202 Sum_probs=47.7
Q ss_pred cCCCCcceEEeeCCCCeEEEEecCCC---------------CCCCeEEEEEcCCC-------CcEEEEe-----------
Q psy959 18 QAIPGRRHFMPPIRLRTLYYTDWGRF---------------GTAGKILRTTMAGS-------LKKVIIE----------- 64 (109)
Q Consensus 18 ~~l~~p~~iavd~~~g~lywsd~~~~---------------~~~~~I~~~~~dG~-------~~~~l~~----------- 64 (109)
+.+..|.+|+++|.++.+|++-.... ...+.|.+...++. .-..++.
T Consensus 347 T~f~RpEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~ 426 (524)
T PF05787_consen 347 TPFDRPEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNG 426 (524)
T ss_pred ccccCccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCcccccccc
Confidence 34789999999999999999975532 12468988887755 1122222
Q ss_pred ------CCCCCceeEEEecCCCeEEEEeC
Q psy959 65 ------KDLSQPSGLAIDFDDEMLYWTDA 87 (109)
Q Consensus 65 ------~~~~~p~glaiD~~~~~lYw~D~ 87 (109)
..+..|-+|++|+. ++|+.+.-
T Consensus 427 ~~~~~~~~f~sPDNL~~d~~-G~LwI~eD 454 (524)
T PF05787_consen 427 SNKCDDNGFASPDNLAFDPD-GNLWIQED 454 (524)
T ss_pred cCcccCCCcCCCCceEECCC-CCEEEEeC
Confidence 12678999999986 45665544
No 61
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=93.46 E-value=2.9 Score=32.52 Aligned_cols=95 Identities=5% Similarity=-0.047 Sum_probs=71.0
Q ss_pred eecCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEE
Q psy959 4 TDKDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLY 83 (109)
Q Consensus 4 ~~~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lY 83 (109)
..-+.+|...+.+..++.+..++.+++...++-.++. ...|...++|..+.+.+-...-....++++++..+.+=
T Consensus 385 ~iyd~~~~e~kr~e~~lg~I~av~vs~dGK~~vvaNd-----r~el~vididngnv~~idkS~~~lItdf~~~~nsr~iA 459 (668)
T COG4946 385 GIYDKDGGEVKRIEKDLGNIEAVKVSPDGKKVVVAND-----RFELWVIDIDNGNVRLIDKSEYGLITDFDWHPNSRWIA 459 (668)
T ss_pred EEEecCCceEEEeeCCccceEEEEEcCCCcEEEEEcC-----ceEEEEEEecCCCeeEecccccceeEEEEEcCCceeEE
Confidence 4556777888888889999999999995555776664 47899999998888877666666778888888777655
Q ss_pred EEeCC---CCeEEEEeCCCCCeE
Q psy959 84 WTDAV---REKIERSDLAGNNRE 103 (109)
Q Consensus 84 w~D~~---~~~I~~~~~dG~~~~ 103 (109)
.+=.. ...|...+++|...-
T Consensus 460 YafP~gy~tq~Iklydm~~~Kiy 482 (668)
T COG4946 460 YAFPEGYYTQSIKLYDMDGGKIY 482 (668)
T ss_pred EecCcceeeeeEEEEecCCCeEE
Confidence 44332 467888888885543
No 62
>PRK04792 tolB translocation protein TolB; Provisional
Probab=93.43 E-value=2.6 Score=31.91 Aligned_cols=81 Identities=17% Similarity=0.111 Sum_probs=53.3
Q ss_pred cceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC--CCeEEEEeCCCC
Q psy959 23 RRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV--REKIERSDLAGN 100 (109)
Q Consensus 23 p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~--~~~I~~~~~dG~ 100 (109)
....+..|....|+++.... ....|++.++++...+.+.... ....+.++.+..+.||.+... ...|..+++++.
T Consensus 308 ~~~p~wSpDG~~I~f~s~~~--g~~~Iy~~dl~~g~~~~Lt~~g-~~~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g 384 (448)
T PRK04792 308 DTEPSWHPDGKSLIFTSERG--GKPQIYRVNLASGKVSRLTFEG-EQNLGGSITPDGRSMIMVNRTNGKFNIARQDLETG 384 (448)
T ss_pred ccceEECCCCCEEEEEECCC--CCceEEEEECCCCCEEEEecCC-CCCcCeeECCCCCEEEEEEecCCceEEEEEECCCC
Confidence 34556677667787764321 2578999999766555443222 233456788889999888654 347888999887
Q ss_pred CeEEEE
Q psy959 101 NREALY 106 (109)
Q Consensus 101 ~~~~l~ 106 (109)
..+.+.
T Consensus 385 ~~~~lt 390 (448)
T PRK04792 385 AMQVLT 390 (448)
T ss_pred CeEEcc
Confidence 766554
No 63
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=92.98 E-value=3 Score=31.32 Aligned_cols=77 Identities=16% Similarity=0.165 Sum_probs=51.0
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcC--C-CCcEEEEeC---C-CCCceeEEEecC---CCeEEEEeCCCC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMA--G-SLKKVIIEK---D-LSQPSGLAIDFD---DEMLYWTDAVRE 90 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~d--G-~~~~~l~~~---~-~~~p~glaiD~~---~~~lYw~D~~~~ 90 (109)
..|.|+++|..+|.||..+-. .-|.+...+ + ..++.+... . ..-..||+|=.. .+.|..++.+..
T Consensus 208 sQ~EGCVVDDe~g~LYvgEE~-----~GIW~y~Aep~~~~~~~~v~~~~g~~l~aDvEGlaly~~~~g~gYLivSsQG~~ 282 (381)
T PF02333_consen 208 SQPEGCVVDDETGRLYVGEED-----VGIWRYDAEPEGGNDRTLVASADGDGLVADVEGLALYYGSDGKGYLIVSSQGDN 282 (381)
T ss_dssp S-EEEEEEETTTTEEEEEETT-----TEEEEEESSCCC-S--EEEEEBSSSSB-S-EEEEEEEE-CCC-EEEEEEEGGGT
T ss_pred CcceEEEEecccCCEEEecCc-----cEEEEEecCCCCCCcceeeecccccccccCccceEEEecCCCCeEEEEEcCCCC
Confidence 578999999999999999864 789998886 3 334433221 1 346889998432 356777787777
Q ss_pred eEEEEeCCCCCe
Q psy959 91 KIERSDLAGNNR 102 (109)
Q Consensus 91 ~I~~~~~dG~~~ 102 (109)
..-..+..|.+.
T Consensus 283 sf~Vy~r~~~~~ 294 (381)
T PF02333_consen 283 SFAVYDREGPNA 294 (381)
T ss_dssp EEEEEESSTT--
T ss_pred eEEEEecCCCCc
Confidence 777777777543
No 64
>PRK04043 tolB translocation protein TolB; Provisional
Probab=92.92 E-value=3.1 Score=31.37 Aligned_cols=98 Identities=14% Similarity=0.082 Sum_probs=61.2
Q ss_pred ecCCCCCccEEEecCCCCcceEEeeCCCC-eEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEE
Q psy959 5 DKDLLKTHRSLLLQAIPGRRHFMPPIRLR-TLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLY 83 (109)
Q Consensus 5 ~~~l~G~~~~~li~~l~~p~~iavd~~~g-~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lY 83 (109)
.+|.||.+.+++..+ ..-..-...|..+ .+|++.... ..+.|++.++.+..++.|.... .....-.+.|..++|.
T Consensus 173 ~~d~dg~~~~~~~~~-~~~~~p~wSpDG~~~i~y~s~~~--~~~~Iyv~dl~tg~~~~lt~~~-g~~~~~~~SPDG~~la 248 (419)
T PRK04043 173 LADYTLTYQKVIVKG-GLNIFPKWANKEQTAFYYTSYGE--RKPTLYKYNLYTGKKEKIASSQ-GMLVVSDVSKDGSKLL 248 (419)
T ss_pred EECCCCCceeEEccC-CCeEeEEECCCCCcEEEEEEccC--CCCEEEEEECCCCcEEEEecCC-CcEEeeEECCCCCEEE
Confidence 467788887777654 2112334455344 377765431 1579999999877777665421 1122345677788887
Q ss_pred EEeC--CCCeEEEEeCCCCCeEEEE
Q psy959 84 WTDA--VREKIERSDLAGNNREALY 106 (109)
Q Consensus 84 w~D~--~~~~I~~~~~dG~~~~~l~ 106 (109)
++.. +...|+.++++|...+.|.
T Consensus 249 ~~~~~~g~~~Iy~~dl~~g~~~~LT 273 (419)
T PRK04043 249 LTMAPKGQPDIYLYDTNTKTLTQIT 273 (419)
T ss_pred EEEccCCCcEEEEEECCCCcEEEcc
Confidence 7755 3568999999887766554
No 65
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=92.56 E-value=3.5 Score=30.99 Aligned_cols=77 Identities=21% Similarity=0.223 Sum_probs=47.1
Q ss_pred cCCCCcceEEe--eCCCCeEEEEecCCCCCCCeEEEEEc--CCCC--cEEEEe--CCCCCceeEEEecCCCeEEEEeCCC
Q psy959 18 QAIPGRRHFMP--PIRLRTLYYTDWGRFGTAGKILRTTM--AGSL--KKVIIE--KDLSQPSGLAIDFDDEMLYWTDAVR 89 (109)
Q Consensus 18 ~~l~~p~~iav--d~~~g~lywsd~~~~~~~~~I~~~~~--dG~~--~~~l~~--~~~~~p~glaiD~~~~~lYw~D~~~ 89 (109)
..+..|+|+++ ++.+|.+|..-... .+.+....+ ++.. .-.++. .--.+|.|+++|...+.||..+...
T Consensus 153 ~~~~e~yGlcly~~~~~g~~ya~v~~k---~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGCVVDDe~g~LYvgEE~~ 229 (381)
T PF02333_consen 153 TDLSEPYGLCLYRSPSTGALYAFVNGK---DGRVEQYELTDDGDGKVSATLVREFKVGSQPEGCVVDDETGRLYVGEEDV 229 (381)
T ss_dssp -SSSSEEEEEEEE-TTT--EEEEEEET---TSEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEEEEETTTTEEEEEETTT
T ss_pred cccccceeeEEeecCCCCcEEEEEecC---CceEEEEEEEeCCCCcEeeEEEEEecCCCcceEEEEecccCCEEEecCcc
Confidence 34566888887 46678777665443 355655555 3332 222332 1235899999999999999999864
Q ss_pred CeEEEEeCC
Q psy959 90 EKIERSDLA 98 (109)
Q Consensus 90 ~~I~~~~~d 98 (109)
-|++...+
T Consensus 230 -GIW~y~Ae 237 (381)
T PF02333_consen 230 -GIWRYDAE 237 (381)
T ss_dssp -EEEEEESS
T ss_pred -EEEEEecC
Confidence 67777776
No 66
>PRK05137 tolB translocation protein TolB; Provisional
Probab=92.43 E-value=3.6 Score=30.85 Aligned_cols=98 Identities=8% Similarity=-0.023 Sum_probs=60.3
Q ss_pred cCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEE
Q psy959 6 KDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWT 85 (109)
Q Consensus 6 ~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~ 85 (109)
+|.+|...+.+...-......+..|....|+++.... ....|+..++++..++.+... -......++.|..++|+++
T Consensus 187 ~d~dg~~~~~lt~~~~~v~~p~wSpDG~~lay~s~~~--g~~~i~~~dl~~g~~~~l~~~-~g~~~~~~~SPDG~~la~~ 263 (435)
T PRK05137 187 MDQDGANVRYLTDGSSLVLTPRFSPNRQEITYMSYAN--GRPRVYLLDLETGQRELVGNF-PGMTFAPRFSPDGRKVVMS 263 (435)
T ss_pred ECCCCCCcEEEecCCCCeEeeEECCCCCEEEEEEecC--CCCEEEEEECCCCcEEEeecC-CCcccCcEECCCCCEEEEE
Confidence 4556666555543333344566677666777765432 157899999987666555432 1223456777888888776
Q ss_pred eC--CCCeEEEEeCCCCCeEEEE
Q psy959 86 DA--VREKIERSDLAGNNREALY 106 (109)
Q Consensus 86 D~--~~~~I~~~~~dG~~~~~l~ 106 (109)
-. +...|+.+++++...+.|.
T Consensus 264 ~~~~g~~~Iy~~d~~~~~~~~Lt 286 (435)
T PRK05137 264 LSQGGNTDIYTMDLRSGTTTRLT 286 (435)
T ss_pred EecCCCceEEEEECCCCceEEcc
Confidence 43 3467888888877665553
No 67
>PRK04922 tolB translocation protein TolB; Provisional
Probab=92.42 E-value=3.6 Score=30.85 Aligned_cols=80 Identities=13% Similarity=0.112 Sum_probs=52.3
Q ss_pred ceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC--CCeEEEEeCCCCC
Q psy959 24 RHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV--REKIERSDLAGNN 101 (109)
Q Consensus 24 ~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~--~~~I~~~~~dG~~ 101 (109)
...+..|....|+++.... ....|+..++++...+.+.... ......++.+..+.|+++... ...|...++++..
T Consensus 295 ~~~~~spDG~~l~f~sd~~--g~~~iy~~dl~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~ 371 (433)
T PRK04922 295 TEPTWAPDGKSIYFTSDRG--GRPQIYRVAASGGSAERLTFQG-NYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGS 371 (433)
T ss_pred cceEECCCCCEEEEEECCC--CCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEECCCCceeEEEEECCCCC
Confidence 4567777666777664221 1467999998776655554322 334467888889999987653 3478899988776
Q ss_pred eEEEE
Q psy959 102 REALY 106 (109)
Q Consensus 102 ~~~l~ 106 (109)
.+.+.
T Consensus 372 ~~~Lt 376 (433)
T PRK04922 372 VRTLT 376 (433)
T ss_pred eEECC
Confidence 65543
No 68
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=92.38 E-value=2.2 Score=29.78 Aligned_cols=69 Identities=16% Similarity=0.086 Sum_probs=48.1
Q ss_pred eeCCCCeEEEEecCCCCCCCeEEEEEcCCCC-cEEEEe-----------CCCCCceeEEEecCCCeEEEEeCCCCeEEEE
Q psy959 28 PPIRLRTLYYTDWGRFGTAGKILRTTMAGSL-KKVIIE-----------KDLSQPSGLAIDFDDEMLYWTDAVREKIERS 95 (109)
Q Consensus 28 vd~~~g~lywsd~~~~~~~~~I~~~~~dG~~-~~~l~~-----------~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~ 95 (109)
+.+..|.+|=--|. ..+|.|...+... ...+-- .....++|||.|+..+++|.+-..-..+.-+
T Consensus 181 LE~VdG~lyANVw~----t~~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~wp~lfEV 256 (262)
T COG3823 181 LEWVDGELYANVWQ----TTRIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKLWPLLFEV 256 (262)
T ss_pred eeeeccEEEEeeee----ecceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecCcCceeEEE
Confidence 45666777755554 5778887776333 333221 1244799999999999999998888888887
Q ss_pred eCCCC
Q psy959 96 DLAGN 100 (109)
Q Consensus 96 ~~dG~ 100 (109)
.+++.
T Consensus 257 k~~~a 261 (262)
T COG3823 257 KLDEA 261 (262)
T ss_pred EecCC
Confidence 77764
No 69
>PRK04043 tolB translocation protein TolB; Provisional
Probab=92.26 E-value=3.9 Score=30.86 Aligned_cols=77 Identities=12% Similarity=-0.091 Sum_probs=51.6
Q ss_pred EEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeC--CCCeEEEEeCCCCCeE
Q psy959 26 FMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDA--VREKIERSDLAGNNRE 103 (109)
Q Consensus 26 iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~--~~~~I~~~~~dG~~~~ 103 (109)
.+..|....|.++-... ....|+..++++...+.|..... .-..-...|..++||++.. +...|+.++++|...+
T Consensus 238 ~~~SPDG~~la~~~~~~--g~~~Iy~~dl~~g~~~~LT~~~~-~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~ 314 (419)
T PRK04043 238 SDVSKDGSKLLLTMAPK--GQPDIYLYDTNTKTLTQITNYPG-IDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVE 314 (419)
T ss_pred eEECCCCCEEEEEEccC--CCcEEEEEECCCCcEEEcccCCC-ccCccEECCCCCEEEEEECCCCCceEEEEECCCCCeE
Confidence 34566556787775432 26899999998877666654321 1122357788889998875 3568999999987764
Q ss_pred EE
Q psy959 104 AL 105 (109)
Q Consensus 104 ~l 105 (109)
.+
T Consensus 315 rl 316 (419)
T PRK04043 315 QV 316 (419)
T ss_pred eC
Confidence 44
No 70
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=92.11 E-value=0.77 Score=22.41 Aligned_cols=42 Identities=21% Similarity=0.175 Sum_probs=26.4
Q ss_pred CCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEe
Q psy959 30 IRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAID 76 (109)
Q Consensus 30 ~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD 76 (109)
|..++||.++.. ...|...+........-+.. ...|.+|+++
T Consensus 1 pd~~~lyv~~~~----~~~v~~id~~~~~~~~~i~v-g~~P~~i~~~ 42 (42)
T TIGR02276 1 PDGTKLYVTNSG----SNTVSVIDTATNKVIATIPV-GGYPFGVAVS 42 (42)
T ss_pred CCCCEEEEEeCC----CCEEEEEECCCCeEEEEEEC-CCCCceEEeC
Confidence 346789999987 57888877632222111222 4679988874
No 71
>KOG4499|consensus
Probab=92.03 E-value=1.8 Score=30.85 Aligned_cols=47 Identities=13% Similarity=0.237 Sum_probs=36.0
Q ss_pred CccEEEec--C-----CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcC-CCCcEEE
Q psy959 11 THRSLLLQ--A-----IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMA-GSLKKVI 62 (109)
Q Consensus 11 ~~~~~li~--~-----l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~d-G~~~~~l 62 (109)
++|++++. + -..|.|++||- .|+||.+-+. .+++++.+.. |+.-..+
T Consensus 195 snr~~i~dlrk~~~~e~~~PDGm~ID~-eG~L~Va~~n----g~~V~~~dp~tGK~L~ei 249 (310)
T KOG4499|consen 195 SNRKVIFDLRKSQPFESLEPDGMTIDT-EGNLYVATFN----GGTVQKVDPTTGKILLEI 249 (310)
T ss_pred cCcceeEEeccCCCcCCCCCCcceEcc-CCcEEEEEec----CcEEEEECCCCCcEEEEE
Confidence 46778872 2 35689999998 9999999998 7899998874 6654444
No 72
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=92.02 E-value=3.1 Score=30.29 Aligned_cols=82 Identities=20% Similarity=0.104 Sum_probs=58.6
Q ss_pred CcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEe----CC----CCCceeEEEecCCCeEEEEeCCCCeEE
Q psy959 22 GRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIE----KD----LSQPSGLAIDFDDEMLYWTDAVREKIE 93 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~----~~----~~~p~glaiD~~~~~lYw~D~~~~~I~ 93 (109)
.-.|+|-||.++.+|++--.. .-.|+....+-+.-..=.. .. +.-..|+..|+.++.|+........+.
T Consensus 182 GfEGlA~d~~~~~l~~aKEr~---P~~I~~~~~~~~~l~~~~~~~~~~~~~~f~~DvSgl~~~~~~~~LLVLS~ESr~l~ 258 (316)
T COG3204 182 GFEGLAWDPVDHRLFVAKERN---PIGIFEVTQSPSSLSVHASLDPTADRDLFVLDVSGLEFNAITNSLLVLSDESRRLL 258 (316)
T ss_pred CceeeecCCCCceEEEEEccC---CcEEEEEecCCcccccccccCcccccceEeeccccceecCCCCcEEEEecCCceEE
Confidence 346899999999999887553 5778777754422111010 01 345779999999999999998888999
Q ss_pred EEeCCCCCeEEEE
Q psy959 94 RSDLAGNNREALY 106 (109)
Q Consensus 94 ~~~~dG~~~~~l~ 106 (109)
-++.+|.-+..+.
T Consensus 259 Evd~~G~~~~~ls 271 (316)
T COG3204 259 EVDLSGEVIELLS 271 (316)
T ss_pred EEecCCCeeeeEE
Confidence 9999998655543
No 73
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=91.85 E-value=3.9 Score=30.01 Aligned_cols=81 Identities=10% Similarity=0.042 Sum_probs=54.3
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC--CCeEEEEeCC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV--REKIERSDLA 98 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~--~~~I~~~~~d 98 (109)
......++.|...+|+++.... ...+|...++++...+.+.... .....++.+..+.|+++... ...++....+
T Consensus 322 ~~~~~~~~spdg~~i~~~~~~~--~~~~i~~~d~~~~~~~~l~~~~--~~~~p~~spdg~~l~~~~~~~~~~~l~~~~~~ 397 (417)
T TIGR02800 322 GYNASPSWSPDGDLIAFVHREG--GGFNIAVMDLDGGGERVLTDTG--LDESPSFAPNGRMILYATTRGGRGVLGLVSTD 397 (417)
T ss_pred CCccCeEECCCCCEEEEEEccC--CceEEEEEeCCCCCeEEccCCC--CCCCceECCCCCEEEEEEeCCCcEEEEEEECC
Confidence 3345667888777888887542 1357888888875555544321 23345677778899998774 3578888899
Q ss_pred CCCeEEE
Q psy959 99 GNNREAL 105 (109)
Q Consensus 99 G~~~~~l 105 (109)
|...+.+
T Consensus 398 g~~~~~~ 404 (417)
T TIGR02800 398 GRFRARL 404 (417)
T ss_pred CceeeEC
Confidence 9876655
No 74
>PRK01029 tolB translocation protein TolB; Provisional
Probab=91.38 E-value=5 Score=30.29 Aligned_cols=81 Identities=6% Similarity=-0.014 Sum_probs=56.3
Q ss_pred ceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeC--CCCeEEEEeCCCCC
Q psy959 24 RHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDA--VREKIERSDLAGNN 101 (109)
Q Consensus 24 ~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~--~~~~I~~~~~dG~~ 101 (109)
...+..|...+|+++.... ....|.+.++++...+.+... .......+..+..+.||++.. +...|..++++|..
T Consensus 330 ~~p~wSPDG~~Laf~~~~~--g~~~I~v~dl~~g~~~~Lt~~-~~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~ 406 (428)
T PRK01029 330 SCPAWSPDGKKIAFCSVIK--GVRQICVYDLATGRDYQLTTS-PENKESPSWAIDSLHLVYSAGNSNESELYLISLITKK 406 (428)
T ss_pred cceeECCCCCEEEEEEcCC--CCcEEEEEECCCCCeEEccCC-CCCccceEECCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 4566777667788775432 146899999987776666543 234456677778888887654 35689999999987
Q ss_pred eEEEEe
Q psy959 102 REALYL 107 (109)
Q Consensus 102 ~~~l~~ 107 (109)
.+.+..
T Consensus 407 ~~~Lt~ 412 (428)
T PRK01029 407 TRKIVI 412 (428)
T ss_pred EEEeec
Confidence 777654
No 75
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=91.35 E-value=4.5 Score=29.71 Aligned_cols=98 Identities=13% Similarity=0.009 Sum_probs=58.9
Q ss_pred ecCCCCCccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEE
Q psy959 5 DKDLLKTHRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYW 84 (109)
Q Consensus 5 ~~~l~G~~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw 84 (109)
..+.+|...+.+...-....+.+..|...+|+|+.... ....|+..++++...+.+... -......+..+.++.||+
T Consensus 174 ~~d~~g~~~~~l~~~~~~~~~p~~Spdg~~la~~~~~~--~~~~i~v~d~~~g~~~~~~~~-~~~~~~~~~spDg~~l~~ 250 (417)
T TIGR02800 174 VADYDGANPQTITRSREPILSPAWSPDGQKLAYVSFES--GKPEIYVQDLATGQREKVASF-PGMNGAPAFSPDGSKLAV 250 (417)
T ss_pred EEcCCCCCCEEeecCCCceecccCCCCCCEEEEEEcCC--CCcEEEEEECCCCCEEEeecC-CCCccceEECCCCCEEEE
Confidence 34556666656553222233456777777888886542 146899888876544444322 123345677777888887
Q ss_pred EeC--CCCeEEEEeCCCCCeEEE
Q psy959 85 TDA--VREKIERSDLAGNNREAL 105 (109)
Q Consensus 85 ~D~--~~~~I~~~~~dG~~~~~l 105 (109)
+.. +...|...++++...+.+
T Consensus 251 ~~~~~~~~~i~~~d~~~~~~~~l 273 (417)
T TIGR02800 251 SLSKDGNPDIYVMDLDGKQLTRL 273 (417)
T ss_pred EECCCCCccEEEEECCCCCEEEC
Confidence 644 345788888876654443
No 76
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=91.14 E-value=0.22 Score=24.75 Aligned_cols=20 Identities=10% Similarity=0.016 Sum_probs=16.7
Q ss_pred CCcceEEeeCCCCeEEEEecC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWG 41 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~ 41 (109)
..|.+||+|+ .|++|.+-..
T Consensus 13 ~~~~~IavD~-~GNiYv~G~T 32 (38)
T PF06739_consen 13 DYGNGIAVDS-NGNIYVTGYT 32 (38)
T ss_pred eeEEEEEECC-CCCEEEEEee
Confidence 4689999998 8999988653
No 77
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=90.81 E-value=5.3 Score=29.64 Aligned_cols=80 Identities=10% Similarity=-0.138 Sum_probs=54.9
Q ss_pred CCCcceEEeeCCCCeEEEEecCC----CC-CCCeEEEEEcC-CCCcEEEEeC------CCCCceeEEEecCCCeEEEEeC
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGR----FG-TAGKILRTTMA-GSLKKVIIEK------DLSQPSGLAIDFDDEMLYWTDA 87 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~----~~-~~~~I~~~~~d-G~~~~~l~~~------~~~~p~glaiD~~~~~lYw~D~ 87 (109)
-..|+++ +.|..+.||.+.... .| ....|...+.. ++....|... ....|..+++.+.+++||.++.
T Consensus 46 G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~ 124 (352)
T TIGR02658 46 GFLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQF 124 (352)
T ss_pred cCCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecC
Confidence 3689996 899889999998720 00 15778887764 4444444321 1345669999999999999987
Q ss_pred C-CCeEEEEeCCCC
Q psy959 88 V-REKIERSDLAGN 100 (109)
Q Consensus 88 ~-~~~I~~~~~dG~ 100 (109)
. ...+..+++...
T Consensus 125 ~p~~~V~VvD~~~~ 138 (352)
T TIGR02658 125 SPSPAVGVVDLEGK 138 (352)
T ss_pred CCCCEEEEEECCCC
Confidence 6 667777766543
No 78
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=90.48 E-value=5.3 Score=29.11 Aligned_cols=84 Identities=14% Similarity=0.111 Sum_probs=54.1
Q ss_pred CCCcceEEee-CCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC----CCeEEE
Q psy959 20 IPGRRHFMPP-IRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV----REKIER 94 (109)
Q Consensus 20 l~~p~~iavd-~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~----~~~I~~ 94 (109)
+.....+..- +....++|.-... +...|+..+.+|...+.|......--.-+.+|..++.||++-.. ...+++
T Consensus 234 v~~~~~~~~~~~~~~~~l~~s~~~--G~~hly~~~~~~~~~~~lT~G~~~V~~i~~~d~~~~~iyf~a~~~~p~~r~lY~ 311 (353)
T PF00930_consen 234 VDVYDPPHFLGPDGNEFLWISERD--GYRHLYLYDLDGGKPRQLTSGDWEVTSILGWDEDNNRIYFTANGDNPGERHLYR 311 (353)
T ss_dssp SSSSSEEEE-TTTSSEEEEEEETT--SSEEEEEEETTSSEEEESS-SSS-EEEEEEEECTSSEEEEEESSGGTTSBEEEE
T ss_pred eeeecccccccCCCCEEEEEEEcC--CCcEEEEEcccccceeccccCceeecccceEcCCCCEEEEEecCCCCCceEEEE
Confidence 4344444432 3344444443321 37899999999988665544333322347899999999999875 569999
Q ss_pred EeCC-CCCeEEE
Q psy959 95 SDLA-GNNREAL 105 (109)
Q Consensus 95 ~~~d-G~~~~~l 105 (109)
++++ |...+.|
T Consensus 312 v~~~~~~~~~~L 323 (353)
T PF00930_consen 312 VSLDSGGEPKCL 323 (353)
T ss_dssp EETTETTEEEES
T ss_pred EEeCCCCCeEec
Confidence 9999 7776655
No 79
>PRK02889 tolB translocation protein TolB; Provisional
Probab=90.01 E-value=6.6 Score=29.45 Aligned_cols=80 Identities=24% Similarity=0.224 Sum_probs=50.1
Q ss_pred ceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC--CCeEEEEeCCCCC
Q psy959 24 RHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV--REKIERSDLAGNN 101 (109)
Q Consensus 24 ~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~--~~~I~~~~~dG~~ 101 (109)
...+..|....|+++... +....|+..++++...+.+.... ......++.+..+.|+++... ...|...++++..
T Consensus 287 ~~~~wSpDG~~l~f~s~~--~g~~~Iy~~~~~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~ 363 (427)
T PRK02889 287 TEPFFSPDGRSIYFTSDR--GGAPQIYRMPASGGAAQRVTFTG-SYNTSPRISPDGKLLAYISRVGGAFKLYVQDLATGQ 363 (427)
T ss_pred cCeEEcCCCCEEEEEecC--CCCcEEEEEECCCCceEEEecCC-CCcCceEECCCCCEEEEEEccCCcEEEEEEECCCCC
Confidence 345667756677765321 12578999988776655444322 223346788888898887653 2478888888766
Q ss_pred eEEEE
Q psy959 102 REALY 106 (109)
Q Consensus 102 ~~~l~ 106 (109)
.+.+.
T Consensus 364 ~~~lt 368 (427)
T PRK02889 364 VTALT 368 (427)
T ss_pred eEEcc
Confidence 65554
No 80
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=89.67 E-value=4.4 Score=30.51 Aligned_cols=72 Identities=17% Similarity=0.236 Sum_probs=43.0
Q ss_pred CCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCCCCCeEEEEe
Q psy959 32 LRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLAGNNREALYL 107 (109)
Q Consensus 32 ~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~dG~~~~~l~~ 107 (109)
+..||.++.. ....++..+++....+.|.........|..+.+.++.+||+... ..+.+++++....++|+.
T Consensus 48 ~kllF~s~~d---g~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~-~~l~~vdL~T~e~~~vy~ 119 (386)
T PF14583_consen 48 RKLLFASDFD---GNRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKNG-RSLRRVDLDTLEERVVYE 119 (386)
T ss_dssp -EEEEEE-TT---SS-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEETT-TEEEEEETTT--EEEEEE
T ss_pred CEEEEEeccC---CCcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEECC-CeEEEEECCcCcEEEEEE
Confidence 3466666644 26888999998877777766433344467777889999887643 588999999888777764
No 81
>KOG4649|consensus
Probab=89.63 E-value=2.1 Score=30.96 Aligned_cols=30 Identities=7% Similarity=-0.083 Sum_probs=23.2
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEc
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTM 54 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~ 54 (109)
..-.-+|||+.+|+|||-..- ..+|+...+
T Consensus 31 Hs~~~~avd~~sG~~~We~il----g~RiE~sa~ 60 (354)
T KOG4649|consen 31 HSGIVIAVDPQSGNLIWEAIL----GVRIECSAI 60 (354)
T ss_pred CCceEEEecCCCCcEEeehhh----CceeeeeeE
Confidence 344568999999999998765 578877665
No 82
>PRK03629 tolB translocation protein TolB; Provisional
Probab=89.63 E-value=7.2 Score=29.33 Aligned_cols=79 Identities=10% Similarity=0.032 Sum_probs=53.0
Q ss_pred ceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeC--CCCeEEEEeCCCCC
Q psy959 24 RHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDA--VREKIERSDLAGNN 101 (109)
Q Consensus 24 ~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~--~~~~I~~~~~dG~~ 101 (109)
.+.+..|...+|+|+.... ....|+..++++...+.+... -......+..+..++|+++-. +...|+.++++|..
T Consensus 246 ~~~~~SPDG~~La~~~~~~--g~~~I~~~d~~tg~~~~lt~~-~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~ 322 (429)
T PRK03629 246 GAPAFSPDGSKLAFALSKT--GSLNLYVMDLASGQIRQVTDG-RSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGA 322 (429)
T ss_pred CCeEECCCCCEEEEEEcCC--CCcEEEEEECCCCCEEEccCC-CCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCC
Confidence 3567888777899875431 146799999987666555432 234456678888888766543 34689999988876
Q ss_pred eEEE
Q psy959 102 REAL 105 (109)
Q Consensus 102 ~~~l 105 (109)
.+.+
T Consensus 323 ~~~l 326 (429)
T PRK03629 323 PQRI 326 (429)
T ss_pred eEEe
Confidence 5554
No 83
>PRK00178 tolB translocation protein TolB; Provisional
Probab=89.14 E-value=7.6 Score=28.90 Aligned_cols=79 Identities=15% Similarity=0.097 Sum_probs=50.6
Q ss_pred eEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC--CCeEEEEeCCCCCe
Q psy959 25 HFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV--REKIERSDLAGNNR 102 (109)
Q Consensus 25 ~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~--~~~I~~~~~dG~~~ 102 (109)
..+..|..+.||++.... ....|++.++++...+.+.... ......++.+..+.|+++... ...|...++++...
T Consensus 291 ~~~~spDg~~i~f~s~~~--g~~~iy~~d~~~g~~~~lt~~~-~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~ 367 (430)
T PRK00178 291 EPFWGKDGRTLYFTSDRG--GKPQIYKVNVNGGRAERVTFVG-NYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSV 367 (430)
T ss_pred CeEECCCCCEEEEEECCC--CCceEEEEECCCCCEEEeecCC-CCccceEECCCCCEEEEEEccCCceEEEEEECCCCCE
Confidence 455666666787764321 2578999998766655443322 223345678888999988753 34688888887766
Q ss_pred EEEE
Q psy959 103 EALY 106 (109)
Q Consensus 103 ~~l~ 106 (109)
+.+.
T Consensus 368 ~~lt 371 (430)
T PRK00178 368 RILT 371 (430)
T ss_pred EEcc
Confidence 6554
No 84
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=88.08 E-value=9.9 Score=28.92 Aligned_cols=77 Identities=14% Similarity=0.037 Sum_probs=49.9
Q ss_pred EeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeC--CCCeEEEEeCCCCCeEE
Q psy959 27 MPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDA--VREKIERSDLAGNNREA 104 (109)
Q Consensus 27 avd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~--~~~~I~~~~~dG~~~~~ 104 (109)
+.-|...+|.++-... ....|+..+++|.....|....-..- .=.+-|..+.||++.. +...|.+++.+|+..+.
T Consensus 244 ~fspDG~~l~f~~~rd--g~~~iy~~dl~~~~~~~Lt~~~gi~~-~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~~r 320 (425)
T COG0823 244 AFSPDGSKLAFSSSRD--GSPDIYLMDLDGKNLPRLTNGFGINT-SPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQVTR 320 (425)
T ss_pred cCCCCCCEEEEEECCC--CCccEEEEcCCCCcceecccCCcccc-CccCCCCCCEEEEEeCCCCCcceEEECCCCCceeE
Confidence 3444456676665443 37999999999988665543211111 1244566788877755 46799999999998655
Q ss_pred EE
Q psy959 105 LY 106 (109)
Q Consensus 105 l~ 106 (109)
+.
T Consensus 321 iT 322 (425)
T COG0823 321 LT 322 (425)
T ss_pred ee
Confidence 43
No 85
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=84.72 E-value=14 Score=27.29 Aligned_cols=75 Identities=25% Similarity=0.201 Sum_probs=56.5
Q ss_pred CCCeEEEEecCCCC---------CCCeEEEEEcCCCCcEEEEeC-CCCCceeEEEecC-----CCeEEEEeCCCCeEEEE
Q psy959 31 RLRTLYYTDWGRFG---------TAGKILRTTMAGSLKKVIIEK-DLSQPSGLAIDFD-----DEMLYWTDAVREKIERS 95 (109)
Q Consensus 31 ~~g~lywsd~~~~~---------~~~~I~~~~~dG~~~~~l~~~-~~~~p~glaiD~~-----~~~lYw~D~~~~~I~~~ 95 (109)
..++||++-..++. ....|.+.+++|...+.+.+. .+..|+|||+-|. .+.|.....+.++|...
T Consensus 197 ig~~lyVtYA~qd~~~~d~v~G~G~G~VdvFd~~G~l~~r~as~g~LNaPWG~a~APa~FG~~sg~lLVGNFGDG~InaF 276 (336)
T TIGR03118 197 LGGTLYVTYAQQDADRNDEVAGAGLGYVNVFTLNGQLLRRVASSGRLNAPWGLAIAPESFGSLSGALLVGNFGDGTINAY 276 (336)
T ss_pred ECCeEEEEEEecCCcccccccCCCcceEEEEcCCCcEEEEeccCCcccCCceeeeChhhhCCCCCCeEEeecCCceeEEe
Confidence 47899999644311 246899999999998877653 5899999999654 57888888899999999
Q ss_pred eCC-CCCeEEE
Q psy959 96 DLA-GNNREAL 105 (109)
Q Consensus 96 ~~d-G~~~~~l 105 (109)
+.. |...-.|
T Consensus 277 D~~sG~~~g~L 287 (336)
T TIGR03118 277 DPQSGAQLGQL 287 (336)
T ss_pred cCCCCceeeee
Confidence 875 6544333
No 86
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=84.59 E-value=13 Score=27.11 Aligned_cols=79 Identities=18% Similarity=0.286 Sum_probs=54.2
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCC-cEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCCC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSL-KKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLAG 99 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~-~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~dG 99 (109)
..-+.|..|| -|.+.-|+++ .+..++.+..-+. ++-=+...-.+|..+-|| ..++++..|...+.|-+++-.-
T Consensus 233 ~gsRriwsdp-ig~~wittwg----~g~l~rfdPs~~sW~eypLPgs~arpys~rVD-~~grVW~sea~agai~rfdpet 306 (353)
T COG4257 233 AGSRRIWSDP-IGRAWITTWG----TGSLHRFDPSVTSWIEYPLPGSKARPYSMRVD-RHGRVWLSEADAGAIGRFDPET 306 (353)
T ss_pred ccccccccCc-cCcEEEeccC----CceeeEeCcccccceeeeCCCCCCCcceeeec-cCCcEEeeccccCceeecCccc
Confidence 3446688898 7889889988 6888888764333 332233345688899999 4567777798888888777654
Q ss_pred CCeEEE
Q psy959 100 NNREAL 105 (109)
Q Consensus 100 ~~~~~l 105 (109)
..-.++
T Consensus 307 a~ftv~ 312 (353)
T COG4257 307 ARFTVL 312 (353)
T ss_pred ceEEEe
Confidence 444443
No 87
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=83.15 E-value=18 Score=27.45 Aligned_cols=75 Identities=23% Similarity=0.255 Sum_probs=48.0
Q ss_pred CcceEEeeCC------CCeEEEEecCCCCCCCeEEEEEcCCCCc---EEEEeCC-CCCceeEEEecCCCeEEEEeCCCCe
Q psy959 22 GRRHFMPPIR------LRTLYYTDWGRFGTAGKILRTTMAGSLK---KVIIEKD-LSQPSGLAIDFDDEMLYWTDAVREK 91 (109)
Q Consensus 22 ~p~~iavd~~------~g~lywsd~~~~~~~~~I~~~~~dG~~~---~~l~~~~-~~~p~glaiD~~~~~lYw~D~~~~~ 91 (109)
.|-|+++-.- .+.+|....+ ...+.+...+|..+ +.++..+ -.+|.++++.+..-.|.-+|.+.++
T Consensus 315 ApsGmaFy~G~~fP~~r~~lfV~~hg----sw~~~~~~~~g~~~~~~~~fl~~d~~gR~~dV~v~~DGallv~~D~~~g~ 390 (399)
T COG2133 315 APSGMAFYTGDLFPAYRGDLFVGAHG----SWPVLRLRPDGNYKVVLTGFLSGDLGGRPRDVAVAPDGALLVLTDQGDGR 390 (399)
T ss_pred ccceeEEecCCcCccccCcEEEEeec----ceeEEEeccCCCcceEEEEEEecCCCCcccceEECCCCeEEEeecCCCCe
Confidence 4566665421 2567777665 34677788888843 3333322 2699999998665555555555789
Q ss_pred EEEEeCCCC
Q psy959 92 IERSDLAGN 100 (109)
Q Consensus 92 I~~~~~dG~ 100 (109)
|.++...|.
T Consensus 391 i~Rv~~~~~ 399 (399)
T COG2133 391 ILRVSYAGT 399 (399)
T ss_pred EEEecCCCC
Confidence 999988763
No 88
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=82.67 E-value=17 Score=26.94 Aligned_cols=71 Identities=18% Similarity=0.217 Sum_probs=46.7
Q ss_pred CCcc-eEEeeCCCCeEEEEecCCCCCCCeEEEEEcC-CCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCC
Q psy959 21 PGRR-HFMPPIRLRTLYYTDWGRFGTAGKILRTTMA-GSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLA 98 (109)
Q Consensus 21 ~~p~-~iavd~~~g~lywsd~~~~~~~~~I~~~~~d-G~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~d 98 (109)
..|. +++..+..+++|.++. .+.|.+.++. +.....+- ....|.|+++.+..+.+|-+....+.+...+..
T Consensus 36 ~~~h~~~~~s~Dgr~~yv~~r-----dg~vsviD~~~~~~v~~i~--~G~~~~~i~~s~DG~~~~v~n~~~~~v~v~D~~ 108 (369)
T PF02239_consen 36 GAPHAGLKFSPDGRYLYVANR-----DGTVSVIDLATGKVVATIK--VGGNPRGIAVSPDGKYVYVANYEPGTVSVIDAE 108 (369)
T ss_dssp TTEEEEEE-TT-SSEEEEEET-----TSEEEEEETTSSSEEEEEE---SSEEEEEEE--TTTEEEEEEEETTEEEEEETT
T ss_pred CCceeEEEecCCCCEEEEEcC-----CCeEEEEECCcccEEEEEe--cCCCcceEEEcCCCCEEEEEecCCCceeEeccc
Confidence 4443 4666776778999864 4788888885 44333442 245799999999999999998877777666543
No 89
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=82.09 E-value=16 Score=28.19 Aligned_cols=61 Identities=18% Similarity=0.343 Sum_probs=33.9
Q ss_pred cceEEeeCCCCeEEEEecCCCCCCCeEEEEEc-CCCCcEEEEe---CC-C--------------CCceeEEEecCCCeEE
Q psy959 23 RRHFMPPIRLRTLYYTDWGRFGTAGKILRTTM-AGSLKKVIIE---KD-L--------------SQPSGLAIDFDDEMLY 83 (109)
Q Consensus 23 p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~-dG~~~~~l~~---~~-~--------------~~p~glaiD~~~~~lY 83 (109)
|..|.+-...++||++.|. .+.|...+. |..+.+..-+ .+ + .-|+=+.+....+|||
T Consensus 314 itDI~iSlDDrfLYvs~W~----~GdvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMvqlS~DGkRlY 389 (461)
T PF05694_consen 314 ITDILISLDDRFLYVSNWL----HGDVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMVQLSLDGKRLY 389 (461)
T ss_dssp ---EEE-TTS-EEEEEETT----TTEEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----EEE-TTSSEEE
T ss_pred eEeEEEccCCCEEEEEccc----CCcEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeEEEccCCeEEE
Confidence 4556666668999999998 688888888 5555443332 11 1 1356677888899999
Q ss_pred EEeC
Q psy959 84 WTDA 87 (109)
Q Consensus 84 w~D~ 87 (109)
+|.+
T Consensus 390 vTnS 393 (461)
T PF05694_consen 390 VTNS 393 (461)
T ss_dssp EE--
T ss_pred EEee
Confidence 9987
No 90
>PRK02888 nitrous-oxide reductase; Validated
Probab=82.04 E-value=25 Score=28.33 Aligned_cols=32 Identities=13% Similarity=0.211 Sum_probs=27.0
Q ss_pred CCCceeEEEecCCCeEEEEeCCCCeEEEEeCC
Q psy959 67 LSQPSGLAIDFDDEMLYWTDAVREKIERSDLA 98 (109)
Q Consensus 67 ~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~d 98 (109)
-.+|+|+++.|..+++|.+......+...++.
T Consensus 320 GKsPHGV~vSPDGkylyVanklS~tVSVIDv~ 351 (635)
T PRK02888 320 PKNPHGVNTSPDGKYFIANGKLSPTVTVIDVR 351 (635)
T ss_pred CCCccceEECCCCCEEEEeCCCCCcEEEEECh
Confidence 45899999999999999999887777776654
No 91
>PRK01742 tolB translocation protein TolB; Provisional
Probab=81.53 E-value=20 Score=26.87 Aligned_cols=77 Identities=9% Similarity=0.015 Sum_probs=50.8
Q ss_pred eEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeC--CCCeEEEEeCCCCCe
Q psy959 25 HFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDA--VREKIERSDLAGNNR 102 (109)
Q Consensus 25 ~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~--~~~~I~~~~~dG~~~ 102 (109)
+++..|....|+++-... ....|+..++++...+.+.. .-......+..+..++|+++.. +...|+.++.+|...
T Consensus 252 ~~~wSPDG~~La~~~~~~--g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~ 328 (429)
T PRK01742 252 APAFSPDGSRLAFASSKD--GVLNIYVMGANGGTPSQLTS-GAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASGGGA 328 (429)
T ss_pred ceeECCCCCEEEEEEecC--CcEEEEEEECCCCCeEeecc-CCCCcCCEEECCCCCEEEEEECCCCCceEEEEECCCCCe
Confidence 567778666788864321 14578888888776655543 2233556788888888887643 456888888877655
Q ss_pred EE
Q psy959 103 EA 104 (109)
Q Consensus 103 ~~ 104 (109)
+.
T Consensus 329 ~~ 330 (429)
T PRK01742 329 SL 330 (429)
T ss_pred EE
Confidence 43
No 92
>PF14759 Reductase_C: Reductase C-terminal; PDB: 3FG2_P 3LXD_A 2YVG_A 2GR1_A 2GQW_A 2GR3_A 2YVF_A 1F3P_A 2GR0_A 2GR2_A ....
Probab=80.36 E-value=8.6 Score=22.24 Aligned_cols=28 Identities=18% Similarity=0.359 Sum_probs=20.0
Q ss_pred EEEecCCCCCCCeEEEEEcCCCCcEEEEeCCC
Q psy959 36 YYTDWGRFGTAGKILRTTMAGSLKKVIIEKDL 67 (109)
Q Consensus 36 ywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~ 67 (109)
||||.. ..+|+.+..-+..-++++..+.
T Consensus 2 FWSdQ~----~~~iq~~G~~~~~~~~v~rg~~ 29 (85)
T PF14759_consen 2 FWSDQY----GVRIQIAGLPGGADEVVVRGDP 29 (85)
T ss_dssp EEEEET----TEEEEEEE-STTSSEEEEEEET
T ss_pred eecccC----CCeEEEEECCCCCCEEEEEccC
Confidence 899998 6899999986665556655443
No 93
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=79.98 E-value=19 Score=25.73 Aligned_cols=68 Identities=19% Similarity=0.221 Sum_probs=46.3
Q ss_pred eeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEe-C--------------CCCCceeEEEecCCCeEEEEeCCCCeE
Q psy959 28 PPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIE-K--------------DLSQPSGLAIDFDDEMLYWTDAVREKI 92 (109)
Q Consensus 28 vd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~-~--------------~~~~p~glaiD~~~~~lYw~D~~~~~I 92 (109)
+.+.+|.||=--|. ...|.+.+.......-.+. . ...-.+|||.|+.++++|.|-..-.++
T Consensus 180 LE~i~G~IyANVW~----td~I~~Idp~tG~V~~~iDls~L~~~~~~~~~~~~~~dVLNGIAyd~~~~~l~vTGK~Wp~l 255 (264)
T PF05096_consen 180 LEYINGKIYANVWQ----TDRIVRIDPETGKVVGWIDLSGLRPEVGRDKSRQPDDDVLNGIAYDPETDRLFVTGKLWPKL 255 (264)
T ss_dssp EEEETTEEEEEETT----SSEEEEEETTT-BEEEEEE-HHHHHHHTSTTST--TTS-EEEEEEETTTTEEEEEETT-SEE
T ss_pred EEEEcCEEEEEeCC----CCeEEEEeCCCCeEEEEEEhhHhhhcccccccccccCCeeEeEeEeCCCCEEEEEeCCCCce
Confidence 34557888755565 6899999986444433332 1 123578999999999999998888888
Q ss_pred EEEeCCC
Q psy959 93 ERSDLAG 99 (109)
Q Consensus 93 ~~~~~dG 99 (109)
..+.+..
T Consensus 256 yeV~l~e 262 (264)
T PF05096_consen 256 YEVKLVE 262 (264)
T ss_dssp EEEEEEE
T ss_pred EEEEEEe
Confidence 7776543
No 94
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=79.03 E-value=7.7 Score=28.28 Aligned_cols=56 Identities=14% Similarity=0.131 Sum_probs=38.5
Q ss_pred eEEeeCCCCeEEEEecCCCCCCCeEEEEEcC-CCCcEEEEeCCCCCceeEEEecCCCe
Q psy959 25 HFMPPIRLRTLYYTDWGRFGTAGKILRTTMA-GSLKKVIIEKDLSQPSGLAIDFDDEM 81 (109)
Q Consensus 25 ~iavd~~~g~lywsd~~~~~~~~~I~~~~~d-G~~~~~l~~~~~~~p~glaiD~~~~~ 81 (109)
-+++|+.++.|||+..........+++++++ |...+.|....... ...++.+..+.
T Consensus 285 i~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~~~~~~~LT~~~~~~-~~~~~Spdg~y 341 (353)
T PF00930_consen 285 ILGWDEDNNRIYFTANGDNPGERHLYRVSLDSGGEPKCLTCEDGDH-YSASFSPDGKY 341 (353)
T ss_dssp EEEEECTSSEEEEEESSGGTTSBEEEEEETTETTEEEESSTTSSTT-EEEEE-TTSSE
T ss_pred cceEcCCCCEEEEEecCCCCCceEEEEEEeCCCCCeEeccCCCCCc-eEEEECCCCCE
Confidence 3678999999999987644457899999999 77766664332222 36677665543
No 95
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=76.18 E-value=11 Score=27.80 Aligned_cols=38 Identities=18% Similarity=0.198 Sum_probs=30.5
Q ss_pred CCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCC-CCCeEE
Q psy959 65 KDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLA-GNNREA 104 (109)
Q Consensus 65 ~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~d-G~~~~~ 104 (109)
+++..|++--. ..++||.+|++.+.+..++.+ |....+
T Consensus 200 ~GLsmPhSPRW--hdgrLwvldsgtGev~~vD~~~G~~e~V 238 (335)
T TIGR03032 200 SGLSMPHSPRW--YQGKLWLLNSGRGELGYVDPQAGKFQPV 238 (335)
T ss_pred cCccCCcCCcE--eCCeEEEEECCCCEEEEEcCCCCcEEEE
Confidence 56778887664 489999999999999999998 665444
No 96
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=74.93 E-value=4 Score=21.33 Aligned_cols=24 Identities=25% Similarity=0.236 Sum_probs=20.3
Q ss_pred CCCCceeEEEecCCCeEEEEeCCC
Q psy959 66 DLSQPSGLAIDFDDEMLYWTDAVR 89 (109)
Q Consensus 66 ~~~~p~glaiD~~~~~lYw~D~~~ 89 (109)
.-..|.||.+|+.++.|.|+-...
T Consensus 9 ~~~LP~gLs~d~~tG~isGtp~~~ 32 (49)
T PF05345_consen 9 GGGLPSGLSLDPSTGTISGTPTSS 32 (49)
T ss_pred CCCCCCcEEEeCCCCEEEeecCCC
Confidence 456899999999999999996543
No 97
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=73.18 E-value=35 Score=25.23 Aligned_cols=81 Identities=14% Similarity=0.116 Sum_probs=53.8
Q ss_pred eeecCCCCCccEEEec--CCCCcceEEeeCC-----CCeEEEEecCCCCCCCeEEEEEcC-CCCcEEEEeCC-----CCC
Q psy959 3 STDKDLLKTHRSLLLQ--AIPGRRHFMPPIR-----LRTLYYTDWGRFGTAGKILRTTMA-GSLKKVIIEKD-----LSQ 69 (109)
Q Consensus 3 ~~~~~l~G~~~~~li~--~l~~p~~iavd~~-----~g~lywsd~~~~~~~~~I~~~~~d-G~~~~~l~~~~-----~~~ 69 (109)
..+.+++|...+.+.+ .|+.|.|||+-|. .|.|..-+.+ ..+|...+.. |...-.|.... +.=
T Consensus 224 VdvFd~~G~l~~r~as~g~LNaPWG~a~APa~FG~~sg~lLVGNFG----DG~InaFD~~sG~~~g~L~~~~G~pi~i~G 299 (336)
T TIGR03118 224 VNVFTLNGQLLRRVASSGRLNAPWGLAIAPESFGSLSGALLVGNFG----DGTINAYDPQSGAQLGQLLDPDNHPVKVDG 299 (336)
T ss_pred EEEEcCCCcEEEEeccCCcccCCceeeeChhhhCCCCCCeEEeecC----CceeEEecCCCCceeeeecCCCCCeEEecC
Confidence 4567889997766663 5899999999663 4777788887 6899999875 66555554321 112
Q ss_pred ceeEEEec-----CCCeEEEEeC
Q psy959 70 PSGLAIDF-----DDEMLYWTDA 87 (109)
Q Consensus 70 p~glaiD~-----~~~~lYw~D~ 87 (109)
-++|+.-. ....||++-.
T Consensus 300 LWgL~fGng~~~~~~ntLyFaAG 322 (336)
T TIGR03118 300 LWSLTFGNGVSGGSANYLYFTAG 322 (336)
T ss_pred eEEeeeCCCcCCCCcceEEEEeC
Confidence 34455433 2357888854
No 98
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=69.88 E-value=40 Score=24.53 Aligned_cols=67 Identities=13% Similarity=0.150 Sum_probs=42.5
Q ss_pred CCCCcceEEe--eCCCCeEEEEecCCCCCCCeEEEEEc-CCCC---cEEEEeC--CCCCceeEEEecCCCeEEEEeCC
Q psy959 19 AIPGRRHFMP--PIRLRTLYYTDWGRFGTAGKILRTTM-AGSL---KKVIIEK--DLSQPSGLAIDFDDEMLYWTDAV 88 (109)
Q Consensus 19 ~l~~p~~iav--d~~~g~lywsd~~~~~~~~~I~~~~~-dG~~---~~~l~~~--~~~~p~glaiD~~~~~lYw~D~~ 88 (109)
++..|.|+++ ++.+|-+|..-..+ .+.|....+ |+.+ +.-++.+ --.+-.|+..|-..+.||.++..
T Consensus 151 ~~s~~YGl~lyrs~ktgd~yvfV~~~---qG~~~Qy~l~d~gnGkv~~k~vR~fk~~tQTEG~VaDdEtG~LYIaeEd 225 (364)
T COG4247 151 SSSSAYGLALYRSPKTGDYYVFVNRR---QGDIAQYKLIDQGNGKVGTKLVRQFKIPTQTEGMVADDETGFLYIAEED 225 (364)
T ss_pred CcccceeeEEEecCCcCcEEEEEecC---CCceeEEEEEecCCceEcceeeEeeecCCcccceeeccccceEEEeecc
Confidence 4677888887 56667665554443 466666655 4333 2222221 12367899999999999999864
No 99
>PRK01029 tolB translocation protein TolB; Provisional
Probab=67.87 E-value=50 Score=24.92 Aligned_cols=80 Identities=14% Similarity=0.048 Sum_probs=50.7
Q ss_pred ceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCC--CcEEEEeCCCCCceeEEEecCCCeEEEEeCC--CCeEEEEeCCC
Q psy959 24 RHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGS--LKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV--REKIERSDLAG 99 (109)
Q Consensus 24 ~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~--~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~--~~~I~~~~~dG 99 (109)
...+..|....|+|+.... ....|++.++++. ..+.+. .........+..|..++|+++... ...|...++++
T Consensus 284 ~~p~wSPDG~~Laf~s~~~--g~~~ly~~~~~~~g~~~~~lt-~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~ 360 (428)
T PRK01029 284 GNPSFSPDGTRLVFVSNKD--GRPRIYIMQIDPEGQSPRLLT-KKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLAT 360 (428)
T ss_pred CCeEECCCCCEEEEEECCC--CCceEEEEECcccccceEEec-cCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCC
Confidence 4557777566677764321 1468898888643 233332 222334456788888888887553 45788899888
Q ss_pred CCeEEEE
Q psy959 100 NNREALY 106 (109)
Q Consensus 100 ~~~~~l~ 106 (109)
...+.+.
T Consensus 361 g~~~~Lt 367 (428)
T PRK01029 361 GRDYQLT 367 (428)
T ss_pred CCeEEcc
Confidence 7766654
No 100
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=65.78 E-value=69 Score=25.71 Aligned_cols=68 Identities=19% Similarity=0.134 Sum_probs=44.0
Q ss_pred CCCcceEEeeCCCCeEEEEecCCC------------CCCCeEEEEEcCCC---C----cEEEEe----------------
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRF------------GTAGKILRTTMAGS---L----KKVIIE---------------- 64 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~------------~~~~~I~~~~~dG~---~----~~~l~~---------------- 64 (109)
+..|..|++.|.+|.+|++..... ...+.|.|....+. . -..++.
T Consensus 416 mdRpE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~~ 495 (616)
T COG3211 416 MDRPEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANIN 495 (616)
T ss_pred ccCccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCccccccccccCcc
Confidence 679999999999999999975531 12456777665332 1 122222
Q ss_pred -CCCCCceeEEEecCCCeEEEEeC
Q psy959 65 -KDLSQPSGLAIDFDDEMLYWTDA 87 (109)
Q Consensus 65 -~~~~~p~glaiD~~~~~lYw~D~ 87 (109)
..+..|-+|++|+..+..--+|.
T Consensus 496 ~~~f~~PDnl~fD~~GrLWi~TDg 519 (616)
T COG3211 496 ANWFNSPDNLAFDPWGRLWIQTDG 519 (616)
T ss_pred cccccCCCceEECCCCCEEEEecC
Confidence 11445999999987766444554
No 101
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=57.49 E-value=61 Score=25.62 Aligned_cols=53 Identities=21% Similarity=0.364 Sum_probs=35.1
Q ss_pred CCCeEEEEEcCCCCcEEEEe--CCCCCceeEEEecCCCeEEEEeC--CCCeEEEEeCCCCCe
Q psy959 45 TAGKILRTTMAGSLKKVIIE--KDLSQPSGLAIDFDDEMLYWTDA--VREKIERSDLAGNNR 102 (109)
Q Consensus 45 ~~~~I~~~~~dG~~~~~l~~--~~~~~p~glaiD~~~~~lYw~D~--~~~~I~~~~~dG~~~ 102 (109)
..++|....-.|...+-++. .++.+|. .| .+++|+..- +.+.|+++++||+..
T Consensus 204 trGklWis~d~g~tFeK~vdl~~~vS~Pm--IV---~~RvYFlsD~eG~GnlYSvdldGkDl 260 (668)
T COG4946 204 TRGKLWISSDGGKTFEKFVDLDGNVSSPM--IV---GERVYFLSDHEGVGNLYSVDLDGKDL 260 (668)
T ss_pred ccceEEEEecCCcceeeeeecCCCcCCce--EE---cceEEEEecccCccceEEeccCCchh
Confidence 35666666655645555554 3455664 33 678887654 578999999999763
No 102
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=56.31 E-value=35 Score=26.78 Aligned_cols=23 Identities=22% Similarity=0.543 Sum_probs=20.4
Q ss_pred CCCCceeEEEecCCCeEEEEeCC
Q psy959 66 DLSQPSGLAIDFDDEMLYWTDAV 88 (109)
Q Consensus 66 ~~~~p~glaiD~~~~~lYw~D~~ 88 (109)
.+.+|.++++++.++.+|++-..
T Consensus 348 ~f~RpEgi~~~p~~g~vY~a~T~ 370 (524)
T PF05787_consen 348 PFDRPEGITVNPDDGEVYFALTN 370 (524)
T ss_pred cccCccCeeEeCCCCEEEEEEec
Confidence 47899999999999999999754
No 103
>KOG0315|consensus
Probab=56.11 E-value=74 Score=23.02 Aligned_cols=54 Identities=19% Similarity=0.146 Sum_probs=40.4
Q ss_pred ceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCC-CcEEEEeCCCCCceeEEEecCCCeE
Q psy959 24 RHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGS-LKKVIIEKDLSQPSGLAIDFDDEML 82 (109)
Q Consensus 24 ~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~-~~~~l~~~~~~~p~glaiD~~~~~l 82 (109)
..+.++|..+.|+..|.. +.|..-++-.. -.+.++.+.......|+|++....|
T Consensus 128 n~vvlhpnQteLis~dqs-----g~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml 182 (311)
T KOG0315|consen 128 NTVVLHPNQTELISGDQS-----GNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSML 182 (311)
T ss_pred ceEEecCCcceEEeecCC-----CcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEE
Confidence 578999999999999975 78888887433 3445555666778889998876554
No 104
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=55.17 E-value=84 Score=24.58 Aligned_cols=16 Identities=0% Similarity=-0.193 Sum_probs=9.8
Q ss_pred EEeeCCCCeEEEEecC
Q psy959 26 FMPPIRLRTLYYTDWG 41 (109)
Q Consensus 26 iavd~~~g~lywsd~~ 41 (109)
+|+|..+|.+-|.-..
T Consensus 274 vAld~~TG~~~W~~Q~ 289 (527)
T TIGR03075 274 VARDPDTGKIKWHYQT 289 (527)
T ss_pred EEEccccCCEEEeeeC
Confidence 5666666666666433
No 105
>PF14251 DUF4346: Domain of unknown function (DUF4346)
Probab=54.78 E-value=51 Score=20.70 Aligned_cols=65 Identities=17% Similarity=0.132 Sum_probs=38.1
Q ss_pred CcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCCCC
Q psy959 22 GRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLAGN 100 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~dG~ 100 (109)
+-+-|++||..=+|...|.. .+.|..-. . .+.-.-.|+|+||.++.+.=+.....+-...-+.|.
T Consensus 8 S~R~i~LDp~GYfiI~~d~~----~~~i~a~h---------~-~n~I~~~Gla~Dpetge~i~~~g~~~r~~~~~~~Gr 72 (119)
T PF14251_consen 8 SQRFIDLDPAGYFIIYVDRE----AGEICAEH---------Y-TNDIDDKGLAVDPETGEVIPCRGKVKRTPSIVFKGR 72 (119)
T ss_pred hcCccccCCCccEEEEEeCC----CCeeeHhh---------c-cCccCcccceeCCCCCCEEEEecCCCCceeEEEecC
Confidence 34668899944455566654 23332111 0 112234489999999999888876555555555554
No 106
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=54.45 E-value=45 Score=23.49 Aligned_cols=34 Identities=3% Similarity=-0.185 Sum_probs=24.4
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCC
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGS 57 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~ 57 (109)
.+-++|||.||..+++|.|-.. -+.+..+.+++.
T Consensus 228 ~nvlNGIA~~~~~~r~~iTGK~----wp~lfEVk~~~a 261 (262)
T COG3823 228 DNVLNGIAHDPQQDRFLITGKL----WPLLFEVKLDEA 261 (262)
T ss_pred cccccceeecCcCCeEEEecCc----CceeEEEEecCC
Confidence 4568899999988899988654 466666655543
No 107
>cd01248 PH_PLC Phospholipase C (PLC) pleckstrin homology (PH) domain. Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Probab=52.20 E-value=46 Score=20.15 Aligned_cols=22 Identities=23% Similarity=0.451 Sum_probs=17.6
Q ss_pred CCceeEEEecCCCeEEEEeCCC
Q psy959 68 SQPSGLAIDFDDEMLYWTDAVR 89 (109)
Q Consensus 68 ~~p~glaiD~~~~~lYw~D~~~ 89 (109)
..+.-+.+|+.+..|||.+...
T Consensus 21 ~~~~~f~ld~~~~~l~W~~~~~ 42 (115)
T cd01248 21 ERRRLFRLDEKGFFLYWKDEGK 42 (115)
T ss_pred eeeEEEEEcCCCcEEEEeCCCC
Confidence 3456788999999999998753
No 108
>KOG0285|consensus
Probab=52.00 E-value=82 Score=23.96 Aligned_cols=73 Identities=14% Similarity=0.113 Sum_probs=45.0
Q ss_pred CCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeC
Q psy959 19 AIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDL 97 (109)
Q Consensus 19 ~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~ 97 (109)
.+.-.++++|||- ..-|-+-.. ...|..-++....-+.-+.......+|+++....-.||-+-. ...|.+-++
T Consensus 150 HlgWVr~vavdP~-n~wf~tgs~----DrtikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~ge-dk~VKCwDL 222 (460)
T KOG0285|consen 150 HLGWVRSVAVDPG-NEWFATGSA----DRTIKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSAGE-DKQVKCWDL 222 (460)
T ss_pred ccceEEEEeeCCC-ceeEEecCC----CceeEEEEcccCeEEEeecchhheeeeeeecccCceEEEecC-CCeeEEEec
Confidence 4667889999995 333444443 578888888644333333445778999999866655554432 334544444
No 109
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=51.63 E-value=19 Score=17.27 Aligned_cols=17 Identities=0% Similarity=-0.058 Sum_probs=12.8
Q ss_pred eEEeeCCCCeEEEEecC
Q psy959 25 HFMPPIRLRTLYYTDWG 41 (109)
Q Consensus 25 ~iavd~~~g~lywsd~~ 41 (109)
=.|+|..+|.+.|.-..
T Consensus 12 l~AlD~~TG~~~W~~~~ 28 (38)
T PF01011_consen 12 LYALDAKTGKVLWKFQT 28 (38)
T ss_dssp EEEEETTTTSEEEEEES
T ss_pred EEEEECCCCCEEEeeeC
Confidence 36888888888887543
No 110
>KOG4649|consensus
Probab=51.18 E-value=34 Score=24.90 Aligned_cols=29 Identities=24% Similarity=0.336 Sum_probs=21.0
Q ss_pred CceeEEEecCCCeEEEEeCCCCeEEEEeC
Q psy959 69 QPSGLAIDFDDEMLYWTDAVREKIERSDL 97 (109)
Q Consensus 69 ~p~glaiD~~~~~lYw~D~~~~~I~~~~~ 97 (109)
.-.-+|||+.++.|||-..-..+||...+
T Consensus 32 s~~~~avd~~sG~~~We~ilg~RiE~sa~ 60 (354)
T KOG4649|consen 32 SGIVIAVDPQSGNLIWEAILGVRIECSAI 60 (354)
T ss_pred CceEEEecCCCCcEEeehhhCceeeeeeE
Confidence 33446888888888888777777776654
No 111
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=50.41 E-value=1e+02 Score=22.88 Aligned_cols=77 Identities=10% Similarity=-0.009 Sum_probs=44.3
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCC-CCcEEEEeCC------CCCceeEEEecCCCeEEEEeCCCCeE
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAG-SLKKVIIEKD------LSQPSGLAIDFDDEMLYWTDAVREKI 92 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG-~~~~~l~~~~------~~~p~glaiD~~~~~lYw~D~~~~~I 92 (109)
-..|+++++.+..+++|.++.. .+.+...+... ...+.+-... -.++.+|.-.+.+...+++-...++|
T Consensus 77 G~~~~~i~~s~DG~~~~v~n~~----~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I 152 (369)
T PF02239_consen 77 GGNPRGIAVSPDGKYVYVANYE----PGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEI 152 (369)
T ss_dssp SSEEEEEEE--TTTEEEEEEEE----TTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEE
T ss_pred CCCcceEEEcCCCCEEEEEecC----CCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeE
Confidence 4679999999988899999887 47777766532 2233332211 12344555555555455555567788
Q ss_pred EEEeCCCC
Q psy959 93 ERSDLAGN 100 (109)
Q Consensus 93 ~~~~~dG~ 100 (109)
..+++...
T Consensus 153 ~vVdy~d~ 160 (369)
T PF02239_consen 153 WVVDYSDP 160 (369)
T ss_dssp EEEETTTS
T ss_pred EEEEeccc
Confidence 88876543
No 112
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=47.50 E-value=89 Score=23.12 Aligned_cols=78 Identities=13% Similarity=0.125 Sum_probs=46.0
Q ss_pred EEeeCCCCeEEEEecCCCCCCCeEEEEE--------------------cCCCCcEEEEeCCC-------CCceeEEEecC
Q psy959 26 FMPPIRLRTLYYTDWGRFGTAGKILRTT--------------------MAGSLKKVIIEKDL-------SQPSGLAIDFD 78 (109)
Q Consensus 26 iavd~~~g~lywsd~~~~~~~~~I~~~~--------------------~dG~~~~~l~~~~~-------~~p~glaiD~~ 78 (109)
=||++..+.|||--|-. .+.+++-. .+-...+.|..+.. .....|.-||.
T Consensus 40 NAV~~vDd~IyFGGWVH---APa~y~gk~~g~~~IdF~NKYSHVH~yd~e~~~VrLLWkesih~~~~WaGEVSdIlYdP~ 116 (339)
T PF09910_consen 40 NAVEWVDDFIYFGGWVH---APAVYEGKGDGRATIDFRNKYSHVHEYDTENDSVRLLWKESIHDKTKWAGEVSDILYDPY 116 (339)
T ss_pred eeeeeecceEEEeeeec---CCceeeeccCCceEEEEeeccceEEEEEcCCCeEEEEEecccCCccccccchhheeeCCC
Confidence 58888899999887654 23333333 32223344444333 24556788888
Q ss_pred CCeEEEEeCCC---CeEEEEeCCCCCeEEEE
Q psy959 79 DEMLYWTDAVR---EKIERSDLAGNNREALY 106 (109)
Q Consensus 79 ~~~lYw~D~~~---~~I~~~~~dG~~~~~l~ 106 (109)
+++||.+-... --|++.+..+...+.|.
T Consensus 117 ~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~ 147 (339)
T PF09910_consen 117 EDRLLLARADGHANLGVYSLDRRTGKAEKLS 147 (339)
T ss_pred cCEEEEEecCCcceeeeEEEcccCCceeecc
Confidence 88888887642 24566665544444443
No 113
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=46.08 E-value=1.2e+02 Score=22.58 Aligned_cols=53 Identities=19% Similarity=0.252 Sum_probs=37.2
Q ss_pred eCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCC---------CCCcee---EEEecCCCeEEEEe
Q psy959 29 PIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKD---------LSQPSG---LAIDFDDEMLYWTD 86 (109)
Q Consensus 29 d~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~---------~~~p~g---laiD~~~~~lYw~D 86 (109)
....+.+||... .+.|+.+++.|...+..-.-. --+|-| +|++...++||..-
T Consensus 192 ~~~~~~~~F~Sy-----~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyvLM 256 (342)
T PF06433_consen 192 SRDGGRLYFVSY-----EGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYVLM 256 (342)
T ss_dssp ETTTTEEEEEBT-----TSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEEEE
T ss_pred ECCCCeEEEEec-----CCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEEEe
Confidence 344678888776 489999999988765543311 123554 89999999999863
No 114
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=44.27 E-value=1.5e+02 Score=23.16 Aligned_cols=70 Identities=16% Similarity=0.098 Sum_probs=41.4
Q ss_pred eEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCC------------CCeE
Q psy959 25 HFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAV------------REKI 92 (109)
Q Consensus 25 ~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~------------~~~I 92 (109)
.+++|+..+.|||--... .+ .++..| --.++..-.=+|+|..++++-|.=.. ...+
T Consensus 238 ~~s~D~~~~lvy~~tGnp---~p------~~~~~r---~gdnl~~~s~vAld~~TG~~~W~~Q~~~~D~wD~d~~~~p~l 305 (527)
T TIGR03075 238 TGSYDPETNLIYFGTGNP---SP------WNSHLR---PGDNLYTSSIVARDPDTGKIKWHYQTTPHDEWDYDGVNEMIL 305 (527)
T ss_pred ceeEcCCCCeEEEeCCCC---CC------CCCCCC---CCCCccceeEEEEccccCCEEEeeeCCCCCCccccCCCCcEE
Confidence 369999999999976541 11 233333 11233344457888888888887432 1233
Q ss_pred EEEeCCCCCeEEEE
Q psy959 93 ERSDLAGNNREALY 106 (109)
Q Consensus 93 ~~~~~dG~~~~~l~ 106 (109)
.....+|+.+++++
T Consensus 306 ~d~~~~G~~~~~v~ 319 (527)
T TIGR03075 306 FDLKKDGKPRKLLA 319 (527)
T ss_pred EEeccCCcEEEEEE
Confidence 33446777666664
No 115
>KOG3567|consensus
Probab=43.11 E-value=75 Score=24.82 Aligned_cols=91 Identities=15% Similarity=0.085 Sum_probs=48.9
Q ss_pred cCCCCCccEEEec--CCCCcceEEeeCCCCeEEEEecCC---------CCCCCeEEEEEcCCCCcEEEEe---CCCCCce
Q psy959 6 KDLLKTHRSLLLQ--AIPGRRHFMPPIRLRTLYYTDWGR---------FGTAGKILRTTMAGSLKKVIIE---KDLSQPS 71 (109)
Q Consensus 6 ~~l~G~~~~~li~--~l~~p~~iavd~~~g~lywsd~~~---------~~~~~~I~~~~~dG~~~~~l~~---~~~~~p~ 71 (109)
.+|+|-.+ ..++ .++.-..++|....|+++=..... .....+|.+..+. ++..+.. ..+..|.
T Consensus 394 e~l~gv~l-gqv~gva~d~~~~l~vfhr~g~vwd~~sfd~~f~~~~~~pi~~~~ilvi~~~--n~~~l~~~g~~~fylph 470 (501)
T KOG3567|consen 394 EALDGVYL-GQVSGVALDPKNNLVVFHRGGHVWDGNSFDSKFVYQQIGPIEEDTILVIDPN--NAAVLQSSGKNLFYLPH 470 (501)
T ss_pred hhcCcchh-heeeeeccCCccceEEEccCCceEeechhhhhhhhhcccccccceEEEEcCc--chhhhhhccCCceecCC
Confidence 34554444 3333 234445566666566653332110 0125777777776 3333322 3467899
Q ss_pred eEEEecCCCeEEEEeCCCCeEEEEeCCCC
Q psy959 72 GLAIDFDDEMLYWTDAVREKIERSDLAGN 100 (109)
Q Consensus 72 glaiD~~~~~lYw~D~~~~~I~~~~~dG~ 100 (109)
||.+| .++..|-+|...+.+..-+..++
T Consensus 471 gl~~d-kdgf~~~tdvash~v~k~k~~~~ 498 (501)
T KOG3567|consen 471 GLSID-KDGFYWVTDVASHQVFKLKPNNK 498 (501)
T ss_pred cceec-CCCcEEeecccchhhhhcccccc
Confidence 99999 45555555666776665555554
No 116
>KOG0273|consensus
Probab=42.74 E-value=1.7e+02 Score=23.07 Aligned_cols=75 Identities=9% Similarity=-0.076 Sum_probs=50.4
Q ss_pred CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeC-CCCeEEEEeCCC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDA-VREKIERSDLAG 99 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~-~~~~I~~~~~dG 99 (109)
....++.|||.+.-=|.+... ...|.++.+++.....-+...-...++|..|+....|-=+.. +.-+|..+.-++
T Consensus 317 ~s~~~lDVdW~~~~~F~ts~t----d~~i~V~kv~~~~P~~t~~GH~g~V~alk~n~tg~LLaS~SdD~TlkiWs~~~~~ 392 (524)
T KOG0273|consen 317 HSAPALDVDWQSNDEFATSST----DGCIHVCKVGEDRPVKTFIGHHGEVNALKWNPTGSLLASCSDDGTLKIWSMGQSN 392 (524)
T ss_pred ccCCccceEEecCceEeecCC----CceEEEEEecCCCcceeeecccCceEEEEECCCCceEEEecCCCeeEeeecCCCc
Confidence 344578889988777777655 588999999887754434445567788888877776655433 355666655444
No 117
>KOG0650|consensus
Probab=41.93 E-value=98 Score=25.14 Aligned_cols=76 Identities=13% Similarity=0.100 Sum_probs=43.4
Q ss_pred cCCCCCccEEEecCC-------------CCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCC-CCcEEEEeCCCCCce
Q psy959 6 KDLLKTHRSLLLQAI-------------PGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAG-SLKKVIIEKDLSQPS 71 (109)
Q Consensus 6 ~~l~G~~~~~li~~l-------------~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG-~~~~~l~~~~~~~p~ 71 (109)
-..+|.++.++|..| ..|..+..+|..-+||.+-.. .+...++.- .-.+.+. .+..+..
T Consensus 539 V~~~~~~~~VliHQLSK~~sQ~PF~kskG~vq~v~FHPs~p~lfVaTq~------~vRiYdL~kqelvKkL~-tg~kwiS 611 (733)
T KOG0650|consen 539 VMPDSGNKSVLIHQLSKRKSQSPFRKSKGLVQRVKFHPSKPYLFVATQR------SVRIYDLSKQELVKKLL-TGSKWIS 611 (733)
T ss_pred eccCCCcceEEEEecccccccCchhhcCCceeEEEecCCCceEEEEecc------ceEEEehhHHHHHHHHh-cCCeeee
Confidence 344566777887432 245567889988899888654 333333321 1111221 3456667
Q ss_pred eEEEecC---------CCeEEEEeCC
Q psy959 72 GLAIDFD---------DEMLYWTDAV 88 (109)
Q Consensus 72 glaiD~~---------~~~lYw~D~~ 88 (109)
+|+|++. .++|.|.|-.
T Consensus 612 ~msihp~GDnli~gs~d~k~~WfDld 637 (733)
T KOG0650|consen 612 SMSIHPNGDNLILGSYDKKMCWFDLD 637 (733)
T ss_pred eeeecCCCCeEEEecCCCeeEEEEcc
Confidence 7777764 4566777653
No 118
>PF12814 Mcp5_PH: Meiotic cell cortex C-terminal pleckstrin homology; InterPro: IPR024774 This pleckstrin homology domain is found in eukaryotic proteins, including Mcp5, a fungal protein that anchors dynein at the cell cortex during the horsetail phase (prophase I) of meiosis. During prophase I of fission yeast all the telomeres become bundled at the spindle pole body and subsequently the nucleus undergoes a dynamic oscillation, resulting in elongated nuclear morphology known as "horsetail" nucleus. The pleckstrin homology domain is necessary for the cortical localisation of the Mcp5 protein during meiosis [].; GO: 0005515 protein binding, 0032065 cortical protein anchoring, 0005938 cell cortex
Probab=41.01 E-value=27 Score=21.69 Aligned_cols=21 Identities=38% Similarity=0.531 Sum_probs=17.4
Q ss_pred CCcceEEeeCCCCeEEEEecC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWG 41 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~ 41 (109)
.+.+=+.|||.++.|||++..
T Consensus 30 ~h~R~fwv~~~~~~L~Ws~~~ 50 (123)
T PF12814_consen 30 PHRRYFWVDPYTRTLYWSSSN 50 (123)
T ss_pred cEEEEEEEeCCCCEEEecCCC
Confidence 345678999999999999965
No 119
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=39.57 E-value=1.5e+02 Score=21.68 Aligned_cols=78 Identities=13% Similarity=0.085 Sum_probs=42.7
Q ss_pred CCCCCccEEEecCCCC---cceEEeeCCCCeEEEEecCCCCCCCeEEEEEcC---CCCcEEEEeCCCCCceeEEEecCCC
Q psy959 7 DLLKTHRSLLLQAIPG---RRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMA---GSLKKVIIEKDLSQPSGLAIDFDDE 80 (109)
Q Consensus 7 ~l~G~~~~~li~~l~~---p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~d---G~~~~~l~~~~~~~p~glaiD~~~~ 80 (109)
.-+|+-+..++..+.- -.|+..|-.+|.||..... -.|....-+ |...+.|-. ..--.-|+=|...-
T Consensus 188 ~gnGkv~~k~vR~fk~~tQTEG~VaDdEtG~LYIaeEd-----vaiWK~~Aep~~G~~g~~idr--~~d~~~LtdDvEGl 260 (364)
T COG4247 188 QGNGKVGTKLVRQFKIPTQTEGMVADDETGFLYIAEED-----VAIWKYEAEPNRGNTGRLIDR--IKDLSYLTDDVEGL 260 (364)
T ss_pred cCCceEcceeeEeeecCCcccceeeccccceEEEeecc-----ceeeecccCCCCCCccchhhh--hcCchhhccccccc
Confidence 3456666666654433 3688889999999998753 455555432 333443322 11112245555555
Q ss_pred eEEEEeCCCCe
Q psy959 81 MLYWTDAVREK 91 (109)
Q Consensus 81 ~lYw~D~~~~~ 91 (109)
.||....+.+-
T Consensus 261 tiYy~pnGkGY 271 (364)
T COG4247 261 TIYYGPNGKGY 271 (364)
T ss_pred EEEEcCCCcEE
Confidence 56666555443
No 120
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=36.25 E-value=1.8e+02 Score=21.71 Aligned_cols=81 Identities=14% Similarity=0.051 Sum_probs=38.2
Q ss_pred EEeeCCCCeEEEEecC-C----CCCCCeEEEEEcCCCCcE-EEE-e--------CCCCCceeEEEecCCCeEEEEeCCCC
Q psy959 26 FMPPIRLRTLYYTDWG-R----FGTAGKILRTTMAGSLKK-VII-E--------KDLSQPSGLAIDFDDEMLYWTDAVRE 90 (109)
Q Consensus 26 iavd~~~g~lywsd~~-~----~~~~~~I~~~~~dG~~~~-~l~-~--------~~~~~p~glaiD~~~~~lYw~D~~~~ 90 (109)
+|+++.+++||..=.. . +.+...|.+.++.-..|. .+- . ..-..|.=++++..++.||.-|...+
T Consensus 243 ~A~~~~~~rlyvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l~~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tG 322 (342)
T PF06433_consen 243 IAYHAASGRLYVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPLEHPIDSIAVSQDDKPLLYALSAGDGTLDVYDAATG 322 (342)
T ss_dssp EEEETTTTEEEEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEEEEEEESEEEEESSSS-EEEEEETTTTEEEEEETTT-
T ss_pred eeeccccCeEEEEecCCCCCCccCCceEEEEEECCCCeEEEEEeCCCccceEEEccCCCcEEEEEcCCCCeEEEEeCcCC
Confidence 8999999999986311 1 113355666665433321 111 0 11234554556655566666666433
Q ss_pred -eEEEEeCCCCCeEEEE
Q psy959 91 -KIERSDLAGNNREALY 106 (109)
Q Consensus 91 -~I~~~~~dG~~~~~l~ 106 (109)
.+.++.-=|.....++
T Consensus 323 k~~~~~~~lG~~~~l~~ 339 (342)
T PF06433_consen 323 KLVRSIEQLGETPTLIL 339 (342)
T ss_dssp -EEEEE---SSS--EEE
T ss_pred cEEeehhccCCCceEEE
Confidence 4444444455555444
No 121
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=35.60 E-value=46 Score=14.83 Aligned_cols=13 Identities=8% Similarity=0.266 Sum_probs=8.1
Q ss_pred EEeeCCCCeEEEE
Q psy959 26 FMPPIRLRTLYYT 38 (109)
Q Consensus 26 iavd~~~g~lyws 38 (109)
.|+|..+|.+-|.
T Consensus 19 ~a~d~~~G~~~W~ 31 (33)
T smart00564 19 YALDAKTGEILWT 31 (33)
T ss_pred EEEEcccCcEEEE
Confidence 4566666666664
No 122
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=35.13 E-value=1.3e+02 Score=19.47 Aligned_cols=70 Identities=14% Similarity=0.052 Sum_probs=41.5
Q ss_pred CcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcC-CCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCC
Q psy959 22 GRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMA-GSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLA 98 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~d-G~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~d 98 (109)
.+.+++++|....++... . ...|...++. +.....+ ......+..+++++.+..++.+.. .+.|...++.
T Consensus 137 ~i~~~~~~~~~~~l~~~~-~----~~~i~i~d~~~~~~~~~~-~~~~~~i~~~~~~~~~~~l~~~~~-~~~i~i~d~~ 207 (289)
T cd00200 137 WVNSVAFSPDGTFVASSS-Q----DGTIKLWDLRTGKCVATL-TGHTGEVNSVAFSPDGEKLLSSSS-DGTIKLWDLS 207 (289)
T ss_pred cEEEEEEcCcCCEEEEEc-C----CCcEEEEEccccccceeE-ecCccccceEEECCCcCEEEEecC-CCcEEEEECC
Confidence 467788888644444433 2 3677777775 3333333 333335777888877766666654 5566666655
No 123
>KOG0291|consensus
Probab=35.11 E-value=2.7e+02 Score=23.38 Aligned_cols=75 Identities=12% Similarity=0.002 Sum_probs=0.0
Q ss_pred eEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCCeEEEEeCCCCCeEE
Q psy959 25 HFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVREKIERSDLAGNNREA 104 (109)
Q Consensus 25 ~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~~I~~~~~dG~~~~~ 104 (109)
.+|+|| .|-|..+-... .-.|.+.++......-+++..-.-..+|.++|.+..|+=..| ...|..-++-++..++
T Consensus 440 cvavD~-sGelV~AG~~d---~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SW-DkTVRiW~if~s~~~v 514 (893)
T KOG0291|consen 440 CVAVDP-SGELVCAGAQD---SFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSW-DKTVRIWDIFSSSGTV 514 (893)
T ss_pred EEEEcC-CCCEEEeeccc---eEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEeccc-cceEEEEEeeccCcee
No 124
>KOG2586|consensus
Probab=33.80 E-value=54 Score=22.72 Aligned_cols=32 Identities=25% Similarity=0.400 Sum_probs=20.4
Q ss_pred CeeeecCCCCC--ccEEEecCCCCcceEEeeCCCCeEEEEecC
Q psy959 1 MVSTDKDLLKT--HRSLLLQAIPGRRHFMPPIRLRTLYYTDWG 41 (109)
Q Consensus 1 ~~~~~~~l~G~--~~~~li~~l~~p~~iavd~~~g~lywsd~~ 41 (109)
|.++-++++|. .|.+|+.++ |+ +|++|+++.+
T Consensus 59 m~LsT~~~d~rvssRmvLlKgl--------~~-~gf~fytn~~ 92 (228)
T KOG2586|consen 59 MTLSTADKDGRVSSRMVLLKGL--------DH-DGFVFYTNYG 92 (228)
T ss_pred eeehhccccCCcceeeeeeecc--------cC-CCeEEEeecc
Confidence 34455566665 555665654 55 7888999875
No 125
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=33.07 E-value=2.7e+02 Score=22.72 Aligned_cols=60 Identities=13% Similarity=0.064 Sum_probs=41.3
Q ss_pred cEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCC---cEEEEeCCCCCceeEEEe
Q psy959 13 RSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSL---KKVIIEKDLSQPSGLAID 76 (109)
Q Consensus 13 ~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~---~~~l~~~~~~~p~glaiD 76 (109)
|++-|-++--|.=||.|+..+.+=++... ...|....+-.+. .+.|.-+...+|.||.+-
T Consensus 331 rKV~IPGILvPDliAfn~kaq~VAVASNT----cn~ilVYSv~~s~mPniQqIqLe~~ERPKGiCFl 393 (671)
T PF15390_consen 331 RKVSIPGILVPDLIAFNPKAQVVAVASNT----CNIILVYSVTPSSMPNIQQIQLESNERPKGICFL 393 (671)
T ss_pred eeeccccccccceeeeCCcCCEEEEEecC----CcEEEEEEeccccCCCeeEEEcccCCCCceeeEc
Confidence 44555567788889999988888776543 5677777665443 344444567899999864
No 126
>PHA02885 putative interleukin binding protein; Provisional
Probab=31.16 E-value=1.3e+02 Score=18.55 Aligned_cols=34 Identities=9% Similarity=-0.002 Sum_probs=22.4
Q ss_pred eEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCC
Q psy959 25 HFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSL 58 (109)
Q Consensus 25 ~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~ 58 (109)
+..+....|++||.-....++...|.--++.|.+
T Consensus 53 s~~lnsedgyiywi~pnntspgeyif~enl~g~~ 86 (135)
T PHA02885 53 SGCLNSEDGYIYWIGPNNTSPGEYIFIENLEGAN 86 (135)
T ss_pred eeeecCCCceEEEEeCCCCCCcceeeeecccccc
Confidence 3566788999999976543334556666665554
No 127
>COG0259 PdxH Pyridoxamine-phosphate oxidase [Coenzyme metabolism]
Probab=29.34 E-value=96 Score=21.50 Aligned_cols=10 Identities=30% Similarity=0.823 Sum_probs=7.0
Q ss_pred CCeEEEEecC
Q psy959 32 LRTLYYTDWG 41 (109)
Q Consensus 32 ~g~lywsd~~ 41 (109)
.|.+|+|+..
T Consensus 73 rGfvFyTN~~ 82 (214)
T COG0259 73 RGFVFYTNYG 82 (214)
T ss_pred CcEEEEeccC
Confidence 6777777754
No 128
>smart00155 PLDc Phospholipase D. Active site motifs. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, aspartic acid, and/or asparagine residues which may contribute to the active site. An E. coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not be
Probab=28.05 E-value=57 Score=14.43 Aligned_cols=20 Identities=25% Similarity=0.253 Sum_probs=12.2
Q ss_pred CeEEEEeCCCCeEEEEeCCC
Q psy959 80 EMLYWTDAVREKIERSDLAG 99 (109)
Q Consensus 80 ~~lYw~D~~~~~I~~~~~dG 99 (109)
.+++++|.....|.+++++.
T Consensus 7 ~K~~v~D~~~~~iGs~N~~~ 26 (28)
T smart00155 7 TKLMIVDDEIAYIGSANLDG 26 (28)
T ss_pred eEEEEEcCCEEEEeCccCCC
Confidence 45667776655666666554
No 129
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=25.82 E-value=1.9e+02 Score=18.64 Aligned_cols=47 Identities=9% Similarity=-0.010 Sum_probs=27.8
Q ss_pred ecCCCCCc--cEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCCC
Q psy959 5 DKDLLKTH--RSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAGS 57 (109)
Q Consensus 5 ~~~l~G~~--~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~ 57 (109)
...++++- ...+........|++.|+ ...-||.-.. ..|........
T Consensus 57 vn~L~~~vV~e~~~~~~~~~~~gl~~D~-~~~t~W~ys~-----~~I~ei~i~~E 105 (147)
T PF05131_consen 57 VNRLNNKVVFEESLLETGGKILGLCRDP-SSNTFWLYSS-----NSIFEIVINNE 105 (147)
T ss_pred EEecCCcEEEEEEeccCCcceeeEEEcC-CCCeEEEEeC-----CeeEEEEcCcc
Confidence 34455542 223334556788999999 4457777664 56666655433
No 130
>PF08310 LGFP: LGFP repeat; InterPro: IPR013207 This 54 amino acid repeat is found in many hypothetical proteins. Several hypothetical proteins from Corynebacterium glutamicum (Brevibacterium flavum) and Corynebacterium efficiens along with PS1 protein contain this repeat region. The N-terminal region of PS1 contains an esterase domain which transfers corynomycolic acid. The C-terminal region consists of 4 tandem LGFP repeats. It is hypothesised that the PS1 proteins in Corynebacterium, when associated with the cell wall, may be anchored via the LGFP tandem repeats that may be important for maintaining cell wall integrity. Deletion of Q01377 from SWISSPROT protein results in a 10-fold increase in the cell volume of the organism and infers the corresponding involvement of the protein in the cell shape formation []. The secondary structure of each repeat is predicted to comprise two beta-strands and one alpha-helix.
Probab=25.18 E-value=1.1e+02 Score=15.84 Aligned_cols=17 Identities=35% Similarity=0.817 Sum_probs=13.8
Q ss_pred eEEEecCCCeEEEEeCC
Q psy959 72 GLAIDFDDEMLYWTDAV 88 (109)
Q Consensus 72 glaiD~~~~~lYw~D~~ 88 (109)
|..-+..++.|||+...
T Consensus 16 G~~q~F~~G~Iywsp~t 32 (54)
T PF08310_consen 16 GRYQDFQNGTIYWSPAT 32 (54)
T ss_pred EEEEEcCCeEEEEeCCC
Confidence 46777889999999874
No 131
>PF15232 DUF4585: Domain of unknown function (DUF4585)
Probab=24.45 E-value=1.5e+02 Score=17.04 Aligned_cols=39 Identities=18% Similarity=0.141 Sum_probs=27.4
Q ss_pred cceEEeeCCCCeEEEEecCCCCCCCeEEEEEc--CCCCcEEEEe
Q psy959 23 RRHFMPPIRLRTLYYTDWGRFGTAGKILRTTM--AGSLKKVIIE 64 (109)
Q Consensus 23 p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~--dG~~~~~l~~ 64 (109)
++-+-+||.+|.-|+.|.-. ..++.+..- .|...++.+.
T Consensus 7 qrKvL~DP~SG~Yy~vd~P~---Qp~~k~lfDPETGqYVeV~iP 47 (75)
T PF15232_consen 7 QRKVLQDPESGQYYVVDAPV---QPKTKTLFDPETGQYVEVLIP 47 (75)
T ss_pred CccEeecCCCCCEEEEecCC---CcceeeeecCCCCcEEEEeCC
Confidence 45567899999999999864 455555442 3667777765
No 132
>PF14956 DUF4505: Domain of unknown function (DUF4505)
Probab=24.44 E-value=59 Score=22.03 Aligned_cols=18 Identities=28% Similarity=0.453 Sum_probs=15.2
Q ss_pred CceeEEEecCCCeEEEEe
Q psy959 69 QPSGLAIDFDDEMLYWTD 86 (109)
Q Consensus 69 ~p~glaiD~~~~~lYw~D 86 (109)
+|..|.+++.+++||==-
T Consensus 115 qPe~L~m~~~sGRvYHPa 132 (183)
T PF14956_consen 115 QPEKLCMDPSSGRVYHPA 132 (183)
T ss_pred ccceEEEcCCCCcEeccC
Confidence 789999999999998643
No 133
>COG4993 Gcd Glucose dehydrogenase [Carbohydrate transport and metabolism]
Probab=24.35 E-value=3e+02 Score=22.74 Aligned_cols=60 Identities=8% Similarity=-0.053 Sum_probs=37.3
Q ss_pred EEeeCCCCeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCC--CCCceeEEEecCCCeEEEEeCCCC
Q psy959 26 FMPPIRLRTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKD--LSQPSGLAIDFDDEMLYWTDAVRE 90 (109)
Q Consensus 26 iavd~~~g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~--~~~p~glaiD~~~~~lYw~D~~~~ 90 (109)
+|+|..+|++=|.-... .----+||+...-+|+... -...-.|+---.++.+|..|...+
T Consensus 429 vAlD~~TG~~kW~yQtv-----hhDlWDmDvp~qp~L~D~~~DG~~vpalv~ptk~G~~YVlDRrtG 490 (773)
T COG4993 429 VALDATTGKLKWVYQTV-----HHDLWDMDVPAQPTLLDITKDGKVVPALVHPTKNGFIYVLDRRTG 490 (773)
T ss_pred EEecCCCcceeeeeecc-----CcchhcccCCCCceEEEeecCCcEeeeeecccccCcEEEEEcCCC
Confidence 78899999998886542 2234567777777776521 111222333335778999998654
No 134
>PF10042 DUF2278: Uncharacterized conserved protein (DUF2278); InterPro: IPR019268 This entry consists of hypothetical proteins with no known function.
Probab=24.00 E-value=87 Score=21.59 Aligned_cols=66 Identities=15% Similarity=0.128 Sum_probs=34.1
Q ss_pred ccEEEecCCC-CcceEEeeCCC-----CeEEEEecCCCCCCCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCe-EE
Q psy959 12 HRSLLLQAIP-GRRHFMPPIRL-----RTLYYTDWGRFGTAGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEM-LY 83 (109)
Q Consensus 12 ~~~~li~~l~-~p~~iavd~~~-----g~lywsd~~~~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~-lY 83 (109)
|..+.++.-. ...-+||+-.+ ..+||.+.... .+-+....--...-..| -..|.++|+|+.++. |+
T Consensus 29 Hy~i~v~~~~~~~yr~aINV~S~~~~s~l~y~~~~~f~--~p~~~~L~~L~~G~h~L----~~~~~~~aLDYiR~~~Lf 101 (206)
T PF10042_consen 29 HYQIHVDDNNGGDYRIAINVKSKDAPSELLYWVVEDFQ--HPITAKLAALPDGFHPL----ESTPGGGALDYIRGNGLF 101 (206)
T ss_pred cEEEEEecCCCceEEEEEECCCCCCCCcEEEEEeCCCC--cHHHHHHHhcCCCceEC----cCCCCCeeeeEeeCCcCc
Confidence 4555554433 66667776544 35888876531 11111100000001111 157899999998887 54
No 135
>PF12216 m04gp34like: Immune evasion protein; InterPro: IPR022022 The proteins in this family are related to the m04 encoded protein gp34 of pathogenic microorganisms such as Murid herpesvirus 1. m06 and m152 genes are expressed earlier in the intracellular replication phases of these microorganism' life cycles. They function to inhibit MHC-1 loading and export. gp34 is theorized to prevent immune reactions from NK cells which would ordinarily recognise and attack cells lacking MHC.
Probab=23.86 E-value=82 Score=22.67 Aligned_cols=21 Identities=14% Similarity=0.121 Sum_probs=18.0
Q ss_pred CCcceEEeeCCCCeEEEEecC
Q psy959 21 PGRRHFMPPIRLRTLYYTDWG 41 (109)
Q Consensus 21 ~~p~~iavd~~~g~lywsd~~ 41 (109)
..+.|+.||+.+|+||.+...
T Consensus 135 ~~~~Gf~Vd~~sG~L~i~sna 155 (272)
T PF12216_consen 135 GNKDGFKVDPSSGNLYISSNA 155 (272)
T ss_pred CCCCCeEEcCCCceEEEecCc
Confidence 467899999999999998755
No 136
>PF00622 SPRY: SPRY domain; InterPro: IPR003877 The SPRY domain is of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin []. Ca2+-release from the sarcoplasmic or endoplasmic reticulum, the intracellular Ca2+ store, is mediated by the ryanodine receptor (RyR) and/or the inositol trisphosphate receptor (IP3R).; GO: 0005515 protein binding; PDB: 2V24_A 3EK9_A 2AFJ_A 2IWG_E 3EMW_A 2WL1_A 3TOJ_B 2VOK_A 2VOL_B 2FNJ_A ....
Probab=23.41 E-value=1.3e+02 Score=17.73 Aligned_cols=20 Identities=10% Similarity=-0.094 Sum_probs=16.9
Q ss_pred CcceEEeeCCCCeEEEEecC
Q psy959 22 GRRHFMPPIRLRTLYYTDWG 41 (109)
Q Consensus 22 ~p~~iavd~~~g~lywsd~~ 41 (109)
.--|+.+|..++.|+|+..+
T Consensus 68 dvIG~~lD~~~g~l~F~~ng 87 (124)
T PF00622_consen 68 DVIGCGLDLDNGELSFYKNG 87 (124)
T ss_dssp SEEEEEEETTTTEEEEEETT
T ss_pred cEEEEEEeecccEEEEEECC
Confidence 34588999999999999865
No 137
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=23.35 E-value=2.4e+02 Score=22.84 Aligned_cols=67 Identities=12% Similarity=0.048 Sum_probs=34.8
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCC-CCCeEEEEE-c---CCCC---cEEEEeCCCCCceeEEEecCCCeEEEEeC
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFG-TAGKILRTT-M---AGSL---KKVIIEKDLSQPSGLAIDFDDEMLYWTDA 87 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~-~~~~I~~~~-~---dG~~---~~~l~~~~~~~p~glaiD~~~~~lYw~D~ 87 (109)
+..|.+|++|+ .|+|+.+.-+..+ ...++.-+. | ++.. ++-+....-....|.+..|..+.||..=.
T Consensus 499 f~~PDnl~fD~-~GrLWi~TDg~~s~~~~~~~G~~~m~~~~p~~g~~~rf~t~P~g~E~tG~~FspD~~TlFV~vQ 573 (616)
T COG3211 499 FNSPDNLAFDP-WGRLWIQTDGSGSTLRNRFRGVTQMLTPDPKTGTIKRFLTGPIGCEFTGPCFSPDGKTLFVNVQ 573 (616)
T ss_pred ccCCCceEECC-CCCEEEEecCCCCccCcccccccccccCCCccceeeeeccCCCcceeecceeCCCCceEEEEec
Confidence 67799999999 7777555332211 012121111 1 2221 22222222245667888888888887643
No 138
>KOG0263|consensus
Probab=23.11 E-value=4.3e+02 Score=21.91 Aligned_cols=60 Identities=7% Similarity=-0.030 Sum_probs=34.0
Q ss_pred EEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEc-CCCCcEEEEeCCCCCceeEEEecCCC
Q psy959 15 LLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTM-AGSLKKVIIEKDLSQPSGLAIDFDDE 80 (109)
Q Consensus 15 ~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~-dG~~~~~l~~~~~~~p~glaiD~~~~ 80 (109)
+.+-.+....+++++|...++ .+-.. ..++..-+. .|..++.+ .........|++.|.++
T Consensus 530 ifaghlsDV~cv~FHPNs~Y~-aTGSs----D~tVRlWDv~~G~~VRiF-~GH~~~V~al~~Sp~Gr 590 (707)
T KOG0263|consen 530 IFAGHLSDVDCVSFHPNSNYV-ATGSS----DRTVRLWDVSTGNSVRIF-TGHKGPVTALAFSPCGR 590 (707)
T ss_pred hhcccccccceEEECCccccc-ccCCC----CceEEEEEcCCCcEEEEe-cCCCCceEEEEEcCCCc
Confidence 344557778889999954444 34332 355555444 35555544 44445556677766443
No 139
>KOG0649|consensus
Probab=21.80 E-value=3.2e+02 Score=19.85 Aligned_cols=38 Identities=13% Similarity=0.347 Sum_probs=29.4
Q ss_pred CCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEc-CCCCcEEE
Q psy959 20 IPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTM-AGSLKKVI 62 (109)
Q Consensus 20 l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~-dG~~~~~l 62 (109)
+...+++.+||..+.++++- + .+.|+..++ +|+..+++
T Consensus 114 vPeINam~ldP~enSi~~Ag-G----D~~~y~~dlE~G~i~r~~ 152 (325)
T KOG0649|consen 114 VPEINAMWLDPSENSILFAG-G----DGVIYQVDLEDGRIQREY 152 (325)
T ss_pred CCccceeEeccCCCcEEEec-C----CeEEEEEEecCCEEEEEE
Confidence 34557899999999999886 3 588888888 57766655
No 140
>PF08470 NTNH_C: Nontoxic nonhaemagglutinin C-terminal; InterPro: IPR013677 Bacteria of the Clostridium genus produce protein neurotoxins, which are complexes consisting of neurotoxin (NT), haemagglutinin (HA), nontoxic nonhaemagglutinin (NTNH), and RNA [, ]. The domain described here is found at the C terminus of the NTNH component. ; PDB: 3V0B_B 3V0A_B.
Probab=21.57 E-value=2.2e+02 Score=18.66 Aligned_cols=42 Identities=17% Similarity=0.307 Sum_probs=26.4
Q ss_pred CCeEEEEEcCCCCcEEEEeCCCCCceeEEEecCCCeEEEEeC
Q psy959 46 AGKILRTTMAGSLKKVIIEKDLSQPSGLAIDFDDEMLYWTDA 87 (109)
Q Consensus 46 ~~~I~~~~~dG~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~ 87 (109)
..+|+..+-.-...+.++..++-.|+.|++...+..+|-+-.
T Consensus 99 nNriqlvd~k~~a~k~iinNdiF~sNclt~~~n~ky~~ls~k 140 (165)
T PF08470_consen 99 NNRIQLVDNKNNAKKIIINNDIFISNCLTLSYNNKYISLSMK 140 (165)
T ss_dssp TSBEEEES-GGG---EEEE--TT-TT-BEEEETTEEEEEEEE
T ss_pred CceEEEEecCCCceEEEEEcCeeccceEEEEECCEEEEEEEc
Confidence 578888776555566677888999999999988888877644
No 141
>PF09000 Cytotoxic: Cytotoxic; InterPro: IPR009105 Colicins are plasmid-encoded protein antibiotics, or bacteriocins, produced by strains of Escherichia coli that kill closely related bacteria. Colicins are classified according to the cell-surface receptor they bind to, colicin E3 binding to the BtuB receptor involved in vitamin B12 uptake. The lethal action of colicin E3 arises from its ability to inactivate the ribosome by site-specific RNase cleavage of the 16S ribosomal RNA, which is carried out by the catalytic, or ribonuclease domain. Colicin E3 is comprised of three domains, each domain being involved in a different stage of infection: receptor binding, translocation and cytotoxicity. Colicin E3 is a Y-shaped molecule with the receptor-binding middle domain forming the stalk, the N-terminal translocation domain forming the two globular heads (IPR003058 from INTERPRO), and the C-terminal catalytic domain forming the two globular arms. To neutralise the toxic effects of colicin E3, the host cell produces an immunity protein, which binds to the C-terminal end of the ribonuclease domain and effectively suppresses its activity. This entry represents the ribonuclease domain (also called catalytic or cytotoxic domain) found in various colicins. This domain confers cytotoxic activity to proteins, enabling the formation of nucleolytic breaks in 16S ribosomal RNA. The structure of the domain reveals a highly twisted central beta-sheet elaborated with a short N-terminal alpha-helix [, ]. ; GO: 0003723 RNA binding, 0016788 hydrolase activity, acting on ester bonds, 0043022 ribosome binding, 0009405 pathogenesis; PDB: 2B5U_C 1JCH_A 1E44_B 2XFZ_Y.
Probab=20.36 E-value=2e+02 Score=16.96 Aligned_cols=30 Identities=17% Similarity=0.109 Sum_probs=23.4
Q ss_pred EEEecCCCeEEEEeCCCCeEEEEeCCCCCe
Q psy959 73 LAIDFDDEMLYWTDAVREKIERSDLAGNNR 102 (109)
Q Consensus 73 laiD~~~~~lYw~D~~~~~I~~~~~dG~~~ 102 (109)
-=.|....++|=-|+..+.||..+-+|.+.
T Consensus 34 rw~~~kG~kiYewDsqHG~lEvy~~~GkHL 63 (85)
T PF09000_consen 34 RWKDKKGRKIYEWDSQHGELEVYNKRGKHL 63 (85)
T ss_dssp EEEETTTTEEEEEETTTTEEEEEETT-BEE
T ss_pred ceEcCCCCEEEEEcCCCCeEEEEcCCCcCc
Confidence 456777788999999999999999888753
No 142
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=20.35 E-value=5.1e+02 Score=21.61 Aligned_cols=14 Identities=0% Similarity=-0.278 Sum_probs=6.9
Q ss_pred EEeeCCCCeEEEEe
Q psy959 26 FMPPIRLRTLYYTD 39 (109)
Q Consensus 26 iavd~~~g~lywsd 39 (109)
+|+|..+|++-|.-
T Consensus 417 vALD~~TGk~~W~~ 430 (764)
T TIGR03074 417 VALDATTGKERWVF 430 (764)
T ss_pred EEEeCCCCceEEEe
Confidence 44555555555543
No 143
>KOG0266|consensus
Probab=20.27 E-value=4e+02 Score=20.28 Aligned_cols=83 Identities=14% Similarity=0.014 Sum_probs=51.8
Q ss_pred ccEEEecCCCCcceEEeeCCCCeEEEEecCCCCCCCeEEEEEcCC-CCcEEEEeCCCCCceeEEEecCCCeEEEEeCCCC
Q psy959 12 HRSLLLQAIPGRRHFMPPIRLRTLYYTDWGRFGTAGKILRTTMAG-SLKKVIIEKDLSQPSGLAIDFDDEMLYWTDAVRE 90 (109)
Q Consensus 12 ~~~~li~~l~~p~~iavd~~~g~lywsd~~~~~~~~~I~~~~~dG-~~~~~l~~~~~~~p~glaiD~~~~~lYw~D~~~~ 90 (109)
..+++...-....++++.|.. .+..+-.. ...|...++.+ +..+.+ ........++++.+.. .++++-...+
T Consensus 238 ~~~~l~gH~~~v~~~~f~p~g-~~i~Sgs~----D~tvriWd~~~~~~~~~l-~~hs~~is~~~f~~d~-~~l~s~s~d~ 310 (456)
T KOG0266|consen 238 NLKTLKGHSTYVTSVAFSPDG-NLLVSGSD----DGTVRIWDVRTGECVRKL-KGHSDGISGLAFSPDG-NLLVSASYDG 310 (456)
T ss_pred EEEEecCCCCceEEEEecCCC-CEEEEecC----CCcEEEEeccCCeEEEee-eccCCceEEEEECCCC-CEEEEcCCCc
Confidence 334444334455789999955 77777665 57888888864 444444 3334567778886544 4455545567
Q ss_pred eEEEEeCCCCC
Q psy959 91 KIERSDLAGNN 101 (109)
Q Consensus 91 ~I~~~~~dG~~ 101 (109)
.|..-+..+..
T Consensus 311 ~i~vwd~~~~~ 321 (456)
T KOG0266|consen 311 TIRVWDLETGS 321 (456)
T ss_pred cEEEEECCCCc
Confidence 77777766555
No 144
>PF14280 DUF4365: Domain of unknown function (DUF4365)
Probab=20.03 E-value=2.3e+02 Score=17.35 Aligned_cols=25 Identities=28% Similarity=0.440 Sum_probs=17.0
Q ss_pred CCceeE-EEecCCCeEEEEeCCCCeE
Q psy959 68 SQPSGL-AIDFDDEMLYWTDAVREKI 92 (109)
Q Consensus 68 ~~p~gl-aiD~~~~~lYw~D~~~~~I 92 (109)
.-|.=| .+|+..+.+||.+.....+
T Consensus 82 ~~PvilV~~~~~~~~~yW~~l~~~~~ 107 (138)
T PF14280_consen 82 PVPVILVLVDPDSDCAYWVSLKGYPI 107 (138)
T ss_pred CCCEEEEEEeCCCCEEEEEEcccCcc
Confidence 345544 4688888999999864443
Done!