Query psy5669
Match_columns 116
No_of_seqs 119 out of 1066
Neff 7.0
Searched_HMMs 46136
Date Fri Aug 16 23:33:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5669.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5669hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00201 UDPGT: UDP-glucoronos 100.0 2.9E-31 6.4E-36 214.7 3.0 104 2-105 397-500 (500)
2 PHA03392 egt ecdysteroid UDP-g 99.9 1.8E-22 4E-27 166.0 10.5 78 2-79 420-498 (507)
3 KOG1192|consensus 99.5 3.7E-13 8.1E-18 108.6 10.0 70 4-75 412-481 (496)
4 TIGR01426 MGT glycosyltransfer 95.2 0.05 1.1E-06 42.9 5.7 45 2-48 347-391 (392)
5 PLN02670 transferase, transfer 95.2 0.055 1.2E-06 44.9 5.9 47 2-49 417-466 (472)
6 COG1819 Glycosyl transferases, 94.2 0.16 3.4E-06 41.2 6.3 48 2-51 356-403 (406)
7 PLN02554 UDP-glycosyltransfera 90.8 0.92 2E-05 37.5 6.6 30 2-31 428-458 (481)
8 PF15050 SCIMP: SCIMP protein 88.7 1.2 2.5E-05 30.9 4.7 47 62-111 1-48 (133)
9 PLN02562 UDP-glycosyltransfera 87.1 2.1 4.5E-05 35.2 6.1 30 2-31 401-430 (448)
10 PF07214 DUF1418: Protein of u 83.9 3.4 7.3E-05 27.4 4.8 34 83-116 60-93 (96)
11 PLN02207 UDP-glycosyltransfera 83.3 1.4 3E-05 36.6 3.5 30 2-31 414-445 (468)
12 PLN03004 UDP-glycosyltransfera 81.3 2.2 4.7E-05 35.3 3.9 28 2-29 412-439 (451)
13 PLN03007 UDP-glucosyltransfera 77.4 3 6.6E-05 34.5 3.6 30 2-31 428-460 (482)
14 cd03784 GT1_Gtf_like This fami 76.2 5.1 0.00011 31.4 4.4 41 2-45 360-400 (401)
15 PF07069 PRRSV_2b: Porcine rep 73.3 11 0.00024 23.1 4.4 34 69-102 26-61 (73)
16 PF05478 Prominin: Prominin; 71.2 12 0.00026 33.1 5.9 36 63-98 85-120 (806)
17 PF02038 ATP1G1_PLM_MAT8: ATP1 70.8 7 0.00015 22.9 3.0 28 64-91 8-35 (50)
18 PF13524 Glyco_trans_1_2: Glyc 70.1 14 0.00031 22.7 4.7 42 2-43 50-91 (92)
19 PLN02208 glycosyltransferase f 70.0 16 0.00034 30.1 6.0 43 2-45 389-436 (442)
20 cd07912 Tweety_N N-terminal do 70.0 22 0.00048 29.3 6.9 13 39-51 4-16 (418)
21 PF07319 DnaI_N: Primosomal pr 68.0 2 4.4E-05 27.9 0.4 32 1-32 1-32 (94)
22 PRK00726 murG undecaprenyldiph 64.8 19 0.00041 27.7 5.3 43 2-45 312-354 (357)
23 PF11628 TCR_zetazeta: T-cell 63.8 15 0.00033 19.6 3.2 19 67-85 5-23 (33)
24 PLN02863 UDP-glucoronosyl/UDP- 63.6 22 0.00048 29.5 5.8 48 2-49 421-472 (477)
25 PLN00414 glycosyltransferase f 63.1 27 0.00059 28.8 6.1 47 2-49 390-441 (446)
26 PLN02167 UDP-glycosyltransfera 62.4 10 0.00022 31.4 3.5 29 2-30 422-451 (475)
27 PLN02448 UDP-glycosyltransfera 62.3 10 0.00022 31.1 3.5 30 2-31 403-437 (459)
28 PF03401 TctC: Tripartite tric 60.5 11 0.00023 28.8 3.2 46 2-49 220-265 (274)
29 PF11057 Cortexin: Cortexin of 59.6 16 0.00034 23.3 3.2 23 73-95 31-53 (81)
30 PLN02152 indole-3-acetate beta 59.5 12 0.00026 31.0 3.5 30 2-31 405-436 (455)
31 PF05478 Prominin: Prominin; 59.0 13 0.00029 32.9 3.9 29 73-102 99-127 (806)
32 cd03820 GT1_amsD_like This fam 57.5 22 0.00049 25.7 4.4 41 2-43 307-347 (348)
33 PLN02210 UDP-glucosyl transfer 56.8 15 0.00033 30.3 3.6 28 2-29 403-433 (456)
34 PF09929 DUF2161: Uncharacteri 56.7 8.4 0.00018 26.5 1.8 26 16-41 56-82 (118)
35 PRK00025 lpxB lipid-A-disaccha 56.7 42 0.00092 25.9 6.0 46 2-47 329-375 (380)
36 PLN02410 UDP-glucoronosyl/UDP- 56.7 15 0.00032 30.4 3.5 29 2-30 398-429 (451)
37 PF11346 DUF3149: Protein of u 56.0 35 0.00076 19.1 4.6 35 64-98 3-37 (42)
38 PF05186 Dpy-30: Dpy-30 motif; 55.8 34 0.00074 19.0 3.9 22 21-42 15-36 (42)
39 PLN02764 glycosyltransferase f 53.9 45 0.00098 27.7 6.0 49 2-51 395-448 (453)
40 PF03213 Pox_P35: Poxvirus P35 50.8 33 0.00072 27.5 4.5 39 36-78 258-297 (325)
41 COG0133 TrpB Tryptophan syntha 50.5 43 0.00093 27.3 5.1 44 2-46 26-69 (396)
42 PF15013 CCSMST1: CCSMST1 fami 50.4 14 0.0003 23.5 1.9 19 55-73 21-39 (77)
43 cd03801 GT1_YqgM_like This fam 49.0 57 0.0012 23.5 5.4 44 2-45 329-372 (374)
44 PF05454 DAG1: Dystroglycan (D 47.6 6.2 0.00013 31.1 0.0 20 22-41 70-89 (290)
45 KOG4331|consensus 47.0 34 0.00073 30.8 4.4 38 63-100 98-135 (865)
46 PHA03105 EEV glycoprotein; Pro 46.2 21 0.00045 26.0 2.5 24 81-105 17-40 (188)
47 KOG4433|consensus 46.2 1.2E+02 0.0026 25.8 7.3 65 35-104 4-76 (526)
48 PLN00164 glucosyltransferase; 45.5 24 0.00052 29.3 3.1 30 2-31 419-453 (480)
49 PTZ00234 variable surface prot 45.4 17 0.00037 30.1 2.3 31 83-113 377-407 (433)
50 COG2011 AbcD ABC-type metal io 45.2 40 0.00087 25.6 4.0 34 65-98 186-219 (222)
51 PF07219 HemY_N: HemY protein 43.9 54 0.0012 21.4 4.2 23 74-96 20-42 (108)
52 cd08806 CARD_CARD14_CARMA2 Cas 43.6 82 0.0018 20.4 4.7 38 7-44 35-75 (86)
53 cd03822 GT1_ecORF704_like This 43.4 95 0.0021 22.9 5.9 43 2-45 322-364 (366)
54 PF05510 Sarcoglycan_2: Sarcog 42.7 40 0.00087 27.7 3.9 21 55-76 269-289 (386)
55 PF04328 DUF466: Protein of un 42.3 79 0.0017 19.2 4.8 44 2-47 10-53 (65)
56 PF01601 Corona_S2: Coronaviru 42.3 8.4 0.00018 33.3 0.0 33 60-92 544-576 (610)
57 PF10661 EssA: WXG100 protein 42.2 46 0.00099 23.4 3.7 14 81-94 130-143 (145)
58 cd03794 GT1_wbuB_like This fam 40.8 46 0.001 24.4 3.8 39 2-40 353-391 (394)
59 cd03821 GT1_Bme6_like This fam 40.1 62 0.0013 23.7 4.4 41 2-42 333-373 (375)
60 PF01102 Glycophorin_A: Glycop 40.0 67 0.0014 22.1 4.2 10 28-37 41-50 (122)
61 PRK08939 primosomal protein Dn 39.8 42 0.00092 26.2 3.6 32 1-32 1-32 (306)
62 PLN02992 coniferyl-alcohol glu 39.6 91 0.002 26.1 5.7 28 2-29 415-445 (481)
63 KOG3989|consensus 39.6 45 0.00097 25.3 3.6 11 41-51 162-172 (224)
64 PRK10907 intramembrane serine 39.6 1.5E+02 0.0033 23.0 6.6 31 12-42 39-69 (276)
65 PF03579 SHP: Small hydrophobi 39.4 87 0.0019 18.9 4.6 9 64-72 14-22 (64)
66 PF14851 FAM176: FAM176 family 37.7 57 0.0012 23.3 3.7 6 90-95 42-47 (153)
67 PF06345 Drf_DAD: DRF Autoregu 37.6 18 0.00038 15.7 0.7 10 1-10 3-12 (15)
68 PF04906 Tweety: Tweety; Inte 37.6 1.6E+02 0.0036 24.0 6.8 18 86-103 34-51 (406)
69 PHA02688 ORF059 IMV protein VP 37.5 52 0.0011 26.4 3.8 40 36-79 256-296 (323)
70 TIGR03798 ocin_TIGR03798 bacte 37.4 37 0.0008 20.2 2.4 24 1-24 1-24 (64)
71 PF14979 TMEM52: Transmembrane 37.3 86 0.0019 22.4 4.5 15 70-84 20-34 (154)
72 PHA02754 hypothetical protein; 37.2 55 0.0012 19.9 3.0 27 6-32 5-31 (67)
73 PF01102 Glycophorin_A: Glycop 37.1 66 0.0014 22.1 3.8 7 73-79 67-73 (122)
74 PRK13028 tryptophan synthase s 36.7 45 0.00099 27.2 3.4 38 2-40 32-69 (402)
75 smart00394 RIIa RIIalpha, Regu 36.4 70 0.0015 16.9 3.6 24 25-48 12-35 (38)
76 PF07856 Orai-1: Mediator of C 36.4 42 0.00091 24.4 2.9 56 30-85 93-153 (175)
77 PF03889 DUF331: Domain of unk 36.2 26 0.00056 19.4 1.3 19 5-23 5-23 (39)
78 PF08374 Protocadherin: Protoc 36.1 38 0.00082 25.7 2.7 7 43-49 8-14 (221)
79 cd00904 Ferritin Ferritin iron 36.1 52 0.0011 23.0 3.3 38 20-57 35-72 (160)
80 PF06837 Fijivirus_P9-2: Fijiv 36.0 94 0.002 23.2 4.6 66 30-101 43-110 (214)
81 PRK04346 tryptophan synthase s 34.9 53 0.0011 26.8 3.6 37 2-39 28-64 (397)
82 PRK11024 colicin uptake protei 34.7 33 0.00072 23.5 2.1 14 69-82 20-33 (141)
83 PLN02173 UDP-glucosyl transfer 34.7 52 0.0011 27.2 3.5 28 2-29 396-426 (449)
84 cd03814 GT1_like_2 This family 34.7 1.5E+02 0.0033 21.7 5.8 42 2-44 320-361 (364)
85 PRK14750 kdpF potassium-transp 34.5 72 0.0016 16.5 3.3 22 69-90 2-23 (29)
86 PF05225 HTH_psq: helix-turn-h 34.3 63 0.0014 17.9 2.8 23 2-24 3-26 (45)
87 PF05568 ASFV_J13L: African sw 34.3 98 0.0021 22.3 4.4 11 63-73 25-35 (189)
88 PF02197 RIIa: Regulatory subu 34.1 80 0.0017 16.9 3.2 24 25-48 12-35 (38)
89 PRK12446 undecaprenyldiphospho 33.7 82 0.0018 24.8 4.4 37 2-44 313-350 (352)
90 cd03808 GT1_cap1E_like This fa 31.9 1E+02 0.0022 22.3 4.4 39 2-40 317-355 (359)
91 TIGR02801 tolR TolR protein. T 31.8 44 0.00096 22.3 2.3 13 69-81 10-22 (129)
92 TIGR02804 ExbD_2 TonB system t 31.7 47 0.001 22.1 2.4 13 69-81 9-21 (121)
93 PF12626 PolyA_pol_arg_C: Poly 31.7 54 0.0012 22.4 2.7 38 9-47 55-92 (124)
94 PF11417 Inhibitor_G39P: Loade 31.6 56 0.0012 20.2 2.5 31 6-36 34-64 (71)
95 TIGR03147 cyt_nit_nrfF cytochr 31.5 1.8E+02 0.0039 20.1 5.2 30 33-65 74-103 (126)
96 PHA02849 putative transmembran 31.4 1E+02 0.0022 19.7 3.7 20 72-91 19-38 (82)
97 PHA00646 hypothetical protein 31.2 1.2E+02 0.0026 18.5 3.8 25 69-93 39-63 (65)
98 PF09959 DUF2193: Uncharacteri 31.1 1.5E+02 0.0032 24.9 5.4 40 7-52 93-134 (499)
99 PRK11267 biopolymer transport 30.9 45 0.00097 22.9 2.2 14 69-82 23-36 (141)
100 cd04962 GT1_like_5 This family 30.8 1.8E+02 0.004 21.8 5.8 45 2-46 324-368 (371)
101 PF14851 FAM176: FAM176 family 30.7 1.1E+02 0.0025 21.8 4.3 16 85-100 33-48 (153)
102 PRK13718 conjugal transfer pro 30.6 1.5E+02 0.0032 19.1 4.3 19 89-107 61-79 (84)
103 cd03807 GT1_WbnK_like This fam 30.3 1.5E+02 0.0033 21.4 5.2 42 2-43 320-361 (365)
104 PF05148 Methyltransf_8: Hypot 30.1 1.6E+02 0.0034 22.4 5.1 38 7-44 30-68 (219)
105 PRK13608 diacylglycerol glucos 29.7 2.8E+02 0.0061 21.9 7.1 44 2-46 326-369 (391)
106 PF00957 Synaptobrevin: Synapt 29.4 88 0.0019 19.5 3.3 11 64-74 62-72 (89)
107 cd03795 GT1_like_4 This family 29.3 78 0.0017 23.5 3.5 25 2-26 320-344 (357)
108 PRK13803 bifunctional phosphor 28.6 65 0.0014 27.7 3.2 37 2-39 241-277 (610)
109 PHA02657 hypothetical protein; 28.6 1E+02 0.0022 20.1 3.4 23 67-89 24-46 (95)
110 PF10854 DUF2649: Protein of u 28.6 1.4E+02 0.0031 18.2 4.3 33 60-92 32-64 (67)
111 PRK05749 3-deoxy-D-manno-octul 28.3 1.9E+02 0.004 22.9 5.7 43 2-44 376-418 (425)
112 PF11157 DUF2937: Protein of u 28.3 1E+02 0.0022 22.1 3.8 26 4-29 58-85 (167)
113 PF04716 ETC_C1_NDUFA5: ETC co 28.2 68 0.0015 19.0 2.4 23 3-25 13-35 (57)
114 COG3036 Uncharacterized protei 27.8 76 0.0016 19.5 2.5 20 6-25 15-34 (66)
115 PRK13609 diacylglycerol glucos 27.7 1.7E+02 0.0037 22.6 5.2 43 2-45 326-368 (380)
116 TIGR03449 mycothiol_MshA UDP-N 27.5 2.6E+02 0.0057 21.6 6.3 44 2-46 356-399 (405)
117 cd03818 GT1_ExpC_like This fam 27.3 1.2E+02 0.0027 23.6 4.4 25 2-26 354-378 (396)
118 PF04316 FlgM: Anti-sigma-28 f 26.9 72 0.0016 18.5 2.4 25 5-29 30-54 (57)
119 TIGR02803 ExbD_1 TonB system t 26.8 62 0.0013 21.5 2.3 14 69-82 9-22 (122)
120 PF10577 UPF0560: Uncharacteri 26.7 1.1E+02 0.0025 27.5 4.4 8 93-100 295-302 (807)
121 COG4455 ImpE Protein of avirul 26.4 1E+02 0.0022 23.9 3.6 35 1-35 1-35 (273)
122 cd03792 GT1_Trehalose_phosphor 26.3 2.3E+02 0.0051 21.7 5.8 45 2-46 325-369 (372)
123 KOG4482|consensus 26.2 1.7E+02 0.0036 24.4 4.9 7 105-111 330-336 (449)
124 PF06917 Pectate_lyase_2: Peri 26.1 93 0.002 26.7 3.6 38 4-42 77-114 (557)
125 COG0848 ExbD Biopolymer transp 26.0 66 0.0014 22.3 2.3 16 69-84 21-36 (137)
126 PF08388 GIIM: Group II intron 25.9 1.2E+02 0.0027 18.1 3.4 9 63-71 34-42 (80)
127 TIGR01133 murG undecaprenyldip 25.4 1.1E+02 0.0023 23.1 3.7 26 2-27 309-334 (348)
128 PTZ00443 Thioredoxin domain-co 24.7 1.6E+02 0.0035 22.0 4.4 6 105-110 211-216 (224)
129 PHA02902 putative IMV membrane 24.7 1.6E+02 0.0034 18.2 3.5 11 70-80 3-13 (70)
130 PF12122 DUF3582: Protein of u 24.7 58 0.0013 21.5 1.8 31 12-42 41-71 (101)
131 PLN02618 tryptophan synthase, 24.6 98 0.0021 25.4 3.5 43 2-45 36-78 (410)
132 PF00558 Vpu: Vpu protein; In 24.3 1.3E+02 0.0028 19.3 3.3 16 69-84 3-18 (81)
133 PF06637 PV-1: PV-1 protein (P 24.0 2.5E+02 0.0055 23.3 5.6 13 51-63 8-20 (442)
134 PF02723 NS3_envE: Non-structu 23.9 2E+02 0.0043 18.4 4.1 10 89-98 38-47 (82)
135 PF08390 TRAM1: TRAM1-like pro 23.8 1.7E+02 0.0037 17.5 4.6 21 63-83 20-40 (65)
136 TIGR02149 glgA_Coryne glycogen 23.6 3.2E+02 0.0069 20.8 6.1 44 2-45 340-383 (388)
137 PRK04125 murein hydrolase regu 23.6 2E+02 0.0043 20.2 4.4 11 86-96 108-118 (141)
138 PF00988 CPSase_sm_chain: Carb 23.5 34 0.00073 23.9 0.5 11 9-19 38-48 (131)
139 PF15179 Myc_target_1: Myc tar 23.5 1.9E+02 0.004 21.6 4.4 21 61-81 9-31 (197)
140 cd03809 GT1_mtfB_like This fam 23.5 1.8E+02 0.0039 21.3 4.5 37 2-39 324-360 (365)
141 TIGR00263 trpB tryptophan synt 23.2 89 0.0019 25.1 2.9 38 1-39 19-56 (385)
142 PHA02851 EEV glycoprotein; Pro 23.2 79 0.0017 23.1 2.4 15 88-102 20-34 (223)
143 TIGR00215 lpxB lipid-A-disacch 23.1 1.8E+02 0.0038 23.2 4.6 40 2-41 335-380 (385)
144 PLN02555 limonoid glucosyltran 22.9 1.1E+02 0.0024 25.5 3.5 29 2-30 417-448 (480)
145 PF07782 DC_STAMP: DC-STAMP-li 22.8 2.5E+02 0.0054 20.2 5.0 45 61-105 141-185 (191)
146 PF12877 DUF3827: Domain of un 22.7 57 0.0012 28.7 1.8 9 73-81 276-284 (684)
147 cd02646 R3H_G-patch R3H domain 22.3 71 0.0015 18.6 1.7 39 1-49 1-39 (58)
148 KOG3814|consensus 21.7 2.1E+02 0.0045 24.0 4.7 37 15-51 406-442 (531)
149 TIGR03088 stp2 sugar transfera 21.5 3.4E+02 0.0073 20.6 5.8 44 2-45 326-369 (374)
150 cd03825 GT1_wcfI_like This fam 21.4 2.9E+02 0.0062 20.5 5.3 19 2-20 318-336 (365)
151 PHA03054 IMV membrane protein; 21.3 2.2E+02 0.0048 17.8 7.2 25 26-50 10-34 (72)
152 cd04951 GT1_WbdM_like This fam 21.1 2.8E+02 0.006 20.5 5.1 42 2-44 314-356 (360)
153 PF02526 GBP_repeat: Glycophor 21.0 1.6E+02 0.0034 15.9 3.5 24 8-31 10-33 (38)
154 PF04659 Arch_fla_DE: Archaeal 21.0 77 0.0017 20.9 1.8 19 36-54 18-36 (99)
155 PRK10591 hypothetical protein; 21.0 2.6E+02 0.0056 18.4 4.4 12 84-95 61-72 (92)
156 PF06667 PspB: Phage shock pro 20.8 2.3E+02 0.005 17.8 4.1 21 71-91 4-24 (75)
157 PF12834 Phage_int_SAM_2: Phag 20.8 2.4E+02 0.0052 18.0 6.0 49 1-49 1-60 (91)
158 PRK10635 bacterioferritin; Pro 20.7 2.9E+02 0.0063 19.4 4.9 55 3-57 6-73 (158)
159 PRK10307 putative glycosyl tra 20.5 3.2E+02 0.0068 21.3 5.5 43 2-44 361-403 (412)
160 cd01056 Euk_Ferritin eukaryoti 20.4 82 0.0018 21.9 2.0 38 20-57 35-72 (161)
161 PTZ00046 rifin; Provisional 20.2 2.1E+02 0.0045 23.4 4.4 6 92-97 335-340 (358)
No 1
>PF00201 UDPGT: UDP-glucoronosyl and UDP-glucosyl transferase; InterPro: IPR002213 UDP glycosyltransferases (UGT) are a superfamily of enzymes that catalyzes the addition of the glycosyl group from a UTP-sugar to a small hydrophobic molecule. This family currently consist of: Mammalian UDP-glucuronosyl transferases (2.4.1.17 from EC) (UDPGT) []. A large family of membrane-bound microsomal enzymes which catalyze the transfer of glucuronic acid to a wide variety of exogenous and endogenous lipophilic substrates. These enzymes are of major importance in the detoxification and subsequent elimination of xenobiotics such as drugs and carcinogens. A large number of putative UDPGT from Caenorhabditis elegans. Mammalian 2-hydroxyacylsphingosine 1-beta-galactosyltransferase [] (2.4.1.45 from EC) (also known as UDP-galactose-ceramide galactosyltransferase). This enzyme catalyzes the transfer of galactose to ceramide, a key enzymatic step in the biosynthesis of galactocerebrosides, which are abundant sphingolipids of the myelin membrane of the central nervous system and peripheral nervous system. Plants flavonol O(3)-glucosyltransferase (2.4.1.91 from EC). An enzyme [] that catalyzes the transfer of glucose from UDP-glucose to a flavanol. This reaction is essential and one of the last steps in anthocyanin pigment biosynthesis. Baculoviruses ecdysteroid UDP-glucosyltransferase (2.4.1 from EC) [] (egt). This enzyme catalyzes the transfer of glucose from UDP-glucose to ectysteroids which are insect molting hormones. The expression of egt in the insect host interferes with the normal insect development by blocking the molting process. Prokaryotic zeaxanthin glucosyltransferase (2.4.1 from EC) (gene crtX), an enzyme involved in carotenoid biosynthesis and that catalyses the glycosylation reaction which converts zeaxanthin to zeaxanthin-beta-diglucoside. Streptomyces macrolide glycosyltransferases (2.4.1 from EC) []. These enzymes specifically inactivates macrolide anitibiotics via 2'-O-glycosylation using UDP-glucose. These enzymes share a conserved domain of about 50 amino acid residues located in their C-terminal section.; GO: 0016758 transferase activity, transferring hexosyl groups, 0008152 metabolic process; PDB: 3HBJ_A 3HBF_A 2PQ6_A 3IA7_B 3RSC_A 3IAA_B 2IYA_A 2IYF_B 2O6L_A 2VCH_A ....
Probab=99.96 E-value=2.9e-31 Score=214.66 Aligned_cols=104 Identities=38% Similarity=0.596 Sum_probs=27.0
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcccCCCCCHHHHHHHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKTKATSLPWYQTSQLDVIASCTLGLG 81 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~~~~~l~~~q~~~lDV~~~l~~v~~ 81 (116)
+++.+||++||+|++|++||+++|++++|||++|+|+|+||||||+||||++||++++.+|||+|||+|||+++++++++
T Consensus 397 ~~l~~ai~~vl~~~~y~~~a~~ls~~~~~~p~~p~~~~~~~ie~v~~~~~~~~l~~~~~~l~~~~~~~lDv~~~~~~~~~ 476 (500)
T PF00201_consen 397 EELRAAIREVLENPSYKENAKRLSSLFRDRPISPLERAVWWIEYVARHGGAPHLRSPARDLSFYQYYLLDVIAFLLLIIL 476 (500)
T ss_dssp HHHHHHHHHHHHSHHHHHHHHHHHHTTT----------------------------------------------------
T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHhcCCCcccCChhhcCCHHHHHHHHHHHHHHHHHH
Confidence 57899999999999999999999999999999999999999999999999999999999999999999999999888888
Q ss_pred HHHHHHHHHHHHHhhhhccccccc
Q psy5669 82 LVLYTLARIGTSFAEFTRGKDDKK 105 (116)
Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~k 105 (116)
++++++++++++++|+++++++||
T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~k~ 500 (500)
T PF00201_consen 477 LIIYIIFKICRFVCRKCVKKKKKK 500 (500)
T ss_dssp ------------------------
T ss_pred HHHHHHHHHHHHHHHHHhcccCCC
Confidence 888999999999999998777664
No 2
>PHA03392 egt ecdysteroid UDP-glucosyltransferase; Provisional
Probab=99.88 E-value=1.8e-22 Score=166.04 Aligned_cols=78 Identities=26% Similarity=0.382 Sum_probs=73.9
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhC-CCCCCcccCCCCCHHHHHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHR-GAPHLKTKATSLPWYQTSQLDVIASCTLG 79 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~-g~~~l~~~~~~l~~~q~~~lDV~~~l~~v 79 (116)
++|.+||+++++||+|++||+++++.++++|.+|.|.|+||||||+||+ |++||++++.+|||+|||+|||+++++++
T Consensus 420 ~~l~~ai~~vl~~~~y~~~a~~ls~~~~~~p~~~~~~av~~iE~v~r~~~g~~~lr~~~~~l~~~qy~~lDv~~~~~~~ 498 (507)
T PHA03392 420 AQLVLAIVDVIENPKYRKNLKELRHLIRHQPMTPLHKAIWYTEHVIRNKHGNTSLKTKAANVSYSDYFMSYILVPLVTF 498 (507)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhCCCCcccccccccCCCHHHHHHHHHHHHHHHH
Confidence 5789999999999999999999999999999999999999999999999 99999999999999999999998666444
No 3
>KOG1192|consensus
Probab=99.46 E-value=3.7e-13 Score=108.64 Aligned_cols=70 Identities=34% Similarity=0.570 Sum_probs=67.6
Q ss_pred HHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcccCCCCCHHHHHHHHHHHH
Q psy5669 4 SRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKTKATSLPWYQTSQLDVIAS 75 (116)
Q Consensus 4 l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~~~~~l~~~q~~~lDV~~~ 75 (116)
+..|+.+++++++|++++++++.+++|||.+| +.+++|+||+.+++++.+++.. .+++|++|+++|++.+
T Consensus 412 ~~~~~~~il~~~~y~~~~~~l~~~~~~~p~~~-~~~~~~~e~~~~~~~~~~l~~~-~~~~~~~~~~~d~~~~ 481 (496)
T KOG1192|consen 412 LLEAIKEILENEEYKEAAKRLSEILRDQPISP-ELAVKWVEFVARHGGAKHLKEA-AHLSFIEYGSLDVIAF 481 (496)
T ss_pred HHHHHHHHHcChHHHHHHHHHHHHHHcCCCCH-HHHHHHHHHHHhcCCCcccCcc-ccCChhhhhhhHHHHH
Confidence 67899999999999999999999999999999 9999999999999999999998 9999999999999976
No 4
>TIGR01426 MGT glycosyltransferase, MGT family. This model describes the MGT (macroside glycosyltransferase) subfamily of the UDP-glucuronosyltransferase family. Members include a number of glucosyl transferases for macrolide antibiotic inactivation, but also include transferases of glucose-related sugars for macrolide antibiotic production.
Probab=95.25 E-value=0.05 Score=42.94 Aligned_cols=45 Identities=20% Similarity=0.314 Sum_probs=39.1
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIA 48 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r 48 (116)
+.+.++|.++++|++|++++++++..+.+.+ +.+.++.++|.+++
T Consensus 347 ~~l~~ai~~~l~~~~~~~~~~~l~~~~~~~~--~~~~aa~~i~~~~~ 391 (392)
T TIGR01426 347 EKLREAVLAVLSDPRYAERLRKMRAEIREAG--GARRAADEIEGFLA 391 (392)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHHHHcC--CHHHHHHHHHHhhc
Confidence 4688999999999999999999999998654 77889999987654
No 5
>PLN02670 transferase, transferring glycosyl groups
Probab=95.18 E-value=0.055 Score=44.90 Aligned_cols=47 Identities=11% Similarity=0.180 Sum_probs=38.0
Q ss_pred chHHHHHHHHhcCH---HHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy5669 2 RSSRRKVQPACMSP---QYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAH 49 (116)
Q Consensus 2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~ 49 (116)
+++.+||++++.|+ +|++||++++.+++++| .|.+.+---+++..+.
T Consensus 417 e~i~~av~~vm~~~~g~~~r~~a~~l~~~~~~~~-~~~~~~~~~~~~l~~~ 466 (472)
T PLN02670 417 DSVAESVRLAMVDDAGEEIRDKAKEMRNLFGDMD-RNNRYVDELVHYLREN 466 (472)
T ss_pred HHHHHHHHHHhcCcchHHHHHHHHHHHHHHhCcc-hhHHHHHHHHHHHHHh
Confidence 47899999999886 79999999999999998 5566666666665554
No 6
>COG1819 Glycosyl transferases, related to UDP-glucuronosyltransferase [Carbohydrate transport and metabolism / Signal transduction mechanisms]
Probab=94.22 E-value=0.16 Score=41.23 Aligned_cols=48 Identities=21% Similarity=0.243 Sum_probs=41.7
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCC
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRG 51 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g 51 (116)
+.+.++|++||+|++|++++++++..++..+- .+.+..++|-..+.++
T Consensus 356 ~~l~~av~~vL~~~~~~~~~~~~~~~~~~~~g--~~~~a~~le~~~~~~~ 403 (406)
T COG1819 356 ERLRAAVNEVLADDSYRRAAERLAEEFKEEDG--PAKAADLLEEFAREKK 403 (406)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHhhhccc--HHHHHHHHHHHHhccc
Confidence 46889999999999999999999999998864 6778999998777654
No 7
>PLN02554 UDP-glycosyltransferase family protein
Probab=90.81 E-value=0.92 Score=37.54 Aligned_cols=30 Identities=10% Similarity=0.123 Sum_probs=26.8
Q ss_pred chHHHHHHHHhc-CHHHHHHHHHHHHHHhcC
Q psy5669 2 RSSRRKVQPACM-SPQYKTNVQRLSTIFRSE 31 (116)
Q Consensus 2 ~~l~~aI~~vl~-n~~Yk~na~~ls~~~~d~ 31 (116)
+++.+||+++++ |++|++||+++++++++.
T Consensus 428 e~l~~av~~vm~~~~~~r~~a~~l~~~~~~a 458 (481)
T PLN02554 428 EEIERGIRCLMEQDSDVRKRVKEMSEKCHVA 458 (481)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH
Confidence 468899999996 899999999999999854
No 8
>PF15050 SCIMP: SCIMP protein
Probab=88.71 E-value=1.2 Score=30.88 Aligned_cols=47 Identities=15% Similarity=0.311 Sum_probs=20.7
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc-ccCCCcc
Q psy5669 62 LPWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRGKDDK-KATKPDR 111 (116)
Q Consensus 62 l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~~~~~-k~~~~~~ 111 (116)
|+||.-+.+=++++-++++-+ ++..++.++||+..+..+| +..||-+
T Consensus 1 M~WWr~nFWiiLAVaII~vS~---~lglIlyCvcR~~lRqGkkweiakp~k 48 (133)
T PF15050_consen 1 MSWWRDNFWIILAVAIILVSV---VLGLILYCVCRWQLRQGKKWEIAKPLK 48 (133)
T ss_pred CchHHhchHHHHHHHHHHHHH---HHHHHHHHHHHHHHHccccceeccchh
Confidence 567665544444333222111 2233445556655443333 4445543
No 9
>PLN02562 UDP-glycosyltransferase
Probab=87.10 E-value=2.1 Score=35.20 Aligned_cols=30 Identities=7% Similarity=0.059 Sum_probs=26.0
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcC
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSE 31 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~ 31 (116)
+.+.++|+++++|++|++||++++...+..
T Consensus 401 ~~l~~~v~~~l~~~~~r~~a~~l~~~~~~~ 430 (448)
T PLN02562 401 KEVEEGLRKVMEDSGMGERLMKLRERAMGE 430 (448)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHHHhc
Confidence 568899999999999999999998866543
No 10
>PF07214 DUF1418: Protein of unknown function (DUF1418); InterPro: IPR010815 This family consists of several hypothetical Enterobacterial proteins of around 100 residues in length. Members of this family are often described as YbjC. In Escherichia coli the ybjC gene is located downstream of nfsA (which encodes the major oxygen-insensitive nitroreductase). It is thought that nfsA and ybjC form an operon an its promoter is a class I SoxS-dependent promoter []. The function of this family is unknown.
Probab=83.95 E-value=3.4 Score=27.40 Aligned_cols=34 Identities=24% Similarity=0.398 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHhhhhccccccccCCCcccccCC
Q psy5669 83 VLYTLARIGTSFAEFTRGKDDKKATKPDRNKKNK 116 (116)
Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~ 116 (116)
.+.+++++.+.+.=..+..+.+++.+++|+|||.
T Consensus 60 av~ivWR~a~~lap~l~~~~~~~~~~s~~~k~~d 93 (96)
T PF07214_consen 60 AVNIVWRVAKGLAPLLIDRPPKSSSRSDREKKND 93 (96)
T ss_pred HHHHHHHHHHHhchHhhcCCCCccccchhhhccc
Confidence 4455667766555545455555555688888873
No 11
>PLN02207 UDP-glycosyltransferase
Probab=83.34 E-value=1.4 Score=36.60 Aligned_cols=30 Identities=7% Similarity=0.107 Sum_probs=26.3
Q ss_pred chHHHHHHHHhc--CHHHHHHHHHHHHHHhcC
Q psy5669 2 RSSRRKVQPACM--SPQYKTNVQRLSTIFRSE 31 (116)
Q Consensus 2 ~~l~~aI~~vl~--n~~Yk~na~~ls~~~~d~ 31 (116)
++|.++|+++++ +++||+||++++.+.+.-
T Consensus 414 e~i~~av~~vm~~~~~~~r~~a~~l~~~a~~A 445 (468)
T PLN02207 414 NEIETAIRCVMNKDNNVVRKRVMDISQMIQRA 445 (468)
T ss_pred HHHHHHHHHHHhcchHHHHHHHHHHHHHHHHH
Confidence 578999999997 789999999999988843
No 12
>PLN03004 UDP-glycosyltransferase
Probab=81.32 E-value=2.2 Score=35.29 Aligned_cols=28 Identities=7% Similarity=0.036 Sum_probs=24.6
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHh
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFR 29 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~ 29 (116)
+.+.++|+++++|++|++++++++..-+
T Consensus 412 e~l~~av~~vm~~~~~r~~a~~~~~~a~ 439 (451)
T PLN03004 412 TEVEKRVQEIIGECPVRERTMAMKNAAE 439 (451)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHHH
Confidence 5788999999999999999999887544
No 13
>PLN03007 UDP-glucosyltransferase family protein
Probab=77.42 E-value=3 Score=34.47 Aligned_cols=30 Identities=7% Similarity=0.246 Sum_probs=26.1
Q ss_pred chHHHHHHHHhcCH---HHHHHHHHHHHHHhcC
Q psy5669 2 RSSRRKVQPACMSP---QYKTNVQRLSTIFRSE 31 (116)
Q Consensus 2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~d~ 31 (116)
+.|.++|++++.|+ +|+++|++++.+.++.
T Consensus 428 ~~l~~av~~~m~~~~~~~~r~~a~~~~~~a~~a 460 (482)
T PLN03007 428 EKVEKAVREVIVGEEAEERRLRAKKLAEMAKAA 460 (482)
T ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHHHHHHH
Confidence 46889999999988 9999999999977754
No 14
>cd03784 GT1_Gtf_like This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin. Gtfs transfer sugar moieties from an activated NDP-sugar donor to the oxidatively cross-linked heptapeptide core of vancomycin group antibiotics. The core structure is important for the bioactivity of the antibiotics.
Probab=76.23 E-value=5.1 Score=31.40 Aligned_cols=41 Identities=15% Similarity=0.081 Sum_probs=27.8
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+.+.++|+++++++.|++ +.+..+.+. ...+.+.++..||-
T Consensus 360 ~~l~~al~~~l~~~~~~~-~~~~~~~~~--~~~g~~~~~~~ie~ 400 (401)
T cd03784 360 ERLAAALRRLLDPPSRRR-AAALLRRIR--EEDGVPSAADVIER 400 (401)
T ss_pred HHHHHHHHHHhCHHHHHH-HHHHHHHHH--hccCHHHHHHHHhh
Confidence 468899999998765555 555544443 34677888887763
No 15
>PF07069 PRRSV_2b: Porcine reproductive and respiratory syndrome virus 2b ; InterPro: IPR009775 This family consists of several Porcine reproductive and respiratory syndrome virus (PRRSV) ORF2b proteins. The function of this family is unknown however it is known that large amounts of 2b protein are present in the virion and it is thought that this protein may be an integral component of the virion [].
Probab=73.32 E-value=11 Score=23.13 Aligned_cols=34 Identities=9% Similarity=0.067 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHHhhhhcccc
Q psy5669 69 QLDVIASCTLGLGLVL--YTLARIGTSFAEFTRGKD 102 (116)
Q Consensus 69 ~lDV~~~l~~v~~~~~--~~~~~~~~~~~~~~~~~~ 102 (116)
-+|++.++.+.+.+.+ +++..|++.+|.-+.+.+
T Consensus 26 ivdiiiflailfgftiagwlvvfcirlv~sailr~r 61 (73)
T PF07069_consen 26 IVDIIIFLAILFGFTIAGWLVVFCIRLVCSAILRAR 61 (73)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHhcc
Confidence 4677766655554333 344556677776665544
No 16
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=71.22 E-value=12 Score=33.13 Aligned_cols=36 Identities=8% Similarity=-0.075 Sum_probs=13.8
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy5669 63 PWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFT 98 (116)
Q Consensus 63 ~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~ 98 (116)
.++.|+-.=++++++.++++++.-+.-+|.++||||
T Consensus 85 ~~l~~~~g~~v~~~i~ll~~il~P~vg~~fCcCRCc 120 (806)
T PF05478_consen 85 ELLAYEWGFLVCAVIGLLFIILMPLVGLCFCCCRCC 120 (806)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccC
Confidence 344444443333333333322332333444444443
No 17
>PF02038 ATP1G1_PLM_MAT8: ATP1G1/PLM/MAT8 family; InterPro: IPR000272 The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable. Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=70.80 E-value=7 Score=22.85 Aligned_cols=28 Identities=18% Similarity=0.237 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy5669 64 WYQTSQLDVIASCTLGLGLVLYTLARIG 91 (116)
Q Consensus 64 ~~q~~~lDV~~~l~~v~~~~~~~~~~~~ 91 (116)
+|.|..|-+-.++++.+++++.++..+.
T Consensus 8 ~YDy~tLrigGLi~A~vlfi~Gi~iils 35 (50)
T PF02038_consen 8 YYDYETLRIGGLIFAGVLFILGILIILS 35 (50)
T ss_dssp GGCHHHHHHHHHHHHHHHHHHHHHHHCT
T ss_pred ccchhHhhccchHHHHHHHHHHHHHHHc
Confidence 3778888887777766666666544443
No 18
>PF13524 Glyco_trans_1_2: Glycosyl transferases group 1
Probab=70.10 E-value=14 Score=22.73 Aligned_cols=42 Identities=7% Similarity=-0.008 Sum_probs=27.7
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSI 43 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wi 43 (116)
+.+.++|..+++||..++.+.+-+...-.+-.+...++-.++
T Consensus 50 ~el~~~i~~ll~~~~~~~~ia~~a~~~v~~~~t~~~~~~~il 91 (92)
T PF13524_consen 50 EELAEKIEYLLENPEERRRIAKNARERVLKRHTWEHRAEQIL 91 (92)
T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHhCCHHHHHHHHH
Confidence 567889999999997666655555444444556666655554
No 19
>PLN02208 glycosyltransferase family protein
Probab=70.03 E-value=16 Score=30.15 Aligned_cols=43 Identities=16% Similarity=0.108 Sum_probs=30.3
Q ss_pred chHHHHHHHHhcCHH-----HHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQ-----YKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~-----Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+++.++|+++++|++ ++++++++++...+ +-+.-...-.-||.
T Consensus 389 ~~l~~ai~~~m~~~~e~g~~~r~~~~~~~~~~~~-~gsS~~~l~~~v~~ 436 (442)
T PLN02208 389 ESLSNAIKSVMDKDSDLGKLVRSNHTKLKEILVS-PGLLTGYVDKFVEE 436 (442)
T ss_pred HHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHhc-CCcHHHHHHHHHHH
Confidence 578899999998765 99999999988754 33444433333433
No 20
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=69.97 E-value=22 Score=29.33 Aligned_cols=13 Identities=8% Similarity=-0.076 Sum_probs=8.7
Q ss_pred HHHHHHHHHHhCC
Q psy5669 39 AIRSIEYVIAHRG 51 (116)
Q Consensus 39 av~wiE~v~r~~g 51 (116)
+=||+.+.=+.|-
T Consensus 4 ~~~~~~~~h~lPh 16 (418)
T cd07912 4 APWLVKLLHSLPH 16 (418)
T ss_pred chHHHHHHHhCCC
Confidence 3478887766664
No 21
>PF07319 DnaI_N: Primosomal protein DnaI N-terminus; InterPro: IPR009928 This entry represents the N terminus (approximately 120 residues) of bacterial primosomal DnaI proteins, although one family member appears to be of viral origin. DnaI is one of the components of the Bacillus subtilis replication restart primosome, and is required for the DnaB75-dependent loading of the DnaC helicase [].; PDB: 2K7R_A.
Probab=67.96 E-value=2 Score=27.89 Aligned_cols=32 Identities=16% Similarity=0.246 Sum_probs=16.7
Q ss_pred CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCC
Q psy5669 1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEP 32 (116)
Q Consensus 1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p 32 (116)
|++|.++|.++..+++++++.+++...+.+.|
T Consensus 1 Me~I~~~l~~~~~~~~~~~~~~~l~~~vl~dp 32 (94)
T PF07319_consen 1 MESIGDSLKQLMKRRNFEERYEQLKQEVLSDP 32 (94)
T ss_dssp ---TT-S----S---HHHHHHHHHHHHHTT-H
T ss_pred CccHHHHHHHHHhcccHHHHHHHHHHHHHcCH
Confidence 89999999999999999999888877776654
No 22
>PRK00726 murG undecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase; Provisional
Probab=64.84 E-value=19 Score=27.65 Aligned_cols=43 Identities=12% Similarity=0.007 Sum_probs=32.2
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+.+.++|.++++|+.+++.+.+-+..+. ++.++...+--|.+.
T Consensus 312 ~~l~~~i~~ll~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 354 (357)
T PRK00726 312 EKLAEKLLELLSDPERLEAMAEAARALG-KPDAAERLADLIEEL 354 (357)
T ss_pred HHHHHHHHHHHcCHHHHHHHHHHHHhcC-CcCHHHHHHHHHHHH
Confidence 4788999999999999988888776663 565666666555443
No 23
>PF11628 TCR_zetazeta: T-cell surface glycoprotein CD3 zeta chain; InterPro: IPR021663 The TCR complex of T-lymphocytes consists of either a TCR alpha/beta or TCR gamma/delta heterodimer co-expressed at the cell surface with the invariant subunits of CD3 labelled gamma, delta, epsilon, zeta, and eta []. The zeta subunit forms either homodimers or heterodimers with eta [], but eta homodimers have not been observed. The structure of the zetazeta transmembrane dimer consists of a left-handed coiled coil with polar contacts. Two aspartic acids are critical for zetazeta dimerisation and assembly with TCR []. The high affinity immunoglobulin epsilon receptor (IgE Fc receptor) subunit gamma associates with a variety of FcR alpha chains to form a functional signaling complex. The gamma subunit has a critical role in allowing the IgE Fc receptor to reach the cell surface and regulates several aspects of the immune response []. This family includes both CD3 zeta subunits and IgE Fc receptor gamma subunits. The gamma chain of the high affinity Fc receptor for IgE has significant structural homology to CD3 zeta and the related CD3 eta subunit and can facilitate T cell receptor expression and signaling in the absence of CD3 zeta and CD3 eta [].; PDB: 2HAC_B.
Probab=63.77 E-value=15 Score=19.58 Aligned_cols=19 Identities=26% Similarity=-0.031 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy5669 67 TSQLDVIASCTLGLGLVLY 85 (116)
Q Consensus 67 ~~~lDV~~~l~~v~~~~~~ 85 (116)
.|.||.++++..++++.++
T Consensus 5 CYiLDgiL~iYgiiiT~L~ 23 (33)
T PF11628_consen 5 CYILDGILFIYGIIITALY 23 (33)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred eeeHHHHHHHHHHHHHHHH
Confidence 4778998777666555544
No 24
>PLN02863 UDP-glucoronosyl/UDP-glucosyl transferase family protein
Probab=63.64 E-value=22 Score=29.53 Aligned_cols=48 Identities=6% Similarity=-0.030 Sum_probs=31.6
Q ss_pred chHHHHHHHHh-cCHHHHHHHHHHHHHHhcCCC---CHHHHHHHHHHHHHHh
Q psy5669 2 RSSRRKVQPAC-MSPQYKTNVQRLSTIFRSEPL---HPLQKAIRSIEYVIAH 49 (116)
Q Consensus 2 ~~l~~aI~~vl-~n~~Yk~na~~ls~~~~d~p~---~P~~~av~wiE~v~r~ 49 (116)
+.+.+++++++ ++++|++||++++...++.-. +.......-|+.+.+.
T Consensus 421 ~~v~~~v~~~m~~~~~~r~~a~~l~e~a~~Av~~gGSS~~~l~~~v~~i~~~ 472 (477)
T PLN02863 421 DELARVFMESVSENQVERERAKELRRAALDAIKERGSSVKDLDGFVKHVVEL 472 (477)
T ss_pred HHHHHHHHHHhhccHHHHHHHHHHHHHHHHHhccCCcHHHHHHHHHHHHHHh
Confidence 46778899887 789999999999987553311 2333344444554444
No 25
>PLN00414 glycosyltransferase family protein
Probab=63.08 E-value=27 Score=28.77 Aligned_cols=47 Identities=21% Similarity=0.116 Sum_probs=31.7
Q ss_pred chHHHHHHHHhcCHH-----HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy5669 2 RSSRRKVQPACMSPQ-----YKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAH 49 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~-----Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~ 49 (116)
+.+.+++++++.|++ |++++++++..+.+.--+..+ ....|+.+..+
T Consensus 390 ~~i~~~v~~~m~~~~e~g~~~r~~a~~~~~~~~~~gg~ss~-l~~~v~~~~~~ 441 (446)
T PLN00414 390 ESLRDTVKSVMDKDSEIGNLVKRNHKKLKETLVSPGLLSGY-ADKFVEALENE 441 (446)
T ss_pred HHHHHHHHHHhcCChhhHHHHHHHHHHHHHHHHcCCCcHHH-HHHHHHHHHHh
Confidence 478899999998754 999999999987644332222 34444444433
No 26
>PLN02167 UDP-glycosyltransferase family protein
Probab=62.38 E-value=10 Score=31.40 Aligned_cols=29 Identities=10% Similarity=0.071 Sum_probs=23.8
Q ss_pred chHHHHHHHHhcCH-HHHHHHHHHHHHHhc
Q psy5669 2 RSSRRKVQPACMSP-QYKTNVQRLSTIFRS 30 (116)
Q Consensus 2 ~~l~~aI~~vl~n~-~Yk~na~~ls~~~~d 30 (116)
+.+.++|++++.++ .|+++|++++...+.
T Consensus 422 ~~l~~av~~~m~~~~~~r~~a~~~~~~~~~ 451 (475)
T PLN02167 422 DEIAGAVRSLMDGEDVPRKKVKEIAEAARK 451 (475)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence 46889999999755 899999999886664
No 27
>PLN02448 UDP-glycosyltransferase family protein
Probab=62.25 E-value=10 Score=31.15 Aligned_cols=30 Identities=13% Similarity=0.118 Sum_probs=25.4
Q ss_pred chHHHHHHHHhcCH-----HHHHHHHHHHHHHhcC
Q psy5669 2 RSSRRKVQPACMSP-----QYKTNVQRLSTIFRSE 31 (116)
Q Consensus 2 ~~l~~aI~~vl~n~-----~Yk~na~~ls~~~~d~ 31 (116)
+.|.++++++++|+ +|+++|++++...+..
T Consensus 403 ~~l~~av~~vl~~~~~~~~~~r~~a~~~~~~~~~a 437 (459)
T PLN02448 403 EEIAELVKRFMDLESEEGKEMRRRAKELQEICRGA 437 (459)
T ss_pred HHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHHHH
Confidence 46899999999875 7999999999887754
No 28
>PF03401 TctC: Tripartite tricarboxylate transporter family receptor; InterPro: IPR005064 Bordetella pertussis, the causative agent of human whooping cough (pertussis), is an obligate human pathogen with diverse high-affinity transport systems for the assimilation of iron, a biometal that is essential for growth []. Periplasmic binding proteins of a new family, particularly well represented in this organism (and more generally in beta-proteobacteria), have been called Bug receptors []. They adopt a characteristic Venus flytrap fold with two globular domains bisected by a ligand-binding cleft. The family is specific for carboxylated solutes, with a characteristic mode of binding involving two highly conserved beta strand-beta turn-alpha helix motifs originating from each domain. These two motifs form hydrogen bonds with a carboxylate group of the ligand, both directly and via conserved water molecules, and have thus been termed the carboxylate pincers. Domain 1 recognises the ligand and the carboxylate group serves as an initial anchoring point. Domain 2 discriminates between productively and non-productively bound ligands as proper interactions with this domain is needed for the of the closed conformation []. BugE has a glutamate bound ligand. No charged residues are involved in glutamate binding by BugE, unlike what has been described for all glutamate receptors reported so far. The Bug architecture is highly conserved despite limited sequence identity [].; GO: 0030288 outer membrane-bounded periplasmic space; PDB: 2QPQ_C 2DVZ_A 2F5X_A.
Probab=60.48 E-value=11 Score=28.84 Aligned_cols=46 Identities=7% Similarity=0.021 Sum_probs=33.5
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAH 49 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~ 49 (116)
+.+.+|++++++||.|++.+++..- ...+.+|.+...++-|...++
T Consensus 220 ~~l~~a~~~~~~~pe~~~~~~~~g~--~~~~~~~~~~~~~l~~~~~~~ 265 (274)
T PF03401_consen 220 DKLADAIKKALEDPEFQEFLEKMGL--EPVYMDGEEFDAFLAEEYARY 265 (274)
T ss_dssp HHHHHHHHHHHT-HHHHHHHHHHTE--EEECESHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCHHHHHHHHHCCC--cCCCCCHHHHHHHHHHHHHHH
Confidence 5789999999999999999988653 334778877666665554443
No 29
>PF11057 Cortexin: Cortexin of kidney; InterPro: IPR020066 Cortexin is a neuron-specific, 82-residue membrane protein which is found especially in vertebrate brain cortex tissue. It may mediate extracellular or intracellular signalling of cortical neurons during forebrain development. Cortexin is present at significant levels in the foetal brain, suggesting that it may be important to neurons of both the developing and adult cerebral cortex. Cortexin has a conserved single membrane-spanning region in the middle of each sequence []. In humans, there is selective expression of Cortexin 3 (CTXN3) in the kidney as well as the brain []. This entry contains Cortexins 1, 2 and 3.; GO: 0031224 intrinsic to membrane
Probab=59.63 E-value=16 Score=23.31 Aligned_cols=23 Identities=13% Similarity=-0.122 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy5669 73 IASCTLGLGLVLYTLARIGTSFA 95 (116)
Q Consensus 73 ~~~l~~v~~~~~~~~~~~~~~~~ 95 (116)
+++++++++++..++.+|.+++.
T Consensus 31 faFV~~L~~fL~~liVRCfrIll 53 (81)
T PF11057_consen 31 FAFVGLLCLFLGLLIVRCFRILL 53 (81)
T ss_pred ehHHHHHHHHHHHHHHHHHHHHc
Confidence 44455555555566677777664
No 30
>PLN02152 indole-3-acetate beta-glucosyltransferase
Probab=59.45 E-value=12 Score=31.02 Aligned_cols=30 Identities=7% Similarity=0.117 Sum_probs=24.5
Q ss_pred chHHHHHHHHhcCHH--HHHHHHHHHHHHhcC
Q psy5669 2 RSSRRKVQPACMSPQ--YKTNVQRLSTIFRSE 31 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~--Yk~na~~ls~~~~d~ 31 (116)
+++.++|+++++|++ |++++++++...+..
T Consensus 405 e~l~~av~~vm~~~~~~~r~~a~~~~~~~~~a 436 (455)
T PLN02152 405 GEIRRCLEAVMEEKSVELRESAEKWKRLAIEA 436 (455)
T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH
Confidence 578999999998776 799998888876643
No 31
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=59.02 E-value=13 Score=32.86 Aligned_cols=29 Identities=10% Similarity=-0.172 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcccc
Q psy5669 73 IASCTLGLGLVLYTLARIGTSFAEFTRGKD 102 (116)
Q Consensus 73 ~~~l~~v~~~~~~~~~~~~~~~~~~~~~~~ 102 (116)
+++++++++-++.+++.||| +|+.|.|+-
T Consensus 99 i~ll~~il~P~vg~~fCcCR-Cc~~CGg~~ 127 (806)
T PF05478_consen 99 IGLLFIILMPLVGLCFCCCR-CCGNCGGRM 127 (806)
T ss_pred HHHHHHHHHHHHHHHHhccc-cCCCcCCcc
Confidence 33444444444444444443 355554433
No 32
>cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases. AmSD in Erwinia amylovora has been shown to be involved in the biosynthesis of amylovoran, the acidic exopolysaccharide acting as a virulence factor. This enzyme may be responsible for the formation of galactose alpha-1,6 linkages in amylovoran.
Probab=57.53 E-value=22 Score=25.68 Aligned_cols=41 Identities=2% Similarity=-0.152 Sum_probs=29.0
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSI 43 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wi 43 (116)
+++.++|.++++||..++++.+-+..+. +.++....+-.|.
T Consensus 307 ~~~~~~i~~ll~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 347 (348)
T cd03820 307 EALAEALLRLMEDEELRKRMGANARESA-ERFSIENIIKQWE 347 (348)
T ss_pred HHHHHHHHHHHcCHHHHHHHHHHHHHHH-HHhCHHHHHHHhc
Confidence 5788999999999988877776554443 3466666665553
No 33
>PLN02210 UDP-glucosyl transferase
Probab=56.77 E-value=15 Score=30.29 Aligned_cols=28 Identities=14% Similarity=0.289 Sum_probs=23.8
Q ss_pred chHHHHHHHHhcCHH---HHHHHHHHHHHHh
Q psy5669 2 RSSRRKVQPACMSPQ---YKTNVQRLSTIFR 29 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~---Yk~na~~ls~~~~ 29 (116)
++|.+||++++.|++ +|+||++++...+
T Consensus 403 ~~l~~av~~~m~~~~g~~~r~~a~~l~~~a~ 433 (456)
T PLN02210 403 EEVERCIEAVTEGPAAADIRRRAAELKHVAR 433 (456)
T ss_pred HHHHHHHHHHhcCchHHHHHHHHHHHHHHHH
Confidence 478899999998875 9999999988655
No 34
>PF09929 DUF2161: Uncharacterized conserved protein (DUF2161); InterPro: IPR018679 This family of various hypothetical prokaryotic proteins has no known function.
Probab=56.74 E-value=8.4 Score=26.45 Aligned_cols=26 Identities=15% Similarity=0.303 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHh-cCCCCHHHHHHH
Q psy5669 16 QYKTNVQRLSTIFR-SEPLHPLQKAIR 41 (116)
Q Consensus 16 ~Yk~na~~ls~~~~-d~p~~P~~~av~ 41 (116)
-|++.|.+++..+. +-|.+|.+.+..
T Consensus 56 AYRQ~Al~~A~~L~~~Gp~~~~~l~~~ 82 (118)
T PF09929_consen 56 AYRQDALRCAAALAEHGPSRPADLRKA 82 (118)
T ss_pred hhHHHHHHHHHHHHHcCCCCHHHHHHh
Confidence 49999999999999 889999987754
No 35
>PRK00025 lpxB lipid-A-disaccharide synthase; Reviewed
Probab=56.71 E-value=42 Score=25.91 Aligned_cols=46 Identities=9% Similarity=0.052 Sum_probs=29.3
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcC-CCCHHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSE-PLHPLQKAIRSIEYVI 47 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~-p~~P~~~av~wiE~v~ 47 (116)
+.+.+++.++++|++.++.+.+-....+.. .-.+.+..+..++..+
T Consensus 329 ~~l~~~i~~ll~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~i~~~~ 375 (380)
T PRK00025 329 EKLARALLPLLADGARRQALLEGFTELHQQLRCGADERAAQAVLELL 375 (380)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHh
Confidence 367889999999998776555543222222 2246777777766554
No 36
>PLN02410 UDP-glucoronosyl/UDP-glucosyl transferase family protein
Probab=56.69 E-value=15 Score=30.38 Aligned_cols=29 Identities=14% Similarity=0.226 Sum_probs=24.1
Q ss_pred chHHHHHHHHhcCH---HHHHHHHHHHHHHhc
Q psy5669 2 RSSRRKVQPACMSP---QYKTNVQRLSTIFRS 30 (116)
Q Consensus 2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~d 30 (116)
+.+.++|++++.|+ +|++++++++..++.
T Consensus 398 ~~v~~av~~lm~~~~~~~~r~~a~~l~~~~~~ 429 (451)
T PLN02410 398 GAVERAVKRLMVEEEGEEMRKRAISLKEQLRA 429 (451)
T ss_pred HHHHHHHHHHHcCCcHHHHHHHHHHHHHHHHH
Confidence 57889999999876 699999988887764
No 37
>PF11346 DUF3149: Protein of unknown function (DUF3149); InterPro: IPR021494 This bacterial family of proteins has no known function.
Probab=56.00 E-value=35 Score=19.14 Aligned_cols=35 Identities=11% Similarity=-0.051 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy5669 64 WYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFT 98 (116)
Q Consensus 64 ~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~ 98 (116)
|.+.+.-||-+.-++++++++.+...+.+.+.+..
T Consensus 3 w~~LF~s~vGL~Sl~vI~~~igm~~~~~~~F~~k~ 37 (42)
T PF11346_consen 3 WKDLFGSDVGLMSLIVIVFTIGMGVFFIRYFIRKM 37 (42)
T ss_pred HHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45556667766666666656665556666555543
No 38
>PF05186 Dpy-30: Dpy-30 motif; InterPro: IPR007858 This motif is about 40 residues long and is probably formed of two alpha-helices. It is found in the Dpy-30 proteins, hence the motifs name. Dpy-30 from Caenorhabditis elegans is an essential component of dosage compensation machinery and loss of dpy-30 activity results in XX-specific lethality; in XO animals, Dpy-30 is required for developmental processes other than dosage compensation []. In yeast, the homologue of DPY-30, Saf19p, functions as part of the Set1 complex that is necessary for the methylation of histone H3 at lysine residue 4; Set1 is a key part of epigenetic developmental control []. There is also a human homologue of Dpy-30 []. This Dpy-30 region may be a dimerisation motif analogous that found in the cAMP-dependent protein kinase regulator, type II PKA, R subunit IPR003117 from INTERPRO.; PDB: 3G36_D.
Probab=55.76 E-value=34 Score=18.96 Aligned_cols=22 Identities=14% Similarity=0.305 Sum_probs=14.4
Q ss_pred HHHHHHHHhcCCCCHHHHHHHH
Q psy5669 21 VQRLSTIFRSEPLHPLQKAIRS 42 (116)
Q Consensus 21 a~~ls~~~~d~p~~P~~~av~w 42 (116)
++-+..+.+.||.+|.+-...|
T Consensus 15 ~~gL~~l~~~rP~DPi~~La~~ 36 (42)
T PF05186_consen 15 TEGLAELAKERPEDPIEFLAEY 36 (42)
T ss_dssp HHHHHHHHHH--SSHHHHHHHH
T ss_pred HHHHHHHHHHCCCChHHHHHHH
Confidence 4556777889999999855544
No 39
>PLN02764 glycosyltransferase family protein
Probab=53.94 E-value=45 Score=27.69 Aligned_cols=49 Identities=14% Similarity=-0.002 Sum_probs=35.9
Q ss_pred chHHHHHHHHhcCHH-----HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCC
Q psy5669 2 RSSRRKVQPACMSPQ-----YKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRG 51 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~-----Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g 51 (116)
+.+.+|++++++|++ +++++++++..+++.- +.-.....-++.+.+..+
T Consensus 395 e~i~~av~~vm~~~~~~g~~~r~~a~~~~~~~~~~G-SS~~~l~~lv~~~~~~~~ 448 (453)
T PLN02764 395 ESLRDAINSVMKRDSEIGNLVKKNHTKWRETLASPG-LLTGYVDNFIESLQDLVS 448 (453)
T ss_pred HHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHHhcC-CHHHHHHHHHHHHHHhcc
Confidence 578899999998753 9999999999997664 334455556666665543
No 40
>PF03213 Pox_P35: Poxvirus P35 protein; InterPro: IPR004900 The Poxvirus P35 protein is an immunodominant envelope protein. It binds to heparan sulphate on the cell surface to provide virion attachment to target cell [].; GO: 0019031 viral envelope
Probab=50.84 E-value=33 Score=27.46 Aligned_cols=39 Identities=13% Similarity=0.177 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHHhCCCCCC-cccCCCCCHHHHHHHHHHHHHHH
Q psy5669 36 LQKAIRSIEYVIAHRGAPHL-KTKATSLPWYQTSQLDVIASCTL 78 (116)
Q Consensus 36 ~~~av~wiE~v~r~~g~~~l-~~~~~~l~~~q~~~lDV~~~l~~ 78 (116)
-.++..|++. ++|++.+. ..| -+||+-.+.++|+.++.+
T Consensus 258 wsrl~~Wla~--~~P~~~y~lttP--LfSfFGlfDInv~g~~ii 297 (325)
T PF03213_consen 258 WSRLGKWLAK--RFPGAYYFLTTP--LFSFFGLFDINVIGVIII 297 (325)
T ss_pred HHHHHHHHHh--hCCCchhhhhch--HHHHcccchhHHHHHHHH
Confidence 3455566542 56665543 322 345555555565544433
No 41
>COG0133 TrpB Tryptophan synthase beta chain [Amino acid transport and metabolism]
Probab=50.49 E-value=43 Score=27.29 Aligned_cols=44 Identities=20% Similarity=0.302 Sum_probs=36.7
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV 46 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v 46 (116)
+.|..|..+.-.||+|++....+.+-+.-|| +|+-.|-.-+|+.
T Consensus 26 ~eLe~ay~~~~~D~~F~~el~~~l~~Y~GRp-tpLy~a~~Lt~~~ 69 (396)
T COG0133 26 EELEKAYEKAKNDPEFQAELDYLLKDYAGRP-TPLYFAERLTEHL 69 (396)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHhCCCC-ChhHHHHHHHHhh
Confidence 4677788888899999999999999999999 8888776666655
No 42
>PF15013 CCSMST1: CCSMST1 family
Probab=50.42 E-value=14 Score=23.53 Aligned_cols=19 Identities=26% Similarity=0.609 Sum_probs=12.8
Q ss_pred CcccCCCCCHHHHHHHHHH
Q psy5669 55 LKTKATSLPWYQTSQLDVI 73 (116)
Q Consensus 55 l~~~~~~l~~~q~~~lDV~ 73 (116)
+.+....++|||.+.+=+-
T Consensus 21 ~~~~~~~~PWyq~~~is~s 39 (77)
T PF15013_consen 21 RGGKQQRMPWYQVYPISLS 39 (77)
T ss_pred cCCCCCCCcceeeehhHHH
Confidence 5555566899998765443
No 43
>cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold.
Probab=49.04 E-value=57 Score=23.54 Aligned_cols=44 Identities=11% Similarity=0.027 Sum_probs=29.2
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+++.++|.++++|+.+.+.+.+-+.....+..+....+-.+.+.
T Consensus 329 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 372 (374)
T cd03801 329 EALAEAILRLLDDPELRRRLGEAARERVAERFSWDRVAARTEEV 372 (374)
T ss_pred HHHHHHHHHHHcChHHHHHHHHHHHHHHHHhcCHHHHHHHHHHh
Confidence 56889999999999877666555543344555666666555543
No 44
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=47.62 E-value=6.2 Score=31.06 Aligned_cols=20 Identities=20% Similarity=0.326 Sum_probs=0.0
Q ss_pred HHHHHHHhcCCCCHHHHHHH
Q psy5669 22 QRLSTIFRSEPLHPLQKAIR 41 (116)
Q Consensus 22 ~~ls~~~~d~p~~P~~~av~ 41 (116)
.++++++.+.--+|.+.+..
T Consensus 70 ~~L~~~L~~~~g~~~~~f~~ 89 (290)
T PF05454_consen 70 EKLRKRLVDDDGKPSQEFVR 89 (290)
T ss_dssp --------------------
T ss_pred HHHHHHHhcCCCCcCHHHHH
Confidence 34444444443344443333
No 45
>KOG4331|consensus
Probab=47.01 E-value=34 Score=30.80 Aligned_cols=38 Identities=13% Similarity=-0.009 Sum_probs=17.4
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Q psy5669 63 PWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRG 100 (116)
Q Consensus 63 ~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~ 100 (116)
-|..|+-.=+...+++++++++......+.++|++|++
T Consensus 98 ~lv~~~~g~lv~sV~~v~~iil~p~~~~~yccc~C~~r 135 (865)
T KOG4331|consen 98 LLVIYEAGMLVCSVILVLYIILKPSEGLCYCCCRCCRR 135 (865)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhheeeeHhh
Confidence 34444444444444444444444444555555555543
No 46
>PHA03105 EEV glycoprotein; Provisional
Probab=46.24 E-value=21 Score=25.97 Aligned_cols=24 Identities=25% Similarity=0.163 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHHhhhhccccccc
Q psy5669 81 GLVLYTLARIGTSFAEFTRGKDDKK 105 (116)
Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~k 105 (116)
++++| ++.+++--++.|+.++++|
T Consensus 17 iLi~Y-ll~i~K~~iKKflkkkk~K 40 (188)
T PHA03105 17 VLILY-IFFICKNTIKKFLKKKKGK 40 (188)
T ss_pred HHHHH-HHHHHHHHHHHHHhccCCC
Confidence 33444 3445555555554444433
No 47
>KOG4433|consensus
Probab=46.19 E-value=1.2e+02 Score=25.82 Aligned_cols=65 Identities=11% Similarity=-0.017 Sum_probs=31.7
Q ss_pred HHHHHHHHHHHHHHhCCCC--------CCcccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc
Q psy5669 35 PLQKAIRSIEYVIAHRGAP--------HLKTKATSLPWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRGKDDK 104 (116)
Q Consensus 35 P~~~av~wiE~v~r~~g~~--------~l~~~~~~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (116)
|.-.+-||+.+.-..|..+ .++++ +-.+.|- |=+.+.+.+ ...++.+++.++..+|++|+++..-
T Consensus 4 ~~y~a~~~v~~lh~lP~~~~~~~~tns~F~pe--~~~Y~Qa--L~lla~l~a-a~l~l~Ll~ll~yli~~cC~Rr~~~ 76 (526)
T KOG4433|consen 4 VMYFAPWWVNLLHVLPHLNFKLHPTNSVFRPE--DSEYQQA--LLLLAALAA-ACLGLSLLFLLFYLICRCCCRRETT 76 (526)
T ss_pred ccccchhHHHHhhcCCCCCceEeeccccCCCC--cHHHHHH--HHHHHHHHH-HHHHHHHHHHHHHHHHHHHcCCCcc
Confidence 4456778998887766432 24443 2223332 222221211 1223344455566667777665543
No 48
>PLN00164 glucosyltransferase; Provisional
Probab=45.51 E-value=24 Score=29.33 Aligned_cols=30 Identities=10% Similarity=0.120 Sum_probs=23.2
Q ss_pred chHHHHHHHHhcCHH-----HHHHHHHHHHHHhcC
Q psy5669 2 RSSRRKVQPACMSPQ-----YKTNVQRLSTIFRSE 31 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~-----Yk~na~~ls~~~~d~ 31 (116)
+.+..+|++++.|++ ++++|++++...+..
T Consensus 419 e~l~~av~~vm~~~~~~~~~~r~~a~~~~~~~~~a 453 (480)
T PLN00164 419 AELERAVRSLMGGGEEEGRKAREKAAEMKAACRKA 453 (480)
T ss_pred HHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHHHH
Confidence 478899999998765 588888887776654
No 49
>PTZ00234 variable surface protein Vir12; Provisional
Probab=45.38 E-value=17 Score=30.12 Aligned_cols=31 Identities=19% Similarity=0.276 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHhhhhccccccccCCCcccc
Q psy5669 83 VLYTLARIGTSFAEFTRGKDDKKATKPDRNK 113 (116)
Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~ 113 (116)
++++.+.-.-.-++.-+.+|++|.+..-|||
T Consensus 377 ifFlfyyn~ss~lks~~~krkrkk~~~ehny 407 (433)
T PTZ00234 377 LVFLFFFFKSTPIRSQTNKGEKKKRKPQNNY 407 (433)
T ss_pred HHHhhhhhcccchhccccchhhcccchhhhh
Confidence 3344444443444444455555555556664
No 50
>COG2011 AbcD ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]
Probab=45.23 E-value=40 Score=25.57 Aligned_cols=34 Identities=18% Similarity=0.266 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy5669 65 YQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFT 98 (116)
Q Consensus 65 ~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~ 98 (116)
||.|+-||..+.++++++++.++-.+--++.+++
T Consensus 186 Y~Rf~~~Vm~~~viillilVq~iQ~~Gd~l~~r~ 219 (222)
T COG2011 186 YQRFNTDVMVVAVVILLILVQLIQSLGDRLVRRL 219 (222)
T ss_pred HHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 7889999998777777777777666666666654
No 51
>PF07219 HemY_N: HemY protein N-terminus; InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=43.90 E-value=54 Score=21.42 Aligned_cols=23 Identities=30% Similarity=0.216 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Q psy5669 74 ASCTLGLGLVLYTLARIGTSFAE 96 (116)
Q Consensus 74 ~~l~~v~~~~~~~~~~~~~~~~~ 96 (116)
+++++++++++++++++++.+++
T Consensus 20 ~~~l~~~~~~l~ll~~ll~~~~~ 42 (108)
T PF07219_consen 20 LILLLLLFVVLYLLLRLLRRLLS 42 (108)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 33444444455555565555543
No 52
>cd08806 CARD_CARD14_CARMA2 Caspase activation and recruitment domain of CARD14-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD14, also known as BIMP2 or CARMA2 (caspase recruitment domain-containing membrane-associated guanylate kinase protein 2). CARD14 has been identified as a novel member of the MAGUK (membrane-associated guanylate kinase) family that functions as upstream activators of BCL10 (B-cell lymphoma 10) and NF-kB signaling. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways
Probab=43.64 E-value=82 Score=20.45 Aligned_cols=38 Identities=13% Similarity=0.211 Sum_probs=30.0
Q ss_pred HHHHHhcCHHHHHHHHH---HHHHHhcCCCCHHHHHHHHHH
Q psy5669 7 KVQPACMSPQYKTNVQR---LSTIFRSEPLHPLQKAIRSIE 44 (116)
Q Consensus 7 aI~~vl~n~~Yk~na~~---ls~~~~d~p~~P~~~av~wiE 44 (116)
-..+|++++.+..++.+ +-.++..+--.+...++.-.|
T Consensus 35 deeeIls~~t~~~r~~k~g~LLDIL~trG~~g~~aFLeSLe 75 (86)
T cd08806 35 DEEEVLHSPRLTNRAMRVGHLLDLLKTRGKNGAIAFLESLK 75 (86)
T ss_pred hHHHHHccchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHH
Confidence 45789999999999999 777888887676666666655
No 53
>cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides.
Probab=43.40 E-value=95 Score=22.88 Aligned_cols=43 Identities=7% Similarity=0.084 Sum_probs=29.7
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+.+.++|.++++|+...+.+.+-+..+..+ ++....+-.|.+.
T Consensus 322 ~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~-~s~~~~~~~~~~~ 364 (366)
T cd03822 322 AALAEAIRRLLADPELAQALRARAREYARA-MSWERVAERYLRL 364 (366)
T ss_pred HHHHHHHHHHHcChHHHHHHHHHHHHHHhh-CCHHHHHHHHHHH
Confidence 468899999999976555555555445545 7877777766654
No 54
>PF05510 Sarcoglycan_2: Sarcoglycan alpha/epsilon; InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=42.68 E-value=40 Score=27.65 Aligned_cols=21 Identities=14% Similarity=0.137 Sum_probs=9.0
Q ss_pred CcccCCCCCHHHHHHHHHHHHH
Q psy5669 55 LKTKATSLPWYQTSQLDVIASC 76 (116)
Q Consensus 55 l~~~~~~l~~~q~~~lDV~~~l 76 (116)
..++...++-- .|.-|.++.+
T Consensus 269 ~~Pp~~~~p~R-~y~~d~~vtl 289 (386)
T PF05510_consen 269 FNPPKESVPGR-DYFPDFLVTL 289 (386)
T ss_pred CCCCccccccc-ccHHHHHHHH
Confidence 44444444432 2444544333
No 55
>PF04328 DUF466: Protein of unknown function (DUF466); InterPro: IPR007423 This is a small bacterial protein of unknown function.
Probab=42.28 E-value=79 Score=19.21 Aligned_cols=44 Identities=20% Similarity=0.217 Sum_probs=34.4
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVI 47 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~ 47 (116)
+.+...++.++.+|.|..=+.....---|.| |+..-.+|=+...
T Consensus 10 ~~~~~~~r~l~G~~~Ye~Yv~H~~~~HP~~p--~ms~~eF~r~r~~ 53 (65)
T PF04328_consen 10 RRVRWYARLLVGEPDYERYVEHMRRHHPDEP--PMSEREFFRERQD 53 (65)
T ss_pred HHHHHHHHHhcCcHHHHHHHHHHHHHCcCCC--CCCHHHHHHHHHH
Confidence 3567889999999999999999888877775 6666667766543
No 56
>PF01601 Corona_S2: Coronavirus S2 glycoprotein; InterPro: IPR002552 The type I glycoprotein S of Coronavirus, trimers of which constitute the typical viral spikes, is assembled into virions through noncovalent interactions with the M protein. The spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 IPR002551 from INTERPRO and S2 []. Both chimeric S proteins appeared to cause cell fusion when expressed individually, suggesting that they were biologically fully active []. The spike is a type I membrane glycoprotein that possesses a conserved transmembrane anchor and an unusual cysteine-rich (cys) domain that bridges the putative junction of the anchor and the cytoplasmic tail [].; GO: 0006944 cellular membrane fusion, 0046813 virion attachment, binding of host cell surface receptor, 0016021 integral to membrane, 0019031 viral envelope; PDB: 2BEQ_B 2FXP_A 1ZVB_A 1WNC_D 1ZV8_H 1ZV7_B 1WYY_B 1ZVA_A 2BEZ_F 1WDG_A ....
Probab=42.27 E-value=8.4 Score=33.26 Aligned_cols=33 Identities=12% Similarity=0.066 Sum_probs=0.4
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy5669 60 TSLPWYQTSQLDVIASCTLGLGLVLYTLARIGT 92 (116)
Q Consensus 60 ~~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~ 92 (116)
..+|||....+=+.+++++.+++.+++..-||+
T Consensus 544 iKWPWyVWL~i~~~li~~~~~l~~~~~~TGCCG 576 (610)
T PF01601_consen 544 IKWPWYVWLAIILALIAFALILLWCCCMTGCCG 576 (610)
T ss_dssp H--------------------------------
T ss_pred eehHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc
Confidence 378998877666555554444444444444554
No 57
>PF10661 EssA: WXG100 protein secretion system (Wss), protein EssA; InterPro: IPR018920 The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria []. Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions.
Probab=42.20 E-value=46 Score=23.45 Aligned_cols=14 Identities=0% Similarity=-0.278 Sum_probs=6.6
Q ss_pred HHHHHHHHHHHHHH
Q psy5669 81 GLVLYTLARIGTSF 94 (116)
Q Consensus 81 ~~~~~~~~~~~~~~ 94 (116)
+++.+++|.++|-+
T Consensus 130 l~i~~giy~~~r~~ 143 (145)
T PF10661_consen 130 LAICGGIYVVLRKV 143 (145)
T ss_pred HHHHHHHHHHHHHh
Confidence 33444455555544
No 58
>cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases. wbuB in E. coli is involved in the biosynthesis of the O26 O-antigen. It has been proposed to function as an N-acetyl-L-fucosamine (L-FucNAc) transferase.
Probab=40.80 E-value=46 Score=24.43 Aligned_cols=39 Identities=8% Similarity=0.034 Sum_probs=25.1
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAI 40 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av 40 (116)
+++.++|.++++|+..++++.+-+..+..+..++...+-
T Consensus 353 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~ 391 (394)
T cd03794 353 EALAAAILELLDDPEERAEMGENGRRYVEEKFSREKLAE 391 (394)
T ss_pred HHHHHHHHHHHhChHHHHHHHHHHHHHHHHhhcHHHHHH
Confidence 467888999998888776665555544444455554443
No 59
>cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide.
Probab=40.12 E-value=62 Score=23.67 Aligned_cols=41 Identities=12% Similarity=-0.031 Sum_probs=25.8
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRS 42 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~w 42 (116)
+.+.++|.++++|++..+.+.+-+.....+..+....+-.|
T Consensus 333 ~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~ 373 (375)
T cd03821 333 DALAAALRRALELPQRLKAMGENGRALVEERFSWTAIAQQL 373 (375)
T ss_pred HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhcCHHHHHHHh
Confidence 46788899999988766665555554434445555544433
No 60
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=39.98 E-value=67 Score=22.07 Aligned_cols=10 Identities=10% Similarity=0.282 Sum_probs=0.0
Q ss_pred HhcCCCCHHH
Q psy5669 28 FRSEPLHPLQ 37 (116)
Q Consensus 28 ~~d~p~~P~~ 37 (116)
+-..+.+|.+
T Consensus 41 is~tt~sP~e 50 (122)
T PF01102_consen 41 ISVTTVSPPE 50 (122)
T ss_dssp ----------
T ss_pred eccccccCCC
Confidence 3344444444
No 61
>PRK08939 primosomal protein DnaI; Reviewed
Probab=39.82 E-value=42 Score=26.23 Aligned_cols=32 Identities=13% Similarity=0.301 Sum_probs=25.7
Q ss_pred CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCC
Q psy5669 1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEP 32 (116)
Q Consensus 1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p 32 (116)
||++.+++.++..++.++++.+++...+...|
T Consensus 1 me~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 32 (306)
T PRK08939 1 MESIGKTLKKLMKRQDFRKRYEKLKQEVLADP 32 (306)
T ss_pred CccHHHHHHHHHhchhHHHHHHHHHHHHhcCH
Confidence 89999999999988888888887755554444
No 62
>PLN02992 coniferyl-alcohol glucosyltransferase
Probab=39.61 E-value=91 Score=26.10 Aligned_cols=28 Identities=11% Similarity=0.172 Sum_probs=21.1
Q ss_pred chHHHHHHHHhcCH---HHHHHHHHHHHHHh
Q psy5669 2 RSSRRKVQPACMSP---QYKTNVQRLSTIFR 29 (116)
Q Consensus 2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~ 29 (116)
+++.++|+++++|+ .+++++++++...+
T Consensus 415 ~~l~~av~~vm~~~~g~~~r~~a~~~~~~a~ 445 (481)
T PLN02992 415 SKIEALVRKVMVEEEGEEMRRKVKKLRDTAE 445 (481)
T ss_pred HHHHHHHHHHhcCCchHHHHHHHHHHHHHHH
Confidence 57889999999774 57777777766555
No 63
>KOG3989|consensus
Probab=39.60 E-value=45 Score=25.34 Aligned_cols=11 Identities=36% Similarity=0.352 Sum_probs=5.5
Q ss_pred HHHHHHHHhCC
Q psy5669 41 RSIEYVIAHRG 51 (116)
Q Consensus 41 ~wiE~v~r~~g 51 (116)
-|+.++...+|
T Consensus 162 ~~i~~v~~~~~ 172 (224)
T KOG3989|consen 162 IWIHYVYFYNG 172 (224)
T ss_pred HHHHHhhccCC
Confidence 35555554444
No 64
>PRK10907 intramembrane serine protease GlpG; Provisional
Probab=39.60 E-value=1.5e+02 Score=22.96 Aligned_cols=31 Identities=16% Similarity=0.068 Sum_probs=26.8
Q ss_pred hcCHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy5669 12 CMSPQYKTNVQRLSTIFRSEPLHPLQKAIRS 42 (116)
Q Consensus 12 l~n~~Yk~na~~ls~~~~d~p~~P~~~av~w 42 (116)
+.|++..+.+++.-+.|.+.|.+|.-.+.-|
T Consensus 39 l~d~~~~~~~~~~~~~f~~~p~~~~y~~asw 69 (276)
T PRK10907 39 LADESQAERVRAELARFLENPADPRYLAASW 69 (276)
T ss_pred ecCHHHHHHHHHHHHHHHhCCCchhHHhccc
Confidence 4699999999999999999999999555555
No 65
>PF03579 SHP: Small hydrophobic protein; InterPro: IPR005327 The small hydrophobic integral membrane protein, SH (previously designated 1A) is found to have a variety of glycosylated forms [, ]. This protein is a component of the mature respiratory syncytial virion [] where it may form complexes and appears to play a structural role.; GO: 0016020 membrane, 0016021 integral to membrane, 0048222 glycoprotein network
Probab=39.43 E-value=87 Score=18.93 Aligned_cols=9 Identities=11% Similarity=0.124 Sum_probs=5.1
Q ss_pred HHHHHHHHH
Q psy5669 64 WYQTSQLDV 72 (116)
Q Consensus 64 ~~q~~~lDV 72 (116)
||.|+.|-.
T Consensus 14 FW~YFtLi~ 22 (64)
T PF03579_consen 14 FWTYFTLIF 22 (64)
T ss_pred cchHHHHHH
Confidence 566666533
No 66
>PF14851 FAM176: FAM176 family
Probab=37.71 E-value=57 Score=23.32 Aligned_cols=6 Identities=0% Similarity=-0.313 Sum_probs=2.4
Q ss_pred HHHHHh
Q psy5669 90 IGTSFA 95 (116)
Q Consensus 90 ~~~~~~ 95 (116)
+.++.|
T Consensus 42 V~risc 47 (153)
T PF14851_consen 42 VIRISC 47 (153)
T ss_pred Hhhhee
Confidence 334444
No 67
>PF06345 Drf_DAD: DRF Autoregulatory Domain; InterPro: IPR010465 This domain is found in Diaphanous-related formins (Drfs). It binds the N-terminal GTPase-binding domain; this link is broken when GTP-bound Rho binds to the GBD and activates the protein. The addition of diaphanous activating domains (DAD) to mammalian cells induces actin filament formation, stabilises microtubules, and activates serum-response mediated transcription [].; PDB: 3O4X_H 3OBV_E 2BAP_C 2F31_B.
Probab=37.62 E-value=18 Score=15.74 Aligned_cols=10 Identities=30% Similarity=0.218 Sum_probs=6.5
Q ss_pred CchHHHHHHH
Q psy5669 1 MRSSRRKVQP 10 (116)
Q Consensus 1 ~~~l~~aI~~ 10 (116)
|+++.+|++.
T Consensus 3 mdsllealqt 12 (15)
T PF06345_consen 3 MDSLLEALQT 12 (15)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHHHHc
Confidence 6677777653
No 68
>PF04906 Tweety: Tweety; InterPro: IPR006990 None of the members of the tweety (tty) family have been functionally characterised. However, they are considered to be transmembrane proteins with five potential membrane-spanning regions. A number of potential functions have been suggested on the basis of homology to the yeast FTR1 and FTH1 iron transporter proteins and the mammalian neurotensin receptors 1 and 2 in that they have a similar hydrophobicity profiles although there is no detectable sequence homology to the tweety-related proteins. It has been proposed that the tweety-related proteins could be involved in transport of iron or other divalent cations or alternatively that they may be membrane-bound receptors [].
Probab=37.60 E-value=1.6e+02 Score=23.99 Aligned_cols=18 Identities=6% Similarity=-0.153 Sum_probs=9.5
Q ss_pred HHHHHHHHHhhhhccccc
Q psy5669 86 TLARIGTSFAEFTRGKDD 103 (116)
Q Consensus 86 ~~~~~~~~~~~~~~~~~~ 103 (116)
++..++..+|++||++++
T Consensus 34 Ll~ll~yl~~~CC~r~~~ 51 (406)
T PF04906_consen 34 LLFLLIYLICRCCCRRPR 51 (406)
T ss_pred HHHHHHHHHHHhhCCCCC
Confidence 344455556666655543
No 69
>PHA02688 ORF059 IMV protein VP55; Provisional
Probab=37.53 E-value=52 Score=26.35 Aligned_cols=40 Identities=10% Similarity=0.056 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHhCCCCCC-cccCCCCCHHHHHHHHHHHHHHHH
Q psy5669 36 LQKAIRSIEYVIAHRGAPHL-KTKATSLPWYQTSQLDVIASCTLG 79 (116)
Q Consensus 36 ~~~av~wiE~v~r~~g~~~l-~~~~~~l~~~q~~~lDV~~~l~~v 79 (116)
..++..|+. -++||+.+. ..| -+||+-.+.++|+.++.++
T Consensus 256 wsrl~~Wla--~~~P~~~y~lttP--LfSfFGlfDInv~gviiil 296 (323)
T PHA02688 256 WSRLGTWLA--KRYPGFYYFLTTP--LFSFFGLFDINVIGVIIIL 296 (323)
T ss_pred HHHHHHHHH--hhCCchheeecch--HHHhhccchhHHHHHHHHH
Confidence 445666754 366776553 332 3466666666665444433
No 70
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=37.39 E-value=37 Score=20.17 Aligned_cols=24 Identities=4% Similarity=0.124 Sum_probs=21.3
Q ss_pred CchHHHHHHHHhcCHHHHHHHHHH
Q psy5669 1 MRSSRRKVQPACMSPQYKTNVQRL 24 (116)
Q Consensus 1 ~~~l~~aI~~vl~n~~Yk~na~~l 24 (116)
|+.+..-+..+.+||..++.++..
T Consensus 1 ~e~l~~Fl~~~~~d~~L~~~l~~~ 24 (64)
T TIGR03798 1 EEQLKAFLEKVKTDPDLREKLKAA 24 (64)
T ss_pred CHHHHHHHHHHHcCHHHHHHHHHc
Confidence 788999999999999999987774
No 71
>PF14979 TMEM52: Transmembrane 52
Probab=37.33 E-value=86 Score=22.44 Aligned_cols=15 Identities=20% Similarity=0.197 Sum_probs=6.8
Q ss_pred HHHHHHHHHHHHHHH
Q psy5669 70 LDVIASCTLGLGLVL 84 (116)
Q Consensus 70 lDV~~~l~~v~~~~~ 84 (116)
+-|++++++++++++
T Consensus 20 WyIwLill~~~llLL 34 (154)
T PF14979_consen 20 WYIWLILLIGFLLLL 34 (154)
T ss_pred hHHHHHHHHHHHHHH
Confidence 345555544444333
No 72
>PHA02754 hypothetical protein; Provisional
Probab=37.23 E-value=55 Score=19.88 Aligned_cols=27 Identities=19% Similarity=0.370 Sum_probs=22.2
Q ss_pred HHHHHHhcCHHHHHHHHHHHHHHhcCC
Q psy5669 6 RKVQPACMSPQYKTNVQRLSTIFRSEP 32 (116)
Q Consensus 6 ~aI~~vl~n~~Yk~na~~ls~~~~d~p 32 (116)
+-|.+.+.++.|++.++++..++.++-
T Consensus 5 eEi~k~i~eK~Fke~MRelkD~LSe~G 31 (67)
T PHA02754 5 EEIPKAIMEKDFKEAMRELKDILSEAG 31 (67)
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHhhCc
Confidence 457788889999999999999887653
No 73
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=37.10 E-value=66 Score=22.10 Aligned_cols=7 Identities=14% Similarity=-0.391 Sum_probs=2.5
Q ss_pred HHHHHHH
Q psy5669 73 IASCTLG 79 (116)
Q Consensus 73 ~~~l~~v 79 (116)
+++++.+
T Consensus 67 ~~Ii~gv 73 (122)
T PF01102_consen 67 IGIIFGV 73 (122)
T ss_dssp HHHHHHH
T ss_pred eehhHHH
Confidence 3333333
No 74
>PRK13028 tryptophan synthase subunit beta; Provisional
Probab=36.75 E-value=45 Score=27.25 Aligned_cols=38 Identities=16% Similarity=0.257 Sum_probs=30.2
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAI 40 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av 40 (116)
+.|.++..++..||+|++....+-.-+..+| +|+..+-
T Consensus 32 ~~~~~~~~~~~~~~~f~~~~~~~~~~~~g~p-TPL~~~~ 69 (402)
T PRK13028 32 DELEAAYEEIKKDPDFIAELRYLLKHYVGRP-TPLYHAK 69 (402)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHhCCCC-CCeeehH
Confidence 5788888999999999999999888887766 5554443
No 75
>smart00394 RIIa RIIalpha, Regulatory subunit portion of type II PKA R-subunit. RIIalpha, Regulatory subunit portion of type II PKA R-subunit. Contains dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs).
Probab=36.44 E-value=70 Score=16.91 Aligned_cols=24 Identities=21% Similarity=0.242 Sum_probs=19.0
Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHH
Q psy5669 25 STIFRSEPLHPLQKAIRSIEYVIA 48 (116)
Q Consensus 25 s~~~~d~p~~P~~~av~wiE~v~r 48 (116)
..+++++|-+|.+-++.|.+-..+
T Consensus 12 ~~vl~~qP~d~~~f~~~yF~kL~~ 35 (38)
T smart00394 12 VEVLRAQPSDLVQFAADYFEKLEE 35 (38)
T ss_pred HHHHHHCCCcHHHHHHHHHHHHHH
Confidence 457889999999999888875543
No 76
>PF07856 Orai-1: Mediator of CRAC channel activity; InterPro: IPR012446 This entry includes Drosophila Orai and human Orai1, Orai2 and Orai3. ORAI-1 GFP reporters are co-expressed with STIM-1 (ER CA(2+) sensors) in the gonad and intestine. The protein has four predicted transmembrane domains with a highly conserved region between TM2 ad TM3. This conserved domain is thought to function in channel regulation. ORAI1-related proteins are required for the production of the calcium channel, CRAC, along with STIM1-related proteins [].
Probab=36.37 E-value=42 Score=24.41 Aligned_cols=56 Identities=16% Similarity=0.189 Sum_probs=32.8
Q ss_pred cCCCCHHHHHHHHHHHHHHhC---CCCCCcccCCCCCHHHHHHH--HHHHHHHHHHHHHHH
Q psy5669 30 SEPLHPLQKAIRSIEYVIAHR---GAPHLKTKATSLPWYQTSQL--DVIASCTLGLGLVLY 85 (116)
Q Consensus 30 d~p~~P~~~av~wiE~v~r~~---g~~~l~~~~~~l~~~q~~~l--DV~~~l~~v~~~~~~ 85 (116)
..|.+|.+..-.|+|-.-++. |..-+=..-.-++|++++.. ...+..+.++++...
T Consensus 93 ~~~~sP~~~f~~~cE~~W~~s~~lGi~lFL~~l~l~~WIKF~~~~~~~aa~~~t~i~~~~~ 153 (175)
T PF07856_consen 93 PVPPSPHRRFHRYCELAWRFSTVLGIPLFLAELALLGWIKFWDSPSPAAAIAITAILVPVL 153 (175)
T ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHheeehhccchHHHHHHHHHHHHHH
Confidence 377789888888888332221 43333333346789998888 555444444443333
No 77
>PF03889 DUF331: Domain of unknown function; InterPro: IPR005589 The members of this family are uncharacterised proteins from a number of bacterial species. They range in size from 50-100 residues.
Probab=36.19 E-value=26 Score=19.41 Aligned_cols=19 Identities=16% Similarity=0.448 Sum_probs=15.0
Q ss_pred HHHHHHHhcCHHHHHHHHH
Q psy5669 5 RRKVQPACMSPQYKTNVQR 23 (116)
Q Consensus 5 ~~aI~~vl~n~~Yk~na~~ 23 (116)
.+|+..++.+|-|+...++
T Consensus 5 dNalkAlv~~~LFr~rvek 23 (39)
T PF03889_consen 5 DNALKALVTSPLFRQRVEK 23 (39)
T ss_pred hhHHHHHHhCcchhhhhhc
Confidence 3688888999988887654
No 78
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=36.13 E-value=38 Score=25.70 Aligned_cols=7 Identities=29% Similarity=0.468 Sum_probs=2.5
Q ss_pred HHHHHHh
Q psy5669 43 IEYVIAH 49 (116)
Q Consensus 43 iE~v~r~ 49 (116)
||-.+++
T Consensus 8 i~~lv~r 14 (221)
T PF08374_consen 8 IEELVRR 14 (221)
T ss_pred HHHHHHh
Confidence 3333333
No 79
>cd00904 Ferritin Ferritin iron storage proteins. Ferritins are the primary iron storage proteins of most living organisms and members of a broad superfamily of ferritin-like diiron-carboxylate proteins. The iron-free (apoferritin) ferritin molecule is a protein shell composed of 24 protein chains arranged in 432 symmetry. Iron storage involves the uptake of iron (II) at the protein shell, its oxidation by molecular oxygen at the dinuclear ferroxidase centers, and the movement of iron (III) into the cavity for deposition as ferrihydrite; the protein shell can hold up to 4500 iron atoms. In vertebrates, two types of chains (subunits) have been characterized, H or M (fast) and L (slow), which differ in rates of iron uptake and mineralization. Bacterial non-heme ferritins are composed only of H chains. Fe(II) oxidation in the H/M subunits take place initially at the ferroxidase center, a carboxylate-bridged diiron center, located within the subunit four-helix bundle. In a complementary rol
Probab=36.06 E-value=52 Score=22.99 Aligned_cols=38 Identities=13% Similarity=0.258 Sum_probs=31.1
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcc
Q psy5669 20 NVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKT 57 (116)
Q Consensus 20 na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~ 57 (116)
+...+++-|+.+-..-.+.|...++|+...||.+.+..
T Consensus 35 ~l~g~a~~f~~~s~eE~~HA~~l~~yi~~rgg~~~l~~ 72 (160)
T cd00904 35 ALKGVAHFFKEQAQEEREHAEKFYKYQNERGGRVELQD 72 (160)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHCCCccccCc
Confidence 56667777888877888999999999999999776643
No 80
>PF06837 Fijivirus_P9-2: Fijivirus P9-2 protein; InterPro: IPR009650 This family consists of several Fijivirus specific P9-2 proteins from Rice black streaked dwarf virus (RBSDV) and Fiji disease virus. The function of this family is unknown.
Probab=36.02 E-value=94 Score=23.23 Aligned_cols=66 Identities=9% Similarity=0.077 Sum_probs=35.6
Q ss_pred cCCC--CHHHHHHHHHHHHHHhCCCCCCcccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc
Q psy5669 30 SEPL--HPLQKAIRSIEYVIAHRGAPHLKTKATSLPWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRGK 101 (116)
Q Consensus 30 d~p~--~P~~~av~wiE~v~r~~g~~~l~~~~~~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~~ 101 (116)
++|. |.+|.-+.-.|..++-...|-++..-...+-+ +++++..+++.++-++.+++.-++|+.|.+
T Consensus 43 e~~~sdse~Dd~vd~lE~~ve~~~~pl~rr~ygkvg~I------i~mI~sflfFGifkl~lKmfYhLfrcV~CN 110 (214)
T PF06837_consen 43 ERPLSDSELDDKVDKLETDVEDKVDPLVRRKYGKVGHI------IWMIFSFLFFGIFKLTLKMFYHLFRCVCCN 110 (214)
T ss_pred cccCcchhHHHHHHHHhhhHHhCCCHHHHhhccCccee------ehHHHHHHHHHHHHHHHHHHHHHHhhhhcC
Confidence 4564 55677788888887776555444332233221 122233333445555667776666665543
No 81
>PRK04346 tryptophan synthase subunit beta; Validated
Probab=34.88 E-value=53 Score=26.82 Aligned_cols=37 Identities=19% Similarity=0.312 Sum_probs=29.3
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKA 39 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~a 39 (116)
+.|.++..++.+||.|++....+-.-+.-+| +|+-.+
T Consensus 28 ~~~~~~~~~~~~~~~f~~~~~~~~~~~~grp-TPL~~~ 64 (397)
T PRK04346 28 EELEEAYEKAKNDPEFQAELDYLLKNYVGRP-TPLYFA 64 (397)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHhcCCC-CCceEh
Confidence 5688888999999999999988888776665 555444
No 82
>PRK11024 colicin uptake protein TolR; Provisional
Probab=34.72 E-value=33 Score=23.54 Aligned_cols=14 Identities=29% Similarity=0.123 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLGL 82 (116)
Q Consensus 69 ~lDV~~~l~~v~~~ 82 (116)
++||.+++++++++
T Consensus 20 lIDVvfvLLiFFmv 33 (141)
T PRK11024 20 LLDVLLVLLLIFMA 33 (141)
T ss_pred HHHHHHHHHHHHHh
Confidence 78999877766543
No 83
>PLN02173 UDP-glucosyl transferase family protein
Probab=34.71 E-value=52 Score=27.22 Aligned_cols=28 Identities=7% Similarity=0.061 Sum_probs=20.9
Q ss_pred chHHHHHHHHhcCHH---HHHHHHHHHHHHh
Q psy5669 2 RSSRRKVQPACMSPQ---YKTNVQRLSTIFR 29 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~---Yk~na~~ls~~~~ 29 (116)
+.+.+++++++.|++ .+++|++++...+
T Consensus 396 e~v~~av~~vm~~~~~~~~r~~a~~~~~~a~ 426 (449)
T PLN02173 396 EEIEFSIKEVMEGEKSKEMKENAGKWRDLAV 426 (449)
T ss_pred HHHHHHHHHHhcCChHHHHHHHHHHHHHHHH
Confidence 568999999998865 4667777666555
No 84
>cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=34.69 E-value=1.5e+02 Score=21.68 Aligned_cols=42 Identities=7% Similarity=-0.069 Sum_probs=27.7
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIE 44 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE 44 (116)
+.+.++|.++++|++..+.+.+-+.... +..+..+.+-.+++
T Consensus 320 ~~l~~~i~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 361 (364)
T cd03814 320 EAFAAALAALLADPELRRRMAARARAEA-ERRSWEAFLDNLLE 361 (364)
T ss_pred HHHHHHHHHHHcCHHHHHHHHHHHHHHH-hhcCHHHHHHHHHH
Confidence 3478899999999987777666555554 34565555544443
No 85
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=34.45 E-value=72 Score=16.47 Aligned_cols=22 Identities=18% Similarity=0.297 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLGLVLYTLARI 90 (116)
Q Consensus 69 ~lDV~~~l~~v~~~~~~~~~~~ 90 (116)
+++++.=.+++++++.++++.+
T Consensus 2 s~~vi~g~llv~lLl~YLvYAL 23 (29)
T PRK14750 2 NFSIVCGALLVLLLLGYLVYAL 23 (29)
T ss_pred cHHHHHHHHHHHHHHHHHHHHH
Confidence 3455544444444445554443
No 86
>PF05225 HTH_psq: helix-turn-helix, Psq domain; InterPro: IPR007889 This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster []. In pipsqueak this domain binds to GAGA sequence []. The pipsqueak family, which includes proteins from fungi, sea urchins, nematodes, insects, and vertebrates appear to be proteins essential for sequence-specific targeting of a polycomb group protein complex [].; GO: 0003677 DNA binding; PDB: 2COB_A.
Probab=34.28 E-value=63 Score=17.92 Aligned_cols=23 Identities=0% Similarity=0.105 Sum_probs=17.0
Q ss_pred chHHHHHHHHhcC-HHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMS-PQYKTNVQRL 24 (116)
Q Consensus 2 ~~l~~aI~~vl~n-~~Yk~na~~l 24 (116)
+.|..||..|.++ -++++.|+..
T Consensus 3 e~l~~Ai~~v~~g~~S~r~AA~~y 26 (45)
T PF05225_consen 3 EDLQKAIEAVKNGKMSIRKAAKKY 26 (45)
T ss_dssp HHHHHHHHHHHTTSS-HHHHHHHH
T ss_pred HHHHHHHHHHHhCCCCHHHHHHHH
Confidence 5688999999877 6777777664
No 87
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=34.25 E-value=98 Score=22.31 Aligned_cols=11 Identities=9% Similarity=0.501 Sum_probs=6.0
Q ss_pred CHHHHHHHHHH
Q psy5669 63 PWYQTSQLDVI 73 (116)
Q Consensus 63 ~~~q~~~lDV~ 73 (116)
+|+.-|++-++
T Consensus 25 sffsthm~tIL 35 (189)
T PF05568_consen 25 SFFSTHMYTIL 35 (189)
T ss_pred cHHHHHHHHHH
Confidence 56665555444
No 88
>PF02197 RIIa: Regulatory subunit of type II PKA R-subunit; InterPro: IPR003117 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []: Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins) Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. In the absence of cAMP, Protein Kinase A (PKA) exists as an equimolar tetramer of regulatory (R) and catalytic (C) subunits []. In addition to its role as an inhibitor of the C subunit, the R subunit anchors the holoenzyme to specific intracellular locations and prevents the C subunit from entering the nucleus. All R subunits have a conserved domain structure consisting of the N-terminal dimerization domain, inhibitory region, cAMP-binding domain A and cAMP-binding domain B. R subunits interact with C subunits primarily through the inhibitory site. The cAMP-binding domains show extensive sequence similarity and bind cAMP cooperatively. Two types of regulatory (R) subunits exist - types I and I - which differ in molecular weight, sequence, autophosphorylation cabaility, cellular location and tissue distribution. Types I and II were further sub-divided into alpha and beta subtypes, based mainly on sequence similarity. This entry represents types I-alpha, I-beta, II-alpha and II-beta regulatory subunits of PKA proteins. These subunits contain the dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs).; GO: 0008603 cAMP-dependent protein kinase regulator activity, 0007165 signal transduction; PDB: 2IZY_E 1R2A_A 1L6E_A 2IZX_B 2KYG_A 2EZW_B 3IM4_B 3IM3_A 4F9K_C 2HWN_B ....
Probab=34.13 E-value=80 Score=16.95 Aligned_cols=24 Identities=25% Similarity=0.235 Sum_probs=16.5
Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHH
Q psy5669 25 STIFRSEPLHPLQKAIRSIEYVIA 48 (116)
Q Consensus 25 s~~~~d~p~~P~~~av~wiE~v~r 48 (116)
..+++++|-++++=+..|.+-..+
T Consensus 12 ~~vl~~qP~Di~~F~a~yF~~L~~ 35 (38)
T PF02197_consen 12 REVLREQPDDILQFAADYFEKLEK 35 (38)
T ss_dssp HHHHHH--S-HHHHHHHHHHHHHH
T ss_pred HHHHHHCCCcHHHHHHHHHHHHHH
Confidence 677889999999988888865544
No 89
>PRK12446 undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase; Reviewed
Probab=33.73 E-value=82 Score=24.80 Aligned_cols=37 Identities=22% Similarity=0.334 Sum_probs=24.6
Q ss_pred chHHHHHHHHhcCH-HHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSP-QYKTNVQRLSTIFRSEPLHPLQKAIRSIE 44 (116)
Q Consensus 2 ~~l~~aI~~vl~n~-~Yk~na~~ls~~~~d~p~~P~~~av~wiE 44 (116)
+.+.++|.++++|+ .|++++++++ .| ++.+..+.+++
T Consensus 313 ~~l~~~l~~ll~~~~~~~~~~~~~~-----~~-~aa~~i~~~i~ 350 (352)
T PRK12446 313 NSLIKHVEELSHNNEKYKTALKKYN-----GK-EAIQTIIDHIS 350 (352)
T ss_pred HHHHHHHHHHHcCHHHHHHHHHHcC-----CC-CHHHHHHHHHH
Confidence 46788999999887 6776665533 33 55666666554
No 90
>cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides.
Probab=31.88 E-value=1e+02 Score=22.30 Aligned_cols=39 Identities=5% Similarity=0.035 Sum_probs=23.0
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAI 40 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av 40 (116)
+++.++|.++++|+...+.+.+-+.-.-.+..++...+-
T Consensus 317 ~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~s~~~~~~ 355 (359)
T cd03808 317 EALADAIERLIEDPELRARMGQAARKRAEEEFDEEIVVK 355 (359)
T ss_pred HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhcCHHHHHH
Confidence 457788888888887665554444333333445544443
No 91
>TIGR02801 tolR TolR protein. The model describes the inner membrane protein TolR, part of the TolR/TolQ complex that transduces energy from the proton-motive force, through TolA, to an outer membrane complex made up of TolB and Pal (peptidoglycan-associated lipoprotein). The complex is required to maintain outer membrane integrity, and defects may cause a defect in the import of some organic compounds in addition to the resulting morphologic. While several gene pairs homologous to talR and tolQ may be found in a single genome, but the scope of this model is set to favor finding only bone fide TolR, supported by operon structure as well as by score.
Probab=31.84 E-value=44 Score=22.31 Aligned_cols=13 Identities=15% Similarity=0.066 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLG 81 (116)
Q Consensus 69 ~lDV~~~l~~v~~ 81 (116)
++||.++++++++
T Consensus 10 lIDVvFlLLiFFm 22 (129)
T TIGR02801 10 YVDVMLVLLIIFM 22 (129)
T ss_pred HHHHHHHHHHHHH
Confidence 6899877765554
No 92
>TIGR02804 ExbD_2 TonB system transport protein ExbD, group 2. Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR family modeled by TIGRFAMs TIGR02801. Members always are encoded next to a protein designated ExbB (TIGR02797), related to the TolQ family modeled by TIGRFAMs TIGR02796. ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.
Probab=31.74 E-value=47 Score=22.11 Aligned_cols=13 Identities=8% Similarity=-0.069 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLG 81 (116)
Q Consensus 69 ~lDV~~~l~~v~~ 81 (116)
++||.++++++++
T Consensus 9 lIDVvflLLiFFm 21 (121)
T TIGR02804 9 FIDIMLVLLAIVL 21 (121)
T ss_pred HHHHHHHHHHHHH
Confidence 6899877765554
No 93
>PF12626 PolyA_pol_arg_C: Polymerase A arginine-rich C-terminus; PDB: 3AQN_A 3AQK_A 3AQM_B 3AQL_B.
Probab=31.68 E-value=54 Score=22.44 Aligned_cols=38 Identities=11% Similarity=-0.061 Sum_probs=21.3
Q ss_pred HHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy5669 9 QPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVI 47 (116)
Q Consensus 9 ~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~ 47 (116)
..+++.|.|+..-.-|.---.. --...+.+.||.+|.-
T Consensus 55 ~~ll~hprFRAAyDFL~LR~~~-ge~~~~l~~WW~~fq~ 92 (124)
T PF12626_consen 55 FRLLEHPRFRAAYDFLLLRAEA-GEELSELAEWWTEFQE 92 (124)
T ss_dssp HHHHHSTTHHHHHHHHHHHHHH--HHHHHHHHHHHHHTT
T ss_pred HHHHhCchHHHHHHHHHHHHHh-CCCcHHHHHHHHHHHh
Confidence 3556677777654333211111 1156788889988863
No 94
>PF11417 Inhibitor_G39P: Loader and inhibitor of phage G40P; InterPro: IPR024424 G39P inhibits the initiation of DNA replication by blocking G40P replicative helicase. G39P has a bipartite stricture consisting of a folded N-terminal domain and an unfolded C-terminal domain. The C-terminal is essential for helicase interaction [].; PDB: 1NO1_B.
Probab=31.64 E-value=56 Score=20.21 Aligned_cols=31 Identities=10% Similarity=0.073 Sum_probs=20.3
Q ss_pred HHHHHHhcCHHHHHHHHHHHHHHhcCCCCHH
Q psy5669 6 RKVQPACMSPQYKTNVQRLSTIFRSEPLHPL 36 (116)
Q Consensus 6 ~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~ 36 (116)
++=.++|+|-.|....+.+...+...+++|.
T Consensus 34 ~~W~~~L~d~~ye~v~~al~~~i~~~kfPPs 64 (71)
T PF11417_consen 34 DLWYDMLKDYDYEIVMKALKKHIATNKFPPS 64 (71)
T ss_dssp HHHHHHHTTS-HHHHHHHHHHHHHH-SS---
T ss_pred HHHHHHHHhCCHHHHHHHHHHHHHhCCCCcC
Confidence 3344778888888888888888888887663
No 95
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=31.47 E-value=1.8e+02 Score=20.12 Aligned_cols=30 Identities=3% Similarity=-0.308 Sum_probs=18.5
Q ss_pred CCHHHHHHHHHHHHHHhCCCCCCcccCCCCCHH
Q psy5669 33 LHPLQKAIRSIEYVIAHRGAPHLKTKATSLPWY 65 (116)
Q Consensus 33 ~~P~~~av~wiE~v~r~~g~~~l~~~~~~l~~~ 65 (116)
.|..|..-+ ++.|+|+.-.++|+-...+|+
T Consensus 74 ~Sd~eI~~~---~v~RYG~~Vly~Pp~~~~t~~ 103 (126)
T TIGR03147 74 KSNQQIIDF---MTARFGDFVLYNPPFKWQTLL 103 (126)
T ss_pred CCHHHHHHH---HHHhcCCeEEecCCCCcchHH
Confidence 344444333 466888877788876655553
No 96
>PHA02849 putative transmembrane protein; Provisional
Probab=31.37 E-value=1e+02 Score=19.73 Aligned_cols=20 Identities=20% Similarity=0.247 Sum_probs=8.1
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy5669 72 VIASCTLGLGLVLYTLARIG 91 (116)
Q Consensus 72 V~~~l~~v~~~~~~~~~~~~ 91 (116)
|+.++++++.++++++..+.
T Consensus 19 vi~v~v~vI~i~~flLlyLv 38 (82)
T PHA02849 19 VILVFVLVISFLAFMLLYLI 38 (82)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33333333334444444433
No 97
>PHA00646 hypothetical protein
Probab=31.18 E-value=1.2e+02 Score=18.51 Aligned_cols=25 Identities=8% Similarity=-0.020 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLGLVLYTLARIGTS 93 (116)
Q Consensus 69 ~lDV~~~l~~v~~~~~~~~~~~~~~ 93 (116)
++-++++++....+.++.++++|+.
T Consensus 39 MVgIWlvI~Fl~Wf~i~mvfKiv~~ 63 (65)
T PHA00646 39 MVGIWLVILFLTWFSLWMVFKIVGY 63 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhe
Confidence 4455666666666666777777654
No 98
>PF09959 DUF2193: Uncharacterized protein conserved in archaea (DUF2193); InterPro: IPR018694 This family of various hypothetical archaeal proteins has no known function
Probab=31.07 E-value=1.5e+02 Score=24.88 Aligned_cols=40 Identities=13% Similarity=0.172 Sum_probs=32.1
Q ss_pred HHHHHh--cCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCC
Q psy5669 7 KVQPAC--MSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGA 52 (116)
Q Consensus 7 aI~~vl--~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~ 52 (116)
+|.||| +||+|++.+.++-..+. ...++---|-+-|+||+
T Consensus 93 ~ilEILy~eD~~F~ks~~kfi~~I~------~sealIg~E~~RrygGF 134 (499)
T PF09959_consen 93 AILEILYEEDPAFRKSVEKFIEAIG------KSEALIGKESARRYGGF 134 (499)
T ss_pred HHHHHHHhcCHHHHHHHHHHHHHHh------hhHHHhhHHHHHHhcCc
Confidence 677888 89999999999988775 34566677888899884
No 99
>PRK11267 biopolymer transport protein ExbD; Provisional
Probab=30.89 E-value=45 Score=22.91 Aligned_cols=14 Identities=14% Similarity=0.107 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLGL 82 (116)
Q Consensus 69 ~lDV~~~l~~v~~~ 82 (116)
++||++++++++++
T Consensus 23 lIDVvf~LLiFFmv 36 (141)
T PRK11267 23 FIDVMLVLLIIFMV 36 (141)
T ss_pred HHHHHHHHHHHHHh
Confidence 78999877766543
No 100
>cd04962 GT1_like_5 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=30.85 E-value=1.8e+02 Score=21.78 Aligned_cols=45 Identities=4% Similarity=-0.104 Sum_probs=28.9
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV 46 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v 46 (116)
+.+.++|.++++||...+.+..-+...-.+.+++...+-.|.+..
T Consensus 324 ~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~fs~~~~~~~~~~~y 368 (371)
T cd04962 324 EAMAEYALSLLEDDELWQEFSRAARNRAAERFDSERIVPQYEALY 368 (371)
T ss_pred HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 457788889998887655544433333335577777777776543
No 101
>PF14851 FAM176: FAM176 family
Probab=30.70 E-value=1.1e+02 Score=21.79 Aligned_cols=16 Identities=6% Similarity=-0.128 Sum_probs=10.2
Q ss_pred HHHHHHHHHHhhhhcc
Q psy5669 85 YTLARIGTSFAEFTRG 100 (116)
Q Consensus 85 ~~~~~~~~~~~~~~~~ 100 (116)
.++..+|-++.+..|+
T Consensus 33 GLlLtLcllV~risc~ 48 (153)
T PF14851_consen 33 GLLLTLCLLVIRISCR 48 (153)
T ss_pred HHHHHHHHHHhhheee
Confidence 3445566777777774
No 102
>PRK13718 conjugal transfer protein TrbE; Provisional
Probab=30.63 E-value=1.5e+02 Score=19.06 Aligned_cols=19 Identities=16% Similarity=0.023 Sum_probs=7.9
Q ss_pred HHHHHHhhhhccccccccC
Q psy5669 89 RIGTSFAEFTRGKDDKKAT 107 (116)
Q Consensus 89 ~~~~~~~~~~~~~~~~k~~ 107 (116)
++|........+.+++|+.
T Consensus 61 y~ffs~Ltkl~~~d~~ks~ 79 (84)
T PRK13718 61 YFFFSALTKLQKHDERKSD 79 (84)
T ss_pred HHHHHHHHHHHhcccccch
Confidence 3333333334444444443
No 103
>cd03807 GT1_WbnK_like This family is most closely related to the GT1 family of glycosyltransferases. WbnK in Shigella dysenteriae has been shown to be involved in the type 7 O-antigen biosynthesis.
Probab=30.33 E-value=1.5e+02 Score=21.44 Aligned_cols=42 Identities=5% Similarity=-0.051 Sum_probs=23.4
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSI 43 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wi 43 (116)
+.+.++|.++++|+...+...+-+...-.+-.++...+-.+.
T Consensus 320 ~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~ 361 (365)
T cd03807 320 EALAEAIEALLADPALRQALGEAARERIEENFSIEAMVEAYE 361 (365)
T ss_pred HHHHHHHHHHHhChHHHHHHHHHHHHHHHHhCCHHHHHHHHH
Confidence 467888999998876554443333333333345555444443
No 104
>PF05148 Methyltransf_8: Hypothetical methyltransferase; InterPro: IPR007823 This family consists of uncharacterised eukaryotic proteins which are related to S-adenosyl-L-methionine-dependent methyltransferases.; GO: 0008168 methyltransferase activity; PDB: 2ZFU_B.
Probab=30.13 E-value=1.6e+02 Score=22.39 Aligned_cols=38 Identities=11% Similarity=0.247 Sum_probs=22.8
Q ss_pred HHHHHhcCHH-HHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669 7 KVQPACMSPQ-YKTNVQRLSTIFRSEPLHPLQKAIRSIE 44 (116)
Q Consensus 7 aI~~vl~n~~-Yk~na~~ls~~~~d~p~~P~~~av~wiE 44 (116)
|.+-.-+||+ |..==.-+++....-|..|+|..+.|+.
T Consensus 30 A~~lf~~dP~~F~~YH~Gfr~Qv~~WP~nPvd~iI~~l~ 68 (219)
T PF05148_consen 30 ALKLFQEDPELFDIYHEGFRQQVKKWPVNPVDVIIEWLK 68 (219)
T ss_dssp HHHHHHH-HHHHHHHHHHHHHHHCTSSS-HHHHHHHHHC
T ss_pred HHHHHHhCHHHHHHHHHHHHHHHhcCCCCcHHHHHHHHH
Confidence 3344444553 3333334455667789999999999987
No 105
>PRK13608 diacylglycerol glucosyltransferase; Provisional
Probab=29.75 E-value=2.8e+02 Score=21.87 Aligned_cols=44 Identities=7% Similarity=-0.071 Sum_probs=31.0
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV 46 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v 46 (116)
+.+.++|.++++|+.-.+.+.+-+... .+|.+..+.+-.|.+.+
T Consensus 326 ~~l~~~i~~ll~~~~~~~~m~~~~~~~-~~~~s~~~i~~~l~~l~ 369 (391)
T PRK13608 326 EEAIKIVASLTNGNEQLTNMISTMEQD-KIKYATQTICRDLLDLI 369 (391)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHh-cCCCCHHHHHHHHHHHh
Confidence 457889999999987666555544443 45778877777776655
No 106
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=29.43 E-value=88 Score=19.46 Aligned_cols=11 Identities=27% Similarity=0.830 Sum_probs=5.9
Q ss_pred HHHHHHHHHHH
Q psy5669 64 WYQTSQLDVIA 74 (116)
Q Consensus 64 ~~q~~~lDV~~ 74 (116)
||+.+-+=+++
T Consensus 62 ~~~~~k~~~i~ 72 (89)
T PF00957_consen 62 WWRNYKLYIII 72 (89)
T ss_dssp HHHHHHHHHHH
T ss_pred HHHHHHHHHhH
Confidence 56666554443
No 107
>cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP-linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=29.35 E-value=78 Score=23.47 Aligned_cols=25 Identities=4% Similarity=0.141 Sum_probs=18.3
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLST 26 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~ 26 (116)
+++.++|.++++|++.++.+.+-+.
T Consensus 320 ~~~~~~i~~l~~~~~~~~~~~~~~~ 344 (357)
T cd03795 320 AALAEAIRRLLEDPELRERLGEAAR 344 (357)
T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHH
Confidence 4688999999999976665554433
No 108
>PRK13803 bifunctional phosphoribosylanthranilate isomerase/tryptophan synthase subunit beta; Provisional
Probab=28.64 E-value=65 Score=27.72 Aligned_cols=37 Identities=19% Similarity=0.362 Sum_probs=29.5
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKA 39 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~a 39 (116)
+.|.++..++.+||+|++....+-.-+..+| +|+-.+
T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~grp-TPL~~~ 277 (610)
T PRK13803 241 QELQESYTKIIKSNEFQKTFKRLLQNYAGRP-TPLTEA 277 (610)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHhCCCC-CcceeH
Confidence 5678888899999999999999988886666 555433
No 109
>PHA02657 hypothetical protein; Provisional
Probab=28.60 E-value=1e+02 Score=20.10 Aligned_cols=23 Identities=13% Similarity=0.184 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy5669 67 TSQLDVIASCTLGLGLVLYTLAR 89 (116)
Q Consensus 67 ~~~lDV~~~l~~v~~~~~~~~~~ 89 (116)
+.++=|+.++++++.++++++..
T Consensus 24 ~~~imVitvfv~vI~il~flLLY 46 (95)
T PHA02657 24 FESILVFTIFIFVVCILIYLLIY 46 (95)
T ss_pred chhhhHHHHHHHHHHHHHHHHHH
Confidence 44555554444444444444433
No 110
>PF10854 DUF2649: Protein of unknown function (DUF2649); InterPro: IPR021217 This entry is represented by Spiroplasma phage 1-C74, Orf10. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Members in this family of proteins are also annotated as Plectrovirus orf 10 transmembrane proteins however currently no function is known.
Probab=28.59 E-value=1.4e+02 Score=18.15 Aligned_cols=33 Identities=12% Similarity=0.050 Sum_probs=19.9
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy5669 60 TSLPWYQTSQLDVIASCTLGLGLVLYTLARIGT 92 (116)
Q Consensus 60 ~~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~ 92 (116)
...+-|--.++-++++++.+..+.+..+++++.
T Consensus 32 ltqneYlt~MiGiWiVilFLtWf~lwm~fKiv~ 64 (67)
T PF10854_consen 32 LTQNEYLTIMIGIWIVILFLTWFLLWMVFKIVG 64 (67)
T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344445555666776666666666666666654
No 111
>PRK05749 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed
Probab=28.33 E-value=1.9e+02 Score=22.90 Aligned_cols=43 Identities=14% Similarity=0.196 Sum_probs=26.9
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIE 44 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE 44 (116)
+.+.++|.++++||.-++.+.+-+..+-.+.....+..+..++
T Consensus 376 ~~La~~l~~ll~~~~~~~~m~~~a~~~~~~~~~~~~~~~~~l~ 418 (425)
T PRK05749 376 EDLAKAVTYLLTDPDARQAYGEAGVAFLKQNQGALQRTLQLLE 418 (425)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHHHHhCccHHHHHHHHHH
Confidence 4678899999999876665554444333333455566655554
No 112
>PF11157 DUF2937: Protein of unknown function (DUF2937); InterPro: IPR022584 This family of proteins with unknown function appears to be found mainly in Proteobacteria.
Probab=28.25 E-value=1e+02 Score=22.08 Aligned_cols=26 Identities=4% Similarity=0.221 Sum_probs=16.4
Q ss_pred HHHHHHHH--hcCHHHHHHHHHHHHHHh
Q psy5669 4 SRRKVQPA--CMSPQYKTNVQRLSTIFR 29 (116)
Q Consensus 4 l~~aI~~v--l~n~~Yk~na~~ls~~~~ 29 (116)
+..+|... -+||..++.+..+..++.
T Consensus 58 ~~~li~~~~~s~dp~~~~~a~~~~~~~~ 85 (167)
T PF11157_consen 58 REALIAHYRQSSDPVFRARAESMQATIE 85 (167)
T ss_pred HHHHHHHHHhCCCHHHHhhHHHHHHHHH
Confidence 34445544 267778888777777665
No 113
>PF04716 ETC_C1_NDUFA5: ETC complex I subunit conserved region; InterPro: IPR006806 This is a family of eukaryotic NADH-ubiquinone oxidoreductase subunits (1.6.5.3 from EC) (1.6.99.3 from EC) from complex I of the electron transport chain initially identified in Neurospora crassa as a 29.9 kDa protein. The conserved region is found at the N terminus of the member proteins [].; GO: 0016651 oxidoreductase activity, acting on NADH or NADPH, 0022904 respiratory electron transport chain, 0005743 mitochondrial inner membrane
Probab=28.18 E-value=68 Score=18.97 Aligned_cols=23 Identities=4% Similarity=0.199 Sum_probs=18.2
Q ss_pred hHHHHHHHHhcCHHHHHHHHHHH
Q psy5669 3 SSRRKVQPACMSPQYKTNVQRLS 25 (116)
Q Consensus 3 ~l~~aI~~vl~n~~Yk~na~~ls 25 (116)
.+.+++.++-+|..||+.+..+-
T Consensus 13 ~~L~~L~~~P~~a~YR~~tE~it 35 (57)
T PF04716_consen 13 KTLKALKKIPEDAAYRQYTEAIT 35 (57)
T ss_pred HHHHHHHhCCCccHHHHHHHHHH
Confidence 45677788888889999988764
No 114
>COG3036 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.84 E-value=76 Score=19.48 Aligned_cols=20 Identities=10% Similarity=0.358 Sum_probs=16.4
Q ss_pred HHHHHHhcCHHHHHHHHHHH
Q psy5669 6 RKVQPACMSPQYKTNVQRLS 25 (116)
Q Consensus 6 ~aI~~vl~n~~Yk~na~~ls 25 (116)
+|+..++.||-|+...++-+
T Consensus 15 NalkALv~~pLFR~RvEk~k 34 (66)
T COG3036 15 NALKALVHDPLFRQRVEKKK 34 (66)
T ss_pred hHHHHHHcCHHHHHHHHHhh
Confidence 68888999999998877743
No 115
>PRK13609 diacylglycerol glucosyltransferase; Provisional
Probab=27.66 E-value=1.7e+02 Score=22.65 Aligned_cols=43 Identities=5% Similarity=-0.142 Sum_probs=27.2
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+.+.++|.++++||..++.+.+-+..+. +|.+..+.+-.+.+.
T Consensus 326 ~~l~~~i~~ll~~~~~~~~m~~~~~~~~-~~~s~~~i~~~i~~~ 368 (380)
T PRK13609 326 EEVFAKTEALLQDDMKLLQMKEAMKSLY-LPEPADHIVDDILAE 368 (380)
T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHhC-CCchHHHHHHHHHHh
Confidence 4678899999999988777665444333 444544444444443
No 116
>TIGR03449 mycothiol_MshA UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase. Members of this protein family, found exclusively in the Actinobacteria, are MshA, the glycosyltransferase of mycothiol biosynthesis. Mycothiol replaces glutathione in these species.
Probab=27.51 E-value=2.6e+02 Score=21.58 Aligned_cols=44 Identities=5% Similarity=-0.101 Sum_probs=27.9
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV 46 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v 46 (116)
+.+.++|.++++|+..++.+..-+.... +.++....+-.+.+..
T Consensus 356 ~~la~~i~~~l~~~~~~~~~~~~~~~~~-~~fsw~~~~~~~~~~y 399 (405)
T TIGR03449 356 ADWADALARLLDDPRTRIRMGAAAVEHA-AGFSWAATADGLLSSY 399 (405)
T ss_pred HHHHHHHHHHHhCHHHHHHHHHHHHHHH-HhCCHHHHHHHHHHHH
Confidence 4677889999988876655544443333 2367666666665544
No 117
>cd03818 GT1_ExpC_like This family is most closely related to the GT1 family of glycosyltransferases. ExpC in Rhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucan (exopolysaccharide II).
Probab=27.31 E-value=1.2e+02 Score=23.57 Aligned_cols=25 Identities=12% Similarity=0.133 Sum_probs=17.3
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLST 26 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~ 26 (116)
+.+.++|.++++|+..++.+.+-+.
T Consensus 354 ~~la~~i~~ll~~~~~~~~l~~~ar 378 (396)
T cd03818 354 DALAAAVIELLDDPARRARLRRAAR 378 (396)
T ss_pred HHHHHHHHHHHhCHHHHHHHHHHHH
Confidence 4678889999988876555544333
No 118
>PF04316 FlgM: Anti-sigma-28 factor, FlgM; InterPro: IPR007412 FlgM binds and inhibits the activity of the transcription factor sigma 28. Inhibition of sigma 28 prevents the expression of genes from flagellar transcriptional class 3, which include genes for the filament and chemotaxis. Correctly assembled basal body-hook structures export FlgM, relieving inhibition of sigma 28 and allowing expression of class 3 genes. NMR studies show that free FlgM is mostly unfolded, which may facilitate its export. The C-terminal half of FlgM adopts a tertiary structure when it binds to sigma 28. All mutations in FlgM that prevent sigma 28 inhibition affect the C-terminal domain and is the region thought to constitute the binding domain. A minimal binding domain has been identified between Glu 64 and Arg 88 in Salmonella typhimurium (P26477 from SWISSPROT).The N-terminal portion remains unstructured and may be necessary for recognition by the export machinery [].; GO: 0045892 negative regulation of transcription, DNA-dependent, 0019861 flagellum; PDB: 1RP3_B 1SC5_B.
Probab=26.95 E-value=72 Score=18.47 Aligned_cols=25 Identities=16% Similarity=0.315 Sum_probs=15.2
Q ss_pred HHHHHHHhcCHHHHHHHHHHHHHHh
Q psy5669 5 RRKVQPACMSPQYKTNVQRLSTIFR 29 (116)
Q Consensus 5 ~~aI~~vl~n~~Yk~na~~ls~~~~ 29 (116)
...|++-+.+.+|+-++.+++.-+.
T Consensus 30 V~~ik~~I~~G~Y~vd~~~iA~~ml 54 (57)
T PF04316_consen 30 VAEIKAAIASGTYKVDAEKIAEKML 54 (57)
T ss_dssp HHHHHHHHHTT-----HHHHHHHHH
T ss_pred HHHHHHHHHcCCCCCCHHHHHHHHH
Confidence 4678888889999999888877554
No 119
>TIGR02803 ExbD_1 TonB system transport protein ExbD, group 1. Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR family modeled by TIGRFAMs TIGR02801. Members always are encoded next to a protein designated ExbB (TIGR02797), related to the TolQ family modeled by TIGRFAMs TIGR02796. ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.
Probab=26.81 E-value=62 Score=21.47 Aligned_cols=14 Identities=14% Similarity=0.107 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLGL 82 (116)
Q Consensus 69 ~lDV~~~l~~v~~~ 82 (116)
++||.+++++++++
T Consensus 9 lIDVvflLLiFFmv 22 (122)
T TIGR02803 9 FIDVMLVLLIIFMV 22 (122)
T ss_pred HHHHHHHHHHHHHH
Confidence 68998777666543
No 120
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=26.71 E-value=1.1e+02 Score=27.47 Aligned_cols=8 Identities=0% Similarity=-0.147 Sum_probs=3.7
Q ss_pred HHhhhhcc
Q psy5669 93 SFAEFTRG 100 (116)
Q Consensus 93 ~~~~~~~~ 100 (116)
+.||+.|.
T Consensus 295 ~yCrrkc~ 302 (807)
T PF10577_consen 295 CYCRRKCL 302 (807)
T ss_pred HhhhcccC
Confidence 33555444
No 121
>COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]
Probab=26.37 E-value=1e+02 Score=23.95 Aligned_cols=35 Identities=9% Similarity=0.129 Sum_probs=31.5
Q ss_pred CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCH
Q psy5669 1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHP 35 (116)
Q Consensus 1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P 35 (116)
|.++...|.++|.+.+-++...-+...++.+|-+.
T Consensus 1 ~~Tl~~t~seLL~~~sL~dai~~a~~qVkakPtda 35 (273)
T COG4455 1 MMTLRDTISELLDDNSLQDAIGLARDQVKAKPTDA 35 (273)
T ss_pred CCchHHHHHHHHHhccHHHHHHHHHHHHhcCCccc
Confidence 67889999999999999999999999999998653
No 122
>cd03792 GT1_Trehalose_phosphorylase Trehalose phosphorylase (TP) reversibly catalyzes trehalose synthesis and degradation from alpha-glucose-1-phosphate (alpha-Glc-1-P) and glucose. The catalyzing activity includes the phosphorolysis of trehalose, which produce alpha-Glc-1-P and glucose, and the subsequent synthesis of trehalose. This family is most closely related to the GT1 family of glycosyltransferases.
Probab=26.31 E-value=2.3e+02 Score=21.68 Aligned_cols=45 Identities=4% Similarity=0.006 Sum_probs=29.0
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYV 46 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v 46 (116)
+.+..+|.++++|++.++.+.+-+.....+-++....+..|.+..
T Consensus 325 ~~~a~~i~~ll~~~~~~~~~~~~a~~~~~~~~s~~~~~~~~~~~~ 369 (372)
T cd03792 325 EEAAVRILYLLRDPELRRKMGANAREHVRENFLITRHLKDYLYLI 369 (372)
T ss_pred HHHHHHHHHHHcCHHHHHHHHHHHHHHHHHHcCHHHHHHHHHHHH
Confidence 456678889998988776655544444344566666666666543
No 123
>KOG4482|consensus
Probab=26.20 E-value=1.7e+02 Score=24.35 Aligned_cols=7 Identities=14% Similarity=0.150 Sum_probs=3.3
Q ss_pred ccCCCcc
Q psy5669 105 KATKPDR 111 (116)
Q Consensus 105 k~~~~~~ 111 (116)
-.++|++
T Consensus 330 d~~ts~~ 336 (449)
T KOG4482|consen 330 DDKTSDI 336 (449)
T ss_pred cccchhh
Confidence 3345554
No 124
>PF06917 Pectate_lyase_2: Periplasmic pectate lyase; InterPro: IPR010702 This family consists of several Enterobacterial periplasmic pectate lyase proteins. A major virulence determinant of the plant-pathogenic enterobacterium Erwinia chrysanthemi is the production of pectate lyase enzymes that degrade plant cell walls [].; GO: 0016837 carbon-oxygen lyase activity, acting on polysaccharides, 0045490 pectin catabolic process, 0005737 cytoplasm; PDB: 2V8J_A 2V8K_A 2V8I_A.
Probab=26.13 E-value=93 Score=26.68 Aligned_cols=38 Identities=11% Similarity=0.085 Sum_probs=25.2
Q ss_pred HHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy5669 4 SRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRS 42 (116)
Q Consensus 4 l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~w 42 (116)
..-++..|.+||+|++.|....+-+-|.-.++. -..+|
T Consensus 77 ~L~~LS~lTgd~~Y~q~A~~~~~yfldh~~D~~-GL~~W 114 (557)
T PF06917_consen 77 TLVGLSNLTGDPKYQQRAQEIIRYFLDHFVDDS-GLFYW 114 (557)
T ss_dssp HHHHHHHHH--THHHHHHHHHHHHHHHHSB-TT-S-BS-
T ss_pred HHHHHHhccCCHHHHHHHHHHHHHHHHhccCCC-cceee
Confidence 356788999999999999988888877755544 34444
No 125
>COG0848 ExbD Biopolymer transport protein [Intracellular trafficking and secretion]
Probab=25.97 E-value=66 Score=22.33 Aligned_cols=16 Identities=13% Similarity=0.090 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLGLVL 84 (116)
Q Consensus 69 ~lDV~~~l~~v~~~~~ 84 (116)
.+||.+++++++++..
T Consensus 21 lIDVmLVLLiiFmvta 36 (137)
T COG0848 21 LIDVMLVLLIIFMVTA 36 (137)
T ss_pred HHHHHHHHHHHHHHhh
Confidence 6899988877766443
No 126
>PF08388 GIIM: Group II intron, maturase-specific domain; InterPro: IPR013597 This region is found mainly in various bacterial and archaeal species, but a few members of this family are expressed by fungal and chlamydomonal species. It has been implicated in the binding of intron RNA during reverse transcription and splicing [].
Probab=25.88 E-value=1.2e+02 Score=18.08 Aligned_cols=9 Identities=11% Similarity=0.434 Sum_probs=5.3
Q ss_pred CHHHHHHHH
Q psy5669 63 PWYQTSQLD 71 (116)
Q Consensus 63 ~~~q~~~lD 71 (116)
+|..||..-
T Consensus 34 GW~nYy~~~ 42 (80)
T PF08388_consen 34 GWANYYRIG 42 (80)
T ss_pred HHHHhhcch
Confidence 366666653
No 127
>TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase. RL J Bacteriol 1993 Mar;175(6):1841-3
Probab=25.42 E-value=1.1e+02 Score=23.12 Aligned_cols=26 Identities=4% Similarity=-0.006 Sum_probs=19.9
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTI 27 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~ 27 (116)
+.+.++|.++++||++++.+.+-...
T Consensus 309 ~~l~~~i~~ll~~~~~~~~~~~~~~~ 334 (348)
T TIGR01133 309 EKLLEALLKLLLDPANLEAMAEAARK 334 (348)
T ss_pred HHHHHHHHHHHcCHHHHHHHHHHHHh
Confidence 46889999999999887766654443
No 128
>PTZ00443 Thioredoxin domain-containing protein; Provisional
Probab=24.69 E-value=1.6e+02 Score=22.04 Aligned_cols=6 Identities=33% Similarity=0.595 Sum_probs=2.5
Q ss_pred ccCCCc
Q psy5669 105 KATKPD 110 (116)
Q Consensus 105 k~~~~~ 110 (116)
-+++|.
T Consensus 211 ~~~~~~ 216 (224)
T PTZ00443 211 ANKKPK 216 (224)
T ss_pred cccCCc
Confidence 334444
No 129
>PHA02902 putative IMV membrane protein; Provisional
Probab=24.68 E-value=1.6e+02 Score=18.20 Aligned_cols=11 Identities=9% Similarity=0.153 Sum_probs=5.1
Q ss_pred HHHHHHHHHHH
Q psy5669 70 LDVIASCTLGL 80 (116)
Q Consensus 70 lDV~~~l~~v~ 80 (116)
+|.+.++.+++
T Consensus 3 ~dtfvi~~v~v 13 (70)
T PHA02902 3 IDTFVILAVIV 13 (70)
T ss_pred hhhHHHHHHHH
Confidence 45554444333
No 130
>PF12122 DUF3582: Protein of unknown function (DUF3582); InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important. The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=24.65 E-value=58 Score=21.48 Aligned_cols=31 Identities=13% Similarity=0.056 Sum_probs=19.2
Q ss_pred hcCHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy5669 12 CMSPQYKTNVQRLSTIFRSEPLHPLQKAIRS 42 (116)
Q Consensus 12 l~n~~Yk~na~~ls~~~~d~p~~P~~~av~w 42 (116)
|.|+.+.+.|+.--..|...|..|.-.++-|
T Consensus 41 l~de~~~~~a~~el~~Fl~nP~~~rYqaASW 71 (101)
T PF12122_consen 41 LHDEEHLEQAEQELEEFLQNPNDPRYQAASW 71 (101)
T ss_dssp ES-GGGHHHHHHHHHHHHHS-SS--------
T ss_pred EeCHHHHHHHHHHHHHHHHCCCCHHHHHHHH
Confidence 4688999999999999999999998888777
No 131
>PLN02618 tryptophan synthase, beta chain
Probab=24.56 E-value=98 Score=25.43 Aligned_cols=43 Identities=14% Similarity=0.193 Sum_probs=28.6
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+.|.++..++.+||+|++....+-+-+--+| +|+-.+-...+.
T Consensus 36 ~~~~~~~~~~~~~~~f~~~~~~~l~~~vGr~-TPL~~~~~Ls~~ 78 (410)
T PLN02618 36 SELEAAFNALATDPEFQEELAGILKDYVGRE-TPLYFAERLTEH 78 (410)
T ss_pred HHHHHHHHHHhcChhhHHHHHHHHHHhcCCC-CceeEhhhHHHH
Confidence 5788899999999999998777655553233 445444444444
No 132
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=24.28 E-value=1.3e+02 Score=19.26 Aligned_cols=16 Identities=25% Similarity=0.513 Sum_probs=4.9
Q ss_pred HHHHHHHHHHHHHHHH
Q psy5669 69 QLDVIASCTLGLGLVL 84 (116)
Q Consensus 69 ~lDV~~~l~~v~~~~~ 84 (116)
.+|+++++.+++++++
T Consensus 3 ~l~i~~iialiv~~ii 18 (81)
T PF00558_consen 3 SLEILAIIALIVALII 18 (81)
T ss_dssp -----HHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHH
Confidence 4555554444444333
No 133
>PF06637 PV-1: PV-1 protein (PLVAP); InterPro: IPR009538 This family consists of several PV-1 (PLVAP) proteins, which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms []. The function of this family is unknown.
Probab=24.02 E-value=2.5e+02 Score=23.31 Aligned_cols=13 Identities=15% Similarity=0.256 Sum_probs=6.7
Q ss_pred CCCCCcccCCCCC
Q psy5669 51 GAPHLKTKATSLP 63 (116)
Q Consensus 51 g~~~l~~~~~~l~ 63 (116)
|.++.|..+..-+
T Consensus 8 ~~~y~R~G~~~kg 20 (442)
T PF06637_consen 8 GGPYSRAGGKGKG 20 (442)
T ss_pred CCcccccCCCCCC
Confidence 3456666555444
No 134
>PF02723 NS3_envE: Non-structural protein NS3/Small envelope protein E; InterPro: IPR003873 This is a family of small nonstructural proteins, well conserved among Coronavirus strains. This protein is also found in Murine hepatitis virus as small envelope protein E.; GO: 0016020 membrane
Probab=23.85 E-value=2e+02 Score=18.44 Aligned_cols=10 Identities=0% Similarity=0.066 Sum_probs=4.0
Q ss_pred HHHHHHhhhh
Q psy5669 89 RIGTSFAEFT 98 (116)
Q Consensus 89 ~~~~~~~~~~ 98 (116)
++|..+|+.+
T Consensus 38 qLC~~cc~~~ 47 (82)
T PF02723_consen 38 QLCFQCCRLC 47 (82)
T ss_pred HHHHHHhhhh
Confidence 3444344433
No 135
>PF08390 TRAM1: TRAM1-like protein; InterPro: IPR013599 This family comprises sequences that are similar to human TRAM1 (Q15629 from SWISSPROT). This is a transmembrane protein of the endoplasmic reticulum, thought to be involved in the membrane transfer of secretory proteins []. The region featured in this family is found N-terminal to the longevity-assurance protein region (IPR005547 from INTERPRO).
Probab=23.82 E-value=1.7e+02 Score=17.45 Aligned_cols=21 Identities=10% Similarity=-0.080 Sum_probs=11.1
Q ss_pred CHHHHHHHHHHHHHHHHHHHH
Q psy5669 63 PWYQTSQLDVIASCTLGLGLV 83 (116)
Q Consensus 63 ~~~q~~~lDV~~~l~~v~~~~ 83 (116)
+.|..=.-|++++++-+++++
T Consensus 20 ~~Y~~G~~D~~fV~fy~i~~t 40 (65)
T PF08390_consen 20 GLYGKGWDDLYFVFFYIIVFT 40 (65)
T ss_pred CccccCcchHHHHHHHHHHHH
Confidence 334444567776665444433
No 136
>TIGR02149 glgA_Coryne glycogen synthase, Corynebacterium family. This model describes Corynebacterium glutamicum GlgA and closely related proteins in several other species. This enzyme is required for glycogen biosynthesis and appears to replace the distantly related TIGR02095 family of ADP-glucose type glycogen synthase in Corynebacterium glutamicum, Mycobacterium tuberculosis, Bifidobacterium longum, and Streptomyces coelicolor.
Probab=23.64 E-value=3.2e+02 Score=20.76 Aligned_cols=44 Identities=9% Similarity=0.028 Sum_probs=26.8
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+.+.++|.++++|+.-++.+..-+...-.+.++....+..+.+.
T Consensus 340 ~~l~~~i~~l~~~~~~~~~~~~~a~~~~~~~~s~~~~~~~~~~~ 383 (388)
T TIGR02149 340 AELAKAINILLADPELAKKMGIAGRKRAEEEFSWGSIAKKTVEM 383 (388)
T ss_pred HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 56888999999888755544443333333446666666555554
No 137
>PRK04125 murein hydrolase regulator LrgA; Provisional
Probab=23.59 E-value=2e+02 Score=20.23 Aligned_cols=11 Identities=0% Similarity=-0.176 Sum_probs=4.4
Q ss_pred HHHHHHHHHhh
Q psy5669 86 TLARIGTSFAE 96 (116)
Q Consensus 86 ~~~~~~~~~~~ 96 (116)
++..+...+.+
T Consensus 108 vtg~v~~~l~r 118 (141)
T PRK04125 108 CTGLFSQFILG 118 (141)
T ss_pred HHHHHHHHHHH
Confidence 33344444443
No 138
>PF00988 CPSase_sm_chain: Carbamoyl-phosphate synthase small chain, CPSase domain; InterPro: IPR002474 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates [, ]. CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate []. The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain []. CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites []. The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein []. The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP []. There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia []. CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains []. This entry represents the N-terminal domain of the small subunit of carbamoyl phosphate synthase. The small subunit has a 3-layer beta/beta/alpha structure, and is thought to be mobile in most proteins that carry it. The C-terminal domain of the small subunit of CPSase has glutamine amidotransferase activity.; GO: 0006807 nitrogen compound metabolic process; PDB: 1CE8_B 1KEE_B 1CS0_D 1T36_D 1M6V_H 1A9X_F 1JDB_I 1BXR_F 1C3O_B 1C30_F ....
Probab=23.51 E-value=34 Score=23.85 Aligned_cols=11 Identities=27% Similarity=0.422 Sum_probs=7.4
Q ss_pred HHHhcCHHHHH
Q psy5669 9 QPACMSPQYKT 19 (116)
Q Consensus 9 ~~vl~n~~Yk~ 19 (116)
+|+|+||||.-
T Consensus 38 qE~lTDPSY~g 48 (131)
T PF00988_consen 38 QEILTDPSYAG 48 (131)
T ss_dssp HHHHT-GGGBT
T ss_pred chhhcCCcCCc
Confidence 57788888864
No 139
>PF15179 Myc_target_1: Myc target protein 1
Probab=23.48 E-value=1.9e+02 Score=21.56 Aligned_cols=21 Identities=29% Similarity=0.406 Sum_probs=10.0
Q ss_pred CCCHHHHHHH-HHH-HHHHHHHH
Q psy5669 61 SLPWYQTSQL-DVI-ASCTLGLG 81 (116)
Q Consensus 61 ~l~~~q~~~l-DV~-~~l~~v~~ 81 (116)
.+.|.+-+++ |++ +|.+++++
T Consensus 9 ~~~~~~~f~~~~lIlaF~vSm~i 31 (197)
T PF15179_consen 9 LLEWLENFDWEDLILAFCVSMAI 31 (197)
T ss_pred cccchhhcchhhHHHHHHHHHHH
Confidence 3556666644 444 34444433
No 140
>cd03809 GT1_mtfB_like This family is most closely related to the GT1 family of glycosyltransferases. mtfB (mannosyltransferase B) in E. coli has been shown to direct the growth of the O9-specific polysaccharide chain. It transfers two mannoses into the position 3 of the previously synthesized polysaccharide.
Probab=23.47 E-value=1.8e+02 Score=21.32 Aligned_cols=37 Identities=5% Similarity=0.027 Sum_probs=24.2
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKA 39 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~a 39 (116)
+.+.++|.++++|+..++.+.+-+.... +..+....+
T Consensus 324 ~~~~~~i~~l~~~~~~~~~~~~~~~~~~-~~~sw~~~~ 360 (365)
T cd03809 324 EALAAAIERLLEDPALREELRERGLARA-KRFSWEKTA 360 (365)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHHHH-HhCCHHHHH
Confidence 4678899999999988777666554333 234544443
No 141
>TIGR00263 trpB tryptophan synthase, beta subunit. Tryptophan synthase catalyzes the last step in the biosynthesis of tryptophan. the beta chain contains the functional domain for or the synthesis of tryptophan from indole and serine. The enzyme requires pyridoxal-phosphate as a cofactor. The pyridoxal-P attachment site is contained within the conserved region [LIVM]-x-H-x-G-[STA]-H-K-x-N] [K is the pyridoxal-P attachment site] which is present between residues 90-100 of the model.
Probab=23.16 E-value=89 Score=25.13 Aligned_cols=38 Identities=18% Similarity=0.324 Sum_probs=28.0
Q ss_pred CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHH
Q psy5669 1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKA 39 (116)
Q Consensus 1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~a 39 (116)
++.|..+...++.|++|.+....+..-+..+| +|+..+
T Consensus 19 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-TPL~~~ 56 (385)
T TIGR00263 19 LEELEAAFEDAKADPAFWAELNELLRNYAGRP-TPLTFA 56 (385)
T ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHHhCCCC-CCceeh
Confidence 36788888999999999999877777675444 455443
No 142
>PHA02851 EEV glycoprotein; Provisional
Probab=23.15 E-value=79 Score=23.05 Aligned_cols=15 Identities=13% Similarity=0.020 Sum_probs=8.0
Q ss_pred HHHHHHHhhhhcccc
Q psy5669 88 ARIGTSFAEFTRGKD 102 (116)
Q Consensus 88 ~~~~~~~~~~~~~~~ 102 (116)
+.+|+.-|+.++.++
T Consensus 20 llIcKk~cKKFFkKK 34 (223)
T PHA02851 20 FLINKKYCKKFFKKK 34 (223)
T ss_pred HHHhHHHHHHHHhcC
Confidence 445555555555443
No 143
>TIGR00215 lpxB lipid-A-disaccharide synthase. Lipid-A precursor biosynthesis producing lipid A disaccharide in a condensation reaction. transcribed as part of an operon including lpxA
Probab=23.09 E-value=1.8e+02 Score=23.15 Aligned_cols=40 Identities=5% Similarity=-0.068 Sum_probs=26.0
Q ss_pred chHHHHHHHHhcCH----HHHHHHHHHHHHHhcCCC--CHHHHHHH
Q psy5669 2 RSSRRKVQPACMSP----QYKTNVQRLSTIFRSEPL--HPLQKAIR 41 (116)
Q Consensus 2 ~~l~~aI~~vl~n~----~Yk~na~~ls~~~~d~p~--~P~~~av~ 41 (116)
+.|.+++.++++|+ ++++.+..--..++++-- ++.++++.
T Consensus 335 ~~l~~~~~~ll~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~a~ 380 (385)
T TIGR00215 335 HPLAIALLLLLENGLKAYKEMHRERQFFEELRQRIYCNADSERAAQ 380 (385)
T ss_pred HHHHHHHHHHhcCCcccHHHHHHHHHHHHHHHHHhcCCCHHHHHHH
Confidence 46889999999999 777666555554544332 34455554
No 144
>PLN02555 limonoid glucosyltransferase
Probab=22.90 E-value=1.1e+02 Score=25.53 Aligned_cols=29 Identities=10% Similarity=0.064 Sum_probs=23.4
Q ss_pred chHHHHHHHHhcCH---HHHHHHHHHHHHHhc
Q psy5669 2 RSSRRKVQPACMSP---QYKTNVQRLSTIFRS 30 (116)
Q Consensus 2 ~~l~~aI~~vl~n~---~Yk~na~~ls~~~~d 30 (116)
+++..+|+++++++ ..|+||++++...+.
T Consensus 417 ~~v~~~v~~vm~~~~g~~~r~ra~~l~~~a~~ 448 (480)
T PLN02555 417 EEVAECLLEATVGEKAAELKQNALKWKEEAEA 448 (480)
T ss_pred HHHHHHHHHHhcCchHHHHHHHHHHHHHHHHH
Confidence 46889999999764 699999999887553
No 145
>PF07782 DC_STAMP: DC-STAMP-like protein; InterPro: IPR012858 This group of sequences is similar to a region of the dendritic cell-specific transmembrane protein (DC-STAMP, Q9H295 from SWISSPROT). This is thought to be a novel receptor protein that shares no identity with other multimembrane-spanning proteins []. It is thought to have seven putative transmembrane regions [], two of which are found in the region featured in this family. DC-STAMP is also described as having potential N-linked glycosylation sites and a potential phosphorylation site for PKC [], but these are not conserved. ; GO: 0016021 integral to membrane
Probab=22.76 E-value=2.5e+02 Score=20.18 Aligned_cols=45 Identities=11% Similarity=-0.034 Sum_probs=23.4
Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc
Q psy5669 61 SLPWYQTSQLDVIASCTLGLGLVLYTLARIGTSFAEFTRGKDDKK 105 (116)
Q Consensus 61 ~l~~~q~~~lDV~~~l~~v~~~~~~~~~~~~~~~~~~~~~~~~~k 105 (116)
..++..|..+=++..++.+++++=..+.++.+.+|..++-.++++
T Consensus 141 ~p~~~~~~~i~~l~~l~~ll~~le~Y~~RLR~~I~a~fyP~re~~ 185 (191)
T PF07782_consen 141 PPDYSVYIQIGLLYLLLWLLVLLEPYALRLRRVICASFYPEREKE 185 (191)
T ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhChHHHHH
Confidence 445555555444333333322222335677777787777666553
No 146
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=22.70 E-value=57 Score=28.67 Aligned_cols=9 Identities=11% Similarity=0.006 Sum_probs=3.5
Q ss_pred HHHHHHHHH
Q psy5669 73 IASCTLGLG 81 (116)
Q Consensus 73 ~~~l~~v~~ 81 (116)
++.++++++
T Consensus 276 lvPv~vV~~ 284 (684)
T PF12877_consen 276 LVPVLVVLL 284 (684)
T ss_pred hHHHHHHHH
Confidence 333433333
No 147
>cd02646 R3H_G-patch R3H domain of a group of fungal and plant proteins with unknown function, who also contain a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the R3H domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=22.26 E-value=71 Score=18.58 Aligned_cols=39 Identities=3% Similarity=0.087 Sum_probs=27.7
Q ss_pred CchHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy5669 1 MRSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAH 49 (116)
Q Consensus 1 ~~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~ 49 (116)
|+.+.+.|+..+.++ ...+.-.|++|.++++-+ +.+..+
T Consensus 1 ~~~i~~~i~~F~~~~---------~~~~~fppm~~~~R~~vH-~lA~~~ 39 (58)
T cd02646 1 IEDIKDEIEAFLLDS---------RDSLSFPPMDKHGRKTIH-KLANCY 39 (58)
T ss_pred ChHHHHHHHHHHhCC---------CceEecCCCCHHHHHHHH-HHHHHc
Confidence 567778888887666 345677899999988765 455444
No 148
>KOG3814|consensus
Probab=21.70 E-value=2.1e+02 Score=24.01 Aligned_cols=37 Identities=19% Similarity=0.460 Sum_probs=30.4
Q ss_pred HHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCC
Q psy5669 15 PQYKTNVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRG 51 (116)
Q Consensus 15 ~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g 51 (116)
|+....+++--..-+.||..|.+....++|+.+.|+-
T Consensus 406 ~smaRalqKYLrtTrqQ~rHt~esil~Hle~Clt~~l 442 (531)
T KOG3814|consen 406 PSMARALQKYLRTTRQQPRHTMESILQHLEFCLTHDL 442 (531)
T ss_pred HHHHHHHHHHHHHhhcCCCCcHHHHHHHHHHHHhcCC
Confidence 4555566666677789999999999999999999974
No 149
>TIGR03088 stp2 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate.
Probab=21.49 E-value=3.4e+02 Score=20.63 Aligned_cols=44 Identities=9% Similarity=-0.002 Sum_probs=25.2
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIEY 45 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE~ 45 (116)
+++.++|.++++|+..++.+.+-+.-.-.+.++....+-.|.+.
T Consensus 326 ~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~~~~~~~~ 369 (374)
T TIGR03088 326 VALARALQPYVSDPAARRAHGAAGRARAEQQFSINAMVAAYAGL 369 (374)
T ss_pred HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 46778888888888655444333332333455655555555543
No 150
>cd03825 GT1_wcfI_like This family is most closely related to the GT1 family of glycosyltransferases. wcfI in Bacteroides fragilis has been shown to be involved in the capsular polysaccharide biosynthesis.
Probab=21.38 E-value=2.9e+02 Score=20.45 Aligned_cols=19 Identities=5% Similarity=0.067 Sum_probs=12.8
Q ss_pred chHHHHHHHHhcCHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTN 20 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~n 20 (116)
+++.++|.++++|+...+.
T Consensus 318 ~~~~~~l~~l~~~~~~~~~ 336 (365)
T cd03825 318 EDLAEGIEWLLADPDEREE 336 (365)
T ss_pred HHHHHHHHHHHhCHHHHHH
Confidence 3567788888888764433
No 151
>PHA03054 IMV membrane protein; Provisional
Probab=21.31 E-value=2.2e+02 Score=17.79 Aligned_cols=25 Identities=20% Similarity=0.371 Sum_probs=16.1
Q ss_pred HHHhcCCCCHHHHHHHHHHHHHHhC
Q psy5669 26 TIFRSEPLHPLQKAIRSIEYVIAHR 50 (116)
Q Consensus 26 ~~~~d~p~~P~~~av~wiE~v~r~~ 50 (116)
..|...|-+..+..+.-+.-|+.-.
T Consensus 10 GvF~ss~d~Df~~Fi~vV~sVl~dk 34 (72)
T PHA03054 10 GVFMGSPEDDLTDFIEIVKSVLSDE 34 (72)
T ss_pred HHhhCCchHHHHHHHHHHHHHHcCC
Confidence 3455566666777777777777664
No 152
>cd04951 GT1_WbdM_like This family is most closely related to the GT1 family of glycosyltransferases and is named after WbdM in Escherichia coli. In general glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have
Probab=21.15 E-value=2.8e+02 Score=20.50 Aligned_cols=42 Identities=7% Similarity=-0.068 Sum_probs=26.1
Q ss_pred chHHHHHHHHh-cCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669 2 RSSRRKVQPAC-MSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIE 44 (116)
Q Consensus 2 ~~l~~aI~~vl-~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE 44 (116)
+++.++|.+++ .++.+++.+... .-...+.++....+-.|.+
T Consensus 314 ~~~~~~i~~ll~~~~~~~~~~~~~-~~~~~~~~s~~~~~~~~~~ 356 (360)
T cd04951 314 EALANKIDEILKMSGEERDIIGAR-RERIVKKFSINSIVQQWLT 356 (360)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHH-HHHHHHhcCHHHHHHHHHH
Confidence 45778888888 567777766655 3333355666665555544
No 153
>PF02526 GBP_repeat: Glycophorin-binding protein; InterPro: IPR003681 The glycophorin-binding protein contains a tandem repeat. The repeated sequence determines the binding domain for an erythrocyte receptor binding protein of Plasmodium falciparum, the malarial parasite []. Erythrocyte invasion by the malarial merozoite is a receptor-mediated process, an obligatory step in the development of the parasite. The P. falciparum protein binds to the erythrocyte receptor glycophorin.
Probab=21.02 E-value=1.6e+02 Score=15.90 Aligned_cols=24 Identities=17% Similarity=0.443 Sum_probs=19.4
Q ss_pred HHHHhcCHHHHHHHHHHHHHHhcC
Q psy5669 8 VQPACMSPQYKTNVQRLSTIFRSE 31 (116)
Q Consensus 8 I~~vl~n~~Yk~na~~ls~~~~d~ 31 (116)
+++-..||.|++.+.-+-+++.+.
T Consensus 10 mk~yaadpeyrkh~~v~yqil~~t 33 (38)
T PF02526_consen 10 MKAYAADPEYRKHLNVLYQILTNT 33 (38)
T ss_pred HHHHhcCHHHHHHHHHHHHHHccC
Confidence 455668999999999998888754
No 154
>PF04659 Arch_fla_DE: Archaeal flagella protein ; InterPro: IPR006752 Archaeal flagella are unique motility structures, and the absence of bacterial structural motility genes in the complete genome sequences of flagellated archaeal species has always suggested that archaeal flagellar biogenesis is likely mediated by novel components. FlaD and FlaE, are present in the cell as membrane-associated proteins but are not major components of isolated flagellar filaments. Interestingly, flaD was found to encode two proteins, each translated from a separate ribosome binding site. This group of sequences contain the archaeal flaD and flaE proteins. The conserved region that defines these sequences is found in the N-teminal region of flaE but towards the C-terminal region of flaD [].; GO: 0001539 ciliary or flagellar motility
Probab=20.99 E-value=77 Score=20.93 Aligned_cols=19 Identities=11% Similarity=0.228 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHhCCCCC
Q psy5669 36 LQKAIRSIEYVIAHRGAPH 54 (116)
Q Consensus 36 ~~~av~wiE~v~r~~g~~~ 54 (116)
.-.+..|+||++...|...
T Consensus 18 ~~~~~eWLefLve~~G~~~ 36 (99)
T PF04659_consen 18 EIVVFEWLEFLVERVGHNN 36 (99)
T ss_pred HHHHHHHHHHHHHHccccc
Confidence 4467899999999988654
No 155
>PRK10591 hypothetical protein; Provisional
Probab=20.96 E-value=2.6e+02 Score=18.37 Aligned_cols=12 Identities=17% Similarity=0.349 Sum_probs=6.1
Q ss_pred HHHHHHHHHHHh
Q psy5669 84 LYTLARIGTSFA 95 (116)
Q Consensus 84 ~~~~~~~~~~~~ 95 (116)
+.+++++.+.+.
T Consensus 61 v~ivWR~a~~la 72 (92)
T PRK10591 61 VVIIWRVAKGLA 72 (92)
T ss_pred HHHHHHHHHHhc
Confidence 345566655443
No 156
>PF06667 PspB: Phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=20.85 E-value=2.3e+02 Score=17.76 Aligned_cols=21 Identities=5% Similarity=-0.075 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy5669 71 DVIASCTLGLGLVLYTLARIG 91 (116)
Q Consensus 71 DV~~~l~~v~~~~~~~~~~~~ 91 (116)
+++++.++++++++..++.++
T Consensus 4 ~fl~~plivf~ifVap~WL~l 24 (75)
T PF06667_consen 4 EFLFVPLIVFMIFVAPIWLIL 24 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 444555555554544444443
No 157
>PF12834 Phage_int_SAM_2: Phage integrase, N-terminal; InterPro: IPR024457 Phage integrase proteins cleave DNA substrates by a series of staggered cuts, during which the protein becomes covalently linked to the DNA through a catalytic tyrosine residue at the carboxy end of the alignment [, ]. This entry represents the N-terminal predicted alpha-helical domain found in a family of putative prophage DNA-binding integrases.
Probab=20.83 E-value=2.4e+02 Score=17.97 Aligned_cols=49 Identities=16% Similarity=0.215 Sum_probs=33.2
Q ss_pred CchHHHHHHHHhc--CHHHHHHHHHHH------HHH---hcCCCCHHHHHHHHHHHHHHh
Q psy5669 1 MRSSRRKVQPACM--SPQYKTNVQRLS------TIF---RSEPLHPLQKAIRSIEYVIAH 49 (116)
Q Consensus 1 ~~~l~~aI~~vl~--n~~Yk~na~~ls------~~~---~d~p~~P~~~av~wiE~v~r~ 49 (116)
|+.|...+.++.. +.||++.+.+.. .-+ .-++.+|...-..+||..+++
T Consensus 1 m~~L~~~l~~lar~~~gS~~T~~~R~~~~~~~~~~L~~~g~~i~~~~~lk~kHI~~lv~~ 60 (91)
T PF12834_consen 1 MSKLRYQLKTLARQAGGSFKTQADRMRILKRFARFLRELGFQIRSIRNLKPKHIEALVQH 60 (91)
T ss_pred ChHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHCCCCcCCHHHhhHHHHHHHHHH
Confidence 7778888877774 346666554432 222 356778888888888888777
No 158
>PRK10635 bacterioferritin; Provisional
Probab=20.70 E-value=2.9e+02 Score=19.42 Aligned_cols=55 Identities=16% Similarity=0.222 Sum_probs=37.5
Q ss_pred hHHHHHHHHhcCH-----HHHHHHHHH--------HHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcc
Q psy5669 3 SSRRKVQPACMSP-----QYKTNVQRL--------STIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKT 57 (116)
Q Consensus 3 ~l~~aI~~vl~n~-----~Yk~na~~l--------s~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~ 57 (116)
.+.+.+++.|.++ .|...+.-+ +..+..+-..-++.|..-+|+++..||.|.+.+
T Consensus 6 ~vi~~LN~~L~~El~Ai~QY~~ha~~~~~~G~~~la~~~~~ea~eEm~HA~~l~eRIl~LgG~P~~~~ 73 (158)
T PRK10635 6 KIINYLNKLLGNELVAINQYFLHARMFKNWGLMRLNDVEYHESIDEMKHADKYIERILFLEGIPNLQD 73 (158)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCC
Confidence 4566666666555 344444333 445555555678889999999999999988765
No 159
>PRK10307 putative glycosyl transferase; Provisional
Probab=20.47 E-value=3.2e+02 Score=21.35 Aligned_cols=43 Identities=5% Similarity=0.038 Sum_probs=25.3
Q ss_pred chHHHHHHHHhcCHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy5669 2 RSSRRKVQPACMSPQYKTNVQRLSTIFRSEPLHPLQKAIRSIE 44 (116)
Q Consensus 2 ~~l~~aI~~vl~n~~Yk~na~~ls~~~~d~p~~P~~~av~wiE 44 (116)
+++.++|.++++|+.-++.+.+-+...-.+.++....+-.+.+
T Consensus 361 ~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~~~~~~~ 403 (412)
T PRK10307 361 EALVAAIAALARQALLRPKLGTVAREYAERTLDKENVLRQFIA 403 (412)
T ss_pred HHHHHHHHHHHhCHHHHHHHHHHHHHHHHHHcCHHHHHHHHHH
Confidence 4677888888888866655555444444444554444444433
No 160
>cd01056 Euk_Ferritin eukaryotic ferritins. Eukaryotic Ferritin (Euk_Ferritin) domain. Ferritins are the primary iron storage proteins of most living organisms and members of a broad superfamily of ferritin-like diiron-carboxylate proteins. The iron-free (apoferritin) ferritin molecule is a protein shell composed of 24 protein chains arranged in 432 symmetry. Iron storage involves the uptake of iron (II) at the protein shell, its oxidation by molecular oxygen at the dinuclear ferroxidase centers, and the movement of iron (III) into the cavity for deposition as ferrihydrite; the protein shell can hold up to 4500 iron atoms. In vertebrates, two types of chains (subunits) have been characterized, H or M (fast) and L (slow), which differ in rates of iron uptake and mineralization. Fe(II) oxidation in the H/M subunits take place initially at the ferroxidase center, a carboxylate-bridged diiron center, located within the subunit four-helix bundle. In a complementary role, negatively charged r
Probab=20.42 E-value=82 Score=21.87 Aligned_cols=38 Identities=18% Similarity=0.228 Sum_probs=31.9
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHHHHhCCCCCCcc
Q psy5669 20 NVQRLSTIFRSEPLHPLQKAIRSIEYVIAHRGAPHLKT 57 (116)
Q Consensus 20 na~~ls~~~~d~p~~P~~~av~wiE~v~r~~g~~~l~~ 57 (116)
+...+++.|+.+-..-.+.|...++|+...||.+.+++
T Consensus 35 ~l~g~a~~f~~~a~eE~~HA~~l~~~i~~rgg~~~~~~ 72 (161)
T cd01056 35 ALPGFAKFFRKLSDEEREHAEKLIKYQNKRGGRVVLQD 72 (161)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHcCCeeecCC
Confidence 45667788888888889999999999999999877655
No 161
>PTZ00046 rifin; Provisional
Probab=20.16 E-value=2.1e+02 Score=23.37 Aligned_cols=6 Identities=0% Similarity=0.074 Sum_probs=2.5
Q ss_pred HHHhhh
Q psy5669 92 TSFAEF 97 (116)
Q Consensus 92 ~~~~~~ 97 (116)
.+++|.
T Consensus 335 YLILRY 340 (358)
T PTZ00046 335 YLILRY 340 (358)
T ss_pred HHHHHh
Confidence 344443
Done!