Query psy1644
Match_columns 923
No_of_seqs 298 out of 373
Neff 3.9
Searched_HMMs 46136
Date Fri Aug 16 21:21:12 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1644.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1644hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0985|consensus 100.0 1E-190 3E-195 1620.8 27.0 499 202-725 7-895 (1666)
2 KOG0985|consensus 100.0 7.2E-68 1.6E-72 609.9 13.0 132 714-846 250-381 (1666)
3 PF13838 Clathrin_H_link: Clat 99.6 2.3E-17 4.9E-22 143.1 0.9 43 402-444 24-66 (66)
4 PF13838 Clathrin_H_link: Clat 99.2 2.8E-11 6E-16 105.5 5.2 47 820-866 1-49 (66)
5 PF01394 Clathrin_propel: Clat 98.6 4.2E-08 9.1E-13 76.7 3.0 37 353-392 1-37 (37)
6 PF09268 Clathrin-link: Clathr 98.4 1.6E-07 3.5E-12 67.4 1.5 24 795-818 1-24 (24)
7 cd00200 WD40 WD40 domain, foun 97.9 0.0057 1.2E-07 59.0 22.7 136 230-389 147-288 (289)
8 smart00299 CLH Clathrin heavy 97.0 0.011 2.3E-07 56.5 11.8 119 669-871 5-127 (140)
9 cd00200 WD40 WD40 domain, foun 96.0 1.7 3.6E-05 42.1 22.3 85 242-335 198-288 (289)
10 smart00299 CLH Clathrin heavy 95.9 0.0026 5.7E-08 60.6 0.5 107 529-637 24-133 (140)
11 TIGR03866 PQQ_ABC_repeats PQQ- 95.7 1.9 4.1E-05 43.9 20.1 228 65-339 33-280 (300)
12 PF02239 Cytochrom_D1: Cytochr 95.6 0.31 6.8E-06 55.0 15.8 268 15-393 3-303 (369)
13 PF00637 Clathrin: Region in C 95.1 0.0033 7.1E-08 59.9 -1.8 126 669-878 5-134 (143)
14 PF01394 Clathrin_propel: Clat 94.7 0.042 9.2E-07 43.3 3.8 37 126-163 1-37 (37)
15 TIGR03866 PQQ_ABC_repeats PQQ- 94.6 4 8.7E-05 41.6 18.7 90 243-340 95-189 (300)
16 PF00637 Clathrin: Region in C 92.0 0.0094 2E-07 56.8 -4.6 63 528-590 23-87 (143)
17 KOG0279|consensus 90.9 5.5 0.00012 44.5 14.3 176 80-309 81-272 (315)
18 PF04841 Vps16_N: Vps16, N-ter 90.1 42 0.00091 38.8 22.2 150 268-474 224-378 (410)
19 KOG1645|consensus 88.3 10 0.00022 44.3 14.3 211 72-334 204-457 (463)
20 PF10282 Lactonase: Lactonase, 87.8 50 0.0011 36.7 20.5 224 63-301 37-324 (345)
21 TIGR03300 assembly_YfgL outer 87.4 1.3 2.8E-05 48.8 6.8 57 734-791 320-376 (377)
22 PRK11138 outer membrane biogen 87.4 1.4 3.1E-05 49.2 7.2 58 734-792 335-392 (394)
23 TIGR02658 TTQ_MADH_Hv methylam 86.2 24 0.00052 40.5 16.0 81 29-126 26-116 (352)
24 PF13360 PQQ_2: PQQ-like domai 86.2 41 0.00088 34.0 17.3 70 222-304 166-235 (238)
25 PRK11028 6-phosphogluconolacto 85.6 58 0.0013 35.3 20.1 34 63-96 35-70 (330)
26 PF04053 Coatomer_WDAD: Coatom 83.6 3 6.6E-05 48.8 7.6 133 176-338 36-173 (443)
27 KOG0650|consensus 83.3 39 0.00085 41.4 16.3 256 36-351 378-650 (733)
28 PF07569 Hira: TUP1-like enhan 83.2 3.6 7.9E-05 43.6 7.4 43 56-100 6-48 (219)
29 KOG0292|consensus 83.0 1.6E+02 0.0034 38.2 23.2 385 73-516 261-699 (1202)
30 KOG2315|consensus 82.9 1.2E+02 0.0027 36.9 23.1 306 4-349 24-401 (566)
31 PRK03629 tolB translocation pr 82.2 66 0.0014 37.1 17.4 121 242-375 222-350 (429)
32 PLN00181 protein SPA1-RELATED; 80.6 82 0.0018 39.1 18.6 86 242-337 681-792 (793)
33 PF13360 PQQ_2: PQQ-like domai 79.5 7.4 0.00016 39.3 7.8 55 732-789 74-145 (238)
34 KOG0315|consensus 78.9 78 0.0017 35.6 15.5 206 85-393 62-289 (311)
35 PF07569 Hira: TUP1-like enhan 77.7 3.9 8.4E-05 43.4 5.4 38 728-768 18-55 (219)
36 PLN03081 pentatricopeptide (PP 77.3 12 0.00026 45.3 10.0 80 466-568 468-551 (697)
37 PF04053 Coatomer_WDAD: Coatom 77.1 15 0.00033 43.2 10.4 144 737-886 208-364 (443)
38 smart00564 PQQ beta-propeller 76.7 4.2 9E-05 29.8 3.8 27 733-759 5-31 (33)
39 PTZ00420 coronin; Provisional 74.4 2.2E+02 0.0047 35.0 21.0 110 265-392 172-293 (568)
40 PF14559 TPR_19: Tetratricopep 74.2 8.8 0.00019 31.8 5.6 34 544-577 24-57 (68)
41 PLN03081 pentatricopeptide (PP 72.6 45 0.00098 40.5 13.2 158 545-726 360-551 (697)
42 PF01011 PQQ: PQQ enzyme repea 72.0 7.3 0.00016 30.3 4.3 30 735-764 1-30 (38)
43 PTZ00421 coronin; Provisional 71.0 2.4E+02 0.0051 33.9 21.4 25 75-99 139-163 (493)
44 KOG1188|consensus 70.8 12 0.00025 43.0 7.1 126 30-191 50-184 (376)
45 PF02897 Peptidase_S9_N: Proly 70.4 24 0.00053 39.6 9.7 135 233-376 95-247 (414)
46 PTZ00421 coronin; Provisional 70.2 68 0.0015 38.3 13.6 111 265-393 173-291 (493)
47 KOG2048|consensus 70.2 1.9E+02 0.0041 36.2 17.1 234 1-331 16-256 (691)
48 smart00564 PQQ beta-propeller 69.7 6.3 0.00014 28.8 3.3 26 74-99 6-31 (33)
49 KOG0307|consensus 68.6 34 0.00074 44.2 11.1 144 239-393 135-328 (1049)
50 KOG0266|consensus 68.4 2.4E+02 0.0052 33.0 22.1 223 45-367 200-434 (456)
51 cd00216 PQQ_DH Dehydrogenases 68.2 11 0.00024 44.2 6.6 57 734-790 301-380 (488)
52 TIGR03300 assembly_YfgL outer 67.2 15 0.00033 40.6 7.2 60 734-797 241-300 (377)
53 PLN03077 Protein ECB2; Provisi 64.9 38 0.00082 42.1 10.6 88 466-588 631-722 (857)
54 KOG0264|consensus 64.8 3E+02 0.0066 32.8 18.0 141 240-397 247-409 (422)
55 PRK13844 recombination protein 64.0 16 0.00036 39.0 6.3 91 494-586 30-162 (200)
56 PRK05137 tolB translocation pr 63.9 2.7E+02 0.0059 32.0 22.3 91 241-339 312-413 (435)
57 KOG1273|consensus 62.9 47 0.001 38.2 9.7 29 76-104 167-195 (405)
58 PLN00181 protein SPA1-RELATED; 62.0 88 0.0019 38.8 12.9 92 290-393 597-691 (793)
59 TIGR01061 parC_Gpos DNA topois 61.6 74 0.0016 40.0 12.2 167 560-795 381-561 (738)
60 PRK04792 tolB translocation pr 61.5 1.2E+02 0.0027 35.3 13.3 130 243-392 198-336 (448)
61 PF08662 eIF2A: Eukaryotic tra 61.1 61 0.0013 33.5 9.8 77 244-328 84-161 (194)
62 PLN03218 maturation of RBCL 1; 60.9 45 0.00097 43.5 10.5 82 466-570 690-779 (1060)
63 PF14779 BBS1: Ciliary BBSome 60.8 23 0.0005 39.2 6.9 59 733-791 194-255 (257)
64 PLN03218 maturation of RBCL 1; 60.2 51 0.0011 43.0 10.8 27 98-128 101-127 (1060)
65 TIGR00615 recR recombination p 60.1 21 0.00046 38.0 6.3 91 494-586 26-158 (195)
66 PF13041 PPR_2: PPR repeat fam 59.1 18 0.00039 29.0 4.4 33 466-498 9-47 (50)
67 KOG1920|consensus 57.6 67 0.0014 42.2 10.9 117 15-133 17-137 (1265)
68 TIGR01062 parC_Gneg DNA topois 57.5 1E+02 0.0022 38.9 12.4 71 725-795 485-555 (735)
69 KOG0266|consensus 57.1 2.8E+02 0.0061 32.4 15.3 73 67-156 251-323 (456)
70 PLN03077 Protein ECB2; Provisi 56.9 50 0.0011 41.1 9.7 83 466-570 259-348 (857)
71 TIGR02800 propeller_TolB tol-p 55.6 2.1E+02 0.0045 32.0 13.4 145 230-392 156-308 (417)
72 PRK11138 outer membrane biogen 54.7 41 0.00089 37.8 7.8 59 734-796 256-314 (394)
73 PRK03629 tolB translocation pr 54.5 1.7E+02 0.0036 33.9 12.8 143 230-392 164-317 (429)
74 PF01011 PQQ: PQQ enzyme repea 54.4 16 0.00036 28.3 3.3 30 75-104 1-30 (38)
75 PTZ00420 coronin; Provisional 53.7 1.7E+02 0.0037 35.9 13.1 116 265-392 130-248 (568)
76 KOG2114|consensus 52.7 1.8E+02 0.004 37.3 13.2 130 741-874 262-415 (933)
77 PRK04922 tolB translocation pr 52.5 4.2E+02 0.0091 30.5 23.1 91 241-339 314-412 (433)
78 PF00780 CNH: CNH domain; Int 51.3 82 0.0018 33.2 9.0 71 11-96 99-169 (275)
79 PRK05560 DNA gyrase subunit A; 50.3 1.7E+02 0.0036 37.3 12.7 71 724-794 488-566 (805)
80 TIGR02800 propeller_TolB tol-p 48.6 4.3E+02 0.0092 29.5 18.7 121 241-375 212-341 (417)
81 PRK11028 6-phosphogluconolacto 48.4 3.9E+02 0.0085 29.0 16.0 135 242-392 101-258 (330)
82 KOG0291|consensus 48.4 7.2E+02 0.016 32.1 17.3 57 67-134 355-412 (893)
83 PRK13979 DNA topoisomerase IV 47.0 3.2E+02 0.007 35.7 14.6 70 724-795 506-582 (957)
84 PF08625 Utp13: Utp13 specific 45.5 24 0.00052 35.5 3.8 51 830-880 2-59 (141)
85 KOG2280|consensus 44.0 8.3E+02 0.018 31.5 20.4 149 689-877 669-824 (829)
86 PRK05137 tolB translocation pr 43.9 5.1E+02 0.011 29.8 14.5 131 230-374 214-352 (435)
87 TIGR03075 PQQ_enz_alc_DH PQQ-d 43.9 37 0.00079 40.7 5.6 34 730-763 468-501 (527)
88 KOG2055|consensus 43.7 96 0.0021 37.2 8.6 114 245-371 282-403 (514)
89 KOG0270|consensus 43.7 6.7E+02 0.014 30.3 15.6 54 290-350 395-461 (463)
90 PRK00178 tolB translocation pr 42.6 3.9E+02 0.0084 30.4 13.1 133 242-392 178-317 (430)
91 KOG0282|consensus 41.9 38 0.00082 40.4 5.1 58 743-801 279-337 (503)
92 KOG3678|consensus 41.5 20 0.00044 42.8 2.9 104 453-573 233-349 (832)
93 KOG0315|consensus 41.1 3E+02 0.0064 31.3 11.3 123 238-375 100-235 (311)
94 KOG0290|consensus 40.5 87 0.0019 35.8 7.4 160 11-255 157-322 (364)
95 TIGR00756 PPR pentatricopeptid 40.1 42 0.0009 23.7 3.4 26 466-491 6-34 (35)
96 PF08662 eIF2A: Eukaryotic tra 39.9 4.3E+02 0.0093 27.4 11.9 119 262-392 5-133 (194)
97 PRK01742 tolB translocation pr 39.6 5.3E+02 0.011 29.7 13.8 143 230-392 169-322 (429)
98 PF12894 Apc4_WD40: Anaphase-p 38.4 48 0.001 27.7 3.9 31 67-98 16-46 (47)
99 PRK05561 DNA topoisomerase IV 38.3 1.1E+02 0.0024 38.5 8.6 71 724-795 497-568 (742)
100 KOG1036|consensus 37.8 5.7E+02 0.012 29.6 13.1 185 7-223 87-283 (323)
101 KOG0306|consensus 37.7 1E+03 0.022 30.8 19.0 154 206-372 347-525 (888)
102 PRK00178 tolB translocation pr 37.5 6.7E+02 0.014 28.6 17.4 121 241-374 221-349 (430)
103 TIGR01063 gyrA DNA gyrase, A s 37.1 4.2E+02 0.0091 33.9 13.4 69 725-793 487-563 (800)
104 PF10433 MMS1_N: Mono-function 37.1 7.1E+02 0.015 29.4 14.5 74 214-303 403-480 (504)
105 TIGR02917 PEP_TPR_lipo putativ 37.0 4.4E+02 0.0096 31.3 12.8 105 466-570 573-694 (899)
106 PF14689 SPOB_a: Sensor_kinase 36.4 38 0.00082 29.5 3.1 21 466-486 29-49 (62)
107 TIGR03075 PQQ_enz_alc_DH PQQ-d 36.1 52 0.0011 39.4 5.2 37 70-106 468-504 (527)
108 KOG2111|consensus 36.0 2.4E+02 0.0052 32.7 9.9 139 268-417 54-198 (346)
109 PF12894 Apc4_WD40: Anaphase-p 35.4 64 0.0014 26.9 4.1 33 725-758 14-46 (47)
110 PRK00076 recR recombination pr 35.1 87 0.0019 33.6 6.1 90 494-586 26-157 (196)
111 PF13570 PQQ_3: PQQ-like domai 34.6 45 0.00097 25.9 3.0 21 733-753 20-40 (40)
112 PRK04922 tolB translocation pr 34.3 7.8E+02 0.017 28.4 19.1 121 241-374 226-354 (433)
113 TIGR02658 TTQ_MADH_Hv methylam 34.1 8.1E+02 0.017 28.5 19.3 75 28-110 75-155 (352)
114 KOG0293|consensus 34.0 1.2E+02 0.0027 36.0 7.5 91 290-396 290-388 (519)
115 TIGR02917 PEP_TPR_lipo putativ 33.9 2E+02 0.0043 34.1 9.4 28 542-569 868-895 (899)
116 KOG4649|consensus 33.8 1.1E+02 0.0024 34.7 6.8 67 729-796 100-167 (354)
117 COG1520 FOG: WD40-like repeat 33.7 1.3E+02 0.0028 33.7 7.5 62 734-795 111-172 (370)
118 PF08309 LVIVD: LVIVD repeat; 33.3 57 0.0012 26.7 3.5 26 141-169 10-35 (42)
119 PF14559 TPR_19: Tetratricopep 32.9 66 0.0014 26.5 3.9 22 849-870 36-57 (68)
120 KOG0639|consensus 32.7 3.3E+02 0.0072 33.4 10.7 116 239-376 483-612 (705)
121 PF13570 PQQ_3: PQQ-like domai 32.7 39 0.00085 26.2 2.4 21 73-93 20-40 (40)
122 KOG2063|consensus 32.0 98 0.0021 39.8 6.8 126 565-715 483-636 (877)
123 PF10313 DUF2415: Uncharacteri 29.7 79 0.0017 26.5 3.7 28 67-94 5-35 (43)
124 KOG1070|consensus 29.3 2.7E+02 0.0059 37.8 10.0 163 416-584 1439-1675(1710)
125 PHA02598 denA endonuclease II; 29.3 48 0.001 33.8 2.9 48 739-792 2-49 (138)
126 PF05131 Pep3_Vps18: Pep3/Vps1 29.0 3E+02 0.0064 28.1 8.5 93 736-851 37-129 (147)
127 KOG0319|consensus 28.9 72 0.0016 39.9 4.8 70 71-141 247-317 (775)
128 KOG1539|consensus 28.5 1.1E+02 0.0024 39.0 6.2 57 68-126 582-638 (910)
129 KOG1273|consensus 27.6 1.4E+02 0.0029 34.7 6.3 32 733-764 164-195 (405)
130 PF13429 TPR_15: Tetratricopep 27.5 88 0.0019 33.3 4.7 29 541-569 244-272 (280)
131 KOG4532|consensus 27.0 1.5E+02 0.0033 33.8 6.4 33 74-106 264-296 (344)
132 PF10366 Vps39_1: Vacuolar sor 26.9 1.3E+02 0.0027 29.1 5.2 54 560-628 13-67 (108)
133 TIGR03074 PQQ_membr_DH membran 26.8 57 0.0012 41.1 3.6 29 733-761 458-486 (764)
134 KOG1896|consensus 26.8 4.1E+02 0.0089 35.7 10.8 151 3-165 483-656 (1366)
135 KOG2003|consensus 26.4 67 0.0014 38.6 3.8 67 440-506 636-707 (840)
136 COG3172 NadR Predicted ATPase/ 26.3 1.8E+02 0.0039 31.0 6.4 55 809-866 22-80 (187)
137 KOG0305|consensus 25.7 1.7E+02 0.0038 35.3 7.1 88 313-412 180-269 (484)
138 cd01449 TST_Repeat_2 Thiosulfa 25.7 1E+02 0.0023 28.3 4.3 42 575-616 62-103 (118)
139 PF00167 FGF: Fibroblast growt 25.5 1.2E+02 0.0027 29.1 4.9 56 734-790 51-108 (122)
140 KOG0276|consensus 25.5 91 0.002 38.7 4.7 78 267-354 20-100 (794)
141 KOG0547|consensus 25.1 69 0.0015 38.8 3.7 55 831-885 121-196 (606)
142 KOG2063|consensus 25.0 1.8E+02 0.004 37.5 7.5 101 743-849 259-370 (877)
143 PRK02889 tolB translocation pr 24.7 1.1E+03 0.024 27.2 21.0 67 17-96 163-232 (427)
144 PRK04792 tolB translocation pr 24.5 1.2E+03 0.026 27.4 17.6 121 241-375 240-369 (448)
145 KOG2005|consensus 24.4 1.7E+02 0.0038 36.8 6.8 69 796-865 208-279 (878)
146 PF10415 FumaraseC_C: Fumarase 24.4 65 0.0014 27.8 2.5 42 841-884 11-55 (55)
147 PF00568 WH1: WH1 domain; Int 24.3 85 0.0018 30.0 3.5 66 701-772 5-77 (111)
148 PRK14574 hmsH outer membrane p 24.0 3.4E+02 0.0073 34.8 9.5 53 467-519 75-133 (822)
149 cd01448 TST_Repeat_1 Thiosulfa 23.8 1.4E+02 0.0029 27.9 4.7 45 572-616 60-105 (122)
150 KOG0289|consensus 23.7 1.4E+03 0.03 27.9 15.7 139 225-393 319-461 (506)
151 KOG0269|consensus 23.5 1.3E+02 0.0028 38.1 5.5 111 229-353 189-311 (839)
152 KOG2377|consensus 23.2 4.5E+02 0.0097 32.2 9.5 92 231-338 36-140 (657)
153 PF13970 DUF4221: Domain of un 23.1 1.5E+02 0.0033 32.9 5.8 63 732-795 54-134 (333)
154 PF04840 Vps16_C: Vps16, C-ter 23.1 2.1E+02 0.0045 32.5 6.8 56 807-869 251-306 (319)
155 PRK04043 tolB translocation pr 23.0 1.3E+03 0.027 27.1 14.0 118 242-377 212-340 (419)
156 KOG2110|consensus 22.9 1.3E+03 0.029 27.4 14.3 99 242-397 152-253 (391)
157 PF00568 WH1: WH1 domain; Int 22.8 1.5E+02 0.0031 28.4 4.8 29 84-112 49-77 (111)
158 smart00442 FGF Acidic and basi 22.7 2.1E+02 0.0046 28.2 6.0 59 733-792 52-112 (126)
159 KOG0269|consensus 22.3 6.7E+02 0.014 32.3 11.1 198 131-374 21-239 (839)
160 KOG0646|consensus 22.2 6.2E+02 0.014 30.7 10.4 184 70-315 131-324 (476)
161 PRK10747 putative protoheme IX 22.1 2.2E+02 0.0047 32.5 6.8 53 466-518 269-324 (398)
162 PF10395 Utp8: Utp8 family; I 21.8 3.5E+02 0.0076 34.1 8.8 71 16-91 85-158 (670)
163 PF00780 CNH: CNH domain; Int 21.6 3.1E+02 0.0067 28.9 7.4 74 721-795 34-123 (275)
164 KOG0319|consensus 21.4 1.3E+02 0.0029 37.8 5.1 73 730-802 246-319 (775)
165 KOG0282|consensus 21.0 55 0.0012 39.2 1.8 59 28-101 235-297 (503)
166 TIGR02552 LcrH_SycD type III s 21.0 1.9E+02 0.0042 26.7 5.1 98 425-559 8-115 (135)
167 KOG0288|consensus 21.0 1.6E+02 0.0035 35.0 5.4 61 729-789 394-456 (459)
168 PF01535 PPR: PPR repeat; Int 21.0 67 0.0015 22.4 1.6 20 551-570 6-25 (31)
169 smart00671 SEL1 Sel1-like repe 21.0 87 0.0019 22.8 2.3 29 530-560 8-36 (36)
170 smart00517 PolyA C-terminal do 20.9 40 0.00086 30.3 0.5 30 623-654 16-46 (64)
171 KOG0091|consensus 20.8 56 0.0012 34.7 1.6 45 868-912 96-155 (213)
172 PRK05560 DNA gyrase subunit A; 20.8 4.1E+02 0.0088 34.0 9.3 64 73-137 603-672 (805)
173 PRK15174 Vi polysaccharide exp 20.3 4.4E+02 0.0096 32.5 9.3 146 416-570 159-343 (656)
174 cd00216 PQQ_DH Dehydrogenases 20.2 2E+02 0.0044 33.9 6.2 29 734-762 61-89 (488)
175 cd00058 FGF Acidic and basic f 20.1 2E+02 0.0044 28.3 5.2 59 733-792 48-108 (123)
176 PF14583 Pectate_lyase22: Olig 20.0 3.7E+02 0.008 31.8 8.1 103 206-319 22-131 (386)
No 1
>KOG0985|consensus
Probab=100.00 E-value=1.3e-190 Score=1620.80 Aligned_cols=499 Identities=66% Similarity=1.036 Sum_probs=488.8
Q ss_pred ccceeeeccccccccCCcccccceeeecceEEEEEeccCCCceEEEEECCCCCCCcccccCccccccCCCccEEEEeccC
Q psy1644 202 EFKNRIYLLTNVGINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKA 281 (923)
Q Consensus 202 ~~~~e~~~Lt~lGI~~~si~F~~lTmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~~ 281 (923)
.|. ||+||+|+||+|++|+|+++||||||||||||+.|+++|||||||+++.+|+||||+||||||||+++|||||
T Consensus 7 ~f~-E~lqL~~~GI~~~~~~F~~~TmESDk~icvRe~~~~~~qVvIiDm~d~~~piRRpisadSaIm~P~kkvialk--- 82 (1666)
T KOG0985|consen 7 RFQ-EHLQLQNLGINPQNIGFSNVTMESDKFICVREKVGGQNQVVIIDMADPVNPIRRPISADSAIMNPDKKVIALK--- 82 (1666)
T ss_pred eHH-HHhcccccCcCccccccceeeeecCceEEEeccCCCcceEEEEEcCCCcccccCCCCCCchhhCchHHHHHHh---
Confidence 455 9999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCCCCCCeEEEEecccccccccccCCCceeEEEeccCCeEEEEecCeEEEEeccCCCCchhhhhcccCCCCCeeEee
Q psy1644 282 GNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWISPNTLALVTETSVYHWSMEGDSTPEKMFDRHSTLNGCQIINY 361 (923)
Q Consensus 282 ~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~~tLglVT~taVYHWsi~g~s~P~K~FdR~~~L~~~QIInY 361 (923)
+|++|||||+|+|+|+|+|+|+|+|+||||||+++||+||+||||||+++|+++|+||||||++|+|||||||
T Consensus 83 -------a~~~lQIFniE~KsK~ksh~m~E~VifWkWIs~kti~lVT~tsV~hws~egds~PvkmfdRh~sL~gcQiIny 155 (1666)
T KOG0985|consen 83 -------AGKTLQIFNIERKSKSKAHTMDEDVIFWKWISDKTIALVTATSVYHWSKEGDSAPVKMFDRHRSLNGCQIINY 155 (1666)
T ss_pred -------hCCeEEEEehhhhhhhccCCCCCceEEEEecchhHHHHhhhhheeeecccccchHHHHHHhhhccccceEEEe
Confidence 6889999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eeCCCCCeEEEEeeecccccccceeeecccccccccccccce--------------------------------------
Q psy1644 362 RTDPKQTWLLLIGISAVSNRVVGAMQLYSVERKCSQPIEEKK-------------------------------------- 403 (923)
Q Consensus 362 r~d~~~kW~~LvGI~~~~~~i~G~mQLYS~er~~SQ~iEGk~-------------------------------------- 403 (923)
++|..++|++|+||+++++||+|+|||||++|++||+|||++
T Consensus 156 k~~~~~eWl~LVGIs~r~~rv~G~MQLyS~~rkvSQ~iegHaa~Fa~fk~egne~p~~lF~fa~r~~~~GkLhiiEvg~~ 235 (1666)
T KOG0985|consen 156 KGNLNAEWLALVGISARTNRVAGKMQLYSVDRKVSQPIEGHAASFAAFKDEGNEVPSCLFSFAVRKPTGGKLHIIEVGDP 235 (1666)
T ss_pred ccccccCceeEEeeecccchhheeeeeeeeccchhhhhHHHHHHHHHhcccCCCCceEEEEecCCCCCCCeeEEEecCCC
Confidence 999999999999999999999999999999999999999900
Q ss_pred --------------------------------------------------------------------------------
Q psy1644 404 -------------------------------------------------------------------------------- 403 (923)
Q Consensus 404 -------------------------------------------------------------------------------- 403 (923)
T Consensus 236 ~tg~~~f~kk~vDvffppda~nDFPvamqvs~k~~vIyLiTKyG~ihlYdlEt~tciy~NRIs~dtIFvta~~~~s~Gi~ 315 (1666)
T KOG0985|consen 236 PTGNMPFLKKKVDVFFPPDAINDFPVAMQVSAKYGVIYLITKYGYIHLYDLETGTCIYRNRISADTIFVTAPHDSSSGIQ 315 (1666)
T ss_pred ccCCCcchhhhhhccCChhhccccchheeeccccCeEEEEeecchhhhhhhhhhhhhhhhhccccceeEecccccCcchh
Confidence
Q ss_pred ----------------------------------------------------------------eEEeccCCCccccCHH
Q psy1644 404 ----------------------------------------------------------------SRVAANAPKGILRTPQ 419 (923)
Q Consensus 404 ----------------------------------------------------------------AkvAA~SP~giLRt~e 419 (923)
||+||+||+||||||+
T Consensus 316 ~iNrKGQVLsv~v~e~~IVpyi~~~LqnpeLal~~A~R~nLaGAe~Lfv~rFneLfaqG~Y~eAAkvAAsSPrgILRt~~ 395 (1666)
T KOG0985|consen 316 GINRKGQVLSVLVNEQNIVPYITNVLQNPELALRLAVRANLAGAENLFVRRFNELFAQGEYEEAAKVAASSPRGILRTPG 395 (1666)
T ss_pred hhhhcceEEEEEecCcceehhhhccccChHHHHHhhhhcCCccHHHHHHHHHHHHHhCccHHHHHHHHHhCchhhhcCHH
Confidence 9999999999999999
Q ss_pred HHHhhhcCCCCCCCCchHHHHHHHHHhcCCCCcccccccchhhHHH----------------------------------
Q psy1644 420 TIVKFQQVPTPAGQTSPLLQYFGILLDHGKLNKHESLELCRPVLAQ---------------------------------- 465 (923)
Q Consensus 420 TI~rFK~vp~~pGq~sPLL~YF~~LL~~g~LN~~ESlEL~rpVL~Q---------------------------------- 465 (923)
||+|||++|++|||++|||||||+||++|+||++||||||||||+|
T Consensus 396 Ti~kFq~V~a~~Gq~sPLLqYFg~LLdqGkLNk~ETLEL~RpVL~Q~RkqLlekWl~EdKLeCSEELGDlVK~~d~~lAL 475 (1666)
T KOG0985|consen 396 TINKFQSVPAQPGQPSPLLQYFGTLLDQGKLNKYETLELCRPVLQQGRKQLLEKWLKEDKLECSEELGDLVKPYDTTLAL 475 (1666)
T ss_pred HHHHHHcCCCCCCCCCcHHHHHHHHHhcccccHHHHHHHHHHHHhhhHHHHHHHHhhhhhhhhhHHhcCccccCCchHHH
Confidence 9999999999999999999999999999999999999999999999
Q ss_pred -----------HHHHHHHhcChhHHHHHHHHhCCCccHHHHHHHHHhcCCchhhHHHHhhhcCCCCCcCccc--------
Q psy1644 466 -----------VIQSFAETGQFQKIVLYAKKVGYSPDYVFLLRNVMRVNPDQGVGFAQMLVQDDEPLADINQ-------- 526 (923)
Q Consensus 466 -----------Vi~clae~Gqfdkiv~Y~~~vgytpDy~~LL~~i~~~nP~~a~efA~~L~~~~~pl~di~~-------- 526 (923)
|++||||+|||+||+.||+|+||||||+++||+++|++||+|++||.||.++++|++|+++
T Consensus 476 ~iYlrAnvp~KVi~cfAE~Gqf~KiilY~kKvGyTPdymflLq~l~r~sPD~~~qFa~~l~Q~~~~~~die~I~DlFme~ 555 (1666)
T KOG0985|consen 476 SIYLRANVPAKVIQCFAETGQFKKIILYAKKVGYTPDYMFLLQQLKRSSPDQALQFAMMLVQDEEPLADIEQIVDLFMEL 555 (1666)
T ss_pred HHHHHcCCcHHHHHHHHHhcchhHHHHHHHHcCCCccHHHHHHHHHccChhHHHHHHHHhhccCCCcccHHHHHHHHHHH
Confidence 9999999999999999999999999999999999999999999999999999999999998
Q ss_pred --------------------------------------------------------------------------------
Q psy1644 527 -------------------------------------------------------------------------------- 526 (923)
Q Consensus 527 -------------------------------------------------------------------------------- 526 (923)
T Consensus 556 N~iQq~TSFLLdaLK~~~Pd~g~LQTrLLE~NL~~aPqVADAILgN~mFtHyDra~IAqLCEKAGL~qraLehytDl~DI 635 (1666)
T KOG0985|consen 556 NLIQQCTSFLLDALKLNSPDEGHLQTRLLEMNLVHAPQVADAILGNDMFTHYDRAEIAQLCEKAGLLQRALEHYTDLYDI 635 (1666)
T ss_pred HhhhhhHHHHHHHhcCCChhhhhHHHHHHHHHhccchHHHHHHHhccccccccHHHHHHHHHhcchHHHHHHhcccHHHH
Confidence
Q ss_pred ----------------------------------------------------------------------cchhhhhhhc
Q psy1644 527 ----------------------------------------------------------------------SYEGLFYFLG 536 (923)
Q Consensus 527 ----------------------------------------------------------------------~~egLyy~Lg 536 (923)
+|||||||||
T Consensus 636 KR~vVhth~L~pEwLv~yFg~lsve~s~eclkaml~~NirqNlQi~VQvatky~eqlg~~~li~lFE~fks~eGL~yfLg 715 (1666)
T KOG0985|consen 636 KRVVVHTHLLNPEWLVNYFGSLSVEDSLECLKAMLSANIRQNLQIVVQVATKYHEQLGAQALIELFESFKSYEGLYYFLG 715 (1666)
T ss_pred HHHHHHhccCCHHHHHHHHHhcCHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCHHHHHHHHHhhccchhHHHHHH
Confidence 6999999999
Q ss_pred ccccccCCcchhhHHHHHHHhcCChhhhHHhhhcCCccChHHHHhHHhhccCCCCCceEEeeccccchhHHHHHHHhcCc
Q psy1644 537 SIVNYSQDQEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNL 616 (923)
Q Consensus 537 sivn~s~d~dVhfKYIeAA~k~gq~~EvERicresn~YdpekvKnfLKeakL~d~~PLI~VCDrf~fV~dLv~YLY~n~~ 616 (923)
||||+|||||||||||||||||||+|||||||||||||||||||||||||||+||+||||||||||||||||+|||+||+
T Consensus 716 Sivn~seDpevh~KYIqAA~kt~QikEvERicresn~YdpErvKNfLkeAkL~DqlPLiiVCDRf~fVhdlvlYLyrnn~ 795 (1666)
T KOG0985|consen 716 SIVNFSEDPEVHFKYIQAACKTGQIKEVERICRESNCYDPERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNL 795 (1666)
T ss_pred HHhccccCchHHHHHHHHHHhhccHHHHHHHHhccccCCHHHHHHHHHhccccccCceEEEecccccHHHHHHHHHHhhH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hheehhhhhccCccccceeee-eccccCCHHHhhhhhccCcchhhhHHhhhccccccchhhHHHHH-hhcchhHHHHHHH
Q psy1644 617 QKYIEIYVQKGRKHLLEKWLK-EEKLECSEELGDLVKQSDPTLALSVYLRANIKQSRFVHDLVLYL-YRNNLQKYIEIYV 694 (923)
Q Consensus 617 ~k~IEvYvqkvNp~~tP~Vvg-LlD~dc~E~~Ik~LL~sv~~l~~~~~v~~~~~~~~pv~~Lv~ev-~RnrL~kll~p~L 694 (923)
+||||+|||||||+|||+||| |||+|||||+||+|++|| +|++|+||||+|| ||||| |||+|||
T Consensus 796 ~kyIE~yVQkvNps~~p~VvG~LLD~dC~E~~ik~Li~~v-------------~gq~~~deLv~EvEkRNRL-klLlp~L 861 (1666)
T KOG0985|consen 796 QKYIEIYVQKVNPSRTPQVVGALLDVDCSEDFIKNLILSV-------------RGQFPVDELVEEVEKRNRL-KLLLPWL 861 (1666)
T ss_pred HHHHHHHHhhcCCcccchhhhhhhcCCCcHHHHHHHHHHH-------------hccCChHHHHHHHHhhhhH-HHHHHHH
Confidence 999999999999999999999 999999999999999996 6777999999999 69999 9999999
Q ss_pred HHHHHhhcccchhhhHHHHhH---hcCccceeee
Q psy1644 695 QKIRIKIKQSRFVHDLVLYLY---RNNLQKYIEI 725 (923)
Q Consensus 695 E~~~~~g~q~~~v~nalakiy---~~~~~~~~~i 725 (923)
|++++||+||||+|||||||| +++|++|+.=
T Consensus 862 E~~i~eG~~d~a~hnAlaKIyIDSNNnPE~fLke 895 (1666)
T KOG0985|consen 862 ESLIQEGSQDPATHNALAKIYIDSNNNPERFLKE 895 (1666)
T ss_pred HHHHhccCcchHHHhhhhheeecCCCChHHhccc
Confidence 999999999999999999999 7779988753
No 2
>KOG0985|consensus
Probab=100.00 E-value=7.2e-68 Score=609.89 Aligned_cols=132 Identities=66% Similarity=1.084 Sum_probs=127.6
Q ss_pred hHhcCccceeeeeeeeccccccEEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEe
Q psy1644 714 LYRNNLQKYIEIYVQKVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSV 793 (923)
Q Consensus 714 iy~~~~~~~~~i~~~qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~I 793 (923)
++.+....|||+.+| ||.|||+||+|||+||+|+||+|||+||||||||+|+||+|++++.++||++||||||||+|+|
T Consensus 250 ffppda~nDFPvamq-vs~k~~vIyLiTKyG~ihlYdlEt~tciy~NRIs~dtIFvta~~~~s~Gi~~iNrKGQVLsv~v 328 (1666)
T KOG0985|consen 250 FFPPDAINDFPVAMQ-VSAKYGVIYLITKYGYIHLYDLETGTCIYRNRISADTIFVTAPHDSSSGIQGINRKGQVLSVLV 328 (1666)
T ss_pred cCChhhccccchhee-eccccCeEEEEeecchhhhhhhhhhhhhhhhhccccceeEecccccCcchhhhhhcceEEEEEe
Confidence 344556799999976 9999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCcccccccccCCCHHHHHHHHHHcCCCCchHHHHHHHHHHHhcCcHHHHHH
Q psy1644 794 DEDNIIPYINGTLQNPDLALRIAVRNNLAGAEDLFVRKFNLLFTNGQYSEAAK 846 (923)
Q Consensus 794 de~~iVpYI~~~L~n~~LA~kLA~R~~LPGAddLf~~qFn~l~~~G~y~eAAk 846 (923)
|+++|||||+++|+||+||+++|+|+||||||+||+++||+||++|+|+||||
T Consensus 329 ~e~~IVpyi~~~LqnpeLal~~A~R~nLaGAe~Lfv~rFneLfaqG~Y~eAAk 381 (1666)
T KOG0985|consen 329 NEQNIVPYITNVLQNPELALRLAVRANLAGAENLFVRRFNELFAQGEYEEAAK 381 (1666)
T ss_pred cCcceehhhhccccChHHHHHhhhhcCCccHHHHHHHHHHHHHhCccHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999
No 3
>PF13838 Clathrin_H_link: Clathrin-H-link; PDB: 2XZG_A 3GD1_I 1BPO_C 1C9I_B 1C9L_A.
Probab=99.65 E-value=2.3e-17 Score=143.05 Aligned_cols=43 Identities=65% Similarity=1.111 Sum_probs=35.6
Q ss_pred ceeEEeccCCCccccCHHHHHhhhcCCCCCCCCchHHHHHHHH
Q psy1644 402 KKSRVAANAPKGILRTPQTIVKFQQVPTPAGQTSPLLQYFGIL 444 (923)
Q Consensus 402 k~AkvAA~SP~giLRt~eTI~rFK~vp~~pGq~sPLL~YF~~L 444 (923)
++||+||+||+|+|||++||+|||++|++|||+||||+||++|
T Consensus 24 eAA~~AA~sP~giLRt~~Ti~rFk~~p~~pGq~splL~YF~~L 66 (66)
T PF13838_consen 24 EAAKVAANSPRGILRTPETINRFKQVPAQPGQPSPLLQYFGAL 66 (66)
T ss_dssp HHHHHHHHSGGGTT-SHHHHHHHHTS---TTS--HHHHHHHHH
T ss_pred HHHHHHHhCccchhcCHHHHHHHHcCCCCCCCCCHHHHHHhhC
Confidence 5699999999999999999999999999999999999999987
No 4
>PF13838 Clathrin_H_link: Clathrin-H-link; PDB: 2XZG_A 3GD1_I 1BPO_C 1C9I_B 1C9L_A.
Probab=99.17 E-value=2.8e-11 Score=105.47 Aligned_cols=47 Identities=49% Similarity=0.774 Sum_probs=40.4
Q ss_pred CCCCchHHHHHHHHHHHhcCcHHHHHHHHHh--cCChhHHHHHHHhccc
Q psy1644 820 NLAGAEDLFVRKFNLLFTNGQYSEAAKAACK--TGQIKEVERICRESNC 866 (923)
Q Consensus 820 ~LPGAddLf~~qFn~l~~~G~y~eAAk~A~k--tg~i~e~eri~r~~~~ 866 (923)
||||||+||++|||+||++|||+||||+|+. .|-+|+.|+|.|+.+.
T Consensus 1 ~lpGad~l~~~~F~~l~~~g~y~eAA~~AA~sP~giLRt~~Ti~rFk~~ 49 (66)
T PF13838_consen 1 NLPGADDLYVQQFNELFSQGQYEEAAKVAANSPRGILRTPETINRFKQV 49 (66)
T ss_dssp T-SS-HHHHHHHHHHHHHTT-HHHHHHHHHHSGGGTT-SHHHHHHHHTS
T ss_pred CCccHHHHHHHHHHHHHHcCCHHHHHHHHHhCccchhcCHHHHHHHHcC
Confidence 7999999999999999999999999999996 5799999999999875
No 5
>PF01394 Clathrin_propel: Clathrin propeller repeat; InterPro: IPR022365 This entry represents the propeller repeat found in clathrin heavy chains. The N terminus of the heavy chain is known as the globular domain, and is composed of seven repeats which form a beta propeller []. Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. More information about these proteins can be found at Protein of the Month: Clathrin [].; PDB: 1UTC_B 3GD1_I 3GC3_B 1BPO_C 1C9I_B 1C9L_A 2XZG_A 2XZH_A.
Probab=98.56 E-value=4.2e-08 Score=76.69 Aligned_cols=37 Identities=57% Similarity=0.935 Sum_probs=35.0
Q ss_pred CCCCeeEeeeeCCCCCeEEEEeeecccccccceeeecccc
Q psy1644 353 LNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVE 392 (923)
Q Consensus 353 L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~e 392 (923)
+++||+|+|+++++++|+.++||+ ++++|+|||||+|
T Consensus 1 ~~~~q~i~f~~~~~es~~~iigi~---~~~~G~v~l~sve 37 (37)
T PF01394_consen 1 GANSQIINFRTAPMESWKFIIGIS---GRVKGQVQLYSVE 37 (37)
T ss_dssp TCCCEEEECTTECCCEEEEEEEEE---TCTEEEEEEEETT
T ss_pred CCcccEEEEEeCCCcCeEEEEEEE---eeeeeEEEEEECC
Confidence 468999999999999999999999 8899999999986
No 6
>PF09268 Clathrin-link: Clathrin, heavy-chain linker; InterPro: IPR015348 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the core motif for the alpha-helical zigzag linker region connecting the conserved N-terminal beta-propeller region to the C-terminal alpha-alpha-superhelical region in clathrin heavy chains []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030130 clathrin coat of trans-Golgi network vesicle, 0030132 clathrin coat of coated pit; PDB: 1C9I_A 1BPO_B 1C9L_A 1UTC_B 3GD1_I 3GC3_B 2XZG_A 2XZH_A.
Probab=98.35 E-value=1.6e-07 Score=67.37 Aligned_cols=24 Identities=75% Similarity=1.156 Sum_probs=21.1
Q ss_pred CCcccccccccCCCHHHHHHHHHH
Q psy1644 795 EDNIIPYINGTLQNPDLALRIAVR 818 (923)
Q Consensus 795 e~~iVpYI~~~L~n~~LA~kLA~R 818 (923)
|++|||||.++|+|++||++||+|
T Consensus 1 e~~IVpyi~~~L~N~~LAl~lA~R 24 (24)
T PF09268_consen 1 EENIVPYILNTLQNPDLALRLASR 24 (24)
T ss_dssp TTTHHHHHHHTT--HHHHHHHHHH
T ss_pred CccchhHHHhccCCHHHHHHHhcC
Confidence 689999999999999999999998
No 7
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.88 E-value=0.0057 Score=59.04 Aligned_cols=136 Identities=9% Similarity=0.171 Sum_probs=86.1
Q ss_pred ceEEEEEeccCCCceEEEEECCCCCCCccccc---CccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccccccccc
Q psy1644 230 DKFICVREKIAESAQVVIIDMNDPTNPIRRPI---SADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKA 306 (923)
Q Consensus 230 DkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi---~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs 306 (923)
.+++.+-. ....|.++|+........... ..-+..++|..+.++.-+ ....+.+||+.+.+.++.
T Consensus 147 ~~~l~~~~---~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~---------~~~~i~i~d~~~~~~~~~ 214 (289)
T cd00200 147 GTFVASSS---QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSS---------SDGTIKLWDLSTGKCLGT 214 (289)
T ss_pred CCEEEEEc---CCCcEEEEEccccccceeEecCccccceEEECCCcCEEEEec---------CCCcEEEEECCCCceecc
Confidence 34555432 234788899875443222211 123456778876666652 246899999998888888
Q ss_pred c-cCCCceeEEEeccCCeEEEEe--cCeEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeeccccccc
Q psy1644 307 H-PMTDDVVFWKWISPNTLALVT--ETSVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVV 383 (923)
Q Consensus 307 ~-~m~E~VvfWkWis~~tLglVT--~taVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~ 383 (923)
+ ..+..+.-=.|-....+.++. +..++.|++.. ..+.+.+.-| ...|.....++++++++..| -.
T Consensus 215 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~-~~~~~~~~~~----~~~i~~~~~~~~~~~l~~~~-------~d 282 (289)
T cd00200 215 LRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT-GECVQTLSGH----TNSVTSLAWSPDGKRLASGS-------AD 282 (289)
T ss_pred hhhcCCceEEEEEcCCCcEEEEEcCCCcEEEEEcCC-ceeEEEcccc----CCcEEEEEECCCCCEEEEec-------CC
Confidence 7 456678888888766666555 67899999864 2333334322 23577888888888876554 24
Q ss_pred ceeeec
Q psy1644 384 GAMQLY 389 (923)
Q Consensus 384 G~mQLY 389 (923)
|.+.+|
T Consensus 283 ~~i~iw 288 (289)
T cd00200 283 GTIRIW 288 (289)
T ss_pred CeEEec
Confidence 555555
No 8
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=96.96 E-value=0.011 Score=56.46 Aligned_cols=119 Identities=16% Similarity=0.209 Sum_probs=89.9
Q ss_pred ccccchhhHHHHHh-hcchhHHHHHHHHHHHHhhcccchhhhHHHHhHhcCccceeeeeeeeccccccEEEEEeccceEE
Q psy1644 669 KQSRFVHDLVLYLY-RNNLQKYIEIYVQKIRIKIKQSRFVHDLVLYLYRNNLQKYIEIYVQKVSAKYDVVYLITKFGYIH 747 (923)
Q Consensus 669 ~~~~pv~~Lv~ev~-RnrL~kll~p~LE~~~~~g~q~~~v~nalakiy~~~~~~~~~i~~~qvS~KyGvIyvITK~Gflh 747 (923)
...+|++++|+.++ +|.. ++|.+|||+.+.+|+.++++||.|..+|- . |+
T Consensus 5 ~~~~~~~~vv~~~~~~~~~-~~l~~yLe~~~~~~~~~~~~~~~li~ly~-----------~-----~~------------ 55 (140)
T smart00299 5 SDPIDVSEVVELFEKRNLL-EELIPYLESALKLNSENPALQTKLIELYA-----------K-----YD------------ 55 (140)
T ss_pred CCcCCHHHHHHHHHhCCcH-HHHHHHHHHHHccCccchhHHHHHHHHHH-----------H-----HC------------
Confidence 56789999999995 7777 99999999999999999999999997761 1 00
Q ss_pred EEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEecCCcccccccccC--CCHHHHHHHHHHcCCCCch
Q psy1644 748 VYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDEDNIIPYINGTL--QNPDLALRIAVRNNLAGAE 825 (923)
Q Consensus 748 lyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide~~iVpYI~~~L--~n~~LA~kLA~R~~LPGAd 825 (923)
++.+++|+.++. =|.+-|++++.+.|+.
T Consensus 56 -----------------------------------------------~~~ll~~l~~~~~~yd~~~~~~~c~~~~l~--- 85 (140)
T smart00299 56 -----------------------------------------------PQKEIERLDNKSNHYDIEKVGKLCEKAKLY--- 85 (140)
T ss_pred -----------------------------------------------HHHHHHHHHhccccCCHHHHHHHHHHcCcH---
Confidence 225677777432 3678899999999864
Q ss_pred HHHHHHHHHHHhcCcHHHHHHHHHhc-CChhHHHHHHHhcccCChHH
Q psy1644 826 DLFVRKFNLLFTNGQYSEAAKAACKT-GQIKEVERICRESNCYNAER 871 (923)
Q Consensus 826 dLf~~qFn~l~~~G~y~eAAk~A~kt-g~i~e~eri~r~~~~y~~~~ 871 (923)
...+-=+-..|+|++|.+.+.+. +..+...+.|+.-+ +||.
T Consensus 86 ---~~~~~l~~k~~~~~~Al~~~l~~~~d~~~a~~~~~~~~--~~~l 127 (140)
T smart00299 86 ---EEAVELYKKDGNFKDAIVTLIEHLGNYEKAIEYFVKQN--NPEL 127 (140)
T ss_pred ---HHHHHHHHhhcCHHHHHHHHHHcccCHHHHHHHHHhCC--CHHH
Confidence 23333445569999999988876 77777777777644 5553
No 9
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=95.99 E-value=1.7 Score=42.15 Aligned_cols=85 Identities=13% Similarity=0.232 Sum_probs=50.2
Q ss_pred CceEEEEECCCCCCCccc---ccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccC-CCceeEEE
Q psy1644 242 SAQVVIIDMNDPTNPIRR---PISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPM-TDDVVFWK 317 (923)
Q Consensus 242 ~~qVvIVDL~n~~~v~Rr---pi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m-~E~VvfWk 317 (923)
...|.+.|+......... +-...+...+|...+++.-+ ....+.+|++++...++.+.. ...+.--.
T Consensus 198 ~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~---------~~~~i~i~~~~~~~~~~~~~~~~~~i~~~~ 268 (289)
T cd00200 198 DGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGS---------EDGTIRVWDLRTGECVQTLSGHTNSVTSLA 268 (289)
T ss_pred CCcEEEEECCCCceecchhhcCCceEEEEEcCCCcEEEEEc---------CCCcEEEEEcCCceeEEEccccCCcEEEEE
Confidence 457888998764432221 11233455677777776651 256899999998777776654 33566666
Q ss_pred eccC-CeEEEEecC-eEEEE
Q psy1644 318 WISP-NTLALVTET-SVYHW 335 (923)
Q Consensus 318 Wis~-~tLglVT~t-aVYHW 335 (923)
|-.+ +.|+..+.. .+..|
T Consensus 269 ~~~~~~~l~~~~~d~~i~iw 288 (289)
T cd00200 269 WSPDGKRLASGSADGTIRIW 288 (289)
T ss_pred ECCCCCEEEEecCCCeEEec
Confidence 7655 344333333 45555
No 10
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=95.86 E-value=0.0026 Score=60.59 Aligned_cols=107 Identities=28% Similarity=0.402 Sum_probs=77.1
Q ss_pred hhhhhhhcccccc-cCCcchhhHHHHHHHhcCChhhhHHhhhcCCccChHHHHhHHhhccCCCCCceEEeeccccchhHH
Q psy1644 529 EGLFYFLGSIVNY-SQDQEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEAKLTDQLPLIIVCDRFDFVHDL 607 (923)
Q Consensus 529 egLyy~Lgsivn~-s~d~dVhfKYIeAA~k~gq~~EvERicresn~YdpekvKnfLKeakL~d~~PLI~VCDrf~fV~dL 607 (923)
+-|+-||.+++.. +.++++|-.||+..++..+-+.+|.+...|++||++++-....+.++.+. .+++..+++..++.
T Consensus 24 ~~l~~yLe~~~~~~~~~~~~~~~li~ly~~~~~~~ll~~l~~~~~~yd~~~~~~~c~~~~l~~~--~~~l~~k~~~~~~A 101 (140)
T smart00299 24 EELIPYLESALKLNSENPALQTKLIELYAKYDPQKEIERLDNKSNHYDIEKVGKLCEKAKLYEE--AVELYKKDGNFKDA 101 (140)
T ss_pred HHHHHHHHHHHccCccchhHHHHHHHHHHHHCHHHHHHHHHhccccCCHHHHHHHHHHcCcHHH--HHHHHHhhcCHHHH
Confidence 4566777776654 57789999999999999999999998878999999999999999888653 45555666666665
Q ss_pred HHHHHh--cCchheehhhhhccCccccceeee
Q psy1644 608 VLYLYR--NNLQKYIEIYVQKGRKHLLEKWLK 637 (923)
Q Consensus 608 v~YLY~--n~~~k~IEvYvqkvNp~~tP~Vvg 637 (923)
+..+-+ ++..+.|+...+.-+|.--..+++
T Consensus 102 l~~~l~~~~d~~~a~~~~~~~~~~~lw~~~~~ 133 (140)
T smart00299 102 IVTLIEHLGNYEKAIEYFVKQNNPELWAEVLK 133 (140)
T ss_pred HHHHHHcccCHHHHHHHHHhCCCHHHHHHHHH
Confidence 555444 356667775555455443333333
No 11
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=95.65 E-value=1.9 Score=43.91 Aligned_cols=228 Identities=14% Similarity=0.153 Sum_probs=106.4
Q ss_pred ceeEEeccccceEEEE-eeeeeEEEEecccceeeeecccCCCcEEEeeccCCCcceEEEecCCcceeeeeeeceeecccc
Q psy1644 65 PVAMQVSAKYDVVYLI-TKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMESDK 143 (923)
Q Consensus 65 PvamqiS~k~G~iyli-TK~G~lhl~Dlet~t~iy~~rIS~~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~sdk 143 (923)
|.++.+++.-+.+|+. ...|.+.+||..++..+-.-....+.. .+.+...| +
T Consensus 33 ~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~~~-----------~~~~~~~g----------------~ 85 (300)
T TIGR03866 33 PRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPDPE-----------LFALHPNG----------------K 85 (300)
T ss_pred CCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCCCcc-----------EEEECCCC----------------C
Confidence 5667778776777765 456899999999987653211111111 11222222 1
Q ss_pred eeEEEecccccceEEEEecCCCCCC--CccCccccccccCcccchhhhcCCCCCC----CCCCcccceeeeccccccccC
Q psy1644 144 FICVREKIAESAQVVIIDMNDPTNP--IRRPISADSAIMNPASKVIALKGKAGND----NNPNGEFKNRIYLLTNVGINV 217 (923)
Q Consensus 144 ~ic~~~~~~~~~~vviid~~~~~~~--~r~~~~a~~~im~p~~~i~al~~~~~~~----~~~~~~~~~e~~~Lt~lGI~~ 217 (923)
++.+-. .+..++.++|+...... +.-+....++-++|..+.++.-+.-++. +...++.. .....+-.|
T Consensus 86 ~l~~~~--~~~~~l~~~d~~~~~~~~~~~~~~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~~~~~----~~~~~~~~~ 159 (300)
T TIGR03866 86 ILYIAN--EDDNLVTVIDIETRKVLAEIPVGVEPEGMAVSPDGKIVVNTSETTNMAHFIDTKTYEIV----DNVLVDQRP 159 (300)
T ss_pred EEEEEc--CCCCeEEEEECCCCeEEeEeeCCCCcceEEECCCCCEEEEEecCCCeEEEEeCCCCeEE----EEEEcCCCc
Confidence 111110 11134555555442110 0001223456678888777653321110 11111111 101122233
Q ss_pred CcccccceeeecceEEEEEeccCCCceEEEEECCCCCCCcccccC----------ccccccCCCccEEEEeccCCCCCCC
Q psy1644 218 GSVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPIS----------ADSAIMNPASKVIALKGKAGNDNNP 287 (923)
Q Consensus 218 ~si~F~~lTmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi~----------ADSAIMnP~~~IIALra~~~~~n~~ 287 (923)
.++.|. .+.+++.+- ......|.++|+.......+.+.. .-+...+|..+.+.+-..
T Consensus 160 ~~~~~s----~dg~~l~~~--~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~------- 226 (300)
T TIGR03866 160 RFAEFT----ADGKELWVS--SEIGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALG------- 226 (300)
T ss_pred cEEEEC----CCCCEEEEE--cCCCCEEEEEEcCcceeeeeeeecccccccccCCccceEECCCCCEEEEEcC-------
Confidence 332221 123444442 222357999999876443332211 112346888776544310
Q ss_pred CCCCeEEEEecccccccccccCCCceeEEEeccCCeEEEEe---cCeEEEEeccC
Q psy1644 288 NAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWISPNTLALVT---ETSVYHWSMEG 339 (923)
Q Consensus 288 ~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~~tLglVT---~taVYHWsi~g 339 (923)
....+.+||+++.+.++.......+.-=.|-.+...-+++ +..|..|++++
T Consensus 227 -~~~~i~v~d~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~ 280 (300)
T TIGR03866 227 -PANRVAVVDAKTYEVLDYLLVGQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAA 280 (300)
T ss_pred -CCCeEEEEECCCCcEEEEEEeCCCcceEEECCCCCEEEEEcCCCCeEEEEECCC
Confidence 2346999999876655543222223222344444433344 35688888764
No 12
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=95.64 E-value=0.31 Score=55.05 Aligned_cols=268 Identities=15% Similarity=0.277 Sum_probs=130.3
Q ss_pred CCceEEEEEEecCCccEEEEEecCCCCCCCCCcccceeeeecCCCccCCcceeEEeccccceEEEEeeeeeEEEEecccc
Q psy1644 15 EPSTLFCFAVRTPAGGKLHIIEVGTPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYLITKFGYIHVYDIETA 94 (923)
Q Consensus 15 ~~stL~cfa~r~~~~gkLhiiEig~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK~G~lhl~Dlet~ 94 (923)
++.+||..+.| ..|++.||+..+... .+.++.. .+.+..+-.|.--..+|+.+..|.|.++|+.|+
T Consensus 3 ~~~~l~~V~~~--~~~~v~viD~~t~~~------~~~i~~~------~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~ 68 (369)
T PF02239_consen 3 DTGNLFYVVER--GSGSVAVIDGATNKV------VARIPTG------GAPHAGLKFSPDGRYLYVANRDGTVSVIDLATG 68 (369)
T ss_dssp -GGGEEEEEEG--GGTEEEEEETTT-SE------EEEEE-S------TTEEEEEE-TT-SSEEEEEETTSEEEEEETTSS
T ss_pred CCccEEEEEec--CCCEEEEEECCCCeE------EEEEcCC------CCceeEEEecCCCCEEEEEcCCCeEEEEECCcc
Confidence 34566666655 457899998765210 0111110 122444445554557999999999999999999
Q ss_pred eeeeecccCCCcEEEeeccCCCcceEEEecCCcceeeeeeeceeecccceeEEEecccccceEEEEecCCCCCCCccCcc
Q psy1644 95 TCIYMNRISSDTIFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPIS 174 (923)
Q Consensus 95 t~iy~~rIS~~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~sdk~ic~~~~~~~~~~vviid~~~~~~~~r~~~~ 174 (923)
..+-+=+. |.++.+|.++ ..+||+-|--.. ..+|+|+|...- .+++ -|+
T Consensus 69 ~~v~~i~~-----------------------G~~~~~i~~s----~DG~~~~v~n~~--~~~v~v~D~~tl-e~v~-~I~ 117 (369)
T PF02239_consen 69 KVVATIKV-----------------------GGNPRGIAVS----PDGKYVYVANYE--PGTVSVIDAETL-EPVK-TIP 117 (369)
T ss_dssp SEEEEEE------------------------SSEEEEEEE------TTTEEEEEEEE--TTEEEEEETTT---EEE-EEE
T ss_pred cEEEEEec-----------------------CCCcceEEEc----CCCCEEEEEecC--CCceeEeccccc-ccee-ecc
Confidence 85543222 2234444443 223555443322 257999998663 3333 222
Q ss_pred ccccc-cCcccchhhhcCCCCCCCCCCcccceeeeccccccccCCcccccceeeecceEE-EEEeccCCCceEEEEECCC
Q psy1644 175 ADSAI-MNPASKVIALKGKAGNDNNPNGEFKNRIYLLTNVGINVGSVSFNTLTMESDKFI-CVREKIAESAQVVIIDMND 252 (923)
Q Consensus 175 a~~~i-m~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~lGI~~~si~F~~lTmESDkyI-CVRE~~~~~~qVvIVDL~n 252 (923)
..+.- ..|.+|+-|+..+... +.|| +++| ..+|.+||..+
T Consensus 118 ~~~~~~~~~~~Rv~aIv~s~~~----------------------------------~~fVv~lkd----~~~I~vVdy~d 159 (369)
T PF02239_consen 118 TGGMPVDGPESRVAAIVASPGR----------------------------------PEFVVNLKD----TGEIWVVDYSD 159 (369)
T ss_dssp --EE-TTTS---EEEEEE-SSS----------------------------------SEEEEEETT----TTEEEEEETTT
T ss_pred cccccccccCCCceeEEecCCC----------------------------------CEEEEEEcc----CCeEEEEEecc
Confidence 22211 1156666666554431 2243 3443 34999999999
Q ss_pred CCCCcccccCcc----ccccCCCcc--EEEEeccCCCCCCCCCCCeEEEEecccccccccccC--------------CCc
Q psy1644 253 PTNPIRRPISAD----SAIMNPASK--VIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPM--------------TDD 312 (923)
Q Consensus 253 ~~~v~Rrpi~AD----SAIMnP~~~--IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m--------------~E~ 312 (923)
+.+..-+-++++ -+.++|+.+ ++|+. +...+-++|.+++........ ++.
T Consensus 160 ~~~~~~~~i~~g~~~~D~~~dpdgry~~va~~----------~sn~i~viD~~~~k~v~~i~~g~~p~~~~~~~~php~~ 229 (369)
T PF02239_consen 160 PKNLKVTTIKVGRFPHDGGFDPDGRYFLVAAN----------GSNKIAVIDTKTGKLVALIDTGKKPHPGPGANFPHPGF 229 (369)
T ss_dssp SSCEEEEEEE--TTEEEEEE-TTSSEEEEEEG----------GGTEEEEEETTTTEEEEEEE-SSSBEETTEEEEEETTT
T ss_pred ccccceeeecccccccccccCcccceeeeccc----------ccceeEEEeeccceEEEEeeccccccccccccccCCCc
Confidence 866554555554 378889865 34444 234677777766544432211 111
Q ss_pred eeEEEecc--CCeEEEEecCeEEEEecc---------CCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeeccccc
Q psy1644 313 VVFWKWIS--PNTLALVTETSVYHWSME---------GDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNR 381 (923)
Q Consensus 313 VvfWkWis--~~tLglVT~taVYHWsi~---------g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~ 381 (923)
=.+|.|-- ...+.++....|.-|+.. ....| +| -++.++.+|+.+--....+
T Consensus 230 g~vw~~~~~~~~~~~~ig~~~v~v~d~~~wkvv~~I~~~G~g--lF-------------i~thP~s~~vwvd~~~~~~-- 292 (369)
T PF02239_consen 230 GPVWATSGLGYFAIPLIGTDPVSVHDDYAWKVVKTIPTQGGG--LF-------------IKTHPDSRYVWVDTFLNPD-- 292 (369)
T ss_dssp EEEEEEEBSSSSEEEEEE--TTT-STTTBTSEEEEEE-SSSS-----------------EE--TT-SEEEEE-TT-SS--
T ss_pred ceEEeeccccceecccccCCccccchhhcCeEEEEEECCCCc--ce-------------eecCCCCccEEeeccCCCC--
Confidence 12355442 333455555555433321 11222 33 3567888998775222222
Q ss_pred ccceeeeccccc
Q psy1644 382 VVGAMQLYSVER 393 (923)
Q Consensus 382 i~G~mQLYS~er 393 (923)
.+.+|++.++.
T Consensus 293 -~~~v~viD~~t 303 (369)
T PF02239_consen 293 -ADTVQVIDKKT 303 (369)
T ss_dssp -HT-EEEEECCG
T ss_pred -CceEEEEECcC
Confidence 77888888765
No 13
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=95.07 E-value=0.0033 Score=59.87 Aligned_cols=126 Identities=19% Similarity=0.292 Sum_probs=91.5
Q ss_pred ccccchhhHHHHH-hhcchhHHHHHHHHHHHHhhc-ccchhhhHHHHhHhcCccceeeeeeeeccccccEEEEEeccceE
Q psy1644 669 KQSRFVHDLVLYL-YRNNLQKYIEIYVQKIRIKIK-QSRFVHDLVLYLYRNNLQKYIEIYVQKVSAKYDVVYLITKFGYI 746 (923)
Q Consensus 669 ~~~~pv~~Lv~ev-~RnrL~kll~p~LE~~~~~g~-q~~~v~nalakiy~~~~~~~~~i~~~qvS~KyGvIyvITK~Gfl 746 (923)
...++++++|+.+ +||+. ++|.+|||+.+.+++ .++.+||+|+.+| ++.
T Consensus 5 ~~~~~~~~vi~~~~~~~~~-~~l~~yLe~~~~~~~~~~~~~~~~L~~ly-----------~~~----------------- 55 (143)
T PF00637_consen 5 TDPLEISEVISAFEERNQP-EELIEYLEALVKENKENNPDLHTLLLELY-----------IKY----------------- 55 (143)
T ss_dssp CTTSCSCCCHHHCTTTT-G-GGCTCCHHHHHHTSTC-SHHHHHHHHHHH-----------HCT-----------------
T ss_pred CCccCHHHHHHHHHhCCCH-HHHHHHHHHHHhcccccCHHHHHHHHHHH-----------Hhc-----------------
Confidence 5677889999999 68999 999999999998875 4599999999876 111
Q ss_pred EEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEecC-CcccccccccCC-CHHHHHHHHHHcCCCCc
Q psy1644 747 HVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDE-DNIIPYINGTLQ-NPDLALRIAVRNNLAGA 824 (923)
Q Consensus 747 hlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide-~~iVpYI~~~L~-n~~LA~kLA~R~~LPGA 824 (923)
++ +.+..|+.+.-+ |++-|++++.+.|+
T Consensus 56 -----------------------------------------------~~~~~l~~~L~~~~~yd~~~~~~~c~~~~l--- 85 (143)
T PF00637_consen 56 -----------------------------------------------DPYEKLLEFLKTSNNYDLDKALRLCEKHGL--- 85 (143)
T ss_dssp -----------------------------------------------TTCCHHHHTTTSSSSS-CTHHHHHHHTTTS---
T ss_pred -----------------------------------------------CCchHHHHHcccccccCHHHHHHHHHhcch---
Confidence 66 789999886433 66778888888774
Q ss_pred hHHHHHHHHHHHhcCcHHHHHHHHHhcCChhHHHHHHHhcccCChHHHHhHHhh
Q psy1644 825 EDLFVRKFNLLFTNGQYSEAAKAACKTGQIKEVERICRESNCYNAERVKNFLKE 878 (923)
Q Consensus 825 ddLf~~qFn~l~~~G~y~eAAk~A~ktg~i~e~eri~r~~~~y~~~~vk~~l~~ 878 (923)
+....--+-..|++.+|.+.+.+.+.+++....|++++ +|+..+..++.
T Consensus 86 ---~~~a~~Ly~~~~~~~~al~i~~~~~~~~~a~e~~~~~~--~~~l~~~l~~~ 134 (143)
T PF00637_consen 86 ---YEEAVYLYSKLGNHDEALEILHKLKDYEEAIEYAKKVD--DPELWEQLLKY 134 (143)
T ss_dssp ---HHHHHHHHHCCTTHTTCSSTSSSTHCSCCCTTTGGGCS--SSHHHHHHHHH
T ss_pred ---HHHHHHHHHHcccHHHHHHHHHHHccHHHHHHHHHhcC--cHHHHHHHHHH
Confidence 33344445568888888887446677777777777776 67666655543
No 14
>PF01394 Clathrin_propel: Clathrin propeller repeat; InterPro: IPR022365 This entry represents the propeller repeat found in clathrin heavy chains. The N terminus of the heavy chain is known as the globular domain, and is composed of seven repeats which form a beta propeller []. Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. More information about these proteins can be found at Protein of the Month: Clathrin [].; PDB: 1UTC_B 3GD1_I 3GC3_B 1BPO_C 1C9I_B 1C9L_A 2XZG_A 2XZH_A.
Probab=94.70 E-value=0.042 Score=43.31 Aligned_cols=37 Identities=32% Similarity=0.311 Sum_probs=31.9
Q ss_pred CcceeeeeeeceeecccceeEEEecccccceEEEEecC
Q psy1644 126 GINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMN 163 (923)
Q Consensus 126 G~~~~si~~~t~tm~sdk~ic~~~~~~~~~~vviid~~ 163 (923)
|.+.+.|.|.+.+|||++|+|.++ .....+|.++||+
T Consensus 1 ~~~~q~i~f~~~~~es~~~iigi~-~~~~G~v~l~sve 37 (37)
T PF01394_consen 1 GANSQIINFRTAPMESWKFIIGIS-GRVKGQVQLYSVE 37 (37)
T ss_dssp TCCCEEEECTTECCCEEEEEEEEE-TCTEEEEEEEETT
T ss_pred CCcccEEEEEeCCCcCeEEEEEEE-eeeeeEEEEEECC
Confidence 456889999999999999999999 3355899999985
No 15
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=94.55 E-value=4 Score=41.57 Aligned_cols=90 Identities=10% Similarity=0.163 Sum_probs=54.5
Q ss_pred ceEEEEECCCCCCCcccc--cCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEecc
Q psy1644 243 AQVVIIDMNDPTNPIRRP--ISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWIS 320 (923)
Q Consensus 243 ~qVvIVDL~n~~~v~Rrp--i~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis 320 (923)
.++.++|+.....+.+.+ ....+..++|..+.++.-+. .+..+.+|+.++...+......+....=.|-.
T Consensus 95 ~~l~~~d~~~~~~~~~~~~~~~~~~~~~~~dg~~l~~~~~--------~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~s~ 166 (300)
T TIGR03866 95 NLVTVIDIETRKVLAEIPVGVEPEGMAVSPDGKIVVNTSE--------TTNMAHFIDTKTYEIVDNVLVDQRPRFAEFTA 166 (300)
T ss_pred CeEEEEECCCCeEEeEeeCCCCcceEEECCCCCEEEEEec--------CCCeEEEEeCCCCeEEEEEEcCCCccEEEECC
Confidence 478888887654333222 23456778999988887521 24567788988766655443333344445554
Q ss_pred CCeEEEEe---cCeEEEEeccCC
Q psy1644 321 PNTLALVT---ETSVYHWSMEGD 340 (923)
Q Consensus 321 ~~tLglVT---~taVYHWsi~g~ 340 (923)
+...-+++ +..|+.|++++.
T Consensus 167 dg~~l~~~~~~~~~v~i~d~~~~ 189 (300)
T TIGR03866 167 DGKELWVSSEIGGTVSVIDVATR 189 (300)
T ss_pred CCCEEEEEcCCCCEEEEEEcCcc
Confidence 44443343 568999999653
No 16
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=92.01 E-value=0.0094 Score=56.79 Aligned_cols=63 Identities=16% Similarity=0.278 Sum_probs=51.9
Q ss_pred chhhhhhhccccc--ccCCcchhhHHHHHHHhcCChhhhHHhhhcCCccChHHHHhHHhhccCCC
Q psy1644 528 YEGLFYFLGSIVN--YSQDQEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEAKLTD 590 (923)
Q Consensus 528 ~egLyy~Lgsivn--~s~d~dVhfKYIeAA~k~gq~~EvERicresn~YdpekvKnfLKeakL~d 590 (923)
.+-|+-||-..+. ...++++|-.+++..++.+..++++++.+.++.||++++...+++.++.+
T Consensus 23 ~~~l~~yLe~~~~~~~~~~~~~~~~L~~ly~~~~~~~~l~~~L~~~~~yd~~~~~~~c~~~~l~~ 87 (143)
T PF00637_consen 23 PEELIEYLEALVKENKENNPDLHTLLLELYIKYDPYEKLLEFLKTSNNYDLDKALRLCEKHGLYE 87 (143)
T ss_dssp GGGCTCCHHHHHHTSTC-SHHHHHHHHHHHHCTTTCCHHHHTTTSSSSS-CTHHHHHHHTTTSHH
T ss_pred HHHHHHHHHHHHhcccccCHHHHHHHHHHHHhcCCchHHHHHcccccccCHHHHHHHHHhcchHH
Confidence 3566777877774 34678999999999999999999999999999999999999998887643
No 17
>KOG0279|consensus
Probab=90.88 E-value=5.5 Score=44.54 Aligned_cols=176 Identities=18% Similarity=0.269 Sum_probs=106.7
Q ss_pred EeeeeeEEEEecccceeeee----------cccCCCcEEEeeccCCCcceEEEecCCcceeeeeeeceeecccceeEEEe
Q psy1644 80 ITKFGYIHVYDIETATCIYM----------NRISSDTIFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMESDKFICVRE 149 (923)
Q Consensus 80 iTK~G~lhl~Dlet~t~iy~----------~rIS~~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~sdk~ic~~~ 149 (923)
-.-.|.+.+||+++|++..+ --||+|+-=+++.+ --.-|..-|.+| ....+.... ||+|=--|||=
T Consensus 81 ~swD~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qivSGS-rDkTiklwnt~g--~ck~t~~~~-~~~~WVscvrf 156 (315)
T KOG0279|consen 81 ASWDGTLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQIVSGS-RDKTIKLWNTLG--VCKYTIHED-SHREWVSCVRF 156 (315)
T ss_pred ccccceEEEEEecCCcEEEEEEecCCceEEEEecCCCceeecCC-Ccceeeeeeecc--cEEEEEecC-CCcCcEEEEEE
Confidence 34578999999999866432 23555443333212 224577778888 455555443 55666778886
Q ss_pred cccccceEEEEecCCCCCCCccCccccccccCcccchhhhcCCCCCCCCCCcccceeeeccccccccCCccc----ccce
Q psy1644 150 KIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLTNVGINVGSVS----FNTL 225 (923)
Q Consensus 150 ~~~~~~~vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~lGI~~~si~----F~~l 225 (923)
..+. .+.+|+. -++|-.. | +-+|.++-+....++ -+++
T Consensus 157 sP~~-~~p~Ivs-----------~s~Dktv-----K---------------------vWnl~~~~l~~~~~gh~~~v~t~ 198 (315)
T KOG0279|consen 157 SPNE-SNPIIVS-----------ASWDKTV-----K---------------------VWNLRNCQLRTTFIGHSGYVNTV 198 (315)
T ss_pred cCCC-CCcEEEE-----------ccCCceE-----E---------------------EEccCCcchhhccccccccEEEE
Confidence 5221 2222221 1222222 1 223333333333333 4688
Q ss_pred eeecceEEEEEeccCCCceEEEEECCCCCCCcccccC--ccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccc
Q psy1644 226 TMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPIS--ADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSK 303 (923)
Q Consensus 226 TmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi~--ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~K 303 (923)
|..-|--+|.- .|...+...+||+...+.--.+=. --|-..-|++-+|+.- .+..+.|||+|+|..
T Consensus 199 ~vSpDGslcas--Ggkdg~~~LwdL~~~k~lysl~a~~~v~sl~fspnrywL~~a----------t~~sIkIwdl~~~~~ 266 (315)
T KOG0279|consen 199 TVSPDGSLCAS--GGKDGEAMLWDLNEGKNLYSLEAFDIVNSLCFSPNRYWLCAA----------TATSIKIWDLESKAV 266 (315)
T ss_pred EECCCCCEEec--CCCCceEEEEEccCCceeEeccCCCeEeeEEecCCceeEeec----------cCCceEEEeccchhh
Confidence 99999999987 556669999999987553222111 1145567988888875 678899999999999
Q ss_pred cccccC
Q psy1644 304 MKAHPM 309 (923)
Q Consensus 304 LKs~~m 309 (923)
+.+...
T Consensus 267 v~~l~~ 272 (315)
T KOG0279|consen 267 VEELKL 272 (315)
T ss_pred hhhccc
Confidence 988755
No 18
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=90.13 E-value=42 Score=38.84 Aligned_cols=150 Identities=19% Similarity=0.226 Sum_probs=87.9
Q ss_pred cCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCC--CceeEEEeccCCeEEEEecCeEEEEeccCCCCchh
Q psy1644 268 MNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMT--DDVVFWKWISPNTLALVTETSVYHWSMEGDSTPEK 345 (923)
Q Consensus 268 MnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~--E~VvfWkWis~~tLglVT~taVYHWsi~g~s~P~K 345 (923)
..|+.+.|||=. .-.++.|.+.+.++++-.+..+ +...-..|.-++.+++--++.|+-..-.|++
T Consensus 224 vSpng~~iAl~t---------~~g~l~v~ssDf~~~~~e~~~~~~~~p~~~~WCG~dav~l~~~~~l~lvg~~~~~---- 290 (410)
T PF04841_consen 224 VSPNGKFIALFT---------DSGNLWVVSSDFSEKLCEFDTDSKSPPKQMAWCGNDAVVLSWEDELLLVGPDGDS---- 290 (410)
T ss_pred ECCCCCEEEEEE---------CCCCEEEEECcccceeEEeecCcCCCCcEEEEECCCcEEEEeCCEEEEECCCCCc----
Confidence 568889999982 2246778888999999888887 5667888999988887766666655432221
Q ss_pred hhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeecccccccccccccceeEEeccCCCccc-cC-HHHHHh
Q psy1644 346 MFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVERKCSQPIEEKKSRVAANAPKGIL-RT-PQTIVK 423 (923)
Q Consensus 346 ~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~er~~SQ~iEGk~AkvAA~SP~giL-Rt-~eTI~r 423 (923)
+.|-.+.+ +.+.+ -++| +||--+.--.+| |- ..|-+-
T Consensus 291 -------------~~~~~~~~------~~l~~--------------------E~DG--~riit~~~~~~l~~Vp~~~~~i 329 (410)
T PF04841_consen 291 -------------ISFWYDGP------VILVS--------------------EIDG--VRIITSTSHEFLQRVPDSTENI 329 (410)
T ss_pred -------------eEEeccCc------eEEec--------------------cCCc--eEEEeCCceEEEEECCHHHHHH
Confidence 11111111 01111 1222 111111111111 11 234455
Q ss_pred hhcCCCCCCCCchHHHHHHHHHhcCCCCcccccccchhhHHH-HHHHHHHhc
Q psy1644 424 FQQVPTPAGQTSPLLQYFGILLDHGKLNKHESLELCRPVLAQ-VIQSFAETG 474 (923)
Q Consensus 424 FK~vp~~pGq~sPLL~YF~~LL~~g~LN~~ESlEL~rpVL~Q-Vi~clae~G 474 (923)
|+=-...| ..+|..-...++++.--++|.|..+|.-|.+ |-.|....|
T Consensus 330 f~igs~~p---~a~L~~a~~~~~~~~~~a~e~lr~i~~~L~~Av~~Ci~AA~ 378 (410)
T PF04841_consen 330 FRIGSTSP---GAILLDAYEEFEKRSPKADEYLRSIKDQLEEAVDDCIDAAG 378 (410)
T ss_pred hcccCCCc---cHHHHHHHHHHHcCChhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 53223333 4667777777788888899999999998887 556665544
No 19
>KOG1645|consensus
Probab=88.25 E-value=10 Score=44.29 Aligned_cols=211 Identities=18% Similarity=0.233 Sum_probs=122.1
Q ss_pred cccceEEEEeeeeeEEEEecccceeeeecccCCCcEEEeecc-CCCcceEEEecCCcceeeeeeeceeecccceeEEEec
Q psy1644 72 AKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPH-EATGGIIGLTNVGINVGSVSFNTLTMESDKFICVREK 150 (923)
Q Consensus 72 ~k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~if~ta~~-~~t~Gil~vn~~G~~~~si~~~t~tm~sdk~ic~~~~ 150 (923)
..+|++-+..---+|.|+|+||..++-+--.- +.+.-+|-+ .+.+=|++=...
T Consensus 204 ~~~GLl~~asl~nkiki~dlet~~~vssy~a~-~~~wSC~wDlde~h~IYaGl~n------------------------- 257 (463)
T KOG1645|consen 204 FNEGLLGLASLGNKIKIMDLETSCVVSSYIAY-NQIWSCCWDLDERHVIYAGLQN------------------------- 257 (463)
T ss_pred cccceeeeeccCceEEEEecccceeeeheecc-CCceeeeeccCCcceeEEeccC-------------------------
Confidence 46788888888889999999999887664333 555555322 223333333333
Q ss_pred ccccceEEEEecCCCCCCCccCccccccccCcccchhhhcCCCCCCCCCCcccceeeeccccccccCCcccccceeeecc
Q psy1644 151 IAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLTNVGINVGSVSFNTLTMESD 230 (923)
Q Consensus 151 ~~~~~~vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~lGI~~~si~F~~lTmESD 230 (923)
.-|.|+||..|.+|+--= ++.+-.||--+|=-+-+. .--+-+-+. +.+|+.+ .|=...|.+|
T Consensus 258 ----G~VlvyD~R~~~~~~~e~--~a~~t~~pv~~i~~~~~n---~~f~~gglL--v~~lt~l-------~f~ei~~s~~ 319 (463)
T KOG1645|consen 258 ----GMVLVYDMRQPEGPLMEL--VANVTINPVHKIAPVQPN---KIFTSGGLL--VFALTVL-------QFYEIVFSAE 319 (463)
T ss_pred ----ceEEEEEccCCCchHhhh--hhhhccCcceeecccCcc---ccccccceE--Eeeehhh-------hhhhhhcccc
Confidence 349999999998886521 222333655554444332 222222222 2233322 4555666677
Q ss_pred eEEEEEeccCCC-----------ceEEEEECCCCCCCccc---------------------------ccCccccccCCCc
Q psy1644 231 KFICVREKIAES-----------AQVVIIDMNDPTNPIRR---------------------------PISADSAIMNPAS 272 (923)
Q Consensus 231 kyICVRE~~~~~-----------~qVvIVDL~n~~~v~Rr---------------------------pi~ADSAIMnP~~ 272 (923)
.+.||-|..+.+ +.+++===.|||.|.-| +...+-||..|..
T Consensus 320 ~~p~vlele~pG~cismqy~~~snh~l~tyRs~pn~p~~r~il~~~d~~dG~pVc~~r~~~~Gs~~~kl~t~~ai~~~~~ 399 (463)
T KOG1645|consen 320 CLPCVLELEPPGICISMQYHGVSNHLLLTYRSNPNFPQSRFILGRIDFRDGFPVCGKRRTYFGSKQTKLSTTQAIRAVED 399 (463)
T ss_pred CCCcccccCCCcceeeeeecCccceEEEEecCCCCCccceeeeeeeccccCceeeeecccccCCcccccccccceecccc
Confidence 777766655322 23333222233332211 1233455655543
Q ss_pred -cEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEeccC---CeEEEEecCeEEE
Q psy1644 273 -KVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWISP---NTLALVTETSVYH 334 (923)
Q Consensus 273 -~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~---~tLglVT~taVYH 334 (923)
++|..-|. +.+.|-++|.-+-+.+....++|+|.=-+-.+. ..||+.|+.-|-.
T Consensus 400 nn~iv~~gd--------~tn~lil~D~~s~evvQ~l~~~epv~Dicp~~~n~~syLa~LTd~~v~I 457 (463)
T KOG1645|consen 400 NNYIVVVGD--------STNELILQDPHSFEVVQTLALSEPVLDICPNDTNGSSYLALLTDDRVHI 457 (463)
T ss_pred ccEEEEecC--------CcceeEEeccchhheeeecccCcceeecceeecCCcchhhheecceEEE
Confidence 33333321 346889999999999999999998876666554 4799999996543
No 20
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=87.82 E-value=50 Score=36.74 Aligned_cols=224 Identities=17% Similarity=0.364 Sum_probs=112.1
Q ss_pred CcceeEEeccccceEEEEee----eeeEEEEeccc--ceeeeecccC--------------CCcEEEeeccCCCcceEEE
Q psy1644 63 DFPVAMQVSAKYDVVYLITK----FGYIHVYDIET--ATCIYMNRIS--------------SDTIFVTAPHEATGGIIGL 122 (923)
Q Consensus 63 DFPvamqiS~k~G~iyliTK----~G~lhl~Dlet--~t~iy~~rIS--------------~~~if~ta~~~~t~Gil~v 122 (923)
+=|-.|.++...+++|..+. .|.|..|++.. |+.-..+++. +..+|++.....+-.++.+
T Consensus 37 ~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i~~~~~g~~l~vany~~g~v~v~~l 116 (345)
T PF10282_consen 37 ENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNSVPSGGSSPCHIAVDPDGRFLYVANYGGGSVSVFPL 116 (345)
T ss_dssp SSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEEEEESSSCEEEEEECTTSSEEEEEETTTTEEEEEEE
T ss_pred CCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeeeeccCCCCcEEEEEecCCCEEEEEEccCCeEEEEEc
Confidence 33445667888999999988 47999999886 5554444443 2344554444444444555
Q ss_pred ecCCcceeeee--ee-----------------ceeeccc-ceeEEEecccccceEEEEecCCCCCCCcc----Cccccc-
Q psy1644 123 TNVGINVGSVS--FN-----------------TLTMESD-KFICVREKIAESAQVVIIDMNDPTNPIRR----PISADS- 177 (923)
Q Consensus 123 n~~G~~~~si~--~~-----------------t~tm~sd-k~ic~~~~~~~~~~vviid~~~~~~~~r~----~~~a~~- 177 (923)
+..| .+.... +. .+.+.-| +|+-|-+...| .|.+.|++.....+.+ .+.+.+
T Consensus 117 ~~~g-~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dlG~D--~v~~~~~~~~~~~l~~~~~~~~~~G~G 193 (345)
T PF10282_consen 117 DDDG-SLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDLGAD--RVYVYDIDDDTGKLTPVDSIKVPPGSG 193 (345)
T ss_dssp CTTS-EEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEETTTT--EEEEEEE-TTS-TEEEEEEEECSTTSS
T ss_pred cCCc-ccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEecCCC--EEEEEEEeCCCceEEEeeccccccCCC
Confidence 5555 333331 11 1222223 67766665444 5777777665422322 222332
Q ss_pred ---cccCcccchhhhcCCCCCC------CCCCcccceeeeccccc--cccCCcccccceeeec-ceEEEEEeccCCCceE
Q psy1644 178 ---AIMNPASKVIALKGKAGND------NNPNGEFKNRIYLLTNV--GINVGSVSFNTLTMES-DKFICVREKIAESAQV 245 (923)
Q Consensus 178 ---~im~p~~~i~al~~~~~~~------~~~~~~~~~e~~~Lt~l--GI~~~si~F~~lTmES-DkyICVRE~~~~~~qV 245 (923)
+.++|..+.+-+=+...+. +.+.++++ ++-....+ |..... .=..+.+.. .||+=|=. .+.++|
T Consensus 194 PRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~-~~~~~~~~~~~~~~~~-~~~~i~ispdg~~lyvsn--r~~~sI 269 (345)
T PF10282_consen 194 PRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLT-EIQTISTLPEGFTGEN-APAEIAISPDGRFLYVSN--RGSNSI 269 (345)
T ss_dssp EEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEE-EEEEEESCETTSCSSS-SEEEEEE-TTSSEEEEEE--CTTTEE
T ss_pred CcEEEEcCCcCEEEEecCCCCcEEEEeecccCCcee-EEEEeeeccccccccC-CceeEEEecCCCEEEEEe--ccCCEE
Confidence 6678877766554433321 11234443 22222222 111110 122334443 34433321 136799
Q ss_pred EEEECCCCC-C---CcccccCcc---ccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccc
Q psy1644 246 VIIDMNDPT-N---PIRRPISAD---SAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMK 301 (923)
Q Consensus 246 vIVDL~n~~-~---v~Rrpi~AD---SAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K 301 (923)
++++++... . +.+-|...+ .-.++|..+.+++=.. .+..+.+|+++.+
T Consensus 270 ~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~--------~s~~v~vf~~d~~ 324 (345)
T PF10282_consen 270 SVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQ--------DSNTVSVFDIDPD 324 (345)
T ss_dssp EEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEET--------TTTEEEEEEEETT
T ss_pred EEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEec--------CCCeEEEEEEeCC
Confidence 999995432 1 222333233 5567888887777421 3568999998754
No 21
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=87.44 E-value=1.3 Score=48.78 Aligned_cols=57 Identities=16% Similarity=0.223 Sum_probs=47.8
Q ss_pred ccEEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEE
Q psy1644 734 YDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSV 791 (923)
Q Consensus 734 yGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V 791 (923)
-|.+|+.+..|.|+++|..||+.+.+..+..++++ +.|--.+++++..+++|.|..+
T Consensus 320 g~~l~~~~~~G~l~~~d~~tG~~~~~~~~~~~~~~-~sp~~~~~~l~v~~~dG~l~~~ 376 (377)
T TIGR03300 320 GGYLVVGDFEGYLHWLSREDGSFVARLKTDGSGIA-SPPVVVGDGLLVQTRDGDLYAF 376 (377)
T ss_pred CCEEEEEeCCCEEEEEECCCCCEEEEEEcCCCccc-cCCEEECCEEEEEeCCceEEEe
Confidence 36999999999999999999999999888776655 4454556789999999999865
No 22
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=87.41 E-value=1.4 Score=49.20 Aligned_cols=58 Identities=17% Similarity=0.305 Sum_probs=48.3
Q ss_pred ccEEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEE
Q psy1644 734 YDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVS 792 (923)
Q Consensus 734 yGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~ 792 (923)
-|.||+.+..|+||.+|..||+.+.+.++....++ +.|--..+.++..+++|.|+.+.
T Consensus 335 ~g~l~v~~~~G~l~~ld~~tG~~~~~~~~~~~~~~-s~P~~~~~~l~v~t~~G~l~~~~ 392 (394)
T PRK11138 335 NGYLVVGDSEGYLHWINREDGRFVAQQKVDSSGFL-SEPVVADDKLLIQARDGTVYAIT 392 (394)
T ss_pred CCEEEEEeCCCEEEEEECCCCCEEEEEEcCCCcce-eCCEEECCEEEEEeCCceEEEEe
Confidence 48899999999999999999999999999766554 44444467888889999998875
No 23
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=86.25 E-value=24 Score=40.49 Aligned_cols=81 Identities=15% Similarity=0.080 Sum_probs=58.2
Q ss_pred ccEEEEEecCCCCCCCCCcccceeeeecCCCccCCcceeEEeccccceEEEEee----------eeeEEEEecccceeee
Q psy1644 29 GGKLHIIEVGTPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYLITK----------FGYIHVYDIETATCIY 98 (923)
Q Consensus 29 ~gkLhiiEig~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK----------~G~lhl~Dlet~t~iy 98 (923)
.|++.+|+..+... ...+++ +-.|-.+ +|+..+.+|+-+. .++|-+||..|+..+-
T Consensus 26 ~~~v~ViD~~~~~v------~g~i~~-------G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~ 91 (352)
T TIGR02658 26 TTQVYTIDGEAGRV------LGMTDG-------GFLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIA 91 (352)
T ss_pred CceEEEEECCCCEE------EEEEEc-------cCCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEe
Confidence 48999999875211 333444 4577776 9999999999988 6999999999999987
Q ss_pred ecccCCCcEEEeeccCCCcceEEEecCC
Q psy1644 99 MNRISSDTIFVTAPHEATGGIIGLTNVG 126 (923)
Q Consensus 99 ~~rIS~~~if~ta~~~~t~Gil~vn~~G 126 (923)
.=.+-+++=|.+.++ ...+.+...|
T Consensus 92 ~i~~p~~p~~~~~~~---~~~~~ls~dg 116 (352)
T TIGR02658 92 DIELPEGPRFLVGTY---PWMTSLTPDN 116 (352)
T ss_pred EEccCCCchhhccCc---cceEEECCCC
Confidence 766666655554433 3455666666
No 24
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=86.15 E-value=41 Score=34.01 Aligned_cols=70 Identities=11% Similarity=0.148 Sum_probs=46.9
Q ss_pred ccceeeecceEEEEEeccCCCceEEEEECCCCCCCcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccc
Q psy1644 222 FNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMK 301 (923)
Q Consensus 222 F~~lTmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K 301 (923)
..++.+..+ .|.+....+. ++.+|+.+....-++|++.-.+.+-+....+.+-. ....|..||+.+.
T Consensus 166 ~~~~~~~~~-~v~~~~~~g~---~~~~d~~tg~~~w~~~~~~~~~~~~~~~~~l~~~~---------~~~~l~~~d~~tG 232 (238)
T PF13360_consen 166 NGSPVISDG-RVYVSSGDGR---VVAVDLATGEKLWSKPISGIYSLPSVDGGTLYVTS---------SDGRLYALDLKTG 232 (238)
T ss_dssp EEEEECCTT-EEEEECCTSS---EEEEETTTTEEEEEECSS-ECECEECCCTEEEEEE---------TTTEEEEEETTTT
T ss_pred ccceEEECC-EEEEEcCCCe---EEEEECCCCCEEEEecCCCccCCceeeCCEEEEEe---------CCCEEEEEECCCC
Confidence 345566655 7777655442 67779999885567776665555777777777762 2478999999887
Q ss_pred ccc
Q psy1644 302 SKM 304 (923)
Q Consensus 302 ~KL 304 (923)
+.+
T Consensus 233 ~~~ 235 (238)
T PF13360_consen 233 KVV 235 (238)
T ss_dssp EEE
T ss_pred CEE
Confidence 543
No 25
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=85.64 E-value=58 Score=35.31 Aligned_cols=34 Identities=15% Similarity=0.058 Sum_probs=23.9
Q ss_pred CcceeEEeccccceEEEEe-eeeeEEEEecc-ccee
Q psy1644 63 DFPVAMQVSAKYDVVYLIT-KFGYIHVYDIE-TATC 96 (923)
Q Consensus 63 DFPvamqiS~k~G~iyliT-K~G~lhl~Dle-t~t~ 96 (923)
+-|..|.+|+.-..+|+.+ ..+.|.+||+. +|..
T Consensus 35 ~~~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l 70 (330)
T PRK11028 35 GQVQPMVISPDKRHLYVGVRPEFRVLSYRIADDGAL 70 (330)
T ss_pred CCCccEEECCCCCEEEEEECCCCcEEEEEECCCCce
Confidence 3466777777767888865 36888889886 4543
No 26
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=83.58 E-value=3 Score=48.76 Aligned_cols=133 Identities=19% Similarity=0.342 Sum_probs=69.7
Q ss_pred cccccCcccchhhhcCCCCCCCCCCcccceeeeccccccccCCccc-ccceeeecceEEEEEeccCCCceEEEE-ECCCC
Q psy1644 176 DSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLTNVGINVGSVS-FNTLTMESDKFICVREKIAESAQVVII-DMNDP 253 (923)
Q Consensus 176 ~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~lGI~~~si~-F~~lTmESDkyICVRE~~~~~~qVvIV-DL~n~ 253 (923)
-+.--||+.+-+++.| +|++. -.++++......+ =.+..+-++.-..|||+ +.+|.|. ++.+.
T Consensus 36 ~~ls~npngr~v~V~g--------~geY~----iyt~~~~r~k~~G~g~~~vw~~~n~yAv~~~---~~~I~I~kn~~~~ 100 (443)
T PF04053_consen 36 QSLSHNPNGRFVLVCG--------DGEYE----IYTALAWRNKAFGSGLSFVWSSRNRYAVLES---SSTIKIYKNFKNE 100 (443)
T ss_dssp SEEEE-TTSSEEEEEE--------TTEEE----EEETTTTEEEEEEE-SEEEE-TSSEEEEE-T---TS-EEEEETTEE-
T ss_pred eeEEECCCCCEEEEEc--------CCEEE----EEEccCCcccccCceeEEEEecCccEEEEEC---CCeEEEEEcCccc
Confidence 3455699999999855 23332 3344444332222 01122333333667766 3467775 44322
Q ss_pred C-CCcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCC-ceeEEEeccC-CeEEEEecC
Q psy1644 254 T-NPIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTD-DVVFWKWISP-NTLALVTET 330 (923)
Q Consensus 254 ~-~v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E-~VvfWkWis~-~tLglVT~t 330 (923)
. ...+-|.++|.- .+ .+++++++ ...|++||.++++-+.+...+. .-|+|. ++ +.+||+|.+
T Consensus 101 ~~k~i~~~~~~~~I--f~-G~LL~~~~----------~~~i~~yDw~~~~~i~~i~v~~vk~V~Ws--~~g~~val~t~~ 165 (443)
T PF04053_consen 101 VVKSIKLPFSVEKI--FG-GNLLGVKS----------SDFICFYDWETGKLIRRIDVSAVKYVIWS--DDGELVALVTKD 165 (443)
T ss_dssp TT-----SS-EEEE--E--SSSEEEEE----------TTEEEEE-TTT--EEEEESS-E-EEEEE---TTSSEEEEE-S-
T ss_pred cceEEcCCcccceE--Ec-CcEEEEEC----------CCCEEEEEhhHcceeeEEecCCCcEEEEE--CCCCEEEEEeCC
Confidence 2 123333344432 22 78999993 3479999999999999999875 445554 44 799999999
Q ss_pred eEEEEecc
Q psy1644 331 SVYHWSME 338 (923)
Q Consensus 331 aVYHWsi~ 338 (923)
++|.++-+
T Consensus 166 ~i~il~~~ 173 (443)
T PF04053_consen 166 SIYILKYN 173 (443)
T ss_dssp SEEEEEE-
T ss_pred eEEEEEec
Confidence 99999763
No 27
>KOG0650|consensus
Probab=83.30 E-value=39 Score=41.43 Aligned_cols=256 Identities=14% Similarity=0.223 Sum_probs=137.9
Q ss_pred ecCCCCCCCCCcccceeeeecCCCccCCcceeEEeccccceEEEEeeeeeEEEEecccceeeeecccCCCcEEEeeccCC
Q psy1644 36 EVGTPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEA 115 (923)
Q Consensus 36 Eig~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~if~ta~~~~ 115 (923)
|+.+| .-=.|||....-++= .-.+--.++.+..+=-.|-=-.+.|.|.||.+.||.||+.-.+-+.+-=+.=.-.+
T Consensus 378 kLPsp-~dLrPFPt~~~lvyr---GHtg~Vr~iSvdp~G~wlasGsdDGtvriWEi~TgRcvr~~~~d~~I~~vaw~P~~ 453 (733)
T KOG0650|consen 378 KLPSP-KDLRPFPTRCALVYR---GHTGLVRSISVDPSGEWLASGSDDGTVRIWEIATGRCVRTVQFDSEIRSVAWNPLS 453 (733)
T ss_pred cCCCh-hhcCCCcceeeeeEe---ccCCeEEEEEecCCcceeeecCCCCcEEEEEeecceEEEEEeecceeEEEEecCCC
Confidence 56664 446788876655542 22555667777775444555568999999999999999987776654444422224
Q ss_pred CcceEEEecCCcce--eeeeeeceeecccceeEEEecccccceEEEEecCCCCCCCccCccccccccCcccchhhhcCCC
Q psy1644 116 TGGIIGLTNVGINV--GSVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKA 193 (923)
Q Consensus 116 t~Gil~vn~~G~~~--~si~~~t~tm~sdk~ic~~~~~~~~~~vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~ 193 (923)
..++|+|-.-+ .+ ..-.|++- ..++. ++=.. -+.| |+.++..|.=-|.
T Consensus 454 ~~~vLAvA~~~-~~~ivnp~~G~~-----------~e~~~-t~ell---------~~~~--------~~~~p~~~~~~W~ 503 (733)
T KOG0650|consen 454 DLCVLAVAVGE-CVLIVNPIFGDR-----------LEVGP-TKELL---------ASAP--------NESEPDAAVVTWS 503 (733)
T ss_pred CceeEEEEecC-ceEEeCccccch-----------hhhcc-hhhhh---------hcCC--------CccCCcccceeec
Confidence 45566655444 21 11112211 01100 00000 0111 2222221111111
Q ss_pred CCCCCCCcccceeeecccccc--ccCCcc-cccceeeecc-eEEEEEeccCCCceEEEEECCCCC--CCcccc-cCcccc
Q psy1644 194 GNDNNPNGEFKNRIYLLTNVG--INVGSV-SFNTLTMESD-KFICVREKIAESAQVVIIDMNDPT--NPIRRP-ISADSA 266 (923)
Q Consensus 194 ~~~~~~~~~~~~e~~~Lt~lG--I~~~si-~F~~lTmESD-kyICVRE~~~~~~qVvIVDL~n~~--~v~Rrp-i~ADSA 266 (923)
+-+ ++=.+.| +--... ..+.+|+.++ .|+.+-.-.+++.+|+|=+|.... .|-||- =.--.|
T Consensus 504 ~~~-----------~~e~~~~v~~~I~~~k~i~~vtWHrkGDYlatV~~~~~~~~VliHQLSK~~sQ~PF~kskG~vq~v 572 (733)
T KOG0650|consen 504 RAS-----------LDELEKGVCIVIKHPKSIRQVTWHRKGDYLATVMPDSGNKSVLIHQLSKRKSQSPFRKSKGLVQRV 572 (733)
T ss_pred hhh-----------hhhhccceEEEEecCCccceeeeecCCceEEEeccCCCcceEEEEecccccccCchhhcCCceeEE
Confidence 100 0000112 111111 2456777654 566666666677899999999854 344332 224578
Q ss_pred ccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEeccC-------CeEEEEecCeEEEE-ecc
Q psy1644 267 IMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWISP-------NTLALVTETSVYHW-SME 338 (923)
Q Consensus 267 IMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~-------~tLglVT~taVYHW-si~ 338 (923)
..||++.-+-+- +.+.+.||||-...-.|. =..+=||||- +.|-+-+-..=.-| +++
T Consensus 573 ~FHPs~p~lfVa----------Tq~~vRiYdL~kqelvKk-----L~tg~kwiS~msihp~GDnli~gs~d~k~~WfDld 637 (733)
T KOG0650|consen 573 KFHPSKPYLFVA----------TQRSVRIYDLSKQELVKK-----LLTGSKWISSMSIHPNGDNLILGSYDKKMCWFDLD 637 (733)
T ss_pred EecCCCceEEEE----------eccceEEEehhHHHHHHH-----HhcCCeeeeeeeecCCCCeEEEecCCCeeEEEEcc
Confidence 899998766664 568999999965443333 2456789873 33333333344445 677
Q ss_pred CCCCchhhhhccc
Q psy1644 339 GDSTPEKMFDRHS 351 (923)
Q Consensus 339 g~s~P~K~FdR~~ 351 (923)
=.+.|.|..--|.
T Consensus 638 lsskPyk~lr~H~ 650 (733)
T KOG0650|consen 638 LSSKPYKTLRLHE 650 (733)
T ss_pred cCcchhHHhhhhh
Confidence 7788888775554
No 28
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=83.21 E-value=3.6 Score=43.62 Aligned_cols=43 Identities=16% Similarity=0.354 Sum_probs=34.9
Q ss_pred cCCCccCCcceeEEeccccceEEEEeeeeeEEEEecccceeeeec
Q psy1644 56 FPPEAANDFPVAMQVSAKYDVVYLITKFGYIHVYDIETATCIYMN 100 (923)
Q Consensus 56 Fppe~~~DFPvamqiS~k~G~iyliTK~G~lhl~Dlet~t~iy~~ 100 (923)
+||=..+-=++.|+.+.+| +.++|..|.+|+||+.++.+++.+
T Consensus 6 ~P~i~Lgs~~~~l~~~~~~--Ll~iT~~G~l~vWnl~~~k~~~~~ 48 (219)
T PF07569_consen 6 LPPIVLGSPVSFLECNGSY--LLAITSSGLLYVWNLKKGKAVLPP 48 (219)
T ss_pred cCcEecCCceEEEEeCCCE--EEEEeCCCeEEEEECCCCeeccCC
Confidence 3444445555778877777 999999999999999999999988
No 29
>KOG0292|consensus
Probab=82.97 E-value=1.6e+02 Score=38.17 Aligned_cols=385 Identities=19% Similarity=0.302 Sum_probs=199.0
Q ss_pred ccceEEEEeeeeeEEEEecccceeeeecccCCCc-----------EEEeeccCCCcceE--EEecCCcceeeeeeeceee
Q psy1644 73 KYDVVYLITKFGYIHVYDIETATCIYMNRISSDT-----------IFVTAPHEATGGII--GLTNVGINVGSVSFNTLTM 139 (923)
Q Consensus 73 k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~-----------if~ta~~~~t~Gil--~vn~~G~~~~si~~~t~tm 139 (923)
++.+|---...+.+.|||++.-|++.-=|=.+|- .|+++ ++ +|++ -+.|.- .+.-+.=+.+-.
T Consensus 261 ~q~lIlSnsEDksirVwDm~kRt~v~tfrrendRFW~laahP~lNLfAAg-HD--sGm~VFkleREr-pa~~v~~n~LfY 336 (1202)
T KOG0292|consen 261 HQDLILSNSEDKSIRVWDMTKRTSVQTFRRENDRFWILAAHPELNLFAAG-HD--SGMIVFKLERER-PAYAVNGNGLFY 336 (1202)
T ss_pred ccceeEecCCCccEEEEecccccceeeeeccCCeEEEEEecCCcceeeee-cC--CceEEEEEcccC-ceEEEcCCEEEE
Confidence 6778888888999999999999999877666543 33333 23 3443 233332 233333333333
Q ss_pred cccceeEEEecccccceEEEEecCCCCCCCccCccccccccCcccchhhhcCCCCCCCCCCcccceeeecccc--ccccC
Q psy1644 140 ESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLTN--VGINV 217 (923)
Q Consensus 140 ~sdk~ic~~~~~~~~~~vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~--lGI~~ 217 (923)
--|+||---+-.. +..+.++-+..|..+-.. .-|.--||+....-+=. |..|++|. ++++.+ -|.++
T Consensus 337 vkd~~i~~~d~~t-~~d~~v~~lr~~g~~~~~---~~smsYNpae~~vlics-----~~~n~~y~--L~~ipk~~~~~~~ 405 (1202)
T KOG0292|consen 337 VKDRFIRSYDLRT-QKDTAVASLRRPGTLWQP---PRSLSYNPAENAVLICS-----NLDNGEYE--LVQIPKDSDGVSD 405 (1202)
T ss_pred EccceEEeeeccc-cccceeEeccCCCcccCC---cceeeeccccCeEEEEe-----ccCCCeEE--EEEecCcccccCC
Confidence 3466666555444 334455455555322222 24556699887665433 77788864 777655 35544
Q ss_pred C-----cccccceeeecceEEEEEeccCCCceEEEEECCCCCCCccc---ccCccccccCCCccEEEEeccCCCCCCCCC
Q psy1644 218 G-----SVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRR---PISADSAIMNPASKVIALKGKAGNDNNPNA 289 (923)
Q Consensus 218 ~-----si~F~~lTmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rr---pi~ADSAIMnP~~~IIALra~~~~~n~~~~ 289 (923)
. +=+-+-+-.-+.+|+.. +++ +.+|+|-||.|. ++|+ |.+.|.-..-=+.+ +-|| .
T Consensus 406 ~~~~~k~tG~~a~fvarNrfavl-~k~--~~~v~ik~l~N~--vtkkl~~~~~~~~IF~ag~g~-lll~----------~ 469 (1202)
T KOG0292|consen 406 GKDVKKGTGEGALFVARNRFAVL-DKS--NEQVVIKNLKNK--VTKKLLLPESTDDIFYAGTGN-LLLR----------S 469 (1202)
T ss_pred chhhhcCCCCceEEEEecceEEE-Eec--CcceEEecccch--hhhcccCcccccceeeccCcc-EEEE----------c
Confidence 1 11122233334444433 332 347888888764 4554 22223222222222 3355 6
Q ss_pred CCeEEEEecccccccccccCCCceeEEEecc---------CCeEEEEecCeEEEEeccCCCCchhhhhcccCCCCCeeEe
Q psy1644 290 PKTLQIFNIEMKSKMKAHPMTDDVVFWKWIS---------PNTLALVTETSVYHWSMEGDSTPEKMFDRHSTLNGCQIIN 360 (923)
Q Consensus 290 g~~LQIFNle~K~KLKs~~m~E~VvfWkWis---------~~tLglVT~taVYHWsi~g~s~P~K~FdR~~~L~~~QIIn 360 (923)
++.+++|+++.|.|+.|... ..|-|--|=+ -++|-|++.+-.+..+|-.. +=-..+--...-|.=
T Consensus 470 ~~~v~lfdvQq~~~~~si~~-s~vkyvvws~dm~~vAll~Kh~i~i~~kkL~l~~sihEt-----iriksgawde~gVfi 543 (1202)
T KOG0292|consen 470 PDSVTLFDVQQKKKVGSIKV-SKVKYVVWSNDMSRVALLSKHTITIADKKLELLCSIHET-----IRIKSGAWDEDGVFI 543 (1202)
T ss_pred CCeEEEEEeecceEEEEEec-CceeEEEEcCccchhhhcccceEEEEecchhheecchhe-----eEeeeceeccCceEE
Confidence 78999999999999999877 3566666654 34444444444333333100 000000001122333
Q ss_pred eeeCCCCCeEEEEe---eecc-c-----ccccceeeecccccc---ccccccc--ceeEEeccCCCccccC--HHHHHhh
Q psy1644 361 YRTDPKQTWLLLIG---ISAV-S-----NRVVGAMQLYSVERK---CSQPIEE--KKSRVAANAPKGILRT--PQTIVKF 424 (923)
Q Consensus 361 Yr~d~~~kW~~LvG---I~~~-~-----~~i~G~mQLYS~er~---~SQ~iEG--k~AkvAA~SP~giLRt--~eTI~rF 424 (923)
|-+..--|.++.-| |-.- + -++.|.+ +|...|. ..-.|+- -+-|.| .+|- +|.+.--
T Consensus 544 YtT~nHikYal~~GD~GIikTLd~~iyitkv~gn~-V~cl~rd~~~~~~~IDptEy~FKlA------Li~k~ydeVl~lI 616 (1202)
T KOG0292|consen 544 YTTLNHIKYALENGDSGIIKTLDKPIYITKVKGNK-VFCLNRDGEIECLTIDPTEYRFKLA------LLNKKYDEVLHLI 616 (1202)
T ss_pred EEehhhhhhhhccCCcceEEecccceEEEEeeCCE-EEEEecCCCeEEEeechHHHHHHHH------HHhhhhHHHHHHH
Confidence 44444444444422 2211 1 1344443 3333321 1111211 001111 1222 3444333
Q ss_pred hcCCCCCCCCchHHHHHHHHHhcCC------CCcccccccchhhHHHHHHHHHHhcChhHHHHHHHHhCCCccHHHHHHH
Q psy1644 425 QQVPTPAGQTSPLLQYFGILLDHGK------LNKHESLELCRPVLAQVIQSFAETGQFQKIVLYAKKVGYSPDYVFLLRN 498 (923)
Q Consensus 425 K~vp~~pGq~sPLL~YF~~LL~~g~------LN~~ESlEL~rpVL~QVi~clae~Gqfdkiv~Y~~~vgytpDy~~LL~~ 498 (923)
|+ ...+|| .++.|. -.+|. .=++|....- --.|-|+.+-++++|++.+-.-=|..|-+.
T Consensus 617 ~n-s~LvGq--aiIaYL---qKkgypeiAL~FVkD~~tRF~---------LaLe~gnle~ale~akkldd~d~w~rLge~ 681 (1202)
T KOG0292|consen 617 KN-SNLVGQ--AIIAYL---QKKGYPEIALHFVKDERTRFE---------LALECGNLEVALEAAKKLDDKDVWERLGEE 681 (1202)
T ss_pred Hh-cCcccH--HHHHHH---HhcCCcceeeeeecCcchhee---------eehhcCCHHHHHHHHHhcCcHHHHHHHHHH
Confidence 33 235565 455564 35662 1222322110 113679999999999998865569999999
Q ss_pred HHhcCCchhhHHHHhhhc
Q psy1644 499 VMRVNPDQGVGFAQMLVQ 516 (923)
Q Consensus 499 i~~~nP~~a~efA~~L~~ 516 (923)
.++.+-.+-+|++-+=++
T Consensus 682 Al~qgn~~IaEm~yQ~~k 699 (1202)
T KOG0292|consen 682 ALRQGNHQIAEMCYQRTK 699 (1202)
T ss_pred HHHhcchHHHHHHHHHhh
Confidence 999988888887655444
No 30
>KOG2315|consensus
Probab=82.92 E-value=1.2e+02 Score=36.93 Aligned_cols=306 Identities=20% Similarity=0.290 Sum_probs=168.4
Q ss_pred ceeeeecCCCCCCceEEEEEEecC-----CccEEEEEecCCCCCCCCCcccceeeeecCCCcc---CCcceeEEeccccc
Q psy1644 4 SFASFKSEGNPEPSTLFCFAVRTP-----AGGKLHIIEVGTPPAGNTPLAKKAVDVFFPPEAA---NDFPVAMQVSAKYD 75 (923)
Q Consensus 4 ~F~sfkl~gN~~~stL~cfa~r~~-----~~gkLhiiEig~~~~g~~~f~kK~vd~~Fppe~~---~DFPvamqiS~k~G 75 (923)
.+-+|...+..+||+-+||+..+. +++...|.+..+...--+-++.|+.|+-|+|... .+=|-+++...--+
T Consensus 24 ~~~~f~~~~~~~~~~v~~~S~~G~lfA~~~~~~v~i~~~~~~~~~lt~~~~~~~~L~fSP~g~yL~T~e~~~i~~~~~~~ 103 (566)
T KOG2315|consen 24 AVTVFEQNKTSRPCNVFAYSNNGRLFAYSDNQVVKVFEIATLKVVLCVELKKTYDLLFSPKGNYLLTWEPWAIYGPKNAS 103 (566)
T ss_pred cccccccCCCCCcceeEEEcCCCcEEEEEcCCeEEEEEccCCcEEEEeccceeeeeeecccccccccccccccccCCCCC
Confidence 445666666788899999887665 4457778888664333557778999999999654 44455555443322
Q ss_pred eEEEEeeeeeEEEEecccceeee--eccc-------------------CCCcEEEeeccCCCcceEEEecCCcceeeeee
Q psy1644 76 VVYLITKFGYIHVYDIETATCIY--MNRI-------------------SSDTIFVTAPHEATGGIIGLTNVGINVGSVSF 134 (923)
Q Consensus 76 ~iyliTK~G~lhl~Dlet~t~iy--~~rI-------------------S~~~if~ta~~~~t~Gil~vn~~G~~~~si~~ 134 (923)
.=.+.+|++||+...+ +.|+ +++.-|...-+ ..+=.=-+-..||.-+++|-
T Consensus 104 -------~pn~~v~~vet~~~~s~~q~k~Q~~W~~qfs~dEsl~arlv~nev~f~~~~~-f~~~~~kl~~~~i~~f~lSp 175 (566)
T KOG2315|consen 104 -------NPNVLVYNVETGVQRSQIQKKMQNGWVPQFSIDESLAARLVSNEVQFYDLGS-FKTIQHKLSVSGITMLSLSP 175 (566)
T ss_pred -------CCceeeeeeccceehhheehhhhcCcccccccchhhhhhhhcceEEEEecCC-ccceeeeeeccceeeEEecC
Confidence 3457799999954432 2222 22333333222 11111123344544444444
Q ss_pred eceeecccceeEEE--ecccccceEEEEecC--CCCCCC--ccCccccccccCcccchhhhcCCCCCCCCCCcccceeee
Q psy1644 135 NTLTMESDKFICVR--EKIAESAQVVIIDMN--DPTNPI--RRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIY 208 (923)
Q Consensus 135 ~t~tm~sdk~ic~~--~~~~~~~~vviid~~--~~~~~~--r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~ 208 (923)
+.. --|+||- |+.|--++|-|--.. .+..++ |-=.+||+|-|-=+ +.|. -+|
T Consensus 176 gp~----~~~vAvyvPe~kGaPa~vri~~~~~~~~~~~~a~ksFFkadkvqm~WN---------------~~gt---~LL 233 (566)
T KOG2315|consen 176 GPE----PPFVAVYVPEKKGAPASVRIYKYPEEGQHQPVANKSFFKADKVQMKWN---------------KLGT---ALL 233 (566)
T ss_pred CCC----CceEEEEccCCCCCCcEEEEeccccccccchhhhccccccceeEEEec---------------cCCc---eEE
Confidence 321 3567764 444444777664443 232222 22345666665322 2222 344
Q ss_pred ccccccccC---Ccccc-----------------------cceeeecce--EEEEEeccCCCceEEEEECCCCCCCcccc
Q psy1644 209 LLTNVGINV---GSVSF-----------------------NTLTMESDK--FICVREKIAESAQVVIIDMNDPTNPIRRP 260 (923)
Q Consensus 209 ~Lt~lGI~~---~si~F-----------------------~~lTmESDk--yICVRE~~~~~~qVvIVDL~n~~~v~Rrp 260 (923)
-|.+-.++. +..+= .++|+.++. |..|=-= --+.|.|.||.-. -+.--|
T Consensus 234 vLastdVDktn~SYYGEq~Lyll~t~g~s~~V~L~k~GPVhdv~W~~s~~EF~VvyGf--MPAkvtifnlr~~-~v~df~ 310 (566)
T KOG2315|consen 234 VLASTDVDKTNASYYGEQTLYLLATQGESVSVPLLKEGPVHDVTWSPSGREFAVVYGF--MPAKVTIFNLRGK-PVFDFP 310 (566)
T ss_pred EEEEEeecCCCccccccceEEEEEecCceEEEecCCCCCceEEEECCCCCEEEEEEec--ccceEEEEcCCCC-EeEeCC
Confidence 444444443 11111 122333222 2222111 1236777776532 122222
Q ss_pred cCc-cccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEeccCCeEEEEecCe--------
Q psy1644 261 ISA-DSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWISPNTLALVTETS-------- 331 (923)
Q Consensus 261 i~A-DSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~~tLglVT~ta-------- 331 (923)
-.. -.|+.||..+||+|-|= +| -...+-|||+..+.+|.++.- ++-.+..|--+-.-=++.-||
T Consensus 311 egpRN~~~fnp~g~ii~lAGF---GN---L~G~mEvwDv~n~K~i~~~~a-~~tt~~eW~PdGe~flTATTaPRlrvdNg 383 (566)
T KOG2315|consen 311 EGPRNTAFFNPHGNIILLAGF---GN---LPGDMEVWDVPNRKLIAKFKA-ANTTVFEWSPDGEYFLTATTAPRLRVDNG 383 (566)
T ss_pred CCCccceEECCCCCEEEEeec---CC---CCCceEEEeccchhhcccccc-CCceEEEEcCCCcEEEEEeccccEEecCC
Confidence 222 25789999999999862 12 356899999999888888755 455666776555544444443
Q ss_pred EEEEeccCCCCchhhhhc
Q psy1644 332 VYHWSMEGDSTPEKMFDR 349 (923)
Q Consensus 332 VYHWsi~g~s~P~K~FdR 349 (923)
+=.|+..|.---++||.+
T Consensus 384 ~KiwhytG~~l~~~~f~s 401 (566)
T KOG2315|consen 384 IKIWHYTGSLLHEKMFKS 401 (566)
T ss_pred eEEEEecCceeehhhhhH
Confidence 556777888888899987
No 31
>PRK03629 tolB translocation protein TolB; Provisional
Probab=82.18 E-value=66 Score=37.11 Aligned_cols=121 Identities=12% Similarity=0.254 Sum_probs=70.9
Q ss_pred CceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccccccccccc-CCCceeEEEe
Q psy1644 242 SAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHP-MTDDVVFWKW 318 (923)
Q Consensus 242 ~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~-m~E~VvfWkW 318 (923)
..++.+.|+..... .+..+-...+...+|..+-||.-.... ....|.++|++++. ++..+ -...+....|
T Consensus 222 ~~~i~i~dl~~G~~~~l~~~~~~~~~~~~SPDG~~La~~~~~~------g~~~I~~~d~~tg~-~~~lt~~~~~~~~~~w 294 (429)
T PRK03629 222 RSALVIQTLANGAVRQVASFPRHNGAPAFSPDGSKLAFALSKT------GSLNLYVMDLASGQ-IRQVTDGRSNNTEPTW 294 (429)
T ss_pred CcEEEEEECCCCCeEEccCCCCCcCCeEECCCCCEEEEEEcCC------CCcEEEEEECCCCC-EEEccCCCCCcCceEE
Confidence 45788888765421 111222223467899988888652110 12368888987753 23222 2345677889
Q ss_pred ccCC-eEEEEecC----eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEee
Q psy1644 319 ISPN-TLALVTET----SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGI 375 (923)
Q Consensus 319 is~~-tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI 375 (923)
-.+. .|+.+++. .+|.+++++. .+.++.. . +........++|++|++.++-
T Consensus 295 SPDG~~I~f~s~~~g~~~Iy~~d~~~g-~~~~lt~-~----~~~~~~~~~SpDG~~Ia~~~~ 350 (429)
T PRK03629 295 FPDSQNLAYTSDQAGRPQVYKVNINGG-APQRITW-E----GSQNQDADVSSDGKFMVMVSS 350 (429)
T ss_pred CCCCCEEEEEeCCCCCceEEEEECCCC-CeEEeec-C----CCCccCEEECCCCCEEEEEEc
Confidence 8654 58888764 6888888653 2333321 1 122334667899999887763
No 32
>PLN00181 protein SPA1-RELATED; Provisional
Probab=80.65 E-value=82 Score=39.10 Aligned_cols=86 Identities=9% Similarity=0.156 Sum_probs=46.5
Q ss_pred CceEEEEECCCCC-----CCcccccC----ccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccC---
Q psy1644 242 SAQVVIIDMNDPT-----NPIRRPIS----ADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPM--- 309 (923)
Q Consensus 242 ~~qVvIVDL~n~~-----~v~Rrpi~----ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m--- 309 (923)
..+|.++|+..+. .+.+.--. ..+.-++|..+.||-=+ ..+.+.||+......+.++.+
T Consensus 681 D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs---------~D~~v~iw~~~~~~~~~s~~~~~~ 751 (793)
T PLN00181 681 DNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGS---------ETNEVFVYHKAFPMPVLSYKFKTI 751 (793)
T ss_pred CCEEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEe---------CCCEEEEEECCCCCceEEEecccC
Confidence 4478888886432 12221100 12334677777776642 346899999776554444322
Q ss_pred -----------CCceeEEEeccCCeEEEEecC---eEEEEec
Q psy1644 310 -----------TDDVVFWKWISPNTLALVTET---SVYHWSM 337 (923)
Q Consensus 310 -----------~E~VvfWkWis~~tLglVT~t---aVYHWsi 337 (923)
.+.|.-+.|=.+... |||.. .|..|+|
T Consensus 752 ~~~~~~~~~~~~~~V~~v~ws~~~~~-lva~~~dG~I~i~~~ 792 (793)
T PLN00181 752 DPVSGLEVDDASQFISSVCWRGQSST-LVAANSTGNIKILEM 792 (793)
T ss_pred CcccccccCCCCcEEEEEEEcCCCCe-EEEecCCCcEEEEec
Confidence 234677777544333 44433 5777765
No 33
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=79.45 E-value=7.4 Score=39.31 Aligned_cols=55 Identities=22% Similarity=0.417 Sum_probs=38.9
Q ss_pred ccccEEEEEeccceEEEEecCCcceeee-eccc---------------CCceEEeeccCCCCcEEEEc-CCCeEE
Q psy1644 732 AKYDVVYLITKFGYIHVYDIETATCIYM-NRIS---------------SDTIFVTAPHEATGGIIGVN-RKGQVL 789 (923)
Q Consensus 732 ~KyGvIyvITK~GflhlyDleTgt~Iy~-nRIS---------------~dtIFvta~~~~t~Gii~VN-rkGqVL 789 (923)
..-|.+|+.|..|.++.+|.+||..+.+ .+-+ .+.++++.. ++++.++| +.|+++
T Consensus 74 ~~~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~g~l~~~d~~tG~~~ 145 (238)
T PF13360_consen 74 VDGGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTS---SGKLVALDPKTGKLL 145 (238)
T ss_dssp EETTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEET---CSEEEEEETTTTEEE
T ss_pred ecccccccccceeeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEec---cCcEEEEecCCCcEE
Confidence 3568889999999999999999999999 4433 122333332 56777777 667664
No 34
>KOG0315|consensus
Probab=78.86 E-value=78 Score=35.58 Aligned_cols=206 Identities=16% Similarity=0.270 Sum_probs=119.9
Q ss_pred eEEEEecccceeeeecccCCCcEEEeeccCCCcceEEEecCCcceeeeeeeceeecccceeEEEecccccceEEEEecCC
Q psy1644 85 YIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMND 164 (923)
Q Consensus 85 ~lhl~Dlet~t~iy~~rIS~~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~sdk~ic~~~~~~~~~~vviid~~~ 164 (923)
-|.|||+.|+.+ +++..=..+ + + ||-+|+|.-- .++.- +.++...|=|-||..
T Consensus 62 hvRlyD~~S~np--------~Pv~t~e~h----------~-k-NVtaVgF~~d----grWMy---TgseDgt~kIWdlR~ 114 (311)
T KOG0315|consen 62 HVRLYDLNSNNP--------NPVATFEGH----------T-K-NVTAVGFQCD----GRWMY---TGSEDGTVKIWDLRS 114 (311)
T ss_pred eeEEEEccCCCC--------CceeEEecc----------C-C-ceEEEEEeec----CeEEE---ecCCCceEEEEeccC
Confidence 588999999854 222221111 1 2 6888888321 22221 334557788999998
Q ss_pred CC--CCCccCccccccccCcccchhhhcCCCCCCCCCCcccceeeeccccccccCCcccccceeeecceEEEEEeccCCC
Q psy1644 165 PT--NPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLTNVGINVGSVSFNTLTMESDKFICVREKIAES 242 (923)
Q Consensus 165 ~~--~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~lGI~~~si~F~~lTmESDkyICVRE~~~~~ 242 (923)
|. -..+-+-...++..||++ + |++. ...+
T Consensus 115 ~~~qR~~~~~spVn~vvlhpnQ----------------t----eLis-----------------------------~dqs 145 (311)
T KOG0315|consen 115 LSCQRNYQHNSPVNTVVLHPNQ----------------T----ELIS-----------------------------GDQS 145 (311)
T ss_pred cccchhccCCCCcceEEecCCc----------------c----eEEe-----------------------------ecCC
Confidence 72 223334455677888887 1 2221 1123
Q ss_pred ceEEEEECCCCCCCcccccCcc-----ccccCCCc-cEEEEeccCCCCCCCCCCCeEEEEecccccccc------cccCC
Q psy1644 243 AQVVIIDMNDPTNPIRRPISAD-----SAIMNPAS-KVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMK------AHPMT 310 (923)
Q Consensus 243 ~qVvIVDL~n~~~v~Rrpi~AD-----SAIMnP~~-~IIALra~~~~~n~~~~g~~LQIFNle~K~KLK------s~~m~ 310 (923)
..|-|.||.+.. -++-+|.-| |+-|+|.. +++|.. +-.+..+|++-.++-+. .+++.
T Consensus 146 g~irvWDl~~~~-c~~~liPe~~~~i~sl~v~~dgsml~a~n----------nkG~cyvW~l~~~~~~s~l~P~~k~~ah 214 (311)
T KOG0315|consen 146 GNIRVWDLGENS-CTHELIPEDDTSIQSLTVMPDGSMLAAAN----------NKGNCYVWRLLNHQTASELEPVHKFQAH 214 (311)
T ss_pred CcEEEEEccCCc-cccccCCCCCcceeeEEEcCCCcEEEEec----------CCccEEEEEccCCCccccceEhhheecc
Confidence 367888887652 345555433 78889995 566664 34589999986643221 13344
Q ss_pred CceeEEEeccCCeEEEEecC---eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccc-----cc
Q psy1644 311 DDVVFWKWISPNTLALVTET---SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSN-----RV 382 (923)
Q Consensus 311 E~VvfWkWis~~tLglVT~t---aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~-----~i 382 (923)
+.-..=.-+|++.==|+|.+ .|+.|+.++- |--.-.|.+.| .|.-=+..|.... .-
T Consensus 215 ~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~------~kle~~l~gh~----------rWvWdc~FS~dg~YlvTass 278 (311)
T KOG0315|consen 215 NGHILRCLLSPDVKYLATCSSDKTVKIWNTDDF------FKLELVLTGHQ----------RWVWDCAFSADGEYLVTASS 278 (311)
T ss_pred cceEEEEEECCCCcEEEeecCCceEEEEecCCc------eeeEEEeecCC----------ceEEeeeeccCccEEEecCC
Confidence 44445556788777788755 7999998753 33333344433 6766666665443 33
Q ss_pred cceeeeccccc
Q psy1644 383 VGAMQLYSVER 393 (923)
Q Consensus 383 ~G~mQLYS~er 393 (923)
.+..-||+++.
T Consensus 279 d~~~rlW~~~~ 289 (311)
T KOG0315|consen 279 DHTARLWDLSA 289 (311)
T ss_pred CCceeeccccc
Confidence 46677888763
No 35
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=77.65 E-value=3.9 Score=43.44 Aligned_cols=38 Identities=16% Similarity=0.381 Sum_probs=30.2
Q ss_pred eeccccccEEEEEeccceEEEEecCCcceeeeecccCCceE
Q psy1644 728 QKVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIF 768 (923)
Q Consensus 728 ~qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIF 768 (923)
.+...+| +.+||..|.+|+||+.++++++.+ +|=-+|.
T Consensus 18 l~~~~~~--Ll~iT~~G~l~vWnl~~~k~~~~~-~Si~pll 55 (219)
T PF07569_consen 18 LECNGSY--LLAITSSGLLYVWNLKKGKAVLPP-VSIAPLL 55 (219)
T ss_pred EEeCCCE--EEEEeCCCeEEEEECCCCeeccCC-ccHHHHh
Confidence 4456666 999999999999999999999988 4433333
No 36
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=77.32 E-value=12 Score=45.34 Aligned_cols=80 Identities=14% Similarity=0.183 Sum_probs=52.3
Q ss_pred HHHHHHHhcChhHHHHHHHHhCCCcc---HHHHHHHHHh-cCCchhhHHHHhhhcCCCCCcCccccchhhhhhhcccccc
Q psy1644 466 VIQSFAETGQFQKIVLYAKKVGYSPD---YVFLLRNVMR-VNPDQGVGFAQMLVQDDEPLADINQSYEGLFYFLGSIVNY 541 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~~~vgytpD---y~~LL~~i~~-~nP~~a~efA~~L~~~~~pl~di~~~~egLyy~Lgsivn~ 541 (923)
+|..|++.|+++++.+..++.++.|| |..||..... -+.+.|.+.+..|...+|.
T Consensus 468 li~~l~r~G~~~eA~~~~~~~~~~p~~~~~~~Ll~a~~~~g~~~~a~~~~~~l~~~~p~--------------------- 526 (697)
T PLN03081 468 MIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGPE--------------------- 526 (697)
T ss_pred HHHHHHhcCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHcCCcHHHHHHHHHHhCCCCC---------------------
Confidence 77888888888888888888888886 6667766532 3666666666666543321
Q ss_pred cCCcchhhHHHHHHHhcCChhhhHHhh
Q psy1644 542 SQDQEVHFKYIQAACKTGQIKEVERIC 568 (923)
Q Consensus 542 s~d~dVhfKYIeAA~k~gq~~EvERic 568 (923)
+...+.-.+..-++.|++.|++++-
T Consensus 527 --~~~~y~~L~~~y~~~G~~~~A~~v~ 551 (697)
T PLN03081 527 --KLNNYVVLLNLYNSSGRQAEAAKVV 551 (697)
T ss_pred --CCcchHHHHHHHHhCCCHHHHHHHH
Confidence 1111222355678899999987653
No 37
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=77.09 E-value=15 Score=43.19 Aligned_cols=144 Identities=17% Similarity=0.231 Sum_probs=74.7
Q ss_pred EEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEecCCcccccccccC-CCHHHHHHH
Q psy1644 737 VYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDEDNIIPYINGTL-QNPDLALRI 815 (923)
Q Consensus 737 IyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide~~iVpYI~~~L-~n~~LA~kL 815 (923)
+|+-|-.+ ||..+-.|..-.-..+ +.++++.....+.+-+++++|+|.|.+.++|-+ .+-|=+..+ +|-+-++++
T Consensus 208 ~fiYtT~~--~lkYl~~Ge~~~i~~l-d~~~yllgy~~~~~~ly~~Dr~~~v~~~~ld~~-~~~fk~av~~~d~~~v~~~ 283 (443)
T PF04053_consen 208 CFIYTTSN--HLKYLVNGETGIIAHL-DKPLYLLGYLPKENRLYLIDRDGNVISYELDLS-ELEFKTAVLRGDFEEVLRM 283 (443)
T ss_dssp EEEEE-TT--EEEEEETTEEEEEEE--SS--EEEEEETTTTEEEEE-TT--EEEEE--HH-HHHHHHHHHTT-HHH----
T ss_pred EEEEEcCC--eEEEEEcCCcceEEEc-CCceEEEEEEccCCEEEEEECCCCEEEEEECHH-HHHHHHHHHcCChhhhhhh
Confidence 44445555 5555544444333333 568999888888899999999999999999876 444533322 233443332
Q ss_pred -HHHcCCCCchHHHHHHHHHHHhcCcHHHHHH-----------HHHhcCChhHHHHHHHhcccCChHHHHhHHhhcccCC
Q psy1644 816 -AVRNNLAGAEDLFVRKFNLLFTNGQYSEAAK-----------AACKTGQIKEVERICRESNCYNAERVKNFLKEAKLTD 883 (923)
Q Consensus 816 -A~R~~LPGAddLf~~qFn~l~~~G~y~eAAk-----------~A~ktg~i~e~eri~r~~~~y~~~~vk~~l~~~kl~~ 883 (923)
|..--+|--+.=..+.--.++.+-.|.|-|- +|.+.|++++...++++.+ ++++-|.-=++|=-+-
T Consensus 284 i~~~~ll~~i~~~~~~~i~~fL~~~G~~e~AL~~~~D~~~rFeLAl~lg~L~~A~~~a~~~~--~~~~W~~Lg~~AL~~g 361 (443)
T PF04053_consen 284 IAASNLLPNIPKDQGQSIARFLEKKGYPELALQFVTDPDHRFELALQLGNLDIALEIAKELD--DPEKWKQLGDEALRQG 361 (443)
T ss_dssp -HHHHTGGG--HHHHHHHHHHHHHTT-HHHHHHHSS-HHHHHHHHHHCT-HHHHHHHCCCCS--THHHHHHHHHHHHHTT
T ss_pred hhhhhhcccCChhHHHHHHHHHHHCCCHHHHHhhcCChHHHhHHHHhcCCHHHHHHHHHhcC--cHHHHHHHHHHHHHcC
Confidence 2122233222223555556666655666554 7788888888888888877 5665555444443333
Q ss_pred Ccc
Q psy1644 884 QLP 886 (923)
Q Consensus 884 ~~~ 886 (923)
..+
T Consensus 362 ~~~ 364 (443)
T PF04053_consen 362 NIE 364 (443)
T ss_dssp BHH
T ss_pred CHH
Confidence 333
No 38
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=76.73 E-value=4.2 Score=29.79 Aligned_cols=27 Identities=19% Similarity=0.268 Sum_probs=23.8
Q ss_pred cccEEEEEeccceEEEEecCCcceeee
Q psy1644 733 KYDVVYLITKFGYIHVYDIETATCIYM 759 (923)
Q Consensus 733 KyGvIyvITK~GflhlyDleTgt~Iy~ 759 (923)
.-|.+|+-|..|.+|-+|.+||..+..
T Consensus 5 ~~~~v~~~~~~g~l~a~d~~~G~~~W~ 31 (33)
T smart00564 5 SDGTVYVGSTDGTLYALDAKTGEILWT 31 (33)
T ss_pred ECCEEEEEcCCCEEEEEEcccCcEEEE
Confidence 346899999999999999999998764
No 39
>PTZ00420 coronin; Provisional
Probab=74.42 E-value=2.2e+02 Score=34.97 Aligned_cols=110 Identities=12% Similarity=0.176 Sum_probs=63.7
Q ss_pred ccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCc----eeEEE--eccCC-eEEEEecC-----eE
Q psy1644 265 SAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDD----VVFWK--WISPN-TLALVTET-----SV 332 (923)
Q Consensus 265 SAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~----VvfWk--Wis~~-tLglVT~t-----aV 332 (923)
+.-.||..+++|--+ ..+.++|||+.+.+.+.++.-.+. .++|- |-.+. .|+-+..+ .|
T Consensus 172 SlswspdG~lLat~s---------~D~~IrIwD~Rsg~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~V 242 (568)
T PTZ00420 172 SLKWNIKGNLLSGTC---------VGKHMHIIDPRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREM 242 (568)
T ss_pred EEEECCCCCEEEEEe---------cCCEEEEEECCCCcEEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEE
Confidence 566789988887642 357899999998877766554443 34553 32333 33333322 59
Q ss_pred EEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeecccc
Q psy1644 333 YHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVE 392 (923)
Q Consensus 333 YHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~e 392 (923)
..|++...+.|...++-+. ....++.+-- ++..=++|+| .-+|.+-+|-..
T Consensus 243 kLWDlr~~~~pl~~~~ld~--~~~~L~p~~D-~~tg~l~lsG------kGD~tIr~~e~~ 293 (568)
T PTZ00420 243 KLWDLKNTTSALVTMSIDN--ASAPLIPHYD-ESTGLIYLIG------KGDGNCRYYQHS 293 (568)
T ss_pred EEEECCCCCCceEEEEecC--CccceEEeee-CCCCCEEEEE------ECCCeEEEEEcc
Confidence 9999987667776653332 1223333322 3333334444 345667777654
No 40
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=74.15 E-value=8.8 Score=31.75 Aligned_cols=34 Identities=21% Similarity=0.223 Sum_probs=27.4
Q ss_pred CcchhhHHHHHHHhcCChhhhHHhhhcCCccChH
Q psy1644 544 DQEVHFKYIQAACKTGQIKEVERICRESNCYNAE 577 (923)
Q Consensus 544 d~dVhfKYIeAA~k~gq~~EvERicresn~Ydpe 577 (923)
++++.+.+.++..+.||+.|.+++++.---.+|+
T Consensus 24 ~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~ 57 (68)
T PF14559_consen 24 NPEARLLLAQCYLKQGQYDEAEELLERLLKQDPD 57 (68)
T ss_dssp SHHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTT
T ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcC
Confidence 5678888999999999999999999886666665
No 41
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=72.61 E-value=45 Score=40.52 Aligned_cols=158 Identities=9% Similarity=0.030 Sum_probs=82.1
Q ss_pred cchhhHHHHHHHhcCChhhhHHhhhcC---C--cc-----------ChHHHHhHHhhccCCCCCc-------eEEeeccc
Q psy1644 545 QEVHFKYIQAACKTGQIKEVERICRES---N--CY-----------NAERVKNFLKEAKLTDQLP-------LIIVCDRF 601 (923)
Q Consensus 545 ~dVhfKYIeAA~k~gq~~EvERicres---n--~Y-----------dpekvKnfLKeakL~d~~P-------LI~VCDrf 601 (923)
..+.--.|.+-+|.|++++.+++-++- | .| +.++..+.++++.-..-.| ||-.|-+-
T Consensus 360 ~~~~~~Li~~y~k~G~~~~A~~vf~~m~~~d~~t~n~lI~~y~~~G~~~~A~~lf~~M~~~g~~Pd~~T~~~ll~a~~~~ 439 (697)
T PLN03081 360 IVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYS 439 (697)
T ss_pred eeehHHHHHHHHHCCCHHHHHHHHHhCCCCCeeeHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcC
Confidence 334445677777777777777765541 1 11 3566667777665544455 45557777
Q ss_pred cchhHHHHHHHhcCchheehhhhhccCccccceeee-eccccCC-------HHHhhhhhccCcchhhhHHhhhccccccc
Q psy1644 602 DFVHDLVLYLYRNNLQKYIEIYVQKGRKHLLEKWLK-EEKLECS-------EELGDLVKQSDPTLALSVYLRANIKQSRF 673 (923)
Q Consensus 602 ~fV~dLv~YLY~n~~~k~IEvYvqkvNp~~tP~Vvg-LlD~dc~-------E~~Ik~LL~sv~~l~~~~~v~~~~~~~~p 673 (923)
+.+++-..++... .+.| .+.|.. ..-+ |+|+=|. +++++..... +...-
T Consensus 440 g~~~~a~~~f~~m-----~~~~--g~~p~~--~~y~~li~~l~r~G~~~eA~~~~~~~~~~--------------p~~~~ 496 (697)
T PLN03081 440 GLSEQGWEIFQSM-----SENH--RIKPRA--MHYACMIELLGREGLLDEAYAMIRRAPFK--------------PTVNM 496 (697)
T ss_pred CcHHHHHHHHHHH-----HHhc--CCCCCc--cchHhHHHHHHhcCCHHHHHHHHHHCCCC--------------CCHHH
Confidence 7777766665442 1111 222221 1222 4443332 2233322110 11111
Q ss_pred hhhHHHHH-hhcchhHHHHHHHHHHHHhhcccchhhhHHHHhH--hcCccceeeee
Q psy1644 674 VHDLVLYL-YRNNLQKYIEIYVQKIRIKIKQSRFVHDLVLYLY--RNNLQKYIEIY 726 (923)
Q Consensus 674 v~~Lv~ev-~RnrL~kll~p~LE~~~~~g~q~~~v~nalakiy--~~~~~~~~~i~ 726 (923)
..-|..-+ ..+.+ .+-.-..|..+.-+-.+...|+.|+.+| .|+.+.-.++.
T Consensus 497 ~~~Ll~a~~~~g~~-~~a~~~~~~l~~~~p~~~~~y~~L~~~y~~~G~~~~A~~v~ 551 (697)
T PLN03081 497 WAALLTACRIHKNL-ELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVV 551 (697)
T ss_pred HHHHHHHHHHcCCc-HHHHHHHHHHhCCCCCCCcchHHHHHHHHhCCCHHHHHHHH
Confidence 23344444 23444 4444555555555566678999999999 56666555554
No 42
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=72.04 E-value=7.3 Score=30.29 Aligned_cols=30 Identities=23% Similarity=0.350 Sum_probs=27.1
Q ss_pred cEEEEEeccceEEEEecCCcceeeeecccC
Q psy1644 735 DVVYLITKFGYIHVYDIETATCIYMNRISS 764 (923)
Q Consensus 735 GvIyvITK~GflhlyDleTgt~Iy~nRIS~ 764 (923)
|.||+-|..|+|+=+|..||..+.+-+...
T Consensus 1 ~~v~~~~~~g~l~AlD~~TG~~~W~~~~~~ 30 (38)
T PF01011_consen 1 GRVYVGTPDGYLYALDAKTGKVLWKFQTGP 30 (38)
T ss_dssp TEEEEETTTSEEEEEETTTTSEEEEEESSS
T ss_pred CEEEEeCCCCEEEEEECCCCCEEEeeeCCC
Confidence 679999999999999999999999877654
No 43
>PTZ00421 coronin; Provisional
Probab=71.04 E-value=2.4e+02 Score=33.88 Aligned_cols=25 Identities=16% Similarity=0.239 Sum_probs=19.5
Q ss_pred ceEEEEeeeeeEEEEecccceeeee
Q psy1644 75 DVVYLITKFGYIHVYDIETATCIYM 99 (923)
Q Consensus 75 G~iyliTK~G~lhl~Dlet~t~iy~ 99 (923)
+++.--+..|.|.|||++++.++..
T Consensus 139 ~iLaSgs~DgtVrIWDl~tg~~~~~ 163 (493)
T PTZ00421 139 NVLASAGADMVVNVWDVERGKAVEV 163 (493)
T ss_pred CEEEEEeCCCEEEEEECCCCeEEEE
Confidence 3566667789999999999987643
No 44
>KOG1188|consensus
Probab=70.80 E-value=12 Score=42.99 Aligned_cols=126 Identities=17% Similarity=0.233 Sum_probs=75.4
Q ss_pred cEEEEEecCCCCCCCCCcccceeeeecCCCccCCcceeEEecc--ccceEEEEeeeeeEEEEecccceeeeecccCCCc-
Q psy1644 30 GKLHIIEVGTPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSA--KYDVVYLITKFGYIHVYDIETATCIYMNRISSDT- 106 (923)
Q Consensus 30 gkLhiiEig~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~--k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~- 106 (923)
|.+||-..++- ++-=.+-.||.+.++ |.+.. .-..|+.-|.-|+||+||+-+...+-+-.-+.+.
T Consensus 50 gsv~lyd~~tg--------~~l~~fk~~~~~~N~----vrf~~~ds~h~v~s~ssDG~Vr~wD~Rs~~e~a~~~~~~~~~ 117 (376)
T KOG1188|consen 50 GSVRLYDKGTG--------QLLEEFKGPPATTNG----VRFISCDSPHGVISCSSDGTVRLWDIRSQAESARISWTQQSG 117 (376)
T ss_pred CeEEEEeccch--------hhhheecCCCCcccc----eEEecCCCCCeeEEeccCCeEEEEEeecchhhhheeccCCCC
Confidence 46677766651 122345567777773 33332 3345677888999999999988776655555444
Q ss_pred -EEEeeccCCCcceEEEecCCcceeeeeeeceeecccceeEEEecccccceEEEEecCCCCCCCccCcccc-----cccc
Q psy1644 107 -IFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISAD-----SAIM 180 (923)
Q Consensus 107 -if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~sdk~ic~~~~~~~~~~vviid~~~~~~~~r~~~~a~-----~~im 180 (923)
.|++-..-.-+++|+. --|.+...++||+-|.....+|+|-.+.+- .+=.
T Consensus 118 ~~f~~ld~nck~~ii~~------------------------GtE~~~s~A~v~lwDvR~~qq~l~~~~eSH~DDVT~lrF 173 (376)
T KOG1188|consen 118 TPFICLDLNCKKNIIAC------------------------GTELTRSDASVVLWDVRSEQQLLRQLNESHNDDVTQLRF 173 (376)
T ss_pred CcceEeeccCcCCeEEe------------------------ccccccCceEEEEEEeccccchhhhhhhhccCcceeEEe
Confidence 4443211112233332 235667779999999999988888776542 2345
Q ss_pred CcccchhhhcC
Q psy1644 181 NPASKVIALKG 191 (923)
Q Consensus 181 ~p~~~i~al~~ 191 (923)
||+.|=|-|-|
T Consensus 174 HP~~pnlLlSG 184 (376)
T KOG1188|consen 174 HPSDPNLLLSG 184 (376)
T ss_pred cCCCCCeEEee
Confidence 66554444333
No 45
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=70.36 E-value=24 Score=39.56 Aligned_cols=135 Identities=16% Similarity=0.233 Sum_probs=77.3
Q ss_pred EEEEeccC--CCceEEEEECCCCCCCcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCC
Q psy1644 233 ICVREKIA--ESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMT 310 (923)
Q Consensus 233 ICVRE~~~--~~~qVvIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~ 310 (923)
+|.|.... +...-+++|.+.-..-.- .......-..|..+-+|+--..+ +..-.+|.|+|+++++-+... +
T Consensus 95 ~~r~~~~~~~~~~~evllD~n~l~~~~~-~~~~~~~~~Spdg~~la~~~s~~----G~e~~~l~v~Dl~tg~~l~d~-i- 167 (414)
T PF02897_consen 95 LYRRKTDEEDGPEEEVLLDPNELAKDGG-YVSLGGFSVSPDGKRLAYSLSDG----GSEWYTLRVFDLETGKFLPDG-I- 167 (414)
T ss_dssp EEEEETTS-TS-C-EEEEEGGGGSTTSS--EEEEEEEETTTSSEEEEEEEET----TSSEEEEEEEETTTTEEEEEE-E-
T ss_pred EEEEecccCCCCceEEEEcchHhhccCc-eEEeeeeeECCCCCEEEEEecCC----CCceEEEEEEECCCCcCcCCc-c-
Confidence 56666552 333267778755332211 23333455679999888873221 112358999999999666432 1
Q ss_pred CceeE--EEeccCC-eEEEEe------------cCeEEEEeccCC-CCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEe
Q psy1644 311 DDVVF--WKWISPN-TLALVT------------ETSVYHWSMEGD-STPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIG 374 (923)
Q Consensus 311 E~Vvf--WkWis~~-tLglVT------------~taVYHWsi~g~-s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvG 374 (923)
+.+.| -.|.++. .+-.++ +..||.|.++.+ +..+-||+-...-. ..++...+.|++|+++.-
T Consensus 168 ~~~~~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~~gt~~~~d~lvfe~~~~~~--~~~~~~~s~d~~~l~i~~ 245 (414)
T PF02897_consen 168 ENPKFSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHKLGTPQSEDELVFEEPDEPF--WFVSVSRSKDGRYLFISS 245 (414)
T ss_dssp EEEESEEEEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEETTS-GGG-EEEEC-TTCTT--SEEEEEE-TTSSEEEEEE
T ss_pred cccccceEEEeCCCCEEEEEEeCcccccccCCCCcEEEEEECCCChHhCeeEEeecCCCc--EEEEEEecCcccEEEEEE
Confidence 22334 4899884 444443 556999999654 45466786543322 366777889999987654
Q ss_pred ee
Q psy1644 375 IS 376 (923)
Q Consensus 375 I~ 376 (923)
-+
T Consensus 246 ~~ 247 (414)
T PF02897_consen 246 SS 247 (414)
T ss_dssp ES
T ss_pred Ec
Confidence 43
No 46
>PTZ00421 coronin; Provisional
Probab=70.23 E-value=68 Score=38.27 Aligned_cols=111 Identities=12% Similarity=0.154 Sum_probs=64.6
Q ss_pred ccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCc--eeEEEeccCCeEEEEe------cCeEEEEe
Q psy1644 265 SAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDD--VVFWKWISPNTLALVT------ETSVYHWS 336 (923)
Q Consensus 265 SAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~--VvfWkWis~~tLglVT------~taVYHWs 336 (923)
+.-.||..+.+|-=+ ..+++.|||+.+.+.+.+....+. +.-..|..+...-+.+ +..|..|+
T Consensus 173 sla~spdG~lLatgs---------~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWD 243 (493)
T PTZ00421 173 SLEWNLDGSLLCTTS---------KDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWD 243 (493)
T ss_pred EEEEECCCCEEEEec---------CCCEEEEEECCCCcEEEEEecCCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEe
Confidence 344678877777642 346899999998776665544332 2234465554332322 34699999
Q ss_pred ccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeeccccc
Q psy1644 337 MEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVER 393 (923)
Q Consensus 337 i~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~er 393 (923)
+.....|...++-+.. +-+.--..+++.+++++.|- -.|.+.+|.++.
T Consensus 244 lr~~~~p~~~~~~d~~---~~~~~~~~d~d~~~L~lggk------gDg~Iriwdl~~ 291 (493)
T PTZ00421 244 TRKMASPYSTVDLDQS---SALFIPFFDEDTNLLYIGSK------GEGNIRCFELMN 291 (493)
T ss_pred CCCCCCceeEeccCCC---CceEEEEEcCCCCEEEEEEe------CCCeEEEEEeeC
Confidence 9765566655443321 11222234677777777662 256777777653
No 47
>KOG2048|consensus
Probab=70.16 E-value=1.9e+02 Score=36.19 Aligned_cols=234 Identities=17% Similarity=0.217 Sum_probs=136.7
Q ss_pred CccceeeeecCCCCCCceEEEEEEecCC--cc-EEEEEecCCCCCCCCCcccceeeeecCCCccCCcceeEEeccccceE
Q psy1644 1 HAASFASFKSEGNPEPSTLFCFAVRTPA--GG-KLHIIEVGTPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVV 77 (923)
Q Consensus 1 haa~F~sfkl~gN~~~stL~cfa~r~~~--~g-kLhiiEig~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~i 77 (923)
|=..|++|| |.+=+=++|++++.. .| .==-||+=.+ .+..|.++++.. |++....- +--+ .-|=+
T Consensus 16 hrcrf~d~~----Ps~I~slA~s~kS~~lAvsRt~g~IEiwN~--~~~w~~~~vi~g--~~drsIE~---L~W~-e~~RL 83 (691)
T KOG2048|consen 16 HRCRFVDYK----PSEIVSLAYSHKSNQLAVSRTDGNIEIWNL--SNNWFLEPVIHG--PEDRSIES---LAWA-EGGRL 83 (691)
T ss_pred EEEEEEeee----ccceEEEEEeccCCceeeeccCCcEEEEcc--CCCceeeEEEec--CCCCceee---EEEc-cCCeE
Confidence 445677774 333444555555553 12 3335777332 257899988887 66544332 2222 55678
Q ss_pred EEEeeeeeEEEEecccceeeeecccCCCcEEEeeccCCCcceEEEe-cCCcceeeeeeeceeecccceeEEEecccccce
Q psy1644 78 YLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGLT-NVGINVGSVSFNTLTMESDKFICVREKIAESAQ 156 (923)
Q Consensus 78 yliTK~G~lhl~Dlet~t~iy~~rIS~~~if~ta~~~~t~Gil~vn-~~G~~~~si~~~t~tm~sdk~ic~~~~~~~~~~ 156 (923)
|=+--.|.|.=||+.++.++|....++..|.--|-+.. +.+++|- ..| =...++.++
T Consensus 84 FS~g~sg~i~EwDl~~lk~~~~~d~~gg~IWsiai~p~-~~~l~IgcddG-vl~~~s~~p-------------------- 141 (691)
T KOG2048|consen 84 FSSGLSGSITEWDLHTLKQKYNIDSNGGAIWSIAINPE-NTILAIGCDDG-VLYDFSIGP-------------------- 141 (691)
T ss_pred EeecCCceEEEEecccCceeEEecCCCcceeEEEeCCc-cceEEeecCCc-eEEEEecCC--------------------
Confidence 88888999999999999999999999999998876643 3555555 344 122222211
Q ss_pred EEEEecCCCCCCCccCccccccccCcccchhhhcCCCCCCCCCCcccceeeeccccccccCCcccccceeeecceEEEEE
Q psy1644 157 VVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLTNVGINVGSVSFNTLTMESDKFICVR 236 (923)
Q Consensus 157 vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~lGI~~~si~F~~lTmESDkyICVR 236 (923)
.-|..+..+|.-++|||-|+-... +. +-.+=.+|--|-+.
T Consensus 142 --------------~~I~~~r~l~rq~sRvLslsw~~~------~~--------------------~i~~Gs~Dg~Iriw 181 (691)
T KOG2048|consen 142 --------------DKITYKRSLMRQKSRVLSLSWNPT------GT--------------------KIAGGSIDGVIRIW 181 (691)
T ss_pred --------------ceEEEEeecccccceEEEEEecCC------cc--------------------EEEecccCceEEEE
Confidence 124456678888888888776332 22 00111233334455
Q ss_pred eccCCCceEEEEECCCCCCCcccccCccccccCCCccEEEEeccCCCCCCCC---CCCeEEEEecccccccccccCCCce
Q psy1644 237 EKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPN---APKTLQIFNIEMKSKMKAHPMTDDV 313 (923)
Q Consensus 237 E~~~~~~qVvIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~---~g~~LQIFNle~K~KLKs~~m~E~V 313 (923)
+-.- ++.+-|++|.-...-.|+|.=+ | -|+-|| |+--+ ...++|.||-+.-.-++||..
T Consensus 182 d~~~-~~t~~~~~~~~d~l~k~~~~iV------W--Sv~~Lr-----d~tI~sgDS~G~V~FWd~~~gTLiqS~~~---- 243 (691)
T KOG2048|consen 182 DVKS-GQTLHIITMQLDRLSKREPTIV------W--SVLFLR-----DSTIASGDSAGTVTFWDSIFGTLIQSHSC---- 243 (691)
T ss_pred EcCC-CceEEEeeecccccccCCceEE------E--EEEEee-----cCcEEEecCCceEEEEcccCcchhhhhhh----
Confidence 4332 2245566654322222222111 1 245556 11111 245899999999999999866
Q ss_pred eEEEeccCCeEEEEecCe
Q psy1644 314 VFWKWISPNTLALVTETS 331 (923)
Q Consensus 314 vfWkWis~~tLglVT~ta 331 (923)
.+.+.++|++.+.
T Consensus 244 -----h~adVl~Lav~~~ 256 (691)
T KOG2048|consen 244 -----HDADVLALAVADN 256 (691)
T ss_pred -----hhcceeEEEEcCC
Confidence 5677888887664
No 48
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=69.65 E-value=6.3 Score=28.82 Aligned_cols=26 Identities=19% Similarity=0.277 Sum_probs=23.0
Q ss_pred cceEEEEeeeeeEEEEecccceeeee
Q psy1644 74 YDVVYLITKFGYIHVYDIETATCIYM 99 (923)
Q Consensus 74 ~G~iyliTK~G~lhl~Dlet~t~iy~ 99 (923)
-|.+|+-|..|++|-+|.+||..+..
T Consensus 6 ~~~v~~~~~~g~l~a~d~~~G~~~W~ 31 (33)
T smart00564 6 DGTVYVGSTDGTLYALDAKTGEILWT 31 (33)
T ss_pred CCEEEEEcCCCEEEEEEcccCcEEEE
Confidence 36899999999999999999988754
No 49
>KOG0307|consensus
Probab=68.61 E-value=34 Score=44.16 Aligned_cols=144 Identities=19% Similarity=0.329 Sum_probs=95.3
Q ss_pred cCCCceEEEEECCCCCCCcccccCccccccCCCccEEEEeccCCCCCCCC--------CCCeEEEEecccccccccccCC
Q psy1644 239 IAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPN--------APKTLQIFNIEMKSKMKAHPMT 310 (923)
Q Consensus 239 ~~~~~qVvIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~--------~g~~LQIFNle~K~KLKs~~m~ 310 (923)
.++..+|.|+||+|+..+.--+ =|+|.+.|.+|. =|+.. .....-||||..|+-+-...-.
T Consensus 135 Ga~~geI~iWDlnn~~tP~~~~------~~~~~~eI~~ls-----WNrkvqhILAS~s~sg~~~iWDlr~~~pii~ls~~ 203 (1049)
T KOG0307|consen 135 GADDGEILIWDLNKPETPFTPG------SQAPPSEIKCLS-----WNRKVSHILASGSPSGRAVIWDLRKKKPIIKLSDT 203 (1049)
T ss_pred cCCCCcEEEeccCCcCCCCCCC------CCCCcccceEec-----cchhhhHHhhccCCCCCceeccccCCCcccccccC
Confidence 3455699999999988775554 466777788887 12211 1235779999888544443332
Q ss_pred Cc-----eeEEEeccCCeEEEEecC----eEEEEeccCCCCchhhhhcccC-------------C--C---CCeeEe---
Q psy1644 311 DD-----VVFWKWISPNTLALVTET----SVYHWSMEGDSTPEKMFDRHST-------------L--N---GCQIIN--- 360 (923)
Q Consensus 311 E~-----VvfWkWis~~tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~-------------L--~---~~QIIn--- 360 (923)
-. ++=|.==..+.|.+-++. .+--|++.-.+.|.|+++-|.. | . ..||+-
T Consensus 204 ~~~~~~S~l~WhP~~aTql~~As~dd~~PviqlWDlR~assP~k~~~~H~~GilslsWc~~D~~lllSsgkD~~ii~wN~ 283 (1049)
T KOG0307|consen 204 PGRMHCSVLAWHPDHATQLLVASGDDSAPVIQLWDLRFASSPLKILEGHQRGILSLSWCPQDPRLLLSSGKDNRIICWNP 283 (1049)
T ss_pred CCccceeeeeeCCCCceeeeeecCCCCCceeEeecccccCCchhhhcccccceeeeccCCCCchhhhcccCCCCeeEecC
Confidence 22 555554455555555554 7889999888999999966643 1 1 235544
Q ss_pred ------eeeCCCCCeEEEEeeecccc------cccceeeeccccc
Q psy1644 361 ------YRTDPKQTWLLLIGISAVSN------RVVGAMQLYSVER 393 (923)
Q Consensus 361 ------Yr~d~~~kW~~LvGI~~~~~------~i~G~mQLYS~er 393 (923)
|..-..++||-=+--..++. .+.|.+++||+..
T Consensus 284 ~tgEvl~~~p~~~nW~fdv~w~pr~P~~~A~asfdgkI~I~sl~~ 328 (1049)
T KOG0307|consen 284 NTGEVLGELPAQGNWCFDVQWCPRNPSVMAAASFDGKISIYSLQG 328 (1049)
T ss_pred CCceEeeecCCCCcceeeeeecCCCcchhhhheeccceeeeeeec
Confidence 34445778998777776543 6779999999864
No 50
>KOG0266|consensus
Probab=68.45 E-value=2.4e+02 Score=32.98 Aligned_cols=223 Identities=15% Similarity=0.226 Sum_probs=115.9
Q ss_pred CCcccceeeeecCCCccCCcceeEEeccccceEEEEeeeeeEEEEec-ccceeeeecccCC-CcEEEeeccCCCcceEEE
Q psy1644 45 TPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYLITKFGYIHVYDI-ETATCIYMNRISS-DTIFVTAPHEATGGIIGL 122 (923)
Q Consensus 45 ~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK~G~lhl~Dl-et~t~iy~~rIS~-~~if~ta~~~~t~Gil~v 122 (923)
.+.+.-.-++-|+||-. .+-=.+..+.+++||+ +.+.++..-+ .+ +.|+.++-+..+.=++..
T Consensus 200 ~~h~~~v~~~~fs~d~~--------------~l~s~s~D~tiriwd~~~~~~~~~~l~-gH~~~v~~~~f~p~g~~i~Sg 264 (456)
T KOG0266|consen 200 SGHTRGVSDVAFSPDGS--------------YLLSGSDDKTLRIWDLKDDGRNLKTLK-GHSTYVTSVAFSPDGNLLVSG 264 (456)
T ss_pred cccccceeeeEECCCCc--------------EEEEecCCceEEEeeccCCCeEEEEec-CCCCceEEEEecCCCCEEEEe
Confidence 35566678888888654 3333456789999999 5556665555 44 445444444333223333
Q ss_pred ecCCcceeeeeeeceeecccceeEEEecccccceEEEEecCCCCCCCccCccccccccCcccchhhhcCCCCCCCCCCcc
Q psy1644 123 TNVGINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGE 202 (923)
Q Consensus 123 n~~G~~~~si~~~t~tm~sdk~ic~~~~~~~~~~vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~ 202 (923)
...| .|.+=+++. .-|+|--.+.+.+|..+|.... +.
T Consensus 265 s~D~----tvriWd~~~----~~~~~~l~~hs~~is~~~f~~d-----------------------------------~~ 301 (456)
T KOG0266|consen 265 SDDG----TVRIWDVRT----GECVRKLKGHSDGISGLAFSPD-----------------------------------GN 301 (456)
T ss_pred cCCC----cEEEEeccC----CeEEEeeeccCCceEEEEECCC-----------------------------------CC
Confidence 3333 222222211 3444444444444444433222 11
Q ss_pred cceeeeccccccccCCcccccceeeecceEEEEEeccCCCce--EEEEECCCCCCCcccccCccccccCCCccEEEEecc
Q psy1644 203 FKNRIYLLTNVGINVGSVSFNTLTMESDKFICVREKIAESAQ--VVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGK 280 (923)
Q Consensus 203 ~~~e~~~Lt~lGI~~~si~F~~lTmESDkyICVRE~~~~~~q--VvIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~ 280 (923)
+. ++-+.|..|++.+...+..+ ..+.+..++. + .=++.-+|+.+-|..-.
T Consensus 302 ~l--------------------~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~-~------~~~~~fsp~~~~ll~~~- 353 (456)
T KOG0266|consen 302 LL--------------------VSASYDGTIRVWDLETGSKLCLKLLSGAENSA-P------VTSVQFSPNGKYLLSAS- 353 (456)
T ss_pred EE--------------------EEcCCCccEEEEECCCCceeeeecccCCCCCC-c------eeEEEECCCCcEEEEec-
Confidence 11 23345788888887766644 1112222221 1 23466779988777762
Q ss_pred CCCCCCCCCCCeEEEEecccccccccccC-CCceeEEEeccCC----eEEEEe---cCeEEEEeccCCCCchhhhhcccC
Q psy1644 281 AGNDNNPNAPKTLQIFNIEMKSKMKAHPM-TDDVVFWKWISPN----TLALVT---ETSVYHWSMEGDSTPEKMFDRHST 352 (923)
Q Consensus 281 ~~~~n~~~~g~~LQIFNle~K~KLKs~~m-~E~VvfWkWis~~----tLglVT---~taVYHWsi~g~s~P~K~FdR~~~ 352 (923)
.++++.+||+..-.-++.+.- ...+ +.|.+.. .-.+++ +..|+-|++.. -......+.|.
T Consensus 354 --------~d~~~~~w~l~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~sg~~d~~v~~~~~~s-~~~~~~l~~h~- 421 (456)
T KOG0266|consen 354 --------LDRTLKLWDLRSGKSVGTYTGHSNLV--RCIFSPTLSTGGKLIYSGSEDGSVYVWDSSS-GGILQRLEGHS- 421 (456)
T ss_pred --------CCCeEEEEEccCCcceeeecccCCcc--eeEecccccCCCCeEEEEeCCceEEEEeCCc-cchhhhhcCCC-
Confidence 467999999986554444433 3332 5554433 223333 45899998864 33444455554
Q ss_pred CCCCeeEeeeeCCCC
Q psy1644 353 LNGCQIINYRTDPKQ 367 (923)
Q Consensus 353 L~~~QIInYr~d~~~ 367 (923)
...+..+...+.+
T Consensus 422 --~~~~~~~~~~~~~ 434 (456)
T KOG0266|consen 422 --KAAVSDLSSHPTE 434 (456)
T ss_pred --CCceeccccCCCc
Confidence 2344444444443
No 51
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=68.17 E-value=11 Score=44.16 Aligned_cols=57 Identities=21% Similarity=0.340 Sum_probs=43.3
Q ss_pred ccEEEEEeccceEEEEecCCcceeeeeccc-------CCceEEeecc---------------CCCCcEEEEc-CCCeEEE
Q psy1644 734 YDVVYLITKFGYIHVYDIETATCIYMNRIS-------SDTIFVTAPH---------------EATGGIIGVN-RKGQVLS 790 (923)
Q Consensus 734 yGvIyvITK~GflhlyDleTgt~Iy~nRIS-------~dtIFvta~~---------------~~t~Gii~VN-rkGqVL~ 790 (923)
..++|+-||.|.++-+|.+||..+...+++ .+.+|+.+.+ +..+.+++++ +.|+++=
T Consensus 301 ~~~V~~g~~~G~l~ald~~tG~~~W~~~~~~~~~~~~~~~vyv~~~~~~~~~~~~~~~~~~~~~~G~l~AlD~~tG~~~W 380 (488)
T cd00216 301 VPAIVHAPKNGFFYVLDRTTGKLISARPEVEQPMAYDPGLVYLGAFHIPLGLPPQKKKRCKKPGKGGLAALDPKTGKVVW 380 (488)
T ss_pred eEEEEEECCCceEEEEECCCCcEeeEeEeeccccccCCceEEEccccccccCcccccCCCCCCCceEEEEEeCCCCcEee
Confidence 368999999999999999999999987654 3678886532 2346677787 6676653
No 52
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=67.24 E-value=15 Score=40.56 Aligned_cols=60 Identities=15% Similarity=0.256 Sum_probs=40.5
Q ss_pred ccEEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEecCCc
Q psy1644 734 YDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDEDN 797 (923)
Q Consensus 734 yGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide~~ 797 (923)
-|.||+.+..|.++.||++||+.+.+...+... ++.-..+-+++.+.+|.|..+..+.-+
T Consensus 241 ~~~vy~~~~~g~l~a~d~~tG~~~W~~~~~~~~----~p~~~~~~vyv~~~~G~l~~~d~~tG~ 300 (377)
T TIGR03300 241 GGQVYAVSYQGRVAALDLRSGRVLWKRDASSYQ----GPAVDDNRLYVTDADGVVVALDRRSGS 300 (377)
T ss_pred CCEEEEEEcCCEEEEEECCCCcEEEeeccCCcc----CceEeCCEEEEECCCCeEEEEECCCCc
Confidence 478999999999999999999999987743311 111123345555666666666654443
No 53
>PLN03077 Protein ECB2; Provisional
Probab=64.95 E-value=38 Score=42.15 Aligned_cols=88 Identities=23% Similarity=0.378 Sum_probs=57.0
Q ss_pred HHHHHHHhcChhHHHHHHHHhCCCcc---HHHHHHHH-HhcCCchhhHHHHhhhcCCCCCcCccccchhhhhhhcccccc
Q psy1644 466 VIQSFAETGQFQKIVLYAKKVGYSPD---YVFLLRNV-MRVNPDQGVGFAQMLVQDDEPLADINQSYEGLFYFLGSIVNY 541 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~~~vgytpD---y~~LL~~i-~~~nP~~a~efA~~L~~~~~pl~di~~~~egLyy~Lgsivn~ 541 (923)
+|..|++.|+++++.++.++...+|| |..||... +.-|.+.|...|+.|.+-+|. ..|.|..|+.+
T Consensus 631 lv~~l~r~G~~~eA~~~~~~m~~~pd~~~~~aLl~ac~~~~~~e~~e~~a~~l~~l~p~-------~~~~y~ll~n~--- 700 (857)
T PLN03077 631 VVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPN-------SVGYYILLCNL--- 700 (857)
T ss_pred HHHHHHhCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHcCChHHHHHHHHHHHhhCCC-------CcchHHHHHHH---
Confidence 78888888888888888888888887 66777754 223666555555555554321 11334433332
Q ss_pred cCCcchhhHHHHHHHhcCChhhhHHhhhcCCccChHHHHhHHhhccC
Q psy1644 542 SQDQEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEAKL 588 (923)
Q Consensus 542 s~d~dVhfKYIeAA~k~gq~~EvERicresn~YdpekvKnfLKeakL 588 (923)
-++.|++.|+++ |..-+|+.++
T Consensus 701 -------------ya~~g~~~~a~~------------vr~~M~~~g~ 722 (857)
T PLN03077 701 -------------YADAGKWDEVAR------------VRKTMRENGL 722 (857)
T ss_pred -------------HHHCCChHHHHH------------HHHHHHHcCC
Confidence 267899999865 5666777666
No 54
>KOG0264|consensus
Probab=64.79 E-value=3e+02 Score=32.80 Aligned_cols=141 Identities=18% Similarity=0.389 Sum_probs=90.8
Q ss_pred CCCceEEEEECCCCCCCcccccCcc-----ccccCCC-ccEEEEeccCCCCCCCCCCCeEEEEeccc-ccccccccCCCc
Q psy1644 240 AESAQVVIIDMNDPTNPIRRPISAD-----SAIMNPA-SKVIALKGKAGNDNNPNAPKTLQIFNIEM-KSKMKAHPMTDD 312 (923)
Q Consensus 240 ~~~~qVvIVDL~n~~~v~Rrpi~AD-----SAIMnP~-~~IIALra~~~~~n~~~~g~~LQIFNle~-K~KLKs~~m~E~ 312 (923)
++...+.|.|+-.+.+..+++..|- ++-.||- ..|||-=+ +-++|++||+.. +++|-++.-.++
T Consensus 247 ~dd~~L~iwD~R~~~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S---------~D~tV~LwDlRnL~~~lh~~e~H~d 317 (422)
T KOG0264|consen 247 GDDGKLMIWDTRSNTSKPSHSVKAHSAEVNCVAFNPFNEFILATGS---------ADKTVALWDLRNLNKPLHTFEGHED 317 (422)
T ss_pred cCCCeEEEEEcCCCCCCCcccccccCCceeEEEeCCCCCceEEecc---------CCCcEEEeechhcccCceeccCCCc
Confidence 5667999999987566677777773 4566777 57888752 457999999964 556666666676
Q ss_pred eeE-EEec--cCCeEEEE-ecCeEEEEecc---CC--------CCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeec
Q psy1644 313 VVF-WKWI--SPNTLALV-TETSVYHWSME---GD--------STPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISA 377 (923)
Q Consensus 313 Vvf-WkWi--s~~tLglV-T~taVYHWsi~---g~--------s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~ 377 (923)
-|| -.|- .++.||-= |+..|-.|++. +. ++|+-+|--.+-- ..|..+-=++.+-|++..- +
T Consensus 318 ev~~V~WSPh~etvLASSg~D~rl~vWDls~ig~eq~~eda~dgppEllF~HgGH~--~kV~DfsWnp~ePW~I~Sv--a 393 (422)
T KOG0264|consen 318 EVFQVEWSPHNETVLASSGTDRRLNVWDLSRIGEEQSPEDAEDGPPELLFIHGGHT--AKVSDFSWNPNEPWTIASV--A 393 (422)
T ss_pred ceEEEEeCCCCCceeEecccCCcEEEEeccccccccChhhhccCCcceeEEecCcc--cccccccCCCCCCeEEEEe--c
Confidence 665 3443 33344432 34579999983 22 2444444322211 3599999999999985421 2
Q ss_pred ccccccceeeeccccccccc
Q psy1644 378 VSNRVVGAMQLYSVERKCSQ 397 (923)
Q Consensus 378 ~~~~i~G~mQLYS~er~~SQ 397 (923)
++ +.+|.|...+.+-.
T Consensus 394 eD----N~LqIW~~s~~i~~ 409 (422)
T KOG0264|consen 394 ED----NILQIWQMAENIYN 409 (422)
T ss_pred CC----ceEEEeeccccccC
Confidence 22 78899887665543
No 55
>PRK13844 recombination protein RecR; Provisional
Probab=64.00 E-value=16 Score=38.98 Aligned_cols=91 Identities=19% Similarity=0.354 Sum_probs=64.0
Q ss_pred HHHHHHHhcCCchhhHHHHhhhcC--------------CCCCcCccc-------------------------cchhhhhh
Q psy1644 494 FLLRNVMRVNPDQGVGFAQMLVQD--------------DEPLADINQ-------------------------SYEGLFYF 534 (923)
Q Consensus 494 ~LL~~i~~~nP~~a~efA~~L~~~--------------~~pl~di~~-------------------------~~egLyy~ 534 (923)
.|--+++...++.+.+||..|.+- +..++++.+ .|.|+|+.
T Consensus 30 Rla~~lL~~~~~~~~~la~~i~~~~~~i~~C~~C~~lte~~~C~IC~d~~Rd~~~iCVVE~~~Dv~aiE~t~~y~G~YhV 109 (200)
T PRK13844 30 RLALYLLDKSPETAIAIANSLLDATANIKKCVYCQALTEDDVCNICSNTNRDDTKLCIIESMLDMIAIEEAGIYRGKYFV 109 (200)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHHHHhCCcCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCHHHHHHHHhhCccceEEEE
Confidence 344456667888888888877551 122344443 58899999
Q ss_pred hcccccccCC---cchhhHHHHHHHhcCChhhhHHhhhcCCccChHHHHhHHhhc
Q psy1644 535 LGSIVNYSQD---QEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEA 586 (923)
Q Consensus 535 Lgsivn~s~d---~dVhfKYIeAA~k~gq~~EvERicresn~YdpekvKnfLKea 586 (923)
||..++-.+. .++.++-+..=.+.|+++|| |.--|.--+.|-+-.||++.
T Consensus 110 L~G~ispl~gi~p~~l~i~~L~~Ri~~~~v~EV--IlAt~~t~EGe~Ta~yi~~~ 162 (200)
T PRK13844 110 LNGRISPLDGIGPSELKLDILQQIIADRKIDEV--ILAISPTVEGETTAHFISQM 162 (200)
T ss_pred ccCccCccCCCChhhcCHHHHHHHHhcCCCcEE--EEeCCCCccHHHHHHHHHHH
Confidence 9998776554 46677766666667788887 66667788888888888765
No 56
>PRK05137 tolB translocation protein TolB; Provisional
Probab=63.90 E-value=2.7e+02 Score=31.96 Aligned_cols=91 Identities=12% Similarity=0.217 Sum_probs=45.7
Q ss_pred CCceEEEEECCCCCCCccc---ccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEE
Q psy1644 241 ESAQVVIIDMNDPTNPIRR---PISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWK 317 (923)
Q Consensus 241 ~~~qVvIVDL~n~~~v~Rr---pi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWk 317 (923)
+..++.++|++... +.|- +-...+.-.+|..+.||..... .....|.+++++.+.. +..+-...+..-.
T Consensus 312 g~~~Iy~~d~~g~~-~~~lt~~~~~~~~~~~SpdG~~ia~~~~~------~~~~~i~~~d~~~~~~-~~lt~~~~~~~p~ 383 (435)
T PRK05137 312 GSPQLYVMNADGSN-PRRISFGGGRYSTPVWSPRGDLIAFTKQG------GGQFSIGVMKPDGSGE-RILTSGFLVEGPT 383 (435)
T ss_pred CCCeEEEEECCCCC-eEEeecCCCcccCeEECCCCCEEEEEEcC------CCceEEEEEECCCCce-EeccCCCCCCCCe
Confidence 44688999976532 2221 1111234567888888886311 0234678888765432 1111111233335
Q ss_pred ecc-CCeEEEEec-------CeEEEEeccC
Q psy1644 318 WIS-PNTLALVTE-------TSVYHWSMEG 339 (923)
Q Consensus 318 Wis-~~tLglVT~-------taVYHWsi~g 339 (923)
|-. .+.|+..+. ..+|..+++|
T Consensus 384 ~spDG~~i~~~~~~~~~~~~~~L~~~dl~g 413 (435)
T PRK05137 384 WAPNGRVIMFFRQTPGSGGAPKLYTVDLTG 413 (435)
T ss_pred ECCCCCEEEEEEccCCCCCcceEEEEECCC
Confidence 542 345666554 2466666654
No 57
>KOG1273|consensus
Probab=62.92 E-value=47 Score=38.25 Aligned_cols=29 Identities=38% Similarity=0.465 Sum_probs=27.6
Q ss_pred eEEEEeeeeeEEEEecccceeeeecccCC
Q psy1644 76 VVYLITKFGYIHVYDIETATCIYMNRISS 104 (923)
Q Consensus 76 ~iyliTK~G~lhl~Dlet~t~iy~~rIS~ 104 (923)
.||.-|.-|-+++||-||-.|+-+-||++
T Consensus 167 yIitGtsKGkllv~~a~t~e~vas~rits 195 (405)
T KOG1273|consen 167 YIITGTSKGKLLVYDAETLECVASFRITS 195 (405)
T ss_pred EEEEecCcceEEEEecchheeeeeeeech
Confidence 58999999999999999999999999988
No 58
>PLN00181 protein SPA1-RELATED; Provisional
Probab=61.96 E-value=88 Score=38.85 Aligned_cols=92 Identities=13% Similarity=0.209 Sum_probs=58.6
Q ss_pred CCeEEEEecccccccccccCCCceeEEEeccC--CeEEEEec-CeEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCC
Q psy1644 290 PKTLQIFNIEMKSKMKAHPMTDDVVFWKWISP--NTLALVTE-TSVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPK 366 (923)
Q Consensus 290 g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~--~tLglVT~-taVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~ 366 (923)
..++.+||+.+...++.......|....|.++ +.|+.-+. ..|+.|++.....|...+..|..- |..-.. .+
T Consensus 597 Dg~v~iWd~~~~~~~~~~~~~~~v~~v~~~~~~g~~latgs~dg~I~iwD~~~~~~~~~~~~~h~~~----V~~v~f-~~ 671 (793)
T PLN00181 597 DGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKT----VSYVRF-VD 671 (793)
T ss_pred CCEEEEEECCCCcEEEEEecCCCeEEEEEeCCCCCEEEEEeCCCeEEEEECCCCCccceEecCCCCC----EEEEEE-eC
Confidence 46899999998888877776778888888764 44554443 389999997655566555554321 211111 24
Q ss_pred CCeEEEEeeecccccccceeeeccccc
Q psy1644 367 QTWLLLIGISAVSNRVVGAMQLYSVER 393 (923)
Q Consensus 367 ~kW~~LvGI~~~~~~i~G~mQLYS~er 393 (923)
..+++-.| -+|.+.+|....
T Consensus 672 ~~~lvs~s-------~D~~ikiWd~~~ 691 (793)
T PLN00181 672 SSTLVSSS-------TDNTLKLWDLSM 691 (793)
T ss_pred CCEEEEEE-------CCCEEEEEeCCC
Confidence 45543322 467888888753
No 59
>TIGR01061 parC_Gpos DNA topoisomerase IV, A subunit, Gram-positive. Operationally, topoisomerase IV is a type II topoisomerase required for the decatenation of chromosome segregation. Not every bacterium has both a topo II and a topo IV. The topo IV families of the Gram-positive bacteria and the Gram-negative bacteria appear not to represent a single clade among the type II topoisomerases, and are represented by separate models for this reason.
Probab=61.57 E-value=74 Score=39.99 Aligned_cols=167 Identities=17% Similarity=0.229 Sum_probs=93.7
Q ss_pred ChhhhHHhhhcCCccChHHHHhHHhhccCCCCCceEEeeccccchhHHHHHHHhcCchheehhhhhccCccccceeeeec
Q psy1644 560 QIKEVERICRESNCYNAERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNLQKYIEIYVQKGRKHLLEKWLKEE 639 (923)
Q Consensus 560 q~~EvERicresn~YdpekvKnfLKeakL~d~~PLI~VCDrf~fV~dLv~YLY~n~~~k~IEvYvqkvNp~~tP~VvgLl 639 (923)
.+.||=.|.|.|. +.+.+|..|++. |+|-++=..||-...+.+.-.-=+.+.
T Consensus 381 ~id~~i~iir~~~--~~~~~~~~l~~~--------------f~~~~~q~~~il~m~l~~lt~~e~~kl------------ 432 (738)
T TIGR01061 381 IIDEIIKLIRSSE--DKSDAKENLIDN--------------FKFTENQAEAIVSLRLYRLTNTDIFEL------------ 432 (738)
T ss_pred hhhhHhHHHHcCC--CHHHHHHHHHHh--------------cCCCHHHHHHHHhhHHHHHhHHHHHHH------------
Confidence 5677778889887 778899999864 666666667776666655432222211
Q ss_pred cccCCHHHhhhhhccCcchhhhHHhhhccccccchhhHHHHH-hhcchhHHHHHHHHHHHHhhcccchhhhHHHHhH---
Q psy1644 640 KLECSEELGDLVKQSDPTLALSVYLRANIKQSRFVHDLVLYL-YRNNLQKYIEIYVQKIRIKIKQSRFVHDLVLYLY--- 715 (923)
Q Consensus 640 D~dc~E~~Ik~LL~sv~~l~~~~~v~~~~~~~~pv~~Lv~ev-~RnrL~kll~p~LE~~~~~g~q~~~v~nalakiy--- 715 (923)
++-++.|... +++|-+-+ .+.++-+++--.|+ ++.|-|
T Consensus 433 -----~~e~~~l~~~-------------------i~~l~~iL~~~~~~~~~i~~el~--------------~ik~kfg~~ 474 (738)
T TIGR01061 433 -----KEEQNELEKK-------------------IISLEQIIASEKARNKLLKKQLE--------------EYKKQFAQQ 474 (738)
T ss_pred -----HHHHHHHHHH-------------------HHHHHHHhCCHHHHHHHHHHHHH--------------HHHHHhCCC
Confidence 0111111111 11111111 13333355555555 455556
Q ss_pred -hcCcc---ceeeeeeeeccccccEEEEEeccceEEEEecCCcceeeeecc-----cCCce-EEeeccCCCCcEEEEcCC
Q psy1644 716 -RNNLQ---KYIEIYVQKVSAKYDVVYLITKFGYIHVYDIETATCIYMNRI-----SSDTI-FVTAPHEATGGIIGVNRK 785 (923)
Q Consensus 716 -~~~~~---~~~~i~~~qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nRI-----S~dtI-Fvta~~~~t~Gii~VNrk 785 (923)
|+..+ ...++.....-++..++.++||.|||.-..+..=.. .+|. .+|-+ ++... +..+=++.++++
T Consensus 475 RRT~I~~~~~~~~~~~edli~~E~~lV~lTk~G~IKrt~l~~f~~--~r~~ai~Lke~Delv~v~~~-~~~d~IllfT~~ 551 (738)
T TIGR01061 475 RRSQIEDFINQIKINESELIENEDYYVLITKAGYIKRTSNRSFAS--SKYTDFGSKDDDILFAQTIA-NTTDQILIFTSL 551 (738)
T ss_pred CceeeecccccccCCHhhcccccceEEEEccCCEEEEeehhhhhh--hcccccCCCCCCEEEEEEEe-cCCCEEEEEeCC
Confidence 44432 123454445557788899999999999877653221 0111 23432 33222 234459999999
Q ss_pred CeEEEEEecC
Q psy1644 786 GQVLSVSVDE 795 (923)
Q Consensus 786 GqVL~V~Ide 795 (923)
|+++.+.+++
T Consensus 552 Gkv~r~~~~e 561 (738)
T TIGR01061 552 GNIINIPVHK 561 (738)
T ss_pred CcEEEEEHHH
Confidence 9999999876
No 60
>PRK04792 tolB translocation protein TolB; Provisional
Probab=61.47 E-value=1.2e+02 Score=35.25 Aligned_cols=130 Identities=17% Similarity=0.185 Sum_probs=70.8
Q ss_pred ceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccc--cccccCCCceeEEEe
Q psy1644 243 AQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSK--MKAHPMTDDVVFWKW 318 (923)
Q Consensus 243 ~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~K--LKs~~m~E~VvfWkW 318 (923)
.++.+.|.+..+. .++-+...-+...+|..+-||.=.. . + ....|.++|+++.+. +.+. .....-=.|
T Consensus 198 ~~l~i~d~dG~~~~~l~~~~~~~~~p~wSPDG~~La~~s~-~--~---g~~~L~~~dl~tg~~~~lt~~--~g~~~~~~w 269 (448)
T PRK04792 198 YQLMIADYDGYNEQMLLRSPEPLMSPAWSPDGRKLAYVSF-E--N---RKAEIFVQDIYTQVREKVTSF--PGINGAPRF 269 (448)
T ss_pred eEEEEEeCCCCCceEeecCCCcccCceECCCCCEEEEEEe-c--C---CCcEEEEEECCCCCeEEecCC--CCCcCCeeE
Confidence 4788888755432 2233333345677888776765311 0 1 234788889887543 3322 211112245
Q ss_pred ccC-CeEEEEecC----eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeecccc
Q psy1644 319 ISP-NTLALVTET----SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVE 392 (923)
Q Consensus 319 is~-~tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~e 392 (923)
-.+ +.|+++... .+|.|++++. .+.++.. +. .-.....-++|++|++.+.- -.|..|+|..+
T Consensus 270 SPDG~~La~~~~~~g~~~Iy~~dl~tg-~~~~lt~-~~----~~~~~p~wSpDG~~I~f~s~------~~g~~~Iy~~d 336 (448)
T PRK04792 270 SPDGKKLALVLSKDGQPEIYVVDIATK-ALTRITR-HR----AIDTEPSWHPDGKSLIFTSE------RGGKPQIYRVN 336 (448)
T ss_pred CCCCCEEEEEEeCCCCeEEEEEECCCC-CeEECcc-CC----CCccceEECCCCCEEEEEEC------CCCCceEEEEE
Confidence 533 457776433 5999998653 2333332 11 11233456789999887641 13567888755
No 61
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=61.13 E-value=61 Score=33.49 Aligned_cols=77 Identities=16% Similarity=0.285 Sum_probs=48.8
Q ss_pred eEEEEECCCCCCCcccc-cCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEeccCC
Q psy1644 244 QVVIIDMNDPTNPIRRP-ISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWISPN 322 (923)
Q Consensus 244 qVvIVDL~n~~~v~Rrp-i~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~~ 322 (923)
+|.+.|+. ...+..-+ ....+..-+|..+.+|+=|. + | ....|++||++++++++.+.- ..++...|-.+-
T Consensus 84 ~v~lyd~~-~~~i~~~~~~~~n~i~wsP~G~~l~~~g~-~--n---~~G~l~~wd~~~~~~i~~~~~-~~~t~~~WsPdG 155 (194)
T PF08662_consen 84 KVTLYDVK-GKKIFSFGTQPRNTISWSPDGRFLVLAGF-G--N---LNGDLEFWDVRKKKKISTFEH-SDATDVEWSPDG 155 (194)
T ss_pred ccEEEcCc-ccEeEeecCCCceEEEECCCCCEEEEEEc-c--C---CCcEEEEEECCCCEEeecccc-CcEEEEEEcCCC
Confidence 67777764 22111111 12335667899999999642 1 1 235699999999999988764 357788887666
Q ss_pred eEEEEe
Q psy1644 323 TLALVT 328 (923)
Q Consensus 323 tLglVT 328 (923)
..-+..
T Consensus 156 r~~~ta 161 (194)
T PF08662_consen 156 RYLATA 161 (194)
T ss_pred CEEEEE
Confidence 554443
No 62
>PLN03218 maturation of RBCL 1; Provisional
Probab=60.90 E-value=45 Score=43.46 Aligned_cols=82 Identities=6% Similarity=0.060 Sum_probs=48.7
Q ss_pred HHHHHHHhcChhHHHHHHHH---hCCCcc---HHHHHHHHHhc-CCchhhHHHHhhhcCC-CCCcCccccchhhhhhhcc
Q psy1644 466 VIQSFAETGQFQKIVLYAKK---VGYSPD---YVFLLRNVMRV-NPDQGVGFAQMLVQDD-EPLADINQSYEGLFYFLGS 537 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~~~---vgytpD---y~~LL~~i~~~-nP~~a~efA~~L~~~~-~pl~di~~~~egLyy~Lgs 537 (923)
+|.++++.|+++++...-++ .|..|| |..|+....+. ++++|.++...+...+ .|
T Consensus 690 LI~ay~k~G~~eeA~~lf~eM~~~g~~PdvvtyN~LI~gy~k~G~~eeAlelf~eM~~~Gi~P----------------- 752 (1060)
T PLN03218 690 LMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCP----------------- 752 (1060)
T ss_pred HHHHHHhCCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCC-----------------
Confidence 66667777777666655443 356665 56666665444 5666666655554432 11
Q ss_pred cccccCCcchhhHHHHHHHhcCChhhhHHhhhc
Q psy1644 538 IVNYSQDQEVHFKYIQAACKTGQIKEVERICRE 570 (923)
Q Consensus 538 ivn~s~d~dVhfKYIeAA~k~gq~~EvERicre 570 (923)
|....--.|.++++.|++++.+++-++
T Consensus 753 ------d~~Ty~sLL~a~~k~G~le~A~~l~~~ 779 (1060)
T PLN03218 753 ------NTITYSILLVASERKDDADVGLDLLSQ 779 (1060)
T ss_pred ------CHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 122233457888899998887766544
No 63
>PF14779 BBS1: Ciliary BBSome complex subunit 1
Probab=60.80 E-value=23 Score=39.18 Aligned_cols=59 Identities=19% Similarity=0.270 Sum_probs=50.5
Q ss_pred cccEEEEEeccceEEEEecCCcceeeeecccCCceEEeec--c-CCCCcEEEEcCCCeEEEE
Q psy1644 733 KYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAP--H-EATGGIIGVNRKGQVLSV 791 (923)
Q Consensus 733 KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~--~-~~t~Gii~VNrkGqVL~V 791 (923)
.-..+.+-|-.|.|++.|.+.-+.+..-++.+=++|+.+. + +...-|++..|+|.|+.+
T Consensus 194 a~scLViGTE~~~i~iLd~~af~il~~~~lpsvPv~i~~~G~~devdyRI~Va~Rdg~iy~i 255 (257)
T PF14779_consen 194 AVSCLVIGTESGEIYILDPQAFTILKQVQLPSVPVFISVSGQYDEVDYRIVVACRDGKIYTI 255 (257)
T ss_pred CcceEEEEecCCeEEEECchhheeEEEEecCCCceEEEEEeeeeccceEEEEEeCCCEEEEE
Confidence 3456889999999999999999999999999999997665 3 356678888899999986
No 64
>PLN03218 maturation of RBCL 1; Provisional
Probab=60.17 E-value=51 Score=42.96 Aligned_cols=27 Identities=30% Similarity=0.430 Sum_probs=19.4
Q ss_pred eecccCCCcEEEeeccCCCcceEEEecCCcc
Q psy1644 98 YMNRISSDTIFVTAPHEATGGIIGLTNVGIN 128 (923)
Q Consensus 98 y~~rIS~~~if~ta~~~~t~Gil~vn~~G~~ 128 (923)
++.|++++.+ ++| .++|+-++++...|
T Consensus 101 ~~~~~~~~~~--~~~--~~~~~~~~~~~~~~ 127 (1060)
T PLN03218 101 TRRRKSSPLV--ATP--SGGGISTLNREDVQ 127 (1060)
T ss_pred hhccccCccc--ccC--CCcccccccHHHHH
Confidence 4788888754 333 67899999988744
No 65
>TIGR00615 recR recombination protein RecR. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=60.13 E-value=21 Score=38.04 Aligned_cols=91 Identities=16% Similarity=0.315 Sum_probs=64.5
Q ss_pred HHHHHHHhcCCchhhHHHHhhhcC--------------CCCCcCccc-------------------------cchhhhhh
Q psy1644 494 FLLRNVMRVNPDQGVGFAQMLVQD--------------DEPLADINQ-------------------------SYEGLFYF 534 (923)
Q Consensus 494 ~LL~~i~~~nP~~a~efA~~L~~~--------------~~pl~di~~-------------------------~~egLyy~ 534 (923)
.|--++++..++.+.+||..|..- +.+++++.+ .|.|+|+.
T Consensus 26 RlA~~ll~~~~~~~~~la~ai~~~~~~i~~C~~C~~lse~~~C~IC~d~~Rd~~~iCVVE~~~Dv~aiE~~~~y~G~YhV 105 (195)
T TIGR00615 26 RLAFHLLKRDPSEVLRLAQALLEAKENLRTCSVCGAISDQEVCNICSDERRDNSVICVVEDPKDVFALEKTKEFRGRYHV 105 (195)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHHHHcCCcCCCCCCCCCCCcCCCCCCCCCCCCEEEEECCHHHHHHHHhhCccceEEEE
Confidence 344456777888888898877651 123454444 57899999
Q ss_pred hcccccccCC---cchhhHHHHHHHhcCChhhhHHhhhcCCccChHHHHhHHhhc
Q psy1644 535 LGSIVNYSQD---QEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEA 586 (923)
Q Consensus 535 Lgsivn~s~d---~dVhfKYIeAA~k~gq~~EvERicresn~YdpekvKnfLKea 586 (923)
||..++--+. .++.++-+..=.+.++++|| |.--|.--+.|-+-.||++.
T Consensus 106 L~G~iSPldgigp~~l~i~~L~~Ri~~~~v~EV--IlAt~~tvEGe~Ta~yi~~~ 158 (195)
T TIGR00615 106 LGGHISPLDGIGPEDLTIAALLKRLQEESVKEV--ILATNPTVEGEATALYIARL 158 (195)
T ss_pred ccCccCccCCCChhhcCHHHHHHHHhcCCCcEE--EEeCCCCchHHHHHHHHHHH
Confidence 9987765544 46788877776777888888 66667778888888887754
No 66
>PF13041 PPR_2: PPR repeat family
Probab=59.12 E-value=18 Score=29.00 Aligned_cols=33 Identities=30% Similarity=0.659 Sum_probs=26.3
Q ss_pred HHHHHHHhcChhHHHHHH---HHhCCCcc---HHHHHHH
Q psy1644 466 VIQSFAETGQFQKIVLYA---KKVGYSPD---YVFLLRN 498 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~---~~vgytpD---y~~LL~~ 498 (923)
++.++++.|+++++...- ++.|..|| |..||+.
T Consensus 9 li~~~~~~~~~~~a~~l~~~M~~~g~~P~~~Ty~~li~~ 47 (50)
T PF13041_consen 9 LISGYCKAGKFEEALKLFKEMKKRGIKPDSYTYNILING 47 (50)
T ss_pred HHHHHHHCcCHHHHHHHHHHHHHcCCCCCHHHHHHHHHH
Confidence 678999999999998875 45588997 6666664
No 67
>KOG1920|consensus
Probab=57.60 E-value=67 Score=42.22 Aligned_cols=117 Identities=17% Similarity=0.135 Sum_probs=73.9
Q ss_pred CCceEEEEEEecCCccEE---EEEecCCCCCCCCCccc-ceeeeecCCCccCCcceeEEeccccceEEEEeeeeeEEEEe
Q psy1644 15 EPSTLFCFAVRTPAGGKL---HIIEVGTPPAGNTPLAK-KAVDVFFPPEAANDFPVAMQVSAKYDVVYLITKFGYIHVYD 90 (923)
Q Consensus 15 ~~stL~cfa~r~~~~gkL---hiiEig~~~~g~~~f~k-K~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK~G~lhl~D 90 (923)
+|.+-.||+..+.-.+.+ ..-++|......+.+.| ..+-..|-||.-||+-++||.=....-|.++|+.|-+++.|
T Consensus 17 ~~~~~~~f~l~~~~~~~ass~~~~~~~~~~~ei~~~~kv~~~~~s~~~~~gd~~i~s~~fl~d~~~i~v~~~~G~iilvd 96 (1265)
T KOG1920|consen 17 GPGNAQAFILSPELVTVASSTINELLGLKIFEIESVGKVRLVAESFLPEDGDDEIVSVQFLADTNSICVITALGDIILVD 96 (1265)
T ss_pred CCCchhheeeccccceeeeceeeecccccceeeeccccccceeecccCcCCCcceEEEEEecccceEEEEecCCcEEEEc
Confidence 333566777776633211 11111221222222323 23455678888889999999999999999999999999999
Q ss_pred cccceeeeecccCCCcEEEeeccCCCcceEEEecCCcceeeee
Q psy1644 91 IETATCIYMNRISSDTIFVTAPHEATGGIIGLTNVGINVGSVS 133 (923)
Q Consensus 91 let~t~iy~~rIS~~~if~ta~~~~t~Gil~vn~~G~~~~si~ 133 (923)
.||+..--++-+.+-+..++ -+...-.+++-+.+.++..++
T Consensus 97 ~et~~~eivg~vd~GI~aas--wS~Dee~l~liT~~~tll~mT 137 (1265)
T KOG1920|consen 97 PETLELEIVGNVDNGISAAS--WSPDEELLALITGRQTLLFMT 137 (1265)
T ss_pred ccccceeeeeeccCceEEEe--ecCCCcEEEEEeCCcEEEEEe
Confidence 99999999988876555444 222334444444433555544
No 68
>TIGR01062 parC_Gneg DNA topoisomerase IV, A subunit, proteobacterial. Operationally, topoisomerase IV is a type II topoisomerase required for the decatenation of chromosome segregation. Not every bacterium has both a topo II and a topo IV. The topo IV families of the Gram-positive bacteria and the Gram-negative bacteria appear not to represent a single clade among the type II topoisomerases, and are represented by separate models for this reason.
Probab=57.52 E-value=1e+02 Score=38.88 Aligned_cols=71 Identities=10% Similarity=0.159 Sum_probs=45.3
Q ss_pred eeeeeccccccEEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEecC
Q psy1644 725 IYVQKVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDE 795 (923)
Q Consensus 725 i~~~qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide 795 (923)
+.....-++..++.++||.|||.--..+.-...-+.-=.+|.+.....-+.++=++.++.+|+++.+.+++
T Consensus 485 ~~~~~~i~~e~v~VilTk~G~IKr~~~~~~~~saikLKegD~L~~~~~~~t~d~LllfTs~Gr~yrf~v~e 555 (735)
T TIGR01062 485 VSEIDMIPKEPVTIILSKMGWVRSAKGHDIDLSTLKYKAGDSEKAIIEGKSNQKVVFIDSTGRSYALDPDN 555 (735)
T ss_pred cchhhcccCcceEEEEecCCEEEeccccccchhccCcCCCCeEEEEEEecCCCEEEEEECCCeEEEEEhHh
Confidence 44445567889999999999998665543221111111234433333334455699999999999998775
No 69
>KOG0266|consensus
Probab=57.08 E-value=2.8e+02 Score=32.43 Aligned_cols=73 Identities=18% Similarity=0.203 Sum_probs=42.6
Q ss_pred eEEeccccceEEEEeeeeeEEEEecccceeeeecccCCCcEEEeeccCCCcceEEEecCCcceeeeeeeceeecccceeE
Q psy1644 67 AMQVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMESDKFIC 146 (923)
Q Consensus 67 amqiS~k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~sdk~ic 146 (923)
+.-++.+=-+|.--...|.+++||+++|+|+=.=..-++-| -.+..++.|- .. ++-..|.+|+
T Consensus 251 ~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~~i----------s~~~f~~d~~-~l------~s~s~d~~i~ 313 (456)
T KOG0266|consen 251 SVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKGHSDGI----------SGLAFSPDGN-LL------VSASYDGTIR 313 (456)
T ss_pred EEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeeccCCce----------EEEEECCCCC-EE------EEcCCCccEE
Confidence 33334444578888899999999999988864322222211 1123333331 11 1123588899
Q ss_pred EEecccccce
Q psy1644 147 VREKIAESAQ 156 (923)
Q Consensus 147 ~~~~~~~~~~ 156 (923)
+++....+-+
T Consensus 314 vwd~~~~~~~ 323 (456)
T KOG0266|consen 314 VWDLETGSKL 323 (456)
T ss_pred EEECCCCcee
Confidence 9988877643
No 70
>PLN03077 Protein ECB2; Provisional
Probab=56.94 E-value=50 Score=41.11 Aligned_cols=83 Identities=17% Similarity=0.227 Sum_probs=45.3
Q ss_pred HHHHHHHhcChhHHHHHHHH---hCCCcc---HHHHHHHHHhc-CCchhhHHHHhhhcCCCCCcCccccchhhhhhhccc
Q psy1644 466 VIQSFAETGQFQKIVLYAKK---VGYSPD---YVFLLRNVMRV-NPDQGVGFAQMLVQDDEPLADINQSYEGLFYFLGSI 538 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~~~---vgytpD---y~~LL~~i~~~-nP~~a~efA~~L~~~~~pl~di~~~~egLyy~Lgsi 538 (923)
+|.++++.|+++++++.-++ .|..|| |..+|....+. +.+.|.++-..+.+.+
T Consensus 259 li~~~~~~g~~~eAl~lf~~M~~~g~~Pd~~ty~~ll~a~~~~g~~~~a~~l~~~~~~~g-------------------- 318 (857)
T PLN03077 259 MISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTG-------------------- 318 (857)
T ss_pred HHHHHHhCCCHHHHHHHHHHHHHcCCCCChhHHHHHHHHHHhcCChHHHHHHHHHHHHhC--------------------
Confidence 67777777877777777544 467776 66666655443 3333333333332222
Q ss_pred ccccCCcchhhHHHHHHHhcCChhhhHHhhhc
Q psy1644 539 VNYSQDQEVHFKYIQAACKTGQIKEVERICRE 570 (923)
Q Consensus 539 vn~s~d~dVhfKYIeAA~k~gq~~EvERicre 570 (923)
+.-|..+.-..|.+.+|.|++.|.+++.++
T Consensus 319 --~~~d~~~~n~Li~~y~k~g~~~~A~~vf~~ 348 (857)
T PLN03077 319 --FAVDVSVCNSLIQMYLSLGSWGEAEKVFSR 348 (857)
T ss_pred --CccchHHHHHHHHHHHhcCCHHHHHHHHhh
Confidence 111223333456666666666666666544
No 71
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=55.59 E-value=2.1e+02 Score=31.96 Aligned_cols=145 Identities=10% Similarity=0.130 Sum_probs=74.3
Q ss_pred ceEEEEEecc-CCCceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccccccccc
Q psy1644 230 DKFICVREKI-AESAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKA 306 (923)
Q Consensus 230 DkyICVRE~~-~~~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs 306 (923)
+++..+++.. ++...+.+.|....+. .+.-.-..-+...+|..+-||.-... . ....|.++|+++....+-
T Consensus 156 ~~~~~~~~~~~~~~~~l~~~d~~g~~~~~l~~~~~~~~~p~~Spdg~~la~~~~~---~---~~~~i~v~d~~~g~~~~~ 229 (417)
T TIGR02800 156 TRIAYVSKSGKSRRYELQVADYDGANPQTITRSREPILSPAWSPDGQKLAYVSFE---S---GKPEIYVQDLATGQREKV 229 (417)
T ss_pred CEEEEEEEeCCCCcceEEEEcCCCCCCEEeecCCCceecccCCCCCCEEEEEEcC---C---CCcEEEEEECCCCCEEEe
Confidence 4566666543 3445788888753221 11111112244577888777765211 0 235799999987643222
Q ss_pred ccCCCceeEEEeccC-CeEEEEec----CeEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeeccccc
Q psy1644 307 HPMTDDVVFWKWISP-NTLALVTE----TSVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNR 381 (923)
Q Consensus 307 ~~m~E~VvfWkWis~-~tLglVT~----taVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~ 381 (923)
......+..=.|-.+ +.|++... ..+|.|++++.. ..++.. +... ...-.-++|++|++.+.-.
T Consensus 230 ~~~~~~~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~~~~-~~~l~~-~~~~----~~~~~~s~dg~~l~~~s~~----- 298 (417)
T TIGR02800 230 ASFPGMNGAPAFSPDGSKLAVSLSKDGNPDIYVMDLDGKQ-LTRLTN-GPGI----DTEPSWSPDGKSIAFTSDR----- 298 (417)
T ss_pred ecCCCCccceEECCCCCEEEEEECCCCCccEEEEECCCCC-EEECCC-CCCC----CCCEEECCCCCEEEEEECC-----
Confidence 223333333345433 45666543 369999986532 222222 1111 1122446788998876421
Q ss_pred ccceeeecccc
Q psy1644 382 VVGAMQLYSVE 392 (923)
Q Consensus 382 i~G~mQLYS~e 392 (923)
.|..|+|.++
T Consensus 299 -~g~~~iy~~d 308 (417)
T TIGR02800 299 -GGSPQIYMMD 308 (417)
T ss_pred -CCCceEEEEE
Confidence 2445777654
No 72
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=54.66 E-value=41 Score=37.84 Aligned_cols=59 Identities=19% Similarity=0.275 Sum_probs=40.3
Q ss_pred ccEEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEecCC
Q psy1644 734 YDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDED 796 (923)
Q Consensus 734 yGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide~ 796 (923)
-|.+|+.+..|.++-+|+.||+.+.+...+.-.-++ -..+-++.++.+|+|+.+....-
T Consensus 256 ~~~vy~~~~~g~l~ald~~tG~~~W~~~~~~~~~~~----~~~~~vy~~~~~g~l~ald~~tG 314 (394)
T PRK11138 256 GGVVYALAYNGNLVALDLRSGQIVWKREYGSVNDFA----VDGGRIYLVDQNDRVYALDTRGG 314 (394)
T ss_pred CCEEEEEEcCCeEEEEECCCCCEEEeecCCCccCcE----EECCEEEEEcCCCeEEEEECCCC
Confidence 478999999999999999999999887765421111 11244555666666666665443
No 73
>PRK03629 tolB translocation protein TolB; Provisional
Probab=54.48 E-value=1.7e+02 Score=33.89 Aligned_cols=143 Identities=13% Similarity=0.112 Sum_probs=80.6
Q ss_pred ceEEEEEeccCC--CceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccccc--c
Q psy1644 230 DKFICVREKIAE--SAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKS--K 303 (923)
Q Consensus 230 DkyICVRE~~~~--~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~--K 303 (923)
+|.+.|....++ ..++.+.|.+..+. .++-+-..-+.-.+|..+.||.-.. . .....|.++++++.+ +
T Consensus 164 ~riayv~~~~~~~~~~~l~~~d~dg~~~~~lt~~~~~~~~p~wSPDG~~la~~s~-~-----~g~~~i~i~dl~~G~~~~ 237 (429)
T PRK03629 164 TRIAYVVQTNGGQFPYELRVSDYDGYNQFVVHRSPQPLMSPAWSPDGSKLAYVTF-E-----SGRSALVIQTLANGAVRQ 237 (429)
T ss_pred CeEEEEEeeCCCCcceeEEEEcCCCCCCEEeecCCCceeeeEEcCCCCEEEEEEe-c-----CCCcEEEEEECCCCCeEE
Confidence 566666654332 34899999875442 3333323335667788877775310 0 023578899997753 3
Q ss_pred cccccCCCceeEEEeccCC-eEEEEec----CeEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecc
Q psy1644 304 MKAHPMTDDVVFWKWISPN-TLALVTE----TSVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAV 378 (923)
Q Consensus 304 LKs~~m~E~VvfWkWis~~-tLglVT~----taVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~ 378 (923)
+.++ ...+.-=.|-.+. .|+++.. ..+|.|++.+. .+..+++-+ ..+....-++|++|++.+.-.
T Consensus 238 l~~~--~~~~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg-~~~~lt~~~-----~~~~~~~wSPDG~~I~f~s~~-- 307 (429)
T PRK03629 238 VASF--PRHNGAPAFSPDGSKLAFALSKTGSLNLYVMDLASG-QIRQVTDGR-----SNNTEPTWFPDSQNLAYTSDQ-- 307 (429)
T ss_pred ccCC--CCCcCCeEECCCCCEEEEEEcCCCCcEEEEEECCCC-CEEEccCCC-----CCcCceEECCCCCEEEEEeCC--
Confidence 3332 2222223565443 5777643 25999998653 233333222 134566778899998766421
Q ss_pred cccccceeeecccc
Q psy1644 379 SNRVVGAMQLYSVE 392 (923)
Q Consensus 379 ~~~i~G~mQLYS~e 392 (923)
.|..|||.++
T Consensus 308 ----~g~~~Iy~~d 317 (429)
T PRK03629 308 ----AGRPQVYKVN 317 (429)
T ss_pred ----CCCceEEEEE
Confidence 2567888654
No 74
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=54.42 E-value=16 Score=28.32 Aligned_cols=30 Identities=23% Similarity=0.350 Sum_probs=26.9
Q ss_pred ceEEEEeeeeeEEEEecccceeeeecccCC
Q psy1644 75 DVVYLITKFGYIHVYDIETATCIYMNRISS 104 (923)
Q Consensus 75 G~iyliTK~G~lhl~Dlet~t~iy~~rIS~ 104 (923)
|.||+-|..|+|+-+|.+||..+-+-+..+
T Consensus 1 ~~v~~~~~~g~l~AlD~~TG~~~W~~~~~~ 30 (38)
T PF01011_consen 1 GRVYVGTPDGYLYALDAKTGKVLWKFQTGP 30 (38)
T ss_dssp TEEEEETTTSEEEEEETTTTSEEEEEESSS
T ss_pred CEEEEeCCCCEEEEEECCCCCEEEeeeCCC
Confidence 679999999999999999999998877655
No 75
>PTZ00420 coronin; Provisional
Probab=53.66 E-value=1.7e+02 Score=35.90 Aligned_cols=116 Identities=16% Similarity=0.148 Sum_probs=67.8
Q ss_pred ccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEeccCCeEEEEe--cCeEEEEeccCCCC
Q psy1644 265 SAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWISPNTLALVT--ETSVYHWSMEGDST 342 (923)
Q Consensus 265 SAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~~tLglVT--~taVYHWsi~g~s~ 342 (923)
+...||...-+++=|. ...++.|||+++.+.+.+......|.--.|-.+..+-.++ +..|..|++....
T Consensus 130 sVaf~P~g~~iLaSgS--------~DgtIrIWDl~tg~~~~~i~~~~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~- 200 (568)
T PTZ00420 130 IIDWNPMNYYIMCSSG--------FDSFVNIWDIENEKRAFQINMPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQE- 200 (568)
T ss_pred EEEECCCCCeEEEEEe--------CCCeEEEEECCCCcEEEEEecCCcEEEEEECCCCCEEEEEecCCEEEEEECCCCc-
Confidence 5567887653332221 2468999999998877766666677777775444443333 4589999986533
Q ss_pred chhhhhcccCCCCCeeEee-eeCCCCCeEEEEeeecccccccceeeecccc
Q psy1644 343 PEKMFDRHSTLNGCQIINY-RTDPKQTWLLLIGISAVSNRVVGAMQLYSVE 392 (923)
Q Consensus 343 P~K~FdR~~~L~~~QIInY-r~d~~~kW~~LvGI~~~~~~i~G~mQLYS~e 392 (923)
+..-|..|..--++..+.. ...++.++++-+|-+.. -.+.+.||...
T Consensus 201 ~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~---~~R~VkLWDlr 248 (568)
T PTZ00420 201 IASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKN---NMREMKLWDLK 248 (568)
T ss_pred EEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCCC---CccEEEEEECC
Confidence 3334555543222222222 11356677777775532 12357888764
No 76
>KOG2114|consensus
Probab=52.70 E-value=1.8e+02 Score=37.29 Aligned_cols=130 Identities=19% Similarity=0.212 Sum_probs=81.7
Q ss_pred eccceEEEEecCCcceeeeecccC-CceEEeecc-------------------CCCCcEEEEcCCCeEEEEE-ecCCccc
Q psy1644 741 TKFGYIHVYDIETATCIYMNRISS-DTIFVTAPH-------------------EATGGIIGVNRKGQVLSVS-VDEDNII 799 (923)
Q Consensus 741 TK~GflhlyDleTgt~Iy~nRIS~-dtIFvta~~-------------------~~t~Gii~VNrkGqVL~V~-Ide~~iV 799 (923)
-..|++..+-.++|+..-.-+-|+ .+||-.-.. ..+.+++.+.+.|.|..++ .|-++-+
T Consensus 262 ~~~g~~L~v~~~~~~~~~s~s~ss~~~i~~~~d~~n~~v~ys~vl~~l~d~l~~w~~~~~vltsdg~~~~L~ek~le~kL 341 (933)
T KOG2114|consen 262 FSFGLLLCVTTDKGTENTSLSNSSSNRIFKAYDLRNRYVLYSSVLEDLSDNLIEWSFDCLVLTSDGVVHELIEKDLETKL 341 (933)
T ss_pred EecCEEEEEEccCCCCCcccCccchhheeehhhhcCcccchHHhHHHHHHHHHhcCCcEEEEecCCceeeeeeccHHHHH
Confidence 355777777777777753333332 334422110 1234899999999998775 2333333
Q ss_pred ccccccCCCHHHHHHHHHHcCCCCc--hHHHHHHHHHHHhcCcHHHHHHHH-HhcCChhHHHHHHHhcccCChHHHHh
Q psy1644 800 PYINGTLQNPDLALRIAVRNNLAGA--EDLFVRKFNLLFTNGQYSEAAKAA-CKTGQIKEVERICRESNCYNAERVKN 874 (923)
Q Consensus 800 pYI~~~L~n~~LA~kLA~R~~LPGA--ddLf~~qFn~l~~~G~y~eAAk~A-~ktg~i~e~eri~r~~~~y~~~~vk~ 874 (923)
.-+..+ +=-.+|++||..-+++++ .+++.+==+-|+.+|||++|..-= -.-|-+..-|-|-++ -||+++||
T Consensus 342 ~iL~kK-~ly~~Ai~LAk~~~~d~d~~~~i~~kYgd~Ly~Kgdf~~A~~qYI~tI~~le~s~Vi~kf---Ldaq~Ikn 415 (933)
T KOG2114|consen 342 DILFKK-NLYKVAINLAKSQHLDEDTLAEIHRKYGDYLYGKGDFDEATDQYIETIGFLEPSEVIKKF---LDAQRIKN 415 (933)
T ss_pred HHHHHh-hhHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHcccCChHHHHHHh---cCHHHHHH
Confidence 333322 234799999999999875 355555557899999999998833 334666666655554 36677665
No 77
>PRK04922 tolB translocation protein TolB; Provisional
Probab=52.45 E-value=4.2e+02 Score=30.52 Aligned_cols=91 Identities=12% Similarity=0.097 Sum_probs=46.5
Q ss_pred CCceEEEEECCCCCCCcccccC---ccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEE
Q psy1644 241 ESAQVVIIDMNDPTNPIRRPIS---ADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWK 317 (923)
Q Consensus 241 ~~~qVvIVDL~n~~~v~Rrpi~---ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWk 317 (923)
+..++.++|+.... +.|-... .-+.-.+|..+.||.-.. + ..+..|.+||+++.+. +..+-......=.
T Consensus 314 g~~~iy~~dl~~g~-~~~lt~~g~~~~~~~~SpDG~~Ia~~~~---~---~~~~~I~v~d~~~g~~-~~Lt~~~~~~~p~ 385 (433)
T PRK04922 314 GRPQIYRVAASGGS-AERLTFQGNYNARASVSPDGKKIAMVHG---S---GGQYRIAVMDLSTGSV-RTLTPGSLDESPS 385 (433)
T ss_pred CCceEEEEECCCCC-eEEeecCCCCccCEEECCCCCEEEEEEC---C---CCceeEEEEECCCCCe-EECCCCCCCCCce
Confidence 34588889986643 2221111 113456788887777421 0 1234799999876542 2211111111125
Q ss_pred eccC-CeEEEEec----CeEEEEeccC
Q psy1644 318 WISP-NTLALVTE----TSVYHWSMEG 339 (923)
Q Consensus 318 Wis~-~tLglVT~----taVYHWsi~g 339 (923)
|-.+ +.|...+. ..+|-|+.+|
T Consensus 386 ~spdG~~i~~~s~~~g~~~L~~~~~~g 412 (433)
T PRK04922 386 FAPNGSMVLYATREGGRGVLAAVSTDG 412 (433)
T ss_pred ECCCCCEEEEEEecCCceEEEEEECCC
Confidence 6544 34555554 2577777755
No 78
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=51.25 E-value=82 Score=33.15 Aligned_cols=71 Identities=25% Similarity=0.405 Sum_probs=49.7
Q ss_pred CCCCCCceEEEEEEecCCccEEEEEecCCCCCCCCCcccceeeeecCCCccCCcceeEEeccccceEEEEeeeeeEEEEe
Q psy1644 11 EGNPEPSTLFCFAVRTPAGGKLHIIEVGTPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYLITKFGYIHVYD 90 (923)
Q Consensus 11 ~gN~~~stL~cfa~r~~~~gkLhiiEig~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK~G~lhl~D 90 (923)
.+.......+|.|.|+ +++|.|...+. ..|.+...++..| |-|.+|+.. .+.|.+-++.|| .++|
T Consensus 99 ~~~~~~~~~L~va~kk----~i~i~~~~~~~---~~f~~~~ke~~lp-----~~~~~i~~~--~~~i~v~~~~~f-~~id 163 (275)
T PF00780_consen 99 NGGHEGSRRLCVAVKK----KILIYEWNDPR---NSFSKLLKEISLP-----DPPSSIAFL--GNKICVGTSKGF-YLID 163 (275)
T ss_pred ccccccceEEEEEECC----EEEEEEEECCc---ccccceeEEEEcC-----CCcEEEEEe--CCEEEEEeCCce-EEEe
Confidence 3445556677777777 99999997631 2232344455555 788999998 777888888885 5779
Q ss_pred ccccee
Q psy1644 91 IETATC 96 (923)
Q Consensus 91 let~t~ 96 (923)
++|+..
T Consensus 164 l~~~~~ 169 (275)
T PF00780_consen 164 LNTGSP 169 (275)
T ss_pred cCCCCc
Confidence 997654
No 79
>PRK05560 DNA gyrase subunit A; Validated
Probab=50.26 E-value=1.7e+02 Score=37.35 Aligned_cols=71 Identities=17% Similarity=0.193 Sum_probs=45.9
Q ss_pred eeeeeeccccccEEEEEeccceEEEEecCCcce-------eeeecc-cCCceEEeeccCCCCcEEEEcCCCeEEEEEec
Q psy1644 724 EIYVQKVSAKYDVVYLITKFGYIHVYDIETATC-------IYMNRI-SSDTIFVTAPHEATGGIIGVNRKGQVLSVSVD 794 (923)
Q Consensus 724 ~i~~~qvS~KyGvIyvITK~GflhlyDleTgt~-------Iy~nRI-S~dtIFvta~~~~t~Gii~VNrkGqVL~V~Id 794 (923)
+|.....-+...++.++|+.||+.-.++.+-.. +-..+. -+|.+--...-++.+-++.++.+|+|+.+.+.
T Consensus 488 ~i~~edlI~~E~v~vllS~~GyIKri~~~~~~~~~~~~~g~~~~klKe~D~l~~~~~~~t~d~LllfTs~Grv~~l~v~ 566 (805)
T PRK05560 488 DIDDEDLIPEEDVVVTLTHGGYIKRTPLDEYRAQRRGGKGVSGAKTKEDDFVEHLFVASTHDTLLFFTNRGRVYRLKVY 566 (805)
T ss_pred ccchhhccCCCCEEEEEeCCCEEEEcchhhhhhhcccCCCccccccCCCCeeEEEEEecCCCeEEEEecCCeEEEEEhh
Confidence 344444556789999999999999987755211 111111 12333323333445679999999999999887
No 80
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=48.64 E-value=4.3e+02 Score=29.51 Aligned_cols=121 Identities=11% Similarity=0.203 Sum_probs=63.6
Q ss_pred CCceEEEEECCCCCCC--cccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccc--cccccCCCceeEE
Q psy1644 241 ESAQVVIIDMNDPTNP--IRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSK--MKAHPMTDDVVFW 316 (923)
Q Consensus 241 ~~~qVvIVDL~n~~~v--~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~K--LKs~~m~E~VvfW 316 (923)
+..+|.++|+...... ..-+-..-+...+|..+-|++-... .....|.++|++++.. +..+. ....--
T Consensus 212 ~~~~i~v~d~~~g~~~~~~~~~~~~~~~~~spDg~~l~~~~~~------~~~~~i~~~d~~~~~~~~l~~~~--~~~~~~ 283 (417)
T TIGR02800 212 GKPEIYVQDLATGQREKVASFPGMNGAPAFSPDGSKLAVSLSK------DGNPDIYVMDLDGKQLTRLTNGP--GIDTEP 283 (417)
T ss_pred CCcEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEECC------CCCccEEEEECCCCCEEECCCCC--CCCCCE
Confidence 3457888888764321 1111112234568887766653211 0234688888877632 22221 111122
Q ss_pred Eec-cCCeEEEEecC----eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEee
Q psy1644 317 KWI-SPNTLALVTET----SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGI 375 (923)
Q Consensus 317 kWi-s~~tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI 375 (923)
.|. +.+.|++++.. .||.|++.+.. .+.+... +....+...+++++|++...-
T Consensus 284 ~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~-~~~l~~~-----~~~~~~~~~spdg~~i~~~~~ 341 (417)
T TIGR02800 284 SWSPDGKSIAFTSDRGGSPQIYMMDADGGE-VRRLTFR-----GGYNASPSWSPDGDLIAFVHR 341 (417)
T ss_pred EECCCCCEEEEEECCCCCceEEEEECCCCC-EEEeecC-----CCCccCeEECCCCCEEEEEEc
Confidence 444 44678888764 69999986532 2223211 222345567888898877653
No 81
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=48.44 E-value=3.9e+02 Score=29.02 Aligned_cols=135 Identities=14% Similarity=0.225 Sum_probs=67.6
Q ss_pred CceEEEEECCCCCCC---ccc---ccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccccccccc---------
Q psy1644 242 SAQVVIIDMNDPTNP---IRR---PISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKA--------- 306 (923)
Q Consensus 242 ~~qVvIVDL~n~~~v---~Rr---pi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs--------- 306 (923)
.++|.++|++....+ .+. ....-++.++|..+.+.+-. . ....|-||++++..+++.
T Consensus 101 ~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~-~-------~~~~v~v~d~~~~g~l~~~~~~~~~~~ 172 (330)
T PRK11028 101 ANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPC-L-------KEDRIRLFTLSDDGHLVAQEPAEVTTV 172 (330)
T ss_pred CCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEee-C-------CCCEEEEEEECCCCcccccCCCceecC
Confidence 468889998642211 111 11223456899877665431 1 346899999987555532
Q ss_pred -ccCCCceeEEEeccCCeEEEEe--cCeEEEEeccCCCCchhhhhc---c-cCCCC-CeeEeeeeCCCCCeEEEEeeecc
Q psy1644 307 -HPMTDDVVFWKWISPNTLALVT--ETSVYHWSMEGDSTPEKMFDR---H-STLNG-CQIINYRTDPKQTWLLLIGISAV 378 (923)
Q Consensus 307 -~~m~E~VvfWkWis~~tLglVT--~taVYHWsi~g~s~P~K~FdR---~-~~L~~-~QIInYr~d~~~kW~~LvGI~~~ 378 (923)
...+..+.|+. +.+.+..+. ...|..|+++....-.++... . +...+ ..-..-..+++++|+.+..
T Consensus 173 ~g~~p~~~~~~p--dg~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~---- 246 (330)
T PRK11028 173 EGAGPRHMVFHP--NQQYAYCVNELNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACD---- 246 (330)
T ss_pred CCCCCceEEECC--CCCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEec----
Confidence 11223444431 223454554 478999998632111111111 1 11111 1122345578888876642
Q ss_pred cccccceeeecccc
Q psy1644 379 SNRVVGAMQLYSVE 392 (923)
Q Consensus 379 ~~~i~G~mQLYS~e 392 (923)
+-.+.+-+|.++
T Consensus 247 --~~~~~I~v~~i~ 258 (330)
T PRK11028 247 --RTASLISVFSVS 258 (330)
T ss_pred --CCCCeEEEEEEe
Confidence 233455555543
No 82
>KOG0291|consensus
Probab=48.38 E-value=7.2e+02 Score=32.05 Aligned_cols=57 Identities=19% Similarity=0.255 Sum_probs=39.8
Q ss_pred eEEeccccceEEEEeeeeeEEEEecccceeeeecccCCCcEEEe-eccCCCcceEEEecCCcceeeeee
Q psy1644 67 AMQVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVT-APHEATGGIIGLTNVGINVGSVSF 134 (923)
Q Consensus 67 amqiS~k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~if~t-a~~~~t~Gil~vn~~G~~~~si~~ 134 (923)
+|+.|+-=-+|----..|-|.|||.-||.|+ +| ++++++.--++....|--++|-|+
T Consensus 355 ~l~YSpDgq~iaTG~eDgKVKvWn~~SgfC~-----------vTFteHts~Vt~v~f~~~g~~llssSL 412 (893)
T KOG0291|consen 355 SLAYSPDGQLIATGAEDGKVKVWNTQSGFCF-----------VTFTEHTSGVTAVQFTARGNVLLSSSL 412 (893)
T ss_pred eEEECCCCcEEEeccCCCcEEEEeccCceEE-----------EEeccCCCceEEEEEEecCCEEEEeec
Confidence 4444443334444446799999999999995 34 778888888888888855566666
No 83
>PRK13979 DNA topoisomerase IV subunit A; Provisional
Probab=46.96 E-value=3.2e+02 Score=35.70 Aligned_cols=70 Identities=16% Similarity=0.172 Sum_probs=48.8
Q ss_pred eeeeeeccccccEEEEEeccceEEEEecCCcceeeeec-------ccCCceEEeeccCCCCcEEEEcCCCeEEEEEecC
Q psy1644 724 EIYVQKVSAKYDVVYLITKFGYIHVYDIETATCIYMNR-------ISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDE 795 (923)
Q Consensus 724 ~i~~~qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nR-------IS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide 795 (923)
++....+-+...++.++|+.||+.-.++.+=. -.+| -..|.+.-...-++.+-++.++.+|+|+.+.+.+
T Consensus 506 ~i~~edlI~~E~v~v~lS~~GyIKr~~~~~~~--~q~~g~~~~~~ke~D~i~~~~~~~T~d~LL~FTn~Gkvy~ikvy~ 582 (957)
T PRK13979 506 KIDVEELIVVEDVVITLSNEGFIKRIPLKSYN--RSNSNVEDIEYREGDFNKFLIQSNTKDTLLIFTDKGNMYQIKGIN 582 (957)
T ss_pred cCCHhHcCCCcceEEEEecCCEEEEccccccc--cccccccccccCCCCceEEEEEEcCCCEEEEEECCCeEEEEEeee
Confidence 45544566788999999999999998765321 0111 1345544444445678899999999999998765
No 84
>PF08625 Utp13: Utp13 specific WD40 associated domain; InterPro: IPR013934 A large ribonuclear protein complex is required for the processing of the small-ribosomal-subunit rRNA - the small-subunit (SSU) processome [, ]. This preribosomal complex contains the U3 snoRNA and at least 40 proteins, which have the following properties: They are nucleolar. They are able to coimmunoprecipitate with the U3 snoRNA and Mpp10 (a protein specific to the SSU processome). They are required for 18S rRNA biogenesis. There appears to be a linkage between polymerase I transcription and the formation of the SSU processome; as some, but not all, of the SSU processome components are required for pre-rRNA transcription initiation. These SSU processome components have been termed t-Utps. They form a pre-complex with pre-18S rRNA in the absence of snoRNA U3 and other SSU processome components. It has been proposed that the t-Utp complex proteins are both rDNA and rRNA binding proteins that are involved in the initiation of pre18S rRNA transcription. Initially binding to rDNA then associating with the 5' end of the nascent pre18S rRNA. The t-Utpcomplex forms the nucleus around which the rest of the SSU processome components, including snoRNA U3, assemble []. From electron microscopy the SSU processome may correspond to the terminal knobs visualized at the 5' ends of nascent 18S rRNA. Utp13 is a nucleolar protein and component of the small subunit (SSU) processome containing the U3 snoRNA that is involved in processing of pre-18S rRNA []. Upt13 is also a component of the Pwp2 complex that forms part of a stable particle subunit independent of the U3 small nucleolar ribonucleoprotein that is essential for the initial assembly steps of the 90S pre-ribosome []. Components of the Pwp2 complex are: Utp1 (Pwp2), Utp6, Utp12 (Dip2), Utp13, Utp18, and Utp21. The relationship between the Pwp2 complex and the t-Utps complex [] that also associates with the 5' end of nascent pre-18S rRNA is unclear. ; GO: 0006364 rRNA processing, 0032040 small-subunit processome
Probab=45.53 E-value=24 Score=35.54 Aligned_cols=51 Identities=18% Similarity=0.269 Sum_probs=36.0
Q ss_pred HHHHHHHhcCcHHHHHHHHHhcCChhHHHHHHHhcc-------cCChHHHHhHHhhcc
Q psy1644 830 RKFNLLFTNGQYSEAAKAACKTGQIKEVERICRESN-------CYNAERVKNFLKEAK 880 (923)
Q Consensus 830 ~qFn~l~~~G~y~eAAk~A~ktg~i~e~eri~r~~~-------~y~~~~vk~~l~~~k 880 (923)
|+++.++.+|+|.+|.+.|.+.+|-..+=.|.+..- -..-+.+.++++.-+
T Consensus 2 Q~L~N~l~~~~y~~Al~LAl~L~~P~~ll~i~~~~~~~~~~~~~~g~~~l~~~i~~L~ 59 (141)
T PF08625_consen 2 QELSNLLRQKDYKEALRLALKLDHPFRLLKILKDLLETEEDEDSIGSEELDEVIKKLD 59 (141)
T ss_pred chHHHHHHhhhHHHHHHHHHhcCCcHHHHHHHHHHHhcccccccchHHHHHHHHHhcC
Confidence 678999999999999999999998776554444333 112355666666544
No 85
>KOG2280|consensus
Probab=44.03 E-value=8.3e+02 Score=31.52 Aligned_cols=149 Identities=17% Similarity=0.136 Sum_probs=78.9
Q ss_pred HHHHHHHHHHHhhcccchhhhHHHHhH-hcCccceeeeeee-eccccccEEEEEeccceEEEEecCCcceeeeecccCCc
Q psy1644 689 YIEIYVQKIRIKIKQSRFVHDLVLYLY-RNNLQKYIEIYVQ-KVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDT 766 (923)
Q Consensus 689 ll~p~LE~~~~~g~q~~~v~nalakiy-~~~~~~~~~i~~~-qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dt 766 (923)
-|+-=||.+...-=.|-.+|..+-+.. .||+.+--.|-.+ +|++|- +|.++=.+..-.-|-|-..-+-..|=|
T Consensus 669 ~lQ~~Le~q~~~~f~dlSl~dTv~~li~~g~~k~a~ql~~~FkipdKr--~~wLk~~aLa~~~kweeLekfAkskks--- 743 (829)
T KOG2280|consen 669 KLQRTLEDQFGGSFVDLSLHDTVTTLILIGQNKRAEQLKSDFKIPDKR--LWWLKLTALADIKKWEELEKFAKSKKS--- 743 (829)
T ss_pred HHHHHHHHHhccccccCcHHHHHHHHHHccchHHHHHHHHhcCCcchh--hHHHHHHHHHhhhhHHHHHHHHhccCC---
Confidence 345567777776667888999988887 5555533333322 566653 222222222222121111111011100
Q ss_pred eEEeeccCCCCcEEEEcCCCeEEEEEecCCcccccccccC--CCHHHHHHHHHHc-CCCCchHHHHHHHHHHHhcCcHHH
Q psy1644 767 IFVTAPHEATGGIIGVNRKGQVLSVSVDEDNIIPYINGTL--QNPDLALRIAVRN-NLAGAEDLFVRKFNLLFTNGQYSE 843 (923)
Q Consensus 767 IFvta~~~~t~Gii~VNrkGqVL~V~Ide~~iVpYI~~~L--~n~~LA~kLA~R~-~LPGAddLf~~qFn~l~~~G~y~e 843 (923)
--| --||+.--+ .|.+=|.|-..|+ ||+ .+=..++.-|+|+|
T Consensus 744 ---------PIG-------------------y~PFVe~c~~~~n~~EA~KYiprv~~l~-------ekv~ay~~~~~~~e 788 (829)
T KOG2280|consen 744 ---------PIG-------------------YLPFVEACLKQGNKDEAKKYIPRVGGLQ-------EKVKAYLRVGDVKE 788 (829)
T ss_pred ---------CCC-------------------chhHHHHHHhcccHHHHhhhhhccCChH-------HHHHHHHHhccHHH
Confidence 111 235554433 5777777755554 344 56677899999999
Q ss_pred HHHHHHhcCChhHHHHHHHhcccC--ChHHHHhHHh
Q psy1644 844 AAKAACKTGQIKEVERICRESNCY--NAERVKNFLK 877 (923)
Q Consensus 844 AAk~A~ktg~i~e~eri~r~~~~y--~~~~vk~~l~ 877 (923)
||+.|++....-++=.+.+.+.-- ..-++|++..
T Consensus 789 Aad~A~~~rd~~~L~ev~~~~~~~~~~~~~~~~~~~ 824 (829)
T KOG2280|consen 789 AADLAAEHRDGAELSEVLSKCTGAPDGATALKIQDQ 824 (829)
T ss_pred HHHHHHHhcChHHHHHHHHhcCCCCccchhhhhHhh
Confidence 999999876655554444433321 2334555443
No 86
>PRK05137 tolB translocation protein TolB; Provisional
Probab=43.88 E-value=5.1e+02 Score=29.81 Aligned_cols=131 Identities=8% Similarity=0.119 Sum_probs=69.2
Q ss_pred ceEEEEEeccCCCceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccc
Q psy1644 230 DKFICVREKIAESAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAH 307 (923)
Q Consensus 230 DkyICVRE~~~~~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~ 307 (923)
.+.+.+.... +..+|.++|+..... .+..+-..-+...+|..+-||+-.... ....|.++|++++.. +..
T Consensus 214 ~~lay~s~~~-g~~~i~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~~------g~~~Iy~~d~~~~~~-~~L 285 (435)
T PRK05137 214 QEITYMSYAN-GRPRVYLLDLETGQRELVGNFPGMTFAPRFSPDGRKVVMSLSQG------GNTDIYTMDLRSGTT-TRL 285 (435)
T ss_pred CEEEEEEecC-CCCEEEEEECCCCcEEEeecCCCcccCcEECCCCCEEEEEEecC------CCceEEEEECCCCce-EEc
Confidence 3444444333 346899999876532 111121222456788887776542110 123466667776532 222
Q ss_pred cCCC-ceeEEEecc-CCeEEEEecC----eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEe
Q psy1644 308 PMTD-DVVFWKWIS-PNTLALVTET----SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIG 374 (923)
Q Consensus 308 ~m~E-~VvfWkWis-~~tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvG 374 (923)
+-.. ....=.|-. .+.|+.++.. .+|.|+++|.. +..++.- +..+.+..-++|++|++.+.
T Consensus 286 t~~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~-~~~lt~~-----~~~~~~~~~SpdG~~ia~~~ 352 (435)
T PRK05137 286 TDSPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSN-PRRISFG-----GGRYSTPVWSPRGDLIAFTK 352 (435)
T ss_pred cCCCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCC-eEEeecC-----CCcccCeEECCCCCEEEEEE
Confidence 2111 122235543 3568888754 69999986542 3444421 11233456778889988776
No 87
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=43.85 E-value=37 Score=40.67 Aligned_cols=34 Identities=21% Similarity=0.366 Sum_probs=31.0
Q ss_pred ccccccEEEEEeccceEEEEecCCcceeeeeccc
Q psy1644 730 VSAKYDVVYLITKFGYIHVYDIETATCIYMNRIS 763 (923)
Q Consensus 730 vS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS 763 (923)
+..+-|++|+.|+.|+++.||.+||..|+..|+-
T Consensus 468 l~t~g~lvf~g~~~G~l~a~D~~TGe~lw~~~~g 501 (527)
T TIGR03075 468 LATAGDLVFYGTLEGYFKAFDAKTGEELWKFKTG 501 (527)
T ss_pred eEECCcEEEEECCCCeEEEEECCCCCEeEEEeCC
Confidence 5567789999999999999999999999999984
No 88
>KOG2055|consensus
Probab=43.70 E-value=96 Score=37.20 Aligned_cols=114 Identities=16% Similarity=0.306 Sum_probs=73.0
Q ss_pred EEEEECCCCC-CCcccccCcccccc-----CCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEe
Q psy1644 245 VVIIDMNDPT-NPIRRPISADSAIM-----NPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKW 318 (923)
Q Consensus 245 VvIVDL~n~~-~v~Rrpi~ADSAIM-----nP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkW 318 (923)
..+.||.... ....+|-.-+--.| .|.+++||+-|+ ...|.+-..-|++=+.++.++--|-=.+|
T Consensus 282 ~ysyDle~ak~~k~~~~~g~e~~~~e~FeVShd~~fia~~G~---------~G~I~lLhakT~eli~s~KieG~v~~~~f 352 (514)
T KOG2055|consen 282 LYSYDLETAKVTKLKPPYGVEEKSMERFEVSHDSNFIAIAGN---------NGHIHLLHAKTKELITSFKIEGVVSDFTF 352 (514)
T ss_pred EEEeeccccccccccCCCCcccchhheeEecCCCCeEEEccc---------CceEEeehhhhhhhhheeeeccEEeeEEE
Confidence 3455665433 22333433343333 377889999863 35788888888888999999998988999
Q ss_pred ccCCeEEEEe--cCeEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEE
Q psy1644 319 ISPNTLALVT--ETSVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLL 371 (923)
Q Consensus 319 is~~tLglVT--~taVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~ 371 (923)
-|+.+.-++. ...||||+++.++ -..-|--.+.+.++ ....+.++.|++
T Consensus 353 sSdsk~l~~~~~~GeV~v~nl~~~~-~~~rf~D~G~v~gt---s~~~S~ng~ylA 403 (514)
T KOG2055|consen 353 SSDSKELLASGGTGEVYVWNLRQNS-CLHRFVDDGSVHGT---SLCISLNGSYLA 403 (514)
T ss_pred ecCCcEEEEEcCCceEEEEecCCcc-eEEEEeecCcccee---eeeecCCCceEE
Confidence 9998887777 6699999997542 11123223333332 334455666764
No 89
>KOG0270|consensus
Probab=43.67 E-value=6.7e+02 Score=30.33 Aligned_cols=54 Identities=22% Similarity=0.436 Sum_probs=35.7
Q ss_pred CCeEEEEecccccc------------cccccCCCceeEEEeccCCeEEEEecCe-EEEEeccCCCCchhhhhcc
Q psy1644 290 PKTLQIFNIEMKSK------------MKAHPMTDDVVFWKWISPNTLALVTETS-VYHWSMEGDSTPEKMFDRH 350 (923)
Q Consensus 290 g~~LQIFNle~K~K------------LKs~~m~E~VvfWkWis~~tLglVT~ta-VYHWsi~g~s~P~K~FdR~ 350 (923)
-+++.+|+++--+- +.+++.+.++-|| +|+=.+++ +-.|++..+++-+|-|.+.
T Consensus 395 d~~Vklw~~~~~~~~~v~~~~~~~~rl~c~~~~~~~a~~-------la~GG~k~~~~vwd~~~~~~V~kaF~~~ 461 (463)
T KOG0270|consen 395 DKVVKLWKFDVDSPKSVKEHSFKLGRLHCFALDPDVAFT-------LAFGGEKAVLRVWDIFTNSPVRKAFGSR 461 (463)
T ss_pred cceEEEEeecCCCCcccccccccccceeecccCCCcceE-------EEecCccceEEEeecccChhHHHhhccc
Confidence 46788888764332 4455667777675 55555556 5679988888888888653
No 90
>PRK00178 tolB translocation protein TolB; Provisional
Probab=42.58 E-value=3.9e+02 Score=30.43 Aligned_cols=133 Identities=14% Similarity=0.107 Sum_probs=70.2
Q ss_pred CceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEec
Q psy1644 242 SAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWI 319 (923)
Q Consensus 242 ~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWi 319 (923)
..++.++|.+..+. .+...-..-+...+|..+-||.-.. . + ....|.++|+++.+.-+-......+..-.|-
T Consensus 178 ~~~l~~~d~~g~~~~~l~~~~~~~~~p~wSpDG~~la~~s~-~--~---~~~~l~~~~l~~g~~~~l~~~~g~~~~~~~S 251 (430)
T PRK00178 178 RYTLQRSDYDGARAVTLLQSREPILSPRWSPDGKRIAYVSF-E--Q---KRPRIFVQNLDTGRREQITNFEGLNGAPAWS 251 (430)
T ss_pred ceEEEEECCCCCCceEEecCCCceeeeeECCCCCEEEEEEc-C--C---CCCEEEEEECCCCCEEEccCCCCCcCCeEEC
Confidence 34788889875431 1111111123455677776665421 0 1 2357899999876432211222233345675
Q ss_pred cC-CeEEEEec----CeEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeecccc
Q psy1644 320 SP-NTLALVTE----TSVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVE 392 (923)
Q Consensus 320 s~-~tLglVT~----taVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~e 392 (923)
.+ +.|++... ..+|.|++.+.. ..++.+ +. .-.....-++|++|++... . -.|..|+|..+
T Consensus 252 pDG~~la~~~~~~g~~~Iy~~d~~~~~-~~~lt~-~~----~~~~~~~~spDg~~i~f~s--~----~~g~~~iy~~d 317 (430)
T PRK00178 252 PDGSKLAFVLSKDGNPEIYVMDLASRQ-LSRVTN-HP----AIDTEPFWGKDGRTLYFTS--D----RGGKPQIYKVN 317 (430)
T ss_pred CCCCEEEEEEccCCCceEEEEECCCCC-eEEccc-CC----CCcCCeEECCCCCEEEEEE--C----CCCCceEEEEE
Confidence 43 56777653 268999886532 222332 11 1123456688899887653 1 23556787654
No 91
>KOG0282|consensus
Probab=41.95 E-value=38 Score=40.44 Aligned_cols=58 Identities=7% Similarity=0.073 Sum_probs=40.3
Q ss_pred cceEEEEecCCcceeeeecccCCceEEeeccCCC-CcEEEEcCCCeEEEEEecCCccccc
Q psy1644 743 FGYIHVYDIETATCIYMNRISSDTIFVTAPHEAT-GGIIGVNRKGQVLSVSVDEDNIIPY 801 (923)
Q Consensus 743 ~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t-~Gii~VNrkGqVL~V~Ide~~iVpY 801 (923)
.+||++||.|||+|+.+=+ ++.++++--.+.+. +=+++-.-+|.|++.-|....||.=
T Consensus 279 D~~lKlwDtETG~~~~~f~-~~~~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~kvvqe 337 (503)
T KOG0282|consen 279 DRFLKLWDTETGQVLSRFH-LDKVPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGKVVQE 337 (503)
T ss_pred ceeeeeeccccceEEEEEe-cCCCceeeecCCCCCcEEEEecCCCcEEEEeccchHHHHH
Confidence 4899999999999997654 34555555555544 3344445788888888877776643
No 92
>KOG3678|consensus
Probab=41.50 E-value=20 Score=42.79 Aligned_cols=104 Identities=23% Similarity=0.361 Sum_probs=69.3
Q ss_pred ccccccchhhHHH----------HHHHHHHhcChhHHHHHHHHhCCCccHHHHHHHHHhcCCchhhHHHHhhhcCC-CCC
Q psy1644 453 HESLELCRPVLAQ----------VIQSFAETGQFQKIVLYAKKVGYSPDYVFLLRNVMRVNPDQGVGFAQMLVQDD-EPL 521 (923)
Q Consensus 453 ~ESlEL~rpVL~Q----------Vi~clae~Gqfdkiv~Y~~~vgytpDy~~LL~~i~~~nP~~a~efA~~L~~~~-~pl 521 (923)
-|..|++|.|+.= +.+.|.++|-.|.++-.|++.. | .||+| -|..++.++.... ...
T Consensus 233 ~e~~e~aR~~~~il~~mFKHSeet~~~Lvaa~~lD~vl~~~rRt~--P---~lLRH-------~ALAL~N~~L~~~~a~q 300 (832)
T KOG3678|consen 233 REPVELARSVAGILEHMFKHSEETCQRLVAAGGLDAVLYWCRRTD--P---ALLRH-------CALALGNCALHGGQAVQ 300 (832)
T ss_pred cCcHHHHHHHHHHHHHHhhhhHHHHHHHHhhcccchheeecccCC--H---HHHHH-------HHHHhhhhhhhchhHHH
Confidence 4788999998864 8999999999999999999865 4 46776 3455666665542 000
Q ss_pred c-Ccc-ccchhhhhhhcccccccCCcchhhHHHHHHHhcCChhhhHHhhhcCCc
Q psy1644 522 A-DIN-QSYEGLFYFLGSIVNYSQDQEVHFKYIQAACKTGQIKEVERICRESNC 573 (923)
Q Consensus 522 ~-di~-~~~egLyy~Lgsivn~s~d~dVhfKYIeAA~k~gq~~EvERicresn~ 573 (923)
- =++ +..|.||+. .++.|.-..+.---|.|-.--.|||||-+|.|.-
T Consensus 301 rrmveKr~~EWLF~L-----A~skDel~R~~AClAV~vlat~KE~E~~VrkS~T 349 (832)
T KOG3678|consen 301 RRMVEKRAAEWLFPL-----AFSKDELLRLHACLAVAVLATNKEVEREVRKSGT 349 (832)
T ss_pred HHHHHhhhhhhhhhh-----hcchHHHHHHHHHHHHhhhhhhhhhhHHHhhccc
Confidence 0 011 166888654 2334444444333355667788999999999865
No 93
>KOG0315|consensus
Probab=41.05 E-value=3e+02 Score=31.25 Aligned_cols=123 Identities=15% Similarity=0.296 Sum_probs=79.7
Q ss_pred ccCCCceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccccccccccc-CCCcee
Q psy1644 238 KIAESAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHP-MTDDVV 314 (923)
Q Consensus 238 ~~~~~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~-m~E~Vv 314 (923)
++++...|-|.||.++.- .-+.|-.--+...||++-=+-.= | +...+.||||.+-+ .+++ ++|+-+
T Consensus 100 TgseDgt~kIWdlR~~~~qR~~~~~spVn~vvlhpnQteLis~-----d----qsg~irvWDl~~~~--c~~~liPe~~~ 168 (311)
T KOG0315|consen 100 TGSEDGTVKIWDLRSLSCQRNYQHNSPVNTVVLHPNQTELISG-----D----QSGNIRVWDLGENS--CTHELIPEDDT 168 (311)
T ss_pred ecCCCceEEEEeccCcccchhccCCCCcceEEecCCcceEEee-----c----CCCcEEEEEccCCc--cccccCCCCCc
Confidence 345667999999999642 23344445678899997433322 1 34589999996553 3444 488777
Q ss_pred EEEecc----CCeEEEE-ecCeEEEEeccCC-----CCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEee
Q psy1644 315 FWKWIS----PNTLALV-TETSVYHWSMEGD-----STPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGI 375 (923)
Q Consensus 315 fWkWis----~~tLglV-T~taVYHWsi~g~-----s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI 375 (923)
+-+=+. -+.++-+ +...-|.|++-++ -.|+.-|.-| +.-|..-+-++|.|-++-++-
T Consensus 169 ~i~sl~v~~dgsml~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah----~~~il~C~lSPd~k~lat~ss 235 (311)
T KOG0315|consen 169 SIQSLTVMPDGSMLAAANNKGNCYVWRLLNHQTASELEPVHKFQAH----NGHILRCLLSPDVKYLATCSS 235 (311)
T ss_pred ceeeEEEcCCCcEEEEecCCccEEEEEccCCCccccceEhhheecc----cceEEEEEECCCCcEEEeecC
Confidence 665442 2344444 4448999998442 2566555444 345888899999999988874
No 94
>KOG0290|consensus
Probab=40.53 E-value=87 Score=35.83 Aligned_cols=160 Identities=23% Similarity=0.340 Sum_probs=86.6
Q ss_pred CCCCCCceEEEEEEecCCccEEEEEecCCCC-C--CCCCcccceeeeecCCCccCCcceeEEeccccceEEEEeeeeeEE
Q psy1644 11 EGNPEPSTLFCFAVRTPAGGKLHIIEVGTPP-A--GNTPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYLITKFGYIH 87 (923)
Q Consensus 11 ~gN~~~stL~cfa~r~~~~gkLhiiEig~~~-~--g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK~G~lh 87 (923)
++|+-..++++-++..+.. ++-=+|.|... . -=..-.|.+.|+.|.-.-.+ +.--+...|-|.
T Consensus 157 DWne~dp~~igtSSiDTTC-TiWdie~~~~~~vkTQLIAHDKEV~DIaf~~~s~~-------------~FASvgaDGSvR 222 (364)
T KOG0290|consen 157 DWNEVDPNLIGTSSIDTTC-TIWDIETGVSGTVKTQLIAHDKEVYDIAFLKGSRD-------------VFASVGADGSVR 222 (364)
T ss_pred ccccCCcceeEeecccCeE-EEEEEeeccccceeeEEEecCcceeEEEeccCccc-------------eEEEecCCCcEE
Confidence 6676666777666655532 22233333110 0 00123355666666542222 112245678999
Q ss_pred EEec---ccceeeeecccCCCcEEEeeccCCCcceEEEecCCcceeeeeeeceeecccceeEEEecccccceEEEEecCC
Q psy1644 88 VYDI---ETATCIYMNRISSDTIFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMND 164 (923)
Q Consensus 88 l~Dl---et~t~iy~~rIS~~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~sdk~ic~~~~~~~~~~vviid~~~ 164 (923)
+||+ |-+|+||...=++...+=-+.+ +. .|..+ .|..| |++.|+|+||..
T Consensus 223 mFDLR~leHSTIIYE~p~~~~pLlRLswn----------kq--Dpnym--ATf~~-------------dS~~V~iLDiR~ 275 (364)
T KOG0290|consen 223 MFDLRSLEHSTIIYEDPSPSTPLLRLSWN----------KQ--DPNYM--ATFAM-------------DSNKVVILDIRV 275 (364)
T ss_pred EEEecccccceEEecCCCCCCcceeeccC----------cC--CchHH--hhhhc-------------CCceEEEEEecC
Confidence 9996 5689999988765444433222 22 23322 23333 456799999988
Q ss_pred CCCCCccCccccccccCcccchhhhcCCCCCCCCCCcccceeeeccccccccCCcccccceeeecceEEEEEeccCCCce
Q psy1644 165 PTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLTNVGINVGSVSFNTLTMESDKFICVREKIAESAQ 244 (923)
Q Consensus 165 ~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~lGI~~~si~F~~lTmESDkyICVRE~~~~~~q 244 (923)
|..| |.-|+|--+.-| || .+ -=.|-+.||- .|+..|
T Consensus 276 P~tp-----------------va~L~~H~a~VN----------------gI-----aW---aPhS~~hict---aGDD~q 311 (364)
T KOG0290|consen 276 PCTP-----------------VARLRNHQASVN----------------GI-----AW---APHSSSHICT---AGDDCQ 311 (364)
T ss_pred CCcc-----------------eehhhcCccccc----------------ce-----Ee---cCCCCceeee---cCCcce
Confidence 7555 334555433221 11 11 0135567774 578889
Q ss_pred EEEEECCCCCC
Q psy1644 245 VVIIDMNDPTN 255 (923)
Q Consensus 245 VvIVDL~n~~~ 255 (923)
+.|.||.....
T Consensus 312 aliWDl~q~~~ 322 (364)
T KOG0290|consen 312 ALIWDLQQMPR 322 (364)
T ss_pred EEEEecccccc
Confidence 99999986543
No 95
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=40.11 E-value=42 Score=23.68 Aligned_cols=26 Identities=23% Similarity=0.503 Sum_probs=19.5
Q ss_pred HHHHHHHhcChhHHHHHHHH---hCCCcc
Q psy1644 466 VIQSFAETGQFQKIVLYAKK---VGYSPD 491 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~~~---vgytpD 491 (923)
++.++++.|++++++..-++ .|..||
T Consensus 6 li~~~~~~~~~~~a~~~~~~M~~~g~~p~ 34 (35)
T TIGR00756 6 LIDGLCKAGRVEEALELFKEMLERGIEPD 34 (35)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHcCCCCC
Confidence 57899999999999887543 355554
No 96
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=39.89 E-value=4.3e+02 Score=27.40 Aligned_cols=119 Identities=14% Similarity=0.240 Sum_probs=76.0
Q ss_pred CccccccCCC--ccEEEEeccCCCCC-CCCCCCeEEEEecccc-cccccccCCC--ceeEEEecc-CCeEEEEec---Ce
Q psy1644 262 SADSAIMNPA--SKVIALKGKAGNDN-NPNAPKTLQIFNIEMK-SKMKAHPMTD--DVVFWKWIS-PNTLALVTE---TS 331 (923)
Q Consensus 262 ~ADSAIMnP~--~~IIALra~~~~~n-~~~~g~~LQIFNle~K-~KLKs~~m~E--~VvfWkWis-~~tLglVT~---ta 331 (923)
+|+.|=||+. ..-++++....-|. +..-.....+|-++.+ .......+.. +|.=-.|=. .+.+|+++. ..
T Consensus 5 ~~~~~~~~W~~~G~~l~~~~~~~~~~~~ks~~~~~~l~~~~~~~~~~~~i~l~~~~~I~~~~WsP~g~~favi~g~~~~~ 84 (194)
T PF08662_consen 5 NVDDAKLHWQPSGDYLLVKVQTRVDKSGKSYYGEFELFYLNEKNIPVESIELKKEGPIHDVAWSPNGNEFAVIYGSMPAK 84 (194)
T ss_pred ccceEEEEecccCCEEEEEEEEeeccCcceEEeeEEEEEEecCCCccceeeccCCCceEEEEECcCCCEEEEEEccCCcc
Confidence 4677777665 56888885421111 1111345889999555 4555566633 588888864 367888864 47
Q ss_pred EEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeecccc
Q psy1644 332 VYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVE 392 (923)
Q Consensus 332 VYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~e 392 (923)
|-.|++. ..+..-|.. .++-.-.=+++++++++.|.. ...|.|++|.++
T Consensus 85 v~lyd~~--~~~i~~~~~------~~~n~i~wsP~G~~l~~~g~~----n~~G~l~~wd~~ 133 (194)
T PF08662_consen 85 VTLYDVK--GKKIFSFGT------QPRNTISWSPDGRFLVLAGFG----NLNGDLEFWDVR 133 (194)
T ss_pred cEEEcCc--ccEeEeecC------CCceEEEECCCCCEEEEEEcc----CCCcEEEEEECC
Confidence 8889885 334433432 222234578999999999975 355899999987
No 97
>PRK01742 tolB translocation protein TolB; Provisional
Probab=39.60 E-value=5.3e+02 Score=29.74 Aligned_cols=143 Identities=14% Similarity=0.163 Sum_probs=77.9
Q ss_pred ceEEEEEeccCC--CceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccccc--c
Q psy1644 230 DKFICVREKIAE--SAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKS--K 303 (923)
Q Consensus 230 DkyICVRE~~~~--~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~--K 303 (923)
.|...|.+..++ ..++.+.|.+..+. .++-.-..-+...+|..+.||--.. . + ....|.++|+++.+ +
T Consensus 169 ~ria~v~~~~~~~~~~~i~i~d~dg~~~~~lt~~~~~v~~p~wSPDG~~la~~s~-~--~---~~~~i~i~dl~tg~~~~ 242 (429)
T PRK01742 169 TRIAYVVQKNGGSQPYEVRVADYDGFNQFIVNRSSQPLMSPAWSPDGSKLAYVSF-E--N---KKSQLVVHDLRSGARKV 242 (429)
T ss_pred CEEEEEEEEcCCCceEEEEEECCCCCCceEeccCCCccccceEcCCCCEEEEEEe-c--C---CCcEEEEEeCCCCceEE
Confidence 455555554432 34788888755431 1111111234556777777776421 1 1 23579999998764 3
Q ss_pred cccccCCCceeEEEeccC-CeEEEEecC----eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecc
Q psy1644 304 MKAHPMTDDVVFWKWISP-NTLALVTET----SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAV 378 (923)
Q Consensus 304 LKs~~m~E~VvfWkWis~-~tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~ 378 (923)
+.++. .....=+|-.+ +.|+..... .+|.|++.+.. +.++.. + ...+.+..-++|++|++.++-
T Consensus 243 l~~~~--g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~-~~~lt~-~----~~~~~~~~wSpDG~~i~f~s~--- 311 (429)
T PRK01742 243 VASFR--GHNGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGT-PSQLTS-G----AGNNTEPSWSPDGQSILFTSD--- 311 (429)
T ss_pred EecCC--CccCceeECCCCCEEEEEEecCCcEEEEEEECCCCC-eEeecc-C----CCCcCCEEECCCCCEEEEEEC---
Confidence 44332 12222366544 467776532 47888885432 333332 1 123456677889999877631
Q ss_pred cccccceeeecccc
Q psy1644 379 SNRVVGAMQLYSVE 392 (923)
Q Consensus 379 ~~~i~G~mQLYS~e 392 (923)
-.|..|+|.++
T Consensus 312 ---~~g~~~I~~~~ 322 (429)
T PRK01742 312 ---RSGSPQVYRMS 322 (429)
T ss_pred ---CCCCceEEEEE
Confidence 23667888865
No 98
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=38.37 E-value=48 Score=27.65 Aligned_cols=31 Identities=26% Similarity=0.328 Sum_probs=28.4
Q ss_pred eEEeccccceEEEEeeeeeEEEEecccceeee
Q psy1644 67 AMQVSAKYDVVYLITKFGYIHVYDIETATCIY 98 (923)
Q Consensus 67 amqiS~k~G~iyliTK~G~lhl~Dlet~t~iy 98 (923)
.|.-|++-++|-+.|..|-|.||.+ ++..|+
T Consensus 16 ~~~w~P~mdLiA~~t~~g~v~v~Rl-~~qriw 46 (47)
T PF12894_consen 16 CMSWCPTMDLIALGTEDGEVLVYRL-NWQRIW 46 (47)
T ss_pred EEEECCCCCEEEEEECCCeEEEEEC-CCcCcc
Confidence 7888999999999999999999999 888776
No 99
>PRK05561 DNA topoisomerase IV subunit A; Validated
Probab=38.26 E-value=1.1e+02 Score=38.53 Aligned_cols=71 Identities=17% Similarity=0.232 Sum_probs=46.5
Q ss_pred eeeeeeccccccEEEEEeccceEEEEecCCcceeeeecccCCce-EEeeccCCCCcEEEEcCCCeEEEEEecC
Q psy1644 724 EIYVQKVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTI-FVTAPHEATGGIIGVNRKGQVLSVSVDE 795 (923)
Q Consensus 724 ~i~~~qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtI-Fvta~~~~t~Gii~VNrkGqVL~V~Ide 795 (923)
+|....+-+...++.++||.||+.-..+.+-...-+.--.+|.+ ++... +..+-++.++++|+++.+.+++
T Consensus 497 ~i~~edlI~~e~~lVllTk~GyIKR~~l~~f~~~aikLke~D~Lv~v~~~-~t~d~LllfT~~Grv~r~~~~e 568 (742)
T PRK05561 497 AIDEEALIPDEPVTVVLSKKGWVRRAKGHSIDASGLSFKEGDSLLFAFEA-RTTDKLLLFTSTGRVYSLPVHE 568 (742)
T ss_pred ccchhhcccCcceEEEEecCCEEEeccchhhhhhccccCCCCeEEEEEEe-cCCCeEEEEECCCcEEEeEHHh
Confidence 34444556778889999999999987765533211111123433 33322 3345699999999999999987
No 100
>KOG1036|consensus
Probab=37.76 E-value=5.7e+02 Score=29.58 Aligned_cols=185 Identities=19% Similarity=0.184 Sum_probs=118.3
Q ss_pred eeecCCCCCCceEEEEEEecC---Ccc---EEEEEecC-CCCCCCCCcccceeeeecCCCccCCcceeEEeccccceEEE
Q psy1644 7 SFKSEGNPEPSTLFCFAVRTP---AGG---KLHIIEVG-TPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYL 79 (923)
Q Consensus 7 sfkl~gN~~~stL~cfa~r~~---~~g---kLhiiEig-~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyl 79 (923)
++++..+..+...++|+--.- .|| ++.+...- ....|..-=+||+. +|.++ =..|-|
T Consensus 87 ~~~igth~~~i~ci~~~~~~~~vIsgsWD~~ik~wD~R~~~~~~~~d~~kkVy--------------~~~v~--g~~LvV 150 (323)
T KOG1036|consen 87 EDQIGTHDEGIRCIEYSYEVGCVISGSWDKTIKFWDPRNKVVVGTFDQGKKVY--------------CMDVS--GNRLVV 150 (323)
T ss_pred ceeeccCCCceEEEEeeccCCeEEEcccCccEEEEeccccccccccccCceEE--------------EEecc--CCEEEE
Confidence 456778888999999981111 233 55555542 11222222223322 22222 234555
Q ss_pred EeeeeeEEEEecccceeeeecccCCCc--EEEeeccCCCcceEEEecCCcceeeeeeece-eecccce--eEEEeccccc
Q psy1644 80 ITKFGYIHVYDIETATCIYMNRISSDT--IFVTAPHEATGGIIGLTNVGINVGSVSFNTL-TMESDKF--ICVREKIAES 154 (923)
Q Consensus 80 iTK~G~lhl~Dlet~t~iy~~rIS~~~--if~ta~~~~t~Gil~vn~~G~~~~si~~~t~-tm~sdk~--ic~~~~~~~~ 154 (923)
-|+.--+++||+-....-++-|=|+=- |=+.+.....+|+..=+-.| ++.-==|.+. .++|-|| =|-|.+.+|.
T Consensus 151 g~~~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~pn~eGy~~sSieG-RVavE~~d~s~~~~skkyaFkCHr~~~~~~ 229 (323)
T KOG1036|consen 151 GTSDRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVPNGEGYVVSSIEG-RVAVEYFDDSEEAQSKKYAFKCHRLSEKDT 229 (323)
T ss_pred eecCceEEEEEcccccchhhhccccceeEEEEEEEecCCCceEEEeecc-eEEEEccCCchHHhhhceeEEeeecccCCc
Confidence 599999999999999999888888832 33335566889999888888 7765556555 7888886 5999998875
Q ss_pred ceEEEEecCCCCCCCccCccccccccCcccchhhhcCCCCCCCCCCcccceeeeccccccccCCccccc
Q psy1644 155 AQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLTNVGINVGSVSFN 223 (923)
Q Consensus 155 ~~vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt~lGI~~~si~F~ 223 (923)
.++. |-|. +=.||-..-+|==|+-|.-|-=++.-.+.+.||...-.+-+++.|+
T Consensus 230 ~~~y------PVNa---------i~Fhp~~~tfaTgGsDG~V~~Wd~~~rKrl~q~~~~~~SI~slsfs 283 (323)
T KOG1036|consen 230 EIIY------PVNA---------IAFHPIHGTFATGGSDGIVNIWDLFNRKRLKQLAKYETSISSLSFS 283 (323)
T ss_pred eEEE------Eece---------eEeccccceEEecCCCceEEEccCcchhhhhhccCCCCceEEEEec
Confidence 4443 3333 3458888888877777766655555555666777764455555554
No 101
>KOG0306|consensus
Probab=37.74 E-value=1e+03 Score=30.78 Aligned_cols=154 Identities=19% Similarity=0.347 Sum_probs=92.4
Q ss_pred eeecccc-ccccCCcccccceeeecc----eEEEEEecc-----CCCceEEEEECCCCCCCcccccCccc---cccCCCc
Q psy1644 206 RIYLLTN-VGINVGSVSFNTLTMESD----KFICVREKI-----AESAQVVIIDMNDPTNPIRRPISADS---AIMNPAS 272 (923)
Q Consensus 206 e~~~Lt~-lGI~~~si~F~~lTmESD----kyICVRE~~-----~~~~qVvIVDL~n~~~v~Rrpi~ADS---AIMnP~~ 272 (923)
|...|.. .+..|+..+-+++++.+. +-+|+-+.. +.+.++-|++-....- + |-|..+. +-|=|-.
T Consensus 347 ~~ysl~~s~~~~p~~~~~~~i~~~GHR~dVRsl~vS~d~~~~~Sga~~SikiWn~~t~kc-i-RTi~~~y~l~~~Fvpgd 424 (888)
T KOG0306|consen 347 EWYSLENSGKTSPEADRTSNIEIGGHRSDVRSLCVSSDSILLASGAGESIKIWNRDTLKC-I-RTITCGYILASKFVPGD 424 (888)
T ss_pred EEEEeccCCCCCccccccceeeeccchhheeEEEeecCceeeeecCCCcEEEEEccCcce-e-EEeccccEEEEEecCCC
Confidence 3444444 566677777777777654 457777654 4556888888664432 2 3355443 3355665
Q ss_pred cEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEE-eccCCeEEEEecC---eEEEEecc---C-CCCch
Q psy1644 273 KVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWK-WISPNTLALVTET---SVYHWSME---G-DSTPE 344 (923)
Q Consensus 273 ~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWk-Wis~~tLglVT~t---aVYHWsi~---g-~s~P~ 344 (923)
+-|.+=. ...-||+|++-+-.-+....-.|. ..|. =++++.=|+||.. .|-.|+.+ . ++.-.
T Consensus 425 ~~Iv~G~---------k~Gel~vfdlaS~~l~Eti~AHdg-aIWsi~~~pD~~g~vT~saDktVkfWdf~l~~~~~gt~~ 494 (888)
T KOG0306|consen 425 RYIVLGT---------KNGELQVFDLASASLVETIRAHDG-AIWSISLSPDNKGFVTGSADKTVKFWDFKLVVSVPGTQK 494 (888)
T ss_pred ceEEEec---------cCCceEEEEeehhhhhhhhhcccc-ceeeeeecCCCCceEEecCCcEEEEEeEEEEeccCcccc
Confidence 5444431 124799999977765555443233 3455 3678888999975 79999863 1 12223
Q ss_pred hhhh----cccCCCCCeeEeeeeCCCCCeEEE
Q psy1644 345 KMFD----RHSTLNGCQIINYRTDPKQTWLLL 372 (923)
Q Consensus 345 K~Fd----R~~~L~~~QIInYr~d~~~kW~~L 372 (923)
|+.. |.=.| .-+|..-+.++|+|.+++
T Consensus 495 k~lsl~~~rtLel-~ddvL~v~~Spdgk~LaV 525 (888)
T KOG0306|consen 495 KVLSLKHTRTLEL-EDDVLCVSVSPDGKLLAV 525 (888)
T ss_pred eeeeeccceEEec-cccEEEEEEcCCCcEEEE
Confidence 3321 11112 247888999999998764
No 102
>PRK00178 tolB translocation protein TolB; Provisional
Probab=37.55 E-value=6.7e+02 Score=28.56 Aligned_cols=121 Identities=8% Similarity=0.096 Sum_probs=62.5
Q ss_pred CCceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCC-ceeEEE
Q psy1644 241 ESAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTD-DVVFWK 317 (923)
Q Consensus 241 ~~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E-~VvfWk 317 (923)
+..++.+.|+..... .+.-+-...+.--+|..+-||+-... + ....|.++|++++.. +..+-.. ....-.
T Consensus 221 ~~~~l~~~~l~~g~~~~l~~~~g~~~~~~~SpDG~~la~~~~~---~---g~~~Iy~~d~~~~~~-~~lt~~~~~~~~~~ 293 (430)
T PRK00178 221 KRPRIFVQNLDTGRREQITNFEGLNGAPAWSPDGSKLAFVLSK---D---GNPEIYVMDLASRQL-SRVTNHPAIDTEPF 293 (430)
T ss_pred CCCEEEEEECCCCCEEEccCCCCCcCCeEECCCCCEEEEEEcc---C---CCceEEEEECCCCCe-EEcccCCCCcCCeE
Confidence 345788888865431 11111112235568888777754211 0 124677888887642 2211111 122235
Q ss_pred eccC-CeEEEEecC----eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEe
Q psy1644 318 WISP-NTLALVTET----SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIG 374 (923)
Q Consensus 318 Wis~-~tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvG 374 (923)
|-.+ +.|+.+++. .+|.|++.+.. +..+..... ....-..++|++|++.+.
T Consensus 294 ~spDg~~i~f~s~~~g~~~iy~~d~~~g~-~~~lt~~~~-----~~~~~~~Spdg~~i~~~~ 349 (430)
T PRK00178 294 WGKDGRTLYFTSDRGGKPQIYKVNVNGGR-AERVTFVGN-----YNARPRLSADGKTLVMVH 349 (430)
T ss_pred ECCCCCEEEEEECCCCCceEEEEECCCCC-EEEeecCCC-----CccceEECCCCCEEEEEE
Confidence 6543 568888754 69999986543 333321110 011224577888877765
No 103
>TIGR01063 gyrA DNA gyrase, A subunit. This model describes the common type II DNA topoisomerase (DNA gyrase). Two apparently independently arising families, one in the Proteobacteria and one in Gram-positive lineages, are both designated toposisomerase IV.
Probab=37.14 E-value=4.2e+02 Score=33.88 Aligned_cols=69 Identities=16% Similarity=0.206 Sum_probs=44.8
Q ss_pred eeeeeccccccEEEEEeccceEEEEecCCcce-------eeeecc-cCCceEEeeccCCCCcEEEEcCCCeEEEEEe
Q psy1644 725 IYVQKVSAKYDVVYLITKFGYIHVYDIETATC-------IYMNRI-SSDTIFVTAPHEATGGIIGVNRKGQVLSVSV 793 (923)
Q Consensus 725 i~~~qvS~KyGvIyvITK~GflhlyDleTgt~-------Iy~nRI-S~dtIFvta~~~~t~Gii~VNrkGqVL~V~I 793 (923)
|+....-++..++.++|+.||+.-.++.+-.. +=..+. -+|.+.-...-+.++-++.++.+|+|+.+.+
T Consensus 487 i~~edli~~e~~~vllS~~GyIKri~~~~~~~~~~~~~g~s~~klKe~D~l~~~~~~~t~d~LllfTs~Grv~~l~~ 563 (800)
T TIGR01063 487 IDIEDLIARENVVVTLSHNGYVKRVPVSAYRLQKRGGKGVSGADMKDDDFIEQLLVASTHDYLLFFTNRGKVYWLKV 563 (800)
T ss_pred cchhhccCcceEEEEEcCCCEEEecchhhhhhhcccCcCccccccCCCCeeEEEEEecCCCeEEEEeCCCcEEEEEh
Confidence 44444566788899999999999987764211 111111 2344443333345667999999999999943
No 104
>PF10433 MMS1_N: Mono-functional DNA-alkylating methyl methanesulfonate N-term; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 2B5N_C 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A ....
Probab=37.08 E-value=7.1e+02 Score=29.38 Aligned_cols=74 Identities=18% Similarity=0.249 Sum_probs=45.3
Q ss_pred cccCCcccccceeeecceEEEEEeccCCCceEEEEECCCCCCCcccccCc----cccccCCCccEEEEeccCCCCCCCCC
Q psy1644 214 GINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISA----DSAIMNPASKVIALKGKAGNDNNPNA 289 (923)
Q Consensus 214 GI~~~si~F~~lTmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi~A----DSAIMnP~~~IIALra~~~~~n~~~~ 289 (923)
|+..+.=+..--+|.++.+|-|-++ .|.++|+.+.....+....+ -+|..|..+=++|+.
T Consensus 403 ~f~~~~~Tl~~~~~~~~~ivQVt~~-----~i~l~~~~~~~~~~~w~~~~~~~I~~a~~~~~~v~v~~~----------- 466 (504)
T PF10433_consen 403 GFDTDEPTLAAGNVGDGRIVQVTPK-----GIRLIDLEDGKLTQEWKPPAGSIIVAASINDPQVLVALS----------- 466 (504)
T ss_dssp TS-SSS-EEEEEEETTTEEEEEESS-----EEEEEESSSTSEEEEEE-TTS---SEEEESSSEEEEEE------------
T ss_pred cCCCCCCCeEEEEcCCCeEEEEecC-----eEEEEECCCCeEEEEEeCCCCCeEEEEEECCCEEEEEEe-----------
Confidence 5555444444455557777777654 68899987665554432333 366677766677775
Q ss_pred CCeEEEEecccccc
Q psy1644 290 PKTLQIFNIEMKSK 303 (923)
Q Consensus 290 g~~LQIFNle~K~K 303 (923)
++.+..|.++...-
T Consensus 467 ~~~~~~~~~~~~~~ 480 (504)
T PF10433_consen 467 GGELVYFELDDNKI 480 (504)
T ss_dssp TTEEEEEEEETTEE
T ss_pred CCcEEEEEEECCce
Confidence 57789998866544
No 105
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=37.00 E-value=4.4e+02 Score=31.28 Aligned_cols=105 Identities=11% Similarity=0.164 Sum_probs=53.9
Q ss_pred HHHHHHHhcChhHHHHHHHHhC-CCc---c-HHHHHHHHHh-cCCchhhHHHHhhhcCCCCC----cCccc-------cc
Q psy1644 466 VIQSFAETGQFQKIVLYAKKVG-YSP---D-YVFLLRNVMR-VNPDQGVGFAQMLVQDDEPL----ADINQ-------SY 528 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~~~vg-ytp---D-y~~LL~~i~~-~nP~~a~efA~~L~~~~~pl----~di~~-------~~ 528 (923)
.+.++.+.|++++++.+.++.- ..| + |..+.....+ -++++|.+....+....+.- ..+.. .-
T Consensus 573 l~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 652 (899)
T TIGR02917 573 LAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYA 652 (899)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHcCCHH
Confidence 5677888899998888887752 223 2 2222222222 36667776665555432110 00000 00
Q ss_pred hhhhhhhcccccccCCcchhhHHHHHHHhcCChhhhHHhhhc
Q psy1644 529 EGLFYFLGSIVNYSQDQEVHFKYIQAACKTGQIKEVERICRE 570 (923)
Q Consensus 529 egLyy~Lgsivn~s~d~dVhfKYIeAA~k~gq~~EvERicre 570 (923)
+.+-+|-..+-...++.+..+.+.....+.|++.|.+++++.
T Consensus 653 ~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~ 694 (899)
T TIGR02917 653 KAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKS 694 (899)
T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 111122111222234456666777777777888877776654
No 106
>PF14689 SPOB_a: Sensor_kinase_SpoOB-type, alpha-helical domain; PDB: 1F51_C 2FTK_B 1IXM_B.
Probab=36.35 E-value=38 Score=29.48 Aligned_cols=21 Identities=24% Similarity=0.498 Sum_probs=18.8
Q ss_pred HHHHHHHhcChhHHHHHHHHh
Q psy1644 466 VIQSFAETGQFQKIVLYAKKV 486 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~~~v 486 (923)
||.+|.+.|+++++.+|+++.
T Consensus 29 vI~gllqlg~~~~a~eYi~~~ 49 (62)
T PF14689_consen 29 VIYGLLQLGKYEEAKEYIKEL 49 (62)
T ss_dssp HHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHCCCHHHHHHHHHHH
Confidence 999999999999999999874
No 107
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=36.07 E-value=52 Score=39.43 Aligned_cols=37 Identities=22% Similarity=0.352 Sum_probs=32.2
Q ss_pred eccccceEEEEeeeeeEEEEecccceeeeecccCCCc
Q psy1644 70 VSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDT 106 (923)
Q Consensus 70 iS~k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~ 106 (923)
+...-|++|+.|+.|+++.||.+||..+++-|+-+..
T Consensus 468 l~t~g~lvf~g~~~G~l~a~D~~TGe~lw~~~~g~~~ 504 (527)
T TIGR03075 468 LATAGDLVFYGTLEGYFKAFDAKTGEELWKFKTGSGI 504 (527)
T ss_pred eEECCcEEEEECCCCeEEEEECCCCCEeEEEeCCCCc
Confidence 4567789999999999999999999999999985433
No 108
>KOG2111|consensus
Probab=36.04 E-value=2.4e+02 Score=32.69 Aligned_cols=139 Identities=16% Similarity=0.245 Sum_probs=78.4
Q ss_pred cCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEeccCCeEEEEecCeEEEEeccCCCCchhhh
Q psy1644 268 MNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKWISPNTLALVTETSVYHWSMEGDSTPEKMF 347 (923)
Q Consensus 268 MnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkWis~~tLglVT~taVYHWsi~g~s~P~K~F 347 (923)
|==..+++||-| ++.+..-+...|=|||=..+.-+....++.+|.=-+ ++.+.|.+|++..+|.|+...+-.-...|
T Consensus 54 MLfR~N~laLVG--Gg~~pky~pNkviIWDD~k~~~i~el~f~~~I~~V~-l~r~riVvvl~~~I~VytF~~n~k~l~~~ 130 (346)
T KOG2111|consen 54 MLFRSNYLALVG--GGSRPKYPPNKVIIWDDLKERCIIELSFNSEIKAVK-LRRDRIVVVLENKIYVYTFPDNPKLLHVI 130 (346)
T ss_pred HhhhhceEEEec--CCCCCCCCCceEEEEecccCcEEEEEEeccceeeEE-EcCCeEEEEecCeEEEEEcCCChhheeee
Confidence 444578999996 222222256689999955555567777777765444 57889999999999999885432222222
Q ss_pred hcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeecc---cccccccccccce---eEEeccCCCccccC
Q psy1644 348 DRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYS---VERKCSQPIEEKK---SRVAANAPKGILRT 417 (923)
Q Consensus 348 dR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS---~er~~SQ~iEGk~---AkvAA~SP~giLRt 417 (923)
|=.++=.| ...+....+..- -+-+|+=.|.+|+-- ...+--..|+++. |.+|.|-++..+-|
T Consensus 131 et~~NPkG--lC~~~~~~~k~~------LafPg~k~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~Ln~~Gt~vAT 198 (346)
T KOG2111|consen 131 ETRSNPKG--LCSLCPTSNKSL------LAFPGFKTGQVQIVDLASTKPNAPSIINAHDSDIACVALNLQGTLVAT 198 (346)
T ss_pred ecccCCCc--eEeecCCCCceE------EEcCCCccceEEEEEhhhcCcCCceEEEcccCceeEEEEcCCccEEEE
Confidence 22221111 111111111111 112455559999654 4444346666643 66666666544433
No 109
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=35.36 E-value=64 Score=26.93 Aligned_cols=33 Identities=21% Similarity=0.261 Sum_probs=29.0
Q ss_pred eeeeeccccccEEEEEeccceEEEEecCCcceee
Q psy1644 725 IYVQKVSAKYDVVYLITKFGYIHVYDIETATCIY 758 (923)
Q Consensus 725 i~~~qvS~KyGvIyvITK~GflhlyDleTgt~Iy 758 (923)
|-.+.-|++-++|.+.|..|-|.+|.+ +|..|+
T Consensus 14 v~~~~w~P~mdLiA~~t~~g~v~v~Rl-~~qriw 46 (47)
T PF12894_consen 14 VSCMSWCPTMDLIALGTEDGEVLVYRL-NWQRIW 46 (47)
T ss_pred EEEEEECCCCCEEEEEECCCeEEEEEC-CCcCcc
Confidence 445578999999999999999999999 888776
No 110
>PRK00076 recR recombination protein RecR; Reviewed
Probab=35.08 E-value=87 Score=33.57 Aligned_cols=90 Identities=18% Similarity=0.351 Sum_probs=56.7
Q ss_pred HHHHHHHhcCCchhhHHHHhhhcC--------------CCCCcCccc-------------------------cchhhhhh
Q psy1644 494 FLLRNVMRVNPDQGVGFAQMLVQD--------------DEPLADINQ-------------------------SYEGLFYF 534 (923)
Q Consensus 494 ~LL~~i~~~nP~~a~efA~~L~~~--------------~~pl~di~~-------------------------~~egLyy~ 534 (923)
.+--+++..+++.+.+||..|..- +..++++.. .|.|+|+.
T Consensus 26 Rla~~ll~~~~~~~~~la~~i~~~~~~i~~C~~C~~lse~~~C~IC~d~~Rd~~~icVVE~~~Dv~aiE~s~~y~G~YhV 105 (196)
T PRK00076 26 RLAFHLLQRDREDVLRLAQALEEAKEKIKHCSVCGNLTEQDPCEICSDPRRDQSLICVVESPADVLAIERTGEYRGLYHV 105 (196)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHHHHcCCcCCCCCCcCCCCcCCCCCCCCCCCCEEEEECCHHHHHHHHhhCcCceEEEE
Confidence 344456667888888888877652 123455544 48899999
Q ss_pred hcccccccCC---cchhhHHHHHHHhcCChhhhHHhhhcCCccChHHHHhHHhhc
Q psy1644 535 LGSIVNYSQD---QEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEA 586 (923)
Q Consensus 535 Lgsivn~s~d---~dVhfKYIeAA~k~gq~~EvERicresn~YdpekvKnfLKea 586 (923)
||..++--+. .++.++=+..-. .++++|| |.--|.--+.|-+-.||++.
T Consensus 106 L~G~ispl~gi~p~~l~i~~L~~ri-~~~v~EV--IlA~~pt~EGe~Ta~yi~~~ 157 (196)
T PRK00076 106 LGGLLSPLDGIGPEDLNIDELLERL-DGEVKEV--ILATNPTVEGEATAHYIARL 157 (196)
T ss_pred ecCCcCCCCCCCccccCHHHHHHHH-hCCCCEE--EEeCCCCchHHHHHHHHHHH
Confidence 9977654333 345555333322 4667776 55557777777777777643
No 111
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=34.59 E-value=45 Score=25.88 Aligned_cols=21 Identities=29% Similarity=0.183 Sum_probs=17.4
Q ss_pred cccEEEEEeccceEEEEecCC
Q psy1644 733 KYDVVYLITKFGYIHVYDIET 753 (923)
Q Consensus 733 KyGvIyvITK~GflhlyDleT 753 (923)
.-|.+|+.|..|.+|-+|.+|
T Consensus 20 ~~g~vyv~~~dg~l~ald~~t 40 (40)
T PF13570_consen 20 AGGRVYVGTGDGNLYALDAAT 40 (40)
T ss_dssp CTSEEEEE-TTSEEEEEETT-
T ss_pred ECCEEEEEcCCCEEEEEeCCC
Confidence 468999999999999999986
No 112
>PRK04922 tolB translocation protein TolB; Provisional
Probab=34.33 E-value=7.8e+02 Score=28.39 Aligned_cols=121 Identities=9% Similarity=0.130 Sum_probs=63.7
Q ss_pred CCceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCC-ceeEEE
Q psy1644 241 ESAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTD-DVVFWK 317 (923)
Q Consensus 241 ~~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E-~VvfWk 317 (923)
+..+|.++|+..... ...-+-...+...+|..+-||+-.... ....|.++|+++++. +..+-.. ....=.
T Consensus 226 ~~~~l~~~dl~~g~~~~l~~~~g~~~~~~~SpDG~~l~~~~s~~------g~~~Iy~~d~~~g~~-~~lt~~~~~~~~~~ 298 (433)
T PRK04922 226 GRSAIYVQDLATGQRELVASFRGINGAPSFSPDGRRLALTLSRD------GNPEIYVMDLGSRQL-TRLTNHFGIDTEPT 298 (433)
T ss_pred CCcEEEEEECCCCCEEEeccCCCCccCceECCCCCEEEEEEeCC------CCceEEEEECCCCCe-EECccCCCCccceE
Confidence 345788888865432 111111122456788887666542111 234799999987652 2222111 122235
Q ss_pred eccC-CeEEEEecC----eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEe
Q psy1644 318 WISP-NTLALVTET----SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIG 374 (923)
Q Consensus 318 Wis~-~tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvG 374 (923)
|-.+ +.|+.+++. .+|.|++++.. .+.+.. + +....+...++|++|++...
T Consensus 299 ~spDG~~l~f~sd~~g~~~iy~~dl~~g~-~~~lt~-~----g~~~~~~~~SpDG~~Ia~~~ 354 (433)
T PRK04922 299 WAPDGKSIYFTSDRGGRPQIYRVAASGGS-AERLTF-Q----GNYNARASVSPDGKKIAMVH 354 (433)
T ss_pred ECCCCCEEEEEECCCCCceEEEEECCCCC-eEEeec-C----CCCccCEEECCCCCEEEEEE
Confidence 6544 467777653 48988886533 333321 1 11222346678888877754
No 113
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=34.05 E-value=8.1e+02 Score=28.50 Aligned_cols=75 Identities=16% Similarity=0.230 Sum_probs=52.8
Q ss_pred CccEEEEEecCCCCCCCCCcccceeeeecCCC--cc-CCcceeEEeccccceEEEE--eeeeeEEEEecccceeeeeccc
Q psy1644 28 AGGKLHIIEVGTPPAGNTPLAKKAVDVFFPPE--AA-NDFPVAMQVSAKYDVVYLI--TKFGYIHVYDIETATCIYMNRI 102 (923)
Q Consensus 28 ~~gkLhiiEig~~~~g~~~f~kK~vd~~Fppe--~~-~DFPvamqiS~k~G~iyli--TK~G~lhl~Dlet~t~iy~~rI 102 (923)
....+.++.+-+- +-.-++..||+ ++ .-+|..+.+|.-=..+|+- |.-.-|.+.|++++..+-+=.+
T Consensus 75 ~~d~V~v~D~~t~--------~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei~v 146 (352)
T TIGR02658 75 RTDYVEVIDPQTH--------LPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAFVRMMDV 146 (352)
T ss_pred CCCEEEEEECccC--------cEEeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEEEECCCCcEEEEEeC
Confidence 4457777776542 23345555555 33 8889999999887777764 5689999999999999887776
Q ss_pred CC-CcEEEe
Q psy1644 103 SS-DTIFVT 110 (923)
Q Consensus 103 S~-~~if~t 110 (923)
-+ ..++.+
T Consensus 147 p~~~~vy~t 155 (352)
T TIGR02658 147 PDCYHIFPT 155 (352)
T ss_pred CCCcEEEEe
Confidence 55 444544
No 114
>KOG0293|consensus
Probab=33.98 E-value=1.2e+02 Score=36.03 Aligned_cols=91 Identities=16% Similarity=0.416 Sum_probs=62.6
Q ss_pred CCeEEEEecccccccccccCC--CceeEEEeccCCeEEEEecC---eEEEEeccCCCCchhhhhcccCCCCC---eeEee
Q psy1644 290 PKTLQIFNIEMKSKMKAHPMT--DDVVFWKWISPNTLALVTET---SVYHWSMEGDSTPEKMFDRHSTLNGC---QIINY 361 (923)
Q Consensus 290 g~~LQIFNle~K~KLKs~~m~--E~VvfWkWis~~tLglVT~t---aVYHWsi~g~s~P~K~FdR~~~L~~~---QIInY 361 (923)
...+-.||.++-.+..-+.-. -.+.=-.|. ++-..+||.. ..+||+.+|+- ..+-.+. .|..-
T Consensus 290 ~e~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~-pDg~~~V~Gs~dr~i~~wdlDgn~--------~~~W~gvr~~~v~dl 360 (519)
T KOG0293|consen 290 DEVLSLWDVDTGDLRHLYPSGLGFSVSSCAWC-PDGFRFVTGSPDRTIIMWDLDGNI--------LGNWEGVRDPKVHDL 360 (519)
T ss_pred hHheeeccCCcchhhhhcccCcCCCcceeEEc-cCCceeEecCCCCcEEEecCCcch--------hhcccccccceeEEE
Confidence 345889999988776655444 455556675 5566788875 89999998864 1111222 37778
Q ss_pred eeCCCCCeEEEEeeecccccccceeeecccccccc
Q psy1644 362 RTDPKQTWLLLIGISAVSNRVVGAMQLYSVERKCS 396 (923)
Q Consensus 362 r~d~~~kW~~LvGI~~~~~~i~G~mQLYS~er~~S 396 (923)
....|+||+++++- .-.+-||++|...-
T Consensus 361 ait~Dgk~vl~v~~-------d~~i~l~~~e~~~d 388 (519)
T KOG0293|consen 361 AITYDGKYVLLVTV-------DKKIRLYNREARVD 388 (519)
T ss_pred EEcCCCcEEEEEec-------ccceeeechhhhhh
Confidence 88999999999993 34677788765443
No 115
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=33.89 E-value=2e+02 Score=34.08 Aligned_cols=28 Identities=14% Similarity=0.220 Sum_probs=20.7
Q ss_pred cCCcchhhHHHHHHHhcCChhhhHHhhh
Q psy1644 542 SQDQEVHFKYIQAACKTGQIKEVERICR 569 (923)
Q Consensus 542 s~d~dVhfKYIeAA~k~gq~~EvERicr 569 (923)
..|+++.+.|.++-.+.|++.|.+.+.+
T Consensus 868 ~~~~~~~~~l~~~~~~~g~~~~A~~~~~ 895 (899)
T TIGR02917 868 PEAAAIRYHLALALLATGRKAEARKELD 895 (899)
T ss_pred CCChHHHHHHHHHHHHcCCHHHHHHHHH
Confidence 3467788888888888888888776643
No 116
>KOG4649|consensus
Probab=33.75 E-value=1.1e+02 Score=34.69 Aligned_cols=67 Identities=19% Similarity=0.306 Sum_probs=55.8
Q ss_pred eccccccEEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCC-CCcEEEEcCCCeEEEEEecCC
Q psy1644 729 KVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEA-TGGIIGVNRKGQVLSVSVDED 796 (923)
Q Consensus 729 qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~-t~Gii~VNrkGqVL~V~Ide~ 796 (923)
|+..--|+||.-|-.|-+|.-|..|-.|+|..++- -++|++---.. .+-++.....|.||.|+++.-
T Consensus 100 ~~d~~~glIycgshd~~~yalD~~~~~cVykskcg-G~~f~sP~i~~g~~sly~a~t~G~vlavt~~~~ 167 (354)
T KOG4649|consen 100 QCDFDGGLIYCGSHDGNFYALDPKTYGCVYKSKCG-GGTFVSPVIAPGDGSLYAAITAGAVLAVTKNPY 167 (354)
T ss_pred EEcCCCceEEEecCCCcEEEecccccceEEecccC-CceeccceecCCCceEEEEeccceEEEEccCCC
Confidence 68888999999999999999999999999997764 46787654444 456777779999999998765
No 117
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=33.71 E-value=1.3e+02 Score=33.71 Aligned_cols=62 Identities=15% Similarity=0.282 Sum_probs=48.1
Q ss_pred ccEEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEecC
Q psy1644 734 YDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDE 795 (923)
Q Consensus 734 yGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide 795 (923)
.|-||+=|-.|.+|.+|..||+.+..-+......+..++--..+.++..+-.|+++.+.-+.
T Consensus 111 ~G~i~~g~~~g~~y~ld~~~G~~~W~~~~~~~~~~~~~~v~~~~~v~~~s~~g~~~al~~~t 172 (370)
T COG1520 111 DGKIYVGSWDGKLYALDASTGTLVWSRNVGGSPYYASPPVVGDGTVYVGTDDGHLYALNADT 172 (370)
T ss_pred CCeEEEecccceEEEEECCCCcEEEEEecCCCeEEecCcEEcCcEEEEecCCCeEEEEEccC
Confidence 78899999999999999999999999998883234444444556666666778888777663
No 118
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=33.34 E-value=57 Score=26.74 Aligned_cols=26 Identities=31% Similarity=0.732 Sum_probs=18.6
Q ss_pred ccceeEEEecccccceEEEEecCCCCCCC
Q psy1644 141 SDKFICVREKIAESAQVVIIDMNDPTNPI 169 (923)
Q Consensus 141 sdk~ic~~~~~~~~~~vviid~~~~~~~~ 169 (923)
+++|..|-.. +..++|+|+.||++|.
T Consensus 10 ~g~yaYva~~---~~Gl~IvDISnPs~P~ 35 (42)
T PF08309_consen 10 SGNYAYVADG---NNGLVIVDISNPSNPV 35 (42)
T ss_pred ECCEEEEEeC---CCCEEEEECCCCCCCE
Confidence 3456666533 2459999999999985
No 119
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=32.89 E-value=66 Score=26.54 Aligned_cols=22 Identities=23% Similarity=0.247 Sum_probs=12.0
Q ss_pred HhcCChhHHHHHHHhcccCChH
Q psy1644 849 CKTGQIKEVERICRESNCYNAE 870 (923)
Q Consensus 849 ~ktg~i~e~eri~r~~~~y~~~ 870 (923)
.++|+..+.++++...--.+|.
T Consensus 36 ~~~g~~~~A~~~l~~~~~~~~~ 57 (68)
T PF14559_consen 36 LKQGQYDEAEELLERLLKQDPD 57 (68)
T ss_dssp HHTT-HHHHHHHHHCCHGGGTT
T ss_pred HHcCCHHHHHHHHHHHHHHCcC
Confidence 3566666666666655544444
No 120
>KOG0639|consensus
Probab=32.69 E-value=3.3e+02 Score=33.41 Aligned_cols=116 Identities=21% Similarity=0.349 Sum_probs=69.3
Q ss_pred cCCCceEEEEECCCCCCCcccccCcc---------ccccCCCccEE-EEeccCCCCCCCCCCCeEEEEeccccccccccc
Q psy1644 239 IAESAQVVIIDMNDPTNPIRRPISAD---------SAIMNPASKVI-ALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHP 308 (923)
Q Consensus 239 ~~~~~qVvIVDL~n~~~v~Rrpi~AD---------SAIMnP~~~II-ALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~ 308 (923)
.|+...|.|+||+.|.-- |+|+ +-.|+|..|+. |.= +...+-||||..+.++..++
T Consensus 483 GGeastlsiWDLAapTpr----ikaeltssapaCyALa~spDakvcFscc----------sdGnI~vwDLhnq~~Vrqfq 548 (705)
T KOG0639|consen 483 GGEASTLSIWDLAAPTPR----IKAELTSSAPACYALAISPDAKVCFSCC----------SDGNIAVWDLHNQTLVRQFQ 548 (705)
T ss_pred ccccceeeeeeccCCCcc----hhhhcCCcchhhhhhhcCCccceeeeec----------cCCcEEEEEcccceeeeccc
Confidence 456679999999987532 4544 23578888753 222 34589999999999988876
Q ss_pred CCCceeEEEeccCCeEEEEe---cCeEEEEeccCCCCchh-hhhcccCCCCCeeEeeeeCCCCCeEEEEeee
Q psy1644 309 MTDDVVFWKWISPNTLALVT---ETSVYHWSMEGDSTPEK-MFDRHSTLNGCQIINYRTDPKQTWLLLIGIS 376 (923)
Q Consensus 309 m~E~VvfWkWis~~tLglVT---~taVYHWsi~g~s~P~K-~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~ 376 (923)
=-.|=+=-==|+.+---|=| +..|=-|++....++.+ =| ..||.+....+++.|++ +|..
T Consensus 549 GhtDGascIdis~dGtklWTGGlDntvRcWDlregrqlqqhdF-------~SQIfSLg~cP~~dWla-vGMe 612 (705)
T KOG0639|consen 549 GHTDGASCIDISKDGTKLWTGGLDNTVRCWDLREGRQLQQHDF-------SSQIFSLGYCPTGDWLA-VGME 612 (705)
T ss_pred CCCCCceeEEecCCCceeecCCCccceeehhhhhhhhhhhhhh-------hhhheecccCCCcccee-eecc
Confidence 53321111111211111212 23566777632222221 12 36999999999999975 5654
No 121
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=32.67 E-value=39 Score=26.21 Aligned_cols=21 Identities=29% Similarity=0.183 Sum_probs=17.3
Q ss_pred ccceEEEEeeeeeEEEEeccc
Q psy1644 73 KYDVVYLITKFGYIHVYDIET 93 (923)
Q Consensus 73 k~G~iyliTK~G~lhl~Dlet 93 (923)
--|.||+.|..|.||-+|.+|
T Consensus 20 ~~g~vyv~~~dg~l~ald~~t 40 (40)
T PF13570_consen 20 AGGRVYVGTGDGNLYALDAAT 40 (40)
T ss_dssp CTSEEEEE-TTSEEEEEETT-
T ss_pred ECCEEEEEcCCCEEEEEeCCC
Confidence 358999999999999999986
No 122
>KOG2063|consensus
Probab=31.97 E-value=98 Score=39.80 Aligned_cols=126 Identities=21% Similarity=0.249 Sum_probs=80.4
Q ss_pred HHhhhcCCccChHHHHhHHhhccCCCCCceEEeeccccchhHHHHHHHhcCch-------------------------he
Q psy1644 565 ERICRESNCYNAERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNLQ-------------------------KY 619 (923)
Q Consensus 565 ERicresn~YdpekvKnfLKeakL~d~~PLI~VCDrf~fV~dLv~YLY~n~~~-------------------------k~ 619 (923)
+-+-+|||.=+-|.++.-||++| ...+|+.+.|.++++ +.
T Consensus 483 ~llrlen~~c~vee~e~~L~k~~---------------~y~~Li~LY~~kg~h~~AL~ll~~l~d~~~~~d~~~~~~~e~ 547 (877)
T KOG2063|consen 483 PLLRLENNHCDVEEIETVLKKSK---------------KYRELIELYATKGMHEKALQLLRDLVDEDSDTDSFQLDGLEK 547 (877)
T ss_pred hhhhccCCCcchHHHHHHHHhcc---------------cHHHHHHHHHhccchHHHHHHHHHHhccccccccchhhhHHH
Confidence 34446887778888999998775 456666666665543 23
Q ss_pred ehhhhhccCccccceeeeeccccC--CHHHhhhhhccCcchhhhHHhhhccccccchhhHHHHHhhcchhHHHHHHHHHH
Q psy1644 620 IEIYVQKGRKHLLEKWLKEEKLEC--SEELGDLVKQSDPTLALSVYLRANIKQSRFVHDLVLYLYRNNLQKYIEIYVQKI 697 (923)
Q Consensus 620 IEvYvqkvNp~~tP~VvgLlD~dc--~E~~Ik~LL~sv~~l~~~~~v~~~~~~~~pv~~Lv~ev~RnrL~kll~p~LE~~ 697 (923)
|.-|+++--+...|.+.-.-|..| +++.++.++.+.+. +-.+.+|-++..+.++-.-= +++.||||+.
T Consensus 548 ii~YL~~l~~~~~~Li~~y~~wvl~~~p~~gi~Ift~~~~---------~~~~sis~~~Vl~~l~~~~~-~l~I~YLE~l 617 (877)
T KOG2063|consen 548 IIEYLKKLGAENLDLILEYADWVLNKNPEAGIQIFTSEDK---------QEAESISRDDVLNYLKSKEP-KLLIPYLEHL 617 (877)
T ss_pred HHHHHHHhcccchhHHHHHhhhhhccCchhheeeeeccCh---------hhhccCCHHHHHHHhhhhCc-chhHHHHHHH
Confidence 566777666665566555223222 34444444444210 11355666666666643333 7899999999
Q ss_pred HHhhcc-cchhhhHHHHhH
Q psy1644 698 RIKIKQ-SRFVHDLVLYLY 715 (923)
Q Consensus 698 ~~~g~q-~~~v~nalakiy 715 (923)
+.+-.. ++-+||-|+|.|
T Consensus 618 i~~~~~~~~~lht~ll~ly 636 (877)
T KOG2063|consen 618 ISDNRLTSTLLHTVLLKLY 636 (877)
T ss_pred hHhccccchHHHHHHHHHH
Confidence 988766 899999999999
No 123
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=29.74 E-value=79 Score=26.48 Aligned_cols=28 Identities=14% Similarity=0.255 Sum_probs=20.4
Q ss_pred eEEeccccc---eEEEEeeeeeEEEEecccc
Q psy1644 67 AMQVSAKYD---VVYLITKFGYIHVYDIETA 94 (923)
Q Consensus 67 amqiS~k~G---~iyliTK~G~lhl~Dlet~ 94 (923)
.|..|+..| ++..-=-.|++||+|+-++
T Consensus 5 ~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~~ 35 (43)
T PF10313_consen 5 CCKFSPEPGGNDLLAWAEHQGRVHIVDTRSN 35 (43)
T ss_pred EEEeCCCCCcccEEEEEccCCeEEEEEcccC
Confidence 355564444 7777778899999999863
No 124
>KOG1070|consensus
Probab=29.33 E-value=2.7e+02 Score=37.82 Aligned_cols=163 Identities=20% Similarity=0.326 Sum_probs=104.9
Q ss_pred cCHHHHHhhh-cCCCCCCCCchHHHHHHHHHhcC--------------CCCcccccccchhhH-----------------
Q psy1644 416 RTPQTIVKFQ-QVPTPAGQTSPLLQYFGILLDHG--------------KLNKHESLELCRPVL----------------- 463 (923)
Q Consensus 416 Rt~eTI~rFK-~vp~~pGq~sPLL~YF~~LL~~g--------------~LN~~ESlEL~rpVL----------------- 463 (923)
|-|+|-+.|. .+-.-|+..-+.++|-.--|+.+ ++|--|.-|.-..-+
T Consensus 1439 ~~pesaeDferlvrssPNSSi~WI~YMaf~LelsEiekAR~iaerAL~tIN~REeeEKLNiWiA~lNlEn~yG~eesl~k 1518 (1710)
T KOG1070|consen 1439 RAPESAEDFERLVRSSPNSSILWIRYMAFHLELSEIEKARKIAERALKTINFREEEEKLNIWIAYLNLENAYGTEESLKK 1518 (1710)
T ss_pred cCCcCHHHHHHHHhcCCCcchHHHHHHHHHhhhhhhHHHHHHHHHHhhhCCcchhHHHHHHHHHHHhHHHhhCcHHHHHH
Confidence 6677777774 46788998889999999888876 343333333222111
Q ss_pred --HH-------------HHHHHHHhcChhHHHHH----HHHhCCCc-cHHHHHHHHHhcCCchh----------------
Q psy1644 464 --AQ-------------VIQSFAETGQFQKIVLY----AKKVGYSP-DYVFLLRNVMRVNPDQG---------------- 507 (923)
Q Consensus 464 --~Q-------------Vi~clae~Gqfdkiv~Y----~~~vgytp-Dy~~LL~~i~~~nP~~a---------------- 507 (923)
.. ..--+...++|+++.++ .++.|-++ =|+.....++|.|-++|
T Consensus 1519 VFeRAcqycd~~~V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk~e 1598 (1710)
T KOG1070|consen 1519 VFERACQYCDAYTVHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSLPKQE 1598 (1710)
T ss_pred HHHHHHHhcchHHHHHHHHHHHHHhhcchhHHHHHHHHHHHhcchhhHHHHHHHHHhcccHHHHHHHHHHHHHhhcchhh
Confidence 11 23334566777777665 57778555 69999999999887333
Q ss_pred -hHHHHhhhcCCCCCcCccc---cchhhhhhhcccccccCCcchhhHHHHHHHhcCChhhhHHhhhc--CCccChHHHHh
Q psy1644 508 -VGFAQMLVQDDEPLADINQ---SYEGLFYFLGSIVNYSQDQEVHFKYIQAACKTGQIKEVERICRE--SNCYNAERVKN 581 (923)
Q Consensus 508 -~efA~~L~~~~~pl~di~~---~~egLyy~Lgsivn~s~d~dVhfKYIeAA~k~gq~~EvERicre--sn~YdpekvKn 581 (923)
++|...-++.+=-.-|.++ .|||| +...-...|+--=||+.-.|.|..+-|+++-+- +--..|-|.|-
T Consensus 1599 Hv~~IskfAqLEFk~GDaeRGRtlfEgl------l~ayPKRtDlW~VYid~eik~~~~~~vR~lfeRvi~l~l~~kkmKf 1672 (1710)
T KOG1070|consen 1599 HVEFISKFAQLEFKYGDAERGRTLFEGL------LSAYPKRTDLWSVYIDMEIKHGDIKYVRDLFERVIELKLSIKKMKF 1672 (1710)
T ss_pred hHHHHHHHHHHHhhcCCchhhHHHHHHH------HhhCccchhHHHHHHHHHHccCCHHHHHHHHHHHHhcCCChhHhHH
Confidence 2222222222211234455 78999 222233457888899999999999987655444 44578999998
Q ss_pred HHh
Q psy1644 582 FLK 584 (923)
Q Consensus 582 fLK 584 (923)
|.|
T Consensus 1673 ffK 1675 (1710)
T KOG1070|consen 1673 FFK 1675 (1710)
T ss_pred HHH
Confidence 887
No 125
>PHA02598 denA endonuclease II; Provisional
Probab=29.25 E-value=48 Score=33.83 Aligned_cols=48 Identities=17% Similarity=0.463 Sum_probs=38.7
Q ss_pred EEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEE
Q psy1644 739 LITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVS 792 (923)
Q Consensus 739 vITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~ 792 (923)
++|++||+.+++++=+++ .||. +-|+.+++ ...|++..-..|+|+.|.
T Consensus 2 ~~~~~~f~k~~~l~l~~~---~~i~--~~f~~~~~-~~n~VY~~~~~~~viYVG 49 (138)
T PHA02598 2 ILEEYGFIKYAELELDKN---GRID--RSFIKCPN-KKNVIYAIAVDDELVYIG 49 (138)
T ss_pred chHhhCceeEEEEEecCC---CcCc--ccccCCcc-cceEEEEEEeCCeEEEEe
Confidence 468999999999998876 5664 45888885 478888887888998886
No 126
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=29.04 E-value=3e+02 Score=28.09 Aligned_cols=93 Identities=17% Similarity=0.361 Sum_probs=55.2
Q ss_pred EEEEEeccceEEEEecCCcceeeeecccCCceEEeeccCCCCcEEEEcCCCeEEEEEecCCcccccccccCCCHHHHHHH
Q psy1644 736 VVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGVNRKGQVLSVSVDEDNIIPYINGTLQNPDLALRI 815 (923)
Q Consensus 736 vIyvITK~GflhlyDleTgt~Iy~nRIS~dtIFvta~~~~t~Gii~VNrkGqVL~V~Ide~~iVpYI~~~L~n~~LA~kL 815 (923)
+=.++|++=+|.||+ ...+-.||+++..||-=.- ....|+++.++.|..+=--++-+.-+= +++
T Consensus 37 ~si~lT~~H~llL~~---~~l~~vn~L~~~vV~e~~~---------~~~~~~~~gl~~D~~~~t~W~ys~~~I----~ei 100 (147)
T PF05131_consen 37 LSIALTEFHLLLLYS---DRLIAVNRLNNKVVFEESL---------LETGGKILGLCRDPSSNTFWLYSSNSI----FEI 100 (147)
T ss_pred ceEEeeceeeeEEeC---CEEEEEEecCCcEEEEEEe---------ccCCcceeeEEEcCCCCeEEEEeCCee----EEE
Confidence 345789999999986 6799999999999984322 445556666666654322222221100 000
Q ss_pred HHHcCCCCchHHHHHHHHHHHhcCcHHHHHHHHHhc
Q psy1644 816 AVRNNLAGAEDLFVRKFNLLFTNGQYSEAAKAACKT 851 (923)
Q Consensus 816 A~R~~LPGAddLf~~qFn~l~~~G~y~eAAk~A~kt 851 (923)
-.+ -|+ ..-..-++.+|+|.+|-+.|..+
T Consensus 101 ~i~-----~E~--r~vWk~yl~~~~fd~Al~~~~~~ 129 (147)
T PF05131_consen 101 VIN-----NED--RDVWKIYLDKGDFDEALQYCKTN 129 (147)
T ss_pred EcC-----cch--HHHHHHHHhcCcHHHHHHHccCC
Confidence 000 022 23355678889999988876653
No 127
>KOG0319|consensus
Probab=28.94 E-value=72 Score=39.93 Aligned_cols=70 Identities=20% Similarity=0.241 Sum_probs=51.7
Q ss_pred ccccceEEEEeeeeeEEEEecccceeeeecccCC-CcEEEeeccCCCcceEEEecCCcceeeeeeeceeecc
Q psy1644 71 SAKYDVVYLITKFGYIHVYDIETATCIYMNRISS-DTIFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMES 141 (923)
Q Consensus 71 S~k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~-~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~s 141 (923)
..|-+.++..-..|.+.+||.|||.|+|..|.++ +-+--.+.-...++++.|++.- |.+-+-+.++++.+
T Consensus 247 ~~~~~~~~TaG~~g~~~~~d~es~~~~~~~~~~~~~e~~~~~~~~~~~~~l~vtaeQ-nl~l~d~~~l~i~k 317 (775)
T KOG0319|consen 247 GGKGEYIITAGGSGVVQYWDSESGKCVYKQRQSDSEEIDHLLAIESMSQLLLVTAEQ-NLFLYDEDELTIVK 317 (775)
T ss_pred CCcceEEEEecCCceEEEEecccchhhhhhccCCchhhhcceeccccCceEEEEccc-eEEEEEccccEEeh
Confidence 4445567777789999999999999999999994 3332234456678888888775 78877777776643
No 128
>KOG1539|consensus
Probab=28.46 E-value=1.1e+02 Score=38.97 Aligned_cols=57 Identities=18% Similarity=0.134 Sum_probs=38.6
Q ss_pred EEeccccceEEEEeeeeeEEEEecccceeeeecccCCCcEEEeeccCCCcceEEEecCC
Q psy1644 68 MQVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIFVTAPHEATGGIIGLTNVG 126 (923)
Q Consensus 68 mqiS~k~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~if~ta~~~~t~Gil~vn~~G 126 (923)
|.+|+-=..|-.-...|+|.+||+-||++|=+=.+.+..+++.- +.++.+|+..-.|
T Consensus 582 ~~FS~DgrWlisasmD~tIr~wDlpt~~lID~~~vd~~~~sls~--SPngD~LAT~Hvd 638 (910)
T KOG1539|consen 582 MTFSPDGRWLISASMDSTIRTWDLPTGTLIDGLLVDSPCTSLSF--SPNGDFLATVHVD 638 (910)
T ss_pred eEeCCCCcEEEEeecCCcEEEEeccCcceeeeEecCCcceeeEE--CCCCCEEEEEEec
Confidence 44444555666666679999999999999987777766666552 2345555555554
No 129
>KOG1273|consensus
Probab=27.63 E-value=1.4e+02 Score=34.71 Aligned_cols=32 Identities=34% Similarity=0.414 Sum_probs=29.1
Q ss_pred cccEEEEEeccceEEEEecCCcceeeeecccC
Q psy1644 733 KYDVVYLITKFGYIHVYDIETATCIYMNRISS 764 (923)
Q Consensus 733 KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~ 764 (923)
+=..||.-|.-|-+++||.+|-.|+-+-||+.
T Consensus 164 ~g~yIitGtsKGkllv~~a~t~e~vas~rits 195 (405)
T KOG1273|consen 164 RGKYIITGTSKGKLLVYDAETLECVASFRITS 195 (405)
T ss_pred CCCEEEEecCcceEEEEecchheeeeeeeech
Confidence 33468999999999999999999999999998
No 130
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=27.46 E-value=88 Score=33.26 Aligned_cols=29 Identities=24% Similarity=0.281 Sum_probs=12.8
Q ss_pred ccCCcchhhHHHHHHHhcCChhhhHHhhh
Q psy1644 541 YSQDQEVHFKYIQAACKTGQIKEVERICR 569 (923)
Q Consensus 541 ~s~d~dVhfKYIeAA~k~gq~~EvERicr 569 (923)
...||.+..-|-++-...|+..|-.++-|
T Consensus 244 ~p~d~~~~~~~a~~l~~~g~~~~A~~~~~ 272 (280)
T PF13429_consen 244 NPDDPLWLLAYADALEQAGRKDEALRLRR 272 (280)
T ss_dssp STT-HHHHHHHHHHHT-------------
T ss_pred ccccccccccccccccccccccccccccc
Confidence 45688899999999999999999887744
No 131
>KOG4532|consensus
Probab=26.96 E-value=1.5e+02 Score=33.76 Aligned_cols=33 Identities=21% Similarity=0.359 Sum_probs=29.5
Q ss_pred cceEEEEeeeeeEEEEecccceeeeecccCCCc
Q psy1644 74 YDVVYLITKFGYIHVYDIETATCIYMNRISSDT 106 (923)
Q Consensus 74 ~G~iyliTK~G~lhl~Dlet~t~iy~~rIS~~~ 106 (923)
-+++|.---.||+||+|+-++.-...=+|++|.
T Consensus 264 lDLLf~sEhfs~~hv~D~R~~~~~q~I~i~~d~ 296 (344)
T KOG4532|consen 264 LDLLFISEHFSRVHVVDTRNYVNHQVIVIPDDV 296 (344)
T ss_pred ceEEEEecCcceEEEEEcccCceeeEEecCccc
Confidence 457777778999999999999999999999988
No 132
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=26.92 E-value=1.3e+02 Score=29.09 Aligned_cols=54 Identities=22% Similarity=0.232 Sum_probs=40.9
Q ss_pred ChhhhHHhhhcCCccChHHHHhHHhhccCCCCCceEEeeccccchhHHHHHHHhcCch-heehhhhhccC
Q psy1644 560 QIKEVERICRESNCYNAERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNLQ-KYIEIYVQKGR 628 (923)
Q Consensus 560 q~~EvERicresn~YdpekvKnfLKeakL~d~~PLI~VCDrf~fV~dLv~YLY~n~~~-k~IEvYvqkvN 628 (923)
+-.++...+|-.|+=|++.|..+|++. ++.++|+.+.|.++++ |-.|...+-.+
T Consensus 13 ~~~~l~~llr~~N~C~~~~~e~~L~~~---------------~~~~eL~~lY~~kg~h~~AL~ll~~l~~ 67 (108)
T PF10366_consen 13 NPSLLGPLLRLPNYCDLEEVEEVLKEH---------------GKYQELVDLYQGKGLHRKALELLKKLAD 67 (108)
T ss_pred CHHHHHHHHccCCcCCHHHHHHHHHHc---------------CCHHHHHHHHHccCccHHHHHHHHHHhc
Confidence 567889999998988999999999866 4679999888887775 34444443333
No 133
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=26.78 E-value=57 Score=41.13 Aligned_cols=29 Identities=28% Similarity=0.114 Sum_probs=25.7
Q ss_pred cccEEEEEeccceEEEEecCCcceeeeec
Q psy1644 733 KYDVVYLITKFGYIHVYDIETATCIYMNR 761 (923)
Q Consensus 733 KyGvIyvITK~GflhlyDleTgt~Iy~nR 761 (923)
+..++|..||.|++|++|-+||++|+..+
T Consensus 458 ~~~~v~~~~K~G~~~vlDr~tG~~l~~~~ 486 (764)
T TIGR03074 458 TVPALVAPTKQGQIYVLDRRTGEPIVPVE 486 (764)
T ss_pred EeeEEEEECCCCEEEEEECCCCCEEeece
Confidence 34499999999999999999999999743
No 134
>KOG1896|consensus
Probab=26.78 E-value=4.1e+02 Score=35.68 Aligned_cols=151 Identities=17% Similarity=0.183 Sum_probs=101.3
Q ss_pred cceeeeecCCCCCCceEEEEEEecC--CccEEEEEecCCCCCCCCCcccceeeeecCCCccCCcceeEEeccc------c
Q psy1644 3 ASFASFKSEGNPEPSTLFCFAVRTP--AGGKLHIIEVGTPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSAK------Y 74 (923)
Q Consensus 3 a~F~sfkl~gN~~~stL~cfa~r~~--~~gkLhiiEig~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k------~ 74 (923)
++++++-+.|.+++.+=+|...-+- ..|+|.++--. -+|..+.++-||- -.++.-|...++.+ |
T Consensus 483 ~~~~~~~~~gl~~~~~~~elV~~sGhgkngaL~V~r~s-------I~P~i~t~fel~G-c~~iWtV~~~~~~~~~~~~~h 554 (1366)
T KOG1896|consen 483 RSSASEAVEGLSPHNKCLELVATSGHGKNGALSVIRRS-------IRPEIATEFELPG-CVDIWTVFIKGRKREEDNTQH 554 (1366)
T ss_pred ccchhhhccCCCCCCCeEEEEEeccCCCCcceEEEeec-------ccceeeEEEEecC-eeeEEEEEEeccccccccCcc
Confidence 3566777788888887333333333 23577776433 3556777776654 56788888877766 8
Q ss_pred ceEEEEeeeeeEEEEecccceeeeecc--cCCCcEEEeeccCCCcceEEEecCCcceee--------eeeec-----eee
Q psy1644 75 DVVYLITKFGYIHVYDIETATCIYMNR--ISSDTIFVTAPHEATGGIIGLTNVGINVGS--------VSFNT-----LTM 139 (923)
Q Consensus 75 G~iyliTK~G~lhl~Dlet~t~iy~~r--IS~~~if~ta~~~~t~Gil~vn~~G~~~~s--------i~~~t-----~tm 139 (923)
+++-+-|+.....++--|.-..+--.+ ..+.|||+.... ..+-|+||...|++.+- |.|+- .|-
T Consensus 555 ~~lilS~e~~t~il~tge~~~Ev~~s~f~~~~~Tl~~gnlg-~~rriVQVtp~~~rllDg~~r~lq~i~fd~~~~vv~~s 633 (1366)
T KOG1896|consen 555 LYLILSTESRTMILETGEELLEVSGSGFTRDGPTLFAGNLG-NERRIVQVTPSGLRLLDGDLRMLQRIPFDSGAIVVQTS 633 (1366)
T ss_pred eEEEeecccchhhhhccchhhhcccceeEeccceEEEEecC-CceEEEEEccceeEEecCcchheeEeccccCCcEEEEe
Confidence 888888888777766555444444333 456899998755 56789999999887653 33332 122
Q ss_pred cccceeEEEecccccceEEEEecCCC
Q psy1644 140 ESDKFICVREKIAESAQVVIIDMNDP 165 (923)
Q Consensus 140 ~sdk~ic~~~~~~~~~~vviid~~~~ 165 (923)
-.|-|||||.-.++ +++.+|.+-
T Consensus 634 v~dpyv~v~~~~g~---i~~~~l~~~ 656 (1366)
T KOG1896|consen 634 VADPYVAVRSSEGR---ITLYDLEEK 656 (1366)
T ss_pred ccCceEEEEEcCCc---eEEEEeccc
Confidence 35889999998765 999999876
No 135
>KOG2003|consensus
Probab=26.39 E-value=67 Score=38.59 Aligned_cols=67 Identities=21% Similarity=0.341 Sum_probs=48.3
Q ss_pred HHHHHHhcCCCCcccccccchhhHHH----HHHHHHHhcChhHHHHHHHHhCC-CccHHHHHHHHHhcCCch
Q psy1644 440 YFGILLDHGKLNKHESLELCRPVLAQ----VIQSFAETGQFQKIVLYAKKVGY-SPDYVFLLRNVMRVNPDQ 506 (923)
Q Consensus 440 YF~~LL~~g~LN~~ESlEL~rpVL~Q----Vi~clae~Gqfdkiv~Y~~~vgy-tpDy~~LL~~i~~~nP~~ 506 (923)
|..+-+..-.+|.+|--.|++|-.+. +..||-++|.++|+...-++..- -|.-+.-|+-++|.+-+-
T Consensus 636 yidtqf~ekai~y~ekaaliqp~~~kwqlmiasc~rrsgnyqka~d~yk~~hrkfpedldclkflvri~~dl 707 (840)
T KOG2003|consen 636 YIDTQFSEKAINYFEKAALIQPNQSKWQLMIASCFRRSGNYQKAFDLYKDIHRKFPEDLDCLKFLVRIAGDL 707 (840)
T ss_pred HHhhHHHHHHHHHHHHHHhcCccHHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCccchHHHHHHHHHhccc
Confidence 44555555677888888889987776 99999999999999988777653 344444555566666553
No 136
>COG3172 NadR Predicted ATPase/kinase involved in NAD metabolism [Coenzyme metabolism]
Probab=26.29 E-value=1.8e+02 Score=31.05 Aligned_cols=55 Identities=27% Similarity=0.290 Sum_probs=45.8
Q ss_pred HHHHHHHHHHcCCCCchHHHHHHHHHHHhcCc----HHHHHHHHHhcCChhHHHHHHHhccc
Q psy1644 809 PDLALRIAVRNNLAGAEDLFVRKFNLLFTNGQ----YSEAAKAACKTGQIKEVERICRESNC 866 (923)
Q Consensus 809 ~~LA~kLA~R~~LPGAddLf~~qFn~l~~~G~----y~eAAk~A~ktg~i~e~eri~r~~~~ 866 (923)
.-||-+||.+.|=|-|.+ |.+.|..-...|+ |.+-|++| +||++-.++..|+++.
T Consensus 22 StLv~kLA~~fnt~~~wE-Y~Re~v~~~l~gdeal~y~Dy~~ia--~Gq~~~~d~av~~a~~ 80 (187)
T COG3172 22 STLVNKLANIFNTTSAWE-YGREYVFEHLGGDEALQYSDYAKIA--LGQAAYEDAAVRYANK 80 (187)
T ss_pred HHHHHHHHHHhCCCchhH-HHHHHHHHHhCCchhhhhccHHHHH--hhhHHHHHHHHhCCCc
Confidence 458899999999888876 7788877777776 77777776 6999999999999886
No 137
>KOG0305|consensus
Probab=25.72 E-value=1.7e+02 Score=35.30 Aligned_cols=88 Identities=24% Similarity=0.365 Sum_probs=63.8
Q ss_pred eeEEEeccCCeEEEEecCeEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeecccc
Q psy1644 313 VVFWKWISPNTLALVTETSVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVE 392 (923)
Q Consensus 313 VvfWkWis~~tLglVT~taVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~e 392 (923)
..+-.|-+.+.+|+=-+..||-|+-.... -..+++-+ ..+|..+.-+++++=+++ |...|.+|+|.++
T Consensus 180 ~nlldWss~n~laValg~~vylW~~~s~~-v~~l~~~~----~~~vtSv~ws~~G~~Lav-------G~~~g~v~iwD~~ 247 (484)
T KOG0305|consen 180 LNLLDWSSANVLAVALGQSVYLWSASSGS-VTELCSFG----EELVTSVKWSPDGSHLAV-------GTSDGTVQIWDVK 247 (484)
T ss_pred hhHhhcccCCeEEEEecceEEEEecCCCc-eEEeEecC----CCceEEEEECCCCCEEEE-------eecCCeEEEEehh
Confidence 45778999999999999999999864222 23334333 678999999999887654 3578999999976
Q ss_pred c-cccccccc-ceeEEeccCCC
Q psy1644 393 R-KCSQPIEE-KKSRVAANAPK 412 (923)
Q Consensus 393 r-~~SQ~iEG-k~AkvAA~SP~ 412 (923)
+ +.-.-+.| +.++|++-+=+
T Consensus 248 ~~k~~~~~~~~h~~rvg~laW~ 269 (484)
T KOG0305|consen 248 EQKKTRTLRGSHASRVGSLAWN 269 (484)
T ss_pred hccccccccCCcCceeEEEecc
Confidence 4 44444555 77787776644
No 138
>cd01449 TST_Repeat_2 Thiosulfate sulfurtransferase (TST), C-terminal, catalytic domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the second repeat. Only the second repeat contains the catalytically active Cys residue.
Probab=25.67 E-value=1e+02 Score=28.35 Aligned_cols=42 Identities=19% Similarity=0.244 Sum_probs=31.2
Q ss_pred ChHHHHhHHhhccCCCCCceEEeeccccchhHHHHHHHhcCc
Q psy1644 575 NAERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNL 616 (923)
Q Consensus 575 dpekvKnfLKeakL~d~~PLI~VCDrf~fV~dLv~YLY~n~~ 616 (923)
+|+..+.++++.+++...|+|++|+.=..-......|-.-+.
T Consensus 62 ~~~~~~~~~~~~~~~~~~~iv~yc~~g~~s~~~~~~l~~~G~ 103 (118)
T cd01449 62 SPEELRALFAALGITPDKPVIVYCGSGVTACVLLLALELLGY 103 (118)
T ss_pred CHHHHHHHHHHcCCCCCCCEEEECCcHHHHHHHHHHHHHcCC
Confidence 578899999999998999999999984344555555544433
No 139
>PF00167 FGF: Fibroblast growth factor; InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=25.46 E-value=1.2e+02 Score=29.06 Aligned_cols=56 Identities=14% Similarity=0.222 Sum_probs=41.6
Q ss_pred ccEEEEEeccceEEEEecCCcceeeeecccCC--ceEEeeccCCCCcEEEEcCCCeEEE
Q psy1644 734 YDVVYLITKFGYIHVYDIETATCIYMNRISSD--TIFVTAPHEATGGIIGVNRKGQVLS 790 (923)
Q Consensus 734 yGvIyvITK~GflhlyDleTgt~Iy~nRIS~d--tIFvta~~~~t~Gii~VNrkGqVL~ 790 (923)
.+--.-+.+.|.++.=.--+..|+|...+..+ ..|.+..+ ..+-.+++|++|+...
T Consensus 51 s~~YLcmn~~G~ly~~~~~~~~C~F~e~~~~n~y~~~~s~~~-~~~~yla~~~~G~~~~ 108 (122)
T PF00167_consen 51 SCRYLCMNKCGRLYGSKNFNKDCVFREELLENGYNTYESAKY-GRGWYLAFNRRGKPRR 108 (122)
T ss_dssp TTEEEEEBTTSBEEEESSBTGGGEEEEEEETTSEEEEEESTT-GTTEBCEBCTTSBBEB
T ss_pred ceEEEEECCCCeEccccccCCCceEEEEEccCCEEEEEeccC-CccEEEEECCCCCCCC
Confidence 34456688899999988888899999998653 34444433 3578899999998754
No 140
>KOG0276|consensus
Probab=25.46 E-value=91 Score=38.69 Aligned_cols=78 Identities=21% Similarity=0.238 Sum_probs=50.7
Q ss_pred ccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCC-ceeEEEeccCCeEEEEecCeEEEEecc--CCCCc
Q psy1644 267 IMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTD-DVVFWKWISPNTLALVTETSVYHWSME--GDSTP 343 (923)
Q Consensus 267 IMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E-~VvfWkWis~~tLglVT~taVYHWsi~--g~s~P 343 (923)
=|||+++++--- .+ ...+||||-|+...+||+...| ||-==|.|--..- +||.+-=++-+|- +.-.-
T Consensus 20 d~HPtePw~la~-Ly--------nG~V~IWnyetqtmVksfeV~~~PvRa~kfiaRknW-iv~GsDD~~IrVfnynt~ek 89 (794)
T KOG0276|consen 20 DFHPTEPWILAA-LY--------NGDVQIWNYETQTMVKSFEVSEVPVRAAKFIARKNW-IVTGSDDMQIRVFNYNTGEK 89 (794)
T ss_pred ecCCCCceEEEe-ee--------cCeeEEEecccceeeeeeeecccchhhheeeeccce-EEEecCCceEEEEeccccee
Confidence 399999865322 11 2589999999999999999854 6654444433332 4555533344442 34478
Q ss_pred hhhhhcccCCC
Q psy1644 344 EKMFDRHSTLN 354 (923)
Q Consensus 344 ~K~FdR~~~L~ 354 (923)
+++|+-|+..-
T Consensus 90 V~~FeAH~DyI 100 (794)
T KOG0276|consen 90 VKTFEAHSDYI 100 (794)
T ss_pred eEEeeccccce
Confidence 88999988764
No 141
>KOG0547|consensus
Probab=25.09 E-value=69 Score=38.81 Aligned_cols=55 Identities=38% Similarity=0.536 Sum_probs=48.1
Q ss_pred HHHHHHhcCcHHHHHH-------------------HHH--hcCChhHHHHHHHhcccCChHHHHhHHhhcccCCCc
Q psy1644 831 KFNLLFTNGQYSEAAK-------------------AAC--KTGQIKEVERICRESNCYNAERVKNFLKEAKLTDQL 885 (923)
Q Consensus 831 qFn~l~~~G~y~eAAk-------------------~A~--ktg~i~e~eri~r~~~~y~~~~vk~~l~~~kl~~~~ 885 (923)
+=|++|.+|.|.+|.| +|| -+|..++|-..|-++=-.+|.-||-.|..|+-.+++
T Consensus 121 ~GN~~f~~kkY~eAIkyY~~AI~l~p~epiFYsNraAcY~~lgd~~~Vied~TkALEl~P~Y~KAl~RRA~A~E~l 196 (606)
T KOG0547|consen 121 KGNKFFRNKKYDEAIKYYTQAIELCPDEPIFYSNRAACYESLGDWEKVIEDCTKALELNPDYVKALLRRASAHEQL 196 (606)
T ss_pred hhhhhhhcccHHHHHHHHHHHHhcCCCCchhhhhHHHHHHHHhhHHHHHHHHHHHhhcCcHHHHHHHHHHHHHHhh
Confidence 3468899999999999 676 699999999999999889999999999998766654
No 142
>KOG2063|consensus
Probab=24.97 E-value=1.8e+02 Score=37.47 Aligned_cols=101 Identities=17% Similarity=0.109 Sum_probs=73.2
Q ss_pred cceEEEEecCCcceeeee-cccCCceEEeeccCCCCcEEEEcC-CCeEEEEEec--CCcccccccccCCCHHHHHHHHHH
Q psy1644 743 FGYIHVYDIETATCIYMN-RISSDTIFVTAPHEATGGIIGVNR-KGQVLSVSVD--EDNIIPYINGTLQNPDLALRIAVR 818 (923)
Q Consensus 743 ~GflhlyDleTgt~Iy~n-RIS~dtIFvta~~~~t~Gii~VNr-kGqVL~V~Id--e~~iVpYI~~~L~n~~LA~kLA~R 818 (923)
.+|+.+++..++.++-+. -++.-..++++. .|++.+.. .|--..+-+. +.+|-.++.+ ..-+.|+.||.-
T Consensus 259 ~~~veI~s~~~~qlvQSI~~~~~~~~l~s~~----~~i~~~~~~s~v~~L~p~~~~~~qi~~lL~~--k~fe~ai~L~e~ 332 (877)
T KOG2063|consen 259 DRSVEIRSKLDGQLVQSITPLSNGRSLLSAH----NGIIFVASLSNVWILVPVSNFEKQIQDLLQE--KSFEEAISLAEI 332 (877)
T ss_pred cccEEEEeccCHHHhhccccccccceeeecC----CcEEEEEeccceEEEEeccchHHHHHHHHHh--hhHHHHHHHHhc
Confidence 349999999999888887 777766665553 37777776 6666555554 3344444443 346899999999
Q ss_pred cCCCCchHHHHHHHH-------HHHhcCcHHHHHHHHH
Q psy1644 819 NNLAGAEDLFVRKFN-------LLFTNGQYSEAAKAAC 849 (923)
Q Consensus 819 ~~LPGAddLf~~qFn-------~l~~~G~y~eAAk~A~ 849 (923)
.+.|+.+..+.-.+- .+|.||+|+||-+.=.
T Consensus 333 ~~~~~p~~~~~i~~~~~l~~a~~lf~q~~f~ea~~~F~ 370 (877)
T KOG2063|consen 333 LDSPNPKEKRQISCIKILIDAFELFLQKQFEEAMSLFE 370 (877)
T ss_pred cCCCChHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 999999988765553 3788999999988433
No 143
>PRK02889 tolB translocation protein TolB; Provisional
Probab=24.67 E-value=1.1e+03 Score=27.15 Aligned_cols=67 Identities=15% Similarity=0.224 Sum_probs=36.1
Q ss_pred ceEEEEEEecCCccEEEEEecC-CCCCCCCCcccceeeeecCCCccCCcceeEEeccccceEEEEee--eeeEEEEeccc
Q psy1644 17 STLFCFAVRTPAGGKLHIIEVG-TPPAGNTPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYLITK--FGYIHVYDIET 93 (923)
Q Consensus 17 stL~cfa~r~~~~gkLhiiEig-~~~~g~~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK--~G~lhl~Dlet 93 (923)
++-+.|.........|.+++.. ..+..-....+......|+||. .-+.|+-.. ...|++||+.+
T Consensus 163 ~~~iayv~~~~~~~~L~~~D~dG~~~~~l~~~~~~v~~p~wSPDG-------------~~la~~s~~~~~~~I~~~dl~~ 229 (427)
T PRK02889 163 STRIAYVIKTGNRYQLQISDADGQNAQSALSSPEPIISPAWSPDG-------------TKLAYVSFESKKPVVYVHDLAT 229 (427)
T ss_pred ccEEEEEEccCCccEEEEECCCCCCceEeccCCCCcccceEcCCC-------------CEEEEEEccCCCcEEEEEECCC
Confidence 4678888865445689998873 2211111222223344444531 123333322 34699999999
Q ss_pred cee
Q psy1644 94 ATC 96 (923)
Q Consensus 94 ~t~ 96 (923)
|.+
T Consensus 230 g~~ 232 (427)
T PRK02889 230 GRR 232 (427)
T ss_pred CCE
Confidence 975
No 144
>PRK04792 tolB translocation protein TolB; Provisional
Probab=24.50 E-value=1.2e+03 Score=27.35 Aligned_cols=121 Identities=13% Similarity=0.233 Sum_probs=63.8
Q ss_pred CCceEEEEECCCCCC--CcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCC-ceeEEE
Q psy1644 241 ESAQVVIIDMNDPTN--PIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTD-DVVFWK 317 (923)
Q Consensus 241 ~~~qVvIVDL~n~~~--v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E-~VvfWk 317 (923)
+..++.++|+..... .+..+-...+.-..|..+-||+-... + ....|.++|+++++ ++..+-.. ....=.
T Consensus 240 g~~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~~~~~---~---g~~~Iy~~dl~tg~-~~~lt~~~~~~~~p~ 312 (448)
T PRK04792 240 RKAEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLALVLSK---D---GQPEIYVVDIATKA-LTRITRHRAIDTEPS 312 (448)
T ss_pred CCcEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEEEEeC---C---CCeEEEEEECCCCC-eEECccCCCCccceE
Confidence 446888888865432 11112112345678888877764211 1 12357777887653 22221111 122335
Q ss_pred eccC-CeEEEEecC----eEEEEeccCCCCchhh-hhcccCCCCCeeEeeeeCCCCCeEEEEee
Q psy1644 318 WISP-NTLALVTET----SVYHWSMEGDSTPEKM-FDRHSTLNGCQIINYRTDPKQTWLLLIGI 375 (923)
Q Consensus 318 Wis~-~tLglVT~t----aVYHWsi~g~s~P~K~-FdR~~~L~~~QIInYr~d~~~kW~~LvGI 375 (923)
|-.+ +.|+++... .+|.|++++.. +.++ |+.+ ....-..++|++|++.++-
T Consensus 313 wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~-~~~Lt~~g~------~~~~~~~SpDG~~l~~~~~ 369 (448)
T PRK04792 313 WHPDGKSLIFTSERGGKPQIYRVNLASGK-VSRLTFEGE------QNLGGSITPDGRSMIMVNR 369 (448)
T ss_pred ECCCCCEEEEEECCCCCceEEEEECCCCC-EEEEecCCC------CCcCeeECCCCCEEEEEEe
Confidence 6543 467777653 69999886533 3333 2221 1223356888899888653
No 145
>KOG2005|consensus
Probab=24.43 E-value=1.7e+02 Score=36.78 Aligned_cols=69 Identities=17% Similarity=0.343 Sum_probs=50.2
Q ss_pred CcccccccccCCCHHHHHHHHHHcC-CCCchHHH--HHHHHHHHhcCcHHHHHHHHHhcCChhHHHHHHHhcc
Q psy1644 796 DNIIPYINGTLQNPDLALRIAVRNN-LAGAEDLF--VRKFNLLFTNGQYSEAAKAACKTGQIKEVERICRESN 865 (923)
Q Consensus 796 ~~iVpYI~~~L~n~~LA~kLA~R~~-LPGAddLf--~~qFn~l~~~G~y~eAAk~A~ktg~i~e~eri~r~~~ 865 (923)
+.+++|+-.+ +-.-+++=|-++.+ +||-||.. .....=++..++|.+|-+.|.+.+.+++++.+|+-..
T Consensus 208 d~l~~~Vd~~-n~~RvclYl~sc~~~lP~Pdd~~ll~~a~~IYlKf~~~~~al~~ai~l~~~~~v~~vf~s~~ 279 (878)
T KOG2005|consen 208 DLLLDYVDEH-NYQRVCLYLTSCVPLLPGPDDVALLRTALKIYLKFNEYPRALVGAIRLDDMKEVKEVFTSCT 279 (878)
T ss_pred hHHHHHhhhh-hHHHHHHHHHHHhhcCCCchhhHHHHHHHHHHHHHHHhHHHHHHHHhcCcHHHHHHHHHhcc
Confidence 3456666543 12235555655555 79999754 4456678889999999999999999999999988644
No 146
>PF10415 FumaraseC_C: Fumarase C C-terminus; InterPro: IPR018951 Fumarase C catalyses the stereo-specific interconversion of fumarate to L-malate as part of the Krebs cycle. The full-length protein forms a tetramer with visible globular shape. FumaraseC_C is the C-terminal 65 residues referred to as domain 3. The core of the molecule consists of a bundle of 20 alpha-helices from the five-helix bundle of domain 2. The projections from the core of the tetramer are generated from domains 1 and 3 of each subunit []. This entry does not appear to be part of either the active site or the activation site but is helical in structure forming a little bundle. ; GO: 0016829 lyase activity, 0006099 tricarboxylic acid cycle; PDB: 3RRP_A 3OCE_D 3OCF_D 3E04_B 3GTD_A 3R6V_F 3R6Q_F 1J3U_B 1FUR_A 1YFE_A ....
Probab=24.35 E-value=65 Score=27.79 Aligned_cols=42 Identities=24% Similarity=0.434 Sum_probs=31.8
Q ss_pred HHHHHHHHH---hcCChhHHHHHHHhcccCChHHHHhHHhhcccCCC
Q psy1644 841 YSEAAKAAC---KTGQIKEVERICRESNCYNAERVKNFLKEAKLTDQ 884 (923)
Q Consensus 841 y~eAAk~A~---ktg~i~e~eri~r~~~~y~~~~vk~~l~~~kl~~~ 884 (923)
|..||++|. ++| +++..++.+...-++|.+.++|.=.++|.+
T Consensus 11 Ye~aa~iAk~A~~~g--~svre~v~~~g~lt~ee~d~ll~p~~mt~P 55 (55)
T PF10415_consen 11 YEKAAEIAKEALAEG--RSVREVVLEEGLLTEEELDELLDPERMTNP 55 (55)
T ss_dssp HHHHHHHHHHHHHHT----HHHHHHHTTSS-HHHHHHHTSHHHHTT-
T ss_pred cHHHHHHHHHHHHcC--CCHHHHHHHcCCCCHHHHHHHcCHHHcCCc
Confidence 788888765 466 688899999998899999999988777653
No 147
>PF00568 WH1: WH1 domain; InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][]. WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,]. Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=24.34 E-value=85 Score=29.95 Aligned_cols=66 Identities=12% Similarity=0.350 Sum_probs=43.3
Q ss_pred hcccchhhhHHHHhHhc--Cccc-eeeeeeeeccccccEEEEE----eccceEEEEecCCcceeeeecccCCceEEeec
Q psy1644 701 IKQSRFVHDLVLYLYRN--NLQK-YIEIYVQKVSAKYDVVYLI----TKFGYIHVYDIETATCIYMNRISSDTIFVTAP 772 (923)
Q Consensus 701 g~q~~~v~nalakiy~~--~~~~-~~~i~~~qvS~KyGvIyvI----TK~GflhlyDleTgt~Iy~nRIS~dtIFvta~ 772 (923)
|..-+.+..|.|.+|.. ++++ ..+. .-.|++.++ .+.=+|.+||+.++..+....|..+..|..+.
T Consensus 5 g~~~~~i~~~vA~v~~~~p~~~~~W~~~------~~~g~v~~v~d~~~~~y~I~~~~~~~~~~v~e~~l~~~~~Y~~~~ 77 (111)
T PF00568_consen 5 GRRCQSIATAVAQVYQADPDTKRQWSPV------KGTGVVCFVKDNSRRSYFIRLYDLQDGKVVWEQELYPGFVYTKAR 77 (111)
T ss_dssp ---EEEEEEEEEEEEEEETTTSESEEES------SSEEEEEEEEETTTTEEEEEEEETTTTEEEEEEEESTT-EEEEES
T ss_pred CcCceEEEEEEEEEEEEEcCCCCcEeeC------CeEEEEEEEEECCCCEEEEEEEEccccEEEEEeEecCCCEEEeCC
Confidence 45556677777777733 3333 4432 345666666 24456999999999999999999998886543
No 148
>PRK14574 hmsH outer membrane protein; Provisional
Probab=24.00 E-value=3.4e+02 Score=34.84 Aligned_cols=53 Identities=15% Similarity=0.038 Sum_probs=38.4
Q ss_pred HHHHHHhcChhHHHHHHHHhCC-Cc-cHHHHHHH--HH--hcCCchhhHHHHhhhcCCC
Q psy1644 467 IQSFAETGQFQKIVLYAKKVGY-SP-DYVFLLRN--VM--RVNPDQGVGFAQMLVQDDE 519 (923)
Q Consensus 467 i~clae~Gqfdkiv~Y~~~vgy-tp-Dy~~LL~~--i~--~~nP~~a~efA~~L~~~~~ 519 (923)
+..+...|+.++++.|+++.-- .| .|..++-. +. .-++++|+++.+.+.+.+|
T Consensus 75 l~l~~~~G~~~~A~~~~eka~~p~n~~~~~llalA~ly~~~gdyd~Aiely~kaL~~dP 133 (822)
T PRK14574 75 LQIAGWAGRDQEVIDVYERYQSSMNISSRGLASAARAYRNEKRWDQALALWQSSLKKDP 133 (822)
T ss_pred HHHHHHcCCcHHHHHHHHHhccCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCC
Confidence 5677788999999999999873 22 35555522 22 2588999999988877653
No 149
>cd01448 TST_Repeat_1 Thiosulfate sulfurtransferase (TST), N-terminal, inactive domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the 1st repeat, which does not contain the catalytically active Cys residue. The role of the 1st repeat is uncertain, but it is believed to be involved in protein interaction.
Probab=23.81 E-value=1.4e+02 Score=27.93 Aligned_cols=45 Identities=11% Similarity=-0.009 Sum_probs=35.6
Q ss_pred CccChHHHHhHHhhccCCCCCceEEeecc-ccchhHHHHHHHhcCc
Q psy1644 572 NCYNAERVKNFLKEAKLTDQLPLIIVCDR-FDFVHDLVLYLYRNNL 616 (923)
Q Consensus 572 n~YdpekvKnfLKeakL~d~~PLI~VCDr-f~fV~dLv~YLY~n~~ 616 (923)
-..++|+...+++..+++...|+|+.|+. .........+|-..++
T Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~vv~~c~~g~~~a~~~~~~l~~~G~ 105 (122)
T cd01448 60 MLPSPEEFAELLGSLGISNDDTVVVYDDGGGFFAARAWWTLRYFGH 105 (122)
T ss_pred CCCCHHHHHHHHHHcCCCCCCEEEEECCCCCccHHHHHHHHHHcCC
Confidence 35677889999999999999999999988 5666666667766554
No 150
>KOG0289|consensus
Probab=23.69 E-value=1.4e+03 Score=27.91 Aligned_cols=139 Identities=18% Similarity=0.257 Sum_probs=75.6
Q ss_pred eeeecceEEEEEeccCCCceEEEEECCCCCCCcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccccccc
Q psy1644 225 LTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKM 304 (923)
Q Consensus 225 lTmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KL 304 (923)
+.-.+|+++|--+...+..-.++.|= .+. +.--||..||...|.+.= +....|.|||+-...-+
T Consensus 319 lsAs~d~~w~Fsd~~~g~~lt~vs~~--~s~-----v~~ts~~fHpDgLifgtg---------t~d~~vkiwdlks~~~~ 382 (506)
T KOG0289|consen 319 LSASNDGTWAFSDISSGSQLTVVSDE--TSD-----VEYTSAAFHPDGLIFGTG---------TPDGVVKIWDLKSQTNV 382 (506)
T ss_pred EEecCCceEEEEEccCCcEEEEEeec--ccc-----ceeEEeeEcCCceEEecc---------CCCceEEEEEcCCcccc
Confidence 45567888888776655432222221 111 334589999999998875 23568999999766533
Q ss_pred ccccCCC-ceeEEEeccCCeEEEEe--cC-eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccc
Q psy1644 305 KAHPMTD-DVVFWKWISPNTLALVT--ET-SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSN 380 (923)
Q Consensus 305 Ks~~m~E-~VvfWkWis~~tLglVT--~t-aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~ 380 (923)
..|.-.+ +|-=-+ .+++--=||| +. +|.-|++.-.- -|.-...=.+-.+-.|.-|.+++.+++-|=
T Consensus 383 a~Fpght~~vk~i~-FsENGY~Lat~add~~V~lwDLRKl~----n~kt~~l~~~~~v~s~~fD~SGt~L~~~g~----- 452 (506)
T KOG0289|consen 383 AKFPGHTGPVKAIS-FSENGYWLATAADDGSVKLWDLRKLK----NFKTIQLDEKKEVNSLSFDQSGTYLGIAGS----- 452 (506)
T ss_pred ccCCCCCCceeEEE-eccCceEEEEEecCCeEEEEEehhhc----ccceeeccccccceeEEEcCCCCeEEeecc-----
Confidence 3332211 111111 1233322444 33 49999884211 222221112234777888888888755531
Q ss_pred cccceeeeccccc
Q psy1644 381 RVVGAMQLYSVER 393 (923)
Q Consensus 381 ~i~G~mQLYS~er 393 (923)
.+|.|-..+
T Consensus 453 ----~l~Vy~~~k 461 (506)
T KOG0289|consen 453 ----DLQVYICKK 461 (506)
T ss_pred ----eeEEEEEec
Confidence 466666553
No 151
>KOG0269|consensus
Probab=23.50 E-value=1.3e+02 Score=38.14 Aligned_cols=111 Identities=22% Similarity=0.366 Sum_probs=73.9
Q ss_pred cceEEEEEeccCCCceEEEEECCCCCCCcccccCccc-----cccCCCccEEEEeccCCCCCCCCCCCeEEEEecc-ccc
Q psy1644 229 SDKFICVREKIAESAQVVIIDMNDPTNPIRRPISADS-----AIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIE-MKS 302 (923)
Q Consensus 229 SDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi~ADS-----AIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle-~K~ 302 (923)
.++|..+-|. .-|--.||-.|-...++ ..|-. -=.||++..||==| --+++.||+.. .|.
T Consensus 189 ~~~F~s~~ds----G~lqlWDlRqp~r~~~k-~~AH~GpV~c~nwhPnr~~lATGG---------RDK~vkiWd~t~~~~ 254 (839)
T KOG0269|consen 189 GNKFASIHDS----GYLQLWDLRQPDRCEKK-LTAHNGPVLCLNWHPNREWLATGG---------RDKMVKIWDMTDSRA 254 (839)
T ss_pred CceEEEecCC----ceEEEeeccCchhHHHH-hhcccCceEEEeecCCCceeeecC---------CCccEEEEeccCCCc
Confidence 5566666543 35667777666544433 12221 12589998998754 24799999998 566
Q ss_pred ccccccC-CCceeEEEeccC--CeE---EEEecCeEEEEeccCCCCchhhhhcccCC
Q psy1644 303 KMKAHPM-TDDVVFWKWISP--NTL---ALVTETSVYHWSMEGDSTPEKMFDRHSTL 353 (923)
Q Consensus 303 KLKs~~m-~E~VvfWkWis~--~tL---glVT~taVYHWsi~g~s~P~K~FdR~~~L 353 (923)
|.++..- --+|-=-||=-+ .-| ++|-+++|+.|+|...--|--+|+-|-+-
T Consensus 255 ~~~~tInTiapv~rVkWRP~~~~hLAtcsmv~dtsV~VWDvrRPYIP~~t~~eH~~~ 311 (839)
T KOG0269|consen 255 KPKHTINTIAPVGRVKWRPARSYHLATCSMVVDTSVHVWDVRRPYIPYATFLEHTDS 311 (839)
T ss_pred cceeEEeecceeeeeeeccCccchhhhhhccccceEEEEeeccccccceeeeccCcc
Confidence 6655433 335555556444 444 58999999999999888999999888654
No 152
>KOG2377|consensus
Probab=23.23 E-value=4.5e+02 Score=32.19 Aligned_cols=92 Identities=18% Similarity=0.360 Sum_probs=59.8
Q ss_pred eEEEEEecc-------CCCceE-EEEECCCCCCCcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEec---c
Q psy1644 231 KFICVREKI-------AESAQV-VIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNI---E 299 (923)
Q Consensus 231 kyICVRE~~-------~~~~qV-vIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNl---e 299 (923)
.-+|+|... |.+..| .-+||.+.. ++| |--..|..|+||+-- ..+.+|.||- +
T Consensus 36 qlfavrSggatgvvvkgpndDVpiSfdm~d~G-~I~------SIkFSlDnkilAVQR---------~~~~v~f~nf~~d~ 99 (657)
T KOG2377|consen 36 QLFAVRSGGATGVVVKGPNDDVPISFDMDDKG-EIK------SIKFSLDNKILAVQR---------TSKTVDFCNFIPDN 99 (657)
T ss_pred eEEEEecCCeeEEEEeCCCCCCCceeeecCCC-cee------EEEeccCcceEEEEe---------cCceEEEEecCCCc
Confidence 357777543 222222 335666554 222 334678899999972 5678999998 5
Q ss_pred ccccccc-cc-CCCceeEEEeccCCeEEEEecCeEEEEecc
Q psy1644 300 MKSKMKA-HP-MTDDVVFWKWISPNTLALVTETSVYHWSME 338 (923)
Q Consensus 300 ~K~KLKs-~~-m~E~VvfWkWis~~tLglVT~taVYHWsi~ 338 (923)
++.-+-. |. =++.+.=+-|.+...+|+|+.+-+=.+.+.
T Consensus 100 ~~l~~~~~ck~k~~~IlGF~W~~s~e~A~i~~~G~e~y~v~ 140 (657)
T KOG2377|consen 100 SQLEYTQECKTKNANILGFCWTSSTEIAFITDQGIEFYQVL 140 (657)
T ss_pred hhhHHHHHhccCcceeEEEEEecCeeEEEEecCCeEEEEEc
Confidence 5543221 22 256788889999999999999987666554
No 153
>PF13970 DUF4221: Domain of unknown function (DUF4221); PDB: 3S9J_A.
Probab=23.13 E-value=1.5e+02 Score=32.93 Aligned_cols=63 Identities=22% Similarity=0.320 Sum_probs=38.3
Q ss_pred ccccEEEEEecc-ceEEEEecCCcceeeeec----------------ccCCceEEeeccCCCCcEEEEcCCCeEEE-EEe
Q psy1644 732 AKYDVVYLITKF-GYIHVYDIETATCIYMNR----------------ISSDTIFVTAPHEATGGIIGVNRKGQVLS-VSV 793 (923)
Q Consensus 732 ~KyGvIyvITK~-GflhlyDleTgt~Iy~nR----------------IS~dtIFvta~~~~t~Gii~VNrkGqVL~-V~I 793 (923)
.|..+.+.-+.. --|++||++++.++-+-. .+.|.||+... ....++..+|.+|+|+. +.+
T Consensus 54 ~~~yL~f~n~~~~~~i~~~Dl~~~~l~~~i~~ekeGpngi~~~~~~~~~~Dsi~l~~~-~~~~~l~~~n~~G~~~~~~~~ 132 (333)
T PF13970_consen 54 GKKYLYFLNNYKSHSIDIYDLDSGKLVKKIPFEKEGPNGIGRPFGFFQNLDSIFLFNS-YAFPKLFLFNSQGEVLKKIDL 132 (333)
T ss_dssp TEEEEEEEE-ST--EEEEEETTTTEEEEEEE-BSSSTTB-TT---EEESSSTTSEEEE-GGGTEEEEE-TT--EEEEEE-
T ss_pred CcEEEEEEcCCCcceEEEEECCCCceeeeeeeeeECCCCccccccceEcCCceEEEec-CCcceEEEEcCCCeEEEEEec
Confidence 333344455554 689999999998874322 56677777766 55689999999999875 344
Q ss_pred cC
Q psy1644 794 DE 795 (923)
Q Consensus 794 de 795 (923)
.+
T Consensus 133 ~~ 134 (333)
T PF13970_consen 133 EE 134 (333)
T ss_dssp --
T ss_pred cc
Confidence 33
No 154
>PF04840 Vps16_C: Vps16, C-terminal region; InterPro: IPR006925 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=23.06 E-value=2.1e+02 Score=32.48 Aligned_cols=56 Identities=23% Similarity=0.303 Sum_probs=45.5
Q ss_pred CCHHHHHHHHHHcCCCCchHHHHHHHHHHHhcCcHHHHHHHHHhcCChhHHHHHHHhcccCCh
Q psy1644 807 QNPDLALRIAVRNNLAGAEDLFVRKFNLLFTNGQYSEAAKAACKTGQIKEVERICRESNCYNA 869 (923)
Q Consensus 807 ~n~~LA~kLA~R~~LPGAddLf~~qFn~l~~~G~y~eAAk~A~ktg~i~e~eri~r~~~~y~~ 869 (923)
.++.-|.+.-.|. | + .++.+-++.-|+|.+||+.|.+.+.+.-++.|-+..+--++
T Consensus 251 ~~~~eA~~yI~k~--~---~--~~rv~~y~~~~~~~~A~~~A~~~kd~~~L~~i~~~~~~~~~ 306 (319)
T PF04840_consen 251 GNKKEASKYIPKI--P---D--EERVEMYLKCGDYKEAAQEAFKEKDIDLLKQILKRCPGNND 306 (319)
T ss_pred CCHHHHHHHHHhC--C---h--HHHHHHHHHCCCHHHHHHHHHHcCCHHHHHHHHHHCCCCCh
Confidence 5677888877773 3 2 67788899999999999999999999999999877654443
No 155
>PRK04043 tolB translocation protein TolB; Provisional
Probab=23.02 E-value=1.3e+03 Score=27.14 Aligned_cols=118 Identities=7% Similarity=0.136 Sum_probs=68.8
Q ss_pred CceEEEEECCCCCCCcccccCcc-----ccccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCC-ceeE
Q psy1644 242 SAQVVIIDMNDPTNPIRRPISAD-----SAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTD-DVVF 315 (923)
Q Consensus 242 ~~qVvIVDL~n~~~v~Rrpi~AD-----SAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E-~Vvf 315 (923)
..+|.++|+.... ++++... +.-..|..+-||+-... + ....|-+.|+++++ ++..+-.+ ....
T Consensus 212 ~~~Iyv~dl~tg~---~~~lt~~~g~~~~~~~SPDG~~la~~~~~---~---g~~~Iy~~dl~~g~-~~~LT~~~~~d~~ 281 (419)
T PRK04043 212 KPTLYKYNLYTGK---KEKIASSQGMLVVSDVSKDGSKLLLTMAP---K---GQPDIYLYDTNTKT-LTQITNYPGIDVN 281 (419)
T ss_pred CCEEEEEECCCCc---EEEEecCCCcEEeeEECCCCCEEEEEEcc---C---CCcEEEEEECCCCc-EEEcccCCCccCc
Confidence 4588888886543 2333211 23468887766665221 0 12456666776654 33332222 2346
Q ss_pred EEeccC-CeEEEEecC----eEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeec
Q psy1644 316 WKWISP-NTLALVTET----SVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISA 377 (923)
Q Consensus 316 WkWis~-~tLglVT~t----aVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~ 377 (923)
..|-.+ +.|+++++. .+|..++++.. +.++..- +. .+...++|++|++.+.-..
T Consensus 282 p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~-~~rlt~~-g~------~~~~~SPDG~~Ia~~~~~~ 340 (419)
T PRK04043 282 GNFVEDDKRIVFVSDRLGYPNIFMKKLNSGS-VEQVVFH-GK------NNSSVSTYKNYIVYSSRET 340 (419)
T ss_pred cEECCCCCEEEEEECCCCCceEEEEECCCCC-eEeCccC-CC------cCceECCCCCEEEEEEcCC
Confidence 678765 479999876 89999997543 2333321 11 1347899999999887543
No 156
>KOG2110|consensus
Probab=22.94 E-value=1.3e+03 Score=27.40 Aligned_cols=99 Identities=15% Similarity=0.253 Sum_probs=64.8
Q ss_pred CceEEEEECCCCCCCcccc--cCcccc-ccCCCccEEEEeccCCCCCCCCCCCeEEEEecccccccccccCCCceeEEEe
Q psy1644 242 SAQVVIIDMNDPTNPIRRP--ISADSA-IMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVVFWKW 318 (923)
Q Consensus 242 ~~qVvIVDL~n~~~v~Rrp--i~ADSA-IMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~VvfWkW 318 (923)
..+|++.|+.|-..++--+ =+.=+| ..||..+.||=- ...|+.+.||.+...+|++.|
T Consensus 152 ~GdV~l~d~~nl~~v~~I~aH~~~lAalafs~~G~llATA--------SeKGTVIRVf~v~~G~kl~eF----------- 212 (391)
T KOG2110|consen 152 SGDVVLFDTINLQPVNTINAHKGPLAALAFSPDGTLLATA--------SEKGTVIRVFSVPEGQKLYEF----------- 212 (391)
T ss_pred CceEEEEEcccceeeeEEEecCCceeEEEECCCCCEEEEe--------ccCceEEEEEEcCCccEeeee-----------
Confidence 4489999998876654422 111111 257777777764 236999999999999999885
Q ss_pred ccCCeEEEEecCeEEEEeccCCCCchhhhhcccCCCCCeeEeeeeCCCCCeEEEEeeecccccccceeeeccccccccc
Q psy1644 319 ISPNTLALVTETSVYHWSMEGDSTPEKMFDRHSTLNGCQIINYRTDPKQTWLLLIGISAVSNRVVGAMQLYSVERKCSQ 397 (923)
Q Consensus 319 is~~tLglVT~taVYHWsi~g~s~P~K~FdR~~~L~~~QIInYr~d~~~kW~~LvGI~~~~~~i~G~mQLYS~er~~SQ 397 (923)
|.+.. -++|.+-.-++|.+.+...|= .|.+-+|..+....+
T Consensus 213 ------------------------------RRG~~-~~~IySL~Fs~ds~~L~~sS~-------TeTVHiFKL~~~~~~ 253 (391)
T KOG2110|consen 213 ------------------------------RRGTY-PVSIYSLSFSPDSQFLAASSN-------TETVHIFKLEKVSNN 253 (391)
T ss_pred ------------------------------eCCce-eeEEEEEEECCCCCeEEEecC-------CCeEEEEEecccccC
Confidence 22223 467888888888887655552 346677776654433
No 157
>PF00568 WH1: WH1 domain; InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][]. WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,]. Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=22.84 E-value=1.5e+02 Score=28.39 Aligned_cols=29 Identities=14% Similarity=0.465 Sum_probs=24.9
Q ss_pred eeEEEEecccceeeeecccCCCcEEEeec
Q psy1644 84 GYIHVYDIETATCIYMNRISSDTIFVTAP 112 (923)
Q Consensus 84 G~lhl~Dlet~t~iy~~rIS~~~if~ta~ 112 (923)
=+|.++|+.++..+....|.++..+..+.
T Consensus 49 y~I~~~~~~~~~~v~e~~l~~~~~Y~~~~ 77 (111)
T PF00568_consen 49 YFIRLYDLQDGKVVWEQELYPGFVYTKAR 77 (111)
T ss_dssp EEEEEEETTTTEEEEEEEESTT-EEEEES
T ss_pred EEEEEEEccccEEEEEeEecCCCEEEeCC
Confidence 46999999999999999999999888753
No 158
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=22.71 E-value=2.1e+02 Score=28.24 Aligned_cols=59 Identities=14% Similarity=0.159 Sum_probs=46.1
Q ss_pred cccEEEEEeccceEEEEecCCcceeeeecccC--CceEEeeccCCCCcEEEEcCCCeEEEEE
Q psy1644 733 KYDVVYLITKFGYIHVYDIETATCIYMNRISS--DTIFVTAPHEATGGIIGVNRKGQVLSVS 792 (923)
Q Consensus 733 KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~--dtIFvta~~~~t~Gii~VNrkGqVL~V~ 792 (923)
+.+.-.-+.+.|.+|.=+.-+-.|+|..++.. -..|.+..+. +.=.+++||+|++-...
T Consensus 52 ~s~~YLCmn~~G~ly~s~~~~~dC~F~E~~~~n~y~~y~S~~~~-~~~ylal~~~G~~r~~~ 112 (126)
T smart00442 52 ASCRYLCMNKCGKLYGSKNFTEDCVFREEMEENGYNTYASAKYR-KRWYVALNKKGRPRRGQ 112 (126)
T ss_pred ccceEEEECCCCCEEEcccCCCCcEEEEEeccCCeEEEEEcccC-CceEEEECCCCCEeEcc
Confidence 44555668899999998889999999999985 5677666553 13688999999987764
No 159
>KOG0269|consensus
Probab=22.28 E-value=6.7e+02 Score=32.28 Aligned_cols=198 Identities=20% Similarity=0.297 Sum_probs=119.8
Q ss_pred eeeeeceeecccceeEEEeccc-------ccceEEEEecCCCCCCCccCccccccccCcccchhhhcCCCCCCCCCCccc
Q psy1644 131 SVSFNTLTMESDKFICVREKIA-------ESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEF 203 (923)
Q Consensus 131 si~~~t~tm~sdk~ic~~~~~~-------~~~~vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~ 203 (923)
+.+|.+..+--|+.+|-+++-. +-+|||.+- +.+= +..=+||+++.+-|.=.-|
T Consensus 21 t~~~~ds~l~~~~~~~~~k~~~nAIs~nr~~~qiv~AG-----rs~l-----klyai~~~~~~~~~~~~~k--------- 81 (839)
T KOG0269|consen 21 TSSFNDSYLPYDKMNCKLKAKANAISVNRDINQIVVAG-----RSLL-----KLYAINPNDFSEKCNHRFK--------- 81 (839)
T ss_pred ccccccccccchheeeecccccceEeecCCcceeEEec-----ccce-----eeEeeCcccCCcceeeecc---------
Confidence 4567777788889999665421 335555421 1111 2233477776665543322
Q ss_pred ceeeeccccccccCCcccccceeeecceEEEEEeccCCCceEEEEECCCCCCCccccc---------CccccccCCCccE
Q psy1644 204 KNRIYLLTNVGINVGSVSFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPI---------SADSAIMNPASKV 274 (923)
Q Consensus 204 ~~e~~~Lt~lGI~~~si~F~~lTmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi---------~ADSAIMnP~~~I 274 (923)
.=+++-++.+.++++ +|-| .+|-..-+ +..|++.||..+ .|.+. +|-.-=.|++.+=
T Consensus 82 -----~kqn~~~S~~DVkW~--~~~~-NlIAT~s~---nG~i~vWdlnk~---~rnk~l~~f~EH~Rs~~~ldfh~tep~ 147 (839)
T KOG0269|consen 82 -----TKQNKFYSAADVKWG--QLYS-NLIATCST---NGVISVWDLNKS---IRNKLLTVFNEHERSANKLDFHSTEPN 147 (839)
T ss_pred -----cccceeeehhhcccc--cchh-hhheeecC---CCcEEEEecCcc---ccchhhhHhhhhccceeeeeeccCCcc
Confidence 113344566667777 4443 45555433 337999999874 33333 4556668898888
Q ss_pred EEEeccCCCCCCCCCCCeEEEEecccccccccccC-CC---ceeEEEeccCCeEEEEecC-eEEEEeccCCCCchhhhhc
Q psy1644 275 IALKGKAGNDNNPNAPKTLQIFNIEMKSKMKAHPM-TD---DVVFWKWISPNTLALVTET-SVYHWSMEGDSTPEKMFDR 349 (923)
Q Consensus 275 IALra~~~~~n~~~~g~~LQIFNle~K~KLKs~~m-~E---~VvfWkWis~~tLglVT~t-aVYHWsi~g~s~P~K~FdR 349 (923)
|-|-|+ .-.++.+||+..|.-..++.- .| ||.|-.=- +++.+=+-+. -|--|+|--...++++|.-
T Consensus 148 iliSGS--------QDg~vK~~DlR~~~S~~t~~~nSESiRDV~fsp~~-~~~F~s~~dsG~lqlWDlRqp~r~~~k~~A 218 (839)
T KOG0269|consen 148 ILISGS--------QDGTVKCWDLRSKKSKSTFRSNSESIRDVKFSPGY-GNKFASIHDSGYLQLWDLRQPDRCEKKLTA 218 (839)
T ss_pred EEEecC--------CCceEEEEeeecccccccccccchhhhceeeccCC-CceEEEecCCceEEEeeccCchhHHHHhhc
Confidence 888763 346999999988765555544 33 56655433 4444444444 6678999988999999988
Q ss_pred ccCCCCCeeEeeeeCCCCCeEEEEe
Q psy1644 350 HSTLNGCQIINYRTDPKQTWLLLIG 374 (923)
Q Consensus 350 ~~~L~~~QIInYr~d~~~kW~~LvG 374 (923)
|-.-- ==.|| .+++.|++-.|
T Consensus 219 H~GpV--~c~nw--hPnr~~lATGG 239 (839)
T KOG0269|consen 219 HNGPV--LCLNW--HPNREWLATGG 239 (839)
T ss_pred ccCce--EEEee--cCCCceeeecC
Confidence 84321 12334 45888988777
No 160
>KOG0646|consensus
Probab=22.17 E-value=6.2e+02 Score=30.68 Aligned_cols=184 Identities=17% Similarity=0.248 Sum_probs=98.6
Q ss_pred eccccceEEEEeeeeeEEEEecccc-------eeeeecccCCCcEEEeeccCCCcceEEEecCCcceeeeeeeceeeccc
Q psy1644 70 VSAKYDVVYLITKFGYIHVYDIETA-------TCIYMNRISSDTIFVTAPHEATGGIIGLTNVGINVGSVSFNTLTMESD 142 (923)
Q Consensus 70 iS~k~G~iyliTK~G~lhl~Dlet~-------t~iy~~rIS~~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~tm~sd 142 (923)
+|.-=-.|+--.|.|-||+|++-+. +.-+....++-+.=+|-.+...|| ..+ ++++.|. |
T Consensus 131 fs~dgs~iiTgskDg~V~vW~l~~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg-----~~~-rl~TaS~-------D 197 (476)
T KOG0646|consen 131 FSDDGSHIITGSKDGAVLVWLLTDLVSADNDHSVKPLHIFSDHTLSITDLQIGSGG-----TNA-RLYTASE-------D 197 (476)
T ss_pred EeCCCcEEEecCCCccEEEEEEEeecccccCCCccceeeeccCcceeEEEEecCCC-----ccc-eEEEecC-------C
Confidence 3333344555568999999988543 344444455555555544444444 222 3444444 7
Q ss_pred ceeEEEecccccceEEEEecCCCCCCCccCccccccccCcccchhhhcCCCCCCCCCCcccceeeeccc--cccccCCcc
Q psy1644 143 KFICVREKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNGEFKNRIYLLT--NVGINVGSV 220 (923)
Q Consensus 143 k~ic~~~~~~~~~~vviid~~~~~~~~r~~~~a~~~im~p~~~i~al~~~~~~~~~~~~~~~~e~~~Lt--~lGI~~~si 220 (923)
.-||+.+--.+ +++....-|. +| -+++|.|+.+++=+=+.-|+- |..-+.++. +-|++....
T Consensus 198 ~t~k~wdlS~g---~LLlti~fp~-----si--~av~lDpae~~~yiGt~~G~I------~~~~~~~~~~~~~~v~~k~~ 261 (476)
T KOG0646|consen 198 RTIKLWDLSLG---VLLLTITFPS-----SI--KAVALDPAERVVYIGTEEGKI------FQNLLFKLSGQSAGVNQKGR 261 (476)
T ss_pred ceEEEEEeccc---eeeEEEecCC-----cc--eeEEEcccccEEEecCCcceE------EeeehhcCCccccccccccc
Confidence 88999998776 6665555552 22 458999999988654443321 111122333 334443333
Q ss_pred cccceeeecceEEEEEeccCCCceEEEEECCCCCCCcccccCccccccCCCccEEEEeccCCCCCCCCCCCeEEEEeccc
Q psy1644 221 SFNTLTMESDKFICVREKIAESAQVVIIDMNDPTNPIRRPISADSAIMNPASKVIALKGKAGNDNNPNAPKTLQIFNIEM 300 (923)
Q Consensus 221 ~F~~lTmESDkyICVRE~~~~~~qVvIVDL~n~~~v~Rrpi~ADSAIMnP~~~IIALra~~~~~n~~~~g~~LQIFNle~ 300 (923)
.+.+++..+ -.|. .+..++++--+++|.- .++- |. ....+-||++..
T Consensus 262 ~~~~t~~~~--------~~Gh---------~~~~~ITcLais~Dgt-------lLlS-Gd--------~dg~VcvWdi~S 308 (476)
T KOG0646|consen 262 HEENTQINV--------LVGH---------ENESAITCLAISTDGT-------LLLS-GD--------EDGKVCVWDIYS 308 (476)
T ss_pred ccccceeee--------eccc---------cCCcceeEEEEecCcc-------EEEe-eC--------CCCCEEEEecch
Confidence 333332221 1111 1123444444444432 2221 11 235799999999
Q ss_pred cccccccc-CCCceeE
Q psy1644 301 KSKMKAHP-MTDDVVF 315 (923)
Q Consensus 301 K~KLKs~~-m~E~Vvf 315 (923)
|+-+|+.+ +--+|.+
T Consensus 309 ~Q~iRtl~~~kgpVtn 324 (476)
T KOG0646|consen 309 KQCIRTLQTSKGPVTN 324 (476)
T ss_pred HHHHHHHhhhccccce
Confidence 99999987 6665543
No 161
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=22.10 E-value=2.2e+02 Score=32.55 Aligned_cols=53 Identities=13% Similarity=0.110 Sum_probs=36.5
Q ss_pred HHHHHHHhcChhHHHHHHHHh-CCCccH--HHHHHHHHhcCCchhhHHHHhhhcCC
Q psy1644 466 VIQSFAETGQFQKIVLYAKKV-GYSPDY--VFLLRNVMRVNPDQGVGFAQMLVQDD 518 (923)
Q Consensus 466 Vi~clae~Gqfdkiv~Y~~~v-gytpDy--~~LL~~i~~~nP~~a~efA~~L~~~~ 518 (923)
....+...|+.+++....++. .-+||- ..+.-.+..-+|+++.+++..+.+..
T Consensus 269 ~A~~l~~~g~~~~A~~~L~~~l~~~~~~~l~~l~~~l~~~~~~~al~~~e~~lk~~ 324 (398)
T PRK10747 269 MAEHLIECDDHDTAQQIILDGLKRQYDERLVLLIPRLKTNNPEQLEKVLRQQIKQH 324 (398)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHhcCCCHHHHHHHhhccCCChHHHHHHHHHHHhhC
Confidence 677888999999998887655 323333 23334444568888988888777653
No 162
>PF10395 Utp8: Utp8 family; InterPro: IPR018843 Utp8 is an essential component of the nuclear tRNA export machinery in Saccharomyces cerevisiae (Baker's yeast). It is a tRNA binding protein that acts at a step between tRNA maturation /aminoacylation, and translocation of the tRNA across the nuclear pore complex [].
Probab=21.84 E-value=3.5e+02 Score=34.09 Aligned_cols=71 Identities=21% Similarity=0.374 Sum_probs=53.1
Q ss_pred CceEEEEEEecCCccEEEEEecC-CCCCCCCCccc--ceeeeecCCCccCCcceeEEeccccceEEEEeeeeeEEEEec
Q psy1644 16 PSTLFCFAVRTPAGGKLHIIEVG-TPPAGNTPLAK--KAVDVFFPPEAANDFPVAMQVSAKYDVVYLITKFGYIHVYDI 91 (923)
Q Consensus 16 ~stL~cfa~r~~~~gkLhiiEig-~~~~g~~~f~k--K~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK~G~lhl~Dl 91 (923)
...+.|++...-..-+|.++|.+ ..++.+..-.+ +...+. .+|=.|++.++.+...||++.+.|-+-+||.
T Consensus 85 ~~~~~~~glt~rKk~~ll~i~~~~~~~~~~~~~~e~~~~~~~k-----l~~kvv~Ik~~~~~~~I~vvl~nG~i~~~d~ 158 (670)
T PF10395_consen 85 GKKLYCVGLTERKKFKLLLIERKVGSTEDGTVNSETTNEFELK-----LDDKVVGIKFSSDGKIIYVVLENGSIQIYDF 158 (670)
T ss_pred CcEEEEEEEeeCCeeEEEEEEccCccccccccCccccceEEEE-----cccceEEEEEecCCCEEEEEEcCCcEEEEec
Confidence 56789999888888899999997 22222222222 223332 4888899999999999999999999999999
No 163
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=21.60 E-value=3.1e+02 Score=28.91 Aligned_cols=74 Identities=15% Similarity=0.230 Sum_probs=49.4
Q ss_pred ceeeeeeeeccccccEEEEEeccceEEEEecCCcceeeee---------------cccCCceEEe-eccCCCCcEEEEcC
Q psy1644 721 KYIEIYVQKVSAKYDVVYLITKFGYIHVYDIETATCIYMN---------------RISSDTIFVT-APHEATGGIIGVNR 784 (923)
Q Consensus 721 ~~~~i~~~qvS~KyGvIyvITK~GflhlyDleTgt~Iy~n---------------RIS~dtIFvt-a~~~~t~Gii~VNr 784 (923)
+-.+|....+.+.++++.+++- |.+++||+.+....... ..+....|.+ .....++-.++|-.
T Consensus 34 ~~~~I~ql~vl~~~~~llvLsd-~~l~~~~L~~l~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~~~~~~~~~~L~va~ 112 (275)
T PF00780_consen 34 KLSSITQLSVLPELNLLLVLSD-GQLYVYDLDSLEPVSTSAPLAFPKSRSLPTKLPETKGVSFFAVNGGHEGSRRLCVAV 112 (275)
T ss_pred ecceEEEEEEecccCEEEEEcC-CccEEEEchhhccccccccccccccccccccccccCCeeEEeeccccccceEEEEEE
Confidence 4455777789999999999986 99999999999988843 2344566666 23333334444443
Q ss_pred CCeEEEEEecC
Q psy1644 785 KGQVLSVSVDE 795 (923)
Q Consensus 785 kGqVL~V~Ide 795 (923)
+-.++-...+.
T Consensus 113 kk~i~i~~~~~ 123 (275)
T PF00780_consen 113 KKKILIYEWND 123 (275)
T ss_pred CCEEEEEEEEC
Confidence 33555555544
No 164
>KOG0319|consensus
Probab=21.41 E-value=1.3e+02 Score=37.75 Aligned_cols=73 Identities=19% Similarity=0.282 Sum_probs=50.1
Q ss_pred ccccccEEEEEeccceEEEEecCCcceeeeecccCCceE-EeeccCCCCcEEEEcCCCeEEEEEecCCcccccc
Q psy1644 730 VSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDTIF-VTAPHEATGGIIGVNRKGQVLSVSVDEDNIIPYI 802 (923)
Q Consensus 730 vS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dtIF-vta~~~~t~Gii~VNrkGqVL~V~Ide~~iVpYI 802 (923)
...|-+.+|..--.|-+.+||.|||.|+|..|.+++.=+ -.++....++.++|...-+++-+..|+-+|+.-|
T Consensus 246 ~~~~~~~~~TaG~~g~~~~~d~es~~~~~~~~~~~~~e~~~~~~~~~~~~~l~vtaeQnl~l~d~~~l~i~k~i 319 (775)
T KOG0319|consen 246 LGGKGEYIITAGGSGVVQYWDSESGKCVYKQRQSDSEEIDHLLAIESMSQLLLVTAEQNLFLYDEDELTIVKQI 319 (775)
T ss_pred cCCcceEEEEecCCceEEEEecccchhhhhhccCCchhhhcceeccccCceEEEEccceEEEEEccccEEehhh
Confidence 444555677777789999999999999999999952223 2334455667777777777777755554444333
No 165
>KOG0282|consensus
Probab=21.01 E-value=55 Score=39.17 Aligned_cols=59 Identities=15% Similarity=0.142 Sum_probs=36.1
Q ss_pred CccEEEEEecCCCCCCC---CCcccceeeeecCCCccCCcceeEEeccccceEEEEee-eeeEEEEecccceeeeecc
Q psy1644 28 AGGKLHIIEVGTPPAGN---TPLAKKAVDVFFPPEAANDFPVAMQVSAKYDVVYLITK-FGYIHVYDIETATCIYMNR 101 (923)
Q Consensus 28 ~~gkLhiiEig~~~~g~---~~f~kK~vd~~Fppe~~~DFPvamqiS~k~G~iyliTK-~G~lhl~Dlet~t~iy~~r 101 (923)
+.+++-|-|+=+...-- .+..|-+-|+-|.++ |.=||-+. .+||+|||.|||+|+.+..
T Consensus 235 mD~~vklW~vy~~~~~lrtf~gH~k~Vrd~~~s~~---------------g~~fLS~sfD~~lKlwDtETG~~~~~f~ 297 (503)
T KOG0282|consen 235 MDGLVKLWNVYDDRRCLRTFKGHRKPVRDASFNNC---------------GTSFLSASFDRFLKLWDTETGQVLSRFH 297 (503)
T ss_pred CCceEEEEEEecCcceehhhhcchhhhhhhhcccc---------------CCeeeeeecceeeeeeccccceEEEEEe
Confidence 44577777773211111 134455556666663 44455444 3799999999999998765
No 166
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=21.01 E-value=1.9e+02 Score=26.74 Aligned_cols=98 Identities=12% Similarity=0.132 Sum_probs=0.0
Q ss_pred hcCCCCCCCCchHHHHHHHHHhcCCCCcccccccchhhHHH----------HHHHHHHhcChhHHHHHHHHhCCCccHHH
Q psy1644 425 QQVPTPAGQTSPLLQYFGILLDHGKLNKHESLELCRPVLAQ----------VIQSFAETGQFQKIVLYAKKVGYSPDYVF 494 (923)
Q Consensus 425 K~vp~~pGq~sPLL~YF~~LL~~g~LN~~ESlEL~rpVL~Q----------Vi~clae~Gqfdkiv~Y~~~vgytpDy~~ 494 (923)
+.+...|+....+..+=..++..| +..+.++..+.+++. ...|+...|++++++.+
T Consensus 8 ~~l~~~p~~~~~~~~~a~~~~~~~--~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~~~~~~A~~~------------ 73 (135)
T TIGR02552 8 DLLGLDSEQLEQIYALAYNLYQQG--RYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDA------------ 73 (135)
T ss_pred HHHcCChhhHHHHHHHHHHHHHcc--cHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHH------------
Q ss_pred HHHHHHhcCCchhhHHHHhhhcCCCCCcCccccchhhhhhhcccccccCCcchhhHHHHHHHhcC
Q psy1644 495 LLRNVMRVNPDQGVGFAQMLVQDDEPLADINQSYEGLFYFLGSIVNYSQDQEVHFKYIQAACKTG 559 (923)
Q Consensus 495 LL~~i~~~nP~~a~efA~~L~~~~~pl~di~~~~egLyy~Lgsivn~s~d~dVhfKYIeAA~k~g 559 (923)
++..+..+|+.. ..+|++|.+-....|.+--.++++.+.+..
T Consensus 74 -~~~~~~~~p~~~----------------------~~~~~la~~~~~~g~~~~A~~~~~~al~~~ 115 (135)
T TIGR02552 74 -YALAAALDPDDP----------------------RPYFHAAECLLALGEPESALKALDLAIEIC 115 (135)
T ss_pred -HHHHHhcCCCCh----------------------HHHHHHHHHHHHcCCHHHHHHHHHHHHHhc
No 167
>KOG0288|consensus
Probab=21.00 E-value=1.6e+02 Score=35.04 Aligned_cols=61 Identities=13% Similarity=0.115 Sum_probs=53.5
Q ss_pred eccccccEEEEEeccceEEEEecCCcceeeeecccCCc--eEEeeccCCCCcEEEEcCCCeEE
Q psy1644 729 KVSAKYDVVYLITKFGYIHVYDIETATCIYMNRISSDT--IFVTAPHEATGGIIGVNRKGQVL 789 (923)
Q Consensus 729 qvS~KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~dt--IFvta~~~~t~Gii~VNrkGqVL 789 (923)
++|+--..+---+-.|-+|+|+..||+|-+.-.-+... |+..+-...+.|.+...|.+-|-
T Consensus 394 vfSpd~~YvaAGS~dgsv~iW~v~tgKlE~~l~~s~s~~aI~s~~W~~sG~~Llsadk~~~v~ 456 (459)
T KOG0288|consen 394 VFSPDGSYVAAGSADGSVYIWSVFTGKLEKVLSLSTSNAAITSLSWNPSGSGLLSADKQKAVT 456 (459)
T ss_pred EECCCCceeeeccCCCcEEEEEccCceEEEEeccCCCCcceEEEEEcCCCchhhcccCCcceE
Confidence 68888777777788999999999999999999999888 99888888999999999877653
No 168
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=20.98 E-value=67 Score=22.41 Aligned_cols=20 Identities=35% Similarity=0.584 Sum_probs=17.3
Q ss_pred HHHHHHhcCChhhhHHhhhc
Q psy1644 551 YIQAACKTGQIKEVERICRE 570 (923)
Q Consensus 551 YIeAA~k~gq~~EvERicre 570 (923)
.|++.++.|++.|++++-++
T Consensus 6 li~~~~~~~~~~~a~~~~~~ 25 (31)
T PF01535_consen 6 LISGYCKMGQFEEALEVFDE 25 (31)
T ss_pred HHHHHHccchHHHHHHHHHH
Confidence 58899999999999988654
No 169
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=20.97 E-value=87 Score=22.80 Aligned_cols=29 Identities=28% Similarity=0.479 Sum_probs=22.0
Q ss_pred hhhhhhcccccccCCcchhhHHHHHHHhcCC
Q psy1644 530 GLFYFLGSIVNYSQDQEVHFKYIQAACKTGQ 560 (923)
Q Consensus 530 gLyy~Lgsivn~s~d~dVhfKYIeAA~k~gq 560 (923)
|.+|+-|. ...+|++--++|.+.|++.|.
T Consensus 8 g~~~~~G~--g~~~d~~~A~~~~~~Aa~~g~ 36 (36)
T smart00671 8 GQMYEYGL--GVKKDLEKALEYYKKAAELGN 36 (36)
T ss_pred HHHHHcCC--CCCcCHHHHHHHHHHHHHccC
Confidence 44555565 457889999999999998773
No 170
>smart00517 PolyA C-terminal domain of Poly(A)-binding protein. Present also in Drosophila hyperplastics discs protein. Involved in homodimerisation (either directly or indirectly)
Probab=20.88 E-value=40 Score=30.28 Aligned_cols=30 Identities=23% Similarity=0.007 Sum_probs=24.0
Q ss_pred hhhccCccccceeee-eccccCCHHHhhhhhcc
Q psy1644 623 YVQKGRKHLLEKWLK-EEKLECSEELGDLVKQS 654 (923)
Q Consensus 623 YvqkvNp~~tP~Vvg-LlD~dc~E~~Ik~LL~s 654 (923)
-|++..|.-+++|-| ||++|-+|- -.||.|
T Consensus 16 ~V~~~~p~~A~KITGMLLEmd~~el--l~lle~ 46 (64)
T smart00517 16 KVQALEPELAGKITGMLLEMDNSEL--LHLLES 46 (64)
T ss_pred HHHhhCcccCCcCeeeeeCCCHHHH--HHHhcC
Confidence 367788999999999 999998664 466665
No 171
>KOG0091|consensus
Probab=20.79 E-value=56 Score=34.74 Aligned_cols=45 Identities=27% Similarity=0.335 Sum_probs=34.6
Q ss_pred ChHHHHhHHhhcccCCCcceEEeeh---------------hHHHHHHHHhchhhheeeee
Q psy1644 868 NAERVKNFLKEAKLTDQLPLIIVFV---------------HDLVLYLYRNNLQKYIEIYV 912 (923)
Q Consensus 868 ~~~~vk~~l~~~kl~~~~~l~~~~~---------------~~~~~~~~~~~~~~~~~~~~ 912 (923)
.=|.|+||+|||++.-|-|-=+||. -+=.+||-.-.=++|||.=+
T Consensus 96 sfehv~~w~~ea~m~~q~P~k~VFlLVGhKsDL~SqRqVt~EEaEklAa~hgM~FVETSa 155 (213)
T KOG0091|consen 96 SFEHVENWVKEAAMATQGPDKVVFLLVGHKSDLQSQRQVTAEEAEKLAASHGMAFVETSA 155 (213)
T ss_pred hHHHHHHHHHHHHHhcCCCCeeEEEEeccccchhhhccccHHHHHHHHHhcCceEEEecc
Confidence 4589999999999999988777742 23356777777888888643
No 172
>PRK05560 DNA gyrase subunit A; Validated
Probab=20.77 E-value=4.1e+02 Score=34.01 Aligned_cols=64 Identities=14% Similarity=0.166 Sum_probs=40.1
Q ss_pred ccceEEEEeeeeeEEEEecccceeeeec-----cc-CCCcEEEeeccCCCcceEEEecCCcceeeeeeece
Q psy1644 73 KYDVVYLITKFGYIHVYDIETATCIYMN-----RI-SSDTIFVTAPHEATGGIIGLTNVGINVGSVSFNTL 137 (923)
Q Consensus 73 k~G~iyliTK~G~lhl~Dlet~t~iy~~-----rI-S~~~if~ta~~~~t~Gil~vn~~G~~~~si~~~t~ 137 (923)
....++++|+.||+.-.++++=....++ ++ .+|-+.....-....-++.+++.| ++..+...++
T Consensus 603 ~e~~lvlvTk~GyiKRi~l~~~~~~~r~G~~~ikLke~D~lv~v~~~~~~d~lll~T~~G-r~~r~~~~eI 672 (805)
T PRK05560 603 DDKYLFFATKNGTVKKTSLSEFSNIRSNGIIAINLDEGDELIGVRLTDGDDDILLATKNG-KAIRFPESDV 672 (805)
T ss_pred CCCEEEEEeCCCEEEEEEhHHhhhcccCCceeeccCCCCEEEEEEEeCCCCEEEEEECCC-cEEEEEhhhc
Confidence 5678999999999998887653322221 11 234333222222334588899999 7877777655
No 173
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=20.32 E-value=4.4e+02 Score=32.47 Aligned_cols=146 Identities=16% Similarity=0.216 Sum_probs=0.0
Q ss_pred cCHHHHHhhhcCCCCCCCCchHHHHHHHHHhcCCCCcccccccchhhHHH-----------HHHHHHHhcChhHHHHHHH
Q psy1644 416 RTPQTIVKFQQVPTPAGQTSPLLQYFGILLDHGKLNKHESLELCRPVLAQ-----------VIQSFAETGQFQKIVLYAK 484 (923)
Q Consensus 416 Rt~eTI~rFK~vp~~pGq~sPLL~YF~~LL~~g~LN~~ESlEL~rpVL~Q-----------Vi~clae~Gqfdkiv~Y~~ 484 (923)
+..+-+.-++.+................++..|.+. |.++..+.++.. ...++.+.|++++++...+
T Consensus 159 ~~~eA~~~~~~~~~~~P~~~~a~~~~~~l~~~g~~~--eA~~~~~~~l~~~~~~~~~~~~~l~~~l~~~g~~~eA~~~~~ 236 (656)
T PRK15174 159 KELQAISLARTQAQEVPPRGDMIATCLSFLNKSRLP--EDHDLARALLPFFALERQESAGLAVDTLCAVGKYQEAIQTGE 236 (656)
T ss_pred ChHHHHHHHHHHHHhCCCCHHHHHHHHHHHHcCCHH--HHHHHHHHHHhcCCCcchhHHHHHHHHHHHCCCHHHHHHHHH
Q ss_pred Hh---------------------CCCcc----HHHHHHHHHhcCCchhh---HHHHhhhcCCCCCcCccccchhhhhhhc
Q psy1644 485 KV---------------------GYSPD----YVFLLRNVMRVNPDQGV---GFAQMLVQDDEPLADINQSYEGLFYFLG 536 (923)
Q Consensus 485 ~v---------------------gytpD----y~~LL~~i~~~nP~~a~---efA~~L~~~~~pl~di~~~~egLyy~Lg 536 (923)
+. |-..+ -...++.++..+|+.+. .++..+.+.+. ..|.+=+|--
T Consensus 237 ~al~~~p~~~~~~~~Lg~~l~~~G~~~eA~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g~-------~~eA~~~l~~ 309 (656)
T PRK15174 237 SALARGLDGAALRRSLGLAYYQSGRSREAKLQAAEHWRHALQFNSDNVRIVTLYADALIRTGQ-------NEKAIPLLQQ 309 (656)
T ss_pred HHHhcCCCCHHHHHHHHHHHHHcCCchhhHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHCCC-------HHHHHHHHHH
Q ss_pred ccccccCCcchhhHHHHHHHhcCChhhhHHhhhc
Q psy1644 537 SIVNYSQDQEVHFKYIQAACKTGQIKEVERICRE 570 (923)
Q Consensus 537 sivn~s~d~dVhfKYIeAA~k~gq~~EvERicre 570 (923)
.+-...+++++++.+-.+..+.|++.|.....+.
T Consensus 310 al~l~P~~~~a~~~La~~l~~~G~~~eA~~~l~~ 343 (656)
T PRK15174 310 SLATHPDLPYVRAMYARALRQVGQYTAASDEFVQ 343 (656)
T ss_pred HHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH
No 174
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=20.21 E-value=2e+02 Score=33.90 Aligned_cols=29 Identities=14% Similarity=0.193 Sum_probs=26.0
Q ss_pred ccEEEEEeccceEEEEecCCcceeeeecc
Q psy1644 734 YDVVYLITKFGYIHVYDIETATCIYMNRI 762 (923)
Q Consensus 734 yGvIyvITK~GflhlyDleTgt~Iy~nRI 762 (923)
-|.+|+.+..|.++-+|++||+.+.+-+.
T Consensus 61 ~g~vy~~~~~g~l~AlD~~tG~~~W~~~~ 89 (488)
T cd00216 61 DGDMYFTTSHSALFALDAATGKVLWRYDP 89 (488)
T ss_pred CCEEEEeCCCCcEEEEECCCChhhceeCC
Confidence 57899999999999999999999987654
No 175
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=20.08 E-value=2e+02 Score=28.25 Aligned_cols=59 Identities=14% Similarity=0.132 Sum_probs=45.1
Q ss_pred cccEEEEEeccceEEEEecCCcceeeeecccC--CceEEeeccCCCCcEEEEcCCCeEEEEE
Q psy1644 733 KYDVVYLITKFGYIHVYDIETATCIYMNRISS--DTIFVTAPHEATGGIIGVNRKGQVLSVS 792 (923)
Q Consensus 733 KyGvIyvITK~GflhlyDleTgt~Iy~nRIS~--dtIFvta~~~~t~Gii~VNrkGqVL~V~ 792 (923)
+.+.-.-+.+.|++|.=+..+..|+|+.++.. -..|-+..+. .+=.+++||+|+.-...
T Consensus 48 ~s~~YLCmn~~G~ly~s~~~~~dC~F~E~~~~n~Y~~y~S~~~~-~~~ylal~~~G~~rr~~ 108 (123)
T cd00058 48 ASCRYLCMNKCGKLYGSKGFTEECLFREELLENNYNTYASAKYR-RRWYLALNKKGRPRRGQ 108 (123)
T ss_pred ccceEEEECCCCCEEECCCCCCCCEEEEEEccCCcEEEEEcccC-CCcEEEECCCCCCcccc
Confidence 44445568899999999999999999999985 4566555443 46789999999876543
No 176
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=20.02 E-value=3.7e+02 Score=31.78 Aligned_cols=103 Identities=14% Similarity=0.232 Sum_probs=59.3
Q ss_pred eeeccccccccCCcccccce--eeecceEEEEEeccCCCceEEEEECCCCCC--CcccccC-ccccccCCCccE-EEEec
Q psy1644 206 RIYLLTNVGINVGSVSFNTL--TMESDKFICVREKIAESAQVVIIDMNDPTN--PIRRPIS-ADSAIMNPASKV-IALKG 279 (923)
Q Consensus 206 e~~~Lt~lGI~~~si~F~~l--TmESDkyICVRE~~~~~~qVvIVDL~n~~~--v~Rrpi~-ADSAIMnP~~~I-IALra 279 (923)
++.+||+--....++-|.+= |=.++|++-.-+ .++..++..+||+.... .+--|-. -=.+++.|.++- +-+|
T Consensus 22 ~VtrLT~~~~~~h~~YF~~~~ft~dG~kllF~s~-~dg~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~- 99 (386)
T PF14583_consen 22 RVTRLTPPDGHSHRLYFYQNCFTDDGRKLLFASD-FDGNRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVK- 99 (386)
T ss_dssp EEEE-S-TTS-EE---TTS--B-TTS-EEEEEE--TTSS-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEE-
T ss_pred eEEEecCCCCcccceeecCCCcCCCCCEEEEEec-cCCCcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEE-
Confidence 55699998776666666554 444667777554 45667999999998642 2222210 015888898765 5688
Q ss_pred cCCCCCCCCCCCeEEEEecccccccccccCCCcee-EEEec
Q psy1644 280 KAGNDNNPNAPKTLQIFNIEMKSKMKAHPMTDDVV-FWKWI 319 (923)
Q Consensus 280 ~~~~~n~~~~g~~LQIFNle~K~KLKs~~m~E~Vv-fWkWi 319 (923)
+++.|.-.||++++.-.=++.+++-+ |=.|+
T Consensus 100 ---------~~~~l~~vdL~T~e~~~vy~~p~~~~g~gt~v 131 (386)
T PF14583_consen 100 ---------NGRSLRRVDLDTLEERVVYEVPDDWKGYGTWV 131 (386)
T ss_dssp ---------TTTEEEEEETTT--EEEEEE--TTEEEEEEEE
T ss_pred ---------CCCeEEEEECCcCcEEEEEECCccccccccee
Confidence 67899999999998777788888877 89997
Done!