Query psy18020
Match_columns 434
No_of_seqs 186 out of 372
Neff 6.7
Searched_HMMs 46136
Date Fri Aug 16 20:27:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18020.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18020hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2391|consensus 100.0 4.9E-72 1.1E-76 541.6 33.7 304 2-433 61-364 (365)
2 KOG2391|consensus 100.0 1.9E-28 4E-33 238.4 21.8 202 88-352 78-279 (365)
3 PF09454 Vps23_core: Vps23 cor 100.0 7E-29 1.5E-33 191.2 9.0 65 361-425 1-65 (65)
4 PF05743 UEV: UEV domain; Int 100.0 5.7E-29 1.2E-33 215.8 5.3 81 2-82 41-121 (121)
5 PF05743 UEV: UEV domain; Int 99.4 2.1E-14 4.5E-19 124.8 0.2 55 99-153 67-121 (121)
6 smart00212 UBCc Ubiquitin-conj 98.5 3.7E-07 8.1E-12 81.5 6.8 67 4-78 43-111 (145)
7 cd00195 UBCc Ubiquitin-conjuga 98.1 8E-06 1.7E-10 72.6 6.3 66 4-77 43-110 (141)
8 PTZ00390 ubiquitin-conjugating 97.5 0.00026 5.7E-09 64.0 6.9 63 10-78 50-113 (152)
9 PF00179 UQ_con: Ubiquitin-con 97.5 0.00018 3.9E-09 63.8 5.7 61 10-76 46-108 (140)
10 PLN00172 ubiquitin conjugating 97.5 0.00027 6E-09 63.5 6.4 67 4-78 45-112 (147)
11 KOG0819|consensus 97.4 0.00014 3.1E-09 72.3 4.0 136 270-412 22-180 (321)
12 PF07200 Mod_r: Modifier of ru 96.0 0.083 1.8E-06 47.3 10.9 53 370-425 97-149 (150)
13 KOG0424|consensus 95.3 0.036 7.8E-07 49.0 5.3 59 12-76 59-120 (158)
14 PF09304 Cortex-I_coil: Cortex 95.1 0.36 7.8E-06 40.8 10.7 66 287-352 12-77 (107)
15 COG5078 Ubiquitin-protein liga 95.0 0.044 9.5E-07 49.6 5.2 67 4-78 50-117 (153)
16 KOG0895|consensus 94.8 0.044 9.5E-07 62.1 5.8 73 11-86 900-979 (1101)
17 KOG1924|consensus 94.4 0.2 4.4E-06 55.2 9.3 36 384-419 906-947 (1102)
18 KOG1924|consensus 94.2 0.33 7.2E-06 53.6 10.4 16 397-412 891-906 (1102)
19 PF07889 DUF1664: Protein of u 94.1 1.6 3.6E-05 38.2 12.9 68 284-351 54-121 (126)
20 PF14461 Prok-E2_B: Prokaryoti 94.1 0.084 1.8E-06 46.5 5.0 66 8-76 35-104 (133)
21 PF09731 Mitofilin: Mitochondr 94.0 3.6 7.8E-05 45.0 18.3 36 380-415 406-445 (582)
22 COG1579 Zn-ribbon protein, pos 93.9 3.1 6.7E-05 40.5 15.5 113 283-395 16-133 (239)
23 KOG0971|consensus 93.6 5.3 0.00012 45.2 18.3 36 317-352 401-436 (1243)
24 PF06005 DUF904: Protein of un 93.4 2.2 4.7E-05 33.7 11.3 53 292-344 19-71 (72)
25 PF11932 DUF3450: Protein of u 93.3 3.4 7.5E-05 40.2 15.2 62 292-353 36-97 (251)
26 PRK10884 SH3 domain-containing 92.9 1.6 3.4E-05 41.6 11.7 50 303-352 116-165 (206)
27 PRK11637 AmiB activator; Provi 92.5 4.4 9.5E-05 42.6 15.7 24 322-345 99-122 (428)
28 KOG0427|consensus 92.5 0.27 5.8E-06 42.9 5.3 58 3-68 57-116 (161)
29 PRK11637 AmiB activator; Provi 92.5 11 0.00024 39.6 18.6 45 306-350 87-131 (428)
30 COG4026 Uncharacterized protei 92.4 3 6.5E-05 39.9 12.5 88 267-354 102-205 (290)
31 PF04156 IncA: IncA protein; 92.1 3.1 6.7E-05 38.5 12.5 58 295-352 92-149 (191)
32 KOG0417|consensus 92.0 0.26 5.6E-06 44.1 4.8 67 4-78 45-112 (148)
33 KOG0422|consensus 91.9 0.48 1E-05 41.9 6.2 58 13-76 53-112 (153)
34 KOG4657|consensus 91.7 4.7 0.0001 38.6 13.0 31 323-353 83-113 (246)
35 KOG0976|consensus 91.4 10 0.00022 42.6 16.7 95 317-419 136-245 (1265)
36 PF07889 DUF1664: Protein of u 91.3 2.9 6.3E-05 36.7 10.4 18 304-321 60-77 (126)
37 KOG0419|consensus 91.1 0.47 1E-05 41.5 5.3 67 4-78 48-115 (152)
38 PRK04778 septation ring format 91.1 10 0.00022 41.6 16.9 107 297-419 382-489 (569)
39 COG4942 Membrane-bound metallo 90.7 15 0.00033 38.5 16.8 35 378-412 138-173 (420)
40 PF05816 TelA: Toxic anion res 90.6 15 0.00032 37.5 16.6 146 276-428 73-218 (333)
41 PF12325 TMF_TATA_bd: TATA ele 90.5 6 0.00013 34.4 11.7 53 298-350 51-103 (120)
42 COG4942 Membrane-bound metallo 90.4 4.2 9.1E-05 42.6 12.5 36 358-395 131-170 (420)
43 KOG0933|consensus 90.2 13 0.00029 42.7 16.8 60 370-429 869-935 (1174)
44 PF09789 DUF2353: Uncharacteri 90.1 5.1 0.00011 40.6 12.5 95 298-396 133-227 (319)
45 KOG1853|consensus 90.1 20 0.00043 35.1 15.8 24 330-353 95-118 (333)
46 KOG0994|consensus 89.4 32 0.00069 40.4 18.9 33 322-354 1601-1633(1758)
47 PF04111 APG6: Autophagy prote 89.3 0.9 2E-05 45.9 6.6 76 310-396 55-130 (314)
48 PRK15422 septal ring assembly 89.2 5.8 0.00013 31.8 9.5 38 315-352 35-72 (79)
49 PHA02562 46 endonuclease subun 88.6 26 0.00055 37.8 17.7 9 62-70 40-48 (562)
50 TIGR03752 conj_TIGR03752 integ 88.6 5.8 0.00013 42.1 12.0 56 297-352 86-142 (472)
51 KOG3091|consensus 88.6 6.2 0.00013 41.9 12.1 31 396-430 455-485 (508)
52 PF09726 Macoilin: Transmembra 88.6 13 0.00028 41.8 15.4 108 317-431 543-657 (697)
53 KOG0420|consensus 88.6 0.6 1.3E-05 42.7 4.1 68 3-78 72-140 (184)
54 TIGR01541 tape_meas_lam_C phag 88.4 9.3 0.0002 39.0 13.1 40 368-409 165-204 (332)
55 PF14915 CCDC144C: CCDC144C pr 88.4 19 0.00042 36.0 14.7 110 307-420 76-196 (305)
56 PF07926 TPR_MLP1_2: TPR/MLP1/ 88.2 7.9 0.00017 34.0 11.0 75 296-374 54-128 (132)
57 TIGR00996 Mtu_fam_mce virulenc 88.2 5.7 0.00012 39.2 11.3 24 392-415 247-270 (291)
58 COG1463 Ttg2C ABC-type transpo 88.2 5.5 0.00012 40.9 11.6 89 292-381 174-262 (359)
59 PF06785 UPF0242: Uncharacteri 88.1 10 0.00022 38.4 12.7 16 381-396 207-222 (401)
60 PF11985 DUF3486: Protein of u 88.1 6 0.00013 36.6 10.7 106 324-433 52-167 (180)
61 PF04799 Fzo_mitofusin: fzo-li 88.1 1.3 2.8E-05 40.8 6.0 43 305-350 102-144 (171)
62 TIGR02449 conserved hypothetic 87.8 9.6 0.00021 29.5 9.7 57 296-352 5-61 (65)
63 PF06160 EzrA: Septation ring 87.6 48 0.001 36.3 19.1 99 299-413 380-478 (560)
64 COG5185 HEC1 Protein involved 87.5 28 0.0006 37.0 15.8 31 380-410 345-375 (622)
65 PF13094 CENP-Q: CENP-Q, a CEN 87.4 19 0.00041 32.5 13.4 59 294-352 34-92 (160)
66 PF07888 CALCOCO1: Calcium bin 87.0 37 0.0008 37.0 17.1 31 309-339 196-226 (546)
67 KOG0421|consensus 87.0 0.64 1.4E-05 41.2 3.2 67 3-78 72-140 (175)
68 PF08614 ATG16: Autophagy prot 86.9 8.5 0.00018 36.0 11.1 58 295-352 127-185 (194)
69 PF03148 Tektin: Tektin family 86.8 42 0.00092 34.9 17.5 41 375-415 320-360 (384)
70 PF11180 DUF2968: Protein of u 86.6 16 0.00034 34.3 12.3 61 290-350 125-185 (192)
71 PF09738 DUF2051: Double stran 86.5 39 0.00084 34.1 16.6 57 296-352 117-173 (302)
72 PF10473 CENP-F_leu_zip: Leuci 86.4 16 0.00035 32.6 11.9 56 297-352 51-106 (140)
73 PRK12704 phosphodiesterase; Pr 86.3 21 0.00045 38.8 15.0 20 401-420 170-189 (520)
74 COG4026 Uncharacterized protei 86.2 5.7 0.00012 38.0 9.3 64 284-347 139-205 (290)
75 PF04108 APG17: Autophagy prot 86.1 26 0.00056 36.8 15.3 57 323-382 67-123 (412)
76 PF11887 DUF3407: Protein of u 85.9 14 0.0003 36.5 12.5 116 296-412 33-152 (267)
77 PRK14140 heat shock protein Gr 85.9 16 0.00034 34.5 12.1 91 285-379 42-132 (191)
78 TIGR00996 Mtu_fam_mce virulenc 85.8 18 0.0004 35.6 13.4 97 296-393 169-266 (291)
79 PF14662 CCDC155: Coiled-coil 85.8 31 0.00068 32.4 14.1 77 317-397 72-148 (193)
80 KOG1655|consensus 85.7 4.4 9.4E-05 38.0 8.1 72 330-408 130-201 (218)
81 smart00787 Spc7 Spc7 kinetocho 85.7 32 0.00069 34.9 15.1 15 418-432 275-289 (312)
82 PF14662 CCDC155: Coiled-coil 85.6 32 0.00069 32.3 14.9 79 295-377 64-142 (193)
83 PF10146 zf-C4H2: Zinc finger- 85.6 11 0.00025 36.4 11.4 37 317-353 65-101 (230)
84 smart00787 Spc7 Spc7 kinetocho 85.6 28 0.0006 35.3 14.6 43 300-342 153-195 (312)
85 COG1579 Zn-ribbon protein, pos 85.5 38 0.00082 33.0 17.3 19 371-389 123-141 (239)
86 PRK00106 hypothetical protein; 85.4 31 0.00067 37.6 15.7 16 403-418 187-202 (535)
87 KOG0995|consensus 85.2 15 0.00032 39.8 12.8 49 302-350 277-325 (581)
88 PRK10803 tol-pal system protei 85.0 21 0.00045 35.2 13.2 31 367-397 142-172 (263)
89 PF10046 BLOC1_2: Biogenesis o 84.9 22 0.00047 29.7 11.8 23 331-353 71-93 (99)
90 PRK10884 SH3 domain-containing 84.8 11 0.00024 35.8 10.8 15 333-347 132-146 (206)
91 PRK15422 septal ring assembly 84.8 19 0.00041 28.9 10.6 48 297-344 24-78 (79)
92 COG0576 GrpE Molecular chapero 84.7 15 0.00032 34.6 11.4 59 294-354 50-108 (193)
93 PF05266 DUF724: Protein of un 84.7 34 0.00074 32.1 13.8 20 372-391 159-178 (190)
94 PF10146 zf-C4H2: Zinc finger- 84.4 19 0.0004 34.9 12.3 31 320-350 40-70 (230)
95 PRK14161 heat shock protein Gr 84.3 24 0.00052 32.8 12.5 77 300-378 39-115 (178)
96 PF12329 TMF_DNA_bd: TATA elem 84.3 14 0.00029 29.3 9.4 54 294-347 15-68 (74)
97 PRK09039 hypothetical protein; 84.3 42 0.0009 34.4 15.5 14 409-422 189-204 (343)
98 PF10475 DUF2450: Protein of u 84.2 47 0.001 33.0 17.4 75 278-352 44-121 (291)
99 KOG4460|consensus 84.2 31 0.00067 37.4 14.5 37 317-353 607-643 (741)
100 TIGR03495 phage_LysB phage lys 84.2 20 0.00044 31.8 11.3 40 311-350 60-99 (135)
101 PF09726 Macoilin: Transmembra 84.0 22 0.00048 39.9 14.3 52 288-339 422-480 (697)
102 KOG0994|consensus 84.0 58 0.0013 38.4 17.3 47 383-429 1560-1606(1758)
103 PF10018 Med4: Vitamin-D-recep 84.0 13 0.00029 34.5 10.9 18 397-414 78-95 (188)
104 PF06005 DUF904: Protein of un 83.9 19 0.00042 28.4 10.5 51 297-351 14-64 (72)
105 PF11932 DUF3450: Protein of u 83.7 18 0.0004 35.1 12.2 55 291-345 49-103 (251)
106 PRK11519 tyrosine kinase; Prov 83.7 52 0.0011 37.1 17.3 111 300-425 273-388 (719)
107 KOG4657|consensus 83.6 20 0.00042 34.5 11.6 38 315-352 89-126 (246)
108 PF12718 Tropomyosin_1: Tropom 83.6 32 0.0007 30.7 13.6 104 302-414 18-122 (143)
109 COG1196 Smc Chromosome segrega 83.6 31 0.00067 41.1 16.1 39 370-408 444-482 (1163)
110 PF13747 DUF4164: Domain of un 83.5 23 0.00051 29.0 10.9 9 281-289 20-28 (89)
111 COG1463 Ttg2C ABC-type transpo 83.5 16 0.00034 37.6 12.2 119 303-425 164-289 (359)
112 COG3074 Uncharacterized protei 83.5 20 0.00043 28.1 10.6 44 309-352 29-72 (79)
113 PF12329 TMF_DNA_bd: TATA elem 83.3 19 0.00042 28.5 9.9 57 297-353 11-67 (74)
114 PRK12704 phosphodiesterase; Pr 83.2 48 0.001 36.0 16.1 39 370-409 155-193 (520)
115 TIGR03319 YmdA_YtgF conserved 83.2 35 0.00075 37.0 15.0 21 401-421 164-184 (514)
116 KOG0250|consensus 83.0 44 0.00096 39.0 16.1 111 317-427 349-463 (1074)
117 KOG2264|consensus 83.0 11 0.00024 40.7 10.7 58 297-354 92-149 (907)
118 PF08317 Spc7: Spc7 kinetochor 82.8 58 0.0013 33.0 17.2 13 135-147 15-27 (325)
119 smart00503 SynN Syntaxin N-ter 82.7 19 0.00042 30.0 10.5 64 288-351 12-75 (117)
120 PF10805 DUF2730: Protein of u 82.6 21 0.00045 30.2 10.5 68 284-352 22-91 (106)
121 PRK00409 recombination and DNA 82.6 21 0.00045 40.8 13.6 64 281-344 503-566 (782)
122 TIGR03752 conj_TIGR03752 integ 82.5 9.8 0.00021 40.4 10.1 60 294-353 76-136 (472)
123 cd07667 BAR_SNX30 The Bin/Amph 82.4 28 0.0006 34.0 12.6 116 283-398 71-200 (240)
124 PRK10361 DNA recombination pro 82.3 38 0.00082 36.3 14.5 26 370-395 170-195 (475)
125 PF15079 DUF4546: Domain of un 82.3 8.7 0.00019 35.1 8.3 56 283-338 50-105 (205)
126 PF05384 DegS: Sensor protein 82.2 37 0.0008 31.0 12.6 101 284-391 10-110 (159)
127 PF02403 Seryl_tRNA_N: Seryl-t 82.0 12 0.00026 31.3 8.9 56 295-350 40-98 (108)
128 PRK14143 heat shock protein Gr 81.9 21 0.00046 34.7 11.6 53 298-352 85-137 (238)
129 PF07464 ApoLp-III: Apolipopho 81.9 15 0.00034 33.3 10.0 83 303-394 65-151 (155)
130 PHA02562 46 endonuclease subun 81.8 80 0.0017 34.0 18.3 15 62-76 37-51 (562)
131 KOG4360|consensus 81.8 52 0.0011 35.4 14.9 31 398-428 380-410 (596)
132 PRK14143 heat shock protein Gr 81.7 32 0.00069 33.5 12.7 31 386-417 125-158 (238)
133 PF15619 Lebercilin: Ciliary p 81.6 30 0.00066 32.6 12.2 24 370-393 123-146 (194)
134 PF14817 HAUS5: HAUS augmin-li 81.6 58 0.0013 36.2 16.1 77 276-353 83-162 (632)
135 PRK10698 phage shock protein P 81.5 39 0.00083 32.5 13.2 70 278-347 4-73 (222)
136 TIGR00606 rad50 rad50. This fa 81.5 28 0.00061 42.0 14.9 120 298-423 984-1113(1311)
137 PF08376 NIT: Nitrate and nitr 81.3 48 0.001 31.1 16.0 88 325-419 82-170 (247)
138 PRK03947 prefoldin subunit alp 81.1 15 0.00033 32.3 9.6 47 369-415 91-137 (140)
139 KOG0982|consensus 81.0 20 0.00044 37.5 11.5 97 315-411 307-417 (502)
140 PF05667 DUF812: Protein of un 80.9 95 0.0021 34.3 17.4 26 370-395 406-431 (594)
141 PRK09039 hypothetical protein; 80.8 52 0.0011 33.7 14.7 27 370-396 135-161 (343)
142 PF04012 PspA_IM30: PspA/IM30 80.8 41 0.0009 31.8 13.2 22 371-392 167-188 (221)
143 COG3074 Uncharacterized protei 80.6 26 0.00055 27.5 10.2 34 309-342 43-76 (79)
144 PF10211 Ax_dynein_light: Axon 80.6 46 0.00099 31.1 13.1 34 316-349 124-157 (189)
145 COG3883 Uncharacterized protei 80.4 43 0.00093 33.1 13.1 56 295-350 49-104 (265)
146 KOG0993|consensus 80.2 46 0.00099 34.8 13.6 108 303-411 347-455 (542)
147 TIGR02169 SMC_prok_A chromosom 80.1 60 0.0013 38.1 16.8 9 62-70 36-44 (1164)
148 PF10046 BLOC1_2: Biogenesis o 80.1 33 0.00072 28.5 11.3 21 327-347 74-94 (99)
149 PF04949 Transcrip_act: Transc 80.1 30 0.00064 31.2 10.8 92 285-384 50-145 (159)
150 PF09731 Mitofilin: Mitochondr 79.9 83 0.0018 34.3 16.8 92 323-414 368-480 (582)
151 KOG1853|consensus 79.9 51 0.0011 32.3 13.1 17 381-397 132-148 (333)
152 TIGR03319 YmdA_YtgF conserved 79.8 51 0.0011 35.8 14.8 43 307-349 82-124 (514)
153 KOG1937|consensus 79.6 47 0.001 35.2 13.6 25 360-384 433-460 (521)
154 PF04899 MbeD_MobD: MbeD/MobD 79.6 28 0.00061 27.4 10.1 50 304-353 20-69 (70)
155 COG2433 Uncharacterized conser 79.6 9.9 0.00021 41.5 9.1 68 317-399 427-494 (652)
156 PRK03947 prefoldin subunit alp 79.6 37 0.00081 29.8 11.6 47 381-427 89-135 (140)
157 PF03962 Mnd1: Mnd1 family; I 79.6 34 0.00073 32.0 11.8 27 326-352 103-129 (188)
158 KOG0995|consensus 79.5 44 0.00096 36.3 13.8 38 315-352 276-313 (581)
159 PRK00106 hypothetical protein; 79.4 58 0.0013 35.5 15.0 39 370-409 170-208 (535)
160 KOG0161|consensus 79.4 17 0.00037 45.1 12.0 75 320-401 1485-1562(1930)
161 PF10267 Tmemb_cc2: Predicted 79.2 53 0.0011 34.4 14.1 70 295-364 230-307 (395)
162 COG1196 Smc Chromosome segrega 79.2 75 0.0016 38.0 17.2 22 371-392 862-883 (1163)
163 TIGR00606 rad50 rad50. This fa 79.1 85 0.0019 38.0 17.8 25 372-396 984-1008(1311)
164 PF03148 Tektin: Tektin family 79.0 83 0.0018 32.7 15.7 60 296-355 48-107 (384)
165 PF11559 ADIP: Afadin- and alp 78.9 39 0.00085 30.0 11.7 8 278-285 37-44 (151)
166 KOG0993|consensus 78.9 25 0.00055 36.6 11.3 89 323-415 104-204 (542)
167 PRK09841 cryptic autophosphory 78.9 67 0.0014 36.3 16.0 127 285-426 258-389 (726)
168 PF02403 Seryl_tRNA_N: Seryl-t 78.8 29 0.00064 28.9 10.2 73 317-390 27-99 (108)
169 PF06785 UPF0242: Uncharacteri 78.8 69 0.0015 32.7 14.1 17 270-286 62-78 (401)
170 smart00502 BBC B-Box C-termina 78.6 38 0.00081 28.2 13.5 29 370-398 77-105 (127)
171 KOG4005|consensus 78.5 33 0.00071 33.3 11.2 28 300-327 92-119 (292)
172 PRK04863 mukB cell division pr 78.5 1.1E+02 0.0024 37.7 18.3 53 372-424 456-513 (1486)
173 TIGR01069 mutS2 MutS2 family p 78.4 21 0.00045 40.7 11.8 44 297-340 514-557 (771)
174 PF13041 PPR_2: PPR repeat fam 78.4 5.7 0.00012 28.1 5.0 48 367-414 2-50 (50)
175 PRK14155 heat shock protein Gr 78.4 23 0.0005 33.7 10.4 76 301-378 34-110 (208)
176 KOG4571|consensus 78.4 17 0.00038 36.2 9.7 37 315-351 251-287 (294)
177 PF12325 TMF_TATA_bd: TATA ele 78.4 45 0.00097 29.0 11.6 59 294-352 33-91 (120)
178 PF06248 Zw10: Centromere/kine 78.3 1.1E+02 0.0024 33.6 17.4 19 406-424 149-167 (593)
179 PF05478 Prominin: Prominin; 78.3 50 0.0011 37.8 14.9 31 383-413 354-384 (806)
180 PF10168 Nup88: Nuclear pore c 78.3 62 0.0013 36.6 15.3 35 317-351 584-618 (717)
181 KOG1029|consensus 78.3 15 0.00032 41.2 10.0 30 324-353 435-464 (1118)
182 PRK14162 heat shock protein Gr 78.2 38 0.00083 31.9 11.7 74 301-378 60-133 (194)
183 KOG0979|consensus 78.1 1.3E+02 0.0029 35.0 17.5 89 317-412 683-774 (1072)
184 TIGR03185 DNA_S_dndD DNA sulfu 77.9 1E+02 0.0022 34.2 17.0 9 62-70 41-49 (650)
185 PF15290 Syntaphilin: Golgi-lo 77.7 66 0.0014 32.0 13.3 18 336-353 120-137 (305)
186 PF05791 Bacillus_HBL: Bacillu 77.6 20 0.00043 33.3 9.6 53 300-352 109-161 (184)
187 PRK14154 heat shock protein Gr 77.5 33 0.00073 32.7 11.1 47 302-350 74-120 (208)
188 PF12126 DUF3583: Protein of u 77.5 57 0.0012 32.7 12.9 61 324-397 59-119 (324)
189 PF09851 SHOCT: Short C-termin 77.5 5 0.00011 26.3 4.0 29 384-412 1-29 (31)
190 PRK03918 chromosome segregatio 77.5 1.1E+02 0.0024 35.0 17.6 16 404-419 732-747 (880)
191 COG3883 Uncharacterized protei 77.3 16 0.00034 36.1 9.1 60 296-355 146-205 (265)
192 PF03961 DUF342: Protein of un 77.3 21 0.00045 37.9 10.9 38 315-352 371-408 (451)
193 PF10186 Atg14: UV radiation r 77.2 51 0.0011 32.2 13.1 17 379-395 192-208 (302)
194 cd00584 Prefoldin_alpha Prefol 77.1 6.8 0.00015 33.9 6.0 47 378-424 79-125 (129)
195 PHA03011 hypothetical protein; 77.1 24 0.00051 29.7 8.6 59 294-352 60-118 (120)
196 PRK14158 heat shock protein Gr 76.6 34 0.00074 32.2 10.9 87 287-378 47-133 (194)
197 PF11068 YlqD: YlqD protein; 76.6 47 0.001 29.3 11.1 64 284-347 20-88 (131)
198 TIGR01069 mutS2 MutS2 family p 76.5 15 0.00033 41.7 10.1 13 2-14 184-196 (771)
199 KOG4552|consensus 76.5 26 0.00057 33.3 9.8 75 353-427 41-119 (272)
200 PF06705 SF-assemblin: SF-asse 76.4 59 0.0013 31.5 13.0 21 378-398 221-241 (247)
201 PF02203 TarH: Tar ligand bind 76.3 53 0.0012 29.0 11.9 77 332-419 81-159 (171)
202 KOG0804|consensus 76.2 32 0.0007 36.3 11.3 36 315-350 364-399 (493)
203 PRK00888 ftsB cell division pr 76.2 11 0.00024 31.9 6.8 34 317-350 39-72 (105)
204 KOG4643|consensus 76.2 95 0.0021 36.2 15.7 99 315-422 418-518 (1195)
205 KOG2072|consensus 75.9 1.5E+02 0.0033 33.9 19.9 67 354-420 650-726 (988)
206 KOG0977|consensus 75.8 56 0.0012 35.6 13.5 37 316-352 96-132 (546)
207 PRK14139 heat shock protein Gr 75.8 70 0.0015 29.9 14.3 11 302-312 54-64 (185)
208 PRK02224 chromosome segregatio 75.8 1.6E+02 0.0034 33.9 18.5 16 62-79 36-51 (880)
209 KOG4460|consensus 75.7 71 0.0015 34.7 13.8 36 317-352 600-635 (741)
210 KOG0996|consensus 75.6 63 0.0014 38.1 14.3 31 395-425 558-588 (1293)
211 PF05064 Nsp1_C: Nsp1-like C-t 75.6 4.7 0.0001 34.7 4.5 48 303-354 52-99 (116)
212 PRK04778 septation ring format 75.6 94 0.002 34.0 15.6 53 360-412 378-430 (569)
213 KOG4005|consensus 75.5 26 0.00057 33.9 9.7 56 291-346 90-145 (292)
214 PF04108 APG17: Autophagy prot 75.5 1.1E+02 0.0024 32.1 16.6 26 403-429 346-371 (412)
215 COG3096 MukB Uncharacterized p 75.5 66 0.0014 36.3 13.8 107 311-424 1001-1111(1480)
216 COG5187 RPN7 26S proteasome re 75.5 11 0.00023 38.0 7.4 80 324-410 77-160 (412)
217 cd05022 S-100A13 S-100A13: S-1 75.4 13 0.00029 30.4 6.9 38 382-420 47-84 (89)
218 TIGR01554 major_cap_HK97 phage 75.4 21 0.00046 36.7 10.1 37 315-351 30-66 (378)
219 PRK14148 heat shock protein Gr 75.3 52 0.0011 31.1 11.7 91 285-379 45-135 (195)
220 PF12128 DUF3584: Protein of u 75.3 1.2E+02 0.0026 36.5 17.4 80 317-396 297-383 (1201)
221 TIGR02231 conserved hypothetic 75.0 13 0.00029 40.0 8.9 38 358-395 124-161 (525)
222 KOG0161|consensus 74.9 88 0.0019 39.2 16.2 26 61-86 534-559 (1930)
223 PF00038 Filament: Intermediat 74.7 39 0.00085 33.6 11.6 19 376-394 121-139 (312)
224 TIGR02231 conserved hypothetic 74.7 44 0.00096 36.1 12.7 43 312-354 131-173 (525)
225 PF08232 Striatin: Striatin fa 74.6 14 0.00031 32.6 7.4 45 307-351 27-71 (134)
226 KOG3647|consensus 74.6 40 0.00087 33.3 10.9 49 285-333 110-158 (338)
227 PRK12705 hypothetical protein; 74.6 1.2E+02 0.0026 32.9 15.7 23 317-339 93-115 (508)
228 PF10186 Atg14: UV radiation r 74.5 64 0.0014 31.5 13.0 7 373-379 206-212 (302)
229 PF04094 DUF390: Protein of un 74.5 19 0.00041 40.1 9.5 110 282-398 380-497 (828)
230 PF08826 DMPK_coil: DMPK coile 74.5 36 0.00079 26.0 9.9 28 318-345 31-58 (61)
231 PF10205 KLRAQ: Predicted coil 74.5 52 0.0011 27.8 10.8 41 301-345 33-73 (102)
232 PF11172 DUF2959: Protein of u 74.4 55 0.0012 31.0 11.4 121 277-398 26-163 (201)
233 PF05010 TACC: Transforming ac 74.3 63 0.0014 30.8 12.1 75 274-352 113-201 (207)
234 PF04048 Sec8_exocyst: Sec8 ex 74.2 63 0.0014 28.6 12.4 76 277-352 48-126 (142)
235 PF05278 PEARLI-4: Arabidopsis 74.2 74 0.0016 31.5 12.8 28 370-397 233-260 (269)
236 cd00890 Prefoldin Prefoldin is 74.2 11 0.00025 32.1 6.7 43 382-424 83-125 (129)
237 KOG3850|consensus 74.2 30 0.00065 35.8 10.3 32 333-364 324-355 (455)
238 PF04065 Not3: Not1 N-terminal 74.1 74 0.0016 30.9 12.7 110 288-404 75-198 (233)
239 TIGR03185 DNA_S_dndD DNA sulfu 74.1 82 0.0018 35.0 14.9 11 63-73 39-49 (650)
240 KOG0612|consensus 74.1 1.8E+02 0.0039 34.7 17.4 58 368-429 570-627 (1317)
241 PF10168 Nup88: Nuclear pore c 74.0 53 0.0011 37.1 13.4 10 45-54 304-313 (717)
242 PF11166 DUF2951: Protein of u 74.0 50 0.0011 27.4 9.8 56 297-352 24-79 (98)
243 COG1722 XseB Exonuclease VII s 73.8 28 0.00061 28.1 8.2 58 297-354 13-70 (81)
244 KOG2685|consensus 73.8 1.1E+02 0.0024 32.0 14.5 91 297-395 76-166 (421)
245 PRK00409 recombination and DNA 73.8 29 0.00062 39.6 11.4 45 285-329 521-565 (782)
246 PF07106 TBPIP: Tat binding pr 73.7 22 0.00047 32.4 8.7 26 329-354 112-137 (169)
247 PF07798 DUF1640: Protein of u 73.6 57 0.0012 30.0 11.5 17 337-353 117-133 (177)
248 PF11887 DUF3407: Protein of u 73.6 18 0.00038 35.7 8.6 90 301-394 17-106 (267)
249 KOG0977|consensus 73.6 1.4E+02 0.003 32.7 15.7 59 371-429 168-227 (546)
250 KOG2896|consensus 73.5 43 0.00093 34.5 11.2 51 295-345 121-171 (377)
251 KOG0250|consensus 73.5 1.9E+02 0.0042 34.0 17.7 27 129-155 157-183 (1074)
252 COG4372 Uncharacterized protei 73.3 1.1E+02 0.0023 32.0 14.0 25 372-396 154-178 (499)
253 PRK01156 chromosome segregatio 73.0 76 0.0017 36.5 14.8 29 371-400 722-750 (895)
254 COG1842 PspA Phage shock prote 73.0 92 0.002 30.0 16.0 96 303-415 33-128 (225)
255 PRK06800 fliH flagellar assemb 72.8 27 0.00059 32.6 8.8 16 303-318 43-58 (228)
256 KOG2264|consensus 72.7 34 0.00074 37.2 10.7 33 317-349 91-123 (907)
257 KOG1029|consensus 72.6 1.8E+02 0.0039 33.2 16.4 30 381-410 544-576 (1118)
258 PF10018 Med4: Vitamin-D-recep 72.6 27 0.00058 32.6 9.1 33 303-335 31-63 (188)
259 PF14712 Snapin_Pallidin: Snap 72.5 50 0.0011 26.7 10.8 19 332-350 70-88 (92)
260 PF12174 RST: RCD1-SRO-TAF4 (R 72.4 13 0.00028 29.2 5.8 42 385-427 27-68 (70)
261 TIGR00255 conserved hypothetic 72.2 1.1E+02 0.0024 30.6 14.1 65 357-429 215-286 (291)
262 PRK14145 heat shock protein Gr 72.2 64 0.0014 30.5 11.5 49 300-350 65-113 (196)
263 smart00212 UBCc Ubiquitin-conj 72.1 3.4 7.4E-05 36.6 2.9 38 111-149 72-111 (145)
264 PF09602 PhaP_Bmeg: Polyhydrox 72.1 81 0.0018 29.0 16.2 22 393-414 134-157 (165)
265 PF13863 DUF4200: Domain of un 72.0 27 0.00058 29.9 8.5 46 307-352 13-58 (126)
266 PF05700 BCAS2: Breast carcino 71.9 95 0.002 29.7 13.2 6 52-57 12-17 (221)
267 PF04111 APG6: Autophagy prote 71.9 52 0.0011 33.3 11.7 7 404-410 244-250 (314)
268 PF04799 Fzo_mitofusin: fzo-li 71.8 21 0.00045 33.0 7.9 46 296-341 103-149 (171)
269 PF08044 DUF1707: Domain of un 71.7 8.4 0.00018 28.5 4.4 35 380-414 4-38 (53)
270 PF02646 RmuC: RmuC family; I 71.7 77 0.0017 31.8 12.8 17 334-350 49-65 (304)
271 PRK05260 condesin subunit F; P 71.4 62 0.0013 33.9 12.0 27 297-323 165-191 (440)
272 PF15290 Syntaphilin: Golgi-lo 71.2 53 0.0012 32.6 10.9 37 317-353 119-169 (305)
273 PF04912 Dynamitin: Dynamitin 71.1 46 0.001 34.5 11.4 29 324-352 359-387 (388)
274 KOG4674|consensus 71.1 1.1E+02 0.0023 38.2 15.4 72 356-427 1302-1373(1822)
275 PF10267 Tmemb_cc2: Predicted 71.0 28 0.0006 36.4 9.6 17 306-322 274-290 (395)
276 PLN02939 transferase, transfer 70.9 1.4E+02 0.003 35.0 15.8 58 366-427 220-277 (977)
277 PRK13455 F0F1 ATP synthase sub 70.7 66 0.0014 29.7 11.3 14 382-395 170-183 (184)
278 PF02321 OEP: Outer membrane e 70.7 70 0.0015 27.6 13.2 41 355-395 147-187 (188)
279 KOG0804|consensus 70.7 71 0.0015 33.8 12.3 38 304-341 367-404 (493)
280 KOG4302|consensus 70.7 1.2E+02 0.0027 33.9 14.8 70 283-352 113-197 (660)
281 KOG1003|consensus 70.5 98 0.0021 29.3 13.2 56 359-414 96-151 (205)
282 PRK14144 heat shock protein Gr 70.4 41 0.00089 31.9 9.8 49 301-351 66-114 (199)
283 PF12072 DUF3552: Domain of un 70.4 96 0.0021 29.1 14.3 22 401-422 166-187 (201)
284 PRK13729 conjugal transfer pil 70.3 22 0.00047 37.9 8.7 35 317-351 88-122 (475)
285 KOG4722|consensus 70.3 1.5E+02 0.0032 31.3 15.3 35 363-397 329-363 (672)
286 PRK04863 mukB cell division pr 70.3 82 0.0018 38.7 14.5 66 365-431 1162-1235(1486)
287 KOG0964|consensus 70.2 1.8E+02 0.0038 34.1 15.9 63 366-428 412-488 (1200)
288 PF14817 HAUS5: HAUS augmin-li 70.1 31 0.00067 38.4 10.2 20 408-427 122-141 (632)
289 cd07663 BAR_SNX5 The Bin/Amphi 70.0 1.1E+02 0.0023 29.5 15.0 42 380-424 103-144 (218)
290 KOG0425|consensus 70.0 4.2 9.2E-05 36.8 2.9 39 14-57 58-96 (171)
291 PF00038 Filament: Intermediat 70.0 1.1E+02 0.0024 30.3 13.6 36 317-352 214-249 (312)
292 PF10211 Ax_dynein_light: Axon 70.0 91 0.002 29.1 12.0 36 309-344 124-159 (189)
293 PF10224 DUF2205: Predicted co 69.9 51 0.0011 26.6 8.8 56 297-352 7-63 (80)
294 PF10241 KxDL: Uncharacterized 69.9 59 0.0013 26.5 11.6 40 299-338 41-80 (88)
295 PF05266 DUF724: Protein of un 69.8 90 0.002 29.3 11.9 8 284-291 90-97 (190)
296 PF13166 AAA_13: AAA domain 69.8 1.1E+02 0.0025 34.0 14.9 47 382-428 420-466 (712)
297 KOG0933|consensus 69.6 2.3E+02 0.005 33.2 17.5 101 322-425 254-354 (1174)
298 KOG4552|consensus 69.6 69 0.0015 30.5 10.8 18 396-413 122-139 (272)
299 KOG0980|consensus 69.5 1.9E+02 0.0041 33.4 15.9 22 376-397 463-484 (980)
300 PF09763 Sec3_C: Exocyst compl 69.5 81 0.0018 35.4 13.7 16 382-397 120-135 (701)
301 COG4372 Uncharacterized protei 69.5 79 0.0017 33.0 12.1 61 291-351 158-218 (499)
302 KOG4196|consensus 69.5 80 0.0017 27.8 10.5 43 307-349 76-118 (135)
303 PF07798 DUF1640: Protein of u 69.3 94 0.002 28.6 12.1 9 388-396 140-148 (177)
304 PRK14147 heat shock protein Gr 69.3 63 0.0014 29.8 10.6 17 333-349 69-85 (172)
305 KOG2751|consensus 69.2 32 0.00069 36.1 9.4 40 308-347 186-225 (447)
306 KOG0972|consensus 69.1 65 0.0014 32.3 11.1 44 295-338 274-317 (384)
307 PF08581 Tup_N: Tup N-terminal 69.1 59 0.0013 26.2 10.9 17 289-305 2-18 (79)
308 PF10212 TTKRSYEDQ: Predicted 69.1 90 0.002 33.7 13.0 23 370-392 492-514 (518)
309 PRK14140 heat shock protein Gr 69.0 1E+02 0.0022 29.0 14.5 32 386-418 95-128 (191)
310 KOG0980|consensus 68.8 47 0.001 37.9 11.1 29 380-408 597-625 (980)
311 PRK14146 heat shock protein Gr 68.7 80 0.0017 30.3 11.5 75 301-379 75-149 (215)
312 PRK14139 heat shock protein Gr 68.7 82 0.0018 29.5 11.3 8 295-302 57-64 (185)
313 PF05335 DUF745: Protein of un 68.6 1E+02 0.0023 28.9 12.6 16 388-403 153-168 (188)
314 PF05008 V-SNARE: Vesicle tran 68.6 52 0.0011 25.7 8.8 49 302-350 26-78 (79)
315 PF08580 KAR9: Yeast cortical 68.5 2.1E+02 0.0045 32.3 17.8 118 305-427 235-365 (683)
316 TIGR01843 type_I_hlyD type I s 68.5 1.4E+02 0.0031 30.4 16.8 30 317-346 149-178 (423)
317 KOG0976|consensus 68.4 1.9E+02 0.0041 33.1 15.4 66 318-386 269-337 (1265)
318 PF06825 HSBP1: Heat shock fac 68.4 19 0.00041 26.9 5.6 37 317-353 12-48 (54)
319 PF11802 CENP-K: Centromere-as 68.3 1.3E+02 0.0028 29.8 13.0 32 318-349 88-119 (268)
320 PRK03918 chromosome segregatio 68.3 1.8E+02 0.0039 33.2 16.5 9 61-69 35-43 (880)
321 PF15070 GOLGA2L5: Putative go 68.1 2E+02 0.0044 32.0 16.7 30 325-354 121-150 (617)
322 PRK14151 heat shock protein Gr 68.1 75 0.0016 29.5 10.9 20 331-350 69-88 (176)
323 COG1293 Predicted RNA-binding 68.1 1.4E+02 0.0031 32.8 14.8 103 317-419 305-426 (564)
324 PRK14160 heat shock protein Gr 68.0 1.2E+02 0.0025 29.1 12.8 15 335-349 114-128 (211)
325 PRK14064 exodeoxyribonuclease 67.8 34 0.00074 27.2 7.4 57 296-352 8-64 (75)
326 PF04156 IncA: IncA protein; 67.8 77 0.0017 29.1 11.1 6 384-389 177-182 (191)
327 PF05384 DegS: Sensor protein 67.8 99 0.0021 28.3 11.6 34 317-350 96-129 (159)
328 TIGR02977 phageshock_pspA phag 67.8 1.1E+02 0.0025 29.0 16.2 121 279-412 5-125 (219)
329 COG1322 Predicted nuclease of 67.7 1.3E+02 0.0028 32.1 13.8 28 370-397 162-189 (448)
330 KOG0971|consensus 67.6 2.4E+02 0.0053 32.7 20.8 25 329-353 328-352 (1243)
331 PF10234 Cluap1: Clusterin-ass 67.6 97 0.0021 30.7 12.1 43 311-353 175-217 (267)
332 PRK02224 chromosome segregatio 67.5 1.3E+02 0.0029 34.5 15.2 35 317-351 263-297 (880)
333 KOG2911|consensus 67.5 78 0.0017 33.4 11.8 61 290-353 246-307 (439)
334 PTZ00464 SNF-7-like protein; P 67.2 74 0.0016 30.4 10.9 18 399-416 156-173 (211)
335 TIGR01843 type_I_hlyD type I s 67.2 1.5E+02 0.0033 30.2 17.8 41 310-350 135-175 (423)
336 PRK10869 recombination and rep 67.1 1.9E+02 0.004 31.7 15.5 13 61-73 34-46 (553)
337 PF02183 HALZ: Homeobox associ 67.0 25 0.00055 25.1 5.9 34 297-330 4-37 (45)
338 PF01920 Prefoldin_2: Prefoldi 67.0 48 0.001 27.1 8.7 35 318-352 61-95 (106)
339 PF12269 zf-CpG_bind_C: CpG bi 66.9 16 0.00034 35.5 6.3 39 315-353 25-63 (236)
340 PF06120 Phage_HK97_TLTM: Tail 66.9 1E+02 0.0023 31.1 12.3 27 317-343 79-105 (301)
341 PF04977 DivIC: Septum formati 66.7 24 0.00053 27.3 6.5 34 317-350 29-62 (80)
342 KOG2196|consensus 66.7 1.3E+02 0.0029 29.3 14.7 148 281-429 79-248 (254)
343 PF14932 HAUS-augmin3: HAUS au 66.7 23 0.00049 34.7 7.6 13 407-419 183-195 (256)
344 PF05377 FlaC_arch: Flagella a 66.5 27 0.00058 26.2 6.1 25 317-341 12-36 (55)
345 KOG2629|consensus 66.4 75 0.0016 31.8 10.9 53 298-350 140-192 (300)
346 KOG0964|consensus 66.4 2.4E+02 0.0053 33.0 16.1 20 396-415 934-953 (1200)
347 PF10174 Cast: RIM-binding pro 66.0 1.4E+02 0.003 34.2 14.4 43 384-426 477-519 (775)
348 PF07200 Mod_r: Modifier of ru 66.0 95 0.002 27.4 10.9 29 334-362 97-128 (150)
349 KOG4643|consensus 65.7 1.8E+02 0.0039 34.0 14.9 65 284-352 181-245 (1195)
350 KOG0895|consensus 65.7 14 0.0003 42.8 6.5 74 11-86 331-412 (1101)
351 PF02252 PA28_beta: Proteasome 65.6 81 0.0017 28.5 10.3 101 324-428 11-122 (150)
352 cd05025 S-100A1 S-100A1: S-100 65.5 11 0.00023 30.6 4.3 34 382-416 52-85 (92)
353 PHA01750 hypothetical protein 65.5 56 0.0012 25.4 7.7 24 317-340 47-70 (75)
354 PF10473 CENP-F_leu_zip: Leuci 65.4 1E+02 0.0022 27.6 11.8 33 317-349 78-110 (140)
355 KOG0426|consensus 65.4 10 0.00022 33.3 4.3 42 11-57 54-95 (165)
356 PRK00736 hypothetical protein; 65.4 63 0.0014 25.1 8.5 36 317-352 17-52 (68)
357 PF06595 BDV_P24: Borna diseas 65.2 65 0.0014 29.4 9.4 71 284-354 92-164 (201)
358 TIGR01844 type_I_sec_TolC type 65.0 1.1E+02 0.0024 31.1 12.7 42 354-395 357-398 (415)
359 PF09755 DUF2046: Uncharacteri 64.7 1.7E+02 0.0036 29.7 16.7 21 369-389 182-202 (310)
360 PF04102 SlyX: SlyX; InterPro 64.7 37 0.0008 26.4 7.0 37 317-353 16-52 (69)
361 PRK11091 aerobic respiration c 64.6 82 0.0018 35.4 12.6 61 318-378 130-191 (779)
362 PF13851 GAS: Growth-arrest sp 64.5 79 0.0017 29.9 10.5 48 303-350 32-79 (201)
363 cd05023 S-100A11 S-100A11: S-1 64.1 11 0.00024 30.7 4.2 34 381-415 51-84 (89)
364 TIGR00833 actII Transport prot 64.0 1.2E+02 0.0026 35.3 13.9 32 321-352 521-552 (910)
365 PRK09174 F0F1 ATP synthase sub 63.8 89 0.0019 29.6 10.8 9 294-302 87-95 (204)
366 PF08700 Vps51: Vps51/Vps67; 63.7 72 0.0016 25.2 10.6 57 285-345 27-84 (87)
367 KOG2991|consensus 63.6 70 0.0015 31.5 9.9 59 295-353 240-298 (330)
368 PRK00295 hypothetical protein; 63.5 68 0.0015 24.9 8.4 35 317-351 17-51 (68)
369 PF07106 TBPIP: Tat binding pr 63.4 1.1E+02 0.0025 27.6 11.2 76 321-407 74-149 (169)
370 KOG4196|consensus 63.2 67 0.0014 28.3 8.8 56 297-352 46-107 (135)
371 KOG2010|consensus 63.2 1.8E+02 0.0039 29.7 13.7 38 315-352 171-208 (405)
372 PF08286 Spc24: Spc24 subunit 63.2 3 6.4E-05 36.0 0.6 31 317-347 11-41 (118)
373 PF09798 LCD1: DNA damage chec 63.2 35 0.00077 38.0 8.9 59 285-343 2-64 (654)
374 KOG4279|consensus 63.0 21 0.00046 40.1 7.1 42 338-394 1148-1189(1226)
375 PF12777 MT: Microtubule-bindi 62.9 28 0.00061 35.5 7.8 38 317-354 254-291 (344)
376 KOG2150|consensus 62.8 82 0.0018 34.3 11.3 51 380-432 116-166 (575)
377 PRK14141 heat shock protein Gr 62.8 95 0.0021 29.7 10.7 21 330-350 79-99 (209)
378 PF08317 Spc7: Spc7 kinetochor 62.8 1.8E+02 0.0039 29.5 15.7 21 62-82 13-33 (325)
379 cd00179 SynN Syntaxin N-termin 62.7 1.1E+02 0.0023 26.8 13.1 60 291-350 13-72 (151)
380 TIGR01010 BexC_CtrB_KpsE polys 62.7 1.8E+02 0.004 29.6 19.5 114 302-426 178-297 (362)
381 TIGR03321 alt_F1F0_F0_B altern 62.6 1.5E+02 0.0033 28.6 14.2 12 402-413 150-161 (246)
382 COG2433 Uncharacterized conser 62.6 78 0.0017 34.8 11.1 36 317-352 472-507 (652)
383 PF04740 LXG: LXG domain of WX 62.4 1.3E+02 0.0027 27.9 11.6 25 381-405 179-203 (204)
384 COG4985 ABC-type phosphate tra 62.4 1.1E+02 0.0024 29.7 10.9 29 322-350 217-245 (289)
385 PRK02793 phi X174 lysis protei 62.4 75 0.0016 25.0 8.5 37 317-353 20-56 (72)
386 PRK10361 DNA recombination pro 62.3 2.3E+02 0.005 30.5 18.6 11 402-412 149-159 (475)
387 PF07407 Seadorna_VP6: Seadorn 62.2 63 0.0014 32.9 9.6 46 307-352 41-88 (420)
388 KOG0612|consensus 62.2 2.5E+02 0.0055 33.5 15.6 25 62-86 261-285 (1317)
389 cd05027 S-100B S-100B: S-100B 62.2 14 0.0003 30.1 4.4 34 381-415 50-83 (88)
390 PF10779 XhlA: Haemolysin XhlA 62.0 56 0.0012 25.4 7.6 29 318-346 19-47 (71)
391 TIGR00293 prefoldin, archaeal 62.0 69 0.0015 27.4 9.0 13 382-394 96-108 (126)
392 KOG0396|consensus 61.9 2E+02 0.0042 29.9 13.2 39 315-353 61-99 (389)
393 PRK05771 V-type ATP synthase s 61.8 1.4E+02 0.003 33.3 13.5 147 283-430 49-238 (646)
394 PF09755 DUF2046: Uncharacteri 61.8 1.9E+02 0.0041 29.4 14.5 69 325-407 105-173 (310)
395 TIGR03495 phage_LysB phage lys 61.8 1.2E+02 0.0025 27.0 10.8 67 287-353 19-88 (135)
396 PF01540 Lipoprotein_7: Adhesi 61.7 1.7E+02 0.0038 29.0 13.6 78 285-362 116-200 (353)
397 KOG0243|consensus 61.7 2.4E+02 0.0052 33.2 15.3 23 268-290 396-421 (1041)
398 PF06637 PV-1: PV-1 protein (P 61.6 1.3E+02 0.0029 31.3 11.9 30 284-313 285-314 (442)
399 TIGR00414 serS seryl-tRNA synt 61.6 43 0.00093 35.3 9.0 95 296-391 4-102 (418)
400 TIGR02894 DNA_bind_RsfA transc 61.5 52 0.0011 30.1 8.2 37 316-352 115-151 (161)
401 PF04740 LXG: LXG domain of WX 61.4 1.4E+02 0.003 27.7 12.0 11 383-393 153-163 (204)
402 PF05377 FlaC_arch: Flagella a 61.2 62 0.0013 24.3 7.2 33 317-349 5-37 (55)
403 cd00584 Prefoldin_alpha Prefol 61.2 1.1E+02 0.0023 26.3 10.8 28 318-345 93-120 (129)
404 PF15324 TALPID3: Hedgehog sig 61.0 15 0.00033 42.4 5.7 41 386-426 406-460 (1252)
405 PRK14153 heat shock protein Gr 61.0 1E+02 0.0023 29.0 10.5 74 301-378 54-127 (194)
406 PF04380 BMFP: Membrane fusoge 60.8 85 0.0018 25.1 9.7 27 327-353 51-77 (79)
407 PRK13729 conjugal transfer pil 60.7 39 0.00085 36.1 8.4 24 312-335 97-120 (475)
408 PF04912 Dynamitin: Dynamitin 60.6 52 0.0011 34.2 9.4 10 342-351 370-379 (388)
409 PRK02119 hypothetical protein; 60.6 82 0.0018 24.8 9.1 37 317-353 21-57 (73)
410 TIGR00634 recN DNA repair prot 60.6 1.1E+02 0.0023 33.5 12.2 126 273-415 141-277 (563)
411 KOG0810|consensus 60.4 1E+02 0.0022 31.0 11.0 61 293-353 49-109 (297)
412 PF05278 PEARLI-4: Arabidopsis 60.4 1.8E+02 0.004 28.8 12.6 24 329-352 231-254 (269)
413 PF00804 Syntaxin: Syntaxin; 60.3 87 0.0019 25.0 9.0 58 293-350 16-73 (103)
414 PF07877 DUF1661: Protein of u 60.2 6.2 0.00014 26.0 1.5 18 412-429 1-18 (31)
415 KOG4286|consensus 60.1 70 0.0015 36.1 10.3 28 388-415 310-337 (966)
416 KOG3091|consensus 60.0 1.3E+02 0.0029 32.3 12.0 28 368-395 431-458 (508)
417 PF10498 IFT57: Intra-flagella 60.0 95 0.0021 32.1 11.0 9 147-155 119-127 (359)
418 PF07851 TMPIT: TMPIT-like pro 59.9 1.8E+02 0.0038 29.8 12.6 55 299-353 5-59 (330)
419 KOG0996|consensus 59.8 1.7E+02 0.0038 34.7 13.7 51 303-353 375-425 (1293)
420 PRK14474 F0F1 ATP synthase sub 59.6 1.6E+02 0.0034 28.8 12.0 14 402-415 150-163 (250)
421 KOG4674|consensus 59.6 4.5E+02 0.0098 33.1 18.1 54 371-424 383-439 (1822)
422 PF04012 PspA_IM30: PspA/IM30 59.5 1.6E+02 0.0034 27.8 15.2 33 317-349 42-74 (221)
423 PRK14163 heat shock protein Gr 59.4 1.4E+02 0.0031 28.6 11.2 10 303-312 63-72 (214)
424 PF03915 AIP3: Actin interacti 59.4 81 0.0017 33.4 10.4 72 319-396 199-270 (424)
425 PRK04654 sec-independent trans 59.3 1.4E+02 0.0031 28.5 11.0 20 324-343 70-89 (214)
426 PF05911 DUF869: Plant protein 59.3 92 0.002 35.5 11.5 80 318-397 119-208 (769)
427 KOG0288|consensus 59.2 93 0.002 32.7 10.5 21 403-423 155-175 (459)
428 PLN02678 seryl-tRNA synthetase 59.0 63 0.0014 34.5 9.7 65 288-352 37-104 (448)
429 PF06248 Zw10: Centromere/kine 59.0 2.8E+02 0.006 30.5 15.4 17 382-398 106-122 (593)
430 PRK14068 exodeoxyribonuclease 59.0 58 0.0013 26.0 7.2 57 296-352 8-64 (76)
431 PRK11546 zraP zinc resistance 58.9 98 0.0021 27.8 9.4 19 315-333 89-107 (143)
432 PHA03161 hypothetical protein; 58.9 54 0.0012 29.6 7.7 18 272-289 8-25 (150)
433 PF08900 DUF1845: Domain of un 58.8 83 0.0018 30.1 9.7 51 364-417 111-164 (217)
434 KOG4673|consensus 58.7 3.1E+02 0.0067 30.9 14.8 140 275-414 543-686 (961)
435 PF05667 DUF812: Protein of un 58.7 2.9E+02 0.0063 30.6 15.0 29 320-348 445-473 (594)
436 KOG3976|consensus 58.6 1.4E+02 0.0031 29.0 11.1 61 290-351 139-202 (247)
437 PRK04406 hypothetical protein; 58.6 91 0.002 24.7 9.3 37 317-353 23-59 (75)
438 PF09789 DUF2353: Uncharacteri 58.5 2.2E+02 0.0047 29.1 13.9 31 399-429 188-225 (319)
439 COG1340 Uncharacterized archae 58.5 2.1E+02 0.0045 28.8 14.6 120 282-406 167-286 (294)
440 PF10174 Cast: RIM-binding pro 58.4 3.3E+02 0.0072 31.2 16.1 91 303-395 116-206 (775)
441 PRK05431 seryl-tRNA synthetase 58.2 79 0.0017 33.4 10.3 84 269-352 10-99 (425)
442 PF04568 IATP: Mitochondrial A 58.2 83 0.0018 26.5 8.4 23 327-349 77-99 (100)
443 PF13851 GAS: Growth-arrest sp 58.1 1.7E+02 0.0036 27.6 13.9 32 317-348 98-129 (201)
444 KOG1962|consensus 58.0 1.1E+02 0.0024 29.4 10.2 93 277-383 108-203 (216)
445 PRK13428 F0F1 ATP synthase sub 57.9 1.5E+02 0.0033 31.4 12.4 36 381-417 144-180 (445)
446 PF07246 Phlebovirus_NSM: Phle 57.8 61 0.0013 32.0 8.6 38 317-354 200-237 (264)
447 PF07956 DUF1690: Protein of U 57.8 79 0.0017 28.3 8.7 17 280-296 31-47 (142)
448 cd07605 I-BAR_IMD Inverse (I)- 57.8 1.5E+02 0.0033 28.5 11.3 25 391-415 176-200 (223)
449 TIGR02132 phaR_Bmeg polyhydrox 57.8 1.5E+02 0.0032 27.7 10.4 7 370-376 162-168 (189)
450 KOG1760|consensus 57.8 86 0.0019 27.4 8.5 34 319-352 81-114 (131)
451 PRK09465 tolC outer membrane c 57.8 1.4E+02 0.003 30.8 12.1 43 354-396 380-422 (446)
452 KOG4117|consensus 57.6 62 0.0013 25.0 6.7 26 327-352 35-60 (73)
453 PF04880 NUDE_C: NUDE protein, 57.6 19 0.00041 33.1 4.8 52 302-355 1-52 (166)
454 PF06156 DUF972: Protein of un 57.6 1.2E+02 0.0026 25.8 10.4 31 317-347 27-57 (107)
455 KOG3080|consensus 57.3 44 0.00096 33.5 7.5 54 356-409 120-176 (328)
456 KOG4677|consensus 57.3 2.6E+02 0.0057 29.8 13.4 128 287-434 192-322 (554)
457 PRK14149 heat shock protein Gr 57.1 1.6E+02 0.0035 27.7 11.0 46 332-379 86-131 (191)
458 PF06637 PV-1: PV-1 protein (P 57.1 1.7E+02 0.0037 30.4 11.9 12 381-392 383-394 (442)
459 KOG0239|consensus 57.0 87 0.0019 35.2 10.8 94 276-377 210-312 (670)
460 TIGR00293 prefoldin, archaeal 56.9 80 0.0017 27.0 8.5 34 381-414 81-114 (126)
461 TIGR01845 outer_NodT efflux tr 56.9 1.7E+02 0.0037 30.2 12.6 45 354-398 410-454 (454)
462 COG3096 MukB Uncharacterized p 56.9 2.9E+02 0.0064 31.5 14.2 120 279-405 1188-1311(1480)
463 PRK13169 DNA replication intia 56.8 87 0.0019 26.8 8.4 28 317-344 27-54 (110)
464 PF05700 BCAS2: Breast carcino 56.8 1.6E+02 0.0035 28.1 11.4 19 122-140 11-32 (221)
465 PF08614 ATG16: Autophagy prot 56.8 90 0.0019 29.1 9.4 18 332-349 150-167 (194)
466 PF11855 DUF3375: Protein of u 56.6 76 0.0017 34.0 10.0 81 332-412 143-233 (478)
467 KOG3231|consensus 56.5 45 0.00097 30.6 6.9 22 397-418 112-133 (208)
468 PF15397 DUF4618: Domain of un 56.3 2.1E+02 0.0046 28.2 16.7 40 303-342 65-104 (258)
469 KOG4286|consensus 56.3 1E+02 0.0023 34.7 10.8 21 395-415 274-294 (966)
470 PRK14154 heat shock protein Gr 56.2 1.9E+02 0.0041 27.6 13.6 24 387-410 111-137 (208)
471 PF05615 THOC7: Tho complex su 56.1 1.4E+02 0.003 26.1 11.3 8 332-339 94-101 (139)
472 cd07621 BAR_SNX5_6 The Bin/Amp 56.0 2E+02 0.0042 27.7 13.3 41 380-423 104-144 (219)
473 PF02183 HALZ: Homeobox associ 56.0 48 0.001 23.7 5.7 34 307-340 7-40 (45)
474 KOG4807|consensus 55.9 68 0.0015 33.4 8.8 23 62-84 44-66 (593)
475 PF05064 Nsp1_C: Nsp1-like C-t 55.9 12 0.00025 32.3 3.0 55 295-349 54-108 (116)
476 KOG0416|consensus 55.8 9.3 0.0002 35.1 2.4 61 4-73 44-107 (189)
477 cd05030 calgranulins Calgranul 55.7 57 0.0012 26.2 7.0 19 396-414 64-82 (88)
478 TIGR01844 type_I_sec_TolC type 55.7 2.1E+02 0.0046 28.9 12.9 45 354-398 151-195 (415)
479 PF05130 FlgN: FlgN protein; 55.7 1.3E+02 0.0027 25.5 10.8 30 323-352 38-67 (143)
480 cd00213 S-100 S-100: S-100 dom 55.6 75 0.0016 25.1 7.6 18 397-414 65-82 (88)
481 PRK14163 heat shock protein Gr 55.5 2E+02 0.0043 27.6 13.5 6 391-396 103-108 (214)
482 PF08542 Rep_fac_C: Replicatio 55.4 68 0.0015 25.4 7.4 65 343-417 6-70 (89)
483 PF12889 DUF3829: Protein of u 55.4 1.6E+02 0.0035 28.4 11.5 117 287-415 110-237 (276)
484 PF12128 DUF3584: Protein of u 55.4 3.4E+02 0.0074 32.7 16.1 129 282-411 811-946 (1201)
485 PRK14070 exodeoxyribonuclease 55.3 48 0.001 26.0 6.1 62 300-361 1-62 (69)
486 PF02563 Poly_export: Polysacc 55.3 12 0.00025 29.9 2.8 35 44-78 35-69 (82)
487 PF05929 Phage_GPO: Phage caps 55.1 91 0.002 31.1 9.5 72 279-350 173-252 (276)
488 KOG0946|consensus 55.1 2.2E+02 0.0048 32.6 13.1 121 282-411 655-776 (970)
489 PF05565 Sipho_Gp157: Siphovir 55.0 78 0.0017 28.8 8.5 58 297-354 25-82 (162)
490 KOG1666|consensus 54.9 1.3E+02 0.0028 28.8 9.9 70 269-338 133-207 (220)
491 KOG4677|consensus 54.9 2E+02 0.0044 30.6 12.2 131 282-412 325-457 (554)
492 PF03908 Sec20: Sec20; InterP 54.8 1.2E+02 0.0025 24.8 9.5 59 282-340 3-61 (92)
493 PF15294 Leu_zip: Leucine zipp 54.8 1.8E+02 0.0038 29.1 11.3 96 279-388 127-224 (278)
494 PF05974 DUF892: Domain of unk 54.7 95 0.0021 28.0 9.0 96 292-388 2-97 (159)
495 PF14197 Cep57_CLD_2: Centroso 54.6 1E+02 0.0022 24.1 10.2 65 283-347 4-68 (69)
496 COG3937 Uncharacterized conser 54.6 1.4E+02 0.003 25.5 10.2 73 277-349 18-106 (108)
497 PF13815 Dzip-like_N: Iguana/D 54.2 1.4E+02 0.003 25.5 9.8 61 289-349 57-117 (118)
498 PF09304 Cortex-I_coil: Cortex 54.1 1.1E+02 0.0024 26.1 8.4 63 290-352 1-63 (107)
499 PF08702 Fib_alpha: Fibrinogen 53.8 1.7E+02 0.0036 26.3 11.5 73 282-354 38-111 (146)
500 PRK05771 V-type ATP synthase s 53.8 77 0.0017 35.2 9.8 76 286-361 45-135 (646)
No 1
>KOG2391|consensus
Probab=100.00 E-value=4.9e-72 Score=541.62 Aligned_cols=304 Identities=43% Similarity=0.706 Sum_probs=246.8
Q ss_pred eeecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccccccCCCCCHHHHHHHHHHHhCCCCC
Q psy18020 2 TNLSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHTWTPQNSKLEGLIKTLASAFSALPP 81 (434)
Q Consensus 2 v~~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~W~~~~s~L~~li~~l~~~F~~~pP 81 (434)
|.|+|++|||||+|||+++||+.||+|||+||++|.||.+.|||+||+|||||||+|.+++|||++||++|+..|+++||
T Consensus 61 ~~~~G~tYnIPV~iWlldtyP~~pP~c~VnPT~~M~ik~~~hVd~nG~V~LPYLh~W~~pssdLv~Liq~l~a~f~~~pP 140 (365)
T KOG2391|consen 61 VPYQGVTYNIPVIIWLLDTYPYYPPICYVNPTSTMIIKVHEHVDPNGKVYLPYLHNWDPPSSDLVGLIQELIAAFSEDPP 140 (365)
T ss_pred ccccCCcccceEEEEecccCCCCCCeEEecCCchhhhHHhhccCCCCeEechhhccCCCccchHHHHHHHHHHHhcCCCc
Confidence 56999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccCCCCCCCCCCCCCCCCCCCCCccccccCcccCCCccccCCCCccccccchhHHHHHHHhhhhcCCCCCCCCCCCCC
Q psy18020 82 LYSSRTKQEPATNPTYNGNIRPTADMRINVSQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQE 161 (434)
Q Consensus 82 l~~~~~~~~p~~~p~~~~~~~pt~~m~~~~~~~v~~~g~~~~pyl~~w~~~~s~l~~l~~~~~~~f~~~pp~~s~~~~~~ 161 (434)
+|+++..+.|.+.|
T Consensus 141 ~ys~~~~~~p~p~p------------------------------------------------------------------ 154 (365)
T KOG2391|consen 141 VYSRSLPSPPPPYP------------------------------------------------------------------ 154 (365)
T ss_pred cccCCCCCCCCCCC------------------------------------------------------------------
Confidence 99987332211111
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy18020 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYP 241 (434)
Q Consensus 162 ~~~~~~~~~~~~s~~p~~~~~~~~~p~~~~~~p~p~~p~~~~~~~~p~~~~~~~~~~~~pyp~~~~~~~~~~~~p~p~~p 241 (434)
++.++.++.. ..+...++|+|++++++|.+ +.||+-.+
T Consensus 155 ------------------~~~~~~~p~~-----------------------p~~~~~~~p~p~p~~~~gas-~~~~~~~d 192 (365)
T KOG2391|consen 155 ------------------QTEYNTPPLK-----------------------PKGSAYKPPLPPPPPPGGAS-ALPYMTDD 192 (365)
T ss_pred ------------------cccCCCCCCC-----------------------CCCcCcCCCCCCCCCCCccc-cCcccCCC
Confidence 0000000000 00001122233221111110 11111000
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Q psy18020 242 PGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMF 321 (434)
Q Consensus 242 ~~~~~~~~~~p~~~~~~~p~p~~~~~~~~e~~r~sl~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~ 321 (434)
. ..+|| .+..+...+++++.+++.++|+++..|.+.+.+++++.|+++++||+.|+.+|+.++
T Consensus 193 ~-----~~~yp------------~n~~~~~~irasvisa~~eklR~r~eeeme~~~aeq~slkRt~EeL~~G~~kL~~~~ 255 (365)
T KOG2391|consen 193 N-----AEPYP------------PNASGKLVIRASVISAVREKLRRRREEEMERLQAEQESLKRTEEELNIGKQKLVAMK 255 (365)
T ss_pred C-----CCcCC------------CCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhHHHHHHHH
Confidence 0 00111 011122338999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCC
Q psy18020 322 ERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVID 401 (434)
Q Consensus 322 ~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ 401 (434)
++||.+...|+.+++.|+++.+| +++++++.+++|||++|++++++++|+++++|+|.||||+||+|+++|++|+|+
T Consensus 256 etLEqq~~~L~~niDIL~~k~~e---al~~~~n~~~~~~D~~~~~~~~l~kq~l~~~A~d~aieD~i~~L~~~~r~G~i~ 332 (365)
T KOG2391|consen 256 ETLEQQLQSLQKNIDILKSKVRE---ALEKAENLEALDIDEAIECTAPLYKQILECYALDLAIEDAIYSLGKSLRDGVID 332 (365)
T ss_pred HHHHHHHHHHHhhhHHHHHHHHH---HHhhhccCcCCCchhhhhccchHHHHHHHhhhhhhHHHHHHHHHHHHHhcCeee
Confidence 99999999999999999999888 777888889999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC
Q psy18020 402 LEVYLKQVRSLSRKQFMFRALLFKCYKHISYA 433 (434)
Q Consensus 402 ld~flK~vR~LaReQF~~Ra~~~Ki~~~~~l~ 433 (434)
||.|||+||.||||||++||+++|||+.+||+
T Consensus 333 l~~yLr~VR~lsReQF~~rat~qk~r~~~~l~ 364 (365)
T KOG2391|consen 333 LDQYLRHVRLLSREQFILRATMQKCRQTAGLA 364 (365)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC
Confidence 99999999999999999999999999999986
No 2
>KOG2391|consensus
Probab=99.96 E-value=1.9e-28 Score=238.45 Aligned_cols=202 Identities=28% Similarity=0.438 Sum_probs=143.2
Q ss_pred CCCCCCCCCCCCCCCCCCCccccccCcccCCCccccCCCCccccccchhHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC
Q psy18020 88 KQEPATNPTYNGNIRPTADMRINVSQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPT 167 (434)
Q Consensus 88 ~~~p~~~p~~~~~~~pt~~m~~~~~~~v~~~g~~~~pyl~~w~~~~s~l~~l~~~~~~~f~~~pp~~s~~~~~~~~~~~~ 167 (434)
+..|+.+|+ |||+||++|.||+++|||+||+|||||||+|.+++|||+||||+|++.|+++||+||++.++
T Consensus 78 dtyP~~pP~--c~VnPT~~M~ik~~~hVd~nG~V~LPYLh~W~~pssdLv~Liq~l~a~f~~~pP~ys~~~~~------- 148 (365)
T KOG2391|consen 78 DTYPYYPPI--CYVNPTSTMIIKVHEHVDPNGKVYLPYLHNWDPPSSDLVGLIQELIAAFSEDPPVYSRSLPS------- 148 (365)
T ss_pred ccCCCCCCe--EEecCCchhhhHHhhccCCCCeEechhhccCCCccchHHHHHHHHHHHhcCCCccccCCCCC-------
Confidence 456666777 99999999999999999999999999999999999999999999999999999999954310
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy18020 168 YNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP 247 (434)
Q Consensus 168 ~~~~~~s~~p~~~~~~~~~p~~~~~~p~p~~p~~~~~~~~p~~~~~~~~~~~~pyp~~~~~~~~~~~~p~p~~p~~~~~~ 247 (434)
+++++++..+ .+|.+ ...++..++|.|+.+++++.
T Consensus 149 --------------------------~p~p~p~~~~---------------~~~p~---~p~~~~~~~p~p~p~~~~ga- 183 (365)
T KOG2391|consen 149 --------------------------PPPPYPQTEY---------------NTPPL---KPKGSAYKPPLPPPPPPGGA- 183 (365)
T ss_pred --------------------------CCCCCCcccC---------------CCCCC---CCCCcCcCCCCCCCCCCCcc-
Confidence 1112222110 01111 11122223333322121111
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Q psy18020 248 ATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 327 (434)
Q Consensus 248 ~~~~p~~~~~~~p~p~~~~~~~~e~~r~sl~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~ 327 (434)
.+++.+.. +..+.......-+.-+++.|.+.++.++++ +.+++++++.+.|++|+++++.|+...++|+++
T Consensus 184 --s~~~~~~~----d~~~~yp~n~~~~~~irasvisa~~eklR~---r~eeeme~~~aeq~slkRt~EeL~~G~~kL~~~ 254 (365)
T KOG2391|consen 184 --SALPYMTD----DNAEPYPPNASGKLVIRASVISAVREKLRR---RREEEMERLQAEQESLKRTEEELNIGKQKLVAM 254 (365)
T ss_pred --ccCcccCC----CCCCcCCCCcccccchhHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhhHHHHHhhHHHHHHH
Confidence 11111110 111122222333445677888888888754 456899999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 328 KAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 328 k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
++.||+....|++.++-|.+|++++
T Consensus 255 ~etLEqq~~~L~~niDIL~~k~~ea 279 (365)
T KOG2391|consen 255 KETLEQQLQSLQKNIDILKSKVREA 279 (365)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHH
Confidence 9999999999999999999999974
No 3
>PF09454 Vps23_core: Vps23 core domain; InterPro: IPR017916 The Endosomal Sorting Complex Required for Transport (ESCRT) complexes form the machinery driving protein sorting from endosomes to lysosomes. ESCRT complexes are central to receptor down-regulation, lysosome biogenesis, and budding of HIV. Yeast ESCRT-I consists of three protein subunits, VPS23, VPS28, and VPS37. In humans, ESCRT-I comprises TSG101, VPS28, and one of four potential human VPS37 homologues. The main role of ESCRT-I is to recognise ubiquitinated cargo via the UEV domain of the VPS23/TSG101 subunit. The assembly of the ESCRT-I complex is directed by the C-terminal steadiness box (SB) of VPS23, the N-terminal half of VPS28, and the C-terminal half of VPS37. The structure is primarily composed of three long, parallel helical hairpins, each corresponding to a different subunit. The additional domains and motifs extending beyond the core serve as gripping tools for ESCRT-I critical functions [, ]. This entry represents the Steadiness box domain.; PDB: 2CAZ_A 2F66_D 2F6M_A 2P22_A.
Probab=99.96 E-value=7e-29 Score=191.24 Aligned_cols=65 Identities=52% Similarity=0.824 Sum_probs=62.3
Q ss_pred cccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 361 DEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFK 425 (434)
Q Consensus 361 De~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~K 425 (434)
|++|+++++++||||+++|||+|||||||+|++||++|+|++|+|||+||.|||||||+||||+|
T Consensus 1 D~~v~~~~~l~~Ql~el~Aed~AieDtiy~L~~al~~g~I~~d~~lK~vR~LaReQF~~Ral~~K 65 (65)
T PF09454_consen 1 DEIVVAEDPLSNQLYELVAEDHAIEDTIYYLDRALQRGSIDLDTFLKQVRSLAREQFLKRALIQK 65 (65)
T ss_dssp GGTEE-SSHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSS-HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 79999999999999999999999999999999999999999999999999999999999999998
No 4
>PF05743 UEV: UEV domain; InterPro: IPR008883 The N-terminal ubiquitin E2 variant (UEV) domain is ~145 amino acid residues in length and shows significant sequence similarity to E2 ubiquitin ligases but is unable to catalyze ubiquitin transfer as it lacks the active site cysteine that forms the transient thioester bond with the C terminus of ubiquitin (Ub). Nevertheless, at least some UEVs have retained the ability to bind Ub, and appear to act either as cofactors in ubiquitylation reactions, or as ubiquitin sensors. UEV domains also frequently contain other protein recognition motifs, and may generally serve to couple protein and Ub binding functions to facilitate the formation of multiprotein complexes [, , , ]. The UEV domain consists of a twisted four-stranded antiparallel beta-sheet having a meander topology, with four alpha-helices packed against one face of the sheet. The UEV fold is generally similar to canonical E2 ligases in the hydrophobic core and 'active site' regions, but differs significantly at both its N- and C-termini [, ]. The UEV domain is found in the eukaryotic tumour susceptibility gene 101 protein (TSG101). Altered transcripts of this gene have been detected in sporadic breast cancers and many other Homo sapiens malignancies. However, the involvement of this gene in neoplastic transformation and tumourigenesis is still elusive. TSG101 is required for normal cell function of embryonic and adult tissues but this gene is not a tumour suppressor for sporadic forms of breast cancer [].; GO: 0006464 protein modification process, 0015031 protein transport; PDB: 3R3Q_A 3R42_A 1UZX_A 3OBX_A 3OBS_A 3P9H_A 2F0R_A 3P9G_A 3OBQ_A 3OBU_A ....
Probab=99.95 E-value=5.7e-29 Score=215.77 Aligned_cols=81 Identities=46% Similarity=0.940 Sum_probs=71.9
Q ss_pred eeecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccccccCCCCCHHHHHHHHHHHhCCCCC
Q psy18020 2 TNLSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHTWTPQNSKLEGLIKTLASAFSALPP 81 (434)
Q Consensus 2 v~~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~W~~~~s~L~~li~~l~~~F~~~pP 81 (434)
|.|+|++|||||+|||+++||+.||+|||+||.+|.|++|+|||++|+||+|||++|+.++|||++||++|+++|+++||
T Consensus 41 i~y~g~~y~iPi~Iwlp~~yP~~pP~v~v~pt~~m~I~~~~~Vd~~G~v~~pyL~~W~~~~s~L~~lv~~l~~~F~~~pP 120 (121)
T PF05743_consen 41 ITYKGSTYNIPICIWLPENYPYSPPIVYVRPTPSMVIKPSHHVDSNGRVYLPYLQNWNPPSSNLVDLVQELQAVFSEEPP 120 (121)
T ss_dssp ECCTTCCEEEEEEEEE-TTTTTSSSEEEE-GCCTECCGGCCCB-TTSBB-SHHHHT--TTTS-HHHHHHHHHHCCCHS-S
T ss_pred cccCCcccceeEEEEEcccCCCCCCEEEEeCCCCCCcCCCCeECCCCCEeCchhccCCCCCCCHHHHHHHHHHHHhHcCC
Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred c
Q psy18020 82 L 82 (434)
Q Consensus 82 l 82 (434)
|
T Consensus 121 l 121 (121)
T PF05743_consen 121 L 121 (121)
T ss_dssp E
T ss_pred C
Confidence 7
No 5
>PF05743 UEV: UEV domain; InterPro: IPR008883 The N-terminal ubiquitin E2 variant (UEV) domain is ~145 amino acid residues in length and shows significant sequence similarity to E2 ubiquitin ligases but is unable to catalyze ubiquitin transfer as it lacks the active site cysteine that forms the transient thioester bond with the C terminus of ubiquitin (Ub). Nevertheless, at least some UEVs have retained the ability to bind Ub, and appear to act either as cofactors in ubiquitylation reactions, or as ubiquitin sensors. UEV domains also frequently contain other protein recognition motifs, and may generally serve to couple protein and Ub binding functions to facilitate the formation of multiprotein complexes [, , , ]. The UEV domain consists of a twisted four-stranded antiparallel beta-sheet having a meander topology, with four alpha-helices packed against one face of the sheet. The UEV fold is generally similar to canonical E2 ligases in the hydrophobic core and 'active site' regions, but differs significantly at both its N- and C-termini [, ]. The UEV domain is found in the eukaryotic tumour susceptibility gene 101 protein (TSG101). Altered transcripts of this gene have been detected in sporadic breast cancers and many other Homo sapiens malignancies. However, the involvement of this gene in neoplastic transformation and tumourigenesis is still elusive. TSG101 is required for normal cell function of embryonic and adult tissues but this gene is not a tumour suppressor for sporadic forms of breast cancer [].; GO: 0006464 protein modification process, 0015031 protein transport; PDB: 3R3Q_A 3R42_A 1UZX_A 3OBX_A 3OBS_A 3P9H_A 2F0R_A 3P9G_A 3OBQ_A 3OBU_A ....
Probab=99.43 E-value=2.1e-14 Score=124.76 Aligned_cols=55 Identities=42% Similarity=0.797 Sum_probs=47.1
Q ss_pred CCCCCCCCccccccCcccCCCccccCCCCccccccchhHHHHHHHhhhhcCCCCC
Q psy18020 99 GNIRPTADMRINVSQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALPPL 153 (434)
Q Consensus 99 ~~~~pt~~m~~~~~~~v~~~g~~~~pyl~~w~~~~s~l~~l~~~~~~~f~~~pp~ 153 (434)
|||+||++|.|+.+.+||++|+||+|||++|..++|+|.+|+++|+..|+++||+
T Consensus 67 v~v~pt~~m~I~~~~~Vd~~G~v~~pyL~~W~~~~s~L~~lv~~l~~~F~~~pPl 121 (121)
T PF05743_consen 67 VYVRPTPSMVIKPSHHVDSNGRVYLPYLQNWNPPSSNLVDLVQELQAVFSEEPPL 121 (121)
T ss_dssp EEE-GCCTECCGGCCCB-TTSBB-SHHHHT--TTTS-HHHHHHHHHHCCCHS-SE
T ss_pred EEEeCCCCCCcCCCCeECCCCCEeCchhccCCCCCCCHHHHHHHHHHHHhHcCCC
Confidence 8999999999999999999999999999999999999999999999999999996
No 6
>smart00212 UBCc Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
Probab=98.45 E-value=3.7e-07 Score=81.53 Aligned_cols=67 Identities=24% Similarity=0.386 Sum_probs=58.2
Q ss_pred ecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccc--cccCCCCCHHHHHHHHHHHhCC
Q psy18020 4 LSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLH--TWTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 4 ~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~--~W~~~~s~L~~li~~l~~~F~~ 78 (434)
|.|.+| .+.||++++||++||.|++... .-+++||.+|.|++++|. +|++ ..+|.+++..+...|.+
T Consensus 43 y~g~~f--~~~l~~p~~yP~~pP~v~f~~~-----i~Hp~i~~~G~icl~~l~~~~W~p-~~~l~~il~~i~~~l~~ 111 (145)
T smart00212 43 YEGGIF--KLTIEFPPDYPFKPPKVKFITK-----IYHPNVDSSGEICLDILKQEKWSP-ATTLETVLLSIQSLLSE 111 (145)
T ss_pred cCCcEE--EEEEECCcccCCCCCEEEEeCC-----ceEeeECCCCCEehhhcCCCCCCC-CCcHHHHHHHHHHHHhC
Confidence 556554 8999999999999999998653 336889999999999998 8996 69999999999999976
No 7
>cd00195 UBCc Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Probab=98.06 E-value=8e-06 Score=72.60 Aligned_cols=66 Identities=26% Similarity=0.359 Sum_probs=54.7
Q ss_pred ecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceecccccc--ccCCCCCHHHHHHHHHHHhC
Q psy18020 4 LSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHT--WTPQNSKLEGLIKTLASAFS 77 (434)
Q Consensus 4 ~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~--W~~~~s~L~~li~~l~~~F~ 77 (434)
|.|.. +.+.|+++++||++||.|.+.. ..-+++||.+|.|++.+|.. |.+ ..+|.+++..+...|.
T Consensus 43 y~g~~--~~~~~~~p~~yP~~pP~v~f~~-----~i~HpnV~~~G~icl~~l~~~~W~p-~~~l~~il~~i~~~l~ 110 (141)
T cd00195 43 YEGGI--FKLDIEFPEDYPFKPPKVRFVT-----KIYHPNVDENGKICLSILKTHGWSP-AYTLRTVLLSLQSLLN 110 (141)
T ss_pred ccCCE--EEEEEECCCccCCCCCeEEEeC-----CcccCCCCCCCCCchhhcCCCCcCC-cCcHHHHHHHHHHHHh
Confidence 44444 4569999999999999998862 12468999999999999987 766 7789999999999997
No 8
>PTZ00390 ubiquitin-conjugating enzyme; Provisional
Probab=97.50 E-value=0.00026 Score=64.01 Aligned_cols=63 Identities=22% Similarity=0.412 Sum_probs=51.2
Q ss_pred ccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceecccc-ccccCCCCCHHHHHHHHHHHhCC
Q psy18020 10 HIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYL-HTWTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 10 ~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL-~~W~~~~s~L~~li~~l~~~F~~ 78 (434)
...+.|.++++||+.||.|... |. | =+++||.+|.|++..| ++|.+ ..++.+++..+...|..
T Consensus 50 ~f~~~i~~p~~YP~~pP~v~F~-t~---i-~HPNV~~~G~iCl~iL~~~W~p-~~ti~~iL~~i~~ll~~ 113 (152)
T PTZ00390 50 YYKLELFLPEQYPMEPPKVRFL-TK---I-YHPNIDKLGRICLDILKDKWSP-ALQIRTVLLSIQALLSA 113 (152)
T ss_pred EEEEEEECccccCCCCCEEEEe-cC---C-eeceECCCCeEECccCcccCCC-CCcHHHHHHHHHHHHhC
Confidence 4568899999999999998663 32 1 2578999999999999 67987 78999999888887753
No 9
>PF00179 UQ_con: Ubiquitin-conjugating enzyme; InterPro: IPR000608 The post-translational attachment of ubiquitin (IPR000626 from INTERPRO) to proteins (ubiquitinylation) alters the function, location or trafficking of a protein, or targets it to the 26S proteasome for degradation [, , ]. Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade []. The E1 enzyme mediates an ATP-dependent transfer of a thioester-linked ubiquitin molecule to a cysteine residue on the E2 enzyme. The E2 enzyme (6.3.2.19 from EC) then either transfers the ubiquitin moiety directly to a substrate, or to an E3 ligase, which can also ubiquitinylate a substrate. There are several different E2 enzymes (over 30 in humans), which are broadly grouped into four classes, all of which have a core catalytic domain (containing the active site cysteine), and some of which have short N- and C-terminal amino acid extensions: class I enzymes consist of just the catalytic core domain (UBC), class II possess a UBC and a C-terminal extension, class III possess a UBC and an N-terminal extension, and class IV possess a UBC and both N- and C-terminal extensions. These extensions appear to be important for some subfamily function, including E2 localisation and protein-protein interactions []. In addition, there are proteins with an E2-like fold that are devoid of catalytic activity, but which appear to assist in poly-ubiquitin chain formation.; GO: 0016881 acid-amino acid ligase activity; PDB: 2AAK_A 3SY2_C 1FBV_C 3SQV_C 1C4Z_D 1JAT_B 2GMI_B 2H2Y_D 2R0J_A 3E95_B ....
Probab=97.49 E-value=0.00018 Score=63.77 Aligned_cols=61 Identities=20% Similarity=0.335 Sum_probs=48.5
Q ss_pred ccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccc--cccCCCCCHHHHHHHHHHHh
Q psy18020 10 HIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLH--TWTPQNSKLEGLIKTLASAF 76 (434)
Q Consensus 10 ~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~--~W~~~~s~L~~li~~l~~~F 76 (434)
.+-+.|.++++||+.||.|.... ..-+++||.+|+|++..|+ +|.+ ..++..++..+...|
T Consensus 46 ~f~~~i~~p~~YP~~pP~v~f~t-----~i~HPni~~~G~icl~~l~~~~W~p-~~~i~~il~~i~~ll 108 (140)
T PF00179_consen 46 IFKFRISFPPDYPFSPPKVRFLT-----PIFHPNIDENGRICLDILNPESWSP-SYTIESILLSIQSLL 108 (140)
T ss_dssp EEEEEEEETTTTTTS--EEEESS-----S-SBTTB-TTSBBGHGGGTTTTC-T-TSHHHHHHHHHHHHH
T ss_pred ccccccccccccccccccccccc-----ccccccccccccchhhhhhcccCCc-ccccccHHHHHHHHH
Confidence 45789999999999999998752 2346899999999999998 4877 899999999999888
No 10
>PLN00172 ubiquitin conjugating enzyme; Provisional
Probab=97.46 E-value=0.00027 Score=63.49 Aligned_cols=67 Identities=21% Similarity=0.362 Sum_probs=53.0
Q ss_pred ecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceecccc-ccccCCCCCHHHHHHHHHHHhCC
Q psy18020 4 LSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYL-HTWTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 4 ~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL-~~W~~~~s~L~~li~~l~~~F~~ 78 (434)
|.|.. +-+.|.++++||+.||.|... |. | -+++||.+|.|++..| ++|.+ ..+|.+++..+...|..
T Consensus 45 yegg~--f~~~i~fp~~YP~~pP~v~f~-t~---i-~HPNv~~~G~iCl~il~~~W~p-~~ti~~il~~i~~ll~~ 112 (147)
T PLN00172 45 YAGGV--FFLSILFPPDYPFKPPKVQFT-TK---I-YHPNINSNGSICLDILRDQWSP-ALTVSKVLLSISSLLTD 112 (147)
T ss_pred CCCCE--EEEEEECCcccCCCCCEEEEe-cC---c-ccceECCCCEEEcccCcCCCCC-cCcHHHHHHHHHHHHhC
Confidence 44544 468889999999999998663 32 2 2577999999999999 68987 78899999888887754
No 11
>KOG0819|consensus
Probab=97.39 E-value=0.00014 Score=72.30 Aligned_cols=136 Identities=21% Similarity=0.248 Sum_probs=114.4
Q ss_pred chHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHH
Q psy18020 270 DEHIKMSLK------SAVEDKIRRKMNEQKVQAQEEIEI------LKQTENELNQGKFK--LARMFERIDKEKAELERSI 335 (434)
Q Consensus 270 ~e~~r~sl~------sal~dkL~~~L~e~~~q~~~ele~------L~~~q~EL~~~~~~--L~~~~~eLe~~k~~Le~~i 335 (434)
-+.++.+++ .+|++.|..|-+++++.+.+..+. +..++.||...+++ +..+..--+....+|++.+
T Consensus 22 Ae~L~kA~kG~Gtde~aII~iL~~Rsn~QRq~I~~ayk~~ygkDLi~~Lk~ELsG~Fe~~i~al~~~p~~~DA~~l~~am 101 (321)
T KOG0819|consen 22 AEQLRKAMKGFGTDEQAIIDILTHRSNAQRQLIRAAYKTMYGKDLIKDLKSELSGDFERAIVALMKPPAEYDAKELKKAM 101 (321)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHccCHHHHHHHHHHHHHHHhHHHHHHHHHHhCccHHHHHHHHcCCHHHhHHHHHHHHH
Confidence 456677776 588899999999999999888876 77899999998888 5777788899999999999
Q ss_pred HHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHh---------cCCCCHHHHH
Q psy18020 336 SFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR---------RGVIDLEVYL 406 (434)
Q Consensus 336 ~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~---------~g~I~ld~fl 406 (434)
++++++++.|.+++|...+.+...+.. +|...|..-.||.-..||..++.|+|- .+.||-+.--
T Consensus 102 kg~gtde~vlIEIlcTRT~~el~~i~~-------aY~~~y~~sLEeDI~s~TSG~frklLv~L~~~~R~e~~~vd~~la~ 174 (321)
T KOG0819|consen 102 KGLGTDEKVLIEILCTRTNEELRAIRQ-------AYQELYKKSLEEDIASDTSGDFRKLLVSLVQGNRDEGDRVDDALAK 174 (321)
T ss_pred hccCcchhhheeeeccCCHHHHHHHHH-------HHHHHHcccHHHHhhhccCchHHHHHHHHHhcCCccCCCcCHHHHH
Confidence 999999999999999988887777777 999999999999999999999999983 1256655554
Q ss_pred HHHHHH
Q psy18020 407 KQVRSL 412 (434)
Q Consensus 407 K~vR~L 412 (434)
..++.|
T Consensus 175 ~dA~~L 180 (321)
T KOG0819|consen 175 QDAQDL 180 (321)
T ss_pred HHHHHH
Confidence 555544
No 12
>PF07200 Mod_r: Modifier of rudimentary (Mod(r)) protein; InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=96.02 E-value=0.083 Score=47.29 Aligned_cols=53 Identities=21% Similarity=0.303 Sum_probs=34.3
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFK 425 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~K 425 (434)
+.+.|-..+++-..-.|.| .+.|-.|.|++|.||++-+..=+.-.+-|+-..|
T Consensus 97 l~~~L~~~~~e~eeeSe~l---ae~fl~g~~d~~~Fl~~f~~~R~~yH~R~~K~Ek 149 (150)
T PF07200_consen 97 LLARLQAAASEAEEESEEL---AEEFLDGEIDVDDFLKQFKEKRKLYHLRRAKEEK 149 (150)
T ss_dssp HHHHHHHHHHHHHHHHHHH---C-S-SSSHHHHHHHHHHHHHHHHHHHHHH---HH
T ss_pred HHHHHHHHHHHHHHHHHHH---HHHHhCCCCCHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 7777777776665555554 6677789999999999988765555555544444
No 13
>KOG0424|consensus
Probab=95.25 E-value=0.036 Score=49.05 Aligned_cols=59 Identities=19% Similarity=0.453 Sum_probs=45.2
Q ss_pred ceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceecccccc---ccCCCCCHHHHHHHHHHHh
Q psy18020 12 PVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHT---WTPQNSKLEGLIKTLASAF 76 (434)
Q Consensus 12 Pi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~---W~~~~s~L~~li~~l~~~F 76 (434)
+|.+=++++||..||.|-..+.- =.++|+++|.|+|-.|.+ |.+ .-+|-+++--+++..
T Consensus 59 ~l~v~F~~dyP~~PPkckF~~pl-----~HPNVypsgtVcLsiL~e~~~W~p-aitikqiL~gIqdLL 120 (158)
T KOG0424|consen 59 KLTVNFPDDYPSSPPKCKFKPPL-----FHPNVYPSGTVCLSILNEEKDWRP-AITIKQILLGIQDLL 120 (158)
T ss_pred EEEEeCCccCCCCCCccccCCCC-----cCCCcCCCCcEehhhhccccCCCc-hhhHHHHHHHHHHHh
Confidence 56777899999999999554321 147899999999999964 876 667777777766644
No 14
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=95.14 E-value=0.36 Score=40.83 Aligned_cols=66 Identities=18% Similarity=0.211 Sum_probs=56.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 287 ~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
..++..+..++.++|.+..-+.+|...++.|+..+..|+.......+.+..|+.+++++.+.++.-
T Consensus 12 ~el~n~La~Le~slE~~K~S~~eL~kqkd~L~~~l~~L~~q~~s~~qr~~eLqaki~ea~~~le~e 77 (107)
T PF09304_consen 12 NELQNRLASLERSLEDEKTSQGELAKQKDQLRNALQSLQAQNASRNQRIAELQAKIDEARRNLEDE 77 (107)
T ss_dssp --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334555567788899999999999999999999999999999999999999999999998887753
No 15
>COG5078 Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones]
Probab=94.96 E-value=0.044 Score=49.61 Aligned_cols=67 Identities=21% Similarity=0.377 Sum_probs=51.1
Q ss_pred ecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccc-cccCCCCCHHHHHHHHHHHhCC
Q psy18020 4 LSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLH-TWTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 4 ~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~-~W~~~~s~L~~li~~l~~~F~~ 78 (434)
|.|..|. +.|=+|++||+.||.|-.... .=.++||.+|+|++-.|+ +|++ .-+|...+-.|+..|.+
T Consensus 50 Yegg~f~--~~l~fP~~YP~~PPkv~F~t~-----i~HPNV~~~G~vCLdIL~~~WsP-~~~l~sILlsl~slL~~ 117 (153)
T COG5078 50 YEGGIFK--LTLEFPEDYPFKPPKVRFTTK-----IFHPNVDPSGNVCLDILKDRWSP-VYTLETILLSLQSLLLS 117 (153)
T ss_pred cCCCEEE--EEEECCCCCCCCCCeeeeccC-----CcCCCcCCCCCChhHHHhCCCCc-cccHHHHHHHHHHHHcC
Confidence 5566654 577899999999999743211 113789999999987665 6875 88999999999888876
No 16
>KOG0895|consensus
Probab=94.82 E-value=0.044 Score=62.13 Aligned_cols=73 Identities=14% Similarity=0.285 Sum_probs=57.5
Q ss_pred cceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccccccC-------CCCCHHHHHHHHHHHhCCCCCcc
Q psy18020 11 IPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHTWTP-------QNSKLEGLIKTLASAFSALPPLY 83 (434)
Q Consensus 11 iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~W~~-------~~s~L~~li~~l~~~F~~~pPl~ 83 (434)
+=+.|.+|.+||.+||+||..-+. +|-|++...+|||++-.|..|.. +.|++++++-.++...-.+=|-|
T Consensus 900 f~fd~~~~~~yp~~pp~~~~~s~~---~r~npnly~~g~vc~s~l~tw~g~~~e~w~~~s~~lq~l~s~q~l~l~~~py~ 976 (1101)
T KOG0895|consen 900 FFFDFQFPQDYPSSPPLVHYHSGG---VRLNPNLYEDGKVCLSLLNTWHGRGNEVWNPSSSILQVLVSIQGLVLNEEPYF 976 (1101)
T ss_pred EEEEeecCCCCCCCCCceEeecCc---eeeCcccccccceehhhhccccCCCccccCcchhHHHHHHHhhhhhccccccc
Confidence 346789999999999999996554 56667788899999999999963 57888888877776665666666
Q ss_pred ccC
Q psy18020 84 SSR 86 (434)
Q Consensus 84 ~~~ 86 (434)
.-+
T Consensus 977 ne~ 979 (1101)
T KOG0895|consen 977 NEA 979 (1101)
T ss_pred Ccc
Confidence 553
No 17
>KOG1924|consensus
Probab=94.35 E-value=0.2 Score=55.24 Aligned_cols=36 Identities=19% Similarity=0.356 Sum_probs=19.6
Q ss_pred hhhHHHHHHHHHhcCCC---CHHHHHHHHHHH---HHHHHHH
Q psy18020 384 TEDTIYYMGEALRRGVI---DLEVYLKQVRSL---SRKQFMF 419 (434)
Q Consensus 384 ieDtiy~L~~aL~~g~I---~ld~flK~vR~L---aReQF~~ 419 (434)
++-.|-.|.+-|..-.| +=|.|+.+++.. |||||-+
T Consensus 906 m~~~ik~Le~dlk~~~~~~~e~dkF~ekM~~F~e~a~eq~~~ 947 (1102)
T KOG1924|consen 906 MENQIKKLERDLKNFKIAGNEHDKFVEKMTSFHEKAREQYSK 947 (1102)
T ss_pred HHHHHHHHHHHHHhcCCCCcchhhHHHHhhHHHHHHHHHHHH
Confidence 34444455555555444 446666666643 5677755
No 18
>KOG1924|consensus
Probab=94.15 E-value=0.33 Score=53.63 Aligned_cols=16 Identities=6% Similarity=0.212 Sum_probs=11.2
Q ss_pred cCCCCHHHHHHHHHHH
Q psy18020 397 RGVIDLEVYLKQVRSL 412 (434)
Q Consensus 397 ~g~I~ld~flK~vR~L 412 (434)
..++..|.+-|+++.+
T Consensus 891 aSrvnad~ikK~~~~m 906 (1102)
T KOG1924|consen 891 ASRVNADEIKKNLQQM 906 (1102)
T ss_pred hccccHHHHHHHHHHH
Confidence 3577788877777654
No 19
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=94.13 E-value=1.6 Score=38.21 Aligned_cols=68 Identities=6% Similarity=0.171 Sum_probs=28.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 284 kL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
.+...|...+..+...++.+..-.++...-.+.+++++.+++..-+.+...++.++.....|+.-|..
T Consensus 54 ~vs~~l~~tKkhLsqRId~vd~klDe~~ei~~~i~~eV~~v~~dv~~i~~dv~~v~~~V~~Le~ki~~ 121 (126)
T PF07889_consen 54 QVSESLSSTKKHLSQRIDRVDDKLDEQKEISKQIKDEVTEVREDVSQIGDDVDSVQQMVEGLEGKIDE 121 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444433333333333344444444444444444444444444444443333
No 20
>PF14461 Prok-E2_B: Prokaryotic E2 family B
Probab=94.13 E-value=0.084 Score=46.51 Aligned_cols=66 Identities=17% Similarity=0.342 Sum_probs=46.7
Q ss_pred ccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceec---c-ccccccCCCCCHHHHHHHHHHHh
Q psy18020 8 VYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYS---P-YLHTWTPQNSKLEGLIKTLASAF 76 (434)
Q Consensus 8 ~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~---p-yL~~W~~~~s~L~~li~~l~~~F 76 (434)
.-.|+|.|-++..||..||.+|+.-.. .+.--+||+.+|.+++ - .++.|++ ...+.++++....+.
T Consensus 35 ~~~~~l~l~~p~~FP~~pp~v~l~d~~--~~~~~pHv~~~G~LCl~~~~~~~D~~~P-~~~~~~~l~~a~~lL 104 (133)
T PF14461_consen 35 GGPFPLRLVFPDDFPYLPPRVYLEDPK--QFPLLPHVESDGKLCLLDEELVLDPWDP-EGIIADCLERAIRLL 104 (133)
T ss_pred CeEEEEEEEECCcccCcCCEEEecCcc--ccCccCeEcCCCeEEEecCCcccCccCH-HHHHHHHHHHHHHHH
Confidence 457899999999999999999997322 2235599999999664 2 3555655 556666665555443
No 21
>PF09731 Mitofilin: Mitochondrial inner membrane protein; InterPro: IPR019133 Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex []. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence (residues 1-43) rich in positive-charged and hydroxylated residues and a membrane anchor domain (residues 47-66). In addition, it has three centrally located coiled coil domains (residues 200-240,280-310 and 400-420) []. ; GO: 0031305 integral to mitochondrial inner membrane
Probab=93.96 E-value=3.6 Score=44.95 Aligned_cols=36 Identities=17% Similarity=0.136 Sum_probs=25.0
Q ss_pred HHHhhhhHHHHHHHHHhcC----CCCHHHHHHHHHHHHHH
Q psy18020 380 EEAATEDTIYYMGEALRRG----VIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 380 ed~AieDtiy~L~~aL~~g----~I~ld~flK~vR~LaRe 415 (434)
.-+++.=++..|..+|.+| ..+|...|+.+|.++.+
T Consensus 406 ~~~~l~~a~~~l~~~l~~~~~~~~~p~~~el~~l~~~~~~ 445 (582)
T PF09731_consen 406 RAQQLWLAVDALKSALDSGNAGSPRPFEDELRALKELAPD 445 (582)
T ss_pred HHHHHHHHHHHHHHHHHcCCCcCCCCHHHHHHHHHHhCCC
Confidence 3445556667777777777 47777777777777665
No 22
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=93.86 E-value=3.1 Score=40.46 Aligned_cols=113 Identities=17% Similarity=0.255 Sum_probs=56.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-----
Q psy18020 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE----- 357 (434)
Q Consensus 283 dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~----- 357 (434)
|.-..++...+.....+++.++.....++.....++.++..|+.....++..|+.+.++.+.++.-+...++..+
T Consensus 16 D~e~~rl~~~~~~~~~~l~k~~~e~e~~~~~~~~~~~e~e~le~qv~~~e~ei~~~r~r~~~~e~kl~~v~~~~e~~aL~ 95 (239)
T COG1579 16 DLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALN 95 (239)
T ss_pred HHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHH
Confidence 333444444444444455555555555555555566666666666666666666666666666665543322111
Q ss_pred CCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHH
Q psy18020 358 VDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 358 ~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL 395 (434)
.+++.+-.-..-+..+|.++.-+-..++|=|-.|.+.+
T Consensus 96 ~E~~~ak~r~~~le~el~~l~~~~~~l~~~i~~l~~~~ 133 (239)
T COG1579 96 IEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERL 133 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 13333333333344455555555555555555444444
No 23
>KOG0971|consensus
Probab=93.59 E-value=5.3 Score=45.25 Aligned_cols=36 Identities=25% Similarity=0.319 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+.++++.+..+-++|++..++|.+++++++..|.++
T Consensus 401 ~~kelE~k~sE~~eL~r~kE~Lsr~~d~aEs~iadl 436 (1243)
T KOG0971|consen 401 LQKELEKKNSELEELRRQKERLSRELDQAESTIADL 436 (1243)
T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333334444444444444444444433
No 24
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=93.42 E-value=2.2 Score=33.75 Aligned_cols=53 Identities=26% Similarity=0.250 Sum_probs=38.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 292 QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344 (434)
Q Consensus 292 ~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~e 344 (434)
.+..++.+++.|+.-..+|......|+.+...|+.++...+..|..|-.+.++
T Consensus 19 ti~~Lq~e~eeLke~n~~L~~e~~~L~~en~~L~~e~~~~~~rl~~LL~kl~~ 71 (72)
T PF06005_consen 19 TIALLQMENEELKEKNNELKEENEELKEENEQLKQERNAWQERLRSLLGKLEE 71 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence 33455777777777777888777888888888888888888877777665543
No 25
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=93.30 E-value=3.4 Score=40.18 Aligned_cols=62 Identities=19% Similarity=0.288 Sum_probs=44.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 292 QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 292 ~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
...+++..++.+.+.+.+|......++++++.|+...+.+++.+..++++.++|++-++.++
T Consensus 36 ~~~~sQ~~id~~~~e~~~L~~e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~ 97 (251)
T PF11932_consen 36 AAQQSQKRIDQWDDEKQELLAEYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIE 97 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33445666777777777777777777777777777777777777777777777777766553
No 26
>PRK10884 SH3 domain-containing protein; Provisional
Probab=92.92 E-value=1.6 Score=41.59 Aligned_cols=50 Identities=14% Similarity=0.154 Sum_probs=30.9
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
..+..++|...-...++.+.+|+++.++|++.+..++.+.+.++.-++.+
T Consensus 116 ~~~~~~~l~~~~~~~~~~~~~L~~~n~~L~~~l~~~~~~~~~l~~~~~~~ 165 (206)
T PRK10884 116 WNQRTAEMQQKVAQSDSVINGLKEENQKLKNQLIVAQKKVDAANLQLDDK 165 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444455444445666666677777777777777777776666655544
No 27
>PRK11637 AmiB activator; Provisional
Probab=92.55 E-value=4.4 Score=42.60 Aligned_cols=24 Identities=25% Similarity=0.430 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 322 ERIDKEKAELERSISFLKEKETEL 345 (434)
Q Consensus 322 ~eLe~~k~~Le~~i~~L~~~~~eL 345 (434)
..++.+-+.++..|..+++++++.
T Consensus 99 ~~~~~ei~~l~~eI~~~q~~l~~~ 122 (428)
T PRK11637 99 NQLNKQIDELNASIAKLEQQQAAQ 122 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333333333333
No 28
>KOG0427|consensus
Probab=92.52 E-value=0.27 Score=42.90 Aligned_cols=58 Identities=17% Similarity=0.431 Sum_probs=41.0
Q ss_pred eecCcccccceEEEecCCCCCCCCEe-EEecCCCceeccCCCcCCCCceecccc-ccccCCCCCHHHH
Q psy18020 3 NLSGLVYHIPVCIWLMDTHPDHAPIC-YVRPTADMRINVSQNVDHTGKIYSPYL-HTWTPQNSKLEGL 68 (434)
Q Consensus 3 ~~~g~~y~iPi~iwl~~~yP~~~P~~-~v~pt~~m~i~~~~~vd~~G~~~~pyL-~~W~~~~s~L~~l 68 (434)
.|.|.+|.+- +=+++.||+..|.| ||-| |-..+|+.+||.|+|-.| ++|.+ .-++..+
T Consensus 57 LYa~e~~qLq--~~F~~~YP~esPqVmF~~~-----~P~HPHiYSNGHICL~iL~d~WsP-Amsv~Sv 116 (161)
T KOG0427|consen 57 LYANETYQLQ--VEFPEHYPMESPQVMFVGP-----APLHPHIYSNGHICLDILYDSWSP-AMSVQSV 116 (161)
T ss_pred eecCcEEEEE--EecCCCCCCCCCeEEEecC-----CCCCCceecCCeEEEEeecccCCc-chhhHHH
Confidence 3667777654 45799999998874 5544 444699999999998776 67987 3334333
No 29
>PRK11637 AmiB activator; Provisional
Probab=92.47 E-value=11 Score=39.61 Aligned_cols=45 Identities=18% Similarity=0.207 Sum_probs=16.6
Q ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 306 TENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 306 ~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
++.+|...+..|+....+|+..+.+|+..-..+++..+.+.+.+.
T Consensus 87 ~~~~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~~rlr 131 (428)
T PRK11637 87 ASRKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQERLLAAQLD 131 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333333333333333333333333333333333
No 30
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=92.40 E-value=3 Score=39.87 Aligned_cols=88 Identities=22% Similarity=0.262 Sum_probs=44.9
Q ss_pred CCCchHHHHHHHHHHHH----HHH---------HH---HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Q psy18020 267 TITDEHIKMSLKSAVED----KIR---------RK---MNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330 (434)
Q Consensus 267 ~~~~e~~r~sl~sal~d----kL~---------~~---L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~ 330 (434)
.++.+..+..++.|+.- .|+ .+ ++++++..+++.+.|..--.+|...-+.++..+.+|+.+...
T Consensus 102 hiD~elvrkEl~nAlvRAGLktL~~v~~~~d~ke~~ee~kekl~E~~~EkeeL~~eleele~e~ee~~erlk~le~E~s~ 181 (290)
T COG4026 102 HIDVELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSR 181 (290)
T ss_pred ccCHHHHHHHHHHHHHHHHHHHHhccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46667777777766542 111 11 123333333344444443344444444455555566666666
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 331 LERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 331 Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
|+.....|..+...|++..+.+..
T Consensus 182 LeE~~~~l~~ev~~L~~r~~ELe~ 205 (290)
T COG4026 182 LEEMLKKLPGEVYDLKKRWDELEP 205 (290)
T ss_pred HHHHHHhchhHHHHHHHHHHHhcc
Confidence 666666666666666555555543
No 31
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=92.10 E-value=3.1 Score=38.53 Aligned_cols=58 Identities=26% Similarity=0.310 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+...+++++.....++...-..+++....++......+..+..+++..+++.+-+.++
T Consensus 92 ~l~~el~~l~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~l~~l~~~~~~~~~e~~~l 149 (191)
T PF04156_consen 92 QLQEELDQLQERIQELESELEKLKEDLQELRELLKSVEERLDSLDESIKELEKEIREL 149 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444555555555555555555555555555555554444
No 32
>KOG0417|consensus
Probab=92.04 E-value=0.26 Score=44.11 Aligned_cols=67 Identities=22% Similarity=0.385 Sum_probs=48.9
Q ss_pred ecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccc-cccCCCCCHHHHHHHHHHHhCC
Q psy18020 4 LSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLH-TWTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 4 ~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~-~W~~~~s~L~~li~~l~~~F~~ 78 (434)
|.|-++. +.|.+|++||+.||-|.. -|. .=+++||.+|+|++-.|. .|.+ ..+|..++..++..+.+
T Consensus 45 YEgG~F~--l~I~~p~~YP~~PPkV~F-~Tk----IyHPNI~~~G~IclDILk~~WsP-Al~i~~VllsI~sLL~~ 112 (148)
T KOG0417|consen 45 YEGGVFF--LEIHFPEDYPFKPPKVRF-LTK----IYHPNIDSNGRICLDILKDQWSP-ALTISKVLLSICSLLSD 112 (148)
T ss_pred cCCCEEE--EEEECCCCCCCCCCceEe-ecc----cccCCcCccccchHHhhhccCCh-hhHHHHHHHHHHHHhcC
Confidence 4444443 688999999999997533 121 235789999999988874 5766 88888888888876644
No 33
>KOG0422|consensus
Probab=91.88 E-value=0.48 Score=41.90 Aligned_cols=58 Identities=21% Similarity=0.376 Sum_probs=44.7
Q ss_pred eEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccc--cccCCCCCHHHHHHHHHHHh
Q psy18020 13 VCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLH--TWTPQNSKLEGLIKTLASAF 76 (434)
Q Consensus 13 i~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~--~W~~~~s~L~~li~~l~~~F 76 (434)
|.|=+|-+|||.||.+-.+- |+- .++||..|.|++|.|. +|.+ .-..-++++.|....
T Consensus 53 l~I~fp~eYPFKPP~i~f~t---kiY--HpNVDe~gqvClPiis~EnWkP-~T~teqVlqaLi~li 112 (153)
T KOG0422|consen 53 LEIDFPVEYPFKPPKIKFKT---KIY--HPNVDEKGQVCLPIISAENWKP-ATRTEQVLQALIALI 112 (153)
T ss_pred EEeeCCCCCCCCCCeeeeee---eec--cCCCCCCCceeeeeeecccccC-cccHHHHHHHHHHHh
Confidence 56778999999999875532 221 4789999999999986 6876 667788888888754
No 34
>KOG4657|consensus
Probab=91.71 E-value=4.7 Score=38.61 Aligned_cols=31 Identities=16% Similarity=0.203 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 323 RIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 323 eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
+=+..+..+++.|..++++.+.+...+..++
T Consensus 83 ek~~~q~~ieqeik~~q~elEvl~~n~Q~lk 113 (246)
T KOG4657|consen 83 EKEARQMGIEQEIKATQSELEVLRRNLQLLK 113 (246)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334455566666666666666666666554
No 35
>KOG0976|consensus
Probab=91.36 E-value=10 Score=42.60 Aligned_cols=95 Identities=20% Similarity=0.185 Sum_probs=43.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
++....+|+....+++..++.|..+.+++...-+.|+++.. .+.+ +--|+-.+++|..-..-++.++.+-|+
T Consensus 136 ~ke~etelE~~~srlh~le~eLsAk~~eIf~~~~~L~nk~~-~lt~-------~~~q~~tkl~e~~~en~~le~k~~k~~ 207 (1265)
T KOG0976|consen 136 KKENEIEIENLNSRLHKLEDELSAKAHDIFMIGEDLHDKNE-ELNE-------FNMEFQTKLAEANREKKALEEKLEKFK 207 (1265)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHhhhhHHHHHHHHHHhhhhh-HHhH-------HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333344444444444444444444444444444433321 1111 334444555554444444444444443
Q ss_pred c-----CCCCH---------HHHHHHHHHHH-HHHHHH
Q psy18020 397 R-----GVIDL---------EVYLKQVRSLS-RKQFMF 419 (434)
Q Consensus 397 ~-----g~I~l---------d~flK~vR~La-ReQF~~ 419 (434)
. ..+.+ -.-+|.|++|+ +.|||.
T Consensus 208 e~~~~nD~~sle~~~~q~~tq~vl~ev~QLss~~q~lt 245 (1265)
T KOG0976|consen 208 EDLIEKDQKSLELHKDQENTQKVLKEVMQLSSQKQTLT 245 (1265)
T ss_pred HHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhh
Confidence 2 23333 34678888884 677775
No 36
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=91.26 E-value=2.9 Score=36.69 Aligned_cols=18 Identities=17% Similarity=0.181 Sum_probs=6.8
Q ss_pred HHHHHHHHhHHHHHHHHH
Q psy18020 304 KQTENELNQGKFKLARMF 321 (434)
Q Consensus 304 ~~~q~EL~~~~~~L~~~~ 321 (434)
..++.+|..+-+.|..-+
T Consensus 60 ~~tKkhLsqRId~vd~kl 77 (126)
T PF07889_consen 60 SSTKKHLSQRIDRVDDKL 77 (126)
T ss_pred HHHHHHHHHHHHHHHhhH
Confidence 333434443333333333
No 37
>KOG0419|consensus
Probab=91.13 E-value=0.47 Score=41.53 Aligned_cols=67 Identities=16% Similarity=0.392 Sum_probs=48.9
Q ss_pred ecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceecccccc-ccCCCCCHHHHHHHHHHHhCC
Q psy18020 4 LSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHT-WTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 4 ~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~-W~~~~s~L~~li~~l~~~F~~ 78 (434)
|.|-++. +.|=+.++||..||.|-. ..+.=.++|+++|.+++-.|++ |++ ..++...+-.++..+.+
T Consensus 48 ~e~gtFk--Ltl~FteeYpnkPP~VrF-----vs~mFHPNvya~G~iClDiLqNrWsp-~Ydva~ILtsiQslL~d 115 (152)
T KOG0419|consen 48 FEGGTFK--LTLEFTEEYPNKPPTVRF-----VSKMFHPNVYADGSICLDILQNRWSP-TYDVASILTSIQSLLND 115 (152)
T ss_pred cCCceEE--EEEEcccccCCCCCeeEe-----eeeccCCCcCCCCcchHHHHhcCCCC-chhHHHHHHHHHHHhcC
Confidence 4455544 466789999999999733 1223358899999999999986 655 88888887777776643
No 38
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=91.09 E-value=10 Score=41.56 Aligned_cols=107 Identities=20% Similarity=0.263 Sum_probs=59.3
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHH
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILN 376 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~ 376 (434)
+.+++.+.....++...+.++...+..|+.....++..+..++.+..++.+.+... .-+.+.+ -|..++.
T Consensus 382 ~e~leel~e~leeie~eq~ei~e~l~~Lrk~E~eAr~kL~~~~~~L~~ikr~l~k~---~lpgip~-------~y~~~~~ 451 (569)
T PRK04778 382 QEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRYLEKS---NLPGLPE-------DYLEMFF 451 (569)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc---CCCCCcH-------HHHHHHH
Confidence 34444444444455555555555555555555555555555555555555554432 2223333 3333333
Q ss_pred hhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHH-HHHHHHHHH
Q psy18020 377 TFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVR-SLSRKQFMF 419 (434)
Q Consensus 377 l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR-~LaReQF~~ 419 (434)
.+++-|..|.+.|++|.|+++.=-+... .-.|=.|+.
T Consensus 452 ------~~~~~i~~l~~~L~~g~VNm~ai~~e~~e~~~~~~~L~ 489 (569)
T PRK04778 452 ------EVSDEIEALAEELEEKPINMEAVNRLLEEATEDVETLE 489 (569)
T ss_pred ------HHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHH
Confidence 4577888899999999999887763333 233444444
No 39
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=90.71 E-value=15 Score=38.55 Aligned_cols=35 Identities=14% Similarity=0.122 Sum_probs=17.5
Q ss_pred hHHHHhhhhHHHHHHHHHhcCCC-CHHHHHHHHHHH
Q psy18020 378 FTEEAATEDTIYYMGEALRRGVI-DLEVYLKQVRSL 412 (434)
Q Consensus 378 ~Aed~AieDtiy~L~~aL~~g~I-~ld~flK~vR~L 412 (434)
.-||..=.+-+-.|-++++.+.. -.|.+.+....|
T Consensus 138 ~~eda~~~~R~ai~~~~l~~~~~~~i~~l~~~~~~l 173 (420)
T COG4942 138 SPEDAQRSVRLAIYYGALNPARAERIDALKATLKQL 173 (420)
T ss_pred ChhhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Confidence 34444445666666666666654 233343333333
No 40
>PF05816 TelA: Toxic anion resistance protein (TelA); InterPro: IPR008863 This family consists of several prokaryotic TelA like proteins. TelA and KlA are associated with tellurite resistance [] and plasmid fertility inhibition [].
Probab=90.63 E-value=15 Score=37.47 Aligned_cols=146 Identities=13% Similarity=0.057 Sum_probs=71.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy18020 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355 (434)
Q Consensus 276 sl~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k 355 (434)
.+...+-.+++..+.+.+.+.+..-+++...-.+|+.++..|.+-...|+.+.+.....+..|+..+...+..++.++..
T Consensus 73 ~~l~klf~k~~~~~~~~~~ky~sv~~qId~I~~~L~~~~~~L~~d~~~L~~l~~~n~~~~~~L~~~I~ag~~~~~~l~~~ 152 (333)
T PF05816_consen 73 GFLGKLFGKAKNSLERYFAKYQSVQSQIDKIIAELESGQDELLRDNAMLDQLYEKNWEYYQELEKYIAAGELKLEELEAE 152 (333)
T ss_pred hHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33444444555555555554444444555566666666666666666666666666666666666666555555555322
Q ss_pred CCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 356 TEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYK 428 (434)
Q Consensus 356 ~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~ 428 (434)
......+-.. .+.+- .=+...+++.++.|.+-++.=.+..-+.+-..=.+-==|...+.|++||..
T Consensus 153 ~~~~~~~~~~-~d~~~------~q~~~~~~~~l~~leqRi~DL~~~~~va~Q~~pqir~iq~nN~~Li~ki~~ 218 (333)
T PF05816_consen 153 LLPALQADAE-GDQMD------AQELADLEQALFRLEQRIQDLQLSRQVAIQTAPQIRMIQNNNRELIEKIQS 218 (333)
T ss_pred hhHHHhhccc-cCHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhHHHHHHHHH
Confidence 1100000000 11111 112344566777776666553333333333222222235555666666653
No 41
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=90.48 E-value=6 Score=34.42 Aligned_cols=53 Identities=25% Similarity=0.349 Sum_probs=26.0
Q ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 298 ~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
+++=.|....+++......+.....+++....+.+..++.|+++.+++++.-.
T Consensus 51 ~Eiv~l~~~~e~~~~~~~~~~~L~~el~~l~~ry~t~LellGEK~E~veEL~~ 103 (120)
T PF12325_consen 51 EEIVKLMEENEELRALKKEVEELEQELEELQQRYQTLLELLGEKSEEVEELRA 103 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHH
Confidence 33333344444444444444444455555555555555555555555554433
No 42
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=90.45 E-value=4.2 Score=42.58 Aligned_cols=36 Identities=11% Similarity=0.039 Sum_probs=16.9
Q ss_pred CCccccccccch----HHHHHHHhhHHHHhhhhHHHHHHHHH
Q psy18020 358 VDVDEAVTTTAP----IYKQILNTFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 358 ~diDe~i~~~~~----l~~Ql~~l~Aed~AieDtiy~L~~aL 395 (434)
..+-.++.++++ ...++|..+.. ++-+.|..|.+-+
T Consensus 131 p~~~ll~~~eda~~~~R~ai~~~~l~~--~~~~~i~~l~~~~ 170 (420)
T COG4942 131 PPPALLVSPEDAQRSVRLAIYYGALNP--ARAERIDALKATL 170 (420)
T ss_pred CCchhhcChhhhhHHHHHHHHHHHhhH--HHHHHHHHHHHHH
Confidence 344445555553 34555554443 3345555555443
No 43
>KOG0933|consensus
Probab=90.17 E-value=13 Score=42.66 Aligned_cols=60 Identities=12% Similarity=0.060 Sum_probs=42.5
Q ss_pred HHHHHHHhhHHHHhhhhHHH-------HHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 370 IYKQILNTFTEEAATEDTIY-------YMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKH 429 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy-------~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~ 429 (434)
+..+|=++-++-..|++-|. .+.+....|.++...-...+-.+.+|-=-.|.-+.++.+.
T Consensus 869 ~~~el~~~k~k~~~~dt~i~~~~~~~e~~~~e~~~~~l~~kkle~e~~~~~~e~~~~~k~v~~l~~k 935 (1174)
T KOG0933|consen 869 AQAELKDQKAKQRDIDTEISGLLTSQEKCLSEKSDGELERKKLEHEVTKLESEKANARKEVEKLLKK 935 (1174)
T ss_pred HHHHHHHHHHHHHhhhHHHhhhhhHHHHHHHHhhcccchHHHHHhHHHHhhhhHHHHHHHHHHHHHh
Confidence 66666677777777776663 2334445677788888888888888888888877777654
No 44
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=90.13 E-value=5.1 Score=40.58 Aligned_cols=95 Identities=21% Similarity=0.232 Sum_probs=66.6
Q ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHh
Q psy18020 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNT 377 (434)
Q Consensus 298 ~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l 377 (434)
.++|.+......|......+-++++++..+++....-+.+|..+ |.-.|.. +....+|||.++...-=+..+|=.+
T Consensus 133 ~qLEk~~~q~~qLe~d~qs~lDEkeEl~~ERD~yk~K~~RLN~E---Ln~~L~g-~~~rivDIDaLi~ENRyL~erl~q~ 208 (319)
T PF09789_consen 133 EQLEKLREQIEQLERDLQSLLDEKEELVTERDAYKCKAHRLNHE---LNYILNG-DENRIVDIDALIMENRYLKERLKQL 208 (319)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHhCC-CCCCcccHHHHHHHHHHHHHHHHHH
Confidence 34444555555555555556677777777777777666666433 3333321 2346689999998888889999999
Q ss_pred hHHHHhhhhHHHHHHHHHh
Q psy18020 378 FTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 378 ~Aed~AieDtiy~L~~aL~ 396 (434)
..|-.-+.-+|--+..+|.
T Consensus 209 qeE~~l~k~~i~KYK~~le 227 (319)
T PF09789_consen 209 QEEKELLKQTINKYKSALE 227 (319)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 9999989999998888887
No 45
>KOG1853|consensus
Probab=90.07 E-value=20 Score=35.06 Aligned_cols=24 Identities=21% Similarity=0.284 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 330 ELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 330 ~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.|+..+..++...+.+.+.|.+|+
T Consensus 95 ~Leddlsqt~aikeql~kyiReLE 118 (333)
T KOG1853|consen 95 QLEDDLSQTHAIKEQLRKYIRELE 118 (333)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444555555555554
No 46
>KOG0994|consensus
Probab=89.37 E-value=32 Score=40.41 Aligned_cols=33 Identities=18% Similarity=0.267 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 322 ERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 322 ~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
.+++++....|+.+....+.+.+|+..+++|+.
T Consensus 1601 ~kv~~~t~~aE~~~~~a~q~~~eL~~~~e~lk~ 1633 (1758)
T KOG0994|consen 1601 AKVQEETAAAEKLATSATQQLGELETRMEELKH 1633 (1758)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444445555555666677777777653
No 47
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=89.33 E-value=0.9 Score=45.95 Aligned_cols=76 Identities=22% Similarity=0.288 Sum_probs=42.7
Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHH
Q psy18020 310 LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIY 389 (434)
Q Consensus 310 L~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy 389 (434)
|+..+..+..++.+|+.++.++++++..|+.+.+++++-... ....-+.+..++++..-+..+++.-+.
T Consensus 55 le~Ee~~l~~eL~~LE~e~~~l~~el~~le~e~~~l~~eE~~-----------~~~~~n~~~~~l~~~~~e~~sl~~q~~ 123 (314)
T PF04111_consen 55 LEQEEEELLQELEELEKEREELDQELEELEEELEELDEEEEE-----------YWREYNELQLELIEFQEERDSLKNQYE 123 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444455555555555555555555555555555433322 122223367777777777788877777
Q ss_pred HHHHHHh
Q psy18020 390 YMGEALR 396 (434)
Q Consensus 390 ~L~~aL~ 396 (434)
+..+-|.
T Consensus 124 ~~~~~L~ 130 (314)
T PF04111_consen 124 YASNQLD 130 (314)
T ss_dssp HHHHHHH
T ss_pred HHHHHHH
Confidence 7666654
No 48
>PRK15422 septal ring assembly protein ZapB; Provisional
Probab=89.19 E-value=5.8 Score=31.80 Aligned_cols=38 Identities=29% Similarity=0.404 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
..|.++...+...+..|++..+.|+.+....++.|..+
T Consensus 35 ~~L~~e~~~~~~~r~~L~~en~qLk~E~~~WqerLr~L 72 (79)
T PRK15422 35 NSLSQEVQNAQHQREELERENNHLKEQQNGWQERLQAL 72 (79)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33455555555555555555555555555544444433
No 49
>PHA02562 46 endonuclease subunit; Provisional
Probab=88.63 E-value=26 Score=37.83 Aligned_cols=9 Identities=11% Similarity=0.150 Sum_probs=4.5
Q ss_pred CCCHHHHHH
Q psy18020 62 NSKLEGLIK 70 (434)
Q Consensus 62 ~s~L~~li~ 70 (434)
+|+|++.|.
T Consensus 40 KStll~aI~ 48 (562)
T PHA02562 40 KSTMLEALT 48 (562)
T ss_pred HHHHHHHHH
Confidence 555554443
No 50
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=88.62 E-value=5.8 Score=42.07 Aligned_cols=56 Identities=21% Similarity=0.208 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 297 QEEIEILKQTENELNQGK-FKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~-~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
++|.++|++.+..+..+- +.|+.+..+++.+++.|+..++.++..+++|.+-++.+
T Consensus 86 ~~eN~~L~~r~~~id~~i~~av~~~~~~~~~~~~ql~~~~~~~~~~l~~l~~~l~~~ 142 (472)
T TIGR03752 86 KAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQLQGLIDQLQRRLAGV 142 (472)
T ss_pred HHHHHHHHHhhhhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 344444444333333332 22555556666677777777777777777776665444
No 51
>KOG3091|consensus
Probab=88.62 E-value=6.2 Score=41.91 Aligned_cols=31 Identities=19% Similarity=0.220 Sum_probs=19.4
Q ss_pred hcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 396 RRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKHI 430 (434)
Q Consensus 396 ~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~~ 430 (434)
.+..||.|.-...--.|.+|| -.|.|+.+.+
T Consensus 455 ~~~~iD~~~~~e~~e~lt~~~----e~l~~Lv~Il 485 (508)
T KOG3091|consen 455 ESYWIDFDKLIEMKEHLTQEQ----EALTKLVNIL 485 (508)
T ss_pred cceeechhhhHHHHHHHHHHH----HHHHHHHHHH
Confidence 345777777777767777777 3455555443
No 52
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=88.59 E-value=13 Score=41.82 Aligned_cols=108 Identities=22% Similarity=0.245 Sum_probs=54.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CC-CCCccccccccchHH--HHH--HHhhHHHHhhhhHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTE--KT-EVDVDEAVTTTAPIY--KQI--LNTFTEEAATEDTIY 389 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~--k~-~~diDe~i~~~~~l~--~Ql--~~l~Aed~AieDtiy 389 (434)
++.-+++||.+...|+.++...++...+|++-+.+++. ++ ..|++.++.+-..+- ++- -.|-||..--=|...
T Consensus 543 ~r~r~~~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~~~~e~~~~~e~L~~aL~amqdk~~~LE~sLsaEtriKldLfs 622 (697)
T PF09726_consen 543 CRQRRRQLESELKKLRRELKQKEEQIRELESELQELRKYEKESEKDTEVLMSALSAMQDKNQHLENSLSAETRIKLDLFS 622 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH
Confidence 55555666666666666666666666666665554443 22 346666443333331 111 124455555555555
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 390 YMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKHIS 431 (434)
Q Consensus 390 ~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~~~ 431 (434)
+||+|-+. ++..-.++| ++-..++.|-.||.+.|.
T Consensus 623 aLg~akrq----~ei~~~~~~---~~d~ei~~lk~ki~~~~a 657 (697)
T PF09726_consen 623 ALGDAKRQ----LEIAQGQLR---KKDKEIEELKAKIAQLLA 657 (697)
T ss_pred HHHHHHHH----HHHHHHHHH---HHHHHHHHHHHHHHHHHh
Confidence 55555443 222222222 344556667777776654
No 53
>KOG0420|consensus
Probab=88.57 E-value=0.6 Score=42.69 Aligned_cols=68 Identities=21% Similarity=0.292 Sum_probs=52.9
Q ss_pred eecCcccccceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccc-cccCCCCCHHHHHHHHHHHhCC
Q psy18020 3 NLSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLH-TWTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 3 ~~~g~~y~iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~-~W~~~~s~L~~li~~l~~~F~~ 78 (434)
.|+|.+++ +.|=++..||+.||-|-+ -| + +=++++|.+|.|+|-.|+ +|.+ .-||.+.|--|...|-+
T Consensus 72 yY~gGkf~--F~~~v~~~Yp~~PPKVkC-lt--k--V~HPNId~~GnVCLnILRedW~P-~lnL~sIi~GL~~LF~e 140 (184)
T KOG0420|consen 72 YYQGGKFR--FKFKVPNAYPHEPPKVKC-LT--K--VYHPNIDLDGNVCLNILREDWRP-VLNLNSIIYGLQFLFLE 140 (184)
T ss_pred eecCceEE--EEEECCCCCCCCCCeeee-ee--c--cccCCcCCcchHHHHHHHhcCcc-ccchHHHHHHHHHHhcc
Confidence 36666555 468899999999999843 22 1 124789999999988886 5876 88999999999998855
No 54
>TIGR01541 tape_meas_lam_C phage tail tape measure protein, lambda family. This model represents a relatively well-conserved region near the C-terminus of the tape measure protein of a lambda and related phage. This protein, which controls phage tail length, is typically about 1000 residues in length. Both low-complexity sequence and insertion/deletion events appear common in this family. Mutational studies suggest a ruler or template role in the determination of phage tail length. Similar behavior is attributed to proteins from distantly related or unrelated families in other phage.
Probab=88.38 E-value=9.3 Score=39.03 Aligned_cols=40 Identities=10% Similarity=0.079 Sum_probs=26.8
Q ss_pred chHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHH
Q psy18020 368 APIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQV 409 (434)
Q Consensus 368 ~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~v 409 (434)
+-++.|+-+++ -.|++..-..|.+.+..|+.+|..|.+.|
T Consensus 165 ~n~a~~~~~~~--~~af~gm~dal~~fvttGk~~f~d~~~si 204 (332)
T TIGR01541 165 TNVASAAAQLA--TNAFGGMASNIAQMLTTGKANWKSFAVSV 204 (332)
T ss_pred HHHHHHHHHHH--HHHHHHHHHHHHHHHHHhHhhHHHHHHHH
Confidence 33444444443 34556666667777788999999999876
No 55
>PF14915 CCDC144C: CCDC144C protein coiled-coil region
Probab=88.37 E-value=19 Score=35.99 Aligned_cols=110 Identities=21% Similarity=0.247 Sum_probs=73.6
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---H--------HHHHHHHHhcCCCCCCccccccccchHHHHHH
Q psy18020 307 ENELNQGKFKLARMFERIDKEKAELERSISFLKEKE---T--------ELDEILAVLTEKTEVDVDEAVTTTAPIYKQIL 375 (434)
Q Consensus 307 q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~---~--------eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~ 375 (434)
.++|+..++.-++...+++..+..|...|...+... . +-++|+. +.++-..||..+......|+.||-
T Consensus 76 ~SkLe~EKq~kerLEtEiES~rsRLaaAi~d~dqsq~skrdlelafqr~rdEw~~-lqdkmn~d~S~lkd~ne~LsQqLs 154 (305)
T PF14915_consen 76 NSKLEKEKQNKERLETEIESYRSRLAAAIQDHDQSQTSKRDLELAFQRARDEWVR-LQDKMNSDVSNLKDNNEILSQQLS 154 (305)
T ss_pred hHHHHHhHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhHHHHHHHHHHHhhHHHH-HHHHhcchHHhHHHHhHHHHHHHH
Confidence 334444444444445555556666666665554321 1 1134443 234555688888888899999999
Q ss_pred HhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHH
Q psy18020 376 NTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFR 420 (434)
Q Consensus 376 ~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~R 420 (434)
..-++-..+++=+++...+|+...+-|+. -=|.|+.-|+-..
T Consensus 155 kaesK~nsLe~elh~trdaLrEKtL~lE~---~QrdL~Qtq~q~K 196 (305)
T PF14915_consen 155 KAESKFNSLEIELHHTRDALREKTLALES---VQRDLSQTQCQIK 196 (305)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHH
Confidence 99999999999999999999988877763 3466766666543
No 56
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=88.23 E-value=7.9 Score=33.96 Aligned_cols=75 Identities=23% Similarity=0.326 Sum_probs=32.0
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHH
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQI 374 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql 374 (434)
....++.|..++.++...+..+.....+++.-+..|+..-.....+...|++-|+.++. -+|++-.-.+.|++||
T Consensus 54 Ha~~~~~L~~lr~e~~~~~~~~~~l~~~~~~a~~~l~~~e~sw~~qk~~le~e~~~~~~----r~~dL~~QN~lLh~Ql 128 (132)
T PF07926_consen 54 HAEDIKELQQLREELQELQQEINELKAEAESAKAELEESEASWEEQKEQLEKELSELEQ----RIEDLNEQNKLLHDQL 128 (132)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHH
Confidence 34445555555555555444444444333333333333333333333333333333311 2233333444566665
No 57
>TIGR00996 Mtu_fam_mce virulence factor Mce family protein. Members of this paralogous family are found as six tandem homologous proteins in the same orientation per cassette, in four separate cassettes in Mycobacterium tuberculosis. The six members of each cassette represent six subfamilies. One subfamily includes the protein mce (mycobacterial cell entry), a virulence protein required for invasion of non-phagocytic cells.
Probab=88.19 E-value=5.7 Score=39.24 Aligned_cols=24 Identities=13% Similarity=0.077 Sum_probs=12.7
Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 392 GEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 392 ~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
...|...+=+++..+.....+++.
T Consensus 247 ~~~l~~~~~~l~~~l~~l~~~~~~ 270 (291)
T TIGR00996 247 RDLLAENRPNLPQALANLAPVLTL 270 (291)
T ss_pred HHHHHHhhhhHHHHHHHHHHHHHH
Confidence 344544555556666665555543
No 58
>COG1463 Ttg2C ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=88.17 E-value=5.5 Score=40.92 Aligned_cols=89 Identities=15% Similarity=0.197 Sum_probs=64.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHH
Q psy18020 292 QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIY 371 (434)
Q Consensus 292 ~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~ 371 (434)
...++.+.++.+.++-++|..+...|...+..|...-+.+....+.+.+..+.+...+..+.+.. -++++++...+.+.
T Consensus 174 ~~~~l~~~l~~l~~~~~~ln~~~~~i~~~i~~l~~~~~~~~~~~~~l~~~~~~l~~l~~~~~~~~-~~l~~~l~~~~~~~ 252 (359)
T COG1463 174 TGPQLNALLDNLAQFTDALNARDGDIGALIANLNQLLDSLAAASDQLDRLLDNLATLTAALAARR-DALDDALAALSALA 252 (359)
T ss_pred cchHHHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhch-HHHHHHHHHHHHHH
Confidence 34456677777888888888888888888888888888888888777777777777777665544 46666666666666
Q ss_pred HHHHHhhHHH
Q psy18020 372 KQILNTFTEE 381 (434)
Q Consensus 372 ~Ql~~l~Aed 381 (434)
++.-++++|.
T Consensus 253 ~~~~~ll~~~ 262 (359)
T COG1463 253 ATVNDLLAEN 262 (359)
T ss_pred HHHHHHHHhh
Confidence 6666666554
No 59
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=88.10 E-value=10 Score=38.42 Aligned_cols=16 Identities=25% Similarity=0.495 Sum_probs=11.8
Q ss_pred HHhhhhHHHHHHHHHh
Q psy18020 381 EAATEDTIYYMGEALR 396 (434)
Q Consensus 381 d~AieDtiy~L~~aL~ 396 (434)
+.-..|.+|-+...|.
T Consensus 207 EsKVqDLm~EirnLLQ 222 (401)
T PF06785_consen 207 ESKVQDLMYEIRNLLQ 222 (401)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5667788888777775
No 60
>PF11985 DUF3486: Protein of unknown function (DUF3486); InterPro: IPR021874 This entry is represented by Bacteriophage Mu, Gp27. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=88.09 E-value=6 Score=36.58 Aligned_cols=106 Identities=16% Similarity=0.198 Sum_probs=67.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHH--HHhhhh-----HHHHHHHHHh
Q psy18020 324 IDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTE--EAATED-----TIYYMGEALR 396 (434)
Q Consensus 324 Le~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Ae--d~AieD-----tiy~L~~aL~ 396 (434)
+......++...+.+++..+..+.|++.+.+..+.|+-+++. ..+..++++++.. +.--+| .+..|.+++.
T Consensus 52 v~RY~~~~~~~~~~lr~are~a~al~~~~~~~~~~~~~~al~--~~~~~~~f~~l~~~~~~~~~~~~~~k~l~~la~~~~ 129 (180)
T PF11985_consen 52 VHRYAQRFEEVLERLREAREIAEALAEELGDEPEDDVTEALI--EMLQTLLFEALMSAQEEDEEDDDPPKDLMKLAKALA 129 (180)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchHHHHHH--HHHHHHHHHHHHHhhcccccchhhHHHHHHHHHHHH
Confidence 334566677778888888888899999887666666666542 2344555554333 222222 5777777764
Q ss_pred ---cCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC
Q psy18020 397 ---RGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKHISYA 433 (434)
Q Consensus 397 ---~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~~~l~ 433 (434)
+.....+.|-+++|.-.++.+-. .+.++.+..||.
T Consensus 130 ~l~~A~~~~~k~~~~ir~~~~~~~~~--~~e~~~~~~Gls 167 (180)
T PF11985_consen 130 RLSRASVYSKKRRAEIRKKAAAEAAA--AAEEAAKEGGLS 167 (180)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHhhcCCC
Confidence 44667777777777666666655 567777777764
No 61
>PF04799 Fzo_mitofusin: fzo-like conserved region; InterPro: IPR006884 This entry represents the heptad repeat domain which is conserved at the C terminus of Fzo/mitofusion family of GTPases. Fzo is a mediator of mitochondrial fusion during spermatogenesis []. This conserved region is also found in the human mitofusin protein []. This domain forms a dimeric antiparallel coiled coil structure, which has been proposed to act as a mitochodrial tether before vesicle fusion [].; GO: 0003924 GTPase activity, 0006184 GTP catabolic process, 0008053 mitochondrial fusion, 0005741 mitochondrial outer membrane, 0016021 integral to membrane; PDB: 1T3J_A.
Probab=88.06 E-value=1.3 Score=40.76 Aligned_cols=43 Identities=30% Similarity=0.417 Sum_probs=14.9
Q ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 305 QTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 305 ~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
+.|.||...+..|.+.++. .+++|+.+|.+|+.+++.||+...
T Consensus 102 QVqqeL~~tf~rL~~~Vd~---~~~eL~~eI~~L~~~i~~le~~~~ 144 (171)
T PF04799_consen 102 QVQQELSSTFARLCQQVDQ---TKNELEDEIKQLEKEIQRLEEIQS 144 (171)
T ss_dssp --------HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444443333 334444444444444444444433
No 62
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=87.77 E-value=9.6 Score=29.54 Aligned_cols=57 Identities=18% Similarity=0.109 Sum_probs=36.8
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+.+.++.|-..-.+|......|......++.++..|......-..+++.|..-+..+
T Consensus 5 Le~kle~Li~~~~~L~~EN~~Lr~q~~~~~~ER~~L~ekne~Ar~rvEamI~RLk~l 61 (65)
T TIGR02449 5 LAAQVEHLLEYLERLKSENRLLRAQEKTWREERAQLLEKNEQARQKVEAMITRLKAL 61 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Confidence 344455555566666666666777777777777777777766666666666555544
No 63
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=87.61 E-value=48 Score=36.29 Aligned_cols=99 Identities=18% Similarity=0.252 Sum_probs=61.2
Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhh
Q psy18020 299 EIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTF 378 (434)
Q Consensus 299 ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~ 378 (434)
.++.+...=.++...+..+...+..|+......++.+..++.+..++.+.++.. .-+.+-+ -|.+++..+
T Consensus 380 ~l~~~~~~l~~ie~~q~~~~~~l~~L~~dE~~Ar~~l~~~~~~l~~ikR~lek~---nLPGlp~-------~y~~~~~~~ 449 (560)
T PF06160_consen 380 ELEEIEEQLEEIEEEQEEINESLQSLRKDEKEAREKLQKLKQKLREIKRRLEKS---NLPGLPE-------DYLDYFFDV 449 (560)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc---CCCCCCH-------HHHHHHHHH
Confidence 333333333344444444666666666666666666666666666666666544 3344433 344445444
Q ss_pred HHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy18020 379 TEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLS 413 (434)
Q Consensus 379 Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~La 413 (434)
.|-|..|.+.|++..|+++.--+.+....
T Consensus 450 ------~~~i~~l~~~L~~~pinm~~v~~~l~~a~ 478 (560)
T PF06160_consen 450 ------SDEIEELSDELNQVPINMDEVNKQLEEAE 478 (560)
T ss_pred ------HHHHHHHHHHHhcCCcCHHHHHHHHHHHH
Confidence 48899999999999999998877776544
No 64
>COG5185 HEC1 Protein involved in chromosome segregation, interacts with SMC proteins [Cell division and chromosome partitioning]
Probab=87.51 E-value=28 Score=37.01 Aligned_cols=31 Identities=23% Similarity=0.175 Sum_probs=18.3
Q ss_pred HHHhhhhHHHHHHHHHhcCCCCHHHHHHHHH
Q psy18020 380 EEAATEDTIYYMGEALRRGVIDLEVYLKQVR 410 (434)
Q Consensus 380 ed~AieDtiy~L~~aL~~g~I~ld~flK~vR 410 (434)
|-.|+..-+.-|..-+.+..|+.+.|-+.+-
T Consensus 345 ei~~L~~~~d~L~~q~~kq~Is~e~fe~mn~ 375 (622)
T COG5185 345 EIKALQSNIDELHKQLRKQGISTEQFELMNQ 375 (622)
T ss_pred HHHHHHhhHHHHHHHHHhcCCCHHHHHHHHH
Confidence 3345555555566666666666666665553
No 65
>PF13094 CENP-Q: CENP-Q, a CENPA-CAD centromere complex subunit
Probab=87.40 E-value=19 Score=32.49 Aligned_cols=59 Identities=15% Similarity=0.182 Sum_probs=44.9
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 294 VQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 294 ~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.++....+++..+++|+...+..++++...|+..++.++.....++++..++...+...
T Consensus 34 ~qL~~~~~~l~lLq~e~~~~e~~le~d~~~L~~Le~~~~~~~~e~~~~~~~~~~vL~~~ 92 (160)
T PF13094_consen 34 RQLAANLHQLELLQEEIEKEEAALERDYEYLQELEKNAKALEREREEEEKKAHPVLQLD 92 (160)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhccc
Confidence 34567778888999999998888888888888877777777777777777766665543
No 66
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=87.01 E-value=37 Score=36.95 Aligned_cols=31 Identities=26% Similarity=0.276 Sum_probs=11.3
Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 309 ELNQGKFKLARMFERIDKEKAELERSISFLK 339 (434)
Q Consensus 309 EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~ 339 (434)
+|....+.+..+...|.....+....|..|+
T Consensus 196 el~~~~e~l~~E~~~L~~q~~e~~~ri~~LE 226 (546)
T PF07888_consen 196 ELTESSEELKEERESLKEQLAEARQRIRELE 226 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333333333333333
No 67
>KOG0421|consensus
Probab=86.98 E-value=0.64 Score=41.23 Aligned_cols=67 Identities=21% Similarity=0.459 Sum_probs=46.8
Q ss_pred eecCcccccceEEEecCCCCCCCCEe-EEecCCCceeccCCCcCCCCceecccccc-ccCCCCCHHHHHHHHHHHhCC
Q psy18020 3 NLSGLVYHIPVCIWLMDTHPDHAPIC-YVRPTADMRINVSQNVDHTGKIYSPYLHT-WTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 3 ~~~g~~y~iPi~iwl~~~yP~~~P~~-~v~pt~~m~i~~~~~vd~~G~~~~pyL~~-W~~~~s~L~~li~~l~~~F~~ 78 (434)
.|.|.+|.+ .+=+|.+||+.||.| |..|- =.++||.+|.|+|-.|.+ |.. -.|....+-.++...|+
T Consensus 72 vyegl~ykl--Sl~Fp~~YPy~pP~vkFltpc------~HPNVD~~GnIcLDILkdKWSa-~YdVrTILLSiQSLLGE 140 (175)
T KOG0421|consen 72 VYEGLKYKL--SLSFPNNYPYKPPTVKFLTPC------FHPNVDLSGNICLDILKDKWSA-VYDVRTILLSIQSLLGE 140 (175)
T ss_pred cccCcEEEE--EEecCCCCCCCCCeeEeeccc------cCCCccccccchHHHHHHHHHH-HHhHHHHHHHHHHHhCC
Confidence 477888875 556799999999986 33332 147899999999998865 654 45555555555555543
No 68
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=86.86 E-value=8.5 Score=35.99 Aligned_cols=58 Identities=22% Similarity=0.262 Sum_probs=39.3
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHh
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL-DEILAVL 352 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eL-e~~I~~l 352 (434)
.++..+..+...-.++...-+.|.+++..|+-..+.++..+..|+.+.++| ++|+...
T Consensus 127 ~L~~~~~~l~~~l~ek~k~~e~l~DE~~~L~l~~~~~e~k~~~l~~En~~Lv~Rwm~~k 185 (194)
T PF08614_consen 127 QLEEKIKDLEEELKEKNKANEILQDELQALQLQLNMLEEKLRKLEEENRELVERWMQRK 185 (194)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444455555556688888888888888888888888888777 6676643
No 69
>PF03148 Tektin: Tektin family; InterPro: IPR000435 Tektin heteropolymers form unique protofilaments of flagellar microtubules []. The proteins are predicted to form extended rods composed of 2 alpha- helical segments (~180 residues long) capable of forming coiled coils, interrupted by non-helical linkers []. The 2 segments are similar in sequence, indicating a gene duplication event. Along each tektin rod, cysteine residues occur with a periodicity of ~8nm, coincident with the axial repeat of tubulin dimers in microtubules []. It is proposed that the assembly of tektin heteropolymers produces filaments with repeats of 8, 16, 24, 32, 40, 48 and 96nm, generating the basis for the complex spatial arrangements of axonemal components [].; GO: 0000226 microtubule cytoskeleton organization, 0005874 microtubule
Probab=86.81 E-value=42 Score=34.87 Aligned_cols=41 Identities=15% Similarity=0.130 Sum_probs=26.7
Q ss_pred HHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 375 LNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 375 ~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
+.|+.|=.-|.++|..|.+.|..-.-.+..-++.--.|-++
T Consensus 320 ~~L~~Ev~~l~~~i~~L~~~L~~a~~~l~~L~~~~~~Le~d 360 (384)
T PF03148_consen 320 YGLIEEVKELRESIEALQEKLDEAEASLQKLERTRLRLEED 360 (384)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445557788899888888877666555555554444444
No 70
>PF11180 DUF2968: Protein of unknown function (DUF2968); InterPro: IPR021350 This family of proteins has no known function.
Probab=86.60 E-value=16 Score=34.30 Aligned_cols=61 Identities=10% Similarity=0.217 Sum_probs=47.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 290 ~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.+...+++++++..++.+++...++...+++...|+.++...+..+..|+..+..|+.-.+
T Consensus 125 ~~ra~~LqaDl~~~~~Q~~~va~~Q~q~r~ea~aL~~e~~aaqaQL~~lQ~qv~~Lq~q~~ 185 (192)
T PF11180_consen 125 EARANRLQADLQIARQQQQQVAARQQQARQEAQALEAERRAAQAQLRQLQRQVRQLQRQAN 185 (192)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4445566777777777777778888888888888888888888888888888888876544
No 71
>PF09738 DUF2051: Double stranded RNA binding protein (DUF2051); InterPro: IPR019139 This entry represents transcriptional repressors which preferentially bind to the GC-rich consensus sequence (5'-AGCCCCCGGCG-3') and may regulate expression of TNF, EGFR and PDGFA. They may control smooth muscle cell proliferation following artery injury through PDGFA repression and may also bind double-stranded RNA. They interact with the leucine-rich repeat domain of human flightless-I (FliI) protein.
Probab=86.47 E-value=39 Score=34.10 Aligned_cols=57 Identities=21% Similarity=0.308 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+++.++.+..+-++|........+.++.++...+.|...+..|++++.+.++.|++.
T Consensus 117 Lkd~lee~eE~~~~~~re~~eK~~elEr~K~~~d~L~~e~~~Lre~L~~rdeli~kh 173 (302)
T PF09738_consen 117 LKDKLEELEETLAQLQREYREKIRELERQKRAHDSLREELDELREQLKQRDELIEKH 173 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
Confidence 344444444444444444444455567777777778888888888888888888765
No 72
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ]. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance []. ; GO: 0008134 transcription factor binding, 0042803 protein homodimerization activity, 0045502 dynein binding
Probab=86.42 E-value=16 Score=32.64 Aligned_cols=56 Identities=27% Similarity=0.388 Sum_probs=35.5
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+++++.|...-..|......|...+..|..++..|.+.++..+.++.+|+.....+
T Consensus 51 k~eie~L~~el~~lt~el~~L~~EL~~l~sEk~~L~k~lq~~q~kv~eLE~~~~~~ 106 (140)
T PF10473_consen 51 KAEIETLEEELEELTSELNQLELELDTLRSEKENLDKELQKKQEKVSELESLNSSL 106 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH
Confidence 44555555555555555555666666777777777777777777777776665544
No 73
>PRK12704 phosphodiesterase; Provisional
Probab=86.26 E-value=21 Score=38.77 Aligned_cols=20 Identities=10% Similarity=0.240 Sum_probs=9.6
Q ss_pred CHHHHHHHHHHHHHHHHHHH
Q psy18020 401 DLEVYLKQVRSLSRKQFMFR 420 (434)
Q Consensus 401 ~ld~flK~vR~LaReQF~~R 420 (434)
+...++|..-.-++++--.+
T Consensus 170 ~~~~~~~~~~~~~~~~a~~~ 189 (520)
T PRK12704 170 EAAVLIKEIEEEAKEEADKK 189 (520)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34445555555555544433
No 74
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=86.20 E-value=5.7 Score=38.04 Aligned_cols=64 Identities=20% Similarity=0.284 Sum_probs=29.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFK---LARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 284 kL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~---L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
-|+.+|.+......+-++.+..++++++..++. |+.+...|+.+...+...+..|+++-++|+.
T Consensus 139 e~kekl~E~~~EkeeL~~eleele~e~ee~~erlk~le~E~s~LeE~~~~l~~ev~~L~~r~~ELe~ 205 (290)
T COG4026 139 ELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEP 205 (290)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHhcc
Confidence 344444444444344444444444444444433 3334445555555555555555555555543
No 75
>PF04108 APG17: Autophagy protein Apg17 ; InterPro: IPR007240 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Autophagy protein 17 (Apg17) is required for activating Apg1 protein kinases. This entry also contains Autophagy protein 11 which is involved in cytoplasm to vacuole transport (Cvt) and pexophagy. ; GO: 0006914 autophagy
Probab=86.06 E-value=26 Score=36.81 Aligned_cols=57 Identities=12% Similarity=0.193 Sum_probs=42.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHH
Q psy18020 323 RIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382 (434)
Q Consensus 323 eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~ 382 (434)
.-+..+..+...+..|+...++|++.+..| ....|+.+|.+...-.+-|+|-+.+|.
T Consensus 67 ~~~~~~~~f~~l~~~ld~~~~~L~~~l~~L---r~t~v~~~~~~~~~~~ktL~DFVd~~~ 123 (412)
T PF04108_consen 67 VYEWGQRDFKDLVKELDPADARLEQTLDML---RNTKVPPFFRPPGEEPKTLYDFVDEDS 123 (412)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCcCCccccCCCCCCCcHHHhcCHHH
Confidence 344466667777777888888888888888 455677777777777888888887764
No 76
>PF11887 DUF3407: Protein of unknown function (DUF3407); InterPro: IPR024516 This entry represents a domain of unknown function found at the C terminus of many proteins in the mammalian cell entry family.
Probab=85.93 E-value=14 Score=36.53 Aligned_cols=116 Identities=10% Similarity=0.082 Sum_probs=67.2
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHH
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQIL 375 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~ 375 (434)
+.+.++.+.++-.+|+...+.|.+.++.|....+.+......|.+-.+.+......+.++. -++|.++...+-+.+++-
T Consensus 33 lg~~l~~l~~~l~~ln~~~~~l~~~l~~l~~v~~~~a~aapdL~~~l~~~~~~s~tL~~~~-~~L~~lL~~~~~~a~~~~ 111 (267)
T PF11887_consen 33 LGETLDDLNTLLATLNPRLPQLREDLRNLADVADTYADAAPDLLDALDNLTTTSRTLVDQR-QQLDALLLSATGLADTGT 111 (267)
T ss_pred HHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHH
Confidence 3445555666666666666666666666666666666666666666666666555554333 377777777777777777
Q ss_pred HhhHHH-HhhhhHHHHHHH---HHhcCCCCHHHHHHHHHHH
Q psy18020 376 NTFTEE-AATEDTIYYMGE---ALRRGVIDLEVYLKQVRSL 412 (434)
Q Consensus 376 ~l~Aed-~AieDtiy~L~~---aL~~g~I~ld~flK~vR~L 412 (434)
+++.+. ..|..++..|.. +|.+-.=.|.-+|+....+
T Consensus 112 ~~l~~n~~~L~~~~~~L~p~~~lL~~~~p~l~c~L~~l~~~ 152 (267)
T PF11887_consen 112 DFLADNRDNLIRALDDLRPTTDLLAKYSPELPCTLQGLAPF 152 (267)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHhCchHHHHHHHHHHH
Confidence 666655 333333333332 3334444555555555433
No 77
>PRK14140 heat shock protein GrpE; Provisional
Probab=85.88 E-value=16 Score=34.46 Aligned_cols=91 Identities=14% Similarity=0.141 Sum_probs=48.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccc
Q psy18020 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAV 364 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i 364 (434)
+..++.+...++.+--+.+.+.++++..-+...++++..+ .+..+++-+..|-.-.+.|++.+... .+..+.+.++
T Consensus 42 l~~~i~~l~~ei~elkd~~lR~~Ae~eN~rkR~~rE~~~~--~~~a~~~~~~~LLpvlDnLerAl~~~--~~~~~~~~i~ 117 (191)
T PRK14140 42 EQAKIAELEAKLDELEERYLRLQADFENYKRRIQKENEAA--EKYRAQSLASDLLPALDNFERALQIE--ADDEQTKSLL 117 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhcc--CccchHHHHH
Confidence 5555544444444444556666666666555555554442 34445555666666666666666532 2222344445
Q ss_pred cccchHHHHHHHhhH
Q psy18020 365 TTTAPIYKQILNTFT 379 (434)
Q Consensus 365 ~~~~~l~~Ql~~l~A 379 (434)
.+-.-.++|+.+.+.
T Consensus 118 ~Gv~mi~k~l~~~L~ 132 (191)
T PRK14140 118 KGVEMVHRQLLEALK 132 (191)
T ss_pred HHHHHHHHHHHHHHH
Confidence 454555666655554
No 78
>TIGR00996 Mtu_fam_mce virulence factor Mce family protein. Members of this paralogous family are found as six tandem homologous proteins in the same orientation per cassette, in four separate cassettes in Mycobacterium tuberculosis. The six members of each cassette represent six subfamilies. One subfamily includes the protein mce (mycobacterial cell entry), a virulence protein required for invasion of non-phagocytic cells.
Probab=85.78 E-value=18 Score=35.61 Aligned_cols=97 Identities=13% Similarity=0.186 Sum_probs=54.4
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHH
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQIL 375 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~ 375 (434)
+.+.++.+..+-+.|......|.+.+..+....+.+.++-..+.+-.+.+...+..+.+ ..-++++++..-+.+.+++=
T Consensus 169 l~~~l~~l~~l~~~l~~~~~~i~~ll~~l~~l~~~l~~~~~~l~~~v~~l~~~~~~l~~-~~~~l~~~l~~l~~~~~~~~ 247 (291)
T TIGR00996 169 LRNLLDGLAQLTAALNARDGDIGALIDNLNRVLDVLADRSDQLDRLLDNLATLTAQLAD-RDDALDDALAALSGASAQVR 247 (291)
T ss_pred HHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHh-ChHHHHHHHHHHHHHHHHHH
Confidence 44445555555566666666666666666666666666666666666666666665543 22345555544455555555
Q ss_pred HhhHHH-HhhhhHHHHHHH
Q psy18020 376 NTFTEE-AATEDTIYYMGE 393 (434)
Q Consensus 376 ~l~Aed-~AieDtiy~L~~ 393 (434)
.++++. ..|++++..|..
T Consensus 248 ~~l~~~~~~l~~~l~~l~~ 266 (291)
T TIGR00996 248 DLLAENRPNLPQALANLAP 266 (291)
T ss_pred HHHHHhhhhHHHHHHHHHH
Confidence 555433 344444444443
No 79
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=85.77 E-value=31 Score=32.38 Aligned_cols=77 Identities=14% Similarity=0.188 Sum_probs=46.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
|+.....|+++...|-.....++++...|..-|.++..... .+....+.+-++.-+|..+-.++--=|+.+...+.
T Consensus 72 Lk~~~~~lEE~~~~L~aq~rqlEkE~q~L~~~i~~Lqeen~----kl~~e~~~lk~~~~eL~~~~~~Lq~Ql~~~e~l~~ 147 (193)
T PF14662_consen 72 LKTLAKSLEEENRSLLAQARQLEKEQQSLVAEIETLQEENG----KLLAERDGLKKRSKELATEKATLQRQLCEFESLIC 147 (193)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----HHHHhhhhHHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 66666666666666666666666666666666666632111 01122233667777777777777777777766665
Q ss_pred c
Q psy18020 397 R 397 (434)
Q Consensus 397 ~ 397 (434)
+
T Consensus 148 ~ 148 (193)
T PF14662_consen 148 Q 148 (193)
T ss_pred H
Confidence 3
No 80
>KOG1655|consensus
Probab=85.74 E-value=4.4 Score=38.02 Aligned_cols=72 Identities=17% Similarity=0.214 Sum_probs=44.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHH
Q psy18020 330 ELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQ 408 (434)
Q Consensus 330 ~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~ 408 (434)
.|+..+..|-+..+|+++.+... -+.+|||+ +.|-.-|=.|-.|+...+++-.+|..+...+.+..+.|++.
T Consensus 130 dlQDem~Dlmd~a~EiQE~Lgr~--y~~peide-----~dL~aELdaL~~E~d~~~~~~~~~~psyl~p~~~~~~~~de 201 (218)
T KOG1655|consen 130 DLQDEMEDLMDQADEIQEVLGRN--YNTPDIDE-----ADLDAELDALGQELDMLEEDENYLMPSYLAPANEPPAFIDE 201 (218)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhc--cCCCCcCH-----HHHHHHHHHHHhHhhcccccccccchhhhCCCCCCCCCccc
Confidence 34455555666667777777644 34444554 22555555566667777766677777777777777777654
No 81
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=85.72 E-value=32 Score=34.86 Aligned_cols=15 Identities=7% Similarity=-0.015 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHhcCC
Q psy18020 418 MFRALLFKCYKHISY 432 (434)
Q Consensus 418 ~~Ra~~~Ki~~~~~l 432 (434)
..++.++.+.+..|+
T Consensus 275 ~Lk~~~~~Le~l~g~ 289 (312)
T smart00787 275 KLKEQLKLLQSLTGW 289 (312)
T ss_pred HHHHHHHHHHHHhCC
Confidence 344444444444443
No 82
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=85.64 E-value=32 Score=32.34 Aligned_cols=79 Identities=24% Similarity=0.255 Sum_probs=55.4
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHH
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQI 374 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql 374 (434)
.+..+++.|+..-..|+....+|-..-..++.+...|...|..|+++..++..-.+.+.. +..++....+.+-+|+
T Consensus 64 ~l~eEledLk~~~~~lEE~~~~L~aq~rqlEkE~q~L~~~i~~Lqeen~kl~~e~~~lk~----~~~eL~~~~~~Lq~Ql 139 (193)
T PF14662_consen 64 ALEEELEDLKTLAKSLEEENRSLLAQARQLEKEQQSLVAEIETLQEENGKLLAERDGLKK----RSKELATEKATLQRQL 139 (193)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhhhHHH----HHHHHHHhhHHHHHHH
Confidence 346667777777777777777787778888888888888888888888777766665532 3344444556677777
Q ss_pred HHh
Q psy18020 375 LNT 377 (434)
Q Consensus 375 ~~l 377 (434)
+..
T Consensus 140 ~~~ 142 (193)
T PF14662_consen 140 CEF 142 (193)
T ss_pred HHH
Confidence 543
No 83
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=85.57 E-value=11 Score=36.37 Aligned_cols=37 Identities=19% Similarity=0.301 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
|+..+..++.++.+.+..+.++.++...|...|..++
T Consensus 65 lE~iIkqa~~er~~~~~~i~r~~eey~~Lk~~in~~R 101 (230)
T PF10146_consen 65 LENIIKQAESERNKRQEKIQRLYEEYKPLKDEINELR 101 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6667777777777777777777777666666666553
No 84
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=85.56 E-value=28 Score=35.27 Aligned_cols=43 Identities=26% Similarity=0.237 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 300 IEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342 (434)
Q Consensus 300 le~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~ 342 (434)
++.|++-...|....+.++..+.+|....+.|+..+..|++..
T Consensus 153 ~~~l~~D~~~L~~~~~~l~~~~~~l~~~~~~L~~e~~~L~~~~ 195 (312)
T smart00787 153 LEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLE 195 (312)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH
Confidence 3333334444444444444444444444444444444443333
No 85
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=85.49 E-value=38 Score=33.03 Aligned_cols=19 Identities=16% Similarity=0.034 Sum_probs=6.9
Q ss_pred HHHHHHhhHHHHhhhhHHH
Q psy18020 371 YKQILNTFTEEAATEDTIY 389 (434)
Q Consensus 371 ~~Ql~~l~Aed~AieDtiy 389 (434)
.+++-++.-+-.++++-++
T Consensus 123 ~~~i~~l~~~~~~~e~~~~ 141 (239)
T COG1579 123 EKEIEDLKERLERLEKNLA 141 (239)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333
No 86
>PRK00106 hypothetical protein; Provisional
Probab=85.38 E-value=31 Score=37.59 Aligned_cols=16 Identities=0% Similarity=0.019 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHHH
Q psy18020 403 EVYLKQVRSLSRKQFM 418 (434)
Q Consensus 403 d~flK~vR~LaReQF~ 418 (434)
..++|..-.-+++.--
T Consensus 187 ~~~i~~~e~~a~~~a~ 202 (535)
T PRK00106 187 ATRIREAEREVKDRSD 202 (535)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344444444444433
No 87
>KOG0995|consensus
Probab=85.17 E-value=15 Score=39.84 Aligned_cols=49 Identities=18% Similarity=0.290 Sum_probs=26.2
Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 302 ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 302 ~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.++.--..|....+.+...++.|..+..+-+..++.|+.+.++|...|+
T Consensus 277 K~~~y~~~~~~k~~~~~~~l~~l~~Eie~kEeE~e~lq~~~d~Lk~~Ie 325 (581)
T KOG0995|consen 277 KFQAYVSQMKSKKQHMEKKLEMLKSEIEEKEEEIEKLQKENDELKKQIE 325 (581)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333344444444555555555555555555555566666666655554
No 88
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=84.98 E-value=21 Score=35.17 Aligned_cols=31 Identities=10% Similarity=0.270 Sum_probs=19.4
Q ss_pred cchHHHHHHHhhHHHHhhhhHHHHHHHHHhc
Q psy18020 367 TAPIYKQILNTFTEEAATEDTIYYMGEALRR 397 (434)
Q Consensus 367 ~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~ 397 (434)
+...|++.++++.+..--+++|-.|.+.++.
T Consensus 142 e~~~Y~~A~~l~~~~~~y~~Ai~af~~fl~~ 172 (263)
T PRK10803 142 ANTDYNAAIALVQDKSRQDDAIVAFQNFVKK 172 (263)
T ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Confidence 3556777777766544455666666666653
No 89
>PF10046 BLOC1_2: Biogenesis of lysosome-related organelles complex-1 subunit 2 ; InterPro: IPR019269 This entry represents a family of proteins that play a role in cellular proliferation, as well as in the biogenesis of specialised organelles of the endosomal-lysosomal system [].
Probab=84.86 E-value=22 Score=29.69 Aligned_cols=23 Identities=30% Similarity=0.372 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 331 LERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 331 Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
++..+..|+.-..+|++|...++
T Consensus 71 Ie~~V~~LE~~v~~LD~ysk~LE 93 (99)
T PF10046_consen 71 IEEQVTELEQTVYELDEYSKELE 93 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33366667777777777776664
No 90
>PRK10884 SH3 domain-containing protein; Provisional
Probab=84.81 E-value=11 Score=35.79 Aligned_cols=15 Identities=33% Similarity=0.401 Sum_probs=5.8
Q ss_pred HHHHHHHHHHHHHHH
Q psy18020 333 RSISFLKEKETELDE 347 (434)
Q Consensus 333 ~~i~~L~~~~~eLe~ 347 (434)
+.+..|+++.++|.+
T Consensus 132 ~~~~~L~~~n~~L~~ 146 (206)
T PRK10884 132 SVINGLKEENQKLKN 146 (206)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333333434444433
No 91
>PRK15422 septal ring assembly protein ZapB; Provisional
Probab=84.77 E-value=19 Score=28.94 Aligned_cols=48 Identities=23% Similarity=0.291 Sum_probs=31.1
Q ss_pred HHHHHHHHHHHHH-------HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 297 QEEIEILKQTENE-------LNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344 (434)
Q Consensus 297 ~~ele~L~~~q~E-------L~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~e 344 (434)
+-+++.|+.-... +..+.+.|.++...|+.++...+.++..|-.+.++
T Consensus 24 qmEieELKekn~~L~~e~~~~~~~r~~L~~en~qLk~E~~~WqerLr~LLGkm~~ 78 (79)
T PRK15422 24 QMEIEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNGWQERLQALLGRMEE 78 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 4445555444444 44455668888888888888888877777666543
No 92
>COG0576 GrpE Molecular chaperone GrpE (heat shock protein) [Posttranslational modification, protein turnover, chaperones]
Probab=84.70 E-value=15 Score=34.64 Aligned_cols=59 Identities=17% Similarity=0.130 Sum_probs=36.5
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 294 VQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 294 ~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
.++.+..+.+.+.+++++.-+..+++++++.. +..+++-+..|---++.+++.|.....
T Consensus 50 ~q~~e~~~~~lr~~Ae~eN~rkR~~re~e~~~--k~a~e~~~~dlLpviDnlerAl~~~~~ 108 (193)
T COG0576 50 AQLEELKDKYLRAQAEFENLRKRTEREREEAK--KYAIEKFAKDLLPVIDNLERALEAAED 108 (193)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 33333445566666666666666666655544 556667777777777777777765433
No 93
>PF05266 DUF724: Protein of unknown function (DUF724); InterPro: IPR007930 This family contains several uncharacterised proteins found exclusively in Arabidopsis thaliana.
Probab=84.69 E-value=34 Score=32.11 Aligned_cols=20 Identities=25% Similarity=0.197 Sum_probs=11.0
Q ss_pred HHHHHhhHHHHhhhhHHHHH
Q psy18020 372 KQILNTFTEEAATEDTIYYM 391 (434)
Q Consensus 372 ~Ql~~l~Aed~AieDtiy~L 391 (434)
+.+..+-++..+|++.+...
T Consensus 159 ~ei~~lks~~~~l~~~~~~~ 178 (190)
T PF05266_consen 159 KEISRLKSEAEALKEEIENA 178 (190)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44455556666666555443
No 94
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=84.43 E-value=19 Score=34.92 Aligned_cols=31 Identities=13% Similarity=0.258 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 320 MFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 320 ~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
+++.|..++....+.+...+.++..|+..|.
T Consensus 40 E~~~L~~Er~~h~eeLrqI~~DIn~lE~iIk 70 (230)
T PF10146_consen 40 EMEELLQERMAHVEELRQINQDINTLENIIK 70 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333444444444333
No 95
>PRK14161 heat shock protein GrpE; Provisional
Probab=84.31 E-value=24 Score=32.82 Aligned_cols=77 Identities=14% Similarity=0.190 Sum_probs=33.8
Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhh
Q psy18020 300 IEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTF 378 (434)
Q Consensus 300 le~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~ 378 (434)
.+.+.++++|++.-+...++++.++ .+..+++-+..|-.-.+.|++.+......++..+..++.+-.-.++|+.+.+
T Consensus 39 kd~~lR~~AefeN~rkR~~ke~~~~--~~~a~~~~~~~LLpv~DnlerAl~~~~~~~~~~~~~~~~Gv~mi~k~l~~vL 115 (178)
T PRK14161 39 KDKLIRTTAEIDNTRKRLEKARDEA--KDYAIATFAKELLNVSDNLSRALAHKPANSDVEVTNIIAGVQMTKDELDKVF 115 (178)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhHHhHHHHHHhcCccccchhHHHHHHHHHHHHHHHHHHH
Confidence 3344455555555444444433332 2333445555555556666666653322222222333434344445554444
No 96
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=84.30 E-value=14 Score=29.34 Aligned_cols=54 Identities=30% Similarity=0.343 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 294 VQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 294 ~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
.++..+-+.|...+..+...--+|+..+.+++.....+...+..+..+.+.++.
T Consensus 15 a~L~eEGekLSk~el~~~~~IKKLr~~~~e~e~~~~~l~~~~~~~e~~~~~l~~ 68 (74)
T PF12329_consen 15 AQLMEEGEKLSKKELKLNNTIKKLRAKIKELEKQIKELKKKLEELEKELESLEE 68 (74)
T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444433444444444444444444444444444444433
No 97
>PRK09039 hypothetical protein; Validated
Probab=84.29 E-value=42 Score=34.41 Aligned_cols=14 Identities=36% Similarity=0.506 Sum_probs=8.0
Q ss_pred HHHHH--HHHHHHHHH
Q psy18020 409 VRSLS--RKQFMFRAL 422 (434)
Q Consensus 409 vR~La--ReQF~~Ra~ 422 (434)
|.+|+ |.+|+.|..
T Consensus 189 ~~~l~~~~~~~~~~l~ 204 (343)
T PRK09039 189 VQELNRYRSEFFGRLR 204 (343)
T ss_pred HHHHHHhHHHHHHHHH
Confidence 44554 467776653
No 98
>PF10475 DUF2450: Protein of unknown function N-terminal domain (DUF2450) ; InterPro: IPR019515 This entry represents Vacuolar protein sorting-associated protein 54, and is thought to be involved in retrograde transport from early and late endosomes to late Golgi found in eukaryotes, but its function is not known.
Probab=84.23 E-value=47 Score=33.04 Aligned_cols=75 Identities=17% Similarity=0.307 Sum_probs=53.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH-HH--HHHHHHHHHHHHHHHHh
Q psy18020 278 KSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER-SI--SFLKEKETELDEILAVL 352 (434)
Q Consensus 278 ~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~-~i--~~L~~~~~eLe~~I~~l 352 (434)
+..|+.+|.+.+.+.....-.++..+.+++.+|...-..++...+.|+..++.+-. .+ -.+..+.+.+.+.+..+
T Consensus 44 ldvVe~~L~~~I~~~s~~f~~a~~~v~el~~~l~~a~~~~~~~R~~L~~~~~~~~~~~L~Il~~~rkr~~l~~ll~~L 121 (291)
T PF10475_consen 44 LDVVEKKLSREISEKSDSFFQAMSSVQELQDELEEALVICKNLRRNLKSADENLTKSGLEILRLQRKRQNLKKLLEKL 121 (291)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35666667777777777778888899999999999998888888888888877655 33 34444444444554444
No 99
>KOG4460|consensus
Probab=84.22 E-value=31 Score=37.35 Aligned_cols=37 Identities=16% Similarity=0.100 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
++++++.+++....|++++++..++.+.|++.+..|.
T Consensus 607 ~~eer~~i~e~a~~La~R~eea~e~qe~L~~~~~~L~ 643 (741)
T KOG4460|consen 607 CREERKSLREMAERLADRYEEAKEKQEDLMNRMKKLL 643 (741)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 5555556666666666666666666666766666664
No 100
>TIGR03495 phage_LysB phage lysis regulatory protein, LysB family. Members of this protein family are phage lysis regulatory protein, including the well-studied protein LysB (lysis protein B) of Enterobacteria phage P2. For members of this family, genes are found in phage or in prophage regions of bacterial genomes, typically near a phage lysozyme or phage holin.
Probab=84.19 E-value=20 Score=31.82 Aligned_cols=40 Identities=18% Similarity=0.216 Sum_probs=28.6
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 311 NQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 311 ~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
+..+..|+++.......-...++.|++|..+.+.+..|-.
T Consensus 60 ~~~Q~~Lr~~~~~~~~~l~~re~~i~rL~~ENe~lR~Wa~ 99 (135)
T TIGR03495 60 EEAQAQLRQQLAQARALLAQREQRIERLKRENEDLRRWAD 99 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCHHHHHHhc
Confidence 4455557777777777777777777777777777777754
No 101
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=84.04 E-value=22 Score=39.93 Aligned_cols=52 Identities=17% Similarity=0.321 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhH-------HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 288 KMNEQKVQAQEEIEILKQTENELNQG-------KFKLARMFERIDKEKAELERSISFLK 339 (434)
Q Consensus 288 ~L~e~~~q~~~ele~L~~~q~EL~~~-------~~~L~~~~~eLe~~k~~Le~~i~~L~ 339 (434)
||...+.+++.|+...++...||+.. +..++.++..|+.+.+.|+..+..|.
T Consensus 422 rLE~dvkkLraeLq~~Rq~E~ELRsqis~l~~~Er~lk~eL~qlr~ene~Lq~Kl~~L~ 480 (697)
T PF09726_consen 422 RLEADVKKLRAELQSSRQSEQELRSQISSLTNNERSLKSELSQLRQENEQLQNKLQNLV 480 (697)
T ss_pred HHHHHHHHHHHHHHhhhhhHHHHHHHHhhccccchHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34333444444444444444444443 23344444444444444444444333
No 102
>KOG0994|consensus
Probab=84.02 E-value=58 Score=38.42 Aligned_cols=47 Identities=15% Similarity=0.083 Sum_probs=26.3
Q ss_pred hhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 383 ATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKH 429 (434)
Q Consensus 383 AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~ 429 (434)
.+.++..-..+||+.+....-.--+-++.-.+.+=.++-++.||...
T Consensus 1560 ~v~~~ae~V~eaL~~Ad~Aq~~a~~ai~~a~~~~~~a~~~l~kv~~~ 1606 (1758)
T KOG0994|consen 1560 DVKGQAEDVVEALEEADVAQGEAQDAIQGADRDIRLAQQLLAKVQEE 1606 (1758)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Confidence 34455555555555544444444455556666666666677776654
No 103
>PF10018 Med4: Vitamin-D-receptor interacting Mediator subunit 4; InterPro: IPR019258 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Members of this family represent the Med4 subunit of the Mediator (Med) complex [, ]. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=83.95 E-value=13 Score=34.54 Aligned_cols=18 Identities=22% Similarity=0.372 Sum_probs=9.9
Q ss_pred cCCCCHHHHHHHHHHHHH
Q psy18020 397 RGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 397 ~g~I~ld~flK~vR~LaR 414 (434)
++.|+.+.-|+..|.||+
T Consensus 78 ~~~v~~~eLL~YA~rISk 95 (188)
T PF10018_consen 78 KRPVDYEELLSYAHRISK 95 (188)
T ss_pred cCCCCHHHHHHHHHHHHH
Confidence 345555555555555554
No 104
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=83.91 E-value=19 Score=28.39 Aligned_cols=51 Identities=22% Similarity=0.272 Sum_probs=29.3
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
+..++++..++.++.. |+.....|..+...|+.....|+.+.....+-|..
T Consensus 14 ~~aveti~~Lq~e~ee----Lke~n~~L~~e~~~L~~en~~L~~e~~~~~~rl~~ 64 (72)
T PF06005_consen 14 QQAVETIALLQMENEE----LKEKNNELKEENEELKEENEQLKQERNAWQERLRS 64 (72)
T ss_dssp HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH----HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555544 66666666666666666666666665555554443
No 105
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=83.71 E-value=18 Score=35.09 Aligned_cols=55 Identities=25% Similarity=0.308 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345 (434)
Q Consensus 291 e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eL 345 (434)
+++..+.++++.+....+.|+...+++++.+..++.+...|++.+..++....++
T Consensus 49 ~e~~~L~~e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l 103 (251)
T PF11932_consen 49 DEKQELLAEYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIEETRQEL 103 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444444444444455555555555555555555444444443
No 106
>PRK11519 tyrosine kinase; Provisional
Probab=83.68 E-value=52 Score=37.13 Aligned_cols=111 Identities=15% Similarity=0.169 Sum_probs=56.7
Q ss_pred HHHHHHHHHHHHhHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHH
Q psy18020 300 IEILKQTENELNQGKFKLARMFE-----RIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQI 374 (434)
Q Consensus 300 le~L~~~q~EL~~~~~~L~~~~~-----eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql 374 (434)
-+++..++.+|+..+.+|++... .++.+...+-+.+..++++..+++..+..+.. .+....|...
T Consensus 273 ~~ql~~l~~~L~~aE~~l~~fr~~~~~vd~~~ea~~~l~~~~~l~~ql~~l~~~~~~l~~--------~y~~~hP~v~-- 342 (719)
T PRK11519 273 AQQLPEVRSRLDVAENKLNAFRQDKDSVDLPLEAKAVLDSMVNIDAQLNELTFKEAEISK--------LYTKEHPAYR-- 342 (719)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HhcccCcHHH--
Confidence 34566666666666666555544 23344444444455555555555554444421 1223333333
Q ss_pred HHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 375 LNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFK 425 (434)
Q Consensus 375 ~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~K 425 (434)
++.++..+|+.-+..+.+-+. ++..--++.+.|-|++=..+.+-.-
T Consensus 343 -~l~~~~~~L~~~~~~l~~~~~----~lp~~e~~~~~L~Re~~~~~~lY~~ 388 (719)
T PRK11519 343 -TLLEKRKALEDEKAKLNGRVT----AMPKTQQEIVRLTRDVESGQQVYMQ 388 (719)
T ss_pred -HHHHHHHHHHHHHHHHHHHHH----hccHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445555555544443332 3444556677777777666655443
No 107
>KOG4657|consensus
Probab=83.63 E-value=20 Score=34.52 Aligned_cols=38 Identities=21% Similarity=0.366 Sum_probs=27.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
-.+++++..++++-+++..+++.++++.+...++|..-
T Consensus 89 ~~ieqeik~~q~elEvl~~n~Q~lkeE~dd~keiIs~k 126 (246)
T KOG4657|consen 89 MGIEQEIKATQSELEVLRRNLQLLKEEKDDSKEIISQK 126 (246)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHH
Confidence 44677777777777888888888888777776666543
No 108
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=83.57 E-value=32 Score=30.70 Aligned_cols=104 Identities=20% Similarity=0.313 Sum_probs=51.6
Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHH
Q psy18020 302 ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE 381 (434)
Q Consensus 302 ~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed 381 (434)
.+...-.+|+.....++.++..|+.....|+..+..+.++..++...+....... .+.+ -+-+.+ .++=++
T Consensus 18 ~~e~~~K~le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k~~lee~~~~~-~~~E-------~l~rri-q~LEee 88 (143)
T PF12718_consen 18 ELEAKVKQLEQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAKEKLEESEKRK-SNAE-------QLNRRI-QLLEEE 88 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH-HhHH-------HHHhhH-HHHHHH
Confidence 3333444444444556666666666666666666666666666666555432211 1111 122211 111111
Q ss_pred -HhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy18020 382 -AATEDTIYYMGEALRRGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 382 -~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaR 414 (434)
...+..+.-..+-|+.-.+-.+.|-|+|..|--
T Consensus 89 le~ae~~L~e~~ekl~e~d~~ae~~eRkv~~le~ 122 (143)
T PF12718_consen 89 LEEAEKKLKETTEKLREADVKAEHFERKVKALEQ 122 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHh
Confidence 112223333333444445678888888887754
No 109
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=83.57 E-value=31 Score=41.12 Aligned_cols=39 Identities=10% Similarity=0.058 Sum_probs=21.0
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHH
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQ 408 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~ 408 (434)
+-.++-.+.+.-..+.|.+..+.+.+..-.-.+....+.
T Consensus 444 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (1163)
T COG1196 444 LNEELEELEEQLEELRDRLKELERELAELQEELQRLEKE 482 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555555566666666666666666655443333333333
No 110
>PF13747 DUF4164: Domain of unknown function (DUF4164)
Probab=83.55 E-value=23 Score=29.05 Aligned_cols=9 Identities=22% Similarity=0.685 Sum_probs=3.3
Q ss_pred HHHHHHHHH
Q psy18020 281 VEDKIRRKM 289 (434)
Q Consensus 281 l~dkL~~~L 289 (434)
|+..|.+++
T Consensus 20 LE~~v~~r~ 28 (89)
T PF13747_consen 20 LEKAVDRRL 28 (89)
T ss_pred HHHHHHHHH
Confidence 333333333
No 111
>COG1463 Ttg2C ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=83.51 E-value=16 Score=37.57 Aligned_cols=119 Identities=14% Similarity=0.189 Sum_probs=65.9
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHH
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~ 382 (434)
+.+..+.+......|.+.+..+++..+.|..+...+...++.+...+..+.+..+ ++|.++.....+.+++-+ .+.
T Consensus 164 l~~~~~~~~g~~~~l~~~l~~l~~~~~~ln~~~~~i~~~i~~l~~~~~~~~~~~~-~l~~~~~~l~~l~~~~~~---~~~ 239 (359)
T COG1463 164 LNEAAAALAGTGPQLNALLDNLAQFTDALNARDGDIGALIANLNQLLDSLAAASD-QLDRLLDNLATLTAALAA---RRD 239 (359)
T ss_pred HHHHHHHhcccchHHHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHhhHH-HHHHHHHHHHHHHHHHhh---chH
Confidence 4444555555555666666666666666666666666666666666665543222 455544333333333332 335
Q ss_pred hhhhHHHHHHHH-------HhcCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 383 ATEDTIYYMGEA-------LRRGVIDLEVYLKQVRSLSRKQFMFRALLFK 425 (434)
Q Consensus 383 AieDtiy~L~~a-------L~~g~I~ld~flK~vR~LaReQF~~Ra~~~K 425 (434)
+|.|++-.++.+ |.+.+-.|...+++.+.++..-.-.+.-+.+
T Consensus 240 ~l~~~l~~~~~~~~~~~~ll~~~r~~l~~~l~~l~~~~~~~~~~~~~~~~ 289 (359)
T COG1463 240 ALDDALAALSALAATVNDLLAENRPNLNQALANLRPLATLLVDYLPGLEQ 289 (359)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHhhHHHHHH
Confidence 666666666555 4455667777777766666555433333333
No 112
>COG3074 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=83.47 E-value=20 Score=28.14 Aligned_cols=44 Identities=27% Similarity=0.367 Sum_probs=25.9
Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 309 ELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 309 EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
||+.....|..+..+++..++.|++.-+.|+.+....++.|..+
T Consensus 29 ELKEknn~l~~e~q~~q~~reaL~~eneqlk~e~~~WQerlrsL 72 (79)
T COG3074 29 ELKEKNNSLSQEVQNAQHQREALERENEQLKEEQNGWQERLRAL 72 (79)
T ss_pred HHHHHhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444666666666666666666666666666555555544
No 113
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=83.27 E-value=19 Score=28.47 Aligned_cols=57 Identities=30% Similarity=0.430 Sum_probs=46.4
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.+.|++|..--..|....-++...+..|+.....++..+..|..+.++++.-+..++
T Consensus 11 De~Ia~L~eEGekLSk~el~~~~~IKKLr~~~~e~e~~~~~l~~~~~~~e~~~~~l~ 67 (74)
T PF12329_consen 11 DEQIAQLMEEGEKLSKKELKLNNTIKKLRAKIKELEKQIKELKKKLEELEKELESLE 67 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 556666666677778878888889999999999999999999888888888877764
No 114
>PRK12704 phosphodiesterase; Provisional
Probab=83.24 E-value=48 Score=35.99 Aligned_cols=39 Identities=15% Similarity=0.181 Sum_probs=16.2
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHH
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQV 409 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~v 409 (434)
+-++|++.+ ++.+-.|+--.+.+.-+.-+-..|..-|.+
T Consensus 155 a~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~~a~~i 193 (520)
T PRK12704 155 AKEILLEKV-EEEARHEAAVLIKEIEEEAKEEADKKAKEI 193 (520)
T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444433 334444444444444433333444444443
No 115
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF.
Probab=83.16 E-value=35 Score=37.01 Aligned_cols=21 Identities=10% Similarity=0.186 Sum_probs=11.0
Q ss_pred CHHHHHHHHHHHHHHHHHHHH
Q psy18020 401 DLEVYLKQVRSLSRKQFMFRA 421 (434)
Q Consensus 401 ~ld~flK~vR~LaReQF~~Ra 421 (434)
+...++|..-.-+++.--.+|
T Consensus 164 ~~~~~~~~~~~~~~~~a~~~a 184 (514)
T TIGR03319 164 EAAKLIKEIEEEAKEEADKKA 184 (514)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555555544443
No 116
>KOG0250|consensus
Probab=83.04 E-value=44 Score=38.97 Aligned_cols=111 Identities=18% Similarity=0.170 Sum_probs=57.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----CCCccccccccchHHHHHHHhhHHHHhhhhHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKT----EVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMG 392 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~----~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~ 392 (434)
++++..+++.+...++.++..++.+.+.+++.|..++... ..+.+++=---.-|..+.-++--....+.+-+.-+.
T Consensus 349 ~~re~~~~~~~~~~~~n~i~~~k~~~d~l~k~I~~~~~~~~~~~~~~~~e~e~k~~~L~~evek~e~~~~~L~~e~~~~~ 428 (1074)
T KOG0250|consen 349 LRREVNDLKEEIREIENSIRKLKKEVDRLEKQIADLEKQTNNELGSELEERENKLEQLKKEVEKLEEQINSLREELNEVK 428 (1074)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5556666666777778888888888888888777765322 112222100000122222222223334444444444
Q ss_pred HHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 393 EALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCY 427 (434)
Q Consensus 393 ~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~ 427 (434)
+++..+.=..+.-....+.|.|.+=..-.+|+-+.
T Consensus 429 ~~~~~~~ee~~~i~~~i~~l~k~i~~~~~~l~~lk 463 (1074)
T KOG0250|consen 429 EKAKEEEEEKEHIEGEILQLRKKIENISEELKDLK 463 (1074)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45554444455555566666666655555555554
No 117
>KOG2264|consensus
Probab=82.96 E-value=11 Score=40.75 Aligned_cols=58 Identities=21% Similarity=0.205 Sum_probs=30.0
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
..|+-.+..-+.||+..-++++..+++|+++-..-+.++..|+.++++.+...+++..
T Consensus 92 s~EL~ele~krqel~seI~~~n~kiEelk~~i~~~q~eL~~Lk~~ieqaq~~~~El~~ 149 (907)
T KOG2264|consen 92 SLELTELEVKRQELNSEIEEINTKIEELKRLIPQKQLELSALKGEIEQAQRQLEELRE 149 (907)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhHHHHHHHHhHHHHHHHHHHHHHh
Confidence 3444444444445555444455555555554444445555566666666665555543
No 118
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=82.75 E-value=58 Score=33.00 Aligned_cols=13 Identities=15% Similarity=0.378 Sum_probs=6.5
Q ss_pred hhHHHHHHHhhhh
Q psy18020 135 KLEGLIKTLASAF 147 (434)
Q Consensus 135 ~l~~l~~~~~~~f 147 (434)
+|.++|..-+..|
T Consensus 15 sL~~FL~~~~I~F 27 (325)
T PF08317_consen 15 SLQDFLNMTGIRF 27 (325)
T ss_pred CHHHHHHHhCcee
Confidence 3445555555555
No 119
>smart00503 SynN Syntaxin N-terminal domain. Three-helix domain that (in Sso1p) slows the rate of its reaction with the SNAP-25 homologue Sec9p
Probab=82.71 E-value=19 Score=30.00 Aligned_cols=64 Identities=13% Similarity=0.147 Sum_probs=36.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 288 ~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
.+...+.++...+++|..++..+.............|+...+.+......+...+..|++....
T Consensus 12 ~I~~~I~~i~~~v~~l~~l~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~i~~~lk~l~~~~~~ 75 (117)
T smart00503 12 EIRANIQKISQNVAELQKLHEELLTPPDADKELREKLERLIDDIKRLAKEIRAKLKELEKENLE 75 (117)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHh
Confidence 3344445555666677777766665433223344555556666666666666666666665543
No 120
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=82.64 E-value=21 Score=30.21 Aligned_cols=68 Identities=18% Similarity=0.206 Sum_probs=41.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERI--DKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 284 kL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eL--e~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.+...+...+ ..+++++.+.+..+++..+-.+++..++.| ...-..|+-.+.+++.+.++++.-++.+
T Consensus 22 ~~~~~l~~~~-a~~~~~~~l~~~~~~~~~Rl~~lE~~l~~LPt~~dv~~L~l~l~el~G~~~~l~~~l~~v 91 (106)
T PF10805_consen 22 IFWLWLRRTY-AKREDIEKLEERLDEHDRRLQALETKLEHLPTRDDVHDLQLELAELRGELKELSARLQGV 91 (106)
T ss_pred HHHHHHHHhh-ccHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 3344443333 235666666666666666666676666666 6666667777777777777766666543
No 121
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=82.57 E-value=21 Score=40.75 Aligned_cols=64 Identities=27% Similarity=0.354 Sum_probs=31.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 281 VEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344 (434)
Q Consensus 281 l~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~e 344 (434)
|.++-+..+.+....+...++.|.+.+.+++...+.++..+.++++.+.+++...+.|+++.++
T Consensus 503 ii~~A~~~~~~~~~~~~~li~~l~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~ 566 (782)
T PRK00409 503 IIEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDK 566 (782)
T ss_pred HHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334344444444444455555555555555555555555555555555555555554444333
No 122
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=82.48 E-value=9.8 Score=40.43 Aligned_cols=60 Identities=22% Similarity=0.329 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 294 VQAQEEIEILKQTENELNQGKFKLARM-FERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 294 ~q~~~ele~L~~~q~EL~~~~~~L~~~-~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
+.+..+.+.|++..+.|.+++..++.. ...|+.++.+++++.+.|+++..++...|..|.
T Consensus 76 ~~l~~~N~~l~~eN~~L~~r~~~id~~i~~av~~~~~~~~~~~~ql~~~~~~~~~~l~~l~ 136 (472)
T TIGR03752 76 AKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQLQGLIDQLQ 136 (472)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334455555666666666666555333 355666677777777777777777766666664
No 123
>cd07667 BAR_SNX30 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 30. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. The specific function of SNX30 is still unknown. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=82.43 E-value=28 Score=33.97 Aligned_cols=116 Identities=11% Similarity=0.115 Sum_probs=57.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHh--
Q psy18020 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE--------TELDEILAVL-- 352 (434)
Q Consensus 283 dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~--------~eLe~~I~~l-- 352 (434)
+++..|+......+..+.+.+...-..|..-+..|..-+..+...-+.|...+..|.+.+ .++...+..+
T Consensus 71 eki~~Rv~kr~~~l~~d~~e~~~~f~~ws~lE~~l~~~L~~~a~~~~~~s~~l~~l~~~~~~~yl~~Lke~~~Y~~slk~ 150 (240)
T cd07667 71 DRIAQRIIKEEIEYLVELREYGPVYSTWSGLEGELAEPLEGVSACIGNCSTALEELTEDMTEDFLPVLREYILYSESMKN 150 (240)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH
Confidence 455555544333344444444444444444444444444444444444444444444333 2222222222
Q ss_pred ----cCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcC
Q psy18020 353 ----TEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRG 398 (434)
Q Consensus 353 ----~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g 398 (434)
++....|.++++...+.--..+=.|-++-..++|.+.++++.+++.
T Consensus 151 vlK~RdqkQ~d~E~l~E~l~~rre~~~kLe~~ie~~~~~ve~f~~~~~~E 200 (240)
T cd07667 151 VLKKRDQVQAEYEAKLEAVALRKEERPKVPTDVEKCQDRVECFNADLKAD 200 (240)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2345567777655443334455566666677777776666666654
No 124
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=82.26 E-value=38 Score=36.31 Aligned_cols=26 Identities=19% Similarity=0.139 Sum_probs=10.4
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHH
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL 395 (434)
|..||-.|..-...|..--.-|.+||
T Consensus 170 L~~qi~~L~~~n~~i~~ea~nLt~AL 195 (475)
T PRK10361 170 LAHEIRNLQQLNAQMAQEAINLTRAL 195 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444333333333333344444
No 125
>PF15079 DUF4546: Domain of unknown function (DUF4546)
Probab=82.25 E-value=8.7 Score=35.13 Aligned_cols=56 Identities=16% Similarity=0.369 Sum_probs=43.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338 (434)
Q Consensus 283 dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L 338 (434)
..|++.|+|-+..+.+.||.+.+.+.-+...+.+|...++-+.+.+...++-+..|
T Consensus 50 ~eLkNeLREVREELkEKmeEIKQIKdiMDKDFDKL~EFVEIMKeMQkDMDEKMDvL 105 (205)
T PF15079_consen 50 QELKNELREVREELKEKMEEIKQIKDIMDKDFDKLHEFVEIMKEMQKDMDEKMDVL 105 (205)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhHHHhhhHH
Confidence 35777888888888888899999999999999888888877777666655544433
No 126
>PF05384 DegS: Sensor protein DegS; InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=82.23 E-value=37 Score=31.02 Aligned_cols=101 Identities=13% Similarity=0.128 Sum_probs=49.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc
Q psy18020 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEA 363 (434)
Q Consensus 284 kL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~ 363 (434)
.|...-.+.+.-.+..-+.+..++.||.. ++..+.+.-.+-+.|+..-.....+..++.+..... .+-||-++
T Consensus 10 ~ie~sK~qIf~I~E~~R~E~~~l~~EL~e----vk~~v~~~I~evD~Le~~er~aR~rL~eVS~~f~~y---sE~dik~A 82 (159)
T PF05384_consen 10 TIESSKEQIFEIAEQARQEYERLRKELEE----VKEEVSEVIEEVDKLEKRERQARQRLAEVSRNFDRY---SEEDIKEA 82 (159)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc---CHHHHHHH
Confidence 33333333333333333344445555555 444444444455555555555555555555544433 33477776
Q ss_pred ccccchHHHHHHHhhHHHHhhhhHHHHH
Q psy18020 364 VTTTAPIYKQILNTFTEEAATEDTIYYM 391 (434)
Q Consensus 364 i~~~~~l~~Ql~~l~Aed~AieDtiy~L 391 (434)
...+.-+.-+|.-+-.++.-+-.-=..|
T Consensus 83 Ye~A~~lQ~~L~~~re~E~qLr~rRD~L 110 (159)
T PF05384_consen 83 YEEAHELQVRLAMLREREKQLRERRDEL 110 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6666666555555555554443333333
No 127
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=82.02 E-value=12 Score=31.31 Aligned_cols=56 Identities=36% Similarity=0.414 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLAR---MFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~---~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
++..+++.|++.++++...-.++.. ...+|..+...+...+..++.+..++++-+.
T Consensus 40 ~l~~~~e~lr~~rN~~sk~I~~~~~~~~~~~~l~~e~~~lk~~i~~le~~~~~~e~~l~ 98 (108)
T PF02403_consen 40 ELQQELEELRAERNELSKEIGKLKKAGEDAEELKAEVKELKEEIKELEEQLKELEEELN 98 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHCHTTCCTHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHhhCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455555555555444333322 2344555555555555555555555554443
No 128
>PRK14143 heat shock protein GrpE; Provisional
Probab=81.92 E-value=21 Score=34.75 Aligned_cols=53 Identities=9% Similarity=0.004 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 298 ~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
...+++.++++++..-+.+.++++.++.. ..+++.+..|-.-.+.|++.+..+
T Consensus 85 elkd~~lR~~AdfeN~RKR~~kE~e~~~~--~a~~~~~~~lLpV~DnLerAl~~~ 137 (238)
T PRK14143 85 ELNSQYMRIAADFDNFRKRTSREQEDLRL--QLKCNTLSEILPVVDNFERARQQL 137 (238)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhHHHHHHhcc
Confidence 33445566666666655555555544332 445555666666666666666543
No 129
>PF07464 ApoLp-III: Apolipophorin-III precursor (apoLp-III); InterPro: IPR010009 This family consists of several insect apolipoprotein-III sequences. Exchangeable apolipoproteins constitute a functionally important family of proteins that play critical roles in lipid transport and lipoprotein metabolism. Apolipophorin III (apoLp-III) is a prototypical exchangeable apolipoprotein found in many insect species that functions in transport of diacylglycerol (DAG) from the fat body lipid storage depot to flight muscles in the adult life stage [].; GO: 0008289 lipid binding, 0006869 lipid transport, 0005576 extracellular region; PDB: 1EQ1_A.
Probab=81.91 E-value=15 Score=33.31 Aligned_cols=83 Identities=18% Similarity=0.228 Sum_probs=49.3
Q ss_pred HHHHHHHHHhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhh
Q psy18020 303 LKQTENELNQGKFKLARMFERI----DKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTF 378 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eL----e~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~ 378 (434)
+..++..|....++|+....++ ...++.|+..++.|-.+.+.+.+-|.....+-.-++.. +.+|+||-+
T Consensus 65 l~~~~~kl~et~~~L~k~~Pev~~qa~~l~e~lQ~~vq~l~~E~qk~~k~v~~~~~~~~e~l~~-------~~K~~~D~~ 137 (155)
T PF07464_consen 65 LKQLKTKLEETAEKLRKANPEVEKQANELQEKLQSAVQSLVQESQKLAKEVSENSEGANEKLQP-------AIKQAYDDA 137 (155)
T ss_dssp HHHHHHHHHHHHHGGGG-SHHHHHT-SSSHHHHHHHHHHHHHHHHHHHHHHHS---SS-GGGHH-------HHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHH-------HHHHHHHHH
Confidence 4555556666555555443333 33567777788888888888887777554444446666 788888876
Q ss_pred HHHHhhhhHHHHHHHH
Q psy18020 379 TEEAATEDTIYYMGEA 394 (434)
Q Consensus 379 Aed~AieDtiy~L~~a 394 (434)
.+ +.+++...|..|
T Consensus 138 ~k--~~~~~~~~l~~a 151 (155)
T PF07464_consen 138 VK--AAQKVQKQLHEA 151 (155)
T ss_dssp HH--HHHHHHHHHHHH
T ss_pred HH--HHHHHHHHHHHh
Confidence 54 345555555444
No 130
>PHA02562 46 endonuclease subunit; Provisional
Probab=81.77 E-value=80 Score=33.99 Aligned_cols=15 Identities=13% Similarity=0.246 Sum_probs=6.8
Q ss_pred CCCHHHHHHHHHHHh
Q psy18020 62 NSKLEGLIKTLASAF 76 (434)
Q Consensus 62 ~s~L~~li~~l~~~F 76 (434)
++-=..++..+.-+|
T Consensus 37 G~GKStll~aI~~~l 51 (562)
T PHA02562 37 GAGKSTMLEALTFAL 51 (562)
T ss_pred CCCHHHHHHHHHHHH
Confidence 444444454444333
No 131
>KOG4360|consensus
Probab=81.77 E-value=52 Score=35.39 Aligned_cols=31 Identities=26% Similarity=0.226 Sum_probs=25.9
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 398 GVIDLEVYLKQVRSLSRKQFMFRALLFKCYK 428 (434)
Q Consensus 398 g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~ 428 (434)
-+|..+.|=..+-.++|.||-.=-+-++|+.
T Consensus 380 l~i~~~ny~~~~~tv~~~~v~tp~lqq~~n~ 410 (596)
T KOG4360|consen 380 LVIPGSNYSSQQQTVARLQVQTPKLQQRCNM 410 (596)
T ss_pred ccCCcchhhhhhhhhhhhhccchhHHhHHHH
Confidence 4789999999999999999987767777764
No 132
>PRK14143 heat shock protein GrpE; Provisional
Probab=81.72 E-value=32 Score=33.52 Aligned_cols=31 Identities=13% Similarity=0.100 Sum_probs=15.0
Q ss_pred hHHHHHHHHHhcCCC---CHHHHHHHHHHHHHHHH
Q psy18020 386 DTIYYMGEALRRGVI---DLEVYLKQVRSLSRKQF 417 (434)
Q Consensus 386 Dtiy~L~~aL~~g~I---~ld~flK~vR~LaReQF 417 (434)
+.+.-|.|||..... +.+.|.+.+..+ ++||
T Consensus 125 pV~DnLerAl~~~~~~~~~~~~l~~Gve~i-~k~l 158 (238)
T PRK14143 125 PVVDNFERARQQLKPEGEEAQALHRSYQGL-YKQL 158 (238)
T ss_pred HHHhHHHHHHhcccccchhHHHHHHHHHHH-HHHH
Confidence 445556666654322 234555555544 3344
No 133
>PF15619 Lebercilin: Ciliary protein causing Leber congenital amaurosis disease
Probab=81.61 E-value=30 Score=32.56 Aligned_cols=24 Identities=8% Similarity=0.097 Sum_probs=9.6
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHH
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGE 393 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~ 393 (434)
+..+|=.+-++=.+-+.-|--|.+
T Consensus 123 L~~kL~~~~~~l~~~~~ki~~Lek 146 (194)
T PF15619_consen 123 LQRKLSQLEQKLQEKEKKIQELEK 146 (194)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444444433333333333333
No 134
>PF14817 HAUS5: HAUS augmin-like complex subunit 5
Probab=81.59 E-value=58 Score=36.24 Aligned_cols=77 Identities=17% Similarity=0.182 Sum_probs=41.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLAR---MFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 276 sl~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~---~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.+.+.|+ +|+..+.+....++..+..+.....+.+...+++.+ ..-=|++....|+....+|.+-...|...++.+
T Consensus 83 ~L~~eve-rLraei~~l~~~I~~~e~e~~~~e~~~~q~~~~~~~~~~k~~LL~Ay~q~c~~~~~~l~e~~~rl~~~~~~~ 161 (632)
T PF14817_consen 83 ELEKEVE-RLRAEIQELDKEIESREREVSRQEASREQMLDKISDSRHKQLLLEAYSQQCEEQRRILREYTKRLQGQVEQL 161 (632)
T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334343 666666555554444444444444444433333222 223356666777777777777777777766665
Q ss_pred c
Q psy18020 353 T 353 (434)
Q Consensus 353 ~ 353 (434)
.
T Consensus 162 q 162 (632)
T PF14817_consen 162 Q 162 (632)
T ss_pred H
Confidence 4
No 135
>PRK10698 phage shock protein PspA; Provisional
Probab=81.55 E-value=39 Score=32.46 Aligned_cols=70 Identities=19% Similarity=0.229 Sum_probs=43.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 278 KSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 278 ~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
.+.+.+.++..+++.+...++=.+.|...-.++...-.+++..+...-..+..++..+..+....++.++
T Consensus 4 f~Rl~~ii~a~in~~ldkaEDP~k~l~q~i~em~~~l~~~r~alA~~~A~~k~~er~~~~~~~~~~~~e~ 73 (222)
T PRK10698 4 FSRFADIVNANINALLEKAEDPQKLVRLMIQEMEDTLVEVRSTSARALAEKKQLTRRIEQAEAQQVEWQE 73 (222)
T ss_pred HHHHHHHHHhHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445556666666666655544445555555555555557777777777777777776666666555544
No 136
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=81.54 E-value=28 Score=42.03 Aligned_cols=120 Identities=11% Similarity=0.118 Sum_probs=67.0
Q ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHhcC-CCCCCccccccccchHHHHH
Q psy18020 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL--KEKETELDEILAVLTE-KTEVDVDEAVTTTAPIYKQI 374 (434)
Q Consensus 298 ~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L--~~~~~eLe~~I~~l~~-k~~~diDe~i~~~~~l~~Ql 374 (434)
..++.+...+.++...-..++..+..++..+..+..++..+ ..++.+++.-|..++. ..+.|.+++-.--.-+..++
T Consensus 984 ~~ie~le~e~~~l~~~i~~l~kel~~~~~~kr~l~dnL~~~~~~~~l~el~~eI~~l~~~~~~~~~~~~~~e~~~l~~~~ 1063 (1311)
T TIGR00606 984 AQLEECEKHQEKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENI 1063 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHH
Confidence 33444444455555555557777788888888888888887 6667777777777652 23345554433333344444
Q ss_pred HHhh-------HHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 375 LNTF-------TEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALL 423 (434)
Q Consensus 375 ~~l~-------Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~ 423 (434)
=.+. .+-..+++-|..|.+-|+. .=.|.+..=.|++++...+.
T Consensus 1064 ~~l~~~~a~l~g~~k~le~qi~~l~~eL~e------~~yk~a~~ryrka~i~~~~~ 1113 (1311)
T TIGR00606 1064 DLIKRNHVLALGRQKGYEKEIKHFKKELRE------PQFRDAEEKYREMMIVMRTT 1113 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcc------hHHHHHHHHHHHHHHHHHHH
Confidence 3333 4444566666666666643 22344444445555444333
No 137
>PF08376 NIT: Nitrate and nitrite sensing; InterPro: IPR013587 The nitrate and nitrite-sensing (NIT) domain is a (~250 aa) sensor domain found in various receptor components of signal transduction pathways from different bacterial lineages []. The NIT domain is predicted to be all alpha-helical in structure []. Proteins containing a NIT domain belong to one of four known classes of prokaryotic signal transduction proteins: intracellular transcription anti-termination regulators, sensor histidine kinases, methyl-accepting chemotaxis proteins, diguanylate cyclases/phosphodiesterases. NIT-containing receptors regulate cellular functions such as gene expression (transcription anti-terminators and histidine kinases), cell motility (chemotaxis receptors), and enzyme activity (diguanylate cyclases/phosphodiesterases), in response to changes in nitrate and/or nitrite concentrations. The NIT domain is found as both an extracellular and an intracellular sensor. The NIT domain can be found in combination with other signalling domains, such as ANTAR, HAMP (IPR003660 from INTERPRO), MCP, Hemerythrins (IPR002063 from INTERPRO), CHASE (IPR006189 from INTERPRO), GGDEF (IPR000160 from INTERPRO), PAS (IPR000014 from INTERPRO), EAL (IPR001633 from INTERPRO), HK (IPR005467 from INTERPRO), GAF, REC and Hpt (IPR008207 from INTERPRO).; PDB: 4AKK_A.
Probab=81.26 E-value=48 Score=31.11 Aligned_cols=88 Identities=15% Similarity=0.210 Sum_probs=51.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHH
Q psy18020 325 DKEKAELERSISFLKEKETELDEILAVLTEK-TEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLE 403 (434)
Q Consensus 325 e~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k-~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld 403 (434)
....-.....+..+..-++.|..++..+... .+.++...+. .-..+-...|-..+|+++ +..+|.+|.++-+
T Consensus 82 d~~~~~~~~~~~~Y~~~i~~ll~~~~~l~~~~~d~~l~~~~~-----a~~~l~~a~E~~~~era~--~~~~l~~~~~~~~ 154 (247)
T PF08376_consen 82 DNRSIDPDEAFDAYTELIDSLLDLIDALAQQSDDPELARQLR-----ALTALLRAKEYAGQERAL--LAGALAAGRLSPE 154 (247)
T ss_dssp HHT-S-HHHHHHHHHHHHHHHHTHHHHHHCC---HHHHHHHH-----HHHHHHHHHHHHHHHHHH--HHHHHHHT---HH
T ss_pred hcCCCChHHHHHHHHHHHHHHHHHHHHHHhccCCHHHHHHHH-----HHHHHHHHHHHHHHHHHH--HHHHHhhcCCCHH
Confidence 3333344455666676777777777766443 3334433221 222234466777788874 6778899999999
Q ss_pred HHHHHHHHHHHHHHHH
Q psy18020 404 VYLKQVRSLSRKQFMF 419 (434)
Q Consensus 404 ~flK~vR~LaReQF~~ 419 (434)
.|.+-++..++.+...
T Consensus 155 ~~~~~~~~~~~~~~~l 170 (247)
T PF08376_consen 155 ELRQFASLIARQRAAL 170 (247)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH
Confidence 9999888888765544
No 138
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=81.10 E-value=15 Score=32.29 Aligned_cols=47 Identities=13% Similarity=0.140 Sum_probs=25.5
Q ss_pred hHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 369 PIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 369 ~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
.+-.-+=.+-.+-..+++.+.-|.+.|..-.=.++...+.+..|+.+
T Consensus 91 ~~~eA~~~l~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~~l~~l~~~ 137 (140)
T PRK03947 91 DLDEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQQLQQE 137 (140)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444555566666666666666654433455555555555443
No 139
>KOG0982|consensus
Probab=81.04 E-value=20 Score=37.49 Aligned_cols=97 Identities=16% Similarity=0.127 Sum_probs=45.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHH
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEA 394 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~a 394 (434)
+.|+.+..+|+.....|+..+.+|.++...+...|+.++.....+-+.-+.-.+.|.+-.-+.-|-..-|+|.+.-|.++
T Consensus 307 qqleeentelRs~~arlksl~dklaee~qr~sd~LE~lrlql~~eq~l~~rm~d~Lrrfq~ekeatqELieelrkelehl 386 (502)
T KOG0982|consen 307 QQLEEENTELRSLIARLKSLADKLAEEDQRSSDLLEALRLQLICEQKLRVRMNDILRRFQEEKEATQELIEELRKELEHL 386 (502)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Confidence 33555556666666666666666666655555555555433333333333233333332222223333444444433332
Q ss_pred H---------hcC-----CCCHHHHHHHHHH
Q psy18020 395 L---------RRG-----VIDLEVYLKQVRS 411 (434)
Q Consensus 395 L---------~~g-----~I~ld~flK~vR~ 411 (434)
= .+| .|.|+.++|..|.
T Consensus 387 r~~kl~~a~p~rgrsSaRe~eleqevkrLrq 417 (502)
T KOG0982|consen 387 RRRKLVLANPVRGRSSAREIELEQEVKRLRQ 417 (502)
T ss_pred HHHHHHhhccccCchhHHHHHHHHHHHHhcc
Confidence 1 111 3456777776664
No 140
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=80.88 E-value=95 Score=34.34 Aligned_cols=26 Identities=4% Similarity=-0.068 Sum_probs=14.9
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHH
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL 395 (434)
-.+.|.+|..+......-+----+.|
T Consensus 406 s~~rl~~L~~qWe~~R~pL~~e~r~l 431 (594)
T PF05667_consen 406 SEQRLVELAQQWEKHRAPLIEEYRRL 431 (594)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 45677777777766554443333333
No 141
>PRK09039 hypothetical protein; Validated
Probab=80.84 E-value=52 Score=33.71 Aligned_cols=27 Identities=19% Similarity=0.046 Sum_probs=13.9
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
.+.|+-.|-++-.|++.-|..|..+|.
T Consensus 135 ~~~~V~~L~~qI~aLr~Qla~le~~L~ 161 (343)
T PRK09039 135 ALAQVELLNQQIAALRRQLAALEAALD 161 (343)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444555555555555555555555543
No 142
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=80.83 E-value=41 Score=31.78 Aligned_cols=22 Identities=9% Similarity=0.200 Sum_probs=8.7
Q ss_pred HHHHHHhhHHHHhhhhHHHHHH
Q psy18020 371 YKQILNTFTEEAATEDTIYYMG 392 (434)
Q Consensus 371 ~~Ql~~l~Aed~AieDtiy~L~ 392 (434)
+..+=+-+.+-.|--++...|.
T Consensus 167 ~er~e~ki~~~ea~a~a~~el~ 188 (221)
T PF04012_consen 167 FERMEEKIEEMEARAEASAELA 188 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHhc
Confidence 3334344444344444444333
No 143
>COG3074 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=80.65 E-value=26 Score=27.53 Aligned_cols=34 Identities=18% Similarity=0.198 Sum_probs=16.8
Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 309 ELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342 (434)
Q Consensus 309 EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~ 342 (434)
++....+.|+++-+.|+.+....+..+..|-.+.
T Consensus 43 ~~q~~reaL~~eneqlk~e~~~WQerlrsLLGkm 76 (79)
T COG3074 43 NAQHQREALERENEQLKEEQNGWQERLRALLGKM 76 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 3334444455555555555555555555444433
No 144
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=80.60 E-value=46 Score=31.13 Aligned_cols=34 Identities=32% Similarity=0.474 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 316 KLARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 316 ~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
.++..+..|+.+...|+..+..|..+.+.+++..
T Consensus 124 ~l~~~i~~L~~e~~~L~~~~~~l~~~~e~~ek~~ 157 (189)
T PF10211_consen 124 ELEEEIEELEEEKEELEKQVQELKNKCEQLEKRE 157 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444555555555555544444444444433
No 145
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=80.42 E-value=43 Score=33.12 Aligned_cols=56 Identities=29% Similarity=0.349 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.++.+++.|...-.++......++..+.+++.+..+++..|..+++.+.+-++.+.
T Consensus 49 ~~q~ei~~L~~qi~~~~~k~~~~~~~i~~~~~eik~l~~eI~~~~~~I~~r~~~l~ 104 (265)
T COG3883 49 NIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLK 104 (265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445554444444444444444455555555555555555555554444444443
No 146
>KOG0993|consensus
Probab=80.21 E-value=46 Score=34.82 Aligned_cols=108 Identities=14% Similarity=0.154 Sum_probs=65.7
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHH
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~ 382 (434)
|..+++++...+..+...+.+|-.-...+.....+|++..+-+... ..++..-.-.-+..+-++.+...|+|-+..+|.
T Consensus 347 l~tlq~~iSqaq~~vq~qma~lv~a~e~i~~e~~rl~q~nd~l~~~-~~l~t~~Qq~e~~~lp~ave~l~ql~~~~r~~~ 425 (542)
T KOG0993|consen 347 LVTLQAEISQAQSEVQKQMARLVVASETIADEDSRLRQINDLLTTV-GELETQVQQAEVQNLPAAVEQLAQLYKQRRTSL 425 (542)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc-cccccchhHhhhhcchhhHHHHHHHHHHHHHHH
Confidence 6677888888888888888888877777777777777665554321 111110000122334455667788887776664
Q ss_pred -hhhhHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy18020 383 -ATEDTIYYMGEALRRGVIDLEVYLKQVRS 411 (434)
Q Consensus 383 -AieDtiy~L~~aL~~g~I~ld~flK~vR~ 411 (434)
+--||-.|..+-|....++|-.-|...|.
T Consensus 426 ~~~l~a~ehv~e~l~~ei~~L~eqle~e~~ 455 (542)
T KOG0993|consen 426 QQELDASEHVQEDLVKEIQSLQEQLEKERQ 455 (542)
T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34466666666666666666555555543
No 147
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=80.15 E-value=60 Score=38.07 Aligned_cols=9 Identities=22% Similarity=0.246 Sum_probs=4.4
Q ss_pred CCCHHHHHH
Q psy18020 62 NSKLEGLIK 70 (434)
Q Consensus 62 ~s~L~~li~ 70 (434)
+|||++.|.
T Consensus 36 KS~ildAi~ 44 (1164)
T TIGR02169 36 KSNIGDAIL 44 (1164)
T ss_pred HHHHHHHHH
Confidence 455555443
No 148
>PF10046 BLOC1_2: Biogenesis of lysosome-related organelles complex-1 subunit 2 ; InterPro: IPR019269 This entry represents a family of proteins that play a role in cellular proliferation, as well as in the biogenesis of specialised organelles of the endosomal-lysosomal system [].
Probab=80.15 E-value=33 Score=28.54 Aligned_cols=21 Identities=33% Similarity=0.460 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy18020 327 EKAELERSISFLKEKETELDE 347 (434)
Q Consensus 327 ~k~~Le~~i~~L~~~~~eLe~ 347 (434)
.-.+|+..+..|++-..+|+.
T Consensus 74 ~V~~LE~~v~~LD~ysk~LE~ 94 (99)
T PF10046_consen 74 QVTELEQTVYELDEYSKELES 94 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444444443
No 149
>PF04949 Transcrip_act: Transcriptional activator; InterPro: IPR007033 Golgins are a family of coiled-coil proteins associated with the Golgi apparatus necessary for tethering events in membrane fusion and as structural supports for Golgi cisternae []. This entry represents proteins annotated as RAB6-interacting golgins.
Probab=80.14 E-value=30 Score=31.18 Aligned_cols=92 Identities=23% Similarity=0.273 Sum_probs=54.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc
Q psy18020 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK----EKAELERSISFLKEKETELDEILAVLTEKTEVDV 360 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~----~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~di 360 (434)
++.+...++.+.+.+-++|..++.+|+.-..=.+.++.-++. ...+|+-..+..+.+..+...|++....+..
T Consensus 50 VrekVq~~LgrveEetkrLa~ireeLE~l~dP~RkEv~~vRkkID~vNreLkpl~~~cqKKEkEykealea~nEknk--- 126 (159)
T PF04949_consen 50 VREKVQAQLGRVEEETKRLAEIREELEVLADPMRKEVEMVRKKIDSVNRELKPLGQSCQKKEKEYKEALEAFNEKNK--- 126 (159)
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH---
Confidence 455556666777777777888888887655334444433333 3333444444555667777777776643221
Q ss_pred cccccccchHHHHHHHhhHHHHhh
Q psy18020 361 DEAVTTTAPIYKQILNTFTEEAAT 384 (434)
Q Consensus 361 De~i~~~~~l~~Ql~~l~Aed~Ai 384 (434)
..+-|...|.+||.+-..+
T Consensus 127 -----eK~~Lv~~L~eLv~eSE~~ 145 (159)
T PF04949_consen 127 -----EKAQLVTRLMELVSESERL 145 (159)
T ss_pred -----HHHHHHHHHHHHHHHHHHH
Confidence 1234667778888776543
No 150
>PF09731 Mitofilin: Mitochondrial inner membrane protein; InterPro: IPR019133 Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex []. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence (residues 1-43) rich in positive-charged and hydroxylated residues and a membrane anchor domain (residues 47-66). In addition, it has three centrally located coiled coil domains (residues 200-240,280-310 and 400-420) []. ; GO: 0031305 integral to mitochondrial inner membrane
Probab=79.90 E-value=83 Score=34.34 Aligned_cols=92 Identities=12% Similarity=0.204 Sum_probs=49.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC------------CCCcccccccc-----chHHHHH---HHhhHHHH
Q psy18020 323 RIDKEKAELERSISFLKEKETELDEILAVLTEKT------------EVDVDEAVTTT-----APIYKQI---LNTFTEEA 382 (434)
Q Consensus 323 eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~------------~~diDe~i~~~-----~~l~~Ql---~~l~Aed~ 382 (434)
.++.++..--..+..|..+++.|++.+..+.... ..++.+.+... .|+.++| -.+..+|.
T Consensus 368 ~v~~Er~~~~~~l~~~~~~~~~le~~~~~~~~~~~~~~~~~~l~~a~~~l~~~l~~~~~~~~~p~~~el~~l~~~~~~d~ 447 (582)
T PF09731_consen 368 KVEQERNGRLAKLAELNSRLKALEEALDARSEAEDENRRAQQLWLAVDALKSALDSGNAGSPRPFEDELRALKELAPDDE 447 (582)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcCCCCHHHHHHHHHHhCCCCh
Confidence 3444555545555555555666655555443110 01333333333 4555444 33346677
Q ss_pred hhhhHHHHHHHHH-hcCCCCHHHHHHHHHHHHH
Q psy18020 383 ATEDTIYYMGEAL-RRGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 383 AieDtiy~L~~aL-~~g~I~ld~flK~vR~LaR 414 (434)
-|+-+|..|.... .+|+.+.+.-.-+-+.|++
T Consensus 448 ~v~~~l~~l~~~a~~~Gv~s~~~L~~rf~~v~~ 480 (582)
T PF09731_consen 448 LVDAALSSLPPEAAQRGVPSEAQLRNRFERVAP 480 (582)
T ss_pred HHHHHHHhcCHHHhhCCCCCHHHHHHHHHHHHH
Confidence 7888888887755 4599887665554444444
No 151
>KOG1853|consensus
Probab=79.87 E-value=51 Score=32.35 Aligned_cols=17 Identities=24% Similarity=0.198 Sum_probs=7.4
Q ss_pred HHhhhhHHHHHHHHHhc
Q psy18020 381 EAATEDTIYYMGEALRR 397 (434)
Q Consensus 381 d~AieDtiy~L~~aL~~ 397 (434)
...++|-=.-|+.|+.+
T Consensus 132 i~sleDfeqrLnqAIEr 148 (333)
T KOG1853|consen 132 IYSLEDFEQRLNQAIER 148 (333)
T ss_pred hhhHHHHHHHHHHHHHH
Confidence 33444444444444443
No 152
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF.
Probab=79.84 E-value=51 Score=35.75 Aligned_cols=43 Identities=37% Similarity=0.424 Sum_probs=18.8
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 307 ENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 307 q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
...|....+.|++..+.|+.....|+...+.|+++.+++++..
T Consensus 82 E~rL~qRee~Lekr~e~Lekre~~Le~ke~~L~~re~eLee~~ 124 (514)
T TIGR03319 82 ERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKE 124 (514)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444444444444444444444444444433
No 153
>KOG1937|consensus
Probab=79.64 E-value=47 Score=35.16 Aligned_cols=25 Identities=12% Similarity=0.076 Sum_probs=17.0
Q ss_pred ccccc---cccchHHHHHHHhhHHHHhh
Q psy18020 360 VDEAV---TTTAPIYKQILNTFTEEAAT 384 (434)
Q Consensus 360 iDe~i---~~~~~l~~Ql~~l~Aed~Ai 384 (434)
+|+++ .-.|..+++.|.++|.-+++
T Consensus 433 tdellf~sakhddhvR~aykllt~iH~n 460 (521)
T KOG1937|consen 433 TDELLFMSAKHDDHVRLAYKLLTRIHLN 460 (521)
T ss_pred HHHHHHHHhccCHHHHHHHHHHHHHHHH
Confidence 45554 34567788888888877764
No 154
>PF04899 MbeD_MobD: MbeD/MobD like ; InterPro: IPR006983 The MbeD and MobD proteins are plasmid encoded, and are involved in the plasmid mobilisation and transfer in the presence of conjugative plasmids [].
Probab=79.63 E-value=28 Score=27.38 Aligned_cols=50 Identities=26% Similarity=0.399 Sum_probs=38.3
Q ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 304 KQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 304 ~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.+.+.+.......|+.+...-+.+...|...+..|....+.|.+-++.++
T Consensus 20 ~~q~~~Wq~sy~~Lq~~~~~t~~~~a~L~~qv~~Ls~qv~~Ls~ql~rLs 69 (70)
T PF04899_consen 20 EKQQQEWQSSYADLQHMFEQTSQENAALSEQVNNLSQQVQRLSEQLERLS 69 (70)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 34555666666668888888888888888888888888888888777663
No 155
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=79.61 E-value=9.9 Score=41.46 Aligned_cols=68 Identities=24% Similarity=0.331 Sum_probs=36.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
++.++++|+.+...|+..+..|+.++++|++-++.++.... ++ .-.--|-.+.+..|+-|.+.|.
T Consensus 427 ~~~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~~~r~~~---~~------------~~~~rei~~~~~~I~~L~~~L~ 491 (652)
T COG2433 427 LEETVERLEEENSELKRELEELKREIEKLESELERFRREVR---DK------------VRKDREIRARDRRIERLEKELE 491 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HH------------HhhhHHHHHHHHHHHHHHHHHH
Confidence 55566666666666666666666666666655555432211 11 1112234456666777777776
Q ss_pred cCC
Q psy18020 397 RGV 399 (434)
Q Consensus 397 ~g~ 399 (434)
...
T Consensus 492 e~~ 494 (652)
T COG2433 492 EKK 494 (652)
T ss_pred HHH
Confidence 553
No 156
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=79.61 E-value=37 Score=29.77 Aligned_cols=47 Identities=13% Similarity=0.071 Sum_probs=28.3
Q ss_pred HHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 381 EAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCY 427 (434)
Q Consensus 381 d~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~ 427 (434)
+..++||+.+|++.++.=.-.++..-+....+.++-=-.+..++.+.
T Consensus 89 E~~~~eA~~~l~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~~l~~l~ 135 (140)
T PRK03947 89 EKDLDEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQQLQ 135 (140)
T ss_pred EecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45788999999988776555555555666555554333333333333
No 157
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=79.57 E-value=34 Score=32.02 Aligned_cols=27 Identities=30% Similarity=0.345 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 326 KEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 326 ~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.++..+-..+..|+.+.++|.+-++.+
T Consensus 103 ~eR~~~l~~l~~l~~~~~~l~~el~~~ 129 (188)
T PF03962_consen 103 EEREELLEELEELKKELKELKKELEKY 129 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455556666666666666666666644
No 158
>KOG0995|consensus
Probab=79.48 E-value=44 Score=36.32 Aligned_cols=38 Identities=21% Similarity=0.356 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.+.+..+..++..+..+++.+..|..++++.++-++.+
T Consensus 276 nK~~~y~~~~~~k~~~~~~~l~~l~~Eie~kEeE~e~l 313 (581)
T KOG0995|consen 276 NKFQAYVSQMKSKKQHMEKKLEMLKSEIEEKEEEIEKL 313 (581)
T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444555554444444444444
No 159
>PRK00106 hypothetical protein; Provisional
Probab=79.38 E-value=58 Score=35.50 Aligned_cols=39 Identities=21% Similarity=0.084 Sum_probs=16.2
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHH
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQV 409 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~v 409 (434)
+-++|++.+ ++.+..|+--.+.+.-..-+-..|..-|.+
T Consensus 170 ak~~l~~~~-~~~~~~~~~~~i~~~e~~a~~~a~~~a~~i 208 (535)
T PRK00106 170 AREIILAET-ENKLTHEIATRIREAEREVKDRSDKMAKDL 208 (535)
T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444443 333344444444444433333444444443
No 160
>KOG0161|consensus
Probab=79.35 E-value=17 Score=45.10 Aligned_cols=75 Identities=19% Similarity=0.209 Sum_probs=45.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 320 MFERIDKEKAELERSISFLKEKETELDEILAVLTE---KTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 320 ~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~---k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
.++.|+.+...+...+..|....+++.+.+..++. .-+..+.+ +-.+|-++-+++.+-||....+.=.+.
T Consensus 1485 ~~e~l~renk~l~~ei~dl~~~~~e~~k~v~elek~~r~le~e~~e-------lQ~aLeElE~~le~eE~~~lr~~~~~~ 1557 (1930)
T KOG0161|consen 1485 QLEELRRENKNLSQEIEDLEEQKDEGGKRVHELEKEKRRLEQEKEE-------LQAALEELEAALEAEEDKKLRLQLELQ 1557 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHhhhhhhHHHHHHHHHH
Confidence 34445555555555555555555555555554431 12234445 899999999999999999877444433
Q ss_pred cCCCC
Q psy18020 397 RGVID 401 (434)
Q Consensus 397 ~g~I~ 401 (434)
+-+++
T Consensus 1558 ~~r~e 1562 (1930)
T KOG0161|consen 1558 QLRSE 1562 (1930)
T ss_pred HHHHH
Confidence 33333
No 161
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=79.25 E-value=53 Score=34.41 Aligned_cols=70 Identities=23% Similarity=0.211 Sum_probs=32.4
Q ss_pred HHHHHHHHHHH-HHHHHHhHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHhcCCCCCCccccc
Q psy18020 295 QAQEEIEILKQ-TENELNQGKFKLARMFERIDKEKAELER-------SISFLKEKETELDEILAVLTEKTEVDVDEAV 364 (434)
Q Consensus 295 q~~~ele~L~~-~q~EL~~~~~~L~~~~~eLe~~k~~Le~-------~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i 364 (434)
.++.+++.|+. .+.|+..-.+.|+.+.-+.+...+.+++ +|..|+.+.+.+++-++-......-||.|++
T Consensus 230 ~L~~~~e~Lk~~~~~e~~~~~~~LqEEr~R~erLEeqlNd~~elHq~Ei~~LKqeLa~~EEK~~Yqs~eRaRdi~E~~ 307 (395)
T PF10267_consen 230 RLEESIEKLKEQYQREYQFILEALQEERYRYERLEEQLNDLTELHQNEIYNLKQELASMEEKMAYQSYERARDIWEVM 307 (395)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhHHHHHH
Confidence 34455555543 4444444444444444444444444444 4444444444444444433334444666643
No 162
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=79.24 E-value=75 Score=37.98 Aligned_cols=22 Identities=14% Similarity=0.136 Sum_probs=9.2
Q ss_pred HHHHHHhhHHHHhhhhHHHHHH
Q psy18020 371 YKQILNTFTEEAATEDTIYYMG 392 (434)
Q Consensus 371 ~~Ql~~l~Aed~AieDtiy~L~ 392 (434)
..++-++-++-..++|.|..+.
T Consensus 862 ~~~l~~~~~~~~~l~~~l~~~~ 883 (1163)
T COG1196 862 KEELEELEAEKEELEDELKELE 883 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444333
No 163
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=79.06 E-value=85 Score=38.00 Aligned_cols=25 Identities=8% Similarity=0.155 Sum_probs=11.2
Q ss_pred HHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 372 KQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 372 ~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
.++=++-++-..+.+.|..|.+.++
T Consensus 984 ~~ie~le~e~~~l~~~i~~l~kel~ 1008 (1311)
T TIGR00606 984 AQLEECEKHQEKINEDMRLMRQDID 1008 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444443
No 164
>PF03148 Tektin: Tektin family; InterPro: IPR000435 Tektin heteropolymers form unique protofilaments of flagellar microtubules []. The proteins are predicted to form extended rods composed of 2 alpha- helical segments (~180 residues long) capable of forming coiled coils, interrupted by non-helical linkers []. The 2 segments are similar in sequence, indicating a gene duplication event. Along each tektin rod, cysteine residues occur with a periodicity of ~8nm, coincident with the axial repeat of tubulin dimers in microtubules []. It is proposed that the assembly of tektin heteropolymers produces filaments with repeats of 8, 16, 24, 32, 40, 48 and 96nm, generating the basis for the complex spatial arrangements of axonemal components [].; GO: 0000226 microtubule cytoskeleton organization, 0005874 microtubule
Probab=78.98 E-value=83 Score=32.73 Aligned_cols=60 Identities=20% Similarity=0.306 Sum_probs=47.7
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k 355 (434)
+.+-+..+..-+.+|.....++..++..|...+..++..+..+..-+....+.+...+++
T Consensus 48 L~~Ri~di~~wk~eL~~~l~~~~~Ei~~L~~~K~~le~aL~~~~~pl~i~~ecL~~R~~R 107 (384)
T PF03148_consen 48 LRQRIRDIRFWKNELERELEELDEEIDLLEEEKRRLEKALEALRKPLSIAQECLSLREKR 107 (384)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHhCC
Confidence 344455567778888888888999999999999999999998888888888888766543
No 165
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=78.93 E-value=39 Score=30.03 Aligned_cols=8 Identities=13% Similarity=0.214 Sum_probs=3.2
Q ss_pred HHHHHHHH
Q psy18020 278 KSAVEDKI 285 (434)
Q Consensus 278 ~sal~dkL 285 (434)
...|.+-|
T Consensus 37 in~i~~Ll 44 (151)
T PF11559_consen 37 INCIYDLL 44 (151)
T ss_pred HHHHHHHH
Confidence 34444433
No 166
>KOG0993|consensus
Probab=78.86 E-value=25 Score=36.63 Aligned_cols=89 Identities=24% Similarity=0.329 Sum_probs=54.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH------hcC-----C-CCCCccccccccchHHHHHHHhhHHHHhhhhHHHH
Q psy18020 323 RIDKEKAELERSISFLKEKETELDEILAV------LTE-----K-TEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYY 390 (434)
Q Consensus 323 eLe~~k~~Le~~i~~L~~~~~eLe~~I~~------l~~-----k-~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~ 390 (434)
+|+.++..+..+...+++++..|.+.++. |++ + ++-.+.++ -+|..|++-++-++-.--||-|.-
T Consensus 104 ~leqertq~qq~~e~~erEv~~l~~llsr~~~~~~Lenem~ka~Ed~eKlrel---v~pmekeI~elk~kl~~aE~~i~E 180 (542)
T KOG0993|consen 104 NLEQERTQLQQNEEKLEREVKALMELLSRGQYQLDLENEMDKAKEDEEKLREL---VTPMEKEINELKKKLAKAEQRIDE 180 (542)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhHHHHHHHHhhHHHHHHH---HhhHHHHHHHHHHHHHhHHHHHHH
Confidence 35555555555555555555555554443 111 0 11123333 357889999999999999999999
Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 391 MGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 391 L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
|.|++++-.=+|..|-...- |+|+
T Consensus 181 l~k~~~h~a~slh~~t~lL~-L~RD 204 (542)
T KOG0993|consen 181 LSKAKHHKAESLHVFTDLLN-LARD 204 (542)
T ss_pred HHhhhcccchHHHHHHHHHH-HhcC
Confidence 99999886556666544332 4443
No 167
>PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional
Probab=78.86 E-value=67 Score=36.28 Aligned_cols=127 Identities=19% Similarity=0.197 Sum_probs=64.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC
Q psy18020 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFER-----IDKEKAELERSISFLKEKETELDEILAVLTEKTEVD 359 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~e-----Le~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~d 359 (434)
+.++..+..+.++-.-++|..++++|...+.+|++..++ ++.+...+-+.+..|+++..+++..+..+.
T Consensus 258 l~~k~~~a~~a~~fL~~qL~~l~~~L~~aE~~l~~fr~~~~~~d~~~ea~~~l~~~~~l~~ql~~l~~~~~~l~------ 331 (726)
T PRK09841 258 IARQAAQDSQSLEFLQRQLPEVRSELDQAEEKLNVYRQQRDSVDLNLEAKAVLEQIVNVDNQLNELTFREAEIS------ 331 (726)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH------
Confidence 334333333333344455667777777766666655542 233334444455666666666655444432
Q ss_pred ccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 360 VDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKC 426 (434)
Q Consensus 360 iDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki 426 (434)
..+....|....|-. +..+|+.-+--+. ...-.+..--++.+.|-|++=..|.+-..+
T Consensus 332 --~~~~~~hP~v~~l~~---~~~~L~~~~~~l~----~~~~~~p~~e~~~~~L~R~~~~~~~lY~~l 389 (726)
T PRK09841 332 --QLYKKDHPTYRALLE---KRQTLEQERKRLN----KRVSAMPSTQQEVLRLSRDVEAGRAVYLQL 389 (726)
T ss_pred --HHhcccCchHHHHHH---HHHHHHHHHHHHH----HHHHhccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 223344444433332 2233443333332 223345566777778888876666655443
No 168
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=78.82 E-value=29 Score=28.92 Aligned_cols=73 Identities=16% Similarity=0.217 Sum_probs=38.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYY 390 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~ 390 (434)
.=+.+-+|......+...++.|+.+..++.+-|..+...+ -+.++++.-..-+-+++-++-++-..+++.++.
T Consensus 27 ~vd~i~~ld~~~r~l~~~~e~lr~~rN~~sk~I~~~~~~~-~~~~~l~~e~~~lk~~i~~le~~~~~~e~~l~~ 99 (108)
T PF02403_consen 27 DVDEIIELDQERRELQQELEELRAERNELSKEIGKLKKAG-EDAEELKAEVKELKEEIKELEEQLKELEEELNE 99 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHTT-CCTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhhCc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334445555555566666666666666666666554443 456664444444444554444444444444443
No 169
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=78.81 E-value=69 Score=32.68 Aligned_cols=17 Identities=24% Similarity=0.466 Sum_probs=8.4
Q ss_pred chHHHHHHHHHHHHHHH
Q psy18020 270 DEHIKMSLKSAVEDKIR 286 (434)
Q Consensus 270 ~e~~r~sl~sal~dkL~ 286 (434)
+..++.+...-+..|||
T Consensus 62 e~qlk~aa~~llq~kir 78 (401)
T PF06785_consen 62 EKQLKTAAGQLLQTKIR 78 (401)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34455555554555544
No 170
>smart00502 BBC B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains
Probab=78.58 E-value=38 Score=28.25 Aligned_cols=29 Identities=21% Similarity=0.313 Sum_probs=19.8
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHHhcC
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEALRRG 398 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL~~g 398 (434)
+..|+-.+-..-..+..++.+..++|+.|
T Consensus 77 l~~q~~~l~~~l~~l~~~~~~~e~~l~~~ 105 (127)
T smart00502 77 LEQQLESLTQKQEKLSHAINFTEEALNSG 105 (127)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 44455555556666778888888888765
No 171
>KOG4005|consensus
Probab=78.53 E-value=33 Score=33.27 Aligned_cols=28 Identities=11% Similarity=0.092 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHH
Q psy18020 300 IEILKQTENELNQGKFKLARMFERIDKE 327 (434)
Q Consensus 300 le~L~~~q~EL~~~~~~L~~~~~eLe~~ 327 (434)
|+.+...-.+|..+.++|..+-+.|++.
T Consensus 92 m~eme~~i~dL~een~~L~~en~~Lr~~ 119 (292)
T KOG4005|consen 92 MEEMEYEIKDLTEENEILQNENDSLRAI 119 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333444444444444444333333
No 172
>PRK04863 mukB cell division protein MukB; Provisional
Probab=78.48 E-value=1.1e+02 Score=37.67 Aligned_cols=53 Identities=15% Similarity=0.102 Sum_probs=32.6
Q ss_pred HHHHHhhHHHHhhhhHHHHHHHHHh-----cCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 372 KQILNTFTEEAATEDTIYYMGEALR-----RGVIDLEVYLKQVRSLSRKQFMFRALLF 424 (434)
Q Consensus 372 ~Ql~~l~Aed~AieDtiy~L~~aL~-----~g~I~ld~flK~vR~LaReQF~~Ra~~~ 424 (434)
.++.++-.+=..+++.+.-+.++++ .|+|+-..--...|.+-|+-=-.|++..
T Consensus 456 ~qL~elE~kL~~lea~leql~~~~~~l~~~~Gkv~~~~a~~~~~~~~~~~~~~~~~~~ 513 (1486)
T PRK04863 456 EELLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAE 513 (1486)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCHHHHHHHHHHHHHHhHHHHHHHH
Confidence 4555555555556666666666653 5888877776777776665544454443
No 173
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=78.45 E-value=21 Score=40.65 Aligned_cols=44 Identities=36% Similarity=0.423 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE 340 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~ 340 (434)
...++.|.+.+.+++...+.++....+++..+.+++...+.|++
T Consensus 514 ~~li~~L~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~ 557 (771)
T TIGR01069 514 NVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKE 557 (771)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444444444444444444444444444444444433333
No 174
>PF13041 PPR_2: PPR repeat family
Probab=78.44 E-value=5.7 Score=28.14 Aligned_cols=48 Identities=17% Similarity=0.318 Sum_probs=41.3
Q ss_pred cchHHHHHHHhhHHHHhhhhHHHHHHHHHhcC-CCCHHHHHHHHHHHHH
Q psy18020 367 TAPIYKQILNTFTEEAATEDTIYYMGEALRRG-VIDLEVYLKQVRSLSR 414 (434)
Q Consensus 367 ~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g-~I~ld~flK~vR~LaR 414 (434)
....||-|++.+++..-+++++..+.++.++| +.+..+|---++.++|
T Consensus 2 ~~~~yn~li~~~~~~~~~~~a~~l~~~M~~~g~~P~~~Ty~~li~~~~k 50 (50)
T PF13041_consen 2 DVVTYNTLISGYCKAGKFEEALKLFKEMKKRGIKPDSYTYNILINGLCK 50 (50)
T ss_pred chHHHHHHHHHHHHCcCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHcC
Confidence 34679999999999999999999999999999 5688888777777664
No 175
>PRK14155 heat shock protein GrpE; Provisional
Probab=78.40 E-value=23 Score=33.74 Aligned_cols=76 Identities=13% Similarity=0.122 Sum_probs=33.0
Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CCCCccccccccchHHHHHHHhh
Q psy18020 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK-TEVDVDEAVTTTAPIYKQILNTF 378 (434)
Q Consensus 301 e~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k-~~~diDe~i~~~~~l~~Ql~~l~ 378 (434)
+.+.++++|++.-+...++++.++ .+..+++-+..|-.-.+.|++.+...... .+.+++.++.+-.-.++||++.+
T Consensus 34 d~~lR~~AefeN~RKR~~kE~e~~--~~~a~~~~~~~LLpV~DnLerAl~~~~~~~~~~~~~~i~~Gvemi~k~~~~~L 110 (208)
T PRK14155 34 DQALRYAAEAENTKRRAEREMNDA--RAYAIQKFARDLLGAADNLGRATAASPKDSADPAVKNFIIGVEMTEKELLGAF 110 (208)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhHHhhHHHHHhcccccccchHHHHHHHHHHHHHHHHHHHH
Confidence 334444555544444444433332 23344455555555555666655533211 11123344444444444444443
No 176
>KOG4571|consensus
Probab=78.38 E-value=17 Score=36.15 Aligned_cols=37 Identities=22% Similarity=0.237 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
+.+..++..|+...++|+..+..|+.++..|.+.|.+
T Consensus 251 E~l~ge~~~Le~rN~~LK~qa~~lerEI~ylKqli~e 287 (294)
T KOG4571|consen 251 EALLGELEGLEKRNEELKDQASELEREIRYLKQLILE 287 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3355555555555555555555555555555555543
No 177
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=78.35 E-value=45 Score=29.01 Aligned_cols=59 Identities=27% Similarity=0.311 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 294 VQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 294 ~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
..++.++++|.+.+.+|...--++.....+++.....+..--..+++-..+.+..++-+
T Consensus 33 ~~l~~el~~l~~~r~~l~~Eiv~l~~~~e~~~~~~~~~~~L~~el~~l~~ry~t~Lell 91 (120)
T PF12325_consen 33 ASLQEELARLEAERDELREEIVKLMEENEELRALKKEVEELEQELEELQQRYQTLLELL 91 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34455555555555555554444444444444433333333333333333333334333
No 178
>PF06248 Zw10: Centromere/kinetochore Zw10; InterPro: IPR009361 Zeste white 10 (ZW10) was initially identified as a mitotic checkpoint protein involved in chromosome segregation, and then implicated in targeting cytoplasmic dynein and dynactin to mitotic kinetochores, but it is also important in non-dividing cells. These include cytoplasmic dynein targeting to Golgi and other membranes, and SNARE-mediated ER-Golgi trafficking [, ]. Dominant-negative ZW10, anti-ZW10 antibody, and ZW10 RNA interference (RNAi) cause Golgi dispersal. ZW10 RNAi also disperse endosomes and lysosomes []. Drosophila kinetochore components Rough deal (Rod) and Zw10 are required for the proper functioning of the metaphase checkpoint in flies []. The eukaryotic spindle assembly checkpoint (SAC) monitors microtubule attachment to kinetochores and prevents anaphase onset until all kinetochores are aligned on the metaphase plate. It is an essential surveillance mechanism that ensures high fidelity chromosome segregation during mitosis. In higher eukaryotes, cytoplasmic dynein is involved in silencing the SAC by removing the checkpoint proteins Mad2 and the Rod-Zw10-Zwilch complex (RZZ) from aligned kinetochores [, , ].; GO: 0007067 mitosis, 0000775 chromosome, centromeric region, 0005634 nucleus
Probab=78.35 E-value=1.1e+02 Score=33.56 Aligned_cols=19 Identities=26% Similarity=0.200 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy18020 406 LKQVRSLSRKQFMFRALLF 424 (434)
Q Consensus 406 lK~vR~LaReQF~~Ra~~~ 424 (434)
++-.+.|..|-+..|..+.
T Consensus 149 ~~i~~~Lk~e~~~lr~~L~ 167 (593)
T PF06248_consen 149 LKILKLLKDEYSELRENLQ 167 (593)
T ss_pred cHHHHHHHHHHHHHHHHHH
Confidence 6777777777776665544
No 179
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=78.33 E-value=50 Score=37.81 Aligned_cols=31 Identities=16% Similarity=0.056 Sum_probs=22.8
Q ss_pred hhhhHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy18020 383 ATEDTIYYMGEALRRGVIDLEVYLKQVRSLS 413 (434)
Q Consensus 383 AieDtiy~L~~aL~~g~I~ld~flK~vR~La 413 (434)
.+++.+.-+.++|+++.-++..+.+.+|...
T Consensus 354 qt~~~v~~ik~~l~~~~~~i~~~a~~i~~~~ 384 (806)
T PF05478_consen 354 QTSDVVPPIKRDLDSIGKQIRSQAKQIPNQI 384 (806)
T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence 3556677777888888777777887777773
No 180
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=78.32 E-value=62 Score=36.63 Aligned_cols=35 Identities=14% Similarity=0.140 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
+++++..|++..++|.+.+++..++.+.|.+.++.
T Consensus 584 l~e~~~~l~~~ae~LaeR~e~a~d~Qe~L~~R~~~ 618 (717)
T PF10168_consen 584 LQEERKSLRESAEKLAERYEEAKDKQEKLMKRVDR 618 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44455555555555555555555555555554443
No 181
>KOG1029|consensus
Probab=78.29 E-value=15 Score=41.20 Aligned_cols=30 Identities=20% Similarity=0.286 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 324 IDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 324 Le~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
|++.+..|+.+++.|+.+..+|+.-|++.+
T Consensus 435 ~nak~~ql~~eletLn~k~qqls~kl~Dvr 464 (1118)
T KOG1029|consen 435 LNAKKKQLQQELETLNFKLQQLSGKLQDVR 464 (1118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhe
Confidence 445555566666666666666666666554
No 182
>PRK14162 heat shock protein GrpE; Provisional
Probab=78.21 E-value=38 Score=31.93 Aligned_cols=74 Identities=11% Similarity=0.106 Sum_probs=34.6
Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhh
Q psy18020 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTF 378 (434)
Q Consensus 301 e~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~ 378 (434)
+.+.++++|+..-+.+.++++.++. +..+++-+..|-.-.+.|++.+.... ++-++..++.+-.-+++||.+.+
T Consensus 60 d~~lR~~AEfeN~rkR~~kE~e~~~--~~a~~~~~~~LLpV~DnLerAl~~~~--~~~~~~~l~~Gvemi~k~l~~vL 133 (194)
T PRK14162 60 DKYLRSQAEIQNMQNRYAKERAQLI--KYESQSLAKDVLPAMDNLERALAVKA--DDEAAKQLKKGVQMTLDHLVKAL 133 (194)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhhHHhHHHHHHhccc--cchhHHHHHHHHHHHHHHHHHHH
Confidence 4445555555554444444444332 33345555555555666666665321 11223344444444455554444
No 183
>KOG0979|consensus
Probab=78.06 E-value=1.3e+02 Score=35.03 Aligned_cols=89 Identities=15% Similarity=0.195 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHH---HHhhhhHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTE---EAATEDTIYYMGE 393 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Ae---d~AieDtiy~L~~ 393 (434)
++..+.++++.+..|+..+..|....+++...++. .+.|.+++..-.--...++...+++ +.+.++ +|+++.
T Consensus 683 ~~~~~~~~~~r~~~ie~~~~~l~~qkee~~~~~~~----~I~~~~~~~~~~~~~~~~~~~k~~e~~i~~~~~~-~~~~~s 757 (1072)
T KOG0979|consen 683 LNSELKSYQQRKERIENLVVDLDRQEEEYAASEAK----KILDTEDMRIQSIRWHLELTDKHKEIGIKEKNES-SYMARS 757 (1072)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhH-HHHHHH
Confidence 44444455555555555555555555555444432 2345555444444455555555555 333333 222221
Q ss_pred HHhcCCCCHHHHHHHHHHH
Q psy18020 394 ALRRGVIDLEVYLKQVRSL 412 (434)
Q Consensus 394 aL~~g~I~ld~flK~vR~L 412 (434)
+..+|.+..++..+-..
T Consensus 758 --~~~~iea~~~i~~~e~~ 774 (1072)
T KOG0979|consen 758 --NKNNIEAERKIEKLEDN 774 (1072)
T ss_pred --HHHHHHHHHHHHHHHhH
Confidence 12345555555554443
No 184
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=77.92 E-value=1e+02 Score=34.24 Aligned_cols=9 Identities=22% Similarity=0.298 Sum_probs=4.2
Q ss_pred CCCHHHHHH
Q psy18020 62 NSKLEGLIK 70 (434)
Q Consensus 62 ~s~L~~li~ 70 (434)
+++|++.+.
T Consensus 41 Kttll~ai~ 49 (650)
T TIGR03185 41 KTTLLDAIQ 49 (650)
T ss_pred HHHHHHHHH
Confidence 444444443
No 185
>PF15290 Syntaphilin: Golgi-localised syntaxin-1-binding clamp
Probab=77.74 E-value=66 Score=32.00 Aligned_cols=18 Identities=6% Similarity=0.259 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHHHHHhc
Q psy18020 336 SFLKEKETELDEILAVLT 353 (434)
Q Consensus 336 ~~L~~~~~eLe~~I~~l~ 353 (434)
++.+.++.+|...|+.++
T Consensus 120 KEARkEIkQLkQvieTmr 137 (305)
T PF15290_consen 120 KEARKEIKQLKQVIETMR 137 (305)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 344555556666666554
No 186
>PF05791 Bacillus_HBL: Bacillus haemolytic enterotoxin (HBL); InterPro: IPR008414 This family consists of several Bacillus haemolytic enterotoxins (HblC, HblD, HblA, NheA, and NheB), which can cause food poisoning in humans []. Haemolysin BL (encoded by HBL) and non-haemolytic enterotoxin (encoded by NHE), represent the major enterotoxins produced by Bacillus cereus. Most of the cytotoxic activity of B. cereus isolates has been attributed to the level of Nhe, which may indicate a highly diarrheic potential []. The exact mechanism by which B. cereus causes diarrhoea is unknown. Hbl, cytotoxin K (CytK) and Nhe are all putative causes. Both Hbl and Nhe are three-component cytotoxins and maximal cytotoxicity of Nhe against epithelia is dependent on all three components. Nhe has haemolytic activity against erythrocytes from a variety of species. It is possible that the common structural and functional properties of these toxins indicate that the Hbl/Nhe and ClyA families of toxins constitute a superfamily of pore-forming cytotoxins []. The high virulence of some strains is thought to be due to the greater cytotoxic activity of CytK-1 compared to CytK-2, and to a high level of cytK expression []. Haemolysin BL and non-haemolytic enterotoxin production are both influenced by pH and micro []. This entry is found in cytotoxic proteins that form part of the enterotoxin complex and bind to erythrocytes. HblA is composed of a binding component, B, and two lytic components, L1 and L2. All three subunits act synergically to cause hemolysis.; GO: 0009405 pathogenesis, 0016020 membrane; PDB: 2NRJ_A.
Probab=77.56 E-value=20 Score=33.34 Aligned_cols=53 Identities=23% Similarity=0.345 Sum_probs=46.9
Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 300 IEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 300 le~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.+.|..++.+....+.+++..+.+|+..+..|.+....|++...++...+...
T Consensus 109 ~~~i~~L~~~i~~~q~~~~~~i~~L~~f~~~l~~D~~~l~~~~~~l~~~l~~~ 161 (184)
T PF05791_consen 109 KEIIEDLQDQIQKNQDKVQALINELNDFKDKLQKDSRNLKTDVDELQSILAGE 161 (184)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHhcc
Confidence 45577788888999999999999999999999999999999999999988743
No 187
>PRK14154 heat shock protein GrpE; Provisional
Probab=77.54 E-value=33 Score=32.68 Aligned_cols=47 Identities=17% Similarity=0.068 Sum_probs=20.2
Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 302 ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 302 ~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.+.++++|.+.-+...++++.++ .+...++-+..|-.-.+.|++.+.
T Consensus 74 ~~lRl~ADfeNyRKR~~kE~e~~--~~~a~e~~~~~LLpVlDnLeRAL~ 120 (208)
T PRK14154 74 QYLRAQAEMDNLRKRIEREKADI--IKFGSKQLITDLLPVADSLIHGLE 120 (208)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhHHhHHHHHHh
Confidence 34444444444333333333332 223334444444444555555554
No 188
>PF12126 DUF3583: Protein of unknown function (DUF3583); InterPro: IPR021978 This domain is found in eukaryotes, and is typically between 302 and 338 amino acids in length. It is found in association with PF00097 from PFAM and PF00643 from PFAM. Most members are promyelocytic leukemia proteins, and this family lies towards the C terminus.
Probab=77.53 E-value=57 Score=32.65 Aligned_cols=61 Identities=15% Similarity=0.254 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhc
Q psy18020 324 IDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRR 397 (434)
Q Consensus 324 Le~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~ 397 (434)
|+.....-++.-+.|..++..|+..|..++..+ ..-|=+.++|.|.-.=|.=.+|.+||.+
T Consensus 59 Le~v~~rYqR~y~ema~~L~~LeavLqRir~G~-------------~LVekM~~YASDQEVLdMh~FlreAL~r 119 (324)
T PF12126_consen 59 LEAVEARYQRDYEEMAGQLGRLEAVLQRIRTGG-------------ALVEKMKLYASDQEVLDMHGFLREALER 119 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHhHH-------------HHHHHHHHhcchHHHHHHHHHHHHHHHH
Confidence 333444444444455555555555555544333 3345567888888888888888888754
No 189
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=77.53 E-value=5 Score=26.28 Aligned_cols=29 Identities=28% Similarity=0.342 Sum_probs=25.3
Q ss_pred hhhHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy18020 384 TEDTIYYMGEALRRGVIDLEVYLKQVRSL 412 (434)
Q Consensus 384 ieDtiy~L~~aL~~g~I~ld~flK~vR~L 412 (434)
|.|.|-.|.+.+.+|.|+=+.|-+.-+.|
T Consensus 1 ~~~~L~~L~~l~~~G~IseeEy~~~k~~l 29 (31)
T PF09851_consen 1 IEDRLEKLKELYDKGEISEEEYEQKKARL 29 (31)
T ss_pred ChHHHHHHHHHHHcCCCCHHHHHHHHHHH
Confidence 57889999999999999999998876654
No 190
>PRK03918 chromosome segregation protein; Provisional
Probab=77.50 E-value=1.1e+02 Score=34.97 Aligned_cols=16 Identities=6% Similarity=-0.045 Sum_probs=8.0
Q ss_pred HHHHHHHHHHHHHHHH
Q psy18020 404 VYLKQVRSLSRKQFMF 419 (434)
Q Consensus 404 ~flK~vR~LaReQF~~ 419 (434)
.|...++.-++++|-.
T Consensus 732 ~~~~~l~~~~~~~l~~ 747 (880)
T PRK03918 732 KYKALLKERALSKVGE 747 (880)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4444555555555543
No 191
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=77.33 E-value=16 Score=36.12 Aligned_cols=60 Identities=22% Similarity=0.333 Sum_probs=50.6
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k 355 (434)
-+..++.+..-+.+|+..+..+++.++.|.....+++.....|+.+..+++..+..+..+
T Consensus 146 Dk~ile~qk~dk~~Le~kq~~l~~~~e~l~al~~e~e~~~~~L~~qk~e~~~l~~~~aa~ 205 (265)
T COG3883 146 DKKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAK 205 (265)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355677777888899999999999999999999999999999999999998888877543
No 192
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=77.32 E-value=21 Score=37.88 Aligned_cols=38 Identities=26% Similarity=0.311 Sum_probs=32.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.+.......|......+.+.+..|+++..+|++.++..
T Consensus 371 ~~~~~~~~~l~~~~~~l~~~~~~l~~~~~~l~~~l~~~ 408 (451)
T PF03961_consen 371 PEKKEQLKKLKEKKKELKEELKELKEELKELKEELERS 408 (451)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34667778888888899999999999999999999877
No 193
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=77.24 E-value=51 Score=32.16 Aligned_cols=17 Identities=6% Similarity=0.208 Sum_probs=10.2
Q ss_pred HHHHhhhhHHHHHHHHH
Q psy18020 379 TEEAATEDTIYYMGEAL 395 (434)
Q Consensus 379 Aed~AieDtiy~L~~aL 395 (434)
..+..|.-++.|...++
T Consensus 192 ~~~~~isaALgyvahlv 208 (302)
T PF10186_consen 192 LPDEEISAALGYVAHLV 208 (302)
T ss_pred CCHHHHHHHHHHHHHHH
Confidence 44556666666666653
No 194
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=77.12 E-value=6.8 Score=33.91 Aligned_cols=47 Identities=17% Similarity=0.217 Sum_probs=30.1
Q ss_pred hHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 378 FTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLF 424 (434)
Q Consensus 378 ~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~ 424 (434)
+.=+..+++|+.+|++.++.=.-.++..-+..+.|.++-=+....+.
T Consensus 79 ~~vE~~~~eA~~~l~~r~~~l~~~~~~l~~~l~~l~~~~~~~~~~l~ 125 (129)
T cd00584 79 YYVEKDLEEAIEFLDKKIEELTKQIEKLQKELAKLKDQINTLEAELQ 125 (129)
T ss_pred EEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456778888888887766555666666777766665544444443
No 195
>PHA03011 hypothetical protein; Provisional
Probab=77.09 E-value=24 Score=29.68 Aligned_cols=59 Identities=20% Similarity=0.183 Sum_probs=44.6
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 294 VQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 294 ~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
..+.+.++.|...-.+|...-..++.++..++..-+.-...+.-|+.+.++|++.++.+
T Consensus 60 Nai~e~ldeL~~qYN~L~dEYn~i~Ne~k~~~~iIQdn~d~I~~LraeIDkLK~niaN~ 118 (120)
T PHA03011 60 NAIIEILDELIAQYNELLDEYNLIENEIKDLEIIIQDNDDEIHFLRAEIDKLKENIANL 118 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHhcc
Confidence 34445566677777777777777888888888777777788888888888888887755
No 196
>PRK14158 heat shock protein GrpE; Provisional
Probab=76.64 E-value=34 Score=32.25 Aligned_cols=87 Identities=11% Similarity=0.117 Sum_probs=40.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccc
Q psy18020 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTT 366 (434)
Q Consensus 287 ~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~ 366 (434)
.++.+...++.+..+.+.++++|+..-....++++.++.. ..+++-+..|-.-.+.|++.+.... +-+.++++.+
T Consensus 47 ~~l~~le~e~~el~d~~lR~~AefeN~RkR~~kE~e~~~~--~a~~~~~~~lLpV~DnLerAl~~~~---~~~~~~i~~G 121 (194)
T PRK14158 47 EALAAKEAEAAANWDKYLRERADLENYRKRVQKEKEELLK--YGNESLILEILPAVDNMERALDHAD---EESMSAIIEG 121 (194)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhHHhHHHHHHhccC---cchHHHHHHH
Confidence 3333333333333445556666666555555555444332 3444555555555566666554221 1123344444
Q ss_pred cchHHHHHHHhh
Q psy18020 367 TAPIYKQILNTF 378 (434)
Q Consensus 367 ~~~l~~Ql~~l~ 378 (434)
-.-+++|+.+.+
T Consensus 122 v~mi~k~l~~vL 133 (194)
T PRK14158 122 IRMTLSMLLSTL 133 (194)
T ss_pred HHHHHHHHHHHH
Confidence 444444444443
No 197
>PF11068 YlqD: YlqD protein; InterPro: IPR021297 This bacterial family of proteins has no known function. ; PDB: 4DCI_C.
Probab=76.56 E-value=47 Score=29.31 Aligned_cols=64 Identities=14% Similarity=0.221 Sum_probs=36.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGK-----FKLARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 284 kL~~~L~e~~~q~~~ele~L~~~q~EL~~~~-----~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
.|.+.+.+.+.++..++++|.-....+.... ..+......++.+++.+...+..|..+.++++.
T Consensus 20 ~l~~~l~~~i~~~d~el~QLefq~kr~~~e~~~~~~~~~~~i~~q~~~e~~~r~e~k~~l~~ql~qv~~ 88 (131)
T PF11068_consen 20 ELLQELQEQIQQLDQELQQLEFQGKRMIKEIKKQNAQQIQSIQQQFEQEKQERLEQKNQLLQQLEQVQK 88 (131)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3444555556666666666554333333222 345666666666666666666666666666544
No 198
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=76.51 E-value=15 Score=41.70 Aligned_cols=13 Identities=15% Similarity=0.317 Sum_probs=6.2
Q ss_pred eeecCcccccceE
Q psy18020 2 TNLSGLVYHIPVC 14 (434)
Q Consensus 2 v~~~g~~y~iPi~ 14 (434)
|++++..|-|||.
T Consensus 184 it~r~~r~vipvk 196 (771)
T TIGR01069 184 VTIRNGRYVLPLK 196 (771)
T ss_pred EEEECCEEEEEee
Confidence 4444555555444
No 199
>KOG4552|consensus
Probab=76.45 E-value=26 Score=33.28 Aligned_cols=75 Identities=17% Similarity=0.210 Sum_probs=56.3
Q ss_pred cCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcC--CC-CHHHHHHHHH-HHHHHHHHHHHHHHHHH
Q psy18020 353 TEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRG--VI-DLEVYLKQVR-SLSRKQFMFRALLFKCY 427 (434)
Q Consensus 353 ~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g--~I-~ld~flK~vR-~LaReQF~~Ra~~~Ki~ 427 (434)
++.+.++|-++++..+--++-|++++-|-.-+|..|..|..-++.- .| -|..-||..- .|++--|-+|-.++-|.
T Consensus 41 ~~gE~v~il~Ll~~kd~ef~~llkla~eq~k~e~~m~~Lea~VEkrD~~IQqLqk~LK~aE~iLtta~fqA~qKLksi~ 119 (272)
T KOG4552|consen 41 KNGETVNILKLLDSKDDEFKTLLKLAPEQQKREQLMRTLEAHVEKRDEVIQQLQKNLKSAEVILTTACFQANQKLKSIK 119 (272)
T ss_pred hcchHHHHHHHHHhccHHHHHHHHHhHhHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556678888888999999999999999999999999998887643 33 5666666553 46777777776665554
No 200
>PF06705 SF-assemblin: SF-assemblin/beta giardin
Probab=76.39 E-value=59 Score=31.46 Aligned_cols=21 Identities=14% Similarity=0.172 Sum_probs=17.0
Q ss_pred hHHHHhhhhHHHHHHHHHhcC
Q psy18020 378 FTEEAATEDTIYYMGEALRRG 398 (434)
Q Consensus 378 ~Aed~AieDtiy~L~~aL~~g 398 (434)
.++|.-|=|||..+.++|.+|
T Consensus 221 ~~~Dd~Iv~aln~yt~~lQ~~ 241 (247)
T PF06705_consen 221 EQSDDDIVQALNHYTKALQDG 241 (247)
T ss_pred HhhhhHHHHHHHHHHHHHHHH
Confidence 456777888999999998877
No 201
>PF02203 TarH: Tar ligand binding domain homologue; InterPro: IPR003122 Methyl-accepting chemotaxis proteins (MCPs) are a family of bacterial receptors that mediate chemotaxis to diverse signals, responding to changes in the concentration of attractants and repellents in the environment by altering swimming behaviour []. Environmental diversity gives rise to diversity in bacterial signalling receptors, and consequently there are many genes encoding MCPs []. For example, there are four well-characterised MCPs found in Escherichia coli: Tar (taxis towards aspartate and maltose, away from nickel and cobalt), Tsr (taxis towards serine, away from leucine, indole and weak acids), Trg (taxis towards galactose and ribose) and Tap (taxis towards dipeptides). MCPs share similar topology and signalling mechanisms. MCPs either bind ligands directly or interact with ligand-binding proteins, transducing the signal to downstream signalling proteins in the cytoplasm. MCPs undergo two covalent modifications: deamidation and reversible methylation at a number of glutamate residues. Attractants increase the level of methylation, while repellents decrease it. The methyl groups are added by the methyl-transferase cheR and are removed by the methylesterase cheB. Most MCPs are homodimers that contain the following organisation: an N-terminal signal sequence that acts as a transmembrane domain in the mature protein; a poorly-conserved periplasmic receptor (ligand-binding) domain; a second transmembrane domain; and a highly-conserved C-terminal cytoplasmic domain that interacts with downstream signalling components. The C-terminal domain contains the glycosylated glutamate residues. This entry represents the ligand-binding domain found in a number of methyl-accepting chemotaxis receptors.; GO: 0004888 transmembrane signaling receptor activity, 0006935 chemotaxis, 0007165 signal transduction, 0016020 membrane; PDB: 2ASR_A 3ATP_A 2D4U_A 2LIG_A 1VLS_A 1LIH_A 1WAT_B 1VLT_B 1WAS_A 1JMW_A.
Probab=76.34 E-value=53 Score=29.02 Aligned_cols=77 Identities=9% Similarity=0.151 Sum_probs=43.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHH-Hh-hhhHHHHHHHHHhcCCCCHHHHHHHH
Q psy18020 332 ERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE-AA-TEDTIYYMGEALRRGVIDLEVYLKQV 409 (434)
Q Consensus 332 e~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed-~A-ieDtiy~L~~aL~~g~I~ld~flK~v 409 (434)
.+.+..+++..++.++.+.........+. ..+.+.+-+.+. .+ +++++.-+-.+++.| |++.|.+..
T Consensus 81 ~~~l~~~~~~l~~~~~~~~~~~~~~~~~~---------~~~~l~~~~~~~~~~~~~~~l~~~~~al~~~--d~~~~~~~~ 149 (171)
T PF02203_consen 81 AELLARAEQNLEQAEQAFDAFKALPHASP---------EERALADELEASFDAYLQQALDPLLAALRAG--DIAAFMQLP 149 (171)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHCS---GT---------GGHHHHHHHHHHHHH-HHHHHHHHHHHHHTT---HHHHHHSS
T ss_pred HHHHHHHHHHHHHHHHHHHHHHccCCCCc---------chHHHHHHHHHHHHHHHHHhHHHHHHHHHCC--CHHHHHHHH
Confidence 34445555556666666665543222111 112223322222 23 477778888889988 999998887
Q ss_pred HHHHHHHHHH
Q psy18020 410 RSLSRKQFMF 419 (434)
Q Consensus 410 R~LaReQF~~ 419 (434)
++=....|..
T Consensus 150 ~~~~~~~~~~ 159 (171)
T PF02203_consen 150 NEKIQPLFDA 159 (171)
T ss_dssp HHHHHHHHHH
T ss_pred HHhcchhHHH
Confidence 7777766654
No 202
>KOG0804|consensus
Probab=76.18 E-value=32 Score=36.29 Aligned_cols=36 Identities=22% Similarity=0.222 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
..++.....++.+++.+++.+..++++.+++.+-+.
T Consensus 364 ~~l~~~~~~~e~~kk~~e~k~~q~q~k~~k~~kel~ 399 (493)
T KOG0804|consen 364 DSLKQESSDLEAEKKIVERKLQQLQTKLKKCQKELK 399 (493)
T ss_pred HhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445555555555555555555555555544444
No 203
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=76.18 E-value=11 Score=31.90 Aligned_cols=34 Identities=15% Similarity=0.156 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
++.++.+|+++.+.|+..|..|++..+-+++...
T Consensus 39 ~~~e~~~l~~~n~~L~~eI~~L~~~~dyiEe~AR 72 (105)
T PRK00888 39 QQQTNAKLKARNDQLFAEIDDLKGGQEAIEERAR 72 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCcHHHHHHHHH
Confidence 6666666777777777777777765555665544
No 204
>KOG4643|consensus
Probab=76.18 E-value=95 Score=36.16 Aligned_cols=99 Identities=17% Similarity=0.094 Sum_probs=52.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHH
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK--TEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMG 392 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k--~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~ 392 (434)
+.|+..+.++.+...+|+..-+.|..+.++|.+.+..+... ..---.+...-..-++.|+..-++|-.+- |.-|+
T Consensus 418 e~Leeri~ql~qq~~eled~~K~L~~E~ekl~~e~~t~~~s~~rq~~e~e~~~q~ls~~~Q~~~et~el~~~---iknln 494 (1195)
T KOG4643|consen 418 EILEERINQLLQQLAELEDLEKKLQFELEKLLEETSTVTRSLSRQSLENEELDQLLSLQDQLEAETEELLNQ---IKNLN 494 (1195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH---HHHHH
Confidence 44566666666666677777777777777776666555310 00001111111223566776666665554 66677
Q ss_pred HHHhcCCCCHHHHHHHHHHHHHHHHHHHHH
Q psy18020 393 EALRRGVIDLEVYLKQVRSLSRKQFMFRAL 422 (434)
Q Consensus 393 ~aL~~g~I~ld~flK~vR~LaReQF~~Ra~ 422 (434)
+.|++-..++ .|..|+.-|+++-+
T Consensus 495 k~L~~r~~el------srl~a~~~elkeQ~ 518 (1195)
T KOG4643|consen 495 KSLNNRDLEL------SRLHALKNELKEQY 518 (1195)
T ss_pred HHHHHHHHHH------HHHHHHHHHHHHHH
Confidence 7777655443 34445555554433
No 205
>KOG2072|consensus
Probab=75.86 E-value=1.5e+02 Score=33.86 Aligned_cols=67 Identities=19% Similarity=0.137 Sum_probs=41.3
Q ss_pred CCCCCCcccc--ccccchHHHHHHHhh-------HHHHhhhhHHHHHHHHHhcCCCCH-HHHHHHHHHHHHHHHHHH
Q psy18020 354 EKTEVDVDEA--VTTTAPIYKQILNTF-------TEEAATEDTIYYMGEALRRGVIDL-EVYLKQVRSLSRKQFMFR 420 (434)
Q Consensus 354 ~k~~~diDe~--i~~~~~l~~Ql~~l~-------Aed~AieDtiy~L~~aL~~g~I~l-d~flK~vR~LaReQF~~R 420 (434)
...++|+.++ +.+..+.-.|+=++- ..=.-++-.|+||.||++..-|.| +.+++.-+.-.||-|..|
T Consensus 650 ~~k~~d~ed~e~lD~d~i~~~q~eel~Ke~kElq~rL~~q~KkiDh~ERA~R~EeiPL~e~~~~~~~~~d~e~~e~~ 726 (988)
T KOG2072|consen 650 GGKEKDLEDLEKLDADQIKARQIEELEKERKELQSRLQYQEKKIDHLERAKRLEEIPLIEKAYDERQEEDRELYEAR 726 (988)
T ss_pred ccccCChHHhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHhhhhHHHHHHHHHhhhHHHHHHH
Confidence 3344444443 344445555544443 333456778999999999998864 666776666666666543
No 206
>KOG0977|consensus
Probab=75.80 E-value=56 Score=35.59 Aligned_cols=37 Identities=27% Similarity=0.402 Sum_probs=28.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 316 KLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 316 ~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.+.+.+.+...++.+++.+|.+|..+.+++...+.+.
T Consensus 96 ~ar~~l~e~~~~ra~~e~ei~kl~~e~~elr~~~~~~ 132 (546)
T KOG0977|consen 96 TARKLLDETARERAKLEIEITKLREELKELRKKLEKA 132 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 3777777888888888888888888888887666554
No 207
>PRK14139 heat shock protein GrpE; Provisional
Probab=75.79 E-value=70 Score=29.94 Aligned_cols=11 Identities=9% Similarity=0.154 Sum_probs=4.7
Q ss_pred HHHHHHHHHHh
Q psy18020 302 ILKQTENELNQ 312 (434)
Q Consensus 302 ~L~~~q~EL~~ 312 (434)
.+.++++|++.
T Consensus 54 ~~lR~~AefeN 64 (185)
T PRK14139 54 SFLRAKAETEN 64 (185)
T ss_pred HHHHHHHHHHH
Confidence 33444444444
No 208
>PRK02224 chromosome segregation protein; Provisional
Probab=75.77 E-value=1.6e+02 Score=33.89 Aligned_cols=16 Identities=13% Similarity=0.005 Sum_probs=8.2
Q ss_pred CCCHHHHHHHHHHHhCCC
Q psy18020 62 NSKLEGLIKTLASAFSAL 79 (434)
Q Consensus 62 ~s~L~~li~~l~~~F~~~ 79 (434)
+|+|++.|.. ..|+..
T Consensus 36 KStil~ai~~--~l~g~~ 51 (880)
T PRK02224 36 KSSLLEACFF--ALYGSK 51 (880)
T ss_pred HHHHHHHHHH--HhCCCc
Confidence 6666655443 345543
No 209
>KOG4460|consensus
Probab=75.70 E-value=71 Score=34.75 Aligned_cols=36 Identities=17% Similarity=0.111 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.-+.+++++++++.++++...|.+..+|+.+.-+.|
T Consensus 600 QlQ~l~~~~eer~~i~e~a~~La~R~eea~e~qe~L 635 (741)
T KOG4460|consen 600 QLQDLSYCREERKSLREMAERLADRYEEAKEKQEDL 635 (741)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence 333444444455555555555554444444443333
No 210
>KOG0996|consensus
Probab=75.63 E-value=63 Score=38.12 Aligned_cols=31 Identities=19% Similarity=0.170 Sum_probs=18.0
Q ss_pred HhcCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 395 LRRGVIDLEVYLKQVRSLSRKQFMFRALLFK 425 (434)
Q Consensus 395 L~~g~I~ld~flK~vR~LaReQF~~Ra~~~K 425 (434)
+....=.|+.-.+..|.|.+.---.|--+..
T Consensus 558 ~~~~~k~l~~~~~e~~~~~~~~~~~rqrveE 588 (1293)
T KOG0996|consen 558 LKEKEKELPKLRKEERNLKSQLNKLRQRVEE 588 (1293)
T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444556777777777776655544444333
No 211
>PF05064 Nsp1_C: Nsp1-like C-terminal region; InterPro: IPR007758 The NSP1-like protein appears to be an essential component of the nuclear pore complex, for example preribosome nuclear export requires the Nup82p-Nup159p-Nsp1p complex. The C-terminal of Nsp1 is involved in binding Nup82 [], probably via coiled-coil formation [, ]. The family is related to the rotavirus nonstructural protein NSP1 which is the least conserved protein in the rotavirus genome. Its function in the replication process is not fully understood.; GO: 0017056 structural constituent of nuclear pore, 0005643 nuclear pore; PDB: 3T97_C.
Probab=75.59 E-value=4.7 Score=34.72 Aligned_cols=48 Identities=21% Similarity=0.500 Sum_probs=38.3
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
|.....++.. |...+.+++..+++|++.+..++....+|+++|..++.
T Consensus 52 Lv~n~~~I~~----L~~~v~~~~~~Q~~ld~~L~~ie~qQ~eLe~~L~~lE~ 99 (116)
T PF05064_consen 52 LVENGEKISK----LYSEVQKAESEQKRLDQELDFIEAQQKELEELLDPLEK 99 (116)
T ss_dssp HHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCCCCC
T ss_pred HHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444 78888899999999999999999999999999987753
No 212
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=75.59 E-value=94 Score=34.04 Aligned_cols=53 Identities=15% Similarity=0.134 Sum_probs=25.2
Q ss_pred ccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy18020 360 VDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSL 412 (434)
Q Consensus 360 iDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~L 412 (434)
.+++...-.-+.+++=++-.+-..|.+.|..|.+.-..-+=.++.|-+..+.+
T Consensus 378 ysel~e~leel~e~leeie~eq~ei~e~l~~Lrk~E~eAr~kL~~~~~~L~~i 430 (569)
T PRK04778 378 YSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEI 430 (569)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44433334444555555555555555555555544443333344444444433
No 213
>KOG4005|consensus
Probab=75.54 E-value=26 Score=33.92 Aligned_cols=56 Identities=20% Similarity=0.270 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 346 (434)
Q Consensus 291 e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe 346 (434)
+.+..++.+|..|..-..-|...-+.|+..-..|-.+..+|...++.+.++..++.
T Consensus 90 aRm~eme~~i~dL~een~~L~~en~~Lr~~n~~L~~~n~el~~~le~~~~~l~~~~ 145 (292)
T KOG4005|consen 90 ARMEEMEYEIKDLTEENEILQNENDSLRAINESLLAKNHELDSELELLRQELAELK 145 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHhhH
Confidence 33444444455555544455555555555555555555555555555555555543
No 214
>PF04108 APG17: Autophagy protein Apg17 ; InterPro: IPR007240 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Autophagy protein 17 (Apg17) is required for activating Apg1 protein kinases. This entry also contains Autophagy protein 11 which is involved in cytoplasm to vacuole transport (Cvt) and pexophagy. ; GO: 0006914 autophagy
Probab=75.51 E-value=1.1e+02 Score=32.10 Aligned_cols=26 Identities=15% Similarity=0.244 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 403 EVYLKQVRSLSRKQFMFRALLFKCYKH 429 (434)
Q Consensus 403 d~flK~vR~LaReQF~~Ra~~~Ki~~~ 429 (434)
+.|=.-+.++.|+++. +..|++|...
T Consensus 346 ~aY~~LL~Ev~RRr~~-~~k~~~i~~~ 371 (412)
T PF04108_consen 346 SAYDSLLLEVERRRAV-RDKMKKIIRE 371 (412)
T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHH
Confidence 3566667788888887 5555555544
No 215
>COG3096 MukB Uncharacterized protein involved in chromosome partitioning [Cell division and chromosome partitioning]
Probab=75.51 E-value=66 Score=36.34 Aligned_cols=107 Identities=14% Similarity=0.198 Sum_probs=56.0
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCCCCccccccccchHHHHHHHhhHHHHhhhh
Q psy18020 311 NQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE----KTEVDVDEAVTTTAPIYKQILNTFTEEAATED 386 (434)
Q Consensus 311 ~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~----k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieD 386 (434)
....+.|++....+......|-+-....+.+.+.+.+.+.++.+ .++...+.+-.--|.++.+|-+.-..-.-+|-
T Consensus 1001 ~~~reqlrQ~Q~Q~sqYnqvl~~LksS~~~K~~~l~El~qEl~d~GV~AD~gAeeRA~~RRDELh~~Lst~RsRr~~~Ek 1080 (1480)
T COG3096 1001 TRAREQLRQHQAQLSQYNQVLASLKSSYDTKKELLNELQQELQDIGVRADSGAEERARIRRDELHAQLSTNRSRRNQLEK 1080 (1480)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhCCCcCcchHHHHHHHHHHHHHHHhccHhHHHHHHH
Confidence 33333344444444444444444444444455555555444432 12222233334445577776666666555555
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 387 TIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLF 424 (434)
Q Consensus 387 tiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~ 424 (434)
.|-... ...|...|.+|++-|+-|-.|-++-
T Consensus 1081 qlT~~E-------~E~~~L~~~~rK~ErDY~~~Re~VV 1111 (1480)
T COG3096 1081 QLTFCE-------AEMDNLTRKLRKLERDYFEMREQVV 1111 (1480)
T ss_pred HHHHHH-------HHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 554332 3456777888888888887776654
No 216
>COG5187 RPN7 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones]
Probab=75.45 E-value=11 Score=37.96 Aligned_cols=80 Identities=20% Similarity=0.319 Sum_probs=57.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcC---C
Q psy18020 324 IDKEKAELERSISFLKEKETELDEILAVLTE-KTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRG---V 399 (434)
Q Consensus 324 Le~~k~~Le~~i~~L~~~~~eLe~~I~~l~~-k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g---~ 399 (434)
|+..+..+++.++.=+++++|+++.|++.+. .++...-+ +.+-+-|-++....++.+...++|.+++. .
T Consensus 77 ikfD~~~~n~l~kkneeki~Elde~i~~~eedngE~e~~e-------a~~n~aeyY~qi~D~~ng~~~~~~~~~~a~stg 149 (412)
T COG5187 77 IKFDRGRMNTLLKKNEEKIEELDERIREKEEDNGETEGSE-------ADRNIAEYYCQIMDIQNGFEWMRRLMRDAMSTG 149 (412)
T ss_pred eehhhHHHHHHHHhhHHHHHHHHHHHHHHhhcccchHHHH-------HHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhcc
Confidence 5556666777777777788888777776642 34555555 55666667777777888888888888865 4
Q ss_pred CCHHHHHHHHH
Q psy18020 400 IDLEVYLKQVR 410 (434)
Q Consensus 400 I~ld~flK~vR 410 (434)
...|+||-++|
T Consensus 150 ~KiDv~l~kiR 160 (412)
T COG5187 150 LKIDVFLCKIR 160 (412)
T ss_pred cchhhHHHHHH
Confidence 57789998887
No 217
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=75.41 E-value=13 Score=30.37 Aligned_cols=38 Identities=8% Similarity=0.122 Sum_probs=26.7
Q ss_pred HhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHH
Q psy18020 382 AATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFR 420 (434)
Q Consensus 382 ~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~R 420 (434)
..+++.|..+|. =..|+|+++.|++.+..|+-..+-.|
T Consensus 47 ~~v~~mi~~~D~-d~DG~I~F~EF~~l~~~l~~~~~~~~ 84 (89)
T cd05022 47 EGLEEKMKNLDV-NQDSKLSFEEFWELIGELAKAVKGEK 84 (89)
T ss_pred HHHHHHHHHhCC-CCCCCCcHHHHHHHHHHHHHHHHHHh
Confidence 344444433322 13689999999999999998888765
No 218
>TIGR01554 major_cap_HK97 phage major capsid protein, HK97 family. This family represents the major capsid protein component of the heads (capsids) of bacteriophage HK97, phi-105, P27, and related phage. This model represents one of several analogous families lacking detectable sequence similarity. The gene encoding this component is typically located in an operon encoding the small and large terminase subunits, the portal protein and the prohead or maturation protease.
Probab=75.35 E-value=21 Score=36.72 Aligned_cols=37 Identities=16% Similarity=0.278 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
.+.+.++.+|+...+++++....++.+++++++....
T Consensus 30 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (378)
T TIGR01554 30 TAAALEKEELETDVEKLKEEIKLLEDAIADLEKVTEE 66 (378)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Confidence 3355556666666666666666666666666554443
No 219
>PRK14148 heat shock protein GrpE; Provisional
Probab=75.29 E-value=52 Score=31.09 Aligned_cols=91 Identities=9% Similarity=0.035 Sum_probs=42.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccc
Q psy18020 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAV 364 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i 364 (434)
|..++.+...++.+..+.+.++++|...-+...++++.++. +..+++-+..|-.-.+.|++.+..... ..++..++
T Consensus 45 l~~~l~~l~~e~~elkd~~lR~~Ae~eN~rKR~~rE~e~~~--~~a~~~~~~~LLpV~DnlerAl~~~~~--~~~~~~l~ 120 (195)
T PRK14148 45 AKDTIKELEDSCDQFKDEALRAKAEMENIRKRAERDVSNAR--KFGIEKFAKELLPVIDSIEQALKHEVK--LEEAIAMK 120 (195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhhHHhHHHHHHhcccc--chhHHHHH
Confidence 34444333333333334455555555554444444444332 233445555555555666666553221 12334444
Q ss_pred cccchHHHHHHHhhH
Q psy18020 365 TTTAPIYKQILNTFT 379 (434)
Q Consensus 365 ~~~~~l~~Ql~~l~A 379 (434)
.+-.-.++||.+.+.
T Consensus 121 ~Gv~mi~k~l~~vL~ 135 (195)
T PRK14148 121 EGIELTAKMLVDILK 135 (195)
T ss_pred HHHHHHHHHHHHHHH
Confidence 455555666655553
No 220
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=75.27 E-value=1.2e+02 Score=36.48 Aligned_cols=80 Identities=11% Similarity=0.193 Sum_probs=37.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----CCCCccccc---cccchHHHHHHHhhHHHHhhhhHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK----TEVDVDEAV---TTTAPIYKQILNTFTEEAATEDTIY 389 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k----~~~diDe~i---~~~~~l~~Ql~~l~Aed~AieDtiy 389 (434)
++....+.++.++++...+..++.+.+.++..|..+..+ ++.||.+++ .....+-+++-++-.+-.+++....
T Consensus 297 ~~~l~~~~~e~~~~~~~~~~~~~~~l~~~~~~L~~i~~~~~~ye~~~i~~~~~~~~~l~~~~~~~~~l~~~~~~Lt~~~~ 376 (1201)
T PF12128_consen 297 LEKLEDEIKELRDELNKELSALNADLARIKSELDEIEQQKKDYEDADIEQLIARVDQLPEWRNELENLQEQLDLLTSKHQ 376 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 333334444444444444444444444444444444321 223444443 3344455555555555555555555
Q ss_pred HHHHHHh
Q psy18020 390 YMGEALR 396 (434)
Q Consensus 390 ~L~~aL~ 396 (434)
-+...++
T Consensus 377 di~~ky~ 383 (1201)
T PF12128_consen 377 DIESKYN 383 (1201)
T ss_pred HHHHHHH
Confidence 5555444
No 221
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=74.98 E-value=13 Score=40.02 Aligned_cols=38 Identities=21% Similarity=0.135 Sum_probs=18.6
Q ss_pred CCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHH
Q psy18020 358 VDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 358 ~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL 395 (434)
.++.++..-.+-+..|+-++.++..++++.+--|.+.|
T Consensus 124 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 161 (525)
T TIGR02231 124 PDLKEWFQAFDFNGSEIERLLTEDREAERRIRELEKQL 161 (525)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444555555555555555554444443
No 222
>KOG0161|consensus
Probab=74.88 E-value=88 Score=39.23 Aligned_cols=26 Identities=8% Similarity=0.180 Sum_probs=22.5
Q ss_pred CCCCHHHHHHHHHHHhCCCCCccccC
Q psy18020 61 QNSKLEGLIKTLASAFSALPPLYSSR 86 (434)
Q Consensus 61 ~~s~L~~li~~l~~~F~~~pPl~~~~ 86 (434)
++++-..|+.-|+..|....|-|.++
T Consensus 534 PkAtd~tf~~kL~~~~~gk~~~f~~~ 559 (1930)
T KOG0161|consen 534 PKATDKTFLEKLCDQHLGKHPKFQKP 559 (1930)
T ss_pred CCCccchHHHHHHHHhhccCccccCc
Confidence 58899999999999997777888776
No 223
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=74.72 E-value=39 Score=33.57 Aligned_cols=19 Identities=0% Similarity=0.146 Sum_probs=8.0
Q ss_pred HhhHHHHhhhhHHHHHHHH
Q psy18020 376 NTFTEEAATEDTIYYMGEA 394 (434)
Q Consensus 376 ~l~Aed~AieDtiy~L~~a 394 (434)
++-++-.++.+=|.+|.+.
T Consensus 121 ~le~~i~~L~eEl~fl~~~ 139 (312)
T PF00038_consen 121 DLENQIQSLKEELEFLKQN 139 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHhh
Confidence 3333344444444444433
No 224
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=74.66 E-value=44 Score=36.05 Aligned_cols=43 Identities=30% Similarity=0.228 Sum_probs=30.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 312 QGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 312 ~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
.-...+...+.+|......|+..+..+++++++|++-++.+..
T Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~l~~ 173 (525)
T TIGR02231 131 QAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNALLT 173 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 3334455666777777777888888888888888777777754
No 225
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=74.63 E-value=14 Score=32.63 Aligned_cols=45 Identities=16% Similarity=0.070 Sum_probs=28.4
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 307 ENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 307 q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
++||..+-..|+-+...++..+..|.+.|..|+-...+-..-...
T Consensus 27 RaEmkarIa~LEGE~r~~e~l~~dL~rrIkMLE~aLkqER~k~~~ 71 (134)
T PF08232_consen 27 RAEMKARIAFLEGERRGQENLKKDLKRRIKMLEYALKQERAKYKK 71 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 556666656666667777777777777777776665555444443
No 226
>KOG3647|consensus
Probab=74.60 E-value=40 Score=33.30 Aligned_cols=49 Identities=14% Similarity=0.217 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH
Q psy18020 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~ 333 (434)
|+..+++...+++..-..|.....++..-.++|++-..+|+..+..|+.
T Consensus 110 lk~aIq~i~~~~q~~~~~Lnnvasdea~L~~Kierrk~ElEr~rkRle~ 158 (338)
T KOG3647|consen 110 LKSAIQAIQVRLQSSRAQLNNVASDEAALGSKIERRKAELERTRKRLEA 158 (338)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3433433333333333334444444444445555555555555555444
No 227
>PRK12705 hypothetical protein; Provisional
Probab=74.56 E-value=1.2e+02 Score=32.90 Aligned_cols=23 Identities=26% Similarity=0.299 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLK 339 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~ 339 (434)
|++....|+...+.|++.-+.|.
T Consensus 93 l~~~~~~l~~~~~~l~~~~~~l~ 115 (508)
T PRK12705 93 LDARAEKLDNLENQLEEREKALS 115 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333
No 228
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=74.53 E-value=64 Score=31.45 Aligned_cols=7 Identities=0% Similarity=0.425 Sum_probs=2.9
Q ss_pred HHHHhhH
Q psy18020 373 QILNTFT 379 (434)
Q Consensus 373 Ql~~l~A 379 (434)
|++.++|
T Consensus 206 hlv~lls 212 (302)
T PF10186_consen 206 HLVSLLS 212 (302)
T ss_pred HHHHHHH
Confidence 4444444
No 229
>PF04094 DUF390: Protein of unknown function (DUF390); InterPro: IPR007228 This domain is found in a family of long proteins that are currently found only in rice. They have no known function. However they may be some kind of transposable element. There is a putative gypsy type transposon domain (IPR007321 from INTERPRO) towards the N terminus of the proteins.
Probab=74.52 E-value=19 Score=40.10 Aligned_cols=110 Identities=16% Similarity=0.241 Sum_probs=64.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc
Q psy18020 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVD 361 (434)
Q Consensus 282 ~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diD 361 (434)
...++.-|+.+++.+++|-..|...=+-+..|...++.|++. .++.-+..+..|+...+.|++++.+.+.. -.
T Consensus 380 ~~~v~~gl~aq~~al~~era~l~a~w~rv~egrr~v~~mv~~---grk~~~~~~~e~~ar~~~l~~v~re~eee----r~ 452 (828)
T PF04094_consen 380 NSTVREGLNAQVQALAAERAALDAEWARVDEGRRAVDAMVEV---GRKAHQAHLAEIQAREETLDSVMRETEEE----RQ 452 (828)
T ss_pred hhHHhhhhhhHHHHHHHHHHHHHHHHHHHhhccchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HH
Confidence 345666677666666666665555555555566556666554 34444455566666666666665544321 11
Q ss_pred ccccccchHHHH------HHHhhHHHHh--hhhHHHHHHHHHhcC
Q psy18020 362 EAVTTTAPIYKQ------ILNTFTEEAA--TEDTIYYMGEALRRG 398 (434)
Q Consensus 362 e~i~~~~~l~~Q------l~~l~Aed~A--ieDtiy~L~~aL~~g 398 (434)
.++++++++..- -|+..++|.+ ++|+-..||-|..+.
T Consensus 453 aalias~~l~ea~~~irlqy~~~~~~l~k~~~~a~gvldaaaarE 497 (828)
T PF04094_consen 453 AALIASSVLDEALGDIRLQYEAHAEDLAKRVDDARGVLDAAAARE 497 (828)
T ss_pred HHHHHHHHHHhhccccccccchHHHHHHHHHHhhhhhhhhhhhhh
Confidence 233444444322 3788888876 789999999887654
No 230
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=74.50 E-value=36 Score=26.02 Aligned_cols=28 Identities=32% Similarity=0.444 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 318 ARMFERIDKEKAELERSISFLKEKETEL 345 (434)
Q Consensus 318 ~~~~~eLe~~k~~Le~~i~~L~~~~~eL 345 (434)
+..+.+-+.....|...|..|+.+++++
T Consensus 31 e~kLqeaE~rn~eL~~ei~~L~~e~ee~ 58 (61)
T PF08826_consen 31 ESKLQEAEKRNRELEQEIERLKKEMEEL 58 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3333444444555666666666666554
No 231
>PF10205 KLRAQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019343 This entry represents a N-terminal 100 residues long domain, which contains a conserved KLRAQ motif. This domain is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain a C-terminal TTKRSYEDQ motif domain (IPR019348 from INTERPRO). The function of these proteins is not known.
Probab=74.45 E-value=52 Score=27.82 Aligned_cols=41 Identities=24% Similarity=0.364 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345 (434)
Q Consensus 301 e~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eL 345 (434)
+.|+.-.++|.. ++.++..|.+-.+.|.+++..|+.+.+..
T Consensus 33 e~Lk~ke~~LRk----~eqE~dSL~FrN~QL~kRV~~LQ~El~~~ 73 (102)
T PF10205_consen 33 EQLKEKEQALRK----LEQENDSLTFRNQQLTKRVEVLQEELEES 73 (102)
T ss_pred HHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334444444444 77788888888899999998888877754
No 232
>PF11172 DUF2959: Protein of unknown function (DUF2959); InterPro: IPR021342 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=74.36 E-value=55 Score=30.99 Aligned_cols=121 Identities=21% Similarity=0.306 Sum_probs=73.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH
Q psy18020 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQT----ENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL-DEILAV 351 (434)
Q Consensus 277 l~sal~dkL~~~L~e~~~q~~~ele~L~~~----q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eL-e~~I~~ 351 (434)
|.+.|+ .-+..+++...+....+++|..+ -.+|+...++|.++.+..+.....+...|..+++..+.| ++|-.+
T Consensus 26 lvdrVe-~Ardsq~eaqeQF~sALe~f~sl~~~~ggdLe~~Y~~ln~~ye~s~~~A~~V~~RI~~vE~Va~ALF~EWe~E 104 (201)
T PF11172_consen 26 LVDRVE-DARDSQQEAQEQFKSALEQFKSLVNFDGGDLEDKYNALNDEYESSEDAAEEVSDRIDAVEDVADALFDEWEQE 104 (201)
T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444 33555566666666667766654 347888888888888888888888888888888877666 778777
Q ss_pred hcCCCCCCccc----cccccchHHHHHHHhhH--HH------HhhhhHHHHHHHHHhcC
Q psy18020 352 LTEKTEVDVDE----AVTTTAPIYKQILNTFT--EE------AATEDTIYYMGEALRRG 398 (434)
Q Consensus 352 l~~k~~~diDe----~i~~~~~l~~Ql~~l~A--ed------~AieDtiy~L~~aL~~g 398 (434)
++.=.+...-. =+..+---|.+|+.... |+ .++.|-.-+|..-||..
T Consensus 105 L~~Y~~~sLR~~S~~kL~~tr~~Y~~L~~aM~~Ae~km~PVL~~~~D~vL~LKHNLNA~ 163 (201)
T PF11172_consen 105 LDQYSNASLRRASEQKLAETRRRYAQLIKAMRRAESKMQPVLAAFRDQVLYLKHNLNAQ 163 (201)
T ss_pred HHHHcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhccccHH
Confidence 74222211111 12233334566655432 22 24556556666666543
No 233
>PF05010 TACC: Transforming acidic coiled-coil-containing protein (TACC); InterPro: IPR007707 This family contains the proteins TACC 1, 2 and 3, found concentrated in the centrosomes of eukaryotes which may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1) [].
Probab=74.33 E-value=63 Score=30.82 Aligned_cols=75 Identities=24% Similarity=0.353 Sum_probs=0.0
Q ss_pred HHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHH-HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 274 KMSLKSAVED-------------KIRRKMNEQKVQAQEEIEILKQ-TENELNQGKFKLARMFERIDKEKAELERSISFLK 339 (434)
Q Consensus 274 r~sl~sal~d-------------kL~~~L~e~~~q~~~ele~L~~-~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~ 339 (434)
...|+..+.+ .|+....+.+.....+++.+++ +++++.. |+..+...+-....|+..+..-.
T Consensus 113 EE~Lkk~~~ey~~~l~~~eqry~aLK~hAeekL~~ANeei~~v~~~~~~e~~a----Lqa~lkk~e~~~~SLe~~LeQK~ 188 (207)
T PF05010_consen 113 EETLKKCIEEYEERLKKEEQRYQALKAHAEEKLEKANEEIAQVRSKHQAELLA----LQASLKKEEMKVQSLEESLEQKT 188 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHHh
Q psy18020 340 EKETELDEILAVL 352 (434)
Q Consensus 340 ~~~~eLe~~I~~l 352 (434)
.+.++|.+++++|
T Consensus 189 kEn~ELtkICDeL 201 (207)
T PF05010_consen 189 KENEELTKICDEL 201 (207)
T ss_pred HHHHHHHHHHHHH
No 234
>PF04048 Sec8_exocyst: Sec8 exocyst complex component specific domain; InterPro: IPR007191 Sec8 is a component of the exocyst complex involved in the docking of exocystic vesicles with a fusion site on the plasma membrane. The exocyst complex is composed of Sec3, Sec5, Sec6, Sec8, Sec10, Sec15, Exo70 and Exo84.; GO: 0006904 vesicle docking involved in exocytosis, 0015031 protein transport, 0000145 exocyst
Probab=74.25 E-value=63 Score=28.64 Aligned_cols=76 Identities=18% Similarity=0.268 Sum_probs=49.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHh
Q psy18020 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS---FLKEKETELDEILAVL 352 (434)
Q Consensus 277 l~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~---~L~~~~~eLe~~I~~l 352 (434)
+...+...|..-+++..+.....+..+..+...+...++++......|+.-+..|...-. .|-.+..+..+.|+.|
T Consensus 48 ~~~~~~~~L~~vV~eh~q~Fn~sI~sy~~i~~~i~~sq~~i~~lK~~L~~ak~~L~~~~~eL~~L~~~s~~~~~mi~iL 126 (142)
T PF04048_consen 48 LKKRIEKALQEVVNEHYQGFNSSIGSYSQILSSISESQERIRELKESLQEAKSLLGCRREELKELWQRSQEYKEMIEIL 126 (142)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHH
Confidence 345566666666677777777777888888888888888887777777777777654333 3333444444444444
No 235
>PF05278 PEARLI-4: Arabidopsis phospholipase-like protein (PEARLI 4); InterPro: IPR007942 This family contains several phospholipase-like proteins from Arabidopsis thaliana and other members of the Streptophyta which are homologous to PEARLI 4.
Probab=74.23 E-value=74 Score=31.53 Aligned_cols=28 Identities=11% Similarity=0.075 Sum_probs=20.5
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHHhc
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEALRR 397 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL~~ 397 (434)
.-..|-++-.|...|+++|.++.--+++
T Consensus 233 ~~~rl~~l~~~~~~l~k~~~~~~sKV~k 260 (269)
T PF05278_consen 233 MKGRLGELEMESTRLSKTIKSIKSKVEK 260 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556778888888888888887765553
No 236
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=74.23 E-value=11 Score=32.13 Aligned_cols=43 Identities=19% Similarity=0.182 Sum_probs=23.2
Q ss_pred HhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 382 AATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLF 424 (434)
Q Consensus 382 ~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~ 424 (434)
..++||+.+|++.+..=.-.++..-+....+..+.=..+..|+
T Consensus 83 ~~~~eA~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~l~~~l~ 125 (129)
T cd00890 83 KSLEEAIEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEELQ 125 (129)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5566777777776654333445555555555554433343333
No 237
>KOG3850|consensus
Probab=74.19 E-value=30 Score=35.83 Aligned_cols=32 Identities=25% Similarity=0.346 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCccccc
Q psy18020 333 RSISFLKEKETELDEILAVLTEKTEVDVDEAV 364 (434)
Q Consensus 333 ~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i 364 (434)
.+|..|+.+.+.+++.++-....-.-||.|++
T Consensus 324 nEi~nLKqElasmeervaYQsyERaRdIqEal 355 (455)
T KOG3850|consen 324 NEIANLKQELASMEERVAYQSYERARDIQEAL 355 (455)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555666666655544444445666654
No 238
>PF04065 Not3: Not1 N-terminal domain, CCR4-Not complex component ; InterPro: IPR007207 The Ccr4-Not complex (Not1, Not2, Not3, Not4 and Not5) is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID []. This domain is the N-terminal region of the Not proteins.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=74.15 E-value=74 Score=30.91 Aligned_cols=110 Identities=18% Similarity=0.196 Sum_probs=65.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---C
Q psy18020 288 KMNEQKVQAQEEIEILKQTENELNQGK---------FKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE---K 355 (434)
Q Consensus 288 ~L~e~~~q~~~ele~L~~~q~EL~~~~---------~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~---k 355 (434)
.|.+.+..++..||+++....+..... .++.-...+-...+.=|...|..|..+++.++.-++.+.. +
T Consensus 75 ~L~e~Rk~IE~~MErFK~vEkesKtKafSkeGL~~~~k~dp~e~ek~e~~~wl~~~Id~L~~QiE~~E~E~E~L~~~~kK 154 (233)
T PF04065_consen 75 KLLENRKLIEEQMERFKVVEKESKTKAFSKEGLMAASKLDPKEKEKEEARDWLKDSIDELNRQIEQLEAEIESLSSQKKK 154 (233)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccchhhhhcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 356677778888888877765544322 1111122223334555667777777777777777766542 1
Q ss_pred C--CCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHH
Q psy18020 356 T--EVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEV 404 (434)
Q Consensus 356 ~--~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~ 404 (434)
+ .-...+ -...|-..+..-.--=+-|-.|-|+|.+|.|+.+.
T Consensus 155 kk~~~~~~~-------r~~~l~~~ierhk~Hi~kLE~lLR~L~N~~l~~e~ 198 (233)
T PF04065_consen 155 KKKDSTKQE-------RIEELESRIERHKFHIEKLELLLRLLDNDELDPEQ 198 (233)
T ss_pred CccCccchh-------HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHH
Confidence 1 122222 44555556666666667777888889999887643
No 239
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=74.09 E-value=82 Score=35.03 Aligned_cols=11 Identities=9% Similarity=0.220 Sum_probs=4.7
Q ss_pred CCHHHHHHHHH
Q psy18020 63 SKLEGLIKTLA 73 (434)
Q Consensus 63 s~L~~li~~l~ 73 (434)
+-=..|+..+.
T Consensus 39 ~GKttll~ai~ 49 (650)
T TIGR03185 39 AGKTTLLDAIQ 49 (650)
T ss_pred CCHHHHHHHHH
Confidence 33444444444
No 240
>KOG0612|consensus
Probab=74.07 E-value=1.8e+02 Score=34.74 Aligned_cols=58 Identities=22% Similarity=0.178 Sum_probs=36.1
Q ss_pred chHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 368 APIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKH 429 (434)
Q Consensus 368 ~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~ 429 (434)
.-..+||....-+-.-++|++-.|.... +-+..=.|.+|..++.-+-.|.++.-+...
T Consensus 570 ~e~~~~iq~~~e~~~~~~d~l~~le~~k----~~ls~~~~~~~~~~e~~~~~~~~~~e~~~~ 627 (1317)
T KOG0612|consen 570 KELSKQIQQELEENRDLEDKLSLLEESK----SKLSKENKKLRSELEKERRQRTEISEIIAE 627 (1317)
T ss_pred hhhhHHHHHHhhccccHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3355666666666667777777665553 335555666777777666666666655543
No 241
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=74.03 E-value=53 Score=37.14 Aligned_cols=10 Identities=30% Similarity=0.557 Sum_probs=5.9
Q ss_pred CCCCceeccc
Q psy18020 45 DHTGKIYSPY 54 (434)
Q Consensus 45 d~~G~~~~py 54 (434)
..+|++||=.
T Consensus 304 ~~~G~l~h~i 313 (717)
T PF10168_consen 304 TSNGKLYHCI 313 (717)
T ss_pred ecCCeEEEEE
Confidence 4567776433
No 242
>PF11166 DUF2951: Protein of unknown function (DUF2951); InterPro: IPR021337 This family of proteins has no known function. It has a highly conserved sequence.
Probab=74.01 E-value=50 Score=27.44 Aligned_cols=56 Identities=16% Similarity=0.227 Sum_probs=30.2
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
-+.++.++.-|...+.-..++++.+.++..++..=+++...-+..+.++..||-.+
T Consensus 24 ~~~L~~Ik~gq~~qe~v~~kld~tlD~i~reRe~dee~k~~n~Knir~~KmwilGl 79 (98)
T PF11166_consen 24 FNKLDEIKDGQHDQELVNQKLDRTLDEINREREEDEENKKKNDKNIRDIKMWILGL 79 (98)
T ss_pred HHHHHHHHHhHhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 33344444444444444455677777776665544444444444466677776544
No 243
>COG1722 XseB Exonuclease VII small subunit [DNA replication, recombination, and repair]
Probab=73.83 E-value=28 Score=28.14 Aligned_cols=58 Identities=17% Similarity=0.311 Sum_probs=46.0
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
++.++.|.++=..|+.|.-.|++-+...+++..-.+..-..|......++..+.....
T Consensus 13 E~~l~eLE~IV~~LE~Gel~Le~sl~~~erG~~L~k~c~~~L~~Ae~~v~~l~~~~~~ 70 (81)
T COG1722 13 EEALAELEEIVESLESGELPLEEALKEFERGMALYKECQEKLQQAEQRVEKLLEEDEE 70 (81)
T ss_pred HHHHHHHHHHHHHHHcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 4455566677777888888899999999998888888888888888888888876543
No 244
>KOG2685|consensus
Probab=73.77 E-value=1.1e+02 Score=32.04 Aligned_cols=91 Identities=19% Similarity=0.185 Sum_probs=61.4
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHH
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILN 376 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~ 376 (434)
..-++.+..=+.+|....++|..++..|..++..|+..+..++.-....++.|...+....+ |=+ -|.+- -+
T Consensus 76 ~~R~~di~~Wk~el~~ele~l~~E~~~L~~~k~rle~~L~~~~~P~~ia~eCL~~RekR~~~--dlv---~D~Ve---~E 147 (421)
T KOG2685|consen 76 GQRLDDVNFWKGELDRELEDLAAEIDDLLHEKRRLERALNALALPLSIAEECLAHREKRQGI--DLV---HDEVE---TE 147 (421)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHhhcccc--hhh---ccccH---HH
Confidence 33445566678888888888999999999999999999999999888888888875433322 221 01122 23
Q ss_pred hhHHHHhhhhHHHHHHHHH
Q psy18020 377 TFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 377 l~Aed~AieDtiy~L~~aL 395 (434)
|+-|-..|+++.-.|.+-+
T Consensus 148 L~kE~eli~~~q~ll~~~~ 166 (421)
T KOG2685|consen 148 LHKEVELIENIQELLKKTL 166 (421)
T ss_pred HHhHHHHHHHHHHHHHHHH
Confidence 4445556666665555543
No 245
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=73.76 E-value=29 Score=39.63 Aligned_cols=45 Identities=16% Similarity=0.171 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH
Q psy18020 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~ 329 (434)
|-.+|.+++.+.+.+.+.+....++++..++.++...++|++.++
T Consensus 521 li~~l~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~ 565 (782)
T PRK00409 521 LIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEED 565 (782)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444444444444444444444444444444433
No 246
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=73.69 E-value=22 Score=32.40 Aligned_cols=26 Identities=38% Similarity=0.440 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 329 AELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 329 ~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
.+|...|..|+++.++++.-|..++.
T Consensus 112 ~el~~~i~~l~~e~~~l~~kL~~l~~ 137 (169)
T PF07106_consen 112 EELREEIEELEEEIEELEEKLEKLRS 137 (169)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34555666666666666666666654
No 247
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=73.60 E-value=57 Score=30.02 Aligned_cols=17 Identities=12% Similarity=0.237 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHHHhc
Q psy18020 337 FLKEKETELDEILAVLT 353 (434)
Q Consensus 337 ~L~~~~~eLe~~I~~l~ 353 (434)
+++++...++.-|..++
T Consensus 117 ~~r~e~~~~~~ki~e~~ 133 (177)
T PF07798_consen 117 RIREEQAKQELKIQELN 133 (177)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444444444443
No 248
>PF11887 DUF3407: Protein of unknown function (DUF3407); InterPro: IPR024516 This entry represents a domain of unknown function found at the C terminus of many proteins in the mammalian cell entry family.
Probab=73.59 E-value=18 Score=35.73 Aligned_cols=90 Identities=11% Similarity=0.193 Sum_probs=66.8
Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHH
Q psy18020 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTE 380 (434)
Q Consensus 301 e~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Ae 380 (434)
..|..+-..|..+.++|.+.+..|...-.+|+.++..+.+.++.+.+..+.+.+. ..|+.+++...+.+.+.|.+ +
T Consensus 17 ~tL~ala~Al~GrG~~lg~~l~~l~~~l~~ln~~~~~l~~~l~~l~~v~~~~a~a-apdL~~~l~~~~~~s~tL~~---~ 92 (267)
T PF11887_consen 17 ATLSALATALDGRGEQLGETLDDLNTLLATLNPRLPQLREDLRNLADVADTYADA-APDLLDALDNLTTTSRTLVD---Q 92 (267)
T ss_pred HHHHHHHHHHcCcchhHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHH---h
Confidence 4566777888888999999999999999999999999999999998888887543 34777777777777766654 4
Q ss_pred HHhhhhHHHHHHHH
Q psy18020 381 EAATEDTIYYMGEA 394 (434)
Q Consensus 381 d~AieDtiy~L~~a 394 (434)
...|+..+-.+..+
T Consensus 93 ~~~L~~lL~~~~~~ 106 (267)
T PF11887_consen 93 RQQLDALLLSATGL 106 (267)
T ss_pred HHHHHHHHHHHHHH
Confidence 44444444444433
No 249
>KOG0977|consensus
Probab=73.57 E-value=1.4e+02 Score=32.71 Aligned_cols=59 Identities=10% Similarity=0.034 Sum_probs=33.6
Q ss_pred HHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHHh
Q psy18020 371 YKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLS-RKQFMFRALLFKCYKH 429 (434)
Q Consensus 371 ~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~La-ReQF~~Ra~~~Ki~~~ 429 (434)
-..+-+|.+|..-|.+.|..+.+.|++..+.--.+.-++-.|= +=.|+.+-+-..|...
T Consensus 168 e~e~~~Lk~en~rl~~~l~~~r~~ld~Etllr~d~~n~~q~Lleel~f~~~~h~~eI~e~ 227 (546)
T KOG0977|consen 168 EDELKRLKAENSRLREELARARKQLDDETLLRVDLQNRVQTLLEELAFLKRIHKQEIEEE 227 (546)
T ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhccHHHHHHH
Confidence 3444556666666777777777777666544444444444433 3367776666655543
No 250
>KOG2896|consensus
Probab=73.50 E-value=43 Score=34.52 Aligned_cols=51 Identities=25% Similarity=0.361 Sum_probs=34.1
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eL 345 (434)
..+...|.|.+.++.++..+++|+.+....+..+.++..+++.|.+...++
T Consensus 121 ~l~~~~Ealsk~~~~~~k~~~kL~~kr~q~~~~q~~l~k~~k~l~e~~~~l 171 (377)
T KOG2896|consen 121 NLQRQIEALSKKRAHLEKTKQKLEDKRQQFNASQVKLQKQLKSLIELRNEL 171 (377)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 345566666666777777777777777777777777777776655444444
No 251
>KOG0250|consensus
Probab=73.46 E-value=1.9e+02 Score=33.96 Aligned_cols=27 Identities=7% Similarity=0.106 Sum_probs=16.9
Q ss_pred cccccchhHHHHHHHhhhhcCCCCCCC
Q psy18020 129 WTAQNSKLEGLIKTLASAFSALPPLYS 155 (434)
Q Consensus 129 w~~~~s~l~~l~~~~~~~f~~~pp~~s 155 (434)
|+..++||..++..|...+..+.=+.|
T Consensus 157 VStKk~dl~~vv~~f~I~veNP~~~ls 183 (1074)
T KOG0250|consen 157 VSTKKEDLDTVVDHFNIQVENPMFVLS 183 (1074)
T ss_pred ccccHHHHHHHHHHhCcCCCCcchhhc
Confidence 666677777766666666655444443
No 252
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=73.30 E-value=1.1e+02 Score=32.02 Aligned_cols=25 Identities=12% Similarity=0.122 Sum_probs=12.8
Q ss_pred HHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 372 KQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 372 ~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
++|.+-.++=.|--|.+|.=.+.|.
T Consensus 154 ~~l~~qr~ql~aq~qsl~a~~k~LQ 178 (499)
T COG4372 154 KTLAEQRRQLEAQAQSLQASQKQLQ 178 (499)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444555666666665
No 253
>PRK01156 chromosome segregation protein; Provisional
Probab=73.03 E-value=76 Score=36.55 Aligned_cols=29 Identities=14% Similarity=0.140 Sum_probs=14.0
Q ss_pred HHHHHHhhHHHHhhhhHHHHHHHHHhcCCC
Q psy18020 371 YKQILNTFTEEAATEDTIYYMGEALRRGVI 400 (434)
Q Consensus 371 ~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I 400 (434)
.+++-++-.-..+++| +.-+.++|....+
T Consensus 722 ~~~~~~l~~~~~~~~~-l~~~r~~l~k~~~ 750 (895)
T PRK01156 722 NETLESMKKIKKAIGD-LKRLREAFDKSGV 750 (895)
T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHhhhccc
Confidence 3333333333445554 4556666665444
No 254
>COG1842 PspA Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]
Probab=73.03 E-value=92 Score=30.04 Aligned_cols=96 Identities=18% Similarity=0.276 Sum_probs=60.4
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHH
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~ 382 (434)
++.++.+|.. +++.+..+-..+..++..+..+....+++++.-...=.+++ ..|.++. +.+..
T Consensus 33 ird~~~~l~~----ar~~~A~~~a~~k~~e~~~~~~~~~~~k~e~~A~~Al~~g~----------E~LAr~a---l~~~~ 95 (225)
T COG1842 33 IRDMESELAK----ARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGN----------EDLAREA---LEEKQ 95 (225)
T ss_pred HHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC----------HHHHHHH---HHHHH
Confidence 4555666655 66666677777777777777777666666554433222222 2244444 44666
Q ss_pred hhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 383 ATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 383 AieDtiy~L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
.++|.+-.++..+..-.=.++.--++++.|-++
T Consensus 96 ~le~~~~~~~~~~~~~~~~~~~l~~~~~~Le~K 128 (225)
T COG1842 96 SLEDLAKALEAELQQAEEQVEKLKKQLAALEQK 128 (225)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 788888888888776666666666666666654
No 255
>PRK06800 fliH flagellar assembly protein H; Validated
Probab=72.84 E-value=27 Score=32.60 Aligned_cols=16 Identities=19% Similarity=0.210 Sum_probs=5.8
Q ss_pred HHHHHHHHHhHHHHHH
Q psy18020 303 LKQTENELNQGKFKLA 318 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~ 318 (434)
|.++|.+|.....+|+
T Consensus 43 L~~~Q~~L~~e~~~l~ 58 (228)
T PRK06800 43 LLAQQKSLHKELNQLR 58 (228)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3333333333333333
No 256
>KOG2264|consensus
Probab=72.66 E-value=34 Score=37.21 Aligned_cols=33 Identities=18% Similarity=0.358 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
+..++.+||..+++|++.|.++..+++++++.|
T Consensus 91 Vs~EL~ele~krqel~seI~~~n~kiEelk~~i 123 (907)
T KOG2264|consen 91 VSLELTELEVKRQELNSEIEEINTKIEELKRLI 123 (907)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 444555555555566666666666665555544
No 257
>KOG1029|consensus
Probab=72.64 E-value=1.8e+02 Score=33.17 Aligned_cols=30 Identities=13% Similarity=0.328 Sum_probs=18.1
Q ss_pred HHhhhhHHHHHHHHHhcCCCCHHH---HHHHHH
Q psy18020 381 EAATEDTIYYMGEALRRGVIDLEV---YLKQVR 410 (434)
Q Consensus 381 d~AieDtiy~L~~aL~~g~I~ld~---flK~vR 410 (434)
-.+|+|.+..|.+--....-+.|. =+|..|
T Consensus 544 rq~ikdqldelskE~esk~~eidi~n~qlkelk 576 (1118)
T KOG1029|consen 544 RQAIKDQLDELSKETESKLNEIDIFNNQLKELK 576 (1118)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHH
Confidence 457788888887776654434444 455444
No 258
>PF10018 Med4: Vitamin-D-receptor interacting Mediator subunit 4; InterPro: IPR019258 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Members of this family represent the Med4 subunit of the Mediator (Med) complex [, ]. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=72.62 E-value=27 Score=32.56 Aligned_cols=33 Identities=12% Similarity=0.279 Sum_probs=17.8
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSI 335 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i 335 (434)
|.+++++......++++++..|...+..|...+
T Consensus 31 I~~L~~e~~~ld~~i~~~~~~L~~~~~~L~~~~ 63 (188)
T PF10018_consen 31 IQQLRAEIEELDEQIRDILKQLKEARKELRTLP 63 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444456666666666665555554
No 259
>PF14712 Snapin_Pallidin: Snapin/Pallidin
Probab=72.55 E-value=50 Score=26.71 Aligned_cols=19 Identities=21% Similarity=0.289 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy18020 332 ERSISFLKEKETELDEILA 350 (434)
Q Consensus 332 e~~i~~L~~~~~eLe~~I~ 350 (434)
++++..+.+....+++.+.
T Consensus 70 kkrm~~l~~~l~~lk~R~~ 88 (92)
T PF14712_consen 70 KKRMSNLHERLQKLKKRAD 88 (92)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444443
No 260
>PF12174 RST: RCD1-SRO-TAF4 (RST) plant domain; InterPro: IPR022003 This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors.
Probab=72.40 E-value=13 Score=29.20 Aligned_cols=42 Identities=17% Similarity=0.278 Sum_probs=33.8
Q ss_pred hhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 385 EDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCY 427 (434)
Q Consensus 385 eDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~ 427 (434)
-|.|..+-+-|++++|+=|+|+|++|...-+|-+ +..|.+++
T Consensus 27 ~~~l~~~Y~~~k~~kIsR~~fvr~lR~IVGD~lL-~s~I~~lq 68 (70)
T PF12174_consen 27 MDLLQKHYEEFKKKKISREEFVRKLRQIVGDQLL-RSAIKSLQ 68 (70)
T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH-HHHHHHhc
Confidence 5667777788999999999999999999998855 44555544
No 261
>TIGR00255 conserved hypothetical protein TIGR00255. The apparent ortholog from Aquifex aeolicus as reported is split into two consecutive reading frames.
Probab=72.23 E-value=1.1e+02 Score=30.64 Aligned_cols=65 Identities=18% Similarity=0.320 Sum_probs=45.6
Q ss_pred CCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcC-------CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 357 EVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRG-------VIDLEVYLKQVRSLSRKQFMFRALLFKCYKH 429 (434)
Q Consensus 357 ~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g-------~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~ 429 (434)
..||+|=++--....+|.-++..++.++.--+.+|-+=++|. ..+.+ +++-=--.+.++.|||.+
T Consensus 215 K~DI~EEl~RL~sHl~~f~~~L~~~~~vGrkLDFL~QEmnRE~NTigSKs~d~~--------is~~vVe~K~eiEkiREQ 286 (291)
T TIGR00255 215 RIDIAEEIDRLDSHVKEFYNILKKGEAVGRKLDFMMQELNRESNTLASKAIDAD--------ITNLAVEMKVLIEKIKEQ 286 (291)
T ss_pred hcchHHHHHHHHHHHHHHHHHHhcCCCcCcchhHHHHHHhHHHHHHHHccCcHH--------HHHHHHHHHHHHHHHHHH
Confidence 347777666666678899999999889888888888888775 22333 223333457888888854
No 262
>PRK14145 heat shock protein GrpE; Provisional
Probab=72.16 E-value=64 Score=30.51 Aligned_cols=49 Identities=14% Similarity=0.138 Sum_probs=24.2
Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 300 IEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 300 le~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.+.+.++++|+..-+...++++.++. +..+++-+..|-.-.+.|++.+.
T Consensus 65 ~d~~lR~~AEfeN~rkR~~kE~e~~~--~~a~e~~~~~LLpV~DnLerAl~ 113 (196)
T PRK14145 65 LDIAQRLKAEFENYRKRTEKEKSEMV--EYGKEQVILELLPVMDNFERALA 113 (196)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhHHhHHHHHHh
Confidence 34455555555554444444444432 23334445555555555555554
No 263
>smart00212 UBCc Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
Probab=72.13 E-value=3.4 Score=36.56 Aligned_cols=38 Identities=29% Similarity=0.526 Sum_probs=33.4
Q ss_pred ccCcccCCCccccCCCC--ccccccchhHHHHHHHhhhhcC
Q psy18020 111 VSQNVDHTGKIYSPYLH--TWTAQNSKLEGLIKTLASAFSA 149 (434)
Q Consensus 111 ~~~~v~~~g~~~~pyl~--~w~~~~s~l~~l~~~~~~~f~~ 149 (434)
-..+||.+|++.+++|. +|.+. .+|.++|+.+...|.+
T Consensus 72 ~Hp~i~~~G~icl~~l~~~~W~p~-~~l~~il~~i~~~l~~ 111 (145)
T smart00212 72 YHPNVDSSGEICLDILKQEKWSPA-TTLETVLLSIQSLLSE 111 (145)
T ss_pred eEeeECCCCCEehhhcCCCCCCCC-CcHHHHHHHHHHHHhC
Confidence 36778999999999998 89955 8899999999999976
No 264
>PF09602 PhaP_Bmeg: Polyhydroxyalkanoic acid inclusion protein (PhaP_Bmeg); InterPro: IPR011728 This entry describes a protein found in polyhydroxyalkanoic acid (PHA) gene regions and incorporated into PHA inclusions in Bacillus cereus and Bacillus megaterium. The role of the protein may include amino acid storage [].
Probab=72.12 E-value=81 Score=28.97 Aligned_cols=22 Identities=27% Similarity=0.433 Sum_probs=15.0
Q ss_pred HHHhcCCC--CHHHHHHHHHHHHH
Q psy18020 393 EALRRGVI--DLEVYLKQVRSLSR 414 (434)
Q Consensus 393 ~aL~~g~I--~ld~flK~vR~LaR 414 (434)
+.++|+-+ -+|.|++.+|.+-.
T Consensus 134 qk~~r~e~qk~~~~yv~~~k~~q~ 157 (165)
T PF09602_consen 134 QKLTREEWQKVLDAYVEQAKSSQK 157 (165)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556544 68899999887643
No 265
>PF13863 DUF4200: Domain of unknown function (DUF4200)
Probab=72.00 E-value=27 Score=29.88 Aligned_cols=46 Identities=22% Similarity=0.294 Sum_probs=19.2
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 307 ENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 307 q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+..|......+++....+......|+..-..|.+.....+++|..+
T Consensus 13 ~~~l~~kr~e~~~~~~~~~~~e~~L~~~e~~l~~~~~~f~~flken 58 (126)
T PF13863_consen 13 QLALDTKREEIERREEQLKQREEELEKKEQELEEDVIKFDKFLKEN 58 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3333333333444444444444444444444444444444444433
No 266
>PF05700 BCAS2: Breast carcinoma amplified sequence 2 (BCAS2); InterPro: IPR008409 This family consists of several eukaryotic sequences of unknown function. The mammalian members of this family are annotated as breast carcinoma amplified sequence 2 (BCAS2) proteins []. BCAS2 is a putative spliceosome associated protein [].
Probab=71.92 E-value=95 Score=29.67 Aligned_cols=6 Identities=33% Similarity=0.921 Sum_probs=2.8
Q ss_pred cccccc
Q psy18020 52 SPYLHT 57 (434)
Q Consensus 52 ~pyL~~ 57 (434)
|||++.
T Consensus 12 LPYiD~ 17 (221)
T PF05700_consen 12 LPYIDP 17 (221)
T ss_pred CCCCCC
Confidence 444444
No 267
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=71.88 E-value=52 Score=33.27 Aligned_cols=7 Identities=14% Similarity=0.506 Sum_probs=2.7
Q ss_pred HHHHHHH
Q psy18020 404 VYLKQVR 410 (434)
Q Consensus 404 ~flK~vR 410 (434)
.||--++
T Consensus 244 ~~L~~~~ 250 (314)
T PF04111_consen 244 AFLDCLQ 250 (314)
T ss_dssp HHHHHHH
T ss_pred HHHHHHH
Confidence 3333333
No 268
>PF04799 Fzo_mitofusin: fzo-like conserved region; InterPro: IPR006884 This entry represents the heptad repeat domain which is conserved at the C terminus of Fzo/mitofusion family of GTPases. Fzo is a mediator of mitochondrial fusion during spermatogenesis []. This conserved region is also found in the human mitofusin protein []. This domain forms a dimeric antiparallel coiled coil structure, which has been proposed to act as a mitochodrial tether before vesicle fusion [].; GO: 0003924 GTPase activity, 0006184 GTP catabolic process, 0008053 mitochondrial fusion, 0005741 mitochondrial outer membrane, 0016021 integral to membrane; PDB: 1T3J_A.
Probab=71.80 E-value=21 Score=33.00 Aligned_cols=46 Identities=24% Similarity=0.258 Sum_probs=21.2
Q ss_pred HHHHHHH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 296 AQEEIEI-LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341 (434)
Q Consensus 296 ~~~ele~-L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~ 341 (434)
.+.||.. +..+....+.....|+.++.+|+.+.+.||+....++.-
T Consensus 103 VqqeL~~tf~rL~~~Vd~~~~eL~~eI~~L~~~i~~le~~~~~~k~L 149 (171)
T PF04799_consen 103 VQQELSSTFARLCQQVDQTKNELEDEIKQLEKEIQRLEEIQSKSKTL 149 (171)
T ss_dssp -------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344432 344444555555556666666666666665555544433
No 269
>PF08044 DUF1707: Domain of unknown function (DUF1707); InterPro: IPR012551 This domain is found in a variety of actinomycetales proteins. All of the proteins containing this domain are hypothetical and probably membrane bound or associated. Currently, it is unclear to the function of this domain.
Probab=71.72 E-value=8.4 Score=28.52 Aligned_cols=35 Identities=14% Similarity=0.291 Sum_probs=28.5
Q ss_pred HHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy18020 380 EEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 380 ed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaR 414 (434)
.|.-=+.++..|+.|+..|+|++++|=..+-....
T Consensus 4 sd~dR~~~~~~L~~a~a~GrL~~~Ef~~R~~~a~~ 38 (53)
T PF08044_consen 4 SDADRERAVDLLRAAFAEGRLSLDEFDERLDAAYA 38 (53)
T ss_pred cHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHh
Confidence 34455788999999999999999999888765543
No 270
>PF02646 RmuC: RmuC family; InterPro: IPR003798 This protein contains several bacterial RmuC DNA recombination proteins. The function of the RMUC protein is unknown but it is suspected that it is either a structural protein that protects DNA against nuclease action, or is itself involved in DNA cleavage at the regions of DNA secondary structures []. Proteins in this family are predicted to contain a central endonuclease-like fold domain, surrounded by coiled coils, consistent with a direct role in DNA cleavage [, ].
Probab=71.70 E-value=77 Score=31.81 Aligned_cols=17 Identities=24% Similarity=0.358 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy18020 334 SISFLKEKETELDEILA 350 (434)
Q Consensus 334 ~i~~L~~~~~eLe~~I~ 350 (434)
.++.|.++...|.+.+.
T Consensus 49 ~~~~l~~~~~~L~~aL~ 65 (304)
T PF02646_consen 49 EIQQLSQEASNLTSALK 65 (304)
T ss_pred HHHHHHHHHHHHHHHHh
Confidence 34566666677777776
No 271
>PRK05260 condesin subunit F; Provisional
Probab=71.39 E-value=62 Score=33.90 Aligned_cols=27 Identities=7% Similarity=0.159 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHH
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFER 323 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~e 323 (434)
.+.++.+...|.-|...++.++.++..
T Consensus 165 aeifd~Idl~QR~mDeqQ~~vk~eIA~ 191 (440)
T PRK05260 165 AEIFDSIDLTQRLMDEQQQQVKDDIAQ 191 (440)
T ss_pred HHHHhhhhhHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444444433
No 272
>PF15290 Syntaphilin: Golgi-localised syntaxin-1-binding clamp
Probab=71.22 E-value=53 Score=32.64 Aligned_cols=37 Identities=27% Similarity=0.322 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHhc
Q psy18020 317 LARMFERIDKEKAELERSISFLKEK--------------ETELDEILAVLT 353 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~--------------~~eLe~~I~~l~ 353 (434)
|+.-..++++.+++++..-..|.++ ..+||..+..++
T Consensus 119 LKEARkEIkQLkQvieTmrssL~ekDkGiQKYFvDINiQN~KLEsLLqsME 169 (305)
T PF15290_consen 119 LKEARKEIKQLKQVIETMRSSLAEKDKGIQKYFVDINIQNKKLESLLQSME 169 (305)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhchhhhhHHHHHhhhhhhHhHHHHHHHHHH
Confidence 4444445555555555544444443 455666666543
No 273
>PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) []. It is also thought to modulate cytoplasmic dynein binding to an organelle, and plays a role in prometaphase chromosome alignment and spindle organisation during mitosis. Dynamitin is also involved in anchoring microtubules to centrosomes and may play a role in synapse formation during brain development []. ; GO: 0007017 microtubule-based process, 0005869 dynactin complex
Probab=71.14 E-value=46 Score=34.53 Aligned_cols=29 Identities=17% Similarity=0.398 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 324 IDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 324 Le~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
|+.....+++|+..++...+.|++.|++|
T Consensus 359 L~~ve~~~~~N~~~i~~n~~~le~Ri~~L 387 (388)
T PF04912_consen 359 LNKVEEKFKENMETIEKNVKKLEERIAKL 387 (388)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 33344445556666666666666666544
No 274
>KOG4674|consensus
Probab=71.06 E-value=1.1e+02 Score=38.17 Aligned_cols=72 Identities=8% Similarity=-0.001 Sum_probs=46.5
Q ss_pred CCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 356 TEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCY 427 (434)
Q Consensus 356 ~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~ 427 (434)
+++|..++=--.+-..+.-=++.+++..|.|+=--|++.=+.-+-.+|.+-..++.|.++==-++.+..+..
T Consensus 1302 k~~d~~~~~kL~~ei~~Lk~el~~ke~~~~el~~~~~~~q~~~k~qld~l~~e~~~lt~~~~ql~~~~~rL~ 1373 (1822)
T KOG4674|consen 1302 KDSDKNDYEKLKSEISRLKEELEEKENLIAELKKELNRLQEKIKKQLDELNNEKANLTKELEQLEDLKTRLA 1373 (1822)
T ss_pred hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555666654444556666677778888888877666665544445778888877777776555555444444
No 275
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=71.01 E-value=28 Score=36.43 Aligned_cols=17 Identities=41% Similarity=0.364 Sum_probs=6.8
Q ss_pred HHHHHHhHHHHHHHHHH
Q psy18020 306 TENELNQGKFKLARMFE 322 (434)
Q Consensus 306 ~q~EL~~~~~~L~~~~~ 322 (434)
+|+|+..-++.|..+.+
T Consensus 274 Hq~Ei~~LKqeLa~~EE 290 (395)
T PF10267_consen 274 HQNEIYNLKQELASMEE 290 (395)
T ss_pred HHHHHHHHHHHHHhHHH
Confidence 34444443333333333
No 276
>PLN02939 transferase, transferring glycosyl groups
Probab=70.93 E-value=1.4e+02 Score=34.98 Aligned_cols=58 Identities=16% Similarity=0.133 Sum_probs=40.2
Q ss_pred ccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 366 TTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCY 427 (434)
Q Consensus 366 ~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~ 427 (434)
...+++..+-.+--|...+.|-|.+|...|.. +..-=+.++.|-||--+..+.++.+.
T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (977)
T PLN02939 220 CVHSLSKELDVLKEENMLLKDDIQFLKAELIE----VAETEERVFKLEKERSLLDASLRELE 277 (977)
T ss_pred ccccHHHHHHHHHHHhHHHHHHHHHHHHHHHH----HHhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 34568888888888999999999998887743 33333455666676666665555443
No 277
>PRK13455 F0F1 ATP synthase subunit B; Provisional
Probab=70.72 E-value=66 Score=29.66 Aligned_cols=14 Identities=21% Similarity=0.240 Sum_probs=6.6
Q ss_pred HhhhhHHHHHHHHH
Q psy18020 382 AATEDTIYYMGEAL 395 (434)
Q Consensus 382 ~AieDtiy~L~~aL 395 (434)
..|++.|.-|+.+|
T Consensus 170 ~lid~~i~~l~~~~ 183 (184)
T PRK13455 170 ALIDEAIKEVEARL 183 (184)
T ss_pred HHHHHHHHHHhhhc
Confidence 44444444444444
No 278
>PF02321 OEP: Outer membrane efflux protein; InterPro: IPR003423 The OEP family (Outer membrane efflux protein) form trimeric channels that allow export of a variety of substrates in Gram negative bacteria. Each member of this family is composed of two repeats. The trimeric channel is composed of a 12 stranded all beta sheet barrel that spans the outer membrane, and a long all helical barrel that spans the periplasm. Examples include the Escherichia coli TolC outer membrane protein, which is required for proper expression of outer membrane protein genes; the Rhizobium nodulation protein; and the Pseudomonas FusA protein, which is involved in resistance to fusaric acid.; GO: 0005215 transporter activity, 0006810 transport; PDB: 3PIK_A 1YC9_A 3D5K_C 1WP1_B 2XMN_C 2WMZ_B 1EK9_B 2VDD_C 1TQQ_A 2VDE_B ....
Probab=70.70 E-value=70 Score=27.64 Aligned_cols=41 Identities=7% Similarity=0.069 Sum_probs=27.0
Q ss_pred CCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHH
Q psy18020 355 KTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 355 k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL 395 (434)
.+..+.-|++.+..-+.+-..++..-...+..++.-|..++
T Consensus 147 ~G~~~~~d~~~a~~~~~~~~~~~~~~~~~~~~~~~~L~~l~ 187 (188)
T PF02321_consen 147 AGLISELDLLQAQAQLLQAQLQLIQAQNDLQIALAQLKRLL 187 (188)
T ss_dssp TTSS-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 45556655666777777777777777777777777666554
No 279
>KOG0804|consensus
Probab=70.67 E-value=71 Score=33.84 Aligned_cols=38 Identities=16% Similarity=0.283 Sum_probs=16.4
Q ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 304 KQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341 (434)
Q Consensus 304 ~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~ 341 (434)
...+..++..+..+++.+.+++...++|.++...+++.
T Consensus 367 ~~~~~~~e~~kk~~e~k~~q~q~k~~k~~kel~~~~E~ 404 (493)
T KOG0804|consen 367 KQESSDLEAEKKIVERKLQQLQTKLKKCQKELKEEREE 404 (493)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444444444444444444444444433
No 280
>KOG4302|consensus
Probab=70.66 E-value=1.2e+02 Score=33.85 Aligned_cols=70 Identities=19% Similarity=0.298 Sum_probs=46.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH---------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGK---------------FKLARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 283 dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~---------------~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
+.|+.+..+.+.++...++++..+-.+|...+ ++|+.....|...+++-..+++.+.+-..++..
T Consensus 113 e~lr~qk~eR~~ef~el~~qie~l~~~l~g~~~~~~~~~~D~~dlsl~kLeelr~~L~~L~~ek~~Rlekv~~~~~~I~~ 192 (660)
T KOG4302|consen 113 EGLRKQKDERRAEFKELYHQIEKLCEELGGPEDLPSFLIADESDLSLEKLEELREHLNELQKEKSDRLEKVLELKEEIKS 192 (660)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCCcccccCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46777778888888888888888888887761 445555555555555555555555555555555
Q ss_pred HHHHh
Q psy18020 348 ILAVL 352 (434)
Q Consensus 348 ~I~~l 352 (434)
..+.|
T Consensus 193 l~~~L 197 (660)
T KOG4302|consen 193 LCSVL 197 (660)
T ss_pred HHHHh
Confidence 55555
No 281
>KOG1003|consensus
Probab=70.48 E-value=98 Score=29.26 Aligned_cols=56 Identities=9% Similarity=0.068 Sum_probs=33.2
Q ss_pred CccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy18020 359 DVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 359 diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaR 414 (434)
|...+..-.-..-+++.+|--+..-+.-.+..|...-..-.=..|.|=+.+|.|+-
T Consensus 96 dLE~~eeraE~~Es~~~eLeEe~~~~~~nlk~l~~~ee~~~q~~d~~e~~ik~ltd 151 (205)
T KOG1003|consen 96 ELERAEERAEAAESQSEELEEDLRILDSNLKSLSAKEEKLEQKEEKYEEELKELTD 151 (205)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHhhhHHHHHHHHHHHHH
Confidence 33333333344455666666666666666666666555555577778777777763
No 282
>PRK14144 heat shock protein GrpE; Provisional
Probab=70.39 E-value=41 Score=31.85 Aligned_cols=49 Identities=22% Similarity=0.217 Sum_probs=24.7
Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 301 e~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
+.+.++++++..-+...++++.++. +..+++.+..|-.-.+.|++.+..
T Consensus 66 dk~lR~~AefeN~RKR~~kE~e~~~--~~a~~~~~~~LLpV~DnLerAl~~ 114 (199)
T PRK14144 66 EKSVRALAELENVRRRMEREVANAH--KYGVEKLISALLPVVDSLEQALQL 114 (199)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhhHHhHHHHHHHc
Confidence 4455555555554444444444332 334445555555556666666553
No 283
>PF12072 DUF3552: Domain of unknown function (DUF3552); InterPro: IPR022711 This presumed domain is functionally uncharacterised. This domain is found in bacteria, archaea and eukaryotes. This domain is about 200 amino acids in length. This domain is found associated with PF00013 from PFAM, PF01966 from PFAM. This domain has a single completely conserved residue A that may be functionally important. ; GO: 0008663 2',3'-cyclic-nucleotide 2'-phosphodiesterase activity
Probab=70.38 E-value=96 Score=29.13 Aligned_cols=22 Identities=5% Similarity=0.161 Sum_probs=13.6
Q ss_pred CHHHHHHHHHHHHHHHHHHHHH
Q psy18020 401 DLEVYLKQVRSLSRKQFMFRAL 422 (434)
Q Consensus 401 ~ld~flK~vR~LaReQF~~Ra~ 422 (434)
+...++|.+-.-|++..-.+|-
T Consensus 166 e~a~~ir~~eeeak~~A~~~Ar 187 (201)
T PF12072_consen 166 EAAALIRRIEEEAKEEADKKAR 187 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666666666666665553
No 284
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=70.35 E-value=22 Score=37.95 Aligned_cols=35 Identities=20% Similarity=0.382 Sum_probs=20.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
++.+++.|....+.+++.|+.|+++++.|++-++.
T Consensus 88 LrqElq~~saq~~dle~KIkeLEaE~~~Lk~Ql~a 122 (475)
T PRK13729 88 IRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKA 122 (475)
T ss_pred HHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHh
Confidence 44444444456666666666666666666666543
No 285
>KOG4722|consensus
Probab=70.35 E-value=1.5e+02 Score=31.29 Aligned_cols=35 Identities=14% Similarity=0.161 Sum_probs=27.0
Q ss_pred cccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhc
Q psy18020 363 AVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRR 397 (434)
Q Consensus 363 ~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~ 397 (434)
+|+++-..-|.-++++|+|...|.-+.-+.+-+.+
T Consensus 329 aFI~TLea~NKRhdlLaK~~e~Eekl~E~nee~~~ 363 (672)
T KOG4722|consen 329 AFIATLEADNKRHDLLAKDAEVEEKLLEKNEEFLN 363 (672)
T ss_pred HHHHHHHHhhhHHHHHHhhHHHHHHHHHHHHHHHH
Confidence 46777777888899999999888888777766543
No 286
>PRK04863 mukB cell division protein MukB; Provisional
Probab=70.32 E-value=82 Score=38.67 Aligned_cols=66 Identities=9% Similarity=0.166 Sum_probs=48.8
Q ss_pred cccchHHHHHHHhhHHHHhhhhH--------HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 365 TTTAPIYKQILNTFTEEAATEDT--------IYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKHIS 431 (434)
Q Consensus 365 ~~~~~l~~Ql~~l~Aed~AieDt--------iy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~~~ 431 (434)
++.+...++.+.+ .||.+.-|. ..||.+-++...|-.+...+.+..|.++=-..+..|++-...++
T Consensus 1162 ~~~~~~~~~~~~~-~~~~~~~e~~~~~~~~~~~~~r~r~r~di~~~~~p~et~e~Le~ei~rl~~~L~e~Er~L~ 1235 (1486)
T PRK04863 1162 VADNEHLRDVLRL-SEDPKRPERKVQFYIAVYQHLRERIRQDIIRTDDPVEAIEQMEIELSRLTEELTSREQKLA 1235 (1486)
T ss_pred ccCcHHHHHHHhh-ccCCCchhHHHhHHHHHHHHHHHHHhhhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 4555566666666 666665543 35777778888998889999999999888888888887766555
No 287
>KOG0964|consensus
Probab=70.25 E-value=1.8e+02 Score=34.07 Aligned_cols=63 Identities=16% Similarity=0.182 Sum_probs=36.2
Q ss_pred ccchHHHHHHHhhHHHHhhhhHHHHHHHHHhc--C------------CCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 366 TTAPIYKQILNTFTEEAATEDTIYYMGEALRR--G------------VIDLEVYLKQVRSLSRKQFMFRALLFKCYK 428 (434)
Q Consensus 366 ~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~--g------------~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~ 428 (434)
-.+.+..-+.++-+++.++..-|..|..-++. | +=.+|.-...=..|.||.=-.|..|.++..
T Consensus 412 ~e~~lq~e~~~~e~~l~~~~e~i~~l~~si~e~~~r~~~~~~~~~~~k~~~del~~~Rk~lWREE~~l~~~i~~~~~ 488 (1200)
T KOG0964|consen 412 QENILQKEIEDLESELKEKLEEIKELESSINETKGRMEEFDAENTELKRELDELQDKRKELWREEKKLRSLIANLEE 488 (1200)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444555555555555555555555555442 1 224556666667788887777777766653
No 288
>PF14817 HAUS5: HAUS augmin-like complex subunit 5
Probab=70.07 E-value=31 Score=38.36 Aligned_cols=20 Identities=20% Similarity=0.406 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy18020 408 QVRSLSRKQFMFRALLFKCY 427 (434)
Q Consensus 408 ~vR~LaReQF~~Ra~~~Ki~ 427 (434)
+++.+-++|||.+|.-++|.
T Consensus 122 ~~~~~~~k~~LL~Ay~q~c~ 141 (632)
T PF14817_consen 122 KISDSRHKQLLLEAYSQQCE 141 (632)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444443
No 289
>cd07663 BAR_SNX5 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 5. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. SNX5, abundantly expressed in macrophages, regulates macropinocytosis, a process that enables cells to internalize large amounts of external solutes. It may also be a component of the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi, acting as a mammalian equivalent of yeast Vsp17p. It
Probab=70.01 E-value=1.1e+02 Score=29.51 Aligned_cols=42 Identities=12% Similarity=0.156 Sum_probs=22.9
Q ss_pred HHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 380 EEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLF 424 (434)
Q Consensus 380 ed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~ 424 (434)
+|.-++|+|.++-+.+..-+ |.|-+++|.|+.=|=-.++|-+
T Consensus 103 d~~~L~e~L~~Y~r~~~A~K---~ll~rR~ral~~~e~A~~~L~K 144 (218)
T cd07663 103 QDLKLTELLRYYMLNIEAAK---DLLYRRARALADYENSNKALDK 144 (218)
T ss_pred HHhhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH
Confidence 34455666666666555433 5566666666655554444433
No 290
>KOG0425|consensus
Probab=69.96 E-value=4.2 Score=36.76 Aligned_cols=39 Identities=23% Similarity=0.276 Sum_probs=28.9
Q ss_pred EEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceecccccc
Q psy18020 14 CIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHT 57 (434)
Q Consensus 14 ~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~ 57 (434)
.+=+|.+||++||.+=. +..| =+++||.+|+|++-.||.
T Consensus 58 ~m~FP~dYP~sPP~~rF--~s~m---wHPNvy~~G~vCISILH~ 96 (171)
T KOG0425|consen 58 HMKFPQDYPLSPPTFRF--TSKM---WHPNVYEDGDVCISILHP 96 (171)
T ss_pred EEeCcccCCCCCCceee--ehhh---cCCCcCCCCCEEEEeecC
Confidence 35579999999998633 2222 148899999999888874
No 291
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=69.96 E-value=1.1e+02 Score=30.31 Aligned_cols=36 Identities=17% Similarity=0.307 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
++.++.+++..-..|+..+..|+.+.+.|++-|..+
T Consensus 214 ~~~E~~~~r~~~~~l~~el~~l~~~~~~Le~~l~~l 249 (312)
T PF00038_consen 214 AKEELKELRRQIQSLQAELESLRAKNASLERQLREL 249 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHhHHHHHHhhhhHhhhhhhccccchhhhhhhHHHH
Confidence 333344444444444444444444444444444444
No 292
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=69.95 E-value=91 Score=29.13 Aligned_cols=36 Identities=28% Similarity=0.354 Sum_probs=16.1
Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 309 ELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344 (434)
Q Consensus 309 EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~e 344 (434)
+|...-..|+.+..+|+.....++..++.++.+.++
T Consensus 124 ~l~~~i~~L~~e~~~L~~~~~~l~~~~e~~ek~~~e 159 (189)
T PF10211_consen 124 ELEEEIEELEEEKEELEKQVQELKNKCEQLEKREEE 159 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333334444444444444444444444444443
No 293
>PF10224 DUF2205: Predicted coiled-coil protein (DUF2205); InterPro: IPR019357 This entry represents a highly conserved 100 residue region which is likely to have a coiled-coil structure. The exact function is unknown.
Probab=69.95 E-value=51 Score=26.62 Aligned_cols=56 Identities=23% Similarity=0.329 Sum_probs=0.0
Q ss_pred HHHHHHHHHH-HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 297 QEEIEILKQT-ENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 297 ~~ele~L~~~-q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
..+++.+... +++|...-..|++.+..|-..-+..+.....|+.+.+-|...|..|
T Consensus 7 ~~d~e~~~~e~k~~Li~ei~~LQ~sL~~L~~Rve~Vk~E~~kL~~EN~~Lq~YI~nL 63 (80)
T PF10224_consen 7 SEDIEKLEKEEKEELIQEILELQDSLEALSDRVEEVKEENEKLESENEYLQQYIGNL 63 (80)
T ss_pred hHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 294
>PF10241 KxDL: Uncharacterized conserved protein; InterPro: IPR019371 This entry represents a conserved region of 80 residues which defines a family of short proteins. There is a characteristic KxDL motif towards the C terminus. The function is unknown.
Probab=69.85 E-value=59 Score=26.49 Aligned_cols=40 Identities=13% Similarity=0.066 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 299 EIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338 (434)
Q Consensus 299 ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L 338 (434)
.-..|..+++++..+...+.++...|+....++...-..|
T Consensus 41 s~~rl~~~~~~f~~~~~~l~~mK~DLd~i~krir~lk~kl 80 (88)
T PF10241_consen 41 SQQRLAEARERFARHTKLLKEMKKDLDYIFKRIRSLKAKL 80 (88)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345666777777777777777777766666555443333
No 295
>PF05266 DUF724: Protein of unknown function (DUF724); InterPro: IPR007930 This family contains several uncharacterised proteins found exclusively in Arabidopsis thaliana.
Probab=69.77 E-value=90 Score=29.29 Aligned_cols=8 Identities=25% Similarity=0.605 Sum_probs=4.2
Q ss_pred HHHHHHHH
Q psy18020 284 KIRRKMNE 291 (434)
Q Consensus 284 kL~~~L~e 291 (434)
.|+.||++
T Consensus 90 ~l~~RL~k 97 (190)
T PF05266_consen 90 FLRSRLNK 97 (190)
T ss_pred HHHHHHHH
Confidence 35555544
No 296
>PF13166 AAA_13: AAA domain
Probab=69.76 E-value=1.1e+02 Score=34.01 Aligned_cols=47 Identities=11% Similarity=0.106 Sum_probs=28.0
Q ss_pred HhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 382 AATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYK 428 (434)
Q Consensus 382 ~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~ 428 (434)
..++..|..+.+++..-.-.+...-+.+..|.++.......+..|.+
T Consensus 420 ~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~l~~~~~~~~~~~~~iN~ 466 (712)
T PF13166_consen 420 KELEKEINSLEKKLKKAKEEIKKIEKEIKELEAQLKNTEPAADRINE 466 (712)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 34455566666666555555666666677777666656555555554
No 297
>KOG0933|consensus
Probab=69.62 E-value=2.3e+02 Score=33.22 Aligned_cols=101 Identities=15% Similarity=0.166 Sum_probs=47.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCC
Q psy18020 322 ERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVID 401 (434)
Q Consensus 322 ~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ 401 (434)
.+.+..-..|+......+.+++.+++-|.++....+-.....+ ..|..|+..+.++..-.+-.+......|+...=.
T Consensus 254 ~e~~~~i~~l~e~~~k~~~ei~~le~~ikei~~~rd~em~~~~---~~L~~~~~~~~~~~tr~~t~l~~~~~tl~~e~~k 330 (1174)
T KOG0933|consen 254 EEMKDKIAKLDESLGKTDKEIESLEKEIKEIEQQRDAEMGGEV---KALEDKLDSLQNEITREETSLNLKKETLNGEEEK 330 (1174)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHhchhh---hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHH
Confidence 3333333334444444444444444444444322111111111 3467777777766666666666666666655444
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 402 LEVYLKQVRSLSRKQFMFRALLFK 425 (434)
Q Consensus 402 ld~flK~vR~LaReQF~~Ra~~~K 425 (434)
++.-.+++-.+.+.-=-.|+-+.|
T Consensus 331 ~e~i~~~i~e~~~~l~~k~~~~~~ 354 (1174)
T KOG0933|consen 331 LEEIRKNIEEDRKKLKEKEKAMAK 354 (1174)
T ss_pred HHHHHHhHHHHHHHHHHHHHHHhh
Confidence 444444444444433333333333
No 298
>KOG4552|consensus
Probab=69.60 E-value=69 Score=30.54 Aligned_cols=18 Identities=17% Similarity=0.248 Sum_probs=10.7
Q ss_pred hcCCCCHHHHHHHHHHHH
Q psy18020 396 RRGVIDLEVYLKQVRSLS 413 (434)
Q Consensus 396 ~~g~I~ld~flK~vR~La 413 (434)
+.+-|+-+..+|..-..+
T Consensus 122 ~krpvsSEelIKyAHrIS 139 (272)
T KOG4552|consen 122 EKRPVSSEELIKYAHRIS 139 (272)
T ss_pred hcCCCCHHHHHHHHHHhh
Confidence 345666676666655444
No 299
>KOG0980|consensus
Probab=69.50 E-value=1.9e+02 Score=33.36 Aligned_cols=22 Identities=18% Similarity=0.181 Sum_probs=13.3
Q ss_pred HhhHHHHhhhhHHHHHHHHHhc
Q psy18020 376 NTFTEEAATEDTIYYMGEALRR 397 (434)
Q Consensus 376 ~l~Aed~AieDtiy~L~~aL~~ 397 (434)
+.--+...++|.|--++++..+
T Consensus 463 ~~~~~~~~L~d~le~~~~~~~~ 484 (980)
T KOG0980|consen 463 DVEEENTNLNDQLEELQRAAGR 484 (980)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4444455667777777776654
No 300
>PF09763 Sec3_C: Exocyst complex component Sec3; InterPro: IPR019160 The exocyst complex is composed of 8 subunits: Exoc1, Exoc2, Exoc3, Exoc4, Exoc5, Exoc6, Exoc7 and Exoc8. This entry represents the subunit Exoc1 (Sec3). Sec3 binds to the C-terminal cytoplasmic domain of GLYT1 (glycine transporter protein 1). Sec3 is the exocyst component that is closest to the plasma membrane docking site and it serves as a spatial landmark in the plasma membrane for incoming secretory vesicles. Sec3 is recruited to the sites of polarised membrane growth through its interaction with Rho1p, a small GTP-binding protein.
Probab=69.47 E-value=81 Score=35.39 Aligned_cols=16 Identities=19% Similarity=0.183 Sum_probs=11.8
Q ss_pred HhhhhHHHHHHHHHhc
Q psy18020 382 AATEDTIYYMGEALRR 397 (434)
Q Consensus 382 ~AieDtiy~L~~aL~~ 397 (434)
..|++++-.|.+|++.
T Consensus 120 ~~~e~a~~~L~~Al~~ 135 (701)
T PF09763_consen 120 EKIEEAAEALYKALKA 135 (701)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4567777888888876
No 301
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=69.46 E-value=79 Score=32.95 Aligned_cols=61 Identities=15% Similarity=0.200 Sum_probs=35.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 291 e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
+++.++.++.+.|...+.+|...-..|+....+|.....+++..-+.|....+.++..-++
T Consensus 158 ~qr~ql~aq~qsl~a~~k~LQ~s~~Qlk~~~~~L~~r~~~ieQ~~~~la~r~~a~q~r~~e 218 (499)
T COG4372 158 EQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEE 218 (499)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555556666666666666666666666666666666666665555444443333
No 302
>KOG4196|consensus
Probab=69.45 E-value=80 Score=27.84 Aligned_cols=43 Identities=26% Similarity=0.402 Sum_probs=20.5
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 307 ENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 307 q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
+.||+..+..|.++++.|.++-..+..+..-++.+-+.|..+.
T Consensus 76 k~eLE~~k~~L~qqv~~L~~e~s~~~~E~da~k~k~e~l~~~~ 118 (135)
T KOG4196|consen 76 KHELEKEKAELQQQVEKLKEENSRLRRELDAYKSKYEALQNSA 118 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 3444444444555555554444444444444444444444433
No 303
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=69.28 E-value=94 Score=28.56 Aligned_cols=9 Identities=11% Similarity=0.198 Sum_probs=3.5
Q ss_pred HHHHHHHHh
Q psy18020 388 IYYMGEALR 396 (434)
Q Consensus 388 iy~L~~aL~ 396 (434)
|.-|...+.
T Consensus 140 i~~lr~~iE 148 (177)
T PF07798_consen 140 IANLRTEIE 148 (177)
T ss_pred HHHHHHHHH
Confidence 333444433
No 304
>PRK14147 heat shock protein GrpE; Provisional
Probab=69.27 E-value=63 Score=29.83 Aligned_cols=17 Identities=24% Similarity=0.083 Sum_probs=7.0
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy18020 333 RSISFLKEKETELDEIL 349 (434)
Q Consensus 333 ~~i~~L~~~~~eLe~~I 349 (434)
+-+..|-.-.+.|++.+
T Consensus 69 ~~~~~lLpv~DnlerAl 85 (172)
T PRK14147 69 KLLGELLPVFDSLDAGL 85 (172)
T ss_pred HHHHHHhhhhhHHHHHH
Confidence 33333333444444444
No 305
>KOG2751|consensus
Probab=69.24 E-value=32 Score=36.13 Aligned_cols=40 Identities=30% Similarity=0.478 Sum_probs=22.3
Q ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 308 NELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 308 ~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
.+|...+++|-+++.+|+++.+.|...+.+++.+..++.+
T Consensus 186 ~~l~~eE~~L~q~lk~le~~~~~l~~~l~e~~~~~~~~~e 225 (447)
T KOG2751|consen 186 KNLKEEEERLLQQLEELEKEEAELDHQLKELEFKAERLNE 225 (447)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444555566666666666666666666555555433
No 306
>KOG0972|consensus
Probab=69.14 E-value=65 Score=32.29 Aligned_cols=44 Identities=9% Similarity=0.158 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L 338 (434)
++...|..+++.+.+|..-+++-++...........|.+.+..+
T Consensus 274 qL~~l~q~fr~a~~~lse~~e~y~q~~~gv~~rT~~L~eVm~e~ 317 (384)
T KOG0972|consen 274 QLASLMQKFRRATDTLSELREKYKQASVGVSSRTETLDEVMDEI 317 (384)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHH
Confidence 33344444455555555444444333333333333333333333
No 307
>PF08581 Tup_N: Tup N-terminal; InterPro: IPR013890 The N-terminal region of the Tup protein has been shown to interact with the Ssn6 transcriptional co-repressor []. ; PDB: 3VP9_B 3VP8_B.
Probab=69.10 E-value=59 Score=26.17 Aligned_cols=17 Identities=35% Similarity=0.354 Sum_probs=7.0
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy18020 289 MNEQKVQAQEEIEILKQ 305 (434)
Q Consensus 289 L~e~~~q~~~ele~L~~ 305 (434)
|++.+..++.|++.+.+
T Consensus 2 l~elLd~ir~Ef~~~~~ 18 (79)
T PF08581_consen 2 LNELLDAIRQEFENLSQ 18 (79)
T ss_dssp HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33444444444444333
No 308
>PF10212 TTKRSYEDQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019348 This entry represents a C-terminal 500 residue region, which contains a conserved TTKRSYEDQ motif. It is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain an N-terminal domain with a KLRAQ motif (IPR019343 from INTERPRO). The function of these proteins is not known.
Probab=69.08 E-value=90 Score=33.75 Aligned_cols=23 Identities=9% Similarity=0.114 Sum_probs=11.9
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHH
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMG 392 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~ 392 (434)
++.+|-.+-.+=..-+|.|..|.
T Consensus 492 MSEHLasmNeqL~~Q~eeI~~LK 514 (518)
T PF10212_consen 492 MSEHLASMNEQLAKQREEIQTLK 514 (518)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 44444444444455556665554
No 309
>PRK14140 heat shock protein GrpE; Provisional
Probab=69.03 E-value=1e+02 Score=28.97 Aligned_cols=32 Identities=19% Similarity=0.254 Sum_probs=18.1
Q ss_pred hHHHHHHHHHhcCC--CCHHHHHHHHHHHHHHHHH
Q psy18020 386 DTIYYMGEALRRGV--IDLEVYLKQVRSLSRKQFM 418 (434)
Q Consensus 386 Dtiy~L~~aL~~g~--I~ld~flK~vR~LaReQF~ 418 (434)
+.+.-|++||+... =++..|++-++ |-++||+
T Consensus 95 pvlDnLerAl~~~~~~~~~~~i~~Gv~-mi~k~l~ 128 (191)
T PRK14140 95 PALDNFERALQIEADDEQTKSLLKGVE-MVHRQLL 128 (191)
T ss_pred HHHHHHHHHHhccCccchHHHHHHHHH-HHHHHHH
Confidence 45666778887432 23566776666 3334444
No 310
>KOG0980|consensus
Probab=68.78 E-value=47 Score=37.92 Aligned_cols=29 Identities=14% Similarity=0.240 Sum_probs=20.0
Q ss_pred HHHhhhhHHHHHHHHHhcCCCCHHHHHHH
Q psy18020 380 EEAATEDTIYYMGEALRRGVIDLEVYLKQ 408 (434)
Q Consensus 380 ed~AieDtiy~L~~aL~~g~I~ld~flK~ 408 (434)
-..+|.|++|-|+.-++-+-.+...||=.
T Consensus 597 ~~q~lq~al~~ld~P~~~~~~~~p~~Lls 625 (980)
T KOG0980|consen 597 GIQALQNALYQLDSPLHWRCLTSPDFLLS 625 (980)
T ss_pred HHHHHHHHHHhcCCCcccCcCCCHHHHHH
Confidence 45667777777777777677777777644
No 311
>PRK14146 heat shock protein GrpE; Provisional
Probab=68.73 E-value=80 Score=30.26 Aligned_cols=75 Identities=16% Similarity=0.168 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhH
Q psy18020 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFT 379 (434)
Q Consensus 301 e~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~A 379 (434)
+.+.++++++..-+...++++.++ .+..+++-+..|-.-.+.|++.+... ...-+...++.+-.-+++||++.+.
T Consensus 75 d~~lR~~AdfeN~rkR~~kE~e~~--~~~a~e~~~~~lLpv~DnlerAl~~~--~~~~~~~~l~~Gv~mi~k~l~~~L~ 149 (215)
T PRK14146 75 DSWARERAEFQNFKRRSAQEFVSI--RKEAVKSLVSGFLNPIDNLERVGATQ--NQSEELKPFVEGVKMILKEFYSVLE 149 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhHHhHHHHHHhcc--cccchhhHHHHHHHHHHHHHHHHHH
Confidence 344445555544433333333332 22333444445555555555555422 1112334444444555566555553
No 312
>PRK14139 heat shock protein GrpE; Provisional
Probab=68.70 E-value=82 Score=29.50 Aligned_cols=8 Identities=38% Similarity=0.385 Sum_probs=3.5
Q ss_pred HHHHHHHH
Q psy18020 295 QAQEEIEI 302 (434)
Q Consensus 295 q~~~ele~ 302 (434)
+..++++.
T Consensus 57 R~~AefeN 64 (185)
T PRK14139 57 RAKAETEN 64 (185)
T ss_pred HHHHHHHH
Confidence 34444444
No 313
>PF05335 DUF745: Protein of unknown function (DUF745); InterPro: IPR007999 This family consists of several uncharacterised Drosophila melanogaster proteins of unknown function.
Probab=68.62 E-value=1e+02 Score=28.87 Aligned_cols=16 Identities=19% Similarity=0.223 Sum_probs=7.9
Q ss_pred HHHHHHHHhcCCCCHH
Q psy18020 388 IYYMGEALRRGVIDLE 403 (434)
Q Consensus 388 iy~L~~aL~~g~I~ld 403 (434)
+..|.+-|..-+.||+
T Consensus 153 ve~L~~QL~~Ar~D~~ 168 (188)
T PF05335_consen 153 VEELQRQLQAARADYE 168 (188)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3444455555555544
No 314
>PF05008 V-SNARE: Vesicle transport v-SNARE protein N-terminus; InterPro: IPR007705 V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this []. This domain is the N-terminal half of the V-Snare proteins. ; GO: 0006886 intracellular protein transport, 0016020 membrane; PDB: 2V8S_V 1VCS_A 3ONL_C 3ONJ_A 2QYW_A.
Probab=68.59 E-value=52 Score=25.69 Aligned_cols=49 Identities=14% Similarity=0.297 Sum_probs=29.5
Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 302 ILKQTENELNQGKFKLARMFERID----KEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 302 ~L~~~q~EL~~~~~~L~~~~~eLe----~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.++..+..|....+-|++|.-+++ ..+..+...+..++.+.+.+++-+.
T Consensus 26 ~i~~~e~~l~ea~~~l~qMe~E~~~~p~s~r~~~~~kl~~yr~~l~~lk~~l~ 78 (79)
T PF05008_consen 26 LIREIERDLDEAEELLKQMELEVRSLPPSERNQYKSKLRSYRSELKKLKKELK 78 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHCTS-HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 455566666665555555544443 4566677777777777776666543
No 315
>PF08580 KAR9: Yeast cortical protein KAR9; InterPro: IPR013889 The KAR9 protein in Saccharomyces cerevisiae (Baker's yeast) is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules []. KAR9 localises at the shmoo tip in mating cells and at the tip of the growing bud in anaphase [].
Probab=68.51 E-value=2.1e+02 Score=32.29 Aligned_cols=118 Identities=15% Similarity=0.147 Sum_probs=61.2
Q ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHH
Q psy18020 305 QTENELNQGKFKLARMFERIDKEKAELERS-------------ISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIY 371 (434)
Q Consensus 305 ~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~-------------i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~ 371 (434)
...++|..+.+.|.+.-..|+.+.+.|+++ +.++....+.+++.+.++.+. +|..--+...+-+.
T Consensus 235 ~a~e~L~~r~~~L~~k~~~L~~e~~~LK~ELiedRW~~vFr~l~~q~~~m~esver~~~kl~~~--~~~~~~~~~~~~l~ 312 (683)
T PF08580_consen 235 SACEELEDRYERLEKKWKKLEKEAESLKKELIEDRWNIVFRNLGRQAQKMCESVERSLSKLQEA--IDSGIHLDNPSKLS 312 (683)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--ccccccccchHHHH
Confidence 334444444444444444444444444433 334444455555555555332 22222334445566
Q ss_pred HHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 372 KQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCY 427 (434)
Q Consensus 372 ~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~ 427 (434)
+||=...+ .+.-++-.+.+|+.-+.|+-.+=-|-+..|+.++=-.+..+..+.
T Consensus 313 ~~i~s~~~---k~~~~~~~I~ka~~~sIi~~gv~~r~n~~L~~rW~~L~~~~d~~L 365 (683)
T PF08580_consen 313 KQIESKEK---KKSHYFPAIYKARVLSIIDKGVADRLNADLAQRWLELKEDMDSLL 365 (683)
T ss_pred HHHHHHHH---HHhccHHHHHHHHHHHhhhhhHHHHhhHHHHHHHHHHHHHHHHhh
Confidence 77655544 445555555566665566655444555678877777775555443
No 316
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=68.46 E-value=1.4e+02 Score=30.43 Aligned_cols=30 Identities=13% Similarity=0.294 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELD 346 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe 346 (434)
++..+..++.....++..+..++.+.+.++
T Consensus 149 l~~~i~~~~~~i~~~~~~l~~~~~~l~~~~ 178 (423)
T TIGR01843 149 ILAQIKQLEAELAGLQAQLQALRQQLEVIS 178 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333333333333333333
No 317
>KOG0976|consensus
Probab=68.41 E-value=1.9e+02 Score=33.14 Aligned_cols=66 Identities=17% Similarity=0.199 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCCCCccccccccchHHHHHHHhhHHHHhhhh
Q psy18020 318 ARMFERIDKEKAELERSISFLKEKETELDEILAVLTE---KTEVDVDEAVTTTAPIYKQILNTFTEEAATED 386 (434)
Q Consensus 318 ~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~---k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieD 386 (434)
+..+++|+.-...|.+....-+....++.+.+++++. ..+-|+.+ ++-.|+---.++.++..+|.-
T Consensus 269 eE~m~qlk~kns~L~~ElSqkeelVk~~qeeLd~lkqt~t~a~gdseq---atkylh~enmkltrqkadirc 337 (1265)
T KOG0976|consen 269 EEKMRQLKAKNSVLGDELSQKEELVKELQEELDTLKQTRTRADGDSEQ---ATKYLHLENMKLTRQKADIRC 337 (1265)
T ss_pred HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHhhccHHH---HHHHHHHHHHHHHHHHHHHHH
Confidence 3333344444444444443333334444444444431 12224433 333444444555555555543
No 318
>PF06825 HSBP1: Heat shock factor binding protein 1; InterPro: IPR009643 Heat shock factor binding protein 1 (HSBP1) appears to be a negative regulator of the heat shock response [].; PDB: 3CI9_A.
Probab=68.38 E-value=19 Score=26.91 Aligned_cols=37 Identities=16% Similarity=0.280 Sum_probs=23.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
|+++..+.+...+.|-..+..+.+++++||+.|..+-
T Consensus 12 L~qmq~kFq~mS~~I~~riDeM~~RIDdLE~si~dl~ 48 (54)
T PF06825_consen 12 LQQMQDKFQTMSDQILGRIDEMSSRIDDLEKSIADLM 48 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH----
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 4455555566667777778888888888888887764
No 319
>PF11802 CENP-K: Centromere-associated protein K; InterPro: IPR020993 Cenp-K is one of seven new Cenp-A-nucleosome distal (CAD) centromere components (the others being Cenp-L, Cenp-O, Cenp-P, Cenp-Q, Cenp-R and Cenp-S) that are identified as assembling on the Cenp-A nucleosome associated complex, NAC []. The Cenp-A NAC is essential, as disruption of the complex causes errors of chromosome alignment and segregation that preclude cell survival despite continued centromere-derived mitotic checkpoint signalling. Cenp-K is centromere-associated through its interaction with one or more components of the Cenp-A NAC.; GO: 0005634 nucleus
Probab=68.31 E-value=1.3e+02 Score=29.84 Aligned_cols=32 Identities=31% Similarity=0.283 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 318 ARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 318 ~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
....++|+.....|+..+..++.+.++|.+-+
T Consensus 88 ~lgkeelqkl~~eLe~vLs~~q~KnekLke~L 119 (268)
T PF11802_consen 88 TLGKEELQKLISELEMVLSTVQSKNEKLKEDL 119 (268)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444455555555555555555555544433
No 320
>PRK03918 chromosome segregation protein; Provisional
Probab=68.30 E-value=1.8e+02 Score=33.24 Aligned_cols=9 Identities=22% Similarity=0.154 Sum_probs=5.0
Q ss_pred CCCCHHHHH
Q psy18020 61 QNSKLEGLI 69 (434)
Q Consensus 61 ~~s~L~~li 69 (434)
.+|+|++.|
T Consensus 35 GKStil~ai 43 (880)
T PRK03918 35 GKSSILEAI 43 (880)
T ss_pred CHHHHHHHH
Confidence 356666554
No 321
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5
Probab=68.12 E-value=2e+02 Score=31.99 Aligned_cols=30 Identities=27% Similarity=0.451 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 325 DKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 325 e~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
+.--..++..+.++++...+..+.++.+.+
T Consensus 121 EerL~ELE~~le~~~e~~~D~~kLLe~lqs 150 (617)
T PF15070_consen 121 EERLAELEEELERLQEQQEDRQKLLEQLQS 150 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 333344455555555555555555555543
No 322
>PRK14151 heat shock protein GrpE; Provisional
Probab=68.10 E-value=75 Score=29.46 Aligned_cols=20 Identities=25% Similarity=0.122 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy18020 331 LERSISFLKEKETELDEILA 350 (434)
Q Consensus 331 Le~~i~~L~~~~~eLe~~I~ 350 (434)
+++-+..|-.-.+.+++.+.
T Consensus 69 ~~~~~~~LLpv~DnlerAl~ 88 (176)
T PRK14151 69 LEKFAGDLLPVVDSLERGLE 88 (176)
T ss_pred HHHHHHHHhhHHhHHHHHHh
Confidence 33444444444445555443
No 323
>COG1293 Predicted RNA-binding protein homologous to eukaryotic snRNP [Transcription]
Probab=68.06 E-value=1.4e+02 Score=32.76 Aligned_cols=103 Identities=15% Similarity=0.144 Sum_probs=51.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccch----------HHHHH-------HHhhH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAP----------IYKQI-------LNTFT 379 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~----------l~~Ql-------~~l~A 379 (434)
+++++..++.....++...+.|.....+++..+...+.....+++++.+.-++ .+++. ..+.-
T Consensus 305 ~~~~~~~~~~~~~~~r~~g~ll~an~~~i~~~~~~v~~~~~~~~~~i~i~l~~~~~~~~~~~~~~~~~~klk~~~~~~~~ 384 (564)
T COG1293 305 QEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDR 384 (564)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhehhhhccccceeeccCcCcccchhhHHHhhhhhhccCceeehhh
Confidence 44444444444445555555555555555555554431111112222222221 12221 22333
Q ss_pred HHHhhhhHHHHHHHHHhc-CCCCHHHHHHHHH-HHHHHHHHH
Q psy18020 380 EEAATEDTIYYMGEALRR-GVIDLEVYLKQVR-SLSRKQFMF 419 (434)
Q Consensus 380 ed~AieDtiy~L~~aL~~-g~I~ld~flK~vR-~LaReQF~~ 419 (434)
.-.+.++++.|+.++... .+++-..-+..+| +|..++|+.
T Consensus 385 ~~~~~~~~~~y~e~~~~~lek~~~~~~ieei~ee~ie~~~~~ 426 (564)
T COG1293 385 QLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLK 426 (564)
T ss_pred hhhhhHHHHHHHHHHHHHHHhccchhhHHHHHHHHHHHHHhh
Confidence 456778888888877653 2333256667776 677888885
No 324
>PRK14160 heat shock protein GrpE; Provisional
Probab=67.98 E-value=1.2e+02 Score=29.13 Aligned_cols=15 Identities=13% Similarity=0.045 Sum_probs=5.9
Q ss_pred HHHHHHHHHHHHHHH
Q psy18020 335 ISFLKEKETELDEIL 349 (434)
Q Consensus 335 i~~L~~~~~eLe~~I 349 (434)
+..|-.-.+.|++.+
T Consensus 114 ~~~LLpVlDnLerAl 128 (211)
T PRK14160 114 LKELLPVLDNLERAA 128 (211)
T ss_pred HHHHhhHHhHHHHHH
Confidence 333333344444443
No 325
>PRK14064 exodeoxyribonuclease VII small subunit; Provisional
Probab=67.84 E-value=34 Score=27.20 Aligned_cols=57 Identities=16% Similarity=0.169 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.++.++.|..+-+.|+.+.-.|++.+..-+.+..-++..-..|+....+++..+...
T Consensus 8 fEe~l~~LE~IV~~LE~~~l~Leesl~~ye~G~~L~k~c~~~L~~ae~kv~~l~~~~ 64 (75)
T PRK14064 8 FEEAIAELETIVEALENGSASLEDSLDMYQKGIELTKLCQDKLQSAEKRMAKVVTDA 64 (75)
T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 455667777778888888888888888888888877777777777777777766543
No 326
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=67.82 E-value=77 Score=29.09 Aligned_cols=6 Identities=33% Similarity=0.412 Sum_probs=2.1
Q ss_pred hhhHHH
Q psy18020 384 TEDTIY 389 (434)
Q Consensus 384 ieDtiy 389 (434)
+++.+.
T Consensus 177 l~~~~~ 182 (191)
T PF04156_consen 177 LEEKIQ 182 (191)
T ss_pred HHHHHH
Confidence 333333
No 327
>PF05384 DegS: Sensor protein DegS; InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=67.80 E-value=99 Score=28.25 Aligned_cols=34 Identities=21% Similarity=0.366 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
++.....|+..++.|+..+..|++-++.++..+.
T Consensus 96 ~re~E~qLr~rRD~LErrl~~l~~tierAE~l~s 129 (159)
T PF05384_consen 96 LREREKQLRERRDELERRLRNLEETIERAENLVS 129 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445555555555555555555555555555444
No 328
>TIGR02977 phageshock_pspA phage shock protein A. Members of this family are the phage shock protein PspA, from the phage shock operon. This is a narrower family than the set of PspA and its homologs, sometimes several in a genome, as described by PFAM model pfam04012. PspA appears to maintain the protonmotive force under stress conditions that include overexpression of certain phage secretins, heat shock, ethanol, and protein export defects.
Probab=67.78 E-value=1.1e+02 Score=28.98 Aligned_cols=121 Identities=17% Similarity=0.198 Sum_probs=60.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy18020 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEV 358 (434)
Q Consensus 279 sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~ 358 (434)
+.|.+.++..+++.+...++=.+.|.+.-.++...-.+++..+..+...+..++..+..+....+++++.....=.++.-
T Consensus 5 ~Rl~~iv~a~~n~~~dk~EDP~~~l~q~irem~~~l~~ar~~lA~~~a~~k~~e~~~~~~~~~~~~~~~~A~~Al~~G~E 84 (219)
T TIGR02977 5 SRFADIVNSNLNALLDKAEDPEKMIRLIIQEMEDTLVEVRTTSARTIADKKELERRVSRLEAQVADWQEKAELALSKGRE 84 (219)
T ss_pred HHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH
Confidence 34445555555555555444333444444444444444666666666667777776666666666665544332222221
Q ss_pred CccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy18020 359 DVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSL 412 (434)
Q Consensus 359 diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~L 412 (434)
| |.+..+. +-...+..+.-|..-+..-.-.++..-.+++.|
T Consensus 85 d----------LAr~Al~---~k~~~~~~~~~l~~~~~~~~~~v~~l~~~l~~L 125 (219)
T TIGR02977 85 D----------LARAALI---EKQKAQELAEALERELAAVEETLAKLQEDIAKL 125 (219)
T ss_pred H----------HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2 3333332 444445555555555554444444444444444
No 329
>COG1322 Predicted nuclease of restriction endonuclease-like fold, RmuC family [General function prediction only]
Probab=67.73 E-value=1.3e+02 Score=32.15 Aligned_cols=28 Identities=14% Similarity=0.112 Sum_probs=16.7
Q ss_pred HHHHHHHhhHHHHhhhhHHHHHHHHHhc
Q psy18020 370 IYKQILNTFTEEAATEDTIYYMGEALRR 397 (434)
Q Consensus 370 l~~Ql~~l~Aed~AieDtiy~L~~aL~~ 397 (434)
+..+|-.++-+..++.-=...|.++|.+
T Consensus 162 ~~~~i~~~lg~~~~la~e~~~Lt~~Lk~ 189 (448)
T COG1322 162 LLEEIDRLLGEIQQLAQEAGNLTAALKG 189 (448)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 5555556666666655555556666654
No 330
>KOG0971|consensus
Probab=67.60 E-value=2.4e+02 Score=32.72 Aligned_cols=25 Identities=32% Similarity=0.505 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 329 AELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 329 ~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
+.|+..+..+++++++|+.-++.|+
T Consensus 328 esLQ~eve~lkEr~deletdlEILK 352 (1243)
T KOG0971|consen 328 ESLQQEVEALKERVDELETDLEILK 352 (1243)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455556666666666666666554
No 331
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=67.58 E-value=97 Score=30.74 Aligned_cols=43 Identities=19% Similarity=0.190 Sum_probs=31.2
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 311 NQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 311 ~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
...-+++++.+..|+.+...|+..|++-+.+.+..++-+..+.
T Consensus 175 ~~~~~~~~~~l~~l~~de~~Le~KIekkk~ELER~qKRL~sLq 217 (267)
T PF10234_consen 175 QQQLQQTQQQLNNLASDEANLEAKIEKKKQELERNQKRLQSLQ 217 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333447777777777888888888887777777777777775
No 332
>PRK02224 chromosome segregation protein; Provisional
Probab=67.54 E-value=1.3e+02 Score=34.46 Aligned_cols=35 Identities=26% Similarity=0.365 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
++..+.+++.....++..+..+++...+++..++.
T Consensus 263 l~~~i~~~e~~~~~l~~~i~~~~~~~~~le~e~~~ 297 (880)
T PRK02224 263 LRETIAETEREREELAEEVRDLRERLEELEEERDD 297 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444444444444444444444444443333
No 333
>KOG2911|consensus
Probab=67.47 E-value=78 Score=33.38 Aligned_cols=61 Identities=21% Similarity=0.295 Sum_probs=30.8
Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 290 NEQKVQAQEEIEILK-QTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 290 ~e~~~q~~~ele~L~-~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
..++..+..+++... ...+.|+.|..++...-- +.+..+++.++++......++..+....
T Consensus 246 ~kqie~L~qeie~~~~~~r~~~k~g~K~iA~~yl---r~rk~~eK~~er~~~~l~~l~~vl~~Id 307 (439)
T KOG2911|consen 246 AKQIEFLEQEIEKSKEKLRQALKEGKKQIAITYL---RARKLLEKDLERKVSSLNNLETVLSQID 307 (439)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHH---HHHHHHHhhHHHHHHHHHHHHHHHHHHH
Confidence 333344444444322 233555555544433222 2455666666666666666666665443
No 334
>PTZ00464 SNF-7-like protein; Provisional
Probab=67.21 E-value=74 Score=30.39 Aligned_cols=18 Identities=22% Similarity=0.217 Sum_probs=13.6
Q ss_pred CCCHHHHHHHHHHHHHHH
Q psy18020 399 VIDLEVYLKQVRSLSRKQ 416 (434)
Q Consensus 399 ~I~ld~flK~vR~LaReQ 416 (434)
.||-|.-...--.|-.|.
T Consensus 156 ~~DEdELe~ELe~Le~e~ 173 (211)
T PTZ00464 156 DIDEDEMLGELDALDFDM 173 (211)
T ss_pred CCCHHHHHHHHHHHHHHH
Confidence 478888888888887764
No 335
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=67.21 E-value=1.5e+02 Score=30.25 Aligned_cols=41 Identities=22% Similarity=0.212 Sum_probs=19.4
Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 310 LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 310 L~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
+......++..+..++.+.+.++..+..++.+.+.+++.+.
T Consensus 135 ~~~~~~~~~~~~~~l~~~i~~~~~~i~~~~~~l~~~~~~l~ 175 (423)
T TIGR01843 135 FESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLE 175 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333344444455555555555555555544444444443
No 336
>PRK10869 recombination and repair protein; Provisional
Probab=67.12 E-value=1.9e+02 Score=31.70 Aligned_cols=13 Identities=8% Similarity=-0.187 Sum_probs=7.8
Q ss_pred CCCCHHHHHHHHH
Q psy18020 61 QNSKLEGLIKTLA 73 (434)
Q Consensus 61 ~~s~L~~li~~l~ 73 (434)
.+|+|++.|..+.
T Consensus 34 GKS~ildAi~~ll 46 (553)
T PRK10869 34 GKSIAIDALGLCL 46 (553)
T ss_pred ChHHHHHHHHHHh
Confidence 3566666666553
No 337
>PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox []. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=67.03 E-value=25 Score=25.14 Aligned_cols=34 Identities=21% Similarity=0.261 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~ 330 (434)
+.+.+.|++.-..|....+.|..+...|.++...
T Consensus 4 E~Dy~~LK~~yd~Lk~~~~~L~~E~~~L~aev~~ 37 (45)
T PF02183_consen 4 ERDYDALKASYDSLKAEYDSLKKENEKLRAEVQE 37 (45)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455555544444444444444444443333
No 338
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=67.01 E-value=48 Score=27.11 Aligned_cols=35 Identities=23% Similarity=0.346 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 318 ARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 318 ~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+..+..|+...+.|+..+..|+.+...+++-+.++
T Consensus 61 ~~~~~~L~~~~~~~~~~i~~l~~~~~~l~~~l~~~ 95 (106)
T PF01920_consen 61 EEAIEELEERIEKLEKEIKKLEKQLKYLEKKLKEL 95 (106)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44445555555555555555555555554444433
No 339
>PF12269 zf-CpG_bind_C: CpG binding protein zinc finger C terminal domain; InterPro: IPR022056 This domain family is found in eukaryotes, and is approximately 240 amino acids in length. This domain is the zinc finger domain of a CpG binding DNA methyltransferase protein. It contains a CxxC motif which forms the zinc finger and binds to DNA.
Probab=66.88 E-value=16 Score=35.52 Aligned_cols=39 Identities=21% Similarity=0.336 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
+.-+..+++|+.++...+..+..|+++.++|+.||...+
T Consensus 25 E~~r~~Le~Ir~kq~~v~~~l~eLe~~~~el~~~i~~~k 63 (236)
T PF12269_consen 25 EQNRKLLEEIRKKQQKVRNRLQELEKRFKELEAIIARAK 63 (236)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444455555666666666666666666666666553
No 340
>PF06120 Phage_HK97_TLTM: Tail length tape measure protein; InterPro: IPR009302 This entry consists of the tail length tape measure protein from Bacteriophage HK97 and related sequences from Escherichia coli (strain K12).
Probab=66.85 E-value=1e+02 Score=31.06 Aligned_cols=27 Identities=15% Similarity=0.238 Sum_probs=9.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKET 343 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~ 343 (434)
++.-+.+.+..-+.+++.|..|+.+++
T Consensus 79 ~~~si~~q~~~i~~l~~~i~~l~~~i~ 105 (301)
T PF06120_consen 79 AEESIAAQKRAIEDLQKKIDSLKDQIK 105 (301)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333333333333333
No 341
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=66.75 E-value=24 Score=27.33 Aligned_cols=34 Identities=26% Similarity=0.434 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
++..+++|+.+.+.|+..++.|+...+.+++...
T Consensus 29 l~~~i~~l~~e~~~L~~ei~~l~~~~~~ie~~AR 62 (80)
T PF04977_consen 29 LQKEIEELKKENEELKEEIERLKNDPDYIEKVAR 62 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Confidence 6666666777777777777777555666666555
No 342
>KOG2196|consensus
Probab=66.73 E-value=1.3e+02 Score=29.33 Aligned_cols=148 Identities=15% Similarity=0.155 Sum_probs=79.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-C
Q psy18020 281 VEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFK---LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK-T 356 (434)
Q Consensus 281 l~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~---L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k-~ 356 (434)
+..+-...|.++-.-....+.++.+--.+|-.+.++ |-.++..++..+++|+.+++-+....+||+..+..++.+ +
T Consensus 79 liNkWs~el~~Qe~vF~~q~~qvNaWDr~LI~ngekI~~Ly~e~~~vk~~qkrLdq~L~~I~sqQ~ELE~~L~~lE~k~~ 158 (254)
T KOG2196|consen 79 LINKWSLELEEQERVFLQQATQVNAWDRTLIENGEKISGLYNEVVKVKLDQKRLDQELEFILSQQQELEDLLDPLETKLE 158 (254)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 333444445444333344444555555566666655 556667777777888888888888888888887777643 2
Q ss_pred CCCccccccccchHHHHHHHhhHH--------HHhhhhHHHHHHHHHhcCC------C----CHHHHHHHHHHHHHHHHH
Q psy18020 357 EVDVDEAVTTTAPIYKQILNTFTE--------EAATEDTIYYMGEALRRGV------I----DLEVYLKQVRSLSRKQFM 418 (434)
Q Consensus 357 ~~diDe~i~~~~~l~~Ql~~l~Ae--------d~AieDtiy~L~~aL~~g~------I----~ld~flK~vR~LaReQF~ 418 (434)
..+-+.+.-..|+---|.|..... +.-+++.|..|..+=+ +. | -|..-++..|.|=-.=|.
T Consensus 159 ~~~g~~~~~~~D~eR~qty~~a~nidsqLk~l~~dL~~ii~~lN~~~~-~~d~t~~~~qi~Kilnah~~sLqwl~d~st~ 237 (254)
T KOG2196|consen 159 LQSGHTYLSRADVEREQTYKMAENIDSQLKRLSEDLKQIIKSLNTMSK-TVDKTDPIIQIEKILNAHMDSLQWLDDNSTQ 237 (254)
T ss_pred ccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhccC-ccccCCchHHHHHHHHHHHHHHHHHHhhhHH
Confidence 223333334444444555543322 2224445554443321 11 1 244455566666555566
Q ss_pred HHHHHHHHHHh
Q psy18020 419 FRALLFKCYKH 429 (434)
Q Consensus 419 ~Ra~~~Ki~~~ 429 (434)
+-..+.||.+.
T Consensus 238 ~e~k~d~i~K~ 248 (254)
T KOG2196|consen 238 LEKKLDKIKKL 248 (254)
T ss_pred HHHHHHHHHhh
Confidence 65555555543
No 343
>PF14932 HAUS-augmin3: HAUS augmin-like complex subunit 3
Probab=66.67 E-value=23 Score=34.72 Aligned_cols=13 Identities=38% Similarity=0.549 Sum_probs=6.5
Q ss_pred HHHHHHHHHHHHH
Q psy18020 407 KQVRSLSRKQFMF 419 (434)
Q Consensus 407 K~vR~LaReQF~~ 419 (434)
|.-....++||+.
T Consensus 183 ~~L~~y~kKqF~~ 195 (256)
T PF14932_consen 183 KYLTSYTKKQFFQ 195 (256)
T ss_pred HHHHHHHHHhccc
Confidence 3344445556554
No 344
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=66.45 E-value=27 Score=26.20 Aligned_cols=25 Identities=20% Similarity=0.424 Sum_probs=9.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEK 341 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~ 341 (434)
++.++..++.+-+.|...++.+++.
T Consensus 12 ~~~~i~tvk~en~~i~~~ve~i~en 36 (55)
T PF05377_consen 12 IESSINTVKKENEEISESVEKIEEN 36 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444444444333333333
No 345
>KOG2629|consensus
Probab=66.41 E-value=75 Score=31.77 Aligned_cols=53 Identities=15% Similarity=0.269 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 298 ~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
+.++.+.+++..|...+.+|.+.+.+|......+.+++++++.++..+...+=
T Consensus 140 ~l~~~va~v~q~~~~qq~Els~~L~~l~~~~~~~s~~~~k~esei~~Ik~lvl 192 (300)
T KOG2629|consen 140 ALMDEVAQVSQLLATQQSELSRALASLKNTLVQLSRNIEKLESEINTIKQLVL 192 (300)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHh
Confidence 33344555555666666667777777777777777788888777777765443
No 346
>KOG0964|consensus
Probab=66.40 E-value=2.4e+02 Score=32.96 Aligned_cols=20 Identities=20% Similarity=0.275 Sum_probs=13.8
Q ss_pred hcCCCCHHHHHHHHHHHHHH
Q psy18020 396 RRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 396 ~~g~I~ld~flK~vR~LaRe 415 (434)
.=|.++.|+|.|-.+.=++|
T Consensus 934 ~lG~Lp~daf~ky~~~~~~e 953 (1200)
T KOG0964|consen 934 ELGVLPEDAFEKYQDKKSKE 953 (1200)
T ss_pred hcCCCchHHHHHhccCCHHH
Confidence 34899999999855544433
No 347
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=66.03 E-value=1.4e+02 Score=34.16 Aligned_cols=43 Identities=12% Similarity=0.051 Sum_probs=27.3
Q ss_pred hhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 384 TEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKC 426 (434)
Q Consensus 384 ieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki 426 (434)
+...+..|..-|+.-.+.|+..=-.+-.|+--||-.+..|..+
T Consensus 477 lk~~~~~LQ~eLsEk~~~l~~~kee~s~l~s~~~K~~s~i~~l 519 (775)
T PF10174_consen 477 LKAKLESLQKELSEKELQLEDAKEEASKLASSQEKKDSEIERL 519 (775)
T ss_pred HHHHHHHHhhhhHHHHHHHHHhhhHHHHHhhccchhhhHHHHH
Confidence 4445555555566666677777777777777777666655443
No 348
>PF07200 Mod_r: Modifier of rudimentary (Mod(r)) protein; InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=66.03 E-value=95 Score=27.44 Aligned_cols=29 Identities=28% Similarity=0.353 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHhc---CCCCCCccc
Q psy18020 334 SISFLKEKETELDEILAVLT---EKTEVDVDE 362 (434)
Q Consensus 334 ~i~~L~~~~~eLe~~I~~l~---~k~~~diDe 362 (434)
....|+....++++--+.+. -.+++|+|+
T Consensus 97 l~~~L~~~~~e~eeeSe~lae~fl~g~~d~~~ 128 (150)
T PF07200_consen 97 LLARLQAAASEAEEESEELAEEFLDGEIDVDD 128 (150)
T ss_dssp HHHHHHHHHHHHHHHHHHHC-S-SSSHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCCHHH
Confidence 33444444455444444332 134456555
No 349
>KOG4643|consensus
Probab=65.75 E-value=1.8e+02 Score=34.03 Aligned_cols=65 Identities=20% Similarity=0.285 Sum_probs=42.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 284 kL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
-++.+++..++++++..+.|.++++||.. ++.++..|+.+...+-....+.+.-.++++...+..
T Consensus 181 dle~kir~LrqElEEK~enll~lr~eLdd----leae~~klrqe~~e~l~ea~ra~~yrdeldalre~a 245 (1195)
T KOG4643|consen 181 DLEKKIRTLRQELEEKFENLLRLRNELDD----LEAEISKLRQEIEEFLDEAHRADRYRDELDALREQA 245 (1195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhh
Confidence 44555555555666666777777777776 666666666666666666666666666666655544
No 350
>KOG0895|consensus
Probab=65.72 E-value=14 Score=42.77 Aligned_cols=74 Identities=22% Similarity=0.305 Sum_probs=57.7
Q ss_pred cceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceeccccccccC--------CCCCHHHHHHHHHHHhCCCCCc
Q psy18020 11 IPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHTWTP--------QNSKLEGLIKTLASAFSALPPL 82 (434)
Q Consensus 11 iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~W~~--------~~s~L~~li~~l~~~F~~~pPl 82 (434)
+.+.|..|..||..+|.|-..=+.+..+.||- +-+|+|.+-.|-.|.. .++.|..++...+..-.++-|-
T Consensus 331 f~Fdiq~P~~yPa~pp~v~~lt~~~~R~nPNl--Yn~GKVcLslLgTwtg~~~e~wtp~~~sl~qvL~sIQ~Li~~e~Py 408 (1101)
T KOG0895|consen 331 FLFDIQFPDTYPAVPPHVKYLTGGGVRLNPNL--YNDGKVCLSLLGTWTGSRREKWTPNGSSLLQVLESIQGLILNEEPY 408 (1101)
T ss_pred eeeEeecCCCCCCCCceeEEeeccceeecCCc--ccCceEEeeeeeecccccccCCCccccchhhhhhhhhhhhcccCcc
Confidence 45899999999999999866545555555553 3489999988877753 4588999999999888888888
Q ss_pred cccC
Q psy18020 83 YSSR 86 (434)
Q Consensus 83 ~~~~ 86 (434)
+..+
T Consensus 409 ~ne~ 412 (1101)
T KOG0895|consen 409 FNEP 412 (1101)
T ss_pred cccc
Confidence 8765
No 351
>PF02252 PA28_beta: Proteasome activator pa28 beta subunit; InterPro: IPR003186 PA28 activator complex (also known as 11S regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha (PA28alpha) and beta (PA28beta) subunits. The catalytic properties of PA28alpha and PA28beta-activated proteosome are similar [, ]. This entry represents the beta subunit. The activator complex binds to the 20S proteasome and stimulates peptidase activity in and ATP-independent manner.; GO: 0008537 proteasome activator complex; PDB: 1AVO_N.
Probab=65.60 E-value=81 Score=28.50 Aligned_cols=101 Identities=18% Similarity=0.269 Sum_probs=54.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCCCCccccccccchHHHHHHHhhHHHHhhhhHH--HHHHHHHhc
Q psy18020 324 IDKEKAELERSISFLKEKETELDEILAVL----TEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTI--YYMGEALRR 397 (434)
Q Consensus 324 Le~~k~~Le~~i~~L~~~~~eLe~~I~~l----~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDti--y~L~~aL~~ 397 (434)
+.+.-+.++..+..|.+....+..||.-+ +++..+.|. | ...+..+|-+.-.+=.++-|+| ||..||=--
T Consensus 11 I~~l~~~vk~ei~~l~e~~~~vk~WI~l~IPkiEDGNNFGV~--V--Qeevl~~l~~v~~~a~~~~~~i~~Y~~~Ra~~v 86 (150)
T PF02252_consen 11 IVELLQKVKPEIRELIEKCNTVKMWIQLLIPKIEDGNNFGVS--V--QEEVLEELRAVESKAENFLDQISKYFSARAKAV 86 (150)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHT-----SS--HHHH--H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccccCCcccHH--H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455556667777777888888898877 233332221 1 0112333333333333333332 555443211
Q ss_pred C----CCCHHHHHHHHHHHHHHHHH-HHHHHHHHHH
Q psy18020 398 G----VIDLEVYLKQVRSLSRKQFM-FRALLFKCYK 428 (434)
Q Consensus 398 g----~I~ld~flK~vR~LaReQF~-~Ra~~~Ki~~ 428 (434)
. -=.+++|.+-|+++=++||. .|..+.-+|+
T Consensus 87 ~k~~K~p~v~DY~~~v~e~Dek~~~~lr~~~~elRn 122 (150)
T PF02252_consen 87 SKAAKYPHVEDYRQAVHELDEKEYISLRLIVLELRN 122 (150)
T ss_dssp HHHHH-TTBTHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1 11678899999999887776 4777766665
No 352
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=65.54 E-value=11 Score=30.57 Aligned_cols=34 Identities=18% Similarity=0.267 Sum_probs=23.3
Q ss_pred HhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHH
Q psy18020 382 AATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQ 416 (434)
Q Consensus 382 ~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQ 416 (434)
..+++.|..++. =+.|.|+++.|++-+..||--+
T Consensus 52 ~~v~~i~~~~D~-d~~G~I~f~eF~~l~~~~~~~~ 85 (92)
T cd05025 52 DAVDKIMKELDE-NGDGEVDFQEFVVLVAALTVAC 85 (92)
T ss_pred HHHHHHHHHHCC-CCCCcCcHHHHHHHHHHHHHHH
Confidence 445555554433 1468999999999988887543
No 353
>PHA01750 hypothetical protein
Probab=65.45 E-value=56 Score=25.37 Aligned_cols=24 Identities=25% Similarity=0.397 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKE 340 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~ 340 (434)
|..+++++...++.|++.+..++.
T Consensus 47 L~~ei~~~kikqDnl~~qv~eik~ 70 (75)
T PHA01750 47 LKTEIEELKIKQDELSRQVEEIKR 70 (75)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHH
Confidence 444444444444444444443333
No 354
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ]. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance []. ; GO: 0008134 transcription factor binding, 0042803 protein homodimerization activity, 0045502 dynein binding
Probab=65.42 E-value=1e+02 Score=27.57 Aligned_cols=33 Identities=27% Similarity=0.373 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
+..+...|..+....+..|..|+.....+...|
T Consensus 78 l~sEk~~L~k~lq~~q~kv~eLE~~~~~~~~~l 110 (140)
T PF10473_consen 78 LRSEKENLDKELQKKQEKVSELESLNSSLENLL 110 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 333333333333333333333333333333333
No 355
>KOG0426|consensus
Probab=65.41 E-value=10 Score=33.25 Aligned_cols=42 Identities=21% Similarity=0.286 Sum_probs=31.8
Q ss_pred cceEEEecCCCCCCCCEeEEecCCCceeccCCCcCCCCceecccccc
Q psy18020 11 IPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHT 57 (434)
Q Consensus 11 iPi~iwl~~~yP~~~P~~~v~pt~~m~i~~~~~vd~~G~~~~pyL~~ 57 (434)
.|-.+-+|.+||.+||-.-. |.+|. .+++.+||+|++-.||.
T Consensus 54 fpA~l~FP~DYPLsPPkm~F--tc~~f---HPNiy~dG~VCISILHa 95 (165)
T KOG0426|consen 54 FPARLSFPLDYPLSPPKMRF--TCEMF---HPNIYPDGRVCISILHA 95 (165)
T ss_pred cceeeecCCCCCCCCCceee--ecccc---cCcccCCCeEEEEEeeC
Confidence 46778899999999997644 33332 46789999999888873
No 356
>PRK00736 hypothetical protein; Provisional
Probab=65.39 E-value=63 Score=25.11 Aligned_cols=36 Identities=19% Similarity=0.280 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.++.+++|...-.+-.+.|..|..+...|.+-+...
T Consensus 17 qe~tie~Ln~~v~~Qq~~i~~L~~ql~~L~~rl~~~ 52 (68)
T PRK00736 17 QEKTIEELSDQLAEQWKTVEQMRKKLDALTERFLSL 52 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 333444444444444444444444444444444333
No 357
>PF06595 BDV_P24: Borna disease virus P24 protein; InterPro: IPR009517 Borna disease virus (BDV) is a non-cytolytic, neurotropic RNA virus that has a broad host range in warm-blooded animals. BDV is an enveloped virus, non-segmented, negative-stranded RNA genome and has an organisation characteristic of a member of Bornaviridae in the order of Mononegavirale. This family consists of several BDV P24 (phosphoprotein 24) proteins. They are essential components of the RNA polymerase transcription and replication complex. P24 is encoded by open reading frame II (ORF-II) and undergoes high rates of mutation in humans. They bind amphoterin-HMGB1, a multifunctional protein, directly may cause deleterious effects in cellular functions by its interference with HMGB1 []. Horse and human P24 have no species-specific amino acid residues, suggesting that the two viruses related [, ]. Numerous interactions of the immune system with the central nervous system have been described. Mood and psychotic disorders, such as severe depression and schizophrenia, are both heterogeneous disorders regarding clinical symptomatology, the acuity of symptoms, the clinical course and the treatment response []. BDV p24 RNA has been detected in the peripheral blood mononuclear cells (PBMCs) of psychiatric patients with such conditions []. Some studies find a significant difference in the prevalence of BDV p24 RNA in patients with mood disorders and schizophrenia [], whilst others find no difference between patients and control groups []. Consequently, debate about the role of BDV in psychiatric diseases remains alive.
Probab=65.15 E-value=65 Score=29.44 Aligned_cols=71 Identities=15% Similarity=0.200 Sum_probs=50.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFK--LARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 284 kL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~--L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
-|+.+|.+.-++++.-+|.|..+|.|+-..-++ -.+-+..|-+.-..|++.++.+.+...-+-+.|.-+-.
T Consensus 92 ~l~~~L~~i~~r~e~g~ESLSsLQ~Etl~~iq~~Dya~SIk~lGEN~kvLDRS~Ktm~eTmklMmEKiDLLY~ 164 (201)
T PF06595_consen 92 GLKGSLDDIAQRMESGLESLSSLQIETLQAIQQSDYADSIKQLGENIKVLDRSMKTMMETMKLMMEKIDLLYA 164 (201)
T ss_pred HhhccHHHHHHHHHHhHHHHHHHHHHHHHHHHhhchhHHHHHHcchhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467777777778888899999999988876665 34556666667777777777777666666666655543
No 358
>TIGR01844 type_I_sec_TolC type I secretion outer membrane protein, TolC family. Members of this model are outer membrane proteins from the TolC subfamily within the RND (Resistance-Nodulation-cell Division) efflux systems. These proteins, unlike the NodT subfamily, appear not to be lipoproteins. All are believed to participate in type I protein secretion, an ABC transporter system for protein secretion without cleavage of a signal sequence, although they may, like TolC, participate also in the efflux of smaller molecules as well. This family includes the well-documented examples TolC (E. coli), PrtF (Erwinia), and AprF (Pseudomonas aeruginosa).
Probab=64.99 E-value=1.1e+02 Score=31.07 Aligned_cols=42 Identities=5% Similarity=-0.027 Sum_probs=29.4
Q ss_pred CCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHH
Q psy18020 354 EKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 354 ~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL 395 (434)
..+..+..|++.+..-+.+.-.++..-...+..+...|..++
T Consensus 357 ~~G~~~~~dll~a~~~~~~a~~~~~~a~~~~~~a~~~L~~~~ 398 (415)
T TIGR01844 357 QVGTRTLLDVLNAEQELYQARQELANARYDYLQAQLNLLSAT 398 (415)
T ss_pred hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 457778888777777777777777766666666666665554
No 359
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=64.74 E-value=1.7e+02 Score=29.73 Aligned_cols=21 Identities=14% Similarity=0.216 Sum_probs=16.8
Q ss_pred hHHHHHHHhhHHHHhhhhHHH
Q psy18020 369 PIYKQILNTFTEEAATEDTIY 389 (434)
Q Consensus 369 ~l~~Ql~~l~Aed~AieDtiy 389 (434)
-|.+|+-.|-++...+..-|.
T Consensus 182 ~L~Kqm~~l~~eKr~Lq~~l~ 202 (310)
T PF09755_consen 182 RLWKQMDKLEAEKRRLQEKLE 202 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHc
Confidence 378999999999988877644
No 360
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=64.70 E-value=37 Score=26.36 Aligned_cols=37 Identities=24% Similarity=0.243 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.++.+++|...-..-.+.|.+|+.....|.+.|..+.
T Consensus 16 qe~~ie~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~~~ 52 (69)
T PF04102_consen 16 QEDTIEELNDVVTEQQRQIDRLQRQLRLLRERLRELE 52 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT-----
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4555555555555555555555555555555555443
No 361
>PRK11091 aerobic respiration control sensor protein ArcB; Provisional
Probab=64.59 E-value=82 Score=35.38 Aligned_cols=61 Identities=15% Similarity=0.138 Sum_probs=32.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CCCCccccccccchHHHHHHHhh
Q psy18020 318 ARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK-TEVDVDEAVTTTAPIYKQILNTF 378 (434)
Q Consensus 318 ~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k-~~~diDe~i~~~~~l~~Ql~~l~ 378 (434)
.....+|+.+..+.++..+.|++..+.+...++...+. -..|.|.-|+-.+..+..++...
T Consensus 130 ~~~~~~L~~~i~~r~~~~~~l~~~~~~l~~il~~~~~~i~~~D~~g~i~~~N~a~~~l~G~~ 191 (779)
T PRK11091 130 QEAFEQLKNEIKEREETQIELEQQSSLLRSFLDASPDLVYYRNEDGEFSGCNRAMELLTGKS 191 (779)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEECCCCcEEeEcHHHHHHhCcC
Confidence 44444555555555555555666666666666544321 11255555666666666666544
No 362
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=64.54 E-value=79 Score=29.86 Aligned_cols=48 Identities=21% Similarity=0.320 Sum_probs=20.2
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
|...-+++.......+..+.++..+-..|..-+..+..+.++|.+-++
T Consensus 32 LKeei~emkk~e~~~~k~m~ei~~eN~~L~epL~~a~~e~~eL~k~L~ 79 (201)
T PF13851_consen 32 LKEEIAEMKKKEERNEKLMAEISQENKRLSEPLKKAEEEVEELRKQLK 79 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Confidence 333333333333334444444444444444444444444444444443
No 363
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=64.06 E-value=11 Score=30.70 Aligned_cols=34 Identities=12% Similarity=0.231 Sum_probs=24.2
Q ss_pred HHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 381 EAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 381 d~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
+..+++.|-.++. =+.|+|+++.|++.+..||-.
T Consensus 51 ~~~~~~ll~~~D~-d~DG~I~f~EF~~l~~~l~~~ 84 (89)
T cd05023 51 PGVLDRMMKKLDL-NSDGQLDFQEFLNLIGGLAVA 84 (89)
T ss_pred HHHHHHHHHHcCC-CCCCcCcHHHHHHHHHHHHHH
Confidence 3455555554443 246899999999999998864
No 364
>TIGR00833 actII Transport protein. Characterized members of the RND superfamily all probably catalyze substrate efflux via an H+ antiport mechanism. These proteins are found ubiquitously in bacteria, archaea and eukaryotes. This sub-family includes the S. coelicolor ActII3 protein, which may play a role in drug resistance, and the M. tuberculosis MmpL7 protein, which catalyzes export of an outer membrane lipid, phthiocerol dimycocerosate.
Probab=64.03 E-value=1.2e+02 Score=35.29 Aligned_cols=32 Identities=16% Similarity=0.245 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 321 FERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 321 ~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.++|.+.-+.+.+.+..|.++..+++++++-+
T Consensus 521 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (910)
T TIGR00833 521 LGQVSLAVRLMQQAISKLQGSAGDVFDIFDPL 552 (910)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444444444444555555555554444
No 365
>PRK09174 F0F1 ATP synthase subunit B'; Validated
Probab=63.81 E-value=89 Score=29.61 Aligned_cols=9 Identities=0% Similarity=0.209 Sum_probs=3.4
Q ss_pred HHHHHHHHH
Q psy18020 294 VQAQEEIEI 302 (434)
Q Consensus 294 ~q~~~ele~ 302 (434)
..+..+++.
T Consensus 87 ~~I~~~L~~ 95 (204)
T PRK09174 87 DRIAQDLDQ 95 (204)
T ss_pred HHHHHHHHH
Confidence 333333333
No 366
>PF08700 Vps51: Vps51/Vps67; InterPro: IPR014812 The VFT tethering complex (also known as GARP complex, Golgi associated retrograde protein complex, Vps53 tethering complex) is a conserved eukaryotic docking complex which is involved in recycling of proteins from endosomes to the late Golgi. Vps51 (also known as Vps67) is a subunit of VFT and interacts with the SNARE Tlg1 [].
Probab=63.67 E-value=72 Score=25.22 Aligned_cols=57 Identities=14% Similarity=0.266 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 285 IRRKMNEQKVQAQEEIEI-LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~-L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eL 345 (434)
+..+|.......+.+++. +.....+|-. ..+++..|+.....|...+..+......+
T Consensus 27 ~~~~L~~~i~~~~~eLr~~V~~nY~~fI~----as~~I~~m~~~~~~l~~~l~~l~~~~~~l 84 (87)
T PF08700_consen 27 LENKLRQEIEEKDEELRKLVYENYRDFIE----ASDEISSMENDLSELRNLLSELQQSIQSL 84 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344444444444444443 3334444444 56666666666666666666666555544
No 367
>KOG2991|consensus
Probab=63.57 E-value=70 Score=31.50 Aligned_cols=59 Identities=20% Similarity=0.331 Sum_probs=43.7
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
++..++.--+...+||+..++.|.+.+.+|...-...++-|-.|+++..+-.+.|..++
T Consensus 240 ~Le~eLAmQKs~seElkssq~eL~dfm~eLdedVEgmqsTiliLQq~Lketr~~Iq~l~ 298 (330)
T KOG2991|consen 240 ELEIELAMQKSQSEELKSSQEELYDFMEELDEDVEGMQSTILILQQKLKETRKEIQRLK 298 (330)
T ss_pred HHHHHHHHHHhhHHHHHHhHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHH
Confidence 44445544566778888888889999988888888888888887777777766666553
No 368
>PRK00295 hypothetical protein; Provisional
Probab=63.52 E-value=68 Score=24.90 Aligned_cols=35 Identities=14% Similarity=0.075 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
.++.+++|...-.+-.+.|..|+.+...|.+-+..
T Consensus 17 qE~tie~Ln~~v~~Qq~~I~~L~~ql~~L~~rl~~ 51 (68)
T PRK00295 17 QDDTIQALNDVLVEQQRVIERLQLQMAALIKRQEE 51 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444444444444444444444444443333
No 369
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=63.39 E-value=1.1e+02 Score=27.59 Aligned_cols=76 Identities=22% Similarity=0.236 Sum_probs=46.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCC
Q psy18020 321 FERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVI 400 (434)
Q Consensus 321 ~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I 400 (434)
+..|+.+...|+..+..|+.+...|+.-+..+. ..+-.++ +-.++=++..|-..+++-+..|.. ....|
T Consensus 74 l~~ld~ei~~L~~el~~l~~~~k~l~~eL~~L~--~~~t~~e-------l~~~i~~l~~e~~~l~~kL~~l~~--~~~~v 142 (169)
T PF07106_consen 74 LAELDAEIKELREELAELKKEVKSLEAELASLS--SEPTNEE-------LREEIEELEEEIEELEEKLEKLRS--GSKPV 142 (169)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCCHHH-------HHHHHHHHHHHHHHHHHHHHHHHh--CCCCC
Confidence 344444445555555555555556655555553 2223333 677778888999999999988887 44457
Q ss_pred CHHHHHH
Q psy18020 401 DLEVYLK 407 (434)
Q Consensus 401 ~ld~flK 407 (434)
+-+..-+
T Consensus 143 s~ee~~~ 149 (169)
T PF07106_consen 143 SPEEKEK 149 (169)
T ss_pred CHHHHHH
Confidence 6655443
No 370
>KOG4196|consensus
Probab=63.24 E-value=67 Score=28.33 Aligned_cols=56 Identities=32% Similarity=0.414 Sum_probs=30.7
Q ss_pred HHHHHHHHHHHHHHHhHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 297 QEEIEILKQTENELNQGK------FKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~------~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
++|+=+|++-.-.|+.+. .+=-....+||.++..|.+.++.|.++...+..-+..+
T Consensus 46 reEVvrlKQrRRTLKNRGYA~sCR~KRv~Qk~eLE~~k~~L~qqv~~L~~e~s~~~~E~da~ 107 (135)
T KOG4196|consen 46 REEVVRLKQRRRTLKNRGYAQSCRVKRVQQKHELEKEKAELQQQVEKLKEENSRLRRELDAY 107 (135)
T ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555554443 11122345677777777777777777766665554444
No 371
>KOG2010|consensus
Probab=63.23 E-value=1.8e+02 Score=29.66 Aligned_cols=38 Identities=21% Similarity=0.198 Sum_probs=26.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+.+..++.+++..++.|....+.|++.+.+=++.|+++
T Consensus 171 eek~kE~er~Kh~~s~Lq~~~~elKe~l~QRdeliee~ 208 (405)
T KOG2010|consen 171 EEKSKELERQKHMCSVLQHKMEELKEGLRQRDELIEEH 208 (405)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 44666666677777777777777777777777777765
No 372
>PF08286 Spc24: Spc24 subunit of Ndc80; InterPro: IPR013252 Spc24 is a component of the evolutionarily conserved kinetochore-associated Ndc80 complex and is involved in chromosome segregation [].; PDB: 2VE7_D 2FV4_B 2FTX_B.
Probab=63.19 E-value=3 Score=35.97 Aligned_cols=31 Identities=32% Similarity=0.475 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
+...+..|+.+-..|+..+..|+++.+++++
T Consensus 11 laK~~~~LE~~l~~l~~el~~L~~~l~eLe~ 41 (118)
T PF08286_consen 11 LAKELSDLESELESLQSELEELKEELEELEE 41 (118)
T ss_dssp -------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444444444444444443
No 373
>PF09798 LCD1: DNA damage checkpoint protein; InterPro: IPR018622 This is a family of proteins which regulate checkpoint kinases. In Schizosaccharomyces pombe (Fission yeast) this protein is called Rad26 and in Saccharomyces cerevisiae (Baker's yeast) it is called LCD1 [].
Probab=63.16 E-value=35 Score=37.96 Aligned_cols=59 Identities=24% Similarity=0.347 Sum_probs=36.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 285 IRRKMNEQKVQAQEEIEILKQTENELNQGK----FKLARMFERIDKEKAELERSISFLKEKET 343 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~L~~~q~EL~~~~----~~L~~~~~eLe~~k~~Le~~i~~L~~~~~ 343 (434)
||++|...-++.++|.+.++...++|+... ++|+.++++||.+|+=|.-+...+.....
T Consensus 2 LRdkL~~Lq~ek~~E~~~l~~~~~~lk~~~~~el~~Lk~~vqkLEDEKKFL~nE~r~~s~~~~ 64 (654)
T PF09798_consen 2 LRDKLELLQQEKQKERQALKSSVEELKESHEEELNKLKSEVQKLEDEKKFLNNELRSLSSSKR 64 (654)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Confidence 455555544555666666666666665544 34777778888887777776666554433
No 374
>KOG4279|consensus
Probab=63.02 E-value=21 Score=40.06 Aligned_cols=42 Identities=24% Similarity=0.354 Sum_probs=31.2
Q ss_pred HHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHH
Q psy18020 338 LKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEA 394 (434)
Q Consensus 338 L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~a 394 (434)
+.+..++|..|+..+ +||| ...++ ..+|+..+.|.||+..|.
T Consensus 1148 ~~~~DeeL~qWL~~~------~iDe-------~si~l--ilne~fT~~d~l~yvtrd 1189 (1226)
T KOG4279|consen 1148 ALSTDEELVQWLRGL------EIDE-------RSIAL--ILNEAFTLSDMLYYVTRD 1189 (1226)
T ss_pred cccchHHHHHHHHhc------CCch-------HHHHH--HHhccccHHHHHHHhhHh
Confidence 345567899999876 6777 44444 468899999999998764
No 375
>PF12777 MT: Microtubule-binding stalk of dynein motor; InterPro: IPR024743 The 380 kDa motor unit of dynein belongs to the AAA class of chaperone-like ATPases. The core of the 380 kDa motor unit contains a concatenated chain of six AAA modules (D1-6), of which four correspond to the ATP binding sites with P-loop signatures, and two are modules in which the P loop has been lost in evolution. This domain occurs between D4 and D5 and includes the two predicted alpha-helical coiled coil segments that form the stalk supporting the ATP-sensitive microtubule binding component [].; PDB: 3VKH_A 3VKG_A 3ERR_A 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 2RR7_A.
Probab=62.92 E-value=28 Score=35.55 Aligned_cols=38 Identities=29% Similarity=0.417 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
|+...++...++..|+..+...+.+.+...+.+..|.+
T Consensus 254 l~~~~~~~~~e~~~l~~~~~~~~~kl~rA~~Li~~L~~ 291 (344)
T PF12777_consen 254 LQKEYEEAQKEKQELEEEIEETERKLERAEKLISGLSG 291 (344)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHhhhcc
Confidence 55555556667777777777777777777777777643
No 376
>KOG2150|consensus
Probab=62.81 E-value=82 Score=34.35 Aligned_cols=51 Identities=14% Similarity=0.077 Sum_probs=30.7
Q ss_pred HHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy18020 380 EEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKHISY 432 (434)
Q Consensus 380 ed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~~~l 432 (434)
++..-.||+.+|...++.=.--+|.|= +-.|=|.+=.+|.|+.|...+|++
T Consensus 116 kEkek~d~~~wi~~~ideLe~q~d~~e--a~~~e~~~erh~~H~~~lEliLr~ 166 (575)
T KOG2150|consen 116 KEKEKRDTMDWISNQIDELERQVDSFE--AEELERFIERHRWHQQKLELILRL 166 (575)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445666777777766543211222221 156667777788888888877764
No 377
>PRK14141 heat shock protein GrpE; Provisional
Probab=62.79 E-value=95 Score=29.65 Aligned_cols=21 Identities=10% Similarity=0.003 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy18020 330 ELERSISFLKEKETELDEILA 350 (434)
Q Consensus 330 ~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.+++-+..|-.-.+.|+..+.
T Consensus 79 a~~~~~~dLLpViDnLerAl~ 99 (209)
T PRK14141 79 GIAGFARDMLSVSDNLRRALD 99 (209)
T ss_pred HHHHHHHHHhhhHhHHHHHHh
Confidence 334444445555555555554
No 378
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=62.79 E-value=1.8e+02 Score=29.45 Aligned_cols=21 Identities=10% Similarity=0.094 Sum_probs=14.5
Q ss_pred CCCHHHHHHHHHHHhCCCCCc
Q psy18020 62 NSKLEGLIKTLASAFSALPPL 82 (434)
Q Consensus 62 ~s~L~~li~~l~~~F~~~pPl 82 (434)
.-+|.+|++.....|-.+..+
T Consensus 13 ~isL~~FL~~~~I~F~dDl~~ 33 (325)
T PF08317_consen 13 PISLQDFLNMTGIRFYDDLEM 33 (325)
T ss_pred CcCHHHHHHHhCceeCCCccc
Confidence 456888887777777666553
No 379
>cd00179 SynN Syntaxin N-terminus domain; syntaxins are nervous system-specific proteins implicated in the docking of synaptic vesicles with the presynaptic plasma membrane; they are a family of receptors for intracellular transport vesicles; each target membrane may be identified by a specific member of the syntaxin family; syntaxins contain a moderately well conserved amino-terminal domain, called Habc, whose structure is an antiparallel three-helix bundle; a linker of about 30 amino acids connects this to the carboxy-terminal region, designated H3 (t_SNARE), of the syntaxin cytoplasmic domain; the highly conserved H3 region forms a single, long alpha-helix when it is part of the core SNARE complex and anchors the protein on the cytoplasmic surface of cellular membranes; H3 is not included in defining this domain
Probab=62.74 E-value=1.1e+02 Score=26.82 Aligned_cols=60 Identities=17% Similarity=0.160 Sum_probs=31.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 291 e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
..+..+...++.|..++..|......-.....+|+...+.....+..+...+..|++...
T Consensus 13 ~~i~~i~~~v~~l~~l~~~~~t~~~~~~~~~~~l~~~~~~~~~~~~~ik~~lk~l~~~~~ 72 (151)
T cd00179 13 GNIDKISEDVEELQKLHSQLLTAPDADPELKQELESLVQEIKKLAKEIKGKLKELEESNE 72 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444555556666666555443322334445555555555666666666666655543
No 380
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=62.68 E-value=1.8e+02 Score=29.56 Aligned_cols=114 Identities=18% Similarity=0.087 Sum_probs=59.1
Q ss_pred HHHHHHHHHHhHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHH
Q psy18020 302 ILKQTENELNQGKFKLARMFE-----RIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILN 376 (434)
Q Consensus 302 ~L~~~q~EL~~~~~~L~~~~~-----eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~ 376 (434)
++...+.+|...+.+|.+... .++.......+.+..|+.+..+++.-+..++.. -..-|-++..
T Consensus 178 ql~~~~~~l~~ae~~l~~fr~~~~~~d~~~~~~~~~~~i~~L~~~l~~~~~~l~~l~~~-----------~~~~~P~v~~ 246 (362)
T TIGR01010 178 EVKEAEQRLNATKAELLKYQIKNKVFDPKAQSSAQLSLISTLEGELIRVQAQLAQLRSI-----------TPEQNPQVPS 246 (362)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----------CCCCCCchHH
Confidence 344455555555544444433 233334455566777777777777766655421 0111334455
Q ss_pred hhHHHHhhhhHHHHHHHHHhcCC-CCHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 377 TFTEEAATEDTIYYMGEALRRGV-IDLEVYLKQVRSLSRKQFMFRALLFKC 426 (434)
Q Consensus 377 l~Aed~AieDtiy~L~~aL~~g~-I~ld~flK~vR~LaReQF~~Ra~~~Ki 426 (434)
+-++-.+++.-|....+-+..+. -.+..-..+-..|-||.=+.|.+.+..
T Consensus 247 l~~~i~~l~~~i~~e~~~i~~~~~~~l~~~~~~~~~L~re~~~a~~~y~~~ 297 (362)
T TIGR01010 247 LQARIKSLRKQIDEQRNQLSGGLGDSLNEQTADYQRLVLQNELAQQQLKAA 297 (362)
T ss_pred HHHHHHHHHHHHHHHHHHhhcCCCccHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55566666666666555444332 244544555556666555555444433
No 381
>TIGR03321 alt_F1F0_F0_B alternate F1F0 ATPase, F0 subunit B. CC and in principle may run in either direction. This model represents the F0 subunit B of this apparent second ATP synthase.
Probab=62.64 E-value=1.5e+02 Score=28.63 Aligned_cols=12 Identities=25% Similarity=0.841 Sum_probs=5.7
Q ss_pred HHHHHHHHHHHH
Q psy18020 402 LEVYLKQVRSLS 413 (434)
Q Consensus 402 ld~flK~vR~La 413 (434)
+|.|++.+..|.
T Consensus 150 id~~i~~l~~l~ 161 (246)
T TIGR03321 150 VDVFVQRLRTLD 161 (246)
T ss_pred HHHHHHHhhcCC
Confidence 344555554443
No 382
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=62.55 E-value=78 Score=34.84 Aligned_cols=36 Identities=28% Similarity=0.321 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
++++++.++..-..|+.++..-.++.++|++-+..+
T Consensus 472 ~~rei~~~~~~I~~L~~~L~e~~~~ve~L~~~l~~l 507 (652)
T COG2433 472 KDREIRARDRRIERLEKELEEKKKRVEELERKLAEL 507 (652)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444455555555554444455555444444
No 383
>PF04740 LXG: LXG domain of WXG superfamily; InterPro: IPR006829 This group of putative transposases is found in Gram-positive bacteria, mostly Bacillus members and is thought to be a Cytosolic protein. However, we have also found a Bacillus subtilis bacteriophage SPbetac2 homologue (O64023 from SWISSPROT), possibly arising as a result of horizontal transfer. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=62.42 E-value=1.3e+02 Score=27.91 Aligned_cols=25 Identities=16% Similarity=0.248 Sum_probs=15.7
Q ss_pred HHhhhhHHHHHHHHHhcCCCCHHHH
Q psy18020 381 EAATEDTIYYMGEALRRGVIDLEVY 405 (434)
Q Consensus 381 d~AieDtiy~L~~aL~~g~I~ld~f 405 (434)
..+++-.+--|..+..+|+|+...|
T Consensus 179 ~~~l~~~l~~l~~~~~~g~~~~~~y 203 (204)
T PF04740_consen 179 LQALQSGLSQLQSMWNNGSISIGSY 203 (204)
T ss_pred HHHHHHHHHHHHHhhcCCCCcccCC
Confidence 3456666677777777776654433
No 384
>COG4985 ABC-type phosphate transport system, auxiliary component [Inorganic ion transport and metabolism]
Probab=62.39 E-value=1.1e+02 Score=29.69 Aligned_cols=29 Identities=24% Similarity=0.324 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 322 ERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 322 ~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
..+..++.+|+++.+.|.++.+.|..-++
T Consensus 217 ~~~~ae~seLq~r~~~l~~~L~~L~~e~~ 245 (289)
T COG4985 217 QHYVAEKSELQKRLAQLQTELDALRAELE 245 (289)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Confidence 44555556666655555555555544433
No 385
>PRK02793 phi X174 lysis protein; Provisional
Probab=62.37 E-value=75 Score=24.97 Aligned_cols=37 Identities=16% Similarity=0.131 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.++.+.+|...-.+-.+.|..|+.+...|.+-+..++
T Consensus 20 Qe~tIe~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~~~ 56 (72)
T PRK02793 20 QEITIEELNVTVTAHEMEMAKLRDHLRLLTEKLKASQ 56 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4445555555555555555555555555555555443
No 386
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=62.28 E-value=2.3e+02 Score=30.53 Aligned_cols=11 Identities=27% Similarity=0.456 Sum_probs=5.0
Q ss_pred HHHHHHHHHHH
Q psy18020 402 LEVYLKQVRSL 412 (434)
Q Consensus 402 ld~flK~vR~L 412 (434)
|+.|=|+|..+
T Consensus 149 l~~f~~~v~~~ 159 (475)
T PRK10361 149 LDGFRRQVQDS 159 (475)
T ss_pred HHHHHHHHHHH
Confidence 44444444443
No 387
>PF07407 Seadorna_VP6: Seadornavirus VP6 protein; InterPro: IPR009982 This family consists of several VP6 proteins from the Banna virus as well as a related protein VP5 from the Kadipiro virus. Members of this family are typically of around 420 residues in length. The function of this family is unknown.
Probab=62.23 E-value=63 Score=32.86 Aligned_cols=46 Identities=13% Similarity=0.167 Sum_probs=21.1
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh
Q psy18020 307 ENELNQGKFKLARMFERIDKEKAE--LERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 307 q~EL~~~~~~L~~~~~eLe~~k~~--Le~~i~~L~~~~~eLe~~I~~l 352 (434)
...|++.-++|+.++++||++..+ +...++..+.+.+.+-+++.+.
T Consensus 41 N~~LKkEN~~Lk~eVerLE~e~l~s~V~E~vet~dv~~d~i~Kimnk~ 88 (420)
T PF07407_consen 41 NHSLKKENNDLKIEVERLENEMLRSHVCEDVETNDVIYDKIVKIMNKM 88 (420)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHH
Confidence 333333334444455555444433 3333444455555555555544
No 388
>KOG0612|consensus
Probab=62.20 E-value=2.5e+02 Score=33.53 Aligned_cols=25 Identities=12% Similarity=0.126 Sum_probs=16.4
Q ss_pred CCCHHHHHHHHHHHhCCCCCccccC
Q psy18020 62 NSKLEGLIKTLASAFSALPPLYSSR 86 (434)
Q Consensus 62 ~s~L~~li~~l~~~F~~~pPl~~~~ 86 (434)
.||.-.|=-+|-.++=.+-|+|+.+
T Consensus 261 ecDwWSlGV~~YEMlyG~TPFYads 285 (1317)
T KOG0612|consen 261 ECDWWSLGVFMYEMLYGETPFYADS 285 (1317)
T ss_pred ccchhhhHHHHHHHHcCCCcchHHH
Confidence 4444445555667777888888765
No 389
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=62.17 E-value=14 Score=30.06 Aligned_cols=34 Identities=9% Similarity=0.232 Sum_probs=23.0
Q ss_pred HHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 381 EAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 381 d~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
+..+++.|..++.- ..|+|+++.|++-+..++-.
T Consensus 50 ~~~v~~~i~~~D~n-~dG~v~f~eF~~li~~~~~~ 83 (88)
T cd05027 50 QEVVDKVMETLDSD-GDGECDFQEFMAFVAMVTTA 83 (88)
T ss_pred HHHHHHHHHHhCCC-CCCcCcHHHHHHHHHHHHHH
Confidence 34455555544322 46899999999998887644
No 390
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=62.04 E-value=56 Score=25.41 Aligned_cols=29 Identities=17% Similarity=0.358 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 318 ARMFERIDKEKAELERSISFLKEKETELD 346 (434)
Q Consensus 318 ~~~~~eLe~~k~~Le~~i~~L~~~~~eLe 346 (434)
++.+..||...+..+..+..+.++.++++
T Consensus 19 ~~~i~~lE~~~~~~e~~i~~~~~~l~~I~ 47 (71)
T PF10779_consen 19 EERIDKLEKRDAANEKDIKNLNKQLEKIK 47 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444444444444444444444443
No 391
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=62.03 E-value=69 Score=27.41 Aligned_cols=13 Identities=31% Similarity=0.268 Sum_probs=5.0
Q ss_pred HhhhhHHHHHHHH
Q psy18020 382 AATEDTIYYMGEA 394 (434)
Q Consensus 382 ~AieDtiy~L~~a 394 (434)
..+++.+.-|.+.
T Consensus 96 ~~l~~~~~~l~~~ 108 (126)
T TIGR00293 96 EELEKAIEKLQEA 108 (126)
T ss_pred HHHHHHHHHHHHH
Confidence 3333333333333
No 392
>KOG0396|consensus
Probab=61.88 E-value=2e+02 Score=29.93 Aligned_cols=39 Identities=15% Similarity=0.227 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 315 FKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.+++.++..++..+..+++.++...+..+.....|+.+.
T Consensus 61 ~~id~Li~kv~~~krk~e~~iq~e~~~~~~iksRid~m~ 99 (389)
T KOG0396|consen 61 SLIDRLIRKVQCLKRKLEEYIQSEEEQLKRIKSRIDFMH 99 (389)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 446666666666666666666666555555555555443
No 393
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=61.83 E-value=1.4e+02 Score=33.27 Aligned_cols=147 Identities=13% Similarity=0.129 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH---------------HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 283 DKIRRKMNEQKVQAQEEIEILK---------------QTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 283 dkL~~~L~e~~~q~~~ele~L~---------------~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
.....++++.+..++...+... +..++.+...+++.+++.+|.+..++|++++++++++.++|+.
T Consensus 49 ~~~~~~~~~~l~~L~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~l~~~~~~L~~~~~~l~~~~~~l~~ 128 (646)
T PRK05771 49 RSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEP 128 (646)
T ss_pred HHHHHHHHHHHHHHHHhccccccchhhhccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q ss_pred H------HHHhcCCCCCCccccccccchHHHHHHHh-------------------hHHHHhhhhHHHHHHHHHhcCCCCH
Q psy18020 348 I------LAVLTEKTEVDVDEAVTTTAPIYKQILNT-------------------FTEEAATEDTIYYMGEALRRGVIDL 402 (434)
Q Consensus 348 ~------I~~l~~k~~~diDe~i~~~~~l~~Ql~~l-------------------~Aed~AieDtiy~L~~aL~~g~I~l 402 (434)
| +..+.+.+-+.+---.++.+-..+...+. ++....-++.-..|.+ +.-.++++
T Consensus 129 ~~~ld~~l~~~~~~~~~~~~~G~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~~~~~~~~~~l~~-~~f~~~~~ 207 (646)
T PRK05771 129 WGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKK-LGFERLEL 207 (646)
T ss_pred hhcCCCCHHHhCCCCcEEEEEEEecchhhhhHHhhccCceEEEEEecCCcEEEEEEEEhhhHHHHHHHHHH-CCCEEecC
Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 403 ---EVYLKQVRSLSRKQFMFRALLFKCYKHI 430 (434)
Q Consensus 403 ---d~flK~vR~LaReQF~~Ra~~~Ki~~~~ 430 (434)
...-+.+++|.++.=-.+..++++.+.+
T Consensus 208 p~~~~p~~~l~~l~~~l~~l~~~~~~~~~~l 238 (646)
T PRK05771 208 EEEGTPSELIREIKEELEEIEKERESLLEEL 238 (646)
T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
No 394
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=61.78 E-value=1.9e+02 Score=29.36 Aligned_cols=69 Identities=28% Similarity=0.388 Sum_probs=38.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHH
Q psy18020 325 DKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEV 404 (434)
Q Consensus 325 e~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~ 404 (434)
+...+.|.+.+..|..+..+|+..++. ..+.-|+. |.++|-.|-++..+...++. -|.+.+|+++-
T Consensus 105 E~ltn~L~rkl~qLr~EK~~lE~~Le~---EqE~~V~k-------L~k~i~~Le~e~~~~q~~le----~Lr~EKVdlEn 170 (310)
T PF09755_consen 105 EFLTNDLSRKLNQLRQEKVELENQLEQ---EQEYLVNK-------LQKKIERLEKEKSAKQEELE----RLRREKVDLEN 170 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH---hHHHHHHH-------HHHHHHHHHHHHHHhHHHHH----HHHHHHHhHHH
Confidence 344455555555555555555554432 12223333 66777777777766666555 45567777776
Q ss_pred HHH
Q psy18020 405 YLK 407 (434)
Q Consensus 405 flK 407 (434)
=|.
T Consensus 171 ~LE 173 (310)
T PF09755_consen 171 TLE 173 (310)
T ss_pred HHH
Confidence 554
No 395
>TIGR03495 phage_LysB phage lysis regulatory protein, LysB family. Members of this protein family are phage lysis regulatory protein, including the well-studied protein LysB (lysis protein B) of Enterobacteria phage P2. For members of this family, genes are found in phage or in prophage regions of bacterial genomes, typically near a phage lysozyme or phage holin.
Probab=61.76 E-value=1.2e+02 Score=27.02 Aligned_cols=67 Identities=13% Similarity=0.257 Sum_probs=32.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 287 ~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~---~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.++......++..-..+...++++......|... .++-+..+..|.............-++-|+.+.
T Consensus 19 ~~~~~l~~~~~~a~~~~~~~~~~l~~~~~qL~~l~~~a~~~~~~Q~~Lr~~~~~~~~~l~~re~~i~rL~ 88 (135)
T TIGR03495 19 QRLRNARADLERANRVLKAQQAELASKANQLIVLLALAKRNEEAQAQLRQQLAQARALLAQREQRIERLK 88 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333344455555555544444333 333355566666666655555555555555553
No 396
>PF01540 Lipoprotein_7: Adhesin lipoprotein; InterPro: IPR002520 This family consists of the p50 and variable adherence-associated antigen (Vaa) adhesins from Mycoplasma hominis. M. hominis is a mycoplasma associated with human urogenital diseases, pneumonia, and septic arthritis []. An adhesin is a cell surface molecule that mediates adhesion to other cells or to the surrounding surface or substrate. The Vaa antigen is a 50kDa surface lipoprotein that has four tandem repetitive DNA sequences encoding a periodic peptide structure, and is highly immunogenic in the human host []. p50 is also a 50kDa lipoprotein, having three repeats A,B and C, that may be a tetramer of 191kDa in its native environment [].
Probab=61.71 E-value=1.7e+02 Score=28.98 Aligned_cols=78 Identities=28% Similarity=0.382 Sum_probs=49.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy18020 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK-------EKAELERSISFLKEKETELDEILAVLTEKTE 357 (434)
Q Consensus 285 L~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~-------~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~ 357 (434)
+-.+|.+..+.++.-++.|..+..|-..-...++..+.+|+. -+.+|+.-+..|..+-.++..+......+++
T Consensus 116 i~k~lAeeNqKIq~gi~EL~Kl~~e~~~l~kTi~~TIa~lEKKFqI~~~FKekLesfa~~L~~KS~eI~tFttv~s~k~e 195 (353)
T PF01540_consen 116 IDKKLAEENQKIQNGIEELKKLSNEAFELSKTINKTIAKLEKKFQIDKDFKEKLESFADLLNKKSREIDTFTTVQSTKEE 195 (353)
T ss_pred HHHHHHHHHHHHHhhHHHHHHHhHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhh
Confidence 344445555555555666666666555555556677777763 4567778888888888888888777666655
Q ss_pred CCccc
Q psy18020 358 VDVDE 362 (434)
Q Consensus 358 ~diDe 362 (434)
+-+.+
T Consensus 196 F~L~E 200 (353)
T PF01540_consen 196 FVLNE 200 (353)
T ss_pred hhHHH
Confidence 54443
No 397
>KOG0243|consensus
Probab=61.67 E-value=2.4e+02 Score=33.18 Aligned_cols=23 Identities=13% Similarity=0.421 Sum_probs=12.8
Q ss_pred CCchHHHHHHHHHHH---HHHHHHHH
Q psy18020 268 ITDEHIKMSLKSAVE---DKIRRKMN 290 (434)
Q Consensus 268 ~~~e~~r~sl~sal~---dkL~~~L~ 290 (434)
++....++.++.-+. ++|++.|.
T Consensus 396 vNQkl~K~~llKd~~~EIerLK~dl~ 421 (1041)
T KOG0243|consen 396 VNQKLMKKTLLKDLYEEIERLKRDLA 421 (1041)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444556666653222 47777773
No 398
>PF06637 PV-1: PV-1 protein (PLVAP); InterPro: IPR009538 This family consists of several PV-1 (PLVAP) proteins, which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms []. The function of this family is unknown.
Probab=61.63 E-value=1.3e+02 Score=31.25 Aligned_cols=30 Identities=20% Similarity=0.258 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhH
Q psy18020 284 KIRRKMNEQKVQAQEEIEILKQTENELNQG 313 (434)
Q Consensus 284 kL~~~L~e~~~q~~~ele~L~~~q~EL~~~ 313 (434)
.|-++|+..+.+..+|...|++++.+++++
T Consensus 285 elar~Lr~~I~~VarENs~LqrQKle~e~~ 314 (442)
T PF06637_consen 285 ELARSLRAGIERVARENSDLQRQKLEAEQG 314 (442)
T ss_pred HHHHHHhhhHHHHHHhhhHHHHHHHHHHHH
Confidence 344455555555555555555555555444
No 399
>TIGR00414 serS seryl-tRNA synthetase. This model represents the seryl-tRNA synthetase found in most organisms. This protein is a class II tRNA synthetase, and is recognized by the pfam model tRNA-synt_2b. The seryl-tRNA synthetases of two archaeal species, Methanococcus jannaschii and Methanobacterium thermoautotrophicum, differ considerably and are included in a different model.
Probab=61.59 E-value=43 Score=35.28 Aligned_cols=95 Identities=14% Similarity=0.141 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-ccccccccchHH
Q psy18020 296 AQEEIEILKQTENELNQGKFKLA---RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVD-VDEAVTTTAPIY 371 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~---~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~d-iDe~i~~~~~l~ 371 (434)
++...+.....++.|..+...+. +.+-+|..++.+|...++.|+.+..++.+-|..+...++ + .++++.....+-
T Consensus 4 ik~ir~n~~~v~~~l~~R~~~~~~~vd~i~~ld~~~r~~~~~~~~l~~erN~~sk~i~~~~~~~~-~~~~~l~~~~~~l~ 82 (418)
T TIGR00414 4 RKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKK-DKIEEIKKELKELK 82 (418)
T ss_pred HHHHHhCHHHHHHHHHhcCCChhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-chHHHHHHHHHHHH
Q ss_pred HHHHHhhHHHHhhhhHHHHH
Q psy18020 372 KQILNTFTEEAATEDTIYYM 391 (434)
Q Consensus 372 ~Ql~~l~Aed~AieDtiy~L 391 (434)
++|-++-++-.++++.+..+
T Consensus 83 ~~~~~~~~~~~~~~~~~~~~ 102 (418)
T TIGR00414 83 EELTELSAALKALEAELQDK 102 (418)
T ss_pred HHHHHHHHHHHHHHHHHHHH
No 400
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=61.52 E-value=52 Score=30.08 Aligned_cols=37 Identities=24% Similarity=0.337 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 316 KLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 316 ~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
+|+..+..|+.+-..|++....++++-..|..+++..
T Consensus 115 ~l~~~~e~Le~e~~~L~~~~~~~~eDY~~L~~Im~RA 151 (161)
T TIGR02894 115 SLQKRNEELEKELEKLRQRLSTIEEDYQTLIDIMDRA 151 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3555555555555555555555555555555555543
No 401
>PF04740 LXG: LXG domain of WXG superfamily; InterPro: IPR006829 This group of putative transposases is found in Gram-positive bacteria, mostly Bacillus members and is thought to be a Cytosolic protein. However, we have also found a Bacillus subtilis bacteriophage SPbetac2 homologue (O64023 from SWISSPROT), possibly arising as a result of horizontal transfer. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=61.42 E-value=1.4e+02 Score=27.66 Aligned_cols=11 Identities=9% Similarity=0.154 Sum_probs=5.7
Q ss_pred hhhhHHHHHHH
Q psy18020 383 ATEDTIYYMGE 393 (434)
Q Consensus 383 AieDtiy~L~~ 393 (434)
-++|++.-|..
T Consensus 153 ~l~~~lekL~~ 163 (204)
T PF04740_consen 153 KLQETLEKLRA 163 (204)
T ss_pred HHHHHHHHHHH
Confidence 44555555543
No 402
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=61.23 E-value=62 Score=24.28 Aligned_cols=33 Identities=15% Similarity=0.267 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
|+..+.+|+..-+.+++..+.+.+..+++++.+
T Consensus 5 lEn~~~~~~~~i~tvk~en~~i~~~ve~i~env 37 (55)
T PF05377_consen 5 LENELPRIESSINTVKKENEEISESVEKIEENV 37 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444444444444444
No 403
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=61.18 E-value=1.1e+02 Score=26.34 Aligned_cols=28 Identities=21% Similarity=0.419 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 318 ARMFERIDKEKAELERSISFLKEKETEL 345 (434)
Q Consensus 318 ~~~~~eLe~~k~~Le~~i~~L~~~~~eL 345 (434)
++-+..|+...+.+++++..+.++.+.+
T Consensus 93 ~~r~~~l~~~~~~l~~~l~~l~~~~~~~ 120 (129)
T cd00584 93 DKKIEELTKQIEKLQKELAKLKDQINTL 120 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333333333333
No 404
>PF15324 TALPID3: Hedgehog signalling target
Probab=60.98 E-value=15 Score=42.38 Aligned_cols=41 Identities=22% Similarity=0.360 Sum_probs=29.0
Q ss_pred hHHHHHHHHHhcC-------CC--CHHHHHHHH-----HHHHHHHHHHHHHHHHH
Q psy18020 386 DTIYYMGEALRRG-------VI--DLEVYLKQV-----RSLSRKQFMFRALLFKC 426 (434)
Q Consensus 386 Dtiy~L~~aL~~g-------~I--~ld~flK~v-----R~LaReQF~~Ra~~~Ki 426 (434)
++||.|-.||..+ +| ++|.|+|.+ .+|||+-+..|-.-+|+
T Consensus 406 aAMYSfInaLstNRd~~EkIRIRKTVDeWIK~ISaEIQdElaRkd~Eqkr~dqk~ 460 (1252)
T PF15324_consen 406 AAMYSFINALSTNRDMSEKIRIRKTVDEWIKTISAEIQDELARKDYEQKRFDQKN 460 (1252)
T ss_pred hHHHHHHHHhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHhhhHHHHHHHHhh
Confidence 4666666666432 34 899999987 57999887777665555
No 405
>PRK14153 heat shock protein GrpE; Provisional
Probab=60.97 E-value=1e+02 Score=29.04 Aligned_cols=74 Identities=23% Similarity=0.221 Sum_probs=33.2
Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhh
Q psy18020 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTF 378 (434)
Q Consensus 301 e~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~ 378 (434)
+.+.++++|...-+...++++.++ .+..+++-+..|-.-.+.|+..+.... ..-++..++.+-.-+++|+++.+
T Consensus 54 d~~lR~~AEfeN~rKR~~kE~e~~--~~~a~~~~~~~LLpv~DnLerAl~~~~--~~~~~~~l~~Gvemi~k~~~~vL 127 (194)
T PRK14153 54 EQLFRLAAEFDNFRKRTAREMEEN--RKFVLEQVLLDLLEVTDNFERALESAR--TAEDMNSIVEGIEMVSKQFFSIL 127 (194)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhHHhHHHHHHhccc--ccchHHHHHHHHHHHHHHHHHHH
Confidence 334445555544444444443332 223334455555555556666554322 11233444444444555555444
No 406
>PF04380 BMFP: Membrane fusogenic activity; InterPro: IPR007475 BMFP consists of two structural domains, a coiled-coil C-terminal domain via which the protein self-associates as a trimer, and an N-terminal domain disordered at neutral pH but adopting an amphipathic alpha-helical structure in the presence of phospholipid vesicles, high ionic strength, acidic pH or SDS. BMFP interacts with phospholipid vesicles though the predicted amphipathic alpha-helix induced in the N-terminal half of the protein and promotes aggregation and fusion of vesicles in vitro.
Probab=60.85 E-value=85 Score=25.08 Aligned_cols=27 Identities=26% Similarity=0.233 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 327 EKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 327 ~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
+-+.....+.+++++++.|+.-|+.|+
T Consensus 51 EFd~q~~~L~~~r~kl~~LEarl~~LE 77 (79)
T PF04380_consen 51 EFDAQKAVLARTREKLEALEARLAALE 77 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 445555556666666666666666554
No 407
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=60.72 E-value=39 Score=36.08 Aligned_cols=24 Identities=17% Similarity=0.315 Sum_probs=11.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 312 QGKFKLARMFERIDKEKAELERSI 335 (434)
Q Consensus 312 ~~~~~L~~~~~eLe~~k~~Le~~i 335 (434)
+..+.+++.+++|+.+.+.|+.++
T Consensus 97 aq~~dle~KIkeLEaE~~~Lk~Ql 120 (475)
T PRK13729 97 KQRGDDQRRIEKLGQDNAALAEQV 120 (475)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHH
Confidence 444444444444444444444443
No 408
>PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) []. It is also thought to modulate cytoplasmic dynein binding to an organelle, and plays a role in prometaphase chromosome alignment and spindle organisation during mitosis. Dynamitin is also involved in anchoring microtubules to centrosomes and may play a role in synapse formation during brain development []. ; GO: 0007017 microtubule-based process, 0005869 dynactin complex
Probab=60.64 E-value=52 Score=34.17 Aligned_cols=10 Identities=0% Similarity=0.049 Sum_probs=3.8
Q ss_pred HHHHHHHHHH
Q psy18020 342 ETELDEILAV 351 (434)
Q Consensus 342 ~~eLe~~I~~ 351 (434)
...++.|++.
T Consensus 370 ~~~i~~n~~~ 379 (388)
T PF04912_consen 370 METIEKNVKK 379 (388)
T ss_pred HHHHHHHHHH
Confidence 3333333333
No 409
>PRK02119 hypothetical protein; Provisional
Probab=60.62 E-value=82 Score=24.84 Aligned_cols=37 Identities=11% Similarity=0.135 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.++.+.+|...-.+-.+.|..|+.+...|.+-+..++
T Consensus 21 QE~tie~LN~~v~~Qq~~id~L~~ql~~L~~rl~~~~ 57 (73)
T PRK02119 21 QENLLEELNQALIEQQFVIDKMQVQLRYMANKLKDMQ 57 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4555555555555555555555555555555554443
No 410
>TIGR00634 recN DNA repair protein RecN. All proteins in this family for which functions are known are ATP binding proteins involved in the initiation of recombination and recombinational repair.
Probab=60.57 E-value=1.1e+02 Score=33.46 Aligned_cols=126 Identities=12% Similarity=0.116 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHH
Q psy18020 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK------ETELD 346 (434)
Q Consensus 273 ~r~sl~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~------~~eLe 346 (434)
+...-+-.+.|.... +.+.+...+...+.++..+.+|.. ++....+++++.+.++..++.|++. .++|+
T Consensus 141 ~~~~~~~~lLD~~~~-~~~~~~~~~~~~~~~~~~~~~L~~----l~~~~~~~~~eld~L~~ql~ELe~~~l~~~E~e~L~ 215 (563)
T TIGR00634 141 FRPDEQRQLLDTFAG-ANEKVKAYRELYQAWLKARQQLKD----RQQKEQELAQRLDFLQFQLEELEEADLQPGEDEALE 215 (563)
T ss_pred cCHHHHHHHHHHhcC-chHHHHHHHHHHHHHHHHHHHHHH----HHhhhHHHHHHHHHHHHHHHHHHhCCcCCCcHHHHH
Q ss_pred HHHHHhcCCCCCCccccccccchHHHHHHHhhHHH----H-hhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 347 EILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE----A-ATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 347 ~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed----~-AieDtiy~L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
+-...+.+.+ .+-+ ..++.+++..+| . .+-|.|....++|..- +|.-++.+...-++
T Consensus 216 ~e~~~L~n~e--~i~~-------~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~l~~~---~d~~~~~~~~~l~~ 277 (563)
T TIGR00634 216 AEQQRLSNLE--KLRE-------LSQNALAALRGDVDVQEGSLLEGLGEAQLALASV---IDGSLRELAEQVGN 277 (563)
T ss_pred HHHHHHhCHH--HHHH-------HHHHHHHHHhCCccccccCHHHHHHHHHHHHHHh---hhHhHHHHHHHHHH
No 411
>KOG0810|consensus
Probab=60.44 E-value=1e+02 Score=31.04 Aligned_cols=61 Identities=13% Similarity=0.110 Sum_probs=42.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 293 KVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 293 ~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
+..+....+.|...++.+.......+...++|+...+.+.+....++.+++.+++..+..+
T Consensus 49 i~~l~~~~~~l~~~hs~~l~~~~~~~~~k~~l~~~~~~~~~~a~~Ik~kL~~~e~~~~~~~ 109 (297)
T KOG0810|consen 49 IEKLDEDVEKLQKLHSKSLHSPNADKELKRKLESLVDEIRRRARKIKTKLKALEKENEADE 109 (297)
T ss_pred HHHHHHHHHHHHHhhhhHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 3444555666666666666655666777777777777777888888888888877776554
No 412
>PF05278 PEARLI-4: Arabidopsis phospholipase-like protein (PEARLI 4); InterPro: IPR007942 This family contains several phospholipase-like proteins from Arabidopsis thaliana and other members of the Streptophyta which are homologous to PEARLI 4.
Probab=60.36 E-value=1.8e+02 Score=28.82 Aligned_cols=24 Identities=17% Similarity=0.322 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 329 AELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 329 ~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.++...+.+|+.+..++.+.+..+
T Consensus 231 ~e~~~rl~~l~~~~~~l~k~~~~~ 254 (269)
T PF05278_consen 231 TEMKGRLGELEMESTRLSKTIKSI 254 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444445555555555554444
No 413
>PF00804 Syntaxin: Syntaxin; InterPro: IPR006011 Syntaxins A and B are nervous system-specific proteins implicated in the docking of synaptic vesicles with the presynaptic plasma membrane. Syntaxins are a family of receptors for intracellular transport vesicles. Each target membrane may be identified by a specific member of the syntaxin family []. Members of the syntaxin family [, ] have a size ranging from 30 Kd to 40 Kd; a C-terminal extremity which is highly hydrophobic and anchors the protein on the cytoplasmic surface of cellular membranes; a central, well conserved region, which seems to be in a coiled-coil conformation. ; GO: 0016020 membrane; PDB: 1S94_B 1EZ3_A 3C98_B 1BR0_A 1FIO_A 2XHE_B.
Probab=60.26 E-value=87 Score=25.03 Aligned_cols=58 Identities=10% Similarity=0.147 Sum_probs=28.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 293 KVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 293 ~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
+..++..++.|..++.........=.....+|+.....+...+..+...+..|++...
T Consensus 16 i~~i~~~~~~l~~l~~~~l~~~~~d~~~~~el~~l~~~i~~~~~~~~~~lk~l~~~~~ 73 (103)
T PF00804_consen 16 IDKIKEKLNELRKLHKKILSSPDQDSELKRELDELTDEIKQLFQKIKKRLKQLSKDNE 73 (103)
T ss_dssp HHHHHHHHHHHHHHHHHHHTSSSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHhcCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3344445555555554433322211234445555555555555555555666555543
No 414
>PF07877 DUF1661: Protein of unknown function (DUF1661); InterPro: IPR012456 The proteins in this entry have not been characterised.
Probab=60.20 E-value=6.2 Score=25.98 Aligned_cols=18 Identities=33% Similarity=0.202 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy18020 412 LSRKQFMFRALLFKCYKH 429 (434)
Q Consensus 412 LaReQF~~Ra~~~Ki~~~ 429 (434)
|+||.|..||..+|+...
T Consensus 1 lare~k~sRakTKKfs~h 18 (31)
T PF07877_consen 1 LAREVKNSRAKTKKFSRH 18 (31)
T ss_pred ChHHHHhHHHHHHHHHHH
Confidence 789999999999999743
No 415
>KOG4286|consensus
Probab=60.07 E-value=70 Score=36.06 Aligned_cols=28 Identities=14% Similarity=0.144 Sum_probs=17.6
Q ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 388 IYYMGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 388 iy~L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
|..+..--+-=+.++++=|+++|.-+|+
T Consensus 310 le~~n~rwk~Lq~SV~~rl~qlrna~~d 337 (966)
T KOG4286|consen 310 LEDLNTRWKLLQVSVPDRLTQLRNAHRD 337 (966)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3334433333356788888888887774
No 416
>KOG3091|consensus
Probab=60.03 E-value=1.3e+02 Score=32.26 Aligned_cols=28 Identities=0% Similarity=-0.051 Sum_probs=14.2
Q ss_pred chHHHHHHHhhHHHHhhhhHHHHHHHHH
Q psy18020 368 APIYKQILNTFTEEAATEDTIYYMGEAL 395 (434)
Q Consensus 368 ~~l~~Ql~~l~Aed~AieDtiy~L~~aL 395 (434)
+.+...|=+|..-..+++|..+...+..
T Consensus 431 nq~k~Rl~~L~e~~r~q~~~~~~~~~~~ 458 (508)
T KOG3091|consen 431 NQLKARLDELYEILRMQNSQLKLQESYW 458 (508)
T ss_pred HHHHHHHHHHHHHHHhhcchhcccccee
Confidence 3344455555555555555555444443
No 417
>PF10498 IFT57: Intra-flagellar transport protein 57 ; InterPro: IPR019530 Eukaryotic cilia and flagella are specialised organelles found at the periphery of cells of diverse organisms. Intra-flagellar transport (IFT) is required for the assembly and maintenance of eukaryotic cilia and flagella, and consists of the bi-directional movement of large protein particles between the base and the distal tip of the organelle. IFT particles contain multiple copies of two distinct protein complexes, A and B, which contain at least 6 and 11 protein subunits. IFT57 is part of complex B but is not, however, required for the core subunits to stay associated []. This protein is known as Huntington-interacting protein-1 in humans.
Probab=59.99 E-value=95 Score=32.11 Aligned_cols=9 Identities=33% Similarity=0.707 Sum_probs=4.5
Q ss_pred hcCCCCCCC
Q psy18020 147 FSALPPLYS 155 (434)
Q Consensus 147 f~~~pp~~s 155 (434)
|.=..|.|.
T Consensus 119 F~~~~p~~~ 127 (359)
T PF10498_consen 119 FKWKRPIYP 127 (359)
T ss_pred cCccCCCCC
Confidence 544445554
No 418
>PF07851 TMPIT: TMPIT-like protein; InterPro: IPR012926 A number of members of this family are annotated as being transmembrane proteins induced by tumour necrosis factor alpha, but no literature was found to support this. ; GO: 0016021 integral to membrane
Probab=59.88 E-value=1.8e+02 Score=29.82 Aligned_cols=55 Identities=25% Similarity=0.256 Sum_probs=31.1
Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 299 EIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 299 ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
|.+.|.+.-.+|+.......+..+++.+.++.|.+.|..-+.+..++.+.+..++
T Consensus 5 EW~eL~~efq~Lqethr~Y~qKleel~~lQ~~C~ssI~~QkkrLk~L~~sLk~~~ 59 (330)
T PF07851_consen 5 EWEELQKEFQELQETHRSYKQKLEELSKLQDKCSSSISHQKKRLKELKKSLKRCK 59 (330)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3334444444444444445555666666666666666666666666666666553
No 419
>KOG0996|consensus
Probab=59.81 E-value=1.7e+02 Score=34.69 Aligned_cols=51 Identities=20% Similarity=0.320 Sum_probs=32.8
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
++...+++......++....+++....++++.+..+..++.+|++-+++.+
T Consensus 375 ~~e~~~~~kn~~~~~k~~~~~~e~~~vk~~E~lK~~~~k~kKleke~ek~~ 425 (1293)
T KOG0996|consen 375 IKERAKELKNKFESLKKKFQDLEREDVKREEKLKRLTSKIKKLEKEIEKAR 425 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444444455556666777777777777777777777777777766553
No 420
>PRK14474 F0F1 ATP synthase subunit B; Provisional
Probab=59.59 E-value=1.6e+02 Score=28.76 Aligned_cols=14 Identities=14% Similarity=0.508 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHH
Q psy18020 402 LEVYLKQVRSLSRK 415 (434)
Q Consensus 402 ld~flK~vR~LaRe 415 (434)
+|.|++.+..|..+
T Consensus 150 id~~i~~l~~l~~~ 163 (250)
T PRK14474 150 VGIFIARLEHLSEA 163 (250)
T ss_pred HHHHHHHhcccCHH
Confidence 46677766666544
No 421
>KOG4674|consensus
Probab=59.57 E-value=4.5e+02 Score=33.05 Aligned_cols=54 Identities=9% Similarity=0.042 Sum_probs=39.2
Q ss_pred HHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHH---HHHHHHHHHHH
Q psy18020 371 YKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLS---RKQFMFRALLF 424 (434)
Q Consensus 371 ~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~La---ReQF~~Ra~~~ 424 (434)
..||-+++.+-.-+.|-+-.+---+.+-.--|.-|++.|..++ .+||---+++.
T Consensus 383 ~~sLtk~ys~~~~~qqqle~~~lele~~~~~l~s~~eev~~~~p~lk~qr~~~e~~~ 439 (1822)
T KOG4674|consen 383 GSSLTKLYSKYSKLQQQLESLKLELERLQNILSSFKEEVKQKAPILKEQRSELERMQ 439 (1822)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHH
Confidence 4688888888888888888777777777777888888888766 45554444443
No 422
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=59.46 E-value=1.6e+02 Score=27.76 Aligned_cols=33 Identities=15% Similarity=0.260 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
++..+......+..++..+..+..+.+++++-.
T Consensus 42 a~~~~a~~~a~~~~le~~~~~~~~~~~~~~~~A 74 (221)
T PF04012_consen 42 ARQALARVMANQKRLERKLDEAEEEAEKWEKQA 74 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555555555556666665555555555554433
No 423
>PRK14163 heat shock protein GrpE; Provisional
Probab=59.38 E-value=1.4e+02 Score=28.57 Aligned_cols=10 Identities=20% Similarity=0.405 Sum_probs=3.8
Q ss_pred HHHHHHHHHh
Q psy18020 303 LKQTENELNQ 312 (434)
Q Consensus 303 L~~~q~EL~~ 312 (434)
+.++++|+..
T Consensus 63 ~lR~~AEfeN 72 (214)
T PRK14163 63 LQRLQAEYQN 72 (214)
T ss_pred HHHHHHHHHH
Confidence 3333443333
No 424
>PF03915 AIP3: Actin interacting protein 3; InterPro: IPR022782 This entry represents a domain found in yeast actin interacting protein 3 and bud site selection protein 6. In these proteins it is typically found towards the C terminus. It is also found in metazoan proteins, such as the mouse enhancer trap locus 4 protein. ; PDB: 3ONX_B 3OKQ_A.
Probab=59.36 E-value=81 Score=33.40 Aligned_cols=72 Identities=15% Similarity=0.160 Sum_probs=38.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 319 RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 319 ~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
.....++..+.+|......|-++.++|+.+|+.|+.. =+--..-|.-+||=.+.-+-.+++-.|.-|..-+.
T Consensus 199 ~~R~~~~~~k~~L~~~sd~Ll~kVdDLQD~VE~LRkD------V~~RgvRp~~~qle~v~kdi~~a~~~L~~m~~~i~ 270 (424)
T PF03915_consen 199 SNRAYMESGKKKLSEESDRLLTKVDDLQDLVEDLRKD------VVQRGVRPSPKQLETVAKDISRASKELKKMKEYIK 270 (424)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHcCCcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5566777777777888888888888888888877421 11222333344444444333444444444444443
No 425
>PRK04654 sec-independent translocase; Provisional
Probab=59.35 E-value=1.4e+02 Score=28.52 Aligned_cols=20 Identities=5% Similarity=0.195 Sum_probs=8.0
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy18020 324 IDKEKAELERSISFLKEKET 343 (434)
Q Consensus 324 Le~~k~~Le~~i~~L~~~~~ 343 (434)
++..+..++..++.+++..+
T Consensus 70 i~~~~~~lk~~~~el~q~a~ 89 (214)
T PRK04654 70 LREAEDQLRNTQQQVEQGAR 89 (214)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33333344444444443333
No 426
>PF05911 DUF869: Plant protein of unknown function (DUF869); InterPro: IPR008587 This family consists of a number of sequences found in plants. The function of this family is unknown.
Probab=59.29 E-value=92 Score=35.52 Aligned_cols=80 Identities=16% Similarity=0.216 Sum_probs=49.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-----CCCCCccc-----cccccchHHHHHHHhhHHHHhhhhH
Q psy18020 318 ARMFERIDKEKAELERSISFLKEKETELDEILAVLTE-----KTEVDVDE-----AVTTTAPIYKQILNTFTEEAATEDT 387 (434)
Q Consensus 318 ~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~-----k~~~diDe-----~i~~~~~l~~Ql~~l~Aed~AieDt 387 (434)
..++.+|...+...+..+..|..+.+-+++-...|+. .++++|-. -.-+.+..++|=++.+=+..-+|.=
T Consensus 119 ~~~i~~l~~~~~~~e~~~~~l~~~l~~~eken~~Lkye~~~~~keleir~~E~~~~~~~ae~a~kqhle~vkkiakLEaE 198 (769)
T PF05911_consen 119 EKLIAELSEEKSQAEAEIEDLMARLESTEKENSSLKYELHVLSKELEIRNEEREYSRRAAEAASKQHLESVKKIAKLEAE 198 (769)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456666666666666666666666666655555531 12222211 1355677899999999998888887
Q ss_pred HHHHHHHHhc
Q psy18020 388 IYYMGEALRR 397 (434)
Q Consensus 388 iy~L~~aL~~ 397 (434)
-.-|.-++++
T Consensus 199 C~rLr~l~rk 208 (769)
T PF05911_consen 199 CQRLRALVRK 208 (769)
T ss_pred HHHHHHHHhc
Confidence 7766544443
No 427
>KOG0288|consensus
Probab=59.15 E-value=93 Score=32.68 Aligned_cols=21 Identities=19% Similarity=0.082 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy18020 403 EVYLKQVRSLSRKQFMFRALL 423 (434)
Q Consensus 403 d~flK~vR~LaReQF~~Ra~~ 423 (434)
|.|+=.-+-.+|-+|.+-++-
T Consensus 155 ~d~~v~~~lpS~~~~~ld~h~ 175 (459)
T KOG0288|consen 155 DDHFVEDTLPSRALFVLDAHE 175 (459)
T ss_pred Cchhhhcccchhhhhhhhccc
Confidence 344445556666666655543
No 428
>PLN02678 seryl-tRNA synthetase
Probab=59.03 E-value=63 Score=34.46 Aligned_cols=65 Identities=23% Similarity=0.225 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLA---RMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 288 ~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~---~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.+.+++.++..+++.|++.++++...-.+++ ...++|.++...|.+.+..|+.+.+++++-+..+
T Consensus 37 ~ld~~~r~l~~~~e~lr~erN~~sk~I~~~k~~~~~~~~l~~~~~~Lk~ei~~le~~~~~~~~~l~~~ 104 (448)
T PLN02678 37 ALDKEWRQRQFELDSLRKEFNKLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAALDAK 104 (448)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 429
>PF06248 Zw10: Centromere/kinetochore Zw10; InterPro: IPR009361 Zeste white 10 (ZW10) was initially identified as a mitotic checkpoint protein involved in chromosome segregation, and then implicated in targeting cytoplasmic dynein and dynactin to mitotic kinetochores, but it is also important in non-dividing cells. These include cytoplasmic dynein targeting to Golgi and other membranes, and SNARE-mediated ER-Golgi trafficking [, ]. Dominant-negative ZW10, anti-ZW10 antibody, and ZW10 RNA interference (RNAi) cause Golgi dispersal. ZW10 RNAi also disperse endosomes and lysosomes []. Drosophila kinetochore components Rough deal (Rod) and Zw10 are required for the proper functioning of the metaphase checkpoint in flies []. The eukaryotic spindle assembly checkpoint (SAC) monitors microtubule attachment to kinetochores and prevents anaphase onset until all kinetochores are aligned on the metaphase plate. It is an essential surveillance mechanism that ensures high fidelity chromosome segregation during mitosis. In higher eukaryotes, cytoplasmic dynein is involved in silencing the SAC by removing the checkpoint proteins Mad2 and the Rod-Zw10-Zwilch complex (RZZ) from aligned kinetochores [, , ].; GO: 0007067 mitosis, 0000775 chromosome, centromeric region, 0005634 nucleus
Probab=59.03 E-value=2.8e+02 Score=30.46 Aligned_cols=17 Identities=24% Similarity=0.417 Sum_probs=7.0
Q ss_pred HhhhhHHHHHHHHHhcC
Q psy18020 382 AATEDTIYYMGEALRRG 398 (434)
Q Consensus 382 ~AieDtiy~L~~aL~~g 398 (434)
.-|++.+..+.+|+..|
T Consensus 106 ~~i~~~l~~~~~al~~~ 122 (593)
T PF06248_consen 106 QEIDELLEEVEEALKEG 122 (593)
T ss_pred HHHHHHHHHHHHHHhcC
Confidence 33444444444444433
No 430
>PRK14068 exodeoxyribonuclease VII small subunit; Provisional
Probab=59.01 E-value=58 Score=26.01 Aligned_cols=57 Identities=14% Similarity=0.188 Sum_probs=45.2
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 296 ~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.++.++.|...-+.|+.+.-.|++.+..-+++..-++..-..|+....++..++...
T Consensus 8 fEeal~~Le~IV~~LE~gdl~Leesl~lyeeG~~L~k~C~~~L~~ae~kv~~l~~~~ 64 (76)
T PRK14068 8 FEEMMQELEQIVQKLDNETVSLEESLDLYQRGMKLSAACDTTLKNAEKKVNDLIKEE 64 (76)
T ss_pred HHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 456677778888888888888888888888888888888888887777877777643
No 431
>PRK11546 zraP zinc resistance protein; Provisional
Probab=58.92 E-value=98 Score=27.79 Aligned_cols=19 Identities=11% Similarity=0.286 Sum_probs=8.4
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy18020 315 FKLARMFERIDKEKAELER 333 (434)
Q Consensus 315 ~~L~~~~~eLe~~k~~Le~ 333 (434)
++|+....|+...+.+|.+
T Consensus 89 ~kI~aL~kEI~~Lr~kL~e 107 (143)
T PRK11546 89 SKINAVAKEMENLRQSLDE 107 (143)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444443
No 432
>PHA03161 hypothetical protein; Provisional
Probab=58.91 E-value=54 Score=29.61 Aligned_cols=18 Identities=6% Similarity=0.215 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy18020 272 HIKMSLKSAVEDKIRRKM 289 (434)
Q Consensus 272 ~~r~sl~sal~dkL~~~L 289 (434)
.++..|..+|..++.-++
T Consensus 8 li~~~lEa~VnKr~aVS~ 25 (150)
T PHA03161 8 FLCSAFEAEINKKASVSL 25 (150)
T ss_pred HHHHHHHHHHHhhhhhhH
Confidence 344455555555544444
No 433
>PF08900 DUF1845: Domain of unknown function (DUF1845); InterPro: IPR014996 Members of this protein family, such as PFL4669, are found in integrating conjugative elements (ICE) of the PFGI-1 class as in Pseudomonas fluorescens.
Probab=58.77 E-value=83 Score=30.12 Aligned_cols=51 Identities=14% Similarity=0.187 Sum_probs=31.1
Q ss_pred ccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHH---HHHHHHHHHHHHH
Q psy18020 364 VTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEV---YLKQVRSLSRKQF 417 (434)
Q Consensus 364 i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~---flK~vR~LaReQF 417 (434)
|.-.+|+.=+++.|+.+ .|+.+..+..|-+.|.|+=++ ||+....+-|.=|
T Consensus 111 l~~~splGy~~v~LL~~---yD~L~~~v~~a~h~glis~~~~~~~l~~~~~~iR~v~ 164 (217)
T PF08900_consen 111 LFFRSPLGYRCVYLLVD---YDQLARKVLTAWHYGLISRQEREDWLRRGGRAIRRVF 164 (217)
T ss_pred eEecCHHHHHHHHHHHH---HHHHHHHHHHHHHHhCCChHHHHHHHHHHHHHHHHHH
Confidence 34456666555555543 566677777788889886544 4555555555444
No 434
>KOG4673|consensus
Probab=58.73 E-value=3.1e+02 Score=30.91 Aligned_cols=140 Identities=12% Similarity=0.132 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGK----FKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 275 ~sl~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~----~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.++..++++++...-...-....+.-+.-+.-|.++..++ +.+.+....|..-...+-..-+.++.++.+|.+.+.
T Consensus 543 r~~~~~le~~~~a~qat~d~a~~Dlqk~nrlkQdear~~~~~lvqqv~dLR~~L~~~Eq~aarrEd~~R~Ei~~LqrRlq 622 (961)
T KOG4673|consen 543 RALAAALEAQALAEQATNDEARSDLQKENRLKQDEARERESMLVQQVEDLRQTLSKKEQQAARREDMFRGEIEDLQRRLQ 622 (961)
T ss_pred HHHHHHHHHHHHHHHHhhhhhhhhHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy18020 351 VLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 351 ~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaR 414 (434)
..+++-+--+.++-.++.||.+||=.|.+--..-.-+-.-+.+-|+..-=+-.+.|+..-.+.|
T Consensus 623 aaE~R~eel~q~v~~TTrPLlRQIE~lQ~tl~~~~tawereE~~l~~rL~dSQtllr~~v~~eq 686 (961)
T KOG4673|consen 623 AAERRCEELIQQVPETTRPLLRQIEALQETLSKAATAWEREERSLNERLSDSQTLLRINVLEEQ 686 (961)
T ss_pred HHHHHHHHHHhhccccccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHh
No 435
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=58.67 E-value=2.9e+02 Score=30.62 Aligned_cols=29 Identities=14% Similarity=0.280 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 320 MFERIDKEKAELERSISFLKEKETELDEI 348 (434)
Q Consensus 320 ~~~eLe~~k~~Le~~i~~L~~~~~eLe~~ 348 (434)
.+.+++..+.+++..+..++.+.+...+.
T Consensus 445 ~~~~ik~~r~~~k~~~~e~~~Kee~~~qL 473 (594)
T PF05667_consen 445 KLQEIKELREEIKEIEEEIRQKEELYKQL 473 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444433333
No 436
>KOG3976|consensus
Probab=58.64 E-value=1.4e+02 Score=29.01 Aligned_cols=61 Identities=15% Similarity=0.112 Sum_probs=36.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 290 NEQKVQAQEEIEILKQTENELNQGKFK---LARMFERIDKEKAELERSISFLKEKETELDEILAV 351 (434)
Q Consensus 290 ~e~~~q~~~ele~L~~~q~EL~~~~~~---L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~ 351 (434)
++.+.++++.+++-.++|+ |....+- ..++--.|+.|...+|+.....++-...||.|++.
T Consensus 139 q~~ik~i~d~id~~~sqq~-~~~~~~~lfd~~keni~l~lE~~yre~~~~v~~E~K~~lDy~v~~ 202 (247)
T KOG3976|consen 139 QAHIKAISDAIDTEKSQQA-LASKTEYLFDVSKENIALQLEATYREQLVRVAKEVKRRLDYWVET 202 (247)
T ss_pred HHHHHHHHHHHhhHHHhHH-HHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444445555555445554 5443322 34555667777777777777777777777777764
No 437
>PRK04406 hypothetical protein; Provisional
Probab=58.59 E-value=91 Score=24.75 Aligned_cols=37 Identities=5% Similarity=0.096 Sum_probs=20.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
.++.+++|...-..-.+.|..|..+...|.+-+..+.
T Consensus 23 QE~tIe~LN~~v~~Qq~~I~~L~~ql~~L~~rl~~~~ 59 (75)
T PRK04406 23 QEQTIEELNDALSQQQLLITKMQDQMKYVVGKVKNMD 59 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4555555555555555555556555555555554443
No 438
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=58.52 E-value=2.2e+02 Score=29.06 Aligned_cols=31 Identities=13% Similarity=0.314 Sum_probs=18.7
Q ss_pred CCCHHHHHHHHHHHH-------HHHHHHHHHHHHHHHh
Q psy18020 399 VIDLEVYLKQVRSLS-------RKQFMFRALLFKCYKH 429 (434)
Q Consensus 399 ~I~ld~flK~vR~La-------ReQF~~Ra~~~Ki~~~ 429 (434)
.||+|.-+=.-|-|- -|-=+.+..|.|....
T Consensus 188 ivDIDaLi~ENRyL~erl~q~qeE~~l~k~~i~KYK~~ 225 (319)
T PF09789_consen 188 IVDIDALIMENRYLKERLKQLQEEKELLKQTINKYKSA 225 (319)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 678988877777653 2223446666665543
No 439
>COG1340 Uncharacterized archaeal coiled-coil protein [Function unknown]
Probab=58.49 E-value=2.1e+02 Score=28.83 Aligned_cols=120 Identities=14% Similarity=0.089 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc
Q psy18020 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVD 361 (434)
Q Consensus 282 ~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diD 361 (434)
.+.|+....+....+..-.+.....+.+|...+++.+....+.+.....+.......++...++...-..+ -|++
T Consensus 167 i~~lk~~~~e~~eki~~la~eaqe~he~m~k~~~~~De~Rkeade~he~~ve~~~~~~e~~ee~~~~~~el-----re~~ 241 (294)
T COG1340 167 IDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNEL-----RELE 241 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH-----HHHH
Q ss_pred ccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHH
Q psy18020 362 EAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYL 406 (434)
Q Consensus 362 e~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~fl 406 (434)
..|-+--.--++.-.....|...+.+--.+.+..+..++|+|.++
T Consensus 242 k~ik~l~~~~~~~~~~~~~ee~kera~ei~EKfk~GekLt~Eell 286 (294)
T COG1340 242 KKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEKLTTEELL 286 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHH
No 440
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=58.41 E-value=3.3e+02 Score=31.19 Aligned_cols=91 Identities=13% Similarity=0.161 Sum_probs=41.0
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHH
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~ 382 (434)
+.++++|.......+......|+.....++..-+.|+...+++++..+.+..++.. . +.....+...+++-++-+.-.
T Consensus 116 ~~rl~~E~er~~~El~~lr~~lE~~q~~~e~~q~~l~~~~eei~kL~e~L~~~g~~-~-~~~~~~~~~~~~~~~~e~~~~ 193 (775)
T PF10174_consen 116 FERLQAERERLQRELERLRKTLEELQLRIETQQQTLDKADEEIEKLQEMLQSKGLS-A-EAEEEDNEALRRIREAEARIM 193 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc-c-cchhhhhHHHHHHHHHHHHHH
Confidence 33444444443333333334444444444444444444444444444444333322 1 223334445566666655555
Q ss_pred hhhhHHHHHHHHH
Q psy18020 383 ATEDTIYYMGEAL 395 (434)
Q Consensus 383 AieDtiy~L~~aL 395 (434)
-++..+--.++..
T Consensus 194 ~le~lle~~e~~~ 206 (775)
T PF10174_consen 194 RLESLLERKEKEH 206 (775)
T ss_pred HHHHHHHHHHHHh
Confidence 5555554444443
No 441
>PRK05431 seryl-tRNA synthetase; Provisional
Probab=58.21 E-value=79 Score=33.38 Aligned_cols=84 Identities=21% Similarity=0.129 Sum_probs=0.0
Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 269 TDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLAR------MFERIDKEKAELERSISFLKEKE 342 (434)
Q Consensus 269 ~~e~~r~sl~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~------~~~eLe~~k~~Le~~i~~L~~~~ 342 (434)
+++.++++++..-.+..-.++.+.-.+.+..+.++..++++.+.-..++.. ..++|.++..+|++.+..|+++.
T Consensus 10 n~~~v~~~l~~R~~~~~vd~i~~ld~~~r~l~~~~~~lr~~rn~~sk~i~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~~ 89 (425)
T PRK05431 10 NPEAVKEALAKRGFPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAEL 89 (425)
T ss_pred CHHHHHHHHHhcCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHh
Q psy18020 343 TELDEILAVL 352 (434)
Q Consensus 343 ~eLe~~I~~l 352 (434)
.++++-+...
T Consensus 90 ~~~~~~~~~~ 99 (425)
T PRK05431 90 DELEAELEEL 99 (425)
T ss_pred HHHHHHHHHH
No 442
>PF04568 IATP: Mitochondrial ATPase inhibitor, IATP; InterPro: IPR007648 ATP synthase inhibitor prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical gradient, for example during oxygen deprivation [] ATP synthase inhibitor forms a one to one complex with the F1 ATPase, possibly by binding at the alpha-beta interface. It is thought to inhibit ATP synthesis by preventing the release of ATP []. The minimum inhibitory region for bovine inhibitor (P01096 from SWISSPROT) is from residues 39 to 72 []. The inhibitor has two oligomeric states, dimer (the active state) and tetramer. At low pH , the inhibitor forms a dimer via antiparallel coiled coil interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory region, thus preventing the inhibitory activity []. ; GO: 0004857 enzyme inhibitor activity, 0045980 negative regulation of nucleotide metabolic process, 0005739 mitochondrion; PDB: 1GMJ_B 1OHH_H 1HF9_B 2V7Q_J.
Probab=58.20 E-value=83 Score=26.50 Aligned_cols=23 Identities=22% Similarity=0.351 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 327 EKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 327 ~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
.+++|+..+..-+++++++++.|
T Consensus 77 Lk~kl~~e~~~~~k~i~~le~~I 99 (100)
T PF04568_consen 77 LKEKLKEEIEHHRKEIDELEKHI 99 (100)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Confidence 33333333334444444444433
No 443
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=58.06 E-value=1.7e+02 Score=27.63 Aligned_cols=32 Identities=22% Similarity=0.294 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEI 348 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~ 348 (434)
++..+..|+-+...|+.....|..+.++|.+-
T Consensus 98 ~ek~l~~Lk~e~evL~qr~~kle~ErdeL~~k 129 (201)
T PF13851_consen 98 LEKELKDLKWEHEVLEQRFEKLEQERDELYRK 129 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46666677777777777777777777777543
No 444
>KOG1962|consensus
Probab=58.01 E-value=1.1e+02 Score=29.36 Aligned_cols=93 Identities=16% Similarity=0.181 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy18020 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFK---LARMFERIDKEKAELERSISFLKEKETELDEILAVLT 353 (434)
Q Consensus 277 l~sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~---L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~ 353 (434)
..+-|.+++..-+.+...-... |+.......|+...++ ++.+...+.++.+.|++.++..+.+.+.+++-...++
T Consensus 108 FL~lvI~R~~~ll~~l~~l~~~--~~~~~~~~~lk~~~~~~~~~~~~~~~~~~~~~kL~~el~~~~~~Le~~~~~~~al~ 185 (216)
T KOG1962|consen 108 FLSLVIRRLHTLLRELATLRAN--EKAMKENEALKKQLENSSKLEEENDKLKADLEKLETELEKKQKKLEKAQKKVDALK 185 (216)
T ss_pred HHHHHHHHHHHHHHHHHHHHhh--HHHHHHHHHHHHhhhcccchhhhHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred CCCCCCccccccccchHHHHHHHhhHHHHh
Q psy18020 354 EKTEVDVDEAVTTTAPIYKQILNTFTEEAA 383 (434)
Q Consensus 354 ~k~~~diDe~i~~~~~l~~Ql~~l~Aed~A 383 (434)
..-+---|| ||-.+||+.
T Consensus 186 Kq~e~~~~E------------ydrLlee~~ 203 (216)
T KOG1962|consen 186 KQSEGLQDE------------YDRLLEEYS 203 (216)
T ss_pred HHHHHcccH------------HHHHHHHHH
No 445
>PRK13428 F0F1 ATP synthase subunit delta; Provisional
Probab=57.87 E-value=1.5e+02 Score=31.43 Aligned_cols=36 Identities=17% Similarity=0.253 Sum_probs=17.1
Q ss_pred HHhhhhHHHHHHHHHh-cCCCCHHHHHHHHHHHHHHHH
Q psy18020 381 EAATEDTIYYMGEALR-RGVIDLEVYLKQVRSLSRKQF 417 (434)
Q Consensus 381 d~AieDtiy~L~~aL~-~g~I~ld~flK~vR~LaReQF 417 (434)
+.-|++-|.-|+..=. ...++...- -++|.-|||-.
T Consensus 144 ~~lId~~i~~l~~~~~~~~~~~~~~~-~~m~~~sr~~~ 180 (445)
T PRK13428 144 SATVDRFLDELDAMAPSTADVDYPLL-AKMRSASRRAL 180 (445)
T ss_pred HHHHHHHHHHhhccCCCchhhcCchh-hhhhHhhHHHH
Confidence 3445555555544222 113333233 33788888843
No 446
>PF07246 Phlebovirus_NSM: Phlebovirus nonstructural protein NS-M; InterPro: IPR009879 This entry consists of several Phlebovirus nonstructural NS-M proteins, which represent the N-terminal region of the M polyprotein precursor. The function of this family is unknown.
Probab=57.85 E-value=61 Score=31.96 Aligned_cols=38 Identities=13% Similarity=0.238 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
++....++......++.....|+.+..++++-+..+++
T Consensus 200 ~~~~~~e~~~r~~~lr~~~~~l~~el~~aK~~~~~~~~ 237 (264)
T PF07246_consen 200 EKILHEELEARESGLRNESKWLEHELSDAKEDMIRLRN 237 (264)
T ss_pred HHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44455555555555666666666666666666655543
No 447
>PF07956 DUF1690: Protein of Unknown function (DUF1690) ; InterPro: IPR012471 Family of uncharacterised fungal proteins.
Probab=57.84 E-value=79 Score=28.29 Aligned_cols=17 Identities=18% Similarity=0.282 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy18020 280 AVEDKIRRKMNEQKVQA 296 (434)
Q Consensus 280 al~dkL~~~L~e~~~q~ 296 (434)
.++..|+.|+.+.+.++
T Consensus 31 ~~e~~iq~Rva~eL~~L 47 (142)
T PF07956_consen 31 TLELHIQERVAEELKRL 47 (142)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34445566655544443
No 448
>cd07605 I-BAR_IMD Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), a dimerization module that binds and bends membranes. Inverse (I)-BAR (or IMD) is a member of the Bin/Amphiphysin/Rvs (BAR) domain family. It is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions in the opposite direction compared to classical BAR and F-BAR domains, which produce membrane invaginations. IMD domains are found in Insulin Receptor tyrosine kinase Substrate p53 (IRSp53), Missing in Metastasis (MIM), and Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-like (BAIAP2L) proteins. These are multi-domain proteins that act as scaffolding proteins and transducers of a variety of signaling pathways that link membrane dynamics and the underlying actin cytoskeleton. Most members contain an N-terminal IMD, an SH3 domain, and a WASP homology 2 (WH2) actin-binding motif at the C-terminus, exccept for MIM which does not carry an SH3 domain. Some me
Probab=57.83 E-value=1.5e+02 Score=28.50 Aligned_cols=25 Identities=12% Similarity=0.118 Sum_probs=10.6
Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHH
Q psy18020 391 MGEALRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 391 L~~aL~~g~I~ld~flK~vR~LaRe 415 (434)
|.+||...+=.+--|+-.-..+..+
T Consensus 176 lr~al~EERrRyc~lv~~~c~v~~~ 200 (223)
T cd07605 176 LRDALLEERRRYCFLVDKHCSVAKH 200 (223)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444433344444444444433
No 449
>TIGR02132 phaR_Bmeg polyhydroxyalkanoic acid synthase, PhaR subunit. This model describes a protein, PhaR, localized to polyhydroxyalkanoic acid (PHA) inclusion granules in Bacillus cereus and related species. PhaR is required for PHA biosynthesis along with PhaC and may be a regulatory subunit.
Probab=57.78 E-value=1.5e+02 Score=27.70 Aligned_cols=7 Identities=29% Similarity=0.529 Sum_probs=3.4
Q ss_pred HHHHHHH
Q psy18020 370 IYKQILN 376 (434)
Q Consensus 370 l~~Ql~~ 376 (434)
+..||++
T Consensus 162 ~~~~~~~ 168 (189)
T TIGR02132 162 LQAQLLE 168 (189)
T ss_pred HHHHHHH
Confidence 4445554
No 450
>KOG1760|consensus
Probab=57.77 E-value=86 Score=27.43 Aligned_cols=34 Identities=18% Similarity=0.377 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 319 RMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 319 ~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.....|+..+..+++.|..|+.+.+.++..++.+
T Consensus 81 ~~~~~LEe~ke~l~k~i~~les~~e~I~~~m~~L 114 (131)
T KOG1760|consen 81 KLQDQLEEKKETLEKEIEELESELESISARMDEL 114 (131)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455555555555555555555555554444
No 451
>PRK09465 tolC outer membrane channel protein; Reviewed
Probab=57.76 E-value=1.4e+02 Score=30.84 Aligned_cols=43 Identities=12% Similarity=0.057 Sum_probs=31.7
Q ss_pred CCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHh
Q psy18020 354 EKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR 396 (434)
Q Consensus 354 ~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~ 396 (434)
..+..+.-|++.+..-+.+-..+++.-....-.+++-|.+++.
T Consensus 380 ~~G~~~~~dvl~a~~~l~~a~~~~~~a~~~~~~a~~~L~~~~G 422 (446)
T PRK09465 380 EVGTRTIVDVLDATTTLYDAKQQLSNARYNYLINQLNLKQALG 422 (446)
T ss_pred HcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 3566778888888888888878877777777777777777653
No 452
>KOG4117|consensus
Probab=57.59 E-value=62 Score=25.00 Aligned_cols=26 Identities=15% Similarity=0.202 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 327 EKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 327 ~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
..+.+-.+|..+...++.||+.|..|
T Consensus 35 MSDQII~RiDDM~~riDDLEKnIaDL 60 (73)
T KOG4117|consen 35 MSDQIIGRIDDMSSRIDDLEKNIADL 60 (73)
T ss_pred HHHHHHHHHhhhhhhhHHHHHHHHHH
Confidence 44455556666666777777777766
No 453
>PF04880 NUDE_C: NUDE protein, C-terminal conserved region; InterPro: IPR006964 This domain represents the C-terminal conserved region of NUDE proteins. Emericella nidulans (Aspergillus nidulans) NUDE, acts in the cytoplasmic dynein/dynactin pathway and is required for distribution of nuclei []. It is a homologue of the nuclear distribution protein RO11 of Neurospora crassa. NUDE interacts with the NUDF via an N-terminal coiled coil domain; this is the only domain which is absolutely required for NUDE function.; PDB: 2V66_B 2V71_B.
Probab=57.59 E-value=19 Score=33.14 Aligned_cols=52 Identities=29% Similarity=0.375 Sum_probs=0.0
Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy18020 302 ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355 (434)
Q Consensus 302 ~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k 355 (434)
.|.+++..|+..-++-.-+..|| .+++.|...+++|+++..+|..-+ .+.++
T Consensus 1 SLeD~EsklN~AIERnalLE~EL-dEKE~L~~~~QRLkDE~RDLKqEl-~V~ek 52 (166)
T PF04880_consen 1 SLEDFESKLNQAIERNALLESEL-DEKENLREEVQRLKDELRDLKQEL-IVQEK 52 (166)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHCH--------------------
T ss_pred CHHHHHHHHHHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHH
No 454
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=57.56 E-value=1.2e+02 Score=25.79 Aligned_cols=31 Identities=26% Similarity=0.412 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
|+..+.+|.++-..|+-.-..|++..+++++
T Consensus 27 LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~~ 57 (107)
T PF06156_consen 27 LKKQLQELLEENARLRIENEHLRERLEELEQ 57 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 5555555555555555555555544444443
No 455
>KOG3080|consensus
Probab=57.32 E-value=44 Score=33.45 Aligned_cols=54 Identities=19% Similarity=0.133 Sum_probs=30.3
Q ss_pred CCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCC---CHHHHHHHH
Q psy18020 356 TEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVI---DLEVYLKQV 409 (434)
Q Consensus 356 ~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I---~ld~flK~v 409 (434)
.+++.|+.|++..|--.=--+|.=--.|++-++-++.++.+-|+- +.|-|-..+
T Consensus 120 ~ei~~~~~v~~~~~eDDf~RElaFYkQa~~aVl~A~~rL~elgV~~~RP~DYfAEMa 176 (328)
T KOG3080|consen 120 VEIDTDEAVIPNNPEDDFARELAFYKQALSAVLEAFPRLHELGVPFLRPTDYFAEMA 176 (328)
T ss_pred cccccCcccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccCchHHHHHHh
Confidence 356777777766321111112222234566677889999888855 455555444
No 456
>KOG4677|consensus
Probab=57.28 E-value=2.6e+02 Score=29.80 Aligned_cols=128 Identities=12% Similarity=0.036 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCCCCCcccc
Q psy18020 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT---EKTEVDVDEA 363 (434)
Q Consensus 287 ~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~---~k~~~diDe~ 363 (434)
++|.+.+....+.+-.++.+|.+|.-..+++++..+....++......+..++.+...+-+.+.... +...-.|||
T Consensus 192 ~~ll~~Rve~le~~Sal~~lq~~L~la~~~~~~~~e~~i~~~~~f~~r~~~~E~e~rn~~E~~~lA~r~l~~~kKe~de- 270 (554)
T KOG4677|consen 192 RRLLKGRVESLERFSALRSLQDKLQLAEEAVSMHDENVITAVLIFLKRTLSKEIEFRNELEVRQLALRHLIHFKKEIDE- 270 (554)
T ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
Q ss_pred ccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy18020 364 VTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALLFKCYKHISYAR 434 (434)
Q Consensus 364 i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~~Ki~~~~~l~~ 434 (434)
++++++++-=..+-|..+.++.++-.-|.=+ |.--..|+.+.++|-..-+.+
T Consensus 271 -------~k~~~~l~~~l~~keeL~~s~~~e~~i~qs~------------~kstas~~E~ee~rve~~~s~ 322 (554)
T KOG4677|consen 271 -------QKLLLDLFRFLDRKEELALSHYREHLIIQSP------------DKSTASRKEFEETRVELPFSA 322 (554)
T ss_pred -------HHHHHHHHHHhhhHHHHHHHHHHHhhccCCC------------CcchhHHHHHHHHHhcccccH
No 457
>PRK14149 heat shock protein GrpE; Provisional
Probab=57.14 E-value=1.6e+02 Score=27.69 Aligned_cols=46 Identities=9% Similarity=-0.075 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhH
Q psy18020 332 ERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFT 379 (434)
Q Consensus 332 e~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~A 379 (434)
+..+..|-.-.+.|++.+.... .+.+...++.+-.-+++||+..+.
T Consensus 86 ~~~~~~LLpVlDnLerAl~~~~--~~~~~~~l~~Gv~mi~k~l~~vL~ 131 (191)
T PRK14149 86 EKIALDLLPVIDALLGALKSAA--EVDKESALTKGLELTMEKLHEVLA 131 (191)
T ss_pred HHHHHHHhhHHhHHHHHHhccc--cccchHHHHHHHHHHHHHHHHHHH
Confidence 3444444444455555544221 111223333344444455444443
No 458
>PF06637 PV-1: PV-1 protein (PLVAP); InterPro: IPR009538 This family consists of several PV-1 (PLVAP) proteins, which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms []. The function of this family is unknown.
Probab=57.11 E-value=1.7e+02 Score=30.44 Aligned_cols=12 Identities=17% Similarity=0.086 Sum_probs=5.8
Q ss_pred HHhhhhHHHHHH
Q psy18020 381 EAATEDTIYYMG 392 (434)
Q Consensus 381 d~AieDtiy~L~ 392 (434)
..|++++|..=.
T Consensus 383 ~saLdtCikaKs 394 (442)
T PF06637_consen 383 TSALDTCIKAKS 394 (442)
T ss_pred hhHHHHHHHhcc
Confidence 345555555433
No 459
>KOG0239|consensus
Probab=56.99 E-value=87 Score=35.16 Aligned_cols=94 Identities=17% Similarity=0.223 Sum_probs=0.0
Q ss_pred HHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 276 SLKSAVEDKIR---------RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 346 (434)
Q Consensus 276 sl~sal~dkL~---------~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe 346 (434)
..+..+++.+. ..+.......+..+..|++.-.+|+.....+++....+..+...+-+.+..+..+.+++.
T Consensus 210 ~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~i~~l~~~l~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~L~~~~ 289 (670)
T KOG0239|consen 210 EERRVLADSLGNYADLRRNIKPLEGLESTIKKKIQALQQELEELKAELKELNDQVSLLTREVQEALKESNTLQSDLESLE 289 (670)
T ss_pred HHHHHHHHHhhhhhhHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHhcCCCCCCccccccccchHHHHHHHh
Q psy18020 347 EILAVLTEKTEVDVDEAVTTTAPIYKQILNT 377 (434)
Q Consensus 347 ~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l 377 (434)
..+.+-. +++..--. ++|++.+|
T Consensus 290 ~~l~~~~-~e~~~r~k-------L~N~i~eL 312 (670)
T KOG0239|consen 290 ENLVEKK-KEKEERRK-------LHNEILEL 312 (670)
T ss_pred HHHHHHH-HHHHHHHH-------HHHHHHHh
No 460
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=56.89 E-value=80 Score=27.01 Aligned_cols=34 Identities=15% Similarity=0.241 Sum_probs=19.9
Q ss_pred HHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy18020 381 EAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 381 d~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaR 414 (434)
+..+++|+.+|.+-++.=.=.++..-+....|.+
T Consensus 81 E~~~~eA~~~l~~~~~~l~~~~~~l~~~l~~l~~ 114 (126)
T TIGR00293 81 EKDAEEAIEFLKKRIEELEKAIEKLQEALAELAS 114 (126)
T ss_pred EecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3667888888888765433344444444444433
No 461
>TIGR01845 outer_NodT efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family. Members of this model comprise a subfamily of the Outer Membrane Factor (TCDB 1.B.17) porins. OMF proteins operate in conjunction with a primary transporter of the RND, MFS, ABC, or PET systems, and a MFP (membrane fusion protein) to tranport substrates across membranes. The complex thus formed allows transport (export) of various solutes (heavy metal cations; drugs, oligosaccharides, proteins, etc.) across the two envelopes of the Gram-negative bacterial cell envelope in a single energy-coupled step. Current data suggest that the OMF (and not the MFP) is largely responsible for the formation of both the trans-outer membrane and trans-periplasmic channels. The roles played by the MFP have yet to be determined.
Probab=56.88 E-value=1.7e+02 Score=30.21 Aligned_cols=45 Identities=11% Similarity=-0.048 Sum_probs=34.1
Q ss_pred CCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcC
Q psy18020 354 EKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRG 398 (434)
Q Consensus 354 ~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g 398 (434)
..+..+.-|++.+..-+.+.-.+++.-....--+...|-+||--|
T Consensus 410 ~~G~~~~~~vl~aq~~~~~a~~~~~~a~~~~~~a~v~L~~alGGg 454 (454)
T TIGR01845 410 RAGLDSYLTVLEAQRSLLTAQRSLATLQARRLSDSVALYKALGGG 454 (454)
T ss_pred HcCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC
Confidence 467788888888888888887887777777777777777777543
No 462
>COG3096 MukB Uncharacterized protein involved in chromosome partitioning [Cell division and chromosome partitioning]
Probab=56.86 E-value=2.9e+02 Score=31.49 Aligned_cols=120 Identities=13% Similarity=0.176 Sum_probs=60.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERI----DKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 279 sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eL----e~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
-+|-.+|+.|++..+-+..+-++.+.++.-||..-.+.|..-..+| +...+.+.+-|++=+..+..|..-+....-
T Consensus 1188 IAVYQHLRERIRQDIIrTDDPveAIEqMEiEL~RLTeELT~REqkLAISS~SVanIiRKTIqREQNRIRMLNQGLqnisF 1267 (1480)
T COG3096 1188 IAVYQHLRERIRQDIIRTDDPVEAIEQMEIELSRLTEELTSREQKLAISSRSVANIIRKTIQREQNRIRMLNQGLQNVSF 1267 (1480)
T ss_pred HHHHHHHHHHHHhhhhccCChHHHHHHHHHHHHHHHHHHhhHHHhhhcchHHHHHHHHHHHHHHHHHHHHHHhhhhhccc
Confidence 3677788888877665555556666666666555443333322222 234444445555544445555544443321
Q ss_pred CCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHH
Q psy18020 355 KTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVY 405 (434)
Q Consensus 355 k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~f 405 (434)
+.++--.+.+---.-+.-|++..++..+ -...+|+..+++|..-
T Consensus 1268 -GQV~sVRL~VnvRetHa~LLd~LS~qQ~------q~qDLF~~~R~TFSEA 1311 (1480)
T COG3096 1268 -GQVNSVRLNVNVRETHAMLLDVLSEQQE------QHQDLFNSNRLTFSEA 1311 (1480)
T ss_pred -cccceEEEEeehHHHHHHHHHHHhhhHH------HHHHHHhhccchHHHH
Confidence 1111111122222256667776665443 3445566666666543
No 463
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=56.77 E-value=87 Score=26.83 Aligned_cols=28 Identities=21% Similarity=0.278 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 317 LARMFERIDKEKAELERSISFLKEKETE 344 (434)
Q Consensus 317 L~~~~~eLe~~k~~Le~~i~~L~~~~~e 344 (434)
|+..+.+|-++-..|+-.-+.|++..++
T Consensus 27 LK~~~~el~EEN~~L~iEN~~Lr~~l~~ 54 (110)
T PRK13169 27 LKKQLAELLEENTALRLENDKLRERLEE 54 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444444444444444444443
No 464
>PF05700 BCAS2: Breast carcinoma amplified sequence 2 (BCAS2); InterPro: IPR008409 This family consists of several eukaryotic sequences of unknown function. The mammalian members of this family are annotated as breast carcinoma amplified sequence 2 (BCAS2) proteins []. BCAS2 is a putative spliceosome associated protein [].
Probab=56.76 E-value=1.6e+02 Score=28.05 Aligned_cols=19 Identities=26% Similarity=0.553 Sum_probs=9.3
Q ss_pred ccCCCCc-cc-cc-cchhHHHH
Q psy18020 122 YSPYLHT-WT-AQ-NSKLEGLI 140 (434)
Q Consensus 122 ~~pyl~~-w~-~~-~s~l~~l~ 140 (434)
.|||+.. .. +. ......||
T Consensus 11 aLPYiD~~~~~~~~~~~a~~lI 32 (221)
T PF05700_consen 11 ALPYIDPDYDTPEERQAAEALI 32 (221)
T ss_pred CCCCCCCCCCCHHHHHHHHHHH
Confidence 5777754 42 21 33344555
No 465
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=56.75 E-value=90 Score=29.09 Aligned_cols=18 Identities=22% Similarity=0.268 Sum_probs=6.4
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy18020 332 ERSISFLKEKETELDEIL 349 (434)
Q Consensus 332 e~~i~~L~~~~~eLe~~I 349 (434)
...+..|.-+...+++-+
T Consensus 150 ~DE~~~L~l~~~~~e~k~ 167 (194)
T PF08614_consen 150 QDELQALQLQLNMLEEKL 167 (194)
T ss_dssp HHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333333333333333333
No 466
>PF11855 DUF3375: Protein of unknown function (DUF3375); InterPro: IPR021804 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 479 to 499 amino acids in length.
Probab=56.64 E-value=76 Score=34.03 Aligned_cols=81 Identities=25% Similarity=0.301 Sum_probs=57.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCC--CCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCC--------
Q psy18020 332 ERSISFLKEKETELDEILAVLTEK--TEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVID-------- 401 (434)
Q Consensus 332 e~~i~~L~~~~~eLe~~I~~l~~k--~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~-------- 401 (434)
+..+..|+.+++++++-|+.++.+ ..+|...+..--..++++.-+|.+.=.-.+|-+..|++.|....++
T Consensus 143 ~~Ri~~Le~e~~~i~~EI~~l~aG~~~~ld~~~~~er~~~i~~la~~L~~DFr~V~~~~r~l~r~lr~~i~~~~~~~G~v 222 (478)
T PF11855_consen 143 ERRIAELEREIAEIDAEIDRLEAGDVPVLDDTQARERARQILQLARELPADFRRVEDNFRELDRALRERIIDWDGSRGEV 222 (478)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccHHHH
Confidence 356666777777777777777654 3346666677777788888888888899999999999998765443
Q ss_pred HHHHHHHHHHH
Q psy18020 402 LEVYLKQVRSL 412 (434)
Q Consensus 402 ld~flK~vR~L 412 (434)
|+.++.....|
T Consensus 223 L~~~~~~~d~l 233 (478)
T PF11855_consen 223 LDEYFDGYDAL 233 (478)
T ss_pred HHHHHHhHHHH
Confidence 55555555554
No 467
>KOG3231|consensus
Probab=56.52 E-value=45 Score=30.61 Aligned_cols=22 Identities=18% Similarity=0.312 Sum_probs=17.0
Q ss_pred cCCCCHHHHHHHHHHHHHHHHH
Q psy18020 397 RGVIDLEVYLKQVRSLSRKQFM 418 (434)
Q Consensus 397 ~g~I~ld~flK~vR~LaReQF~ 418 (434)
+.+.+++.-+..+|..-.++..
T Consensus 112 Nk~M~pek~~~tmr~FQ~anmK 133 (208)
T KOG3231|consen 112 NKKMDPEKTLQTMRNFQKANMK 133 (208)
T ss_pred HccCCHHHHHHHHHHHHHHHHH
Confidence 4578999999999977666543
No 468
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=56.33 E-value=2.1e+02 Score=28.25 Aligned_cols=40 Identities=20% Similarity=0.272 Sum_probs=24.2
Q ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342 (434)
Q Consensus 303 L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~ 342 (434)
+.+.++||..-+++.+..+..|+..-+.|+..|....++.
T Consensus 65 l~~ak~eLqe~eek~e~~l~~Lq~ql~~l~akI~k~~~el 104 (258)
T PF15397_consen 65 LQQAKAELQEWEEKEESKLSKLQQQLEQLDAKIQKTQEEL 104 (258)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5556666666666666666666666666666555555443
No 469
>KOG4286|consensus
Probab=56.31 E-value=1e+02 Score=34.74 Aligned_cols=21 Identities=29% Similarity=0.461 Sum_probs=12.1
Q ss_pred HhcCCCCHHHHHHHHHHHHHH
Q psy18020 395 LRRGVIDLEVYLKQVRSLSRK 415 (434)
Q Consensus 395 L~~g~I~ld~flK~vR~LaRe 415 (434)
|+...-.+..|+|.|..||++
T Consensus 274 ~~~ei~p~~~~v~~vndla~q 294 (966)
T KOG4286|consen 274 LRGEIAPLKENVSHVNDLARQ 294 (966)
T ss_pred HHhhcchHhhchhhHHHHHHH
Confidence 333333566677777766663
No 470
>PRK14154 heat shock protein GrpE; Provisional
Probab=56.22 E-value=1.9e+02 Score=27.64 Aligned_cols=24 Identities=4% Similarity=-0.105 Sum_probs=11.2
Q ss_pred HHHHHHHHHhcC---CCCHHHHHHHHH
Q psy18020 387 TIYYMGEALRRG---VIDLEVYLKQVR 410 (434)
Q Consensus 387 tiy~L~~aL~~g---~I~ld~flK~vR 410 (434)
.++-|+|||... .-+++.|++-|+
T Consensus 111 VlDnLeRAL~~~~~~~~~~~~l~eGve 137 (208)
T PRK14154 111 VADSLIHGLESPASEDPQVKSMRDGMS 137 (208)
T ss_pred HHhHHHHHHhcccccchhHHHHHHHHH
Confidence 444455555421 112455555554
No 471
>PF05615 THOC7: Tho complex subunit 7; InterPro: IPR008501 The Tho complex (THOC) is involved in transcription elongation and mRNA export from the nucleus []. This entry represents the subunit THOC7, which is found in higher eukaryotes, and the non-homologous subunit Mft1p found in yeast. The funtions of these subunits are unknown, and it is not known if these subunits are functionally equivalent.
Probab=56.06 E-value=1.4e+02 Score=26.10 Aligned_cols=8 Identities=38% Similarity=0.484 Sum_probs=2.9
Q ss_pred HHHHHHHH
Q psy18020 332 ERSISFLK 339 (434)
Q Consensus 332 e~~i~~L~ 339 (434)
+.+|..|+
T Consensus 94 k~~ie~lk 101 (139)
T PF05615_consen 94 KKEIEELK 101 (139)
T ss_pred HHHHHHHH
Confidence 33333333
No 472
>cd07621 BAR_SNX5_6 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexins 5 and 6. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. Members of this subfamily include SNX5, SNX6, the mammalian SNX32, and similar proteins. SNX5 and SNX6 may be components of the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi, acting as a mammalian equivalent of yeast Vsp17p. The function of SNX32 is still unknown. BAR domain
Probab=55.97 E-value=2e+02 Score=27.74 Aligned_cols=41 Identities=17% Similarity=0.257 Sum_probs=27.1
Q ss_pred HHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 380 EEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALL 423 (434)
Q Consensus 380 ed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF~~Ra~~ 423 (434)
+|.-++|+|.++-|.+..=+ +.|.+++|.|..=|=-.+.|-
T Consensus 104 d~l~L~e~L~~Y~r~~~A~K---~~l~rR~ral~~~q~A~k~L~ 144 (219)
T cd07621 104 EDLKLSDTLRYYMRDTQAAK---DLLYRRLRCLANYENANKNLE 144 (219)
T ss_pred hHhHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
Confidence 45567788888777776544 677788887776555444443
No 473
>PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox []. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=55.96 E-value=48 Score=23.68 Aligned_cols=34 Identities=24% Similarity=0.324 Sum_probs=16.5
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 307 ENELNQGKFKLARMFERIDKEKAELERSISFLKE 340 (434)
Q Consensus 307 q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~ 340 (434)
-..|....+.|+..-..|..+.+.|...+..|..
T Consensus 7 y~~LK~~yd~Lk~~~~~L~~E~~~L~aev~~L~~ 40 (45)
T PF02183_consen 7 YDALKASYDSLKAEYDSLKKENEKLRAEVQELKE 40 (45)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444555555555555555554444443
No 474
>KOG4807|consensus
Probab=55.92 E-value=68 Score=33.40 Aligned_cols=23 Identities=17% Similarity=0.053 Sum_probs=12.0
Q ss_pred CCCHHHHHHHHHHHhCCCCCccc
Q psy18020 62 NSKLEGLIKTLASAFSALPPLYS 84 (434)
Q Consensus 62 ~s~L~~li~~l~~~F~~~pPl~~ 84 (434)
.+.|--+.........+-+|+.+
T Consensus 44 ~~~~~~~~d~~A~~~~~L~~~~~ 66 (593)
T KOG4807|consen 44 DSSLKYYRDSTAEEADELDGEID 66 (593)
T ss_pred HHHHHHHHHHHHHhcccCCcccc
Confidence 44555555555555555555543
No 475
>PF05064 Nsp1_C: Nsp1-like C-terminal region; InterPro: IPR007758 The NSP1-like protein appears to be an essential component of the nuclear pore complex, for example preribosome nuclear export requires the Nup82p-Nup159p-Nsp1p complex. The C-terminal of Nsp1 is involved in binding Nup82 [], probably via coiled-coil formation [, ]. The family is related to the rotavirus nonstructural protein NSP1 which is the least conserved protein in the rotavirus genome. Its function in the replication process is not fully understood.; GO: 0017056 structural constituent of nuclear pore, 0005643 nuclear pore; PDB: 3T97_C.
Probab=55.87 E-value=12 Score=32.27 Aligned_cols=55 Identities=18% Similarity=0.214 Sum_probs=37.1
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 295 q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
.....+..|...-..+...+.+|++.+..++..+.+|+..+..|+++.+.+....
T Consensus 54 ~n~~~I~~L~~~v~~~~~~Q~~ld~~L~~ie~qQ~eLe~~L~~lE~~~~~l~~~~ 108 (116)
T PF05064_consen 54 ENGEKISKLYSEVQKAESEQKRLDQELDFIEAQQKELEELLDPLEKQVEKLLSNQ 108 (116)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCCCCCTT-------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455566666666677777788888888888888888888888887777765443
No 476
>KOG0416|consensus
Probab=55.82 E-value=9.3 Score=35.10 Aligned_cols=61 Identities=28% Similarity=0.519 Sum_probs=38.9
Q ss_pred ecCcccccceEEEecCCCCCCCCEe-EEecCCCceeccCCCcCC-CCceeccccc-cccCCCCCHHHHHHHHH
Q psy18020 4 LSGLVYHIPVCIWLMDTHPDHAPIC-YVRPTADMRINVSQNVDH-TGKIYSPYLH-TWTPQNSKLEGLIKTLA 73 (434)
Q Consensus 4 ~~g~~y~iPi~iwl~~~yP~~~P~~-~v~pt~~m~i~~~~~vd~-~G~~~~pyL~-~W~~~~s~L~~li~~l~ 73 (434)
|.|..+. |.+-||++||+..|-| ||+ +| =+++||. +|.|+|-.+. .|.+ -.+|+..+..+.
T Consensus 44 YegGvWk--v~V~lPd~YP~KSPSIGFvn-----KI-fHPNIDe~SGsVCLDViNQtWSp-~yDL~NIfetfL 107 (189)
T KOG0416|consen 44 YEGGVWK--VRVELPDNYPFKSPSIGFVN-----KI-FHPNIDEASGSVCLDVINQTWSP-LYDLVNIFETFL 107 (189)
T ss_pred ccCceEE--EEEECCCCCCCCCCccccee-----ec-cCCCchhccCccHHHHHhhhhhH-HHHHHHHHHHHh
Confidence 3444444 4567999999998865 442 11 1366775 8999987775 4654 556666655554
No 477
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=55.72 E-value=57 Score=26.23 Aligned_cols=19 Identities=21% Similarity=0.359 Sum_probs=16.1
Q ss_pred hcCCCCHHHHHHHHHHHHH
Q psy18020 396 RRGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 396 ~~g~I~ld~flK~vR~LaR 414 (434)
+.|+|+++.|++.+..+++
T Consensus 64 ~dG~I~f~eF~~~~~~~~~ 82 (88)
T cd05030 64 QDGQLSFEEFLVLVIKVGV 82 (88)
T ss_pred CCCcCcHHHHHHHHHHHHH
Confidence 4689999999998887764
No 478
>TIGR01844 type_I_sec_TolC type I secretion outer membrane protein, TolC family. Members of this model are outer membrane proteins from the TolC subfamily within the RND (Resistance-Nodulation-cell Division) efflux systems. These proteins, unlike the NodT subfamily, appear not to be lipoproteins. All are believed to participate in type I protein secretion, an ABC transporter system for protein secretion without cleavage of a signal sequence, although they may, like TolC, participate also in the efflux of smaller molecules as well. This family includes the well-documented examples TolC (E. coli), PrtF (Erwinia), and AprF (Pseudomonas aeruginosa).
Probab=55.71 E-value=2.1e+02 Score=28.88 Aligned_cols=45 Identities=4% Similarity=-0.029 Sum_probs=33.4
Q ss_pred CCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcC
Q psy18020 354 EKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRG 398 (434)
Q Consensus 354 ~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g 398 (434)
..+..+..|+..+..-+.+-..++..-+..+..+...|..++...
T Consensus 151 ~~G~~s~~dl~~a~~~~~~~~~~~~~a~~~~~~a~~~L~~llg~~ 195 (415)
T TIGR01844 151 DVGLGTRTDVLQAEARYASARAQLIQAQNNLDDAKAQLRRLVGQP 195 (415)
T ss_pred hCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 356667777777777777777777777888888888888887753
No 479
>PF05130 FlgN: FlgN protein; InterPro: IPR007809 Flagella synthesis protein FlgN is an export chaperone involved in flagellar synthesis []. This entry represents a FlgN-like domain, consisting of a 4 long helices bundle, where the last helix is shorter than the three others.; GO: 0009296 flagellum assembly, 0019861 flagellum; PDB: 2FUP_A 3OPC_A.
Probab=55.70 E-value=1.3e+02 Score=25.49 Aligned_cols=30 Identities=23% Similarity=0.320 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 323 RIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 323 eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.|+....+.+..+..+.+-..+...++..+
T Consensus 38 ~l~~~~~~k~~l~~~l~~le~~r~~~~~~~ 67 (143)
T PF05130_consen 38 ELEELVEEKQELLEELRELEKQRQQLLAKL 67 (143)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 333333333344444444444555555544
No 480
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=55.60 E-value=75 Score=25.09 Aligned_cols=18 Identities=22% Similarity=0.469 Sum_probs=15.7
Q ss_pred cCCCCHHHHHHHHHHHHH
Q psy18020 397 RGVIDLEVYLKQVRSLSR 414 (434)
Q Consensus 397 ~g~I~ld~flK~vR~LaR 414 (434)
.|.|+++.|+..+..++.
T Consensus 65 ~g~I~f~eF~~~~~~~~~ 82 (88)
T cd00213 65 DGKVDFQEFLVLIGKLAV 82 (88)
T ss_pred CCcCcHHHHHHHHHHHHH
Confidence 589999999999988865
No 481
>PRK14163 heat shock protein GrpE; Provisional
Probab=55.55 E-value=2e+02 Score=27.65 Aligned_cols=6 Identities=33% Similarity=0.512 Sum_probs=2.3
Q ss_pred HHHHHh
Q psy18020 391 MGEALR 396 (434)
Q Consensus 391 L~~aL~ 396 (434)
|.|||.
T Consensus 103 LerAl~ 108 (214)
T PRK14163 103 VGRARE 108 (214)
T ss_pred HHHHHh
Confidence 333333
No 482
>PF08542 Rep_fac_C: Replication factor C C-terminal domain; InterPro: IPR013748 Replication factor C (RFC) is a multimeric AAA+ protein complex that loads the DNA polymerase processivity clamp PCNA (Proliferating Cell Nuclear Antigen) onto DNA using ATP to drive the reaction []. PCNA functions at multiple levels in directing DNA metabolic pathways []. When bound to DNA, PCNA organises various proteins involved in DNA replication, DNA repair, DNA modification, and chromatin modelling. Replication factor C consists of five subunits in a spiral arrangement: Rfc1, Rfc2, Rfc3, Rfc4, and Rfc5 subunits. Rfc1 and Rfc2 load the PCNA sliding clamp onto DNA, while Rfc3 binds ATP and also acts as a checkpoint sensor. The RFC complex contains four ATP sites (sites A, B, C, and D) located at subunit interfaces. In each ATP site, an arginine residue from one subunit is located near the gamma-phosphate of ATP bound in the adjacent subunit. These arginine residues act as "arginine fingers" that can potentially perform two functions: sensing that ATP is bound and catalyzing ATP hydrolysis []. This entry represents the core domain found in Rfc1-5.; GO: 0003689 DNA clamp loader activity, 0005524 ATP binding, 0006260 DNA replication, 0005663 DNA replication factor C complex; PDB: 1SXJ_B 2CHG_B 2CHV_F 2CHQ_C 1IQP_A.
Probab=55.41 E-value=68 Score=25.42 Aligned_cols=65 Identities=18% Similarity=0.227 Sum_probs=0.0
Q ss_pred HHHHHHHHHhcCCCCCCccccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHH
Q psy18020 343 TELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQF 417 (434)
Q Consensus 343 ~eLe~~I~~l~~k~~~diDe~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~LaReQF 417 (434)
+.+++.++.+.++ ++++ +.+.+++++.+-.+..|.|..|.+.+.+..++-..-.+-++.||.-.+
T Consensus 6 ~~i~~i~~~~~~~---~~~~-------~~~~~~~l~~~G~s~~~Il~~l~~~l~~~~~~~~~k~~i~~~la~~e~ 70 (89)
T PF08542_consen 6 EVIEEILESCLNG---DFKE-------ARKKLYELLVEGYSASDILKQLHEVLVESDIPDSQKAEILKILAEIEY 70 (89)
T ss_dssp HHHHHHHHHHHHT---CHHH-------HHHHHHHHHHTT--HHHHHHHHHHHHHTSTSSHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhC---CHHH-------HHHHHHHHHHcCCCHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHhH
No 483
>PF12889 DUF3829: Protein of unknown function (DUF3829); InterPro: IPR024291 This is a small family of proteins from several bacterial species, whose function is not known. It may, however, be related to the GvpL_GvpF family of proteins (PF06386 from PFAM).; PDB: 3RH3_B 3IEE_A.
Probab=55.40 E-value=1.6e+02 Score=28.35 Aligned_cols=117 Identities=14% Similarity=0.135 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccc
Q psy18020 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFE--RIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAV 364 (434)
Q Consensus 287 ~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~--eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i 364 (434)
.+.++...++...++.+......+...-.++..+.. +|+..++..+...--+.+-+...++++..+.+.+..+.|.
T Consensus 110 ak~kelh~~l~~~~~~~~~a~~~~~~~l~~~~~~~~~~~l~~lk~~g~~~~y~~~~~~~~a~~i~~~l~~~~~~~~d~-- 187 (276)
T PF12889_consen 110 AKAKELHAKLLAAYEEFKAAYDAFDAALEKINDERRKEQLEALKKEGKMIIYYMLKLMIDAEQIIDELQSDDNQKFDM-- 187 (276)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------TTTTTHHHHHHHHHHHHHHHHHHHHHT-TTS--HHH--
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCH--
Q ss_pred cccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCC---------HHHHHHHHHHHHHH
Q psy18020 365 TTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVID---------LEVYLKQVRSLSRK 415 (434)
Q Consensus 365 ~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~---------ld~flK~vR~LaRe 415 (434)
.++-+.=..+++.+.-+.++...+.-+ ++.|++.++.+-++
T Consensus 188 ----------~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 237 (276)
T PF12889_consen 188 ----------AALDAALAELEKLLKDLKKLTKENYDDEMLSSFMSSADSFKSSAKSFIRR 237 (276)
T ss_dssp ----------HHHH--HHHHHHHHHHHHTS---TT-H----HHHHHHHHHHHHHHHHHHH
T ss_pred ----------HHHHHHHHHHHHHHHHHHHhhcccccchhHHHHHHHHHHHHHHHHHHHHH
No 484
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=55.35 E-value=3.4e+02 Score=32.67 Aligned_cols=129 Identities=19% Similarity=0.245 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCCCCc
Q psy18020 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE-KTEVDV 360 (434)
Q Consensus 282 ~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~-k~~~di 360 (434)
.+.++.+-.+...+....-..+..++.+|......++....+|+.....++..+..+++....+...+..+.. ....+.
T Consensus 811 ~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~le~~~~~~~~~~~~~~~~l~~l~~~~~~l~~~~~~~~~ 890 (1201)
T PF12128_consen 811 VDELREEKPELEEQLRDLEQELQELEQELNQLQKEVKQRRKELEEELKALEEQLEQLEEQLRRLRDLLEKLAELSEPPNA 890 (1201)
T ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCc
Q ss_pred cccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHh------cCCCCHHHHHHHHHH
Q psy18020 361 DEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR------RGVIDLEVYLKQVRS 411 (434)
Q Consensus 361 De~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~------~g~I~ld~flK~vR~ 411 (434)
+++-..-+.+...+-++.-.-.-+.+.+.-.-+.|+ .|.--.+.|.+. |.
T Consensus 891 ~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~~f~~~l~~~~~~~~~~~~~~~-~~ 946 (1201)
T PF12128_consen 891 EDAEGSVDERLRDLEDLLQRRKRLREELKKAVERFKGVLTKHSGSELAENWEEL-RS 946 (1201)
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchHHHHHHH-HH
No 485
>PRK14070 exodeoxyribonuclease VII small subunit; Provisional
Probab=55.32 E-value=48 Score=25.98 Aligned_cols=62 Identities=10% Similarity=0.215 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc
Q psy18020 300 IEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVD 361 (434)
Q Consensus 300 le~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diD 361 (434)
|+.|...-..|+.+.-.|++.+..-+++..-++..-..|++...+++..+..+.+..+.+.+
T Consensus 1 m~~LEeIV~~LE~gel~Leesl~lyeeG~~L~k~C~~~L~~aE~kI~~l~~e~~~~~~~~~~ 62 (69)
T PRK14070 1 MKELEEIVNRLENEDLPLEESIKLFERGVELYRKCKEILQQNRLKIIDVMKELEGEIDASGR 62 (69)
T ss_pred CHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc
No 486
>PF02563 Poly_export: Polysaccharide biosynthesis/export protein; InterPro: IPR003715 The extracellular polysaccharide colanic acid (CA) is produced by species of the family Enterobacteriaceae. In Escherichia coli (strain K12) the CA cluster comprises 19 genes. The wzx gene encodes a protein with multiple transmembrane segments that may function in export of the CA repeat unit from the cytoplasm into the periplasm in a process analogous to O-unit export. The CA gene clusters may be involved in the export of polysaccharide from the cell [].; GO: 0015159 polysaccharide transmembrane transporter activity, 0015774 polysaccharide transport, 0016020 membrane; PDB: 2W8I_E 2W8H_E 2J58_D.
Probab=55.30 E-value=12 Score=29.93 Aligned_cols=35 Identities=20% Similarity=0.235 Sum_probs=0.0
Q ss_pred cCCCCceeccccccccCCCCCHHHHHHHHHHHhCC
Q psy18020 44 VDHTGKIYSPYLHTWTPQNSKLEGLIKTLASAFSA 78 (434)
Q Consensus 44 vd~~G~~~~pyL~~W~~~~s~L~~li~~l~~~F~~ 78 (434)
||.+|.|++|++-.-.-.+-++.++-+.+...|.+
T Consensus 35 V~~dG~I~lP~iG~v~v~G~T~~e~~~~I~~~l~~ 69 (82)
T PF02563_consen 35 VDPDGTISLPLIGPVKVAGLTLEEAEEEIKQRLQK 69 (82)
T ss_dssp --TTSEEEETTTEEEE-TT--HHHHHHHHHHHHTT
T ss_pred ECCCCcEeecccceEEECCCCHHHHHHHHHHHHHH
No 487
>PF05929 Phage_GPO: Phage capsid scaffolding protein (GPO) serine peptidase; InterPro: IPR009228 This entry is represented by Bacteriophage P2, GpO. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacteriophage capsid scaffolding protein (GpO) and some related bacterial sequences. GpO is thought to function in both the assembly of proheads and the cleavage of GpN [].; GO: 0019069 viral capsid assembly
Probab=55.10 E-value=91 Score=31.07 Aligned_cols=72 Identities=22% Similarity=0.287 Sum_probs=0.0
Q ss_pred HHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 279 SAVEDKIRRKM-------NEQKVQAQEEIEILKQTENELNQGKF-KLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 279 sal~dkL~~~L-------~e~~~q~~~ele~L~~~q~EL~~~~~-~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
..+.++++..+ .+++..+...++.+...+.++....+ ++......+...+..++.....+.++.++|.+.|+
T Consensus 173 ~~l~~~~~~~~~~~~~~~~~~~~~~~~ave~ia~~~~~~~~~~~~~ls~~~~~~~~~~~~l~~~~~~~~~~f~~L~~~L~ 252 (276)
T PF05929_consen 173 KSLFSKVKALFKKKEASDDEQFADLQQAVEAIAEQQQELEEAFEEQLSEQETQVAELKQELKEQHEALTEDFAALKEKLS 252 (276)
T ss_pred hhhhHHhhhhhcCCcCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHhh
No 488
>KOG0946|consensus
Probab=55.06 E-value=2.2e+02 Score=32.58 Aligned_cols=121 Identities=20% Similarity=0.241 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCCCCc
Q psy18020 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE-KTEVDV 360 (434)
Q Consensus 282 ~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~-k~~~di 360 (434)
.+.+..+++..+.....+++.+++...+|....+.|+.++.....+...++++...|+.+...+..-...+.. .+....
T Consensus 655 l~~~~~kyK~lI~~lD~~~e~lkQ~~~~l~~e~eeL~~~vq~~~s~hsql~~q~~~Lk~qLg~~~~~~~~~~q~~e~~~t 734 (970)
T KOG0946|consen 655 LDDIQQKYKGLIRELDYQIENLKQMEKELQVENEELEEEVQDFISEHSQLKDQLDLLKNQLGIISSKQRDLLQGAEASKT 734 (970)
T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccchhhHHhHHHhccC
Q ss_pred cccccccchHHHHHHHhhHHHHhhhhHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy18020 361 DEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRS 411 (434)
Q Consensus 361 De~i~~~~~l~~Ql~~l~Aed~AieDtiy~L~~aL~~g~I~ld~flK~vR~ 411 (434)
.+ +-+=.+..|..-++---.+|.+-|+++.=+.+.|=+..|.
T Consensus 735 ~~---------eel~a~~~e~k~l~~~q~~l~~~L~k~~~~~es~k~~~~~ 776 (970)
T KOG0946|consen 735 QN---------EELNAALSENKKLENDQELLTKELNKKNADIESFKATQRS 776 (970)
T ss_pred Ch---------HHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhh
No 489
>PF05565 Sipho_Gp157: Siphovirus Gp157; InterPro: IPR008840 This family contains both viral and bacterial proteins which are related to the Gp157 protein of the Streptococcus thermophilus SFi bacteriophage. It is thought that bacteria possessing the gene coding for this protein have an increased resistance to the bacteriophage [].
Probab=55.03 E-value=78 Score=28.76 Aligned_cols=58 Identities=16% Similarity=0.360 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy18020 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354 (434)
Q Consensus 297 ~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~ 354 (434)
+.-.++|..++.++..--+.+-..+..++...+.++..+.+|.+....+++.+..+++
T Consensus 25 e~~~dtLe~i~~~~~~K~~~~~~~Ik~~ea~~e~~k~E~krL~~rkk~~e~~~~~Lk~ 82 (162)
T PF05565_consen 25 EAIADTLESIEDEIEEKADNIAKVIKNLEADIEAIKAEIKRLQERKKSIENRIDRLKE 82 (162)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 490
>KOG1666|consensus
Probab=54.94 E-value=1.3e+02 Score=28.85 Aligned_cols=70 Identities=26% Similarity=0.363 Sum_probs=0.0
Q ss_pred CchHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 269 TDEHIKMSLKSAVE-----DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338 (434)
Q Consensus 269 ~~e~~r~sl~sal~-----dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L 338 (434)
+.+-++++-+.+++ ..+-..|..++++++...+.|+.+.+.|.+....|+.|..++..++-.|--.|..+
T Consensus 133 st~rl~ds~Ria~ETEqIG~~IL~dL~~QRe~L~rar~rL~~td~~lgkS~kiL~tM~RR~~~nk~~~~aii~~l 207 (220)
T KOG1666|consen 133 STDRLKDSQRIALETEQIGSEILEDLHGQREQLERARERLRETDANLGKSRKILTTMTRRLIRNKFTLTAIIALL 207 (220)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 491
>KOG4677|consensus
Probab=54.93 E-value=2e+02 Score=30.58 Aligned_cols=131 Identities=11% Similarity=0.010 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc
Q psy18020 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVD 361 (434)
Q Consensus 282 ~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diD 361 (434)
.+++.......+.++.+.=.+.+.+++.+.......-..++........|+...+.+..+.+-.+..+....++..-.|-
T Consensus 325 ~~~~q~q~~~Lrs~~~d~EAq~r~l~s~~~~q~~~~h~~ka~~~~~~~~l~~~~ec~~~e~e~~~~~~~r~~~~~qski~ 404 (554)
T KOG4677|consen 325 SAHIQDQYTLLRSQIIDIEAQDRHLESAGQTQIFRKHPRKASILNMPLVLTLFYECFYHETEAEGTFSSRVNLKKQSKIP 404 (554)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHhhhHhhhhhhchHHHHHHHHHHHHHHHHhhhhhhhccchhhccCc
Q ss_pred ccccccchHHHHHHHhhHHH--HhhhhHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy18020 362 EAVTTTAPIYKQILNTFTEE--AATEDTIYYMGEALRRGVIDLEVYLKQVRSL 412 (434)
Q Consensus 362 e~i~~~~~l~~Ql~~l~Aed--~AieDtiy~L~~aL~~g~I~ld~flK~vR~L 412 (434)
|-=..-+-+-+||-.-...| .+++|+..-|...|.+..|.|+..++.--.|
T Consensus 405 dk~~el~kl~~~l~~r~~~~s~~~l~~~~~qLt~tl~qkq~~le~v~~~~~~l 457 (554)
T KOG4677|consen 405 DKQYELTKLAARLKLRAWNDSVDALFTTKNQLTYTLKQKQIGLERVVEILHKL 457 (554)
T ss_pred chHHHHHHHHHHHHHHhhhhhHHHHhchhHHHHHHHHHHHHHHHHHHHHHhhh
No 492
>PF03908 Sec20: Sec20; InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=54.80 E-value=1.2e+02 Score=24.76 Aligned_cols=59 Identities=17% Similarity=0.248 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE 340 (434)
Q Consensus 282 ~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~ 340 (434)
...+...|.+.++....++++-....++|....+.|+....++......|...-+.+++
T Consensus 3 s~~vT~~L~rt~~~m~~ev~~s~~t~~~L~~Ss~~L~~~~~e~~~~~~~l~~s~~ll~~ 61 (92)
T PF03908_consen 3 SSDVTESLRRTRQMMAQEVERSELTLQTLEESSATLRSTNDEYDGQSSLLKKSRKLLKK 61 (92)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 493
>PF15294 Leu_zip: Leucine zipper
Probab=54.77 E-value=1.8e+02 Score=29.10 Aligned_cols=96 Identities=19% Similarity=0.305 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhcCCC
Q psy18020 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI--SFLKEKETELDEILAVLTEKT 356 (434)
Q Consensus 279 sal~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i--~~L~~~~~eLe~~I~~l~~k~ 356 (434)
.++..+-..+|++....++.-+..+...-......+.+|+..+.+|+......+... ..=..+..+|+.-++.++
T Consensus 127 ~~ll~kEi~rLq~EN~kLk~rl~~le~~at~~l~Ek~kl~~~L~~lq~~~~~~~~k~~~~~~~q~l~dLE~k~a~lK--- 203 (278)
T PF15294_consen 127 SELLNKEIDRLQEENEKLKERLKSLEKQATSALDEKSKLEAQLKELQDEQGDQKGKKDLSFKAQDLSDLENKMAALK--- 203 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccchhhHHHHHHHHH---
Q ss_pred CCCccccccccchHHHHHHHhhHHHHhhhhHH
Q psy18020 357 EVDVDEAVTTTAPIYKQILNTFTEEAATEDTI 388 (434)
Q Consensus 357 ~~diDe~i~~~~~l~~Ql~~l~Aed~AieDti 388 (434)
+-+.+.+.+....-.++++++
T Consensus 204 -----------~e~ek~~~d~~~~~k~L~e~L 224 (278)
T PF15294_consen 204 -----------SELEKALQDKESQQKALEETL 224 (278)
T ss_pred -----------HHHHHHHHHHHHHHHHHHHHH
No 494
>PF05974 DUF892: Domain of unknown function (DUF892); InterPro: IPR010287 This domain is found in several hypothetical bacterial proteins of unknown function.; PDB: 4ERU_B 3OGH_A 2GS4_B 2GYQ_B 3HIU_A.
Probab=54.66 E-value=95 Score=28.02 Aligned_cols=96 Identities=20% Similarity=0.243 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccccchHH
Q psy18020 292 QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIY 371 (434)
Q Consensus 292 ~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l~~k~~~diDe~i~~~~~l~ 371 (434)
...-.-.++..+......+...-+++...... ...++.|++-+..-++..+.|++.++.+...-..--++++.+--.-.
T Consensus 2 l~~~~~~~L~d~y~aE~q~~~~l~~~~~~a~~-~~L~~~l~~h~~eT~~q~~rLe~~~~~lg~~p~~~~c~~~~gl~~e~ 80 (159)
T PF05974_consen 2 LRDLFIDELRDLYSAEKQLLKALPKLAEAASS-PELKAALEEHLEETEQQIERLEQIFEALGADPSAEKCDAMEGLVAEA 80 (159)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-SS-HHHHHHHHHHHHHHHHHHHHHHHHHHHTTS-S-CHH-HHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCccCcchHHHHHHHHH
Q ss_pred HHHHHhhHHHHhhhhHH
Q psy18020 372 KQILNTFTEEAATEDTI 388 (434)
Q Consensus 372 ~Ql~~l~Aed~AieDti 388 (434)
..+++..++|..+.|+.
T Consensus 81 ~~~~~~~~~d~~~~D~~ 97 (159)
T PF05974_consen 81 QELIEEFAEDPAVKDAA 97 (159)
T ss_dssp HHHHHT-S-SHHHHHHH
T ss_pred HHHHhcccCCchHhhHH
No 495
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=54.65 E-value=1e+02 Score=24.08 Aligned_cols=65 Identities=11% Similarity=0.113 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347 (434)
Q Consensus 283 dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~ 347 (434)
+.....|++.+.++...++..+.....|....+....-+...-.+..+|...+..|+.+.++...
T Consensus 4 ea~~~~Lr~rLd~~~rk~~~~~~~~k~L~~ERd~~~~~l~~a~~e~~~Lk~E~e~L~~el~~~r~ 68 (69)
T PF14197_consen 4 EAEIATLRNRLDSLTRKNSVHEIENKRLRRERDSAERQLGDAYEENNKLKEENEALRKELEELRA 68 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
No 496
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=54.58 E-value=1.4e+02 Score=25.54 Aligned_cols=73 Identities=19% Similarity=0.361 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHhHHHH-HHHHHHHHH----HHHHHHHHHHHHHHH
Q psy18020 277 LKSAVEDKIRRKM-----------NEQKVQAQEEIEILKQTENELNQGKFK-LARMFERID----KEKAELERSISFLKE 340 (434)
Q Consensus 277 l~sal~dkL~~~L-----------~e~~~q~~~ele~L~~~q~EL~~~~~~-L~~~~~eLe----~~k~~Le~~i~~L~~ 340 (434)
+.+.+.+++..-. .+.+.-.++.+++++..+.++....+. |+.++..++ .+.++++..|.+|++
T Consensus 18 ~~a~~~ek~~klvDelVkkGeln~eEak~~vddl~~q~k~~~~e~e~K~~r~i~~ml~~~~~~r~~~~~~l~~rvd~Ler 97 (108)
T COG3937 18 LAAETAEKVQKLVDELVKKGELNAEEAKRFVDDLLRQAKEAQGELEEKIPRKIEEMLSDLEVARQSEMDELTERVDALER 97 (108)
T ss_pred HHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHhhhHHHhhhHHHHHHHhhccccccchHHHHHHHHHHHHH
Q ss_pred HHHHHHHHH
Q psy18020 341 KETELDEIL 349 (434)
Q Consensus 341 ~~~eLe~~I 349 (434)
+...|+.-+
T Consensus 98 qv~~Lenk~ 106 (108)
T COG3937 98 QVADLENKL 106 (108)
T ss_pred HHHHHHHHh
No 497
>PF13815 Dzip-like_N: Iguana/Dzip1-like DAZ-interacting protein N-terminal
Probab=54.22 E-value=1.4e+02 Score=25.54 Aligned_cols=61 Identities=20% Similarity=0.165 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 289 MNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349 (434)
Q Consensus 289 L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I 349 (434)
+-..+.-.+-.+|=|...|+-|...-..++..++.+..+.++++..+..++++...+.+-+
T Consensus 57 ~~klfrLaQl~ieYLl~~q~~L~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~~k~lk~E~ 117 (118)
T PF13815_consen 57 FLKLFRLAQLSIEYLLHCQEYLSSQLEQLEERLQELQQEIEKLKQKLKKQKEEIKKLKKES 117 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
No 498
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=54.09 E-value=1.1e+02 Score=26.10 Aligned_cols=63 Identities=16% Similarity=0.182 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy18020 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352 (434)
Q Consensus 290 ~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~~l 352 (434)
.+.+..+.+.-..+...-+.|...-+..+.-..+|..+++.|+.....|+.+.+.....+.++
T Consensus 1 ~ee~~~l~as~~el~n~La~Le~slE~~K~S~~eL~kqkd~L~~~l~~L~~q~~s~~qr~~eL 63 (107)
T PF09304_consen 1 KEEKEALEASQNELQNRLASLERSLEDEKTSQGELAKQKDQLRNALQSLQAQNASRNQRIAEL 63 (107)
T ss_dssp -------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcHHHHHHhhHHHHHHHHHHHHHHHHHHHhhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH
No 499
>PF08702 Fib_alpha: Fibrinogen alpha/beta chain family; InterPro: IPR012290 Fibrinogen plays key roles in both blood clotting and platelet aggregation. During blood clot formation, the conversion of soluble fibrinogen to insoluble fibrin is triggered by thrombin, resulting in the polymerisation of fibrin, which forms a soft clot; this is then converted to a hard clot by factor XIIIA, which cross-links fibrin molecules. Platelet aggregation involves the binding of the platelet protein receptor integrin alpha(IIb)-beta(3) to the C-terminal D domain of fibrinogen []. In addition to platelet aggregation, platelet-fibrinogen interaction mediates both adhesion and fibrin clot retraction. Fibrinogen occurs as a dimer, where each monomer is composed of three non-identical chains, alpha, beta and gamma, linked together by several disulphide bonds []. The N-terminals of all six chains come together to form the centre of the molecule (E domain), from which the monomers extend in opposite directions as coiled coils, followed by C-terminal globular domains (D domains). Therefore, the domain composition is: D-coil-E-coil-D. At each end, the C-terminal of the alpha chain extends beyond the D domain as a protuberance that is important for cross-linking the molecule. During clot formation, the N-terminal fragments of the alpha and beta chains (within the E domain) in fibrinogen are cleaved by thrombin, releasing fibrinopeptides A and B, respectively, and producing fibrin. This cleavage results in the exposure of four binding sites on the E domain, each of which can bind to a D domain from different fibrin molecules. The binding of fibrin molecules produces a polymer consisting of a lattice network of fibrins that form a long, branching, flexible fibre [, ]. Fibrin fibres interact with platelets to increase the size of the clot, as well as with several different proteins and cells, thereby promoting the inflammatory response and concentrating the cells required for wound repair at the site of damage. This entry represents the coiled-coil domain and part of the N-terminal E domain found in all three fibrinogen polypeptides, namely the alpha, beta and gamma chains. More information about these proteins can be found at Protein of the Month: Fibrinogen [].; GO: 0005102 receptor binding, 0030674 protein binding, bridging, 0007165 signal transduction, 0030168 platelet activation, 0051258 protein polymerization, 0005577 fibrinogen complex; PDB: 1LWU_D 1N73_D 1M1J_B 1JY2_R 1JY3_R 1RF0_A 2H43_D 1RE4_D 2XNY_D 2HPC_D ....
Probab=53.82 E-value=1.7e+02 Score=26.29 Aligned_cols=73 Identities=14% Similarity=0.154 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcC
Q psy18020 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI-SFLKEKETELDEILAVLTE 354 (434)
Q Consensus 282 ~dkL~~~L~e~~~q~~~ele~L~~~q~EL~~~~~~L~~~~~eLe~~k~~Le~~i-~~L~~~~~eLe~~I~~l~~ 354 (434)
.++|+..|.+.-+......+-+..++..+..++...+.--.-..+.++.|++.+ .+++.....-...|..|.+
T Consensus 38 i~~L~~~L~~~~n~t~~~~~~v~~i~~~~~~~q~~~~~n~~i~~~~s~~l~~~~~~~~e~~i~~~~~~I~~Lq~ 111 (146)
T PF08702_consen 38 IQELENLLDQISNSTSEAFEYVKNIKDSLRPRQKQAKPNDNIYNQYSKSLRKMIIYILETKIINQPSNIRVLQN 111 (146)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhccccccCCcccHHHHHHHHHHHHHHHHHHHHHhhhHhHHHHHHH
No 500
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=53.81 E-value=77 Score=35.21 Aligned_cols=76 Identities=26% Similarity=0.317 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH---------------HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy18020 286 RRKMNEQKVQAQEEIEILKQTENE---------------LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350 (434)
Q Consensus 286 ~~~L~e~~~q~~~ele~L~~~q~E---------------L~~~~~~L~~~~~eLe~~k~~Le~~i~~L~~~~~eLe~~I~ 350 (434)
.++..+...+.++.++.|++...+ ++......+....+++++-.++++.++.|+++.+++++.++
T Consensus 45 ~~~~~~~~~~~~~~l~~L~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~l~~~~~~L~~~~~~l~~~~~ 124 (646)
T PRK05771 45 LRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIE 124 (646)
T ss_pred HhHHHHHHHHHHHHHHHHHHhccccccchhhhccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HhcCCCCCCcc
Q psy18020 351 VLTEKTEVDVD 361 (434)
Q Consensus 351 ~l~~k~~~diD 361 (434)
.++.=++.|+|
T Consensus 125 ~l~~~~~ld~~ 135 (646)
T PRK05771 125 RLEPWGNFDLD 135 (646)
T ss_pred HhhhhhcCCCC
Done!