Query psy2692
Match_columns 133
No_of_seqs 105 out of 168
Neff 5.5
Searched_HMMs 46136
Date Fri Aug 16 22:46:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2692.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2692hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF06957 COPI_C: Coatomer (COP 100.0 8.8E-49 1.9E-53 335.9 11.6 126 7-132 124-261 (422)
2 KOG0292|consensus 100.0 2.1E-36 4.5E-41 274.5 11.0 124 8-132 913-1048(1202)
3 PF07719 TPR_2: Tetratricopept 88.1 0.94 2E-05 24.2 3.3 28 79-106 5-32 (34)
4 PF13432 TPR_16: Tetratricopep 86.7 0.6 1.3E-05 28.8 2.2 29 79-107 1-29 (65)
5 PF04212 MIT: MIT (microtubule 86.2 4.3 9.4E-05 26.0 6.2 53 76-130 6-59 (69)
6 PF00515 TPR_1: Tetratricopept 85.5 1.7 3.7E-05 23.5 3.5 28 79-106 5-32 (34)
7 PF13174 TPR_6: Tetratricopept 84.5 1.7 3.6E-05 22.9 3.1 29 78-106 3-31 (33)
8 PF13371 TPR_9: Tetratricopept 83.8 2.5 5.5E-05 26.3 4.2 41 78-123 32-72 (73)
9 KOG0624|consensus 81.5 6 0.00013 34.9 6.8 92 10-107 47-138 (504)
10 cd02684 MIT_2 MIT: domain cont 81.3 6.3 0.00014 26.3 5.5 59 71-130 2-60 (75)
11 TIGR02552 LcrH_SycD type III s 79.9 7.4 0.00016 26.7 5.8 29 78-106 88-116 (135)
12 cd02680 MIT_calpain7_2 MIT: do 79.5 14 0.00031 24.9 6.8 37 71-107 2-38 (75)
13 cd02678 MIT_VPS4 MIT: domain c 79.3 11 0.00023 24.9 6.1 57 73-130 4-60 (75)
14 cd02656 MIT MIT: domain contai 79.3 9.4 0.0002 24.8 5.8 55 75-130 6-60 (75)
15 PF13414 TPR_11: TPR repeat; P 76.1 3.9 8.5E-05 25.1 3.1 29 78-106 6-34 (69)
16 smart00745 MIT Microtubule Int 76.1 14 0.00031 23.9 6.0 56 74-130 7-62 (77)
17 smart00028 TPR Tetratricopepti 73.9 4.9 0.00011 19.2 2.6 26 79-104 5-30 (34)
18 cd02677 MIT_SNX15 MIT: domain 72.5 21 0.00046 23.8 6.2 57 73-130 4-60 (75)
19 PF13812 PPR_3: Pentatricopept 72.1 4.3 9.4E-05 21.5 2.3 20 9-28 9-28 (34)
20 PRK15331 chaperone protein Sic 71.6 7.6 0.00016 30.1 4.3 34 70-103 32-65 (165)
21 PRK10370 formate-dependent nit 70.9 9.5 0.00021 29.4 4.8 92 9-107 81-176 (198)
22 cd02681 MIT_calpain7_1 MIT: do 70.9 14 0.00031 24.9 5.0 53 77-130 8-61 (76)
23 PF13428 TPR_14: Tetratricopep 70.6 8.6 0.00019 22.2 3.5 28 79-106 5-32 (44)
24 PF14559 TPR_19: Tetratricopep 70.3 4 8.7E-05 25.0 2.1 23 85-107 1-23 (68)
25 KOG0543|consensus 68.3 10 0.00023 33.2 4.9 47 81-127 214-260 (397)
26 smart00668 CTLH C-terminal to 68.0 8.7 0.00019 23.1 3.3 38 9-46 9-53 (58)
27 cd03598 CLECT_EMBP_like C-type 67.9 2.9 6.4E-05 28.9 1.2 42 82-124 3-45 (117)
28 cd02682 MIT_AAA_Arch MIT: doma 67.6 18 0.00039 24.5 5.0 52 78-130 9-60 (75)
29 cd02683 MIT_1 MIT: domain cont 67.4 17 0.00036 24.4 4.8 54 76-130 7-60 (77)
30 PF04097 Nic96: Nup93/Nic96; 66.4 15 0.00032 33.5 5.6 82 11-106 437-535 (613)
31 PF14276 DUF4363: Domain of un 64.9 46 0.001 23.4 7.3 61 72-132 25-90 (121)
32 PRK09720 cybC cytochrome b562; 63.9 14 0.0003 26.6 4.0 28 71-98 58-85 (100)
33 PF09976 TPR_21: Tetratricopep 62.8 25 0.00053 25.1 5.3 90 9-102 56-145 (145)
34 PF07721 TPR_4: Tetratricopept 61.3 9.8 0.00021 19.9 2.3 19 8-26 8-26 (26)
35 PF13525 YfiO: Outer membrane 61.2 14 0.0003 28.3 4.0 31 78-108 8-38 (203)
36 PLN03098 LPA1 LOW PSII ACCUMUL 61.0 18 0.00038 32.4 5.0 37 71-107 71-107 (453)
37 cd00189 TPR Tetratricopeptide 60.9 12 0.00025 21.9 2.8 28 79-106 4-31 (100)
38 PLN03088 SGT1, suppressor of 60.1 33 0.00071 28.8 6.3 109 8-130 9-117 (356)
39 KOG4648|consensus 57.3 15 0.00032 32.6 3.8 51 81-131 103-166 (536)
40 PF13176 TPR_7: Tetratricopept 56.9 19 0.00041 20.0 3.1 22 81-102 5-26 (36)
41 PRK15363 pathogenicity island 56.5 26 0.00057 26.9 4.7 40 69-108 28-68 (157)
42 PF13181 TPR_8: Tetratricopept 56.1 25 0.00055 18.5 3.5 27 80-106 6-32 (34)
43 TIGR02795 tol_pal_ybgF tol-pal 55.5 24 0.00051 23.1 3.9 21 9-29 10-30 (119)
44 PF13374 TPR_10: Tetratricopep 55.4 22 0.00048 19.2 3.2 25 80-104 7-31 (42)
45 PF03986 Autophagy_N: Autophag 55.3 3.9 8.5E-05 31.0 0.0 13 5-17 27-39 (145)
46 PF13432 TPR_16: Tetratricopep 55.0 32 0.00068 20.8 4.2 58 9-106 5-62 (65)
47 TIGR03504 FimV_Cterm FimV C-te 54.3 14 0.0003 22.4 2.4 25 8-32 6-30 (44)
48 COG1729 Uncharacterized protei 53.7 15 0.00032 30.6 3.1 31 78-108 144-174 (262)
49 PRK15058 cytochrome b562; Prov 53.4 24 0.00051 26.4 3.9 88 12-101 20-116 (128)
50 PF07361 Cytochrom_B562: Cytoc 53.3 18 0.00039 25.6 3.1 31 71-101 61-91 (103)
51 TIGR00756 PPR pentatricopeptid 53.3 16 0.00036 18.7 2.4 19 10-28 9-27 (35)
52 KOG0553|consensus 53.1 28 0.00061 29.6 4.7 89 9-108 89-182 (304)
53 PF05538 Campylo_MOMP: Campylo 52.8 4.6 0.0001 35.7 0.0 37 1-48 1-37 (431)
54 PF13424 TPR_12: Tetratricopep 52.4 55 0.0012 20.4 5.2 26 79-104 9-34 (78)
55 PF10579 Rapsyn_N: Rapsyn N-te 51.8 28 0.00062 24.0 3.8 32 75-106 6-37 (80)
56 KOG1840|consensus 51.8 34 0.00073 30.9 5.3 110 8-117 206-325 (508)
57 PF09577 Spore_YpjB: Sporulati 50.9 81 0.0018 25.7 7.0 40 72-111 2-41 (232)
58 PF12895 Apc3: Anaphase-promot 50.1 13 0.00029 23.9 1.9 78 14-100 2-83 (84)
59 cd03600 CLECT_thrombomodulin_l 50.0 15 0.00033 26.2 2.4 42 82-125 6-48 (141)
60 PF13041 PPR_2: PPR repeat fam 49.4 16 0.00035 21.4 2.1 20 85-104 13-32 (50)
61 TIGR03302 OM_YfiO outer membra 48.1 28 0.0006 26.3 3.7 32 75-106 33-64 (235)
62 TIGR02795 tol_pal_ybgF tol-pal 48.1 37 0.0008 22.1 3.9 25 80-104 7-31 (119)
63 PF09976 TPR_21: Tetratricopep 47.0 30 0.00066 24.7 3.6 30 76-105 49-78 (145)
64 PF12854 PPR_1: PPR repeat 47.0 23 0.00049 19.7 2.3 17 10-26 16-32 (34)
65 PRK15359 type III secretion sy 46.0 95 0.0021 22.3 6.1 30 78-107 95-124 (144)
66 PF00059 Lectin_C: Lectin C-ty 44.7 43 0.00093 21.3 3.8 33 90-124 4-36 (105)
67 KOG2168|consensus 44.2 83 0.0018 30.3 6.8 93 9-109 643-740 (835)
68 PRK10866 outer membrane biogen 44.1 37 0.0008 27.1 4.0 32 77-108 34-65 (243)
69 PF10938 YfdX: YfdX protein; 43.5 45 0.00098 25.1 4.2 40 72-111 114-153 (155)
70 PF07720 TPR_3: Tetratricopept 43.2 24 0.00052 20.3 2.1 20 80-99 6-25 (36)
71 KOG3807|consensus 43.0 24 0.00052 31.3 2.9 37 74-112 276-312 (556)
72 TIGR02552 LcrH_SycD type III s 42.7 59 0.0013 22.1 4.4 32 75-106 17-48 (135)
73 PLN03088 SGT1, suppressor of 41.0 38 0.00083 28.4 3.8 29 78-106 5-33 (356)
74 PF01535 PPR: PPR repeat; Int 40.0 30 0.00066 17.5 2.1 19 85-103 10-28 (31)
75 PRK15359 type III secretion sy 39.7 35 0.00076 24.6 3.0 28 80-107 29-56 (144)
76 PF14561 TPR_20: Tetratricopep 39.6 1.2E+02 0.0026 20.6 5.5 34 73-106 20-53 (90)
77 cd05492 Bromo_ZMYND11 Bromodom 39.3 1.3E+02 0.0029 21.4 5.9 56 69-128 46-101 (109)
78 PF13512 TPR_18: Tetratricopep 38.7 50 0.0011 24.9 3.7 34 78-111 13-46 (142)
79 TIGR02917 PEP_TPR_lipo putativ 38.1 1.5E+02 0.0032 25.9 7.0 20 10-29 65-84 (899)
80 PF05347 Complex1_LYR: Complex 37.6 50 0.0011 20.0 3.1 16 115-130 42-57 (59)
81 KOG0545|consensus 36.5 32 0.00069 29.2 2.5 33 78-110 181-213 (329)
82 PF13512 TPR_18: Tetratricopep 35.0 67 0.0015 24.2 3.9 32 76-107 48-79 (142)
83 KOG4234|consensus 34.8 44 0.00096 27.6 3.1 31 80-110 100-130 (271)
84 KOG3677|consensus 34.8 49 0.0011 29.8 3.6 49 79-127 276-334 (525)
85 cd03594 CLECT_REG-1_like C-typ 33.7 37 0.0008 23.2 2.2 43 82-124 12-55 (129)
86 COG1484 DnaC DNA replication p 33.0 1.1E+02 0.0024 24.6 5.2 55 69-129 138-199 (254)
87 PF02064 MAS20: MAS20 protein 33.0 88 0.0019 22.9 4.2 31 76-106 64-94 (121)
88 PF09613 HrpB1_HrpK: Bacterial 32.6 88 0.0019 24.1 4.3 42 78-127 47-88 (160)
89 PRK10803 tol-pal system protei 31.3 79 0.0017 25.7 4.1 33 76-108 143-176 (263)
90 KOG2376|consensus 31.2 78 0.0017 29.6 4.3 50 78-127 178-227 (652)
91 PF12335 SBF2: Myotubularin pr 30.8 1E+02 0.0022 24.9 4.6 52 55-106 1-54 (225)
92 KOG4281|consensus 30.8 73 0.0016 26.1 3.7 45 73-117 5-49 (236)
93 KOG0553|consensus 30.8 69 0.0015 27.3 3.7 38 74-111 79-117 (304)
94 PF13525 YfiO: Outer membrane 30.5 85 0.0018 23.9 4.0 31 78-108 45-75 (203)
95 KOG0739|consensus 30.3 1.3E+02 0.0028 26.4 5.3 58 72-130 7-65 (439)
96 cd00037 CLECT C-type lectin (C 29.7 46 0.001 21.1 2.1 35 88-124 9-43 (116)
97 COG4989 Predicted oxidoreducta 29.4 97 0.0021 26.2 4.3 47 14-61 135-186 (298)
98 cd05508 Bromo_RACK7 Bromodomai 29.3 1.9E+02 0.0042 20.0 5.3 56 69-128 42-97 (99)
99 PF07304 SRA1: Steroid recepto 29.3 2.4E+02 0.0052 21.3 6.2 47 75-128 90-140 (157)
100 PF10255 Paf67: RNA polymerase 28.6 83 0.0018 27.6 4.0 50 78-127 167-226 (404)
101 PRK10049 pgaA outer membrane p 28.6 3.3E+02 0.0072 25.2 8.0 99 8-108 279-392 (765)
102 cd06131 DNA_pol_III_epsilon_Ec 28.4 48 0.001 24.0 2.1 28 2-29 53-80 (167)
103 TIGR00990 3a0801s09 mitochondr 28.3 58 0.0012 29.0 3.0 28 79-106 131-158 (615)
104 PRK11788 tetratricopeptide rep 27.6 2.1E+02 0.0046 23.0 6.0 20 10-29 116-135 (389)
105 PF14769 CLAMP: Flagellar C1a 27.5 1.6E+02 0.0034 20.3 4.6 21 87-107 34-54 (101)
106 PF09670 Cas_Cas02710: CRISPR- 27.4 1.8E+02 0.0039 24.9 5.7 45 77-129 133-177 (379)
107 PF10516 SHNi-TPR: SHNi-TPR; 27.3 71 0.0015 18.7 2.3 21 84-104 10-30 (38)
108 COG4063 MtrA Tetrahydromethano 27.1 77 0.0017 25.8 3.2 38 88-128 115-152 (238)
109 KOG0543|consensus 26.3 2.1E+02 0.0045 25.3 5.9 51 75-129 291-341 (397)
110 TIGR02878 spore_ypjB sporulati 26.2 79 0.0017 26.0 3.2 32 73-104 3-34 (233)
111 cd03602 CLECT_1 C-type lectin 26.2 1.3E+02 0.0028 20.0 3.9 36 87-124 8-43 (108)
112 PHA02841 hypothetical protein; 26.1 35 0.00077 23.9 1.0 33 21-55 6-39 (103)
113 PRK10866 outer membrane biogen 26.1 1.6E+02 0.0035 23.4 4.9 32 77-108 71-102 (243)
114 PRK11788 tetratricopeptide rep 26.1 3.5E+02 0.0075 21.7 7.1 21 10-30 44-64 (389)
115 CHL00033 ycf3 photosystem I as 25.7 98 0.0021 22.4 3.4 29 78-106 38-66 (168)
116 PRK09637 RNA polymerase sigma 25.6 84 0.0018 23.4 3.1 32 90-121 2-33 (181)
117 PRK10564 maltose regulon perip 24.7 65 0.0014 27.4 2.5 22 9-30 265-288 (303)
118 PRK09591 celC cellobiose phosp 24.3 80 0.0017 22.5 2.6 30 7-45 26-55 (104)
119 COG0329 DapA Dihydrodipicolina 24.2 3E+02 0.0066 22.7 6.4 81 14-111 167-250 (299)
120 PF04048 Sec8_exocyst: Sec8 ex 24.1 1.4E+02 0.0031 21.8 4.1 55 72-130 45-102 (142)
121 cd00215 PTS_IIA_lac PTS_IIA, P 23.9 83 0.0018 22.1 2.6 29 8-45 22-50 (97)
122 PF06580 His_kinase: Histidine 23.9 2.2E+02 0.0048 18.7 4.6 54 73-130 12-67 (82)
123 PRK10747 putative protoheme IX 23.4 2.4E+02 0.0053 23.7 5.8 89 8-107 270-360 (398)
124 PRK02603 photosystem I assembl 23.3 1.1E+02 0.0024 22.3 3.3 28 77-104 37-64 (172)
125 cd06130 DNA_pol_III_epsilon_li 22.9 1.1E+02 0.0023 21.6 3.1 28 2-29 49-76 (156)
126 cd03591 CLECT_collectin_like C 22.8 1.2E+02 0.0027 20.4 3.3 35 88-124 10-44 (114)
127 COG5379 BtaA S-adenosylmethion 22.7 1.5E+02 0.0032 25.9 4.3 43 3-50 186-228 (414)
128 TIGR00823 EIIA-LAC phosphotran 22.7 90 0.002 22.0 2.6 20 8-27 24-43 (99)
129 KOG0548|consensus 22.7 2E+02 0.0044 26.4 5.3 63 10-106 307-389 (539)
130 cd07957 Anticodon_Ia_Met Antic 22.5 2.5E+02 0.0054 18.8 6.0 57 74-130 44-107 (129)
131 smart00034 CLECT C-type lectin 22.5 1.4E+02 0.0031 19.3 3.5 41 83-125 13-54 (126)
132 KOG0547|consensus 22.3 78 0.0017 29.2 2.7 27 80-106 120-146 (606)
133 PF00140 Sigma70_r1_2: Sigma-7 22.3 1.7E+02 0.0036 16.7 4.1 19 92-110 1-19 (37)
134 TIGR02521 type_IV_pilW type IV 22.2 1.4E+02 0.0031 21.0 3.6 29 78-106 34-62 (234)
135 cd03603 CLECT_VCBS A bacterial 22.1 1.4E+02 0.003 20.6 3.4 36 87-124 8-43 (118)
136 cd06266 RNaseH_typeII Ribonucl 22.0 56 0.0012 25.1 1.5 29 5-33 132-160 (193)
137 PF11817 Foie-gras_1: Foie gra 21.9 85 0.0018 24.9 2.6 20 8-27 185-204 (247)
138 PF12793 SgrR_N: Sugar transpo 21.8 89 0.0019 22.4 2.5 18 8-25 77-94 (115)
139 PF02255 PTS_IIA: PTS system, 21.6 96 0.0021 21.6 2.5 20 8-27 21-40 (96)
140 COG1283 NptA Na+/phosphate sym 21.6 1.5E+02 0.0033 27.1 4.4 40 90-129 370-412 (533)
141 KOG0482|consensus 21.4 3.6E+02 0.0078 25.4 6.7 81 31-128 550-643 (721)
142 TIGR03302 OM_YfiO outer membra 21.2 2.3E+02 0.005 21.2 4.8 29 78-106 73-101 (235)
143 PRK11447 cellulose synthase su 21.1 2.7E+02 0.0058 27.1 6.2 25 82-106 276-300 (1157)
144 PF03705 CheR_N: CheR methyltr 21.0 1.1E+02 0.0024 18.2 2.5 41 16-57 4-44 (57)
145 cd03590 CLECT_DC-SIGN_like C-t 21.0 1.4E+02 0.0029 20.1 3.2 40 83-124 13-53 (126)
146 cd06144 REX4_like DEDDh 3'-5' 20.9 1.2E+02 0.0025 22.1 3.0 29 1-29 48-76 (152)
147 cd03592 CLECT_selectins_like C 20.9 1.4E+02 0.0031 20.0 3.3 35 88-124 9-43 (115)
148 PF10607 CLTH: CTLH/CRA C-term 20.8 3.1E+02 0.0067 19.3 7.5 19 9-27 9-27 (145)
149 PLN00416 carbonate dehydratase 20.8 1.6E+02 0.0034 24.3 4.0 73 13-101 2-74 (258)
150 KOG2076|consensus 20.6 1.4E+02 0.0031 29.0 4.1 40 72-111 135-175 (895)
151 KOG0980|consensus 20.5 91 0.002 30.4 2.8 40 20-59 208-262 (980)
152 PF05053 Menin: Menin; InterP 20.5 2.3E+02 0.0049 26.5 5.2 49 80-130 324-372 (618)
153 KOG1173|consensus 20.4 1.6E+02 0.0036 27.3 4.3 27 82-108 496-522 (611)
154 cd05513 Bromo_brd7_like Bromod 20.2 3.1E+02 0.0066 19.0 7.1 88 35-126 1-94 (98)
155 PF03564 DUF1759: Protein of u 20.1 71 0.0015 22.7 1.7 80 13-108 52-132 (145)
156 PF04184 ST7: ST7 protein; In 20.0 4.8E+02 0.01 24.0 7.1 27 82-108 266-292 (539)
No 1
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=100.00 E-value=8.8e-49 Score=335.88 Aligned_cols=126 Identities=43% Similarity=0.713 Sum_probs=110.8
Q ss_pred ChHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCC------------CCChHH
Q psy2692 7 TSESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKV------------DRSSID 74 (133)
Q Consensus 7 ~~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~------------~~~l~~ 74 (133)
.++.|||||+||+||||||||+|||||+|||++|+++|.+||+|+|++|++|||..|++||| ++++++
T Consensus 124 laadhvAAGsFetAm~LLnrQiGivnF~PLk~~Fl~~y~~s~~~l~~~~~~p~l~~~~~r~~~~~~~~~~lP~i~~~l~~ 203 (422)
T PF06957_consen 124 LAADHVAAGSFETAMQLLNRQIGIVNFEPLKPLFLEVYQASRTYLPALPSLPPLPSYIRRNWDESNPKNGLPAIPLSLSS 203 (422)
T ss_dssp SHHHHHHCT-HHHHHHHHHHHC-B---GGGHHHHHHHHCCTEEEE-SSTTTS-EEEEEBCTTTTSSSCCG-BB----HHH
T ss_pred cHHHHHHhCCHHHHHHHHHHHhCccccHHHHHHHHHHHHhhceecccCCCCCCccccccCCccccccccCCCcCcCCHHH
Confidence 47889999999999999999999999999999999999999999999999999999999997 568999
Q ss_pred HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhhhc
Q psy2692 75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYIVG 132 (133)
Q Consensus 75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYila 132 (133)
|.++||.|||+||+|||+||++.||+|||+|||++|+|++|++|++|+|+||||||+|
T Consensus 204 L~~~Lk~gyk~~t~gKF~eA~~~Fr~iL~~i~l~vv~~~~E~~e~~eli~icrEYilg 261 (422)
T PF06957_consen 204 LEERLKEGYKLFTAGKFEEAIEIFRSILHSIPLLVVESREEEDEAKELIEICREYILG 261 (422)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHC--BSSCHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHHHHHhheeeecCHHHHHHHHHHHHHHHHHHHH
Confidence 9999999999999999999999999999999999999999999999999999999997
No 2
>KOG0292|consensus
Probab=100.00 E-value=2.1e-36 Score=274.53 Aligned_cols=124 Identities=48% Similarity=0.743 Sum_probs=118.5
Q ss_pred hHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCC------------CCChHHH
Q psy2692 8 SESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKV------------DRSSIDL 75 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~------------~~~l~~L 75 (133)
++.+||||+||+||++|++|+|++||.|+|.+|+++|.+||+|++++|++| +.+++-||| +.+++++
T Consensus 913 ~adhvaAGsf~tA~~lL~dqvgvv~f~p~Kt~fl~iy~~sR~~l~~~~~~~-~~~~~~R~~se~~~~~~~P~v~~~l~~l 991 (1202)
T KOG0292|consen 913 AADHVAAGSFETAMRLLHDQVGVVNFGPLKTHFLKIYAGSRTYLRATPCLP-VSLYPVRNWSETSSKQGLPAVGFKLSQL 991 (1202)
T ss_pred hhhhhhcCchHHHHHHHHhhhcceecccHHhhhhhhccccceeccCCCCcc-cccccccccccchhhccCCcccccHHHH
Confidence 678999999999999999999999999999999999999999999999999 666777775 6779999
Q ss_pred HHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhhhc
Q psy2692 76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYIVG 132 (133)
Q Consensus 76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYila 132 (133)
..+|++|||+||.|||.||+++||+||++||+++|+|++|++|++|+|.||||||+|
T Consensus 992 ~~kl~~gy~ltt~gKf~eAie~Frsii~~i~l~vvd~~~e~aea~~li~i~~eYi~g 1048 (1202)
T KOG0292|consen 992 NKKLQKGYKLTTEGKFGEAIEKFRSIIYSIPLLVVDSKEEEAEADELIKICREYIVG 1048 (1202)
T ss_pred HHHHHHHHhhhccCcHHHHHHHHHHHHhheeEEEecchhhHHHHHHHHHHHHHHHhh
Confidence 999999999999999999999999999999999999999999999999999999997
No 3
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=88.05 E-value=0.94 Score=24.21 Aligned_cols=28 Identities=21% Similarity=0.195 Sum_probs=22.9
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
...|.-++..|++++|+..|+..+..-|
T Consensus 5 ~~lg~~~~~~~~~~~A~~~~~~al~l~p 32 (34)
T PF07719_consen 5 YYLGQAYYQLGNYEEAIEYFEKALELDP 32 (34)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHHST
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHHCc
Confidence 3568889999999999999999987644
No 4
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=86.73 E-value=0.6 Score=28.80 Aligned_cols=29 Identities=21% Similarity=0.218 Sum_probs=25.1
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
+..|..++..|+|++|+..|+.++..-|-
T Consensus 1 ~~~a~~~~~~g~~~~A~~~~~~~l~~~P~ 29 (65)
T PF13432_consen 1 YALARALYQQGDYDEAIAAFEQALKQDPD 29 (65)
T ss_dssp HHHHHHHHHCTHHHHHHHHHHHHHCCSTT
T ss_pred ChHHHHHHHcCCHHHHHHHHHHHHHHCCC
Confidence 45788899999999999999999987764
No 5
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=86.24 E-value=4.3 Score=26.01 Aligned_cols=53 Identities=19% Similarity=0.338 Sum_probs=38.9
Q ss_pred HHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccC-hHHHHHHHHHHHHHHHhh
Q psy2692 76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTT-KKEIEEAKQIINICKEYI 130 (133)
Q Consensus 76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~-~~e~~e~~eli~icrEYi 130 (133)
...++.|-+.=..|++++|++.+++-+..+.-..-.. .++..++ +-..+.+|+
T Consensus 6 ~~~~~~Av~~D~~g~~~~A~~~Y~~ai~~l~~~~~~~~~~~~~~~--l~~k~~~yl 59 (69)
T PF04212_consen 6 IELIKKAVEADEAGNYEEALELYKEAIEYLMQALKSESNPERRQA--LRQKMKEYL 59 (69)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHHHHHHHSTTHHHHHH--HHHHHHHHH
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHhccCCCHHHHHH--HHHHHHHHH
Confidence 3457888888889999999999999888776654443 3333333 778888886
No 6
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=85.54 E-value=1.7 Score=23.49 Aligned_cols=28 Identities=18% Similarity=0.189 Sum_probs=23.0
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
...|.-.+..|++++|+..|+..|..-|
T Consensus 5 ~~~g~~~~~~~~~~~A~~~~~~al~~~p 32 (34)
T PF00515_consen 5 YNLGNAYFQLGDYEEALEYYQRALELDP 32 (34)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHHST
T ss_pred HHHHHHHHHhCCchHHHHHHHHHHHHCc
Confidence 4678888999999999999999987644
No 7
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=84.51 E-value=1.7 Score=22.95 Aligned_cols=29 Identities=17% Similarity=0.236 Sum_probs=24.1
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
.++.|.-....|++++|++.|+.++..-|
T Consensus 3 ~~~~a~~~~~~g~~~~A~~~~~~~~~~~P 31 (33)
T PF13174_consen 3 LYRLARCYYKLGDYDEAIEYFQRLIKRYP 31 (33)
T ss_dssp HHHHHHHHHHHCHHHHHHHHHHHHHHHST
T ss_pred HHHHHHHHHHccCHHHHHHHHHHHHHHCc
Confidence 45667777778999999999999998765
No 8
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=83.77 E-value=2.5 Score=26.28 Aligned_cols=41 Identities=20% Similarity=0.353 Sum_probs=32.1
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHH
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQII 123 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli 123 (133)
....|.-++..|+|++|+..|..++..-| +..+...++..|
T Consensus 32 ~~~~a~~~~~~g~~~~A~~~l~~~l~~~p-----~~~~~~~~~a~l 72 (73)
T PF13371_consen 32 WLQRARCLFQLGRYEEALEDLERALELSP-----DDPDARALRAML 72 (73)
T ss_pred hHHHHHHHHHhccHHHHHHHHHHHHHHCC-----CcHHHHHHHHhc
Confidence 45678889999999999999999998877 455555555443
No 9
>KOG0624|consensus
Probab=81.54 E-value=6 Score=34.94 Aligned_cols=92 Identities=15% Similarity=0.132 Sum_probs=61.8
Q ss_pred HHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcccC
Q psy2692 10 SINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTSG 89 (133)
Q Consensus 10 ~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~g 89 (133)
++.|.|+|..|+.--+.-| +-.| .....+|-++.+|+.-=-+-+.+. -+.|-...+.+..-.++|.|.=+.+.|
T Consensus 47 ~lla~~Q~sDALt~yHaAv---e~dp--~~Y~aifrRaT~yLAmGksk~al~-Dl~rVlelKpDF~~ARiQRg~vllK~G 120 (504)
T KOG0624|consen 47 ELLARGQLSDALTHYHAAV---EGDP--NNYQAIFRRATVYLAMGKSKAALQ-DLSRVLELKPDFMAARIQRGVVLLKQG 120 (504)
T ss_pred HHHHhhhHHHHHHHHHHHH---cCCc--hhHHHHHHHHHHHhhhcCCccchh-hHHHHHhcCccHHHHHHHhchhhhhcc
Confidence 6789999999988777654 4444 344555556666653222222211 001111345566677999999999999
Q ss_pred CHHHHHHHHHHHHHHccc
Q psy2692 90 KFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 90 KF~eAl~~Fr~iL~~i~l 107 (133)
++++|..-|+.+|..-|-
T Consensus 121 ele~A~~DF~~vl~~~~s 138 (504)
T KOG0624|consen 121 ELEQAEADFDQVLQHEPS 138 (504)
T ss_pred cHHHHHHHHHHHHhcCCC
Confidence 999999999999998874
No 10
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=81.29 E-value=6.3 Score=26.32 Aligned_cols=59 Identities=14% Similarity=0.107 Sum_probs=42.1
Q ss_pred ChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 71 SSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
+++.-.+.+++|-..=+.|++++|+..+.+-|......+-...++ +.-.-+-..+.||+
T Consensus 2 ~l~~Ai~lv~~Av~~D~~g~y~eA~~lY~~ale~~~~~~k~e~~~-~~k~~lr~k~~eyl 60 (75)
T cd02684 2 SLEKAIALVVQAVKKDQRGDAAAALSLYCSALQYFVPALHYETDA-QRKEALRQKVLQYV 60 (75)
T ss_pred cHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCCCH-HHHHHHHHHHHHHH
Confidence 345556788999999999999999999999998887765332222 22223666777775
No 11
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=79.92 E-value=7.4 Score=26.74 Aligned_cols=29 Identities=14% Similarity=0.028 Sum_probs=24.9
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
....|+-+...|++++|+..|+.++..-|
T Consensus 88 ~~~la~~~~~~g~~~~A~~~~~~al~~~p 116 (135)
T TIGR02552 88 YFHAAECLLALGEPESALKALDLAIEICG 116 (135)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhcc
Confidence 45667778889999999999999999876
No 12
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=79.46 E-value=14 Score=24.92 Aligned_cols=37 Identities=14% Similarity=0.026 Sum_probs=32.0
Q ss_pred ChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692 71 SSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
.++.-+..+++|...=++|..++|+..|.+.+-....
T Consensus 2 ~l~kai~Lv~~A~~eD~~gny~eA~~lY~~ale~~~~ 38 (75)
T cd02680 2 DLERAHFLVTQAFDEDEKGNAEEAIELYTEAVELCIN 38 (75)
T ss_pred CHHHHHHHHHHHHHhhHhhhHHHHHHHHHHHHHHHHH
Confidence 3455667788998888999999999999999999888
No 13
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=79.32 E-value=11 Score=24.85 Aligned_cols=57 Identities=16% Similarity=0.297 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
+.-...+++|-+.=+.|++++|+..+.+.+..+...+-...++ .....+-..|.||+
T Consensus 4 ~~A~~l~~~Av~~D~~g~y~eA~~~Y~~aie~l~~~~k~e~~~-~~k~~~~~k~~eyl 60 (75)
T cd02678 4 QKAIELVKKAIEEDNAGNYEEALRLYQHALEYFMHALKYEKNP-KSKESIRAKCTEYL 60 (75)
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHhhCCCH-HHHHHHHHHHHHHH
Confidence 3345577888888899999999999999998887765332222 33344667777775
No 14
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=79.27 E-value=9.4 Score=24.82 Aligned_cols=55 Identities=20% Similarity=0.248 Sum_probs=40.0
Q ss_pred HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
-.+.++.|-+.=+.|++++|+..+.+.+..+....-.+.+ ...-..+-..|.+|+
T Consensus 6 a~~l~~~Av~~D~~g~~~~Al~~Y~~a~e~l~~~~~~~~~-~~~k~~l~~k~~~yl 60 (75)
T cd02656 6 AKELIKQAVKEDEDGNYEEALELYKEALDYLLQALKAEKE-PKLRKLLRKKVKEYL 60 (75)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHhccCCC-HHHHHHHHHHHHHHH
Confidence 3446778888888999999999999999888776533322 333444777788876
No 15
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=76.10 E-value=3.9 Score=25.15 Aligned_cols=29 Identities=17% Similarity=0.162 Sum_probs=25.7
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
-...|..++..|++++|+..|...|..-|
T Consensus 6 ~~~~g~~~~~~~~~~~A~~~~~~ai~~~p 34 (69)
T PF13414_consen 6 WYNLGQIYFQQGDYEEAIEYFEKAIELDP 34 (69)
T ss_dssp HHHHHHHHHHTTHHHHHHHHHHHHHHHST
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCC
Confidence 45779999999999999999999998765
No 16
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=76.08 E-value=14 Score=23.87 Aligned_cols=56 Identities=11% Similarity=0.216 Sum_probs=38.7
Q ss_pred HHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 74 DLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 74 ~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
.-.+.++.|-+.=+.|+.++|+..+++-+..+.-..-.. .+...-..+-..|++|+
T Consensus 7 ~A~~li~~Av~~d~~g~~~eAl~~Y~~a~e~l~~~~~~~-~~~~~~~~~~~k~~eyl 62 (77)
T smart00745 7 KAKELISKALKADEAGDYEEALELYKKAIEYLLEGIKVE-SDSKRREAVKAKAAEYL 62 (77)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHhccC-CCHHHHHHHHHHHHHHH
Confidence 344567788888889999999999999888877644222 22233334667777775
No 17
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=73.88 E-value=4.9 Score=19.15 Aligned_cols=26 Identities=15% Similarity=0.147 Sum_probs=21.1
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLIC 104 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~ 104 (133)
.+.|.-+...|++++|+..|+..+..
T Consensus 5 ~~~a~~~~~~~~~~~a~~~~~~~~~~ 30 (34)
T smart00028 5 YNLGNAYLKLGDYDEALEYYEKALEL 30 (34)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHcc
Confidence 35567778889999999999988754
No 18
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=72.48 E-value=21 Score=23.85 Aligned_cols=57 Identities=16% Similarity=0.109 Sum_probs=40.1
Q ss_pred HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
+.-.+.++.|-..=+.|++++|+..+++-|......+-.. .+.+.-.-+-..+.+|+
T Consensus 4 ~~A~~l~~~Ave~d~~~~y~eA~~~Y~~~i~~~~~~~k~e-~~~~~k~~ir~K~~eYl 60 (75)
T cd02677 4 EQAAELIRLALEKEEEGDYEAAFEFYRAGVDLLLKGVQGD-SSPERREAVKRKIAEYL 60 (75)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhccC-CCHHHHHHHHHHHHHHH
Confidence 3345678889999999999999999999998877754332 22222223566677775
No 19
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=72.14 E-value=4.3 Score=21.47 Aligned_cols=20 Identities=15% Similarity=0.280 Sum_probs=16.8
Q ss_pred HHHHhccCHHHHHHHHHhhc
Q psy2692 9 ESINKSSSFETAFKLLHDQI 28 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQi 28 (133)
..++.+|+++.|++++++-.
T Consensus 9 ~a~~~~g~~~~a~~~~~~M~ 28 (34)
T PF13812_consen 9 RACAKAGDPDAALQLFDEMK 28 (34)
T ss_pred HHHHHCCCHHHHHHHHHHHH
Confidence 35788999999999998754
No 20
>PRK15331 chaperone protein SicA; Provisional
Probab=71.56 E-value=7.6 Score=30.14 Aligned_cols=34 Identities=15% Similarity=0.151 Sum_probs=28.3
Q ss_pred CChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHH
Q psy2692 70 RSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLI 103 (133)
Q Consensus 70 ~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~ 103 (133)
++-+.|..--..||.++..||+.||...||-+..
T Consensus 32 is~~~le~iY~~Ay~~y~~Gk~~eA~~~F~~L~~ 65 (165)
T PRK15331 32 IPQDMMDGLYAHAYEFYNQGRLDEAETFFRFLCI 65 (165)
T ss_pred CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 3445566677999999999999999999998765
No 21
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=70.91 E-value=9.5 Score=29.37 Aligned_cols=92 Identities=9% Similarity=-0.018 Sum_probs=57.5
Q ss_pred HHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCC----CCCCccccCCCCCCCChHHHHHHHHHHHH
Q psy2692 9 ESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLP----LLAPVTAFPTRKVDRSSIDLIAKIQQCYQ 84 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p----~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk 84 (133)
..+.+.|+++.|...+++=+- ..|-.+.+...+..+...-.+-. ....+.-.+.+ ..++.......|.-
T Consensus 81 ~~~~~~g~~~~A~~a~~~Al~---l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~----dP~~~~al~~LA~~ 153 (198)
T PRK10370 81 EYYLWRNDYDNALLAYRQALQ---LRGENAELYAALATVLYYQAGQHMTPQTREMIDKALAL----DANEVTALMLLASD 153 (198)
T ss_pred HHHHHCCCHHHHHHHHHHHHH---hCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHh----CCCChhHHHHHHHH
Confidence 356789999999999997653 44655555555544332222210 01111111111 22233445667888
Q ss_pred hcccCCHHHHHHHHHHHHHHccc
Q psy2692 85 LTTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 85 ~~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
++..|++++|+..|+.++...|=
T Consensus 154 ~~~~g~~~~Ai~~~~~aL~l~~~ 176 (198)
T PRK10370 154 AFMQADYAQAIELWQKVLDLNSP 176 (198)
T ss_pred HHHcCCHHHHHHHHHHHHhhCCC
Confidence 89999999999999999999886
No 22
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=70.85 E-value=14 Score=24.90 Aligned_cols=53 Identities=15% Similarity=0.194 Sum_probs=37.1
Q ss_pred HHHHHHHHhcccCCHHHHHHHHHHHHHHcccc-cccChHHHHHHHHHHHHHHHhh
Q psy2692 77 AKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL-VVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~-vv~~~~e~~e~~eli~icrEYi 130 (133)
+-++.|-..=+.|++++|+..+++-+-.+... ..++..+.. ...+-..|.||+
T Consensus 8 ~~a~~Ave~D~~g~y~eA~~~Y~~aie~l~~~~~~~~~n~~~-k~~ir~K~~eYl 61 (76)
T cd02681 8 QFARLAVQRDQEGRYSEAVFYYKEAAQLLIYAEMAGTLNDSH-LKTIQEKSNEYL 61 (76)
T ss_pred HHHHHHHHHHHccCHHHHHHHHHHHHHHHHHHHHhcCCChHH-HHHHHHHHHHHH
Confidence 35678888889999999999999998887664 233232222 223567777775
No 23
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=70.56 E-value=8.6 Score=22.25 Aligned_cols=28 Identities=21% Similarity=0.165 Sum_probs=23.8
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
+.-|.-+...|++++|+..|+.++..-|
T Consensus 5 ~~la~~~~~~G~~~~A~~~~~~~l~~~P 32 (44)
T PF13428_consen 5 LALARAYRRLGQPDEAERLLRRALALDP 32 (44)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHCc
Confidence 4456677889999999999999998776
No 24
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=70.35 E-value=4 Score=24.96 Aligned_cols=23 Identities=35% Similarity=0.420 Sum_probs=19.2
Q ss_pred hcccCCHHHHHHHHHHHHHHccc
Q psy2692 85 LTTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 85 ~~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
++..|++++|+..|+.++..-|-
T Consensus 1 ll~~~~~~~A~~~~~~~l~~~p~ 23 (68)
T PF14559_consen 1 LLKQGDYDEAIELLEKALQRNPD 23 (68)
T ss_dssp HHHTTHHHHHHHHHHHHHHHTTT
T ss_pred ChhccCHHHHHHHHHHHHHHCCC
Confidence 35679999999999999988774
No 25
>KOG0543|consensus
Probab=68.29 E-value=10 Score=33.23 Aligned_cols=47 Identities=17% Similarity=0.199 Sum_probs=40.0
Q ss_pred HHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHH
Q psy2692 81 QCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICK 127 (133)
Q Consensus 81 ~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icr 127 (133)
.|=-+|+.|||..|...|+.++..+.--.-.+.+|.+++.++.-.|-
T Consensus 214 ~Gn~~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~ 260 (397)
T KOG0543|consen 214 RGNVLFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACH 260 (397)
T ss_pred hhhHHHhhchHHHHHHHHHHHHHHhhccccCCHHHHHHHHHHHHHHh
Confidence 37789999999999999999999998776677788888888776664
No 26
>smart00668 CTLH C-terminal to LisH motif. Alpha-helical motif of unknown function.
Probab=67.97 E-value=8.7 Score=23.13 Aligned_cols=38 Identities=18% Similarity=0.177 Sum_probs=32.2
Q ss_pred HHHHhccCHHHHHHHHHhhcC-------cccccccHHHHHHHHhh
Q psy2692 9 ESINKSSSFETAFKLLHDQIG-------VKNFLPYRQMFLNSYLQ 46 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQiG-------ivNf~PLk~~F~~~y~~ 46 (133)
.+.+..|+++.|+..+++-.. -..|.-.++.|++....
T Consensus 9 ~~~i~~g~~~~a~~~~~~~~~~l~~~~~~l~f~L~~q~~lell~~ 53 (58)
T smart00668 9 RELILKGDWDEALEWLSSLKPPLLERNSKLEFELRKQKFLELVRQ 53 (58)
T ss_pred HHHHHcCCHHHHHHHHHHcCHHHhccCCCchhHHHHHHHHHHHHc
Confidence 357889999999999988876 67888899999988754
No 27
>cd03598 CLECT_EMBP_like C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CLECT_EMBP_like: C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Eosinophils and basophils carry out various functions in allergic, parasitic, and inflammatory diseases. EMBP is stored in eosinophil crystalloid granules and is released upon degranulation. EMBP is also expressed in basophils. The proform of EMBP is expressed in placental X cells and breast tissue and increases significantly during human pregnancy. EMBP has cytotoxic properties and damages bacteria and mammalian cells, in vitro, as well as, helminth parasites. EMBP deposition has been observed in the inflamed tissue of all
Probab=67.88 E-value=2.9 Score=28.93 Aligned_cols=42 Identities=17% Similarity=0.217 Sum_probs=34.9
Q ss_pred HHHhcc-cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 82 CYQLTT-SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 82 gyk~~t-~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
+|+.++ ...|.||...+++. +-.-|+.++|.+|.+.+.+++.
T Consensus 3 Cy~~~~~~~t~~~A~~~C~~~-~g~~La~i~s~~e~~~l~~~~~ 45 (117)
T cd03598 3 CYRFVKSPRTFRDAQVICRRC-YRGNLASIHSFAFNYRVQRLVS 45 (117)
T ss_pred eEEEecCCCCHHHHHHHhhcC-CCceEeeecChhHhHHHHHHHh
Confidence 466664 58899999999987 6677899999999999988874
No 28
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=67.60 E-value=18 Score=24.49 Aligned_cols=52 Identities=13% Similarity=0.135 Sum_probs=34.8
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
-++.|-++=+.|+++||+..++.-+-.+.=++-...++.. ..-+-+.+.||.
T Consensus 9 ~a~~AVe~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~-k~~yr~ki~eY~ 60 (75)
T cd02682 9 YAINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPT-RLIYEQMINEYK 60 (75)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHH-HHHHHHHHHHHH
Confidence 4678888889999999999999877666554444333332 222555666664
No 29
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=67.42 E-value=17 Score=24.40 Aligned_cols=54 Identities=17% Similarity=0.272 Sum_probs=38.9
Q ss_pred HHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
.+.++.|-..=+.|++++|+..+.+-|..+...+-... +...-.-+-..+.||+
T Consensus 7 ~~l~~~Ave~D~~g~y~eAl~~Y~~aie~l~~~lk~e~-d~~~k~~~r~ki~eY~ 60 (77)
T cd02683 7 KEVLKRAVELDQEGRFQEALVCYQEGIDLLMQVLKGTK-DEAKKKNLRQKISEYM 60 (77)
T ss_pred HHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCC-CHHHHHHHHHHHHHHH
Confidence 45678888889999999999999999988877643332 2222333666777775
No 30
>PF04097 Nic96: Nup93/Nic96; InterPro: IPR007231 Nup93/Nic96 is a component of the nuclear pore complex. It is required for the correct assembly of the nuclear pore complex []. In Saccharomyces cerevisiae, Nic96 has been shown to be involved in the distribution and cellular concentration of the GTPase Gsp1 []. The structure of Nic96 has revealed a mostly alpha helical structure [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 2QX5_B 2RFO_A.
Probab=66.36 E-value=15 Score=33.50 Aligned_cols=82 Identities=16% Similarity=0.361 Sum_probs=47.6
Q ss_pred HHhccCHHHHHHHHHhhcCcc---------------cccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHH-
Q psy2692 11 INKSSSFETAFKLLHDQIGVK---------------NFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSID- 74 (133)
Q Consensus 11 ~~AAGsFe~A~~LLnrQiGiv---------------Nf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~- 74 (133)
|-.||++|.+++++|+++|-+ +...+-..+++.|.........+. +. ..+.
T Consensus 437 y~La~~~d~vl~lln~~Ls~~l~~~~~~~~~~s~~~~l~~la~~i~~~y~~~~~~~~~~~----------~~---~~~t~ 503 (613)
T PF04097_consen 437 YHLAEEYDKVLSLLNRLLSQVLSQPSSSSLSDSERERLIELAKEILERYKSNPHISSKVS----------RK---NRETF 503 (613)
T ss_dssp HHHTT-HHHHHHHHHHHHHHHHHCSSTSSSSSTTTTSHHHHHHHHHHHHTTSHHHHTTS-----------HH---HHHHH
T ss_pred HHHHhhHHHHHHHHHHHHHHHHcCccccccccchhhhHHHHHHHHHHHHHhCcchHhhcc----------HH---HHHHH
Confidence 456899999999999997643 222233444455544433322200 00 1222
Q ss_pred -HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 75 -LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 75 -L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
+.-+|.+.+.++.+|+.++|++.+.++ ..+|
T Consensus 504 ~~Ll~L~~ff~~~~~g~~~~AL~~i~~L-~liP 535 (613)
T PF04097_consen 504 QLLLDLAEFFDLYHAGQYEQALDIIEKL-DLIP 535 (613)
T ss_dssp HHHHHHHHHHHHHHTT-HHHHHHHHHHT-T-S-
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHhC-CCCC
Confidence 333688999999999999999877653 4566
No 31
>PF14276 DUF4363: Domain of unknown function (DUF4363)
Probab=64.92 E-value=46 Score=23.44 Aligned_cols=61 Identities=11% Similarity=0.129 Sum_probs=50.2
Q ss_pred hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHc-----ccccccChHHHHHHHHHHHHHHHhhhc
Q psy2692 72 SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICI-----PLLVVTTKKEIEEAKQIINICKEYIVG 132 (133)
Q Consensus 72 l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i-----~l~vv~~~~e~~e~~eli~icrEYila 132 (133)
-+++.+.+....+.+..++.++|...+..+.... .+...-+.+|.+++..-+...+.||-+
T Consensus 25 ~~~i~~~l~~i~~~i~~~dW~~A~~~~~~l~~~W~k~~~~~~~~~~h~eid~i~~sl~rl~~~i~~ 90 (121)
T PF14276_consen 25 TDSIEEQLEQIEEAIENEDWEKAYKETEELEKEWDKNKKRWSILIEHQEIDNIDISLARLKGYIEA 90 (121)
T ss_pred HHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhchheeeeecHHHHHHHHHHHHHHHHHHHC
Confidence 4568889999999999999999999988887653 223336799999999999999999854
No 32
>PRK09720 cybC cytochrome b562; Provisional
Probab=63.90 E-value=14 Score=26.57 Aligned_cols=28 Identities=21% Similarity=0.264 Sum_probs=25.2
Q ss_pred ChHHHHHHHHHHHHhcccCCHHHHHHHH
Q psy2692 71 SSIDLIAKIQQCYQLTTSGKFVDAIEKL 98 (133)
Q Consensus 71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~F 98 (133)
+++.|+..+..+-++...||++||...=
T Consensus 58 Gld~lI~qID~A~~La~~GkL~eAK~~a 85 (100)
T PRK09720 58 GFDILVGQIDDALKLANEGKVKEAQAAA 85 (100)
T ss_pred HHHHHHHHHHHHHHHHHcCCHHHHHHHH
Confidence 6889999999999999999999998543
No 33
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=62.82 E-value=25 Score=25.14 Aligned_cols=90 Identities=11% Similarity=0.054 Sum_probs=46.3
Q ss_pred HHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhccc
Q psy2692 9 ESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTS 88 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~ 88 (133)
..++..|+|+.|...|++=+.-..=.++++.. .+.. +++.+..-..-..+.. +.....-.+..... .-.|--++..
T Consensus 56 ~~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~a-~l~L-A~~~~~~~~~d~Al~~-L~~~~~~~~~~~~~-~~~Gdi~~~~ 131 (145)
T PF09976_consen 56 KAAYEQGDYDEAKAALEKALANAPDPELKPLA-RLRL-ARILLQQGQYDEALAT-LQQIPDEAFKALAA-ELLGDIYLAQ 131 (145)
T ss_pred HHHHHCCCHHHHHHHHHHHHhhCCCHHHHHHH-HHHH-HHHHHHcCCHHHHHHH-HHhccCcchHHHHH-HHHHHHHHHC
Confidence 35667899999999999977654222344333 2211 1222111111000000 00000111222222 2467778889
Q ss_pred CCHHHHHHHHHHHH
Q psy2692 89 GKFVDAIEKLRNVL 102 (133)
Q Consensus 89 gKF~eAl~~Fr~iL 102 (133)
|++++|+..|+..|
T Consensus 132 g~~~~A~~~y~~Al 145 (145)
T PF09976_consen 132 GDYDEARAAYQKAL 145 (145)
T ss_pred CCHHHHHHHHHHhC
Confidence 99999999998753
No 34
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=61.33 E-value=9.8 Score=19.89 Aligned_cols=19 Identities=16% Similarity=0.202 Sum_probs=16.0
Q ss_pred hHHHHhccCHHHHHHHHHh
Q psy2692 8 SESINKSSSFETAFKLLHD 26 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnr 26 (133)
+..+...|+++.|.+++.+
T Consensus 8 a~~~~~~G~~~eA~~~l~~ 26 (26)
T PF07721_consen 8 ARALLAQGDPDEAERLLRR 26 (26)
T ss_pred HHHHHHcCCHHHHHHHHhC
Confidence 4568899999999999864
No 35
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=61.16 E-value=14 Score=28.29 Aligned_cols=31 Identities=16% Similarity=0.253 Sum_probs=24.0
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
..+.|...+..|++.+|+..|..++..-|-.
T Consensus 8 lY~~a~~~~~~g~y~~Ai~~f~~l~~~~P~s 38 (203)
T PF13525_consen 8 LYQKALEALQQGDYEEAIKLFEKLIDRYPNS 38 (203)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHHHHH-TTS
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC
Confidence 5578888888888888888888888887764
No 36
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=61.02 E-value=18 Score=32.38 Aligned_cols=37 Identities=16% Similarity=0.185 Sum_probs=30.8
Q ss_pred ChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692 71 SSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
..++-....+.|..++..|++++|+..|+..|..-|=
T Consensus 71 dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd 107 (453)
T PLN03098 71 DVKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPN 107 (453)
T ss_pred CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCC
Confidence 4455666788899999999999999999999887664
No 37
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=60.85 E-value=12 Score=21.91 Aligned_cols=28 Identities=18% Similarity=0.200 Sum_probs=19.8
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
.+.|.-.+..|++++|+..|+.++..-|
T Consensus 4 ~~~a~~~~~~~~~~~A~~~~~~~~~~~~ 31 (100)
T cd00189 4 LNLGNLYYKLGDYDEALEYYEKALELDP 31 (100)
T ss_pred HHHHHHHHHHhcHHHHHHHHHHHHhcCC
Confidence 3455666677888888888888776554
No 38
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=60.10 E-value=33 Score=28.83 Aligned_cols=109 Identities=13% Similarity=0.058 Sum_probs=59.3
Q ss_pred hHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcc
Q psy2692 8 SESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTT 87 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t 87 (133)
+.+....|+|+.|+.++++-+..- ..-..-++ ..+.+++-.-..--.+ ....+......+.....++.|.-+..
T Consensus 9 a~~a~~~~~~~~Ai~~~~~Al~~~-P~~~~a~~----~~a~~~~~~g~~~eAl-~~~~~Al~l~P~~~~a~~~lg~~~~~ 82 (356)
T PLN03088 9 AKEAFVDDDFALAVDLYTQAIDLD-PNNAELYA----DRAQANIKLGNFTEAV-ADANKAIELDPSLAKAYLRKGTACMK 82 (356)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHhC-CCCHHHHH----HHHHHHHHcCCHHHHH-HHHHHHHHhCcCCHHHHHHHHHHHHH
Confidence 457788999999999999887642 22111111 1122221110000000 00000000011112234566777888
Q ss_pred cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
.|++++|+..|+..+..-|-- .++...+..|.+.+
T Consensus 83 lg~~~eA~~~~~~al~l~P~~--------~~~~~~l~~~~~kl 117 (356)
T PLN03088 83 LEEYQTAKAALEKGASLAPGD--------SRFTKLIKECDEKI 117 (356)
T ss_pred hCCHHHHHHHHHHHHHhCCCC--------HHHHHHHHHHHHHH
Confidence 999999999999999987642 23566677776554
No 39
>KOG4648|consensus
Probab=57.31 E-value=15 Score=32.62 Aligned_cols=51 Identities=20% Similarity=0.265 Sum_probs=37.9
Q ss_pred HHHHhcccCCHHHHHHHHHHHHHHcccccccC-------------hHHHHHHHHHHHHHHHhhh
Q psy2692 81 QCYQLTTSGKFVDAIEKLRNVLICIPLLVVTT-------------KKEIEEAKQIINICKEYIV 131 (133)
Q Consensus 81 ~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~-------------~~e~~e~~eli~icrEYil 131 (133)
.|-++|+.|||+||+++|-.-+-.-|-..|-- ..-+.++...|.+-|-|+-
T Consensus 103 ~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~K 166 (536)
T KOG4648|consen 103 RGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVK 166 (536)
T ss_pred hhhhhhhccchhHHHHHhhhhhccCCCCccchhhHHHHHHHHHHHHHHHHhHHHHHHhhHHHHH
Confidence 48999999999999999999998888444431 1234456667777777764
No 40
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=56.93 E-value=19 Score=19.98 Aligned_cols=22 Identities=23% Similarity=0.253 Sum_probs=17.1
Q ss_pred HHHHhcccCCHHHHHHHHHHHH
Q psy2692 81 QCYQLTTSGKFVDAIEKLRNVL 102 (133)
Q Consensus 81 ~gyk~~t~gKF~eAl~~Fr~iL 102 (133)
-|--....|++++|++.|+..|
T Consensus 5 Lg~~~~~~g~~~~Ai~~y~~aL 26 (36)
T PF13176_consen 5 LGRIYRQQGDYEKAIEYYEQAL 26 (36)
T ss_dssp HHHHHHHCT-HHHHHHHHHHHH
T ss_pred HHHHHHHcCCHHHHHHHHHHHH
Confidence 3445678999999999999966
No 41
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=56.50 E-value=26 Score=26.86 Aligned_cols=40 Identities=18% Similarity=0.113 Sum_probs=31.5
Q ss_pred CCC-hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 69 DRS-SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 69 ~~~-l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
.++ -+.|..--.-||.+...||+.||...|+-+...=|-.
T Consensus 28 ~~~~~~~l~~lY~~A~~ly~~G~l~~A~~~f~~L~~~Dp~~ 68 (157)
T PRK15363 28 DDDVTQPLNTLYRYAMQLMEVKEFAGAARLFQLLTIYDAWS 68 (157)
T ss_pred CCChHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccc
Confidence 445 3345556688999999999999999999988776654
No 42
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=56.10 E-value=25 Score=18.47 Aligned_cols=27 Identities=19% Similarity=0.230 Sum_probs=21.5
Q ss_pred HHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 80 QQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
..|.-....|++++|++.|+..+..-|
T Consensus 6 ~lg~~y~~~~~~~~A~~~~~~a~~~~~ 32 (34)
T PF13181_consen 6 NLGKIYEQLGDYEEALEYFEKALELNP 32 (34)
T ss_dssp HHHHHHHHTTSHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhhCC
Confidence 456666779999999999999887543
No 43
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=55.45 E-value=24 Score=23.07 Aligned_cols=21 Identities=10% Similarity=0.180 Sum_probs=16.1
Q ss_pred HHHHhccCHHHHHHHHHhhcC
Q psy2692 9 ESINKSSSFETAFKLLHDQIG 29 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQiG 29 (133)
..+...|+++.|+..+.+-+.
T Consensus 10 ~~~~~~~~~~~A~~~~~~~~~ 30 (119)
T TIGR02795 10 LLVLKAGDYADAIQAFQAFLK 30 (119)
T ss_pred HHHHHcCCHHHHHHHHHHHHH
Confidence 456778999999988877653
No 44
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=55.43 E-value=22 Score=19.19 Aligned_cols=25 Identities=20% Similarity=0.349 Sum_probs=18.6
Q ss_pred HHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692 80 QQCYQLTTSGKFVDAIEKLRNVLIC 104 (133)
Q Consensus 80 k~gyk~~t~gKF~eAl~~Fr~iL~~ 104 (133)
+-|.-+...|++++|+..++.++..
T Consensus 7 ~la~~~~~~g~~~~A~~~~~~al~~ 31 (42)
T PF13374_consen 7 NLANAYRAQGRYEEALELLEEALEI 31 (42)
T ss_dssp HHHHHHHHCT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHhhhhcchhhHHHHHHHHH
Confidence 4455566779999999999988754
No 45
>PF03986 Autophagy_N: Autophagocytosis associated protein (Atg3), N-terminal domain ; InterPro: IPR007134 Proteins in this entry belong to the Atg3 group of proteins and the Atg3 conjugation enzymes. Autophagy is a degradative transport pathway that delivers cytosolic proteins to the lysosome (vacuole) [] and is induced by starvation []. Cytosolic proteins appear inside the vacuole enclosed in autophagic vesicles. Autophagy significantly differs from other transport pathways by using double membrane layered transport intermediates, called autophagosomes [, ]. The breakdown of vesicular transport intermediates is a unique feature of autophagy []. Autophagy can also function in the elimination of invading bacteria and antigens []. Atg3 is the E2 enzyme for the LC3 lipidation process []. It is essential for autophagocytosis. The super protein complex, the Atg16L complex, consists of multiple Atg12-Atg5 conjugates. Atg16L has an E3-like role in the LC3 lipidation reaction. The activated intermediate, LC3-Atg3 (E2), is recruited to the site where the lipidation takes place []. Atg3 catalyses the conjugation of Atg8 and phosphatidylethanolamine (PE). Atg3 has an alpha/beta-fold, and its core region is topologically similar to canonical E2 enzymes. Atg3 has two regions inserted in the core region and another with a long alpha-helical structure that protrudes from the core region as far as 30 A []. It interacts with atg8 through an intermediate thioester bond between Cys-288 and the C-terminal Gly of atg8. It also interacts with the C-terminal region of the E1-like atg7 enzyme. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the lysosome/vacuole. Atg3 is a ubiquitin like modifier that is topologically similar to the canonical E2 enzyme []. It catalyses the conjugation of Atg8 and phosphatidylethanolamine []. This domain is the N-terminal of Atg3 while the C-terminal is represented by IPR007135 from INTERPRO.; PDB: 3T7G_C 2DYT_A.
Probab=55.32 E-value=3.9 Score=30.99 Aligned_cols=13 Identities=15% Similarity=0.350 Sum_probs=9.6
Q ss_pred cCChHHHHhccCH
Q psy2692 5 KVTSESINKSSSF 17 (133)
Q Consensus 5 ~~~~~~~~AAGsF 17 (133)
.+||+++|+||+|
T Consensus 27 ~iTPeEFV~AGD~ 39 (145)
T PF03986_consen 27 VITPEEFVAAGDY 39 (145)
T ss_dssp ---HHHHHHHHHH
T ss_pred eeCHHHHHHhhhH
Confidence 4799999999997
No 46
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=54.96 E-value=32 Score=20.76 Aligned_cols=58 Identities=14% Similarity=0.191 Sum_probs=40.3
Q ss_pred HHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhccc
Q psy2692 9 ESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTS 88 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~ 88 (133)
..+...|+|+.|...+++-+..- |. ........|.-+...
T Consensus 5 ~~~~~~g~~~~A~~~~~~~l~~~-----------------------P~-----------------~~~a~~~lg~~~~~~ 44 (65)
T PF13432_consen 5 RALYQQGDYDEAIAAFEQALKQD-----------------------PD-----------------NPEAWYLLGRILYQQ 44 (65)
T ss_dssp HHHHHCTHHHHHHHHHHHHHCCS-----------------------TT-----------------HHHHHHHHHHHHHHT
T ss_pred HHHHHcCCHHHHHHHHHHHHHHC-----------------------CC-----------------CHHHHHHHHHHHHHc
Confidence 45778888888888887653221 11 122345666677889
Q ss_pred CCHHHHHHHHHHHHHHcc
Q psy2692 89 GKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 89 gKF~eAl~~Fr~iL~~i~ 106 (133)
|++++|+..|+.++..-|
T Consensus 45 g~~~~A~~~~~~a~~~~P 62 (65)
T PF13432_consen 45 GRYDEALAYYERALELDP 62 (65)
T ss_dssp T-HHHHHHHHHHHHHHST
T ss_pred CCHHHHHHHHHHHHHHCc
Confidence 999999999999987655
No 47
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=54.32 E-value=14 Score=22.40 Aligned_cols=25 Identities=20% Similarity=0.225 Sum_probs=20.8
Q ss_pred hHHHHhccCHHHHHHHHHhhcCccc
Q psy2692 8 SESINKSSSFETAFKLLHDQIGVKN 32 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQiGivN 32 (133)
+..|+..|+.|+|-.+|+.-+.--|
T Consensus 6 A~ayie~Gd~e~Ar~lL~evl~~~~ 30 (44)
T TIGR03504 6 ARAYIEMGDLEGARELLEEVIEEGD 30 (44)
T ss_pred HHHHHHcCChHHHHHHHHHHHHcCC
Confidence 4679999999999999998875444
No 48
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=53.75 E-value=15 Score=30.57 Aligned_cols=31 Identities=16% Similarity=0.244 Sum_probs=28.3
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
.-+.||+++..|||.+|...|+.-+...|=-
T Consensus 144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s 174 (262)
T COG1729 144 LYNAALDLYKSGDYAEAEQAFQAFIKKYPNS 174 (262)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCCCC
Confidence 6799999999999999999999999988743
No 49
>PRK15058 cytochrome b562; Provisional
Probab=53.44 E-value=24 Score=26.39 Aligned_cols=88 Identities=18% Similarity=0.218 Sum_probs=54.8
Q ss_pred HhccCHHHHHHHHHhhcCcccc----cccHHHHHHHHhhhhhhccCCC----CCCCccccCCCC-CCCChHHHHHHHHHH
Q psy2692 12 NKSSSFETAFKLLHDQIGVKNF----LPYRQMFLNSYLQSRTAYSTLP----LLAPVTAFPTRK-VDRSSIDLIAKIQQC 82 (133)
Q Consensus 12 ~AAGsFe~A~~LLnrQiGivNf----~PLk~~F~~~y~~s~~~~~~~p----~lp~l~~~~~rn-~~~~l~~L~~~Lk~g 82 (133)
+-|.+-+.-|+-+++.++.++= +-+|.-....-..+--+-.+.| +.||=. .... ---+++.|+..|..+
T Consensus 20 a~Aa~l~~~M~~m~~~~k~~~~A~~a~~~k~al~~mr~aa~~Ak~~~P~kl~~k~~ds--~e~K~Y~~G~d~Li~qID~a 97 (128)
T PRK15058 20 SFAADLEDNMETLNDNLKVVEKTDNAAEVKDALTKMRAAALDAQKATPPKLEDKAPDS--PEMKDFRHGFDILVGQIDGA 97 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHhccCCchhhccCCCC--HHHHHHHHHHHHHHHHHHHH
Confidence 3455789999999999987754 4455544444433333332222 222210 0000 022688899999999
Q ss_pred HHhcccCCHHHHHHHHHHH
Q psy2692 83 YQLTTSGKFVDAIEKLRNV 101 (133)
Q Consensus 83 yk~~t~gKF~eAl~~Fr~i 101 (133)
-++..+||++||...-+.+
T Consensus 98 ~~la~~GkL~eAK~~a~~l 116 (128)
T PRK15058 98 LKLANEGKVKEAQAAAEQL 116 (128)
T ss_pred HHHHhCCCHHHHHHHHHHH
Confidence 9999999999998665443
No 50
>PF07361 Cytochrom_B562: Cytochrome b562; InterPro: IPR009155 Cytochrome b562 is a haem-containing protein that is expressed in the periplasm of Escherichia coli. In b-type cytochromes, the haem atom is not covalently attached to the polypeptide. Cytochrome b562 has a four-helical bundle structure that is structurally similar to that found in members of the cytochrome c family (IPR002321 from INTERPRO). Cytochrome b562 has a reduction potential of 167 mV, which sets the energy yield possible in metabolism and is also a key determinant of the rate at which redox reactions proceed [].; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding, 0042597 periplasmic space; PDB: 4ER9_A 3IQ6_G 2QLA_B 3FOO_A 3M79_C 256B_A 3NMI_F 3HNK_A 3NMK_D 2BC5_A ....
Probab=53.31 E-value=18 Score=25.56 Aligned_cols=31 Identities=26% Similarity=0.373 Sum_probs=27.7
Q ss_pred ChHHHHHHHHHHHHhcccCCHHHHHHHHHHH
Q psy2692 71 SSIDLIAKIQQCYQLTTSGKFVDAIEKLRNV 101 (133)
Q Consensus 71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~i 101 (133)
+++.|+..+..+-+++.+|++++|....+.+
T Consensus 61 Gl~~li~~id~a~~~~~~G~l~~AK~~l~~l 91 (103)
T PF07361_consen 61 GLDKLIDQIDKAEALAEAGKLDEAKAALKKL 91 (103)
T ss_dssp HHHHHHHHHHHHHHHHHTTHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 6888999999999999999999999877665
No 51
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=53.29 E-value=16 Score=18.73 Aligned_cols=19 Identities=21% Similarity=0.461 Sum_probs=16.3
Q ss_pred HHHhccCHHHHHHHHHhhc
Q psy2692 10 SINKSSSFETAFKLLHDQI 28 (133)
Q Consensus 10 ~~~AAGsFe~A~~LLnrQi 28 (133)
.++.+|+++.|++++++-.
T Consensus 9 ~~~~~~~~~~a~~~~~~M~ 27 (35)
T TIGR00756 9 GLCKAGRVEEALELFKEML 27 (35)
T ss_pred HHHHCCCHHHHHHHHHHHH
Confidence 5788999999999998753
No 52
>KOG0553|consensus
Probab=53.10 E-value=28 Score=29.58 Aligned_cols=89 Identities=13% Similarity=0.140 Sum_probs=56.7
Q ss_pred HHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccC----CC-CCCCccccCCCCCCCChHHHHHHHHHHH
Q psy2692 9 ESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYST----LP-LLAPVTAFPTRKVDRSSIDLIAKIQQCY 83 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~----~p-~lp~l~~~~~rn~~~~l~~L~~~Lk~gy 83 (133)
-++.+.|+|+.|.+...+-|-+. |==+.|-.-=.++++.+.. +- +...+ .++..+-..-.+-|.
T Consensus 89 N~~m~~~~Y~eAv~kY~~AI~l~---P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al--------~iDp~yskay~RLG~ 157 (304)
T KOG0553|consen 89 NKLMKNKDYQEAVDKYTEAIELD---PTNAVYYCNRAAAYSKLGEYEDAVKDCESAL--------SIDPHYSKAYGRLGL 157 (304)
T ss_pred HHHHHhhhHHHHHHHHHHHHhcC---CCcchHHHHHHHHHHHhcchHHHHHHHHHHH--------hcChHHHHHHHHHHH
Confidence 36788999999999999986554 4333333222333333321 11 01111 223344444567788
Q ss_pred HhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 84 QLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 84 k~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
-+...||+.+|++.|+..|+.=|--
T Consensus 158 A~~~~gk~~~A~~aykKaLeldP~N 182 (304)
T KOG0553|consen 158 AYLALGKYEEAIEAYKKALELDPDN 182 (304)
T ss_pred HHHccCcHHHHHHHHHhhhccCCCc
Confidence 8899999999999999999887754
No 53
>PF05538 Campylo_MOMP: Campylobacter major outer membrane protein; InterPro: IPR008439 Campylobacter are Gram-negative, spiral, microaerophilic bacteria. Campylobacter jejuni is one of the main causative agents of food poisoning in the developed world. This family consists of Campylobacter major outer membrane proteins. The major outer membrane protein (MOMP), a putative porin and a multifunction surface protein of Campylobacter jejuni, may play an important role in the adaptation of the organism to various host environments [].
Probab=52.76 E-value=4.6 Score=35.74 Aligned_cols=37 Identities=19% Similarity=0.213 Sum_probs=32.7
Q ss_pred CccccCChHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhh
Q psy2692 1 MNLHKVTSESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSR 48 (133)
Q Consensus 1 ~~~~~~~~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~ 48 (133)
|.|-|+|-.+.||+|+|.+| |-.||.+.++.|-.+-.
T Consensus 1 MKl~KlSLaAavA~gafStA-----------~A~PLeEaiKdVdvSG~ 37 (431)
T PF05538_consen 1 MKLVKLSLAAAVALGAFSTA-----------SATPLEEAIKDVDVSGF 37 (431)
T ss_pred CchHHHHHHHHHHhhhhhhc-----------cCCcHHHHhcccccceE
Confidence 88999999999999999987 77899999999976543
No 54
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=52.43 E-value=55 Score=20.36 Aligned_cols=26 Identities=12% Similarity=0.150 Sum_probs=21.5
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLIC 104 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~ 104 (133)
..-|.-+...|++++|++.|+..+..
T Consensus 9 ~~la~~~~~~~~~~~A~~~~~~al~~ 34 (78)
T PF13424_consen 9 NNLARVYRELGRYDEALDYYEKALDI 34 (78)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 45566777999999999999999988
No 55
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=51.79 E-value=28 Score=24.01 Aligned_cols=32 Identities=22% Similarity=0.242 Sum_probs=28.8
Q ss_pred HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
-...+.+|.|+...++=++|+..++++|..++
T Consensus 6 ak~~ie~GlkLY~~~~~~~Al~~W~~aL~k~~ 37 (80)
T PF10579_consen 6 AKQQIEKGLKLYHQNETQQALQKWRKALEKIT 37 (80)
T ss_pred HHHHHHHHHHHhccchHHHHHHHHHHHHhhcC
Confidence 34589999999999999999999999998875
No 56
>KOG1840|consensus
Probab=51.77 E-value=34 Score=30.90 Aligned_cols=110 Identities=14% Similarity=0.006 Sum_probs=56.1
Q ss_pred hHHHHhccCHHHHHHHHHhhcCcc-----cccccHHH----HHHHHhhhhhhccCCCC-CCCccccCCCCCCCChHHHHH
Q psy2692 8 SESINKSSSFETAFKLLHDQIGVK-----NFLPYRQM----FLNSYLQSRTAYSTLPL-LAPVTAFPTRKVDRSSIDLIA 77 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQiGiv-----Nf~PLk~~----F~~~y~~s~~~~~~~p~-lp~l~~~~~rn~~~~l~~L~~ 77 (133)
+..++..|+||.|.++-.+-+-++ .-.|.-.. |=.+|....-+..+++- -+.+..+..---..+...-.-
T Consensus 206 a~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~ 285 (508)
T KOG1840|consen 206 AEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAAT 285 (508)
T ss_pred HHHHHHhccHHHHHHHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHhcCCCCHHHHHH
Confidence 467888999998776655544331 22232222 33344444444444442 222221110000112222222
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHH
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIE 117 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~ 117 (133)
..+-|+-+-..|||.||...+.+.+-+.-=+-..+..|+.
T Consensus 286 l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~ 325 (508)
T KOG1840|consen 286 LNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVA 325 (508)
T ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHhhccChHHHH
Confidence 3344666699999999999999887665443334444443
No 57
>PF09577 Spore_YpjB: Sporulation protein YpjB (SpoYpjB); InterPro: IPR014231 Proteins in thie entry, typified by YpjB, are restricted to a subset of the endospore-forming bacteria which includes Bacillus species, but not species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon []. Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set.
Probab=50.92 E-value=81 Score=25.66 Aligned_cols=40 Identities=18% Similarity=0.111 Sum_probs=31.0
Q ss_pred hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692 72 SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT 111 (133)
Q Consensus 72 l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~ 111 (133)
+..|...--+.|++++.||++||......+--..+-....
T Consensus 2 w~eLd~~sd~~lqlvk~~~yeeA~q~l~~fs~~f~~~~~~ 41 (232)
T PF09577_consen 2 WKELDQLSDEALQLVKQGKYEEAKQLLEYFSEQFTSVDFK 41 (232)
T ss_pred hHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHhhcccc
Confidence 3456666788999999999999999888777666665553
No 58
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=50.06 E-value=13 Score=23.95 Aligned_cols=78 Identities=18% Similarity=0.260 Sum_probs=39.8
Q ss_pred ccCHHHHHHHHHhhcCcccccccHHHH----HHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcccC
Q psy2692 14 SSSFETAFKLLHDQIGVKNFLPYRQMF----LNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTSG 89 (133)
Q Consensus 14 AGsFe~A~~LLnrQiGivNf~PLk~~F----~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~g 89 (133)
.|+++.|..+.++=+..-...| .+.+ -.+|....-+-.++..+-.+ ..+........-.|.-++..|
T Consensus 2 ~~~y~~Ai~~~~k~~~~~~~~~-~~~~~~~la~~~~~~~~y~~A~~~~~~~--------~~~~~~~~~~~l~a~~~~~l~ 72 (84)
T PF12895_consen 2 QGNYENAIKYYEKLLELDPTNP-NSAYLYNLAQCYFQQGKYEEAIELLQKL--------KLDPSNPDIHYLLARCLLKLG 72 (84)
T ss_dssp TT-HHHHHHHHHHHHHHHCGTH-HHHHHHHHHHHHHHTTHHHHHHHHHHCH--------THHHCHHHHHHHHHHHHHHTT
T ss_pred CccHHHHHHHHHHHHHHCCCCh-hHHHHHHHHHHHHHCCCHHHHHHHHHHh--------CCCCCCHHHHHHHHHHHHHhC
Confidence 3778888888888776665543 2221 12333332222222211110 111112333333477888899
Q ss_pred CHHHHHHHHHH
Q psy2692 90 KFVDAIEKLRN 100 (133)
Q Consensus 90 KF~eAl~~Fr~ 100 (133)
+++||++.|..
T Consensus 73 ~y~eAi~~l~~ 83 (84)
T PF12895_consen 73 KYEEAIKALEK 83 (84)
T ss_dssp -HHHHHHHHHH
T ss_pred CHHHHHHHHhc
Confidence 99999998875
No 59
>cd03600 CLECT_thrombomodulin_like C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CLECT_thrombomodulin_like: C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In these thrombomodulin-like proteins the residues involved in coordinating Ca2+ in the classical MBP-A CTLD are not conserved. TM exerts anti-fibrinolytic and anti-inflammatory activity. TM also regulates blood coagulation in the anticoagulant protein C pathway. In this pathway, the procoagulant properties of thrombin (T) are lost when it binds TM. TM also plays a key role in tumor biology. It is expressed on endothelial cells and on several type of tumor cell including squamous cell carcinoma. Loss of TM expression correlates with advanced stage and poor prognosis. Loss of function of TM func
Probab=50.03 E-value=15 Score=26.17 Aligned_cols=42 Identities=19% Similarity=0.296 Sum_probs=34.5
Q ss_pred HHHhcc-cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHH
Q psy2692 82 CYQLTT-SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINI 125 (133)
Q Consensus 82 gyk~~t-~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~i 125 (133)
+|+.++ ...|.||...++.. -.-|.+++|.+|.+.+++++..
T Consensus 6 Cy~~~~~~~sw~~A~~~C~~~--gg~La~i~s~~E~~~v~~~l~~ 48 (141)
T cd03600 6 CYTLHPQKLTFLEAQRSCIEL--GGNLATVRSGEEADVVSLLLAA 48 (141)
T ss_pred eEEEeCCccCHHHHHHHHHhh--CCEeeecCCHHHHHHHHHHHhh
Confidence 566665 48899999999984 6788999999999999888754
No 60
>PF13041 PPR_2: PPR repeat family
Probab=49.44 E-value=16 Score=21.40 Aligned_cols=20 Identities=20% Similarity=0.391 Sum_probs=17.0
Q ss_pred hcccCCHHHHHHHHHHHHHH
Q psy2692 85 LTTSGKFVDAIEKLRNVLIC 104 (133)
Q Consensus 85 ~~t~gKF~eAl~~Fr~iL~~ 104 (133)
+.+.|++++|.+.|++....
T Consensus 13 ~~~~~~~~~a~~l~~~M~~~ 32 (50)
T PF13041_consen 13 YCKAGKFEEALKLFKEMKKR 32 (50)
T ss_pred HHHCcCHHHHHHHHHHHHHc
Confidence 46789999999999998754
No 61
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=48.15 E-value=28 Score=26.34 Aligned_cols=32 Identities=16% Similarity=0.220 Sum_probs=23.1
Q ss_pred HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
....+..|..++..|+|++|+..|+.++..-|
T Consensus 33 ~~~~~~~g~~~~~~~~~~~A~~~~~~~~~~~p 64 (235)
T TIGR03302 33 AEELYEEAKEALDSGDYTEAIKYFEALESRYP 64 (235)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence 44467777777777888888887777776655
No 62
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=48.07 E-value=37 Score=22.11 Aligned_cols=25 Identities=20% Similarity=0.380 Sum_probs=12.2
Q ss_pred HHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692 80 QQCYQLTTSGKFVDAIEKLRNVLIC 104 (133)
Q Consensus 80 k~gyk~~t~gKF~eAl~~Fr~iL~~ 104 (133)
+.|..++..|++++|++.|..++..
T Consensus 7 ~~~~~~~~~~~~~~A~~~~~~~~~~ 31 (119)
T TIGR02795 7 DAALLVLKAGDYADAIQAFQAFLKK 31 (119)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 3444444455555555555555443
No 63
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=47.02 E-value=30 Score=24.67 Aligned_cols=30 Identities=20% Similarity=0.243 Sum_probs=25.4
Q ss_pred HHHHHHHHHhcccCCHHHHHHHHHHHHHHc
Q psy2692 76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICI 105 (133)
Q Consensus 76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i 105 (133)
...|..|-..+..|+|++|+..|+.++-..
T Consensus 49 ~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~ 78 (145)
T PF09976_consen 49 LAALQLAKAAYEQGDYDEAKAALEKALANA 78 (145)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHhhC
Confidence 346778888899999999999999998754
No 64
>PF12854 PPR_1: PPR repeat
Probab=46.98 E-value=23 Score=19.68 Aligned_cols=17 Identities=24% Similarity=0.487 Sum_probs=14.8
Q ss_pred HHHhccCHHHHHHHHHh
Q psy2692 10 SINKSSSFETAFKLLHD 26 (133)
Q Consensus 10 ~~~AAGsFe~A~~LLnr 26 (133)
.+-.+|.+|.|++++.+
T Consensus 16 ~~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 16 GYCKAGRVDEAFELFDE 32 (34)
T ss_pred HHHHCCCHHHHHHHHHh
Confidence 46789999999999976
No 65
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=46.01 E-value=95 Score=22.31 Aligned_cols=30 Identities=17% Similarity=-0.057 Sum_probs=25.9
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
..+-|.-+.+.|++++|+..|+..+..-|-
T Consensus 95 ~~~lg~~l~~~g~~~eAi~~~~~Al~~~p~ 124 (144)
T PRK15359 95 VYQTGVCLKMMGEPGLAREAFQTAIKMSYA 124 (144)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhCCC
Confidence 467788888999999999999999987764
No 66
>PF00059 Lectin_C: Lectin C-type domain; InterPro: IPR001304 Lectins occur in plants, animals, bacteria and viruses. Initially described for their carbohydrate-binding activity [], they are now recognised as a more diverse group of proteins, some of which are involved in protein-protein, protein-lipid or protein-nucleic acid interactions []. There are at least twelve structural families of lectins: C-type lectins, which are Ca+-dependent. S-type (galectins), a widespread family of glycan-binding proteins []. I-type, which have an immunoglobulin-like fold and can recognise sialic acids, other sugars and glycosaminoglycans []. P-type, which bind phosphomannosyl receptors []. Pentraxins []. (Trout) egg lectins. Calreticulin and calnexin, which act as molecular chaperones of the endoplasmic reticulum []. ERGIC-53 and VIP-36 []. Discoidins []. Eel aggutinins (fucolectins) []. Annexin lectins []. Fibrinogen-type lectins, which includes ficolins, tachylectins 5A and 5B, and Limax flavus (Spotted garden slug) agglutinin (these proteins have clear distinctions from one another, but they share a homologous fibrinogen-like domain used for carbohydrate binding). Also unclassified orphan lectins, including amphoterin, Cel-II, complement factor H, thrombospondin, sailic acid-binding lectins, adherence lectin, and cytokins (such as tumour necrosis factor and several interleukins). C-type lectins can be further divided into seven subgroups based on additional non-lectin domains and gene structure: (I) hyalectans, (II) asialoglycoprotein receptors, (III) collectins, (IV) selectins, (V) NK group transmembrane receptors, (VI) macrophage mannose receptors, and (VII) simple (single domain) lectins []. Therefore, lectins are a diverse group of proteins, both in terms of structure and activity. Carbohydrate binding ability may have evolved independently and sporadically in numerous unrelated families, where each evolved a structure that was conserved to fulfil some other activity and function. In general, animal lectins act as recognition molecules within the immune system, their functions involving defence against pathogens, cell trafficking, immune regulation and the prevention of autoimmunity [].; GO: 0005488 binding; PDB: 1T8D_A 2H2T_B 1T8C_A 2H2R_A 1TN3_A 1RJH_A 1HTN_A 3G8K_B 2E3X_B 1UMR_D ....
Probab=44.68 E-value=43 Score=21.33 Aligned_cols=33 Identities=9% Similarity=0.160 Sum_probs=27.9
Q ss_pred CHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 90 KFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 90 KF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
+|.+|...+++ +..-|++++|.+|.+-++..+.
T Consensus 4 ~~~~A~~~C~~--~~~~L~~i~~~~e~~~i~~~~~ 36 (105)
T PF00059_consen 4 TWEEAQQYCQS--MGAHLASINSEEENDFIQSQLK 36 (105)
T ss_dssp EHHHHHHHHHH--TTSEEB-GSSHHHHHHHHHHHH
T ss_pred CHHHHHHHHhc--CCCEEeEeCCHHHhhhhhhccc
Confidence 58999999997 8888999999999888888765
No 67
>KOG2168|consensus
Probab=44.22 E-value=83 Score=30.32 Aligned_cols=93 Identities=15% Similarity=0.216 Sum_probs=57.2
Q ss_pred HHHHhccCHHHHHHHHHhhcCcccccc-----cHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHH
Q psy2692 9 ESINKSSSFETAFKLLHDQIGVKNFLP-----YRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCY 83 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQiGivNf~P-----Lk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gy 83 (133)
+=|--||+||.|+.++|+-++=+=..| .+...-.+-.+-..-+-..|+.+... .++--.|.-++-..+
T Consensus 643 ~LY~lag~yd~al~link~LS~~l~~~~~~~~n~erl~~La~~~~~~y~~~~~~~~~~-------~~~t~~lLl~~~~~f 715 (835)
T KOG2168|consen 643 LLYHLAGDYDKALELINKLLSQVLHSPTLGQSNKERLGDLALSMNDIYESNKGDSAKV-------VVKTLSLLLDLVSFF 715 (835)
T ss_pred HHHHHhhhhhHHHHHHHHHHHHHHhhcccCCcchhhHHHHHHHHHHHHHhccCcchhh-------HHHHHHHHHHHHHHH
Confidence 335679999999999999887655554 34433333333333333333333211 112223455667788
Q ss_pred HhcccCCHHHHHHHHHHHHHHccccc
Q psy2692 84 QLTTSGKFVDAIEKLRNVLICIPLLV 109 (133)
Q Consensus 84 k~~t~gKF~eAl~~Fr~iL~~i~l~v 109 (133)
..-..|++++|++..+. |+.||+..
T Consensus 716 ~~y~~~~~e~aL~~le~-l~LiP~~~ 740 (835)
T KOG2168|consen 716 DLYHNGEWEEALSILEH-LDLIPLDP 740 (835)
T ss_pred HHHhhhHHHHHHHHHHH-HhccCCCh
Confidence 88889999999998765 56788743
No 68
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=44.09 E-value=37 Score=27.08 Aligned_cols=32 Identities=19% Similarity=0.158 Sum_probs=27.4
Q ss_pred HHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 77 AKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
.....|.+.+..|++.+|++.|+.++..-|..
T Consensus 34 ~~Y~~A~~~~~~g~y~~Ai~~f~~l~~~yP~s 65 (243)
T PRK10866 34 EIYATAQQKLQDGNWKQAITQLEALDNRYPFG 65 (243)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCC
Confidence 35588999999999999999999999988854
No 69
>PF10938 YfdX: YfdX protein; InterPro: IPR021236 YfdX is a protein found in Proteobacteria of unknown function. The protein coding for this gene is regulated by EvgA in Escherichia coli []. ; PDB: 3DZA_C.
Probab=43.47 E-value=45 Score=25.12 Aligned_cols=40 Identities=23% Similarity=0.228 Sum_probs=32.4
Q ss_pred hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692 72 SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT 111 (133)
Q Consensus 72 l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~ 111 (133)
+..-...++.|-++..+||..||......++-.+.+..++
T Consensus 114 L~~~~~av~~A~~ll~~~k~~eA~~aL~~A~~~lv~~s~~ 153 (155)
T PF10938_consen 114 LAQTPAAVKQAAALLDEGKYYEANAALKQALDGLVFDSVD 153 (155)
T ss_dssp HHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHT-EEEEEE
T ss_pred HHhhHHHHHHHHHHHHCCCHHHHHHHHHHHhcCeEEEEee
Confidence 4555668999999999999999999999999888876553
No 70
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=43.22 E-value=24 Score=20.34 Aligned_cols=20 Identities=25% Similarity=0.473 Sum_probs=16.1
Q ss_pred HHHHHhcccCCHHHHHHHHH
Q psy2692 80 QQCYQLTTSGKFVDAIEKLR 99 (133)
Q Consensus 80 k~gyk~~t~gKF~eAl~~Fr 99 (133)
--||.+-..||+++|++.|.
T Consensus 6 ~~a~~~y~~~ky~~A~~~~~ 25 (36)
T PF07720_consen 6 GLAYNFYQKGKYDEAIHFFQ 25 (36)
T ss_dssp HHHHHHHHTT-HHHHHHHHH
T ss_pred HHHHHHHHHhhHHHHHHHHH
Confidence 45888999999999999944
No 71
>KOG3807|consensus
Probab=42.99 E-value=24 Score=31.27 Aligned_cols=37 Identities=19% Similarity=0.468 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccC
Q psy2692 74 DLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTT 112 (133)
Q Consensus 74 ~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~ 112 (133)
.+..+|.-+- -+.||..||+..||++..-.|++.+-|
T Consensus 276 YIKRRLAMCA--RklGrlrEA~K~~RDL~ke~pl~t~ln 312 (556)
T KOG3807|consen 276 YIKRRLAMCA--RKLGRLREAVKIMRDLMKEFPLLTMLN 312 (556)
T ss_pred HHHHHHHHHH--HHhhhHHHHHHHHHHHhhhccHHHHHH
Confidence 3455665554 467999999999999999999987766
No 72
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=42.72 E-value=59 Score=22.08 Aligned_cols=32 Identities=19% Similarity=0.313 Sum_probs=25.3
Q ss_pred HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
.......|+.+...|++++|+..|+.++..-|
T Consensus 17 ~~~~~~~a~~~~~~~~~~~A~~~~~~~~~~~p 48 (135)
T TIGR02552 17 LEQIYALAYNLYQQGRYDEALKLFQLLAAYDP 48 (135)
T ss_pred HHHHHHHHHHHHHcccHHHHHHHHHHHHHhCC
Confidence 44467788888888999999999988877544
No 73
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=41.04 E-value=38 Score=28.40 Aligned_cols=29 Identities=10% Similarity=0.077 Sum_probs=22.5
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
.+++|-+++..|+|.+|+..|+..|..-|
T Consensus 5 l~~~a~~a~~~~~~~~Ai~~~~~Al~~~P 33 (356)
T PLN03088 5 LEDKAKEAFVDDDFALAVDLYTQAIDLDP 33 (356)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence 45678888888888888888888887655
No 74
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=40.01 E-value=30 Score=17.46 Aligned_cols=19 Identities=21% Similarity=0.361 Sum_probs=15.4
Q ss_pred hcccCCHHHHHHHHHHHHH
Q psy2692 85 LTTSGKFVDAIEKLRNVLI 103 (133)
Q Consensus 85 ~~t~gKF~eAl~~Fr~iL~ 103 (133)
+...|++++|.+.|+.+..
T Consensus 10 ~~~~~~~~~a~~~~~~M~~ 28 (31)
T PF01535_consen 10 YCKMGQFEEALEVFDEMRE 28 (31)
T ss_pred HHccchHHHHHHHHHHHhH
Confidence 3568999999999998753
No 75
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=39.66 E-value=35 Score=24.62 Aligned_cols=28 Identities=14% Similarity=0.290 Sum_probs=23.8
Q ss_pred HHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692 80 QQCYQLTTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
+.|+.++..|++++|+..|+..+..-|-
T Consensus 29 ~~g~~~~~~g~~~~A~~~~~~al~~~P~ 56 (144)
T PRK15359 29 ASGYASWQEGDYSRAVIDFSWLVMAQPW 56 (144)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcCCC
Confidence 5688999999999999999998877663
No 76
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=39.64 E-value=1.2e+02 Score=20.62 Aligned_cols=34 Identities=21% Similarity=0.247 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
+++..++.-|-.++-.|++++|++.+-.++..-+
T Consensus 20 ~D~~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr 53 (90)
T PF14561_consen 20 DDLDARYALADALLAAGDYEEALDQLLELVRRDR 53 (90)
T ss_dssp T-HHHHHHHHHHHHHTT-HHHHHHHHHHHHCC-T
T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCc
Confidence 4566678889999999999999999998887643
No 77
>cd05492 Bromo_ZMYND11 Bromodomain; ZMYND11_like sub-family. ZMYND11 or BS69 is a ubiquitously expressed nuclear protein that has been shown to associate with chromatin. It interacts with chromatin remodeling factors and might play a role in chromatin remodeling and gene expression. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=39.29 E-value=1.3e+02 Score=21.43 Aligned_cols=56 Identities=9% Similarity=0.023 Sum_probs=47.0
Q ss_pred CCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHH
Q psy2692 69 DRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKE 128 (133)
Q Consensus 69 ~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrE 128 (133)
|..++.+..+|+.| + =..++|-..-++-|.+.....--.+..+...+++++.-|++
T Consensus 46 PmDL~tI~~kl~~~-~---Y~s~~ef~~Dv~LI~~N~~~yNg~~s~~~~~A~~l~~d~~~ 101 (109)
T cd05492 46 HLDVADIQEKINSE-K---YTSLEEFKADALLLLHNTAIFHGADSEQYDAARWLYRDTCH 101 (109)
T ss_pred CCcHHHHHHHHHcC-C---CCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH
Confidence 88999999999887 2 33477778888888899888888888999999999999975
No 78
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=38.67 E-value=50 Score=24.91 Aligned_cols=34 Identities=18% Similarity=0.230 Sum_probs=28.2
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT 111 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~ 111 (133)
..++|.+++..|++.+|++.|..|.-..|+---.
T Consensus 13 ly~~a~~~l~~~~Y~~A~~~le~L~~ryP~g~ya 46 (142)
T PF13512_consen 13 LYQEAQEALQKGNYEEAIKQLEALDTRYPFGEYA 46 (142)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHhcCCCCccc
Confidence 4588888899999999999999999888886443
No 79
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=38.08 E-value=1.5e+02 Score=25.93 Aligned_cols=20 Identities=15% Similarity=0.096 Sum_probs=15.5
Q ss_pred HHHhccCHHHHHHHHHhhcC
Q psy2692 10 SINKSSSFETAFKLLHDQIG 29 (133)
Q Consensus 10 ~~~AAGsFe~A~~LLnrQiG 29 (133)
.+...|+|+.|+..+++-+.
T Consensus 65 ~~~~~g~~~~A~~~~~~~~~ 84 (899)
T TIGR02917 65 IYLALGDYAAAEKELRKALS 84 (899)
T ss_pred HHHHcCCHHHHHHHHHHHHH
Confidence 46678889999988887654
No 80
>PF05347 Complex1_LYR: Complex 1 protein (LYR family); InterPro: IPR008011 This family of short proteins includes proteins from the NADH-ubiquinone oxidoreductase complex I. The family includes the B14 subunit from bovine NADH-ubiquinone oxidoreductase B14 subunit Q02366 from SWISSPROT, and the B22 subunit from the human enzyme Q9Y6M9 from SWISSPROT. The family has been named LYR after a highly conserved tripeptide motif close to the N terminus of these proteins. Members of this family also found in yeast which do contain this complex. In these organisms they are believed to be be required for iron-sulphur custer biogenesis.
Probab=37.57 E-value=50 Score=20.04 Aligned_cols=16 Identities=13% Similarity=0.298 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHhh
Q psy2692 115 EIEEAKQIINICKEYI 130 (133)
Q Consensus 115 e~~e~~eli~icrEYi 130 (133)
+.+++.+++..|++|+
T Consensus 42 d~~~I~~~l~~g~~~l 57 (59)
T PF05347_consen 42 DPEKIEELLKKGEEEL 57 (59)
T ss_pred CHHHHHHHHHHHHHHh
Confidence 4556677777777764
No 81
>KOG0545|consensus
Probab=36.50 E-value=32 Score=29.21 Aligned_cols=33 Identities=27% Similarity=0.452 Sum_probs=27.4
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccccc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVV 110 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv 110 (133)
.=++|-++|+.|+|.||...+|+.+-++--+..
T Consensus 181 l~q~GN~lfk~~~ykEA~~~YreAi~~l~~L~l 213 (329)
T KOG0545|consen 181 LHQEGNRLFKLGRYKEASSKYREAIICLRNLQL 213 (329)
T ss_pred HHHhhhhhhhhccHHHHHHHHHHHHHHHHHHHh
Confidence 347899999999999999999998877654443
No 82
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=35.03 E-value=67 Score=24.24 Aligned_cols=32 Identities=19% Similarity=0.342 Sum_probs=28.2
Q ss_pred HHHHHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692 76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
..+|..||-.+..|++++|+..+.+.|..=|-
T Consensus 48 qAqL~l~yayy~~~~y~~A~a~~~rFirLhP~ 79 (142)
T PF13512_consen 48 QAQLDLAYAYYKQGDYEEAIAAYDRFIRLHPT 79 (142)
T ss_pred HHHHHHHHHHHHccCHHHHHHHHHHHHHhCCC
Confidence 45899999999999999999999998877663
No 83
>KOG4234|consensus
Probab=34.84 E-value=44 Score=27.62 Aligned_cols=31 Identities=19% Similarity=0.284 Sum_probs=27.9
Q ss_pred HHHHHhcccCCHHHHHHHHHHHHHHcccccc
Q psy2692 80 QQCYQLTTSGKFVDAIEKLRNVLICIPLLVV 110 (133)
Q Consensus 80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv 110 (133)
++|-++|..|.|.+|.+.|...|-++|-+..
T Consensus 100 ~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~ 130 (271)
T KOG4234|consen 100 KEGNELFKNGDYEEANSKYQEALESCPSTST 130 (271)
T ss_pred HHHHHhhhcccHHHHHHHHHHHHHhCccccH
Confidence 5699999999999999999999999987643
No 84
>KOG3677|consensus
Probab=34.80 E-value=49 Score=29.84 Aligned_cols=49 Identities=22% Similarity=0.361 Sum_probs=35.7
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHHcccc----------cccChHHHHHHHHHHHHHH
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLICIPLL----------VVTTKKEIEEAKQIINICK 127 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~----------vv~~~~e~~e~~eli~icr 127 (133)
-+.||...--|+|.||+..|-+||..|-=+ .-...++.+++.+++.||-
T Consensus 276 Y~VGFayLmmrryadai~~F~niLlyIqrtks~~~~~~y~~d~inKq~eqm~~llai~l 334 (525)
T KOG3677|consen 276 YQVGFAYLMMRRYADAIRVFLNILLYIQRTKSMFSRTTYQYDMINKQNEQMHHLLAICL 334 (525)
T ss_pred eehhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhHhhhhhhHHHHHHHHHHHH
Confidence 456788888899999999999999875211 1113566777888888884
No 85
>cd03594 CLECT_REG-1_like C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates
Probab=33.74 E-value=37 Score=23.21 Aligned_cols=43 Identities=16% Similarity=0.130 Sum_probs=33.1
Q ss_pred HHHhc-ccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 82 CYQLT-TSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 82 gyk~~-t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
+|+++ +...+.||...+++.-...-|++++|.+|.+-+..++.
T Consensus 12 Cy~~~~~~~tw~~A~~~C~~~~~g~~La~i~s~~e~~~l~~~~~ 55 (129)
T cd03594 12 CYGYFRQPLSWSDAELFCQKYGPGAHLASIHSPAEAAAIASLIS 55 (129)
T ss_pred eeeEeccCcCHHHHHHHHHhcCCCceEcccCCHHHHHHHHHHHH
Confidence 56555 46789999999988632466888899999888887764
No 86
>COG1484 DnaC DNA replication protein [DNA replication, recombination, and repair]
Probab=33.04 E-value=1.1e+02 Score=24.58 Aligned_cols=55 Identities=25% Similarity=0.335 Sum_probs=40.6
Q ss_pred CCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccC-------hHHHHHHHHHHHHHHHh
Q psy2692 69 DRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTT-------KKEIEEAKQIINICKEY 129 (133)
Q Consensus 69 ~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~-------~~e~~e~~eli~icrEY 129 (133)
-++..++..+|+.++.. |..+ ..+...+..+++++.|+ +.+.+.+-|+|.-.+++
T Consensus 138 f~~~~el~~~Lk~~~~~---~~~~---~~l~~~l~~~dlLIiDDlG~~~~~~~~~~~~~q~I~~r~~~ 199 (254)
T COG1484 138 FITAPDLLSKLKAAFDE---GRLE---EKLLRELKKVDLLIIDDIGYEPFSQEEADLLFQLISRRYES 199 (254)
T ss_pred EEEHHHHHHHHHHHHhc---CchH---HHHHHHhhcCCEEEEecccCccCCHHHHHHHHHHHHHHHhh
Confidence 55788899999999987 4333 34556677889999984 55667888888777664
No 87
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=33.01 E-value=88 Score=22.93 Aligned_cols=31 Identities=29% Similarity=0.443 Sum_probs=25.9
Q ss_pred HHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
.+.++.|=.+.+.|+..+|..+|-+.|...|
T Consensus 64 l~qV~lGE~L~~~G~~~~aa~hf~nAl~V~~ 94 (121)
T PF02064_consen 64 LQQVQLGEQLLAQGDYEEAAEHFYNALKVCP 94 (121)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHTSS
T ss_pred HHHHHHHHHHHhCCCHHHHHHHHHHHHHhCC
Confidence 4468889999999999999999999887655
No 88
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=32.55 E-value=88 Score=24.12 Aligned_cols=42 Identities=21% Similarity=0.347 Sum_probs=32.7
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHH
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICK 127 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icr 127 (133)
.+=+|.-.+..|.+.||+..||++-..-|-... ++.++..|-
T Consensus 47 ~~~~~~l~i~r~~w~dA~rlLr~l~~~~~~~p~--------~kALlA~CL 88 (160)
T PF09613_consen 47 DLFDGWLHIVRGDWDDALRLLRELEERAPGFPY--------AKALLALCL 88 (160)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHhccCCCChH--------HHHHHHHHH
Confidence 345688999999999999999999777765543 466777774
No 89
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=31.34 E-value=79 Score=25.72 Aligned_cols=33 Identities=18% Similarity=0.136 Sum_probs=25.7
Q ss_pred HHHHHHHHHh-cccCCHHHHHHHHHHHHHHcccc
Q psy2692 76 IAKIQQCYQL-TTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 76 ~~~Lk~gyk~-~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
......|+.+ +..|++.+|+..|+.++...|=-
T Consensus 143 ~~~Y~~A~~l~~~~~~y~~Ai~af~~fl~~yP~s 176 (263)
T PRK10803 143 NTDYNAAIALVQDKSRQDDAIVAFQNFVKKYPDS 176 (263)
T ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcCC
Confidence 3357778876 56789999999999999888753
No 90
>KOG2376|consensus
Probab=31.18 E-value=78 Score=29.57 Aligned_cols=50 Identities=22% Similarity=0.211 Sum_probs=37.5
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHH
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICK 127 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icr 127 (133)
-.+.|..+.+.||..+|++..+..+...-=.--++..++++.++=|.+||
T Consensus 178 ~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~Ir 227 (652)
T KOG2376|consen 178 LYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIR 227 (652)
T ss_pred HHHHHHHHHhcccHHHHHHHHHHHHHHHHHhhcccccchhhHHHHHHHHH
Confidence 45889999999999999999999876654433344444666777777776
No 91
>PF12335 SBF2: Myotubularin protein ; InterPro: IPR022096 This domain family is found in eukaryotes, and is approximately 220 amino acids in length. The family is found in association with PF02141 from PFAM, PF03456 from PFAM, PF03455 from PFAM. This family is the middle region of SBF2, a member of the myotubularin family. Myotubularin-related proteins have been suggested to work in phosphoinositide-mediated signalling events that may also convey control of myelination. Mutations of SBF2 are implicated in Charcot-Marie-Tooth disease.
Probab=30.84 E-value=1e+02 Score=24.94 Aligned_cols=52 Identities=19% Similarity=0.323 Sum_probs=37.5
Q ss_pred CCCCCccccCCCCC--CCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 55 PLLAPVTAFPTRKV--DRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 55 p~lp~l~~~~~rn~--~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
|..||+...+.++. ..+-+...+-|+.+..+.=.||..||-+.|-.++..+-
T Consensus 1 ~~~p~~~~~~~~~~~~~~~s~rrlevlr~ci~~if~~k~~e~~k~~~av~~~lk 54 (225)
T PF12335_consen 1 PMGPPVVSIVDKNGNVFANSARRLEVLRNCISFIFDNKILEARKSLPAVLRALK 54 (225)
T ss_pred CCCCCchhHHhcccchhhhHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHc
Confidence 55677766666653 22344566778999999999999999999877666543
No 92
>KOG4281|consensus
Probab=30.83 E-value=73 Score=26.13 Aligned_cols=45 Identities=22% Similarity=0.312 Sum_probs=36.3
Q ss_pred HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHH
Q psy2692 73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIE 117 (133)
Q Consensus 73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~ 117 (133)
..|.+-.|+.|+-..-|-|+|+++..|++|-.+-=-.|.-++|..
T Consensus 5 q~l~ntck~~fs~~~pg~~~~nlqkLr~vLd~lkp~Dvgl~~~~~ 49 (236)
T KOG4281|consen 5 QRLFNTCKEVFSTGGPGPFEENLQKLRSVLDELKPEDVGLEPEMQ 49 (236)
T ss_pred HHHHHHHHHHhcCCCCCccHHHHHHHHHHHhhCCHhhcCcChhhh
Confidence 457778899998888889999999999999988776666555543
No 93
>KOG0553|consensus
Probab=30.82 E-value=69 Score=27.27 Aligned_cols=38 Identities=16% Similarity=0.250 Sum_probs=32.4
Q ss_pred HHHHHH-HHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692 74 DLIAKI-QQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT 111 (133)
Q Consensus 74 ~L~~~L-k~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~ 111 (133)
.+.++| .+|=++...++|++|+..+-..|..-|--.|-
T Consensus 79 ~~AE~LK~eGN~~m~~~~Y~eAv~kY~~AI~l~P~nAVy 117 (304)
T KOG0553|consen 79 ALAESLKNEGNKLMKNKDYQEAVDKYTEAIELDPTNAVY 117 (304)
T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCCcchH
Confidence 466677 46999999999999999999999988876654
No 94
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=30.55 E-value=85 Score=23.89 Aligned_cols=31 Identities=16% Similarity=0.313 Sum_probs=25.8
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
.+..|+-.+..|++.+|+..|.+.+..-|=.
T Consensus 45 ~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~ 75 (203)
T PF13525_consen 45 QLMLAYAYYKQGDYEEAIAAYERFIKLYPNS 75 (203)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHHCCCC
Confidence 6788999999999999999999999988864
No 95
>KOG0739|consensus
Probab=30.26 E-value=1.3e+02 Score=26.41 Aligned_cols=58 Identities=16% Similarity=0.197 Sum_probs=42.1
Q ss_pred hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHH-HHHHHHHhh
Q psy2692 72 SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQ-IINICKEYI 130 (133)
Q Consensus 72 l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~e-li~icrEYi 130 (133)
+..-++..++|-..=+++..+||+..|++.|....... ..+...+.+++ +-..|.||+
T Consensus 7 l~kaI~lv~kA~~eD~a~nY~eA~~lY~~aleYF~~~l-KYE~~~~kaKd~IraK~~EYL 65 (439)
T KOG0739|consen 7 LQKAIDLVKKAIDEDNAKNYEEALRLYQNALEYFLHAL-KYEANNKKAKDSIRAKFTEYL 65 (439)
T ss_pred HHHHHHHHHHHhhhcchhchHHHHHHHHHHHHHHHHHH-HhhhcChhHHHHHHHHHHHHH
Confidence 44456677888899999999999999999987766543 33434445566 446788886
No 96
>cd00037 CLECT C-type lectin (CTL)/C-type lectin-like (CTLD) domain. CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initia
Probab=29.74 E-value=46 Score=21.07 Aligned_cols=35 Identities=14% Similarity=0.109 Sum_probs=28.6
Q ss_pred cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
...|++|.+.+++.= .-|+.+++.+|.+-+.+.+.
T Consensus 9 ~~~~~~A~~~C~~~~--~~L~~~~~~~e~~~i~~~~~ 43 (116)
T cd00037 9 KLTWEEAQEYCRSLG--GHLASIHSEEENDFLASLLK 43 (116)
T ss_pred ccCHHHHHHHHHHcC--CEEcccCCHHHHHHHHHHHh
Confidence 678999999999854 67888899888888877664
No 97
>COG4989 Predicted oxidoreductase [General function prediction only]
Probab=29.40 E-value=97 Score=26.23 Aligned_cols=47 Identities=32% Similarity=0.458 Sum_probs=34.0
Q ss_pred ccCHHHHHHHHHhh-----cCcccccccHHHHHHHHhhhhhhccCCCCCCCcc
Q psy2692 14 SSSFETAFKLLHDQ-----IGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVT 61 (133)
Q Consensus 14 AGsFe~A~~LLnrQ-----iGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~ 61 (133)
+..--.|++.|+++ +|+-||.|-.=.+++.|..-....-++- +.|+.
T Consensus 135 ~eeVAeAf~~L~~sGKVr~fGVSNf~p~Q~~LL~s~l~~~LvtNQle-lS~~~ 186 (298)
T COG4989 135 AEEVAEAFTHLHKSGKVRHFGVSNFNPAQFELLQSRLPFTLVTNQLE-LSPLH 186 (298)
T ss_pred HHHHHHHHHHHHhcCCeeeeecCCCCHHHHHHHHHhccchhhhccee-ecccc
Confidence 34445788999865 8999999999999998887765544443 44443
No 98
>cd05508 Bromo_RACK7 Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) was identified as a potential tumor suppressor genes, it shares domain architecture with BS69/ZMYND11; both have been implicated in the regulation of cellular proliferation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=29.32 E-value=1.9e+02 Score=20.02 Aligned_cols=56 Identities=16% Similarity=0.151 Sum_probs=44.0
Q ss_pred CCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHH
Q psy2692 69 DRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKE 128 (133)
Q Consensus 69 ~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrE 128 (133)
|..++.+..+|+.|. =...+|-..-++-|........-++..-...++++.+.|+.
T Consensus 42 PmDL~tI~~kl~~~~----Y~s~~ef~~Dv~LI~~Na~~YN~~~s~i~~~A~~l~~~~~~ 97 (99)
T cd05508 42 PMDLSTLEKNVRKKA----YGSTDAFLADAKWILHNAIIYNGGDHKLTQAAKAIVKICEQ 97 (99)
T ss_pred CCCHHHHHHHHhcCC----CCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHh
Confidence 889999999997664 23466667777888888888777777888888999999874
No 99
>PF07304 SRA1: Steroid receptor RNA activator (SRA1); InterPro: IPR009917 This entry consists of several hypothetical mammalian steroid receptor RNA activator proteins. The SRA-RNAs encode stable proteins that are widely expressed and upregulated in breast cancer cell lines. SRA-RNA is a steroid receptor co-activator which acts as a functional RNA. This domain is also found at the C terminus of Sec31, a component of the coat protein complex II (COPII, which promotes the formation of transport vesicles from the endoplasmic reticulum (ER). COPII has two main functions, the physical deformation of the endoplasmic reticulum membrane into vesicles and the selection of cargo molecules. ; PDB: 2YRU_A.
Probab=29.32 E-value=2.4e+02 Score=21.26 Aligned_cols=47 Identities=17% Similarity=0.237 Sum_probs=34.5
Q ss_pred HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHH----HHHHHHHHHHH
Q psy2692 75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIE----EAKQIINICKE 128 (133)
Q Consensus 75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~----e~~eli~icrE 128 (133)
+.++|.+=.++..+++|+.|.++.++|+- +.-+|.. -+|+||.++|.
T Consensus 90 v~~~L~~L~~aL~~~d~~~A~~Ih~~L~t-------~h~~E~~~WmvGVKRLI~~~r~ 140 (157)
T PF07304_consen 90 VVDKLHQLAQALQARDYDAADEIHVDLMT-------DHVDECGNWMVGVKRLIAMARN 140 (157)
T ss_dssp HHHHHHHHHHHHHHT-HHHHHHHHHHHHH-------SSHHHHTTTHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHh-------ccHHHhhhHHHHHHHHHHHHHh
Confidence 55677788889999999999999988763 4344443 47888888875
No 100
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=28.59 E-value=83 Score=27.65 Aligned_cols=50 Identities=18% Similarity=0.294 Sum_probs=36.5
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHccccc--ccC--------hHHHHHHHHHHHHHH
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLV--VTT--------KKEIEEAKQIINICK 127 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~v--v~~--------~~e~~e~~eli~icr 127 (133)
--..||..+=-+++.||+..|.+||..+-=.- -.+ .+-.+++-.++.||-
T Consensus 167 ~YyvGFaylMlrRY~DAir~f~~iL~yi~r~k~~~~~~~~q~d~i~K~~eqMyaLlAic~ 226 (404)
T PF10255_consen 167 YYYVGFAYLMLRRYADAIRTFSQILLYIQRTKNQYHQRSYQYDQINKKNEQMYALLAICL 226 (404)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccchhhHHHhHHHHHHHHHHHHH
Confidence 34678999999999999999999998765443 000 145566777888874
No 101
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=28.59 E-value=3.3e+02 Score=25.22 Aligned_cols=99 Identities=15% Similarity=0.050 Sum_probs=50.1
Q ss_pred hHHHHhccCHHHHHHHHHhhcCccccc------ccHHHHH-----HHHhhhhhhccCCCCC-CC---ccccCCCCCCCCh
Q psy2692 8 SESINKSSSFETAFKLLHDQIGVKNFL------PYRQMFL-----NSYLQSRTAYSTLPLL-AP---VTAFPTRKVDRSS 72 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQiGivNf~------PLk~~F~-----~~y~~s~~~~~~~p~l-p~---l~~~~~rn~~~~l 72 (133)
+..+...|+++.|.+.+++=+-.-... ....++. +-|..+...+..+... |+ +...+.++ | +-
T Consensus 279 a~~yl~~g~~e~A~~~l~~~l~~~p~~~~~~~~~~~~L~~a~~~~g~~~eA~~~l~~~~~~~P~~~~~~~~~~~~-p-~~ 356 (765)
T PRK10049 279 ASAYLKLHQPEKAQSILTELFYHPETIADLSDEELADLFYSLLESENYPGALTVTAHTINNSPPFLRLYGSPTSI-P-ND 356 (765)
T ss_pred HHHHHhcCCcHHHHHHHHHHhhcCCCCCCCChHHHHHHHHHHHhcccHHHHHHHHHHHhhcCCceEeecCCCCCC-C-Cc
Confidence 346788999999999998744322111 1122222 1122233333322222 21 11111111 1 11
Q ss_pred HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
..+...+..|.-+...|++++|+..|+.++...|--
T Consensus 357 ~~~~a~~~~a~~l~~~g~~~eA~~~l~~al~~~P~n 392 (765)
T PRK10049 357 DWLQGQSLLSQVAKYSNDLPQAEMRARELAYNAPGN 392 (765)
T ss_pred hHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCC
Confidence 123334455566667788888888888888776653
No 102
>cd06131 DNA_pol_III_epsilon_Ecoli_like DEDDh 3'-5' exonuclease domain of the epsilon subunit of Escherichia coli DNA polymerase III and similar proteins. This subfamily is composed of the epsilon subunit of Escherichia coli DNA polymerase III (Pol III) and similar proteins. Pol III is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. It is a holoenzyme complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The Pol III epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon
Probab=28.38 E-value=48 Score=24.04 Aligned_cols=28 Identities=21% Similarity=0.296 Sum_probs=24.7
Q ss_pred ccccCChHHHHhccCHHHHHHHHHhhcC
Q psy2692 2 NLHKVTSESINKSSSFETAFKLLHDQIG 29 (133)
Q Consensus 2 ~~~~~~~~~~~AAGsFe~A~~LLnrQiG 29 (133)
.+|.+|++.+..++.|+.+++-+...+|
T Consensus 53 ~ihGIt~e~l~~~~~~~~v~~~l~~~l~ 80 (167)
T cd06131 53 KVHGITDEFLADKPKFAEIADEFLDFIR 80 (167)
T ss_pred HHhCCCHHHHhcCCCHHHHHHHHHHHHC
Confidence 4799999999999999999998887765
No 103
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=28.27 E-value=58 Score=28.96 Aligned_cols=28 Identities=18% Similarity=0.297 Sum_probs=25.5
Q ss_pred HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 79 IQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
...|..++..|+|++|+..|...|...|
T Consensus 131 k~~G~~~~~~~~~~~Ai~~y~~al~~~p 158 (615)
T TIGR00990 131 KEKGNKAYRNKDFNKAIKLYSKAIECKP 158 (615)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHhcCC
Confidence 4679999999999999999999998777
No 104
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=27.59 E-value=2.1e+02 Score=22.97 Aligned_cols=20 Identities=15% Similarity=0.172 Sum_probs=13.4
Q ss_pred HHHhccCHHHHHHHHHhhcC
Q psy2692 10 SINKSSSFETAFKLLHDQIG 29 (133)
Q Consensus 10 ~~~AAGsFe~A~~LLnrQiG 29 (133)
.+...|+|+.|++.+++-+.
T Consensus 116 ~~~~~g~~~~A~~~~~~~l~ 135 (389)
T PRK11788 116 DYLKAGLLDRAEELFLQLVD 135 (389)
T ss_pred HHHHCCCHHHHHHHHHHHHc
Confidence 34566777777777777654
No 105
>PF14769 CLAMP: Flagellar C1a complex subunit C1a-32
Probab=27.46 E-value=1.6e+02 Score=20.28 Aligned_cols=21 Identities=10% Similarity=0.241 Sum_probs=18.2
Q ss_pred ccCCHHHHHHHHHHHHHHccc
Q psy2692 87 TSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 87 t~gKF~eAl~~Fr~iL~~i~l 107 (133)
....++|+++.|+++|...-.
T Consensus 34 ~~~~~~~~~~~fk~~l~~~sv 54 (101)
T PF14769_consen 34 KGMSLEDSFKYFKELLLRHSV 54 (101)
T ss_pred ccCCHHHHHHHHHHHHHHhcc
Confidence 889999999999999966544
No 106
>PF09670 Cas_Cas02710: CRISPR-associated protein (Cas_Cas02710)
Probab=27.42 E-value=1.8e+02 Score=24.88 Aligned_cols=45 Identities=16% Similarity=0.301 Sum_probs=33.8
Q ss_pred HHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHh
Q psy2692 77 AKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEY 129 (133)
Q Consensus 77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEY 129 (133)
..++.+-.++..+.|..|...|..+...+ +.+++ .+-+-.+|+=|
T Consensus 133 ~~~~~a~~l~n~~~y~aA~~~l~~l~~rl-----~~~~~---~~~~~~l~~~y 177 (379)
T PF09670_consen 133 REWRRAKELFNRYDYGAAARILEELLRRL-----PGREE---YQRYKDLCEGY 177 (379)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHhC-----Cchhh---HHHHHHHHHHH
Confidence 36789999999999999999999999872 33333 44566666654
No 107
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=27.26 E-value=71 Score=18.66 Aligned_cols=21 Identities=24% Similarity=0.216 Sum_probs=16.8
Q ss_pred HhcccCCHHHHHHHHHHHHHH
Q psy2692 84 QLTTSGKFVDAIEKLRNVLIC 104 (133)
Q Consensus 84 k~~t~gKF~eAl~~Fr~iL~~ 104 (133)
=....++|++|+.-|++.|-.
T Consensus 10 isle~e~f~qA~~D~~~aL~i 30 (38)
T PF10516_consen 10 ISLENENFEQAIEDYEKALEI 30 (38)
T ss_pred HHHHhccHHHHHHHHHHHHHH
Confidence 356788999999999988753
No 108
>COG4063 MtrA Tetrahydromethanopterin S-methyltransferase, subunit A [Coenzyme metabolism]
Probab=27.15 E-value=77 Score=25.75 Aligned_cols=38 Identities=18% Similarity=0.277 Sum_probs=25.9
Q ss_pred cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHH
Q psy2692 88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKE 128 (133)
Q Consensus 88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrE 128 (133)
.|-=.||++.||+.+-.+-++.++|.++ ++..++-|++
T Consensus 115 ENi~~eaveRfqqqvelvdlid~eD~~~---I~~~v~ecv~ 152 (238)
T COG4063 115 ENIPDEAVERFQQQVELVDLIDVEDPDE---ITAKVEECVE 152 (238)
T ss_pred hcCCHHHHHHHHHHeeeehhcccCCHHH---HHHHHHHHHh
Confidence 3445799999998887777777766444 4455555554
No 109
>KOG0543|consensus
Probab=26.28 E-value=2.1e+02 Score=25.33 Aligned_cols=51 Identities=20% Similarity=0.207 Sum_probs=41.4
Q ss_pred HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHh
Q psy2692 75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEY 129 (133)
Q Consensus 75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEY 129 (133)
.....+.|--+.+.|.+++|...|+.++..-|- |++-..|+..+-...|+|
T Consensus 291 ~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~----Nka~~~el~~l~~k~~~~ 341 (397)
T KOG0543|consen 291 VKALYRRGQALLALGEYDLARDDFQKALKLEPS----NKAARAELIKLKQKIREY 341 (397)
T ss_pred hhHHHHHHHHHHhhccHHHHHHHHHHHHHhCCC----cHHHHHHHHHHHHHHHHH
Confidence 344567888999999999999999999998884 466667777777777776
No 110
>TIGR02878 spore_ypjB sporulation protein YpjB. Members of this protein, YpjB, family are restricted to a subset of endospore-forming bacteria, including Bacillus species but not CLostridium or some others. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon, where sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect. This protein family is not, however, a part of the endospore formation minimal gene set.
Probab=26.23 E-value=79 Score=25.96 Aligned_cols=32 Identities=22% Similarity=0.169 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692 73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLIC 104 (133)
Q Consensus 73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~ 104 (133)
..|.+...++.++++.|+|+||.......=-.
T Consensus 3 ~eL~~lsD~~LQl~K~~~yeeAkqvL~yFs~~ 34 (233)
T TIGR02878 3 EELNDLSDQVLQLTKQARYEEAKQVLEYFSKQ 34 (233)
T ss_pred hhHhhhhHHHHHHHHhhhHHHHHHHHHHHHHH
Confidence 45666678899999999999998765544333
No 111
>cd03602 CLECT_1 C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_1: C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers from which ligand-binding sites project in differe
Probab=26.17 E-value=1.3e+02 Score=20.00 Aligned_cols=36 Identities=14% Similarity=0.132 Sum_probs=29.8
Q ss_pred ccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 87 TSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 87 t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
....+.||...++.. -.-|++++|++|.+.++.++.
T Consensus 8 ~~~~w~~A~~~C~~~--g~~La~i~s~~e~~~l~~~~~ 43 (108)
T cd03602 8 ESKTWSEAQQYCREN--YTDLATVQNQEDNALLSNLSR 43 (108)
T ss_pred cccCHHHHHHHHHHH--CCccCeecCHHHHHHHHHHHh
Confidence 467899999999985 567888899999998888774
No 112
>PHA02841 hypothetical protein; Provisional
Probab=26.12 E-value=35 Score=23.93 Aligned_cols=33 Identities=18% Similarity=0.411 Sum_probs=25.7
Q ss_pred HHHHHhhcCcccccccHHHHHHHHhhhhhh-ccCCC
Q psy2692 21 FKLLHDQIGVKNFLPYRQMFLNSYLQSRTA-YSTLP 55 (133)
Q Consensus 21 ~~LLnrQiGivNf~PLk~~F~~~y~~s~~~-~~~~p 55 (133)
.++.+|.+...|| +|.-|...|..|+-- +.++.
T Consensus 6 ir~IKRKI~sLNF--~KK~FKK~YS~ckeigvtsin 39 (103)
T PHA02841 6 VKSIKRKIYSLNF--FKKTFKKTYSPCKEIGVTSIN 39 (103)
T ss_pred HHHHHHHHHHHHH--HHHHHHHhhChHhHhCeeeec
Confidence 5788999999998 799999999888743 33333
No 113
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=26.06 E-value=1.6e+02 Score=23.38 Aligned_cols=32 Identities=9% Similarity=0.106 Sum_probs=28.2
Q ss_pred HHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 77 AKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
..+..|+-.+..|++++|+..|++.+..-|=-
T Consensus 71 a~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~ 102 (243)
T PRK10866 71 VQLDLIYAYYKNADLPLAQAAIDRFIRLNPTH 102 (243)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHhCcCC
Confidence 36788999999999999999999999887753
No 114
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=26.06 E-value=3.5e+02 Score=21.73 Aligned_cols=21 Identities=14% Similarity=0.186 Sum_probs=15.8
Q ss_pred HHHhccCHHHHHHHHHhhcCc
Q psy2692 10 SINKSSSFETAFKLLHDQIGV 30 (133)
Q Consensus 10 ~~~AAGsFe~A~~LLnrQiGi 30 (133)
.+...|+++.|...+.+-+..
T Consensus 44 ~~~~~~~~~~A~~~~~~al~~ 64 (389)
T PRK11788 44 NFLLNEQPDKAIDLFIEMLKV 64 (389)
T ss_pred HHHhcCChHHHHHHHHHHHhc
Confidence 445678888888888887765
No 115
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=25.73 E-value=98 Score=22.39 Aligned_cols=29 Identities=7% Similarity=0.166 Sum_probs=24.4
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
.+..|.-....|++++|+..|+..+...+
T Consensus 38 ~~~~g~~~~~~g~~~~A~~~~~~al~l~~ 66 (168)
T CHL00033 38 YYRDGMSAQSEGEYAEALQNYYEAMRLEI 66 (168)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHhccc
Confidence 56778888889999999999999987643
No 116
>PRK09637 RNA polymerase sigma factor SigZ; Provisional
Probab=25.60 E-value=84 Score=23.41 Aligned_cols=32 Identities=13% Similarity=0.162 Sum_probs=21.9
Q ss_pred CHHHHHHHHHHHHHHcccccccChHHHHHHHH
Q psy2692 90 KFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQ 121 (133)
Q Consensus 90 KF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~e 121 (133)
||++..+.|...++.+....+.++++.+++-|
T Consensus 2 ~~~~l~~~y~~~l~~~~~~~~~~~~~aeDi~Q 33 (181)
T PRK09637 2 KLESIWSEYKAQLKAFLHSRVSNEADVDDLLQ 33 (181)
T ss_pred cHHHHHHHHHHHHHHHHHHHhCChhhHHHHHH
Confidence 56777777777777777766666666666544
No 117
>PRK10564 maltose regulon periplasmic protein; Provisional
Probab=24.70 E-value=65 Score=27.40 Aligned_cols=22 Identities=27% Similarity=0.517 Sum_probs=17.8
Q ss_pred HHHHhccCHHHHHHHHHh--hcCc
Q psy2692 9 ESINKSSSFETAFKLLHD--QIGV 30 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnr--QiGi 30 (133)
.+.|++||.+.||+||++ .+|+
T Consensus 265 ~~AVk~gDi~KAL~LldEAe~LG~ 288 (303)
T PRK10564 265 KQAVKKGDVDKALKLLDEAERLGS 288 (303)
T ss_pred HHHHHcCCHHHHHHHHHHHHHhCC
Confidence 356899999999999985 4554
No 118
>PRK09591 celC cellobiose phosphotransferase system IIA component; Reviewed
Probab=24.30 E-value=80 Score=22.48 Aligned_cols=30 Identities=13% Similarity=0.179 Sum_probs=23.1
Q ss_pred ChHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHh
Q psy2692 7 TSESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYL 45 (133)
Q Consensus 7 ~~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~ 45 (133)
.+...+..|+|+.|-.+|.+- +..|.+.+.
T Consensus 26 eAl~~ak~gdf~~A~~~l~eA---------~~~l~~AH~ 55 (104)
T PRK09591 26 EAFAAMREGNFDLAEQKLNQS---------NEELLEAHH 55 (104)
T ss_pred HHHHHHHcCCHHHHHHHHHHH---------HHHHHHHHH
Confidence 356788999999999999886 555666553
No 119
>COG0329 DapA Dihydrodipicolinate synthase/N-acetylneuraminate lyase [Amino acid transport and metabolism / Cell envelope biogenesis, outer membrane]
Probab=24.24 E-value=3e+02 Score=22.67 Aligned_cols=81 Identities=17% Similarity=0.154 Sum_probs=50.7
Q ss_pred ccCHHHHHHHHHhhc--CcccccccHHHHHHHH-hhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcccCC
Q psy2692 14 SSSFETAFKLLHDQI--GVKNFLPYRQMFLNSY-LQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTSGK 90 (133)
Q Consensus 14 AGsFe~A~~LLnrQi--GivNf~PLk~~F~~~y-~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~gK 90 (133)
.|+++.-.++...-- ...-|...=..+.... .+++.++++..+.-| +...+=|+++.+|+
T Consensus 167 ~gd~~~~~~~~~~~~~~~f~v~~G~d~~~~~~~~~G~~G~is~~~N~~p-----------------~~~~~l~~~~~~g~ 229 (299)
T COG0329 167 SGDLDRLEEIIAALGDRDFIVLSGDDELALPALLLGADGVISVTANVAP-----------------ELAVELYRAAKAGD 229 (299)
T ss_pred CcCHHHHHHHHHhcCccCeeEEeCchHHHHHHHhCCCCeEEecccccCH-----------------HHHHHHHHHHHcCC
Confidence 455555544433322 2344445555555444 366666666665444 24566778888999
Q ss_pred HHHHHHHHHHHHHHccccccc
Q psy2692 91 FVDAIEKLRNVLICIPLLVVT 111 (133)
Q Consensus 91 F~eAl~~Fr~iL~~i~l~vv~ 111 (133)
+++|....+.++..+.++...
T Consensus 230 ~~~A~~l~~~l~~l~~~~~~~ 250 (299)
T COG0329 230 IEEARELQDRLLPLIRLLFRE 250 (299)
T ss_pred HHHHHHHHHHHHHHHHHHhcC
Confidence 999999999998877666554
No 120
>PF04048 Sec8_exocyst: Sec8 exocyst complex component specific domain; InterPro: IPR007191 Sec8 is a component of the exocyst complex involved in the docking of exocystic vesicles with a fusion site on the plasma membrane. The exocyst complex is composed of Sec3, Sec5, Sec6, Sec8, Sec10, Sec15, Exo70 and Exo84.; GO: 0006904 vesicle docking involved in exocytosis, 0015031 protein transport, 0000145 exocyst
Probab=24.12 E-value=1.4e+02 Score=21.82 Aligned_cols=55 Identities=20% Similarity=0.303 Sum_probs=39.9
Q ss_pred hHHHHHHHHHHHHhccc---CCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 72 SIDLIAKIQQCYQLTTS---GKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 72 l~~L~~~Lk~gyk~~t~---gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
+..+.+++..+.+.+-. ..|..++..|..|+..+- ++++.+.++++-+.-|+.=+
T Consensus 45 f~~~~~~~~~~L~~vV~eh~q~Fn~sI~sy~~i~~~i~----~sq~~i~~lK~~L~~ak~~L 102 (142)
T PF04048_consen 45 FEELKKRIEKALQEVVNEHYQGFNSSIGSYSQILSSIS----ESQERIRELKESLQEAKSLL 102 (142)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH
Confidence 34444455555555543 357889999999888764 67899999999999998754
No 121
>cd00215 PTS_IIA_lac PTS_IIA, PTS system, lactose/cellobiose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. This family of proteins normally function as a homotrimer, stabilized by a centrally located metal ion. Separation into subunits is thought to occur after phosphorylation.
Probab=23.94 E-value=83 Score=22.08 Aligned_cols=29 Identities=17% Similarity=0.184 Sum_probs=22.5
Q ss_pred hHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHh
Q psy2692 8 SESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYL 45 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~ 45 (133)
+...+..|+|+.|-.+|++- +..|.+.+.
T Consensus 22 Al~~a~~g~fe~A~~~l~ea---------~~~l~~AH~ 50 (97)
T cd00215 22 ALKAAKEGDFAEAEELLEEA---------NDSLNEAHH 50 (97)
T ss_pred HHHHHHcCCHHHHHHHHHHH---------HHHHHHHHH
Confidence 56788999999999999886 555555553
No 122
>PF06580 His_kinase: Histidine kinase; InterPro: IPR010559 Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions []. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK. A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [, ]. Signal transducing histidine kinases are the key elements in two-component signal transduction systems, which control complex processes such as the initiation of development in microorganisms [, ]. Examples of histidine kinases are EnvZ, which plays a central role in osmoregulation [], and CheA, which plays a central role in the chemotaxis system []. Histidine kinases usually have an N-terminal ligand-binding domain and a C-terminal kinase domain, but other domains may also be present. The kinase domain is responsible for the autophosphorylation of the histidine with ATP, the phosphotransfer from the kinase to an aspartate of the response regulator, and (with bifunctional enzymes) the phosphotransfer from aspartyl phosphate back to ADP or to water []. The kinase core has a unique fold, distinct from that of the Ser/Thr/Tyr kinase superfamily. HKs can be roughly divided into two classes: orthodox and hybrid kinases [, ]. Most orthodox HKs, typified by the Escherichia coli EnvZ protein, function as periplasmic membrane receptors and have a signal peptide and transmembrane segment(s) that separate the protein into a periplasmic N-terminal sensing domain and a highly conserved cytoplasmic C-terminal kinase core. Members of this family, however, have an integral membrane sensor domain. Not all orthodox kinases are membrane bound, e.g., the nitrogen regulatory kinase NtrB (GlnL) is a soluble cytoplasmic HK []. Hybrid kinases contain multiple phosphodonor and phosphoacceptor sites and use multi-step phospho-relay schemes instead of promoting a single phosphoryl transfer. In addition to the sensor domain and kinase core, they contain a CheY-like receiver domain and a His-containing phosphotransfer (HPt) domain. This family represents a region within bacterial histidine kinase enzymes. Two-component signal transduction systems such as those mediated by histidine kinase are integral parts of bacterial cellular regulatory processes, and are used to regulate the expression of genes involved in virulence. Members of this family often contain IPR003594 from INTERPRO and/or IPR003660 from INTERPRO.; GO: 0000155 two-component sensor activity, 0000160 two-component signal transduction system (phosphorelay), 0016021 integral to membrane
Probab=23.88 E-value=2.2e+02 Score=18.74 Aligned_cols=54 Identities=17% Similarity=0.200 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHhcccC--CHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 73 IDLIAKIQQCYQLTTSG--KFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 73 ~~L~~~Lk~gyk~~t~g--KF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
..|.+-|+.-+.+...+ +-++++..|-+.++... +++++.-.+++=++.|+.|+
T Consensus 12 HFl~NtLn~I~~l~~~~~~~~~~~i~~ls~~lRy~l----~~~~~~v~l~~El~~i~~Yl 67 (82)
T PF06580_consen 12 HFLFNTLNSISWLARIDPEKASEMILSLSDLLRYSL----SSKEEFVTLEEELEFIENYL 67 (82)
T ss_pred HHHHHHHHHHHHHHHcCHHHHHHHHHHHHHHHHHHh----CCCCCeeeHHHHHHHHHHHH
Confidence 34555555555554444 44556666666666655 33333333444445555543
No 123
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=23.44 E-value=2.4e+02 Score=23.69 Aligned_cols=89 Identities=15% Similarity=0.121 Sum_probs=54.4
Q ss_pred hHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCC--CCCCccccCCCCCCCChHHHHHHHHHHHHh
Q psy2692 8 SESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLP--LLAPVTAFPTRKVDRSSIDLIAKIQQCYQL 85 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p--~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~ 85 (133)
+..++..|+.+.|..+|.+.+. .+..| ....+|..... +-+ -+.-+..++.+. .++-.-.+-.|.=+
T Consensus 270 A~~l~~~g~~~~A~~~L~~~l~-~~~~~---~l~~l~~~l~~---~~~~~al~~~e~~lk~~----P~~~~l~l~lgrl~ 338 (398)
T PRK10747 270 AEHLIECDDHDTAQQIILDGLK-RQYDE---RLVLLIPRLKT---NNPEQLEKVLRQQIKQH----GDTPLLWSTLGQLL 338 (398)
T ss_pred HHHHHHCCCHHHHHHHHHHHHh-cCCCH---HHHHHHhhccC---CChHHHHHHHHHHHhhC----CCCHHHHHHHHHHH
Confidence 4568899999999999999987 45544 33333333211 111 011111112111 12223356668888
Q ss_pred cccCCHHHHHHHHHHHHHHccc
Q psy2692 86 TTSGKFVDAIEKLRNVLICIPL 107 (133)
Q Consensus 86 ~t~gKF~eAl~~Fr~iL~~i~l 107 (133)
...|++.+|.+.|+..+..-|-
T Consensus 339 ~~~~~~~~A~~~le~al~~~P~ 360 (398)
T PRK10747 339 MKHGEWQEASLAFRAALKQRPD 360 (398)
T ss_pred HHCCCHHHHHHHHHHHHhcCCC
Confidence 8999999999999999887554
No 124
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=23.29 E-value=1.1e+02 Score=22.30 Aligned_cols=28 Identities=14% Similarity=0.296 Sum_probs=22.6
Q ss_pred HHHHHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692 77 AKIQQCYQLTTSGKFVDAIEKLRNVLIC 104 (133)
Q Consensus 77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~ 104 (133)
.....|..+...|++++|+..|+..+..
T Consensus 37 ~~~~lg~~~~~~g~~~~A~~~~~~al~~ 64 (172)
T PRK02603 37 VYYRDGMSAQADGEYAEALENYEEALKL 64 (172)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 3567788888889999999999988754
No 125
>cd06130 DNA_pol_III_epsilon_like an uncharacterized bacterial subgroup of the DEDDh 3'-5' exonuclease domain family with similarity to the epsilon subunit of DNA polymerase III. This subfamily is composed of uncharacterized bacterial proteins with similarity to the epsilon subunit of DNA polymerase III (Pol III), a multisubunit polymerase which is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. The Pol III holoenzyme is a complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The Pol III epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that ser
Probab=22.85 E-value=1.1e+02 Score=21.64 Aligned_cols=28 Identities=18% Similarity=0.477 Sum_probs=25.0
Q ss_pred ccccCChHHHHhccCHHHHHHHHHhhcC
Q psy2692 2 NLHKVTSESINKSSSFETAFKLLHDQIG 29 (133)
Q Consensus 2 ~~~~~~~~~~~AAGsFe~A~~LLnrQiG 29 (133)
++|.+|.+.+..++.|+.++.-+.+-++
T Consensus 49 ~i~GIt~e~l~~~~~~~~v~~~l~~~l~ 76 (156)
T cd06130 49 AIHGITPEDVADAPTFPEVWPEIKPFLG 76 (156)
T ss_pred cccCcCHHHHhcCCCHHHHHHHHHHHhC
Confidence 5899999999999999999998888766
No 126
>cd03591 CLECT_collectin_like C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis. MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway. MBP also acts directly as an opsonin. SP-A and SP-D in addition to functioning as host defense components, a
Probab=22.82 E-value=1.2e+02 Score=20.40 Aligned_cols=35 Identities=14% Similarity=0.062 Sum_probs=29.6
Q ss_pred cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
.-.+.+|...+++. -.-|++++|.+|.+.+.+++.
T Consensus 10 ~~~w~~A~~~C~~~--g~~La~i~s~~e~~~l~~~~~ 44 (114)
T cd03591 10 EKNFDDAQKLCSEA--GGTLAMPRNAAENAAIASYVK 44 (114)
T ss_pred eeCHHHHHHHHhhc--CCEEecCCCHHHHHHHHHHHh
Confidence 45799999999984 677888999999999988875
No 127
>COG5379 BtaA S-adenosylmethionine:diacylglycerol 3-amino-3-carboxypropyl transferase [Lipid metabolism]
Probab=22.69 E-value=1.5e+02 Score=25.91 Aligned_cols=43 Identities=19% Similarity=0.313 Sum_probs=38.0
Q ss_pred cccCChHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhh
Q psy2692 3 LHKVTSESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTA 50 (133)
Q Consensus 3 ~~~~~~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~ 50 (133)
+|.|.+.+++++|+.+.--|.+++. |.|+...-.--+.+||-+
T Consensus 186 ~~Gvn~~~m~~~r~lgeQR~ff~~k-----~ap~f~r~vl~wlt~rks 228 (414)
T COG5379 186 LHGVNPQAMFKARDLGEQRQFFDRK-----LAPLFDRPVLRWLTSRKS 228 (414)
T ss_pred hhCCCHHHHHhhcchhHHHHHHHhh-----hhhhhhHHHHHHHhcccc
Confidence 7999999999999999999999886 789999888888887655
No 128
>TIGR00823 EIIA-LAC phosphotransferase system enzyme II, lactose-specific, factor III. operon. While the Lac permeases consist of two polypeptide chains (IIA and IICB), the Chb permease of E. coli consists of three (IIA, IIB and IIC). In B. subtilis, a PTS permease similar to the Chb permease of E. coli is believed to transport lichenan (a b-1,3;1,4 glucan) degradation products, oligosaccharides of 2-4 glucose units. This model is specific for the IIA subunit of the Lac PTS family.
Probab=22.66 E-value=90 Score=21.97 Aligned_cols=20 Identities=20% Similarity=0.257 Sum_probs=17.5
Q ss_pred hHHHHhccCHHHHHHHHHhh
Q psy2692 8 SESINKSSSFETAFKLLHDQ 27 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQ 27 (133)
+...+..|+|+.|-++|+.-
T Consensus 24 Al~~a~~gdfe~A~~~l~eA 43 (99)
T TIGR00823 24 ALKAAKAGDFAKARALVEQA 43 (99)
T ss_pred HHHHHHcCCHHHHHHHHHHH
Confidence 56788999999999999876
No 129
>KOG0548|consensus
Probab=22.65 E-value=2e+02 Score=26.41 Aligned_cols=63 Identities=13% Similarity=0.163 Sum_probs=0.0
Q ss_pred HHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCC--------------------C
Q psy2692 10 SINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKV--------------------D 69 (133)
Q Consensus 10 ~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~--------------------~ 69 (133)
+++..|+++.|...++| .+..+|++ .
T Consensus 307 a~~k~~~~~~ai~~~~k----------------------------------aLte~Rt~~~ls~lk~~Ek~~k~~e~~a~ 352 (539)
T KOG0548|consen 307 AYTKREDYEGAIKYYQK----------------------------------ALTEHRTPDLLSKLKEAEKALKEAERKAY 352 (539)
T ss_pred hhhhHHhHHHHHHHHHH----------------------------------HhhhhcCHHHHHHHHHHHHHHHHHHHHHh
Q ss_pred CChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 70 RSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 70 ~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
++..-=.+.=..|+-+|+.|+|++|+..+-..|..-|
T Consensus 353 ~~pe~A~e~r~kGne~Fk~gdy~~Av~~YteAIkr~P 389 (539)
T KOG0548|consen 353 INPEKAEEEREKGNEAFKKGDYPEAVKHYTEAIKRDP 389 (539)
T ss_pred hChhHHHHHHHHHHHHHhccCHHHHHHHHHHHHhcCC
No 130
>cd07957 Anticodon_Ia_Met Anticodon-binding domain of methionyl tRNA synthetases. This domain is found in methionyl tRNA synthetases (MetRS), which belong to the class Ia aminoacyl tRNA synthetases. It lies C-terminal to the catalytic core domain, and recognizes and specifically binds to the tRNA anticodon (CAU). MetRS catalyzes the transfer of methionine to the 3'-end of its tRNA.
Probab=22.46 E-value=2.5e+02 Score=18.81 Aligned_cols=57 Identities=11% Similarity=0.155 Sum_probs=32.9
Q ss_pred HHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccCh------H-HHHHHHHHHHHHHHhh
Q psy2692 74 DLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTK------K-EIEEAKQIINICKEYI 130 (133)
Q Consensus 74 ~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~------~-e~~e~~eli~icrEYi 130 (133)
.+.+.+++.-+.+...+|.+|+....+.+..+=-..-..+ + +.++.+..+..|.+.+
T Consensus 44 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~n~y~~~~kpw~~~~~~~~~~~~~~l~~~~~~l 107 (129)
T cd07957 44 EAEELLEEVAEAMEELEFRKALEEIMELARAANKYIDETAPWKLAKEEDPERLATVLYVLLELL 107 (129)
T ss_pred HHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHhhhccCCCHHHHccCCHHHHHHHHHHHHHHH
Confidence 3444556666666788999999988877744322221110 1 2334555666666543
No 131
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=22.46 E-value=1.4e+02 Score=19.32 Aligned_cols=41 Identities=15% Similarity=0.236 Sum_probs=32.4
Q ss_pred HHhcc-cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHH
Q psy2692 83 YQLTT-SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINI 125 (133)
Q Consensus 83 yk~~t-~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~i 125 (133)
|+.+. ...+.+|...+++. ..-|+.++|.+|.+.+.+++..
T Consensus 13 y~~~~~~~~~~~A~~~C~~~--~~~La~i~~~~e~~~i~~~~~~ 54 (126)
T smart00034 13 YKFSTEKKTWADAQAFCQSL--GAHLASIHSEAENDFVASLLKN 54 (126)
T ss_pred EEEECCccCHHHHHHHHHhc--CCEEcccCCHHHHHHHHHHHHh
Confidence 33443 47899999999998 5778888999999988887764
No 132
>KOG0547|consensus
Probab=22.34 E-value=78 Score=29.17 Aligned_cols=27 Identities=15% Similarity=0.228 Sum_probs=0.0
Q ss_pred HHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 80 QQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
..|-+.|+.||+.+||..+...|-..|
T Consensus 120 ~~GN~~f~~kkY~eAIkyY~~AI~l~p 146 (606)
T KOG0547|consen 120 TKGNKFFRNKKYDEAIKYYTQAIELCP 146 (606)
T ss_pred hhhhhhhhcccHHHHHHHHHHHHhcCC
No 133
>PF00140 Sigma70_r1_2: Sigma-70 factor, region 1.2; InterPro: IPR009042 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. ; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 1SMY_F 1IW7_P 1SIG_A 3IYD_F 2BE5_F 2A6E_F 2CW0_F 2A69_P 2A6H_P 3DXJ_P ....
Probab=22.31 E-value=1.7e+02 Score=16.71 Aligned_cols=19 Identities=21% Similarity=0.242 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHcccccc
Q psy2692 92 VDAIEKLRNVLICIPLLVV 110 (133)
Q Consensus 92 ~eAl~~Fr~iL~~i~l~vv 110 (133)
+|++..|-+-+..+|++..
T Consensus 1 ~D~l~~Yl~ei~~~~LLt~ 19 (37)
T PF00140_consen 1 SDSLRLYLKEIGRYPLLTA 19 (37)
T ss_dssp HHHHHHHHHHHHHS-EETT
T ss_pred CcHHHHHHHHHcCCCCCCH
Confidence 5889999999999999944
No 134
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=22.20 E-value=1.4e+02 Score=21.03 Aligned_cols=29 Identities=24% Similarity=0.261 Sum_probs=22.2
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
..+.|.-++..|++++|+..|+..+..-|
T Consensus 34 ~~~la~~~~~~~~~~~A~~~~~~~l~~~p 62 (234)
T TIGR02521 34 RVQLALGYLEQGDLEVAKENLDKALEHDP 62 (234)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHhCc
Confidence 55667777888888888888888876654
No 135
>cd03603 CLECT_VCBS A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_VCBS: A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Bacterial CTLDs within this group are functionally uncharacterized. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surface
Probab=22.08 E-value=1.4e+02 Score=20.59 Aligned_cols=36 Identities=8% Similarity=-0.031 Sum_probs=30.2
Q ss_pred ccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 87 TSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 87 t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
+...|.||...+++. -.-|++++|.+|.+.+.+++.
T Consensus 8 ~~~sw~~A~~~C~~~--g~~La~I~s~~E~~fv~~~~~ 43 (118)
T cd03603 8 GGMTWEAAQTLAESL--GGHLVTINSAEENDWLLSNFG 43 (118)
T ss_pred CCcCHHHHHHHHHHc--CCEEcccCCHHHHHHHHHHhc
Confidence 367899999999985 678888999999988887764
No 136
>cd06266 RNaseH_typeII Ribonuclease H type II. Ribonuclease H (RNase H) is classified into two families, type I (prokaryotic RNase HI, eukaryotic RNase H1 and viral RNase H) and type II (prokaryotic RNase HII and HIII, and eukaryotic RNase H2/HII). RNase H endonucleolytically hydrolyzes an RNA strand when it is annealed to a complementary DNA strand in the presence of divalent cations, in DNA replication and repair. The enzyme can be found in bacteria, archaea, and eukaryotes. Most prokaryotic and eukaryotic genomes contain multiple RNase H genes, however, no prokaryotic genome contained the combination of only RNase HI and HIII. Despite a lack of evidence for homology from sequence comparisons, type I and type II RNase H share a common fold and similar steric configurations of the four acidic active-site residues, suggesting identical or very similar catalytic mechanisms. It appears that type I and type II RNases H also have overlapping functions in cells, as over-expression of Escheri
Probab=22.01 E-value=56 Score=25.06 Aligned_cols=29 Identities=17% Similarity=0.177 Sum_probs=25.1
Q ss_pred cCChHHHHhccCHHHHHHHHHhhcCcccc
Q psy2692 5 KVTSESINKSSSFETAFKLLHDQIGVKNF 33 (133)
Q Consensus 5 ~~~~~~~~AAGsFe~A~~LLnrQiGivNf 33 (133)
-|.++|.+|--.+|..|+.|+++++..+|
T Consensus 132 ~VaAASIiAKv~rd~~m~~l~~~~~~~~~ 160 (193)
T cd06266 132 SVAAASILAKVTRDRLMEELDEEYPGYGF 160 (193)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHCCCCCC
Confidence 47889999999999999999999886444
No 137
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=21.91 E-value=85 Score=24.92 Aligned_cols=20 Identities=25% Similarity=0.428 Sum_probs=17.7
Q ss_pred hHHHHhccCHHHHHHHHHhh
Q psy2692 8 SESINKSSSFETAFKLLHDQ 27 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQ 27 (133)
+.++..+|+++.|.++|.+-
T Consensus 185 A~ey~~~g~~~~A~~~l~~~ 204 (247)
T PF11817_consen 185 AEEYFRLGDYDKALKLLEPA 204 (247)
T ss_pred HHHHHHCCCHHHHHHHHHHH
Confidence 57899999999999999875
No 138
>PF12793 SgrR_N: Sugar transport-related sRNA regulator N-term
Probab=21.84 E-value=89 Score=22.43 Aligned_cols=18 Identities=28% Similarity=0.549 Sum_probs=16.2
Q ss_pred hHHHHhccCHHHHHHHHH
Q psy2692 8 SESINKSSSFETAFKLLH 25 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLn 25 (133)
+.+++..|+++.|+++|.
T Consensus 77 ~~~~l~~g~~~~a~~ll~ 94 (115)
T PF12793_consen 77 AEELLEQGKYEQALQLLD 94 (115)
T ss_pred HHHHHHcCCHHHHHHHHH
Confidence 467889999999999998
No 139
>PF02255 PTS_IIA: PTS system, Lactose/Cellobiose specific IIA subunit; InterPro: IPR003188 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) [, ] is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC []. These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose [, , , ]. The lactose/cellobiose-specific family are one of four structurally and functionally distinct group IIA PTS system enzymes. This family of proteins normally function as a homotrimer, stabilised by a centrally located metal ion []. Separation into subunits is thought to occur after phosphorylation.; GO: 0005351 sugar:hydrogen symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane; PDB: 3L8R_D 2E2A_B 1E2A_C 3K1S_C 2LRK_C 2LRL_A 2WY2_A 1WCR_A 2WWV_C.
Probab=21.63 E-value=96 Score=21.60 Aligned_cols=20 Identities=25% Similarity=0.375 Sum_probs=16.2
Q ss_pred hHHHHhccCHHHHHHHHHhh
Q psy2692 8 SESINKSSSFETAFKLLHDQ 27 (133)
Q Consensus 8 ~~~~~AAGsFe~A~~LLnrQ 27 (133)
+.+.+..|+|+.|-.+|++-
T Consensus 21 Al~~a~~~~fe~A~~~l~~a 40 (96)
T PF02255_consen 21 ALKAAREGDFEEAEELLKEA 40 (96)
T ss_dssp HHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHhCCHHHHHHHHHHH
Confidence 56778899999999999876
No 140
>COG1283 NptA Na+/phosphate symporter [Inorganic ion transport and metabolism]
Probab=21.58 E-value=1.5e+02 Score=27.08 Aligned_cols=40 Identities=18% Similarity=0.224 Sum_probs=28.0
Q ss_pred CHHHHHHHHHHHHHHccccccc---ChHHHHHHHHHHHHHHHh
Q psy2692 90 KFVDAIEKLRNVLICIPLLVVT---TKKEIEEAKQIINICKEY 129 (133)
Q Consensus 90 KF~eAl~~Fr~iL~~i~l~vv~---~~~e~~e~~eli~icrEY 129 (133)
|.+|+++....-++....-+-. |++|.+...++++.||++
T Consensus 370 ~~e~~vd~~~~~Ik~YL~~ls~~~Lse~es~r~~~iid~a~~l 412 (533)
T COG1283 370 KLEDAVDRLYEEIKLYLARLSKEGLSEEESRRWAEIIDAAINL 412 (533)
T ss_pred HHHHHHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHhH
Confidence 4566666666666554433222 688999999999999986
No 141
>KOG0482|consensus
Probab=21.43 E-value=3.6e+02 Score=25.37 Aligned_cols=81 Identities=17% Similarity=0.280 Sum_probs=46.2
Q ss_pred ccccccHHHHHHHH-hhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhccc------------CCHHHHHHH
Q psy2692 31 KNFLPYRQMFLNSY-LQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTS------------GKFVDAIEK 97 (133)
Q Consensus 31 vNf~PLk~~F~~~y-~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~------------gKF~eAl~~ 97 (133)
-+|+||-+-.|.-| ..+|.+-|-.| ++|.+.+-.||--... -+-=-++-.
T Consensus 550 ~~fepl~~~~mR~yI~~ak~~~P~vp-----------------~~l~dyi~~AYv~~Rrea~~~~~~t~ttpRtLL~IlR 612 (721)
T KOG0482|consen 550 LDFEPLDPNLMRRYISLAKRKNPVVP-----------------EALADYITGAYVELRREARSSKDFTYTTPRTLLGILR 612 (721)
T ss_pred ccCCCCCHHHHHHHHHHHhhcCCCCC-----------------HHHHHHHHHHHHHHHHHhhccCCCcccCHHHHHHHHH
Confidence 57899999998888 45666655544 2344444444432221 111122223
Q ss_pred HHHHHHHcccccccChHHHHHHHHHHHHHHH
Q psy2692 98 LRNVLICIPLLVVTTKKEIEEAKQIINICKE 128 (133)
Q Consensus 98 Fr~iL~~i~l~vv~~~~e~~e~~eli~icrE 128 (133)
.-..|-.+-|...-++++++|+-.|++..++
T Consensus 613 ls~AlarLRls~~V~~~DV~EALRLme~sK~ 643 (721)
T KOG0482|consen 613 LSTALARLRLSDSVEEDDVNEALRLMEMSKD 643 (721)
T ss_pred HHHHHHHhhhccccchhhHHHHHHHHHhhhc
Confidence 3333444555666678888888888887765
No 142
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=21.23 E-value=2.3e+02 Score=21.21 Aligned_cols=29 Identities=21% Similarity=0.418 Sum_probs=25.3
Q ss_pred HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
.+..|.-++..|++++|+..|+.++..-|
T Consensus 73 ~~~la~~~~~~~~~~~A~~~~~~~l~~~p 101 (235)
T TIGR03302 73 QLDLAYAYYKSGDYAEAIAAADRFIRLHP 101 (235)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHCc
Confidence 46678888899999999999999998776
No 143
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=21.15 E-value=2.7e+02 Score=27.14 Aligned_cols=25 Identities=24% Similarity=0.228 Sum_probs=18.9
Q ss_pred HHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692 82 CYQLTTSGKFVDAIEKLRNVLICIP 106 (133)
Q Consensus 82 gyk~~t~gKF~eAl~~Fr~iL~~i~ 106 (133)
|..+...|++++|+..|+.+|..-|
T Consensus 276 G~~~~~~g~~~~A~~~l~~aL~~~P 300 (1157)
T PRK11447 276 GLAAVDSGQGGKAIPELQQAVRANP 300 (1157)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCC
Confidence 6666777888888888888877655
No 144
>PF03705 CheR_N: CheR methyltransferase, all-alpha domain; InterPro: IPR022641 CheR proteins are part of the chemotaxis signaling mechanism which methylates the chemotaxis receptor at specific glutamate residues. This entry refers to the N-terminal domain of the CherR-type MCP methyltransferases, which are found in bacteria, archaea and green plants. This entry is found in association with PF01739 from PFAM. Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. Flagellated bacteria swim towards favourable chemicals and away from deleterious ones. Sensing of chemoeffector gradients involves chemotaxis receptors, transmembrane (TM) proteins that detect stimuli through their periplasmic domains and transduce the signals via their cytoplasmic domains []. Signalling outputs from these receptors are influenced both by the binding of the chemoeffector ligand to their periplasmic domains and by methylation of specific glutamate residues on their cytoplasmic domains. Methylation is catalysed by CheR, an S-adenosylmethionine-dependent methyltransferase [], which reversibly methylates specific glutamate residues within a coiled coil region, to form gamma-glutamyl methyl ester residues [, ]. The structure of the Salmonella typhimurium chemotaxis receptor methyltransferase CheR, bound to S-adenosylhomocysteine, has been determined to a resolution of 2.0 A []. The structure reveals CheR to be a two-domain protein, with a smaller N-terminal helical domain linked via a single polypeptide connection to a larger C-terminal alpha/beta domain. The C-terminal domain has the characteristics of a nucleotide-binding fold, with an insertion of a small anti-parallel beta-sheet subdomain. The S-adenosylhomocysteine-binding site is formed mainly by the large domain, with contributions from residues within the N-terminal domain and the linker region [].; PDB: 1AF7_A 1BC5_A.
Probab=20.98 E-value=1.1e+02 Score=18.25 Aligned_cols=41 Identities=24% Similarity=0.420 Sum_probs=26.2
Q ss_pred CHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCC
Q psy2692 16 SFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLL 57 (133)
Q Consensus 16 sFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~l 57 (133)
+|+.-..++++..|+ +|.+.|+.+++-=...+...-++++.
T Consensus 4 ~f~~~~~~i~~~~Gi-~l~~~K~~~l~rRl~~rm~~~~~~~~ 44 (57)
T PF03705_consen 4 EFERFRELIYRRTGI-DLSEYKRSLLERRLARRMRALGLPSF 44 (57)
T ss_dssp HHHHHHHHHHHHH------GGGHHHHHHHHHHHHHHHT---H
T ss_pred HHHHHHHHHHHHHCC-CCchhhHHHHHHHHHHHHHHcCCCCH
Confidence 477778889999997 89999999888777777766666543
No 145
>cd03590 CLECT_DC-SIGN_like C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on path
Probab=20.98 E-value=1.4e+02 Score=20.09 Aligned_cols=40 Identities=15% Similarity=0.311 Sum_probs=32.2
Q ss_pred HHhcc-cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 83 YQLTT-SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 83 yk~~t-~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
|++++ ...+.+|...++.. ..-|++++|.+|.+-+.+++.
T Consensus 13 y~~~~~~~tw~~A~~~C~~~--g~~La~i~s~~e~~~l~~~~~ 53 (126)
T cd03590 13 YFFSTEKKSWEESRQFCEDM--GAHLVIINSQEEQEFISKILS 53 (126)
T ss_pred EEEeCCCcCHHHHHHHHHhC--CCEEEeeCCHHHHHHHHHHhC
Confidence 44444 57899999999874 788889999999999988874
No 146
>cd06144 REX4_like DEDDh 3'-5' exonuclease domain of RNA exonuclease 4, XPMC2, Interferon Stimulated Gene product of 20 kDa, and similar proteins. This subfamily is composed of RNA exonuclease 4 (REX4 or Rex4p), XPMC2, Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20), and similar proteins. REX4 is involved in pre-rRNA processing. It controls the ratio between the two forms of 5.8S rRNA in yeast. XPMC2 is a Xenopus gene which was identified through its ability to correct a mitotic defect in fission yeast. The human homolog of XPMC2 (hPMC2) may be involved in angiotensin II-induced adrenal cell cycle progression and cell proliferation. ISG20 is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. These proteins are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clus
Probab=20.91 E-value=1.2e+02 Score=22.09 Aligned_cols=29 Identities=14% Similarity=0.292 Sum_probs=25.8
Q ss_pred CccccCChHHHHhccCHHHHHHHHHhhcC
Q psy2692 1 MNLHKVTSESINKSSSFETAFKLLHDQIG 29 (133)
Q Consensus 1 ~~~~~~~~~~~~AAGsFe~A~~LLnrQiG 29 (133)
+.+|.+|++....|-+|+.++.-+.+-+|
T Consensus 48 ~~ihGIt~~~v~~a~~~~~~~~~l~~~l~ 76 (152)
T cd06144 48 TAVSGIRPEHLKDAPDFEEVQKKVAELLK 76 (152)
T ss_pred ccCCCCCHHHHcCCCCHHHHHHHHHHHhC
Confidence 36899999999999999999998888775
No 147
>cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. L- sel is expressed constitutively on most leukocytes. P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets. E- sels are present on endothelial cells. Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp
Probab=20.90 E-value=1.4e+02 Score=20.03 Aligned_cols=35 Identities=11% Similarity=0.095 Sum_probs=29.0
Q ss_pred cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692 88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN 124 (133)
Q Consensus 88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~ 124 (133)
.-.|.||...+++. -.-|+.++|.+|.+-+.+++.
T Consensus 9 ~~~w~~A~~~C~~~--g~~La~i~s~~e~~~i~~~~~ 43 (115)
T cd03592 9 KMTFNEAVKYCKSR--GTDLVAIQNAEENALLNGFAL 43 (115)
T ss_pred ccCHHHHHHHHHHc--CCeEeecCCHHHHHHHHHHHH
Confidence 46799999999986 678889999999988887653
No 148
>PF10607 CLTH: CTLH/CRA C-terminal to LisH motif domain; InterPro: IPR019589 This entry represents the CRA (or CT11-RanBPM) domain, which is a protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi) and which is found in Ran-binding proteins such as Ran-binding protein 9 (RanBP9 or RanBPM) and RanBP10. RanBPM is a scaffolding protein important in regulating cellular function in both the immune system and the nervous system, and may act as an adapter protein to couple membrane receptors to intracellular signaling pathways. This domain is at the C terminus of the proteins and is the binding domain for the CRA motif, which is comprised of approximately 100 amino acids at the C-terminal of RanBPM. It was found to be important for the interaction of RanBPM with fragile X mental retardation protein (FMRP), but its functional significance has yet to be determined [].
Probab=20.82 E-value=3.1e+02 Score=19.28 Aligned_cols=19 Identities=11% Similarity=0.230 Sum_probs=16.6
Q ss_pred HHHHhccCHHHHHHHHHhh
Q psy2692 9 ESINKSSSFETAFKLLHDQ 27 (133)
Q Consensus 9 ~~~~AAGsFe~A~~LLnrQ 27 (133)
.+.+..|+.+.|+..+++.
T Consensus 9 ~~~I~~g~i~~Ai~w~~~~ 27 (145)
T PF10607_consen 9 RQAILNGDIDPAIEWLNEN 27 (145)
T ss_pred HHHHHcCCHHHHHHHHHHc
Confidence 4567899999999999986
No 149
>PLN00416 carbonate dehydratase
Probab=20.80 E-value=1.6e+02 Score=24.26 Aligned_cols=73 Identities=16% Similarity=0.213 Sum_probs=49.3
Q ss_pred hccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcccCCHH
Q psy2692 13 KSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTSGKFV 92 (133)
Q Consensus 13 AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~gKF~ 92 (133)
|+.|++.+..-|++-+ -|.+-++..--..-..-.+++.... .+.+.-.++|.+|++-|..++++
T Consensus 2 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~al~~Ll~Gn~rF~~~~~~ 65 (258)
T PLN00416 2 ATESYEAAIKGLNDLL--STKADLGNVAAAKIKALTAELKELD--------------SSNSDAIERIKTGFTQFKTEKYL 65 (258)
T ss_pred CcccHHHHHHHHHhhc--ccccccchHHHHhHHHHHHHHHHhh--------------cCHHHHHHHHHHHHHHHHhcccc
Confidence 5678999999998853 2566666554443333333343333 23456677999999999999998
Q ss_pred HHHHHHHHH
Q psy2692 93 DAIEKLRNV 101 (133)
Q Consensus 93 eAl~~Fr~i 101 (133)
+-.+.|+.+
T Consensus 66 ~~~~~~~~l 74 (258)
T PLN00416 66 KNSTLFNHL 74 (258)
T ss_pred cCHHHHHhh
Confidence 877777764
No 150
>KOG2076|consensus
Probab=20.56 E-value=1.4e+02 Score=28.98 Aligned_cols=40 Identities=20% Similarity=0.150 Sum_probs=34.3
Q ss_pred hHH-HHHHHHHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692 72 SID-LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT 111 (133)
Q Consensus 72 l~~-L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~ 111 (133)
++- +...|.+|--+|--|.+++|...+++|+.+.|.....
T Consensus 135 l~~~l~~ll~eAN~lfarg~~eeA~~i~~EvIkqdp~~~~a 175 (895)
T KOG2076|consen 135 LAPELRQLLGEANNLFARGDLEEAEEILMEVIKQDPRNPIA 175 (895)
T ss_pred cCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCccchhh
Confidence 444 7778899999999999999999999999999986443
No 151
>KOG0980|consensus
Probab=20.49 E-value=91 Score=30.43 Aligned_cols=40 Identities=25% Similarity=0.297 Sum_probs=29.6
Q ss_pred HHHHHHhhcCcccccccHHHHHHHHhhhh---------------hhccCCCCCCC
Q psy2692 20 AFKLLHDQIGVKNFLPYRQMFLNSYLQSR---------------TAYSTLPLLAP 59 (133)
Q Consensus 20 A~~LLnrQiGivNf~PLk~~F~~~y~~s~---------------~~~~~~p~lp~ 59 (133)
=|.-|++|+----.+|+|++|+..|..-+ ..+|.+|.-||
T Consensus 208 mlfkLHs~vp~dtLeghRdRf~~qf~rLk~FY~~~S~lqYfk~LI~IP~LP~~~P 262 (980)
T KOG0980|consen 208 MLFKLHSQVPPDTLEGHRDRFHTQFERLKQFYADCSNLQYFKRLIQIPTLPEDAP 262 (980)
T ss_pred HHHHHHcCCCHHHhhhHHHHHHHHHHHHHHHHHhcchhHHHHHHhcCCCCCCCCc
Confidence 46678888887778999999998765433 34788886554
No 152
>PF05053 Menin: Menin; InterPro: IPR007747 MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23 [].; GO: 0005634 nucleus; PDB: 3RE2_A 3U84_B 3U86_A 3U88_B 3U85_A.
Probab=20.45 E-value=2.3e+02 Score=26.48 Aligned_cols=49 Identities=16% Similarity=0.226 Sum_probs=38.9
Q ss_pred HHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692 80 QQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI 130 (133)
Q Consensus 80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi 130 (133)
+-|| +...+++.||+...-+.-.-|-...- ++++++--||+++|+-|-|
T Consensus 324 ~gg~-~yR~~~~~eA~~~Wa~aa~Vi~~YnY-~reDeEiYKEfleIAneLi 372 (618)
T PF05053_consen 324 LGGY-YYRHKRYREALRSWAEAADVIRKYNY-SREDEEIYKEFLEIANELI 372 (618)
T ss_dssp HHHH-HHHTT-HHHHHHHHHHHHHHHTTSB---GGGHHHHHHHHHHHHTHH
T ss_pred hhhH-HHHHHHHHHHHHHHHHHHHHHHHccc-CccHHHHHHHHHHHHHHHH
Confidence 4466 78889999999999988777777755 7999999999999998865
No 153
>KOG1173|consensus
Probab=20.40 E-value=1.6e+02 Score=27.34 Aligned_cols=27 Identities=22% Similarity=0.256 Sum_probs=16.0
Q ss_pred HHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 82 CYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 82 gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
||-.--.|+|+-|++.|...|..-|.-
T Consensus 496 g~iy~llgnld~Aid~fhKaL~l~p~n 522 (611)
T KOG1173|consen 496 GYIYHLLGNLDKAIDHFHKALALKPDN 522 (611)
T ss_pred HHHHHHhcChHHHHHHHHHHHhcCCcc
Confidence 455555566666666666666655543
No 154
>cd05513 Bromo_brd7_like Bromodomain, brd7_like subgroup. The BRD7 gene encodes a nuclear protein that has been shown to inhibit cell growth and the progression of the cell cycle by regulating cell-cycle genes at the transcriptional level. BRD7 has been identified as a gene involved in nasopharyngeal carcinoma. The protein interacts with acetylated histone H3 via its bromodomain. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=20.21 E-value=3.1e+02 Score=18.96 Aligned_cols=88 Identities=11% Similarity=-0.001 Sum_probs=54.0
Q ss_pred ccHHHHHHHHhhhhhhccCCCCCCCccccCCCC------CCCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 35 PYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRK------VDRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 35 PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn------~~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
||+..+.++...-+..=.+.++.-|+....--+ .|..++.+..+|+.+. =..+.+-..-++-|.......
T Consensus 1 ~l~~~l~~il~~l~~~~~~~~F~~PV~~~~~pdY~~vIk~PmDL~tI~~kl~~~~----Y~s~~~f~~D~~li~~Na~~y 76 (98)
T cd05513 1 PLQKALEQLIRQLQRKDPHGFFAFPVTDFIAPGYSSIIKHPMDFSTMKEKIKNND----YQSIEEFKDDFKLMCENAMKY 76 (98)
T ss_pred CHHHHHHHHHHHHHcCCccccccCcCCccccccHHHHHcCccCHHHHHHHHhCCC----CCCHHHHHHHHHHHHHHHHHH
Confidence 777888777765555444555544432111001 0888999988888763 334666777777777776666
Q ss_pred cccChHHHHHHHHHHHHH
Q psy2692 109 VVTTKKEIEEAKQIINIC 126 (133)
Q Consensus 109 vv~~~~e~~e~~eli~ic 126 (133)
--.+...-..+.++.+..
T Consensus 77 N~~~s~~~~~A~~L~~~~ 94 (98)
T cd05513 77 NKPDTIYYKAAKKLLHSG 94 (98)
T ss_pred CCCCCHHHHHHHHHHHhh
Confidence 656655556666665543
No 155
>PF03564 DUF1759: Protein of unknown function (DUF1759); InterPro: IPR005312 This is a small family of proteins of unknown function.
Probab=20.15 E-value=71 Score=22.74 Aligned_cols=80 Identities=18% Similarity=0.302 Sum_probs=42.2
Q ss_pred hccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhccc-CCH
Q psy2692 13 KSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTS-GKF 91 (133)
Q Consensus 13 AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~-gKF 91 (133)
...+|+.|++.|++++|=- .- ..+.+...-..+|.+. .+.+..+..+.+.++......+. |.=
T Consensus 52 ~~~~Y~~a~~~L~~~yg~~--~~----i~~~~~~~l~~l~~~~----------~~d~~~L~~~~~~v~~~i~~L~~lg~~ 115 (145)
T PF03564_consen 52 SEENYEEAWELLEERYGNP--RR----IIQALLEELRNLPPIS----------NDDPEALRSLVDKVNNCIRALKALGVN 115 (145)
T ss_pred cchhhHHHHHHHHHHhCCc--hH----HHHHHHHHHhcccccc----------chhHHHHHHHHHHHHHHHHHHHHcCCC
Confidence 4688999999999999832 22 2223333333333311 11133566666666665554443 443
Q ss_pred HHHHHHHHHHHHHcccc
Q psy2692 92 VDAIEKLRNVLICIPLL 108 (133)
Q Consensus 92 ~eAl~~Fr~iL~~i~l~ 108 (133)
.+.-....-|+..+|.-
T Consensus 116 ~~~~~l~~~i~~KLp~~ 132 (145)
T PF03564_consen 116 VDDPLLISIILSKLPPE 132 (145)
T ss_pred CCCHHHHHHHHHHCCHH
Confidence 33333445555666653
No 156
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=20.01 E-value=4.8e+02 Score=24.03 Aligned_cols=27 Identities=26% Similarity=0.395 Sum_probs=22.3
Q ss_pred HHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692 82 CYQLTTSGKFVDAIEKLRNVLICIPLL 108 (133)
Q Consensus 82 gyk~~t~gKF~eAl~~Fr~iL~~i~l~ 108 (133)
|--+-+.||-.||++.||+++.-.|..
T Consensus 266 AmCarklGr~~EAIk~~rdLlke~p~~ 292 (539)
T PF04184_consen 266 AMCARKLGRLREAIKMFRDLLKEFPNL 292 (539)
T ss_pred HHHHHHhCChHHHHHHHHHHHhhCCcc
Confidence 334448999999999999999988864
Done!