Query psy8176
Match_columns 159
No_of_seqs 110 out of 353
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 20:32:50 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8176.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8176hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF08336 P4Ha_N: Prolyl 4-Hydr 100.0 7.2E-34 1.6E-38 211.4 10.1 90 3-94 45-134 (134)
2 PF07719 TPR_2: Tetratricopept 96.2 0.013 2.8E-07 32.1 4.3 32 102-133 1-32 (34)
3 PF08631 SPO22: Meiosis protei 96.0 0.022 4.7E-07 46.8 6.3 57 102-158 35-98 (278)
4 PF13424 TPR_12: Tetratricopep 95.5 0.018 3.9E-07 37.5 3.2 57 102-159 5-61 (78)
5 PF13414 TPR_11: TPR repeat; P 92.5 0.63 1.4E-05 29.2 5.9 50 101-158 2-51 (69)
6 TIGR02795 tol_pal_ybgF tol-pal 92.0 1.9 4E-05 29.2 8.3 54 100-158 37-90 (119)
7 PF13176 TPR_7: Tetratricopept 91.2 0.49 1.1E-05 26.6 3.8 29 105-133 2-30 (36)
8 PF13432 TPR_16: Tetratricopep 91.2 0.59 1.3E-05 29.1 4.6 46 106-159 1-46 (65)
9 smart00028 TPR Tetratricopepti 91.1 0.31 6.6E-06 24.3 2.8 28 104-131 3-30 (34)
10 smart00671 SEL1 Sel1-like repe 90.8 0.46 9.9E-06 26.0 3.4 28 103-130 2-33 (36)
11 PRK10803 tol-pal system protei 90.1 2.9 6.3E-05 34.3 8.9 58 97-159 175-232 (263)
12 PF13374 TPR_10: Tetratricopep 90.0 0.8 1.7E-05 25.5 4.1 33 102-134 2-34 (42)
13 PF12895 Apc3: Anaphase-promot 87.5 1.1 2.4E-05 29.5 4.0 49 102-159 25-73 (84)
14 PF13174 TPR_6: Tetratricopept 87.0 0.81 1.8E-05 24.3 2.6 29 106-134 4-32 (33)
15 cd00189 TPR Tetratricopeptide 86.9 1.6 3.4E-05 26.7 4.3 27 105-131 3-29 (100)
16 PF00515 TPR_1: Tetratricopept 86.0 2.2 4.7E-05 23.0 4.1 30 102-131 1-30 (34)
17 PF13181 TPR_8: Tetratricopept 85.0 2.1 4.6E-05 22.9 3.7 30 103-132 2-31 (34)
18 PF13428 TPR_14: Tetratricopep 85.0 1.8 3.9E-05 25.2 3.6 28 105-132 4-31 (44)
19 PF11817 Foie-gras_1: Foie gra 84.9 1.9 4E-05 34.9 4.8 60 98-159 174-233 (247)
20 PF08238 Sel1: Sel1 repeat; I 84.1 2 4.3E-05 23.8 3.4 29 102-130 1-36 (39)
21 PF13424 TPR_12: Tetratricopep 84.0 1.4 2.9E-05 28.4 3.0 32 102-133 46-77 (78)
22 PF13525 YfiO: Outer membrane 84.0 5 0.00011 31.2 6.7 57 96-157 36-92 (203)
23 PF13432 TPR_16: Tetratricopep 83.8 1.9 4E-05 26.7 3.5 33 101-133 30-62 (65)
24 TIGR03302 OM_YfiO outer membra 83.6 18 0.0004 27.9 10.9 49 106-159 170-218 (235)
25 PF14559 TPR_19: Tetratricopep 82.3 2.5 5.5E-05 26.2 3.7 40 112-159 1-40 (68)
26 PF08631 SPO22: Meiosis protei 81.3 16 0.00035 29.8 9.1 37 95-131 232-275 (278)
27 PLN03098 LPA1 LOW PSII ACCUMUL 79.4 6.9 0.00015 34.9 6.6 55 99-158 72-126 (453)
28 PF13371 TPR_9: Tetratricopept 77.5 4.4 9.5E-05 25.4 3.7 42 109-158 2-43 (73)
29 PRK11788 tetratricopeptide rep 76.9 28 0.00062 28.8 9.4 26 105-130 110-135 (389)
30 cd00189 TPR Tetratricopeptide 76.9 14 0.00029 22.2 6.1 26 105-130 37-62 (100)
31 COG1729 Uncharacterized protei 76.4 8.9 0.00019 31.8 6.0 58 97-159 173-230 (262)
32 KOG2002|consensus 75.6 15 0.00033 35.6 8.0 94 59-158 616-730 (1018)
33 PF07721 TPR_4: Tetratricopept 74.7 3.9 8.5E-05 21.2 2.4 23 105-127 4-26 (26)
34 PF13525 YfiO: Outer membrane 74.4 38 0.00082 26.2 11.9 119 12-159 71-193 (203)
35 PRK02603 photosystem I assembl 74.1 12 0.00025 28.0 5.8 22 109-130 79-100 (172)
36 TIGR03504 FimV_Cterm FimV C-te 73.0 6.4 0.00014 23.6 3.3 25 106-130 3-27 (44)
37 PRK02603 photosystem I assembl 72.6 12 0.00026 27.9 5.6 27 105-131 38-64 (172)
38 CHL00033 ycf3 photosystem I as 72.4 13 0.00029 27.4 5.8 52 102-158 35-86 (168)
39 cd05804 StaR_like StaR_like; a 72.2 16 0.00035 29.9 6.7 53 103-159 149-201 (355)
40 PF13512 TPR_18: Tetratricopep 71.7 20 0.00043 27.0 6.5 57 98-159 6-62 (142)
41 PF14938 SNAP: Soluble NSF att 70.4 5.4 0.00012 32.5 3.5 80 64-158 89-169 (282)
42 PRK15359 type III secretion sy 70.2 12 0.00027 27.3 5.1 50 101-158 57-106 (144)
43 PF13429 TPR_15: Tetratricopep 67.8 9.1 0.0002 30.7 4.3 47 105-159 217-263 (280)
44 TIGR03302 OM_YfiO outer membra 67.6 24 0.00052 27.2 6.5 52 100-156 68-119 (235)
45 TIGR02552 LcrH_SycD type III s 67.0 16 0.00035 25.4 5.0 53 99-159 48-100 (135)
46 PRK10866 outer membrane biogen 66.3 67 0.0015 25.8 10.7 52 102-158 175-226 (243)
47 COG3063 PilF Tfp pilus assembl 66.1 11 0.00025 30.8 4.4 45 108-158 109-153 (250)
48 TIGR02521 type_IV_pilW type IV 65.9 21 0.00045 26.2 5.7 33 99-131 28-60 (234)
49 TIGR02795 tol_pal_ybgF tol-pal 65.2 28 0.0006 23.2 5.8 53 102-159 2-54 (119)
50 PRK10866 outer membrane biogen 64.4 26 0.00056 28.2 6.2 56 97-157 64-119 (243)
51 PRK10370 formate-dependent nit 63.9 66 0.0014 24.9 9.3 50 102-156 144-193 (198)
52 TIGR00540 hemY_coli hemY prote 62.0 18 0.00039 31.1 5.2 44 107-159 340-385 (409)
53 PF13371 TPR_9: Tetratricopept 59.6 38 0.00083 20.9 5.3 31 101-131 28-58 (73)
54 KOG3617|consensus 59.5 30 0.00065 33.7 6.4 113 11-128 879-993 (1416)
55 PF12688 TPR_5: Tetratrico pep 59.4 35 0.00076 24.6 5.6 49 105-158 41-89 (120)
56 PF12455 Dynactin: Dynein asso 59.3 92 0.002 25.7 8.7 105 10-131 69-176 (274)
57 PF14559 TPR_19: Tetratricopep 57.9 14 0.00029 22.7 2.9 31 101-131 24-54 (68)
58 KOG2940|consensus 56.9 13 0.00029 30.8 3.3 53 23-77 34-91 (325)
59 TIGR02521 type_IV_pilW type IV 56.0 76 0.0017 23.1 9.3 25 106-130 139-163 (234)
60 PRK15359 type III secretion sy 55.0 45 0.00098 24.2 5.7 52 99-158 21-72 (144)
61 KOG1173|consensus 53.8 13 0.00027 34.2 2.9 33 102-134 414-446 (611)
62 PF09976 TPR_21: Tetratricopep 50.4 43 0.00093 24.1 4.9 32 100-131 46-77 (145)
63 KOG1840|consensus 49.8 48 0.001 30.0 6.0 66 93-158 190-255 (508)
64 PF13429 TPR_15: Tetratricopep 49.0 39 0.00084 27.0 4.9 50 101-158 145-194 (280)
65 PF13414 TPR_11: TPR repeat; P 47.2 25 0.00055 21.5 2.9 29 103-131 38-67 (69)
66 PF04212 MIT: MIT (microtubule 47.1 47 0.001 21.0 4.2 33 102-134 5-37 (69)
67 PF14853 Fis1_TPR_C: Fis1 C-te 46.4 39 0.00085 20.9 3.5 28 104-131 2-30 (53)
68 TIGR02552 LcrH_SycD type III s 43.3 1.1E+02 0.0023 21.1 9.3 34 101-134 84-117 (135)
69 PLN03088 SGT1, suppressor of 42.7 81 0.0017 26.7 6.1 33 99-131 67-99 (356)
70 TIGR00990 3a0801s09 mitochondr 42.6 69 0.0015 28.9 5.9 28 103-130 400-427 (615)
71 PRK11189 lipoprotein NlpI; Pro 40.9 71 0.0015 26.1 5.3 30 102-131 98-127 (296)
72 cd02679 MIT_spastin MIT: domai 40.8 49 0.0011 22.3 3.6 35 102-136 8-42 (79)
73 PRK10370 formate-dependent nit 40.7 1.1E+02 0.0023 23.7 6.1 51 101-159 106-159 (198)
74 PRK11788 tetratricopeptide rep 40.5 2.1E+02 0.0045 23.5 10.0 30 102-131 214-243 (389)
75 cd09247 BRO1_Alix_like_2 Prote 40.4 45 0.00096 28.4 4.1 35 101-135 252-286 (346)
76 TIGR00990 3a0801s09 mitochondr 40.0 92 0.002 28.1 6.3 49 102-158 365-413 (615)
77 PRK11447 cellulose synthase su 38.7 3.1E+02 0.0067 27.1 10.1 30 102-131 303-332 (1157)
78 TIGR01987 HI0074 nucleotidyltr 38.7 1.3E+02 0.0028 21.9 5.8 70 59-131 18-88 (123)
79 TIGR02917 PEP_TPR_lipo putativ 38.5 2.4E+02 0.0052 25.3 8.7 33 99-131 122-154 (899)
80 COG4649 Uncharacterized protei 37.5 31 0.00066 27.5 2.5 30 99-130 166-195 (221)
81 cd09245 BRO1_UmRIM23-like Prot 37.5 47 0.001 29.2 3.9 33 102-134 296-328 (413)
82 KOG1840|consensus 37.0 2.4E+02 0.0053 25.6 8.4 57 102-159 240-298 (508)
83 PLN03088 SGT1, suppressor of 37.0 2.2E+02 0.0047 24.1 7.8 17 117-133 102-118 (356)
84 PRK12370 invasion protein regu 36.0 82 0.0018 28.2 5.3 45 106-158 342-386 (553)
85 KOG1156|consensus 36.0 57 0.0012 30.6 4.3 47 104-158 77-123 (700)
86 cd04087 PTPA Phosphotyrosyl ph 35.6 20 0.00043 29.8 1.2 57 71-128 145-217 (266)
87 KOG1855|consensus 35.1 37 0.00081 30.2 2.8 38 4-41 165-203 (484)
88 PF09976 TPR_21: Tetratricopep 35.0 1.2E+02 0.0027 21.6 5.3 75 60-159 59-133 (145)
89 PRK11189 lipoprotein NlpI; Pro 34.8 48 0.001 27.1 3.4 28 104-131 238-265 (296)
90 COG0790 FOG: TPR repeat, SEL1 34.5 54 0.0012 26.3 3.6 31 101-131 108-142 (292)
91 TIGR02917 PEP_TPR_lipo putativ 34.3 1.2E+02 0.0026 27.3 6.1 53 99-159 53-105 (899)
92 COG4700 Uncharacterized protei 33.4 1.4E+02 0.003 24.2 5.6 89 59-158 50-138 (251)
93 PF12688 TPR_5: Tetratrico pep 31.7 1.7E+02 0.0037 21.0 5.5 49 106-159 5-53 (120)
94 COG4235 Cytochrome c biogenesi 31.7 1.2E+02 0.0026 25.5 5.3 81 59-159 132-242 (287)
95 PF04976 DmsC: DMSO reductase 31.6 25 0.00053 29.0 1.2 28 12-39 63-90 (276)
96 COG3302 DmsC DMSO reductase an 31.1 31 0.00068 28.9 1.7 26 12-37 60-85 (281)
97 PRK10747 putative protoheme IX 30.8 1.2E+02 0.0027 25.8 5.4 31 101-131 327-357 (398)
98 COG4105 ComL DNA uptake lipopr 30.0 2E+02 0.0044 23.7 6.2 56 98-158 30-85 (254)
99 PF03704 BTAD: Bacterial trans 29.6 2.1E+02 0.0044 20.2 5.7 46 105-158 65-110 (146)
100 PF09986 DUF2225: Uncharacteri 28.5 71 0.0015 25.3 3.3 28 104-131 167-194 (214)
101 PF12462 Helicase_IV_N: DNA he 28.3 2.7E+02 0.0057 21.1 8.5 67 24-90 82-151 (166)
102 PRK15363 pathogenicity island 28.2 15 0.00033 28.1 -0.6 53 99-159 66-118 (157)
103 PF12569 NARP1: NMDA receptor- 28.1 1.6E+02 0.0034 26.7 5.8 47 104-158 196-242 (517)
104 cd09034 BRO1_Alix_like Protein 27.8 82 0.0018 26.3 3.7 35 100-134 249-283 (345)
105 PF07079 DUF1347: Protein of u 27.7 1.1E+02 0.0025 27.7 4.6 86 74-159 51-143 (549)
106 TIGR03505 FimV_core FimV N-ter 27.7 28 0.00061 23.1 0.7 46 59-110 11-57 (74)
107 CHL00033 ycf3 photosystem I as 27.2 77 0.0017 23.2 3.1 27 105-131 75-101 (168)
108 PRK09400 secE preprotein trans 27.0 99 0.0021 19.8 3.2 29 22-50 8-36 (61)
109 COG0429 Predicted hydrolase of 26.7 1.8E+02 0.0038 25.2 5.5 79 8-95 175-255 (345)
110 KOG4563|consensus 26.3 89 0.0019 27.4 3.6 57 101-158 40-97 (400)
111 PF03095 PTPA: Phosphotyrosyl 25.9 27 0.00058 29.4 0.4 57 71-128 169-240 (299)
112 KOG3830|consensus 25.6 88 0.0019 28.2 3.5 41 68-116 289-331 (542)
113 PHA02763 hypothetical protein; 25.5 1.1E+02 0.0024 21.3 3.3 30 102-134 54-83 (102)
114 PF09177 Syntaxin-6_N: Syntaxi 25.3 1.3E+02 0.0029 20.4 3.9 41 43-88 41-81 (97)
115 cd02681 MIT_calpain7_1 MIT: do 25.2 1.6E+02 0.0034 19.6 4.0 33 102-134 6-38 (76)
116 PF03745 DUF309: Domain of unk 24.9 63 0.0014 20.5 2.0 25 106-130 3-27 (62)
117 COG3071 HemY Uncharacterized e 24.5 1.4E+02 0.0031 26.2 4.6 31 101-131 327-357 (400)
118 PF09722 DUF2384: Protein of u 23.9 1.3E+02 0.0029 18.0 3.3 43 108-153 3-50 (54)
119 PRK15174 Vi polysaccharide exp 23.9 5.8E+02 0.013 23.5 9.8 31 101-131 283-313 (656)
120 PF10579 Rapsyn_N: Rapsyn N-te 23.3 1.3E+02 0.0028 20.5 3.3 51 103-158 7-57 (80)
121 PF10516 SHNi-TPR: SHNi-TPR; 23.2 1.6E+02 0.0035 16.9 3.8 29 106-134 5-33 (38)
122 PRK12370 invasion protein regu 21.9 2.3E+02 0.0051 25.3 5.7 31 101-131 371-401 (553)
123 KOG2002|consensus 21.7 8.1E+02 0.017 24.4 9.6 38 97-134 302-339 (1018)
124 PF07743 HSCB_C: HSCB C-termin 21.6 97 0.0021 20.1 2.5 21 61-81 52-72 (78)
125 PRK09782 bacteriophage N4 rece 21.5 2.5E+02 0.0054 27.7 6.1 30 102-131 609-638 (987)
126 COG1803 MgsA Methylglyoxal syn 21.4 71 0.0015 23.9 1.9 22 59-84 89-110 (142)
127 COG0790 FOG: TPR repeat, SEL1 21.1 1.5E+02 0.0032 23.7 3.9 32 100-131 185-220 (292)
128 PF14718 SLT_L: Soluble lytic 21.0 1.3E+02 0.0027 19.6 2.9 54 68-123 7-60 (71)
129 PF09523 DUF2390: Protein of u 20.8 1.8E+02 0.0039 20.6 3.9 53 67-129 15-67 (109)
130 KOG1126|consensus 20.7 3.6E+02 0.0077 25.4 6.5 67 59-131 383-450 (638)
131 TIGR00327 secE_euk_arch protei 20.5 1.6E+02 0.0035 18.9 3.2 27 24-50 6-32 (61)
132 PF06160 EzrA: Septation ring 20.5 4.3E+02 0.0092 24.1 7.1 52 84-135 495-549 (560)
133 COG4667 Predicted esterase of 20.4 1.4E+02 0.0031 25.1 3.6 73 9-82 194-273 (292)
134 KOG2867|consensus 20.4 53 0.0012 28.5 1.2 17 69-85 180-196 (367)
135 PF10151 DUF2359: Uncharacteri 20.3 3.2E+02 0.0068 24.6 6.1 41 10-51 188-228 (469)
136 cd02656 MIT MIT: domain contai 20.3 2.5E+02 0.0053 17.9 5.9 33 102-134 6-38 (75)
137 PF07026 DUF1317: Protein of u 20.2 62 0.0013 20.8 1.2 22 11-32 29-50 (60)
No 1
>PF08336 P4Ha_N: Prolyl 4-Hydroxylase alpha-subunit, N-terminal region; InterPro: IPR013547 The members found in this entry are eukaryotic proteins, and include all three isoforms of the prolyl 4-hydroxylase alpha subunit. This enzyme (1.14.11.2 from EC) is important in the post-translational modification of collagen, as it catalyses the formation of 4-hydroxyproline. In vertebrates, the complete enzyme is an alpha2-beta2 tetramer; the beta-subunit is identical to protein disulphide isomerase [, , , ]. The function of the N-terminal region featured in this family does not seem to be known. ; GO: 0004656 procollagen-proline 4-dioxygenase activity, 0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen, 0055114 oxidation-reduction process, 0005783 endoplasmic reticulum
Probab=100.00 E-value=7.2e-34 Score=211.39 Aligned_cols=90 Identities=48% Similarity=0.811 Sum_probs=86.0
Q ss_pred hhhhhcChhhhhhchhHHHHHHHhhhhhHHHHHHHHHhhhhHHHHHHHHHhhhcCCCCChhhHHHHHHHHHHHHHHhCCC
Q psy8176 3 HAEAAKDVPEYLANPINAFLVVKRLTLDWKQAEHYMKDHVYQEAMDNMTRYKQDLKFPTDEDLSGAATALLRLQDTYKLE 82 (159)
Q Consensus 3 ~~~~~~~~e~yl~nPlnaF~LirRl~~dW~~~~~~~~~~~~~~~i~~l~~~~~~~~~P~~~Dl~~A~~al~RLQ~~Y~L~ 82 (159)
+..+..+|++|++||||||+|||||++||++|+++|+++++.+++..++..+.. +||.+|++||+.||+|||+||+|+
T Consensus 45 ~~~~~~d~e~yl~nPlnaF~LIrRl~~dW~~~~~~~~~~~~~~~~~~~~~~~~~--~Pt~~Dl~gA~~~l~RLQ~~Y~L~ 122 (134)
T PF08336_consen 45 HEKAKSDPEEYLSNPLNAFSLIRRLHQDWPKWEKLMEQPVGQEQLQNLQELRSK--LPTEEDLEGAAEGLLRLQDTYNLD 122 (134)
T ss_pred HHHhhcchhhhhhcHHHHHHHHHHHHHhhhhHHHHHHHhhhHHHHHHHHHHHhc--CCCHHHHHHHHHHHHHHHHHhCCC
Confidence 345677999999999999999999999999999999999999999999999988 999999999999999999999999
Q ss_pred hHHhhccccCCc
Q psy8176 83 TASVARGELNGV 94 (159)
Q Consensus 83 ~~dla~G~l~g~ 94 (159)
+.|||+|+|.|+
T Consensus 123 ~~dlA~G~l~G~ 134 (134)
T PF08336_consen 123 PSDLANGLLQGV 134 (134)
T ss_pred HHHHhccccCCC
Confidence 999999999986
No 2
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=96.21 E-value=0.013 Score=32.05 Aligned_cols=32 Identities=19% Similarity=0.346 Sum_probs=27.2
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhh
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHD 133 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~ 133 (159)
|.-++.+|..++..++|..|..+++.|++...
T Consensus 1 a~~~~~lg~~~~~~~~~~~A~~~~~~al~l~p 32 (34)
T PF07719_consen 1 AEAWYYLGQAYYQLGNYEEAIEYFEKALELDP 32 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHST
T ss_pred CHHHHHHHHHHHHhCCHHHHHHHHHHHHHHCc
Confidence 34589999999999999999999999997643
No 3
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=95.97 E-value=0.022 Score=46.82 Aligned_cols=57 Identities=18% Similarity=0.365 Sum_probs=45.3
Q ss_pred HHHHHHHHHHHHhCC-CchhHHHHHHHHHhhhhc-cC--CCCCC---CHHHHHHHHHHhHhhcC
Q psy8176 102 ASDCFELGRQSYNTQ-DFYHTALWMGEALKRHDM-ER--NGTST---PKWEILEYLAYSKFMQD 158 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~-dy~~A~~Wl~~Al~~~~~-~~--~~~~~---~~~~il~~la~s~~~~g 158 (159)
++=||.+|+.+++.+ +|..|..||+.|++.++. .. ..+.+ .+..||..||.++.+.+
T Consensus 35 a~~~yn~G~~l~~~~~~~~~a~~wL~~a~~~l~~~~~~~~~~~~~~elr~~iL~~La~~~l~~~ 98 (278)
T PF08631_consen 35 ARVCYNIGKSLLSKKDKYEEAVKWLQRAYDILEKPGKMDKLSPDGSELRLSILRLLANAYLEWD 98 (278)
T ss_pred HHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHhhhhccccCCcHHHHHHHHHHHHHHHHHcCC
Confidence 677999999999999 999999999999999865 21 12221 16688999999987754
No 4
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=95.46 E-value=0.018 Score=37.54 Aligned_cols=57 Identities=12% Similarity=0.070 Sum_probs=40.9
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
+.=+..||..++..++|..|..+++.|++.....++.... ...++..+|.+...+|+
T Consensus 5 a~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~-~a~~~~~lg~~~~~~g~ 61 (78)
T PF13424_consen 5 ANAYNNLARVYRELGRYDEALDYYEKALDIEEQLGDDHPD-TANTLNNLGECYYRLGD 61 (78)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTTTHHHH-HHHHHHHHHHHHHHTTH
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHHHHcCC
Confidence 3446789999999999999999999999985443221122 35667788888877763
No 5
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=92.46 E-value=0.63 Score=29.19 Aligned_cols=50 Identities=14% Similarity=0.236 Sum_probs=41.5
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
++.-.+.+|..++..++|..|...+..|++.- .+ ...++-.+|.+..++|
T Consensus 2 ~a~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~-------p~-~~~~~~~~g~~~~~~~ 51 (69)
T PF13414_consen 2 NAEAWYNLGQIYFQQGDYEEAIEYFEKAIELD-------PN-NAEAYYNLGLAYMKLG 51 (69)
T ss_dssp SHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHS-------TT-HHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC-------CC-CHHHHHHHHHHHHHhC
Confidence 35667899999999999999999999999852 23 5678888888888776
No 6
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=92.00 E-value=1.9 Score=29.20 Aligned_cols=54 Identities=15% Similarity=0.081 Sum_probs=37.8
Q ss_pred CCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 100 LTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 100 Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
..+.-+|.+|..++..++|..|...++.++...... .. ..+++..++.++.+.|
T Consensus 37 ~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~----~~-~~~~~~~~~~~~~~~~ 90 (119)
T TIGR02795 37 YAPNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPKS----PK-APDALLKLGMSLQELG 90 (119)
T ss_pred ccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCCC----Cc-ccHHHHHHHHHHHHhC
Confidence 456677889999999999999999888888654321 12 3455666666666655
No 7
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=91.18 E-value=0.49 Score=26.60 Aligned_cols=29 Identities=21% Similarity=0.208 Sum_probs=24.3
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHhhhh
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALKRHD 133 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~~~~ 133 (159)
|..||+.+...|+|..|...++.||..-.
T Consensus 2 l~~Lg~~~~~~g~~~~Ai~~y~~aL~l~~ 30 (36)
T PF13176_consen 2 LNNLGRIYRQQGDYEKAIEYYEQALALAR 30 (36)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhcc
Confidence 57899999999999999999999885433
No 8
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=91.17 E-value=0.59 Score=29.09 Aligned_cols=46 Identities=22% Similarity=0.265 Sum_probs=35.3
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
|.+|..++..|+|..|..=++.+++.. +. ..+++-.++.+++++|+
T Consensus 1 ~~~a~~~~~~g~~~~A~~~~~~~l~~~-------P~-~~~a~~~lg~~~~~~g~ 46 (65)
T PF13432_consen 1 YALARALYQQGDYDEAIAAFEQALKQD-------PD-NPEAWYLLGRILYQQGR 46 (65)
T ss_dssp HHHHHHHHHCTHHHHHHHHHHHHHCCS-------TT-HHHHHHHHHHHHHHTT-
T ss_pred ChHHHHHHHcCCHHHHHHHHHHHHHHC-------CC-CHHHHHHHHHHHHHcCC
Confidence 567888999999999988888887543 23 67888888888887774
No 9
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=91.09 E-value=0.31 Score=24.30 Aligned_cols=28 Identities=18% Similarity=0.301 Sum_probs=25.2
Q ss_pred HHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 104 DCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 104 DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
-++.+|..++..+++..|..|+..+++.
T Consensus 3 ~~~~~a~~~~~~~~~~~a~~~~~~~~~~ 30 (34)
T smart00028 3 ALYNLGNAYLKLGDYDEALEYYEKALEL 30 (34)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHcc
Confidence 4678999999999999999999999864
No 10
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=90.80 E-value=0.46 Score=25.98 Aligned_cols=28 Identities=14% Similarity=0.200 Sum_probs=23.1
Q ss_pred HHHHHHHHHHHhC----CCchhHHHHHHHHHh
Q psy8176 103 SDCFELGRQSYNT----QDFYHTALWMGEALK 130 (159)
Q Consensus 103 ~DC~~iG~~~y~~----~dy~~A~~Wl~~Al~ 130 (159)
.-||.||..+... .|+..|..|++.|.+
T Consensus 2 ~a~~~lg~~~~~G~g~~~d~~~A~~~~~~Aa~ 33 (36)
T smart00671 2 EAQYNLGQMYEYGLGVKKDLEKALEYYKKAAE 33 (36)
T ss_pred HHHHHHHHHHHcCCCCCcCHHHHHHHHHHHHH
Confidence 4689999887764 489999999999865
No 11
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=90.08 E-value=2.9 Score=34.32 Aligned_cols=58 Identities=10% Similarity=0.095 Sum_probs=47.3
Q ss_pred CCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 97 TTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 97 ~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
++.+.+.=-|.+|..+|..++|..|...++..++.+.+.. . ..+++-.++.+..++|+
T Consensus 175 ~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~----~-~~dAl~klg~~~~~~g~ 232 (263)
T PRK10803 175 DSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSP----K-AADAMFKVGVIMQDKGD 232 (263)
T ss_pred CCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCc----c-hhHHHHHHHHHHHHcCC
Confidence 4566677779999999999999999999999998876542 3 66888888888877764
No 12
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=90.04 E-value=0.8 Score=25.52 Aligned_cols=33 Identities=18% Similarity=0.113 Sum_probs=26.2
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
+.=+-.||..++..++|..|..|+++|+.....
T Consensus 2 a~~~~~la~~~~~~g~~~~A~~~~~~al~~~~~ 34 (42)
T PF13374_consen 2 ASALNNLANAYRAQGRYEEALELLEEALEIRER 34 (42)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHHH--
T ss_pred HHHHHHHHHHHHhhhhcchhhHHHHHHHHHHHH
Confidence 344668999999999999999999999887654
No 13
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=87.49 E-value=1.1 Score=29.51 Aligned_cols=49 Identities=14% Similarity=0.146 Sum_probs=35.8
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
..-.+.+|..+|+.++|..|..++.. +. .+. . ..++.-.+|.|++++|+
T Consensus 25 ~~~~~~la~~~~~~~~y~~A~~~~~~-~~-~~~------~-~~~~~~l~a~~~~~l~~ 73 (84)
T PF12895_consen 25 SAYLYNLAQCYFQQGKYEEAIELLQK-LK-LDP------S-NPDIHYLLARCLLKLGK 73 (84)
T ss_dssp HHHHHHHHHHHHHTTHHHHHHHHHHC-HT-HHH------C-HHHHHHHHHHHHHHTT-
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHH-hC-CCC------C-CHHHHHHHHHHHHHhCC
Confidence 34567789999999999999999988 22 222 2 34555577999999874
No 14
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=87.03 E-value=0.81 Score=24.27 Aligned_cols=29 Identities=21% Similarity=0.344 Sum_probs=25.8
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
|.+|...+..|++..|...++..++++.+
T Consensus 4 ~~~a~~~~~~g~~~~A~~~~~~~~~~~P~ 32 (33)
T PF13174_consen 4 YRLARCYYKLGDYDEAIEYFQRLIKRYPD 32 (33)
T ss_dssp HHHHHHHHHHCHHHHHHHHHHHHHHHSTT
T ss_pred HHHHHHHHHccCHHHHHHHHHHHHHHCcC
Confidence 67999999999999999999999887653
No 15
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=86.90 E-value=1.6 Score=26.72 Aligned_cols=27 Identities=22% Similarity=0.357 Sum_probs=18.7
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
|+.+|..++..+++..|..+++.+++.
T Consensus 3 ~~~~a~~~~~~~~~~~A~~~~~~~~~~ 29 (100)
T cd00189 3 LLNLGNLYYKLGDYDEALEYYEKALEL 29 (100)
T ss_pred HHHHHHHHHHHhcHHHHHHHHHHHHhc
Confidence 566777777777777777777776653
No 16
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=86.01 E-value=2.2 Score=22.99 Aligned_cols=30 Identities=20% Similarity=0.367 Sum_probs=25.9
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
|.-++.+|...+..++|..|..=++.|++.
T Consensus 1 a~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 30 (34)
T PF00515_consen 1 AEAYYNLGNAYFQLGDYEEALEYYQRALEL 30 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHhCCchHHHHHHHHHHHH
Confidence 355889999999999999999999999875
No 17
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=85.00 E-value=2.1 Score=22.92 Aligned_cols=30 Identities=17% Similarity=0.306 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHhCCCchhHHHHHHHHHhhh
Q psy8176 103 SDCFELGRQSYNTQDFYHTALWMGEALKRH 132 (159)
Q Consensus 103 ~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~ 132 (159)
.=|+.+|......+++..|..=++.|++.-
T Consensus 2 ~~~~~lg~~y~~~~~~~~A~~~~~~a~~~~ 31 (34)
T PF13181_consen 2 EAYYNLGKIYEQLGDYEEALEYFEKALELN 31 (34)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHhhC
Confidence 348999999999999999999999998753
No 18
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=84.95 E-value=1.8 Score=25.20 Aligned_cols=28 Identities=14% Similarity=0.039 Sum_probs=25.1
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHhhh
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALKRH 132 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~~~ 132 (159)
-+.+|+.+...|++..|..+++.+++..
T Consensus 4 ~~~la~~~~~~G~~~~A~~~~~~~l~~~ 31 (44)
T PF13428_consen 4 WLALARAYRRLGQPDEAERLLRRALALD 31 (44)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHHC
Confidence 3678999999999999999999999753
No 19
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=84.91 E-value=1.9 Score=34.85 Aligned_cols=60 Identities=12% Similarity=0.011 Sum_probs=50.2
Q ss_pred CCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 98 TQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 98 ~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
.++...=++.||+.++..|+|..|..=++-+...+..+.+..+ ..+++..|..|.++.|+
T Consensus 174 ~R~~~~l~~~~A~ey~~~g~~~~A~~~l~~~~~~yr~egW~~l--~~~~l~~l~~Ca~~~~~ 233 (247)
T PF11817_consen 174 NRMASYLSLEMAEEYFRLGDYDKALKLLEPAASSYRREGWWSL--LTEVLWRLLECAKRLGD 233 (247)
T ss_pred chHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHhCCcHHH--HHHHHHHHHHHHHHhCC
Confidence 4566667789999999999999999999999888888776543 46889999999988875
No 20
>PF08238 Sel1: Sel1 repeat; InterPro: IPR006597 Sel1-like repeats are tetratricopeptide repeat sequences originally identified in a Caenorhabditis elegans receptor molecule which is a key negative regulator of the Notch pathway []. Mammalian homologues have since been identified although these mainly pancreatic proteins have yet to have a function assigned.; PDB: 2XM6_A 3RJV_A 1OUV_A 1KLX_A.
Probab=84.09 E-value=2 Score=23.79 Aligned_cols=29 Identities=28% Similarity=0.512 Sum_probs=20.3
Q ss_pred HHHHHHHH--HHHHhCC-----CchhHHHHHHHHHh
Q psy8176 102 ASDCFELG--RQSYNTQ-----DFYHTALWMGEALK 130 (159)
Q Consensus 102 ~~DC~~iG--~~~y~~~-----dy~~A~~Wl~~Al~ 130 (159)
|.=||.|| ...++.. |+..|..|++.|.+
T Consensus 1 a~A~~~lg~~~~~~~g~~g~~~d~~~A~~~~~~Aa~ 36 (39)
T PF08238_consen 1 AEAQYNLGMYYMYYNGKGGVPKDYEKAFKWYEKAAE 36 (39)
T ss_dssp HHHHHHHHHHHHHHHTSTSSCHHHHHHHHHHHHHHH
T ss_pred ChHHHHHHHHHhhhhccCCccccccchHHHHHHHHH
Confidence 34578888 4334433 57999999999865
No 21
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=83.98 E-value=1.4 Score=28.37 Aligned_cols=32 Identities=22% Similarity=0.287 Sum_probs=28.1
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhh
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHD 133 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~ 133 (159)
+.=.+.||...+..|+|..|..+++.|++..+
T Consensus 46 a~~~~~lg~~~~~~g~~~~A~~~~~~al~i~~ 77 (78)
T PF13424_consen 46 ANTLNNLGECYYRLGDYEEALEYYQKALDIFE 77 (78)
T ss_dssp HHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHhhhc
Confidence 55577899999999999999999999998654
No 22
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=83.97 E-value=5 Score=31.17 Aligned_cols=57 Identities=14% Similarity=0.045 Sum_probs=41.5
Q ss_pred cCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhc
Q psy8176 96 YTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQ 157 (159)
Q Consensus 96 ~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~ 157 (159)
.++++.+.-.|.+|...|+.++|..|...++.-++.+... .. ..+++=.++.|.+++
T Consensus 36 P~s~~a~~A~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~----~~-~~~A~Y~~g~~~~~~ 92 (203)
T PF13525_consen 36 PNSPYAPQAQLMLAYAYYKQGDYEEAIAAYERFIKLYPNS----PK-ADYALYMLGLSYYKQ 92 (203)
T ss_dssp TTSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-----TT-HHHHHHHHHHHHHHH
T ss_pred CCChHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCC----cc-hhhHHHHHHHHHHHh
Confidence 3577888888999999999999999988888877766553 24 667777777777654
No 23
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=83.83 E-value=1.9 Score=26.71 Aligned_cols=33 Identities=18% Similarity=0.304 Sum_probs=27.5
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhh
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKRHD 133 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~ 133 (159)
++.=-+.+|..++..|+|..|..+++.+++...
T Consensus 30 ~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~~~P 62 (65)
T PF13432_consen 30 NPEAWYLLGRILYQQGRYDEALAYYERALELDP 62 (65)
T ss_dssp HHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHST
T ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCc
Confidence 456668899999999999999999999987643
No 24
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=83.60 E-value=18 Score=27.88 Aligned_cols=49 Identities=8% Similarity=-0.020 Sum_probs=39.4
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
+.+|..+++.|+|..|..+++.+++.+... .. ..+++-.++.+..+.|+
T Consensus 170 ~~~a~~~~~~g~~~~A~~~~~~al~~~p~~----~~-~~~a~~~l~~~~~~lg~ 218 (235)
T TIGR03302 170 LYVARFYLKRGAYVAAINRFETVVENYPDT----PA-TEEALARLVEAYLKLGL 218 (235)
T ss_pred HHHHHHHHHcCChHHHHHHHHHHHHHCCCC----cc-hHHHHHHHHHHHHHcCC
Confidence 589999999999999999999999876653 23 56777778888877764
No 25
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=82.28 E-value=2.5 Score=26.17 Aligned_cols=40 Identities=20% Similarity=0.120 Sum_probs=30.9
Q ss_pred HHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 112 SYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 112 ~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
++..|+|..|...++.++..... ..++.-.||.++++.|+
T Consensus 1 ll~~~~~~~A~~~~~~~l~~~p~--------~~~~~~~la~~~~~~g~ 40 (68)
T PF14559_consen 1 LLKQGDYDEAIELLEKALQRNPD--------NPEARLLLAQCYLKQGQ 40 (68)
T ss_dssp HHHTTHHHHHHHHHHHHHHHTTT--------SHHHHHHHHHHHHHTT-
T ss_pred ChhccCHHHHHHHHHHHHHHCCC--------CHHHHHHHHHHHHHcCC
Confidence 46789999999999999875332 45777789999988874
No 26
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=81.27 E-value=16 Score=29.76 Aligned_cols=37 Identities=22% Similarity=0.421 Sum_probs=28.8
Q ss_pred ccCCCCCHHHHH-------HHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 95 QYTTQLTASDCF-------ELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 95 ~~~~~Ls~~DC~-------~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
....+|++...- .-|..+|+.++|..|..|++.|+..
T Consensus 232 ~~~~~ls~~~~~a~~~LLW~~~~~~~~~k~y~~A~~w~~~al~~ 275 (278)
T PF08631_consen 232 SLGKQLSAEAASAIHTLLWNKGKKHYKAKNYDEAIEWYELALHS 275 (278)
T ss_pred HhcCCCCHHHHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHh
Confidence 345567776533 3499999999999999999999853
No 27
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=79.43 E-value=6.9 Score=34.86 Aligned_cols=55 Identities=15% Similarity=0.067 Sum_probs=39.7
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
+=++.+|+.+|..++..++|..|..+++.|+..-.... . ....+-.+|-++.++|
T Consensus 72 P~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~a----e-A~~A~yNLAcaya~LG 126 (453)
T PLN03098 72 VKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPD----E-AQAAYYNKACCHAYRE 126 (453)
T ss_pred CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCch----H-HHHHHHHHHHHHHHcC
Confidence 56789999999999999999999999999997632211 0 1123455566655555
No 28
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=77.53 E-value=4.4 Score=25.41 Aligned_cols=42 Identities=14% Similarity=0.023 Sum_probs=24.4
Q ss_pred HHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 109 GRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 109 G~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
...+.+.++|..|...++.++..... ....+..+|.+++++|
T Consensus 2 ~~~~~~~~~~~~A~~~~~~~l~~~p~--------~~~~~~~~a~~~~~~g 43 (73)
T PF13371_consen 2 KQIYLQQEDYEEALEVLERALELDPD--------DPELWLQRARCLFQLG 43 (73)
T ss_pred HHHHHhCCCHHHHHHHHHHHHHhCcc--------cchhhHHHHHHHHHhc
Confidence 45566777777777777777654222 2234445555555555
No 29
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=76.92 E-value=28 Score=28.76 Aligned_cols=26 Identities=12% Similarity=0.052 Sum_probs=17.2
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHh
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALK 130 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~ 130 (159)
.+.+|..++..|+|..|..++..+++
T Consensus 110 ~~~La~~~~~~g~~~~A~~~~~~~l~ 135 (389)
T PRK11788 110 LQELGQDYLKAGLLDRAEELFLQLVD 135 (389)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHc
Confidence 45666666677777777766666654
No 30
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=76.85 E-value=14 Score=22.24 Aligned_cols=26 Identities=15% Similarity=0.208 Sum_probs=13.1
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHh
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALK 130 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~ 130 (159)
++.+|..++..+++..|..+++.+++
T Consensus 37 ~~~~~~~~~~~~~~~~a~~~~~~~~~ 62 (100)
T cd00189 37 YYNLAAAYYKLGKYEEALEDYEKALE 62 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34455555555555555555555443
No 31
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=76.40 E-value=8.9 Score=31.76 Aligned_cols=58 Identities=17% Similarity=0.130 Sum_probs=47.5
Q ss_pred CCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 97 TTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 97 ~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
++..++.--|=||...|..|+|..|..=+..+.+.+.... . -.|.|-.|+.|...+|+
T Consensus 173 ~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~----K-ApdallKlg~~~~~l~~ 230 (262)
T COG1729 173 NSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSP----K-APDALLKLGVSLGRLGN 230 (262)
T ss_pred CCcccchhHHHHHHHHHhcccchHHHHHHHHHHHhCCCCC----C-ChHHHHHHHHHHHHhcC
Confidence 6789999999999999999999999998888877665532 2 45888889998888774
No 32
>KOG2002|consensus
Probab=75.59 E-value=15 Score=35.59 Aligned_cols=94 Identities=14% Similarity=0.139 Sum_probs=67.3
Q ss_pred CCChhh---HHHHHHHHHHHHHHhCCChHHh--hccccC---C-cccC----CCCCHHHHHH--------HHHHHHhCCC
Q psy8176 59 FPTDED---LSGAATALLRLQDTYKLETASV--ARGELN---G-VQYT----TQLTASDCFE--------LGRQSYNTQD 117 (159)
Q Consensus 59 ~P~~~D---l~~A~~al~RLQ~~Y~L~~~dl--a~G~l~---g-~~~~----~~Ls~~DC~~--------iG~~~y~~~d 117 (159)
.|+.++ ..-..+||.+-++|-..+|..| |+|+-- . .+.+ +-=.+.+|.. ||..+...|.
T Consensus 616 ~~~rn~ek~kk~~~KAlq~y~kvL~~dpkN~yAANGIgiVLA~kg~~~~A~dIFsqVrEa~~~~~dv~lNlah~~~e~~q 695 (1018)
T KOG2002|consen 616 NPSRNPEKEKKHQEKALQLYGKVLRNDPKNMYAANGIGIVLAEKGRFSEARDIFSQVREATSDFEDVWLNLAHCYVEQGQ 695 (1018)
T ss_pred ccccChHHHHHHHHHHHHHHHHHHhcCcchhhhccchhhhhhhccCchHHHHHHHHHHHHHhhCCceeeeHHHHHHHHHH
Confidence 455544 5557889999999999999999 455310 0 0110 0012344544 7888889999
Q ss_pred chhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 118 FYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 118 y~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
|..|++=++-+++.+... + ..+|+.+||.+.|+.|
T Consensus 696 y~~AIqmYe~~lkkf~~~-----~-~~~vl~~Lara~y~~~ 730 (1018)
T KOG2002|consen 696 YRLAIQMYENCLKKFYKK-----N-RSEVLHYLARAWYEAG 730 (1018)
T ss_pred HHHHHHHHHHHHHHhccc-----C-CHHHHHHHHHHHHHhh
Confidence 999999999999877642 4 7899999999999876
No 33
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=74.73 E-value=3.9 Score=21.19 Aligned_cols=23 Identities=13% Similarity=0.057 Sum_probs=20.3
Q ss_pred HHHHHHHHHhCCCchhHHHHHHH
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGE 127 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~ 127 (159)
.+.+|+.++..|++..|..|+++
T Consensus 4 ~~~la~~~~~~G~~~eA~~~l~~ 26 (26)
T PF07721_consen 4 RLALARALLAQGDPDEAERLLRR 26 (26)
T ss_pred HHHHHHHHHHcCCHHHHHHHHhC
Confidence 57899999999999999998863
No 34
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=74.37 E-value=38 Score=26.15 Aligned_cols=119 Identities=14% Similarity=0.168 Sum_probs=72.6
Q ss_pred hhhhchhHHHHHHHhhhhhHHHHHHHH----HhhhhHHHHHHHHHhhhcCCCCChhhHHHHHHHHHHHHHHhCCChHHhh
Q psy8176 12 EYLANPINAFLVVKRLTLDWKQAEHYM----KDHVYQEAMDNMTRYKQDLKFPTDEDLSGAATALLRLQDTYKLETASVA 87 (159)
Q Consensus 12 ~yl~nPlnaF~LirRl~~dW~~~~~~~----~~~~~~~~i~~l~~~~~~~~~P~~~Dl~~A~~al~RLQ~~Y~L~~~dla 87 (159)
.|-.||...+.+-.+-......+...+ .+....+-+..++.+... +|+..=..+|-.-|..|.+. |
T Consensus 71 ~yP~~~~~~~A~Y~~g~~~~~~~~~~~~~~~D~~~~~~A~~~~~~li~~--yP~S~y~~~A~~~l~~l~~~-------l- 140 (203)
T PF13525_consen 71 LYPNSPKADYALYMLGLSYYKQIPGILRSDRDQTSTRKAIEEFEELIKR--YPNSEYAEEAKKRLAELRNR-------L- 140 (203)
T ss_dssp H-TT-TTHHHHHHHHHHHHHHHHHHHH-TT---HHHHHHHHHHHHHHHH---TTSTTHHHHHHHHHHHHHH-------H-
T ss_pred HCCCCcchhhHHHHHHHHHHHhCccchhcccChHHHHHHHHHHHHHHHH--CcCchHHHHHHHHHHHHHHH-------H-
Confidence 344555555554444443333333321 111122334455555556 78877777777666555432 2
Q ss_pred ccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 88 RGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 88 ~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
+.-=+.||+..++.+.|.-|..=++..++.+.+. .. ..+.+-.++.+..++|.
T Consensus 141 --------------a~~e~~ia~~Y~~~~~y~aA~~r~~~v~~~yp~t----~~-~~~al~~l~~~y~~l~~ 193 (203)
T PF13525_consen 141 --------------AEHELYIARFYYKRGKYKAAIIRFQYVIENYPDT----PA-AEEALARLAEAYYKLGL 193 (203)
T ss_dssp --------------HHHHHHHHHHHHCTT-HHHHHHHHHHHHHHSTTS----HH-HHHHHHHHHHHHHHTT-
T ss_pred --------------HHHHHHHHHHHHHcccHHHHHHHHHHHHHHCCCC----ch-HHHHHHHHHHHHHHhCC
Confidence 3455889999999999999999999999887663 23 67889999999988873
No 35
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=74.10 E-value=12 Score=27.98 Aligned_cols=22 Identities=14% Similarity=0.114 Sum_probs=10.5
Q ss_pred HHHHHhCCCchhHHHHHHHHHh
Q psy8176 109 GRQSYNTQDFYHTALWMGEALK 130 (159)
Q Consensus 109 G~~~y~~~dy~~A~~Wl~~Al~ 130 (159)
|..++..|+|..|..++..|++
T Consensus 79 a~~~~~~g~~~~A~~~~~~al~ 100 (172)
T PRK02603 79 GIIYASNGEHDKALEYYHQALE 100 (172)
T ss_pred HHHHHHcCCHHHHHHHHHHHHH
Confidence 3344444555555555554443
No 36
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=73.03 E-value=6.4 Score=23.60 Aligned_cols=25 Identities=16% Similarity=0.258 Sum_probs=23.2
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHh
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALK 130 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~ 130 (159)
|.+++.+.+.||+..|..++++.+.
T Consensus 3 LdLA~ayie~Gd~e~Ar~lL~evl~ 27 (44)
T TIGR03504 3 LDLARAYIEMGDLEGARELLEEVIE 27 (44)
T ss_pred hHHHHHHHHcCChHHHHHHHHHHHH
Confidence 6799999999999999999999884
No 37
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=72.62 E-value=12 Score=27.87 Aligned_cols=27 Identities=19% Similarity=0.165 Sum_probs=14.1
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
-+.+|..++..++|..|..+++.|+..
T Consensus 38 ~~~lg~~~~~~g~~~~A~~~~~~al~~ 64 (172)
T PRK02603 38 YYRDGMSAQADGEYAEALENYEEALKL 64 (172)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 344555555555555555555555443
No 38
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=72.44 E-value=13 Score=27.42 Aligned_cols=52 Identities=12% Similarity=0.015 Sum_probs=36.4
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
+.--+.+|..+...++|..|..++..|+..... +.. ...++..+|.++.+.|
T Consensus 35 a~~~~~~g~~~~~~g~~~~A~~~~~~al~l~~~----~~~-~~~~~~~lg~~~~~~g 86 (168)
T CHL00033 35 AFTYYRDGMSAQSEGEYAEALQNYYEAMRLEID----PYD-RSYILYNIGLIHTSNG 86 (168)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHhcccc----chh-hHHHHHHHHHHHHHcC
Confidence 444467888888899999999999999876322 123 3456666777766665
No 39
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=72.23 E-value=16 Score=29.92 Aligned_cols=53 Identities=9% Similarity=-0.080 Sum_probs=35.7
Q ss_pred HHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 103 SDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 103 ~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
.=+..+|..+++.+++..|..|+..++...... ... .....-++|..+..+|+
T Consensus 149 ~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~~~---~~~-~~~~~~~la~~~~~~G~ 201 (355)
T cd05804 149 WAVHAVAHVLEMQGRFKEGIAFMESWRDTWDCS---SML-RGHNWWHLALFYLERGD 201 (355)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHhhhhccCCC---cch-hHHHHHHHHHHHHHCCC
Confidence 335778999999999999999999988754431 112 23344456666666653
No 40
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=71.66 E-value=20 Score=26.96 Aligned_cols=57 Identities=9% Similarity=0.100 Sum_probs=40.5
Q ss_pred CCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 98 TQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 98 ~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
..+++.+-|.-|..+++.|+|..|++=++.-..++..++ - ...+---|+++.|+.|+
T Consensus 6 ~~~~~~~ly~~a~~~l~~~~Y~~A~~~le~L~~ryP~g~----y-a~qAqL~l~yayy~~~~ 62 (142)
T PF13512_consen 6 PDKSPQELYQEAQEALQKGNYEEAIKQLEALDTRYPFGE----Y-AEQAQLDLAYAYYKQGD 62 (142)
T ss_pred CCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHhcCCCCc----c-cHHHHHHHHHHHHHccC
Confidence 457888999999999999999999877777666665533 1 23444467777777663
No 41
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=70.36 E-value=5.4 Score=32.55 Aligned_cols=80 Identities=16% Similarity=0.054 Sum_probs=53.4
Q ss_pred hHHHHHHHHHHHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHHHHHhC-CCchhHHHHHHHHHhhhhccCCCCCCC
Q psy8176 64 DLSGAATALLRLQDTYKLETASVARGELNGVQYTTQLTASDCFELGRQSYNT-QDFYHTALWMGEALKRHDMERNGTSTP 142 (159)
Q Consensus 64 Dl~~A~~al~RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~-~dy~~A~~Wl~~Al~~~~~~~~~~~~~ 142 (159)
|..+|+..+.+--++|. ..|. ...-+.=|..||..+.+. +++..|.+.++.|++.+..++ .+..
T Consensus 89 ~~~~Ai~~~~~A~~~y~------~~G~-------~~~aA~~~~~lA~~ye~~~~d~e~Ai~~Y~~A~~~y~~e~-~~~~- 153 (282)
T PF14938_consen 89 DPDEAIECYEKAIEIYR------EAGR-------FSQAAKCLKELAEIYEEQLGDYEKAIEYYQKAAELYEQEG-SPHS- 153 (282)
T ss_dssp THHHHHHHHHHHHHHHH------HCT--------HHHHHHHHHHHHHHHCCTT--HHHHHHHHHHHHHHHHHTT--HHH-
T ss_pred CHHHHHHHHHHHHHHHH------hcCc-------HHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHCC-Chhh-
Confidence 66667776666666652 2221 123366677889988888 899999999999999998865 2233
Q ss_pred HHHHHHHHHHhHhhcC
Q psy8176 143 KWEILEYLAYSKFMQD 158 (159)
Q Consensus 143 ~~~il~~la~s~~~~g 158 (159)
..+++..+|....+.|
T Consensus 154 a~~~~~~~A~l~~~l~ 169 (282)
T PF14938_consen 154 AAECLLKAADLYARLG 169 (282)
T ss_dssp HHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHhC
Confidence 5577777777776665
No 42
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=70.19 E-value=12 Score=27.30 Aligned_cols=50 Identities=12% Similarity=0.019 Sum_probs=30.5
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
++.-.+.+|..+...|+|..|..++..|+..-. . ..+.+-.++.++.+.|
T Consensus 57 ~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p-------~-~~~a~~~lg~~l~~~g 106 (144)
T PRK15359 57 SWRAHIALAGTWMMLKEYTTAINFYGHALMLDA-------S-HPEPVYQTGVCLKMMG 106 (144)
T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCC-------C-CcHHHHHHHHHHHHcC
Confidence 355556777777777777777777777775311 1 2344555566555555
No 43
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=67.77 E-value=9.1 Score=30.69 Aligned_cols=47 Identities=13% Similarity=-0.057 Sum_probs=33.4
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
+..+|..++..|++..|..|++.+++... . ...++..+|.++.+.|+
T Consensus 217 ~~~la~~~~~lg~~~~Al~~~~~~~~~~p-------~-d~~~~~~~a~~l~~~g~ 263 (280)
T PF13429_consen 217 WDALAAAYLQLGRYEEALEYLEKALKLNP-------D-DPLWLLAYADALEQAGR 263 (280)
T ss_dssp CHHHHHHHHHHT-HHHHHHHHHHHHHHST-------T--HHHHHHHHHHHT----
T ss_pred HHHHHHHhccccccccccccccccccccc-------c-ccccccccccccccccc
Confidence 45779999999999999999999986422 2 45777788888887773
No 44
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=67.64 E-value=24 Score=27.19 Aligned_cols=52 Identities=10% Similarity=0.083 Sum_probs=32.6
Q ss_pred CCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhh
Q psy8176 100 LTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFM 156 (159)
Q Consensus 100 Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~ 156 (159)
....=.+.+|..++..+++..|...++.+++..... .. ....+-.++.+.++
T Consensus 68 ~~~~a~~~la~~~~~~~~~~~A~~~~~~~l~~~p~~----~~-~~~a~~~~g~~~~~ 119 (235)
T TIGR03302 68 YAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPNH----PD-ADYAYYLRGLSNYN 119 (235)
T ss_pred hHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcCC----Cc-hHHHHHHHHHHHHH
Confidence 333445778888888888888888888877654432 12 33455556666554
No 45
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=66.99 E-value=16 Score=25.43 Aligned_cols=53 Identities=6% Similarity=-0.160 Sum_probs=39.7
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
+-++.--+.+|..++..++|..|..+++.++... .+ ..++.-++|.++.+.|+
T Consensus 48 p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~-------p~-~~~~~~~la~~~~~~g~ 100 (135)
T TIGR02552 48 PYNSRYWLGLAACCQMLKEYEEAIDAYALAAALD-------PD-DPRPYFHAAECLLALGE 100 (135)
T ss_pred CCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CC-ChHHHHHHHHHHHHcCC
Confidence 3466667788999999999999999998887642 12 34666788888888774
No 46
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=66.25 E-value=67 Score=25.77 Aligned_cols=52 Identities=6% Similarity=-0.020 Sum_probs=42.9
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
+.-.+.||+..++.+.|.-|..=++..++.+... .. ..+.|-++..+..++|
T Consensus 175 a~~e~~ia~~Y~~~~~y~AA~~r~~~v~~~Yp~t----~~-~~eal~~l~~ay~~lg 226 (243)
T PRK10866 175 AKYELSVAEYYTKRGAYVAVVNRVEQMLRDYPDT----QA-TRDALPLMENAYRQLQ 226 (243)
T ss_pred HHHHHHHHHHHHHcCchHHHHHHHHHHHHHCCCC----ch-HHHHHHHHHHHHHHcC
Confidence 5667899999999999999998888888877653 34 6788989988888776
No 47
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=66.07 E-value=11 Score=30.83 Aligned_cols=45 Identities=13% Similarity=0.022 Sum_probs=31.9
Q ss_pred HHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 108 LGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 108 iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
-|-.+|.+|.|..|..||+.|+.. +.... ..+.++-+..|..+.|
T Consensus 109 YG~FLC~qg~~~eA~q~F~~Al~~-----P~Y~~-~s~t~eN~G~Cal~~g 153 (250)
T COG3063 109 YGAFLCAQGRPEEAMQQFERALAD-----PAYGE-PSDTLENLGLCALKAG 153 (250)
T ss_pred hhHHHHhCCChHHHHHHHHHHHhC-----CCCCC-cchhhhhhHHHHhhcC
Confidence 577889999999999999999852 22222 3455666666666655
No 48
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=65.88 E-value=21 Score=26.18 Aligned_cols=33 Identities=15% Similarity=0.052 Sum_probs=24.8
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
.-.+.--+.+|..++..++|..|..+++.+++.
T Consensus 28 ~~~~~~~~~la~~~~~~~~~~~A~~~~~~~l~~ 60 (234)
T TIGR02521 28 NKAAKIRVQLALGYLEQGDLEVAKENLDKALEH 60 (234)
T ss_pred CcHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 344556677888888888888888888888754
No 49
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=65.23 E-value=28 Score=23.18 Aligned_cols=53 Identities=9% Similarity=0.065 Sum_probs=37.5
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
++.-|.+|..+++.++|..|..=+..++...... .. ..++.-.++.+.++.|+
T Consensus 2 ~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~~~~----~~-~~~~~~~l~~~~~~~~~ 54 (119)
T TIGR02795 2 EEAYYDAALLVLKAGDYADAIQAFQAFLKKYPKS----TY-APNAHYWLGEAYYAQGK 54 (119)
T ss_pred cHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCc----cc-cHHHHHHHHHHHHhhcc
Confidence 3566899999999999999999888887654321 12 24555567777766653
No 50
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=64.39 E-value=26 Score=28.22 Aligned_cols=56 Identities=7% Similarity=-0.014 Sum_probs=36.9
Q ss_pred CCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhc
Q psy8176 97 TTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQ 157 (159)
Q Consensus 97 ~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~ 157 (159)
.+++...-=+.+|...|+.++|..|..+++..++.+...+ . ...++=.++.|.+..
T Consensus 64 ~s~~a~~a~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~~----~-~~~a~Y~~g~~~~~~ 119 (243)
T PRK10866 64 FGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFIRLNPTHP----N-IDYVLYMRGLTNMAL 119 (243)
T ss_pred CChHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCcCCC----c-hHHHHHHHHHhhhhc
Confidence 4456666667888888888888888888888877655532 3 445555555554443
No 51
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=63.92 E-value=66 Score=24.86 Aligned_cols=50 Identities=10% Similarity=0.212 Sum_probs=38.8
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhh
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFM 156 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~ 156 (159)
+.=.+.+|..+++.|+|..|..+++.+++.... ..+ +..+.+-...+..-
T Consensus 144 ~~al~~LA~~~~~~g~~~~Ai~~~~~aL~l~~~----~~~-r~~~i~~i~~a~~~ 193 (198)
T PRK10370 144 VTALMLLASDAFMQADYAQAIELWQKVLDLNSP----RVN-RTQLVESINMAKLL 193 (198)
T ss_pred hhHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCC----Ccc-HHHHHHHHHHHHHH
Confidence 456789999999999999999999999876554 345 77777766655543
No 52
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=62.01 E-value=18 Score=31.05 Aligned_cols=44 Identities=18% Similarity=0.041 Sum_probs=33.3
Q ss_pred HHHHHHHhCCCchhHHHHHH--HHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 107 ELGRQSYNTQDFYHTALWMG--EALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 107 ~iG~~~y~~~dy~~A~~Wl~--~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
.+|+.++..++|..|..+|+ .+++.. + ..++..+|+..+.+.|+
T Consensus 340 sLg~l~~~~~~~~~A~~~le~a~a~~~~--------p-~~~~~~~La~ll~~~g~ 385 (409)
T TIGR00540 340 ALGQLLMKHGEFIEAADAFKNVAACKEQ--------L-DANDLAMAADAFDQAGD 385 (409)
T ss_pred HHHHHHHHcccHHHHHHHHHHhHHhhcC--------C-CHHHHHHHHHHHHHcCC
Confidence 68999999999999999999 565431 1 23445588888877764
No 53
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=59.60 E-value=38 Score=20.88 Aligned_cols=31 Identities=10% Similarity=0.034 Sum_probs=27.9
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
++.-.+..|...+..|+|..|...++.++..
T Consensus 28 ~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~ 58 (73)
T PF13371_consen 28 DPELWLQRARCLFQLGRYEEALEDLERALEL 58 (73)
T ss_pred cchhhHHHHHHHHHhccHHHHHHHHHHHHHH
Confidence 5677888999999999999999999999864
No 54
>KOG3617|consensus
Probab=59.52 E-value=30 Score=33.73 Aligned_cols=113 Identities=17% Similarity=0.228 Sum_probs=70.5
Q ss_pred hhhhhchhHHHHHHHhhhhhHHHHHHHHHhhhhHHHHHHHHHhhhcCCCCChhhHHHHHHHHHHHHHHhCCChHHhhccc
Q psy8176 11 PEYLANPINAFLVVKRLTLDWKQAEHYMKDHVYQEAMDNMTRYKQDLKFPTDEDLSGAATALLRLQDTYKLETASVARGE 90 (159)
Q Consensus 11 e~yl~nPlnaF~LirRl~~dW~~~~~~~~~~~~~~~i~~l~~~~~~~~~P~~~Dl~~A~~al~RLQ~~Y~L~~~dla~G~ 90 (159)
+-|-.--+++|-.-|-|..+=+.++.|++....... ++.+-.+ +-+.-|+++|..=-..-.|-|.+=--.--.|+
T Consensus 879 eyyEK~~~hafev~rmL~e~p~~~e~Yv~~~~d~~L---~~WWgqY--lES~GemdaAl~~Y~~A~D~fs~VrI~C~qGk 953 (1416)
T KOG3617|consen 879 EYYEKAGVHAFEVFRMLKEYPKQIEQYVRRKRDESL---YSWWGQY--LESVGEMDAALSFYSSAKDYFSMVRIKCIQGK 953 (1416)
T ss_pred HHHHhcCChHHHHHHHHHhChHHHHHHHHhccchHH---HHHHHHH--HhcccchHHHHHHHHHhhhhhhheeeEeeccC
Confidence 344455677888877777888899999986554433 3334455 66677777776433333333333222222333
Q ss_pred cCCcc--cCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHH
Q psy8176 91 LNGVQ--YTTQLTASDCFELGRQSYNTQDFYHTALWMGEA 128 (159)
Q Consensus 91 l~g~~--~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~A 128 (159)
++.-. .+-+=+-.-||.|||++-+.+++..|+..+..|
T Consensus 954 ~~kAa~iA~esgd~AAcYhlaR~YEn~g~v~~Av~FfTrA 993 (1416)
T KOG3617|consen 954 TDKAARIAEESGDKAACYHLARMYENDGDVVKAVKFFTRA 993 (1416)
T ss_pred chHHHHHHHhcccHHHHHHHHHHhhhhHHHHHHHHHHHHH
Confidence 33100 012234568999999999999999999999765
No 55
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=59.43 E-value=35 Score=24.60 Aligned_cols=49 Identities=14% Similarity=0.143 Sum_probs=33.8
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
-+.+|..+-+.|++..|..+|+.++..+...+. ...+--++|.+++..|
T Consensus 41 ~i~lastlr~LG~~deA~~~L~~~~~~~p~~~~-----~~~l~~f~Al~L~~~g 89 (120)
T PF12688_consen 41 LIQLASTLRNLGRYDEALALLEEALEEFPDDEL-----NAALRVFLALALYNLG 89 (120)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHHCCCccc-----cHHHHHHHHHHHHHCC
Confidence 456888899999999999999999876544221 2233345566666555
No 56
>PF12455 Dynactin: Dynein associated protein ; InterPro: IPR022157 This domain family is found in eukaryotes, and is approximately 280 amino acids in length. The family is found in association with PF01302 from PFAM. There is a single completely conserved residue E that may be functionally important. Dynactin has been associated with Dynein, a kinesin protein which is involved in organelle transport, mitotic spindle assembly and chromosome segregation. Dynactin anchors Dynein to specific subcellular structures.
Probab=59.25 E-value=92 Score=25.70 Aligned_cols=105 Identities=13% Similarity=0.117 Sum_probs=69.7
Q ss_pred hhhhh---hchhHHHHHHHhhhhhHHHHHHHHHhhhhHHHHHHHHHhhhcCCCCChhhHHHHHHHHHHHHHHhCCChHHh
Q psy8176 10 VPEYL---ANPINAFLVVKRLTLDWKQAEHYMKDHVYQEAMDNMTRYKQDLKFPTDEDLSGAATALLRLQDTYKLETASV 86 (159)
Q Consensus 10 ~e~yl---~nPlnaF~LirRl~~dW~~~~~~~~~~~~~~~i~~l~~~~~~~~~P~~~Dl~~A~~al~RLQ~~Y~L~~~dl 86 (159)
|+.|+ .++|.++.+++|+..-=.-+...+.+.. .... -.. -+..+++..++..+.+|-.+ ..+
T Consensus 69 Pd~f~~gD~D~il~lLl~~Rl~~Ka~Ll~~~i~e~~------~~~~-~~~--~~~~e~~~~~~~~~~~L~~l-----~~~ 134 (274)
T PF12455_consen 69 PDSFFRGDHDSILALLLFKRLAFKAELLASQIREKG------DREE-SLK--GHVAEQLSFACRLIYKLSWL-----QAL 134 (274)
T ss_pred ChhhhcccchhHHHHHHHHHHHHHHHHHHHHHHhhc------cccc-CCC--CchHHHHHHHHHHHHHHHHH-----HHH
Confidence 77787 5899999999999975555555554330 0000 011 45578899888888777544 333
Q ss_pred hccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 87 ARGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 87 a~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
.+-.... -..-|++.+..+|..+++..-..++..=+-+.|++
T Consensus 135 ~~rf~~~---l~~Cs~E~f~k~g~~~~Em~~~Er~lD~~IdlLk~ 176 (274)
T PF12455_consen 135 CHRFESA---LSRCSVEQFLKMGGLYPEMEPVERALDSWIDLLKK 176 (274)
T ss_pred HHHHHHH---HccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 2222111 13467999999999999999999999855555544
No 57
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=57.90 E-value=14 Score=22.70 Aligned_cols=31 Identities=10% Similarity=0.108 Sum_probs=26.4
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
++.=++.+|+.++..|++..|...++.++..
T Consensus 24 ~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~ 54 (68)
T PF14559_consen 24 NPEARLLLAQCYLKQGQYDEAEELLERLLKQ 54 (68)
T ss_dssp SHHHHHHHHHHHHHTT-HHHHHHHHHCCHGG
T ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 4777889999999999999999999987754
No 58
>KOG2940|consensus
Probab=56.90 E-value=13 Score=30.76 Aligned_cols=53 Identities=25% Similarity=0.304 Sum_probs=40.0
Q ss_pred HHHhhhhhHHHHHH-----HHHhhhhHHHHHHHHHhhhcCCCCChhhHHHHHHHHHHHHH
Q psy8176 23 VVKRLTLDWKQAEH-----YMKDHVYQEAMDNMTRYKQDLKFPTDEDLSGAATALLRLQD 77 (159)
Q Consensus 23 LirRl~~dW~~~~~-----~~~~~~~~~~i~~l~~~~~~~~~P~~~Dl~~A~~al~RLQ~ 77 (159)
=+||++.||..|.. +++..+|.-....+...+.. +|+.-|+-....++-|.-.
T Consensus 34 ~~KR~qrdrAa~~~d~k~dylkeeig~rlaDrvfD~kk~--fp~a~diGcs~G~v~rhl~ 91 (325)
T KOG2940|consen 34 DLKRIQRDRAAWLSDQKNDYLKEEIGDRLADRVFDCKKS--FPTAFDIGCSLGAVKRHLR 91 (325)
T ss_pred HHHHHHHhHHhhcchhhhhHHHHHHHHHHHHHHHHHhhh--CcceeecccchhhhhHHHH
Confidence 47999999997754 55556666666667677766 9999999888888777543
No 59
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=56.02 E-value=76 Score=23.06 Aligned_cols=25 Identities=20% Similarity=0.287 Sum_probs=11.9
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHh
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALK 130 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~ 130 (159)
+.+|..++..+++..|..++..++.
T Consensus 139 ~~l~~~~~~~g~~~~A~~~~~~~~~ 163 (234)
T TIGR02521 139 ENAGLCALKAGDFDKAEKYLTRALQ 163 (234)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3344444455555555555544443
No 60
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=54.99 E-value=45 Score=24.23 Aligned_cols=52 Identities=13% Similarity=0.191 Sum_probs=37.0
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
++.+..=+..|..++..|+|..|..-+..++..- +. ..+++..+|.+..+.|
T Consensus 21 ~~~p~~~~~~g~~~~~~g~~~~A~~~~~~al~~~-------P~-~~~a~~~lg~~~~~~g 72 (144)
T PRK15359 21 SVDPETVYASGYASWQEGDYSRAVIDFSWLVMAQ-------PW-SWRAHIALAGTWMMLK 72 (144)
T ss_pred HcCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC-------CC-cHHHHHHHHHHHHHHh
Confidence 3455556678999999999999999999987531 12 3456666666666555
No 61
>KOG1173|consensus
Probab=53.84 E-value=13 Score=34.21 Aligned_cols=33 Identities=18% Similarity=0.248 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
+.=-=++|-++|+.++|..|..||+-|+...+.
T Consensus 414 plv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~ 446 (611)
T KOG1173|consen 414 PLVLHELGVVAYTYEEYPEALKYFQKALEVIKS 446 (611)
T ss_pred chhhhhhhheeehHhhhHHHHHHHHHHHHHhhh
Confidence 334447899999999999999999999966654
No 62
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=50.41 E-value=43 Score=24.11 Aligned_cols=32 Identities=19% Similarity=0.344 Sum_probs=22.4
Q ss_pred CCHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 100 LTASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 100 Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
....--|.+|+.++..|+|..|..=|+.++..
T Consensus 46 ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~~~ 77 (145)
T PF09976_consen 46 YAALAALQLAKAAYEQGDYDEAKAALEKALAN 77 (145)
T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHhh
Confidence 44555677777888888888777777666653
No 63
>KOG1840|consensus
Probab=49.75 E-value=48 Score=30.01 Aligned_cols=66 Identities=11% Similarity=-0.063 Sum_probs=45.8
Q ss_pred CcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 93 GVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 93 g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
+..++.+-=...|-.||..+...++|..|..|.+.|++.+.+.......--...+.-+|.++..++
T Consensus 190 ~~~~~~P~~~~~~~~La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~ 255 (508)
T KOG1840|consen 190 GLGDEDPERLRTLRNLAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLG 255 (508)
T ss_pred hcccCCchHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHhc
Confidence 344455566777888999999999999999999999999766433222103344555676665544
No 64
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=49.03 E-value=39 Score=27.00 Aligned_cols=50 Identities=14% Similarity=-0.083 Sum_probs=35.1
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
++.=.+.+|..+.+.|++..|...++.|++...+ ..++...|++.+...|
T Consensus 145 ~~~~~~~~a~~~~~~G~~~~A~~~~~~al~~~P~--------~~~~~~~l~~~li~~~ 194 (280)
T PF13429_consen 145 SARFWLALAEIYEQLGDPDKALRDYRKALELDPD--------DPDARNALAWLLIDMG 194 (280)
T ss_dssp -HHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-TT---------HHHHHHHHHHHCTTC
T ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC--------CHHHHHHHHHHHHHCC
Confidence 4555668899999999999999999999876432 3455666666665554
No 65
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=47.19 E-value=25 Score=21.53 Aligned_cols=29 Identities=21% Similarity=0.217 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHhCC-CchhHHHHHHHHHhh
Q psy8176 103 SDCFELGRQSYNTQ-DFYHTALWMGEALKR 131 (159)
Q Consensus 103 ~DC~~iG~~~y~~~-dy~~A~~Wl~~Al~~ 131 (159)
.=-+.+|...+..+ +|..|..-++.|++.
T Consensus 38 ~~~~~~g~~~~~~~~~~~~A~~~~~~al~l 67 (69)
T PF13414_consen 38 EAYYNLGLAYMKLGKDYEEAIEDFEKALKL 67 (69)
T ss_dssp HHHHHHHHHHHHTTTHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhCccHHHHHHHHHHHHHc
Confidence 34578999999999 799999999999864
No 66
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=47.12 E-value=47 Score=21.03 Aligned_cols=33 Identities=9% Similarity=0.031 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
+..+...|..+=+.|+|..|..+...|++.+..
T Consensus 5 A~~~~~~Av~~D~~g~~~~A~~~Y~~ai~~l~~ 37 (69)
T PF04212_consen 5 AIELIKKAVEADEAGNYEEALELYKEAIEYLMQ 37 (69)
T ss_dssp HHHHHHHHHHHHHTTSHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH
Confidence 567888899999999999999999999988765
No 67
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=46.41 E-value=39 Score=20.89 Aligned_cols=28 Identities=18% Similarity=0.408 Sum_probs=22.9
Q ss_pred HH-HHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 104 DC-FELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 104 DC-~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
|| |.++...|+.++|..|..-....|+.
T Consensus 2 d~lY~lAig~ykl~~Y~~A~~~~~~lL~~ 30 (53)
T PF14853_consen 2 DCLYYLAIGHYKLGEYEKARRYCDALLEI 30 (53)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHhhhHHHHHHHHHHHHhh
Confidence 45 56788889999999999999998864
No 68
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=43.32 E-value=1.1e+02 Score=21.07 Aligned_cols=34 Identities=6% Similarity=-0.100 Sum_probs=28.5
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
++.-=+.+|..++..+++..|..|++.+++....
T Consensus 84 ~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~ 117 (135)
T TIGR02552 84 DPRPYFHAAECLLALGEPESALKALDLAIEICGE 117 (135)
T ss_pred ChHHHHHHHHHHHHcCCHHHHHHHHHHHHHhccc
Confidence 4555588999999999999999999999886443
No 69
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=42.66 E-value=81 Score=26.75 Aligned_cols=33 Identities=6% Similarity=0.076 Sum_probs=28.4
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
+-++.-.+.+|..++..|+|..|...++.|+..
T Consensus 67 P~~~~a~~~lg~~~~~lg~~~eA~~~~~~al~l 99 (356)
T PLN03088 67 PSLAKAYLRKGTACMKLEEYQTAKAALEKGASL 99 (356)
T ss_pred cCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHh
Confidence 346777888999999999999999999999864
No 70
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=42.56 E-value=69 Score=28.95 Aligned_cols=28 Identities=4% Similarity=0.098 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHhCCCchhHHHHHHHHHh
Q psy8176 103 SDCFELGRQSYNTQDFYHTALWMGEALK 130 (159)
Q Consensus 103 ~DC~~iG~~~y~~~dy~~A~~Wl~~Al~ 130 (159)
.-.+.+|..++..|+|..|...++.|+.
T Consensus 400 ~~~~~lg~~~~~~g~~~~A~~~~~kal~ 427 (615)
T TIGR00990 400 DIYYHRAQLHFIKGEFAQAGKDYQKSID 427 (615)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3344455555555555555555555443
No 71
>PRK11189 lipoprotein NlpI; Provisional
Probab=40.91 E-value=71 Score=26.12 Aligned_cols=30 Identities=17% Similarity=0.021 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
+.=-+.+|..+...++|..|..++..|++.
T Consensus 98 ~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l 127 (296)
T PRK11189 98 ADAYNYLGIYLTQAGNFDAAYEAFDSVLEL 127 (296)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 444577899999999999999999999864
No 72
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=40.76 E-value=49 Score=22.32 Aligned_cols=35 Identities=9% Similarity=-0.060 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccC
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDMER 136 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~ 136 (159)
|..|...|..+=+.|+-..|..|++.+++.+.++-
T Consensus 8 A~~~I~kaL~~dE~g~~e~Al~~Y~~gi~~l~eg~ 42 (79)
T cd02679 8 AFEEISKALRADEWGDKEQALAHYRKGLRELEEGI 42 (79)
T ss_pred HHHHHHHHhhhhhcCCHHHHHHHHHHHHHHHHHHc
Confidence 67899999999999999999999999999998753
No 73
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=40.68 E-value=1.1e+02 Score=23.66 Aligned_cols=51 Identities=24% Similarity=0.178 Sum_probs=30.3
Q ss_pred CHHHHHHHHHHH-HhCCC--chhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 101 TASDCFELGRQS-YNTQD--FYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 101 s~~DC~~iG~~~-y~~~d--y~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
++.-.+.+|..+ +..++ +..|...++.|++.- ++ ..+++-.||.+.++.|+
T Consensus 106 ~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~d-------P~-~~~al~~LA~~~~~~g~ 159 (198)
T PRK10370 106 NAELYAALATVLYYQAGQHMTPQTREMIDKALALD-------AN-EVTALMLLASDAFMQAD 159 (198)
T ss_pred CHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhC-------CC-ChhHHHHHHHHHHHcCC
Confidence 445555566653 34444 356666666665431 22 45778888888888775
No 74
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=40.48 E-value=2.1e+02 Score=23.54 Aligned_cols=30 Identities=10% Similarity=0.102 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
+.--+.+|..+...+++..|..+++.++..
T Consensus 214 ~~~~~~la~~~~~~g~~~~A~~~~~~~~~~ 243 (389)
T PRK11788 214 VRASILLGDLALAQGDYAAAIEALERVEEQ 243 (389)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
Confidence 344567888888888888888888887753
No 75
>cd09247 BRO1_Alix_like_2 Protein-interacting Bro1-like domain of an Uncharacterized family of the BRO1_Alix_like superfamily. This domain family is comprised of uncharacterized proteins. It belongs to the BRO1_Alix_like superfamily which includes the Bro1-like domains of mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), RhoA-binding proteins Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, Ustilago maydis Rim23 (also known as PalC), and related domains. Alix, HD-PTP, Brox, Bro1, Rim20 and Rim23 interact with the ESCRT (Endosomal Sorting Complexes Required for Transport) system. Alix participates in membrane remodeling processes during the budding of enveloped viruses, vesicle budding inside late endosomal multivesicular bodies (MVBs), and the abscission reactions of mammalian cell division. It also functions in apoptosis. HD-PTP and Bro1 function in
Probab=40.36 E-value=45 Score=28.36 Aligned_cols=35 Identities=17% Similarity=0.086 Sum_probs=31.2
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhcc
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKRHDME 135 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~ 135 (159)
.+.-|+.+|..+.+.+.|..|+.|++.|...++..
T Consensus 252 ~A~A~~~~a~~~~~~~k~GeaIa~L~~A~~~l~~~ 286 (346)
T cd09247 252 EARSQLYLARRLKEAGHIGVAVGVLREALRNLKKK 286 (346)
T ss_pred HHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHh
Confidence 57789999999999999999999999999877653
No 76
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=39.95 E-value=92 Score=28.12 Aligned_cols=49 Identities=16% Similarity=0.070 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
+..++.+|..++..++|..|..-++.|++.- +. ..+++..++.+++..|
T Consensus 365 ~~~~~~la~~~~~~g~~~eA~~~~~~al~~~-------p~-~~~~~~~lg~~~~~~g 413 (615)
T TIGR00990 365 TQSYIKRASMNLELGDPDKAEEDFDKALKLN-------SE-DPDIYYHRAQLHFIKG 413 (615)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhC-------CC-CHHHHHHHHHHHHHcC
Confidence 4455677777777777777777777776541 12 3355556666665555
No 77
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=38.73 E-value=3.1e+02 Score=27.11 Aligned_cols=30 Identities=17% Similarity=0.053 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
+.=.+.+|..++..+++..|..-++.|++.
T Consensus 303 ~~a~~~Lg~~~~~~g~~~eA~~~l~~Al~~ 332 (1157)
T PRK11447 303 SEALGALGQAYSQQGDRARAVAQFEKALAL 332 (1157)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 444566777777777777777777776654
No 78
>TIGR01987 HI0074 nucleotidyltransferase substrate binding protein, HI0074 family. The member of this family from Haemophilus influenzae, HI0074, has been shown by crystal structure to resemble nucleotidyltransferase substrate binding proteins. It forms a complex with HI0073, encoded by the adjacent gene and containing a nucleotidyltransferase nucleotide binding domain (pfam01909).
Probab=38.69 E-value=1.3e+02 Score=21.91 Aligned_cols=70 Identities=16% Similarity=0.259 Sum_probs=40.9
Q ss_pred CCChhhHHHHHHHHHHHHHHhCCChHHhhcc-ccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 59 FPTDEDLSGAATALLRLQDTYKLETASVARG-ELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 59 ~P~~~Dl~~A~~al~RLQ~~Y~L~~~dla~G-~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
.|..+|+..-...|.|..=||.|.=+-|-.= ...|..-....|+.||+.. |++.|--.....|+.---.|
T Consensus 18 ~~~~~~~~~~~g~IqrFE~t~ELaWK~lK~~L~~~G~~~~~~~spr~~ir~---A~~~glI~d~~~W~~ml~~R 88 (123)
T TIGR01987 18 FDLTNDITIIDGAIQKFEFTFELAWKLMKRYLAQEGINDIGAYSPKDVLKE---AFRAGLIGDESLWIAMLDDR 88 (123)
T ss_pred cCccchHHHHHHHHHHhhhHHHHHHHHHHHHHHHcCCcccccCCHHHHHHH---HHHcCCcCCHHHHHHHHHHh
Confidence 4555555233344567777777654443321 0123221126789999975 56777777778899876554
No 79
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=38.50 E-value=2.4e+02 Score=25.35 Aligned_cols=33 Identities=12% Similarity=-0.142 Sum_probs=25.1
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
..++.-.+.+|..++..++|..|..+++.++..
T Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~A~~~~~~a~~~ 154 (899)
T TIGR02917 122 EGAAELLALRGLAYLGLGQLELAQKSYEQALAI 154 (899)
T ss_pred hhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHhc
Confidence 345666778888888888888888888887653
No 80
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.53 E-value=31 Score=27.50 Aligned_cols=30 Identities=23% Similarity=0.498 Sum_probs=23.8
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHh
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALK 130 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~ 130 (159)
+-|+.+ +||..+|+.|||..|..||.+-..
T Consensus 166 R~sArE--ALglAa~kagd~a~A~~~F~qia~ 195 (221)
T COG4649 166 RHSARE--ALGLAAYKAGDFAKAKSWFVQIAN 195 (221)
T ss_pred HHHHHH--HHhHHHHhccchHHHHHHHHHHHc
Confidence 445555 567788999999999999987654
No 81
>cd09245 BRO1_UmRIM23-like Protein-interacting, Bro1-like domain of Ustilago maydis Rim23 (PalC), and related domains. This family contains the Bro1-like domain of Ustilago maydis Rim23 (also known as PalC), and related proteins. It belongs to the BRO1_Alix_like superfamily which includes the Bro1-like domains of mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), RhoA-binding proteins Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, and related domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23 interact with the ESCRT (Endosomal Sorting Complexes Required for Transport) system. Rim20 and Rim23 participate in the response to the external pH via the Rim101 pathway. Through its Bro1-like domain, Rim23 allows the interaction between the endosomal and plasma membrane complexes. Bro1-like domains are boomerang-shape, and part of the domain is a tetratricop
Probab=37.50 E-value=47 Score=29.15 Aligned_cols=33 Identities=21% Similarity=0.214 Sum_probs=29.8
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
+.-|+.+|..+.+.+.|..|+.||+.|...++.
T Consensus 296 A~A~~~~g~d~~e~~k~GeaIa~L~~A~~~L~~ 328 (413)
T cd09245 296 ALACKFLGIDAGENGKVGEAIGWLRAAKKELED 328 (413)
T ss_pred HHHHHHHHHhhHhcCCHHHHHHHHHHHHHHHHH
Confidence 778999999999999999999999999886543
No 82
>KOG1840|consensus
Probab=37.02 E-value=2.4e+02 Score=25.59 Aligned_cols=57 Identities=14% Similarity=0.046 Sum_probs=44.3
Q ss_pred HHHHHH-HHHHHHhCCCchhHHHHHHHHHhhhhccCCC-CCCCHHHHHHHHHHhHhhcCC
Q psy8176 102 ASDCFE-LGRQSYNTQDFYHTALWMGEALKRHDMERNG-TSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 102 ~~DC~~-iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~-~~~~~~~il~~la~s~~~~g~ 159 (159)
...|.. +|..+-..+.|.+|+.=++.|+......-.. .+. ...++.-||.+++++|+
T Consensus 240 va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~-va~~l~nLa~ly~~~GK 298 (508)
T KOG1840|consen 240 VASMLNILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPA-VAATLNNLAVLYYKQGK 298 (508)
T ss_pred HHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHHHHHHhccCC
Confidence 444555 9999999999999999999999888753221 233 56789999999988874
No 83
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=36.96 E-value=2.2e+02 Score=24.13 Aligned_cols=17 Identities=6% Similarity=-0.138 Sum_probs=11.3
Q ss_pred CchhHHHHHHHHHhhhh
Q psy8176 117 DFYHTALWMGEALKRHD 133 (159)
Q Consensus 117 dy~~A~~Wl~~Al~~~~ 133 (159)
+...+..|+..+..++.
T Consensus 102 ~~~~~~~~l~~~~~kl~ 118 (356)
T PLN03088 102 GDSRFTKLIKECDEKIA 118 (356)
T ss_pred CCHHHHHHHHHHHHHHH
Confidence 55667778877766654
No 84
>PRK12370 invasion protein regulator; Provisional
Probab=35.99 E-value=82 Score=28.24 Aligned_cols=45 Identities=18% Similarity=0.147 Sum_probs=30.5
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
+.+|..+...+++..|...++.|++.-. + ..+++-++|.+++..|
T Consensus 342 ~~lg~~~~~~g~~~~A~~~~~~Al~l~P-------~-~~~a~~~lg~~l~~~G 386 (553)
T PRK12370 342 GLLGLINTIHSEYIVGSLLFKQANLLSP-------I-SADIKYYYGWNLFMAG 386 (553)
T ss_pred HHHHHHHHHccCHHHHHHHHHHHHHhCC-------C-CHHHHHHHHHHHHHCC
Confidence 4577788888888888888888876421 2 3445556666666655
No 85
>KOG1156|consensus
Probab=35.97 E-value=57 Score=30.57 Aligned_cols=47 Identities=17% Similarity=0.135 Sum_probs=36.4
Q ss_pred HHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 104 DCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 104 DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
=|-.+|...-..++|..|..|++.|+.. ..+ ..+|+.-||.-..|+|
T Consensus 77 CwHv~gl~~R~dK~Y~eaiKcy~nAl~~-------~~d-N~qilrDlslLQ~QmR 123 (700)
T KOG1156|consen 77 CWHVLGLLQRSDKKYDEAIKCYRNALKI-------EKD-NLQILRDLSLLQIQMR 123 (700)
T ss_pred hHHHHHHHHhhhhhHHHHHHHHHHHHhc-------CCC-cHHHHHHHHHHHHHHH
Confidence 3456788889999999999999999863 123 5678888887777665
No 86
>cd04087 PTPA Phosphotyrosyl phosphatase activator (PTPA) is also known as protein phosphatase 2A (PP2A) phosphatase activator. PTPA is an essential, well conserved protein that stimulates the tyrosyl phosphatase activity of PP2A. It also reactivates the serine/threonine phosphatase activity of an inactive form of PP2A. Together, PTPA and PP2A constitute an ATPase. It has been suggested that PTPA alters the relative specificity of PP2A from phosphoserine/phosphothreonine substrates to phosphotyrosine substrates in an ATP-hydrolysis-dependent manner. Basal expression of PTPA is controlled by the transcription factor Yin Yang1 (YY1). PTPA has been suggested to play a role in the insertion of metals to the PP2A catalytic subunit (PP2Ac) active site, to act as a chaperone, and more recently, to have peptidyl prolyl cis/trans isomerase activity that specifically targets human PP2Ac.
Probab=35.60 E-value=20 Score=29.81 Aligned_cols=57 Identities=21% Similarity=0.289 Sum_probs=31.3
Q ss_pred HHHHHHHHhCCChHHhhcccc--C---------C-ccc--CCCCCHHHHHHHHHH--HHhCCCchhHHHHHHHH
Q psy8176 71 ALLRLQDTYKLETASVARGEL--N---------G-VQY--TTQLTASDCFELGRQ--SYNTQDFYHTALWMGEA 128 (159)
Q Consensus 71 al~RLQ~~Y~L~~~dla~G~l--~---------g-~~~--~~~Ls~~DC~~iG~~--~y~~~dy~~A~~Wl~~A 128 (159)
=+.|||.+|.|+|+= +.|.. + | .|. ...+++.++..-.-. ..+..-|-.|..++...
T Consensus 145 l~R~Lq~~Y~LEPAG-SHGvWGLDDy~fLPfi~GsaQL~~~~~~~P~~i~~~~~v~~~~~~ymy~~~I~fI~~v 217 (266)
T cd04087 145 LVRRLQLTYRLEPAG-SHGVWGLDDYQFLPFIFGSAQLINHKPLKPKSILDPEIVEEYKKDYLYLSCIAFINKV 217 (266)
T ss_pred HHHHHHHHhccccCC-CCCCCcccccccchhhhhHHHhccCCCCCchhhcCHHHHHHhhhcccHHHHHHHHHHc
Confidence 357999999999974 33432 1 1 111 234566666532222 22344456777777653
No 87
>KOG1855|consensus
Probab=35.09 E-value=37 Score=30.22 Aligned_cols=38 Identities=24% Similarity=0.455 Sum_probs=31.7
Q ss_pred hhhhcChhhhhh-chhHHHHHHHhhhhhHHHHHHHHHhh
Q psy8176 4 AEAAKDVPEYLA-NPINAFLVVKRLTLDWKQAEHYMKDH 41 (159)
Q Consensus 4 ~~~~~~~e~yl~-nPlnaF~LirRl~~dW~~~~~~~~~~ 41 (159)
.....+++.||. +=|.+|+=||.|+.||..+-.-++..
T Consensus 165 khvrrnkeGyVpv~~vaSFKKvK~LTrd~~~va~ALr~S 203 (484)
T KOG1855|consen 165 KHVRRNKEGYVPVKLVASFKKVKALTRDWKLVADALRKS 203 (484)
T ss_pred HHHhcCCCCceeeehhhhHHHHHHHhhhhHHHHHHHhhc
Confidence 345678899985 56789999999999999999888753
No 88
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=35.05 E-value=1.2e+02 Score=21.64 Aligned_cols=75 Identities=15% Similarity=0.033 Sum_probs=47.6
Q ss_pred CChhhHHHHHHHHHHHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCC
Q psy8176 60 PTDEDLSGAATALLRLQDTYKLETASVARGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGT 139 (159)
Q Consensus 60 P~~~Dl~~A~~al~RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~ 139 (159)
-...|+++|...+..+.+.. .+..+.+.--+.||+..+..++|..|...++.. .. .
T Consensus 59 ~~~g~~~~A~~~l~~~~~~~----------------~d~~l~~~a~l~LA~~~~~~~~~d~Al~~L~~~----~~----~ 114 (145)
T PF09976_consen 59 YEQGDYDEAKAALEKALANA----------------PDPELKPLARLRLARILLQQGQYDEALATLQQI----PD----E 114 (145)
T ss_pred HHCCCHHHHHHHHHHHHhhC----------------CCHHHHHHHHHHHHHHHHHcCCHHHHHHHHHhc----cC----c
Confidence 34456666666665543321 122344556678999999999999999888541 11 1
Q ss_pred CCCHHHHHHHHHHhHhhcCC
Q psy8176 140 STPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 140 ~~~~~~il~~la~s~~~~g~ 159 (159)
.- ...+.+.++.+++..|+
T Consensus 115 ~~-~~~~~~~~Gdi~~~~g~ 133 (145)
T PF09976_consen 115 AF-KALAAELLGDIYLAQGD 133 (145)
T ss_pred ch-HHHHHHHHHHHHHHCCC
Confidence 22 45677777888777774
No 89
>PRK11189 lipoprotein NlpI; Provisional
Probab=34.84 E-value=48 Score=27.13 Aligned_cols=28 Identities=21% Similarity=0.185 Sum_probs=25.4
Q ss_pred HHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 104 DCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 104 DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
-.|.+|..+.+.|++..|..|++.|+..
T Consensus 238 a~~~Lg~~~~~~g~~~~A~~~~~~Al~~ 265 (296)
T PRK11189 238 TYFYLAKYYLSLGDLDEAAALFKLALAN 265 (296)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 3589999999999999999999999853
No 90
>COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only]
Probab=34.48 E-value=54 Score=26.29 Aligned_cols=31 Identities=26% Similarity=0.249 Sum_probs=26.4
Q ss_pred CHHHHHHHHHHHHh----CCCchhHHHHHHHHHhh
Q psy8176 101 TASDCFELGRQSYN----TQDFYHTALWMGEALKR 131 (159)
Q Consensus 101 s~~DC~~iG~~~y~----~~dy~~A~~Wl~~Al~~ 131 (159)
.+.-|+.||..+.. ..|+..|..|++.|.++
T Consensus 108 ~~~a~~~lg~~~~~G~gv~~d~~~A~~~~~~Aa~~ 142 (292)
T COG0790 108 LAEALFNLGLMYANGRGVPLDLVKALKYYEKAAKL 142 (292)
T ss_pred cHHHHHhHHHHHhcCCCcccCHHHHHHHHHHHHHc
Confidence 36789999999988 55999999999998764
No 91
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=34.31 E-value=1.2e+02 Score=27.29 Aligned_cols=53 Identities=23% Similarity=0.203 Sum_probs=38.9
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
+-++.=.+.+|..++..|+|..|..+++.+++. .+. ..++...++.++...|+
T Consensus 53 p~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~-------~~~-~~~~~~~~a~~~~~~g~ 105 (899)
T TIGR02917 53 PNDAEARFLLGKIYLALGDYAAAEKELRKALSL-------GYP-KNQVLPLLARAYLLQGK 105 (899)
T ss_pred CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc-------CCC-hhhhHHHHHHHHHHCCC
Confidence 457777788999999999999999998888753 122 34566678888877764
No 92
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=33.40 E-value=1.4e+02 Score=24.21 Aligned_cols=89 Identities=18% Similarity=0.156 Sum_probs=56.0
Q ss_pred CCChhhHHHHHHHHHHHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCC
Q psy8176 59 FPTDEDLSGAATALLRLQDTYKLETASVARGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNG 138 (159)
Q Consensus 59 ~P~~~Dl~~A~~al~RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~ 138 (159)
+|..--=.+|-..++++|+. |+|..=.+---. ....-=|...-+.+|..+-+.|+|..|+.=+++|+.-.
T Consensus 50 LPEl~~~R~a~~~~~a~~q~--ldP~R~~Rea~~--~~~~ApTvqnr~rLa~al~elGr~~EA~~hy~qalsG~------ 119 (251)
T COG4700 50 LPELGADRHAHTLLMALQQK--LDPERHLREATE--ELAIAPTVQNRYRLANALAELGRYHEAVPHYQQALSGI------ 119 (251)
T ss_pred hhHhcccchhHHHHHHHHHh--cChhHHHHHHHH--HHhhchhHHHHHHHHHHHHHhhhhhhhHHHHHHHhccc------
Confidence 45444445677777788776 555432211100 01122357778999999999999999999999998422
Q ss_pred CCCCHHHHHHHHHHhHhhcC
Q psy8176 139 TSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 139 ~~~~~~~il~~la~s~~~~g 158 (159)
... ...+|--+|.+.|.++
T Consensus 120 fA~-d~a~lLglA~Aqfa~~ 138 (251)
T COG4700 120 FAH-DAAMLLGLAQAQFAIQ 138 (251)
T ss_pred cCC-CHHHHHHHHHHHHhhc
Confidence 222 3455656666666654
No 93
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=31.74 E-value=1.7e+02 Score=20.97 Aligned_cols=49 Identities=14% Similarity=-0.033 Sum_probs=37.7
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
|.+|-..-..|+...|+..++.|+.. ...+.. +.+.+-.|+.++..+|+
T Consensus 5 ~~~A~a~d~~G~~~~Ai~~Y~~Al~~----gL~~~~-~~~a~i~lastlr~LG~ 53 (120)
T PF12688_consen 5 YELAWAHDSLGREEEAIPLYRRALAA----GLSGAD-RRRALIQLASTLRNLGR 53 (120)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHc----CCCchH-HHHHHHHHHHHHHHcCC
Confidence 67788888999999999999999863 122334 66777788888887774
No 94
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=31.70 E-value=1.2e+02 Score=25.48 Aligned_cols=81 Identities=17% Similarity=0.178 Sum_probs=54.9
Q ss_pred CCChhhHHHHHHHHHHHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCC-
Q psy8176 59 FPTDEDLSGAATALLRLQDTYKLETASVARGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERN- 137 (159)
Q Consensus 59 ~P~~~Dl~~A~~al~RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~- 137 (159)
.|.+.+++..+.+ ||..-.-+|. ++..-..||+.+...+++..|..=+..|+++..+...
T Consensus 132 ~~~~~~~~~l~a~---Le~~L~~nP~----------------d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~ 192 (287)
T COG4235 132 PPAEQEMEALIAR---LETHLQQNPG----------------DAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEI 192 (287)
T ss_pred CCCcccHHHHHHH---HHHHHHhCCC----------------CchhHHHHHHHHHHhcchhHHHHHHHHHHHhCCCCHHH
Confidence 5776667665544 4444333333 3556788999999999999999988888876433210
Q ss_pred -----------------------------CCCCCHHHHHHHHHHhHhhcCC
Q psy8176 138 -----------------------------GTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 138 -----------------------------~~~~~~~~il~~la~s~~~~g~ 159 (159)
...+ ....+-+||+..|++|+
T Consensus 193 ~~g~aeaL~~~a~~~~ta~a~~ll~~al~~D~~-~iral~lLA~~afe~g~ 242 (287)
T COG4235 193 LLGLAEALYYQAGQQMTAKARALLRQALALDPA-NIRALSLLAFAAFEQGD 242 (287)
T ss_pred HHHHHHHHHHhcCCcccHHHHHHHHHHHhcCCc-cHHHHHHHHHHHHHccc
Confidence 1122 56678899999999885
No 95
>PF04976 DmsC: DMSO reductase anchor subunit (DmsC); InterPro: IPR007059 The terminal electron transfer enzyme dimethyl sulphoxide reductase of Escherichia coli is a heterotrimeric enzyme composed of a membrane extrinsic catalytic dimer (DmsAB) and a membrane intrinsic polytopic anchor subunit (DmsC) []. This family represents DmsC.; GO: 0019645 anaerobic electron transport chain, 0016021 integral to membrane
Probab=31.61 E-value=25 Score=28.95 Aligned_cols=28 Identities=25% Similarity=0.421 Sum_probs=23.8
Q ss_pred hhhhchhHHHHHHHhhhhhHHHHHHHHH
Q psy8176 12 EYLANPINAFLVVKRLTLDWKQAEHYMK 39 (159)
Q Consensus 12 ~yl~nPlnaF~LirRl~~dW~~~~~~~~ 39 (159)
-++++|.++|.-++|+.+.|-.=|-+.-
T Consensus 63 ~HLG~P~~A~~al~~~~tSwLSrEi~~~ 90 (276)
T PF04976_consen 63 FHLGRPLRAFRALSNLGTSWLSREILFA 90 (276)
T ss_pred HHcCChHHHHHHHhccCchHHHHHHHHH
Confidence 4789999999999999999987766554
No 96
>COG3302 DmsC DMSO reductase anchor subunit [General function prediction only]
Probab=31.05 E-value=31 Score=28.88 Aligned_cols=26 Identities=27% Similarity=0.452 Sum_probs=21.8
Q ss_pred hhhhchhHHHHHHHhhhhhHHHHHHH
Q psy8176 12 EYLANPINAFLVVKRLTLDWKQAEHY 37 (159)
Q Consensus 12 ~yl~nPlnaF~LirRl~~dW~~~~~~ 37 (159)
-++++|++||.-++|..+.|-.=|-.
T Consensus 60 lHLG~PlRA~naL~rvg~SwLSnEI~ 85 (281)
T COG3302 60 LHLGSPLRAFNALNRVGSSWLSNEIA 85 (281)
T ss_pred hhcCChhHHHHHHHhhhhhhHHHHHH
Confidence 46899999999999999999765433
No 97
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=30.82 E-value=1.2e+02 Score=25.84 Aligned_cols=31 Identities=23% Similarity=0.209 Sum_probs=26.7
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
++.=-+.+|+.++..++|..|...|+.++..
T Consensus 327 ~~~l~l~lgrl~~~~~~~~~A~~~le~al~~ 357 (398)
T PRK10747 327 TPLLWSTLGQLLMKHGEWQEASLAFRAALKQ 357 (398)
T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHhc
Confidence 3444678999999999999999999999864
No 98
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=29.97 E-value=2e+02 Score=23.72 Aligned_cols=56 Identities=16% Similarity=0.210 Sum_probs=38.2
Q ss_pred CCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 98 TQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 98 ~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
...++..-|.=|....+.|+|..|..=++....++..++ . ..+++-.++++.++.|
T Consensus 30 ~~~p~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~s~---~--~~qa~l~l~yA~Yk~~ 85 (254)
T COG4105 30 YNLPASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSP---Y--SEQAQLDLAYAYYKNG 85 (254)
T ss_pred cCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHcCCCCc---c--cHHHHHHHHHHHHhcc
Confidence 356788888888888888888888888777665544332 1 2345556777777765
No 99
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=29.60 E-value=2.1e+02 Score=20.25 Aligned_cols=46 Identities=15% Similarity=0.055 Sum_probs=29.4
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
+-.++..+...++|..|..++..++..-.- ...+...|..++...|
T Consensus 65 ~~~l~~~~~~~~~~~~a~~~~~~~l~~dP~--------~E~~~~~lm~~~~~~g 110 (146)
T PF03704_consen 65 LERLAEALLEAGDYEEALRLLQRALALDPY--------DEEAYRLLMRALAAQG 110 (146)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHSTT---------HHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHhccCHHHHHHHHHHHHhcCCC--------CHHHHHHHHHHHHHCc
Confidence 345677788999999999999988764221 2344555555555554
No 100
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=28.48 E-value=71 Score=25.33 Aligned_cols=28 Identities=7% Similarity=0.278 Sum_probs=24.8
Q ss_pred HHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 104 DCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 104 DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
=.|-||...+..|++..|..|+...+..
T Consensus 167 l~YLigeL~rrlg~~~eA~~~fs~vi~~ 194 (214)
T PF09986_consen 167 LLYLIGELNRRLGNYDEAKRWFSRVIGS 194 (214)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHcC
Confidence 4577999999999999999999988754
No 101
>PF12462 Helicase_IV_N: DNA helicase IV / RNA helicase N terminal; InterPro: IPR022161 This domain family is found in bacteria and eukaryotes, and is approximately 170 amino acids in length. The family is found in association with PF00580 from PFAM. Nucleolin unwinds nucleic acid strands in the 5' to 3' direction with respect to the bound strand. It can unwind RNA-RNA duplexes, as well as DNA-DNA and DNA-RNA duplexes. Nucleolin is modulated by phosphorylation of serine and threonine residues in its N-terminal region.
Probab=28.31 E-value=2.7e+02 Score=21.12 Aligned_cols=67 Identities=12% Similarity=0.216 Sum_probs=50.6
Q ss_pred HHhhhhhHHHHHHHHHhhh---hHHHHHHHHHhhhcCCCCChhhHHHHHHHHHHHHHHhCCChHHhhccc
Q psy8176 24 VKRLTLDWKQAEHYMKDHV---YQEAMDNMTRYKQDLKFPTDEDLSGAATALLRLQDTYKLETASVARGE 90 (159)
Q Consensus 24 irRl~~dW~~~~~~~~~~~---~~~~i~~l~~~~~~~~~P~~~Dl~~A~~al~RLQ~~Y~L~~~dla~G~ 90 (159)
.+++..-|.+|..-|-... =....+.|......-++|+..++....+.+..+-+.-.+++.+++.+.
T Consensus 82 a~~l~~~wq~w~~~~~~~~~~~L~~~~~~i~~~~~~~r~lt~~~~~~~~~~~r~~~~~l~~~l~~~~~~~ 151 (166)
T PF12462_consen 82 AHHLNQAWQQWSQEQSEQLAEVLPQWLQAIQRLSAQDRWLTHSQVQPLLEKIRALFEALPLPLPELAQFD 151 (166)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCHHHHHHHHHHHHHHHHcCCCCHHHHHhcc
Confidence 4888888988887775432 233445555555444599999999999999999999999999998753
No 102
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=28.23 E-value=15 Score=28.06 Aligned_cols=53 Identities=13% Similarity=-0.016 Sum_probs=38.3
Q ss_pred CCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcCC
Q psy8176 99 QLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQDP 159 (159)
Q Consensus 99 ~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g~ 159 (159)
+.+++=.+-+|-.+-..|+|..|..=+..|...-. + ..+..-+++.|+++.|+
T Consensus 66 p~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~-------d-dp~~~~~ag~c~L~lG~ 118 (157)
T PRK15363 66 AWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKI-------D-APQAPWAAAECYLACDN 118 (157)
T ss_pred cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCC-------C-CchHHHHHHHHHHHcCC
Confidence 45677788888888888898888888888765321 2 33455578888887775
No 103
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=28.06 E-value=1.6e+02 Score=26.73 Aligned_cols=47 Identities=15% Similarity=0.079 Sum_probs=34.5
Q ss_pred HHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 104 DCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 104 DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
=++.+++|.-..|+|..|...+..|+.. +++ ..+.+-.-|..+-..|
T Consensus 196 ~~~~lAqhyd~~g~~~~Al~~Id~aI~h-------tPt-~~ely~~KarilKh~G 242 (517)
T PF12569_consen 196 TLYFLAQHYDYLGDYEKALEYIDKAIEH-------TPT-LVELYMTKARILKHAG 242 (517)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHhc-------CCC-cHHHHHHHHHHHHHCC
Confidence 3488999999999999999999999864 233 4555555555555554
No 104
>cd09034 BRO1_Alix_like Protein-interacting Bro1-like domain of mammalian Alix and related domains. This superfamily includes the Bro1-like domains of mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), RhoA-binding proteins Rhophilin-1 and Rhophilin-2, Brox, Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, and related domains. Alix, HD-PTP, Brox, Bro1 and Rim20 interact with the ESCRT (Endosomal Sorting Complexes Required for Transport) system. Alix, also known as apoptosis-linked gene-2 interacting protein 1 (AIP1), participates in membrane remodeling processes during the budding of enveloped viruses, vesicle budding inside late endosomal multivesicular bodies (MVBs), and the abscission reactions of mammalian cell division. It also functions in apoptosis. HD-PTP functions in cell migration and endosomal trafficking, Bro1 in endosomal trafficking, and Rim20 in the response to
Probab=27.84 E-value=82 Score=26.28 Aligned_cols=35 Identities=11% Similarity=0.041 Sum_probs=30.9
Q ss_pred CCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 100 LTASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 100 Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
..+.-||.+|..+.+.+.+..|+.|++.|...++.
T Consensus 249 ~~a~a~~~~a~~~~e~~~~G~aia~L~~A~~~~~~ 283 (345)
T cd09034 249 FKALAYYYHGLKLDEANKIGEAIARLQAALELLKE 283 (345)
T ss_pred HHHHHHHHHHHHhhccccHHHHHHHHHHHHHHHHH
Confidence 35778899999999999999999999999887765
No 105
>PF07079 DUF1347: Protein of unknown function (DUF1347); InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=27.74 E-value=1.1e+02 Score=27.68 Aligned_cols=86 Identities=9% Similarity=-0.001 Sum_probs=52.0
Q ss_pred HHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCC-------HHHH
Q psy8176 74 RLQDTYKLETASVARGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTP-------KWEI 146 (159)
Q Consensus 74 RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~-------~~~i 146 (159)
||-..|-|+--|+..+.+....-...=++.=|+..|.++|+.++|..|.+=+...-.+.+...++-.++ ..-.
T Consensus 51 rilnAffl~nld~Me~~l~~l~~~~~~s~~l~LF~~L~~Y~~k~~~kal~~ls~w~~~~~~~~~~~Ld~ni~~l~~df~l 130 (549)
T PF07079_consen 51 RILNAFFLNNLDLMEKQLMELRQQFGKSAYLPLFKALVAYKQKEYRKALQALSVWKEQIKGTESPWLDTNIQQLFSDFFL 130 (549)
T ss_pred HHHHHHHHhhHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHhhhHHHHHHHHHHHHhhhcccccchhhhhHHHHhhHHHH
Confidence 566667777777666665432211224678899999999999999999875555444444432211110 1112
Q ss_pred HHHHHHhHhhcCC
Q psy8176 147 LEYLAYSKFMQDP 159 (159)
Q Consensus 147 l~~la~s~~~~g~ 159 (159)
-++.|.|+.+.|.
T Consensus 131 ~~i~a~sLIe~g~ 143 (549)
T PF07079_consen 131 DEIEAHSLIETGR 143 (549)
T ss_pred HHHHHHHHHhcCC
Confidence 3455778877773
No 106
>TIGR03505 FimV_core FimV N-terminal domain. This region is found at, or about 200 amino acids from, the N-terminus of FimV from Pseudomonas aeruginosa, TspA of Neisseria meningitidis, and related proteins. Disruption of FimV blocks twitching motility from type IV pili; Semmler, et al. suggest a role for this family in peptidoglycan layer remodelling required by type IV fimbrial systems. Most but not all members of this protein family have a C-terminal region recognized by TIGR03504. In between is a highly variable, often repeat-filled region rich in the negatively charged amino acids Asp and Glu.
Probab=27.66 E-value=28 Score=23.13 Aligned_cols=46 Identities=20% Similarity=0.203 Sum_probs=35.3
Q ss_pred CCCh-hhHHHHHHHHHHHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHH
Q psy8176 59 FPTD-EDLSGAATALLRLQDTYKLETASVARGELNGVQYTTQLTASDCFELGR 110 (159)
Q Consensus 59 ~P~~-~Dl~~A~~al~RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~ 110 (159)
.|+. ..+..++.||.+ .+|..+..|++++.+.+..|..-+.-.|..
T Consensus 11 ~~~~~~s~~q~m~ai~~------aNp~AF~~~nin~L~~G~~L~iP~~~~i~~ 57 (74)
T TIGR03505 11 RPDNSVSLYQMMLALYR------ANPDAFIGGNINRLKVGQILRIPSEEEIQA 57 (74)
T ss_pred ccCCCCCHHHHHHHHHH------HCHHhHhcCChhhcCCCCEEeCCCHHHHhh
Confidence 3554 677778888755 899999999999888888777777666544
No 107
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=27.22 E-value=77 Score=23.24 Aligned_cols=27 Identities=15% Similarity=0.233 Sum_probs=24.1
Q ss_pred HHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 105 CFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 105 C~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
-+.||..+...+++..|..=++.|+..
T Consensus 75 ~~~lg~~~~~~g~~~eA~~~~~~Al~~ 101 (168)
T CHL00033 75 LYNIGLIHTSNGEHTKALEYYFQALER 101 (168)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 477999999999999999999999865
No 108
>PRK09400 secE preprotein translocase subunit SecE; Reviewed
Probab=27.03 E-value=99 Score=19.79 Aligned_cols=29 Identities=17% Similarity=0.101 Sum_probs=19.9
Q ss_pred HHHHhhhhhHHHHHHHHHhhhhHHHHHHH
Q psy8176 22 LVVKRLTLDWKQAEHYMKDHVYQEAMDNM 50 (159)
Q Consensus 22 ~LirRl~~dW~~~~~~~~~~~~~~~i~~l 50 (159)
.-++.+..||..+.+...+|..+|+....
T Consensus 8 e~~~~f~~d~~rvl~~~~KPd~~Ef~~ia 36 (61)
T PRK09400 8 ENVKNFLEDYKRVLKVARKPTREEFLLVA 36 (61)
T ss_pred HhHHHHHHHHHHHHHHhcCCCHHHHHHHH
Confidence 34567778888887777777766665543
No 109
>COG0429 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=26.66 E-value=1.8e+02 Score=25.19 Aligned_cols=79 Identities=13% Similarity=0.165 Sum_probs=50.4
Q ss_pred cChhhhhhchhHHHHHHHhhhhhHHHHHHHHHhhhhHHHHHHHHHhhhcCCC-CChhhH-HHHHHHHHHHHHHhCCChHH
Q psy8176 8 KDVPEYLANPINAFLVVKRLTLDWKQAEHYMKDHVYQEAMDNMTRYKQDLKF-PTDEDL-SGAATALLRLQDTYKLETAS 85 (159)
Q Consensus 8 ~~~e~yl~nPlnaF~LirRl~~dW~~~~~~~~~~~~~~~i~~l~~~~~~~~~-P~~~Dl-~~A~~al~RLQ~~Y~L~~~d 85 (159)
.+....+++|+.-..-.-||.++|.. .+-....+.-+..++...... + |+.-+- .++++++.+++++ .+
T Consensus 175 ~~aa~~vs~P~Dl~~~~~~l~~~~s~--~ly~r~l~~~L~~~~~~kl~~--l~~~~p~~~~~~ik~~~ti~eF-----D~ 245 (345)
T COG0429 175 LDAAVAVSAPFDLEACAYRLDSGFSL--RLYSRYLLRNLKRNAARKLKE--LEPSLPGTVLAAIKRCRTIREF-----DD 245 (345)
T ss_pred cceeeeeeCHHHHHHHHHHhcCchhh--hhhHHHHHHHHHHHHHHHHHh--cCcccCcHHHHHHHhhchHHhc-----cc
Confidence 34566789999999999999999984 233333444444444433333 3 333333 8889999999888 55
Q ss_pred hhccccCCcc
Q psy8176 86 VARGELNGVQ 95 (159)
Q Consensus 86 la~G~l~g~~ 95 (159)
+.-+.+.|-.
T Consensus 246 ~~Tap~~Gf~ 255 (345)
T COG0429 246 LLTAPLHGFA 255 (345)
T ss_pred eeeecccCCC
Confidence 5555566544
No 110
>KOG4563|consensus
Probab=26.28 E-value=89 Score=27.40 Aligned_cols=57 Identities=18% Similarity=0.111 Sum_probs=45.7
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccC-CCCCCCHHHHHHHHHHhHhhcC
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKRHDMER-NGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~-~~~~~~~~~il~~la~s~~~~g 158 (159)
+...-+..|+.++..+||..|+.=+.+|........ ..... ..+.|-++..++++++
T Consensus 40 ~~e~lv~~G~~~~~~~d~~~Avda~s~A~~l~~ei~Ge~~~e-~~eal~~YGkslLela 97 (400)
T KOG4563|consen 40 TLEELVQAGRRALCNNDIDKAVDALSEATELSDEIYGEKHLE-TFEALFLYGKSLLELA 97 (400)
T ss_pred HHHHHHHhhhHHHhcccHHHHHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHHHHHHHHH
Confidence 456788999999999999999999999998876532 12334 5788889999998865
No 111
>PF03095 PTPA: Phosphotyrosyl phosphate activator (PTPA) protein; InterPro: IPR004327 Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is relatively low when compared to the better recognised phosphoserine/ threonine protein phosphorylase activity. The specific biological role of PTPA is unknown, Basal expression of PTPA depends on the activity of a ubiquitous transcription factor, Yin Yang 1 (YY1). The tumour suppressor protein p53 can inhibit PTPA expression through an unknown mechanism that negatively controls YY1 [].; GO: 0019211 phosphatase activator activity; PDB: 2HV7_H 2HV6_A 2IXM_A 2G62_A 2IXN_B 2IXO_B 2IXP_B.
Probab=25.92 E-value=27 Score=29.45 Aligned_cols=57 Identities=19% Similarity=0.283 Sum_probs=29.3
Q ss_pred HHHHHHHHhCCChHHhhccc--cCC----------cc-cCCCCCHHHHHHHHHHHHhCC--CchhHHHHHHHH
Q psy8176 71 ALLRLQDTYKLETASVARGE--LNG----------VQ-YTTQLTASDCFELGRQSYNTQ--DFYHTALWMGEA 128 (159)
Q Consensus 71 al~RLQ~~Y~L~~~dla~G~--l~g----------~~-~~~~Ls~~DC~~iG~~~y~~~--dy~~A~~Wl~~A 128 (159)
=+.|||.+|.|+|+= +.|. |++ .| .+..+++.++..-.-.--... -|-.|..++...
T Consensus 169 l~R~Lq~~Y~LEPAG-SHGVWGLDDy~fLPfi~GsaQ~~~~~~~P~~i~~~~~v~~~~~~yly~~~I~fI~~~ 240 (299)
T PF03095_consen 169 LVRRLQSTYRLEPAG-SHGVWGLDDYQFLPFIFGSAQLDHPPIKPKSILDPDIVEEYRDDYLYLDCIDFINKV 240 (299)
T ss_dssp HHHHHHHHCT-EECS-CCCCCSSSSS-SHHHHHHHHHTTTSSGGGGGGGSHHHHHHHTTT-HHHHHHHHHHHH
T ss_pred HHHHHHHheeeccCC-CCCCCccccceeeccccccccccCCCCChHHhcCHHHHHHhcccccHHHHHHHHHHh
Confidence 457999999999963 3343 111 11 134555655554433222223 345666666553
No 112
>KOG3830|consensus
Probab=25.59 E-value=88 Score=28.20 Aligned_cols=41 Identities=17% Similarity=0.178 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHhC--CChHHhhccccCCcccCCCCCHHHHHHHHHHHHhCC
Q psy8176 68 AATALLRLQDTYK--LETASVARGELNGVQYTTQLTASDCFELGRQSYNTQ 116 (159)
Q Consensus 68 A~~al~RLQ~~Y~--L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~ 116 (159)
+.-.+.++-++|. ++-.|||+ +.-++..+||.|++|+..++
T Consensus 289 ~np~l~~li~~~sp~~slqdma~--------~agisi~evf~La~HLv~W~ 331 (542)
T KOG3830|consen 289 QNPLLSSLIAIVSPNLSLQDMAN--------RAGISISEVFILAKHLVYWR 331 (542)
T ss_pred cchHHHHHHHHhCccccHHHHhh--------hcCCCHHHHHHHHHHHHHHH
Confidence 4556778888874 66778886 67799999999999886543
No 113
>PHA02763 hypothetical protein; Provisional
Probab=25.51 E-value=1.1e+02 Score=21.31 Aligned_cols=30 Identities=20% Similarity=0.431 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
..||+. .+.+-..|...++|+.+|.+..++
T Consensus 54 t~~~Ls---KYv~~SGFe~VEeWl~eArrLh~~ 83 (102)
T PHA02763 54 TEYCLS---KYVKFSGFENVEEWLNEARRLHND 83 (102)
T ss_pred hHHHHH---HHhhhcchhhHHHHHHHHHHHhcC
Confidence 578876 455667889999999999776554
No 114
>PF09177 Syntaxin-6_N: Syntaxin 6, N-terminal; InterPro: IPR015260 Members of this entry, which are found in the amino terminus of various SNARE proteins, adopt a structure consisting of an antiparallel three-helix bundle. Their exact function has not been determined, though it is known that they regulate the SNARE motif, as well as mediate various protein-protein interactions involved in membrane-transport []. ; GO: 0048193 Golgi vesicle transport, 0016020 membrane; PDB: 1LVF_B 2C5I_T 2C5J_A 2C5K_T 4DND_A.
Probab=25.34 E-value=1.3e+02 Score=20.43 Aligned_cols=41 Identities=20% Similarity=0.282 Sum_probs=26.8
Q ss_pred hHHHHHHHHHhhhcCCCCChhhHHHHHHHHHHHHHHhCCChHHhhc
Q psy8176 43 YQEAMDNMTRYKQDLKFPTDEDLSGAATALLRLQDTYKLETASVAR 88 (159)
Q Consensus 43 ~~~~i~~l~~~~~~~~~P~~~Dl~~A~~al~RLQ~~Y~L~~~dla~ 88 (159)
..++...+...... -+||+.|+..+.+=-.-|+|++.+|++
T Consensus 41 ~~eL~~~l~~ie~~-----L~DL~~aV~ive~np~kF~l~~~Ei~~ 81 (97)
T PF09177_consen 41 KRELRNALQSIEWD-----LEDLEEAVRIVEKNPSKFNLSEEEISR 81 (97)
T ss_dssp HHHHHHHHHHHHHH-----HHHHHHHHHHHHCCHHHHT-HHHHHHH
T ss_pred HHHHHHHHHHHHHH-----HHHHHHHHHHHHhCccccCCCHHHHHH
Confidence 34455555554433 578888888877777777888887764
No 115
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=25.23 E-value=1.6e+02 Score=19.61 Aligned_cols=33 Identities=15% Similarity=0.087 Sum_probs=27.9
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
+.++...+-.+=+.|+|..|..+..+|++.+..
T Consensus 6 Ai~~a~~Ave~D~~g~y~eA~~~Y~~aie~l~~ 38 (76)
T cd02681 6 AVQFARLAVQRDQEGRYSEAVFYYKEAAQLLIY 38 (76)
T ss_pred HHHHHHHHHHHHHccCHHHHHHHHHHHHHHHHH
Confidence 566777777788899999999999999988765
No 116
>PF03745 DUF309: Domain of unknown function (DUF309); InterPro: IPR005500 This family consists of eubacterial and archaebacterial proteins of unknown function. The proteins contain a motif HXXXEXX(W/Y) where X can be any amino acid. This motif is likely to be functionally important and may be involved in metal binding.; PDB: 2CXD_B 2CWY_A 2IJQ_B.
Probab=24.88 E-value=63 Score=20.55 Aligned_cols=25 Identities=28% Similarity=0.449 Sum_probs=21.1
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHh
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALK 130 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~ 130 (159)
+.-|..+++.++|..|=+++++...
T Consensus 3 ~~~~~~l~n~g~f~EaHEvlE~~W~ 27 (62)
T PF03745_consen 3 LEEGIELFNAGDFFEAHEVLEELWK 27 (62)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHCC
T ss_pred HHHHHHHHcCCCHHHhHHHHHHHHH
Confidence 4567889999999999999999876
No 117
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=24.53 E-value=1.4e+02 Score=26.23 Aligned_cols=31 Identities=19% Similarity=0.091 Sum_probs=26.3
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
++.=-..+|+.+++++.|..|..-|+.|++.
T Consensus 327 ~p~L~~tLG~L~~k~~~w~kA~~~leaAl~~ 357 (400)
T COG3071 327 DPLLLSTLGRLALKNKLWGKASEALEAALKL 357 (400)
T ss_pred ChhHHHHHHHHHHHhhHHHHHHHHHHHHHhc
Confidence 3444568999999999999999999999864
No 118
>PF09722 DUF2384: Protein of unknown function (DUF2384); InterPro: IPR024467 This domain is found predominantly in proteobacterial proteins. Its function in unknown.
Probab=23.91 E-value=1.3e+02 Score=18.00 Aligned_cols=43 Identities=14% Similarity=0.142 Sum_probs=26.8
Q ss_pred HHHHHHhCCCchhHHHHHHHHHhhhhccCCC----C-CCCHHHHHHHHHHh
Q psy8176 108 LGRQSYNTQDFYHTALWMGEALKRHDMERNG----T-STPKWEILEYLAYS 153 (159)
Q Consensus 108 iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~----~-~~~~~~il~~la~s 153 (159)
.+...| ++-..|..||......+....+. + .- ...|+++|...
T Consensus 3 ~a~~vf--gd~~~a~~Wl~~p~~~l~g~~Plel~~t~~G-~~~V~~~L~~~ 50 (54)
T PF09722_consen 3 QAEEVF--GDEDKARRWLRTPNPALGGRTPLELLRTEAG-AERVLDYLDRI 50 (54)
T ss_pred HHHHHH--CCHHHHHHHHHChHHHhCCCCHHHHHcChHH-HHHHHHHHHHH
Confidence 344455 77799999999876666553321 1 12 45677777654
No 119
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=23.89 E-value=5.8e+02 Score=23.53 Aligned_cols=31 Identities=6% Similarity=-0.093 Sum_probs=23.1
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
++.=...+|..+...+++..|...++.|+..
T Consensus 283 ~~~a~~~lg~~l~~~g~~~eA~~~l~~al~l 313 (656)
T PRK15174 283 NVRIVTLYADALIRTGQNEKAIPLLQQSLAT 313 (656)
T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 3445567788888888888888888888764
No 120
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=23.34 E-value=1.3e+02 Score=20.49 Aligned_cols=51 Identities=16% Similarity=0.229 Sum_probs=34.4
Q ss_pred HHHHHHHHHHHhCCCchhHHHHHHHHHhhhhccCCCCCCCHHHHHHHHHHhHhhcC
Q psy8176 103 SDCFELGRQSYNTQDFYHTALWMGEALKRHDMERNGTSTPKWEILEYLAYSKFMQD 158 (159)
Q Consensus 103 ~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~~~~~~~~~~~~il~~la~s~~~~g 158 (159)
.+=.+=|..+|+......|+.=.+-||+..... .. +..+|=+|+.++...|
T Consensus 7 k~~ie~GlkLY~~~~~~~Al~~W~~aL~k~~~~----~~-rf~~lG~l~qA~~e~G 57 (80)
T PF10579_consen 7 KQQIEKGLKLYHQNETQQALQKWRKALEKITDR----ED-RFRVLGYLIQAHMEWG 57 (80)
T ss_pred HHHHHHHHHHhccchHHHHHHHHHHHHhhcCCh----HH-HHHHHHHHHHHHHHHH
Confidence 344566788898888888888555666655442 24 6777777777776655
No 121
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=23.23 E-value=1.6e+02 Score=16.85 Aligned_cols=29 Identities=21% Similarity=0.162 Sum_probs=25.4
Q ss_pred HHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 106 FELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 106 ~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
..||.+.....+|..|..=++.||.....
T Consensus 5 ~~Lgeisle~e~f~qA~~D~~~aL~i~~~ 33 (38)
T PF10516_consen 5 DLLGEISLENENFEQAIEDYEKALEIQEE 33 (38)
T ss_pred HHHHHHHHHhccHHHHHHHHHHHHHHHHH
Confidence 46899999999999999999999887654
No 122
>PRK12370 invasion protein regulator; Provisional
Probab=21.90 E-value=2.3e+02 Score=25.34 Aligned_cols=31 Identities=16% Similarity=0.144 Sum_probs=24.6
Q ss_pred CHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 101 TASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 101 s~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
++.--+.+|..+...|++..|..|++.|++.
T Consensus 371 ~~~a~~~lg~~l~~~G~~~eAi~~~~~Al~l 401 (553)
T PRK12370 371 SADIKYYYGWNLFMAGQLEEALQTINECLKL 401 (553)
T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHhc
Confidence 3455677888888889998898888888764
No 123
>KOG2002|consensus
Probab=21.72 E-value=8.1e+02 Score=24.41 Aligned_cols=38 Identities=24% Similarity=0.352 Sum_probs=32.5
Q ss_pred CCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 97 TTQLTASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 97 ~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
+.++-++-||-|||..+..|||..|...+.+|++--++
T Consensus 302 ~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d 339 (1018)
T KOG2002|consen 302 NKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADND 339 (1018)
T ss_pred hhHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHccCCC
Confidence 34577888999999999999999999999999865443
No 124
>PF07743 HSCB_C: HSCB C-terminal oligomerisation domain; InterPro: IPR009073 This entry represents the C-terminal oligomerisation domain found in HscB (heat shock cognate protein B), which is also known as HSC20 (20K heat shock cognate protein). HscB acts as a co-chaperone to regulate the ATPase activity and peptide-binding specificity of the molecular chaperone HscA, also known as HSC66 (HSP70 class). HscB proteins contain two domains, an N-terminal J-domain, which is involved in interactions with HscA, connected by a short loop to the C-terminal oligomerisation domain; the two domains make contact through a hydrophobic interface. The core of the oligomerisation domain is thought to bind and target proteins to HscA and consists of an open, three-helical bundle []. HscB, along with HscA, has been shown to play a role in the biogenesis of iron-sulphur proteins.; GO: 0006457 protein folding; PDB: 1FPO_C 3BVO_B 3HHO_A 3UO2_B 3UO3_B.
Probab=21.64 E-value=97 Score=20.06 Aligned_cols=21 Identities=33% Similarity=0.319 Sum_probs=17.2
Q ss_pred ChhhHHHHHHHHHHHHHHhCC
Q psy8176 61 TDEDLSGAATALLRLQDTYKL 81 (159)
Q Consensus 61 ~~~Dl~~A~~al~RLQ~~Y~L 81 (159)
...|+..|...+.||+-.+++
T Consensus 52 ~~~d~~~A~~~~~kLky~~kl 72 (78)
T PF07743_consen 52 DAKDWEEAKEALRKLKYLQKL 72 (78)
T ss_dssp HTT-HHHHHHHHHHHHHHHHH
T ss_pred ccCcHHHHHHHHHHHHHHHHH
Confidence 678999999999999877655
No 125
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=21.53 E-value=2.5e+02 Score=27.66 Aligned_cols=30 Identities=10% Similarity=-0.044 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
+.-.+.+|..+.+.+++..|..+++.|+..
T Consensus 609 ~~a~~~LA~~l~~lG~~deA~~~l~~AL~l 638 (987)
T PRK09782 609 ANAYVARATIYRQRHNVPAAVSDLRAALEL 638 (987)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 667788899999999999999999888765
No 126
>COG1803 MgsA Methylglyoxal synthase [Carbohydrate transport and metabolism]
Probab=21.44 E-value=71 Score=23.91 Aligned_cols=22 Identities=45% Similarity=0.723 Sum_probs=17.8
Q ss_pred CCChhhHHHHHHHHHHHHHHhCCChH
Q psy8176 59 FPTDEDLSGAATALLRLQDTYKLETA 84 (159)
Q Consensus 59 ~P~~~Dl~~A~~al~RLQ~~Y~L~~~ 84 (159)
.|-+-|+ +||+||-+||++...
T Consensus 89 qPHdpDV----kAL~Rl~~V~nIP~A 110 (142)
T COG1803 89 QPHDPDV----KALLRLATVYNIPVA 110 (142)
T ss_pred CCCCcCH----HHHHHHHHhhcccch
Confidence 5777775 799999999998753
No 127
>COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only]
Probab=21.06 E-value=1.5e+02 Score=23.67 Aligned_cols=32 Identities=19% Similarity=0.260 Sum_probs=24.3
Q ss_pred CCHHHHHHHHHHHHh----CCCchhHHHHHHHHHhh
Q psy8176 100 LTASDCFELGRQSYN----TQDFYHTALWMGEALKR 131 (159)
Q Consensus 100 Ls~~DC~~iG~~~y~----~~dy~~A~~Wl~~Al~~ 131 (159)
..+.-|+.+|..+.+ ..|+..|..|+..|.++
T Consensus 185 ~~~~a~~~lg~~y~~G~Gv~~d~~~A~~wy~~Aa~~ 220 (292)
T COG0790 185 GNPDAQLLLGRMYEKGLGVPRDLKKAFRWYKKAAEQ 220 (292)
T ss_pred cCHHHHHHHHHHHHcCCCCCcCHHHHHHHHHHHHHC
Confidence 367789999955544 34889999999998754
No 128
>PF14718 SLT_L: Soluble lytic murein transglycosylase L domain; PDB: 1QTE_A 1SLY_A 1QSA_A.
Probab=20.99 E-value=1.3e+02 Score=19.57 Aligned_cols=54 Identities=17% Similarity=0.105 Sum_probs=37.5
Q ss_pred HHHHHHHHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHH
Q psy8176 68 AATALLRLQDTYKLETASVARGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTAL 123 (159)
Q Consensus 68 A~~al~RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~ 123 (159)
.--+|.|+...|.|+-.--|+=...- .-..++..+-..+|..++++|=|..|+.
T Consensus 7 ~~p~l~R~~EL~~l~~~~~ArrEW~~--~l~~~~~~~~~aaA~~A~~~gW~d~AI~ 60 (71)
T PF14718_consen 7 QDPALQRARELYALGRRTEARREWRH--LLRRLDEEQQLAAAELAHQWGWHDRAIQ 60 (71)
T ss_dssp HSHHHHHHHHHHHTT-HHHHHHHHHH--HHTTS-HHHHHHHHHHHHHTT-HHHHHH
T ss_pred cCHHHHHHHHHHHcCCcHHHHHHHHH--HHHCCCHHHHHHHHHHHHHCCCHHHHHH
Confidence 34578899999999886666532211 1235789999999999999998888774
No 129
>PF09523 DUF2390: Protein of unknown function (DUF2390); InterPro: IPR012659 Members of this family are bacterial hypothetical proteins, about 160 amino acids in length, found in various proteobacteria, including members of the genera Pseudomonas and Vibrio. The C-terminal region is poorly conserved and is not included in the model.
Probab=20.82 E-value=1.8e+02 Score=20.59 Aligned_cols=53 Identities=26% Similarity=0.251 Sum_probs=37.8
Q ss_pred HHHHHHHHHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHH
Q psy8176 67 GAATALLRLQDTYKLETASVARGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEAL 129 (159)
Q Consensus 67 ~A~~al~RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al 129 (159)
|...+.++|||-|.+++.-+-=+..-+. .+..+++.+.-.+-. .+..|=+..+
T Consensus 15 gV~~acL~LQd~~g~nVnllL~~~wl~~-~g~~l~~~~l~~l~~---------~~~~W~~~vv 67 (109)
T PF09523_consen 15 GVAEACLALQDRYGANVNLLLLALWLAQ-QGRSLDAERLAALDA---------AVAPWREEVV 67 (109)
T ss_pred hHHHHHHHHHHHHCCCHHHHHHHHHHHH-cCCCCCHHHHHHHHH---------HHHHHHHHHH
Confidence 5567889999999999998876665555 466788887666544 3455766543
No 130
>KOG1126|consensus
Probab=20.68 E-value=3.6e+02 Score=25.35 Aligned_cols=67 Identities=18% Similarity=0.139 Sum_probs=52.6
Q ss_pred CCCh-hhHHHHHHHHHHHHHHhCCChHHhhccccCCcccCCCCCHHHHHHHHHHHHhCCCchhHHHHHHHHHhh
Q psy8176 59 FPTD-EDLSGAATALLRLQDTYKLETASVARGELNGVQYTTQLTASDCFELGRQSYNTQDFYHTALWMGEALKR 131 (159)
Q Consensus 59 ~P~~-~Dl~~A~~al~RLQ~~Y~L~~~dla~G~l~g~~~~~~Ls~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~ 131 (159)
-|-- ++++-=...|.-||+++.| +-||++.|.-. +.+|.-=-++|...--++|+..|++-|+.|++.
T Consensus 383 ~p~rv~~meiyST~LWHLq~~v~L--s~Laq~Li~~~----~~sPesWca~GNcfSLQkdh~~Aik~f~RAiQl 450 (638)
T KOG1126|consen 383 EPYRVKGMEIYSTTLWHLQDEVAL--SYLAQDLIDTD----PNSPESWCALGNCFSLQKDHDTAIKCFKRAIQL 450 (638)
T ss_pred ccccccchhHHHHHHHHHHhhHHH--HHHHHHHHhhC----CCCcHHHHHhcchhhhhhHHHHHHHHHHHhhcc
Confidence 3443 6777788999999999888 45888877643 345666567888888999999999999999864
No 131
>TIGR00327 secE_euk_arch protein translocase SEC61 complex gamma subunit, archaeal and eukaryotic. This model describes archaeal SEC61-like and eukaryotic SEC61 but not bacterial secE proteins, for which a Pfam pfam00584 (SecE) has been created.
Probab=20.54 E-value=1.6e+02 Score=18.89 Aligned_cols=27 Identities=7% Similarity=-0.069 Sum_probs=17.9
Q ss_pred HHhhhhhHHHHHHHHHhhhhHHHHHHH
Q psy8176 24 VKRLTLDWKQAEHYMKDHVYQEAMDNM 50 (159)
Q Consensus 24 irRl~~dW~~~~~~~~~~~~~~~i~~l 50 (159)
++.+..||..+.+...++..+|+....
T Consensus 6 ~~~f~k~~~r~lk~~~KPd~~Ef~~ia 32 (61)
T TIGR00327 6 PVEFIKEGTRVLAVCKKPDLEEYLKVA 32 (61)
T ss_pred HHHHHHHHHHHHHHhcCCCHHHHHHHH
Confidence 455667788777777777666665543
No 132
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=20.48 E-value=4.3e+02 Score=24.08 Aligned_cols=52 Identities=12% Similarity=0.219 Sum_probs=39.6
Q ss_pred HHhhccccC-Cccc-CCCCCHHHHHHHHHHHHhC-CCchhHHHHHHHHHhhhhcc
Q psy8176 84 ASVARGELN-GVQY-TTQLTASDCFELGRQSYNT-QDFYHTALWMGEALKRHDME 135 (159)
Q Consensus 84 ~dla~G~l~-g~~~-~~~Ls~~DC~~iG~~~y~~-~dy~~A~~Wl~~Al~~~~~~ 135 (159)
..|+.-.|. |-.| ...-....-+.-+..+|+. ++|..|...+..|+..+++|
T Consensus 495 A~L~E~~iQYaNRYR~~~~~v~~al~~Ae~~F~~~~~Y~~ALe~i~~alE~vePG 549 (560)
T PF06160_consen 495 ATLAEQLIQYANRYRSDNPEVDEALTEAEDLFRNEYDYEKALETIATALEKVEPG 549 (560)
T ss_pred HHHHHHHHHHHhcccCCCHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHhCCC
Confidence 444444333 4445 3356788889999999999 99999999999999998875
No 133
>COG4667 Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only]
Probab=20.38 E-value=1.4e+02 Score=25.08 Aligned_cols=73 Identities=12% Similarity=0.230 Sum_probs=47.3
Q ss_pred ChhhhhhchhHHHHHHHhhhhhHHHHHHHHHhh--hhHHHHHHHHHh---hh-cCCCCC-hhhHHHHHHHHHHHHHHhCC
Q psy8176 9 DVPEYLANPINAFLVVKRLTLDWKQAEHYMKDH--VYQEAMDNMTRY---KQ-DLKFPT-DEDLSGAATALLRLQDTYKL 81 (159)
Q Consensus 9 ~~e~yl~nPlnaF~LirRl~~dW~~~~~~~~~~--~~~~~i~~l~~~---~~-~~~~P~-~~Dl~~A~~al~RLQ~~Y~L 81 (159)
.|.+|-.+| -.|..+++...+.+.+.+.++.. .-.+..+.|... .. .+-.|+ .-+...--..+-+||..|++
T Consensus 194 rps~~~~~p-~~~~~~~~~lr~~~~Lv~~l~~r~~~Y~~~~q~I~~le~~~~ifvi~p~k~l~i~~~e~~~~~l~~~Y~l 272 (292)
T COG4667 194 RPSDYYKTP-QWFKRLELWLRELPHLVELLQHREKLYNDIQQFILKLENKGRIFVIRPPKPLKIKRLEKNLPKLRADYDL 272 (292)
T ss_pred chHHhccCC-CcchHHHHHHHHhHHHHHHHHHhhhhHHHHHHHHhccccCCceEEECCCCCcchhhhccchHHHHHHHHH
Confidence 488999999 88999999888899888888752 122222333322 11 001343 36666667777888888776
Q ss_pred C
Q psy8176 82 E 82 (159)
Q Consensus 82 ~ 82 (159)
-
T Consensus 273 G 273 (292)
T COG4667 273 G 273 (292)
T ss_pred H
Confidence 3
No 134
>KOG2867|consensus
Probab=20.35 E-value=53 Score=28.46 Aligned_cols=17 Identities=47% Similarity=0.636 Sum_probs=13.9
Q ss_pred HHHHHHHHHHhCCChHH
Q psy8176 69 ATALLRLQDTYKLETAS 85 (159)
Q Consensus 69 ~~al~RLQ~~Y~L~~~d 85 (159)
.+=+.|||.||+|+|+-
T Consensus 180 lev~R~Lq~tY~lEPAG 196 (367)
T KOG2867|consen 180 LEVMRRLQTTYNLEPAG 196 (367)
T ss_pred HHHHHHHHHHhccccCC
Confidence 35578999999999974
No 135
>PF10151 DUF2359: Uncharacterised conserved protein (DUF2359); InterPro: IPR019308 This is a 450 amino acid region of a family of proteins conserved from insects to humans. The function is not known.
Probab=20.31 E-value=3.2e+02 Score=24.63 Aligned_cols=41 Identities=20% Similarity=0.400 Sum_probs=30.6
Q ss_pred hhhhhhchhHHHHHHHhhhhhHHHHHHHHHhhhhHHHHHHHH
Q psy8176 10 VPEYLANPINAFLVVKRLTLDWKQAEHYMKDHVYQEAMDNMT 51 (159)
Q Consensus 10 ~e~yl~nPlnaF~LirRl~~dW~~~~~~~~~~~~~~~i~~l~ 51 (159)
..-|..|...|-.|++++..+|...-.-+.+. -.+.++.++
T Consensus 188 r~lY~knl~~S~llL~~l~~~W~~~s~KL~k~-l~~Tl~sfr 228 (469)
T PF10151_consen 188 RQLYKKNLKQSVLLLKHLDDEWKESSKKLSKS-LKETLKSFR 228 (469)
T ss_pred HHHHHHhHHHHHHHHHHHHHhHHhhhHHHHHH-HHHHHHHHH
Confidence 35688899999999999999999887777653 344444444
No 136
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=20.31 E-value=2.5e+02 Score=17.90 Aligned_cols=33 Identities=12% Similarity=0.033 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHHhCCCchhHHHHHHHHHhhhhc
Q psy8176 102 ASDCFELGRQSYNTQDFYHTALWMGEALKRHDM 134 (159)
Q Consensus 102 ~~DC~~iG~~~y~~~dy~~A~~Wl~~Al~~~~~ 134 (159)
+..+..-|..+=+.|+|..|..++..|++.+..
T Consensus 6 a~~l~~~Av~~D~~g~~~~Al~~Y~~a~e~l~~ 38 (75)
T cd02656 6 AKELIKQAVKEDEDGNYEEALELYKEALDYLLQ 38 (75)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHH
Confidence 345556666677889999999999999988765
No 137
>PF07026 DUF1317: Protein of unknown function (DUF1317); InterPro: IPR009750 This entry is represented by Bacteriophage lambda, Xis. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=20.23 E-value=62 Score=20.77 Aligned_cols=22 Identities=18% Similarity=0.252 Sum_probs=17.8
Q ss_pred hhhhhchhHHHHHHHhhhhhHH
Q psy8176 11 PEYLANPINAFLVVKRLTLDWK 32 (159)
Q Consensus 11 e~yl~nPlnaF~LirRl~~dW~ 32 (159)
.+++.||+-|.++-.+++.-=.
T Consensus 29 g~vi~NPlkAqR~AE~~n~~~~ 50 (60)
T PF07026_consen 29 GKVITNPLKAQRLAEELNSKQV 50 (60)
T ss_pred CeeEcCHHHHHHHHHHHHhhHh
Confidence 4678999999999999886433
Done!