Query psy14697
Match_columns 175
No_of_seqs 111 out of 204
Neff 4.1
Searched_HMMs 46136
Date Fri Aug 16 18:18:59 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14697.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14697hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF02064 MAS20: MAS20 protein 100.0 9.3E-60 2E-64 366.7 6.3 121 51-171 1-121 (121)
2 TIGR00985 3a0801s04tom mitocho 100.0 2.5E-58 5.4E-63 369.5 14.6 127 44-170 4-148 (148)
3 KOG4056|consensus 100.0 8.7E-56 1.9E-60 352.1 13.5 132 44-175 6-143 (143)
4 TIGR00990 3a0801s09 mitochondr 97.5 0.00049 1.1E-08 63.9 8.4 45 105-149 119-163 (615)
5 PF00515 TPR_1: Tetratricopept 94.5 0.061 1.3E-06 31.3 3.4 29 117-145 5-33 (34)
6 PF07719 TPR_2: Tetratricopept 94.0 0.083 1.8E-06 30.3 3.3 29 117-145 5-33 (34)
7 PF13428 TPR_14: Tetratricopep 90.7 0.64 1.4E-05 29.0 4.4 37 117-153 5-41 (44)
8 smart00028 TPR Tetratricopepti 90.3 0.36 7.9E-06 24.8 2.6 29 117-145 5-33 (34)
9 PF13371 TPR_9: Tetratricopept 90.2 0.66 1.4E-05 30.6 4.3 39 117-155 33-71 (73)
10 PF13176 TPR_7: Tetratricopept 89.4 0.22 4.8E-06 30.1 1.4 32 117-148 3-34 (36)
11 PF13181 TPR_8: Tetratricopept 89.4 0.58 1.3E-05 26.9 3.2 28 117-144 5-32 (34)
12 PF13414 TPR_11: TPR repeat; P 88.3 0.78 1.7E-05 30.0 3.6 34 117-150 7-40 (69)
13 PF14561 TPR_20: Tetratricopep 88.3 1.1 2.5E-05 32.8 4.8 47 102-155 18-70 (90)
14 PRK10370 formate-dependent nit 84.0 13 0.00029 30.3 9.4 39 117-155 77-115 (198)
15 PRK11677 hypothetical protein; 83.7 9.5 0.00021 30.6 8.3 36 107-144 50-86 (134)
16 PRK13183 psbN photosystem II r 83.2 1 2.2E-05 30.5 2.1 25 43-67 4-28 (46)
17 PF13432 TPR_16: Tetratricopep 79.3 2.6 5.7E-05 27.3 3.1 29 117-145 35-63 (65)
18 PF14559 TPR_19: Tetratricopep 79.1 7.1 0.00015 25.2 5.1 41 102-149 21-61 (68)
19 PF04689 S1FA: DNA binding pro 79.0 2.1 4.5E-05 31.1 2.6 38 51-88 19-62 (69)
20 PF13432 TPR_16: Tetratricopep 78.7 3.5 7.6E-05 26.7 3.6 34 118-151 2-35 (65)
21 PF06295 DUF1043: Protein of u 78.2 13 0.00028 29.0 7.2 36 108-145 47-83 (128)
22 TIGR00540 hemY_coli hemY prote 78.1 34 0.00074 30.5 10.7 45 104-148 132-188 (409)
23 PF10602 RPN7: 26S proteasome 77.7 12 0.00025 30.4 7.1 49 110-158 33-81 (177)
24 PF13174 TPR_6: Tetratricopept 77.7 2.8 6.2E-05 23.4 2.6 29 117-145 4-32 (33)
25 CHL00020 psbN photosystem II p 77.6 1.8 4E-05 28.8 1.9 23 45-67 3-25 (43)
26 PF10841 DUF2644: Protein of u 75.7 2.9 6.3E-05 29.7 2.6 25 45-69 13-37 (60)
27 cd00189 TPR Tetratricopeptide 74.7 7.1 0.00015 23.9 4.0 31 117-147 38-68 (100)
28 PF13374 TPR_10: Tetratricopep 72.8 4.6 0.0001 23.4 2.7 27 117-143 6-32 (42)
29 PF02468 PsbN: Photosystem II 70.5 3.6 7.8E-05 27.4 1.9 24 45-68 3-26 (43)
30 TIGR02795 tol_pal_ybgF tol-pal 70.4 9.4 0.0002 26.3 4.2 33 119-151 82-114 (119)
31 KOG4648|consensus 69.4 2.9 6.4E-05 39.6 1.9 30 118-147 102-132 (536)
32 PF14559 TPR_19: Tetratricopep 69.3 6.4 0.00014 25.4 3.0 28 123-150 1-28 (68)
33 PF10516 SHNi-TPR: SHNi-TPR; 68.5 6.4 0.00014 25.0 2.8 25 117-141 5-29 (38)
34 PF01102 Glycophorin_A: Glycop 66.6 4.1 8.9E-05 32.2 1.9 20 46-65 68-88 (122)
35 COG4968 PilE Tfp pilus assembl 66.1 15 0.00032 30.0 5.0 33 75-116 29-61 (139)
36 KOG4234|consensus 65.4 13 0.00028 33.0 4.8 53 100-152 62-135 (271)
37 COG4235 Cytochrome c biogenesi 64.1 30 0.00064 31.1 7.0 39 117-155 160-198 (287)
38 KOG0548|consensus 63.0 6.4 0.00014 38.2 2.8 33 113-145 2-34 (539)
39 PF13424 TPR_12: Tetratricopep 61.4 7.1 0.00015 26.1 2.1 33 111-143 44-76 (78)
40 KOG1174|consensus 61.1 12 0.00025 36.3 4.1 39 116-154 474-512 (564)
41 PF08572 PRP3: pre-mRNA proces 60.6 25 0.00055 30.2 5.8 44 78-121 146-209 (223)
42 cd08323 CARD_APAF1 Caspase act 60.5 20 0.00044 26.4 4.5 40 116-158 47-86 (86)
43 PF12868 DUF3824: Domain of un 59.7 2.9 6.4E-05 33.6 -0.0 18 46-63 7-24 (137)
44 PRK02603 photosystem I assembl 59.4 27 0.00058 27.1 5.3 36 116-151 75-110 (172)
45 KOG3119|consensus 59.1 5.2 0.00011 35.0 1.4 28 65-92 194-221 (269)
46 PLN03098 LPA1 LOW PSII ACCUMUL 59.0 8.1 0.00018 36.8 2.7 41 107-147 69-109 (453)
47 PRK15359 type III secretion sy 58.8 20 0.00043 27.5 4.5 38 102-146 88-125 (144)
48 cd00189 TPR Tetratricopeptide 58.7 10 0.00022 23.2 2.3 30 116-145 71-100 (100)
49 PRK11447 cellulose synthase su 57.6 47 0.001 34.0 8.0 48 102-149 92-148 (1157)
50 KOG0624|consensus 57.4 20 0.00043 34.1 4.9 47 103-155 32-78 (504)
51 PF07721 TPR_4: Tetratricopept 56.9 14 0.0003 20.8 2.5 21 116-136 4-24 (26)
52 TIGR02795 tol_pal_ybgF tol-pal 56.2 16 0.00034 25.2 3.2 30 117-146 43-72 (119)
53 PRK10803 tol-pal system protei 55.0 29 0.00064 30.0 5.3 56 117-172 184-250 (263)
54 PF13414 TPR_11: TPR repeat; P 54.4 15 0.00033 23.8 2.7 28 117-144 41-69 (69)
55 TIGR02552 LcrH_SycD type III s 53.9 29 0.00063 25.0 4.4 38 117-154 89-126 (135)
56 PRK15363 pathogenicity island 53.9 32 0.00069 28.3 5.1 36 102-144 99-134 (157)
57 PF13424 TPR_12: Tetratricopep 53.5 17 0.00037 24.2 2.9 25 118-142 10-34 (78)
58 PF11118 DUF2627: Protein of u 51.7 17 0.00038 26.9 2.9 36 44-79 40-77 (77)
59 COG1729 Uncharacterized protei 51.4 15 0.00032 32.6 2.9 54 119-172 184-248 (262)
60 PF06522 B12D: NADH-ubiquinone 50.8 17 0.00038 25.8 2.7 16 65-82 34-49 (73)
61 PRK12370 invasion protein regu 50.3 37 0.00079 31.8 5.5 29 118-146 377-405 (553)
62 CHL00033 ycf3 photosystem I as 49.8 31 0.00067 26.5 4.3 29 119-147 41-69 (168)
63 PF06365 CD34_antigen: CD34/Po 49.7 10 0.00022 32.5 1.6 19 56-74 112-130 (202)
64 PF00731 AIRC: AIR carboxylase 48.6 7.8 0.00017 31.5 0.7 20 71-90 129-148 (150)
65 PF05942 PaREP1: Archaeal PaRE 48.4 26 0.00056 26.4 3.5 28 114-141 7-34 (115)
66 PF08066 PMC2NT: PMC2NT (NUC01 48.3 34 0.00074 25.0 4.0 58 67-124 16-78 (91)
67 KOG4555|consensus 47.4 19 0.00041 30.1 2.8 29 122-150 52-80 (175)
68 KOG0553|consensus 46.9 27 0.00058 31.8 3.9 37 111-147 147-183 (304)
69 PRK11234 nfrB bacteriophage N4 46.8 42 0.0009 33.5 5.5 57 105-163 639-698 (727)
70 TIGR00990 3a0801s09 mitochondr 45.9 27 0.00058 32.8 3.9 37 116-152 545-581 (615)
71 PRK09720 cybC cytochrome b562; 45.0 88 0.0019 24.2 6.0 60 73-133 16-82 (100)
72 PF03704 BTAD: Bacterial trans 44.8 32 0.00069 25.6 3.5 32 113-144 6-37 (146)
73 PRK10866 outer membrane biogen 43.7 1.9E+02 0.0042 24.4 8.5 42 103-148 63-104 (243)
74 PF12895 Apc3: Anaphase-promot 43.2 37 0.00081 23.1 3.4 27 113-139 58-84 (84)
75 PLN02539 glucose-6-phosphate 1 41.7 73 0.0016 30.6 6.1 115 31-172 23-150 (491)
76 PRK09400 secE preprotein trans 41.2 30 0.00064 24.3 2.6 23 44-66 32-54 (61)
77 KOG0543|consensus 41.0 82 0.0018 29.7 6.2 41 102-149 287-327 (397)
78 smart00386 HAT HAT (Half-A-TPR 40.8 51 0.0011 17.6 3.2 27 127-153 1-27 (33)
79 PRK15359 type III secretion sy 40.2 53 0.0011 25.1 4.2 30 117-146 62-91 (144)
80 PLN03088 SGT1, suppressor of 39.9 49 0.0011 29.4 4.5 33 117-149 40-72 (356)
81 PF12301 CD99L2: CD99 antigen 39.6 16 0.00035 30.3 1.3 32 40-71 109-140 (169)
82 TIGR00327 secE_euk_arch protei 39.4 33 0.00071 24.2 2.6 24 44-67 28-51 (61)
83 PF09976 TPR_21: Tetratricopep 39.3 25 0.00054 26.6 2.2 23 118-140 123-145 (145)
84 KOG4516|consensus 39.2 9.4 0.0002 30.3 -0.2 51 15-71 27-77 (118)
85 PF03605 DcuA_DcuB: Anaerobic 39.0 41 0.00088 31.3 3.9 22 64-85 184-205 (364)
86 COG5343 Uncharacterized protei 38.9 32 0.0007 30.3 3.1 24 52-75 108-131 (240)
87 PRK15174 Vi polysaccharide exp 38.5 67 0.0014 31.0 5.4 33 116-148 113-145 (656)
88 PRK02603 photosystem I assembl 37.8 65 0.0014 25.0 4.4 27 120-146 113-139 (172)
89 PTZ00459 mucin-associated surf 37.6 19 0.00042 31.9 1.5 10 40-49 1-10 (291)
90 TIGR01495 ETRAMP Plasmodium ri 37.6 36 0.00078 25.3 2.8 16 59-74 67-82 (85)
91 KOG1174|consensus 37.1 54 0.0012 31.9 4.5 74 55-154 299-372 (564)
92 PF09976 TPR_21: Tetratricopep 36.8 63 0.0014 24.3 4.1 33 114-146 49-81 (145)
93 PF15147 DUF4578: Domain of un 36.6 13 0.00029 29.7 0.4 11 4-14 7-17 (127)
94 TIGR02521 type_IV_pilW type IV 36.6 80 0.0017 23.4 4.6 29 118-146 70-98 (234)
95 PRK11788 tetratricopeptide rep 35.7 70 0.0015 27.2 4.6 39 117-155 286-324 (389)
96 TIGR02521 type_IV_pilW type IV 35.4 80 0.0017 23.4 4.4 29 117-145 35-63 (234)
97 TIGR02552 LcrH_SycD type III s 34.4 93 0.002 22.3 4.5 29 118-146 56-84 (135)
98 PRK12489 anaerobic C4-dicarbox 34.4 47 0.001 31.4 3.6 22 63-84 191-212 (443)
99 KOG1125|consensus 34.1 69 0.0015 31.6 4.7 59 99-157 403-474 (579)
100 TIGR03302 OM_YfiO outer membra 34.0 1.2E+02 0.0025 24.2 5.4 29 117-145 170-198 (235)
101 PF06305 DUF1049: Protein of u 33.4 44 0.00095 22.4 2.5 7 78-84 47-53 (68)
102 PF13371 TPR_9: Tetratricopept 33.3 54 0.0012 21.2 2.8 31 120-150 2-32 (73)
103 PRK10866 outer membrane biogen 32.9 1E+02 0.0022 26.0 5.2 33 118-150 37-69 (243)
104 PF11712 Vma12: Endoplasmic re 32.7 2E+02 0.0043 22.5 6.4 28 65-92 9-39 (142)
105 COG2443 Sss1 Preprotein transl 32.6 48 0.001 23.8 2.6 22 45-66 34-55 (65)
106 PRK11189 lipoprotein NlpI; Pro 32.5 1E+02 0.0022 26.4 5.1 28 117-144 240-267 (296)
107 PF13838 Clathrin_H_link: Clat 32.2 33 0.00072 24.5 1.8 27 110-139 6-32 (66)
108 PF06024 DUF912: Nucleopolyhed 32.1 54 0.0012 24.6 3.0 17 51-67 67-83 (101)
109 PF08359 TetR_C_4: YsiA-like p 31.8 1.8E+02 0.004 21.3 5.8 73 66-138 47-128 (133)
110 KOG0547|consensus 31.5 70 0.0015 31.6 4.3 41 106-146 108-149 (606)
111 PF11846 DUF3366: Domain of un 31.4 2.7E+02 0.0058 22.1 8.9 27 119-145 150-176 (193)
112 PF12487 DUF3703: Protein of u 31.3 88 0.0019 24.5 4.1 41 107-148 3-43 (112)
113 KOG2769|consensus 31.3 1.2E+02 0.0025 29.7 5.7 44 78-121 293-356 (522)
114 PF06703 SPC25: Microsomal sig 31.1 34 0.00074 27.1 1.9 21 51-71 30-50 (162)
115 PRK14748 kdpF potassium-transp 31.0 54 0.0012 20.2 2.3 14 53-66 8-21 (29)
116 PRK10747 putative protoheme IX 31.0 2.6E+02 0.0056 25.0 7.6 32 117-148 332-363 (398)
117 PF08802 CytB6-F_Fe-S: Cytochr 30.8 54 0.0012 21.3 2.4 22 46-67 14-35 (39)
118 TIGR03302 OM_YfiO outer membra 30.6 2.2E+02 0.0047 22.6 6.5 32 117-148 74-105 (235)
119 PRK09782 bacteriophage N4 rece 30.4 1.4E+02 0.003 30.9 6.4 95 68-171 38-140 (987)
120 PF09164 VitD-bind_III: Vitami 30.3 40 0.00087 24.5 1.9 21 68-88 5-25 (68)
121 PRK01001 putative inner membra 30.2 65 0.0014 32.9 4.0 75 46-124 3-81 (795)
122 KOG0548|consensus 29.8 2.3E+02 0.005 27.9 7.4 29 120-148 365-393 (539)
123 PF04654 DUF599: Protein of un 29.1 70 0.0015 27.0 3.5 32 55-86 6-37 (216)
124 KOG0909|consensus 29.0 55 0.0012 31.6 3.1 47 124-171 84-131 (500)
125 PLN03088 SGT1, suppressor of 29.0 1E+02 0.0022 27.5 4.6 37 117-153 74-110 (356)
126 KOG3498|consensus 28.9 65 0.0014 23.4 2.8 22 44-65 33-54 (67)
127 KOG0772|consensus 28.2 59 0.0013 32.1 3.2 57 107-165 568-624 (641)
128 PRK02898 cobalt transport prot 28.0 38 0.00083 26.2 1.6 26 47-72 68-93 (100)
129 COG1687 AzlD Predicted branche 27.8 68 0.0015 25.1 2.9 24 143-166 29-52 (106)
130 PF09716 ETRAMP: Malarial earl 27.8 59 0.0013 23.8 2.5 14 56-69 68-81 (84)
131 PRK10370 formate-dependent nit 27.6 2.9E+02 0.0063 22.4 6.8 31 117-147 148-178 (198)
132 PF13431 TPR_17: Tetratricopep 27.5 81 0.0018 18.7 2.7 25 102-133 9-33 (34)
133 PF12688 TPR_5: Tetratrico pep 27.2 1.3E+02 0.0028 23.1 4.5 31 115-145 40-70 (120)
134 PF12072 DUF3552: Domain of un 27.2 1.7E+02 0.0036 24.2 5.4 16 50-65 5-20 (201)
135 PF10828 DUF2570: Protein of u 27.2 2.7E+02 0.0059 20.9 7.2 100 49-165 5-105 (110)
136 PF13429 TPR_15: Tetratricopep 27.1 96 0.0021 25.6 3.9 36 117-152 150-185 (280)
137 PLN03098 LPA1 LOW PSII ACCUMUL 26.9 1.1E+02 0.0024 29.4 4.7 38 102-143 105-142 (453)
138 PF04415 DUF515: Protein of un 26.9 1.3E+02 0.0028 28.6 5.1 67 46-120 31-97 (416)
139 PRK11462 putative transporter; 26.5 58 0.0013 29.4 2.7 23 65-87 426-448 (460)
140 PRK11447 cellulose synthase su 26.4 2.2E+02 0.0048 29.3 7.0 27 119-145 391-417 (1157)
141 PF07862 Nif11: Nitrogen fixat 25.7 41 0.00088 21.6 1.2 12 72-83 14-25 (49)
142 KOG3289|consensus 25.2 31 0.00068 29.6 0.7 54 20-76 23-92 (199)
143 TIGR01162 purE phosphoribosyla 25.1 90 0.0019 25.7 3.3 20 71-90 127-146 (156)
144 KOG2072|consensus 24.8 3.4E+02 0.0075 28.5 7.9 89 63-155 186-280 (988)
145 PRK00888 ftsB cell division pr 24.7 1.4E+02 0.003 22.6 4.1 29 58-86 12-41 (105)
146 COG1575 MenA 1,4-dihydroxy-2-n 24.5 21 0.00047 32.3 -0.4 15 66-80 66-83 (303)
147 PF14938 SNAP: Soluble NSF att 24.4 63 0.0014 27.4 2.4 28 117-144 118-146 (282)
148 PF02526 GBP_repeat: Glycophor 24.3 41 0.00089 21.7 1.0 10 73-82 15-24 (38)
149 PRK15058 cytochrome b562; Prov 24.1 2.7E+02 0.0058 22.4 5.8 22 112-133 89-110 (128)
150 PRK10747 putative protoheme IX 23.9 5.2E+02 0.011 23.0 8.8 34 120-153 160-193 (398)
151 smart00745 MIT Microtubule Int 23.9 1E+02 0.0022 21.1 3.0 20 129-148 31-50 (77)
152 PF02234 CDI: Cyclin-dependent 23.8 58 0.0012 21.8 1.7 22 103-124 6-27 (51)
153 PF13480 Acetyltransf_6: Acety 23.7 73 0.0016 22.7 2.3 37 76-116 2-38 (142)
154 COG1453 Predicted oxidoreducta 23.6 48 0.001 31.2 1.6 49 66-119 55-105 (391)
155 PF00646 F-box: F-box domain; 23.6 62 0.0013 19.8 1.7 16 156-171 5-20 (48)
156 PF04341 DUF485: Protein of un 23.4 65 0.0014 23.6 2.0 18 72-89 2-19 (91)
157 PF06864 PAP_PilO: Pilin acces 23.3 1.5E+02 0.0032 27.1 4.7 9 133-141 274-282 (414)
158 COG4700 Uncharacterized protei 23.3 1.4E+02 0.0029 26.5 4.2 42 114-155 90-133 (251)
159 PF08631 SPO22: Meiosis protei 23.3 1.5E+02 0.0032 25.3 4.5 46 103-148 25-71 (278)
160 PF10883 DUF2681: Protein of u 23.1 75 0.0016 23.9 2.3 9 58-66 15-23 (87)
161 PF01034 Syndecan: Syndecan do 23.0 29 0.00064 24.9 0.1 9 58-66 26-34 (64)
162 PF13525 YfiO: Outer membrane 22.7 94 0.002 25.1 3.0 28 118-145 10-37 (203)
163 KOG1130|consensus 22.7 1.8E+02 0.004 28.5 5.3 60 110-169 14-87 (639)
164 PF14407 Frankia_peptide: Ribo 22.6 45 0.00096 23.8 1.0 12 73-84 11-22 (61)
165 PRK11788 tetratricopeptide rep 22.5 1.7E+02 0.0037 24.8 4.7 25 117-141 73-97 (389)
166 PF12777 MT: Microtubule-bindi 22.5 68 0.0015 28.6 2.4 23 56-84 316-338 (344)
167 PRK15179 Vi polysaccharide bio 22.4 2.1E+02 0.0046 28.4 5.9 36 115-150 156-191 (694)
168 PHA03065 Hypothetical protein; 22.4 87 0.0019 29.9 3.1 20 72-91 75-94 (438)
169 PRK10153 DNA-binding transcrip 22.3 4.6E+02 0.0099 25.0 7.9 77 53-145 376-452 (517)
170 PF14964 DUF4507: Domain of un 22.2 84 0.0018 29.2 2.9 30 129-159 325-361 (362)
171 PRK14750 kdpF potassium-transp 22.1 1E+02 0.0022 19.1 2.3 15 52-66 7-21 (29)
172 KOG0624|consensus 22.1 1.5E+02 0.0032 28.5 4.5 68 100-167 92-160 (504)
173 TIGR03798 ocin_TIGR03798 bacte 22.0 40 0.00086 23.0 0.6 15 69-83 9-23 (64)
174 TIGR02917 PEP_TPR_lipo putativ 21.8 1.7E+02 0.0037 26.7 4.8 34 117-150 60-93 (899)
175 TIGR03644 marine_trans_1 proba 21.6 97 0.0021 28.8 3.2 22 44-65 243-264 (404)
176 smart00299 CLH Clathrin heavy 21.4 2.4E+02 0.0051 20.9 4.8 39 126-167 95-133 (140)
177 PF12569 NARP1: NMDA receptor- 21.4 1.7E+02 0.0037 28.0 4.9 23 119-141 462-484 (517)
178 cd02577 PSTD1 PSTD1: Pseudouri 21.2 5.4E+02 0.012 23.1 7.8 15 59-73 133-147 (319)
179 COG1930 CbiN ABC-type cobalt t 21.0 34 0.00073 26.4 0.1 33 33-73 60-92 (97)
180 KOG0425|consensus 20.7 73 0.0016 26.8 2.0 20 70-89 145-164 (171)
181 KOG3060|consensus 20.7 1.1E+02 0.0024 27.7 3.3 37 118-154 193-233 (289)
182 PF11817 Foie-gras_1: Foie gra 20.5 96 0.0021 26.1 2.8 30 117-146 182-211 (247)
183 PF02158 Neuregulin: Neureguli 20.4 34 0.00073 32.3 0.0 33 51-88 15-48 (404)
184 PRK12370 invasion protein regu 20.4 1.5E+02 0.0033 27.7 4.3 35 117-151 342-376 (553)
185 PF08690 GET2: GET complex sub 20.3 69 0.0015 28.7 1.9 13 78-90 7-19 (302)
186 KOG4162|consensus 20.1 1.4E+02 0.0029 30.7 4.0 27 117-143 758-784 (799)
187 PF13512 TPR_18: Tetratricopep 20.0 1.4E+02 0.0031 24.1 3.5 34 115-148 49-82 (142)
No 1
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=100.00 E-value=9.3e-60 Score=366.74 Aligned_cols=121 Identities=50% Similarity=0.899 Sum_probs=68.5
Q ss_pred HHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChh
Q psy14697 51 TAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLD 130 (175)
Q Consensus 51 ~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~ 130 (175)
+++|++|++|+|||||||||||||||||||||+|||+++++++.+....+|+++|++++|+|||+||++||+|+++|+++
T Consensus 1 ~~ag~a~~~~lgYciYFD~KRR~dP~frkkL~~rrk~~~~~~~~~~~~~~~~~~d~~~~e~~Fl~qV~lGE~L~~~G~~~ 80 (121)
T PF02064_consen 1 IAAGVAAAAFLGYCIYFDYKRRSDPDFRKKLRERRKKQAKAQEQAKSEKLPDPNDPEEMERFFLQQVQLGEQLLAQGDYE 80 (121)
T ss_dssp ------------------------------------------------SSSHHHHHHHHHHHHHHHHHHHHHHHHTT-HH
T ss_pred ChHHHHHHHHHHHHhhcccccccChHHHHHHHHHHHHHHHHHhhhhcccccCcCCHHHHHHHHHHHHHHHHHHHhCCCHH
Confidence 47888999999999999999999999999999999988876556677889999999999999999999999999999999
Q ss_pred hHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHHhcCC
Q psy14697 131 NGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIEKLPP 171 (175)
Q Consensus 131 eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~kLp~ 171 (175)
+|++||||||+|||||++||+|||+||||+||++|+++||+
T Consensus 81 ~aa~hf~nAl~V~~qP~~LL~i~q~tlP~~vf~~l~~~l~e 121 (121)
T PF02064_consen 81 EAAEHFYNALKVCPQPAELLQIYQKTLPPPVFEMLVQKLPE 121 (121)
T ss_dssp HHHHHHHHHHHTSSSHHHHHHHHHHHS-HHHHHHHHHTS--
T ss_pred HHHHHHHHHHHhCCCHHHHHHHHHhhCCHHHHHHHHHHcCC
Confidence 99999999999999999999999999999999999999996
No 2
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit translocase of outer membrane 20 kDa subunit.
Probab=100.00 E-value=2.5e-58 Score=369.53 Aligned_cols=127 Identities=45% Similarity=0.746 Sum_probs=116.3
Q ss_pred hhhhHHHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHH--------HH---------hhcCCCCCCCCh
Q psy14697 44 ISGNKVVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELL--------QQ---------KNKRGIPDLKDH 106 (175)
Q Consensus 44 ~~~~~~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~--------~~---------a~~~~~p~~~d~ 106 (175)
++++++++++|++|++|+||||||||||||||||||+||+|||++++++ ++ ++....|+|+|+
T Consensus 4 ~~~~~~~~~ag~a~~~flgYciYFD~KRR~dPdFRkkLr~rr~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~p~d~ 83 (148)
T TIGR00985 4 SNKSNVVIAAGIAAAAFLGYAIYFDYKRRNDPDFRKKLRRRRKKQAGAEKQYGGLAKEKQRQRIRPAAAGLAKAPDPTDP 83 (148)
T ss_pred chHHHHHHHHHHHHHHHHHHHHhhhhhhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCH
Confidence 5677888999999999999999999999999999999999998877431 10 234568999999
Q ss_pred HHHHHHHHHHHHhHhHHhhcC-ChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHHhcC
Q psy14697 107 SAVQQYFLQEIQLGETLLAAG-DLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIEKLP 170 (175)
Q Consensus 107 e~~E~fFl~eV~lGE~L~a~G-~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~kLp 170 (175)
+++|+|||+||++||+|+++| +.++|+.||||||+|||||++||+|||+|||++||+|||.++|
T Consensus 84 ~e~E~~Fl~eV~~GE~L~~~g~~~~ega~hf~nAl~Vc~qP~~LL~iyq~tlP~~Vf~~lia~~p 148 (148)
T TIGR00985 84 SEKEAFFLQEVQLGEELMAQGTNVDEGAVHFYNALKVYPQPQQLLSIYQQTLPEPIFRLMIAQLP 148 (148)
T ss_pred HHHHHHHHHHHHHHHHHHhCCCchHHHHHHHHHHHHhCCCHHHHHHHHHhhCCHHHHHHHHhhcC
Confidence 999999999999999999999 8999999999999999999999999999999999999999987
No 3
>KOG4056|consensus
Probab=100.00 E-value=8.7e-56 Score=352.09 Aligned_cols=132 Identities=50% Similarity=0.831 Sum_probs=121.1
Q ss_pred hhhhHHHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHH---HHHhhcCC---CCCCCChHHHHHHHHHHH
Q psy14697 44 ISGNKVVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNREL---LQQKNKRG---IPDLKDHSAVQQYFLQEI 117 (175)
Q Consensus 44 ~~~~~~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~---~~~a~~~~---~p~~~d~e~~E~fFl~eV 117 (175)
...+++.+++|+||++|+||||||||||||||+|||+||+||+++..+ +++++.++ .|||+|++++|+|||+||
T Consensus 6 ~v~~~~vI~agiag~af~gYciYFd~KRrsdP~fk~~lr~rrk~q~~ae~a~~~~a~sk~~~~pd~~d~~~~E~~Fmqqv 85 (143)
T KOG4056|consen 6 PVLRTSVIAAGIAGLAFIGYCIYFDKKRRSDPDFKKKLRERRKKQASAELAKERAAASKADNIPDPSDAEEVEKFFMQQV 85 (143)
T ss_pred chhhhHHHHHHHHHHHHHHHHhhcccccccChhHHHHHHHHHHHHhhcccccccchhhccccCCCCCCHHHHHHHHHHHH
Confidence 445667799999999999999999999999999999999999998843 34444555 999999999999999999
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHHhcCCCCCC
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIEKLPPAGIV 175 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~kLp~~~i~ 175 (175)
++||+|+++|++++||+||||||.|||||++||+|||+||||+||++|+.+||.+.++
T Consensus 86 ~lGE~L~~qg~~e~ga~h~~nAi~vcgqpaqLL~vlq~tlp~~if~~i~~~l~~~~~~ 143 (143)
T KOG4056|consen 86 QLGEELLAQGNEEEGAEHLANAIVVCGQPAQLLQVLQQTLPEAIFAMLLKKLPSASER 143 (143)
T ss_pred HhHHHHHHccCHHHHHHHHHHHHhhcCCHHHHHHHHHhhCCHHHHHHHHHHhhcccCC
Confidence 9999999999999999999999999999999999999999999999999999998753
No 4
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=97.47 E-value=0.00049 Score=63.92 Aligned_cols=45 Identities=11% Similarity=0.200 Sum_probs=39.6
Q ss_pred ChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697 105 DHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL 149 (175)
Q Consensus 105 d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L 149 (175)
++++++.+...-..+|..+...|++++|+.+|.+||...|+|.-.
T Consensus 119 ~~~~~~~~a~~~k~~G~~~~~~~~~~~Ai~~y~~al~~~p~~~~~ 163 (615)
T TIGR00990 119 SEEERKKYAAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPVYY 163 (615)
T ss_pred CHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCchHHH
Confidence 455778888888999999999999999999999999999987543
No 5
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=94.54 E-value=0.061 Score=31.34 Aligned_cols=29 Identities=24% Similarity=0.410 Sum_probs=25.5
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
..+|..+...|++++|+.+|.+||.+.|+
T Consensus 5 ~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 33 (34)
T PF00515_consen 5 YNLGNAYFQLGDYEEALEYYQRALELDPD 33 (34)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred HHHHHHHHHhCCchHHHHHHHHHHHHCcC
Confidence 46899999999999999999999999885
No 6
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=94.04 E-value=0.083 Score=30.28 Aligned_cols=29 Identities=21% Similarity=0.384 Sum_probs=25.7
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
..+|..+...|++++|..||-+|+..+|+
T Consensus 5 ~~lg~~~~~~~~~~~A~~~~~~al~l~p~ 33 (34)
T PF07719_consen 5 YYLGQAYYQLGNYEEAIEYFEKALELDPN 33 (34)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHHCcC
Confidence 46899999999999999999999999986
No 7
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=90.73 E-value=0.64 Score=29.00 Aligned_cols=37 Identities=19% Similarity=0.199 Sum_probs=32.6
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVL 153 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~Iy 153 (175)
..+|+.+...|++++|...|-++|+.+|+-.+....|
T Consensus 5 ~~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~L 41 (44)
T PF13428_consen 5 LALARAYRRLGQPDEAERLLRRALALDPDDPEAWRAL 41 (44)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHh
Confidence 5789999999999999999999999999887766544
No 8
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=90.34 E-value=0.36 Score=24.76 Aligned_cols=29 Identities=31% Similarity=0.460 Sum_probs=25.0
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
..+|..+...|++++|+.+|.+++.+.|+
T Consensus 5 ~~~a~~~~~~~~~~~a~~~~~~~~~~~~~ 33 (34)
T smart00028 5 YNLGNAYLKLGDYDEALEYYEKALELDPN 33 (34)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHccCCC
Confidence 46788899999999999999999988653
No 9
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=90.21 E-value=0.66 Score=30.61 Aligned_cols=39 Identities=18% Similarity=0.267 Sum_probs=34.6
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ 155 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq 155 (175)
...|..+...|++++|...|-+++..+|++.+...+.++
T Consensus 33 ~~~a~~~~~~g~~~~A~~~l~~~l~~~p~~~~~~~~~a~ 71 (73)
T PF13371_consen 33 LQRARCLFQLGRYEEALEDLERALELSPDDPDARALRAM 71 (73)
T ss_pred HHHHHHHHHhccHHHHHHHHHHHHHHCCCcHHHHHHHHh
Confidence 347889999999999999999999999999998887654
No 10
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=89.45 E-value=0.22 Score=30.13 Aligned_cols=32 Identities=25% Similarity=0.468 Sum_probs=26.2
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
..+|..+...|++++|++.+-+||.+..+|.+
T Consensus 3 ~~Lg~~~~~~g~~~~Ai~~y~~aL~l~~~~~~ 34 (36)
T PF13176_consen 3 NNLGRIYRQQGDYEKAIEYYEQALALARDPED 34 (36)
T ss_dssp HHHHHHHHHCT-HHHHHHHHHHHHHHHHHCT-
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHhcccccC
Confidence 36899999999999999999999988776653
No 11
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=89.39 E-value=0.58 Score=26.88 Aligned_cols=28 Identities=25% Similarity=0.395 Sum_probs=25.5
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCG 144 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcp 144 (175)
..+|..+...|++++|..+|-+|+.+.|
T Consensus 5 ~~lg~~y~~~~~~~~A~~~~~~a~~~~~ 32 (34)
T PF13181_consen 5 YNLGKIYEQLGDYEEALEYFEKALELNP 32 (34)
T ss_dssp HHHHHHHHHTTSHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHhhCC
Confidence 4689999999999999999999999876
No 12
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=88.33 E-value=0.78 Score=30.02 Aligned_cols=34 Identities=18% Similarity=0.375 Sum_probs=28.8
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLL 150 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL 150 (175)
..+|..+...|++++|+.+|.+||.+.|+=.+++
T Consensus 7 ~~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~~~ 40 (69)
T PF13414_consen 7 YNLGQIYFQQGDYEEAIEYFEKAIELDPNNAEAY 40 (69)
T ss_dssp HHHHHHHHHTTHHHHHHHHHHHHHHHSTTHHHHH
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHH
Confidence 4678999999999999999999999988655543
No 13
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=88.33 E-value=1.1 Score=32.79 Aligned_cols=47 Identities=26% Similarity=0.400 Sum_probs=36.5
Q ss_pred CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCCh------HHHHHHHhh
Q psy14697 102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQP------NQLLGVLQQ 155 (175)
Q Consensus 102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP------~~LL~IyQq 155 (175)
+|+|++++ .++++.++..|++++|.++|-..++.-+.. ..||.||+-
T Consensus 18 ~P~D~~ar-------~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~~~~~ar~~ll~~f~~ 70 (90)
T PF14561_consen 18 NPDDLDAR-------YALADALLAAGDYEEALDQLLELVRRDRDYEDDAARKRLLDIFEL 70 (90)
T ss_dssp STT-HHHH-------HHHHHHHHHTT-HHHHHHHHHHHHCC-TTCCCCHHHHHHHHHHHH
T ss_pred CCCCHHHH-------HHHHHHHHHCCCHHHHHHHHHHHHHhCccccccHHHHHHHHHHHH
Confidence 46787766 678999999999999999999999987766 677777763
No 14
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=83.98 E-value=13 Score=30.30 Aligned_cols=39 Identities=31% Similarity=0.388 Sum_probs=33.3
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ 155 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq 155 (175)
..+|..+...|++++|+.+|.+|++.-|+-.+++.-|-.
T Consensus 77 ~~Lg~~~~~~g~~~~A~~a~~~Al~l~P~~~~~~~~lA~ 115 (198)
T PRK10370 77 ALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALAT 115 (198)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHH
Confidence 668999999999999999999999999987777655433
No 15
>PRK11677 hypothetical protein; Provisional
Probab=83.73 E-value=9.5 Score=30.60 Aligned_cols=36 Identities=17% Similarity=0.216 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHH-HHHhcC
Q psy14697 107 SAVQQYFLQEIQLGETLLAAGDLDNGVEHLAN-ALTVCG 144 (175)
Q Consensus 107 e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~n-Al~Vcp 144 (175)
.++...|.+-.+|=+.|. .+|.+=-.|+++ |-.+||
T Consensus 50 qeV~~HFa~TA~Ll~~L~--~~Y~~Ly~HlA~~s~~Llp 86 (134)
T PRK11677 50 QELVSHFARSAELLDTMA--KDYRQLYQHMAKSSSELLP 86 (134)
T ss_pred HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHcc
Confidence 357889998888877776 467788889977 667777
No 16
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=83.23 E-value=1 Score=30.46 Aligned_cols=25 Identities=8% Similarity=0.215 Sum_probs=18.0
Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhee
Q psy14697 43 LISGNKVVTAVAVGAAVVVGYCLYF 67 (175)
Q Consensus 43 m~~~~~~~~~agvaga~flGYcIYF 67 (175)
|-+.+.+++..+....+++||+||=
T Consensus 4 me~A~~~~i~i~~lL~~~TgyaiYt 28 (46)
T PRK13183 4 MSPALSLAITILAILLALTGFGIYT 28 (46)
T ss_pred cchhHHHHHHHHHHHHHHhhheeee
Confidence 3456666666666667899999993
No 17
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=79.28 E-value=2.6 Score=27.30 Aligned_cols=29 Identities=24% Similarity=0.326 Sum_probs=25.4
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
..+|..+..+|++++|+..|-.++...|+
T Consensus 35 ~~lg~~~~~~g~~~~A~~~~~~a~~~~P~ 63 (65)
T PF13432_consen 35 YLLGRILYQQGRYDEALAYYERALELDPD 63 (65)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHCcC
Confidence 56899999999999999999999999886
No 18
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=79.07 E-value=7.1 Score=25.17 Aligned_cols=41 Identities=20% Similarity=0.225 Sum_probs=31.8
Q ss_pred CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697 102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL 149 (175)
Q Consensus 102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L 149 (175)
+|.|++.+ +.+++.++..|++++|...|-+++...|+-.+.
T Consensus 21 ~p~~~~~~-------~~la~~~~~~g~~~~A~~~l~~~~~~~~~~~~~ 61 (68)
T PF14559_consen 21 NPDNPEAR-------LLLAQCYLKQGQYDEAEELLERLLKQDPDNPEY 61 (68)
T ss_dssp TTTSHHHH-------HHHHHHHHHTT-HHHHHHHHHCCHGGGTTHHHH
T ss_pred CCCCHHHH-------HHHHHHHHHcCCHHHHHHHHHHHHHHCcCHHHH
Confidence 34566655 468999999999999999999999999984333
No 19
>PF04689 S1FA: DNA binding protein S1FA; InterPro: IPR006779 S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=78.97 E-value=2.1 Score=31.07 Aligned_cols=38 Identities=29% Similarity=0.530 Sum_probs=21.2
Q ss_pred HHHHHHHHHHhh-hhheeccC-----CCCChhhHHHhHHHHHHh
Q psy14697 51 TAVAVGAAVVVG-YCLYFDKK-----RRSDPLFKEKLKERRRKN 88 (175)
Q Consensus 51 ~~agvaga~flG-YcIYFD~K-----RR~dP~FRKkLrerRrk~ 88 (175)
.+++.+.+.++| |.+|.=-| |...|--+||++++|-+|
T Consensus 19 vV~g~ll~flvGnyvlY~Yaqk~lpp~kkkpvskkk~k~e~lkq 62 (69)
T PF04689_consen 19 VVAGLLLVFLVGNYVLYVYAQKTLPPKKKKPVSKKKMKRERLKQ 62 (69)
T ss_pred hHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccHHHHHHHHHhc
Confidence 455555566666 77775433 344555566666555443
No 20
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=78.72 E-value=3.5 Score=26.69 Aligned_cols=34 Identities=21% Similarity=0.261 Sum_probs=29.1
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHH
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTVCGQPNQLLG 151 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~ 151 (175)
++|..+..+|++++|..+|-.++...|+=.+.+.
T Consensus 2 ~~a~~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~ 35 (65)
T PF13432_consen 2 ALARALYQQGDYDEAIAAFEQALKQDPDNPEAWY 35 (65)
T ss_dssp HHHHHHHHCTHHHHHHHHHHHHHCCSTTHHHHHH
T ss_pred hHHHHHHHcCCHHHHHHHHHHHHHHCCCCHHHHH
Confidence 5789999999999999999999999987555443
No 21
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=78.16 E-value=13 Score=29.00 Aligned_cols=36 Identities=25% Similarity=0.290 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHH-HhcCC
Q psy14697 108 AVQQYFLQEIQLGETLLAAGDLDNGVEHLANAL-TVCGQ 145 (175)
Q Consensus 108 ~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl-~Vcpq 145 (175)
++..+|.+..++-..|. .+|.+--.|+++.- .+||+
T Consensus 47 ~V~~HF~~ta~Ll~~l~--~~Y~~l~~Hla~~a~~Ll~~ 83 (128)
T PF06295_consen 47 EVNDHFAQTAELLDNLT--QDYQKLYQHLAKGAEELLPD 83 (128)
T ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhccC
Confidence 58999999998888886 45777777887753 44553
No 22
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=78.09 E-value=34 Score=30.53 Aligned_cols=45 Identities=18% Similarity=0.226 Sum_probs=31.1
Q ss_pred CChHHHHHHHHHH------------HHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 104 KDHSAVQQYFLQE------------IQLGETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 104 ~d~e~~E~fFl~e------------V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
.|.+...+||.+- +..++.++.+|++++|...|...+...|+-.+
T Consensus 132 g~~~~A~~~l~~a~~~~p~~~l~~~~~~a~l~l~~~~~~~Al~~l~~l~~~~P~~~~ 188 (409)
T TIGR00540 132 GDEARANQHLEEAAELAGNDNILVEIARTRILLAQNELHAARHGVDKLLEMAPRHKE 188 (409)
T ss_pred CCHHHHHHHHHHHHHhCCcCchHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHH
Confidence 5666666676552 22366666678888888888888888886553
No 23
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=77.70 E-value=12 Score=30.43 Aligned_cols=49 Identities=14% Similarity=0.301 Sum_probs=40.7
Q ss_pred HHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCC
Q psy14697 110 QQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLP 158 (175)
Q Consensus 110 E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLP 158 (175)
|.-=+-...+|+.+...|+.++|+.++.++..=|..|...++++=..+-
T Consensus 33 esir~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l~~ir 81 (177)
T PF10602_consen 33 ESIRMALEDLADHYCKIGDLEEALKAYSRARDYCTSPGHKIDMCLNVIR 81 (177)
T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHHHHHH
Confidence 3333456789999999999999999999999999999999988765443
No 24
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=77.66 E-value=2.8 Score=23.40 Aligned_cols=29 Identities=17% Similarity=0.301 Sum_probs=25.6
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
..+|..+...|++++|...|-+.+.-||+
T Consensus 4 ~~~a~~~~~~g~~~~A~~~~~~~~~~~P~ 32 (33)
T PF13174_consen 4 YRLARCYYKLGDYDEAIEYFQRLIKRYPD 32 (33)
T ss_dssp HHHHHHHHHHCHHHHHHHHHHHHHHHSTT
T ss_pred HHHHHHHHHccCHHHHHHHHHHHHHHCcC
Confidence 46788888889999999999999999986
No 25
>CHL00020 psbN photosystem II protein N
Probab=77.62 E-value=1.8 Score=28.83 Aligned_cols=23 Identities=22% Similarity=0.291 Sum_probs=17.0
Q ss_pred hhhHHHHHHHHHHHHHhhhhhee
Q psy14697 45 SGNKVVTAVAVGAAVVVGYCLYF 67 (175)
Q Consensus 45 ~~~~~~~~agvaga~flGYcIYF 67 (175)
+.+.+++..+....+++||+||=
T Consensus 3 ~A~~~~i~i~~ll~~~Tgy~iYt 25 (43)
T CHL00020 3 TATLVAIFISGLLVSFTGYALYT 25 (43)
T ss_pred chhhHHHHHHHHHHHhhheeeee
Confidence 45566666666667899999993
No 26
>PF10841 DUF2644: Protein of unknown function (DUF2644); InterPro: IPR020300 This entry is represented by Bacteriophage PY100, Orf49. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry contains membrane proteins with no known function.
Probab=75.65 E-value=2.9 Score=29.67 Aligned_cols=25 Identities=24% Similarity=0.382 Sum_probs=18.1
Q ss_pred hhhHHHHHHHHHHHHHhhhhheecc
Q psy14697 45 SGNKVVTAVAVGAAVVVGYCLYFDK 69 (175)
Q Consensus 45 ~~~~~~~~agvaga~flGYcIYFD~ 69 (175)
+...+-....++.++++-||+|.||
T Consensus 13 TT~~iQffg~lv~agil~~~vy~dr 37 (60)
T PF10841_consen 13 TTAFIQFFGALVMAGILIYCVYLDR 37 (60)
T ss_pred hHHHHHHHHHHHHHHHHHhheeeCC
Confidence 3444556666666789999999995
No 27
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=74.72 E-value=7.1 Score=23.86 Aligned_cols=31 Identities=16% Similarity=0.263 Sum_probs=22.3
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPN 147 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~ 147 (175)
..+|..+...|++++|..+|-+++...|...
T Consensus 38 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 68 (100)
T cd00189 38 YNLAAAYYKLGKYEEALEDYEKALELDPDNA 68 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCcch
Confidence 4566777777777778888888777776655
No 28
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=72.77 E-value=4.6 Score=23.44 Aligned_cols=27 Identities=26% Similarity=0.370 Sum_probs=21.9
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVC 143 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vc 143 (175)
-.+|..+..+|++++|...+-.|+.++
T Consensus 6 ~~la~~~~~~g~~~~A~~~~~~al~~~ 32 (42)
T PF13374_consen 6 NNLANAYRAQGRYEEALELLEEALEIR 32 (42)
T ss_dssp HHHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhhhhcchhhHHHHHHHHHH
Confidence 467888999999999999998887754
No 29
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=70.52 E-value=3.6 Score=27.43 Aligned_cols=24 Identities=13% Similarity=0.329 Sum_probs=17.6
Q ss_pred hhhHHHHHHHHHHHHHhhhhheec
Q psy14697 45 SGNKVVTAVAVGAAVVVGYCLYFD 68 (175)
Q Consensus 45 ~~~~~~~~agvaga~flGYcIYFD 68 (175)
+.+.+++..++...+++||+||.-
T Consensus 3 ~a~~~~i~i~~~lv~~Tgy~iYta 26 (43)
T PF02468_consen 3 TATVLAIFISCLLVSITGYAIYTA 26 (43)
T ss_pred ceeeHHHHHHHHHHHHHhhhhhhe
Confidence 455666666666678999999954
No 30
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=70.45 E-value=9.4 Score=26.35 Aligned_cols=33 Identities=18% Similarity=0.187 Sum_probs=16.3
Q ss_pred hHhHHhhcCChhhHHHHHHHHHHhcCChHHHHH
Q psy14697 119 LGETLLAAGDLDNGVEHLANALTVCGQPNQLLG 151 (175)
Q Consensus 119 lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~ 151 (175)
+|+.+...|+.++|..++-.++..+|+.....+
T Consensus 82 ~~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~ 114 (119)
T TIGR02795 82 LGMSLQELGDKEKAKATLQQVIKRYPGSSAAKL 114 (119)
T ss_pred HHHHHHHhCChHHHHHHHHHHHHHCcCChhHHH
Confidence 344444445555555555555555555444433
No 31
>KOG4648|consensus
Probab=69.45 E-value=2.9 Score=39.55 Aligned_cols=30 Identities=20% Similarity=0.461 Sum_probs=27.4
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHhcC-ChH
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTVCG-QPN 147 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~Vcp-qP~ 147 (175)
+.|.....||.++||+.|+.++|.++| +|.
T Consensus 102 E~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV 132 (536)
T KOG4648|consen 102 ERGNTYFKQGKYEEAIDCYSTAIAVYPHNPV 132 (536)
T ss_pred HhhhhhhhccchhHHHHHhhhhhccCCCCcc
Confidence 468899999999999999999999999 774
No 32
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=69.28 E-value=6.4 Score=25.41 Aligned_cols=28 Identities=32% Similarity=0.392 Sum_probs=20.8
Q ss_pred HhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697 123 LLAAGDLDNGVEHLANALTVCGQPNQLL 150 (175)
Q Consensus 123 L~a~G~~~eaa~hf~nAl~VcpqP~~LL 150 (175)
|+.+|++++|+.+|-+++...|+-.++.
T Consensus 1 ll~~~~~~~A~~~~~~~l~~~p~~~~~~ 28 (68)
T PF14559_consen 1 LLKQGDYDEAIELLEKALQRNPDNPEAR 28 (68)
T ss_dssp HHHTTHHHHHHHHHHHHHHHTTTSHHHH
T ss_pred ChhccCHHHHHHHHHHHHHHCCCCHHHH
Confidence 4567888888888888888888655443
No 33
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=68.54 E-value=6.4 Score=25.03 Aligned_cols=25 Identities=28% Similarity=0.419 Sum_probs=22.1
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALT 141 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~ 141 (175)
..+||--+-.++++.|+..|-+||.
T Consensus 5 ~~Lgeisle~e~f~qA~~D~~~aL~ 29 (38)
T PF10516_consen 5 DLLGEISLENENFEQAIEDYEKALE 29 (38)
T ss_pred HHHHHHHHHhccHHHHHHHHHHHHH
Confidence 4689988888999999999999875
No 34
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=66.63 E-value=4.1 Score=32.23 Aligned_cols=20 Identities=20% Similarity=0.438 Sum_probs=13.8
Q ss_pred hhHHHHHHHHHHH-HHhhhhh
Q psy14697 46 GNKVVTAVAVGAA-VVVGYCL 65 (175)
Q Consensus 46 ~~~~~~~agvaga-~flGYcI 65 (175)
..+++++||++|. +++.|||
T Consensus 68 ~Ii~gv~aGvIg~Illi~y~i 88 (122)
T PF01102_consen 68 GIIFGVMAGVIGIILLISYCI 88 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred ehhHHHHHHHHHHHHHHHHHH
Confidence 4456677777764 6778988
No 35
>COG4968 PilE Tfp pilus assembly protein PilE [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=66.08 E-value=15 Score=29.99 Aligned_cols=33 Identities=15% Similarity=0.382 Sum_probs=21.0
Q ss_pred hhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHH
Q psy14697 75 PLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQE 116 (175)
Q Consensus 75 P~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~e 116 (175)
|.|+.-++|-||.+.++.= ..+...+|+||..|
T Consensus 29 PSY~~yv~rs~R~~a~A~L---------~~~a~~~Er~y~~q 61 (139)
T COG4968 29 PSYQNYVLRSRRSAAKAAL---------LENAQFMERYYALQ 61 (139)
T ss_pred HhHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHc
Confidence 8888888887777765421 11234578888544
No 36
>KOG4234|consensus
Probab=65.37 E-value=13 Score=33.03 Aligned_cols=53 Identities=17% Similarity=0.332 Sum_probs=39.9
Q ss_pred CCCCCChHHHHHHHHHHH--------------------HhHhHHhhcCChhhHHHHHHHHHHhcCC-hHHHHHH
Q psy14697 100 IPDLKDHSAVQQYFLQEI--------------------QLGETLLAAGDLDNGVEHLANALTVCGQ-PNQLLGV 152 (175)
Q Consensus 100 ~p~~~d~e~~E~fFl~eV--------------------~lGE~L~a~G~~~eaa~hf~nAl~Vcpq-P~~LL~I 152 (175)
-|+|.+.|-.+.+.|.++ ..|..|...|++++|...+..||..||. |+++-.|
T Consensus 62 se~~~~SEl~eEeLmae~E~i~~deek~k~~~kad~lK~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsI 135 (271)
T KOG4234|consen 62 SEEPQHSELKEEELMAEIEKIFSDEEKDKAIEKADSLKKEGNELFKNGDYEEANSKYQEALESCPSTSTEERSI 135 (271)
T ss_pred CCCccchhhhHHHHHHHHHHhcCcHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHhCccccHHHHHH
Confidence 367778777777777332 2478888999999999999999999986 4444443
No 37
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=64.07 E-value=30 Score=31.13 Aligned_cols=39 Identities=26% Similarity=0.347 Sum_probs=34.0
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ 155 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq 155 (175)
..||+..+..|+..+|+..+.||++.-|.-.+++.-|=.
T Consensus 160 ~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~ae 198 (287)
T COG4235 160 DLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAE 198 (287)
T ss_pred HHHHHHHHHhcchhHHHHHHHHHHHhCCCCHHHHHHHHH
Confidence 569999999999999999999999999987777766543
No 38
>KOG0548|consensus
Probab=62.97 E-value=6.4 Score=38.23 Aligned_cols=33 Identities=12% Similarity=0.345 Sum_probs=30.0
Q ss_pred HHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 113 FLQEIQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 113 Fl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
|.+..+.|-..++.|+++.|+.||.+||.+-|.
T Consensus 2 a~e~k~kgnaa~s~~d~~~ai~~~t~ai~l~p~ 34 (539)
T KOG0548|consen 2 AVELKEKGNAAFSSGDFETAIRLFTEAIMLSPT 34 (539)
T ss_pred hhHHHHHHHhhcccccHHHHHHHHHHHHccCCC
Confidence 456788899999999999999999999999988
No 39
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=61.41 E-value=7.1 Score=26.06 Aligned_cols=33 Identities=24% Similarity=0.290 Sum_probs=28.2
Q ss_pred HHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697 111 QYFLQEIQLGETLLAAGDLDNGVEHLANALTVC 143 (175)
Q Consensus 111 ~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vc 143 (175)
....--..+|+.+...|++++|..+|-+|+.++
T Consensus 44 ~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i~ 76 (78)
T PF13424_consen 44 DTANTLNNLGECYYRLGDYEEALEYYQKALDIF 76 (78)
T ss_dssp HHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhh
Confidence 345566789999999999999999999999876
No 40
>KOG1174|consensus
Probab=61.05 E-value=12 Score=36.26 Aligned_cols=39 Identities=21% Similarity=0.232 Sum_probs=33.5
Q ss_pred HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHh
Q psy14697 116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQ 154 (175)
Q Consensus 116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQ 154 (175)
++.+|.-+.+++++.+|.+||+.||..=||-+.-+.=+|
T Consensus 474 H~~Lgd~~~A~Ne~Q~am~~y~~ALr~dP~~~~sl~Gl~ 512 (564)
T KOG1174|consen 474 HNHLGDIMRAQNEPQKAMEYYYKALRQDPKSKRTLRGLR 512 (564)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHhcCccchHHHHHHH
Confidence 478999999999999999999999999999887666554
No 41
>PF08572 PRP3: pre-mRNA processing factor 3 (PRP3); InterPro: IPR013881 Pre-mRNA processing factor 3 (PRP3) is a U4/U6-associated splicing factor. The human PRP3 has been implicated in autosomal retinitis pigmentosa [].
Probab=60.61 E-value=25 Score=30.23 Aligned_cols=44 Identities=16% Similarity=0.325 Sum_probs=26.2
Q ss_pred HHHhHHHHHHhHHHH--HHh--hcCCCCC----------------CCChHHHHHHHHHHHHhHh
Q psy14697 78 KEKLKERRRKNRELL--QQK--NKRGIPD----------------LKDHSAVQQYFLQEIQLGE 121 (175)
Q Consensus 78 RKkLrerRrk~~~~~--~~a--~~~~~p~----------------~~d~e~~E~fFl~eV~lGE 121 (175)
|||||++||+.+..+ ++. ++..-|. -.||.++|.-.-+||+.-.
T Consensus 146 rKKlRr~rR~e~~kEkq~kIrlGL~ppP~PKVKlSNLMrVL~~eAV~DPT~vE~~Vr~Q~eeR~ 209 (223)
T PF08572_consen 146 RKKLRRQRRQEKQKEKQDKIRLGLEPPPPPKVKLSNLMRVLGNEAVQDPTKVEAKVRKQMEERQ 209 (223)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCCCCCCcccHHHHHHHhhcchhcCcHHHHHHHHHHHHHHH
Confidence 789998776654332 222 2222121 3678888888888877543
No 42
>cd08323 CARD_APAF1 Caspase activation and recruitment domain similar to that found in Apoptotic Protease-Activating Factor 1. Caspase activation and recruitment domain (CARD) similar to that found in apoptotic protease-activating factor 1 (APAF-1), which is an activator of caspase-9. APAF-1 contains WD-40 repeats, a CARD, and an ATPase domain. Upon stimulation, APAF-1, together with caspase-9, forms the heptameric 'apoptosome', which leads to the processing and activation of caspase-9, starting a caspase cascade which leads to apoptosis. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effect
Probab=60.48 E-value=20 Score=26.43 Aligned_cols=40 Identities=23% Similarity=0.301 Sum_probs=29.9
Q ss_pred HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCC
Q psy14697 116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLP 158 (175)
Q Consensus 116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLP 158 (175)
-.++=..|...|+ .|..+|++||.=.++| .|-.+++..+|
T Consensus 47 a~~Lld~L~trG~--~Af~~F~~aL~~~~~~-~La~lL~~~~~ 86 (86)
T cd08323 47 AVMLINMILTKDN--HAYVSFYNALLHEGYK-DLALLLHDAIP 86 (86)
T ss_pred HHHHHHHHHhcCH--HHHHHHHHHHHhcCCh-HHHHHHhccCC
Confidence 3445566677887 4899999999965555 78888888776
No 43
>PF12868 DUF3824: Domain of unknwon function (DUF3824); InterPro: IPR024436 This repeating domain is proline-rich but its function is unknown.
Probab=59.75 E-value=2.9 Score=33.62 Aligned_cols=18 Identities=17% Similarity=0.040 Sum_probs=8.1
Q ss_pred hhHHHHHHHHHHHHHhhh
Q psy14697 46 GNKVVTAVAVGAAVVVGY 63 (175)
Q Consensus 46 ~~~~~~~agvaga~flGY 63 (175)
+.+.++++|+||+++.+.
T Consensus 7 R~la~~aLaAAG~G~AA~ 24 (137)
T PF12868_consen 7 RDLAEAALAAAGAGYAAH 24 (137)
T ss_pred HHHhHHHHHHHHHHHHHH
Confidence 344444445555444443
No 44
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=59.38 E-value=27 Score=27.11 Aligned_cols=36 Identities=19% Similarity=0.260 Sum_probs=23.1
Q ss_pred HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHH
Q psy14697 116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLG 151 (175)
Q Consensus 116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~ 151 (175)
...+|..+...|++++|..++-+|+...|+-...+.
T Consensus 75 ~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~ 110 (172)
T PRK02603 75 LYNMGIIYASNGEHDKALEYYHQALELNPKQPSALN 110 (172)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhCcccHHHHH
Confidence 345566666667777777777777777665444433
No 45
>KOG3119|consensus
Probab=59.11 E-value=5.2 Score=34.97 Aligned_cols=28 Identities=21% Similarity=0.502 Sum_probs=23.1
Q ss_pred heeccCCCCChhhHHHhHHHHHHhHHHH
Q psy14697 65 LYFDKKRRSDPLFKEKLKERRRKNRELL 92 (175)
Q Consensus 65 IYFD~KRR~dP~FRKkLrerRrk~~~~~ 92 (175)
-||+++|||+..-||...++|+++.+.+
T Consensus 194 ~y~err~rNN~A~~kSR~~~k~~~~e~~ 221 (269)
T KOG3119|consen 194 EYKERRRRNNEAVRKSRDKRKQKEDEMA 221 (269)
T ss_pred HHHHHHHhhhHHHHHhhhhHHHHHHHHH
Confidence 5999999999999999887777766544
No 46
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=59.02 E-value=8.1 Score=36.76 Aligned_cols=41 Identities=20% Similarity=0.370 Sum_probs=33.4
Q ss_pred HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697 107 SAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPN 147 (175)
Q Consensus 107 e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~ 147 (175)
++...+....+.+|-.|...|++++|+.+|-+||..-|+..
T Consensus 69 ~~dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~a 109 (453)
T PLN03098 69 EADVKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPD 109 (453)
T ss_pred cCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCch
Confidence 34445555668889999999999999999999999988865
No 47
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=58.85 E-value=20 Score=27.51 Aligned_cols=38 Identities=21% Similarity=0.195 Sum_probs=29.6
Q ss_pred CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
+|.|++.. ..+|..+...|++++|..+|-.||.+.|+=
T Consensus 88 ~p~~~~a~-------~~lg~~l~~~g~~~eAi~~~~~Al~~~p~~ 125 (144)
T PRK15359 88 DASHPEPV-------YQTGVCLKMMGEPGLAREAFQTAIKMSYAD 125 (144)
T ss_pred CCCCcHHH-------HHHHHHHHHcCCHHHHHHHHHHHHHhCCCC
Confidence 45665554 467888888899999999999999988863
No 48
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=58.69 E-value=10 Score=23.16 Aligned_cols=30 Identities=20% Similarity=0.276 Sum_probs=25.2
Q ss_pred HHHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 116 EIQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 116 eV~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
...+|..+...|++++|..+|.+++..+|+
T Consensus 71 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 100 (100)
T cd00189 71 YYNLGLAYYKLGKYEEALEAYEKALELDPN 100 (100)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHccCCC
Confidence 356778888889999999999999988874
No 49
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=57.64 E-value=47 Score=34.01 Aligned_cols=48 Identities=10% Similarity=0.079 Sum_probs=35.9
Q ss_pred CCCChHHHHHHHH---------HHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697 102 DLKDHSAVQQYFL---------QEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL 149 (175)
Q Consensus 102 ~~~d~e~~E~fFl---------~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L 149 (175)
.|.+++..+.... +.++++..+...|++++|+.+|-+++.--|.+.++
T Consensus 92 ~P~~~~~~~~~~~~~~~~~~~~~~l~~A~ll~~~g~~~eA~~~~~~~l~~~p~~~~l 148 (1157)
T PRK11447 92 APDSNAYRSSRTTMLLSTPEGRQALQQARLLATTGRTEEALASYDKLFNGAPPELDL 148 (1157)
T ss_pred CCCChHHHHHHHHHHhcCCchhhHHHHHHHHHhCCCHHHHHHHHHHHccCCCCChHH
Confidence 4666665443322 34888999999999999999999999987766554
No 50
>KOG0624|consensus
Probab=57.40 E-value=20 Score=34.10 Aligned_cols=47 Identities=28% Similarity=0.593 Sum_probs=39.0
Q ss_pred CCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697 103 LKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ 155 (175)
Q Consensus 103 ~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq 155 (175)
..+++++|+ +.++|.+|+++|.+.+|+.||-.||. ++|..-+.||++
T Consensus 32 ~~~~advek----hlElGk~lla~~Q~sDALt~yHaAve--~dp~~Y~aifrR 78 (504)
T KOG0624|consen 32 TASPADVEK----HLELGKELLARGQLSDALTHYHAAVE--GDPNNYQAIFRR 78 (504)
T ss_pred cCCHHHHHH----HHHHHHHHHHhhhHHHHHHHHHHHHc--CCchhHHHHHHH
Confidence 346667765 78999999999999999999999865 678888888876
No 51
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=56.93 E-value=14 Score=20.82 Aligned_cols=21 Identities=38% Similarity=0.547 Sum_probs=17.5
Q ss_pred HHHhHhHHhhcCChhhHHHHH
Q psy14697 116 EIQLGETLLAAGDLDNGVEHL 136 (175)
Q Consensus 116 eV~lGE~L~a~G~~~eaa~hf 136 (175)
.+.+|+.+..+|+.++|..++
T Consensus 4 ~~~la~~~~~~G~~~eA~~~l 24 (26)
T PF07721_consen 4 RLALARALLAQGDPDEAERLL 24 (26)
T ss_pred HHHHHHHHHHcCCHHHHHHHH
Confidence 357899999999999988765
No 52
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=56.23 E-value=16 Score=25.18 Aligned_cols=30 Identities=17% Similarity=0.271 Sum_probs=27.4
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
..+|..+...|++++|..+|-.++..+|+.
T Consensus 43 ~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~ 72 (119)
T TIGR02795 43 YWLGEAYYAQGKYADAAKAFLAVVKKYPKS 72 (119)
T ss_pred HHHHHHHHhhccHHHHHHHHHHHHHHCCCC
Confidence 558999999999999999999999999884
No 53
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=55.04 E-value=29 Score=30.02 Aligned_cols=56 Identities=18% Similarity=0.320 Sum_probs=39.4
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCC----hHHHHH---HHhh-c---CCHHHHHHHHHhcCCC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQ----PNQLLG---VLQQ-T---LPPNVFNALIEKLPPA 172 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq----P~~LL~---IyQq-T---LPp~Vf~~Li~kLp~~ 172 (175)
..+|+.+...|++++|+.+|.+++.-||+ |.-|+. +|++ - --..+|+-+++..|..
T Consensus 184 y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~~~g~~~~A~~~~~~vi~~yP~s 250 (263)
T PRK10803 184 YWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQVIKKYPGT 250 (263)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCC
Confidence 57899999999999999999999999997 555543 3321 1 1124566677776653
No 54
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=54.42 E-value=15 Score=23.75 Aligned_cols=28 Identities=25% Similarity=0.458 Sum_probs=24.1
Q ss_pred HHhHhHHhhcC-ChhhHHHHHHHHHHhcC
Q psy14697 117 IQLGETLLAAG-DLDNGVEHLANALTVCG 144 (175)
Q Consensus 117 V~lGE~L~a~G-~~~eaa~hf~nAl~Vcp 144 (175)
..+|..+...| ++++|..+|-+||++-|
T Consensus 41 ~~~g~~~~~~~~~~~~A~~~~~~al~l~P 69 (69)
T PF13414_consen 41 YNLGLAYMKLGKDYEEAIEDFEKALKLDP 69 (69)
T ss_dssp HHHHHHHHHTTTHHHHHHHHHHHHHHHST
T ss_pred HHHHHHHHHhCccHHHHHHHHHHHHHcCc
Confidence 56788888899 69999999999998765
No 55
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=53.87 E-value=29 Score=24.98 Aligned_cols=38 Identities=24% Similarity=0.476 Sum_probs=31.2
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHh
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQ 154 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQ 154 (175)
..+|..+...|++++|...|-.++.+.|+........+
T Consensus 89 ~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~ 126 (135)
T TIGR02552 89 FHAAECLLALGEPESALKALDLAIEICGENPEYSELKE 126 (135)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHhccccchHHHHHH
Confidence 44777888889999999999999999998877655544
No 56
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=53.87 E-value=32 Score=28.28 Aligned_cols=36 Identities=19% Similarity=0.365 Sum_probs=31.0
Q ss_pred CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcC
Q psy14697 102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCG 144 (175)
Q Consensus 102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcp 144 (175)
+|+||+..+ .+|+-++.-|+.++|...|..||..|.
T Consensus 99 ~~ddp~~~~-------~ag~c~L~lG~~~~A~~aF~~Ai~~~~ 134 (157)
T PRK15363 99 KIDAPQAPW-------AAAECYLACDNVCYAIKALKAVVRICG 134 (157)
T ss_pred CCCCchHHH-------HHHHHHHHcCCHHHHHHHHHHHHHHhc
Confidence 467777664 489999999999999999999999994
No 57
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=53.47 E-value=17 Score=24.18 Aligned_cols=25 Identities=20% Similarity=0.364 Sum_probs=17.6
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHh
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTV 142 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~V 142 (175)
.+|..+...|++++|..+|-+|+.+
T Consensus 10 ~la~~~~~~~~~~~A~~~~~~al~~ 34 (78)
T PF13424_consen 10 NLARVYRELGRYDEALDYYEKALDI 34 (78)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 4677777777777788877777765
No 58
>PF11118 DUF2627: Protein of unknown function (DUF2627); InterPro: IPR020138 This entry represents uncharacterised membrane proteins with no known function.
Probab=51.70 E-value=17 Score=26.94 Aligned_cols=36 Identities=19% Similarity=0.205 Sum_probs=21.9
Q ss_pred hhhhHHHHHHHHHHHHHh-hhhheeccCCCC-ChhhHH
Q psy14697 44 ISGNKVVTAVAVGAAVVV-GYCLYFDKKRRS-DPLFKE 79 (175)
Q Consensus 44 ~~~~~~~~~agvaga~fl-GYcIYFD~KRR~-dP~FRK 79 (175)
|..-++++..-++|.+|+ ||-.|=|+||.. .|-|||
T Consensus 40 wlqfl~G~~lf~~G~~Fi~GfI~~RDRKrnkV~pRF~k 77 (77)
T PF11118_consen 40 WLQFLAGLLLFAIGVGFIAGFILHRDRKRNKVQPRFRK 77 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHhHhheeeccccccchhhcC
Confidence 333445555555555444 688999999964 455553
No 59
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=51.41 E-value=15 Score=32.63 Aligned_cols=54 Identities=22% Similarity=0.355 Sum_probs=38.2
Q ss_pred hHhHHhhcCChhhHHHHHHHHHHhcCCh----HHHHHHH-------hhcCCHHHHHHHHHhcCCC
Q psy14697 119 LGETLLAAGDLDNGVEHLANALTVCGQP----NQLLGVL-------QQTLPPNVFNALIEKLPPA 172 (175)
Q Consensus 119 lGE~L~a~G~~~eaa~hf~nAl~VcpqP----~~LL~Iy-------QqTLPp~Vf~~Li~kLp~~ 172 (175)
|||.+.++|++++|+.-|.++++=||+- .-||++= |+..--.+|+-|+.+.|.+
T Consensus 184 LGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~~~~~l~~~d~A~atl~qv~k~YP~t 248 (262)
T COG1729 184 LGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTDEACATLQQVIKRYPGT 248 (262)
T ss_pred HHHHHHhcccchHHHHHHHHHHHhCCCCCCChHHHHHHHHHHHHhcCHHHHHHHHHHHHHHCCCC
Confidence 8999999999999999999999999974 4444321 1223334555666666543
No 60
>PF06522 B12D: NADH-ubiquinone reductase complex 1 MLRQ subunit; InterPro: IPR010530 The MLRQ subunit of mitochondrial NADH-ubiquinone reductase complex I is nuclear [] and is found in plants [], insects, fungi and higher metazoans []. It appears to act within the membrane and, in mammals, is highly expressed in muscle and neural tissue, indicative of a role in ATP generation [].
Probab=50.77 E-value=17 Score=25.82 Aligned_cols=16 Identities=38% Similarity=0.480 Sum_probs=9.0
Q ss_pred heeccCCCCChhhHHHhH
Q psy14697 65 LYFDKKRRSDPLFKEKLK 82 (175)
Q Consensus 65 IYFD~KRR~dP~FRKkLr 82 (175)
|.+|+++|.+ --.+++
T Consensus 34 V~~~k~~~~~--pw~~~~ 49 (73)
T PF06522_consen 34 VRWNKKNRPE--PWEKYK 49 (73)
T ss_pred eEEEecCCcC--hhhhcC
Confidence 3578888743 334444
No 61
>PRK12370 invasion protein regulator; Provisional
Probab=50.28 E-value=37 Score=31.81 Aligned_cols=29 Identities=21% Similarity=0.366 Sum_probs=15.8
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
.+|..+...|++++|..+|-+|+..-|.+
T Consensus 377 ~lg~~l~~~G~~~eAi~~~~~Al~l~P~~ 405 (553)
T PRK12370 377 YYGWNLFMAGQLEEALQTINECLKLDPTR 405 (553)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHhcCCCC
Confidence 34555555555555555555555555553
No 62
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=49.82 E-value=31 Score=26.50 Aligned_cols=29 Identities=14% Similarity=0.239 Sum_probs=17.5
Q ss_pred hHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697 119 LGETLLAAGDLDNGVEHLANALTVCGQPN 147 (175)
Q Consensus 119 lGE~L~a~G~~~eaa~hf~nAl~VcpqP~ 147 (175)
+|-.+...|++++|..+|.+|+..-++|.
T Consensus 41 ~g~~~~~~g~~~~A~~~~~~al~l~~~~~ 69 (168)
T CHL00033 41 DGMSAQSEGEYAEALQNYYEAMRLEIDPY 69 (168)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHhccccch
Confidence 45555556666666666666666655543
No 63
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=49.70 E-value=10 Score=32.47 Aligned_cols=19 Identities=21% Similarity=-0.029 Sum_probs=15.5
Q ss_pred HHHHHhhhhheeccCCCCC
Q psy14697 56 GAAVVVGYCLYFDKKRRSD 74 (175)
Q Consensus 56 aga~flGYcIYFD~KRR~d 74 (175)
+..++++|.+||=++||+-
T Consensus 112 lLla~~~~~~Y~~~~Rrs~ 130 (202)
T PF06365_consen 112 LLLAILLGAGYCCHQRRSW 130 (202)
T ss_pred HHHHHHHHHHHHhhhhccC
Confidence 5567788999999999984
No 64
>PF00731 AIRC: AIR carboxylase; InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain []. The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme [].; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process; PDB: 3TRH_O 2YWX_A 2NSL_A 1D7A_A 2NSJ_A 1QCZ_A 2ATE_A 2NSH_A 3RG8_C 3RGG_D ....
Probab=48.56 E-value=7.8 Score=31.55 Aligned_cols=20 Identities=40% Similarity=0.594 Sum_probs=15.7
Q ss_pred CCCChhhHHHhHHHHHHhHH
Q psy14697 71 RRSDPLFKEKLKERRRKNRE 90 (175)
Q Consensus 71 RR~dP~FRKkLrerRrk~~~ 90 (175)
+.+||+.++||++.|+++++
T Consensus 129 a~~d~~l~~kl~~~~~~~~~ 148 (150)
T PF00731_consen 129 ALKDPELREKLRAYREKMKE 148 (150)
T ss_dssp HTT-HHHHHHHHHHHHHHHH
T ss_pred hcCCHHHHHHHHHHHHHHHc
Confidence 46899999999998877653
No 65
>PF05942 PaREP1: Archaeal PaREP1/PaREP8 family; InterPro: IPR010268 This family consists of several archaeal PaREP1 proteins, the function of the family is unknown.; PDB: 2Q00_A 2JPU_A.
Probab=48.39 E-value=26 Score=26.39 Aligned_cols=28 Identities=29% Similarity=0.437 Sum_probs=22.8
Q ss_pred HHHHHhHhHHhhcCChhhHHHHHHHHHH
Q psy14697 114 LQEIQLGETLLAAGDLDNGVEHLANALT 141 (175)
Q Consensus 114 l~eV~lGE~L~a~G~~~eaa~hf~nAl~ 141 (175)
..++.+++.++..|+..+|+++.++|..
T Consensus 7 ~~~l~~A~e~L~~G~~rqAaeK~~~A~~ 34 (115)
T PF05942_consen 7 EKELEEADELLEKGDLRQAAEKAFKAVE 34 (115)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 4567889999999999999998888776
No 66
>PF08066 PMC2NT: PMC2NT (NUC016) domain; InterPro: IPR012588 Exosomes are nano-compartments that function in the degradation or processing of RNA (including mRNA, rRNA, snRNA and snoRNA) [, ]. Exosomes occur in both archaea and eukaryotes, and have a similar overall structure to each other and to bacterial/organelle PNPases (polynucleotide phosphorylases; 2.7.7.8 from EC) [], consisting of a barrel structure composed of a hexameric ring of PH domains that act as a degradation chamber, and an S1-domain/KH-domain containing cap that binds the RNA substrate (and sometimes accessory proteins) in order to regulate and restrict entry into the degradation chamber []. There are two types of exosomes in eukaryotes, cytoplasmic exosomes that are responsible for 3'-5' exoribonuclease degradation of mRNAs, and nuclear exosomes that degrade pre-mRNAs (such as nonsense transcripts) and degrade rRNAs, snRNAs and snoRNAs. Unstructured RNA substrates feed in through the pore made by the S1 domains, are degraded by the PH domain ring, and exit as nucleotides via the PH pore at the opposite end of the barrel [, ]. There are several accessory proteins that help degrade, unwind or polyadenylate RNA substrate before they enter the exosome. This entry represents the N-terminal domain of Rrp6 (exosome component 10 in humans), a nuclear exosome accessory factor that interacts with the bottom of the hexameric PH-ring opposite the cap. Rrp6 functions as a hydrolytic exonuclease, and is homologous to RNase-D in Escherichia coli. More information about these proteins can be found at Protein of the Month: RNA Exosomes [].; GO: 0006396 RNA processing, 0000176 nuclear exosome (RNase complex)
Probab=48.32 E-value=34 Score=24.97 Aligned_cols=58 Identities=26% Similarity=0.381 Sum_probs=34.2
Q ss_pred eccCCCCChhhHHHhHHHHHHhHHH-----HHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHh
Q psy14697 67 FDKKRRSDPLFKEKLKERRRKNREL-----LQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLL 124 (175)
Q Consensus 67 FD~KRR~dP~FRKkLrerRrk~~~~-----~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~ 124 (175)
|||-|--||+|++.+.+..++-... +........+.+.|.+.++..|-.=+..=+.|+
T Consensus 16 ~~Fy~s~dp~f~~~ld~~s~rll~l~n~ll~~~~~~~~~~~~~~~~d~d~~~~~vvd~~D~Ll 78 (91)
T PF08066_consen 16 YDFYRSFDPEFAESLDEQSQRLLSLINSLLKSAGSKSNISSPDDVDDVDERWDSVVDVNDSLL 78 (91)
T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCCccccHHHHHHHHHHHHHHHH
Confidence 5677888999999999876554322 112233344455566777776644333333333
No 67
>KOG4555|consensus
Probab=47.36 E-value=19 Score=30.13 Aligned_cols=29 Identities=31% Similarity=0.455 Sum_probs=22.8
Q ss_pred HHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697 122 TLLAAGDLDNGVEHLANALTVCGQPNQLL 150 (175)
Q Consensus 122 ~L~a~G~~~eaa~hf~nAl~VcpqP~~LL 150 (175)
.|.-.|+.++|.+.|.+||.|||+-..-.
T Consensus 52 alaE~g~Ld~AlE~F~qal~l~P~raSay 80 (175)
T KOG4555|consen 52 ALAEAGDLDGALELFGQALCLAPERASAY 80 (175)
T ss_pred HHHhccchHHHHHHHHHHHHhcccchHhh
Confidence 34446899999999999999998755443
No 68
>KOG0553|consensus
Probab=46.86 E-value=27 Score=31.80 Aligned_cols=37 Identities=24% Similarity=0.303 Sum_probs=33.7
Q ss_pred HHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697 111 QYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPN 147 (175)
Q Consensus 111 ~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~ 147 (175)
.|+-.-..+|=.++++|++++|++.|-|||-+.|.=.
T Consensus 147 ~yskay~RLG~A~~~~gk~~~A~~aykKaLeldP~Ne 183 (304)
T KOG0553|consen 147 HYSKAYGRLGLAYLALGKYEEAIEAYKKALELDPDNE 183 (304)
T ss_pred HHHHHHHHHHHHHHccCcHHHHHHHHHhhhccCCCcH
Confidence 6777778999999999999999999999999999865
No 69
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=46.82 E-value=42 Score=33.51 Aligned_cols=57 Identities=14% Similarity=0.242 Sum_probs=37.5
Q ss_pred ChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHH---hhcCCHHHHH
Q psy14697 105 DHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVL---QQTLPPNVFN 163 (175)
Q Consensus 105 d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~Iy---QqTLPp~Vf~ 163 (175)
+.+-.++|+..|.++||.|+..|...+++ |.+||.-+..-.+-|+-| +..+.++..+
T Consensus 639 ~~~~~~~~~~~~~~lGe~L~~~g~i~~~~--l~~aL~~~~~~~~~lG~~Lv~~g~it~~~l~ 698 (727)
T PRK11234 639 AGEIWRQYVPHQFLFAEILTTLGHIPRSA--INVLLLRHERSSLPLGKFLVTEGVISQETLD 698 (727)
T ss_pred chhhhhhhhhhhhhHHHHHHHcCCCCHHH--HHHHHHHhcccCCcHHHHHHHcCCCCHHHHH
Confidence 33456899999999999999999876443 567777665544444433 2344444433
No 70
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=45.92 E-value=27 Score=32.76 Aligned_cols=37 Identities=19% Similarity=0.363 Sum_probs=31.6
Q ss_pred HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHH
Q psy14697 116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGV 152 (175)
Q Consensus 116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~I 152 (175)
.+.+|+.+...|++++|+..|-+|+.+.++-.++++.
T Consensus 545 ~~~la~~~~~~g~~~eAi~~~e~A~~l~~~~~e~~~a 581 (615)
T TIGR00990 545 VATMAQLLLQQGDVDEALKLFERAAELARTEGELVQA 581 (615)
T ss_pred HHHHHHHHHHccCHHHHHHHHHHHHHHhccHHHHHHH
Confidence 3578999999999999999999999999887776443
No 71
>PRK09720 cybC cytochrome b562; Provisional
Probab=45.03 E-value=88 Score=24.17 Aligned_cols=60 Identities=12% Similarity=0.179 Sum_probs=36.8
Q ss_pred CChhhHHHhHHHHHHhHHHHHHhhcCCC-CCCCChHHHH------HHHHHHHHhHhHHhhcCChhhHH
Q psy14697 73 SDPLFKEKLKERRRKNRELLQQKNKRGI-PDLKDHSAVQ------QYFLQEIQLGETLLAAGDLDNGV 133 (175)
Q Consensus 73 ~dP~FRKkLrerRrk~~~~~~~a~~~~~-p~~~d~e~~E------~fFl~eV~lGE~L~a~G~~~eaa 133 (175)
+..+|++.|-+=|.....+++. -.+++ --+.|.+++. .-+..||.....|..+|+++||-
T Consensus 16 ~a~~~k~al~~mr~aA~~Ak~~-~P~kL~~~~~ds~e~K~y~~Gld~lI~qID~A~~La~~GkL~eAK 82 (100)
T PRK09720 16 NAAQVKDALTKMRAAALDAQKA-TPPKLEDKSPDSPEMKDFRHGFDILVGQIDDALKLANEGKVKEAQ 82 (100)
T ss_pred CHHHHHHHHHHHHHHHHHHhhc-CChhhccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHH
Confidence 4457888887766554443321 11111 0123444453 45899999999999999988773
No 72
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=44.82 E-value=32 Score=25.64 Aligned_cols=32 Identities=25% Similarity=0.356 Sum_probs=17.1
Q ss_pred HHHHHHhHhHHhhcCChhhHHHHHHHHHHhcC
Q psy14697 113 FLQEIQLGETLLAAGDLDNGVEHLANALTVCG 144 (175)
Q Consensus 113 Fl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcp 144 (175)
|.+.+..|+.....|+.+.+..++.+|+..|.
T Consensus 6 F~~~~~~a~~~~~~~~~~~~~~~~~~al~ly~ 37 (146)
T PF03704_consen 6 FEALVREARAAARAGDPEEAIELLEEALALYR 37 (146)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHTT--
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHhC
Confidence 44455556665555666666666666666654
No 73
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=43.68 E-value=1.9e+02 Score=24.39 Aligned_cols=42 Identities=10% Similarity=-0.011 Sum_probs=33.7
Q ss_pred CCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 103 LKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 103 ~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
|.++...+. +..+|..+...+++++|+.+|-+.|..+|+-.+
T Consensus 63 P~s~~a~~a----~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~~~ 104 (243)
T PRK10866 63 PFGPYSQQV----QLDLIYAYYKNADLPLAQAAIDRFIRLNPTHPN 104 (243)
T ss_pred CCChHHHHH----HHHHHHHHHhcCCHHHHHHHHHHHHHhCcCCCc
Confidence 344444443 678999999999999999999999999998644
No 74
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=43.23 E-value=37 Score=23.08 Aligned_cols=27 Identities=22% Similarity=0.262 Sum_probs=20.9
Q ss_pred HHHHHHhHhHHhhcCChhhHHHHHHHH
Q psy14697 113 FLQEIQLGETLLAAGDLDNGVEHLANA 139 (175)
Q Consensus 113 Fl~eV~lGE~L~a~G~~~eaa~hf~nA 139 (175)
+.....+|..+...|++++|+.||-+|
T Consensus 58 ~~~~~l~a~~~~~l~~y~eAi~~l~~~ 84 (84)
T PF12895_consen 58 PDIHYLLARCLLKLGKYEEAIKALEKA 84 (84)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHhcC
Confidence 444456799999999999999999876
No 75
>PLN02539 glucose-6-phosphate 1-dehydrogenase
Probab=41.67 E-value=73 Score=30.59 Aligned_cols=115 Identities=16% Similarity=0.340 Sum_probs=61.7
Q ss_pred ccCCCCccchhhhhhhhHHHHHHHHHHHHHh---hh-hheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCCh
Q psy14697 31 IGPSGRLMSPMELISGNKVVTAVAVGAAVVV---GY-CLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDH 106 (175)
Q Consensus 31 ~~~~~~~~~~~~m~~~~~~~~~agvaga~fl---GY-cIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~ 106 (175)
.|.||+|- .|.++-+.........+ .. .|-+.+..-+|.+||+++++.-+... + .+.
T Consensus 23 FGAtGDLa------~RKL~PaL~~L~~~~~lpp~~~~IiG~aR~~~s~e~fr~~v~~~l~~~~---------~----~~~ 83 (491)
T PLN02539 23 LGASGDLA------KKKTFPALFNLYRQGFLPPDEVHIFGYARSKITDEELRDRIRGYLKDEK---------N----APA 83 (491)
T ss_pred eCCccHHH------HhhHHHHHHHHHHcCCCCCCCcEEEEEECCCCCHHHHHHHHHHHHHhhc---------c----ccH
Confidence 47899987 23333333333333344 22 34578888899999999987543211 0 122
Q ss_pred HHHHHHHHHHHHhHhHHhhcCChh--hHHHHHHHHHHhcC-------ChHHHHHHHhhcCCHHHHHHHHHhcCCC
Q psy14697 107 SAVQQYFLQEIQLGETLLAAGDLD--NGVEHLANALTVCG-------QPNQLLGVLQQTLPPNVFNALIEKLPPA 172 (175)
Q Consensus 107 e~~E~fFl~eV~lGE~L~a~G~~~--eaa~hf~nAl~Vcp-------qP~~LL~IyQqTLPp~Vf~~Li~kLp~~ 172 (175)
+..++| ++.+. -.+|+++ +.-.++.++|.=.. .+.. .||==.+||.+|..+++.|...
T Consensus 84 ~~~~~F-~~~~~-----Y~~~d~~~~e~y~~L~~~l~~~~~~~~~~~~~~~--rifYLA~PP~~f~~i~~~L~~~ 150 (491)
T PLN02539 84 EAVSKF-LQLIK-----YVSGAYDSEEGFRRLDKEISEHEISKNSAEGSSR--RLFYLALPPSVYPPVCKMIKKC 150 (491)
T ss_pred HHHHHH-HhhCe-----EEecCCCChHHHHHHHHHHHHHhhhccccCCCCc--eEEEEecChHHHHHHHHHHHHh
Confidence 334443 33221 1245543 44556666664110 0011 1344479999999998877544
No 76
>PRK09400 secE preprotein translocase subunit SecE; Reviewed
Probab=41.21 E-value=30 Score=24.28 Aligned_cols=23 Identities=17% Similarity=0.268 Sum_probs=17.3
Q ss_pred hhhhHHHHHHHHHHHHHhhhhhe
Q psy14697 44 ISGNKVVTAVAVGAAVVVGYCLY 66 (175)
Q Consensus 44 ~~~~~~~~~agvaga~flGYcIY 66 (175)
..+.+..+++|.+..+++||.||
T Consensus 32 f~~ia~~~~iG~~i~G~iGf~Ik 54 (61)
T PRK09400 32 FLLVAKVTGLGILLIGLIGFIIY 54 (61)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455667778888899999987
No 77
>KOG0543|consensus
Probab=40.96 E-value=82 Score=29.72 Aligned_cols=41 Identities=24% Similarity=0.343 Sum_probs=34.7
Q ss_pred CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697 102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL 149 (175)
Q Consensus 102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L 149 (175)
+|++..++ -..|+.++..|+++.|..-|-+|+++.|+-.+.
T Consensus 287 ~~~N~KAL-------yRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~ 327 (397)
T KOG0543|consen 287 DPNNVKAL-------YRRGQALLALGEYDLARDDFQKALKLEPSNKAA 327 (397)
T ss_pred CCCchhHH-------HHHHHHHHhhccHHHHHHHHHHHHHhCCCcHHH
Confidence 56776666 357999999999999999999999999987543
No 78
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=40.79 E-value=51 Score=17.64 Aligned_cols=27 Identities=19% Similarity=0.196 Sum_probs=19.7
Q ss_pred CChhhHHHHHHHHHHhcCChHHHHHHH
Q psy14697 127 GDLDNGVEHLANALTVCGQPNQLLGVL 153 (175)
Q Consensus 127 G~~~eaa~hf~nAl~VcpqP~~LL~Iy 153 (175)
|+.+.+..-|.+|+..||+..++-.-|
T Consensus 1 ~~~~~~r~i~e~~l~~~~~~~~~W~~y 27 (33)
T smart00386 1 GDIERARKIYERALEKFPKSVELWLKY 27 (33)
T ss_pred CcHHHHHHHHHHHHHHCCCChHHHHHH
Confidence 456677788899999998776665443
No 79
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=40.23 E-value=53 Score=25.14 Aligned_cols=30 Identities=17% Similarity=0.251 Sum_probs=19.1
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
..+|..+...|++++|..+|-+|+..-|+-
T Consensus 62 ~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~ 91 (144)
T PRK15359 62 IALAGTWMMLKEYTTAINFYGHALMLDASH 91 (144)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhcCCCC
Confidence 446666666666666666666666665544
No 80
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=39.86 E-value=49 Score=29.44 Aligned_cols=33 Identities=9% Similarity=0.145 Sum_probs=25.9
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQL 149 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L 149 (175)
..+|..+...|++++|+..+-+||.+.|+-.+.
T Consensus 40 ~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a 72 (356)
T PLN03088 40 ADRAQANIKLGNFTEAVADANKAIELDPSLAKA 72 (356)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHH
Confidence 567888888888888888888998888864443
No 81
>PF12301 CD99L2: CD99 antigen like protein 2; InterPro: IPR022078 This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum.
Probab=39.57 E-value=16 Score=30.35 Aligned_cols=32 Identities=22% Similarity=0.125 Sum_probs=18.4
Q ss_pred hhhhhhhhHHHHHHHHHHHHHhhhhheeccCC
Q psy14697 40 PMELISGNKVVTAVAVGAAVVVGYCLYFDKKR 71 (175)
Q Consensus 40 ~~~m~~~~~~~~~agvaga~flGYcIYFD~KR 71 (175)
-+++.++++.+|+.+++.|++-+-.=||=||+
T Consensus 109 ~~~~~~g~IaGIvsav~valvGAvsSyiaYqk 140 (169)
T PF12301_consen 109 DGEAEAGTIAGIVSAVVVALVGAVSSYIAYQK 140 (169)
T ss_pred ccCcccchhhhHHHHHHHHHHHHHHHHHHHHh
Confidence 34556777777776666654444444555543
No 82
>TIGR00327 secE_euk_arch protein translocase SEC61 complex gamma subunit, archaeal and eukaryotic. This model describes archaeal SEC61-like and eukaryotic SEC61 but not bacterial secE proteins, for which a Pfam pfam00584 (SecE) has been created.
Probab=39.42 E-value=33 Score=24.19 Aligned_cols=24 Identities=17% Similarity=0.246 Sum_probs=17.8
Q ss_pred hhhhHHHHHHHHHHHHHhhhhhee
Q psy14697 44 ISGNKVVTAVAVGAAVVVGYCLYF 67 (175)
Q Consensus 44 ~~~~~~~~~agvaga~flGYcIYF 67 (175)
..+.+..+++|.+..+++||.||.
T Consensus 28 f~~iak~t~iG~~i~G~IGf~Ikl 51 (61)
T TIGR00327 28 YLKVAKVTGIGIIIVGIIGYIIKI 51 (61)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345556677788888999999873
No 83
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=39.26 E-value=25 Score=26.56 Aligned_cols=23 Identities=39% Similarity=0.564 Sum_probs=21.3
Q ss_pred HhHhHHhhcCChhhHHHHHHHHH
Q psy14697 118 QLGETLLAAGDLDNGVEHLANAL 140 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl 140 (175)
-+|+-+.++|+.++|..+|-+||
T Consensus 123 ~~Gdi~~~~g~~~~A~~~y~~Al 145 (145)
T PF09976_consen 123 LLGDIYLAQGDYDEARAAYQKAL 145 (145)
T ss_pred HHHHHHHHCCCHHHHHHHHHHhC
Confidence 59999999999999999998886
No 84
>KOG4516|consensus
Probab=39.17 E-value=9.4 Score=30.25 Aligned_cols=51 Identities=35% Similarity=0.463 Sum_probs=27.1
Q ss_pred ccccccccccccccccccCCCCccchhhhhhhhHHHHHHHHHHHHHhhhhheeccCC
Q psy14697 15 CWFGDWCGYFGLWSGRIGPSGRLMSPMELISGNKVVTAVAVGAAVVVGYCLYFDKKR 71 (175)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~~~agvaga~flGYcIYFD~KR 71 (175)
.|----||..|+-|+-+-.-|+=---..-+.+.++.+++++++ || |||.||
T Consensus 27 ~~~~~a~g~~g~~s~~~~N~~~rkP~~~gi~~~ll~i~a~~~A----Gy--y~~~~r 77 (118)
T KOG4516|consen 27 IWNPIACGVAGVGSAIFINWGFRKPVFSGIQKHLLFIAAGVGA----GY--YFDQKR 77 (118)
T ss_pred CcchhhHhhhHHHHHHHHhhhhcCchHHHHHHHHHHHHHHHhh----hH--HHHHHH
Confidence 4555558888876664432222111111244666666666654 66 677665
No 85
>PF03605 DcuA_DcuB: Anaerobic c4-dicarboxylate membrane transporter; InterPro: IPR004668 These proteins are members of the C4-Dicarboxylate Uptake (Dcu) family. Most proteins in this family are predicted to have 12 GES predicted transmembrane regions; however the one member whose membrane topology has been experimentally determined has 10 transmembrane regions, with both the N- and C-termini localized to the periplasm []. The DcuA and DcuB proteins are involved in the transport of aspartate, malate, fumarate and succinate in many species [, , ], and are thought to function as antiporters with any two of these substrates. Since DcuA is encoded in an operon with the gene for aspartase, and DcuB is encoded in an operon with the gene for fumarase, their physiological functions may be to catalyze aspartate:fumarate and fumarate:malate exchange during the anaerobic utilization of aspartate and fumarate, respectively []. The Escherichia coli DcuA and DcuB proteins have very different expression patterns []. DcuA is constitutively expressed; DcuB is strongly induced anaerobically by FNR and C4-dicarboxylates, while it is repressed by nitrate and subject to CRP-mediated catabolite repression.; GO: 0015556 C4-dicarboxylate transmembrane transporter activity, 0015740 C4-dicarboxylate transport, 0016021 integral to membrane
Probab=39.05 E-value=41 Score=31.32 Aligned_cols=22 Identities=23% Similarity=0.320 Sum_probs=17.8
Q ss_pred hheeccCCCCChhhHHHhHHHH
Q psy14697 64 CLYFDKKRRSDPLFKEKLKERR 85 (175)
Q Consensus 64 cIYFD~KRR~dP~FRKkLrerR 85 (175)
+-..+...-.||+|++||++-.
T Consensus 184 ~~~~GkeL~~Dp~yq~rl~~g~ 205 (364)
T PF03605_consen 184 SSRRGKELDDDPEYQERLADGL 205 (364)
T ss_pred HHhcCCccccCHHHHHHHhccc
Confidence 4467888899999999998643
No 86
>COG5343 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.87 E-value=32 Score=30.31 Aligned_cols=24 Identities=29% Similarity=0.443 Sum_probs=19.1
Q ss_pred HHHHHHHHHhhhhheeccCCCCCh
Q psy14697 52 AVAVGAAVVVGYCLYFDKKRRSDP 75 (175)
Q Consensus 52 ~agvaga~flGYcIYFD~KRR~dP 75 (175)
+.++++++.++|++-|+.+|+..|
T Consensus 108 ataavAvlal~~a~~~s~~~~~~P 131 (240)
T COG5343 108 ATAAVAVLALALALTFSLRRGTTP 131 (240)
T ss_pred HHHHHHHHHHHHccccccccCCCc
Confidence 333344578899999999999999
No 87
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=38.52 E-value=67 Score=30.99 Aligned_cols=33 Identities=12% Similarity=0.097 Sum_probs=27.2
Q ss_pred HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
...+|+.+...|++++|+.+|-+|+...|+..+
T Consensus 113 ~~~la~~l~~~g~~~~Ai~~l~~Al~l~P~~~~ 145 (656)
T PRK15174 113 VLLVASVLLKSKQYATVADLAEQAWLAFSGNSQ 145 (656)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhCCCcHH
Confidence 356788889999999999999999998777544
No 88
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=37.85 E-value=65 Score=24.95 Aligned_cols=27 Identities=7% Similarity=0.051 Sum_probs=12.5
Q ss_pred HhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 120 GETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 120 GE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
|..+...|+...+..++..|+..+.+-
T Consensus 113 g~~~~~~g~~~~a~~~~~~A~~~~~~A 139 (172)
T PRK02603 113 AVIYHKRGEKAEEAGDQDEAEALFDKA 139 (172)
T ss_pred HHHHHHcCChHhHhhCHHHHHHHHHHH
Confidence 444444444444444455555544433
No 89
>PTZ00459 mucin-associated surface protein (MASP); Provisional
Probab=37.60 E-value=19 Score=31.87 Aligned_cols=10 Identities=10% Similarity=0.291 Sum_probs=6.7
Q ss_pred hhhhhhhhHH
Q psy14697 40 PMELISGNKV 49 (175)
Q Consensus 40 ~~~m~~~~~~ 49 (175)
|+|||++|.|
T Consensus 1 MaMmMTGRVL 10 (291)
T PTZ00459 1 MAMMMTGRVL 10 (291)
T ss_pred CccchhchHH
Confidence 4566787766
No 90
>TIGR01495 ETRAMP Plasmodium ring stage membrane protein ETRAMP. These genes have been shown to be found in the sub-telomeric regions of both P. falciparum and P. yoelii chromosomes.
Probab=37.56 E-value=36 Score=25.33 Aligned_cols=16 Identities=38% Similarity=0.684 Sum_probs=9.0
Q ss_pred HHhhhhheeccCCCCC
Q psy14697 59 VVVGYCLYFDKKRRSD 74 (175)
Q Consensus 59 ~flGYcIYFD~KRR~d 74 (175)
..+|.-+|+-.|.+.|
T Consensus 67 t~~G~g~y~~~kk~~~ 82 (85)
T TIGR01495 67 AGVGLGYYYKKKNKDD 82 (85)
T ss_pred HHHHHhhhhhcccccc
Confidence 4556666765555443
No 91
>KOG1174|consensus
Probab=37.15 E-value=54 Score=31.87 Aligned_cols=74 Identities=18% Similarity=0.187 Sum_probs=47.2
Q ss_pred HHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChhhHHH
Q psy14697 55 VGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVE 134 (175)
Q Consensus 55 vaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~ 134 (175)
++.--||.-|+-||.|| |++-|-=-.|. -.-++.+ .|.+ |-.|-.|++.|.+++|+.
T Consensus 299 ta~~wfV~~~~l~~~K~-----~~rAL~~~eK~-----------I~~~~r~---~~al----ilKG~lL~~~~R~~~A~I 355 (564)
T KOG1174|consen 299 TASHWFVHAQLLYDEKK-----FERALNFVEKC-----------IDSEPRN---HEAL----ILKGRLLIALERHTQAVI 355 (564)
T ss_pred chhhhhhhhhhhhhhhh-----HHHHHHHHHHH-----------hccCccc---chHH----HhccHHHHhccchHHHHH
Confidence 44456888899999875 44444311111 1112333 3444 457999999999999999
Q ss_pred HHHHHHHhcCChHHHHHHHh
Q psy14697 135 HLANALTVCGQPNQLLGVLQ 154 (175)
Q Consensus 135 hf~nAl~VcpqP~~LL~IyQ 154 (175)
||--|+.+-|-- |.+||
T Consensus 356 aFR~Aq~Lap~r---L~~Y~ 372 (564)
T KOG1174|consen 356 AFRTAQMLAPYR---LEIYR 372 (564)
T ss_pred HHHHHHhcchhh---HHHHH
Confidence 999999886543 34554
No 92
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=36.82 E-value=63 Score=24.34 Aligned_cols=33 Identities=30% Similarity=0.467 Sum_probs=27.1
Q ss_pred HHHHHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 114 LQEIQLGETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 114 l~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
+-.+.+++.+...|++++|...|-.++.-.+.|
T Consensus 49 ~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~~d~ 81 (145)
T PF09976_consen 49 LAALQLAKAAYEQGDYDEAKAALEKALANAPDP 81 (145)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCH
Confidence 345678888999999999999999988877665
No 93
>PF15147 DUF4578: Domain of unknown function (DUF4578)
Probab=36.59 E-value=13 Score=29.68 Aligned_cols=11 Identities=64% Similarity=1.534 Sum_probs=7.9
Q ss_pred CCCcccccCcc
Q psy14697 4 CGSSWVLPSSF 14 (175)
Q Consensus 4 ~~~~~~~~~~~ 14 (175)
||-||--||.|
T Consensus 7 CggSWScPstf 17 (127)
T PF15147_consen 7 CGGSWSCPSTF 17 (127)
T ss_pred cCCCcCCchhH
Confidence 77777777765
No 94
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=36.58 E-value=80 Score=23.42 Aligned_cols=29 Identities=21% Similarity=0.269 Sum_probs=19.9
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
.+|..+...|++++|...|-+|+...|+.
T Consensus 70 ~la~~~~~~~~~~~A~~~~~~al~~~~~~ 98 (234)
T TIGR02521 70 ALALYYQQLGELEKAEDSFRRALTLNPNN 98 (234)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence 45666667777777777777777766654
No 95
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=35.69 E-value=70 Score=27.17 Aligned_cols=39 Identities=13% Similarity=0.082 Sum_probs=27.4
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ 155 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq 155 (175)
..+|+.+..+|++++|...|.+++...|+-..+...+..
T Consensus 286 ~~la~~~~~~g~~~~A~~~l~~~l~~~P~~~~~~~l~~~ 324 (389)
T PRK11788 286 LALAQLLEEQEGPEAAQALLREQLRRHPSLRGFHRLLDY 324 (389)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHhCcCHHHHHHHHHH
Confidence 445777777788888888888888887777666555543
No 96
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=35.38 E-value=80 Score=23.41 Aligned_cols=29 Identities=34% Similarity=0.398 Sum_probs=20.6
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
..+|..+...|++++|...|-+++...|+
T Consensus 35 ~~la~~~~~~~~~~~A~~~~~~~l~~~p~ 63 (234)
T TIGR02521 35 VQLALGYLEQGDLEVAKENLDKALEHDPD 63 (234)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHhCcc
Confidence 34677777777777777777777776654
No 97
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=34.41 E-value=93 Score=22.29 Aligned_cols=29 Identities=10% Similarity=0.084 Sum_probs=17.3
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
.+|..+...|++++|+..|-.++...|..
T Consensus 56 ~la~~~~~~~~~~~A~~~~~~~~~~~p~~ 84 (135)
T TIGR02552 56 GLAACCQMLKEYEEAIDAYALAAALDPDD 84 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Confidence 45566666666666666666666665543
No 98
>PRK12489 anaerobic C4-dicarboxylate transporter; Reviewed
Probab=34.38 E-value=47 Score=31.41 Aligned_cols=22 Identities=36% Similarity=0.575 Sum_probs=19.1
Q ss_pred hhheeccCCCCChhhHHHhHHH
Q psy14697 63 YCLYFDKKRRSDPLFKEKLKER 84 (175)
Q Consensus 63 YcIYFD~KRR~dP~FRKkLrer 84 (175)
|.-.+.+|-+.||+|.+++++.
T Consensus 191 ~~~~~gk~l~~Dp~~q~r~~~~ 212 (443)
T PRK12489 191 WSLRRGKDLDKDPEFQERLKDP 212 (443)
T ss_pred HHHHhcCccccCHHHHHHhhcc
Confidence 5666899999999999999875
No 99
>KOG1125|consensus
Probab=34.06 E-value=69 Score=31.61 Aligned_cols=59 Identities=29% Similarity=0.361 Sum_probs=43.8
Q ss_pred CCCCCCChHHHHHHHHHHH-----------H--hHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcC
Q psy14697 99 GIPDLKDHSAVQQYFLQEI-----------Q--LGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTL 157 (175)
Q Consensus 99 ~~p~~~d~e~~E~fFl~eV-----------~--lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTL 157 (175)
..++..+...++++|++=+ | ||=..-..|+++.|+.||--||.|-|+=.-|-+-|=-||
T Consensus 403 s~~~~~~l~~i~~~fLeaa~~~~~~~DpdvQ~~LGVLy~ls~efdraiDcf~~AL~v~Pnd~~lWNRLGAtL 474 (579)
T KOG1125|consen 403 SFLDSSHLAHIQELFLEAARQLPTKIDPDVQSGLGVLYNLSGEFDRAVDCFEAALQVKPNDYLLWNRLGATL 474 (579)
T ss_pred CCCCHHHHHHHHHHHHHHHHhCCCCCChhHHhhhHHHHhcchHHHHHHHHHHHHHhcCCchHHHHHHhhHHh
Confidence 3566777789999999853 3 333333468899999999999999998877766655554
No 100
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=34.02 E-value=1.2e+02 Score=24.16 Aligned_cols=29 Identities=7% Similarity=0.159 Sum_probs=26.4
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
..+|+.+..+|++++|...|.+++.-||+
T Consensus 170 ~~~a~~~~~~g~~~~A~~~~~~al~~~p~ 198 (235)
T TIGR03302 170 LYVARFYLKRGAYVAAINRFETVVENYPD 198 (235)
T ss_pred HHHHHHHHHcCChHHHHHHHHHHHHHCCC
Confidence 46788889999999999999999999986
No 101
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=33.44 E-value=44 Score=22.39 Aligned_cols=7 Identities=0% Similarity=0.453 Sum_probs=2.8
Q ss_pred HHHhHHH
Q psy14697 78 KEKLKER 84 (175)
Q Consensus 78 RKkLrer 84 (175)
|+++|+.
T Consensus 47 r~~~~~~ 53 (68)
T PF06305_consen 47 RRRIRRL 53 (68)
T ss_pred HHHHHHH
Confidence 3444433
No 102
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=33.27 E-value=54 Score=21.24 Aligned_cols=31 Identities=23% Similarity=0.291 Sum_probs=23.4
Q ss_pred HhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697 120 GETLLAAGDLDNGVEHLANALTVCGQPNQLL 150 (175)
Q Consensus 120 GE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL 150 (175)
.+.++.++++++|..++-+++..-|+...+.
T Consensus 2 ~~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~ 32 (73)
T PF13371_consen 2 KQIYLQQEDYEEALEVLERALELDPDDPELW 32 (73)
T ss_pred HHHHHhCCCHHHHHHHHHHHHHhCcccchhh
Confidence 3556778888899999999988877755443
No 103
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=32.87 E-value=1e+02 Score=26.03 Aligned_cols=33 Identities=9% Similarity=0.008 Sum_probs=29.0
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTVCGQPNQLL 150 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL 150 (175)
..|+.+...|++++|+..|-..+..+|.+...-
T Consensus 37 ~~A~~~~~~g~y~~Ai~~f~~l~~~yP~s~~a~ 69 (243)
T PRK10866 37 ATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQ 69 (243)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHH
Confidence 568888899999999999999999999986554
No 104
>PF11712 Vma12: Endoplasmic reticulum-based factor for assembly of V-ATPase; InterPro: IPR021013 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. The yeast vacuolar proton-translocating ATPase (V-ATPase) is the best characterised member of the V-ATPase family. A total of thirteen genes are required for encoding the subunits of the enzyme complex itself and an additional three for providing factors necessary for the assembly of the whole. Vma12 is one of these latter, all three of which are localised to the endoplasmic reticulum [].
Probab=32.73 E-value=2e+02 Score=22.48 Aligned_cols=28 Identities=29% Similarity=0.483 Sum_probs=18.9
Q ss_pred heeccC---CCCChhhHHHhHHHHHHhHHHH
Q psy14697 65 LYFDKK---RRSDPLFKEKLKERRRKNRELL 92 (175)
Q Consensus 65 IYFD~K---RR~dP~FRKkLrerRrk~~~~~ 92 (175)
|||.-+ -|.+|||++++.+=|+++++.+
T Consensus 9 v~~p~~~~~~~~s~E~~a~le~Lr~~~ee~e 39 (142)
T PF11712_consen 9 VYFPPPPPKPRPSPELKARLERLRAEQEERE 39 (142)
T ss_pred EEeCCCCCCCCCCHHHHHHHHHHHHHHHHHH
Confidence 455543 3449999999988776666544
No 105
>COG2443 Sss1 Preprotein translocase subunit Sss1 [Intracellular trafficking and secretion]
Probab=32.59 E-value=48 Score=23.83 Aligned_cols=22 Identities=18% Similarity=0.386 Sum_probs=15.8
Q ss_pred hhhHHHHHHHHHHHHHhhhhhe
Q psy14697 45 SGNKVVTAVAVGAAVVVGYCLY 66 (175)
Q Consensus 45 ~~~~~~~~agvaga~flGYcIY 66 (175)
+..+...++|....+++||.||
T Consensus 34 ~~~aKi~~~Gi~liG~IGfiI~ 55 (65)
T COG2443 34 SKIAKITGLGILLIGIIGFIIY 55 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444556677777789999997
No 106
>PRK11189 lipoprotein NlpI; Provisional
Probab=32.51 E-value=1e+02 Score=26.37 Aligned_cols=28 Identities=29% Similarity=0.298 Sum_probs=24.5
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcC
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCG 144 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcp 144 (175)
.-+|..+...|++++|+.+|-+|+...|
T Consensus 240 ~~Lg~~~~~~g~~~~A~~~~~~Al~~~~ 267 (296)
T PRK11189 240 FYLAKYYLSLGDLDEAAALFKLALANNV 267 (296)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHhCC
Confidence 4568888999999999999999998775
No 107
>PF13838 Clathrin_H_link: Clathrin-H-link; PDB: 2XZG_A 3GD1_I 1BPO_C 1C9I_B 1C9L_A.
Probab=32.22 E-value=33 Score=24.51 Aligned_cols=27 Identities=19% Similarity=0.480 Sum_probs=19.3
Q ss_pred HHHHHHHHHhHhHHhhcCChhhHHHHHHHH
Q psy14697 110 QQYFLQEIQLGETLLAAGDLDNGVEHLANA 139 (175)
Q Consensus 110 E~fFl~eV~lGE~L~a~G~~~eaa~hf~nA 139 (175)
+..|.++. ++|+++|++++||.+=+|+
T Consensus 6 d~l~~~~F---~~l~~~g~y~eAA~~AA~s 32 (66)
T PF13838_consen 6 DDLYVQQF---NELFSQGQYEEAAKVAANS 32 (66)
T ss_dssp HHHHHHHH---HHHHHTT-HHHHHHHHHHS
T ss_pred HHHHHHHH---HHHHHcCCHHHHHHHHHhC
Confidence 34455444 6789999999999987776
No 108
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=32.06 E-value=54 Score=24.61 Aligned_cols=17 Identities=18% Similarity=0.542 Sum_probs=9.9
Q ss_pred HHHHHHHHHHhhhhhee
Q psy14697 51 TAVAVGAAVVVGYCLYF 67 (175)
Q Consensus 51 ~~agvaga~flGYcIYF 67 (175)
+.++++..+++=|+|||
T Consensus 67 ~lls~v~IlVily~IyY 83 (101)
T PF06024_consen 67 SLLSFVCILVILYAIYY 83 (101)
T ss_pred HHHHHHHHHHHHhhheE
Confidence 33344444556699986
No 109
>PF08359 TetR_C_4: YsiA-like protein, C-terminal region; InterPro: IPR013570 The members of this family are thought to be TetR-type (tetracycline resistance) transcriptional regulators that bear particular similarity to YsiA (P94548 from SWISSPROT). This entry represents the C-terminal domain.; PDB: 1VI0_B.
Probab=31.82 E-value=1.8e+02 Score=21.26 Aligned_cols=73 Identities=16% Similarity=0.206 Sum_probs=41.3
Q ss_pred eeccCCCCChhhHHHhHHHHHHhHHH-----HHHhhcCCCCCCCChHHHHHHHHHHHHh-HhHHhh-cCC--hhhHHHHH
Q psy14697 66 YFDKKRRSDPLFKEKLKERRRKNREL-----LQQKNKRGIPDLKDHSAVQQYFLQEIQL-GETLLA-AGD--LDNGVEHL 136 (175)
Q Consensus 66 YFD~KRR~dP~FRKkLrerRrk~~~~-----~~~a~~~~~p~~~d~e~~E~fFl~eV~l-GE~L~a-~G~--~~eaa~hf 136 (175)
+..--|.+++++|+++++.+++..+. ++..+...+....|++..-.+|.--+.. =..+.. .++ .++-++.+
T Consensus 47 ~~~e~~~~~~~~~~~~~~~~~~~~~~i~~iI~eG~~~Gefr~d~d~~~~~~~i~G~i~~~v~~w~~~~~~~~l~~~~~~~ 126 (133)
T PF08359_consen 47 LSLELRQSNEELRKKINEIRREYLRIIEEIIEEGQEQGEFRKDLDPELAARFIFGMINHTVLDWVLNDGKFDLEDIAEEL 126 (133)
T ss_dssp HHCTTS-SSHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSB-TT--HHHHHHHHHHHHHHHHHHHHHTTT-S-CGGGHHHH
T ss_pred hHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHcCccCCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHH
Confidence 34567888999999999876655432 2223344555557888888888877763 233443 343 34445544
Q ss_pred HH
Q psy14697 137 AN 138 (175)
Q Consensus 137 ~n 138 (175)
.+
T Consensus 127 ~~ 128 (133)
T PF08359_consen 127 MD 128 (133)
T ss_dssp HH
T ss_pred HH
Confidence 43
No 110
>KOG0547|consensus
Probab=31.51 E-value=70 Score=31.57 Aligned_cols=41 Identities=15% Similarity=0.332 Sum_probs=35.7
Q ss_pred hHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCC-h
Q psy14697 106 HSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQ-P 146 (175)
Q Consensus 106 ~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcpq-P 146 (175)
+|++.+|=.+==..|.....+|+|++|+.....||..||. |
T Consensus 108 ~e~~~k~A~~lK~~GN~~f~~kkY~eAIkyY~~AI~l~p~ep 149 (606)
T KOG0547|consen 108 KEERLKYAAALKTKGNKFFRNKKYDEAIKYYTQAIELCPDEP 149 (606)
T ss_pred hHHHHHHHHHHHhhhhhhhhcccHHHHHHHHHHHHhcCCCCc
Confidence 3566777777778899999999999999999999999999 6
No 111
>PF11846 DUF3366: Domain of unknown function (DUF3366); InterPro: IPR021797 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length.
Probab=31.39 E-value=2.7e+02 Score=22.13 Aligned_cols=27 Identities=22% Similarity=0.211 Sum_probs=22.6
Q ss_pred hHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 119 LGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 119 lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
+...+...|+.++|......|...||+
T Consensus 150 ~a~~l~~~G~~~eA~~~~~~~~~lyP~ 176 (193)
T PF11846_consen 150 YALALALLGDPEEARQWLARARRLYPA 176 (193)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHhCCc
Confidence 466677789999999999999999993
No 112
>PF12487 DUF3703: Protein of unknown function (DUF3703) ; InterPro: IPR022172 This family of proteins is found in bacteria. Proteins in this family are typically between 113 and 135 amino acids in length.
Probab=31.31 E-value=88 Score=24.52 Aligned_cols=41 Identities=34% Similarity=0.506 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 107 SAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 107 e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
+++..+|..|++......++|++..|-.|+-+|=.+ +||.-
T Consensus 3 ~~~~~~~~~el~~a~~a~~~~~~~~aw~hLErAHIl-gQ~~~ 43 (112)
T PF12487_consen 3 KALRPAYDAELEAAREALAAGDLQQAWRHLERAHIL-GQPYP 43 (112)
T ss_pred HHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHhhHHh-cCCch
Confidence 467889999999999999999999999999998544 44433
No 113
>KOG2769|consensus
Probab=31.29 E-value=1.2e+02 Score=29.70 Aligned_cols=44 Identities=16% Similarity=0.283 Sum_probs=26.4
Q ss_pred HHHhHHHHHHhHHHHH--Hh--hcCCCC----------------CCCChHHHHHHHHHHHHhHh
Q psy14697 78 KEKLKERRRKNRELLQ--QK--NKRGIP----------------DLKDHSAVQQYFLQEIQLGE 121 (175)
Q Consensus 78 RKkLrerRrk~~~~~~--~a--~~~~~p----------------~~~d~e~~E~fFl~eV~lGE 121 (175)
|||||++||+.+..++ +- ++..-| -..||..+|+-.-.||..=+
T Consensus 293 rKKLRRQ~R~ea~KEkqekIrLGL~~ppePKVKiSNLMrVLgsEAiqDPTK~E~~VR~Q~aeR~ 356 (522)
T KOG2769|consen 293 RKKLRRQRRKEARKEKQEKIRLGLEPPPEPKVKLSNLMRVLGSEAIQDPTKLEAEVRDQMAERQ 356 (522)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcchHHHHHHHhhhccccCcHHHHHHHHHHHHHHH
Confidence 7899987776654321 21 222212 13788888888877776544
No 114
>PF06703 SPC25: Microsomal signal peptidase 25 kDa subunit (SPC25); InterPro: IPR009582 This family consists of several microsomal signal peptidase 25 kDa subunit proteins. Translocation of polypeptide chains across the endoplasmic reticulum (ER) membrane is triggered by signal sequences. Subsequently, signal recognition particle interacts with its membrane receptor and the ribosome-bound nascent chain is targeted to the ER where it is transferred into a protein-conducting channel. At some point, a second signal sequence recognition event takes place in the membrane and translocation of the nascent chain through the membrane occurs. The signal sequence of most secretory and membrane proteins is cleaved off at this stage. Cleavage occurs by the signal peptidase complex (SPC) as soon as the lumenal domain of the translocating polypeptide is large enough to expose its cleavage site to the enzyme. The signal peptidase complex is possibly also involved in proteolytic events in the ER membrane other than the processing of the signal sequence, for example the further digestion of the cleaved signal peptide or the degradation of membrane proteins. Mammalian signal peptidase is as a complex of five different polypeptide chains [].; GO: 0008233 peptidase activity, 0006465 signal peptide processing, 0005787 signal peptidase complex, 0016021 integral to membrane
Probab=31.07 E-value=34 Score=27.11 Aligned_cols=21 Identities=24% Similarity=0.558 Sum_probs=15.2
Q ss_pred HHHHHHHHHHhhhhheeccCC
Q psy14697 51 TAVAVGAAVVVGYCLYFDKKR 71 (175)
Q Consensus 51 ~~agvaga~flGYcIYFD~KR 71 (175)
++.|.++.++.|+|.|||||-
T Consensus 30 L~lg~~a~~iA~~a~~~d~~~ 50 (162)
T PF06703_consen 30 LALGYLAVIIAGFAFFYDYKY 50 (162)
T ss_pred HHHHHHHHHHHHHHHHhhhcC
Confidence 455556666778999999954
No 115
>PRK14748 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=31.00 E-value=54 Score=20.23 Aligned_cols=14 Identities=21% Similarity=0.475 Sum_probs=8.8
Q ss_pred HHHHHHHHhhhhhe
Q psy14697 53 VAVGAAVVVGYCLY 66 (175)
Q Consensus 53 agvaga~flGYcIY 66 (175)
.++...+.+||-||
T Consensus 8 G~ilv~lLlgYLvy 21 (29)
T PRK14748 8 GVLLVFLLLGYLVY 21 (29)
T ss_pred HHHHHHHHHHHHHH
Confidence 33444567888877
No 116
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=30.95 E-value=2.6e+02 Score=24.95 Aligned_cols=32 Identities=19% Similarity=0.253 Sum_probs=28.8
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
+.+|..++.++++++|.++|-+++..-|++..
T Consensus 332 l~lgrl~~~~~~~~~A~~~le~al~~~P~~~~ 363 (398)
T PRK10747 332 STLGQLLMKHGEWQEASLAFRAALKQRPDAYD 363 (398)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHhcCCCHHH
Confidence 67899999999999999999999999888877
No 117
>PF08802 CytB6-F_Fe-S: Cytochrome B6-F complex Fe-S subunit ; InterPro: IPR014909 The cytochrome b6-f complex mediates electron transfer between photosystem II (PSII) and photosystem I (PSI), cyclic electron flow around PSI, and state transitions. The cytochrome b6-f complex has 4 large subunits, these are: cytochrome b6, subunit IV (17 kDa polypeptide, PetD), cytochrome f and the Rieske protein, while the 4 small subunits are: PetG, PetL, PetM and PetN. The complex functions as a dimer. This protein corresponds to the alpha helical transmembrane domain of the cytochrome b6-f complex Rieske iron-sulphur subunit. ; GO: 0009496 plastoquinol-plastocyanin reductase activity, 0051537 2 iron, 2 sulfur cluster binding, 0055114 oxidation-reduction process, 0042651 thylakoid membrane; PDB: 1Q90_R 1VF5_D 2E75_D 2E74_D 2E76_D 2D2C_Q 2ZT9_D.
Probab=30.78 E-value=54 Score=21.35 Aligned_cols=22 Identities=9% Similarity=0.048 Sum_probs=11.6
Q ss_pred hhHHHHHHHHHHHHHhhhhhee
Q psy14697 46 GNKVVTAVAVGAAVVVGYCLYF 67 (175)
Q Consensus 46 ~~~~~~~agvaga~flGYcIYF 67 (175)
..+.++++++++.....|.-||
T Consensus 14 ~ll~Gava~~a~~~lyP~~~ff 35 (39)
T PF08802_consen 14 LLLGGAVAVPAGGMLYPYVKFF 35 (39)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHhhHHHHHHHHhhhheeEe
Confidence 3344455555555555565554
No 118
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=30.55 E-value=2.2e+02 Score=22.57 Aligned_cols=32 Identities=9% Similarity=0.201 Sum_probs=28.4
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
..+|+.+...|++++|...|-+++..+|+...
T Consensus 74 ~~la~~~~~~~~~~~A~~~~~~~l~~~p~~~~ 105 (235)
T TIGR03302 74 LDLAYAYYKSGDYAEAIAAADRFIRLHPNHPD 105 (235)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHCcCCCc
Confidence 56799999999999999999999999996554
No 119
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=30.35 E-value=1.4e+02 Score=30.94 Aligned_cols=95 Identities=14% Similarity=0.114 Sum_probs=60.9
Q ss_pred ccCC-CCChhhHHHhHHHHHHhHH-HH--HHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697 68 DKKR-RSDPLFKEKLKERRRKNRE-LL--QQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVC 143 (175)
Q Consensus 68 D~KR-R~dP~FRKkLrerRrk~~~-~~--~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vc 143 (175)
|++| ++++.|.+-++...+..-. +- =+.... -+|++++.. ..+++.++..|+.++|..++-+|+..-
T Consensus 38 ~~~~~~~~~~f~~a~~~~~~Gd~~~A~~~l~~Al~--~dP~n~~~~-------~~LA~~yl~~g~~~~A~~~~~kAv~ld 108 (987)
T PRK09782 38 DYRHFVIYPRLDKALKAQKNNDEATAIREFEYIHQ--QVPDNIPLT-------LYLAEAYRHFGHDDRARLLLEDQLKRH 108 (987)
T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHH--hCCCCHHHH-------HHHHHHHHHCCCHHHHHHHHHHHHhcC
Confidence 5555 6778888877765432221 11 011222 245666655 788999999999999999999999998
Q ss_pred CChHHHHHHHh--hcCCH--HHHHHHHHhcCC
Q psy14697 144 GQPNQLLGVLQ--QTLPP--NVFNALIEKLPP 171 (175)
Q Consensus 144 pqP~~LL~IyQ--qTLPp--~Vf~~Li~kLp~ 171 (175)
|....+...+- +--|+ ++|+-+++.-|.
T Consensus 109 P~n~~~~~~La~i~~~~kA~~~ye~l~~~~P~ 140 (987)
T PRK09782 109 PGDARLERSLAAIPVEVKSVTTVEELLAQQKA 140 (987)
T ss_pred cccHHHHHHHHHhccChhHHHHHHHHHHhCCC
Confidence 87665554221 22333 557777776664
No 120
>PF09164 VitD-bind_III: Vitamin D binding protein, domain III; InterPro: IPR015247 This domain is predominantly found in Vitamin D binding proteins, and adopts a multihelical structure. It is required for formation of an actin 'clamp', allowing the protein to bind to actin []. ; PDB: 1MA9_A 1KW2_A 1KXP_D 1J7E_A 1J78_A 1LOT_A.
Probab=30.30 E-value=40 Score=24.51 Aligned_cols=21 Identities=29% Similarity=0.393 Sum_probs=15.8
Q ss_pred ccCCCCChhhHHHhHHHHHHh
Q psy14697 68 DKKRRSDPLFKEKLKERRRKN 88 (175)
Q Consensus 68 D~KRR~dP~FRKkLrerRrk~ 88 (175)
||.--.=-||||||.++-+++
T Consensus 5 dYse~tFtEyKKrL~e~l~~k 25 (68)
T PF09164_consen 5 DYSENTFTEYKKRLAERLRAK 25 (68)
T ss_dssp TTTTS-HHHHHHHHHHHHHHH
T ss_pred hhhhccHHHHHHHHHHHHHHH
Confidence 666667789999999877653
No 121
>PRK01001 putative inner membrane protein translocase component YidC; Provisional
Probab=30.23 E-value=65 Score=32.93 Aligned_cols=75 Identities=17% Similarity=0.220 Sum_probs=38.2
Q ss_pred hhHHHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHH----hhcCCCCCCCChHHHHHHHHHHHHhHh
Q psy14697 46 GNKVVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQ----KNKRGIPDLKDHSAVQQYFLQEIQLGE 121 (175)
Q Consensus 46 ~~~~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~----a~~~~~p~~~d~e~~E~fFl~eV~lGE 121 (175)
++++..+..+ +++|+|--+||-|+ |-.=.+.++++.|+..+.+.+ -+.+..|-.++.++-|.==-..+..|.
T Consensus 3 ~r~~~~v~~~-~~~~~~~~~~f~~~---~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (795)
T PRK01001 3 KRTLLFVSLV-SAAFLGCQIFFGYR---DFRSCQDLAEKQRAISEQILAATESVGLSVVPWTTSLEEEESKNQYAVRVGD 78 (795)
T ss_pred ceeehHHHHH-HHHHHHHHHHhhcc---cHHHHHHHHHHHHHHHHHHHHhhccccccccccccccchhhhhhhhhhhccc
Confidence 4444334333 35699999999997 333345566654443322211 234445555555443333333455565
Q ss_pred HHh
Q psy14697 122 TLL 124 (175)
Q Consensus 122 ~L~ 124 (175)
.|+
T Consensus 79 ~~~ 81 (795)
T PRK01001 79 RLL 81 (795)
T ss_pred eEE
Confidence 554
No 122
>KOG0548|consensus
Probab=29.77 E-value=2.3e+02 Score=27.90 Aligned_cols=29 Identities=21% Similarity=0.371 Sum_probs=24.7
Q ss_pred HhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 120 GETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 120 GE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
|-.....|++.+|+.|+..||+-=|+-..
T Consensus 365 Gne~Fk~gdy~~Av~~YteAIkr~P~Da~ 393 (539)
T KOG0548|consen 365 GNEAFKKGDYPEAVKHYTEAIKRDPEDAR 393 (539)
T ss_pred HHHHHhccCHHHHHHHHHHHHhcCCchhH
Confidence 99999999999999999999986654433
No 123
>PF04654 DUF599: Protein of unknown function, DUF599; InterPro: IPR006747 This family includes several uncharacterised proteins.
Probab=29.12 E-value=70 Score=26.97 Aligned_cols=32 Identities=28% Similarity=0.528 Sum_probs=23.6
Q ss_pred HHHHHHhhhhheeccCCCCChhhHHHhHHHHH
Q psy14697 55 VGAAVVVGYCLYFDKKRRSDPLFKEKLKERRR 86 (175)
Q Consensus 55 vaga~flGYcIYFD~KRR~dP~FRKkLrerRr 86 (175)
...+..+||-.|.+|+-|.+|--....++.|+
T Consensus 6 ~~l~~~~gY~~~l~~~~~~~pt~~~~~~~~R~ 37 (216)
T PF04654_consen 6 LFLLCWVGYHLYLERRVRRRPTLSGIMNQYRR 37 (216)
T ss_pred HHHHHHHHHHHHHhhhccCCChHHHHHHHHHH
Confidence 33356889999999999999955555555444
No 124
>KOG0909|consensus
Probab=29.04 E-value=55 Score=31.59 Aligned_cols=47 Identities=19% Similarity=0.196 Sum_probs=39.0
Q ss_pred hhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCC-HHHHHHHHHhcCC
Q psy14697 124 LAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLP-PNVFNALIEKLPP 171 (175)
Q Consensus 124 ~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLP-p~Vf~~Li~kLp~ 171 (175)
.+.+.+.+.+....+.+.||.+| .|..+=..|+| +++++...++++.
T Consensus 84 ~~~~~Fle~l~~~sqhvlvYEDP-~lq~~Als~iPvdEL~~kA~ekla~ 131 (500)
T KOG0909|consen 84 RATNAFLEVLRSYSQHVLVYEDP-VLQAKALSTIPVDELKEKASEKLAK 131 (500)
T ss_pred hcchHHHHHHHHhccCceeecCH-HHHHHHHhcCCHHHHHHHHHHhhcc
Confidence 34455677888889999999999 67777788999 8999999999884
No 125
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=28.98 E-value=1e+02 Score=27.48 Aligned_cols=37 Identities=5% Similarity=0.196 Sum_probs=30.8
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVL 153 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~Iy 153 (175)
..+|..+...|++++|..+|-+|+..-|+-.++...+
T Consensus 74 ~~lg~~~~~lg~~~eA~~~~~~al~l~P~~~~~~~~l 110 (356)
T PLN03088 74 LRKGTACMKLEEYQTAKAALEKGASLAPGDSRFTKLI 110 (356)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHH
Confidence 4679999999999999999999999998766654443
No 126
>KOG3498|consensus
Probab=28.93 E-value=65 Score=23.37 Aligned_cols=22 Identities=14% Similarity=0.199 Sum_probs=16.6
Q ss_pred hhhhHHHHHHHHHHHHHhhhhh
Q psy14697 44 ISGNKVVTAVAVGAAVVVGYCL 65 (175)
Q Consensus 44 ~~~~~~~~~agvaga~flGYcI 65 (175)
.++...+.+.|.+..+|+||.|
T Consensus 33 f~ki~~~~aiGf~~mG~iGf~v 54 (67)
T KOG3498|consen 33 FTKIAKATAIGFVIMGFIGFFV 54 (67)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4555666777777788999987
No 127
>KOG0772|consensus
Probab=28.24 E-value=59 Score=32.14 Aligned_cols=57 Identities=16% Similarity=0.361 Sum_probs=46.8
Q ss_pred HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHH
Q psy14697 107 SAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNAL 165 (175)
Q Consensus 107 e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~L 165 (175)
-.+..|...||.+-......-|.-||+...+++-. .+|.=+-..|.||-|+.||+-+
T Consensus 568 g~L~sy~vk~iglak~~~~d~DpREAILKyAk~A~--~nP~~iapaYskTQPk~IF~~~ 624 (641)
T KOG0772|consen 568 GLLSSYLVKQIGLAKNFMDDEDPREAILKYAKVAE--ENPYFIAPAYSKTQPKTIFQEV 624 (641)
T ss_pred ccHHHHHHhhcccccccccccCHHHHHHHHHhhhh--hCccccchhhhccCCchhhccc
Confidence 46899999999998887777778899998888765 4565555799999999999865
No 128
>PRK02898 cobalt transport protein CbiN; Provisional
Probab=27.97 E-value=38 Score=26.17 Aligned_cols=26 Identities=27% Similarity=0.318 Sum_probs=18.4
Q ss_pred hHHHHHHHHHHHHHhhhhheeccCCC
Q psy14697 47 NKVVTAVAVGAAVVVGYCLYFDKKRR 72 (175)
Q Consensus 47 ~~~~~~agvaga~flGYcIYFD~KRR 72 (175)
+++.+.=++.|++++||++-+-+-||
T Consensus 68 SLLFaLQAAiGAgiIgY~lG~~~gr~ 93 (100)
T PRK02898 68 SLLFALQAALGAGIIGYILGYYKGRS 93 (100)
T ss_pred HHHHHHHHHHhhhhhheeeeehhhhh
Confidence 45556667777899999987655443
No 129
>COG1687 AzlD Predicted branched-chain amino acid permeases (azaleucine resistance) [Amino acid transport and metabolism]
Probab=27.82 E-value=68 Score=25.14 Aligned_cols=24 Identities=38% Similarity=0.778 Sum_probs=21.4
Q ss_pred cCChHHHHHHHhhcCCHHHHHHHH
Q psy14697 143 CGQPNQLLGVLQQTLPPNVFNALI 166 (175)
Q Consensus 143 cpqP~~LL~IyQqTLPp~Vf~~Li 166 (175)
-.+|.|+.+.+.+++|+.|+-+|.
T Consensus 29 ~~~ppq~v~~lgk~lP~~v~~~Lv 52 (106)
T COG1687 29 SGRPPQFVGYLGKVLPASVMGMLV 52 (106)
T ss_pred CCCchHHHHHHHHhcCHHHHHHHH
Confidence 357899999999999999999884
No 130
>PF09716 ETRAMP: Malarial early transcribed membrane protein (ETRAMP); InterPro: IPR006389 These sequences represent a family of proteins from the malaria parasite Plasmodium falciparum, several of which have been shown to be expressed specifically in the ring stage as well as the rodent parasite Plasmodium yoelii []. A homologue from Plasmodium chabaudi was localized to the parasitophorous vacuole membrane []. Members have an initial hydrophobic, Phe/Tyr-rich stretch long enough to span the membrane, a highly charged region rich in Lys, a second putative transmembrane region, and a second highly charged, low complexity sequence region. Some members have up to 100 residues of additional C-terminal sequence. These genes have been shown to be found in the sub-telomeric regions of both Plasmodium falciparum and P. yoelii chromosomes.
Probab=27.76 E-value=59 Score=23.78 Aligned_cols=14 Identities=36% Similarity=0.470 Sum_probs=8.7
Q ss_pred HHHHHhhhhheecc
Q psy14697 56 GAAVVVGYCLYFDK 69 (175)
Q Consensus 56 aga~flGYcIYFD~ 69 (175)
+.++.+||-+|.-+
T Consensus 68 li~~~~G~g~y~~~ 81 (84)
T PF09716_consen 68 LIATALGYGYYKKK 81 (84)
T ss_pred HHHHHHHHHHHhhc
Confidence 33456688888543
No 131
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=27.60 E-value=2.9e+02 Score=22.40 Aligned_cols=31 Identities=13% Similarity=0.190 Sum_probs=24.9
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPN 147 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~ 147 (175)
..+|-.+...|++++|+.++-+++..-|.=.
T Consensus 148 ~~LA~~~~~~g~~~~Ai~~~~~aL~l~~~~~ 178 (198)
T PRK10370 148 MLLASDAFMQADYAQAIELWQKVLDLNSPRV 178 (198)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHhhCCCCc
Confidence 3457777788999999999999998877643
No 132
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=27.50 E-value=81 Score=18.75 Aligned_cols=25 Identities=20% Similarity=0.369 Sum_probs=18.8
Q ss_pred CCCChHHHHHHHHHHHHhHhHHhhcCChhhHH
Q psy14697 102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGV 133 (175)
Q Consensus 102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa 133 (175)
+|+|+++. -.+|..+...|+.++|.
T Consensus 9 ~P~n~~a~-------~nla~~~~~~g~~~~A~ 33 (34)
T PF13431_consen 9 NPNNAEAY-------NNLANLYLNQGDYEEAI 33 (34)
T ss_pred CCCCHHHH-------HHHHHHHHHCcCHHhhc
Confidence 46776665 35899999999988774
No 133
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=27.24 E-value=1.3e+02 Score=23.12 Aligned_cols=31 Identities=32% Similarity=0.338 Sum_probs=27.3
Q ss_pred HHHHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 115 QEIQLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 115 ~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
-.|++|..+...|++++|+.-|-+++.-.|.
T Consensus 40 a~i~lastlr~LG~~deA~~~L~~~~~~~p~ 70 (120)
T PF12688_consen 40 ALIQLASTLRNLGRYDEALALLEEALEEFPD 70 (120)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHHCCC
Confidence 4568999999999999999999999988887
No 134
>PF12072 DUF3552: Domain of unknown function (DUF3552); InterPro: IPR022711 This presumed domain is functionally uncharacterised. This domain is found in bacteria, archaea and eukaryotes. This domain is about 200 amino acids in length. This domain is found associated with PF00013 from PFAM, PF01966 from PFAM. This domain has a single completely conserved residue A that may be functionally important. ; GO: 0008663 2',3'-cyclic-nucleotide 2'-phosphodiesterase activity
Probab=27.20 E-value=1.7e+02 Score=24.23 Aligned_cols=16 Identities=25% Similarity=0.505 Sum_probs=9.5
Q ss_pred HHHHHHHHHHHhhhhh
Q psy14697 50 VTAVAVGAAVVVGYCL 65 (175)
Q Consensus 50 ~~~agvaga~flGYcI 65 (175)
.++++++.++++||.+
T Consensus 5 ~~i~~~~vG~~~G~~~ 20 (201)
T PF12072_consen 5 IAIVALIVGIGIGYLV 20 (201)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3444444456788888
No 135
>PF10828 DUF2570: Protein of unknown function (DUF2570); InterPro: IPR022538 This entry is represented by Bacteriophage IME08, pseT.3. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a family of proteins with unknown function.
Probab=27.15 E-value=2.7e+02 Score=20.91 Aligned_cols=100 Identities=21% Similarity=0.268 Sum_probs=50.7
Q ss_pred HHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHH-HHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcC
Q psy14697 49 VVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNREL-LQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAG 127 (175)
Q Consensus 49 ~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~-~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G 127 (175)
+.+++++...+..||..|--+ .-.+|+.+-+++++. +..+.. .-.+.+..+-++..+++.+.=|..+.+
T Consensus 5 ~~~~l~~lvl~L~~~l~~qs~-------~i~~L~a~n~~q~~tI~qq~~~--~~~L~~~~~~~r~~~~~~~~~~qq~r~- 74 (110)
T PF10828_consen 5 IYIALAVLVLGLGGWLWYQSQ-------RIDRLRAENKAQAQTIQQQEDA--NQELKAQLQQNRQAVEEQQKREQQLRQ- 74 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-
Confidence 344445555556667666432 234455443333322 111111 011233344567777777777776654
Q ss_pred ChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHH
Q psy14697 128 DLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNAL 165 (175)
Q Consensus 128 ~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~L 165 (175)
+.++..+-+-+||+== | -+...+|.+|.+.|
T Consensus 75 ~~e~~~e~ik~~lk~d--~-----Ca~~~~P~~V~d~L 105 (110)
T PF10828_consen 75 QSEERRESIKTALKDD--P-----CANTAVPDAVIDSL 105 (110)
T ss_pred HHHHHHHHHHHHHccC--c-----cccCCCCHHHHHHH
Confidence 4456677777777621 1 23345777776654
No 136
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=27.05 E-value=96 Score=25.65 Aligned_cols=36 Identities=19% Similarity=0.227 Sum_probs=27.9
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGV 152 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~I 152 (175)
+..|+.+...|+.++|...+-+||..-|+-.+++.-
T Consensus 150 ~~~a~~~~~~G~~~~A~~~~~~al~~~P~~~~~~~~ 185 (280)
T PF13429_consen 150 LALAEIYEQLGDPDKALRDYRKALELDPDDPDARNA 185 (280)
T ss_dssp HHHHHHHHHCCHHHHHHHHHHHHHHH-TT-HHHHHH
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHH
Confidence 456788888999999999999999999985555543
No 137
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=26.92 E-value=1.1e+02 Score=29.37 Aligned_cols=38 Identities=13% Similarity=0.050 Sum_probs=30.5
Q ss_pred CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697 102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVC 143 (175)
Q Consensus 102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vc 143 (175)
+|+++++...| ..+|-.+...|+.++|+.+|-+||..+
T Consensus 105 ~Pd~aeA~~A~----yNLAcaya~LGr~dEAla~LrrALels 142 (453)
T PLN03098 105 NPNPDEAQAAY----YNKACCHAYREEGKKAADCLRTALRDY 142 (453)
T ss_pred CCCchHHHHHH----HHHHHHHHHcCCHHHHHHHHHHHHHhc
Confidence 45666553333 678999999999999999999999985
No 138
>PF04415 DUF515: Protein of unknown function (DUF515) ; InterPro: IPR007509 This is a family of hypothetical archaeal proteins.
Probab=26.90 E-value=1.3e+02 Score=28.64 Aligned_cols=67 Identities=13% Similarity=0.180 Sum_probs=35.3
Q ss_pred hhHHHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhH
Q psy14697 46 GNKVVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLG 120 (175)
Q Consensus 46 ~~~~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lG 120 (175)
+.+++++.-+...+|++|.+|+..+=|-- ++ |.. .|+++ .++-...-+-+.|+|++.. -|..||+.+
T Consensus 31 riiiGa~Vl~iIii~~~~~~Y~~~~~~~~---~e-Le~-aK~~k-I~evN~~F~g~l~nD~~K~--~li~qI~~A 97 (416)
T PF04415_consen 31 RIIIGAAVLIIIIIFIVYNIYYFLQNQPA---QE-LEN-AKAAK-INEVNKYFKGELPNDPQKI--ILINQIQSA 97 (416)
T ss_pred hhhhhhhhHhHHHHHHHHHHHHHhhhhHH---HH-HHH-HHHHH-HHHHHHHhcCcCCCChHHH--HHHHHHHhc
Confidence 34454444555568899999998875432 22 332 11111 1111111222567888743 467777665
No 139
>PRK11462 putative transporter; Provisional
Probab=26.53 E-value=58 Score=29.42 Aligned_cols=23 Identities=17% Similarity=0.332 Sum_probs=16.3
Q ss_pred heeccCCCCChhhHHHhHHHHHH
Q psy14697 65 LYFDKKRRSDPLFKEKLKERRRK 87 (175)
Q Consensus 65 IYFD~KRR~dP~FRKkLrerRrk 87 (175)
-||+--|+...+.+++|++||++
T Consensus 426 ~~y~l~~~~~~~i~~~l~~~~~~ 448 (460)
T PRK11462 426 RYYSLTTHNLKTVMEQLAQGKRR 448 (460)
T ss_pred HhccCCHHHHHHHHHHHHHHHhh
Confidence 35677677788888888766543
No 140
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=26.44 E-value=2.2e+02 Score=29.33 Aligned_cols=27 Identities=22% Similarity=0.236 Sum_probs=12.9
Q ss_pred hHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 119 LGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 119 lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
+|..+..+|++++|..+|-+|+..-|.
T Consensus 391 Lg~~~~~~g~~~eA~~~y~~aL~~~p~ 417 (1157)
T PRK11447 391 LGDVAMARKDYAAAERYYQQALRMDPG 417 (1157)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHhCCC
Confidence 444444444444444444444444444
No 141
>PF07862 Nif11: Nitrogen fixation protein of unknown function; InterPro: IPR012903 This domain is found in the cyanobacteria, and the nitrogen-fixing proteobacterium Azotobacter vinelandii and may be involved in nitrogen fixation, but no role has been assigned [].
Probab=25.66 E-value=41 Score=21.60 Aligned_cols=12 Identities=50% Similarity=0.872 Sum_probs=10.7
Q ss_pred CCChhhHHHhHH
Q psy14697 72 RSDPLFKEKLKE 83 (175)
Q Consensus 72 R~dP~FRKkLre 83 (175)
++||+||.+|+.
T Consensus 14 ~~d~~l~~~l~~ 25 (49)
T PF07862_consen 14 KSDPELREQLKA 25 (49)
T ss_pred hcCHHHHHHHHh
Confidence 499999999986
No 142
>KOG3289|consensus
Probab=25.18 E-value=31 Score=29.62 Aligned_cols=54 Identities=22% Similarity=0.268 Sum_probs=34.2
Q ss_pred cccccccccccccCCCCccc----------------hhhhhhhhHHHHHHHHHHHHHhhhhheeccCCCCChh
Q psy14697 20 WCGYFGLWSGRIGPSGRLMS----------------PMELISGNKVVTAVAVGAAVVVGYCLYFDKKRRSDPL 76 (175)
Q Consensus 20 ~~~~~~~~~~~~~~~~~~~~----------------~~~m~~~~~~~~~agvaga~flGYcIYFD~KRR~dP~ 76 (175)
-|+.+||.-|+++++|.-.. |+| +.-+.+=.=..-.|..++|| ||-.-|-+|-.
T Consensus 23 h~aVnGLLla~~~~kg~~v~itdcVPLfH~~laLaPmlE-vAl~lId~~~~~~GlviaGy--y~Ane~~~D~s 92 (199)
T KOG3289|consen 23 HAAVNGLLLAPATGKGECVEITDCVPLFHSHLALAPMLE-VALNLIDVWGAQAGLVIAGY--YHANERVNDQS 92 (199)
T ss_pred ccceeeEEEeccCCCCCeEEEEecchhhccccccccHHH-HHHHHHHHHHHhcCeEEEEE--eecCCCccccC
Confidence 47889999999999997543 444 22222222222344556666 88888888744
No 143
>TIGR01162 purE phosphoribosylaminoimidazole carboxylase, PurE protein. Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. This model represents PurK, an N5-CAIR mutase.
Probab=25.08 E-value=90 Score=25.73 Aligned_cols=20 Identities=45% Similarity=0.609 Sum_probs=16.9
Q ss_pred CCCChhhHHHhHHHHHHhHH
Q psy14697 71 RRSDPLFKEKLKERRRKNRE 90 (175)
Q Consensus 71 RR~dP~FRKkLrerRrk~~~ 90 (175)
+.+||+.++||++.|+++++
T Consensus 127 ~~~d~~l~~kl~~~r~~~~~ 146 (156)
T TIGR01162 127 GIKDPELAEKLKEYRENQKE 146 (156)
T ss_pred cCCCHHHHHHHHHHHHHHHH
Confidence 57899999999998877664
No 144
>KOG2072|consensus
Probab=24.83 E-value=3.4e+02 Score=28.47 Aligned_cols=89 Identities=17% Similarity=0.259 Sum_probs=48.0
Q ss_pred hhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChh------hHHHHH
Q psy14697 63 YCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLD------NGVEHL 136 (175)
Q Consensus 63 YcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~------eaa~hf 136 (175)
+|. -|+|++ +|||=--.=|..-....+....+..++++|||.+|.|.=.-.++=..-+..+-+. |-+.|+
T Consensus 186 FCL--kYqRkt--EFRrLCe~LR~HL~~i~k~~nq~~~v~Ln~~Etlql~LDtRf~QLdvAi~lELWQEAyrSiEDIhgL 261 (988)
T KOG2072|consen 186 FCL--KYQRKT--EFRRLCELLRMHLDNINKHQNQSTRVDLNDPETLQLYLDTRFQQLDVAIELELWQEAYRSIEDIHGL 261 (988)
T ss_pred HHH--HHhhhH--HHHHHHHHHHHHHHHHHHhhccCcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 665 456664 8887433223332233333445778899999999988655444333333322211 223344
Q ss_pred HHHHHhcCChHHHHHHHhh
Q psy14697 137 ANALTVCGQPNQLLGVLQQ 155 (175)
Q Consensus 137 ~nAl~VcpqP~~LL~IyQq 155 (175)
..--+=-|.|.-|-+-|++
T Consensus 262 m~lSKrtPkp~~laNYY~K 280 (988)
T KOG2072|consen 262 MKLSKRTPKPSTLANYYEK 280 (988)
T ss_pred HHHhcCCCchhHHHHHHHH
Confidence 4444455666666555554
No 145
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=24.70 E-value=1.4e+02 Score=22.63 Aligned_cols=29 Identities=14% Similarity=0.237 Sum_probs=14.9
Q ss_pred HHHhhhhhee-ccCCCCChhhHHHhHHHHH
Q psy14697 58 AVVVGYCLYF-DKKRRSDPLFKEKLKERRR 86 (175)
Q Consensus 58 a~flGYcIYF-D~KRR~dP~FRKkLrerRr 86 (175)
.++++|-+.| |+-.++--+.++++.+.++
T Consensus 12 l~~l~y~l~~g~~G~~~~~~l~~q~~~~~~ 41 (105)
T PRK00888 12 LVWLQYSLWFGKNGILDYWRVNDQVAAQQQ 41 (105)
T ss_pred HHHHHHHHhccCCcHHHHHHHHHHHHHHHH
Confidence 3567777766 3333344445555554333
No 146
>COG1575 MenA 1,4-dihydroxy-2-naphthoate octaprenyltransferase [Coenzyme metabolism]
Probab=24.45 E-value=21 Score=32.27 Aligned_cols=15 Identities=40% Similarity=0.656 Sum_probs=12.4
Q ss_pred eeccCCCCC---hhhHHH
Q psy14697 66 YFDKKRRSD---PLFKEK 80 (175)
Q Consensus 66 YFD~KRR~d---P~FRKk 80 (175)
||||||-+| |+.+++
T Consensus 66 Y~D~~KG~D~~g~~~~~~ 83 (303)
T COG1575 66 YFDYKKGTDTHGPDRLKQ 83 (303)
T ss_pred hhhhhcCCCCCCCccccc
Confidence 999999999 666554
No 147
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=24.43 E-value=63 Score=27.43 Aligned_cols=28 Identities=21% Similarity=0.312 Sum_probs=20.8
Q ss_pred HHhHhHHhhc-CChhhHHHHHHHHHHhcC
Q psy14697 117 IQLGETLLAA-GDLDNGVEHLANALTVCG 144 (175)
Q Consensus 117 V~lGE~L~a~-G~~~eaa~hf~nAl~Vcp 144 (175)
..+|+.+-.. |++++|++++-+|+.+|.
T Consensus 118 ~~lA~~ye~~~~d~e~Ai~~Y~~A~~~y~ 146 (282)
T PF14938_consen 118 KELAEIYEEQLGDYEKAIEYYQKAAELYE 146 (282)
T ss_dssp HHHHHHHCCTT--HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 5677777777 788888888888888774
No 148
>PF02526 GBP_repeat: Glycophorin-binding protein; InterPro: IPR003681 The glycophorin-binding protein contains a tandem repeat. The repeated sequence determines the binding domain for an erythrocyte receptor binding protein of Plasmodium falciparum, the malarial parasite []. Erythrocyte invasion by the malarial merozoite is a receptor-mediated process, an obligatory step in the development of the parasite. The P. falciparum protein binds to the erythrocyte receptor glycophorin.
Probab=24.25 E-value=41 Score=21.71 Aligned_cols=10 Identities=30% Similarity=0.833 Sum_probs=8.4
Q ss_pred CChhhHHHhH
Q psy14697 73 SDPLFKEKLK 82 (175)
Q Consensus 73 ~dP~FRKkLr 82 (175)
.||+|||-|.
T Consensus 15 adpeyrkh~~ 24 (38)
T PF02526_consen 15 ADPEYRKHLN 24 (38)
T ss_pred cCHHHHHHHH
Confidence 5999999875
No 149
>PRK15058 cytochrome b562; Provisional
Probab=24.05 E-value=2.7e+02 Score=22.35 Aligned_cols=22 Identities=14% Similarity=0.212 Sum_probs=19.1
Q ss_pred HHHHHHHhHhHHhhcCChhhHH
Q psy14697 112 YFLQEIQLGETLLAAGDLDNGV 133 (175)
Q Consensus 112 fFl~eV~lGE~L~a~G~~~eaa 133 (175)
-+..||..-+.|+.+|+.++|-
T Consensus 89 ~Li~qID~a~~la~~GkL~eAK 110 (128)
T PRK15058 89 ILVGQIDGALKLANEGKVKEAQ 110 (128)
T ss_pred HHHHHHHHHHHHHhCCCHHHHH
Confidence 3788999999999999998774
No 150
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=23.94 E-value=5.2e+02 Score=23.03 Aligned_cols=34 Identities=15% Similarity=0.123 Sum_probs=22.7
Q ss_pred HhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHH
Q psy14697 120 GETLLAAGDLDNGVEHLANALTVCGQPNQLLGVL 153 (175)
Q Consensus 120 GE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~Iy 153 (175)
++..+.+|++++|..++-+++...|+..+.+..+
T Consensus 160 a~l~l~~g~~~~Al~~l~~~~~~~P~~~~al~ll 193 (398)
T PRK10747 160 VRIQLARNENHAARHGVDKLLEVAPRHPEVLRLA 193 (398)
T ss_pred HHHHHHCCCHHHHHHHHHHHHhcCCCCHHHHHHH
Confidence 5555667777777777777777777666444433
No 151
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=23.90 E-value=1e+02 Score=21.06 Aligned_cols=20 Identities=15% Similarity=0.298 Sum_probs=13.9
Q ss_pred hhhHHHHHHHHHHhcCChHH
Q psy14697 129 LDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 129 ~~eaa~hf~nAl~VcpqP~~ 148 (175)
|.+|+++|-+|+.+.++|..
T Consensus 31 Y~~a~e~l~~~~~~~~~~~~ 50 (77)
T smart00745 31 YKKAIEYLLEGIKVESDSKR 50 (77)
T ss_pred HHHHHHHHHHHhccCCCHHH
Confidence 45667777788888777644
No 152
>PF02234 CDI: Cyclin-dependent kinase inhibitor; InterPro: IPR003175 Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase.; GO: 0004861 cyclin-dependent protein kinase inhibitor activity, 0007050 cell cycle arrest, 0005634 nucleus; PDB: 1H27_E 1JSU_C.
Probab=23.80 E-value=58 Score=21.79 Aligned_cols=22 Identities=23% Similarity=0.439 Sum_probs=14.9
Q ss_pred CCChHHHHHHHHHHHHhHhHHh
Q psy14697 103 LKDHSAVQQYFLQEIQLGETLL 124 (175)
Q Consensus 103 ~~d~e~~E~fFl~eV~lGE~L~ 124 (175)
|.|+++++++|.++.+.-.+-.
T Consensus 6 p~d~~e~~~~~~~~l~~~~e~~ 27 (51)
T PF02234_consen 6 PVDHEELERFFQEELQEQREEF 27 (51)
T ss_dssp ---HHHHHHHHHHHHTTTTHHH
T ss_pred CCCHHHHHHHHHHHHHHHHHHH
Confidence 7899999999999877654443
No 153
>PF13480 Acetyltransf_6: Acetyltransferase (GNAT) domain
Probab=23.74 E-value=73 Score=22.68 Aligned_cols=37 Identities=11% Similarity=0.228 Sum_probs=19.8
Q ss_pred hhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHH
Q psy14697 76 LFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQE 116 (175)
Q Consensus 76 ~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~e 116 (175)
.+|+++|+++|+.++ .+.-.+-..+++++.++++-.-
T Consensus 2 k~r~~~rr~~r~~~~----~g~~~~~~~~~~~~~~~~~~~~ 38 (142)
T PF13480_consen 2 KFRKKIRRAIRRAEK----LGGVRFEVATDPADLEAFYELY 38 (142)
T ss_pred cHHHHHHHHHHHHHh----cCCEEEEEeCCHHHHHHHHHHH
Confidence 467888877766543 1211112235667776655443
No 154
>COG1453 Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]
Probab=23.58 E-value=48 Score=31.20 Aligned_cols=49 Identities=16% Similarity=0.318 Sum_probs=36.1
Q ss_pred eeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCC--CCCChHHHHHHHHHHHHh
Q psy14697 66 YFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIP--DLKDHSAVQQYFLQEIQL 119 (175)
Q Consensus 66 YFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p--~~~d~e~~E~fFl~eV~l 119 (175)
|-+|--.+.|-++|-|+++-|.+-. -++++| ...|.++++++|-+|.+.
T Consensus 55 ~~Yh~g~sE~~lgkaL~~~~Rekv~-----LaTKlp~~~~~~~edm~r~fneqLek 105 (391)
T COG1453 55 WPYHGGESEEFLGKALKDGYREKVK-----LATKLPSWPVKDREDMERIFNEQLEK 105 (391)
T ss_pred ccccCCCchHHHHHHhhhcccceEE-----EEeecCCccccCHHHHHHHHHHHHHH
Confidence 4456677888888889887643322 236777 789999999999999753
No 155
>PF00646 F-box: F-box domain; InterPro: IPR001810 The F-box domain was first described as a sequence motif found in cyclin-F that interacts with the protein SKP1 [, ]. This relatively conserved structural motif is present in numerous proteins and serves as a link between a target protein and a ubiquitin-conjugating enzyme. The SCF complex (e.g., Skp1-Cullin-F-box) plays a similar role as an E3 ligase in the ubiquitin protein degradation pathway [, ]. Different F-box proteins as a part of SCF complex recruit particular substrates for ubiquitination through specific protein-protein interaction domains. Many mammalian F-box domains contain leucine-rich or WD-40 repeats (IPR001680 from INTERPRO). However, several F-box proteins either have other previously described domains such as Sec7 domain found in FBS protein or do not contain defined protein-protein interaction domains or motifs.; GO: 0005515 protein binding; PDB: 2E32_A 2E31_A 3V7D_B 1NEX_B 3MKS_D 3L2O_B.
Probab=23.58 E-value=62 Score=19.78 Aligned_cols=16 Identities=25% Similarity=0.671 Sum_probs=11.6
Q ss_pred cCCHHHHHHHHHhcCC
Q psy14697 156 TLPPNVFNALIEKLPP 171 (175)
Q Consensus 156 TLPp~Vf~~Li~kLp~ 171 (175)
-||++|...++..|+.
T Consensus 5 ~LP~~il~~Il~~l~~ 20 (48)
T PF00646_consen 5 DLPDEILQEILSYLDP 20 (48)
T ss_dssp HS-HHHHHHHHHTS-H
T ss_pred HCCHHHHHHHHHHCcH
Confidence 4899999999888874
No 156
>PF04341 DUF485: Protein of unknown function, DUF485; InterPro: IPR007436 This family includes several putative integral membrane proteins.
Probab=23.42 E-value=65 Score=23.59 Aligned_cols=18 Identities=33% Similarity=0.518 Sum_probs=13.1
Q ss_pred CCChhhHHHhHHHHHHhH
Q psy14697 72 RSDPLFKEKLKERRRKNR 89 (175)
Q Consensus 72 R~dP~FRKkLrerRrk~~ 89 (175)
.+||+|++=.|+||+-.-
T Consensus 2 ~~~p~f~~L~r~r~r~~~ 19 (91)
T PF04341_consen 2 LRSPEFQELVRRRRRLAW 19 (91)
T ss_pred CCCHHHHHHHHHHHHHHH
Confidence 479999987777666543
No 157
>PF06864 PAP_PilO: Pilin accessory protein (PilO); InterPro: IPR009663 This family consists of several enterobacterial PilO proteins. The function of PilO is unknown although it has been suggested that it is a cytoplasmic protein in the absence of other Pil proteins, but PilO protein is translocated to the outer membrane in the presence of other Pil proteins. Alternatively, PilO protein may form a complex with other Pil protein(s). PilO has been predicted to function as a component of the pilin transport apparatus and thin-pilus basal body []. This family does not seem to be related to IPR007445 from INTERPRO.
Probab=23.31 E-value=1.5e+02 Score=27.14 Aligned_cols=9 Identities=11% Similarity=0.235 Sum_probs=5.2
Q ss_pred HHHHHHHHH
Q psy14697 133 VEHLANALT 141 (175)
Q Consensus 133 a~hf~nAl~ 141 (175)
+..|..+++
T Consensus 274 v~dF~~~~~ 282 (414)
T PF06864_consen 274 VADFIRRVR 282 (414)
T ss_pred HHHHHHHHH
Confidence 456666554
No 158
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=23.29 E-value=1.4e+02 Score=26.49 Aligned_cols=42 Identities=26% Similarity=0.399 Sum_probs=34.3
Q ss_pred HHHHHhHhHHhhcCChhhHHHHHHHHHH--hcCChHHHHHHHhh
Q psy14697 114 LQEIQLGETLLAAGDLDNGVEHLANALT--VCGQPNQLLGVLQQ 155 (175)
Q Consensus 114 l~eV~lGE~L~a~G~~~eaa~hf~nAl~--VcpqP~~LL~IyQq 155 (175)
...+.+|+.|..-|++-||+.||..|+. .-..|+-||+.=|-
T Consensus 90 qnr~rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~A 133 (251)
T COG4700 90 QNRYRLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQA 133 (251)
T ss_pred HHHHHHHHHHHHhhhhhhhHHHHHHHhccccCCCHHHHHHHHHH
Confidence 3568999999999999999999999997 45677777766553
No 159
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=23.27 E-value=1.5e+02 Score=25.25 Aligned_cols=46 Identities=17% Similarity=0.292 Sum_probs=38.1
Q ss_pred CCChHHHHHHHHHHHHhHhHHhhcC-ChhhHHHHHHHHHHhcCChHH
Q psy14697 103 LKDHSAVQQYFLQEIQLGETLLAAG-DLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 103 ~~d~e~~E~fFl~eV~lGE~L~a~G-~~~eaa~hf~nAl~VcpqP~~ 148 (175)
..||+..+.--.---..|..+..++ ++++|+.=+-+|+.+|.+|.+
T Consensus 25 ~~~~~~~~~La~~~yn~G~~l~~~~~~~~~a~~wL~~a~~~l~~~~~ 71 (278)
T PF08631_consen 25 SLDPDMAEELARVCYNIGKSLLSKKDKYEEAVKWLQRAYDILEKPGK 71 (278)
T ss_pred cCCcHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHhhhh
Confidence 4566667766666778999999999 999999999999999988744
No 160
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=23.10 E-value=75 Score=23.88 Aligned_cols=9 Identities=22% Similarity=0.715 Sum_probs=4.4
Q ss_pred HHHhhhhhe
Q psy14697 58 AVVVGYCLY 66 (175)
Q Consensus 58 a~flGYcIY 66 (175)
++++||..|
T Consensus 15 ~~i~~y~~~ 23 (87)
T PF10883_consen 15 ALILAYLWW 23 (87)
T ss_pred HHHHHHHHH
Confidence 345555543
No 161
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=23.04 E-value=29 Score=24.87 Aligned_cols=9 Identities=11% Similarity=0.582 Sum_probs=0.0
Q ss_pred HHHhhhhhe
Q psy14697 58 AVVVGYCLY 66 (175)
Q Consensus 58 a~flGYcIY 66 (175)
.+++.+.||
T Consensus 26 ilLIlf~iy 34 (64)
T PF01034_consen 26 ILLILFLIY 34 (64)
T ss_dssp ---------
T ss_pred HHHHHHHHH
Confidence 356667777
No 162
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=22.71 E-value=94 Score=25.10 Aligned_cols=28 Identities=14% Similarity=0.233 Sum_probs=15.3
Q ss_pred HhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697 118 QLGETLLAAGDLDNGVEHLANALTVCGQ 145 (175)
Q Consensus 118 ~lGE~L~a~G~~~eaa~hf~nAl~Vcpq 145 (175)
+.|..++..|++.+|+..|...+.-||.
T Consensus 10 ~~a~~~~~~g~y~~Ai~~f~~l~~~~P~ 37 (203)
T PF13525_consen 10 QKALEALQQGDYEEAIKLFEKLIDRYPN 37 (203)
T ss_dssp HHHHHHHHCT-HHHHHHHHHHHHHH-TT
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCC
Confidence 4455555666666666666666665554
No 163
>KOG1130|consensus
Probab=22.66 E-value=1.8e+02 Score=28.51 Aligned_cols=60 Identities=23% Similarity=0.279 Sum_probs=42.3
Q ss_pred HHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH--------------HHHHhhcCCHHHHHHHHHhc
Q psy14697 110 QQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL--------------LGVLQQTLPPNVFNALIEKL 169 (175)
Q Consensus 110 E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L--------------L~IyQqTLPp~Vf~~Li~kL 169 (175)
++--++=...||.|+..|+...|+.-|-.||.|=..--.+ |.=|.|.|-=+.+++.++++
T Consensus 14 ~~SCleLalEGERLck~gdcraGv~ff~aA~qvGTeDl~tLSAIYsQLGNAyfyL~DY~kAl~yH~hDltlar~ 87 (639)
T KOG1130|consen 14 DRSCLELALEGERLCKMGDCRAGVDFFKAALQVGTEDLSTLSAIYSQLGNAYFYLKDYEKALKYHTHDLTLARL 87 (639)
T ss_pred hhHHHHHHHHHHHHHhccchhhhHHHHHHHHHhcchHHHHHHHHHHHhcchhhhHhhHHHHHhhhhhhHHHHHH
Confidence 3334555778999999999999999999999886554443 44455566555666655543
No 164
>PF14407 Frankia_peptide: Ribosomally synthesized peptide prototyped by Frankia Franean1_4349.
Probab=22.61 E-value=45 Score=23.77 Aligned_cols=12 Identities=25% Similarity=0.528 Sum_probs=10.0
Q ss_pred CChhhHHHhHHH
Q psy14697 73 SDPLFKEKLKER 84 (175)
Q Consensus 73 ~dP~FRKkLrer 84 (175)
+||+||++|.-.
T Consensus 11 ~D~~FRqqllad 22 (61)
T PF14407_consen 11 TDEAFRQQLLAD 22 (61)
T ss_pred cCHHHHHHHhcC
Confidence 699999998753
No 165
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=22.48 E-value=1.7e+02 Score=24.80 Aligned_cols=25 Identities=24% Similarity=0.472 Sum_probs=16.4
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALT 141 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~ 141 (175)
..+|..+..+|++++|..++-+++.
T Consensus 73 ~~la~~~~~~g~~~~A~~~~~~~l~ 97 (389)
T PRK11788 73 LALGNLFRRRGEVDRAIRIHQNLLS 97 (389)
T ss_pred HHHHHHHHHcCcHHHHHHHHHHHhc
Confidence 3456666667777777776666665
No 166
>PF12777 MT: Microtubule-binding stalk of dynein motor; InterPro: IPR024743 The 380 kDa motor unit of dynein belongs to the AAA class of chaperone-like ATPases. The core of the 380 kDa motor unit contains a concatenated chain of six AAA modules (D1-6), of which four correspond to the ATP binding sites with P-loop signatures, and two are modules in which the P loop has been lost in evolution. This domain occurs between D4 and D5 and includes the two predicted alpha-helical coiled coil segments that form the stalk supporting the ATP-sensitive microtubule binding component [].; PDB: 3VKH_A 3VKG_A 3ERR_A 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 2RR7_A.
Probab=22.46 E-value=68 Score=28.61 Aligned_cols=23 Identities=26% Similarity=0.453 Sum_probs=18.0
Q ss_pred HHHHHhhhhheeccCCCCChhhHHHhHHH
Q psy14697 56 GAAVVVGYCLYFDKKRRSDPLFKEKLKER 84 (175)
Q Consensus 56 aga~flGYcIYFD~KRR~dP~FRKkLrer 84 (175)
.+++|++||=+|| ++||++|-++
T Consensus 316 laaa~isY~G~f~------~~~R~~l~~~ 338 (344)
T PF12777_consen 316 LAAAFISYLGPFT------PEYRQELLKK 338 (344)
T ss_dssp HHHHHHHCCCCTS------HHHHHHHHHH
T ss_pred HHHHHHHHcCCCC------HHHHHHHHHH
Confidence 3467999999996 6888888653
No 167
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=22.43 E-value=2.1e+02 Score=28.44 Aligned_cols=36 Identities=3% Similarity=-0.092 Sum_probs=26.4
Q ss_pred HHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697 115 QEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLL 150 (175)
Q Consensus 115 ~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL 150 (175)
.+..+|+.|...|++++|.+.|-++++--|++.+.+
T Consensus 156 ~~~~~a~~l~~~g~~~~A~~~y~~~~~~~p~~~~~~ 191 (694)
T PRK15179 156 EILLEAKSWDEIGQSEQADACFERLSRQHPEFENGY 191 (694)
T ss_pred HHHHHHHHHHHhcchHHHHHHHHHHHhcCCCcHHHH
Confidence 346788888889999999999999998333333443
No 168
>PHA03065 Hypothetical protein; Provisional
Probab=22.41 E-value=87 Score=29.93 Aligned_cols=20 Identities=35% Similarity=0.501 Sum_probs=15.4
Q ss_pred CCChhhHHHhHHHHHHhHHH
Q psy14697 72 RSDPLFKEKLKERRRKNREL 91 (175)
Q Consensus 72 R~dP~FRKkLrerRrk~~~~ 91 (175)
|.+=.-|..||+|||+..+.
T Consensus 75 RG~I~IK~~lReKRr~a~~~ 94 (438)
T PHA03065 75 RGSIPIKQSLREKRRKASKN 94 (438)
T ss_pred cCccchhhHHHHHHHHHHHH
Confidence 56778899999998876543
No 169
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=22.31 E-value=4.6e+02 Score=25.04 Aligned_cols=77 Identities=9% Similarity=-0.040 Sum_probs=0.0
Q ss_pred HHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChhhH
Q psy14697 53 VAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLDNG 132 (175)
Q Consensus 53 agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~ea 132 (175)
+-+-+...++|.+-+.+.. +.+....+.....+.......++.++..-+.. |-....+|++++|
T Consensus 376 a~a~A~la~~~~~~~~~~~---------~~~~~l~~a~~~~~~a~al~~~~~~~~~~~al-------a~~~~~~g~~~~A 439 (517)
T PRK10153 376 TYAQAEKALADIVRHSQQP---------LDEKQLAALSTELDNIVALPELNVLPRIYEIL-------AVQALVKGKTDEA 439 (517)
T ss_pred HHHHHHHHHHHHHHHhcCC---------ccHHHHHHHHHHHHHhhhcccCcCChHHHHHH-------HHHHHhcCCHHHH
Q ss_pred HHHHHHHHHhcCC
Q psy14697 133 VEHLANALTVCGQ 145 (175)
Q Consensus 133 a~hf~nAl~Vcpq 145 (175)
..||-+|+..=|+
T Consensus 440 ~~~l~rAl~L~ps 452 (517)
T PRK10153 440 YQAINKAIDLEMS 452 (517)
T ss_pred HHHHHHHHHcCCC
No 170
>PF14964 DUF4507: Domain of unknown function (DUF4507)
Probab=22.25 E-value=84 Score=29.15 Aligned_cols=30 Identities=33% Similarity=0.618 Sum_probs=26.2
Q ss_pred hhhHHHHHHHHHH-------hcCChHHHHHHHhhcCCH
Q psy14697 129 LDNGVEHLANALT-------VCGQPNQLLGVLQQTLPP 159 (175)
Q Consensus 129 ~~eaa~hf~nAl~-------VcpqP~~LL~IyQqTLPp 159 (175)
++.|+++|+.|+. +|+++.+|+..++ +||+
T Consensus 325 ~~laldRlAQalQVamasgcl~~n~~~L~~~l~-~LP~ 361 (362)
T PF14964_consen 325 IQLALDRLAQALQVAMASGCLYGNRDDLLALLS-QLPH 361 (362)
T ss_pred HHHHHHHHHHHHHHHHHhCcccCCHHHHHHHHH-hCCC
Confidence 6789999999997 5899999999998 6664
No 171
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=22.13 E-value=1e+02 Score=19.06 Aligned_cols=15 Identities=27% Similarity=0.529 Sum_probs=9.9
Q ss_pred HHHHHHHHHhhhhhe
Q psy14697 52 AVAVGAAVVVGYCLY 66 (175)
Q Consensus 52 ~agvaga~flGYcIY 66 (175)
+.++.....+||-+|
T Consensus 7 ~g~llv~lLl~YLvY 21 (29)
T PRK14750 7 CGALLVLLLLGYLVY 21 (29)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344444578888887
No 172
>KOG0624|consensus
Probab=22.08 E-value=1.5e+02 Score=28.51 Aligned_cols=68 Identities=18% Similarity=0.222 Sum_probs=52.1
Q ss_pred CCCCCCh-HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHH
Q psy14697 100 IPDLKDH-SAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIE 167 (175)
Q Consensus 100 ~p~~~d~-e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~ 167 (175)
++|++.+ +.+-.|--.-+|.|-.|+.+|++|+|..-|-.-|.--|+-..-+..+.++.|-+-...|.+
T Consensus 92 l~Dl~rVlelKpDF~~ARiQRg~vllK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~l~~ 160 (504)
T KOG0624|consen 92 LQDLSRVLELKPDFMAARIQRGVVLLKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWVLVQ 160 (504)
T ss_pred hhhHHHHHhcCccHHHHHHHhchhhhhcccHHHHHHHHHHHHhcCCCcchhHHHHHHHHhHHHHHHHHH
Confidence 4444443 2234566677999999999999999999999999999988888888888877655444443
No 173
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=21.95 E-value=40 Score=23.03 Aligned_cols=15 Identities=47% Similarity=0.747 Sum_probs=12.7
Q ss_pred cCCCCChhhHHHhHH
Q psy14697 69 KKRRSDPLFKEKLKE 83 (175)
Q Consensus 69 ~KRR~dP~FRKkLre 83 (175)
.+-++||+||++|+.
T Consensus 9 ~~~~~d~~L~~~l~~ 23 (64)
T TIGR03798 9 EKVKTDPDLREKLKA 23 (64)
T ss_pred HHHHcCHHHHHHHHH
Confidence 356799999999997
No 174
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=21.80 E-value=1.7e+02 Score=26.72 Aligned_cols=34 Identities=35% Similarity=0.453 Sum_probs=29.6
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLL 150 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL 150 (175)
..+|..+..+|++++|..+|.+++...|.+.+.+
T Consensus 60 ~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~ 93 (899)
T TIGR02917 60 FLLGKIYLALGDYAAAEKELRKALSLGYPKNQVL 93 (899)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHcCCChhhhH
Confidence 4589999999999999999999999988776544
No 175
>TIGR03644 marine_trans_1 probable ammonium transporter, marine subtype. Members of this protein family are well conserved subclass of putative ammonimum transporters, belonging to the much broader set of ammonium/methylammonium transporter described by TIGR00836. Species with this transporter tend to be marine bacteria. Partial phylogenetic profiling (PPP) picks a member of this protein family as the single best-scoring protein vs. a reference profile for the marine environment Genome Property for a large number of different query genomes. This finding by PPP suggests that this transporter family represents an important adaptation to the marine environment.
Probab=21.56 E-value=97 Score=28.83 Aligned_cols=22 Identities=18% Similarity=0.143 Sum_probs=13.8
Q ss_pred hhhhHHHHHHHHHHHHHhhhhh
Q psy14697 44 ISGNKVVTAVAVGAAVVVGYCL 65 (175)
Q Consensus 44 ~~~~~~~~~agvaga~flGYcI 65 (175)
..+|.++.++|...+.++.|..
T Consensus 243 ~~NT~la~a~g~l~~~~~~~~~ 264 (404)
T TIGR03644 243 FANTNAAAAGGAIAALLLTKLL 264 (404)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 5666666666666666666654
No 176
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=21.44 E-value=2.4e+02 Score=20.86 Aligned_cols=39 Identities=13% Similarity=0.392 Sum_probs=30.3
Q ss_pred cCChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHH
Q psy14697 126 AGDLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIE 167 (175)
Q Consensus 126 ~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~ 167 (175)
.|++++|+.+.-+.+ .++...++..++...++++..++.
T Consensus 95 ~~~~~~Al~~~l~~~---~d~~~a~~~~~~~~~~~lw~~~~~ 133 (140)
T smart00299 95 DGNFKDAIVTLIEHL---GNYEKAIEYFVKQNNPELWAEVLK 133 (140)
T ss_pred hcCHHHHHHHHHHcc---cCHHHHHHHHHhCCCHHHHHHHHH
Confidence 466777777655444 679999999999999999888765
No 177
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=21.43 E-value=1.7e+02 Score=28.05 Aligned_cols=23 Identities=30% Similarity=0.342 Sum_probs=15.4
Q ss_pred hHhHHhhcCChhhHHHHHHHHHH
Q psy14697 119 LGETLLAAGDLDNGVEHLANALT 141 (175)
Q Consensus 119 lGE~L~a~G~~~eaa~hf~nAl~ 141 (175)
.||.|++.-+..+.|.+|.+=|.
T Consensus 462 ~GekL~~t~dPLe~A~kfl~pL~ 484 (517)
T PF12569_consen 462 LGEKLLKTEDPLEEAMKFLKPLL 484 (517)
T ss_pred cHHHHhcCCcHHHHHHHHHHHHH
Confidence 58888887776555666665443
No 178
>cd02577 PSTD1 PSTD1: Pseudouridine synthase, a subgroup of the TruD family. This group consists of several hypothetical archeal pseudouridine synthases assigned to the TruD family of psuedouridine synthases. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). The TruD family is comprised of proteins related to Escherichia coli TruD.
Probab=21.23 E-value=5.4e+02 Score=23.14 Aligned_cols=15 Identities=27% Similarity=0.381 Sum_probs=11.3
Q ss_pred HHhhhhheeccCCCC
Q psy14697 59 VVVGYCLYFDKKRRS 73 (175)
Q Consensus 59 ~flGYcIYFD~KRR~ 73 (175)
.--|+-=||++||=.
T Consensus 133 ~~~GfpNYfG~QRFG 147 (319)
T cd02577 133 EKGGFPNYFGSQRFG 147 (319)
T ss_pred HhcCCCccccccccC
Confidence 355778899999854
No 179
>COG1930 CbiN ABC-type cobalt transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=21.00 E-value=34 Score=26.41 Aligned_cols=33 Identities=24% Similarity=0.408 Sum_probs=22.4
Q ss_pred CCCCccchhhhhhhhHHHHHHHHHHHHHhhhhheeccCCCC
Q psy14697 33 PSGRLMSPMELISGNKVVTAVAVGAAVVVGYCLYFDKKRRS 73 (175)
Q Consensus 33 ~~~~~~~~~~m~~~~~~~~~agvaga~flGYcIYFD~KRR~ 73 (175)
|||.+= +++...=++.|++++||.+-+++-||.
T Consensus 60 pSGEIE--------SLLFslQaaiGa~IIgY~lGyyrgk~~ 92 (97)
T COG1930 60 PSGEIE--------SLLFSLQAAIGAGIIGYFLGYYRGKGQ 92 (97)
T ss_pred CCccHH--------HHHHHHHHHhcceeeeeeeeeecchhh
Confidence 666654 344455566678899999988876543
No 180
>KOG0425|consensus
Probab=20.71 E-value=73 Score=26.85 Aligned_cols=20 Identities=20% Similarity=0.363 Sum_probs=15.0
Q ss_pred CCCCChhhHHHhHHHHHHhH
Q psy14697 70 KRRSDPLFKEKLKERRRKNR 89 (175)
Q Consensus 70 KRR~dP~FRKkLrerRrk~~ 89 (175)
-|-|+-|||||+++.-++.+
T Consensus 145 ~Ren~~EykkkV~r~vr~s~ 164 (171)
T KOG0425|consen 145 WRENPEEYKKKVRRCVRRSQ 164 (171)
T ss_pred HhhCHHHHHHHHHHHHHHHH
Confidence 47889999999997544433
No 181
>KOG3060|consensus
Probab=20.70 E-value=1.1e+02 Score=27.71 Aligned_cols=37 Identities=24% Similarity=0.359 Sum_probs=31.3
Q ss_pred HhHhHHhhcCC---hhhHHHHHHHHHHhcC-ChHHHHHHHh
Q psy14697 118 QLGETLLAAGD---LDNGVEHLANALTVCG-QPNQLLGVLQ 154 (175)
Q Consensus 118 ~lGE~L~a~G~---~~eaa~hf~nAl~Vcp-qP~~LL~IyQ 154 (175)
.++|.+-.+|. ++-+-.|+.+||+.+| +-..|.+||-
T Consensus 193 rlae~~Yt~gg~eN~~~arkyy~~alkl~~~~~ral~GI~l 233 (289)
T KOG3060|consen 193 RLAEVLYTQGGAENLELARKYYERALKLNPKNLRALFGIYL 233 (289)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHhChHhHHHHHHHHH
Confidence 67898888874 6678889999999999 8888888874
No 182
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=20.54 E-value=96 Score=26.12 Aligned_cols=30 Identities=20% Similarity=0.381 Sum_probs=26.4
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQP 146 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP 146 (175)
++++|++...|++++|+..|-.+...|.+.
T Consensus 182 ~~~A~ey~~~g~~~~A~~~l~~~~~~yr~e 211 (247)
T PF11817_consen 182 LEMAEEYFRLGDYDKALKLLEPAASSYRRE 211 (247)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHhC
Confidence 689999999999999999999998877653
No 183
>PF02158 Neuregulin: Neuregulin family; InterPro: IPR002154 Neuregulins are a sub-family of EGF-like molecules that have been shown to play multiple essential roles in vertebrate embryogenesis including: cardiac development, Schwann cell and oligodendrocyte differentiation, some aspects of neuronal development, as well as the formation of neuromuscular synapses [, ]. Included in the family are heregulin; neu differentiation factor; acetylcholine receptor synthesis stimulator; glial growth factor; and sensory and motor-neuron derived factor []. Multiple family members are generated by alternate splicing or by use of several cell type-specific transcription initiation sites. In general, they bind to and activate the erbB family of receptor tyrosine kinases (erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4)), functioning both as heterodimers and homodimers. The transmembrane forms of neuregulin 1 (NRG1) are present within synaptic vesicles, including those containing glutamate []. After exocytosis, NRG1 is in the presynaptic membrane, where the ectodomain of NRG1 may be cleaved off. The ectodomain then migrates across the synaptic cleft and binds to and activates a member of the EGF-receptor family on the postsynaptic membrane. This has been shown to increase the expression of certain glutamate-receptor subunits. NRG1 appears to signal for glutamate-receptor subunit expression, localisation, and /or phosphorylation facilitating subsequent glutamate transmission. The NRG1 gene has been identified as a potential gene determining susceptibility to schizophrenia by a combination of genetic linkage and association approaches []. ; GO: 0005102 receptor binding, 0009790 embryo development; PDB: 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=20.43 E-value=34 Score=32.30 Aligned_cols=33 Identities=24% Similarity=0.365 Sum_probs=0.0
Q ss_pred HHHHHHHHHHhhhh-heeccCCCCChhhHHHhHHHHHHh
Q psy14697 51 TAVAVGAAVVVGYC-LYFDKKRRSDPLFKEKLKERRRKN 88 (175)
Q Consensus 51 ~~agvaga~flGYc-IYFD~KRR~dP~FRKkLrerRrk~ 88 (175)
|+.|+..++++.-+ .||--|++ ||||+++.|+.
T Consensus 15 IcvaLlVVGi~Cvv~aYCKTKKQ-----Rkklh~hLkqs 48 (404)
T PF02158_consen 15 ICVALLVVGIVCVVDAYCKTKKQ-----RKKLHEHLKQS 48 (404)
T ss_dssp ---------------------------------------
T ss_pred hhHHHHHHHHHHHHHHHHHhHHH-----HHHHHHHHhhh
Confidence 44444445556666 78888877 78888765543
No 184
>PRK12370 invasion protein regulator; Provisional
Probab=20.42 E-value=1.5e+02 Score=27.72 Aligned_cols=35 Identities=11% Similarity=-0.095 Sum_probs=29.7
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHH
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLG 151 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~ 151 (175)
..+|..+...|++++|..+|-+|+...|+..+...
T Consensus 342 ~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~ 376 (553)
T PRK12370 342 GLLGLINTIHSEYIVGSLLFKQANLLSPISADIKY 376 (553)
T ss_pred HHHHHHHHHccCHHHHHHHHHHHHHhCCCCHHHHH
Confidence 46788888999999999999999999998766543
No 185
>PF08690 GET2: GET complex subunit GET2; InterPro: IPR014802 This family corresponds to the GET complex subunit GET2. The GET complex is involved in the retrieval of ER resident proteins from the Golgi []. ; PDB: 3SJD_D 3ZS9_C.
Probab=20.25 E-value=69 Score=28.74 Aligned_cols=13 Identities=31% Similarity=0.618 Sum_probs=8.8
Q ss_pred HHHhHHHHHHhHH
Q psy14697 78 KEKLKERRRKNRE 90 (175)
Q Consensus 78 RKkLrerRrk~~~ 90 (175)
|||||++||+++-
T Consensus 7 krRLrRERReAKi 19 (302)
T PF08690_consen 7 KRRLRRERREAKI 19 (302)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 6778777766653
No 186
>KOG4162|consensus
Probab=20.12 E-value=1.4e+02 Score=30.74 Aligned_cols=27 Identities=30% Similarity=0.373 Sum_probs=23.9
Q ss_pred HHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697 117 IQLGETLLAAGDLDNGVEHLANALTVC 143 (175)
Q Consensus 117 V~lGE~L~a~G~~~eaa~hf~nAl~Vc 143 (175)
-++||.+-++|+...|+++|.-|+..=
T Consensus 758 ~~LG~v~k~~Gd~~~Aaecf~aa~qLe 784 (799)
T KOG4162|consen 758 YYLGEVFKKLGDSKQAAECFQAALQLE 784 (799)
T ss_pred HHHHHHHHHccchHHHHHHHHHHHhhc
Confidence 368999999999999999999998753
No 187
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=20.04 E-value=1.4e+02 Score=24.07 Aligned_cols=34 Identities=9% Similarity=0.223 Sum_probs=29.6
Q ss_pred HHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697 115 QEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQ 148 (175)
Q Consensus 115 ~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~ 148 (175)
-|+.+|.....++++++|+.-+-+-|+..|+-.+
T Consensus 49 AqL~l~yayy~~~~y~~A~a~~~rFirLhP~hp~ 82 (142)
T PF13512_consen 49 AQLDLAYAYYKQGDYEEAIAAYDRFIRLHPTHPN 82 (142)
T ss_pred HHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCCC
Confidence 3688999999999999999999999999987543
Done!