Query psy5551
Match_columns 172
No_of_seqs 114 out of 254
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 20:26:28 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5551.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5551hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00777 Mad3_BUB1_I Mad3/BU 100.0 2.5E-54 5.3E-59 331.6 14.1 123 34-156 2-125 (125)
2 PF08311 Mad3_BUB1_I: Mad3/BUB 100.0 4.1E-51 8.9E-56 313.0 13.8 125 33-157 1-126 (126)
3 KOG1166|consensus 100.0 4.5E-31 9.8E-36 254.7 12.8 139 33-171 14-155 (974)
4 PF15297 CKAP2_C: Cytoskeleton 96.8 0.0067 1.4E-07 54.1 8.5 58 113-170 119-180 (353)
5 PF05843 Suf: Suppressor of fo 96.6 0.018 3.9E-07 49.1 9.1 122 20-159 7-136 (280)
6 KOG1915|consensus 96.4 0.0094 2E-07 55.7 6.9 107 36-159 58-170 (677)
7 KOG1258|consensus 95.0 0.18 3.9E-06 47.8 9.3 113 35-160 273-396 (577)
8 KOG2396|consensus 94.5 0.28 6.1E-06 46.0 9.2 120 31-163 31-173 (568)
9 KOG1915|consensus 94.5 0.41 8.8E-06 45.1 10.2 120 36-162 307-436 (677)
10 KOG2047|consensus 94.2 0.13 2.9E-06 49.6 6.6 127 31-163 401-544 (835)
11 PF05843 Suf: Suppressor of fo 93.1 0.17 3.7E-06 43.1 4.9 58 100-159 35-99 (280)
12 KOG1070|consensus 92.2 1.3 2.8E-05 46.2 10.4 119 30-160 1471-1594(1710)
13 PF07719 TPR_2: Tetratricopept 90.1 1 2.2E-05 25.2 4.6 31 130-160 1-31 (34)
14 PF13429 TPR_15: Tetratricopep 89.3 1.6 3.5E-05 36.2 7.0 95 56-162 81-178 (280)
15 PF13428 TPR_14: Tetratricopep 89.1 0.94 2E-05 27.6 4.2 31 132-162 3-33 (44)
16 PF13181 TPR_8: Tetratricopept 88.6 1.4 3.1E-05 24.7 4.5 30 130-159 1-30 (34)
17 COG5191 Uncharacterized conser 86.7 1 2.2E-05 40.5 4.4 119 31-162 31-174 (435)
18 smart00028 TPR Tetratricopepti 85.3 1.9 4.1E-05 21.8 3.5 29 131-159 2-30 (34)
19 PF02631 RecX: RecX family; I 84.6 9.6 0.00021 28.2 8.3 76 78-160 9-86 (121)
20 PF12895 Apc3: Anaphase-promot 84.4 6.7 0.00015 26.6 6.9 72 77-155 9-83 (84)
21 TIGR02521 type_IV_pilW type IV 84.2 10 0.00022 28.5 8.5 45 113-159 154-198 (234)
22 KOG2047|consensus 84.0 15 0.00032 36.0 11.0 55 112-167 568-622 (835)
23 PF13176 TPR_7: Tetratricopept 83.9 1.1 2.5E-05 26.2 2.4 25 133-157 2-26 (36)
24 PF13414 TPR_11: TPR repeat; P 82.1 2.8 6E-05 27.1 4.0 32 129-160 2-33 (69)
25 cd02682 MIT_AAA_Arch MIT: doma 81.9 3.7 8E-05 29.0 4.7 32 128-159 4-35 (75)
26 PRK14136 recX recombination re 81.3 9.4 0.0002 33.8 8.0 74 79-157 195-268 (309)
27 PF13432 TPR_16: Tetratricopep 80.0 11 0.00023 24.0 6.3 49 110-160 13-61 (65)
28 KOG0495|consensus 79.9 4.9 0.00011 39.4 6.2 93 53-162 651-751 (913)
29 PF00515 TPR_1: Tetratricopept 79.2 6.9 0.00015 21.9 4.6 31 130-160 1-31 (34)
30 PF08424 NRDE-2: NRDE-2, neces 78.7 43 0.00094 29.0 11.6 113 33-160 46-184 (321)
31 PF04212 MIT: MIT (microtubule 77.0 6.7 0.00015 26.2 4.7 32 128-159 3-34 (69)
32 smart00386 HAT HAT (Half-A-TPR 76.4 6.7 0.00014 21.0 3.9 30 33-64 3-32 (33)
33 PF13374 TPR_10: Tetratricopep 76.2 8.9 0.00019 21.8 4.6 30 130-159 2-31 (42)
34 PF13371 TPR_9: Tetratricopept 74.6 8.7 0.00019 24.9 4.7 54 113-168 14-67 (73)
35 PF13424 TPR_12: Tetratricopep 74.3 9.4 0.0002 25.2 4.9 32 128-159 3-34 (78)
36 KOG1914|consensus 73.4 5.4 0.00012 38.2 4.5 97 53-160 237-361 (656)
37 PF13424 TPR_12: Tetratricopep 71.5 9.3 0.0002 25.2 4.3 31 128-158 44-74 (78)
38 TIGR02795 tol_pal_ybgF tol-pal 70.6 31 0.00068 23.6 8.2 53 111-163 56-109 (119)
39 smart00745 MIT Microtubule Int 70.3 11 0.00023 25.6 4.5 32 128-159 6-37 (77)
40 cd02683 MIT_1 MIT: domain cont 70.3 11 0.00024 26.3 4.6 33 127-159 3-35 (77)
41 PRK15359 type III secretion sy 70.1 13 0.00029 28.1 5.4 65 95-161 53-123 (144)
42 cd00189 TPR Tetratricopeptide 67.8 25 0.00055 21.5 7.0 48 111-160 51-98 (100)
43 KOG2366|consensus 66.8 2.8 6E-05 38.3 1.1 24 1-24 190-214 (414)
44 cd02681 MIT_calpain7_1 MIT: do 65.5 17 0.00037 25.5 4.7 32 128-159 4-35 (76)
45 cd02656 MIT MIT: domain contai 63.7 18 0.00038 24.5 4.5 31 128-158 4-34 (75)
46 PRK14134 recX recombination re 63.5 17 0.00037 31.4 5.4 54 77-131 92-145 (283)
47 PF05596 Taeniidae_ag: Taeniid 63.0 19 0.00041 24.7 4.4 46 114-159 17-63 (64)
48 PF03012 PP_M1: Phosphoprotein 60.4 8.1 0.00017 33.8 2.7 45 41-107 78-122 (298)
49 PRK15179 Vi polysaccharide bio 60.1 47 0.001 32.4 8.2 77 75-162 104-186 (694)
50 TIGR02552 LcrH_SycD type III s 59.7 43 0.00093 23.9 6.3 50 111-162 68-117 (135)
51 cd02678 MIT_VPS4 MIT: domain c 59.7 23 0.00049 24.3 4.5 31 129-159 5-35 (75)
52 PF14559 TPR_19: Tetratricopep 59.4 13 0.00028 23.7 3.0 49 112-162 9-57 (68)
53 smart00299 CLH Clathrin heavy 59.3 35 0.00075 25.2 5.8 25 132-156 84-108 (140)
54 PRK02603 photosystem I assembl 58.8 31 0.00067 26.5 5.6 51 111-161 52-103 (172)
55 COG5107 RNA14 Pre-mRNA 3'-end 56.9 34 0.00073 32.5 6.3 120 33-168 58-199 (660)
56 KOG1840|consensus 56.8 41 0.0009 31.7 7.0 46 113-160 351-397 (508)
57 PRK15179 Vi polysaccharide bio 56.8 72 0.0016 31.1 8.8 57 108-166 168-224 (694)
58 PRK00117 recX recombination re 55.9 41 0.00089 25.9 5.9 53 78-131 43-96 (157)
59 TIGR02552 LcrH_SycD type III s 54.1 39 0.00085 24.2 5.2 51 110-162 33-83 (135)
60 PF13174 TPR_6: Tetratricopept 53.1 24 0.00052 19.0 3.1 29 130-160 2-30 (33)
61 PRK14135 recX recombination re 52.5 43 0.00093 28.0 5.8 54 77-131 88-143 (263)
62 COG2956 Predicted N-acetylgluc 52.5 26 0.00055 31.8 4.6 45 118-162 168-212 (389)
63 cd02679 MIT_spastin MIT: domai 52.2 32 0.00068 24.4 4.2 33 127-159 5-37 (79)
64 KOG1166|consensus 51.8 69 0.0015 32.7 8.0 104 46-163 8-111 (974)
65 PF12688 TPR_5: Tetratrico pep 50.4 98 0.0021 23.2 7.0 49 112-160 19-68 (120)
66 PF13432 TPR_16: Tetratricopep 50.4 24 0.00052 22.3 3.2 26 135-160 2-27 (65)
67 TIGR00756 PPR pentatricopeptid 50.3 39 0.00084 18.0 4.5 27 137-163 7-33 (35)
68 PF02259 FAT: FAT domain; Int 49.4 1.5E+02 0.0033 24.6 10.4 95 39-158 110-212 (352)
69 PF13414 TPR_11: TPR repeat; P 48.6 41 0.00088 21.4 4.1 45 113-159 22-67 (69)
70 TIGR02917 PEP_TPR_lipo putativ 48.4 1.8E+02 0.0039 26.8 9.7 66 96-163 665-736 (899)
71 PRK11788 tetratricopeptide rep 47.2 1.3E+02 0.0029 25.4 8.2 49 112-160 159-210 (389)
72 PF09976 TPR_21: Tetratricopep 44.6 1.3E+02 0.0028 22.4 12.2 47 111-157 99-145 (145)
73 cd02684 MIT_2 MIT: domain cont 44.2 51 0.0011 22.8 4.3 29 130-158 6-34 (75)
74 cd02677 MIT_SNX15 MIT: domain 44.0 46 0.001 23.1 4.0 25 134-158 10-34 (75)
75 PF13041 PPR_2: PPR repeat fam 43.9 69 0.0015 19.4 4.5 33 133-165 6-38 (50)
76 KOG1914|consensus 43.8 1.9E+02 0.0041 28.1 9.1 119 34-164 310-436 (656)
77 KOG2002|consensus 43.3 1.4E+02 0.003 30.6 8.5 101 50-159 38-193 (1018)
78 PF13812 PPR_3: Pentatricopept 43.3 54 0.0012 17.7 4.4 31 133-163 4-34 (34)
79 PRK15359 type III secretion sy 43.1 85 0.0018 23.6 5.8 51 110-162 40-90 (144)
80 KOG3617|consensus 42.8 33 0.00071 35.0 4.1 28 131-158 913-940 (1416)
81 TIGR02521 type_IV_pilW type IV 42.7 1.4E+02 0.003 22.2 7.9 53 110-162 115-167 (234)
82 PRK14720 transcript cleavage f 42.2 1E+02 0.0022 31.3 7.5 81 77-160 169-253 (906)
83 PRK15363 pathogenicity island 41.7 57 0.0012 26.1 4.7 67 96-164 65-138 (157)
84 COG2137 OraA Uncharacterized p 41.7 70 0.0015 25.9 5.3 49 76-129 54-103 (174)
85 PF02330 MAM33: Mitochondrial 40.1 48 0.001 26.8 4.2 31 113-143 161-191 (204)
86 PRK10747 putative protoheme IX 39.2 1.8E+02 0.004 25.6 8.0 46 112-160 346-391 (398)
87 KOG2076|consensus 39.1 1.4E+02 0.003 30.3 7.7 113 47-165 103-242 (895)
88 PRK02603 photosystem I assembl 38.2 1.8E+02 0.0039 22.2 7.7 45 113-159 91-135 (172)
89 PF12663 DUF3788: Protein of u 38.0 30 0.00065 26.6 2.5 25 43-68 10-34 (133)
90 TIGR02795 tol_pal_ybgF tol-pal 37.7 1.3E+02 0.0028 20.4 5.9 51 112-162 20-71 (119)
91 PLN03218 maturation of RBCL 1; 37.4 2.5E+02 0.0053 29.0 9.4 52 111-163 631-682 (1060)
92 PRK14137 recX recombination re 37.4 86 0.0019 25.7 5.2 74 78-157 72-145 (195)
93 cd02680 MIT_calpain7_2 MIT: do 36.8 65 0.0014 22.6 3.8 31 129-159 5-35 (75)
94 KOG0495|consensus 36.8 53 0.0012 32.6 4.4 60 101-160 652-715 (913)
95 PF12854 PPR_1: PPR repeat 35.7 79 0.0017 18.1 3.6 24 132-155 9-32 (34)
96 PRK11189 lipoprotein NlpI; Pro 35.7 2.7E+02 0.0058 23.5 8.8 36 125-160 231-266 (296)
97 KOG1070|consensus 35.6 2.3E+02 0.0049 30.7 8.8 104 51-158 1562-1699(1710)
98 PRK10803 tol-pal system protei 35.5 2.8E+02 0.006 23.6 8.4 83 77-163 163-250 (263)
99 PRK09857 putative transposase; 35.3 2.5E+02 0.0055 24.2 8.1 83 75-162 187-272 (292)
100 PLN03088 SGT1, suppressor of 35.3 2.2E+02 0.0047 25.0 7.8 48 111-160 53-100 (356)
101 PRK11788 tetratricopeptide rep 35.2 2.8E+02 0.006 23.5 11.7 27 133-159 252-278 (389)
102 KOG0890|consensus 34.9 75 0.0016 35.4 5.5 63 96-159 1666-1731(2382)
103 PF08424 NRDE-2: NRDE-2, neces 34.7 3E+02 0.0065 23.8 13.1 111 36-159 4-131 (321)
104 PF07721 TPR_4: Tetratricopept 34.6 77 0.0017 17.0 3.2 22 133-154 4-25 (26)
105 PRK10370 formate-dependent nit 34.1 1.3E+02 0.0029 24.0 5.9 64 95-160 102-174 (198)
106 PF14938 SNAP: Soluble NSF att 33.9 93 0.002 26.1 5.1 44 112-158 99-143 (282)
107 PRK15331 chaperone protein Sic 33.7 89 0.0019 25.2 4.7 46 112-159 89-134 (165)
108 TIGR02917 PEP_TPR_lipo putativ 33.1 1.5E+02 0.0032 27.3 6.6 46 113-160 822-867 (899)
109 CHL00033 ycf3 photosystem I as 32.4 1.7E+02 0.0036 22.2 5.9 51 112-162 53-104 (168)
110 PLN03218 maturation of RBCL 1; 32.1 1.4E+02 0.003 30.7 6.8 73 91-163 365-470 (1060)
111 PF04733 Coatomer_E: Coatomer 31.9 3.2E+02 0.007 23.4 8.2 84 77-165 147-236 (290)
112 PLN03081 pentatricopeptide (PP 31.2 3.1E+02 0.0066 26.0 8.6 53 110-163 306-358 (697)
113 KOG1258|consensus 30.6 64 0.0014 31.0 3.9 33 129-161 78-110 (577)
114 PF12895 Apc3: Anaphase-promot 30.4 93 0.002 20.7 3.8 43 113-155 8-50 (84)
115 PF09005 DUF1897: Domain of un 29.8 37 0.00081 21.0 1.5 25 130-154 9-33 (38)
116 PF02184 HAT: HAT (Half-A-TPR) 29.2 1E+02 0.0022 18.3 3.2 26 145-170 2-27 (32)
117 KOG2076|consensus 28.9 2.7E+02 0.0059 28.3 7.9 99 47-167 677-778 (895)
118 TIGR00990 3a0801s09 mitochondr 28.7 4.5E+02 0.0097 24.5 9.1 47 112-160 526-572 (615)
119 PF13431 TPR_17: Tetratricopep 28.4 65 0.0014 18.5 2.3 22 129-150 12-33 (34)
120 PF04911 ATP-synt_J: ATP synth 28.3 34 0.00073 22.8 1.2 18 82-99 26-43 (54)
121 PF10602 RPN7: 26S proteasome 28.0 1.6E+02 0.0034 23.4 5.2 46 121-166 27-72 (177)
122 PHA01810 hypothetical protein 27.7 33 0.00071 24.7 1.1 13 47-59 76-88 (100)
123 PF01535 PPR: PPR repeat; Int 27.1 1E+02 0.0022 16.0 3.0 22 136-157 6-27 (31)
124 PRK11447 cellulose synthase su 26.9 4.6E+02 0.01 26.8 9.4 51 110-162 285-335 (1157)
125 TIGR00540 hemY_coli hemY prote 25.8 4.6E+02 0.01 23.1 11.3 77 75-158 317-398 (409)
126 PF07720 TPR_3: Tetratricopept 25.8 1.5E+02 0.0033 17.5 4.1 23 132-154 3-25 (36)
127 PF09976 TPR_21: Tetratricopep 25.7 2.8E+02 0.006 20.5 6.5 80 75-155 29-110 (145)
128 TIGR00990 3a0801s09 mitochondr 24.9 2.4E+02 0.0052 26.3 6.6 47 113-159 313-360 (615)
129 PF13429 TPR_15: Tetratricopep 24.5 3.9E+02 0.0085 21.8 9.1 106 51-167 144-252 (280)
130 PF08631 SPO22: Meiosis protei 24.3 4.1E+02 0.0089 22.3 7.4 109 37-160 36-151 (278)
131 PF10366 Vps39_1: Vacuolar sor 24.3 1.7E+02 0.0037 21.5 4.5 40 112-158 28-67 (108)
132 PF08328 ASL_C: Adenylosuccina 23.7 39 0.00085 25.8 0.9 38 122-159 13-50 (115)
133 PF12569 NARP1: NMDA receptor- 22.7 2.8E+02 0.0061 26.1 6.6 54 110-163 17-71 (517)
134 smart00648 SWAP Suppressor-of- 22.4 1.3E+02 0.0028 19.1 3.1 33 35-67 13-52 (54)
135 PF13311 DUF4080: Protein of u 22.2 4.2E+02 0.009 21.3 7.1 74 77-156 30-112 (190)
136 cd06224 REM Guanine nucleotide 22.2 2E+02 0.0044 20.4 4.5 41 77-122 3-43 (122)
137 PF04910 Tcf25: Transcriptiona 22.0 3.5E+02 0.0077 24.0 6.8 61 93-158 4-68 (360)
138 PF08640 U3_assoc_6: U3 small 21.9 1.6E+02 0.0034 20.9 3.7 31 31-63 23-53 (83)
139 KOG1586|consensus 21.9 97 0.0021 27.1 3.1 42 112-156 98-140 (288)
140 PF00618 RasGEF_N: RasGEF N-te 21.7 1.4E+02 0.003 20.9 3.5 44 75-123 6-49 (104)
141 PF12843 DUF3820: Protein of u 20.9 1.6E+02 0.0034 20.5 3.4 26 56-85 28-54 (67)
No 1
>smart00777 Mad3_BUB1_I Mad3/BUB1 hoMad3/BUB1 homology region 1. Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of the binding of BUB1 and MAD3 to CDC20p.
Probab=100.00 E-value=2.5e-54 Score=331.56 Aligned_cols=123 Identities=40% Similarity=0.833 Sum_probs=120.2
Q ss_pred HHHHHHHHHHH-hcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCC
Q psy5551 34 CLSSREFELQI-RTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPN 112 (172)
Q Consensus 34 ~~~r~~fE~~i-~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~d 112 (172)
+++|++||++| +.++|||||++|++||+|++++||+|+++|+|.++|||||++|+++++||||+|||||||+||++++|
T Consensus 2 ~~~r~~~e~~i~~~~~~dDPL~~w~~yI~W~~~~~p~g~~~s~L~~lLerc~~~f~~~~~YknD~RyLkiWi~ya~~~~d 81 (125)
T smart00777 2 EQQRQAFEQELQDLYEGDDPLDLWLRYIKWTEENYPQGGKESGLLTLLERCIRYFEDDERYKNDPRYLKIWLKYADNCDE 81 (125)
T ss_pred hHHHHHHHHHHHhcccCCCChHHHHHHHHHHHHhCCCCCchhhHHHHHHHHHHHhhhhhhhcCCHHHHHHHHHHHHhcCC
Confidence 46799999999 66999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Q psy5551 113 PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLG 156 (172)
Q Consensus 113 p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~G 156 (172)
|.+||+||+++|||+++|+||++||.++|.+|++++|++||++|
T Consensus 82 p~~if~~L~~~~IG~~~AlfYe~~A~~lE~~g~~~~A~~iy~~G 125 (125)
T smart00777 82 PRELFQFLYSKGIGTKLALFYEEWAQLLEAAGRYKKADEVYQLG 125 (125)
T ss_pred HHHHHHHHHHCCcchhhHHHHHHHHHHHHHcCCHHHHHHHHHcc
Confidence 99999999999999999999999999999999999999999998
No 2
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=100.00 E-value=4.1e-51 Score=313.03 Aligned_cols=125 Identities=42% Similarity=0.872 Sum_probs=119.0
Q ss_pred HHHHHHHHHHHHhcCC-CCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCC
Q psy5551 33 ICLSSREFELQIRTYD-GPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQP 111 (172)
Q Consensus 33 l~~~r~~fE~~i~~~~-~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~ 111 (172)
|+++|++||++|++++ |||||++|++||+|++++||+++.+++|.++|||||++|+++++|+||+|||+|||+||++++
T Consensus 1 ~~~~r~~~e~~i~~~~~~dDPL~~w~~yI~w~~~~~p~~~~~~~L~~lLer~~~~f~~~~~Y~nD~RylkiWi~ya~~~~ 80 (126)
T PF08311_consen 1 LEQQRQEFEEQIRSYEEGDDPLDPWLRYIKWIEENYPSGGKQSGLLELLERCIRKFKDDERYKNDERYLKIWIKYADLSS 80 (126)
T ss_dssp -HHHHHHHHHHHHCCGGSS-CHHHHHHHHHHHHHHCTTCCCCHHHHHHHHHHHHHHTTSGGGTT-HHHHHHHHHHHTTBS
T ss_pred CHHHHHHHHHHHHHccCCCCChHHHHHHHHHHHHHCCCCCchhHHHHHHHHHHHHHhhhHhhcCCHHHHHHHHHHHHHcc
Confidence 5789999999999998 999999999999999999999888899999999999999999999999999999999999999
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGL 157 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi 157 (172)
||.+||+||+++|||+++|+||++||.++|..|++++|++||++||
T Consensus 81 ~~~~if~~l~~~~IG~~~A~fY~~wA~~le~~~~~~~A~~I~~~Gi 126 (126)
T PF08311_consen 81 DPREIFKFLYSKGIGTKLALFYEEWAEFLEKRGNFKKADEIYQLGI 126 (126)
T ss_dssp HHHHHHHHHHHHTTSTTBHHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred CHHHHHHHHHHcCccHHHHHHHHHHHHHHHHcCCHHHHHHHHHhhC
Confidence 9999999999999999999999999999999999999999999998
No 3
>KOG1166|consensus
Probab=99.97 E-value=4.5e-31 Score=254.65 Aligned_cols=139 Identities=28% Similarity=0.549 Sum_probs=133.0
Q ss_pred HHHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCC-CCChHHHHHHHHHHhhcccccccchhHHHHHHH--Hhhc
Q psy5551 33 ICLSSREFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGP-ETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIK--YIGT 109 (172)
Q Consensus 33 l~~~r~~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~-~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~--Ya~~ 109 (172)
+....+.++..+++|.++|||++|++||.|++++||+|+. .+.+..+||||+..+.+.++|+||+||+++|+. .++.
T Consensus 14 ~~n~eq~li~el~~~~~~DPl~~w~ryi~wv~~~~~~~~~~~~~l~~~lerc~~~~~~lk~Y~nD~Rfl~~~~~~~~~e~ 93 (974)
T KOG1166|consen 14 PLNYEQRLIYELESYAGNDPLDKWLRYIEWVLEVYPEGKENQSLLRNLLERCLEELEDLKRYRNDPRFLILWCSLELREE 93 (974)
T ss_pred HHHHHHHHHHHHHhhcCCCchhhhHhHhhhhhhccccCCchhhhHHHHHHHHHHhccchhhccccHHHHHHHHhHHHHHH
Confidence 4556777888889999999999999999999999999987 889999999999999999999999999999995 8899
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCchHHHHHhc
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEPIEDLEAAQ 171 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P~~~L~~~h 171 (172)
+.|+.++|.||+++|||+.+|+||++||.++|.++.+++|.+||++||+++|+|.++|+++|
T Consensus 94 ~~d~~d~f~~m~~kgIg~~lalfYe~~a~~lE~k~~~keA~~v~q~Giq~~aeP~~rL~~~~ 155 (974)
T KOG1166|consen 94 LQDAEDFFSYLENKGIGTTLALFYEAYAKHLERKEYFKEAKEVFQLGIQNKAEPLERLLRQY 155 (974)
T ss_pred HhhHHHHHHHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999986
No 4
>PF15297 CKAP2_C: Cytoskeleton-associated protein 2 C-terminus
Probab=96.81 E-value=0.0067 Score=54.14 Aligned_cols=58 Identities=22% Similarity=0.279 Sum_probs=49.6
Q ss_pred HHHHHHHHH----HCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCchHHHHHh
Q psy5551 113 PLEIYNLVY----SENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEPIEDLEAA 170 (172)
Q Consensus 113 p~~if~~L~----~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P~~~L~~~ 170 (172)
+.+|+.-|- +-=-..+||-||+.+|.+++..|.+++...||..+|..+|+|++.|+..
T Consensus 119 ~eei~~~L~~li~~IP~A~K~aKYWIC~Arl~~~~~~~e~vi~iyEeAi~agAqPieElR~~ 180 (353)
T PF15297_consen 119 KEEILATLSDLIKNIPDAKKLAKYWICLARLEPRTGPIEDVIAIYEEAILAGAQPIEELRHV 180 (353)
T ss_pred HHHHHHHHHHHHhcCchHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHcCCChHHHHHHH
Confidence 456666665 2223689999999999999999999999999999999999999999763
No 5
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=96.56 E-value=0.018 Score=49.11 Aligned_cols=122 Identities=13% Similarity=0.168 Sum_probs=84.2
Q ss_pred hhhHHHHhhhHHHHHHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhH
Q psy5551 20 LINIEILTYLFTAICLSSREFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRY 99 (172)
Q Consensus 20 ~~~~~~~~~~~~~l~~~r~~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~Ry 99 (172)
++|+.- ....++..|.-|+..+.. +.-.-++|..|-.+ | |-.++....=..++|+.++.|.++.
T Consensus 7 ~m~~~~---r~~g~~~aR~vF~~a~~~--~~~~~~vy~~~A~~-E--~~~~~d~~~A~~Ife~glk~f~~~~-------- 70 (280)
T PF05843_consen 7 YMRFMR---RTEGIEAARKVFKRARKD--KRCTYHVYVAYALM-E--YYCNKDPKRARKIFERGLKKFPSDP-------- 70 (280)
T ss_dssp HHHHHH---HHHHHHHHHHHHHHHHCC--CCS-THHHHHHHHH-H--HHTCS-HHHHHHHHHHHHHHHTT-H--------
T ss_pred HHHHHH---HhCChHHHHHHHHHHHcC--CCCCHHHHHHHHHH-H--HHhCCCHHHHHHHHHHHHHHCCCCH--------
Confidence 445544 235588899999999732 22233566555443 2 2222222235899999999877763
Q ss_pred HHHHHHHhhcC------CCHHHHHHHHHHCCcchHH--HHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 100 VSLVIKYIGTQ------PNPLEIYNLVYSENIGTKL--AMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 100 LkiWi~Ya~~~------~dp~~if~~L~~~~IG~~~--AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
.+|+.|+++. ++.+.+|+-..++ ++..- -.+|..|..+-...|+...+.+|+++-.+.
T Consensus 71 -~~~~~Y~~~l~~~~d~~~aR~lfer~i~~-l~~~~~~~~iw~~~i~fE~~~Gdl~~v~~v~~R~~~~ 136 (280)
T PF05843_consen 71 -DFWLEYLDFLIKLNDINNARALFERAISS-LPKEKQSKKIWKKFIEFESKYGDLESVRKVEKRAEEL 136 (280)
T ss_dssp -HHHHHHHHHHHHTT-HHHHHHHHHHHCCT-SSCHHHCHHHHHHHHHHHHHHS-HHHHHHHHHHHHHH
T ss_pred -HHHHHHHHHHHHhCcHHHHHHHHHHHHHh-cCchhHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 5699999873 3489999999887 55554 789999999999999999999999887664
No 6
>KOG1915|consensus
Probab=96.41 E-value=0.0094 Score=55.69 Aligned_cols=107 Identities=21% Similarity=0.292 Sum_probs=80.7
Q ss_pred HHHHHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhc------
Q psy5551 36 SSREFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGT------ 109 (172)
Q Consensus 36 ~r~~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~------ 109 (172)
.|+.||..|+--.. -+..|.+|-+|=++ ++.. ..--.+.||++. .|-|++.+||+||++
T Consensus 58 kRkefEd~irrnR~--~~~~WikYaqwEes---q~e~-~RARSv~ERALd---------vd~r~itLWlkYae~Emknk~ 122 (677)
T KOG1915|consen 58 KRKEFEDQIRRNRL--NMQVWIKYAQWEES---QKEI-QRARSVFERALD---------VDYRNITLWLKYAEFEMKNKQ 122 (677)
T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHh---HHHH-HHHHHHHHHHHh---------cccccchHHHHHHHHHHhhhh
Confidence 46689999965332 26799999999763 2221 124667888874 588999999999986
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
+++++.|+.--.. |--.+-.||-.|-..-|..|+..-|.+||.+=+.-
T Consensus 123 vNhARNv~dRAvt--~lPRVdqlWyKY~ymEE~LgNi~gaRqiferW~~w 170 (677)
T KOG1915|consen 123 VNHARNVWDRAVT--ILPRVDQLWYKYIYMEEMLGNIAGARQIFERWMEW 170 (677)
T ss_pred HhHHHHHHHHHHH--hcchHHHHHHHHHHHHHHhcccHHHHHHHHHHHcC
Confidence 4678888875543 45667789999999999999999999999876654
No 7
>KOG1258|consensus
Probab=94.96 E-value=0.18 Score=47.80 Aligned_cols=113 Identities=19% Similarity=0.185 Sum_probs=78.5
Q ss_pred HHHHHHHHHHhc-CCCCCC-----hHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhh
Q psy5551 35 LSSREFELQIRT-YDGPDP-----LQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIG 108 (172)
Q Consensus 35 ~~r~~fE~~i~~-~~~dDP-----L~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~ 108 (172)
..|-.||+.|.. |..-+| |..|..|....+. .|..+.+.-+.|||+... .+|..+||+|+.
T Consensus 273 ~kr~~fE~~IkrpYfhvkpl~~aql~nw~~yLdf~i~----~g~~~~~~~l~ercli~c---------A~Y~efWiky~~ 339 (577)
T KOG1258|consen 273 EKRWGFEEGIKRPYFHVKPLDQAQLKNWRYYLDFEIT----LGDFSRVFILFERCLIPC---------ALYDEFWIKYAR 339 (577)
T ss_pred HHHHhhhhhccccccccCcccHHHHHHHHHHhhhhhh----cccHHHHHHHHHHHHhHH---------hhhHHHHHHHHH
Confidence 456679999976 433333 4568888877652 344567899999999875 579999999999
Q ss_pred cCCC--HHHHH--HHHHHCCcchHHH-HHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 109 TQPN--PLEIY--NLVYSENIGTKLA-MFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 109 ~~~d--p~~if--~~L~~~~IG~~~A-lfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
.+.. -.++= .......|-.+-+ ..-.-||.+-|..|++..|..+|+.=++..
T Consensus 340 ~m~~~~~~~~~~~~~~~~~~i~~k~~~~i~L~~a~f~e~~~n~~~A~~~lq~i~~e~ 396 (577)
T KOG1258|consen 340 WMESSGDVSLANNVLARACKIHVKKTPIIHLLEARFEESNGNFDDAKVILQRIESEY 396 (577)
T ss_pred HHHHcCchhHHHHHHHhhhhhcCCCCcHHHHHHHHHHHhhccHHHHHHHHHHHHhhC
Confidence 8632 12222 2334445544433 334567889999999999999999877665
No 8
>KOG2396|consensus
Probab=94.46 E-value=0.28 Score=46.02 Aligned_cols=120 Identities=19% Similarity=0.264 Sum_probs=80.4
Q ss_pred HHHHHHHHHHHHHHhcCCCCCChHHHHHHHHHH---HHhCC------CCC-------CCCChHHHHHHHHHHhhcccccc
Q psy5551 31 TAICLSSREFELQIRTYDGPDPLQLRFDYVQWL---EQSYP------CLG-------PETNIIPFLEETLVAFKNIEQYK 94 (172)
Q Consensus 31 ~~l~~~r~~fE~~i~~~~~dDPL~~w~~YI~W~---~~~yp------~~~-------~~s~l~~lLErc~~~f~~~e~Y~ 94 (172)
.++-.+|+.||.+|+.... -+.-+++||+.= ++.-- ..+ ....+..+..++|.+|..
T Consensus 31 ~~Ivk~Rr~fE~kL~rr~~--~i~Dfi~YI~YE~nl~~lr~kR~Kk~~~k~S~sd~si~~rIv~lyr~at~rf~~----- 103 (568)
T KOG2396|consen 31 REIVKKRRDFELKLQRRTL--SIEDFINYIQYEINLEELRAKRRKKKRVKYSFSDDSIPNRIVFLYRRATNRFNG----- 103 (568)
T ss_pred HHHHHHHHHHHHHHccCcc--cHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhHHHHHHHHHHHHHHHhcCC-----
Confidence 5567789999999976432 256688888762 11100 001 122467777788887766
Q ss_pred cchhHHHHHHHHhhcC---C---CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcC-CHHHHHHHHHHHHHcCCCc
Q psy5551 95 QDPRYVSLVIKYIGTQ---P---NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHN-DIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 95 nD~RyLkiWi~Ya~~~---~---dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~-~~~~A~~I~~~Gi~~~A~P 163 (172)
| +++|..|+.+| . .-..||..|.... ..-+.+|+-=|.++...+ +.+.|.+++.+|++.+.+-
T Consensus 104 -D---~~lW~~yi~f~kk~~~~~~v~ki~~~~l~~H--p~~~dLWI~aA~wefe~n~ni~saRalflrgLR~npds 173 (568)
T KOG2396|consen 104 -D---VKLWLSYIAFCKKKKTYGEVKKIFAAMLAKH--PNNPDLWIYAAKWEFEINLNIESARALFLRGLRFNPDS 173 (568)
T ss_pred -C---HHHHHHHHHHHHHhcchhHHHHHHHHHHHhC--CCCchhHHhhhhhHHhhccchHHHHHHHHHHhhcCCCC
Confidence 4 46777777765 2 3577888887654 345667777777666555 4999999999999988653
No 9
>KOG1915|consensus
Probab=94.46 E-value=0.41 Score=45.11 Aligned_cols=120 Identities=15% Similarity=0.186 Sum_probs=86.7
Q ss_pred HHHHHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhc------
Q psy5551 36 SSREFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGT------ 109 (172)
Q Consensus 36 ~r~~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~------ 109 (172)
.|-+||..+.....+ -|.|.+|++-++.. |. ...+.++.||+|..---...=+-=-||+=|||.||=+
T Consensus 307 Rk~qYE~~v~~np~n--YDsWfdylrL~e~~---g~-~~~Ire~yErAIanvpp~~ekr~W~RYIYLWinYalyeEle~e 380 (677)
T KOG1915|consen 307 RKFQYEKEVSKNPYN--YDSWFDYLRLEESV---GD-KDRIRETYERAIANVPPASEKRYWRRYIYLWINYALYEELEAE 380 (677)
T ss_pred hhhHHHHHHHhCCCC--chHHHHHHHHHHhc---CC-HHHHHHHHHHHHccCCchhHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 344788888642111 36899999988852 33 3468899999986544333333346999999999854
Q ss_pred -CCCHHHHHHHHHHCCcchH---HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 110 -QPNPLEIYNLVYSENIGTK---LAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 110 -~~dp~~if~~L~~~~IG~~---~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
+.-.++||+-... =|-++ +|..|.=+|.+.-.+.+...|.+|+-.+|-....
T Consensus 381 d~ertr~vyq~~l~-lIPHkkFtFaKiWlmyA~feIRq~~l~~ARkiLG~AIG~cPK 436 (677)
T KOG1915|consen 381 DVERTRQVYQACLD-LIPHKKFTFAKIWLMYAQFEIRQLNLTGARKILGNAIGKCPK 436 (677)
T ss_pred hHHHHHHHHHHHHh-hcCcccchHHHHHHHHHHHHHHHcccHHHHHHHHHHhccCCc
Confidence 2336899988777 45543 6888999999999999999999998777765543
No 10
>KOG2047|consensus
Probab=94.21 E-value=0.13 Score=49.61 Aligned_cols=127 Identities=15% Similarity=0.236 Sum_probs=89.6
Q ss_pred HHHHHHHHHHHHHHhc-CC-CCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhc--ccccccc-------hhH
Q psy5551 31 TAICLSSREFELQIRT-YD-GPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKN--IEQYKQD-------PRY 99 (172)
Q Consensus 31 ~~l~~~r~~fE~~i~~-~~-~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~--~e~Y~nD-------~Ry 99 (172)
..|...|.-||....- |. .+|=-.+| +.|.+-..-..+- ..-+.+|++++--=.. .+.|.|. .|=
T Consensus 401 ~~l~~aRvifeka~~V~y~~v~dLa~vw---~~waemElrh~~~-~~Al~lm~~A~~vP~~~~~~~yd~~~pvQ~rlhrS 476 (835)
T KOG2047|consen 401 GDLDDARVIFEKATKVPYKTVEDLAEVW---CAWAEMELRHENF-EAALKLMRRATHVPTNPELEYYDNSEPVQARLHRS 476 (835)
T ss_pred CcHHHHHHHHHHhhcCCccchHHHHHHH---HHHHHHHHhhhhH-HHHHHHHHhhhcCCCchhhhhhcCCCcHHHHHHHh
Confidence 5578899999999864 65 45544455 5665432222111 1346688877643222 1334443 367
Q ss_pred HHHHHHHhhcC------CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 100 VSLVIKYIGTQ------PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 100 LkiWi~Ya~~~------~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
||||-.|||+- ...+.+|.-+..-+|.|. ..=+.||.+||...-|.+|-++|.+||.---=|
T Consensus 477 lkiWs~y~DleEs~gtfestk~vYdriidLriaTP--qii~NyAmfLEeh~yfeesFk~YErgI~LFk~p 544 (835)
T KOG2047|consen 477 LKIWSMYADLEESLGTFESTKAVYDRIIDLRIATP--QIIINYAMFLEEHKYFEESFKAYERGISLFKWP 544 (835)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHHHhcCCH--HHHHHHHHHHHhhHHHHHHHHHHHcCCccCCCc
Confidence 99999999983 457999999999999985 456899999999999999999999999754333
No 11
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=93.07 E-value=0.17 Score=43.08 Aligned_cols=58 Identities=16% Similarity=0.242 Sum_probs=40.7
Q ss_pred HHHHHHHhhc----CCC---HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 100 VSLVIKYIGT----QPN---PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 100 LkiWi~Ya~~----~~d---p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
-.||+.||.+ ..| +..||+-.... ...-..|+..|..+|...|+...|..+|++++..
T Consensus 35 ~~vy~~~A~~E~~~~~d~~~A~~Ife~glk~--f~~~~~~~~~Y~~~l~~~~d~~~aR~lfer~i~~ 99 (280)
T PF05843_consen 35 YHVYVAYALMEYYCNKDPKRARKIFERGLKK--FPSDPDFWLEYLDFLIKLNDINNARALFERAISS 99 (280)
T ss_dssp THHHHHHHHHHHHTCS-HHHHHHHHHHHHHH--HTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHCCT
T ss_pred HHHHHHHHHHHHHhCCCHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHHhCcHHHHHHHHHHHHHh
Confidence 4778888876 244 57788877743 3344677888888888888888888888888876
No 12
>KOG1070|consensus
Probab=92.22 E-value=1.3 Score=46.22 Aligned_cols=119 Identities=17% Similarity=0.293 Sum_probs=88.4
Q ss_pred HHHHHHHHHHHHHHHhcC---CCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHH
Q psy5551 30 FTAICLSSREFELQIRTY---DGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKY 106 (172)
Q Consensus 30 ~~~l~~~r~~fE~~i~~~---~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Y 106 (172)
.++++..|.-+|+.|.+- +++.-|.+|.-|++- |..|. .+..+..+.||+.++. +-|+ =|+++.=-|
T Consensus 1471 lsEiekAR~iaerAL~tIN~REeeEKLNiWiA~lNl-En~yG---~eesl~kVFeRAcqyc---d~~~---V~~~L~~iy 1540 (1710)
T KOG1070|consen 1471 LSEIEKARKIAERALKTINFREEEEKLNIWIAYLNL-ENAYG---TEESLKKVFERACQYC---DAYT---VHLKLLGIY 1540 (1710)
T ss_pred hhhhHHHHHHHHHHhhhCCcchhHHHHHHHHHHHhH-HHhhC---cHHHHHHHHHHHHHhc---chHH---HHHHHHHHH
Confidence 478899999999999874 799999999999995 44454 3446888899888662 2222 244444444
Q ss_pred hhc--CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 107 IGT--QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 107 a~~--~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
... .+++.++|..|..+ -| +.-.-|..||.+|-.+..-+.|..|++++++.-
T Consensus 1541 ~k~ek~~~A~ell~~m~KK-F~-q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~l 1594 (1710)
T KOG1070|consen 1541 EKSEKNDEADELLRLMLKK-FG-QTRKVWIMYADFLLRQNEAEAARELLKRALKSL 1594 (1710)
T ss_pred HHhhcchhHHHHHHHHHHH-hc-chhhHHHHHHHHHhcccHHHHHHHHHHHHHhhc
Confidence 433 45788888888754 45 566678889999999988899999998888764
No 13
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=90.13 E-value=1 Score=25.15 Aligned_cols=31 Identities=13% Similarity=0.104 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 130 AMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 130 AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
|..|-..|..+...|++++|...|+++++-.
T Consensus 1 a~~~~~lg~~~~~~~~~~~A~~~~~~al~l~ 31 (34)
T PF07719_consen 1 AEAWYYLGQAYYQLGNYEEAIEYFEKALELD 31 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHS
T ss_pred CHHHHHHHHHHHHhCCHHHHHHHHHHHHHHC
Confidence 4567778899999999999999999998753
No 14
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=89.30 E-value=1.6 Score=36.20 Aligned_cols=95 Identities=18% Similarity=0.203 Sum_probs=40.8
Q ss_pred HHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCCH---HHHHHHHHHCCcchHHHHH
Q psy5551 56 RFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNP---LEIYNLVYSENIGTKLAMF 132 (172)
Q Consensus 56 w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp---~~if~~L~~~~IG~~~Alf 132 (172)
..+++..+...-|. .-..++++..+++ .|++++..++.......+. .+++..+....-...-+.|
T Consensus 81 ~~~l~~l~~~~~~~-----~A~~~~~~~~~~~-------~~~~~l~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 148 (280)
T PF13429_consen 81 YERLIQLLQDGDPE-----EALKLAEKAYERD-------GDPRYLLSALQLYYRLGDYDEAEELLEKLEELPAAPDSARF 148 (280)
T ss_dssp -------------------------------------------------H-HHHTT-HHHHHHHHHHHHH-T---T-HHH
T ss_pred cccccccccccccc-----ccccccccccccc-------cccchhhHHHHHHHHHhHHHHHHHHHHHHHhccCCCCCHHH
Confidence 45555554333332 1244555554332 4678888888777666654 5555555543333467889
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 133 YKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 133 Ye~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
|...|.++...|+.++|..+|++++.....
T Consensus 149 ~~~~a~~~~~~G~~~~A~~~~~~al~~~P~ 178 (280)
T PF13429_consen 149 WLALAEIYEQLGDPDKALRDYRKALELDPD 178 (280)
T ss_dssp HHHHHHHHHHCCHHHHHHHHHHHHHHH-TT
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Confidence 999999999999999999999999997654
No 15
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=89.13 E-value=0.94 Score=27.62 Aligned_cols=31 Identities=13% Similarity=0.172 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 132 FYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 132 fYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
.+..+|..+...|++++|.++|++.++....
T Consensus 3 ~~~~la~~~~~~G~~~~A~~~~~~~l~~~P~ 33 (44)
T PF13428_consen 3 AWLALARAYRRLGQPDEAERLLRRALALDPD 33 (44)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHHCcC
Confidence 4677899999999999999999999997644
No 16
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=88.61 E-value=1.4 Score=24.67 Aligned_cols=30 Identities=20% Similarity=0.279 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 130 AMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 130 AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
|..|..-|..+...|++++|.+.|+++++-
T Consensus 1 a~~~~~lg~~y~~~~~~~~A~~~~~~a~~~ 30 (34)
T PF13181_consen 1 AEAYYNLGKIYEQLGDYEEALEYFEKALEL 30 (34)
T ss_dssp -HHHHHHHHHHHHTTSHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHHHHHhh
Confidence 346777899999999999999999999874
No 17
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=86.70 E-value=1 Score=40.55 Aligned_cols=119 Identities=16% Similarity=0.127 Sum_probs=76.0
Q ss_pred HHHHHHHHHHHHHHhcCCCCCChHHHHHHHHHHH--Hh--------CCCCCC--------CCChHHHHHHHHHHhhcccc
Q psy5551 31 TAICLSSREFELQIRTYDGPDPLQLRFDYVQWLE--QS--------YPCLGP--------ETNIIPFLEETLVAFKNIEQ 92 (172)
Q Consensus 31 ~~l~~~r~~fE~~i~~~~~dDPL~~w~~YI~W~~--~~--------yp~~~~--------~s~l~~lLErc~~~f~~~e~ 92 (172)
+.+-..|+.||.+|+.-+ --|.-+++||+.=. +. --++++ .....-++.|.|.+|-+|
T Consensus 31 ~~IvktRr~fE~rL~rr~--~klnDf~~YI~yE~nleklRaKR~Kr~~v~~K~s~sD~sipqk~~f~~~R~tnkff~D-- 106 (435)
T COG5191 31 RRIVKTRRKFELRLQRRE--KKLNDFMRYIKYECNLEKLRAKRVKRKKVGKKASFSDMSIPQKKIFELYRSTNKFFND-- 106 (435)
T ss_pred HHHHHHHHHHHHHHhccc--chHHHHHHHHHHHhhHHHHHHHHHHHHHhcccccchhccccceeeEeeehhhhcCCCC--
Confidence 456678999999997643 23666889998621 11 111221 112344567777777665
Q ss_pred cccchhHHHHHHHHhhcC------CCHHHHHHHHHHCCcchHHHHHHHH-HHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 93 YKQDPRYVSLVIKYIGTQ------PNPLEIYNLVYSENIGTKLAMFYKA-WAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 93 Y~nD~RyLkiWi~Ya~~~------~dp~~if~~L~~~~IG~~~AlfYe~-wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
+++|..|+.+. ..-..||..+..+.=- =+.+|+. =+.-++..++++-|.+++++|+..+.+
T Consensus 107 -------~k~w~~y~~Y~~k~k~y~~~~nI~~~~l~khP~--nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~ 174 (435)
T COG5191 107 -------PKIWSQYAAYVIKKKMYGEMKNIFAECLTKHPL--NVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSR 174 (435)
T ss_pred -------cHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CceeeeeeccchhhhhccHHHHHHHHHhhhccCCC
Confidence 57899998875 3455666655544311 1233443 455678889999999999999998754
No 18
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=85.35 E-value=1.9 Score=21.81 Aligned_cols=29 Identities=10% Similarity=0.025 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 131 MFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 131 lfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
..|...|..+...|++++|...|+.+++.
T Consensus 2 ~~~~~~a~~~~~~~~~~~a~~~~~~~~~~ 30 (34)
T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKALEL 30 (34)
T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHHcc
Confidence 35677899999999999999999999875
No 19
>PF02631 RecX: RecX family; InterPro: IPR003783 RecX is a putative bacterial regulatory protein []. The gene encoding RecX is found downstream of recA, and it is suggested that the RecX protein might be regulator of RecA activity by interaction with the RecA protein or filament [].; GO: 0006282 regulation of DNA repair; PDB: 3DFG_A 3D5L_B 3C1D_B 3E3V_A.
Probab=84.58 E-value=9.6 Score=28.17 Aligned_cols=76 Identities=17% Similarity=0.207 Sum_probs=47.3
Q ss_pred HHHHHHHHHhhcccccccchhHHHHHHHHhhc--CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy5551 78 PFLEETLVAFKNIEQYKQDPRYVSLVIKYIGT--QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQL 155 (172)
Q Consensus 78 ~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~--~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~ 155 (172)
+.++..+.++.+ ..|-||.||.+-|+...-. ..-|..|-.-|..+||-..+.. ..++..-..+.|..+.++
T Consensus 9 e~I~~vi~~l~~-~gyidD~~ya~~~v~~~~~~~~~G~~~I~~~L~~kGi~~~~i~------~~l~~~~~~e~a~~~~~k 81 (121)
T PF02631_consen 9 EAIEEVIDRLKE-LGYIDDERYAESYVRSRLRRKGKGPRRIRQKLKQKGIDREIIE------EALEEYDEEEEALELAEK 81 (121)
T ss_dssp HHHHHHHHHHHH-TTSS-HHHHHHHHHHHHHHHTT--HHHHHHHHHHTT--HHHHH------HHHTCS-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHH-cCCCCHHHHHHHHHHHhcccccccHHHHHHHHHHHCCChHHHH------HHHHHhhHHHHHHHHHHH
Confidence 344444444333 3899999999999997764 4569999999999999988654 333322333446666666
Q ss_pred HHHcC
Q psy5551 156 GLNAH 160 (172)
Q Consensus 156 Gi~~~ 160 (172)
-....
T Consensus 82 k~~~~ 86 (121)
T PF02631_consen 82 KYRRY 86 (121)
T ss_dssp HHHHT
T ss_pred HHhcc
Confidence 55554
No 20
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=84.36 E-value=6.7 Score=26.56 Aligned_cols=72 Identities=13% Similarity=0.026 Sum_probs=42.9
Q ss_pred HHHHHHHHHHhhcccccccchhHHHHHHHHhhc---CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHH
Q psy5551 77 IPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGT---QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVF 153 (172)
Q Consensus 77 ~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~---~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~ 153 (172)
+.++++++..-.+++ ++. +|+..|.. ..+-...+..+-..+++.......--.|..+...|++++|..+|
T Consensus 9 i~~~~k~~~~~~~~~---~~~----~~~~la~~~~~~~~y~~A~~~~~~~~~~~~~~~~~~l~a~~~~~l~~y~eAi~~l 81 (84)
T PF12895_consen 9 IKYYEKLLELDPTNP---NSA----YLYNLAQCYFQQGKYEEAIELLQKLKLDPSNPDIHYLLARCLLKLGKYEEAIKAL 81 (84)
T ss_dssp HHHHHHHHHHHCGTH---HHH----HHHHHHHHHHHTTHHHHHHHHHHCHTHHHCHHHHHHHHHHHHHHTT-HHHHHHHH
T ss_pred HHHHHHHHHHCCCCh---hHH----HHHHHHHHHHHCCCHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHhCCHHHHHHHH
Confidence 445566655433311 222 45555554 24455555555555566655444445599999999999999999
Q ss_pred HH
Q psy5551 154 QL 155 (172)
Q Consensus 154 ~~ 155 (172)
++
T Consensus 82 ~~ 83 (84)
T PF12895_consen 82 EK 83 (84)
T ss_dssp HH
T ss_pred hc
Confidence 76
No 21
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=84.16 E-value=10 Score=28.55 Aligned_cols=45 Identities=9% Similarity=-0.071 Sum_probs=27.6
Q ss_pred HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 113 PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 113 p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
+.+.|.-.....-+ .+..|...|..+...|++++|...++..+..
T Consensus 154 A~~~~~~~~~~~~~--~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~ 198 (234)
T TIGR02521 154 AEKYLTRALQIDPQ--RPESLLELAELYYLRGQYKDARAYLERYQQT 198 (234)
T ss_pred HHHHHHHHHHhCcC--ChHHHHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 45555554443322 3445556777777777777777777777665
No 22
>KOG2047|consensus
Probab=83.96 E-value=15 Score=36.05 Aligned_cols=55 Identities=9% Similarity=0.215 Sum_probs=37.2
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCchHHH
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEPIEDL 167 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P~~~L 167 (172)
.++|+|..-...==|.-.--+|..||.+-|..|--..|.+||+++ .....|.++|
T Consensus 568 raRdLFEqaL~~Cpp~~aKtiyLlYA~lEEe~GLar~amsiyera-t~~v~~a~~l 622 (835)
T KOG2047|consen 568 RARDLFEQALDGCPPEHAKTIYLLYAKLEEEHGLARHAMSIYERA-TSAVKEAQRL 622 (835)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH-HhcCCHHHHH
Confidence 356666666665455555667888888888888888888888873 4444555544
No 23
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=83.95 E-value=1.1 Score=26.24 Aligned_cols=25 Identities=12% Similarity=0.271 Sum_probs=20.6
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHHH
Q psy5551 133 YKAWAEVLDAHNDIKQANHVFQLGL 157 (172)
Q Consensus 133 Ye~wA~~lE~~~~~~~A~~I~~~Gi 157 (172)
|..-|.++...|+|++|.++|+..+
T Consensus 2 l~~Lg~~~~~~g~~~~Ai~~y~~aL 26 (36)
T PF13176_consen 2 LNNLGRIYRQQGDYEKAIEYYEQAL 26 (36)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 4467889999999999999999944
No 24
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=82.10 E-value=2.8 Score=27.06 Aligned_cols=32 Identities=13% Similarity=0.149 Sum_probs=28.7
Q ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 129 LAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 129 ~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
-|..|...|..+...|++++|...|.++|+..
T Consensus 2 ~a~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~ 33 (69)
T PF13414_consen 2 NAEAWYNLGQIYFQQGDYEEAIEYFEKAIELD 33 (69)
T ss_dssp SHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHS
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Confidence 36788999999999999999999999999865
No 25
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=81.92 E-value=3.7 Score=28.99 Aligned_cols=32 Identities=13% Similarity=0.153 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 128 KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 128 ~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
..|..|..=|.-+++.|++.+|..-|+.||.-
T Consensus 4 ~~A~~~a~~AVe~D~~gr~~eAi~~Y~~aIe~ 35 (75)
T cd02682 4 EMARKYAINAVKAEKEGNAEDAITNYKKAIEV 35 (75)
T ss_pred HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Confidence 46888999999999999999999999999974
No 26
>PRK14136 recX recombination regulator RecX; Provisional
Probab=81.32 E-value=9.4 Score=33.81 Aligned_cols=74 Identities=16% Similarity=0.245 Sum_probs=48.0
Q ss_pred HHHHHHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy5551 79 FLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGL 157 (172)
Q Consensus 79 lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi 157 (172)
+++..|.+++ ...|-||.||.+.|+..-....-|..|=..|..+||...+..= ..+.+-| ..++.|.+++++-.
T Consensus 195 ~IE~VIerLk-e~gYLDDeRFAesyVr~R~~kkGp~rIrqELrQKGId~eLIEq--ALeeieE--DE~E~A~~L~eKK~ 268 (309)
T PRK14136 195 SVEPLLDALE-REGWLSDARFAESLVHRRASRVGSARIVSELKRHAVGDALVES--VGAQLRE--TEFERAQAVWRKKF 268 (309)
T ss_pred HHHHHHHHHH-HcCCcCHHHHHHHHHHHHhhchhHHHHHHHHHHcCCCHHHHHH--HHHhccH--hHHHHHHHHHHHHh
Confidence 4444444432 3399999999999997632223489999999999999887642 2222211 23566666766644
No 27
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=80.02 E-value=11 Score=24.02 Aligned_cols=49 Identities=6% Similarity=0.121 Sum_probs=38.9
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
.+++..+|..+.... -..+..+-..+..+...|++++|...|+..++..
T Consensus 13 ~~~A~~~~~~~l~~~--P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~~~ 61 (65)
T PF13432_consen 13 YDEAIAAFEQALKQD--PDNPEAWYLLGRILYQQGRYDEALAYYERALELD 61 (65)
T ss_dssp HHHHHHHHHHHHCCS--TTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHS
T ss_pred HHHHHHHHHHHHHHC--CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHC
Confidence 346788888888888 4466777778888889999999999999988654
No 28
>KOG0495|consensus
Probab=79.85 E-value=4.9 Score=39.41 Aligned_cols=93 Identities=10% Similarity=0.126 Sum_probs=71.3
Q ss_pred hHHHHHHHH--HHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCC------HHHHHHHHHHCC
Q psy5551 53 LQLRFDYVQ--WLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPN------PLEIYNLVYSEN 124 (172)
Q Consensus 53 L~~w~~YI~--W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~d------p~~if~~L~~~~ 124 (172)
-.+|..|++ |..++. ..-..|||+|++.|-+ |-|+|+..-+...+ +++. |+..-+
T Consensus 651 eRv~mKs~~~er~ld~~------eeA~rllEe~lk~fp~---------f~Kl~lmlGQi~e~~~~ie~aR~a--Y~~G~k 713 (913)
T KOG0495|consen 651 ERVWMKSANLERYLDNV------EEALRLLEEALKSFPD---------FHKLWLMLGQIEEQMENIEMAREA--YLQGTK 713 (913)
T ss_pred chhhHHHhHHHHHhhhH------HHHHHHHHHHHHhCCc---------hHHHHHHHhHHHHHHHHHHHHHHH--HHhccc
Confidence 368998876 444322 2358899999999854 67999998776432 4443 455666
Q ss_pred cchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 125 IGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 125 IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
-+-.+-.+|..-|.+-|..|...+|..|+.+|.-+|..
T Consensus 714 ~cP~~ipLWllLakleEk~~~~~rAR~ildrarlkNPk 751 (913)
T KOG0495|consen 714 KCPNSIPLWLLLAKLEEKDGQLVRARSILDRARLKNPK 751 (913)
T ss_pred cCCCCchHHHHHHHHHHHhcchhhHHHHHHHHHhcCCC
Confidence 67788999999999999999999999999999887743
No 29
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=79.23 E-value=6.9 Score=21.86 Aligned_cols=31 Identities=19% Similarity=0.151 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 130 AMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 130 AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
|..|-.-+..+...|++++|..-|+++|+-.
T Consensus 1 a~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 31 (34)
T PF00515_consen 1 AEAYYNLGNAYFQLGDYEEALEYYQRALELD 31 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHS
T ss_pred CHHHHHHHHHHHHhCCchHHHHHHHHHHHHC
Confidence 3456677888999999999999999999754
No 30
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=78.73 E-value=43 Score=29.04 Aligned_cols=113 Identities=16% Similarity=0.235 Sum_probs=75.6
Q ss_pred HHHHHH-HHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcC-
Q psy5551 33 ICLSSR-EFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQ- 110 (172)
Q Consensus 33 l~~~r~-~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~- 110 (172)
+.+.|. .+|+.|.. ..|=-.+++.|++=..+..|. ..+...-++++.+ +|....||+.|++.+
T Consensus 46 ~~E~klsilerAL~~--np~~~~L~l~~l~~~~~~~~~----~~l~~~we~~l~~---------~~~~~~LW~~yL~~~q 110 (321)
T PF08424_consen 46 LAERKLSILERALKH--NPDSERLLLGYLEEGEKVWDS----EKLAKKWEELLFK---------NPGSPELWREYLDFRQ 110 (321)
T ss_pred HHHHHHHHHHHHHHh--CCCCHHHHHHHHHHHHHhCCH----HHHHHHHHHHHHH---------CCCChHHHHHHHHHHH
Confidence 334444 47777766 235567888888877766653 3466677777766 455788888888863
Q ss_pred --------CCHHHHHH----HHHHCCcc------------hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 111 --------PNPLEIYN----LVYSENIG------------TKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 111 --------~dp~~if~----~L~~~~IG------------~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
++-+++|. .|....=| ..+...+...+.++-..|..+.|.+++|.=+.-+
T Consensus 111 ~~~~~f~v~~~~~~y~~~l~~L~~~~~~~~~~~~~~~~~e~~~l~v~~r~~~fl~~aG~~E~Ava~~Qa~lE~n 184 (321)
T PF08424_consen 111 SNFASFTVSDVRDVYEKCLRALSRRRSGRMTSHPDLPELEEFMLYVFLRLCRFLRQAGYTERAVALWQALLEFN 184 (321)
T ss_pred HHhccCcHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHHHHHHCCchHHHHHHHHHHHHHH
Confidence 13344443 34444333 3555667788999999999999999999887754
No 31
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=76.96 E-value=6.7 Score=26.21 Aligned_cols=32 Identities=13% Similarity=0.074 Sum_probs=27.5
Q ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 128 KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 128 ~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
.-|..+..=|.-++..|++++|...|..||..
T Consensus 3 ~~A~~~~~~Av~~D~~g~~~~A~~~Y~~ai~~ 34 (69)
T PF04212_consen 3 DKAIELIKKAVEADEAGNYEEALELYKEAIEY 34 (69)
T ss_dssp HHHHHHHHHHHHHHHTTSHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
Confidence 34677788899999999999999999999873
No 32
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=76.35 E-value=6.7 Score=21.03 Aligned_cols=30 Identities=17% Similarity=0.412 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHhcCCCCCChHHHHHHHHHHH
Q psy5551 33 ICLSSREFELQIRTYDGPDPLQLRFDYVQWLE 64 (172)
Q Consensus 33 l~~~r~~fE~~i~~~~~dDPL~~w~~YI~W~~ 64 (172)
.+..|.-||..|.... .-.+.|..|+++..
T Consensus 3 ~~~~r~i~e~~l~~~~--~~~~~W~~y~~~e~ 32 (33)
T smart00386 3 IERARKIYERALEKFP--KSVELWLKYAEFEE 32 (33)
T ss_pred HHHHHHHHHHHHHHCC--CChHHHHHHHHHHh
Confidence 3567889999997654 77899999999853
No 33
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=76.17 E-value=8.9 Score=21.83 Aligned_cols=30 Identities=17% Similarity=0.164 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 130 AMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 130 AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
|.-+...|..+-..|++.+|..+++..+.-
T Consensus 2 a~~~~~la~~~~~~g~~~~A~~~~~~al~~ 31 (42)
T PF13374_consen 2 ASALNNLANAYRAQGRYEEALELLEEALEI 31 (42)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhhhcchhhHHHHHHHHH
Confidence 445667788999999999999999998764
No 34
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=74.62 E-value=8.7 Score=24.86 Aligned_cols=54 Identities=11% Similarity=0.059 Sum_probs=41.3
Q ss_pred HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCchHHHH
Q psy5551 113 PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEPIEDLE 168 (172)
Q Consensus 113 p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P~~~L~ 168 (172)
+.+++..+... ......+|...|..+...|++.+|.+.|+..++....+.+...
T Consensus 14 A~~~~~~~l~~--~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~p~~~~~~~ 67 (73)
T PF13371_consen 14 ALEVLERALEL--DPDDPELWLQRARCLFQLGRYEEALEDLERALELSPDDPDARA 67 (73)
T ss_pred HHHHHHHHHHh--CcccchhhHHHHHHHHHhccHHHHHHHHHHHHHHCCCcHHHHH
Confidence 44455555443 4446777888999999999999999999999998877766544
No 35
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=74.32 E-value=9.4 Score=25.17 Aligned_cols=32 Identities=19% Similarity=0.206 Sum_probs=27.7
Q ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 128 KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 128 ~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
.+|..|...|..+...|++++|...|++.++-
T Consensus 3 ~~a~~~~~la~~~~~~~~~~~A~~~~~~al~~ 34 (78)
T PF13424_consen 3 DTANAYNNLARVYRELGRYDEALDYYEKALDI 34 (78)
T ss_dssp HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 46778889999999999999999999999875
No 36
>KOG1914|consensus
Probab=73.40 E-value=5.4 Score=38.19 Aligned_cols=97 Identities=15% Similarity=0.259 Sum_probs=61.5
Q ss_pred hHHHHHHHHHHHHhCCCC---C--CCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCC---------------
Q psy5551 53 LQLRFDYVQWLEQSYPCL---G--PETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPN--------------- 112 (172)
Q Consensus 53 L~~w~~YI~W~~~~yp~~---~--~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~d--------------- 112 (172)
++.|.++|+|=.++ |-. | ..+.+.=+.|.|+..+.-.+ .||..|+.+...
T Consensus 237 v~~W~n~I~wEksN-pL~t~~~~~~~~Rv~yayeQ~ll~l~~~p---------eiWy~~s~yl~~~s~l~~~~~d~~~a~ 306 (656)
T KOG1914|consen 237 VELWKNWIKWEKSN-PLRTLDGTMLTRRVMYAYEQCLLYLGYHP---------EIWYDYSMYLIEISDLLTEKGDVPDAK 306 (656)
T ss_pred HHHHHHHHHHHhcC-CcccccccHHHHHHHHHHHHHHHHHhcCH---------HHHHHHHHHHHHhhHHHHHhcccccch
Confidence 37899999995542 422 2 24456667899998876544 588888776422
Q ss_pred -----HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCC---HHHHHHHHHHHHHcC
Q psy5551 113 -----PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHND---IKQANHVFQLGLNAH 160 (172)
Q Consensus 113 -----p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~---~~~A~~I~~~Gi~~~ 160 (172)
...+|+-...--- ...+++|-++|.+-|..-+ ++.-..+|++-+...
T Consensus 307 ~~t~e~~~~yEr~I~~l~-~~~~~Ly~~~a~~eE~~~~~n~~~~~~~~~~~ll~~~ 361 (656)
T KOG1914|consen 307 SLTDEAASIYERAIEGLL-KENKLLYFALADYEESRYDDNKEKKVHEIYNKLLKIE 361 (656)
T ss_pred hhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhHHHhcccchhhhhHHHHHHHHhhh
Confidence 3444444433222 2366777777777777766 777777777665543
No 37
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=71.47 E-value=9.3 Score=25.21 Aligned_cols=31 Identities=19% Similarity=0.155 Sum_probs=28.3
Q ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 128 KLAMFYKAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 128 ~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
.+|..|-.-|..+...|++++|.+.|++.++
T Consensus 44 ~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~ 74 (78)
T PF13424_consen 44 DTANTLNNLGECYYRLGDYEEALEYYQKALD 74 (78)
T ss_dssp HHHHHHHHHHHHHHHTTHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHh
Confidence 4688899999999999999999999999875
No 38
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=70.56 E-value=31 Score=23.60 Aligned_cols=53 Identities=11% Similarity=0.039 Sum_probs=39.6
Q ss_pred CCHHHHHHHHHHCCcch-HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 111 PNPLEIYNLVYSENIGT-KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 111 ~dp~~if~~L~~~~IG~-~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
+.+.+.|.-+....-+. ..+..+...+..+...|++.+|...|..-+......
T Consensus 56 ~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p~~ 109 (119)
T TIGR02795 56 ADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQELGDKEKAKATLQQVIKRYPGS 109 (119)
T ss_pred HHHHHHHHHHHHHCCCCCcccHHHHHHHHHHHHhCChHHHHHHHHHHHHHCcCC
Confidence 45777888777654443 235567777888899999999999999999876544
No 39
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=70.34 E-value=11 Score=25.58 Aligned_cols=32 Identities=16% Similarity=0.071 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 128 KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 128 ~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
.-|..++.=|.-.+..|++++|...|..|+..
T Consensus 6 ~~A~~li~~Av~~d~~g~~~eAl~~Y~~a~e~ 37 (77)
T smart00745 6 SKAKELISKALKADEAGDYEEALELYKKAIEY 37 (77)
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 44667777888899999999999999999873
No 40
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=70.29 E-value=11 Score=26.31 Aligned_cols=33 Identities=21% Similarity=0.134 Sum_probs=28.0
Q ss_pred hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 127 TKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 127 ~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
..-|.-+..=|.-++..|+|++|...|..||.-
T Consensus 3 ~~~a~~l~~~Ave~D~~g~y~eAl~~Y~~aie~ 35 (77)
T cd02683 3 ELAAKEVLKRAVELDQEGRFQEALVCYQEGIDL 35 (77)
T ss_pred hHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHH
Confidence 345667778889999999999999999999973
No 41
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=70.07 E-value=13 Score=28.13 Aligned_cols=65 Identities=14% Similarity=0.102 Sum_probs=51.4
Q ss_pred cchhHHHHHHHHhhcC------CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCC
Q psy5551 95 QDPRYVSLVIKYIGTQ------PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHA 161 (172)
Q Consensus 95 nD~RyLkiWi~Ya~~~------~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A 161 (172)
.||.....|...+..+ .++.+.|..... +....+..+-.++..+...|++++|...|+..|....
T Consensus 53 ~~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~--l~p~~~~a~~~lg~~l~~~g~~~eAi~~~~~Al~~~p 123 (144)
T PRK15359 53 AQPWSWRAHIALAGTWMMLKEYTTAINFYGHALM--LDASHPEPVYQTGVCLKMMGEPGLAREAFQTAIKMSY 123 (144)
T ss_pred cCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHh--cCCCCcHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence 3666777787777653 468888888886 4566677777888889999999999999999998653
No 42
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=67.83 E-value=25 Score=21.45 Aligned_cols=48 Identities=15% Similarity=0.119 Sum_probs=35.4
Q ss_pred CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 111 PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 111 ~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
+++.+.|....... ......+...|..+...|++++|...+..++...
T Consensus 51 ~~a~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 98 (100)
T cd00189 51 EEALEDYEKALELD--PDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98 (100)
T ss_pred HHHHHHHHHHHhCC--CcchhHHHHHHHHHHHHHhHHHHHHHHHHHHccC
Confidence 44666666665543 3333667788899999999999999999887644
No 43
>KOG2366|consensus
Probab=66.79 E-value=2.8 Score=38.27 Aligned_cols=24 Identities=38% Similarity=0.717 Sum_probs=17.2
Q ss_pred Cccc-ccccccccccCCCchhhhHH
Q psy5551 1 MVYS-CSWPVYQIYSGMTVDLINIE 24 (172)
Q Consensus 1 ~~~~-~~~pv~~~~sg~~~~~~~~~ 24 (172)
|+|| ||||++..--+-.++.-|..
T Consensus 190 i~ySlC~W~~~~~~~~~~pny~~i~ 214 (414)
T KOG2366|consen 190 IFYSLCSWPAYHPGLPHHPNYKNIS 214 (414)
T ss_pred eEEEeccCcccccCccCCCcchhhh
Confidence 6899 99999997666555544433
No 44
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=65.52 E-value=17 Score=25.53 Aligned_cols=32 Identities=13% Similarity=0.086 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 128 KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 128 ~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
.-|.-|..=|.-+++.|+|.+|...|+.||.-
T Consensus 4 ~~Ai~~a~~Ave~D~~g~y~eA~~~Y~~aie~ 35 (76)
T cd02681 4 RDAVQFARLAVQRDQEGRYSEAVFYYKEAAQL 35 (76)
T ss_pred HHHHHHHHHHHHHHHccCHHHHHHHHHHHHHH
Confidence 34667788899999999999999999999873
No 45
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=63.74 E-value=18 Score=24.52 Aligned_cols=31 Identities=16% Similarity=0.108 Sum_probs=26.6
Q ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 128 KLAMFYKAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 128 ~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
.-|.++..=|.-.+..|++++|...|..|+.
T Consensus 4 ~~a~~l~~~Av~~D~~g~~~~Al~~Y~~a~e 34 (75)
T cd02656 4 QQAKELIKQAVKEDEDGNYEEALELYKEALD 34 (75)
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3467777888889999999999999999986
No 46
>PRK14134 recX recombination regulator RecX; Provisional
Probab=63.53 E-value=17 Score=31.36 Aligned_cols=54 Identities=11% Similarity=0.156 Sum_probs=40.5
Q ss_pred HHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHHCCcchHHHH
Q psy5551 77 IPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYSENIGTKLAM 131 (172)
Q Consensus 77 ~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~~~IG~~~Al 131 (172)
..++++++.+++. ..|-||.||.+.++..--...-|..|-.-|..+||...+..
T Consensus 92 ~~~Ie~vI~~L~e-~~yldD~ryA~~yv~~~~~~~G~~~I~~eL~qKGI~~~iIe 145 (283)
T PRK14134 92 EDAVNRVIRFLKE-YNFIDDDKYCDMYIREKINSYGRNKIKYTLLNKGIKENIII 145 (283)
T ss_pred HHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHhhhHHHHHHHHHHCCCCHHHHH
Confidence 3455555555333 39999999999999864333459999999999999988654
No 47
>PF05596 Taeniidae_ag: Taeniidae antigen; InterPro: IPR008860 This family consists of several antigen proteins from Taenia and Echinococcus (tapeworm) species.
Probab=63.00 E-value=19 Score=24.73 Aligned_cols=46 Identities=28% Similarity=0.551 Sum_probs=38.5
Q ss_pred HHHHHHHHHCCcchHHHHHHHHHHHH-HHHcCCHHHHHHHHHHHHHc
Q psy5551 114 LEIYNLVYSENIGTKLAMFYKAWAEV-LDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 114 ~~if~~L~~~~IG~~~AlfYe~wA~~-lE~~~~~~~A~~I~~~Gi~~ 159 (172)
..|..|.++.=||.++|.++.+|-.. .+..+++..|..=|-+|+.+
T Consensus 17 ~~v~~FF~~DPlGqkIa~l~kdw~~~~~~~r~KiR~~L~ey~k~L~~ 63 (64)
T PF05596_consen 17 EEVRNFFYEDPLGQKIAQLAKDWNEICQEVRKKIRAALAEYCKGLKN 63 (64)
T ss_pred HHHHHHhccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 45778889999999999999999765 56668888888888888764
No 48
>PF03012 PP_M1: Phosphoprotein; InterPro: IPR004259 This family includes the M1 phosphoprotein non-structural RNA polymerase alpha subunit (2.7.7.48 from EC) from various strains of Rabies virus []. The M1 phosphoprotein is thought to be a component of the active polymerase, and may be involved in template binding.; GO: 0003968 RNA-directed RNA polymerase activity, 0019083 viral transcription; PDB: 1VYI_A 3L32_B 3OA1_B 2WZL_A.
Probab=60.42 E-value=8.1 Score=33.81 Aligned_cols=45 Identities=16% Similarity=0.234 Sum_probs=20.0
Q ss_pred HHHHhcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHh
Q psy5551 41 ELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYI 107 (172)
Q Consensus 41 E~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya 107 (172)
|++....+|.||+-++..|+.=+= -+.++ +-|.-+||+|||=.-.
T Consensus 78 eEDfym~e~eDP~v~FQsyLD~vG----------------~qiVr------kMKsGE~FfKIWSq~v 122 (298)
T PF03012_consen 78 EEDFYMDEGEDPLVPFQSYLDEVG----------------AQIVR------KMKSGEGFFKIWSQTV 122 (298)
T ss_dssp -------------HHHHHHHHHHH----------------HHHHH------HHCTT--HHHHHHHHH
T ss_pred ccccccccccCchhHHHHHHHHHH----------------HHHHH------HHhccccHHHHHHHHH
Confidence 333444579999999999987541 12333 3466789999997654
No 49
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=60.14 E-value=47 Score=32.39 Aligned_cols=77 Identities=17% Similarity=0.054 Sum_probs=49.7
Q ss_pred ChHHHHHHHHHHhhcccccccchhHHHHHHHHhhc------CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHH
Q psy5551 75 NIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGT------QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQ 148 (172)
Q Consensus 75 ~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~------~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~ 148 (172)
.-..+|++|+.. +|.+...++.|+.. .+++...+..+... ....+.+.-..|..|...|++++
T Consensus 104 ea~~~l~~~~~~---------~Pd~~~a~~~~a~~L~~~~~~eeA~~~~~~~l~~--~p~~~~~~~~~a~~l~~~g~~~~ 172 (694)
T PRK15179 104 EGLAVWRGIHQR---------FPDSSEAFILMLRGVKRQQGIEAGRAEIELYFSG--GSSSAREILLEAKSWDEIGQSEQ 172 (694)
T ss_pred HHHHHHHHHHhh---------CCCcHHHHHHHHHHHHHhccHHHHHHHHHHHhhc--CCCCHHHHHHHHHHHHHhcchHH
Confidence 446666777665 45566667777654 24566666666554 34556666677777777777777
Q ss_pred HHHHHHHHHHcCCC
Q psy5551 149 ANHVFQLGLNAHAE 162 (172)
Q Consensus 149 A~~I~~~Gi~~~A~ 162 (172)
|.++|++.+..++.
T Consensus 173 A~~~y~~~~~~~p~ 186 (694)
T PRK15179 173 ADACFERLSRQHPE 186 (694)
T ss_pred HHHHHHHHHhcCCC
Confidence 77777777766544
No 50
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=59.72 E-value=43 Score=23.95 Aligned_cols=50 Identities=20% Similarity=0.205 Sum_probs=38.2
Q ss_pred CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 111 PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 111 ~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
+++..+|.-.. +++.....+|-.-|..+...|++++|...|+.+++....
T Consensus 68 ~~A~~~~~~~~--~~~p~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~ 117 (135)
T TIGR02552 68 EEAIDAYALAA--ALDPDDPRPYFHAAECLLALGEPESALKALDLAIEICGE 117 (135)
T ss_pred HHHHHHHHHHH--hcCCCChHHHHHHHHHHHHcCCHHHHHHHHHHHHHhccc
Confidence 34555666544 356667777778888999999999999999999987643
No 51
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=59.65 E-value=23 Score=24.27 Aligned_cols=31 Identities=19% Similarity=0.084 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 129 LAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 129 ~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
-|.++..=|.-.+..|++++|...|..||..
T Consensus 5 ~A~~l~~~Av~~D~~g~y~eA~~~Y~~aie~ 35 (75)
T cd02678 5 KAIELVKKAIEEDNAGNYEEALRLYQHALEY 35 (75)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 4667777788899999999999999999873
No 52
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=59.45 E-value=13 Score=23.69 Aligned_cols=49 Identities=10% Similarity=0.089 Sum_probs=35.8
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
++.++|+-+....=+ .+..+..+|..+-..|++++|..++...+.....
T Consensus 9 ~A~~~~~~~l~~~p~--~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~ 57 (68)
T PF14559_consen 9 EAIELLEKALQRNPD--NPEARLLLAQCYLKQGQYDEAEELLERLLKQDPD 57 (68)
T ss_dssp HHHHHHHHHHHHTTT--SHHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTT
T ss_pred HHHHHHHHHHHHCCC--CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcC
Confidence 355666666554333 4455567999999999999999999999887665
No 53
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=59.30 E-value=35 Score=25.19 Aligned_cols=25 Identities=16% Similarity=0.197 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHH
Q psy5551 132 FYKAWAEVLDAHNDIKQANHVFQLG 156 (172)
Q Consensus 132 fYe~wA~~lE~~~~~~~A~~I~~~G 156 (172)
+|++-..++.+.|.+++|..++-..
T Consensus 84 l~~~~~~l~~k~~~~~~Al~~~l~~ 108 (140)
T smart00299 84 LYEEAVELYKKDGNFKDAIVTLIEH 108 (140)
T ss_pred cHHHHHHHHHhhcCHHHHHHHHHHc
Confidence 7889999999999999998887653
No 54
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=58.79 E-value=31 Score=26.54 Aligned_cols=51 Identities=10% Similarity=0.133 Sum_probs=38.0
Q ss_pred CCHHHHHHHHHHCCcc-hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCC
Q psy5551 111 PNPLEIYNLVYSENIG-TKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHA 161 (172)
Q Consensus 111 ~dp~~if~~L~~~~IG-~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A 161 (172)
.++.+.|.......-. ...+..|...|..+...|++++|...|+++++...
T Consensus 52 ~~A~~~~~~al~~~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p 103 (172)
T PRK02603 52 AEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNP 103 (172)
T ss_pred HHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCc
Confidence 3466666665543222 12467889999999999999999999999998654
No 55
>COG5107 RNA14 Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor [RNA processing and modification]
Probab=56.92 E-value=34 Score=32.55 Aligned_cols=120 Identities=12% Similarity=0.106 Sum_probs=73.6
Q ss_pred HHHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcC--
Q psy5551 33 ICLSSREFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQ-- 110 (172)
Q Consensus 33 l~~~r~~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~-- 110 (172)
..+.|..+|+-.+...--. -.|..||+=-. .-.+ -..+-.|+-||+.+-.+ |.+|..|.+++
T Consensus 58 ~~~~re~yeq~~~pfp~~~--~aw~ly~s~EL---A~~d-f~svE~lf~rCL~k~l~----------ldLW~lYl~YIRr 121 (660)
T COG5107 58 MDAEREMYEQLSSPFPIME--HAWRLYMSGEL---ARKD-FRSVESLFGRCLKKSLN----------LDLWMLYLEYIRR 121 (660)
T ss_pred HHHHHHHHHHhcCCCcccc--HHHHHHhcchh---hhhh-HHHHHHHHHHHHhhhcc----------HhHHHHHHHHHHh
Confidence 3445666665554422111 36888876210 0000 11366788899988554 88999998874
Q ss_pred -C-----C----HHHHHHHHHHC-CcchHHHHHHHHHHHHHHH---------cCCHHHHHHHHHHHHHcCCCchHHHH
Q psy5551 111 -P-----N----PLEIYNLVYSE-NIGTKLAMFYKAWAEVLDA---------HNDIKQANHVFQLGLNAHAEPIEDLE 168 (172)
Q Consensus 111 -~-----d----p~~if~~L~~~-~IG~~~AlfYe~wA~~lE~---------~~~~~~A~~I~~~Gi~~~A~P~~~L~ 168 (172)
+ . --+.|+|.... +.-.+...||-+|..++|. +.+.+.-..+|+++|+---.-++.|-
T Consensus 122 ~n~~~tGq~r~~i~~ayefv~~~~~~e~~s~~~W~ey~~fle~~~~~~kwEeQqrid~iR~~Y~ral~tP~~nleklW 199 (660)
T COG5107 122 VNNLITGQKRFKIYEAYEFVLGCAIFEPQSENYWDEYGLFLEYIEELGKWEEQQRIDKIRNGYMRALQTPMGNLEKLW 199 (660)
T ss_pred hCcccccchhhhhHHHHHHHHhcccccccccchHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHcCccccHHHHH
Confidence 2 1 35678887772 2234678899999988865 44566677788888876444444443
No 56
>KOG1840|consensus
Probab=56.78 E-value=41 Score=31.67 Aligned_cols=46 Identities=22% Similarity=0.307 Sum_probs=39.0
Q ss_pred HHHHHHHHHHCCcch-HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 113 PLEIYNLVYSENIGT-KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 113 p~~if~~L~~~~IG~-~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
..+|+. ...|.-. .+|-.|...|..+-.+|+|++|..+|+..|++.
T Consensus 351 al~i~~--~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~ 397 (508)
T KOG1840|consen 351 ALKIYL--DAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQIL 397 (508)
T ss_pred HHHHHH--hhccccchHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHH
Confidence 344444 5567777 899999999999999999999999999999875
No 57
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=56.78 E-value=72 Score=31.14 Aligned_cols=57 Identities=19% Similarity=0.305 Sum_probs=47.9
Q ss_pred hcCCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCchHH
Q psy5551 108 GTQPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEPIED 166 (172)
Q Consensus 108 ~~~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P~~~ 166 (172)
.-..++.++|.-+.. =+...+.-+.+||..|...|+.++|...|+.+|...+--...
T Consensus 168 g~~~~A~~~y~~~~~--~~p~~~~~~~~~a~~l~~~G~~~~A~~~~~~a~~~~~~~~~~ 224 (694)
T PRK15179 168 GQSEQADACFERLSR--QHPEFENGYVGWAQSLTRRGALWRARDVLQAGLDAIGDGARK 224 (694)
T ss_pred cchHHHHHHHHHHHh--cCCCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhhCcchHH
Confidence 345689999999998 455777888999999999999999999999999987654433
No 58
>PRK00117 recX recombination regulator RecX; Reviewed
Probab=55.91 E-value=41 Score=25.85 Aligned_cols=53 Identities=15% Similarity=0.223 Sum_probs=40.5
Q ss_pred HHHHHHHHHhhcccccccchhHHHHHHHHhhcC-CCHHHHHHHHHHCCcchHHHH
Q psy5551 78 PFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQ-PNPLEIYNLVYSENIGTKLAM 131 (172)
Q Consensus 78 ~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~-~dp~~if~~L~~~~IG~~~Al 131 (172)
.+++.++..+. ...|-||.||.+-++...... .-|..|-..|..+||...+..
T Consensus 43 ~~i~~vl~~l~-~~~~ldD~~~a~~~~~~~~~~~~g~~~I~~~L~~kGi~~~~I~ 96 (157)
T PRK00117 43 EVIEAVLDRLK-EEGLLDDERFAESFVRSRARKGYGPRRIRQELRQKGVDREIIE 96 (157)
T ss_pred HHHHHHHHHHH-HcCCCCHHHHHHHHHHHHHhCCchHHHHHHHHHHcCCCHHHHH
Confidence 45555555544 448899999999999887332 358999999999999988754
No 59
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=54.14 E-value=39 Score=24.17 Aligned_cols=51 Identities=10% Similarity=0.205 Sum_probs=40.8
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
..++...|..+... +.....+|...|..+...|++.+|..+|+.++...-.
T Consensus 33 ~~~A~~~~~~~~~~--~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~p~ 83 (135)
T TIGR02552 33 YDEALKLFQLLAAY--DPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPD 83 (135)
T ss_pred HHHHHHHHHHHHHh--CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC
Confidence 34577777777664 4556788888999999999999999999999987644
No 60
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=53.09 E-value=24 Score=19.00 Aligned_cols=29 Identities=21% Similarity=0.157 Sum_probs=20.7
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 130 AMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 130 AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
|+|..+ ..+...|++++|..+|+.=|..-
T Consensus 2 a~~~~a--~~~~~~g~~~~A~~~~~~~~~~~ 30 (33)
T PF13174_consen 2 ALYRLA--RCYYKLGDYDEAIEYFQRLIKRY 30 (33)
T ss_dssp HHHHHH--HHHHHHCHHHHHHHHHHHHHHHS
T ss_pred HHHHHH--HHHHHccCHHHHHHHHHHHHHHC
Confidence 344444 44556899999999999877653
No 61
>PRK14135 recX recombination regulator RecX; Provisional
Probab=52.50 E-value=43 Score=27.98 Aligned_cols=54 Identities=17% Similarity=0.220 Sum_probs=42.9
Q ss_pred HHHHHHHHHHhhcccccccchhHHHHHHHHhhcCC--CHHHHHHHHHHCCcchHHHH
Q psy5551 77 IPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQP--NPLEIYNLVYSENIGTKLAM 131 (172)
Q Consensus 77 ~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~--dp~~if~~L~~~~IG~~~Al 131 (172)
..+++.++..+. ...|-||.||.+-++...-... -|..|-.-|..+||...+..
T Consensus 88 ~~~Ie~vl~~l~-~~~~ldD~~~a~~~~~~~~~~~~~g~~~I~~kL~~kGi~~~~Ie 143 (263)
T PRK14135 88 EEIISEVIDKLK-EEKYIDDKEYAESYVRTNINTGDKGPRVIKQKLLQKGIEDEIIE 143 (263)
T ss_pred HHHHHHHHHHHH-HcCCCCHHHHHHHHHHHHHhccccchHHHHHHHHHcCCCHHHHH
Confidence 466777777755 4588999999999998764433 58999999999999988765
No 62
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=52.48 E-value=26 Score=31.83 Aligned_cols=45 Identities=20% Similarity=0.272 Sum_probs=38.4
Q ss_pred HHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 118 NLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 118 ~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
.-+-.+.-+..+|.||-+.|.......++++|...+.++++.++.
T Consensus 168 ~k~~~q~~~~eIAqfyCELAq~~~~~~~~d~A~~~l~kAlqa~~~ 212 (389)
T COG2956 168 VKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKK 212 (389)
T ss_pred HHcCCccchhHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhhCcc
Confidence 334456678899999999999999999999999999999987643
No 63
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=52.25 E-value=32 Score=24.41 Aligned_cols=33 Identities=27% Similarity=0.121 Sum_probs=28.3
Q ss_pred hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 127 TKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 127 ~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
.+.|.=|++=|.-.|..|..++|...|++||..
T Consensus 5 ~~~A~~~I~kaL~~dE~g~~e~Al~~Y~~gi~~ 37 (79)
T cd02679 5 YKQAFEEISKALRADEWGDKEQALAHYRKGLRE 37 (79)
T ss_pred HHHHHHHHHHHhhhhhcCCHHHHHHHHHHHHHH
Confidence 456777888888889999999999999999874
No 64
>KOG1166|consensus
Probab=51.85 E-value=69 Score=32.70 Aligned_cols=104 Identities=14% Similarity=0.229 Sum_probs=63.2
Q ss_pred cCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHHCCc
Q psy5551 46 TYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYSENI 125 (172)
Q Consensus 46 ~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~~~I 125 (172)
...+++|+..|.+||.|++. |-.+++.....+-++.|...+..... +.+-++.|+++.-..-++.+-|.-..+
T Consensus 8 ~~~~~~~~n~eq~li~el~~-~~~~DPl~~w~ryi~wv~~~~~~~~~---~~~~l~~~lerc~~~~~~lk~Y~nD~R--- 80 (974)
T KOG1166|consen 8 EQQNPTPLNYEQRLIYELES-YAGNDPLDKWLRYIEWVLEVYPEGKE---NQSLLRNLLERCLEELEDLKRYRNDPR--- 80 (974)
T ss_pred hhccCcHHHHHHHHHHHHHh-hcCCCchhhhHhHhhhhhhccccCCc---hhhhHHHHHHHHHHhccchhhccccHH---
Confidence 56789999999999999985 44444545567777777777666555 455566666654332222222222222
Q ss_pred chHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 126 GTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 126 G~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
+..+|.. ++....+++|..+|..=-+.++-+
T Consensus 81 ---fl~~~~~----~~~~e~~~d~~d~f~~m~~kgIg~ 111 (974)
T KOG1166|consen 81 ---FLILWCS----LELREELQDAEDFFSYLENKGIGT 111 (974)
T ss_pred ---HHHHHHh----HHHHHHHhhHHHHHHHHHhccccc
Confidence 2222222 677777788888877655554433
No 65
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=50.39 E-value=98 Score=23.24 Aligned_cols=49 Identities=12% Similarity=0.184 Sum_probs=39.3
Q ss_pred CHHHHHHHHHHCCc-chHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 112 NPLEIYNLVYSENI-GTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 112 dp~~if~~L~~~~I-G~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
.+..+|.--..+|. |....--++..|..+-..|++++|..+++.++...
T Consensus 19 ~Ai~~Y~~Al~~gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~~~~ 68 (120)
T PF12688_consen 19 EAIPLYRRALAAGLSGADRRRALIQLASTLRNLGRYDEALALLEEALEEF 68 (120)
T ss_pred HHHHHHHHHHHcCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHC
Confidence 46778888888774 55567788888899999999999999999888764
No 66
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=50.37 E-value=24 Score=22.33 Aligned_cols=26 Identities=27% Similarity=0.321 Sum_probs=23.3
Q ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 135 AWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 135 ~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
..|..+...|++++|..+|+..+...
T Consensus 2 ~~a~~~~~~g~~~~A~~~~~~~l~~~ 27 (65)
T PF13432_consen 2 ALARALYQQGDYDEAIAAFEQALKQD 27 (65)
T ss_dssp HHHHHHHHCTHHHHHHHHHHHHHCCS
T ss_pred hHHHHHHHcCCHHHHHHHHHHHHHHC
Confidence 35788899999999999999999887
No 67
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=50.30 E-value=39 Score=17.99 Aligned_cols=27 Identities=22% Similarity=0.266 Sum_probs=21.3
Q ss_pred HHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 137 AEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 137 A~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
=..+.+.|++++|.++|+.=.+.+-.|
T Consensus 7 i~~~~~~~~~~~a~~~~~~M~~~g~~p 33 (35)
T TIGR00756 7 IDGLCKAGRVEEALELFKEMLERGIEP 33 (35)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHcCCCC
Confidence 345677899999999999877777666
No 68
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=49.41 E-value=1.5e+02 Score=24.63 Aligned_cols=95 Identities=12% Similarity=0.116 Sum_probs=63.1
Q ss_pred HHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCC------C
Q psy5551 39 EFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQP------N 112 (172)
Q Consensus 39 ~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~------d 112 (172)
.+..++... .|+++.|..-+.|=.-.++. + .-..-....|+++|+.+. -
T Consensus 110 ~W~~Rl~~~--~~~~~~~~~il~~R~~~l~~----------~-------------~~~~~~~~~~l~~a~~aRk~g~~~~ 164 (352)
T PF02259_consen 110 RWRSRLPNM--QDDFSVWEPILSLRRLVLSL----------I-------------LLPEELAETWLKFAKLARKAGNFQL 164 (352)
T ss_pred HHHHHHHHh--ccchHHHHHHHHHHHHHHhc----------c-------------cchhHHHHHHHHHHHHHHHCCCcHH
Confidence 344444333 77777887777774433331 1 333445678999998752 2
Q ss_pred HHHHHHHHHHCCc--chHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 113 PLEIYNLVYSENI--GTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 113 p~~if~~L~~~~I--G~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
+...+..+...+. +........+.|.++-..|+..+|...++..+.
T Consensus 165 A~~~l~~~~~~~~~~~~~~~~v~~e~akllw~~g~~~~Ai~~L~~~~~ 212 (352)
T PF02259_consen 165 ALSALNRLFQLNPSSESLLPRVFLEYAKLLWAQGEQEEAIQKLRELLK 212 (352)
T ss_pred HHHHHHHHhccCCcccCCCcchHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 5555555555332 222677888999999999999999999999998
No 69
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=48.62 E-value=41 Score=21.36 Aligned_cols=45 Identities=16% Similarity=0.155 Sum_probs=34.6
Q ss_pred HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcC-CHHHHHHHHHHHHHc
Q psy5551 113 PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHN-DIKQANHVFQLGLNA 159 (172)
Q Consensus 113 p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~-~~~~A~~I~~~Gi~~ 159 (172)
+...|.-... +....+..|...|..+...| ++++|.+.++++|+-
T Consensus 22 A~~~~~~ai~--~~p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al~l 67 (69)
T PF13414_consen 22 AIEYFEKAIE--LDPNNAEAYYNLGLAYMKLGKDYEEAIEDFEKALKL 67 (69)
T ss_dssp HHHHHHHHHH--HSTTHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHH--cCCCCHHHHHHHHHHHHHhCccHHHHHHHHHHHHHc
Confidence 4444444444 34667778889999999999 799999999999874
No 70
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=48.42 E-value=1.8e+02 Score=26.77 Aligned_cols=66 Identities=11% Similarity=0.106 Sum_probs=45.2
Q ss_pred chhHHHHHHHHhhc------CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 96 DPRYVSLVIKYIGT------QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 96 D~RyLkiWi~Ya~~------~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
+|.....|...+.. .+++.+++..|...+... +..+..-+..+...|++++|.+.|+..+.....+
T Consensus 665 ~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~--~~~~~~~~~~~~~~g~~~~A~~~~~~~~~~~~~~ 736 (899)
T TIGR02917 665 KPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKA--ALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSS 736 (899)
T ss_pred CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCcCC--hHHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCc
Confidence 34444556655554 235778888887776544 4445556778888899999999999888876554
No 71
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=47.19 E-value=1.3e+02 Score=25.42 Aligned_cols=49 Identities=18% Similarity=0.245 Sum_probs=27.7
Q ss_pred CHHHHHHHHHHCCcc---hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 112 NPLEIYNLVYSENIG---TKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 112 dp~~if~~L~~~~IG---~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
++.++|.-+...+-. ...+.+|...+..+...|++++|..+|++.++..
T Consensus 159 ~A~~~~~~~~~~~~~~~~~~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~~ 210 (389)
T PRK11788 159 KAIDVAERLEKLGGDSLRVEIAHFYCELAQQALARGDLDAARALLKKALAAD 210 (389)
T ss_pred HHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHhHC
Confidence 355566655543321 1245566666666666666666666666666543
No 72
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=44.63 E-value=1.3e+02 Score=22.35 Aligned_cols=47 Identities=15% Similarity=0.039 Sum_probs=32.8
Q ss_pred CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy5551 111 PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGL 157 (172)
Q Consensus 111 ~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi 157 (172)
.++.+-...|....-..-.+.+++--+.++..+|++++|...|+.+|
T Consensus 99 ~~~d~Al~~L~~~~~~~~~~~~~~~~Gdi~~~~g~~~~A~~~y~~Al 145 (145)
T PF09976_consen 99 GQYDEALATLQQIPDEAFKALAAELLGDIYLAQGDYDEARAAYQKAL 145 (145)
T ss_pred CCHHHHHHHHHhccCcchHHHHHHHHHHHHHHCCCHHHHHHHHHHhC
Confidence 34555555554433344456677888888999999999999998764
No 73
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=44.24 E-value=51 Score=22.76 Aligned_cols=29 Identities=21% Similarity=0.124 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 130 AMFYKAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 130 AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
|.+-..=|.-.+..|++++|...|..||.
T Consensus 6 Ai~lv~~Av~~D~~g~y~eA~~lY~~ale 34 (75)
T cd02684 6 AIALVVQAVKKDQRGDAAAALSLYCSALQ 34 (75)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHH
Confidence 34445556778899999999999999987
No 74
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=44.04 E-value=46 Score=23.12 Aligned_cols=25 Identities=16% Similarity=0.222 Sum_probs=20.4
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 134 KAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 134 e~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
..-|.-.+..|+|++|...|+.||.
T Consensus 10 ~~~Ave~d~~~~y~eA~~~Y~~~i~ 34 (75)
T cd02677 10 IRLALEKEEEGDYEAAFEFYRAGVD 34 (75)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHH
Confidence 4456778888999999999998886
No 75
>PF13041 PPR_2: PPR repeat family
Probab=43.93 E-value=69 Score=19.41 Aligned_cols=33 Identities=12% Similarity=0.104 Sum_probs=26.0
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCCCchH
Q psy5551 133 YKAWAEVLDAHNDIKQANHVFQLGLNAHAEPIE 165 (172)
Q Consensus 133 Ye~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P~~ 165 (172)
|-..=..+.+.|++++|.++|+.=.+++-+|-.
T Consensus 6 yn~li~~~~~~~~~~~a~~l~~~M~~~g~~P~~ 38 (50)
T PF13041_consen 6 YNTLISGYCKAGKFEEALKLFKEMKKRGIKPDS 38 (50)
T ss_pred HHHHHHHHHHCcCHHHHHHHHHHHHHcCCCCCH
Confidence 444445677889999999999998888888853
No 76
>KOG1914|consensus
Probab=43.82 E-value=1.9e+02 Score=28.12 Aligned_cols=119 Identities=16% Similarity=0.148 Sum_probs=79.0
Q ss_pred HHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcC---
Q psy5551 34 CLSSREFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQ--- 110 (172)
Q Consensus 34 ~~~r~~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~--- 110 (172)
.+.+.-+|..|....+.+ ..++.-|...=+..+-.. +........++.+..+..+ +-=+|+.|++..
T Consensus 310 ~e~~~~yEr~I~~l~~~~-~~Ly~~~a~~eE~~~~~n-~~~~~~~~~~~ll~~~~~~--------~tLv~~~~mn~irR~ 379 (656)
T KOG1914|consen 310 DEAASIYERAIEGLLKEN-KLLYFALADYEESRYDDN-KEKKVHEIYNKLLKIEDID--------LTLVYCQYMNFIRRA 379 (656)
T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHhhHHHhcccc-hhhhhHHHHHHHHhhhccC--------CceehhHHHHHHHHh
Confidence 356778999888766553 445666776666555433 3445777777777665444 334678887764
Q ss_pred ---CCHHHHHHHHHHCCcchHHHHHHHHHHHHH-HHcCCHHHHHHHHHHHHHcC-CCch
Q psy5551 111 ---PNPLEIYNLVYSENIGTKLAMFYKAWAEVL-DAHNDIKQANHVFQLGLNAH-AEPI 164 (172)
Q Consensus 111 ---~dp~~if~~L~~~~IG~~~AlfYe~wA~~l-E~~~~~~~A~~I~~~Gi~~~-A~P~ 164 (172)
+.++.||.-.-.-+-+.-+ -|++=|..- -..++.+-|-.||.+|+..- -.|.
T Consensus 380 eGlkaaR~iF~kaR~~~r~~hh--VfVa~A~mEy~cskD~~~AfrIFeLGLkkf~d~p~ 436 (656)
T KOG1914|consen 380 EGLKAARKIFKKAREDKRTRHH--VFVAAALMEYYCSKDKETAFRIFELGLKKFGDSPE 436 (656)
T ss_pred hhHHHHHHHHHHHhhccCCcch--hhHHHHHHHHHhcCChhHHHHHHHHHHHhcCCChH
Confidence 5689999999888877733 344433322 23489999999999999863 4443
No 77
>KOG2002|consensus
Probab=43.33 E-value=1.4e+02 Score=30.63 Aligned_cols=101 Identities=17% Similarity=0.204 Sum_probs=73.2
Q ss_pred CCChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHh----------------------
Q psy5551 50 PDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYI---------------------- 107 (172)
Q Consensus 50 dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya---------------------- 107 (172)
.-||+.|+..- ..|+.+|+-...+.|||..+ .+...+.-|++...+|+.++
T Consensus 38 ~a~le~wi~~A----leYy~~gk~eefi~iLE~g~---~~~~~~y~d~~~~~~~a~~~laay~s~~a~kek~~~~k~e~~ 110 (1018)
T KOG2002|consen 38 QAPLEAWIEIA----LEYYKQGKTEEFIKILESGL---IDANEEYADVKSDQMKALDILAAYYSQLAMKEKKKDEKDELF 110 (1018)
T ss_pred cCchhHHHHHH----HHHHhcccHHHHHHHHHhhh---hcccchhcchHHHHHHHHHHHHHHHHHHHHHHHhcchhHHHH
Confidence 45888887542 34666665557888898888 55566677888888888764
Q ss_pred -------------hc--------------------CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHH
Q psy5551 108 -------------GT--------------------QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQ 154 (172)
Q Consensus 108 -------------~~--------------------~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~ 154 (172)
++ .+++..-|.|+.++.=+--+++| .=|.+...+|+|..|..+|+
T Consensus 111 ~~at~~~~~A~ki~m~~~~~l~~~~~~~l~~~~~~~~~A~a~F~~Vl~~sp~Nil~Ll--GkA~i~ynkkdY~~al~yyk 188 (1018)
T KOG2002|consen 111 DKATLLFDLADKIDMYEDSHLLVQRGFLLLEGDKSMDDADAQFHFVLKQSPDNILALL--GKARIAYNKKDYRGALKYYK 188 (1018)
T ss_pred HHHHHHhhHHHHhhccCcchhhhhhhhhhhcCCccHHHHHHHHHHHHhhCCcchHHHH--HHHHHHhccccHHHHHHHHH
Confidence 00 13567778888887766665554 67899999999999999999
Q ss_pred HHHHc
Q psy5551 155 LGLNA 159 (172)
Q Consensus 155 ~Gi~~ 159 (172)
..+.-
T Consensus 189 ~al~i 193 (1018)
T KOG2002|consen 189 KALRI 193 (1018)
T ss_pred HHHhc
Confidence 96654
No 78
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=43.26 E-value=54 Score=17.66 Aligned_cols=31 Identities=19% Similarity=0.212 Sum_probs=22.7
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 133 YKAWAEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 133 Ye~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
|..--..+-+.|+++.|.++|+.=.+.+-+|
T Consensus 4 y~~ll~a~~~~g~~~~a~~~~~~M~~~gv~P 34 (34)
T PF13812_consen 4 YNALLRACAKAGDPDAALQLFDEMKEQGVKP 34 (34)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHhCCCC
Confidence 5556666777888888888888766666555
No 79
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=43.15 E-value=85 Score=23.63 Aligned_cols=51 Identities=10% Similarity=-0.065 Sum_probs=42.0
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
.+++.+.|..+.... ...+.+|..-|..+...|++++|...|+.++.....
T Consensus 40 ~~~A~~~~~~al~~~--P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p~ 90 (144)
T PRK15359 40 YSRAVIDFSWLVMAQ--PWSWRAHIALAGTWMMLKEYTTAINFYGHALMLDAS 90 (144)
T ss_pred HHHHHHHHHHHHHcC--CCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCC
Confidence 356788888877655 446788899999999999999999999999986643
No 80
>KOG3617|consensus
Probab=42.81 E-value=33 Score=34.98 Aligned_cols=28 Identities=14% Similarity=0.418 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 131 MFYKAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 131 lfYe~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
.+|..|++++|+.|..+.|..+|..+..
T Consensus 913 ~L~~WWgqYlES~GemdaAl~~Y~~A~D 940 (1416)
T KOG3617|consen 913 SLYSWWGQYLESVGEMDAALSFYSSAKD 940 (1416)
T ss_pred HHHHHHHHHHhcccchHHHHHHHHHhhh
Confidence 7899999999999999999999988754
No 81
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=42.69 E-value=1.4e+02 Score=22.19 Aligned_cols=53 Identities=8% Similarity=0.061 Sum_probs=39.7
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
.+++...|.-......-......+...+..+...|++++|...|.++++....
T Consensus 115 ~~~A~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~~~ 167 (234)
T TIGR02521 115 YEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQ 167 (234)
T ss_pred HHHHHHHHHHHHhccccccchHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcC
Confidence 34567777777664333344556777899999999999999999999987644
No 82
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=42.19 E-value=1e+02 Score=31.26 Aligned_cols=81 Identities=15% Similarity=0.210 Sum_probs=62.3
Q ss_pred HHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHHC---Ccc-hHHHHHHHHHHHHHHHcCCHHHHHHH
Q psy5551 77 IPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYSE---NIG-TKLAMFYKAWAEVLDAHNDIKQANHV 152 (172)
Q Consensus 77 ~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~~---~IG-~~~AlfYe~wA~~lE~~~~~~~A~~I 152 (172)
..+..+++..|.+..+|.. .+.+|-+|..+..+-.++|..+.++ .|| ++.+.+++--=..+...++|.++..|
T Consensus 169 ~~m~~KAV~~~i~~kq~~~---~~e~W~k~~~~~~~d~d~f~~i~~ki~~~~~~~~~~~~~~~l~~~y~~~~~~~~~i~i 245 (906)
T PRK14720 169 ITYLKKAIYRFIKKKQYVG---IEEIWSKLVHYNSDDFDFFLRIERKVLGHREFTRLVGLLEDLYEPYKALEDWDEVIYI 245 (906)
T ss_pred HHHHHHHHHHHHhhhcchH---HHHHHHHHHhcCcccchHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhHHHHH
Confidence 6778888888988887764 7899999999988766666555443 223 55666777777788889999999999
Q ss_pred HHHHHHcC
Q psy5551 153 FQLGLNAH 160 (172)
Q Consensus 153 ~~~Gi~~~ 160 (172)
+++-+...
T Consensus 246 LK~iL~~~ 253 (906)
T PRK14720 246 LKKILEHD 253 (906)
T ss_pred HHHHHhcC
Confidence 99988753
No 83
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=41.70 E-value=57 Score=26.10 Aligned_cols=67 Identities=12% Similarity=-0.009 Sum_probs=43.5
Q ss_pred chhHHHHHHHHhhcC------CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC-CCch
Q psy5551 96 DPRYVSLVIKYIGTQ------PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH-AEPI 164 (172)
Q Consensus 96 D~RyLkiWi~Ya~~~------~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~-A~P~ 164 (172)
||-...-|+..+-.+ .++.+.|.......+-.-.+.|| -+..+-..|+...|.+-|+..|... ..|.
T Consensus 65 Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~ddp~~~~~--ag~c~L~lG~~~~A~~aF~~Ai~~~~~~~~ 138 (157)
T PRK15363 65 DAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWA--AAECYLACDNVCYAIKALKAVVRICGEVSE 138 (157)
T ss_pred CcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCchHHHH--HHHHHHHcCCHHHHHHHHHHHHHHhccChh
Confidence 555555566555432 35777777777777655555554 4566667788888888888888765 3443
No 84
>COG2137 OraA Uncharacterized protein conserved in bacteria [General function prediction only]
Probab=41.68 E-value=70 Score=25.90 Aligned_cols=49 Identities=16% Similarity=0.295 Sum_probs=38.0
Q ss_pred hHHHHHHHHHHhhcccccccchhHHHHHHHHhhcC-CCHHHHHHHHHHCCcchHH
Q psy5551 76 IIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQ-PNPLEIYNLVYSENIGTKL 129 (172)
Q Consensus 76 l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~-~dp~~if~~L~~~~IG~~~ 129 (172)
+-.+|.+|.. ..|-||.||...|+.=-... .-|.-+-+-|..+||+...
T Consensus 54 Ie~Vi~~l~~-----~~~ldD~~fAe~~i~~r~~~g~G~~rl~qeL~qkGi~~~~ 103 (174)
T COG2137 54 IEEVIDRLAE-----EGYLDDTRFAEAYIRSRSRKGKGPARLKQELKQKGIDDEI 103 (174)
T ss_pred HHHHHHHHHH-----cCcccHHHHHHHHHHHHHhcccChHHHHHHHHHcCCCHHH
Confidence 3445555544 49999999999999865554 4699999999999988765
No 85
>PF02330 MAM33: Mitochondrial glycoprotein; InterPro: IPR003428 This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q.; GO: 0005759 mitochondrial matrix; PDB: 3QV0_A 1YQF_F 3JV1_A 1P32_A 3RPX_A.
Probab=40.15 E-value=48 Score=26.79 Aligned_cols=31 Identities=16% Similarity=0.221 Sum_probs=26.9
Q ss_pred HHHHHHHHHHCCcchHHHHHHHHHHHHHHHc
Q psy5551 113 PLEIYNLVYSENIGTKLAMFYKAWAEVLDAH 143 (172)
Q Consensus 113 p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~ 143 (172)
-..+..||..+||...+|.|=..|+...|++
T Consensus 161 q~~~~~yLeeRGId~~la~fl~~y~~~kEq~ 191 (204)
T PF02330_consen 161 QDAFMNYLEERGIDEELANFLHDYSTDKEQR 191 (204)
T ss_dssp HHHHHHHHHHTT-SHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHHHHHHH
Confidence 3678899999999999999999999988875
No 86
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=39.16 E-value=1.8e+02 Score=25.65 Aligned_cols=46 Identities=17% Similarity=0.130 Sum_probs=30.1
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
.+++.|+-+.+.+=. ..=|...|..++..|+.++|.+.|++|+.-.
T Consensus 346 ~A~~~le~al~~~P~---~~~~~~La~~~~~~g~~~~A~~~~~~~l~~~ 391 (398)
T PRK10747 346 EASLAFRAALKQRPD---AYDYAWLADALDRLHKPEEAAAMRRDGLMLT 391 (398)
T ss_pred HHHHHHHHHHhcCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHhhh
Confidence 366666666655322 2235567778888888888888888877643
No 87
>KOG2076|consensus
Probab=39.08 E-value=1.4e+02 Score=30.31 Aligned_cols=113 Identities=15% Similarity=0.165 Sum_probs=65.9
Q ss_pred CCCCCChHHHHHHHHHHHHhCCCCC----CCCC----hHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCC-------
Q psy5551 47 YDGPDPLQLRFDYVQWLEQSYPCLG----PETN----IIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQP------- 111 (172)
Q Consensus 47 ~~~dDPL~~w~~YI~W~~~~yp~~~----~~s~----l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~------- 111 (172)
+++..|+....-.-+|-.. .|..| +++. +..+|-.+-..|.. -|+.+-.+|+..-+..-.
T Consensus 103 ~e~p~~~~~~~~e~~~s~~-~~k~~~~~r~~~~l~~~l~~ll~eAN~lfar----g~~eeA~~i~~EvIkqdp~~~~ay~ 177 (895)
T KOG2076|consen 103 YEEPEGLKQFKGEGEKSTG-TKKRGRRSRGKSKLAPELRQLLGEANNLFAR----GDLEEAEEILMEVIKQDPRNPIAYY 177 (895)
T ss_pred cccCchhhhhhhhheeccc-CCccCCCCCcccccCHHHHHHHHHHHHHHHh----CCHHHHHHHHHHHHHhCccchhhHH
Confidence 3456666666666555433 34322 2233 44455555555543 455555555555554422
Q ss_pred ----------C--HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCchH
Q psy5551 112 ----------N--PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEPIE 165 (172)
Q Consensus 112 ----------d--p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P~~ 165 (172)
| ..-.|-++..+-+-... .||..||.+.++.|++.+|.-.|.++|+.+..-..
T Consensus 178 tL~~IyEqrGd~eK~l~~~llAAHL~p~d~-e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~n~~ 242 (895)
T KOG2076|consen 178 TLGEIYEQRGDIEKALNFWLLAAHLNPKDY-ELWKRLADLSEQLGNINQARYCYSRAIQANPSNWE 242 (895)
T ss_pred HHHHHHHHcccHHHHHHHHHHHHhcCCCCh-HHHHHHHHHHHhcccHHHHHHHHHHHHhcCCcchH
Confidence 1 11223333333333333 89999999999999999999999999997754433
No 88
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=38.15 E-value=1.8e+02 Score=22.17 Aligned_cols=45 Identities=11% Similarity=0.025 Sum_probs=25.8
Q ss_pred HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 113 PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 113 p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
+...|.-..+. ....+..+...+..+...|+...|..=+...+..
T Consensus 91 A~~~~~~al~~--~p~~~~~~~~lg~~~~~~g~~~~a~~~~~~A~~~ 135 (172)
T PRK02603 91 ALEYYHQALEL--NPKQPSALNNIAVIYHKRGEKAEEAGDQDEAEAL 135 (172)
T ss_pred HHHHHHHHHHh--CcccHHHHHHHHHHHHHcCChHhHhhCHHHHHHH
Confidence 45555544443 3334455556677777777777766656555543
No 89
>PF12663 DUF3788: Protein of unknown function (DUF3788); InterPro: IPR024265 This family of functionally uncharacterised proteins is found in bacteria and archaea. Proteins in this family are typically between 137 and 149 amino acids in length and may be distantly related to RelE proteins.
Probab=38.03 E-value=30 Score=26.60 Aligned_cols=25 Identities=40% Similarity=0.913 Sum_probs=19.8
Q ss_pred HHhcCCCCCChHHHHHHHHHHHHhCC
Q psy5551 43 QIRTYDGPDPLQLRFDYVQWLEQSYP 68 (172)
Q Consensus 43 ~i~~~~~dDPL~~w~~YI~W~~~~yp 68 (172)
.|..+-|. |+.+|.+...|++++|+
T Consensus 10 ~i~~~lg~-~~~~w~~l~~~i~~~Y~ 34 (133)
T PF12663_consen 10 EISEYLGK-PLELWDELCSWIEETYP 34 (133)
T ss_pred HHHHHHCc-cHHHHHHHHHHHHHHcC
Confidence 34444444 77999999999999998
No 90
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=37.71 E-value=1.3e+02 Score=20.38 Aligned_cols=51 Identities=8% Similarity=0.025 Sum_probs=35.3
Q ss_pred CHHHHHHHHHHCCcch-HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 112 NPLEIYNLVYSENIGT-KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 112 dp~~if~~L~~~~IG~-~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
.+.+.|.-+.+..=+. ..+..+-..+..+...|++++|...|+.++.....
T Consensus 20 ~A~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~ 71 (119)
T TIGR02795 20 DAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPK 71 (119)
T ss_pred HHHHHHHHHHHHCCCccccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCC
Confidence 3556666665443222 12334556899999999999999999999986544
No 91
>PLN03218 maturation of RBCL 1; Provisional
Probab=37.42 E-value=2.5e+02 Score=28.97 Aligned_cols=52 Identities=17% Similarity=0.274 Sum_probs=33.7
Q ss_pred CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 111 PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 111 ~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
+++.++|..|...|+-.. ...|......+.+.|++++|.+++..-++.+-.|
T Consensus 631 deAl~lf~eM~~~Gv~PD-~~TynsLI~a~~k~G~~eeA~~l~~eM~k~G~~p 682 (1060)
T PLN03218 631 DFALSIYDDMKKKGVKPD-EVFFSALVDVAGHAGDLDKAFEILQDARKQGIKL 682 (1060)
T ss_pred HHHHHHHHHHHHcCCCCC-HHHHHHHHHHHHhCCCHHHHHHHHHHHHHcCCCC
Confidence 356777777777666544 3456666666666777777777777766666555
No 92
>PRK14137 recX recombination regulator RecX; Provisional
Probab=37.40 E-value=86 Score=25.73 Aligned_cols=74 Identities=12% Similarity=0.155 Sum_probs=43.3
Q ss_pred HHHHHHHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy5551 78 PFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGL 157 (172)
Q Consensus 78 ~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi 157 (172)
.+++.+|.++++ ..|-||.||.+.... ..---|.-|=.-|..+||...+..=. .+.+ +....++.|..+.++-.
T Consensus 72 e~Ie~vI~rL~e-~gyLDD~rfAe~~~~--~k~~Gp~rI~~eL~qKGI~~~lI~~a--l~~~-d~ede~e~a~~l~~KK~ 145 (195)
T PRK14137 72 ALVTEVLERVQE-LGYQDDAQVARAENS--RRGVGALRVRQTLRRRGVEETLIEET--LAAR-DPQEEQQEARNLLERRW 145 (195)
T ss_pred HHHHHHHHHHHH-cCCCCHHHHHHHHHH--hcCchHHHHHHHHHHcCCCHHHHHHH--HHhc-CchhHHHHHHHHHHHhc
Confidence 334444444332 389999999997521 11125889999999999998876522 2221 11123455555555544
No 93
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=36.81 E-value=65 Score=22.59 Aligned_cols=31 Identities=16% Similarity=0.120 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 129 LAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 129 ~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
-|.|...=|.-....|+|++|..+|..|+.-
T Consensus 5 kai~Lv~~A~~eD~~gny~eA~~lY~~ale~ 35 (75)
T cd02680 5 RAHFLVTQAFDEDEKGNAEEAIELYTEAVEL 35 (75)
T ss_pred HHHHHHHHHHHhhHhhhHHHHHHHHHHHHHH
Confidence 3566677777788899999999999999874
No 94
>KOG0495|consensus
Probab=36.78 E-value=53 Score=32.57 Aligned_cols=60 Identities=12% Similarity=0.112 Sum_probs=42.5
Q ss_pred HHHHHHhhc---CCCHHHHHHHHH-HCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 101 SLVIKYIGT---QPNPLEIYNLVY-SENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 101 kiWi~Ya~~---~~dp~~if~~L~-~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
|+|++++.+ .++-.+--.+|. .-++.-.+--||.--.+++|++++.+.|.+-|..|++..
T Consensus 652 Rv~mKs~~~er~ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k~c 715 (913)
T KOG0495|consen 652 RVWMKSANLERYLDNVEEALRLLEEALKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQGTKKC 715 (913)
T ss_pred hhhHHHhHHHHHhhhHHHHHHHHHHHHHhCCchHHHHHHHhHHHHHHHHHHHHHHHHHhccccC
Confidence 577777765 334444444442 124555667788888999999999999999999998875
No 95
>PF12854 PPR_1: PPR repeat
Probab=35.68 E-value=79 Score=18.10 Aligned_cols=24 Identities=13% Similarity=0.176 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHH
Q psy5551 132 FYKAWAEVLDAHNDIKQANHVFQL 155 (172)
Q Consensus 132 fYe~wA~~lE~~~~~~~A~~I~~~ 155 (172)
-|-..-.-+.+.|+.++|.+++..
T Consensus 9 ty~~lI~~~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 9 TYNTLIDGYCKAGRVDEAFELFDE 32 (34)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHh
Confidence 455666778899999999999864
No 96
>PRK11189 lipoprotein NlpI; Provisional
Probab=35.65 E-value=2.7e+02 Score=23.47 Aligned_cols=36 Identities=22% Similarity=0.338 Sum_probs=29.5
Q ss_pred cchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 125 IGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 125 IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
++.+++.-|--.+..++..|++++|...|++.+..+
T Consensus 231 l~~~~~ea~~~Lg~~~~~~g~~~~A~~~~~~Al~~~ 266 (296)
T PRK11189 231 LAERLCETYFYLAKYYLSLGDLDEAAALFKLALANN 266 (296)
T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhC
Confidence 344555667777888999999999999999999776
No 97
>KOG1070|consensus
Probab=35.64 E-value=2.3e+02 Score=30.65 Aligned_cols=104 Identities=20% Similarity=0.313 Sum_probs=72.4
Q ss_pred CChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHH------------hhccc-ccccchh--------------HHHHH
Q psy5551 51 DPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVA------------FKNIE-QYKQDPR--------------YVSLV 103 (172)
Q Consensus 51 DPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~------------f~~~e-~Y~nD~R--------------yLkiW 103 (172)
.-..+|..|+..+.. +.. +..-..+|+|++.- |...| +|-+-+| =..+|
T Consensus 1562 q~~~vW~~y~~fLl~---~ne-~~aa~~lL~rAL~~lPk~eHv~~IskfAqLEFk~GDaeRGRtlfEgll~ayPKRtDlW 1637 (1710)
T KOG1070|consen 1562 QTRKVWIMYADFLLR---QNE-AEAARELLKRALKSLPKQEHVEFISKFAQLEFKYGDAERGRTLFEGLLSAYPKRTDLW 1637 (1710)
T ss_pred chhhHHHHHHHHHhc---ccH-HHHHHHHHHHHHhhcchhhhHHHHHHHHHHHhhcCCchhhHHHHHHHHhhCccchhHH
Confidence 567889999888752 211 12345555555432 22221 2334444 57899
Q ss_pred HHHhhcC------CCHHHHHHHHHHCCcchHHH-HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 104 IKYIGTQ------PNPLEIYNLVYSENIGTKLA-MFYKAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 104 i~Ya~~~------~dp~~if~~L~~~~IG~~~A-lfYe~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
.-|+|+. ...+++|+-..+.++-.+=+ .||..|=.+-.+.|+=+.+++|=.+++.
T Consensus 1638 ~VYid~eik~~~~~~vR~lfeRvi~l~l~~kkmKfffKkwLeyEk~~Gde~~vE~VKarA~E 1699 (1710)
T KOG1070|consen 1638 SVYIDMEIKHGDIKYVRDLFERVIELKLSIKKMKFFFKKWLEYEKSHGDEKNVEYVKARAKE 1699 (1710)
T ss_pred HHHHHHHHccCCHHHHHHHHHHHHhcCCChhHhHHHHHHHHHHHHhcCchhhHHHHHHHHHH
Confidence 9999983 34799999999999987665 5799999999999999988888766653
No 98
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=35.47 E-value=2.8e+02 Score=23.55 Aligned_cols=83 Identities=11% Similarity=0.134 Sum_probs=52.7
Q ss_pred HHHHHHHHHHhhcccccccchhHHHHHHHHhhc----CCCHHHHHHHHHHC-CcchHHHHHHHHHHHHHHHcCCHHHHHH
Q psy5551 77 IPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGT----QPNPLEIYNLVYSE-NIGTKLAMFYKAWAEVLDAHNDIKQANH 151 (172)
Q Consensus 77 ~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~----~~dp~~if~~L~~~-~IG~~~AlfYe~wA~~lE~~~~~~~A~~ 151 (172)
+..++..+..+-++ .|.-+.. .|+-.+-+ .+++...|.-+..+ --+.+....+..-+..+...|++++|..
T Consensus 163 i~af~~fl~~yP~s-~~a~~A~---y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~~~g~~~~A~~ 238 (263)
T PRK10803 163 IVAFQNFVKKYPDS-TYQPNAN---YWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKA 238 (263)
T ss_pred HHHHHHHHHHCcCC-cchHHHH---HHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHHHcCCHHHHHH
Confidence 45566666655443 2333444 45544433 34578888888754 3334444445455677889999999999
Q ss_pred HHHHHHHcCCCc
Q psy5551 152 VFQLGLNAHAEP 163 (172)
Q Consensus 152 I~~~Gi~~~A~P 163 (172)
+|+..|..-...
T Consensus 239 ~~~~vi~~yP~s 250 (263)
T PRK10803 239 VYQQVIKKYPGT 250 (263)
T ss_pred HHHHHHHHCcCC
Confidence 999999875443
No 99
>PRK09857 putative transposase; Provisional
Probab=35.33 E-value=2.5e+02 Score=24.24 Aligned_cols=83 Identities=17% Similarity=0.179 Sum_probs=52.2
Q ss_pred ChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcC---CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHH
Q psy5551 75 NIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQ---PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANH 151 (172)
Q Consensus 75 ~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~---~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~ 151 (172)
++..++......+... +.++ ..++.|+.|+-.. +++.++++.+....= +...--..+|+-+++.|.-++|.+
T Consensus 187 dl~~~~~~l~~ll~~~--~~~~-~~~~~ll~Yi~~~~~~~~~~~~~~~l~~~~~--~~~e~iMTiAEqL~qeG~qe~~~~ 261 (292)
T PRK09857 187 DLMGLVEQMACLLSSG--YAND-RQIKGLFNYILQTGDAVRFNDFIDGVAERSP--KHKESLMTIAERLRQEGEQSKALH 261 (292)
T ss_pred hHHHHHHHHHHHHHhc--cCCH-HHHHHHHHHHhhccccchHHHHHHHHHHhCc--cccHHHHHHHHHHHHHHHHHHHHH
Confidence 4666666665555543 2233 5588999998443 357888887766421 122223488999999887776766
Q ss_pred HHHHHHHcCCC
Q psy5551 152 VFQLGLNAHAE 162 (172)
Q Consensus 152 I~~~Gi~~~A~ 162 (172)
+-+.-+..+..
T Consensus 262 ia~~ml~~g~~ 272 (292)
T PRK09857 262 IAKIMLESGVP 272 (292)
T ss_pred HHHHHHHcCCC
Confidence 66666665554
No 100
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=35.31 E-value=2.2e+02 Score=24.99 Aligned_cols=48 Identities=13% Similarity=0.049 Sum_probs=36.7
Q ss_pred CCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 111 PNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 111 ~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
+++...+...... ....+..|-..+..+...|+|.+|...|+.++.-.
T Consensus 53 ~eAl~~~~~Al~l--~P~~~~a~~~lg~~~~~lg~~~eA~~~~~~al~l~ 100 (356)
T PLN03088 53 TEAVADANKAIEL--DPSLAKAYLRKGTACMKLEEYQTAKAALEKGASLA 100 (356)
T ss_pred HHHHHHHHHHHHh--CcCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhC
Confidence 3466666666554 34456667788899999999999999999999865
No 101
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=35.22 E-value=2.8e+02 Score=23.46 Aligned_cols=27 Identities=15% Similarity=0.138 Sum_probs=13.7
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 133 YKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 133 Ye~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
+...+..+...|++++|...++..++.
T Consensus 252 ~~~l~~~~~~~g~~~~A~~~l~~~~~~ 278 (389)
T PRK11788 252 LPKLMECYQALGDEAEGLEFLRRALEE 278 (389)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 334444455555555555555555544
No 102
>KOG0890|consensus
Probab=34.92 E-value=75 Score=35.38 Aligned_cols=63 Identities=16% Similarity=0.000 Sum_probs=55.7
Q ss_pred chhHHHHHHHHhhcCCC---HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 96 DPRYVSLVIKYIGTQPN---PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 96 D~RyLkiWi~Ya~~~~d---p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
+.+--++|+.+|+.... -.-.+.++.+.+-+. .+..|.+-|.++-..|+-..|..|++.|+..
T Consensus 1666 ~~~~ge~wLqsAriaR~aG~~q~A~nall~A~e~r-~~~i~~E~AK~lW~~gd~~~Al~~Lq~~l~~ 1731 (2382)
T KOG0890|consen 1666 KSRLGECWLQSARIARLAGHLQRAQNALLNAKESR-LPEIVLERAKLLWQTGDELNALSVLQEILSK 1731 (2382)
T ss_pred cchhHHHHHHHHHHHHhcccHHHHHHHHHhhhhcc-cchHHHHHHHHHHhhccHHHHHHHHHHHHHh
Confidence 67889999999999763 466788888888777 9999999999999999999999999999953
No 103
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=34.74 E-value=3e+02 Score=23.76 Aligned_cols=111 Identities=12% Similarity=0.284 Sum_probs=68.7
Q ss_pred HHHHHHHHHhcCCCCCChHHHHHHHHHHHHhCCCCCC-CC-------ChHHHHHHHHHHhhcccccccchhHHHHHHHHh
Q psy5551 36 SSREFELQIRTYDGPDPLQLRFDYVQWLEQSYPCLGP-ET-------NIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYI 107 (172)
Q Consensus 36 ~r~~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~~~~-~s-------~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya 107 (172)
....|+++++....| ++.|++||+.-.+.++.+.. .. .-+.+|||+++. +|.=.+||+.|+
T Consensus 4 r~~el~~~v~~~P~d--i~~Wl~li~~Qd~~~~~~~~~~~~~~a~~E~klsilerAL~~---------np~~~~L~l~~l 72 (321)
T PF08424_consen 4 RTAELNRRVRENPHD--IEAWLELIEFQDELFRLQSSSKAERRALAERKLSILERALKH---------NPDSERLLLGYL 72 (321)
T ss_pred HHHHHHHHHHhCccc--HHHHHHHHHHHHHhccccccchhhHHHHHHHHHHHHHHHHHh---------CCCCHHHHHHHH
Confidence 344677777654333 78999999999888774432 11 235667777766 344456667776
Q ss_pred hcC---CCH---HHHHHHHHHCCcchHHHHHHHHHHHHHHH---cCCHHHHHHHHHHHHHc
Q psy5551 108 GTQ---PNP---LEIYNLVYSENIGTKLAMFYKAWAEVLDA---HNDIKQANHVFQLGLNA 159 (172)
Q Consensus 108 ~~~---~dp---~~if~~L~~~~IG~~~AlfYe~wA~~lE~---~~~~~~A~~I~~~Gi~~ 159 (172)
+.+ .++ .+-++-+....-| ...+|.+|-.+... .-.+.+...+|.+.++.
T Consensus 73 ~~~~~~~~~~~l~~~we~~l~~~~~--~~~LW~~yL~~~q~~~~~f~v~~~~~~y~~~l~~ 131 (321)
T PF08424_consen 73 EEGEKVWDSEKLAKKWEELLFKNPG--SPELWREYLDFRQSNFASFTVSDVRDVYEKCLRA 131 (321)
T ss_pred HHHHHhCCHHHHHHHHHHHHHHCCC--ChHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHH
Confidence 653 333 4556666665544 34455555555544 23577888888877764
No 104
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=34.64 E-value=77 Score=16.97 Aligned_cols=22 Identities=27% Similarity=0.277 Sum_probs=18.1
Q ss_pred HHHHHHHHHHcCCHHHHHHHHH
Q psy5551 133 YKAWAEVLDAHNDIKQANHVFQ 154 (172)
Q Consensus 133 Ye~wA~~lE~~~~~~~A~~I~~ 154 (172)
...-|..+-..|+..+|..+++
T Consensus 4 ~~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 4 RLALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHHcCCHHHHHHHHh
Confidence 3456788899999999999875
No 105
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=34.13 E-value=1.3e+02 Score=24.02 Aligned_cols=64 Identities=16% Similarity=0.099 Sum_probs=47.6
Q ss_pred cchhHHHHHHHHhhc---------CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 95 QDPRYVSLVIKYIGT---------QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 95 nD~RyLkiWi~Ya~~---------~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
-+|....++..|+.. .+++..++.......-...-+++.- |..+...|++++|...|++-++..
T Consensus 102 l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~dP~~~~al~~L--A~~~~~~g~~~~Ai~~~~~aL~l~ 174 (198)
T PRK10370 102 LRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALMLL--ASDAFMQADYAQAIELWQKVLDLN 174 (198)
T ss_pred hCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCCChhHHHHH--HHHHHHcCCHHHHHHHHHHHHhhC
Confidence 345566677777762 1357889999888888877777665 445568999999999999987764
No 106
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=33.87 E-value=93 Score=26.11 Aligned_cols=44 Identities=23% Similarity=0.304 Sum_probs=32.6
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHc-CCHHHHHHHHHHHHH
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAH-NDIKQANHVFQLGLN 158 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~-~~~~~A~~I~~~Gi~ 158 (172)
.+.++| ...|=....|..+..-|.++|.. |++++|.+.|+.++.
T Consensus 99 ~A~~~y---~~~G~~~~aA~~~~~lA~~ye~~~~d~e~Ai~~Y~~A~~ 143 (282)
T PF14938_consen 99 KAIEIY---REAGRFSQAAKCLKELAEIYEEQLGDYEKAIEYYQKAAE 143 (282)
T ss_dssp HHHHHH---HHCT-HHHHHHHHHHHHHHHCCTT--HHHHHHHHHHHHH
T ss_pred HHHHHH---HhcCcHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 344444 46777788888999999999998 999999999988875
No 107
>PRK15331 chaperone protein SicA; Provisional
Probab=33.71 E-value=89 Score=25.23 Aligned_cols=46 Identities=11% Similarity=0.213 Sum_probs=29.1
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
.+.+.|.+-...+...-...||-+ ..+-..|+...|..-|...|.+
T Consensus 89 ~Ai~~Y~~A~~l~~~dp~p~f~ag--qC~l~l~~~~~A~~~f~~a~~~ 134 (165)
T PRK15331 89 KACDLYAVAFTLLKNDYRPVFFTG--QCQLLMRKAAKARQCFELVNER 134 (165)
T ss_pred HHHHHHHHHHHcccCCCCccchHH--HHHHHhCCHHHHHHHHHHHHhC
Confidence 466666666666666666666654 3344566677777776666663
No 108
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=33.06 E-value=1.5e+02 Score=27.32 Aligned_cols=46 Identities=11% Similarity=0.096 Sum_probs=26.5
Q ss_pred HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 113 PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 113 p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
+.+++.-+....- ....++...+..+...|++++|..+|++.++..
T Consensus 822 A~~~~~~~~~~~~--~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~ 867 (899)
T TIGR02917 822 ALEYAEKALKLAP--NIPAILDTLGWLLVEKGEADRALPLLRKAVNIA 867 (899)
T ss_pred HHHHHHHHHhhCC--CCcHHHHHHHHHHHHcCCHHHHHHHHHHHHhhC
Confidence 4445554444322 233445556666677777777777777777654
No 109
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=32.37 E-value=1.7e+02 Score=22.18 Aligned_cols=51 Identities=10% Similarity=0.077 Sum_probs=37.3
Q ss_pred CHHHHHHHHHHCCcch-HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 112 NPLEIYNLVYSENIGT-KLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 112 dp~~if~~L~~~~IG~-~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
++...|.-.....-.+ ..+..|...|..+...|++++|...|++.+.....
T Consensus 53 ~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~~~~ 104 (168)
T CHL00033 53 EALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPF 104 (168)
T ss_pred HHHHHHHHHHhccccchhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcC
Confidence 4556665554433232 25668889999999999999999999999976543
No 110
>PLN03218 maturation of RBCL 1; Provisional
Probab=32.07 E-value=1.4e+02 Score=30.67 Aligned_cols=73 Identities=12% Similarity=0.191 Sum_probs=48.9
Q ss_pred cccccchhHHHHHHHHhhc--CCCHHHHHHHHHHCCcchHHHHHHHHHHH------------------------------
Q psy5551 91 EQYKQDPRYVSLVIKYIGT--QPNPLEIYNLVYSENIGTKLAMFYKAWAE------------------------------ 138 (172)
Q Consensus 91 e~Y~nD~RyLkiWi~Ya~~--~~dp~~if~~L~~~~IG~~~AlfYe~wA~------------------------------ 138 (172)
..-+|.+-|+.+.-.+.+. ..++.++|..|...|+-.--...|...-.
T Consensus 365 ~~~~~~~~~~~~y~~l~r~G~l~eAl~Lfd~M~~~gvv~~~~v~~~~li~~~~~~g~~~eAl~lf~~M~~pd~~Tyn~LL 444 (1060)
T PLN03218 365 SGKRKSPEYIDAYNRLLRDGRIKDCIDLLEDMEKRGLLDMDKIYHAKFFKACKKQRAVKEAFRFAKLIRNPTLSTFNMLM 444 (1060)
T ss_pred CCCCCchHHHHHHHHHHHCcCHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHCCCHHHHHHHHHHcCCCCHHHHHHHH
Confidence 4456789999999988764 46899999999999986433333332222
Q ss_pred -HHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 139 -VLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 139 -~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
.+...|++++|.+||+.-++.+-.|
T Consensus 445 ~a~~k~g~~e~A~~lf~~M~~~Gl~p 470 (1060)
T PLN03218 445 SVCASSQDIDGALRVLRLVQEAGLKA 470 (1060)
T ss_pred HHHHhCcCHHHHHHHHHHHHHcCCCC
Confidence 3344566777777776666665555
No 111
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=31.89 E-value=3.2e+02 Score=23.39 Aligned_cols=84 Identities=12% Similarity=0.171 Sum_probs=53.8
Q ss_pred HHHHHHHHHHhhcccccccchhHHHH---HHHHhhc---CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHH
Q psy5551 77 IPFLEETLVAFKNIEQYKQDPRYVSL---VIKYIGT---QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQAN 150 (172)
Q Consensus 77 ~~lLErc~~~f~~~e~Y~nD~RyLki---Wi~Ya~~---~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~ 150 (172)
..+-++-+..++ +--.|.=-+.+ |+.++.- ..++.-+|+.|..+ -|. .....-.-|...-..|+|++|.
T Consensus 147 ~dlA~k~l~~~~---~~~eD~~l~qLa~awv~l~~g~e~~~~A~y~f~El~~~-~~~-t~~~lng~A~~~l~~~~~~eAe 221 (290)
T PF04733_consen 147 PDLAEKELKNMQ---QIDEDSILTQLAEAWVNLATGGEKYQDAFYIFEELSDK-FGS-TPKLLNGLAVCHLQLGHYEEAE 221 (290)
T ss_dssp HHHHHHHHHHHH---CCSCCHHHHHHHHHHHHHHHTTTCCCHHHHHHHHHHCC-S---SHHHHHHHHHHHHHCT-HHHHH
T ss_pred HHHHHHHHHHHH---hcCCcHHHHHHHHHHHHHHhCchhHHHHHHHHHHHHhc-cCC-CHHHHHHHHHHHHHhCCHHHHH
Confidence 344444444443 33456655555 5555433 46799999998665 444 4555678888899999999999
Q ss_pred HHHHHHHHcCCCchH
Q psy5551 151 HVFQLGLNAHAEPIE 165 (172)
Q Consensus 151 ~I~~~Gi~~~A~P~~ 165 (172)
++++.++.....-.+
T Consensus 222 ~~L~~al~~~~~~~d 236 (290)
T PF04733_consen 222 ELLEEALEKDPNDPD 236 (290)
T ss_dssp HHHHHHCCC-CCHHH
T ss_pred HHHHHHHHhccCCHH
Confidence 999999876654433
No 112
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=31.16 E-value=3.1e+02 Score=26.03 Aligned_cols=53 Identities=6% Similarity=0.041 Sum_probs=34.6
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
.+++.++|..|..+|+.-... -|..--..+...|++++|.+|+..-++.+-.|
T Consensus 306 ~~eA~~lf~~M~~~g~~pd~~-t~~~ll~a~~~~g~~~~a~~i~~~m~~~g~~~ 358 (697)
T PLN03081 306 SEEALCLYYEMRDSGVSIDQF-TFSIMIRIFSRLALLEHAKQAHAGLIRTGFPL 358 (697)
T ss_pred HHHHHHHHHHHHHcCCCCCHH-HHHHHHHHHHhccchHHHHHHHHHHHHhCCCC
Confidence 346788888888887765433 45555556666677777777776666655443
No 113
>KOG1258|consensus
Probab=30.60 E-value=64 Score=31.00 Aligned_cols=33 Identities=18% Similarity=0.276 Sum_probs=26.2
Q ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCC
Q psy5551 129 LAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHA 161 (172)
Q Consensus 129 ~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A 161 (172)
|=-||..||.+--+.|....+.+||++|++.-+
T Consensus 78 ~~gyW~kfA~~E~klg~~~~s~~Vfergv~aip 110 (577)
T KOG1258|consen 78 CYGYWKKFADYEYKLGNAENSVKVFERGVQAIP 110 (577)
T ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhhh
Confidence 445788888888888888888888888887543
No 114
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=30.45 E-value=93 Score=20.74 Aligned_cols=43 Identities=9% Similarity=0.121 Sum_probs=30.9
Q ss_pred HHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy5551 113 PLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQL 155 (172)
Q Consensus 113 p~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~ 155 (172)
+..+|.-+....-...-+.++-..|..+-..|+|++|..+++.
T Consensus 8 Ai~~~~k~~~~~~~~~~~~~~~~la~~~~~~~~y~~A~~~~~~ 50 (84)
T PF12895_consen 8 AIKYYEKLLELDPTNPNSAYLYNLAQCYFQQGKYEEAIELLQK 50 (84)
T ss_dssp HHHHHHHHHHHHCGTHHHHHHHHHHHHHHHTTHHHHHHHHHHC
T ss_pred HHHHHHHHHHHCCCChhHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 4555666655555432444555589999999999999999987
No 115
>PF09005 DUF1897: Domain of unknown function (DUF1897); InterPro: IPR015096 This domain is found in Psi proteins produced by Drosophila, and in various eukaryotic hypothetical proteins. It has no known function. ; PDB: 2BN6_A 2BN5_A.
Probab=29.78 E-value=37 Score=21.04 Aligned_cols=25 Identities=20% Similarity=0.304 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHH
Q psy5551 130 AMFYKAWAEVLDAHNDIKQANHVFQ 154 (172)
Q Consensus 130 AlfYe~wA~~lE~~~~~~~A~~I~~ 154 (172)
+.|=..|++++-+.|...+|+.|=+
T Consensus 9 ~DYSaqW~eYYr~~G~~~~A~~i~~ 33 (38)
T PF09005_consen 9 PDYSAQWAEYYRQIGQHEEAEAIEQ 33 (38)
T ss_dssp ---CCHHHHHHCCCT-HHHHHHHHH
T ss_pred CCHHHHHHHHHHHcCcHHHHHHHHH
Confidence 4555689999999999999998754
No 116
>PF02184 HAT: HAT (Half-A-TPR) repeat; InterPro: IPR003107 The HAT (Half A TPR) repeat has a repetitive pattern characterised by three aromatic residues with a conserved spacing. They are structurally and sequentially similar to TPRs (tetratricopeptide repeats), though they lack the highly conserved alanine and glycine residues found in TPRs. The number of HAT repeats found in different proteins varies between 9 and 12. HAT-repeat-containing proteins appear to be components of macromolecular complexes that are required for RNA processing []. The repeats may be involved in protein-protein interactions. The HAT motif has striking structural similarities to HEAT repeats (IPR000357 from INTERPRO), being of a similar length and consisting of two short helices connected by a loop domain, as in HEAT repeats.; GO: 0006396 RNA processing, 0005622 intracellular
Probab=29.24 E-value=1e+02 Score=18.35 Aligned_cols=26 Identities=15% Similarity=0.201 Sum_probs=21.6
Q ss_pred CHHHHHHHHHHHHHcCCCchHHHHHh
Q psy5551 145 DIKQANHVFQLGLNAHAEPIEDLEAA 170 (172)
Q Consensus 145 ~~~~A~~I~~~Gi~~~A~P~~~L~~~ 170 (172)
.++.|..||++-+.....|...|+=|
T Consensus 2 E~dRAR~IyeR~v~~hp~~k~WikyA 27 (32)
T PF02184_consen 2 EFDRARSIYERFVLVHPEVKNWIKYA 27 (32)
T ss_pred hHHHHHHHHHHHHHhCCCchHHHHHH
Confidence 47899999999999998888777643
No 117
>KOG2076|consensus
Probab=28.87 E-value=2.7e+02 Score=28.30 Aligned_cols=99 Identities=18% Similarity=0.295 Sum_probs=62.5
Q ss_pred CCCCCChHHHHHHHHHHHHhCCCCCCCCChHHHHHH-HHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHHCCc
Q psy5551 47 YDGPDPLQLRFDYVQWLEQSYPCLGPETNIIPFLEE-TLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYSENI 125 (172)
Q Consensus 47 ~~~dDPL~~w~~YI~W~~~~yp~~~~~s~l~~lLEr-c~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~~~I 125 (172)
...+|| ..|.+||.|+...+--- ...+..-+-. |.+.| -.+-++++|+|+.+. .|..+.-
T Consensus 677 ~~~~d~-~~a~~~lR~~i~~~~~~--~~~~q~~l~n~~~s~~---~~~~q~v~~~R~~~~-------------~~~~~~~ 737 (895)
T KOG2076|consen 677 LYARDP-GDAFSYLRSVITQFQFY--LDVYQLNLWNLDFSYF---SKYGQRVCYLRLIMR-------------LLVKNKD 737 (895)
T ss_pred HhcCCH-HHHHHHHHHHHHHHhhh--hhhHHHHHHHHHHHHH---HHHHHHHHHHHHHHH-------------HhccCcc
Confidence 346666 67899999998754321 1234444445 44444 467889999999888 3333333
Q ss_pred c-hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC-chHHH
Q psy5551 126 G-TKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE-PIEDL 167 (172)
Q Consensus 126 G-~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~-P~~~L 167 (172)
+ ..++++ .+.++...+.|+.|..+|...-.-+.. |+.-|
T Consensus 738 ~~~~l~~i---~gh~~~~~~s~~~Al~~y~ra~~~~pd~Pl~nl 778 (895)
T KOG2076|consen 738 DTPPLALI---YGHNLFVNASFKHALQEYMRAFRQNPDSPLINL 778 (895)
T ss_pred CCcceeee---echhHhhccchHHHHHHHHHHHHhCCCCcHHHH
Confidence 3 222222 355667788888998888888777655 76543
No 118
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=28.73 E-value=4.5e+02 Score=24.48 Aligned_cols=47 Identities=11% Similarity=0.128 Sum_probs=35.6
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
.+.+.|...... .......+...|..+...|++++|...|+.++.-.
T Consensus 526 eA~~~~~kAl~l--~p~~~~a~~~la~~~~~~g~~~eAi~~~e~A~~l~ 572 (615)
T TIGR00990 526 EAENLCEKALII--DPECDIAVATMAQLLLQQGDVDEALKLFERAAELA 572 (615)
T ss_pred HHHHHHHHHHhc--CCCcHHHHHHHHHHHHHccCHHHHHHHHHHHHHHh
Confidence 456666655544 34555677888999999999999999999998763
No 119
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=28.44 E-value=65 Score=18.51 Aligned_cols=22 Identities=23% Similarity=0.178 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHcCCHHHHH
Q psy5551 129 LAMFYKAWAEVLDAHNDIKQAN 150 (172)
Q Consensus 129 ~AlfYe~wA~~lE~~~~~~~A~ 150 (172)
-+..|-..|.++-..|++++|.
T Consensus 12 n~~a~~nla~~~~~~g~~~~A~ 33 (34)
T PF13431_consen 12 NAEAYNNLANLYLNQGDYEEAI 33 (34)
T ss_pred CHHHHHHHHHHHHHCcCHHhhc
Confidence 3556777788888999999885
No 120
>PF04911 ATP-synt_J: ATP synthase j chain; InterPro: IPR006995 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit J found in the F0 complex of F-ATPases from fungal mitochondria. This subunit does not appear to display sequence similarity with subunits of F-ATPases found in other organisms []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0045263 proton-transporting ATP synthase complex, coupling factor F(o)
Probab=28.26 E-value=34 Score=22.82 Aligned_cols=18 Identities=33% Similarity=0.586 Sum_probs=12.9
Q ss_pred HHHHHhhcccccccchhH
Q psy5551 82 ETLVAFKNIEQYKQDPRY 99 (172)
Q Consensus 82 rc~~~f~~~e~Y~nD~Ry 99 (172)
+.-..-.+.+.|+||||=
T Consensus 26 k~~~a~~ns~E~~NDPRN 43 (54)
T PF04911_consen 26 KAQNAMMNSDEFKNDPRN 43 (54)
T ss_pred HHHHHHhcCHHHhcCCCC
Confidence 344445678999999983
No 121
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=28.00 E-value=1.6e+02 Score=23.41 Aligned_cols=46 Identities=13% Similarity=0.074 Sum_probs=36.3
Q ss_pred HHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCchHH
Q psy5551 121 YSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEPIED 166 (172)
Q Consensus 121 ~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P~~~ 166 (172)
.+|.|-.....=+++-|.++.+.|++++|.+.|.+-......|-..
T Consensus 27 ~~n~~kesir~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~ 72 (177)
T PF10602_consen 27 KSNLGKESIRMALEDLADHYCKIGDLEEALKAYSRARDYCTSPGHK 72 (177)
T ss_pred HhccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHH
Confidence 3677778888888889999999999999999888876666555444
No 122
>PHA01810 hypothetical protein
Probab=27.68 E-value=33 Score=24.71 Aligned_cols=13 Identities=23% Similarity=0.263 Sum_probs=10.4
Q ss_pred CCCCCChHHHHHH
Q psy5551 47 YDGPDPLQLRFDY 59 (172)
Q Consensus 47 ~~~dDPL~~w~~Y 59 (172)
..||||++.|--.
T Consensus 76 iagddpieqwnmh 88 (100)
T PHA01810 76 IAGDDPIEQWNMH 88 (100)
T ss_pred HhCCChHHHHHhh
Confidence 3799999999643
No 123
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=27.14 E-value=1e+02 Score=16.00 Aligned_cols=22 Identities=14% Similarity=-0.023 Sum_probs=16.5
Q ss_pred HHHHHHHcCCHHHHHHHHHHHH
Q psy5551 136 WAEVLDAHNDIKQANHVFQLGL 157 (172)
Q Consensus 136 wA~~lE~~~~~~~A~~I~~~Gi 157 (172)
.=..+-+.|++++|.++|++=.
T Consensus 6 li~~~~~~~~~~~a~~~~~~M~ 27 (31)
T PF01535_consen 6 LISGYCKMGQFEEALEVFDEMR 27 (31)
T ss_pred HHHHHHccchHHHHHHHHHHHh
Confidence 3355677899999999997643
No 124
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=26.88 E-value=4.6e+02 Score=26.77 Aligned_cols=51 Identities=14% Similarity=0.085 Sum_probs=40.1
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCC
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAE 162 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~ 162 (172)
.+++...|....... ...+..+...+..+...|++++|...|++.++....
T Consensus 285 ~~~A~~~l~~aL~~~--P~~~~a~~~Lg~~~~~~g~~~eA~~~l~~Al~~~p~ 335 (1157)
T PRK11447 285 GGKAIPELQQAVRAN--PKDSEALGALGQAYSQQGDRARAVAQFEKALALDPH 335 (1157)
T ss_pred HHHHHHHHHHHHHhC--CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCC
Confidence 345778888777643 345667778899999999999999999999986644
No 125
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=25.79 E-value=4.6e+02 Score=23.08 Aligned_cols=77 Identities=12% Similarity=0.022 Sum_probs=46.5
Q ss_pred ChHHHHHHHHHHhhcccccccch--hH--HHHHHHHhhc-CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHH
Q psy5551 75 NIIPFLEETLVAFKNIEQYKQDP--RY--VSLVIKYIGT-QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQA 149 (172)
Q Consensus 75 ~l~~lLErc~~~f~~~e~Y~nD~--Ry--LkiWi~Ya~~-~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A 149 (172)
.+...+|+.++. .-+|+ .+ --=|+.|..- .+.+.+.|+-.....+...... +...+.++++.|+.++|
T Consensus 317 ~~~~~~e~~lk~------~p~~~~~~ll~sLg~l~~~~~~~~~A~~~le~a~a~~~~p~~~~-~~~La~ll~~~g~~~~A 389 (409)
T TIGR00540 317 KLEKLIEKQAKN------VDDKPKCCINRALGQLLMKHGEFIEAADAFKNVAACKEQLDAND-LAMAADAFDQAGDKAEA 389 (409)
T ss_pred HHHHHHHHHHHh------CCCChhHHHHHHHHHHHHHcccHHHHHHHHHHhHHhhcCCCHHH-HHHHHHHHHHcCCHHHH
Confidence 466666666654 34455 32 2234444322 2346666662123333333333 55889999999999999
Q ss_pred HHHHHHHHH
Q psy5551 150 NHVFQLGLN 158 (172)
Q Consensus 150 ~~I~~~Gi~ 158 (172)
.++|++|+.
T Consensus 390 ~~~~~~~l~ 398 (409)
T TIGR00540 390 AAMRQDSLG 398 (409)
T ss_pred HHHHHHHHH
Confidence 999999965
No 126
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=25.77 E-value=1.5e+02 Score=17.54 Aligned_cols=23 Identities=22% Similarity=0.258 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHH
Q psy5551 132 FYKAWAEVLDAHNDIKQANHVFQ 154 (172)
Q Consensus 132 fYe~wA~~lE~~~~~~~A~~I~~ 154 (172)
+.-..|..+-.+|++++|..+|+
T Consensus 3 ~~y~~a~~~y~~~ky~~A~~~~~ 25 (36)
T PF07720_consen 3 YLYGLAYNFYQKGKYDEAIHFFQ 25 (36)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHH
T ss_pred HHHHHHHHHHHHhhHHHHHHHHH
Confidence 34567888999999999999954
No 127
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=25.66 E-value=2.8e+02 Score=20.48 Aligned_cols=80 Identities=10% Similarity=0.019 Sum_probs=53.7
Q ss_pred ChHHHHHHHHHHhhcccccccchhHHHHHHHHhh-cCCCHHHHHHHHHHCCcchHH-HHHHHHHHHHHHHcCCHHHHHHH
Q psy5551 75 NIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIG-TQPNPLEIYNLVYSENIGTKL-AMFYKAWAEVLDAHNDIKQANHV 152 (172)
Q Consensus 75 ~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~-~~~dp~~if~~L~~~~IG~~~-AlfYe~wA~~lE~~~~~~~A~~I 152 (172)
.+...++.++..+.+. .|....++..-=+.+.. -.+.+...|.....+.-...+ +.-....|.++...|++++|..+
T Consensus 29 ~~~~~~~~l~~~~~~s-~ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~a~l~LA~~~~~~~~~d~Al~~ 107 (145)
T PF09976_consen 29 KAEAAAEQLAKDYPSS-PYAALAALQLAKAAYEQGDYDEAKAALEKALANAPDPELKPLARLRLARILLQQGQYDEALAT 107 (145)
T ss_pred HHHHHHHHHHHHCCCC-hHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHcCCHHHHHHH
Confidence 3566777777775555 46655555433333322 235689999999988743322 34455678999999999999999
Q ss_pred HHH
Q psy5551 153 FQL 155 (172)
Q Consensus 153 ~~~ 155 (172)
++.
T Consensus 108 L~~ 110 (145)
T PF09976_consen 108 LQQ 110 (145)
T ss_pred HHh
Confidence 966
No 128
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=24.89 E-value=2.4e+02 Score=26.32 Aligned_cols=47 Identities=9% Similarity=0.022 Sum_probs=25.1
Q ss_pred HHHHHHHHHHCC-cchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 113 PLEIYNLVYSEN-IGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 113 p~~if~~L~~~~-IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
+...|.-....+ ++...+..|...+.++...|++++|...|++.+..
T Consensus 313 A~~~~~~al~~~~~~~~~a~a~~~lg~~~~~~g~~~eA~~~~~kal~l 360 (615)
T TIGR00990 313 AARAFEKALDLGKLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIEL 360 (615)
T ss_pred HHHHHHHHHhcCCCChhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Confidence 444444444432 34455555555555555566666666666655554
No 129
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=24.47 E-value=3.9e+02 Score=21.79 Aligned_cols=106 Identities=14% Similarity=0.103 Sum_probs=60.1
Q ss_pred CChHHHHHHHHHHHHhCCCCCCCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcC--CCHHHHHHHHHHCCcchH
Q psy5551 51 DPLQLRFDYVQWLEQSYPCLGPETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQ--PNPLEIYNLVYSENIGTK 128 (172)
Q Consensus 51 DPL~~w~~YI~W~~~~yp~~~~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~--~dp~~if~~L~~~~IG~~ 128 (172)
+--..|..+..+..+ .|....=..+++++++..-+++.. +.--+|+. ++.- ++..++..-+.... ..
T Consensus 144 ~~~~~~~~~a~~~~~----~G~~~~A~~~~~~al~~~P~~~~~----~~~l~~~l-i~~~~~~~~~~~l~~~~~~~--~~ 212 (280)
T PF13429_consen 144 DSARFWLALAEIYEQ----LGDPDKALRDYRKALELDPDDPDA----RNALAWLL-IDMGDYDEAREALKRLLKAA--PD 212 (280)
T ss_dssp T-HHHHHHHHHHHHH----CCHHHHHHHHHHHHHHH-TT-HHH----HHHHHHHH-CTTCHHHHHHHHHHHHHHH---HT
T ss_pred CCHHHHHHHHHHHHH----cCCHHHHHHHHHHHHHcCCCCHHH----HHHHHHHH-HHCCChHHHHHHHHHHHHHC--cC
Confidence 444566666666553 221123467777777766655443 33334432 2221 22345665554443 33
Q ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC-CCchHHH
Q psy5551 129 LAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH-AEPIEDL 167 (172)
Q Consensus 129 ~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~-A~P~~~L 167 (172)
-+.++..+|..+...|++++|...|++.+..+ ..|.-.+
T Consensus 213 ~~~~~~~la~~~~~lg~~~~Al~~~~~~~~~~p~d~~~~~ 252 (280)
T PF13429_consen 213 DPDLWDALAAAYLQLGRYEEALEYLEKALKLNPDDPLWLL 252 (280)
T ss_dssp SCCHCHHHHHHHHHHT-HHHHHHHHHHHHHHSTT-HHHHH
T ss_pred HHHHHHHHHHHhcccccccccccccccccccccccccccc
Confidence 44678899999999999999999999999855 3555443
No 130
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=24.31 E-value=4.1e+02 Score=22.25 Aligned_cols=109 Identities=11% Similarity=0.067 Sum_probs=57.3
Q ss_pred HHHHHHHHhcCCCCCChHHHHHHHHHHHHhCCC----CC---CCCChHHHHHHHHHHhhcccccccchhHHHHHHHHhhc
Q psy5551 37 SREFELQIRTYDGPDPLQLRFDYVQWLEQSYPC----LG---PETNIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGT 109 (172)
Q Consensus 37 r~~fE~~i~~~~~dDPL~~w~~YI~W~~~~yp~----~~---~~s~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~ 109 (172)
+.-|+--...+.-. +.+..=++|++++|-- +. ...+..++--+|++.+...----+++-++.-++.+.+.
T Consensus 36 ~~~yn~G~~l~~~~---~~~~~a~~wL~~a~~~l~~~~~~~~~~~~~~elr~~iL~~La~~~l~~~~~~~~~ka~~~l~~ 112 (278)
T PF08631_consen 36 RVCYNIGKSLLSKK---DKYEEAVKWLQRAYDILEKPGKMDKLSPDGSELRLSILRLLANAYLEWDTYESVEKALNALRL 112 (278)
T ss_pred HHHHHHHHHHHHcC---CChHHHHHHHHHHHHHHHhhhhccccCCcHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHH
Confidence 44444444333322 3344557777766532 21 22344455555555544443333444444444444443
Q ss_pred CCCHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC
Q psy5551 110 QPNPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAH 160 (172)
Q Consensus 110 ~~dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~ 160 (172)
. .+..|.+...|+-...-+.- .++..++++++..-|..-
T Consensus 113 l-----------~~e~~~~~~~~~L~l~il~~-~~~~~~~~~~L~~mi~~~ 151 (278)
T PF08631_consen 113 L-----------ESEYGNKPEVFLLKLEILLK-SFDEEEYEEILMRMIRSV 151 (278)
T ss_pred H-----------HHhCCCCcHHHHHHHHHHhc-cCChhHHHHHHHHHHHhc
Confidence 3 56666666666555554444 677777777777777664
No 131
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=24.29 E-value=1.7e+02 Score=21.49 Aligned_cols=40 Identities=10% Similarity=0.090 Sum_probs=32.5
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
|+.++-..|.++|--..+..|| ..+|...+|.+++..=..
T Consensus 28 ~~~~~e~~L~~~~~~~eL~~lY-------~~kg~h~~AL~ll~~l~~ 67 (108)
T PF10366_consen 28 DLEEVEEVLKEHGKYQELVDLY-------QGKGLHRKALELLKKLAD 67 (108)
T ss_pred CHHHHHHHHHHcCCHHHHHHHH-------HccCccHHHHHHHHHHhc
Confidence 7899999998888666666665 589999999999986555
No 132
>PF08328 ASL_C: Adenylosuccinate lyase C-terminal; InterPro: IPR013539 This domain is found at the C terminus of adenylosuccinate lyase(ASL; PurB in Escherichia coli). It has been identified in bacteria, eukaryotes and archaea and is found together with the lyase domain IPR000362 from INTERPRO. ASL catalyses the cleavage of succinylaminoimidazole carboxamide ribotide to aminoimidazole carboxamide ribotide and fumarate and the cleavage of adenylosuccinate to adenylate and fumarate []. ; GO: 0004018 N6-(1,2-dicarboxyethyl)AMP AMP-lyase (fumarate-forming) activity, 0006188 IMP biosynthetic process; PDB: 2HVG_A 2QGA_C 2PTS_A 2PTR_A 2PTQ_B 3BHG_A 3GZH_A.
Probab=23.67 E-value=39 Score=25.82 Aligned_cols=38 Identities=16% Similarity=0.243 Sum_probs=26.0
Q ss_pred HCCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc
Q psy5551 122 SENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNA 159 (172)
Q Consensus 122 ~~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~ 159 (172)
.+.||+.+|....+|...+--.++..=-.+....-+..
T Consensus 13 lRNiGva~~~sliA~~s~lkGl~Kl~vn~~~l~~dL~~ 50 (115)
T PF08328_consen 13 LRNIGVAFGHSLIAYKSLLKGLGKLEVNEERLAEDLDE 50 (115)
T ss_dssp HTTHHHHHHHHHHHHHHHHHHHHTEEE-HHHHHHHHCT
T ss_pred HHhhhHHHHHHHHHHHHHHHHHhcccCCHHHHHHHHHH
Confidence 46899999999999999987766544333344444443
No 133
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=22.70 E-value=2.8e+02 Score=26.12 Aligned_cols=54 Identities=11% Similarity=0.055 Sum_probs=42.3
Q ss_pred CCCHHHHHHHHHH-CCcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCc
Q psy5551 110 QPNPLEIYNLVYS-ENIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLNAHAEP 163 (172)
Q Consensus 110 ~~dp~~if~~L~~-~~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~~~A~P 163 (172)
..++.+...+|.. .+.-..-..+++.=|.++-+.|++++|..+|+.-|.+|..=
T Consensus 17 ~g~~~~AL~~L~~~~~~I~Dk~~~~E~rA~ll~kLg~~~eA~~~y~~Li~rNPdn 71 (517)
T PF12569_consen 17 AGDYEEALEHLEKNEKQILDKLAVLEKRAELLLKLGRKEEAEKIYRELIDRNPDN 71 (517)
T ss_pred CCCHHHHHHHHHhhhhhCCCHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCc
Confidence 3566677777755 33444556789999999999999999999999999998653
No 134
>smart00648 SWAP Suppressor-of-White-APricot splicing regulator. domain present in regulators which are responsible for pre-mRNA splicing processes
Probab=22.41 E-value=1.3e+02 Score=19.06 Aligned_cols=33 Identities=18% Similarity=0.253 Sum_probs=18.5
Q ss_pred HHHHHHHHHHhcCCCCCCh-------HHHHHHHHHHHHhC
Q psy5551 35 LSSREFELQIRTYDGPDPL-------QLRFDYVQWLEQSY 67 (172)
Q Consensus 35 ~~r~~fE~~i~~~~~dDPL-------~~w~~YI~W~~~~y 67 (172)
+.-..||+.|..-..+||. ++...|-.|..+.|
T Consensus 13 ~~G~~fe~~l~~~~~~n~~F~FL~~~~~~h~yy~~~l~~~ 52 (54)
T smart00648 13 RNGPEFEAKLMERERNNPQFDFLKPNDPYHAYYRKKLAEY 52 (54)
T ss_pred HhhHHHHHHHHHhcCCCCCCccCCCCCCCcHHHHHHHHHH
Confidence 3345799999764444444 33445555555443
No 135
>PF13311 DUF4080: Protein of unknown function (DUF4080)
Probab=22.18 E-value=4.2e+02 Score=21.27 Aligned_cols=74 Identities=11% Similarity=0.260 Sum_probs=50.7
Q ss_pred HHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHH-------CCcchHHHHHHHHHHHHHHHcC--CHH
Q psy5551 77 IPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYS-------ENIGTKLAMFYKAWAEVLDAHN--DIK 147 (172)
Q Consensus 77 ~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~-------~~IG~~~AlfYe~wA~~lE~~~--~~~ 147 (172)
+.-+|+++..+-|..++.+=.+|| .....+|-+.|.-|.. .++|+.....|+-...++...+ +.+
T Consensus 30 Lk~~e~~le~yyNSgrF~~tl~~l------~~~~~spF~ff~~l~~~~~~~g~~~~~~s~~~l~~~L~~F~~~~~~~~~~ 103 (190)
T PF13311_consen 30 LKRFEDMLEKYYNSGRFDYTLEYL------LKEFGSPFDFFEALADYWEEQGYFKRSHSLEDLFEILYDFLKEKLGEDKE 103 (190)
T ss_pred HHHHHHHHHHHhhhhhHHHHHHHH------HHhcCCHHHHHHHHHHHHHHCCCcccCCCHHHHHHHHHHHHHHhccccHH
Confidence 455677776666666555555554 3444889888877664 4668999999999999997553 566
Q ss_pred HHHHHHHHH
Q psy5551 148 QANHVFQLG 156 (172)
Q Consensus 148 ~A~~I~~~G 156 (172)
.+.+++..-
T Consensus 104 ~~~elL~~D 112 (190)
T PF13311_consen 104 LIKELLKFD 112 (190)
T ss_pred HHHHHHHHH
Confidence 666665544
No 136
>cd06224 REM Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal domain (RasGef_N), also called REM domain (Ras exchanger motif). This domain is common in nucleotide exchange factors for Ras-like small GTPases and is typically found immediately N-terminal to the RasGef (Cdc25-like) domain. REM contacts the GTPase and is assumed to participate in the catalytic activity of the exchange factor. Proteins with the REM domain include Sos1 and Sos2, which relay signals from tyrosine-kinase mediated signalling to Ras, RasGRP1-4, RasGRF1,2, CNrasGEF, and RAP-specific nucleotide exchange factors, to name a few.
Probab=22.16 E-value=2e+02 Score=20.37 Aligned_cols=41 Identities=20% Similarity=0.329 Sum_probs=30.5
Q ss_pred HHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHH
Q psy5551 77 IPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYS 122 (172)
Q Consensus 77 ~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~ 122 (172)
..|+++++.... ..|+.|+..++.=-..+..|.+++..|..
T Consensus 3 ~~Li~~L~~~~~-----~~d~~f~~~FllTyrsF~s~~~ll~~L~~ 43 (122)
T cd06224 3 EALIEHLTSTFD-----MPDPSFVSTFLLTYRSFTTPTELLEKLIE 43 (122)
T ss_pred HHHHHHHcCCCc-----cccHHHHHHHHHHhhhhCCHHHHHHHHHH
Confidence 455666665432 68999999998877777889888888743
No 137
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=22.04 E-value=3.5e+02 Score=24.03 Aligned_cols=61 Identities=21% Similarity=0.319 Sum_probs=46.0
Q ss_pred cccchhHHHHHHHHhhcC--CCHHHHHHHHHHC--CcchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy5551 93 YKQDPRYVSLVIKYIGTQ--PNPLEIYNLVYSE--NIGTKLAMFYKAWAEVLDAHNDIKQANHVFQLGLN 158 (172)
Q Consensus 93 Y~nD~RyLkiWi~Ya~~~--~dp~~if~~L~~~--~IG~~~AlfYe~wA~~lE~~~~~~~A~~I~~~Gi~ 158 (172)
|.++..|-++=-.|...+ -||..++.+|.++ .|.+- ..-|.++..+|+...|.+..++++-
T Consensus 4 f~hs~~Y~~~q~~F~~~v~~~Dp~~l~~ll~~~PyHidtL-----lqls~v~~~~gd~~~A~~lleRALf 68 (360)
T PF04910_consen 4 FEHSKAYQEAQEQFYAAVQSHDPNALINLLQKNPYHIDTL-----LQLSEVYRQQGDHAQANDLLERALF 68 (360)
T ss_pred EeCCHHHHHHHHHHHHHHHccCHHHHHHHHHHCCCcHHHH-----HHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 446666766666665543 3999999999777 55554 4558899999999999999998864
No 138
>PF08640 U3_assoc_6: U3 small nucleolar RNA-associated protein 6; InterPro: IPR013949 This entry represents U3 nucleolar RNA-associated proteins which are involved in nucleolar processing of pre-18S ribosomal RNA [].
Probab=21.91 E-value=1.6e+02 Score=20.87 Aligned_cols=31 Identities=16% Similarity=0.306 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHhcCCCCCChHHHHHHHHHH
Q psy5551 31 TAICLSSREFELQIRTYDGPDPLQLRFDYVQWL 63 (172)
Q Consensus 31 ~~l~~~r~~fE~~i~~~~~dDPL~~w~~YI~W~ 63 (172)
..+-..|..||-.|..-.. ....|++||.+=
T Consensus 23 ~~IvkkR~~fEy~L~rr~~--~~~Dfl~YI~yE 53 (83)
T PF08640_consen 23 REIVKKRRDFEYKLQRRGK--KKSDFLRYIEYE 53 (83)
T ss_pred HHHHHHHHHHHHHHHcCCC--CHHHHHHHHHHH
Confidence 5577789999999975321 233788999883
No 139
>KOG1586|consensus
Probab=21.88 E-value=97 Score=27.08 Aligned_cols=42 Identities=21% Similarity=0.238 Sum_probs=28.1
Q ss_pred CHHHHHHHHHHCCcchHHHHHHHHHHHHHHHc-CCHHHHHHHHHHH
Q psy5551 112 NPLEIYNLVYSENIGTKLAMFYKAWAEVLDAH-NDIKQANHVFQLG 156 (172)
Q Consensus 112 dp~~if~~L~~~~IG~~~AlfYe~wA~~lE~~-~~~~~A~~I~~~G 156 (172)
.+.+||.-| |=.+.-|-++.+.|.++|.. .++++|..-|+.+
T Consensus 98 ~aieIyt~~---Grf~~aAk~~~~iaEiyEsdl~d~ekaI~~YE~A 140 (288)
T KOG1586|consen 98 KAIEIYTDM---GRFTMAAKHHIEIAEIYESDLQDFEKAIAHYEQA 140 (288)
T ss_pred HHHHHHHhh---hHHHHHHhhhhhHHHHHhhhHHHHHHHHHHHHHH
Confidence 345555544 44455677788888888887 7777777777654
No 140
>PF00618 RasGEF_N: RasGEF N-terminal motif; InterPro: IPR000651 The crystal structure of the guanine nucleotide exchange factor (GEF) region of human Sos1 complexes with Ras has been solved []. The structure consists of two distinct alpha helical structural domains: the N-terminal domain which seems to have a purely structural role and the C-terminal domain which is sufficient for catalytic activity and contains all residues that interact with Ras. A main feature of the catalytic domain is the protrusion of a helical hairpin important for the nucleotide-exchange mechanism. The N-terminal domain is likely to be important for the stability and correct placement of the hairpin structure. This entry represents a domain found in several GEF for Ras-like small GTPases which lies N-terminal to the RasGef (Cdc25-like) domain. ; GO: 0005085 guanyl-nucleotide exchange factor activity, 0051056 regulation of small GTPase mediated signal transduction, 0005622 intracellular; PDB: 3CF6_E 2BYV_E 1NVW_S 1BKD_S 1XDV_A 2II0_A 1NVU_S 1NVX_S 1NVV_S 1XD4_B ....
Probab=21.70 E-value=1.4e+02 Score=20.88 Aligned_cols=44 Identities=14% Similarity=0.253 Sum_probs=29.9
Q ss_pred ChHHHHHHHHHHhhcccccccchhHHHHHHHHhhcCCCHHHHHHHHHHC
Q psy5551 75 NIIPFLEETLVAFKNIEQYKQDPRYVSLVIKYIGTQPNPLEIYNLVYSE 123 (172)
Q Consensus 75 ~l~~lLErc~~~f~~~e~Y~nD~RyLkiWi~Ya~~~~dp~~if~~L~~~ 123 (172)
.+-.|+++++..+... |..|+...+.=-...-.|.+++..|..+
T Consensus 6 tl~~Li~~L~~~~~~~-----d~~f~~~FllTyr~F~tp~~Ll~~L~~r 49 (104)
T PF00618_consen 6 TLEKLIERLTSSFNSD-----DEEFVDTFLLTYRSFTTPEELLDKLIQR 49 (104)
T ss_dssp -HHHHHHHHCHC-SS------HHHHHHHHHHHHHHCS-HHHHHHHHHHH
T ss_pred CHHHHHHHHhcccccC-----CHHHHHHHHHHhHhhCCHHHHHHHHHHH
Confidence 3566677776553333 7799999988777777999999887653
No 141
>PF12843 DUF3820: Protein of unknown function (DUF3820); InterPro: IPR024530 This protein family is mostly found in bacteria and is currently functionally uncharacterised.
Probab=20.90 E-value=1.6e+02 Score=20.47 Aligned_cols=26 Identities=15% Similarity=0.437 Sum_probs=17.5
Q ss_pred HHHHHHHHHH-hCCCCCCCCChHHHHHHHHH
Q psy5551 56 RFDYVQWLEQ-SYPCLGPETNIIPFLEETLV 85 (172)
Q Consensus 56 w~~YI~W~~~-~yp~~~~~s~l~~lLErc~~ 85 (172)
=..|+.|+.+ -|| +..|-.+|+.+..
T Consensus 28 P~~YL~W~~~kGfP----~GkLG~~l~~~~e 54 (67)
T PF12843_consen 28 PESYLVWFARKGFP----KGKLGELLALMYE 54 (67)
T ss_pred CHHHHHHHHHcCCC----chHHHHHHHHHHH
Confidence 3579999864 344 4467777777654
Done!