Query psy6079
Match_columns 79
No_of_seqs 123 out of 1392
Neff 10.3
Searched_HMMs 46136
Date Fri Aug 16 17:05:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6079.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6079hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4441|consensus 99.8 2E-20 4.4E-25 118.8 8.6 75 3-79 399-476 (571)
2 KOG4441|consensus 99.8 2.2E-20 4.8E-25 118.6 7.6 75 3-79 352-428 (571)
3 PHA02713 hypothetical protein; 99.8 5.4E-19 1.2E-23 112.2 9.4 75 3-79 323-399 (557)
4 PF13964 Kelch_6: Kelch motif 99.8 6.7E-19 1.4E-23 80.2 6.3 49 19-69 1-50 (50)
5 PHA02713 hypothetical protein; 99.8 2.1E-18 4.5E-23 109.5 8.5 74 4-79 436-513 (557)
6 PHA02790 Kelch-like protein; P 99.8 1.3E-17 2.8E-22 104.5 9.2 73 3-79 334-408 (480)
7 PHA02790 Kelch-like protein; P 99.7 1.4E-16 3.1E-21 99.8 9.1 72 3-79 290-363 (480)
8 PHA03098 kelch-like protein; P 99.7 1.2E-16 2.6E-21 100.8 8.8 75 3-79 314-390 (534)
9 PLN02153 epithiospecifier prot 99.7 4.6E-16 1E-20 93.9 9.5 75 3-79 53-138 (341)
10 PF01344 Kelch_1: Kelch motif; 99.7 8.6E-17 1.9E-21 72.2 4.1 46 19-66 1-47 (47)
11 PHA03098 kelch-like protein; P 99.7 7.5E-16 1.6E-20 97.3 9.2 75 3-79 361-438 (534)
12 PLN02153 epithiospecifier prot 99.7 1.1E-15 2.4E-20 92.2 9.4 75 3-79 104-194 (341)
13 smart00612 Kelch Kelch domain. 99.7 3.6E-16 7.8E-21 69.8 5.3 47 33-79 1-47 (47)
14 TIGR03548 mutarot_permut cycli 99.7 1.3E-15 2.8E-20 91.3 9.1 74 4-79 92-172 (323)
15 TIGR03547 muta_rot_YjhT mutatr 99.7 1.6E-15 3.4E-20 91.6 8.8 75 3-79 88-201 (346)
16 PLN02193 nitrile-specifier pro 99.6 3.4E-15 7.3E-20 93.5 9.7 75 3-79 196-279 (470)
17 PLN02193 nitrile-specifier pro 99.6 4.2E-15 9.1E-20 93.1 9.7 75 3-79 247-329 (470)
18 PF13415 Kelch_3: Galactose ox 99.6 2E-15 4.3E-20 68.4 5.8 47 31-77 1-49 (49)
19 TIGR03547 muta_rot_YjhT mutatr 99.6 2.1E-14 4.6E-19 86.7 8.9 74 3-78 32-118 (346)
20 PF07646 Kelch_2: Kelch motif; 99.6 1.3E-14 2.8E-19 65.7 6.1 46 19-66 1-49 (49)
21 PRK14131 N-acetylneuraminic ac 99.6 3.9E-14 8.4E-19 86.6 8.8 75 3-79 109-222 (376)
22 KOG0379|consensus 99.6 4.4E-14 9.4E-19 88.8 8.5 75 3-79 142-225 (482)
23 TIGR03548 mutarot_permut cycli 99.5 3.4E-13 7.5E-18 80.9 8.7 67 11-79 52-124 (323)
24 PF13418 Kelch_4: Galactose ox 99.5 3.9E-14 8.4E-19 64.0 3.4 47 19-66 1-48 (49)
25 KOG4693|consensus 99.5 2E-13 4.4E-18 80.2 6.8 75 3-79 47-140 (392)
26 PRK14131 N-acetylneuraminic ac 99.5 5.6E-13 1.2E-17 81.6 8.9 74 3-78 53-139 (376)
27 KOG4693|consensus 99.4 1.1E-12 2.3E-17 77.3 6.2 73 5-79 221-303 (392)
28 KOG1230|consensus 99.3 7.8E-12 1.7E-16 76.9 6.9 72 4-77 158-240 (521)
29 KOG0379|consensus 99.3 2.4E-11 5.1E-16 76.7 7.9 74 3-78 193-275 (482)
30 PLN02772 guanylate kinase 99.1 4.3E-10 9.3E-15 69.3 7.6 61 17-79 22-86 (398)
31 PF13854 Kelch_5: Kelch motif 99.1 3.3E-10 7.2E-15 49.6 5.0 39 17-57 2-42 (42)
32 KOG1230|consensus 99.1 5.1E-10 1.1E-14 69.1 6.7 73 3-76 101-185 (521)
33 KOG4152|consensus 99.0 7.1E-10 1.5E-14 70.3 5.5 72 5-78 235-332 (830)
34 COG3055 Uncharacterized protei 98.8 1.7E-08 3.6E-13 61.4 5.9 67 10-78 70-145 (381)
35 COG3055 Uncharacterized protei 98.4 4E-07 8.7E-12 55.5 4.3 70 3-74 116-224 (381)
36 KOG4152|consensus 98.1 5.3E-06 1.1E-10 53.3 4.0 64 10-76 17-89 (830)
37 PF07250 Glyoxal_oxid_N: Glyox 98.0 4.2E-05 9.1E-10 45.0 6.3 74 3-78 49-127 (243)
38 smart00612 Kelch Kelch domain. 97.8 2.7E-05 5.9E-10 33.9 2.8 26 3-28 18-45 (47)
39 KOG2437|consensus 97.5 9.5E-05 2.1E-09 47.6 3.0 66 9-74 238-318 (723)
40 PF13415 Kelch_3: Galactose ox 97.3 0.00035 7.7E-09 31.1 2.6 26 3-28 22-49 (49)
41 KOG2437|consensus 97.1 0.00093 2E-08 43.3 4.1 58 4-63 350-420 (723)
42 PF13964 Kelch_6: Kelch motif 97.0 0.00064 1.4E-08 30.3 1.8 18 3-20 31-48 (50)
43 PLN02772 guanylate kinase 97.0 0.0046 9.9E-08 39.0 6.0 51 3-56 54-109 (398)
44 TIGR01640 F_box_assoc_1 F-box 96.4 0.048 1E-06 31.5 7.4 59 3-63 73-135 (230)
45 PF01344 Kelch_1: Kelch motif; 96.4 0.001 2.3E-08 29.0 0.3 17 3-19 31-47 (47)
46 TIGR01640 F_box_assoc_1 F-box 95.9 0.15 3.2E-06 29.5 7.8 62 3-64 17-87 (230)
47 PF13418 Kelch_4: Galactose ox 95.8 0.0058 1.2E-07 26.9 1.3 17 3-19 32-48 (49)
48 PF07646 Kelch_2: Kelch motif; 95.4 0.013 2.8E-07 25.9 1.6 17 3-19 33-49 (49)
49 PF12768 Rax2: Cortical protei 95.2 0.16 3.4E-06 30.9 6.2 60 4-64 20-81 (281)
50 PF07893 DUF1668: Protein of u 89.9 3.3 7.2E-05 25.8 8.1 36 4-41 90-127 (342)
51 PF07250 Glyoxal_oxid_N: Glyox 88.5 0.95 2E-05 27.0 3.5 34 30-70 179-214 (243)
52 COG4257 Vgb Streptogramin lyas 85.4 5.8 0.00013 24.7 5.6 55 3-63 257-313 (353)
53 PF13570 PQQ_3: PQQ-like domai 81.0 3.1 6.7E-05 17.2 4.0 37 11-56 4-40 (40)
54 PF08268 FBA_3: F-box associat 78.1 8.1 0.00018 20.3 6.1 33 31-64 5-37 (129)
55 PF06058 DCP1: Dcp1-like decap 69.0 6.8 0.00015 20.9 2.4 16 47-62 29-44 (122)
56 PRK09511 nirD nitrite reductas 66.1 16 0.00035 18.9 3.4 31 4-36 76-106 (108)
57 PF07443 HARP: HepA-related pr 65.5 2.8 6.1E-05 19.3 0.5 12 50-61 21-32 (55)
58 PF05096 Glu_cyclase_2: Glutam 63.1 33 0.00071 21.0 6.4 46 31-79 55-100 (264)
59 PF08450 SGL: SMP-30/Gluconola 62.9 28 0.00061 20.2 5.0 54 3-65 25-78 (246)
60 PF14830 Haemocyan_bet_s: Haem 62.4 7.7 0.00017 20.2 1.8 24 31-54 36-60 (103)
61 PF00614 PLDc: Phospholipase D 61.9 10 0.00022 14.8 2.0 16 24-41 8-23 (28)
62 PF14157 YmzC: YmzC-like prote 61.5 14 0.00031 17.5 2.4 34 24-62 23-56 (63)
63 PF03089 RAG2: Recombination a 55.2 25 0.00054 22.1 3.3 19 21-41 156-174 (337)
64 PRK11408 hypothetical protein; 54.9 30 0.00065 19.2 3.3 29 34-63 34-63 (145)
65 PF13806 Rieske_2: Rieske-like 54.1 20 0.00043 18.4 2.5 31 4-36 73-103 (104)
66 COG4880 Secreted protein conta 51.4 18 0.00039 24.1 2.4 29 25-64 571-599 (603)
67 KOG0278|consensus 44.1 41 0.00089 20.8 3.0 30 23-57 227-256 (334)
68 TIGR00547 lolA periplasmic cha 43.8 55 0.0012 19.0 3.5 24 31-61 77-100 (204)
69 cd01207 Ena-Vasp Enabled-VASP- 43.7 30 0.00064 18.3 2.2 18 47-64 9-26 (111)
70 PF03835 Rad4: Rad4 transgluta 43.6 34 0.00073 18.6 2.5 18 49-66 39-56 (145)
71 KOG2321|consensus 42.4 53 0.0012 22.7 3.5 38 17-60 131-168 (703)
72 PF03088 Str_synth: Strictosid 42.4 47 0.001 16.7 2.9 33 4-37 41-73 (89)
73 cd05398 NT_ClassII-CCAase Nucl 41.5 21 0.00044 19.4 1.4 11 31-41 16-26 (139)
74 cd03583 NTR_complement_C3 NTR/ 40.9 52 0.0011 18.2 2.9 32 31-62 86-118 (149)
75 smart00155 PLDc Phospholipase 40.6 26 0.00056 13.2 2.7 19 21-41 5-23 (28)
76 PF10946 DUF2625: Protein of u 40.1 57 0.0012 19.3 3.1 18 46-63 123-140 (208)
77 KOG2315|consensus 39.4 1.3E+02 0.0027 20.8 4.8 49 4-57 298-346 (566)
78 cd03584 NTR_complement_C4 NTR/ 38.8 64 0.0014 17.9 3.1 32 31-62 90-122 (153)
79 cd01206 Homer Homer type EVH1 38.1 42 0.00091 17.8 2.2 18 47-64 11-29 (111)
80 PF07893 DUF1668: Protein of u 37.8 1E+02 0.0023 19.4 6.6 41 31-77 76-116 (342)
81 PF10902 DUF2693: Protein of u 36.7 53 0.0012 16.4 2.3 18 45-62 47-64 (83)
82 PF11958 DUF3472: Domain of un 36.5 36 0.00077 19.5 1.9 24 47-70 80-103 (184)
83 PF13540 RCC1_2: Regulator of 35.7 34 0.00074 13.2 2.5 16 23-39 10-25 (30)
84 PF06335 DUF1054: Protein of u 35.5 71 0.0015 18.7 3.0 34 4-39 59-97 (194)
85 cd00837 EVH1 EVH1 (Enabled, Va 34.5 52 0.0011 16.8 2.2 16 47-62 9-24 (104)
86 PF06739 SBBP: Beta-propeller 33.2 33 0.00072 14.1 1.2 11 31-41 23-33 (38)
87 PF11090 DUF2833: Protein of u 31.9 77 0.0017 16.0 2.6 13 30-42 7-19 (86)
88 KOG0294|consensus 31.9 1.4E+02 0.0031 19.1 4.2 23 31-58 52-74 (362)
89 KOG1324|consensus 31.8 30 0.00066 20.0 1.1 9 32-40 111-119 (190)
90 KOG2705|consensus 31.6 28 0.00061 22.4 1.1 18 43-60 304-321 (471)
91 smart00120 HX Hemopexin-like r 30.2 51 0.0011 13.5 4.0 19 31-56 9-27 (45)
92 COG2380 Uncharacterized protei 29.7 56 0.0012 20.3 2.1 19 47-65 22-40 (327)
93 PF14741 GH114_assoc: N-termin 29.5 1E+02 0.0022 16.9 2.8 22 45-66 61-83 (129)
94 KOG2006|consensus 27.9 92 0.002 23.0 3.0 53 6-66 321-380 (1023)
95 PF00186 DHFR_1: Dihydrofolate 27.6 39 0.00085 18.9 1.1 9 32-40 92-100 (161)
96 COG4334 Uncharacterized protei 27.2 76 0.0016 16.9 2.0 14 25-40 41-54 (131)
97 KOG0282|consensus 27.1 1.9E+02 0.004 19.7 4.1 12 31-42 310-322 (503)
98 PLN03215 ascorbic acid mannose 27.0 1.4E+02 0.0031 19.4 3.6 27 9-37 189-215 (373)
99 KOG2967|consensus 26.9 39 0.00084 21.3 1.1 13 31-43 200-212 (314)
100 PLN03113 DNA ligase 1; Provisi 26.6 1.9E+02 0.0042 20.7 4.3 56 3-63 561-624 (744)
101 COG0262 FolA Dihydrofolate red 26.5 49 0.0011 18.5 1.4 9 32-40 96-104 (167)
102 PF00415 RCC1: Regulator of ch 26.3 52 0.0011 13.8 1.3 9 31-39 1-9 (51)
103 TIGR02608 delta_60_rpt delta-6 26.1 80 0.0017 14.4 3.9 26 30-55 10-37 (55)
104 PF10049 DUF2283: Protein of u 26.0 38 0.00083 14.9 0.8 11 1-11 1-11 (50)
105 smart00284 OLF Olfactomedin-li 25.9 1.7E+02 0.0036 18.0 5.2 54 3-60 156-213 (255)
106 cd03474 Rieske_T4moC Toluene-4 25.2 1.1E+02 0.0023 15.5 3.5 6 31-36 92-97 (108)
107 TIGR03352 VI_chp_3 type VI sec 24.9 1.3E+02 0.0029 16.5 3.7 24 49-72 113-136 (146)
108 PF00352 TBP: Transcription fa 24.6 1E+02 0.0022 15.1 2.3 10 31-40 57-66 (86)
109 PRK10769 folA dihydrofolate re 24.5 55 0.0012 18.3 1.3 9 32-40 89-97 (159)
110 COG4946 Uncharacterized protei 24.2 1.2E+02 0.0026 20.7 2.9 17 47-63 287-303 (668)
111 PF00045 Hemopexin: Hemopexin; 23.9 74 0.0016 13.2 4.1 18 31-55 9-26 (45)
112 PF15494 SRCR_2: Scavenger rec 23.1 1.2E+02 0.0025 15.2 2.3 17 47-63 4-20 (98)
113 smart00461 WH1 WASP homology r 23.1 1.2E+02 0.0026 15.6 2.4 17 47-63 11-27 (106)
114 PF12118 SprA-related: SprA-re 22.8 52 0.0011 20.5 1.1 10 30-39 182-191 (294)
115 PF06462 Hyd_WA: Propeller; I 22.7 68 0.0015 12.7 1.1 12 4-15 20-31 (32)
116 cd03475 Rieske_SoxF_SoxL SoxF 22.2 1.7E+02 0.0037 16.8 4.4 7 32-38 154-160 (171)
117 PRK09965 3-phenylpropionate di 22.1 1.3E+02 0.0027 15.3 3.9 6 31-36 94-99 (106)
118 PF04096 Nucleoporin2: Nucleop 21.9 1.2E+02 0.0026 16.6 2.3 16 47-62 122-137 (141)
119 cd00094 HX Hemopexin-like repe 21.8 1.7E+02 0.0037 16.6 4.4 22 31-59 110-131 (194)
120 COG2131 ComEB Deoxycytidylate 21.7 1.6E+02 0.0035 16.8 2.8 25 18-44 25-49 (164)
121 PF01743 PolyA_pol: Poly A pol 21.5 38 0.00082 17.9 0.3 7 34-40 1-7 (126)
122 PF12091 DUF3567: Protein of u 21.3 1.3E+02 0.0029 15.2 3.0 39 1-39 1-45 (85)
123 PF00568 WH1: WH1 domain; Int 20.7 1.4E+02 0.0031 15.3 3.9 15 48-62 17-32 (111)
124 PF04122 CW_binding_2: Putativ 20.4 79 0.0017 15.5 1.3 11 31-41 73-83 (92)
125 KOG2445|consensus 20.4 2.3E+02 0.0051 18.2 3.5 36 31-66 183-220 (361)
No 1
>KOG4441|consensus
Probab=99.84 E-value=2e-20 Score=118.77 Aligned_cols=75 Identities=43% Similarity=0.641 Sum_probs=69.6
Q ss_pred ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCC-CCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWS-YLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~-~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
.+|||.+++|+.+++|+ |.+++++++ +++||++||.++.. .++++++|||.+++|+.+++|+.+|..++++++++
T Consensus 399 E~YDp~~~~W~~va~m~~~r~~~gv~~~--~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M~~~R~~~g~a~~~~ 476 (571)
T KOG4441|consen 399 ECYDPVTNKWTPVAPMLTRRSGHGVAVL--GGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPMNTRRSGFGVAVLNG 476 (571)
T ss_pred EEecCCCCcccccCCCCcceeeeEEEEE--CCEEEEEcCcCCCccccceEEEEcCCCCceeecCCcccccccceEEEECC
Confidence 48999999999999885 788999999 99999999988776 79999999999999999999999999999999875
No 2
>KOG4441|consensus
Probab=99.83 E-value=2.2e-20 Score=118.60 Aligned_cols=75 Identities=43% Similarity=0.685 Sum_probs=70.5
Q ss_pred ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
-+||+++++|+.+++|. |..++++++ ++.||++||+++...++++|+|||.+++|+.+++|+.+|..+++++++|
T Consensus 352 e~YD~~~~~W~~~a~M~~~R~~~~v~~l--~g~iYavGG~dg~~~l~svE~YDp~~~~W~~va~m~~~r~~~gv~~~~g 428 (571)
T KOG4441|consen 352 ERYDPRTNQWTPVAPMNTKRSDFGVAVL--DGKLYAVGGFDGEKSLNSVECYDPVTNKWTPVAPMLTRRSGHGVAVLGG 428 (571)
T ss_pred EEecCCCCceeccCCccCccccceeEEE--CCEEEEEeccccccccccEEEecCCCCcccccCCCCcceeeeEEEEECC
Confidence 37999999999999886 889999999 9999999999998889999999999999999999999999999999875
No 3
>PHA02713 hypothetical protein; Provisional
Probab=99.80 E-value=5.4e-19 Score=112.18 Aligned_cols=75 Identities=21% Similarity=0.356 Sum_probs=67.9
Q ss_pred ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
.+||+.+++|..+++|+ |..++++++ +++||++||.++....+++++|||.+++|..+++|+.+|..+++++++|
T Consensus 323 ~~Yd~~~n~W~~~~~m~~~R~~~~~~~~--~g~IYviGG~~~~~~~~sve~Ydp~~~~W~~~~~mp~~r~~~~~~~~~g 399 (557)
T PHA02713 323 YKINIENKIHVELPPMIKNRCRFSLAVI--DDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQ 399 (557)
T ss_pred EEEECCCCeEeeCCCCcchhhceeEEEE--CCEEEEECCcCCCCCCceEEEEECCCCeEEECCCCCcccccccEEEECC
Confidence 47999999999999876 788889999 9999999998766667899999999999999999999999999988775
No 4
>PF13964 Kelch_6: Kelch motif
Probab=99.79 E-value=6.7e-19 Score=80.16 Aligned_cols=49 Identities=35% Similarity=0.667 Sum_probs=45.2
Q ss_pred ccceeeeEEECCCCeEEEEcCCCC-CCCCCeEEEEeCCCCcEEeccCCCCCC
Q psy6079 19 SYMCLGVGVLEPEGPLYAVGGHDG-WSYLNTTERFDPRLRQWSFIAPMLSPR 69 (79)
Q Consensus 19 ~r~~~~~~~~~~~~~iyv~GG~~~-~~~~~~~~~yd~~~~~W~~~~~~~~~r 69 (79)
||.+++++++ +++|||+||..+ ....+++++||+++++|+.+++||.+|
T Consensus 1 pR~~~s~v~~--~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR 50 (50)
T PF13964_consen 1 PRYGHSAVVV--GGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR 50 (50)
T ss_pred CCccCEEEEE--CCEEEEECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence 5888999999 999999999887 566899999999999999999999987
No 5
>PHA02713 hypothetical protein; Provisional
Probab=99.77 E-value=2.1e-18 Score=109.52 Aligned_cols=74 Identities=15% Similarity=0.113 Sum_probs=65.2
Q ss_pred eecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCC-CCeEEEEeCCC-CcEEeccCCCCCCCceeEEEECC
Q psy6079 4 SICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSY-LNTTERFDPRL-RQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 4 ~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~-~~~~~~yd~~~-~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
+|||.+++|+.+++|+ |..++++++ +++||++||.++... .+.+++|||.+ ++|+.+++|+.+|..+++++++|
T Consensus 436 ~YDP~td~W~~v~~m~~~r~~~~~~~~--~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~~r~~~~~~~~~~ 513 (557)
T PHA02713 436 RYDTVNNIWETLPNFWTGTIRPGVVSH--KDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTILHDN 513 (557)
T ss_pred EECCCCCeEeecCCCCcccccCcEEEE--CCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCcccccceeEEECC
Confidence 6999999999999885 788888999 999999999875433 35689999999 89999999999999999999875
No 6
>PHA02790 Kelch-like protein; Provisional
Probab=99.75 E-value=1.3e-17 Score=104.47 Aligned_cols=73 Identities=22% Similarity=0.296 Sum_probs=64.5
Q ss_pred ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
.+||+.+++|+.+++|+ |..++++++ +++||++||..+. .+++++|||++++|..+++|+.+|..+++++++|
T Consensus 334 e~ydp~~n~W~~~~~l~~~r~~~~~~~~--~g~IYviGG~~~~--~~~ve~ydp~~~~W~~~~~m~~~r~~~~~~~~~~ 408 (480)
T PHA02790 334 ERWFHGDAAWVNMPSLLKPRCNPAVASI--NNVIYVIGGHSET--DTTTEYLLPNHDQWQFGPSTYYPHYKSCALVFGR 408 (480)
T ss_pred EEEECCCCeEEECCCCCCCCcccEEEEE--CCEEEEecCcCCC--CccEEEEeCCCCEEEeCCCCCCccccceEEEECC
Confidence 47999999999999886 778888889 9999999997643 3679999999999999999999999998888775
No 7
>PHA02790 Kelch-like protein; Provisional
Probab=99.70 E-value=1.4e-16 Score=99.78 Aligned_cols=72 Identities=22% Similarity=0.356 Sum_probs=63.4
Q ss_pred ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
.+|||.+++|..+++|+ |..++++++ +++||++||.++. +++++||+.+++|..+++|+.+|..+++++++|
T Consensus 290 ~~Ydp~~~~W~~~~~m~~~r~~~~~v~~--~~~iYviGG~~~~---~sve~ydp~~n~W~~~~~l~~~r~~~~~~~~~g 363 (480)
T PHA02790 290 IAVNYISNNWIPIPPMNSPRLYASGVPA--NNKLYVVGGLPNP---TSVERWFHGDAAWVNMPSLLKPRCNPAVASINN 363 (480)
T ss_pred EEEECCCCEEEECCCCCchhhcceEEEE--CCEEEEECCcCCC---CceEEEECCCCeEEECCCCCCCCcccEEEEECC
Confidence 36999999999999775 777888888 9999999997532 579999999999999999999999999988765
No 8
>PHA03098 kelch-like protein; Provisional
Probab=99.70 E-value=1.2e-16 Score=100.81 Aligned_cols=75 Identities=27% Similarity=0.361 Sum_probs=66.9
Q ss_pred ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
.+||+.+++|..+++|+ |.+++++++ +++||++||..+....+++++||+.+++|..+++||.+|..+++++++|
T Consensus 314 ~~yd~~~~~W~~~~~~~~~R~~~~~~~~--~~~lyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~lp~~r~~~~~~~~~~ 390 (534)
T PHA03098 314 VSYDTKTKSWNKVPELIYPRKNPGVTVF--NNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNN 390 (534)
T ss_pred EEEeCCCCeeeECCCCCcccccceEEEE--CCEEEEEeCCCCCEecceEEEEcCCCCceeeCCCcCcCCccceEEEECC
Confidence 47999999999999664 788899999 9999999998765567899999999999999999999999999887764
No 9
>PLN02153 epithiospecifier protein
Probab=99.69 E-value=4.6e-16 Score=93.86 Aligned_cols=75 Identities=15% Similarity=0.225 Sum_probs=63.2
Q ss_pred ceecCCCCCeeecCCC---ccc---eeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCC-----CCCCCc
Q psy6079 3 SSICLHLVFWFPLLLL---SYM---CLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPM-----LSPRST 71 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~---~r~---~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~-----~~~r~~ 71 (79)
.+||+.+++|+.++++ ||. +++++++ +++||+|||..+...++++++||+++++|+.+++| |.+|..
T Consensus 53 ~~yd~~~~~W~~~~~~~~~p~~~~~~~~~~~~--~~~iyv~GG~~~~~~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~ 130 (341)
T PLN02153 53 YVFDFNTHTWSIAPANGDVPRISCLGVRMVAV--GTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEARTF 130 (341)
T ss_pred EEEECCCCEEEEcCccCCCCCCccCceEEEEE--CCEEEEECCCCCCCccCcEEEEECCCCEEEEeccCCCCCCCCCcee
Confidence 4799999999998754 342 5777888 99999999987666678999999999999999887 789999
Q ss_pred eeEEEECC
Q psy6079 72 AGVAVLNS 79 (79)
Q Consensus 72 ~~~~~~~~ 79 (79)
|+++++++
T Consensus 131 ~~~~~~~~ 138 (341)
T PLN02153 131 HSMASDEN 138 (341)
T ss_pred eEEEEECC
Confidence 99887754
No 10
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=99.68 E-value=8.6e-17 Score=72.18 Aligned_cols=46 Identities=28% Similarity=0.508 Sum_probs=41.6
Q ss_pred ccceeeeEEECCCCeEEEEcCCCC-CCCCCeEEEEeCCCCcEEeccCCC
Q psy6079 19 SYMCLGVGVLEPEGPLYAVGGHDG-WSYLNTTERFDPRLRQWSFIAPML 66 (79)
Q Consensus 19 ~r~~~~~~~~~~~~~iyv~GG~~~-~~~~~~~~~yd~~~~~W~~~~~~~ 66 (79)
||.+++++++ +++|||+||..+ ....+++++||+++++|+.+++||
T Consensus 1 pR~~~~~~~~--~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp 47 (47)
T PF01344_consen 1 PRSGHAAVVV--GNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPPMP 47 (47)
T ss_dssp -BBSEEEEEE--TTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred CCccCEEEEE--CCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence 5888999999 999999999887 566999999999999999999886
No 11
>PHA03098 kelch-like protein; Provisional
Probab=99.67 E-value=7.5e-16 Score=97.26 Aligned_cols=75 Identities=19% Similarity=0.135 Sum_probs=64.5
Q ss_pred ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCC-CCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGW-SYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~-~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
.+||+.+++|+.++++| |..++++++ +++||++||.... ..++++++||+.+++|..+++||.+|..++++++++
T Consensus 361 ~~yd~~~~~W~~~~~lp~~r~~~~~~~~--~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~~~~ 438 (534)
T PHA03098 361 ESWKPGESKWREEPPLIFPRYNPCVVNV--NNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDG 438 (534)
T ss_pred EEEcCCCCceeeCCCcCcCCccceEEEE--CCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCCccccCceEEEECC
Confidence 47999999999998664 788888888 9999999996533 346899999999999999999999999998887664
No 12
>PLN02153 epithiospecifier protein
Probab=99.67 E-value=1.1e-15 Score=92.24 Aligned_cols=75 Identities=21% Similarity=0.200 Sum_probs=61.6
Q ss_pred ceecCCCCCeeecCCC-------ccceeeeEEECCCCeEEEEcCCCCCC------CCCeEEEEeCCCCcEEeccCCC---
Q psy6079 3 SSICLHLVFWFPLLLL-------SYMCLGVGVLEPEGPLYAVGGHDGWS------YLNTTERFDPRLRQWSFIAPML--- 66 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~-------~r~~~~~~~~~~~~~iyv~GG~~~~~------~~~~~~~yd~~~~~W~~~~~~~--- 66 (79)
.+||+.+++|+.++++ +|..+++++. +++|||+||..... .++++++||+++++|..++++.
T Consensus 104 ~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~--~~~iyv~GG~~~~~~~~~~~~~~~v~~yd~~~~~W~~l~~~~~~~ 181 (341)
T PLN02153 104 YSYDTVKNEWTFLTKLDEEGGPEARTFHSMASD--ENHVYVFGGVSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENF 181 (341)
T ss_pred EEEECCCCEEEEeccCCCCCCCCCceeeEEEEE--CCEEEEECCccCCCccCCCcccceEEEEECCCCeEeeCCCCCCCC
Confidence 4799999999988743 5888888888 99999999976321 3578999999999999998764
Q ss_pred CCCCceeEEEECC
Q psy6079 67 SPRSTAGVAVLNS 79 (79)
Q Consensus 67 ~~r~~~~~~~~~~ 79 (79)
.+|..++++++++
T Consensus 182 ~~r~~~~~~~~~~ 194 (341)
T PLN02153 182 EKRGGAGFAVVQG 194 (341)
T ss_pred CCCCcceEEEECC
Confidence 7888888877654
No 13
>smart00612 Kelch Kelch domain.
Probab=99.66 E-value=3.6e-16 Score=69.78 Aligned_cols=47 Identities=43% Similarity=0.800 Sum_probs=42.7
Q ss_pred eEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 33 PLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 33 ~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
+||++||..+....+++++||+.+++|+.+++|+.+|..+++++++|
T Consensus 1 ~iyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~g 47 (47)
T smart00612 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47 (47)
T ss_pred CEEEEeCCCCCceeeeEEEECCCCCeEccCCCCCCccccceEEEeCC
Confidence 48999998765568899999999999999999999999999999875
No 14
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.66 E-value=1.3e-15 Score=91.32 Aligned_cols=74 Identities=19% Similarity=0.179 Sum_probs=61.3
Q ss_pred eecCCCCCe----eecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCC-CCCCceeEEE
Q psy6079 4 SICLHLVFW----FPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPML-SPRSTAGVAV 76 (79)
Q Consensus 4 ~~~~~~~~W----~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~-~~r~~~~~~~ 76 (79)
+||+.+++| +.++++| |..++++++ +++||++||.......+++++||+.+++|+.+++|+ .+|..+++++
T Consensus 92 ~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~--~~~iYv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~p~~~r~~~~~~~ 169 (323)
T TIGR03548 92 RITLDESKEELICETIGNLPFTFENGSACYK--DGTLYVGGGNRNGKPSNKSYLFNLETQEWFELPDFPGEPRVQPVCVK 169 (323)
T ss_pred EEEEcCCceeeeeeEcCCCCcCccCceEEEE--CCEEEEEeCcCCCccCceEEEEcCCCCCeeECCCCCCCCCCcceEEE
Confidence 688888887 6777665 677888888 999999999765555789999999999999999988 4788888777
Q ss_pred ECC
Q psy6079 77 LNS 79 (79)
Q Consensus 77 ~~~ 79 (79)
+++
T Consensus 170 ~~~ 172 (323)
T TIGR03548 170 LQN 172 (323)
T ss_pred ECC
Confidence 654
No 15
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.65 E-value=1.6e-15 Score=91.61 Aligned_cols=75 Identities=13% Similarity=0.118 Sum_probs=58.9
Q ss_pred ceecCCCCCeeecC-CCcc--ceeeeE-EECCCCeEEEEcCCCCCC----------------------------------
Q psy6079 3 SSICLHLVFWFPLL-LLSY--MCLGVG-VLEPEGPLYAVGGHDGWS---------------------------------- 44 (79)
Q Consensus 3 ~~~~~~~~~W~~~~-~~~r--~~~~~~-~~~~~~~iyv~GG~~~~~---------------------------------- 44 (79)
.+||+.+++|+.++ ++|| .+++++ ++ +++||++||.+...
T Consensus 88 ~~Yd~~~~~W~~~~~~~p~~~~~~~~~~~~--~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (346)
T TIGR03547 88 YRYDPKKNSWQKLDTRSPVGLLGASGFSLH--NGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYFSQPPEDYF 165 (346)
T ss_pred EEEECCCCEEecCCCCCCCcccceeEEEEe--CCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHhCCChhHcC
Confidence 47999999999997 5565 344444 56 99999999976321
Q ss_pred CCCeEEEEeCCCCcEEeccCCCC-CCCceeEEEECC
Q psy6079 45 YLNTTERFDPRLRQWSFIAPMLS-PRSTAGVAVLNS 79 (79)
Q Consensus 45 ~~~~~~~yd~~~~~W~~~~~~~~-~r~~~~~~~~~~ 79 (79)
..+++++||+.+++|..+++||. +|..++++++++
T Consensus 166 ~~~~v~~YDp~t~~W~~~~~~p~~~r~~~~~~~~~~ 201 (346)
T TIGR03547 166 WNKNVLSYDPSTNQWRNLGENPFLGTAGSAIVHKGN 201 (346)
T ss_pred ccceEEEEECCCCceeECccCCCCcCCCceEEEECC
Confidence 13789999999999999999986 688888877764
No 16
>PLN02193 nitrile-specifier protein
Probab=99.64 E-value=3.4e-15 Score=93.48 Aligned_cols=75 Identities=21% Similarity=0.344 Sum_probs=63.2
Q ss_pred ceecCCCCCeeecCC---Cc---cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCC---CCCCCcee
Q psy6079 3 SSICLHLVFWFPLLL---LS---YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPM---LSPRSTAG 73 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~---~~---r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~---~~~r~~~~ 73 (79)
.+||+.+++|+.+++ +| |..++++++ +++|||+||.+....++++++||+.+++|..++++ |.+|..|+
T Consensus 196 ~~yD~~~~~W~~~~~~g~~P~~~~~~~~~v~~--~~~lYvfGG~~~~~~~ndv~~yD~~t~~W~~l~~~~~~P~~R~~h~ 273 (470)
T PLN02193 196 YVFDLETRTWSISPATGDVPHLSCLGVRMVSI--GSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHS 273 (470)
T ss_pred EEEECCCCEEEeCCCCCCCCCCcccceEEEEE--CCEEEEECCCCCCCCCccEEEEECCCCEEEEcCcCCCCCCCccceE
Confidence 479999999998763 33 356778888 99999999988766689999999999999999888 78999999
Q ss_pred EEEECC
Q psy6079 74 VAVLNS 79 (79)
Q Consensus 74 ~~~~~~ 79 (79)
+++.++
T Consensus 274 ~~~~~~ 279 (470)
T PLN02193 274 MAADEE 279 (470)
T ss_pred EEEECC
Confidence 887653
No 17
>PLN02193 nitrile-specifier protein
Probab=99.63 E-value=4.2e-15 Score=93.06 Aligned_cols=75 Identities=20% Similarity=0.254 Sum_probs=64.2
Q ss_pred ceecCCCCCeeecCCC-----ccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccC---CCCCCCceeE
Q psy6079 3 SSICLHLVFWFPLLLL-----SYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAP---MLSPRSTAGV 74 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~-----~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~---~~~~r~~~~~ 74 (79)
.+||+.+++|+.++++ +|..+++++. +++|||+||.+....++++++||+.+++|..++. ++.+|..+++
T Consensus 247 ~~yD~~t~~W~~l~~~~~~P~~R~~h~~~~~--~~~iYv~GG~~~~~~~~~~~~yd~~t~~W~~~~~~~~~~~~R~~~~~ 324 (470)
T PLN02193 247 YSFDTTTNEWKLLTPVEEGPTPRSFHSMAAD--EENVYVFGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGL 324 (470)
T ss_pred EEEECCCCEEEEcCcCCCCCCCccceEEEEE--CCEEEEECCCCCCCCcceEEEEECCCCEEEeCCCCCCCCCCCCCcEE
Confidence 4799999999998754 4888888888 9999999998876668899999999999999864 5678988888
Q ss_pred EEECC
Q psy6079 75 AVLNS 79 (79)
Q Consensus 75 ~~~~~ 79 (79)
+++++
T Consensus 325 ~~~~g 329 (470)
T PLN02193 325 EVVQG 329 (470)
T ss_pred EEECC
Confidence 87653
No 18
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=99.63 E-value=2e-15 Score=68.37 Aligned_cols=47 Identities=34% Similarity=0.527 Sum_probs=42.4
Q ss_pred CCeEEEEcCCC--CCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEE
Q psy6079 31 EGPLYAVGGHD--GWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVL 77 (79)
Q Consensus 31 ~~~iyv~GG~~--~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~ 77 (79)
+++|||+||.. ....++++++||+.+++|++++++|.+|..|+++++
T Consensus 1 g~~~~vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~i 49 (49)
T PF13415_consen 1 GNKLYVFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATVI 49 (49)
T ss_pred CCEEEEECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEEC
Confidence 47899999988 455689999999999999999999999999999874
No 19
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.58 E-value=2.1e-14 Score=86.70 Aligned_cols=74 Identities=18% Similarity=0.222 Sum_probs=57.2
Q ss_pred ceecC--CCCCeeecCCCc---cceeeeEEECCCCeEEEEcCCCCCC------CCCeEEEEeCCCCcEEecc-CCCCCCC
Q psy6079 3 SSICL--HLVFWFPLLLLS---YMCLGVGVLEPEGPLYAVGGHDGWS------YLNTTERFDPRLRQWSFIA-PMLSPRS 70 (79)
Q Consensus 3 ~~~~~--~~~~W~~~~~~~---r~~~~~~~~~~~~~iyv~GG~~~~~------~~~~~~~yd~~~~~W~~~~-~~~~~r~ 70 (79)
.+||+ .+++|..+++|| |..++++++ +++|||+||..... .++++++||+.+++|+.++ +++.+|.
T Consensus 32 ~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~--~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~~W~~~~~~~p~~~~ 109 (346)
T TIGR03547 32 YKLDLKKPSKGWQKIADFPGGPRNQAVAAAI--DGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLDTRSPVGLL 109 (346)
T ss_pred EEEECCCCCCCceECCCCCCCCcccceEEEE--CCEEEEEeCCCCCCCCCcceecccEEEEECCCCEEecCCCCCCCccc
Confidence 35665 578999999764 677888889 99999999975321 3678999999999999997 4566676
Q ss_pred ceeEE-EEC
Q psy6079 71 TAGVA-VLN 78 (79)
Q Consensus 71 ~~~~~-~~~ 78 (79)
.++++ +++
T Consensus 110 ~~~~~~~~~ 118 (346)
T TIGR03547 110 GASGFSLHN 118 (346)
T ss_pred ceeEEEEeC
Confidence 66655 444
No 20
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=99.58 E-value=1.3e-14 Score=65.66 Aligned_cols=46 Identities=33% Similarity=0.488 Sum_probs=40.4
Q ss_pred ccceeeeEEECCCCeEEEEcCC---CCCCCCCeEEEEeCCCCcEEeccCCC
Q psy6079 19 SYMCLGVGVLEPEGPLYAVGGH---DGWSYLNTTERFDPRLRQWSFIAPML 66 (79)
Q Consensus 19 ~r~~~~~~~~~~~~~iyv~GG~---~~~~~~~~~~~yd~~~~~W~~~~~~~ 66 (79)
||.+++++++ +++||++||+ ......+++++||+++++|+.+++|+
T Consensus 1 ~r~~hs~~~~--~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g 49 (49)
T PF07646_consen 1 PRYGHSAVVL--DGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG 49 (49)
T ss_pred CccceEEEEE--CCEEEEECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence 5788999999 9999999999 34455899999999999999998874
No 21
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.56 E-value=3.9e-14 Score=86.63 Aligned_cols=75 Identities=15% Similarity=0.118 Sum_probs=57.8
Q ss_pred ceecCCCCCeeecCC-Ccc--ceeeeEE-ECCCCeEEEEcCCCCC----------------------------------C
Q psy6079 3 SSICLHLVFWFPLLL-LSY--MCLGVGV-LEPEGPLYAVGGHDGW----------------------------------S 44 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~-~~r--~~~~~~~-~~~~~~iyv~GG~~~~----------------------------------~ 44 (79)
.+||+.+++|+.+++ +|+ .++++++ . +++||++||.... .
T Consensus 109 ~~YD~~~n~W~~~~~~~p~~~~~~~~~~~~--~~~IYv~GG~~~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~ 186 (376)
T PRK14131 109 YKYDPKTNSWQKLDTRSPVGLAGHVAVSLH--NGKAYITGGVNKNIFDGYFEDLAAAGKDKTPKDKINDAYFDKKPEDYF 186 (376)
T ss_pred EEEeCCCCEEEeCCCCCCCcccceEEEEee--CCEEEEECCCCHHHHHHHHhhhhhcccchhhhhhhHHHHhcCChhhcC
Confidence 369999999999984 354 4455555 6 9999999997531 0
Q ss_pred CCCeEEEEeCCCCcEEeccCCCC-CCCceeEEEECC
Q psy6079 45 YLNTTERFDPRLRQWSFIAPMLS-PRSTAGVAVLNS 79 (79)
Q Consensus 45 ~~~~~~~yd~~~~~W~~~~~~~~-~r~~~~~~~~~~ 79 (79)
..+++++||+.+++|..+++||. +|..++++++++
T Consensus 187 ~~~~v~~YD~~t~~W~~~~~~p~~~~~~~a~v~~~~ 222 (376)
T PRK14131 187 FNKEVLSYDPSTNQWKNAGESPFLGTAGSAVVIKGN 222 (376)
T ss_pred cCceEEEEECCCCeeeECCcCCCCCCCcceEEEECC
Confidence 14689999999999999999996 677777777654
No 22
>KOG0379|consensus
Probab=99.55 E-value=4.4e-14 Score=88.82 Aligned_cols=75 Identities=24% Similarity=0.272 Sum_probs=65.9
Q ss_pred ceecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCCCCC-CCCeEEEEeCCCCcEEec---cCCCCCCCcee
Q psy6079 3 SSICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHDGWS-YLNTTERFDPRLRQWSFI---APMLSPRSTAG 73 (79)
Q Consensus 3 ~~~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~~~~-~~~~~~~yd~~~~~W~~~---~~~~~~r~~~~ 73 (79)
|.||+.+.+|..+. |.+|.+|++++. ++++||+||..... ..+++++||+++.+|.++ ...|.||.+|+
T Consensus 142 ~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~--g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~~~~~~g~~P~pR~gH~ 219 (482)
T KOG0379|consen 142 HSLDLSTRTWSLLSPTGDPPPPRAGHSATVV--GTKLVVFGGIGGTGDSLNDLHIYDLETSTWSELDTQGEAPSPRYGHA 219 (482)
T ss_pred EeccCCCCcEEEecCcCCCCCCcccceEEEE--CCEEEEECCccCcccceeeeeeeccccccceecccCCCCCCCCCCce
Confidence 67999999999986 556999999999 99999999987665 689999999999999988 45678999999
Q ss_pred EEEECC
Q psy6079 74 VAVLNS 79 (79)
Q Consensus 74 ~~~~~~ 79 (79)
+++.++
T Consensus 220 ~~~~~~ 225 (482)
T KOG0379|consen 220 MVVVGN 225 (482)
T ss_pred EEEECC
Confidence 988753
No 23
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.49 E-value=3.4e-13 Score=80.94 Aligned_cols=67 Identities=12% Similarity=0.079 Sum_probs=57.4
Q ss_pred CeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcE----EeccCCCCCCCceeEEEECC
Q psy6079 11 FWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQW----SFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 11 ~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W----~~~~~~~~~r~~~~~~~~~~ 79 (79)
+|..+++|| |..++++++ +++||++||..+...++++++||+.+++| ..+++||.+|..++++++++
T Consensus 52 ~W~~~~~lp~~r~~~~~~~~--~~~lyviGG~~~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~ 124 (323)
T TIGR03548 52 KWVKDGQLPYEAAYGASVSV--ENGIYYIGGSNSSERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDG 124 (323)
T ss_pred eEEEcccCCccccceEEEEE--CCEEEEEcCCCCCCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECC
Confidence 799988665 666777888 99999999987766688999999999998 78899999999999888764
No 24
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=99.49 E-value=3.9e-14 Score=64.01 Aligned_cols=47 Identities=23% Similarity=0.342 Sum_probs=30.1
Q ss_pred ccceeeeEEECCCCeEEEEcCCCCC-CCCCeEEEEeCCCCcEEeccCCC
Q psy6079 19 SYMCLGVGVLEPEGPLYAVGGHDGW-SYLNTTERFDPRLRQWSFIAPML 66 (79)
Q Consensus 19 ~r~~~~~~~~~~~~~iyv~GG~~~~-~~~~~~~~yd~~~~~W~~~~~~~ 66 (79)
||.+|+++.+. +++|||+||.+.. ..++++++||+++++|++++++|
T Consensus 1 pR~~h~~~~~~-~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P 48 (49)
T PF13418_consen 1 PRYGHSAVSIG-DNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP 48 (49)
T ss_dssp --BS-EEEEE--TTEEEEE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred CcceEEEEEEe-CCeEEEECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence 57888888872 4799999998876 46899999999999999998887
No 25
>KOG4693|consensus
Probab=99.48 E-value=2e-13 Score=80.22 Aligned_cols=75 Identities=16% Similarity=0.158 Sum_probs=63.5
Q ss_pred ceecCCCCCeeecCC---------------CccceeeeEEECCCCeEEEEcCCCCC-CCCCeEEEEeCCCCcEEec---c
Q psy6079 3 SSICLHLVFWFPLLL---------------LSYMCLGVGVLEPEGPLYAVGGHDGW-SYLNTTERFDPRLRQWSFI---A 63 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~---------------~~r~~~~~~~~~~~~~iyv~GG~~~~-~~~~~~~~yd~~~~~W~~~---~ 63 (79)
|.+|.++-+|++++| ..|++++++.+ ++++|+.||+++. ...+.+++|||++++|.+. .
T Consensus 47 H~lNa~~~RWtk~pp~~~ka~i~~~yp~VPyqRYGHtvV~y--~d~~yvWGGRND~egaCN~Ly~fDp~t~~W~~p~v~G 124 (392)
T KOG4693|consen 47 HVLNAENYRWTKMPPGITKATIESPYPAVPYQRYGHTVVEY--QDKAYVWGGRNDDEGACNLLYEFDPETNVWKKPEVEG 124 (392)
T ss_pred EEeeccceeEEecCcccccccccCCCCccchhhcCceEEEE--cceEEEEcCccCcccccceeeeeccccccccccceee
Confidence 567888889999873 23788999988 9999999999874 4578999999999999976 5
Q ss_pred CCCCCCCceeEEEECC
Q psy6079 64 PMLSPRSTAGVAVLNS 79 (79)
Q Consensus 64 ~~~~~r~~~~~~~~~~ 79 (79)
..|.+|.+|++|++++
T Consensus 125 ~vPgaRDGHsAcV~gn 140 (392)
T KOG4693|consen 125 FVPGARDGHSACVWGN 140 (392)
T ss_pred ecCCccCCceeeEECc
Confidence 6788999999999864
No 26
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.48 E-value=5.6e-13 Score=81.57 Aligned_cols=74 Identities=15% Similarity=0.240 Sum_probs=56.9
Q ss_pred ceecCC--CCCeeecCCC---ccceeeeEEECCCCeEEEEcCCCC-C-----CCCCeEEEEeCCCCcEEeccC-CCCCCC
Q psy6079 3 SSICLH--LVFWFPLLLL---SYMCLGVGVLEPEGPLYAVGGHDG-W-----SYLNTTERFDPRLRQWSFIAP-MLSPRS 70 (79)
Q Consensus 3 ~~~~~~--~~~W~~~~~~---~r~~~~~~~~~~~~~iyv~GG~~~-~-----~~~~~~~~yd~~~~~W~~~~~-~~~~r~ 70 (79)
.+||+. +++|..++++ +|.+++++++ +++|||+||... . ..++++++||+.+++|..+++ ++.+|.
T Consensus 53 ~~~d~~~~~~~W~~l~~~p~~~r~~~~~v~~--~~~IYV~GG~~~~~~~~~~~~~~~v~~YD~~~n~W~~~~~~~p~~~~ 130 (376)
T PRK14131 53 YKLDLNAPSKGWTKIAAFPGGPREQAVAAFI--DGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGLA 130 (376)
T ss_pred EEEECCCCCCCeEECCcCCCCCcccceEEEE--CCEEEEEcCCCCCCCCCceeEcccEEEEeCCCCEEEeCCCCCCCccc
Confidence 357775 4789999865 4778888888 999999999764 1 236789999999999999985 466666
Q ss_pred ceeEEE-EC
Q psy6079 71 TAGVAV-LN 78 (79)
Q Consensus 71 ~~~~~~-~~ 78 (79)
.+++++ .+
T Consensus 131 ~~~~~~~~~ 139 (376)
T PRK14131 131 GHVAVSLHN 139 (376)
T ss_pred ceEEEEeeC
Confidence 666655 44
No 27
>KOG4693|consensus
Probab=99.40 E-value=1.1e-12 Score=77.25 Aligned_cols=73 Identities=25% Similarity=0.293 Sum_probs=60.2
Q ss_pred ecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCCCC--CCCCeEEEEeCCCCcEEecc---CCCCCCCceeE
Q psy6079 5 ICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHDGW--SYLNTTERFDPRLRQWSFIA---PMLSPRSTAGV 74 (79)
Q Consensus 5 ~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~~~--~~~~~~~~yd~~~~~W~~~~---~~~~~r~~~~~ 74 (79)
+|..|+.|..-+ |-.|..++..+. ++++|+|||+++. ..++++++|||++..|..|. .-|.+|...++
T Consensus 221 ld~~T~aW~r~p~~~~~P~GRRSHS~fvY--ng~~Y~FGGYng~ln~HfndLy~FdP~t~~W~~I~~~Gk~P~aRRRqC~ 298 (392)
T KOG4693|consen 221 LDLATGAWTRTPENTMKPGGRRSHSTFVY--NGKMYMFGGYNGTLNVHFNDLYCFDPKTSMWSVISVRGKYPSARRRQCS 298 (392)
T ss_pred EeccccccccCCCCCcCCCcccccceEEE--cceEEEecccchhhhhhhcceeecccccchheeeeccCCCCCcccceeE
Confidence 578899998876 223778889999 9999999999873 35899999999999999984 34788888888
Q ss_pred EEECC
Q psy6079 75 AVLNS 79 (79)
Q Consensus 75 ~~~~~ 79 (79)
++.++
T Consensus 299 ~v~g~ 303 (392)
T KOG4693|consen 299 VVSGG 303 (392)
T ss_pred EEECC
Confidence 87654
No 28
>KOG1230|consensus
Probab=99.33 E-value=7.8e-12 Score=76.87 Aligned_cols=72 Identities=24% Similarity=0.257 Sum_probs=61.4
Q ss_pred eecCCCCCeeecC----CCccceeeeEEECCCCeEEEEcCCCCCC----CCCeEEEEeCCCCcEEeccC---CCCCCCce
Q psy6079 4 SICLHLVFWFPLL----LLSYMCLGVGVLEPEGPLYAVGGHDGWS----YLNTTERFDPRLRQWSFIAP---MLSPRSTA 72 (79)
Q Consensus 4 ~~~~~~~~W~~~~----~~~r~~~~~~~~~~~~~iyv~GG~~~~~----~~~~~~~yd~~~~~W~~~~~---~~~~r~~~ 72 (79)
.+|..+.+|+++. |.||.+|.+++. .++|++|||+.+.. +++++++||+.+-+|.++.+ -|.+|+++
T Consensus 158 ~fd~~trkweql~~~g~PS~RSGHRMvaw--K~~lilFGGFhd~nr~y~YyNDvy~FdLdtykW~Klepsga~PtpRSGc 235 (521)
T KOG1230|consen 158 LFDLKTRKWEQLEFGGGPSPRSGHRMVAW--KRQLILFGGFHDSNRDYIYYNDVYAFDLDTYKWSKLEPSGAGPTPRSGC 235 (521)
T ss_pred eeeeccchheeeccCCCCCCCccceeEEe--eeeEEEEcceecCCCceEEeeeeEEEeccceeeeeccCCCCCCCCCCcc
Confidence 4788999999997 888999999999 99999999976533 48999999999999999844 27899988
Q ss_pred eEEEE
Q psy6079 73 GVAVL 77 (79)
Q Consensus 73 ~~~~~ 77 (79)
...+.
T Consensus 236 q~~vt 240 (521)
T KOG1230|consen 236 QFSVT 240 (521)
T ss_pred eEEec
Confidence 77654
No 29
>KOG0379|consensus
Probab=99.29 E-value=2.4e-11 Score=76.65 Aligned_cols=74 Identities=26% Similarity=0.336 Sum_probs=64.3
Q ss_pred ceecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCC-CCCCCCeEEEEeCCCCcEEecc---CCCCCCCcee
Q psy6079 3 SSICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHD-GWSYLNTTERFDPRLRQWSFIA---PMLSPRSTAG 73 (79)
Q Consensus 3 ~~~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~-~~~~~~~~~~yd~~~~~W~~~~---~~~~~r~~~~ 73 (79)
+.||+++.+|.++. |.||.+|+++++ ++++++|||.. +...+++++++|..+.+|..++ ..|.+|..|.
T Consensus 193 ~i~d~~~~~W~~~~~~g~~P~pR~gH~~~~~--~~~~~v~gG~~~~~~~l~D~~~ldl~~~~W~~~~~~g~~p~~R~~h~ 270 (482)
T KOG0379|consen 193 HIYDLETSTWSELDTQGEAPSPRYGHAMVVV--GNKLLVFGGGDDGDVYLNDVHILDLSTWEWKLLPTGGDLPSPRSGHS 270 (482)
T ss_pred eeeccccccceecccCCCCCCCCCCceEEEE--CCeEEEEeccccCCceecceEeeecccceeeeccccCCCCCCcceee
Confidence 67999999999986 668999999999 99999999877 5667999999999999999664 5679999998
Q ss_pred EEEEC
Q psy6079 74 VAVLN 78 (79)
Q Consensus 74 ~~~~~ 78 (79)
.++.+
T Consensus 271 ~~~~~ 275 (482)
T KOG0379|consen 271 LTVSG 275 (482)
T ss_pred eEEEC
Confidence 87543
No 30
>PLN02772 guanylate kinase
Probab=99.13 E-value=4.3e-10 Score=69.29 Aligned_cols=61 Identities=15% Similarity=0.117 Sum_probs=52.3
Q ss_pred CCccceeeeEEECCCCeEEEEcCCCCCC-CCCeEEEEeCCCCcEEec---cCCCCCCCceeEEEECC
Q psy6079 17 LLSYMCLGVGVLEPEGPLYAVGGHDGWS-YLNTTERFDPRLRQWSFI---APMLSPRSTAGVAVLNS 79 (79)
Q Consensus 17 ~~~r~~~~~~~~~~~~~iyv~GG~~~~~-~~~~~~~yd~~~~~W~~~---~~~~~~r~~~~~~~~~~ 79 (79)
.-|+..++++++ ++++||+||.++.. ..+.+++||..+++|... ...|.+|.+|+++++++
T Consensus 22 ~~~~~~~tav~i--gdk~yv~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~ 86 (398)
T PLN02772 22 VKPKNRETSVTI--GDKTYVIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNK 86 (398)
T ss_pred CCCCCcceeEEE--CCEEEEEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEECC
Confidence 346888999999 99999999988754 578999999999999975 46689999999999864
No 31
>PF13854 Kelch_5: Kelch motif
Probab=99.12 E-value=3.3e-10 Score=49.56 Aligned_cols=39 Identities=23% Similarity=0.169 Sum_probs=33.4
Q ss_pred CCccceeeeEEECCCCeEEEEcCCCC--CCCCCeEEEEeCCCC
Q psy6079 17 LLSYMCLGVGVLEPEGPLYAVGGHDG--WSYLNTTERFDPRLR 57 (79)
Q Consensus 17 ~~~r~~~~~~~~~~~~~iyv~GG~~~--~~~~~~~~~yd~~~~ 57 (79)
|.+|.+|+++++ +++||++||.+. ...++++++||..+.
T Consensus 2 P~~R~~hs~~~~--~~~iyi~GG~~~~~~~~~~d~~~l~l~sf 42 (42)
T PF13854_consen 2 PSPRYGHSAVVV--GNNIYIFGGYSGNNNSYSNDLYVLDLPSF 42 (42)
T ss_pred CCCccceEEEEE--CCEEEEEcCccCCCCCEECcEEEEECCCC
Confidence 567999999999 999999999883 556899999998763
No 32
>KOG1230|consensus
Probab=99.09 E-value=5.1e-10 Score=69.08 Aligned_cols=73 Identities=18% Similarity=0.153 Sum_probs=58.6
Q ss_pred ceecCCCCCeeecC----CCccceeeeEEECCCCeEEEEcCCCCC------CCCCeEEEEeCCCCcEEec--cCCCCCCC
Q psy6079 3 SSICLHLVFWFPLL----LLSYMCLGVGVLEPEGPLYAVGGHDGW------SYLNTTERFDPRLRQWSFI--APMLSPRS 70 (79)
Q Consensus 3 ~~~~~~~~~W~~~~----~~~r~~~~~~~~~~~~~iyv~GG~~~~------~~~~~~~~yd~~~~~W~~~--~~~~~~r~ 70 (79)
.+||.++++|..+. |.||..|.+++++ .+.+|++||--.. ....++|.||..+++|+++ +.-|.+|+
T Consensus 101 y~Yn~k~~eWkk~~spn~P~pRsshq~va~~-s~~l~~fGGEfaSPnq~qF~HYkD~W~fd~~trkweql~~~g~PS~RS 179 (521)
T KOG1230|consen 101 YSYNTKKNEWKKVVSPNAPPPRSSHQAVAVP-SNILWLFGGEFASPNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSPRS 179 (521)
T ss_pred eEEeccccceeEeccCCCcCCCccceeEEec-cCeEEEeccccCCcchhhhhhhhheeeeeeccchheeeccCCCCCCCc
Confidence 37899999999985 6779888888874 4799999993211 1378999999999999998 45689999
Q ss_pred ceeEEE
Q psy6079 71 TAGVAV 76 (79)
Q Consensus 71 ~~~~~~ 76 (79)
+|-.++
T Consensus 180 GHRMva 185 (521)
T KOG1230|consen 180 GHRMVA 185 (521)
T ss_pred cceeEE
Confidence 997665
No 33
>KOG4152|consensus
Probab=99.03 E-value=7.1e-10 Score=70.27 Aligned_cols=72 Identities=11% Similarity=0.170 Sum_probs=58.4
Q ss_pred ecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCC------------CC--CCCCeEEEEeCCCCcEEecc--
Q psy6079 5 ICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHD------------GW--SYLNTTERFDPRLRQWSFIA-- 63 (79)
Q Consensus 5 ~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~------------~~--~~~~~~~~yd~~~~~W~~~~-- 63 (79)
.|++|..|.+.. ||||.-|++.++ ++++|||||.- +. ...+++.++|..+..|+.+-
T Consensus 235 Ldl~Tl~W~kp~~~G~~PlPRSLHsa~~I--GnKMyvfGGWVPl~~~~~~~~~hekEWkCTssl~clNldt~~W~tl~~d 312 (830)
T KOG4152|consen 235 LDLDTLTWNKPSLSGVAPLPRSLHSATTI--GNKMYVFGGWVPLVMDDVKVATHEKEWKCTSSLACLNLDTMAWETLLMD 312 (830)
T ss_pred EecceeecccccccCCCCCCcccccceee--cceeEEecceeeeeccccccccccceeeeccceeeeeecchheeeeeec
Confidence 477888898764 999999999999 99999999941 11 12678889999999999872
Q ss_pred -----CCCCCCCceeEEEEC
Q psy6079 64 -----PMLSPRSTAGVAVLN 78 (79)
Q Consensus 64 -----~~~~~r~~~~~~~~~ 78 (79)
..|.+|.+|++++++
T Consensus 313 ~~ed~tiPR~RAGHCAvAig 332 (830)
T KOG4152|consen 313 TLEDNTIPRARAGHCAVAIG 332 (830)
T ss_pred cccccccccccccceeEEec
Confidence 368899999998875
No 34
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.82 E-value=1.7e-08 Score=61.37 Aligned_cols=67 Identities=22% Similarity=0.296 Sum_probs=50.4
Q ss_pred CCeeecCCCc---cceeeeEEECCCCeEEEEcCCCCCC-----CCCeEEEEeCCCCcEEeccCC-CCCCCceeEEEEC
Q psy6079 10 VFWFPLLLLS---YMCLGVGVLEPEGPLYAVGGHDGWS-----YLNTTERFDPRLRQWSFIAPM-LSPRSTAGVAVLN 78 (79)
Q Consensus 10 ~~W~~~~~~~---r~~~~~~~~~~~~~iyv~GG~~~~~-----~~~~~~~yd~~~~~W~~~~~~-~~~r~~~~~~~~~ 78 (79)
..|+.++..| |.+...+++ +++||+|||..... .++++++|||.+++|+++... |+.-..+.++.++
T Consensus 70 k~W~~~a~FpG~~rnqa~~a~~--~~kLyvFgG~Gk~~~~~~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~~ 145 (381)
T COG3055 70 KGWTKIADFPGGARNQAVAAVI--GGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLN 145 (381)
T ss_pred CCceEcccCCCcccccchheee--CCeEEEeeccccCCCCCceEeeeeEEecCCCChhheeccccccccccceeEecC
Confidence 5899998544 899999999 99999999965322 278999999999999998543 3343444555544
No 35
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.43 E-value=4e-07 Score=55.54 Aligned_cols=70 Identities=17% Similarity=0.195 Sum_probs=50.7
Q ss_pred ceecCCCCCeeecC-CCcc--ceeeeEEECCCC-eEEEEcCCCCC----------------------------------C
Q psy6079 3 SSICLHLVFWFPLL-LLSY--MCLGVGVLEPEG-PLYAVGGHDGW----------------------------------S 44 (79)
Q Consensus 3 ~~~~~~~~~W~~~~-~~~r--~~~~~~~~~~~~-~iyv~GG~~~~----------------------------------~ 44 (79)
++|||.+|+|.++. ..|+ .++.++.+ ++ +||++||++.. .
T Consensus 116 Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~--~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~~~i~~~yf~~~~~dy~ 193 (381)
T COG3055 116 YRYDPSTNSWHKLDTRSPTGLVGASTFSL--NGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAVDKIIAHYFDKKAEDYF 193 (381)
T ss_pred EEecCCCChhheeccccccccccceeEec--CCceEEEEccccHHhhhhhHHhhhhhcccHHHHHHHHHHHhCCCHHHhc
Confidence 58999999999998 3344 56666666 77 99999996321 0
Q ss_pred CCCeEEEEeCCCCcEEeccCCC-CCCCceeE
Q psy6079 45 YLNTTERFDPRLRQWSFIAPML-SPRSTAGV 74 (79)
Q Consensus 45 ~~~~~~~yd~~~~~W~~~~~~~-~~r~~~~~ 74 (79)
....+..|+|.+++|+.....| .+++..+.
T Consensus 194 ~n~ev~sy~p~~n~W~~~G~~pf~~~aGsa~ 224 (381)
T COG3055 194 FNKEVLSYDPSTNQWRNLGENPFYGNAGSAV 224 (381)
T ss_pred ccccccccccccchhhhcCcCcccCccCcce
Confidence 1356778999999999997555 56665433
No 36
>KOG4152|consensus
Probab=98.09 E-value=5.3e-06 Score=53.30 Aligned_cols=64 Identities=11% Similarity=0.135 Sum_probs=49.5
Q ss_pred CCeeecC------CCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEec---cCCCCCCCceeEEE
Q psy6079 10 VFWFPLL------LLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFI---APMLSPRSTAGVAV 76 (79)
Q Consensus 10 ~~W~~~~------~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~---~~~~~~r~~~~~~~ 76 (79)
-+|.++. |-||.+|.++++ .+.|.||||-++. ...++++||..+++|..- .++|.+...|+++.
T Consensus 17 ~rWrrV~~~tGPvPrpRHGHRAVai--kELiviFGGGNEG-iiDELHvYNTatnqWf~PavrGDiPpgcAA~Gfvc 89 (830)
T KOG4152|consen 17 VRWRRVQQSTGPVPRPRHGHRAVAI--KELIVIFGGGNEG-IIDELHVYNTATNQWFAPAVRGDIPPGCAAFGFVC 89 (830)
T ss_pred cceEEEecccCCCCCccccchheee--eeeEEEecCCccc-chhhhhhhccccceeecchhcCCCCCchhhcceEe
Confidence 4788875 445799999999 9999999986543 468899999999999864 45666666666654
No 37
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=98.00 E-value=4.2e-05 Score=45.00 Aligned_cols=74 Identities=20% Similarity=0.231 Sum_probs=54.4
Q ss_pred ceecCCCCCeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCC----CcEEecc-CCCCCCCceeEEEE
Q psy6079 3 SSICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRL----RQWSFIA-PMLSPRSTAGVAVL 77 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~----~~W~~~~-~~~~~r~~~~~~~~ 77 (79)
..||+.+++++.+....-..++..++-.++.+.++||..+. ...+..|++.+ ..|...+ .|..+|..+++..+
T Consensus 49 ~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~G--~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~L 126 (243)
T PF07250_consen 49 VEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDNDG--NKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATTL 126 (243)
T ss_pred EEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCcc--ccceEEEecCCCCCCCCceECcccccCCCccccceEC
Confidence 57999999999987544333333333348899999997652 45678888876 6798775 58999999988876
Q ss_pred C
Q psy6079 78 N 78 (79)
Q Consensus 78 ~ 78 (79)
.
T Consensus 127 ~ 127 (243)
T PF07250_consen 127 P 127 (243)
T ss_pred C
Confidence 4
No 38
>smart00612 Kelch Kelch domain.
Probab=97.82 E-value=2.7e-05 Score=33.86 Aligned_cols=26 Identities=23% Similarity=0.240 Sum_probs=20.9
Q ss_pred ceecCCCCCeeecCCCc--cceeeeEEE
Q psy6079 3 SSICLHLVFWFPLLLLS--YMCLGVGVL 28 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~ 28 (79)
.+||+.+++|+.+++|+ |..++++++
T Consensus 18 ~~yd~~~~~W~~~~~~~~~r~~~~~~~~ 45 (47)
T smart00612 18 EVYDPETNKWTPLPSMPTPRSGHGVAVI 45 (47)
T ss_pred EEECCCCCeEccCCCCCCccccceEEEe
Confidence 47999999999999776 666666665
No 39
>KOG2437|consensus
Probab=97.53 E-value=9.5e-05 Score=47.61 Aligned_cols=66 Identities=18% Similarity=0.154 Sum_probs=51.0
Q ss_pred CCCeeecCC------------CccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEecc---CCCCCCCcee
Q psy6079 9 LVFWFPLLL------------LSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIA---PMLSPRSTAG 73 (79)
Q Consensus 9 ~~~W~~~~~------------~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~---~~~~~r~~~~ 73 (79)
+.+|++.++ -.|.+|-++....++.+|+-||.++...+.++|.|+...++|..+. ..|..|..|-
T Consensus 238 ~~~W~~i~~~~~~~~~~~~~p~~RgGHQMV~~~~~~CiYLYGGWdG~~~l~DFW~Y~v~e~~W~~iN~~t~~PG~RsCHR 317 (723)
T KOG2437|consen 238 KPRWSQIIPKSTKGDGEDNRPGMRGGHQMVIDVQTECVYLYGGWDGTQDLADFWAYSVKENQWTCINRDTEGPGARSCHR 317 (723)
T ss_pred cccccccCchhhcccccccCccccCcceEEEeCCCcEEEEecCcccchhHHHHHhhcCCcceeEEeecCCCCCcchhhhh
Confidence 567887652 2378888888744558999999999888999999999999999873 3566777664
Q ss_pred E
Q psy6079 74 V 74 (79)
Q Consensus 74 ~ 74 (79)
.
T Consensus 318 M 318 (723)
T KOG2437|consen 318 M 318 (723)
T ss_pred h
Confidence 4
No 40
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=97.28 E-value=0.00035 Score=31.07 Aligned_cols=26 Identities=12% Similarity=-0.068 Sum_probs=20.7
Q ss_pred ceecCCCCCeeecC--CCccceeeeEEE
Q psy6079 3 SSICLHLVFWFPLL--LLSYMCLGVGVL 28 (79)
Q Consensus 3 ~~~~~~~~~W~~~~--~~~r~~~~~~~~ 28 (79)
..||+.+++|++++ |.+|.+|+++++
T Consensus 22 ~~~~~~~~~W~~~~~~P~~R~~h~~~~i 49 (49)
T PF13415_consen 22 WVFDLDTNTWTRIGDLPPPRSGHTATVI 49 (49)
T ss_pred EEEECCCCEEEECCCCCCCccceEEEEC
Confidence 47999999999998 445888887763
No 41
>KOG2437|consensus
Probab=97.12 E-value=0.00093 Score=43.26 Aligned_cols=58 Identities=14% Similarity=0.180 Sum_probs=44.6
Q ss_pred eecCCCCCeeecC--------CCccceeeeEEECCCCe--EEEEcCCCC--C-CCCCeEEEEeCCCCcEEecc
Q psy6079 4 SICLHLVFWFPLL--------LLSYMCLGVGVLEPEGP--LYAVGGHDG--W-SYLNTTERFDPRLRQWSFIA 63 (79)
Q Consensus 4 ~~~~~~~~W~~~~--------~~~r~~~~~~~~~~~~~--iyv~GG~~~--~-~~~~~~~~yd~~~~~W~~~~ 63 (79)
+||..++.|..+. |-....|.+++. .++ +||+||+.- . ..+..++.||..-..|+...
T Consensus 350 ~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd--~~k~~iyVfGGr~~~~~e~~f~GLYaf~~~~~~w~~l~ 420 (723)
T KOG2437|consen 350 RFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVD--SEKHMIYVFGGRILTCNEPQFSGLYAFNCQCQTWKLLR 420 (723)
T ss_pred EEecCCceeEEecccccccCCcceeecceeeEe--cCcceEEEecCeeccCCCccccceEEEecCCccHHHHH
Confidence 5788899998886 222467888888 666 999999652 2 34788999999999998764
No 42
>PF13964 Kelch_6: Kelch motif
Probab=96.97 E-value=0.00064 Score=30.25 Aligned_cols=18 Identities=11% Similarity=-0.011 Sum_probs=15.9
Q ss_pred ceecCCCCCeeecCCCcc
Q psy6079 3 SSICLHLVFWFPLLLLSY 20 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~r 20 (79)
.+||+++++|+.+++||.
T Consensus 31 ~~yd~~t~~W~~~~~mp~ 48 (50)
T PF13964_consen 31 ERYDPETNTWEQLPPMPT 48 (50)
T ss_pred EEEcCCCCcEEECCCCCC
Confidence 589999999999998873
No 43
>PLN02772 guanylate kinase
Probab=96.97 E-value=0.0046 Score=38.96 Aligned_cols=51 Identities=6% Similarity=-0.052 Sum_probs=37.1
Q ss_pred ceecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCC
Q psy6079 3 SSICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRL 56 (79)
Q Consensus 3 ~~~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~ 56 (79)
+.||..+++|.... |.||.+|+++++. +++|+|+++.... ..++|.....|
T Consensus 54 ~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~-~~rilv~~~~~~~--~~~~w~l~~~t 109 (398)
T PLN02772 54 QILDKITNNWVSPIVLGTGPKPCKGYSAVVLN-KDRILVIKKGSAP--DDSIWFLEVDT 109 (398)
T ss_pred EEEECCCCcEecccccCCCCCCCCcceEEEEC-CceEEEEeCCCCC--ccceEEEEcCC
Confidence 46899999999875 7779999999883 5689999764332 25566555444
No 44
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=96.43 E-value=0.048 Score=31.54 Aligned_cols=59 Identities=14% Similarity=-0.013 Sum_probs=39.1
Q ss_pred ceecCCCCCeeecCCCc---cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEe-cc
Q psy6079 3 SSICLHLVFWFPLLLLS---YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSF-IA 63 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~---r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~-~~ 63 (79)
..|+..++.|..+...+ ......+.+ ++.+|-+...........+..||..+.+|.. ++
T Consensus 73 ~Vys~~~~~Wr~~~~~~~~~~~~~~~v~~--~G~lyw~~~~~~~~~~~~IvsFDl~~E~f~~~i~ 135 (230)
T TIGR01640 73 QVYTLGSNSWRTIECSPPHHPLKSRGVCI--NGVLYYLAYTLKTNPDYFIVSFDVSSERFKEFIP 135 (230)
T ss_pred EEEEeCCCCccccccCCCCccccCCeEEE--CCEEEEEEEECCCCCcEEEEEEEcccceEeeeee
Confidence 47888999999987322 111225667 9998887642221112268999999999995 53
No 45
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=96.38 E-value=0.001 Score=29.01 Aligned_cols=17 Identities=12% Similarity=0.004 Sum_probs=15.0
Q ss_pred ceecCCCCCeeecCCCc
Q psy6079 3 SSICLHLVFWFPLLLLS 19 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~ 19 (79)
.+||+++++|+.+++||
T Consensus 31 ~~yd~~~~~W~~~~~mp 47 (47)
T PF01344_consen 31 EVYDPETNTWEELPPMP 47 (47)
T ss_dssp EEEETTTTEEEEEEEES
T ss_pred EEEeCCCCEEEEcCCCC
Confidence 47999999999998875
No 46
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=95.92 E-value=0.15 Score=29.50 Aligned_cols=62 Identities=8% Similarity=-0.122 Sum_probs=35.9
Q ss_pred ceecCCCCCeeecCCCcc----ceeeeEEECCCC-----eEEEEcCCCCCCCCCeEEEEeCCCCcEEeccC
Q psy6079 3 SSICLHLVFWFPLLLLSY----MCLGVGVLEPEG-----PLYAVGGHDGWSYLNTTERFDPRLRQWSFIAP 64 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~r----~~~~~~~~~~~~-----~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~ 64 (79)
...||.|.+|..+|+.+. .......+.++. |+..+...........+++|+..++.|+.+..
T Consensus 17 ~V~NP~T~~~~~LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~~~~~~~~~~~Vys~~~~~Wr~~~~ 87 (230)
T TIGR01640 17 VVWNPSTGQSRWLPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDRSGNRNQSEHQVYTLGSNSWRTIEC 87 (230)
T ss_pred EEECCCCCCEEecCCCCCcccccccceEEEeecccCCcEEEEEEEeecCCCCCccEEEEEeCCCCcccccc
Confidence 367999999999974331 111011121122 44444332111124578999999999999864
No 47
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=95.83 E-value=0.0058 Score=26.91 Aligned_cols=17 Identities=12% Similarity=0.082 Sum_probs=12.4
Q ss_pred ceecCCCCCeeecCCCc
Q psy6079 3 SSICLHLVFWFPLLLLS 19 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~ 19 (79)
.+||+++++|++++++|
T Consensus 32 ~~~d~~~~~W~~~~~~P 48 (49)
T PF13418_consen 32 WIFDIETNTWTRLPSMP 48 (49)
T ss_dssp EEEETTTTEEEE--SS-
T ss_pred EEEECCCCEEEECCCCC
Confidence 47999999999998776
No 48
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=95.37 E-value=0.013 Score=25.90 Aligned_cols=17 Identities=12% Similarity=0.018 Sum_probs=14.8
Q ss_pred ceecCCCCCeeecCCCc
Q psy6079 3 SSICLHLVFWFPLLLLS 19 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~ 19 (79)
+.||+++++|+.+++++
T Consensus 33 ~~~d~~t~~W~~~~~~g 49 (49)
T PF07646_consen 33 WVFDTETNQWTELSPMG 49 (49)
T ss_pred EEEECCCCEEeecCCCC
Confidence 57999999999998775
No 49
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=95.19 E-value=0.16 Score=30.87 Aligned_cols=60 Identities=12% Similarity=-0.008 Sum_probs=39.3
Q ss_pred eecCCCCCeeecCCC-ccceeeeEEECCCCeEEEEcCCCCCC-CCCeEEEEeCCCCcEEeccC
Q psy6079 4 SICLHLVFWFPLLLL-SYMCLGVGVLEPEGPLYAVGGHDGWS-YLNTTERFDPRLRQWSFIAP 64 (79)
Q Consensus 4 ~~~~~~~~W~~~~~~-~r~~~~~~~~~~~~~iyv~GG~~~~~-~~~~~~~yd~~~~~W~~~~~ 64 (79)
.||+...+|..+..- .-.-..+... -++++|+.|-+.-.. ....+..||.++.+|..++.
T Consensus 20 ~yd~~~~qW~~~g~~i~G~V~~l~~~-~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~~~~~ 81 (281)
T PF12768_consen 20 LYDTDNSQWSSPGNGISGTVTDLQWA-SNNQLLVGGNFTLNGTNSSNLATYDFKNQTWSSLGG 81 (281)
T ss_pred EEECCCCEeecCCCCceEEEEEEEEe-cCCEEEEEEeeEECCCCceeEEEEecCCCeeeecCC
Confidence 589999999998732 1111222222 256777777544333 35678999999999988865
No 50
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=89.88 E-value=3.3 Score=25.82 Aligned_cols=36 Identities=14% Similarity=-0.101 Sum_probs=24.8
Q ss_pred eecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCC
Q psy6079 4 SICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHD 41 (79)
Q Consensus 4 ~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~ 41 (79)
.||+++..-..+|.+. .....++.+ +++||++....
T Consensus 90 vyDt~t~av~~~P~l~~pk~~pisv~V--G~~LY~m~~~~ 127 (342)
T PF07893_consen 90 VYDTDTRAVATGPRLHSPKRCPISVSV--GDKLYAMDRSP 127 (342)
T ss_pred EEECCCCeEeccCCCCCCCcceEEEEe--CCeEEEeeccC
Confidence 5888888877777554 233345556 88999997643
No 51
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=88.51 E-value=0.95 Score=27.03 Aligned_cols=34 Identities=18% Similarity=0.401 Sum_probs=26.2
Q ss_pred CCCeEEEEcCCCCCCCCCeEEEEeCCCCcE-EeccCCCC-CCC
Q psy6079 30 PEGPLYAVGGHDGWSYLNTTERFDPRLRQW-SFIAPMLS-PRS 70 (79)
Q Consensus 30 ~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W-~~~~~~~~-~r~ 70 (79)
.+++||+++. ....+||+.++++ ..+|.+|. +|.
T Consensus 179 PdG~lFi~an-------~~s~i~d~~~n~v~~~lP~lPg~~R~ 214 (243)
T PF07250_consen 179 PDGNLFIFAN-------RGSIIYDYKTNTVVRTLPDLPGGPRN 214 (243)
T ss_pred CCCCEEEEEc-------CCcEEEeCCCCeEEeeCCCCCCCcee
Confidence 4999999975 3457789999987 77888884 454
No 52
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=85.37 E-value=5.8 Score=24.69 Aligned_cols=55 Identities=13% Similarity=0.173 Sum_probs=33.6
Q ss_pred ceecCCCCCeeecC--CCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEecc
Q psy6079 3 SSICLHLVFWFPLL--LLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIA 63 (79)
Q Consensus 3 ~~~~~~~~~W~~~~--~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~ 63 (79)
++|||....|..-+ ...-..+++.+-. .+.+++- ....+.+.+|||++.+...++
T Consensus 257 ~rfdPs~~sW~eypLPgs~arpys~rVD~-~grVW~s-----ea~agai~rfdpeta~ftv~p 313 (353)
T COG4257 257 HRFDPSVTSWIEYPLPGSKARPYSMRVDR-HGRVWLS-----EADAGAIGRFDPETARFTVLP 313 (353)
T ss_pred eEeCcccccceeeeCCCCCCCcceeeecc-CCcEEee-----ccccCceeecCcccceEEEec
Confidence 68999999998754 2222233444331 3345542 112467889999998877663
No 53
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=81.01 E-value=3.1 Score=17.18 Aligned_cols=37 Identities=16% Similarity=0.074 Sum_probs=18.5
Q ss_pred CeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCC
Q psy6079 11 FWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRL 56 (79)
Q Consensus 11 ~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~ 56 (79)
.|+.-.+- ....+.++. ++.+|+.+. ...++++|+++
T Consensus 4 ~W~~~~~~-~~~~~~~v~--~g~vyv~~~------dg~l~ald~~t 40 (40)
T PF13570_consen 4 LWSYDTGG-PIWSSPAVA--GGRVYVGTG------DGNLYALDAAT 40 (40)
T ss_dssp EEEEE-SS----S--EEC--TSEEEEE-T------TSEEEEEETT-
T ss_pred eEEEECCC-CcCcCCEEE--CCEEEEEcC------CCEEEEEeCCC
Confidence 46664322 222334455 778887654 35788888764
No 54
>PF08268 FBA_3: F-box associated domain; InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=78.15 E-value=8.1 Score=20.33 Aligned_cols=33 Identities=30% Similarity=0.301 Sum_probs=24.1
Q ss_pred CCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccC
Q psy6079 31 EGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAP 64 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~ 64 (79)
+|-+|-..-. .......+-+||.++.+|..+..
T Consensus 5 nGvly~~a~~-~~~~~~~IvsFDv~~E~f~~i~~ 37 (129)
T PF08268_consen 5 NGVLYWLAWS-EDSDNNVIVSFDVRSEKFRFIKL 37 (129)
T ss_pred CcEEEeEEEE-CCCCCcEEEEEEcCCceEEEEEe
Confidence 7777776543 22235789999999999998854
No 55
>PF06058 DCP1: Dcp1-like decapping family; InterPro: IPR010334 An essential step in mRNA turnover is decapping. In yeast, two proteins have been identified that are essential for decapping, Dcp1 (this family) and Dcp2 (IPR007722 from INTERPRO). The precise role of these proteins in the decapping reaction has not been established. Evidence suggests that the Dcp1 may enhance the function of Dcp2 [].; PDB: 1Q67_A 2QKM_C 2QKL_A.
Probab=69.01 E-value=6.8 Score=20.91 Aligned_cols=16 Identities=19% Similarity=0.418 Sum_probs=14.0
Q ss_pred CeEEEEeCCCCcEEec
Q psy6079 47 NTTERFDPRLRQWSFI 62 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~ 62 (79)
..++.||+.+++|++.
T Consensus 29 v~vY~f~~~~~~W~K~ 44 (122)
T PF06058_consen 29 VVVYKFDHETNEWEKT 44 (122)
T ss_dssp EEEEEEETTTTEEEEE
T ss_pred EEEEeecCCCCcEeec
Confidence 5688899999999986
No 56
>PRK09511 nirD nitrite reductase small subunit; Provisional
Probab=66.12 E-value=16 Score=18.86 Aligned_cols=31 Identities=13% Similarity=-0.039 Sum_probs=12.6
Q ss_pred eecCCCCCeeecCCCccceeeeEEECCCCeEEE
Q psy6079 4 SICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYA 36 (79)
Q Consensus 4 ~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv 36 (79)
+||.+|++-...+..+..-+-+.+. ++.|||
T Consensus 76 ~Fdl~TG~~~~~~~~~l~typV~ve--~g~V~v 106 (108)
T PRK09511 76 RFRLSDGLCMEDEQFSVKHYDARVK--DGVVQL 106 (108)
T ss_pred EEECCCcccCCCCCccEeeEeEEEE--CCEEEE
Confidence 4455554443322222233333444 555554
No 57
>PF07443 HARP: HepA-related protein (HARP); InterPro: IPR010003 This entry represents a conserved region approximately 60 residues long within eukaryotic HepA-related protein (HARP). This exhibits single-stranded DNA-dependent ATPase activity, and is ubiquitously expressed in human and mouse tissues []. Family members may contain more than one copy of this region.; GO: 0004386 helicase activity, 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0016568 chromatin modification, 0005634 nucleus
Probab=65.49 E-value=2.8 Score=19.28 Aligned_cols=12 Identities=42% Similarity=1.326 Sum_probs=10.1
Q ss_pred EEEeCCCCcEEe
Q psy6079 50 ERFDPRLRQWSF 61 (79)
Q Consensus 50 ~~yd~~~~~W~~ 61 (79)
..||+.|+.|.-
T Consensus 21 r~YD~~Tr~W~F 32 (55)
T PF07443_consen 21 RNYDPKTRKWNF 32 (55)
T ss_pred cccCccceeeee
Confidence 468999999984
No 58
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=63.06 E-value=33 Score=21.04 Aligned_cols=46 Identities=20% Similarity=0.317 Sum_probs=31.4
Q ss_pred CCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079 31 EGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS 79 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~ 79 (79)
++.+|.--|..+ .+.+..||+.+.+-....+++..-.+-+.+.++|
T Consensus 55 ~g~LyESTG~yG---~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d 100 (264)
T PF05096_consen 55 DGTLYESTGLYG---QSSLRKVDLETGKVLQSVPLPPRYFGEGITILGD 100 (264)
T ss_dssp TTEEEEEECSTT---EEEEEEEETTTSSEEEEEE-TTT--EEEEEEETT
T ss_pred CCEEEEeCCCCC---cEEEEEEECCCCcEEEEEECCccccceeEEEECC
Confidence 567777656544 3789999999998777777776666667777664
No 59
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=62.86 E-value=28 Score=20.21 Aligned_cols=54 Identities=17% Similarity=0.137 Sum_probs=30.2
Q ss_pred ceecCCCCCeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCC
Q psy6079 3 SSICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPM 65 (79)
Q Consensus 3 ~~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~ 65 (79)
+++|+.+.+-....... ..+++....++.+|+... .....+|+.+.+++.+...
T Consensus 25 ~~~~~~~~~~~~~~~~~--~~G~~~~~~~g~l~v~~~-------~~~~~~d~~~g~~~~~~~~ 78 (246)
T PF08450_consen 25 YRVDPDTGEVEVIDLPG--PNGMAFDRPDGRLYVADS-------GGIAVVDPDTGKVTVLADL 78 (246)
T ss_dssp EEEETTTTEEEEEESSS--EEEEEEECTTSEEEEEET-------TCEEEEETTTTEEEEEEEE
T ss_pred EEEECCCCeEEEEecCC--CceEEEEccCCEEEEEEc-------CceEEEecCCCcEEEEeec
Confidence 35566665544433222 222333213678888753 2346669999988877655
No 60
>PF14830 Haemocyan_bet_s: Haemocyanin beta-sandwich; PDB: 1JS8_B 3QJO_A 1LNL_A.
Probab=62.44 E-value=7.7 Score=20.21 Aligned_cols=24 Identities=17% Similarity=0.451 Sum_probs=11.9
Q ss_pred CCeEEEEcCCCCC-CCCCeEEEEeC
Q psy6079 31 EGPLYAVGGHDGW-SYLNTTERFDP 54 (79)
Q Consensus 31 ~~~iyv~GG~~~~-~~~~~~~~yd~ 54 (79)
.+.++|.||..+. +.+..++.||.
T Consensus 36 AG~F~vLGG~~EMpW~FdRlykydI 60 (103)
T PF14830_consen 36 AGTFFVLGGEKEMPWAFDRLYKYDI 60 (103)
T ss_dssp EEEEEE---TTS---B-SS-EEEE-
T ss_pred ccEEEEcCCCccCccccCccchhhH
Confidence 4568999997764 34778888886
No 61
>PF00614 PLDc: Phospholipase D Active site motif; InterPro: IPR001736 Phosphatidylcholine-hydrolysing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, and/or asparagine residues which may contribute to the active site aspartic acid. An Escherichia coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs [, , , ].; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 3HSI_C.
Probab=61.89 E-value=10 Score=14.78 Aligned_cols=16 Identities=25% Similarity=0.378 Sum_probs=9.4
Q ss_pred eeEEECCCCeEEEEcCCC
Q psy6079 24 GVGVLEPEGPLYAVGGHD 41 (79)
Q Consensus 24 ~~~~~~~~~~iyv~GG~~ 41 (79)
-..++ ++++..+||.+
T Consensus 8 K~~vv--D~~~a~vGg~n 23 (28)
T PF00614_consen 8 KFVVV--DDRVAFVGGAN 23 (28)
T ss_dssp -EEEE--TTTEEEEE---
T ss_pred EEEEE--cCCEEEECcee
Confidence 35567 88888888865
No 62
>PF14157 YmzC: YmzC-like protein; PDB: 3KVP_E.
Probab=61.55 E-value=14 Score=17.45 Aligned_cols=34 Identities=18% Similarity=0.147 Sum_probs=19.4
Q ss_pred eeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEec
Q psy6079 24 GVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFI 62 (79)
Q Consensus 24 ~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~ 62 (79)
.++.+ .+.-|.+-|-. ..-.++.||+++++-..+
T Consensus 23 ~m~ql--~~N~Fav~~e~---~~iKIfkyd~~tNei~L~ 56 (63)
T PF14157_consen 23 NMTQL--EHNHFAVVDED---GQIKIFKYDEDTNEITLK 56 (63)
T ss_dssp EEEE---STTEEEEE-ET---TEEEEEEEETTTTEEEEE
T ss_pred ceEEe--cCCEEEEEecC---CeEEEEEeCCCCCeEEEE
Confidence 45555 44444443211 135789999999987654
No 63
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=55.20 E-value=25 Score=22.06 Aligned_cols=19 Identities=21% Similarity=0.235 Sum_probs=14.6
Q ss_pred ceeeeEEECCCCeEEEEcCCC
Q psy6079 21 MCLGVGVLEPEGPLYAVGGHD 41 (79)
Q Consensus 21 ~~~~~~~~~~~~~iyv~GG~~ 41 (79)
..|.+.+. ++.+|++||..
T Consensus 156 SFHvslar--~D~VYilGGHs 174 (337)
T PF03089_consen 156 SFHVSLAR--NDCVYILGGHS 174 (337)
T ss_pred EEEEEEec--CceEEEEccEE
Confidence 55666666 88999999954
No 64
>PRK11408 hypothetical protein; Provisional
Probab=54.91 E-value=30 Score=19.24 Aligned_cols=29 Identities=24% Similarity=0.528 Sum_probs=20.0
Q ss_pred EEEE-cCCCCCCCCCeEEEEeCCCCcEEecc
Q psy6079 34 LYAV-GGHDGWSYLNTTERFDPRLRQWSFIA 63 (79)
Q Consensus 34 iyv~-GG~~~~~~~~~~~~yd~~~~~W~~~~ 63 (79)
+|.+ ||.-+.. ...++.|-|++-+|+.+.
T Consensus 34 ~FAiNgG~lg~~-~G~v~Y~aPDtL~WE~l~ 63 (145)
T PRK11408 34 FFALNGGGLGED-VGEVYYFAPDTLKWESLE 63 (145)
T ss_pred eEEeccCccCCC-CCeEEEeCCCccceeccC
Confidence 4455 4433322 578999999999999763
No 65
>PF13806 Rieske_2: Rieske-like [2Fe-2S] domain; PDB: 2JO6_A 3C0D_A 3D89_A 2JZA_A.
Probab=54.08 E-value=20 Score=18.45 Aligned_cols=31 Identities=19% Similarity=0.066 Sum_probs=13.9
Q ss_pred eecCCCCCeeecCCCccceeeeEEECCCCeEEE
Q psy6079 4 SICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYA 36 (79)
Q Consensus 4 ~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv 36 (79)
+||++|++....+..+..-+-+-+. ++.|||
T Consensus 73 ~f~L~tG~~~~~~~~~l~~ypvrv~--~g~V~V 103 (104)
T PF13806_consen 73 RFDLRTGECLEDPDVSLRTYPVRVE--DGQVYV 103 (104)
T ss_dssp EEETTTTEESSECSEBSBEEEEEEC--TTEEEE
T ss_pred eEECCCcCcCCCCCCcEEeEEEEEE--CCEEEE
Confidence 4555555443333323333444444 555554
No 66
>COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only]
Probab=51.43 E-value=18 Score=24.06 Aligned_cols=29 Identities=28% Similarity=0.544 Sum_probs=20.2
Q ss_pred eEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccC
Q psy6079 25 VGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAP 64 (79)
Q Consensus 25 ~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~ 64 (79)
+..+ ++.+|++|| +.++.+| .|.|+.+..
T Consensus 571 A~fi--~dylY~vg~-------~ev~~ld--enswe~Vge 599 (603)
T COG4880 571 AFFI--KDYLYLVGG-------NEVWKLD--ENSWEVVGE 599 (603)
T ss_pred eEEe--cceEEEecc-------ceeEEec--cchHhhhhh
Confidence 4455 888999987 5677775 456877643
No 67
>KOG0278|consensus
Probab=44.09 E-value=41 Score=20.85 Aligned_cols=30 Identities=23% Similarity=0.449 Sum_probs=18.1
Q ss_pred eeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCC
Q psy6079 23 LGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLR 57 (79)
Q Consensus 23 ~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~ 57 (79)
-.++.+.++..+||.||-+ -.++.||-.|.
T Consensus 227 V~SASL~P~k~~fVaGged-----~~~~kfDy~Tg 256 (334)
T KOG0278|consen 227 VESASLHPKKEFFVAGGED-----FKVYKFDYNTG 256 (334)
T ss_pred cccccccCCCceEEecCcc-----eEEEEEeccCC
Confidence 3445555577899999843 23455555444
No 68
>TIGR00547 lolA periplasmic chaperone LolA. This protein, LolA, is known so far only in the gamma and beta subdivisions of the Proteobacteria. The E. coli major outer lipoprotein (Lpp) of E. coli is released from the inner membrane as a complex with this chaperone in an energy-requiring process, and is then delivered to LolB for insertion into the outer membrane. LolA is involved in the delivery of lipoproteins generally, rather than just Lpp, and is an essential protein in E. coli, unlike Lpp itself.
Probab=43.80 E-value=55 Score=19.02 Aligned_cols=24 Identities=21% Similarity=0.259 Sum_probs=16.6
Q ss_pred CCeEEEEcCCCCCCCCCeEEEEeCCCCcEEe
Q psy6079 31 EGPLYAVGGHDGWSYLNTTERFDPRLRQWSF 61 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~ 61 (79)
.+.+.|..| ..++.|||+.++...
T Consensus 77 ~~q~iVsdG-------~~vw~Ydp~leQvt~ 100 (204)
T TIGR00547 77 DESIIISDG-------KTLWFYDPFVEQATA 100 (204)
T ss_pred CceEEEECC-------CEEEEECCCCceeee
Confidence 445555554 578999998888653
No 69
>cd01207 Ena-Vasp Enabled-VASP-type homology (EVH1) domain. Enabled-VASP-type homology (EVH1) domain. The EVH1 domain binds to other proteins at proline rich sequences. It is found in proteins involved in cytoskeletal reorganization such as Enabled and VASP. Ena-VASP type EVH1 domains specifically recognize FPPPP motifs in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=43.71 E-value=30 Score=18.30 Aligned_cols=18 Identities=11% Similarity=0.276 Sum_probs=14.7
Q ss_pred CeEEEEeCCCCcEEeccC
Q psy6079 47 NTTERFDPRLRQWSFIAP 64 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~~~ 64 (79)
..+..||+.++.|.+...
T Consensus 9 A~Vm~~d~~tk~W~P~~~ 26 (111)
T cd01207 9 ASVMVYDDSNKKWVPAGG 26 (111)
T ss_pred EEeeEEcCCCCcEEcCCC
Confidence 467889999999988644
No 70
>PF03835 Rad4: Rad4 transglutaminase-like domain; InterPro: IPR018325 RAD4/Xp-C proteins contain an ancient transglutaminase fold that is also found in peptide-N-glycanases (PNGases), which remove glycans from glycoproteins during their degradation. The PNGases retain the catalytic triad that is typical of this fold and are predicted to have a reaction mechanism similar to that involved in transglutamination. In contrast, the RAD4/Xp-C proteins are predicted to be inactive and are likely to only possess the interaction function in DNA repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0005634 nucleus; PDB: 2QSG_A 2QSF_A 2QSH_A 1X3W_A 1X3Z_A 3ESW_A.
Probab=43.57 E-value=34 Score=18.58 Aligned_cols=18 Identities=22% Similarity=0.791 Sum_probs=15.2
Q ss_pred EEEEeCCCCcEEeccCCC
Q psy6079 49 TERFDPRLRQWSFIAPML 66 (79)
Q Consensus 49 ~~~yd~~~~~W~~~~~~~ 66 (79)
+|+|++..++|-.+.++.
T Consensus 39 ~EV~~~~~~rWI~VDp~~ 56 (145)
T PF03835_consen 39 VEVYSPEEKRWIHVDPVV 56 (145)
T ss_dssp EEEEETTTTEEEEEETTT
T ss_pred EEEEecCCCeEEEeeeec
Confidence 688999999999886654
No 71
>KOG2321|consensus
Probab=42.43 E-value=53 Score=22.73 Aligned_cols=38 Identities=16% Similarity=0.198 Sum_probs=24.7
Q ss_pred CCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEE
Q psy6079 17 LLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWS 60 (79)
Q Consensus 17 ~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~ 60 (79)
.+|+.+.-++.-...-.||+.|. ..+++++|.+...|-
T Consensus 131 RIP~~GRDm~y~~~scDly~~gs------g~evYRlNLEqGrfL 168 (703)
T KOG2321|consen 131 RIPKFGRDMKYHKPSCDLYLVGS------GSEVYRLNLEQGRFL 168 (703)
T ss_pred ecCcCCccccccCCCccEEEeec------CcceEEEEccccccc
Confidence 34555555554433556888763 367899999988874
No 72
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=42.40 E-value=47 Score=16.71 Aligned_cols=33 Identities=15% Similarity=-0.086 Sum_probs=17.8
Q ss_pred eecCCCCCeeecCCCccceeeeEEECCCCeEEEE
Q psy6079 4 SICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYAV 37 (79)
Q Consensus 4 ~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv~ 37 (79)
+|||.+++.+.+.. .......+++..++...++
T Consensus 41 ~ydp~t~~~~vl~~-~L~fpNGVals~d~~~vlv 73 (89)
T PF03088_consen 41 RYDPSTKETTVLLD-GLYFPNGVALSPDESFVLV 73 (89)
T ss_dssp EEETTTTEEEEEEE-EESSEEEEEE-TTSSEEEE
T ss_pred EEECCCCeEEEehh-CCCccCeEEEcCCCCEEEE
Confidence 68888888877642 1223334444445554333
No 73
>cd05398 NT_ClassII-CCAase Nucleotidyltransferase (NT) domain of ClassII CCA-adding enzymes. CCA-adding enzymes add the sequence [cytidine(C)-cytidine-adenosine (A)], one nucleotide at a time, onto the 3' end of tRNA, in a template-independent reaction. This Class II group is comprised mainly of eubacterial and eukaryotic enzymes and includes Bacillus stearothermophilus CCAase, Escherichia coli poly(A) polymerase I, human mitochondrial CCAase, and Saccharomyces cerevisiae CCAase (CCA1). CCA-adding enzymes have a single catalytic pocket, which recognizes both ATP and CTP substrates. Included in this subgroup are CC- and A-adding enzymes from various ancient species of bacteria such as Aquifex aeolicus; these enzymes collaborate to add CCA to tRNAs. This family belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal io
Probab=41.50 E-value=21 Score=19.36 Aligned_cols=11 Identities=36% Similarity=0.395 Sum_probs=8.9
Q ss_pred CCeEEEEcCCC
Q psy6079 31 EGPLYAVGGHD 41 (79)
Q Consensus 31 ~~~iyv~GG~~ 41 (79)
+.++|++||+-
T Consensus 16 g~~~ylVGG~V 26 (139)
T cd05398 16 GYEAYLVGGAV 26 (139)
T ss_pred CceEEEECChH
Confidence 66899999953
No 74
>cd03583 NTR_complement_C3 NTR/C345C domain, complement C3 subfamily; The NTR domain found in complement C3 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C3 plays a pivotal role in the activation of the complement systems, as all pathways (classical, alternative, and lectin) result in the processing of C3 by C3 convertase. The larger fragment, activated C3b, contains the NTR/C345C domain and binds covalently, via a reactive thioester, to cell surface carbohydrates including components of bacterial cell walls and immune aggregates. The smaller cleavage product, C3a, acts independently as a diffusible signal to mediate local inflammatory processes. The structure of C3 shows that the NTR/C345C domain is located in an exposed position relative to the rest of the molecule. The function of the domain in complement C3 is poorly understood.
Probab=40.86 E-value=52 Score=18.23 Aligned_cols=32 Identities=16% Similarity=0.200 Sum_probs=21.5
Q ss_pred CCeEEEEcCCCCCCC-CCeEEEEeCCCCcEEec
Q psy6079 31 EGPLYAVGGHDGWSY-LNTTERFDPRLRQWSFI 62 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~-~~~~~~yd~~~~~W~~~ 62 (79)
.++.|++-|.++... ...-+.|-..+++|-..
T Consensus 86 ~gk~YLIMG~~~~~~~~~~~~~YvL~~~TWvE~ 118 (149)
T cd03583 86 EGKDYLIMGLSSDLWRIKDKYSYVIGKDTWIEY 118 (149)
T ss_pred CCCEEEEEeCCCCccccCCcEEEEeCCCeEEEE
Confidence 467788877664432 23347888899999765
No 75
>smart00155 PLDc Phospholipase D. Active site motifs. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, aspartic acid, and/or asparagine residues which may contribute to the active site. An E. coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not be
Probab=40.59 E-value=26 Score=13.17 Aligned_cols=19 Identities=5% Similarity=0.256 Sum_probs=12.9
Q ss_pred ceeeeEEECCCCeEEEEcCCC
Q psy6079 21 MCLGVGVLEPEGPLYAVGGHD 41 (79)
Q Consensus 21 ~~~~~~~~~~~~~iyv~GG~~ 41 (79)
.+.-..++ +++..++|+.+
T Consensus 5 ~H~K~~v~--D~~~~~iGs~N 23 (28)
T smart00155 5 LHTKLMIV--DDEIAYIGSAN 23 (28)
T ss_pred EEeEEEEE--cCCEEEEeCcc
Confidence 34445667 88888888765
No 76
>PF10946 DUF2625: Protein of unknown function DUF2625; InterPro: IPR021239 Some members in this family of proteins are annotated as ybfG however currently no function is known.
Probab=40.06 E-value=57 Score=19.30 Aligned_cols=18 Identities=17% Similarity=0.359 Sum_probs=15.7
Q ss_pred CCeEEEEeCCCCcEEecc
Q psy6079 46 LNTTERFDPRLRQWSFIA 63 (79)
Q Consensus 46 ~~~~~~yd~~~~~W~~~~ 63 (79)
...++.|.|++-+|+.+.
T Consensus 123 ~G~v~YfAPDtL~WE~l~ 140 (208)
T PF10946_consen 123 PGKVYYFAPDTLEWEPLE 140 (208)
T ss_pred CCeEEEECCCcccccccC
Confidence 578999999999999763
No 77
>KOG2315|consensus
Probab=39.38 E-value=1.3e+02 Score=20.75 Aligned_cols=49 Identities=16% Similarity=0.120 Sum_probs=27.9
Q ss_pred eecCCCCCeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCC
Q psy6079 4 SICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLR 57 (79)
Q Consensus 4 ~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~ 57 (79)
.||++.+---.++.-||.. .. ....+.|.++.|+.+. -.+++++|..+.
T Consensus 298 ifnlr~~~v~df~egpRN~-~~--fnp~g~ii~lAGFGNL--~G~mEvwDv~n~ 346 (566)
T KOG2315|consen 298 IFNLRGKPVFDFPEGPRNT-AF--FNPHGNIILLAGFGNL--PGDMEVWDVPNR 346 (566)
T ss_pred EEcCCCCEeEeCCCCCccc-eE--ECCCCCEEEEeecCCC--CCceEEEeccch
Confidence 4555555555555445533 12 2226667777777653 367888887663
No 78
>cd03584 NTR_complement_C4 NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.
Probab=38.80 E-value=64 Score=17.91 Aligned_cols=32 Identities=22% Similarity=0.326 Sum_probs=20.7
Q ss_pred CCeEEEEcCCCCCCC-CCeEEEEeCCCCcEEec
Q psy6079 31 EGPLYAVGGHDGWSY-LNTTERFDPRLRQWSFI 62 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~-~~~~~~yd~~~~~W~~~ 62 (79)
-++-|++-|.++... ...-+.|-..+++|-..
T Consensus 90 ~gk~YLImG~~~~~~~~~~~~~Y~L~~~tWvE~ 122 (153)
T cd03584 90 KGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEK 122 (153)
T ss_pred CCCEEEEEcCCCCCcCcCCcEEEEeCCCceEEE
Confidence 556787766554322 23346788899999765
No 79
>cd01206 Homer Homer type EVH1 domain. Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=38.14 E-value=42 Score=17.80 Aligned_cols=18 Identities=17% Similarity=0.434 Sum_probs=13.7
Q ss_pred CeEEEEeCCCC-cEEeccC
Q psy6079 47 NTTERFDPRLR-QWSFIAP 64 (79)
Q Consensus 47 ~~~~~yd~~~~-~W~~~~~ 64 (79)
..+..||+.++ .|.+..+
T Consensus 11 A~V~~yd~~tKk~WvPs~~ 29 (111)
T cd01206 11 AHVFQIDPKTKKNWIPASK 29 (111)
T ss_pred eEEEEECCCCcceeEeCCC
Confidence 56888999875 8987654
No 80
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=37.79 E-value=1e+02 Score=19.36 Aligned_cols=41 Identities=17% Similarity=0.130 Sum_probs=29.7
Q ss_pred CCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEE
Q psy6079 31 EGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVL 77 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~ 77 (79)
+.+|++++.. ...-+||.++..-...|.+..+.....++.+
T Consensus 76 gskIv~~d~~------~~t~vyDt~t~av~~~P~l~~pk~~pisv~V 116 (342)
T PF07893_consen 76 GSKIVAVDQS------GRTLVYDTDTRAVATGPRLHSPKRCPISVSV 116 (342)
T ss_pred CCeEEEEcCC------CCeEEEECCCCeEeccCCCCCCCcceEEEEe
Confidence 7788887543 3477899999998888888776665544443
No 81
>PF10902 DUF2693: Protein of unknown function (DUF2693); InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown.
Probab=36.72 E-value=53 Score=16.45 Aligned_cols=18 Identities=17% Similarity=0.362 Sum_probs=15.1
Q ss_pred CCCeEEEEeCCCCcEEec
Q psy6079 45 YLNTTERFDPRLRQWSFI 62 (79)
Q Consensus 45 ~~~~~~~yd~~~~~W~~~ 62 (79)
...++.+||.+.+.|...
T Consensus 47 s~~s~~yfDve~~~WRSF 64 (83)
T PF10902_consen 47 STTSVRYFDVEKKGWRSF 64 (83)
T ss_pred CcceEEEEEeccCceeee
Confidence 467799999999999864
No 82
>PF11958 DUF3472: Domain of unknown function (DUF3472); InterPro: IPR021862 This presumed domain is functionally uncharacterised. This domain is found in bacteria, eukaryotes and viruses. This domain is typically between 174 to 190 amino acids in length. This domain has a single completely conserved residue G that may be functionally important.
Probab=36.50 E-value=36 Score=19.49 Aligned_cols=24 Identities=25% Similarity=0.542 Sum_probs=18.8
Q ss_pred CeEEEEeCCCCcEEeccCCCCCCC
Q psy6079 47 NTTERFDPRLRQWSFIAPMLSPRS 70 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~~~~~~~r~ 70 (79)
-..+++|+.+++|..|+.+..|+.
T Consensus 80 ~~~~v~d~~t~~~~~Igs~~~p~~ 103 (184)
T PF11958_consen 80 YTGYVTDPTTGKWTLIGSIKRPKT 103 (184)
T ss_pred EEEEEEECCCCcEEEEEEEEEcCc
Confidence 356789999999999987765543
No 83
>PF13540 RCC1_2: Regulator of chromosome condensation (RCC1) repeat; PDB: 3QI0_D 1JTD_B 3QHY_B.
Probab=35.74 E-value=34 Score=13.15 Aligned_cols=16 Identities=19% Similarity=0.443 Sum_probs=9.6
Q ss_pred eeeEEECCCCeEEEEcC
Q psy6079 23 LGVGVLEPEGPLYAVGG 39 (79)
Q Consensus 23 ~~~~~~~~~~~iyv~GG 39 (79)
|++++.. ++++|.+|-
T Consensus 10 ht~al~~-~g~v~~wG~ 25 (30)
T PF13540_consen 10 HTCALTS-DGEVYCWGD 25 (30)
T ss_dssp EEEEEE--TTEEEEEE-
T ss_pred EEEEEEc-CCCEEEEcC
Confidence 3444332 889999984
No 84
>PF06335 DUF1054: Protein of unknown function (DUF1054); InterPro: IPR009403 This entry consists of several hypothetical bacterial proteins of unknown function.; PDB: 2A8E_A 3MQZ_A.
Probab=35.52 E-value=71 Score=18.71 Aligned_cols=34 Identities=9% Similarity=0.105 Sum_probs=24.5
Q ss_pred eecCCCCCeeecCCCcc-----ceeeeEEECCCCeEEEEcC
Q psy6079 4 SICLHLVFWFPLLLLSY-----MCLGVGVLEPEGPLYAVGG 39 (79)
Q Consensus 4 ~~~~~~~~W~~~~~~~r-----~~~~~~~~~~~~~iyv~GG 39 (79)
+-||..++|-...+-+| .++.+... ++.+||.=+
T Consensus 59 tvNPP~dTWvAf~~nKRGYK~~PHFqiGl~--~d~lFi~~a 97 (194)
T PF06335_consen 59 TVNPPKDTWVAFSPNKRGYKMLPHFQIGLW--DDHLFIWFA 97 (194)
T ss_dssp SSS--SEEEEEEESSSS-STTS-EEEEEEE--TTEEEEEEE
T ss_pred cCCCCccchhhccCCCCccccCCceEEeec--CceEEEEEE
Confidence 35788899999987776 67788888 888888654
No 85
>cd00837 EVH1 EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. The EVH1 domain binds to other proteins at proline rich sequences in either FPPPP or PPXXF motifs. It is found in the cytoskeletal reorganization proteins Enabled VASP, and WASP, and in the synaptic scaffolding protein Homer. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=34.51 E-value=52 Score=16.84 Aligned_cols=16 Identities=19% Similarity=0.302 Sum_probs=13.3
Q ss_pred CeEEEEeCCCCcEEec
Q psy6079 47 NTTERFDPRLRQWSFI 62 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~ 62 (79)
..++.+|+.++.|...
T Consensus 9 a~v~~~~~~~~~W~~~ 24 (104)
T cd00837 9 AQVYTADPSTGKWVPA 24 (104)
T ss_pred EEEEEECCCCCceEEC
Confidence 4678889999999886
No 86
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=33.22 E-value=33 Score=14.14 Aligned_cols=11 Identities=27% Similarity=0.661 Sum_probs=8.0
Q ss_pred CCeEEEEcCCC
Q psy6079 31 EGPLYAVGGHD 41 (79)
Q Consensus 31 ~~~iyv~GG~~ 41 (79)
.+.+||.|-.+
T Consensus 23 ~GNiYv~G~T~ 33 (38)
T PF06739_consen 23 NGNIYVTGYTN 33 (38)
T ss_pred CCCEEEEEeec
Confidence 56799998543
No 87
>PF11090 DUF2833: Protein of unknown function (DUF2833); InterPro: IPR020335 This entry contains proteins with no known function.
Probab=31.92 E-value=77 Score=16.03 Aligned_cols=13 Identities=38% Similarity=0.825 Sum_probs=10.6
Q ss_pred CCCeEEEEcCCCC
Q psy6079 30 PEGPLYAVGGHDG 42 (79)
Q Consensus 30 ~~~~iyv~GG~~~ 42 (79)
+.+.++.+||..+
T Consensus 7 ~~g~~lAiGG~~g 19 (86)
T PF11090_consen 7 HKGRPLAIGGNNG 19 (86)
T ss_pred cCCeEEEEccccC
Confidence 4889999999764
No 88
>KOG0294|consensus
Probab=31.87 E-value=1.4e+02 Score=19.15 Aligned_cols=23 Identities=30% Similarity=0.536 Sum_probs=14.8
Q ss_pred CCeEEEEcCCCCCCCCCeEEEEeCCCCc
Q psy6079 31 EGPLYAVGGHDGWSYLNTTERFDPRLRQ 58 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~ 58 (79)
++ .|++.|.++ ..+.+||..++.
T Consensus 52 s~-~~~aSGssD----etI~IYDm~k~~ 74 (362)
T KOG0294|consen 52 SG-PYVASGSSD----ETIHIYDMRKRK 74 (362)
T ss_pred cc-eeEeccCCC----CcEEEEeccchh
Confidence 55 466655443 468899987765
No 89
>KOG1324|consensus
Probab=31.77 E-value=30 Score=20.04 Aligned_cols=9 Identities=22% Similarity=0.737 Sum_probs=7.4
Q ss_pred CeEEEEcCC
Q psy6079 32 GPLYAVGGH 40 (79)
Q Consensus 32 ~~iyv~GG~ 40 (79)
+++||+||.
T Consensus 111 e~vfvIGG~ 119 (190)
T KOG1324|consen 111 EMVFVIGGS 119 (190)
T ss_pred eEEEEEcCH
Confidence 579999993
No 90
>KOG2705|consensus
Probab=31.61 E-value=28 Score=22.43 Aligned_cols=18 Identities=22% Similarity=0.508 Sum_probs=14.6
Q ss_pred CCCCCeEEEEeCCCCcEE
Q psy6079 43 WSYLNTTERFDPRLRQWS 60 (79)
Q Consensus 43 ~~~~~~~~~yd~~~~~W~ 60 (79)
+...+.++.||..||+|-
T Consensus 304 ~~~~~~v~SFN~nTN~wv 321 (471)
T KOG2705|consen 304 HDYNSVVESFNCNTNTWV 321 (471)
T ss_pred CccceeeeeeecChhHHH
Confidence 344678999999999995
No 91
>smart00120 HX Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs).
Probab=30.25 E-value=51 Score=13.46 Aligned_cols=19 Identities=37% Similarity=0.655 Sum_probs=13.9
Q ss_pred CCeEEEEcCCCCCCCCCeEEEEeCCC
Q psy6079 31 EGPLYAVGGHDGWSYLNTTERFDPRL 56 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~~~~~yd~~~ 56 (79)
++++|++-| +..+.||..+
T Consensus 9 ~~~~yfFkg-------~~yw~~~~~~ 27 (45)
T smart00120 9 NGKTYFFKG-------DKYWRFDPKR 27 (45)
T ss_pred CCeEEEEeC-------CEEEEEcCCc
Confidence 568999976 5678888654
No 92
>COG2380 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.67 E-value=56 Score=20.28 Aligned_cols=19 Identities=26% Similarity=0.536 Sum_probs=15.2
Q ss_pred CeEEEEeCCCCcEEeccCC
Q psy6079 47 NTTERFDPRLRQWSFIAPM 65 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~~~~ 65 (79)
.....+|.+|+.|+.++.+
T Consensus 22 f~~~~~~vEtdew~~i~~~ 40 (327)
T COG2380 22 FAEESFDVETDEWEAIPTL 40 (327)
T ss_pred ccceeeecccccccccCcc
Confidence 4567789999999998654
No 93
>PF14741 GH114_assoc: N-terminal glycosyl-hydrolase-114-associated domain
Probab=29.53 E-value=1e+02 Score=16.86 Aligned_cols=22 Identities=18% Similarity=0.273 Sum_probs=17.7
Q ss_pred CCCeEEEEeC-CCCcEEeccCCC
Q psy6079 45 YLNTTERFDP-RLRQWSFIAPML 66 (79)
Q Consensus 45 ~~~~~~~yd~-~~~~W~~~~~~~ 66 (79)
..-.++.||. .+++|..+.++.
T Consensus 61 ~~W~vd~~~~~~~~~w~~vGD~t 83 (129)
T PF14741_consen 61 MRWTVDMFDFDDTSTWVKVGDLT 83 (129)
T ss_pred ceeEEEEEEccCCCeeEEecccc
Confidence 3567888999 899999998764
No 94
>KOG2006|consensus
Probab=27.85 E-value=92 Score=23.03 Aligned_cols=53 Identities=17% Similarity=0.218 Sum_probs=35.9
Q ss_pred cCCCCCeeecC-CCc------cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCC
Q psy6079 6 CLHLVFWFPLL-LLS------YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPML 66 (79)
Q Consensus 6 ~~~~~~W~~~~-~~~------r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~ 66 (79)
+....+|..+. |.+ ..++.+.-. +++.+++.|.. -...|+..+++|...+...
T Consensus 321 ~~~~~~w~~~~~P~~y~ssnwpiry~s~~~--dg~~lavAgl~------gla~y~~~~~~Wk~f~~~s 380 (1023)
T KOG2006|consen 321 ITDIEKWDHLALPLTYTSSNWPIRYVSIDY--DGKHLAVAGLH------GLARYSLSSKRWKLFGDES 380 (1023)
T ss_pred ccchhhhheeecccccccccCceeEEeecC--CccEEEEeecc------cceEEEeecceeecCCccc
Confidence 45667888876 333 133444444 88889888854 3567899999999876543
No 95
>PF00186 DHFR_1: Dihydrofolate reductase; InterPro: IPR001796 Dihydrofolate reductase (DHFR) (1.5.1.3 from EC) catalyses the NADPH-dependent reduction of dihydrofolate to tetrahydrofolate, an essential step in de novo synthesis both of glycine and of purines and deoxythymidine phosphate (the precursors of DNA synthesis) [], and important also in the conversion of deoxyuridine monophosphate to deoxythymidine monophosphate. Although DHFR is found ubiquitously in prokaryotes and eukaryotes, and is found in all dividing cells, maintaining levels of fully reduced folate coenzymes, the catabolic steps are still not well understood []. Bacterial species possesses distinct DHFR enzymes (based on their pattern of binding diaminoheterocyclic molecules), but mammalian DHFRs are highly similar []. The active site is situated in the N-terminal half of the sequence, which includes a conserved Pro-Trp dipeptide; the tryptophan has been shown [] to be involved in the binding of substrate by the enzyme. Its central role in DNA precursor synthesis, coupled with its inhibition by antagonists such as trimethoprim and methotrexate, which are used as anti-bacterial or anti-cancer agents, has made DHFR a target of anticancer chemotherapy. However, resistance has developed against some drugs, as a result of changes in DHFR itself [].; GO: 0004146 dihydrofolate reductase activity, 0006545 glycine biosynthetic process, 0009165 nucleotide biosynthetic process, 0055114 oxidation-reduction process; PDB: 1ZDR_B 3SA2_B 3JWK_B 3E0B_A 3S9U_B 3FL9_H 3FL8_F 2QK8_A 3JW3_A 3SA1_B ....
Probab=27.56 E-value=39 Score=18.88 Aligned_cols=9 Identities=22% Similarity=0.822 Sum_probs=6.5
Q ss_pred CeEEEEcCC
Q psy6079 32 GPLYAVGGH 40 (79)
Q Consensus 32 ~~iyv~GG~ 40 (79)
+.+||+||.
T Consensus 92 ~~i~ViGG~ 100 (161)
T PF00186_consen 92 EEIFVIGGA 100 (161)
T ss_dssp SEEEEEE-H
T ss_pred CcEEEECCH
Confidence 579999993
No 96
>COG4334 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.16 E-value=76 Score=16.92 Aligned_cols=14 Identities=29% Similarity=0.458 Sum_probs=11.2
Q ss_pred eEEECCCCeEEEEcCC
Q psy6079 25 VGVLEPEGPLYAVGGH 40 (79)
Q Consensus 25 ~~~~~~~~~iyv~GG~ 40 (79)
++.+ ++.+||-|+.
T Consensus 41 ~v~v--DgdLyVrgy~ 54 (131)
T COG4334 41 FVYV--DGDLYVRGYR 54 (131)
T ss_pred EEEE--CCceEEeecC
Confidence 4567 9999999874
No 97
>KOG0282|consensus
Probab=27.12 E-value=1.9e+02 Score=19.65 Aligned_cols=12 Identities=33% Similarity=0.803 Sum_probs=9.3
Q ss_pred CC-eEEEEcCCCC
Q psy6079 31 EG-PLYAVGGHDG 42 (79)
Q Consensus 31 ~~-~iyv~GG~~~ 42 (79)
++ .+|++||.+.
T Consensus 310 d~~n~fl~G~sd~ 322 (503)
T KOG0282|consen 310 DNQNIFLVGGSDK 322 (503)
T ss_pred CCCcEEEEecCCC
Confidence 44 8999999764
No 98
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=27.01 E-value=1.4e+02 Score=19.38 Aligned_cols=27 Identities=22% Similarity=0.192 Sum_probs=15.5
Q ss_pred CCCeeecCCCccceeeeEEECCCCeEEEE
Q psy6079 9 LVFWFPLLLLSYMCLGVGVLEPEGPLYAV 37 (79)
Q Consensus 9 ~~~W~~~~~~~r~~~~~~~~~~~~~iyv~ 37 (79)
.+.|+.+..+.-..--++.. +|++|++
T Consensus 189 ~~~Wt~l~~~~~~~~DIi~~--kGkfYAv 215 (373)
T PLN03215 189 GNVLKALKQMGYHFSDIIVH--KGQTYAL 215 (373)
T ss_pred CCeeeEccCCCceeeEEEEE--CCEEEEE
Confidence 37899886533333334444 6666665
No 99
>KOG2967|consensus
Probab=26.87 E-value=39 Score=21.30 Aligned_cols=13 Identities=23% Similarity=0.409 Sum_probs=9.5
Q ss_pred CCeEEEEcCCCCC
Q psy6079 31 EGPLYAVGGHDGW 43 (79)
Q Consensus 31 ~~~iyv~GG~~~~ 43 (79)
.+++|||||.-+.
T Consensus 200 p~~vYiIGglVD~ 212 (314)
T KOG2967|consen 200 PDKVYIIGGLVDK 212 (314)
T ss_pred CCcEEEEEEEecC
Confidence 4579999996543
No 100
>PLN03113 DNA ligase 1; Provisional
Probab=26.64 E-value=1.9e+02 Score=20.70 Aligned_cols=56 Identities=13% Similarity=0.141 Sum_probs=28.8
Q ss_pred ceecCC--CCCeeecCCCccceeeeEEECCCCeEEEEcCCCCCC----CCCe--EEEEeCCCCcEEecc
Q psy6079 3 SSICLH--LVFWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWS----YLNT--TERFDPRLRQWSFIA 63 (79)
Q Consensus 3 ~~~~~~--~~~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~----~~~~--~~~yd~~~~~W~~~~ 63 (79)
+.|-|. .+.|-++.+-...+.+- .-.+.|+||+.+.. .... +-+||+.++.|..++
T Consensus 561 S~Y~pGkRs~~WlKlK~dy~~~~~d-----tlDlVvIGa~~G~GkR~g~~g~fLla~yd~~~~~~~~v~ 624 (744)
T PLN03113 561 ATYEPSKRSNNWLKLKKDYMESIGD-----SLDLVPIAAFHGRGKRTGVYGAFLLACYDSNKEEFQSIC 624 (744)
T ss_pred CCccCCCCCCCeEEEechhhccccc-----cccEEEEEEEeCCCCcCCccceEEEEEEcCCCCEEEEee
Confidence 356663 45788876432221110 11255566654321 1233 447999888887664
No 101
>COG0262 FolA Dihydrofolate reductase [Coenzyme metabolism]
Probab=26.55 E-value=49 Score=18.51 Aligned_cols=9 Identities=22% Similarity=0.759 Sum_probs=7.3
Q ss_pred CeEEEEcCC
Q psy6079 32 GPLYAVGGH 40 (79)
Q Consensus 32 ~~iyv~GG~ 40 (79)
+.|||+||.
T Consensus 96 ~~i~IiGG~ 104 (167)
T COG0262 96 EDIFIIGGG 104 (167)
T ss_pred CeEEEEcCH
Confidence 569999993
No 102
>PF00415 RCC1: Regulator of chromosome condensation (RCC1) repeat; InterPro: IPR000408 The regulator of chromosome condensation (RCC1) [] is a eukaryotic protein which binds to chromatin and interacts with ran, a nuclear GTP-binding protein IPR002041 from INTERPRO, to promote the loss of bound GDP and the uptake of fresh GTP, thus acting as a guanine-nucleotide dissociation stimulator (GDS). The interaction of RCC1 with ran probably plays an important role in the regulation of gene expression. RCC1, known as PRP20 or SRM1 in yeast, pim1 in fission yeast and BJ1 in Drosophila, is a protein that contains seven tandem repeats of a domain of about 50 to 60 amino acids. As shown in the following schematic representation, the repeats make up the major part of the length of the protein. Outside the repeat region, there is just a small N-terminal domain of about 40 to 50 residues and, in the Drosophila protein only, a C-terminal domain of about 130 residues. +----+-------+-------+-------+-------+-------+-------+-------+-------------+ |N-t.|Rpt. 1 |Rpt. 2 |Rpt. 3 |Rpt. 4 |Rpt. 5 |Rpt. 6 |Rpt. 7 | C-terminal | +----+-------+-------+-------+-------+-------+-------+-------+-------------+ The RCC1-type of repeat is also found in the X-linked retinitis pigmentosa GTPase regulator []. The RCC repeats form a beta-propeller structure.; PDB: 3MVD_L 3OF7_A 1I2M_D 1A12_B 3KCI_A 4DNV_B 4DNU_A 4D9S_B 4DNW_A.
Probab=26.29 E-value=52 Score=13.85 Aligned_cols=9 Identities=33% Similarity=0.870 Sum_probs=6.9
Q ss_pred CCeEEEEcC
Q psy6079 31 EGPLYAVGG 39 (79)
Q Consensus 31 ~~~iyv~GG 39 (79)
++++|.+|-
T Consensus 1 dG~vy~wG~ 9 (51)
T PF00415_consen 1 DGRVYSWGS 9 (51)
T ss_dssp TSEEEEEEE
T ss_pred CCcEEEEEC
Confidence 468899983
No 103
>TIGR02608 delta_60_rpt delta-60 repeat domain. This domain occurs in tandem repeats, as many as 13, in proteins from Bdellovibrio bacteriovorus, Azotobacter vinelandii, Geobacter sulfurreducens, Pirellula sp. 1, Myxococcus xanthus, and others, many of which are Deltaproteobacteria. The periodicity of the repeat ranges from about 57 to 61 amino acids, and a core region of about 54 is represented by this model and seed alignment.
Probab=26.05 E-value=80 Score=14.37 Aligned_cols=26 Identities=15% Similarity=0.179 Sum_probs=15.0
Q ss_pred CCCeEEEEcCCCC--CCCCCeEEEEeCC
Q psy6079 30 PEGPLYAVGGHDG--WSYLNTTERFDPR 55 (79)
Q Consensus 30 ~~~~iyv~GG~~~--~~~~~~~~~yd~~ 55 (79)
.+++|++.|-... ......+.+|++.
T Consensus 10 ~DGkIlv~G~~~~~~~~~~~~l~Rln~D 37 (55)
T TIGR02608 10 SDGKILVAGYVDNSSGNNDFVLARLNAD 37 (55)
T ss_pred CCCcEEEEEEeecCCCcccEEEEEECCC
Confidence 4889998885432 1223346666654
No 104
>PF10049 DUF2283: Protein of unknown function (DUF2283); InterPro: IPR019270 Members of this family of hypothetical proteins have no known function.
Probab=25.98 E-value=38 Score=14.88 Aligned_cols=11 Identities=9% Similarity=-0.239 Sum_probs=7.3
Q ss_pred CcceecCCCCC
Q psy6079 1 MQSSICLHLVF 11 (79)
Q Consensus 1 ~~~~~~~~~~~ 11 (79)
|.-+||++++.
T Consensus 1 Mki~YD~~~D~ 11 (50)
T PF10049_consen 1 MKIEYDPEADA 11 (50)
T ss_pred CEeEEcCcCCE
Confidence 56677777664
No 105
>smart00284 OLF Olfactomedin-like domains.
Probab=25.90 E-value=1.7e+02 Score=17.97 Aligned_cols=54 Identities=11% Similarity=0.060 Sum_probs=27.9
Q ss_pred ceecCCCC----CeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEE
Q psy6079 3 SSICLHLV----FWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWS 60 (79)
Q Consensus 3 ~~~~~~~~----~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~ 60 (79)
++.||.+- .|..--+-+..+ .+.++ =+.+|++-.... ....-.+.||..+++=+
T Consensus 156 SkLnp~tL~ve~tW~T~~~k~sa~-naFmv--CGvLY~~~s~~~-~~~~I~yayDt~t~~~~ 213 (255)
T smart00284 156 SKLNPATLTIENTWITTYNKRSAS-NAFMI--CGILYVTRSLGS-KGEKVFYAYDTNTGKEG 213 (255)
T ss_pred EeeCcccceEEEEEEcCCCccccc-ccEEE--eeEEEEEccCCC-CCcEEEEEEECCCCccc
Confidence 45666653 555522222222 23333 356888853221 12345778999988743
No 106
>cd03474 Rieske_T4moC Toluene-4-monooxygenase effector protein complex (T4mo), Rieske ferredoxin subunit; The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. T4mo is a four-protein complex that catalyzes the NADH- and O2-dependent hydroxylation of toluene to form p-cresol. T4mo consists of an NADH oxidoreductase (T4moF), a diiron hydroxylase (T4moH), a catalytic effector protein (T4moD), and a Rieske ferredoxin (T4moC). T4moC contains a Rieske domain and functions as an obligate electron carrier between T4moF and T4moH. Rieske ferredoxins are found as subunits of membrane oxidase complexes, cis-dihydrodiol-forming aromatic dioxygenases, bacterial assimilatory nitrite reductases, and arsenite oxidase. Rieske ferredoxins are also found as soluble electron carriers in bacterial dioxygenase and monooxygenase complexes.
Probab=25.19 E-value=1.1e+02 Score=15.46 Aligned_cols=6 Identities=17% Similarity=0.446 Sum_probs=2.5
Q ss_pred CCeEEE
Q psy6079 31 EGPLYA 36 (79)
Q Consensus 31 ~~~iyv 36 (79)
++.|||
T Consensus 92 ~g~v~v 97 (108)
T cd03474 92 GGDILV 97 (108)
T ss_pred CCEEEE
Confidence 444443
No 107
>TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PubMed:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al.
Probab=24.85 E-value=1.3e+02 Score=16.51 Aligned_cols=24 Identities=8% Similarity=0.083 Sum_probs=17.7
Q ss_pred EEEEeCCCCcEEeccCCCCCCCce
Q psy6079 49 TERFDPRLRQWSFIAPMLSPRSTA 72 (79)
Q Consensus 49 ~~~yd~~~~~W~~~~~~~~~r~~~ 72 (79)
...+|+...+|..+-+.+..+...
T Consensus 113 A~fr~~~~~~Wr~~~~~~~~~~~~ 136 (146)
T TIGR03352 113 AAYRDIDNAEWRVVYKVPPKGKYK 136 (146)
T ss_pred EEeecCCCCceEEEEecCCCCcee
Confidence 445788889999987777666443
No 108
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=24.64 E-value=1e+02 Score=15.07 Aligned_cols=10 Identities=20% Similarity=0.494 Sum_probs=7.7
Q ss_pred CCeEEEEcCC
Q psy6079 31 EGPLYAVGGH 40 (79)
Q Consensus 31 ~~~iyv~GG~ 40 (79)
.|++.+.|..
T Consensus 57 sGki~itGak 66 (86)
T PF00352_consen 57 SGKIVITGAK 66 (86)
T ss_dssp TSEEEEEEES
T ss_pred CCEEEEEecC
Confidence 7888888863
No 109
>PRK10769 folA dihydrofolate reductase; Provisional
Probab=24.47 E-value=55 Score=18.26 Aligned_cols=9 Identities=22% Similarity=0.575 Sum_probs=7.2
Q ss_pred CeEEEEcCC
Q psy6079 32 GPLYAVGGH 40 (79)
Q Consensus 32 ~~iyv~GG~ 40 (79)
+.|||+||.
T Consensus 89 ~~I~viGG~ 97 (159)
T PRK10769 89 PEIMVIGGG 97 (159)
T ss_pred CCEEEECcH
Confidence 469999993
No 110
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=24.17 E-value=1.2e+02 Score=20.74 Aligned_cols=17 Identities=12% Similarity=0.198 Sum_probs=14.3
Q ss_pred CeEEEEeCCCCcEEecc
Q psy6079 47 NTTERFDPRLRQWSFIA 63 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~~ 63 (79)
.+++.|||.++.-+++.
T Consensus 287 GdIylydP~td~lekld 303 (668)
T COG4946 287 GDIYLYDPETDSLEKLD 303 (668)
T ss_pred CcEEEeCCCcCcceeee
Confidence 57899999999988763
No 111
>PF00045 Hemopexin: Hemopexin; InterPro: IPR018487 Hemopexin (3.2.1.35 from EC) is a serum glycoprotein that binds haem and transports it to the liver for breakdown and iron recovery, after which the free hemopexin returns to the circulation []. Hemopexin prevents haem-mediated oxidative stress. Structurally hemopexin consists of two similar halves of approximately two hundred amino acid residues connected by a histidine-rich hinge region. Each half is itself formed by the repetition of a basic unit of some 35 to 45 residues. Hemopexin-like domains have been found in two other types of proteins, vitronectin [], a cell adhesion and spreading factor found in plasma and tissues, and matrixins MMP-1, MMP-2, MMP-3, MMP-9, MMP-10, MMP-11, MMP-12, MMP-14, MMP-15 and MMP-16, members of the matrix metalloproteinase family that cleave extracellular matrix constituents []. These zinc endopeptidases, which belong to MEROPS peptidase subfamily M10A, have a single hemopexin-like domain in their C-terminal section. It is suggested that the hemopexin domain facilitates binding to a variety of molecules and proteins, for example the HX repeats of some matrixins bind tissue inhibitor of metallopeptidases (TIMPs). This entry represents the repeat found in hempoxein and related domains.; PDB: 1SU3_B 2CLT_B 3V6N_B 3C7X_A 1GXD_A 1RTG_A 1CK7_A 1GEN_A 1ITV_A 2JXY_A ....
Probab=23.89 E-value=74 Score=13.24 Aligned_cols=18 Identities=39% Similarity=0.556 Sum_probs=13.8
Q ss_pred CCeEEEEcCCCCCCCCCeEEEEeCC
Q psy6079 31 EGPLYAVGGHDGWSYLNTTERFDPR 55 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~~~~~yd~~ 55 (79)
++++|+|-| +..++||..
T Consensus 9 ~g~~yfFkg-------~~ywr~~~~ 26 (45)
T PF00045_consen 9 NGKTYFFKG-------NQYWRFDES 26 (45)
T ss_dssp TTEEEEEET-------TEEEEEETT
T ss_pred CCcEEEEEC-------CEEEEEcCc
Confidence 389999976 567888776
No 112
>PF15494 SRCR_2: Scavenger receptor cysteine-rich domain
Probab=23.14 E-value=1.2e+02 Score=15.21 Aligned_cols=17 Identities=6% Similarity=0.243 Sum_probs=14.0
Q ss_pred CeEEEEeCCCCcEEecc
Q psy6079 47 NTTERFDPRLRQWSFIA 63 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~~ 63 (79)
+.+++|...+..|..+.
T Consensus 4 s~LQV~~~~~~~W~~VC 20 (98)
T PF15494_consen 4 SLLQVYSAASGSWLPVC 20 (98)
T ss_pred cEEEEEecCCCCEeeec
Confidence 56889999999999774
No 113
>smart00461 WH1 WASP homology region 1. Region of the Wiskott-Aldrich syndrome protein (WASp) that contains point mutations in the majority of patients with WAS. Unknown function. Ena-like WH1 domains bind polyproline-containing peptides, and that Homer contains a WH1 domain.
Probab=23.11 E-value=1.2e+02 Score=15.62 Aligned_cols=17 Identities=12% Similarity=0.417 Sum_probs=13.5
Q ss_pred CeEEEEeCCCCcEEecc
Q psy6079 47 NTTERFDPRLRQWSFIA 63 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~~ 63 (79)
..+..|+..++.|.+..
T Consensus 11 avV~~y~~~~~~W~~~~ 27 (106)
T smart00461 11 AVVQLYDADTKKWVPTG 27 (106)
T ss_pred EEEEEEeCCCCCeEECC
Confidence 46788999999998754
No 114
>PF12118 SprA-related: SprA-related family; InterPro: IPR021973 Most members of this uncharacterised protein family are annotated as being SprA-related.
Probab=22.78 E-value=52 Score=20.54 Aligned_cols=10 Identities=70% Similarity=1.361 Sum_probs=8.9
Q ss_pred CCCeEEEEcC
Q psy6079 30 PEGPLYAVGG 39 (79)
Q Consensus 30 ~~~~iyv~GG 39 (79)
.|++.|.+||
T Consensus 182 pDg~~YavgG 191 (294)
T PF12118_consen 182 PDGKRYAVGG 191 (294)
T ss_pred CCCceeeccc
Confidence 5889999998
No 115
>PF06462 Hyd_WA: Propeller; InterPro: IPR006624 Tectonins I and II are two dominant proteins in the nuclei and nuclear matrix from plasmodia of Physarum polycephalum (Slime mold) which encode 217 and 353 amino acids, respectively. Tectonin I is homologous to the C-terminal two-thirds of tectonin II. Both proteins contain six tandem repeats that are each 33-37 amino acids in length and define a new consensus sequence. Homologous repeats are found in L-6, a bacterial lipopolysaccharide-binding lectin from horseshoe crab hemocytes. The repetitive sequences of the tectonins and L-6 are reminiscent of the WD repeats of the beta-subunit of G proteins, suggesting that they form beta-propeller domains. The tectonins may be lectins that function as part of a transmembrane signalling complex during phagocytosis [].
Probab=22.70 E-value=68 Score=12.67 Aligned_cols=12 Identities=8% Similarity=-0.006 Sum_probs=7.1
Q ss_pred eecCCCCCeeec
Q psy6079 4 SICLHLVFWFPL 15 (79)
Q Consensus 4 ~~~~~~~~W~~~ 15 (79)
..+|....|..+
T Consensus 20 ~~~P~G~~W~~i 31 (32)
T PF06462_consen 20 PSNPEGTSWEHI 31 (32)
T ss_pred CCCCCCCCcEEC
Confidence 345666667654
No 116
>cd03475 Rieske_SoxF_SoxL SoxF and SoxL family, Rieske domain; The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. SoxF is a subunit of the terminal oxidase supercomplex SoxM in the plasma membrane of Sulfolobus acidocaldarius that combines features of a cytochrome bc(1) complex and a cytochrome. The Rieske domain of SoxF has a 12 residue insertion which is not found in eukaryotic and bacterial Rieske proteins and is thought to influence the redox properties of the iron-sulfur cluster. SoxL is a Rieske protein which may be part of an archaeal bc-complex homologue whose physiological function is still unknown. SoxL has two features not seen in other Rieske proteins; (i) a significantly greater distance between the two cluster-binding sites and (ii) an unexpected Pro - Asp substitution at one of the cluster binding sites. SoxF and SoxL are found in archaea and in bacteria.
Probab=22.18 E-value=1.7e+02 Score=16.79 Aligned_cols=7 Identities=71% Similarity=1.123 Sum_probs=4.8
Q ss_pred CeEEEEc
Q psy6079 32 GPLYAVG 38 (79)
Q Consensus 32 ~~iyv~G 38 (79)
+.||++|
T Consensus 154 d~iyAvG 160 (171)
T cd03475 154 DDLYAVG 160 (171)
T ss_pred CcEEEEe
Confidence 4677776
No 117
>PRK09965 3-phenylpropionate dioxygenase ferredoxin subunit; Provisional
Probab=22.12 E-value=1.3e+02 Score=15.25 Aligned_cols=6 Identities=17% Similarity=0.678 Sum_probs=2.5
Q ss_pred CCeEEE
Q psy6079 31 EGPLYA 36 (79)
Q Consensus 31 ~~~iyv 36 (79)
++.|||
T Consensus 94 ~g~v~v 99 (106)
T PRK09965 94 GGDIFI 99 (106)
T ss_pred CCEEEE
Confidence 444443
No 118
>PF04096 Nucleoporin2: Nucleoporin autopeptidase; InterPro: IPR007230 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of autocatalytic serine endopeptidases belong to MEROPS peptidase family S59 (clan SP). The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 50 unique proteins, collectively known as nucleoporins. A number of the peptides are synthesised as precursors and undergo self-catalyzed cleavage. The proteolytic cleavage site of yeast Nup145p has been mapped upstream of an evolutionary conserved serine residue. Cleavage occurs at the same site when a precursor is artificially expressed in Escherichia coli. A hydroxyl-containing residue is critical for the reaction, although a thiol-containing residue offers an acceptable replacement. In vitro kinetics experiments using a purified precursor molecule demonstrate that the cleavage is self-catalyzed and that the catalytic domain lies within the N-terminal moiety. Taken altogether, the data are consistent with a proteolytic mechanism involving an N>O acyl rearrangement and a subsequent ester intermediate uncovered in other self-processing proteins []. Nup98 is a component of the nuclear pore that plays its primary role in the export of RNAs. Nup98 is expressed in two forms, derived from alternate mRNA splicing. Both forms are processed into two peptides through autoproteolysis mediated by the C-terminal domain of hNup98. The three-dimensional structure of the C-terminal domain reveals a novel protein fold, and thus a new class of autocatalytic proteases. The structure further reveals that the suggested nucleoporin RNA binding motif is unlikely to bind to RNA []. The following nucleoporins share an ~150-residue C-terminal domain responsible for NPC targeting [, ]: Vertebrate Nup98, a component of the nuclear pore that plays its primary role in the export of RNAs. Yeast Nup100, plays an important role in several nuclear export and import pathways including poly(A)+ RNA and protein transport. Yeast Nup116, involved in mRNA export and protein transport. Yeast Nup145, involved in nuclear poly(A)+ RNA and tRNA export. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore [, ]. The NUP C-terminal domain adopts a predominantly beta-strand structure. The molecule consists of a six-stranded beta-sheet sandwiched against a two-stranded beta-sheet and flanked by alpha-helical regions. The N-terminal helical region consists of two short helices, whereas the stretch on the opposite side of molecule consists of a single, longer helix [, ].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 3PBP_E 2AIV_A 1KO6_A 2Q5X_A 2Q5Y_C 3KEP_A 3KES_A 3NF5_B 3TKN_I.
Probab=21.95 E-value=1.2e+02 Score=16.64 Aligned_cols=16 Identities=25% Similarity=0.596 Sum_probs=12.7
Q ss_pred CeEEEEeCCCCcEEec
Q psy6079 47 NTTERFDPRLRQWSFI 62 (79)
Q Consensus 47 ~~~~~yd~~~~~W~~~ 62 (79)
..+..||+.+..|.-.
T Consensus 122 ~~FvsYd~~tG~W~F~ 137 (141)
T PF04096_consen 122 AEFVSYDPETGTWVFR 137 (141)
T ss_dssp SEEEEEETTTTEEEEE
T ss_pred CEEEEEeCCCcEEEEE
Confidence 4567899999999753
No 119
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=21.77 E-value=1.7e+02 Score=16.59 Aligned_cols=22 Identities=23% Similarity=0.459 Sum_probs=15.3
Q ss_pred CCeEEEEcCCCCCCCCCeEEEEeCCCCcE
Q psy6079 31 EGPLYAVGGHDGWSYLNTTERFDPRLRQW 59 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W 59 (79)
++++|++.| +..+.||..+++.
T Consensus 110 ~~~~yfFkg-------~~y~ry~~~~~~v 131 (194)
T cd00094 110 NGKTYFFKG-------DKYWRYDEKTQKM 131 (194)
T ss_pred CCEEEEEeC-------CEEEEEeCCCccc
Confidence 578999977 4567777655543
No 120
>COG2131 ComEB Deoxycytidylate deaminase [Nucleotide transport and metabolism]
Probab=21.74 E-value=1.6e+02 Score=16.80 Aligned_cols=25 Identities=12% Similarity=0.110 Sum_probs=16.0
Q ss_pred CccceeeeEEECCCCeEEEEcCCCCCC
Q psy6079 18 LSYMCLGVGVLEPEGPLYAVGGHDGWS 44 (79)
Q Consensus 18 ~~r~~~~~~~~~~~~~iyv~GG~~~~~ 44 (79)
-+|...+++.+ .++-.|..|+++..
T Consensus 25 c~r~~VGAvIv--kd~rIiatGYNG~p 49 (164)
T COG2131 25 CPRRQVGAVIV--KDGRIIATGYNGAP 49 (164)
T ss_pred CcccceeEEEE--eCCeEEEeecCCCC
Confidence 35666777777 55556666777643
No 121
>PF01743 PolyA_pol: Poly A polymerase head domain; InterPro: IPR002646 This group includes nucleic acid independent RNA polymerases, such as polynucleotide adenylyltransferase (2.7.7.19 from EC), which adds the poly (A) tail to mRNA. This group also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of the tRNA 2.7.7.25 from EC.; GO: 0003723 RNA binding, 0016779 nucleotidyltransferase activity, 0006396 RNA processing; PDB: 1VFG_A 3H38_A 3H3A_B 3H39_B 3H37_A 1MIY_A 1MIV_B 1MIW_B 1OU5_B 3AQN_A ....
Probab=21.50 E-value=38 Score=17.93 Aligned_cols=7 Identities=57% Similarity=1.009 Sum_probs=4.8
Q ss_pred EEEEcCC
Q psy6079 34 LYAVGGH 40 (79)
Q Consensus 34 iyv~GG~ 40 (79)
+|++||+
T Consensus 1 ~ylVGG~ 7 (126)
T PF01743_consen 1 AYLVGGA 7 (126)
T ss_dssp EEEETHH
T ss_pred CEEecHH
Confidence 4777774
No 122
>PF12091 DUF3567: Protein of unknown function (DUF3567); InterPro: IPR021951 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are about 90 amino acids in length. This protein has a conserved EIVDK sequence motif.
Probab=21.33 E-value=1.3e+02 Score=15.17 Aligned_cols=39 Identities=10% Similarity=-0.055 Sum_probs=20.2
Q ss_pred CcceecCCCCCeeecC------CCccceeeeEEECCCCeEEEEcC
Q psy6079 1 MQSSICLHLVFWFPLL------LLSYMCLGVGVLEPEGPLYAVGG 39 (79)
Q Consensus 1 ~~~~~~~~~~~W~~~~------~~~r~~~~~~~~~~~~~iyv~GG 39 (79)
|+-.||...-.--.++ .+.|.++-.+=-..+..||+-|-
T Consensus 1 M~miYnSd~y~VV~~~~~~~~~~l~~gGyEIVDK~~~rEifi~G~ 45 (85)
T PF12091_consen 1 MQMIYNSDNYCVVEFPPDAGHPALARGGYEIVDKNARREIFIDGS 45 (85)
T ss_pred CceeecCCceEEEEecCCCCccchhcCCcEEeecCCCceEEeCcH
Confidence 5666776653333333 23355655554323456777654
No 123
>PF00568 WH1: WH1 domain; InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][]. WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,]. Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=20.72 E-value=1.4e+02 Score=15.33 Aligned_cols=15 Identities=27% Similarity=0.603 Sum_probs=10.8
Q ss_pred eEEEEeCCCCc-EEec
Q psy6079 48 TTERFDPRLRQ-WSFI 62 (79)
Q Consensus 48 ~~~~yd~~~~~-W~~~ 62 (79)
.++.+||.+++ |...
T Consensus 17 ~v~~~~p~~~~~W~~~ 32 (111)
T PF00568_consen 17 QVYQADPDTKRQWSPV 32 (111)
T ss_dssp EEEEEETTTSESEEES
T ss_pred EEEEEEcCCCCcEeeC
Confidence 46677887777 8874
No 124
>PF04122 CW_binding_2: Putative cell wall binding repeat 2; InterPro: IPR007253 This repeat is found in multiple tandem copies in proteins including amidase enhancers [] and adhesins [].
Probab=20.38 E-value=79 Score=15.46 Aligned_cols=11 Identities=27% Similarity=0.643 Sum_probs=8.0
Q ss_pred CCeEEEEcCCC
Q psy6079 31 EGPLYAVGGHD 41 (79)
Q Consensus 31 ~~~iyv~GG~~ 41 (79)
..++|++||..
T Consensus 73 ~~~v~iiGg~~ 83 (92)
T PF04122_consen 73 IKKVYIIGGEG 83 (92)
T ss_pred CCEEEEECCCC
Confidence 35789999853
No 125
>KOG2445|consensus
Probab=20.36 E-value=2.3e+02 Score=18.22 Aligned_cols=36 Identities=28% Similarity=0.457 Sum_probs=21.8
Q ss_pred CCeEEEEcCCCCCCCCC--eEEEEeCCCCcEEeccCCC
Q psy6079 31 EGPLYAVGGHDGWSYLN--TTERFDPRLRQWSFIAPML 66 (79)
Q Consensus 31 ~~~iyv~GG~~~~~~~~--~~~~yd~~~~~W~~~~~~~ 66 (79)
...++++|--......+ .++.|+...++|.+++.++
T Consensus 183 ~~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~kva~L~ 220 (361)
T KOG2445|consen 183 HEPLIAVGSDEDAPHLNKVKIYEYNENGRKWLKVAELP 220 (361)
T ss_pred cCceEEEEcccCCccccceEEEEecCCcceeeeehhcC
Confidence 34566666433333333 4566777788999986553
Done!