Query psy2980
Match_columns 92
No_of_seqs 115 out of 1475
Neff 10.9
Searched_HMMs 46136
Date Fri Aug 16 22:29:15 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2980.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2980hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4441|consensus 99.9 1.5E-22 3.3E-27 129.6 8.7 90 1-92 385-475 (571)
2 KOG4441|consensus 99.9 2.6E-22 5.5E-27 128.5 8.7 91 1-92 337-427 (571)
3 PHA02713 hypothetical protein; 99.9 9.8E-22 2.1E-26 125.8 10.8 91 1-92 308-398 (557)
4 PHA02713 hypothetical protein; 99.9 8.4E-21 1.8E-25 121.6 10.1 81 12-92 431-512 (557)
5 PHA03098 kelch-like protein; P 99.8 8E-20 1.7E-24 116.5 10.2 90 1-91 299-388 (534)
6 PHA03098 kelch-like protein; P 99.8 1.5E-19 3.3E-24 115.3 10.0 90 1-91 347-436 (534)
7 PHA02790 Kelch-like protein; P 99.8 2.6E-19 5.7E-24 113.2 10.5 85 1-92 323-407 (480)
8 PHA02790 Kelch-like protein; P 99.8 6.4E-19 1.4E-23 111.4 10.8 87 1-92 276-362 (480)
9 PLN02153 epithiospecifier prot 99.8 2.4E-18 5.2E-23 104.9 11.0 90 1-91 90-192 (341)
10 TIGR03548 mutarot_permut cycli 99.8 2.4E-18 5.3E-23 104.2 10.5 89 1-91 77-170 (323)
11 PLN02153 epithiospecifier prot 99.8 4.4E-18 9.6E-23 103.7 10.7 90 1-91 37-136 (341)
12 PLN02193 nitrile-specifier pro 99.8 1E-17 2.2E-22 105.8 11.3 89 1-91 233-327 (470)
13 TIGR03547 muta_rot_YjhT mutatr 99.8 6.2E-18 1.4E-22 103.1 9.8 91 1-91 68-199 (346)
14 PF13964 Kelch_6: Kelch motif 99.7 1.9E-17 4E-22 75.5 6.6 50 34-83 1-50 (50)
15 PLN02193 nitrile-specifier pro 99.7 6.5E-17 1.4E-21 102.2 11.1 90 1-91 180-277 (470)
16 PRK14131 N-acetylneuraminic ac 99.7 5.7E-17 1.2E-21 100.0 9.8 91 1-91 89-220 (376)
17 KOG0379|consensus 99.7 5.3E-17 1.1E-21 102.8 9.5 92 1-92 127-224 (482)
18 TIGR03548 mutarot_permut cycli 99.7 8.4E-17 1.8E-21 97.5 9.7 77 1-80 128-205 (323)
19 TIGR03547 muta_rot_YjhT mutatr 99.7 5.6E-16 1.2E-20 94.6 10.0 84 1-89 22-114 (346)
20 PF01344 Kelch_1: Kelch motif; 99.6 7.8E-16 1.7E-20 69.2 4.6 47 34-80 1-47 (47)
21 KOG0379|consensus 99.6 1.1E-14 2.4E-19 92.3 9.2 91 1-91 178-274 (482)
22 PRK14131 N-acetylneuraminic ac 99.6 5.9E-15 1.3E-19 91.2 7.2 92 1-92 226-343 (376)
23 KOG4693|consensus 99.6 4.8E-15 1E-19 87.1 5.0 91 1-91 93-190 (392)
24 PF13415 Kelch_3: Galactose ox 99.6 2.4E-14 5.2E-19 64.9 6.2 48 44-91 1-49 (49)
25 KOG4693|consensus 99.6 2.2E-14 4.7E-19 84.4 7.3 91 1-91 196-301 (392)
26 PF07646 Kelch_2: Kelch motif; 99.5 1.3E-13 2.9E-18 62.4 6.3 47 34-80 1-49 (49)
27 PF13418 Kelch_4: Galactose ox 99.5 5.6E-14 1.2E-18 63.6 4.4 47 34-80 1-48 (49)
28 smart00612 Kelch Kelch domain. 99.5 1.2E-13 2.7E-18 61.7 4.7 46 46-92 1-46 (47)
29 KOG1230|consensus 99.4 7.5E-13 1.6E-17 81.6 7.0 78 13-90 154-239 (521)
30 KOG1230|consensus 99.4 1E-12 2.2E-17 81.0 7.0 79 13-91 98-186 (521)
31 smart00612 Kelch Kelch domain. 99.2 2.2E-11 4.7E-16 54.3 4.1 44 1-45 4-47 (47)
32 PF13415 Kelch_3: Galactose ox 99.2 2.5E-11 5.3E-16 54.9 4.1 43 1-43 6-49 (49)
33 KOG4152|consensus 99.2 1.2E-10 2.6E-15 74.0 6.8 90 1-92 220-332 (830)
34 PF13964 Kelch_6: Kelch motif 99.2 4.5E-11 9.9E-16 54.2 3.4 35 1-35 16-50 (50)
35 PF13854 Kelch_5: Kelch motif 99.1 3.8E-10 8.1E-15 49.4 5.5 40 31-70 1-41 (42)
36 PLN02772 guanylate kinase 99.0 6.4E-09 1.4E-13 64.7 7.7 60 33-92 23-85 (398)
37 COG3055 Uncharacterized protei 98.9 3.4E-09 7.4E-14 64.6 5.4 90 1-90 97-225 (381)
38 PF01344 Kelch_1: Kelch motif; 98.8 1.5E-09 3.3E-14 48.4 1.7 32 1-32 16-47 (47)
39 KOG4152|consensus 98.8 1.6E-09 3.5E-14 69.1 2.1 89 1-91 47-145 (830)
40 COG3055 Uncharacterized protei 98.7 4.5E-08 9.7E-13 59.8 5.9 75 15-89 60-142 (381)
41 PF13418 Kelch_4: Galactose ox 98.7 7.4E-09 1.6E-13 46.6 1.2 32 1-32 17-48 (49)
42 PF07646 Kelch_2: Kelch motif; 98.5 3.2E-07 7E-12 41.2 3.3 32 1-32 16-49 (49)
43 PLN02772 guanylate kinase 98.4 1.8E-06 3.9E-11 54.1 7.2 54 1-54 39-96 (398)
44 KOG2437|consensus 98.3 2.5E-07 5.3E-12 59.2 0.9 90 1-91 277-382 (723)
45 KOG2437|consensus 97.7 0.00013 2.8E-09 47.3 5.7 65 13-77 346-420 (723)
46 TIGR01640 F_box_assoc_1 F-box 97.3 0.004 8.7E-08 36.4 8.1 68 13-81 70-138 (230)
47 PF12768 Rax2: Cortical protei 97.0 0.0067 1.5E-07 36.9 7.1 76 2-78 3-81 (281)
48 PF07250 Glyoxal_oxid_N: Glyox 96.1 0.044 9.5E-07 32.8 6.2 74 14-90 47-125 (243)
49 PF07893 DUF1668: Protein of u 95.9 0.11 2.5E-06 32.5 7.4 71 14-84 87-171 (342)
50 PF07250 Glyoxal_oxid_N: Glyox 95.6 0.039 8.5E-07 33.0 4.6 50 2-54 83-138 (243)
51 TIGR01640 F_box_assoc_1 F-box 95.1 0.33 7.2E-06 28.4 7.3 65 13-78 14-87 (230)
52 PF03089 RAG2: Recombination a 93.1 0.37 8E-06 29.8 4.7 54 1-54 43-111 (337)
53 PF07893 DUF1668: Protein of u 93.0 0.84 1.8E-05 28.7 6.4 54 22-81 157-221 (342)
54 PF08268 FBA_3: F-box associat 91.8 1.2 2.5E-05 23.8 8.3 66 11-76 18-87 (129)
55 PF05096 Glu_cyclase_2: Glutam 88.6 4 8.6E-05 25.0 6.8 52 13-70 68-119 (264)
56 COG4257 Vgb Streptogramin lyas 88.3 2.8 6E-05 26.2 5.3 59 13-77 254-313 (353)
57 smart00284 OLF Olfactomedin-li 80.8 7 0.00015 23.9 4.7 45 31-81 70-114 (255)
58 PF03089 RAG2: Recombination a 77.8 5.6 0.00012 24.9 3.7 41 14-54 132-174 (337)
59 KOG2321|consensus 76.9 7 0.00015 26.8 4.2 53 12-72 154-208 (703)
60 PF02191 OLF: Olfactomedin-lik 74.4 9.7 0.00021 23.1 4.1 46 30-81 64-109 (250)
61 PF06433 Me-amine-dh_H: Methyl 73.4 14 0.0003 23.7 4.7 56 10-73 266-324 (342)
62 KOG0316|consensus 71.4 18 0.00038 22.3 4.6 52 14-71 124-175 (307)
63 PF03178 CPSF_A: CPSF A subuni 68.3 26 0.00056 21.7 6.6 71 13-88 107-178 (321)
64 PF14830 Haemocyan_bet_s: Haem 65.3 8.5 0.00018 20.2 2.2 28 42-69 34-61 (103)
65 PF14157 YmzC: YmzC-like prote 61.5 17 0.00037 17.2 2.7 15 14-28 42-56 (63)
66 PF05096 Glu_cyclase_2: Glutam 61.3 37 0.00079 21.0 6.3 50 38-91 48-98 (264)
67 PF07443 HARP: HepA-related pr 59.2 4.7 0.0001 18.6 0.6 13 16-28 21-33 (55)
68 PF11958 DUF3472: Domain of un 58.1 35 0.00075 19.7 4.1 23 14-36 81-103 (184)
69 PF12768 Rax2: Cortical protei 58.0 43 0.00094 20.8 6.1 63 11-78 62-130 (281)
70 PF06058 DCP1: Dcp1-like decap 57.4 15 0.00033 19.7 2.5 17 60-76 28-44 (122)
71 KOG2394|consensus 55.0 18 0.0004 24.8 2.9 13 43-55 342-354 (636)
72 cd01206 Homer Homer type EVH1 53.9 33 0.00072 18.3 5.1 20 11-30 9-29 (111)
73 PF08450 SGL: SMP-30/Gluconola 51.8 48 0.001 19.5 7.5 56 12-79 21-78 (246)
74 PF03178 CPSF_A: CPSF A subuni 49.7 61 0.0013 20.1 6.0 60 13-80 62-127 (321)
75 PF03088 Str_synth: Strictosid 48.0 26 0.00056 17.8 2.3 20 10-29 34-53 (89)
76 COG2380 Uncharacterized protei 45.9 28 0.0006 21.7 2.5 21 11-31 20-40 (327)
77 cd01207 Ena-Vasp Enabled-VASP- 45.1 49 0.0011 17.6 5.6 18 13-30 9-26 (111)
78 PRK11138 outer membrane biogen 42.3 91 0.002 20.0 7.4 57 13-76 79-147 (394)
79 PRK11408 hypothetical protein; 40.6 68 0.0015 18.0 3.4 30 47-77 34-63 (145)
80 PF13570 PQQ_3: PQQ-like domai 40.4 31 0.00067 14.0 4.8 26 38-70 15-40 (40)
81 PF07734 FBA_1: F-box associat 40.1 67 0.0015 17.8 4.7 40 41-81 2-41 (164)
82 PF03835 Rad4: Rad4 transgluta 39.5 46 0.00099 18.2 2.6 21 12-32 36-56 (145)
83 PF10902 DUF2693: Protein of u 38.2 46 0.001 16.7 2.2 20 57-76 45-64 (83)
84 PRK09511 nirD nitrite reductas 37.7 55 0.0012 17.0 2.7 32 16-49 75-106 (108)
85 PF08662 eIF2A: Eukaryotic tra 36.5 87 0.0019 18.0 4.2 25 44-71 111-135 (194)
86 PF00614 PLDc: Phospholipase D 35.7 35 0.00075 13.2 1.4 16 39-54 8-23 (28)
87 KOG2055|consensus 34.5 1.5E+02 0.0033 20.2 4.9 27 47-79 272-298 (514)
88 KOG2958|consensus 33.0 35 0.00076 21.6 1.6 20 11-30 9-28 (354)
89 PRK00178 tolB translocation pr 33.0 1.4E+02 0.003 19.4 7.2 57 14-76 268-326 (430)
90 PRK05137 tolB translocation pr 32.9 1.4E+02 0.0031 19.5 7.5 60 13-77 226-286 (435)
91 PRK04792 tolB translocation pr 32.8 1.5E+02 0.0032 19.6 7.2 59 13-77 242-302 (448)
92 PF09272 Hepsin-SRCR: Hepsin, 31.7 70 0.0015 16.9 2.3 20 60-79 11-30 (110)
93 PF14298 DUF4374: Domain of un 31.5 1.7E+02 0.0036 19.8 6.8 61 12-74 366-429 (435)
94 cd00837 EVH1 EVH1 (Enabled, Va 31.5 62 0.0013 16.7 2.2 17 60-76 8-24 (104)
95 KOG0179|consensus 31.2 35 0.00075 20.5 1.3 19 1-21 129-147 (235)
96 PF13952 DUF4216: Domain of un 30.8 75 0.0016 15.6 3.6 19 57-75 54-73 (75)
97 cd03476 Rieske_ArOX_small Smal 30.6 95 0.0021 16.7 4.3 35 16-51 78-115 (126)
98 TIGR02800 propeller_TolB tol-p 30.3 1.5E+02 0.0033 18.9 7.6 60 13-77 214-274 (417)
99 PF08950 DUF1861: Protein of u 30.0 1.5E+02 0.0032 18.8 4.1 42 11-53 52-95 (298)
100 cd03529 Rieske_NirD Assimilato 29.6 86 0.0019 15.9 2.8 6 18-23 73-78 (103)
101 TIGR03074 PQQ_membr_DH membran 29.1 2.3E+02 0.005 20.7 7.9 58 13-76 641-719 (764)
102 PF05237 MoeZ_MoeB: MoeZ/MoeB 29.1 81 0.0018 15.5 2.8 19 12-30 53-71 (84)
103 PF00568 WH1: WH1 domain; Int 29.0 88 0.0019 16.3 2.6 16 13-28 16-32 (111)
104 KOG1898|consensus 28.6 2.8E+02 0.006 21.4 6.0 31 60-90 997-1027(1205)
105 PF04096 Nucleoporin2: Nucleop 28.0 86 0.0019 17.4 2.5 16 12-27 121-136 (141)
106 KOG0294|consensus 27.6 1.8E+02 0.0039 18.9 4.8 27 39-71 47-73 (362)
107 PF10946 DUF2625: Protein of u 27.3 1.3E+02 0.0028 18.0 3.2 30 47-77 111-140 (208)
108 PF10282 Lactonase: Lactonase, 27.3 1.7E+02 0.0036 18.5 6.6 76 2-80 4-83 (345)
109 PF09910 DUF2139: Uncharacteri 27.2 1.8E+02 0.0039 18.8 6.5 17 57-73 169-185 (339)
110 PF14741 GH114_assoc: N-termin 27.1 1E+02 0.0022 17.0 2.6 20 14-33 64-84 (129)
111 TIGR02377 MocE_fam_FeS Rieske 27.0 98 0.0021 15.7 3.4 13 15-27 26-38 (101)
112 PRK01029 tolB translocation pr 26.5 2E+02 0.0042 19.0 7.3 59 14-78 352-412 (428)
113 KOG2111|consensus 26.4 1.9E+02 0.0041 18.8 6.5 51 1-51 63-119 (346)
114 PF13360 PQQ_2: PQQ-like domai 26.1 1.4E+02 0.003 17.2 8.3 55 14-77 4-64 (238)
115 PF03022 MRJP: Major royal jel 25.4 1.8E+02 0.0038 18.1 9.3 62 12-79 33-107 (287)
116 TIGR02378 nirD_assim_sml nitri 24.5 1.1E+02 0.0024 15.5 2.7 7 17-23 73-79 (105)
117 PF07453 NUMOD1: NUMOD1 domain 24.1 68 0.0015 12.8 1.9 12 14-25 2-13 (37)
118 PF09816 EAF: RNA polymerase I 23.9 1.2E+02 0.0027 15.8 3.5 28 47-74 65-92 (109)
119 PRK09965 3-phenylpropionate di 23.2 1.2E+02 0.0026 15.5 3.7 10 40-49 90-99 (106)
120 TIGR02658 TTQ_MADH_Hv methylam 22.8 2.3E+02 0.0049 18.5 6.0 54 11-72 277-333 (352)
121 cd03583 NTR_complement_C3 NTR/ 22.8 1.5E+02 0.0034 16.5 3.1 36 44-79 86-121 (149)
122 PF12904 Collagen_bind_2: Puta 22.7 1.3E+02 0.0027 15.5 2.5 18 13-30 49-66 (93)
123 TIGR03352 VI_chp_3 type VI sec 22.6 1.4E+02 0.003 16.6 2.7 23 63-85 113-135 (146)
124 cd03584 NTR_complement_C4 NTR/ 22.5 1.6E+02 0.0034 16.5 3.2 36 44-79 90-125 (153)
125 PF15232 DUF4585: Domain of un 22.4 1.1E+02 0.0024 15.1 2.0 15 16-30 32-46 (75)
126 cd03548 Rieske_RO_Alpha_OMO_CA 22.3 1.5E+02 0.0032 16.1 4.5 37 16-52 81-124 (136)
127 KOG4499|consensus 21.9 2.2E+02 0.0047 17.9 4.1 39 11-51 231-273 (310)
128 PRK04043 tolB translocation pr 21.8 2.5E+02 0.0054 18.6 7.2 63 13-80 213-276 (419)
129 TIGR03032 conserved hypothetic 21.2 2.5E+02 0.0054 18.3 4.0 33 42-80 210-242 (335)
130 PRK04922 tolB translocation pr 21.1 2.5E+02 0.0055 18.4 7.2 58 14-76 229-287 (433)
131 KOG2445|consensus 21.0 2.5E+02 0.0054 18.3 4.7 22 10-31 198-219 (361)
132 PF15525 DUF4652: Domain of un 20.9 2E+02 0.0044 17.1 8.2 68 10-79 85-158 (200)
133 PF08309 LVIVD: LVIVD repeat; 20.4 95 0.0021 13.2 4.5 26 37-69 4-29 (42)
No 1
>KOG4441|consensus
Probab=99.88 E-value=1.5e-22 Score=129.56 Aligned_cols=90 Identities=33% Similarity=0.760 Sum_probs=81.2
Q ss_pred CccccCC-CCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980 1 MLRGWHA-STENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL 79 (92)
Q Consensus 1 v~GG~~~-~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~ 79 (92)
++||.++ ..+++ +++|||.+++|..+++|+.+|.++++++++++||++||.++....++++++|||.+++|+.+++|
T Consensus 385 avGG~dg~~~l~s--vE~YDp~~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M 462 (571)
T KOG4441|consen 385 AVGGFDGEKSLNS--VECYDPVTNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPM 462 (571)
T ss_pred EEecccccccccc--EEEecCCCCcccccCCCCcceeeeEEEEECCEEEEEcCcCCCccccceEEEEcCCCCceeecCCc
Confidence 4799883 34555 99999999999999999999999999999999999999987444899999999999999999999
Q ss_pred CCCccceeeEEcC
Q psy2980 80 PSARSWLGCVPLQ 92 (92)
Q Consensus 80 ~~~~~~~~~~~~~ 92 (92)
+.+|..+++++++
T Consensus 463 ~~~R~~~g~a~~~ 475 (571)
T KOG4441|consen 463 NTRRSGFGVAVLN 475 (571)
T ss_pred ccccccceEEEEC
Confidence 9999999998764
No 2
>KOG4441|consensus
Probab=99.88 E-value=2.6e-22 Score=128.54 Aligned_cols=91 Identities=33% Similarity=0.644 Sum_probs=82.4
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
++||.+......+.+++||+.+++|..+++|+.+|..++++++++.||++||.++ ...++++++|||.+++|..+++|+
T Consensus 337 v~GG~~~~~~~l~~ve~YD~~~~~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg-~~~l~svE~YDp~~~~W~~va~m~ 415 (571)
T KOG4441|consen 337 VVGGYDSGSDRLSSVERYDPRTNQWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDG-EKSLNSVECYDPVTNKWTPVAPML 415 (571)
T ss_pred EEccccCCCcccceEEEecCCCCceeccCCccCccccceeEEECCEEEEEecccc-ccccccEEEecCCCCcccccCCCC
Confidence 5799874344555699999999999999999999999999999999999999986 677999999999999999999999
Q ss_pred CCccceeeEEcC
Q psy2980 81 SARSWLGCVPLQ 92 (92)
Q Consensus 81 ~~~~~~~~~~~~ 92 (92)
.+|..+++++++
T Consensus 416 ~~r~~~gv~~~~ 427 (571)
T KOG4441|consen 416 TRRSGHGVAVLG 427 (571)
T ss_pred cceeeeEEEEEC
Confidence 999999998864
No 3
>PHA02713 hypothetical protein; Provisional
Probab=99.88 E-value=9.8e-22 Score=125.79 Aligned_cols=91 Identities=31% Similarity=0.586 Sum_probs=79.5
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
|+||.+......+++++||+.+++|..+++||.+|..+++++++++||++||.++ ....+++++|||.+++|..+++||
T Consensus 308 viGG~~~~~~~~~~v~~Yd~~~n~W~~~~~m~~~R~~~~~~~~~g~IYviGG~~~-~~~~~sve~Ydp~~~~W~~~~~mp 386 (557)
T PHA02713 308 IAGGYNFNNPSLNKVYKINIENKIHVELPPMIKNRCRFSLAVIDDTIYAIGGQNG-TNVERTIECYTMGDDKWKMLPDMP 386 (557)
T ss_pred EEcCCCCCCCccceEEEEECCCCeEeeCCCCcchhhceeEEEECCEEEEECCcCC-CCCCceEEEEECCCCeEEECCCCC
Confidence 5788753333456799999999999999999999999999999999999999865 445789999999999999999999
Q ss_pred CCccceeeEEcC
Q psy2980 81 SARSWLGCVPLQ 92 (92)
Q Consensus 81 ~~~~~~~~~~~~ 92 (92)
.+|..+++++++
T Consensus 387 ~~r~~~~~~~~~ 398 (557)
T PHA02713 387 IALSSYGMCVLD 398 (557)
T ss_pred cccccccEEEEC
Confidence 999999888764
No 4
>PHA02713 hypothetical protein; Provisional
Probab=99.85 E-value=8.4e-21 Score=121.55 Aligned_cols=81 Identities=23% Similarity=0.384 Sum_probs=72.1
Q ss_pred cceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCC-CceEEcCCCCCCccceeeEE
Q psy2980 12 TNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDN-DSWEIMSHLPSARSWLGCVP 90 (92)
Q Consensus 12 ~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~-~~W~~~~~~~~~~~~~~~~~ 90 (92)
.+++++|||.+++|..+++|+.+|..+++++++++||++||.++.....+.+++|||.+ ++|..+++||.+|..+++++
T Consensus 431 ~~~ve~YDP~td~W~~v~~m~~~r~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~~r~~~~~~~ 510 (557)
T PHA02713 431 SNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTIL 510 (557)
T ss_pred cceEEEECCCCCeEeecCCCCcccccCcEEEECCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCcccccceeEE
Confidence 45699999999999999999999999999999999999999864333346789999999 89999999999999999988
Q ss_pred cC
Q psy2980 91 LQ 92 (92)
Q Consensus 91 ~~ 92 (92)
++
T Consensus 511 ~~ 512 (557)
T PHA02713 511 HD 512 (557)
T ss_pred EC
Confidence 64
No 5
>PHA03098 kelch-like protein; Provisional
Probab=99.83 E-value=8e-20 Score=116.54 Aligned_cols=90 Identities=21% Similarity=0.385 Sum_probs=80.0
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
|+||.+......+++++||+.+++|..+++||.+|..+++++++++||++||... ....+++++||+.+++|+.+++||
T Consensus 299 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~R~~~~~~~~~~~lyv~GG~~~-~~~~~~v~~yd~~~~~W~~~~~lp 377 (534)
T PHA03098 299 FIGGMNKNNLSVNSVVSYDTKTKSWNKVPELIYPRKNPGVTVFNNRIYVIGGIYN-SISLNTVESWKPGESKWREEPPLI 377 (534)
T ss_pred EECCCcCCCCeeccEEEEeCCCCeeeECCCCCcccccceEEEECCEEEEEeCCCC-CEecceEEEEcCCCCceeeCCCcC
Confidence 5788875555556799999999999999999999999999999999999999874 556889999999999999999999
Q ss_pred CCccceeeEEc
Q psy2980 81 SARSWLGCVPL 91 (92)
Q Consensus 81 ~~~~~~~~~~~ 91 (92)
.+|..++++++
T Consensus 378 ~~r~~~~~~~~ 388 (534)
T PHA03098 378 FPRYNPCVVNV 388 (534)
T ss_pred cCCccceEEEE
Confidence 99999888765
No 6
>PHA03098 kelch-like protein; Provisional
Probab=99.82 E-value=1.5e-19 Score=115.27 Aligned_cols=90 Identities=24% Similarity=0.477 Sum_probs=77.9
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
++||.+. ....+++++||+.+++|..++++|.+|..+++++++++||++||.......++++++||+.+++|..++++|
T Consensus 347 v~GG~~~-~~~~~~v~~yd~~~~~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p 425 (534)
T PHA03098 347 VIGGIYN-SISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLP 425 (534)
T ss_pred EEeCCCC-CEecceEEEEcCCCCceeeCCCcCcCCccceEEEECCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCC
Confidence 4788763 234556999999999999999999999999999999999999997543556789999999999999999999
Q ss_pred CCccceeeEEc
Q psy2980 81 SARSWLGCVPL 91 (92)
Q Consensus 81 ~~~~~~~~~~~ 91 (92)
.+|..++++++
T Consensus 426 ~~r~~~~~~~~ 436 (534)
T PHA03098 426 ISHYGGCAIYH 436 (534)
T ss_pred ccccCceEEEE
Confidence 99998887765
No 7
>PHA02790 Kelch-like protein; Provisional
Probab=99.82 E-value=2.6e-19 Score=113.16 Aligned_cols=85 Identities=19% Similarity=0.341 Sum_probs=73.5
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
++||.+. . +.+++||+.+++|..+++||.+|..+++++++++||++||... ..+.+++|||.+++|+.+++|+
T Consensus 323 viGG~~~--~--~sve~ydp~~n~W~~~~~l~~~r~~~~~~~~~g~IYviGG~~~---~~~~ve~ydp~~~~W~~~~~m~ 395 (480)
T PHA02790 323 VVGGLPN--P--TSVERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGHSE---TDTTTEYLLPNHDQWQFGPSTY 395 (480)
T ss_pred EECCcCC--C--CceEEEECCCCeEEECCCCCCCCcccEEEEECCEEEEecCcCC---CCccEEEEeCCCCEEEeCCCCC
Confidence 4688642 2 3499999999999999999999999999999999999999753 2367899999999999999999
Q ss_pred CCccceeeEEcC
Q psy2980 81 SARSWLGCVPLQ 92 (92)
Q Consensus 81 ~~~~~~~~~~~~ 92 (92)
.+|..+++++++
T Consensus 396 ~~r~~~~~~~~~ 407 (480)
T PHA02790 396 YPHYKSCALVFG 407 (480)
T ss_pred CccccceEEEEC
Confidence 999998887653
No 8
>PHA02790 Kelch-like protein; Provisional
Probab=99.81 E-value=6.4e-19 Score=111.41 Aligned_cols=87 Identities=23% Similarity=0.397 Sum_probs=75.3
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
++||.+. ....+.+++||+.+++|..+++|+.+|..+++++++++||++||.+. .+++++|||.+++|..+++||
T Consensus 276 viGG~~~-~~~~~~v~~Ydp~~~~W~~~~~m~~~r~~~~~v~~~~~iYviGG~~~----~~sve~ydp~~n~W~~~~~l~ 350 (480)
T PHA02790 276 LIGGWMN-NEIHNNAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLYVVGGLPN----PTSVERWFHGDAAWVNMPSLL 350 (480)
T ss_pred EEcCCCC-CCcCCeEEEEECCCCEEEECCCCCchhhcceEEEECCEEEEECCcCC----CCceEEEECCCCeEEECCCCC
Confidence 4788753 23445699999999999999999999999999999999999999753 367899999999999999999
Q ss_pred CCccceeeEEcC
Q psy2980 81 SARSWLGCVPLQ 92 (92)
Q Consensus 81 ~~~~~~~~~~~~ 92 (92)
.+|..+++++++
T Consensus 351 ~~r~~~~~~~~~ 362 (480)
T PHA02790 351 KPRCNPAVASIN 362 (480)
T ss_pred CCCcccEEEEEC
Confidence 999998887753
No 9
>PLN02153 epithiospecifier protein
Probab=99.79 E-value=2.4e-18 Score=104.88 Aligned_cols=90 Identities=19% Similarity=0.389 Sum_probs=72.7
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCC-----CCCcccceEEEECCEEEEEcCcCCCC-----CccceEEEEeCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPM-----KEKRYRPGIAVIDGKIYVLGGEEGWD-----GYHDSIECYDVDN 70 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~-----~~~~~~~~~~~~~~~iy~~GG~~~~~-----~~~~~~~~y~~~~ 70 (92)
||||.+... ..+++++||+.+++|..++++ |.+|..+++++.+++||++||..... ..++++++||+.+
T Consensus 90 v~GG~~~~~-~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~~yd~~~ 168 (341)
T PLN02153 90 IFGGRDEKR-EFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAYNIAD 168 (341)
T ss_pred EECCCCCCC-ccCcEEEEECCCCEEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCcccceEEEEECCC
Confidence 589976432 345699999999999999876 78899999999999999999985311 2357899999999
Q ss_pred CceEEcCCCC---CCccceeeEEc
Q psy2980 71 DSWEIMSHLP---SARSWLGCVPL 91 (92)
Q Consensus 71 ~~W~~~~~~~---~~~~~~~~~~~ 91 (92)
++|..++++. .+|..++++++
T Consensus 169 ~~W~~l~~~~~~~~~r~~~~~~~~ 192 (341)
T PLN02153 169 GKWVQLPDPGENFEKRGGAGFAVV 192 (341)
T ss_pred CeEeeCCCCCCCCCCCCcceEEEE
Confidence 9999998764 67888877654
No 10
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.79 E-value=2.4e-18 Score=104.17 Aligned_cols=89 Identities=20% Similarity=0.285 Sum_probs=74.2
Q ss_pred CccccCCCCCCcceEEEeeCCCCeE----EeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEc
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTW----EYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM 76 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W----~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~ 76 (92)
++||.+.. ...+++++||+.+++| ..++++|.+|..+++++++++||++||... ....+++++||+.+++|+++
T Consensus 77 viGG~~~~-~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~-~~~~~~v~~yd~~~~~W~~~ 154 (323)
T TIGR03548 77 YIGGSNSS-ERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRN-GKPSNKSYLFNLETQEWFEL 154 (323)
T ss_pred EEcCCCCC-CCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCC-CccCceEEEEcCCCCCeeEC
Confidence 47887632 2456799999999987 778999999999999999999999999754 45579999999999999999
Q ss_pred CCCCC-CccceeeEEc
Q psy2980 77 SHLPS-ARSWLGCVPL 91 (92)
Q Consensus 77 ~~~~~-~~~~~~~~~~ 91 (92)
++||. +|..+.++++
T Consensus 155 ~~~p~~~r~~~~~~~~ 170 (323)
T TIGR03548 155 PDFPGEPRVQPVCVKL 170 (323)
T ss_pred CCCCCCCCCcceEEEE
Confidence 99884 7777776654
No 11
>PLN02153 epithiospecifier protein
Probab=99.78 E-value=4.4e-18 Score=103.70 Aligned_cols=90 Identities=17% Similarity=0.380 Sum_probs=72.0
Q ss_pred CccccCCC-CCCcceEEEeeCCCCeEEeeCCCC-CCc---ccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEE
Q psy2980 1 MLRGWHAS-TENTNRVECYHIAENTWEYKSPMK-EKR---YRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEI 75 (92)
Q Consensus 1 v~GG~~~~-~~~~~~v~~yd~~~~~W~~~~~~~-~~~---~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~ 75 (92)
|+||.... ....+++++||+.+++|..+++++ .++ ..+++++++++||++||... ...++++++||+.+++|+.
T Consensus 37 v~GG~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~-~~~~~~v~~yd~~t~~W~~ 115 (341)
T PLN02153 37 SFGGELKPNEHIDKDLYVFDFNTHTWSIAPANGDVPRISCLGVRMVAVGTKLYIFGGRDE-KREFSDFYSYDTVKNEWTF 115 (341)
T ss_pred EECCccCCCCceeCcEEEEECCCCEEEEcCccCCCCCCccCceEEEEECCEEEEECCCCC-CCccCcEEEEECCCCEEEE
Confidence 57887532 223457999999999999987653 344 36788899999999999865 4557899999999999999
Q ss_pred cCCC-----CCCccceeeEEc
Q psy2980 76 MSHL-----PSARSWLGCVPL 91 (92)
Q Consensus 76 ~~~~-----~~~~~~~~~~~~ 91 (92)
+++| |.+|..|+++++
T Consensus 116 ~~~~~~~~~p~~R~~~~~~~~ 136 (341)
T PLN02153 116 LTKLDEEGGPEARTFHSMASD 136 (341)
T ss_pred eccCCCCCCCCCceeeEEEEE
Confidence 9877 788998888765
No 12
>PLN02193 nitrile-specifier protein
Probab=99.77 E-value=1e-17 Score=105.77 Aligned_cols=89 Identities=21% Similarity=0.376 Sum_probs=74.2
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCC---CCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPM---KEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~---~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
|+||.+.. ...+++++||+.+++|.+++++ |.+|..+++++.+++||++||.+. ...+++++.||+.+++|+.++
T Consensus 233 vfGG~~~~-~~~ndv~~yD~~t~~W~~l~~~~~~P~~R~~h~~~~~~~~iYv~GG~~~-~~~~~~~~~yd~~t~~W~~~~ 310 (470)
T PLN02193 233 VFGGRDAS-RQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSA-TARLKTLDSYNIVDKKWFHCS 310 (470)
T ss_pred EECCCCCC-CCCccEEEEECCCCEEEEcCcCCCCCCCccceEEEEECCEEEEECCCCC-CCCcceEEEEECCCCEEEeCC
Confidence 57887632 2345699999999999999877 789999999999999999999865 445789999999999999986
Q ss_pred C---CCCCccceeeEEc
Q psy2980 78 H---LPSARSWLGCVPL 91 (92)
Q Consensus 78 ~---~~~~~~~~~~~~~ 91 (92)
+ ++.+|..++++++
T Consensus 311 ~~~~~~~~R~~~~~~~~ 327 (470)
T PLN02193 311 TPGDSFSIRGGAGLEVV 327 (470)
T ss_pred CCCCCCCCCCCcEEEEE
Confidence 4 5678888877664
No 13
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.77 E-value=6.2e-18 Score=103.14 Aligned_cols=91 Identities=15% Similarity=0.226 Sum_probs=69.3
Q ss_pred CccccCCCC-----CCcceEEEeeCCCCeEEeeC-CCCCCcccceEE-EECCEEEEEcCcCCCC----------------
Q psy2980 1 MLRGWHAST-----ENTNRVECYHIAENTWEYKS-PMKEKRYRPGIA-VIDGKIYVLGGEEGWD---------------- 57 (92)
Q Consensus 1 v~GG~~~~~-----~~~~~v~~yd~~~~~W~~~~-~~~~~~~~~~~~-~~~~~iy~~GG~~~~~---------------- 57 (92)
|+||.+... ...+++++||+.+++|..++ .+|.++..++++ +++++||++||.+...
T Consensus 68 v~GG~~~~~~~~~~~~~~~v~~Yd~~~~~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~ 147 (346)
T TIGR03547 68 VFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSE 147 (346)
T ss_pred EEeCCCCCCCCCcceecccEEEEECCCCEEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccch
Confidence 578875321 13456999999999999997 456666666665 6899999999975310
Q ss_pred -----------------CccceEEEEeCCCCceEEcCCCCC-CccceeeEEc
Q psy2980 58 -----------------GYHDSIECYDVDNDSWEIMSHLPS-ARSWLGCVPL 91 (92)
Q Consensus 58 -----------------~~~~~~~~y~~~~~~W~~~~~~~~-~~~~~~~~~~ 91 (92)
...+++++|||.+++|+.+++||. +|..++++++
T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~v~~YDp~t~~W~~~~~~p~~~r~~~~~~~~ 199 (346)
T TIGR03547 148 PKDKLIAAYFSQPPEDYFWNKNVLSYDPSTNQWRNLGENPFLGTAGSAIVHK 199 (346)
T ss_pred hhhhhHHHHhCCChhHcCccceEEEEECCCCceeECccCCCCcCCCceEEEE
Confidence 124789999999999999999996 6777777665
No 14
>PF13964 Kelch_6: Kelch motif
Probab=99.74 E-value=1.9e-17 Score=75.49 Aligned_cols=50 Identities=36% Similarity=0.798 Sum_probs=46.0
Q ss_pred CcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCCCc
Q psy2980 34 KRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPSAR 83 (92)
Q Consensus 34 ~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~~~ 83 (92)
+|..+++++++++||++||........+++++||+.+++|+++++||.+|
T Consensus 1 pR~~~s~v~~~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR 50 (50)
T PF13964_consen 1 PRYGHSAVVVGGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR 50 (50)
T ss_pred CCccCEEEEECCEEEEECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence 58889999999999999999765678999999999999999999999887
No 15
>PLN02193 nitrile-specifier protein
Probab=99.74 E-value=6.5e-17 Score=102.18 Aligned_cols=90 Identities=14% Similarity=0.397 Sum_probs=71.5
Q ss_pred CccccCCCC-CCcceEEEeeCCCCeEEeeCC---CCC-CcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEE
Q psy2980 1 MLRGWHAST-ENTNRVECYHIAENTWEYKSP---MKE-KRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEI 75 (92)
Q Consensus 1 v~GG~~~~~-~~~~~v~~yd~~~~~W~~~~~---~~~-~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~ 75 (92)
|+||..... ...+++++||+.+++|..++. +|. .|..+++++++++||++||... ...++++++||+.+++|++
T Consensus 180 v~GG~~~~~~~~~~~v~~yD~~~~~W~~~~~~g~~P~~~~~~~~~v~~~~~lYvfGG~~~-~~~~ndv~~yD~~t~~W~~ 258 (470)
T PLN02193 180 SFGGEFTPNQPIDKHLYVFDLETRTWSISPATGDVPHLSCLGVRMVSIGSTLYVFGGRDA-SRQYNGFYSFDTTTNEWKL 258 (470)
T ss_pred EECCcCCCCCCeeCcEEEEECCCCEEEeCCCCCCCCCCcccceEEEEECCEEEEECCCCC-CCCCccEEEEECCCCEEEE
Confidence 578875322 233569999999999998864 333 2457788899999999999865 4568999999999999999
Q ss_pred cCCC---CCCccceeeEEc
Q psy2980 76 MSHL---PSARSWLGCVPL 91 (92)
Q Consensus 76 ~~~~---~~~~~~~~~~~~ 91 (92)
+.++ |.+|..|+++++
T Consensus 259 l~~~~~~P~~R~~h~~~~~ 277 (470)
T PLN02193 259 LTPVEEGPTPRSFHSMAAD 277 (470)
T ss_pred cCcCCCCCCCccceEEEEE
Confidence 9888 889999988765
No 16
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.73 E-value=5.7e-17 Score=100.04 Aligned_cols=91 Identities=16% Similarity=0.219 Sum_probs=69.5
Q ss_pred CccccCC-C----CCCcceEEEeeCCCCeEEeeCC-CCCCcccceEEE-ECCEEEEEcCcCCC-----------------
Q psy2980 1 MLRGWHA-S----TENTNRVECYHIAENTWEYKSP-MKEKRYRPGIAV-IDGKIYVLGGEEGW----------------- 56 (92)
Q Consensus 1 v~GG~~~-~----~~~~~~v~~yd~~~~~W~~~~~-~~~~~~~~~~~~-~~~~iy~~GG~~~~----------------- 56 (92)
|+||... . ....+++++||+.+++|..++. +|.++..+++++ .+++||++||....
T Consensus 89 V~GG~~~~~~~~~~~~~~~v~~YD~~~n~W~~~~~~~p~~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~~d~~~~~~~~~ 168 (376)
T PRK14131 89 VFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGLAGHVAVSLHNGKAYITGGVNKNIFDGYFEDLAAAGKDKT 168 (376)
T ss_pred EEcCCCCCCCCCceeEcccEEEEeCCCCEEEeCCCCCCCcccceEEEEeeCCEEEEECCCCHHHHHHHHhhhhhcccchh
Confidence 5788753 1 1234679999999999999985 466666777666 89999999997521
Q ss_pred ----------------CCccceEEEEeCCCCceEEcCCCCC-CccceeeEEc
Q psy2980 57 ----------------DGYHDSIECYDVDNDSWEIMSHLPS-ARSWLGCVPL 91 (92)
Q Consensus 57 ----------------~~~~~~~~~y~~~~~~W~~~~~~~~-~~~~~~~~~~ 91 (92)
....+++++||+.+++|..++++|. +|..++++++
T Consensus 169 ~~~~i~~~~~~~~~~~~~~~~~v~~YD~~t~~W~~~~~~p~~~~~~~a~v~~ 220 (376)
T PRK14131 169 PKDKINDAYFDKKPEDYFFNKEVLSYDPSTNQWKNAGESPFLGTAGSAVVIK 220 (376)
T ss_pred hhhhhHHHHhcCChhhcCcCceEEEEECCCCeeeECCcCCCCCCCcceEEEE
Confidence 0124789999999999999999996 6777777654
No 17
>KOG0379|consensus
Probab=99.73 E-value=5.3e-17 Score=102.80 Aligned_cols=92 Identities=22% Similarity=0.379 Sum_probs=79.7
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEc-
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM- 76 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~- 76 (92)
+|||.+....+.++++.||+.|++|..+. .+|.+|.+|++++.++++|++||........+++++||+.+.+|+++
T Consensus 127 lfGG~~~~~~~~~~l~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~~~~ 206 (482)
T KOG0379|consen 127 LFGGTDKKYRNLNELHSLDLSTRTWSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETSTWSELD 206 (482)
T ss_pred EEccccCCCCChhheEeccCCCCcEEEecCcCCCCCCcccceEEEECCEEEEECCccCcccceeeeeeeccccccceecc
Confidence 58998754556678999999999998885 46889999999999999999999977555899999999999999987
Q ss_pred --CCCCCCccceeeEEcC
Q psy2980 77 --SHLPSARSWLGCVPLQ 92 (92)
Q Consensus 77 --~~~~~~~~~~~~~~~~ 92 (92)
.+.|.||..|++++++
T Consensus 207 ~~g~~P~pR~gH~~~~~~ 224 (482)
T KOG0379|consen 207 TQGEAPSPRYGHAMVVVG 224 (482)
T ss_pred cCCCCCCCCCCceEEEEC
Confidence 4668899999998763
No 18
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.72 E-value=8.4e-17 Score=97.51 Aligned_cols=77 Identities=16% Similarity=0.295 Sum_probs=62.5
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCC-CCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMK-EKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL 79 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~-~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~ 79 (92)
|+||... ....+++++||+.+++|..++++| .+|..+.+++++++||++||.+. ....++++||+.+++|+.+++|
T Consensus 128 v~GG~~~-~~~~~~v~~yd~~~~~W~~~~~~p~~~r~~~~~~~~~~~iYv~GG~~~--~~~~~~~~yd~~~~~W~~~~~~ 204 (323)
T TIGR03548 128 VGGGNRN-GKPSNKSYLFNLETQEWFELPDFPGEPRVQPVCVKLQNELYVFGGGSN--IAYTDGYKYSPKKNQWQKVADP 204 (323)
T ss_pred EEeCcCC-CccCceEEEEcCCCCCeeECCCCCCCCCCcceEEEECCEEEEEcCCCC--ccccceEEEecCCCeeEECCCC
Confidence 4788642 223566999999999999999887 47888888899999999999853 2345789999999999999876
Q ss_pred C
Q psy2980 80 P 80 (92)
Q Consensus 80 ~ 80 (92)
+
T Consensus 205 ~ 205 (323)
T TIGR03548 205 T 205 (323)
T ss_pred C
Confidence 3
No 19
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.69 E-value=5.6e-16 Score=94.63 Aligned_cols=84 Identities=21% Similarity=0.312 Sum_probs=66.8
Q ss_pred CccccCCCCCCcceEEEeeC--CCCeEEeeCCCC-CCcccceEEEECCEEEEEcCcCCCC-----CccceEEEEeCCCCc
Q psy2980 1 MLRGWHASTENTNRVECYHI--AENTWEYKSPMK-EKRYRPGIAVIDGKIYVLGGEEGWD-----GYHDSIECYDVDNDS 72 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~--~~~~W~~~~~~~-~~~~~~~~~~~~~~iy~~GG~~~~~-----~~~~~~~~y~~~~~~ 72 (92)
|+||.. .+.+++||+ .+++|..+++|| .+|..+++++++++||++||..... ..++++++|||.+++
T Consensus 22 v~GG~~-----~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~~ 96 (346)
T TIGR03547 22 VGLGSA-----GTSWYKLDLKKPSKGWQKIADFPGGPRNQAVAAAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNS 96 (346)
T ss_pred EEcccc-----CCeeEEEECCCCCCCceECCCCCCCCcccceEEEECCEEEEEeCCCCCCCCCcceecccEEEEECCCCE
Confidence 356753 245899996 578899999998 5899999999999999999985311 257899999999999
Q ss_pred eEEcC-CCCCCccceeeE
Q psy2980 73 WEIMS-HLPSARSWLGCV 89 (92)
Q Consensus 73 W~~~~-~~~~~~~~~~~~ 89 (92)
|+.++ ++|.+|..++++
T Consensus 97 W~~~~~~~p~~~~~~~~~ 114 (346)
T TIGR03547 97 WQKLDTRSPVGLLGASGF 114 (346)
T ss_pred EecCCCCCCCcccceeEE
Confidence 99997 566777666555
No 20
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=99.63 E-value=7.8e-16 Score=69.20 Aligned_cols=47 Identities=34% Similarity=0.698 Sum_probs=43.1
Q ss_pred CcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 34 KRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 34 ~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
+|..+++++++++||++||.......++++++||+.+++|+.+++||
T Consensus 1 pR~~~~~~~~~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp 47 (47)
T PF01344_consen 1 PRSGHAAVVVGNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPPMP 47 (47)
T ss_dssp -BBSEEEEEETTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred CCccCEEEEECCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence 57889999999999999999876788999999999999999999886
No 21
>KOG0379|consensus
Probab=99.60 E-value=1.1e-14 Score=92.35 Aligned_cols=91 Identities=23% Similarity=0.462 Sum_probs=78.0
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
||||.+......+++++||+++.+|.++. ..|.||.+|++++.+++++++||.+.....+++++.+|..+.+|+.+.
T Consensus 178 vfGG~~~~~~~~ndl~i~d~~~~~W~~~~~~g~~P~pR~gH~~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~~~W~~~~ 257 (482)
T KOG0379|consen 178 VFGGIGGTGDSLNDLHIYDLETSTWSELDTQGEAPSPRYGHAMVVVGNKLLVFGGGDDGDVYLNDVHILDLSTWEWKLLP 257 (482)
T ss_pred EECCccCcccceeeeeeeccccccceecccCCCCCCCCCCceEEEECCeEEEEeccccCCceecceEeeecccceeeecc
Confidence 68999865556777999999999999985 457899999999999999999998744778999999999999999764
Q ss_pred ---CCCCCccceeeEEc
Q psy2980 78 ---HLPSARSWLGCVPL 91 (92)
Q Consensus 78 ---~~~~~~~~~~~~~~ 91 (92)
..|.+|+.|..+..
T Consensus 258 ~~g~~p~~R~~h~~~~~ 274 (482)
T KOG0379|consen 258 TGGDLPSPRSGHSLTVS 274 (482)
T ss_pred ccCCCCCCcceeeeEEE
Confidence 56889999887753
No 22
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.59 E-value=5.9e-15 Score=91.18 Aligned_cols=92 Identities=17% Similarity=0.189 Sum_probs=67.2
Q ss_pred CccccCCCCCCcceEE--EeeCCCCeEEeeCCCCCCccc--------ceEEEECCEEEEEcCcCCCC-------------
Q psy2980 1 MLRGWHASTENTNRVE--CYHIAENTWEYKSPMKEKRYR--------PGIAVIDGKIYVLGGEEGWD------------- 57 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~--~yd~~~~~W~~~~~~~~~~~~--------~~~~~~~~~iy~~GG~~~~~------------- 57 (92)
|+||.......+.+++ .||+++++|..+++||.+|.. +.+++++++||++||.....
T Consensus 226 v~GG~~~~~~~~~~~~~~~~~~~~~~W~~~~~~p~~~~~~~~~~~~~~~a~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~ 305 (376)
T PRK14131 226 LINGEIKPGLRTDAVKQGKFTGNNLKWQKLPDLPPAPGGSSQEGVAGAFAGYSNGVLLVAGGANFPGARENYQNGKLYAH 305 (376)
T ss_pred EEeeeECCCcCChhheEEEecCCCcceeecCCCCCCCcCCcCCccceEeceeECCEEEEeeccCCCCChhhhhcCCcccc
Confidence 4788654334444454 557789999999999877632 22567899999999975311
Q ss_pred ---CccceEEEEeCCCCceEEcCCCCCCccceeeEEcC
Q psy2980 58 ---GYHDSIECYDVDNDSWEIMSHLPSARSWLGCVPLQ 92 (92)
Q Consensus 58 ---~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~~~ 92 (92)
.....+++||+.+++|+.+++||.+|..+++++++
T Consensus 306 ~~~~~~~~~e~yd~~~~~W~~~~~lp~~r~~~~av~~~ 343 (376)
T PRK14131 306 EGLKKSWSDEIYALVNGKWQKVGELPQGLAYGVSVSWN 343 (376)
T ss_pred cCCcceeehheEEecCCcccccCcCCCCccceEEEEeC
Confidence 01235789999999999999999999988877653
No 23
>KOG4693|consensus
Probab=99.57 E-value=4.8e-15 Score=87.09 Aligned_cols=91 Identities=16% Similarity=0.293 Sum_probs=74.2
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEECCEEEEEcCcCC-CCCccceEEEEeCCCCceEEc
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVIDGKIYVLGGEEG-WDGYHDSIECYDVDNDSWEIM 76 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~iy~~GG~~~-~~~~~~~~~~y~~~~~~W~~~ 76 (92)
|+||.+.....++.+++||++++.|.+.. ..|..|.+|++|++++..|+|||+.. ....+++++.+|..+-+|+.+
T Consensus 93 vWGGRND~egaCN~Ly~fDp~t~~W~~p~v~G~vPgaRDGHsAcV~gn~MyiFGGye~~a~~FS~d~h~ld~~TmtWr~~ 172 (392)
T KOG4693|consen 93 VWGGRNDDEGACNLLYEFDPETNVWKKPEVEGFVPGARDGHSACVWGNQMYIFGGYEEDAQRFSQDTHVLDFATMTWREM 172 (392)
T ss_pred EEcCccCcccccceeeeeccccccccccceeeecCCccCCceeeEECcEEEEecChHHHHHhhhccceeEeccceeeeeh
Confidence 57998865666777999999999997764 47889999999999999999999943 467889999999999999998
Q ss_pred CCC---CCCccceeeEEc
Q psy2980 77 SHL---PSARSWLGCVPL 91 (92)
Q Consensus 77 ~~~---~~~~~~~~~~~~ 91 (92)
... |.=|..|+++++
T Consensus 173 ~Tkg~PprwRDFH~a~~~ 190 (392)
T KOG4693|consen 173 HTKGDPPRWRDFHTASVI 190 (392)
T ss_pred hccCCCchhhhhhhhhhc
Confidence 654 333556666554
No 24
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=99.57 E-value=2.4e-14 Score=64.87 Aligned_cols=48 Identities=25% Similarity=0.522 Sum_probs=43.5
Q ss_pred CCEEEEEcCcC-CCCCccceEEEEeCCCCceEEcCCCCCCccceeeEEc
Q psy2980 44 DGKIYVLGGEE-GWDGYHDSIECYDVDNDSWEIMSHLPSARSWLGCVPL 91 (92)
Q Consensus 44 ~~~iy~~GG~~-~~~~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~~ 91 (92)
++++|++||.. .....+++++.||+.+++|++++++|.+|..|+++++
T Consensus 1 g~~~~vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~i 49 (49)
T PF13415_consen 1 GNKLYVFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATVI 49 (49)
T ss_pred CCEEEEECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEEC
Confidence 47899999997 3477899999999999999999999999999999875
No 25
>KOG4693|consensus
Probab=99.56 E-value=2.2e-14 Score=84.40 Aligned_cols=91 Identities=22% Similarity=0.456 Sum_probs=75.0
Q ss_pred CccccCCC--------CCCcceEEEeeCCCCeEEeeCC---CCCCcccceEEEECCEEEEEcCcCC-CCCccceEEEEeC
Q psy2980 1 MLRGWHAS--------TENTNRVECYHIAENTWEYKSP---MKEKRYRPGIAVIDGKIYVLGGEEG-WDGYHDSIECYDV 68 (92)
Q Consensus 1 v~GG~~~~--------~~~~~~v~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~~iy~~GG~~~-~~~~~~~~~~y~~ 68 (92)
||||...+ ...++++..+|..|..|...+. .|.+|..|+..++++++|+|||+++ .+.-.+++++|||
T Consensus 196 iFGGR~D~~gpfHs~~e~Yc~~i~~ld~~T~aW~r~p~~~~~P~GRRSHS~fvYng~~Y~FGGYng~ln~HfndLy~FdP 275 (392)
T KOG4693|consen 196 IFGGRSDESGPFHSIHEQYCDTIMALDLATGAWTRTPENTMKPGGRRSHSTFVYNGKMYMFGGYNGTLNVHFNDLYCFDP 275 (392)
T ss_pred EeccccccCCCccchhhhhcceeEEEeccccccccCCCCCcCCCcccccceEEEcceEEEecccchhhhhhhcceeeccc
Confidence 57886532 2456789999999999988864 4788999999999999999999976 3567899999999
Q ss_pred CCCceEEcC---CCCCCccceeeEEc
Q psy2980 69 DNDSWEIMS---HLPSARSWLGCVPL 91 (92)
Q Consensus 69 ~~~~W~~~~---~~~~~~~~~~~~~~ 91 (92)
++..|+.+. .-|.+|+.+.+++.
T Consensus 276 ~t~~W~~I~~~Gk~P~aRRRqC~~v~ 301 (392)
T KOG4693|consen 276 KTSMWSVISVRGKYPSARRRQCSVVS 301 (392)
T ss_pred ccchheeeeccCCCCCcccceeEEEE
Confidence 999999885 45788888877765
No 26
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=99.51 E-value=1.3e-13 Score=62.40 Aligned_cols=47 Identities=30% Similarity=0.630 Sum_probs=42.0
Q ss_pred CcccceEEEECCEEEEEcCc--CCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 34 KRYRPGIAVIDGKIYVLGGE--EGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 34 ~~~~~~~~~~~~~iy~~GG~--~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
+|..|++++++++||++||+ .......++++.||+.+++|+.+++|+
T Consensus 1 ~r~~hs~~~~~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g 49 (49)
T PF07646_consen 1 PRYGHSAVVLDGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG 49 (49)
T ss_pred CccceEEEEECCEEEEECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence 57889999999999999999 344678999999999999999998875
No 27
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=99.50 E-value=5.6e-14 Score=63.63 Aligned_cols=47 Identities=26% Similarity=0.630 Sum_probs=31.8
Q ss_pred CcccceEEEE-CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 34 KRYRPGIAVI-DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 34 ~~~~~~~~~~-~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
||..|+++.. ++.||++||.+.....+++++.||+.+++|++++++|
T Consensus 1 pR~~h~~~~~~~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P 48 (49)
T PF13418_consen 1 PRYGHSAVSIGDNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP 48 (49)
T ss_dssp --BS-EEEEE-TTEEEEE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred CcceEEEEEEeCCeEEEECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence 5888999988 5899999999875678999999999999999998887
No 28
>smart00612 Kelch Kelch domain.
Probab=99.48 E-value=1.2e-13 Score=61.69 Aligned_cols=46 Identities=35% Similarity=0.695 Sum_probs=41.0
Q ss_pred EEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCCCccceeeEEcC
Q psy2980 46 KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPSARSWLGCVPLQ 92 (92)
Q Consensus 46 ~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~~~ 92 (92)
+||++||... ....+++++||+.+++|+.+++|+.+|..+++++++
T Consensus 1 ~iyv~GG~~~-~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~ 46 (47)
T smart00612 1 KIYVVGGFDG-GQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVIN 46 (47)
T ss_pred CEEEEeCCCC-CceeeeEEEECCCCCeEccCCCCCCccccceEEEeC
Confidence 4899999865 567899999999999999999999999999988864
No 29
>KOG1230|consensus
Probab=99.43 E-value=7.5e-13 Score=81.56 Aligned_cols=78 Identities=15% Similarity=0.245 Sum_probs=66.3
Q ss_pred ceEEEeeCCCCeEEeeC--CCCCCcccceEEEECCEEEEEcCcCC---CCCccceEEEEeCCCCceEEcCC-C--CCCcc
Q psy2980 13 NRVECYHIAENTWEYKS--PMKEKRYRPGIAVIDGKIYVLGGEEG---WDGYHDSIECYDVDNDSWEIMSH-L--PSARS 84 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~--~~~~~~~~~~~~~~~~~iy~~GG~~~---~~~~~~~~~~y~~~~~~W~~~~~-~--~~~~~ 84 (92)
+++|.||.++++|.++. .-|.+|.+|.+++...+|++|||..+ ...+++++++||..+-+|.++.+ - |.||+
T Consensus 154 kD~W~fd~~trkweql~~~g~PS~RSGHRMvawK~~lilFGGFhd~nr~y~YyNDvy~FdLdtykW~Klepsga~PtpRS 233 (521)
T KOG1230|consen 154 KDLWLFDLKTRKWEQLEFGGGPSPRSGHRMVAWKRQLILFGGFHDSNRDYIYYNDVYAFDLDTYKWSKLEPSGAGPTPRS 233 (521)
T ss_pred hheeeeeeccchheeeccCCCCCCCccceeEEeeeeEEEEcceecCCCceEEeeeeEEEeccceeeeeccCCCCCCCCCC
Confidence 56999999999999995 35889999999999999999999843 34589999999999999999854 2 77888
Q ss_pred ceeeEE
Q psy2980 85 WLGCVP 90 (92)
Q Consensus 85 ~~~~~~ 90 (92)
++.+.+
T Consensus 234 Gcq~~v 239 (521)
T KOG1230|consen 234 GCQFSV 239 (521)
T ss_pred cceEEe
Confidence 776544
No 30
>KOG1230|consensus
Probab=99.42 E-value=1e-12 Score=80.98 Aligned_cols=79 Identities=20% Similarity=0.325 Sum_probs=64.9
Q ss_pred ceEEEeeCCCCeEEee--CCCCCCcccceEEEEC-CEEEEEcCcCC-CCC----ccceEEEEeCCCCceEEc--CCCCCC
Q psy2980 13 NRVECYHIAENTWEYK--SPMKEKRYRPGIAVID-GKIYVLGGEEG-WDG----YHDSIECYDVDNDSWEIM--SHLPSA 82 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~--~~~~~~~~~~~~~~~~-~~iy~~GG~~~-~~~----~~~~~~~y~~~~~~W~~~--~~~~~~ 82 (92)
++++.||+++++|..+ |+.|.||..|.++++. +.+|++||.-. .++ -.++++.||..+++|+++ +.-|.+
T Consensus 98 ndLy~Yn~k~~eWkk~~spn~P~pRsshq~va~~s~~l~~fGGEfaSPnq~qF~HYkD~W~fd~~trkweql~~~g~PS~ 177 (521)
T KOG1230|consen 98 NDLYSYNTKKNEWKKVVSPNAPPPRSSHQAVAVPSNILWLFGGEFASPNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSP 177 (521)
T ss_pred eeeeEEeccccceeEeccCCCcCCCccceeEEeccCeEEEeccccCCcchhhhhhhhheeeeeeccchheeeccCCCCCC
Confidence 5699999999999888 4568899998888875 88999999732 122 368999999999999998 466899
Q ss_pred ccceeeEEc
Q psy2980 83 RSWLGCVPL 91 (92)
Q Consensus 83 ~~~~~~~~~ 91 (92)
|++|-+++-
T Consensus 178 RSGHRMvaw 186 (521)
T KOG1230|consen 178 RSGHRMVAW 186 (521)
T ss_pred CccceeEEe
Confidence 999988763
No 31
>smart00612 Kelch Kelch domain.
Probab=99.23 E-value=2.2e-11 Score=54.25 Aligned_cols=44 Identities=30% Similarity=0.545 Sum_probs=36.7
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDG 45 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~ 45 (92)
|+||.+. ....+++++||+.+++|..+++|+.+|..++++++++
T Consensus 4 v~GG~~~-~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~g 47 (47)
T smart00612 4 VVGGFDG-GQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47 (47)
T ss_pred EEeCCCC-CceeeeEEEECCCCCeEccCCCCCCccccceEEEeCC
Confidence 5799864 3345679999999999999999999999998887764
No 32
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=99.23 E-value=2.5e-11 Score=54.89 Aligned_cols=43 Identities=21% Similarity=0.360 Sum_probs=37.0
Q ss_pred CccccC-CCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEE
Q psy2980 1 MLRGWH-ASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVI 43 (92)
Q Consensus 1 v~GG~~-~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~ 43 (92)
|+||.+ ......+++++||+.+++|++++++|.+|..|+++++
T Consensus 6 vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~i 49 (49)
T PF13415_consen 6 VFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATVI 49 (49)
T ss_pred EECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEEC
Confidence 689988 3455667899999999999999999999999988763
No 33
>KOG4152|consensus
Probab=99.18 E-value=1.2e-10 Score=74.03 Aligned_cols=90 Identities=22% Similarity=0.324 Sum_probs=72.0
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEECCEEEEEcCcC-------------CCCCccceEE
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVIDGKIYVLGGEE-------------GWDGYHDSIE 64 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~iy~~GG~~-------------~~~~~~~~~~ 64 (92)
|+||..+-.+ .++|.+|+++-+|.+.. -.|.||..|++.++++++|+|||.- .+....+++-
T Consensus 220 vyGGM~G~RL--gDLW~Ldl~Tl~W~kp~~~G~~PlPRSLHsa~~IGnKMyvfGGWVPl~~~~~~~~~hekEWkCTssl~ 297 (830)
T KOG4152|consen 220 VYGGMSGCRL--GDLWTLDLDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTSSLA 297 (830)
T ss_pred EEcccccccc--cceeEEecceeecccccccCCCCCCcccccceeecceeEEecceeeeeccccccccccceeeecccee
Confidence 4788765444 44999999999998874 3578999999999999999999961 1234678899
Q ss_pred EEeCCCCceEEcC-------CCCCCccceeeEEcC
Q psy2980 65 CYDVDNDSWEIMS-------HLPSARSWLGCVPLQ 92 (92)
Q Consensus 65 ~y~~~~~~W~~~~-------~~~~~~~~~~~~~~~ 92 (92)
++|..+..|..+. ..|.+|++|.+++++
T Consensus 298 clNldt~~W~tl~~d~~ed~tiPR~RAGHCAvAig 332 (830)
T KOG4152|consen 298 CLNLDTMAWETLLMDTLEDNTIPRARAGHCAVAIG 332 (830)
T ss_pred eeeecchheeeeeeccccccccccccccceeEEec
Confidence 9999999999762 268899999888764
No 34
>PF13964 Kelch_6: Kelch motif
Probab=99.16 E-value=4.5e-11 Score=54.19 Aligned_cols=35 Identities=37% Similarity=0.625 Sum_probs=30.1
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCc
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKR 35 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~ 35 (92)
|+||.......++++++||+.+++|+++++||.+|
T Consensus 16 v~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR 50 (50)
T PF13964_consen 16 VFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR 50 (50)
T ss_pred EECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence 58998754556777999999999999999999887
No 35
>PF13854 Kelch_5: Kelch motif
Probab=99.13 E-value=3.8e-10 Score=49.39 Aligned_cols=40 Identities=25% Similarity=0.581 Sum_probs=35.5
Q ss_pred CCCCcccceEEEECCEEEEEcCcCC-CCCccceEEEEeCCC
Q psy2980 31 MKEKRYRPGIAVIDGKIYVLGGEEG-WDGYHDSIECYDVDN 70 (92)
Q Consensus 31 ~~~~~~~~~~~~~~~~iy~~GG~~~-~~~~~~~~~~y~~~~ 70 (92)
+|.+|..|++++++++||++||... ....+++++++|..+
T Consensus 1 ~P~~R~~hs~~~~~~~iyi~GG~~~~~~~~~~d~~~l~l~s 41 (42)
T PF13854_consen 1 IPSPRYGHSAVVVGNNIYIFGGYSGNNNSYSNDLYVLDLPS 41 (42)
T ss_pred CCCCccceEEEEECCEEEEEcCccCCCCCEECcEEEEECCC
Confidence 4788999999999999999999983 577899999999865
No 36
>PLN02772 guanylate kinase
Probab=98.96 E-value=6.4e-09 Score=64.68 Aligned_cols=60 Identities=20% Similarity=0.283 Sum_probs=52.3
Q ss_pred CCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEc---CCCCCCccceeeEEcC
Q psy2980 33 EKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM---SHLPSARSWLGCVPLQ 92 (92)
Q Consensus 33 ~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~---~~~~~~~~~~~~~~~~ 92 (92)
.++..++++++++++|++||.++.....+.+++||+.+.+|..- .+.|.+|.+|++|+++
T Consensus 23 ~~~~~~tav~igdk~yv~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~ 85 (398)
T PLN02772 23 KPKNRETSVTIGDKTYVIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLN 85 (398)
T ss_pred CCCCcceeEEECCEEEEEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEEC
Confidence 46788999999999999999887555789999999999999864 5779999999999873
No 37
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.92 E-value=3.4e-09 Score=64.57 Aligned_cols=90 Identities=19% Similarity=0.320 Sum_probs=65.5
Q ss_pred CccccCCCC----CCcceEEEeeCCCCeEEeeCC-CCCCcccceEEEECC-EEEEEcCcCCC--C---------------
Q psy2980 1 MLRGWHAST----ENTNRVECYHIAENTWEYKSP-MKEKRYRPGIAVIDG-KIYVLGGEEGW--D--------------- 57 (92)
Q Consensus 1 v~GG~~~~~----~~~~~v~~yd~~~~~W~~~~~-~~~~~~~~~~~~~~~-~iy~~GG~~~~--~--------------- 57 (92)
+|||..... ...+++++|||.+|+|+++.. .|.....++++.+++ +||++||++.. +
T Consensus 97 vFgG~Gk~~~~~~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~ 176 (381)
T COG3055 97 VFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEA 176 (381)
T ss_pred EeeccccCCCCCceEeeeeEEecCCCChhheeccccccccccceeEecCCceEEEEccccHHhhhhhHHhhhhhcccHHH
Confidence 467765222 233669999999999999974 577778888888887 89999998521 0
Q ss_pred ----------------CccceEEEEeCCCCceEEcCCCCCCccceeeEE
Q psy2980 58 ----------------GYHDSIECYDVDNDSWEIMSHLPSARSWLGCVP 90 (92)
Q Consensus 58 ----------------~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~ 90 (92)
.....+..|+|++++|+.+...|......++++
T Consensus 177 ~~~i~~~yf~~~~~dy~~n~ev~sy~p~~n~W~~~G~~pf~~~aGsa~~ 225 (381)
T COG3055 177 VDKIIAHYFDKKAEDYFFNKEVLSYDPSTNQWRNLGENPFYGNAGSAVV 225 (381)
T ss_pred HHHHHHHHhCCCHHHhcccccccccccccchhhhcCcCcccCccCccee
Confidence 134678889999999999998776544434433
No 38
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=98.85 E-value=1.5e-09 Score=48.42 Aligned_cols=32 Identities=38% Similarity=0.696 Sum_probs=28.1
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMK 32 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~ 32 (92)
|+||.+.....++++++||+.+++|..+++||
T Consensus 16 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp 47 (47)
T PF01344_consen 16 VIGGYDGNNQPTNSVEVYDPETNTWEELPPMP 47 (47)
T ss_dssp EEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred EEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence 58999876677888999999999999999876
No 39
>KOG4152|consensus
Probab=98.84 E-value=1.6e-09 Score=69.06 Aligned_cols=89 Identities=16% Similarity=0.253 Sum_probs=69.4
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEee---CCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYK---SPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~---~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
||||-+ ....+++..||..+++|..- .++|.+...++.+..+.+||+|||.-..+.+.+++|-..-..-.|+++.
T Consensus 47 iFGGGN--EGiiDELHvYNTatnqWf~PavrGDiPpgcAA~GfvcdGtrilvFGGMvEYGkYsNdLYELQasRWeWkrlk 124 (830)
T KOG4152|consen 47 IFGGGN--EGIIDELHVYNTATNQWFAPAVRGDIPPGCAAFGFVCDGTRILVFGGMVEYGKYSNDLYELQASRWEWKRLK 124 (830)
T ss_pred EecCCc--ccchhhhhhhccccceeecchhcCCCCCchhhcceEecCceEEEEccEeeeccccchHHHhhhhhhhHhhcC
Confidence 578854 33556799999999999654 4788899999999999999999998777888999866655555677763
Q ss_pred -------CCCCCccceeeEEc
Q psy2980 78 -------HLPSARSWLGCVPL 91 (92)
Q Consensus 78 -------~~~~~~~~~~~~~~ 91 (92)
+.|-+|.+|+...+
T Consensus 125 p~~p~nG~pPCPRlGHSFsl~ 145 (830)
T KOG4152|consen 125 PKTPKNGPPPCPRLGHSFSLV 145 (830)
T ss_pred CCCCCCCCCCCCccCceeEEe
Confidence 35778999987654
No 40
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.74 E-value=4.5e-08 Score=59.84 Aligned_cols=75 Identities=21% Similarity=0.348 Sum_probs=55.6
Q ss_pred EEEeeCCC--CeEEeeCCCCC-CcccceEEEECCEEEEEcCcCCC----CCccceEEEEeCCCCceEEcCCC-CCCccce
Q psy2980 15 VECYHIAE--NTWEYKSPMKE-KRYRPGIAVIDGKIYVLGGEEGW----DGYHDSIECYDVDNDSWEIMSHL-PSARSWL 86 (92)
Q Consensus 15 v~~yd~~~--~~W~~~~~~~~-~~~~~~~~~~~~~iy~~GG~~~~----~~~~~~~~~y~~~~~~W~~~~~~-~~~~~~~ 86 (92)
.+..|++. ..|.++...|- +|-....++++++||+|||.... .+..+++++|||++++|+++... |..-..+
T Consensus 60 fy~ldL~~~~k~W~~~a~FpG~~rnqa~~a~~~~kLyvFgG~Gk~~~~~~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~ 139 (381)
T COG3055 60 FYVLDLKKPGKGWTKIADFPGGARNQAVAAVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGA 139 (381)
T ss_pred ceehhhhcCCCCceEcccCCCcccccchheeeCCeEEEeeccccCCCCCceEeeeeEEecCCCChhheeccccccccccc
Confidence 56666653 47999998874 67778888899999999998432 34679999999999999998533 4443333
Q ss_pred eeE
Q psy2980 87 GCV 89 (92)
Q Consensus 87 ~~~ 89 (92)
.++
T Consensus 140 ~~~ 142 (381)
T COG3055 140 STF 142 (381)
T ss_pred eeE
Confidence 333
No 41
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=98.69 E-value=7.4e-09 Score=46.60 Aligned_cols=32 Identities=25% Similarity=0.402 Sum_probs=20.1
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeCCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKSPMK 32 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~ 32 (92)
|+||.+......++++.||+.+++|.+++++|
T Consensus 17 v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P 48 (49)
T PF13418_consen 17 VFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP 48 (49)
T ss_dssp EE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred EECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence 58998866567777999999999999998776
No 42
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=98.46 E-value=3.2e-07 Score=41.22 Aligned_cols=32 Identities=25% Similarity=0.509 Sum_probs=26.1
Q ss_pred Ccccc--CCCCCCcceEEEeeCCCCeEEeeCCCC
Q psy2980 1 MLRGW--HASTENTNRVECYHIAENTWEYKSPMK 32 (92)
Q Consensus 1 v~GG~--~~~~~~~~~v~~yd~~~~~W~~~~~~~ 32 (92)
|+||. +.....++++++||+++++|..++.++
T Consensus 16 v~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g 49 (49)
T PF07646_consen 16 VFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG 49 (49)
T ss_pred EECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence 58998 344566678999999999999998764
No 43
>PLN02772 guanylate kinase
Probab=98.43 E-value=1.8e-06 Score=54.07 Aligned_cols=54 Identities=15% Similarity=0.165 Sum_probs=43.6
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEE-CCEEEEEcCcC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVI-DGKIYVLGGEE 54 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~-~~~iy~~GG~~ 54 (92)
||||.+.....++.+++||..+++|..-. ..|.+|.+|+++++ +++|+++++..
T Consensus 39 v~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~~~~ 96 (398)
T PLN02772 39 VIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIKKGS 96 (398)
T ss_pred EEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEECCceEEEEeCCC
Confidence 58887654446778999999999997764 57899999999998 57888888653
No 44
>KOG2437|consensus
Probab=98.28 E-value=2.5e-07 Score=59.19 Aligned_cols=90 Identities=18% Similarity=0.316 Sum_probs=66.0
Q ss_pred CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEE--CCEEEEEcCcCC-----CCCccceEEEEeCCC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVI--DGKIYVLGGEEG-----WDGYHDSIECYDVDN 70 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~--~~~iy~~GG~~~-----~~~~~~~~~~y~~~~ 70 (92)
+.||+++- ....++|.|+...+.|..+- ..|-.|..|..+.. +.++|++|-+-+ .....++++.||..+
T Consensus 277 LYGGWdG~-~~l~DFW~Y~v~e~~W~~iN~~t~~PG~RsCHRMVid~S~~KLYLlG~Y~~sS~r~~~s~RsDfW~FDi~~ 355 (723)
T KOG2437|consen 277 LYGGWDGT-QDLADFWAYSVKENQWTCINRDTEGPGARSCHRMVIDISRRKLYLLGRYLDSSVRNSKSLRSDFWRFDIDT 355 (723)
T ss_pred EecCcccc-hhHHHHHhhcCCcceeEEeecCCCCCcchhhhhhhhhhhHhHHhhhhhccccccccccccccceEEEecCC
Confidence 35888742 23344999999999998883 46788888877775 458999997732 124578999999999
Q ss_pred CceEEcCC------CCCCccceeeEEc
Q psy2980 71 DSWEIMSH------LPSARSWLGCVPL 91 (92)
Q Consensus 71 ~~W~~~~~------~~~~~~~~~~~~~ 91 (92)
+.|.-+.- -|...+.|.+++.
T Consensus 356 ~~W~~ls~dt~~dGGP~~vfDHqM~Vd 382 (723)
T KOG2437|consen 356 NTWMLLSEDTAADGGPKLVFDHQMCVD 382 (723)
T ss_pred ceeEEecccccccCCcceeecceeeEe
Confidence 99998742 2556677777764
No 45
>KOG2437|consensus
Probab=97.74 E-value=0.00013 Score=47.27 Aligned_cols=65 Identities=20% Similarity=0.376 Sum_probs=51.1
Q ss_pred ceEEEeeCCCCeEEeeCC------CCCCcccceEEEECCE--EEEEcCcCC--CCCccceEEEEeCCCCceEEcC
Q psy2980 13 NRVECYHIAENTWEYKSP------MKEKRYRPGIAVIDGK--IYVLGGEEG--WDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~------~~~~~~~~~~~~~~~~--iy~~GG~~~--~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
+++|.||..++.|..+.- =|...+.|..++..++ ||++||+.- +......++.||.....|+.+.
T Consensus 346 sDfW~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd~~k~~iyVfGGr~~~~~e~~f~GLYaf~~~~~~w~~l~ 420 (723)
T KOG2437|consen 346 SDFWRFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVDSEKHMIYVFGGRILTCNEPQFSGLYAFNCQCQTWKLLR 420 (723)
T ss_pred cceEEEecCCceeEEecccccccCCcceeecceeeEecCcceEEEecCeeccCCCccccceEEEecCCccHHHHH
Confidence 479999999999988842 1444577888888776 999999842 2346788999999999998764
No 46
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=97.34 E-value=0.004 Score=36.41 Aligned_cols=68 Identities=18% Similarity=0.300 Sum_probs=44.7
Q ss_pred ceEEEeeCCCCeEEeeCCCCCC-cccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCC
Q psy2980 13 NRVECYHIAENTWEYKSPMKEK-RYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPS 81 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~~~~~-~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~ 81 (92)
..+++|+..+++|+.+...+.. ......+.+++.+|-+..... ......+..||..+++|+...++|.
T Consensus 70 ~~~~Vys~~~~~Wr~~~~~~~~~~~~~~~v~~~G~lyw~~~~~~-~~~~~~IvsFDl~~E~f~~~i~~P~ 138 (230)
T TIGR01640 70 SEHQVYTLGSNSWRTIECSPPHHPLKSRGVCINGVLYYLAYTLK-TNPDYFIVSFDVSSERFKEFIPLPC 138 (230)
T ss_pred ccEEEEEeCCCCccccccCCCCccccCCeEEECCEEEEEEEECC-CCCcEEEEEEEcccceEeeeeecCc
Confidence 4689999999999998743321 112226678888888764321 1112379999999999996224443
No 47
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=97.05 E-value=0.0067 Score=36.92 Aligned_cols=76 Identities=13% Similarity=0.209 Sum_probs=51.5
Q ss_pred ccccC--CCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEE-CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCC
Q psy2980 2 LRGWH--ASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVI-DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSH 78 (92)
Q Consensus 2 ~GG~~--~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~ 78 (92)
+||-- ...+++..+..||+.+.+|.....--.+.. ..+... ++++|+.|-..........+-.||..+.+|..+..
T Consensus 3 VGG~F~~aGsL~C~~lC~yd~~~~qW~~~g~~i~G~V-~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~~~~~ 81 (281)
T PF12768_consen 3 VGGSFTSAGSLPCPGLCLYDTDNSQWSSPGNGISGTV-TDLQWASNNQLLVGGNFTLNGTNSSNLATYDFKNQTWSSLGG 81 (281)
T ss_pred EeeecCCCCCcCCCEEEEEECCCCEeecCCCCceEEE-EEEEEecCCEEEEEEeeEECCCCceeEEEEecCCCeeeecCC
Confidence 66733 233467789999999999999875322221 222223 67888887554323346778999999999988865
No 48
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=96.15 E-value=0.044 Score=32.81 Aligned_cols=74 Identities=16% Similarity=0.243 Sum_probs=48.0
Q ss_pred eEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCC----CceEEcC-CCCCCccceee
Q psy2980 14 RVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDN----DSWEIMS-HLPSARSWLGC 88 (92)
Q Consensus 14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~----~~W~~~~-~~~~~~~~~~~ 88 (92)
.-..||+.++++..+.-..-.-....+..-++.+.+.||..+ -.+.+..|+|.. ..|.+.. .|..+|=..++
T Consensus 47 ~s~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~---G~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~ 123 (243)
T PF07250_consen 47 HSVEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDND---GNKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTA 123 (243)
T ss_pred EEEEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCc---cccceEEEecCCCCCCCCceECcccccCCCccccc
Confidence 357899999999888643222222223334778999999743 245677788865 5798875 57777766555
Q ss_pred EE
Q psy2980 89 VP 90 (92)
Q Consensus 89 ~~ 90 (92)
..
T Consensus 124 ~~ 125 (243)
T PF07250_consen 124 TT 125 (243)
T ss_pred eE
Confidence 44
No 49
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=95.85 E-value=0.11 Score=32.50 Aligned_cols=71 Identities=15% Similarity=0.222 Sum_probs=46.3
Q ss_pred eEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCc----cceEEE--EeC--------CCCceEEcCCC
Q psy2980 14 RVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGY----HDSIEC--YDV--------DNDSWEIMSHL 79 (92)
Q Consensus 14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~----~~~~~~--y~~--------~~~~W~~~~~~ 79 (92)
....||+.+..-...|.+..+.....++.+++.||++......... ...++. |++ ..-.|+.+++.
T Consensus 87 ~t~vyDt~t~av~~~P~l~~pk~~pisv~VG~~LY~m~~~~~~~~~~~~~~~~FE~l~~~~~~~~~~~~~~w~W~~LP~P 166 (342)
T PF07893_consen 87 RTLVYDTDTRAVATGPRLHSPKRCPISVSVGDKLYAMDRSPFPEPAGRPDFPCFEALVYRPPPDDPSPEESWSWRSLPPP 166 (342)
T ss_pred CeEEEECCCCeEeccCCCCCCCcceEEEEeCCeEEEeeccCccccccCccceeEEEeccccccccccCCCcceEEcCCCC
Confidence 4789999999988888887777666677789999999876431111 013333 442 22258888776
Q ss_pred CCCcc
Q psy2980 80 PSARS 84 (92)
Q Consensus 80 ~~~~~ 84 (92)
|..+.
T Consensus 167 Pf~~~ 171 (342)
T PF07893_consen 167 PFVRD 171 (342)
T ss_pred Ccccc
Confidence 65443
No 50
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=95.62 E-value=0.039 Score=32.99 Aligned_cols=50 Identities=18% Similarity=0.479 Sum_probs=37.4
Q ss_pred ccccCCCCCCcceEEEeeCCC----CeEEeeC-CCCCCcccceEEEE-CCEEEEEcCcC
Q psy2980 2 LRGWHASTENTNRVECYHIAE----NTWEYKS-PMKEKRYRPGIAVI-DGKIYVLGGEE 54 (92)
Q Consensus 2 ~GG~~~~~~~~~~v~~yd~~~----~~W~~~~-~~~~~~~~~~~~~~-~~~iy~~GG~~ 54 (92)
.||... ... .+..|++.+ ..|.+.+ .|..+|...++..+ +++++++||..
T Consensus 83 tGG~~~-G~~--~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~L~DG~vlIvGG~~ 138 (243)
T PF07250_consen 83 TGGDND-GNK--AIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATTLPDGRVLIVGGSN 138 (243)
T ss_pred eCCCCc-ccc--ceEEEecCCCCCCCCceECcccccCCCccccceECCCCCEEEEeCcC
Confidence 566642 333 378888876 5798876 58999999888886 67899999974
No 51
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=95.09 E-value=0.33 Score=28.40 Aligned_cols=65 Identities=14% Similarity=0.301 Sum_probs=39.0
Q ss_pred ceEEEeeCCCCeEEeeCCCCCCcccc----eEEEEC----C-EEEEEcCcCCCCCccceEEEEeCCCCceEEcCC
Q psy2980 13 NRVECYHIAENTWEYKSPMKEKRYRP----GIAVID----G-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSH 78 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~~~~~~~~~----~~~~~~----~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~ 78 (92)
..+..+||.|++|..+|.++.++... ....++ + ++..+..... ......++.|+..++.|+.+..
T Consensus 14 ~~~~V~NP~T~~~~~LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~~~-~~~~~~~~Vys~~~~~Wr~~~~ 87 (230)
T TIGR01640 14 KRLVVWNPSTGQSRWLPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDRSG-NRNQSEHQVYTLGSNSWRTIEC 87 (230)
T ss_pred CcEEEECCCCCCEEecCCCCCcccccccceEEEeecccCCcEEEEEEEeecC-CCCCccEEEEEeCCCCcccccc
Confidence 45899999999999998654421111 111122 1 2333322111 1234578899999999999864
No 52
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=93.09 E-value=0.37 Score=29.79 Aligned_cols=54 Identities=19% Similarity=0.330 Sum_probs=34.6
Q ss_pred CccccCCCCCCcceEEEeeCCCCe--------EE---eeCCCCCCcccceEEEEC--C--EEEEEcCcC
Q psy2980 1 MLRGWHASTENTNRVECYHIAENT--------WE---YKSPMKEKRYRPGIAVID--G--KIYVLGGEE 54 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~--------W~---~~~~~~~~~~~~~~~~~~--~--~iy~~GG~~ 54 (92)
|-||...+..-+++++........ .. .+..+|.+|++|++-++. + ...+|||..
T Consensus 43 IHGGrTPNNElS~~LY~ls~~s~~cNkK~tl~C~EKeLvGdvP~aRYGHt~~vV~SrGKta~VlFGGRS 111 (337)
T PF03089_consen 43 IHGGRTPNNELSSSLYILSVDSRGCNKKVTLCCQEKELVGDVPEARYGHTINVVHSRGKTACVLFGGRS 111 (337)
T ss_pred ecCCcCCCcccccceEEEEeecCCCCceeEEEEecceecCCCCcccccceEEEEEECCcEEEEEECCcc
Confidence 457877555556667776654332 21 125689999999887753 2 378889874
No 53
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=93.02 E-value=0.84 Score=28.74 Aligned_cols=54 Identities=24% Similarity=0.409 Sum_probs=34.8
Q ss_pred CCeEEeeCCCCCCccc-------ceEEEE-CCEEEEE-cCcCCCCCccceEEEEeCCCCceEEcC--CCCC
Q psy2980 22 ENTWEYKSPMKEKRYR-------PGIAVI-DGKIYVL-GGEEGWDGYHDSIECYDVDNDSWEIMS--HLPS 81 (92)
Q Consensus 22 ~~~W~~~~~~~~~~~~-------~~~~~~-~~~iy~~-GG~~~~~~~~~~~~~y~~~~~~W~~~~--~~~~ 81 (92)
.-.|+.+|++|..... .+-+++ +..||+- -+. ....++||+.+.+|++.. .||+
T Consensus 157 ~w~W~~LP~PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~------~~GTysfDt~~~~W~~~GdW~LPF 221 (342)
T PF07893_consen 157 SWSWRSLPPPPFVRDRRYSDYRITSYAVVDGRTIFVSVNGR------RWGTYSFDTESHEWRKHGDWMLPF 221 (342)
T ss_pred cceEEcCCCCCccccCCcccceEEEEEEecCCeEEEEecCC------ceEEEEEEcCCcceeeccceecCc
Confidence 3368999876654432 244455 6667773 221 135899999999999986 4553
No 54
>PF08268 FBA_3: F-box associated domain; InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=91.77 E-value=1.2 Score=23.80 Aligned_cols=66 Identities=9% Similarity=0.186 Sum_probs=42.3
Q ss_pred CcceEEEeeCCCCeEEeeCCC---CCCcccceEEEECCEEEEEcCcCCCCCccceEEEE-eCCCCceEEc
Q psy2980 11 NTNRVECYHIAENTWEYKSPM---KEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECY-DVDNDSWEIM 76 (92)
Q Consensus 11 ~~~~v~~yd~~~~~W~~~~~~---~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y-~~~~~~W~~~ 76 (92)
....+.+||..+.+|..++.+ ........++.++++|-++.-........-+++.. |-.+.+|.+.
T Consensus 18 ~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~~~~~~iWvLeD~~k~~Wsk~ 87 (129)
T PF08268_consen 18 DNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEYKGKLALVSYNDQGEPDSIDIWVLEDYEKQEWSKK 87 (129)
T ss_pred CCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEeCCeEEEEEecCCCCcceEEEEEeeccccceEEEE
Confidence 345699999999999988653 23445567778899987775332211112345544 3456789865
No 55
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=88.64 E-value=4 Score=25.04 Aligned_cols=52 Identities=15% Similarity=0.291 Sum_probs=37.3
Q ss_pred ceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCC
Q psy2980 13 NRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDN 70 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~ 70 (92)
+.+..||+.+++-....++|..-.+=+++.++++||..--. ....+.||+.+
T Consensus 68 S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d~l~qLTWk------~~~~f~yd~~t 119 (264)
T PF05096_consen 68 SSLRKVDLETGKVLQSVPLPPRYFGEGITILGDKLYQLTWK------EGTGFVYDPNT 119 (264)
T ss_dssp EEEEEEETTTSSEEEEEE-TTT--EEEEEEETTEEEEEESS------SSEEEEEETTT
T ss_pred EEEEEEECCCCcEEEEEECCccccceeEEEECCEEEEEEec------CCeEEEEcccc
Confidence 46999999999877666777766777899999999998543 34566677653
No 56
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=88.34 E-value=2.8 Score=26.17 Aligned_cols=59 Identities=8% Similarity=0.203 Sum_probs=38.9
Q ss_pred ceEEEeeCCCCeEEeeCCC-CCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 13 NRVECYHIAENTWEYKSPM-KEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~~-~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
..+.+|||.+.+|.+-+-+ ..+|....-+--.+++++- ....+.+.+|||.+.+...++
T Consensus 254 g~l~rfdPs~~sW~eypLPgs~arpys~rVD~~grVW~s------ea~agai~rfdpeta~ftv~p 313 (353)
T COG4257 254 GSLHRFDPSVTSWIEYPLPGSKARPYSMRVDRHGRVWLS------EADAGAIGRFDPETARFTVLP 313 (353)
T ss_pred ceeeEeCcccccceeeeCCCCCCCcceeeeccCCcEEee------ccccCceeecCcccceEEEec
Confidence 4589999999999877532 2334332222234567763 223678999999999888763
No 57
>smart00284 OLF Olfactomedin-like domains.
Probab=80.79 E-value=7 Score=23.89 Aligned_cols=45 Identities=16% Similarity=0.294 Sum_probs=32.3
Q ss_pred CCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCC
Q psy2980 31 MKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPS 81 (92)
Q Consensus 31 ~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~ 81 (92)
+|.+-.+.+.+++++.+|.-- .....+.+||..+.+-.....+|.
T Consensus 70 Lp~~~~GtG~VVYngslYY~~------~~s~~iiKydL~t~~v~~~~~Lp~ 114 (255)
T smart00284 70 LPHAGQGTGVVVYNGSLYFNK------FNSHDICRFDLTTETYQKEPLLNG 114 (255)
T ss_pred CCCccccccEEEECceEEEEe------cCCccEEEEECCCCcEEEEEecCc
Confidence 555566778889999998863 235789999999987654444543
No 58
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=77.83 E-value=5.6 Score=24.92 Aligned_cols=41 Identities=12% Similarity=0.142 Sum_probs=27.4
Q ss_pred eEEEeeCCCCeE--EeeCCCCCCcccceEEEECCEEEEEcCcC
Q psy2980 14 RVECYHIAENTW--EYKSPMKEKRYRPGIAVIDGKIYVLGGEE 54 (92)
Q Consensus 14 ~v~~yd~~~~~W--~~~~~~~~~~~~~~~~~~~~~iy~~GG~~ 54 (92)
.|+..|++-.-. ..+|.+.-+...|.+.+-++.+|++||-.
T Consensus 132 ~VfLiDleFGC~tah~lpEl~dG~SFHvslar~D~VYilGGHs 174 (337)
T PF03089_consen 132 QVFLIDLEFGCCTAHTLPELQDGQSFHVSLARNDCVYILGGHS 174 (337)
T ss_pred eEEEEeccccccccccchhhcCCeEEEEEEecCceEEEEccEE
Confidence 356666654433 33455555667777778899999999963
No 59
>KOG2321|consensus
Probab=76.93 E-value=7 Score=26.82 Aligned_cols=53 Identities=13% Similarity=0.225 Sum_probs=32.7
Q ss_pred cceEEEeeCCCCeEEeeCCCCCCcccceEEEEC--CEEEEEcCcCCCCCccceEEEEeCCCCc
Q psy2980 12 TNRVECYHIAENTWEYKSPMKEKRYRPGIAVID--GKIYVLGGEEGWDGYHDSIECYDVDNDS 72 (92)
Q Consensus 12 ~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~--~~iy~~GG~~~~~~~~~~~~~y~~~~~~ 72 (92)
..+|+++|.+.++|..--....+..+ ++.++ ..|+.+|+.+ ..++.|||..+.
T Consensus 154 g~evYRlNLEqGrfL~P~~~~~~~lN--~v~in~~hgLla~Gt~~------g~VEfwDpR~ks 208 (703)
T KOG2321|consen 154 GSEVYRLNLEQGRFLNPFETDSGELN--VVSINEEHGLLACGTED------GVVEFWDPRDKS 208 (703)
T ss_pred CcceEEEEccccccccccccccccce--eeeecCccceEEecccC------ceEEEecchhhh
Confidence 45699999999988543332222222 33332 3477888754 378888887653
No 60
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=74.44 E-value=9.7 Score=23.12 Aligned_cols=46 Identities=22% Similarity=0.232 Sum_probs=31.8
Q ss_pred CCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCC
Q psy2980 30 PMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPS 81 (92)
Q Consensus 30 ~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~ 81 (92)
.+|.+-.+.+.+++++.+|.-- ...+.+.+||..++.-.....+|.
T Consensus 64 ~Lp~~~~GtG~vVYngslYY~~------~~s~~IvkydL~t~~v~~~~~L~~ 109 (250)
T PF02191_consen 64 KLPYPWQGTGHVVYNGSLYYNK------YNSRNIVKYDLTTRSVVARRELPG 109 (250)
T ss_pred EEeceeccCCeEEECCcEEEEe------cCCceEEEEECcCCcEEEEEECCc
Confidence 3455556677888899888863 247899999999887553334443
No 61
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=73.38 E-value=14 Score=23.67 Aligned_cols=56 Identities=18% Similarity=0.297 Sum_probs=30.2
Q ss_pred CCcceEEEeeCCCCeEEeeCCCCCCcccceEEEE-CC--EEEEEcCcCCCCCccceEEEEeCCCCce
Q psy2980 10 ENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVI-DG--KIYVLGGEEGWDGYHDSIECYDVDNDSW 73 (92)
Q Consensus 10 ~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~--~iy~~GG~~~~~~~~~~~~~y~~~~~~W 73 (92)
.+..+||.||+++.+ ++..++......++.+- ++ .||..-+. -..++.||..+.+-
T Consensus 266 dpgteVWv~D~~t~k--rv~Ri~l~~~~~Si~Vsqd~~P~L~~~~~~------~~~l~v~D~~tGk~ 324 (342)
T PF06433_consen 266 DPGTEVWVYDLKTHK--RVARIPLEHPIDSIAVSQDDKPLLYALSAG------DGTLDVYDAATGKL 324 (342)
T ss_dssp S-EEEEEEEETTTTE--EEEEEEEEEEESEEEEESSSS-EEEEEETT------TTEEEEEETTT--E
T ss_pred CCceEEEEEECCCCe--EEEEEeCCCccceEEEccCCCcEEEEEcCC------CCeEEEEeCcCCcE
Confidence 356789999999886 22222222222234443 33 45554221 35799999988754
No 62
>KOG0316|consensus
Probab=71.39 E-value=18 Score=22.29 Aligned_cols=52 Identities=15% Similarity=0.268 Sum_probs=36.3
Q ss_pred eEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCC
Q psy2980 14 RVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDND 71 (92)
Q Consensus 14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~ 71 (92)
++.++|...++..++.-+...+.......+.+..++.|..++ .+..||....
T Consensus 124 s~r~wDCRS~s~ePiQildea~D~V~Si~v~~heIvaGS~DG------tvRtydiR~G 175 (307)
T KOG0316|consen 124 SVRLWDCRSRSFEPIQILDEAKDGVSSIDVAEHEIVAGSVDG------TVRTYDIRKG 175 (307)
T ss_pred eeEEEEcccCCCCccchhhhhcCceeEEEecccEEEeeccCC------cEEEEEeecc
Confidence 467777777777777666667777777778888877776554 5667777554
No 63
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=68.30 E-value=26 Score=21.71 Aligned_cols=71 Identities=17% Similarity=0.158 Sum_probs=42.4
Q ss_pred ceEEEeeCCCCe-EEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCCCccceee
Q psy2980 13 NRVECYHIAENT-WEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPSARSWLGC 88 (92)
Q Consensus 13 ~~v~~yd~~~~~-W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~ 88 (92)
+.+..|+...++ +.....+..+-...++.+.++.|++---. ..-.+..|+...++-..++.-..++...++
T Consensus 107 ~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~~~~~I~vgD~~-----~sv~~~~~~~~~~~l~~va~d~~~~~v~~~ 178 (321)
T PF03178_consen 107 NKLYVYDLDNSKTLLKKAFYDSPFYITSLSVFKNYILVGDAM-----KSVSLLRYDEENNKLILVARDYQPRWVTAA 178 (321)
T ss_dssp TEEEEEEEETTSSEEEEEEE-BSSSEEEEEEETTEEEEEESS-----SSEEEEEEETTTE-EEEEEEESS-BEEEEE
T ss_pred CEEEEEEccCcccchhhheecceEEEEEEeccccEEEEEEcc-----cCEEEEEEEccCCEEEEEEecCCCccEEEE
Confidence 357888888777 88887665555666777788866553221 134455677766666666554444443333
No 64
>PF14830 Haemocyan_bet_s: Haemocyanin beta-sandwich; PDB: 1JS8_B 3QJO_A 1LNL_A.
Probab=65.31 E-value=8.5 Score=20.16 Aligned_cols=28 Identities=32% Similarity=0.506 Sum_probs=14.6
Q ss_pred EECCEEEEEcCcCCCCCccceEEEEeCC
Q psy2980 42 VIDGKIYVLGGEEGWDGYHDSIECYDVD 69 (92)
Q Consensus 42 ~~~~~iy~~GG~~~~~~~~~~~~~y~~~ 69 (92)
...+.+++.||...+.=....++.||+.
T Consensus 34 ~~AG~F~vLGG~~EMpW~FdRlykydIT 61 (103)
T PF14830_consen 34 HKAGTFFVLGGEKEMPWAFDRLYKYDIT 61 (103)
T ss_dssp EEEEEEEE---TTS---B-SS-EEEE-H
T ss_pred eEccEEEEcCCCccCccccCccchhhHH
Confidence 5567899999987655567888888873
No 65
>PF14157 YmzC: YmzC-like protein; PDB: 3KVP_E.
Probab=61.47 E-value=17 Score=17.23 Aligned_cols=15 Identities=20% Similarity=0.206 Sum_probs=12.0
Q ss_pred eEEEeeCCCCeEEee
Q psy2980 14 RVECYHIAENTWEYK 28 (92)
Q Consensus 14 ~v~~yd~~~~~W~~~ 28 (92)
+++.||+++|.-..+
T Consensus 42 KIfkyd~~tNei~L~ 56 (63)
T PF14157_consen 42 KIFKYDEDTNEITLK 56 (63)
T ss_dssp EEEEEETTTTEEEEE
T ss_pred EEEEeCCCCCeEEEE
Confidence 589999999986544
No 66
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=61.27 E-value=37 Score=21.03 Aligned_cols=50 Identities=24% Similarity=0.264 Sum_probs=35.2
Q ss_pred ceEEE-ECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCCCccceeeEEc
Q psy2980 38 PGIAV-IDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPSARSWLGCVPL 91 (92)
Q Consensus 38 ~~~~~-~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~~ 91 (92)
.+... .++.+|---|..+ .+.+..+|+.+.+-....++|..-++-|.+.+
T Consensus 48 QGL~~~~~g~LyESTG~yG----~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~ 98 (264)
T PF05096_consen 48 QGLEFLDDGTLYESTGLYG----QSSLRKVDLETGKVLQSVPLPPRYFGEGITIL 98 (264)
T ss_dssp EEEEEEETTEEEEEECSTT----EEEEEEEETTTSSEEEEEE-TTT--EEEEEEE
T ss_pred ccEEecCCCEEEEeCCCCC----cEEEEEEECCCCcEEEEEECCccccceeEEEE
Confidence 34444 5678887766544 68899999999988877888887777777765
No 67
>PF07443 HARP: HepA-related protein (HARP); InterPro: IPR010003 This entry represents a conserved region approximately 60 residues long within eukaryotic HepA-related protein (HARP). This exhibits single-stranded DNA-dependent ATPase activity, and is ubiquitously expressed in human and mouse tissues []. Family members may contain more than one copy of this region.; GO: 0004386 helicase activity, 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0016568 chromatin modification, 0005634 nucleus
Probab=59.23 E-value=4.7 Score=18.57 Aligned_cols=13 Identities=15% Similarity=0.626 Sum_probs=10.8
Q ss_pred EEeeCCCCeEEee
Q psy2980 16 ECYHIAENTWEYK 28 (92)
Q Consensus 16 ~~yd~~~~~W~~~ 28 (92)
..||+.|..|.-.
T Consensus 21 r~YD~~Tr~W~F~ 33 (55)
T PF07443_consen 21 RNYDPKTRKWNFS 33 (55)
T ss_pred cccCccceeeeee
Confidence 6799999999654
No 68
>PF11958 DUF3472: Domain of unknown function (DUF3472); InterPro: IPR021862 This presumed domain is functionally uncharacterised. This domain is found in bacteria, eukaryotes and viruses. This domain is typically between 174 to 190 amino acids in length. This domain has a single completely conserved residue G that may be functionally important.
Probab=58.11 E-value=35 Score=19.74 Aligned_cols=23 Identities=9% Similarity=0.144 Sum_probs=17.8
Q ss_pred eEEEeeCCCCeEEeeCCCCCCcc
Q psy2980 14 RVECYHIAENTWEYKSPMKEKRY 36 (92)
Q Consensus 14 ~v~~yd~~~~~W~~~~~~~~~~~ 36 (92)
..+.+|+.+++|..|..+..|..
T Consensus 81 ~~~v~d~~t~~~~~Igs~~~p~~ 103 (184)
T PF11958_consen 81 TGYVTDPTTGKWTLIGSIKRPKT 103 (184)
T ss_pred EEEEEECCCCcEEEEEEEEEcCc
Confidence 35788999999999987655554
No 69
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=57.97 E-value=43 Score=20.83 Aligned_cols=63 Identities=16% Similarity=0.296 Sum_probs=38.2
Q ss_pred CcceEEEeeCCCCeEEeeCC-----CCCCcccceEEEE-CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCC
Q psy2980 11 NTNRVECYHIAENTWEYKSP-----MKEKRYRPGIAVI-DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSH 78 (92)
Q Consensus 11 ~~~~v~~yd~~~~~W~~~~~-----~~~~~~~~~~~~~-~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~ 78 (92)
....+-.||..+.+|..++. +|.+......... .+.+++.|.... -..-+..| .-.+|..+..
T Consensus 62 ~~~~la~yd~~~~~w~~~~~~~s~~ipgpv~a~~~~~~d~~~~~~aG~~~~---g~~~l~~~--dGs~W~~i~~ 130 (281)
T PF12768_consen 62 NSSNLATYDFKNQTWSSLGGGSSNSIPGPVTALTFISNDGSNFWVAGRSAN---GSTFLMKY--DGSSWSSIGS 130 (281)
T ss_pred CceeEEEEecCCCeeeecCCcccccCCCcEEEEEeeccCCceEEEeceecC---CCceEEEE--cCCceEeccc
Confidence 34458999999999988865 2333322222222 345777775421 24456677 4568988865
No 70
>PF06058 DCP1: Dcp1-like decapping family; InterPro: IPR010334 An essential step in mRNA turnover is decapping. In yeast, two proteins have been identified that are essential for decapping, Dcp1 (this family) and Dcp2 (IPR007722 from INTERPRO). The precise role of these proteins in the decapping reaction has not been established. Evidence suggests that the Dcp1 may enhance the function of Dcp2 [].; PDB: 1Q67_A 2QKM_C 2QKL_A.
Probab=57.43 E-value=15 Score=19.71 Aligned_cols=17 Identities=24% Similarity=0.554 Sum_probs=14.5
Q ss_pred cceEEEEeCCCCceEEc
Q psy2980 60 HDSIECYDVDNDSWEIM 76 (92)
Q Consensus 60 ~~~~~~y~~~~~~W~~~ 76 (92)
.-.++.||+.+++|.+.
T Consensus 28 ~v~vY~f~~~~~~W~K~ 44 (122)
T PF06058_consen 28 HVVVYKFDHETNEWEKT 44 (122)
T ss_dssp EEEEEEEETTTTEEEEE
T ss_pred eEEEEeecCCCCcEeec
Confidence 55788999999999975
No 71
>KOG2394|consensus
Probab=54.98 E-value=18 Score=24.76 Aligned_cols=13 Identities=54% Similarity=0.825 Sum_probs=10.8
Q ss_pred ECCEEEEEcCcCC
Q psy2980 43 IDGKIYVLGGEEG 55 (92)
Q Consensus 43 ~~~~iy~~GG~~~ 55 (92)
-+++.++.||.++
T Consensus 342 PDGKyIvtGGEDD 354 (636)
T KOG2394|consen 342 PDGKYIVTGGEDD 354 (636)
T ss_pred CCccEEEecCCcc
Confidence 4888999999875
No 72
>cd01206 Homer Homer type EVH1 domain. Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=53.88 E-value=33 Score=18.25 Aligned_cols=20 Identities=15% Similarity=0.089 Sum_probs=15.8
Q ss_pred CcceEEEeeCCCC-eEEeeCC
Q psy2980 11 NTNRVECYHIAEN-TWEYKSP 30 (92)
Q Consensus 11 ~~~~v~~yd~~~~-~W~~~~~ 30 (92)
....|..||+.++ .|....+
T Consensus 9 arA~V~~yd~~tKk~WvPs~~ 29 (111)
T cd01206 9 TRAHVFQIDPKTKKNWIPASK 29 (111)
T ss_pred eeeEEEEECCCCcceeEeCCC
Confidence 3456999999875 9988865
No 73
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=51.78 E-value=48 Score=19.51 Aligned_cols=56 Identities=21% Similarity=0.270 Sum_probs=34.6
Q ss_pred cceEEEeeCCCCeEEeeCCCCCCcccceEEEE--CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980 12 TNRVECYHIAENTWEYKSPMKEKRYRPGIAVI--DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL 79 (92)
Q Consensus 12 ~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~--~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~ 79 (92)
..+++++|+.+.+-..+.... ..+++.. ++.+|+... ..+..+|+.+.+++.+...
T Consensus 21 ~~~i~~~~~~~~~~~~~~~~~----~~G~~~~~~~g~l~v~~~--------~~~~~~d~~~g~~~~~~~~ 78 (246)
T PF08450_consen 21 GGRIYRVDPDTGEVEVIDLPG----PNGMAFDRPDGRLYVADS--------GGIAVVDPDTGKVTVLADL 78 (246)
T ss_dssp TTEEEEEETTTTEEEEEESSS----EEEEEEECTTSEEEEEET--------TCEEEEETTTTEEEEEEEE
T ss_pred CCEEEEEECCCCeEEEEecCC----CceEEEEccCCEEEEEEc--------CceEEEecCCCcEEEEeec
Confidence 356888888887765543322 2334444 577777653 2345568888888877554
No 74
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=49.68 E-value=61 Score=20.08 Aligned_cols=60 Identities=20% Similarity=0.323 Sum_probs=38.0
Q ss_pred ceEEEeeCCCC-----eEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCc-eEEcCCCC
Q psy2980 13 NRVECYHIAEN-----TWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDS-WEIMSHLP 80 (92)
Q Consensus 13 ~~v~~yd~~~~-----~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~-W~~~~~~~ 80 (92)
-.+..|+..+. ++..+...+....-.+++.+++.+.+..| +.+..|+...++ +...+.+.
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~~~~lv~~~g--------~~l~v~~l~~~~~l~~~~~~~ 127 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICSFNGRLVVAVG--------NKLYVYDLDNSKTLLKKAFYD 127 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEETTEEEEEET--------TEEEEEEEETTSSEEEEEEE-
T ss_pred cEEEEEEEEcccccceEEEEEEEEeecCcceEhhhhCCEEEEeec--------CEEEEEEccCcccchhhheec
Confidence 56889998874 66666544444444667778888555443 467777777766 77665443
No 75
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=48.02 E-value=26 Score=17.77 Aligned_cols=20 Identities=15% Similarity=0.041 Sum_probs=14.4
Q ss_pred CCcceEEEeeCCCCeEEeeC
Q psy2980 10 ENTNRVECYHIAENTWEYKS 29 (92)
Q Consensus 10 ~~~~~v~~yd~~~~~W~~~~ 29 (92)
.++-.+..|||.+++...+.
T Consensus 34 ~~~GRll~ydp~t~~~~vl~ 53 (89)
T PF03088_consen 34 RPTGRLLRYDPSTKETTVLL 53 (89)
T ss_dssp ---EEEEEEETTTTEEEEEE
T ss_pred CCCcCEEEEECCCCeEEEeh
Confidence 35567999999999986664
No 76
>COG2380 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.86 E-value=28 Score=21.68 Aligned_cols=21 Identities=14% Similarity=0.420 Sum_probs=16.7
Q ss_pred CcceEEEeeCCCCeEEeeCCC
Q psy2980 11 NTNRVECYHIAENTWEYKSPM 31 (92)
Q Consensus 11 ~~~~v~~yd~~~~~W~~~~~~ 31 (92)
.+.....+|.+|++|..++.+
T Consensus 20 ~~f~~~~~~vEtdew~~i~~~ 40 (327)
T COG2380 20 KPFAEESFDVETDEWEAIPTL 40 (327)
T ss_pred ccccceeeecccccccccCcc
Confidence 344578899999999999754
No 77
>cd01207 Ena-Vasp Enabled-VASP-type homology (EVH1) domain. Enabled-VASP-type homology (EVH1) domain. The EVH1 domain binds to other proteins at proline rich sequences. It is found in proteins involved in cytoskeletal reorganization such as Enabled and VASP. Ena-VASP type EVH1 domains specifically recognize FPPPP motifs in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=45.13 E-value=49 Score=17.62 Aligned_cols=18 Identities=17% Similarity=0.294 Sum_probs=15.3
Q ss_pred ceEEEeeCCCCeEEeeCC
Q psy2980 13 NRVECYHIAENTWEYKSP 30 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~ 30 (92)
+.|..||+.++.|.....
T Consensus 9 A~Vm~~d~~tk~W~P~~~ 26 (111)
T cd01207 9 ASVMVYDDSNKKWVPAGG 26 (111)
T ss_pred EEeeEEcCCCCcEEcCCC
Confidence 468999999999988755
No 78
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=42.32 E-value=91 Score=19.98 Aligned_cols=57 Identities=16% Similarity=0.396 Sum_probs=33.2
Q ss_pred ceEEEeeCCCC--eEEeeCCCC--------CCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCC--ceEEc
Q psy2980 13 NRVECYHIAEN--TWEYKSPMK--------EKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDND--SWEIM 76 (92)
Q Consensus 13 ~~v~~yd~~~~--~W~~~~~~~--------~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~--~W~~~ 76 (92)
..+.+||..+. .|+.-..-. ..+...+.++.++.+|+... ...++++|..+. .|+.-
T Consensus 79 g~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~~~~v~v~~~-------~g~l~ald~~tG~~~W~~~ 147 (394)
T PRK11138 79 GLVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVAGGKVYIGSE-------KGQVYALNAEDGEVAWQTK 147 (394)
T ss_pred CeEEEEECCCCcEeeEEcCCCcccccccccccccccccEEECCEEEEEcC-------CCEEEEEECCCCCCccccc
Confidence 35889998765 486542211 11223345667888887532 236888887553 57654
No 79
>PRK11408 hypothetical protein; Provisional
Probab=40.55 E-value=68 Score=18.00 Aligned_cols=30 Identities=17% Similarity=0.326 Sum_probs=20.9
Q ss_pred EEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 47 IYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 47 iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
+|.+.|..- ......++.|-|.+-+|+.+.
T Consensus 34 ~FAiNgG~l-g~~~G~v~Y~aPDtL~WE~l~ 63 (145)
T PRK11408 34 FFALNGGGL-GEDVGEVYYFAPDTLKWESLE 63 (145)
T ss_pred eEEeccCcc-CCCCCeEEEeCCCccceeccC
Confidence 455543221 445788999999999999763
No 80
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=40.37 E-value=31 Score=14.02 Aligned_cols=26 Identities=27% Similarity=0.450 Sum_probs=15.6
Q ss_pred ceEEEECCEEEEEcCcCCCCCccceEEEEeCCC
Q psy2980 38 PGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDN 70 (92)
Q Consensus 38 ~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~ 70 (92)
.+.++.++.+|+... -..++++|+.+
T Consensus 15 ~~~~v~~g~vyv~~~-------dg~l~ald~~t 40 (40)
T PF13570_consen 15 SSPAVAGGRVYVGTG-------DGNLYALDAAT 40 (40)
T ss_dssp S--EECTSEEEEE-T-------TSEEEEEETT-
T ss_pred cCCEEECCEEEEEcC-------CCEEEEEeCCC
Confidence 334667888888753 34788888753
No 81
>PF07734 FBA_1: F-box associated; InterPro: IPR006527 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=40.08 E-value=67 Score=17.82 Aligned_cols=40 Identities=20% Similarity=0.337 Sum_probs=23.8
Q ss_pred EEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCC
Q psy2980 41 AVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPS 81 (92)
Q Consensus 41 ~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~ 81 (92)
+.+++.+|=+.-... ......+.+||..+++.++..++|.
T Consensus 2 V~vnG~~hW~~~~~~-~~~~~~IlsFDl~~E~F~~~~~lP~ 41 (164)
T PF07734_consen 2 VFVNGALHWLAYDEN-NDEKDFILSFDLSTEKFGRSLPLPF 41 (164)
T ss_pred EEECCEEEeeEEecC-CCCceEEEEEeccccccCCEECCCC
Confidence 345677766654322 1112279999999999944445543
No 82
>PF03835 Rad4: Rad4 transglutaminase-like domain; InterPro: IPR018325 RAD4/Xp-C proteins contain an ancient transglutaminase fold that is also found in peptide-N-glycanases (PNGases), which remove glycans from glycoproteins during their degradation. The PNGases retain the catalytic triad that is typical of this fold and are predicted to have a reaction mechanism similar to that involved in transglutamination. In contrast, the RAD4/Xp-C proteins are predicted to be inactive and are likely to only possess the interaction function in DNA repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0005634 nucleus; PDB: 2QSG_A 2QSF_A 2QSH_A 1X3W_A 1X3Z_A 3ESW_A.
Probab=39.47 E-value=46 Score=18.25 Aligned_cols=21 Identities=29% Similarity=0.439 Sum_probs=16.8
Q ss_pred cceEEEeeCCCCeEEeeCCCC
Q psy2980 12 TNRVECYHIAENTWEYKSPMK 32 (92)
Q Consensus 12 ~~~v~~yd~~~~~W~~~~~~~ 32 (92)
.--+|+|++.+++|..+-++.
T Consensus 36 ~~W~EV~~~~~~rWI~VDp~~ 56 (145)
T PF03835_consen 36 NFWVEVYSPEEKRWIHVDPVV 56 (145)
T ss_dssp CEEEEEEETTTTEEEEEETTT
T ss_pred eEEEEEEecCCCeEEEeeeec
Confidence 335899999999999996554
No 83
>PF10902 DUF2693: Protein of unknown function (DUF2693); InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown.
Probab=38.19 E-value=46 Score=16.75 Aligned_cols=20 Identities=20% Similarity=0.579 Sum_probs=16.2
Q ss_pred CCccceEEEEeCCCCceEEc
Q psy2980 57 DGYHDSIECYDVDNDSWEIM 76 (92)
Q Consensus 57 ~~~~~~~~~y~~~~~~W~~~ 76 (92)
.+....+..||...+.|+..
T Consensus 45 kes~~s~~yfDve~~~WRSF 64 (83)
T PF10902_consen 45 KESTTSVRYFDVEKKGWRSF 64 (83)
T ss_pred cCCcceEEEEEeccCceeee
Confidence 44567799999999999854
No 84
>PRK09511 nirD nitrite reductase small subunit; Provisional
Probab=37.74 E-value=55 Score=16.98 Aligned_cols=32 Identities=9% Similarity=0.087 Sum_probs=14.5
Q ss_pred EEeeCCCCeEEeeCCCCCCcccceEEEECCEEEE
Q psy2980 16 ECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYV 49 (92)
Q Consensus 16 ~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~ 49 (92)
++||..|.+....+. .+...+.+-+-++.||+
T Consensus 75 ~~Fdl~TG~~~~~~~--~~l~typV~ve~g~V~v 106 (108)
T PRK09511 75 QRFRLSDGLCMEDEQ--FSVKHYDARVKDGVVQL 106 (108)
T ss_pred CEEECCCcccCCCCC--ccEeeEeEEEECCEEEE
Confidence 556666665433221 12222334444666655
No 85
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=36.46 E-value=87 Score=18.04 Aligned_cols=25 Identities=32% Similarity=0.611 Sum_probs=15.5
Q ss_pred CCEEEEEcCcCCCCCccceEEEEeCCCC
Q psy2980 44 DGKIYVLGGEEGWDGYHDSIECYDVDND 71 (92)
Q Consensus 44 ~~~iy~~GG~~~~~~~~~~~~~y~~~~~ 71 (92)
++.+.+++|..+ ...++..||..+.
T Consensus 111 ~G~~l~~~g~~n---~~G~l~~wd~~~~ 135 (194)
T PF08662_consen 111 DGRFLVLAGFGN---LNGDLEFWDVRKK 135 (194)
T ss_pred CCCEEEEEEccC---CCcEEEEEECCCC
Confidence 566777777643 2246778888643
No 86
>PF00614 PLDc: Phospholipase D Active site motif; InterPro: IPR001736 Phosphatidylcholine-hydrolysing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, and/or asparagine residues which may contribute to the active site aspartic acid. An Escherichia coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs [, , , ].; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 3HSI_C.
Probab=35.69 E-value=35 Score=13.21 Aligned_cols=16 Identities=25% Similarity=0.644 Sum_probs=8.9
Q ss_pred eEEEECCEEEEEcCcC
Q psy2980 39 GIAVIDGKIYVLGGEE 54 (92)
Q Consensus 39 ~~~~~~~~iy~~GG~~ 54 (92)
-.+++++.+.++||.+
T Consensus 8 K~~vvD~~~a~vGg~n 23 (28)
T PF00614_consen 8 KFVVVDDRVAFVGGAN 23 (28)
T ss_dssp -EEEETTTEEEEE---
T ss_pred EEEEEcCCEEEECcee
Confidence 3566777777777754
No 87
>KOG2055|consensus
Probab=34.47 E-value=1.5e+02 Score=20.22 Aligned_cols=27 Identities=11% Similarity=0.249 Sum_probs=19.8
Q ss_pred EEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980 47 IYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL 79 (92)
Q Consensus 47 iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~ 79 (92)
..+++|. .+-++.||..+.+-.++.++
T Consensus 272 ~i~~s~r------rky~ysyDle~ak~~k~~~~ 298 (514)
T KOG2055|consen 272 VIFTSGR------RKYLYSYDLETAKVTKLKPP 298 (514)
T ss_pred EEEeccc------ceEEEEeeccccccccccCC
Confidence 5555554 45789999998888887655
No 88
>KOG2958|consensus
Probab=33.02 E-value=35 Score=21.63 Aligned_cols=20 Identities=25% Similarity=0.478 Sum_probs=15.4
Q ss_pred CcceEEEeeCCCCeEEeeCC
Q psy2980 11 NTNRVECYHIAENTWEYKSP 30 (92)
Q Consensus 11 ~~~~v~~yd~~~~~W~~~~~ 30 (92)
+...-.+|||.+++|-.+.+
T Consensus 9 ~q~~hrRynPltd~wVlvSp 28 (354)
T KOG2958|consen 9 NQHSHRRYNPLTDEWVLVSP 28 (354)
T ss_pred ccCchhccCCccceeEEech
Confidence 34457899999999987743
No 89
>PRK00178 tolB translocation protein TolB; Provisional
Probab=32.95 E-value=1.4e+02 Score=19.36 Aligned_cols=57 Identities=12% Similarity=0.085 Sum_probs=30.9
Q ss_pred eEEEeeCCCCeEEeeCCCCCCcccceEE-EEC-CEEEEEcCcCCCCCccceEEEEeCCCCceEEc
Q psy2980 14 RVECYHIAENTWEYKSPMKEKRYRPGIA-VID-GKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM 76 (92)
Q Consensus 14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~-~~~-~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~ 76 (92)
+++.+|..+.....+...+. ...... .-+ ..|++.....+ ...++.+|..+..++++
T Consensus 268 ~Iy~~d~~~~~~~~lt~~~~--~~~~~~~spDg~~i~f~s~~~g----~~~iy~~d~~~g~~~~l 326 (430)
T PRK00178 268 EIYVMDLASRQLSRVTNHPA--IDTEPFWGKDGRTLYFTSDRGG----KPQIYKVNVNGGRAERV 326 (430)
T ss_pred eEEEEECCCCCeEEcccCCC--CcCCeEECCCCCEEEEEECCCC----CceEEEEECCCCCEEEe
Confidence 58888998888777654221 111111 123 34544432111 34678888777766655
No 90
>PRK05137 tolB translocation protein TolB; Provisional
Probab=32.88 E-value=1.4e+02 Score=19.47 Aligned_cols=60 Identities=17% Similarity=0.210 Sum_probs=31.0
Q ss_pred ceEEEeeCCCCeEEeeCCCCCCcccceEEEECC-EEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 13 NRVECYHIAENTWEYKSPMKEKRYRPGIAVIDG-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
..++.+|+.+.+...+...+...... ...-++ .|++....++ ..+++.+|..+..-+++.
T Consensus 226 ~~i~~~dl~~g~~~~l~~~~g~~~~~-~~SPDG~~la~~~~~~g----~~~Iy~~d~~~~~~~~Lt 286 (435)
T PRK05137 226 PRVYLLDLETGQRELVGNFPGMTFAP-RFSPDGRKVVMSLSQGG----NTDIYTMDLRSGTTTRLT 286 (435)
T ss_pred CEEEEEECCCCcEEEeecCCCcccCc-EECCCCCEEEEEEecCC----CceEEEEECCCCceEEcc
Confidence 35888888888777665443221111 111233 3433332221 346777777666655554
No 91
>PRK04792 tolB translocation protein TolB; Provisional
Probab=32.77 E-value=1.5e+02 Score=19.64 Aligned_cols=59 Identities=12% Similarity=0.120 Sum_probs=31.4
Q ss_pred ceEEEeeCCCCeEEeeCCCCCCcccceEEE-EC-CEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 13 NRVECYHIAENTWEYKSPMKEKRYRPGIAV-ID-GKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~-~~-~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
..++.+|+.+++-..+...+... ...+. -+ ..|++....++ ...++.+|..+.+.+++.
T Consensus 242 ~~L~~~dl~tg~~~~lt~~~g~~--~~~~wSPDG~~La~~~~~~g----~~~Iy~~dl~tg~~~~lt 302 (448)
T PRK04792 242 AEIFVQDIYTQVREKVTSFPGIN--GAPRFSPDGKKLALVLSKDG----QPEIYVVDIATKALTRIT 302 (448)
T ss_pred cEEEEEECCCCCeEEecCCCCCc--CCeeECCCCCEEEEEEeCCC----CeEEEEEECCCCCeEECc
Confidence 35788888777665555433211 11222 23 34544432222 356888888777776654
No 92
>PF09272 Hepsin-SRCR: Hepsin, SRCR; InterPro: IPR015352 This entry represents the extracellular domain of the serine protease hepsin. The domain is formed primarily by three elements of regular secondary structure: a 12-residue alpha helix, a twisted five-stranded antiparallel beta sheet, and a second, two-stranded, antiparallel sheet. The two beta-sheets lie at roughly right angles to each other, with the helix nestled between the two, adopting an SRCR fold. The exact function of this domain has not been identified, though it probably may serve to orient the protease domain or place it in the vicinity of its substrate []. ; GO: 0004252 serine-type endopeptidase activity, 0070008 serine-type exopeptidase activity; PDB: 3T2N_B 1Z8G_A 1P57_A 1O5E_L 1O5F_L.
Probab=31.69 E-value=70 Score=16.94 Aligned_cols=20 Identities=10% Similarity=0.521 Sum_probs=14.8
Q ss_pred cceEEEEeCCCCceEEcCCC
Q psy2980 60 HDSIECYDVDNDSWEIMSHL 79 (92)
Q Consensus 60 ~~~~~~y~~~~~~W~~~~~~ 79 (92)
-..+..||..+++|+.+...
T Consensus 11 D~rL~vfD~te~~WRllCss 30 (110)
T PF09272_consen 11 DQRLMVFDSTEGTWRLLCSS 30 (110)
T ss_dssp T-BEEEEETTTTEEEEEB--
T ss_pred cceEEEEeccCCEEEEEeec
Confidence 45688899999999988643
No 93
>PF14298 DUF4374: Domain of unknown function (DUF4374)
Probab=31.49 E-value=1.7e+02 Score=19.79 Aligned_cols=61 Identities=16% Similarity=0.277 Sum_probs=36.4
Q ss_pred cceEEEeeCCCCeEEeeCCCCCC---cccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceE
Q psy2980 12 TNRVECYHIAENTWEYKSPMKEK---RYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWE 74 (92)
Q Consensus 12 ~~~v~~yd~~~~~W~~~~~~~~~---~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~ 74 (92)
..++-+||..+.+...+..+|.. .......+-++.+|+--.-. .....-++.+||.+.+-.
T Consensus 366 ~~~laI~d~~~kt~t~V~glP~~~is~~~~~~~ve~G~aYi~Vtt~--~g~~~~IY~iDp~TatAt 429 (435)
T PF14298_consen 366 AKKLAIFDVSNKTFTWVTGLPADLISGFGNAPYVENGKAYIPVTTE--DGSDPYIYKIDPATATAT 429 (435)
T ss_pred cceEEEEEccCceeEEeccCChhhccccccceEeeCCEEEEEEeec--CCCceeEEEEcCcccccc
Confidence 34577888888877777666654 22223344566666653221 111357999999877543
No 94
>cd00837 EVH1 EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. The EVH1 domain binds to other proteins at proline rich sequences in either FPPPP or PPXXF motifs. It is found in the cytoskeletal reorganization proteins Enabled VASP, and WASP, and in the synaptic scaffolding protein Homer. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=31.48 E-value=62 Score=16.70 Aligned_cols=17 Identities=12% Similarity=0.173 Sum_probs=14.6
Q ss_pred cceEEEEeCCCCceEEc
Q psy2980 60 HDSIECYDVDNDSWEIM 76 (92)
Q Consensus 60 ~~~~~~y~~~~~~W~~~ 76 (92)
...++.+|+.++.|...
T Consensus 8 ~a~v~~~~~~~~~W~~~ 24 (104)
T cd00837 8 VAQVYTADPSTGKWVPA 24 (104)
T ss_pred EEEEEEECCCCCceEEC
Confidence 56788999999999987
No 95
>KOG0179|consensus
Probab=31.20 E-value=35 Score=20.47 Aligned_cols=19 Identities=21% Similarity=0.333 Sum_probs=13.8
Q ss_pred CccccCCCCCCcceEEEeeCC
Q psy2980 1 MLRGWHASTENTNRVECYHIA 21 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~ 21 (92)
|++|.+.++... |+.|||.
T Consensus 129 ilaGiDeeGKG~--VySyDPv 147 (235)
T KOG0179|consen 129 ILAGIDEEGKGA--VYSYDPV 147 (235)
T ss_pred eeecccccCcee--EEeecCC
Confidence 467887666555 8999875
No 96
>PF13952 DUF4216: Domain of unknown function (DUF4216)
Probab=30.80 E-value=75 Score=15.57 Aligned_cols=19 Identities=11% Similarity=0.298 Sum_probs=11.7
Q ss_pred CCccceEEEEeCC-CCceEE
Q psy2980 57 DGYHDSIECYDVD-NDSWEI 75 (92)
Q Consensus 57 ~~~~~~~~~y~~~-~~~W~~ 75 (92)
.+...-.+.-||. ...|..
T Consensus 54 sqa~QVfYv~dp~~~~~w~v 73 (75)
T PF13952_consen 54 SQAKQVFYVPDPSSDKKWHV 73 (75)
T ss_pred hheeEEEEEeCCCCCCCcEE
Confidence 4445566667777 666764
No 97
>cd03476 Rieske_ArOX_small Small subunit of Arsenite oxidase (ArOX) family, Rieske domain; ArOX is a molybdenum/iron protein involved in the detoxification of arsenic, oxidizing it to arsenate. It consists of two subunits, a large subunit similar to members of the DMSO reductase family of molybdenum enzymes and a small subunit with a Rieske-type [2Fe-2S] cluster. The large subunit of ArOX contains the molybdenum site at which the oxidation of arsenite occurs. The small subunit contains a domain homologous to the Rieske domains of the cytochrome bc(1) and cytochrome b6f complexes as well as naphthalene 1,2-dioxygenase. The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer.
Probab=30.61 E-value=95 Score=16.73 Aligned_cols=35 Identities=14% Similarity=0.166 Sum_probs=17.6
Q ss_pred EEeeCCC-CeEEeeCCCCCCcccceEEEE--CCEEEEEc
Q psy2980 16 ECYHIAE-NTWEYKSPMKEKRYRPGIAVI--DGKIYVLG 51 (92)
Q Consensus 16 ~~yd~~~-~~W~~~~~~~~~~~~~~~~~~--~~~iy~~G 51 (92)
+.||+.+ ..-...|. +.+...+.+.+- ++.||+.|
T Consensus 78 ~~Fdl~tgG~~~~gPa-~~~L~~ypv~ve~~~g~V~~~~ 115 (126)
T cd03476 78 SQFDPARGGQMVSGQA-TQNLPQIVLEYDEASGDIYAVG 115 (126)
T ss_pred CEEeCCCCCeEEcCCC-CCCCCeEEEEEECCCCEEEEEE
Confidence 4566653 24433332 223333445555 67788876
No 98
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=30.34 E-value=1.5e+02 Score=18.93 Aligned_cols=60 Identities=17% Similarity=0.160 Sum_probs=31.7
Q ss_pred ceEEEeeCCCCeEEeeCCCCCCcccceEEEECC-EEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 13 NRVECYHIAENTWEYKSPMKEKRYRPGIAVIDG-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
..++.+|..+++-..+...+..... ....-++ .|++.....+ ...++.+|..+...+++.
T Consensus 214 ~~i~v~d~~~g~~~~~~~~~~~~~~-~~~spDg~~l~~~~~~~~----~~~i~~~d~~~~~~~~l~ 274 (417)
T TIGR02800 214 PEIYVQDLATGQREKVASFPGMNGA-PAFSPDGSKLAVSLSKDG----NPDIYVMDLDGKQLTRLT 274 (417)
T ss_pred cEEEEEECCCCCEEEeecCCCCccc-eEECCCCCEEEEEECCCC----CccEEEEECCCCCEEECC
Confidence 4588889887766555443322211 1112233 4554432222 356888888777666553
No 99
>PF08950 DUF1861: Protein of unknown function (DUF1861); InterPro: IPR015045 This hypothetical protein, found in bacteria and in the eukaryote Leishmania, has no known function. ; PDB: 2B4W_A.
Probab=30.01 E-value=1.5e+02 Score=18.80 Aligned_cols=42 Identities=17% Similarity=0.379 Sum_probs=25.9
Q ss_pred CcceEEEeeCC-CCeEEeeCCCC-CCcccceEEEECCEEEEEcCc
Q psy2980 11 NTNRVECYHIA-ENTWEYKSPMK-EKRYRPGIAVIDGKIYVLGGE 53 (92)
Q Consensus 11 ~~~~v~~yd~~-~~~W~~~~~~~-~~~~~~~~~~~~~~iy~~GG~ 53 (92)
..+.|..|.-. .++|..++.-+ ......-.+.+++.+ ++||.
T Consensus 52 e~S~V~fF~e~g~~~w~~v~~~~~~~LqDPF~t~I~gel-ifGGv 95 (298)
T PF08950_consen 52 EHSEVRFFEETGKDEWTPVEGAPVFQLQDPFVTRIQGEL-IFGGV 95 (298)
T ss_dssp S--EEEEEEEEETTEEEE-TT---BS-EEEEEEEETTEE-EEEEE
T ss_pred hccEEEEEEEeCCCeEEECCCcceEEecCcceeeECCEE-EEeeE
Confidence 34567777655 88999998744 455566677888988 45554
No 100
>cd03529 Rieske_NirD Assimilatory nitrite reductase (NirD) family, Rieske domain; Assimilatory nitrate and nitrite reductases convert nitrate through nitrite to ammonium. Members include bacterial and fungal proteins. The bacterial NirD contains a single Rieske domain while fungal proteins have a C-terminal Rieske domain in addition to several other domains. The fungal NirD is involved in nutrient acquisition, functioning at the soil/fungus interface to control nutrient exchange between the fungus and the host plant. The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. The Rieske [2Fe-2S] cluster is liganded to two histidine and two cysteine residues present in conserved sequences called Rieske motifs. In this family, only a few members contain these residues. Other members may have lost the ability to bind the Rieske [2Fe-2S] cluster.
Probab=29.55 E-value=86 Score=15.88 Aligned_cols=6 Identities=0% Similarity=0.423 Sum_probs=2.3
Q ss_pred eeCCCC
Q psy2980 18 YHIAEN 23 (92)
Q Consensus 18 yd~~~~ 23 (92)
||+.+.
T Consensus 73 Fdl~tG 78 (103)
T cd03529 73 FSLKTG 78 (103)
T ss_pred EEcCCC
Confidence 333333
No 101
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=29.15 E-value=2.3e+02 Score=20.67 Aligned_cols=58 Identities=10% Similarity=0.349 Sum_probs=33.3
Q ss_pred ceEEEeeCCCCe--EEeeCC---------------CC--CCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCC--
Q psy2980 13 NRVECYHIAENT--WEYKSP---------------MK--EKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDND-- 71 (92)
Q Consensus 13 ~~v~~yd~~~~~--W~~~~~---------------~~--~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~-- 71 (92)
-.+..+|+.|.+ |+.--. ++ .+..+..+++-++.+|+-|..+ ..+..||.++.
T Consensus 641 G~l~AiDl~tGk~~W~~~~g~~~~~~p~~~~~~~~~~~g~p~~gG~l~TagglvF~~gt~d------~~l~A~D~~tGk~ 714 (764)
T TIGR03074 641 GYMAAIDLKTGKVVWQHPNGTVRDTGPMGIRMPLPIPIGVPTLGGPLATAGGLVFIGATQD------NYLRAYDLSTGKE 714 (764)
T ss_pred EEEEEEECCCCcEeeeeECCccccccccccccccccccCCcccCCcEEEcCCEEEEEeCCC------CEEEEEECCCCce
Confidence 458889988775 755420 01 1222333555566666533322 47899998664
Q ss_pred ceEEc
Q psy2980 72 SWEIM 76 (92)
Q Consensus 72 ~W~~~ 76 (92)
-|+..
T Consensus 715 lW~~~ 719 (764)
T TIGR03074 715 LWKAR 719 (764)
T ss_pred eeEee
Confidence 48754
No 102
>PF05237 MoeZ_MoeB: MoeZ/MoeB domain; InterPro: IPR007901 This putative domain is found in the MoeZ protein and the MoeB protein. The domain has two CXXC motifs that are only partly conserved. MoeZ is necessary for the synthesis of pyridine-2,6-bis(thiocarboxylic acid), a small secreted metabolite that has a high affinity for transition metals, increases iron uptake efficiency by 20% in Pseudomonas stutzeri, has the ability to reduce both soluble and mineral forms of iron, and has antimicrobial activity towards several species of bacteria. MoeB is the molybdopterin synthase activating enzyme in the molybdopterin cofactor biosynthesis pathway. Both these enzymes are members of a superfamily consisting of related but structurally distinct proteins that are members of pathways involved in the transfer of sulphur-containing moieties to metabolites [] and both also contain the UBA/THIF-type NAD/FAD binding fold (IPR000594 from INTERPRO). ; PDB: 1JWA_B 1JW9_B 1JWB_B 1ZKM_D 1ZUD_3 1ZFN_D.
Probab=29.13 E-value=81 Score=15.46 Aligned_cols=19 Identities=0% Similarity=0.034 Sum_probs=15.6
Q ss_pred cceEEEeeCCCCeEEeeCC
Q psy2980 12 TNRVECYHIAENTWEYKSP 30 (92)
Q Consensus 12 ~~~v~~yd~~~~~W~~~~~ 30 (92)
..++..||..+.+|+.+.-
T Consensus 53 ~~~l~~~D~~~~~~~~i~~ 71 (84)
T PF05237_consen 53 SGKLLTIDLLNMSFRSIRI 71 (84)
T ss_dssp BTEEEEEETTTTEEEEEE-
T ss_pred hhheeeEECCCCeEEEEec
Confidence 4469999999999998863
No 103
>PF00568 WH1: WH1 domain; InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][]. WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,]. Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=29.04 E-value=88 Score=16.27 Aligned_cols=16 Identities=13% Similarity=0.104 Sum_probs=13.4
Q ss_pred ceEEEeeCCCCe-EEee
Q psy2980 13 NRVECYHIAENT-WEYK 28 (92)
Q Consensus 13 ~~v~~yd~~~~~-W~~~ 28 (92)
..|+.+|+.+++ |...
T Consensus 16 A~v~~~~p~~~~~W~~~ 32 (111)
T PF00568_consen 16 AQVYQADPDTKRQWSPV 32 (111)
T ss_dssp EEEEEEETTTSESEEES
T ss_pred EEEEEEEcCCCCcEeeC
Confidence 467899999888 9986
No 104
>KOG1898|consensus
Probab=28.56 E-value=2.8e+02 Score=21.39 Aligned_cols=31 Identities=10% Similarity=0.083 Sum_probs=21.7
Q ss_pred cceEEEEeCCCCceEEcCCCCCCccceeeEE
Q psy2980 60 HDSIECYDVDNDSWEIMSHLPSARSWLGCVP 90 (92)
Q Consensus 60 ~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~ 90 (92)
.-.+++|++..|+......-|.+|....+..
T Consensus 997 SV~~~~y~~~~n~l~~fadD~~pR~Vt~~~~ 1027 (1205)
T KOG1898|consen 997 SVHFVRYRREDNQLIVFADDPVPRHVTALEL 1027 (1205)
T ss_pred eEEEEEEecCCCeEEEEeCCCccceeeEEEE
Confidence 4456778888888888777777776554443
No 105
>PF04096 Nucleoporin2: Nucleoporin autopeptidase; InterPro: IPR007230 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of autocatalytic serine endopeptidases belong to MEROPS peptidase family S59 (clan SP). The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 50 unique proteins, collectively known as nucleoporins. A number of the peptides are synthesised as precursors and undergo self-catalyzed cleavage. The proteolytic cleavage site of yeast Nup145p has been mapped upstream of an evolutionary conserved serine residue. Cleavage occurs at the same site when a precursor is artificially expressed in Escherichia coli. A hydroxyl-containing residue is critical for the reaction, although a thiol-containing residue offers an acceptable replacement. In vitro kinetics experiments using a purified precursor molecule demonstrate that the cleavage is self-catalyzed and that the catalytic domain lies within the N-terminal moiety. Taken altogether, the data are consistent with a proteolytic mechanism involving an N>O acyl rearrangement and a subsequent ester intermediate uncovered in other self-processing proteins []. Nup98 is a component of the nuclear pore that plays its primary role in the export of RNAs. Nup98 is expressed in two forms, derived from alternate mRNA splicing. Both forms are processed into two peptides through autoproteolysis mediated by the C-terminal domain of hNup98. The three-dimensional structure of the C-terminal domain reveals a novel protein fold, and thus a new class of autocatalytic proteases. The structure further reveals that the suggested nucleoporin RNA binding motif is unlikely to bind to RNA []. The following nucleoporins share an ~150-residue C-terminal domain responsible for NPC targeting [, ]: Vertebrate Nup98, a component of the nuclear pore that plays its primary role in the export of RNAs. Yeast Nup100, plays an important role in several nuclear export and import pathways including poly(A)+ RNA and protein transport. Yeast Nup116, involved in mRNA export and protein transport. Yeast Nup145, involved in nuclear poly(A)+ RNA and tRNA export. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore [, ]. The NUP C-terminal domain adopts a predominantly beta-strand structure. The molecule consists of a six-stranded beta-sheet sandwiched against a two-stranded beta-sheet and flanked by alpha-helical regions. The N-terminal helical region consists of two short helices, whereas the stretch on the opposite side of molecule consists of a single, longer helix [, ].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 3PBP_E 2AIV_A 1KO6_A 2Q5X_A 2Q5Y_C 3KEP_A 3KES_A 3NF5_B 3TKN_I.
Probab=28.05 E-value=86 Score=17.36 Aligned_cols=16 Identities=19% Similarity=0.482 Sum_probs=13.0
Q ss_pred cceEEEeeCCCCeEEe
Q psy2980 12 TNRVECYHIAENTWEY 27 (92)
Q Consensus 12 ~~~v~~yd~~~~~W~~ 27 (92)
..++..||+.+.+|.-
T Consensus 121 ~~~FvsYd~~tG~W~F 136 (141)
T PF04096_consen 121 GAEFVSYDPETGTWVF 136 (141)
T ss_dssp TSEEEEEETTTTEEEE
T ss_pred CCEEEEEeCCCcEEEE
Confidence 3577889999999964
No 106
>KOG0294|consensus
Probab=27.57 E-value=1.8e+02 Score=18.91 Aligned_cols=27 Identities=19% Similarity=0.427 Sum_probs=16.0
Q ss_pred eEEEECCEEEEEcCcCCCCCccceEEEEeCCCC
Q psy2980 39 GIAVIDGKIYVLGGEEGWDGYHDSIECYDVDND 71 (92)
Q Consensus 39 ~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~ 71 (92)
.++++++...+-||.+ .+++.||..++
T Consensus 47 tavAVs~~~~aSGssD------etI~IYDm~k~ 73 (362)
T KOG0294|consen 47 TALAVSGPYVASGSSD------ETIHIYDMRKR 73 (362)
T ss_pred eEEEecceeEeccCCC------CcEEEEeccch
Confidence 3444566444445443 37888888765
No 107
>PF10946 DUF2625: Protein of unknown function DUF2625; InterPro: IPR021239 Some members in this family of proteins are annotated as ybfG however currently no function is known.
Probab=27.33 E-value=1.3e+02 Score=18.04 Aligned_cols=30 Identities=13% Similarity=0.314 Sum_probs=20.4
Q ss_pred EEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980 47 IYVLGGEEGWDGYHDSIECYDVDNDSWEIMS 77 (92)
Q Consensus 47 iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~ 77 (92)
+|.+.|..- ......++.|.|.+-+|+.+.
T Consensus 111 ~FAiNgG~l-g~d~G~v~YfAPDtL~WE~l~ 140 (208)
T PF10946_consen 111 FFAINGGGL-GEDPGKVYYFAPDTLEWEPLE 140 (208)
T ss_pred EEEEccCCc-CCCCCeEEEECCCcccccccC
Confidence 455533221 345778999999999999773
No 108
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=27.31 E-value=1.7e+02 Score=18.50 Aligned_cols=76 Identities=14% Similarity=0.255 Sum_probs=38.4
Q ss_pred ccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEE--ECCEEEEEcCcCCCCCccceEEEEeCCC--CceEEcC
Q psy2980 2 LRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAV--IDGKIYVLGGEEGWDGYHDSIECYDVDN--DSWEIMS 77 (92)
Q Consensus 2 ~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~--~~~~iy~~GG~~~~~~~~~~~~~y~~~~--~~W~~~~ 77 (92)
+|++.......=.++.||..+.++..+......-.-.-++. -++.||+..... .....+..|.... .+.+.+.
T Consensus 4 vgsy~~~~~~gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~~---~~~g~v~~~~i~~~~g~L~~~~ 80 (345)
T PF10282_consen 4 VGSYTNGKGGGIYVFRFDEETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEGS---GDSGGVSSYRIDPDTGTLTLLN 80 (345)
T ss_dssp EEECCSSSSTEEEEEEEETTTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETTS---STTTEEEEEEEETTTTEEEEEE
T ss_pred EEcCCCCCCCcEEEEEEcCCCCCceEeeeecCCCCCceEEEEeCCCEEEEEEccc---cCCCCEEEEEECCCcceeEEee
Confidence 56665322222125666678899987764322111111333 356788885432 1244566666544 4666654
Q ss_pred CCC
Q psy2980 78 HLP 80 (92)
Q Consensus 78 ~~~ 80 (92)
..+
T Consensus 81 ~~~ 83 (345)
T PF10282_consen 81 SVP 83 (345)
T ss_dssp EEE
T ss_pred eec
Confidence 433
No 109
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=27.21 E-value=1.8e+02 Score=18.79 Aligned_cols=17 Identities=24% Similarity=0.552 Sum_probs=14.7
Q ss_pred CCccceEEEEeCCCCce
Q psy2980 57 DGYHDSIECYDVDNDSW 73 (92)
Q Consensus 57 ~~~~~~~~~y~~~~~~W 73 (92)
..-.+.+.++|..+.+|
T Consensus 169 ~~g~~~i~~~Dli~~~~ 185 (339)
T PF09910_consen 169 HKGVSGIHCLDLISGKW 185 (339)
T ss_pred ccCCceEEEEEccCCeE
Confidence 44578899999999999
No 110
>PF14741 GH114_assoc: N-terminal glycosyl-hydrolase-114-associated domain
Probab=27.06 E-value=1e+02 Score=16.95 Aligned_cols=20 Identities=15% Similarity=0.415 Sum_probs=15.9
Q ss_pred eEEEeeC-CCCeEEeeCCCCC
Q psy2980 14 RVECYHI-AENTWEYKSPMKE 33 (92)
Q Consensus 14 ~v~~yd~-~~~~W~~~~~~~~ 33 (92)
.+..||. .+++|.++..+..
T Consensus 64 ~vd~~~~~~~~~w~~vGD~t~ 84 (129)
T PF14741_consen 64 TVDMFDFDDTSTWVKVGDLTG 84 (129)
T ss_pred EEEEEEccCCCeeEEeccccc
Confidence 4788888 8889999987643
No 111
>TIGR02377 MocE_fam_FeS Rieske [2Fe-2S] domain protein, MocE subfamily. This model describes a subfamily of the Rieske-like [2Fe-2S] family of ferredoxins that includes MocE, part of the rhizopine (3-O-methyl-scyllo-inosamine) catabolic cluster in Rhizobium. Members of this family are related to, yet distinct from, the small subunit of nitrite reductase [NAD(P)H].
Probab=26.97 E-value=98 Score=15.68 Aligned_cols=13 Identities=8% Similarity=0.289 Sum_probs=5.2
Q ss_pred EEEeeCCCCeEEe
Q psy2980 15 VECYHIAENTWEY 27 (92)
Q Consensus 15 v~~yd~~~~~W~~ 27 (92)
+..|....++..-
T Consensus 26 i~l~r~~~g~~~A 38 (101)
T TIGR02377 26 FAIYRTPDDQYYA 38 (101)
T ss_pred EEEEEeCCCEEEE
Confidence 4444443334433
No 112
>PRK01029 tolB translocation protein TolB; Provisional
Probab=26.54 E-value=2e+02 Score=19.02 Aligned_cols=59 Identities=8% Similarity=0.064 Sum_probs=34.1
Q ss_pred eEEEeeCCCCeEEeeCCCCCCcccceEE-EECC-EEEEEcCcCCCCCccceEEEEeCCCCceEEcCC
Q psy2980 14 RVECYHIAENTWEYKSPMKEKRYRPGIA-VIDG-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSH 78 (92)
Q Consensus 14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~-~~~~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~ 78 (92)
.++.||+.+++...+...+ ....... .-++ .|++..... ....++.+|....+.+.+..
T Consensus 352 ~I~v~dl~~g~~~~Lt~~~--~~~~~p~wSpDG~~L~f~~~~~----g~~~L~~vdl~~g~~~~Lt~ 412 (428)
T PRK01029 352 QICVYDLATGRDYQLTTSP--ENKESPSWAIDSLHLVYSAGNS----NESELYLISLITKKTRKIVI 412 (428)
T ss_pred EEEEEECCCCCeEEccCCC--CCccceEECCCCCEEEEEECCC----CCceEEEEECCCCCEEEeec
Confidence 5788898888877775432 1111122 2233 454443221 13678899998887777753
No 113
>KOG2111|consensus
Probab=26.38 E-value=1.9e+02 Score=18.77 Aligned_cols=51 Identities=22% Similarity=0.203 Sum_probs=29.2
Q ss_pred CccccCCCCCCcceEEEeeCCCCe-EEeeCC---CC--CCcccceEEEECCEEEEEc
Q psy2980 1 MLRGWHASTENTNRVECYHIAENT-WEYKSP---MK--EKRYRPGIAVIDGKIYVLG 51 (92)
Q Consensus 1 v~GG~~~~~~~~~~v~~yd~~~~~-W~~~~~---~~--~~~~~~~~~~~~~~iy~~G 51 (92)
++||-+....+.++|..+|=...+ -.++.- .. .-|...-+++..++||++-
T Consensus 63 LVGGg~~pky~pNkviIWDD~k~~~i~el~f~~~I~~V~l~r~riVvvl~~~I~Vyt 119 (346)
T KOG2111|consen 63 LVGGGSRPKYPPNKVIIWDDLKERCIIELSFNSEIKAVKLRRDRIVVVLENKIYVYT 119 (346)
T ss_pred EecCCCCCCCCCceEEEEecccCcEEEEEEeccceeeEEEcCCeEEEEecCeEEEEE
Confidence 467776666777789988843332 222221 11 1133456777788887763
No 114
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=26.13 E-value=1.4e+02 Score=17.16 Aligned_cols=55 Identities=16% Similarity=0.327 Sum_probs=33.6
Q ss_pred eEEEeeCCCCe--EEeeCCCCCCcccce--EEEECCEEEEEcCcCCCCCccceEEEEeCCCC--ceEEcC
Q psy2980 14 RVECYHIAENT--WEYKSPMKEKRYRPG--IAVIDGKIYVLGGEEGWDGYHDSIECYDVDND--SWEIMS 77 (92)
Q Consensus 14 ~v~~yd~~~~~--W~~~~~~~~~~~~~~--~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~--~W~~~~ 77 (92)
.+.++|+.+.+ |+.-.. .+..... .+..++.+|+..+ ...++++|+.+. .|+.-.
T Consensus 4 ~l~~~d~~tG~~~W~~~~~--~~~~~~~~~~~~~~~~v~~~~~-------~~~l~~~d~~tG~~~W~~~~ 64 (238)
T PF13360_consen 4 TLSALDPRTGKELWSYDLG--PGIGGPVATAVPDGGRVYVASG-------DGNLYALDAKTGKVLWRFDL 64 (238)
T ss_dssp EEEEEETTTTEEEEEEECS--SSCSSEEETEEEETTEEEEEET-------TSEEEEEETTTSEEEEEEEC
T ss_pred EEEEEECCCCCEEEEEECC--CCCCCccceEEEeCCEEEEEcC-------CCEEEEEECCCCCEEEEeec
Confidence 47889987664 766321 1122222 3347888888732 457899998665 476654
No 115
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=25.36 E-value=1.8e+02 Score=18.12 Aligned_cols=62 Identities=15% Similarity=0.290 Sum_probs=34.7
Q ss_pred cceEEEeeCCCCeEEeeCCCC----CCccc-ceEEEEC-------CEEEEEcCcCCCCCccceEEEEeCCCC-ceEEcCC
Q psy2980 12 TNRVECYHIAENTWEYKSPMK----EKRYR-PGIAVID-------GKIYVLGGEEGWDGYHDSIECYDVDND-SWEIMSH 78 (92)
Q Consensus 12 ~~~v~~yd~~~~~W~~~~~~~----~~~~~-~~~~~~~-------~~iy~~GG~~~~~~~~~~~~~y~~~~~-~W~~~~~ 78 (92)
..++..||+.+++-.+.-.+| .+... ..+++-. ..+|+.- .....+.+||..++ .|+.+..
T Consensus 33 ~pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD------~~~~glIV~dl~~~~s~Rv~~~ 106 (287)
T PF03022_consen 33 PPKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITD------SGGPGLIVYDLATGKSWRVLHN 106 (287)
T ss_dssp --EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEE------TTTCEEEEEETTTTEEEEEETC
T ss_pred CcEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeC------CCcCcEEEEEccCCcEEEEecC
Confidence 356999999999854332222 12222 2233322 3567763 22348999999886 5777655
Q ss_pred C
Q psy2980 79 L 79 (92)
Q Consensus 79 ~ 79 (92)
.
T Consensus 107 ~ 107 (287)
T PF03022_consen 107 S 107 (287)
T ss_dssp G
T ss_pred C
Confidence 3
No 116
>TIGR02378 nirD_assim_sml nitrite reductase [NAD(P)H], small subunit. This model describes NirD, the small subunit of nitrite reductase [NAD(P)H] (the assimilatory nitrite reductase), which associates with NirB, the large subunit (TIGR02374). In a few bacteria such as Klebsiella pneumoniae and in Fungi, the two regions are fused.
Probab=24.50 E-value=1.1e+02 Score=15.47 Aligned_cols=7 Identities=0% Similarity=0.468 Sum_probs=2.7
Q ss_pred EeeCCCC
Q psy2980 17 CYHIAEN 23 (92)
Q Consensus 17 ~yd~~~~ 23 (92)
.||+.+.
T Consensus 73 ~Fdl~tG 79 (105)
T TIGR02378 73 NFRLEDG 79 (105)
T ss_pred EEEcCCc
Confidence 3443333
No 117
>PF07453 NUMOD1: NUMOD1 domain; InterPro: IPR010896 This helix-turn-helix-containing DNA-binding domain is found associated in homing nucleases [].
Probab=24.06 E-value=68 Score=12.84 Aligned_cols=12 Identities=25% Similarity=0.343 Sum_probs=8.9
Q ss_pred eEEEeeCCCCeE
Q psy2980 14 RVECYHIAENTW 25 (92)
Q Consensus 14 ~v~~yd~~~~~W 25 (92)
.|++||..++.-
T Consensus 2 ~V~~yd~~~~~i 13 (37)
T PF07453_consen 2 PVYVYDLNTNEI 13 (37)
T ss_pred eEEEEECCCCeE
Confidence 478888887763
No 118
>PF09816 EAF: RNA polymerase II transcription elongation factor; InterPro: IPR019194 This entry represents the N-terminal domain of ELL-associated factor (Eaf) proteins, which act as transcriptional transactivators of ELL and ELL2 RNA Polymerase II (Pol II) transcriptional elongation factors [, , , ]. Eaf proteins form a stable heterodimer complex with ELL proteins to facilitate the binding of RNA polymerase II to activate transcription elongation. ELL and EAF1 are components of Cajal bodies, which have a role in leukemogenesis []. EAF1 also has the capacity to interact with ELL1 and ELL2. The N terminus of approx 120 of EAF1 has a region of high serine, aspartic acid, and glutamic acid residues [, ].
Probab=23.89 E-value=1.2e+02 Score=15.77 Aligned_cols=28 Identities=14% Similarity=0.317 Sum_probs=19.1
Q ss_pred EEEEcCcCCCCCccceEEEEeCCCCceE
Q psy2980 47 IYVLGGEEGWDGYHDSIECYDVDNDSWE 74 (92)
Q Consensus 47 iy~~GG~~~~~~~~~~~~~y~~~~~~W~ 74 (92)
-|+|-|.........-+.+||+.++.+.
T Consensus 65 ~~~f~G~~~~~~~~ecVLifD~~~~~f~ 92 (109)
T PF09816_consen 65 TYVFKGSQRPSKEKECVLIFDPETGEFV 92 (109)
T ss_pred cEEEEeccCCCCCcEEEEEEECCCCEEE
Confidence 4677664333444678999999997654
No 119
>PRK09965 3-phenylpropionate dioxygenase ferredoxin subunit; Provisional
Probab=23.16 E-value=1.2e+02 Score=15.46 Aligned_cols=10 Identities=30% Similarity=0.747 Sum_probs=4.5
Q ss_pred EEEECCEEEE
Q psy2980 40 IAVIDGKIYV 49 (92)
Q Consensus 40 ~~~~~~~iy~ 49 (92)
+.+.++.||+
T Consensus 90 v~v~~g~v~v 99 (106)
T PRK09965 90 VHVEGGDIFI 99 (106)
T ss_pred EEEECCEEEE
Confidence 3334445544
No 120
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=22.85 E-value=2.3e+02 Score=18.47 Aligned_cols=54 Identities=9% Similarity=0.087 Sum_probs=34.1
Q ss_pred CcceEEEeeCCCCeEEeeCCCCCCcccceEEEEC--C-EEEEEcCcCCCCCccceEEEEeCCCCc
Q psy2980 11 NTNRVECYHIAENTWEYKSPMKEKRYRPGIAVID--G-KIYVLGGEEGWDGYHDSIECYDVDNDS 72 (92)
Q Consensus 11 ~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~--~-~iy~~GG~~~~~~~~~~~~~y~~~~~~ 72 (92)
..++|+.+|..+. ..+..++.++...+++.-. . .+|+..+ ..+++..+|..+.+
T Consensus 277 ~~~~V~ViD~~t~--kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~------~s~~VsViD~~t~k 333 (352)
T TIGR02658 277 ASRFLFVVDAKTG--KRLRKIELGHEIDSINVSQDAKPLLYALST------GDKTLYIFDAETGK 333 (352)
T ss_pred CCCEEEEEECCCC--eEEEEEeCCCceeeEEECCCCCeEEEEeCC------CCCcEEEEECcCCe
Confidence 3357999998754 4555555555555555543 3 5676643 35678899987763
No 121
>cd03583 NTR_complement_C3 NTR/C345C domain, complement C3 subfamily; The NTR domain found in complement C3 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C3 plays a pivotal role in the activation of the complement systems, as all pathways (classical, alternative, and lectin) result in the processing of C3 by C3 convertase. The larger fragment, activated C3b, contains the NTR/C345C domain and binds covalently, via a reactive thioester, to cell surface carbohydrates including components of bacterial cell walls and immune aggregates. The smaller cleavage product, C3a, acts independently as a diffusible signal to mediate local inflammatory processes. The structure of C3 shows that the NTR/C345C domain is located in an exposed position relative to the rest of the molecule. The function of the domain in complement C3 is poorly understood.
Probab=22.83 E-value=1.5e+02 Score=16.54 Aligned_cols=36 Identities=19% Similarity=0.179 Sum_probs=22.4
Q ss_pred CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980 44 DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL 79 (92)
Q Consensus 44 ~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~ 79 (92)
.++-|++-|.+......+.-+.|=..+++|-+.=|.
T Consensus 86 ~gk~YLIMG~~~~~~~~~~~~~YvL~~~TWvE~wP~ 121 (149)
T cd03583 86 EGKDYLIMGLSSDLWRIKDKYSYVIGKDTWIEYWPT 121 (149)
T ss_pred CCCEEEEEeCCCCccccCCcEEEEeCCCeEEEECCC
Confidence 466777777653222223357888899999876433
No 122
>PF12904 Collagen_bind_2: Putative collagen-binding domain of a collagenase ; InterPro: IPR024749 This domain is likely to be the collagen-binding domain of a family of bacterial collagenase enzymes. The structure of one family member, Q8A905 from SWISSPROT, has been characterised. The domain occurs in the C-terminal region of the protein.; PDB: 3KZS_D.
Probab=22.70 E-value=1.3e+02 Score=15.45 Aligned_cols=18 Identities=6% Similarity=0.143 Sum_probs=13.7
Q ss_pred ceEEEeeCCCCeEEeeCC
Q psy2980 13 NRVECYHIAENTWEYKSP 30 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~ 30 (92)
.+++.|||.+++...+..
T Consensus 49 ~~a~WfdPR~G~~~~~g~ 66 (93)
T PF12904_consen 49 VKAWWFDPRTGKYTYIGE 66 (93)
T ss_dssp EEEEEEETTT-BEEEEEE
T ss_pred eeEEEEcCCCCCEEEeee
Confidence 468999999999887754
No 123
>TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PubMed:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al.
Probab=22.55 E-value=1.4e+02 Score=16.60 Aligned_cols=23 Identities=22% Similarity=0.634 Sum_probs=16.9
Q ss_pred EEEEeCCCCceEEcCCCCCCccc
Q psy2980 63 IECYDVDNDSWEIMSHLPSARSW 85 (92)
Q Consensus 63 ~~~y~~~~~~W~~~~~~~~~~~~ 85 (92)
...+|+...+|+.+-+.+..+..
T Consensus 113 A~fr~~~~~~Wr~~~~~~~~~~~ 135 (146)
T TIGR03352 113 AAYRDIDNAEWRVVYKVPPKGKY 135 (146)
T ss_pred EEeecCCCCceEEEEecCCCCce
Confidence 34578888999998877766543
No 124
>cd03584 NTR_complement_C4 NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.
Probab=22.47 E-value=1.6e+02 Score=16.53 Aligned_cols=36 Identities=22% Similarity=0.370 Sum_probs=21.4
Q ss_pred CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980 44 DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL 79 (92)
Q Consensus 44 ~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~ 79 (92)
.++-|++-|.+......+.-+.|=..+++|-+.=|.
T Consensus 90 ~gk~YLImG~~~~~~~~~~~~~Y~L~~~tWvE~wP~ 125 (153)
T cd03584 90 KGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEKIPS 125 (153)
T ss_pred CCCEEEEEcCCCCCcCcCCcEEEEeCCCceEEECCC
Confidence 455666666543222223347888899999876433
No 125
>PF15232 DUF4585: Domain of unknown function (DUF4585)
Probab=22.43 E-value=1.1e+02 Score=15.12 Aligned_cols=15 Identities=0% Similarity=-0.115 Sum_probs=11.9
Q ss_pred EEeeCCCCeEEeeCC
Q psy2980 16 ECYHIAENTWEYKSP 30 (92)
Q Consensus 16 ~~yd~~~~~W~~~~~ 30 (92)
..||++|++.-++.-
T Consensus 32 ~lfDPETGqYVeV~i 46 (75)
T PF15232_consen 32 TLFDPETGQYVEVLI 46 (75)
T ss_pred eeecCCCCcEEEEeC
Confidence 568999999887753
No 126
>cd03548 Rieske_RO_Alpha_OMO_CARDO Rieske non-heme iron oxygenase (RO) family, 2-Oxoquinoline 8-monooxygenase (OMO) and Carbazole 1,9a-dioxygenase (CARDO) subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. OMO catalyzes the NADH-dependent oxidation of the N-heterocyclic aromatic compound 2-oxoquinoline to 8-hydroxy-2-oxoquinoline, the second step in the bacterial degradation of quinoline. OMO consists of a reductase component (OMR) and an oxygenase component (OMO) that together function to shuttle electrons from the
Probab=22.30 E-value=1.5e+02 Score=16.06 Aligned_cols=37 Identities=14% Similarity=0.132 Sum_probs=22.2
Q ss_pred EEeeCCCCeEEeeCCCCC-------CcccceEEEECCEEEEEcC
Q psy2980 16 ECYHIAENTWEYKSPMKE-------KRYRPGIAVIDGKIYVLGG 52 (92)
Q Consensus 16 ~~yd~~~~~W~~~~~~~~-------~~~~~~~~~~~~~iy~~GG 52 (92)
+.||+.+.+-..++..|. ....+.+.+.++.||+.-+
T Consensus 81 w~Fdl~tG~~~~~~~~p~~~~~~~~~L~~ypv~~~~g~V~v~~~ 124 (136)
T cd03548 81 WTYRLDDGKLVTILANPDDPLIGRTGLKTYPVEEAKGMIFVFVG 124 (136)
T ss_pred cEEeCCCccEEEcccCCCccccccCCCceEeEEEECCEEEEEeC
Confidence 778988888776543321 1223445556777777644
No 127
>KOG4499|consensus
Probab=21.89 E-value=2.2e+02 Score=17.88 Aligned_cols=39 Identities=10% Similarity=0.076 Sum_probs=24.0
Q ss_pred CcceEEEeeCCCCeEEeeCCCCCCcccceEEEEC----CEEEEEc
Q psy2980 11 NTNRVECYHIAENTWEYKSPMKEKRYRPGIAVID----GKIYVLG 51 (92)
Q Consensus 11 ~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~----~~iy~~G 51 (92)
+...|.++||.+.+-..--.+|.++.. +|.++ +.+|+..
T Consensus 231 ng~~V~~~dp~tGK~L~eiklPt~qit--sccFgGkn~d~~yvT~ 273 (310)
T KOG4499|consen 231 NGGTVQKVDPTTGKILLEIKLPTPQIT--SCCFGGKNLDILYVTT 273 (310)
T ss_pred cCcEEEEECCCCCcEEEEEEcCCCceE--EEEecCCCccEEEEEe
Confidence 445688999998876544456766654 33333 3356654
No 128
>PRK04043 tolB translocation protein TolB; Provisional
Probab=21.80 E-value=2.5e+02 Score=18.56 Aligned_cols=63 Identities=16% Similarity=0.215 Sum_probs=34.9
Q ss_pred ceEEEeeCCCCeEEeeCCCCCCcccceEEEECC-EEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 13 NRVECYHIAENTWEYKSPMKEKRYRPGIAVIDG-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
..++.+|+.+++=..+...+. ........-++ +|.+.-...+ ..+++.+|.....++++.+.+
T Consensus 213 ~~Iyv~dl~tg~~~~lt~~~g-~~~~~~~SPDG~~la~~~~~~g----~~~Iy~~dl~~g~~~~LT~~~ 276 (419)
T PRK04043 213 PTLYKYNLYTGKKEKIASSQG-MLVVSDVSKDGSKLLLTMAPKG----QPDIYLYDTNTKTLTQITNYP 276 (419)
T ss_pred CEEEEEECCCCcEEEEecCCC-cEEeeEECCCCCEEEEEEccCC----CcEEEEEECCCCcEEEcccCC
Confidence 368888888776666654221 11111122233 4544332221 468999998888888875443
No 129
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=21.16 E-value=2.5e+02 Score=18.28 Aligned_cols=33 Identities=18% Similarity=0.411 Sum_probs=23.8
Q ss_pred EECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980 42 VIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP 80 (92)
Q Consensus 42 ~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~ 80 (92)
-+++++|+. .....++..+|+.+.+.+.+...|
T Consensus 210 WhdgrLwvl------dsgtGev~~vD~~~G~~e~Va~vp 242 (335)
T TIGR03032 210 WYQGKLWLL------NSGRGELGYVDPQAGKFQPVAFLP 242 (335)
T ss_pred EeCCeEEEE------ECCCCEEEEEcCCCCcEEEEEECC
Confidence 456888887 334568888898888887776554
No 130
>PRK04922 tolB translocation protein TolB; Provisional
Probab=21.08 E-value=2.5e+02 Score=18.37 Aligned_cols=58 Identities=10% Similarity=0.097 Sum_probs=28.9
Q ss_pred eEEEeeCCCCeEEeeCCCCCCcccceEEEEC-CEEEEEcCcCCCCCccceEEEEeCCCCceEEc
Q psy2980 14 RVECYHIAENTWEYKSPMKEKRYRPGIAVID-GKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM 76 (92)
Q Consensus 14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~-~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~ 76 (92)
.++.+|..+++-..+...+.... .....-+ ..|++....++ ..+++.+|..+..-+++
T Consensus 229 ~l~~~dl~~g~~~~l~~~~g~~~-~~~~SpDG~~l~~~~s~~g----~~~Iy~~d~~~g~~~~l 287 (433)
T PRK04922 229 AIYVQDLATGQRELVASFRGING-APSFSPDGRRLALTLSRDG----NPEIYVMDLGSRQLTRL 287 (433)
T ss_pred EEEEEECCCCCEEEeccCCCCcc-CceECCCCCEEEEEEeCCC----CceEEEEECCCCCeEEC
Confidence 47888887777666654432111 1111122 34443322221 24677777776665554
No 131
>KOG2445|consensus
Probab=21.03 E-value=2.5e+02 Score=18.26 Aligned_cols=22 Identities=14% Similarity=0.238 Sum_probs=15.8
Q ss_pred CCcceEEEeeCCCCeEEeeCCC
Q psy2980 10 ENTNRVECYHIAENTWEYKSPM 31 (92)
Q Consensus 10 ~~~~~v~~yd~~~~~W~~~~~~ 31 (92)
.+...++.|+-..++|..+..+
T Consensus 198 ~~~~~Iye~~e~~rKw~kva~L 219 (361)
T KOG2445|consen 198 LNKVKIYEYNENGRKWLKVAEL 219 (361)
T ss_pred ccceEEEEecCCcceeeeehhc
Confidence 4455677788778899988654
No 132
>PF15525 DUF4652: Domain of unknown function (DUF4652)
Probab=20.87 E-value=2e+02 Score=17.12 Aligned_cols=68 Identities=10% Similarity=0.130 Sum_probs=40.2
Q ss_pred CCcceEEEeeCCCCeEEeeC--CCC---CCcccceEEEECCE-EEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980 10 ENTNRVECYHIAENTWEYKS--PMK---EKRYRPGIAVIDGK-IYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL 79 (92)
Q Consensus 10 ~~~~~v~~yd~~~~~W~~~~--~~~---~~~~~~~~~~~~~~-iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~ 79 (92)
.-.-+++.+|..++.|..+. +.+ .|. ...-.-+.. ++++|...+.-+....++.|++.+..-..+-+.
T Consensus 85 EgiGkIYIkn~~~~~~~~L~i~~~~~k~sPK--~i~WiDD~~L~vIIG~a~GTvS~GGnLy~~nl~tg~~~~ly~~ 158 (200)
T PF15525_consen 85 EGIGKIYIKNLNNNNWWSLQIDQNEEKYSPK--YIEWIDDNNLAVIIGYAHGTVSKGGNLYKYNLNTGNLTELYEW 158 (200)
T ss_pred ccceeEEEEecCCCceEEEEecCcccccCCc--eeEEecCCcEEEEEccccceEccCCeEEEEEccCCceeEeeec
Confidence 34456899999888886552 221 233 223333444 455553333233456799999998887776554
No 133
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=20.42 E-value=95 Score=13.21 Aligned_cols=26 Identities=23% Similarity=0.392 Sum_probs=16.8
Q ss_pred cceEEEECCEEEEEcCcCCCCCccceEEEEeCC
Q psy2980 37 RPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVD 69 (92)
Q Consensus 37 ~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~ 69 (92)
...+++.++.+|+..+. ..+..+|..
T Consensus 4 a~~v~v~g~yaYva~~~-------~Gl~IvDIS 29 (42)
T PF08309_consen 4 ARDVAVSGNYAYVADGN-------NGLVIVDIS 29 (42)
T ss_pred EEEEEEECCEEEEEeCC-------CCEEEEECC
Confidence 45577788899988443 235666663
Done!