Query psy5522
Match_columns 1397
No_of_seqs 428 out of 1892
Neff 7.8
Searched_HMMs 46136
Date Fri Aug 16 19:41:25 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5522.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5522hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1366|consensus 100.0 7E-157 1E-161 1521.3 91.1 1180 1-1339 22-1433(1436)
2 COG2373 Large extracellular al 100.0 1.7E-64 3.8E-69 666.6 73.5 812 97-997 393-1269(1621)
3 cd02896 complement_C3_C4_C5 Pr 100.0 2.6E-56 5.6E-61 516.2 13.5 279 880-1279 2-287 (297)
4 cd02897 A2M_2 Proteins similar 100.0 1.4E-50 3.1E-55 469.2 15.1 280 879-1279 1-282 (292)
5 PF07678 A2M_comp: A-macroglob 100.0 1.3E-49 2.8E-54 447.5 1.5 233 931-1279 1-234 (246)
6 cd02891 A2M_like Proteins simi 100.0 2E-36 4.4E-41 350.2 13.8 270 880-1279 2-272 (282)
7 PF07703 A2M_N_2: Alpha-2-macr 99.9 8.7E-23 1.9E-27 209.5 18.3 135 419-555 1-136 (136)
8 PF00207 A2M: Alpha-2-macroglo 99.9 1.8E-22 3.9E-27 191.6 14.5 91 660-751 1-92 (92)
9 PF07677 A2M_recep: A-macroglo 99.8 7E-22 1.5E-26 186.2 5.0 49 1091-1141 1-51 (91)
10 KOG1366|consensus 99.8 4.2E-20 9.2E-25 243.4 20.6 199 954-1157 1154-1390(1436)
11 PF01835 A2M_N: MG2 domain; I 99.8 3.6E-19 7.9E-24 171.9 12.6 93 99-191 1-99 (99)
12 cd00688 ISOPREN_C2_like This g 99.7 1.4E-17 3E-22 193.2 8.9 262 881-1279 3-290 (300)
13 cd02889 SQCY Squalene cyclase 99.5 6.6E-14 1.4E-18 167.3 13.6 259 899-1279 23-322 (348)
14 PF10569 Thiol-ester_cl: Alpha 99.4 2.4E-13 5.2E-18 98.6 2.1 31 882-912 1-31 (31)
15 COG2373 Large extracellular al 99.3 1E-07 2.2E-12 129.5 50.9 282 95-394 285-599 (1621)
16 cd02892 SQCY_1 Squalene cyclas 99.1 7.5E-10 1.6E-14 141.7 14.6 118 1157-1279 475-608 (634)
17 cd02896 complement_C3_C4_C5 Pr 98.6 7.2E-08 1.6E-12 112.3 8.7 71 936-1011 214-296 (297)
18 TIGR01507 hopene_cyclase squal 98.6 9.7E-07 2.1E-11 112.2 16.8 150 1156-1353 469-633 (635)
19 PF07678 A2M_comp: A-macroglob 98.3 5E-07 1.1E-11 102.3 5.3 51 956-1011 190-243 (246)
20 cd02897 A2M_2 Proteins similar 98.0 2.8E-05 6.1E-10 90.7 10.1 54 955-1011 235-291 (292)
21 TIGR03463 osq_cycl 2,3-oxidosq 97.9 4.9E-05 1.1E-09 97.0 12.5 114 1157-1275 476-604 (634)
22 TIGR01787 squalene_cyclas squa 97.9 9.8E-05 2.1E-09 94.2 13.7 119 1156-1279 458-592 (621)
23 PF05326 SVA: Seminal vesicle 97.8 0.00017 3.6E-09 70.2 10.5 84 711-809 30-113 (124)
24 PLN03012 Camelliol C synthase 97.6 0.00017 3.8E-09 91.7 10.1 70 931-1001 517-611 (759)
25 PLN02993 lupeol synthase 97.4 0.0011 2.4E-08 84.8 13.2 114 1157-1274 589-721 (763)
26 cd00688 ISOPREN_C2_like This g 97.4 0.00031 6.6E-09 81.3 7.5 85 1175-1276 138-228 (300)
27 cd02889 SQCY Squalene cyclase 97.3 0.00067 1.5E-08 81.3 9.4 91 904-1004 222-321 (348)
28 cd02890 PTase Protein prenyltr 97.0 0.01 2.2E-07 69.1 15.0 91 902-1002 28-120 (286)
29 PF13249 Prenyltrans_2: Prenyl 97.0 0.001 2.2E-08 65.6 5.8 91 898-1000 20-112 (113)
30 PF13249 Prenyltrans_2: Prenyl 96.9 0.00057 1.2E-08 67.4 2.9 60 936-1003 4-65 (113)
31 PF07703 A2M_N_2: Alpha-2-macr 96.7 0.044 9.5E-07 56.1 14.8 122 101-238 2-128 (136)
32 cd02894 GGTase-II Geranylgeran 96.6 0.024 5.2E-07 66.0 13.9 104 889-1002 18-122 (287)
33 PRK14081 triple tyrosine motif 96.3 5.6 0.00012 50.7 54.2 417 68-544 155-608 (667)
34 COG1470 Predicted membrane pro 96.2 0.25 5.4E-06 58.7 18.7 158 707-876 259-472 (513)
35 cd02893 FTase Protein farnesyl 96.1 0.1 2.3E-06 60.9 15.3 77 917-1002 44-120 (299)
36 cd02892 SQCY_1 Squalene cyclas 95.9 0.023 5E-07 73.5 9.6 90 904-1004 507-607 (634)
37 TIGR03503 conserved hypothetic 95.7 1.1 2.4E-05 53.0 21.4 190 6-207 98-311 (374)
38 TIGR01507 hopene_cyclase squal 95.7 0.029 6.4E-07 71.9 9.3 86 906-1002 504-599 (635)
39 COG1470 Predicted membrane pro 95.6 1.1 2.3E-05 53.6 20.3 171 10-182 280-469 (513)
40 PLN02710 farnesyltranstransfer 95.5 0.2 4.3E-06 60.8 14.7 73 917-1001 89-164 (439)
41 PF13243 Prenyltrans_1: Prenyl 95.3 0.0063 1.4E-07 59.6 0.8 65 932-1002 4-68 (109)
42 PLN02993 lupeol synthase 95.1 0.037 8.1E-07 71.2 7.3 70 921-1002 591-661 (763)
43 PLN03201 RAB geranylgeranyl tr 95.1 0.2 4.2E-06 59.1 12.5 84 904-1002 43-126 (316)
44 TIGR01787 squalene_cyclas squa 95.0 0.073 1.6E-06 68.4 9.4 71 920-1002 460-531 (621)
45 TIGR03463 osq_cycl 2,3-oxidosq 94.8 0.082 1.8E-06 68.0 9.3 88 904-1001 508-604 (634)
46 PF10633 NPCBM_assoc: NPCBM-as 94.7 0.15 3.3E-06 46.7 8.4 72 725-811 1-76 (78)
47 TIGR03503 conserved hypothetic 93.8 19 0.00042 42.9 27.6 234 136-397 32-287 (374)
48 PLN03012 Camelliol C synthase 93.7 0.22 4.8E-06 64.3 9.4 88 906-1001 623-722 (759)
49 PF10670 DUF4198: Domain of un 93.1 9.3 0.0002 42.1 20.4 174 98-293 4-213 (215)
50 cd02890 PTase Protein prenyltr 93.0 0.3 6.5E-06 56.8 8.6 81 908-1000 134-215 (286)
51 cd02894 GGTase-II Geranylgeran 92.4 0.5 1.1E-05 55.0 9.3 74 915-1000 143-216 (287)
52 cd02891 A2M_like Proteins simi 92.1 0.49 1.1E-05 54.7 8.9 52 955-1011 227-281 (282)
53 PRK13211 N-acetylglucosamine-b 92.1 34 0.00073 42.5 24.5 222 60-308 158-404 (478)
54 PF01835 A2M_N: MG2 domain; I 92.1 2.9 6.4E-05 40.0 12.8 78 319-399 6-86 (99)
55 PF07919 Gryzun: Gryzun, putat 91.9 0.48 1E-05 60.6 9.2 79 712-805 468-546 (554)
56 PF07705 CARDB: CARDB; InterP 91.6 2.5 5.5E-05 40.2 11.8 76 718-812 8-85 (101)
57 PF05753 TRAP_beta: Translocon 91.1 1.9 4.2E-05 46.3 11.2 85 705-808 19-107 (181)
58 COG5029 CAL1 Prenyltransferase 90.5 0.79 1.7E-05 52.0 7.8 97 890-1002 93-196 (342)
59 cd02895 GGTase-I Geranylgerany 89.6 1.2 2.5E-05 52.5 8.8 110 888-1003 14-135 (307)
60 PF13115 YtkA: YtkA-like 89.1 3.2 6.9E-05 38.7 9.7 75 98-178 7-85 (86)
61 PF10670 DUF4198: Domain of un 89.0 1.8 4E-05 47.7 9.5 76 314-399 135-212 (215)
62 PLN03201 RAB geranylgeranyl tr 88.3 1.5 3.3E-05 51.7 8.6 73 917-1001 149-221 (316)
63 cd02895 GGTase-I Geranylgerany 88.0 1.1 2.4E-05 52.6 7.1 70 922-1000 167-236 (307)
64 PLN02710 farnesyltranstransfer 87.9 1.1 2.4E-05 54.6 7.1 67 922-1002 145-213 (439)
65 cd02893 FTase Protein farnesyl 87.9 2 4.2E-05 50.4 9.1 74 916-1001 142-216 (299)
66 smart00557 IG_FLMN Filamin-typ 87.5 3.6 7.7E-05 39.1 9.0 64 110-182 15-78 (93)
67 COG5029 CAL1 Prenyltransferase 87.5 2.2 4.8E-05 48.6 8.6 94 892-1001 151-244 (342)
68 KOG0367|consensus 87.2 1.1 2.3E-05 50.4 5.9 105 886-1001 136-249 (347)
69 PF00207 A2M: Alpha-2-macroglo 87.1 6 0.00013 37.5 10.3 75 153-234 10-90 (92)
70 smart00634 BID_1 Bacterial Ig- 85.9 8.5 0.00018 36.5 10.6 67 328-400 18-84 (92)
71 PF00630 Filamin: Filamin/ABP2 84.3 6.1 0.00013 37.8 9.1 70 110-182 18-92 (101)
72 KOG0366|consensus 83.4 3.3 7.2E-05 45.8 7.3 90 899-1002 93-184 (329)
73 PF00927 Transglut_C: Transglu 82.8 6.7 0.00014 38.3 8.7 79 717-808 3-86 (107)
74 PF13243 Prenyltrans_1: Prenyl 81.2 0.58 1.3E-05 45.7 0.6 60 922-994 48-107 (109)
75 PF02369 Big_1: Bacterial Ig-l 79.8 9.7 0.00021 36.8 8.5 69 326-401 21-91 (100)
76 PF06030 DUF916: Bacterial pro 79.5 11 0.00023 37.9 8.9 75 725-799 23-105 (121)
77 PF09315 DUF1973: Domain of un 78.8 27 0.00059 37.6 12.3 46 133-181 42-87 (179)
78 PF14545 DBB: Dof, BCAP, and B 76.0 18 0.0004 37.1 9.4 84 3-95 1-88 (142)
79 PF10633 NPCBM_assoc: NPCBM-as 75.1 15 0.00033 33.5 8.0 54 817-871 22-75 (78)
80 PF14524 Wzt_C: Wzt C-terminal 73.6 19 0.0004 36.6 9.2 73 106-180 28-107 (142)
81 PF12690 BsuPI: Intracellular 73.3 15 0.00032 34.2 7.4 64 115-178 2-82 (82)
82 PF12975 DUF3859: Domain of un 73.0 13 0.00029 37.5 7.6 58 134-191 59-125 (125)
83 PF04734 Ceramidase_alk: Neutr 72.8 16 0.00036 47.4 10.3 82 98-181 557-653 (674)
84 KOG0366|consensus 71.7 11 0.00024 42.0 7.0 80 907-1001 56-135 (329)
85 PF10648 Gmad2: Immunoglobulin 69.7 39 0.00084 31.9 9.4 51 130-181 27-81 (88)
86 PRK10301 hypothetical protein; 69.0 14 0.0003 37.3 6.7 61 132-192 59-124 (124)
87 PF13620 CarboxypepD_reg: Carb 68.5 30 0.00064 31.6 8.5 57 117-182 2-58 (82)
88 PF11589 DUF3244: Domain of un 67.9 30 0.00065 33.8 8.7 59 131-192 47-106 (106)
89 smart00634 BID_1 Bacterial Ig- 67.8 41 0.00089 31.8 9.4 80 100-182 3-84 (92)
90 KOG0497|consensus 67.7 5.7 0.00012 49.9 4.3 67 931-1002 593-660 (760)
91 PF15418 DUF4625: Domain of un 67.6 23 0.0005 36.1 8.0 74 105-182 28-117 (132)
92 KOG0365|consensus 66.5 22 0.00048 41.2 8.2 100 891-1002 139-241 (423)
93 PF13860 FlgD_ig: FlgD Ig-like 66.1 24 0.00052 32.6 7.2 50 131-180 24-77 (81)
94 PF01483 P_proprotein: Proprot 64.6 20 0.00043 33.5 6.5 51 130-180 14-70 (87)
95 PTZ00487 ceramidase; Provision 63.9 25 0.00055 45.6 9.2 73 108-181 607-692 (715)
96 PF00432 Prenyltrans: Prenyltr 63.0 6.9 0.00015 31.6 2.6 21 982-1002 3-23 (44)
97 PRK06655 flgD flagellar basal 61.5 47 0.001 37.2 9.7 75 100-180 100-178 (225)
98 PF04234 CopC: CopC domain; I 61.2 9.4 0.0002 36.6 3.7 60 132-192 33-97 (97)
99 KOG4386|consensus 60.7 17 0.00036 44.1 6.2 86 706-806 685-770 (809)
100 PF13115 YtkA: YtkA-like 59.8 87 0.0019 29.0 9.9 69 311-384 5-76 (86)
101 PF02369 Big_1: Bacterial Ig-l 59.3 1.3E+02 0.0028 29.0 11.2 78 101-182 11-90 (100)
102 PRK14081 triple tyrosine motif 56.2 5.8E+02 0.013 33.3 44.6 340 68-463 253-625 (667)
103 PRK13211 N-acetylglucosamine-b 55.4 1.1E+02 0.0024 38.1 12.3 86 26-115 338-432 (478)
104 PF00432 Prenyltrans: Prenyltr 54.6 8.5 0.00019 31.0 1.9 20 1257-1276 4-23 (44)
105 PF14326 DUF4384: Domain of un 54.5 65 0.0014 29.9 8.0 63 107-178 1-65 (83)
106 PF13199 Glyco_hydro_66: Glyco 53.9 64 0.0014 41.0 10.2 93 104-199 2-105 (559)
107 PF08400 phage_tail_N: Prophag 51.9 1E+02 0.0022 31.4 9.3 96 328-430 1-100 (134)
108 TIGR03437 Soli_cterm Solibacte 51.6 3.4E+02 0.0074 30.2 14.2 119 59-182 51-204 (215)
109 PF11614 FixG_C: IG-like fold 51.4 1.8E+02 0.0038 28.8 11.1 63 733-811 35-101 (118)
110 PF13620 CarboxypepD_reg: Carb 50.4 35 0.00077 31.0 5.6 57 332-400 2-58 (82)
111 PF00868 Transglut_N: Transglu 49.9 1.2E+02 0.0026 30.3 9.5 82 96-180 33-116 (118)
112 PRK12812 flgD flagellar basal 48.7 1.8E+02 0.0039 33.3 11.7 63 113-181 128-194 (259)
113 PF14900 DUF4493: Domain of un 46.6 4.7E+02 0.01 29.4 19.1 44 131-182 32-75 (235)
114 PF05751 FixH: FixH; InterPro 46.1 90 0.002 32.1 8.4 63 113-180 69-132 (146)
115 PRK12634 flgD flagellar basal 45.2 1.2E+02 0.0026 33.9 9.6 51 131-181 121-175 (221)
116 COG2372 CopC Uncharacterized p 42.5 79 0.0017 31.8 6.7 62 131-192 59-125 (127)
117 PRK05842 flgD flagellar basal 41.7 1.5E+02 0.0032 34.6 9.8 69 110-180 146-220 (295)
118 PF15432 Sec-ASP3: Accessory S 41.6 2.3E+02 0.0049 28.8 10.0 80 104-193 45-126 (128)
119 PF13715 DUF4480: Domain of un 41.2 2.5E+02 0.0053 25.9 9.8 52 117-182 2-54 (88)
120 PF00775 Dioxygenase_C: Dioxyg 41.1 66 0.0014 34.8 6.6 28 324-353 24-51 (183)
121 PRK12633 flgD flagellar basal 40.4 96 0.0021 34.9 7.9 51 131-181 128-182 (230)
122 KOG0367|consensus 39.6 91 0.002 35.8 7.4 55 939-1000 137-194 (347)
123 cd00421 intradiol_dioxygenase 38.8 1.1E+02 0.0023 31.9 7.5 22 325-346 7-28 (146)
124 COG1689 Uncharacterized protei 37.3 44 0.00094 36.4 4.3 41 955-1000 208-248 (274)
125 PF12584 TRAPPC10: Trafficking 36.5 2.8E+02 0.0061 28.8 10.2 70 722-806 24-105 (147)
126 PF04151 PPC: Bacterial pre-pe 35.5 1E+02 0.0023 27.3 6.0 44 133-179 25-69 (70)
127 KOG0365|consensus 35.4 85 0.0018 36.7 6.4 73 917-1001 69-141 (423)
128 PF05688 DUF824: Salmonella re 34.4 62 0.0013 26.8 3.8 31 323-355 8-38 (47)
129 COG1657 SqhC Squalene cyclase 32.7 1E+02 0.0023 38.5 7.2 111 1158-1274 352-479 (517)
130 PF01122 Cobalamin_bind: Eukar 31.5 96 0.0021 36.6 6.3 87 1174-1279 142-240 (326)
131 PF06280 DUF1034: Fn3-like dom 31.1 2.4E+02 0.0051 27.7 8.3 66 728-798 7-80 (112)
132 PF07177 Neuralized: Neuralize 30.8 1E+02 0.0022 27.7 5.0 28 98-126 29-56 (69)
133 TIGR01451 B_ant_repeat conserv 30.8 83 0.0018 26.6 4.2 37 104-144 3-41 (53)
134 KOG0497|consensus 30.4 54 0.0012 41.7 4.2 134 854-1000 572-720 (760)
135 PF07233 DUF1425: Protein of u 30.2 3E+02 0.0064 26.3 8.4 62 720-794 15-78 (94)
136 PF09608 Alph_Pro_TM: Putative 29.9 1.1E+02 0.0024 34.5 6.3 102 85-193 66-196 (236)
137 TIGR02186 alph_Pro_TM conserve 28.7 1.2E+02 0.0026 34.5 6.3 51 136-193 166-221 (261)
138 PF05751 FixH: FixH; InterPro 28.6 3E+02 0.0066 28.2 9.0 65 331-398 70-134 (146)
139 cd05822 TLP_HIUase HIUase (5-h 27.9 1.9E+02 0.0042 28.7 6.8 60 121-182 7-68 (112)
140 cd05790 S1_Rrp40 S1_Rrp40: Rrp 27.9 84 0.0018 29.6 4.1 32 106-143 47-78 (86)
141 KOG1948|consensus 27.6 1.6E+03 0.035 30.0 27.8 57 369-437 258-314 (1165)
142 PF07705 CARDB: CARDB; InterP 27.2 5.1E+02 0.011 24.0 10.6 75 422-498 9-88 (101)
143 PF01273 LBP_BPI_CETP: LBP / B 27.2 4.4E+02 0.0095 27.5 10.1 46 186-231 41-87 (164)
144 PF12863 DUF3821: Domain of un 26.2 1.9E+02 0.0041 32.0 7.1 39 109-150 102-148 (209)
145 PF11974 MG1: Alpha-2-macroglo 26.0 1.3E+02 0.0029 28.8 5.3 45 332-382 15-60 (97)
146 cd03459 3,4-PCD Protocatechuat 25.7 2E+02 0.0043 30.4 6.9 49 325-380 11-78 (158)
147 TIGR02439 catechol_proteo cate 25.4 1.7E+02 0.0037 33.9 6.9 22 325-346 124-145 (285)
148 PF05688 DUF824: Salmonella re 25.4 1.4E+02 0.0031 24.7 4.4 35 108-142 8-42 (47)
149 PF14874 PapD-like: Flagellar- 25.2 5.9E+02 0.013 24.1 11.9 68 724-809 15-85 (102)
150 cd03463 3,4-PCD_alpha Protocat 25.1 1.9E+02 0.0042 31.3 6.9 48 326-380 33-98 (185)
151 PF00576 Transthyretin: HIUase 25.1 2.1E+02 0.0046 28.4 6.5 62 121-182 7-69 (112)
152 PF01122 Cobalamin_bind: Eukar 25.1 88 0.0019 36.9 4.5 76 923-1011 214-292 (326)
153 COG4935 Regulatory P domain of 25.0 5.4E+02 0.012 27.6 9.7 53 130-182 50-110 (177)
154 PRK12813 flgD flagellar basal 24.0 2.1E+02 0.0046 32.0 7.0 48 131-181 123-174 (223)
155 TIGR01451 B_ant_repeat conserv 23.8 2.2E+02 0.0047 24.0 5.5 40 722-761 5-45 (53)
156 TIGR02438 catachol_actin catec 23.5 2E+02 0.0042 33.4 6.8 23 325-347 128-150 (281)
157 cd03458 Catechol_intradiol_dio 23.2 2E+02 0.0044 32.8 6.9 22 325-346 100-121 (256)
158 PRK15036 hydroxyisourate hydro 23.2 3.6E+02 0.0079 27.7 8.1 64 116-182 28-93 (137)
159 COG5266 CbiK ABC-type Co2+ tra 23.2 2.1E+02 0.0046 32.3 6.7 57 320-381 164-228 (264)
160 cd03462 1,2-CCD chlorocatechol 23.1 2.1E+02 0.0046 32.5 6.9 22 325-346 95-116 (247)
161 PF05506 DUF756: Domain of unk 23.0 4E+02 0.0088 24.9 7.9 58 732-810 21-78 (89)
162 cd05469 Transthyretin_like Tra 23.0 4.2E+02 0.0092 26.3 8.1 59 121-182 7-68 (113)
163 cd03460 1,2-CTD Catechol 1,2 d 22.9 2E+02 0.0043 33.4 6.8 22 325-346 120-141 (282)
164 PF14782 BBS2_C: Ciliary BBSom 22.8 2.7E+02 0.0059 34.4 8.3 81 780-871 93-173 (431)
165 TIGR02962 hdxy_isourate hydrox 22.3 4.2E+02 0.0092 26.3 8.0 60 121-182 7-68 (112)
166 COG1689 Uncharacterized protei 22.1 76 0.0016 34.7 3.0 66 921-999 7-73 (274)
167 PF07718 Coatamer_beta_C: Coat 21.9 2.6E+02 0.0056 28.8 6.6 67 714-799 57-124 (140)
168 smart00736 CADG Dystroglycan-t 21.8 5.1E+02 0.011 24.6 8.5 47 133-182 31-77 (97)
169 cd05703 S1_Rrp5_repeat_hs12_sc 21.7 1.2E+02 0.0026 27.3 3.9 18 108-125 46-63 (73)
170 PF01345 DUF11: Domain of unkn 20.8 2.1E+02 0.0046 25.7 5.4 40 721-760 33-73 (76)
171 COG2372 CopC Uncharacterized p 20.8 2.2E+02 0.0048 28.7 5.7 58 452-510 68-126 (127)
No 1
>KOG1366|consensus
Probab=100.00 E-value=6.8e-157 Score=1521.27 Aligned_cols=1180 Identities=35% Similarity=0.543 Sum_probs=925.7
Q ss_pred CEEEEeeCeeecCCeEEEEEEeecCCccEEEEEEEEeecCCCCeeEEeEEEEECCCceeEEEEEcCCCCC--ceeEEEEE
Q psy5522 1 YYTVVAPKVLRPNGEYHVAVSTQAVSEATQVSVEVGGKQDSGGEFRTKQLLSVDPFSTRIVKLDIGDLGP--GQYNLTVK 78 (1397)
Q Consensus 1 ~y~v~~P~~lr~g~~~~v~v~l~~~~~~v~v~v~L~~~~~~~~~~~~~~~~~~~~~~~~~~~f~vP~~~~--~~~~l~v~ 78 (1397)
+|+|++|+++|+|+++++|. ..+.+.+++++...+.+. .........-......|+.|++|.... ..|.....
T Consensus 22 ~y~v~~P~iir~~~~~~~~~----~~~~~~~~v~v~~~~~~~-~~~~~~~~~~~~~~~~~v~~~ip~~~~~~~~~~~~~~ 96 (1436)
T KOG1366|consen 22 TYMLLAPSIIRVESPESLLE----VQSDVDVTVSVLLRPPSN-ARSIVTDTEGIFQDSSLVTFQIPSLPPNSAEYGRSEK 96 (1436)
T ss_pred ceeeecceeEecCCCCceee----ecCCcceEEEEEecCccc-ccceeeccccccccccceeccccccCCCcceeEEeee
Confidence 59999999999999999444 344444444444332222 222333332334557899999996543 34788888
Q ss_pred ecCceeeEeeEEEEEEecccEEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCce
Q psy5522 79 GSGSLNFYNSTGLVYVHKSYSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGV 158 (1397)
Q Consensus 79 g~~~~~f~~~~~v~~~~~~~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi 158 (1397)
| +++.|.+++.+++..+++++|||||||||+|||||+||+|+++.+++|. + +.|+|+||+||+|+||+......|+
T Consensus 97 ~-~~l~f~~~~~l~~~~~~~~vfIqTDkpiY~pgqtV~~Rvv~~~~~~~~~-~--~~v~~~dp~~~~ikq~~~~~~~~gv 172 (1436)
T KOG1366|consen 97 G-PELEFENSTRLLFSQRRGSVFIQTDKPIYKPGQTVRYRVVTLNLKLKPF-K--VTVYIEDPEGNRIKQWLSVQSDGGV 172 (1436)
T ss_pred c-ceEEEeeeeeeeeecccceEEEecCCCccCCCCEEEEEEEEeccccccc-c--ceEEEECCCCCchhhhhhccccCce
Confidence 8 8899999999999999999999999999999999999999999999995 4 9999999999999999999999999
Q ss_pred eeeeeecCCCCCCeeEEEEEEEcCeEEEEEEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEecCccccceEEEEE
Q psy5522 159 FSADLQLSKSPVLGDWNITINVLDQKFTKRFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTYGKPVKGEATITA 238 (1397)
Q Consensus 159 ~~~sf~L~~~~~lG~y~I~v~~~~~~~~~~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v 238 (1397)
++++|+||++|.+|.|+|+|+..++..+..++|+|||||+|||.+++|+.+++. +++.++|||+|+|||||+|.+++++
T Consensus 173 ~~~~f~ls~~~~~G~w~I~v~~~~~~~~~~~~v~eyvlp~FeV~v~~~~~~~~~-~~~~~~v~a~y~ygkpV~g~~~v~~ 251 (1436)
T KOG1366|consen 173 YSLSFQLSSEPILGDWKIRVQYKNGLQTKSSTVEEYVLPKFEVNVEPPSIITIN-KELEVTVCARYTYGKPVTGKATVSF 251 (1436)
T ss_pred EEeeecccCCCcCCceEEEEEEcCCCccccceEEEEEecCceEeeecCcceeec-ceeEEEEeeeeecCCcccceEEEEE
Confidence 999999999999999999999988555555599999999999999999998887 9999999999999999999999999
Q ss_pred EeccccCcc----ccccccceeEEeecCceEEEEEechhhhcccccc---ceEEEEEEEEEECCcceEEeeEEEEEEEEe
Q psy5522 239 YPTIFSGVI----QPLFQTPVRKVVPIDGKTVIEFDVVKELQLTDEY---ERNIHFDVAVEEALTGRRQNNTGSVVFHKH 311 (1397)
Q Consensus 239 ~~~~~~g~~----~~~~~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~---~~~l~V~a~VtE~gTG~~~~~~~~i~i~~~ 311 (1397)
++....+.. ....|.....+...++|.....+... +...... .....+.+..+..-++...+.....+..+.
T Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~i~~v~s~~~~~~~~~~~~~~~~~~~ 330 (1436)
T KOG1366|consen 252 GLLDSDYKKCQPKIQSNCKNFEQQLNVEGCKVLNVDTQN-LQLYYNFEEGAEPVGITAVRSPFTTDIEETPLHLVPGGPA 330 (1436)
T ss_pred ecccccccccccchhccccccccccceeeEEEEecCchh-heeehhhhccCcceeEEEEcccchhhhhhcceeecCCcce
Confidence 987432221 12234444555666677766655544 3332111 124555565565555666666667777788
Q ss_pred eeEEEEEecCCcccCCCeEEEEEEEEccCCCccCCC-CceEEEEEeeeccccccceeeEeeCCCeeEEEE----------
Q psy5522 312 KYKMDLIKSSEYFKPGLKYTAYMKLTHHDGTPVTDN-NNMVQVRHGFSYDESKYEANQYKLDRNGMIKLV---------- 380 (1397)
Q Consensus 312 ~~~i~f~~~~~~~kpG~p~~~~v~v~~~dG~P~~~~-~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~---------- 380 (1397)
..++.+.+....+|+|+|+.+++++.+.+|.|+... .....+. .....+++++..+...+.
T Consensus 331 ~~~v~v~~~~~~~k~gvp~~v~v~v~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 402 (1436)
T KOG1366|consen 331 ELKVSVEDEDDSFKPGVPFTVQVKVVSKDGVPLDLDLPKEVQVD--------TSVQRPITTDRSKLVESNETMLATQNPS 402 (1436)
T ss_pred eeeEEEeecCCcccCCcceEEEEEEecCCCcccccccccccccc--------ceEEEeeeccccccceeceeeccccCCc
Confidence 888999999999999999999999999999998732 1111111 111222223222222221
Q ss_pred -----EeC-CCCCceeEEEEEEEecCcceEEEEEEeecCCCCCeEEEEecCCCCCCCCeEEEEEEecC--------Ccce
Q psy5522 381 -----YYP-PANENVTTLGIEAEYLDIKEWFSTISASESPSNSFIQAALLTQNPKVNKDVELEINSTA--------PLKY 446 (1397)
Q Consensus 381 -----i~~-~~~~~~~~l~l~a~~~~~~~~~~~v~~~~S~s~s~L~i~~~~~~~~~G~~~~v~v~s~~--------~~~~ 446 (1397)
... +.+... +....+.. ..+.+......+...++++.......+-++-...+.+.+.. ....
T Consensus 403 ~~~~~~~~~~~~~~~--~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (1436)
T KOG1366|consen 403 LTLVRSLAPPSECPP--FLSDDETD--NRAVRLFRKTFNIVQSYIEKAKNSYKLYLQLNKGVKVLSGLGKRVTGILKSVS 478 (1436)
T ss_pred ceeeEeeccCCCCcc--eEEeeecc--ccccccccccccccccccccccccceeeccccccceEEeeccceeeeEEEEec
Confidence 111 111100 10000000 01111122222333344444333333333332222222110 0012
Q ss_pred EEEEEEeCCeEEEE----EEEEecCCceeEEEEEeeCCCCCceeEEEEEEEecCCeEEEEeEEEEEcccccceeeEEecC
Q psy5522 447 ISYQVLGRGDVIMA----DTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVREDGEVVADGLDLELEGGLQNFVSANVSP 522 (1397)
Q Consensus 447 ~~ylV~srG~Iv~~----~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~~gevvads~~~~V~~~f~nkV~l~~s~ 522 (1397)
..|.++.++-++.. ..+..............++-.-.|.+++..+++.++|+++.|+..|.++.+++|+|.+.+++
T Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~~~~~n~v~l~~~~ 558 (1436)
T KOG1366|consen 479 PLYHFTYSGMIVFTITHGSHMVARVPTETVNDSACVKVLDKPVGLLVSFGIDSSGEVVGDSLKFKVEGDLKNKVALVANK 558 (1436)
T ss_pred ceeEEeeccceeEEEEEeeEEecccceeeeccceeEEeeeeceeeEEEEEeccccccccceeEEEeeccccceEEEEEcc
Confidence 33444444444333 33322222345566667777777877888889999999999999999999999999999999
Q ss_pred CCCCCCceEEEEEEecCCcEEEEEEEeeecccccCCCCCChhHHhHhccccCCCCCCCCccccccccccC----------
Q psy5522 523 DETEPGSNIQINLEAKPNSYIGLLAVDQKVLLLKTGNDIGKEDVMRELRSYDETDTSKLPLVENLRERYP---------- 592 (1397)
Q Consensus 523 ~~~~Pg~~v~l~V~a~p~S~v~L~aVD~sVllL~~~~~lt~~~v~~~l~~y~~~~~~~~~~~~~~~~~~p---------- 592 (1397)
....||+++.+++...|+|+||+++||+|+++|+..|+++.+++...|..|+...... .. .+++|
T Consensus 559 ~~~~~~~~~~~~~~~~~~s~vgl~~vd~s~~~m~~~n~~~~~~~~~~l~~~~~~~~~~----~~-~~~~~~~~~~~~~~~ 633 (1436)
T KOG1366|consen 559 SAKEPGNNVALEVTDSPDSFVGLSAVDQSVLLMDAANDLTLEGVTFLLSEYDLTNQGG----KS-RGRNPGELVNKKKVT 633 (1436)
T ss_pred cccCCCCcceeeecCCCceeecchhhhhhhhhcccccchhhhcceeeecccccccccc----Cc-cccCchhhhhhhhhh
Confidence 9999999999999999999999999999999999999999999999999998665311 00 11111
Q ss_pred CCcchhhHhhhcCeEEEecccccccCcccccccC---------CCCCCCcccchhhccccccccccccceecccCcceeE
Q psy5522 593 GSFTAQATFEKAGAIVMTNGYVHERNPWVYYKSL---------NDPPDDMLDGEEQLLSQVTTSVTQLTVRKHFPETWLF 663 (1397)
Q Consensus 593 ~~~d~~~~f~~~Gl~v~Tn~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~vR~~FpETwiW 663 (1397)
.+.+.+..++.+|+.+++|..+..+..+..+... .....++..........+......++||++|||||||
T Consensus 634 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~VRk~FPEtWlw 713 (1436)
T KOG1366|consen 634 NAKYKYSTSKGDGLRFFCNPGIRSPSDASRRILVGRECSSVRRDLSGAGMPILAHALTNEDLSVPSKPRVRKHFPETWLW 713 (1436)
T ss_pred cccchhhccccccceeeecCCccchhhhhhhccccccceeeeccccCCCcchheeecccccccCCCCcchhccCCcceee
Confidence 2356678888999999999888765443211000 0001111110000000011234678999999999999
Q ss_pred EEEEeccCceEEEEEEcCCccccEEEEEEEEeCCCceeeeeCCeeEEEeeeEEEEecCCceeeeCCEEEEEEEEEcCCCc
Q psy5522 664 QMEETGFDGKVMVNEKVPDSITSWVLSAFSVDSLYGLGLMDMPKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQ 743 (1397)
Q Consensus 664 ~~~~~~~~G~a~v~~tvPDtiT~W~~~Afa~s~~~G~G~a~~~~~l~vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~ 743 (1397)
+.+.++.. ..++.+++||+||+|.++|||+|++.|+|++ ++.++++||||||+++|||||+|||+|.|+++||||+++
T Consensus 714 ~~~~v~~~-~~~~~v~~PDtiT~W~~~~f~ls~~~Gl~v~-~~~~l~~fQpfFi~l~lPySV~RgE~i~l~~tv~NYl~k 791 (1436)
T KOG1366|consen 714 DDVNVGRR-VKELTVTVPDTITTWVASGFSLSEDKGLGVA-PTTSLRVFQPFFIELNLPYSVRRGEQIALRVTVFNYLTK 791 (1436)
T ss_pred eeEecCCc-eEEEEEEcCCccceEEEEEEEEcCCceeEec-CCceEEEeeceeEEecCceeEEeCcEeEEeEEEecccCc
Confidence 99999888 9999999999999999999999999999999 577799999999999999999999999999999999999
Q ss_pred cEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEeceeeeEEEEEEEEeCCCCceeEEEE
Q psy5522 744 DLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITPKELGYIGIKVTATSNLAGDSMEGKL 823 (1397)
Q Consensus 744 ~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p~~lG~v~itV~A~~~~~~D~v~k~L 823 (1397)
+++|.|.+..+++++....+..+..... .....+.....++++.+.+.|+|+|..+|.++++++|.+..++|+|+|+|
T Consensus 792 ~~~v~V~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~v~~~~s~~~~~~~v~p~~~g~~~~~~~~~t~~~~dav~k~l 869 (1436)
T KOG1366|consen 792 ELDVSVILLSSEDFCYDATSKANEQVGV--SQRSVQVSRVTPAQSGKLVYFPVRPGVIGLILVTISALTLLGGDAVHKTL 869 (1436)
T ss_pred ceEEEEEEccCCCeeeecccccCccccc--ccceEEEEEEcccccceEEEEEeccccccccceEEEEeecCCCccEEEEE
Confidence 9999999999998887654433221110 01222334455699999999999999999999999999999999999999
Q ss_pred EecCCCeeeEEEEEEEEecCCCceeEEEEEecCCCCcCCCCcEEEEEEEcccccCcccCcchhhccCCCchhhhhcccch
Q psy5522 824 LVKPEGETQYKNKAIFVDLRKNKTFSVNVTLDMPKNIVPGSEHVEVSAVGDLLGPSIPNLANLIKMPFGCGEQNMLNFVP 903 (1397)
Q Consensus 824 ~V~peG~~~~~t~s~~l~~~~~~~~~~~i~l~~P~~~v~gs~~~~vsv~gd~l~~~~~~l~~Ll~~P~GCgEQ~m~~~~p 903 (1397)
+|+|||++++..++.+||++. +.++++++.+|+++|+||+++++++.||++|++++|+++|++||||||||||+.|+|
T Consensus 870 ~V~~eG~~~~~~~~~~i~l~~--~~~~~~~~~~p~~~V~~S~~~~~~~vgD~lGps~~~l~~Ll~lP~GcGEQnMi~faP 947 (1436)
T KOG1366|consen 870 LVEPEGITQERNFSVLIDLSV--SASKELSLGVPNDVVPGSERARISVVGDVLGPSMNNLSNLLRLPYGCGEQNMINFAP 947 (1436)
T ss_pred EEeccCccceeeeeEEeccCC--CcCcceeecCCCCcCccccceeeEEEecccccChhhhHHhccCCCCcccceeecccc
Confidence 999999999999999999874 578999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHH
Q psy5522 904 NIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVIL 983 (1397)
Q Consensus 904 ~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~ 983 (1397)
|||+++||+.++|++++++++|+++|++||||+|+|+|+|||||+|+..+.++++||||||+++|.+|+++|.||+..+.
T Consensus 948 ni~v~~YL~~t~q~~~~~k~ka~~~l~~GyqrqL~yk~~DgSySaFg~~~~~~stWLtafvlr~f~~a~~~i~id~~~i~ 1027 (1436)
T KOG1366|consen 948 NIYVLKYLPKTNQLTPELKRKALKFLEQGYQRQLTYKRADGSYSAFGSSDRSGSTWLTAFVLRVFSQAKEYIFIDPNVIT 1027 (1436)
T ss_pred hhhHHHHHhhhhccChhHHHHHHHHHHHHHHHHHhhhccCCChhhhcCCCCcccHHHHHHHHHHhhhccCceEecHHHHH
Confidence 99999999999999999999999999999999999999999999999977899999999999999999999999999999
Q ss_pred HHHHHHHhcccCCCccccchHhHHHhhcCCCCccEEEEEEEEeccccceeEEEEeeeeEEEEEEEEEEeeccCCCCCcee
Q psy5522 984 EALAWLSSNQAVNGSFPEVGKVSHADMQGGAAKGLALTAYTLLPKKTRAVNMTATGSGFAVVYISYQYNINVTGAWPMFT 1063 (1397)
Q Consensus 984 ~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~~~v~lt~~v~lp~~~~~~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~ 1063 (1397)
++++||.++|+.+|+|.+..+|++..+++++...+.+++....
T Consensus 1028 ~a~~wl~~~Qk~~GsF~e~~~v~~~~~qg~l~~~~~l~~~~l~------------------------------------- 1070 (1436)
T KOG1366|consen 1028 QALNWLSQQQKENGSFKEVGEVLHNEMQGGLQDIVALTAIVLS------------------------------------- 1070 (1436)
T ss_pred HHHHHHHHhhccCceEeccccccchhhhcCCCCcceeeeEeee-------------------------------------
Confidence 9999999999999999999999999999887666677776544
Q ss_pred eccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCchhhcccCceeEEEEeCCeEEEEEEeeecCCcc
Q psy5522 1064 LDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALPSLQVSQNVKRVETKNGNTMVVLYFAYHQVLP 1143 (1397)
Q Consensus 1064 l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~~l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~ 1143 (1397)
+++++ +
T Consensus 1071 ------------------------------~~~~~-----------------------------~--------------- 1076 (1436)
T KOG1366|consen 1071 ------------------------------VILDV-----------------------------D--------------- 1076 (1436)
T ss_pred ------------------------------eeccc-----------------------------h---------------
Confidence 55555 1
Q ss_pred ceeeecccccccchhhHHHHHHHHHhhcCCC-CchHHHHHHHHHHhhCCCCchHHHHHHHhhcccccCC-cccccccCCC
Q psy5522 1144 WTLMHYLVSKYPRMNTINKAVDYIVKNLAGT-EDAYAIAICSYALHLAQHPVKDVAFNLLESKAHNEDG-KKWWKRAERP 1221 (1397)
Q Consensus 1144 ~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~-~~~y~~al~aYal~la~~~~~~~~~~~L~~~a~~~~~-~~~W~~~~~~ 1221 (1397)
...+.+|..|+++.++.. .++|+.|++||||.+++++.+..++.+|+.+|+.+++ ..||..+.+.
T Consensus 1077 -------------~~~~~~a~~~~~~~~~~~~~~~~~~a~~ayAl~l~~sp~a~~~~~~l~~~a~~~~d~~~~~~~~~~~ 1143 (1436)
T KOG1366|consen 1077 -------------NASIQKAVAYLEESLDSGSMDVYTVAITAYALQLAKSPQAAKALAKLKSLARVEGDRRYWWASALKA 1143 (1436)
T ss_pred -------------hhHHHHHHHHHhhccCccccccchHHHHHHHHHhccCchHHHHHHhhhhhhcccCCceeeeeccccc
Confidence 135788999999999865 7899999999999999999999988889999999877 5666665443
Q ss_pred CCCCCCCCCCCCcccchhhHHHHHHhhccCCcccchhhhhhhhhcccCCCCccccccC----------------------
Q psy5522 1222 EDKKNPWAQVPNSVDVEMTSYALLSYLDRGLVEDTLPILTWLVTQQNDQGGFASTQMS---------------------- 1279 (1397)
Q Consensus 1222 ~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~~~~a~~i~~Wl~~q~n~~Ggf~stQdT---------------------- 1279 (1397)
.+... |.+++++++|||||||||+++...+..++.||+|||++|||..|||+|||||
T Consensus 1144 ~~~~~-~~~q~~s~~VE~tsYaLL~~~~~~~~~~~~pivrWl~~qr~~~GGf~STQdTvvalqaLs~y~~~~~~~~~~~~ 1222 (1436)
T KOG1366|consen 1144 KNAVK-YSPQARSIDVETTAYALLAYLLLAQVDYALPIVRWLVEQRNALGGFSSTQDTVVALQALSEYAALSHTETLSLQ 1222 (1436)
T ss_pred cCccc-CCCccchhhccchHHHHHHHHHhcccCcCchhHhhhhhhhcccCceeehHHHHHHHHHHHHhhhhcccceeEEE
Confidence 33212 5678899999999999999999888889999999999999999999999999
Q ss_pred --------------------------------------------------------------------------------
Q psy5522 1280 -------------------------------------------------------------------------------- 1279 (1397)
Q Consensus 1280 -------------------------------------------------------------------------------- 1279 (1397)
T Consensus 1223 V~i~~~~~~~~~~~~~~~~~N~lllQ~~~~p~~~~~v~~ta~G~g~~~~q~~~~Ynv~~~~~~~~f~l~~~~~~~~~~~~ 1302 (1436)
T KOG1366|consen 1223 VTIQVPGKSSGTSQVDSTPNNLLLLQSVSLPSLHKTVCFTAKGQGCATVQISYTYNLLTPKSDPRFFLIVSTKPEPVEFD 1302 (1436)
T ss_pred EEEEecccCCCceeeeccccchhhcceeecccCCCceEEEEecceeEEEEeeEEEeecCcccCCCceeeeeeccccccCc
Confidence
Q ss_pred -----------------------------------------------------------------------CCcccccee
Q psy5522 1280 -----------------------------------------------------------------------KKELCPTVS 1288 (1397)
Q Consensus 1280 -----------------------------------------------------------------------~~~~c~~~~ 1288 (1397)
++..|+.|+
T Consensus 1303 ~s~~~~~~~~~~~~~~~~~~SnMAIldVklpSGf~~~~~sl~~le~~~~v~~~~~~~~~~~v~~Y~D~v~~~~~~cv~F~ 1382 (1436)
T KOG1366|consen 1303 LSNIVVVYTTCCERTGSSGRSNMAILDVKLPSGFRADHESLGLLEQLEDVYRVEVENDHRRVLIYFDSVSGNETFCVRFE 1382 (1436)
T ss_pred cccceEEEEEEEeecCCCCCCCceeeeeecccCcccChhhHHHhhccceEEEEEEecCCcceEEEeeccCCCCceEEEEE
Confidence 567799999
Q ss_pred ccccccccccCceeEEEeeccccchhheeeeccccccccccccCCCCCCcc
Q psy5522 1289 AFRTHKVAQQKPVPVSIYDYYDQSRRARMFYEPRVATLCDICEGEDCSKDT 1339 (1397)
Q Consensus 1289 ~~~~~~v~~~~p~~v~~ydyy~~~~~~~~~y~~~~~~~c~~c~~~~c~~~~ 1339 (1397)
|.|.++|+++|||+|+|||||+|+++|++||+++..++||+|++++|...|
T Consensus 1383 a~~~~~V~~~kpa~v~VYDYY~pee~~t~~Y~~~~~~~~~~c~~~~c~~~c 1433 (1436)
T KOG1366|consen 1383 AVQSFPVANLKPASVTVYDYYEPEEQATRFYNAESSSSCDVCEGAQCKCPC 1433 (1436)
T ss_pred EEEEeehhccCCceEEEeeccCchhhhhhhhcccccchhhhcccccCCcCC
Confidence 999999999999999999999999999999999888899999999998755
No 2
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=100.00 E-value=1.7e-64 Score=666.61 Aligned_cols=812 Identities=18% Similarity=0.239 Sum_probs=611.9
Q ss_pred ccEEEEEcCCCcccCCceeEEEEEEEcCCCc-cCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEE
Q psy5522 97 SYSVFIQLDKAIYKPGDLVRFRAIVLNSHLK-PSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWN 175 (1397)
Q Consensus 97 ~~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~-P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~ 175 (1397)
...+|++|||+||||||+||+.++.++.+-+ +..+.++.+.+.+|+|..+++-.....+.|..+++|+||++++.|.|+
T Consensus 393 ~~k~y~ftDRglYRpGE~v~~~~~~R~~~~~~a~~~~p~~l~v~~PdG~~~~~~~~~~~~~G~~~~~~~l~~na~tG~w~ 472 (1621)
T COG2373 393 GLKVYLFTDRGLYRPGETVHVNALLRDFDGKTALDNQPLKLRVLDPDGSVLRTLTITLDEEGLYELSFPLPENALTGGYT 472 (1621)
T ss_pred ceEEEEecCcccCCCCceeeeeeeehhhcccccccCCCeEEEEECCCCcEEEEEEEeccccCceEEeeeCCCCCCcceEE
Confidence 5789999999999999999999999999887 667899999999999999999988889999999999999999999999
Q ss_pred EEEEEcC--eEEEEEEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEecCccccc-eEEEEEEec-----------
Q psy5522 176 ITINVLD--QKFTKRFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTYGKPVKG-EATITAYPT----------- 241 (1397)
Q Consensus 176 I~v~~~~--~~~~~~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYtyGkPV~G-~v~v~v~~~----------- 241 (1397)
|++...+ ...+.+|+|+||++|+|+|.++..+..+..++.+.++|.++|+||.|++| +++-.+..+
T Consensus 473 l~~~~~~~~~~~s~~f~V~df~p~r~~i~l~~~k~~~~~g~~v~~~v~~~yL~GaPa~g~~~~~~l~lr~~~~~~~~~~~ 552 (1621)
T COG2373 473 LELYTGGKSAVISMSFRVEDFIPDRFKINLTLDKTEWVPGKDVKIKVDLRYLYGAPAAGLTVQGELDLRPTRFSVPGFPG 552 (1621)
T ss_pred EEEEeCCccceeeeeEEhhHhCCceEEEecccccccccCCCcEEEEEEEEecCCCcccCceeeeEEEeecccccccCCcc
Confidence 9999987 68899999999999999999999998888999999999999999999998 455323221
Q ss_pred cccCccccc-----cccce-eEEeecCceEEEEEechhhhccccccceEEEEEEE-EEECCcceEEeeEEEEEEEEeeeE
Q psy5522 242 IFSGVIQPL-----FQTPV-RKVVPIDGKTVIEFDVVKELQLTDEYERNIHFDVA-VEEALTGRRQNNTGSVVFHKHKYK 314 (1397)
Q Consensus 242 ~~~g~~~~~-----~~~~~-~~~~d~~G~~~~~~~~~~~~~~~~~~~~~l~V~a~-VtE~gTG~~~~~~~~i~i~~~~~~ 314 (1397)
+.||...++ +.+.. ...+|.+|++.+.+++.+ .. ..+.+.+.+. +.| +.|+.++.+.++.|.....-
T Consensus 553 ~~fg~~~~~~~~~~~~~~~~~~~~D~~G~a~~~l~~~~-~~----s~~~~~~~~~~~~~-~ggR~v~~s~~~~v~p~~~l 626 (1621)
T COG2373 553 FTFGLADEEPNSLTFEEELELTVTDGKGKASLDLDLAE-TP----SPYQLLVASSEVEE-DGGRAVTRSLSVRVSPLDEL 626 (1621)
T ss_pred eeeeccccccccccccccccccccCCCCcEeEeccccc-CC----CceEEEeecceeec-cCCceeccceeEEEEcCcce
Confidence 122322111 11111 256889999999998866 21 1233333333 545 55999999999999999988
Q ss_pred EEEEe--cCCcccCCCeEEEEEEEEccCCCccCCCCceEEEE-E--eeecc--cc------------ccceeeEeeCCCe
Q psy5522 315 MDLIK--SSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVR-H--GFSYD--ES------------KYEANQYKLDRNG 375 (1397)
Q Consensus 315 i~f~~--~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~-~--~~~~~--~~------------~~~~~~~ttd~~G 375 (1397)
+.+.. ...+.+-..+..+.+.+.+++|+.++..+..+.+. . .+.|. +. .....+..+..++
T Consensus 627 iGvr~~f~~~~~~~~~~~~~~~i~v~~~g~~~a~~~l~~~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 706 (1621)
T COG2373 627 IGVRPKFAADYVKDITPAKFSIIAVSADGSKVALKGLKLVLIEVKRNYNWTYSEGESWKYESVEIETVILEDTLDISANE 706 (1621)
T ss_pred eeeecccchhhhcccCceEEEEEEecccccccccCCceeEEEEEeecceEEEecCcceecccccchhhhcccceeecCCC
Confidence 88776 34555555889999999999999998654444422 1 11221 11 0112223456777
Q ss_pred eEEEEEeCCCCCceeEEEEEEEecC-cceE--EEEEEe-e---cCCCCCeEEEEecCCCCCCCCeEEEEEEecCCcceEE
Q psy5522 376 MIKLVYYPPANENVTTLGIEAEYLD-IKEW--FSTISA-S---ESPSNSFIQAALLTQNPKVNKDVELEINSTAPLKYIS 448 (1397)
Q Consensus 376 ~~~f~i~~~~~~~~~~l~l~a~~~~-~~~~--~~~v~~-~---~S~s~s~L~i~~~~~~~~~G~~~~v~v~s~~~~~~~~ 448 (1397)
.+.|++++. +|.|+|.|+ +.+ ...+ .|.+.. + +..++..|.|..+++.|++||++++.|+.+.+.. -.
T Consensus 707 ~~~it~p~~--~G~Yrl~v~--~~d~~~~~~~~f~v~~~~~~~~~~~~d~l~i~ldK~~Y~~Gdtakl~I~~p~ag~-~~ 781 (1621)
T COG2373 707 TTIITVPVG--WGDYRLEVE--DSDGGILASVSFDVGGSGNDTVTGTPDVLKLKLDKPEYQPGDTAKLLITTPYAGE-EA 781 (1621)
T ss_pred ceEEEeccc--CCceEEEEE--cCCCCeEEEEEEeccccccccccCCccceEEecCccccCCCceEEEEeccCCCCc-ce
Confidence 788888664 477666555 444 2211 222211 1 1223678999999999999999999999999886 67
Q ss_pred EEEEeCCeEEEEEEEEecCCceeEEEEEeeCCCCCceeEEEEEEEecCCeE--------EEEeEEEEEcccccceeeEEe
Q psy5522 449 YQVLGRGDVIMADTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVREDGEV--------VADGLDLELEGGLQNFVSANV 520 (1397)
Q Consensus 449 ylV~srG~Iv~~~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~~gev--------vads~~~~V~~~f~nkV~l~~ 520 (1397)
++++++.+++....+.+.. ....+++++++.|.+.+++.|-.++| |+. -...++|.|... .+++.|++
T Consensus 782 litve~Dk~~~~~~~~v~~--~~~~~~i~v~~~~~~~~yV~v~f~r~-~~~~~~~~~~~~yG~~~~~v~~~-~r~L~v~l 857 (1621)
T COG2373 782 LITVESDKVIQVQLLKVPA--NGLTLEIPVPAAWEHNAYVSVSFVRP-GDASQSFTPARAYGFVPLPVDVK-DRRLDVQL 857 (1621)
T ss_pred eEEEecccceeeEEEecCC--CCceEEEEccHHhcCCeEEEEEEEcC-CChhhhccCccceEEEecccCCc-cceeEEEe
Confidence 7888999999999888754 44689999999999999999999985 322 134567888766 67888887
Q ss_pred c-CCCCCCCceEEEEEEecC-----CcEEEEEEEeeecccccCCCCCChhHHhHhccccCCCCCCCCccccccccccCCC
Q psy5522 521 S-PDETEPGSNIQINLEAKP-----NSYIGLLAVDQKVLLLKTGNDIGKEDVMRELRSYDETDTSKLPLVENLRERYPGS 594 (1397)
Q Consensus 521 s-~~~~~Pg~~v~l~V~a~p-----~S~v~L~aVD~sVllL~~~~~lt~~~v~~~l~~y~~~~~~~~~~~~~~~~~~p~~ 594 (1397)
. +.+.+||+.++++|+..- .+.+.+.|||++||.|.+....++.+.|-.-... +
T Consensus 858 ~~p~k~~Pg~~l~i~l~~~~~~~~~~a~vtV~aVDegIl~lt~y~~Pdp~~~ff~~r~l--------------------~ 917 (1621)
T COG2373 858 KTPKKVKPGEPLTIKLSVSGAGGPKSAYVTVAAVDEGILNLTGYEAPDPVKFFFGQRRL--------------------G 917 (1621)
T ss_pred cCccccCCCCeeEEEEEEcCCCCCcceEEEEEEeeeehhhccCCCCCCHHHhhhccccc--------------------c
Confidence 6 678889999999999643 7999999999999999886655554443111110 1
Q ss_pred cchhhHhhhcCeEEEecccccccCcccccccCCCCCCCcccchhhccccccccccccceecccCcceeEEE-EEeccCce
Q psy5522 595 FTAQATFEKAGAIVMTNGYVHERNPWVYYKSLNDPPDDMLDGEEQLLSQVTTSVTQLTVRKHFPETWLFQM-EETGFDGK 673 (1397)
Q Consensus 595 ~d~~~~f~~~Gl~v~Tn~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vR~~FpETwiW~~-~~~~~~G~ 673 (1397)
.+.+++ .|..+--. .. .+...+.+|..+. ......+..|+ |..+++|.. +.+|++|+
T Consensus 918 v~t~di---~~~ii~~~-------~~---~~~~~~~Ggd~~~--------~~~~~~~p~r~-~~~va~~~g~v~~d~~G~ 975 (1621)
T COG2373 918 VETRDI---YGRIIDGF-------QG---VAKPLRTGGDGGR--------AAKQGEPPTRR-EKPVAAFSGIVETDAQGK 975 (1621)
T ss_pred eEeeec---cccccccc-------cc---cccccccCCCccc--------cccccCCCccc-ccchhhhccceEEcCCCc
Confidence 111111 11110000 00 0000111111100 00112334555 666677775 56999999
Q ss_pred EEEEEEcCCccccEEEEEEEEeCCCceeeeeCCeeEEEeeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEee
Q psy5522 674 VMVNEKVPDSITSWVLSAFSVDSLYGLGLMDMPKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLEN 753 (1397)
Q Consensus 674 a~v~~tvPDtiT~W~~~Afa~s~~~G~G~a~~~~~l~vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~ 753 (1397)
++|+|.+||..|+||+||+|++ +..+|-++ ++++|++|+.++++||+|++.||++.+.+.|||..++...+ +..
T Consensus 976 a~vt~~iPdfng~~RvmavA~t-~~~vG~~~--~~~~v~~p~i~~~s~PrflapGD~~~~~l~v~N~~~~~g~~---~~~ 1049 (1621)
T COG2373 976 ATVTFPIPDFNGRLRVMAVAWT-DGAVGQAE--AELIVRKPVIVEASMPRFLAPGDRSSAGLDVFNLTGKAGDV---LTV 1049 (1621)
T ss_pred EEEEEecCCcCceEEEEEEEec-Cccccccc--eeEEecCCeEEecCCCceecCCccceeEEEEeccCCCCcce---eec
Confidence 9999999999999999999999 67999994 99999999999999999999999999999999999999873 333
Q ss_pred cCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEece-eeeEEEEEEEEeCCCCceeEEEEEecCCCeee
Q psy5522 754 VGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITPKE-LGYIGIKVTATSNLAGDSMEGKLLVKPEGETQ 832 (1397)
Q Consensus 754 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p~~-lG~v~itV~A~~~~~~D~v~k~L~V~peG~~~ 832 (1397)
+..+.+.. ....+++.+..+++.++.|++.+.. .|.+.++++... ..+|.+++.+.|.|...++
T Consensus 1050 ~~~l~~~~--------------~~~~~~~~l~~g~~~~l~~~l~a~~g~g~~~~~~~~~~-~~~~~~~~~~~vr~~~~~~ 1114 (1621)
T COG2373 1050 EGTLKFNG--------------KEAPQTLTLAEGSATTLFFPLRALDGSGKIDATLSVSG-QVGDKLERHLNVRPAVPPQ 1114 (1621)
T ss_pred cceecccc--------------cccceEEEecCCCceEEEEEeeccCCCceeeEEEEecC-cccceeeeeeccccCccce
Confidence 33343321 1246899999999999999999986 555554444432 2689999999999999999
Q ss_pred EEEEEEEEecCCCceeEEEEEecCCCCcCCCCcEEEEEEEcccccCcccCcchhhccCCCchhhhhcccchHHHHHHHHH
Q psy5522 833 YKNKAIFVDLRKNKTFSVNVTLDMPKNIVPGSEHVEVSAVGDLLGPSIPNLANLIKMPFGCGEQNMLNFVPNIVVLEYLK 912 (1397)
Q Consensus 833 ~~t~s~~l~~~~~~~~~~~i~l~~P~~~v~gs~~~~vsv~gd~l~~~~~~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~ 912 (1397)
+.++...+.+. ..++..++.++.+++..+.+++++.+....-..++.|.+|||||.||+.|++.|.+|+.....
T Consensus 1115 ~~~~~~~l~~~------~~~~~~~l~~l~~~~~~~~l~ls~~~~~~~~~~~~~L~~YPygc~EQt~S~~~pll~~~~~~~ 1188 (1621)
T COG2373 1115 TQVKTVVLAPG------FTISGDLLAGLSPQAVNAQLQLSGTPAADIAALLDVLDDYPYGCAEQTASRLLPLLYAQKATA 1188 (1621)
T ss_pred eeeEEEecccc------cccCchhhhhcCccccceEEEeccCccccHHHHHHHHHhCCccchhhhhhhHHHHHhhhhhhc
Confidence 99888877642 455677888999999999999999988877778999999999999999999999999887665
Q ss_pred HhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCC-CccCHHHHHHHHHHHHh
Q psy5522 913 KTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASH-TTIDESVILEALAWLSS 991 (1397)
Q Consensus 913 ~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~-i~vd~~~~~~~~~WL~~ 991 (1397)
.. ....++....|+..+.|++.+|..+|+|++|++. .+++.|+||||..++++|+.. +.++...+.++..|+.+
T Consensus 1189 ~~----~~~~~~~~~~l~~a~~rL~~~Q~~~G~F~~W~~~-~~~d~~ltaYa~~Fl~~A~e~g~~vp~~~~~~~~~~~~~ 1263 (1621)
T COG2373 1189 DP----GAADNDLRARLQDAIGRLLSLQGSNGAFGLWGGN-GSGDPWLTAYAVDFLLRAREQGYSVPSDALNQMLERLLE 1263 (1621)
T ss_pred cc----cccchhHHHHHHHHHHHHHhhhhcCCceeecCCC-CCcchhhhHHHHHHHhhhhhcCcCCCHHHHHHHHHHHHH
Confidence 44 1223344445667788999999999999999886 689999999999999999877 89999999999898876
Q ss_pred cccCCC
Q psy5522 992 NQAVNG 997 (1397)
Q Consensus 992 ~q~~~G 997 (1397)
.-...|
T Consensus 1264 ~l~n~~ 1269 (1621)
T COG2373 1264 YLQNPG 1269 (1621)
T ss_pred HHhCcC
Confidence 544433
No 3
>cd02896 complement_C3_C4_C5 Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.
Probab=100.00 E-value=2.6e-56 Score=516.16 Aligned_cols=279 Identities=39% Similarity=0.644 Sum_probs=242.6
Q ss_pred ccCcchhhccCCCchhhhhcccchHHHHHHHHHHhcccc---HHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCC
Q psy5522 880 IPNLANLIKMPFGCGEQNMLNFVPNIVVLEYLKKTYQLT---DAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNG 956 (1397)
Q Consensus 880 ~~~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l~---~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~ 956 (1397)
.+||++|++|||||+||||++|+|++|+++||+.+++++ ++.++++.++|++||+|+++||++||||++|++. .+
T Consensus 2 ~~~l~~L~~~P~GC~EQ~~~~~~p~~~~~~yl~~~~~~~~~~~~~~~~~~~~i~~g~~r~l~~q~~dGsf~~w~~~--~~ 79 (297)
T cd02896 2 PEGLEKLIRLPTGCGEQTMIKLAPTVYALRYLDTTNQWEKLGPERRDEALKYIRQGYQRQLSYRKPDGSYAAWKNR--PS 79 (297)
T ss_pred chhHHHHHhCCCCcHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHHHHHHHHHHHHHHhccCCCCCccCCCCC--Cc
Confidence 468999999999999999999999999999999999865 4667899999999999999999999999999764 68
Q ss_pred CcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhhcCCCC---ccEEEEEEEEecccccee
Q psy5522 957 STWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADMQGGAA---KGLALTAYTLLPKKTRAV 1033 (1397)
Q Consensus 957 s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~---~~v~lt~~v~lp~~~~~~ 1033 (1397)
++||||||+++|.+|++++.||+..++++++||.++|+++|||.+...+++.-+.++++ .++.+|+++.+
T Consensus 80 s~wlTA~v~~~l~~a~~~~~v~~~~l~~a~~wL~~~Q~~dG~f~e~~~~~~~~m~gg~~~~~~~~~lTA~vl~------- 152 (297)
T cd02896 80 STWLTAFVVKVFSLARKYIPVDQNVICGSVNWLISNQKPDGSFQEPSPVIHREMTGGVEGSEGDVSLTAFVLI------- 152 (297)
T ss_pred chhhHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHhcCCCCCeeCCCCCccChhccCCccccCCCccchHHHHH-------
Confidence 99999999999999999999999999999999999999999999877777766666543 57889999877
Q ss_pred EEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCchh
Q psy5522 1034 NMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALPS 1113 (1397)
Q Consensus 1034 ~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~~ 1113 (1397)
+|+|.+-+.+
T Consensus 153 ------------------------------------------------------------aL~~~~~~~~---------- 162 (297)
T cd02896 153 ------------------------------------------------------------ALQEARSICP---------- 162 (297)
T ss_pred ------------------------------------------------------------HHHhhhcccc----------
Confidence 7777721100
Q ss_pred hcccCceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCCCCchHHHHHHHHHHhhCCCC
Q psy5522 1114 LQVSQNVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAGTEDAYAIAICSYALHLAQHP 1193 (1397)
Q Consensus 1114 l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~y~~al~aYal~la~~~ 1193 (1397)
.. ... ....+.+|+.||+++.+++.++|++||++|||+|+|++
T Consensus 163 --------------~~---------~~~--------------~~~~i~rA~~yL~~~~~~~~~~Y~~Al~ayALal~~~~ 205 (297)
T cd02896 163 --------------PE---------VQN--------------LDQSIRKAISYLENQLPNLQRPYALAITAYALALADSP 205 (297)
T ss_pred --------------cc---------chh--------------hHHHHHHHHHHHHHhcccCCChHHHHHHHHHHHHcCCh
Confidence 00 011 45689999999999999999999999999999999999
Q ss_pred chHHHHHHHhhcccccCCcccccccCCCCCCCCCCC-CCCCcccchhhHHHHHHhhccCCcccchhhhhhhhhcccCCCC
Q psy5522 1194 VKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWA-QVPNSVDVEMTSYALLSYLDRGLVEDTLPILTWLVTQQNDQGG 1272 (1397)
Q Consensus 1194 ~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~-~~~~s~~vE~t~Y~lL~~~~~~~~~~a~~i~~Wl~~q~n~~Gg 1272 (1397)
.+..++++|+++++.+++..||..+.. ..|+ ..+++++||+||||||++|..+++++|.|||+||++|||++||
T Consensus 206 ~~~~a~~~L~~~~~~d~~~~~~~~~~~-----~~~~~~~~~~~~vE~TAYALLall~~~~~~~a~~iv~WL~~qr~~~Gg 280 (297)
T cd02896 206 LSHAANRKLLSLAKRDGNGWYWWTIDS-----PYWPVPGPSAITVETTAYALLALLKLGDIEYANPIARWLTEQRNYGGG 280 (297)
T ss_pred hhHHHHHHHHHHhhhCCCcceeccCcC-----ccCCCCCCchhhhHHHHHHHHHHHhcCCchhHHHHHHHHHhcCCCCCC
Confidence 888899999999999998877755311 1232 3567889999999999999999999999999999999999999
Q ss_pred ccccccC
Q psy5522 1273 FASTQMS 1279 (1397)
Q Consensus 1273 f~stQdT 1279 (1397)
|+|||||
T Consensus 281 f~sTQdT 287 (297)
T cd02896 281 FGSTQDT 287 (297)
T ss_pred eehHHHH
Confidence 9999999
No 4
>cd02897 A2M_2 Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.
Probab=100.00 E-value=1.4e-50 Score=469.24 Aligned_cols=280 Identities=49% Similarity=0.798 Sum_probs=242.6
Q ss_pred cccCcchhhccCCCchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCc
Q psy5522 879 SIPNLANLIKMPFGCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGST 958 (1397)
Q Consensus 879 ~~~~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~ 958 (1397)
++++|++|+++||||+||||++++|++|+++||+.++++++++++++.++|++||+|+++||++||||++|++.+..+++
T Consensus 1 ~~~~l~~L~~~P~gc~Eq~~~~~~p~~~~~~yl~~~~~~~~~~~~~~~~~l~~g~~~~~~~q~~dGsf~~w~~~~~~~~~ 80 (292)
T cd02897 1 ALQNLDNLLRMPYGCGEQNMVNFAPNIYVLDYLKATGQLTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDKSGST 80 (292)
T ss_pred ChhhHHHHHhCCCCcHHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHhccCCCCCeecccCCCCCcch
Confidence 35789999999999999999999999999999999999999999999999999999999999999999999765457899
Q ss_pred chhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhhcCCCCccEEEEEEEEeccccceeEEEEe
Q psy5522 959 WLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADMQGGAAKGLALTAYTLLPKKTRAVNMTAT 1038 (1397)
Q Consensus 959 wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~~~v~lt~~v~lp~~~~~~~v~~~ 1038 (1397)
|+||||+++|.+|++++.+|+..++++++||.++|+++|||.+...+++..+++.....+.+|+++.+
T Consensus 81 wlTa~v~~~L~~a~~~~~v~~~~i~ra~~wL~~~Q~~dG~f~~~~~~~~~~~~~~~~~~~~~TA~vl~------------ 148 (292)
T cd02897 81 WLTAFVLKSFAQARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQGGVDDEVALTAYVLI------------ 148 (292)
T ss_pred hhHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHhcCCCCCCCCCCcccChhhcCCCCCCcchHHHHHH------------
Confidence 99999999999999889999999999999999999999999876656555454443455677887766
Q ss_pred eeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCchhhcccC
Q psy5522 1039 GSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALPSLQVSQ 1118 (1397)
Q Consensus 1039 G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~~l~~~~ 1118 (1397)
|+++.
T Consensus 149 -------------------------------------------------------aL~~~-------------------- 153 (292)
T cd02897 149 -------------------------------------------------------ALLEA-------------------- 153 (292)
T ss_pred -------------------------------------------------------HHHhc--------------------
Confidence 77776
Q ss_pred ceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCCCCchHHHHHHHHHHhhCCCCchHHH
Q psy5522 1119 NVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAGTEDAYAIAICSYALHLAQHPVKDVA 1198 (1397)
Q Consensus 1119 ~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~y~~al~aYal~la~~~~~~~~ 1198 (1397)
+. +. .++++.+|+.||++....+.++|.+||++|||++++++....+
T Consensus 154 ---------g~----------~~--------------~~~~i~~a~~yL~~~~~~~~~~y~~al~a~AL~~~~~~~~~~~ 200 (292)
T cd02897 154 ---------GL----------PS--------------ERPVVEKALSCLEAALDSISDPYTLALAAYALTLAGSEKRPEA 200 (292)
T ss_pred ---------CC----------cc--------------ccHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHcCCccHHHH
Confidence 32 11 4578999999999998888999999999999999997666779
Q ss_pred HHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhccC--CcccchhhhhhhhhcccCCCCcccc
Q psy5522 1199 FNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDRG--LVEDTLPILTWLVTQQNDQGGFAST 1276 (1397)
Q Consensus 1199 ~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~--~~~~a~~i~~Wl~~q~n~~Ggf~st 1276 (1397)
+++|+.+++.+++..||..... ......+++.+.+.+||+||||||++++.+ +++.+.+|++||.+|||++|||+||
T Consensus 201 ~~~l~~~~~~~~~~~~W~~~~~-~~~~~~~~~~~~~~~ve~TAyaLlall~~~~~~~~~~~~~v~WL~~~q~~~Ggf~sT 279 (292)
T cd02897 201 LKKLDELAISEDGTKHWSRPPP-SEEGPSYYWQAPSAEVEMTAYALLALLSAGGEDLAEALPIVKWLAKQRNSLGGFSST 279 (292)
T ss_pred HHHHHHHHhccCCCCCCCcCCC-cccccccCCCCCcchHHHHHHHHHHHHHcCCccHhHHHHHHHHHHHcCCCCCCcccH
Confidence 9999999999889999987521 111223334556789999999999999877 8899999999999999999999999
Q ss_pred ccC
Q psy5522 1277 QMS 1279 (1397)
Q Consensus 1277 QdT 1279 (1397)
|||
T Consensus 280 Qdt 282 (292)
T cd02897 280 QDT 282 (292)
T ss_pred HHH
Confidence 999
No 5
>PF07678 A2M_comp: A-macroglobulin complement component; InterPro: IPR011626 This domain covers the complement component region of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0005615 extracellular space; PDB: 1QSJ_D 1QQF_A 4ACQ_C 2B39_B 2WIN_H 2I07_B 2ICF_B 2XWJ_D 3G6J_B 2NOJ_C ....
Probab=100.00 E-value=1.3e-49 Score=447.46 Aligned_cols=233 Identities=45% Similarity=0.767 Sum_probs=199.7
Q ss_pred hccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhh
Q psy5522 931 TGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADM 1010 (1397)
Q Consensus 931 ~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~ 1010 (1397)
+||||+|+||++||||++| ..+.++|+||||||+++|.+|+++++||+..|+++++||.++|+++|||.+.+.+++..+
T Consensus 1 ~GYqr~L~y~~~DGsfs~f-~~~~~~s~WLTAfv~k~f~~a~~~i~vd~~~i~~a~~wL~~~Q~~dG~F~e~~~~~~~~~ 79 (246)
T PF07678_consen 1 QGYQRQLSYRRSDGSFSAF-SSDSPSSTWLTAFVVKVFSQAKKYIFVDENVICRAVKWLISQQQPDGSFEEDGPVIHREM 79 (246)
T ss_dssp HHHHHHHTTB-TTSSBBSS-TTTSSBBHHHHHHHHHHHHHHTTTS-CEHHHHHHHHHHHHHHBETTSEB--SSS-SSGGG
T ss_pred CchHHHhcCCCCCCCeecc-ccCCcccHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHhhcCCCccccCCCcccccc
Confidence 5999999999999999999 555689999999999999999999999999999999999999999999999888988888
Q ss_pred cCCCCccEEEEEEEEeccccceeEEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCC
Q psy5522 1011 QGGAAKGLALTAYTLLPKKTRAVNMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGD 1090 (1397)
Q Consensus 1011 ~~~~~~~v~lt~~v~lp~~~~~~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~ 1090 (1397)
+++++.++.||+||.+
T Consensus 80 ~g~~~~~~~lTA~Vli---------------------------------------------------------------- 95 (246)
T PF07678_consen 80 QGGVEDDIALTAYVLI---------------------------------------------------------------- 95 (246)
T ss_dssp SGGGTHHHHHHHHHHH----------------------------------------------------------------
T ss_pred CCCCCCCeeehHHHHH----------------------------------------------------------------
Confidence 8887888999999988
Q ss_pred ccEEEEEecCCcccccCCCCchhhcccCceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhh
Q psy5522 1091 SNMAVMEVSLPSGFTVDSDALPSLQVSQNVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKN 1170 (1397)
Q Consensus 1091 s~Mailev~l~SGf~~~~~~l~~l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~ 1170 (1397)
|++|.+=..+ .. .+. ...+++||+.||+++
T Consensus 96 ---AL~e~~~~~~------------------------~~---------~~~--------------~~~~i~kA~~~L~~~ 125 (246)
T PF07678_consen 96 ---ALLEAGSLCD------------------------SE---------KPE--------------YENAINKALNYLERH 125 (246)
T ss_dssp ---HHHHCHCCHT------------------------TT---------HHC--------------HHHHHHHHHHHHHHH
T ss_pred ---HHHhhhhhcc------------------------cc---------chh--------------hHHHHHHHHHHHHHh
Confidence 8888821000 00 123 567999999999999
Q ss_pred cCCCCchHHHHHHHHHHhhCC-CCchHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhc
Q psy5522 1171 LAGTEDAYAIAICSYALHLAQ-HPVKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLD 1249 (1397)
Q Consensus 1171 ~~~~~~~y~~al~aYal~la~-~~~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~ 1249 (1397)
++++.++|++||+||||+|++ ++.+..++++|++.|+.+++.+||+......+...+|+ .+++.+||+||||||+++.
T Consensus 126 ~~~~~~~Y~lAl~aYAL~la~~~~~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~-~~~s~~vEtTaYaLLa~l~ 204 (246)
T PF07678_consen 126 LDNIQDPYTLALVAYALALAGDSPQASKLLNKLNSMATTEGGLRYWSSDESSSSSSSPWS-RGSSLDVETTAYALLALLK 204 (246)
T ss_dssp HGCTSSHHHHHHHHHHHHHTTTCHHHHHHHHHHHCHCEETTTTCEE-SSSSSSSSSSTTT--SHHHHHHHHHHHHHHHHH
T ss_pred ccccCCHHHHHHHHHHHHhhcccchHHHHHHHHHHhhhhccccCcccCCccccccccccc-ccchHHHHHHHHHHHHHHh
Confidence 999999999999999999996 44457799999999999999999999876554433333 4789999999999999999
Q ss_pred cCCcccchhhhhhhhhcccCCCCccccccC
Q psy5522 1250 RGLVEDTLPILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus 1250 ~~~~~~a~~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
.+++.+|.|||+||++|||++|||+|||||
T Consensus 205 ~~~~~~~~~iv~WL~~qr~~~Ggf~STQdT 234 (246)
T PF07678_consen 205 RGDLEEASPIVRWLISQRNSGGGFGSTQDT 234 (246)
T ss_dssp HTCHHHHHHHHHHHHHCTTTTSSTSSHHHH
T ss_pred cccHHHHHHHHHHHHHhcCCCCccCcHHHH
Confidence 999999999999999999999999999999
No 6
>cd02891 A2M_like Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor: pregnancy zone protein (PZP). PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement. The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propaga
Probab=100.00 E-value=2e-36 Score=350.22 Aligned_cols=270 Identities=45% Similarity=0.718 Sum_probs=226.5
Q ss_pred ccCcchhhccCCCchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcc
Q psy5522 880 IPNLANLIKMPFGCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTW 959 (1397)
Q Consensus 880 ~~~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~w 959 (1397)
.+++++|++|||||+||+|++++|++++.+||...+++++..++++...|++|+++++++|++||||++|++.+ .+++|
T Consensus 2 ~~~l~~L~~yPygC~EQ~~s~~~p~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~Q~~dGgf~~w~~~~-~~~~~ 80 (282)
T cd02891 2 LGNLDYLLRYPYGCGEQTMSRAAPNLYVLKYLDATGQLTPEIREKALEYIRKGYQRLLTYQRSDGSFSAWGNSD-SGSTW 80 (282)
T ss_pred cchHHHHHcCCCCcHHHHHHHHHHHHHHHHHHHHhCcCCHHHHHHHHHHHHHHHHHHHhhcCCCCCccccCCCC-CCchH
Confidence 46799999999999999999999999999999999999888888999999999999999999999999998753 68999
Q ss_pred hhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhhcCCCCccEEEEEEEEeccccceeEEEEee
Q psy5522 960 LTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADMQGGAAKGLALTAYTLLPKKTRAVNMTATG 1039 (1397)
Q Consensus 960 lTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~~~v~lt~~v~lp~~~~~~~v~~~G 1039 (1397)
+||||+++|..|++...+|+..+.++++||.++|+.+|+|.......+.-+.+.....+.+++++..
T Consensus 81 ~Ta~~~~~L~~a~~~~~v~~~~i~ra~~~L~~~q~~~g~~~~~~~~~~~~~~~~~~~~~~~tA~al~------------- 147 (282)
T cd02891 81 LTAYVVKFLSQARKYIDVDENVLARALGWLVPQQKEDGSFRELGPVIHREMKGGVDDSVSLTAYVLI------------- 147 (282)
T ss_pred HHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHhccCCCCCcCCCCCccCHhhcCCcCCCcchHHHHHH-------------
Confidence 9999999999999887899999999999999999999999877654222112222233445555443
Q ss_pred eeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCchhhcccCc
Q psy5522 1040 SGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALPSLQVSQN 1119 (1397)
Q Consensus 1040 ~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~~l~~~~~ 1119 (1397)
++.+.
T Consensus 148 ------------------------------------------------------~L~~~--------------------- 152 (282)
T cd02891 148 ------------------------------------------------------ALAEA--------------------- 152 (282)
T ss_pred ------------------------------------------------------HHHHh---------------------
Confidence 44444
Q ss_pred eeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCCCCchHHHHHHHHHHhhCCCCc-hHHH
Q psy5522 1120 VKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAGTEDAYAIAICSYALHLAQHPV-KDVA 1198 (1397)
Q Consensus 1120 i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~y~~al~aYal~la~~~~-~~~~ 1198 (1397)
+.. ....+.+|+.||.++.....++|.+|++||||++.|+.. +.++
T Consensus 153 --------g~~-------------------------~~~~~~~a~~~L~~~~~~~~~~~~~a~la~al~~~g~~~~~~~~ 199 (282)
T cd02891 153 --------GKA-------------------------CDASIEKALAYLETQLDGLLDPYALAILAYALALAGDSTRADEA 199 (282)
T ss_pred --------ccc-------------------------chHHHHHHHHHHHHhcccCCChHHHHHHHHHHHHcCccHHHHHH
Confidence 321 245789999999999887889999999999999999755 4679
Q ss_pred HHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhccCCcccchhhhhhhhhcccCCCCcccccc
Q psy5522 1199 FNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDRGLVEDTLPILTWLVTQQNDQGGFASTQM 1278 (1397)
Q Consensus 1199 ~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~~~~a~~i~~Wl~~q~n~~Ggf~stQd 1278 (1397)
+++|.+.+...++..+|... .+.+..+++.+|+|+|+||+.+..++...+.+|++||.+||+..|+|.||||
T Consensus 200 l~~l~~~~~~~~~~~~~~~~--------~~~~~~s~~~~~a~a~all~~~~~~~~~~~~~~~~~L~~~~~~~~~~~sTq~ 271 (282)
T cd02891 200 LKKLLEAAREKGGTAHWSLS--------WPGDYGSSLRVEATAYALLALLKLGDLEEAGPIAKWLAQQRNSGGGFLSTQD 271 (282)
T ss_pred HHHHHHHhhhcCCcccccCC--------CCCCCCchhhHHHHHHHHHHHHhcCChhhHHHHHHHHHHcCCCCCCcccHHH
Confidence 99999999988888888211 2233446688999999999999888888999999999999999999999999
Q ss_pred C
Q psy5522 1279 S 1279 (1397)
Q Consensus 1279 T 1279 (1397)
|
T Consensus 272 t 272 (282)
T cd02891 272 T 272 (282)
T ss_pred H
Confidence 9
No 7
>PF07703 A2M_N_2: Alpha-2-macroglobulin family N-terminal region; InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=99.90 E-value=8.7e-23 Score=209.48 Aligned_cols=135 Identities=36% Similarity=0.517 Sum_probs=124.9
Q ss_pred EEEEecCCCCCCCCeEEEEEEecCCcceEEEEEEeCCeEEEEEEEEecCCceeEEEEEeeCCCCCceeEEEEEEEec-CC
Q psy5522 419 IQAALLTQNPKVNKDVELEINSTAPLKYISYQVLGRGDVIMADTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVRE-DG 497 (1397)
Q Consensus 419 L~i~~~~~~~~~G~~~~v~v~s~~~~~~~~ylV~srG~Iv~~~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~-~g 497 (1397)
|+|.++++.|+|||++++.+.++.+...++|+|++||+|+..+.+++.. +..+++|+++++|+|++++++|++++ +|
T Consensus 1 l~i~~~~~~~~~Ge~~~v~v~~~~~~~~~~~~v~s~g~I~~~~~~~~~~--~~~~~~~~v~~~~~P~~~v~~~~v~~~~g 78 (136)
T PF07703_consen 1 LQISTDKDSYKPGETAKVTVQSPFPNGTFLYLVESRGKIVSTGSVELKN--GSTTFEFPVTPDMAPNFYVLAYYVRPADG 78 (136)
T ss_dssp EEEEE-SSSB-TTSEEEEEEEEESCESEEEEEEEETTEEEEEEEEECTT--TSSEEEEEE-GGGTSEEEEEEEEETTCTC
T ss_pred CEEEcCCCCcCCCCEEEEEEEcCCCccEEEEEEEECCeEEEEEEEEecC--CcEEEEEecchhcCCcEEEEEEEEcCCCC
Confidence 6899999999999999999999998667999999999999999999765 34589999999999999999999998 99
Q ss_pred eEEEEeEEEEEcccccceeeEEecCCCCCCCceEEEEEEecCCcEEEEEEEeeecccc
Q psy5522 498 EVVADGLDLELEGGLQNFVSANVSPDETEPGSNIQINLEAKPNSYIGLLAVDQKVLLL 555 (1397)
Q Consensus 498 evvads~~~~V~~~f~nkV~l~~s~~~~~Pg~~v~l~V~a~p~S~v~L~aVD~sVllL 555 (1397)
|+++|+++|.|+.|+.++|++++++++++||++++++|++.++|+|+|+|||||||+|
T Consensus 79 ~~~~~s~~i~V~~~~~~~v~l~~~~~~~~Pg~~~~~~i~~~~~s~v~l~~VD~sv~~L 136 (136)
T PF07703_consen 79 EVVADSVWIEVEPCFELKVELTASPDEYKPGEEVTLRIKAPPNSLVGLSAVDESVLLL 136 (136)
T ss_dssp EEEEEEEEEEBGCSGSSSEEEEESSSSBTTTSEEEEEEEESTTEEEEEEEEEHHHHHH
T ss_pred eEEEEEEEEEecccccceEEEEEecceeCCCCEEEEEEEeCCCCEEEEEEeEhhhhcC
Confidence 9999999999999999999999999999999999999999999999999999999976
No 8
>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=99.89 E-value=1.8e-22 Score=191.56 Aligned_cols=91 Identities=44% Similarity=0.804 Sum_probs=81.2
Q ss_pred ceeEEEEEe-ccCceEEEEEEcCCccccEEEEEEEEeCCCceeeeeCCeeEEEeeeEEEEecCCceeeeCCEEEEEEEEE
Q psy5522 660 TWLFQMEET-GFDGKVMVNEKVPDSITSWVLSAFSVDSLYGLGLMDMPKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVF 738 (1397)
Q Consensus 660 TwiW~~~~~-~~~G~a~v~~tvPDtiT~W~~~Afa~s~~~G~G~a~~~~~l~vfkpffv~l~LPysv~rGE~~~l~~~Vf 738 (1397)
||+|.+..+ |++|++++++++||+||+|+++|||++++.|||++ ++.+++++||||++++||||+++||++.++++||
T Consensus 1 T~~w~~~~~~d~~G~~~~~~~lPd~it~w~v~a~a~s~~~~~g~~-~~~~~~v~~p~~i~~~lP~~l~~GD~~~i~v~v~ 79 (92)
T PF00207_consen 1 TWFWSPIVVTDANGKATFSFTLPDSITSWRVTAFAVSPTGGFGIA-EPPEITVFKPFFIQLNLPRSLRRGDQIQIPVTVF 79 (92)
T ss_dssp EEEEEEEEECCTTEEEEEEEE-SSSSSEEEEEEEEEETTTEEEEE-CCEEEEEB-SEEEEEE--SEEETTSEEEEEEEEE
T ss_pred CeEEeeeeccCCCCEEEEEEECCCCccEEEEEEEEECCCCcceEe-cceEEEEEeeEEEEcCCCcEEecCCEEEEEEEEE
Confidence 799999975 69999999999999999999999999999999999 4559999999999999999999999999999999
Q ss_pred cCCCccEEEEEEE
Q psy5522 739 NYLSQDLVADVTL 751 (1397)
Q Consensus 739 Nyl~~~~~v~v~L 751 (1397)
||++++++|+|+|
T Consensus 80 N~~~~~~~v~V~l 92 (92)
T PF00207_consen 80 NYTDKDQEVTVTL 92 (92)
T ss_dssp E-SSS-EEEEEEE
T ss_pred eCCCCCEEEEEEC
Confidence 9999999999986
No 9
>PF07677 A2M_recep: A-macroglobulin receptor; InterPro: IPR009048 This entry represents the receptor-binding domain (RBD) of alpha-2-macroglobulin proteins. The RBD is located at the C terminus, its structure having an immunoglobulin-like fold consists of a sandwich of nine strands in two sheets with a Greek-key topology [, ]. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0005576 extracellular region; PDB: 1AYO_A 2B39_B 3HS0_C 3PRX_B 3PVM_D 3HRZ_C 3FRP_B 1EDY_A 3KLS_B 3KM9_B ....
Probab=99.84 E-value=7e-22 Score=186.17 Aligned_cols=49 Identities=49% Similarity=0.739 Sum_probs=43.7
Q ss_pred ccEEEEEecCCcccccCCCCchhhcccCc--eeEEEEeCCeEEEEEEeeecCC
Q psy5522 1091 SNMAVMEVSLPSGFTVDSDALPSLQVSQN--VKRVETKNGNTMVVLYFAYHQV 1141 (1397)
Q Consensus 1091 s~Mailev~l~SGf~~~~~~l~~l~~~~~--i~~~e~~~~~~~v~lY~~~~~~ 1141 (1397)
||||||||+|||||.|+.++|++|..... |+|+|++++++ +||||+++.
T Consensus 1 S~Mai~eV~lpSGf~~~~~~l~~l~~~~~~~~~r~E~~~~~V--~lY~d~i~~ 51 (91)
T PF07677_consen 1 SNMAIMEVGLPSGFVPDQDSLKQLQNSNDVKLKRYEVDDNKV--ILYFDEIPS 51 (91)
T ss_dssp BSSEEEEEE-STTEEEEHHHHHHHHCSTTESESEEEEETTEE--EEEESEBCS
T ss_pred CceEEEEEcCCCceeEChhHhhccccCCceeEEEEEEcCCEE--EEEEeeccc
Confidence 79999999999999999999999988654 56999999985 999999985
No 10
>KOG1366|consensus
Probab=99.84 E-value=4.2e-20 Score=243.37 Aligned_cols=199 Identities=26% Similarity=0.366 Sum_probs=165.1
Q ss_pred CCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh---HHHhhcCCC-------CccEEEEE-
Q psy5522 954 PNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV---SHADMQGGA-------AKGLALTA- 1022 (1397)
Q Consensus 954 ~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv---l~al~~~~~-------~~~v~lt~- 1022 (1397)
.+.++++|||||++++..++. +.+.++++||.+||+..|||.||||. ++||++... ...+++..
T Consensus 1154 ~s~~VE~tsYaLL~~~~~~~~-----~~~~pivrWl~~qr~~~GGf~STQdTvvalqaLs~y~~~~~~~~~~~~V~i~~~ 1228 (1436)
T KOG1366|consen 1154 RSIDVETTAYALLAYLLLAQV-----DYALPIVRWLVEQRNALGGFSSTQDTVVALQALSEYAALSHTETLSLQVTIQVP 1228 (1436)
T ss_pred chhhccchHHHHHHHHHhccc-----CcCchhHhhhhhhhcccCceeehHHHHHHHHHHHHhhhhcccceeEEEEEEEec
Confidence 357899999999999988764 27788999999999999999999983 889888421 11233221
Q ss_pred --------------------EEEeccccceeEEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccC-C----CCCce
Q psy5522 1023 --------------------YTLLPKKTRAVNMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKN-S----DSNHL 1077 (1397)
Q Consensus 1023 --------------------~v~lp~~~~~~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~-~----~~~~~ 1077 (1397)
.+.+|..++.+.++|+|.||+++|++++||+.++++++.|.|.+++.+. + +....
T Consensus 1229 ~~~~~~~~~~~~~~N~lllQ~~~~p~~~~~v~~ta~G~g~~~~q~~~~Ynv~~~~~~~~f~l~~~~~~~~~~~~~s~~~~ 1308 (1436)
T KOG1366|consen 1229 GKSSGTSQVDSTPNNLLLLQSVSLPSLHKTVCFTAKGQGCATVQISYTYNLLTPKSDPRFFLIVSTKPEPVEFDLSNIVV 1308 (1436)
T ss_pred ccCCCceeeeccccchhhcceeecccCCCceEEEEecceeEEEEeeEEEeecCcccCCCceeeeeeccccccCccccceE
Confidence 2466788999999999999999999999999999999999999987432 2 23345
Q ss_pred EEEEEEeec-CCCCccEEEEEecCCcccccCCCCchhhcccCceeEEEEeCCeEEEEEEeeecC-Cccceeeeccccccc
Q psy5522 1078 QLSICSGFI-GEGDSNMAVMEVSLPSGFTVDSDALPSLQVSQNVKRVETKNGNTMVVLYFAYHQ-VLPWTLMHYLVSKYP 1155 (1397)
Q Consensus 1078 ~l~vc~~y~-~~~~s~Mailev~l~SGf~~~~~~l~~l~~~~~i~~~e~~~~~~~v~lY~~~~~-~~~~~~~~~~~~~~~ 1155 (1397)
....|..+. .+++||||||||+|||||.++.++|.+|....++.+.|.+.++..|++|||+++ .+++.+.|+...-+|
T Consensus 1309 ~~~~~~~~~~~~~~SnMAIldVklpSGf~~~~~sl~~le~~~~v~~~~~~~~~~~v~~Y~D~v~~~~~~cv~F~a~~~~~ 1388 (1436)
T KOG1366|consen 1309 VYTTCCERTGSSGRSNMAILDVKLPSGFRADHESLGLLEQLEDVYRVEVENDHRRVLIYFDSVSGNETFCVRFEAVQSFP 1388 (1436)
T ss_pred EEEEEEeecCCCCCCCceeeeeecccCcccChhhHHHhhccceEEEEEEecCCcceEEEeeccCCCCceEEEEEEEEEee
Confidence 556666555 467899999999999999999999999999999999999999877899999999 999999998888776
Q ss_pred ch
Q psy5522 1156 RM 1157 (1397)
Q Consensus 1156 ~~ 1157 (1397)
..
T Consensus 1389 V~ 1390 (1436)
T KOG1366|consen 1389 VA 1390 (1436)
T ss_pred hh
Confidence 43
No 11
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=99.80 E-value=3.6e-19 Score=171.94 Aligned_cols=93 Identities=37% Similarity=0.716 Sum_probs=80.3
Q ss_pred EEEEEcCCCcccCCceeEEEEEEEcCC--CccCCCcceEEEEECCCCCeEEeeee-eccCCceeeeeeecCCCCCCeeEE
Q psy5522 99 SVFIQLDKAIYKPGDLVRFRAIVLNSH--LKPSVTGALEIFITDGKGNRIKQWTR-ALTTRGVFSADLQLSKSPVLGDWN 175 (1397)
Q Consensus 99 ~vfIqTDKpIYkPGqtV~fRvv~ld~~--l~P~~~~~~~v~I~DP~g~~I~qw~~-~~~~~Gi~~~sf~L~~~~~lG~y~ 175 (1397)
++|||||||+|||||+|+||+++++.+ +.|..+.++.|+|.||+|+.|.+|.. ..+..|+++++|+||+++.+|.|+
T Consensus 1 ~~~i~TDr~iYrPGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~ 80 (99)
T PF01835_consen 1 KIFIQTDRPIYRPGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYT 80 (99)
T ss_dssp EEEEEESSSEE-TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEE
T ss_pred CEEEECCccCcCCCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEE
Confidence 589999999999999999999999887 66667899999999999999999999 789999999999999999999999
Q ss_pred EEEEEc---CeEEEEEEEE
Q psy5522 176 ITINVL---DQKFTKRFTV 191 (1397)
Q Consensus 176 I~v~~~---~~~~~~~F~V 191 (1397)
|++... +...+++|+|
T Consensus 81 i~~~~~~~~~~~~~~~F~V 99 (99)
T PF01835_consen 81 IRVKTDDDGGQSFSKTFQV 99 (99)
T ss_dssp EEEEETTTTCEEEEEEEEE
T ss_pred EEEEEccCCCCEEEEEEEC
Confidence 999993 3778999987
No 12
>cd00688 ISOPREN_C2_like This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein.
Probab=99.71 E-value=1.4e-17 Score=193.16 Aligned_cols=262 Identities=34% Similarity=0.497 Sum_probs=180.2
Q ss_pred cCcchhhccCCC--------chhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCC
Q psy5522 881 PNLANLIKMPFG--------CGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTT 952 (1397)
Q Consensus 881 ~~l~~Ll~~P~G--------CgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~ 952 (1397)
.++++|+++|++ |+||+++.++|.+....+....+ .+.+....++.+++.++++|+.||+|++|...
T Consensus 3 ~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~q~~dG~~~~~~~~ 77 (300)
T cd00688 3 KHLKYLLRYPYGDGHWYQSLCGEQTWSTAWPLLALLLLLAATG-----IRDKADENIEKGIQRLLSYQLSDGGFSGWGGN 77 (300)
T ss_pred hHHHHHHHccccCCCccccCCcCcHHHHHHHHHHHHHhCCccc-----ccccchHHHHHHHHHHHhccCCCCCccCCCCC
Confidence 468899999999 99999999999888776654321 11122233445555666788999999999653
Q ss_pred CCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHhHHHhhcCCCCccEEEEEEEEeccccce
Q psy5522 953 DPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKVSHADMQGGAAKGLALTAYTLLPKKTRA 1032 (1397)
Q Consensus 953 ~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdvl~al~~~~~~~~v~lt~~v~lp~~~~~ 1032 (1397)
. .+++|.|+|++++|..+++...++...+.++++||.++|+.+|+|......... ......++..++++..
T Consensus 78 ~-~~~~~~T~~~~~~l~~~~~~~~~~~~~~~~~~~~l~~~q~~dG~~~~~~~~~~~--~~~~~~~~~~t~~al~------ 148 (300)
T cd00688 78 D-YPSLWLTAYALKALLLAGDYIAVDRIDLARALNWLLSLQNEDGGFREDGPGNHR--IGGDESDVRLTAYALI------ 148 (300)
T ss_pred C-CcchHhHHHHHHHHHHcCCccccCHHHHHHHHHHHHHccCCCCCeeeecCCCCc--ccCCCCcccHHHHHHH------
Confidence 2 679999999999999887654567788999999999999999999753211000 0000111222222111
Q ss_pred eEEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCch
Q psy5522 1033 VNMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALP 1112 (1397)
Q Consensus 1033 ~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~ 1112 (1397)
++.+.
T Consensus 149 -------------------------------------------------------------aL~~~-------------- 153 (300)
T cd00688 149 -------------------------------------------------------------ALALL-------------- 153 (300)
T ss_pred -------------------------------------------------------------HHHHc--------------
Confidence 22221
Q ss_pred hhcccCceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCC--------CCchHHHHHHH
Q psy5522 1113 SLQVSQNVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAG--------TEDAYAIAICS 1184 (1397)
Q Consensus 1113 ~l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~--------~~~~y~~al~a 1184 (1397)
+. .. ....+.+++.||.++... ..++|..|.++
T Consensus 154 ---------------~~----------~~--------------~~~~~~~~~~~l~~~q~~~g~~~~~~~~~~~~t~~~~ 194 (300)
T cd00688 154 ---------------GK----------LD--------------PDPLIEKALDYLLSCQNYDGGFGPGGESHGYGTACAA 194 (300)
T ss_pred ---------------CC----------CC--------------CcHHHHHHHHHHHHHhcCCCCcCCCccccHHHHHHHH
Confidence 11 00 135688999999887642 47899999999
Q ss_pred HHHhhCCCC---chHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhccCCcccchhhhh
Q psy5522 1185 YALHLAQHP---VKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDRGLVEDTLPILT 1261 (1397)
Q Consensus 1185 Yal~la~~~---~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~~~~a~~i~~ 1261 (1397)
+||++++.. ..+.+++.|.+....+++..+|.... ...+......++.|+|++.....+.....++++
T Consensus 195 ~aL~~~~~~~~~~~~~~~~~L~~~q~~~g~~~~~~~~~---------~~~~~~~~~~~~~~aL~~~~~~~~~~~~~~~~~ 265 (300)
T cd00688 195 AALALLGDLDSPDAKKALRWLLSRQRPDGGWGEGRDRT---------NKLSDSCYTEWAAYALLALGKLGDLEDAEKLVK 265 (300)
T ss_pred HHHHHcCCcchHHHHHHHHHHHHhcCCCCCcCccccCC---------CCcCchHHHHHHHHHHHHHhhhcCcccHHHHHH
Confidence 999999973 34668888888887777665543210 112233445566666666554446678899999
Q ss_pred hhhhcccCCCCccc-------cccC
Q psy5522 1262 WLVTQQNDQGGFAS-------TQMS 1279 (1397)
Q Consensus 1262 Wl~~q~n~~Ggf~s-------tQdT 1279 (1397)
||..||+..|||.+ ||+|
T Consensus 266 ~L~~~q~~~G~w~~~~~~~~~~~~t 290 (300)
T cd00688 266 WLLSQQNEDGGFSSKPGKSYDTQHT 290 (300)
T ss_pred HHHhccCCCCCcCcCCCCCCcchhh
Confidence 99999999999998 7877
No 13
>cd02889 SQCY Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. Bacterial SQCY catalyzes the convertion of squalene to hopene or diplopterol. Eukaryotic OSQCY transforms the 2,3-epoxide of squalene to compounds such as, lanosterol (a metabolic precursor of cholesterol and steroid hormones) in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain. This group also contains SQCY-like archael sequences and some bacterial SQCY's which lack this minor domain.
Probab=99.51 E-value=6.6e-14 Score=167.34 Aligned_cols=259 Identities=19% Similarity=0.198 Sum_probs=163.4
Q ss_pred cccchHHHHHHHHHHhcc--ccHHHHHHHHHHHHhccccce---------ee-ecCCCCccccCCCCCCCCcchhHHHHH
Q psy5522 899 LNFVPNIVVLEYLKKTYQ--LTDAIEAKASRYLETGYQQEL---------TY-RRPDGSFSAFGTTDPNGSTWLTAFVAK 966 (1397)
Q Consensus 899 ~~~~p~i~~l~yL~~~~~--l~~~~~~~a~~~l~~gy~~~l---------~y-~~~dGsfs~~~~~~~~~s~wlTAyvl~ 966 (1397)
+.+....+++..|...++ ...+..+||++||.+. ++ .| +.++|+|.+-.......+.-.||||++
T Consensus 23 ~~~~~Ta~~~~al~~~g~~~~~~~~~~ka~~~l~~~---q~~~~~~~~~~~~~~~~~Ggw~y~~~~~~~~~~~~Ta~~l~ 99 (348)
T cd02889 23 SQVWDTALALQALLEAGLAPEFDPALKKALEWLLKS---QIRDNPDDWKVKYRHLRKGGWAFSTANQGYPDSDDTAEALK 99 (348)
T ss_pred CchHHHHHHHHHHHHcCCCCccCHHHHHHHHHHHhc---CCCCCCCchhhcCCCCCCCcCcccCcCCCCCCCCChHHHHH
Confidence 444556677788877765 3456667999999873 21 12 256888764321111245569999999
Q ss_pred HHHHccCCC----ccCHHHHHHHHHHHHhcccCCCccccchHh----H-H--HhhcCC--C-CccEEEEEEEEeccccce
Q psy5522 967 SFRQAASHT----TIDESVILEALAWLSSNQAVNGSFPEVGKV----S-H--ADMQGG--A-AKGLALTAYTLLPKKTRA 1032 (1397)
Q Consensus 967 ~~~~a~~~i----~vd~~~~~~~~~WL~~~q~~~G~F~stqdv----l-~--al~~~~--~-~~~v~lt~~v~lp~~~~~ 1032 (1397)
+|..++.+. .++.+.+.++++||.++|+++|+|.+.... + . ..+.+. . ...+.+|+++.+
T Consensus 100 al~~~~~~~~~~~~~~~~~i~~a~~~L~~~Q~~dG~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ta~~l~------ 173 (348)
T cd02889 100 ALLRLQKKPPDGKKVSRERLYDAVDWLLSMQNSNGGFAAFEPDNTYKYLELIPEVDGDIMIDPPYVECTGSVLE------ 173 (348)
T ss_pred HHHHhhccCcccchhhHHHHHHHHHHHHHhccCCCCEeeecCCccHHHHhcCchhhcCCccCCCCcchHHHHHH------
Confidence 999988774 678899999999999999999999753210 1 0 000000 0 011222222211
Q ss_pred eEEEEeeeeEEEEEEEEEEeeccCCCCCceeeccccccCCCCCceEEEEEEeecCCCCccEEEEEecCCcccccCCCCch
Q psy5522 1033 VNMTATGSGFAVVYISYQYNINVTGAWPMFTLDPQVDKNSDSNHLQLSICSGFIGEGDSNMAVMEVSLPSGFTVDSDALP 1112 (1397)
Q Consensus 1033 ~~v~~~G~G~~~~q~~~~Yn~~~~~~~~~f~l~v~~~~~~~~~~~~l~vc~~y~~~~~s~Mailev~l~SGf~~~~~~l~ 1112 (1397)
+|++.
T Consensus 174 -------------------------------------------------------------aL~~~-------------- 178 (348)
T cd02889 174 -------------------------------------------------------------ALGLF-------------- 178 (348)
T ss_pred -------------------------------------------------------------HHHHh--------------
Confidence 33332
Q ss_pred hhcccCceeEEEEeCCeEEEEEEeeecCCccceeeecccccccchhhHHHHHHHHHhhcCC---------CCchHHHHHH
Q psy5522 1113 SLQVSQNVKRVETKNGNTMVVLYFAYHQVLPWTLMHYLVSKYPRMNTINKAVDYIVKNLAG---------TEDAYAIAIC 1183 (1397)
Q Consensus 1113 ~l~~~~~i~~~e~~~~~~~v~lY~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~---------~~~~y~~al~ 1183 (1397)
+.. .. ..|+ ...+.+.+|+.||.+.... ....|..|++
T Consensus 179 ---------------~~~---------~~-------~~~~--~~~~~i~~a~~~L~~~q~~dG~w~~~~~~~~~y~ta~a 225 (348)
T cd02889 179 ---------------GKL---------YP-------EHRR--EIDPAIRRAVKYLEREQEPDGSWYGRWGVCFIYGTWFA 225 (348)
T ss_pred ---------------hhc---------CC-------chHH--HHHHHHHHHHHHHHHhCCCCCCccccCCCcchHHHHHH
Confidence 110 00 0000 0236789999999876441 2468999999
Q ss_pred HHHHhhCCCCc----hHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhccC--Ccccch
Q psy5522 1184 SYALHLAQHPV----KDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDRG--LVEDTL 1257 (1397)
Q Consensus 1184 aYal~la~~~~----~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~--~~~~a~ 1257 (1397)
.+||.++|... .+.+++-|.+....+++ |....... ....+..+...++++|||||++++..+ +...+.
T Consensus 226 ~~aL~~~g~~~~~~~~~~~~~~L~~~Q~~dG~---w~~~~~~~--~~~~~~~~~~~~~~~Ta~al~aL~~~g~~~~~~v~ 300 (348)
T cd02889 226 LEALAAAGEDENSPYVRKACDWLLSKQNPDGG---WGESYESY--EDPSYAGGGRSTVVQTAWALLALMAAGEPDSEAVK 300 (348)
T ss_pred HHHHHHcCCCcCcHHHHHHHHHHHHccCCCCC---cCCccccc--ccccccCCCCCcHHHHHHHHHHHHhcCCCCHHHHH
Confidence 99999999753 23466666666555554 54321100 000112344568999999999999876 345678
Q ss_pred hhhhhhhhcccCCCCccccccC
Q psy5522 1258 PILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus 1258 ~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
++++||.+|||+.|||.|+|++
T Consensus 301 ~a~~wL~~~Q~~dG~f~~~~~~ 322 (348)
T cd02889 301 RGVKYLLNTQQEDGDWPQEEIT 322 (348)
T ss_pred HHHHHHHHhcCCCCCcCCceee
Confidence 8899999999999999999987
No 14
>PF10569 Thiol-ester_cl: Alpha-macro-globulin thiol-ester bond-forming region; InterPro: IPR019565 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ]. This short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved. ; PDB: 2B39_B 2PN5_A 4ACQ_C 1HZF_A 2WIN_H 2I07_B 2ICF_B 2XWJ_D 3G6J_B 2NOJ_C ....
Probab=99.36 E-value=2.4e-13 Score=98.60 Aligned_cols=31 Identities=55% Similarity=1.112 Sum_probs=27.7
Q ss_pred CcchhhccCCCchhhhhcccchHHHHHHHHH
Q psy5522 882 NLANLIKMPFGCGEQNMLNFVPNIVVLEYLK 912 (1397)
Q Consensus 882 ~l~~Ll~~P~GCgEQ~m~~~~p~i~~l~yL~ 912 (1397)
||++|+++|+|||||||++|+|++++++||+
T Consensus 1 ~l~~Li~~P~GCgEQtm~~~~P~v~~~~YL~ 31 (31)
T PF10569_consen 1 NLDSLIRYPYGCGEQTMSSFAPNVYALRYLD 31 (31)
T ss_dssp TGGGGSSS--SSTTHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHhcCCCcHHHHHHHHhHHHHHHHHhC
Confidence 6899999999999999999999999999995
No 15
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=99.29 E-value=1e-07 Score=129.45 Aligned_cols=282 Identities=15% Similarity=0.128 Sum_probs=187.6
Q ss_pred ecccEEEEEcCCCcccCCceeEEEEEEEc-CCCccCCCcceEEEEECCCCCeEEeeeeeccCCc-------eeeeeeecC
Q psy5522 95 HKSYSVFIQLDKAIYKPGDLVRFRAIVLN-SHLKPSVTGALEIFITDGKGNRIKQWTRALTTRG-------VFSADLQLS 166 (1397)
Q Consensus 95 ~~~~~vfIqTDKpIYkPGqtV~fRvv~ld-~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~G-------i~~~sf~L~ 166 (1397)
.....+|+.||.+++++..+.++.+++.. .+.+|+ ..+.+.+.|-+|..+.+- .++..| .....|.+.
T Consensus 285 ~~a~~~~~VSD~g~~~~~~~~~l~v~v~sl~sg~P~--~g~~v~~~~~ngevl~~~--~tds~G~~~~~~~~~~~~~l~~ 360 (1621)
T COG2373 285 ARATRLFFVSDIGIVAKAYHDGLDVFVQSLSSGKPV--PGVEVSVLAGNGEVLVSG--TTDSQGHAKFSNAKKPAALLLA 360 (1621)
T ss_pred ccceEEEEEeCceeEeecccccEEEEEEeccCCCCC--CCcEEEEEecCCcEEeee--EEccccceecccccCCceEEEE
Confidence 56777999999999999999999998875 456775 345677889999777654 455566 444456665
Q ss_pred CCCCCeeEEEEEEEcCeEEEEEEEEeeecCCc--EEEEEeCCCceEeeCCeEEEEEEEEEecCc-cccc-eEEEEEEecc
Q psy5522 167 KSPVLGDWNITINVLDQKFTKRFTVAEYVLPK--FEVNVNVPPHATFKDSKVVISVNAKYTYGK-PVKG-EATITAYPTI 242 (1397)
Q Consensus 167 ~~~~lG~y~I~v~~~~~~~~~~F~VeEYvLPk--FeV~v~~p~~i~~~~~~~~v~v~AkYtyGk-PV~G-~v~v~v~~~~ 242 (1397)
...- +.|...-..++......|.|+.+..|- +.+-+-+++-++..||.+.+.+-+|+..|+ ++.+ .+++++...
T Consensus 361 ~~~~-~~~~~~d~~~~~~d~s~f~V~g~~~~~~~~k~y~ftDRglYRpGE~v~~~~~~R~~~~~~a~~~~p~~l~v~~P- 438 (1621)
T COG2373 361 RKED-GDFLGLDLTGGVFDLSDFDVEGRAAPGYGLKVYLFTDRGLYRPGETVHVNALLRDFDGKTALDNQPLKLRVLDP- 438 (1621)
T ss_pred ecCC-CceEEEEcCCCccccccccccceecCCCceEEEEecCcccCCCCceeeeeeeehhhcccccccCCCeEEEEECC-
Confidence 5432 334433333334456789999998887 888889999999999999999999999999 5655 366666532
Q ss_pred ccCccccccccceeEEeecCceEEEEEechhhhccccccceEEEEEEEEEECCcceEEeeEEEE-EEEEeeeEEEEEecC
Q psy5522 243 FSGVIQPLFQTPVRKVVPIDGKTVIEFDVVKELQLTDEYERNIHFDVAVEEALTGRRQNNTGSV-VFHKHKYKMDLIKSS 321 (1397)
Q Consensus 243 ~~g~~~~~~~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~~~~l~V~a~VtE~gTG~~~~~~~~i-~i~~~~~~i~f~~~~ 321 (1397)
.|. .......+++++|-+++.+++.++ .....+++++.+.+.. ...+.+..| .+.+..++|......
T Consensus 439 -dG~----~~~~~~~~~~~~G~~~~~~~l~~n-----a~tG~w~l~~~~~~~~--~~~s~~f~V~df~p~r~~i~l~~~k 506 (1621)
T COG2373 439 -DGS----VLRTLTITLDEEGLYELSFPLPEN-----ALTGGYTLELYTGGKS--AVISMSFRVEDFIPDRFKINLTLDK 506 (1621)
T ss_pred -CCc----EEEEEEEeccccCceEEeeeCCCC-----CCcceEEEEEEeCCcc--ceeeeeEEhhHhCCceEEEeccccc
Confidence 121 112345678899999999998661 1223556666654322 333333333 334555666655555
Q ss_pred CcccCCCeEEEEEEEEccCCCccCCCCce--EEEEEee----ecc---------cc-----ccceeeEeeCCCeeEEEEE
Q psy5522 322 EYFKPGLKYTAYMKLTHHDGTPVTDNNNM--VQVRHGF----SYD---------ES-----KYEANQYKLDRNGMIKLVY 381 (1397)
Q Consensus 322 ~~~kpG~p~~~~v~v~~~dG~P~~~~~~~--V~v~~~~----~~~---------~~-----~~~~~~~ttd~~G~~~f~i 381 (1397)
..+.+|.++.++|.+.+..|.|+.+...+ +.++... .|. .. .......++|.+|.+.|.+
T Consensus 507 ~~~~~g~~v~~~v~~~yL~GaPa~g~~~~~~l~lr~~~~~~~~~~~~~fg~~~~~~~~~~~~~~~~~~~~D~~G~a~~~l 586 (1621)
T COG2373 507 TEWVPGKDVKIKVDLRYLYGAPAAGLTVQGELDLRPTRFSVPGFPGFTFGLADEEPNSLTFEEELELTVTDGKGKASLDL 586 (1621)
T ss_pred ccccCCCcEEEEEEEEecCCCcccCceeeeEEEeecccccccCCcceeeeccccccccccccccccccccCCCCcEeEec
Confidence 66899999999999999999999964322 2222110 111 00 0111236789999999999
Q ss_pred eCCCCCceeEEEE
Q psy5522 382 YPPANENVTTLGI 394 (1397)
Q Consensus 382 ~~~~~~~~~~l~l 394 (1397)
+..+....+.+.+
T Consensus 587 ~~~~~~s~~~~~~ 599 (1621)
T COG2373 587 DLAETPSPYQLLV 599 (1621)
T ss_pred ccccCCCceEEEe
Confidence 9875543333333
No 16
>cd02892 SQCY_1 Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. This group contains bacterial SQCY which catalyzes the convertion of squalene to hopene or diplopterol and eukaryotic OSQCY which transforms the 2,3-epoxide of squalene to compounds such as, lanosterol in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain.
Probab=99.08 E-value=7.5e-10 Score=141.68 Aligned_cols=118 Identities=20% Similarity=0.247 Sum_probs=85.9
Q ss_pred hhhHHHHHHHHHhhcCC---------CCchHHHHHHHHHHhhCCCC-c----hHHHHHHHhhcccccCCcccccccCCCC
Q psy5522 1157 MNTINKAVDYIVKNLAG---------TEDAYAIAICSYALHLAQHP-V----KDVAFNLLESKAHNEDGKKWWKRAERPE 1222 (1397)
Q Consensus 1157 ~~~~~~a~~~l~~~~~~---------~~~~y~~al~aYal~la~~~-~----~~~~~~~L~~~a~~~~~~~~W~~~~~~~ 1222 (1397)
...+++|+.||++.-.. ....|..+.+.+||..+|.. . .+.+.+-|.+....+|| |......-
T Consensus 475 ~~~i~rAv~~L~~~Q~~DGsW~g~wg~~~~Y~T~~al~AL~~~G~~~~~~~~i~~a~~~L~s~Q~~DGg---Wge~~~s~ 551 (634)
T cd02892 475 DPAIRRAVKYLLREQEPDGSWYGRWGVCYIYGTWFALEALAAAGEDYENSPYIRKACDFLLSKQNPDGG---WGESYLSY 551 (634)
T ss_pred HHHHHHHHHHHHHccCCCCCccccCCCccHHHHHHHHHHHHHhCCcccCcHHHHHHHHHHHhcCCCCCC---CCCccccc
Confidence 46899999999986431 35578999999999998875 2 35678888887665555 54321100
Q ss_pred CCCCCCCCCCCcccchhhHHHHHHhhccCCc--ccchhhhhhhhhcccCCCCccccccC
Q psy5522 1223 DKKNPWAQVPNSVDVEMTSYALLSYLDRGLV--EDTLPILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus 1223 ~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~~--~~a~~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
. ...+..+...+++.||||||+++..++. ....+.++||.++|++.|||.++|+|
T Consensus 552 ~--~~~~~~~~~s~~~~TA~AllaLl~~g~~~~~~i~r~i~wL~~~Q~~~G~w~~~~~~ 608 (634)
T cd02892 552 E--DKSYAGGGRSTVVQTAWALLALMAAGEPDSEAVERGIKYLLNTQLPDGDWPQEEIT 608 (634)
T ss_pred c--CcccCCCCCCcHHHHHHHHHHHHHcCCCChHHHHHHHHHHHHcCCCCCCCCCcccc
Confidence 0 0112223456789999999999987755 34667799999999999999999998
No 17
>cd02896 complement_C3_C4_C5 Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.
Probab=98.62 E-value=7.2e-08 Score=112.34 Aligned_cols=71 Identities=14% Similarity=0.123 Sum_probs=55.2
Q ss_pred ceeeecCCCCcccc---------CCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh-
Q psy5522 936 ELTYRRPDGSFSAF---------GTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV- 1005 (1397)
Q Consensus 936 ~l~y~~~dGsfs~~---------~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv- 1005 (1397)
++..++.||++..| +....+.+++.|||||++++..+. ...+.++++||.+||+++|+|+||||.
T Consensus 214 L~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~vE~TAYALLall~~~~-----~~~a~~iv~WL~~qr~~~Ggf~sTQdTv 288 (297)
T cd02896 214 LLSLAKRDGNGWYWWTIDSPYWPVPGPSAITVETTAYALLALLKLGD-----IEYANPIARWLTEQRNYGGGFGSTQDTV 288 (297)
T ss_pred HHHHhhhCCCcceeccCcCccCCCCCCchhhhHHHHHHHHHHHhcCC-----chhHHHHHHHHHhcCCCCCCeehHHHHH
Confidence 44456678876543 222123479999999999998874 357889999999999999999999983
Q ss_pred --HHHhhc
Q psy5522 1006 --SHADMQ 1011 (1397)
Q Consensus 1006 --l~al~~ 1011 (1397)
||||++
T Consensus 289 val~AL~~ 296 (297)
T cd02896 289 VALQALAE 296 (297)
T ss_pred HHHHHHhc
Confidence 899986
No 18
>TIGR01507 hopene_cyclase squalene-hopene cyclase. SHC is an essential prokaryotic gene in hopanoid (triterpenoid) biosynthesis. Squalene hopene cyclase, an integral membrane protein, directly cyclizes squalene into hopanoid products.
Probab=98.55 E-value=9.7e-07 Score=112.20 Aligned_cols=150 Identities=23% Similarity=0.332 Sum_probs=93.9
Q ss_pred chhhHHHHHHHHHhhcC---------CCCchHHHHHHHHHHhhCCCCc----hHHHHHHHhhcccccCCcccccccCCCC
Q psy5522 1156 RMNTINKAVDYIVKNLA---------GTEDAYAIAICSYALHLAQHPV----KDVAFNLLESKAHNEDGKKWWKRAERPE 1222 (1397)
Q Consensus 1156 ~~~~~~~a~~~l~~~~~---------~~~~~y~~al~aYal~la~~~~----~~~~~~~L~~~a~~~~~~~~W~~~~~~~ 1222 (1397)
.++.|++|+.||++.-+ +....|..+.+.-||+.+|-.. .+.+++-|.+.- +.|| -|......-
T Consensus 469 ~~~~i~rav~~L~~~Q~~dG~W~g~wg~~~~Y~T~~al~aL~~~g~~~~~~~i~rAv~wL~~~Q-~~DG--GWge~~~sy 545 (635)
T TIGR01507 469 AWPVIERAVEYLKREQEPDGSWFGRWGVNYLYGTGAVLSALKAVGIDTREPYIQKALAWLESHQ-NPDG--GWGEDCRSY 545 (635)
T ss_pred hhHHHHHHHHHHHHccCCCCCCccCCCCccccHHHHHHHHHHHcCCCcccHHHHHHHHHHHHhc-CCCC--CCCCCCccc
Confidence 35689999999988532 2345777777777777776432 345666677653 3343 243221000
Q ss_pred CCCCCCCCCCCcccchhhHHHHHHhhccCC--cccchhhhhhhhhcccCCCCccccccCCCccccceeccccccccccCc
Q psy5522 1223 DKKNPWAQVPNSVDVEMTSYALLSYLDRGL--VEDTLPILTWLVTQQNDQGGFASTQMSKKELCPTVSAFRTHKVAQQKP 1300 (1397)
Q Consensus 1223 ~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~--~~~a~~i~~Wl~~q~n~~Ggf~stQdT~~~~c~~~~~~~~~~v~~~~p 1300 (1397)
. .+.+.....-++..||+||+++...++ -....+.++||.++||+.||+....-|. +| .-+-
T Consensus 546 ~--~~~~~g~g~s~~s~TA~AL~AL~~ag~~~~~~I~rav~~L~~~Q~~dG~W~e~~~~g--~g------------fp~~ 609 (635)
T TIGR01507 546 E--DPAYAGKGASTASQTAWALIALIAAGRAESEAARRGVQYLVETQRPDGGWDEPYYTG--TG------------FPGD 609 (635)
T ss_pred c--cccccCCCCCcHHHHHHHHHHHHHhCCCCcHHHHHHHHHHHHhcCCCCCCCCccccc--Cc------------ccce
Confidence 0 001111112378899999999997663 3456779999999999999997666652 11 1233
Q ss_pred eeEEEeeccccchhheeeeccccccccccccCCCCCCccchhHHHHHHHhhhh
Q psy5522 1301 VPVSIYDYYDQSRRARMFYEPRVATLCDICEGEDCSKDTVIGLYALAQLAEKV 1353 (1397)
Q Consensus 1301 ~~v~~ydyy~~~~~~~~~y~~~~~~~c~~c~~~~c~~~~~~~~~~~~~~~~~~ 1353 (1397)
-+++ |.||. .+--|-||+.|....
T Consensus 610 ~yi~-Y~~Y~----------------------------~~fpl~ALg~y~~~~ 633 (635)
T TIGR01507 610 FYLG-YHMYR----------------------------HVFPLLALARYKQAI 633 (635)
T ss_pred eeec-ccchh----------------------------hHhHHHHHHHHHHhh
Confidence 4566 77774 157788999998754
No 19
>PF07678 A2M_comp: A-macroglobulin complement component; InterPro: IPR011626 This domain covers the complement component region of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0005615 extracellular space; PDB: 1QSJ_D 1QQF_A 4ACQ_C 2B39_B 2WIN_H 2I07_B 2ICF_B 2XWJ_D 3G6J_B 2NOJ_C ....
Probab=98.32 E-value=5e-07 Score=102.35 Aligned_cols=51 Identities=18% Similarity=0.180 Sum_probs=45.6
Q ss_pred CCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh---HHHhhc
Q psy5522 956 GSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV---SHADMQ 1011 (1397)
Q Consensus 956 ~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv---l~al~~ 1011 (1397)
.++++|||+|++++..+ |.....++++||.+||+.+|||.||||. |+||++
T Consensus 190 ~~vEtTaYaLLa~l~~~-----~~~~~~~iv~WL~~qr~~~Ggf~STQdTvvaL~AL~~ 243 (246)
T PF07678_consen 190 LDVETTAYALLALLKRG-----DLEEASPIVRWLISQRNSGGGFGSTQDTVVALQALAE 243 (246)
T ss_dssp HHHHHHHHHHHHHHHHT-----CHHHHHHHHHHHHHCTTTTSSTSSHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhcc-----cHHHHHHHHHHHHHhcCCCCccCcHHHHHHHHHHHHH
Confidence 35899999999999984 5688999999999999999999999983 899986
No 20
>cd02897 A2M_2 Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.
Probab=97.95 E-value=2.8e-05 Score=90.68 Aligned_cols=54 Identities=19% Similarity=0.265 Sum_probs=46.4
Q ss_pred CCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh---HHHhhc
Q psy5522 955 NGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV---SHADMQ 1011 (1397)
Q Consensus 955 ~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv---l~al~~ 1011 (1397)
.++++.|||+|++++.++. -+...+.++++||.++|+++|+|.||||. |+||++
T Consensus 235 ~~~ve~TAyaLlall~~~~---~~~~~~~~~v~WL~~~q~~~Ggf~sTQdt~~al~AL~~ 291 (292)
T cd02897 235 SAEVEMTAYALLALLSAGG---EDLAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAK 291 (292)
T ss_pred cchHHHHHHHHHHHHHcCC---ccHhHHHHHHHHHHHcCCCCCCcccHHHHHHHHHHHHc
Confidence 4689999999999998874 23467888999999999999999999983 889875
No 21
>TIGR03463 osq_cycl 2,3-oxidosqualene cyclase. This model identifies 2,3-oxidosqualene cyclases from Stigmatella aurantiaca which produces cycloartenol, and Gemmata obscuriglobus and Methylococcus capsulatus which each produce the closely related sterol, lanosterol.
Probab=97.93 E-value=4.9e-05 Score=96.97 Aligned_cols=114 Identities=20% Similarity=0.253 Sum_probs=73.4
Q ss_pred hhhHHHHHHHHHhhcC---------CCCchHHHHHHHHHHhhCCCCc----hHHHHHHHhhcccccCCcccccccCCCCC
Q psy5522 1157 MNTINKAVDYIVKNLA---------GTEDAYAIAICSYALHLAQHPV----KDVAFNLLESKAHNEDGKKWWKRAERPED 1223 (1397)
Q Consensus 1157 ~~~~~~a~~~l~~~~~---------~~~~~y~~al~aYal~la~~~~----~~~~~~~L~~~a~~~~~~~~W~~~~~~~~ 1223 (1397)
..++.+|+.||.+.-. +....|..+.+.-||..+|... .+.+.+-|.+.-. .||. |...-..-.
T Consensus 476 ~~ai~rav~~L~~~Q~~dGsW~g~Wg~~~~Y~T~~al~aL~~~G~~~~~~~i~rA~~~Ll~~Q~-~DGg--Wg~~~~s~~ 552 (634)
T TIGR03463 476 TRAISRGVRFLRSRQREDGSFPGSWGVCFTYGTFHGVMGLRAAGASPDDMALQRAAAWLRSYQR-ADGG--WGEVYESCL 552 (634)
T ss_pred HHHHHHHHHHHHHhcCCCCCccccCCCCCcHHHHHHHHHHHHcCCCcCcHHHHHHHHHHHHccC-CCCC--ccCccCccc
Confidence 4679999999988532 2345788888888888877642 3557777776533 3442 432110000
Q ss_pred CCCCCCCCCCcccchhhHHHHHHhhccCC--cccchhhhhhhhhcccCCCCccc
Q psy5522 1224 KKNPWAQVPNSVDVEMTSYALLSYLDRGL--VEDTLPILTWLVTQQNDQGGFAS 1275 (1397)
Q Consensus 1224 ~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~--~~~a~~i~~Wl~~q~n~~Ggf~s 1275 (1397)
...|-....-++..||+||++++..+. -....+-++||.++|++.|||..
T Consensus 553 --~~~y~~~~~S~~~~TA~Al~aL~~~g~~~~~~i~rgi~~L~~~Q~~dG~W~~ 604 (634)
T TIGR03463 553 --QARYVEGKQSQAVMTSWALLALAEAGEGGHDAVQRGVAWLRSRQQEDGRWPR 604 (634)
T ss_pred --cccccCCCCCcHHHHHHHHHHHHHcCCcCCHHHHHHHHHHHHhCCCCCCCCC
Confidence 011111122457899999999998663 34567789999999999999953
No 22
>TIGR01787 squalene_cyclas squalene/oxidosqualene cyclases. This family of enzymes catalyzes the cyclization of the triterpenes squalene or 2-3-oxidosqualene to a variety of products including hopene, lanosterol, cycloartenol, amyrin, lupeol and isomultiflorenol.
Probab=97.87 E-value=9.8e-05 Score=94.16 Aligned_cols=119 Identities=18% Similarity=0.281 Sum_probs=77.7
Q ss_pred chhhHHHHHHHHHhhcC--C-------CCchHHHHHHHHHHhhCCCC-----chHHHHHHHhhcccccCCcccccccCCC
Q psy5522 1156 RMNTINKAVDYIVKNLA--G-------TEDAYAIAICSYALHLAQHP-----VKDVAFNLLESKAHNEDGKKWWKRAERP 1221 (1397)
Q Consensus 1156 ~~~~~~~a~~~l~~~~~--~-------~~~~y~~al~aYal~la~~~-----~~~~~~~~L~~~a~~~~~~~~W~~~~~~ 1221 (1397)
....+++|+.||.+.-. + ....|..+.+.-||+.+|.. ..+.+++-|.+.-..+|| |...-..
T Consensus 458 ~~~~i~rAl~~L~~~Q~~DGsw~g~wg~~y~YgT~~al~aL~~~G~~~~~~~~i~rA~~~L~~~Q~~DGG---Wge~~~s 534 (621)
T TIGR01787 458 IRNVLERALEYLRREQRADGSWFGRWGVNYTYGTGFVLSALAAAGRTYRNCPEVQKACDWLLSRQMPDGG---WGEDCFS 534 (621)
T ss_pred HHHHHHHHHHHHHHhcCCCCCCcccCCCCCchhHHHHHHHHHHhCCcccCCHHHHHHHHHHHhhcCCCCC---CCcCCcc
Confidence 34689999999987532 2 23467888888888888753 235577777766444444 3211000
Q ss_pred CCCCCCCCCCCCcccchhhHHHHHHhhccCC--cccchhhhhhhhhcccCCCCccccccC
Q psy5522 1222 EDKKNPWAQVPNSVDVEMTSYALLSYLDRGL--VEDTLPILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus 1222 ~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~~--~~~a~~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
.....|.....-++..|||||++++..++ .....+.++||.+.|++.|||.....|
T Consensus 535 --~~~~~y~~~~~S~~s~Ta~AL~AL~~ag~~~~~ai~rgv~~L~~~Q~~dG~w~~~~~~ 592 (621)
T TIGR01787 535 --YEDPSYVGSGGSTPSQTGWALMALIAAGEADSEAIERGVKYLLETQRPDGDWPQEYIT 592 (621)
T ss_pred --ccccccCCCCCCCHHHHHHHHHHHHHcCccchHHHHHHHHHHHHhCCCCCCCCCcccc
Confidence 00011111122478899999999997663 346778999999999999999776655
No 23
>PF05326 SVA: Seminal vesicle autoantigen (SVA); InterPro: IPR007990 This family consists of seminal vesicle autoantigen and prolactin-inducible (PIP) proteins. Seminal vesicle autoantigen (SVA) is specifically present in the seminal plasma of mice. This 19 kDa secretory glycoprotein suppresses the motility of spermatozoa by interacting with phospholipid. PIP has several known functions. In saliva, this protein plays a role in host defence by binding to microorganisms such as Streptococcus. PIP is an aspartyl proteinase and it acts as a factor capable of suppressing T-cell apoptosis through its interaction with CD4 [].; GO: 0005576 extracellular region; PDB: 3ES6_B.
Probab=97.79 E-value=0.00017 Score=70.24 Aligned_cols=84 Identities=17% Similarity=0.150 Sum_probs=50.0
Q ss_pred EeeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCee
Q psy5522 711 VFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGS 790 (1397)
Q Consensus 711 vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~ 790 (1397)
..||+-+++.+|...++||+++++++|-|++.+|+.|++.|..+..++. .....++-|+|.+..+
T Consensus 30 ~~k~l~l~l~~~~~~~~~eev~v~l~V~t~~~eCmvVk~yl~sn~~i~~---------------~fny~YTaCLC~d~~r 94 (124)
T PF05326_consen 30 SRKPLSLNLQVPQTAKANEEVTVTLTVTTELRECMVVKIYLESNPPIDG---------------SFNYKYTACLCDDYPR 94 (124)
T ss_dssp -----EE-----SEE-SS--EEEEEEEEE--SS-EEEEEEEEESS---S---------------GGG-EEEEEE-SSS-E
T ss_pred cCccEEEEeecCCCCCCCCEEEEEEEEEcchheeEEEEEEeccCCCccc---------------cccceEEEEeCCCCCc
Confidence 8999999999999999999999999999999999999999998876642 2244789999999999
Q ss_pred EEEEEEEeceeeeEEEEEE
Q psy5522 791 TTTFVITPKELGYIGIKVT 809 (1397)
Q Consensus 791 tv~f~v~p~~lG~v~itV~ 809 (1397)
++.|.|...+-+.+...+.
T Consensus 95 ~FyWDi~~~~t~~i~~~ad 113 (124)
T PF05326_consen 95 TFYWDIQVNRTVTIAAVAD 113 (124)
T ss_dssp EEEEEE--SS-EEEEEEEE
T ss_pred cEEEEEEECceEEEEEEEE
Confidence 9999999877665544433
No 24
>PLN03012 Camelliol C synthase
Probab=97.62 E-value=0.00017 Score=91.74 Aligned_cols=70 Identities=14% Similarity=0.179 Sum_probs=52.6
Q ss_pred hccccceeeecCCCCccccCCCC-------------------CCCCcchhHHHHHHHHHccCCCc------cCHHHHHHH
Q psy5522 931 TGYQQELTYRRPDGSFSAFGTTD-------------------PNGSTWLTAFVAKSFRQAASHTT------IDESVILEA 985 (1397)
Q Consensus 931 ~gy~~~l~y~~~dGsfs~~~~~~-------------------~~~s~wlTAyvl~~~~~a~~~i~------vd~~~~~~~ 985 (1397)
.+..-+|++|+.||||++|.... +....+.||-|+++|...+++.. | ...+.+|
T Consensus 517 ~av~wlL~mQn~dGGwaafe~~~~~~~le~lnp~E~F~d~mid~~y~dcTa~~l~aL~~f~~~~~~~r~~~i-~~~i~rA 595 (759)
T PLN03012 517 DAVNILLSLQSKNGGMTAWEPAGAPEWLELLNPTEMFADIVIEHEYNECTSSAIQALILFKQLYPDHRTEEI-NAFIKKA 595 (759)
T ss_pred HHHHHHHhccCCCCCEeeecCCcchHHHHhcChhhhhcCeecCCCcccHHHHHHHHHHHHhhhCcccchhhh-HHHHHHH
Confidence 44445677999999999986421 13445669999999987665422 2 4678999
Q ss_pred HHHHHhcccCCCcccc
Q psy5522 986 LAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus 986 ~~WL~~~q~~~G~F~s 1001 (1397)
++||.+.|++||+|..
T Consensus 596 v~~L~~~Q~~DGsW~G 611 (759)
T PLN03012 596 AEYIENIQMLDGSWYG 611 (759)
T ss_pred HHHHHHhcCCCCCCcc
Confidence 9999999999999973
No 25
>PLN02993 lupeol synthase
Probab=97.40 E-value=0.0011 Score=84.79 Aligned_cols=114 Identities=19% Similarity=0.294 Sum_probs=73.1
Q ss_pred hhhHHHHHHHHHhhcC---------CCCchHHHHHHHHHHhhCCCC-----chHHHHHHHhhcccccCCcccccccCCCC
Q psy5522 1157 MNTINKAVDYIVKNLA---------GTEDAYAIAICSYALHLAQHP-----VKDVAFNLLESKAHNEDGKKWWKRAERPE 1222 (1397)
Q Consensus 1157 ~~~~~~a~~~l~~~~~---------~~~~~y~~al~aYal~la~~~-----~~~~~~~~L~~~a~~~~~~~~W~~~~~~~ 1222 (1397)
+..+++|+.||++.=. ++.-.|..+-+--||+.+|.. ..+.+.+=|.+....+|| |..+-..-
T Consensus 589 ~~~i~rAv~yL~~~Q~~DGSW~G~Wgv~y~YgT~~aL~aL~a~G~~~~~~~~IrrAv~fLls~Q~~DGG---WGEs~~S~ 665 (763)
T PLN02993 589 IKSIEKAVQFIESKQTPDGSWYGNWGICFIYATWFALGGLAAAGKTYNDCLAMRKGVHFLLTIQRDDGG---WGESYLSC 665 (763)
T ss_pred HHHHHHHHHHHHHhcCCCCCcccccccccCcHHHHHHHHHHHcCCCCCCcHHHHHHHHHHHHhcCCCCC---cCcCcCcC
Confidence 4689999999987421 245678777777788888764 235577777776555544 43221000
Q ss_pred CCCCCCCC-CCCcccchhhHHHHHHhhccCCcc----cchhhhhhhhhcccCCCCcc
Q psy5522 1223 DKKNPWAQ-VPNSVDVEMTSYALLSYLDRGLVE----DTLPILTWLVTQQNDQGGFA 1274 (1397)
Q Consensus 1223 ~~~~~~~~-~~~s~~vE~t~Y~lL~~~~~~~~~----~a~~i~~Wl~~q~n~~Ggf~ 1274 (1397)
.. ..+-. ....-++.-||+|||+++..|.-+ .-.+-++||.+.|...|+|-
T Consensus 666 ~~-~~y~~~~~~~St~~qTAwAllaL~~aG~~~~~~~~l~Rgi~~L~~~Q~~~G~W~ 721 (763)
T PLN02993 666 PE-QRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDLIPLHRAAKLIITSQLENGDFP 721 (763)
T ss_pred CC-cccccCCCCCCchhhHHHHHHHHHHcCCCCCCcHHHHHHHHHHHhccCCCCCCC
Confidence 00 00111 112235678999999999887532 23468999999999999983
No 26
>cd00688 ISOPREN_C2_like This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein.
Probab=97.38 E-value=0.00031 Score=81.28 Aligned_cols=85 Identities=22% Similarity=0.185 Sum_probs=60.3
Q ss_pred CchHHHHHHHHHHhhCCCC----chHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhhcc
Q psy5522 1175 EDAYAIAICSYALHLAQHP----VKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYLDR 1250 (1397)
Q Consensus 1175 ~~~y~~al~aYal~la~~~----~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~~~ 1250 (1397)
.+....|.+.+||++++.. ....+++-|.+.-..+++...| ...+...|+.++.++...
T Consensus 138 ~~~~~t~~al~aL~~~~~~~~~~~~~~~~~~l~~~q~~~g~~~~~-----------------~~~~~~~t~~~~~aL~~~ 200 (300)
T cd00688 138 SDVRLTAYALIALALLGKLDPDPLIEKALDYLLSCQNYDGGFGPG-----------------GESHGYGTACAAAALALL 200 (300)
T ss_pred CcccHHHHHHHHHHHcCCCCCcHHHHHHHHHHHHHhcCCCCcCCC-----------------ccccHHHHHHHHHHHHHc
Confidence 4566677888889998875 3355666666665544443111 113557889999998877
Q ss_pred CC--cccchhhhhhhhhcccCCCCcccc
Q psy5522 1251 GL--VEDTLPILTWLVTQQNDQGGFAST 1276 (1397)
Q Consensus 1251 ~~--~~~a~~i~~Wl~~q~n~~Ggf~st 1276 (1397)
++ -......++||.+.|+..|||...
T Consensus 201 ~~~~~~~~~~~~~~L~~~q~~~g~~~~~ 228 (300)
T cd00688 201 GDLDSPDAKKALRWLLSRQRPDGGWGEG 228 (300)
T ss_pred CCcchHHHHHHHHHHHHhcCCCCCcCcc
Confidence 76 567889999999999999999654
No 27
>cd02889 SQCY Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. Bacterial SQCY catalyzes the convertion of squalene to hopene or diplopterol. Eukaryotic OSQCY transforms the 2,3-epoxide of squalene to compounds such as, lanosterol (a metabolic precursor of cholesterol and steroid hormones) in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain. This group also contains SQCY-like archael sequences and some bacterial SQCY's which lack this minor domain.
Probab=97.31 E-value=0.00067 Score=81.26 Aligned_cols=91 Identities=19% Similarity=0.248 Sum_probs=63.5
Q ss_pred HHHHHHHHHHhcc-ccHHHHHHHHHHHHhccccceeeecCCCCccccCC--------CCCCCCcchhHHHHHHHHHccCC
Q psy5522 904 NIVVLEYLKKTYQ-LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGT--------TDPNGSTWLTAFVAKSFRQAASH 974 (1397)
Q Consensus 904 ~i~~l~yL~~~~~-l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~--------~~~~~s~wlTAyvl~~~~~a~~~ 974 (1397)
+.+++.-|...+. ...+..++++++|.+ .|++||+|+.... ....+++..|||++++|+.++..
T Consensus 222 ta~a~~aL~~~g~~~~~~~~~~~~~~L~~-------~Q~~dG~w~~~~~~~~~~~~~~~~~~~~~~Ta~al~aL~~~g~~ 294 (348)
T cd02889 222 TWFALEALAAAGEDENSPYVRKACDWLLS-------KQNPDGGWGESYESYEDPSYAGGGRSTVVQTAWALLALMAAGEP 294 (348)
T ss_pred HHHHHHHHHHcCCCcCcHHHHHHHHHHHH-------ccCCCCCcCCcccccccccccCCCCCcHHHHHHHHHHHHhcCCC
Confidence 3444444444443 222333456665543 5778998764210 12356889999999999998764
Q ss_pred CccCHHHHHHHHHHHHhcccCCCccccchH
Q psy5522 975 TTIDESVILEALAWLSSNQAVNGSFPEVGK 1004 (1397)
Q Consensus 975 i~vd~~~~~~~~~WL~~~q~~~G~F~stqd 1004 (1397)
|...+.++++||.++|+++|+|.|+|+
T Consensus 295 ---~~~~v~~a~~wL~~~Q~~dG~f~~~~~ 321 (348)
T cd02889 295 ---DSEAVKRGVKYLLNTQQEDGDWPQEEI 321 (348)
T ss_pred ---CHHHHHHHHHHHHHhcCCCCCcCCcee
Confidence 667899999999999999999999985
No 28
>cd02890 PTase Protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). The protein prenyltransferase family of lipid-modifying enzymes includes protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II). They catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between the C1 atom of farnesyl (15-carbon by FTase) or geranylgeranyl (20-carbon by GGTase-I, II) isoprenoid lipids and cysteine residues at or near the C-terminus of protein acceptors. FTase and GGTase-I prenylate the cysteine in the terminal sequence, "CAAX"; and GGTase-II prenylates both cysteines in the "CC" (or "CXC") terminal sequence. These enzymes are heterodimeric with both alpha and beta subunits re
Probab=97.01 E-value=0.01 Score=69.14 Aligned_cols=91 Identities=18% Similarity=0.097 Sum_probs=65.1
Q ss_pred chHHHHHHHHHHhcc-ccHHHHHHHHHHHHhccccceeee-cCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCH
Q psy5522 902 VPNIVVLEYLKKTYQ-LTDAIEAKASRYLETGYQQELTYR-RPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDE 979 (1397)
Q Consensus 902 ~p~i~~l~yL~~~~~-l~~~~~~~a~~~l~~gy~~~l~y~-~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~ 979 (1397)
.-..+++.-|.--+. +....+++++++|.. .| ++||+|+.... ..++++.|.+++.++...+.- ..+.
T Consensus 28 ~~~y~~l~~l~ll~~~~~~~~~~~~i~~l~~-------~q~~~~Ggf~~~~~--~~~~~~~T~~al~~l~llg~~-~~~~ 97 (286)
T cd02890 28 WLLYWILSSLDLLGEDLDDENKDEIIDFIYS-------CQVNEDGGFGGGPG--QDPHLASTYAAVLSLAILGDD-ALSR 97 (286)
T ss_pred HHHHHHHHHHHHhCCCcchHHHHHHHHHHHH-------hhcCCCCCCCCCCC--CCccHHHHHHHHHHHHHcCcc-ccch
Confidence 333444444443343 566778889988775 45 78999988643 367899999999999988752 1233
Q ss_pred HHHHHHHHHHHhcccCCCccccc
Q psy5522 980 SVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 980 ~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
....++++||.+.|+++|+|...
T Consensus 98 ~~~~~~~~~l~~~q~~dGgf~~~ 120 (286)
T cd02890 98 IDREKIYKFLSSLQNPDGSFRGD 120 (286)
T ss_pred hhHHHHHHHHHHhcCCCCCcccC
Confidence 34567899999999999999753
No 29
>PF13249 Prenyltrans_2: Prenyltransferase-like; PDB: 1O6R_B 1O6Q_B 1H35_C 1H3A_C 1SQC_A 1UMP_A 1O6H_C 1O79_B 1GSZ_C 1H37_C ....
Probab=97.01 E-value=0.001 Score=65.58 Aligned_cols=91 Identities=19% Similarity=0.230 Sum_probs=60.2
Q ss_pred hcccchHHHHHHHHHHhccccHHHHHHH--HHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCC
Q psy5522 898 MLNFVPNIVVLEYLKKTYQLTDAIEAKA--SRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHT 975 (1397)
Q Consensus 898 m~~~~p~i~~l~yL~~~~~l~~~~~~~a--~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i 975 (1397)
.+....+.+++.-|...+...+ ++++ .++|.+ -|++||+|+..... ...+++.|++++.++......
T Consensus 20 ~~~~~~T~~al~aL~~~g~~~~--~~~~~~~~~L~~-------~q~~dGg~~~~~~~-~~~~~~~t~~~l~~l~~~~~~- 88 (113)
T PF13249_consen 20 PSDVWDTAFALLALAALGEEPD--RDRAAAVEWLLS-------QQNPDGGWGSNPDG-GPPDVYTTYVALAALELLGRP- 88 (113)
T ss_dssp -BEHHHHHHHHHHHHHHTSHHC--HHHHHHHHHHHH-------HB-TTSGBBSSTTT-T-BSHHHHHHHHHHHHHHT-G-
T ss_pred CCCHHHHHHHHHHHHHhCCccc--HHHHHHHHHHHH-------hCCCCCCccCCCCC-CCccHHHHHHHHHHHHHcCCC-
Confidence 3444555566666665554322 4455 777765 47799999887532 356789999999988766543
Q ss_pred ccCHHHHHHHHHHHHhcccCCCccc
Q psy5522 976 TIDESVILEALAWLSSNQAVNGSFP 1000 (1397)
Q Consensus 976 ~vd~~~~~~~~~WL~~~q~~~G~F~ 1000 (1397)
-+...+.++++||.++|+++|+|.
T Consensus 89 -~~~~~~~~a~~~l~~~Q~~dGg~~ 112 (113)
T PF13249_consen 89 -DDEEAVRKAVDWLLSCQNPDGGWG 112 (113)
T ss_dssp -GCHTTHCCHHHHHHHTB-TTSSB-
T ss_pred -cccHHHHHHHHHHHHhcCCCCCCC
Confidence 224678999999999999999995
No 30
>PF13249 Prenyltrans_2: Prenyltransferase-like; PDB: 1O6R_B 1O6Q_B 1H35_C 1H3A_C 1SQC_A 1UMP_A 1O6H_C 1O79_B 1GSZ_C 1H37_C ....
Probab=96.91 E-value=0.00057 Score=67.43 Aligned_cols=60 Identities=25% Similarity=0.417 Sum_probs=46.1
Q ss_pred ceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHH--HHHHHhcccCCCccccch
Q psy5522 936 ELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEA--LAWLSSNQAVNGSFPEVG 1003 (1397)
Q Consensus 936 ~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~--~~WL~~~q~~~G~F~stq 1003 (1397)
+++.|++||+|+.+ . ..++++.|++++.+|..+... ....++ ++||.++|+++|+|.+..
T Consensus 4 L~~~Q~~dGgw~~~-~--~~~~~~~T~~al~aL~~~g~~-----~~~~~~~~~~~L~~~q~~dGg~~~~~ 65 (113)
T PF13249_consen 4 LLSRQNPDGGWGGF-G--GPSDVWDTAFALLALAALGEE-----PDRDRAAAVEWLLSQQNPDGGWGSNP 65 (113)
T ss_dssp HHHHB-TTSSBBSS-T--S-BEHHHHHHHHHHHHHHTSH-----HCHHHHHHHHHHHHHB-TTSGBBSST
T ss_pred hHHHcCCCCCCcCC-C--CCCCHHHHHHHHHHHHHhCCc-----ccHHHHHHHHHHHHhCCCCCCccCCC
Confidence 45579999999886 2 367899999999999988753 225556 999999999999998754
No 31
>PF07703 A2M_N_2: Alpha-2-macroglobulin family N-terminal region; InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=96.67 E-value=0.044 Score=56.05 Aligned_cols=122 Identities=15% Similarity=0.217 Sum_probs=79.8
Q ss_pred EEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEE
Q psy5522 101 FIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINV 180 (1397)
Q Consensus 101 fIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~ 180 (1397)
-|.+||..|+|||++++.+.+-.. ...+.+.|.. +|..+. +.......|.-..+|+++++...+-|-+....
T Consensus 2 ~i~~~~~~~~~Ge~~~v~v~~~~~------~~~~~~~v~s-~g~I~~-~~~~~~~~~~~~~~~~v~~~~~P~~~v~~~~v 73 (136)
T PF07703_consen 2 QISTDKDSYKPGETAKVTVQSPFP------NGTFLYLVES-RGKIVS-TGSVELKNGSTTFEFPVTPDMAPNFYVLAYYV 73 (136)
T ss_dssp EEEE-SSSB-TTSEEEEEEEEESC------ESEEEEEEEE-TTEEEE-EEEEECTTTSSEEEEEE-GGGTSEEEEEEEEE
T ss_pred EEEcCCCCcCCCCEEEEEEEcCCC------ccEEEEEEEE-CCeEEE-EEEEEecCCcEEEEEecchhcCCcEEEEEEEE
Confidence 478999999999999999887543 3555666654 444443 33444666777999999988777665555554
Q ss_pred ---cC--eEEEEEEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEecCccccceEEEEE
Q psy5522 181 ---LD--QKFTKRFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTYGKPVKGEATITA 238 (1397)
Q Consensus 181 ---~~--~~~~~~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v 238 (1397)
++ ...+..|.|+.-. +.+|+++..+..+..++++++++.+ |..+.+.+.+
T Consensus 74 ~~~~g~~~~~s~~i~V~~~~--~~~v~l~~~~~~~~Pg~~~~~~i~~------~~~s~v~l~~ 128 (136)
T PF07703_consen 74 RPADGEVVADSVWIEVEPCF--ELKVELTASPDEYKPGEEVTLRIKA------PPNSLVGLSA 128 (136)
T ss_dssp TTCTCEEEEEEEEEEBGCSG--SSSEEEEESSSSBTTTSEEEEEEEE------STTEEEEEEE
T ss_pred cCCCCeEEEEEEEEEecccc--cceEEEEEecceeCCCCEEEEEEEe------CCCCEEEEEE
Confidence 23 3467789998822 4555555655556689999999998 4444454443
No 32
>cd02894 GGTase-II Geranylgeranyltransferase type II (GGTase-II)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-IIs are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-II ). GGTase-II catalyzes alkylation of both cysteine residues in Rab proteins containing carboxy-terminal "CC", "CXCX" or "CXC" motifs. PTases are heterodimeric with both alpha and beta subunits required for catalytic activity. In contrast to other prenyltr
Probab=96.62 E-value=0.024 Score=65.95 Aligned_cols=104 Identities=15% Similarity=0.074 Sum_probs=66.1
Q ss_pred cCCCchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeee-cCCCCccccCCCCCCCCcchhHHHHHH
Q psy5522 889 MPFGCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYR-RPDGSFSAFGTTDPNGSTWLTAFVAKS 967 (1397)
Q Consensus 889 ~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~-~~dGsfs~~~~~~~~~s~wlTAyvl~~ 967 (1397)
+|+.=..+..+++.-.-+++.-|.--+++....+++.++++.+ .+ ..+|+|...... .+++..|-+++.+
T Consensus 18 ~~~~~~~~~~~r~~~~y~~l~~l~lL~~~~~~~~~~~i~fl~~-------~q~~~~GgF~~~~~~--~~~~~~t~~a~~~ 88 (287)
T cd02894 18 DDYEYILTEHLRMSGIYWGLTALDLLGQLERLNREEIIEFVKS-------CQDNEDGGFGGSPGH--DPHILSTLSAIQI 88 (287)
T ss_pred CChhhhhHHhhHHHHHHHHHHHHHHhCCCchhhHHHHHHHHHH-------HhcCCCCCCCCCCCC--cchHHHHHHHHHH
Confidence 3554344444555555555544444455444445777777765 34 568999875432 4577778777777
Q ss_pred HHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522 968 FRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 968 ~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
+...+.+-.||+ ..+++++||.+.|+++|||...
T Consensus 89 L~ll~~~~~i~~-~~~~~~~~i~~~q~~dGgf~~~ 122 (287)
T cd02894 89 LALYDLLNKIDE-NKEKIAKFIKGLQNEDGSFSGD 122 (287)
T ss_pred HHHhhhhhhccH-HHHHHHHHHHHHcCCCCCeecC
Confidence 776543323453 4678999999999999999854
No 33
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=96.27 E-value=5.6 Score=50.72 Aligned_cols=417 Identities=15% Similarity=0.151 Sum_probs=212.9
Q ss_pred CCCceeEEEEEec---CceeeEeeEEEEEEeccc-----EEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEE
Q psy5522 68 LGPGQYNLTVKGS---GSLNFYNSTGLVYVHKSY-----SVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFIT 139 (1397)
Q Consensus 68 ~~~~~~~l~v~g~---~~~~f~~~~~v~~~~~~~-----~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~ 139 (1397)
..+|.|.|.+... +.-.|..++.+.+.-+.. .-|.....|+.. |+.|.|.+-+....-+- --....+.
T Consensus 155 ~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~Vk~~~~v~I~~F~~lns~~i~-~~eI~f~~~a~~~~g~~---~LYKF~~i 230 (667)
T PRK14081 155 NKPGKYELLVECKRIDSTKDFDDFKKVKFKVKEIDKVEITDFKCLNKELIC-DEELVFEVESVYEEDRT---ILYKFVKI 230 (667)
T ss_pred cCCCcEEEEEEEecCCCccccCcceEEEEEcccCcceEEEeccccCcceec-CcEEEEEEEEEeCCCce---EEEEEEEE
Confidence 3567788877664 224566666666655433 233346667766 88888887665433211 12233455
Q ss_pred CCCCCeEEeeeeecc--CCceeeeeeecCCCCCCeeEEEEEEEcC--------eEEEEEEEEeeecCCcEE-EEEeCCCc
Q psy5522 140 DGKGNRIKQWTRALT--TRGVFSADLQLSKSPVLGDWNITINVLD--------QKFTKRFTVAEYVLPKFE-VNVNVPPH 208 (1397)
Q Consensus 140 DP~g~~I~qw~~~~~--~~Gi~~~sf~L~~~~~lG~y~I~v~~~~--------~~~~~~F~VeEYvLPkFe-V~v~~p~~ 208 (1397)
|++|.... .++ ...++ ++. | ...|.|+|.+.+.+ ......|.|.-|..++.+ +.+...+.
T Consensus 231 ~~~G~~~~----~qdYst~n~~--~y~-~--~~~G~Y~i~~~VKD~~S~~eyDD~~~l~y~Vk~~~~vkI~~~~~d~~s~ 301 (667)
T PRK14081 231 DSDGKQTC----IQDYSTKNIV--SYK-E--KKSGDYKLLCLVKDMYSNNEFDDRAVLVYTVKPYKDIKIRNFTTDLSSP 301 (667)
T ss_pred CCCCCEEE----ecCccccceE--EEE-e--CCCccEEEEEEEeccCcccccccceEEEEEEecCCCcEEEEEEEcCCCC
Confidence 77774332 211 12222 222 2 25799999999976 345668999888877753 23333333
Q ss_pred eEeeCCeEEEEEEEEEecCccccceEEEEEEeccccCccc--ccc--ccceeEEeecCceEEEEEechhhhccccccceE
Q psy5522 209 ATFKDSKVVISVNAKYTYGKPVKGEATITAYPTIFSGVIQ--PLF--QTPVRKVVPIDGKTVIEFDVVKELQLTDEYERN 284 (1397)
Q Consensus 209 i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v~~~~~~g~~~--~~~--~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~~~~ 284 (1397)
+ +.++.+.+++.|. -|. .+.++.-.+ |... .++ +..+.=+-...|...+.+-+++ --..+.|...
T Consensus 302 q-l~g~~I~ika~a~--GG~----~llYrf~I~---G~~~e~~~Y~~~n~~~w~P~~~G~Y~I~v~VKD-k~S~~eyD~~ 370 (667)
T PRK14081 302 Q-LTDTDIELKAVAE--GGK----ELLYRFIIK---GKESEDSGYIRNNIYTWKPKIAGKYSITLWVKD-ISSKGEYEDK 370 (667)
T ss_pred e-ecCceEEEEEEec--CCC----ceEEEEEEC---CcEEeeccccccceEEEeeCCCceEEEEEEEEc-CcCcccccce
Confidence 3 4678888888776 232 222332211 1100 000 0111112223455544444433 1111122222
Q ss_pred EEEEEEEEECCcceEEeeEEEEEEEEeeeEEEEE--ecCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccc
Q psy5522 285 IHFDVAVEEALTGRRQNNTGSVVFHKHKYKMDLI--KSSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES 362 (1397)
Q Consensus 285 l~V~a~VtE~gTG~~~~~~~~i~i~~~~~~i~f~--~~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~ 362 (1397)
-.++-+|.|.. ..|.+|.-. +....+--|.++.+.+.+.+.++ +. ....+..+
T Consensus 371 ~~v~f~i~~~~--------------~~p~~I~~vl~d~~~~~lvG~~i~i~v~a~gg~~--~l-----Y~f~ik~n---- 425 (667)
T PRK14081 371 SSIDYTIEEKS--------------KEPIKIEDVILDKGKHILKGEEIKIRVIAEGGTN--LR-----YSFIIKKD---- 425 (667)
T ss_pred EEEEEEEcccC--------------CCCeEEEEEEECCCCceEeCCeEEEEEEecCCCe--EE-----EEEEEEEC----
Confidence 22222222211 013333322 22345556888877776655432 21 11111100
Q ss_pred ccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecCcce-----EEEEEEee-cCCCC-CeEEEEecCCCCCCCCeEE
Q psy5522 363 KYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLDIKE-----WFSTISAS-ESPSN-SFIQAALLTQNPKVNKDVE 435 (1397)
Q Consensus 363 ~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~~~-----~~~~v~~~-~S~s~-s~L~i~~~~~~~~~G~~~~ 435 (1397)
+.... .++=.-.-.+.|.| ...|.|.|.|.|+++-+.. ....+..+ +.|-. +++ |.+.++.+.||+++.
T Consensus 426 g~~ve--~~~Y~~~~~~~f~P-~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e~~P~~i~~i-l~~~~~~~~vg~~i~ 501 (667)
T PRK14081 426 GKEEE--KIDYGKNNWVNFIP-EEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHEYIPAEIDYI-LLPVKEYYLVGDDIE 501 (667)
T ss_pred CEEEE--EeecccccEEEEEE-CCCeeEEEEEEEecccCchhcccceEEEEEEeccCceeeeeE-EecccccEEeCCEEE
Confidence 01001 11111111344544 3557888888887765432 11222222 34532 333 456677899999999
Q ss_pred EEEEecCC-cceEEEEEEeCCeEEEEEEEEecCCceeEEEEEeeCCCCCceeEEEEEEEecCCeEEEE---eEEEEEccc
Q psy5522 436 LEINSTAP-LKYISYQVLGRGDVIMADTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVREDGEVVAD---GLDLELEGG 511 (1397)
Q Consensus 436 v~v~s~~~-~~~~~ylV~srG~Iv~~~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~~gevvad---s~~~~V~~~ 511 (1397)
+.+.+... .-..-|.++.+|..+...-..- +. .++| .|.-+-.-.+.||+-+.-.....| .+.|.|...
T Consensus 502 ~~~~~~~~k~v~y~y~~~~NG~~v~~t~Ys~---~~--~ysf--~P~~~GkY~I~V~aKn~~s~~~~D~~k~v~~~V~e~ 574 (667)
T PRK14081 502 IEVIIQNTKDVLIKYILKINGHKVEETDYIK---NK--KYKF--IPKCSGKYTIEVLAKNIKSTEEYDSKKEVKFYVREA 574 (667)
T ss_pred EEEEEeCCCeEEEEEEEEECCEEEEEeeccc---cc--eEEE--eecCCceEEEEEEEcccccccccccceEEEEEEcCC
Confidence 99986532 2236688899998875332221 12 2333 244555566777765433333333 356777644
Q ss_pred cc-ceeeEEecCCCCCCCceEEEEEEecCCcEEE
Q psy5522 512 LQ-NFVSANVSPDETEPGSNIQINLEAKPNSYIG 544 (1397)
Q Consensus 512 f~-nkV~l~~s~~~~~Pg~~v~l~V~a~p~S~v~ 544 (1397)
++ ....+..++....+|+.+++.+.+.-|.-|.
T Consensus 575 ~pi~nt~~~~~~~~~~~n~~~t~~~~~~gg~~v~ 608 (667)
T PRK14081 575 LPITNTKIKTSKKKFKCNEEVTFSVKSEGGKDVC 608 (667)
T ss_pred CCceeeEEEeecceEEcCCeEEEEEEccCCCcEE
Confidence 32 2356666778899999999999988766553
No 34
>COG1470 Predicted membrane protein [Function unknown]
Probab=96.23 E-value=0.25 Score=58.72 Aligned_cols=158 Identities=18% Similarity=0.259 Sum_probs=110.6
Q ss_pred eeEEEeeeEEEEecCC--c-eeeeCCEEEEEEEEEcCCCccEEEEEEEe-ecCCeeeeecCCCccCCCCCCcceeeeEEE
Q psy5522 707 KKLRVFRPFFISLDLP--Y-SVMRGEVVAIPVVVFNYLSQDLVADVTLE-NVGQFDFADFSNEVDAAPQPKFEVFRRKKL 782 (1397)
Q Consensus 707 ~~l~vfkpffv~l~LP--y-sv~rGE~~~l~~~VfNyl~~~~~v~v~L~-~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 782 (1397)
..+.-.+.||.+++.- | -+..++.+.+.+++-|...++.++.+.+. .++++...-.+.+ ...-++
T Consensus 259 ls~~~k~~~ftEl~~s~~~~~i~~~~t~sf~V~IeN~g~~~d~y~Le~~g~pe~w~~~Fteg~-----------~~vt~v 327 (513)
T COG1470 259 LSVTEKKSYFTELNSSDIYLEISPSTTASFTVSIENRGKQDDEYALELSGLPEGWTAEFTEGE-----------LRVTSV 327 (513)
T ss_pred EEEEeccceEEEeecccceeEEccCCceEEEEEEccCCCCCceeEEEeccCCCCcceEEeeCc-----------eEEEEE
Confidence 4455567888888765 3 34568888999999999999999998888 5565543322221 335589
Q ss_pred EEeCCCeeEEEEEEEec---eeeeEEEEEEEEeCC---------------------------------------------
Q psy5522 783 TIKANSGSTTTFVITPK---ELGYIGIKVTATSNL--------------------------------------------- 814 (1397)
Q Consensus 783 ~v~a~~~~tv~f~v~p~---~lG~v~itV~A~~~~--------------------------------------------- 814 (1397)
.|.+|+.+.+...|.|. ..|.-+++|.|.+..
T Consensus 328 kL~~gE~kdvtleV~ps~na~pG~Ynv~I~A~s~s~v~~e~~lki~~~g~~~~~v~l~~g~~~lt~taGee~~i~i~I~N 407 (513)
T COG1470 328 KLKPGEEKDVTLEVYPSLNATPGTYNVTITASSSSGVTRELPLKIKNTGSYNELVKLDNGPYRLTITAGEEKTIRISIEN 407 (513)
T ss_pred EecCCCceEEEEEEecCCCCCCCceeEEEEEeccccceeeeeEEEEeccccceeEEccCCcEEEEecCCccceEEEEEEe
Confidence 99999999999999984 589999999987630
Q ss_pred -CCceeE-EEEEec-CCCeeeEEEEEEEEe-cCCCceeEEEEEecCCCCcCCCCcEEEEEEEcccc
Q psy5522 815 -AGDSME-GKLLVK-PEGETQYKNKAIFVD-LRKNKTFSVNVTLDMPKNIVPGSEHVEVSAVGDLL 876 (1397)
Q Consensus 815 -~~D~v~-k~L~V~-peG~~~~~t~s~~l~-~~~~~~~~~~i~l~~P~~~v~gs~~~~vsv~gd~l 876 (1397)
..+.++ -.|.|+ |-|++.+...+ .|+ ++.+...+-++++++|+++.+|--++++++..|.+
T Consensus 408 sGna~LtdIkl~v~~PqgWei~Vd~~-~I~sL~pge~~tV~ltI~vP~~a~aGdY~i~i~~ksDq~ 472 (513)
T COG1470 408 SGNAPLTDIKLTVNGPQGWEIEVDES-TIPSLEPGESKTVSLTITVPEDAGAGDYRITITAKSDQA 472 (513)
T ss_pred cCCCccceeeEEecCCccceEEECcc-cccccCCCCcceEEEEEEcCCCCCCCcEEEEEEEeeccc
Confidence 111111 233443 66666665554 332 22223456777889999999999999999999943
No 35
>cd02893 FTase Protein farnesyltransferase (FTase)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). FTases are a subgroup of PTase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. These proteins are heterodimers of alpha and beta subunits. Both subunits are required for catalytic activity. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids. Ftase attaches a 15-carbon farnesyl group to the cysteine within the C-terminal CaaX motif of substrate proteins when X is Ala, Met, Ser, Cys or Gln. Protein farnesylation has been shown to play critical roles in a variety of cellular pro
Probab=96.10 E-value=0.1 Score=60.93 Aligned_cols=77 Identities=16% Similarity=0.122 Sum_probs=53.7
Q ss_pred ccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCC
Q psy5522 917 LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVN 996 (1397)
Q Consensus 917 l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~ 996 (1397)
+....+++.+++|.+ .|++||+|+.-.. ..++++.|=+++.++...+.--..+.-..+++++||.+.|+++
T Consensus 44 ~~~~~~~~~i~~i~~-------~q~~~GgF~~~~~--~~~h~~~Ty~A~~~L~ll~~~~~~~~id~~~~~~~l~~~q~~d 114 (299)
T cd02893 44 LDQSYADDVISFLRR-------CQNPSGGFGGGPG--QLPHLATTYAAVNALAIIGTEEAYDVIDREALYKFLLSLKQPD 114 (299)
T ss_pred ccHHHHHHHHHHHHH-------hcCCCCCCCCCCC--CCccHHHHHHHHHHHHHhCCchhhhHhhHHHHHHHHHHhcCCC
Confidence 455667788887765 4678999976322 3678899988888887766410011122345899999999999
Q ss_pred Cccccc
Q psy5522 997 GSFPEV 1002 (1397)
Q Consensus 997 G~F~st 1002 (1397)
|+|...
T Consensus 115 Ggf~~~ 120 (299)
T cd02893 115 GSFRMH 120 (299)
T ss_pred CCeeCC
Confidence 999865
No 36
>cd02892 SQCY_1 Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. This group contains bacterial SQCY which catalyzes the convertion of squalene to hopene or diplopterol and eukaryotic OSQCY which transforms the 2,3-epoxide of squalene to compounds such as, lanosterol in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain.
Probab=95.91 E-value=0.023 Score=73.45 Aligned_cols=90 Identities=19% Similarity=0.291 Sum_probs=63.9
Q ss_pred HHHHHHHHHHhccc--cHHHHHHHHHHHHhccccceeeecCCCCccc---------cCCCCCCCCcchhHHHHHHHHHcc
Q psy5522 904 NIVVLEYLKKTYQL--TDAIEAKASRYLETGYQQELTYRRPDGSFSA---------FGTTDPNGSTWLTAFVAKSFRQAA 972 (1397)
Q Consensus 904 ~i~~l~yL~~~~~l--~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~---------~~~~~~~~s~wlTAyvl~~~~~a~ 972 (1397)
+.+++.-|...+.. ..+..++|.++|.+ .|+.||+|+. +.+ ...+++..|||++++|+.++
T Consensus 507 T~~al~AL~~~G~~~~~~~~i~~a~~~L~s-------~Q~~DGgWge~~~s~~~~~~~~-~~~s~~~~TA~AllaLl~~g 578 (634)
T cd02892 507 TWFALEALAAAGEDYENSPYIRKACDFLLS-------KQNPDGGWGESYLSYEDKSYAG-GGRSTVVQTAWALLALMAAG 578 (634)
T ss_pred HHHHHHHHHHhCCcccCcHHHHHHHHHHHh-------cCCCCCCCCCccccccCcccCC-CCCCcHHHHHHHHHHHHHcC
Confidence 33455555554432 23344567776654 5788998752 111 13467889999999999887
Q ss_pred CCCccCHHHHHHHHHHHHhcccCCCccccchH
Q psy5522 973 SHTTIDESVILEALAWLSSNQAVNGSFPEVGK 1004 (1397)
Q Consensus 973 ~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqd 1004 (1397)
.. |...+.++++||.++|+++|+|.+++.
T Consensus 579 ~~---~~~~i~r~i~wL~~~Q~~~G~w~~~~~ 607 (634)
T cd02892 579 EP---DSEAVERGIKYLLNTQLPDGDWPQEEI 607 (634)
T ss_pred CC---ChHHHHHHHHHHHHcCCCCCCCCCccc
Confidence 63 667899999999999999999998875
No 37
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=95.75 E-value=1.1 Score=53.04 Aligned_cols=190 Identities=16% Similarity=0.229 Sum_probs=111.8
Q ss_pred eeCeeecCCeEEEEEEeecCCcc---------EEEEEEEEeecCCCC-eeEEeEEEE---------EC-CCceeEEEEEc
Q psy5522 6 APKVLRPNGEYHVAVSTQAVSEA---------TQVSVEVGGKQDSGG-EFRTKQLLS---------VD-PFSTRIVKLDI 65 (1397)
Q Consensus 6 ~P~~lr~g~~~~v~v~l~~~~~~---------v~v~v~L~~~~~~~~-~~~~~~~~~---------~~-~~~~~~~~f~v 65 (1397)
.|..|..|...++...|++.+++ +.+++++...++.+. +......++ ++ +..-..++-.+
T Consensus 98 lP~~l~~gE~lk~ta~L~~d~~~i~~~~~l~~v~~~v~f~~~~~~~~e~~~~~~~vvg~f~DdG~g~DE~p~DGvFT~~l 177 (374)
T TIGR03503 98 LPSPLFQGETLKVTAKLLNDGEPLKLPDFRDVVKLTVRFVSTNNAEDENFGADPIVVGEFEDDGEGLDERPGDGIFTGEF 177 (374)
T ss_pred CCccccCCCeEEEEEEEecCCEEeechhhhheEEEEEEEEecCCccccccccCCEEEEeeccCCccCCCCCCCceEEEEe
Confidence 58899999999999999986665 345555544322111 111111111 00 01112333344
Q ss_pred C-CCCCceeEEEEEecCc-eeeEeeEEEEEEec-ccEEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCC
Q psy5522 66 G-DLGPGQYNLTVKGSGS-LNFYNSTGLVYVHK-SYSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGK 142 (1397)
Q Consensus 66 P-~~~~~~~~l~v~g~~~-~~f~~~~~v~~~~~-~~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~ 142 (1397)
| ..++|.|++.+...++ +.=..+..+.+.+. =...|+|.+.| ++.=++.+-.-+..++| .+..+++++..|+
T Consensus 178 ~l~~~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~~~~~~~----~~~h~l~v~~d~~~i~~-~s~~~~~~~~~P~ 252 (374)
T TIGR03503 178 NLDVAPGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTVIQSEDE----SGAHQLMVDADAGHIDP-GSLVIHGELVFPN 252 (374)
T ss_pred eccCCCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEEEccCCC----CCceEEEEEcccceecc-ccEEEEEEEECCC
Confidence 5 4467899999888544 22233456777654 34678898854 55434444444456778 5889999999999
Q ss_pred CCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-eEEEEEEEEeeecCCcEEEEEeCCC
Q psy5522 143 GNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-QKFTKRFTVAEYVLPKFEVNVNVPP 207 (1397)
Q Consensus 143 g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-~~~~~~F~VeEYvLPkFeV~v~~p~ 207 (1397)
|. +.+-.... .-....+.||.....|.|++....-+ ....+.|. +.||.+...+..|.
T Consensus 253 g~-~~~~~~~~---~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~---~~lpe~~F~~~~p~ 311 (374)
T TIGR03503 253 GQ-IQQFSIEL---EEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQ---LTLPEFSFNVEEPP 311 (374)
T ss_pred Cc-eEEecccC---ccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEE---EEcCCEeEEecCCC
Confidence 97 55433221 22344777888888999999988754 22233332 34566655555543
No 38
>TIGR01507 hopene_cyclase squalene-hopene cyclase. SHC is an essential prokaryotic gene in hopanoid (triterpenoid) biosynthesis. Squalene hopene cyclase, an integral membrane protein, directly cyclizes squalene into hopanoid products.
Probab=95.72 E-value=0.029 Score=71.94 Aligned_cols=86 Identities=21% Similarity=0.296 Sum_probs=62.1
Q ss_pred HHHHHHHHhcc-ccHHHHHHHHHHHHhccccceeeecCCCCccc---------cCCCCCCCCcchhHHHHHHHHHccCCC
Q psy5522 906 VVLEYLKKTYQ-LTDAIEAKASRYLETGYQQELTYRRPDGSFSA---------FGTTDPNGSTWLTAFVAKSFRQAASHT 975 (1397)
Q Consensus 906 ~~l~yL~~~~~-l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~---------~~~~~~~~s~wlTAyvl~~~~~a~~~i 975 (1397)
.++.-|...+. ...+..++|+++|.+ -|++||+|+. +.+. ..+++..|||++.+|..++..
T Consensus 504 ~al~aL~~~g~~~~~~~i~rAv~wL~~-------~Q~~DGGWge~~~sy~~~~~~g~-g~s~~s~TA~AL~AL~~ag~~- 574 (635)
T TIGR01507 504 AVLSALKAVGIDTREPYIQKALAWLES-------HQNPDGGWGEDCRSYEDPAYAGK-GASTASQTAWALIALIAAGRA- 574 (635)
T ss_pred HHHHHHHHcCCCcccHHHHHHHHHHHH-------hcCCCCCCCCCCcccccccccCC-CCCcHHHHHHHHHHHHHhCCC-
Confidence 44445554442 233455678888765 4889999752 1111 246788999999999988753
Q ss_pred ccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522 976 TIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 976 ~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
+...+.++++||.++|+++|+|.+.
T Consensus 575 --~~~~I~rav~~L~~~Q~~dG~W~e~ 599 (635)
T TIGR01507 575 --ESEAARRGVQYLVETQRPDGGWDEP 599 (635)
T ss_pred --CcHHHHHHHHHHHHhcCCCCCCCCc
Confidence 5577999999999999999999765
No 39
>COG1470 Predicted membrane protein [Function unknown]
Probab=95.59 E-value=1.1 Score=53.56 Aligned_cols=171 Identities=20% Similarity=0.311 Sum_probs=105.4
Q ss_pred eecCCeEEEEEEeecCC-ccEEEEEEEEeecC-CCCee----EEeEEEEECCCceeEEEEEc-C--CCCCceeEEEEEec
Q psy5522 10 LRPNGEYHVAVSTQAVS-EATQVSVEVGGKQD-SGGEF----RTKQLLSVDPFSTRIVKLDI-G--DLGPGQYNLTVKGS 80 (1397)
Q Consensus 10 lr~g~~~~v~v~l~~~~-~~v~v~v~L~~~~~-~~~~~----~~~~~~~~~~~~~~~~~f~v-P--~~~~~~~~l~v~g~ 80 (1397)
++++....+.|.+.+.. ++-++.+.+.+-++ .+..+ .....+.++++....+.++| | +..+|.|.+.|...
T Consensus 280 i~~~~t~sf~V~IeN~g~~~d~y~Le~~g~pe~w~~~Fteg~~~vt~vkL~~gE~kdvtleV~ps~na~pG~Ynv~I~A~ 359 (513)
T COG1470 280 ISPSTTASFTVSIENRGKQDDEYALELSGLPEGWTAEFTEGELRVTSVKLKPGEEKDVTLEVYPSLNATPGTYNVTITAS 359 (513)
T ss_pred EccCCceEEEEEEccCCCCCceeEEEeccCCCCcceEEeeCceEEEEEEecCCCceEEEEEEecCCCCCCCceeEEEEEe
Confidence 55666656666665522 34455555553222 11111 12344567888888888877 3 33567776666554
Q ss_pred CceeeEeeEEEEEEecc-cEEEEEcCCCcc----cCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeee----e
Q psy5522 81 GSLNFYNSTGLVYVHKS-YSVFIQLDKAIY----KPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWT----R 151 (1397)
Q Consensus 81 ~~~~f~~~~~v~~~~~~-~~vfIqTDKpIY----kPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~----~ 151 (1397)
++-....+..|.+.... .-..+-+|-..| +||+..-+|+..-+..--|. ..+.++|.+|+|=.++--. .
T Consensus 360 s~s~v~~e~~lki~~~g~~~~~v~l~~g~~~lt~taGee~~i~i~I~NsGna~L--tdIkl~v~~PqgWei~Vd~~~I~s 437 (513)
T COG1470 360 SSSGVTRELPLKIKNTGSYNELVKLDNGPYRLTITAGEEKTIRISIENSGNAPL--TDIKLTVNGPQGWEIEVDESTIPS 437 (513)
T ss_pred ccccceeeeeEEEEeccccceeEEccCCcEEEEecCCccceEEEEEEecCCCcc--ceeeEEecCCccceEEECcccccc
Confidence 32233444455555443 235555666555 59998777777766654453 6788999999995554111 1
Q ss_pred -eccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 152 -ALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 152 -~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
...+.+-+......|+++..|+|.|.++...
T Consensus 438 L~pge~~tV~ltI~vP~~a~aGdY~i~i~~ks 469 (513)
T COG1470 438 LEPGESKTVSLTITVPEDAGAGDYRITITAKS 469 (513)
T ss_pred cCCCCcceEEEEEEcCCCCCCCcEEEEEEEee
Confidence 2335678999999999999999999998753
No 40
>PLN02710 farnesyltranstransferase subunit beta
Probab=95.51 E-value=0.2 Score=60.82 Aligned_cols=73 Identities=15% Similarity=0.221 Sum_probs=52.8
Q ss_pred ccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccC---CCccCHHHHHHHHHHHHhcc
Q psy5522 917 LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAAS---HTTIDESVILEALAWLSSNQ 993 (1397)
Q Consensus 917 l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~---~i~vd~~~~~~~~~WL~~~q 993 (1397)
++++.+++.+++|.+ .|++||+|+-..+ ..++...|=.++.++...+. +-.||. .++++||.+.|
T Consensus 89 l~~~~~~~ii~~l~~-------cQ~~dGGFgg~pg--~~~hl~~TY~Av~~L~iLg~~~~l~~Idr---~~l~~fl~s~q 156 (439)
T PLN02710 89 LDDELENDTIDFLSR-------CQDPNGGYGGGPG--QLPHLATTYAAVNTLVTIGGERALSSINR---EKLYTFLLRMK 156 (439)
T ss_pred ccHHHHHHHHHHHHH-------hcCCCcCCCCCCC--CCccHHHHHHHHHHHHHcCCchhhcccCH---HHHHHHHHHcC
Confidence 667778888888775 5778999976543 24677777777777766653 223443 55799999999
Q ss_pred cCCCcccc
Q psy5522 994 AVNGSFPE 1001 (1397)
Q Consensus 994 ~~~G~F~s 1001 (1397)
+++|+|..
T Consensus 157 ~~dGgF~~ 164 (439)
T PLN02710 157 DPSGGFRM 164 (439)
T ss_pred CCCCCccc
Confidence 99999975
No 41
>PF13243 Prenyltrans_1: Prenyltransferase-like; PDB: 3SDR_A 3SAE_A 3SDV_A 3SDT_A 3SDQ_A 3SDU_A.
Probab=95.26 E-value=0.0063 Score=59.64 Aligned_cols=65 Identities=23% Similarity=0.324 Sum_probs=16.5
Q ss_pred ccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522 932 GYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 932 gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
...-++.-|++||||..- ...+...|++++.++..+..- .....+.++++||.++|+++|+|...
T Consensus 4 ~~~~l~~~Q~~dG~W~~~----~~~~~~~t~~~~~al~~~~~~--~~~~ai~ka~~~l~~~Q~~dG~w~~~ 68 (109)
T PF13243_consen 4 AAEWLLSQQNPDGSWGYN----WGSDVFVTAALILALAAAGDA--AVDEAIKKAIDWLLSHQNPDGGWGYS 68 (109)
T ss_dssp -----------------------------------------TS---SSBSSHHHHHHHHH---TTS--S-T
T ss_pred cccccccccccccccccc----ccccccccccccccccccCCC--CcHHHHHHHHHHHHHhcCCCCCCCCc
Confidence 334556679999998532 124577788888888877643 33456799999999999999999754
No 42
>PLN02993 lupeol synthase
Probab=95.15 E-value=0.037 Score=71.24 Aligned_cols=70 Identities=23% Similarity=0.345 Sum_probs=53.5
Q ss_pred HHHHHHHHHHhccccceeeecCCCCc-cccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCcc
Q psy5522 921 IEAKASRYLETGYQQELTYRRPDGSF-SAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSF 999 (1397)
Q Consensus 921 ~~~~a~~~l~~gy~~~l~y~~~dGsf-s~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F 999 (1397)
..++|++||.+ -|++|||| +.|+. .-+.-|++||.+|..++.-.. +...+.++++||.+.|+.||||
T Consensus 591 ~i~rAv~yL~~-------~Q~~DGSW~G~Wgv----~y~YgT~~aL~aL~a~G~~~~-~~~~IrrAv~fLls~Q~~DGGW 658 (763)
T PLN02993 591 SIEKAVQFIES-------KQTPDGSWYGNWGI----CFIYATWFALGGLAAAGKTYN-DCLAMRKGVHFLLTIQRDDGGW 658 (763)
T ss_pred HHHHHHHHHHH-------hcCCCCCccccccc----ccCcHHHHHHHHHHHcCCCCC-CcHHHHHHHHHHHHhcCCCCCc
Confidence 44567776654 59999998 55654 346789999999987765322 3467999999999999999999
Q ss_pred ccc
Q psy5522 1000 PEV 1002 (1397)
Q Consensus 1000 ~st 1002 (1397)
+++
T Consensus 659 GEs 661 (763)
T PLN02993 659 GES 661 (763)
T ss_pred CcC
Confidence 864
No 43
>PLN03201 RAB geranylgeranyl transferase beta-subunit; Provisional
Probab=95.06 E-value=0.2 Score=59.11 Aligned_cols=84 Identities=15% Similarity=0.203 Sum_probs=56.5
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHH
Q psy5522 904 NIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVIL 983 (1397)
Q Consensus 904 ~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~ 983 (1397)
.+-.++.|.. +....+++++++|.+ .|++||+|+.+... .++.-.|-.++.++....++-.|| .+
T Consensus 43 ~l~~L~lL~~---~~~~~~~~~i~~l~~-------cq~~~GGF~~~~~~--~~h~~~Ty~al~~L~ll~~~~~id---~~ 107 (316)
T PLN03201 43 GLTALDLLGK---LDDVDRDEVVSWVMR-------CQHESGGFGGNTGH--DPHILYTLSAVQILALFDRLDLLD---AD 107 (316)
T ss_pred HHHHHHHhCC---CccccHHHHHHHHHH-------hcCCCCCcCCCCCC--cccHHHHHHHHHHHHHhhhhhhhh---HH
Confidence 3344555542 334445677777665 47789999887543 456777777777776655442233 45
Q ss_pred HHHHHHHhcccCCCccccc
Q psy5522 984 EALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 984 ~~~~WL~~~q~~~G~F~st 1002 (1397)
++++||.+.|+++|||...
T Consensus 108 ~~~~~l~s~Q~~dGgF~~~ 126 (316)
T PLN03201 108 KVASYVAGLQNEDGSFSGD 126 (316)
T ss_pred HHHHHHHHhcCCCCCccCC
Confidence 6899999999999999853
No 44
>TIGR01787 squalene_cyclas squalene/oxidosqualene cyclases. This family of enzymes catalyzes the cyclization of the triterpenes squalene or 2-3-oxidosqualene to a variety of products including hopene, lanosterol, cycloartenol, amyrin, lupeol and isomultiflorenol.
Probab=94.98 E-value=0.073 Score=68.36 Aligned_cols=71 Identities=28% Similarity=0.441 Sum_probs=53.2
Q ss_pred HHHHHHHHHHHhccccceeeecCCCCc-cccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCc
Q psy5522 920 AIEAKASRYLETGYQQELTYRRPDGSF-SAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGS 998 (1397)
Q Consensus 920 ~~~~~a~~~l~~gy~~~l~y~~~dGsf-s~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~ 998 (1397)
+..++|++||.. .|++||+| +.|+. ..+.-|++|+.+|..++.... +...+.++++||.++|+++||
T Consensus 460 ~~i~rAl~~L~~-------~Q~~DGsw~g~wg~----~y~YgT~~al~aL~~~G~~~~-~~~~i~rA~~~L~~~Q~~DGG 527 (621)
T TIGR01787 460 NVLERALEYLRR-------EQRADGSWFGRWGV----NYTYGTGFVLSALAAAGRTYR-NCPEVQKACDWLLSRQMPDGG 527 (621)
T ss_pred HHHHHHHHHHHH-------hcCCCCCCcccCCC----CCchhHHHHHHHHHHhCCccc-CCHHHHHHHHHHHhhcCCCCC
Confidence 344577777664 58899995 45543 235669999999998876432 237789999999999999999
Q ss_pred cccc
Q psy5522 999 FPEV 1002 (1397)
Q Consensus 999 F~st 1002 (1397)
|.++
T Consensus 528 Wge~ 531 (621)
T TIGR01787 528 WGED 531 (621)
T ss_pred CCcC
Confidence 9754
No 45
>TIGR03463 osq_cycl 2,3-oxidosqualene cyclase. This model identifies 2,3-oxidosqualene cyclases from Stigmatella aurantiaca which produces cycloartenol, and Gemmata obscuriglobus and Methylococcus capsulatus which each produce the closely related sterol, lanosterol.
Probab=94.84 E-value=0.082 Score=68.00 Aligned_cols=88 Identities=19% Similarity=0.397 Sum_probs=62.4
Q ss_pred HHHHHHHHHHhcc-ccHHHHHHHHHHHHhccccceeeecCCCCccc-cCC-------CCCCCCcchhHHHHHHHHHccCC
Q psy5522 904 NIVVLEYLKKTYQ-LTDAIEAKASRYLETGYQQELTYRRPDGSFSA-FGT-------TDPNGSTWLTAFVAKSFRQAASH 974 (1397)
Q Consensus 904 ~i~~l~yL~~~~~-l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~-~~~-------~~~~~s~wlTAyvl~~~~~a~~~ 974 (1397)
+.+++.-|...+. ......++|+++|.+ .|++||||+- |.. ....+++..||+++.+|..++..
T Consensus 508 T~~al~aL~~~G~~~~~~~i~rA~~~Ll~-------~Q~~DGgWg~~~~s~~~~~y~~~~~S~~~~TA~Al~aL~~~g~~ 580 (634)
T TIGR03463 508 TFHGVMGLRAAGASPDDMALQRAAAWLRS-------YQRADGGWGEVYESCLQARYVEGKQSQAVMTSWALLALAEAGEG 580 (634)
T ss_pred HHHHHHHHHHcCCCcCcHHHHHHHHHHHH-------ccCCCCCccCccCccccccccCCCCCcHHHHHHHHHHHHHcCCc
Confidence 4455555555453 233455677777764 5899999842 221 11246788999999999988742
Q ss_pred CccCHHHHHHHHHHHHhcccCCCcccc
Q psy5522 975 TTIDESVILEALAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus 975 i~vd~~~~~~~~~WL~~~q~~~G~F~s 1001 (1397)
+...+.++++||.++|+++|+|..
T Consensus 581 ---~~~~i~rgi~~L~~~Q~~dG~W~~ 604 (634)
T TIGR03463 581 ---GHDAVQRGVAWLRSRQQEDGRWPR 604 (634)
T ss_pred ---CCHHHHHHHHHHHHhCCCCCCCCC
Confidence 556789999999999999999975
No 46
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=94.75 E-value=0.15 Score=46.74 Aligned_cols=72 Identities=21% Similarity=0.205 Sum_probs=45.4
Q ss_pred eeeCCEEEEEEEEEcCCCcc-EEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEec---e
Q psy5522 725 VMRGEVVAIPVVVFNYLSQD-LVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITPK---E 800 (1397)
Q Consensus 725 v~rGE~~~l~~~VfNyl~~~-~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p~---~ 800 (1397)
|.+||.+.+.++|.|..+.. ..+++.|...++.... . .......|++|++.++.|.|+|- .
T Consensus 1 v~~G~~~~~~~tv~N~g~~~~~~v~~~l~~P~GW~~~-~--------------~~~~~~~l~pG~s~~~~~~V~vp~~a~ 65 (78)
T PF10633_consen 1 VTPGETVTVTLTVTNTGTAPLTNVSLSLSLPEGWTVS-A--------------SPASVPSLPPGESVTVTFTVTVPADAA 65 (78)
T ss_dssp --TTEEEEEEEEEE--SSS-BSS-EEEEE--TTSE------------------EEEEE--B-TTSEEEEEEEEEE-TT--
T ss_pred CCCCCEEEEEEEEEECCCCceeeEEEEEeCCCCcccc-C--------------CccccccCCCCCEEEEEEEEECCCCCC
Confidence 46899999999999998765 3578888877777521 0 11223389999999999999973 4
Q ss_pred eeeEEEEEEEE
Q psy5522 801 LGYIGIKVTAT 811 (1397)
Q Consensus 801 lG~v~itV~A~ 811 (1397)
.|.-.|++.|.
T Consensus 66 ~G~y~v~~~a~ 76 (78)
T PF10633_consen 66 PGTYTVTVTAR 76 (78)
T ss_dssp SEEEEEEEEEE
T ss_pred CceEEEEEEEE
Confidence 69988888875
No 47
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=93.76 E-value=19 Score=42.91 Aligned_cols=234 Identities=12% Similarity=0.096 Sum_probs=119.2
Q ss_pred EEEECCCCCeEEeee-eeccCCceeeeeeec--CCCCCCeeEEEEEEEcCeEEEEEEEEeeecCCcEEEEEeCCCceEee
Q psy5522 136 IFITDGKGNRIKQWT-RALTTRGVFSADLQL--SKSPVLGDWNITINVLDQKFTKRFTVAEYVLPKFEVNVNVPPHATFK 212 (1397)
Q Consensus 136 v~I~DP~g~~I~qw~-~~~~~~Gi~~~sf~L--~~~~~lG~y~I~v~~~~~~~~~~F~VeEYvLPkFeV~v~~p~~i~~~ 212 (1397)
+.|..|+|.+...|. ...+-.-.-..++.| -++|+.|.|.+.-.....+ ... ++-..++.+++-+..++.
T Consensus 32 vvLV~PDGsk~ya~~~~p~~V~W~~~~~~DlItI~~PmpGpWq~~G~v~p~s---ri~----viS~L~L~v~plP~~l~~ 104 (374)
T TIGR03503 32 VILVRPDGSKYYAWRVHPEDVKWYDESTMDIISIKNPMPGPWQAIGKITPGN---RVK----VISNLRLEVEPLPSPLFQ 104 (374)
T ss_pred eEEECCCCcEEeccCCCCCCceEEecCCceEEEeCCCCCCCcEEeeeeCCCC---eEE----EEeccEEEEecCCccccC
Confidence 567789999988887 433333333344444 7899999999988875411 111 244566667665555567
Q ss_pred CCeEEEEEEEEEec-Cccccc-----eEEEEEEeccccCccccccc------ccee--E----EeecCceEEEEEechhh
Q psy5522 213 DSKVVISVNAKYTY-GKPVKG-----EATITAYPTIFSGVIQPLFQ------TPVR--K----VVPIDGKTVIEFDVVKE 274 (1397)
Q Consensus 213 ~~~~~v~v~AkYty-GkPV~G-----~v~v~v~~~~~~g~~~~~~~------~~~~--~----~~d~~G~~~~~~~~~~~ 274 (1397)
++.+ +++|+-+. |+|+.. .+++++.........++++. .++. + ....||.++..+++..
T Consensus 105 gE~l--k~ta~L~~d~~~i~~~~~l~~v~~~v~f~~~~~~~~e~~~~~~~vvg~f~DdG~g~DE~p~DGvFT~~l~l~~- 181 (374)
T TIGR03503 105 GETL--KVTAKLLNDGEPLKLPDFRDVVKLTVRFVSTNNAEDENFGADPIVVGEFEDDGEGLDERPGDGIFTGEFNLDV- 181 (374)
T ss_pred CCeE--EEEEEEecCCEEeechhhhheEEEEEEEEecCCccccccccCCEEEEeeccCCccCCCCCCCceEEEEeeccC-
Confidence 7755 45555444 788874 23344322111110011110 0110 0 1235788888877633
Q ss_pred hccccccceEEEEEEEEEECCcceEEee-EEEEEEEEeeeEEEEEecCCcccCCCeEEEEEEEEccCCCccCCCCceEEE
Q psy5522 275 LQLTDEYERNIHFDVAVEEALTGRRQNN-TGSVVFHKHKYKMDLIKSSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQV 353 (1397)
Q Consensus 275 ~~~~~~~~~~l~V~a~VtE~gTG~~~~~-~~~i~i~~~~~~i~f~~~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v 353 (1397)
. +..+.+.+.+ +++.-... +..+.+++.|..++|..... ++.+..+.|.+-+..=.|- .+.+
T Consensus 182 --~----~G~Y~~~v~~---~n~vF~R~~~q~v~V~p~Pi~~~~~~~~~---~~~~h~l~v~~d~~~i~~~-----s~~~ 244 (374)
T TIGR03503 182 --A----PGEYRPTYQS---RNPVFLREVEQPVLVYPLPVSYTVIQSED---ESGAHQLMVDADAGHIDPG-----SLVI 244 (374)
T ss_pred --C----CceEEEEEEE---cCceEEEEEEEeEEEECCCeeEEEEccCC---CCCceEEEEEcccceeccc-----cEEE
Confidence 1 1233443333 22433333 45788899999988876543 4666666555322221221 2322
Q ss_pred EEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEE
Q psy5522 354 RHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAE 397 (1397)
Q Consensus 354 ~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~ 397 (1397)
........ +.........+.+...+.+......|.|++...+.
T Consensus 245 ~~~~~~P~-g~~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~ 287 (374)
T TIGR03503 245 HGELVFPN-GQIQQFSIELEEPETRVDLPANYEFGKYRVKGTVF 287 (374)
T ss_pred EEEEECCC-CceEEecccCccCceEEeccCcCCCeEEEEEEEEE
Confidence 22111111 11111112223444556665555677877766653
No 48
>PLN03012 Camelliol C synthase
Probab=93.66 E-value=0.22 Score=64.26 Aligned_cols=88 Identities=17% Similarity=0.198 Sum_probs=60.7
Q ss_pred HHHHHHHHhccc--cHHHHHHHHHHHHhccccceeeecCCCCccc-cCCC---------CCCCCcchhHHHHHHHHHccC
Q psy5522 906 VVLEYLKKTYQL--TDAIEAKASRYLETGYQQELTYRRPDGSFSA-FGTT---------DPNGSTWLTAFVAKSFRQAAS 973 (1397)
Q Consensus 906 ~~l~yL~~~~~l--~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~-~~~~---------~~~~s~wlTAyvl~~~~~a~~ 973 (1397)
.++.-|...+.. +.+..+||.++|.+ -|++||||+- |..- ...+.+.-||||+++|+.|+.
T Consensus 623 ~aL~aL~a~g~~~~~~~~Irrav~fLls-------~Q~~DGGWGEs~~Sc~~~~y~~~~~~~S~~~qTaWAl~aLi~ag~ 695 (759)
T PLN03012 623 FALAGLAAAGKTFNDCEAIRKGVHFLLA-------AQKDNGGWGESYLSCPKKIYIAQEGEISNLVQTAWALMGLIHAGQ 695 (759)
T ss_pred HHHHHHHHhCccCCCcHHHHHHHHHHHH-------hcCCCCCcCCCCCCCCCccccCCCCCCCcHHHHHHHHHHHHHcCC
Confidence 344444444431 22445678888775 4889999752 2210 113567889999999998864
Q ss_pred CCccCHHHHHHHHHHHHhcccCCCcccc
Q psy5522 974 HTTIDESVILEALAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus 974 ~i~vd~~~~~~~~~WL~~~q~~~G~F~s 1001 (1397)
..-|+..+.++++||.++|.++|+|..
T Consensus 696 -~~~~~~~i~Rg~~~Ll~~Q~~dG~W~q 722 (759)
T PLN03012 696 -AERDPIPLHRAAKLIINSQLENGDFPQ 722 (759)
T ss_pred -CCCCcHHHHHHHHHHHHcccCCCCCCC
Confidence 234566899999999999999999964
No 49
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=93.08 E-value=9.3 Score=42.11 Aligned_cols=174 Identities=13% Similarity=0.128 Sum_probs=91.2
Q ss_pred cEEEEEcCCCcccCCceeEEEEEEEcCCCccC--CCcceEEEEECCCCCeEEeeeeecc----------CCceeeeeeec
Q psy5522 98 YSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPS--VTGALEIFITDGKGNRIKQWTRALT----------TRGVFSADLQL 165 (1397)
Q Consensus 98 ~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~--~~~~~~v~I~DP~g~~I~qw~~~~~----------~~Gi~~~sf~L 165 (1397)
..++|.+++...++|+.+.+++..=. .+... .... .+.+.+|+|.++........ ..|.+...+.+
T Consensus 4 H~~Wl~~~~~~~~~g~~~~v~~~~Gh-~~~~~~~~~~~-~l~~~~~dg~~~~l~~~~~~~~~~~~~~~~~~g~~~~~~~~ 81 (215)
T PF10670_consen 4 HELWLEPDAFQVSPGEEVTVELYFGH-PFPGDEFPPDK-SLQVISPDGKRIDLPPTVEDGKYALVFDVEQPGTYKLTVAL 81 (215)
T ss_pred ccEEEECCccccCCCCEEEEEEEECC-CCCCccccccC-ceEEECCCCeeecccccccccccceeEeecCCCcEEEEEEc
Confidence 46799999999999999999887532 22110 0111 78889999984443221111 23333333322
Q ss_pred CCCCCCeeEEEEEEEcC----eE------EEEEEEE-eeec----------CC---cEEEEEeCCCceEeeCCeEEEEEE
Q psy5522 166 SKSPVLGDWNITINVLD----QK------FTKRFTV-AEYV----------LP---KFEVNVNVPPHATFKDSKVVISVN 221 (1397)
Q Consensus 166 ~~~~~lG~y~I~v~~~~----~~------~~~~F~V-eEYv----------LP---kFeV~v~~p~~i~~~~~~~~v~v~ 221 (1397)
.. .|.|+....... .. ..+.++. +-|+ .| .||+.-.....-+..++.+.+.|-
T Consensus 82 ~~---~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aK~~v~~~~~~~~~~~~~g~~lEIvPl~~P~~l~~g~~~~~~vl 158 (215)
T PF10670_consen 82 RD---PGYYSFTPEGWKRYSKREGPDAESSERYQKYAKTLVNVGGPSEDWSKPVGLPLEIVPLTNPYKLKAGDPLPFQVL 158 (215)
T ss_pred cC---CEEEEEeccCcccccccccCccceEEEEEEEEEEEEEccCCccccccccCCcEEEEECcCcccccCCCEEEEEEE
Confidence 21 355555444321 00 1111211 1122 12 234433222233346777777775
Q ss_pred EEEecCccccceEEEEEEeccccCccccccccceeEEeecCceEEEEEechhhhccccccceEEEEEEEEEE
Q psy5522 222 AKYTYGKPVKGEATITAYPTIFSGVIQPLFQTPVRKVVPIDGKTVIEFDVVKELQLTDEYERNIHFDVAVEE 293 (1397)
Q Consensus 222 AkYtyGkPV~G~v~v~v~~~~~~g~~~~~~~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~~~~l~V~a~VtE 293 (1397)
+-|+|++| +.|.+......... .......++|.+|.++|.++-+. ...|.|...+
T Consensus 159 ---~~GkPl~~-a~V~~~~~~~~~~~---~~~~~~~~TD~~G~~~~~~~~~G----------~wli~a~~~~ 213 (215)
T PF10670_consen 159 ---FDGKPLAG-AEVEAFSPGGWYDV---EHEAKTLKTDANGRATFTLPRPG----------LWLIRASHKD 213 (215)
T ss_pred ---ECCeEccc-EEEEEEECCCcccc---ccceEEEEECCCCEEEEecCCCE----------EEEEEEEEec
Confidence 77999999 44444332111000 00144678999999998876533 4566666544
No 50
>cd02890 PTase Protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). The protein prenyltransferase family of lipid-modifying enzymes includes protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II). They catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between the C1 atom of farnesyl (15-carbon by FTase) or geranylgeranyl (20-carbon by GGTase-I, II) isoprenoid lipids and cysteine residues at or near the C-terminus of protein acceptors. FTase and GGTase-I prenylate the cysteine in the terminal sequence, "CAAX"; and GGTase-II prenylates both cysteines in the "CC" (or "CXC") terminal sequence. These enzymes are heterodimeric with both alpha and beta subunits re
Probab=92.97 E-value=0.3 Score=56.81 Aligned_cols=81 Identities=23% Similarity=0.232 Sum_probs=57.2
Q ss_pred HHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHH
Q psy5522 908 LEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALA 987 (1397)
Q Consensus 908 l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~ 987 (1397)
+.-|...+......+++++++|.+ .|.+||+|+.-++ .++....|++++.++..++.. +...+.++++
T Consensus 134 l~~l~ll~~~~~~~~~~~~~~l~~-------~Q~~dGGf~~~~~--~es~~~~t~~av~sL~~l~~~---~~~~~~~~~~ 201 (286)
T cd02890 134 LSILSLLNILTDIDKEKLIDYILS-------CQNYDGGFGGVPG--AESHGGYTFCAVASLALLGRL---DLIDKERLLR 201 (286)
T ss_pred HHHHHHhCCchhhhHHHHHHHHHH-------hCCCCCCcCCCCC--CCCCccHhHHHHHHHHHcCCC---cccCHHHHHH
Confidence 333333344334456677777764 4788999976543 356788999999999888764 3445788899
Q ss_pred HHHhcccCCC-ccc
Q psy5522 988 WLSSNQAVNG-SFP 1000 (1397)
Q Consensus 988 WL~~~q~~~G-~F~ 1000 (1397)
||.++|+.+| ||.
T Consensus 202 ~L~~~q~~~ggGf~ 215 (286)
T cd02890 202 WLVERQLASGGGFN 215 (286)
T ss_pred HHHHhCCCCCCCcC
Confidence 9999998777 785
No 51
>cd02894 GGTase-II Geranylgeranyltransferase type II (GGTase-II)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-IIs are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-II ). GGTase-II catalyzes alkylation of both cysteine residues in Rab proteins containing carboxy-terminal "CC", "CXCX" or "CXC" motifs. PTases are heterodimeric with both alpha and beta subunits required for catalytic activity. In contrast to other prenyltr
Probab=92.38 E-value=0.5 Score=55.03 Aligned_cols=74 Identities=19% Similarity=0.081 Sum_probs=54.9
Q ss_pred ccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhccc
Q psy5522 915 YQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQA 994 (1397)
Q Consensus 915 ~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~ 994 (1397)
+.+...-.++++++|.+ .|++||+|+.-++ .+.++..|.+++-++..++..-.+ .++++++||.++|.
T Consensus 143 ~~~~~i~~~~~~~~l~~-------~q~~dGGF~~~~~--~es~~~~t~cavasL~llg~~~~~---~~~~~~~~L~~~q~ 210 (287)
T cd02894 143 GKLDLIDVDKAVDYLLS-------CYNFDGGFGCRPG--AESHAGQIFCCVGALAILGSLDLI---DRDRLGWWLCERQL 210 (287)
T ss_pred CCcchhhHHHHHHHHHH-------cCCCCCCcCCCCC--CCCchhHHHHHHHHHHHcCccccc---CHHHHHHHHHHhCC
Confidence 33433345788888775 3678999976433 356788899999999888765233 37778999999999
Q ss_pred CCCccc
Q psy5522 995 VNGSFP 1000 (1397)
Q Consensus 995 ~~G~F~ 1000 (1397)
.+|||.
T Consensus 211 ~~GGf~ 216 (287)
T cd02894 211 PSGGLN 216 (287)
T ss_pred CCCCcC
Confidence 999996
No 52
>cd02891 A2M_like Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor: pregnancy zone protein (PZP). PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement. The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propaga
Probab=92.14 E-value=0.49 Score=54.73 Aligned_cols=52 Identities=15% Similarity=0.132 Sum_probs=42.8
Q ss_pred CCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccchHh---HHHhhc
Q psy5522 955 NGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEVGKV---SHADMQ 1011 (1397)
Q Consensus 955 ~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~stqdv---l~al~~ 1011 (1397)
...++.|||++++..... +.....++++||.++++.+|+|.|||+. ++||.+
T Consensus 227 ~~~~~a~a~all~~~~~~-----~~~~~~~~~~~L~~~~~~~~~~~sTq~t~~al~AL~~ 281 (282)
T cd02891 227 SLRVEATAYALLALLKLG-----DLEEAGPIAKWLAQQRNSGGGFLSTQDTVVALQALAA 281 (282)
T ss_pred hhhHHHHHHHHHHHHhcC-----ChhhHHHHHHHHHHcCCCCCCcccHHHHHHHHHHHHh
Confidence 457899999999998654 4567788999999999999999999983 777753
No 53
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=92.11 E-value=34 Score=42.51 Aligned_cols=222 Identities=18% Similarity=0.236 Sum_probs=112.3
Q ss_pred EEEEEcCCCCCcee-EEEEE--ecCceeeEeeEEEEEEecccE------EEEEcCCCcccCCceeEEEEEEEcCCCccCC
Q psy5522 60 IVKLDIGDLGPGQY-NLTVK--GSGSLNFYNSTGLVYVHKSYS------VFIQLDKAIYKPGDLVRFRAIVLNSHLKPSV 130 (1397)
Q Consensus 60 ~~~f~vP~~~~~~~-~l~v~--g~~~~~f~~~~~v~~~~~~~~------vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~ 130 (1397)
..+..+|. .+|.. -+.+. +...-.|.+-..|.|...... +--+....--+.||+|+||++--.... +
T Consensus 158 ~~~~~lP~-RsG~hVI~~vWq~~Dt~~aFY~c~DV~f~~~~~~~p~w~~~g~i~~~~~l~~Gd~V~~RvFd~~Ge~-~-- 233 (478)
T PRK13211 158 SHECNVPE-RTGYQVILAVWEVGDTANSFYNVIDVNFDGGGGVVPEWSKVGQINPSMDLKAGDKVMTRVFDANGEN-P-- 233 (478)
T ss_pred EEEEECCC-CCccEEEEEEEEecCCcCCceEEEEEEEcCCCCCCCccccCccCccCCCCCCCCEEEEEEecCCCcc-c--
Confidence 33556775 55542 22222 212236888888888753211 001122225678999999999665321 2
Q ss_pred CcceEEEEECCCCCeEEeeeeec----------cCCceeee--eeecCCCCCCeeEEEEEEEcC--eEEEEEEEEeeecC
Q psy5522 131 TGALEIFITDGKGNRIKQWTRAL----------TTRGVFSA--DLQLSKSPVLGDWNITINVLD--QKFTKRFTVAEYVL 196 (1397)
Q Consensus 131 ~~~~~v~I~DP~g~~I~qw~~~~----------~~~Gi~~~--sf~L~~~~~lG~y~I~v~~~~--~~~~~~F~VeEYvL 196 (1397)
.....+.|.+.....-.+|.... ..-|.... .+. |..|.-.|-++.++ ...+..|+.+. .-
T Consensus 234 ~~~~~l~I~~~~~~~~~~W~~~LA~~iN~~~~~~~~G~~~~~g~i~----p~~g~N~iy~~~~s~~~~vei~~~~~~-p~ 308 (478)
T PRK13211 234 ALQTELTIDSATQGKKNNWSHALASKINQEQQQLRAGQLNADGQIE----PVYGKNPIYLKAGSGLERVEIGYDIEA-PA 308 (478)
T ss_pred cceeEEEEeccccCChhhhHHHHHHHHhhcchhhheeeecCCCCEe----eccCccceEEeCCCCceEEEEEEEecC-CC
Confidence 33456777764323334565211 11232222 222 67888888888776 44455555444 46
Q ss_pred CcEEEEEeC-CCceEeeCCeEEEEEEEEEecCccccceEEEEEEeccccCccccccccceeEEeecCceEEEEEechhhh
Q psy5522 197 PKFEVNVNV-PPHATFKDSKVVISVNAKYTYGKPVKGEATITAYPTIFSGVIQPLFQTPVRKVVPIDGKTVIEFDVVKEL 275 (1397)
Q Consensus 197 PkFeV~v~~-p~~i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v~~~~~~g~~~~~~~~~~~~~~d~~G~~~~~~~~~~~~ 275 (1397)
|.+++.|.- .....+.+...++++... ..|.+++++......|.. ......+-.++...+++.+.+ +
T Consensus 309 p~~~lsVsGL~~eY~I~dG~~~i~ftv~------a~g~~~vta~V~d~~g~~-----~~~~~~~v~d~s~~vtL~Ls~-~ 376 (478)
T PRK13211 309 PDYELEVSGLAKEYKIGDGAATLDFTVT------ATGDMNVEATVYNHDGEA-----LGSKSQTVNDGSQSVSLDLSK-L 376 (478)
T ss_pred CCCceEEecCcceeEEcCCcEEEEEEEE------eccceEEEEEEEcCCCCe-----eeeeeEEecCCceeEEEeccc-C
Confidence 778888774 233344566676666555 345555554432211111 011223333455777777765 3
Q ss_pred ccccccceEEEEEEEEEECCcceEE-eeEEEEEE
Q psy5522 276 QLTDEYERNIHFDVAVEEALTGRRQ-NNTGSVVF 308 (1397)
Q Consensus 276 ~~~~~~~~~l~V~a~VtE~gTG~~~-~~~~~i~i 308 (1397)
.. + .+.|.|.|++ +. |... ..+..+.+
T Consensus 377 ~A-G--~y~Lvv~~t~--~d-G~~~~q~~~~~~v 404 (478)
T PRK13211 377 KA-G--HHMLVVKAKP--KD-GELIKQQTLDFML 404 (478)
T ss_pred CC-c--eEEEEEEEEe--CC-CceeeeeeEEEEE
Confidence 31 1 2345555554 33 4443 34444443
No 54
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=92.06 E-value=2.9 Score=40.05 Aligned_cols=78 Identities=14% Similarity=0.152 Sum_probs=47.5
Q ss_pred ecCCcccCCCeEEEEEEEEccCC--CccCCCCceEEEEEeeeccccccceeeE-eeCCCeeEEEEEeCCCCCceeEEEEE
Q psy5522 319 KSSEYFKPGLKYTAYMKLTHHDG--TPVTDNNNMVQVRHGFSYDESKYEANQY-KLDRNGMIKLVYYPPANENVTTLGIE 395 (1397)
Q Consensus 319 ~~~~~~kpG~p~~~~v~v~~~dG--~P~~~~~~~V~v~~~~~~~~~~~~~~~~-ttd~~G~~~f~i~~~~~~~~~~l~l~ 395 (1397)
.....|+||....+++.+.+.++ .+..+ ..+.+.+... +......... +++++|.+.+++.+|++...-.++|+
T Consensus 6 TDr~iYrPGetV~~~~~~~~~~~~~~~~~~--~~~~v~i~dp-~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~ 82 (99)
T PF01835_consen 6 TDRPIYRPGETVHFRAIVRDLDNDFKPPAN--SPVTVTIKDP-SGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIR 82 (99)
T ss_dssp ESSSEE-TTSEEEEEEEEEEECTTCSCESS--EEEEEEEEET-TSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEE
T ss_pred CCccCcCCCCEEEEEEEEeccccccccccC--CceEEEEECC-CCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEE
Confidence 35578999999999999999984 44443 3555554321 1111233445 67899999999998877633446666
Q ss_pred EEec
Q psy5522 396 AEYL 399 (1397)
Q Consensus 396 a~~~ 399 (1397)
+...
T Consensus 83 ~~~~ 86 (99)
T PF01835_consen 83 VKTD 86 (99)
T ss_dssp EEET
T ss_pred EEEc
Confidence 6664
No 55
>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function. The region in question is approximately 150 residues long.
Probab=91.86 E-value=0.48 Score=60.63 Aligned_cols=79 Identities=16% Similarity=0.226 Sum_probs=70.0
Q ss_pred eeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeE
Q psy5522 712 FRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGST 791 (1397)
Q Consensus 712 fkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~t 791 (1397)
--|+-|.+.+|.+...|+.|.+..+|.|....-++..++|+.+++|-|. +.++.++.|-+.++.+
T Consensus 468 ~~~~~v~~~~p~~~~~~~~~~l~~~I~N~T~~~~~~~~~me~s~~F~fs---------------G~k~~~~~llP~s~~~ 532 (554)
T PF07919_consen 468 SSPLRVLASVPPSAIVGEPFTLSYTIENPTNHFQTFELSMEPSDDFMFS---------------GPKQTTFSLLPFSRHT 532 (554)
T ss_pred CCCcEEEEecCCccccCcEEEEEEEEECCCCccEEEEEEEccCCCEEEE---------------CCCcCceEECCCCcEE
Confidence 3467788889999999999999999999999999999999999998773 2335688999999999
Q ss_pred EEEEEEeceeeeEE
Q psy5522 792 TTFVITPKELGYIG 805 (1397)
Q Consensus 792 v~f~v~p~~lG~v~ 805 (1397)
+.|.+.|...|...
T Consensus 533 ~~y~l~pl~~G~~~ 546 (554)
T PF07919_consen 533 VRYNLLPLVAGWWI 546 (554)
T ss_pred EEEEEEEccCCcEE
Confidence 99999999999764
No 56
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=91.58 E-value=2.5 Score=40.21 Aligned_cols=76 Identities=28% Similarity=0.257 Sum_probs=54.6
Q ss_pred EecCCceeeeCCEEEEEEEEEcCCCcc-EEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEE-EEeCCCeeEEEEE
Q psy5522 718 SLDLPYSVMRGEVVAIPVVVFNYLSQD-LVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKL-TIKANSGSTTTFV 795 (1397)
Q Consensus 718 ~l~LPysv~rGE~~~l~~~VfNyl~~~-~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~a~~~~tv~f~ 795 (1397)
....|..+..|+.+.+.++|.|..... -.+.|.+...+... ....+ .|.+|+..++.|.
T Consensus 8 ~~~~~~~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~-------------------~~~~i~~L~~g~~~~v~~~ 68 (101)
T PF07705_consen 8 ITVSPSNVVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSV-------------------STVTIPSLAPGESETVTFT 68 (101)
T ss_dssp EEEC-SEEETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEE-------------------EEEEESEB-TTEEEEEEEE
T ss_pred EeeCCCcccCCCEEEEEEEEEECCCCCCCCEEEEEEECCcee-------------------ccEEECCcCCCcEEEEEEE
Confidence 566799999999999999999987654 55677766554331 13445 7899999999999
Q ss_pred EEeceeeeEEEEEEEEe
Q psy5522 796 ITPKELGYIGIKVTATS 812 (1397)
Q Consensus 796 v~p~~lG~v~itV~A~~ 812 (1397)
+.+...|...|++.+..
T Consensus 69 ~~~~~~G~~~i~~~iD~ 85 (101)
T PF07705_consen 69 WTPPSPGSYTIRVVIDP 85 (101)
T ss_dssp EE-SS-CEEEEEEEEST
T ss_pred EEeCCCCeEEEEEEEee
Confidence 99999999988887743
No 57
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins. The normal translocation of nascent polypeptides into the lumen of the endoplasmic reticulum (ER) is thought to be aided in part by a translocon-associated protein (TRAP) complex consisting of 4 protein subunits. The association of mature proteins with the ER and Golgi, or other intracellular locales, such as lysosomes, depends on the initial targeting of the nascent polypeptide to the ER membrane. A similar scenario must also exist for proteins destined for secretion [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=91.10 E-value=1.9 Score=46.32 Aligned_cols=85 Identities=24% Similarity=0.375 Sum_probs=62.7
Q ss_pred CCeeEEEeeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEee----cCCeeeeecCCCccCCCCCCcceeeeE
Q psy5522 705 MPKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLEN----VGQFDFADFSNEVDAAPQPKFEVFRRK 780 (1397)
Q Consensus 705 ~~~~l~vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~----s~~~~~~~~~~~~~~~~~~~~~~~~~~ 780 (1397)
.+..+.+.|-+ ++..+..|+.+.+..+|+|..+... ..|+|.. +++|+++... ....
T Consensus 19 ~~a~llv~K~i-----l~~~~v~g~~v~V~~~iyN~G~~~A-~dV~l~D~~fp~~~F~lvsG~-------------~s~~ 79 (181)
T PF05753_consen 19 SPARLLVSKQI-----LNKYLVEGEDVTVTYTIYNVGSSAA-YDVKLTDDSFPPEDFELVSGS-------------LSAS 79 (181)
T ss_pred CCcEEEEEEee-----ccccccCCcEEEEEEEEEECCCCeE-EEEEEECCCCCccccEeccCc-------------eEEE
Confidence 46777777754 6888999999999999999987643 3444444 2467765321 1122
Q ss_pred EEEEeCCCeeEEEEEEEeceeeeEEEEE
Q psy5522 781 KLTIKANSGSTTTFVITPKELGYIGIKV 808 (1397)
Q Consensus 781 ~~~v~a~~~~tv~f~v~p~~lG~v~itV 808 (1397)
--.|++|+..+..+.++|++.|..+++-
T Consensus 80 ~~~i~pg~~vsh~~vv~p~~~G~f~~~~ 107 (181)
T PF05753_consen 80 WERIPPGENVSHSYVVRPKKSGYFNFTP 107 (181)
T ss_pred EEEECCCCeEEEEEEEeeeeeEEEEccC
Confidence 3489999999999999999999888763
No 58
>COG5029 CAL1 Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones]
Probab=90.48 E-value=0.79 Score=52.03 Aligned_cols=97 Identities=21% Similarity=0.307 Sum_probs=59.0
Q ss_pred CCCchhhhhcccchHHHHHHHHHH---hccccHHHHHHHHHHHHhccccceeeecCCCCcc--ccCCCCCCCCcch--hH
Q psy5522 890 PFGCGEQNMLNFVPNIVVLEYLKK---TYQLTDAIEAKASRYLETGYQQELTYRRPDGSFS--AFGTTDPNGSTWL--TA 962 (1397)
Q Consensus 890 P~GCgEQ~m~~~~p~i~~l~yL~~---~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs--~~~~~~~~~s~wl--TA 962 (1397)
++||++-..++++.++++.+-|.- ..-++...|....+|+.. -|++||+|. .|+. .++-. +|
T Consensus 93 ~f~~~~g~~~hL~sT~~Ai~~L~~~d~~~~~~~idr~~l~~fi~~-------lk~pdGsF~~~~~ge----vDtr~~Y~a 161 (342)
T COG5029 93 GFGGGPGQDSHLASTVFAIQSLAMLDSLDVLSRIDRDSLASFISG-------LKNPDGSFRSDLEGE----VDTRFLYIA 161 (342)
T ss_pred CcCCCCccchhHHHHHHHHHHHHHhccccccchhhHHHHHHHHHh-------ccCCCCceecccCCc----chHHHHHHH
Confidence 456666666677777766654443 333343344445555543 589999985 3332 22222 33
Q ss_pred HHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522 963 FVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 963 yvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
-.+++++. . .|....+.+++||.+.|+.+|||+.-
T Consensus 162 l~ilsllg--~---~~~~~~e~~vdyl~kCqnyeGGFg~~ 196 (342)
T COG5029 162 LSILSLLG--D---LDKELFEGAVDYLKKCQNYEGGFGLC 196 (342)
T ss_pred HHHHHHHh--h---cchhhhHHHHHHHHHhhccCCcccCC
Confidence 33333332 2 36677888999999999999999853
No 59
>cd02895 GGTase-I Geranylgeranyltransferase types I (GGTase-I)-like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-I s are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-I ). GGTase-I prenylates the cysteine in the terminal sequence, "CAAX" when X is Leu or Phe. Substrates for GTTase-I include the gamma subunit of neural G-proteins and several Ras-related G-proteins. PTases are heterodimeric with both alpha and beta subunits r
Probab=89.62 E-value=1.2 Score=52.45 Aligned_cols=110 Identities=20% Similarity=0.149 Sum_probs=69.0
Q ss_pred ccCCCchhhhhcccchHHHHHHHHHHhccc---cHHHHHHHHHHHHhccccceeeecCCCCccccCCC--------CCCC
Q psy5522 888 KMPFGCGEQNMLNFVPNIVVLEYLKKTYQL---TDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTT--------DPNG 956 (1397)
Q Consensus 888 ~~P~GCgEQ~m~~~~p~i~~l~yL~~~~~l---~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~--------~~~~ 956 (1397)
.+|......-..+++-..+++.-|.--+.. +...+++.+++|.+- |.. -.++||+|+.-... ...+
T Consensus 14 ~lp~~~~~~d~~r~~~~yf~l~~L~lL~~~~~~~~~~r~~~i~~i~~~-q~~--~~~~~GgF~~~~~~~~~~~~~~~~~~ 90 (307)
T cd02895 14 LLPSSYQSLDTNRLTIAFFALSGLDLLGALDSILVEEKDDIIEWIYSL-QVL--SNLPRGGFRGSSTLGLPGTASKYDTG 90 (307)
T ss_pred hCCchhhhhhhhhHHHHHHHHHHHHHhCCCccccHHHHHHHHHHHHHH-hcc--cCCCCCCCCCCCCCccccccccCCcc
Confidence 357776666666655555555433333322 355667777777652 100 12789999764321 1257
Q ss_pred CcchhHHHHHHHHHccCC-CccCHHHHHHHHHHHHhcccCCCccccch
Q psy5522 957 STWLTAFVAKSFRQAASH-TTIDESVILEALAWLSSNQAVNGSFPEVG 1003 (1397)
Q Consensus 957 s~wlTAyvl~~~~~a~~~-i~vd~~~~~~~~~WL~~~q~~~G~F~stq 1003 (1397)
+++.|=+++.++...+.- -.+| -+++++||.+.|+++|+|.+..
T Consensus 91 ~l~~ty~Al~~L~lL~~~~~~id---r~~i~~~l~~~q~~dGgF~~~~ 135 (307)
T cd02895 91 NLAMTYFALLSLLILGDDLSRVD---RKAILNFLSKLQLPDGSFGSVL 135 (307)
T ss_pred cHHHHHHHHHHHHHhCCchhhhh---HHHHHHHHHHhCCCCCCccCCc
Confidence 899999998888776542 1233 3567999999999999998763
No 60
>PF13115 YtkA: YtkA-like
Probab=89.10 E-value=3.2 Score=38.72 Aligned_cols=75 Identities=23% Similarity=0.344 Sum_probs=53.7
Q ss_pred cEEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCC--C-CeEEeeeeec-cCCceeeeeeecCCCCCCee
Q psy5522 98 YSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGK--G-NRIKQWTRAL-TTRGVFSADLQLSKSPVLGD 173 (1397)
Q Consensus 98 ~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~--g-~~I~qw~~~~-~~~Gi~~~sf~L~~~~~lG~ 173 (1397)
..+-++++.++ +.|+. -|.+- ++.+.+|+.+..+.+.+..|. | ..-..+.... ...|.|..+..|+. .|.
T Consensus 7 ~~v~l~~~~~~-~~g~~-~i~v~-~~~~g~pv~~a~V~~~~~m~~~~g~~~~~~~~~~~~~~~G~Y~~~~~f~m---~G~ 80 (86)
T PF13115_consen 7 YTVELVSPEPP-KVGEN-TITVT-VDQGGKPVTDADVQFEIWMPDMEGMEPMTSKVELEETGPGVYEAEVTFSM---AGT 80 (86)
T ss_pred EEEEEecCCCC-cCCce-EEEEE-ECCCCCCCCCCEEEEEEEeCCCCCCCCCceeeeeecCCCCeEEEEeecCC---Cee
Confidence 34455555444 88888 57666 888999997778999998885 3 2223333333 47899999988865 699
Q ss_pred EEEEE
Q psy5522 174 WNITI 178 (1397)
Q Consensus 174 y~I~v 178 (1397)
|.|++
T Consensus 81 W~i~v 85 (86)
T PF13115_consen 81 WQITV 85 (86)
T ss_pred EEEEE
Confidence 99987
No 61
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=88.95 E-value=1.8 Score=47.74 Aligned_cols=76 Identities=20% Similarity=0.232 Sum_probs=48.8
Q ss_pred EEEEEe--cCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeE
Q psy5522 314 KMDLIK--SSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTT 391 (1397)
Q Consensus 314 ~i~f~~--~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~ 391 (1397)
.+++++ .|..++.|.++.++|. .||+|+++ ..|.+.....|.........++||++|.+.|.+..+ |.
T Consensus 135 ~lEIvPl~~P~~l~~g~~~~~~vl---~~GkPl~~--a~V~~~~~~~~~~~~~~~~~~~TD~~G~~~~~~~~~---G~-- 204 (215)
T PF10670_consen 135 PLEIVPLTNPYKLKAGDPLPFQVL---FDGKPLAG--AEVEAFSPGGWYDVEHEAKTLKTDANGRATFTLPRP---GL-- 204 (215)
T ss_pred cEEEEECcCcccccCCCEEEEEEE---ECCeEccc--EEEEEEECCCccccccceEEEEECCCCEEEEecCCC---EE--
Confidence 345555 4556789999887774 79999995 356554332222111226789999999999887554 45
Q ss_pred EEEEEEec
Q psy5522 392 LGIEAEYL 399 (1397)
Q Consensus 392 l~l~a~~~ 399 (1397)
.-|+|.+.
T Consensus 205 wli~a~~~ 212 (215)
T PF10670_consen 205 WLIRASHK 212 (215)
T ss_pred EEEEEEEe
Confidence 44555544
No 62
>PLN03201 RAB geranylgeranyl transferase beta-subunit; Provisional
Probab=88.26 E-value=1.5 Score=51.65 Aligned_cols=73 Identities=18% Similarity=0.183 Sum_probs=51.1
Q ss_pred ccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCC
Q psy5522 917 LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVN 996 (1397)
Q Consensus 917 l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~ 996 (1397)
++....+++++||.+ -|+.||+|+.-++. ++....|..++-++..++..-.+|. +...+||.++|...
T Consensus 149 ~~~i~~~~~~~~i~s-------cq~~dGGF~~~p~~--esh~g~T~caiaaL~llg~~~~~d~---~~l~~wL~~rQ~~~ 216 (316)
T PLN03201 149 LDKINVEKAVDYIVS-------CKNFDGGFGCTPGG--ESHAGQIFCCVGALAITGSLHHVDK---DLLGWWLCERQVKS 216 (316)
T ss_pred cchhHHHHHHHHHHH-------hcCCCCCcCCCCCC--CCccceehHHHHHHHHcCccccCCH---HHHHHHHHHhCCCC
Confidence 333335678888764 46789999875443 4566668777777777765544564 44589999999989
Q ss_pred Ccccc
Q psy5522 997 GSFPE 1001 (1397)
Q Consensus 997 G~F~s 1001 (1397)
|||..
T Consensus 217 GGf~g 221 (316)
T PLN03201 217 GGLNG 221 (316)
T ss_pred CCcCC
Confidence 99974
No 63
>cd02895 GGTase-I Geranylgeranyltransferase types I (GGTase-I)-like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-I s are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-I ). GGTase-I prenylates the cysteine in the terminal sequence, "CAAX" when X is Leu or Phe. Substrates for GTTase-I include the gamma subunit of neural G-proteins and several Ras-related G-proteins. PTases are heterodimeric with both alpha and beta subunits r
Probab=88.01 E-value=1.1 Score=52.64 Aligned_cols=70 Identities=17% Similarity=0.196 Sum_probs=46.9
Q ss_pred HHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccc
Q psy5522 922 EAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFP 1000 (1397)
Q Consensus 922 ~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~ 1000 (1397)
+++++++|.+ -|..||+|+.-+.. ++....|-.++-++......-..+.....++++||.++|+.+|||.
T Consensus 167 ~~~li~~l~s-------~Q~~dGGF~~~~~~--Esh~g~Tyca~asL~lL~~~~~~~~~~~~~l~~wL~~rQ~~~GGF~ 236 (307)
T cd02895 167 KEKLIDYIKS-------SQSYDGGFGQGPGL--ESHGGSTFCAIASLSLLGKLEELSEKFLERLKRWLVHRQVSGTGFN 236 (307)
T ss_pred HHHHHHHHHH-------ccCCCCCccCCCCC--CccccHHHHHHHHHHHcCCccccccccHHHHHHHHHHhcCCCCCcC
Confidence 4567777664 47779999865543 4456666666666655554311113446778999999999999996
No 64
>PLN02710 farnesyltranstransferase subunit beta
Probab=87.93 E-value=1.1 Score=54.59 Aligned_cols=67 Identities=16% Similarity=0.231 Sum_probs=43.5
Q ss_pred HHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchh--HHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCcc
Q psy5522 922 EAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLT--AFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSF 999 (1397)
Q Consensus 922 ~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlT--Ayvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F 999 (1397)
+++..+||.+ .|++||||+.-... +.+++.| |-+++.++.. ++....+++++||.+.|+.+|||
T Consensus 145 r~~l~~fl~s-------~q~~dGgF~~~~~g--E~D~R~tYcAlail~LL~~-----l~~~~~e~~~~~I~scQ~~dGGF 210 (439)
T PLN02710 145 REKLYTFLLR-------MKDPSGGFRMHDGG--EMDVRACYTAISVASLLNI-----LDDELVKGVGDYILSCQTYEGGI 210 (439)
T ss_pred HHHHHHHHHH-------cCCCCCCcccCCCC--CCCcCCcHHHHHHHHHhCc-----CchhhHHHHHHHHHHhCCCCCCC
Confidence 4566666654 57789999763322 3344433 4444444432 24556788999999999999999
Q ss_pred ccc
Q psy5522 1000 PEV 1002 (1397)
Q Consensus 1000 ~st 1002 (1397)
...
T Consensus 211 ~g~ 213 (439)
T PLN02710 211 GGE 213 (439)
T ss_pred CCC
Confidence 855
No 65
>cd02893 FTase Protein farnesyltransferase (FTase)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). FTases are a subgroup of PTase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. These proteins are heterodimers of alpha and beta subunits. Both subunits are required for catalytic activity. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids. Ftase attaches a 15-carbon farnesyl group to the cysteine within the C-terminal CaaX motif of substrate proteins when X is Ala, Met, Ser, Cys or Gln. Protein farnesylation has been shown to play critical roles in a variety of cellular pro
Probab=87.86 E-value=2 Score=50.39 Aligned_cols=74 Identities=19% Similarity=0.302 Sum_probs=53.5
Q ss_pred cccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccC
Q psy5522 916 QLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAV 995 (1397)
Q Consensus 916 ~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~ 995 (1397)
.++++..+++++||.+ -|..||+|+..++. ++..-.|.+++-+|..++..-.+| .++.++||.++|..
T Consensus 142 ~~~~~~~~~~~~~l~~-------cQ~~dGGF~~~p~~--e~h~~yTfcavasL~llg~~~~~d---~~~l~~wl~~~q~~ 209 (299)
T cd02893 142 ILTDELFEGVAEYILS-------CQTYEGGFGGVPGN--EAHGGYTFCALAALAILGKPDKLD---LESLLRWLVARQMR 209 (299)
T ss_pred CCchhhHHHHHHHHHH-------cCCCCCCcCCCCCC--CCCccHHHHHHHHHHHcCCccccC---HHHHHHHHHhhcCC
Confidence 3456667788888775 46779999875443 556677888888887776643344 56679999998876
Q ss_pred -CCcccc
Q psy5522 996 -NGSFPE 1001 (1397)
Q Consensus 996 -~G~F~s 1001 (1397)
.|||.-
T Consensus 210 ~~GGf~g 216 (299)
T cd02893 210 FEGGFQG 216 (299)
T ss_pred CCCCcCC
Confidence 899963
No 66
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=87.48 E-value=3.6 Score=39.15 Aligned_cols=64 Identities=17% Similarity=0.270 Sum_probs=50.6
Q ss_pred cCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 110 KPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 110 kPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
+.|+...|.+-++|. ....+.+.|.+|+|+.+.-. .....+|.+..+|. |.. .|.|+|.+..++
T Consensus 15 ~vg~~~~f~v~~~d~-----G~~~~~v~i~~p~g~~~~~~-v~d~~dGty~v~y~-P~~--~G~~~i~V~~~g 78 (93)
T smart00557 15 VVGEPAEFTIDTRGA-----GGGELEVEVTGPSGKKVPVE-VKDNGDGTYTVSYT-PTE--PGDYTVTVKFGG 78 (93)
T ss_pred ecCCCEEEEEEcCCC-----CCCcEEEEEECCCCCeeEeE-EEeCCCCEEEEEEE-eCC--CEeEEEEEEECC
Confidence 567788888888876 35788999999999766543 35677799999999 444 399999999987
No 67
>COG5029 CAL1 Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones]
Probab=87.47 E-value=2.2 Score=48.59 Aligned_cols=94 Identities=17% Similarity=0.303 Sum_probs=59.6
Q ss_pred CchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHc
Q psy5522 892 GCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQA 971 (1397)
Q Consensus 892 GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a 971 (1397)
||..+.+...+-+ ++..| +.++.+..+-+++|+.+ -|+.||+|+.=+..+..+----+|.+.++++
T Consensus 151 gevDtr~~Y~al~--ilsll---g~~~~~~~e~~vdyl~k-------CqnyeGGFg~~p~aEaHag~tFcalaalalL-- 216 (342)
T COG5029 151 GEVDTRFLYIALS--ILSLL---GDLDKELFEGAVDYLKK-------CQNYEGGFGLCPYAEAHAGYTFCALAALALL-- 216 (342)
T ss_pred CcchHHHHHHHHH--HHHHH---hhcchhhhHHHHHHHHH-------hhccCCcccCCCchhhccchHHHHHHHHHHH--
Confidence 6666666333222 23334 35666777788887776 4777999998766433333333455555444
Q ss_pred cCCCccCHHHHHHHHHHHHhcccCCCcccc
Q psy5522 972 ASHTTIDESVILEALAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus 972 ~~~i~vd~~~~~~~~~WL~~~q~~~G~F~s 1001 (1397)
.+...++. .++..+||.++|-+.|||..
T Consensus 217 ~~Ld~ls~--~E~l~~Wl~~RQ~ssgGl~G 244 (342)
T COG5029 217 GKLDKLSD--VEKLIRWLAERQLSSGGLNG 244 (342)
T ss_pred hcccccch--HHHHHHHHHHcccccCCcCC
Confidence 44444443 66779999999999999963
No 68
>KOG0367|consensus
Probab=87.21 E-value=1.1 Score=50.45 Aligned_cols=105 Identities=19% Similarity=0.327 Sum_probs=57.9
Q ss_pred hhccCCCchh------hhhcccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcc
Q psy5522 886 LIKMPFGCGE------QNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTW 959 (1397)
Q Consensus 886 Ll~~P~GCgE------Q~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~w 959 (1397)
..|.|-||+- .+-.+|+-...+.-|+-..-+ ---+++++.||.+- |+.||+|+.-++.++.+-.-
T Consensus 136 ~~Q~~dGsF~~~~~GSe~DmRFvYcA~aI~ymLd~~s--~iD~ek~~~yI~~~-------q~YdgGfg~~pg~EsHgG~T 206 (347)
T KOG0367|consen 136 ACQRPDGSFVSINVGSESDMRFVYCAVAICYMLDFWS--GIDKEKLIGYIRSS-------QRYDGGFGQHPGGESHGGAT 206 (347)
T ss_pred HhcCCCCceeecCCCCchhhHHHHHHHHHHHHhcccc--ccCHHHHHHHHHHh-------hccccccccCCCCCCCcchh
Confidence 4566767653 222344433333344432111 11246788888753 66799999876543332222
Q ss_pred hhHHHHHHHHHccCCC--cc-CHHHHHHHHHHHHhcccCCCcccc
Q psy5522 960 LTAFVAKSFRQAASHT--TI-DESVILEALAWLSSNQAVNGSFPE 1001 (1397)
Q Consensus 960 lTAyvl~~~~~a~~~i--~v-d~~~~~~~~~WL~~~q~~~G~F~s 1001 (1397)
--|.+-++| ..+.+ .+ +...+++.++||.++|..+|||..
T Consensus 207 fCAlAsL~L--~~~l~~e~l~~~~~~erlirWli~RQ~~sgGfqG 249 (347)
T KOG0367|consen 207 FCALASLAL--MGKLIPEELSNTSKVERLIRWLIQRQVSSGGFQG 249 (347)
T ss_pred HHHHHHHHH--HhhhhhhhhccccCHHHHHHHHHHHhhccCCcCC
Confidence 223333333 33331 12 333488999999999999999964
No 69
>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=87.07 E-value=6 Score=37.50 Aligned_cols=75 Identities=17% Similarity=0.251 Sum_probs=51.2
Q ss_pred ccCCceeeeeeecCCCCCCeeEEEEEEEcC-----eEEEE-EEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEec
Q psy5522 153 LTTRGVFSADLQLSKSPVLGDWNITINVLD-----QKFTK-RFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTY 226 (1397)
Q Consensus 153 ~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-----~~~~~-~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYty 226 (1397)
++++|-...+|.||+. +|.|+|.+..-+ ...+. .|.|.. .|-++...|..+. .+|++.+.+...-+-
T Consensus 10 ~d~~G~~~~~~~lPd~--it~w~v~a~a~s~~~~~g~~~~~~~~v~~----p~~i~~~lP~~l~-~GD~~~i~v~v~N~~ 82 (92)
T PF00207_consen 10 TDANGKATFSFTLPDS--ITSWRVTAFAVSPTGGFGIAEPPEITVFK----PFFIQLNLPRSLR-RGDQIQIPVTVFNYT 82 (92)
T ss_dssp CCTTEEEEEEEE-SSS--SSEEEEEEEEEETTTEEEEECCEEEEEB-----SEEEEEE--SEEE-TTSEEEEEEEEEE-S
T ss_pred cCCCCEEEEEEECCCC--ccEEEEEEEEECCCCcceEecceEEEEEe----eEEEEcCCCcEEe-cCCEEEEEEEEEeCC
Confidence 3578999999999997 689999998743 22333 566533 6888999988875 688999998887666
Q ss_pred CccccceE
Q psy5522 227 GKPVKGEA 234 (1397)
Q Consensus 227 GkPV~G~v 234 (1397)
++....++
T Consensus 83 ~~~~~v~V 90 (92)
T PF00207_consen 83 DKDQEVTV 90 (92)
T ss_dssp SS-EEEEE
T ss_pred CCCEEEEE
Confidence 66554433
No 70
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=85.86 E-value=8.5 Score=36.49 Aligned_cols=67 Identities=16% Similarity=0.161 Sum_probs=44.8
Q ss_pred CeEEEEEEEEccCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecC
Q psy5522 328 LKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLD 400 (1397)
Q Consensus 328 ~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~ 400 (1397)
....+++.++|.+|.|+++. .|.+.+...- ..........||++|.+.+.+... ..+. +.|+|+..+
T Consensus 18 d~~~i~v~v~D~~Gnpv~~~--~V~f~~~~~~-~~~~~~~~~~Td~~G~a~~~l~~~-~~G~--~~vta~~~~ 84 (92)
T smart00634 18 DAITLTATVTDANGNPVAGQ--EVTFTTPSGG-ALTLSKGTATTDANGIATVTLTST-TAGV--YTVTASLEN 84 (92)
T ss_pred ccEEEEEEEECCCCCCcCCC--EEEEEECCCc-eeeccCCeeeeCCCCEEEEEEECC-CCcE--EEEEEEECC
Confidence 67889999999999999964 4555542110 000112456899999999999854 4455 666666543
No 71
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=84.28 E-value=6.1 Score=37.84 Aligned_cols=70 Identities=13% Similarity=0.201 Sum_probs=54.6
Q ss_pred cCCceeEEEEEEEcCCCccC--CCcceEEEEECCCCCe---EEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 110 KPGDLVRFRAIVLNSHLKPS--VTGALEIFITDGKGNR---IKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 110 kPGqtV~fRvv~ld~~l~P~--~~~~~~v~I~DP~g~~---I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
..|+...|.|.++|..-.+. ....+.+.|.+|++.. -..+.......|.+..+|.... .|.|+|.+..++
T Consensus 18 ~~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~---~G~y~i~V~~~g 92 (101)
T PF00630_consen 18 VVGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTE---PGKYKISVKING 92 (101)
T ss_dssp ETTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESS---SEEEEEEEEESS
T ss_pred ECCCcEEEEEEEccCCCCccccCCceeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCc---cEeEEEEEEECC
Confidence 68999999999999865543 3467889999999872 2233445678899999999554 399999999976
No 72
>KOG0366|consensus
Probab=83.44 E-value=3.3 Score=45.80 Aligned_cols=90 Identities=18% Similarity=0.252 Sum_probs=53.2
Q ss_pred cccchHHHHHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCcc--ccCCCCCCCCcchhHHHHHHHHHccCCCc
Q psy5522 899 LNFVPNIVVLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFS--AFGTTDPNGSTWLTAFVAKSFRQAASHTT 976 (1397)
Q Consensus 899 ~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs--~~~~~~~~~s~wlTAyvl~~~~~a~~~i~ 976 (1397)
.++.-++.+.|.|.--..+..--+.+...|+. | .|++||||+ .||. -++-.+--++.++...++.
T Consensus 93 ~hll~TlsAvQiL~~ydsi~~~d~d~v~~yi~-g------Lq~edGsF~gD~wGE----vDTRfs~~av~~L~lLg~l-- 159 (329)
T KOG0366|consen 93 PHLLYTLSAVQILALYDSINVLDRDKVASYIK-G------LQQEDGSFSGDIWGE----VDTRFSYCAVACLALLGKL-- 159 (329)
T ss_pred hHHHHHHHHHHHHHHHcccccccHHHHHHHHH-h------hcCcCCcccCCcccc----cchhhhHHHHHHHHHHhhH--
Confidence 34444555666665444443333445556664 4 478899995 4543 2333333333333333332
Q ss_pred cCHHHHHHHHHHHHhcccCCCccccc
Q psy5522 977 IDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 977 vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
|.-.+++|++|+.+--|-||||...
T Consensus 160 -d~~nve~aVd~~~~CyN~DGGFG~~ 184 (329)
T KOG0366|consen 160 -DTINVEKAVDFVLSCYNFDGGFGCR 184 (329)
T ss_pred -HHhhHHHHHHHHHhhcccCCCcCCC
Confidence 3455788999999999999999864
No 73
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=82.79 E-value=6.7 Score=38.27 Aligned_cols=79 Identities=19% Similarity=0.195 Sum_probs=53.5
Q ss_pred EEecCCceeeeCCEEEEEEEEEcCCCcc-EEEEEEEeecC----CeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeE
Q psy5522 717 ISLDLPYSVMRGEVVAIPVVVFNYLSQD-LVADVTLENVG----QFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGST 791 (1397)
Q Consensus 717 v~l~LPysv~rGE~~~l~~~VfNyl~~~-~~v~v~L~~s~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~t 791 (1397)
+++.++-.+..|+.+.+.+++.|-++.. -.|++.|.... +..... .........|.+++..+
T Consensus 3 ~~i~~~~~~~vG~d~~v~v~~~N~~~~~l~~v~~~l~~~~v~ytG~~~~~-------------~~~~~~~~~l~p~~~~~ 69 (107)
T PF00927_consen 3 IKIKLPGDPVVGQDFTVSVSFTNPSSEPLRNVSLNLCAFTVEYTGLTRDQ-------------FKKEKFEVTLKPGETKS 69 (107)
T ss_dssp EEEEEESEEBTTSEEEEEEEEEE-SSS-EECEEEEEEEEEEECTTTEEEE-------------EEEEEEEEEE-TTEEEE
T ss_pred EEEEECCCccCCCCEEEEEEEEeCCcCccccceeEEEEEEEEECCccccc-------------EeEEEcceeeCCCCEEE
Confidence 4566777888999999999999999999 55777764321 111000 12335578999999999
Q ss_pred EEEEEEeceeeeEEEEE
Q psy5522 792 TTFVITPKELGYIGIKV 808 (1397)
Q Consensus 792 v~f~v~p~~lG~v~itV 808 (1397)
+.+.|.|...|.-.+-+
T Consensus 70 ~~~~i~p~~yG~~~~l~ 86 (107)
T PF00927_consen 70 VEVTITPSQYGPKQLLV 86 (107)
T ss_dssp EEEEE-HHSHEEECCEE
T ss_pred EEEEEEceeEecchhcc
Confidence 99999999998744333
No 74
>PF13243 Prenyltrans_1: Prenyltransferase-like; PDB: 3SDR_A 3SAE_A 3SDV_A 3SDT_A 3SDQ_A 3SDU_A.
Probab=81.20 E-value=0.58 Score=45.66 Aligned_cols=60 Identities=25% Similarity=0.333 Sum_probs=31.8
Q ss_pred HHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhccc
Q psy5522 922 EAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQA 994 (1397)
Q Consensus 922 ~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~ 994 (1397)
.+++.+||.+ .|++||+|..++. .+...|+-++.++..+... -+...+.++++||.++|.
T Consensus 48 i~ka~~~l~~-------~Q~~dG~w~~~~~----~~~~~t~~~~~~l~~~~~~--~~~~~~~r~~~wi~~~~~ 107 (109)
T PF13243_consen 48 IKKAIDWLLS-------HQNPDGGWGYSGG----EYVSMTAAAIAALALAGVY--PDDEAVERGLEWILSHQL 107 (109)
T ss_dssp SHHHHHHHHH----------TTS--S-TS------HHHHHHHHHHHHHHHHTT----HHHHHHHHHHHHHH--
T ss_pred HHHHHHHHHH-------hcCCCCCCCCcCC----CCHHHHHHHHHHHHHhCCC--CCCHHHHHHHHHHHHccC
Confidence 3678887765 5889999987732 2344555444444433221 266779999999998753
No 75
>PF02369 Big_1: Bacterial Ig-like domain (group 1); InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=79.83 E-value=9.7 Score=36.77 Aligned_cols=69 Identities=16% Similarity=0.139 Sum_probs=40.6
Q ss_pred CCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccccccee--eEeeCCCeeEEEEEeCCCCCceeEEEEEEEecCc
Q psy5522 326 PGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEAN--QYKLDRNGMIKLVYYPPANENVTTLGIEAEYLDI 401 (1397)
Q Consensus 326 pG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~--~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~ 401 (1397)
-+....+++.++|.+|+|+++. .|.+..... ....... ..+||++|.+.+.+... ..+. ..|+|+..+.
T Consensus 21 g~~~~tltatV~D~~gnpv~g~--~V~f~~~~~--~~~l~~~~~~~~Td~~G~a~~tltst-~aG~--~~VtA~~~~~ 91 (100)
T PF02369_consen 21 GSDTNTLTATVTDANGNPVPGQ--PVTFSSSSS--GGTLSPTNTSATTDSNGIATVTLTST-KAGT--YTVTATVDGG 91 (100)
T ss_dssp SSS-EEEEEEEEETTSEB-TS---EEEE--EES--SSEES-CEE-EEE-TTSEEEEEEE-S-S-EE--EEEEEEETTE
T ss_pred CcCcEEEEEEEEcCCCCCCCCC--EEEEEEcCC--CcEEecCccccEECCCEEEEEEEEec-CceE--EEEEEEECCc
Confidence 4667899999999999999964 455511111 0001111 47899999999999754 4455 6677776654
No 76
>PF06030 DUF916: Bacterial protein of unknown function (DUF916); InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function.
Probab=79.51 E-value=11 Score=37.95 Aligned_cols=75 Identities=23% Similarity=0.283 Sum_probs=47.8
Q ss_pred eeeCCEEEEEEEEEcCCCccEEEEEEEeec---C--CeeeeecCCCccCC--CCCCcceeeeEEEEEeCCCeeEEEEEEE
Q psy5522 725 VMRGEVVAIPVVVFNYLSQDLVADVTLENV---G--QFDFADFSNEVDAA--PQPKFEVFRRKKLTIKANSGSTTTFVIT 797 (1397)
Q Consensus 725 v~rGE~~~l~~~VfNyl~~~~~v~v~L~~s---~--~~~~~~~~~~~~~~--~~~~~~~~~~~~~~v~a~~~~tv~f~v~ 797 (1397)
+.+|++-.|.+.|.|..++.+.+.+.+... + .+............ +....-....+.+.|+|++++.|.|.|+
T Consensus 23 ~~P~q~~~l~v~i~N~s~~~~tv~v~~~~A~Tn~nG~I~Y~~~~~~~d~sl~~~~~~~v~~~~~Vtl~~~~sk~V~~~i~ 102 (121)
T PF06030_consen 23 VKPGQKQTLEVRITNNSDKEITVKVSANTATTNDNGVIDYSQNNPKKDKSLKYPFSDLVKIPKEVTLPPNESKTVTFTIK 102 (121)
T ss_pred eCCCCEEEEEEEEEeCCCCCEEEEEEEeeeEecCCEEEEECCCCcccCcccCcchHHhccCCcEEEECCCCEEEEEEEEE
Confidence 678999999999999999999999887642 1 12222221111000 0000001123349999999999999998
Q ss_pred -ec
Q psy5522 798 -PK 799 (1397)
Q Consensus 798 -p~ 799 (1397)
|.
T Consensus 103 ~P~ 105 (121)
T PF06030_consen 103 MPK 105 (121)
T ss_pred cCC
Confidence 54
No 77
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=78.80 E-value=27 Score=37.56 Aligned_cols=46 Identities=24% Similarity=0.389 Sum_probs=35.0
Q ss_pred ceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEc
Q psy5522 133 ALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVL 181 (1397)
Q Consensus 133 ~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~ 181 (1397)
.+.|.|.||+|+...... .+...-...|.||..+..|.|++.+...
T Consensus 42 ~p~i~L~~P~G~~~~~~~---~d~~~~~~~i~ipg~ae~G~W~y~i~~~ 87 (179)
T PF09315_consen 42 PPSITLTDPSGTVYTTFT---TDSNSKTARIQIPGTAEVGTWTYSITNT 87 (179)
T ss_pred CceEEEECCCCCEEeeeE---EcccccEEEEECCCCcccccEEEEEecC
Confidence 567889999999998832 2234455667788889999999999543
No 78
>PF14545 DBB: Dof, BCAP, and BANK (DBB) motif,
Probab=76.01 E-value=18 Score=37.05 Aligned_cols=84 Identities=17% Similarity=0.199 Sum_probs=54.9
Q ss_pred EEEeeCeeecCCeEEEEEEeec-CCccEEEEEEEEeecCCCCeeEEeEEEEECCCceeEEEEEcCCC---CCceeEEEEE
Q psy5522 3 TVVAPKVLRPNGEYHVAVSTQA-VSEATQVSVEVGGKQDSGGEFRTKQLLSVDPFSTRIVKLDIGDL---GPGQYNLTVK 78 (1397)
Q Consensus 3 ~v~~P~~lr~g~~~~v~v~l~~-~~~~v~v~v~L~~~~~~~~~~~~~~~~~~~~~~~~~~~f~vP~~---~~~~~~l~v~ 78 (1397)
+++.|+.++.|.+ +|.|-|.+ ..+..+++|.+... + .. .+.+.....+...+.|++|+. ++|.+.++|.
T Consensus 1 f~V~P~rI~cg~~-~vfIiL~~~l~~~~~~eVef~~~---n--~~-~~~~~~~~~N~yt~~~~aPd~~~~pag~V~v~v~ 73 (142)
T PF14545_consen 1 FVVQPSRIRCGQP-EVFIILRDPLDEEDTVEVEFESN---N--KP-IRRVPAKWENPYTLQFKAPDFCLEPAGSVNVRVY 73 (142)
T ss_pred CeecCceeecCCC-EEEEEEeCCCCCCCeEEEEEEeC---C--Ce-eEeccceEECCEEEEEECchhcCCCCceEEEEEE
Confidence 4678999999999 99997766 33345677777654 1 11 112222333567889999998 8898888888
Q ss_pred ecCceeeEeeEEEEEEe
Q psy5522 79 GSGSLNFYNSTGLVYVH 95 (1397)
Q Consensus 79 g~~~~~f~~~~~v~~~~ 95 (1397)
- +|.. .-.++|.+.+
T Consensus 74 ~-~g~~-~~~~~ikyys 88 (142)
T PF14545_consen 74 C-DGVS-LGTRQIKYYS 88 (142)
T ss_pred E-CCEE-EEEEeEEEEe
Confidence 7 4442 2344555443
No 79
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=75.13 E-value=15 Score=33.54 Aligned_cols=54 Identities=28% Similarity=0.391 Sum_probs=28.1
Q ss_pred ceeEEEEEecCCCeeeEEEEEEEEecCCCceeEEEEEecCCCCcCCCCcEEEEEE
Q psy5522 817 DSMEGKLLVKPEGETQYKNKAIFVDLRKNKTFSVNVTLDMPKNIVPGSEHVEVSA 871 (1397)
Q Consensus 817 D~v~k~L~V~peG~~~~~t~s~~l~~~~~~~~~~~i~l~~P~~~v~gs~~~~vsv 871 (1397)
..+.-.|.+ |+|......-...-.+..+...+.++.+..|+++.+|+-.+.+.+
T Consensus 22 ~~v~~~l~~-P~GW~~~~~~~~~~~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a 75 (78)
T PF10633_consen 22 TNVSLSLSL-PEGWTVSASPASVPSLPPGESVTVTFTVTVPADAAPGTYTVTVTA 75 (78)
T ss_dssp SS-EEEEE---TTSE---EEEEE--B-TTSEEEEEEEEEE-TT--SEEEEEEEEE
T ss_pred eeEEEEEeC-CCCccccCCccccccCCCCCEEEEEEEEECCCCCCCceEEEEEEE
Confidence 334444555 899883333223224444457788899999999999987766654
No 80
>PF14524 Wzt_C: Wzt C-terminal domain; PDB: 2R5O_B.
Probab=73.58 E-value=19 Score=36.59 Aligned_cols=73 Identities=15% Similarity=0.284 Sum_probs=48.4
Q ss_pred CCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeee------ccC-CceeeeeeecCCCCCCeeEEEEE
Q psy5522 106 KAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRA------LTT-RGVFSADLQLSKSPVLGDWNITI 178 (1397)
Q Consensus 106 KpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~------~~~-~Gi~~~sf~L~~~~~lG~y~I~v 178 (1397)
......||++.|++-..-. ++..+-.+.+.|.|.+|..|...... ... .|.....+.+|..-..|.|.|.+
T Consensus 28 ~~~~~~ge~~~i~i~~~~~--~~i~~~~~~~~i~~~~g~~v~~~~t~~~~~~~~~~~~g~~~~~~~i~~~L~~G~Y~i~v 105 (142)
T PF14524_consen 28 TSSFESGEPIRIRIDYEVN--EDIDDPVFGFAIRDSDGQRVFGTNTYDSGFPIPLSEGGTYEVTFTIPKPLNPGEYSISV 105 (142)
T ss_dssp -SSEETTSEEEEEEEEEES--S-EEEEEEEEEEEETT--EEEEEEHHHHT--EEE-TT-EEEEEEEEE--B-SEEEEEEE
T ss_pred eeEEeCCCEEEEEEEEEEC--CCCCccEEEEEEEcCCCCEEEEECccccCccccccCCCEEEEEEEEcCccCCCeEEEEE
Confidence 3567899999999876542 23334568899999999998853221 112 78888888888878889999999
Q ss_pred EE
Q psy5522 179 NV 180 (1397)
Q Consensus 179 ~~ 180 (1397)
..
T Consensus 106 ~l 107 (142)
T PF14524_consen 106 GL 107 (142)
T ss_dssp EE
T ss_pred EE
Confidence 98
No 81
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=73.30 E-value=15 Score=34.22 Aligned_cols=64 Identities=14% Similarity=0.312 Sum_probs=31.5
Q ss_pred eEEEEEEEcCCCccC-----CCcceEEEEECCCCCeEEeeeeec----------c-CCceeeeeeecCCCCC-CeeEEEE
Q psy5522 115 VRFRAIVLNSHLKPS-----VTGALEIFITDGKGNRIKQWTRAL----------T-TRGVFSADLQLSKSPV-LGDWNIT 177 (1397)
Q Consensus 115 V~fRvv~ld~~l~P~-----~~~~~~v~I~DP~g~~I~qw~~~~----------~-~~Gi~~~sf~L~~~~~-lG~y~I~ 177 (1397)
|.|++...+..-+|+ .+....+.|.|++|+.|-+|..-. . ..-....++.++..+. .|.|++.
T Consensus 2 v~~~l~v~N~s~~~v~l~f~sgq~~D~~v~d~~g~~vwrwS~~~~FtQal~~~~l~pGe~~~~~~~~~~~~~~~G~Y~~~ 81 (82)
T PF12690_consen 2 VEFTLTVTNNSDEPVTLQFPSGQRYDFVVKDKEGKEVWRWSDGKMFTQALQEETLEPGESLTYEETWDLKDLSPGEYTLE 81 (82)
T ss_dssp EEEEEEEEE-SSS-EEEEESSS--EEEEEE-TT--EEEETTTT-------EEEEE-TT-EEEEEEEESS----SEEEEEE
T ss_pred EEEEEEEEeCCCCeEEEEeCCCCEEEEEEECCCCCEEEEecCCchhhheeeEEEECCCCEEEEEEEECCCCCCCceEEEe
Confidence 455555555443332 467899999999999999996421 1 1112333344444332 7888876
Q ss_pred E
Q psy5522 178 I 178 (1397)
Q Consensus 178 v 178 (1397)
|
T Consensus 82 a 82 (82)
T PF12690_consen 82 A 82 (82)
T ss_dssp E
T ss_pred C
Confidence 5
No 82
>PF12975 DUF3859: Domain of unknown function (DUF3859); InterPro: IPR024331 This entry represents a functionally uncharacterised domain, which is found in bacterial proteins.; PDB: 3N6Y_B.
Probab=73.00 E-value=13 Score=37.48 Aligned_cols=58 Identities=26% Similarity=0.433 Sum_probs=35.4
Q ss_pred eEEEEECC------CCCeEEeeeeecc--CCceeeeeeecCCCCCCeeEEEEEEEcC-eEEEEEEEE
Q psy5522 134 LEIFITDG------KGNRIKQWTRALT--TRGVFSADLQLSKSPVLGDWNITINVLD-QKFTKRFTV 191 (1397)
Q Consensus 134 ~~v~I~DP------~g~~I~qw~~~~~--~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-~~~~~~F~V 191 (1397)
+++.|.-| +|..+..|..... .+-..-..|.-+.++..|.|++.+..++ ...+++|+|
T Consensus 59 lt~~i~HP~ip~~~~g~~~~~~~~~v~~~~~~~~~~~f~~~~e~v~G~Wr~~i~~~g~~i~~ktF~V 125 (125)
T PF12975_consen 59 LTVCITHPGIPDGDKGDTVEPWQGEVYVGSNDWDFYTFDTPWEPVPGEWRFTIELDGKVIAEKTFEV 125 (125)
T ss_dssp EEEEEEEEEETTS-EEEEEEEEE---TT-S-EEEEEE--SGGG---EEEEEEEEETTEEEEEEEEEE
T ss_pred EEEEEeCCCCCCCCCCCEECcEEEEecCCCcceEEEEcCcccccCCceEEEEEEECCEEEEEEEEeC
Confidence 55555543 5666677775422 2334555677778888999999999998 667999987
No 83
>PF04734 Ceramidase_alk: Neutral/alkaline non-lysosomal ceramidase; InterPro: IPR006823 This family represents a group of neutral/alkaline ceramidases found in both bacteria and eukaryotes [, , ]. They hydrolyse the sphingolipid ceramide into sphingosine and free fatty acid.; PDB: 2ZXC_A 2ZWS_A.
Probab=72.76 E-value=16 Score=47.36 Aligned_cols=82 Identities=18% Similarity=0.182 Sum_probs=48.6
Q ss_pred cEEEEEcCCCcccCCceeEEEEEEEcCC--CccCCCcceEEEEE-----------CCCCCeEEeeeeeccCC--ceeeee
Q psy5522 98 YSVFIQLDKAIYKPGDLVRFRAIVLNSH--LKPSVTGALEIFIT-----------DGKGNRIKQWTRALTTR--GVFSAD 162 (1397)
Q Consensus 98 ~~vfIqTDKpIYkPGqtV~fRvv~ld~~--l~P~~~~~~~v~I~-----------DP~g~~I~qw~~~~~~~--Gi~~~s 162 (1397)
+.+..| -++.|+|||+|.+..+.-++. ++- .+.++.|+-. |.+-....+|....... ..+..+
T Consensus 557 Gdv~~~-~~~~y~~g~~v~~~F~~a~Prn~l~~-~~tfl~Ver~~~~~~W~~v~~D~dw~t~f~W~r~~~~~~~S~~ti~ 634 (674)
T PF04734_consen 557 GDVLQQ-PNPSYRPGDTVSATFVGANPRNNLRL-EGTFLTVERLESGGSWQTVADDADWSTRFRWKRTGSLLGTSEVTIE 634 (674)
T ss_dssp T-EEE----SEEETT-EEEEEEEE--GGG---T-TS-SEEEEEEES-S--EEEEETTSTTEEEEEEEETTT--EEEEEEE
T ss_pred CceEeC-CcCccCCCCeEEEEEeeeCCCCccCC-CCCeEEEEEecCCCCeEEEEeCCCccEEEEEEecCCccccEEEEEE
Confidence 445555 568899999999998887653 333 3456655543 45556677898665553 356778
Q ss_pred eecCCCCCCeeEEEEEEEc
Q psy5522 163 LQLSKSPVLGDWNITINVL 181 (1397)
Q Consensus 163 f~L~~~~~lG~y~I~v~~~ 181 (1397)
+.+|.+.+.|.|+|+..-.
T Consensus 635 W~ip~~~~~G~YRi~~~G~ 653 (674)
T PF04734_consen 635 WEIPPDTPPGTYRIRHFGD 653 (674)
T ss_dssp EE--TT--SEEEEEEEEEE
T ss_pred EECCCCCCCCCEEEEEEee
Confidence 8899999999999998764
No 84
>KOG0366|consensus
Probab=71.71 E-value=11 Score=41.95 Aligned_cols=80 Identities=18% Similarity=0.234 Sum_probs=52.1
Q ss_pred HHHHHHHhccccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHH
Q psy5522 907 VLEYLKKTYQLTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEAL 986 (1397)
Q Consensus 907 ~l~yL~~~~~l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~ 986 (1397)
+++.|. |+..-.+++.+.|+. +-|+.+|+|+...+. .+.+--|-+|+..+..-...-.+| .++.+
T Consensus 56 al~ll~---~le~~~~eeiv~~v~-------~C~~~~GGfa~~~Gh--d~hll~TlsAvQiL~~ydsi~~~d---~d~v~ 120 (329)
T KOG0366|consen 56 ALDLLG---QLERMSREEIVSFVL-------SCQHEDGGFAGCPGH--DPHLLYTLSAVQILALYDSINVLD---RDKVA 120 (329)
T ss_pred HHHHhc---ccchhhHHHHHHHHH-------heecCCCCcCCCCCC--ChHHHHHHHHHHHHHHHccccccc---HHHHH
Confidence 444443 333333556666554 478999999988764 345666777777666544332234 34468
Q ss_pred HHHHhcccCCCcccc
Q psy5522 987 AWLSSNQAVNGSFPE 1001 (1397)
Q Consensus 987 ~WL~~~q~~~G~F~s 1001 (1397)
.|+..-|+++|+|..
T Consensus 121 ~yi~gLq~edGsF~g 135 (329)
T KOG0366|consen 121 SYIKGLQQEDGSFSG 135 (329)
T ss_pred HHHHhhcCcCCcccC
Confidence 999999999999974
No 85
>PF10648 Gmad2: Immunoglobulin-like domain of bacterial spore germination; InterPro: IPR018911 This domain is found linked to IPR019606 from INTERPRO in some bacterial proteins. It is predicted to contain an immunoglobulin-like all-beta fold.
Probab=69.66 E-value=39 Score=31.92 Aligned_cols=51 Identities=18% Similarity=0.320 Sum_probs=40.5
Q ss_pred CCcceEEEEECCCCCeEEeeeee----ccCCceeeeeeecCCCCCCeeEEEEEEEc
Q psy5522 130 VTGALEIFITDGKGNRIKQWTRA----LTTRGVFSADLQLSKSPVLGDWNITINVL 181 (1397)
Q Consensus 130 ~~~~~~v~I~DP~g~~I~qw~~~----~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~ 181 (1397)
.+..+.+.|+|.+|+.+.+-... ....|-|+.++.++..+ .|.++|.+...
T Consensus 27 FEgtv~~rv~D~~g~vl~e~~~~a~~g~~~~g~F~~tv~~~~~~-~~~g~l~v~~~ 81 (88)
T PF10648_consen 27 FEGTVNIRVRDGHGEVLAEGFVTATGGAPSWGPFEGTVSFPPPP-PGKGTLEVFED 81 (88)
T ss_pred eeeEEEEEEEcCCCcEEEEeeEEeccCCCcccceEEEEEeCCCC-CCceEEEEEEe
Confidence 47889999999999998554322 34568899999999887 89999998754
No 86
>PRK10301 hypothetical protein; Provisional
Probab=68.99 E-value=14 Score=37.31 Aligned_cols=61 Identities=21% Similarity=0.177 Sum_probs=40.1
Q ss_pred cceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-----eEEEEEEEEe
Q psy5522 132 GALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-----QKFTKRFTVA 192 (1397)
Q Consensus 132 ~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-----~~~~~~F~Ve 192 (1397)
....|.+.|++|+.+..-........--....+|+.....|.|+|+...-+ ...+..|+|+
T Consensus 59 ~~s~i~v~~~~g~~v~~~~~~~~~~~~~~~~v~l~~~L~~G~YtV~Wrvvs~DGH~~~G~~~F~V~ 124 (124)
T PRK10301 59 GFSGATITGPKQENIKTLPAKRNEQDQKQLIVPLADSLKPGTYTVDWHVVSVDGHKTKGHYTFSVK 124 (124)
T ss_pred cccEEEEEcCCCCEeccCCccccCCCCcEEEEECCCCCCCccEEEEEEEEecCCCccCCeEEEEEC
Confidence 456688899999876533222222333456778877778899999988743 3456677764
No 87
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=68.48 E-value=30 Score=31.55 Aligned_cols=57 Identities=21% Similarity=0.372 Sum_probs=36.7
Q ss_pred EEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 117 FRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 117 fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
+++.++|.+-+|+ ....|.|.+..+..+. ...++..|-|.+.- | ++ |.|+|++...+
T Consensus 2 I~G~V~d~~g~pv--~~a~V~l~~~~~~~~~--~~~Td~~G~f~~~~-l---~~-g~Y~l~v~~~g 58 (82)
T PF13620_consen 2 ISGTVTDATGQPV--PGATVTLTDQDGGTVY--TTTTDSDGRFSFEG-L---PP-GTYTLRVSAPG 58 (82)
T ss_dssp EEEEEEETTSCBH--TT-EEEET--TTTECC--EEE--TTSEEEEEE-E----S-EEEEEEEEBTT
T ss_pred EEEEEEcCCCCCc--CCEEEEEEEeeCCCEE--EEEECCCceEEEEc-c---CC-EeEEEEEEECC
Confidence 4677778888886 4456778887776543 35678888877661 2 22 99999998876
No 88
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=67.87 E-value=30 Score=33.79 Aligned_cols=59 Identities=22% Similarity=0.281 Sum_probs=39.9
Q ss_pred CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcCe-EEEEEEEEe
Q psy5522 131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLDQ-KFTKRFTVA 192 (1397)
Q Consensus 131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~~-~~~~~F~Ve 192 (1397)
...++|.|.|..|+.|.+-....... -..++.|... ..|.|.|.+..+.. .....|.+|
T Consensus 47 ~~~vtI~I~d~~G~vVy~~~~~~~~~--~~~~I~L~~~-~~G~Y~l~i~~~~g~~l~G~F~ie 106 (106)
T PF11589_consen 47 IGDVTITIKDSTGNVVYSETVSNSAG--QSITIDLNGL-PSGEYTLEITNGNGTYLYGEFTIE 106 (106)
T ss_dssp -SEEEEEEEETT--EEEEEEESCGGT--TEEEEE-TTS--SEEEEEEEEECTC-EEEEEEEE-
T ss_pred CCCEEEEEEeCCCCEEEEEEccCCCC--cEEEEEeCCC-CCccEEEEEEeCCCCEEEEEEEEC
Confidence 46689999999999998865443332 3457777665 48999999998864 677788775
No 89
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=67.80 E-value=41 Score=31.79 Aligned_cols=80 Identities=16% Similarity=0.235 Sum_probs=57.2
Q ss_pred EEEEcCCCcccCC--ceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEE
Q psy5522 100 VFIQLDKAIYKPG--DLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNIT 177 (1397)
Q Consensus 100 vfIqTDKpIYkPG--qtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~ 177 (1397)
+-+..||...+++ +.+.+.+-++|.+-.|+.+..+.+.+..+.-..+..-...++.+|.....+. +. ..|.++|+
T Consensus 3 i~l~~~~~~~~Adg~d~~~i~v~v~D~~Gnpv~~~~V~f~~~~~~~~~~~~~~~~Td~~G~a~~~l~-~~--~~G~~~vt 79 (92)
T smart00634 3 TTLTADKDTAVANGSDAITLTATVTDANGNPVAGQEVTFTTPSGGALTLSKGTATTDANGIATVTLT-ST--TAGVYTVT 79 (92)
T ss_pred EEEEeCCCcEEEcCcccEEEEEEEECCCCCCcCCCEEEEEECCCceeeccCCeeeeCCCCEEEEEEE-CC--CCcEEEEE
Confidence 3455677777777 8899999999999999877777777754420111122235677999999988 32 67999999
Q ss_pred EEEcC
Q psy5522 178 INVLD 182 (1397)
Q Consensus 178 v~~~~ 182 (1397)
+...+
T Consensus 80 a~~~~ 84 (92)
T smart00634 80 ASLEN 84 (92)
T ss_pred EEECC
Confidence 98875
No 90
>KOG0497|consensus
Probab=67.66 E-value=5.7 Score=49.94 Aligned_cols=67 Identities=24% Similarity=0.310 Sum_probs=49.4
Q ss_pred hccccceeeecCCCCc-cccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522 931 TGYQQELTYRRPDGSF-SAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 931 ~gy~~~l~y~~~dGsf-s~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
.+.+-+-+.|.+|||| +.|+- =-+..|=|++..+..|++.+. +-..+.+++.+|.+.|+.+||+.+.
T Consensus 593 ~av~~ie~~Q~~DGSWyGsWgv----CFtY~t~Fa~~gl~aaGkty~-nc~~irka~~Fll~~Q~~~GGWgEs 660 (760)
T KOG0497|consen 593 KAVEFIEKLQLPDGSWYGSWGV----CFTYGTWFALRGLAAAGKTYE-NCEAIRKACDFLLSKQNPDGGWGES 660 (760)
T ss_pred HHHHHHHHcCCCCCcccchhhH----HHHHHHHHhcchhhhcchhhh-ccHHHHHHHHHHHhhhcccCCCccc
Confidence 3333333469999985 34432 236789999999998876544 3567899999999999999999864
No 91
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=67.55 E-value=23 Score=36.11 Aligned_cols=74 Identities=19% Similarity=0.332 Sum_probs=50.7
Q ss_pred CCCcccCCceeEEEEEEEcCCCccCCCcceEEEEE---CC--CCCe----EEeeeeec-------cCCceeeeeeecCCC
Q psy5522 105 DKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFIT---DG--KGNR----IKQWTRAL-------TTRGVFSADLQLSKS 168 (1397)
Q Consensus 105 DKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~---DP--~g~~----I~qw~~~~-------~~~Gi~~~sf~L~~~ 168 (1397)
+--+++.|++++|++-..+... =+.+.|+|- |- .+.. ...|.... .....+...+.+|.+
T Consensus 28 ~~~~~~~G~~ihfe~~i~d~~~----i~si~VeIH~nfd~H~h~~~~~~~~~~~~~~~~~~~~~g~~~~~~h~~i~IPa~ 103 (132)
T PF15418_consen 28 NCKVATRGDDIHFEADISDNSA----IKSIKVEIHNNFDHHTHSTEAGECEKPWVFEQDYDIYGGKKNYDFHEHIDIPAD 103 (132)
T ss_pred CCeEEecCCcEEEEEEEEcccc----eeEEEEEEecCcCcccccccccccccCcEEEEEEcccCCcccEeEEEeeeCCCC
Confidence 4567789999999998877532 245677771 21 1111 33444321 224578999999999
Q ss_pred CCCeeEEEEEEEcC
Q psy5522 169 PVLGDWNITINVLD 182 (1397)
Q Consensus 169 ~~lG~y~I~v~~~~ 182 (1397)
+..|.|.+.+..-+
T Consensus 104 a~~G~YH~~i~VtD 117 (132)
T PF15418_consen 104 APAGDYHFMITVTD 117 (132)
T ss_pred CCCcceEEEEEEEE
Confidence 99999999998865
No 92
>KOG0365|consensus
Probab=66.47 E-value=22 Score=41.22 Aligned_cols=100 Identities=23% Similarity=0.348 Sum_probs=59.7
Q ss_pred CCchhhhhcccchHHHHHHHHHHhccccHHHHHHHHHHH--HhccccceeeecCCCCccccCCCCCCCCc-chhHHHHHH
Q psy5522 891 FGCGEQNMLNFVPNIVVLEYLKKTYQLTDAIEAKASRYL--ETGYQQELTYRRPDGSFSAFGTTDPNGST-WLTAFVAKS 967 (1397)
Q Consensus 891 ~GCgEQ~m~~~~p~i~~l~yL~~~~~l~~~~~~~a~~~l--~~gy~~~l~y~~~dGsfs~~~~~~~~~s~-wlTAyvl~~ 967 (1397)
||-|--...++||+-.+..-|---+. ++|.+-| +.=||-+.+.|++||||..-.+ |.+ --.||...+
T Consensus 139 fgGGPGQl~HLA~TYAAVnaL~~~~~------e~A~~~InR~~l~~fL~slK~~dGgFrmh~~----GE~DvRs~YcA~s 208 (423)
T KOG0365|consen 139 FGGGPGQLPHLAPTYAAVNALCLCGS------EDAYSSINREKLYQFLFSLKDPDGGFRMHVE----GEVDVRSAYCALS 208 (423)
T ss_pred CCCCCccchhhhHHHHHHHHHHhcCc------HHHHHHhhHHHHHHHHHHhcCCCCCeEeecC----CcchHHHHHHHHH
Confidence 44455556788887544433332221 1122222 1223344557999999977533 233 236888777
Q ss_pred HHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522 968 FRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 968 ~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
....-. |. +++..+...+||.+-|+..|||++-
T Consensus 209 vaslln-i~-~deL~eG~~~wi~~CQtyEGG~GG~ 241 (423)
T KOG0365|consen 209 VASLLN-IP-MDELFEGTLDWIASCQTYEGGFGGE 241 (423)
T ss_pred HHHHHC-CC-cHHHHHHHHHHHHhcccccCCcCCC
Confidence 654432 22 3466788999999999999999864
No 93
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=66.06 E-value=24 Score=32.59 Aligned_cols=50 Identities=12% Similarity=0.191 Sum_probs=35.9
Q ss_pred CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCC----CCCCeeEEEEEEE
Q psy5522 131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSK----SPVLGDWNITINV 180 (1397)
Q Consensus 131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~----~~~lG~y~I~v~~ 180 (1397)
-..+.|.|.|.+|+.|+++.......|.....+.=-+ ..+-|.|++++..
T Consensus 24 a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~WdG~d~~G~~~~~G~Y~~~v~a 77 (81)
T PF13860_consen 24 ADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWDGKDDDGNPVPDGTYTFRVTA 77 (81)
T ss_dssp CEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-SB-TTS-B--SEEEEEEEEE
T ss_pred ccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEECCCCCCcCCCCCCCEEEEEEE
Confidence 4568999999999999998776667788777777221 2456999999875
No 94
>PF01483 P_proprotein: Proprotein convertase P-domain; InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=64.60 E-value=20 Score=33.54 Aligned_cols=51 Identities=18% Similarity=0.270 Sum_probs=34.6
Q ss_pred CCcceEEEEECCCCCeEEeeeeecc---CCceeeeeeec---CCCCCCeeEEEEEEE
Q psy5522 130 VTGALEIFITDGKGNRIKQWTRALT---TRGVFSADLQL---SKSPVLGDWNITINV 180 (1397)
Q Consensus 130 ~~~~~~v~I~DP~g~~I~qw~~~~~---~~Gi~~~sf~L---~~~~~lG~y~I~v~~ 180 (1397)
.-..+.|+|..|.|.++.=+..... ..+.+...|.+ -.+.+.|+|++++.-
T Consensus 14 ~~gdL~i~L~SP~Gt~~~L~~~~~~d~~~~~~~~~~f~~~~f~Ge~~~G~W~L~v~D 70 (87)
T PF01483_consen 14 YRGDLRITLISPSGTRSTLKDRRGSDDSGSGFLNWTFTSVAFWGESANGTWTLRVTD 70 (87)
T ss_dssp SGGGEEEEEE-TT--EEEEE-SSTTHHHSTSEEEEEEEESTTTT-B--EEEEEEEEE
T ss_pred CcCCEEEEEECCCCCEEEEECCcCCCcccccccccEEEEEeecCccCCCEEEEEEEE
Confidence 4577899999999998876655444 56778888875 478889999999987
No 95
>PTZ00487 ceramidase; Provisional
Probab=63.88 E-value=25 Score=45.59 Aligned_cols=73 Identities=14% Similarity=0.116 Sum_probs=52.7
Q ss_pred cccCCceeEEEEEEEcC--CCccCCCcceEEEEEC-----------CCCCeEEeeeeeccCCceeeeeeecCCCCCCeeE
Q psy5522 108 IYKPGDLVRFRAIVLNS--HLKPSVTGALEIFITD-----------GKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDW 174 (1397)
Q Consensus 108 IYkPGqtV~fRvv~ld~--~l~P~~~~~~~v~I~D-----------P~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y 174 (1397)
.|++||+|....+.-++ |++. .+.+..|+-++ .+-+...+|.......--+..++.+|++...|.|
T Consensus 607 ~y~~g~~v~~~F~~a~Prn~l~~-~~tf~~Ve~~~~~~~W~~v~~D~dw~t~~~W~r~~~~~S~~ti~W~i~~~~~~G~Y 685 (715)
T PTZ00487 607 VYSNNDTVSAEFYGGNPRNNFMT-ESSFLTVDKLNEKNQWTTILVDGDWDTKWHWKMHDLGFSLITIIWSIGPTTEPGTY 685 (715)
T ss_pred ccCCCCEEEEEEEecCCCCcccc-CcceEEEEEecCCCceeEeccCCCcceEEEEeccCCCceeEEEEEECCCCCCCeee
Confidence 49999999988888765 4544 46677777643 3445677887652222346778999999999999
Q ss_pred EEEEEEc
Q psy5522 175 NITINVL 181 (1397)
Q Consensus 175 ~I~v~~~ 181 (1397)
+|+-.-.
T Consensus 686 Ri~~~G~ 692 (715)
T PTZ00487 686 RITHSGY 692 (715)
T ss_pred EEEEeec
Confidence 9988654
No 96
>PF00432 Prenyltrans: Prenyltransferase and squalene oxidase repeat This Prosite family is a subset of the Pfam family.; InterPro: IPR001330 The beta subunit of the farnesyltransferases is responsible for peptide binding. Squalene-hopene cyclase is a bacterial enzyme that catalyzes the cyclization of squalene into hopene, a key step in hopanoid (triterpenoid) metabolism []. Lanosterol synthase (5.4.99.7 from EC) (oxidosqualene-lanosterol cyclase) catalyzes the cyclization of (S)-2,3-epoxysqualene to lanosterol, the initial precursor of cholesterol, steroid hormones and vitamin D in vertebrates and of ergosterol in fungi []. Cycloartenol synthase () (2,3-epoxysqualene-cycloartenol cyclase) is a plant enzyme that catalyzes the cyclization of (S)-2,3-epoxysqualene to cycloartenol.; GO: 0003824 catalytic activity; PDB: 2IEJ_B 1LD7_B 1LD8_B 2H6G_B 1TN6_B 1S63_B 1MZC_B 2H6I_B 2H6F_B 1JCQ_B ....
Probab=63.02 E-value=6.9 Score=31.56 Aligned_cols=21 Identities=24% Similarity=0.332 Sum_probs=18.4
Q ss_pred HHHHHHHHHhcccCCCccccc
Q psy5522 982 ILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 982 ~~~~~~WL~~~q~~~G~F~st 1002 (1397)
++++.+||.+.|+++|||...
T Consensus 3 ~~~~~~~l~~~Q~~dGGf~~~ 23 (44)
T PF00432_consen 3 VEKLIRFLLSCQNPDGGFGGR 23 (44)
T ss_dssp HHHHHHHHHHTBBTTSSBBSS
T ss_pred HHHHHHHHHHHCCCCCCCCCC
Confidence 467899999999999999865
No 97
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=61.50 E-value=47 Score=37.18 Aligned_cols=75 Identities=21% Similarity=0.320 Sum_probs=46.1
Q ss_pred EEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCC----CCCeeEE
Q psy5522 100 VFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKS----PVLGDWN 175 (1397)
Q Consensus 100 vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~----~~lG~y~ 175 (1397)
+.+.+|.-....|..+.+++. + |.....+.|+|.|.+|+.|+.-.......|...+++.=-+. .+-|.|+
T Consensus 100 V~~~~~~~~~~~~~~~~~~~~-l-----~~~a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~f~WDG~d~~G~~lp~G~Yt 173 (225)
T PRK06655 100 VLVPGDTVLVGTGGTTPFGVE-L-----PSAADNVTVTITDSAGQVVRTIDLGAQSAGVVSFTWDGTDTDGNALPDGNYT 173 (225)
T ss_pred EEEecceEEecCCCceEEEEE-c-----CCCCcEEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeEE
Confidence 334444433344556666633 2 21245689999999999998765434556776665543222 4579999
Q ss_pred EEEEE
Q psy5522 176 ITINV 180 (1397)
Q Consensus 176 I~v~~ 180 (1397)
++|..
T Consensus 174 ~~V~A 178 (225)
T PRK06655 174 IKASA 178 (225)
T ss_pred EEEEE
Confidence 88865
No 98
>PF04234 CopC: CopC domain; InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=61.16 E-value=9.4 Score=36.63 Aligned_cols=60 Identities=18% Similarity=0.224 Sum_probs=38.5
Q ss_pred cceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-----eEEEEEEEEe
Q psy5522 132 GALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-----QKFTKRFTVA 192 (1397)
Q Consensus 132 ~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-----~~~~~~F~Ve 192 (1397)
....|.|.|++|+++..-...... +--.....|+.....|.|+|....-+ ...+..|.|+
T Consensus 33 ~~s~v~v~~~~g~~v~~~~~~~~~-~~~~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~~~F~V~ 97 (97)
T PF04234_consen 33 GFSSVTVTDPDGKRVDLGEPTVDG-DGKTLTVPLPPPLPPGTYTVSWRVVSADGHPVSGSFSFTVK 97 (97)
T ss_dssp CC-EEEEEEEEETTSCTCEEEEEE-STTEEEEEESS---SEEEEEEEEEEETTSCEEEEEEEEEE-
T ss_pred CccEEEEEcCCCceeecCcceecC-CceEEEEECCCCCCCceEEEEEEEEecCCCCcCCEEEEEEC
Confidence 566788999999877533222222 34577888888788899999988742 4567788774
No 99
>KOG4386|consensus
Probab=60.65 E-value=17 Score=44.06 Aligned_cols=86 Identities=17% Similarity=0.242 Sum_probs=73.1
Q ss_pred CeeEEEeeeEEEEecCCceeeeCCEEEEEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEe
Q psy5522 706 PKKLRVFRPFFISLDLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIK 785 (1397)
Q Consensus 706 ~~~l~vfkpffv~l~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 785 (1397)
|.-+.-.=|+-|...||.+-+..|.+.++--+-|-.+..++|++.++.+|.|-|. +..+-++.|-
T Consensus 685 phviVe~iPlhvnadlpsfgrVReslpvkyhLqnktdlvqdveisvepsDaFMFS---------------GlkqirlriL 749 (809)
T KOG4386|consen 685 PHVIVEAIPLHVNADLPSFGRVRESLPVKYHLQNKTDLVQDVEISVEPSDAFMFS---------------GLKQIRLRIL 749 (809)
T ss_pred ccceeeeccceeecCCCCcceecccccEEEEeccccceeeeEEeecccchhheec---------------ccceEEEEEc
Confidence 4455567799999999999999999999999999999999999999999999773 2345678888
Q ss_pred CCCeeEEEEEEEeceeeeEEE
Q psy5522 786 ANSGSTTTFVITPKELGYIGI 806 (1397)
Q Consensus 786 a~~~~tv~f~v~p~~lG~v~i 806 (1397)
+|..+.+.+.+-|...|..++
T Consensus 750 PGteqemlynfypLmAGyqql 770 (809)
T KOG4386|consen 750 PGTEQEMLYNFYPLMAGYQQL 770 (809)
T ss_pred CCCceEEEEEEehhhchhhhC
Confidence 999999999999988775443
No 100
>PF13115 YtkA: YtkA-like
Probab=59.80 E-value=87 Score=28.98 Aligned_cols=69 Identities=19% Similarity=0.177 Sum_probs=39.9
Q ss_pred eeeEEEEEecCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEeee-cc-cc-ccceeeEeeCCCeeEEEEEeCC
Q psy5522 311 HKYKMDLIKSSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFS-YD-ES-KYEANQYKLDRNGMIKLVYYPP 384 (1397)
Q Consensus 311 ~~~~i~f~~~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~-~~-~~-~~~~~~~ttd~~G~~~f~i~~~ 384 (1397)
..|.|.+...+ ..+.|.+ .+.+. .+.+|+|+.+. .|.+..... .. .. .......+..+.|.....+..+
T Consensus 5 ~~~~v~l~~~~-~~~~g~~-~i~v~-~~~~g~pv~~a--~V~~~~~m~~~~g~~~~~~~~~~~~~~~G~Y~~~~~f~ 76 (86)
T PF13115_consen 5 GGYTVELVSPE-PPKVGEN-TITVT-VDQGGKPVTDA--DVQFEIWMPDMEGMEPMTSKVELEETGPGVYEAEVTFS 76 (86)
T ss_pred ccEEEEEecCC-CCcCCce-EEEEE-ECCCCCCCCCC--EEEEEEEeCCCCCCCCCceeeeeecCCCCeEEEEeecC
Confidence 34667776554 5688888 78888 89999999965 455543211 00 00 0112223334567666665554
No 101
>PF02369 Big_1: Bacterial Ig-like domain (group 1); InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=59.27 E-value=1.3e+02 Score=29.04 Aligned_cols=78 Identities=19% Similarity=0.251 Sum_probs=49.8
Q ss_pred EEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEee--eeeccCCceeeeeeecCCCCCCeeEEEEE
Q psy5522 101 FIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQW--TRALTTRGVFSADLQLSKSPVLGDWNITI 178 (1397)
Q Consensus 101 fIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw--~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v 178 (1397)
.+..|+.+=--++.+.+.+.+.|.+..|+.+..+.+. .++.+-.+..- ...++..|.+...|. +. ..|.++|.+
T Consensus 11 ~~~~~~~~a~g~~~~tltatV~D~~gnpv~g~~V~f~-~~~~~~~l~~~~~~~~Td~~G~a~~tlt-st--~aG~~~VtA 86 (100)
T PF02369_consen 11 SIVADKAVADGSDTNTLTATVTDANGNPVPGQPVTFS-SSSSGGTLSPTNTSATTDSNGIATVTLT-ST--KAGTYTVTA 86 (100)
T ss_dssp EEEES-EESSSSS-EEEEEEEEETTSEB-TS-EEEE---EESSSEES-CEE-EEE-TTSEEEEEEE--S--S-EEEEEEE
T ss_pred eeeecceEeCCcCcEEEEEEEEcCCCCCCCCCEEEEE-EcCCCcEEecCccccEECCCEEEEEEEE-ec--CceEEEEEE
Confidence 3444444444788899999999999999977777773 33444444332 357889999988876 22 679999999
Q ss_pred EEcC
Q psy5522 179 NVLD 182 (1397)
Q Consensus 179 ~~~~ 182 (1397)
...+
T Consensus 87 ~~~~ 90 (100)
T PF02369_consen 87 TVDG 90 (100)
T ss_dssp EETT
T ss_pred EECC
Confidence 9987
No 102
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=56.24 E-value=5.8e+02 Score=33.28 Aligned_cols=340 Identities=16% Similarity=0.154 Sum_probs=159.5
Q ss_pred CCCceeEEEEEecCc---eeeEeeEEEEEEe--cc-c--EEEE-EcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEE
Q psy5522 68 LGPGQYNLTVKGSGS---LNFYNSTGLVYVH--KS-Y--SVFI-QLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFI 138 (1397)
Q Consensus 68 ~~~~~~~l~v~g~~~---~~f~~~~~v~~~~--~~-~--~vfI-qTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I 138 (1397)
..+|.|+|.+...+. -.|..+..+.+.= .. . .-|+ -.++|+ -.|+.|.|++.+-... +......|
T Consensus 253 ~~~G~Y~i~~~VKD~~S~~eyDD~~~l~y~Vk~~~~vkI~~~~~d~~s~q-l~g~~I~ika~a~GG~-----~llYrf~I 326 (667)
T PRK14081 253 KKSGDYKLLCLVKDMYSNNEFDDRAVLVYTVKPYKDIKIRNFTTDLSSPQ-LTDTDIELKAVAEGGK-----ELLYRFII 326 (667)
T ss_pred CCCccEEEEEEEeccCcccccccceEEEEEEecCCCcEEEEEEEcCCCCe-ecCceEEEEEEecCCC-----ceEEEEEE
Confidence 356778777666432 2344444444331 11 1 1111 124666 4677777777765432 11233344
Q ss_pred ECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC--------eEEEEEEEEeeec-CCcEEEEEeCC-Cc
Q psy5522 139 TDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD--------QKFTKRFTVAEYV-LPKFEVNVNVP-PH 208 (1397)
Q Consensus 139 ~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~--------~~~~~~F~VeEYv-LPkFeV~v~~p-~~ 208 (1397)
. |..+. ....| -..+|.-.++ ..|.|.|.+.+.+ ....-.|.|++|. .|-.=-.|..| +.
T Consensus 327 ~---G~~~e-----~~~Y~-~~n~~~w~P~-~~G~Y~I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p~~I~~vl~d~~~ 396 (667)
T PRK14081 327 K---GKESE-----DSGYI-RNNIYTWKPK-IAGKYSITLWVKDISSKGEYEDKSSIDYTIEEKSKEPIKIEDVILDKGK 396 (667)
T ss_pred C---CcEEe-----ecccc-ccceEEEeeC-CCceEEEEEEEEcCcCcccccceEEEEEEEcccCCCCeEEEEEEECCCC
Confidence 2 43332 11111 2223333222 4799999999976 2345588898883 34322333444 34
Q ss_pred eEeeCCeEEEEEEEEEecCccccceEEEEEEeccccCccc--ccc--ccceeEEeecCceEEEEEechhhhccccccceE
Q psy5522 209 ATFKDSKVVISVNAKYTYGKPVKGEATITAYPTIFSGVIQ--PLF--QTPVRKVVPIDGKTVIEFDVVKELQLTDEYERN 284 (1397)
Q Consensus 209 i~~~~~~~~v~v~AkYtyGkPV~G~v~v~v~~~~~~g~~~--~~~--~~~~~~~~d~~G~~~~~~~~~~~~~~~~~~~~~ 284 (1397)
-.+.|+.+.+++.|.=+ . .+.++.-.+. .|... .++ ...+.-+-...|..++.+-.++ -.....|...
T Consensus 397 ~~lvG~~i~i~v~a~gg--~----~~lY~f~ik~-ng~~ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKd-k~S~~~yD~~ 468 (667)
T PRK14081 397 HILKGEEIKIRVIAEGG--T----NLRYSFIIKK-DGKEEEKIDYGKNNWVNFIPEEKGNYELEVRVKD-KYSDKEYDAH 468 (667)
T ss_pred ceEeCCeEEEEEEecCC--C----eEEEEEEEEE-CCEEEEEeecccccEEEEEECCCeeEEEEEEEec-ccCchhcccc
Confidence 44568888888875432 2 1222211110 01100 000 0111122233444444444433 1111122222
Q ss_pred EEEEEEEEECCcceEEeeEEEEEEEEeeeEEEEE--ecCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccc
Q psy5522 285 IHFDVAVEEALTGRRQNNTGSVVFHKHKYKMDLI--KSSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES 362 (1397)
Q Consensus 285 l~V~a~VtE~gTG~~~~~~~~i~i~~~~~~i~f~--~~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~ 362 (1397)
-.|...|.|. .|..+... +...++.-|.+.++.+...+.++.-. .-.+..+..|..
T Consensus 469 k~v~l~V~e~----------------~P~~i~~il~~~~~~~~vg~~i~~~~~~~~~k~v~y-----~y~~~~NG~~v~- 526 (667)
T PRK14081 469 TIVYIKVHEY----------------IPAEIDYILLPVKEYYLVGDDIEIEVIIQNTKDVLI-----KYILKINGHKVE- 526 (667)
T ss_pred eEEEEEEecc----------------CceeeeeEEecccccEEeCCEEEEEEEEeCCCeEEE-----EEEEEECCEEEE-
Confidence 2222233221 23333332 33466778998888888877543221 111221112210
Q ss_pred ccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecCcce-------EEEEEEeecCCCCCeEEEEecCCCCCCCCeEE
Q psy5522 363 KYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLDIKE-------WFSTISASESPSNSFIQAALLTQNPKVNKDVE 435 (1397)
Q Consensus 363 ~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~~~-------~~~~v~~~~S~s~s~L~i~~~~~~~~~G~~~~ 435 (1397)
.+.+. .+. .++|.| ...|.|.+.+-|+..-+.. -.+.+. ....-.-..|..++....||++++
T Consensus 527 ---~t~Ys--~~~--~ysf~P-~~~GkY~I~V~aKn~~s~~~~D~~k~v~~~V~--e~~pi~nt~~~~~~~~~~~n~~~t 596 (667)
T PRK14081 527 ---ETDYI--KNK--KYKFIP-KCSGKYTIEVLAKNIKSTEEYDSKKEVKFYVR--EALPITNTKIKTSKKKFKCNEEVT 596 (667)
T ss_pred ---Eeecc--ccc--eEEEee-cCCceEEEEEEEcccccccccccceEEEEEEc--CCCCceeeEEEeecceEEcCCeEE
Confidence 11111 122 344443 3456766666665443321 112222 122233455667788999999999
Q ss_pred EEEEecCCcc-eEEEEEEeCCeEEEEEEE
Q psy5522 436 LEINSTAPLK-YISYQVLGRGDVIMADTI 463 (1397)
Q Consensus 436 v~v~s~~~~~-~~~ylV~srG~Iv~~~~~ 463 (1397)
+.+.+...-. ..-|.||.+|.-.....-
T Consensus 597 ~~~~~~gg~~v~Yef~v~~~g~w~~vq~y 625 (667)
T PRK14081 597 FSVKSEGGKDVCYEFYIMEKGEWKLVQKY 625 (667)
T ss_pred EEEEccCCCcEEEEEEEEECCcEEEEeec
Confidence 9999763222 355678999987665543
No 103
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=55.45 E-value=1.1e+02 Score=38.12 Aligned_cols=86 Identities=23% Similarity=0.232 Sum_probs=51.9
Q ss_pred CccEEEEEEEEeecCCCCeeEEeEEEEECCCceeEEEEEcCCCCCceeEEEEEec--Cc-eeeEeeEEEEEEecc-----
Q psy5522 26 SEATQVSVEVGGKQDSGGEFRTKQLLSVDPFSTRIVKLDIGDLGPGQYNLTVKGS--GS-LNFYNSTGLVYVHKS----- 97 (1397)
Q Consensus 26 ~~~v~v~v~L~~~~~~~~~~~~~~~~~~~~~~~~~~~f~vP~~~~~~~~l~v~g~--~~-~~f~~~~~v~~~~~~----- 97 (1397)
++.+.|+++|... ..+........++.+. +.+++.+..+..|.|.|.|++. +| .....+..+++....
T Consensus 338 ~g~~~vta~V~d~---~g~~~~~~~~~v~d~s-~~vtL~Ls~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~~~~~~~~~ 413 (478)
T PRK13211 338 TGDMNVEATVYNH---DGEALGSKSQTVNDGS-QSVSLDLSKLKAGHHMLVVKAKPKDGELIKQQTLDFMLEAKDPPPSG 413 (478)
T ss_pred ccceEEEEEEEcC---CCCeeeeeeEEecCCc-eeEEEecccCCCceEEEEEEEEeCCCceeeeeeEEEEEEecCCCCCC
Confidence 4456666666543 1122333334444433 5667777788889998888875 33 445677777774221
Q ss_pred cEEEEEcCC-CcccCCcee
Q psy5522 98 YSVFIQLDK-AIYKPGDLV 115 (1397)
Q Consensus 98 ~~vfIqTDK-pIYkPGqtV 115 (1397)
..=|.+-+- ..|+.|++|
T Consensus 414 ~~~~~~~~g~~~Y~aGd~V 432 (478)
T PRK13211 414 DYDFVFPEGLKSYTAGTKV 432 (478)
T ss_pred CcCccccCCcccccCCCEE
Confidence 112455556 889999987
No 104
>PF00432 Prenyltrans: Prenyltransferase and squalene oxidase repeat This Prosite family is a subset of the Pfam family.; InterPro: IPR001330 The beta subunit of the farnesyltransferases is responsible for peptide binding. Squalene-hopene cyclase is a bacterial enzyme that catalyzes the cyclization of squalene into hopene, a key step in hopanoid (triterpenoid) metabolism []. Lanosterol synthase (5.4.99.7 from EC) (oxidosqualene-lanosterol cyclase) catalyzes the cyclization of (S)-2,3-epoxysqualene to lanosterol, the initial precursor of cholesterol, steroid hormones and vitamin D in vertebrates and of ergosterol in fungi []. Cycloartenol synthase () (2,3-epoxysqualene-cycloartenol cyclase) is a plant enzyme that catalyzes the cyclization of (S)-2,3-epoxysqualene to cycloartenol.; GO: 0003824 catalytic activity; PDB: 2IEJ_B 1LD7_B 1LD8_B 2H6G_B 1TN6_B 1S63_B 1MZC_B 2H6I_B 2H6F_B 1JCQ_B ....
Probab=54.64 E-value=8.5 Score=31.00 Aligned_cols=20 Identities=30% Similarity=0.635 Sum_probs=17.8
Q ss_pred hhhhhhhhhcccCCCCcccc
Q psy5522 1257 LPILTWLVTQQNDQGGFAST 1276 (1397)
Q Consensus 1257 ~~i~~Wl~~q~n~~Ggf~st 1276 (1397)
.++++||.+.|++.|||+..
T Consensus 4 ~~~~~~l~~~Q~~dGGf~~~ 23 (44)
T PF00432_consen 4 EKLIRFLLSCQNPDGGFGGR 23 (44)
T ss_dssp HHHHHHHHHTBBTTSSBBSS
T ss_pred HHHHHHHHHHCCCCCCCCCC
Confidence 57899999999999999765
No 105
>PF14326 DUF4384: Domain of unknown function (DUF4384)
Probab=54.52 E-value=65 Score=29.88 Aligned_cols=63 Identities=19% Similarity=0.382 Sum_probs=37.7
Q ss_pred CcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceee--eeeecCCCCCCeeEEEEE
Q psy5522 107 AIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFS--ADLQLSKSPVLGDWNITI 178 (1397)
Q Consensus 107 pIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~--~sf~L~~~~~lG~y~I~v 178 (1397)
+.|+.||+|+|++=+ ....++.|.-.|++|+...=..+.......+. ..+.+|+. -..|.+++
T Consensus 1 ~~~~~Ge~v~~~~~~-------~~~~Yl~l~~~~~~G~v~~L~Pn~~~~~~~v~ag~~~~iP~~--~~~~~~~v 65 (83)
T PF14326_consen 1 TVYRVGERVRFRVTS-------NRDGYLYLFYIDADGKVTLLFPNRYQPDNFVKAGQTYTIPDP--GDRFSFTV 65 (83)
T ss_pred CcccCCCEEEEEEEe-------CCCeEEEEEEECCCCCEEEEecCccccCceEcCCceEEcCCC--CCceEEEE
Confidence 579999999998765 12457777778999977654444322223222 24566643 23444444
No 106
>PF13199 Glyco_hydro_66: Glycosyl hydrolase family 66; PDB: 3VMO_A 3VMN_A 3VMP_A.
Probab=53.90 E-value=64 Score=41.01 Aligned_cols=93 Identities=16% Similarity=0.194 Sum_probs=0.0
Q ss_pred cCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeee----eecCCCCCCeeEEEEEE
Q psy5522 104 LDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSAD----LQLSKSPVLGDWNITIN 179 (1397)
Q Consensus 104 TDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~s----f~L~~~~~lG~y~I~v~ 179 (1397)
|||..|+|||+|.|.+=..+..-..- ...+.|++.-=+ +.|.+-.....-.+.-+.+ +..|.+-- .-|-+.+.
T Consensus 2 tDKA~Y~PGe~V~l~~~~~~~~~~~~-~g~~~v~~~~l~-~~V~~~~~~~~~~~~~~~~~~~~w~~P~~df-~GYlv~i~ 78 (559)
T PF13199_consen 2 TDKARYRPGEKVTLTASLKNTTGSDF-SGTVKVRYYHLN-EVVGETKQSVDLTSGASWTLTIEWTAPADDF-TGYLVEIY 78 (559)
T ss_dssp ES-SSB-TTS-EEEE-EEE--SSS-E-EEEEEEEEEETT-EEEEEEEEEEEE-TT-EEE-TTSEEE-TTSS-EEEEEEEE
T ss_pred CCcceeCCCCeEEEEEEeccCccccc-cceEEEEEEEcC-eEeeeeeeEEeecCCCcceeeEEEECCcccC-eeEEEEEE
Q ss_pred EcC-----eEEEEEEEEee--ecCCcE
Q psy5522 180 VLD-----QKFTKRFTVAE--YVLPKF 199 (1397)
Q Consensus 180 ~~~-----~~~~~~F~VeE--YvLPkF 199 (1397)
... ...+..+.|.. .+-||+
T Consensus 79 ~~~~g~~~~~~t~aiDVSsdW~~fPRY 105 (559)
T PF13199_consen 79 ADSGGKEVDSATIAIDVSSDWTRFPRY 105 (559)
T ss_dssp E-TT--EEEEEEEEEEE-SSTTSS--E
T ss_pred EecCCceeeeeeEEEEecCCcccCCcc
No 107
>PF08400 phage_tail_N: Prophage tail fibre N-terminal; InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=51.87 E-value=1e+02 Score=31.43 Aligned_cols=96 Identities=14% Similarity=0.079 Sum_probs=0.0
Q ss_pred CeEEEEEEEEccCCCccCCCCceEEEEEeeecccc-ccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecCcce-EE
Q psy5522 328 LKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES-KYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLDIKE-WF 405 (1397)
Q Consensus 328 ~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~-~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~~~-~~ 405 (1397)
+...+.=.+.|+.|+|++ +..|.+....+-... ........|+++|...|.+.+ |.|.+.+...-..... ..
T Consensus 1 MsV~ISGvL~dg~G~pv~--g~~I~L~A~~tS~~Vv~~t~as~~t~~~G~Ys~~~ep----G~Y~V~l~~~g~~~~~vG~ 74 (134)
T PF08400_consen 1 MSVKISGVLKDGAGKPVP--GCTITLKARRTSSTVVVGTVASVVTGEAGEYSFDVEP----GVYRVTLKVEGRPPVYVGD 74 (134)
T ss_pred CeEEEEEEEeCCCCCcCC--CCEEEEEEccCchheEEEEEEEEEcCCCceEEEEecC----CeEEEEEEECCCCceeEEE
Q ss_pred EEEEeecCCC--CCeEEEEecCCCCCC
Q psy5522 406 STISASESPS--NSFIQAALLTQNPKV 430 (1397)
Q Consensus 406 ~~v~~~~S~s--~s~L~i~~~~~~~~~ 430 (1397)
.++..-..|. |.|| ..+....++|
T Consensus 75 I~V~~dS~pGTLN~fL-~~~~e~dl~P 100 (134)
T PF08400_consen 75 ITVYEDSKPGTLNDFL-TAPDEDDLRP 100 (134)
T ss_pred EEEecCCCCCcHHHHh-hccccccCCH
No 108
>TIGR03437 Soli_cterm Solibacter uncharacterized C-terminal domain. This model describes a protein domain found in 90 proteins of Solibacter usitatus Ellin6076, nearly always as the C-terminal domain of a much larger protein. No homologs to this domain are detected outside of S. usitatus, a member of the Acidobacteria.
Probab=51.61 E-value=3.4e+02 Score=30.19 Aligned_cols=119 Identities=15% Similarity=0.239 Sum_probs=72.3
Q ss_pred eEEEEEcC-CCCCceeEEEEEecCceeeEeeEEEEEEecccEEEEE--------------------cCCCcccCCceeEE
Q psy5522 59 RIVKLDIG-DLGPGQYNLTVKGSGSLNFYNSTGLVYVHKSYSVFIQ--------------------LDKAIYKPGDLVRF 117 (1397)
Q Consensus 59 ~~~~f~vP-~~~~~~~~l~v~g~~~~~f~~~~~v~~~~~~~~vfIq--------------------TDKpIYkPGqtV~f 117 (1397)
..++|++| .+..+...+.|+-.++ -.....+.+.....-+|.. .+.|. ||||.|.+
T Consensus 51 ~QIN~qvP~~~~~g~~~v~V~~~g~--~s~~~~v~va~~aPgiFt~~~~g~g~~a~~~~~d~s~~~~~~PA-~~G~~i~i 127 (215)
T TIGR03437 51 GQINAQVPYEVAPGAATVTVTYNGG--ASAAVTVTVAAAAPGIFTLDGSGTGQAAALNNQDGSVNSAANPA-APGDVVVL 127 (215)
T ss_pred ceEEEEeCCCcCCCcEEEEEEeCCc--ccceEEEEeeccccEEEEecCCCCcceeEEEcCCCccccCCCCC-CCCCEEEE
Confidence 45889999 4555666666664222 1344455555555555542 35566 89999888
Q ss_pred EEEEEcC------CCccC-------CCcceEEEEECCCCCeEEeeeeecc-CCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 118 RAIVLNS------HLKPS-------VTGALEIFITDGKGNRIKQWTRALT-TRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 118 Rvv~ld~------~l~P~-------~~~~~~v~I~DP~g~~I~qw~~~~~-~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
-+--+.. +..+. ...++++.|=.-.=. -++.-... .-|++|..+.||.....|.+.|.+..++
T Consensus 128 y~TG~G~~~p~~~~G~~~~~~~~~~~~~~v~vtigG~~a~--V~yaGlaPg~~Gl~QvNv~vP~~~~~G~~~v~itvgg 204 (215)
T TIGR03437 128 YATGLGPTSPAVADGAPAPSSPLAPALAPVTVTIGGVPAT--VLYAGLAPGFVGLYQVNVRVPAGLATGAVPVVITVGG 204 (215)
T ss_pred EEeCCCCCCCccccCccccCCccccccCCeEEEECCEEee--EEEeccCCCCCceEEEEEEcCCCCCCCcEeEEEEECC
Confidence 7765421 11111 134567776221111 12222222 3499999999999999999999999987
No 109
>PF11614 FixG_C: IG-like fold at C-terminal of FixG, putative oxidoreductase; PDB: 2R39_A.
Probab=51.40 E-value=1.8e+02 Score=28.84 Aligned_cols=63 Identities=14% Similarity=0.190 Sum_probs=40.8
Q ss_pred EEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEe-ce---eeeEEEEE
Q psy5522 733 IPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITP-KE---LGYIGIKV 808 (1397)
Q Consensus 733 l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p-~~---lG~v~itV 808 (1397)
-.+.|.|-.++...+++++...+++++.. ....+.|+|++...+.+.|.. .. .|..+|++
T Consensus 35 Y~lkl~Nkt~~~~~~~i~~~g~~~~~l~~----------------~~~~i~v~~g~~~~~~v~v~~p~~~~~~~~~~i~f 98 (118)
T PF11614_consen 35 YTLKLTNKTNQPRTYTISVEGLPGAELQG----------------PENTITVPPGETREVPVFVTAPPDALKSGSTPITF 98 (118)
T ss_dssp EEEEEEE-SSS-EEEEEEEES-SS-EE-E----------------S--EEEE-TT-EEEEEEEEEE-GGG-SSSEEEEEE
T ss_pred EEEEEEECCCCCEEEEEEEecCCCeEEEC----------------CCcceEECCCCEEEEEEEEEECHHHccCCCeeEEE
Confidence 45668899999999999999877777621 145789999999999988874 22 35556666
Q ss_pred EEE
Q psy5522 809 TAT 811 (1397)
Q Consensus 809 ~A~ 811 (1397)
.+.
T Consensus 99 ~v~ 101 (118)
T PF11614_consen 99 TVT 101 (118)
T ss_dssp EEE
T ss_pred EEE
Confidence 665
No 110
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=50.43 E-value=35 Score=31.05 Aligned_cols=57 Identities=16% Similarity=0.170 Sum_probs=31.6
Q ss_pred EEEEEEccCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEecC
Q psy5522 332 AYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEYLD 400 (1397)
Q Consensus 332 ~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~ 400 (1397)
+.-+|+|.+|+|+++ ..|.+. .. .........||++|...|.--++ +. +.|++....
T Consensus 2 I~G~V~d~~g~pv~~--a~V~l~--~~---~~~~~~~~~Td~~G~f~~~~l~~---g~--Y~l~v~~~g 58 (82)
T PF13620_consen 2 ISGTVTDATGQPVPG--ATVTLT--DQ---DGGTVYTTTTDSDGRFSFEGLPP---GT--YTLRVSAPG 58 (82)
T ss_dssp EEEEEEETTSCBHTT---EEEET-------TTTECCEEE--TTSEEEEEEE-S---EE--EEEEEEBTT
T ss_pred EEEEEEcCCCCCcCC--EEEEEE--Ee---eCCCEEEEEECCCceEEEEccCC---Ee--EEEEEEECC
Confidence 344677899999995 345443 11 12345677899999888772222 45 455554443
No 111
>PF00868 Transglut_N: Transglutaminase family; InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=49.85 E-value=1.2e+02 Score=30.33 Aligned_cols=82 Identities=5% Similarity=0.197 Sum_probs=45.1
Q ss_pred cccEEEEEcCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeee--ccCCceeeeeeecCCCCCCee
Q psy5522 96 KSYSVFIQLDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRA--LTTRGVFSADLQLSKSPVLGD 173 (1397)
Q Consensus 96 ~~~~vfIqTDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~--~~~~Gi~~~sf~L~~~~~lG~ 173 (1397)
+...+-|.-+||.....|.+.|-.- +.++=....+..+.+-+.... .-..|... ..+.+.+..+...|.+++.|.
T Consensus 33 Q~F~i~l~f~r~~~~~~d~l~l~~~-~G~~P~~~~gT~~~~~~~~~~--~~~~W~a~v~~~~~~~~tv~V~spa~A~VG~ 109 (118)
T PF00868_consen 33 QPFTITLRFNRPFDPSKDQLSLEFE-TGPNPSESKGTKVVFPVSSSL--DSSSWSARVESQDGNSVTVSVTSPANAPVGR 109 (118)
T ss_dssp SEEEEEEEESSS--TTTEEEEEEEE-ESSS--TTTTSEEEEEECSSS---TSSSEEEEEEEETTEEEEEEE--TTS--EE
T ss_pred CEEEEEEEEcCCcCCCCcEEEEEEE-EecccccCCCcEEEEEEccCC--CCCCEEEEEEecCCCEEEEEEECCCCCceEE
Confidence 4456777788885555666665443 232211123456666662221 12367643 334566899999999999999
Q ss_pred EEEEEEE
Q psy5522 174 WNITINV 180 (1397)
Q Consensus 174 y~I~v~~ 180 (1397)
|++.+..
T Consensus 110 y~l~v~~ 116 (118)
T PF00868_consen 110 YKLSVET 116 (118)
T ss_dssp EEEEEEE
T ss_pred EEEEEEE
Confidence 9999875
No 112
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=48.67 E-value=1.8e+02 Score=33.32 Aligned_cols=63 Identities=13% Similarity=0.222 Sum_probs=42.9
Q ss_pred ceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCC----CCCeeEEEEEEEc
Q psy5522 113 DLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKS----PVLGDWNITINVL 181 (1397)
Q Consensus 113 qtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~----~~lG~y~I~v~~~ 181 (1397)
..+.|++- + |..-..+.|.|.|.+|+.|+.........|...+++.=-+. .+-|.|+|++...
T Consensus 128 ~~~~~~~~-l-----~~~a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~A~ 194 (259)
T PRK12812 128 ELIALKLY-F-----PEDSDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDGRDNDGVYAGDGEYTIKAVYN 194 (259)
T ss_pred ceeEEEEe-c-----CCcCceEEEEEEeCCCCEEEEEecCCCCCcceeEEECCCCCCCCcCCCeeeEEEEEEE
Confidence 46666633 2 22235689999999999998776445567877666655332 4569999999854
No 113
>PF14900 DUF4493: Domain of unknown function (DUF4493)
Probab=46.59 E-value=4.7e+02 Score=29.37 Aligned_cols=44 Identities=16% Similarity=0.395 Sum_probs=30.0
Q ss_pred CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
.....|.|.+..|..+..|...... ...+.|| .|.|+|.|..++
T Consensus 32 ~~~~~v~I~~~~~~~~~~~~~~~~~----~~~i~L~----~G~Ytv~A~~g~ 75 (235)
T PF14900_consen 32 VDDFTVEIYNADGTVVKYWKYSEMP----GESIELP----VGSYTVKASYGD 75 (235)
T ss_pred cCceEEEEEeCCCcEEEecchhccc----cceEeec----CCcEEEEEEcCC
Confidence 5678899999997666666433222 3556655 499999999653
No 114
>PF05751 FixH: FixH; InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=46.08 E-value=90 Score=32.10 Aligned_cols=63 Identities=14% Similarity=0.262 Sum_probs=44.1
Q ss_pred ceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeee-eccCCceeeeeeecCCCCCCeeEEEEEEE
Q psy5522 113 DLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTR-ALTTRGVFSADLQLSKSPVLGDWNITINV 180 (1397)
Q Consensus 113 qtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~-~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~ 180 (1397)
.++.+++ .|.+-.|.....+++.+.-|....-.+-.. .....|.+....+++ ..|.|.|++..
T Consensus 69 ~~~~i~~--~d~~g~~~~~~~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~---~~G~W~l~l~~ 132 (146)
T PF05751_consen 69 NSLTIRL--TDPNGAPVSGAKLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLL---KKGRWYLRLDW 132 (146)
T ss_pred CeEEEEE--EcCCCCcCcCceEEEEEECCCCccCCeeEEeeECCCceEEEEcCCC---CCccEEEEEEE
Confidence 4454444 346677777778889888887655544333 346789888888777 67999999944
No 115
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=45.20 E-value=1.2e+02 Score=33.89 Aligned_cols=51 Identities=22% Similarity=0.273 Sum_probs=38.3
Q ss_pred CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCC----CCCeeEEEEEEEc
Q psy5522 131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKS----PVLGDWNITINVL 181 (1397)
Q Consensus 131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~----~~lG~y~I~v~~~ 181 (1397)
...++|.|.|.+|+.|+.........|...+++.=-+. .+-|.|+++|...
T Consensus 121 a~~v~i~I~d~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~a~ 175 (221)
T PRK12634 121 AGFVNFEITDANGAFVKQISVPASAAGEVSFAWDGTDANGNRMAAGKYGVTATQT 175 (221)
T ss_pred CCeEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeeEEEEEEE
Confidence 46789999999999998876545566877777765332 4569999999753
No 116
>COG2372 CopC Uncharacterized protein, homolog of Cu resistance protein CopC [General function prediction only]
Probab=42.51 E-value=79 Score=31.79 Aligned_cols=62 Identities=16% Similarity=0.199 Sum_probs=45.8
Q ss_pred CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-----eEEEEEEEEe
Q psy5522 131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD-----QKFTKRFTVA 192 (1397)
Q Consensus 131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~-----~~~~~~F~Ve 192 (1397)
..+..+.|.+|+|.++.--.....+.---+...+|+....-|.|++.-..-+ ...+..|.|+
T Consensus 59 ~~fs~~~l~~~d~~~v~t~~~~~~~~~~~~l~v~l~~~L~aG~Y~v~WrvvS~DGH~v~G~~sFsV~ 125 (127)
T COG2372 59 PGFSGAKLTGPDGEEVATAGTKLDEQNHTQLEVPLPQPLKAGVYTVDWRVVSSDGHVVKGSISFSVG 125 (127)
T ss_pred CCcceeEEECCCCCccccCcccccccCCcEEEecCcccCCCCcEEEEEEEEecCCcEeccEEEEEec
Confidence 4557789999999998754444444434678899999999999998877642 4567788875
No 117
>PRK05842 flgD flagellar basal body rod modification protein; Reviewed
Probab=41.75 E-value=1.5e+02 Score=34.57 Aligned_cols=69 Identities=12% Similarity=0.181 Sum_probs=42.1
Q ss_pred cCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeecc--CCceeeeeeecCCC----CCCeeEEEEEEE
Q psy5522 110 KPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALT--TRGVFSADLQLSKS----PVLGDWNITINV 180 (1397)
Q Consensus 110 kPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~--~~Gi~~~sf~L~~~----~~lG~y~I~v~~ 180 (1397)
..|..+.|++. ++... |.....+.|.|.|.+|+.|+....... ..|...+++.=-+. .+-|.|+++|..
T Consensus 146 ~~g~~~~~~~~-l~~~~-~~~a~~v~I~I~Da~G~vVrTi~l~~~~~~aG~~~f~WDG~d~~G~~~p~G~Yt~~V~a 220 (295)
T PRK05842 146 DGNNKLSFSLF-FDEKI-DASKGVPAIQILNENNELVKTIPLKDYNGQKGYINFEWDGLNEKGEKVPKGNYKIKAEY 220 (295)
T ss_pred cCCCceEEEEe-ccccc-cccCceEEEEEEcCCCCEEEEEecCcccCCCcceeEEECCCCCCCCcCCCcceEEEEEE
Confidence 34446666632 22211 223457899999999999987653222 34766666544222 456999999864
No 118
>PF15432 Sec-ASP3: Accessory Sec secretory system ASP3
Probab=41.57 E-value=2.3e+02 Score=28.85 Aligned_cols=80 Identities=15% Similarity=0.215 Sum_probs=58.3
Q ss_pred cCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC-
Q psy5522 104 LDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD- 182 (1397)
Q Consensus 104 TDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~- 182 (1397)
-+=|+-|+|++=++. ++.+..|.+...+.|...|-+|..|..-..... +.+|..|+++ =.|+|..-..|
T Consensus 45 P~LPlLk~G~~Y~l~---~~~~~~P~~svylki~F~dr~~e~i~~~i~k~~-----~~~F~yP~~a--ysY~I~LinaG~ 114 (128)
T PF15432_consen 45 PSLPLLKRGHTYQLK---FNIDVVPENSVYLKIIFFDRQGEEIEEQIIKND-----SFEFTYPEEA--YSYTISLINAGC 114 (128)
T ss_pred CCCCEecCCCEEEEE---EEEEEccCCeEEEEEEEEccCCCEeeEEEEecC-----ceEEeCCCCc--eEEEEEEeeCCC
Confidence 466999999996653 455667877788999999999999987653322 3688889886 67888887766
Q ss_pred -eEEEEEEEEee
Q psy5522 183 -QKFTKRFTVAE 193 (1397)
Q Consensus 183 -~~~~~~F~VeE 193 (1397)
.-.-+.|.++|
T Consensus 115 ~~l~F~~i~I~e 126 (128)
T PF15432_consen 115 QSLTFHSIEISE 126 (128)
T ss_pred CeeEEeEEEEEE
Confidence 34455555544
No 119
>PF13715 DUF4480: Domain of unknown function (DUF4480)
Probab=41.24 E-value=2.5e+02 Score=25.90 Aligned_cols=52 Identities=21% Similarity=0.413 Sum_probs=37.1
Q ss_pred EEEEEEcCC-CccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 117 FRAIVLNSH-LKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 117 fRvv~ld~~-l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
+++.++|.+ -.|. ..+.|.+.+.+ + ...+++.|-|... ++ .|.|+|.+..-|
T Consensus 2 i~G~V~d~~t~~pl--~~a~V~~~~~~-----~-~~~Td~~G~F~i~--~~----~g~~~l~is~~G 54 (88)
T PF13715_consen 2 ISGKVVDSDTGEPL--PGATVYLKNTK-----K-GTVTDENGRFSIK--LP----EGDYTLKISYIG 54 (88)
T ss_pred EEEEEEECCCCCCc--cCeEEEEeCCc-----c-eEEECCCeEEEEE--Ec----CCCeEEEEEEeC
Confidence 567788876 6776 45667777665 1 2456788887777 43 588999999877
No 120
>PF00775 Dioxygenase_C: Dioxygenase; InterPro: IPR000627 This entry represents the C-terminal domain common to several intradiol ring-cleavage dioxygenases. Dioxygenases catalyse the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms. Cleavage of aromatic rings is one of the most important functions of dioxygenases, which play key roles in the degradation of aromatic compounds. The substrates of ring-cleavage dioxygenases can be classified into two groups according to the mode of scission of the aromatic ring. Intradiol enzymes use a non-haem Fe(III) to cleave the aromatic ring between two hydroxyl groups (ortho-cleavage), whereas extradiol enzymes (IPR000486 from INTERPRO) use a non-haem Fe(II) to cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage) []. These two subfamilies differ in sequence, structural fold, iron ligands, and the orientation of second sphere active site amino acid residues. Enzymes that belong to the intradiol family include catechol 1,2-dioxygenase (1,2-CTD) (1.13.11.1 from EC); protocatechuate 3,4-dioxygenase (3,4-PCD) (1.13.11.3 from EC); and chlorocatechol 1,2-dioxygenase (1.13.11.1 from EC) [].; GO: 0003824 catalytic activity, 0008199 ferric iron binding, 0006725 cellular aromatic compound metabolic process, 0055114 oxidation-reduction process; PDB: 2BUV_A 2BUX_A 2BUU_A 2BUR_A 1EO9_A 2BUZ_A 2BV0_A 1EO2_A 1EOC_A 1EOA_A ....
Probab=41.13 E-value=66 Score=34.82 Aligned_cols=28 Identities=25% Similarity=0.433 Sum_probs=21.6
Q ss_pred ccCCCeEEEEEEEEccCCCccCCCCceEEE
Q psy5522 324 FKPGLKYTAYMKLTHHDGTPVTDNNNMVQV 353 (1397)
Q Consensus 324 ~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v 353 (1397)
-.+|.++.+..+|+|.+|+|+++ ..|.+
T Consensus 24 ~~~G~~l~l~G~V~D~~g~Pv~~--A~vei 51 (183)
T PF00775_consen 24 DAPGEPLVLHGRVIDTDGKPVPG--ALVEI 51 (183)
T ss_dssp TSSS-EEEEEEEEEETTSSB-TT--EEEEE
T ss_pred CCCCCEEEEEEEEECCCCCCCCC--cEEEE
Confidence 56999999999999999999995 34544
No 121
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=40.40 E-value=96 Score=34.89 Aligned_cols=51 Identities=14% Similarity=0.205 Sum_probs=38.7
Q ss_pred CcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCC----CCCeeEEEEEEEc
Q psy5522 131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKS----PVLGDWNITINVL 181 (1397)
Q Consensus 131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~----~~lG~y~I~v~~~ 181 (1397)
-..+.|.|.|.+|+.|+.........|...+.+.--+. .+-|.|+++|...
T Consensus 128 a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Y~~~V~a~ 182 (230)
T PRK12633 128 ATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDGNNDGGQPLADGKYSITVSAS 182 (230)
T ss_pred CcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECCCCCCCCcCCCcceEEEEEEE
Confidence 45689999999999999776545567887777766432 4679999999763
No 122
>KOG0367|consensus
Probab=39.64 E-value=91 Score=35.77 Aligned_cols=55 Identities=25% Similarity=0.466 Sum_probs=31.8
Q ss_pred eecCCCCccccCCCCCCCCcchhHHHHHH--HHHc-cCCCccCHHHHHHHHHHHHhcccCCCccc
Q psy5522 939 YRRPDGSFSAFGTTDPNGSTWLTAFVAKS--FRQA-ASHTTIDESVILEALAWLSSNQAVNGSFP 1000 (1397)
Q Consensus 939 y~~~dGsfs~~~~~~~~~s~wlTAyvl~~--~~~a-~~~i~vd~~~~~~~~~WL~~~q~~~G~F~ 1000 (1397)
-|++||||..-... ++.+ | -||.-+ .+-. ...-.+|.+ +++.|+.++|+++|||+
T Consensus 137 ~Q~~dGsF~~~~~G-Se~D--m-RFvYcA~aI~ymLd~~s~iD~e---k~~~yI~~~q~YdgGfg 194 (347)
T KOG0367|consen 137 CQRPDGSFVSINVG-SESD--M-RFVYCAVAICYMLDFWSGIDKE---KLIGYIRSSQRYDGGFG 194 (347)
T ss_pred hcCCCCceeecCCC-Cchh--h-HHHHHHHHHHHHhccccccCHH---HHHHHHHHhhccccccc
Confidence 68999999654322 1222 2 122111 1111 112346654 45899999999999997
No 123
>cd00421 intradiol_dioxygenase Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. This family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases which are mononuclear non-heme iron enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings. The members are intradiol-cleaving enzymes which break the catechol C1-C2 bond and utilize Fe3+, as opposed to the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. Catechol 1,2-dioxygenases are mostly homodimers with one catalytic ferric ion per monomer. Protocatechuate 3,4-dioxygenases form more diverse oligomers.
Probab=38.81 E-value=1.1e+02 Score=31.88 Aligned_cols=22 Identities=36% Similarity=0.493 Sum_probs=20.5
Q ss_pred cCCCeEEEEEEEEccCCCccCC
Q psy5522 325 KPGLKYTAYMKLTHHDGTPVTD 346 (1397)
Q Consensus 325 kpG~p~~~~v~v~~~dG~P~~~ 346 (1397)
.+|.|..+..+|+|.+|.|+++
T Consensus 7 ~~G~~l~l~G~V~D~~g~pv~~ 28 (146)
T cd00421 7 APGEPLTLTGTVLDGDGCPVPD 28 (146)
T ss_pred CCCCEEEEEEEEECCCCCCCCC
Confidence 5899999999999999999985
No 124
>COG1689 Uncharacterized protein conserved in archaea [Function unknown]
Probab=37.28 E-value=44 Score=36.44 Aligned_cols=41 Identities=17% Similarity=0.194 Sum_probs=32.6
Q ss_pred CCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccc
Q psy5522 955 NGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFP 1000 (1397)
Q Consensus 955 ~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~ 1000 (1397)
++-++-|=|+++.+...++-+. +.+.++++.+-||.||||+
T Consensus 208 PPYiE~t~ya~r~lelL~~k~~-----i~~~~rFI~slqN~nGGFR 248 (274)
T COG1689 208 PPYIEPTFYALRGLELLGGKYC-----ISDHIRFIRSLQNQNGGFR 248 (274)
T ss_pred CCccchHHHHHhHHHHHccCcC-----chHHHHHHHHhhcCCCCee
Confidence 4568889999999887765433 4556789999999999997
No 125
>PF12584 TRAPPC10: Trafficking protein particle complex subunit 10, TRAPPC10; InterPro: IPR022233 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway. To date, two kinds of TRAPP complexes have been studied, TRAPPI and TRAPP II. These complexes differ in subunit composition []. TRAPP I binds vesicles derived from the endoplasmic reticulum bringing them closer to the acceptor membrane. This entry represents a domain which forms part of the TRAPP complex for mediating vesicle docking and fusion in the Golgi apparatus. The fungal version is referred to as Trs130, and an alternative vertebrate alias is TMEM1 [, ].
Probab=36.51 E-value=2.8e+02 Score=28.77 Aligned_cols=70 Identities=14% Similarity=0.190 Sum_probs=47.2
Q ss_pred CceeeeCCEEEEEEEEEcCC-----------CccEEEEEEEeec-CCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCe
Q psy5522 722 PYSVMRGEVVAIPVVVFNYL-----------SQDLVADVTLENV-GQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSG 789 (1397)
Q Consensus 722 Pysv~rGE~~~l~~~VfNyl-----------~~~~~v~v~L~~s-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~ 789 (1397)
|-.++.|+-+.+...|.|.. ++..+....+..+ +... + .+.++..+.+..++.
T Consensus 24 ~~~~~vGqpi~~~l~I~~~~~W~~~~~~~~~~~~~~~~yei~a~~~~Wl-V--------------~Grrrg~f~~~~~~~ 88 (147)
T PF12584_consen 24 PPPCRVGQPIPAELRIKNSRKWSSEDQEESSNEDTEFMYEIVADSDNWL-V--------------SGRRRGVFSLSDGSE 88 (147)
T ss_pred CcceEeCCeEEEEEEEEEcccCCccccccccCCCccEEEEEecCCCcEE-E--------------eccCcceEEecCCCe
Confidence 44678999999999999952 2223333344322 2221 1 233455778888889
Q ss_pred eEEEEEEEeceeeeEEE
Q psy5522 790 STTTFVITPKELGYIGI 806 (1397)
Q Consensus 790 ~tv~f~v~p~~lG~v~i 806 (1397)
..+.+.+.|++.|.+.+
T Consensus 89 ~~~~l~LIPL~~G~L~l 105 (147)
T PF12584_consen 89 HEIPLTLIPLRAGYLPL 105 (147)
T ss_pred EEEEEEEEecccceecC
Confidence 99999999999998764
No 126
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=35.49 E-value=1e+02 Score=27.27 Aligned_cols=44 Identities=18% Similarity=0.425 Sum_probs=29.3
Q ss_pred ceEEEEECCCCCeEEeeeeeccCCce-eeeeeecCCCCCCeeEEEEEE
Q psy5522 133 ALEIFITDGKGNRIKQWTRALTTRGV-FSADLQLSKSPVLGDWNITIN 179 (1397)
Q Consensus 133 ~~~v~I~DP~g~~I~qw~~~~~~~Gi-~~~sf~L~~~~~lG~y~I~v~ 179 (1397)
-..+.|.|++|+.+..........|. -...|.. +..|+|-|+|.
T Consensus 25 d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~---~~~GtYyi~V~ 69 (70)
T PF04151_consen 25 DADLYLYDSNGNSLASYDDSSQSGGNDESITFTA---PAAGTYYIRVY 69 (70)
T ss_dssp SEEEEEEETTSSSCEECCCCTCETTSEEEEEEEE---SSSEEEEEEEE
T ss_pred CeEEEEEcCCCCchhhheecCCCCCCccEEEEEc---CCCEEEEEEEE
Confidence 45688999999877765433322333 3445555 56799999985
No 127
>KOG0365|consensus
Probab=35.38 E-value=85 Score=36.68 Aligned_cols=73 Identities=19% Similarity=0.338 Sum_probs=56.2
Q ss_pred ccHHHHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCC
Q psy5522 917 LTDAIEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVN 996 (1397)
Q Consensus 917 l~~~~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~ 996 (1397)
+.+.+++|-..||..++.++-. .|..- +.+..||-=|.+.++..... .+|.+..++++.+|.+-|.+.
T Consensus 69 ~~~~~r~kH~~YL~~~Lr~Lp~------~~~~L----DASR~Wm~YWil~sl~lL~~--~~dd~v~~~~i~fL~~c~~Pe 136 (423)
T KOG0365|consen 69 LEPIFRQKHLMYLDKMLRQLPS------QYTCL----DASRPWMCYWILNSLALLDE--WLDDDVKENAIDFLFTCQGPE 136 (423)
T ss_pred HHHHHHHHHHHHHHHHHHhCCC------ccccc----ccCcchhHHHHHHHHHHhcC--cCCHHHHHHHHHHHHhcCCCC
Confidence 4566688888899888755321 23322 24568999999999888775 589999999999999999999
Q ss_pred Ccccc
Q psy5522 997 GSFPE 1001 (1397)
Q Consensus 997 G~F~s 1001 (1397)
|||..
T Consensus 137 GGfgG 141 (423)
T KOG0365|consen 137 GGFGG 141 (423)
T ss_pred CCCCC
Confidence 99975
No 128
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=34.36 E-value=62 Score=26.75 Aligned_cols=31 Identities=16% Similarity=0.245 Sum_probs=25.1
Q ss_pred cccCCCeEEEEEEEEccCCCccCCCCceEEEEE
Q psy5522 323 YFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRH 355 (1397)
Q Consensus 323 ~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~ 355 (1397)
..|.|....+.|.+.|.+|+|+++. .+.|..
T Consensus 8 kaK~Ge~I~ltVt~kda~G~pv~n~--~f~l~r 38 (47)
T PF05688_consen 8 KAKVGETIPLTVTVKDANGNPVPNA--PFTLTR 38 (47)
T ss_pred heecCCeEEEEEEEECCCCCCcCCc--eEEEEe
Confidence 4578999999999999999999954 455543
No 129
>COG1657 SqhC Squalene cyclase [Lipid metabolism]
Probab=32.67 E-value=1e+02 Score=38.51 Aligned_cols=111 Identities=21% Similarity=0.223 Sum_probs=66.5
Q ss_pred hhHHHHHHHHHhhcC---------CCCchHHHHHHHHHHhhCCCCc----h-HHHHHHHhhcccccCC-cccccccCCCC
Q psy5522 1158 NTINKAVDYIVKNLA---------GTEDAYAIAICSYALHLAQHPV----K-DVAFNLLESKAHNEDG-KKWWKRAERPE 1222 (1397)
Q Consensus 1158 ~~~~~a~~~l~~~~~---------~~~~~y~~al~aYal~la~~~~----~-~~~~~~L~~~a~~~~~-~~~W~~~~~~~ 1222 (1397)
..+++|+.||-+... .+...|.-+++.+|+++.|... . +++..-|..+-..++| .-+|.....
T Consensus 352 ~~i~~a~e~LL~~Q~~~GsW~g~w~v~~iY~~s~a~~~l~~~g~~~~~~~~v~~~~~~l~~~~~~~~Gw~e~~~~~~~-- 429 (517)
T COG1657 352 QPIERALEWLLSDQEPDGSWYGRWGVCYIYGTSGALSALALVGETDENEVLVRKLISWLVSKQMPDGGWGEAKEAISD-- 429 (517)
T ss_pred CcccHHHhhhhhhccccCceeeEEEEEEEEehhhhhhhhhccCccccchHHHHHHHHHhhhccccCCCcccccccccc--
Confidence 347788888865432 2346899999999999999887 2 4455555555444443 233433221
Q ss_pred CCCCCCCCCCCcccchhhHHHHHHhhccC--CcccchhhhhhhhhcccCCCCcc
Q psy5522 1223 DKKNPWAQVPNSVDVEMTSYALLSYLDRG--LVEDTLPILTWLVTQQNDQGGFA 1274 (1397)
Q Consensus 1223 ~~~~~~~~~~~s~~vE~t~Y~lL~~~~~~--~~~~a~~i~~Wl~~q~n~~Ggf~ 1274 (1397)
+.+....+--+=-|+.||..++... +-+.-.++++-|..+|-+.|-..
T Consensus 430 ----~~~~~t~~sl~~~~~wal~~~~~a~~~~~~~i~~~~~~~~~~~~~~g~~~ 479 (517)
T COG1657 430 ----PVYTGTESSLLVQTNWALIALLTALEPNQEAIKPGINLLVSDQEPDGSWR 479 (517)
T ss_pred ----cccccccchhhcchhHHHHHHHHhcccchhhhcccccccccCcCCCCccc
Confidence 1111112222235778888887532 33446778888888888777543
No 130
>PF01122 Cobalamin_bind: Eukaryotic cobalamin-binding protein; InterPro: IPR002157 Cobalamin (Cbl or vitamin B12) is only accessible through diet in mammals. Absorption, plasma transport and cellular uptake of Cbl in mammals involves three Cbl-transporting proteins, which are listed below in order of increasing Cbl-specificity: Haptocorrin (cobalophilin), which binds Cbl and Cbl-derivatives such as cobinamide; it may play a role in preventing the absorption of cobalamin analogues produced by bacteria. Transcobalamin (TC), which transport Cbl from blood to cells. Intrinsic factor (IF), which promotes Cbl absorption in the ileum by specific receptor-mediated endocytosis. The structure of TC reveals a two-domain structure, an N-terminal alpha(6)-alpha(6) barrel, and a smaller C-terminal domain []. Many interactions between Cbl and its binding site in the interface of the two domains are conserved among the other Cbl transporters. Specificity for Cbl between the different transporters may reside in a beta-hairpin motif found in the smaller C-terminal domain []. ; GO: 0031419 cobalamin binding, 0015889 cobalamin transport; PDB: 3KQ4_A 2PMV_A 2BB5_A 2V3N_A 2BBC_A 2BB6_D 2V3P_A.
Probab=31.46 E-value=96 Score=36.57 Aligned_cols=87 Identities=14% Similarity=0.178 Sum_probs=57.8
Q ss_pred CCchHHHHHHHHHHhhCCCCchHHHHHHHhhcccccCCcccccccCCCCCCCCCCCCCCCcccchhhHHHHHHhh--ccC
Q psy5522 1174 TEDAYAIAICSYALHLAQHPVKDVAFNLLESKAHNEDGKKWWKRAERPEDKKNPWAQVPNSVDVEMTSYALLSYL--DRG 1251 (1397)
Q Consensus 1174 ~~~~y~~al~aYal~la~~~~~~~~~~~L~~~a~~~~~~~~W~~~~~~~~~~~~~~~~~~s~~vE~t~Y~lL~~~--~~~ 1251 (1397)
.++-|-.+|..-||.+.+.........+|-..+..+ |+.....+|-+.|-|.||+- .+.
T Consensus 142 ~TnyYq~sL~vLALCv~~~~~~~~~v~kL~~~~~~~-------------------~~~~~~~sVDT~AmA~LALtCv~~~ 202 (326)
T PF01122_consen 142 LTNYYQYSLGVLALCVHNKRVSLSVVAKLLKAENHN-------------------FYHGSQFSVDTGAMAVLALTCVKNS 202 (326)
T ss_dssp SSGHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHSS-------------------TSS-STCHHHHHHHHHHHHHHHHTT
T ss_pred CCcccchHHHHHHHHccCCCcCHHHHHHHHHHHHhh-------------------cccCCCCCccHHHHHHHHHHHHhcc
Confidence 456799999999999999877655556655443311 12223467889999999965 221
Q ss_pred ---------Ccccc-hhhhhhhhhcccCCCCccccccC
Q psy5522 1252 ---------LVEDT-LPILTWLVTQQNDQGGFASTQMS 1279 (1397)
Q Consensus 1252 ---------~~~~a-~~i~~Wl~~q~n~~Ggf~stQdT 1279 (1397)
.+..| ..++++|.+||.+.|-|...=-|
T Consensus 203 ~~~~~~~~~~i~~~i~~~~~kIl~~q~~~G~~GNiyST 240 (326)
T PF01122_consen 203 NPNGPELRRRIQQAIRSLVEKILSQQKPNGLFGNIYST 240 (326)
T ss_dssp TSTTGGGHHHHHHHHHHHHHHHHHTB-TTS-BSSTTTH
T ss_pred CcCcHhHHHHHHHHHHHHHHHHHHhcCCCCcccchhhH
Confidence 23333 34889999999999999876666
No 131
>PF06280 DUF1034: Fn3-like domain (DUF1034); InterPro: IPR010435 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain of unknown function is present in bacterial and plant peptidases belonging to MEROPS peptidase family S8 (subfamily S8A subtilisin, clan SB). It is C-terminal to and adjacent to the S8 peptidase domain and can be found in conjunction with the PA (Protease associated) domain (IPR003137 from INTERPRO) and additionally in Gram-positive bacteria with the surface protein anchor domain (IPR001899 from INTERPRO).; GO: 0004252 serine-type endopeptidase activity, 0005618 cell wall, 0016020 membrane; PDB: 3EIF_A 1XF1_B.
Probab=31.14 E-value=2.4e+02 Score=27.66 Aligned_cols=66 Identities=18% Similarity=0.240 Sum_probs=34.9
Q ss_pred CCEEEEEEEEEcCCCccEEEEEEEeec--------CCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEe
Q psy5522 728 GEVVAIPVVVFNYLSQDLVADVTLENV--------GQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITP 798 (1397)
Q Consensus 728 GE~~~l~~~VfNyl~~~~~v~v~L~~s--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p 798 (1397)
|......++|.|+.++.+..++.-... ++......... . .........+++|+||+++.+.+.|.+
T Consensus 7 ~~~~~~~itl~N~~~~~~ty~~~~~~~~t~~~~~~~~~~~~~~~~~--~---~~~~~~~~~~vTV~ag~s~~v~vti~~ 80 (112)
T PF06280_consen 7 GNKFSFTITLHNYGDKPVTYTLSHVPVLTDKTDTEEGYSILVPPVP--S---ISTVSFSPDTVTVPAGQSKTVTVTITP 80 (112)
T ss_dssp -SEEEEEEEEEE-SSS-EEEEEEEE-EEEEEE--ETTEEEEEEEE---------EEE---EEEEE-TTEEEEEEEEEE-
T ss_pred CCceEEEEEEEECCCCCEEEEEeeEEEEeeEeeccCCccccccccc--c---eeeEEeCCCeEEECCCCEEEEEEEEEe
Confidence 566889999999999998877765510 11111000000 0 000223456889999999998888886
No 132
>PF07177 Neuralized: Neuralized; InterPro: IPR006573 NEUZ is a domain of unknown function found in neuralized proteins, i.e. proteins involved in the specification of the neuroblast during cellular differentiation. ; PDB: 2YUE_A 2E63_A.
Probab=30.84 E-value=1e+02 Score=27.67 Aligned_cols=28 Identities=7% Similarity=0.124 Sum_probs=19.2
Q ss_pred cEEEEEcCCCcccCCceeEEEEEEEcCCC
Q psy5522 98 YSVFIQLDKAIYKPGDLVRFRAIVLNSHL 126 (1397)
Q Consensus 98 ~~vfIqTDKpIYkPGqtV~fRvv~ld~~l 126 (1397)
....++|+||+ ++||++++|+.-++...
T Consensus 29 ~~giVFS~rPl-~~~E~~~v~I~~~~~~w 56 (69)
T PF07177_consen 29 NNGIVFSSRPL-RIGEKFEVRIDEVEPSW 56 (69)
T ss_dssp SS-EEEESS-B--TT-EEEEEEEEE-SSS
T ss_pred CceEEEecCCc-cCCCEEEEEEEecCCCc
Confidence 34789999998 69999999998887643
No 133
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=30.76 E-value=83 Score=26.57 Aligned_cols=37 Identities=27% Similarity=0.480 Sum_probs=27.4
Q ss_pred cCCCcccCCceeEEEEEEEcCCCccCCCcceEEEEEC--CCCC
Q psy5522 104 LDKAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITD--GKGN 144 (1397)
Q Consensus 104 TDKpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~D--P~g~ 144 (1397)
.||...+|||+|.|++...+..-.|+ -.|.|.| |.|-
T Consensus 3 ~d~~~~~~Gd~v~Yti~v~N~g~~~a----~~v~v~D~lP~g~ 41 (53)
T TIGR01451 3 VDKTVATIGDTITYTITVTNNGNVPA----TNVVVTDILPSGT 41 (53)
T ss_pred cCccccCCCCEEEEEEEEEECCCCce----EeEEEEEcCCCCC
Confidence 58999999999999999988765554 1355555 4443
No 134
>KOG0497|consensus
Probab=30.38 E-value=54 Score=41.75 Aligned_cols=134 Identities=20% Similarity=0.241 Sum_probs=76.7
Q ss_pred ecCCCCcCCCCcEEEEEEEcccccCcccCcchhhccCCCchhhhh-c-ccchHHHHHHHHHHhccc---cHHHHHHHHHH
Q psy5522 854 LDMPKNIVPGSEHVEVSAVGDLLGPSIPNLANLIKMPFGCGEQNM-L-NFVPNIVVLEYLKKTYQL---TDAIEAKASRY 928 (1397)
Q Consensus 854 l~~P~~~v~gs~~~~vsv~gd~l~~~~~~l~~Ll~~P~GCgEQ~m-~-~~~p~i~~l~yL~~~~~l---~~~~~~~a~~~ 928 (1397)
|..|....||+.+..|.- .+-.+++-+.+ .|+|+|...=+- + -+...-++++=|.+.++- -+++ ++|..|
T Consensus 572 L~~F~k~~p~~r~~Ei~~---~i~~av~~ie~-~Q~~DGSWyGsWgvCFtY~t~Fa~~gl~aaGkty~nc~~i-rka~~F 646 (760)
T KOG0497|consen 572 LVYFHKLFPGHRKKEIEK---SIEKAVEFIEK-LQLPDGSWYGSWGVCFTYGTWFALRGLAAAGKTYENCEAI-RKACDF 646 (760)
T ss_pred HHhhcccCccHHHHHHHH---HHHHHHHHHHH-cCCCCCcccchhhHHHHHHHHHhcchhhhcchhhhccHHH-HHHHHH
Confidence 455666666665544331 11122333333 477888655332 1 112222333333333321 2223 355554
Q ss_pred HHhccccceeeecCCCCccccCCC----------CCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCc
Q psy5522 929 LETGYQQELTYRRPDGSFSAFGTT----------DPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGS 998 (1397)
Q Consensus 929 l~~gy~~~l~y~~~dGsfs~~~~~----------~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~ 998 (1397)
||+.|.+||+|+--..+ ..+.++--|++++.++..+++. ..|+-.+.++++-|.+.|.++|.
T Consensus 647 -------ll~~Q~~~GGWgEs~lscp~~~Yi~~~gn~s~vv~T~wAlm~Li~~~q~-~rd~~P~hr~ak~linsQ~~nGd 718 (760)
T KOG0497|consen 647 -------LLSKQNPDGGWGESYLSCPEKRYIPLEGNKSNVVQTAWALMALIMAGQA-ERDPLPLHRAAKVLINSQLENGD 718 (760)
T ss_pred -------HHhhhcccCCCccccccCccccccccccccccchhHHHHHHHHHhcCCc-ccccchHHHHHHHHHhcccccCC
Confidence 56689999986522111 0134566799999999987653 45666789999999999999999
Q ss_pred cc
Q psy5522 999 FP 1000 (1397)
Q Consensus 999 F~ 1000 (1397)
|.
T Consensus 719 fp 720 (760)
T KOG0497|consen 719 FP 720 (760)
T ss_pred cc
Confidence 96
No 135
>PF07233 DUF1425: Protein of unknown function (DUF1425); InterPro: IPR010824 This family consists of several hypothetical bacterial proteins of around 125 residues in length. Several members of this family are described as putative lipoproteins and are often known as YcfL. The function of this family is unknown.; PDB: 3O0L_A.
Probab=30.15 E-value=3e+02 Score=26.27 Aligned_cols=62 Identities=11% Similarity=0.048 Sum_probs=37.9
Q ss_pred cCCceeeeCCEEEEEEEEEcCCCccEEEEEEEe--ecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEE
Q psy5522 720 DLPYSVMRGEVVAIPVVVFNYLSQDLVADVTLE--NVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTF 794 (1397)
Q Consensus 720 ~LPysv~rGE~~~l~~~VfNyl~~~~~v~v~L~--~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f 794 (1397)
.=|.....++.....+.+.|-.+..+.+.-.+. ..++++..+. ...-+.+.|.+++..++.-
T Consensus 15 ~~~~~~~~~g~~~~~~~l~N~~~~~~~l~Yrf~WyD~~G~~v~~~-------------~~~w~~~~l~~~~~~~l~~ 78 (94)
T PF07233_consen 15 ENPSTSRVNGLLRAQATLSNKSSKPLTLQYRFYWYDKQGLEVDPE-------------QSPWQSLTLPGGQTVTLSA 78 (94)
T ss_dssp EEEEEEECCCEEEEEEEEEE-SSS-EEEEEEEEEE-TTS-EE--T-------------T---EEEEE-TT-EEEEEE
T ss_pred eccEEEeeCCeEEEEEEEEECCCCcEEEEEEEEEECCCCCCcCCC-------------CCCCEEEEEcCCCEEEEEE
Confidence 345566779999999999999999877776665 6668875322 1235688999888765543
No 136
>PF09608 Alph_Pro_TM: Putative transmembrane protein (Alph_Pro_TM); InterPro: IPR019088 This entry consists of predicted transmembrane proteins of about 270 amino acids. They are found predominantly, though not exclusively, in alphaproteobacteria, generally only once in each genome.
Probab=29.90 E-value=1.1e+02 Score=34.45 Aligned_cols=102 Identities=13% Similarity=0.232 Sum_probs=60.7
Q ss_pred eEeeEEEEEEecccEEEEEcCCCccc-----------CCce-eEEEEEEEcCCCc-cC-----------CCcceEEEEEC
Q psy5522 85 FYNSTGLVYVHKSYSVFIQLDKAIYK-----------PGDL-VRFRAIVLNSHLK-PS-----------VTGALEIFITD 140 (1397)
Q Consensus 85 f~~~~~v~~~~~~~~vfIqTDKpIYk-----------PGqt-V~fRvv~ld~~l~-P~-----------~~~~~~v~I~D 140 (1397)
+.|..++.|..-...--|-|.||+-. .|=. ++.++-....... +. ....-.++-.+
T Consensus 66 WiN~~~~~f~~vPsfY~vassrPl~~i~~~~~~~~~~ig~~~l~~~i~~~~~~~~~~~~~~~f~~alirlk~~~gLY~~~ 145 (236)
T PF09608_consen 66 WINTDSVEFDNVPSFYAVASSRPLDEIASPEELLRLGIGLDNLRLRIRRVGSDQSDPDEQDDFREALIRLKERAGLYQEN 145 (236)
T ss_pred EEecCceEEccCCchheeeecCCHHHhCCHHHHHhhCcCcchhhhcccccCcccCChhhHHHHHHHHHHHHHhCCCceec
Confidence 46777888888888888999999543 3322 1111000001000 00 01122334445
Q ss_pred CCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEc--C---eEEEEEEEEee
Q psy5522 141 GKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVL--D---QKFTKRFTVAE 193 (1397)
Q Consensus 141 P~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~--~---~~~~~~F~VeE 193 (1397)
+.+..+ .+.+.|..+++||.+.+.|.|++++..- | ...+..++|+.
T Consensus 146 ~~~V~~-------~~~~lFra~i~LPanvp~G~Y~v~v~l~rdG~vv~~~~~~l~V~K 196 (236)
T PF09608_consen 146 EGGVQF-------LEGTLFRARIPLPANVPPGDYTVRVYLFRDGQVVASQETPLRVRK 196 (236)
T ss_pred CCeEEE-------cCCCeEEEEeEcCCCCCcceEEEEEEEEECCEEEEEEeeEEEEEE
Confidence 554333 4568999999999999999999999873 3 34566677654
No 137
>TIGR02186 alph_Pro_TM conserved hypothetical protein. This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.
Probab=28.74 E-value=1.2e+02 Score=34.54 Aligned_cols=51 Identities=14% Similarity=0.292 Sum_probs=37.0
Q ss_pred EEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEc--C---eEEEEEEEEee
Q psy5522 136 IFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVL--D---QKFTKRFTVAE 193 (1397)
Q Consensus 136 v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~--~---~~~~~~F~VeE 193 (1397)
++-.|+.|.. -.+.++|..++.||.+.+.|.|++++..- | ...+..|+|++
T Consensus 166 LY~~~~~gV~-------~~~~~LFra~i~LPAnvp~G~Y~v~v~L~r~G~vv~~~~t~l~V~K 221 (261)
T TIGR02186 166 LYREDAGGVT-------FISPTLFRATLRLPANVPNGTHEVRAYLFRGGVFIARTELALEIVK 221 (261)
T ss_pred CeeecCCeEE-------EcCCceEEEeeecCCCCCCceEEEEEEEEeCCEEEEEEEeEEEEEE
Confidence 4445555532 24568899999999999999999999873 3 34566777754
No 138
>PF05751 FixH: FixH; InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=28.57 E-value=3e+02 Score=28.16 Aligned_cols=65 Identities=14% Similarity=0.060 Sum_probs=36.6
Q ss_pred EEEEEEEccCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEeCCCCCceeEEEEEEEe
Q psy5522 331 TAYMKLTHHDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYYPPANENVTTLGIEAEY 398 (1397)
Q Consensus 331 ~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~ 398 (1397)
.+.++++|++|.|+.+. .+++.+...-+........++..++|.....+..+ ..|.+.++|....
T Consensus 70 ~~~i~~~d~~g~~~~~~--~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~-~~G~W~l~l~~~~ 134 (146)
T PF05751_consen 70 SLTIRLTDPNGAPVSGA--KLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLL-KKGRWYLRLDWEP 134 (146)
T ss_pred eEEEEEEcCCCCcCcCc--eEEEEEECCCCccCCeeEEeeECCCceEEEEcCCC-CCccEEEEEEEec
Confidence 55777888999998743 45554422111112334455567777777666654 3456566664433
No 139
>cd05822 TLP_HIUase HIUase (5-hydroxyisourate hydrolase) catalyzes the second step in a three-step ureide pathway in which 5-hydroxyisourate (HIU), a product of the uricase (urate oxidase) reaction, is hydrolyzed to 2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (OHCU). HIUase has high sequence similarity with transthyretins and is a member of the transthyretin-like protein (TLP) family. HIUase is distinguished from transthyretins by a conserved signature motif at its C-terminus that forms part of the active site. In HIUase, this motif is YRGS, while transthyretins have a conserved TAVV sequence in the same location. Most HIUases are cytosolic but in plants and slime molds, they are peroxisomal based on the presence of N-terminal periplasmic localization sequences. HIUase forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located betw
Probab=27.94 E-value=1.9e+02 Score=28.65 Aligned_cols=60 Identities=20% Similarity=0.301 Sum_probs=39.5
Q ss_pred EEcC-CCccCCCcceEEEEECCCC-CeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 121 VLNS-HLKPSVTGALEIFITDGKG-NRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 121 ~ld~-~l~P~~~~~~~v~I~DP~g-~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
+||. ..+|+.+..+.+.-.+.++ ..|.+ .+++..|-+..-+.-.+....|.|+|+...++
T Consensus 7 VLDt~~G~PAagv~V~L~~~~~~~~~~i~~--~~Td~DGR~~~~~~~~~~~~~G~Y~l~F~~~~ 68 (112)
T cd05822 7 VLDTATGKPAAGVAVTLYRLDGNGWTLLAT--GVTNADGRCDDLLPPGAQLAAGTYKLTFDTGA 68 (112)
T ss_pred EEeCCCCcccCCCEEEEEEecCCCeEEEEE--EEECCCCCccCcccccccCCCeeEEEEEEhhh
Confidence 4563 5688877788887776654 23443 46788888754333235566799999988765
No 140
>cd05790 S1_Rrp40 S1_Rrp40: Rrp40 S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. Rrp4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In Saccharomyces cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure".
Probab=27.89 E-value=84 Score=29.57 Aligned_cols=32 Identities=31% Similarity=0.508 Sum_probs=27.8
Q ss_pred CCcccCCceeEEEEEEEcCCCccCCCcceEEEEECCCC
Q psy5522 106 KAIYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGKG 143 (1397)
Q Consensus 106 KpIYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~g 143 (1397)
||..++||-|.-||...+.++.| .++..|+.|
T Consensus 47 rp~L~~GDlV~ArV~~~~~~~~~------eLtc~~~~~ 78 (86)
T cd05790 47 RPNLNVGDLVYARVVKANRDMEP------ELSCVDSSG 78 (86)
T ss_pred cccCCCCCEEEEEEEecCCCCCe------EEEEeCCCC
Confidence 99999999999999998877655 588888876
No 141
>KOG1948|consensus
Probab=27.56 E-value=1.6e+03 Score=30.01 Aligned_cols=57 Identities=12% Similarity=0.190 Sum_probs=32.6
Q ss_pred EeeCCCeeEEEEEeCCCCCceeEEEEEEEecCcceEEEEEEeecCCCCCeEEEEecCCCCCCCCeEEEE
Q psy5522 369 YKLDRNGMIKLVYYPPANENVTTLGIEAEYLDIKEWFSTISASESPSNSFIQAALLTQNPKVNKDVELE 437 (1397)
Q Consensus 369 ~ttd~~G~~~f~i~~~~~~~~~~l~l~a~~~~~~~~~~~v~~~~S~s~s~L~i~~~~~~~~~G~~~~v~ 437 (1397)
..+|++|..+|.-- |.+ . +.+.+.|.++... +.+ || +-|.+...++.+++-+..+|.
T Consensus 258 ~vsd~~G~fsfksv-PsG--k--Y~l~a~y~ge~~~-fdv----SP--~~l~v~Vehd~lqi~~ef~vt 314 (1165)
T KOG1948|consen 258 GVSDPRGRFSFKSV-PSG--K--YYLAASYVGEPKS-FDV----SP--NPLKVVVEHDHLQIASEFRVT 314 (1165)
T ss_pred EEEcCCceEEEEEc-CCC--C--EEEEEEecCCceE-EEe----CC--CceeEEEeccceeccceeEEE
Confidence 45799998777643 333 4 6678888876422 222 33 344555555556655555544
No 142
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=27.20 E-value=5.1e+02 Score=24.04 Aligned_cols=75 Identities=17% Similarity=0.226 Sum_probs=42.0
Q ss_pred EecCCCCCCCCeEEEEEEe----cCCcceEEEEEEeCCeEEEEEEE-EecCCceeEEEEEeeCCCCCceeEEEEEEEecC
Q psy5522 422 ALLTQNPKVNKDVELEINS----TAPLKYISYQVLGRGDVIMADTI-TVPGNKMSTVIRFLATYAMAPTAHVIVQYVRED 496 (1397)
Q Consensus 422 ~~~~~~~~~G~~~~v~v~s----~~~~~~~~ylV~srG~Iv~~~~~-~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~~~ 496 (1397)
.........|++..|.+.= ......+...+...|..+....+ .+..+ ....+.|.+... .|+.+-+...++++
T Consensus 9 ~~~~~~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~~~~~i~~L~~g-~~~~v~~~~~~~-~~G~~~i~~~iD~~ 86 (101)
T PF07705_consen 9 TVSPSNVVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSVSTVTIPSLAPG-ESETVTFTWTPP-SPGSYTIRVVIDPD 86 (101)
T ss_dssp EEC-SEEETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEEEEEEESEB-TT-EEEEEEEEEE-S-S-CEEEEEEEESTT
T ss_pred eeCCCcccCCCEEEEEEEEEECCCCCCCCEEEEEEECCceeccEEECCcCCC-cEEEEEEEEEeC-CCCeEEEEEEEeeC
Confidence 3445566789988887751 12223355666777877755555 45554 455666666666 66655555556654
Q ss_pred Ce
Q psy5522 497 GE 498 (1397)
Q Consensus 497 ge 498 (1397)
+.
T Consensus 87 n~ 88 (101)
T PF07705_consen 87 ND 88 (101)
T ss_dssp TS
T ss_pred Cc
Confidence 43
No 143
>PF01273 LBP_BPI_CETP: LBP / BPI / CETP family, N-terminal domain; InterPro: IPR017942 This entry represents the N-terminal domain found in several lipid-binding serum glycoproteins. The N- and C-terminal domains share a similar two-layer alpha/beta structure, but they show little sequence identity. Proteins containing this N-terminal domain include: Bactericidal permeability-increasing protein (BPI) Lipopolysaccharide-binding protein (LBP) Cholesteryl ester transfer protein (CETP) Phospholipid transfer protein (PLTP) Palate, lung and nasal epithelium carcinoma-associated protein (PLUNC) Bactericidal permeability-increasing protein (BPI) is a potent antimicrobial protein of 456 residues that binds to and neutralises lipopolysaccharides from the outer membrane of Gram-negative bacteria []. BPI contains two domains that adopt the same structural fold, even though they have little sequence similarity []. Lipopolysaccharide-binding protein (LBP) is an endotoxin-binding protein that is closely related to, and functions in a co-ordinated manner with BPI to facilitate an integrated host response to invading Gram-negative bacteria []. Cholesteryl ester transfer protein (CETP) is a glycoprotein that facilitates the transfer of lipids (cholesteryl esters and triglycerides) between the different lipoproteins that transport them through plasma, including HDL, LDL, VLDL and chylomicrons. These lipoproteins shield the lipids from water by encapsulating them within a coating of polar lipids and proteins []. Phospholipid transfer protein (PLTP) exchanges phospholipids between lipoproteins and remodels high-density lipoproteins (HDLs) []. Palate, lung and nasal epithelium carcinoma-associated protein (PLUNC) is a potential host defensive protein that is secreted from the submucosal gland to the saliva and nasal lavage fluid. PLUNC appears to be a secreted product of neutrophil granules that participates in an aspect of the inflammatory response that contributes to host defence []. Short palate, lung and nasal epithelium clone 1 (SPLUNC1) may bind the lipopolysaccharide of Gram-negative nanobacteria, thereby playing an important role in the host defence of nasopharyngeal epithelium [].; GO: 0008289 lipid binding; PDB: 1EWF_A 1BP1_A 2OBD_A.
Probab=27.15 E-value=4.4e+02 Score=27.46 Aligned_cols=46 Identities=11% Similarity=0.229 Sum_probs=36.0
Q ss_pred EEEEEEeeecCCcEEEEEeCCCceEeeCCeEEEEEEEEEecC-cccc
Q psy5522 186 TKRFTVAEYVLPKFEVNVNVPPHATFKDSKVVISVNAKYTYG-KPVK 231 (1397)
Q Consensus 186 ~~~F~VeEYvLPkFeV~v~~p~~i~~~~~~~~v~v~AkYtyG-kPV~ 231 (1397)
-...+|.++.+|...++..++..+...-..+.+++.++|.+. .+++
T Consensus 41 i~~i~I~~~~~~~~~l~~~p~~~~~~~~~~~~~~~~~~~~~~~~~i~ 87 (164)
T PF01273_consen 41 ISNIKISNFRLPQSSLELSPDGGIVLSIIPASASLSGNWPYKGGFIK 87 (164)
T ss_dssp EEEEEEEEEEEEEEEEEEETTTEEEEEEEEEEEEEEEEEEEEECCEE
T ss_pred EEeEEEEEeecCCccceECCCCceEEEEeeeEEEEEEEEecCCChhh
Confidence 446889999999999999999887666667788888877776 4444
No 144
>PF12863 DUF3821: Domain of unknown function (DUF3821); InterPro: IPR024277 This is a domain largely confined to sequences from Methanomicrobiales. It is found in putative lipases but the function is unknown.
Probab=26.20 E-value=1.9e+02 Score=32.00 Aligned_cols=39 Identities=26% Similarity=0.355 Sum_probs=28.1
Q ss_pred ccCCceeEEEEEEEcCCCccC--------CCcceEEEEECCCCCeEEeee
Q psy5522 109 YKPGDLVRFRAIVLNSHLKPS--------VTGALEIFITDGKGNRIKQWT 150 (1397)
Q Consensus 109 YkPGqtV~fRvv~ld~~l~P~--------~~~~~~v~I~DP~g~~I~qw~ 150 (1397)
-.+|+-|.||+ +.||-.. ....+.|.|++|.|.....-.
T Consensus 102 V~~G~~v~FrI---~tNL~~~~~R~g~~~~~~~v~I~V~~P~G~~~t~L~ 148 (209)
T PF12863_consen 102 VPRGDNVNFRI---DTNLYSIFQRGGYTPGDGPVDIKVTTPSGATYTSLF 148 (209)
T ss_pred eccCCeEEEEE---cccHHHHhhcCCCCCCcceEEEEEeCCCCcEEEEEE
Confidence 45789999998 6666332 123499999999998776544
No 145
>PF11974 MG1: Alpha-2-macroglobulin MG1 domain; InterPro: IPR021868 This is the N-terminal MG1 domain from alpha-2-macroglobulin [].
Probab=25.97 E-value=1.3e+02 Score=28.83 Aligned_cols=45 Identities=18% Similarity=0.171 Sum_probs=30.6
Q ss_pred EEEEEEc-cCCCccCCCCceEEEEEeeeccccccceeeEeeCCCeeEEEEEe
Q psy5522 332 AYMKLTH-HDGTPVTDNNNMVQVRHGFSYDESKYEANQYKLDRNGMIKLVYY 382 (1397)
Q Consensus 332 ~~v~v~~-~dG~P~~~~~~~V~v~~~~~~~~~~~~~~~~ttd~~G~~~f~i~ 382 (1397)
..|.+++ .+|+|+++. .|++.- + ..+....+.+||++|.+.|...
T Consensus 15 ~~v~v~~L~tg~Pv~ga--~V~l~~---~-~~~~~l~~g~TD~~G~a~~~~~ 60 (97)
T PF11974_consen 15 LLVWVTSLSTGKPVAGA--EVELYD---S-RNGQVLASGKTDADGFASFDST 60 (97)
T ss_pred EEEEEeeCCCCCccCCC--EEEEEE---C-CCCcEeeeeeeCCCceEEecCC
Confidence 3555665 489999953 565532 0 1134667889999999998875
No 146
>cd03459 3,4-PCD Protocatechuate 3,4-dioxygenase (3,4-PCD) catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=25.65 E-value=2e+02 Score=30.38 Aligned_cols=49 Identities=16% Similarity=0.273 Sum_probs=0.0
Q ss_pred cCCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccc-------------------ccceeeEeeCCCeeEEEE
Q psy5522 325 KPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES-------------------KYEANQYKLDRNGMIKLV 380 (1397)
Q Consensus 325 kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~-------------------~~~~~~~ttd~~G~~~f~ 380 (1397)
.+|.+..+..+|+|.+|+|++ +..|.+ |+-. -.......||++|...|.
T Consensus 11 ~~G~~l~l~g~V~D~~g~Pv~--~A~vei-----Wqad~~G~Y~~~~~~~~~~~~~~f~~rG~~~Td~~G~~~f~ 78 (158)
T cd03459 11 AIGERIILEGRVLDGDGRPVP--DALVEI-----WQADAAGRYRHPRDSHRAPLDPNFTGFGRVLTDADGRYRFR 78 (158)
T ss_pred CCCcEEEEEEEEECCCCCCCC--CCEEEE-----EccCCCCccCCccCCcccccCCCCCceeEEEECCCCcEEEE
No 147
>TIGR02439 catechol_proteo catechol 1,2-dioxygenase, proteobacterial. Members of this family known so far are catechol 1,2-dioxygenases of the Proteobacteria. They are distinct from catechol 1,2-dioxygenases and chlorocatechol 1,2-dioxygenases of the Actinobacteria, which are quite similar to each other and resolved by separate models. This enzyme catalyzes intradiol cleavage in which catechol + O2 becomes cis,cis-muconate. Catechol is an intermediate in the catabolism of many different aromatic compounds, as is the alternative intermediate protocatechuate. In Acinetobacter lwoffii, two isozymes are present with abilities, differing somewhat, to act on catechol analogs 3-methylcatechol, 4-methylcatechol, 4-methoxycatechol, and 4-chlorocatechol.
Probab=25.44 E-value=1.7e+02 Score=33.94 Aligned_cols=22 Identities=27% Similarity=0.472 Sum_probs=20.2
Q ss_pred cCCCeEEEEEEEEccCCCccCC
Q psy5522 325 KPGLKYTAYMKLTHHDGTPVTD 346 (1397)
Q Consensus 325 kpG~p~~~~v~v~~~dG~P~~~ 346 (1397)
.+|.|..+..+|+|.+|+|+++
T Consensus 124 ~~G~pl~v~G~V~D~~G~PI~g 145 (285)
T TIGR02439 124 DPGETLFLHGQVTDADGKPIAG 145 (285)
T ss_pred CCCcEEEEEEEEECCCCCCcCC
Confidence 3699999999999999999995
No 148
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=25.42 E-value=1.4e+02 Score=24.74 Aligned_cols=35 Identities=17% Similarity=0.233 Sum_probs=29.5
Q ss_pred cccCCceeEEEEEEEcCCCccCCCcceEEEEECCC
Q psy5522 108 IYKPGDLVRFRAIVLNSHLKPSVTGALEIFITDGK 142 (1397)
Q Consensus 108 IYkPGqtV~fRvv~ld~~l~P~~~~~~~v~I~DP~ 142 (1397)
.-|-||++.++|-+.|.+..|+.+.++.|..-|..
T Consensus 8 kaK~Ge~I~ltVt~kda~G~pv~n~~f~l~r~~~~ 42 (47)
T PF05688_consen 8 KAKVGETIPLTVTVKDANGNPVPNAPFTLTRGDAK 42 (47)
T ss_pred heecCCeEEEEEEEECCCCCCcCCceEEEEecCcc
Confidence 34789999999999999999998888888776654
No 149
>PF14874 PapD-like: Flagellar-associated PapD-like
Probab=25.21 E-value=5.9e+02 Score=24.10 Aligned_cols=68 Identities=15% Similarity=0.062 Sum_probs=46.4
Q ss_pred eeeeCCEEEEEEEEEcCCCccEEEEEEEee--cCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEE-ece
Q psy5522 724 SVMRGEVVAIPVVVFNYLSQDLVADVTLEN--VGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVIT-PKE 800 (1397)
Q Consensus 724 sv~rGE~~~l~~~VfNyl~~~~~v~v~L~~--s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~-p~~ 800 (1397)
.+..|+.....++|.|......+.++.... ++.|.+.+. .-.|++|....+...+. +..
T Consensus 15 ~v~~g~~~~~~v~l~N~s~~p~~f~v~~~~~~~~~~~v~~~------------------~g~l~PG~~~~~~V~~~~~~~ 76 (102)
T PF14874_consen 15 NVFVGQTYSRTVTLTNTSSIPARFRVRQPESLSSFFSVEPP------------------SGFLAPGESVELEVTFSPTKP 76 (102)
T ss_pred EEccCCEEEEEEEEEECCCCCEEEEEEeCCcCCCCEEEECC------------------CCEECCCCEEEEEEEEEeCCC
Confidence 456799999999999999998888887654 223333211 12488888888888888 455
Q ss_pred eeeEEEEEE
Q psy5522 801 LGYIGIKVT 809 (1397)
Q Consensus 801 lG~v~itV~ 809 (1397)
+|...-++.
T Consensus 77 ~g~~~~~l~ 85 (102)
T PF14874_consen 77 LGDYEGSLV 85 (102)
T ss_pred CceEEEEEE
Confidence 676554443
No 150
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=25.15 E-value=1.9e+02 Score=31.30 Aligned_cols=48 Identities=19% Similarity=0.367 Sum_probs=0.0
Q ss_pred CCCeEEEEEEEEccCCCccCCCCceEEEEEeeecccc----------cc--------ceeeEeeCCCeeEEEE
Q psy5522 326 PGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGFSYDES----------KY--------EANQYKLDRNGMIKLV 380 (1397)
Q Consensus 326 pG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~~~~~~----------~~--------~~~~~ttd~~G~~~f~ 380 (1397)
+|.+..+..+|+|.+|+|++ +..|.+ |.-. .. ......||++|...|.
T Consensus 33 ~G~~l~l~G~V~D~~g~Pi~--gA~Vei-----Wqad~~G~Y~~~~~~~~~~~~~f~~rGr~~TD~~G~y~F~ 98 (185)
T cd03463 33 AGERITLEGRVYDGDGAPVP--DAMLEI-----WQADAAGRYAHPADSRRRLDPGFRGFGRVATDADGRFSFT 98 (185)
T ss_pred CCCEEEEEEEEECCCCCCCC--CCEEEE-----EcCCCCCccCCcCCcccccCCCCCcEEEEEECCCCCEEEE
No 151
>PF00576 Transthyretin: HIUase/Transthyretin family; InterPro: IPR023416 This family includes transthyretin that is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. However, most of the sequences listed in this family do not bind thyroid hormones. They are actually enzymes of the purine catabolism that catalyse the conversion of 5-hydroxyisourate (HIU) to OHCU [, ]. HIU hydrolysis is the original function of the family and is conserved from bacteria to mammals; transthyretins arose by gene duplications in the vertebrate lineage [, ]. HIUases are distinguished in the alignment from the conserved C-terminal YRGS sequence. Transthyretin (formerly prealbumin) is one of 3 thyroid hormone-binding proteins found in the blood of vertebrates []. It is produced in the liver and circulates in the bloodstream, where it binds retinol and thyroxine (T4) []. It differs from the other 2 hormone-binding proteins (T4-binding globulin and albumin) in 3 distinct ways: (1) the gene is expressed at a high rate in the brain choroid plexus; (2) it is enriched in cerebrospinal fluid; and (3) no genetically caused absence has been observed, suggesting an essential role in brain function, distinct from that played in the bloodstream []. The protein consists of around 130 amino acids, which assemble as a homotetramer that contains an internal channel in which T4 is bound. Within this complex, T4 appears to be transported across the blood-brain barrier, where, in the choroid plexus, the hormone stimulates further synthesis of transthyretin. The protein then diffuses back into the bloodstream, where it binds T4 for transport back to the brain [].; PDB: 1TFP_B 1KGJ_D 1IE4_C 1GKE_C 1KGI_D 2H0J_B 2H0E_B 2H0F_B 1ZD6_A 3DGD_D ....
Probab=25.12 E-value=2.1e+02 Score=28.35 Aligned_cols=62 Identities=18% Similarity=0.204 Sum_probs=38.4
Q ss_pred EEcC-CCccCCCcceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 121 VLNS-HLKPSVTGALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 121 ~ld~-~l~P~~~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
+||. ..+|+.+..+.+.-.++.+....--...++..|-+...+.-......|.|+|+...++
T Consensus 7 VLDt~~G~PA~gv~V~L~~~~~~~~~~~l~~~~Td~DGR~~~~~~~~~~~~~G~Y~l~F~~~~ 69 (112)
T PF00576_consen 7 VLDTTTGKPAAGVPVTLYRLDSDGSWTLLAEGVTDADGRIKQPLLEGESLEPGIYKLVFDTGD 69 (112)
T ss_dssp EEETTTTEE-TT-EEEEEEEETTSCEEEEEEEEBETTSEESSTSSETTTS-SEEEEEEEEHHH
T ss_pred EeeCCCCCCccCCEEEEEEecCCCCcEEEEEEEECCCCcccccccccccccceEEEEEEEHHH
Confidence 4554 4688877888887777544333323456888888753333356667899999887754
No 152
>PF01122 Cobalamin_bind: Eukaryotic cobalamin-binding protein; InterPro: IPR002157 Cobalamin (Cbl or vitamin B12) is only accessible through diet in mammals. Absorption, plasma transport and cellular uptake of Cbl in mammals involves three Cbl-transporting proteins, which are listed below in order of increasing Cbl-specificity: Haptocorrin (cobalophilin), which binds Cbl and Cbl-derivatives such as cobinamide; it may play a role in preventing the absorption of cobalamin analogues produced by bacteria. Transcobalamin (TC), which transport Cbl from blood to cells. Intrinsic factor (IF), which promotes Cbl absorption in the ileum by specific receptor-mediated endocytosis. The structure of TC reveals a two-domain structure, an N-terminal alpha(6)-alpha(6) barrel, and a smaller C-terminal domain []. Many interactions between Cbl and its binding site in the interface of the two domains are conserved among the other Cbl transporters. Specificity for Cbl between the different transporters may reside in a beta-hairpin motif found in the smaller C-terminal domain []. ; GO: 0031419 cobalamin binding, 0015889 cobalamin transport; PDB: 3KQ4_A 2PMV_A 2BB5_A 2V3N_A 2BBC_A 2BB6_D 2V3P_A.
Probab=25.10 E-value=88 Score=36.90 Aligned_cols=76 Identities=22% Similarity=0.254 Sum_probs=48.6
Q ss_pred HHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhcccCCCccccc
Q psy5522 923 AKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSNQAVNGSFPEV 1002 (1397)
Q Consensus 923 ~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~q~~~G~F~st 1002 (1397)
.++++.+.+ +++.-|.+||.| ||++-|+-|+.+|..+..++.--...-.++++||.++. .+|.|...
T Consensus 214 ~~~i~~~~~---kIl~~q~~~G~~---------GNiySTglAmQAL~~~~~~~~~~~w~C~k~~~~ll~~i-~~G~F~nP 280 (326)
T PF01122_consen 214 QQAIRSLVE---KILSQQKPNGLF---------GNIYSTGLAMQALSVSPSPPSESEWNCQKALDALLKEI-SQGAFQNP 280 (326)
T ss_dssp HHHHHHHHH---HHHHTB-TTS-B---------SSTTTHHHHHHHHTT-SS-SSHHHHHHHHHHHHHHHHH-TTTTT-SH
T ss_pred HHHHHHHHH---HHHHhcCCCCcc---------cchhhHHHHHHHHhcCCCCCcchhhHHHHHHHHHHHHh-hcCCCCCH
Confidence 455554444 566678889976 47888999999999887652211244677899998764 59999864
Q ss_pred hH---hHHHhhc
Q psy5522 1003 GK---VSHADMQ 1011 (1397)
Q Consensus 1003 qd---vl~al~~ 1011 (1397)
-. ||.+|..
T Consensus 281 ~a~aQiLPaL~g 292 (326)
T PF01122_consen 281 MAIAQILPALNG 292 (326)
T ss_dssp HHHHHHHHHHTT
T ss_pred HHHHHHHHHHcC
Confidence 33 4666655
No 153
>COG4935 Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones]
Probab=24.98 E-value=5.4e+02 Score=27.59 Aligned_cols=53 Identities=21% Similarity=0.334 Sum_probs=35.9
Q ss_pred CCcceEEEEECCCCCeEEeeee---ec--cCCceeeeee---ecCCCCCCeeEEEEEEEcC
Q psy5522 130 VTGALEIFITDGKGNRIKQWTR---AL--TTRGVFSADL---QLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 130 ~~~~~~v~I~DP~g~~I~qw~~---~~--~~~Gi~~~sf---~L~~~~~lG~y~I~v~~~~ 182 (1397)
+-+-+.|++..|.|..|..-+. .. ..+|....+| ++=.+++.|+|++.+.-..
T Consensus 50 yrGDL~I~L~sP~g~~Il~~~~~~~~~D~~~q~~~~~s~~l~~~~ge~~~G~W~L~V~D~~ 110 (177)
T COG4935 50 YRGDLRITLISPSGQGILSNLTLGRRYDDDSQGYNDQSFMLVQLWGERAEGNWRLEVQDLA 110 (177)
T ss_pred ccccEEEEEeCCCCcceEEeeccCCccccccceeeeeeeeeeeeccCCCCceEEEEEEecC
Confidence 4567788888998655543221 12 2456777777 5567788999999998754
No 154
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=23.96 E-value=2.1e+02 Score=32.01 Aligned_cols=48 Identities=13% Similarity=0.199 Sum_probs=34.0
Q ss_pred CcceEEEEECCCCCeEEeeeeeccCCceeeeeee-cCCC---CCCeeEEEEEEEc
Q psy5522 131 TGALEIFITDGKGNRIKQWTRALTTRGVFSADLQ-LSKS---PVLGDWNITINVL 181 (1397)
Q Consensus 131 ~~~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~-L~~~---~~lG~y~I~v~~~ 181 (1397)
-..+.|.|.|.+|+.|+.... ..|....++. ...+ .+-|.|+|.|...
T Consensus 123 a~~v~v~I~D~~G~vV~t~~~---~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A~ 174 (223)
T PRK12813 123 ADKAELVVRDAAGAEVARETV---PVGAGPVEWAGEDADGNPLPNGAYSFVVESY 174 (223)
T ss_pred CceEEEEEEcCCCCEEEEEee---CCCceeEEeCCcCCCCCcCCCccEEEEEEEE
Confidence 467899999999999987542 4566666665 2222 4469999998764
No 155
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=23.80 E-value=2.2e+02 Score=24.05 Aligned_cols=40 Identities=15% Similarity=0.021 Sum_probs=30.9
Q ss_pred CceeeeCCEEEEEEEEEcCCCccEE-EEEEEeecCCeeeee
Q psy5522 722 PYSVMRGEVVAIPVVVFNYLSQDLV-ADVTLENVGQFDFAD 761 (1397)
Q Consensus 722 Pysv~rGE~~~l~~~VfNyl~~~~~-v~v~L~~s~~~~~~~ 761 (1397)
+..+.+||.+.-.++|.|..+.... |.|+-....++.+++
T Consensus 5 ~~~~~~Gd~v~Yti~v~N~g~~~a~~v~v~D~lP~g~~~v~ 45 (53)
T TIGR01451 5 KTVATIGDTITYTITVTNNGNVPATNVVVTDILPSGTTFVS 45 (53)
T ss_pred ccccCCCCEEEEEEEEEECCCCceEeEEEEEcCCCCCEEEe
Confidence 4567899999999999999887654 666655666777764
No 156
>TIGR02438 catachol_actin catechol 1,2-dioxygenase, Actinobacterial. Members of this family are catechol 1,2-dioxygenases of the Actinobacteria. They are more closely related to actinobacterial chlorocatechol 1,2-dioxygenases than to proteobacterial catechol 1,2-dioxygenases, and so are built in this separate model. The member from Rhodococcus rhodochrous NCIMB 13259 (GB|AAC33003.1) is described as a homodimer with bound Fe, similarly active on catechol, 3-methylcatechol and 4-methylcatechol.
Probab=23.46 E-value=2e+02 Score=33.37 Aligned_cols=23 Identities=17% Similarity=0.222 Sum_probs=20.7
Q ss_pred cCCCeEEEEEEEEccCCCccCCC
Q psy5522 325 KPGLKYTAYMKLTHHDGTPVTDN 347 (1397)
Q Consensus 325 kpG~p~~~~v~v~~~dG~P~~~~ 347 (1397)
.+|.|..+..+|+|.+|+|+++.
T Consensus 128 ~~G~pl~v~G~V~D~~G~Pv~gA 150 (281)
T TIGR02438 128 EAGTPLVFSGQVTDLDGNGLAGA 150 (281)
T ss_pred CCCCEEEEEEEEEcCCCCCcCCC
Confidence 47999999999999999999953
No 157
>cd03458 Catechol_intradiol_dioxygenases Catechol intradiol dioxygenases can be divided into several subgroups according to their substrate specificity for catechol, chlorocatechols and hydroxyquinols. Almost all members of this family are homodimers containing one ferric ion (Fe3+) per monomer. They belong to the intradiol dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=23.23 E-value=2e+02 Score=32.78 Aligned_cols=22 Identities=23% Similarity=0.309 Sum_probs=20.6
Q ss_pred cCCCeEEEEEEEEccCCCccCC
Q psy5522 325 KPGLKYTAYMKLTHHDGTPVTD 346 (1397)
Q Consensus 325 kpG~p~~~~v~v~~~dG~P~~~ 346 (1397)
.+|.|..+..+|+|.+|+|+++
T Consensus 100 ~~G~~l~l~G~V~D~~G~Pv~~ 121 (256)
T cd03458 100 ADGEPLFVHGTVTDTDGKPLAG 121 (256)
T ss_pred CCCcEEEEEEEEEcCCCCCCCC
Confidence 6799999999999999999995
No 158
>PRK15036 hydroxyisourate hydrolase; Provisional
Probab=23.21 E-value=3.6e+02 Score=27.73 Aligned_cols=64 Identities=16% Similarity=0.235 Sum_probs=41.6
Q ss_pred EEEEEEEcC-CCccCCCcceEEEEECCCC-CeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 116 RFRAIVLNS-HLKPSVTGALEIFITDGKG-NRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 116 ~fRvv~ld~-~l~P~~~~~~~v~I~DP~g-~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
.++.-++|. .-+|+.+-.+.++-.++++ ..+.+ ..++..|-+..-+. +++...|.|+|+...++
T Consensus 28 ~Is~HVLDt~~G~PA~gV~V~L~~~~~~~w~~l~~--~~Td~dGR~~~l~~-~~~~~~G~Y~L~F~t~~ 93 (137)
T PRK15036 28 ILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNT--AKTDKDGRIKALWP-EQTATTGDYRVVFKTGD 93 (137)
T ss_pred CeEEEEEeCCCCcCCCCCEEEEEEccCCceEEEEE--EEECCCCCCccccC-cccCCCeeEEEEEEcch
Confidence 477777886 4689877777776554432 22322 35788888865222 33356899999998765
No 159
>COG5266 CbiK ABC-type Co2+ transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=23.20 E-value=2.1e+02 Score=32.34 Aligned_cols=57 Identities=16% Similarity=0.188 Sum_probs=35.9
Q ss_pred cCCcccCCCeEEEEEEEEccCCCccCCCCceEEEEEee-----eccc---cccceeeEeeCCCeeEEEEE
Q psy5522 320 SSEYFKPGLKYTAYMKLTHHDGTPVTDNNNMVQVRHGF-----SYDE---SKYEANQYKLDRNGMIKLVY 381 (1397)
Q Consensus 320 ~~~~~kpG~p~~~~v~v~~~dG~P~~~~~~~V~v~~~~-----~~~~---~~~~~~~~ttd~~G~~~f~i 381 (1397)
.|...+.|.+|.++|. + +|+|+++ .+|.+...+ +|-. .+.......||++|.+.|..
T Consensus 164 nP~~a~~ge~f~~~vl--~-~GkPv~n--A~V~v~~~n~~~~d~~a~~~~~ek~~~~~~TD~kG~~~fip 228 (264)
T COG5266 164 NPANAWVGEVFRGKVL--D-NGKPVPN--ATVEVEFDNIDTKDNRAKTGNTEKTALVQFTDDKGEVSFIP 228 (264)
T ss_pred CccccccCCeEEEEEE--E-CCccCCC--cEEEEEEecccccccccccCCCCCcceEEEcCCCceEEEEE
Confidence 3556778999987663 3 3999995 466665321 1211 12334567899999877764
No 160
>cd03462 1,2-CCD chlorocatechol 1,2-dioxygenases (1,2-CCDs) (type II enzymes) are homodimeric intradiol dioxygenases that degrade chlorocatechols via the addition of molecular oxygen and the subsequent cleavage between two adjacent hydroxyl groups. This reaction is part of the modified ortho-cleavage pathway which is a central oxidative bacterial pathway that channels chlorocatechols, derived from the degradation of chlorinated benzoic acids, phenoxyacetic acids, phenols, benzenes, and other aromatics into the energy-generating tricarboxylic acid pathway.
Probab=23.15 E-value=2.1e+02 Score=32.48 Aligned_cols=22 Identities=14% Similarity=0.092 Sum_probs=20.2
Q ss_pred cCCCeEEEEEEEEccCCCccCC
Q psy5522 325 KPGLKYTAYMKLTHHDGTPVTD 346 (1397)
Q Consensus 325 kpG~p~~~~v~v~~~dG~P~~~ 346 (1397)
.+|.+..+.-+|+|.+|+|+++
T Consensus 95 ~~G~~l~l~G~V~D~~G~Pv~~ 116 (247)
T cd03462 95 DDHKPLLFRGTVKDLAGAPVAG 116 (247)
T ss_pred CCCCEEEEEEEEEcCCCCCcCC
Confidence 5699999999999999999995
No 161
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=23.00 E-value=4e+02 Score=24.87 Aligned_cols=58 Identities=16% Similarity=0.083 Sum_probs=39.1
Q ss_pred EEEEEEEcCCCccEEEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEEEEEEEeceeeeEEEEEEE
Q psy5522 732 AIPVVVFNYLSQDLVADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTTTFVITPKELGYIGIKVTA 810 (1397)
Q Consensus 732 ~l~~~VfNyl~~~~~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv~f~v~p~~lG~v~itV~A 810 (1397)
.|.+++.|-....+.++|.- . .+ ......++.|++|+..++.|++ ...-|--.|+|++
T Consensus 21 ~l~l~l~N~g~~~~~~~v~~--~-~y-----------------~~~~~~~~~v~ag~~~~~~w~l-~~s~gwYDl~v~~ 78 (89)
T PF05506_consen 21 NLRLTLSNPGSAAVTFTVYD--N-AY-----------------GGGGPWTYTVAAGQTVSLTWPL-AASGGWYDLTVTG 78 (89)
T ss_pred EEEEEEEeCCCCcEEEEEEe--C-Cc-----------------CCCCCEEEEECCCCEEEEEEee-cCCCCcEEEEEEc
Confidence 67777788766666555542 1 11 0122568899999999999998 5566777777776
No 162
>cd05469 Transthyretin_like Transthyretin_like. This domain is present in the transthyretin-like protein (TLP) family which includes transthyretin (TTR) and a transthyretin-related protein called 5-hydroxyisourate hydrolase (HIUase). TTR and HIUase are homotetrameric proteins with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits. TTR transports thyroid hormones and retinol in the blood serum of vertebrates while HIUase catalyzes the second step in a three-step ureide pathway. TTRs are highly conserved and found only in vertebrates while the HIUases are found in a wide range of bacterial, plant, fungal, slime mold and vertebrate organisms.
Probab=22.95 E-value=4.2e+02 Score=26.33 Aligned_cols=59 Identities=14% Similarity=0.265 Sum_probs=37.2
Q ss_pred EEcC-CCccCCCcceEEEEECCCC--CeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 121 VLNS-HLKPSVTGALEIFITDGKG--NRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 121 ~ld~-~l~P~~~~~~~v~I~DP~g--~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
+||. ..+|+.+..+.+.-.++.+ ..|.+ ..+++.|-+..-+. .++...|.|+|+...++
T Consensus 7 VLDt~~G~PAagv~V~L~~~~~~~~w~~l~~--~~Tn~DGR~~~~l~-~~~~~~G~Y~l~F~t~~ 68 (113)
T cd05469 7 VLDAVRGSPAANVAIKVFRKTADGSWEIFAT--GKTNEDGELHGLIT-EEEFXAGVYRVEFDTKS 68 (113)
T ss_pred EEeCCCCccCCCCEEEEEEecCCCceEEEEE--EEECCCCCccCccc-cccccceEEEEEEehHH
Confidence 4554 4578777788887776532 23332 45778888743222 45556799999887764
No 163
>cd03460 1,2-CTD Catechol 1,2 dioxygenase (1,2-CTD) catalyzes an intradiol cleavage reaction of catechol to form cis,cis-muconate. 1,2-CTDs is homodimers with one catalytic non-heme ferric ion per monomer. They belong to the aromatic dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=22.95 E-value=2e+02 Score=33.37 Aligned_cols=22 Identities=27% Similarity=0.371 Sum_probs=20.2
Q ss_pred cCCCeEEEEEEEEccCCCccCC
Q psy5522 325 KPGLKYTAYMKLTHHDGTPVTD 346 (1397)
Q Consensus 325 kpG~p~~~~v~v~~~dG~P~~~ 346 (1397)
.+|.|..+..+|+|.+|+|+++
T Consensus 120 ~~Gepl~l~G~V~D~~G~PI~~ 141 (282)
T cd03460 120 DDGETLVMHGTVTDTDGKPVPG 141 (282)
T ss_pred CCCCEEEEEEEEECCCCCCcCC
Confidence 4799999999999999999995
No 164
>PF14782 BBS2_C: Ciliary BBSome complex subunit 2, C-terminal
Probab=22.81 E-value=2.7e+02 Score=34.38 Aligned_cols=81 Identities=17% Similarity=0.325 Sum_probs=58.1
Q ss_pred EEEEEeCCCeeEEEEEEEeceeeeEEEEEEEEeCCCCceeEEEEEecCCCeeeEEEEEEEEecCCCceeEEEEEecCCCC
Q psy5522 780 KKLTIKANSGSTTTFVITPKELGYIGIKVTATSNLAGDSMEGKLLVKPEGETQYKNKAIFVDLRKNKTFSVNVTLDMPKN 859 (1397)
Q Consensus 780 ~~~~v~a~~~~tv~f~v~p~~lG~v~itV~A~~~~~~D~v~k~L~V~peG~~~~~t~s~~l~~~~~~~~~~~i~l~~P~~ 859 (1397)
....+|++-.-.+.|.+.+ ..|.+.+.|++. .|+++|.++|..||+-..+++. .++.. .+.+-.+.|..|+|
T Consensus 93 ~~~~ipa~T~l~~~l~~n~-~~~~~eL~vst~----ndtiIr~vlIfaEglF~gEs~v--~~~~~-pss~l~ipL~~~kd 164 (431)
T PF14782_consen 93 DTGVIPANTQLQVSLSVNP-ETGCVELVVSTS----NDTIIRAVLIFAEGLFEGESHV--PSPQN-PSSTLRIPLRPPKD 164 (431)
T ss_pred cCCccCCCCEEEEEEeeCC-cCCcEEEEEEeC----CCcEEEEEEEEeccccCCcccc--ccCCC-CCCeEEEEeeCCCC
Confidence 3557888988889998888 778888888885 3999999999999996665554 44432 23456677777765
Q ss_pred cCCCCcEEEEEE
Q psy5522 860 IVPGSEHVEVSA 871 (1397)
Q Consensus 860 ~v~gs~~~~vsv 871 (1397)
. ...+.+.+
T Consensus 165 ~---~vdL~iKv 173 (431)
T PF14782_consen 165 V---PVDLHIKV 173 (431)
T ss_pred C---ceEEEEEE
Confidence 4 23455554
No 165
>TIGR02962 hdxy_isourate hydroxyisourate hydrolase. Members of this family, hydroxyisourate hydrolase, represent a distinct clade of transthyretin-related proteins. Bacterial members typically are encoded next to ureidoglycolate hydrolase and often near either xanthine dehydrogenase or xanthine/uracil permease genes and have been demonstrated to have hydroxyisourate hydrolase activity. In eukaryotes, a clade separate from the transthyretins (a family of thyroid-hormone binding proteins) has also been shown to have HIU hydrolase activity in urate catabolizing organisms. Transthyretin, then, would appear to be the recently diverged paralog of the more ancient HIUH family.
Probab=22.27 E-value=4.2e+02 Score=26.27 Aligned_cols=60 Identities=18% Similarity=0.297 Sum_probs=37.5
Q ss_pred EEcC-CCccCCCcceEEEEECCCC-CeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 121 VLNS-HLKPSVTGALEIFITDGKG-NRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 121 ~ld~-~l~P~~~~~~~v~I~DP~g-~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
+||. ..+|+.+..+.++-.++++ ..|. ..+++..|-+..-+.-.+....|.|+|+...++
T Consensus 7 VLDt~~G~PAagv~V~L~~~~~~~~~~i~--~~~Tn~DGR~~~~l~~~~~~~~G~Y~l~F~~g~ 68 (112)
T TIGR02962 7 VLDTTSGKPAAGVPVTLYRLDGSGWTPLA--EGVTNADGRCPDLLPEGETLAAGIYKLRFDTGD 68 (112)
T ss_pred EEeCCCCccCCCCEEEEEEecCCCeEEEE--EEEECCCCCCcCcccCcccCCCeeEEEEEEhhh
Confidence 4663 5688777777777666543 2232 346788888764333233345699999987764
No 166
>COG1689 Uncharacterized protein conserved in archaea [Function unknown]
Probab=22.10 E-value=76 Score=34.69 Aligned_cols=66 Identities=12% Similarity=0.152 Sum_probs=46.2
Q ss_pred HHHHHHHHHHhccccceeeecCCCCccccCCCCCCCCcchhHHHHHHHHHccCCCccCHHHHHHHHHHHHhc-ccCCCcc
Q psy5522 921 IEAKASRYLETGYQQELTYRRPDGSFSAFGTTDPNGSTWLTAFVAKSFRQAASHTTIDESVILEALAWLSSN-QAVNGSF 999 (1397)
Q Consensus 921 ~~~~a~~~l~~gy~~~l~y~~~dGsfs~~~~~~~~~s~wlTAyvl~~~~~a~~~i~vd~~~~~~~~~WL~~~-q~~~G~F 999 (1397)
..++.++|++. -||.||+|....+- +.++-=|=||++.+...+ .|....++.+.||.+| |...++|
T Consensus 7 ~l~rvi~fi~~-------RrhedGGy~f~~~L--pdti~~TyYAi~i~s~lg----~evPr~EktiefL~d~~qt~~~~~ 73 (274)
T COG1689 7 NLNRVIEFIEK-------RRHEDGGYCFVSQL--PDTINDTYYAIKIYSLLG----HEVPRKEKTIEFLYDQMQTAGVGV 73 (274)
T ss_pred hHHHHHHHHHH-------hhcCCCCeEEeccC--cchhhhhhhhhhhhhhcC----CcCchHHHHHHHHHHHHHHhhhHH
Confidence 45677788876 48999999766442 223466889999988765 3556678999999874 3444444
No 167
>PF07718 Coatamer_beta_C: Coatomer beta C-terminal region; InterPro: IPR011710 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C-terminal domain of the beta subunit from coatomer proteins (Beta-coat proteins). The C-terminal domain probably adapts the function of the N-terminal IPR002553 from INTERPRO domain. Coatomer protein complex I (COPI)-coated vesicles are involved in transport between the endoplasmic reticulum and the Golgi but also participate in transport from early to late endosomes within the endocytic pathway []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat
Probab=21.90 E-value=2.6e+02 Score=28.83 Aligned_cols=67 Identities=15% Similarity=0.281 Sum_probs=48.2
Q ss_pred eEEEEecCCceeeeCCEEEEEEEEEcCCCccE-EEEEEEeecCCeeeeecCCCccCCCCCCcceeeeEEEEEeCCCeeEE
Q psy5522 714 PFFISLDLPYSVMRGEVVAIPVVVFNYLSQDL-VADVTLENVGQFDFADFSNEVDAAPQPKFEVFRRKKLTIKANSGSTT 792 (1397)
Q Consensus 714 pffv~l~LPysv~rGE~~~l~~~VfNyl~~~~-~v~v~L~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~~~tv 792 (1397)
|.|++..+= +-.=.+.|.+.+.|..+..+ .+++.|....+++++. +.+.+++.|++...+
T Consensus 57 PvYaEA~v~---v~q~DIvLDvllvNqT~~tLqNl~vElat~gdLklve----------------~p~~~tL~P~~~~~i 117 (140)
T PF07718_consen 57 PVYAEAYVT---VHQYDIVLDVLLVNQTNETLQNLTVELATLGDLKLVE----------------RPQPITLAPHGFARI 117 (140)
T ss_pred CeEEEEEEE---EEeeeEEEEEEEEeCChhhhhcEEEEEEecCCcEEcc----------------CCCceeeCCCcEEEE
Confidence 555554432 22335678899999998765 4999999999988752 234678999999988
Q ss_pred EEEEEec
Q psy5522 793 TFVITPK 799 (1397)
Q Consensus 793 ~f~v~p~ 799 (1397)
...|+..
T Consensus 118 ~~~iKVs 124 (140)
T PF07718_consen 118 KATIKVS 124 (140)
T ss_pred EEEEEEE
Confidence 8887754
No 168
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=21.84 E-value=5.1e+02 Score=24.59 Aligned_cols=47 Identities=17% Similarity=0.327 Sum_probs=31.0
Q ss_pred ceEEEEECCCCCeEEeeeeeccCCceeeeeeecCCCCCCeeEEEEEEEcC
Q psy5522 133 ALEIFITDGKGNRIKQWTRALTTRGVFSADLQLSKSPVLGDWNITINVLD 182 (1397)
Q Consensus 133 ~~~v~I~DP~g~~I~qw~~~~~~~Gi~~~sf~L~~~~~lG~y~I~v~~~~ 182 (1397)
.+.+.+.-.+|..+-.|..-....+.+.+. |..-..|.|+|++...+
T Consensus 31 ~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt---P~~~~~g~~~i~v~a~D 77 (97)
T smart00736 31 TLTYSATLSDGSALPSWLSFDSDTGTLSGT---PTNSDVGSLSLKVTATD 77 (97)
T ss_pred eEEEEEEeCCCCCCCCeEEEeCCCCEEEEE---CCCCCCcEEEEEEEEEE
Confidence 444444433444445688777778888883 44444799999998865
No 169
>cd05703 S1_Rrp5_repeat_hs12_sc9 S1_Rrp5_repeat_hs12_sc9: Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in Saccharomyces cerevisiae Rrp5 and 14 S1 repeats in Homo sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 12 (hs12) and S. cerevisiae S1 repeat 9 (sc9). Rrp5 is found in eukaryotes but not in prokaryotes or archaea.
Probab=21.71 E-value=1.2e+02 Score=27.31 Aligned_cols=18 Identities=6% Similarity=0.287 Sum_probs=16.1
Q ss_pred cccCCceeEEEEEEEcCC
Q psy5522 108 IYKPGDLVRFRAIVLNSH 125 (1397)
Q Consensus 108 IYkPGqtV~fRvv~ld~~ 125 (1397)
.|++||.|.++++.+|.+
T Consensus 46 ~~~vG~~v~~kV~~id~~ 63 (73)
T cd05703 46 KFPIGQALKAKVVGVDKE 63 (73)
T ss_pred hCCCCCEEEEEEEEEeCC
Confidence 499999999999998864
No 170
>PF01345 DUF11: Domain of unknown function DUF11; InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins. In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=20.84 E-value=2.1e+02 Score=25.75 Aligned_cols=40 Identities=20% Similarity=0.142 Sum_probs=31.8
Q ss_pred CCceeeeCCEEEEEEEEEcCCCccEE-EEEEEeecCCeeee
Q psy5522 721 LPYSVMRGEVVAIPVVVFNYLSQDLV-ADVTLENVGQFDFA 760 (1397)
Q Consensus 721 LPysv~rGE~~~l~~~VfNyl~~~~~-v~v~L~~s~~~~~~ 760 (1397)
-+..+..||.+...++|.|..+.... |.|+-....+++++
T Consensus 33 ~~~~~~~Gd~v~ytitvtN~G~~~a~nv~v~D~lp~g~~~v 73 (76)
T PF01345_consen 33 NPSTANPGDTVTYTITVTNTGPAPATNVVVTDTLPAGLTFV 73 (76)
T ss_pred CCCcccCCCEEEEEEEEEECCCCeeEeEEEEEcCCCCCEEe
Confidence 36778999999999999999988765 66666566677665
No 171
>COG2372 CopC Uncharacterized protein, homolog of Cu resistance protein CopC [General function prediction only]
Probab=20.77 E-value=2.2e+02 Score=28.73 Aligned_cols=58 Identities=12% Similarity=0.137 Sum_probs=43.0
Q ss_pred EeCCeEEEEEEEEecCCceeEEEEEeeCCCCCceeEEEEEEEe-cCCeEEEEeEEEEEcc
Q psy5522 452 LGRGDVIMADTITVPGNKMSTVIRFLATYAMAPTAHVIVQYVR-EDGEVVADGLDLELEG 510 (1397)
Q Consensus 452 ~srG~Iv~~~~~~~~~~~~~~~~~l~vt~~maP~a~l~vy~v~-~~gevvads~~~~V~~ 510 (1397)
...|.-+..+..+... +....++++....+.+..+.+.+-+. .||.++..++.|.|..
T Consensus 68 ~~d~~~v~t~~~~~~~-~~~~~l~v~l~~~L~aG~Y~v~WrvvS~DGH~v~G~~sFsV~~ 126 (127)
T COG2372 68 GPDGEEVATAGTKLDE-QNHTQLEVPLPQPLKAGVYTVDWRVVSSDGHVVKGSISFSVGA 126 (127)
T ss_pred CCCCCccccCcccccc-cCCcEEEecCcccCCCCcEEEEEEEEecCCcEeccEEEEEecC
Confidence 3445555444444432 23467999999999999999999865 5899999999999875
Done!