Query psy11447
Match_columns 360
No_of_seqs 115 out of 133
Neff 3.9
Searched_HMMs 46136
Date Fri Aug 16 18:42:45 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11447.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11447hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd01455 vWA_F11C1-5a_type Von 100.0 3.5E-48 7.7E-53 353.7 13.9 190 104-359 1-191 (191)
2 KOG1808|consensus 99.9 8.5E-26 1.8E-30 254.5 -1.8 235 1-324 1606-1856(1856)
3 PF13519 VWA_2: von Willebrand 98.5 1.5E-06 3.2E-11 72.4 12.0 128 211-353 40-172 (172)
4 KOG1808|consensus 98.5 1.3E-07 2.8E-12 109.2 5.2 117 21-141 919-1043(1856)
5 smart00327 VWA von Willebrand 97.6 0.0025 5.5E-08 53.3 13.4 140 199-355 29-174 (177)
6 cd01453 vWA_transcription_fact 97.4 0.0023 5.1E-08 57.5 11.6 113 232-352 64-177 (183)
7 cd00198 vWFA Von Willebrand fa 97.3 0.0073 1.6E-07 48.6 12.8 114 199-320 28-141 (161)
8 cd01470 vWA_complement_factors 97.0 0.02 4.3E-07 51.1 13.3 144 199-350 28-197 (198)
9 TIGR03436 acidobact_VWFA VWFA- 96.9 0.021 4.6E-07 54.1 13.1 97 254-355 138-252 (296)
10 PRK13685 hypothetical protein; 96.7 0.045 9.8E-07 53.4 14.5 167 108-355 92-288 (326)
11 cd01454 vWA_norD_type norD typ 96.3 0.056 1.2E-06 47.2 11.1 113 199-322 29-154 (174)
12 cd01476 VWA_integrin_invertebr 96.2 0.071 1.5E-06 45.5 11.0 114 199-321 27-142 (163)
13 cd01465 vWA_subgroup VWA subgr 96.2 0.087 1.9E-06 44.8 11.2 92 255-350 73-170 (170)
14 cd01461 vWA_interalpha_trypsin 96.0 0.42 9E-06 40.5 14.5 135 204-350 32-169 (171)
15 PF00092 VWA: von Willebrand f 95.8 0.091 2E-06 44.3 9.5 143 199-353 27-178 (178)
16 cd01467 vWA_BatA_type VWA BatA 95.6 0.3 6.4E-06 42.2 12.2 85 256-346 82-180 (180)
17 cd01469 vWA_integrins_alpha_su 95.5 0.22 4.7E-06 44.1 11.4 140 199-349 28-177 (177)
18 cd01450 vWFA_subfamily_ECM Von 95.1 0.5 1.1E-05 39.1 11.6 112 199-320 28-142 (161)
19 cd01475 vWA_Matrilin VWA_Matri 95.0 0.47 1E-05 43.5 12.1 145 199-355 30-181 (224)
20 cd01472 vWA_collagen von Wille 94.9 0.54 1.2E-05 40.4 11.5 111 199-321 28-141 (164)
21 PTZ00441 sporozoite surface pr 94.9 1.1 2.3E-05 48.4 15.9 151 199-355 71-228 (576)
22 cd01480 vWA_collagen_alpha_1-V 94.7 0.66 1.4E-05 41.2 12.0 92 251-347 82-177 (186)
23 cd01471 vWA_micronemal_protein 93.9 2 4.2E-05 37.7 13.1 121 199-325 29-152 (186)
24 cd01482 vWA_collagen_alphaI-XI 93.6 2.1 4.6E-05 37.0 12.6 131 199-342 28-163 (164)
25 cd01462 VWA_YIEM_type VWA YIEM 92.9 1.7 3.7E-05 36.8 10.8 108 199-323 28-137 (152)
26 cd01474 vWA_ATR ATR (Anthrax T 91.9 3.8 8.2E-05 36.3 12.1 98 255-356 78-180 (185)
27 cd01456 vWA_ywmD_type VWA ywmD 91.9 6.9 0.00015 35.2 13.9 86 252-343 112-203 (206)
28 TIGR02640 gas_vesic_GvpN gas v 91.4 0.54 1.2E-05 44.6 6.6 65 38-107 173-237 (262)
29 COG4548 NorD Nitric oxide redu 91.1 0.87 1.9E-05 49.0 8.3 97 255-356 529-634 (637)
30 cd01466 vWA_C3HC4_type VWA C3H 90.6 10 0.00023 32.7 13.3 75 258-340 78-154 (155)
31 cd01463 vWA_VGCC_like VWA Volt 89.7 9 0.00019 34.0 12.4 122 211-340 50-186 (190)
32 cd01473 vWA_CTRP CTRP for CS 85.6 28 0.00061 31.6 14.7 145 199-354 29-187 (192)
33 TIGR03788 marine_srt_targ mari 82.5 53 0.0012 35.0 15.6 155 191-356 284-444 (596)
34 PF14062 DUF4253: Domain of un 82.3 2.4 5.3E-05 36.2 4.7 67 199-268 32-101 (111)
35 cd01477 vWA_F09G8-8_type VWA F 81.2 31 0.00068 31.6 11.8 65 256-323 106-174 (193)
36 cd01451 vWA_Magnesium_chelatas 80.7 39 0.00085 29.7 14.3 84 258-345 77-169 (178)
37 PF08406 CbbQ_C: CbbQ/NirQ/Nor 80.7 8.5 0.00018 31.7 7.1 73 57-137 3-81 (86)
38 TIGR00868 hCaCC calcium-activa 78.6 48 0.001 37.8 14.3 129 211-354 343-477 (863)
39 cd01458 vWA_ku Ku70/Ku80 N-ter 75.3 24 0.00051 32.3 9.2 88 234-322 76-174 (218)
40 PF08683 CAMSAP_CKK: Microtubu 74.0 9.3 0.0002 33.7 5.9 81 241-346 15-111 (123)
41 cd00951 KDGDH 5-dehydro-4-deox 69.6 1.1E+02 0.0023 29.6 12.7 44 213-266 39-87 (289)
42 cd01455 vWA_F11C1-5a_type Von 68.5 4.3 9.3E-05 38.1 2.8 65 120-189 19-84 (191)
43 TIGR02031 BchD-ChlD magnesium 68.0 1.5E+02 0.0032 32.1 14.4 85 257-344 482-584 (589)
44 TIGR02442 Cob-chelat-sub cobal 66.5 64 0.0014 35.0 11.4 119 206-339 499-631 (633)
45 PRK13406 bchD magnesium chelat 63.2 55 0.0012 35.5 10.1 124 206-343 434-570 (584)
46 TIGR01650 PD_CobS cobaltochela 60.1 18 0.0004 36.5 5.6 67 40-106 209-280 (327)
47 KOG2454|consensus 59.9 37 0.00081 36.0 7.8 86 251-353 226-322 (583)
48 PF13768 VWA_3: von Willebrand 58.2 1.1E+02 0.0025 25.7 10.6 110 203-323 28-138 (155)
49 cd08567 GDPD_SpGDE_like Glycer 57.5 15 0.00032 34.0 4.2 35 257-292 15-54 (263)
50 cd00408 DHDPS-like Dihydrodipi 56.9 1.8E+02 0.0038 27.5 12.0 106 212-333 35-174 (281)
51 cd00615 Orn_deC_like Ornithine 56.6 1.1E+02 0.0023 29.2 9.9 23 294-317 139-161 (294)
52 cd08601 GDPD_SaGlpQ_like Glyce 56.1 19 0.00042 33.6 4.7 35 257-292 15-54 (256)
53 PF11775 CobT_C: Cobalamin bio 52.9 55 0.0012 31.7 7.2 88 260-357 119-216 (219)
54 cd00300 LDH_like L-lactate deh 51.7 49 0.0011 32.2 6.9 86 221-324 66-174 (300)
55 cd08600 GDPD_EcGlpQ_like Glyce 51.1 21 0.00046 35.2 4.3 36 257-293 15-55 (318)
56 COG0773 MurC UDP-N-acetylmuram 50.6 2.6E+02 0.0057 29.9 12.3 156 191-355 294-455 (459)
57 PF03456 uDENN: uDENN domain; 49.7 4.3 9.3E-05 30.7 -0.6 36 250-295 24-60 (65)
58 PRK00421 murC UDP-N-acetylmura 47.9 1.9E+02 0.0041 29.5 10.6 154 191-355 293-454 (461)
59 COG3414 SgaB Phosphotransferas 47.8 46 0.00099 27.9 5.2 79 253-355 8-89 (93)
60 TIGR03249 KdgD 5-dehydro-4-deo 47.8 2.7E+02 0.0059 27.0 12.3 60 196-266 28-92 (296)
61 PF06888 Put_Phosphatase: Puta 47.4 9 0.00019 36.8 1.1 78 224-305 21-121 (234)
62 TIGR01082 murC UDP-N-acetylmur 47.1 2.2E+02 0.0047 29.0 10.9 153 193-353 289-447 (448)
63 cd06267 PBP1_LacI_sugar_bindin 46.6 1.7E+02 0.0036 25.5 8.8 48 296-347 164-218 (264)
64 PRK10997 yieM hypothetical pro 46.2 2.1E+02 0.0047 30.7 11.0 76 238-323 381-460 (487)
65 cd05294 LDH-like_MDH_nadp A la 45.8 69 0.0015 31.4 6.9 88 220-325 71-181 (309)
66 PRK03620 5-dehydro-4-deoxygluc 45.1 3E+02 0.0066 26.8 12.7 60 196-267 30-95 (303)
67 TIGR02313 HpaI-NOT-DapA 2,4-di 44.9 2.6E+02 0.0056 27.3 10.6 95 213-321 39-168 (294)
68 cd08605 GDPD_GDE5_like_1_plant 44.2 42 0.00091 31.9 5.1 35 257-292 25-64 (282)
69 PF05762 VWA_CoxE: VWA domain 42.8 1.3E+02 0.0028 28.0 7.9 99 238-346 111-219 (222)
70 PLN02602 lactate dehydrogenase 40.6 1E+02 0.0022 31.2 7.4 87 221-325 105-214 (350)
71 PRK14093 UDP-N-acetylmuramoyla 40.4 69 0.0015 33.1 6.3 89 258-353 348-440 (479)
72 TIGR01763 MalateDH_bact malate 39.9 80 0.0017 31.0 6.3 87 221-325 69-178 (305)
73 PF02796 HTH_7: Helix-turn-hel 39.3 52 0.0011 23.4 3.7 44 230-306 1-44 (45)
74 PF00072 Response_reg: Respons 39.1 1.4E+02 0.0031 22.8 6.6 93 235-341 6-100 (112)
75 cd05126 Mth938 Mth938 domain. 39.0 21 0.00046 30.8 2.0 52 294-350 45-101 (117)
76 cd08562 GDPD_EcUgpQ_like Glyce 38.7 29 0.00062 31.4 2.9 35 257-292 13-52 (229)
77 cd08584 PI-PLCc_GDPD_SF_unchar 38.2 1.5E+02 0.0033 28.0 7.6 85 258-353 7-97 (192)
78 PF01075 Glyco_transf_9: Glyco 38.0 1E+02 0.0022 28.0 6.3 76 278-354 104-182 (247)
79 PF06144 DNA_pol3_delta: DNA p 37.7 2E+02 0.0042 24.6 7.8 77 278-356 56-139 (172)
80 PF00701 DHDPS: Dihydrodipicol 37.6 3.1E+02 0.0067 26.2 9.8 89 212-315 39-161 (289)
81 cd05560 Xcc1710_like Xcc1710_l 37.5 25 0.00054 29.8 2.1 47 294-349 39-93 (109)
82 smart00800 uDENN Domain always 37.1 17 0.00036 29.2 1.0 38 249-295 46-84 (89)
83 PRK04147 N-acetylneuraminate l 36.6 79 0.0017 30.5 5.7 90 212-316 42-165 (293)
84 TIGR01143 murF UDP-N-acetylmur 36.6 1.1E+02 0.0023 30.9 6.8 84 258-353 307-393 (417)
85 cd08563 GDPD_TtGDE_like Glycer 36.5 57 0.0012 29.9 4.5 37 256-293 14-55 (230)
86 cd06275 PBP1_PurR Ligand-bindi 36.4 1.1E+02 0.0024 27.2 6.3 36 223-272 79-114 (269)
87 cd08602 GDPD_ScGlpQ1_like Glyc 36.4 52 0.0011 32.4 4.5 35 257-292 15-54 (309)
88 cd01460 vWA_midasin VWA_Midasi 36.2 4.3E+02 0.0094 26.0 13.8 140 199-352 89-254 (266)
89 PF09625 VP9: VP9 protein; In 35.8 3.4 7.5E-05 34.1 -3.1 36 250-287 5-46 (79)
90 PF09827 CRISPR_Cas2: CRISPR a 35.3 1.1E+02 0.0024 23.7 5.3 54 234-287 11-69 (78)
91 cd00954 NAL N-Acetylneuraminic 34.7 4.2E+02 0.0092 25.5 11.4 106 212-333 39-179 (288)
92 cd08603 GDPD_SHV3_repeat_1 Gly 34.6 38 0.00082 33.8 3.2 36 257-292 15-56 (299)
93 cd03007 PDI_a_ERp29_N PDIa fam 34.4 41 0.00089 29.1 3.0 33 250-288 27-59 (116)
94 cd08582 GDPD_like_2 Glyceropho 34.2 38 0.00083 31.1 3.0 36 256-292 12-52 (233)
95 cd08555 PI-PLCc_GDPD_SF Cataly 33.7 60 0.0013 28.8 4.1 49 257-306 13-68 (179)
96 cd01459 vWA_copine_like VWA Co 33.0 3.6E+02 0.0078 26.3 9.5 119 196-320 68-193 (254)
97 cd00950 DHDPS Dihydrodipicolin 32.3 4.5E+02 0.0097 25.0 12.2 45 212-266 38-88 (284)
98 PF03586 Herpes_UL36: Herpesvi 32.3 30 0.00066 34.1 2.1 35 92-127 38-77 (253)
99 cd08575 GDPD_GDE4_like Glycero 32.1 73 0.0016 30.3 4.6 36 257-293 15-55 (264)
100 PRK10423 transcriptional repre 32.0 2.8E+02 0.0061 25.8 8.4 18 64-81 23-40 (327)
101 cd08579 GDPD_memb_like Glycero 31.9 43 0.00094 30.5 2.9 36 257-293 13-53 (220)
102 COG0584 UgpQ Glycerophosphoryl 30.6 55 0.0012 30.3 3.4 40 257-297 20-64 (257)
103 TIGR01856 hisJ_fam histidinol 30.5 31 0.00066 32.6 1.8 22 246-267 3-25 (253)
104 cd08606 GDPD_YPL110cp_fungi Gl 30.5 1.1E+02 0.0025 29.0 5.6 40 256-296 23-67 (286)
105 PRK04663 murD UDP-N-acetylmura 29.8 4.8E+02 0.01 26.5 10.2 114 191-351 283-399 (438)
106 PF11181 YflT: Heat induced st 29.7 59 0.0013 26.9 3.1 24 3-26 70-93 (103)
107 TIGR01651 CobT cobaltochelatas 29.3 2.2E+02 0.0047 31.5 8.0 139 199-355 420-594 (600)
108 PRK10773 murF UDP-N-acetylmura 29.0 1.6E+02 0.0035 30.2 6.7 86 258-355 336-423 (453)
109 COG2884 FtsE Predicted ATPase 29.0 21 0.00045 34.6 0.3 50 188-244 58-122 (223)
110 cd08571 GDPD_SHV3_plant Glycer 28.9 54 0.0012 32.2 3.2 35 257-292 15-54 (302)
111 cd08573 GDPD_GDE1 Glycerophosp 28.8 80 0.0017 30.0 4.3 36 257-293 13-53 (258)
112 cd01339 LDH-like_MDH L-lactate 28.8 2.1E+02 0.0047 27.5 7.2 86 221-324 66-174 (300)
113 smart00631 Zn_pept Zn_pept. 28.7 12 0.00025 36.0 -1.4 71 196-272 7-78 (277)
114 smart00481 POLIIIAc DNA polyme 28.7 54 0.0012 24.4 2.5 36 246-289 2-38 (67)
115 PF04447 DUF550: Protein of un 28.7 25 0.00054 30.3 0.7 61 250-312 10-76 (100)
116 COG3148 Uncharacterized conser 28.6 87 0.0019 30.6 4.4 45 298-344 92-138 (231)
117 cd05780 DNA_polB_Kod1_like_exo 28.5 20 0.00044 32.5 0.2 30 199-228 59-89 (195)
118 PRK11930 putative bifunctional 27.7 2.2E+02 0.0049 31.6 8.0 87 258-354 340-430 (822)
119 TIGR00683 nanA N-acetylneurami 27.6 1.4E+02 0.003 29.0 5.7 44 212-265 39-88 (290)
120 cd06277 PBP1_LacI_like_1 Ligan 27.5 4.1E+02 0.0089 23.8 8.4 87 254-344 98-187 (268)
121 PF10996 Beta-Casp: Beta-Casp 27.3 54 0.0012 26.9 2.5 37 279-320 80-116 (126)
122 cd01452 VWA_26S_proteasome_sub 27.0 5.2E+02 0.011 24.1 11.7 100 232-343 64-174 (187)
123 cd01536 PBP1_ABC_sugar_binding 27.0 4.3E+02 0.0093 23.1 10.2 24 331-354 178-201 (267)
124 cd08559 GDPD_periplasmic_GlpQ_ 26.7 86 0.0019 30.4 4.1 35 257-292 15-54 (296)
125 cd08566 GDPD_AtGDE_like Glycer 26.4 72 0.0016 29.9 3.5 36 257-293 15-55 (240)
126 cd08610 GDPD_GDE6 Glycerophosp 26.4 99 0.0021 30.9 4.5 36 257-293 37-77 (316)
127 TIGR02690 resist_ArsH arsenica 26.4 90 0.002 29.8 4.1 54 294-347 8-69 (219)
128 PRK03803 murD UDP-N-acetylmura 26.3 6.5E+02 0.014 25.5 10.5 113 192-347 288-404 (448)
129 PRK07328 histidinol-phosphatas 26.3 52 0.0011 31.3 2.5 18 245-262 5-22 (269)
130 cd06288 PBP1_sucrose_transcrip 26.2 3.2E+02 0.007 24.2 7.4 33 253-289 95-127 (269)
131 PF09606 Med15: ARC105 or Med1 25.9 23 0.00049 40.0 0.0 103 3-138 536-646 (799)
132 PF01674 Lipase_2: Lipase (cla 25.8 38 0.00082 32.2 1.5 23 198-221 64-86 (219)
133 cd08318 Death_NMPP84 Death dom 25.8 1.6E+02 0.0034 23.8 4.9 54 7-80 22-75 (86)
134 KOG2125|consensus 25.6 43 0.00093 37.5 2.0 65 212-286 202-276 (760)
135 cd06304 PBP1_BmpA_like Peripla 25.5 4.6E+02 0.01 23.7 8.5 113 222-353 79-198 (260)
136 PF04155 Ground-like: Ground-l 24.8 1.4E+02 0.0031 23.4 4.3 32 258-291 22-53 (76)
137 cd08612 GDPD_GDE4 Glycerophosp 24.1 1.2E+02 0.0026 29.5 4.6 55 207-293 22-81 (300)
138 cd01537 PBP1_Repressors_Sugar_ 24.0 3.7E+02 0.008 23.2 7.2 15 333-347 205-219 (264)
139 COG1378 Predicted transcriptio 23.9 68 0.0015 30.9 2.8 23 6-28 5-27 (247)
140 PF08862 DUF1829: Domain of un 23.8 1.3E+02 0.0028 24.7 4.0 31 259-289 34-64 (88)
141 cd07079 ALDH_F18-19_ProA-GPR G 23.6 7.9E+02 0.017 25.0 10.5 74 276-359 165-246 (406)
142 cd03860 M14_CP_A-B_like The Pe 23.5 18 0.00039 35.1 -1.2 38 196-233 7-45 (294)
143 cd08583 PI-PLCc_GDPD_SF_unchar 23.2 89 0.0019 28.9 3.4 37 256-293 14-55 (237)
144 PF02811 PHP: PHP domain; Int 22.8 56 0.0012 27.6 1.8 35 246-288 3-38 (175)
145 cd08319 Death_RAIDD Death doma 22.8 2.1E+02 0.0045 23.5 5.1 54 7-80 17-71 (83)
146 cd01840 SGNH_hydrolase_yrhL_li 22.8 3.2E+02 0.007 23.2 6.5 79 254-343 4-87 (150)
147 cd07077 ALDH-like NAD(P)+-depe 22.7 7.9E+02 0.017 24.7 10.7 59 298-359 170-237 (397)
148 PF13660 DUF4147: Domain of un 22.4 79 0.0017 30.7 2.9 81 199-302 55-145 (238)
149 PRK14573 bifunctional D-alanyl 22.3 6E+02 0.013 28.3 10.0 150 193-351 287-445 (809)
150 cd07127 ALDH_PAD-PaaZ Phenylac 22.1 5.8E+02 0.013 27.6 9.5 72 276-359 251-333 (549)
151 PF02875 Mur_ligase_C: Mur lig 21.9 2.1E+02 0.0045 22.4 4.8 59 256-320 20-80 (91)
152 PF09936 Methyltrn_RNA_4: SAM- 21.7 5.3E+02 0.011 24.6 8.1 84 236-326 51-147 (185)
153 PRK03806 murD UDP-N-acetylmura 21.7 4.1E+02 0.0088 26.8 7.9 78 258-354 322-399 (438)
154 cd08613 GDPD_GDE4_like_1 Glyce 21.6 1.8E+02 0.004 29.3 5.4 36 256-292 59-99 (309)
155 cd06534 ALDH-SF NAD(P)+-depend 21.2 7.8E+02 0.017 24.0 10.3 58 298-358 160-228 (367)
156 KOG4384|consensus 20.8 48 0.001 34.3 1.1 46 175-220 218-288 (361)
157 cd03859 M14_CPT Peptidase M14- 20.6 70 0.0015 31.2 2.2 44 191-234 4-49 (295)
158 cd05160 DEDDy_DNA_polB_exo DED 20.5 36 0.00079 30.3 0.2 30 199-228 66-96 (199)
159 PF12983 DUF3867: Protein of u 20.2 48 0.001 31.4 0.9 17 290-306 68-84 (186)
160 PRK14338 (dimethylallyl)adenos 20.1 8.3E+02 0.018 25.4 9.9 61 241-306 160-230 (459)
161 PRK00066 ldh L-lactate dehydro 20.1 2.8E+02 0.006 27.4 6.3 87 221-325 73-182 (315)
162 PRK04405 prsA peptidylprolyl i 20.0 1E+02 0.0022 30.3 3.3 44 294-338 128-171 (298)
No 1
>cd01455 vWA_F11C1-5a_type Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=100.00 E-value=3.5e-48 Score=353.73 Aligned_cols=190 Identities=55% Similarity=0.971 Sum_probs=184.6
Q ss_pred hhhhhhHHHhhhhhccccCCchhhHHHHHHHHHHHHHhhhhhccceeeeeccCCCCccccccccccCCChhhHHHHHHHh
Q psy11447 104 KHLQEGVLGQHENTRKFNGYDGRLDRELEAVIMVLEAFQDFDQRIRLDIIGHSGETHSIPFVTSIKNLPRDNKQRLEVIK 183 (360)
Q Consensus 104 KhL~~~v~dv~~nv~rFd~yD~rl~R~lE~v~~Vme~~~~~e~kl~~di~~HSG~~~siP~v~kvk~~are~k~RL~ei~ 183 (360)
||| -.+.|+|+|||+|++||+++.+.||++.+
T Consensus 1 ~~l-~lavDlSgSM~~~~~~dg~~~~RL~a~k~----------------------------------------------- 32 (191)
T cd01455 1 KRL-KLVVDVSGSMYRFNGYDGRLDRSLEAVVM----------------------------------------------- 32 (191)
T ss_pred Cce-EEEEECcHhHHHHhccCCccccHHHHHHH-----------------------------------------------
Confidence 566 77899999999999999999999999999
Q ss_pred hccccccccCCchhhHHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHH
Q psy11447 184 VHHNNLRNTGSDLGTSLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESI 263 (360)
Q Consensus 184 m~~~~lrfn~~DL~rvMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~ 263 (360)
|||+|.+|.+|++||++||+|+++.+|||+.+.||++|++||++++.|++|+||||+||+|.+|+
T Consensus 33 ---------------v~~~f~~f~~~r~~DriG~~g~~~~~~~lt~d~p~t~d~~~~~~l~~~l~~~q~g~ag~~TadAi 97 (191)
T cd01455 33 ---------------VMEAFDGFEDKIQYDIIGHSGDGPCVPFVKTNHPPKNNKERLETLKMMHAHSQFCWSGDHTVEAT 97 (191)
T ss_pred ---------------HHHHHHHHHHhCccceeeecCcccccCccccccCcccchhHHHHHHHHHHhcccCccCccHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHH-hhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCceEEecC
Q psy11447 264 RASIQDLT-TECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRGFVCMD 342 (360)
Q Consensus 264 ~~ai~~~a-~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~fv~~d 342 (360)
..||+.+. +. ++++++||+|||+|..+|+|+|.+.++.|.++ ++|+.|.|+||+.++++.++..++|.|+.|+|.|
T Consensus 98 ~~av~rl~~~~--~a~~kvvILLTDG~n~~~~i~P~~aAa~lA~~-~gV~iytIgiG~~d~~~l~~iA~~tgG~~F~A~d 174 (191)
T cd01455 98 EFAIKELAAKE--DFDEAIVIVLSDANLERYGIQPKKLADALARE-PNVNAFVIFIGSLSDEADQLQRELPAGKAFVCMD 174 (191)
T ss_pred HHHHHHHHhcC--cCCCcEEEEEeCCCcCCCCCChHHHHHHHHHh-CCCEEEEEEecCCCHHHHHHHHhCCCCcEEEeCC
Confidence 99999997 77 99999999999999999999999999999997 9999999999999999999999999999999999
Q ss_pred CCchHHHHHHHHHhccC
Q psy11447 343 LTEIPQILQQIFSASLL 359 (360)
Q Consensus 343 t~~lP~il~qIf~s~~l 359 (360)
+++||+|+++||+|++|
T Consensus 175 ~~~L~~iy~~I~~~~~~ 191 (191)
T cd01455 175 TSELPHIMQQIFTSTLL 191 (191)
T ss_pred HHHHHHHHHHHHHHhcC
Confidence 99999999999999986
No 2
>KOG1808|consensus
Probab=99.90 E-value=8.5e-26 Score=254.48 Aligned_cols=235 Identities=24% Similarity=0.209 Sum_probs=210.8
Q ss_pred CchHHHHHHhHHcCCCHHhHHHHHHhhHhhhhHHHHHhhhhhcccccCCCC-chhHHH-HHHhcCCCCChhHHHHHHHhh
Q psy11447 1 MGLKAFKQKLKEIQMSEYDAKVYSEYSLPVQNQQNDQHRDLFSVHVIDNPS-PESELA-LLSSYGPDVPRDTLIRLVRAF 78 (360)
Q Consensus 1 m~~ka~~~rLkEi~Ms~~da~~Y~~~~~~vgndf~~~lgd~fs~h~v~np~-~esE~~-~L~~ygp~v~~~~l~~Lv~af 78 (360)
+++.++.++|++++|++|++..|...+..+..+... + ..++++.. ...++. |.++ ++.
T Consensus 1606 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-----~~~~~~~e~q~~~~~d~~~~--------------~~~ 1665 (1856)
T KOG1808|consen 1606 PARAAKLQGLREGQRLNYREEIPEIASQFRKDIIWL-R-----RTKLDKREYQIKIAEDDSSS--------------MAE 1665 (1856)
T ss_pred HHHHHHhcccchhhhhhhhhhhhHHhhhhhhhHHHH-H-----HHHhhhhhhhheeehhhhhh--------------hhh
Confidence 367889999999999999999999999999999987 4 45566666 666777 8999 899
Q ss_pred hhhhhhh-ccCCC---C--CCccchhhh-h-------hhhhhhhhHHHhhhhhccccCCchhhHHHHHHHHHHHHHhhhh
Q psy11447 79 GEFRELS-DKSLL---P--YPYSTREAV-H-------IVKHLQEGVLGQHENTRKFNGYDGRLDRELEAVIMVLEAFQDF 144 (360)
Q Consensus 79 geLr~~~-deGli---~--ypYs~Re~v-~-------ivKhL~~~v~dv~~nv~rFd~yD~rl~R~lE~v~~Vme~~~~~ 144 (360)
++++.++ ..+.+ + -.|..|..+ + +.+.+ +-+.++++|||.|++++.++.++.|+.++
T Consensus 1666 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~f~~~~~~~~~~l-~~~~d~s~s~~~~~~~~~~~~~t~e~~~~-------- 1736 (1856)
T KOG1808|consen 1666 NKIKQNDIESLTVQGVALKILYLGRIAVCSFGGSVALLHPFL-EEVSDYSGSMLKFLSKDESLQKTDEANLL-------- 1736 (1856)
T ss_pred cccchhheeeehhhhhHHHHHhccccccCCcccchhhccccc-ceeeeechhHHHHHhhhhhhhhhHHHHHH--------
Confidence 9999888 66665 2 445556533 2 55566 88999999999999999999999999999
Q ss_pred hccceeeeeccCCCCccccccccccCCChhhHHHHHHHhhccccccccCCchhhHHHHhhccccceeEeeeeccCCCCcc
Q psy11447 145 DQRIRLDIIGHSGETHSIPFVTSIKNLPRDNKQRLEVIKVHHNNLRNTGSDLGTSLEAFQDFDQRISLDIIGHSGETHSI 224 (360)
Q Consensus 145 e~kl~~di~~HSG~~~siP~v~kvk~~are~k~RL~ei~m~~~~lrfn~~DL~rvMEaf~g~e~k~~ydivGHSGd~~~i 224 (360)
.|+++.+...++.||++||||+++++
T Consensus 1737 ------------------------------------------------------~~~~~l~~~~~~~~~~~~~~~~~~~~ 1762 (1856)
T KOG1808|consen 1737 ------------------------------------------------------AASQQLSKNIKFQYDQVLHSVSDGRG 1762 (1856)
T ss_pred ------------------------------------------------------HHHHhhhhhhhhhHhhhccccccCcc
Confidence 89999999999999999999999999
Q ss_pred ccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHH
Q psy11447 225 PFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEAL 304 (360)
Q Consensus 225 ~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l 304 (360)
-|++-..-|+++++ .+++|+||+|||++||+|.+.+.++|+.+... +.++++|+++||+||.||+|-|+.++..+
T Consensus 1763 ~~~~~~~~~~~~~~---~~~~~~~~~~f~~~~~~s~~~~~~~ik~~~~~--~~~~~v~~~~~~~~~~~~~~~~k~~a~~~ 1837 (1856)
T KOG1808|consen 1763 LLLPGIELVLAAKQ---LAKNANAHSTFCVLGNLSYESSIHDIKVLVFK--GNIEYVVTVLSDAILSRYYILPKSLALPL 1837 (1856)
T ss_pred cccchhhhhhhHHH---HhhhcccCceEEEecCCcccccHHHHHHHhhc--cCCceEEEehhhccchhhhhhhhhhhhhh
Confidence 99999999999999 99999999999999999999999999999976 77899999999999999999999999999
Q ss_pred hcCcccccEEEEEecCchHH
Q psy11447 305 NAEAASVRAYAIFIGSLGDQ 324 (360)
Q Consensus 305 ~~~~p~V~a~~IfIgslgdq 324 (360)
+.. |+++.+.+||+|++++
T Consensus 1838 ti~-~~i~~~~~~i~s~s~~ 1856 (1856)
T KOG1808|consen 1838 TIS-ERIRQWFEFITSLSDP 1856 (1856)
T ss_pred hhh-HHHHHHHHHhhccCCC
Confidence 997 9999999999998864
No 3
>PF13519 VWA_2: von Willebrand factor type A domain; PDB: 3IBS_B 3RAG_B 2X5N_A.
Probab=98.54 E-value=1.5e-06 Score=72.39 Aligned_cols=128 Identities=18% Similarity=0.349 Sum_probs=80.1
Q ss_pred eEeeeeccCCCCccccccCCCCCCCchHHH-HHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC
Q psy11447 211 SLDIIGHSGETHSIPFVTSNNLPRDNKQRL-EVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN 289 (360)
Q Consensus 211 ~ydivGHSGd~~~i~lV~~~~~Pk~~keRl-~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN 289 (360)
++.+++.++.+... . |+.++++.+ +.++++.... .+-.|....+|+..|.+.++.. +....++|++||..
T Consensus 40 ~v~l~~f~~~~~~~--~----~~t~~~~~~~~~l~~~~~~~-~~~~~t~~~~al~~a~~~~~~~--~~~~~~iv~iTDG~ 110 (172)
T PF13519_consen 40 RVGLVSFSDSSRTL--S----PLTSDKDELKNALNKLSPQG-MPGGGTNLYDALQEAAKMLASS--DNRRRAIVLITDGE 110 (172)
T ss_dssp EEEEEEESTSCEEE--E----EEESSHHHHHHHHHTHHHHG---SSS--HHHHHHHHHHHHHC---SSEEEEEEEEES-T
T ss_pred EEEEEEeccccccc--c----cccccHHHHHHHhhcccccc-cCccCCcHHHHHHHHHHHHHhC--CCCceEEEEecCCC
Confidence 77777777754211 1 233444333 3444443221 2345566678888999888765 56789999999974
Q ss_pred ccccCCChhHHHHHHhcCcccccEEEEEecCchHH---HHHHHhhCCCCceEEe-cCCCchHHHHHHH
Q psy11447 290 LDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQ---AKRLTQELPAGRGFVC-MDLTEIPQILQQI 353 (360)
Q Consensus 290 L~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdq---A~~l~~~Lp~Gr~fv~-~dt~~lP~il~qI 353 (360)
-. +-.+ +..+.+. + ++|+.|+|.||+-++. .++|.+.. .|+.|.+ .+..+|+++|+||
T Consensus 111 ~~--~~~~-~~~~~~~-~-~~i~i~~v~~~~~~~~~~~l~~la~~t-gG~~~~~~~~~~~l~~~~~~I 172 (172)
T PF13519_consen 111 DN--SSDI-EAAKALK-Q-QGITIYTVGIGSDSDANEFLQRLAEAT-GGRYFHVDNDPEDLDDAFQQI 172 (172)
T ss_dssp TH--CHHH-HHHHHHH-C-TTEEEEEEEES-TT-EHHHHHHHHHHT-EEEEEEE-SSSHHHHHHHHH-
T ss_pred CC--cchh-HHHHHHH-H-cCCeEEEEEECCCccHHHHHHHHHHhc-CCEEEEecCCHHHHHHHHhcC
Confidence 44 1112 2444454 4 8899999999998874 56676666 5999998 7999999999998
No 4
>KOG1808|consensus
Probab=98.47 E-value=1.3e-07 Score=109.21 Aligned_cols=117 Identities=28% Similarity=0.397 Sum_probs=108.9
Q ss_pred HHHHHhhHhhhhHHHHHhhhhhccc----ccCCCCchhHHHHHHhcCCCCChhHHHHHHHhhhhhhhhhccCCCCCCccc
Q psy11447 21 KVYSEYSLPVQNQQNDQHRDLFSVH----VIDNPSPESELALLSSYGPDVPRDTLIRLVRAFGEFRELSDKSLLPYPYST 96 (360)
Q Consensus 21 ~~Y~~~~~~vgndf~~~lgd~fs~h----~v~np~~esE~~~L~~ygp~v~~~~l~~Lv~afgeLr~~~deGli~ypYs~ 96 (360)
.-|....+.++|.+++.+||+|++| .|.++...+|..++++++|+++.+.+.+++..+++|+...+++...|||++
T Consensus 919 rd~~~~v~fl~~~~~~~~~~~~al~~~l~~vf~d~l~~~~~~~k~~~~~~~~~~l~~~~~~~~~l~~~~~~~~~~~~~~~ 998 (1856)
T KOG1808|consen 919 RDYETWVEFLNNMFQAILGDLFALHLGLEMVFLDALQGELTMLKQFNENVPVKSLTELVAELGELRGRTKQLERQYPYST 998 (1856)
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcchhhHHHhhccccchHHHHHHHHHHhhhhcccchhhhccccch
Confidence 3455566778999999999999999 999999999999999999999999999999999999999999999999999
Q ss_pred hhhhhhhhhhh----hhHHHhhhhhccccCCchhhHHHHHHHHHHHHHh
Q psy11447 97 REAVHIVKHLQ----EGVLGQHENTRKFNGYDGRLDRELEAVIMVLEAF 141 (360)
Q Consensus 97 Re~v~ivKhL~----~~v~dv~~nv~rFd~yD~rl~R~lE~v~~Vme~~ 141 (360)
|+.++.++|+. ...-++..| ++||.|.+.+ .+....+|.+|
T Consensus 999 ~~~v~~~~~~g~f~~~~~~~~~ss-~~f~~~a~t~---~~nl~~~l~~~ 1043 (1856)
T KOG1808|consen 999 EEAVNDVFHVGHFPIKTLPLVASS-FDFDSYAPTT---SENLRRVLRAL 1043 (1856)
T ss_pred hhheeeeeeeccCCCCCCCccccc-ccccccChHH---HHHHHHHHHhc
Confidence 99999999998 667788888 9999999999 99999999888
No 5
>smart00327 VWA von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
Probab=97.58 E-value=0.0025 Score=53.25 Aligned_cols=140 Identities=13% Similarity=0.219 Sum_probs=86.7
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCC
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHD 278 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daD 278 (360)
++..+.....+..+-|++.++... +.+... .+++-.+-.+.++.+..+ +-.+.++..|++.+++.+.+.. ...
T Consensus 29 ~~~~~~~~~~~~~i~ii~f~~~~~--~~~~~~-~~~~~~~~~~~i~~~~~~---~~~~~~~~~al~~~~~~~~~~~-~~~ 101 (177)
T smart00327 29 LVEQLDIGPDGDRVGLVTFSDDAT--VLFPLN-DSRSKDALLEALASLSYK---LGGGTNLGAALQYALENLFSKS-AGS 101 (177)
T ss_pred HHHhcCCCCCCcEEEEEEeCCCce--EEEccc-ccCCHHHHHHHHHhcCCC---CCCCchHHHHHHHHHHHhcCcC-CCC
Confidence 455555445688899999998432 222221 122333333334433322 5566789999999999985210 111
Q ss_pred ----CeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecC--chHHHHHHHhhCCCCceEEecCCCchHHHHHH
Q psy11447 279 ----DSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGS--LGDQAKRLTQELPAGRGFVCMDLTEIPQILQQ 352 (360)
Q Consensus 279 ----e~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgs--lgdqA~~l~~~Lp~Gr~fv~~dt~~lP~il~q 352 (360)
...+|++||.+-..+ -.+....+.+.+ .+|..++|.+|. .-+....|...-+....| .+.+++.
T Consensus 102 ~~~~~~~iviitDg~~~~~-~~~~~~~~~~~~--~~i~i~~i~~~~~~~~~~l~~~~~~~~~~~~~-------~~~~~~~ 171 (177)
T smart00327 102 RRGAPKVLILITDGESNDG-GDLLKAAKELKR--SGVKVFVVGVGNDVDEEELKKLASAPGGVYVF-------LPELFDL 171 (177)
T ss_pred CCCCCeEEEEEcCCCCCCC-ccHHHHHHHHHH--CCCEEEEEEccCccCHHHHHHHhCCCcceEEe-------cHHHhhh
Confidence 458999999987766 456666666666 359999999998 567777888877733444 4455555
Q ss_pred HHH
Q psy11447 353 IFS 355 (360)
Q Consensus 353 If~ 355 (360)
+..
T Consensus 172 ~~~ 174 (177)
T smart00327 172 LID 174 (177)
T ss_pred hhh
Confidence 544
No 6
>cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
Probab=97.40 E-value=0.0023 Score=57.55 Aligned_cols=113 Identities=16% Similarity=0.193 Sum_probs=72.9
Q ss_pred CCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC-ccccCCChhHHHHHHhcCccc
Q psy11447 232 LPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN-LDRYGIAPREMGEALNAEAAS 310 (360)
Q Consensus 232 ~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN-L~RYgI~p~~l~~~l~~~~p~ 310 (360)
|++.|.+ .++..+... .-+-=|-+.-.|++.|.+.+......+--.+||++||.. .+. -.+.++++.+.+ .+
T Consensus 64 PlT~D~~--~~~~~L~~~-~~~~G~t~l~~aL~~A~~~l~~~~~~~~~~iiil~sd~~~~~~--~~~~~~~~~l~~--~~ 136 (183)
T cd01453 64 DLTGNPR--KHIQALKTA-RECSGEPSLQNGLEMALESLKHMPSHGSREVLIIFSSLSTCDP--GNIYETIDKLKK--EN 136 (183)
T ss_pred CCCCCHH--HHHHHhhcc-cCCCCchhHHHHHHHHHHHHhcCCccCceEEEEEEcCCCcCCh--hhHHHHHHHHHH--cC
Confidence 6677776 445555433 222223677889999999986431122345888888832 111 123456666765 47
Q ss_pred ccEEEEEecCchHHHHHHHhhCCCCceEEecCCCchHHHHHH
Q psy11447 311 VRAYAIFIGSLGDQAKRLTQELPAGRGFVCMDLTEIPQILQQ 352 (360)
Q Consensus 311 V~a~~IfIgslgdqA~~l~~~Lp~Gr~fv~~dt~~lP~il~q 352 (360)
|+-++|=||+--+.=++|++.= .|+.|.+.|..+||+++++
T Consensus 137 I~v~~IgiG~~~~~L~~ia~~t-gG~~~~~~~~~~l~~~~~~ 177 (183)
T cd01453 137 IRVSVIGLSAEMHICKEICKAT-NGTYKVILDETHLKELLLE 177 (183)
T ss_pred cEEEEEEechHHHHHHHHHHHh-CCeeEeeCCHHHHHHHHHh
Confidence 8888888886223345666665 5999999999999999887
No 7
>cd00198 vWFA Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.
Probab=97.34 E-value=0.0073 Score=48.62 Aligned_cols=114 Identities=14% Similarity=0.191 Sum_probs=75.6
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCC
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHD 278 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daD 278 (360)
++..+.....+..+-+++.|+... .++.... +.+..+-.+.+..+.. -+..|.++..|...+++.+........
T Consensus 28 ~~~~~~~~~~~~~i~v~~f~~~~~--~~~~~~~-~~~~~~~~~~~~~~~~---~~~~~t~~~~al~~~~~~~~~~~~~~~ 101 (161)
T cd00198 28 LVSSLSASPPGDRVGLVTFGSNAR--VVLPLTT-DTDKADLLEAIDALKK---GLGGGTNIGAALRLALELLKSAKRPNA 101 (161)
T ss_pred HHHhcccCCCCcEEEEEEecCccc--eeecccc-cCCHHHHHHHHHhccc---CCCCCccHHHHHHHHHHHhcccCCCCC
Confidence 455555445678888999987321 1222222 2344555555665543 356778899999999999864211456
Q ss_pred CeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecC
Q psy11447 279 DSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGS 320 (360)
Q Consensus 279 e~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgs 320 (360)
...+|++||..-.-.+-.+.+..+.+.. .+|+-++|.+|+
T Consensus 102 ~~~lvvitDg~~~~~~~~~~~~~~~~~~--~~v~v~~v~~g~ 141 (161)
T cd00198 102 RRVIILLTDGEPNDGPELLAEAARELRK--LGITVYTIGIGD 141 (161)
T ss_pred ceEEEEEeCCCCCCCcchhHHHHHHHHH--cCCEEEEEEcCC
Confidence 7999999999865554466666677765 478889999986
No 8
>cd01470 vWA_complement_factors Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.
Probab=97.01 E-value=0.02 Score=51.08 Aligned_cols=144 Identities=13% Similarity=0.292 Sum_probs=86.6
Q ss_pred HHHHhhccccceeEeeeeccCCCCc-cccccCCCCCCCchHHHHHHHHHHHhhcccc----CCcchHHHHHHHHHHHHhh
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHS-IPFVTSNNLPRDNKQRLEVIKIMHAHAQFCA----SGDYTLESIRASIQDLTTE 273 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~-i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~----sGD~Tlea~~~ai~~~a~~ 273 (360)
+.+.+..+.....+-||.-|++... +|+ ..+..++++ +++..+ ...+|+. .|-+|-.|++.+.+.+...
T Consensus 28 l~~~l~~~~~~~~v~li~Fs~~~~~~~~~---~~~~~~~~~--~~~~~l-~~~~~~~~~~~ggT~~~~Al~~~~~~l~~~ 101 (198)
T cd01470 28 LIEKISSYEVSPRYEIISYASDPKEIVSI---RDFNSNDAD--DVIKRL-EDFNYDDHGDKTGTNTAAALKKVYERMALE 101 (198)
T ss_pred HHHHccccCCCceEEEEEecCCceEEEec---ccCCCCCHH--HHHHHH-HhCCcccccCccchhHHHHHHHHHHHHHHH
Confidence 6666665655677888988887543 344 333344444 344433 3445543 5678888888888776421
Q ss_pred cc---c---CCCeEEEEEecCCccccCCChhHHHHHHhc-----------CcccccEEEEEecCchH--HHHHHHhhCCC
Q psy11447 274 CA---Q---HDDSILIVLSDANLDRYGIAPREMGEALNA-----------EAASVRAYAIFIGSLGD--QAKRLTQELPA 334 (360)
Q Consensus 274 ~~---d---aDe~fVivlSDANL~RYgI~p~~l~~~l~~-----------~~p~V~a~~IfIgslgd--qA~~l~~~Lp~ 334 (360)
.. . .-...||+|||-.=. .|-+|....+.+.. ....|.-|+|=||+--| +=+.|+. -|.
T Consensus 102 ~~~~~~~~~~~~~~iillTDG~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~v~i~~iGvG~~~~~~~L~~iA~-~~~ 179 (198)
T cd01470 102 KVRNKEAFNETRHVIILFTDGKSN-MGGSPLPTVDKIKNLVYKNNKSDNPREDYLDVYVFGVGDDVNKEELNDLAS-KKD 179 (198)
T ss_pred HhcCccchhhcceEEEEEcCCCcC-CCCChhHHHHHHHHHHhcccccccchhcceeEEEEecCcccCHHHHHHHhc-CCC
Confidence 00 0 113679999996322 24466655444421 11368899998987323 3344544 455
Q ss_pred C--ceEEecCCCchHHHH
Q psy11447 335 G--RGFVCMDLTEIPQIL 350 (360)
Q Consensus 335 G--r~fv~~dt~~lP~il 350 (360)
| +.|.+.|.++|+.|+
T Consensus 180 g~~~~f~~~~~~~l~~v~ 197 (198)
T cd01470 180 NERHFFKLKDYEDLQEVF 197 (198)
T ss_pred CCceEEEeCCHHHHHHhc
Confidence 7 689999998888765
No 9
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=96.88 E-value=0.021 Score=54.12 Aligned_cols=97 Identities=24% Similarity=0.316 Sum_probs=64.3
Q ss_pred cCCcchHHHHHHHHHHHHhhcccC---CCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCc---------
Q psy11447 254 ASGDYTLESIRASIQDLTTECAQH---DDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSL--------- 321 (360)
Q Consensus 254 ~sGD~Tlea~~~ai~~~a~~~~da---De~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgsl--------- 321 (360)
..|-++-.|+..|..++.... .+ ....+|+|||-.=.....+.+++.+.+++ .+|..|.|-+|.+
T Consensus 138 ~g~T~l~~al~~aa~~~~~~~-~~~~p~rk~iIllTDG~~~~~~~~~~~~~~~~~~--~~v~vy~I~~~~~~~~~~~~~~ 214 (296)
T TIGR03436 138 GGGTALYDAITLAALEQLANA-LAGIPGRKALIVISDGGDNRSRDTLERAIDAAQR--ADVAIYSIDARGLRAPDLGAGA 214 (296)
T ss_pred CCcchhHHHHHHHHHHHHHHh-hcCCCCCeEEEEEecCCCcchHHHHHHHHHHHHH--cCCEEEEeccCccccCCccccc
Confidence 345556677666665554321 22 36889999997433322344555555654 6899999988743
Q ss_pred -----h-HHHHHHHhhCCCCceEEecCCCchHHHHHHHHH
Q psy11447 322 -----G-DQAKRLTQELPAGRGFVCMDLTEIPQILQQIFS 355 (360)
Q Consensus 322 -----g-dqA~~l~~~Lp~Gr~fv~~dt~~lP~il~qIf~ 355 (360)
+ +.-+.|++. ..|+.|.. |.++|+.++++|-.
T Consensus 215 ~~~~~~~~~L~~iA~~-TGG~~~~~-~~~~l~~~f~~i~~ 252 (296)
T TIGR03436 215 KAGLGGPEALERLAEE-TGGRAFYV-NSNDLDGAFAQIAE 252 (296)
T ss_pred ccCCCcHHHHHHHHHH-hCCeEecc-cCccHHHHHHHHHH
Confidence 1 334555554 47999999 99999999999875
No 10
>PRK13685 hypothetical protein; Provisional
Probab=96.75 E-value=0.045 Score=53.44 Aligned_cols=167 Identities=17% Similarity=0.199 Sum_probs=103.3
Q ss_pred hhHHHhhhhhccccCCchhhHHHHHHHHHHHHHhhhhhccceeeeeccCCCCccccccccccCCChhhHHHHHHHhhccc
Q psy11447 108 EGVLGQHENTRKFNGYDGRLDRELEAVIMVLEAFQDFDQRIRLDIIGHSGETHSIPFVTSIKNLPRDNKQRLEVIKVHHN 187 (360)
Q Consensus 108 ~~v~dv~~nv~rFd~yD~rl~R~lE~v~~Vme~~~~~e~kl~~di~~HSG~~~siP~v~kvk~~are~k~RL~ei~m~~~ 187 (360)
=.|.|+|+||-.=|....|+++..+++..
T Consensus 92 vlvlD~S~SM~~~D~~p~RL~~ak~~~~~--------------------------------------------------- 120 (326)
T PRK13685 92 MLVIDVSQSMRATDVEPNRLAAAQEAAKQ--------------------------------------------------- 120 (326)
T ss_pred EEEEECCccccCCCCCCCHHHHHHHHHHH---------------------------------------------------
Confidence 57899999998777555677766666665
Q ss_pred cccccCCchhhHHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcch--HHHHHH
Q psy11447 188 NLRNTGSDLGTSLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYT--LESIRA 265 (360)
Q Consensus 188 ~lrfn~~DL~rvMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~T--lea~~~ 265 (360)
+.+.+.. .+| .-+|.=+|+...+ -||++|++.++ .....-+ .|..| =.|+..
T Consensus 121 -----------~l~~l~~-~d~--vglv~Fa~~a~~~------~p~t~d~~~l~---~~l~~l~---~~~~T~~g~al~~ 174 (326)
T PRK13685 121 -----------FADELTP-GIN--LGLIAFAGTATVL------VSPTTNREATK---NAIDKLQ---LADRTATGEAIFT 174 (326)
T ss_pred -----------HHHhCCC-CCe--EEEEEEcCceeec------CCCCCCHHHHH---HHHHhCC---CCCCcchHHHHHH
Confidence 4444321 233 3456666664421 26777766442 2232222 23444 356777
Q ss_pred HHHHHHhh---c---ccCCCeEEEEEecCCccccCCCh------hHHHHHHhcCcccccEEEEEecCch-----------
Q psy11447 266 SIQDLTTE---C---AQHDDSILIVLSDANLDRYGIAP------REMGEALNAEAASVRAYAIFIGSLG----------- 322 (360)
Q Consensus 266 ai~~~a~~---~---~daDe~fVivlSDANL~RYgI~p------~~l~~~l~~~~p~V~a~~IfIgslg----------- 322 (360)
|++.+.+. . .......||++||-+= ..|.+| ...++.+. + ..|+-|+|-||+-+
T Consensus 175 A~~~l~~~~~~~~~~~~~~~~~IILlTDG~~-~~~~~~~~~~~~~~aa~~a~-~-~gi~i~~Ig~G~~~g~~~~~g~~~~ 251 (326)
T PRK13685 175 ALQAIATVGAVIGGGDTPPPARIVLMSDGKE-TVPTNPDNPRGAYTAARTAK-D-QGVPISTISFGTPYGSVEINGQRQP 251 (326)
T ss_pred HHHHHHhhhcccccccCCCCCEEEEEcCCCC-CCCCCCCCcccHHHHHHHHH-H-cCCeEEEEEECCCCCCcCcCCceee
Confidence 77766421 0 0223467999999753 334322 23344343 3 68999999999832
Q ss_pred -----HHHHHHHhhCCCCceEEecCCCchHHHHHHHHH
Q psy11447 323 -----DQAKRLTQELPAGRGFVCMDLTEIPQILQQIFS 355 (360)
Q Consensus 323 -----dqA~~l~~~Lp~Gr~fv~~dt~~lP~il~qIf~ 355 (360)
+.-++|.+.- .|+.|..-|.++|+.++++|-.
T Consensus 252 ~~~d~~~L~~iA~~t-gG~~~~~~~~~~L~~if~~I~~ 288 (326)
T PRK13685 252 VPVDDESLKKIAQLS-GGEFYTAASLEELRAVYATLQQ 288 (326)
T ss_pred ecCCHHHHHHHHHhc-CCEEEEcCCHHHHHHHHHHHHH
Confidence 3456666655 5999999999999999999854
No 11
>cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3- ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif.
Probab=96.34 E-value=0.056 Score=47.17 Aligned_cols=113 Identities=16% Similarity=0.267 Sum_probs=64.3
Q ss_pred HHHHhhccccceeEeeeeccCCC---CccccccCCCCCCCchHHHHHHHHHHHhhccccCC-cchHHHHHHHHHHHHhhc
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGET---HSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASG-DYTLESIRASIQDLTTEC 274 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~---~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sG-D~Tlea~~~ai~~~a~~~ 274 (360)
+++++.. ......|++.+++. ...+++........... ++.+.+. +..-.| =++-+|++.|.+.+.+.
T Consensus 29 l~~~l~~--~~d~~~l~~F~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~l~---~~~~~g~T~~~~al~~a~~~l~~~- 100 (174)
T cd01454 29 LAEALEA--CGVPHAILGFTTDAGGRERVRWIKIKDFDESLHE--RARKRLA---ALSPGGNTRDGAAIRHAAERLLAR- 100 (174)
T ss_pred HHHHHHH--cCCcEEEEEecCCCCCccceEEEEecCcccccch--hHHHHHH---ccCCCCCCcHHHHHHHHHHHHhcC-
Confidence 6666665 35667889988873 22334443221111111 2222221 111123 35588999999999765
Q ss_pred ccCCCeEEEEEecCCc--ccc---CC-ChhHH---HHHHhcCcccccEEEEEecCch
Q psy11447 275 AQHDDSILIVLSDANL--DRY---GI-APREM---GEALNAEAASVRAYAIFIGSLG 322 (360)
Q Consensus 275 ~daDe~fVivlSDANL--~RY---gI-~p~~l---~~~l~~~~p~V~a~~IfIgslg 322 (360)
.....++|+|||-.= ..+ ++ .+++. .+.+. ...|..|+|.||+-.
T Consensus 101 -~~~~~~iiliTDG~~~~~~~~~~~~~~~~~~~~~~~~~~--~~gi~v~~igig~~~ 154 (174)
T cd01454 101 -PEKRKILLVISDGEPNDLDYYEGNVFATEDALRAVIEAR--KLGIEVFGITIDRDA 154 (174)
T ss_pred -CCcCcEEEEEeCCCcCcccccCcchhHHHHHHHHHHHHH--hCCcEEEEEEecCcc
Confidence 667899999999742 221 23 33333 22232 278999999999954
No 12
>cd01476 VWA_integrin_invertebrates VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.
Probab=96.25 E-value=0.071 Score=45.48 Aligned_cols=114 Identities=16% Similarity=0.201 Sum_probs=68.1
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcc--c
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECA--Q 276 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~--d 276 (360)
+++.|.--..+..+-+++-|++......+.. +..+.+-.+++.+.. -+||=.|=++-+|++.|.+.+.+... .
T Consensus 27 ~~~~l~~~~~~~~v~lv~f~~~~~~~~~~~l----~~~~~~~~l~~~i~~-l~~~gg~T~l~~aL~~a~~~l~~~~~~r~ 101 (163)
T cd01476 27 IVEGLEIGPTATRVALITYSGRGRQRVRFNL----PKHNDGEELLEKVDN-LRFIGGTTATGAAIEVALQQLDPSEGRRE 101 (163)
T ss_pred HHHhcCCCCCCcEEEEEEEcCCCceEEEecC----CCCCCHHHHHHHHHh-CccCCCCccHHHHHHHHHHHhccccCCCC
Confidence 4555543355788889998886333222222 112222234443332 33443344678899999888841100 1
Q ss_pred CCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCc
Q psy11447 277 HDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSL 321 (360)
Q Consensus 277 aDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgsl 321 (360)
.....||++||-.-. .+ |..+++.|... ++|..|+|-||.-
T Consensus 102 ~~~~~villTDG~~~-~~--~~~~~~~l~~~-~~v~v~~vg~g~~ 142 (163)
T cd01476 102 GIPKVVVVLTDGRSH-DD--PEKQARILRAV-PNIETFAVGTGDP 142 (163)
T ss_pred CCCeEEEEECCCCCC-Cc--hHHHHHHHhhc-CCCEEEEEECCCc
Confidence 224799999995321 22 78888888884 8899999998874
No 13
>cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n
Probab=96.18 E-value=0.087 Score=44.76 Aligned_cols=92 Identities=11% Similarity=0.178 Sum_probs=62.4
Q ss_pred CCcchHHHHHHHHHHHHhhcccCC--CeEEEEEecCCccccCCC-hhHHHHHHhc-CcccccEEEEEecCchHH--HHHH
Q psy11447 255 SGDYTLESIRASIQDLTTECAQHD--DSILIVLSDANLDRYGIA-PREMGEALNA-EAASVRAYAIFIGSLGDQ--AKRL 328 (360)
Q Consensus 255 sGD~Tlea~~~ai~~~a~~~~daD--e~fVivlSDANL~RYgI~-p~~l~~~l~~-~~p~V~a~~IfIgslgdq--A~~l 328 (360)
-|=++-.|.+.|.+.+.+. ... ...+|++||..-.- |.. ++.+.+.++. ...+|.-++|=||+-.+. -++|
T Consensus 73 g~T~~~~al~~a~~~~~~~--~~~~~~~~ivl~TDG~~~~-~~~~~~~~~~~~~~~~~~~v~i~~i~~g~~~~~~~l~~i 149 (170)
T cd01465 73 GSTAGGAGIQLGYQEAQKH--FVPGGVNRILLATDGDFNV-GETDPDELARLVAQKRESGITLSTLGFGDNYNEDLMEAI 149 (170)
T ss_pred CCCCHHHHHHHHHHHHHhh--cCCCCeeEEEEEeCCCCCC-CCCCHHHHHHHHHHhhcCCeEEEEEEeCCCcCHHHHHHH
Confidence 3446678888898888643 222 26789999996432 333 3666666543 227899999999964332 3344
Q ss_pred HhhCCCCceEEecCCCchHHHH
Q psy11447 329 TQELPAGRGFVCMDLTEIPQIL 350 (360)
Q Consensus 329 ~~~Lp~Gr~fv~~dt~~lP~il 350 (360)
.+ .+.|+.|..-|.++||+|+
T Consensus 150 a~-~~~g~~~~~~~~~~~~~~~ 170 (170)
T cd01465 150 AD-AGNGNTAYIDNLAEARKVF 170 (170)
T ss_pred Hh-cCCceEEEeCCHHHHHhhC
Confidence 44 6679999999999999774
No 14
>cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.
Probab=96.01 E-value=0.42 Score=40.52 Aligned_cols=135 Identities=16% Similarity=0.273 Sum_probs=80.5
Q ss_pred hccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEE
Q psy11447 204 QDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILI 283 (360)
Q Consensus 204 ~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVi 283 (360)
..+...-.+.|++-|++.... .....|.+...+.+.++.+-.... ..|=+...|+..|.+.+... +.-...||
T Consensus 32 ~~l~~~~~~~l~~Fs~~~~~~---~~~~~~~~~~~~~~~~~~l~~~~~--~g~T~l~~al~~a~~~l~~~--~~~~~~ii 104 (171)
T cd01461 32 KDLPPGDYFNIIGFSDTVEEF---SPSSVSATAENVAAAIEYVNRLQA--LGGTNMNDALEAALELLNSS--PGSVPQII 104 (171)
T ss_pred HhCCCCCEEEEEEeCCCceee---cCcceeCCHHHHHHHHHHHHhcCC--CCCcCHHHHHHHHHHhhccC--CCCccEEE
Confidence 344444456788888775432 111123333334444443333222 45556788899998888642 33358999
Q ss_pred EEecCCccccCCChhHHHHHHhcC-cccccEEEEEecC-chHH-HHHHHhhCCCCceEEecCCCchHHHH
Q psy11447 284 VLSDANLDRYGIAPREMGEALNAE-AASVRAYAIFIGS-LGDQ-AKRLTQELPAGRGFVCMDLTEIPQIL 350 (360)
Q Consensus 284 vlSDANL~RYgI~p~~l~~~l~~~-~p~V~a~~IfIgs-lgdq-A~~l~~~Lp~Gr~fv~~dt~~lP~il 350 (360)
++||... -.+.++.+.++.. .+.|.-|+|=||+ .+.+ .++|.+. ..|+.+.+-|+++|-+-|
T Consensus 105 llTDG~~----~~~~~~~~~~~~~~~~~i~i~~i~~g~~~~~~~l~~ia~~-~gG~~~~~~~~~~~~~~~ 169 (171)
T cd01461 105 LLTDGEV----TNESQILKNVREALSGRIRLFTFGIGSDVNTYLLERLARE-GRGIARRIYETDDIESQL 169 (171)
T ss_pred EEeCCCC----CCHHHHHHHHHHhcCCCceEEEEEeCCccCHHHHHHHHHc-CCCeEEEecChHHHHHHh
Confidence 9999973 2345555555432 2478888888886 2222 3444444 469999999988876544
No 15
>PF00092 VWA: von Willebrand factor type A domain; InterPro: IPR002035 The von Willebrand factor is a large multimeric glycoprotein found in blood plasma. Mutant forms are involved in the aetiology of bleeding disorders []. In von Willebrand factor, the type A domain (vWF) is the prototype for a protein superfamily. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. Fold recognition algorithms were used to score sequence compatibility with a library of known structures: the vWF domain fold was predicted to be a doubly-wound, open, twisted beta-sheet flanked by alpha-helices []. 3D structures have been determined for the I-domains of integrins CD11b (with bound magnesium) [] and CD11a (with bound manganese) []. The domain adopts a classic alpha/beta Rossmann fold and contains an unusual metal ion coordination site at its surface. It has been suggested that this site represents a general metal ion-dependent adhesion site (MIDAS) for binding protein ligands []. The residues constituting the MIDAS motif in the CD11b and CD11a I-domains are completely conserved, but the manner in which the metal ion is coordinated differs slightly [].; GO: 0005515 protein binding; PDB: 2XGG_B 3ZQK_B 3GXB_A 3PPV_A 3PPX_A 3PPW_A 3PPY_A 1CQP_B 3TCX_B 2ICA_A ....
Probab=95.77 E-value=0.091 Score=44.34 Aligned_cols=143 Identities=14% Similarity=0.242 Sum_probs=88.4
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccC--CcchHHHHHHHHHHHHhh---
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCAS--GDYTLESIRASIQDLTTE--- 273 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~s--GD~Tlea~~~ai~~~a~~--- 273 (360)
+++.|.-...+.++-++-.|.... .++..... +..-+++... ..+-..+ |=++-.|+..|.+.+...
T Consensus 27 ~i~~~~~~~~~~rv~iv~f~~~~~--~~~~~~~~----~~~~~~~~~i--~~~~~~~~g~t~~~~aL~~a~~~l~~~~~~ 98 (178)
T PF00092_consen 27 IISRLSISNNGTRVGIVTFSDSAR--VLFSLTDY----QSKNDLLNAI--NDSIPSSGGGTNLGAALKFAREQLFSSNNG 98 (178)
T ss_dssp HHHHSTBSTTSEEEEEEEESSSEE--EEEETTSH----SSHHHHHHHH--HTTGGCCBSSB-HHHHHHHHHHHTTSGGGT
T ss_pred HHHhhhccccccccceeeeecccc--cccccccc----cccccccccc--cccccccchhhhHHHHHhhhhhcccccccc
Confidence 667776667888888888877765 33343332 2233333333 2222333 334578888888887521
Q ss_pred cccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecC---chHHHHHHHhhC-CCCceEEecCCCchHHH
Q psy11447 274 CAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGS---LGDQAKRLTQEL-PAGRGFVCMDLTEIPQI 349 (360)
Q Consensus 274 ~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgs---lgdqA~~l~~~L-p~Gr~fv~~dt~~lP~i 349 (360)
....-..++|++||.+-.-....+........+ ++..+|.||. -..+-..|...- ..|+.|...|..+++++
T Consensus 99 ~r~~~~~~iiliTDG~~~~~~~~~~~~~~~~~~----~~i~~~~ig~~~~~~~~l~~la~~~~~~~~~~~~~~~~~l~~~ 174 (178)
T PF00092_consen 99 GRPNSPKVIILITDGNSNDSDSPSEEAANLKKS----NGIKVIAIGIDNADNEELRELASCPTSEGHVFYLADFSDLSQI 174 (178)
T ss_dssp TGTTSEEEEEEEESSSSSSHSGHHHHHHHHHHH----CTEEEEEEEESCCHHHHHHHHSHSSTCHHHEEEESSHHHHHHH
T ss_pred cccccccceEEEEeecccCCcchHHHHHHHHHh----cCcEEEEEecCcCCHHHHHHHhCCCCCCCcEEEcCCHHHHHHH
Confidence 114567899999999987765555555555444 3555555555 344555555443 34789999999999998
Q ss_pred HHHH
Q psy11447 350 LQQI 353 (360)
Q Consensus 350 l~qI 353 (360)
.+|.
T Consensus 175 ~~~~ 178 (178)
T PF00092_consen 175 IQQM 178 (178)
T ss_dssp HHHH
T ss_pred HhcC
Confidence 8873
No 16
>cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=95.61 E-value=0.3 Score=42.23 Aligned_cols=85 Identities=25% Similarity=0.437 Sum_probs=59.2
Q ss_pred CcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccC-CChhHHHHHHhcCcccccEEEEEecC------------ch
Q psy11447 256 GDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYG-IAPREMGEALNAEAASVRAYAIFIGS------------LG 322 (360)
Q Consensus 256 GD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYg-I~p~~l~~~l~~~~p~V~a~~IfIgs------------lg 322 (360)
|-++-.|+..|.+.+.+. ..-...||++||-.-. .| .+|.+..+.+.+ ..|.-|+|.||+ ..
T Consensus 82 ~T~l~~al~~a~~~l~~~--~~~~~~iiliTDG~~~-~g~~~~~~~~~~~~~--~gi~i~~i~ig~~~~~~~~~~~~~~~ 156 (180)
T cd01467 82 GTAIGDAIGLAIKRLKNS--EAKERVIVLLTDGENN-AGEIDPATAAELAKN--KGVRIYTIGVGKSGSGPKPDGSTILD 156 (180)
T ss_pred CCcHHHHHHHHHHHHHhc--CCCCCEEEEEeCCCCC-CCCCCHHHHHHHHHH--CCCEEEEEEecCCCCCcCCCCcccCC
Confidence 334567888888888654 3345899999998543 34 567777665543 689999999998 22
Q ss_pred H-HHHHHHhhCCCCceEEecCCCch
Q psy11447 323 D-QAKRLTQELPAGRGFVCMDLTEI 346 (360)
Q Consensus 323 d-qA~~l~~~Lp~Gr~fv~~dt~~l 346 (360)
. +.++|.+.= .|+.|.+.|..+|
T Consensus 157 ~~~l~~la~~t-gG~~~~~~~~~~~ 180 (180)
T cd01467 157 EDSLVEIADKT-GGRIFRALDGFEL 180 (180)
T ss_pred HHHHHHHHHhc-CCEEEEecCcccC
Confidence 2 345665554 6899999887764
No 17
>cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.
Probab=95.55 E-value=0.22 Score=44.06 Aligned_cols=140 Identities=16% Similarity=0.240 Sum_probs=87.9
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhh---cc
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTE---CA 275 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~---~~ 275 (360)
+++.|.--+.....-+|.-|+... +. +..... .+..+-++.++.| +|--.|=+|-+|++.|.+.+-.. .-
T Consensus 28 ~i~~l~~~~~~~rvgvv~fs~~~~-~~-~~l~~~-~~~~~~~~~i~~~----~~~~g~T~~~~AL~~a~~~l~~~~~g~R 100 (177)
T cd01469 28 VMKKLDIGPTKTQFGLVQYSESFR-TE-FTLNEY-RTKEEPLSLVKHI----SQLLGLTNTATAIQYVVTELFSESNGAR 100 (177)
T ss_pred HHHHcCcCCCCcEEEEEEECCcee-EE-EecCcc-CCHHHHHHHHHhC----ccCCCCccHHHHHHHHHHHhcCcccCCC
Confidence 556665546667788888887742 22 223322 2333334444433 23233457889999999987321 01
Q ss_pred cCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCch------HHHHHHHhhCCC-CceEEecCCCchHH
Q psy11447 276 QHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLG------DQAKRLTQELPA-GRGFVCMDLTEIPQ 348 (360)
Q Consensus 276 daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslg------dqA~~l~~~Lp~-Gr~fv~~dt~~lP~ 348 (360)
+....++|||||..=.--..+ ...++.++. ..|.-|+|-||.-. .|=..|+.. |. .|.|..-|-++|+.
T Consensus 101 ~~~~kv~illTDG~~~~~~~~-~~~~~~~k~--~gv~v~~Vgvg~~~~~~~~~~~L~~ias~-p~~~h~f~~~~~~~l~~ 176 (177)
T cd01469 101 KDATKVLVVITDGESHDDPLL-KDVIPQAER--EGIIRYAIGVGGHFQRENSREELKTIASK-PPEEHFFNVTDFAALKD 176 (177)
T ss_pred CCCCeEEEEEeCCCCCCcccc-HHHHHHHHH--CCcEEEEEEecccccccccHHHHHHHhcC-CcHHhEEEecCHHHhcc
Confidence 235789999999987765544 555555553 68999999999854 444556666 64 58898888888775
Q ss_pred H
Q psy11447 349 I 349 (360)
Q Consensus 349 i 349 (360)
|
T Consensus 177 ~ 177 (177)
T cd01469 177 I 177 (177)
T ss_pred C
Confidence 4
No 18
>cd01450 vWFA_subfamily_ECM Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=95.12 E-value=0.5 Score=39.08 Aligned_cols=112 Identities=17% Similarity=0.339 Sum_probs=70.0
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhcccc-CCcchHHHHHHHHHHHHhhc--c
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCA-SGDYTLESIRASIQDLTTEC--A 275 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~-sGD~Tlea~~~ai~~~a~~~--~ 275 (360)
+...|...+.+....++.=++... ..+.....+ ++-++++.+-. -.+.. +|=++-+|...|.+.+.+.. .
T Consensus 28 ~~~~~~~~~~~~~~~li~f~~~~~--~~~~~~~~~----~~~~~~~~i~~-~~~~~~~~t~~~~al~~a~~~~~~~~~~~ 100 (161)
T cd01450 28 LVEKLDIGPDKTRVGLVQYSDDVR--VEFSLNDYK----SKDDLLKAVKN-LKYLGGGGTNTGKALQYALEQLFSESNAR 100 (161)
T ss_pred HHHheeeCCCceEEEEEEEcCCce--EEEECCCCC----CHHHHHHHHHh-cccCCCCCccHHHHHHHHHHHhccccccc
Confidence 555666556677888888666432 223333322 33344333321 11233 36777999999999987541 0
Q ss_pred cCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecC
Q psy11447 276 QHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGS 320 (360)
Q Consensus 276 daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgs 320 (360)
+....++|++||..=.-.+ .|..+.+-++.. +|..++|.||+
T Consensus 101 ~~~~~~iiliTDG~~~~~~-~~~~~~~~~~~~--~v~v~~i~~g~ 142 (161)
T cd01450 101 ENVPKVIIVLTDGRSDDGG-DPKEAAAKLKDE--GIKVFVVGVGP 142 (161)
T ss_pred CCCCeEEEEECCCCCCCCc-chHHHHHHHHHC--CCEEEEEeccc
Confidence 2566799999997433322 677777777763 79999999998
No 19
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=94.98 E-value=0.47 Score=43.50 Aligned_cols=145 Identities=13% Similarity=0.221 Sum_probs=83.7
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhc--c-
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTEC--A- 275 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~--~- 275 (360)
++..|.--.+....-||-.|.+... . +. +...+.+-++.+.+. .-+|.-.|-+|-.|.+.|++.+-.+. +
T Consensus 30 l~~~l~~~~~~~rvglv~fs~~~~~-~-~~----l~~~~~~~~l~~~i~-~i~~~~~~t~tg~AL~~a~~~~~~~~~g~r 102 (224)
T cd01475 30 IIDSLDVGPDATRVGLVQYSSTVKQ-E-FP----LGRFKSKADLKRAVR-RMEYLETGTMTGLAIQYAMNNAFSEAEGAR 102 (224)
T ss_pred HHHhcccCCCccEEEEEEecCceeE-E-ec----ccccCCHHHHHHHHH-hCcCCCCCChHHHHHHHHHHHhCChhcCCC
Confidence 4455533244556667777666421 1 11 211122223333222 23454456678888899988753210 0
Q ss_pred -cCC--CeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHH-HHHHHhhCCCCceEEecCCCchHHHHH
Q psy11447 276 -QHD--DSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQ-AKRLTQELPAGRGFVCMDLTEIPQILQ 351 (360)
Q Consensus 276 -daD--e~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdq-A~~l~~~Lp~Gr~fv~~dt~~lP~il~ 351 (360)
... ..+|||+||-.-. - .+.+.++.|+. ..|.-|+|=||+...+ =..|+..-..++.|..-|-++||.+..
T Consensus 103 ~~~~~~~kvvillTDG~s~-~--~~~~~a~~lk~--~gv~i~~VgvG~~~~~~L~~ias~~~~~~~f~~~~~~~l~~~~~ 177 (224)
T cd01475 103 PGSERVPRVGIVVTDGRPQ-D--DVSEVAAKARA--LGIEMFAVGVGRADEEELREIASEPLADHVFYVEDFSTIEELTK 177 (224)
T ss_pred CCCCCCCeEEEEEcCCCCc-c--cHHHHHHHHHH--CCcEEEEEeCCcCCHHHHHHHhCCCcHhcEEEeCCHHHHHHHhh
Confidence 111 5799999998532 1 35666777764 6899999999985322 122222222357899999999999998
Q ss_pred HHHH
Q psy11447 352 QIFS 355 (360)
Q Consensus 352 qIf~ 355 (360)
+|-.
T Consensus 178 ~l~~ 181 (224)
T cd01475 178 KFQG 181 (224)
T ss_pred hccc
Confidence 8754
No 20
>cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
Probab=94.87 E-value=0.54 Score=40.41 Aligned_cols=111 Identities=15% Similarity=0.215 Sum_probs=62.7
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhh---cc
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTE---CA 275 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~---~~ 275 (360)
+++.|.-.+....+-|+--|++.... +.... +.+... + .....+.+++-.|=++=+|...|.+.+... ..
T Consensus 28 ~~~~l~~~~~~~~~giv~Fs~~~~~~--~~~~~-~~~~~~---~-~~~l~~l~~~~g~T~~~~al~~a~~~l~~~~~~~~ 100 (164)
T cd01472 28 VVERLDIGPDGVRVGVVQYSDDPRTE--FYLNT-YRSKDD---V-LEAVKNLRYIGGGTNTGKALKYVRENLFTEASGSR 100 (164)
T ss_pred HHhhcccCCCCeEEEEEEEcCceeEE--EecCC-CCCHHH---H-HHHHHhCcCCCCCchHHHHHHHHHHHhCCcccCCC
Confidence 45555433566778888888765432 22222 233222 2 223334455523346788999999988641 01
Q ss_pred cCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCc
Q psy11447 276 QHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSL 321 (360)
Q Consensus 276 daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgsl 321 (360)
..-..++|++||-.- ..+.. .....+.. ..|..|+|=||..
T Consensus 101 ~~~~~~iiliTDG~~-~~~~~--~~~~~l~~--~gv~i~~ig~g~~ 141 (164)
T cd01472 101 EGVPKVLVVITDGKS-QDDVE--EPAVELKQ--AGIEVFAVGVKNA 141 (164)
T ss_pred CCCCEEEEEEcCCCC-CchHH--HHHHHHHH--CCCEEEEEECCcC
Confidence 345789999999832 22221 12223332 6788888888874
No 21
>PTZ00441 sporozoite surface protein 2 (SSP2); Provisional
Probab=94.85 E-value=1.1 Score=48.39 Aligned_cols=151 Identities=13% Similarity=0.144 Sum_probs=85.6
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchH-HHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhh-ccc
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQ-RLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTE-CAQ 276 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~ke-Rl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~-~~d 276 (360)
+...|..-.+++..-|+.-|+.... ++........+|+ -+..|..+.. ..+..-|=+|-.|...+.+.+... .-+
T Consensus 71 LV~~l~Is~D~V~VgiV~FSd~~r~--vfpL~s~~s~Dk~~aL~~I~sL~~-~~~pgGgTnig~AL~~Aae~L~sr~~R~ 147 (576)
T PTZ00441 71 LIQQLNLSDDAINLYMSLFSNNTTE--LIRLGSGASKDKEQALIIVKSLRK-TYLPYGKTNMTDALLEVRKHLNDRVNRE 147 (576)
T ss_pred HHHHhccCCCceEEEEEEeCCCceE--EEecCCCccccHHHHHHHHHHHHh-hccCCCCccHHHHHHHHHHHHhhccccc
Confidence 5566665577777777776665432 2333333223333 2333443332 123334445677887777766421 002
Q ss_pred CCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHH--HHHHh---hCCCCceEEecCCCchHHHHH
Q psy11447 277 HDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQA--KRLTQ---ELPAGRGFVCMDLTEIPQILQ 351 (360)
Q Consensus 277 aDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA--~~l~~---~Lp~Gr~fv~~dt~~lP~il~ 351 (360)
+--.+||||||-.=.- +-.+.+.++.|.. ..|..|+|=||+-.+.. ..|+- .-+.|+.|+..|-++|=.+.+
T Consensus 148 nvpKVVILLTDG~sns-~~dvleaAq~LR~--~GVeI~vIGVG~g~n~e~LrlIAgC~p~~g~c~~Y~vadf~eL~~ivk 224 (576)
T PTZ00441 148 NAIQLVILMTDGIPNS-KYRALEESRKLKD--RNVKLAVIGIGQGINHQFNRLLAGCRPREGKCKFYSDADWEEAKNLIK 224 (576)
T ss_pred CCceEEEEEecCCCCC-cccHHHHHHHHHH--CCCEEEEEEeCCCcCHHHHHHHhccCCCCCCCceEEeCCHHHHHHHHH
Confidence 2347999999987421 1223344555654 69999999999844433 22431 224467788888877777777
Q ss_pred HHHH
Q psy11447 352 QIFS 355 (360)
Q Consensus 352 qIf~ 355 (360)
.|..
T Consensus 225 ~Lik 228 (576)
T PTZ00441 225 PFIA 228 (576)
T ss_pred HHHH
Confidence 6654
No 22
>cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=94.69 E-value=0.66 Score=41.24 Aligned_cols=92 Identities=11% Similarity=0.149 Sum_probs=57.0
Q ss_pred ccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCC---ChhHHHHHHhcCcccccEEEEEecCchHHHHH
Q psy11447 251 QFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGI---APREMGEALNAEAASVRAYAIFIGSLGDQAKR 327 (360)
Q Consensus 251 Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI---~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~ 327 (360)
+|.-.|=+|-+|+..|.+.+.......-...||++||-.= .+. .+.+..+.++ + .+|+.|+|-||+ .++..-
T Consensus 82 ~~~gg~T~~~~AL~~a~~~l~~~~~~~~~~~iillTDG~~--~~~~~~~~~~~~~~~~-~-~gi~i~~vgig~-~~~~~L 156 (186)
T cd01480 82 EYIGGGTFTDCALKYATEQLLEGSHQKENKFLLVITDGHS--DGSPDGGIEKAVNEAD-H-LGIKIFFVAVGS-QNEEPL 156 (186)
T ss_pred ccCCCCccHHHHHHHHHHHHhccCCCCCceEEEEEeCCCc--CCCcchhHHHHHHHHH-H-CCCEEEEEecCc-cchHHH
Confidence 4544555668999999999864111234589999999853 232 2334444444 3 799999999999 555542
Q ss_pred -HHhhCCCCceEEecCCCchH
Q psy11447 328 -LTQELPAGRGFVCMDLTEIP 347 (360)
Q Consensus 328 -l~~~Lp~Gr~fv~~dt~~lP 347 (360)
--..-|.|+-|+.-..+-+|
T Consensus 157 ~~IA~~~~~~~~~~~~~~l~~ 177 (186)
T cd01480 157 SRIACDGKSALYRENFAELLW 177 (186)
T ss_pred HHHHcCCcchhhhcchhhhcc
Confidence 22344556644444444445
No 23
>cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.
Probab=93.90 E-value=2 Score=37.73 Aligned_cols=121 Identities=12% Similarity=0.151 Sum_probs=70.0
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchH-HHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhh--cc
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQ-RLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTE--CA 275 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~ke-Rl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~--~~ 275 (360)
+++.|.--.+....-||-=|++.. .+++...++.++++ -+.+++.+... .+-..|=++-+|...|.+.+.+. ..
T Consensus 29 ~~~~~~~~~~~~~vglv~Fs~~~~--~~~~l~~~~~~~~~~~~~~i~~l~~~-~~~~G~T~l~~aL~~a~~~l~~~~~~r 105 (186)
T cd01471 29 FVQNLNISPDEINLYLVTFSTNAK--ELIRLSSPNSTNKDLALNAIRALLSL-YYPNGSTNTTSALLVVEKHLFDTRGNR 105 (186)
T ss_pred HHHhcccCCCceEEEEEEecCCce--EEEECCCccccchHHHHHHHHHHHhC-cCCCCCccHHHHHHHHHHHhhccCCCc
Confidence 555554335667777777665433 34444444444443 33455544322 23345567889999999888641 00
Q ss_pred cCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHH
Q psy11447 276 QHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQA 325 (360)
Q Consensus 276 daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA 325 (360)
..-..+||++||-.=.-. -.+...++.|.. ..|+.++|=||+--|..
T Consensus 106 ~~~~~~villTDG~~~~~-~~~~~~a~~l~~--~gv~v~~igiG~~~d~~ 152 (186)
T cd01471 106 ENAPQLVIIMTDGIPDSK-FRTLKEARKLRE--RGVIIAVLGVGQGVNHE 152 (186)
T ss_pred ccCceEEEEEccCCCCCC-cchhHHHHHHHH--CCCEEEEEEeehhhCHH
Confidence 112368999999875433 233345556644 57888888888744433
No 24
>cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=93.57 E-value=2.1 Score=37.01 Aligned_cols=131 Identities=15% Similarity=0.190 Sum_probs=72.0
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhc---c
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTEC---A 275 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~---~ 275 (360)
++..|+-......+-++..|++... .+ ..+. .++..+=++.+..+ .|+-.|=+|-+|.+.+.+.+-+.. .
T Consensus 28 l~~~~~~~~~~~rvgli~fs~~~~~-~~-~l~~-~~~~~~l~~~l~~~----~~~~g~T~~~~aL~~a~~~~~~~~~~~r 100 (164)
T cd01482 28 VVEAFEIGPDGVQVGLVQYSDDPRT-EF-DLNA-YTSKEDVLAAIKNL----PYKGGNTRTGKALTHVREKNFTPDAGAR 100 (164)
T ss_pred HHhheeeCCCceEEEEEEECCCeeE-EE-ecCC-CCCHHHHHHHHHhC----cCCCCCChHHHHHHHHHHHhcccccCCC
Confidence 5666654456788999999998532 11 1121 12222222333332 344445566777777777663320 0
Q ss_pred cCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCch-HHHHHHHhhCCCC-ceEEecC
Q psy11447 276 QHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLG-DQAKRLTQELPAG-RGFVCMD 342 (360)
Q Consensus 276 daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslg-dqA~~l~~~Lp~G-r~fv~~d 342 (360)
..-...+|++||-+=. -.+.+.++.|.+ -.|.-|+|-||.-. .+-+.|+.. |.+ |.|...|
T Consensus 101 ~~~~k~iillTDG~~~---~~~~~~a~~lk~--~gi~i~~ig~g~~~~~~L~~ia~~-~~~~~~~~~~d 163 (164)
T cd01482 101 PGVPKVVILITDGKSQ---DDVELPARVLRN--LGVNVFAVGVKDADESELKMIASK-PSETHVFNVAD 163 (164)
T ss_pred CCCCEEEEEEcCCCCC---chHHHHHHHHHH--CCCEEEEEecCcCCHHHHHHHhCC-CchheEEEcCC
Confidence 1234689999998632 245666777765 57888888887642 222333333 422 5555443
No 25
>cd01462 VWA_YIEM_type VWA YIEM type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=92.93 E-value=1.7 Score=36.78 Aligned_cols=108 Identities=13% Similarity=0.162 Sum_probs=60.5
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhcccc-CCcchHHHHHHHHHHHHhhcccC
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCA-SGDYTLESIRASIQDLTTECAQH 277 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~-sGD~Tlea~~~ai~~~a~~~~da 277 (360)
+++++....+ .+.++.-+++ ...... ++ ...+.++++.+. +... .|-+.-.|...+.+.+.+. ..
T Consensus 28 l~~~~~~~~~--~v~li~F~~~-~~~~~~----~~--~~~~~~~~~~l~---~~~~~ggT~l~~al~~a~~~l~~~--~~ 93 (152)
T cd01462 28 LLRIALAENR--DTYLILFDSE-FQTKIV----DK--TDDLEEPVEFLS---GVQLGGGTDINKALRYALELIERR--DP 93 (152)
T ss_pred HHHHHHHcCC--cEEEEEeCCC-ceEEec----CC--cccHHHHHHHHh---cCCCCCCcCHHHHHHHHHHHHHhc--CC
Confidence 4555443233 3456766666 222111 12 223444555443 2222 4455677888888877654 44
Q ss_pred CCeEEEEEecCCccccCCChhHH-HHHHhcCcccccEEEEEecCchH
Q psy11447 278 DDSILIVLSDANLDRYGIAPREM-GEALNAEAASVRAYAIFIGSLGD 323 (360)
Q Consensus 278 De~fVivlSDANL~RYgI~p~~l-~~~l~~~~p~V~a~~IfIgslgd 323 (360)
-...||++||-.= =++++..+ ...+... -+|.-++|.||+-++
T Consensus 94 ~~~~ivliTDG~~--~~~~~~~~~~~~~~~~-~~~~v~~~~~g~~~~ 137 (152)
T cd01462 94 RKADIVLITDGYE--GGVSDELLREVELKRS-RVARFVALALGDHGN 137 (152)
T ss_pred CCceEEEECCCCC--CCCCHHHHHHHHHHHh-cCcEEEEEEecCCCC
Confidence 4689999999851 24555443 1333333 579999999999543
No 26
>cd01474 vWA_ATR ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.
Probab=91.91 E-value=3.8 Score=36.25 Aligned_cols=98 Identities=13% Similarity=0.172 Sum_probs=64.0
Q ss_pred CCcchHHHHHHHHHHHHhhcccC--CCeEEEEEecCCcc-ccCCChhHHHHHHhcCcccccEEEEEecCch-HHHHHHHh
Q psy11447 255 SGDYTLESIRASIQDLTTECAQH--DDSILIVLSDANLD-RYGIAPREMGEALNAEAASVRAYAIFIGSLG-DQAKRLTQ 330 (360)
Q Consensus 255 sGD~Tlea~~~ai~~~a~~~~da--De~fVivlSDANL~-RYgI~p~~l~~~l~~~~p~V~a~~IfIgslg-dqA~~l~~ 330 (360)
-|=+|-+|.+.|.+.+.+....+ .-..||+|||-.-. .-+..|..-++.+.. ..|.-|+|=||... .+-..|+.
T Consensus 78 g~T~~~~aL~~a~~~l~~~~~~~r~~~~~villTDG~~~~~~~~~~~~~a~~l~~--~gv~i~~vgv~~~~~~~L~~iA~ 155 (185)
T cd01474 78 GQTYIHEGLENANEQIFNRNGGGRETVSVIIALTDGQLLLNGHKYPEHEAKLSRK--LGAIVYCVGVTDFLKSQLINIAD 155 (185)
T ss_pred CCCcHHHHHHHHHHHHHhhccCCCCCCeEEEEEcCCCcCCCCCcchHHHHHHHHH--cCCEEEEEeechhhHHHHHHHhC
Confidence 34567899999998884321112 23789999998753 334456666666655 56777777775422 12244443
Q ss_pred hCCCCceE-EecCCCchHHHHHHHHHh
Q psy11447 331 ELPAGRGF-VCMDLTEIPQILQQIFSA 356 (360)
Q Consensus 331 ~Lp~Gr~f-v~~dt~~lP~il~qIf~s 356 (360)
-| ++.| +..|..+|+.++++|...
T Consensus 156 -~~-~~~f~~~~~~~~l~~~~~~~~~~ 180 (185)
T cd01474 156 -SK-EYVFPVTSGFQALSGIIESVVKK 180 (185)
T ss_pred -CC-CeeEecCccHHHHHHHHHHHHHh
Confidence 34 8899 667889999999998764
No 27
>cd01456 vWA_ywmD_type VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=91.87 E-value=6.9 Score=35.20 Aligned_cols=86 Identities=10% Similarity=0.125 Sum_probs=50.5
Q ss_pred cccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCc---ccccEEEEEecCch--HHHH
Q psy11447 252 FCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEA---ASVRAYAIFIGSLG--DQAK 326 (360)
Q Consensus 252 fc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~---p~V~a~~IfIgslg--dqA~ 326 (360)
+..-|=+.-+|.+.|.+.+. . .-...||+|||-.-.-.+ .|.+....+.... +.|..|+|=||+-- ++-+
T Consensus 112 ~~~G~T~l~~aL~~a~~~l~-~---~~~~~iillTDG~~~~~~-~~~~~~~~~~~~~~~~~~i~i~~igiG~~~~~~~l~ 186 (206)
T cd01456 112 TPTGWTPLAAALAEAAAYVD-P---GRVNVVVLITDGEDTCGP-DPCEVARELAKRRTPAPPIKVNVIDFGGDADRAELE 186 (206)
T ss_pred CCCCcChHHHHHHHHHHHhC-C---CCcceEEEEcCCCccCCC-CHHHHHHHHHHhcCCCCCceEEEEEecCcccHHHHH
Confidence 33334455677777777663 1 112799999998644333 5544444444332 58999999999842 3345
Q ss_pred HHHhhCCCCce-EEecCC
Q psy11447 327 RLTQELPAGRG-FVCMDL 343 (360)
Q Consensus 327 ~l~~~Lp~Gr~-fv~~dt 343 (360)
+|.+.- .|.. |...|.
T Consensus 187 ~iA~~t-gG~~~~~~~~~ 203 (206)
T cd01456 187 AIAEAT-GGTYAYNQSDL 203 (206)
T ss_pred HHHHhc-CCeEecccccc
Confidence 555554 3665 544443
No 28
>TIGR02640 gas_vesic_GvpN gas vesicle protein GvpN. Members of this family are the GvpN protein associated with the production of gas vesicles produced in some prokaryotes to give cells buoyancy. This family belongs to a larger family of ATPases (pfam07728).
Probab=91.42 E-value=0.54 Score=44.56 Aligned_cols=65 Identities=26% Similarity=0.338 Sum_probs=54.2
Q ss_pred hhhhhcccccCCCCchhHHHHHHhcCCCCChhHHHHHHHhhhhhhhhhccCCCCCCccchhhhhhhhhhh
Q psy11447 38 HRDLFSVHVIDNPSPESELALLSSYGPDVPRDTLIRLVRAFGEFRELSDKSLLPYPYSTREAVHIVKHLQ 107 (360)
Q Consensus 38 lgd~fs~h~v~np~~esE~~~L~~ygp~v~~~~l~~Lv~afgeLr~~~deGli~ypYs~Re~v~ivKhL~ 107 (360)
+-|-|..+-++.|+.+.|.+.++++- .++++++.+++..+.++|.... ...+|+|+.+.+.+++.
T Consensus 173 L~~R~~~i~i~~P~~~~e~~Il~~~~-~~~~~~~~~iv~~~~~~R~~~~----~~~~~~r~~i~~~~~~~ 237 (262)
T TIGR02640 173 LLDRLITIFMDYPDIDTETAILRAKT-DVAEDSAATIVRLVREFRASGD----EITSGLRASLMIAEVAT 237 (262)
T ss_pred HHhhcEEEECCCCCHHHHHHHHHHhh-CCCHHHHHHHHHHHHHHHhhCC----ccCCcHHHHHHHHHHHH
Confidence 45557788899999999999999874 7999999999999999993222 24668999999999884
No 29
>COG4548 NorD Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]
Probab=91.09 E-value=0.87 Score=48.97 Aligned_cols=97 Identities=22% Similarity=0.366 Sum_probs=72.8
Q ss_pred CCcchH--HHHHHHHHHHHhhcccCCCeEEEEEecC---Ccc----ccCCChhHHHHHHhcCcccccEEEEEecCchHHH
Q psy11447 255 SGDYTL--ESIRASIQDLTTECAQHDDSILIVLSDA---NLD----RYGIAPREMGEALNAEAASVRAYAIFIGSLGDQA 325 (360)
Q Consensus 255 sGD~Tl--ea~~~ai~~~a~~~~daDe~fVivlSDA---NL~----RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA 325 (360)
.|++|- -|||+|-+.|... .=--.|.||+||- +++ ||||.-..=|-+..+. ..|..|.|||.+ ....
T Consensus 529 Pg~ytR~G~AIR~As~kL~~r--pq~qklLivlSDGkPnd~d~YEgr~gIeDTr~AV~eaRk-~Gi~VF~Vtld~-ea~~ 604 (637)
T COG4548 529 PGYYTRDGAAIRHASAKLMER--PQRQKLLIVLSDGKPNDFDHYEGRFGIEDTREAVIEARK-SGIEVFNVTLDR-EAIS 604 (637)
T ss_pred ccccccccHHHHHHHHHHhcC--cccceEEEEecCCCcccccccccccchhhHHHHHHHHHh-cCceEEEEEecc-hhhh
Confidence 466773 5889999999754 3345999999994 455 9999988888888887 999999999998 3333
Q ss_pred HHHHhhCCCCceEEecCCCchHHHHHHHHHh
Q psy11447 326 KRLTQELPAGRGFVCMDLTEIPQILQQIFSA 356 (360)
Q Consensus 326 ~~l~~~Lp~Gr~fv~~dt~~lP~il~qIf~s 356 (360)
.--......|=+|| -+..+||-.|-+|+-.
T Consensus 605 y~p~~fgqngYa~V-~~v~~LP~~L~~lyrk 634 (637)
T COG4548 605 YLPALFGQNGYAFV-ERVAQLPGALPPLYRK 634 (637)
T ss_pred hhHHHhccCceEEc-cchhhcchhHHHHHHH
Confidence 33344455566665 5888999888888754
No 30
>cd01466 vWA_C3HC4_type VWA C3HC4-type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most,
Probab=90.62 E-value=10 Score=32.66 Aligned_cols=75 Identities=13% Similarity=0.188 Sum_probs=43.7
Q ss_pred chHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecC-chHH-HHHHHhhCCCC
Q psy11447 258 YTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGS-LGDQ-AKRLTQELPAG 335 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgs-lgdq-A~~l~~~Lp~G 335 (360)
++-+|.+.|.+.+.......-...||+|||-.= .+| ..++.....+|..|+|-||. ...+ -++|+. ...|
T Consensus 78 ~~~~al~~a~~~~~~~~~~~~~~~iillTDG~~-~~~------~~~~~~~~~~v~v~~igig~~~~~~~l~~iA~-~t~G 149 (155)
T cd01466 78 NVVGGLKKALKVLGDRRQKNPVASIMLLSDGQD-NHG------AVVLRADNAPIPIHTFGLGASHDPALLAFIAE-ITGG 149 (155)
T ss_pred cHHHHHHHHHHHHhhcccCCCceEEEEEcCCCC-Ccc------hhhhcccCCCceEEEEecCCCCCHHHHHHHHh-ccCc
Confidence 447888888887753210112368999999742 233 22333333678999999984 3323 344443 4457
Q ss_pred ceEEe
Q psy11447 336 RGFVC 340 (360)
Q Consensus 336 r~fv~ 340 (360)
+.|.+
T Consensus 150 ~~~~~ 154 (155)
T cd01466 150 TFSYV 154 (155)
T ss_pred eEEEe
Confidence 77654
No 31
>cd01463 vWA_VGCC_like VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
Probab=89.68 E-value=9 Score=34.01 Aligned_cols=122 Identities=14% Similarity=0.194 Sum_probs=62.8
Q ss_pred eEeeeeccCCCCcc-ccccCCCCCCCchHHHHHHHHHHHhhccccCC-cchHHHHHHHHHHHHhhc----c---cCCCeE
Q psy11447 211 SLDIIGHSGETHSI-PFVTSNNLPRDNKQRLEVIKIMHAHAQFCASG-DYTLESIRASIQDLTTEC----A---QHDDSI 281 (360)
Q Consensus 211 ~ydivGHSGd~~~i-~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sG-D~Tlea~~~ai~~~a~~~----~---daDe~f 281 (360)
.+-|+.-|++...+ |+......+.+...+-++.+.+ .+.+..| -++-.|.+.|++.+.+.. . ..--..
T Consensus 50 ~v~lv~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l---~~l~~~G~T~~~~al~~a~~~l~~~~~~~~~~~~~~~~~~ 126 (190)
T cd01463 50 FFNIITFSNEVNPVVPCFNDTLVQATTSNKKVLKEAL---DMLEAKGIANYTKALEFAFSLLLKNLQSNHSGSRSQCNQA 126 (190)
T ss_pred EEEEEEeCCCeeEEeeecccceEecCHHHHHHHHHHH---hhCCCCCcchHHHHHHHHHHHHHHhhhcccccccCCceeE
Confidence 46677777774432 2221111111222333333333 2333344 466789999999887510 0 011257
Q ss_pred EEEEecCCccccCCChhHHHHHHhcCc---ccccEEEEEecCch---HHHHHHHhhCCCCceEEe
Q psy11447 282 LIVLSDANLDRYGIAPREMGEALNAEA---ASVRAYAIFIGSLG---DQAKRLTQELPAGRGFVC 340 (360)
Q Consensus 282 VivlSDANL~RYgI~p~~l~~~l~~~~---p~V~a~~IfIgslg---dqA~~l~~~Lp~Gr~fv~ 340 (360)
+|+|||-.=. .+..+.+.++... ..|.-|.|-||+-- ++-++|++.- .|..+..
T Consensus 127 iillTDG~~~----~~~~~~~~~~~~~~~~~~v~i~tigiG~~~~d~~~L~~lA~~~-~G~~~~i 186 (190)
T cd01463 127 IMLITDGVPE----NYKEIFDKYNWDKNSEIPVRVFTYLIGREVTDRREIQWMACEN-KGYYSHI 186 (190)
T ss_pred EEEEeCCCCC----cHhHHHHHhcccccCCCcEEEEEEecCCccccchHHHHHHhhc-CCeEEEc
Confidence 9999997533 2344444444331 25888888888732 3445555543 3554443
No 32
>cd01473 vWA_CTRP CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.
Probab=85.58 E-value=28 Score=31.62 Aligned_cols=145 Identities=14% Similarity=0.288 Sum_probs=85.3
Q ss_pred HHHHhhccccceeEeeeeccCCCCc-cccccCCCCCCCchHHHHHHHHHHHhhc-cc-cCCcchHHHHHHHHHHHHhhcc
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHS-IPFVTSNNLPRDNKQRLEVIKIMHAHAQ-FC-ASGDYTLESIRASIQDLTTECA 275 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~-i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-fc-~sGD~Tlea~~~ai~~~a~~~~ 275 (360)
+++.|.--++....-+|=-|++... +||-. ....+|+ ++++.+..-.+ |- -.|=+|-+|++.|.+.+....+
T Consensus 29 lv~~l~i~~~~~rvgvv~fs~~~~~~~~~~~---~~~~~~~--~l~~~i~~l~~~~~~~g~T~~~~AL~~a~~~~~~~~~ 103 (192)
T cd01473 29 IINNLNISKDKVHVGILLFAEKNRDVVPFSD---EERYDKN--ELLKKINDLKNSYRSGGETYIVEALKYGLKNYTKHGN 103 (192)
T ss_pred HHHhCccCCCccEEEEEEecCCceeEEecCc---ccccCHH--HHHHHHHHHHhccCCCCcCcHHHHHHHHHHHhccCCC
Confidence 6677765577777777777766542 34422 1122333 33333332221 22 3466789999999998853210
Q ss_pred -cCC-CeEEEEEecCCccccC-CChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCc--------eEEecCCC
Q psy11447 276 -QHD-DSILIVLSDANLDRYG-IAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGR--------GFVCMDLT 344 (360)
Q Consensus 276 -daD-e~fVivlSDANL~RYg-I~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr--------~fv~~dt~ 344 (360)
..+ ..++|||||-.=.--. -.+.+.++.|+. ..|..|+|=||. .++++ .+.| ||. -+...+-.
T Consensus 104 ~r~~~~kv~IllTDG~s~~~~~~~~~~~a~~lk~--~gV~i~~vGiG~-~~~~e--l~~i-a~~~~~~~~~~~~~~~~f~ 177 (192)
T cd01473 104 RRKDAPKVTMLFTDGNDTSASKKELQDISLLYKE--ENVKLLVVGVGA-ASENK--LKLL-AGCDINNDNCPNVIKTEWN 177 (192)
T ss_pred CcccCCeEEEEEecCCCCCcchhhHHHHHHHHHH--CCCEEEEEEecc-ccHHH--HHHh-cCCCCCCCCCCeEEecchh
Confidence 122 6899999997543211 113344444544 799999999998 56654 3344 242 23445788
Q ss_pred chHHHHHHHH
Q psy11447 345 EIPQILQQIF 354 (360)
Q Consensus 345 ~lP~il~qIf 354 (360)
+|+.+-+++-
T Consensus 178 ~l~~~~~~l~ 187 (192)
T cd01473 178 NLNGISKFLT 187 (192)
T ss_pred hHHHHHHHHH
Confidence 8988877763
No 33
>TIGR03788 marine_srt_targ marine proteobacterial sortase target protein. Members of this protein family are restricted to the Proteobacteria. Each contains a C-terminal sortase-recognition motif, transmembrane domain, and basic residues cluster at the the C-terminus, and is encoded adjacent to a sortase gene. This protein is frequently the only sortase target in its genome, which is as unusual its occurrence in Gram-negative rather than Gram-positive genomes. Many bacteria with this system are marine. In addition to the LPXTG signal, members carry a vault protein inter-alpha-trypsin inhibitor domain (pfam08487) and a von Willebrand factor type A domain (pfam00092).
Probab=82.51 E-value=53 Score=34.99 Aligned_cols=155 Identities=15% Similarity=0.200 Sum_probs=88.7
Q ss_pred ccCCchhh----HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHH
Q psy11447 191 NTGSDLGT----SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRAS 266 (360)
Q Consensus 191 fn~~DL~r----vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~a 266 (360)
|.|-++++ +..++..+...=.+-|++=+++...+ .....|.+...+-++++ ....-+ ..-|=+-..|++.|
T Consensus 284 M~g~~i~~ak~al~~~l~~L~~~d~~~ii~F~~~~~~~---~~~~~~~~~~~~~~a~~-~i~~l~-a~GgT~l~~aL~~a 358 (596)
T TIGR03788 284 MAGESIEQAKSALLLALDQLRPGDRFNIIQFDSDVTLL---FPVPVPATAHNLARARQ-FVAGLQ-ADGGTEMAGALSAA 358 (596)
T ss_pred CCCccHHHHHHHHHHHHHhCCCCCEEEEEEECCcceEe---ccccccCCHHHHHHHHH-HHhhCC-CCCCccHHHHHHHH
Confidence 56655555 22344555545566677665553321 12223444444433332 332222 23455557778888
Q ss_pred HHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHH--HHHHHhhCCCCceEEecCCC
Q psy11447 267 IQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQ--AKRLTQELPAGRGFVCMDLT 344 (360)
Q Consensus 267 i~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdq--A~~l~~~Lp~Gr~fv~~dt~ 344 (360)
++..... .+.--+.||+|||-... .+..+.+.+.....+++-|.|-||+--+. -++|++. -.|..+..-|..
T Consensus 359 ~~~~~~~-~~~~~~~iillTDG~~~----~~~~~~~~~~~~~~~~ri~tvGiG~~~n~~lL~~lA~~-g~G~~~~i~~~~ 432 (596)
T TIGR03788 359 LRDDGPE-SSGALRQVVFLTDGAVG----NEDALFQLIRTKLGDSRLFTVGIGSAPNSYFMRKAAQF-GRGSFTFIGSTD 432 (596)
T ss_pred HHhhccc-CCCceeEEEEEeCCCCC----CHHHHHHHHHHhcCCceEEEEEeCCCcCHHHHHHHHHc-CCCEEEECCCHH
Confidence 7653221 02223579999998631 24456666655435678899999984332 2344444 347778888999
Q ss_pred chHHHHHHHHHh
Q psy11447 345 EIPQILQQIFSA 356 (360)
Q Consensus 345 ~lP~il~qIf~s 356 (360)
+++..|.+.|..
T Consensus 433 ~~~~~~~~~l~~ 444 (596)
T TIGR03788 433 EVQRKMSQLFAK 444 (596)
T ss_pred HHHHHHHHHHHh
Confidence 999999998875
No 34
>PF14062 DUF4253: Domain of unknown function (DUF4253)
Probab=82.30 E-value=2.4 Score=36.16 Aligned_cols=67 Identities=15% Similarity=0.290 Sum_probs=50.4
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhcc-ccCC--cchHHHHHHHHH
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQF-CASG--DYTLESIRASIQ 268 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qf-c~sG--D~Tlea~~~ai~ 268 (360)
+|--++..++||---|+|.++|.-++ .+.+||++..+-+++.+...+-..- ..-| ..|+++...++.
T Consensus 32 ~~a~lr~W~er~ga~i~~i~~d~le~---~v~~pP~~~~ea~~lA~E~y~fCpD~v~qg~~~~~l~~la~~L~ 101 (111)
T PF14062_consen 32 IIAVLRYWEERYGAEIVGIGFDTLEL---SVARPPQTPEEAEALAAEHYAFCPDIVDQGYGNFTLEELAEELR 101 (111)
T ss_pred HHHHHHHHHHHhCEEEEEEECCEEEE---EECCCCCCHHHHHHHHHHHHHhCccHHhcCCCCCCHHHHHHHHh
Confidence 77777888999999999999996653 6788999999999998876653222 1223 448887776664
No 35
>cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo
Probab=81.17 E-value=31 Score=31.64 Aligned_cols=65 Identities=15% Similarity=0.207 Sum_probs=45.3
Q ss_pred CcchHHHHHHHHHHHHhhccc---CCCeEEEEEecC-CccccCCChhHHHHHHhcCcccccEEEEEecCchH
Q psy11447 256 GDYTLESIRASIQDLTTECAQ---HDDSILIVLSDA-NLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGD 323 (360)
Q Consensus 256 GD~Tlea~~~ai~~~a~~~~d---aDe~fVivlSDA-NL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgd 323 (360)
|=+|-.|++.|.+.+...... .=-..||||||- +-+.- -.|...++.|.. ..|..|+|=||+-.|
T Consensus 106 gT~ig~aL~~A~~~l~~~~~~~R~~v~kvvIllTDg~~~~~~-~~~~~~a~~l~~--~GI~i~tVGiG~~~d 174 (193)
T cd01477 106 ASYLDTGLQAAEQMLAAGKRTSRENYKKVVIVFASDYNDEGS-NDPRPIAARLKS--TGIAIITVAFTQDES 174 (193)
T ss_pred cchHHHHHHHHHHHHHhhhccccCCCCeEEEEEecCccCCCC-CCHHHHHHHHHH--CCCEEEEEEeCCCCC
Confidence 577888998888877532001 125889999974 22221 567888888864 799999999999544
No 36
>cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions.
Probab=80.71 E-value=39 Score=29.70 Aligned_cols=84 Identities=19% Similarity=0.228 Sum_probs=51.7
Q ss_pred chHHHHHHHHHHHHhhcccC-CCeEEEEEecCCccccCCCh-----hHHHHHHhcCcccccEEEEEecCch---HHHHHH
Q psy11447 258 YTLESIRASIQDLTTECAQH-DDSILIVLSDANLDRYGIAP-----REMGEALNAEAASVRAYAIFIGSLG---DQAKRL 328 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~da-De~fVivlSDANL~RYgI~p-----~~l~~~l~~~~p~V~a~~IfIgslg---dqA~~l 328 (360)
++-+|.+.|.+.+....... ....||+|||-.=.. |.+| .++++.+.. ..|..++|.+|+-+ ..-++|
T Consensus 77 ~l~~aL~~a~~~l~~~~~~~~~~~~ivliTDG~~~~-g~~~~~~~~~~~~~~l~~--~gi~v~~I~~~~~~~~~~~l~~i 153 (178)
T cd01451 77 PLAAGLLAAYELAAEQARDPGQRPLIVVITDGRANV-GPDPTADRALAAARKLRA--RGISALVIDTEGRPVRRGLAKDL 153 (178)
T ss_pred cHHHHHHHHHHHHHHHhcCCCCceEEEEECCCCCCC-CCCchhHHHHHHHHHHHh--cCCcEEEEeCCCCccCccHHHHH
Confidence 45778888888872210011 237999999985433 4444 344555654 45777777776532 235777
Q ss_pred HhhCCCCceEEecCCCc
Q psy11447 329 TQELPAGRGFVCMDLTE 345 (360)
Q Consensus 329 ~~~Lp~Gr~fv~~dt~~ 345 (360)
.+.- .|+.|..-|...
T Consensus 154 A~~t-gG~~~~~~d~~~ 169 (178)
T cd01451 154 ARAL-GGQYVRLPDLSA 169 (178)
T ss_pred HHHc-CCeEEEcCcCCH
Confidence 7766 488887776653
No 37
>PF08406 CbbQ_C: CbbQ/NirQ/NorQ C-terminal ; InterPro: IPR013615 This domain is found at the C terminus of proteins of the CbbQ/NirQ/NorQ family of proteins which play a role in the post-translational activation of Rubisco []. It is also found in the Thauera aromaticaTutH protein which is similar to the CbbQ/NirQ/NorQ family [], as well as in putative chaperones. The ATPase domain associated with various cellular activities (AAA) IPR011704 from INTERPRO is found in the same bacterial and archaeal proteins as the domain described here.
Probab=80.70 E-value=8.5 Score=31.69 Aligned_cols=73 Identities=15% Similarity=0.140 Sum_probs=50.0
Q ss_pred HHHHhcCCCCChhHHHHHHHhhhhhhhhhccCCCCCCccchhhhhhhhhhh------hhHHHhhhhhccccCCchhhHHH
Q psy11447 57 ALLSSYGPDVPRDTLIRLVRAFGEFRELSDKSLLPYPYSTREAVHIVKHLQ------EGVLGQHENTRKFNGYDGRLDRE 130 (360)
Q Consensus 57 ~~L~~ygp~v~~~~l~~Lv~afgeLr~~~deGli~ypYs~Re~v~ivKhL~------~~v~dv~~nv~rFd~yD~rl~R~ 130 (360)
+.+.+=+ .++.+..++||..=+.+|.+++.| +..+-|||.++...+=+. +.+.-++...+. -|+.+
T Consensus 3 ~Iv~~es-gl~~~~a~~LV~~A~~iR~l~~~~-L~e~~STR~Li~~A~Li~~G~~~~~A~~~ai~~~Lt---dd~~~--- 74 (86)
T PF08406_consen 3 EIVARES-GLDPEIAEKLVKLANAIRNLKGGG-LEETVSTRLLIYAARLIAAGMDPRQACRAAIVEPLT---DDPDV--- 74 (86)
T ss_pred HHHHHHc-CCCHHHHHHHHHHHHHHHhccCCC-CCCCCcHHHHHHHHHHHHcCCCHHHHHHHHHhccCC---CCHHH---
Confidence 3444333 799999999999999999997777 678889999999766554 333333333332 37777
Q ss_pred HHHHHHH
Q psy11447 131 LEAVIMV 137 (360)
Q Consensus 131 lE~v~~V 137 (360)
.+++..+
T Consensus 75 ~~al~~~ 81 (86)
T PF08406_consen 75 REALREL 81 (86)
T ss_pred HHHHHHH
Confidence 4444443
No 38
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=78.63 E-value=48 Score=37.80 Aligned_cols=129 Identities=11% Similarity=0.135 Sum_probs=75.3
Q ss_pred eEeeeeccCCCCc-cccccCCCCCCCchHHHHHHHHHHHhhccccCCcch--HHHHHHHHHHHHhhcccCCCeEEEEEec
Q psy11447 211 SLDIIGHSGETHS-IPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYT--LESIRASIQDLTTECAQHDDSILIVLSD 287 (360)
Q Consensus 211 ~ydivGHSGd~~~-i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~T--lea~~~ai~~~a~~~~daDe~fVivlSD 287 (360)
..-||.=|+.... .||+.. .+..+|-.+.+.+-.. +|..| -.|++.|++.|.+......-..+|+|||
T Consensus 343 rVGLVtFsssA~vl~pLt~I----ts~~dr~aL~~~L~~~-----A~GGT~I~~GL~~Alq~L~~~~~~~~~~~IILLTD 413 (863)
T TIGR00868 343 WVGMVTFDSAAYIKNELIQI----TSSAERDALTANLPTA-----ASGGTSICSGLKAAFQVIKKSYQSTDGSEIVLLTD 413 (863)
T ss_pred EEEEEEECCceeEeeccccC----CcHHHHHHHHHhhccc-----cCCCCcHHHHHHHHHHHHHhcccccCCCEEEEEeC
Confidence 4455655655332 234332 2234555555554321 34445 6888889988865411224578999999
Q ss_pred CCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCceEEecCCCch---HHHHHHHH
Q psy11447 288 ANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRGFVCMDLTEI---PQILQQIF 354 (360)
Q Consensus 288 ANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~fv~~dt~~l---P~il~qIf 354 (360)
-.=. . ...+.+.+.. ..|.-|.|=+|+-.++.-+=-...-.|+.|.+-|.+++ ...+.+|-
T Consensus 414 Gedn--~--~~~~l~~lk~--~gVtI~TIg~G~dad~~L~~IA~~TGG~~f~asd~~dl~~L~dAF~~is 477 (863)
T TIGR00868 414 GEDN--T--ISSCFEEVKQ--SGAIIHTIALGPSAAKELEELSDMTGGLRFYASDQADNNGLIDAFGALS 477 (863)
T ss_pred CCCC--C--HHHHHHHHHH--cCCEEEEEEeCCChHHHHHHHHHhcCCEEEEeCCHHHHHHHHHHHHHHh
Confidence 6421 2 2344445543 57889999999866554333344568999998876654 55555553
No 39
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=75.26 E-value=24 Score=32.25 Aligned_cols=88 Identities=9% Similarity=0.121 Sum_probs=55.0
Q ss_pred CCchHHHHHHHHHHHhhcc-------ccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccC---CChhHHHHH
Q psy11447 234 RDNKQRLEVIKIMHAHAQF-------CASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYG---IAPREMGEA 303 (360)
Q Consensus 234 k~~keRl~vl~~M~aH~Qf-------c~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYg---I~p~~l~~~ 303 (360)
..+.++++.++.+..-... +-.|=+=.+|+..|.+.+.......-.+-||++||.. .-+| ++++++..+
T Consensus 76 ~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~l~~aL~~a~~~~~~~~~~~~~k~IvL~TDg~-~p~~~~~~~~~~~~~~ 154 (218)
T cd01458 76 TPGAERVEDLKELIEPGGLSFAGQVGDSGQVSLSDALWVCLDLFSKGKKKKSHKRIFLFTNND-DPHGGDSIKDSQAAVK 154 (218)
T ss_pred CCCHHHHHHHHHHhhcchhhhcccCCCCCCccHHHHHHHHHHHHHhccccccccEEEEECCCC-CCCCCCHHHHHHHHHH
Confidence 5577888888887643311 1123334889999999886411144568899999964 2222 445554433
Q ss_pred HhcC-cccccEEEEEecCch
Q psy11447 304 LNAE-AASVRAYAIFIGSLG 322 (360)
Q Consensus 304 l~~~-~p~V~a~~IfIgslg 322 (360)
...- .-+|+-++|+||+-+
T Consensus 155 a~~l~~~gI~i~~i~i~~~~ 174 (218)
T cd01458 155 AEDLKDKGIELELFPLSSPG 174 (218)
T ss_pred HHHHHhCCcEEEEEecCCCC
Confidence 3321 147999999999965
No 40
>PF08683 CAMSAP_CKK: Microtubule-binding calmodulin-regulated spectrin-associated; InterPro: IPR014797 This is the C-terminal domain of a family of eumetazoan proteins collectively defined as calmodulin-regulated spectrin-associated, or CAMSAP, proteins. CAMSAP proteins carry an N-terminal region that includes the CH domain, a central region including a predicted coiled-coil and this C-terminal, or CKK, domain - defined as being present in CAMSAP, KIAA1078 and KIAA1543, The C-terminal domain is the part of the CAMSAP proteins that binds to microtubules. The domain appears to act by producing inhibition of neurite extension, probably by blocking microtubule function. CKK represents a domain that has evolved with the metazoa []. The structure of a murine hypothetical protein from RIKEN cDNA has shown the domain to adopt a mainly beta barrel structure with an associated alpha-helical hairpin. ; PDB: 1UGJ_A.
Probab=73.96 E-value=9.3 Score=33.70 Aligned_cols=81 Identities=17% Similarity=0.384 Sum_probs=52.1
Q ss_pred HHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCcc----------------ccCCChhHHHHHH
Q psy11447 241 EVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLD----------------RYGIAPREMGEAL 304 (360)
Q Consensus 241 ~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~----------------RYgI~p~~l~~~l 304 (360)
.|++.-..| -|.+|.+--+.=..+++++++. +++ +|+|++.|++.. =||..|+.+..
T Consensus 15 ~iI~nAL~~--~~LaG~vN~~~r~~~l~~~~~s--~~~-hflILfrd~~~~fRglY~~~~~~~~~~ki~G~gP~~i~~-- 87 (123)
T PF08683_consen 15 RIIHNALSH--CCLAGPVNEKQRNKALEEMSKS--EAN-HFLILFRDAGCQFRGLYSYDPESEELVKIYGTGPRVITP-- 87 (123)
T ss_dssp HHHHHHHHH--TTT-SSTTHHHHHHHHHHHHHS----S--EEEEESSSS-SEEEEEEE-TTSS-EEEEESSS-SEE-T--
T ss_pred HHHHHHHHH--hcCCCccCHHHHHHHHHHHhhC--CCC-eEEEEEecCCCceEEEEEEeCCCCeEEEEEccCcCccCH--
Confidence 456655555 7999999999999999999886 665 999999998754 25666766643
Q ss_pred hcCcccccEEEEEecCchHHHHHHHhhCCCCceEEecCCCch
Q psy11447 305 NAEAASVRAYAIFIGSLGDQAKRLTQELPAGRGFVCMDLTEI 346 (360)
Q Consensus 305 ~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~fv~~dt~~l 346 (360)
.-|.++.-|=.+ ++.|-..+++.+
T Consensus 88 ----~mv~~~~KYdSg--------------~K~F~~i~sk~~ 111 (123)
T PF08683_consen 88 ----SMVDKFYKYDSG--------------SKQFKPIPSKHL 111 (123)
T ss_dssp ----TTEEEEEEEETT--------------TTEEEE-SSSS-
T ss_pred ----HHHHHHhccccc--------------CceeeeccCccc
Confidence 345566666544 666666666654
No 41
>cd00951 KDGDH 5-dehydro-4-deoxyglucarate dehydratase, also called 5-keto-4-deoxy-glucarate dehydratase (KDGDH), which is member of dihydrodipicolinate synthase (DHDPS) family that comprises several pyruvate-dependent class I aldolases. The enzyme is involved in glucarate metabolism, and its mechanism presumbly involves a Schiff-base intermediate similar to members of DHDPS family. While in the case of Pseudomonas sp. 5-dehydro-4-deoxy-D-glucarate is degraded by KDGDH to 2,5-dioxopentanoate, in certain species of Enterobacteriaceae it is degraded instead to pyruvate and glycerate.
Probab=69.63 E-value=1.1e+02 Score=29.64 Aligned_cols=44 Identities=18% Similarity=0.218 Sum_probs=33.4
Q ss_pred eeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCcchHHHHHHH
Q psy11447 213 DIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGDYTLESIRAS 266 (360)
Q Consensus 213 divGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD~Tlea~~~a 266 (360)
=+.|++|+.+.+ +..||.++++....++. +|-.|.+|-++++.+
T Consensus 39 ~v~GstGE~~~L----------s~eEr~~l~~~~~~~~~~~~pvi~gv~~~t~~~i~~a 87 (289)
T cd00951 39 FAAGGTGEFFSL----------TPDEYAQVVRAAVEETAGRVPVLAGAGYGTATAIAYA 87 (289)
T ss_pred EECcCCcCcccC----------CHHHHHHHHHHHHHHhCCCCCEEEecCCCHHHHHHHH
Confidence 378999999874 56799999999888764 565666777776643
No 42
>cd01455 vWA_F11C1-5a_type Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=68.53 E-value=4.3 Score=38.11 Aligned_cols=65 Identities=37% Similarity=0.740 Sum_probs=48.3
Q ss_pred ccCC-chhhHHHHHHHHHHHHHhhhhhccceeeeeccCCCCccccccccccCCChhhHHHHHHHhhccccc
Q psy11447 120 FNGY-DGRLDRELEAVIMVLEAFQDFDQRIRLDIIGHSGETHSIPFVTSIKNLPRDNKQRLEVIKVHHNNL 189 (360)
Q Consensus 120 Fd~y-D~rl~R~lE~v~~Vme~~~~~e~kl~~di~~HSG~~~siP~v~kvk~~are~k~RL~ei~m~~~~l 189 (360)
||+- ..||+.-...+-. |.++.++.++|++||+|+...+|++. +..||...++||+.+..+..++
T Consensus 19 ~dg~~~~RL~a~k~v~~~----f~~f~~~r~~DriG~~g~~~~~~~lt-~d~p~t~d~~~~~~l~~~l~~~ 84 (191)
T cd01455 19 YDGRLDRSLEAVVMVMEA----FDGFEDKIQYDIIGHSGDGPCVPFVK-TNHPPKNNKERLETLKMMHAHS 84 (191)
T ss_pred cCCccccHHHHHHHHHHH----HHHHHHhCccceeeecCcccccCccc-cccCcccchhHHHHHHHHHHhc
Confidence 4444 4566554444333 23346688999999999999999998 5999999999999999877743
No 43
>TIGR02031 BchD-ChlD magnesium chelatase ATPase subunit D. This model represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll. This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. Unlike subunit I (TIGR02030), this subunit is not found in archaea.
Probab=68.02 E-value=1.5e+02 Score=32.09 Aligned_cols=85 Identities=13% Similarity=0.128 Sum_probs=54.1
Q ss_pred cchHHHHHHHHHHHHhhcccCCCeEEEEEec--CCccccCC----Ch---------hHHHHHHhcCcccccEEEEEecCc
Q psy11447 257 DYTLESIRASIQDLTTECAQHDDSILIVLSD--ANLDRYGI----AP---------REMGEALNAEAASVRAYAIFIGSL 321 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daDe~fVivlSD--ANL~RYgI----~p---------~~l~~~l~~~~p~V~a~~IfIgsl 321 (360)
=++-.|+..|.+.+.......-...||+||| +|-...|. .| ..+++.+.. ..|.+.||=+|+-
T Consensus 482 TpL~~gL~~A~~~~~~~~~~~~~~~ivllTDG~~nv~~~~~~~~~~~~~~~~~~~~~~~a~~~~~--~gi~~~vid~~~~ 559 (589)
T TIGR02031 482 TPLAAGLAAAFQTALQARSSGGTPTIVLITDGRGNIPLDGDPESIKADREQAAEEALALARKIRE--AGMPALVIDTAMR 559 (589)
T ss_pred CcHHHHHHHHHHHHHHhcccCCceEEEEECCCCCCCCCCcccccccccchhHHHHHHHHHHHHHh--cCCeEEEEeCCCC
Confidence 3447888888888764310112358999999 56544442 21 334566654 5688888888752
Q ss_pred -h--HHHHHHHhhCCCCceEEecCCC
Q psy11447 322 -G--DQAKRLTQELPAGRGFVCMDLT 344 (360)
Q Consensus 322 -g--dqA~~l~~~Lp~Gr~fv~~dt~ 344 (360)
. ..+.+|++.+- |+.|-+-|.+
T Consensus 560 ~~~~~~~~~lA~~~~-g~y~~l~~~~ 584 (589)
T TIGR02031 560 FVSTGFAQKLARKMG-AHYIYLPNAT 584 (589)
T ss_pred CccchHHHHHHHhcC-CcEEeCCCCC
Confidence 1 35999999995 8877665543
No 44
>TIGR02442 Cob-chelat-sub cobaltochelatase subunit. A number of genomes (actinobacteria, cyanobacteria, betaproteobacteria and pseudomonads) which apparently biosynthesize B12, encode a cobN gene but are demonstrably lacking cobS and cobT. These genomes do, however contain a homolog (modelled here) of the magnesium chelatase subunits BchI/BchD family. Aside from the cyanobacteria (which have a separate magnesium chelatase trimer), these species do not make chlorins, so do not have any use for a magnesium chelatase. Furthermore, in nearly all cases the members of this family are proximal to either CobN itself or other genes involved in cobalt transport or B12 biosynthesis.
Probab=66.51 E-value=64 Score=34.96 Aligned_cols=119 Identities=20% Similarity=0.264 Sum_probs=66.4
Q ss_pred cccceeEeeeeccCCCCccccccCCCCCCCchHHH-HHHHHHHHhhccccCCcch--HHHHHHHHHHHHhh--cccCCCe
Q psy11447 206 FDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRL-EVIKIMHAHAQFCASGDYT--LESIRASIQDLTTE--CAQHDDS 280 (360)
Q Consensus 206 ~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl-~vl~~M~aH~Qfc~sGD~T--lea~~~ai~~~a~~--~~daDe~ 280 (360)
+..+-..-+|+=+|+...+- -||+.+.+.. ..|..| -.|..| -.|+..|.+.+... ....-..
T Consensus 499 ~~~~D~v~lI~F~g~~a~~~-----~p~t~~~~~~~~~L~~l-------~~gG~Tpl~~aL~~A~~~l~~~~~~~~~~~~ 566 (633)
T TIGR02442 499 YQKRDKVALITFRGEEAEVL-----LPPTSSVELAARRLEEL-------PTGGRTPLAAGLLKAAEVLSNELLRDDDGRP 566 (633)
T ss_pred hcCCCEEEEEEECCCCceEE-----cCCCCCHHHHHHHHHhC-------CCCCCCCHHHHHHHHHHHHHHhhccCCCCce
Confidence 44556667777776643321 2455444433 222222 236566 78888888887621 0033458
Q ss_pred EEEEEecC--CccccCCCh----hHHHHHHhcCcccccEEEEEecCc---hHHHHHHHhhCCCCceEE
Q psy11447 281 ILIVLSDA--NLDRYGIAP----REMGEALNAEAASVRAYAIFIGSL---GDQAKRLTQELPAGRGFV 339 (360)
Q Consensus 281 fVivlSDA--NL~RYgI~p----~~l~~~l~~~~p~V~a~~IfIgsl---gdqA~~l~~~Lp~Gr~fv 339 (360)
.||+|||- |-.-.|-+| ..+++.|.. ..|.+.+|-+++- -.-+..|++.+- |+.|.
T Consensus 567 ~vvliTDG~~n~~~~~~~~~~~~~~~a~~l~~--~~i~~~vIdt~~~~~~~~~~~~lA~~~g-g~y~~ 631 (633)
T TIGR02442 567 LLVVITDGRANVADGGEPPTDDARTIAAKLAA--RGILFVVIDTESGFVRLGLAEDLARALG-GEYVR 631 (633)
T ss_pred EEEEECCCCCCCCCCCCChHHHHHHHHHHHHh--cCCeEEEEeCCCCCcchhHHHHHHHhhC-CeEEe
Confidence 99999995 332012121 334555543 4677777766541 145888888884 77663
No 45
>PRK13406 bchD magnesium chelatase subunit D; Provisional
Probab=63.25 E-value=55 Score=35.48 Aligned_cols=124 Identities=19% Similarity=0.192 Sum_probs=74.5
Q ss_pred cccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcch--HHHHHHHHHHHHhhcccCCCeEEE
Q psy11447 206 FDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYT--LESIRASIQDLTTECAQHDDSILI 283 (360)
Q Consensus 206 ~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~T--lea~~~ai~~~a~~~~daDe~fVi 283 (360)
|..+=+.-+|.=+|+....- -||+.+.++. .+.+- +. -.|.-| -.|+..|.+.+.......-...||
T Consensus 434 y~~rD~v~lI~F~g~~a~~~-----lppT~~~~~~--~~~L~---~l-~~gGgTpL~~gL~~A~~~l~~~~~~~~~~~iV 502 (584)
T PRK13406 434 YVRRDQVALVAFRGRGAELL-----LPPTRSLVRA--KRSLA---GL-PGGGGTPLAAGLDAAAALALQVRRKGMTPTVV 502 (584)
T ss_pred cCCCCEEEEEEECCCceeEE-----cCCCcCHHHH--HHHHh---cC-CCCCCChHHHHHHHHHHHHHHhccCCCceEEE
Confidence 44455677888888865432 2455544433 22221 11 155666 566777777664321122358999
Q ss_pred EEec--CCcccc---C-----CChhHHHHHHhcCcccccEEEEEecCchH-HHHHHHhhCCCCceEEecCC
Q psy11447 284 VLSD--ANLDRY---G-----IAPREMGEALNAEAASVRAYAIFIGSLGD-QAKRLTQELPAGRGFVCMDL 343 (360)
Q Consensus 284 vlSD--ANL~RY---g-----I~p~~l~~~l~~~~p~V~a~~IfIgslgd-qA~~l~~~Lp~Gr~fv~~dt 343 (360)
+||| +|...= | ..+.+++..+.. ..|.+++|-+|.-.. .+.+|.+.+- |+-|-.-+.
T Consensus 503 LlTDG~~n~~~~~~~~~~~~~~~~~~~a~~~~~--~gi~~~vId~g~~~~~~~~~LA~~~g-g~y~~l~~~ 570 (584)
T PRK13406 503 LLTDGRANIARDGTAGRAQAEEDALAAARALRA--AGLPALVIDTSPRPQPQARALAEAMG-ARYLPLPRA 570 (584)
T ss_pred EEeCCCCCCCccccccccchhhHHHHHHHHHHh--cCCeEEEEecCCCCcHHHHHHHHhcC-CeEEECCCC
Confidence 9999 454321 1 223445555544 579999999987664 8999999995 887765443
No 46
>TIGR01650 PD_CobS cobaltochelatase, CobS subunit. This model describes the aerobic cobalamin pathway Pseudomonas denitrificans CobS gene product, which is a cobalt chelatase subunit, with a MW ~37 kDa. The aerobic pathway cobalt chelatase is a heterotrimeric, ATP-dependent enzyme that catalyzes cobalt insertion during cobalamin biosynthesis. The other two subunits are the P. denitrificans CobT (TIGR01651) and CobN (pfam02514 CobN/Magnesium Chelatase) proteins. To avoid potential confusion with the nonhomologous Salmonella typhimurium/E.coli cobS gene product, the P. denitrificans gene symbol is not used in the name of this model.
Probab=60.11 E-value=18 Score=36.52 Aligned_cols=67 Identities=15% Similarity=0.177 Sum_probs=56.2
Q ss_pred hhhcc-cccCCCCchhHHHHHHhcCCCCC----hhHHHHHHHhhhhhhhhhccCCCCCCccchhhhhhhhhh
Q psy11447 40 DLFSV-HVIDNPSPESELALLSSYGPDVP----RDTLIRLVRAFGEFRELSDKSLLPYPYSTREAVHIVKHL 106 (360)
Q Consensus 40 d~fs~-h~v~np~~esE~~~L~~ygp~v~----~~~l~~Lv~afgeLr~~~deGli~ypYs~Re~v~ivKhL 106 (360)
|=|.. ..++-|+.+.|...|.+..|.++ +.++..+|+.-...|..+..|-+..+-|+|.++...+..
T Consensus 209 DRF~i~~~~~Yp~~e~E~~Il~~~~~~~~~~~~~~i~~~mV~la~~tR~~~~~~~i~~~~SpR~li~w~~~~ 280 (327)
T TIGR01650 209 DRWSIVTTLNYLEHDNEAAIVLAKAKGFDDTEGKDIINAMVRVADMTRNAFINGDISTVMSPRTVITWAENA 280 (327)
T ss_pred hheeeEeeCCCCCHHHHHHHHHhhccCCCccchHHHHHHHHHHHHHHHhhhccCCccccccHHHHHHHHHHH
Confidence 44653 45999999999999988766653 688999999999999988788899999999999987765
No 47
>KOG2454|consensus
Probab=59.89 E-value=37 Score=35.99 Aligned_cols=86 Identities=23% Similarity=0.366 Sum_probs=65.4
Q ss_pred ccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHh
Q psy11447 251 QFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQ 330 (360)
Q Consensus 251 Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~ 330 (360)
|-.|||-.-.|-+|.+...+ ++++.+|=+++ +.+ +-+++|+++ |.|+ +++|||| -.-|..|.+
T Consensus 226 ~~~WS~~fy~e~ir~~L~a~-----g~~p~LVq~it-------clp--d~a~~ltSh-~g~k-hitFiGS-qpvak~i~~ 288 (583)
T KOG2454|consen 226 HASWSGCFYFEIIRAALAAV-----GAPPNLVQVIT-------CLP--DTAEALTSH-SGVK-HITFIGS-QPVAKMIMR 288 (583)
T ss_pred ceeeehhhHHHHHHHHHHHc-----CCCcchhheee-------cCc--chHhHhhcC-CCcc-eEEEecC-cHHHHHHHH
Confidence 56799999999999987753 67788888874 444 789999997 9997 7899999 555666544
Q ss_pred ----h-------CCCCceEEecCCCchHHHHHHH
Q psy11447 331 ----E-------LPAGRGFVCMDLTEIPQILQQI 353 (360)
Q Consensus 331 ----~-------Lp~Gr~fv~~dt~~lP~il~qI 353 (360)
+ |-.--+|+..|..+|-++.+-+
T Consensus 289 ~AAk~LTPv~lELGGKDafIi~dda~l~~v~si~ 322 (583)
T KOG2454|consen 289 NAAKTLTPVTLELGGKDAFIICDDADLSHVASIA 322 (583)
T ss_pred hhhhhcCcEEEeecCcccEEEeccccHHHHHHHH
Confidence 2 3333589988888888776544
No 48
>PF13768 VWA_3: von Willebrand factor type A domain
Probab=58.19 E-value=1.1e+02 Score=25.73 Aligned_cols=110 Identities=18% Similarity=0.288 Sum_probs=68.8
Q ss_pred hhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccc-cCCcchHHHHHHHHHHHHhhcccCCCeE
Q psy11447 203 FQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFC-ASGDYTLESIRASIQDLTTECAQHDDSI 281 (360)
Q Consensus 203 f~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc-~sGD~Tlea~~~ai~~~a~~~~daDe~f 281 (360)
++.+..+=++.|+.=+. .+ -++. .+-.|.++..+-+.++.|.. ..+ ..|=+...|.+.|.+... . .....-
T Consensus 28 l~~L~~~d~fnii~f~~-~~-~~~~-~~~~~~~~~~~~~a~~~I~~--~~~~~G~t~l~~aL~~a~~~~~-~--~~~~~~ 99 (155)
T PF13768_consen 28 LRSLPPGDRFNIIAFGS-SV-RPLF-PGLVPATEENRQEALQWIKS--LEANSGGTDLLAALRAALALLQ-R--PGCVRA 99 (155)
T ss_pred HHhCCCCCEEEEEEeCC-Ee-eEcc-hhHHHHhHHHHHHHHHHHHH--hcccCCCccHHHHHHHHHHhcc-c--CCCccE
Confidence 34454455666665333 21 2333 44456666777666666654 334 566666788998888752 2 345688
Q ss_pred EEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchH
Q psy11447 282 LIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGD 323 (360)
Q Consensus 282 VivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgd 323 (360)
+|+|||-+... ++.++-+.......+++-|++=||+-.+
T Consensus 100 IilltDG~~~~---~~~~i~~~v~~~~~~~~i~~~~~g~~~~ 138 (155)
T PF13768_consen 100 IILLTDGQPVS---GEEEILDLVRRARGHIRIFTFGIGSDAD 138 (155)
T ss_pred EEEEEeccCCC---CHHHHHHHHHhcCCCceEEEEEECChhH
Confidence 99999998733 3456666665543789999999998554
No 49
>cd08567 GDPD_SpGDE_like Glycerophosphodiester phosphodiesterase domain of putative Silicibacter pomeroyi glycerophosphodiester phosphodiesterase and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized bacterial glycerophosphodiester phosphodiesterases (GP-GDE, EC 3.1.4.46) and similar proteins. The prototype of this CD is a putative GP-GDE from Silicibacter pomeroyi (SpGDE). It shows high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=57.54 E-value=15 Score=34.01 Aligned_cols=35 Identities=20% Similarity=0.324 Sum_probs=25.4
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
.||++|.+.|++.=+. -.|.| |..+||.-|.+|+|
T Consensus 15 ENT~~Af~~A~~~Gad-~vE~DV~~TkDg~~Vv~HD~~l~r 54 (263)
T cd08567 15 ENTLPAFAKALDLGVD-TLELDLVLTKDGVIVVSHDPKLNP 54 (263)
T ss_pred cchHHHHHHHHHcCCC-EEEEEEEEcCCCCEEEeCCCccCc
Confidence 5899999999974221 11222 57788999999996
No 50
>cd00408 DHDPS-like Dihydrodipicolinate synthase family. A member of the class I aldolases, which use an active-site lysine which stablilzes a reaction intermediate via Schiff base formation, and have TIM beta/alpha barrel fold. The dihydrodipicolinate synthase family comprises several pyruvate-dependent class I aldolases that use the same catalytic step to catalyze different reactions in different pathways and includes such proteins as N-acetylneuraminate lyase, MosA protein, 5-keto-4-deoxy-glucarate dehydratase, trans-o-hydroxybenzylidenepyruvate hydratase-aldolase, trans-2'-carboxybenzalpyruvate hydratase-aldolase, and 2-keto-3-deoxy- gluconate aldolase. The family is also referred to as the N-acetylneuraminate lyase (NAL) family.
Probab=56.94 E-value=1.8e+02 Score=27.53 Aligned_cols=106 Identities=20% Similarity=0.304 Sum_probs=60.5
Q ss_pred EeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCc-chHHHHHHHHHHHHhhcccCCCe-----
Q psy11447 212 LDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGD-YTLESIRASIQDLTTECAQHDDS----- 280 (360)
Q Consensus 212 ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD-~Tlea~~~ai~~~a~~~~daDe~----- 280 (360)
.-+.|++|+.+.+ +..||.++++....+.. +|-.|. +|-|+++.|- .+.+- -||--
T Consensus 35 i~~~GstGE~~~l----------s~~Er~~l~~~~~~~~~~~~~vi~gv~~~~~~~~i~~a~-~a~~~--Gad~v~v~pP 101 (281)
T cd00408 35 LVVLGTTGEAPTL----------TDEERKEVIEAVVEAVAGRVPVIAGVGANSTREAIELAR-HAEEA--GADGVLVVPP 101 (281)
T ss_pred EEECCCCcccccC----------CHHHHHHHHHHHHHHhCCCCeEEEecCCccHHHHHHHHH-HHHHc--CCCEEEECCC
Confidence 3588999999874 57999999999998874 444554 4545555443 22221 22211
Q ss_pred ----------------------EEEEEecCC-ccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCC
Q psy11447 281 ----------------------ILIVLSDAN-LDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELP 333 (360)
Q Consensus 281 ----------------------fVivlSDAN-L~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp 333 (360)
+=|++-+.- ...+.++|+.+.++.. . |+|-+.=.--+.+ .+-.++.+..|
T Consensus 102 ~y~~~~~~~~~~~~~~ia~~~~~pi~iYn~P~~tg~~l~~~~~~~L~~-~-~~v~giK~s~~d~-~~~~~~~~~~~ 174 (281)
T cd00408 102 YYNKPSQEGIVAHFKAVADASDLPVILYNIPGRTGVDLSPETIARLAE-H-PNIVGIKDSSGDL-DRLTRLIALLG 174 (281)
T ss_pred cCCCCCHHHHHHHHHHHHhcCCCCEEEEECccccCCCCCHHHHHHHhc-C-CCEEEEEeCCCCH-HHHHHHHHhcC
Confidence 111222221 2357888999988874 4 7665544333332 33334444444
No 51
>cd00615 Orn_deC_like Ornithine decarboxylase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to ornithine decarboxylase (ODC), arginine decarboxylase (ADC) and lysine decarboxylase (LDC). ODC is a dodecamer composed of six homodimers and catalyzes the decarboxylation of tryptophan. ADC catalyzes the decarboxylation of arginine and LDC catalyzes the decarboxylation of lysine. Members of this family are widely found in all three forms of life.
Probab=56.64 E-value=1.1e+02 Score=29.15 Aligned_cols=23 Identities=30% Similarity=0.439 Sum_probs=19.1
Q ss_pred CCChhHHHHHHhcCcccccEEEEE
Q psy11447 294 GIAPREMGEALNAEAASVRAYAIF 317 (360)
Q Consensus 294 gI~p~~l~~~l~~~~p~V~a~~If 317 (360)
+|+|+.+.++++.+ |++++.+|.
T Consensus 139 ~i~~~~l~~~l~~~-~~~k~v~l~ 161 (294)
T cd00615 139 GIPPETFKKALIEH-PDAKAAVIT 161 (294)
T ss_pred CCCHHHHHHHHHhC-CCceEEEEE
Confidence 78999999999886 777776665
No 52
>cd08601 GDPD_SaGlpQ_like Glycerophosphodiester phosphodiesterase domain of Staphylococcus aureus and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in uncharacterized glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46) from Staphylococcus aureus, Bacillus subtilis and similar proteins. Members in this family show very high sequence similarity to Escherichia coli periplasmic phosphodiesterase GlpQ, which catalyzes the Ca2+-dependent degradation of periplasmic glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=56.08 E-value=19 Score=33.58 Aligned_cols=35 Identities=23% Similarity=0.357 Sum_probs=25.7
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
.||++|.++|++.=+. -.|.| |..+||.-|.+|+|
T Consensus 15 ENT~~af~~A~~~G~d-~vE~DV~lTkDg~~Vv~HD~~l~r 54 (256)
T cd08601 15 EHTFAAYDLAREMGAD-YIELDLQMTKDGVLVAMHDETLDR 54 (256)
T ss_pred CchHHHHHHHHHcCCC-EEEEEeeECCCCeEEEeCCCcccc
Confidence 6899999999984221 01222 57889999999998
No 53
>PF11775 CobT_C: Cobalamin biosynthesis protein CobT VWA domain
Probab=52.88 E-value=55 Score=31.67 Aligned_cols=88 Identities=22% Similarity=0.242 Sum_probs=55.8
Q ss_pred HHHHHHHHHHHHhhcccCCCeEEEEEecCCcc---ccCCC-h----hHHHHHHhcC--cccccEEEEEecCchHHHHHHH
Q psy11447 260 LESIRASIQDLTTECAQHDDSILIVLSDANLD---RYGIA-P----REMGEALNAE--AASVRAYAIFIGSLGDQAKRLT 329 (360)
Q Consensus 260 lea~~~ai~~~a~~~~daDe~fVivlSDANL~---RYgI~-p----~~l~~~l~~~--~p~V~a~~IfIgslgdqA~~l~ 329 (360)
=||++.|.+.|..- ..+.+|.||+||-+=. -++-. + ..|.++...- ...|.-..|=|| ..+.+
T Consensus 119 GeAl~~a~~rL~~r--~e~rkiLiViSDG~P~d~st~~~n~~~~L~~HLr~vi~~ie~~~~Vel~aiGIg---~D~~~-- 191 (219)
T PF11775_consen 119 GEALRWAAERLLAR--PEQRKILIVISDGAPADDSTLSANDGDYLDAHLRQVIAEIETRSDVELIAIGIG---HDVSR-- 191 (219)
T ss_pred HHHHHHHHHHHHcC--CccceEEEEEeCCCcCcccccccCChHHHHHHHHHHHHHHhccCCcEEEEEEcC---CCchh--
Confidence 58999999999876 7889999999996533 12222 2 3444554432 145665555544 34443
Q ss_pred hhCCCCceEEecCCCchHHHHHHHHHhc
Q psy11447 330 QELPAGRGFVCMDLTEIPQILQQIFSAS 357 (360)
Q Consensus 330 ~~Lp~Gr~fv~~dt~~lP~il~qIf~s~ 357 (360)
--| ++-+..|..+||-.|=..++..
T Consensus 192 -yY~--~~~~i~~~e~l~~~~~~~l~~l 216 (219)
T PF11775_consen 192 -YYR--RAVTIDDVEELGGALFEQLARL 216 (219)
T ss_pred -hcc--cceecCCHHHHHHHHHHHHHHH
Confidence 333 6677888899997765555443
No 54
>cd00300 LDH_like L-lactate dehydrogenase-like enzymes. Members of this subfamily are tetrameric NAD-dependent 2-hydroxycarboxylate dehydrogenases including LDHs, L-2-hydroxyisocaproate dehydrogenases (L-HicDH), and LDH-like malate dehydrogenases (MDH). Dehydrogenases catalyze the conversion of carbonyl compounds to alcohols or amino acids. LDHs catalyze the last step of glycolysis in which pyruvate is converted to L-lactate. Vertebrate LDHs are non-allosteric, but some bacterial LDHs are activated by an allosteric effector such as fructose-1,6-bisphosphate. L-HicDH catalyzes the conversion of a variety of 2-oxo carboxylic acids with medium-sized aliphatic or aromatic side chains. MDH is one of the key enzymes in the citric acid cycle, facilitating both the conversion of malate to oxaloacetate and replenishing levels of oxalacetate by reductive carboxylation of pyruvate. The LDH-like subfamily is part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of prot
Probab=51.75 E-value=49 Score=32.22 Aligned_cols=86 Identities=26% Similarity=0.464 Sum_probs=56.7
Q ss_pred CCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC------cccc-
Q psy11447 221 THSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN------LDRY- 293 (360)
Q Consensus 221 ~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN------L~RY- 293 (360)
+.+|-++.+|.|++....|++.+.. +++-++.-++.+.+ -+-+.+||++|... +.+|
T Consensus 66 ~aDiVIitag~p~~~~~~R~~l~~~-------------n~~i~~~~~~~i~~---~~p~~~viv~sNP~d~~~~~~~~~s 129 (300)
T cd00300 66 DADIVVITAGAPRKPGETRLDLINR-------------NAPILRSVITNLKK---YGPDAIILVVSNPVDILTYVAQKLS 129 (300)
T ss_pred CCCEEEEcCCCCCCCCCCHHHHHHH-------------HHHHHHHHHHHHHH---hCCCeEEEEccChHHHHHHHHHHHh
Confidence 4678899999999998899998775 55555555566654 34567888888532 2233
Q ss_pred CCChh----------------HHHHHHhcCcccccEEEEEecCchHH
Q psy11447 294 GIAPR----------------EMGEALNAEAASVRAYAIFIGSLGDQ 324 (360)
Q Consensus 294 gI~p~----------------~l~~~l~~~~p~V~a~~IfIgslgdq 324 (360)
|++|. .+++.|.-+..+|++++| |.-|+.
T Consensus 130 g~~~~kviG~gt~lDs~r~~~~la~~l~v~~~~v~~~vi--GeHg~s 174 (300)
T cd00300 130 GLPKNRVIGSGTLLDSARFRSLLAEKLDVDPQSVHAYVL--GEHGDS 174 (300)
T ss_pred CcCHHHEEecCCcHHHHHHHHHHHHHhCCCcccEEEEEE--eccCCc
Confidence 44433 345556666566888755 887764
No 55
>cd08600 GDPD_EcGlpQ_like Glycerophosphodiester phosphodiesterase domain of Escherichia coli (GlpQ) and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Escherichia coli periplasmic glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46), GlpQ, and similar proteins. GP-GDE plays an essential role in the metabolic pathway of E. coli. It catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols, which are major sources of carbon and phosphate. E. coli possesses two major G3P uptake systems: Glp and Ugp, which contain genes coding for two different GP-GDEs. GlpQ gene from the E. coli glp operon codes for a periplasmic phosphodiesterase GlpQ, which is the prototype of this family. GlpQ is a dimeric enzyme that hydrolyzes periplasmic glycerophosphodiesters, such as glycerophosphocholine (GPC), glycerophosphoethanolanmine (GPE), glycerophosphoglycerol (GP
Probab=51.12 E-value=21 Score=35.24 Aligned_cols=36 Identities=28% Similarity=0.349 Sum_probs=27.1
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCcccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRY 293 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RY 293 (360)
.||++|.+.|++.=+. ..|.| |..+||.-|+||+|-
T Consensus 15 ENTl~Af~~A~~~Gad-~iE~DV~lTkDg~lVv~HD~~l~rt 55 (318)
T cd08600 15 EHTLEAKALAYAQGAD-YLEQDVVLTKDDKLVVIHDHYLDNV 55 (318)
T ss_pred ccHHHHHHHHHHcCCC-EEEeeeeECcCCcEEEeCCchhhcc
Confidence 6899999999975221 12333 688999999999985
No 56
>COG0773 MurC UDP-N-acetylmuramate-alanine ligase [Cell envelope biogenesis, outer membrane]
Probab=50.57 E-value=2.6e+02 Score=29.93 Aligned_cols=156 Identities=16% Similarity=0.265 Sum_probs=107.7
Q ss_pred ccCCchhhHHHHhhcc-ccceeEeeeeccCCCCcccccc-CCCCCCCchHHHHHHHHHHHh--hccccCCcchHHHHHHH
Q psy11447 191 NTGSDLGTSLEAFQDF-DQRISLDIIGHSGETHSIPFVT-SNNLPRDNKQRLEVIKIMHAH--AQFCASGDYTLESIRAS 266 (360)
Q Consensus 191 fn~~DL~rvMEaf~g~-e~k~~ydivGHSGd~~~i~lV~-~~~~Pk~~keRl~vl~~M~aH--~Qfc~sGD~Tlea~~~a 266 (360)
--|+|.+.+-++|.+| --|=+|++.|+.+. +-++. .+..|++-+-=|+-+|...-+ -=-+.-==||--=|+.-
T Consensus 294 ~~Gi~~~~i~~aL~~F~GvkRRfe~~g~~~~---~~viDDYaHHPtEI~aTL~aaR~~~~~~~rIvaifQPHrySRt~~~ 370 (459)
T COG0773 294 ELGIDPEAIAEALASFQGVKRRFELKGEVNG---VTVIDDYAHHPTEIKATLAAARQKVPGGKRIVAVFQPHRYSRTRDL 370 (459)
T ss_pred HcCCCHHHHHHHHHhCCCcceeeEEeeeECC---EEEEecCCCCHHHHHHHHHHHHHhcCCCceEEEEECCCchHhHHHH
Confidence 5677866677777666 23556678888887 55554 789999888777777766532 00111112554445555
Q ss_pred HHHHHhhcccCCCeEEEEEecCCcccc-C-CChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCceEEecCCC
Q psy11447 267 IQDLTTECAQHDDSILIVLSDANLDRY-G-IAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRGFVCMDLT 344 (360)
Q Consensus 267 i~~~a~~~~daDe~fVivlSDANL~RY-g-I~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~fv~~dt~ 344 (360)
.++-++--.+||+=|+.=+-.|.=+.. | ++-+.|++.++.. -++. +..+.+=++.|.+.+-+|--+++|-..
T Consensus 371 ~~dF~~~l~~AD~v~l~~VY~A~e~~~~g~~~~~~l~~~i~~~-~~~~-----~~~~~~~~~~l~~~~~~gD~il~mGAG 444 (459)
T COG0773 371 LDDFAKALSDADEVILLDVYAAGEEPIEGDVSSEDLAEKIRQP-GHVD-----VPDLDDLVELLAKVAQPGDVILFMGAG 444 (459)
T ss_pred HHHHHHHHhcCCEEEEecccccCCCCCcCCccHHHHHHHhhcC-Cccc-----CCCHHHHHHHHHhhCCCCCEEEEecCC
Confidence 555553223888644433334666654 7 9999999999884 3333 788899999999999999999999999
Q ss_pred chHHHHHHHHH
Q psy11447 345 EIPQILQQIFS 355 (360)
Q Consensus 345 ~lP~il~qIf~ 355 (360)
+|-++.+++..
T Consensus 445 di~~~~~~~~~ 455 (459)
T COG0773 445 DIGKIARELLE 455 (459)
T ss_pred cHHHHHHHHHH
Confidence 99999888765
No 57
>PF03456 uDENN: uDENN domain; InterPro: IPR005113 This region is always found associated with IPR001194 from INTERPRO. It is predicted to form an all beta domain [].; PDB: 3TW8_A.
Probab=49.69 E-value=4.3 Score=30.69 Aligned_cols=36 Identities=22% Similarity=0.348 Sum_probs=23.8
Q ss_pred hccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCc-cccCC
Q psy11447 250 AQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANL-DRYGI 295 (360)
Q Consensus 250 ~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL-~RYgI 295 (360)
.+||+.+.-.+.... . ...+.|++||+|.+= ..||.
T Consensus 24 ~~FCfP~G~~~~~~~--------~--~~~~~f~FvLT~~~G~r~Yg~ 60 (65)
T PF03456_consen 24 PMFCFPDGIEISSQS--------R--PPPQFFSFVLTDEDGSRLYGY 60 (65)
T ss_dssp HHHHS-S-CCCCGGG-----------GSSCEEEEEEE-TTS-EEEEE
T ss_pred CccCCCCCcEeeccc--------c--CCCeEEEEEEECCCCCEEEEE
Confidence 789999986655444 2 557899999999984 46764
No 58
>PRK00421 murC UDP-N-acetylmuramate--L-alanine ligase; Provisional
Probab=47.90 E-value=1.9e+02 Score=29.54 Aligned_cols=154 Identities=16% Similarity=0.248 Sum_probs=89.2
Q ss_pred ccCCchhhHHHHh---hccccceeEeeeeccCCCCcccccc-CCCCCCCchHHHHHHHHHHH-hhccccCCcchHHHHHH
Q psy11447 191 NTGSDLGTSLEAF---QDFDQRISLDIIGHSGETHSIPFVT-SNNLPRDNKQRLEVIKIMHA-HAQFCASGDYTLESIRA 265 (360)
Q Consensus 191 fn~~DL~rvMEaf---~g~e~k~~ydivGHSGd~~~i~lV~-~~~~Pk~~keRl~vl~~M~a-H~Qfc~sGD~Tlea~~~ 265 (360)
.-|.+.+.+.+++ .+...|+.+ ++.. ++ +.++. ...-|.+-+.=++.++.+.. ..=.+..|.|+....+.
T Consensus 293 ~lgv~~~~i~~~l~~f~~~~~R~e~--~~~~-~g--~~~i~D~aHnp~~~~a~~~al~~~~~~~~i~~v~gp~~~~r~kd 367 (461)
T PRK00421 293 ELGIDDEAIREALATFKGVKRRFEE--KGEV-GG--VVLIDDYAHHPTEIKATLKAARQGYPDKRIVAVFQPHRYSRTRD 367 (461)
T ss_pred HcCCCHHHHHHHHHhCCCCCcccEE--EEec-CC--cEEEEeCCCCHHHHHHHHHHHHhhCCCCeEEEEECCCCCccHHH
Confidence 3367755555554 455677764 5532 22 23443 33346666666666665432 11245556666555444
Q ss_pred HHHHHHhhcccCCCeEEEEEec-CCccc--cCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCceEEecC
Q psy11447 266 SIQDLTTECAQHDDSILIVLSD-ANLDR--YGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRGFVCMD 342 (360)
Q Consensus 266 ai~~~a~~~~daDe~fVivlSD-ANL~R--YgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~fv~~d 342 (360)
-...+++....+|. |++.++ +.-+. =|+++++|.+.+.. +..+ ++++.++-+-.+.+.+...+|-..++|=
T Consensus 368 ~~~~~~~~l~~~d~--vi~~~~~~~~e~~~~~~~~~~l~~~~~~--~~~~--~~~~~~~~~a~~~a~~~a~~gD~vlv~G 441 (461)
T PRK00421 368 LLDEFAEALSDADE--VILLDIYAAGEEPIGGVDSEDLARKIKR--GHRD--PIFVPDLEDLAELLAEVLKPGDLVLTMG 441 (461)
T ss_pred HHHHHHHHHHHCCE--EEEcCccCCCCCCCCCCCHHHHHHHHhc--cCCc--eEEeCCHHHHHHHHHHhcCCCCEEEEEC
Confidence 33333321114554 223222 11121 26889999999865 2233 4567887666666777778899999999
Q ss_pred CCchHHHHHHHHH
Q psy11447 343 LTEIPQILQQIFS 355 (360)
Q Consensus 343 t~~lP~il~qIf~ 355 (360)
..++..+.+.|+.
T Consensus 442 ~g~~~~~~~~~~~ 454 (461)
T PRK00421 442 AGDITKLARALLE 454 (461)
T ss_pred CCCHHHHHHHHHH
Confidence 9999998888764
No 59
>COG3414 SgaB Phosphotransferase system, galactitol-specific IIB component [Carbohydrate transport and metabolism]
Probab=47.81 E-value=46 Score=27.87 Aligned_cols=79 Identities=22% Similarity=0.327 Sum_probs=53.3
Q ss_pred ccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhC
Q psy11447 253 CASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQEL 332 (360)
Q Consensus 253 c~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~L 332 (360)
|=+|=.|==-++..|+++.++ +||++..=.-..+.-....+++=||+.| .++..++
T Consensus 8 CG~GvgSS~~ik~kve~~l~~-------------------~gi~~~~~~~~v~~~~~~~~~aDiiv~s-----~~l~~~~ 63 (93)
T COG3414 8 CGNGVGSSTMIKMKVEEVLKE-------------------LGIDVDVEQCAVDEIKALTDGADIIVTS-----TKLADEF 63 (93)
T ss_pred CCCCccHHHHHHHHHHHHHHH-------------------cCCCceeeeEEecccccCCCcccEEEEe-----hHhhhhc
Confidence 666666666777788877664 4444332222333334566778899988 4455555
Q ss_pred ---CCCceEEecCCCchHHHHHHHHH
Q psy11447 333 ---PAGRGFVCMDLTEIPQILQQIFS 355 (360)
Q Consensus 333 ---p~Gr~fv~~dt~~lP~il~qIf~ 355 (360)
|.|+.|+.-+.-+.+.+-++|+.
T Consensus 64 ~~~~~~~v~~~~~~~d~~ei~~~l~~ 89 (93)
T COG3414 64 EDIPKGYVVITGNGMDIEEIKQKLLE 89 (93)
T ss_pred CcCCCceEEEEcccCCHHHHHHHHHH
Confidence 55899999999999999998875
No 60
>TIGR03249 KdgD 5-dehydro-4-deoxyglucarate dehydratase. 5-dehydro-4-deoxyglucarate dehydratase not only catalyzes the dehydration of the substrate (diol to ketone + water), but causes the decarboxylation of the intermediate product to yield 2-oxoglutarate semialdehyde (2,5-dioxopentanoate). The gene for the enzyme is usually observed in the vicinity of transporters and dehydratases handling D-galactarate and D-gluconate as well as aldehyde dehydrogenases which convert the product to alpha-ketoglutarate.
Probab=47.80 E-value=2.7e+02 Score=26.97 Aligned_cols=60 Identities=18% Similarity=0.264 Sum_probs=38.0
Q ss_pred hhhHHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCcchHHHHHHH
Q psy11447 196 LGTSLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGDYTLESIRAS 266 (360)
Q Consensus 196 L~rvMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD~Tlea~~~a 266 (360)
|.++.+-+-..--+-- =+.|++|+.+.+ +..||.+|++....+.+ ++-.|.+|-+|++.|
T Consensus 28 l~~li~~l~~~Gv~gi-~v~GstGE~~~L----------t~eEr~~v~~~~~~~~~g~~pvi~gv~~~t~~ai~~a 92 (296)
T TIGR03249 28 YRENIEWLLGYGLEAL-FAAGGTGEFFSL----------TPAEYEQVVEIAVSTAKGKVPVYTGVGGNTSDAIEIA 92 (296)
T ss_pred HHHHHHHHHhcCCCEE-EECCCCcCcccC----------CHHHHHHHHHHHHHHhCCCCcEEEecCccHHHHHHHH
Confidence 5555555432211112 278999999874 57899999998877654 344455676666654
No 61
>PF06888 Put_Phosphatase: Putative Phosphatase; InterPro: IPR016965 This group represents phosphatases related to PHOSPHO1 and PHOSPHO2 []. It includes plant phosphatases with homology to the haloacid dehalogenase (HAD) superfamily [, ]. PHOSPHO1 is a phosphoethanolamine/phosphocholine phosphatase [], while PHOSPHO2 has high activity toward pyridoxal 5'-phosphate (PLP), and it is active at much lower level toward pyrophosphate, phosphoethanolamine (PEA)and phosphocholine (PCho) []. ; GO: 0016791 phosphatase activity
Probab=47.44 E-value=9 Score=36.81 Aligned_cols=78 Identities=24% Similarity=0.373 Sum_probs=41.1
Q ss_pred cccccCCCCCCCchHHHH---HHHHHHHhhccccCCcchHHHHHHHHHH-------------HHhhcccCCCeEEEEEec
Q psy11447 224 IPFVTSNNLPRDNKQRLE---VIKIMHAHAQFCASGDYTLESIRASIQD-------------LTTECAQHDDSILIVLSD 287 (360)
Q Consensus 224 i~lV~~~~~Pk~~keRl~---vl~~M~aH~Qfc~sGD~Tlea~~~ai~~-------------~a~~~~daDe~fVivlSD 287 (360)
++....+..|+.-+++++ ...-|-..-++-.+-.-|.+.++.+++. +++ .....=+++|||
T Consensus 21 ~~~l~~~~~~~~l~~~~~~~~wt~~m~~vl~~L~~~gvt~~~I~~~l~~ip~~pgm~~~l~~l~~---~~~~~~~~IiSD 97 (234)
T PF06888_consen 21 IELLPPEELPEELRESYPKGGWTEYMDRVLQLLHEQGVTPEDIRDALRSIPIDPGMKELLRFLAK---NQRGFDLIIISD 97 (234)
T ss_pred HHhcCCcccHHHHHHhccccchHHHHHHHHHHHHHcCCCHHHHHHHHHcCCCCccHHHHHHHHHh---cCCCceEEEEeC
Confidence 333334444444444444 3333333333333334677766665554 432 123455999999
Q ss_pred CC-------ccccCCChhHHHHHHh
Q psy11447 288 AN-------LDRYGIAPREMGEALN 305 (360)
Q Consensus 288 AN-------L~RYgI~p~~l~~~l~ 305 (360)
|| |+.|||. ..+.++.+
T Consensus 98 aNs~fI~~iL~~~gl~-~~f~~I~T 121 (234)
T PF06888_consen 98 ANSFFIETILEHHGLR-DCFSEIFT 121 (234)
T ss_pred CcHhHHHHHHHhCCCc-cccceEEe
Confidence 99 8889986 33434333
No 62
>TIGR01082 murC UDP-N-acetylmuramate--alanine ligase. UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase (murein tripeptide ligase) is described by TIGR01081.
Probab=47.11 E-value=2.2e+02 Score=28.98 Aligned_cols=153 Identities=17% Similarity=0.292 Sum_probs=89.5
Q ss_pred CCchhhHHHHhhcc---ccceeEeeeeccCCCCccccccC-CCCCCCchHHHHHHHHHHHh-hccccCCcchHHHHHHHH
Q psy11447 193 GSDLGTSLEAFQDF---DQRISLDIIGHSGETHSIPFVTS-NNLPRDNKQRLEVIKIMHAH-AQFCASGDYTLESIRASI 267 (360)
Q Consensus 193 ~~DL~rvMEaf~g~---e~k~~ydivGHSGd~~~i~lV~~-~~~Pk~~keRl~vl~~M~aH-~Qfc~sGD~Tlea~~~ai 267 (360)
|.+.+.+-+++..| ..|+. +++.. +.+.++.. ..-|++-+.=++.++.+... .=++..|-|+.-..+.-.
T Consensus 289 gi~~~~i~~~l~~f~~~~~R~e--~~~~~---~gv~~i~D~ahn~~~~~a~~~al~~~~~~~~ii~i~g~~~~~r~k~~~ 363 (448)
T TIGR01082 289 GIDFEAILRALANFQGVKRRFE--ILGEF---GGVLLIDDYAHHPTEIKATLKAARQGYPDKRIVVVFQPHRYSRTRDLF 363 (448)
T ss_pred CCCHHHHHHHHHhCCCCCccce--EEEEe---CCeEEEEcCCCCHHHHHHHHHHHHHhcCCCeEEEEECCCCCccHHHHH
Confidence 77777766666655 55665 45432 22555554 33577777777777765311 113555555554433323
Q ss_pred HHHHhhcccCCCeEEEEEecCCccc-cCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCceEEecCCCch
Q psy11447 268 QDLTTECAQHDDSILIVLSDANLDR-YGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRGFVCMDLTEI 346 (360)
Q Consensus 268 ~~~a~~~~daDe~fVivlSDANL~R-YgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~fv~~dt~~l 346 (360)
+++++.-..+|..+++-+.+++=+- -|.++++|.+.+.+. ..++ ++++.++-+-.+.+.+...+|-..++|=..++
T Consensus 364 ~~~~~~l~~~d~v~l~~~~~~~~~~~~g~~~~~i~~~~~~~-~~~~--~~~~~~~~~a~~~a~~~a~~gD~VLl~G~g~~ 440 (448)
T TIGR01082 364 DDFAKVLSDADELILLDIYAAGEEPINGIDGKSLARKITQL-GKIE--PYFVPDLAELVEFLAAVLQSGDLILTMGAGDI 440 (448)
T ss_pred HHHHHHHHhCCEEEEecccCCCCCCCCCCCHHHHHHHHhhc-CCCc--eEEeCCHHHHHHHHHHhcCCCCEEEEECCCCH
Confidence 4443321145643333333333222 389999999999753 2122 34567766666667677788999999998888
Q ss_pred HHHHHHH
Q psy11447 347 PQILQQI 353 (360)
Q Consensus 347 P~il~qI 353 (360)
-.+-++|
T Consensus 441 ~~~~~~~ 447 (448)
T TIGR01082 441 IKLAREL 447 (448)
T ss_pred HHHHHhh
Confidence 8877665
No 63
>cd06267 PBP1_LacI_sugar_binding_like Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. In most cases, ligands are monosaccharide including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor. In this case, the domain sugar binding changes the DNA binding activity of the repressor domain.
Probab=46.59 E-value=1.7e+02 Score=25.50 Aligned_cols=48 Identities=15% Similarity=0.016 Sum_probs=24.3
Q ss_pred ChhHHHHHHhcCcccccEEEEEecCchHHHHHHH-------hhCCCCceEEecCCCchH
Q psy11447 296 APREMGEALNAEAASVRAYAIFIGSLGDQAKRLT-------QELPAGRGFVCMDLTEIP 347 (360)
Q Consensus 296 ~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~-------~~Lp~Gr~fv~~dt~~lP 347 (360)
....+.+.|.++ |++.+.+.. + +..|..+. ..+|..-..++.|.....
T Consensus 164 ~~~~~~~~l~~~-~~~~~i~~~--~-~~~a~~~~~al~~~g~~~~~~i~i~~~d~~~~~ 218 (264)
T cd06267 164 GYEAARELLASG-ERPTAIFAA--N-DLMAIGALRALRELGLRVPEDVSVVGFDDIPLA 218 (264)
T ss_pred HHHHHHHHHhcC-CCCcEEEEc--C-cHHHHHHHHHHHHhCCCCCCceEEEeeCCCchh
Confidence 345666777776 766654322 2 22332221 123444467777766543
No 64
>PRK10997 yieM hypothetical protein; Provisional
Probab=46.23 E-value=2.1e+02 Score=30.65 Aligned_cols=76 Identities=20% Similarity=0.261 Sum_probs=43.4
Q ss_pred HHHHHHHHHHHhhccccCCcchH-HHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHH---hcCcccccE
Q psy11447 238 QRLEVIKIMHAHAQFCASGDYTL-ESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEAL---NAEAASVRA 313 (360)
Q Consensus 238 eRl~vl~~M~aH~Qfc~sGD~Tl-ea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l---~~~~p~V~a 313 (360)
.+-++++.+. .. +.|...+ .|.+.+++.+.+. ..-...|||+||=-- -++ |.++-+.+ +.. -+.+-
T Consensus 381 gl~~ll~fL~--~~--f~GGTDl~~aL~~al~~l~~~--~~r~adIVVISDF~~--~~~-~eel~~~L~~Lk~~-~~~rf 450 (487)
T PRK10997 381 GLEQAIRFLS--QS--FRGGTDLAPCLRAIIEKMQGR--EWFDADAVVISDFIA--QRL-PDELVAKVKELQRQ-HQHRF 450 (487)
T ss_pred CHHHHHHHHH--Hh--cCCCCcHHHHHHHHHHHHccc--ccCCceEEEECCCCC--CCC-hHHHHHHHHHHHHh-cCcEE
Confidence 3444555553 22 3555444 4556677776532 333466999999521 122 55555544 332 57788
Q ss_pred EEEEecCchH
Q psy11447 314 YAIFIGSLGD 323 (360)
Q Consensus 314 ~~IfIgslgd 323 (360)
|.++||.-|.
T Consensus 451 ~~l~i~~~~~ 460 (487)
T PRK10997 451 HAVAMSAHGK 460 (487)
T ss_pred EEEEeCCCCC
Confidence 9999997554
No 65
>cd05294 LDH-like_MDH_nadp A lactate dehydrogenases-like structure with malate dehydrogenase enzymatic activity. The LDH-like MDH proteins have a lactate dehyhydrogenase-like (LDH-like) structure and malate dehydrogenase (MDH) enzymatic activity. This subgroup is composed of some archaeal LDH-like MDHs that prefer NADP(H) rather than NAD(H) as a cofactor. One member, MJ0490 from Methanococcus jannaschii, has been observed to form dimers and tetramers during crystalization, although it is believed to exist primarilly as a tetramer in solution. In addition to its MDH activity, MJ0490 also possesses fructose-1,6-bisphosphate-activated LDH activity. Members of this subgroup have a higher sequence similarity to LDHs than to other MDHs. LDH catalyzes the last step of glycolysis in which pyruvate is converted to L-lactate. MDH is one of the key enzymes in the citric acid cycle, facilitating both the conversion of malate to oxaloacetate and replenishing levels of oxalacetate by reductive carbox
Probab=45.79 E-value=69 Score=31.44 Aligned_cols=88 Identities=16% Similarity=0.312 Sum_probs=59.5
Q ss_pred CCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecC-C-----c-cc
Q psy11447 220 ETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDA-N-----L-DR 292 (360)
Q Consensus 220 d~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDA-N-----L-~R 292 (360)
.+.+|-++..|.|++.+..|++.++. +++-++..++.+.+. +.+.+|||.|-. | + +.
T Consensus 71 ~~aDiViitag~p~~~~~~r~dl~~~-------------n~~i~~~~~~~i~~~---~~~~~viv~~npvd~~t~~~~~~ 134 (309)
T cd05294 71 AGSDIVIITAGVPRKEGMSRLDLAKK-------------NAKIVKKYAKQIAEF---APDTKILVVTNPVDVMTYKALKE 134 (309)
T ss_pred CCCCEEEEecCCCCCCCCCHHHHHHH-------------HHHHHHHHHHHHHHH---CCCeEEEEeCCchHHHHHHHHHh
Confidence 45667889999999988889888765 556667777777654 346778888742 1 1 22
Q ss_pred cCCChhH----------------HHHHHhcCcccccEEEEEecCchHHH
Q psy11447 293 YGIAPRE----------------MGEALNAEAASVRAYAIFIGSLGDQA 325 (360)
Q Consensus 293 YgI~p~~----------------l~~~l~~~~p~V~a~~IfIgslgdqA 325 (360)
-|++|.+ |++.|.-+..+|++++| |.-||..
T Consensus 135 ~g~~~~~viG~gt~LDs~R~~~~la~~l~v~~~~v~~~vi--GeHg~s~ 181 (309)
T cd05294 135 SGFDKNRVFGLGTHLDSLRFKVAIAKHFNVHISEVHTRII--GEHGDSM 181 (309)
T ss_pred cCCCHHHEeeccchHHHHHHHHHHHHHHCcChHHeEEEEE--ecCCCce
Confidence 2566554 45556666566887775 9988853
No 66
>PRK03620 5-dehydro-4-deoxyglucarate dehydratase; Provisional
Probab=45.11 E-value=3e+02 Score=26.81 Aligned_cols=60 Identities=18% Similarity=0.197 Sum_probs=38.9
Q ss_pred hhhHHHHhhcccccee-EeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCcchHHHHHHHH
Q psy11447 196 LGTSLEAFQDFDQRIS-LDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGDYTLESIRASI 267 (360)
Q Consensus 196 L~rvMEaf~g~e~k~~-ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD~Tlea~~~ai 267 (360)
|.++.+-+-. .-++ +=+-|++|+.+.+ +..||.++++....+++ +|-.|.+|-++++.+-
T Consensus 30 l~~li~~l~~--~Gv~Gi~~~GstGE~~~L----------t~eEr~~~~~~~~~~~~~~~pvi~gv~~~t~~~i~~~~ 95 (303)
T PRK03620 30 YREHLEWLAP--YGAAALFAAGGTGEFFSL----------TPDEYSQVVRAAVETTAGRVPVIAGAGGGTAQAIEYAQ 95 (303)
T ss_pred HHHHHHHHHH--cCCCEEEECcCCcCcccC----------CHHHHHHHHHHHHHHhCCCCcEEEecCCCHHHHHHHHH
Confidence 5555554432 1221 2377999999874 57799999998877764 4555667777766543
No 67
>TIGR02313 HpaI-NOT-DapA 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase. This model represents a subset of the DapA (dihydrodipicolinate synthase) family which has apparently evolved a separate function. The product of DapA, dihydrodipicolinate, results from the non-enzymatic cyclization and dehydration of 6-amino-2,4-dihydroxyhept-2-ene-1,7-dioic acid, which is different from the substrate of this reaction only in the presence of the amino group. In the absence of this amino group, and running the reaction in the opposite direction, the reaction corresponds to the HpaI aldolase component of the 4-hydroxyphenylacetic acid catabolism pathway (see TIGR02311). At present, this variant of DapA is found only in Oceanobacillus iheyensis HTE831 and Thermus thermophilus HB27. In both of these cases, one or more other DapA genes can be found and the one identified by this model is part of an operon for 4-hydroxyphenylacetic acid catabolism.
Probab=44.85 E-value=2.6e+02 Score=27.26 Aligned_cols=95 Identities=16% Similarity=0.123 Sum_probs=56.2
Q ss_pred eeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCc-chHHHHHHHHHHHHhhcccCCCe------
Q psy11447 213 DIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGD-YTLESIRASIQDLTTECAQHDDS------ 280 (360)
Q Consensus 213 divGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD-~Tlea~~~ai~~~a~~~~daDe~------ 280 (360)
=+.|.+|+.+. -+..||.++++....++. ++-.|. +|-++++.|-. .++- -||--
T Consensus 39 ~v~GstGE~~~----------Ls~~Er~~l~~~~~~~~~g~~pvi~gv~~~~t~~ai~~a~~-A~~~--Gad~v~v~pP~ 105 (294)
T TIGR02313 39 SVGGTSGEPGS----------LTLEERKQAIENAIDQIAGRIPFAPGTGALNHDETLELTKF-AEEA--GADAAMVIVPY 105 (294)
T ss_pred EECccCccccc----------CCHHHHHHHHHHHHHHhCCCCcEEEECCcchHHHHHHHHHH-HHHc--CCCEEEEcCcc
Confidence 37899999886 468999999998887664 344454 44455554433 2221 23311
Q ss_pred ----------------------EEEEEecCC-ccccCCChhHHHHHHhcCcccccEEEEEecCc
Q psy11447 281 ----------------------ILIVLSDAN-LDRYGIAPREMGEALNAEAASVRAYAIFIGSL 321 (360)
Q Consensus 281 ----------------------fVivlSDAN-L~RYgI~p~~l~~~l~~~~p~V~a~~IfIgsl 321 (360)
+=|++-+.- ..-+.|+|..+.++.... |+|.+.=.--|.+
T Consensus 106 y~~~~~~~l~~~f~~ia~a~~~lpv~iYn~P~~tg~~l~~~~l~~L~~~~-pnv~giK~ss~d~ 168 (294)
T TIGR02313 106 YNKPNQEALYDHFAEVADAVPDFPIIIYNIPGRAAQEIAPKTMARLRKDC-PNIVGAKESNKDF 168 (294)
T ss_pred CCCCCHHHHHHHHHHHHHhccCCCEEEEeCchhcCcCCCHHHHHHHHhhC-CCEEEEEeCCCCH
Confidence 112222211 234667888888887665 8877665544443
No 68
>cd08605 GDPD_GDE5_like_1_plant Glycerophosphodiester phosphodiesterase domain of uncharacterized plant glycerophosphodiester phosphodiesterase-like proteins similar to mammalian GDE5. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized plant glycerophosphodiester phosphodiesterase (GP-PDE)-like proteins. Members in this family show very high sequence homology to mammalian glycerophosphodiester phosphodiesterase GDE5 and are distantly related to plant GP-PDEs.
Probab=44.19 E-value=42 Score=31.93 Aligned_cols=35 Identities=17% Similarity=0.176 Sum_probs=25.6
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
-||+.|.+.|++.=+. -.|.| |..+||+=|.+|+|
T Consensus 25 ENTl~Af~~A~~~Gad-~vE~DV~lTkDg~~VV~HD~~l~r 64 (282)
T cd08605 25 ENTIASFIAASKFGAD-FVEFDVQVTRDGVPVIWHDDFIVV 64 (282)
T ss_pred CcHHHHHHHHHHcCCC-EEEEEEEECcCCeEEEECCCceec
Confidence 5899999999974221 11222 67889999999987
No 69
>PF05762 VWA_CoxE: VWA domain containing CoxE-like protein; InterPro: IPR008912 This group of proteins contains a VWA type domain and the function of this family is unknown. It is found as part of a CO oxidising (Cox) system operon in several bacteria [].
Probab=42.76 E-value=1.3e+02 Score=28.00 Aligned_cols=99 Identities=18% Similarity=0.245 Sum_probs=59.6
Q ss_pred HHHHHHHHHHHhhccccCCcch--HHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEE
Q psy11447 238 QRLEVIKIMHAHAQFCASGDYT--LESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYA 315 (360)
Q Consensus 238 eRl~vl~~M~aH~Qfc~sGD~T--lea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~ 315 (360)
+..+.+..+.+-.+. |.| .| -+|.+.+++..+.. ..-...||++||+ + + .-.+..+.+.|..-..++ ..+
T Consensus 111 ~~~~~l~~~~~~~~~-~~G-gTdi~~aL~~~~~~~~~~--~~~~t~vvIiSDg-~-~-~~~~~~~~~~l~~l~~r~-~rv 182 (222)
T PF05762_consen 111 DPEEALARLSALVQS-FGG-GTDIGQALREFLRQYARP--DLRRTTVVIISDG-W-D-TNDPEPLAEELRRLRRRG-RRV 182 (222)
T ss_pred CHHHHHHHHHhhccC-CCC-ccHHHHHHHHHHHHhhcc--cccCcEEEEEecc-c-c-cCChHHHHHHHHHHHHhC-CEE
Confidence 334566666543444 666 56 57778888877643 3346899999997 1 1 223444455554332222 356
Q ss_pred EEecCc--------hHHHHHHHhhCCCCceEEecCCCch
Q psy11447 316 IFIGSL--------GDQAKRLTQELPAGRGFVCMDLTEI 346 (360)
Q Consensus 316 IfIgsl--------gdqA~~l~~~Lp~Gr~fv~~dt~~l 346 (360)
|.+-.+ ...+..++.-+| +.+.|.+..+|
T Consensus 183 iwLnP~~~~~~~~~~~~~~~~~~~~~--~v~~~~~~~~l 219 (222)
T PF05762_consen 183 IWLNPLPRAGWPGYDPVARGYRAALP--YVDACRPLHQL 219 (222)
T ss_pred EEECCcccccCCCCChHHHHHHHhCC--hhhhcCChhhh
Confidence 777665 335677777777 56777777655
No 70
>PLN02602 lactate dehydrogenase
Probab=40.60 E-value=1e+02 Score=31.24 Aligned_cols=87 Identities=23% Similarity=0.406 Sum_probs=58.7
Q ss_pred CCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC-------cccc
Q psy11447 221 THSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN-------LDRY 293 (360)
Q Consensus 221 ~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN-------L~RY 293 (360)
+.+|-++.+|.|++...+|++.+.. +++-++.-++.+.+- +-+.++|++|... .+.-
T Consensus 105 daDiVVitAG~~~k~g~tR~dll~~-------------N~~I~~~i~~~I~~~---~p~~ivivvtNPvdv~t~~~~k~s 168 (350)
T PLN02602 105 GSDLCIVTAGARQIPGESRLNLLQR-------------NVALFRKIIPELAKY---SPDTILLIVSNPVDVLTYVAWKLS 168 (350)
T ss_pred CCCEEEECCCCCCCcCCCHHHHHHH-------------HHHHHHHHHHHHHHH---CCCeEEEEecCchHHHHHHHHHHh
Confidence 6778899999999988899998875 555555666666543 4557888888521 3334
Q ss_pred CCChhHH----------------HHHHhcCcccccEEEEEecCchHHH
Q psy11447 294 GIAPREM----------------GEALNAEAASVRAYAIFIGSLGDQA 325 (360)
Q Consensus 294 gI~p~~l----------------~~~l~~~~p~V~a~~IfIgslgdqA 325 (360)
|++|+++ ++.|.-+..+|+++|| |.-||..
T Consensus 169 g~p~~rviG~gt~LDs~R~r~~lA~~l~v~~~~V~~~Vi--GeHGds~ 214 (350)
T PLN02602 169 GFPANRVIGSGTNLDSSRFRFLIADHLDVNAQDVQAYIV--GEHGDSS 214 (350)
T ss_pred CCCHHHEEeecchHHHHHHHHHHHHHhCCCccceeeeEE--ecCCCce
Confidence 5555543 4445566567888855 9888654
No 71
>PRK14093 UDP-N-acetylmuramoylalanyl-D-glutamyl-2,6-diaminopimelate--D-alanyl-D-alanine ligase; Provisional
Probab=40.45 E-value=69 Score=33.06 Aligned_cols=89 Identities=19% Similarity=0.197 Sum_probs=45.6
Q ss_pred chHHHHHHHHHHHHhhccc-CCCeEEEEEecC-CccccCC-ChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCC
Q psy11447 258 YTLESIRASIQDLTTECAQ-HDDSILIVLSDA-NLDRYGI-APREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPA 334 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~d-aDe~fVivlSDA-NL~RYgI-~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~ 334 (360)
||.++++.|++.+.+.. . ...+.++|+.|= .|..+.. -..++++.+... . --.||++|. .+..+.+.+..
T Consensus 348 hnP~s~~aaL~~l~~~~-~~~~~r~i~V~G~m~elg~~~~~~h~~~~~~~~~~--~-~d~v~~~G~---~~~~~~~~~~~ 420 (479)
T PRK14093 348 ANPASMAAALGVLGRAP-VGPQGRRIAVLGDMLELGPRGPELHRGLAEAIRAN--A-IDLVFCCGP---LMRNLWDALSS 420 (479)
T ss_pred CCHHHHHHHHHHHHhhh-ccCCCCEEEEECChHHcCcHHHHHHHHHHHHHHHc--C-CCEEEEEch---hHHHHHHhhcc
Confidence 56777777777776430 1 244677777662 3322211 124555555421 1 136666653 44556555654
Q ss_pred Cce-EEecCCCchHHHHHHH
Q psy11447 335 GRG-FVCMDLTEIPQILQQI 353 (360)
Q Consensus 335 Gr~-fv~~dt~~lP~il~qI 353 (360)
|+. +.+.|.+++.+.|+..
T Consensus 421 ~~~~~~~~~~~~~~~~l~~~ 440 (479)
T PRK14093 421 GKRGGYAEDAAALESQVVAA 440 (479)
T ss_pred cccceeeCCHHHHHHHHHHh
Confidence 432 3445666665555543
No 72
>TIGR01763 MalateDH_bact malate dehydrogenase, NAD-dependent. The annotation of Botryococcus braunii as lactate dehydrogenase appears top be in error. This was initially annotated as MDH by Swiss-Prot and then changed. The rationale for either of these annotations is not traceable.
Probab=39.86 E-value=80 Score=31.01 Aligned_cols=87 Identities=16% Similarity=0.251 Sum_probs=57.5
Q ss_pred CCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC------cccc-
Q psy11447 221 THSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN------LDRY- 293 (360)
Q Consensus 221 ~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN------L~RY- 293 (360)
..++-++..+.|++.+..|++.++. .++.++..++.+++- +.+.++|++|-.. +.+|
T Consensus 69 ~aDiVIitag~p~~~~~sR~~l~~~-------------N~~iv~~i~~~I~~~---~p~~~iIv~tNP~di~t~~~~~~s 132 (305)
T TIGR01763 69 NSDIVVITAGLPRKPGMSREDLLSM-------------NAGIVREVTGRIMEH---SPNPIIVVVSNPLDAMTYVAWQKS 132 (305)
T ss_pred CCCEEEEcCCCCCCcCCCHHHHHHH-------------HHHHHHHHHHHHHHH---CCCeEEEEecCcHHHHHHHHHHHH
Confidence 4567789999999988899987764 556677777777653 3457777777511 1111
Q ss_pred CCC----------------hhHHHHHHhcCcccccEEEEEecCchHHH
Q psy11447 294 GIA----------------PREMGEALNAEAASVRAYAIFIGSLGDQA 325 (360)
Q Consensus 294 gI~----------------p~~l~~~l~~~~p~V~a~~IfIgslgdqA 325 (360)
|.+ -..+++.|.-+..+|+++ .||.-||..
T Consensus 133 g~~~~rviG~g~~lds~R~~~~la~~l~v~~~~v~~~--v~GeHg~s~ 178 (305)
T TIGR01763 133 GFPKERVIGQAGVLDSARFRTFIAMELGVSVQDVTAC--VLGGHGDAM 178 (305)
T ss_pred CcCHHHEEEeccchHHHHHHHHHHHHhCcCHHHeeee--EEecCCCcE
Confidence 222 234566677775668888 589888763
No 73
>PF02796 HTH_7: Helix-turn-helix domain of resolvase; InterPro: IPR006120 Site-specific recombination plays an important role in DNA rearrangement in prokaryotic organisms. Two types of site-specific recombination are known to occur: Recombination between inverted repeats resulting in the reversal of a DNA segment. Recombination between repeat sequences on two DNA molecules resulting in their cointegration, or between repeats on one DNA molecule resulting in the excision of a DNA fragment. Site-specific recombination is characterised by a strand exchange mechanism that requires no DNA synthesis or high energy cofactor; the phosphodiester bond energy is conserved in a phospho-protein linkage during strand cleavage and re-ligation. Two unrelated families of recombinases are currently known []. The first, called the 'phage integrase' family, groups a number of bacterial phage and yeast plasmid enzymes. The second [], called the 'resolvase' family, groups enzymes which share the following structural characteristics: an N-terminal catalytic and dimerization domain that contains a conserved serine residue involved in the transient covalent attachment to DNA IPR006119 from INTERPRO, and a C-terminal helix-turn-helix DNA-binding domain. ; GO: 0000150 recombinase activity, 0003677 DNA binding, 0006310 DNA recombination; PDB: 1ZR2_A 2GM4_B 1RES_A 1ZR4_A 1RET_A 1GDT_B 2R0Q_C 1JKP_C 1IJW_C 1JJ6_C ....
Probab=39.34 E-value=52 Score=23.35 Aligned_cols=44 Identities=27% Similarity=0.471 Sum_probs=30.5
Q ss_pred CCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhc
Q psy11447 230 NNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNA 306 (360)
Q Consensus 230 ~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~ 306 (360)
|+||+-+++..+-+..|++. | .+|.++|+. +||+..-+-++|+.
T Consensus 1 GRp~~~~~~~~~~i~~l~~~------G--------~si~~IA~~-------------------~gvsr~TvyR~l~~ 44 (45)
T PF02796_consen 1 GRPPKLSKEQIEEIKELYAE------G--------MSIAEIAKQ-------------------FGVSRSTVYRYLNK 44 (45)
T ss_dssp SSSSSSSHCCHHHHHHHHHT------T----------HHHHHHH-------------------TTS-HHHHHHHHCC
T ss_pred CcCCCCCHHHHHHHHHHHHC------C--------CCHHHHHHH-------------------HCcCHHHHHHHHhc
Confidence 68999999888888888853 2 467777765 67777777766654
No 74
>PF00072 Response_reg: Response regulator receiver domain; InterPro: IPR001789 Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions []. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK. A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [, ]. Bipartite response regulator proteins are involved in a two-component signal transduction system in bacteria, and certain eukaryotes like protozoa, that functions to detect and respond to environmental changes []. These systems have been detected during host invasion, drug resistance, motility, phosphate uptake, osmoregulation, and nitrogen fixation, amongst others []. The two-component system consists of a histidine protein kinase environmental sensor that phosphorylates the receiver domain of a response regulator protein; phosphorylation induces a conformational change in the response regulator, which activates the effector domain, triggering the cellular response []. The domains of the two-component proteins are highly modular, but the core structures and activities are maintained. The response regulators act as phosphorylation-activated switches to affect a cellular response, usually by transcriptional regulation. Most of these proteins consist of two domains, an N-terminal response regulator receiver domain, and a variable C-terminal effector domain with DNA-binding activity. This entry represents the response regulator receiver domain, which belongs to the CheY family, and receives the signal from the sensor partner in the two-component system.; GO: 0000156 two-component response regulator activity, 0000160 two-component signal transduction system (phosphorelay), 0006355 regulation of transcription, DNA-dependent; PDB: 2QR3_A 2QXY_A 1I3C_A 1JLK_A 2PKX_A 2PL1_A 3H1F_A 3H1E_A 3GWG_A 3H1G_A ....
Probab=39.13 E-value=1.4e+02 Score=22.81 Aligned_cols=93 Identities=16% Similarity=0.187 Sum_probs=59.1
Q ss_pred CchHHHHHHHHHHHhhcc-ccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccE
Q psy11447 235 DNKQRLEVIKIMHAHAQF-CASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRA 313 (360)
Q Consensus 235 ~~keRl~vl~~M~aH~Qf-c~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a 313 (360)
++....+.+++...+..| ... +......|...+.+. .+=+++.|.++.. .+..++.+.|... + -+.
T Consensus 6 ~~~~~~~~l~~~l~~~~~~~v~---~~~~~~~~~~~~~~~------~~d~iiid~~~~~--~~~~~~~~~i~~~-~-~~~ 72 (112)
T PF00072_consen 6 DDPEIRELLEKLLERAGYEEVT---TASSGEEALELLKKH------PPDLIIIDLELPD--GDGLELLEQIRQI-N-PSI 72 (112)
T ss_dssp SSHHHHHHHHHHHHHTTEEEEE---EESSHHHHHHHHHHS------TESEEEEESSSSS--SBHHHHHHHHHHH-T-TTS
T ss_pred CCHHHHHHHHHHHHhCCCCEEE---EECCHHHHHHHhccc------CceEEEEEeeecc--ccccccccccccc-c-ccc
Confidence 466778888888887777 333 333344555555554 3668888988887 6777777777775 3 466
Q ss_pred EEEEecCchHHHHHHHhhCCCCc-eEEec
Q psy11447 314 YAIFIGSLGDQAKRLTQELPAGR-GFVCM 341 (360)
Q Consensus 314 ~~IfIgslgdqA~~l~~~Lp~Gr-~fv~~ 341 (360)
-+|++++-.+.+. ..+.+.+|= .|+..
T Consensus 73 ~ii~~t~~~~~~~-~~~~~~~g~~~~l~k 100 (112)
T PF00072_consen 73 PIIVVTDEDDSDE-VQEALRAGADDYLSK 100 (112)
T ss_dssp EEEEEESSTSHHH-HHHHHHTTESEEEES
T ss_pred cEEEecCCCCHHH-HHHHHHCCCCEEEEC
Confidence 7888887666443 333344453 34444
No 75
>cd05126 Mth938 Mth938 domain. Mth938 is a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. This protein crystallizes as a dimer, although it is monomeric in solution, with one disulfide bond in each monomer. The function of the protein has not been determined.
Probab=39.01 E-value=21 Score=30.78 Aligned_cols=52 Identities=23% Similarity=0.327 Sum_probs=36.1
Q ss_pred CCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhh-----CCCCceEEecCCCchHHHH
Q psy11447 294 GIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQE-----LPAGRGFVCMDLTEIPQIL 350 (360)
Q Consensus 294 gI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~-----Lp~Gr~fv~~dt~~lP~il 350 (360)
.++|++|..+|... |+ +|.||+--.++-.+... ...|-.+.||||..-=...
T Consensus 45 ~l~~~~l~~ll~~~-pe----ivliGTG~~~~~~~~~~~~~~l~~~Gi~ve~m~T~aAcrTY 101 (117)
T cd05126 45 GLQPEELEELLEEG-VE----VIVIGTGQSGALKVPPETVEKLEKRGVEVLVLPTEEAVKRY 101 (117)
T ss_pred cCCHHHHHHHHhcC-CC----EEEEcCCCCccccCCHHHHHHHHhcCCEEEEcChHHHHHHH
Confidence 58999999999887 77 89999943334221211 1368899999997654443
No 76
>cd08562 GDPD_EcUgpQ_like Glycerophosphodiester phosphodiesterase domain in Escherichia coli cytosolic glycerophosphodiester phosphodiesterase UgpQ and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Escherichia coli cytosolic glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46), UgpQ, and similar proteins. GP-GDE plays an essential role in the metabolic pathway of E. coli. It catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols, which are major sources of carbon and phosphate. E. coli possesses two major G3P uptake systems: Glp and Ugp, which contain genes coding for two distinct GP-GDEs. UgpQ gene from the E. coli ugp operon codes for a cytosolic phosphodiesterase GlpQ, which is the prototype of this family. Various glycerophosphodiesters, such as glycerophosphocholine (GPC), glycerophosphoethanolanmine (GPE), glycerophosphoglycerol (GPG)
Probab=38.68 E-value=29 Score=31.44 Aligned_cols=35 Identities=31% Similarity=0.478 Sum_probs=25.1
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
.||++|.+.|++.=+. ..|.| |..+||.-|.+|+|
T Consensus 13 ENT~~af~~A~~~gad-~iE~Dv~~TkDg~lvv~HD~~l~r 52 (229)
T cd08562 13 ENTLAAFRAAAELGVR-WVEFDVKLSGDGTLVLIHDDTLDR 52 (229)
T ss_pred chHHHHHHHHHHcCCC-EEEEEEeECCCCCEEEEcCCCCcc
Confidence 5899999999975332 11222 46688999999987
No 77
>cd08584 PI-PLCc_GDPD_SF_unchar2 Uncharacterized hypothetical proteins similar to the catalytic domains of Phosphoinositide-specific phospholipaseand Glycerophosphodiester phosphodiesterases. This subfamily corresponds to a group of uncharacterized hypothetical proteins similar to the catalytic domains of Phosphoinositide-specific phospholipase C (PI-PLC), and glycerophosphodiester phosphodiesterases (GP-GDE), and also sphingomyelinases D (SMases D) and similar proteins. They hydrolyze the 3'-5' phosphodiester bonds in different substrates, utilizing a similar mechanism of general base and acid catalysis involving two conserved histidine residues.
Probab=38.21 E-value=1.5e+02 Score=28.02 Aligned_cols=85 Identities=14% Similarity=0.228 Sum_probs=51.8
Q ss_pred chHHHHHHHHHHHHhhcccCC----CeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchH--HHHHHHhh
Q psy11447 258 YTLESIRASIQDLTTECAQHD----DSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGD--QAKRLTQE 331 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~daD----e~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgd--qA~~l~~~ 331 (360)
||+.|.+.|++. -- .|.| +.-+||.=|++++|. | .|.++|..= |+.. ..|-|=+-|- +...+.++
T Consensus 7 NTl~AF~~A~~~-dg--vE~DVr~tDg~lVV~HD~~l~~~---P-tLeEvL~~~-~~~~-l~inIK~~~l~~~l~~li~~ 77 (192)
T cd08584 7 NTITALKRTFEN-FG--VETDIRDYGGQLVISHDPFVKNG---E-LLEDWLKEY-NHGT-LILNIKAEGLELRLKKLLAE 77 (192)
T ss_pred HHHHHHHHHHHC-CE--EEEEEEeeCCeEEEECCCCCCCC---C-CHHHHHHhc-cccc-EEEEECchhHHHHHHHHHHh
Confidence 899999999996 21 1222 456888999999974 3 377777664 5443 5566765442 33344444
Q ss_pred CCCCceEEecCCCchHHHHHHH
Q psy11447 332 LPAGRGFVCMDLTEIPQILQQI 353 (360)
Q Consensus 332 Lp~Gr~fv~~dt~~lP~il~qI 353 (360)
.... ..|+. .+..|.+++.+
T Consensus 78 ~~~~-~~vi~-ssf~~~~l~~~ 97 (192)
T cd08584 78 YGIT-NYFFL-DMSVPDIIKYL 97 (192)
T ss_pred cCCc-ceEEE-EcCCHHHHHHH
Confidence 4433 34444 45567776654
No 78
>PF01075 Glyco_transf_9: Glycosyltransferase family 9 (heptosyltransferase); InterPro: IPR002201 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 9 GT9 from CAZY comprises enzymes with two known activity; lipopolysaccharide N-acetylglucosaminyltransferase (2.4.1.56 from EC), heptosyltransferase (2.4 from EC). Heptosyltransferase I is thought to add L-glycero-D-manno-heptose to the inner 3-deoxy-D-manno-octulosonic acid (Kdo) residue of the lipopolysaccharide core []. Heptosyltransferase II is a glycosyltransferase involved in the synthesis of the inner core region of lipopolysaccharide []. Lipopolysaccharide is a major component of the outer leaflet of the outer membrane in Gram-negative bacteria. It is composed of three domains; lipid A, Core oligosaccharide and the O-antigen. These enzymes transfer heptose to the lipopolysaccharide core [].; GO: 0016757 transferase activity, transferring glycosyl groups, 0008152 metabolic process; PDB: 1PSW_A 2H1F_A 2GT1_A 3TOV_A 2H1H_A.
Probab=38.00 E-value=1e+02 Score=27.99 Aligned_cols=76 Identities=13% Similarity=0.153 Sum_probs=40.7
Q ss_pred CCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCC---ceEEecCCCchHHHHHHHH
Q psy11447 278 DDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAG---RGFVCMDLTEIPQILQQIF 354 (360)
Q Consensus 278 De~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~G---r~fv~~dt~~lP~il~qIf 354 (360)
+..+|++.-=|.-..=..+++.+++++..= .+-...+|++|+-+++.+.+.+.+.++ +......+..|.+...-|-
T Consensus 104 ~~~~i~i~~~a~~~~k~wp~e~~~~l~~~l-~~~~~~vvl~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~e~~ali~ 182 (247)
T PF01075_consen 104 DKPYIGINPGASWPSKRWPAEKWAELIERL-KERGYRVVLLGGPEEQEKEIADQIAAGLQNPVINLAGKTSLRELAALIS 182 (247)
T ss_dssp TSSEEEEE---SSGGGS--HHHHHHHHHHH-CCCT-EEEE--SSHHHHHHHHHHHHTTHTTTTEEETTTS-HHHHHHHHH
T ss_pred cCCeEEEeecCCCccccCCHHHHHHHHHHH-HhhCceEEEEccchHHHHHHHHHHHHhcccceEeecCCCCHHHHHHHHh
Confidence 445555555565577788998888887765 555577899999777444544444444 2344445566776554443
No 79
>PF06144 DNA_pol3_delta: DNA polymerase III, delta subunit; InterPro: IPR010372 DNA polymerase III, delta subunit (2.7.7.7 from EC) is required for, along with delta' subunit, the assembly of the processivity factor beta(2) onto primed DNA in the DNA polymerase III holoenzyme-catalysed reaction []. The delta subunit is also known as HolA.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0009360 DNA polymerase III complex; PDB: 3GLG_F 1XXH_A 1JQL_B 3GLF_F 1JQJ_C 3GLI_F.
Probab=37.66 E-value=2e+02 Score=24.60 Aligned_cols=77 Identities=16% Similarity=0.147 Sum_probs=44.1
Q ss_pred CCeEEEEEecCCc-cccC--CChhHHHHHHhcCcccccEEEEEecC-chHHHHHHHhhCCCC-c--eEEecCCCchHHHH
Q psy11447 278 DDSILIVLSDANL-DRYG--IAPREMGEALNAEAASVRAYAIFIGS-LGDQAKRLTQELPAG-R--GFVCMDLTEIPQIL 350 (360)
Q Consensus 278 De~fVivlSDANL-~RYg--I~p~~l~~~l~~~~p~V~a~~IfIgs-lgdqA~~l~~~Lp~G-r--~fv~~dt~~lP~il 350 (360)
.+.=||++.++++ .+.. =..+.|...+.. |.-+..+|++.. -.|....+.+.+... . .|...+..+++.++
T Consensus 56 ~~~klvii~~~~~l~~~~~~~~~~~l~~~l~~--~~~~~~lii~~~~~~~~~~k~~k~l~~~~~~~~~~~~~~~~~~~~i 133 (172)
T PF06144_consen 56 GDKKLVIIKNAPFLKDKLKKKEIKALIEYLSN--PPPDCILIIFSEEKLDKRKKLYKALKKQAIVIECKKPKEQELPRWI 133 (172)
T ss_dssp SSEEEEEEE-----TT-S-TTHHHHHHHHTTT----SSEEEEEEES-S--HHHHHHHHHTTTEEEEEE----TTTHHHHH
T ss_pred CCCeEEEEecCccccccccHHHHHHHHHHHhC--CCCCEEEEEEeCCchhhhhhHHHHHhcccceEEecCCCHHHHHHHH
Confidence 4566888888866 3221 223456666644 556778888877 667777888888742 2 23356899999999
Q ss_pred HHHHHh
Q psy11447 351 QQIFSA 356 (360)
Q Consensus 351 ~qIf~s 356 (360)
++.|.+
T Consensus 134 ~~~~~~ 139 (172)
T PF06144_consen 134 KERAKK 139 (172)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 999875
No 80
>PF00701 DHDPS: Dihydrodipicolinate synthetase family; InterPro: IPR002220 Dihydropicolinate synthase (DHDPS) is the key enzyme in lysine biosynthesis via the diaminopimelate pathway of prokaryotes, some phycomycetes and higher plants. The enzyme catalyses the condensation of L-aspartate-beta- semialdehyde and pyruvate to dihydropicolinic acid via a ping-pong mechanism in which pyruvate binds to the enzyme by forming a Schiff-base with a lysine residue []. Three other proteins are structurally related to DHDPS and probably also act via a similar catalytic mechanism. These are Escherichia coli N-acetylneuraminate lyase (4.1.3.3 from EC) (gene nanA), which catalyzes the condensation of N-acetyl-D-mannosamine and pyruvate to form N-acetylneuraminate; Rhizobium meliloti (Sinorhizobium meliloti) protein mosA [], which is involved in the biosynthesis of the rhizopine 3-o-methyl-scyllo-inosamine; and E. coli hypothetical protein yjhH. The sequences of DHDPS from different sources are well-conserved. The structure takes the form of a homotetramer, in which 2 monomers are related by an approximate 2-fold symmetry []. Each monomer comprises 2 domains: an 8-fold alpha-/beta-barrel, and a C-terminal alpha-helical domain. The fold resembles that of N-acetylneuraminate lyase. The active site lysine is located in the barrel domain, and has access via 2 channels on the C-terminal side of the barrel.; GO: 0016829 lyase activity, 0008152 metabolic process; PDB: 3B4U_B 3S8H_A 3QZE_B 1XXX_F 3L21_F 3IRD_A 3A5F_B 3G0S_B 3DAQ_C 3UQN_A ....
Probab=37.56 E-value=3.1e+02 Score=26.19 Aligned_cols=89 Identities=18% Similarity=0.286 Sum_probs=55.1
Q ss_pred EeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc------cccCCcchHHHHHHHHHHHHhhcccCCCeEEE--
Q psy11447 212 LDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ------FCASGDYTLESIRASIQDLTTECAQHDDSILI-- 283 (360)
Q Consensus 212 ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q------fc~sGD~Tlea~~~ai~~~a~~~~daDe~fVi-- 283 (360)
+=+.|++|+++.. +..||.++++....|.. .+.++.+|-++++.|-.- .+- .||-.+|+
T Consensus 39 l~~~GstGE~~~L----------t~~Er~~l~~~~~~~~~~~~~vi~gv~~~st~~~i~~a~~a-~~~--Gad~v~v~~P 105 (289)
T PF00701_consen 39 LVVLGSTGEFYSL----------TDEERKELLEIVVEAAAGRVPVIAGVGANSTEEAIELARHA-QDA--GADAVLVIPP 105 (289)
T ss_dssp EEESSTTTTGGGS-----------HHHHHHHHHHHHHHHTTSSEEEEEEESSSHHHHHHHHHHH-HHT--T-SEEEEEES
T ss_pred EEECCCCcccccC----------CHHHHHHHHHHHHHHccCceEEEecCcchhHHHHHHHHHHH-hhc--CceEEEEecc
Confidence 4567999999874 57899999999988765 233444666666665442 221 45533222
Q ss_pred -------------------------EEecCC-ccccCCChhHHHHHHhcCcccccEEE
Q psy11447 284 -------------------------VLSDAN-LDRYGIAPREMGEALNAEAASVRAYA 315 (360)
Q Consensus 284 -------------------------vlSDAN-L~RYgI~p~~l~~~l~~~~p~V~a~~ 315 (360)
++-+.- ...+.++|+.+.++.+ . |+|.+.=
T Consensus 106 ~~~~~s~~~l~~y~~~ia~~~~~pi~iYn~P~~tg~~ls~~~l~~L~~-~-~nv~giK 161 (289)
T PF00701_consen 106 YYFKPSQEELIDYFRAIADATDLPIIIYNNPARTGNDLSPETLARLAK-I-PNVVGIK 161 (289)
T ss_dssp TSSSCCHHHHHHHHHHHHHHSSSEEEEEEBHHHHSSTSHHHHHHHHHT-S-TTEEEEE
T ss_pred ccccchhhHHHHHHHHHHhhcCCCEEEEECCCccccCCCHHHHHHHhc-C-CcEEEEE
Confidence 222222 3466788888888777 4 8877654
No 81
>cd05560 Xcc1710_like Xcc1710_like family, specific to proteobacteria. Xcc1710 is a hypothetical protein from Xanthomonas campestris pv. campestris str. ATCC 33913, similar to Mth938, a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. Their three-dimensional structures have been determined, but their functions are unknown.
Probab=37.49 E-value=25 Score=29.80 Aligned_cols=47 Identities=19% Similarity=0.355 Sum_probs=33.7
Q ss_pred CCChhHHHHHHhcCcccccEEEEEecCchHHH--------HHHHhhCCCCceEEecCCCchHHH
Q psy11447 294 GIAPREMGEALNAEAASVRAYAIFIGSLGDQA--------KRLTQELPAGRGFVCMDLTEIPQI 349 (360)
Q Consensus 294 gI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA--------~~l~~~Lp~Gr~fv~~dt~~lP~i 349 (360)
-|+++.|..++..+ |+ +|.||. |..= +.|.+ .|-...||||..-=..
T Consensus 39 ~l~~e~l~~l~~~~-pe----iliiGT-G~~~~~~~~~~~~~l~~---~gi~vE~m~T~~AcrT 93 (109)
T cd05560 39 DLTAAHFEALLALQ-PE----VILLGT-GERQRFPPPALLAPLLA---RGIGVEVMDTQAACRT 93 (109)
T ss_pred cCCHHHHHHHHhcC-CC----EEEEec-CCCCCcCCHHHHHHHHH---cCCeEEEECHHHHHHH
Confidence 39999999999887 76 889998 4331 22333 5889999999754333
No 82
>smart00800 uDENN Domain always found upstream of DENN domain, found in a variety of signalling proteins. The uDENN domain is part of the tripartite DENN domain. It is always found upstream of the DENN domain itself, which is found in a variety of signalling proteins involved in Rab-mediated processes or regulation of MAPKs signalling pathways. The DENN domain is always encircled on both sides by more divergent domains, called uDENN (for upstream DENN) and dDENN (for downstream DENN). The function of the DENN domain remains to date unclear, although it appears to represent a good candidate for a GTP/GDP exchange activity.
Probab=37.12 E-value=17 Score=29.22 Aligned_cols=38 Identities=21% Similarity=0.414 Sum_probs=26.9
Q ss_pred hhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC-ccccCC
Q psy11447 249 HAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN-LDRYGI 295 (360)
Q Consensus 249 H~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN-L~RYgI 295 (360)
-.+||+..+-++...+ .. ...+.|++||+|.| -.|||+
T Consensus 46 i~~FCfP~G~~~~~~~-------~~--~~~~~f~FvLT~~dG~r~yG~ 84 (89)
T smart00800 46 IPLFCFPEGLLFVTQK-------SS--KDPQFFSFVLTDIDGSRRYGF 84 (89)
T ss_pred CCeeECCCCeEeeccc-------CC--CCCcEEEEEEECCCCCEEEEE
Confidence 3899998666554311 11 44679999999997 778885
No 83
>PRK04147 N-acetylneuraminate lyase; Provisional
Probab=36.63 E-value=79 Score=30.52 Aligned_cols=90 Identities=17% Similarity=0.209 Sum_probs=55.1
Q ss_pred EeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCcc-hHHHHHHHHHHHHhhcccCCCeE----
Q psy11447 212 LDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGDY-TLESIRASIQDLTTECAQHDDSI---- 281 (360)
Q Consensus 212 ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD~-Tlea~~~ai~~~a~~~~daDe~f---- 281 (360)
+=+.|++|+.+.+ +..||.++++....+++ ++-.|++ |-++++.|- ...+- .||--.
T Consensus 42 i~v~GstGE~~~L----------s~eEr~~~~~~~~~~~~~~~~viagvg~~~t~~ai~~a~-~a~~~--Gad~v~v~~P 108 (293)
T PRK04147 42 LYVGGSTGEAFLL----------STEEKKQVLEIVAEEAKGKVKLIAQVGSVNTAEAQELAK-YATEL--GYDAISAVTP 108 (293)
T ss_pred EEECCCccccccC----------CHHHHHHHHHHHHHHhCCCCCEEecCCCCCHHHHHHHHH-HHHHc--CCCEEEEeCC
Confidence 3488999999875 45999999999999887 5666664 555544432 22221 233111
Q ss_pred -----------------------EEEEecC-CccccCCChhHHHHHHhcCcccccEEEE
Q psy11447 282 -----------------------LIVLSDA-NLDRYGIAPREMGEALNAEAASVRAYAI 316 (360)
Q Consensus 282 -----------------------VivlSDA-NL~RYgI~p~~l~~~l~~~~p~V~a~~I 316 (360)
=|++-+. -...+.|+|+.+.++. .. |+|.+.=-
T Consensus 109 ~y~~~~~~~l~~~f~~va~a~~lPv~iYn~P~~tg~~l~~~~l~~L~-~~-pnvvgiK~ 165 (293)
T PRK04147 109 FYYPFSFEEICDYYREIIDSADNPMIVYNIPALTGVNLSLDQFNELF-TL-PKVIGVKQ 165 (293)
T ss_pred cCCCCCHHHHHHHHHHHHHhCCCCEEEEeCchhhccCCCHHHHHHHh-cC-CCEEEEEe
Confidence 1222221 1346678999999887 44 87755443
No 84
>TIGR01143 murF UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase. This family consists of the strictly bacterial MurF gene of peptidoglycan biosynthesis. This enzyme is almost always UDP-N-acetylmuramoylalanyl-D-glutamyl-2,6-diaminopimelate--D-alanyl-D-alanyl ligase, but in a few species, MurE adds lysine rather than diaminopimelate. This enzyme acts on the product from MurE activity, and so is also subfamily rather than equivalog. Staphylococcus aureus is an example of species in this MurF protein would differ.
Probab=36.63 E-value=1.1e+02 Score=30.91 Aligned_cols=84 Identities=24% Similarity=0.357 Sum_probs=53.0
Q ss_pred chHHHHHHHHHHHHhhcccCCCeEEEEEecC-CccccCC-ChhHHHHHHhcCccccc-EEEEEecCchHHHHHHHhhCCC
Q psy11447 258 YTLESIRASIQDLTTECAQHDDSILIVLSDA-NLDRYGI-APREMGEALNAEAASVR-AYAIFIGSLGDQAKRLTQELPA 334 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~daDe~fVivlSDA-NL~RYgI-~p~~l~~~l~~~~p~V~-a~~IfIgslgdqA~~l~~~Lp~ 334 (360)
||.+|++.|++.+.+. . .+.++|+.+- .+..|.. -...+++.+.+. + ..+|++|. .+..+.+.++.
T Consensus 307 ~np~s~~~al~~l~~~--~--~r~i~VlG~~~e~G~~~~~~~~~l~~~~~~~----~~d~vi~~g~---~~~~~~~~~~~ 375 (417)
T TIGR01143 307 ANPDSMRAALDALARF--P--GKKILVLGDMAELGEYSEELHAEVGRYANSL----GIDLVFLVGE---EAAVIYDSLGC 375 (417)
T ss_pred CCHHHHHHHHHHHHhC--C--CCEEEEEcCchhcChHHHHHHHHHHHHHHHc----CCCEEEEECH---HHHHHHHhccc
Confidence 5889999999999653 3 4777888763 2333321 123466666442 3 56778764 55667777763
Q ss_pred CceEEecCCCchHHHHHHH
Q psy11447 335 GRGFVCMDLTEIPQILQQI 353 (360)
Q Consensus 335 Gr~fv~~dt~~lP~il~qI 353 (360)
..+.+.|.+++.+.+++.
T Consensus 376 -~~~~~~~~~~~~~~l~~~ 393 (417)
T TIGR01143 376 -KGFHFADKDELLAFLKLE 393 (417)
T ss_pred -CcEEECCHHHHHHHHHHh
Confidence 345567777777777654
No 85
>cd08563 GDPD_TtGDE_like Glycerophosphodiester phosphodiesterase domain of Thermoanaerobacter tengcongensis and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Thermoanaerobacter tengcongensis glycerophosphodiester phosphodiesterase (TtGDE, EC 3.1.4.46) and its uncharacterized homologs. Members in this family show high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. Despite the fact that most of GDPD family members exist as the monomer, TtGDE can function as a dimeric unit. Its catalytic mechanism is based on the general base-acid catalysis, which is similar to that of phosphoinositide-specific phospholipases C (PI-PLCs, EC 3.1.4.11). A divalent metal cation is required for the enzyme activity of TtGDE.
Probab=36.47 E-value=57 Score=29.91 Aligned_cols=37 Identities=27% Similarity=0.389 Sum_probs=26.6
Q ss_pred CcchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCcccc
Q psy11447 256 GDYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRY 293 (360)
Q Consensus 256 GD~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RY 293 (360)
..||++|.+.|++.=+. ..|+| |..+||.-|.+|+|-
T Consensus 14 pENT~~Af~~A~~~g~~-~vE~DV~~TkDg~~Vv~HD~~l~r~ 55 (230)
T cd08563 14 PENTLLAFKKAIEAGAD-GIELDVHLTKDGQLVVIHDETVDRT 55 (230)
T ss_pred CchhHHHHHHHHHcCCC-EEEEEeeEcCCCCEEEECCCCcccc
Confidence 57999999999874221 11222 467889999999986
No 86
>cd06275 PBP1_PurR Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. This dimeric PurR belongs to the LacI-GalR family of transcription regulators and is activated to bind to DNA operator sites by initially binding either of high affinity corepressors, hypoxanthine or guanine. PurR is composed of two functional domains: aan N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the purine transcription repressor undergoes a
Probab=36.39 E-value=1.1e+02 Score=27.24 Aligned_cols=36 Identities=22% Similarity=0.136 Sum_probs=20.6
Q ss_pred ccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHh
Q psy11447 223 SIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTT 272 (360)
Q Consensus 223 ~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~ 272 (360)
.||+|..++.+.+ .++++.|.+--++.+.+.+.|.+
T Consensus 79 ~ipvV~i~~~~~~--------------~~~~~V~~d~~~~~~~~~~~l~~ 114 (269)
T cd06275 79 HIPMVVMDWGPED--------------DFADKIQDNSEEGGYLATRHLIE 114 (269)
T ss_pred CCCEEEEecccCC--------------CCCCeEeeCcHHHHHHHHHHHHH
Confidence 3677766665432 23455566666666666666654
No 87
>cd08602 GDPD_ScGlpQ1_like Glycerophosphodiester phosphodiesterase domain of Streptomycin coelicolor (GlpQ1) and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of putative bacterial and eukaryotic glycerophosphodiester phosphodiesterases (GP-GDE, EC 3.1.4.46) similar to Escherichia coli periplasmic phosphodiesterase GlpQ, as well as plant glycerophosphodiester phosphodiesterases (GP-PDEs), all of which catalyzes the Ca2+-dependent degradation of periplasmic glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. The prototypes of this family include putative secreted phosphodiesterase encoded by gene glpQ1 (SCO1565) from the pho regulon in Streptomyces coelicolor genome, and in plants, two distinct Arabidopsis thaliana genes, AT5G08030 and AT1G74210, coding putative GP-PDEs from the cell walls and vacuoles, respectively.
Probab=36.35 E-value=52 Score=32.38 Aligned_cols=35 Identities=23% Similarity=0.349 Sum_probs=25.8
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
.||++|.+.|++.=+. -.|.| |..+||+-|.+|+|
T Consensus 15 ENTl~Af~~A~~~Gad-~iE~DVqlTkDg~lVv~HD~~l~r 54 (309)
T cd08602 15 EHTLAAYQLAIEQGAD-FIEPDLVSTKDGVLICRHEPELSG 54 (309)
T ss_pred ccHHHHHHHHHHcCCC-EEEEeeeECCCCcEEEeCCCcccc
Confidence 6899999999984221 11223 67899999999998
No 88
>cd01460 vWA_midasin VWA_Midasin: Midasin is a member of the AAA ATPase family. The proteins of this family are unified by their common archetectural organization that is based upon a conserved ATPase domain. The AAA domain of midasin contains six tandem AAA protomers. The AAA domains in midasin is followed by a D/E rich domain that is following by a VWA domain. The members of this subgroup have a conserved MIDAS motif. The function of this domain is not exactly known although it has been speculated to play a crucial role in midasin function.
Probab=36.16 E-value=4.3e+02 Score=26.00 Aligned_cols=140 Identities=16% Similarity=0.216 Sum_probs=79.2
Q ss_pred HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcch--HHHHHHHHHHHHhhc--
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYT--LESIRASIQDLTTEC-- 274 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~T--lea~~~ai~~~a~~~-- 274 (360)
|.++++..+. -..-|++=+|+.. .+.+.+. |.+.+.=++++ ++|.+..+.| ..|+..|++.+....
T Consensus 89 Is~al~~Le~-g~vgVv~Fg~~~~--~v~Plt~-d~~~~a~~~~l------~~~~f~~~~Tni~~aL~~a~~~f~~~~~~ 158 (266)
T cd01460 89 VSKALTLLEV-GQLGVCSFGEDVQ--ILHPFDE-QFSSQSGPRIL------NQFTFQQDKTDIANLLKFTAQIFEDARTQ 158 (266)
T ss_pred HHHHHHhCcC-CcEEEEEeCCCce--EeCCCCC-CchhhHHHHHh------CcccCCCCCCcHHHHHHHHHHHHHhhhcc
Confidence 5666666544 3445666666532 2223333 22222222322 6677777777 566677777663210
Q ss_pred ccCC--CeEEEEEecCC--ccccCCChhHHHHHHhcCcccccEEEEEecCc-h----------------H-HHHHHHhhC
Q psy11447 275 AQHD--DSILIVLSDAN--LDRYGIAPREMGEALNAEAASVRAYAIFIGSL-G----------------D-QAKRLTQEL 332 (360)
Q Consensus 275 ~daD--e~fVivlSDAN--L~RYgI~p~~l~~~l~~~~p~V~a~~IfIgsl-g----------------d-qA~~l~~~L 332 (360)
.... ...+|+|||-- ... |+.+....++ .. .+|.-+.|.|-+- + . +-..--+.+
T Consensus 159 ~~s~~~~qlilLISDG~~~~~e-~~~~~~~r~a-~e--~~i~l~~I~ld~~~~~~SI~d~~~~~~~~~~~~~l~~Yl~~f 234 (266)
T cd01460 159 SSSGSLWQLLLIISDGRGEFSE-GAQKVRLREA-RE--QNVFVVFIIIDNPDNKQSILDIKVVSFKNDKSGVITPYLDEF 234 (266)
T ss_pred ccccccccEEEEEECCCcccCc-cHHHHHHHHH-HH--cCCeEEEEEEcCCCCCCCcccccccccCCCCccHHHHHHhcC
Confidence 0122 38999999976 222 3333333333 33 5688888888874 0 1 333445577
Q ss_pred CCCceEEecCCCchHHHHHH
Q psy11447 333 PAGRGFVCMDLTEIPQILQQ 352 (360)
Q Consensus 333 p~Gr~fv~~dt~~lP~il~q 352 (360)
|-----|+-|..+||.+|-.
T Consensus 235 pfpYy~~~~~~~~lp~~l~~ 254 (266)
T cd01460 235 PFPYYVIVRDLNQLPSVLSD 254 (266)
T ss_pred CCCeEEEecChhHhHHHHHH
Confidence 76666688999999997643
No 89
>PF09625 VP9: VP9 protein; InterPro: IPR018587 VP9 is a protein containing a ferredoxin fold. Two dimers come together to form one asymmetric unit which possesses a DNA recognition fold and specific metal binding sites possibly for zinc. It is postulated that being a non-structural protein VP9 is involved in the transcriptional regulation of the White spot syndrome virus (WSSV), from which it comes. WSSV is the major viral pathogen in shrimp aquaculture []. VP9 is found N-terminal to the IPR009766 from INTERPRO. ; PDB: 2ZUG_A 2GJ2_A 2GJI_A.
Probab=35.84 E-value=3.4 Score=34.07 Aligned_cols=36 Identities=19% Similarity=0.297 Sum_probs=19.0
Q ss_pred hccccCCcch-----HHHHHHHHHHHHhhcccCCC-eEEEEEec
Q psy11447 250 AQFCASGDYT-----LESIRASIQDLTTECAQHDD-SILIVLSD 287 (360)
Q Consensus 250 ~Qfc~sGD~T-----lea~~~ai~~~a~~~~daDe-~fVivlSD 287 (360)
+-||++||+| +-.+=.||+.+.+. |-+| -|++-|-|
T Consensus 5 ~gF~l~G~~~~~YE~Vm~~f~sv~siRkS--El~d~~~iV~Lk~ 46 (79)
T PF09625_consen 5 SGFLLSGEKTSKYERVMLTFESVKSIRKS--ELRDGVYIVQLKK 46 (79)
T ss_dssp S-EEEESS-GTTHHHHHHTSTTEEEEEE---SS-TTEEEEEE-T
T ss_pred cceEEecCchHHHHHHHHHHHHHHHHHHh--hcccceEEEEEec
Confidence 5699999987 33345566666665 4444 45544444
No 90
>PF09827 CRISPR_Cas2: CRISPR associated protein Cas2; InterPro: IPR019199 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. Members of this family of bacterial proteins comprise various hypothetical proteins, as well as CRISPR (clustered regularly interspaced short palindromic repeats) associated proteins, conferring resistance to infection by certain bacteriophages. ; PDB: 3EXC_X 2I0X_A 3OQ2_B 3UI3_A 1ZPW_X 2I8E_A 2IVY_A.
Probab=35.35 E-value=1.1e+02 Score=23.72 Aligned_cols=54 Identities=28% Similarity=0.359 Sum_probs=30.7
Q ss_pred CCchHHHHHHHHHHHhh---cc-ccCCcchHHHHHHHHHHHHhhcccC-CCeEEEEEec
Q psy11447 234 RDNKQRLEVIKIMHAHA---QF-CASGDYTLESIRASIQDLTTECAQH-DDSILIVLSD 287 (360)
Q Consensus 234 k~~keRl~vl~~M~aH~---Qf-c~sGD~Tlea~~~ai~~~a~~~~da-De~fVivlSD 287 (360)
+++|.|-++.+.+..|. || +|.|+-|-.-.+.-.+++.+.-.+. |.=+++-+++
T Consensus 11 ~~~k~~~kv~k~L~~~g~~iQ~SVf~~~~~~~~~~~l~~~l~~~i~~~~d~i~i~~l~~ 69 (78)
T PF09827_consen 11 SDNKRRNKVRKILKSYGTRIQYSVFEGNLTNAELRKLRRELEKLIDPDEDSIRIYPLCD 69 (78)
T ss_dssp HSHHHHHHHHHHHHHTTEEEETTEEEEEE-HHHHHHHHHHHHHHSCTTTCEEEEEEEEH
T ss_pred CCcHHHHHHHHHHHHhCccccceEEEEEcCHHHHHHHHHHHHhhCCCCCCEEEEEEeCC
Confidence 56799999998888876 33 5666655444444455554432122 4444444444
No 91
>cd00954 NAL N-Acetylneuraminic acid aldolase, also called N-acetylneuraminate lyase (NAL), which catalyses the reversible aldol reaction of N-acetyl-D-mannosamine and pyruvate to give N-acetyl-D-neuraminic acid (D-sialic acid). It has a widespread application as biocatalyst for the synthesis of sialic acid and its derivatives. This enzyme has been shown to be quite specific for pyruvate as the donor, but flexible to a variety of D- and, to some extent, L-hexoses and pentoses as acceptor substrates. NAL is member of dihydrodipicolinate synthase family that comprises several pyruvate-dependent class I aldolases.
Probab=34.74 E-value=4.2e+02 Score=25.47 Aligned_cols=106 Identities=16% Similarity=0.183 Sum_probs=59.7
Q ss_pred EeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCc-chHHHHHHHHHHHHhhcccCCCeE----
Q psy11447 212 LDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGD-YTLESIRASIQDLTTECAQHDDSI---- 281 (360)
Q Consensus 212 ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD-~Tlea~~~ai~~~a~~~~daDe~f---- 281 (360)
+=+.|++|+.+.. +..||.++++....++. ++-.|. +|-++++.|-.- .+- -||--.
T Consensus 39 i~~~GstGE~~~L----------t~~Er~~~~~~~~~~~~~~~~viagv~~~~~~~ai~~a~~a-~~~--Gad~v~~~~P 105 (288)
T cd00954 39 LYVNGSTGEGFLL----------SVEERKQIAEIVAEAAKGKVTLIAHVGSLNLKESQELAKHA-EEL--GYDAISAITP 105 (288)
T ss_pred EEECcCCcCcccC----------CHHHHHHHHHHHHHHhCCCCeEEeccCCCCHHHHHHHHHHH-HHc--CCCEEEEeCC
Confidence 4578999999873 57899999998877644 333444 566666554321 111 222111
Q ss_pred ------------------------EEEEecC-CccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCC
Q psy11447 282 ------------------------LIVLSDA-NLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELP 333 (360)
Q Consensus 282 ------------------------VivlSDA-NL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp 333 (360)
=|++-+. ....+.++|+.+.++.. . |+|.+.=---|.+ ....++.+..|
T Consensus 106 ~y~~~~~~~i~~~~~~v~~a~~~lpi~iYn~P~~tg~~l~~~~~~~L~~-~-pnivgiK~s~~d~-~~~~~~~~~~~ 179 (288)
T cd00954 106 FYYKFSFEEIKDYYREIIAAAASLPMIIYHIPALTGVNLTLEQFLELFE-I-PNVIGVKFTATDL-YDLERIRAASP 179 (288)
T ss_pred CCCCCCHHHHHHHHHHHHHhcCCCCEEEEeCccccCCCCCHHHHHHHhc-C-CCEEEEEeCCCCH-HHHHHHHHhCC
Confidence 1111211 14577888988888874 4 8776544433332 23444555554
No 92
>cd08603 GDPD_SHV3_repeat_1 Glycerophosphodiester phosphodiesterase domain repeat 1 of glycerophosphodiester phosphodiesterase-like protein SHV3 and SHV3-like proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) repeat 1 present in glycerophosphodiester phosphodiesterase (GP-GDE)-like protein SHV3 and SHV3-like proteins (SVLs), which may play an important role in cell wall organization. The prototype of this family is a glycosylphosphatidylinositol (GPI) anchored protein SHV3 encoded by shaven3 (shv3) gene from Arabidopsis thaliana. Members in this family show sequence homology to bacterial GP-GDEs (EC 3.1.4.46) that catalyze the hydrolysis of various glycerophosphodiesters, and produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. Both, SHV3 and SVLs, have two tandemly repeated GDPD domains whose biochemical functions remain unclear. The residues essential for interactions with the substrates and calcium ions in bacterial GP
Probab=34.59 E-value=38 Score=33.85 Aligned_cols=36 Identities=28% Similarity=0.332 Sum_probs=26.4
Q ss_pred cchHHHHHHHHHHHHhh-cccCC-----CeEEEEEecCCccc
Q psy11447 257 DYTLESIRASIQDLTTE-CAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~-~~daD-----e~fVivlSDANL~R 292 (360)
+||++|-+.|++.=+.. ..|+| |...||..|.||+|
T Consensus 15 EnTl~Ay~~Ai~~Ga~~d~IE~DV~lTkDgvlVv~HD~~L~r 56 (299)
T cd08603 15 DSSLFAYQFAASSSSPDVALWCDLQLTKDGVGICLPDLNLDN 56 (299)
T ss_pred cchHHHHHHHHHcCCCCCEEEEEeeECcCCcEEEeCCccccc
Confidence 68999999999842210 01233 67889999999998
No 93
>cd03007 PDI_a_ERp29_N PDIa family, endoplasmic reticulum protein 29 (ERp29) subfamily; ERp29 is a ubiquitous ER-resident protein expressed in high levels in secretory cells. It forms homodimers and higher oligomers in vitro and in vivo. It contains a redox inactive TRX-like domain at the N-terminus, which is homologous to the redox active TRX (a) domains of PDI, and a C-terminal helical domain similar to the C-terminal domain of P5. The expression profile of ERp29 suggests a role in secretory protein production distinct from that of PDI. It has also been identified as a member of the thyroglobulin folding complex. The Drosophila homolog, Wind, is the product of windbeutel, an essential gene in the development of dorsal-ventral patterning. Wind is required for correct targeting of Pipe, a Golgi-resident type II transmembrane protein with homology to 2-O-sulfotransferase.
Probab=34.37 E-value=41 Score=29.12 Aligned_cols=33 Identities=12% Similarity=0.176 Sum_probs=20.9
Q ss_pred hccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecC
Q psy11447 250 AQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDA 288 (360)
Q Consensus 250 ~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDA 288 (360)
+.|||.|.| -..+.||.+-+.|++.+||.--|+
T Consensus 27 A~~Pwc~k~------~~~~~LA~e~~~aa~~v~lakVd~ 59 (116)
T cd03007 27 TAYPYGEKH------EAFTRLAESSASATDDLLVAEVGI 59 (116)
T ss_pred CCCCCCCCh------HHHHHHHHHHHhhcCceEEEEEec
Confidence 457999952 345555533335555688888887
No 94
>cd08582 GDPD_like_2 Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in a group of uncharacterized bacterial glycerophosphodiester phosphodiesterase and similar proteins. They show high sequence similarity to Escherichia coli glycerophosphodiester phosphodiesterase, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols.
Probab=34.21 E-value=38 Score=31.09 Aligned_cols=36 Identities=22% Similarity=0.323 Sum_probs=25.2
Q ss_pred CcchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 256 GDYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 256 GD~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
-.||++|.+.|++.=+. ..|.| |..+||+-|.+|+|
T Consensus 12 pENTl~af~~A~~~G~~-~vE~Dv~lTkDg~~Vv~HD~~l~r 52 (233)
T cd08582 12 PENTLAAFELAWEQGAD-GIETDVRLTKDGELVCVHDPTLKR 52 (233)
T ss_pred CchHHHHHHHHHHcCCC-EEEEEEEEccCCCEEEecCCcccc
Confidence 36899999999884221 11222 45678889999987
No 95
>cd08555 PI-PLCc_GDPD_SF Catalytic domain of phosphoinositide-specific phospholipase C-like phosphodiesterases superfamily. The PI-PLC-like phosphodiesterases superfamily represents the catalytic domains of bacterial phosphatidylinositol-specific phospholipase C (PI-PLC, EC 4.6.1.13), eukaryotic phosphoinositide-specific phospholipase C (PI-PLC, EC 3.1.4.11), glycerophosphodiester phosphodiesterases (GP-GDE, EC 3.1.4.46), sphingomyelinases D (SMases D) (sphingomyelin phosphodiesterase D, EC 3.1.4.41) from spider venom, SMases D-like proteins, and phospholipase D (PLD) from several pathogenic bacteria, as well as their uncharacterized homologs found in organisms ranging from bacteria and archaea to metazoans, plants, and fungi. PI-PLCs are ubiquitous enzymes hydrolyzing the membrane lipid phosphoinositides to yield two important second messengers, inositol phosphates and diacylglycerol (DAG). GP-GDEs play essential roles in glycerol metabolism and catalyze the hydrolysis of glycerophosph
Probab=33.72 E-value=60 Score=28.84 Aligned_cols=49 Identities=29% Similarity=0.410 Sum_probs=33.8
Q ss_pred cchHHHHHHHHHHHHhhcccC-----CCeEEEEEecCCccc--cCCChhHHHHHHhc
Q psy11447 257 DYTLESIRASIQDLTTECAQH-----DDSILIVLSDANLDR--YGIAPREMGEALNA 306 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~da-----De~fVivlSDANL~R--YgI~p~~l~~~l~~ 306 (360)
.||+.|.+.|++.=+.. .|. .|.-.||.-|.+++| .|.+.-.|.++|..
T Consensus 13 eNT~~af~~a~~~G~~~-iE~DV~lt~Dg~lvv~HD~~~~r~~~~~~~ptl~evl~~ 68 (179)
T cd08555 13 ENTLEAFYRALDAGARG-LELDVRLTKDGELVVYHGPTLDRTTAGILPPTLEEVLEL 68 (179)
T ss_pred ccHHHHHHHHHHcCCCE-EEEEEeEcCCCeEEEECCCccccccCCCCCCCHHHHHHH
Confidence 78999999999853220 021 135677889999998 46666677777754
No 96
>cd01459 vWA_copine_like VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
Probab=32.95 E-value=3.6e+02 Score=26.32 Aligned_cols=119 Identities=16% Similarity=0.186 Sum_probs=66.3
Q ss_pred hhhHHHHhhccccceeEeeeeccCCCCccccccC---CCCCCCchHH-HHHHHHH-HHhhccccCCcchHH-HHHHHHHH
Q psy11447 196 LGTSLEAFQDFDQRISLDIIGHSGETHSIPFVTS---NNLPRDNKQR-LEVIKIM-HAHAQFCASGDYTLE-SIRASIQD 269 (360)
Q Consensus 196 L~rvMEaf~g~e~k~~ydivGHSGd~~~i~lV~~---~~~Pk~~keR-l~vl~~M-~aH~Qfc~sGD~Tle-a~~~ai~~ 269 (360)
+++|.+-+..|.+-=.++++|=.+..+.-+-|.. ++|....-.= =.|++.- ++-.+..+||--..+ .+++|++
T Consensus 68 I~~vg~il~~yD~D~~ip~~GFGa~~~~~~~v~~~f~~~~~~p~~~Gi~gvl~aY~~~l~~v~lsGpT~fapvI~~a~~- 146 (254)
T cd01459 68 IRIVGEVLQPYDSDKLIPAFGFGAIVTKDQSVFSFFPGYSESPECQGFEGVLRAYREALPNVSLSGPTNFAPVIRAAAN- 146 (254)
T ss_pred HHHHHHHHHhcCCCCceeeEeecccCCCCCccccccCCCCCCCcccCHHHHHHHHHHHhceeeecCcchHHHHHHHHHH-
Confidence 6668888888876668888887654322211111 2221111111 1222221 133567788975544 4455544
Q ss_pred HHhhcccCCCeE-EEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecC
Q psy11447 270 LTTECAQHDDSI-LIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGS 320 (360)
Q Consensus 270 ~a~~~~daDe~f-VivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgs 320 (360)
.++.......|+ +++|||-.... -++-.++|..- .+---++||||-
T Consensus 147 ~a~~~~~~~~Y~VLLIiTDG~i~D----~~~t~~aIv~A-S~~PlSIiiVGV 193 (254)
T cd01459 147 IAKASNSQSKYHILLIITDGEITD----MNETIKAIVEA-SKYPLSIVIVGV 193 (254)
T ss_pred HHHHhcCCCceEEEEEECCCCccc----HHHHHHHHHHH-hcCCeEEEEEEe
Confidence 343311222365 78899987763 45556666553 677899999997
No 97
>cd00950 DHDPS Dihydrodipicolinate synthase (DHDPS) is a key enzyme in lysine biosynthesis. It catalyzes the aldol condensation of L-aspartate-beta- semialdehyde and pyruvate to dihydropicolinic acid via a Schiff base formation between pyruvate and a lysine residue. The functional enzyme is a homotetramer consisting of a dimer of dimers. DHDPS is member of dihydrodipicolinate synthase family that comprises several pyruvate-dependent class I aldolases that use the same catalytic step to catalyze different reactions in different pathways.
Probab=32.32 E-value=4.5e+02 Score=24.99 Aligned_cols=45 Identities=22% Similarity=0.383 Sum_probs=32.2
Q ss_pred EeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCc-chHHHHHHH
Q psy11447 212 LDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGD-YTLESIRAS 266 (360)
Q Consensus 212 ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD-~Tlea~~~a 266 (360)
+-+.|++|+.+. -+..||.++++.+..++. ++-.|. +|-++++.|
T Consensus 38 l~v~GstGE~~~----------lt~~Er~~l~~~~~~~~~~~~~vi~gv~~~~~~~~~~~a 88 (284)
T cd00950 38 LVVCGTTGESPT----------LSDEEHEAVIEAVVEAVNGRVPVIAGTGSNNTAEAIELT 88 (284)
T ss_pred EEECCCCcchhh----------CCHHHHHHHHHHHHHHhCCCCcEEeccCCccHHHHHHHH
Confidence 456789999876 568999999999999875 233444 555665544
No 98
>PF03586 Herpes_UL36: Herpesvirus UL36 tegument protein; InterPro: IPR005210 The UL36 open reading frame (ORF) encodes the largest Human herpesvirus 1 (HHV-1) protein, a 270 kDa polypeptide designated VP1/2, which is also a component of the virion tegument. A null mutation in the UL36 gene of herpes simplex virus type 1 results in accumulation of unenveloped DNA-filled capsids in the cytoplasm of infected cells []. The region which defines these sequences only covers a small central part of this large protein.; GO: 0004197 cysteine-type endopeptidase activity, 0008242 omega peptidase activity
Probab=32.30 E-value=30 Score=34.13 Aligned_cols=35 Identities=20% Similarity=0.237 Sum_probs=27.6
Q ss_pred CCccchhhhh-----hhhhhhhhHHHhhhhhccccCCchhh
Q psy11447 92 YPYSTREAVH-----IVKHLQEGVLGQHENTRKFNGYDGRL 127 (360)
Q Consensus 92 ypYs~Re~v~-----ivKhL~~~v~dv~~nv~rFd~yD~rl 127 (360)
+.|.+|++-+ +--|. +.+..++.+||.||.|.|+-
T Consensus 38 ~gyd~~~~~k~Ae~~v~a~a-~~~~~al~tvl~fNPyspeN 77 (253)
T PF03586_consen 38 HGYDTRDFRKRAEQAVAANA-RTATNALETVLAFNPYSPEN 77 (253)
T ss_pred cCCChHHHHHHHHHHHHHHH-HHHHHHHHHHHhhCCCCccc
Confidence 5566666544 44566 88999999999999999987
No 99
>cd08575 GDPD_GDE4_like Glycerophosphodiester phosphodiesterase domain of mammalian glycerophosphodiester phosphodiesterase GDE4-like proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in mammalian GDE4 (also known as glycerophosphodiester phosphodiesterase domain-containing protein 1 (GDPD1)) and similar proteins. Mammalian GDE4 is a transmembrane protein whose cellular function is not elucidated. It is expressed widely, including in placenta, liver, kidney, pancreas, spleen, thymus, ovary, small intestine and peripheral blood leukocytes. It is also expressed in the growth cones in neuroblastoma Neuro2a cells, which suggests mammalian GDE4 may play some distinct role from other members of mammalian GDEs family. Also included in this subfamily are uncharacterized mammalian glycerophosphodiester phosphodiesterase domain-containing protein 3 (GDPD3) and similar proteins which display very high sequence homology to mammalian GDE4.
Probab=32.14 E-value=73 Score=30.28 Aligned_cols=36 Identities=25% Similarity=0.456 Sum_probs=26.0
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCcccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRY 293 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RY 293 (360)
.||++|.+.|++.=+. ..|.| |..+||+-|.+|+|-
T Consensus 15 ENTl~af~~A~~~G~d-~iE~DV~lT~Dg~~Vv~HD~~l~r~ 55 (264)
T cd08575 15 ENTIAAFRHAVKNGAD-MLELDVQLTKDGQVVVFHDWDLDRL 55 (264)
T ss_pred ccHHHHHHHHHHcCCC-EEEEEEEECCCCCEEEEcCCcccce
Confidence 6899999999985321 11222 577889999999984
No 100
>PRK10423 transcriptional repressor RbsR; Provisional
Probab=32.04 E-value=2.8e+02 Score=25.79 Aligned_cols=18 Identities=17% Similarity=0.108 Sum_probs=14.0
Q ss_pred CCCChhHHHHHHHhhhhh
Q psy11447 64 PDVPRDTLIRLVRAFGEF 81 (360)
Q Consensus 64 p~v~~~~l~~Lv~afgeL 81 (360)
|.|++++-.|+.++-.+|
T Consensus 23 ~~vs~~tr~rV~~~a~~l 40 (327)
T PRK10423 23 RFVSEAITAKVEAAIKEL 40 (327)
T ss_pred CCCCHHHHHHHHHHHHHH
Confidence 568888888888877666
No 101
>cd08579 GDPD_memb_like Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in uncharacterized bacterial glycerophosphodiester phosphodiesterases. In addition to a C-terminal GDPD domain, most members in this family have an N-terminus that functions as a membrane anchor.
Probab=31.88 E-value=43 Score=30.46 Aligned_cols=36 Identities=36% Similarity=0.562 Sum_probs=25.4
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCcccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRY 293 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RY 293 (360)
.||++|.+.|++.=+. ..|+| |.-.||+-|.+|+|-
T Consensus 13 ENT~~af~~A~~~Gad-~vE~DV~~T~Dg~~vv~HD~~l~r~ 53 (220)
T cd08579 13 ENTLEALEAAIKAKPD-YVEIDVQETKDGQFVVMHDANLKRL 53 (220)
T ss_pred ccHHHHHHHHHHcCCC-EEEEEeeEcCCCCEEEEcCCchhhc
Confidence 6899999999986332 12433 344667999999984
No 102
>COG0584 UgpQ Glycerophosphoryl diester phosphodiesterase [Energy production and conversion]
Probab=30.55 E-value=55 Score=30.32 Aligned_cols=40 Identities=25% Similarity=0.366 Sum_probs=25.5
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccccCCCh
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRYGIAP 297 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RYgI~p 297 (360)
-||++|.+.|++.=+. ..|.| |..+||+=|..++|=...-
T Consensus 20 ENTl~Af~~A~~~gad-~iE~Dv~lTkDg~lVv~HD~~~drt~~~~ 64 (257)
T COG0584 20 ENTLAAFELAAEQGAD-YIELDVQLTKDGVLVVIHDETLDRTTNGL 64 (257)
T ss_pred cchHHHHHHHHHcCCC-EEEeeccCccCCcEEEecccchhhhccCc
Confidence 7999999999954221 11222 4667788888766654433
No 103
>TIGR01856 hisJ_fam histidinol phosphate phosphatase HisJ family. This model represents the histidinol phosphate phosphatase HisJ of Bacillus subtilis, and related proteins from a number of species within a larger family of phosphatases in the PHP hydrolase family. HisJ catalyzes the penultimate step of histidine biosynthesis but shows no homology to the functionally equivalent sequence in E. coli, a domain of the bifunctional HisB protein. Note, however, that many species have two members and that Clostridium perfringens, predicted not to make histidine, has five members of this family; this family is designated subfamily rather than equivalog to indicate that members may not all act as HisJ.
Probab=30.53 E-value=31 Score=32.60 Aligned_cols=22 Identities=45% Similarity=0.833 Sum_probs=14.5
Q ss_pred HHHhhccccCCcchHHHH-HHHH
Q psy11447 246 MHAHAQFCASGDYTLESI-RASI 267 (360)
Q Consensus 246 M~aH~Qfc~sGD~Tlea~-~~ai 267 (360)
+|.||+||.=|-.|+|.. +.|+
T Consensus 3 ~H~Ht~~s~d~~~~~ee~v~~A~ 25 (253)
T TIGR01856 3 GHSHSPFCAHGTDTLEEVVQEAI 25 (253)
T ss_pred cccCcCCCCCCCCCHHHHHHHHH
Confidence 799999995555565544 4444
No 104
>cd08606 GDPD_YPL110cp_fungi Glycerophosphodiester phosphodiesterase domain of Saccharomyces cerevisiae YPL110cp and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Saccharomyces cerevisiae YPL110cp and other uncharacterized fungal homologs. The product of S. cerevisiae ORF YPL110c (GDE1), YPL110cp (Gde1p), displays homology to bacterial and mammalian glycerophosphodiester phosphodiesterases (GP-GDE, EC 3.1.4.46), which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. S. cerevisiae YPL110cp has been characterized as a cytoplasmic glycerophosphocholine (GPC)-specific phosphodiesterase that selectively hydrolyzes GPC, not glycerophosphoinositol (GPI), to generate choline and glycerolphosphate. YPL110cp has multi-domain architecture, including not only C-terminal GDPD, but also an SPX N-terminal domain along with several ankyrin repeats, which imp
Probab=30.48 E-value=1.1e+02 Score=29.03 Aligned_cols=40 Identities=18% Similarity=0.222 Sum_probs=28.1
Q ss_pred CcchHHHHHHHHHHHHhhccc-----CCCeEEEEEecCCccccCCC
Q psy11447 256 GDYTLESIRASIQDLTTECAQ-----HDDSILIVLSDANLDRYGIA 296 (360)
Q Consensus 256 GD~Tlea~~~ai~~~a~~~~d-----aDe~fVivlSDANL~RYgI~ 296 (360)
.-||++|.+.|++-=+. -.| .-|..+||+-|.+|+|-|.+
T Consensus 23 pENTl~af~~A~~~g~d-~vE~DV~lTkDg~~VV~HD~~l~rt~~~ 67 (286)
T cd08606 23 GENTVESFILAASLGAS-YVEVDVQLTKDLVPVIYHDFLVSETGTD 67 (286)
T ss_pred CcchHHHHHHHHHcCCC-EEEEEEEEccCCEEEEeCCCeeccCCCC
Confidence 36899999999874221 012 22578899999999997543
No 105
>PRK04663 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional
Probab=29.79 E-value=4.8e+02 Score=26.48 Aligned_cols=114 Identities=14% Similarity=0.246 Sum_probs=63.2
Q ss_pred ccCCchhhHHHHh---hccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHH
Q psy11447 191 NTGSDLGTSLEAF---QDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASI 267 (360)
Q Consensus 191 fn~~DL~rvMEaf---~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai 267 (360)
.-|++++.+.+++ .+.+.|+.+ ++. +. .+-+|...+ | +..+|+..|+
T Consensus 283 ~lGi~~~~i~~~L~~f~g~~~R~e~--v~~-~~--g~~~idDs~-~------------------------tn~~s~~~Al 332 (438)
T PRK04663 283 AAGVDYRKALDALKSYTGLTHRCQV--VAD-NH--GIKWVNDSK-A------------------------TNVASTLAAL 332 (438)
T ss_pred HcCCCHHHHHHHHHhCCCCCCceEE--eee-eC--CcEEEeCCC-c------------------------CCHHHHHHHH
Confidence 3477866655555 555787765 443 22 344554433 2 2467778888
Q ss_pred HHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCceEEecCCCchH
Q psy11447 268 QDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRGFVCMDLTEIP 347 (360)
Q Consensus 268 ~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~fv~~dt~~lP 347 (360)
+.+.. +.++++|+-+- +=|...++|.+.+... +..+|++|.-.. ++...+. ..++|.|.+++.
T Consensus 333 ~~~~~-----~~~~i~IlGg~---~~~~~~~~l~~~~~~~----~~~vi~~G~~~~---~~~~~~~--~~~~~~~~e~av 395 (438)
T PRK04663 333 SGLEI-----EGKLYLLVGGV---GKGADFSPLKPVLATL----NLQLCCFGEDGD---QFMPLHP--SARRFDTMEDAI 395 (438)
T ss_pred Hhccc-----CCcEEEEECCc---cCCCCHHHHHHHHHhh----CcEEEEECCCHH---HHHHHhc--cCeecCCHHHHH
Confidence 87641 23677777762 2245667888887653 236888887543 3333222 244555544444
Q ss_pred HHHH
Q psy11447 348 QILQ 351 (360)
Q Consensus 348 ~il~ 351 (360)
+.++
T Consensus 396 ~~~~ 399 (438)
T PRK04663 396 ESIS 399 (438)
T ss_pred HHHH
Confidence 4443
No 106
>PF11181 YflT: Heat induced stress protein YflT
Probab=29.68 E-value=59 Score=26.88 Aligned_cols=24 Identities=29% Similarity=0.423 Sum_probs=20.7
Q ss_pred hHHHHHHhHHcCCCHHhHHHHHHh
Q psy11447 3 LKAFKQKLKEIQMSEYDAKVYSEY 26 (360)
Q Consensus 3 ~ka~~~rLkEi~Ms~~da~~Y~~~ 26 (360)
...+..+|.++|+|+.+++.|++-
T Consensus 70 ~d~~~~~l~~lGl~~~ea~~y~~~ 93 (103)
T PF11181_consen 70 GDELRSKLESLGLSEDEAERYEEE 93 (103)
T ss_pred cHHHHHHHHHcCCCHHHHHHHHHH
Confidence 346788999999999999999863
No 107
>TIGR01651 CobT cobaltochelatase, CobT subunit. This model describes the aerobic cobalamin pathway Pseudomonas denitrificans CobT gene product, which is a cobalt chelatase subunit, with a MW ~70 kDa. The aerobic pathway cobalt chelatase is a heterotrimeric, ATP-dependent enzyme that catalyzes cobalt insertion during cobalamin biosynthesis. The other two subunits are the P. denitrificans CobS (TIGR01650) and CobN (pfam02514 CobN/Magnesium Chelatase) proteins. To avoid potential confusion with the nonhomologous Salmonella typhimurium/E.coli cobT gene product, the P. denitrificans gene symbol is not used in the name of this model.
Probab=29.28 E-value=2.2e+02 Score=31.53 Aligned_cols=139 Identities=16% Similarity=0.247 Sum_probs=85.6
Q ss_pred HHHHhhccccceeEeeeeccCCCC-----ccccccCCCCCCCchHHHHHHHHHHHhhccccCC-----------------
Q psy11447 199 SLEAFQDFDQRISLDIIGHSGETH-----SIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASG----------------- 256 (360)
Q Consensus 199 vMEaf~g~e~k~~ydivGHSGd~~-----~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sG----------------- 256 (360)
+-++++ .-.+...|.|++-+.+ .=..+..|+||.- -||.-++.++ |.-+.
T Consensus 420 La~aL~--~~gIp~eVlGFtt~aw~gg~~re~w~~~g~p~~P--gRlN~l~hii----yk~ad~~wr~~r~~l~~mm~~~ 491 (600)
T TIGR01651 420 LARTLE--RCGVKVEILGFTTRAWKGGQSREKWLKAGKPAAP--GRLNDLRHII----YKSADAPWRRARRNLGLMMREG 491 (600)
T ss_pred HHHHHH--HCCCCeEEEeecccccccccchHHHHhcCCCCCC--cccchhhhhh----hhccccchhhhccchhhhhhcc
Confidence 334555 3478899999996532 2257788888653 2333222222 22111
Q ss_pred ---cc-hHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCC----Ch----hHHHHHHhcCc--ccccEEEEEecCch
Q psy11447 257 ---DY-TLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGI----AP----REMGEALNAEA--ASVRAYAIFIGSLG 322 (360)
Q Consensus 257 ---D~-Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI----~p----~~l~~~l~~~~--p~V~a~~IfIgslg 322 (360)
+| -=||++.|.+.|.+- ...-.|.|||||.-=.-.+- ++ +.|.+++..-. +.|.-.+|=||.
T Consensus 492 ~~~eN~DGeAl~wa~~rL~~R--~e~rKiL~ViSDG~P~D~~TlsvN~~~~l~~hLr~vi~~~e~~~~vel~aigIg~-- 567 (600)
T TIGR01651 492 LLKENIDGEALMWAHQRLIAR--PEQRRILMMISDGAPVDDSTLSVNPGNYLERHLRAVIEEIETRSPVELLAIGIGH-- 567 (600)
T ss_pred ccccCCchHHHHHHHHHHhcC--cccceEEEEEeCCCcCCccccccCchhHHHHHHHHHHHHHhccCCceEEEeeccc--
Confidence 11 148999999999876 78899999999964221111 12 23444433321 468888888876
Q ss_pred HHHHHHHhhCCCCceEEecCCCchHHHHHHHHH
Q psy11447 323 DQAKRLTQELPAGRGFVCMDLTEIPQILQQIFS 355 (360)
Q Consensus 323 dqA~~l~~~Lp~Gr~fv~~dt~~lP~il~qIf~ 355 (360)
| +.+-- .|+++..|..+||..|-+=++
T Consensus 568 D----v~r~Y--~~~v~i~~~~eL~~~~~~qLa 594 (600)
T TIGR01651 568 D----VTRYY--RRAVTIVDAEELAGAMTEQLA 594 (600)
T ss_pred c----HHHHc--cccceecCHHHHHHHHHHHHH
Confidence 2 44444 588999999999988765444
No 108
>PRK10773 murF UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase; Reviewed
Probab=29.03 E-value=1.6e+02 Score=30.16 Aligned_cols=86 Identities=23% Similarity=0.280 Sum_probs=50.8
Q ss_pred chHHHHHHHHHHHHhhcccCCCeEEEEEec-CCccccCCC-hhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCC
Q psy11447 258 YTLESIRASIQDLTTECAQHDDSILIVLSD-ANLDRYGIA-PREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAG 335 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~daDe~fVivlSD-ANL~RYgI~-p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~G 335 (360)
||.++++.|++.+.+. . .+-++||.| ..|..+... .+++++.+... . --.+||+| +.+..+.... |
T Consensus 336 ~nP~s~~aaL~~l~~~--~--~r~i~VlG~m~elG~~~~~~h~~~~~~~~~~--~-~d~v~~~G---~~~~~~~~~~--~ 403 (453)
T PRK10773 336 ANVGSMTAAAQVLAEM--P--GYRVMVVGDMAELGAESEACHRQVGEAAKAA--G-IDKVLSVG---KLSHAISEAS--G 403 (453)
T ss_pred CCHHHHHHHHHHHHhC--C--CCEEEEECChhhcchHHHHHHHHHHHHHHHc--C-CCEEEEEC---hhHHHHHHhc--C
Confidence 5678888888888643 2 356788888 666544221 23444444322 1 13567776 4555565542 4
Q ss_pred ceEEecCCCchHHHHHHHHH
Q psy11447 336 RGFVCMDLTEIPQILQQIFS 355 (360)
Q Consensus 336 r~fv~~dt~~lP~il~qIf~ 355 (360)
++..+-|.+++...|+.++.
T Consensus 404 ~~~~~~~~~~~~~~l~~~~~ 423 (453)
T PRK10773 404 VGEHFADKTALIARLKALLA 423 (453)
T ss_pred CCeeECCHHHHHHHHHHhhc
Confidence 33456788888877777654
No 109
>COG2884 FtsE Predicted ATPase involved in cell division [Cell division and chromosome partitioning]
Probab=29.01 E-value=21 Score=34.61 Aligned_cols=50 Identities=18% Similarity=0.396 Sum_probs=35.2
Q ss_pred cccccCCchhh---------------HHHHhhccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHH
Q psy11447 188 NLRNTGSDLGT---------------SLEAFQDFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIK 244 (360)
Q Consensus 188 ~lrfn~~DL~r---------------vMEaf~g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~ 244 (360)
++.|||.||.| |..-|.=+.+|-.|| ....||.-.|.+|++-++|..-+-
T Consensus 58 ~i~~~~~dl~~l~~~~iP~LRR~IGvVFQD~rLL~~~tvye-------NVA~pL~v~G~~~~~i~~rV~~~L 122 (223)
T COG2884 58 KILVNGHDLSRLKGREIPFLRRQIGVVFQDFRLLPDRTVYE-------NVALPLRVIGKPPREIRRRVSEVL 122 (223)
T ss_pred eEEECCeecccccccccchhhheeeeEeeeccccccchHhh-------hhhhhhhccCCCHHHHHHHHHHHH
Confidence 57788888443 444444447888887 456788889999999888865443
No 110
>cd08571 GDPD_SHV3_plant Glycerophosphodiester phosphodiesterase domain of glycerophosphodiester phosphodiesterase-like protein SHV3 and SHV3-like proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase (GDPD) domain present in glycerophosphodiester phosphodiesterase (GP-GDE)-like protein SHV3 and SHV3-like proteins (SVLs), which may play an important role in cell wall organization. The prototype of this family is a glycosylphosphatidylinositol (GPI) anchored protein SHV3 encoded by shaven3 (shv3) gene from Arabidopsis thaliana. Members in this family show sequence homology to bacterial GP-GDEs (EC 3.1.4.46) that catalyze the hydrolysis of various glycerophosphodiesters, and produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. Both, SHV3 and SVLs, have two tandemly repeated GDPD domains whose biochemical functions remain unclear. The residues essential for interactions with the substrates and calcium ions in bacterial GP-GDEs are not conserv
Probab=28.89 E-value=54 Score=32.16 Aligned_cols=35 Identities=31% Similarity=0.345 Sum_probs=26.0
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
.||++|.+.|++.=+. -.|.| |..+||.=|.+|+|
T Consensus 15 ENTl~Af~~A~~~Gad-~IE~DV~lTkDg~lVv~HD~~l~r 54 (302)
T cd08571 15 DSTDLAYQKAISDGAD-VLDCDVQLTKDGVPICLPSINLDN 54 (302)
T ss_pred cchHHHHHHHHHcCCC-EEEeeeeEcCCCcEEEeCCchhcC
Confidence 6899999999985322 11333 57788999999988
No 111
>cd08573 GDPD_GDE1 Glycerophosphodiester phosphodiesterase domain of mammalian glycerophosphodiester phosphodiesterase GDE1 and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in mammalian GDE1 (also known as MIR16, membrane interacting protein of RGS16) and their metazoan homologs. GDE1 is widely expressed in mammalian tissues, including the heart, brain, liver, and kidney. It shows sequence homology to bacterial glycerophosphodiester phosphodiesterases (GP-GDEs, EC 3.1.4.46), which catalyzes the hydrolysis of various glycerophosphodiesters, and produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. GDE1 has been characterized as GPI-GDE (EC 3.1.4.44) that selectively hydrolyzes extracellular glycerophosphoinositol (GPI) to generate glycerol phosphate and inositol. It functions as an integral membrane-bound glycoprotein interacting with regulator of G protein signaling protein RGS16, and is modulated by G
Probab=28.85 E-value=80 Score=30.01 Aligned_cols=36 Identities=22% Similarity=0.421 Sum_probs=25.4
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCcccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRY 293 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RY 293 (360)
-||+.|.+.|++.=+. ..|.| |.-+||+-|.||+|-
T Consensus 13 ENTl~af~~A~~~Gad-~iE~DV~lTkDg~~Vv~HD~~l~R~ 53 (258)
T cd08573 13 ENTLAAFRQAKKNGAD-GVEFDLEFTKDGVPVLMHDDTVDRT 53 (258)
T ss_pred ccHHHHHHHHHHcCCC-EEEEEeeECCCCcEEEECCCCccee
Confidence 5899999999984221 11222 466889999999984
No 112
>cd01339 LDH-like_MDH L-lactate dehydrogenase-like malate dehydrogenase proteins. Members of this subfamily have an LDH-like structure and an MDH enzymatic activity. Some members, like MJ0490 from Methanococcus jannaschii, exhibit both MDH and LDH activities. Tetrameric MDHs, including those from phototrophic bacteria, are more similar to LDHs than to other MDHs. LDH catalyzes the last step of glycolysis in which pyruvate is converted to L-lactate. MDH is one of the key enzymes in the citric acid cycle, facilitating both the conversion of malate to oxaloacetate and replenishing levels of oxalacetate by reductive carboxylation of pyruvate. The LDH-like MDHs are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate dehydrogenases, formate/glycerate dehydrogenases, siroheme synthases, 6-phosphogluconate dehydrogenas
Probab=28.82 E-value=2.1e+02 Score=27.54 Aligned_cols=86 Identities=15% Similarity=0.277 Sum_probs=55.1
Q ss_pred CCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC-------cccc
Q psy11447 221 THSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN-------LDRY 293 (360)
Q Consensus 221 ~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN-------L~RY 293 (360)
+.++-++..+.|.+.+.+|.+.+.. +++..+..++++.+- +-+..+|+.|... .+-.
T Consensus 66 dADiVIit~g~p~~~~~~r~e~~~~-------------n~~i~~~i~~~i~~~---~p~~~iIv~sNP~di~t~~~~~~s 129 (300)
T cd01339 66 GSDVVVITAGIPRKPGMSRDDLLGT-------------NAKIVKEVAENIKKY---APNAIVIVVTNPLDVMTYVAYKAS 129 (300)
T ss_pred CCCEEEEecCCCCCcCCCHHHHHHH-------------HHHHHHHHHHHHHHH---CCCeEEEEecCcHHHHHHHHHHHh
Confidence 3456677788888888888876663 567777777777653 3446667777422 2222
Q ss_pred CCCh----------------hHHHHHHhcCcccccEEEEEecCchHH
Q psy11447 294 GIAP----------------REMGEALNAEAASVRAYAIFIGSLGDQ 324 (360)
Q Consensus 294 gI~p----------------~~l~~~l~~~~p~V~a~~IfIgslgdq 324 (360)
|++| ..|++.|.-+..+|++++| |.-|+-
T Consensus 130 ~~~~~rviGlgt~lds~r~~~~la~~l~v~~~~v~~~v~--G~hg~~ 174 (300)
T cd01339 130 GFPRNRVIGMAGVLDSARFRYFIAEELGVSVKDVQAMVL--GGHGDT 174 (300)
T ss_pred CCCHHHEEEecchHHHHHHHHHHHHHhCCCccceEEEEE--eCCCCc
Confidence 4443 3456677777566777665 887774
No 113
>smart00631 Zn_pept Zn_pept.
Probab=28.75 E-value=12 Score=35.96 Aligned_cols=71 Identities=18% Similarity=0.129 Sum_probs=44.6
Q ss_pred hhhHHHHhh-ccccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHh
Q psy11447 196 LGTSLEAFQ-DFDQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTT 272 (360)
Q Consensus 196 L~rvMEaf~-g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~ 272 (360)
+...|++|. .+.+..++..+|+|-+|..|+.+..++++..+|-+.=|.=.||+-- + .+.+++..-++.|..
T Consensus 7 ~~~~l~~l~~~~~~~v~~~~iG~S~~Gr~i~~~~i~~~~~~~k~~v~i~a~~Hg~E---~---~g~~~~~~~i~~L~~ 78 (277)
T smart00631 7 IEAWLKELAARYPDLVRLVSIGKSVEGRPIWVLKISDGGSHNKPAIFIDAGIHARE---W---IGPATALYLINQLLE 78 (277)
T ss_pred HHHHHHHHHHHCCCceEEEeceecCCCCceEEEEEeCCCCCCCcEEEEECCccccc---c---ccHHHHHHHHHHHHH
Confidence 444455553 4677899999999999999999999987744444333333333211 1 134455556666654
No 114
>smart00481 POLIIIAc DNA polymerase alpha chain like domain. DNA polymerase alpha chain like domain, incl. family of hypothetical proteins
Probab=28.73 E-value=54 Score=24.43 Aligned_cols=36 Identities=17% Similarity=0.338 Sum_probs=22.2
Q ss_pred HHHhhcccc-CCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC
Q psy11447 246 MHAHAQFCA-SGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN 289 (360)
Q Consensus 246 M~aH~Qfc~-sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN 289 (360)
+|.||+||. -|..+++- -++.+.+. ..=.+.+||=|
T Consensus 2 lH~Ht~~S~~~~~~~~~~---~~~~a~~~-----g~~~v~iTDh~ 38 (67)
T smart00481 2 LHVHSDYSLLDGALSPEE---LVKRAKEL-----GLKAIAITDHG 38 (67)
T ss_pred CccccCCccccccCCHHH---HHHHHHHc-----CCCEEEEeeCC
Confidence 688999996 55555442 33333322 23478899988
No 115
>PF04447 DUF550: Protein of unknown function (DUF550); InterPro: IPR007538 This entry represents the N terminus of a protein of unknown function, found in a range of Proteobacteria and a few P22-like dsDNA virus particles.
Probab=28.70 E-value=25 Score=30.34 Aligned_cols=61 Identities=16% Similarity=0.272 Sum_probs=37.2
Q ss_pred hccccC-CcchHHHHHHHHHHHHh---hcccCCC--eEEEEEecCCccccCCChhHHHHHHhcCccccc
Q psy11447 250 AQFCAS-GDYTLESIRASIQDLTT---ECAQHDD--SILIVLSDANLDRYGIAPREMGEALNAEAASVR 312 (360)
Q Consensus 250 ~Qfc~s-GD~Tlea~~~ai~~~a~---~~~daDe--~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~ 312 (360)
|+-.|- |=+|+.-.++.-+|+-+ .-.|-.| .++|++=|+ ..|+||+|++|.++|... -.||
T Consensus 10 S~~TFGp~~~p~g~lkHl~kE~~E~~~~p~Dl~EwaDv~~Ll~D~-~~RaGis~~~i~~A~~~K-~~iN 76 (100)
T PF04447_consen 10 SDRTFGPGVGPVGPLKHLSKEALEAEAAPGDLSEWADVQILLWDG-ARRAGISPEQIIDAMEAK-LAIN 76 (100)
T ss_pred HHhhcCCCCCcchHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHH-HHHcCCCHHHHHHHHHHH-HHhc
Confidence 334444 66676666666666532 1001112 245556565 369999999999999876 5555
No 116
>COG3148 Uncharacterized conserved protein [Function unknown]
Probab=28.60 E-value=87 Score=30.59 Aligned_cols=45 Identities=22% Similarity=0.368 Sum_probs=36.6
Q ss_pred hHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCc--eEEecCCC
Q psy11447 298 REMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGR--GFVCMDLT 344 (360)
Q Consensus 298 ~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr--~fv~~dt~ 344 (360)
.+|.+.| ++ |.++.||||=|+-..+...+...+|.++ -||..|.+
T Consensus 92 ~eLl~ll-~~-P~~~p~lvfP~e~a~e~t~v~~~~p~~k~plfIllDgT 138 (231)
T COG3148 92 PELLALL-AN-PDYQPYLVFPAEYAEELTEVISTAPAEKPPLFILLDGT 138 (231)
T ss_pred HHHHHHH-hC-CCCceEEEcchHHHHHHHHHhhcccccCCceEEEecCc
Confidence 3455555 44 9999999999999999999999999884 58988864
No 117
>cd05780 DNA_polB_Kod1_like_exo DEDDy 3'-5' exonuclease domain of Pyrococcus kodakaraensis Kod1 and similar archaeal family-B DNA polymerases. The 3'-5' exonuclease domain of archaeal family-B DNA polymerases with similarity to Pyrococcus kodakaraensis Kod1, including polymerases from Desulfurococcus (D. Tok Pol) and Thermococcus gorgonarius (Tgo Pol). Kod1, D. Tok Pol, and Tgo Pol are thermostable enzymes that exhibit both polymerase and 3'-5' exonuclease activities. They are family-B DNA polymerases. Their amino termini harbor a DEDDy-type DnaQ-like 3'-5' exonuclease domain that contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members of this subfamily show
Probab=28.53 E-value=20 Score=32.46 Aligned_cols=30 Identities=17% Similarity=0.415 Sum_probs=24.7
Q ss_pred HHHHhhccccceeEe-eeeccCCCCcccccc
Q psy11447 199 SLEAFQDFDQRISLD-IIGHSGETHSIPFVT 228 (360)
Q Consensus 199 vMEaf~g~e~k~~yd-ivGHSGd~~~i~lV~ 228 (360)
++.+|-.+=.++..| ||||+|++.++|++.
T Consensus 59 lL~~F~~~i~~~dpdiivgyN~~~FD~pyL~ 89 (195)
T cd05780 59 MIKRFIEIVKEKDPDVIYTYNGDNFDFPYLK 89 (195)
T ss_pred HHHHHHHHHHHcCCCEEEecCCCCCcHHHHH
Confidence 677777776677788 999999999999764
No 118
>PRK11930 putative bifunctional UDP-N-acetylmuramoyl-tripeptide:D-alanyl-D-alanine ligase/alanine racemase; Provisional
Probab=27.65 E-value=2.2e+02 Score=31.56 Aligned_cols=87 Identities=13% Similarity=0.144 Sum_probs=52.6
Q ss_pred chHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCC----hhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCC
Q psy11447 258 YTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIA----PREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELP 333 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~----p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp 333 (360)
||.+|++.|++.+.+. ....+.++||.|= ..-|-. ...+++.+.. .++ ..+|++|+ ++..+...++
T Consensus 340 ~nP~s~~aaL~~l~~~--~~~~~~ilIlG~m--~elG~~~~~~~~~l~~~l~~--~~i-~~vi~~G~---~~~~~~~~~~ 409 (822)
T PRK11930 340 SDLQSLDIALDFLNRR--SQSKKKTLILSDI--LQSGQSPEELYRKVAQLISK--RGI-DRLIGIGE---EISSEASKFE 409 (822)
T ss_pred CCHHHHHHHHHHHHhc--ccCCCEEEEECCh--HhcCchHHHHHHHHHHHHHH--cCC-CEEEEECH---HHHHHHHhcC
Confidence 5789999999998653 2224667777743 222332 3445555553 223 45666654 5666766676
Q ss_pred CCceEEecCCCchHHHHHHHH
Q psy11447 334 AGRGFVCMDLTEIPQILQQIF 354 (360)
Q Consensus 334 ~Gr~fv~~dt~~lP~il~qIf 354 (360)
.+..+.+.|.+++.+.++++.
T Consensus 410 ~~~~~~~~~~e~a~~~l~~~~ 430 (822)
T PRK11930 410 GTEKEFFKTTEAFLKSFAFLK 430 (822)
T ss_pred ccccEEECCHHHHHHHHHHhc
Confidence 566666777777766666553
No 119
>TIGR00683 nanA N-acetylneuraminate lyase. N-acetylneuraminate lyase is also known as N-acetylneuraminic acid aldolase, sialic acid aldolase, or sialate lyase. It is an intracellular enzyme. The structure of this homotetrameric enzyme related to dihydrodipicolinate synthase is known. In Clostridium tertium, the enzyme appears to be in an operon with a secreted sialidase that releases sialic acid from host sialoglycoconjugates. In several E. coli strains, however, this enzyme is responsible for N-acetyl-D-neuraminic acid synthesis for capsule production by condensing N-acetyl-D-mannosamine and pyruvate.
Probab=27.61 E-value=1.4e+02 Score=29.00 Aligned_cols=44 Identities=23% Similarity=0.420 Sum_probs=31.7
Q ss_pred EeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc-----cccCCcchH-HHHHH
Q psy11447 212 LDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ-----FCASGDYTL-ESIRA 265 (360)
Q Consensus 212 ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q-----fc~sGD~Tl-ea~~~ 265 (360)
.=+.|++|+++. -+..||.++++....+++ ++-.|.++. ++++.
T Consensus 39 i~~~GstGE~~~----------Lt~eEr~~~~~~~~~~~~~~~pvi~gv~~~~t~~~i~l 88 (290)
T TIGR00683 39 LYVGGSTGENFM----------LSTEEKKEIFRIAKDEAKDQIALIAQVGSVNLKEAVEL 88 (290)
T ss_pred EEECCccccccc----------CCHHHHHHHHHHHHHHhCCCCcEEEecCCCCHHHHHHH
Confidence 457899999886 357799999999998876 456665544 44443
No 120
>cd06277 PBP1_LacI_like_1 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=27.52 E-value=4.1e+02 Score=23.76 Aligned_cols=87 Identities=14% Similarity=0.072 Sum_probs=37.4
Q ss_pred cCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecC--chHHHHHHHhh
Q psy11447 254 ASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGS--LGDQAKRLTQE 331 (360)
Q Consensus 254 ~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgs--lgdqA~~l~~~ 331 (360)
...|| .++.+.|.+.+.+. ...+++++-++..-....-+-.-+.+++.....++....++++. --+.+..+.+.
T Consensus 98 V~~d~-~~~~~~a~~~l~~~---g~~~i~~i~~~~~~~~~~~R~~gf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 173 (268)
T cd06277 98 VLTDN-YSGAYAATEYLIEK---GHRKIGFVGDPLYSPSFEERYEGYKKALLDHGIPFNEDYDITEKEEDEEDIGKFIDE 173 (268)
T ss_pred EEecc-hHHHHHHHHHHHHC---CCCcEEEECCCCCCcchHHHHHHHHHHHHHcCCCCCcceEEEcchhHHHHHHHHHhc
Confidence 44444 55677777776542 34566666444321111111222445554432344444444432 11222233333
Q ss_pred C-CCCceEEecCCC
Q psy11447 332 L-PAGRGFVCMDLT 344 (360)
Q Consensus 332 L-p~Gr~fv~~dt~ 344 (360)
+ +.-.+++|.+..
T Consensus 174 ~~~~~~ai~~~~d~ 187 (268)
T cd06277 174 LKPLPTAFFCSNDG 187 (268)
T ss_pred CCCCCCEEEECCcH
Confidence 3 223566666443
No 121
>PF10996 Beta-Casp: Beta-Casp domain; InterPro: IPR022712 The beta-CASP domain is found C-terminal to the beta-lactamase domain in pre-mRNA 3'-end-processing endonuclease. The active site of this enzyme is located at the interface of these two domains []. ; PDB: 2YCB_B 2XR1_B 2I7T_A 2I7V_A 2I7X_A 3A4Y_A 3IE2_D 3IE1_B 3IE0_D 2DKF_D ....
Probab=27.32 E-value=54 Score=26.94 Aligned_cols=37 Identities=19% Similarity=0.231 Sum_probs=29.3
Q ss_pred CeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecC
Q psy11447 279 DSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGS 320 (360)
Q Consensus 279 e~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgs 320 (360)
+..||+.|+++| ..|-....|.+..... +..+||+|-
T Consensus 80 ~p~Vvias~gml-~~G~s~~~l~~~~~d~----~n~Ii~~gy 116 (126)
T PF10996_consen 80 GPKVVIASSGML-EGGRSRHYLKRLASDP----RNTIIFTGY 116 (126)
T ss_dssp SSEEEEESSTTS-SSSHHHHHHHHHTTST----TSEEEESSS
T ss_pred CCeEEEeCCCCC-CCCHHHHHHHHHcCCC----CCeEEEecC
Confidence 689999999999 7897777776666443 566999986
No 122
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=27.03 E-value=5.2e+02 Score=24.06 Aligned_cols=100 Identities=13% Similarity=0.248 Sum_probs=55.4
Q ss_pred CCCCchHHHHHHHHHHHhhccccCCcc-hHHHHHHHHHHHHhhcccCC-CeEEEEE-ecCCccccCCChhHH---HHHHh
Q psy11447 232 LPRDNKQRLEVIKIMHAHAQFCASGDY-TLESIRASIQDLTTECAQHD-DSILIVL-SDANLDRYGIAPREM---GEALN 305 (360)
Q Consensus 232 ~Pk~~keRl~vl~~M~aH~Qfc~sGD~-Tlea~~~ai~~~a~~~~daD-e~fVivl-SDANL~RYgI~p~~l---~~~l~ 305 (360)
||++|.+ .++..++. --|- |.- --.|++.|...+........ .++||++ ||.. ..|..+ ++.|+
T Consensus 64 plT~D~~--~~~~~L~~--i~~~-g~~~l~~AL~~A~~~L~~~~~~~~~~rivi~v~S~~~-----~d~~~i~~~~~~lk 133 (187)
T cd01452 64 TLTNDQG--KILSKLHD--VQPK-GKANFITGIQIAQLALKHRQNKNQKQRIVAFVGSPIE-----EDEKDLVKLAKRLK 133 (187)
T ss_pred CCCCCHH--HHHHHHHh--CCCC-CcchHHHHHHHHHHHHhcCCCcCCcceEEEEEecCCc-----CCHHHHHHHHHHHH
Confidence 6777733 33333332 1242 443 56788888888853311223 3655554 5433 244444 45554
Q ss_pred cCcccccEEEEEecCchHHHHH---HHhhCCC--CceEEecCC
Q psy11447 306 AEAASVRAYAIFIGSLGDQAKR---LTQELPA--GRGFVCMDL 343 (360)
Q Consensus 306 ~~~p~V~a~~IfIgslgdqA~~---l~~~Lp~--Gr~fv~~dt 343 (360)
. .+|+.++|.+|+..+-.+. +.+.++. |-.||-...
T Consensus 134 k--~~I~v~vI~~G~~~~~~~~l~~~~~~~~~~~~s~~~~~~~ 174 (187)
T cd01452 134 K--NNVSVDIINFGEIDDNTEKLTAFIDAVNGKDGSHLVSVPP 174 (187)
T ss_pred H--cCCeEEEEEeCCCCCCHHHHHHHHHHhcCCCCceEEEeCC
Confidence 4 5899999999997665554 4456653 355555444
No 123
>cd01536 PBP1_ABC_sugar_binding_like Periplasmic sugar-binding domain of active transport systems that are members of the type I periplasmic binding protein (PBP1) superfamily. Periplasmic sugar-binding domain of active transport systems that are members of the type I periplasmic binding protein (PBP1) superfamily. The members of this family function as the primary receptors for chemotaxis and transport of many sugar based solutes in bacteria and archaea. The sugar binding domain is also homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Moreover, this periplasmic binding domain, also known as Venus flytrap domain, undergoes transition from an open to a closed conformational state upon the binding of ligands such as lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. This family also includes the periplasmic binding domain of autoinducer-2 (AI-2
Probab=26.99 E-value=4.3e+02 Score=23.11 Aligned_cols=24 Identities=4% Similarity=-0.072 Sum_probs=11.9
Q ss_pred hCCCCceEEecCCCchHHHHHHHH
Q psy11447 331 ELPAGRGFVCMDLTEIPQILQQIF 354 (360)
Q Consensus 331 ~Lp~Gr~fv~~dt~~lP~il~qIf 354 (360)
.-|.=.+++|.+...-.-+++.+-
T Consensus 178 ~~~~~~~i~~~~d~~a~~~~~~l~ 201 (267)
T cd01536 178 ANPDIDAIFAANDSMALGAVAALK 201 (267)
T ss_pred hCCCccEEEEecCCchHHHHHHHH
Confidence 334334666666554454555443
No 124
>cd08559 GDPD_periplasmic_GlpQ_like Periplasmic glycerophosphodiester phosphodiesterase domain (GlpQ) and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in bacterial and eukaryotic glycerophosphodiester phosphodiesterase (GP-GDE, EC 3.1.4.46) similar to Escherichia coli periplasmic phosphodiesterase GlpQ. GP-GDEs are involved in glycerol metabolism and catalyze the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols, which are major sources of carbon and phosphate. In E. coli, there are two major G3P uptake systems: Glp and Ugp, which contain genes coding for two different GP-GDEs. GlpQ gene from the glp operon codes for a periplasmic phosphodiesterase GlpQ. GlpQ is a dimeric enzyme that hydrolyzes periplasmic glycerophosphodiesters, such as glycerophosphocholine (GPC), glycerophosphoethanolanmine (GPE), glycerophosphoglycerol (GPG), glycerophosphoinositol (GPI),
Probab=26.72 E-value=86 Score=30.36 Aligned_cols=35 Identities=29% Similarity=0.418 Sum_probs=26.2
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
.||++|.+.|++.=+. ..|.| |..+||.-|.+|+|
T Consensus 15 ENTl~Af~~A~~~Gad-~iE~DV~lTkDg~lVv~HD~~l~r 54 (296)
T cd08559 15 EHTLAAYALAIEMGAD-YIEQDLVMTKDGVLVARHDPTLDR 54 (296)
T ss_pred cchHHHHHHHHHhCCC-EEEEeeEEccCCCEEEeccchhhc
Confidence 5899999999985332 12333 57889999999998
No 125
>cd08566 GDPD_AtGDE_like Glycerophosphodiester phosphodiesterase domain of Agrobacterium tumefaciens and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in Agrobacterium tumefaciens glycerophosphodiester phosphodiesterase (AtGDE, EC 3.1.4.46) and its uncharacterized eukaryotic homolgoues. Members in this family shows high sequence similarity to Escherichia coli GP-GDE, which catalyzes the degradation of glycerophosphodiesters to produce sn-glycerol-3-phosphate (G3P) and the corresponding alcohols. AtGDE exists as a hexamer that is a trimer of dimers, which is unique among current known GDPD family members. However, it remains unclear if the hexamer plays a physiological role in AtGDE enzymatic function.
Probab=26.42 E-value=72 Score=29.86 Aligned_cols=36 Identities=31% Similarity=0.484 Sum_probs=25.6
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCcccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRY 293 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RY 293 (360)
.||++|.+.|++.=+. -.|.| |...||+-|.||+|-
T Consensus 15 ENTl~af~~A~~~g~d-~iE~DV~~T~Dg~~vv~HD~~l~r~ 55 (240)
T cd08566 15 ENSLAAIEAAIDLGAD-IVEIDVRRTKDGVLVLMHDDTLDRT 55 (240)
T ss_pred ccHHHHHHHHHHcCCC-EEEEEeeEcCCCCEEEECCCCCccc
Confidence 5899999999985321 11222 466788899999993
No 126
>cd08610 GDPD_GDE6 Glycerophosphodiester phosphodiesterase domain of mammalian glycerophosphodiester phosphodiesterase GDE6 and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in mammalian GDE6 (also known as glycerophosphodiester phosphodiesterase domain-containing protein 4 (GDPD4)) and their metazoan homologs. Mammalian GDE6 is a transmembrane protein predominantly expressed in the spermatocytes of testis. Although the specific physiological function of mammalian GDE6 has not been elucidated, its different pattern of tissue distribution suggests it might play a critical role in the completion of meiosis during male germ cell differentiation.
Probab=26.38 E-value=99 Score=30.90 Aligned_cols=36 Identities=22% Similarity=0.328 Sum_probs=26.2
Q ss_pred cchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCcccc
Q psy11447 257 DYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRY 293 (360)
Q Consensus 257 D~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RY 293 (360)
.||+.|.+.|++.=+. -.|.| |.-+||+-|.+|+|-
T Consensus 37 ENTl~AF~~A~~~Gad-~IE~DV~lTkDG~lVV~HD~tL~Rt 77 (316)
T cd08610 37 ENTMMSFEKAIEHGAH-GLETDVTLSYDGVPFLMHDFTLKRT 77 (316)
T ss_pred ccHHHHHHHHHHcCCC-EEEEEEEEccCCCEEEeCCCccccc
Confidence 6899999999984221 11222 577899999999994
No 127
>TIGR02690 resist_ArsH arsenical resistance protein ArsH. Members of this protein family occur in arsenate resistance operons that include at least two different types of arsenate reductase. ArsH is not required for arsenate resistance in some systems. This family belongs to the larger family of NADPH-dependent FMN reductases (Pfam model pfam03358). The function of ArsH is not known.
Probab=26.36 E-value=90 Score=29.78 Aligned_cols=54 Identities=22% Similarity=0.294 Sum_probs=41.6
Q ss_pred CCChhHHHHHHhcC-cccccEEEEEecCc------hHHHHHHHhhCC-CCceEEecCCCchH
Q psy11447 294 GIAPREMGEALNAE-AASVRAYAIFIGSL------GDQAKRLTQELP-AGRGFVCMDLTEIP 347 (360)
Q Consensus 294 gI~p~~l~~~l~~~-~p~V~a~~IfIgsl------gdqA~~l~~~Lp-~Gr~fv~~dt~~lP 347 (360)
||....|..+-.+. -|..=--++++||+ .--|+++.+.+| .|..+.+.|..+||
T Consensus 8 ~~~~~~~~~~~~~~~~~~~~kI~~I~GSlR~~S~n~~la~~~~~~~~~~g~~v~~idl~~lP 69 (219)
T TIGR02690 8 GIDEPALRPLFSATHKPHIPRILLLYGSLRERSYSRLLAEEAARLLGCEGRETRIFDPPGLP 69 (219)
T ss_pred cCCccchhhccCCCCCCCCCEEEEEECCCCCcchHHHHHHHHHHHHhhcCCEEEEeCcccCC
Confidence 56666666666532 25555678899998 678999999999 59999999999888
No 128
>PRK03803 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional
Probab=26.35 E-value=6.5e+02 Score=25.47 Aligned_cols=113 Identities=16% Similarity=0.248 Sum_probs=62.1
Q ss_pred cCCchhhHHHHhhcc---ccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHH
Q psy11447 192 TGSDLGTSLEAFQDF---DQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQ 268 (360)
Q Consensus 192 n~~DL~rvMEaf~g~---e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~ 268 (360)
-|.+.+.+.+++..| +.|+. ++|.. +.+-+|...++ |..+|+..|++
T Consensus 288 lgi~~~~i~~~L~~f~g~~~R~e--~v~~~---~gv~~idDs~a-------------------------tN~~a~~~al~ 337 (448)
T PRK03803 288 AGLPKEAMLEVLRTFTGLPHRCE--WVREV---AGVDYYNDSKG-------------------------TNVGATVAAIE 337 (448)
T ss_pred cCCCHHHHHHHHhhCCCCCCceE--EEEEe---CCeEEEEcCCc-------------------------CCHHHHHHHHH
Confidence 378877766666554 67765 45532 22445554332 56788888888
Q ss_pred HHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCC-CceEEecCCCchH
Q psy11447 269 DLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPA-GRGFVCMDLTEIP 347 (360)
Q Consensus 269 ~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~-Gr~fv~~dt~~lP 347 (360)
.+.+. .+.++++|+..-. + |...+.+.+.+... +. .+|++|. .++.|.+.+.. +..++|-|..+..
T Consensus 338 ~l~~~---~~~~iilI~Gg~~--k-~~d~~~l~~~l~~~---~~-~vil~G~---~~~~i~~~l~~~~~~~~~~~~~~a~ 404 (448)
T PRK03803 338 GLGAH---IQGKLVLIAGGDG--K-GADFSPLREPVAKY---VR-AVVLIGR---DADKIAAALGGAVPLVRVATLAEAV 404 (448)
T ss_pred hhhhc---CCCCEEEEECCCC--C-CCCHHHHHHHHHhh---CC-EEEEECC---CHHHHHHHHhcCCCEEEeCCHHHHH
Confidence 88532 1236666765321 1 55556677777443 33 3577764 34556655542 1345555544433
No 129
>PRK07328 histidinol-phosphatase; Provisional
Probab=26.29 E-value=52 Score=31.29 Aligned_cols=18 Identities=28% Similarity=0.355 Sum_probs=14.4
Q ss_pred HHHHhhccccCCcchHHH
Q psy11447 245 IMHAHAQFCASGDYTLES 262 (360)
Q Consensus 245 ~M~aH~Qfc~sGD~Tlea 262 (360)
.+|.||+||.=|..|+|-
T Consensus 5 D~H~HT~~s~~~~~~~ee 22 (269)
T PRK07328 5 DYHMHTPLCGHAVGTPEE 22 (269)
T ss_pred eeccCCCCCCCCCCCHHH
Confidence 359999999888888653
No 130
>cd06288 PBP1_sucrose_transcription_regulator Ligand-binding domain of DNA-binding regulatory proteins specific to sucrose that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of DNA-binding regulatory proteins specific to sucrose that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=26.16 E-value=3.2e+02 Score=24.24 Aligned_cols=33 Identities=9% Similarity=0.032 Sum_probs=15.1
Q ss_pred ccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC
Q psy11447 253 CASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN 289 (360)
Q Consensus 253 c~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN 289 (360)
|..-||+ ++.+.|.+.+.+. .-..++++-++.+
T Consensus 95 ~v~~d~~-~~~~~a~~~l~~~---g~~~i~~l~~~~~ 127 (269)
T cd06288 95 SVVPDEE-QGGYDATRHLLAA---GHRRIAFINGEPW 127 (269)
T ss_pred eEEEccH-HHHHHHHHHHHHc---CCceEEEEeCCcc
Confidence 3444554 4555555555432 2234555444433
No 131
>PF09606 Med15: ARC105 or Med15 subunit of Mediator complex non-fungal; InterPro: IPR019087 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. The proteins in this entry represent subunit Med15 of the Mediator complex. They contain a single copy of the approximately 70 residue ARC105 domain. The ARC105 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, ARC105 is a critical transducer of gene activation signals that control early metazoan development []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex; PDB: 2GUT_A.
Probab=25.94 E-value=23 Score=40.00 Aligned_cols=103 Identities=18% Similarity=0.317 Sum_probs=0.0
Q ss_pred hHHHHHHhHHcCCCHHhHHHHHHhhHhhhhHHHHHhhhhhcccccCCCCchhHHHHHHhc-----CCC--CChhHHHHHH
Q psy11447 3 LKAFKQKLKEIQMSEYDAKVYSEYSLPVQNQQNDQHRDLFSVHVIDNPSPESELALLSSY-----GPD--VPRDTLIRLV 75 (360)
Q Consensus 3 ~ka~~~rLkEi~Ms~~da~~Y~~~~~~vgndf~~~lgd~fs~h~v~np~~esE~~~L~~y-----gp~--v~~~~l~~Lv 75 (360)
..+|.||||++ ++|-++|++-+.+ + -.|.+..+++..+++. +|+ +|.++|.|..
T Consensus 536 ~~~y~eK~~qL----------skYiePL~rmi~k-~--------~~~~~~~~~l~kmk~ll~iL~~p~~r~pl~tL~kce 596 (799)
T PF09606_consen 536 EQAYLEKLRQL----------SKYIEPLRRMINK-M--------DKDEGRKKDLSKMKSLLDILSNPSKRCPLETLQKCE 596 (799)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHh----------cccccHHHHHHHh-c--------ccCCCchhHHHhHHHHHHHhcCCccCCchHHHHHHH
Confidence 35778888875 4667777776665 3 1233333444444444 565 5899999998
Q ss_pred HhhhhhhhhhccCCCC-CCccchhhhhhhhhhhhhHHHhhhhhccccCCchhhHHHHHHHHHHH
Q psy11447 76 RAFGEFRELSDKSLLP-YPYSTREAVHIVKHLQEGVLGQHENTRKFNGYDGRLDRELEAVIMVL 138 (360)
Q Consensus 76 ~afgeLr~~~deGli~-ypYs~Re~v~ivKhL~~~v~dv~~nv~rFd~yD~rl~R~lE~v~~Vm 138 (360)
.+...|+. +-|..+ .| .. -.+.|++.+=.++..|+.-|-|.+--+..++
T Consensus 597 ~~l~kl~~--~~~~~~~~p-----------~~-~pll~~v~~~l~sp~~nhtL~rtf~p~~~~~ 646 (799)
T PF09606_consen 597 IVLEKLKN--DMGVPTPPP-----------MC-QPLLDAVMSNLQSPVFNHTLQRTFGPAMTAL 646 (799)
T ss_dssp ----------------------------------------------------------------
T ss_pred HHHHHhhc--ccCCCCCCC-----------CC-chHHHHHHhhcCCcccchhHHHHHHHHHhhc
Confidence 88876662 334433 33 22 6677888887887778888877666666644
No 132
>PF01674 Lipase_2: Lipase (class 2); InterPro: IPR002918 Lipases or triacylglycerol acylhydrolases hydrolyse ester bonds in triacylglycerol giving diacylglycerol, monoacylglycerol, glycerol and free fatty acids []. This group of lipases has been called class 2 as they are not clearly related to other lipase families, and includes LipA and LipB from Bacillus subtilis [] and uncharacterised proteins from Caenorhabditis.; PDB: 2VTV_B 2X76_A 2X5X_A 2QXU_A 3QMM_A 1I6W_A 3D2C_J 2QXT_B 1R50_A 1T2N_A ....
Probab=25.82 E-value=38 Score=32.16 Aligned_cols=23 Identities=26% Similarity=0.443 Sum_probs=15.3
Q ss_pred hHHHHhhccccceeEeeeeccCCC
Q psy11447 198 TSLEAFQDFDQRISLDIIGHSGET 221 (360)
Q Consensus 198 rvMEaf~g~e~k~~ydivGHSGd~ 221 (360)
+..++...+... +-||||||=-+
T Consensus 64 ~fI~~Vl~~TGa-kVDIVgHS~G~ 86 (219)
T PF01674_consen 64 AFIDAVLAYTGA-KVDIVGHSMGG 86 (219)
T ss_dssp HHHHHHHHHHT---EEEEEETCHH
T ss_pred HHHHHHHHhhCC-EEEEEEcCCcC
Confidence 355666667888 99999999544
No 133
>cd08318 Death_NMPP84 Death domain of Nuclear Matrix Protein P84. Death domain (DD) found in the Nuclear Matrix Protein P84 (also known as HPR1 or THOC1). HPR1/p84 resides in the nuclear matrix and is part of the THO complex, also called TREX (transcription/export) complex, which functions in mRNP biogenesis at the interface between transcription and export of mRNA from the nucleus. Mice lacking THOC1 have abnormal testis development and are sterile. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.
Probab=25.82 E-value=1.6e+02 Score=23.84 Aligned_cols=54 Identities=17% Similarity=0.183 Sum_probs=37.2
Q ss_pred HHHhHHcCCCHHhHHHHHHhhHhhhhHHHHHhhhhhcccccCCCCchhHHHHHHhcCCCCChhHHHHHHHhhhh
Q psy11447 7 KQKLKEIQMSEYDAKVYSEYSLPVQNQQNDQHRDLFSVHVIDNPSPESELALLSSYGPDVPRDTLIRLVRAFGE 80 (360)
Q Consensus 7 ~~rLkEi~Ms~~da~~Y~~~~~~vgndf~~~lgd~fs~h~v~np~~esE~~~L~~ygp~v~~~~l~~Lv~afge 80 (360)
++=-+++|+|+.|-+..+.=...+..+.++.| +.|.++.|. ..++..|+++..+
T Consensus 22 k~Lar~LGls~~dI~~i~~~~~~~~eq~~~mL-----------------~~W~~r~g~---~AT~~~L~~aL~~ 75 (86)
T cd08318 22 KTLAPHLEMKDKEIRAIESDSEDIKMQAKQLL-----------------VAWQDREGS---QATPETLITALNA 75 (86)
T ss_pred HHHHHHcCCCHHHHHHHHhcCCCHHHHHHHHH-----------------HHHHHhcCc---cccHHHHHHHHHH
Confidence 33457889999988888755555566666655 478888774 4567777777654
No 134
>KOG2125|consensus
Probab=25.62 E-value=43 Score=37.52 Aligned_cols=65 Identities=28% Similarity=0.324 Sum_probs=40.1
Q ss_pred EeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhc---------cccCCcch-HHHHHHHHHHHHhhcccCCCeE
Q psy11447 212 LDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQ---------FCASGDYT-LESIRASIQDLTTECAQHDDSI 281 (360)
Q Consensus 212 ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Q---------fc~sGD~T-lea~~~ai~~~a~~~~daDe~f 281 (360)
.|-|||++- +.-|+ +|..-||.=+|+++||-..| -|..|||- .|+.-+....-+ |-+-..
T Consensus 202 lDHIGH~~G-~~Sp~-----vp~KLkEmDeiv~~I~~~~~~~~s~d~tllil~gDHGM~e~GnHGGss~~----ET~s~l 271 (760)
T KOG2125|consen 202 LDHIGHVLG-PSSPL-----VPAKLKEMDEIVKRIHDYLMEHRSGDQTLLILCGDHGMTESGNHGGSSPG----ETSSPL 271 (760)
T ss_pred cceeccccC-Ccchh-----hhHHHHHHHHHHHHHHHHHhhcCCCCceEEEEEccccccccCCCCCCCcc----cccccE
Confidence 567999974 44454 57889999999999998443 35555553 233334333322 334456
Q ss_pred EEEEe
Q psy11447 282 LIVLS 286 (360)
Q Consensus 282 VivlS 286 (360)
++..+
T Consensus 272 ~~~~~ 276 (760)
T KOG2125|consen 272 LFLLP 276 (760)
T ss_pred EEEec
Confidence 66666
No 135
>cd06304 PBP1_BmpA_like Periplasmic binding component of a family of basic membrane lipoproteins from Borrelia and various putative lipoproteins from other bacteria. Periplasmic binding component of a family of basic membrane lipoproteins from Borrelia and various putative lipoproteins from other bacteria. These outer membrane proteins include Med, a cell-surface localized protein regulating the competence transcription factor gene comK in Bacillus subtilis, and PnrA, a periplasmic purine nucleoside binding protein of an ATP-binding cassette (ABC) transport system in Treponema pallidum. All contain the type I periplasmic sugar-binding protein-like fold.
Probab=25.54 E-value=4.6e+02 Score=23.72 Aligned_cols=113 Identities=9% Similarity=-0.030 Sum_probs=55.3
Q ss_pred CccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccC-CChhHH
Q psy11447 222 HSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYG-IAPREM 300 (360)
Q Consensus 222 ~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYg-I~p~~l 300 (360)
+.+|+|..+.++.. +. .-.|..+|| -+|.+.|.. ++... ..-..+.++..+.+ ... -+-+-+
T Consensus 79 ~~ipvv~~~~~~~~-~~-----------~~~~v~~d~-~~~~~~a~~-l~~~~-~g~~~I~~i~~~~~--~~~~~R~~Gf 141 (260)
T cd06304 79 PDVKFAIIDGVVDA-PP-----------NVASYVFRE-YEGSYLAGV-LAALM-TKTGKVGFVGGMPI--PEVNRFINGF 141 (260)
T ss_pred CCCEEEEecCccCC-CC-----------Ceeeeecch-HHHHHHHHH-HHHHh-ccCCceEEEecccc--HHHHHHHHHH
Confidence 34677777765422 10 113666777 445555554 44321 13346666655332 111 112345
Q ss_pred HHHHhcCcccccEEEEEecCch--HHHHHHHh----hCCCCceEEecCCCchHHHHHHH
Q psy11447 301 GEALNAEAASVRAYAIFIGSLG--DQAKRLTQ----ELPAGRGFVCMDLTEIPQILQQI 353 (360)
Q Consensus 301 ~~~l~~~~p~V~a~~IfIgslg--dqA~~l~~----~Lp~Gr~fv~~dt~~lP~il~qI 353 (360)
.++|....+.++...++-++.. +++....+ .-| .+++|.+..--.-+++.+
T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~ai~~~~d~~A~gv~~al 198 (260)
T cd06304 142 AAGAKSVNPDITVLVIYTGSFFDPAKGKEAALALIDQGA--DVIFAAAGGTGPGVIQAA 198 (260)
T ss_pred HHHHHHhCCCcEEEEEEecCccCcHHHHHHHHHHHhCCC--CEEEEcCCCCchHHHHHH
Confidence 5677765566665555444442 33322222 224 678887766555555543
No 136
>PF04155 Ground-like: Ground-like domain; InterPro: IPR007284 This group of proteins contain one or more copies of the ground-like domain, which are specific to Caenorhabditis elegans and Caenorhabditis briggsae. It has been proposed that the ground-like domain containing proteins may bind and modulate the activity of Patched-like membrane molecules, reminiscent of the modulating activities of neuropeptides [].
Probab=24.77 E-value=1.4e+02 Score=23.41 Aligned_cols=32 Identities=16% Similarity=0.296 Sum_probs=27.5
Q ss_pred chHHHHHHHHHHHHhhcccCCCeEEEEEecCCcc
Q psy11447 258 YTLESIRASIQDLTTECAQHDDSILIVLSDANLD 291 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~ 291 (360)
.++..+..+|...++. .....|.++.|..++.
T Consensus 22 ~~~~~s~~~Iq~~~e~--~f~~~f~vIcs~~~Fs 53 (76)
T PF04155_consen 22 CNLSISKRAIQKAAEK--RFGGSFEVICSEGDFS 53 (76)
T ss_pred CCHHHHHHHHHHHHHH--HhCCCEEEEEeCCCce
Confidence 7888889999998876 7777999999999885
No 137
>cd08612 GDPD_GDE4 Glycerophosphodiester phosphodiesterase domain of mammalian glycerophosphodiester phosphodiesterase GDE4 and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in mammalian GDE4 (also known as glycerophosphodiester phosphodiesterase domain-containing protein 1 (GDPD1)) and similar proteins. Mammalian GDE4 is a transmembrane protein whose cellular function has not yet been elucidated. It is expressed widely, including in placenta, liver, kidney, pancreas, spleen, thymus, ovary, small intestine and peripheral blood leukocytes. It is also expressed in the growth cones in neuroblastoma Neuro2a cells, which suggests GDE4 may play some distinct role from other members of the GDE family.
Probab=24.12 E-value=1.2e+02 Score=29.47 Aligned_cols=55 Identities=24% Similarity=0.269 Sum_probs=38.6
Q ss_pred ccceeEeeeeccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCC-----CeE
Q psy11447 207 DQRISLDIIGHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHD-----DSI 281 (360)
Q Consensus 207 e~k~~ydivGHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daD-----e~f 281 (360)
..++...|+||-|.+... | .||++|.+.|++.=+. ..|.| |..
T Consensus 22 ~~~~~~~iiAHRG~~~~~--------P-----------------------ENTl~Af~~A~~~Gad-~iE~DV~lTkDG~ 69 (300)
T cd08612 22 KSPFPCRHISHRGGSGEN--------L-----------------------ENTMEAFEHAVKVGTD-MLELDVHLTKDGQ 69 (300)
T ss_pred ccCCCCCEEECCCCCCCC--------C-----------------------ccHHHHHHHHHHcCCC-EEEEEeeECcCCe
Confidence 457888899999964321 1 5899999999975321 11222 567
Q ss_pred EEEEecCCcccc
Q psy11447 282 LIVLSDANLDRY 293 (360)
Q Consensus 282 VivlSDANL~RY 293 (360)
+||+=|.+|+|-
T Consensus 70 lVV~HD~~l~Rt 81 (300)
T cd08612 70 VVVSHDENLLRS 81 (300)
T ss_pred EEEECCcccccc
Confidence 888889999884
No 138
>cd01537 PBP1_Repressors_Sugar_Binding_like Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems. Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems, all of which contain the type I periplasmic binding protein-like fold. Their specific ligands include lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor; in general the sugar binding domain in this family binds a sugar, which in turn changes the DNA binding activity of the repressor domain. The core structure of the periplasmic binding proteins is classified into two types and they differ in number and order of beta strands in each domain: type I, which has six beta strands, and type II, which has five beta strands. These two distinct structural arrangem
Probab=23.95 E-value=3.7e+02 Score=23.25 Aligned_cols=15 Identities=20% Similarity=0.182 Sum_probs=8.0
Q ss_pred CCCceEEecCCCchH
Q psy11447 333 PAGRGFVCMDLTEIP 347 (360)
Q Consensus 333 p~Gr~fv~~dt~~lP 347 (360)
|..-..++.|.....
T Consensus 205 ~~~i~i~~~d~~~~~ 219 (264)
T cd01537 205 PDDISVIGFDGTPEA 219 (264)
T ss_pred CCCeEEEeecCccHH
Confidence 434456666655543
No 139
>COG1378 Predicted transcriptional regulators [Transcription]
Probab=23.91 E-value=68 Score=30.95 Aligned_cols=23 Identities=30% Similarity=0.634 Sum_probs=19.4
Q ss_pred HHHHhHHcCCCHHhHHHHHHhhH
Q psy11447 6 FKQKLKEIQMSEYDAKVYSEYSL 28 (360)
Q Consensus 6 ~~~rLkEi~Ms~~da~~Y~~~~~ 28 (360)
..+.|+++|+|+|||+.|-.+..
T Consensus 5 ~~~~L~~lGlt~yEa~vY~aLl~ 27 (247)
T COG1378 5 LEENLQKLGLTEYEAKVYLALLC 27 (247)
T ss_pred HHHHHHHcCCCHHHHHHHHHHHH
Confidence 45789999999999999987643
No 140
>PF08862 DUF1829: Domain of unknown function DUF1829; InterPro: IPR014961 This short protein is usually associated with IPR014960 from INTERPRO.
Probab=23.76 E-value=1.3e+02 Score=24.74 Aligned_cols=31 Identities=13% Similarity=0.272 Sum_probs=17.3
Q ss_pred hHHHHHHHHHHHHhhcccCCCeEEEEEecCC
Q psy11447 259 TLESIRASIQDLTTECAQHDDSILIVLSDAN 289 (360)
Q Consensus 259 Tlea~~~ai~~~a~~~~daDe~fVivlSDAN 289 (360)
++.++-.+..++.+.....+..|.|++-|-+
T Consensus 34 ~~~~~~f~w~D~~~~~~~~~~~~~ii~ND~e 64 (88)
T PF08862_consen 34 NIANAIFAWEDTKKRNRKSNSKFYIILNDSE 64 (88)
T ss_pred HHHHHHHHHhhhhhhccCCCceEEEEECCCC
Confidence 4455555655662222266667777776654
No 141
>cd07079 ALDH_F18-19_ProA-GPR Gamma-glutamyl phosphate reductase (GPR), aldehyde dehydrogenase families 18 and 19. Gamma-glutamyl phosphate reductase (GPR), a L-proline biosynthetic pathway (PBP) enzyme that catalyzes the NADPH dependent reduction of L-gamma-glutamyl 5-phosphate into L-glutamate 5-semialdehyde and phosphate. The glutamate route of the PBP involves two enzymatic steps catalyzed by gamma-glutamyl kinase (GK, EC 2.7.2.11) and GPR (EC 1.2.1.41). These enzymes are fused into the bifunctional enzyme, ProA or delta(1)-pyrroline-5-carboxylate synthetase (P5CS) in plants and animals, whereas they are separate enzymes in bacteria and yeast. In humans, the P5CS (ALDH18A1), an inner mitochondrial membrane enzyme, is essential to the de novo synthesis of the amino acids proline and arginine. Tomato (Lycopersicon esculentum) has both the prokaryotic-like polycistronic operons encoding GK and GPR (PRO1, ALDH19) and the full-length, bifunctional P5CS (PRO2, ALDH18B1).
Probab=23.62 E-value=7.9e+02 Score=25.02 Aligned_cols=74 Identities=7% Similarity=0.067 Sum_probs=50.1
Q ss_pred cCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHh--------hCCCCceEEecCCCchH
Q psy11447 276 QHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQ--------ELPAGRGFVCMDLTEIP 347 (360)
Q Consensus 276 daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~--------~Lp~Gr~fv~~dt~~lP 347 (360)
...+..|-++. |-.++...+.|..+ |.|+. +.|.|| -.-+..+.+ +|-.--..|++++.+|.
T Consensus 165 G~P~gvv~~v~-------g~~~~~~~~~l~~~-~~v~~-i~fTGS-~~~G~~i~~~a~~pv~lELGGk~p~IV~~dADl~ 234 (406)
T cd07079 165 GLPEDAVQLIP-------DTDREAVQELLKLD-DYIDL-IIPRGG-AGLIRFVVENATIPVIKHGDGNCHVYVDESADLE 234 (406)
T ss_pred CCCcccEEEec-------CCChHHHHHHHcCC-CCccE-EEeCCC-HHHHHHHHHhcCCCEEeecCCcceEEEeCCCCHH
Confidence 45566666663 32245678889987 98875 899999 334444443 34323346788889999
Q ss_pred HHHHHHHHhccC
Q psy11447 348 QILQQIFSASLL 359 (360)
Q Consensus 348 ~il~qIf~s~~l 359 (360)
...++|..|.+.
T Consensus 235 ~Aa~~i~~~~f~ 246 (406)
T cd07079 235 MAVRIVVNAKTQ 246 (406)
T ss_pred HHHHHHHHcccC
Confidence 999999887653
No 142
>cd03860 M14_CP_A-B_like The Peptidase M14 Carboxypeptidase (CP) A/B subfamily is one of two main M14 CP subfamilies defined by sequence and structural homology, the other being the N/E subfamily. CPs hydrolyze single, C-terminal amino acids from polypeptide chains. They have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Enzymes belonging to the A/B subfamily are normally synthesized as inactive precursors containing preceding signal peptide, followed by a globular N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The A/B enzymes can be further divided based on their substrate specificity; Carboxypeptidase A-like (CPA-like) enzymes favor hydrophobic residues while carboxypeptidase B-like (CPB-like) enzymes only cleave the basic residues lysine or arginine. There are nine members in the A/B family: CPA1, CPA2, CPA3, CPA4, CPA5, CPA6, CPB, CPO and CPU. CPA1, CPA2 and CPB are produced by the
Probab=23.49 E-value=18 Score=35.07 Aligned_cols=38 Identities=21% Similarity=0.249 Sum_probs=31.9
Q ss_pred hhhHHHHhh-ccccceeEeeeeccCCCCccccccCCCCC
Q psy11447 196 LGTSLEAFQ-DFDQRISLDIIGHSGETHSIPFVTSNNLP 233 (360)
Q Consensus 196 L~rvMEaf~-g~e~k~~ydivGHSGd~~~i~lV~~~~~P 233 (360)
+...|+++. .+.+..++..+|+|-++..|+.++.++++
T Consensus 7 i~~~l~~l~~~~p~~~~~~~iG~S~egr~i~~l~i~~~~ 45 (294)
T cd03860 7 IYAWLDELAQKYPDLVTVETIGRSYEGRPIKGVKISNGG 45 (294)
T ss_pred HHHHHHHHHHHCCCceEEEeeeeCCCCCeEEEEEEecCC
Confidence 555666663 57789999999999999999999999876
No 143
>cd08583 PI-PLCc_GDPD_SF_unchar1 Uncharacterized hypothetical proteins similar to the catalytic domains of Phosphoinositide-specific phospholipaseand Glycerophosphodiester phosphodiesterases. This subfamily corresponds to a group of uncharacterized hypothetical proteins similar to the catalytic domains of Phosphoinositide-specific phospholipase C (PI-PLC), and glycerophosphodiester phosphodiesterases (GP-GDE), and also sphingomyelinases D (SMases D) and similar proteins. They hydrolyze the 3'-5' phosphodiester bonds in different substrates, utilizing a similar mechanism of general base and acid catalysis involving two conserved histidine residues.
Probab=23.21 E-value=89 Score=28.90 Aligned_cols=37 Identities=11% Similarity=0.091 Sum_probs=26.2
Q ss_pred CcchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCcccc
Q psy11447 256 GDYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDRY 293 (360)
Q Consensus 256 GD~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~RY 293 (360)
-.||+.|.+.|++.=+. -.|.| |..+||+=|.++.|-
T Consensus 14 pENTl~Af~~A~~~G~d-~iE~DV~lTkDg~lVv~HD~~~~r~ 55 (237)
T cd08583 14 YTNSLDAFEHNYKKGYR-VFEVDLSLTSDGVLVARHSWDESLL 55 (237)
T ss_pred CccHHHHHHHHHHhCCC-EEEEEeeEccCCCEEEEECCcCchh
Confidence 37899999999985332 12444 577788899988653
No 144
>PF02811 PHP: PHP domain; InterPro: IPR004013 The PHP (Polymerase and Histidinol Phosphatase) domain is a putative phosphoesterase domain. This family is often associated with an N-terminal region IPR003141 from INTERPRO.; GO: 0003824 catalytic activity; PDB: 2WJE_A 3QY8_A 2WJD_A 2WJF_A 1PB0_B 1M68_A 1M65_A 3E38_B 2W9M_A 3E0F_A ....
Probab=22.81 E-value=56 Score=27.57 Aligned_cols=35 Identities=17% Similarity=0.315 Sum_probs=20.4
Q ss_pred HHHhhccc-cCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecC
Q psy11447 246 MHAHAQFC-ASGDYTLESIRASIQDLTTECAQHDDSILIVLSDA 288 (360)
Q Consensus 246 M~aH~Qfc-~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDA 288 (360)
+|-||.|| .=|..|++-+- +...+. . .=.|.+||=
T Consensus 3 lH~HT~~s~~dg~~~~~e~v---~~A~~~--G---l~~i~iTDH 38 (175)
T PF02811_consen 3 LHVHTKYSILDGKDSPEEYV---EQAKEK--G---LDAIAITDH 38 (175)
T ss_dssp EEB--TTTSSTSSSSHHHHH---HHHHHT--T---ESEEEEEEE
T ss_pred ccccccCcchhhcCCHHHHH---HHHHHc--C---CCEEEEcCC
Confidence 57899988 66667666444 333222 2 337889998
No 145
>cd08319 Death_RAIDD Death domain of RIP-associated ICH-1 homologous protein with a death domain. Death domain (DD) of RAIDD (RIP-associated ICH-1 homologous protein with a death domain), also known as CRADD (Caspase and RIP adaptor). RAIDD is an adaptor protein that together with the p53-inducible protein PIDD and caspase-2, forms the PIDDosome complex, which is required for caspase-2 activation and plays a role in mediating stress-induced apoptosis. RAIDD contains an N-terminal Caspase Activation and Recruitment Domain (CARD), which interacts with the caspase-2 CARD, and a C-terminal DD, which interacts with the DD of PIDD. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD, DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other pr
Probab=22.76 E-value=2.1e+02 Score=23.45 Aligned_cols=54 Identities=20% Similarity=0.271 Sum_probs=38.0
Q ss_pred HHHhHHcCCCHHhHHHHHHh-hHhhhhHHHHHhhhhhcccccCCCCchhHHHHHHhcCCCCChhHHHHHHHhhhh
Q psy11447 7 KQKLKEIQMSEYDAKVYSEY-SLPVQNQQNDQHRDLFSVHVIDNPSPESELALLSSYGPDVPRDTLIRLVRAFGE 80 (360)
Q Consensus 7 ~~rLkEi~Ms~~da~~Y~~~-~~~vgndf~~~lgd~fs~h~v~np~~esE~~~L~~ygp~v~~~~l~~Lv~afge 80 (360)
++=.+++|||+.|-...+.- ...+.-|+++.| +.|-.+.| ..+++.+|+++...
T Consensus 17 ~~Lar~Lgls~~~I~~i~~~~p~~l~eQv~~mL-----------------~~W~~r~G---~~ATv~~L~~aL~~ 71 (83)
T cd08319 17 EQVLLDLGLSQTDIYRCKENHPHNVQSQIVEAL-----------------VKWRQRFG---KKATVQSLIQSLKA 71 (83)
T ss_pred HHHHHHcCCCHHHHHHHHHhCCCCHHHHHHHHH-----------------HHHHHhcC---CCCcHHHHHHHHHH
Confidence 33457889998888877664 446777777766 47888887 46667777777544
No 146
>cd01840 SGNH_hydrolase_yrhL_like yrhL-like subfamily of SGNH-hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases. Most members of this sub-family appear to co-occur with N-terminal acyltransferase domains. Might be involved in lipid metabolism.
Probab=22.76 E-value=3.2e+02 Score=23.21 Aligned_cols=79 Identities=19% Similarity=0.226 Sum_probs=41.9
Q ss_pred cCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCCh-hHHHHHHhcCcccccEEEEEecCch----HHHHHH
Q psy11447 254 ASGDYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAP-REMGEALNAEAASVRAYAIFIGSLG----DQAKRL 328 (360)
Q Consensus 254 ~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p-~~l~~~l~~~~p~V~a~~IfIgslg----dqA~~l 328 (360)
+.||+-..++..++.+.-.. ..| ||--.|=.... ..|.+++... ......+|++|+++ ++-+.|
T Consensus 4 ~~GDSv~~~~~~~~~~~~p~------~~i----~a~~g~~~~~~~~~l~~~~~~~-~~~d~vvi~lGtNd~~~~~nl~~i 72 (150)
T cd01840 4 AIGDSVMLDSSPALQEIFPN------IQI----DAKVGRQMSEAPDLIRQLKDSG-KLRKTVVIGLGTNGPFTKDQLDEL 72 (150)
T ss_pred EEeehHHHchHHHHHHHCCC------CEE----EeeecccHHHHHHHHHHHHHcC-CCCCeEEEEecCCCCCCHHHHHHH
Confidence 45888877777776664322 223 34433322211 2233333332 35677888888877 455566
Q ss_pred HhhCCCCceEEecCC
Q psy11447 329 TQELPAGRGFVCMDL 343 (360)
Q Consensus 329 ~~~Lp~Gr~fv~~dt 343 (360)
.+.+++|+--+.+.+
T Consensus 73 i~~~~~~~~ivlv~~ 87 (150)
T cd01840 73 LDALGPDRQVYLVNP 87 (150)
T ss_pred HHHcCCCCEEEEEEC
Confidence 666766554444333
No 147
>cd07077 ALDH-like NAD(P)+-dependent aldehyde dehydrogenase-like (ALDH-like) family. The aldehyde dehydrogenase-like (ALDH-like) group of the ALDH superfamily of NAD(P)+-dependent enzymes which, in general, oxidize a wide range of endogenous and exogenous aliphatic and aromatic aldehydes to their corresponding carboxylic acids and play an important role in detoxification. This group includes families ALDH18, ALDH19, and ALDH20 and represents such proteins as gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA reductase, and coenzyme A acylating aldehyde dehydrogenase. All of these proteins have a conserved cysteine that aligns with the catalytic cysteine of the ALDH group.
Probab=22.74 E-value=7.9e+02 Score=24.69 Aligned_cols=59 Identities=17% Similarity=0.300 Sum_probs=42.3
Q ss_pred hHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCC--------CCceEEecC-CCchHHHHHHHHHhccC
Q psy11447 298 REMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELP--------AGRGFVCMD-LTEIPQILQQIFSASLL 359 (360)
Q Consensus 298 ~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp--------~Gr~fv~~d-t~~lP~il~qIf~s~~l 359 (360)
..+...|..+ |.|. .++|.|| .+-...+.+..| .|.+++..+ ..++....+.|..|.+.
T Consensus 170 ~~~~~~l~~~-~~vd-~v~ftGs-~~~~~~v~~~~~~~~~~~~~gg~~~~iv~~dad~~~a~~~~~~~~~~ 237 (397)
T cd07077 170 DELAEELLSH-PKID-LIVATGG-RDAVDAAVKHSPHIPVIGFGAGNSPVVVDETADEERASGSVHDSKFF 237 (397)
T ss_pred HHHHHHHHcC-CCCC-EEEecCC-HHHHHHHHHcCCCCceEEecCCcceEEEcCCCCHHHHHHHHHHhhcc
Confidence 3577888887 8765 5899999 455666666664 366665554 78999999888877653
No 148
>PF13660 DUF4147: Domain of unknown function (DUF4147); PDB: 1X3L_A 2B8N_A.
Probab=22.38 E-value=79 Score=30.69 Aligned_cols=81 Identities=20% Similarity=0.305 Sum_probs=41.9
Q ss_pred HHHHhhcc-ccceeEeee----eccCCCCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhh
Q psy11447 199 SLEAFQDF-DQRISLDII----GHSGETHSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTE 273 (360)
Q Consensus 199 vMEaf~g~-e~k~~ydiv----GHSGd~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~ 273 (360)
+.+|++.. .+++..=+| ||..+...|.+++.++|= ....+++|++..++-+.+-
T Consensus 55 MA~a~~~~lg~~i~~G~vv~~~g~~~~~~~i~v~~~~HP~---------------------Pd~~s~~aa~~il~~~~~~ 113 (238)
T PF13660_consen 55 MAEAAEEILGDRIVGGLVVVPYGHESPLPRIEVLEGGHPL---------------------PDENSVRAARRILELAREL 113 (238)
T ss_dssp HHHHHHHHCGGCEEEEEEEEETT-----TTSEEEEE-SSS-----------------------HHHHHHHHHHHHHHCC-
T ss_pred HHHHHHHHhcccccCceEEeCCcccCCCCCEEEEECCCCC---------------------CCHHHHHHHHHHHHHHhcC
Confidence 55666443 556533332 677788888899988763 3445788888877776543
Q ss_pred cccCCCeEEEEEecC-----CccccCCChhHHHH
Q psy11447 274 CAQHDDSILIVLSDA-----NLDRYGIAPREMGE 302 (360)
Q Consensus 274 ~~daDe~fVivlSDA-----NL~RYgI~p~~l~~ 302 (360)
..|+-++++||=- .+-.=|||-+++.+
T Consensus 114 --~~~dlvl~LiSGGgSALl~~P~~gisLed~~~ 145 (238)
T PF13660_consen 114 --TEDDLVLVLISGGGSALLELPADGISLEDKQE 145 (238)
T ss_dssp ---TTSEEEEEE-TTHHHHS--B-TT--HHHHHH
T ss_pred --CCCCeEEEEecCChHHhhcCCCCCCCHHHHHH
Confidence 6677888888741 22333666665543
No 149
>PRK14573 bifunctional D-alanyl-alanine synthetase A/UDP-N-acetylmuramate--L-alanine ligase; Provisional
Probab=22.27 E-value=6e+02 Score=28.28 Aligned_cols=150 Identities=14% Similarity=0.219 Sum_probs=80.2
Q ss_pred CCchhhHHHHh---hccccceeEeeeeccCCCCccccc-cCCCCCCCchHHHHHHHHHHHhh-ccccCCcchHHHHHHHH
Q psy11447 193 GSDLGTSLEAF---QDFDQRISLDIIGHSGETHSIPFV-TSNNLPRDNKQRLEVIKIMHAHA-QFCASGDYTLESIRASI 267 (360)
Q Consensus 193 ~~DL~rvMEaf---~g~e~k~~ydivGHSGd~~~i~lV-~~~~~Pk~~keRl~vl~~M~aH~-Qfc~sGD~Tlea~~~ai 267 (360)
|.+.+.+.+++ .+...|+. ++++.+ .+.++ ....-|..-+.=++.++...... =.|.-|+|+...++.-+
T Consensus 287 gi~~~~i~~~L~~f~~~~~R~e--~~~~~~---~~~~i~D~ahnP~~~~a~l~~l~~~~~~~rli~vf~~~~~~~~~~~~ 361 (809)
T PRK14573 287 GIDEGAIRNALKGFSGVQRRLE--RKNSSE---TFLFLEDYAHHPSEISCTLRAVRDAVGLRRIIAICQPHRFSRLRECL 361 (809)
T ss_pred CCCHHHHHHHHHhCCCCCCCCE--EEeccC---CcEEEEECCCCHHHHHHHHHHHHhhcCCCEEEEEEcCCcchhHHHHH
Confidence 67755555555 45577775 465432 23333 34444554444455554432111 25777999999999988
Q ss_pred HHHHhhcccCCCeEEEEEecCCc-cccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCce---EEecCC
Q psy11447 268 QDLTTECAQHDDSILIVLSDANL-DRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRG---FVCMDL 343 (360)
Q Consensus 268 ~~~a~~~~daDe~fVivlSDANL-~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~---fv~~dt 343 (360)
++++..-..+|..++.-+..|.- .|-|+++++|++.+... ..++.. ++ ...+=.+.|.+.+..+.. .+|-+.
T Consensus 362 ~~~~~~l~~~d~vilt~~~~~~e~~~~~~~~~~L~~~~~~~-~~~~~~--~~-~~~~~~~al~~~~~~~d~~~il~~GSl 437 (809)
T PRK14573 362 DSFPSAFQDADEVILTDVYSAGEEPEDSISYQKLAEAISQS-SIVKCT--YV-PFHEIQRYLEQSIRVHDVCVSLGAGNI 437 (809)
T ss_pred HHHHHHHHHCCEEEECCccCCCCCCCCCCCHHHHHHHHhhc-CCCcee--ec-CHHHHHHHHHHhcCCCCEEEEECCCCH
Confidence 77654211466533332332323 35589999999999643 222221 22 334555566555543443 344454
Q ss_pred CchHHHHH
Q psy11447 344 TEIPQILQ 351 (360)
Q Consensus 344 ~~lP~il~ 351 (360)
--+..+++
T Consensus 438 y~i~~~~~ 445 (809)
T PRK14573 438 YTLGEALK 445 (809)
T ss_pred HHHHHHHH
Confidence 45555544
No 150
>cd07127 ALDH_PAD-PaaZ Phenylacetic acid degradation proteins PaaZ (Escherichia coli) and PaaN (Pseudomonas putida)-like. Phenylacetic acid degradation (PAD) proteins PaaZ (Escherichia coli) and PaaN (Pseudomonas putida) are putative aromatic ring cleavage enzymes of the aerobic PA catabolic pathway. PaaZ mutants were defective for growth with PA as a sole carbon source due to interruption of the putative ring opening system. This CD is limited to bacterial monofunctional enzymes.
Probab=22.14 E-value=5.8e+02 Score=27.56 Aligned_cols=72 Identities=19% Similarity=0.301 Sum_probs=49.5
Q ss_pred cCCCeEEEEEecCCccccCCCh-hHHHHHHhcCcccccEEEEEecCchHHHHHHHh---------hCCCCce-EEecCCC
Q psy11447 276 QHDDSILIVLSDANLDRYGIAP-REMGEALNAEAASVRAYAIFIGSLGDQAKRLTQ---------ELPAGRG-FVCMDLT 344 (360)
Q Consensus 276 daDe~fVivlSDANL~RYgI~p-~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~---------~Lp~Gr~-fv~~dt~ 344 (360)
..+...|-++.. .| ..+++.|..+ |.| ..+.|.|| -.-..+|.+ +|. |+. .|+.++.
T Consensus 251 GlP~gvv~~v~g--------~~~~~~~~~L~~~-p~v-~~I~FTGS-~~~G~~i~~~a~~~~v~~ElG-Gkn~~IV~~dA 318 (549)
T cd07127 251 GFDPNLVTLAAD--------TPEEPIAQTLATR-PEV-RIIDFTGS-NAFGDWLEANARQAQVYTEKA-GVNTVVVDSTD 318 (549)
T ss_pred CcCcccEEEEeC--------CCcHHHHHHHHhC-CCC-CEEEEECC-HHHHHHHHHHhccCcEEEecC-CcCeEEECCCC
Confidence 456666666653 12 4678999997 999 67899999 344454544 443 554 4566778
Q ss_pred chHHHHHHHHHhccC
Q psy11447 345 EIPQILQQIFSASLL 359 (360)
Q Consensus 345 ~lP~il~qIf~s~~l 359 (360)
+|....+.|..|.++
T Consensus 319 Dl~~aa~~i~~~~f~ 333 (549)
T cd07127 319 DLKAMLRNLAFSLSL 333 (549)
T ss_pred CHHHHHHHHHHHHHc
Confidence 999999998877654
No 151
>PF02875 Mur_ligase_C: Mur ligase family, glutamate ligase domain This Prosite entry is a subset of the Pfam family.; InterPro: IPR004101 The bacterial cell wall provides strength and rigidity to counteract internal osmotic pressure, and protection against the environment. The peptidoglycan layer gives the cell wall its strength, and helps maintain the overall shape of the cell. The basic peptidoglycan structure of both Gram-positive and Gram-negative bacteria is comprised of a sheet of glycan chains connected by short cross-linking polypeptides. Biosynthesis of peptidoglycan is a multi-step (11-12 steps) process comprising three main stages: (1) formation of UDP-N-acetylmuramic acid (UDPMurNAc) from N-acetylglucosamine (GlcNAc). (2) addition of a short polypeptide chain to the UDPMurNAc. (3) addition of a second GlcNAc to the disaccharide-pentapeptide building block and transport of this unit through the cytoplasmic membrane and incorporation into the growing peptidoglycan layer. Stage two involves four key Mur ligase enzymes: MurC (6.3.2.8 from EC) [], MurD (6.3.2.9 from EC) [], MurE (6.3.2.13 from EC) [] and MurF (6.3.2.10 from EC) []. These four Mur ligases are responsible for the successive additions of L-alanine, D-glutamate, meso-diaminopimelate or L-lysine, and D-alanyl-D-alanine to UDP-N-acetylmuramic acid. All four Mur ligases are topologically similar to one another, even though they display low sequence identity. They are each composed of three domains: an N-terminal Rossmann-fold domain responsible for binding the UDPMurNAc substrate; a central domain (similar to ATP-binding domains of several ATPases and GTPases); and a C-terminal domain (similar to dihydrofolate reductase fold) that appears to be associated with binding the incoming amino acid. The conserved sequence motifs found in the four Mur enzymes also map to other members of the Mur ligase family, including folylpolyglutamate synthetase, cyanophycin synthetase and the capB enzyme from Bacillales []. This entry represents the C-terminal domain from all four stage 2 Mur enzymes: UDP-N-acetylmuramate-L-alanine ligase (MurC), UDP-N-acetylmuramoylalanine-D-glutamate ligase (MurD), UDP-N-acetylmuramoylalanyl-D-glutamate-2,6-diaminopimelate ligase (MurE), and UDP-N-acetylmuramoyl-tripeptide-D-alanyl-D-alanine ligase (MurF). This entry also includes the C-terminal domain of folylpolyglutamate synthase that transfers glutamate to folylpolyglutamate and cyanophycin synthetase that catalyses the biosynthesis of the cyanobacterial reserve material multi-L-arginyl-poly-L-aspartate (cyanophycin) []. The C-terminal domain is almost always associated with the cytoplasmic peptidoglycan synthetases, N-terminal domain (see IPR000713 from INTERPRO).; GO: 0005524 ATP binding, 0016874 ligase activity, 0009058 biosynthetic process; PDB: 2Y68_A 3UAG_A 4UAG_A 2UAG_A 1E0D_A 2XPC_A 2WJP_A 2VTE_A 2Y67_A 1EEH_A ....
Probab=21.85 E-value=2.1e+02 Score=22.45 Aligned_cols=59 Identities=22% Similarity=0.244 Sum_probs=34.7
Q ss_pred CcchHHHHHHHHHHHHhhcccCCCeEEEEEec-CCccccCCC-hhHHHHHHhcCcccccEEEEEecC
Q psy11447 256 GDYTLESIRASIQDLTTECAQHDDSILIVLSD-ANLDRYGIA-PREMGEALNAEAASVRAYAIFIGS 320 (360)
Q Consensus 256 GD~Tlea~~~ai~~~a~~~~daDe~fVivlSD-ANL~RYgI~-p~~l~~~l~~~~p~V~a~~IfIgs 320 (360)
|=|+.+|++.+++.+.+. -.+..+++|+.+ .+|...+.. ...++.++... .-.+|+.|.
T Consensus 20 ~ahNp~s~~a~l~~l~~~--~~~~~~i~V~G~~~d~g~~~~~~~~~~~~~~~~~----~d~vi~~~~ 80 (91)
T PF02875_consen 20 YAHNPDSIRALLEALKEL--YPKGRIIAVFGAMGDLGSKDKDFHEEIGELAAQL----ADVVILTGD 80 (91)
T ss_dssp T--SHHHHHHHHHHHHHH--CTTSEEEEEEEEBTT-HTSHHHCHHHHHHHHTTC----SSEEEEETS
T ss_pred CCCCHHHHHHHHHHHHHh--ccCCcEEEEEccccccccccHHHHHHHHHHHHhc----CCEEEEcCC
Confidence 347889999999998754 456788888886 344333222 23566666543 333555543
No 152
>PF09936 Methyltrn_RNA_4: SAM-dependent RNA methyltransferase; InterPro: IPR019230 This entry contains proteins that have no known function. They are found as separate proteins and as a C-terminal domain to tRNA (guanine-N(1)-)-methyltransferases to which they are structurally related. ; PDB: 3DCM_X.
Probab=21.69 E-value=5.3e+02 Score=24.59 Aligned_cols=84 Identities=20% Similarity=0.261 Sum_probs=47.3
Q ss_pred chHHHHHHHHHHHhhccccCCcc------------hHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHH
Q psy11447 236 NKQRLEVIKIMHAHAQFCASGDY------------TLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEA 303 (360)
Q Consensus 236 ~keRl~vl~~M~aH~Qfc~sGD~------------Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~ 303 (360)
-++..++++++..|=|--+-+.+ .++.++.|++++.+. + -..-.+|-+||...---|+..++.+.
T Consensus 51 l~~Q~~l~~ril~hW~~G~G~~yNp~R~eAl~~v~~~~sle~a~~~I~~~--~-G~~P~~v~TsAr~~~~~is~~~lr~~ 127 (185)
T PF09936_consen 51 LEAQRELAERILGHWQEGYGAEYNPDRKEALSLVRVVDSLEEAIEDIEEE--E-GKRPLLVATSARKYPNTISYAELRRM 127 (185)
T ss_dssp -HHHHHHHHHHHHHHHTSGGGGT-SSSHHHHTTEEEESSHHHHHHHHHHH--H-SS--EEEE--SS--SS-B-HHHHHHH
T ss_pred hHHHHHHHHHHHHhcccCCCcCcCcCHHHHHhHhccHhhHHHHHHHHHHH--h-CCCCEEEEecCcCCCCCcCHHHHHHH
Confidence 45667788888888776555542 244556777777654 2 34678889999987788999999999
Q ss_pred H-hcCcccccEEEEEecCchHHHH
Q psy11447 304 L-NAEAASVRAYAIFIGSLGDQAK 326 (360)
Q Consensus 304 l-~~~~p~V~a~~IfIgslgdqA~ 326 (360)
| +.+ +-++|..|.---=++
T Consensus 128 l~~~~----~P~LllFGTGwGL~~ 147 (185)
T PF09936_consen 128 LEEED----RPVLLLFGTGWGLAP 147 (185)
T ss_dssp HHH------S-EEEEE--TT---H
T ss_pred HhccC----CeEEEEecCCCCCCH
Confidence 9 444 678888887433333
No 153
>PRK03806 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional
Probab=21.69 E-value=4.1e+02 Score=26.80 Aligned_cols=78 Identities=15% Similarity=0.161 Sum_probs=47.7
Q ss_pred chHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCce
Q psy11447 258 YTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRG 337 (360)
Q Consensus 258 ~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~ 337 (360)
|+.+|++.|++.+.. +.+.++|+.. .. -+.....|.+.+... +. .++++|+- ++.|++.+. +..
T Consensus 322 ~n~~a~~~al~~l~~-----~~~~i~IlG~--~~-k~~d~~~l~~~l~~~---~~-~v~~~g~~---~~~l~~~~~-~~~ 385 (438)
T PRK03806 322 TNVGSTEAALNGLHV-----DGTLHLLLGG--DG-KSADFSPLARYLNGD---NI-RLYCFGRD---GAQLAALRP-EVS 385 (438)
T ss_pred CCHHHHHHHHHhCcc-----CCcEEEEECC--cC-CCCCHHHHHHHHHhh---Cc-EEEEECCC---HHHHHHHhh-cce
Confidence 789999999998852 1367777775 21 245667788887532 23 68899874 445555443 223
Q ss_pred EEecCCCchHHHHHHHH
Q psy11447 338 FVCMDLTEIPQILQQIF 354 (360)
Q Consensus 338 fv~~dt~~lP~il~qIf 354 (360)
+++ .+++..++.+.
T Consensus 386 ~~~---~~~~~av~~a~ 399 (438)
T PRK03806 386 QLT---ETMEQAMRLIA 399 (438)
T ss_pred Eec---CCHHHHHHHHH
Confidence 333 34666665554
No 154
>cd08613 GDPD_GDE4_like_1 Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial homologs of mammalian glycerophosphodiester phosphodiesterase GDE4. This subfamily corresponds to the glycerophosphodiester phosphodiesterase domain (GDPD) present in uncharacterized bacterial homologs of mammalian GDE4, a transmembrane protein whose cellular function has not been elucidated yet.
Probab=21.64 E-value=1.8e+02 Score=29.28 Aligned_cols=36 Identities=25% Similarity=0.290 Sum_probs=26.2
Q ss_pred CcchHHHHHHHHHHHHhhcccCC-----CeEEEEEecCCccc
Q psy11447 256 GDYTLESIRASIQDLTTECAQHD-----DSILIVLSDANLDR 292 (360)
Q Consensus 256 GD~Tlea~~~ai~~~a~~~~daD-----e~fVivlSDANL~R 292 (360)
--||++|.+.|++.=+. -.|.| |.-+||+-|++|+|
T Consensus 59 pENTl~Af~~A~~~Gad-~IE~DV~lTkDg~lVV~HD~tL~R 99 (309)
T cd08613 59 LENTIASMQAAFDAGAD-VVELDVHPTKDGEFAVFHDWTLDC 99 (309)
T ss_pred CchHHHHHHHHHHcCCC-EEEEEEEEccCCeEEEEecCcccc
Confidence 47999999999875221 11222 67788999999987
No 155
>cd06534 ALDH-SF NAD(P)+-dependent aldehyde dehydrogenase superfamily. The aldehyde dehydrogenase superfamily (ALDH-SF) of NAD(P)+-dependent enzymes, in general, oxidize a wide range of endogenous and exogenous aliphatic and aromatic aldehydes to their corresponding carboxylic acids and play an important role in detoxification. Besides aldehyde detoxification, many ALDH isozymes possess multiple additional catalytic and non-catalytic functions such as participating in metabolic pathways, or as binding proteins, or osmoregulants, to mention a few. The enzyme has three domains, a NAD(P)+ cofactor-binding domain, a catalytic domain, and a bridging domain; and the active enzyme is generally either homodimeric or homotetrameric. The catalytic mechanism is proposed to involve cofactor binding, resulting in a conformational change and activation of an invariant catalytic cysteine nucleophile. The cysteine and aldehyde substrate form an oxyanion thiohemiacetal intermediate resulting in hydri
Probab=21.24 E-value=7.8e+02 Score=24.03 Aligned_cols=58 Identities=19% Similarity=0.261 Sum_probs=38.8
Q ss_pred hHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCC----------CceEE-ecCCCchHHHHHHHHHhcc
Q psy11447 298 REMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPA----------GRGFV-CMDLTEIPQILQQIFSASL 358 (360)
Q Consensus 298 ~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~----------Gr~fv-~~dt~~lP~il~qIf~s~~ 358 (360)
.++.+.|..+ |.|. .|+|.|| -+-+..+.+.++. |++++ +..+.++....++|..+.+
T Consensus 160 ~~~~~~l~~~-~~vd-~v~~tGs-~~~~~~v~~~~~~~~~~~~~e~~g~~~~iV~~~ad~~~aa~~i~~~~~ 228 (367)
T cd06534 160 DEVGAALLSH-PRVD-KISFTGS-TAVGKAIMKAAAENLKPVTLELGGKSPVIVDEDADLDAAVEGAVFGAF 228 (367)
T ss_pred hhHHHHHhcC-CCcC-EEEEECC-HHHHHHHHHHHhcCCCeEEEEcCCCCeEEECCCCCHHHHHHHHHHHHH
Confidence 3567777776 7654 6899998 4445555554332 66665 4555789999999887754
No 156
>KOG4384|consensus
Probab=20.77 E-value=48 Score=34.29 Aligned_cols=46 Identities=22% Similarity=0.108 Sum_probs=35.0
Q ss_pred hHHHHHHHhhccc--cccccCCc-hhh----------------------HHHHhhccccceeEeeeeccCC
Q psy11447 175 NKQRLEVIKVHHN--NLRNTGSD-LGT----------------------SLEAFQDFDQRISLDIIGHSGE 220 (360)
Q Consensus 175 ~k~RL~ei~m~~~--~lrfn~~D-L~r----------------------vMEaf~g~e~k~~ydivGHSGd 220 (360)
+.+=|+-|+|.+| .|=+|||| |++ +|+|.+.+.+.-.+-++=-.++
T Consensus 218 ~~ewL~~i~le~y~~~~L~nGYd~le~~k~i~e~dL~~lgI~nP~Hr~kLL~av~~~~e~d~~~~~~~~~e 288 (361)
T KOG4384|consen 218 LEEWLRRIGLEEYIETLLENGYDTLEDLKDITEEDLEELGIDNPDHRKKLLSAVELLKEIDSGSEQVGENE 288 (361)
T ss_pred HHHHHHHhhHHHHHHHHHHcchHHHHHHHhccHHHHHHhCCCCHHHHHHHHHHHHHHHhccccccccCCCc
Confidence 8888999999999 57799999 877 8888888866655554433333
No 157
>cd03859 M14_CPT Peptidase M14-like domain of carboxypeptidase (CP) T (CPT), CPT belongs to the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPT has moderate similarity to CPA and CPB, and exhibits dual-substrate specificity by cleaving C-terminal hydrophobic amino acid residues like CPA and C-terminal positively charged residues like CPB. CPA and CPB are M14 family peptidases but do not belong to this CPT group. The substrate specificity difference between CPT and CPA and CPB is ascribed to a few amino acid substitutions at the substrate-binding pocket while the spatial organization of the binding site remains the same as in all Zn-CPs. CPT has increased thermal stability in presence of Ca2+ ions, and two disulfide bridges which give an additional stabilization factor.
Probab=20.65 E-value=70 Score=31.18 Aligned_cols=44 Identities=18% Similarity=0.089 Sum_probs=34.6
Q ss_pred ccCCc-hhhHHHHhh-ccccceeEeeeeccCCCCccccccCCCCCC
Q psy11447 191 NTGSD-LGTSLEAFQ-DFDQRISLDIIGHSGETHSIPFVTSNNLPR 234 (360)
Q Consensus 191 fn~~D-L~rvMEaf~-g~e~k~~ydivGHSGd~~~i~lV~~~~~Pk 234 (360)
|-.|| +...|+.+. .+.+..+...+|.|-+|..|..++.++.|.
T Consensus 4 y~~y~ei~~~l~~l~~~~p~~v~~~~iG~S~eGR~i~~l~i~~~~~ 49 (295)
T cd03859 4 YHNYLEMVDELNAAAAAYPNLTKVKSIGKSYEGRDIIAVKISDNVA 49 (295)
T ss_pred CCCHHHHHHHHHHHHHHCCCceEEEeeeecCCCCeEEEEEEecCCC
Confidence 33455 566667664 678889999999999999999999887663
No 158
>cd05160 DEDDy_DNA_polB_exo DEDDy 3'-5' exonuclease domain of family-B DNA polymerases. The 3'-5' exonuclease domain of family-B DNA polymerases. This domain has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The exonuclease domain of family B polymerase also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members include Escherichia coli DNA polymerase II, some eubacterial phage DNA polymerases, nuclear replicative
Probab=20.49 E-value=36 Score=30.31 Aligned_cols=30 Identities=20% Similarity=0.390 Sum_probs=22.3
Q ss_pred HHHHhhccccceeEe-eeeccCCCCcccccc
Q psy11447 199 SLEAFQDFDQRISLD-IIGHSGETHSIPFVT 228 (360)
Q Consensus 199 vMEaf~g~e~k~~yd-ivGHSGd~~~i~lV~ 228 (360)
++++|-.+=+.+.-| ||||+|++.++|++.
T Consensus 66 lL~~f~~~i~~~dpdiivg~N~~~FD~~~L~ 96 (199)
T cd05160 66 LLKRFFDIIREYDPDILTGYNIDDFDLPYLL 96 (199)
T ss_pred HHHHHHHHHHhcCCCEEEEeccCCCcHHHHH
Confidence 666666654455556 899999999999764
No 159
>PF12983 DUF3867: Protein of unknown function (DUF3867); InterPro: IPR024218 This entry represents a family of functionally uncharacterised proteins that are found in bacteria. Proteins in this family are approximately 190 amino acids in length.
Probab=20.17 E-value=48 Score=31.37 Aligned_cols=17 Identities=29% Similarity=0.857 Sum_probs=15.7
Q ss_pred ccccCCChhHHHHHHhc
Q psy11447 290 LDRYGIAPREMGEALNA 306 (360)
Q Consensus 290 L~RYgI~p~~l~~~l~~ 306 (360)
|+|||+.|+.+..-|..
T Consensus 68 mERYGfd~~~iE~q~K~ 84 (186)
T PF12983_consen 68 MERYGFDPSEIEKQMKS 84 (186)
T ss_pred HHHhCCCHHHHHHHHHH
Confidence 68999999999999977
No 160
>PRK14338 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=20.08 E-value=8.3e+02 Score=25.42 Aligned_cols=61 Identities=18% Similarity=0.317 Sum_probs=39.8
Q ss_pred HHHHHHHHhhccccCC-------cchHHHHHHHHHHHHhhcccCCCeEEEEEecCCccccCCC---hhHHHHHHhc
Q psy11447 241 EVIKIMHAHAQFCASG-------DYTLESIRASIQDLTTECAQHDDSILIVLSDANLDRYGIA---PREMGEALNA 306 (360)
Q Consensus 241 ~vl~~M~aH~Qfc~sG-------D~Tlea~~~ai~~~a~~~~daDe~fVivlSDANL~RYgI~---p~~l~~~l~~ 306 (360)
+|.+-+--+--||... ..+++-+..-|+.+.+. ..--|+|+|.|+..||.+ ...|.++|..
T Consensus 160 ~I~rGC~~~CsfC~~p~~~G~~rsr~~e~Il~ei~~l~~~-----G~keI~l~g~~~~~yG~d~~~~~~l~~Ll~~ 230 (459)
T PRK14338 160 PIIYGCNMSCSYCVIPLRRGRERSRPLAEIVEEVRRIAAR-----GAKEITLLGQIVDSYGHDLPGRPDLADLLEA 230 (459)
T ss_pred EcccCCCCCCCcCCeeccCCCCccCCHHHHHHHHHHHHHC-----CCeEEEEeeecCCCcccccCChHHHHHHHHH
Confidence 4555555667788753 45677777777777643 234578899999999963 2345555543
No 161
>PRK00066 ldh L-lactate dehydrogenase; Reviewed
Probab=20.06 E-value=2.8e+02 Score=27.44 Aligned_cols=87 Identities=22% Similarity=0.404 Sum_probs=55.9
Q ss_pred CCccccccCCCCCCCchHHHHHHHHHHHhhccccCCcchHHHHHHHHHHHHhhcccCCCeEEEEEecCC-------cccc
Q psy11447 221 THSIPFVTSNNLPRDNKQRLEVIKIMHAHAQFCASGDYTLESIRASIQDLTTECAQHDDSILIVLSDAN-------LDRY 293 (360)
Q Consensus 221 ~~~i~lV~~~~~Pk~~keRl~vl~~M~aH~Qfc~sGD~Tlea~~~ai~~~a~~~~daDe~fVivlSDAN-------L~RY 293 (360)
+.+|-.+..|.|.+....|++.++. +.+-++.-++++++. +-+.+||++|... .+--
T Consensus 73 ~adivIitag~~~k~g~~R~dll~~-------------N~~i~~~i~~~i~~~---~~~~~vivvsNP~d~~~~~~~k~s 136 (315)
T PRK00066 73 DADLVVITAGAPQKPGETRLDLVEK-------------NLKIFKSIVGEVMAS---GFDGIFLVASNPVDILTYATWKLS 136 (315)
T ss_pred CCCEEEEecCCCCCCCCCHHHHHHH-------------HHHHHHHHHHHHHHh---CCCeEEEEccCcHHHHHHHHHHHh
Confidence 4567788899998888899998876 444455556666653 3357788888522 1223
Q ss_pred CCChhH----------------HHHHHhcCcccccEEEEEecCchHHH
Q psy11447 294 GIAPRE----------------MGEALNAEAASVRAYAIFIGSLGDQA 325 (360)
Q Consensus 294 gI~p~~----------------l~~~l~~~~p~V~a~~IfIgslgdqA 325 (360)
|++|++ +++.|.-+..+|++++| |.-|+-.
T Consensus 137 g~p~~~viG~gt~LDs~R~~~~la~~l~v~~~~V~~~vi--GeHG~s~ 182 (315)
T PRK00066 137 GFPKERVIGSGTSLDSARFRYMLSEKLDVDPRSVHAYII--GEHGDTE 182 (315)
T ss_pred CCCHHHEeecCchHHHHHHHHHHHHHhCCCcccEEEEEE--ecCCCcc
Confidence 555554 34455666566888754 8877653
No 162
>PRK04405 prsA peptidylprolyl isomerase; Provisional
Probab=20.01 E-value=1e+02 Score=30.26 Aligned_cols=44 Identities=16% Similarity=0.333 Sum_probs=33.7
Q ss_pred CCChhHHHHHHhcCcccccEEEEEecCchHHHHHHHhhCCCCceE
Q psy11447 294 GIAPREMGEALNAEAASVRAYAIFIGSLGDQAKRLTQELPAGRGF 338 (360)
Q Consensus 294 gI~p~~l~~~l~~~~p~V~a~~IfIgslgdqA~~l~~~Lp~Gr~f 338 (360)
-||++++........|+|++.-|+|.. ..+|+.+.+.|-.|-.|
T Consensus 128 ~Vtd~ei~~~y~~~~~~~~v~hIlv~~-~~~A~~v~~~l~~G~~F 171 (298)
T PRK04405 128 KVTNSQLKKAWKSYQPKVTVQHILVSK-KSTAETVIKKLKDGKDF 171 (298)
T ss_pred CCCHHHHHHHHHHhhhhEEEEEEEecC-hHHHHHHHHHHHCCCCH
Confidence 366666666666655889999999986 78899888888777555
Done!