Query gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 374 No_of_seqs 175 out of 2058 Neff 9.7 Searched_HMMs 39220 Date Mon May 30 01:23:03 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780934.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK13685 hypothetical protein; 100.0 3.3E-28 8.5E-33 186.5 19.7 190 170-370 87-295 (326) 2 cd01465 vWA_subgroup VWA subgr 100.0 2.9E-27 7.3E-32 181.0 17.0 165 172-358 1-170 (170) 3 cd01467 vWA_BatA_type VWA BatA 100.0 3.7E-27 9.6E-32 180.3 17.3 164 171-354 2-180 (180) 4 cd01461 vWA_interalpha_trypsin 99.9 6.4E-24 1.6E-28 161.0 17.3 162 170-358 1-169 (171) 5 cd01466 vWA_C3HC4_type VWA C3H 99.9 9.2E-24 2.3E-28 160.1 15.1 149 172-349 1-155 (155) 6 cd01475 vWA_Matrilin VWA_Matri 99.9 1.7E-22 4.4E-27 152.5 18.9 180 170-367 1-185 (224) 7 cd01456 vWA_ywmD_type VWA ywmD 99.9 4.5E-23 1.2E-27 156.0 15.6 169 165-352 14-204 (206) 8 cd01480 vWA_collagen_alpha_1-V 99.9 9.1E-22 2.3E-26 148.2 18.6 172 170-358 1-179 (186) 9 cd01470 vWA_complement_factors 99.9 1.8E-21 4.7E-26 146.4 19.3 176 172-359 1-198 (198) 10 cd01474 vWA_ATR ATR (Anthrax T 99.9 3.8E-21 9.8E-26 144.5 19.5 178 170-368 3-184 (185) 11 cd01463 vWA_VGCC_like VWA Volt 99.9 7.7E-22 2E-26 148.6 15.9 166 169-352 11-190 (190) 12 cd01464 vWA_subfamily VWA subf 99.9 2.9E-21 7.5E-26 145.2 15.1 173 171-359 3-175 (176) 13 cd01451 vWA_Magnesium_chelatas 99.9 1.3E-19 3.2E-24 135.5 18.2 166 174-358 3-176 (178) 14 cd01469 vWA_integrins_alpha_su 99.9 6.6E-20 1.7E-24 137.1 16.3 167 172-356 1-176 (177) 15 smart00327 VWA von Willebrand 99.8 2.8E-19 7.2E-24 133.4 17.5 160 171-348 1-165 (177) 16 cd01473 vWA_CTRP CTRP for CS 99.8 1.2E-18 3E-23 129.6 19.5 178 172-367 1-192 (192) 17 cd01471 vWA_micronemal_protein 99.8 8.5E-19 2.2E-23 130.5 18.3 170 172-359 1-183 (186) 18 cd01472 vWA_collagen von Wille 99.8 4.4E-19 1.1E-23 132.2 16.0 157 173-350 2-163 (164) 19 pfam00092 VWA von Willebrand f 99.8 1.5E-18 3.8E-23 129.1 18.5 170 173-361 1-177 (177) 20 cd01450 vWFA_subfamily_ECM Von 99.8 6.9E-19 1.7E-23 131.1 16.4 152 172-343 1-155 (161) 21 TIGR03436 acidobact_VWFA VWFA- 99.8 4.4E-18 1.1E-22 126.3 18.8 182 168-371 50-260 (296) 22 cd01482 vWA_collagen_alphaI-XI 99.8 6.9E-18 1.8E-22 125.1 16.6 157 173-350 2-163 (164) 23 cd01481 vWA_collagen_alpha3-VI 99.8 2.2E-17 5.7E-22 122.1 17.3 159 173-350 2-164 (165) 24 TIGR00868 hCaCC calcium-activa 99.8 3.1E-18 8E-23 127.2 11.7 165 172-363 308-481 (874) 25 cd00198 vWFA Von Willebrand fa 99.8 1.8E-17 4.7E-22 122.6 15.5 151 172-342 1-154 (161) 26 cd01476 VWA_integrin_invertebr 99.8 5.5E-17 1.4E-21 119.7 16.5 153 172-346 1-161 (163) 27 cd01462 VWA_YIEM_type VWA YIEM 99.7 1.6E-15 4.1E-20 111.0 15.4 143 173-340 2-145 (152) 28 COG4245 TerY Uncharacterized p 99.6 8.5E-15 2.2E-19 106.7 13.6 185 171-369 3-188 (207) 29 COG1240 ChlD Mg-chelatase subu 99.6 1.1E-13 2.7E-18 100.2 16.1 173 168-358 75-256 (261) 30 cd01454 vWA_norD_type norD typ 99.6 4.9E-13 1.3E-17 96.2 16.2 153 172-346 1-172 (174) 31 COG4961 TadG Flp pilus assembl 99.5 1.5E-12 3.9E-17 93.3 14.0 68 4-71 8-75 (185) 32 cd01477 vWA_F09G8-8_type VWA F 99.4 8.6E-11 2.2E-15 82.9 15.8 164 168-346 16-191 (193) 33 TIGR02442 Cob-chelat-sub cobal 99.3 5.9E-11 1.5E-15 83.8 13.5 161 170-348 507-687 (688) 34 PRK13406 bchD magnesium chelat 99.3 8.7E-10 2.2E-14 76.9 17.3 168 170-359 400-580 (584) 35 TIGR02031 BchD-ChlD magnesium 99.3 1.1E-10 2.8E-15 82.2 12.3 172 164-352 503-700 (705) 36 KOG2353 consensus 99.2 5.2E-10 1.3E-14 78.2 12.4 181 168-367 222-414 (1104) 37 cd01453 vWA_transcription_fact 99.2 8.5E-09 2.2E-13 71.0 16.8 171 172-360 4-177 (183) 38 cd01457 vWA_ORF176_type VWA OR 99.0 2.3E-08 5.9E-13 68.4 12.9 157 171-340 2-163 (199) 39 COG4548 NorD Nitric oxide redu 98.8 6.9E-08 1.8E-12 65.6 11.4 176 172-365 447-635 (637) 40 COG2425 Uncharacterized protei 98.8 4.3E-08 1.1E-12 66.8 10.3 159 173-360 274-434 (437) 41 pfam11775 CobT_C Cobalamin bio 98.8 6.6E-07 1.7E-11 59.7 15.0 166 170-363 11-216 (220) 42 pfam04056 Ssl1 Ssl1-like. Ssl1 98.5 3.1E-05 7.9E-10 49.8 16.4 174 169-360 50-228 (250) 43 pfam00362 Integrin_beta Integr 98.5 6.2E-05 1.6E-09 48.0 17.3 185 165-365 94-343 (424) 44 KOG2807 consensus 98.4 3.3E-05 8.3E-10 49.6 14.2 175 169-361 58-235 (378) 45 smart00187 INB Integrin beta s 98.4 0.00017 4.4E-09 45.3 17.4 185 165-365 93-342 (423) 46 KOG3768 consensus 98.3 1.2E-05 3E-10 52.3 10.7 191 174-370 4-232 (888) 47 cd01455 vWA_F11C1-5a_type Von 98.3 0.00012 3.1E-09 46.2 15.7 175 174-364 3-188 (191) 48 PRK10997 yieM hypothetical pro 98.2 0.00011 2.8E-09 46.5 13.0 142 173-340 322-465 (484) 49 cd01452 VWA_26S_proteasome_sub 98.2 0.0003 7.7E-09 43.9 15.0 169 173-357 5-182 (187) 50 pfam05762 VWA_CoxE VWA domain 98.1 7.7E-05 2E-09 47.4 11.1 131 169-327 55-188 (223) 51 pfam06707 DUF1194 Protein of u 98.0 0.00049 1.3E-08 42.6 13.9 180 171-367 3-204 (206) 52 COG4655 Predicted membrane pro 98.0 5.7E-06 1.4E-10 54.2 3.0 57 9-65 2-58 (565) 53 pfam04285 DUF444 Protein of un 97.9 0.0012 3E-08 40.4 13.5 164 173-364 248-418 (421) 54 PRK05325 hypothetical protein; 97.7 0.0026 6.6E-08 38.3 13.8 167 172-366 235-411 (414) 55 COG2304 Uncharacterized protei 97.7 0.00035 9E-09 43.5 8.9 160 168-348 34-198 (399) 56 cd01460 vWA_midasin VWA_Midasi 97.6 0.0045 1.1E-07 36.9 16.6 180 169-364 58-258 (266) 57 pfam09967 DUF2201 Predicted me 97.6 0.00031 7.8E-09 43.9 7.1 96 172-302 288-384 (412) 58 KOG1226 consensus 97.5 0.00082 2.1E-08 41.3 8.6 145 166-326 127-330 (783) 59 COG4867 Uncharacterized protei 97.4 0.0056 1.4E-07 36.3 11.5 163 163-357 455-641 (652) 60 KOG2884 consensus 97.3 0.011 2.8E-07 34.5 14.1 169 173-358 5-183 (259) 61 COG4547 CobT Cobalamin biosynt 97.2 0.0012 3E-08 40.3 6.6 81 260-358 520-610 (620) 62 COG5151 SSL1 RNA polymerase II 96.8 0.032 8.2E-07 31.8 12.7 175 170-362 86-267 (421) 63 cd01458 vWA_ku Ku70/Ku80 N-ter 96.7 0.04 1E-06 31.2 12.5 155 173-340 3-187 (218) 64 COG2718 Uncharacterized conser 96.7 0.042 1.1E-06 31.1 13.4 165 172-364 247-417 (423) 65 TIGR01651 CobT cobaltochelatas 96.7 0.0054 1.4E-07 36.4 6.0 83 258-358 502-595 (606) 66 pfam07811 TadE TadE-like prote 96.6 0.008 2E-07 35.4 6.3 42 17-58 1-42 (43) 67 LOAD_ku consensus 96.1 0.097 2.5E-06 29.0 13.1 144 173-329 1-174 (521) 68 TIGR02877 spore_yhbH sporulati 95.9 0.077 2E-06 29.6 8.4 163 171-361 214-391 (392) 69 pfam11443 DUF2828 Domain of un 95.7 0.14 3.5E-06 28.0 11.0 139 172-325 327-470 (524) 70 COG3552 CoxE Protein containin 95.2 0.035 8.9E-07 31.6 4.6 127 171-325 218-347 (395) 71 cd01459 vWA_copine_like VWA Co 95.1 0.21 5.4E-06 26.9 12.3 151 169-339 29-204 (254) 72 KOG1327 consensus 94.9 0.25 6.5E-06 26.5 12.0 151 170-339 284-461 (529) 73 COG5148 RPN10 26S proteasome r 94.7 0.28 7E-06 26.3 12.0 146 173-336 5-153 (243) 74 pfam03731 Ku_N Ku70/Ku80 N-ter 94.4 0.33 8.4E-06 25.8 15.6 187 173-369 1-220 (222) 75 COG3864 Uncharacterized protei 94.2 0.12 3E-06 28.4 5.3 94 172-300 262-355 (396) 76 pfam11265 Med25_VWA Mediator c 93.8 0.45 1.1E-05 25.0 12.1 152 171-326 5-178 (219) 77 pfam07002 Copine Copine. This 92.0 0.79 2E-05 23.5 9.9 118 192-328 10-145 (145) 78 KOG4465 consensus 91.6 0.88 2.3E-05 23.2 8.9 141 171-340 427-581 (598) 79 cd01479 Sec24-like Sec24-like: 91.0 1 2.6E-05 22.9 16.4 174 171-366 3-241 (244) 80 TIGR00873 gnd 6-phosphoglucona 89.7 0.7 1.8E-05 23.8 4.7 46 285-332 62-129 (480) 81 COG3847 Flp Flp pilus assembly 88.4 0.81 2.1E-05 23.5 4.3 29 6-34 3-31 (58) 82 KOG2487 consensus 85.7 2.3 5.9E-05 20.8 6.4 74 284-360 162-237 (314) 83 TIGR00627 tfb4 transcription f 85.2 2.5 6.3E-05 20.6 6.6 87 283-369 151-243 (295) 84 cd01468 trunk_domain trunk dom 84.8 2.6 6.5E-05 20.5 17.1 160 171-351 3-224 (239) 85 COG4726 PilX Tfp pilus assembl 84.0 2.8 7.1E-05 20.3 5.9 55 10-65 7-69 (196) 86 pfam04811 Sec23_trunk Sec23/Se 83.7 2.8 7.3E-05 20.2 17.6 158 171-350 3-223 (241) 87 cd01478 Sec23-like Sec23-like: 75.9 5.2 0.00013 18.7 15.3 46 173-232 5-50 (267) 88 COG1721 Uncharacterized conser 75.0 5.5 0.00014 18.5 8.1 81 167-253 220-300 (416) 89 pfam03850 Tfb4 Transcription f 74.5 5.6 0.00014 18.5 15.6 76 285-362 139-216 (271) 90 COG2984 ABC-type uncharacteriz 72.8 6.2 0.00016 18.2 9.4 83 285-370 157-240 (322) 91 PRK08643 acetoin reductase; Va 69.7 7.3 0.00019 17.8 9.1 22 308-329 163-184 (256) 92 PRK10538 3-hydroxy acid dehydr 69.0 7.5 0.00019 17.7 9.1 52 309-363 159-221 (248) 93 PRK08265 short chain dehydroge 67.9 8 0.0002 17.6 9.1 22 308-329 161-182 (261) 94 PRK05872 short chain dehydroge 67.4 8.1 0.00021 17.5 8.9 21 309-329 166-186 (296) 95 PRK06720 hypothetical protein; 63.0 9.9 0.00025 17.0 6.5 65 309-373 30-99 (169) 96 PRK07523 gluconate 5-dehydroge 60.1 11 0.00028 16.7 7.5 22 308-329 166-187 (251) 97 PRK06138 short chain dehydroge 58.2 12 0.00031 16.5 9.1 22 308-329 164-185 (252) 98 PRK12826 3-ketoacyl-(acyl-carr 57.2 13 0.00032 16.4 9.1 21 309-329 168-188 (253) 99 PRK05717 oxidoreductase; Valid 56.4 13 0.00033 16.3 9.1 19 310-329 170-188 (255) 100 PRK07454 short chain dehydroge 55.8 13 0.00034 16.2 9.1 55 309-364 167-223 (241) 101 TIGR02472 sucr_P_syn_N sucrose 54.7 13 0.00032 16.4 2.9 38 333-371 382-419 (445) 102 PRK08277 D-mannonate oxidoredu 54.2 14 0.00036 16.1 8.7 22 308-329 185-206 (278) 103 PRK06139 short chain dehydroge 54.1 14 0.00036 16.1 8.3 33 194-226 154-186 (324) 104 pfam02060 ISK_Channel Slow vol 53.9 14 0.00036 16.1 4.6 39 9-47 32-70 (129) 105 TIGR02921 PEP_integral PEP-CTE 53.5 14 0.00037 16.0 6.7 16 172-187 637-652 (952) 106 TIGR03206 benzo_BadH 2-hydroxy 52.8 15 0.00038 16.0 8.7 22 308-329 163-184 (250) 107 PRK05867 short chain dehydroge 52.6 15 0.00038 15.9 7.3 22 308-329 172-193 (253) 108 PRK12829 short chain dehydroge 52.4 15 0.00038 15.9 9.2 22 308-329 171-192 (264) 109 PRK07109 short chain dehydroge 51.2 16 0.0004 15.8 8.6 34 194-227 156-190 (338) 110 cd03132 GATase1_catalase Type 49.8 17 0.00042 15.7 5.3 59 292-354 66-129 (142) 111 pfam04964 Flp_Fap Flp/Fap pili 48.3 17 0.00044 15.5 5.8 29 11-39 1-29 (47) 112 PRK07576 short chain dehydroge 48.0 18 0.00045 15.5 9.0 21 309-329 168-188 (260) 113 PRK06124 gluconate 5-dehydroge 47.4 18 0.00046 15.4 8.7 22 308-329 174-195 (259) 114 PRK12384 sorbitol-6-phosphate 47.1 18 0.00046 15.4 9.2 22 308-329 165-186 (259) 115 PRK07067 sorbitol dehydrogenas 47.0 18 0.00047 15.4 9.1 22 308-329 163-184 (256) 116 PRK07775 short chain dehydroge 46.8 18 0.00047 15.4 9.2 20 309-328 171-190 (275) 117 COG1681 FlaB Archaeal flagelli 46.1 19 0.00048 15.3 2.7 39 14-52 1-41 (209) 118 PRK09186 flagellin modificatio 44.4 20 0.00051 15.2 9.0 53 308-363 178-236 (255) 119 PRK07326 short chain dehydroge 44.1 20 0.00052 15.1 9.5 53 309-363 164-216 (235) 120 PRK05650 short chain dehydroge 43.9 20 0.00052 15.1 8.8 18 311-328 163-180 (270) 121 PRK12936 3-ketoacyl-(acyl-carr 42.4 22 0.00055 15.0 7.0 22 308-329 163-184 (245) 122 PRK09242 tropinone reductase; 42.0 22 0.00056 14.9 7.4 22 308-329 172-193 (258) 123 COG1991 Uncharacterized conser 41.9 8.1 0.00021 17.5 0.2 35 6-40 3-37 (131) 124 PRK06181 short chain dehydroge 41.6 22 0.00057 14.9 8.4 21 309-329 162-182 (263) 125 PRK12828 short chain dehydroge 41.4 22 0.00057 14.9 9.5 54 308-362 165-218 (239) 126 pfam11411 DNA_ligase_IV DNA li 41.3 21 0.00053 15.1 2.3 22 342-363 14-35 (36) 127 PRK07479 consensus 41.0 23 0.00058 14.8 8.5 21 308-328 166-186 (252) 128 PRK07677 short chain dehydroge 40.8 23 0.00058 14.8 8.4 20 310-329 166-186 (254) 129 PRK07825 short chain dehydroge 40.5 23 0.00059 14.8 8.3 19 310-328 163-181 (273) 130 PRK07831 short chain dehydroge 39.9 24 0.0006 14.7 7.3 56 308-363 180-243 (261) 131 PRK05866 short chain dehydroge 38.7 25 0.00063 14.6 7.9 21 309-329 204-224 (290) 132 cd06325 PBP1_ABC_uncharacteriz 38.4 25 0.00064 14.6 6.8 30 290-324 187-216 (281) 133 TIGR01382 PfpI intracellular p 38.1 25 0.00064 14.6 4.7 21 345-365 168-188 (189) 134 PRK08340 glucose-1-dehydrogena 37.9 25 0.00065 14.6 8.8 21 309-329 163-183 (259) 135 PRK08217 fabG 3-ketoacyl-(acyl 37.7 26 0.00065 14.5 8.5 51 309-362 175-235 (253) 136 PRK07774 short chain dehydroge 37.5 26 0.00066 14.5 8.6 21 309-329 167-187 (250) 137 PRK07666 fabG 3-ketoacyl-(acyl 37.1 26 0.00067 14.5 9.0 50 308-358 166-216 (238) 138 pfam04392 ABC_sub_bind ABC tra 36.2 27 0.00069 14.4 7.0 32 289-325 185-216 (292) 139 PRK05653 fabG 3-ketoacyl-(acyl 35.8 27 0.0007 14.4 8.8 52 309-363 166-227 (246) 140 COG5242 TFB4 RNA polymerase II 35.8 27 0.0007 14.4 15.0 44 315-360 179-224 (296) 141 PRK07814 short chain dehydroge 35.7 28 0.0007 14.3 8.4 19 310-329 173-191 (263) 142 PRK08324 short chain dehydroge 35.6 28 0.00071 14.3 9.6 81 287-373 419-503 (676) 143 pfam06415 iPGM_N BPG-independe 35.1 28 0.00072 14.3 5.5 19 322-340 172-190 (223) 144 PRK13394 3-hydroxybutyrate deh 35.1 28 0.00072 14.3 6.7 22 308-329 168-189 (262) 145 PRK06113 7-alpha-hydroxysteroi 33.0 31 0.00078 14.1 9.5 22 308-329 170-191 (255) 146 PRK12938 acetyacetyl-CoA reduc 33.0 31 0.00078 14.1 8.2 22 308-329 164-185 (246) 147 PRK12429 3-hydroxybutyrate deh 32.4 31 0.0008 14.0 8.7 21 309-329 165-185 (258) 148 pfam10526 NADH_ub_rd_NUML NADH 31.7 32 0.00082 13.9 2.6 25 26-50 12-36 (80) 149 PRK06949 short chain dehydroge 31.3 33 0.00083 13.9 8.5 22 308-329 177-198 (258) 150 TIGR01963 PHB_DH 3-hydroxybuty 31.3 33 0.00083 13.9 5.9 22 235-256 111-132 (258) 151 PRK06484 short chain dehydroge 30.9 33 0.00085 13.9 9.5 65 309-373 288-354 (530) 152 PRK06057 short chain dehydroge 30.8 33 0.00085 13.9 8.9 22 308-329 165-186 (255) 153 PRK07776 consensus 30.6 34 0.00085 13.8 9.6 20 309-329 168-187 (252) 154 PRK08213 gluconate 5-dehydroge 30.1 34 0.00087 13.8 8.5 22 308-329 177-198 (259) 155 PRK07231 fabG 3-ketoacyl-(acyl 29.9 34 0.00088 13.8 9.5 22 308-329 165-186 (250) 156 PRK08267 short chain dehydroge 29.5 35 0.00089 13.7 9.0 21 309-329 161-181 (258) 157 PRK06346 consensus 29.4 35 0.0009 13.7 8.0 22 308-329 166-187 (251) 158 PRK07097 gluconate 5-dehydroge 29.0 36 0.00091 13.7 8.9 22 308-329 170-191 (265) 159 PRK06172 short chain dehydroge 28.5 36 0.00093 13.6 8.3 22 308-329 168-189 (253) 160 PRK07890 short chain dehydroge 28.5 36 0.00093 13.6 8.8 22 308-329 165-186 (258) 161 PRK06227 consensus 28.4 37 0.00094 13.6 8.6 22 308-329 165-186 (256) 162 PRK06463 fabG 3-ketoacyl-(acyl 28.3 37 0.00094 13.6 8.7 21 309-329 163-183 (254) 163 PRK08085 gluconate 5-dehydroge 28.2 37 0.00094 13.6 8.6 22 308-329 169-190 (254) 164 TIGR01822 2am3keto_CoA 2-amino 27.5 38 0.00097 13.5 3.6 59 291-365 327-392 (395) 165 PRK06483 short chain dehydroge 27.3 38 0.00098 13.5 8.6 49 310-362 161-217 (236) 166 PRK06182 short chain dehydroge 26.9 39 0.001 13.4 9.3 19 310-328 159-177 (273) 167 PRK07063 short chain dehydroge 26.3 40 0.001 13.4 8.8 22 308-329 168-189 (259) 168 TIGR01371 met_syn_B12ind 5-met 25.5 41 0.0011 13.3 2.1 69 301-373 667-748 (778) 169 TIGR02764 spore_ybaN_pdaB poly 24.9 43 0.0011 13.2 4.0 37 290-326 159-195 (198) 170 KOG4169 consensus 24.7 43 0.0011 13.2 6.1 25 314-339 169-193 (261) 171 PRK05693 short chain dehydroge 24.5 43 0.0011 13.2 9.1 16 313-328 159-174 (274) 172 PRK12744 short chain dehydroge 24.1 44 0.0011 13.1 10.1 22 308-329 170-191 (257) 173 PRK07035 short chain dehydroge 23.3 46 0.0012 13.0 8.5 53 308-363 169-233 (252) 174 PRK05958 8-amino-7-oxononanoat 22.0 48 0.0012 12.9 3.8 56 305-365 328-384 (387) 175 PRK05786 fabG 3-ketoacyl-(acyl 21.6 49 0.0013 12.8 8.6 22 308-329 161-182 (238) 176 PRK12827 short chain dehydroge 20.8 51 0.0013 12.7 9.6 22 308-329 171-192 (251) 177 PRK07832 short chain dehydroge 20.5 52 0.0013 12.7 6.3 19 310-328 164-182 (272) 178 PRK07062 short chain dehydroge 20.4 52 0.0013 12.7 9.2 22 308-329 170-191 (265) 179 PRK08415 enoyl-(acyl carrier p 20.3 53 0.0013 12.7 7.5 21 309-329 169-189 (274) 180 COG0362 Gnd 6-phosphogluconate 20.1 53 0.0014 12.6 6.6 10 31-40 6-15 (473) No 1 >PRK13685 hypothetical protein; Provisional Probab=99.97 E-value=3.3e-28 Score=186.52 Aligned_cols=190 Identities=18% Similarity=0.228 Sum_probs=154.9 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHH Q ss_conf 87249999657864511313531379999998876322100236656734899998268505610204786899999860 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~ 249 (374) ...+|++++|.|+||...+ -..+|++.+|..+..|++... ...|+|++.|.+.+....|+|.|+..++..++. T Consensus 87 ~~~~i~l~lD~S~SM~a~D-~~p~Rl~~ak~~~~~fi~~~~------~~driGlv~Fa~~a~~~~plT~D~~~~~~~l~~ 159 (326) T PRK13685 87 NRAVVMLVIDVSQSMRATD-VEPNRLAAAQEAAKQFADQLT------PGINLGLIAFAGTATVLVSPTTNREATKNALDK 159 (326) T ss_pred CCCCEEEEEECCCCCCCCC-CCCCHHHHHHHHHHHHHHHCC------CCCEEEEEEECCCCEECCCCCCCHHHHHHHHHH T ss_conf 7886799998975655878-895689999999999997379------888289999658720148987539999999984 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCH---HHHHHHHHHHHCCCEEEEEE Q ss_conf 133687744420367677643222333310257886564279998068778888806---89999999997898799999 Q gi|254780934|r 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ---QSLYYCNEAKKRGAIVYAIG 326 (374) Q Consensus 250 l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~---~~~~~~~~~k~~gi~IytIg 326 (374) +... .+|.++.++..+.+.+...... ...++..+.+.|||+|||++|.+.... .+...++.+|+.||+||||| T Consensus 160 l~~~-~~taiG~ai~~Al~~l~~~~~~---~~~~~~~~~~~IILLTDG~~n~g~~~~~p~~~~~AA~~A~~~gi~IyTIg 235 (326) T PRK13685 160 LQLA-DRTATGEGIFTALQAIATVGAV---IGGGDTPPPARIVLFSDGKETVPTNPDNPKGAYTAARTAKDQGVPISTIS 235 (326) T ss_pred CCCC-CCCCCCHHHHHHHHHHHHHHHH---CCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEE T ss_conf 6878-8886406899999999863320---14567778867999748997778898873029999999998599489999 Q ss_pred ECCC--------------CCHHHHHHHC--CCCCEEEECCHHHHHHHHHHHHHHHHEEEE Q ss_conf 2482--------------2278899823--898179838989999999999984200166 Q gi|254780934|r 327 IRVI--------------RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 (374) Q Consensus 327 ~~~~--------------~~~~~L~~~A--s~~~~~~a~~~~~L~~~f~~I~~~i~~~~~ 370 (374) +|.+ -|++.|+++| ++|+||.+.|.++|.++|++|.+.|..+.+ T Consensus 236 vGt~~g~~~~~g~~~~~~lDe~~L~~IA~~TGG~yfrA~d~~~L~~Iy~~i~~~i~~~~~ 295 (326) T PRK13685 236 FGTPYGFVEINGQRQPVPVDDETLKKIAQLSGGEFYTAASLEELRAVYATLQQQIGYETI 295 (326) T ss_pred ECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHHHHHCCEEE T ss_conf 779988435478403456899999999997298799719999999999996333160331 No 2 >cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n Probab=99.96 E-value=2.9e-27 Score=180.97 Aligned_cols=165 Identities=16% Similarity=0.251 Sum_probs=138.8 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHHHHHHHHH Q ss_conf 249999657864511313531379999998876322100236656734899998268505610204--786899999860 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKY 249 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~~~~~i~~ 249 (374) +||+||+|.||||... |++.+|+++..+++.+.+ ..|++++.|++.+....+++ .+...+...++. T Consensus 1 ldiv~vlD~SGSM~g~------~~~~~k~a~~~~l~~l~~------~dr~~iv~F~~~~~~~~~~~~~~~~~~~~~~i~~ 68 (170) T cd01465 1 LNLVFVIDRSGSMDGP------KLPLVKSALKLLVDQLRP------DDRLAIVTYDGAAETVLPATPVRDKAAILAAIDR 68 (170) T ss_pred CCEEEEEECCCCCCCC------HHHHHHHHHHHHHHHCCC------CCEEEEEEECCCCEECCCCCCHHHHHHHHHHHHC T ss_conf 9199999088688971------999999999999985898------7879999835861551587866679999998743 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCC-CCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 133687744420367677643222333310257886564279998068778888-8068999999999789879999924 Q gi|254780934|r 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 250 l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~-~~~~~~~~~~~~k~~gi~IytIg~~ 328 (374) +.+.| +|+++.++..+++.+.... .++..+.|||||||++|.+. +.......+...++.||+||||||| T Consensus 69 l~~~G-~T~~~~~l~~a~~~~~~~~---------~~~~~~~iillTDG~~~~~~~~~~~~~~~~~~~~~~~i~i~tiGiG 138 (170) T cd01465 69 LTAGG-STAGGAGIQLGYQEAQKHF---------VPGGVNRILLATDGDFNVGETDPDELARLVAQKRESGITLSTLGFG 138 (170) T ss_pred CCCCC-CCCHHHHHHHHHHHHHHCC---------CCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 89899-8527799999999998633---------7887506999815885679889899999999987438862489808 Q ss_pred CCCCHHHHHHHC--CCCCEEEECCHHHHHHHH Q ss_conf 822278899823--898179838989999999 Q gi|254780934|r 329 VIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 (374) Q Consensus 329 ~~~~~~~L~~~A--s~~~~~~a~~~~~L~~~f 358 (374) .+.+.++|+.+| ++|+||+++|+++|.++| T Consensus 139 ~~~~~~~L~~iA~~~~G~~~~v~~~~~l~~~f 170 (170) T cd01465 139 DNYNEDLMEAIADAGNGNTAYIDNLAEARKVF 170 (170) T ss_pred CCCCHHHHHHHHHCCCCEEEECCCHHHHHHHC T ss_conf 87999999999975798899849999999639 No 3 >cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if Probab=99.96 E-value=3.7e-27 Score=180.27 Aligned_cols=164 Identities=21% Similarity=0.264 Sum_probs=136.7 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHC Q ss_conf 72499996578645113135313799999988763221002366567348999982685056102047868999998601 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l 250 (374) .+||+||+|.||||.........|++.+|+++..+++... ..|++++.|++.+....|+|.+...++..++.+ T Consensus 2 G~dvvlvlD~SgSM~~~d~~~~~rl~~ak~~~~~~i~~~~-------~drvglv~Fs~~a~~~~plT~d~~~~~~~l~~i 74 (180) T cd01467 2 GRDIMIALDVSGSMLAQDFVKPSRLEAAKEVLSDFIDRRE-------NDRIGLVVFAGAAFTQAPLTLDRESLKELLEDI 74 (180) T ss_pred CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHCC-------CCEEEEEEECCCCEEECCCCCCHHHHHHHHHCC T ss_conf 6279999989847578666785899999999999997199-------975999997287367337665689999998622 Q ss_pred CC--CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 33--6877444203676776432223333102578865642799980687788888068999999999789879999924 Q gi|254780934|r 251 SK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 251 ~~--~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~ 328 (374) .. .+++|+++.++..+.+.|... ....|.|||||||++|.+... +...++.+++.||+||+|||| T Consensus 75 ~~~~~~ggT~i~~al~~a~~~l~~~-----------~~~~~~ivLlTDG~~n~g~~~--~~~~~~~a~~~gi~v~tIGvG 141 (180) T cd01467 75 KIGLAGQGTAIGDAIGLAIKRLKNS-----------EAKERVIVLLTDGENNAGEID--PATAAELAKNKGVRIYTIGVG 141 (180) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHCC-----------CCCCCEEEEEECCCCCCCCCC--HHHHHHHHHHCCCEEEEEEEC T ss_conf 4453236860899999999976424-----------766637999805886678769--999999999769989999977 Q ss_pred CCC-----------CHHHHHHHC--CCCCEEEECCHHHH Q ss_conf 822-----------278899823--89817983898999 Q gi|254780934|r 329 VIR-----------SHEFLRACA--SPNSFYLVENPHSM 354 (374) Q Consensus 329 ~~~-----------~~~~L~~~A--s~~~~~~a~~~~~L 354 (374) .+. +++.|+++| ++|+||++.|+++| T Consensus 142 ~~~~~~~~~~~~~~d~~~L~~iA~~tgG~yy~a~~~~eL 180 (180) T cd01467 142 KSGSGPKPDGSTILDEDSLVEIADKTGGRIFRALDGFEL 180 (180) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCHHHC T ss_conf 898887688876559999999999619979972874649 No 4 >cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix. Probab=99.93 E-value=6.4e-24 Score=161.01 Aligned_cols=162 Identities=20% Similarity=0.219 Sum_probs=128.5 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEC--CCCH---HHHH Q ss_conf 8724999965786451131353137999999887632210023665673489999826850561020--4786---8999 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGV---SHLQ 244 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--t~~~---~~~~ 244 (374) .|.||+||+|.||||... |++.+|+++..+++.+.. ..|++++.|++.+....+. ..+. .... T Consensus 1 ~P~div~viD~SgSM~g~------~l~~ak~a~~~~l~~l~~------~d~~~iv~F~~~~~~~~~~~~~~~~~~~~~a~ 68 (171) T cd01461 1 LPKEVVFVIDTSGSMSGT------KIEQTKEALLTALKDLPP------GDYFNIIGFSDTVEEFSPSSVSATAENVAAAI 68 (171) T ss_pred CCCEEEEEECCCCCCCCH------HHHHHHHHHHHHHHHCCC------CCEEEEEEECCEEEEECCCCEECCHHHHHHHH T ss_conf 984699999179889863------999999999999982998------78799999878065980775307999999999 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEE Q ss_conf 99860133687744420367677643222333310257886564279998068778888806899999999978987999 Q gi|254780934|r 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 (374) Q Consensus 245 ~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~Iyt 324 (374) ..++.+.+.| +|+++.++.++++.+.. .....+.|||||||+.+. .......+.++++.+|+||| T Consensus 69 ~~i~~l~~~G-~T~i~~aL~~a~~~l~~-----------~~~~~~~iillTDG~~~~---~~~~~~~~~~~~~~~i~i~t 133 (171) T cd01461 69 EYVNRLQALG-GTNMNDALEAALELLNS-----------SPGSVPQIILLTDGEVTN---ESQILKNVREALSGRIRLFT 133 (171) T ss_pred HHHHCCCCCC-CCHHHHHHHHHHHHHHH-----------CCCCCCEEEEECCCCCCC---HHHHHHHHHHHHCCCCEEEE T ss_conf 8875478899-86699999999998863-----------579861899975788688---68999999997448963999 Q ss_pred EEECCCCCHHHHHHHC--CCCCEEEECCHHHHHHHH Q ss_conf 9924822278899823--898179838989999999 Q gi|254780934|r 325 IGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 (374) Q Consensus 325 Ig~~~~~~~~~L~~~A--s~~~~~~a~~~~~L~~~f 358 (374) ||||.+.+.++|+.+| ++|.||+++++++|.+.+ T Consensus 134 ig~G~~~~~~~L~~iA~~~~G~~~~v~~~~~l~~~~ 169 (171) T cd01461 134 FGIGSDVNTYLLERLAREGRGIARRIYETDDIESQL 169 (171) T ss_pred EEECCCCCHHHHHHHHHCCCCEEEECCCHHHHHHHH T ss_conf 997897999999999972898899889878999976 No 5 >cd01466 vWA_C3HC4_type VWA C3HC4-type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, Probab=99.92 E-value=9.2e-24 Score=160.08 Aligned_cols=149 Identities=23% Similarity=0.285 Sum_probs=122.0 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCC----CHHHHHHHH Q ss_conf 2499996578645113135313799999988763221002366567348999982685056102047----868999998 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKI 247 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~----~~~~~~~~i 247 (374) +|++||||+||||... |++.+|.++..+++.+.+ ..|++++.|++.+....|++. ++..++..+ T Consensus 1 ~div~vlD~SGSM~g~------~l~~~k~a~~~~~~~L~~------~d~v~iV~F~~~a~~~~pl~~~~~~~~~~~~~~i 68 (155) T cd01466 1 VDLVAVLDVSGSMAGD------KLQLVKHALRFVISSLGD------ADRLSIVTFSTSAKRLSPLRRMTAKGKRSAKRVV 68 (155) T ss_pred CEEEEEEECCCCCCCH------HHHHHHHHHHHHHHHCCC------CCEEEEEEECCCCEEEECCEECCHHHHHHHHHHH T ss_conf 9399999089898873------899999999999984897------6748999956874262046037999999999987 Q ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 60133687744420367677643222333310257886564279998068778888806899999999978987999992 Q gi|254780934|r 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 (374) Q Consensus 248 ~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~ 327 (374) +.+.+.| +|+++.|+..|.+.+.... ..++.+.|||||||++|.+ ..+.++++.+|+|||||| T Consensus 69 ~~l~~~G-gT~i~~gl~~a~~~l~~~~---------~~~~~~~IiLlTDG~~n~~-------~~~~~~~~~~i~i~tiGi 131 (155) T cd01466 69 DGLQAGG-GTNVVGGLKKALKVLGDRR---------QKNPVASIMLLSDGQDNHG-------AVVLRADNAPIPIHTFGL 131 (155) T ss_pred HCCCCCC-CCCHHHHHHHHHHHHHHCC---------CCCCCEEEEEECCCCCCHH-------HHHHHHHCCCCEEEEEEE T ss_conf 5377688-8726799999999998436---------6898308999826986405-------778998717973999997 Q ss_pred CCCCCHHHHHHHC--CCCCEEEEC Q ss_conf 4822278899823--898179838 Q gi|254780934|r 328 RVIRSHEFLRACA--SPNSFYLVE 349 (374) Q Consensus 328 ~~~~~~~~L~~~A--s~~~~~~a~ 349 (374) |.+.+.++|+.+| ++|.||+++ T Consensus 132 G~~~d~~lL~~iA~~~gG~~~~v~ 155 (155) T cd01466 132 GASHDPALLAFIAEITGGTFSYVK 155 (155) T ss_pred CCCCCHHHHHHHHHCCCCEEEEEC T ss_conf 886789999999976997799949 No 6 >cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands. Probab=99.91 E-value=1.7e-22 Score=152.50 Aligned_cols=180 Identities=19% Similarity=0.323 Sum_probs=148.0 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHHHHHHH Q ss_conf 87249999657864511313531379999998876322100236656734899998268505610204--7868999998 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKI 247 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~~~~~i 247 (374) +++||+|+||.|+|++.. .++.+|+.+..+++.+.- ..+.+|++++.|+..+...+++. .++..+...| T Consensus 1 gplDlvFllD~S~Svg~~------nF~~~k~Fv~~lv~~f~I---~~~~trVgvv~ys~~~~~~f~l~~~~~k~~l~~aI 71 (224) T cd01475 1 GPTDLVFLIDSSRSVRPE------NFELVKQFLNQIIDSLDV---GPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAV 71 (224) T ss_pred CCEEEEEEEECCCCCCHH------HHHHHHHHHHHHHHHCCC---CCCCEEEEEEEECCCEEEEEECCCCCCHHHHHHHH T ss_conf 974399999488998989------999999999999985687---99852999999658278999668867889999999 Q ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 60133687744420367677643222333310257886564279998068778888806899999999978987999992 Q gi|254780934|r 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 (374) Q Consensus 248 ~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~ 327 (374) +.+.+.|++|+++.|+..+.+.+.......++ ...+.+|++|+||||..+. .....+..+|++||+||+||+ T Consensus 72 ~~i~~~gggT~Tg~AL~~~~~~~f~~~~G~Rp---~~~~vpkvlIviTDG~s~D-----~v~~~A~~lr~~GV~ifaVGV 143 (224) T cd01475 72 RRMEYLETGTMTGLAIQYAMNNAFSEAEGARP---GSERVPRVGIVVTDGRPQD-----DVSEVAAKARALGIEMFAVGV 143 (224) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCC---CCCCCCEEEEEECCCCCCC-----CHHHHHHHHHHCCCEEEEEEC T ss_conf 86361388446999999999972770239987---5568985999971798766-----389999999987988999963 Q ss_pred CCCCCHHHHHHHCCC---CCEEEECCHHHHHHHHHHHHHHHHE Q ss_conf 482227889982389---8179838989999999999984200 Q gi|254780934|r 328 RVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 (374) Q Consensus 328 ~~~~~~~~L~~~As~---~~~~~a~~~~~L~~~f~~I~~~i~~ 367 (374) |. .+.+.|+.+||. .|+|.+++.++|...-+.|.+.|+- T Consensus 144 g~-~~~~eL~~IAs~P~~~hvf~v~~F~~l~~l~~~l~~~iC~ 185 (224) T cd01475 144 GR-ADEEELREIASEPLADHVFYVEDFSTIEELTKKFQGKICV 185 (224) T ss_pred CC-CCHHHHHHHHCCCCHHCEEEECCHHHHHHHHHHHHHHHCC T ss_conf 74-7989999985597375689947988999999998761189 No 7 >cd01456 vWA_ywmD_type VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if Probab=99.91 E-value=4.5e-23 Score=155.96 Aligned_cols=169 Identities=19% Similarity=0.126 Sum_probs=123.9 Q ss_pred ECCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEE--------- Q ss_conf 10377872499996578645113135313799999988763221002366567348999982685056102--------- Q gi|254780934|r 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--------- 235 (374) Q Consensus 165 ~~~~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 235 (374) ..+.+.|.++++|||.||||......+..||+.+|.++..+++.+.. ..|++++.|+.......+ T Consensus 14 ~~~~~~P~~~~lVlD~SGSM~~~~~~g~~rl~~ak~a~~~~v~~l~~------~drvgLv~F~~~~~~~~d~~~~~~~~~ 87 (206) T cd01456 14 ETEPQLPPNVAIVLDNSGSMREVDGGGETRLDNAKAALDETANALPD------GTRLGLWTFSGDGDNPLDVRVLVPKGC 87 (206) T ss_pred CCCCCCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC------CCEEEEEEECCCCCCCCCCCEECCCCC T ss_conf 56898987389999798787787877645999999999999985799------987999997786777888513214565 Q ss_pred --------CCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHH Q ss_conf --------047868999998601336877444203676776432223333102578865642799980687788888068 Q gi|254780934|r 236 --------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 (374) Q Consensus 236 --------~t~~~~~~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~ 307 (374) ...++..+...++.+....++|.++.++..+.+.+. +...+.|||||||++|++..... T Consensus 88 ~~~~~~~~~~~~r~~l~~~i~~l~~~~G~T~l~~al~~a~~~~~-------------~~~~~~IvLlTDG~~~~g~~~~~ 154 (206) T cd01456 88 LTAPVNGFPSAQRSALDAALNSLQTPTGWTPLAAALAEAAAYVD-------------PGRVNVVVLITDGEDTCGPDPCE 154 (206) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHC-------------CCCCCEEEEEECCCCCCCCCHHH T ss_conf 44434552377899999999745778896479999999998627-------------78764799992376446888599 Q ss_pred HHHHHHH--HHHCCCEEEEEEECCCCCHHHHHHHC--CCCCE-EEECCHH Q ss_conf 9999999--99789879999924822278899823--89817-9838989 Q gi|254780934|r 308 SLYYCNE--AKKRGAIVYAIGIRVIRSHEFLRACA--SPNSF-YLVENPH 352 (374) Q Consensus 308 ~~~~~~~--~k~~gi~IytIg~~~~~~~~~L~~~A--s~~~~-~~a~~~~ 352 (374) ....... .+..+|+||+||||.+.+.++|+++| ++|.| |.++++. T Consensus 155 ~~~~l~~~~~~~~~v~V~tig~G~d~d~~~L~~IA~~tgG~y~y~~~d~~ 204 (206) T cd01456 155 VARELAKRRTPAPPIKVNVIDFGGDADRAELEAIAEATGGTYAYNQSDLA 204 (206) T ss_pred HHHHHHHHCCCCCCEEEEEEEECCCCCHHHHHHHHHCCCCEEEEECCCCC T ss_conf 99999983177999589999718865899999999742978995167602 No 8 >cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions. Probab=99.90 E-value=9.1e-22 Score=148.20 Aligned_cols=172 Identities=22% Similarity=0.234 Sum_probs=137.5 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC---CCCCCCEEEEEEEECCCCEEEEECC---CCHHHH Q ss_conf 8724999965786451131353137999999887632210023---6656734899998268505610204---786899 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI---PDVNNVVQSGLVTFSNKIEEFFLLE---WGVSHL 243 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~t---~~~~~~ 243 (374) +|+|++|++|.|+|++.. .++.+|+.+..+++.+... +..+..+|++++.|++.+....++. .++..+ T Consensus 1 gpvDlvFllD~S~Sv~~~------~F~~~k~Fv~~lv~~f~~~~~~~i~~~~~rVgvv~ys~~~~~~~~~~~~~~~~~~l 74 (186) T cd01480 1 GPVDITFVLDSSESVGLQ------NFDITKNFVKRVAERFLKDYYRKDPAGSWRVGVVQYSDQQEVEAGFLRDIRNYTSL 74 (186) T ss_pred CCEEEEEEEECCCCCCHH------HHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCEEEEECCCCCCCCHHHH T ss_conf 974699999688987878------99999999999999985301345687743898998558427986047775889999 Q ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEE Q ss_conf 99986013368774442036767764322233331025788656427999806877888880689999999997898799 Q gi|254780934|r 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 (374) Q Consensus 244 ~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~Iy 323 (374) +..|+.++..|++|+++.|+.++.+.+... .....+|++||||||+.+. .++..+...++.+|+.||+|| T Consensus 75 ~~~I~~i~y~gG~T~tg~AL~~a~~~~~~~---------~r~~~~kvlvliTDG~S~~-~~~~~~~~aa~~lr~~GV~if 144 (186) T cd01480 75 KEAVDNLEYIGGGTFTDCALKYATEQLLEG---------SHQKENKFLLVITDGHSDG-SPDGGIEKAVNEADHLGIKIF 144 (186) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHC---------CCCCCCEEEEEEECCCCCC-CCCHHHHHHHHHHHHCCCEEE T ss_conf 999975013589862999999999998613---------6789853899984587666-740669999999998798999 Q ss_pred EEEECCCCCHHHHHHHCC-CCCEEEECCHHHHHHHH Q ss_conf 999248222788998238-98179838989999999 Q gi|254780934|r 324 AIGIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAF 358 (374) Q Consensus 324 tIg~~~~~~~~~L~~~As-~~~~~~a~~~~~L~~~f 358 (374) +||+|.. +.+.|+.+|| +++.|.++|.++|.+.| T Consensus 145 aVGVG~~-~~~eL~~IAs~p~~~~~~~~f~~L~~~~ 179 (186) T cd01480 145 FVAVGSQ-NEEPLSRIACDGKSALYRENFAELLWSF 179 (186) T ss_pred EEEECCC-CHHHHHHHHCCCCCEEEECCHHHHHCCH T ss_conf 9994748-8799999858997389736899870111 No 9 >cd01470 vWA_complement_factors Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains. Probab=99.90 E-value=1.8e-21 Score=146.39 Aligned_cols=176 Identities=17% Similarity=0.261 Sum_probs=133.0 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEC----CCCHHHHHHHH Q ss_conf 24999965786451131353137999999887632210023665673489999826850561020----47868999998 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKI 247 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----t~~~~~~~~~i 247 (374) +||+||||.|||++.. .++.+|+.+..+++.+..... ..|++++.|+..+.....+ +.+...+...| T Consensus 1 lDivfllD~SgSIg~~------nF~~~k~Fv~~lv~~~~~~~~---~~rvgvv~ys~~~~~~f~l~~~~~~~~~~~~~~i 71 (198) T cd01470 1 LNIYIALDASDSIGEE------DFDEAKNAIKTLIEKISSYEV---SPRYEIISYASDPKEIVSIRDFNSNDADDVIKRL 71 (198) T ss_pred CEEEEEEECCCCCCHH------HHHHHHHHHHHHHHHHCCCCC---CCEEEEEEECCCCEEEEECCCCCCCCHHHHHHHH T ss_conf 9199999798988878------899999999999998446687---7538999815885389971576666899999999 Q ss_pred HHCCCCC----CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHH----------HHHH Q ss_conf 6013368----7744420367677643222333310257886564279998068778888806899----------9999 Q gi|254780934|r 248 KYLSKFG----VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL----------YYCN 313 (374) Q Consensus 248 ~~l~~~g----~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~----------~~~~ 313 (374) +.+...+ +||++..++...+..+...... ......+.+|++||||||..|.+.++.... ..+. T Consensus 72 ~~i~y~~~~~~~gT~t~~AL~~~~~~~~~~~~~---~~~~~~~v~~v~illTDG~sn~g~~P~~~~~~~~~~~~~~~~a~ 148 (198) T cd01470 72 EDFNYDDHGDKTGTNTAAALKKVYERMALEKVR---NKEAFNETRHVIILFTDGKSNMGGSPLPTVDKIKNLVYKNNKSD 148 (198) T ss_pred HHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC---CCCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 846033577886468999999999986555304---66444567559999737854578993367888777664101456 Q ss_pred HHHHCCCEEEEEEECCCCCHHHHHHHCCC--C--CEEEECCHHHHHHHHH Q ss_conf 99978987999992482227889982389--8--1798389899999999 Q gi|254780934|r 314 EAKKRGAIVYAIGIRVIRSHEFLRACASP--N--SFYLVENPHSMYDAFS 359 (374) Q Consensus 314 ~~k~~gi~IytIg~~~~~~~~~L~~~As~--~--~~~~a~~~~~L~~~f~ 359 (374) .+++.||.||+||+|.+.+.+.|+.+||. + |+|.++|.++|.++|+ T Consensus 149 ~~r~~gi~ifaiGVG~~~d~~eL~~IAS~~~~e~hvf~v~df~~L~~i~d 198 (198) T cd01470 149 NPREDYLDVYVFGVGDDVNKEELNDLASKKDNERHFFKLKDYEDLQEVFD 198 (198) T ss_pred HHHHCCCEEEEEEECCCCCHHHHHHHHCCCCCCCEEEEECCHHHHHHHHC T ss_conf 78873947999996661599999998579998716999689999998639 No 10 >cd01474 vWA_ATR ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells. Probab=99.89 E-value=3.8e-21 Score=144.48 Aligned_cols=178 Identities=20% Similarity=0.263 Sum_probs=137.1 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHH Q ss_conf 87249999657864511313531379999998876322100236656734899998268505610204786899999860 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~ 249 (374) .++||+|++|.|||++.+| ...++.++.+++.+. ....|++++.|+..+....+++.+++.....+.. T Consensus 3 ~~~DivFllD~S~Sv~~~f-------~~~~~Fv~~lv~~f~-----~~~~rvgvv~fS~~~~~~f~l~~~~~~~~~~~~~ 70 (185) T cd01474 3 GHFDLYFVLDKSGSVAANW-------IEIYDFVEQLVDRFN-----SPGLRFSFITFSTRATKILPLTDDSSAIIKGLEV 70 (185) T ss_pred CCEEEEEEEECCCCCCCCH-------HHHHHHHHHHHHHCC-----CCCEEEEEEEECCCCCEEEECCCCHHHHHHHHHH T ss_conf 8613899997899876576-------999999999998569-----9874999999869831898457870788999999 Q ss_pred CC--CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 13--3687744420367677643222333310257886564279998068778888806899999999978987999992 Q gi|254780934|r 250 LS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 (374) Q Consensus 250 l~--~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~ 327 (374) +. ..+++|+++.|+..+...+..... ++....|++|++|||..+.. ....+...++.+|+.||.||+||+ T Consensus 71 ~~~~~~~G~T~tg~AL~~a~~~~f~~~~-------g~R~~~kvlivlTDG~s~~~-~~~~~~~~a~~lr~~gV~i~aVGV 142 (185) T cd01474 71 LKKVTPSGQTYIHEGLENANEQIFNRNG-------GGRETVSVIIALTDGQLLLN-GHKYPEHEAKLSRKLGAIVYCVGV 142 (185) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHCCCC-------CCCCCCEEEEEEECCCCCCC-CCHHHHHHHHHHHHCCCEEEEEEC T ss_conf 8876158937899999999997503236-------99887628999932665676-214179999999978948999971 Q ss_pred CCCCCHHHHHHHCC-CCCEEEECC-HHHHHHHHHHHHHHHHEE Q ss_conf 48222788998238-981798389-899999999999842001 Q gi|254780934|r 328 RVIRSHEFLRACAS-PNSFYLVEN-PHSMYDAFSHIGKDIVTK 368 (374) Q Consensus 328 ~~~~~~~~L~~~As-~~~~~~a~~-~~~L~~~f~~I~~~i~~~ 368 (374) + +.+++.|..+|| +.+.|.+++ .++|....+.|.+.+|+. T Consensus 143 ~-~~~~~eL~~IAs~p~~vf~v~~~F~~L~~i~~~l~~~iC~~ 184 (185) T cd01474 143 T-DFLKSQLINIADSKEYVFPVTSGFQALSGIIESVVKKACIE 184 (185) T ss_pred C-CCCHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHHCCC T ss_conf 6-25999999871998648983475777899999999852879 No 11 >cd01463 vWA_VGCC_like VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex. Probab=99.89 E-value=7.7e-22 Score=148.63 Aligned_cols=166 Identities=18% Similarity=0.235 Sum_probs=125.3 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEE---------CCCC Q ss_conf 7872499996578645113135313799999988763221002366567348999982685056102---------0478 Q gi|254780934|r 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL---------LEWG 239 (374) Q Consensus 169 ~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~t~~ 239 (374) ..|.|++||||.||||... |++.+|+++..+++.+.+. .+++++.|++.+....| ...+ T Consensus 11 ~~Pkdvv~vlD~SGSM~g~------kl~~ak~a~~~il~~L~~~------D~~~iv~Fs~~~~~~~p~~~~~~~~~t~~n 78 (190) T cd01463 11 TSPKDIVILLDVSGSMTGQ------RLHLAKQTVSSILDTLSDN------DFFNIITFSNEVNPVVPCFNDTLVQATTSN 78 (190) T ss_pred CCCCEEEEEEECCCCCCCC------HHHHHHHHHHHHHHHCCCC------CEEEEEEECCCCEEEECCCCCCEEECCHHH T ss_conf 8982699999799988973------4999999999999819987------799999968975363024568433689999 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHH--H Q ss_conf 68999998601336877444203676776432223333102578865642799980687788888068999999999--7 Q gi|254780934|r 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK--K 317 (374) Q Consensus 240 ~~~~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k--~ 317 (374) +..++..|+.|.+.| +|++..|+..|++.+........ ........+.|||+|||.+++ ...........+ . T Consensus 79 ~~~~~~~i~~l~~~G-~Tn~~~al~~A~~~l~~~~~~~~--~~~~~~~~~~IillTDG~~~~---~~~i~~~~~~~~~~~ 152 (190) T cd01463 79 KKVLKEALDMLEAKG-IANYTKALEFAFSLLLKNLQSNH--SGSRSQCNQAIMLITDGVPEN---YKEIFDKYNWDKNSE 152 (190) T ss_pred HHHHHHHHHHCCCCC-CCHHHHHHHHHHHHHHHHHCCCC--CCCCCCCCCEEEEEECCCCCC---HHHHHHHHHHHHCCC T ss_conf 999999998285798-72489999999999987420155--665555551599983698875---788999999975579 Q ss_pred CCCEEEEEEECCC-CCHHHHHHHC--CCCCEEEECCHH Q ss_conf 8987999992482-2278899823--898179838989 Q gi|254780934|r 318 RGAIVYAIGIRVI-RSHEFLRACA--SPNSFYLVENPH 352 (374) Q Consensus 318 ~gi~IytIg~~~~-~~~~~L~~~A--s~~~~~~a~~~~ 352 (374) .+|+|||+|||.+ .+.++|+.+| +.|.|++..+.+ T Consensus 153 ~~i~ift~G~G~~~~d~~~L~~iA~~~~G~y~~I~~~d 190 (190) T cd01463 153 IPVRVFTYLIGREVTDRREIQWMACENKGYYSHIQSLD 190 (190) T ss_pred CCEEEEEEEECCCCCCHHHHHHHHHCCCCEEEECCCCC T ss_conf 98799999967997787999999980995699788898 No 12 >cd01464 vWA_subfamily VWA subfamily: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if Probab=99.88 E-value=2.9e-21 Score=145.17 Aligned_cols=173 Identities=18% Similarity=0.205 Sum_probs=131.5 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHC Q ss_conf 72499996578645113135313799999988763221002366567348999982685056102047868999998601 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l 250 (374) .+.++||+|.||||... +|+.+++++..+++.+...+.....++++++.|++.+....|++.- ....+..| T Consensus 3 rlpvvlvlD~SGSM~G~------~i~~~k~al~~~~~~L~~d~~a~~~~~vsVItF~s~a~~~~pl~~~---~~~~~~~L 73 (176) T cd01464 3 RLPIYLLLDTSGSMAGE------PIEALNQGLQMLQSELRQDPYALESVEISVITFDSAARVIVPLTPL---ESFQPPRL 73 (176) T ss_pred CCCEEEEEECCCCCCCH------HHHHHHHHHHHHHHHHHCCCCCHHEEEEEEEEECCCEEEECCCCCH---HHCCCCCC T ss_conf 35789999789999984------7999999999999997118310113269999978951780586347---66475547 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 33687744420367677643222333310257886564279998068778888806899999999978987999992482 Q gi|254780934|r 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 (374) Q Consensus 251 ~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~ 330 (374) .+.| +|+++.|+..+.+.|........ .......+++|||||||+++. +.......+...+..++.|+++|+|.+ T Consensus 74 ~a~G-~T~~g~al~~a~~~l~~~~~~~~--~~~~~~~~P~I~LlTDG~PtD--~~~~~~~~~~~~~~~~~~i~a~giG~d 148 (176) T cd01464 74 TASG-GTSMGAALELALDCIDRRVQRYR--ADQKGDWRPWVFLLTDGEPTD--DLTAAIERIKEARDSKGRIVACAVGPK 148 (176) T ss_pred CCCC-CCHHHHHHHHHHHHHHHHHHHCC--CCCCCCCCEEEEEECCCCCCC--CHHHHHHHHHHHHHCCCEEEEEEEECC T ss_conf 7789-98199999999999998652236--556677531799966899887--589999999988863976999997387 Q ss_pred CCHHHHHHHCCCCCEEEECCHHHHHHHHH Q ss_conf 22788998238981798389899999999 Q gi|254780934|r 331 RSHEFLRACASPNSFYLVENPHSMYDAFS 359 (374) Q Consensus 331 ~~~~~L~~~As~~~~~~a~~~~~L~~~f~ 359 (374) ++.++|++++.. .....++.++.+.|+ T Consensus 149 ad~~~L~~is~~--~~~~~~~~~f~~ff~ 175 (176) T cd01464 149 ADLDTLKQITEG--VPLLDDALSGLNFFK 175 (176) T ss_pred CCHHHHHHHHCC--CCCCCCHHHHHHHHC T ss_conf 189999988577--745345345888508 No 13 >cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions. Probab=99.86 E-value=1.3e-19 Score=135.46 Aligned_cols=166 Identities=19% Similarity=0.254 Sum_probs=129.8 Q ss_pred EEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEC-CCCEEEEECCCCHHHHHHHHHHCCC Q ss_conf 999965786451131353137999999887632210023665673489999826-8505610204786899999860133 Q gi|254780934|r 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSK 252 (374) Q Consensus 174 i~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~~~~~~~i~~l~~ 252 (374) ++||+|.||||... .|++.+|.++..++... .....+++++.|. ..+....|+|.+...++..|+.|++ T Consensus 3 vvfvvD~SGSM~~~-----~rl~~aK~a~~~ll~d~-----~~~~D~v~lv~F~g~~a~~~lppT~~~~~~~~~l~~L~~ 72 (178) T cd01451 3 VIFVVDASGSMAAR-----HRMAAAKGAVLSLLRDA-----YQRRDKVALIAFRGTEAEVLLPPTRSVELAKRRLARLPT 72 (178) T ss_pred EEEEEECCCCCCCC-----CHHHHHHHHHHHHHHHH-----CCCCCEEEEEEECCCCCEEECCCCCCHHHHHHHHHCCCC T ss_conf 99999898788875-----67999999999999974-----346788999997597555856887657999998721678 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHH--HHHHHHHHHHCCCEEEEEEECCC Q ss_conf 6877444203676776432223333102578865642799980687788888068--99999999978987999992482 Q gi|254780934|r 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ--SLYYCNEAKKRGAIVYAIGIRVI 330 (374) Q Consensus 253 ~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~--~~~~~~~~k~~gi~IytIg~~~~ 330 (374) .| +|++..|+..+++.+.... .++...+++||+|||..|.+.++.. ...++...++.||...+|+|+.. T Consensus 73 gG-~T~l~~gL~~a~~~~~~~~--------~~~~~~~~iiLlTDG~~N~g~~~~~~~~~~~a~~~~~~gi~~~vId~~~~ 143 (178) T cd01451 73 GG-GTPLAAGLLAAYELAAEQA--------RDPGQRPLIVVITDGRANVGPDPTADRALAAARKLRARGISALVIDTEGR 143 (178) T ss_pred CC-CCCHHHHHHHHHHHHHHHC--------CCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 89-8519999999999999850--------27898439999846986679995126999999999866997899979999 Q ss_pred C-CHHHHHHHC--CCCCEEEECCH--HHHHHHH Q ss_conf 2-278899823--89817983898--9999999 Q gi|254780934|r 331 R-SHEFLRACA--SPNSFYLVENP--HSMYDAF 358 (374) Q Consensus 331 ~-~~~~L~~~A--s~~~~~~a~~~--~~L~~~f 358 (374) . ...+|+++| .+|+||+.++. ++|.++. T Consensus 144 ~~~~~~~~~LA~~~~g~Y~~id~l~~~~i~~~v 176 (178) T cd01451 144 PVRRGLAKDLARALGGQYVRLPDLSADAIASAV 176 (178) T ss_pred CCCHHHHHHHHHHCCCCEEECCCCCHHHHHHHH T ss_conf 767489999999429969989979988999987 No 14 >cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions. Probab=99.85 E-value=6.6e-20 Score=137.14 Aligned_cols=167 Identities=24% Similarity=0.311 Sum_probs=133.0 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCC--CHHHHHHHHHH Q ss_conf 2499996578645113135313799999988763221002366567348999982685056102047--86899999860 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKY 249 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--~~~~~~~~i~~ 249 (374) +|++|++|.|+||... .++.+++.+..+++.+... ....|++++.|+..+.....+.. ++..+...++. T Consensus 1 lDl~fllD~S~Sv~~~------~F~~~k~fi~~lv~~f~i~---~~~~rvglv~ys~~~~~~~~l~~~~~~~~~~~~i~~ 71 (177) T cd01469 1 MDIVFVLDGSGSIYPD------DFQKVKNFLSTVMKKLDIG---PTKTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKH 71 (177) T ss_pred CCEEEEEECCCCCCHH------HHHHHHHHHHHHHHHCCCC---CCCCEEEEEEECCCEEEEEECCCCCCHHHHHHHHHH T ss_conf 9099999688999989------9999999999999866769---987489999936824999823556778999999862 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 13368774442036767764322233331025788656427999806877888880689999999997898799999248 Q gi|254780934|r 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 250 l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~ 329 (374) +...|++|+++.|+.++.+.+...... ..++.+|++|++|||..+.+ ......++.+|+.||+||+||+|. T Consensus 72 i~~~~g~t~t~~AL~~a~~~~f~~~~g------~R~~~~kv~ivlTDG~s~d~---~~~~~~~~~lk~~gv~vf~VGvG~ 142 (177) T cd01469 72 ISQLLGLTNTATAIQYVVTELFSESNG------ARKDATKVLVVITDGESHDD---PLLKDVIPQAEREGIIRYAIGVGG 142 (177) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCC------CCCCCEEEEEEEECCCCCCC---CCHHHHHHHHHHCCEEEEEEEECC T ss_conf 303689752527999999985364558------86787169999978986775---014999999997990899999555 Q ss_pred CC----CHHHHHHHCCC---CCEEEECCHHHHHH Q ss_conf 22----27889982389---81798389899999 Q gi|254780934|r 330 IR----SHEFLRACASP---NSFYLVENPHSMYD 356 (374) Q Consensus 330 ~~----~~~~L~~~As~---~~~~~a~~~~~L~~ 356 (374) .. +.+.|+.+||. .|+|.+++.++|.+ T Consensus 143 ~~~~~~~~~eL~~iAs~P~~~hvf~~~~f~~L~~ 176 (177) T cd01469 143 HFQRENSREELKTIASKPPEEHFFNVTDFAALKD 176 (177) T ss_pred CCCCCCCHHHHHHHHCCCCHHCEEEECCHHHHCC T ss_conf 1467451999999967985871998379777646 No 15 >smart00327 VWA von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods. Probab=99.84 E-value=2.8e-19 Score=133.38 Aligned_cols=160 Identities=27% Similarity=0.388 Sum_probs=127.8 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEC--CCCHHHHHHHHH Q ss_conf 724999965786451131353137999999887632210023665673489999826850561020--478689999986 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIK 248 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--t~~~~~~~~~i~ 248 (374) ++||+||+|.||||.. .+++.+|.++..++..+.... ...|++++.|++.+....++ ..+...+...|+ T Consensus 1 ~~di~~vvD~S~SM~~------~~~~~~k~~~~~~i~~l~~~~---~~~~v~vv~f~~~~~~~~~~~~~~~~~~~~~~i~ 71 (177) T smart00327 1 PLDVVFLLDGSGSMGP------NRFEKAKEFVLKLVEQLDIGP---DGDRVGLVTFSDDATVLFPLNDSRSKDALLEALA 71 (177) T ss_pred CCEEEEEEECCCCCCC------HHHHHHHHHHHHHHHHHHCCC---CCCEEEEEEECCCEEEEECCCCCCCHHHHHHHHH T ss_conf 9489999928899882------899999999999999864179---9878999996372689976888689999999997 Q ss_pred HCCC-CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 0133-687744420367677643222333310257886564279998068778888806899999999978987999992 Q gi|254780934|r 249 YLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 (374) Q Consensus 249 ~l~~-~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~ 327 (374) .+.. .+++|+++.++.++++.+....... ....+|++|++|||.++.+ ......+..+++.||.||+||+ T Consensus 72 ~l~~~~~g~t~~~~al~~a~~~~~~~~~~~------~~~~~~~iil~TDG~~~~~---~~~~~~~~~~~~~~v~i~~ig~ 142 (177) T smart00327 72 SLSYKLGGGTNLGAALQYALENLFSKSAGS------RRGAPKVLILITDGESNDG---GDLLKAAKELKRSGVKVFVVGV 142 (177) T ss_pred HCCCCCCCCCCCHHHHHHHHHHHHHHHCCC------CCCCCEEEEEEECCCCCCC---HHHHHHHHHHHHCCCEEEEEEE T ss_conf 141557887764289999999997665037------7887428999805887872---5299999999867948999995 Q ss_pred CCCCCHHHHHHHCC--CCCEEEE Q ss_conf 48222788998238--9817983 Q gi|254780934|r 328 RVIRSHEFLRACAS--PNSFYLV 348 (374) Q Consensus 328 ~~~~~~~~L~~~As--~~~~~~a 348 (374) |.+.+...|+.+|+ ++.|++. T Consensus 143 g~~~~~~~l~~ia~~~~~~~~~~ 165 (177) T smart00327 143 GNDVDEEELKKLASAPGGVYVFL 165 (177) T ss_pred CCCCCHHHHHHHHHCCCCEEEEE T ss_conf 88479999999984899659995 No 16 >cd01473 vWA_CTRP CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions. Probab=99.84 E-value=1.2e-18 Score=129.64 Aligned_cols=178 Identities=17% Similarity=0.241 Sum_probs=132.7 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHH-HHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEC----CCCHHHHHHH Q ss_conf 24999965786451131353137999-999887632210023665673489999826850561020----4786899999 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDM-AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRK 246 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----t~~~~~~~~~ 246 (374) .|++|++|.|||.+.. .+.. +++.+..+++.+. -.++.+|++++.|+..+....++ +.++..+... T Consensus 1 ~DivFllD~S~SIg~~------nf~~~v~~F~~~lv~~f~---Ig~~~~rvgvv~yS~~~~~~~~f~~~~~~~k~~~l~~ 71 (192) T cd01473 1 YDLTLILDESASIGYS------NWRKDVIPFTEKIINNLN---ISKDKVHVGILLFAEKNRDVVPFSDEERYDKNELLKK 71 (192) T ss_pred CCEEEEEECCCCCCHH------HHHHHHHHHHHHHHHHCC---CCCCCEEEEEEEECCCCCEEEECCCCCCCCHHHHHHH T ss_conf 9789999389986667------769999999999998756---5989619999995588740132355443489999999 Q ss_pred HHHCC---CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEE Q ss_conf 86013---368774442036767764322233331025788656427999806877888880689999999997898799 Q gi|254780934|r 247 IKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 (374) Q Consensus 247 i~~l~---~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~Iy 323 (374) |+.|. ..|++|+++.|+..+.+.+.... ...++.+|++|++|||..+.. +.......+..+|++||.|| T Consensus 72 i~~l~~~~~~gg~T~tg~AL~~~~~~~~~~~-------g~R~~vpkv~IvlTDG~s~~~-~~~~~~~~a~~lr~~gV~i~ 143 (192) T cd01473 72 INDLKNSYRSGGETYIVEALKYGLKNYTKHG-------NRRKDAPKVTMLFTDGNDTSA-SKKELQDISLLYKEENVKLL 143 (192) T ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHCCCC-------CCCCCCCEEEEEEECCCCCCC-CHHHHHHHHHHHHHCCCEEE T ss_conf 9998731468982479999999999863467-------888899749999956998873-16789999999998797899 Q ss_pred EEEECCCCCHHHHHHHCC-C--C---CEEEECCHHHHHHHHHHHHHHHHE Q ss_conf 999248222788998238-9--8---179838989999999999984200 Q gi|254780934|r 324 AIGIRVIRSHEFLRACAS-P--N---SFYLVENPHSMYDAFSHIGKDIVT 367 (374) Q Consensus 324 tIg~~~~~~~~~L~~~As-~--~---~~~~a~~~~~L~~~f~~I~~~i~~ 367 (374) +||+|. .+.+.|+.+|+ + + .+|...+-++|..+.+.|.+++|. T Consensus 144 avGVg~-~~~~eL~~iag~~~~~~~c~~~~~~~fd~l~~i~~~l~~~vC~ 192 (192) T cd01473 144 VVGVGA-ASENKLKLLAGCDINNDNCPNVIKTEWNNLNGISKFLTDKICD 192 (192) T ss_pred EEEECC-CCHHHHHHHHCCCCCCCCCCEEEECCHHHHHHHHHHHHHHHCC T ss_conf 998063-7999999986999889977579947978999999999997249 No 17 >cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners. Probab=99.84 E-value=8.5e-19 Score=130.52 Aligned_cols=170 Identities=22% Similarity=0.273 Sum_probs=130.6 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCC----CHH---HHH Q ss_conf 2499996578645113135313799999988763221002366567348999982685056102047----868---999 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVS---HLQ 244 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~----~~~---~~~ 244 (374) +|++|++|.|||++.. .+.+.+|+.+..+++.+. -.++.+|++++.|+..+....++.. ++. ... T Consensus 1 lDlvFllD~S~SVg~~-----n~f~~~k~F~~~lv~~f~---I~~~~~rVgvv~ys~~~~~~~~l~~~~~~~~~~~~~~~ 72 (186) T cd01471 1 LDLYLLVDGSGSIGYS-----NWVTHVVPFLHTFVQNLN---ISPDEINLYLVTFSTNAKELIRLSSPNSTNKDLALNAI 72 (186) T ss_pred CEEEEEEECCCCCCCC-----CHHHHHHHHHHHHHHHCC---CCCCCEEEEEEEECCCCEEEEECCCCCCCCHHHHHHHH T ss_conf 9099999488988861-----318999999999999749---69884499999954870599875775544656799999 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEE Q ss_conf 99860133687744420367677643222333310257886564279998068778888806899999999978987999 Q gi|254780934|r 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 (374) Q Consensus 245 ~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~Iyt 324 (374) ..+..++..|++|+++.|+..+.+.+.... ...++.+|++||+|||..+. +..+...++.+|++||+||+ T Consensus 73 ~~i~~~~y~gg~T~Tg~AL~~a~~~~f~~~-------g~R~~vpkv~illTDG~s~d---~~~~~~~a~~Lr~~GV~ifa 142 (186) T cd01471 73 RALLSLYYPNGSTNTTSALLVVEKHLFDTR-------GNRENAPQLVIIMTDGIPDS---KFRTLKEARKLRERGVIIAV 142 (186) T ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHCCCC-------CCCCCCCEEEEEEECCCCCC---CCHHHHHHHHHHHCCCEEEE T ss_conf 999837778996779999999999721146-------88999985999990698778---52589999999988999999 Q ss_pred EEECCCCCHHHHHHHCCC-CC-----EEEECCHHHHHHHHH Q ss_conf 992482227889982389-81-----798389899999999 Q gi|254780934|r 325 IGIRVIRSHEFLRACASP-NS-----FYLVENPHSMYDAFS 359 (374) Q Consensus 325 Ig~~~~~~~~~L~~~As~-~~-----~~~a~~~~~L~~~f~ 359 (374) ||+|...+.+.|+.+|+. +. .|..++-++|.++-+ T Consensus 143 vGVG~~v~~~eL~~Iag~~~~~~~c~~~~~~~~~~l~~~~~ 183 (186) T cd01471 143 LGVGQGVNHEENRSLVGCDPDDSPCPLYLQSSWSEVQNVIK 183 (186) T ss_pred EECCCCCCHHHHHHHCCCCCCCCCCCEEEECCHHHHHHHHH T ss_conf 98343249999999709998889986575178888874775 No 18 >cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif. Probab=99.83 E-value=4.4e-19 Score=132.22 Aligned_cols=157 Identities=23% Similarity=0.349 Sum_probs=125.4 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHHHHHHHHHC Q ss_conf 49999657864511313531379999998876322100236656734899998268505610204--7868999998601 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL 250 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~~~~~i~~l 250 (374) ||+||+|.|+||... .++.+|+.+..+++.+... ....|++++.|+..+....++. .++..+...|+.+ T Consensus 2 Di~fvlD~S~Sv~~~------~f~~~k~fi~~li~~~~i~---~~~~rvgvv~fs~~~~~~~~l~~~~~~~~l~~~i~~i 72 (164) T cd01472 2 DIVFLVDGSESIGLS------NFNLVKDFVKRVVERLDIG---PDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNL 72 (164) T ss_pred CEEEEEECCCCCCHH------HHHHHHHHHHHHHHHCCCC---CCCCEEEEEEECCCEEEEECCCCCCCHHHHHHHHHHH T ss_conf 799999797998879------9999999999999964768---8860899998247415874454669889999999861 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 33687744420367677643222333310257886564279998068778888806899999999978987999992482 Q gi|254780934|r 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 (374) Q Consensus 251 ~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~ 330 (374) ...+++|+++.|+.++.+.+..... ...++.+|++||||||..+. .....+..+|+.||+||+||+|. T Consensus 73 ~~~~g~t~~~~AL~~~~~~~~~~~~------~~r~~~~kvvvllTDG~s~~-----~~~~~a~~lr~~Gi~v~~VGig~- 140 (164) T cd01472 73 RYIGGGTNTGKALKYVRENLFTEAS------GSREGVPKVLVVITDGKSQD-----DVEEPAVELKQAGIEVFAVGVKN- 140 (164) T ss_pred HCCCCCCHHHHHHHHHHHHHHCCCC------CCCCCCEEEEEEEECCCCCC-----HHHHHHHHHHHCCCEEEEEECCC- T ss_conf 1668975299999999998635357------87678515999983799864-----08899999998898899997884- Q ss_pred CCHHHHHHHCCCC---CEEEECC Q ss_conf 2278899823898---1798389 Q gi|254780934|r 331 RSHEFLRACASPN---SFYLVEN 350 (374) Q Consensus 331 ~~~~~L~~~As~~---~~~~a~~ 350 (374) .+.+.|+.+||.+ ++|.+.+ T Consensus 141 ~~~~~L~~iAs~p~~~~~~~~~~ 163 (164) T cd01472 141 ADEEELKQIASDPKELYVFNVAD 163 (164) T ss_pred CCHHHHHHHHCCCCHHEEEECCC T ss_conf 79999999967993783896588 No 19 >pfam00092 VWA von Willebrand factor type A domain. Probab=99.83 E-value=1.5e-18 Score=129.07 Aligned_cols=170 Identities=23% Similarity=0.318 Sum_probs=129.3 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCH--HHHHHHH-HH Q ss_conf 49999657864511313531379999998876322100236656734899998268505610204786--8999998-60 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKI-KY 249 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~--~~~~~~i-~~ 249 (374) |++||+|.|+||... +++.+|+++..++..+. ......|++++.|++.+....+++... ......+ .. T Consensus 1 Di~fvlD~S~Sm~~~------~~~~~k~~~~~~i~~~~---~~~~~~rv~lv~f~~~~~~~~~l~~~~~~~~~~~~~~~~ 71 (177) T pfam00092 1 DIVFLLDGSGSIGEA------NFEKVKEFIKKLVENLD---IGPDGTRVGLVQYSSDVTTEFSLNDYKSKDDLLSAVLRN 71 (177) T ss_pred CEEEEEECCCCCCHH------HHHHHHHHHHHHHHHHC---CCCCCCEEEEEEECCCEEEEECCCCCCCHHHHHHHHHHH T ss_conf 989999687998868------89999999999999836---588752899999458458996178868999999998643 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 13368774442036767764322233331025788656427999806877888880689999999997898799999248 Q gi|254780934|r 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 250 l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~ 329 (374) ....+++|+++.|+..+.+.+.... ...++.+|++||+|||.++.+..... .....+++.||.||+||+| T Consensus 72 ~~~~~g~t~~~~al~~a~~~~~~~~-------~~r~~~~k~vvllTDG~~~~~~~~~~--~~~~~~~~~gI~v~~vG~g- 141 (177) T pfam00092 72 IYYLGGGTNTGKALKYALENLFRSA-------GSRPNAPKVVILLTDGKSNDGGLVPA--AAAALRRKVGIIVFGVGVG- 141 (177) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHCC-------CCCCCCEEEEEEEECCCCCCCCCCHH--HHHHHHHHCCCEEEEEECC- T ss_conf 1578995659999999999986354-------78878726899983698788864699--9999999789589999747- Q ss_pred CCCHHHHHHHCC----CCCEEEECCHHHHHHHHHHH Q ss_conf 222788998238----98179838989999999999 Q gi|254780934|r 330 IRSHEFLRACAS----PNSFYLVENPHSMYDAFSHI 361 (374) Q Consensus 330 ~~~~~~L~~~As----~~~~~~a~~~~~L~~~f~~I 361 (374) +.+.+.|+.+|+ .+++|.+.+.++|.++++.| T Consensus 142 ~~~~~~L~~ia~~~~~~~~~~~~~~~~~l~~~~~~i 177 (177) T pfam00092 142 DVDEEELRLIASEPCSEGHVFYVTDFDALSDIQEEL 177 (177) T ss_pred CCCHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHC T ss_conf 448999999968999898599958989999999619 No 20 >cd01450 vWFA_subfamily_ECM Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A Probab=99.83 E-value=6.9e-19 Score=131.08 Aligned_cols=152 Identities=26% Similarity=0.377 Sum_probs=122.8 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCC--CHHHHHHHHHH Q ss_conf 2499996578645113135313799999988763221002366567348999982685056102047--86899999860 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKY 249 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--~~~~~~~~i~~ 249 (374) +||+||+|.|+||... +++.+|+++..+++.+... ....|++++.|++.+....+++. ++..+...++. T Consensus 1 ~DivfvlD~S~Sm~~~------~~~~~k~~i~~~i~~~~~~---~~~~rv~lv~fs~~~~~~~~l~~~~~~~~l~~~i~~ 71 (161) T cd01450 1 LDIVFLLDGSESVGPE------NFEKVKDFIEKLVEKLDIG---PDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKN 71 (161) T ss_pred CEEEEEEECCCCCCHH------HHHHHHHHHHHHHHHCCCC---CCCCEEEEEEECCCEEEEECCCCCCCHHHHHHHHHH T ss_conf 9699999798998858------9999999999999970568---878589999955731687146564669999999984 Q ss_pred CCCCCC-CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 133687-7444203676776432223333102578865642799980687788888068999999999789879999924 Q gi|254780934|r 250 LSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 250 l~~~g~-~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~ 328 (374) +...++ +|+++.|+.++.+.+.... ...++.+|++||+|||.++.+. .+...+..+|+.||+||+||+| T Consensus 72 l~~~~~~~t~~~~AL~~~~~~~~~~~-------~~r~~~~kvivllTDG~~~~~~---~~~~~a~~lk~~gi~v~~vgiG 141 (161) T cd01450 72 LKYLGGGGTNTGKALQYALEQLFSES-------NARENVPKVIIVLTDGRSDDGG---DPKEAAAKLKDEGIKVFVVGVG 141 (161) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHCC-------CCCCCCCEEEEEEECCCCCCCC---CHHHHHHHHHHCCCEEEEEEEC T ss_conf 21368998548999999999986144-------6666675499998258878874---7999999999889989999826 Q ss_pred CCCCHHHHHHHCCCC Q ss_conf 822278899823898 Q gi|254780934|r 329 VIRSHEFLRACASPN 343 (374) Q Consensus 329 ~~~~~~~L~~~As~~ 343 (374) . .+.+.|+.+|+.+ T Consensus 142 ~-~~~~~L~~iA~~p 155 (161) T cd01450 142 P-ADEEELREIASCP 155 (161) T ss_pred C-CCHHHHHHHHCCC T ss_conf 4-8999999997799 No 21 >TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196. Probab=99.82 E-value=4.4e-18 Score=126.26 Aligned_cols=182 Identities=20% Similarity=0.224 Sum_probs=135.4 Q ss_pred CCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHH Q ss_conf 77872499996578645113135313799999988763221002366567348999982685056102047868999998 Q gi|254780934|r 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 (374) Q Consensus 168 ~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i 247 (374) ...|+.+++|+|.||||.. ++..++++...|+..... ...++.++.|+..+....++|+|...+...+ T Consensus 50 ~~~P~sv~l~~D~S~s~~~-------~~~~~~~a~~~fl~~~l~-----p~d~~avv~F~~~~~l~~~fT~d~~~l~~al 117 (296) T TIGR03436 50 TDLPLTVGLVIDTSGSMFN-------DLARARAAAIRFLKTVLR-----PNDEVFVVTFSTQLRLLQDFTSDPRLLEAAL 117 (296) T ss_pred CCCCCEEEEEEECCCCCHH-------HHHHHHHHHHHHHHHHCC-----CCCEEEEEEECCCEEECCCCCCCHHHHHHHH T ss_conf 8898469999978999145-------399999999999986368-----8867999994895457278988999999999 Q ss_pred HHCCCC--------------CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHH Q ss_conf 601336--------------877444203676776432223333102578865642799980687788888068999999 Q gi|254780934|r 248 KYLSKF--------------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 (374) Q Consensus 248 ~~l~~~--------------g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~ 313 (374) +.+.+. +++|.++.++..+...+.. ........+|.||++|||.++.... ...++.+ T Consensus 118 ~~l~~~~~~~~~~~~~~~~~~g~tal~dAi~laa~~~~~-------~~~~~~~gRK~li~iSdG~d~~s~~--~~~~~~~ 188 (296) T TIGR03436 118 NKLKPPLRTDYNSSGAFVADAGGTALYDAITLAALQQLA-------NALAGIPGRKALIVISDGEDNSSRD--TLERAIE 188 (296) T ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-------HHCCCCCCCEEEEEEECCCCCCCCC--CHHHHHH T ss_conf 861567654333345323578741027889999999987-------5404798867999992698863304--8999999 Q ss_pred HHHHCCCEEEEEEECCC-------------CCHHHHHHHC--CCCCEEEECCHHHHHHHHHHHHHHHHEEEEE Q ss_conf 99978987999992482-------------2278899823--8981798389899999999999842001666 Q gi|254780934|r 314 EAKKRGAIVYAIGIRVI-------------RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 (374) Q Consensus 314 ~~k~~gi~IytIg~~~~-------------~~~~~L~~~A--s~~~~~~a~~~~~L~~~f~~I~~~i~~~~~~ 371 (374) .+++.+|.||+|++... .+++.|+.+| +||++|..+ ..+|.++|++|++++.+..++ T Consensus 189 ~a~~a~v~IY~I~~~~~~~~~~~~~~~~~~~~~~~L~~lA~~TGG~~f~~~-~~dl~~~~~~i~~~lr~qY~l 260 (296) T TIGR03436 189 AAQRADVLIYSIDARGLRAPDLGAGAKAGLSGPETLERLAAETGGRAFYVN-SNDIDEAFAQIAEELRSQYVI 260 (296) T ss_pred HHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCEEECCC-CCCHHHHHHHHHHHHHHEEEE T ss_conf 999849779995467656656444445567627999999997399675547-410899999999987523899 No 22 >cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions. Probab=99.80 E-value=6.9e-18 Score=125.11 Aligned_cols=157 Identities=20% Similarity=0.286 Sum_probs=124.1 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHHHHHHHHHC Q ss_conf 49999657864511313531379999998876322100236656734899998268505610204--7868999998601 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL 250 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~~~~~i~~l 250 (374) |++|++|.|+||... .++.+|+.+..+++.+.- .....|++++.|+..+....++. .+...+...|+.+ T Consensus 2 DlvfllD~S~Si~~~------~f~~~k~fi~~lv~~f~i---~~~~~rvgvv~ys~~~~~~~~l~~~~~~~~l~~~i~~i 72 (164) T cd01482 2 DIVFLVDGSWSIGRS------NFNLVRSFLSSVVEAFEI---GPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNL 72 (164) T ss_pred CEEEEEECCCCCCHH------HHHHHHHHHHHHHHHCCC---CCCCEEEEEEEECCCCEEEECCCCCCCHHHHHHHHHHC T ss_conf 799999698998889------999999999999996476---88862899999447512787343469989999998640 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 33687744420367677643222333310257886564279998068778888806899999999978987999992482 Q gi|254780934|r 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 (374) Q Consensus 251 ~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~ 330 (374) ...|++|+++.++.++.+.+...... ..++.+|++|++|||..+ ......+..+|+.||+||+||+|. T Consensus 73 ~~~~g~t~~~~AL~~~~~~~f~~~~g------~R~~~~kvlvliTDG~s~-----d~~~~~a~~lr~~gv~i~~VGVg~- 140 (164) T cd01482 73 PYKGGNTRTGKALTHVREKNFTPDAG------ARPGVPKVVILITDGKSQ-----DDVELPARVLRNLGVNVFAVGVKD- 140 (164) T ss_pred CCCCCCCCHHHHHHHHHHHHHCHHCC------CCCCCCEEEEEECCCCCC-----CHHHHHHHHHHHCCCEEEEEECCC- T ss_conf 26689972899999999986150028------988886079996079884-----338999999998893899997883- Q ss_pred CCHHHHHHHCCC---CCEEEECC Q ss_conf 227889982389---81798389 Q gi|254780934|r 331 RSHEFLRACASP---NSFYLVEN 350 (374) Q Consensus 331 ~~~~~L~~~As~---~~~~~a~~ 350 (374) .+.+.|+.+||. .|+|.++| T Consensus 141 ~~~~eL~~IAs~P~~~hvf~~~~ 163 (164) T cd01482 141 ADESELKMIASKPSETHVFNVAD 163 (164) T ss_pred CCHHHHHHHHCCCCHHCEEECCC T ss_conf 78999999968985661797479 No 23 >cd01481 vWA_collagen_alpha3-VI-like VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions. Probab=99.79 E-value=2.2e-17 Score=122.07 Aligned_cols=159 Identities=18% Similarity=0.274 Sum_probs=124.8 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHHHHHHHHHC Q ss_conf 49999657864511313531379999998876322100236656734899998268505610204--7868999998601 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL 250 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~~~~~i~~l 250 (374) |++|++|.|+|+... .++..|+.++.+++.+.- .++..|++++.|+..+.....+. .++..+...|+.+ T Consensus 2 DlvFllD~S~si~~~------~F~~~k~Fv~~lv~~f~i---~~~~trVgvi~ys~~~~~~f~l~~~~~~~~l~~~I~~i 72 (165) T cd01481 2 DIVFLIDGSDNVGSG------NFPAIRDFIERIVQSLDV---GPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRL 72 (165) T ss_pred CEEEEEECCCCCCHH------HHHHHHHHHHHHHHHHCC---CCCCEEEEEEEECCCEEEEEECCCCCCHHHHHHHHHHH T ss_conf 789999688998989------999999999999996046---88862788999868647999767768999999999841 Q ss_pred CCCCC-CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 33687-74442036767764322233331025788656427999806877888880689999999997898799999248 Q gi|254780934|r 251 SKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 251 ~~~g~-~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~ 329 (374) .+.++ +|+++.++.++.+.+..... .....++.+|++|++|||..+ ......+..+|+.||+||+||++. T Consensus 73 ~~~~g~~t~tg~AL~~a~~~~f~~~~----g~R~r~~v~kvlvviTdG~s~-----d~~~~~a~~lr~~gV~i~aVGvg~ 143 (165) T cd01481 73 RLRGGSQLNTGSALDYVVKNLFTKSA----GSRIEEGVPQFLVLITGGKSQ-----DDVERPAVALKRAGIVPFAIGARN 143 (165) T ss_pred HCCCCCCEEHHHHHHHHHHHHCCCCC----CCCCCCCCCEEEEEEECCCCC-----CHHHHHHHHHHHCCCEEEEEECCC T ss_conf 04589843699999999997167567----887557998699998489885-----378999999998897899996897 Q ss_pred CCCHHHHHHHCC-CCCEEEECC Q ss_conf 222788998238-981798389 Q gi|254780934|r 330 IRSHEFLRACAS-PNSFYLVEN 350 (374) Q Consensus 330 ~~~~~~L~~~As-~~~~~~a~~ 350 (374) .+...|+.+|| |.+.|.+++ T Consensus 144 -~~~~eL~~IAs~p~~vf~~~~ 164 (165) T cd01481 144 -ADLAELQQIAFDPSFVFQVSD 164 (165) T ss_pred -CCHHHHHHHHCCCCCEEECCC T ss_conf -999999998589877697389 No 24 >TIGR00868 hCaCC calcium-activated chloride channel protein 1; InterPro: IPR004727 This entry represents a family of Ca(2+)-regulated chloride channels (CLCA) which includes bovine, murine and human proteins , . Each CLCA exhibits a distinct, often overlapping, tissue expression pattern. With the exception of the truncated, secreted protein hCLCA3 , they are synthesized as an approximately 125 kDa precursor transmembrane glycoprotein that is rapidly cleaved into 90 and 35 kDa subunits. The human proteins have been shown to affect a large number of cell functions including chloride conductance, epithelial secretion, cell-cell adhesion, apoptosis, cell cycle control, mucus production in asthma, and blood pressure. The CLCA proteins expressed on the luminal surface of lung vascular endothelia (bCLCA2; mCLCA1; hCLCA2) serve as adhesion molecules for lung metastatic cancer cells, mediating vascular arrest and lung colonization. Expression of hCLCA2 in normal mammary epithelium is consistently lost in human breast cancer and in all tumorigenic breast cancer cell lines. Re-expression of hCLCA2 in human breast cancer cells abrogates tumorigenicity in nude mice, implying that hCLCA2 acts as a tumour suppressor in breast cancer.. Probab=99.78 E-value=3.1e-18 Score=127.16 Aligned_cols=165 Identities=22% Similarity=0.242 Sum_probs=119.5 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCC--CHHHHHHHHHH Q ss_conf 2499996578645113135313799999988763221002366567348999982685056102047--86899999860 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKY 249 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--~~~~~~~~i~~ 249 (374) .=++||||.||||...+ |+..+.+|.+-|+-+.-+ ...-+|+|.|.+-++....+.. +.......... T Consensus 308 RiVCLVLDKSGSM~~~d-----RL~RmNQAa~lFL~Q~vE-----~gs~VGmV~FDS~A~i~n~L~~I~s~~~~~~l~a~ 377 (874) T TIGR00868 308 RIVCLVLDKSGSMTKED-----RLKRMNQAAKLFLLQIVE-----KGSWVGMVTFDSAAEIKNELIKITSSDERDALTAN 377 (874) T ss_pred EEEEEEECCCCCCCCCC-----HHHHHHHHHHHHEEEEEE-----CCCEEEEEECCCEEEEEEEEEEECCHHHHHHHHHH T ss_conf 58999863443379885-----334555566430123554-----15267766306445765420775266899899870 Q ss_pred CCC-CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHH-HHHHCCCEEEEEEE Q ss_conf 133-6877444203676776432223333102578865642799980687788888068999999-99978987999992 Q gi|254780934|r 250 LSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN-EAKKRGAIVYAIGI 327 (374) Q Consensus 250 l~~-~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~-~~k~~gi~IytIg~ 327 (374) |+. ..|||-+..|++.|++.+....+...+ --||||||||+|.-.. |= +.|+.|+.||||++ T Consensus 378 LP~~a~GGTSIC~Gl~~aFq~I~~~~~~t~G---------SEi~LLTDGEDN~i~s-------C~~eVkqsGaIiHtiAL 441 (874) T TIGR00868 378 LPTEASGGTSICSGLKAAFQVIKKSDQSTDG---------SEIVLLTDGEDNTISS-------CIEEVKQSGAIIHTIAL 441 (874) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC---------CEEEEEECCCCCCEEE-------CHHHHHCCCEEEEEEEC T ss_conf 7787876803656676665433312666675---------3699830687576231-------30554109808998507 Q ss_pred CCCCCHHHHHHHC--CCCCEEEECC---HHHHHHHHHHHHH Q ss_conf 4822278899823--8981798389---8999999999998 Q gi|254780934|r 328 RVIRSHEFLRACA--SPNSFYLVEN---PHSMYDAFSHIGK 363 (374) Q Consensus 328 ~~~~~~~~L~~~A--s~~~~~~a~~---~~~L~~~f~~I~~ 363 (374) |..+++++ .... +||+.|+|++ -+.|.+||..|.. T Consensus 442 GpsAa~el-e~lS~mTGG~~fYa~D~~~~NgLidAFg~lsS 481 (874) T TIGR00868 442 GPSAAKEL-EELSDMTGGLRFYASDEADNNGLIDAFGALSS 481 (874) T ss_pred CHHHHHHH-HHHHHHCCCCEEEEECHHHCCCHHHHHHHHCC T ss_conf 84589999-99873338711334133331414546642214 No 25 >cd00198 vWFA Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. Probab=99.78 E-value=1.8e-17 Score=122.56 Aligned_cols=151 Identities=28% Similarity=0.361 Sum_probs=120.0 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCC--CHHHHHHHHHH Q ss_conf 2499996578645113135313799999988763221002366567348999982685056102047--86899999860 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKY 249 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--~~~~~~~~i~~ 249 (374) .|++||+|.||||.. .+++.+++++..+++.+... ....+++++.|+..+....+++. +...+...++. T Consensus 1 ~div~vlD~S~Sm~~------~~~~~~k~~~~~~~~~l~~~---~~~~~v~vv~f~~~~~~~~~~~~~~~~~~~~~~i~~ 71 (161) T cd00198 1 ADIVFLLDVSGSMGG------EKLDKAKEALKALVSSLSAS---PPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDA 71 (161) T ss_pred CEEEEEEECCCCCCC------HHHHHHHHHHHHHHHHHHHC---CCCCEEEEEEECCCEEEEECCCCHHHHHHHHHHHHH T ss_conf 909999918899880------79999999999999987655---999889999937951488147412579999997751 Q ss_pred CCC-CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 133-6877444203676776432223333102578865642799980687788888068999999999789879999924 Q gi|254780934|r 250 LSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 250 l~~-~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~ 328 (374) +.. .+++|+++.++..+.+.+.... ..+.+|++||+|||.++.+. .........+|+.||.||+||+| T Consensus 72 ~~~~~~g~t~~~~al~~a~~~~~~~~---------~~~~~~~iiliTDG~~~~~~--~~~~~~~~~~~~~~v~i~~igig 140 (161) T cd00198 72 LKKGLGGGTNIGAALRLALELLKSAK---------RPNARRVIILLTDGEPNDGP--ELLAEAARELRKLGITVYTIGIG 140 (161) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHC---------CCCCCEEEEEECCCCCCCCH--HHHHHHHHHHHHCCCEEEEEEEC T ss_conf 35689998389999999999987532---------55565179996789989873--67999999999779989999966 Q ss_pred CCCCHHHHHHHCCC Q ss_conf 82227889982389 Q gi|254780934|r 329 VIRSHEFLRACASP 342 (374) Q Consensus 329 ~~~~~~~L~~~As~ 342 (374) .+.+.+.|+.+|+. T Consensus 141 ~~~~~~~l~~ia~~ 154 (161) T cd00198 141 DDANEDELKEIADK 154 (161) T ss_pred HHHCHHHHHHHHHC T ss_conf 11199999999838 No 26 >cd01476 VWA_integrin_invertebrates VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands. Probab=99.77 E-value=5.5e-17 Score=119.74 Aligned_cols=153 Identities=21% Similarity=0.284 Sum_probs=116.1 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEE--EEC--CCCHHHHHHHH Q ss_conf 24999965786451131353137999999887632210023665673489999826850561--020--47868999998 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF--FLL--EWGVSHLQRKI 247 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~--t~~~~~~~~~i 247 (374) +|++|+||.|+|+.. .++..|+.+..+++.+.- .....|++++.|++..... ..+ ..++..+.+.| T Consensus 1 lDl~fllD~S~Sv~~-------~f~~~k~F~~~lv~~f~i---~~~~~rVgvv~ys~~~~~~i~f~l~~~~~~~~l~~~I 70 (163) T cd01476 1 LDLLFVLDSSGSVRG-------KFEKYKKYIERIVEGLEI---GPTATRVALITYSGRGRQRVRFNLPKHNDGEELLEKV 70 (163) T ss_pred CCEEEEEECCCCHHH-------HHHHHHHHHHHHHHHHCC---CCCCEEEEEEEECCCCCEEEEECCCCCCCHHHHHHHH T ss_conf 929999918888667-------399999999999996146---8885389999966987078887577779999999999 Q ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHH-CCCEEEEEE Q ss_conf 6013368774442036767764322233331025788656427999806877888880689999999997-898799999 Q gi|254780934|r 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIG 326 (374) Q Consensus 248 ~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~-~gi~IytIg 326 (374) +.++..|++|+++.|+..+.+.+.... ...++.+|++|++|||..+.. +...+..+|+ .||+||+|| T Consensus 71 ~~i~~~~g~T~tg~AL~~a~~~~~~~~-------g~R~~~~kv~vviTDG~s~d~-----~~~~a~~lr~~~gv~v~avg 138 (163) T cd01476 71 DNLRFIGGTTATGAAIEVALQQLDPSE-------GRREGIPKVVVVLTDGRSHDD-----PEKQARILRAVPNIETFAVG 138 (163) T ss_pred HHEECCCCCCCHHHHHHHHHHHHHHHC-------CCCCCCEEEEEEEECCCCCCC-----HHHHHHHHHHHCCCEEEEEE T ss_conf 752036898548999999999721420-------678996169999818987664-----88999999970998999998 Q ss_pred ECC--CCCHHHHHHHCCC-CCEE Q ss_conf 248--2227889982389-8179 Q gi|254780934|r 327 IRV--IRSHEFLRACASP-NSFY 346 (374) Q Consensus 327 ~~~--~~~~~~L~~~As~-~~~~ 346 (374) +|. +.+.+.|..+|++ +++| T Consensus 139 VG~~~~~d~~eL~~Ia~~~~~Vf 161 (163) T cd01476 139 TGDPGTVDTEELHSITGNEDHIF 161 (163) T ss_pred ECCCCCCCHHHHHHHCCCCCCCC T ss_conf 38865015999998649972545 No 27 >cd01462 VWA_YIEM_type VWA YIEM type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if Probab=99.70 E-value=1.6e-15 Score=111.03 Aligned_cols=143 Identities=19% Similarity=0.129 Sum_probs=108.9 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEE-EEECCCCHHHHHHHHHHCC Q ss_conf 499996578645113135313799999988763221002366567348999982685056-1020478689999986013 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-FFLLEWGVSHLQRKIKYLS 251 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~t~~~~~~~~~i~~l~ 251 (374) .+++|||.||||... |+..+|.++..++..+... ..+++++.|++.... ..+.+.+.......+..+. T Consensus 2 pvV~vlD~SGSM~G~------~~~~ak~~~~~l~~~l~~~-----~~~~~lv~F~~~~~~~~~~~~~~~~~~~~~i~~~~ 70 (152) T cd01462 2 PVILLVDQSGSMYGA------PEEVAKAVALALLRIALAE-----NRDTYLILFDSEFQTKIVDKTDDLEEPVEFLSGVQ 70 (152) T ss_pred CEEEEEECCCCCCCC------HHHHHHHHHHHHHHHHHHC-----CCEEEEEEECCCCEEEECCCCCCHHHHHHHHHHCC T ss_conf 999999799998980------6999999999999973233-----98099999168735771587645999999997253 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCC Q ss_conf 36877444203676776432223333102578865642799980687788888068999999999789879999924822 Q gi|254780934|r 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 (374) Q Consensus 252 ~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~~ 331 (374) +.| ||++..++..+...+... ...+..|||+|||+.... .......+...++.++++|++++|.+. T Consensus 71 ~~G-GT~i~~aL~~A~~~l~~~-----------~~~~~~IvlITDG~~~~~--~~~~~~~~~~~~~~~~r~~~~~iG~~~ 136 (152) T cd01462 71 LGG-GTDINKALRYALELIERR-----------DPRKADIVLITDGYEGGV--SDELLREVELKRSRVARFVALALGDHG 136 (152) T ss_pred CCC-CCCHHHHHHHHHHHHHCC-----------CCCCCEEEEEECCCCCCC--HHHHHHHHHHHHHCCEEEEEEEECCCC T ss_conf 689-865799999999987425-----------765646999826756798--399999999998389199999989998 Q ss_pred CHHHHHHHC Q ss_conf 278899823 Q gi|254780934|r 332 SHEFLRACA 340 (374) Q Consensus 332 ~~~~L~~~A 340 (374) +..+++..+ T Consensus 137 ~p~~~~~~~ 145 (152) T cd01462 137 NPGYDRISA 145 (152) T ss_pred CCHHHHHHH T ss_conf 827878766 No 28 >COG4245 TerY Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only] Probab=99.64 E-value=8.5e-15 Score=106.69 Aligned_cols=185 Identities=19% Similarity=0.215 Sum_probs=138.8 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHC Q ss_conf 72499996578645113135313799999988763221002366567348999982685056102047868999998601 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l 250 (374) .+-++|+||.||||... +|+.++..+..+++++...+.....+.+++++|++.+....|++.-.+. ..-.| T Consensus 3 RlP~~lllDtSgSM~Ge------~IealN~Glq~m~~~Lkqdp~Ale~v~lsIVTF~~~a~~~~pf~~~~nF---~~p~L 73 (207) T COG4245 3 RLPCYLLLDTSGSMIGE------PIEALNAGLQMMIDTLKQDPYALERVELSIVTFGGPARVIQPFTDAANF---NPPIL 73 (207) T ss_pred CCCEEEEEECCCCCCCC------CHHHHHHHHHHHHHHHHHCHHHHHEEEEEEEEECCCCEEEECHHHHHHC---CCCCE T ss_conf 77889999367542456------1799989999999998748465440578999826850687331557544---88701 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 33687744420367677643222333310257886564279998068778888806899999999978987999992482 Q gi|254780934|r 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 (374) Q Consensus 251 ~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~ 330 (374) .+.| +|.++.++..+.+++.+..+..... .-..++.+++|||||+++. +.+.....-..-....-.+-..++|.. T Consensus 74 ~a~G-gT~lGaAl~~a~d~Ie~~~~~~~a~--~kgdyrP~vfLiTDG~PtD--~w~~~~~~~~~~~~~~k~v~a~~~G~~ 148 (207) T COG4245 74 TAQG-GTPLGAALTLALDMIEERKRKYDAN--GKGDYRPWVFLITDGEPTD--DWQAGAALVFQGERRAKSVAAFSVGVQ 148 (207) T ss_pred ECCC-CCCHHHHHHHHHHHHHHHHHHCCCC--CCCCCCEEEEEECCCCCCH--HHHHHHHHHHHCCCCCCEEEEEEECCC T ss_conf 3699-9806799999999998777650567--7555441799953899665--777677776403310052899995354 Q ss_pred -CCHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHHHEEE Q ss_conf -227889982389817983898999999999998420016 Q gi|254780934|r 331 -RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 (374) Q Consensus 331 -~~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~~i~~~~ 369 (374) ++.+.|+++.+.-.-|+..+..++.+.|+.+..+|+.-+ T Consensus 149 ~ad~~~L~qit~~V~~~~t~d~~~f~~fFkW~SaSisagS 188 (207) T COG4245 149 GADNKTLNQITEKVRQFLTLDGLQFREFFKWLSASISAGS 188 (207) T ss_pred CCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 3441899998876525234534889999999987751323 No 29 >COG1240 ChlD Mg-chelatase subunit ChlD [Coenzyme metabolism] Probab=99.60 E-value=1.1e-13 Score=100.17 Aligned_cols=173 Identities=18% Similarity=0.219 Sum_probs=135.4 Q ss_pred CCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEC-CCCEEEEECCCCHHHHHHH Q ss_conf 778724999965786451131353137999999887632210023665673489999826-8505610204786899999 Q gi|254780934|r 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWGVSHLQRK 246 (374) Q Consensus 168 ~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~~~~~~~ 246 (374) .+....|+||+|.||||... .||..+|-++-.++..... ...+++++.|. .+++.+.++|.+.+.++.. T Consensus 75 ~r~g~lvvfvVDASgSM~~~-----~Rm~aaKG~~~~lL~dAYq-----~RdkvavI~F~G~~A~lll~pT~sv~~~~~~ 144 (261) T COG1240 75 GRAGNLIVFVVDASGSMAAR-----RRMAAAKGAALSLLRDAYQ-----RRDKVAVIAFRGEKAELLLPPTSSVELAERA 144 (261) T ss_pred CCCCCCEEEEEECCCCCHHH-----HHHHHHHHHHHHHHHHHHH-----CCCEEEEEEECCCCCEEEECCCCCHHHHHHH T ss_conf 47677489999476542057-----8999999999999999997-----0354899996377653884786539999999 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCC---HHHHHHHHHHHHCCCEEE Q ss_conf 86013368774442036767764322233331025788656427999806877888880---689999999997898799 Q gi|254780934|r 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED---QQSLYYCNEAKKRGAIVY 323 (374) Q Consensus 247 i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~---~~~~~~~~~~k~~gi~Iy 323 (374) +..+++.| .|.+..|+..++..+.... ..++....++|++|||..|.+... ..+..+|..+...|+.+- T Consensus 145 L~~l~~GG-~TPL~~aL~~a~ev~~r~~-------r~~p~~~~~~vviTDGr~n~~~~~~~~~e~~~~a~~~~~~g~~~l 216 (261) T COG1240 145 LERLPTGG-KTPLADALRQAYEVLAREK-------RRGPDRRPVMVVITDGRANVPIPLGPKAETLEAASKLRLRGIQLL 216 (261) T ss_pred HHHCCCCC-CCCHHHHHHHHHHHHHHHH-------CCCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCCEE T ss_conf 98389999-8843999999999999751-------048876538999737965888898657799999999852688479 Q ss_pred EEEECCCC-CHHHHHHHC--CCCCEEEECCHHH--HHHHH Q ss_conf 99924822-278899823--8981798389899--99999 Q gi|254780934|r 324 AIGIRVIR-SHEFLRACA--SPNSFYLVENPHS--MYDAF 358 (374) Q Consensus 324 tIg~~~~~-~~~~L~~~A--s~~~~~~a~~~~~--L~~~f 358 (374) .|.++.+. ...+.+.+| ++|.||+.++..+ |..+. T Consensus 217 vid~e~~~~~~g~~~~iA~~~Gg~~~~L~~l~~~~i~~~~ 256 (261) T COG1240 217 VIDTEGSEVRLGLAEEIARASGGEYYHLDDLSDDSIVSAV 256 (261) T ss_pred EEECCCCCCCCCHHHHHHHHHCCEEEECCCCCCHHHHHHH T ss_conf 9955785233447999999739907865556404899988 No 30 >cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3- ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif. Probab=99.56 E-value=4.9e-13 Score=96.20 Aligned_cols=153 Identities=22% Similarity=0.306 Sum_probs=108.3 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCC--------EEEEECCCC-HHH Q ss_conf 24999965786451131353137999999887632210023665673489999826850--------561020478-689 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--------EEFFLLEWG-VSH 242 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~t~~-~~~ 242 (374) +.+.+++|.||||... .+++.+++++..+.+.+... .+++.++.|.+.. .....+... ... T Consensus 1 ~aV~lLlD~SgSM~~~-----~~i~~a~~a~~~l~~aL~~~-----g~~~~v~gF~s~~~~r~~~~~~~~k~f~e~~~~~ 70 (174) T cd01454 1 LAVTLLLDLSGSMRSD-----RRIDVAKKAAVLLAEALEAC-----GVPHAILGFTTDAGGRERVRWIKIKDFDESLHER 70 (174) T ss_pred CEEEEEEECCCCCCCC-----CHHHHHHHHHHHHHHHHHHC-----CCCEEEEECCCCCCCCCCEEEEECCCCCCCHHHH T ss_conf 9899999898688998-----48999999999999999976-----9956999515788984434789323667421145 Q ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCC-------CHHHHHHHHHH Q ss_conf 99998601336877444203676776432223333102578865642799980687788888-------06899999999 Q gi|254780934|r 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-------DQQSLYYCNEA 315 (374) Q Consensus 243 ~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~-------~~~~~~~~~~~ 315 (374) .+..+..+.+.| +|..+.++.|+...|... +..+|++|++|||+|+.... -..+...+.++ T Consensus 71 ~~~~i~~l~~~g-~Tr~G~Air~a~~~L~~~-----------~~~rkiliviSDG~P~D~~~~~~~~~~~~D~~~av~e~ 138 (174) T cd01454 71 ARKRLAALSPGG-NTRDGAAIRHAAERLLAR-----------PEKRKILLVISDGEPNDLDYYEGNVFATEDALRAVIEA 138 (174) T ss_pred HHHHHHCCCCCC-CCCCHHHHHHHHHHHHHC-----------CCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 688885118789-896179999999998639-----------76667999983899766777887553899999999999 Q ss_pred HHCCCEEEEEEECCCC---CHHHHHHHCCCCCEE Q ss_conf 9789879999924822---278899823898179 Q gi|254780934|r 316 KKRGAIVYAIGIRVIR---SHEFLRACASPNSFY 346 (374) Q Consensus 316 k~~gi~IytIg~~~~~---~~~~L~~~As~~~~~ 346 (374) +..||.+|+|+++.+. +.+.|+.+-+..+|- T Consensus 139 ~~~GI~~~~i~i~~~~~~~~~~~l~~i~g~~~~~ 172 (174) T cd01454 139 RKLGIEVFGITIDRDATTVDKEYLKNIFGEEGYA 172 (174) T ss_pred HHCCCEEEEEEECCCCCHHHHHHHHHHCCCCCEE T ss_conf 9879889999989855566999999842878778 No 31 >COG4961 TadG Flp pilus assembly protein TadG [Intracellular trafficking and secretion] Probab=99.49 E-value=1.5e-12 Score=93.30 Aligned_cols=68 Identities=22% Similarity=0.303 Sum_probs=60.5 Q ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH Q ss_conf 68788864123686299999999999999999999999999999999999999999988763113310 Q gi|254780934|r 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 (374) Q Consensus 4 ~~~~~~~f~~~~~G~vaiifAl~l~~ll~~~g~avD~~~~~~~ks~Lq~A~DaAaLAaa~~~~~~~~~ 71 (374) +-+..+||.|||+|+++|.|||++|||++++++.||++.++..|.+||+|+|+|+++++.....+... T Consensus 8 ~~~~~~rF~rdr~Ga~AVeFAlvap~ll~l~~g~ve~~~~~~~~~~l~~a~d~aara~~~~~~~~~~~ 75 (185) T COG4961 8 LRGLLRRFRRDRRGAAAVEFALVAPPLLLLVFGIVEFGIAFLAKQSLQNAADAAARAAARGLTTDAAD 75 (185) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH T ss_conf 79999988764876899999999999999999999999999999999999999999998507644202 No 32 >cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo Probab=99.36 E-value=8.6e-11 Score=82.85 Aligned_cols=164 Identities=20% Similarity=0.188 Sum_probs=110.5 Q ss_pred CCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH---CCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHH Q ss_conf 778724999965786451131353137999999887632210---0236656734899998268505610204--78689 Q gi|254780934|r 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV---KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSH 242 (374) Q Consensus 168 ~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~ 242 (374) .+..+|+++|+|.|..|.... +..++..+..++... ......+...|+|+++|+..++....+. ..... T Consensus 16 ~nLWLDVv~VVD~S~~mt~~g------l~~V~~~I~s~f~~~t~iGt~~~~pr~TRVGlVTYn~~AtvvAdLn~~~S~dd 89 (193) T cd01477 16 KNLWLDIVFVVDNSKGMTQGG------LWQVRATISSLFGSSSQIGTDYDDPRSTRVGLVTYNSNATVVADLNDLQSFDD 89 (193) T ss_pred HHEEEEEEEEEECCCCCCCCC------HHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEECCCCEEEECCCCCCCHHH T ss_conf 222378999996787656210------99999999999713540357889987338999996787459863454565788 Q ss_pred HHHHHHH----CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 9999860----133687744420367677643222333310257886564279998068778888806899999999978 Q gi|254780934|r 243 LQRKIKY----LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 (374) Q Consensus 243 ~~~~i~~----l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~ 318 (374) +...|.. .... ..+++..|+..|-++|... ......+++|+||+.+-.-...+. ..+..+++++|.. T Consensus 90 l~~~i~~~l~~vsss-~~SyL~~GL~aA~~~l~~~------~~~~R~nykKVVIVyAs~y~~~g~--~dp~pvA~rLK~~ 160 (193) T cd01477 90 LYSQIQGSLTDVSST-NASYLDTGLQAAEQMLAAG------KRTSRENYKKVVIVFASDYNDEGS--NDPRPIAARLKST 160 (193) T ss_pred HHHHHHHHHCCCCCC-CHHHHHHHHHHHHHHHHHC------CCCCCCCCCEEEEEEECCCCCCCC--CCHHHHHHHHHHC T ss_conf 999998875146666-3127999999999999833------266424862799999502467898--8869999999876 Q ss_pred CCEEEEEEECCCCCHH---HHHHHCCCCCEE Q ss_conf 9879999924822278---899823898179 Q gi|254780934|r 319 GAIVYAIGIRVIRSHE---FLRACASPNSFY 346 (374) Q Consensus 319 gi~IytIg~~~~~~~~---~L~~~As~~~~~ 346 (374) |+.|-||+|+.+.+.+ .|++|||+|.-| T Consensus 161 Gv~IiTVa~~q~~~~~~~~~L~~IASpg~nF 191 (193) T cd01477 161 GIAIITVAFTQDESSNLLDKLGKIASPGMNF 191 (193) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHCCCCCCC T ss_conf 9789999826887588999888757998887 No 33 >TIGR02442 Cob-chelat-sub cobaltochelatase subunit; InterPro: IPR012804 Cobalamin (vitamin B12) can be complexed with metal via ATP-dependent reactions (aerobic pathway) (e.g., in Pseudomonas denitrificans) or via ATP-independent reactions (anaerobic pathway) (e.g., in Salmonella typhimurium) , . The corresponding cobalt chelatases are not homologous. Cobaltochelatase is responsible for the insertion of cobalt into the corrin ring of coenzyme B12 during its biosynthesis. Two versions have been well described. CbiK/CbiX is a monomeric, anaerobic version which acts early in the biosynthesis (IPR010388 from INTERPRO). CobNST is a trimeric, ATP-dependent, aerobic version which acts late in the biosynthesis, (IPR011953 from INTERPRO, IPR006537 from INTERPRO, IPR006538 from INTERPRO) . The two pathways differ in the point of cobalt insertion during corrin ring formation . There are apparently a number of variations on these two pathways, where the major differences seem to be concerned with the process of ring contraction . Cobaltochelatase shows similarities with magnesium chelatase, which is also a complex ATP-dependent enzyme made up of two separable components. However, unlike the situation in cobaltochelatase, one of these two components is membrane bound in magnesium chelatase . . Probab=99.33 E-value=5.9e-11 Score=83.84 Aligned_cols=161 Identities=18% Similarity=0.163 Sum_probs=117.9 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEC-CCCEEEEECCCCHHHHHHHHH Q ss_conf 8724999965786451131353137999999887632210023665673489999826-850561020478689999986 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWGVSHLQRKIK 248 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~~~~~~~i~ 248 (374) ..--|.||+|-||||... .||..+|-++-.++..... ...++++++|- .+++.+.|+|++.......+. T Consensus 507 ~G~LviFvVDASGSM~ar-----~RM~~~KGavLsLL~DAYq-----~RDkValI~FrG~~AevlLPPT~sv~~A~r~L~ 576 (688) T TIGR02442 507 AGNLVIFVVDASGSMAAR-----GRMAAAKGAVLSLLRDAYQ-----KRDKVALITFRGEEAEVLLPPTSSVELAARRLE 576 (688) T ss_pred HCCCEEEEEECCHHHHHH-----HHHHHHHHHHHHHHHHHHH-----HCCEEEEEECCCCEEEEECCCCCHHHHHHHHHH T ss_conf 115222353353204423-----5789989999998888886-----277688862367343576587884899999997 Q ss_pred HCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCC------C-C-CCHHHHHHHHHHHHC-- Q ss_conf 01336877444203676776432223333102578865642799980687788------8-8-806899999999978-- Q gi|254780934|r 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS------T-K-EDQQSLYYCNEAKKR-- 318 (374) Q Consensus 249 ~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~------~-~-~~~~~~~~~~~~k~~-- 318 (374) .|+..| .|....|+..|+..+..... .+...+.++|++|||--|- + . ....+..+...+... T Consensus 577 ~lPtGG-rTPLa~gL~~A~~v~~~~~~-------~~~~~~pl~V~iTDGRaNv~L~~~~g~~qp~~~~~~~a~~L~~~~~ 648 (688) T TIGR02442 577 ELPTGG-RTPLAAGLLKAAEVLSNELL-------RDDDRRPLVVVITDGRANVALDVSLGEPQPLDDARTIASKLAARAS 648 (688) T ss_pred HCCCCC-CCHHHHHHHHHHHHHHHHHH-------CCCCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 288989-87458999999999999861-------1689942899870786354266678884157789999999887504 Q ss_pred -----CCEEEEEEECC-CC-CHHHHHHHC--CCCCEEEE Q ss_conf -----98799999248-22-278899823--89817983 Q gi|254780934|r 319 -----GAIVYAIGIRV-IR-SHEFLRACA--SPNSFYLV 348 (374) Q Consensus 319 -----gi~IytIg~~~-~~-~~~~L~~~A--s~~~~~~a 348 (374) |+..-.|=-.. .. .--+-+++| .++.||.- T Consensus 649 R~R~Lg~~~vV~DTE~~~~v~lGlA~~~A~~lgg~~~~l 687 (688) T TIGR02442 649 RIRSLGIKFVVIDTENPGFVRLGLAEDLASALGGEYLRL 687 (688) T ss_pred CEEECCCEEEEEECCCCCCCCCCHHHHHHHHHCCCEECC T ss_conf 301116227899726887542223899999829832247 No 34 >PRK13406 bchD magnesium chelatase subunit D; Provisional Probab=99.29 E-value=8.7e-10 Score=76.88 Aligned_cols=168 Identities=17% Similarity=0.203 Sum_probs=122.8 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC-CCEEEEECCCCHHHHHHHHH Q ss_conf 87249999657864511313531379999998876322100236656734899998268-50561020478689999986 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFLLEWGVSHLQRKIK 248 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~~~~~~~i~ 248 (374) ...-++||+|.||||.- .|+..+|-++..++..... ...+++++.|.. ..+.+.|+|......+..+. T Consensus 400 ~~~lviFvVDASGS~A~------~Rm~~aKGAV~~LL~dAY~-----~RD~ValIaFRG~~AevlLPPTrSv~~A~r~L~ 468 (584) T PRK13406 400 SETTTIFVVDASGSAAL------HRLAEAKGAVELLLAECYV-----RRDHVALVAFRGRGAELLLPPTRSLVRAKRSLA 468 (584) T ss_pred CCEEEEEEEECCCCHHH------HHHHHHHHHHHHHHHHHHH-----HHCEEEEEEECCCCCEEEECCCCCHHHHHHHHH T ss_conf 66069999828862799------9999999999999999996-----004478998768763074188655999999996 Q ss_pred HCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCC--------HHHHHHHHHHHHCCC Q ss_conf 013368774442036767764322233331025788656427999806877888880--------689999999997898 Q gi|254780934|r 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED--------QQSLYYCNEAKKRGA 320 (374) Q Consensus 249 ~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~--------~~~~~~~~~~k~~gi 320 (374) .|+.. ++|....|+..+++....... ......+||+|||--|-+-+. .....++..++..|| T Consensus 469 ~LP~G-G~TPLA~GL~~A~~l~~~~r~---------~~~~p~~VllTDGRaNv~ldg~~~r~~a~~da~~~A~~l~~~g~ 538 (584) T PRK13406 469 GLPGG-GGTPLAAGLDAALALALSVRR---------KGQTPTVVLLTDGRANIARDGAGGRAQAEEDALAAARALRAAGL 538 (584) T ss_pred CCCCC-CCCHHHHHHHHHHHHHHHHHC---------CCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 29999-988599999999999999755---------79954899982798777877788711489999999999997699 Q ss_pred EEEEEEECCCCCHHHHHHHC--CCCCEEEEC--CHHHHHHHHH Q ss_conf 79999924822278899823--898179838--9899999999 Q gi|254780934|r 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVE--NPHSMYDAFS 359 (374) Q Consensus 321 ~IytIg~~~~~~~~~L~~~A--s~~~~~~a~--~~~~L~~~f~ 359 (374) ..-.|-.+. ......+..| -++.||.-+ +++.|..+.+ T Consensus 539 ~~vVIDT~~-~~~~~a~~LA~~l~a~Y~~Lp~~~A~~l~~~V~ 580 (584) T PRK13406 539 PALVIDTSP-RPQPQARALAEAMGARYLPLPRADATRLSQAVR 580 (584) T ss_pred CEEEEECCC-CCCHHHHHHHHHCCCCEEECCCCCHHHHHHHHH T ss_conf 789994898-886269999998399189789789899999999 No 35 >TIGR02031 BchD-ChlD magnesium chelatase ATPase subunit D; InterPro: IPR011776 This entry represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll. This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. Unlike subunit I (IPR011775 from INTERPRO), this subunit is not found in archaea.; GO: 0005524 ATP binding, 0016851 magnesium chelatase activity, 0015995 chlorophyll biosynthetic process. Probab=99.28 E-value=1.1e-10 Score=82.21 Aligned_cols=172 Identities=18% Similarity=0.227 Sum_probs=124.3 Q ss_pred EECCCCCCEEEEEEECCCC-CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC-CCEEEEECCCCHH Q ss_conf 3103778724999965786-4511313531379999998876322100236656734899998268-5056102047868 Q gi|254780934|r 164 VNSQTDARLDMMIVLDVSR-SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFLLEWGVS 241 (374) Q Consensus 164 ~~~~~~~~~di~~VlD~Sg-SM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~~ 241 (374) +.....-..=++|++|-|| ||-. .|+..+|=||..++...... .+.+.++++.|-+ .+..+.|+|.... T Consensus 503 Kr~~~ksg~L~IF~VDASGSsaa~------~Rm~~AKGAV~~LL~~AYv~---RD~vkVaLi~FRG~~Ae~LLPPsrSv~ 573 (705) T TIGR02031 503 KRYRRKSGALLIFVVDASGSSAAV------ARMSEAKGAVELLLGEAYVH---RDQVKVALIAFRGTAAEVLLPPSRSVE 573 (705) T ss_pred HHHHCCCCCEEEEEEECCHHHHHH------HHHHHHHHHHHHHHHHHHHH---CCCEEEEEEECCCCHHHHCCCCHHHHH T ss_conf 443302882799976063578999------99987789999998765441---360357763044430000378523589 Q ss_pred HHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCC----------C------- Q ss_conf 99999860133687744420367677643222333310257886564279998068778888----------8------- Q gi|254780934|r 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK----------E------- 304 (374) Q Consensus 242 ~~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~----------~------- 304 (374) ..++.++.|+. ||||....||..||..=..+ -..+.-.+-.|||+|||-.|-.- . T Consensus 574 ~aKr~L~~LP~-GGGtPLA~gL~~A~~~a~qa-------r~~GD~~~~~ivliTDGRgNvpL~~~~DP~~~~~~r~PrPt 645 (705) T TIGR02031 574 LAKRRLDVLPG-GGGTPLAAGLAAAVEVAKQA-------RSRGDVGRITIVLITDGRGNVPLDASVDPKAAKADRLPRPT 645 (705) T ss_pred HHHHHHCCCCC-CCCCHHHHHHHHHHHHHHHH-------HCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCH T ss_conf 99999715899-98567899999999999851-------02688524556776077877467656786100235678726 Q ss_pred ----CHHHHHHHHHHHHCCCEEEEEEEC-CCCCHHHHHHHCC--CCCEEEECCHH Q ss_conf ----068999999999789879999924-8222788998238--98179838989 Q gi|254780934|r 305 ----DQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACAS--PNSFYLVENPH 352 (374) Q Consensus 305 ----~~~~~~~~~~~k~~gi~IytIg~~-~~~~~~~L~~~As--~~~~~~a~~~~ 352 (374) ..+...++..+++.||-.-.|=-. .+...-+++++|. .++||+-+++. T Consensus 646 s~~l~~e~~~lA~~i~~~G~~~lVIDT~~~f~s~G~a~~lA~~~~a~Y~yLP~a~ 700 (705) T TIGR02031 646 SEELKEEVLALARKIREAGISALVIDTANKFVSTGFAKKLARKLGARYIYLPNAT 700 (705) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHCCCEEECCCCC T ss_conf 8999999999999988718865898267786676448999998589067136888 No 36 >KOG2353 consensus Probab=99.20 E-value=5.2e-10 Score=78.20 Aligned_cols=181 Identities=20% Similarity=0.273 Sum_probs=129.4 Q ss_pred CCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEC---------CC Q ss_conf 778724999965786451131353137999999887632210023665673489999826850561020---------47 Q gi|254780934|r 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL---------EW 238 (374) Q Consensus 168 ~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------t~ 238 (374) +..+-||++.+|.||||... +++.+|..+..+++++...+ .+.+++|+....+..++ .. T Consensus 222 At~pKdiviLlD~SgSm~g~------~~~lak~tv~~iLdtLs~~D------fvni~tf~~~~~~v~pc~~~~lvqAt~~ 289 (1104) T KOG2353 222 ATSPKDIVILLDVSGSMSGL------RLDLAKQTVNEILDTLSDND------FVNILTFNSEVNPVSPCFNGTLVQATMR 289 (1104) T ss_pred CCCCCCEEEEEECCCCCCCH------HHHHHHHHHHHHHHHCCCCC------EEEEEEECCCCCCCCCCCCCCEEECCHH T ss_conf 67866459999656555443------16999999999997615477------6878762135675652025852204567 Q ss_pred CHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 86899999860133687744420367677643222333310257886564279998068778888806899999999978 Q gi|254780934|r 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 (374) Q Consensus 239 ~~~~~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~ 318 (374) ++..+++.++.+.+.| .++...|+..++..|.+..... ....+..-.+.++++|||.++.. ....+.-+ .-++ T Consensus 290 nk~~~~~~i~~l~~k~-~a~~~~~~e~aF~lL~~~n~s~--~~~~~~~C~~~iml~tdG~~~~~---~~If~~yn-~~~~ 362 (1104) T KOG2353 290 NKKVFKEAIETLDAKG-IANYTAALEYAFSLLRDYNDSR--ANTQRSPCNQAIMLITDGVDENA---KEIFEKYN-WPDK 362 (1104) T ss_pred HHHHHHHHHHHHCCCC-CCCHHHHHHHHHHHHHHHCCCC--CCCCCCCCCEEEEEEECCCCCCH---HHHHHHHC-CCCC T ss_conf 7999999986414125-4124355778999998744455--44322500104577624775108---99998603-6777 Q ss_pred CCEEEEEEECCCC---CHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHHHE Q ss_conf 9879999924822---278899823898179838989999999999984200 Q gi|254780934|r 319 GAIVYAIGIRVIR---SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 (374) Q Consensus 319 gi~IytIg~~~~~---~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~~i~~ 367 (374) .|+|||.=+|... +....-.|++.|.|++..+..++.+--.+...=+.. T Consensus 363 ~Vrvftflig~~~~~~~~~~wmac~n~gyy~~I~~~~~v~~~~~~y~~vlsR 414 (1104) T KOG2353 363 KVRVFTFLIGDEVYDLDEIQWMACANKGYYVHIISIADVRENVLEYLDVLSR 414 (1104) T ss_pred CEEEEEEEECCCCCCCCCCHHHHHHCCCCEEECCCHHHCCHHHHHHHHHHCC T ss_conf 3599999924421345412122540788558646656458676556645320 No 37 >cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group. Probab=99.15 E-value=8.5e-09 Score=70.96 Aligned_cols=171 Identities=16% Similarity=0.225 Sum_probs=126.8 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCC-CEEEEECCCCHHHHHHHHHHC Q ss_conf 2499996578645113135313799999988763221002366567348999982685-056102047868999998601 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEEFFLLEWGVSHLQRKIKYL 250 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~t~~~~~~~~~i~~l 250 (374) .-+++|+|.|.||... +-.+.|+....+.+..|+.++....|. ..+|++...++ ++.+.+++.+.......+..+ T Consensus 4 R~l~iiiD~S~am~~~-D~~PtRl~~~~~~l~~Fi~effdqNPi---sqlGii~~rn~~a~~ls~lsgn~~~hi~~l~~~ 79 (183) T cd01453 4 RHLIIVIDCSRSMEEQ-DLKPSRLAVVLKLLELFIEEFFDQNPI---SQLGIISIKNGRAEKLTDLTGNPRKHIQALKTA 79 (183) T ss_pred EEEEEEEECCHHHHHC-CCCCCHHHHHHHHHHHHHHHHHCCCCC---CEEEEEEEECCEEEEEEECCCCHHHHHHHHHHC T ss_conf 6999999883767756-589549999999999999998707974---048999994681699764689989999999854 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 33687744420367677643222333310257886564279998068778888806899999999978987999992482 Q gi|254780934|r 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 (374) Q Consensus 251 ~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~ 330 (374) ....|....+.|+..|...|.... ....|.++|+ ...-.++ ++....+..+.+|+.+|++..||+..+ T Consensus 80 ~~~~G~~SLqN~Le~A~~~L~~~P---------~~~sREILiI-~~Sl~t~--DpgdI~~ti~~lk~~~IrvsvI~l~aE 147 (183) T cd01453 80 RECSGEPSLQNGLEMALESLKHMP---------SHGSREVLII-FSSLSTC--DPGNIYETIDKLKKENIRVSVIGLSAE 147 (183) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCC---------CCCCEEEEEE-ECCCCCC--CCCCHHHHHHHHHHCCCEEEEEEECHH T ss_conf 589998139999999999982089---------8784489999-7565347--976499999999983978999974278 Q ss_pred CCHHHHHHHC--CCCCEEEECCHHHHHHHHHH Q ss_conf 2278899823--89817983898999999999 Q gi|254780934|r 331 RSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 (374) Q Consensus 331 ~~~~~L~~~A--s~~~~~~a~~~~~L~~~f~~ 360 (374) . ..+|.++ ++|.|+-+-|..-+.+.+.+ T Consensus 148 v--~I~k~l~~~TgG~y~V~lde~H~~~ll~~ 177 (183) T cd01453 148 M--HICKEICKATNGTYKVILDETHLKELLLE 177 (183) T ss_pred H--HHHHHHHHHHCCEEEEECCHHHHHHHHHH T ss_conf 9--99999999839976875399999999995 No 38 >cd01457 vWA_ORF176_type VWA ORF176 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most Probab=98.98 E-value=2.3e-08 Score=68.39 Aligned_cols=157 Identities=17% Similarity=0.149 Sum_probs=96.1 Q ss_pred CEEEEEEECCCCCCCCCCCC-CHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHH Q ss_conf 72499996578645113135-31379999998876322100236656734899998268505610204786899999860 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~-~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~ 249 (374) ..|.+|++|.||||.+.... +..||+.++.++..+.......++..- .++-+++..... ..-+.+.+....+. T Consensus 2 ~rD~v~lIDdSgSM~~~d~~~~~sRW~~a~~al~~iA~~c~~~D~DGI----dvyfln~~~~~~--~~~~~~~V~~iF~~ 75 (199) T cd01457 2 NRDYTLLIDKSGSMAEADEAKERSRWEEAQESTRALARKCEEYDSDGI----TVYLFSGDFRRY--DNVNSSKVDQLFAE 75 (199) T ss_pred CCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC----EEEEEECCCCCC--CCCCHHHHHHHHHC T ss_conf 977799996888766776788876299999999999999987488998----799962776456--88899999999855 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCC-EEEEEEECCCCCCCCCCH-HHHHHHHHHH-HCCCEEEEEE Q ss_conf 133687744420367677643222333310257886564-279998068778888806-8999999999-7898799999 Q gi|254780934|r 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-KIIVFMTDGENLSTKEDQ-QSLYYCNEAK-KRGAIVYAIG 326 (374) Q Consensus 250 l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~-k~iillTDG~~~~~~~~~-~~~~~~~~~k-~~gi~IytIg 326 (374) ..+.| +|+++..+....+........ +.+.|+ -.+|++|||+++....-. .....+.++. ...+-|..+. T Consensus 76 ~~P~G-~T~~g~~L~~il~~y~~r~~~------~~~kp~g~~iIVITDG~p~D~~av~~~Ii~aa~kLd~~~qlgIqF~Q 148 (199) T cd01457 76 NSPDG-GTNLAAVLQDALNNYFQRKEN------GATCPEGETFLVITDGAPDDKDAVERVIIKASDELDADNELAISFLQ 148 (199) T ss_pred CCCCC-CCCHHHHHHHHHHHHHHHHHC------CCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEE T ss_conf 89899-796379999998999873200------68999860799982799798288999999999863440100367778 Q ss_pred ECCCC-CHHHHHHHC Q ss_conf 24822-278899823 Q gi|254780934|r 327 IRVIR-SHEFLRACA 340 (374) Q Consensus 327 ~~~~~-~~~~L~~~A 340 (374) +|.+. ..++|+..- T Consensus 149 VG~D~~A~~fL~~LD 163 (199) T cd01457 149 IGRDPAATAFLKALD 163 (199) T ss_pred ECCCHHHHHHHHHHC T ss_conf 559688999999858 No 39 >COG4548 NorD Nitric oxide reductase activation protein [Inorganic ion transport and metabolism] Probab=98.85 E-value=6.9e-08 Score=65.57 Aligned_cols=176 Identities=17% Similarity=0.249 Sum_probs=116.2 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCC------CEEEEECC-CCHHHHH Q ss_conf 2499996578645113135313799999988763221002366567348999982685------05610204-7868999 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK------IEEFFLLE-WGVSHLQ 244 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~t-~~~~~~~ 244 (374) +.+-+.+|.|-||..+++....=++..++++..+..-.....+ ...+..|.+. ++++.++. +-...+. T Consensus 447 la~TLLvD~S~St~a~mdetrRvidl~~eaL~~la~~~qa~gd-----~~~~~~fts~rr~~vri~tvk~FDes~~~~~~ 521 (637) T COG4548 447 LAFTLLVDVSASTDAKMDETRRVIDLFHEALLVLAHGHQALGD-----SEDILDFTSRRRPWVRINTVKDFDESMGETVG 521 (637) T ss_pred CEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCC-----HHHHCCCHHHCCCCEEEEEEECCCCCCCCCCC T ss_conf 1467876234336777652212578789999986132655178-----88743721213763123111034300045533 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCC------HHHHHHHHHHHHC Q ss_conf 9986013368774442036767764322233331025788656427999806877888880------6899999999978 Q gi|254780934|r 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED------QQSLYYCNEAKKR 318 (374) Q Consensus 245 ~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~------~~~~~~~~~~k~~ 318 (374) ..|-+|.+. .-|.++.++..+...|. ..++.+|.+|++|||++|.-..+ ..+..+...+++. T Consensus 522 ~RImALePg-~ytR~G~AIR~As~kL~-----------~rpq~qklLivlSDGkPnd~d~YEgr~gIeDTr~AV~eaRk~ 589 (637) T COG4548 522 PRIMALEPG-YYTRDGAAIRHASAKLM-----------ERPQRQKLLIVLSDGKPNDFDHYEGRFGIEDTREAVIEARKS 589 (637) T ss_pred HHHEECCCC-CCCCCCHHHHHHHHHHH-----------CCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 122133766-44431099999999983-----------474112489994489854344323321115379999999865 Q ss_pred CCEEEEEEECCCCCHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHH Q ss_conf 98799999248222788998238981798389899999999999842 Q gi|254780934|r 319 GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 (374) Q Consensus 319 gi~IytIg~~~~~~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~~i 365 (374) ||.+|.|.+..+.. .++...-.-+-|-.+++...|..++-.|-+.+ T Consensus 590 Gi~VF~Vtld~ea~-~y~p~~fgqngYa~V~~v~~LP~~L~~lyrkL 635 (637) T COG4548 590 GIEVFNVTLDREAI-SYLPALFGQNGYAFVERVAQLPGALPPLYRKL 635 (637) T ss_pred CCEEEEEEECCHHH-HHHHHHHCCCCEEECCCHHHCCHHHHHHHHHH T ss_conf 83479998333055-55288852674697024001605579999996 No 40 >COG2425 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only] Probab=98.84 E-value=4.3e-08 Score=66.81 Aligned_cols=159 Identities=24% Similarity=0.243 Sum_probs=95.3 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHCCC Q ss_conf 49999657864511313531379999998876322100236656734899998268505610204786899999860133 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l~~ 252 (374) .+++++|.||||.+. +.+.+|..+-.++..... ..-++.+.-|.+... ...++.....++..|+-|.. T Consensus 274 pvilllD~SGSM~G~------~e~~AKAvalAl~~~ala-----enR~~~~~lF~s~~~-~~el~~k~~~~~e~i~fL~~ 341 (437) T COG2425 274 PVILLLDKSGSMSGF------KEQWAKAVALALMRIALA-----ENRDCYVILFDSEVI-EYELYEKKIDIEELIEFLSY 341 (437) T ss_pred CEEEEEECCCCCCCC------HHHHHHHHHHHHHHHHHH-----HCCCEEEEEECCCCE-EEEECCCCCCHHHHHHHHHH T ss_conf 879999588885782------889999999999999988-----430538999525202-55505774579999999965 Q ss_pred -CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCC Q ss_conf -6877444203676776432223333102578865642799980687788888068999999999789879999924822 Q gi|254780934|r 253 -FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 (374) Q Consensus 253 -~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~~ 331 (374) .||||+...++..+.+.+..... .+--+|++|||+..-. +.-...+-...|..+.++|+|-++.. T Consensus 342 ~f~GGTD~~~~l~~al~~~k~~~~-----------~~adiv~ITDg~~~~~--~~~~~~v~e~~k~~~~rl~aV~I~~~- 407 (437) T COG2425 342 VFGGGTDITKALRSALEDLKSREL-----------FKADIVVITDGEDERL--DDFLRKVKELKKRRNARLHAVLIGGY- 407 (437) T ss_pred HCCCCCCHHHHHHHHHHHHHCCCC-----------CCCCEEEEECCHHHHH--HHHHHHHHHHHHHHHCEEEEEEECCC- T ss_conf 068988858999999998643665-----------6777899804376654--67899999998875434899996478- Q ss_pred CHHHHHHHCCCCC-EEEECCHHHHHHHHHH Q ss_conf 2788998238981-7983898999999999 Q gi|254780934|r 332 SHEFLRACASPNS-FYLVENPHSMYDAFSH 360 (374) Q Consensus 332 ~~~~L~~~As~~~-~~~a~~~~~L~~~f~~ 360 (374) +...|+.+. ++ .|..+ +.+...+++. T Consensus 408 ~~~~l~~Is--d~~i~~~~-~~~~~kv~~~ 434 (437) T COG2425 408 GKPGLMRIS--DHIIYRVE-PRDRVKVVKR 434 (437) T ss_pred CCCCCCEEC--EEEEEEEC-CHHHHHHHHC T ss_conf 986600011--14678727-4777677734 No 41 >pfam11775 CobT_C Cobalamin biosynthesis protein CobT VWA domain. This family consists of several bacterial cobalamin biosynthesis (CobT) proteins. CobT is involved in the transformation of precorrin-3 into cobyrinic acid. Probab=98.79 E-value=6.6e-07 Score=59.71 Aligned_cols=166 Identities=13% Similarity=0.147 Sum_probs=89.3 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCC-----E---------EE-- Q ss_conf 8724999965786451131353137999999887632210023665673489999826850-----5---------61-- Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-----E---------EF-- 233 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~---------~~-- 233 (374) ..+-+.+++|.||||.+. +|+.+...+.-+.+.+..... .+-+..|.... . +. T Consensus 11 ~Dt~VtlLID~SGSMrgr------~i~~Aa~~adiL~~~Ler~gv-----~~EILGFtT~~wkGg~~r~~w~~~G~p~~p 79 (220) T pfam11775 11 RDACVQLLIDLSGSMGGR------KIQLAAACADIIADALDRCGV-----KNEILGFTTFAWKGGPDREAMLAAGFPAFE 79 (220) T ss_pred CCEEEEEEEECCCCCCCC------HHHHHHHHHHHHHHHHHHCCC-----CEEEEEECCCCCCCCCHHHHHHHCCCCCCC T ss_conf 752899988688888988------678999999999999987699-----879985404765775029999875997672 Q ss_pred ---------EECCCCHH--HHHHHHHHCCCCC--CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCC- Q ss_conf ---------02047868--9999986013368--774442036767764322233331025788656427999806877- Q gi|254780934|r 234 ---------FLLEWGVS--HLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN- 299 (374) Q Consensus 234 ---------~~~t~~~~--~~~~~i~~l~~~g--~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~- 299 (374) .....+.. ..+..+..+-..| ..--=++++.||.+.|... +..+|+++++|||.| T Consensus 80 gRlndl~hiiyk~ad~~wrrar~~lg~m~~~gllkENiDGEAL~wA~~RL~~R-----------~e~RkILmViSDGaP~ 148 (220) T pfam11775 80 ALLLDIIHIINEKADAPEIRARKNLGCMCEEFLLKENIDGEALAQAAKLFAGR-----------MEDKKILLMISDGAPC 148 (220) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCC-----------CCCCEEEEEECCCCCC T ss_conf 36655664212446883077899987776631443189719999999998639-----------3124699997589967 Q ss_pred -------CCCCCCH-HHHHHHHHH-HHCCCEEEEEEECCCCCHHHHHHHCCCCCEEEECCHHHHHHHH-HHHHH Q ss_conf -------8888806-899999999-9789879999924822278899823898179838989999999-99998 Q gi|254780934|r 300 -------LSTKEDQ-QSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAF-SHIGK 363 (374) Q Consensus 300 -------~~~~~~~-~~~~~~~~~-k~~gi~IytIg~~~~~~~~~L~~~As~~~~~~a~~~~~L~~~f-~~I~~ 363 (374) |.+..-. ......... +..+|++..||+|.+...++.+.+ ..+.+.++|..+. +++++ T Consensus 149 ddst~s~n~~~yL~~hLr~vi~~ie~~~~iel~aIGIghDv~r~yY~~a------v~i~d~eeL~~~~~~~L~~ 216 (220) T pfam11775 149 DDSTLSVAAGDGFEQHLRHIIEEIETLSEIDLIAIGIGHDAPRRYYKNA------ALINDAEELGGAITEELAE 216 (220) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHCC------EEECCHHHHHHHHHHHHHH T ss_conf 7641125877767999999999985068826999874777686665065------6860388865999999999 No 42 >pfam04056 Ssl1 Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex. Probab=98.49 E-value=3.1e-05 Score=49.77 Aligned_cols=174 Identities=14% Similarity=0.138 Sum_probs=120.3 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEE-CCCCEEEEECCCCHHHHHHHH Q ss_conf 7872499996578645113135313799999988763221002366567348999982-685056102047868999998 Q gi|254780934|r 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQRKI 247 (374) Q Consensus 169 ~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~t~~~~~~~~~i 247 (374) ..-.-+++|+|.|-+|...+ -.+.|+....+.+..|+.++....|.. .+|++.. +..+..+.+++.+.......+ T Consensus 50 GiiRhl~iilD~S~aM~e~D-lkP~R~~~~l~~l~~Fi~efFdqNPiS---Qlgii~~rn~~a~~ls~lsgnp~~hi~aL 125 (250) T pfam04056 50 GIIRHLYIVLDCSRAMEEKD-LRPSRFACTIKYLETFVEEFFDQNPIS---QIGLITCKDGRAHRLTDLTGNPRVHIKAL 125 (250) T ss_pred CCEEEEEEEEECCHHHHHCC-CCCCHHHHHHHHHHHHHHHHHHCCCCC---CEEEEEEECCEEEEEEECCCCHHHHHHHH T ss_conf 20168999998827676351-586489999999999999987439830---22799996571378332579989999999 Q ss_pred HHCCCCC--CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 6013368--77444203676776432223333102578865642799980687788888068999999999789879999 Q gi|254780934|r 248 KYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 (374) Q Consensus 248 ~~l~~~g--~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytI 325 (374) ..+...+ |.-..+.|+..+...|... +....+-++|++.-= .+ .++.......+.+|+.+|++..| T Consensus 126 ~~~~~~~~~G~pSLqN~Le~a~~~L~~~---------P~~~sREILii~gSL-~T--~DPgdI~~tI~~l~~~~IrvsvI 193 (250) T pfam04056 126 KSLREAECGGDPSLQNALELARASLKHV---------PSHGSREVLIIFGSL-ST--CDPGDIYSTIDTLKKEKIRCSVI 193 (250) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHCC---------CCCCCEEEEEEEEEC-CC--CCCCCHHHHHHHHHHCCCEEEEE T ss_conf 9874069999920899999999887508---------987854899998204-44--58865999999999759079998 Q ss_pred EECCCCCHHHHHHHC--CCCCEEEECCHHHHHHHHHH Q ss_conf 924822278899823--89817983898999999999 Q gi|254780934|r 326 GIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 (374) Q Consensus 326 g~~~~~~~~~L~~~A--s~~~~~~a~~~~~L~~~f~~ 360 (374) |+.... ..+|.++ ++|.|.-+-|..-+.+.+.+ T Consensus 194 ~LaaEv--~Ick~l~~~T~G~y~V~lde~Hfk~ll~~ 228 (250) T pfam04056 194 GLSAEV--FICKELCKATNGTYSVALDETHLKELLLE 228 (250) T ss_pred EECHHH--HHHHHHHHHHCCEEEEECCHHHHHHHHHH T ss_conf 733899--99999999749988875699999999995 No 43 >pfam00362 Integrin_beta Integrin, beta chain. Integrins have been found in animals and their homologues have also been found in cyanobacteria, probably due to horizontal gene transfer. The sequences repeats have been trimmed due to an overlap with EGF. Probab=98.46 E-value=6.2e-05 Score=47.97 Aligned_cols=185 Identities=16% Similarity=0.182 Sum_probs=108.7 Q ss_pred ECCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCE------------- Q ss_conf 1037787249999657864511313531379999998876322100236656734899998268505------------- Q gi|254780934|r 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE------------- 231 (374) Q Consensus 165 ~~~~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------- 231 (374) ....+-|+|++++.|.|.||... ++.++.-...+...+... ...+|+|.-+|-+++. T Consensus 94 ~~aedYPVDLYYLMDLS~SM~dD-------l~~lk~LG~~La~~m~~i---T~nfrlGFGSFVDK~v~P~~~t~p~~l~~ 163 (424) T pfam00362 94 RQAEDYPVDLYYLMDLSYSMKDD-------LENLKTLGTDLAKEMANI---TSNFRLGFGSFVDKTVSPYVSTVPEKLKN 163 (424) T ss_pred ECCCCCCCEEEEEEECCHHHHHH-------HHHHHHHHHHHHHHHHHH---CCCCEEECCCCCCCCCCCCCCCCHHHHCC T ss_conf 71356971369986054146778-------999999999999999861---40435630200047668853379778539 Q ss_pred ----------------EEEECCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEE Q ss_conf ----------------6102047868999998601336877444203676776432223333102578865642799980 Q gi|254780934|r 232 ----------------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 (374) Q Consensus 232 ----------------~~~~~t~~~~~~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillT 295 (374) -..++|.+...+...++....+| +-+..+|---|.-+. ..=.....+ ....++.+||.| T Consensus 164 PC~~~~~~C~~~fgf~~~l~LT~~~~~F~~~v~~q~iSg-NlD~PEGGfDAlmQ~---aVC~~~IGW-R~~arrllv~~T 238 (424) T pfam00362 164 PCSSKNPGCQPPFGFRHVLSLTDDTDRFNEEVKKQKISG-NLDAPEGGFDAIMQA---AVCGEEIGW-RNEARRLLVFTT 238 (424) T ss_pred CCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCC-CCCCCCCCHHHHHHH---HHHCCCCCC-CCCCEEEEEEEC T ss_conf 875778898898022200246777899999987463646-777875017777788---761423377-778528999985 Q ss_pred CCCCC--------------CC---------------CCCHHHHHHHHHHHHCCCE-EEEEEECCCCCHHHHHHHCCCCCE Q ss_conf 68778--------------88---------------8806899999999978987-999992482227889982389817 Q gi|254780934|r 296 DGENL--------------ST---------------KEDQQSLYYCNEAKKRGAI-VYAIGIRVIRSHEFLRACASPNSF 345 (374) Q Consensus 296 DG~~~--------------~~---------------~~~~~~~~~~~~~k~~gi~-IytIg~~~~~~~~~L~~~As~~~~ 345 (374) |+..+ .+ .++...-++...+++++|. ||+|.=..-.-.+-|.+.- +|.+ T Consensus 239 Da~fH~AgDGkL~GIv~PNDg~CHL~~~g~Yt~s~~~DYPSv~ql~~kl~ennI~~IFAVt~~~~~~Y~~Ls~~i-~gs~ 317 (424) T pfam00362 239 DAGFHFAGDGKLGGIVEPNDGQCHLDDNGEYTASTTLDYPSVGQLAEKLSENNINPIFAVTENVVDLYKELSELI-PGST 317 (424) T ss_pred CCCCCCCCCCCEEEEECCCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHC-CCCE T ss_conf 887513577633435348887304489876144566788888999999986492599997502458999999757-7652 Q ss_pred E--EECCHHHH----HHHHHHHHHHH Q ss_conf 9--83898999----99999999842 Q gi|254780934|r 346 Y--LVENPHSM----YDAFSHIGKDI 365 (374) Q Consensus 346 ~--~a~~~~~L----~~~f~~I~~~i 365 (374) . -..+...+ .+++++|...+ T Consensus 318 vg~L~~DSsNIv~LI~~aY~ki~S~V 343 (424) T pfam00362 318 VGVLSSDSSNVVQLIKDAYNKISSKV 343 (424) T ss_pred EEEECCCCHHHHHHHHHHHHHHHEEE T ss_conf 56624675028999999998752289 No 44 >KOG2807 consensus Probab=98.38 E-value=3.3e-05 Score=49.63 Aligned_cols=175 Identities=14% Similarity=0.116 Sum_probs=123.4 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEC-CCCEEEEECCCCHHHHHHHH Q ss_conf 78724999965786451131353137999999887632210023665673489999826-85056102047868999998 Q gi|254780934|r 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWGVSHLQRKI 247 (374) Q Consensus 169 ~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~~~~~~~i 247 (374) ..-.-+++|+|.|..|...+ =..+|...+.+.+..|+..+....|.. .++++.-. ..+..+..++.+......++ T Consensus 58 GiiRhl~iviD~S~am~e~D-f~P~r~a~~~K~le~Fv~eFFdQNPiS---Qigii~~k~g~A~~lt~ltgnp~~hI~aL 133 (378) T KOG2807 58 GIIRHLYIVIDCSRAMEEKD-FRPSRFANVIKYLEGFVPEFFDQNPIS---QIGIISIKDGKADRLTDLTGNPRIHIHAL 133 (378) T ss_pred HHHEEEEEEEEHHHHHHHCC-CCCHHHHHHHHHHHHHHHHHHCCCCHH---HEEEEEEECCHHHHHHHHCCCHHHHHHHH T ss_conf 22236899987345566444-780489999999999999986149620---33589970553268887148878899997 Q ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 60133687744420367677643222333310257886564279998068778888806899999999978987999992 Q gi|254780934|r 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 (374) Q Consensus 248 ~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~ 327 (374) ..+..-++.-..+.++..+...|..- .+-..+.++|+++-=-. .++....+..+.+|..+|++..||. T Consensus 134 ~~~~~~~g~fSLqNaLe~a~~~Lk~~---------p~H~sREVLii~sslsT---~DPgdi~~tI~~lk~~kIRvsvIgL 201 (378) T KOG2807 134 KGLTECSGDFSLQNALELAREVLKHM---------PGHVSREVLIIFSSLST---CDPGDIYETIDKLKAYKIRVSVIGL 201 (378) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHCCC---------CCCCCEEEEEEEEEECC---CCCCCHHHHHHHHHHHCEEEEEEEE T ss_conf 31224488867887999999985178---------76563279999854035---5852099999999861727999850 Q ss_pred CCCCCHHHHHHH--CCCCCEEEECCHHHHHHHHHHH Q ss_conf 482227889982--3898179838989999999999 Q gi|254780934|r 328 RVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHI 361 (374) Q Consensus 328 ~~~~~~~~L~~~--As~~~~~~a~~~~~L~~~f~~I 361 (374) ... -.+-|.+ |++|.|+.+-+..-|.+.|.+- T Consensus 202 saE--v~icK~l~kaT~G~Y~V~lDe~HlkeLl~e~ 235 (378) T KOG2807 202 SAE--VFICKELCKATGGRYSVALDEGHLKELLLEH 235 (378) T ss_pred CHH--HHHHHHHHHHHCCEEEEEECHHHHHHHHHHC T ss_conf 055--8999999886188579875789999999845 No 45 >smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed). Probab=98.35 E-value=0.00017 Score=45.30 Aligned_cols=185 Identities=17% Similarity=0.197 Sum_probs=107.9 Q ss_pred ECCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCE------------- Q ss_conf 1037787249999657864511313531379999998876322100236656734899998268505------------- Q gi|254780934|r 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE------------- 231 (374) Q Consensus 165 ~~~~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------- 231 (374) ....+-|+|++++.|.|.||... ++.++.--..+...+... ...+|+|.-.|-+++. T Consensus 93 ~~aedYPVDLYYLMDLS~SM~dD-------l~~l~~LG~~La~~m~~i---T~nfrlGFGsFVDK~v~Py~~t~p~~l~~ 162 (423) T smart00187 93 RQAEDYPVDLYYLMDLSYSMKDD-------LDNLKSLGDDLAREMKGL---TSNFRLGFGSFVDKTVSPFVSTRPEKLEN 162 (423) T ss_pred ECCCCCCCEEEEEEECCHHHHHH-------HHHHHHHHHHHHHHHHHH---CCCCEEEEEECCCCCCCCCCCCCHHHHCC T ss_conf 03246971268885054457888-------999999999999999864---00545511112036657754489788549 Q ss_pred ----------------EEEECCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEE Q ss_conf ----------------6102047868999998601336877444203676776432223333102578865642799980 Q gi|254780934|r 232 ----------------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 (374) Q Consensus 232 ----------------~~~~~t~~~~~~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillT 295 (374) -..++|.|...+...++....+| +-+..+|---|.-+. ..=.....+ ..+.++.+||.| T Consensus 163 PC~~~~~~C~ppfgf~n~l~LT~d~~~F~~~V~~q~iSg-NlD~PEGGfDAlmQ~---avC~~~IGW-R~~arrllVf~T 237 (423) T smart00187 163 PCPNYNLTCEPPYGFKHVLSLTDDTDEFNEEVKKQRISG-NLDAPEGGFDAIMQA---AVCTEQIGW-REDARRLLVFST 237 (423) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCC-CCCCCCCCHHHHHHH---HHHCCCCCC-CCCCEEEEEEEC T ss_conf 987889876798130111236788899999986253634-668876127788888---752000376-557438999983 Q ss_pred CCCCC--------------CCC---------------CCHHHHHHHHHHHHCCCE-EEEEEECCCCCHHHHHHHCCCCCE Q ss_conf 68778--------------888---------------806899999999978987-999992482227889982389817 Q gi|254780934|r 296 DGENL--------------STK---------------EDQQSLYYCNEAKKRGAI-VYAIGIRVIRSHEFLRACASPNSF 345 (374) Q Consensus 296 DG~~~--------------~~~---------------~~~~~~~~~~~~k~~gi~-IytIg~~~~~~~~~L~~~As~~~~ 345 (374) |+..+ .+. ++...-++...+++++|. ||+|.=..-.-.+-|...= +|.. T Consensus 238 Da~fH~AgDGkL~GIv~PNDg~CHLd~~g~Yt~s~~~DYPSi~ql~~kl~ennI~~IFAVT~~~~~~Y~~Ls~~i-pgs~ 316 (423) T smart00187 238 DAGFHFAGDGKLAGIVQPNDGQCHLDNNGEYTMSTTQDYPSIGQLNQKLAENNINPIFAVTKKQVSLYKELSALI-PGSS 316 (423) T ss_pred CCCCCCCCCCCEEEEECCCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHC-CCCE T ss_conf 786302367624435437887303278885244565678878999999985393279985220456999998757-7540 Q ss_pred E--EECCHH---H-HHHHHHHHHHHH Q ss_conf 9--838989---9-999999999842 Q gi|254780934|r 346 Y--LVENPH---S-MYDAFSHIGKDI 365 (374) Q Consensus 346 ~--~a~~~~---~-L~~~f~~I~~~i 365 (374) . -.++.. + +.+++++|...+ T Consensus 317 vg~L~~DSsNVv~LI~~aY~ki~S~V 342 (423) T smart00187 317 VGVLSEDSSNVVELIKDAYNKISSRV 342 (423) T ss_pred EEEECCCCHHHHHHHHHHHHHHCEEE T ss_conf 35524575138999999998750189 No 46 >KOG3768 consensus Probab=98.32 E-value=1.2e-05 Score=52.27 Aligned_cols=191 Identities=15% Similarity=0.130 Sum_probs=119.0 Q ss_pred EEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEE-EECCCCHHHHHHHHHHCCC Q ss_conf 999965786451131353137999999887632210023665673489999826850561-0204786899999860133 Q gi|254780934|r 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK 252 (374) Q Consensus 174 i~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~t~~~~~~~~~i~~l~~ 252 (374) +.|++|+||||...-.....-+|.+|.++..|+........ ....|+-+.+|.....-. ..+...-..+...|..|.+ T Consensus 4 ~lFllDTS~SM~qrah~~~tylD~AKgaVEtFiK~R~r~~~-~~gdryml~TfeepP~~vk~~~~~~~a~~~~eik~l~a 82 (888) T KOG3768 4 FLFLLDTSGSMSQRAHPQFTYLDLAKGAVETFIKQRTRVGR-ETGDRYMLTTFEEPPKNVKVACEKLGAVVIEEIKKLHA 82 (888) T ss_pred EEEEEECCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCC-CCCCEEEEEECCCCCHHHHHHHHHCCCHHHHHHHHHCC T ss_conf 99997066412220367831567777799999998741541-24865899862458602556776503079888776157 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHH----HCCCCCC-CCCCEEEEEEECCCCCCCCCC----------------HHHHHH Q ss_conf 6877444203676776432223333----1025788-656427999806877888880----------------689999 Q gi|254780934|r 253 FGVSTNSTPGLKYAYNQIFDMQGMR----QHCNTED-ANYKKIIVFMTDGENLSTKED----------------QQSLYY 311 (374) Q Consensus 253 ~g~~T~~~~~~~~a~~~l~~~~~~~----~~~~~~~-~~~~k~iillTDG~~~~~~~~----------------~~~~~~ 311 (374) .++.+....+...|++.|.-.--.. .+..+.- .--.-+||++|||.--.+... +.+++. T Consensus 83 ~~~s~~~~~~~t~AFdlLnlnR~qtGID~yGqGR~pf~lEP~~iI~iTDG~r~s~~~GV~~e~~Lpl~~p~pGse~Tkep 162 (888) T KOG3768 83 PYGSCQLHHAITEAFDLLNLNRVQTGIDGYGQGRLPFNLEPVTIILITDGGRYSGVAGVPIEFRLPLDPPFPGSEMTKEP 162 (888) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCCCCC T ss_conf 63124556788877654315555305545555657456686489998248721344677515885068999862023563 Q ss_pred HHHHHHCCCEEEEEEECCC--------------C-CHHHHHHHC-CCCCEEEECCHHHHHHHHHHHHHHHHEEEE Q ss_conf 9999978987999992482--------------2-278899823-898179838989999999999984200166 Q gi|254780934|r 312 CNEAKKRGAIVYAIGIRVI--------------R-SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 (374) Q Consensus 312 ~~~~k~~gi~IytIg~~~~--------------~-~~~~L~~~A-s~~~~~~a~~~~~L~~~f~~I~~~i~~~~~ 370 (374) .. . .-+.|++-+.-+ . +.-.-+-|+ ++|+-|.+-+.-.|....+.+.+. .+-.+ T Consensus 163 FR-W---DQrlftlVlRiPgt~~~~~~qlt~Vp~Dds~IermCevTGGRSysV~Spr~lnqciesLvqk-vQ~gV 232 (888) T KOG3768 163 FR-W---DQRLFTLVLRIPGTPYPTISQLTAVPIDDSVIERMCEVTGGRSYSVVSPRQLNQCIESLVQK-VQYGV 232 (888) T ss_pred CH-H---HHHHHEEEEECCCCCCCCHHHHCCCCCCCHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHHH-HCCCE T ss_conf 02-3---43120046755899986675505777773266776653178503553799999999999986-44683 No 47 >cd01455 vWA_F11C1-5a_type Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A Probab=98.30 E-value=0.00012 Score=46.23 Aligned_cols=175 Identities=12% Similarity=0.164 Sum_probs=106.1 Q ss_pred EEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCE------EEEECCCCHHHHH--H Q ss_conf 9999657864511313531379999998876322100236656734899998268505------6102047868999--9 Q gi|254780934|r 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE------EFFLLEWGVSHLQ--R 245 (374) Q Consensus 174 i~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~t~~~~~~~--~ 245 (374) +-+|+|.|||| +-+++.-.|++..-+++.-++..+...+ ......++..++... ...++.++++.++ + T Consensus 3 lr~v~DvSgSM-YRFNg~DgRL~R~lEa~~MvMEaf~g~e---~k~~ydIvGHSGd~~~I~lV~~~~~Pk~~keRl~vl~ 78 (191) T cd01455 3 LKLVVDVSGSM-YRFNGYDGRLDRSLEAVVMVMEAFDGFE---DKIQYDIIGHSGDGPCVPFVKTNHPPKNNKERLETLK 78 (191) T ss_pred EEEEEECCCCE-EEECCCCHHHHHHHHHHHHHHHHHHCCC---CEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 69999735442-3304753289999999999999861754---0057887502688775102348999986689999999 Q ss_pred HHHHCCCC-CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHH-HHCCCEEE Q ss_conf 98601336-87744420367677643222333310257886564279998068778888806899999999-97898799 Q gi|254780934|r 246 KIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVY 323 (374) Q Consensus 246 ~i~~l~~~-g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~-k~~gi~Iy 323 (374) .+-+-+.. ..|-++-+++..+...+..... .-..++|++||-.-..+... +..+...+ ++.+|.-| T Consensus 79 ~M~AHsQyC~sGD~Tlea~~~Ai~~l~a~~d----------~De~fVivlSDANL~RYgI~--p~~l~~~l~~~p~V~a~ 146 (191) T cd01455 79 MMHAHSQFCWSGDHTVEATEFAIKELAAKED----------FDEAIVIVLSDANLERYGIQ--PKKLADALAREPNVNAF 146 (191) T ss_pred HHHHHHHHEECCCCHHHHHHHHHHHHHHCCC----------CCCCEEEEECCCCHHHCCCC--HHHHHHHHHCCCCCCEE T ss_conf 8631201002588448999999998753026----------77608999814764431889--89999997338776689 Q ss_pred EEEECCCCCH-HHHHHHCCCCCEEEECCHHHHHHHHHHHHHH Q ss_conf 9992482227-8899823898179838989999999999984 Q gi|254780934|r 324 AIGIRVIRSH-EFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 (374) Q Consensus 324 tIg~~~~~~~-~~L~~~As~~~~~~a~~~~~L~~~f~~I~~~ 364 (374) .|-+|.-.++ +-|++-=-.|+-|..-+..+|..+|++|... T Consensus 147 ~IfIgslg~eA~~l~~~lP~G~~fVc~dt~~lP~il~qIfts 188 (191) T cd01455 147 VIFIGSLSDEADQLQRELPAGKAFVCMDTSELPHIMQQIFTS 188 (191) T ss_pred EEEEECHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHH T ss_conf 999735167999999748997417853653678999999887 No 48 >PRK10997 yieM hypothetical protein; Provisional Probab=98.17 E-value=0.00011 Score=46.53 Aligned_cols=142 Identities=16% Similarity=0.166 Sum_probs=87.8 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHCCC Q ss_conf 49999657864511313531379999998876322100236656734899998268505610204786899999860133 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l~~ 252 (374) -+++++|+||||.+. +=..+|..+-.++...... .-.+-++.|+...... .++. ...+...++-|.. T Consensus 322 P~IvCVDTSGSM~G~------pE~~AKA~~Lal~r~Al~e-----~R~CyvI~FSte~~t~-eLt~-~~gl~~l~~FL~~ 388 (484) T PRK10997 322 PFIVCVDTSGSMGGF------NEQCAKAFCLALMRIALAE-----NRRCYIMLFSTEVITY-ELSG-PDGLEQAIRFLSQ 388 (484) T ss_pred CEEEEEECCCCCCCC------HHHHHHHHHHHHHHHHHHC-----CCCEEEEEECCCEEEE-EECC-CCCHHHHHHHHCC T ss_conf 979999588888997------6889999999999999962-----8987999812651789-8048-7887999998528 Q ss_pred -CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHH-HHCCCEEEEEEECCC Q ss_conf -687744420367677643222333310257886564279998068778888806899999999-978987999992482 Q gi|254780934|r 253 -FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVI 330 (374) Q Consensus 253 -~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~-k~~gi~IytIg~~~~ 330 (374) ..|||+...++..+...+....... -=++++||-.-. .-+......+... |..+-+.|+|.++.- T Consensus 389 sF~GGTD~~~~L~~~l~~m~~~~y~~-----------ADllvISDFIa~--~lp~~l~~kv~~lqk~~~nrFhav~is~~ 455 (484) T PRK10997 389 SFRGGTDLAPCLRAIIEKMQGREWFD-----------ADAVVISDFIAQ--RLPDELVAKVKELQRVHQHRFHAVAMSAH 455 (484) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCC-----------CCEEEECHHCCC--CCCHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 88898457999999999862324465-----------887997122065--69999999999999850683588840123 Q ss_pred CCHHHHHHHC Q ss_conf 2278899823 Q gi|254780934|r 331 RSHEFLRACA 340 (374) Q Consensus 331 ~~~~~L~~~A 340 (374) ....+|+..- T Consensus 456 g~p~~m~iFD 465 (484) T PRK10997 456 GKPGIMRIFD 465 (484) T ss_pred CCHHHHHHHH T ss_conf 5857999970 No 49 >cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association. Probab=98.15 E-value=0.0003 Score=43.88 Aligned_cols=169 Identities=20% Similarity=0.195 Sum_probs=114.4 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC-CCEEEEECCCCHHHHHHHHHHCC Q ss_conf 49999657864511313531379999998876322100236656734899998268-50561020478689999986013 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFLLEWGVSHLQRKIKYLS 251 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~~~~~~~i~~l~ 251 (374) ..++++|+|-.|.. .+-.+.|+++-++|+..+...-....+. ..+|+.+.++ .+..+..+|.|...+...+..+. T Consensus 5 Atmi~iDNSe~~RN-GDy~PtR~~AQ~dAvn~i~~~k~~~NpE---n~VGl~tmag~~~~Vl~TlT~D~gkiL~~lh~i~ 80 (187) T cd01452 5 ATMICIDNSEYMRN-GDYPPTRFQAQADAVNLICQAKTRSNPE---NNVGLMTMAGNSPEVLVTLTNDQGKILSKLHDVQ 80 (187) T ss_pred EEEEEEECCHHHCC-CCCCCCHHHHHHHHHHHHHHHHHHCCCC---CCEEEEEECCCCCEEEEECCCCHHHHHHHCCCCC T ss_conf 89999978566505-8989718999999999999777514953---3113576158986689844865788987532677 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC- Q ss_conf 3687744420367677643222333310257886564279998068778888806899999999978987999992482- Q gi|254780934|r 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI- 330 (374) Q Consensus 252 ~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~- 330 (374) ..| ..+...|++-|.=.|.-. .+.+.++.||++--... . .+......++..+|+++|-|-.|.||.. T Consensus 81 ~~G-~~~~~~~IqiA~LALKHR---------qnk~~~qRIv~FVgSPi-~-~~ek~l~~laKklKKnnV~vDII~FGe~~ 148 (187) T cd01452 81 PKG-KANFITGIQIAQLALKHR---------QNKNQKQRIVAFVGSPI-E-EDEKDLVKLAKRLKKNNVSVDIINFGEID 148 (187) T ss_pred CCC-EECHHHHHHHHHHHHHCC---------CCCCCCEEEEEEECCCC-C-CCHHHHHHHHHHHHHCCCCEEEEEECCCC T ss_conf 187-651887999999997234---------67775447999978987-5-57899999999875558535899946888 Q ss_pred CCHHHHHHH----CCC--CCEEEECCHHHH-HHH Q ss_conf 227889982----389--817983898999-999 Q gi|254780934|r 331 RSHEFLRAC----ASP--NSFYLVENPHSM-YDA 357 (374) Q Consensus 331 ~~~~~L~~~----As~--~~~~~a~~~~~L-~~~ 357 (374) .+.+.|+.. -++ .++-.++.+..| .|+ T Consensus 149 ~n~~kL~~f~~~vn~~~~Shlv~ippg~~lLSd~ 182 (187) T cd01452 149 DNTEKLTAFIDAVNGKDGSHLVSVPPGENLLSDA 182 (187) T ss_pred CCHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHH T ss_conf 9989999999984589982599947998645676 No 50 >pfam05762 VWA_CoxE VWA domain containing CoxE-like protein. This family is annotated by SMART as containing a VWA type domain. The exact function of this family is unknown. It is found as part of a CO oxidising (Cox) system operon is several bacteria. Probab=98.10 E-value=7.7e-05 Score=47.42 Aligned_cols=131 Identities=15% Similarity=0.198 Sum_probs=75.4 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHHHHHH Q ss_conf 787249999657864511313531379999998876322100236656734899998268505610204--786899999 Q gi|254780934|r 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRK 246 (374) Q Consensus 169 ~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~~~~~ 246 (374) ..+..+++++|.||||..+. ++ .-..+..+...+ -++-.+.|+.....+.+.- .+....... T Consensus 55 ~~p~~lVvl~DVSGSM~~ys-----~~--~L~~~~al~~~~---------~rv~~F~F~t~l~~vT~~l~~~d~~~al~~ 118 (223) T pfam05762 55 RKPRRLVLLLDVSGSMADYS-----RI--FLALLHALLAGR---------PRTRLFAFGTRLTDLTRALRERDPAEALLR 118 (223) T ss_pred CCCCCEEEEECCCCCCHHHH-----HH--HHHHHHHHHHCC---------CCCEEEEEECCHHHHHHHHHCCCHHHHHHH T ss_conf 89875899973788749999-----99--999999998546---------861599983648988888712899999999 Q ss_pred HHHC-CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 8601-336877444203676776432223333102578865642799980687788888068999999999789879999 Q gi|254780934|r 247 IKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 (374) Q Consensus 247 i~~l-~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytI 325 (374) +... ...|+||+++.++........ ...-..+-++|++|||.++++ ..........++..+.+|..+ T Consensus 119 ~~~~~~~~~GgT~ig~al~~f~~~~~----------~~~l~~~t~ViilsDg~~~~~--~~~l~~~l~~L~~~~~rviWL 186 (223) T pfam05762 119 VSARVEDWGGGTRIGAALAYFNELWT----------RPALSRGAVVVLVSDGLERGD--SEELLAEVARLVRSARRLVWL 186 (223) T ss_pred HHHHHCCCCCCCCHHHHHHHHHHHCC----------CCCCCCCCEEEEEECCCCCCC--HHHHHHHHHHHHHHCCEEEEE T ss_conf 99860366799749999999998503----------034678867999723010388--318999999999837879998 Q ss_pred EE Q ss_conf 92 Q gi|254780934|r 326 GI 327 (374) Q Consensus 326 g~ 327 (374) -- T Consensus 187 NP 188 (223) T pfam05762 187 NP 188 (223) T ss_pred CC T ss_conf 99 No 51 >pfam06707 DUF1194 Protein of unknown function (DUF1194). This family consists of several hypothetical Rhizobiales specific proteins of around 270 residues in length. The function of this family is unknown. Probab=98.02 E-value=0.00049 Score=42.62 Aligned_cols=180 Identities=14% Similarity=0.164 Sum_probs=106.3 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHH--HHHHHHHH--HHHHHCCCCCCCCCEEEEEEEECCCCE--EEEECC--CCHHH Q ss_conf 72499996578645113135313799--99998876--322100236656734899998268505--610204--78689 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKID--MAIKSINA--MLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLE--WGVSH 242 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~--~~~~a~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~t--~~~~~ 242 (374) .++++|.+|.|+||+.. ..+++ -.-.|+.. ++..+.. .....+.+.++.+++... ...+++ .+.+. T Consensus 3 dlaLvLavDvS~SVD~~----E~~lQr~G~A~Al~dp~V~~Ai~~--g~~g~Iava~~eWsg~~~q~~vv~Wt~I~~~~d 76 (206) T pfam06707 3 DLALVLAVDVSGSVDEE----EYRLQREGYAAALRDPEVLDALLS--GPHGRIAVTYVEWSGPDDQRVVVPWTLIDSAED 76 (206) T ss_pred HHHHHHHHHCCCCCCHH----HHHHHHHHHHHHHCCHHHHHHHHC--CCCCEEEEEEEEECCCCCCEEEECCEEECCHHH T ss_conf 47786773323686999----999999999999759999999962--899718999998027887448869989589999 Q ss_pred HHHH---HHHCC-CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 9999---86013-3687744420367677643222333310257886564279998068778888806899999999978 Q gi|254780934|r 243 LQRK---IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 (374) Q Consensus 243 ~~~~---i~~l~-~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~ 318 (374) .... |...+ ..+..|.++.++..+...+.... ..-.+|+|=+=.||.+|.+..+.. ..-+.+-.. T Consensus 77 a~a~A~~i~~~~r~~~~~Taig~Al~~a~~l~~~~~---------~~~~RrvIDiSGDG~nN~G~~p~~--~ard~~~~~ 145 (206) T pfam06707 77 AEAFAARLAAAPRRAGRRTAIGGALGFAAALLAQNP---------YECLRRVIDVSGDGPNNQGFPPVT--AARDAAVAA 145 (206) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCC---------CCCCEEEEEEECCCCCCCCCCCHH--HHHHHHHHC T ss_conf 999999997588788999769999999999998299---------876179999607998889998137--898767775 Q ss_pred CCEEEEEEECCCC------CHHHHHHHC--CCCCE-EEECCHHHHHHHH-HHHHHHHHE Q ss_conf 9879999924822------278899823--89817-9838989999999-999984200 Q gi|254780934|r 319 GAIVYAIGIRVIR------SHEFLRACA--SPNSF-YLVENPHSMYDAF-SHIGKDIVT 367 (374) Q Consensus 319 gi~IytIg~~~~~------~~~~L~~~A--s~~~~-~~a~~~~~L~~~f-~~I~~~i~~ 367 (374) ||+|..+.++.+. -..+.+.|. ++|.| -.+.+.++..+++ +++..||.. T Consensus 146 GitINgL~I~~~~~~~~~~L~~yy~~~VIgGpgAFV~~a~~~~df~~AirrKL~rEIag 204 (206) T pfam06707 146 GVTINGLAIMGAEAPTSDDLDAYYRDCVIGGPGAFVEPANGFEDFAEAIRRKLVREIAG 204 (206) T ss_pred CEEEEEEEECCCCCCCCHHHHHHHHHCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHC T ss_conf 92896677747898762369999973202389844997388799999999999998732 No 52 >COG4655 Predicted membrane protein [Function unknown] Probab=97.96 E-value=5.7e-06 Score=54.16 Aligned_cols=57 Identities=18% Similarity=0.351 Sum_probs=52.1 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 864123686299999999999999999999999999999999999999999988763 Q gi|254780934|r 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 (374) Q Consensus 9 ~~f~~~~~G~vaiifAl~l~~ll~~~g~avD~~~~~~~ks~Lq~A~DaAaLAaa~~~ 65 (374) +.|.|-+|+-+.++.|+.++..++..+++|||++.|..|.+||..+|-|+++++..+ T Consensus 2 ~g~~r~~rs~~gvltal~~~lal~~l~l~VD~G~l~leqR~LQ~~ADlAAiaAAs~~ 58 (565) T COG4655 2 NGWPRRQRSMVGVLTALFVPLALATLLLGVDYGYLYLEQRELQRVADLAAIAAASNL 58 (565) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCEEEEEHHHHHHHHHHHHHHHHHHC T ss_conf 842376766778999999999999886502201244117878887769988877627 No 53 >pfam04285 DUF444 Protein of unknown function (DUF444). Bacterial protein of unknown function. One family member is predicted to contain a von Willebrand factor (vWF) type A domain (Smart:VWA). Probab=97.86 E-value=0.0012 Score=40.40 Aligned_cols=164 Identities=18% Similarity=0.205 Sum_probs=87.7 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHCCC Q ss_conf 49999657864511313531379999998876322100236656734899998268505610204786899999860133 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l~~ 252 (374) -+.|+.|+||||+.. +-+.+|...--+...+.. .+ ..+-+-.+.....+.-.. -+..= . .. T Consensus 248 Vmfc~MDVSGSM~e~------~K~lAk~FfflLy~FL~r--~Y-~~VEvVFI~H~t~AkEVd-----Ee~FF----~-~~ 308 (421) T pfam04285 248 VVFCLMDVSGSMGES------EKDLAKRFFFLLYLFLTR--KY-ENVEIVFIAHHTEAKEVD-----ETDFF----Y-KQ 308 (421) T ss_pred EEEEEEECCCCCCHH------HHHHHHHHHHHHHHHHHH--CC-CCEEEEEEEECCCEEEEC-----HHHHC----C-CC T ss_conf 999998557687788------999999999999999971--57-854899997138347836-----79932----5-48 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE-ECCCC Q ss_conf 68774442036767764322233331025788656427999806877888880689999999997898799999-24822 Q gi|254780934|r 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG-IRVIR 331 (374) Q Consensus 253 ~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg-~~~~~ 331 (374) .+|||-...|+..+.+.+....+. ....-+.+=.|||.|+..........+.+.+-. -+..|.-+ +..+. T Consensus 309 EsGGT~vSSal~l~~~II~~RYpp--------~~WNiY~f~aSDGDNw~~D~~~c~~lL~~~llp-~~~~f~Y~EI~~~~ 379 (421) T pfam04285 309 ESGGTIVSSALELALEIIDERYPP--------AEWNIYAFQASDGDNWTDDSERCVKLLMNKLMP-NAQYYGYVEITQRR 379 (421) T ss_pred CCCCEEEEHHHHHHHHHHHHHCCH--------HHCEEEEEEECCCCCCCCCHHHHHHHHHHHHHH-HHHEEEEEEECCCC T ss_conf 989758727999999999855886--------445046798037766434649999999998988-74158999945887 Q ss_pred CHHH---HHHH-CCCCCE--EEECCHHHHHHHHHHHHHH Q ss_conf 2788---9982-389817--9838989999999999984 Q gi|254780934|r 332 SHEF---LRAC-ASPNSF--YLVENPHSMYDAFSHIGKD 364 (374) Q Consensus 332 ~~~~---L~~~-As~~~~--~~a~~~~~L~~~f~~I~~~ 364 (374) .+.+ ++.. ...++| ..+.+.+++-.+|+++.+. T Consensus 380 ~~~~~~~y~~~~~~~~nf~~~~I~~k~dIypvfr~lf~k 418 (421) T pfam04285 380 SHSTWRKYEAVKGVKDNFAMYTIREKDDVYPVFRTLFQK 418 (421) T ss_pred CCCHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHHH T ss_conf 652799999863248975799958888889999999864 No 54 >PRK05325 hypothetical protein; Provisional Probab=97.75 E-value=0.0026 Score=38.31 Aligned_cols=167 Identities=17% Similarity=0.132 Sum_probs=89.7 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHCC Q ss_conf 24999965786451131353137999999887632210023665673489999826850561020478689999986013 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l~ 251 (374) .-|.|+.|+||||+.. +-+.++...--+...+.. .+ ..+-+-.+.....+.-.. -. ..-. .. T Consensus 235 AV~f~lMDvSGSM~~~------~K~lak~ff~lLy~fL~~--~Y-~~vevvFI~H~t~AkEVd-----Ee---~FF~-~~ 296 (414) T PRK05325 235 AVMFCLMDVSGSMDEA------EKDLAKRFFFLLYLFLRR--KY-ENVEVVFIRHHTEAKEVD-----EE---EFFY-SR 296 (414) T ss_pred EEEEEEEECCCCCCHH------HHHHHHHHHHHHHHHHHH--HC-CCEEEEEEEECCCEEECC-----HH---HHCC-CC T ss_conf 6999998556677678------999999999999999985--15-754899997159426747-----89---8315-58 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE-ECCC Q ss_conf 368774442036767764322233331025788656427999806877888880689999999997898799999-2482 Q gi|254780934|r 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG-IRVI 330 (374) Q Consensus 252 ~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg-~~~~ 330 (374) .+|||-...|+..+...+....+. ....-+.+=.|||.|+..........+...+-.. +..|+.+ +... T Consensus 297 -esGGT~vSSa~~l~~eII~~rYpp--------~~WNIY~f~aSDGDNw~~D~~~~~~~L~~~llp~-~~~f~Y~Ei~~~ 366 (414) T PRK05325 297 -ESGGTIVSSALKLMLEIIEERYPP--------AEWNIYAFQASDGDNWSDDSPRCVELLVEELLPV-VNYFAYIEITPR 366 (414) T ss_pred -CCCCEEEEHHHHHHHHHHHHHCCH--------HHCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHH-HHEEEEEEEECC T ss_conf -989848508999999999854887--------5652788991377675446699999999988887-536899997179 Q ss_pred ---CCHHHHHH---HC-CCCCE--EEECCHHHHHHHHHHHHHHHH Q ss_conf ---22788998---23-89817--983898999999999998420 Q gi|254780934|r 331 ---RSHEFLRA---CA-SPNSF--YLVENPHSMYDAFSHIGKDIV 366 (374) Q Consensus 331 ---~~~~~L~~---~A-s~~~~--~~a~~~~~L~~~f~~I~~~i~ 366 (374) ....+++. .. ..++| ..+.+.+++-.+|+++.+.-. T Consensus 367 ~~~~~~~l~~~y~~~~~~~~~f~~~~I~~~~dI~p~fr~lf~k~~ 411 (414) T PRK05325 367 AYYRHQTLWREYEKLQDEFDNFAMQHIRDKADIYPVFRELFKKEL 411 (414) T ss_pred CCCCCHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHHHHH T ss_conf 888756899999997554888679994888888999999985555 No 55 >COG2304 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only] Probab=97.71 E-value=0.00035 Score=43.49 Aligned_cols=160 Identities=16% Similarity=0.146 Sum_probs=101.2 Q ss_pred CCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHHHHH Q ss_conf 7787249999657864511313531379999998876322100236656734899998268505610204--78689999 Q gi|254780934|r 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQR 245 (374) Q Consensus 168 ~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~~~~ 245 (374) ...+....+++|.||||... . +...+.+...++..+... ..+.++.|........+.+ .++..+.. T Consensus 34 ~~~~~~~~~~~~~~~s~~~~-----~-~~~~~~~~~~~v~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 101 (399) T COG2304 34 LLVPANLTLAIDTSGSMTGA-----L-LELAKSAAIELVNGLNPG------DLLSIVTFAGSADVLIPPTGATNKESITA 101 (399) T ss_pred CCCCCHHHHHHCCCCCCHHH-----H-HHHHHHHHHHHHHCCCCH------HHHEEEECCCCCCEECCCCCCCCHHHHHH T ss_conf 00221002221366410055-----6-776789988877315621------11204611565550125642232277888 Q ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHH-HHHHHHHHHCCCEEEE Q ss_conf 986013368774442036767764322233331025788656427999806877888880689-9999999978987999 Q gi|254780934|r 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS-LYYCNEAKKRGAIVYA 324 (374) Q Consensus 246 ~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~-~~~~~~~k~~gi~Iyt 324 (374) .++.....++.|....++.|+...+.... .......+.+.|||.++.+..+... ....+.....+|.+.+ T Consensus 102 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~tdg~~~~~~~d~~~~~~~~~~~~~~~i~~~~ 172 (399) T COG2304 102 AIDQSLQAGGATAVEASLSLAVELAAKAL---------PRGTLNRILLLTDGENNLGLVDPSRLSALAKLAAGKGIVLDT 172 (399) T ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHC---------CCCCCCEEEEECCCCHHCCCCCHHHHHHHHCCCCCCCEEEEE T ss_conf 87640264554305778999999876423---------545532333303641202766788999986345567627863 Q ss_pred EEECCCCCHHHHHHHC--CCCCEEEE Q ss_conf 9924822278899823--89817983 Q gi|254780934|r 325 IGIRVIRSHEFLRACA--SPNSFYLV 348 (374) Q Consensus 325 Ig~~~~~~~~~L~~~A--s~~~~~~a 348 (374) +|++.+.+.+.+...+ ..|.+... T Consensus 173 ~g~~~~~n~~~~~~~~~~~~g~l~~~ 198 (399) T COG2304 173 LGLGDDVNEDELTGIAAAANGNLAFI 198 (399) T ss_pred ECCCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 13552267777776553036641102 No 56 >cd01460 vWA_midasin VWA_Midasin: Midasin is a member of the AAA ATPase family. The proteins of this family are unified by their common archetectural organization that is based upon a conserved ATPase domain. The AAA domain of midasin contains six tandem AAA protomers. The AAA domains in midasin is followed by a D/E rich domain that is following by a VWA domain. The members of this subgroup have a conserved MIDAS motif. The function of this domain is not exactly known although it has been speculated to play a crucial role in midasin function. Probab=97.62 E-value=0.0045 Score=36.89 Aligned_cols=180 Identities=16% Similarity=0.243 Sum_probs=105.6 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHH--HHHHH Q ss_conf 7872499996578645113135313799999988763221002366567348999982685056102047868--99999 Q gi|254780934|r 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRK 246 (374) Q Consensus 169 ~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~--~~~~~ 246 (374) ....-|++.+|.|.||..+-. .+.+-+++.-+...+... +.-.+++..|+..+....++..... .--.. T Consensus 58 KR~YqI~lAiDdSkSM~~~~~-----~~lAlesl~lvs~Als~L----EvG~l~V~~FGe~v~~lh~f~~~f~~~~g~~i 128 (266) T cd01460 58 KRDYQILIAIDDSKSMSENNS-----KKLALESLCLVSKALTLL----EVGQLGVCSFGEDVQILHPFDEQFSSQSGPRI 128 (266) T ss_pred CCCEEEEEEECCCHHHHHHHH-----HHHHHHHHHHHHHHHHHH----CCCCEEEEECCCCEEEEEECCCCCCCCHHHHH T ss_conf 542589999726033101047-----889999999999999872----77656899848872786226887654227999 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 86013368774442036767764322233331025788656427999806877888880689999999997898799999 Q gi|254780934|r 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 (374) Q Consensus 247 i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg 326 (374) ++.++-....|++..-+....+.+..... ..++....+.++++|||....... ........+.+++|.+-.|= T Consensus 129 l~~f~F~q~~T~v~~ll~~~~~~~~~a~~-----~~~~~~~~qL~lIiSDG~~~~~~~--~~r~~vr~a~~~~i~~vfiI 201 (266) T cd01460 129 LNQFTFQQDKTDIANLLKFTAQIFEDART-----QSSSGSLWQLLLIISDGRGEFSEG--AQKVRLREAREQNVFVVFII 201 (266) T ss_pred HHHCCCCCCCCCHHHHHHHHHHHHHHHHC-----CCCCCCHHHEEEEEECCCCCCCCC--HHHHHHHHHHHCCCEEEEEE T ss_conf 98398876776199999999999999752-----668756212799996898633530--68999999997697699999 Q ss_pred ECCCCCHH-H--HHH--------------HCC-CCCEEE-ECCHHHHHHHHHHHHHH Q ss_conf 24822278-8--998--------------238-981798-38989999999999984 Q gi|254780934|r 327 IRVIRSHE-F--LRA--------------CAS-PNSFYL-VENPHSMYDAFSHIGKD 364 (374) Q Consensus 327 ~~~~~~~~-~--L~~--------------~As-~~~~~~-a~~~~~L~~~f~~I~~~ 364 (374) +..+..++ . |+. .-+ |-.||- +.+-++|..++.++-++ T Consensus 202 iD~~~~~~SIldmk~~~f~~~~~~~~~~YLd~FPFpyYvivrdi~~LP~~Lsd~LRQ 258 (266) T cd01460 202 IDNPDNKQSILDIKVVSFKNDKSGVITPYLDEFPFPYYVIVRDLNQLPSVLSDALRQ 258 (266) T ss_pred ECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCEEEEECCHHHHHHHHHHHHHH T ss_conf 708988776331137777179840664724439986489976887808999999999 No 57 >pfam09967 DUF2201 Predicted metallopeptidase (DUF2201). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=97.58 E-value=0.00031 Score=43.85 Aligned_cols=96 Identities=18% Similarity=0.150 Sum_probs=58.7 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHCC Q ss_conf 24999965786451131353137999999887632210023665673489999826850561020478689999986013 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l~ 251 (374) .+|++++|+||||... .++..+.++........ ..+-++.+...+.....+..... .+..+. T Consensus 288 ~~i~v~iDTSGSis~~---------~l~~flsEi~~I~~~~~-----~~i~vi~~D~~V~~~~~~~~~~~----~~~~~~ 349 (412) T pfam09967 288 PRLAVAIDTSGSISDA---------ELARFAAEIAAILRRTR-----AEVHVLACDEKVSSVQKFEPGDS----EISEVE 349 (412) T ss_pred CCEEEEEECCCCCCHH---------HHHHHHHHHHHHHHHCC-----CCEEEEEECCEECCEEEECCCCC----CCCCCC T ss_conf 5579999698998899---------99999999999998489-----97799996888564078634667----644141 Q ss_pred C-CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCC Q ss_conf 3-68774442036767764322233331025788656427999806877888 Q gi|254780934|r 252 K-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 (374) Q Consensus 252 ~-~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~ 302 (374) . .||||+....+.+.-.. ....+|++|||.-+.+ T Consensus 350 ~~GgGGTdf~pvf~~~~~~-----------------~p~~~i~fTDG~g~~p 384 (412) T pfam09967 350 LTGGGGTDFRPVLEAALRL-----------------RPDAAVVLTDLEGWPA 384 (412) T ss_pred CCCCCCCCCHHHHHHHHHC-----------------CCCEEEEEECCCCCCC T ss_conf 3578998784899999826-----------------9976999838998988 No 58 >KOG1226 consensus Probab=97.52 E-value=0.00082 Score=41.31 Aligned_cols=145 Identities=12% Similarity=0.193 Sum_probs=81.3 Q ss_pred CCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCC--------------- Q ss_conf 03778724999965786451131353137999999887632210023665673489999826850--------------- Q gi|254780934|r 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--------------- 230 (374) Q Consensus 166 ~~~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------- 230 (374) ...+-|+|++..+|.|.||.... .++..+-..+..-+..+. ..+|+|.-.|-+++ T Consensus 127 ~a~~yPVDLYyLMDlS~SM~DDl----~~l~~LG~~L~~~m~~lT------~nfrlGFGSFVDK~v~P~i~~~pekl~np 196 (783) T KOG1226 127 QAEDYPVDLYYLMDLSYSMKDDL----ENLKSLGTDLAREMRKLT------SNFRLGFGSFVDKTVSPYISTTPEKLRNP 196 (783) T ss_pred ECCCCCEEEEEEEECCHHHHHHH----HHHHHHHHHHHHHHHHHH------CCCCCCCCCHHCCCCCCCCCCCCHHHCCC T ss_conf 03579703799861302456569----999999999999999875------46776653111244354111680775389 Q ss_pred -------------EEEEECCCCHHHHHHHHHHCCCCCCCCCCCCH-HHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEC Q ss_conf -------------56102047868999998601336877444203-6767764322233331025788656427999806 Q gi|254780934|r 231 -------------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG-LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 (374) Q Consensus 231 -------------~~~~~~t~~~~~~~~~i~~l~~~g~~T~~~~~-~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTD 296 (374) ....++|.+...+.+.+..-..+| +-+..+| +..-.+.. .=.....+ ....++.+||.|| T Consensus 197 c~~~~~C~ppfgfkhvLsLT~~~~~F~~~V~~q~ISg-NlDaPEGGfDAimQaa----vC~~~IGW-R~~a~~LLVF~td 270 (783) T KOG1226 197 CPNYKNCAPPFGFKHVLSLTNDAEEFNEEVGKQRISG-NLDAPEGGFDAIMQAA----VCTEKIGW-RNDATRLLVFSTD 270 (783) T ss_pred CCCCCCCCCCCCCCEEEECCCCHHHHHHHHHHCEECC-CCCCCCCHHHHHHHHH----HCCCCCCC-CCCCEEEEEEECC T ss_conf 9874457798554002106887699999875435326-8898982298887664----14655220-1265168999707 Q ss_pred CCCC--------------CC---------------CCCHHHHHHHHHHHHCCC-EEEEEE Q ss_conf 8778--------------88---------------880689999999997898-799999 Q gi|254780934|r 297 GENL--------------ST---------------KEDQQSLYYCNEAKKRGA-IVYAIG 326 (374) Q Consensus 297 G~~~--------------~~---------------~~~~~~~~~~~~~k~~gi-~IytIg 326 (374) ...+ .+ .++.....+...+.+++| .||+|. T Consensus 271 ~~~H~a~DgkLaGiv~pnDG~CHL~~~g~Yt~S~~qdyPSia~l~~kl~~~ni~~IFAVt 330 (783) T KOG1226 271 AGFHFAGDGKLAGIVQPNDGQCHLDKNGEYTQSTTQDYPSIAQLAQKLADNNINTIFAVT 330 (783) T ss_pred CCEEEECCCCEEEEECCCCCCCCCCCCCCCCEECCCCCCCHHHHHHHHHHHCCHHHHHHH T ss_conf 513451155312585378874156877751201377777389999987660453477876 No 59 >COG4867 Uncharacterized protein with a von Willebrand factor type A (vWA) domain [General function prediction only] Probab=97.39 E-value=0.0056 Score=36.34 Aligned_cols=163 Identities=15% Similarity=0.114 Sum_probs=95.0 Q ss_pred EEECCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHH---HHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCC Q ss_conf 331037787249999657864511313531379999998876---32210023665673489999826850561020478 Q gi|254780934|r 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA---MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 (374) Q Consensus 163 ~~~~~~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~ 239 (374) ...++.+..-.+++.+|.|-||--+. ||--+|+..-. ++.+-.. ..-+.++.|+..+.... T Consensus 455 v~etE~rt~aAvallvDtS~SM~~eG-----Rw~PmKQtALALhHLv~Trfr------GD~l~~i~Fgr~A~~v~----- 518 (652) T COG4867 455 VAETETRTQAAVALLVDTSFSMVMEG-----RWLPMKQTALALHHLVCTRFR------GDALQIIAFGRYARTVT----- 518 (652) T ss_pred EEHHHHCCCCCEEEEEECCHHHHHHC-----CCCCHHHHHHHHHHHHHHCCC------CCEEEEEECCCCCCCCC----- T ss_conf 30433203132454341516777741-----667168899999999983178------86058876043020177----- Q ss_pred HHHHHHHHHHCC-CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCC-------------CC Q ss_conf 689999986013-3687744420367677643222333310257886564279998068778888-------------80 Q gi|254780934|r 240 VSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-------------ED 305 (374) Q Consensus 240 ~~~~~~~i~~l~-~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~-------------~~ 305 (374) .+.+..+. ..-.+||...++..+-+.|. -....+|.|+++|||+++.-- .+ T Consensus 519 ----v~eLt~l~~v~eqgTNlhhaL~LA~r~l~-----------Rh~~~~~~il~vTDGePtAhle~~DG~~~~f~yp~D 583 (652) T COG4867 519 ----AAELTGLAGVYEQGTNLHHALALAGRHLR-----------RHAGAQPVVLVVTDGEPTAHLEDGDGTSVFFDYPPD 583 (652) T ss_pred ----HHHHHCCCCCCCCCCCHHHHHHHHHHHHH-----------HCCCCCCEEEEEECCCCCCCCCCCCCCEEECCCCCC T ss_conf ----99982488767455545889999999987-----------375657628998379863013478985661689987 Q ss_pred H----HHHHHHHHHHHCCCEEEEEEECCCC-CHHHHHHHC--CCCCEEEECCHHHHHHH Q ss_conf 6----8999999999789879999924822-278899823--89817983898999999 Q gi|254780934|r 306 Q----QSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACA--SPNSFYLVENPHSMYDA 357 (374) Q Consensus 306 ~----~~~~~~~~~k~~gi~IytIg~~~~~-~~~~L~~~A--s~~~~~~a~~~~~L~~~ 357 (374) . .+....+++...|+.|-+.-.+.+. -..++.+.| ..|..|.. +.+.|-.+ T Consensus 584 P~t~~~Tvr~~d~~~r~G~q~t~FrLg~DpgL~~Fv~qva~rv~G~vv~p-dldglGaa 641 (652) T COG4867 584 PRTIAHTVRGFDDMARLGAQVTIFRLGSDPGLARFIDQVARRVQGRVVVP-DLDGLGAA 641 (652) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHHHHHCCEEEEC-CCCHHHHH T ss_conf 77998989988888751641367752277768999999999858848813-82213589 No 60 >KOG2884 consensus Probab=97.31 E-value=0.011 Score=34.54 Aligned_cols=169 Identities=20% Similarity=0.226 Sum_probs=112.3 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC-CCEEEEECCCCHHHHHHHHHHCC Q ss_conf 49999657864511313531379999998876322100236656734899998268-50561020478689999986013 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFLLEWGVSHLQRKIKYLS 251 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~~~~~~~i~~l~ 251 (374) ..++++|+|--|.. .+-.++|+++-++++..+...-....+. ..+|+.+... .+..+..+|.++..+.+.+..+. T Consensus 5 atmi~iDNse~mrN-gDy~PtRf~aQ~daVn~v~~~K~~snpE---ntvGiitla~a~~~vLsT~T~d~gkils~lh~i~ 80 (259) T KOG2884 5 ATMICIDNSEYMRN-GDYLPTRFQAQKDAVNLVCQAKLRSNPE---NTVGIITLANASVQVLSTLTSDRGKILSKLHGIQ 80 (259) T ss_pred EEEEEEECHHHHHC-CCCCHHHHHHHHHHHHHHHHHHHCCCCC---CCEEEEECCCCCCEEEEECCCCCHHHHHHHCCCC T ss_conf 27999847677624-8977188898899999998755027954---3154686368985044303430048987732778 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCC-EEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 3687744420367677643222333310257886564-279998068778888806899999999978987999992482 Q gi|254780934|r 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 (374) Q Consensus 252 ~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~-k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~ 330 (374) +.| +-+...++.-+.-.|.-. .+++-+ ++++|+ |-+-.... ......+.++|+.+|-|-.|-||.. T Consensus 81 ~~g-~~~~~~~i~iA~lalkhR---------qnk~~~~riVvFv--GSpi~e~e-keLv~~akrlkk~~Vaidii~FGE~ 147 (259) T KOG2884 81 PHG-KANFMTGIQIAQLALKHR---------QNKNQKQRIVVFV--GSPIEESE-KELVKLAKRLKKNKVAIDIINFGEA 147 (259) T ss_pred CCC-CCCHHHHHHHHHHHHHHH---------CCCCCCEEEEEEE--CCCCHHHH-HHHHHHHHHHHHCCEEEEEEEECCC T ss_conf 577-612888899999998710---------3888636999993--68322338-9999999998754802789872434 Q ss_pred CCH-HHHHH-H-C--C---CCCEEEECCHHHHHHHH Q ss_conf 227-88998-2-3--8---98179838989999999 Q gi|254780934|r 331 RSH-EFLRA-C-A--S---PNSFYLVENPHSMYDAF 358 (374) Q Consensus 331 ~~~-~~L~~-~-A--s---~~~~~~a~~~~~L~~~f 358 (374) .+. +.|.. + + + +.|...++.+.-|.++. T Consensus 148 ~~~~e~l~~fida~N~~~~gshlv~Vppg~~L~d~l 183 (259) T KOG2884 148 ENNTEKLFEFIDALNGKGDGSHLVSVPPGPLLSDAL 183 (259) T ss_pred CCCHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHH T ss_conf 333788999999853898874489858984077776 No 61 >COG4547 CobT Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) [Coenzyme metabolism] Probab=97.23 E-value=0.0012 Score=40.33 Aligned_cols=81 Identities=17% Similarity=0.259 Sum_probs=48.0 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCC--------CCCCHH-HHHHHHHHH-HCCCEEEEEEECC Q ss_conf 203676776432223333102578865642799980687788--------888068-999999999-7898799999248 Q gi|254780934|r 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS--------TKEDQQ-SLYYCNEAK-KRGAIVYAIGIRV 329 (374) Q Consensus 260 ~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~--------~~~~~~-~~~~~~~~k-~~gi~IytIg~~~ 329 (374) ++++.|+-+.|. +.+..+|++.+|+||.+-. ++.... ..+..+.+- ...|.+.+||+|- T Consensus 520 GEal~wah~rl~-----------gRpEqrkIlmmiSDGAPvddstlsvnpGnylerHLRaVieeIEtrSpveLlAIGigh 588 (620) T COG4547 520 GEALMWAHQRLI-----------GRPEQRKILMMISDGAPVDDSTLSVNPGNYLERHLRAVIEEIETRSPVELLAIGIGH 588 (620) T ss_pred HHHHHHHHHHHH-----------CCHHHCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHEEEECCC T ss_conf 199999999873-----------584241378883489855554345588607999999999997037840330331255 Q ss_pred CCCHHHHHHHCCCCCEEEECCHHHHHHHH Q ss_conf 22278899823898179838989999999 Q gi|254780934|r 330 IRSHEFLRACASPNSFYLVENPHSMYDAF 358 (374) Q Consensus 330 ~~~~~~L~~~As~~~~~~a~~~~~L~~~f 358 (374) +.-.-+-|..+ ..+.++|..++ T Consensus 589 DvtRyYrravt-------iVdaeeL~gam 610 (620) T COG4547 589 DVTRYYRRAVT-------IVDAEELAGAM 610 (620) T ss_pred CCCHHHHHHEE-------EECHHHHCHHH T ss_conf 53066662013-------74288856589 No 62 >COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair] Probab=96.83 E-value=0.032 Score=31.79 Aligned_cols=175 Identities=11% Similarity=0.115 Sum_probs=106.8 Q ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC-CCEEEEECCCCHHHHHHHHH Q ss_conf 87249999657864511313531379999998876322100236656734899998268-50561020478689999986 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFLLEWGVSHLQRKIK 248 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~~~~~~~i~ 248 (374) .-..+.++||.|.+|...+-- .+|-+...+.+..|+..+....|.. .++++...+ -+........|.......+. T Consensus 86 IiRhl~l~lD~Seam~e~Df~-p~r~a~vikya~~Fv~eFf~qNPiS---qlsii~irdg~a~~~s~~~gnpq~hi~~lk 161 (421) T COG5151 86 IIRHLHLILDVSEAMDESDFL-PTRRANVIKYAEGFVPEFFSQNPIS---QLSIISIRDGCAKYTSSMDGNPQAHIGQLK 161 (421) T ss_pred HHHEEEEEEEHHHHHHHHHCC-CHHHHHHHHHHHHHHHHHHCCCCCH---HEEEEEHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 021057888735444332036-0588889999987768875259700---324433354688876534799899998750 Q ss_pred HCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 01336877444203676776432223333102578865642799980687788888068999999999789879999924 Q gi|254780934|r 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 249 ~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~ 328 (374) +++.-.|+-....|+..+.-.|... ..-..+.++|++--=- ..++....+..+.+...+|+|..||+- T Consensus 162 S~rd~~gnfSLqNaLEmar~~l~~~---------~~H~trEvLiifgS~s---t~DPgdi~~tid~Lv~~~IrV~~igL~ 229 (421) T COG5151 162 SKRDCSGNFSLQNALEMARIELMKN---------TMHGTREVLIIFGSTS---TRDPGDIAETIDKLVAYNIRVHFIGLC 229 (421) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCC---------CCCCCEEEEEEEEECC---CCCCCCHHHHHHHHHHHCEEEEEEEEH T ss_conf 1004688853776887766540345---------4556227999984215---589741899999987505279997501 Q ss_pred CCCCHHHHHH-H-CC----CCCEEEECCHHHHHHHHHHHH Q ss_conf 8222788998-2-38----981798389899999999999 Q gi|254780934|r 329 VIRSHEFLRA-C-AS----PNSFYLVENPHSMYDAFSHIG 362 (374) Q Consensus 329 ~~~~~~~L~~-~-As----~~~~~~a~~~~~L~~~f~~I~ 362 (374) ... ...|. | |+ .+.||..-+..-|.+.|.+.. T Consensus 230 aev--aicKeickaTn~~~e~~y~v~vde~Hl~el~~E~~ 267 (421) T COG5151 230 AEV--AICKEICKATNSSTEGRYYVPVDEGHLSELMRELS 267 (421) T ss_pred HHH--HHHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHCC T ss_conf 589--99999986147676750676604788999998627 No 63 >cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif. Probab=96.71 E-value=0.04 Score=31.25 Aligned_cols=155 Identities=15% Similarity=0.168 Sum_probs=82.2 Q ss_pred EEEEEECCCCCCCCCCCC-CHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCE----------EEEECCC-CH Q ss_conf 499996578645113135-31379999998876322100236656734899998268505----------6102047-86 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE----------EFFLLEW-GV 240 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~-~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~t~-~~ 240 (374) .++|++|.|-||-....+ ....++.+.+.+..++...- -......+|++-|+..-. ...++.. +. T Consensus 3 ~ivflID~s~sM~~~~~~~~~s~~~~al~~i~~~~~~ki---is~~~d~vGvv~~~T~~~~n~~~~~~i~vl~~l~~~~a 79 (218) T cd01458 3 SVVFLVDVSPSMFESKDGEYESPFEEALKCIRQLMKSKI---ISSPKDLVGVVFYGTEESKNPVGYENIYVLLDLDTPGA 79 (218) T ss_pred EEEEEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHE---ECCCCCEEEEEEECCCCCCCCCCCCEEEEEECCCCCCH T ss_conf 799999799778477678888839999999999998650---67899869999976788888789872699633887677 Q ss_pred HHHHHHHHHCCCC-----------CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCC---H Q ss_conf 8999998601336-----------8774442036767764322233331025788656427999806877888880---6 Q gi|254780934|r 241 SHLQRKIKYLSKF-----------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED---Q 306 (374) Q Consensus 241 ~~~~~~i~~l~~~-----------g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~---~ 306 (374) ..++. +..+... .....+..++..+.+++... ......|.|+|+||..+=.+.+. . T Consensus 80 ~~i~~-l~~~~~~~~~~~~~~~~~~~~~~l~~aL~~~~~~f~~~---------~~~~~~krI~lfTdnD~P~~~~~~~~~ 149 (218) T cd01458 80 ERVED-LKELIEPGGLSFAGQVGDSGQVSLSDALWVCLDLFSKG---------KKKKSHKRIFLFTNNDDPHGGDSIKDS 149 (218) T ss_pred HHHHH-HHHHHHCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHC---------CCCCCCCEEEEECCCCCCCCCCHHHHH T ss_conf 99999-99986010235566448888867999999999999855---------534577779998689989998879999 Q ss_pred HHHHHHHHHHHCCCEEEEEEECCCCC----HHHHHHHC Q ss_conf 89999999997898799999248222----78899823 Q gi|254780934|r 307 QSLYYCNEAKKRGAIVYAIGIRVIRS----HEFLRACA 340 (374) Q Consensus 307 ~~~~~~~~~k~~gi~IytIg~~~~~~----~~~L~~~A 340 (374) .....+..+++.||.|-.+.+..+.. ..+.+.+- T Consensus 150 ~a~~~a~DL~d~gI~iel~~l~~~~~~Fd~s~FY~dii 187 (218) T cd01458 150 QAAVKAEDLKDKGIELELFPLSSPGKKFDVSKFYKDII 187 (218) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCCCCCCCHHHHHHHH T ss_conf 99999988987796899984489988688067788752 No 64 >COG2718 Uncharacterized conserved protein [Function unknown] Probab=96.68 E-value=0.042 Score=31.13 Aligned_cols=165 Identities=18% Similarity=0.183 Sum_probs=86.9 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHCC Q ss_conf 24999965786451131353137999999887632210023665673489999826850561020478689999986013 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l~ 251 (374) .-++|+.|+||||+.. +-+.++....-+.-.+... -..+-+-.+.....+.... ....- . . T Consensus 247 AVmfclMDvSGSM~~~------~KdlAkrFF~lL~~FL~~k---YenveivfIrHht~A~EVd-----E~dFF----~-~ 307 (423) T COG2718 247 AVMFCLMDVSGSMDQS------EKDLAKRFFFLLYLFLRRK---YENVEIVFIRHHTEAKEVD-----ETDFF----Y-S 307 (423) T ss_pred EEEEEEEECCCCCCHH------HHHHHHHHHHHHHHHHHCC---CCEEEEEEEEECCCCEECC-----HHHCE----E-E T ss_conf 5899977457774467------8999999999999998444---0025899996037431423-----22402----3-2 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE-ECCC Q ss_conf 368774442036767764322233331025788656427999806877888880689999999997898799999-2482 Q gi|254780934|r 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG-IRVI 330 (374) Q Consensus 252 ~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg-~~~~ 330 (374) ..+|||-...++..+.+.+....+. .....+..-.+||+|+...+..+...+.+.+-. .+.-|+-+ +... T Consensus 308 ~esGGTivSSAl~~m~evi~ErYp~--------aeWNIY~fqaSDGDN~~dDserc~~ll~~~im~-~~~~y~Y~Eitq~ 378 (423) T COG2718 308 QESGGTIVSSALKLMLEVIKERYPP--------AEWNIYAFQASDGDNWADDSERCVELLAKKLMP-VVQYYGYIEITQR 378 (423) T ss_pred CCCCCEEEHHHHHHHHHHHHHHCCH--------HHEEEEEEEECCCCCCCCCCHHHHHHHHHHHHH-HHHHEEEEEEEEC T ss_conf 4789768688999999999703885--------350255454057766568778899999999998-6334388742002 Q ss_pred CCHHHHH--HHC-CCC--CEEEECCHHHHHHHHHHHHHH Q ss_conf 2278899--823-898--179838989999999999984 Q gi|254780934|r 331 RSHEFLR--ACA-SPN--SFYLVENPHSMYDAFSHIGKD 364 (374) Q Consensus 331 ~~~~~L~--~~A-s~~--~~~~a~~~~~L~~~f~~I~~~ 364 (374) .....|+ ..- .-+ .++...+++++-.+|+++... T Consensus 379 ~~H~t~~y~~~~~~~dnFa~~~I~~~~Diypvfr~lf~k 417 (423) T COG2718 379 RTHQTLEYEALQGVFDNFAMQTIREPDDIYPVFRELFSK 417 (423) T ss_pred CCCHHHHHHHHHCCCCCHHEEEECCHHHHHHHHHHHHHC T ss_conf 431124334443357531303406778779999999841 No 65 >TIGR01651 CobT cobaltochelatase, CobT subunit; InterPro: IPR006538 These proteins are CobT subunits of the aerobic cobalt chelatase (aerobic cobalamin biosynthesis pathway). Pseudomonas denitrificans CobT has been experimentally characterised , . Aerobic cobalt chelatase consists of three subunits, CobT, CobN (IPR003672 from INTERPRO) and CobS (IPR006537 from INTERPRO). Cobalamin (vitamin B12) can be complexed with metal via the ATP-dependent reactions (aerobic pathway) (e.g., in Pseudomonas denitrificans) or via ATP-independent reactions (anaerobic pathway) (e.g., in Salmonella typhimurium) , . The corresponding cobalt chelatases are not homologous. However, aerobic cobalt chelatase subunits CobN and CobS are homologous to Mg-chelatase subunits BchH and BchI, respectively . CobT, too, has been found to be remotely related to the third subunit of Mg-chelatase, BchD (involved in bacteriochlorophyll synthesis, e.g., in Rhodobacter capsulatus) . Nomenclature note: CobT of the aerobic pathway Pseudomonas denitrificans is not a homolog of CobT of the anaerobic pathway (Salmonella typhimurium, Escherichia coli). Therefore, annotation of any members of this family as nicotinate-mononucleotide--5,6-dimethylbenzimidazole phosphoribosyltransferases is erroneous. . Probab=96.68 E-value=0.0054 Score=36.43 Aligned_cols=83 Identities=16% Similarity=0.262 Sum_probs=47.5 Q ss_pred CC-CCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCC--------CCCCCH-HHHHHHHHH-HHCCCEEEEEE Q ss_conf 44-20367677643222333310257886564279998068778--------888806-899999999-97898799999 Q gi|254780934|r 258 NS-TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--------STKEDQ-QSLYYCNEA-KKRGAIVYAIG 326 (374) Q Consensus 258 ~~-~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~--------~~~~~~-~~~~~~~~~-k~~gi~IytIg 326 (374) |+ ++++.||.+-|... +..+|++.++|||.|- .|+.-. ...++.+.+ -...|++-+|| T Consensus 502 NIDGEAL~WAH~RliAR-----------~EQRrILM~ISDGAPVDDSTLSVN~G~YLERHLR~VI~~IEtrSPVELlAIG 570 (606) T TIGR01651 502 NIDGEALLWAHERLIAR-----------PEQRRILMMISDGAPVDDSTLSVNPGNYLERHLRAVIEEIETRSPVELLAIG 570 (606) T ss_pred CCCHHHHHHHHHHHHCC-----------HHHCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEC T ss_conf 56467999886664147-----------2047587776278886645235478506789999999862377870002323 Q ss_pred ECCCCCHHHHHHHCCCCCEEEECCHHHHHHHH Q ss_conf 24822278899823898179838989999999 Q gi|254780934|r 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAF 358 (374) Q Consensus 327 ~~~~~~~~~L~~~As~~~~~~a~~~~~L~~~f 358 (374) +|-+.-. +.+...+ .-+.++|.-++ T Consensus 571 IGHDVTR-YYRRAVT------IVD~e~LaG~~ 595 (606) T TIGR01651 571 IGHDVTR-YYRRAVT------IVDAEELAGAL 595 (606) T ss_pred CCCCCCC-EECCEEE------EEEHHHHHHHH T ss_conf 4434220-0000143------76144530189 No 66 >pfam07811 TadE TadE-like protein. The members of this family are similar to a region of the protein product of the bacterial tadE locus. In various bacterial species, the tad locus is closely linked to flp-like genes, which encode proteins required for the production of pili involved in adherence to surfaces. It is thought that the tad loci encode proteins that act to assemble or export an Flp pilus in various bacteria. All tad loci but TadA have putative transmembrane regions, and in fact the region in question is this family has a high proportion of hydrophobic amino acid residues. Probab=96.58 E-value=0.008 Score=35.39 Aligned_cols=42 Identities=26% Similarity=0.551 Sum_probs=39.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 629999999999999999999999999999999999999999 Q gi|254780934|r 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSL 58 (374) Q Consensus 17 G~vaiifAl~l~~ll~~~g~avD~~~~~~~ks~Lq~A~DaAa 58 (374) |..+|=|++++|+++.+....+|++++...+..++.|+..++ T Consensus 1 G~a~VEfalv~p~~l~l~~~~~~~~~~~~~~~~~~~Aa~~aa 42 (43) T pfam07811 1 GAAAVEFALVLPVLLLLLFGIVELGRLFYARQVLQNAAREAA 42 (43) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 916999999999999999999999999999999999998674 No 67 >LOAD_ku consensus Probab=96.05 E-value=0.097 Score=28.95 Aligned_cols=144 Identities=15% Similarity=0.214 Sum_probs=69.5 Q ss_pred EEEEEECCCCCCCCCCCCC-HHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEE----------EEECCC-CH Q ss_conf 4999965786451131353-13799999988763221002366567348999982685056----------102047-86 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE----------FFLLEW-GV 240 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~-~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~t~-~~ 240 (374) .++|++|.|-||....++. ...++.+.+.+..++...--. .....+|++-|++.-+. ..++.. +. T Consensus 1 AivflID~s~sM~~~~~~~~~s~l~~al~~v~~~~~~ki~~---~~~D~vGvvl~gT~~t~n~~~~~~i~~l~~l~~p~~ 77 (521) T LOAD_ku 1 AILFCIDVSPAMFESSDGEELSPFEQALKCIRTLMQRKVIS---RPKDLIGVVLYGTDKSDNSEGFKNQYVLQDIDVPGA 77 (521) T ss_pred CEEEEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHEEC---CCCCEEEEEEECCCCCCCCCCCCCEEEEEECCCCCH T ss_conf 98999979988878789988886999999999999864458---999869999982799887568774488551585107 Q ss_pred HHHHHHHHHCC-------------CCCC--CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCC Q ss_conf 89999986013-------------3687--74442036767764322233331025788656427999806877888880 Q gi|254780934|r 241 SHLQRKIKYLS-------------KFGV--STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 (374) Q Consensus 241 ~~~~~~i~~l~-------------~~g~--~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~ 305 (374) ..+.. +..+. ..+. ..+...++..+.+++.... .....|.|+|+||+.+-...+. T Consensus 78 ~~i~~-l~~l~~~~~~~~~~~~~~~~~~~~~~~l~~al~~~~~~~~~~~---------~k~~~krI~l~Tn~d~P~~~~~ 147 (521) T LOAD_ku 78 ERLLK-LFRLKQFDDLEGKEFFQQPGGHGADFSLSDVLWVCLDLFQTVG---------KKLSHKRIFLFTDNDNPHDNDS 147 (521) T ss_pred HHHHH-HHHHHHCCCCCCHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHH---------HCCCCCEEEEEECCCCCCCCCH T ss_conf 89999-9866530210002321367887656558999999999998742---------2126865999947998888746 Q ss_pred HHH---HHHHHHHHHCCCEEEEEEECC Q ss_conf 689---999999997898799999248 Q gi|254780934|r 306 QQS---LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 306 ~~~---~~~~~~~k~~gi~IytIg~~~ 329 (374) ... ...+..+++.||.+-...++. T Consensus 148 ~~~~~~~~~~~dl~~~gi~~~~~~~~~ 174 (521) T LOAD_ku 148 DLRDIALTKAKDLRDLGIFLDLMHLKK 174 (521) T ss_pred HHHHHHHHHHHHHHHCCEEECCCCCCC T ss_conf 789999987436765481110455586 No 68 >TIGR02877 spore_yhbH sporulation protein YhbH; InterPro: IPR014230 Proteins in this entry, typified by YhbH from Bacillus subtilis, are found in the genomes of nearly every endospore-forming bacterium, and in no other genomes. The gene in Bacillus subtilis was shown to be a member of the sigma-E regulon, with mutation leading to a sporulation defect .. Probab=95.90 E-value=0.077 Score=29.55 Aligned_cols=163 Identities=15% Similarity=0.188 Sum_probs=85.8 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHC Q ss_conf 72499996578645113135313799999988763221002366567348999982685056102047868999998601 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l 250 (374) .--|+++.|.||||+.. +--.++...--++..+... -..|-+..++..+.+.-...- ..-..- T Consensus 214 ~AVvi~mMDtSGSMg~~------kKYiARSfFFw~~kFlr~K---Y~~VeI~FisH~TeAkEV~Ee--------~FF~kg 276 (392) T TIGR02877 214 KAVVIAMMDTSGSMGEF------KKYIARSFFFWMVKFLRTK---YENVEIVFISHHTEAKEVTEE--------EFFTKG 276 (392) T ss_pred CEEEEEEECCCCCCCCC------HHHHHHHHHHHHHHHHHHE---EEEEEEEEEEECCCCEEECHH--------HCCCCC T ss_conf 73777764478898873------1678888999999886320---014789997104521240166--------601325 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHH-HHCCC----EE--- Q ss_conf 33687744420367677643222333310257886564279998068778888806899999999-97898----79--- Q gi|254780934|r 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGA----IV--- 322 (374) Q Consensus 251 ~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~-k~~gi----~I--- 322 (374) ..|||..-.+...|.+.+.... +-....-+-+-+|||.|....+....+++...+ +-.+. +| T Consensus 277 --ESGGT~~SS~Y~~ALeiI~~RY--------nP~~yNiY~FHfSDGDNl~~Dn~Rlav~l~~~L~~~cNL~GYgEIEtq 346 (392) T TIGR02877 277 --ESGGTRCSSAYKLALEIIDERY--------NPARYNIYAFHFSDGDNLSSDNERLAVKLVRKLLEVCNLFGYGEIETQ 346 (392) T ss_pred --CCCCCCHHHHHHHHHHHHHCCC--------CCCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf --6677430167889999974278--------831006565355337788988646899999999887611110566056 Q ss_pred ---EEEEECCCC--CHHHHHHHCCCCCE--EEECCHHHHHHHHHHH Q ss_conf ---999924822--27889982389817--9838989999999999 Q gi|254780934|r 323 ---YAIGIRVIR--SHEFLRACASPNSF--YLVENPHSMYDAFSHI 361 (374) Q Consensus 323 ---ytIg~~~~~--~~~~L~~~As~~~~--~~a~~~~~L~~~f~~I 361 (374) .+..++... -..+.+.+- .++| +...+-.+|-.|++++ T Consensus 347 Pqyls~~Y~y~~tL~~~f~~ei~-~~~Fv~~~I~~K~d~y~ALk~~ 391 (392) T TIGR02877 347 PQYLSMPYGYSSTLKSKFKKEIK-DPNFVLLIIKDKEDVYPALKKF 391 (392) T ss_pred CCEECCCCCCCHHHHHHHHHHHC-CCCCEEEEECCHHHHHHHHHHH T ss_conf 51103788665577888887405-8883587650414689999983 No 69 >pfam11443 DUF2828 Domain of unknown function (DUF2828). This is a uncharacterized domain found in eukaryotes and viruses. Probab=95.69 E-value=0.14 Score=28.02 Aligned_cols=139 Identities=19% Similarity=0.179 Sum_probs=75.4 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHCC Q ss_conf 24999965786451131353137999999887632210023665673489999826850561020478689999986013 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l~ 251 (374) -++..|.|+||||.+... ...-++.+. ++.-++.+... + -.+-.+++|+..+....-...+...-...+...+ T Consensus 327 ~n~iav~DvSGSM~g~~~-~~~p~~vai-~Lgl~ise~~~-~----~fk~~~iTFs~~P~~~~l~g~~l~ekv~~~~~~~ 399 (524) T pfam11443 327 TNCIAVCDVSGSMSGPVF-SITPMDVCI-ALGLLVSELSE-G----PFKGKVITFSSNPQLHHIKGDSLREKVSFVRRMP 399 (524) T ss_pred CCEEEEEECCCCCCCCCC-CCCHHHHHH-HHHHHHHHHCC-C----CCCCCEEEECCCCEEEECCCCCHHHHHHHHHHCC T ss_conf 544899956877778888-887499999-99999998535-0----0058189844997589707988999999999586 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCC-----CCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 368774442036767764322233331025788656427999806877888-----88068999999999789879999 Q gi|254780934|r 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST-----KEDQQSLYYCNEAKKRGAIVYAI 325 (374) Q Consensus 252 ~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~-----~~~~~~~~~~~~~k~~gi~IytI 325 (374) -|++||....+. .+.+.... + .-......|.|+++||=+.+.. .+.+....++..-++.|.++=-| T Consensus 400 -wg~nTnf~~vf~----lIL~~av~--~-~l~~eempk~l~VfSDMqFD~a~~~~~~~~t~~e~i~~~f~~aGY~~P~I 470 (524) T pfam11443 400 -WGMSTNFQKVFD----LILETAVE--N-KLPQEDMPKRLFVFSDMEFDQASTGTSGWETDYEAIQRKFKEAGYEVPEL 470 (524) T ss_pred -CCCCCHHHHHHH----HHHHHHHH--C-CCCHHHCCCEEEEEECCCHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCEE T ss_conf -763533999999----99999998--6-99978867738998454231203799876238999999999839999838 No 70 >COG3552 CoxE Protein containing von Willebrand factor type A (vWA) domain [General function prediction only] Probab=95.20 E-value=0.035 Score=31.60 Aligned_cols=127 Identities=10% Similarity=0.142 Sum_probs=62.6 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEC--CCCHHHHHHHHH Q ss_conf 724999965786451131353137999999887632210023665673489999826850561020--478689999986 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIK 248 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--t~~~~~~~~~i~ 248 (374) +-.+++.+|+||||.++..- +-.+..|+.+. ..|+-...|+...+-.... ..|.+.....+. T Consensus 218 ~~~lvvL~DVSGSm~~ys~~----~L~l~hAl~q~------------~~R~~~F~F~TRLt~vT~~l~~rD~~~Al~~~~ 281 (395) T COG3552 218 KPPLVVLCDVSGSMSGYSRI----FLHLLHALRQQ------------RSRVHVFLFGTRLTRVTHMLRERDLEDALRRLS 281 (395) T ss_pred CCCEEEEEECCCCHHHHHHH----HHHHHHHHHHC------------CCCEEEEEEECHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99859998346634566999----99999999852------------266059973015877788762489999999998 Q ss_pred H-CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 0-1336877444203676776432223333102578865642799980687788888068999999999789879999 Q gi|254780934|r 249 Y-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 (374) Q Consensus 249 ~-l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytI 325 (374) . .....+||.++..+..-... +.. ..-...-+|+++|||-++... .........+...-=.+... T Consensus 282 a~v~dw~ggTrig~tl~aF~~~----~~~------~~L~~gA~VlilsDg~drd~~--~~l~~~~~rl~rrarrlvwL 347 (395) T COG3552 282 AQVKDWDGGTRIGNTLAAFLRR----WHG------NVLSGGAVVLILSDGLDRDDI--PELVTAMARLRRRARRLVWL 347 (395) T ss_pred HHCCCCCCCCCHHHHHHHHHCC----CCC------CCCCCCEEEEEEECCCCCCCC--HHHHHHHHHHHHHHCCEEEC T ss_conf 6411335776234899999715----444------455786179997054224781--57999999999861403643 No 71 >cd01459 vWA_copine_like VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding. Probab=95.15 E-value=0.21 Score=26.92 Aligned_cols=151 Identities=15% Similarity=0.151 Sum_probs=82.6 Q ss_pred CCCEEEEEEECCCCCCCCCC---------CCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEE----EE Q ss_conf 78724999965786451131---------353137999999887632210023665673489999826850561----02 Q gi|254780934|r 169 DARLDMMIVLDVSRSMESFF---------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF----FL 235 (374) Q Consensus 169 ~~~~di~~VlD~SgSM~~~~---------~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 235 (374) ...+++++.+|.|+|.++.. ++...-.+.+..++..++...+. .-++.++.|+.+.... .. T Consensus 29 G~einlivaIDFT~SNg~p~~~~SLHy~~~~~~N~Y~~aI~~vg~iL~~YD~------Dk~~p~yGFGa~~~~~~~~~~~ 102 (254) T cd01459 29 GLESNLIVAIDFTKSNGWPGEKRSLHYISPGRLNPYQKAIRIVGEVLQPYDS------DKLIPAFGFGAIVTKDQSVFSF 102 (254) T ss_pred CCEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC------CCCEEEECCCCCCCCCCEEECC T ss_conf 9688589999707868998999885458999989999999999999863277------8811301123113999757357 Q ss_pred C--------CCCHH----HHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCC Q ss_conf 0--------47868----99999860133687744420367677643222333310257886564279998068778888 Q gi|254780934|r 236 L--------EWGVS----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 (374) Q Consensus 236 ~--------t~~~~----~~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~ 303 (374) + -.+.+ ..+..+..+.-.| -|+..+-+..+.+.-.... ....--+++++|||+-+. T Consensus 103 ~~~~~~~p~~~G~~gvl~aY~~~l~~v~lsG-PT~FapiI~~a~~~a~~~~---------~~~~Y~ILlIiTDG~i~D-- 170 (254) T cd01459 103 FPGYSESPECQGFEGVLRAYREALPNVSLSG-PTNFAPVIRAAANIAKASN---------SQSKYHILLIITDGEITD-- 170 (254) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCCCEECC-CCCHHHHHHHHHHHHHHHC---------CCCEEEEEEEECCCCCCC-- T ss_conf 7799999965599999999998607437648-8605999999999999732---------487089999980796367-- Q ss_pred CCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHH Q ss_conf 806899999999978987999992482227889982 Q gi|254780934|r 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 (374) Q Consensus 304 ~~~~~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~ 339 (374) -..+..+.-++-+..+.|-.||+|. .+=+.|+.. T Consensus 171 -~~~Ti~aIv~AS~~PlSIIiVGVGd-~dF~~M~~l 204 (254) T cd01459 171 -MNETIKAIVEASKYPLSIVIVGVGD-GPFDAMERL 204 (254) T ss_pred -HHHHHHHHHHHCCCCEEEEEEEECC-CCHHHHHHH T ss_conf -8999999999717981799997368-882778873 No 72 >KOG1327 consensus Probab=94.88 E-value=0.25 Score=26.47 Aligned_cols=151 Identities=16% Similarity=0.225 Sum_probs=82.0 Q ss_pred CCEEEEEEECCCCCCCCCCC---------CCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEE------EE Q ss_conf 87249999657864511313---------5313799999988763221002366567348999982685056------10 Q gi|254780934|r 170 ARLDMMIVLDVSRSMESFFD---------SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE------FF 234 (374) Q Consensus 170 ~~~di~~VlD~SgSM~~~~~---------~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~ 234 (374) ..+++.+-+|.+.|..+... ......+.+..++...+...+ .+-++..+.|+.+..+ .+ T Consensus 284 ~~lnf~vgIDfTaSNg~p~~~sSLHyi~p~~~N~Y~~Ai~~vG~~lq~yd------sdk~fpa~GFGakip~~~~vs~~f 357 (529) T KOG1327 284 EQLNFTVGIDFTASNGDPRNPSSLHYIDPHQPNPYEQAIRSVGETLQDYD------SDKLFPAFGFGAKIPPDGQVSHEF 357 (529) T ss_pred CEEEEEEEEEEECCCCCCCCCCCCEECCCCCCCHHHHHHHHHHHHHCCCC------CCCCCCCCCCCCCCCCCCCCCCCE T ss_conf 56616999997124789999975245278899889999999713410558------777642345466579986524202 Q ss_pred ECCC--------CH----HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCC Q ss_conf 2047--------86----89999986013368774442036767764322233331025788656427999806877888 Q gi|254780934|r 235 LLEW--------GV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 (374) Q Consensus 235 ~~t~--------~~----~~~~~~i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~ 302 (374) .+.. +. +..+..+-.+...| .|+..+-+..+.+.-..... ....--+++++|||+-+. T Consensus 358 ~ln~~~~~~~c~Gi~gVl~aY~~~lp~v~l~G-PTnFaPII~~va~~a~~~~~--------~~~qY~VLlIitDG~vTd- 427 (529) T KOG1327 358 VLNFNPEDPECRGIEGVLEAYRKALPNVQLYG-PTNFSPIINHVARIAQQSGN--------TAGQYHVLLIITDGVVTD- 427 (529) T ss_pred EECCCCCCCCCCCHHHHHHHHHHHCCCCCCCC-CCCCHHHHHHHHHHHHHHCC--------CCCCEEEEEEEECCCCCC- T ss_conf 20378899854347889999986456642058-87618999999999997256--------786249999993782344- Q ss_pred CCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHH Q ss_conf 8806899999999978987999992482227889982 Q gi|254780934|r 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 (374) Q Consensus 303 ~~~~~~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~ 339 (374) -..+..+.-.+-+....|--||+| +.+-+.|+.. T Consensus 428 --m~~T~~AIV~AS~lPlSIIiVGVG-d~df~~M~~l 461 (529) T KOG1327 428 --MKETRDAIVSASDLPLSIIIVGVG-DADFDMMREL 461 (529) T ss_pred --HHHHHHHHHHHCCCCEEEEEEEEC-CCCHHHHHHH T ss_conf --889999998631598079999737-9787899975 No 73 >COG5148 RPN10 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones] Probab=94.74 E-value=0.28 Score=26.25 Aligned_cols=146 Identities=10% Similarity=0.092 Sum_probs=96.3 Q ss_pred EEEEEECCCCCC-CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC-CCEEEEECCCCHHHHHHHHHHC Q ss_conf 499996578645-11313531379999998876322100236656734899998268-5056102047868999998601 Q gi|254780934|r 173 DMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFLLEWGVSHLQRKIKYL 250 (374) Q Consensus 173 di~~VlD~SgSM-~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~~~~~~~i~~l 250 (374) .+++++|+|--| +..| -..|.++-|+++..++..-....+. ..+|+++... .+..+.-+|..+.++...+..+ T Consensus 5 atvvliDNse~s~NgDy--~ptRFeAQkd~ve~if~~K~ndnpE---ntiGli~~~~a~p~vlsT~T~~~gkilt~lhd~ 79 (243) T COG5148 5 ATVVLIDNSEASQNGDY--LPTRFEAQKDAVESIFSKKFNDNPE---NTIGLIPLVQAQPNVLSTPTKQRGKILTFLHDI 79 (243) T ss_pred EEEEEEECHHHHHCCCC--CCHHHHHHHHHHHHHHHHHHCCCCC---CEEEEEECCCCCCCHHCCCHHHHHHHHHHHCCC T ss_conf 28999847066524897--7078888788999999877238963---315445635688512113065412888772365 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCC-CEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 3368774442036767764322233331025788656-427999806877888880689999999997898799999248 Q gi|254780934|r 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY-KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 251 ~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~-~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~ 329 (374) +..| +-.+..++.-+.-.|... .++.. ++++.|+---. ..+ .......+..+|+.|+-|--|-||. T Consensus 80 ~~~g-~a~~~~~lqiaql~lkhR---------~nk~q~qriVaFvgSpi--~es-edeLirlak~lkknnVAidii~fGE 146 (243) T COG5148 80 RLHG-GADIMRCLQIAQLILKHR---------DNKGQRQRIVAFVGSPI--QES-EDELIRLAKQLKKNNVAIDIIFFGE 146 (243) T ss_pred CCCC-CCHHHHHHHHHHHHHHCC---------CCCCCCEEEEEEECCCC--CCC-HHHHHHHHHHHHHCCEEEEEEEHHH T ss_conf 1247-640888999999998601---------48765058999946845--326-7999999999986683589986034 Q ss_pred CCCHHHH Q ss_conf 2227889 Q gi|254780934|r 330 IRSHEFL 336 (374) Q Consensus 330 ~~~~~~L 336 (374) -.+...| T Consensus 147 ~~n~~~l 153 (243) T COG5148 147 AANMAGL 153 (243) T ss_pred HHHHHHH T ss_conf 5556678 No 74 >pfam03731 Ku_N Ku70/Ku80 N-terminal alpha/beta domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the amino terminal alpha/beta domain. This domain only makes a small contribution to the dimer interface. The domain comprises a six stranded beta sheet of the Rossman fold. Probab=94.40 E-value=0.33 Score=25.78 Aligned_cols=187 Identities=16% Similarity=0.181 Sum_probs=93.1 Q ss_pred EEEEEECCCCCCCCCCCCCH-HHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCE----------EEEECCC-CH Q ss_conf 49999657864511313531-379999998876322100236656734899998268505----------6102047-86 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSI-TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE----------EFFLLEW-GV 240 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~-~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~t~-~~ 240 (374) .++|++|.|-||-...+... ..++.+.+.+..++...--. .....+|++-|+..-. +..++.. +. T Consensus 1 avvf~ID~s~sM~~~~~~~~~s~~~~al~~i~~~~~~kIis---~~kD~vGvv~~~T~~~~n~~~~~ni~vl~~l~~p~a 77 (222) T pfam03731 1 STVFLIDASPAMFESVKGLEASPFEQALKCIDEILSRKIIS---NDKDLIGVVLYGTDESENSEGFENVTVLRDLDLPGA 77 (222) T ss_pred CEEEEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHEEC---CCCCEEEEEEECCCCCCCCCCCCEEEEECCCCCCCH T ss_conf 97999979988868788998783999999999999877137---899858899970567777678860698505788788 Q ss_pred HHHHHHHHHCCC---------CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHH-- Q ss_conf 899999860133---------687744420367677643222333310257886564279998068778888806899-- Q gi|254780934|r 241 SHLQRKIKYLSK---------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL-- 309 (374) Q Consensus 241 ~~~~~~i~~l~~---------~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~-- 309 (374) ..++.-.+-+.. .........++..+.+++.... .......|.|+|+||..+=.+.+...-. T Consensus 78 ~~ik~L~~~~~~~~~~~~~~~~~~~~~~~~aL~~~~~~~~~~~-------~~~k~~~krI~LfTdnD~P~~~~~~~~~~~ 150 (222) T pfam03731 78 ELLKELDQFLEPLADVFGFNGDSSDGDLLSALWVCMDLLQKQT-------GKKKLSKKRILLFTNLDDPFEDDDQLDTIR 150 (222) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHH-------CCCCCCCCEEEEECCCCCCCCCCCHHHHHH T ss_conf 9999999985103444411689776648889999999988510-------343457867999899998988751778999 Q ss_pred --HHHHHHHHCCCEEEEEEECCCC---CHHHHHHHC---C--CCCEEEECCHHHHHHHHHHHHHHHHEEE Q ss_conf --9999999789879999924822---278899823---8--9817983898999999999998420016 Q gi|254780934|r 310 --YYCNEAKKRGAIVYAIGIRVIR---SHEFLRACA---S--PNSFYLVENPHSMYDAFSHIGKDIVTKR 369 (374) Q Consensus 310 --~~~~~~k~~gi~IytIg~~~~~---~~~~L~~~A---s--~~~~~~a~~~~~L~~~f~~I~~~i~~~~ 369 (374) ..+..+.+.||.|-.+.++.+. ...+.+.+- + ...+.....++.|.+....|-....++| T Consensus 151 ~~~~a~Dl~d~gi~i~lf~i~~~~~f~~~~FY~dii~~~~~~~~~~~~~~~~~~l~~l~~~i~~k~~~kR 220 (222) T pfam03731 151 QKLLAEDLRDEGIEFNLIHLPNSGGFDPNIFYKEIIKLGEDEENEVMLDLEGEKLEDLLSRLRAKQTAKR 220 (222) T ss_pred HHHHHCCHHHCCCEEEEEECCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCC T ss_conf 9998523787497799961498888877788886426776434566777303169999999974303565 No 75 >COG3864 Uncharacterized protein conserved in bacteria [Function unknown] Probab=94.21 E-value=0.12 Score=28.43 Aligned_cols=94 Identities=22% Similarity=0.376 Sum_probs=45.6 Q ss_pred EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHHHHHCC Q ss_conf 24999965786451131353137999999887632210023665673489999826850561020478689999986013 Q gi|254780934|r 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 (374) Q Consensus 172 ~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~i~~l~ 251 (374) +-|+.++|.||||... .++++ +.++.+-+. ...++..++.-...++........+..-. . + T Consensus 262 ~~i~vaVDtSGS~~d~------ei~a~---~~Ei~~Il~-----~~~~eltli~~D~~v~~~~~~r~g~~~~~-~---~- 322 (396) T COG3864 262 IKIVVAVDTSGSMTDA------EIDAA---MTEIFDILK-----NKNYELTLIECDNIVRRMYRVRKGRDMKK-K---L- 322 (396) T ss_pred HHEEEEEECCCCCCHH------HHHHH---HHHHHHHHH-----CCCCEEEEEEECCHHHHHHCCCCCCCCCC-C---C- T ss_conf 1147898157874489------99999---999999870-----78807999980301210310577445786-5---5- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCC Q ss_conf 3687744420367677643222333310257886564279998068778 Q gi|254780934|r 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 (374) Q Consensus 252 ~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~ 300 (374) ..||+|+..+.+.+--+.. +.-++|.+|||.-. T Consensus 323 ~ggG~Tdf~Pvfeylek~~----------------~~~~lIyfTDG~gd 355 (396) T COG3864 323 DGGGGTDFSPVFEYLEKNR----------------MECFLIYFTDGMGD 355 (396) T ss_pred CCCCCCCCCHHHHHHHHHC----------------CCCEEEEECCCCCC T ss_conf 7898766417999997616----------------33359998168887 No 76 >pfam11265 Med25_VWA Mediator complex subunit 25 von Willebrand factor type A. The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA domain which is this one, an SD2 domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and a C-terminal NR box-containing domain (646-650) from 646-747. This VWA or von Willebrand factor type A domain when bound to RAR and the histone acetyltransferase CBP is responsible for recruiting Med1 to the rest of the Mediator complex. Probab=93.75 E-value=0.45 Score=25.01 Aligned_cols=152 Identities=14% Similarity=0.161 Sum_probs=91.0 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHH-HHHCCCC--CCCCCEEEEEEEECCCCE------EEEECCCCHH Q ss_conf 724999965786451131353137999999887632-2100236--656734899998268505------6102047868 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML-EEVKLIP--DVNNVVQSGLVTFSNKIE------EFFLLEWGVS 241 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~-~~~~~~~--~~~~~~~~~~~~~~~~~~------~~~~~t~~~~ 241 (374) --|+++|+..+.-|+.+|+.- |-+-+...+..|- ..+...+ ........+++.|..... ....+|.+.. T Consensus 5 ~~dvVfviEgTA~~g~y~~~l--kt~Yi~p~ieyF~~g~~~~~~~~~~~~~~~y~LVvf~t~~~~p~~~~q~~gpt~~~~ 82 (219) T pfam11265 5 VKDVVFVIEGTANLGPYFETL--KTDYILPIIEYFNGGPLAETDFGGEYGGTQYSLVVFNTHASYPECLVQRSGPTRDVD 82 (219) T ss_pred CEEEEEEEECCCCCCCHHHHH--HHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCHHHHHHCCCCCCHH T ss_conf 004899995543452128999--887689999986189965332356678845899996046877623666068858999 Q ss_pred HHHHHHHHCCCCCCCCC----CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCC-------CHHHHH Q ss_conf 99999860133687744----4203676776432223333102578865642799980687788888-------068999 Q gi|254780934|r 242 HLQRKIKYLSKFGVSTN----STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-------DQQSLY 310 (374) Q Consensus 242 ~~~~~i~~l~~~g~~T~----~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~-------~~~~~~ 310 (374) .+..+++.|+-.|||-. +.+|+..|...+.+....+. ..+..+.+|.-||+..-.+..-.. .....+ T Consensus 83 ~fl~~Ld~i~f~GGG~es~A~iaEGLa~ALq~Fdd~~~~r~--~~~~~~~qkhCILIcnSpPy~lP~~e~~~y~g~t~dq 160 (219) T pfam11265 83 EFLQWLSSIPFMGGGFESCALIAEGLAEALQMFDDFSKMRQ--QQGQTDVHRHCILICNSPPYPLPTVESWQYEGKTSDQ 160 (219) T ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC--CCCCCCCCEEEEEEECCCCCCCCCCHHHHHCCCCHHH T ss_conf 99999973501678714668899899999987313665164--6898875304899968999768651204343876899 Q ss_pred HHHHHH--HCCCEEEEEE Q ss_conf 999999--7898799999 Q gi|254780934|r 311 YCNEAK--KRGAIVYAIG 326 (374) Q Consensus 311 ~~~~~k--~~gi~IytIg 326 (374) ++..+. +++|-+..|. T Consensus 161 la~~~~f~e~~i~lSIis 178 (219) T pfam11265 161 LAAAINFAERSISLSIIC 178 (219) T ss_pred HHHHHCCCCCCEEEEEEC T ss_conf 998741200462589976 No 77 >pfam07002 Copine Copine. This family represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca(2+)-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth. Probab=92.02 E-value=0.79 Score=23.52 Aligned_cols=118 Identities=14% Similarity=0.189 Sum_probs=65.8 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCC------EEEEECCCCH------------HHHHHHHHHCCCC Q ss_conf 137999999887632210023665673489999826850------5610204786------------8999998601336 Q gi|254780934|r 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI------EEFFLLEWGV------------SHLQRKIKYLSKF 253 (374) Q Consensus 192 ~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~t~~~------------~~~~~~i~~l~~~ 253 (374) .+-.+.+..++...+...+. +-++.++.|+.+. ...++++.+. +..+..+..+... T Consensus 10 ~N~Y~~AI~~vg~il~~YD~------Dk~~p~yGFGa~~~~~~~vsh~F~ln~~~~~p~~~G~~gvl~aY~~~~~~v~l~ 83 (145) T pfam07002 10 PNPYEQAIRIVGEILQPYDS------DKRFPAFGFGARLPPDYEVSHDFPLNFNPENPECNGIEGVLNAYREALPNLQLS 83 (145) T ss_pred CCHHHHHHHHHHHHHHHCCC------CCCEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCEEEEC T ss_conf 88999999999999872689------881365435556699986330123568989997669999999999985810654 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 877444203676776432223333102578865642799980687788888068999999999789879999924 Q gi|254780934|r 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 254 g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~ 328 (374) | -|+..+-+..+.+.... . .....-.+++++|||+-+ +-..+....-++-+..+.|-.||+| T Consensus 84 g-PT~fapiI~~a~~~a~~----~-----~~~~~Y~VLlIiTDG~i~---D~~~Ti~aIv~AS~~PlSIIiVGVG 145 (145) T pfam07002 84 G-PTNFAPIIDAAARIAEA----T-----QKSGQYHVLLIITDGQVT---DMKATIDAIVRASHLPLSIIIVGVG 145 (145) T ss_pred C-CCCHHHHHHHHHHHHHH----H-----CCCCEEEEEEEECCCCCC---CHHHHHHHHHHHHCCCEEEEEEEEC T ss_conf 8-75279999999999997----2-----237718999996389735---6999999999982799279999519 No 78 >KOG4465 consensus Probab=91.59 E-value=0.88 Score=23.24 Aligned_cols=141 Identities=15% Similarity=0.179 Sum_probs=79.3 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--HHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECC--CCHHHHHHH Q ss_conf 724999965786451131353137999999887--6322100236656734899998268505610204--786899999 Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN--AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRK 246 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--~~~~~~~~~ 246 (374) +-...+.+|+|+||+.-..++... .+.+.. .++..-.. .-...+.|.++.... |+| +....+..+ T Consensus 427 gkr~~laldvs~sm~~rv~~s~ln---~reaaa~m~linlhne-------ad~~~vaf~d~lte~-pftkd~kigqv~~~ 495 (598) T KOG4465 427 GKRFCLALDVSASMNQRVLGSILN---AREAAAAMCLINLHNE-------ADSRCVAFCDELTEC-PFTKDMKIGQVLDA 495 (598) T ss_pred CCEEEEEEECCHHHHHHHHCCCCC---HHHHHHHHHEEEECCC-------CCEEEEEECCCCCCC-CCCCCCCHHHHHHH T ss_conf 745899985104444556436555---6788755512451244-------440378860545468-87553449999999 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCE---EE Q ss_conf 860133687744420367677643222333310257886564279998068778888806899999999978987---99 Q gi|254780934|r 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI---VY 323 (374) Q Consensus 247 i~~l~~~g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~---Iy 323 (374) ++.+++ | +|+-+-.+.|+...- -+-.+.|+.||-+...+..... .++-+.-+..+|. +. T Consensus 496 ~nni~~-g-~tdcglpm~wa~enn---------------lk~dvfii~tdndt~ageihp~-~aik~yrea~~i~dakli 557 (598) T KOG4465 496 MNNIDA-G-GTDCGLPMIWAQENN---------------LKADVFIIFTDNDTFAGEIHPA-EAIKEYREAMDIHDAKLI 557 (598) T ss_pred HHCCCC-C-CCCCCCCEEEHHHCC---------------CCCCEEEEEECCCCCCCCCCHH-HHHHHHHHHCCCCCCEEE T ss_conf 855888-8-876687204321058---------------8754799983586324666778-999999985489865179 Q ss_pred EE-------EECCCCCHHHHHHHC Q ss_conf 99-------924822278899823 Q gi|254780934|r 324 AI-------GIRVIRSHEFLRACA 340 (374) Q Consensus 324 tI-------g~~~~~~~~~L~~~A 340 (374) .. .+.+|.+...|..|. T Consensus 558 v~amqa~d~siadp~dagmldi~g 581 (598) T KOG4465 558 VCAMQANDFSIADPDDAGMLDICG 581 (598) T ss_pred EEEEECCCCEECCCCCCCCEEECC T ss_conf 998633883224855466332036 No 79 >cd01479 Sec24-like Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 Probab=90.96 E-value=1 Score=22.86 Aligned_cols=174 Identities=13% Similarity=0.128 Sum_probs=89.2 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEE------------------ Q ss_conf 72499996578645113135313799999988763221002366567348999982685056------------------ Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE------------------ 232 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------ 232 (374) |--.+||+|.|-.--.. .-.+.+.+++...++.+.. .....+++++.|+..+.- T Consensus 3 Pp~y~FvIDvS~~av~s-----G~l~~~~~sI~~~L~~lp~---~~~rt~VgiiTfds~vhfy~l~~~l~~pqm~vv~Dl 74 (244) T cd01479 3 PAVYVFLIDVSYNAIKS-----GLLATACEALLSNLDNLPG---DDPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDL 74 (244) T ss_pred CCEEEEEEECCHHHHHH-----CHHHHHHHHHHHHHHHCCC---CCCCEEEEEEEECCEEEEEECCCCCCCCEEEEEECC T ss_conf 98899999898999876-----8799999999999985778---888628999996887999966888888716873066 Q ss_pred -----------EEECCCCHHHHHHHHHHCCCC-----CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEC Q ss_conf -----------102047868999998601336-----8774442036767764322233331025788656427999806 Q gi|254780934|r 233 -----------FFLLEWGVSHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 (374) Q Consensus 233 -----------~~~~t~~~~~~~~~i~~l~~~-----g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillTD 296 (374) ..++...+..+...++.|+.. ..+...+.|++.|...|... + -|+++ +.- T Consensus 75 ~d~f~P~~~~llv~l~e~~~~i~~lL~~lp~~f~~~~~~~~~~G~Al~aA~~~l~~~---------G----GkI~~-f~s 140 (244) T cd01479 75 DDPFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKET---------G----GKIIV-FQS 140 (244) T ss_pred CCCCCCCCCCEEECHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHC---------C----CEEEE-EEC T ss_conf 555678874436525998999999999878875068988643889999999999734---------9----88999-964 Q ss_pred CCCCCCC-----------------------CCHHHHHHHHHHHHCCCEEEEEEECCC-CCHHHHHHHC--CCCC-EEEE- Q ss_conf 8778888-----------------------806899999999978987999992482-2278899823--8981-7983- Q gi|254780934|r 297 GENLSTK-----------------------EDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACA--SPNS-FYLV- 348 (374) Q Consensus 297 G~~~~~~-----------------------~~~~~~~~~~~~k~~gi~IytIg~~~~-~~~~~L~~~A--s~~~-~~~a- 348 (374) +.++.|. ...-.+.++..+-+.+|.|.-.-|... .+-..|..++ |+|. ||+- T Consensus 141 s~Pt~G~G~l~~r~~~~~~~~~~e~~ll~p~~~fY~~la~~~~~~~isvDlF~~~~~~~Dlatl~~l~~~TGG~~~~Yp~ 220 (244) T cd01479 141 SLPTLGAGKLKSREDPKLLSTDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPS 220 (244) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHCCCEEEEEECC T ss_conf 79988887550054434568602222038660899999999998592189996268766544430044316724898078 Q ss_pred ---CCHHHHHHHHHHHHHHHH Q ss_conf ---898999999999998420 Q gi|254780934|r 349 ---ENPHSMYDAFSHIGKDIV 366 (374) Q Consensus 349 ---~~~~~L~~~f~~I~~~i~ 366 (374) ....+-....+++...++ T Consensus 221 f~~~~~~d~~kl~~dl~~~lt 241 (244) T cd01479 221 FNFSAPNDVEKLVNELARYLT 241 (244) T ss_pred CCCCCHHHHHHHHHHHHHHHC T ss_conf 777677688999999999834 No 80 >TIGR00873 gnd 6-phosphogluconate dehydrogenase, decarboxylating; InterPro: IPR006113 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) , . Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved . The protein is a homodimer in which the monomers act independently : each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet . NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket . This model does not specify whether the cofactor is NADP only, NAD only, or both.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0050661 NADP binding, 0006098 pentose-phosphate shunt. Probab=89.67 E-value=0.7 Score=23.83 Aligned_cols=46 Identities=17% Similarity=0.330 Sum_probs=33.5 Q ss_pred CCCCEEEEEEECCCCC--C--------------------CCCCHHHHHHHHHHHHCCCEEEEEEECCCCC Q ss_conf 6564279998068778--8--------------------8880689999999997898799999248222 Q gi|254780934|r 285 ANYKKIIVFMTDGENL--S--------------------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 (374) Q Consensus 285 ~~~~k~iillTDG~~~--~--------------------~~~~~~~~~~~~~~k~~gi~IytIg~~~~~~ 332 (374) ++|||++++++-|.+. . .+.++.+..-+++++++||. .||.|...+ T Consensus 62 e~PRKImLMVkAG~pVdaD~~I~~L~P~LE~GDiIIDGGNS~Y~DT~RR~~eL~~~Gi~--FvG~GvSGG 129 (480) T TIGR00873 62 ERPRKIMLMVKAGAPVDADAVINSLLPLLEKGDIIIDGGNSHYKDTERRYKELKAKGIL--FVGVGVSGG 129 (480) T ss_pred CCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHHHHCCCC--EEEEEEECC T ss_conf 68872888775388537789999964435899888758878846657899999864981--673013245 No 81 >COG3847 Flp Flp pilus assembly protein, pilin Flp [Intracellular trafficking and secretion] Probab=88.38 E-value=0.81 Score=23.48 Aligned_cols=29 Identities=21% Similarity=0.371 Sum_probs=20.7 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 78886412368629999999999999999 Q gi|254780934|r 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVL 34 (374) Q Consensus 6 ~~~~~f~~~~~G~vaiifAl~l~~ll~~~ 34 (374) -.++||+||++|.-+|=.+++...+-.++ T Consensus 3 ~~~~rF~rDE~GAtaiEYglia~lIav~i 31 (58) T COG3847 3 KLLRRFLRDEDGATAIEYGLIAALIAVVI 31 (58) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 78999977456518999999999999999 No 82 >KOG2487 consensus Probab=85.73 E-value=2.3 Score=20.75 Aligned_cols=74 Identities=14% Similarity=0.167 Sum_probs=47.8 Q ss_pred CCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHC--CCCCEEEECCHHHHHHHHHH Q ss_conf 865642799980687788888068999999999789879999924822278899823--89817983898999999999 Q gi|254780934|r 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 (374) Q Consensus 284 ~~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~A--s~~~~~~a~~~~~L~~~f~~ 360 (374) ++..+..++++|=+.+-.. .+-......-.+.+.+|.|-.+-+|.+ ..+|++|+ ++|.|.++++++.|-+.+-. T Consensus 162 ~~~lkSRilV~t~t~d~~~-qyi~~MNciFaAqKq~I~Idv~~l~~~--s~~LqQa~D~TGG~YL~v~~~~gLLqyLlt 237 (314) T KOG2487 162 SEKLKSRILVFTLTRDRAL-QYIPYMNCIFAAQKQNIPIDVVSLGGD--SGFLQQACDITGGDYLHVEKPDGLLQYLLT 237 (314) T ss_pred HHHHHCEEEEEEECHHHHH-HHHHHHHHHHHHHHCCCEEEEEEECCC--CHHHHHHHHHCCCEEEECCCCCHHHHHHHH T ss_conf 1321232899991517776-553577877778753961589995698--439999875028704714885259999999 No 83 >TIGR00627 tfb4 transcription factor tfb4; InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0016251 general RNA polymerase II transcription factor activity, 0006281 DNA repair, 0006355 regulation of transcription DNA-dependent, 0000439 core TFIIH complex. Probab=85.16 E-value=2.5 Score=20.59 Aligned_cols=87 Identities=15% Similarity=0.164 Sum_probs=57.5 Q ss_pred CCCCCCEEEEEEECCCCCCCC---CCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHC--CCCCEEEECCHHHHHHH Q ss_conf 886564279998068778888---8068999999999789879999924822278899823--89817983898999999 Q gi|254780934|r 283 EDANYKKIIVFMTDGENLSTK---EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDA 357 (374) Q Consensus 283 ~~~~~~k~iillTDG~~~~~~---~~~~~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~A--s~~~~~~a~~~~~L~~~ 357 (374) .....+..+++++=|.-..+. .+-..--..=.+.+++|-|-.+-+|.+...-+|+|.| |+|-|-+++++..|-+. T Consensus 151 ~~~~l~sR~lv~~~GsGst~d~~~qY~~~MN~iFsA~K~~i~idvv~~~~~~~~~~LqQAaD~TGG~YL~v~~~~~LL~y 230 (295) T TIGR00627 151 AKEKLKSRILVISIGSGSTEDVALQYIPLMNCIFSAQKQNIPIDVVKIGGDFESGFLQQAADITGGVYLKVEKPKGLLQY 230 (295) T ss_pred HHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHCCEEEEECCCHHHHHH T ss_conf 00110011789973578762101002005569998516984158998089830206777776638745742787468999 Q ss_pred H-HHHHHHHHEEE Q ss_conf 9-99998420016 Q gi|254780934|r 358 F-SHIGKDIVTKR 369 (374) Q Consensus 358 f-~~I~~~i~~~~ 369 (374) + ..+.-+=..|. T Consensus 231 L~~~~l~D~~~R~ 243 (295) T TIGR00627 231 LMTAMLPDPDLRA 243 (295) T ss_pred HHHHHCCCCCCCC T ss_conf 9998578902112 No 84 >cd01468 trunk_domain trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins. Probab=84.77 E-value=2.6 Score=20.48 Aligned_cols=160 Identities=16% Similarity=0.110 Sum_probs=86.6 Q ss_pred CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEE------------------ Q ss_conf 72499996578645113135313799999988763221002366567348999982685056------------------ Q gi|254780934|r 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE------------------ 232 (374) Q Consensus 171 ~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------ 232 (374) +--.+||+|.|-.--.. .-.+.+++++...++.+.. ....+++++.|+..+.. T Consensus 3 pp~~~FvIDvs~~ai~~-----g~l~~~~~si~~~l~~lp~----~~~~~VgiiTfd~~v~~y~l~~~~~~~~~~vv~d~ 73 (239) T cd01468 3 PPVFVFVIDVSYEAIKE-----GLLQALKESLLASLDLLPG----DPRARVGLITYDSTVHFYNLSSDLAQPKMYVVSDL 73 (239) T ss_pred CCEEEEEEECCHHHHHC-----HHHHHHHHHHHHHHHHCCC----CCCCEEEEEEECCEEEEEECCCCCCCCCEEEECCC T ss_conf 98899999886665334-----0999999999999985778----99978999997877899975888778823675388 Q ss_pred -----------EEECCCCHHHHHHHHHHCCCCCC-------CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEE Q ss_conf -----------10204786899999860133687-------744420367677643222333310257886564279998 Q gi|254780934|r 233 -----------FFLLEWGVSHLQRKIKYLSKFGV-------STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 (374) Q Consensus 233 -----------~~~~t~~~~~~~~~i~~l~~~g~-------~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iill 294 (374) +.++...+..+...++.|..... ..-.+.++..+...|.... .+ -.|+++ T Consensus 74 ~d~f~P~~~~~lv~~~e~~~~i~~lL~~l~~~~~~~~~~~~~~~~GsAl~~A~~~L~~~~-------~G-----GkI~~f 141 (239) T cd01468 74 KDVFLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGPALQAAFLLLKGTF-------AG-----GRIIVF 141 (239) T ss_pred CCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC-------CC-----CEEEEE T ss_conf 556678875457618998999999999877511454788887437999999999972457-------89-----669999 Q ss_pred ECCCCCCCC-----------------------CCHHHHHHHHHHHHCCCEEEEEEECCC-CCHHHHHHHC--CCCCEEEE Q ss_conf 068778888-----------------------806899999999978987999992482-2278899823--89817983 Q gi|254780934|r 295 TDGENLSTK-----------------------EDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACA--SPNSFYLV 348 (374) Q Consensus 295 TDG~~~~~~-----------------------~~~~~~~~~~~~k~~gi~IytIg~~~~-~~~~~L~~~A--s~~~~~~a 348 (374) +-|.++.|. ...-...++..+.+.||-|.-..+..+ .+-..|+.++ |+|..|+- T Consensus 142 ~s~~pt~GpG~l~~r~~~~~~~s~~e~~~~~~~~~fY~~la~~~~~~~isvDlF~~s~~~~dlatl~~l~~~TGG~~~~y 221 (239) T cd01468 142 QGGLPTVGPGKLKSREDKEPIRSHDEAQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLY 221 (239) T ss_pred ECCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHCCEEEEEEECCCCCCCHHHHHHHHHHCCCEEEEE T ss_conf 57899788986546554557897115665352078999999999986915999972586678388875887269679985 Q ss_pred CCH Q ss_conf 898 Q gi|254780934|r 349 ENP 351 (374) Q Consensus 349 ~~~ 351 (374) ++. T Consensus 222 ~~F 224 (239) T cd01468 222 DSF 224 (239) T ss_pred CCC T ss_conf 888 No 85 >COG4726 PilX Tfp pilus assembly protein PilX [Cell motility and secretion / Intracellular trafficking and secretion] Probab=84.00 E-value=2.8 Score=20.28 Aligned_cols=55 Identities=13% Similarity=0.227 Sum_probs=29.8 Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 641236862999999999999999999999--------99999999999999999999988763 Q gi|254780934|r 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIE--------VSHIFFMKTVLHSMIDRSLVHAATQI 65 (374) Q Consensus 10 ~f~~~~~G~vaiifAl~l~~ll~~~g~avD--------~~~~~~~ks~Lq~A~DaAaLAaa~~~ 65 (374) +-.|.+|| ++++++|++++++.++|++.- .+.-+..|+..++|+++|.-.+=..+ T Consensus 7 r~~r~qRG-~~LivvL~~LvvltLl~l~~~r~~llqeRiSaN~~D~~lAfqaAEaaLr~~E~~i 69 (196) T COG4726 7 RGSRRQRG-FALIVVLMVLVVLTLLGLAAARSVLLQERISANERDRSLAFQAAEAALREGELQI 69 (196) T ss_pred CCCCCCCC-EEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87645676-4738999999999999999999999898875206778999999999998778998 No 86 >pfam04811 Sec23_trunk Sec23/Sec24 trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Probab=83.73 E-value=2.8 Score=20.21 Aligned_cols=158 Identities=16% Similarity=0.115 Sum_probs=87.8 Q ss_pred CEEEEEEECCCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEE----------------- Q ss_conf 7249999657864-5113135313799999988763221002366567348999982685056----------------- Q gi|254780934|r 171 RLDMMIVLDVSRS-MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE----------------- 232 (374) Q Consensus 171 ~~di~~VlD~SgS-M~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------- 232 (374) |--.+||+|.|-. .... -.+.+++++...++.+.. ....+++++.|+..+.. T Consensus 3 PP~f~FvIDvS~~ai~~g------~l~~~~~si~~~l~~lp~----~~~~~VgiITf~~~v~~y~l~~~~~~~~~~vv~d 72 (241) T pfam04811 3 PPVFLFVIDVSYNAIKSG------LLAALKESLLQSLDLLPG----DPRALVGFITFDSTVHFFNLSSSLRQPKMLVVSD 72 (241) T ss_pred CCEEEEEEECCHHHHHCC------HHHHHHHHHHHHHHHCCC----CCCCEEEEEEECCEEEEEECCCCCCCCEEEEHHH T ss_conf 988999998865653343------999999999999984779----9986899999688689998778888872650356 Q ss_pred ------------EEECCCCHHHHHHHHHHCCCC-----CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEE Q ss_conf ------------102047868999998601336-----877444203676776432223333102578865642799980 Q gi|254780934|r 233 ------------FFLLEWGVSHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 (374) Q Consensus 233 ------------~~~~t~~~~~~~~~i~~l~~~-----g~~T~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~k~iillT 295 (374) ..++...+..+...++.|+.. ......+.++..|...|... ..+ -.|++++ T Consensus 73 l~d~f~P~~~~~lv~~~e~~~~i~~lL~~L~~~~~~~~~~~~~~G~Al~~A~~lL~~~-------~~G-----GkI~~F~ 140 (241) T pfam04811 73 LQDMFLPLPDRFLVPLSECRFVLEDLLEELPRMFPVTKRPERCLGPALQAAVLLLKAA-------FTG-----GKIMLFQ 140 (241) T ss_pred HHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHC-------CCC-----CEEEEEE T ss_conf 5430367755546446999999999999757643467887422489999999998516-------899-----7699994 Q ss_pred CCCCCCCC-------------------------CCHHHHHHHHHHHHCCCEEEEEEECCC-CCHHHHHHHC--CCCCEEE Q ss_conf 68778888-------------------------806899999999978987999992482-2278899823--8981798 Q gi|254780934|r 296 DGENLSTK-------------------------EDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACA--SPNSFYL 347 (374) Q Consensus 296 DG~~~~~~-------------------------~~~~~~~~~~~~k~~gi~IytIg~~~~-~~~~~L~~~A--s~~~~~~ 347 (374) -|.++.|. .+.-...++..+-+.||.+.-..+..+ .+-..|+.++ |+|..|+ T Consensus 141 s~~pt~Gpg~~~~~~~~~~~~~~~~e~~~~~~~~~~fY~~la~~~~~~~isvDlF~~s~~~~dlatl~~l~~~TGG~i~~ 220 (241) T pfam04811 141 GGLPTVGPGGKLKSRLDESHHDTDKEKAKLVKKADKFYKSLAKECVAQGHSVDLFAFSLDYVDVAELGCLSRLTGGQVYL 220 (241) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHCCCCEEEE T ss_conf 78998797544555444344676300454166217999999999998694599995278667828877687606936998 Q ss_pred ECC Q ss_conf 389 Q gi|254780934|r 348 VEN 350 (374) Q Consensus 348 a~~ 350 (374) -++ T Consensus 221 y~~ 223 (241) T pfam04811 221 YPS 223 (241) T ss_pred ECC T ss_conf 678 No 87 >cd01478 Sec23-like Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24 Probab=75.95 E-value=5.2 Score=18.68 Aligned_cols=46 Identities=20% Similarity=0.264 Sum_probs=32.2 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEE Q ss_conf 499996578645113135313799999988763221002366567348999982685056 Q gi|254780934|r 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 (374) Q Consensus 173 di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (374) -.+||+|.+-. .. .++.+|+.+...+..+. ...++|+++|+..+.. T Consensus 5 vf~fVVDtc~~--ee------eL~~LK~sL~~~ls~LP------~~alVGLITFG~~V~v 50 (267) T cd01478 5 VFLFVVDTCMD--EE------ELDALKESLIMSLSLLP------PNALVGLITFGTMVQV 50 (267) T ss_pred EEEEEEECCCC--HH------HHHHHHHHHHHHHHHCC------CCCEEEEEECCCEEEE T ss_conf 99999987779--99------99999999999998489------9777999984987999 No 88 >COG1721 Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only] Probab=75.02 E-value=5.5 Score=18.53 Aligned_cols=81 Identities=23% Similarity=0.278 Sum_probs=57.3 Q ss_pred CCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEECCCCHHHHHHH Q ss_conf 37787249999657864511313531379999998876322100236656734899998268505610204786899999 Q gi|254780934|r 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 (374) Q Consensus 167 ~~~~~~di~~VlD~SgSM~~~~~~~~~ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~ 246 (374) +.....++++++|.|.||...... ..+.+.+..++..+.-... ....++++..++.......+...+....... T Consensus 220 e~er~~~v~l~lD~~~~m~~~~~~-~~~~e~av~~a~~la~~~l-----~~gd~vg~~~~~~~~~~~~~p~~G~~~l~~~ 293 (416) T COG1721 220 EEERGRTVVLVLDASRSMLFGSGV-ASKFEEAVRAAASLAYAAL-----KNGDRVGLLIFGGGGPKWIPPSRGRRHLARI 293 (416) T ss_pred ECCCCCCEEEEEECCCCCCCCCCC-CHHHHHHHHHHHHHHHHHH-----HCCCCEEEEEECCCCCEEECCCCCHHHHHHH T ss_conf 302675289999089755456553-0089999999999999999-----6399379999668863343787777899999 Q ss_pred HHHCCCC Q ss_conf 8601336 Q gi|254780934|r 247 IKYLSKF 253 (374) Q Consensus 247 i~~l~~~ 253 (374) ++.+... T Consensus 294 l~~l~~~ 300 (416) T COG1721 294 LKALALL 300 (416) T ss_pred HHHHHHC T ss_conf 9997413 No 89 >pfam03850 Tfb4 Transcription factor Tfb4. Probab=74.55 E-value=5.6 Score=18.46 Aligned_cols=76 Identities=13% Similarity=0.102 Sum_probs=51.1 Q ss_pred CCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHC--CCCCEEEECCHHHHHHHHHHHH Q ss_conf 65642799980687788888068999999999789879999924822278899823--8981798389899999999999 Q gi|254780934|r 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 (374) Q Consensus 285 ~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~A--s~~~~~~a~~~~~L~~~f~~I~ 362 (374) ...+..|++++=.. .....+-......-.+++.+|.|-+.-++.. +..+|++.+ ++|-|+.+++.++|.+..-.+. T Consensus 139 ~~~~~RILiis~S~-d~~~QYi~~MN~iFaAqk~~I~IDvc~L~~~-~s~fLQQA~diT~G~Yl~~~~~~gLlQyL~~~f 216 (271) T pfam03850 139 TSLKSRILVLSGSP-DSASQYIPIMNSIFAAQKLKIPIDVCKLGGE-DSSFLQQAADITGGVYLHVTEPDGLLQYLMTAF 216 (271) T ss_pred CCCEEEEEEEECCC-CCHHHHHHHHHHHHHHHHCCCEEEEEEECCC-CCHHHHHHHHHHCCEEECCCCCCHHHHHHHHHH T ss_conf 65302599998788-8447789999999999855974799993699-858999999974977751478333899999996 No 90 >COG2984 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=72.75 E-value=6.2 Score=18.20 Aligned_cols=83 Identities=18% Similarity=0.109 Sum_probs=46.4 Q ss_pred CCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC-CCCHHHHHHHCCCCCEEEECCHHHHHHHHHHHHH Q ss_conf 656427999806877888880689999999997898799999248-2227889982389817983898999999999998 Q gi|254780934|r 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 (374) Q Consensus 285 ~~~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~-~~~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~ 363 (374) ++.+++-++..-||.|+. .....+-..+++.|++++..++.. ...+..++...+....++...-..+...++.+.+ T Consensus 157 Pnak~Igv~Y~p~E~ns~---~l~eelk~~A~~~Gl~vve~~v~~~ndi~~a~~~l~g~~d~i~~p~dn~i~s~~~~l~~ 233 (322) T COG2984 157 PNAKSIGVLYNPGEANSV---SLVEELKKEARKAGLEVVEAAVTSVNDIPRAVQALLGKVDVIYIPTDNLIVSAIESLLQ 233 (322) T ss_pred CCCEEEEEEECCCCCCCH---HHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHCCCCCEEEEECCHHHHHHHHHHHH T ss_conf 887069999579886608---99999999998779889998347632008999973478767998660677888999999 Q ss_pred HHHEEEE Q ss_conf 4200166 Q gi|254780934|r 364 DIVTKRI 370 (374) Q Consensus 364 ~i~~~~~ 370 (374) ...+.++ T Consensus 234 ~a~~~ki 240 (322) T COG2984 234 VANKAKI 240 (322) T ss_pred HHHHHCC T ss_conf 9887089 No 91 >PRK08643 acetoin reductase; Validated Probab=69.65 E-value=7.3 Score=17.78 Aligned_cols=22 Identities=14% Similarity=0.110 Sum_probs=17.2 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 163 tkslA~ela~~gIrVN~V~PG~ 184 (256) T PRK08643 163 TQTAARDLASEGITVNAYAPGI 184 (256) T ss_pred HHHHHHHHHHHCCEEEEEEECC T ss_conf 9999999877591899996066 No 92 >PRK10538 3-hydroxy acid dehydrogenase; Provisional Probab=68.98 E-value=7.5 Score=17.69 Aligned_cols=52 Identities=10% Similarity=0.033 Sum_probs=27.1 Q ss_pred HHHHHHHHHCCCEEEEEEECCCC-----------CHHHHHHHCCCCCEEEECCHHHHHHHHHHHHH Q ss_conf 99999999789879999924822-----------27889982389817983898999999999998 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRVIR-----------SHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~~~-----------~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~ 363 (374) ..++.++...||++.+|.-|.-. +.+.+...-..+.. -.++|+.++.--++. T Consensus 159 ~~La~El~~~gIrVn~v~PG~v~t~~~~~~~~~~~~~~~~~~~~~~~~---l~PedVA~av~fl~s 221 (248) T PRK10538 159 LNLRTDLHGTAVRVTDIEPGLVGGTEFSNVRFKGDDGKAEKTYQNTVA---LTPEDVSEAVWWVAT 221 (248) T ss_pred HHHHHHHCCCCEEEEEEECCCCCCCCHHCCCCCCCHHHHHHHHCCCCC---CCHHHHHHHHHHHHC T ss_conf 999998478685999998475768411114556768889740357899---999999999999982 No 93 >PRK08265 short chain dehydrogenase; Provisional Probab=67.85 E-value=8 Score=17.55 Aligned_cols=22 Identities=9% Similarity=-0.005 Sum_probs=17.0 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.+....||+|.+|.-|. T Consensus 161 tk~lA~e~a~~gIrVN~IaPG~ 182 (261) T PRK08265 161 TRSMAMDLAPDGIRVNSVSPGW 182 (261) T ss_pred HHHHHHHHHHHCEEEEEEEECC T ss_conf 9999999741092998885587 No 94 >PRK05872 short chain dehydrogenase; Provisional Probab=67.39 E-value=8.1 Score=17.50 Aligned_cols=21 Identities=19% Similarity=-0.016 Sum_probs=15.1 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) ..+..+++..||++-+|.-|. T Consensus 166 ~sLa~Ela~~GIrVn~V~PG~ 186 (296) T PRK05872 166 NALRLEVAHRGVSVGSAYLSW 186 (296) T ss_pred HHHHHHHHHHCCEEEEEECCC T ss_conf 999998400193899997088 No 95 >PRK06720 hypothetical protein; Provisional Probab=63.04 E-value=9.9 Score=17.00 Aligned_cols=65 Identities=14% Similarity=0.074 Sum_probs=29.8 Q ss_pred HHHHHHHHHCCCEEEEEEECCCCCHHHHHHHCC-CC--CE--EEECCHHHHHHHHHHHHHHHHEEEEEEE Q ss_conf 999999997898799999248222788998238-98--17--9838989999999999984200166620 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PN--SF--YLVENPHSMYDAFSHIGKDIVTKRIWYD 373 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~As-~~--~~--~~a~~~~~L~~~f~~I~~~i~~~~~~~~ 373 (374) ...|..+.+.|.+|...+...+..++..+++.. ++ .+ .++.+.+++.+.++++.++..+.-+++| T Consensus 30 ~a~A~~la~~Ga~Vvi~d~~~~~~~~~~~~i~~~g~~a~~~~~Dvs~~~~v~~~i~~~~~~~g~iDiLvN 99 (169) T PRK06720 30 RNTALLLAKQGAKVIVTDIDQESGQATVEEITNLGGEALFVSYDMEKQGDWQRVISITLNAFSRIDMLFQ 99 (169) T ss_pred HHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 9999999986998999527636599999999974995378975889999999999999997598998998 No 96 >PRK07523 gluconate 5-dehydrogenase; Provisional Probab=60.07 E-value=11 Score=16.68 Aligned_cols=22 Identities=18% Similarity=0.048 Sum_probs=16.3 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 166 Tr~lA~e~a~~gIrVNaVaPG~ 187 (251) T PRK07523 166 TKGMATDWAKHGLQCNAIAPGY 187 (251) T ss_pred HHHHHHHHCCCCEEEEEEEECC T ss_conf 9999999702094999997378 No 97 >PRK06138 short chain dehydrogenase; Provisional Probab=58.21 E-value=12 Score=16.49 Aligned_cols=22 Identities=14% Similarity=0.055 Sum_probs=16.7 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 164 Tk~lA~e~a~~gIrVNaI~PG~ 185 (252) T PRK06138 164 TRAMALDHATDGIRVNAVAPGT 185 (252) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999998622291999997588 No 98 >PRK12826 3-ketoacyl-(acyl-carrier-protein) reductase; Reviewed Probab=57.21 E-value=13 Score=16.39 Aligned_cols=21 Identities=14% Similarity=0.071 Sum_probs=15.6 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) +.++.+....||+|.+|..|. T Consensus 168 k~lA~e~~~~gIrvN~I~PG~ 188 (253) T PRK12826 168 RALALELARRNITVNSVHPGM 188 (253) T ss_pred HHHHHHHHHHCEEEEEEEECC T ss_conf 999998532095999996287 No 99 >PRK05717 oxidoreductase; Validated Probab=56.38 E-value=13 Score=16.30 Aligned_cols=19 Identities=11% Similarity=-0.164 Sum_probs=12.3 Q ss_pred HHHHHHHHCCCEEEEEEECC Q ss_conf 99999997898799999248 Q gi|254780934|r 310 YYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 310 ~~~~~~k~~gi~IytIg~~~ 329 (374) .++.++.. +|++.+|..|. T Consensus 170 slA~e~a~-~IRvN~I~PG~ 188 (255) T PRK05717 170 ALAISLGP-EIRVNAVSPGW 188 (255) T ss_pred HHHHHHCC-CCEEEEEECCC T ss_conf 99999779-99899996271 No 100 >PRK07454 short chain dehydrogenase; Provisional Probab=55.79 E-value=13 Score=16.24 Aligned_cols=55 Identities=9% Similarity=0.138 Sum_probs=28.9 Q ss_pred HHHHHHHHHCCCEEEEEEECCCCCHHHHHHHCCCCCEE--EECCHHHHHHHHHHHHHH Q ss_conf 99999999789879999924822278899823898179--838989999999999984 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY--LVENPHSMYDAFSHIGKD 364 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~As~~~~~--~a~~~~~L~~~f~~I~~~ 364 (374) ..+..+++..||++-+|.-|. .+.+++..-.-...+- ..-.++++.++.--++.+ T Consensus 167 ~~la~E~~~~gIrVn~V~PG~-v~T~m~~~~~~~~~~~~~~~l~PedVA~~v~flas~ 223 (241) T PRK07454 167 KCLAEEERSHGIRVCTLTLGA-VNTPLWDSETVQADFDRSAMLSPEQVAQTILYLAQL 223 (241) T ss_pred HHHHHHHCCCCCEEEEEEECC-CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 999998384593899997388-988988863333554556899999999999999769 No 101 >TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822 This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase.. Probab=54.72 E-value=13 Score=16.38 Aligned_cols=38 Identities=13% Similarity=0.251 Sum_probs=34.2 Q ss_pred HHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHHHEEEEE Q ss_conf 788998238981798389899999999999842001666 Q gi|254780934|r 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 (374) Q Consensus 333 ~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~~i~~~~~~ 371 (374) .+...+|- +|--+++.|.++|..|++++..+-.+++.| T Consensus 382 ~dI~~~C~-NGLLvd~ld~e~i~~AL~~alsd~~QW~~W 419 (445) T TIGR02472 382 RDIIANCR-NGLLVDVLDLEAIASALEQALSDSSQWQTW 419 (445) T ss_pred HHHHHHCC-CCCEECCCCHHHHHHHHHHHCCCHHHHHHH T ss_conf 68884288-875005789899999999733890667899 No 102 >PRK08277 D-mannonate oxidoreductase; Provisional Probab=54.17 E-value=14 Score=16.08 Aligned_cols=22 Identities=27% Similarity=0.202 Sum_probs=15.8 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 185 Tk~lA~e~a~~gIrVNaIaPG~ 206 (278) T PRK08277 185 TQWLAVEFAKVGIRVNAIAPGF 206 (278) T ss_pred HHHHHHHHCCCCEEEEEEEECC T ss_conf 9999999653594999985288 No 103 >PRK06139 short chain dehydrogenase; Provisional Probab=54.13 E-value=14 Score=16.08 Aligned_cols=33 Identities=6% Similarity=-0.148 Sum_probs=15.2 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEE Q ss_conf 799999988763221002366567348999982 Q gi|254780934|r 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 (374) Q Consensus 194 ki~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (374) -....|-++..|.+.+...-.....+++..+.- T Consensus 154 aY~ASK~Av~gftesLr~EL~~~~gI~Vt~V~P 186 (324) T PRK06139 154 AYSASKFGLRGFSEALRAELTDFPDIHVCDVYP 186 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEC T ss_conf 198999999999999999837998918999857 No 104 >pfam02060 ISK_Channel Slow voltage-gated potassium channel. Probab=53.88 E-value=14 Score=16.06 Aligned_cols=39 Identities=21% Similarity=0.097 Sum_probs=31.8 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 864123686299999999999999999999999999999 Q gi|254780934|r 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMK 47 (374) Q Consensus 9 ~~f~~~~~G~vaiifAl~l~~ll~~~g~avD~~~~~~~k 47 (374) ||.-+...|...++..++++-+++++.++|-++..-..| T Consensus 32 rr~p~~~dg~le~lYiLmvlGfFgFft~GImlsyiRSkk 70 (129) T pfam02060 32 RRSPLGDDGKLEALYILMVLGFFGFFTLGIMLSYIRSKK 70 (129) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 678899986026889999999999999999999999886 No 105 >TIGR02921 PEP_integral PEP-CTERM family integral membrane protein; InterPro: IPR014270 Members of this protein family, found in three different species so far, have a PEP-CTERM sequence at the carboxyl-terminus (IPR013424 from INTERPRO), but are unusual among PEP-CTERM proteins in having multiple predicted transmembrane segments. The function is unknown. It is proposed that a member of the EpsH family, to be designated exosortase (see IPR013426 from INTERPRO), recognises and cleaves PEP-CTERM proteins in a manner analogous to the cleavage of LPXTG proteins by sortase .. Probab=53.48 E-value=14 Score=16.02 Aligned_cols=16 Identities=38% Similarity=0.476 Sum_probs=12.1 Q ss_pred EEEEEEECCCCCCCCC Q ss_conf 2499996578645113 Q gi|254780934|r 172 LDMMIVLDVSRSMESF 187 (374) Q Consensus 172 ~di~~VlD~SgSM~~~ 187 (374) .-+++|+|.|.||+.. T Consensus 637 ~~~A~~~D~S~SMGE~ 652 (952) T TIGR02921 637 TRVALLLDTSRSMGEQ 652 (952) T ss_pred CCEEEEEECCHHHHHH T ss_conf 5268998466002456 No 106 >TIGR03206 benzo_BadH 2-hydroxycyclohexanecarboxyl-CoA dehydrogenase. Members of this protein family are the enzyme 2-hydroxycyclohexanecarboxyl-CoA dehydrogenase. The enzymatic properties were confirmed experimentally in Rhodopseudomonas palustris; the enzyme is homotetrameric, and not sensitive to oxygen. This enzyme is part of proposed pathway for degradation of benzoyl-CoA to 3-hydroxypimeloyl-CoA that differs from the analogous in Thauera aromatica. It also may occur in degradation of the non-aromatic compound cyclohexane-1-carboxylate. Probab=52.80 E-value=15 Score=15.95 Aligned_cols=22 Identities=18% Similarity=0.116 Sum_probs=17.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.+|.++...||++-+|.-|. T Consensus 163 tk~lA~ela~~gIrVNaV~PG~ 184 (250) T TIGR03206 163 SKTMAREHARHGITVNVVCPGP 184 (250) T ss_pred HHHHHHHHCCCCEEEEEEEECC T ss_conf 9999999653291899997688 No 107 >PRK05867 short chain dehydrogenase; Provisional Probab=52.58 E-value=15 Score=15.93 Aligned_cols=22 Identities=9% Similarity=-0.074 Sum_probs=16.1 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 172 tr~lA~ela~~gIrVN~VaPG~ 193 (253) T PRK05867 172 TKAMAVELAPHKIRVNSVSPGY 193 (253) T ss_pred HHHHHHHHCCCCEEEEEEEECC T ss_conf 9999999700092999996588 No 108 >PRK12829 short chain dehydrogenase; Provisional Probab=52.36 E-value=15 Score=15.91 Aligned_cols=22 Identities=23% Similarity=-0.026 Sum_probs=16.9 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 171 tr~lA~E~a~~gIrVNaV~PG~ 192 (264) T PRK12829 171 VKSLAIELGPLGIRVNAILPGI 192 (264) T ss_pred HHHHHHHHHHHCEEEEEEEECC T ss_conf 9999999854094999886288 No 109 >PRK07109 short chain dehydrogenase; Provisional Probab=51.20 E-value=16 Score=15.80 Aligned_cols=34 Identities=12% Similarity=0.045 Sum_probs=17.3 Q ss_pred HHHHHHHHHHHHHHHHCCC-CCCCCCEEEEEEEEC Q ss_conf 7999999887632210023-665673489999826 Q gi|254780934|r 194 KIDMAIKSINAMLEEVKLI-PDVNNVVQSGLVTFS 227 (374) Q Consensus 194 ki~~~~~a~~~~~~~~~~~-~~~~~~~~~~~~~~~ 227 (374) -.-..|-++..|.+.+... ......+++..+.-+ T Consensus 156 aY~ASK~Av~GftesLr~EL~~~~s~I~Vt~V~Pg 190 (338) T PRK07109 156 AYCAAKHAIRGFTDSLRCELLHDGSPVSVTMVQPP 190 (338) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 79999999999999999999867998189997579 No 110 >cd03132 GATase1_catalase Type 1 glutamine amidotransferase (GATase1)-like domain found in at the C-terminal of several large catalases. Type 1 glutamine amidotransferase (GATase1)-like domain found in at the C-terminal of several large catalases. Catalase catalyzes the dismutation of hydrogen peroxide (H2O2) to water and oxygen. This group includes the large catalases: Neurospora crassa Catalase-1 and Catalase-3 and, Escherichia coli HP-II. This GATase1-like domain has an essential role in HP-II catalase activity. However, it lacks enzymatic activity and the catalytic triad typical of GATase1 domains. Catalase-1 and -3 are homotetrameric, HP-II is homohexameric. It has been proposed that this domain may facilitate the folding and oligomerization process. The interface between this GATase1-like domain of HP-II and the core of the subunit forms part of a channel which provides access to the deeply buried catalase active sites of HPII. Catalase-1 is associated with non-growing cells; C Probab=49.83 E-value=17 Score=15.67 Aligned_cols=59 Identities=17% Similarity=0.256 Sum_probs=25.1 Q ss_pred EEEECCCCCCC--CCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHC---CCCCEEEECCHHHH Q ss_conf 99806877888--88068999999999789879999924822278899823---89817983898999 Q gi|254780934|r 292 VFMTDGENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA---SPNSFYLVENPHSM 354 (374) Q Consensus 292 illTDG~~~~~--~~~~~~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~A---s~~~~~~a~~~~~L 354 (374) +++-+|..+.. ..+........++-..+=.|-.+ ..+.++|.... ...-.+..++...+ T Consensus 66 vvipgG~~~~~~L~~~~~~~~fv~eay~h~KpI~a~----~~~~~lL~~agi~~~~~Gvv~~~~~~~~ 129 (142) T cd03132 66 VVVPGGAEAAFALAPSGRALHFVTEAFKHGKPIGAV----GEGSDLLEAAGIPLEDPGVVTADDVKDV 129 (142) T ss_pred EEECCCCCCHHHHCCCHHHHHHHHHHHHCCCEEEEE----CCHHHHHHHCCCCCCCCCEEEECCCCHH T ss_conf 884388779888644867999999999769979993----7729999976979999857981586677 No 111 >pfam04964 Flp_Fap Flp/Fap pilin component. Probab=48.31 E-value=17 Score=15.53 Aligned_cols=29 Identities=17% Similarity=0.299 Sum_probs=18.9 Q ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 41236862999999999999999999999 Q gi|254780934|r 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIE 39 (374) Q Consensus 11 f~~~~~G~vaiifAl~l~~ll~~~g~avD 39 (374) |+|||+|.-+|=-+|....+-.++-.++. T Consensus 1 F~kde~GaTAIEYgLIaalIav~iI~~~~ 29 (47) T pfam04964 1 FLKDESGATAIEYGLIAALIAVVIIAYVT 29 (47) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 96565641599999999999999999999 No 112 >PRK07576 short chain dehydrogenase; Provisional Probab=47.97 E-value=18 Score=15.49 Aligned_cols=21 Identities=19% Similarity=-0.021 Sum_probs=15.8 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) +.++.++..+||++.+|..|. T Consensus 168 k~lA~e~a~~gIrVN~IaPG~ 188 (260) T PRK07576 168 RTLALEWGPEGVRVNSISPGP 188 (260) T ss_pred HHHHHHHHHCCEEEEEEEECC T ss_conf 999999713392999983477 No 113 >PRK06124 gluconate 5-dehydrogenase; Provisional Probab=47.40 E-value=18 Score=15.44 Aligned_cols=22 Identities=18% Similarity=-0.003 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 174 tr~lA~ela~~gIrVN~I~PG~ 195 (259) T PRK06124 174 MRALAAEFGPHGITSNAIAPGY 195 (259) T ss_pred HHHHHHHHCCCCEEEEEEEECC T ss_conf 9999999651397999997588 No 114 >PRK12384 sorbitol-6-phosphate dehydrogenase; Provisional Probab=47.12 E-value=18 Score=15.41 Aligned_cols=22 Identities=9% Similarity=0.039 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 165 tk~lA~e~a~~gIrVN~I~PG~ 186 (259) T PRK12384 165 TQSLALDLAEYGITVHSLMLGN 186 (259) T ss_pred HHHHHHHHHHHCEEEEEEECCC T ss_conf 9999999623197999983887 No 115 >PRK07067 sorbitol dehydrogenase; Provisional Probab=46.98 E-value=18 Score=15.40 Aligned_cols=22 Identities=23% Similarity=0.104 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 163 Tr~lA~ela~~gIrVNaV~PG~ 184 (256) T PRK07067 163 TQSAALALARHGINVNAIAPGV 184 (256) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999999704292899995488 No 116 >PRK07775 short chain dehydrogenase; Provisional Probab=46.80 E-value=18 Score=15.38 Aligned_cols=20 Identities=10% Similarity=-0.084 Sum_probs=14.7 Q ss_pred HHHHHHHHHCCCEEEEEEEC Q ss_conf 99999999789879999924 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~ 328 (374) ..++.++...||++.+|.-| T Consensus 171 ~~La~El~~~gIrVn~V~PG 190 (275) T PRK07775 171 TNLQMELEGTGVRASIVHPG 190 (275) T ss_pred HHHHHHHCCCCEEEEEEECC T ss_conf 99999856569089999726 No 117 >COG1681 FlaB Archaeal flagellins [Cell motility and secretion] Probab=46.06 E-value=19 Score=15.31 Aligned_cols=39 Identities=18% Similarity=0.230 Sum_probs=22.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHH Q ss_conf 36862999999999999999999--9999999999999999 Q gi|254780934|r 14 NYKGGMTILTAIFLPIIFLVLGM--IIEVSHIFFMKTVLHS 52 (374) Q Consensus 14 ~~~G~vaiifAl~l~~ll~~~g~--avD~~~~~~~ks~Lq~ 52 (374) +|||-+.|=+++.++.|+++++. +|=.......+.|-|. T Consensus 1 ~rrG~~GIgtlIVfIAmVlVAAVaA~VlInt~g~lqQKa~~ 41 (209) T COG1681 1 DRRGATGIGTLIVFIAMVLVAAVAAYVLINTGGFLQQKAKA 41 (209) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHH T ss_conf 98411043289999999999999999986534202245677 No 118 >PRK09186 flagellin modification protein A; Provisional Probab=44.41 E-value=20 Score=15.16 Aligned_cols=53 Identities=15% Similarity=0.222 Sum_probs=29.1 Q ss_pred HHHHHHHHHHCCCEEEEEEECCC---CCHHHHHHHC---CCCCEEEECCHHHHHHHHHHHHH Q ss_conf 99999999978987999992482---2278899823---89817983898999999999998 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRVI---RSHEFLRACA---SPNSFYLVENPHSMYDAFSHIGK 363 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~~---~~~~~L~~~A---s~~~~~~a~~~~~L~~~f~~I~~ 363 (374) ++.++.++...||+|.+|..|.- ..+.+++... ...+. -+++|+..+.-=++. T Consensus 178 tr~lA~e~a~~gIrVN~VaPG~i~~~~~~~~~~~~~~~~~~~~~---~~p~dia~~v~fL~S 236 (255) T PRK09186 178 TKYLAKYFKDSNIRVNCVSPGGILDNQPEAFLNAYKKSCNGKGM---LDPEDICGSLVFLLS 236 (255) T ss_pred HHHHHHHHCCCCEEEEEEEECCCCCCCCHHHHHHHHHHCCCCCC---CCHHHHHHHHHHHHC T ss_conf 99999996758989999855768899989999999863557799---899999999999957 No 119 >PRK07326 short chain dehydrogenase; Provisional Probab=44.08 E-value=20 Score=15.13 Aligned_cols=53 Identities=9% Similarity=-0.001 Sum_probs=28.6 Q ss_pred HHHHHHHHHCCCEEEEEEECCCCCHHHHHHHCCCCCEEEECCHHHHHHHHHHHHH Q ss_conf 9999999978987999992482227889982389817983898999999999998 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~ 363 (374) ..++.+++..||++.+|.-|. .+.++....-.....+. -.++++.++.--+.. T Consensus 164 ~~la~El~~~gIrVn~v~PG~-v~T~~~~~~~~~~~~~~-l~PedVA~av~flls 216 (235) T PRK07326 164 EALMLDLRDYDVRVSTIMPGS-VATHFNGHPPGEDDAWK-IQPEDVAQAVLDLLR 216 (235) T ss_pred HHHHHHHCCCCEEEEEEEECC-EECCCCCCCCCCCCCCC-CCHHHHHHHHHHHHC T ss_conf 999998474694999998058-90788899986221137-999999999999984 No 120 >PRK05650 short chain dehydrogenase; Provisional Probab=43.95 E-value=20 Score=15.12 Aligned_cols=18 Identities=11% Similarity=-0.145 Sum_probs=10.3 Q ss_pred HHHHHHHCCCEEEEEEEC Q ss_conf 999999789879999924 Q gi|254780934|r 311 YCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 311 ~~~~~k~~gi~IytIg~~ 328 (374) +..+++..||+|.+|--| T Consensus 163 L~~El~~~gI~V~~v~PG 180 (270) T PRK05650 163 LLVELADDEIGVHVVCPS 180 (270) T ss_pred HHHHHCCCCCEEEEEECC T ss_conf 999853219689999738 No 121 >PRK12936 3-ketoacyl-(acyl-carrier-protein) reductase NodG; Reviewed Probab=42.42 E-value=22 Score=14.98 Aligned_cols=22 Identities=18% Similarity=0.140 Sum_probs=16.7 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 163 trslA~ela~~gIrVN~IaPG~ 184 (245) T PRK12936 163 SKSLAQEIATRNVTVNCVAPGF 184 (245) T ss_pred HHHHHHHHHHHCEEEEEEEECC T ss_conf 9999999705292999997576 No 122 >PRK09242 tropinone reductase; Provisional Probab=42.03 E-value=22 Score=14.94 Aligned_cols=22 Identities=18% Similarity=0.045 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 172 tr~lA~ela~~gIrVN~V~PG~ 193 (258) T PRK09242 172 TRNLAVEWAEDGIRVNAVAPWY 193 (258) T ss_pred HHHHHHHHHCCCEEEEEEEECC T ss_conf 9999999802798999983588 No 123 >COG1991 Uncharacterized conserved protein [Function unknown] Probab=41.91 E-value=8.1 Score=17.51 Aligned_cols=35 Identities=17% Similarity=0.233 Sum_probs=28.9 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 78886412368629999999999999999999999 Q gi|254780934|r 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEV 40 (374) Q Consensus 6 ~~~~~f~~~~~G~vaiifAl~l~~ll~~~g~avD~ 40 (374) +-+..|..+-||.+.+=|.|+++.++++++.++-| T Consensus 3 ~~i~~~~~~nkgQiSLEf~Ll~l~ivla~~i~~~y 37 (131) T COG1991 3 TYITKIILSNKGQISLEFSLLLLAIVLAASIAGAY 37 (131) T ss_pred CEEEEEEECCCCCEEEEHHHHHHHHHHHHHHEEEE T ss_conf 16625422466625641389999999973121148 No 124 >PRK06181 short chain dehydrogenase; Provisional Probab=41.65 E-value=22 Score=14.90 Aligned_cols=21 Identities=14% Similarity=-0.053 Sum_probs=15.3 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) ..++.+++..||+|.+|.-|. T Consensus 162 ~~la~El~~~gIrVn~v~PG~ 182 (263) T PRK06181 162 DSLRIELADTGVAVTVVCPGF 182 (263) T ss_pred HHHHHHHCCCCEEEEEEECCC T ss_conf 999998475593999997288 No 125 >PRK12828 short chain dehydrogenase; Provisional Probab=41.37 E-value=22 Score=14.88 Aligned_cols=54 Identities=19% Similarity=0.111 Sum_probs=28.2 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHCCCCCEEEECCHHHHHHHHHHHH Q ss_conf 9999999997898799999248222788998238981798389899999999999 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~ 362 (374) ++.++.++...||++.+|.-|.-. .+.++.-...+.+=.-.+++|+.++.-=++ T Consensus 165 tk~lA~e~~~~gIrVN~V~PG~v~-T~~~~~~~~~~~~~r~~~p~diA~~v~fL~ 218 (239) T PRK12828 165 TEALAAELLDRGITVNAVLPSIID-TPPNRADMPDADFSRWVTPEQIAAVIAFLL 218 (239) T ss_pred HHHHHHHHHHHCEEEEEEEECCCC-CCCHHCCCCHHHCCCCCCHHHHHHHHHHHH T ss_conf 999999861309089999738788-820024185646179899999999999995 No 126 >pfam11411 DNA_ligase_IV DNA ligase IV. DNA ligase IV along with Xrcc4 functions in DNA non-homologous end joining. This process is required to mend double-strand breaks. Upon ligase binding to an Xrcc4 dimer, the helical tails unwind leading to a flat interaction surface. Probab=41.28 E-value=21 Score=15.07 Aligned_cols=22 Identities=23% Similarity=0.514 Sum_probs=19.2 Q ss_pred CCCEEEECCHHHHHHHHHHHHH Q ss_conf 9817983898999999999998 Q gi|254780934|r 342 PNSFYLVENPHSMYDAFSHIGK 363 (374) Q Consensus 342 ~~~~~~a~~~~~L~~~f~~I~~ 363 (374) ++.||.-++.++|.++|..|.+ T Consensus 14 GDSy~~dt~~~qLk~vF~~i~~ 35 (36) T pfam11411 14 GDSYFVDTDEQQLKDVFHRIKK 35 (36) T ss_pred CCCEEECCCHHHHHHHHHHHCC T ss_conf 5400104858999999987504 No 127 >PRK07479 consensus Probab=40.95 E-value=23 Score=14.84 Aligned_cols=21 Identities=10% Similarity=-0.051 Sum_probs=15.3 Q ss_pred HHHHHHHHHHCCCEEEEEEEC Q ss_conf 999999999789879999924 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~ 328 (374) ++.++.++...||++.+|.-| T Consensus 166 tr~lA~el~~~gIrVN~I~Pg 186 (252) T PRK07479 166 TKAMAAELAPDNIRVNCLNPV 186 (252) T ss_pred HHHHHHHHHHHCEEEEEEECC T ss_conf 999999951409699999669 No 128 >PRK07677 short chain dehydrogenase; Provisional Probab=40.80 E-value=23 Score=14.82 Aligned_cols=20 Identities=30% Similarity=0.125 Sum_probs=12.9 Q ss_pred HHHHHH-HHCCCEEEEEEECC Q ss_conf 999999-97898799999248 Q gi|254780934|r 310 YYCNEA-KKRGAIVYAIGIRV 329 (374) Q Consensus 310 ~~~~~~-k~~gi~IytIg~~~ 329 (374) .++.++ ++.||++.+|..|. T Consensus 166 ~lA~ela~~~gIrvN~I~PG~ 186 (254) T PRK07677 166 TLAVEWGRKYGIRVNAIAPGP 186 (254) T ss_pred HHHHHHCCCCCEEEEEEEECC T ss_conf 999985723398999994276 No 129 >PRK07825 short chain dehydrogenase; Provisional Probab=40.55 E-value=23 Score=14.80 Aligned_cols=19 Identities=16% Similarity=-0.017 Sum_probs=10.9 Q ss_pred HHHHHHHHCCCEEEEEEEC Q ss_conf 9999999789879999924 Q gi|254780934|r 310 YYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 310 ~~~~~~k~~gi~IytIg~~ 328 (374) .+..+++..||++.+|.-| T Consensus 163 sLa~El~~~gIrVn~V~PG 181 (273) T PRK07825 163 ALRLELRPTGVHVSVVLPT 181 (273) T ss_pred HHHHHHCCCCCEEEEEEEC T ss_conf 9999852309599999709 No 130 >PRK07831 short chain dehydrogenase; Provisional Probab=39.92 E-value=24 Score=14.74 Aligned_cols=56 Identities=21% Similarity=0.256 Sum_probs=28.8 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCC--------CHHHHHHHCCCCCEEEECCHHHHHHHHHHHHH Q ss_conf 999999999789879999924822--------27889982389817983898999999999998 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRVIR--------SHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~~~--------~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~ 363 (374) ++.++.+....||++.+|..|.-. .++.++......-+=..-.++|+.++.-=++. T Consensus 180 Tk~lA~e~a~~gIrVNaI~PG~i~t~~~~~~~~~~~~~~~~~~~p~gR~g~pediA~~v~fLaS 243 (261) T PRK07831 180 TRCSAIEAAEYGVRINAVAPSIARHKFLKKVTSAELLDRLASGEAFGRAAEPWEVAAVIAFLAS 243 (261) T ss_pred HHHHHHHHHHCCCEEEEEECCCCCCCHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 9999999845290899995587677022213999999998707997897599999999999958 No 131 >PRK05866 short chain dehydrogenase; Provisional Probab=38.67 E-value=25 Score=14.62 Aligned_cols=21 Identities=14% Similarity=-0.016 Sum_probs=15.8 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) ..++.++...||++.+|.-|. T Consensus 204 ~sLa~El~~~gIrVn~V~PG~ 224 (290) T PRK05866 204 RVIETEWGDRGVHSTTLYYPL 224 (290) T ss_pred HHHHHHHCCCCEEEEEEECCC T ss_conf 999998526196999997688 No 132 >cd06325 PBP1_ABC_uncharacterized_transporter Type I periplasmic ligand-binding domain of uncharacterized ABC-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This group includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This subgroup has high sequence similarity to members of the family of hydrophobic amino acid transporters (HAAT), such as leucine/isoleucine/valine binding protein (LIVBP); its ligand specificity has not been determined experimentally. Probab=38.44 E-value=25 Score=14.60 Aligned_cols=30 Identities=20% Similarity=-0.001 Sum_probs=16.9 Q ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEE Q ss_conf 79998068778888806899999999978987999 Q gi|254780934|r 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 (374) Q Consensus 290 ~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~Iyt 324 (374) .+++.+|.-.. .....+...+.+.+|.+|+ T Consensus 187 al~~~~d~~v~-----s~~~~i~~~a~~~~iPv~~ 216 (281) T cd06325 187 AIYVPTDNTVA-----SAMEAVVKVANEAKIPVIA 216 (281) T ss_pred EEEEECCCHHH-----HHHHHHHHHHHHCCCCEEE T ss_conf 99991881277-----7999999999874998893 No 133 >TIGR01382 PfpI intracellular protease, PfpI family; InterPro: IPR006286 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad . This group of cysteine peptidases belong to MEROPS peptidase family C56 (Pfp1 endopeptidase, clan PC(C)). The member of this family from Pyrococcus horikoshii has been solved to 2 Angstrom resolution. It is an ATP-independent intracellular protease that crystallises as a hexameric ring. Cys-101 is proposed as the active site residue in a catalytic triad with the adjacent His-102 and a Glu residue from an adjacent monomer. A member of this family from Bacillus subtilis, GSP18, has been shown to be expressed in response to several forms of stress. A role in the degradation of small peptides has been suggested. This family is contained in a larger one of the thiamine biosynthesis protein ThiJ and its homologs.; GO: 0016798 hydrolase activity acting on glycosyl bonds. Probab=38.13 E-value=25 Score=14.57 Aligned_cols=21 Identities=19% Similarity=0.063 Sum_probs=12.4 Q ss_pred EEEECCHHHHHHHHHHHHHHH Q ss_conf 798389899999999999842 Q gi|254780934|r 345 FYLVENPHSMYDAFSHIGKDI 365 (374) Q Consensus 345 ~~~a~~~~~L~~~f~~I~~~i 365 (374) +-.+..+++|....++|.+.| T Consensus 168 ~itar~p~~l~aF~~~~~~~L 188 (189) T TIGR01382 168 LITARVPADLPAFNSEILKLL 188 (189) T ss_pred EEECCCCHHHHHHHHHHHHHC T ss_conf 994288112788999999971 No 134 >PRK08340 glucose-1-dehydrogenase; Provisional Probab=37.91 E-value=25 Score=14.55 Aligned_cols=21 Identities=14% Similarity=0.078 Sum_probs=15.5 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) +.++.+....||++.+|..|. T Consensus 163 k~lA~e~~~~gIrvN~v~pG~ 183 (259) T PRK08340 163 KGVSRTYGGRGIRAYTVLLGS 183 (259) T ss_pred HHHHHHHHHHCEEEEEEEECC T ss_conf 999999842291999985488 No 135 >PRK08217 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=37.69 E-value=26 Score=14.53 Aligned_cols=51 Identities=24% Similarity=0.298 Sum_probs=28.6 Q ss_pred HHHHHHHHHCCCEEEEEEECCCC-------CHHHHHHHCC--C-CCEEEECCHHHHHHHHHHHH Q ss_conf 99999999789879999924822-------2788998238--9-81798389899999999999 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRACAS--P-NSFYLVENPHSMYDAFSHIG 362 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~~~-------~~~~L~~~As--~-~~~~~a~~~~~L~~~f~~I~ 362 (374) +.++.++...||++.+|..|.-. .++.+++... | +++ -.++|+.++..=+. T Consensus 175 k~lA~el~~~gIrVN~I~PG~i~T~~~~~~~~~~~~~~~~~~pl~R~---g~p~dva~~v~fL~ 235 (253) T PRK08217 175 VTWAKELARYGIRVAAIAPGVIETEMTAAMKPEALERLEKMIPVGRL---GEPEEIAHTVRFII 235 (253) T ss_pred HHHHHHHHHCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHCCCCCCC---CCHHHHHHHHHHHH T ss_conf 99999953219599999738898733111799999999857999998---49999999999999 No 136 >PRK07774 short chain dehydrogenase; Provisional Probab=37.46 E-value=26 Score=14.51 Aligned_cols=21 Identities=19% Similarity=-0.050 Sum_probs=16.0 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) +.++.++-..||+|.+|..|. T Consensus 167 k~lA~el~~~gIrVN~V~PG~ 187 (250) T PRK07774 167 QQLARELGGMNIRVNAIAPGP 187 (250) T ss_pred HHHHHHHHHCCEEEEEEEECC T ss_conf 999999706494899997387 No 137 >PRK07666 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=37.11 E-value=26 Score=14.48 Aligned_cols=50 Identities=12% Similarity=0.059 Sum_probs=26.0 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHC-CCCCEEEECCHHHHHHHH Q ss_conf 999999999789879999924822278899823-898179838989999999 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLVENPHSMYDAF 358 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~A-s~~~~~~a~~~~~L~~~f 358 (374) +..++.+++..||++.+|.-|. ...++.+... ..+.--..-.++++.++. T Consensus 166 t~~la~El~~~gIrVn~v~PG~-v~T~m~~~~~~~~~~~~~~~~PedVA~~v 216 (238) T PRK07666 166 TESLMMEVRKHNIRVTALTPST-VATDMAVDLGLTDGNPDKVMQPEDLAEFI 216 (238) T ss_pred HHHHHHHHCCCCEEEEEEEECC-CCCCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 9999998541396999998588-98624678777878830257999999999 No 138 >pfam04392 ABC_sub_bind ABC transporter substrate binding protein. This family contains many hypothetical proteins and some ABC transporter substrate binding proteins. Probab=36.23 E-value=27 Score=14.39 Aligned_cols=32 Identities=22% Similarity=0.017 Sum_probs=17.1 Q ss_pred EEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 2799980687788888068999999999789879999 Q gi|254780934|r 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 (374) Q Consensus 289 k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytI 325 (374) ..+++++|.--. .....+...+.+.++.+|+. T Consensus 185 Dal~i~~d~~v~-----s~~~~i~~~a~~~kiPv~~~ 216 (292) T pfam04392 185 DAIFIPTDNLIA-----SAFTAVLQEANKAKIPVITS 216 (292) T ss_pred CEEEEECCCCHH-----HHHHHHHHHHHHCCCCEEEC T ss_conf 889993781078-----89999999999749998957 No 139 >PRK05653 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Validated Probab=35.84 E-value=27 Score=14.35 Aligned_cols=52 Identities=21% Similarity=0.260 Sum_probs=27.7 Q ss_pred HHHHHHHHHCCCEEEEEEECCC-------CCHHHHHHHCC--C-CCEEEECCHHHHHHHHHHHHH Q ss_conf 9999999978987999992482-------22788998238--9-817983898999999999998 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRVI-------RSHEFLRACAS--P-NSFYLVENPHSMYDAFSHIGK 363 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~~-------~~~~~L~~~As--~-~~~~~a~~~~~L~~~f~~I~~ 363 (374) +.++.++...||++.+|..|.- ..++..+.... | +++ -+++|+.++..=+.. T Consensus 166 ~~la~e~~~~~IrvN~I~PG~i~T~~~~~~~~~~~~~~~~~~Pl~R~---~~p~dia~~v~fL~S 227 (246) T PRK05653 166 KALALELASRGITVNAVAPGFIDTDMTRALPEEVKEALLKQIPLGRL---GTPEEVANAVAFLAS 227 (246) T ss_pred HHHHHHHHHHCEEEEEEEECCCCCCCCCCCCHHHHHHHHHCCCCCCC---CCHHHHHHHHHHHHC T ss_conf 99999950439399999638887723111689999999847998998---399999999999968 No 140 >COG5242 TFB4 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription / DNA replication, recombination, and repair] Probab=35.83 E-value=27 Score=14.35 Aligned_cols=44 Identities=23% Similarity=0.334 Sum_probs=32.2 Q ss_pred HHHCCCEEEEEEECCCCCHHHHHHH--CCCCCEEEECCHHHHHHHHHH Q ss_conf 9978987999992482227889982--389817983898999999999 Q gi|254780934|r 315 AKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 (374) Q Consensus 315 ~k~~gi~IytIg~~~~~~~~~L~~~--As~~~~~~a~~~~~L~~~f~~ 360 (374) +.+.||.|..+-++.. ..+|.+| |++|-|..+++.+.|.+.+-. T Consensus 179 Aqk~~ipI~v~~i~g~--s~fl~Q~~daTgG~Yl~ve~~eGllqyL~~ 224 (296) T COG5242 179 AQKFGIPISVFSIFGN--SKFLLQCCDATGGDYLTVEDTEGLLQYLLS 224 (296) T ss_pred HHHCCCCEEEEEECCC--CHHHHHHHHCCCCEEEEECCCHHHHHHHHH T ss_conf 6434981489982486--178998763448726862482069999999 No 141 >PRK07814 short chain dehydrogenase; Provisional Probab=35.69 E-value=28 Score=14.34 Aligned_cols=19 Identities=16% Similarity=-0.111 Sum_probs=12.6 Q ss_pred HHHHHHHHCCCEEEEEEECC Q ss_conf 99999997898799999248 Q gi|254780934|r 310 YYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 310 ~~~~~~k~~gi~IytIg~~~ 329 (374) .++.++.+ +|++-+|..|. T Consensus 173 ~lA~e~a~-~IrVN~V~PG~ 191 (263) T PRK07814 173 LAALDLCP-RIRVNAIAPGS 191 (263) T ss_pred HHHHHHCC-CCEEEEEEECC T ss_conf 99999779-97899997798 No 142 >PRK08324 short chain dehydrogenase; Validated Probab=35.64 E-value=28 Score=14.33 Aligned_cols=81 Identities=12% Similarity=0.042 Sum_probs=54.0 Q ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHCCCCC----EEEECCHHHHHHHHHHHH Q ss_conf 6427999806877888880689999999997898799999248222788998238981----798389899999999999 Q gi|254780934|r 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS----FYLVENPHSMYDAFSHIG 362 (374) Q Consensus 287 ~~k~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~As~~~----~~~a~~~~~L~~~f~~I~ 362 (374) ..-.++|+|=|.. .--.+.|..+.+.|..|...+...+.-++..+.+..++. ..++++.+++.++|+++. T Consensus 419 L~GKVALVTGga~------GIG~A~A~~fa~eGA~Vvl~D~~~~~l~~~a~el~~~~~~~~~~~DVtd~~~v~~~v~~~~ 492 (676) T PRK08324 419 LAGKVALVTGAAG------GIGLATAKRLAAEGACVVLADIDEEAAEAAAAELGGRDRALGVACDVTDEAAVQAAFEEAA 492 (676) T ss_pred CCCCEEEEECCCC------CHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHH T ss_conf 8998799947988------1629999999987998999958889999999997079947999806899999999999999 Q ss_pred HHHHEEEEEEE Q ss_conf 84200166620 Q gi|254780934|r 363 KDIVTKRIWYD 373 (374) Q Consensus 363 ~~i~~~~~~~~ 373 (374) ++--..-+++| T Consensus 493 ~~fGgIDiLVn 503 (676) T PRK08324 493 LAFGGVDIVVS 503 (676) T ss_pred HHHCCCCEEEE T ss_conf 98599888997 No 143 >pfam06415 iPGM_N BPG-independent PGAM N-terminus (iPGM_N). This family represents the N-terminal region of the 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (or phosphoglyceromutase or BPG-independent PGAM) protein (EC:5.4.2.1). The family is found in conjunction with pfam01676 (located in the C-terminal region of the protein). Probab=35.14 E-value=28 Score=14.28 Aligned_cols=19 Identities=21% Similarity=0.216 Sum_probs=8.8 Q ss_pred EEEEEECCCCCHHHHHHHC Q ss_conf 9999924822278899823 Q gi|254780934|r 322 VYAIGIRVIRSHEFLRACA 340 (374) Q Consensus 322 IytIg~~~~~~~~~L~~~A 340 (374) |...-|..+...++.+... T Consensus 172 vif~NFR~DRaRqi~~a~~ 190 (223) T pfam06415 172 VIFFNFRPDRARQLSHAFV 190 (223) T ss_pred EEEEECCCHHHHHHHHHHC T ss_conf 9997357578999999963 No 144 >PRK13394 3-hydroxybutyrate dehydrogenase; Provisional Probab=35.09 E-value=28 Score=14.28 Aligned_cols=22 Identities=9% Similarity=-0.010 Sum_probs=16.2 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.+....||++.+|.-|. T Consensus 168 tk~lA~E~a~~gIrVN~V~PG~ 189 (262) T PRK13394 168 ARVLAKEGAKHNVRSHVVCPGF 189 (262) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999998523196999997587 No 145 >PRK06113 7-alpha-hydroxysteroid dehydrogenase; Validated Probab=33.03 E-value=31 Score=14.08 Aligned_cols=22 Identities=9% Similarity=-0.034 Sum_probs=17.0 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++-+|..|. T Consensus 170 tk~lA~ela~~gIrVN~V~PG~ 191 (255) T PRK06113 170 VRNMAFDLGEKNIRVNGIAPGA 191 (255) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999999826494999986488 No 146 >PRK12938 acetyacetyl-CoA reductase; Provisional Probab=32.98 E-value=31 Score=14.07 Aligned_cols=22 Identities=14% Similarity=0.262 Sum_probs=16.4 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||+|.+|.-|. T Consensus 164 tk~lA~Ela~~gIrVN~VaPG~ 185 (246) T PRK12938 164 TMSLAQEVATKGVTVNTVSPGY 185 (246) T ss_pred HHHHHHHHHHHCEEEEEEEECC T ss_conf 9999999604398999996687 No 147 >PRK12429 3-hydroxybutyrate dehydrogenase; Provisional Probab=32.36 E-value=31 Score=14.01 Aligned_cols=21 Identities=24% Similarity=0.088 Sum_probs=16.3 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) +.++.++...||++.+|..|. T Consensus 165 ~~lA~e~~~~gIrvN~V~PG~ 185 (258) T PRK12429 165 KVVALEGATHGVTVNAICPGY 185 (258) T ss_pred HHHHHHHHHHCEEEEEEEECC T ss_conf 999998532097999997487 No 148 >pfam10526 NADH_ub_rd_NUML NADH-ubiquinone reductase complex 1 MLRQ subunit. This subunit appears to be a recent vertebrate addition to the MADH-ubiquinone reductase complex 1, acting within the membrane. its exact function is not known, but it is highly expressed in muscle and neural tissue, indicative of a role in ATP generation. Probab=31.67 E-value=32 Score=13.94 Aligned_cols=25 Identities=16% Similarity=0.364 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999 Q gi|254780934|r 26 FLPIIFLVLGMIIEVSHIFFMKTVL 50 (374) Q Consensus 26 ~l~~ll~~~g~avD~~~~~~~ks~L 50 (374) .|+||++++|++.-.+-.|..|.-| T Consensus 12 ~LIPLfv~ig~g~~gA~~y~~rlal 36 (80) T pfam10526 12 ALIPLFVFIGAGATGATLYLLRLAL 36 (80) T ss_pred CHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 1324899994138899999999980 No 149 >PRK06949 short chain dehydrogenase; Provisional Probab=31.33 E-value=33 Score=13.91 Aligned_cols=22 Identities=23% Similarity=0.022 Sum_probs=16.8 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 177 tr~lA~ela~~gIrVN~IaPG~ 198 (258) T PRK06949 177 TRAMALEWGRHGINVNAICPGY 198 (258) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999999622197999996578 No 150 >TIGR01963 PHB_DH 3-hydroxybutyrate dehydrogenase; InterPro: IPR011294 This entry represents a subfamily of the short chain dehydrogenases. Characterised members so far as 3-hydroxybutyrate dehydrogenases and are found in species that accumulate ester polymers called polyhydroxyalkanoic acids (PHAs) under certain conditions. Several members of the family are from species not known to accumulate PHAs, including Oceanobacillus iheyensis and Bacillus subtilis. However, polymer formation is not required for there be a role for 3-hydroxybutyrate dehydrogenase; it may be members of this family have the same function in those species.; GO: 0003858 3-hydroxybutyrate dehydrogenase activity. Probab=31.33 E-value=33 Score=13.91 Aligned_cols=22 Identities=14% Similarity=0.224 Sum_probs=12.0 Q ss_pred ECCCCHHHHHHHHHHCCCCCCC Q ss_conf 2047868999998601336877 Q gi|254780934|r 235 LLEWGVSHLQRKIKYLSKFGVS 256 (374) Q Consensus 235 ~~t~~~~~~~~~i~~l~~~g~~ 256 (374) .+|+-+..++..+=-+...|+| T Consensus 111 ~LtsaF~t~raAlP~Mk~~gwG 132 (258) T TIGR01963 111 MLTSAFHTIRAALPHMKKQGWG 132 (258) T ss_pred CCCHHHHHHHHCCCCCCCCCCC T ss_conf 1688899997506432137855 No 151 >PRK06484 short chain dehydrogenase; Validated Probab=30.86 E-value=33 Score=13.86 Aligned_cols=65 Identities=11% Similarity=0.008 Sum_probs=29.3 Q ss_pred HHHHHHHHHCCCEEEEEEECCCCCHHHHHHHCCCCCE--EEECCHHHHHHHHHHHHHHHHEEEEEEE Q ss_conf 9999999978987999992482227889982389817--9838989999999999984200166620 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF--YLVENPHSMYDAFSHIGKDIVTKRIWYD 373 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~As~~~~--~~a~~~~~L~~~f~~I~~~i~~~~~~~~ 373 (374) ..++..+.+.|.+|.......+....+.+.+...... .++.+.+++...++++.++--..-+++| T Consensus 288 ~aiA~~la~~GA~Vvi~d~~~~~~~~~~~~~g~~~~~~~~Dv~~~~~v~~~v~~~~~~fG~iDiLVN 354 (530) T PRK06484 288 AAIADRFAALGDRVAIIDSDGEEAVKLREILGGEHLSWQVDITDEASVESAFAGIQGRLGPLDVLVN 354 (530) T ss_pred HHHHHHHHHCCCEEEEEECCHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 9999999988798999958889999999973997369995389999999999999998299889998 No 152 >PRK06057 short chain dehydrogenase; Provisional Probab=30.84 E-value=33 Score=13.86 Aligned_cols=22 Identities=18% Similarity=-0.005 Sum_probs=16.0 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.+....||++.+|..|. T Consensus 165 Tr~lA~e~a~~gIrVN~IaPG~ 186 (255) T PRK06057 165 SRELGVQFARQGIRVNALCPGP 186 (255) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999998603193999997387 No 153 >PRK07776 consensus Probab=30.63 E-value=34 Score=13.84 Aligned_cols=20 Identities=20% Similarity=-0.004 Sum_probs=13.3 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) +.++.++.. +|+|.+|..|. T Consensus 168 k~lA~e~a~-~IrVN~V~PG~ 187 (252) T PRK07776 168 KQLALELAP-RVRVNAVAPGV 187 (252) T ss_pred HHHHHHHCC-CCEEEEEEECC T ss_conf 999999869-98899996457 No 154 >PRK08213 gluconate 5-dehydrogenase; Provisional Probab=30.06 E-value=34 Score=13.78 Aligned_cols=22 Identities=23% Similarity=0.078 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 177 tr~lA~e~a~~gIrVNaIaPG~ 198 (259) T PRK08213 177 TRALAAEWGPHGIRVNAIAPGF 198 (259) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999999610391999997798 No 155 >PRK07231 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=29.92 E-value=34 Score=13.76 Aligned_cols=22 Identities=18% Similarity=0.015 Sum_probs=16.6 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 165 t~~lA~el~~~gIrVN~I~PG~ 186 (250) T PRK07231 165 TKSLAVELAPDNIRVNAVNPVV 186 (250) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999999534095999996387 No 156 >PRK08267 short chain dehydrogenase; Provisional Probab=29.53 E-value=35 Score=13.72 Aligned_cols=21 Identities=14% Similarity=-0.092 Sum_probs=14.5 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) ..++.+++..||+|.+|.-|. T Consensus 161 ~sla~El~~~gIrVn~v~PG~ 181 (258) T PRK08267 161 EALDLEWRRHGIRVADVMPLF 181 (258) T ss_pred HHHHHHHCCCCCEEEEEEECC T ss_conf 999998430191899997188 No 157 >PRK06346 consensus Probab=29.39 E-value=35 Score=13.71 Aligned_cols=22 Identities=14% Similarity=-0.068 Sum_probs=16.1 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.+....||++-+|..|. T Consensus 166 tr~lA~e~a~~gIrvN~I~PG~ 187 (251) T PRK06346 166 TKNTGFMYANKGIRCNAIAPGA 187 (251) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999998624195999987688 No 158 >PRK07097 gluconate 5-dehydrogenase; Provisional Probab=29.03 E-value=36 Score=13.67 Aligned_cols=22 Identities=14% Similarity=0.037 Sum_probs=16.4 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++.+.||+|.+|.-|. T Consensus 170 tr~lA~e~a~~gIrVN~V~PG~ 191 (265) T PRK07097 170 TKNIASEYGEANIQCNGIGPGY 191 (265) T ss_pred HHHHHHHHHHCCEEEEEEEECC T ss_conf 9999999702495999996588 No 159 >PRK06172 short chain dehydrogenase; Provisional Probab=28.52 E-value=36 Score=13.62 Aligned_cols=22 Identities=27% Similarity=0.138 Sum_probs=16.7 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 168 tr~lA~e~a~~gIrVNaV~PG~ 189 (253) T PRK06172 168 TKSAAIEYAKKGIRVNAVCPAV 189 (253) T ss_pred HHHHHHHHHHHCCEEEEEEECC T ss_conf 9999998633187899997797 No 160 >PRK07890 short chain dehydrogenase; Provisional Probab=28.52 E-value=36 Score=13.62 Aligned_cols=22 Identities=18% Similarity=0.113 Sum_probs=16.7 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||+|.+|..|. T Consensus 165 tk~lA~ela~~gIrVN~V~PG~ 186 (258) T PRK07890 165 SQSLATELGPQGIRVNSVAPGY 186 (258) T ss_pred HHHHHHHHHHHCEEEEEEEECC T ss_conf 9999999714095999995187 No 161 >PRK06227 consensus Probab=28.39 E-value=37 Score=13.60 Aligned_cols=22 Identities=14% Similarity=-0.091 Sum_probs=16.4 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 165 Tr~lA~ela~~gIrVNaVaPG~ 186 (256) T PRK06227 165 THSLAVSLSKYKIRVVSISPGW 186 (256) T ss_pred HHHHHHHHCCCCEEEEEEECCC T ss_conf 9999999620294999996186 No 162 >PRK06463 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=28.32 E-value=37 Score=13.60 Aligned_cols=21 Identities=24% Similarity=0.025 Sum_probs=15.7 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) +.++.++...||++.+|.-|. T Consensus 163 r~lA~ela~~gIrVNaVaPG~ 183 (254) T PRK06463 163 KRLAFELGKYGIRVNAVAPGW 183 (254) T ss_pred HHHHHHHHHCCEEEEEEEECC T ss_conf 999999702395999998688 No 163 >PRK08085 gluconate 5-dehydrogenase; Provisional Probab=28.24 E-value=37 Score=13.59 Aligned_cols=22 Identities=18% Similarity=0.149 Sum_probs=16.4 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.+|.++...||++.+|..|. T Consensus 169 tr~lA~e~a~~~IrvN~IaPG~ 190 (254) T PRK08085 169 TRGMCVELARHNIQVNGIAPGY 190 (254) T ss_pred HHHHHHHHCCCCEEEEEEEECC T ss_conf 9999999672796999997688 No 164 >TIGR01822 2am3keto_CoA 2-amino-3-ketobutyrate coenzyme A ligase; InterPro: IPR011282 This entry represents a narrowly defined clade of animal and bacterial (almost exclusively proteobacterial) 2-amino-3-ketobutyrate--CoA ligase. This enzyme can act in threonine catabolism. The closest homologue from Bacillus subtilis, and sequences like it, may be functionally equivalent but were not included in the model because of difficulty in finding reports of function.; GO: 0008890 glycine C-acetyltransferase activity. Probab=27.53 E-value=38 Score=13.51 Aligned_cols=59 Identities=19% Similarity=0.399 Sum_probs=31.0 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCC---CHHHHH-HHCCCCCEEEECCHHHH---HHHHHHHHH Q ss_conf 99980687788888068999999999789879999924822---278899-82389817983898999---999999998 Q gi|254780934|r 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---SHEFLR-ACASPNSFYLVENPHSM---YDAFSHIGK 363 (374) Q Consensus 291 iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg~~~~~---~~~~L~-~~As~~~~~~a~~~~~L---~~~f~~I~~ 363 (374) -|+|-|- ......++++-+.|| |.|||-.|. ++--.| |+. .+.+.++| .++|.+|.+ T Consensus 327 pvMlyDA--------~~Aq~~A~~Ll~~Gi--Yv~GFfYPVVPKGqARIRvQ~S------A~H~~e~l~~a~~AF~~~G~ 390 (395) T TIGR01822 327 PVMLYDA--------KLAQRFAERLLEEGI--YVIGFFYPVVPKGQARIRVQIS------AVHTEEQLDRAVEAFTRVGR 390 (395) T ss_pred EEECCCH--------HHHHHHHHHHHHCCE--EEEEEEECCCCCCCCEEEEEEC------CCCCHHHHHHHHHHHHHHHH T ss_conf 2321147--------899999999987792--8997452643788612444102------56898899999999999878 Q ss_pred HH Q ss_conf 42 Q gi|254780934|r 364 DI 365 (374) Q Consensus 364 ~i 365 (374) ++ T Consensus 391 ~l 392 (395) T TIGR01822 391 EL 392 (395) T ss_pred HH T ss_conf 62 No 165 >PRK06483 short chain dehydrogenase; Provisional Probab=27.25 E-value=38 Score=13.48 Aligned_cols=49 Identities=8% Similarity=0.009 Sum_probs=23.0 Q ss_pred HHHHHHHHCCCEEEEEEECC-----CCCHHHHHHH-C-CC-CCEEEECCHHHHHHHHHHHH Q ss_conf 99999997898799999248-----2227889982-3-89-81798389899999999999 Q gi|254780934|r 310 YYCNEAKKRGAIVYAIGIRV-----IRSHEFLRAC-A-SP-NSFYLVENPHSMYDAFSHIG 362 (374) Q Consensus 310 ~~~~~~k~~gi~IytIg~~~-----~~~~~~L~~~-A-s~-~~~~~a~~~~~L~~~f~~I~ 362 (374) .++.++.. +|+|.+|..|. ..+....+.. + ++ +++ ..++|+.++..=+. T Consensus 161 ~lA~ela~-~IrVN~V~PG~i~~~~~~~~~~~~~~~~~~~~~r~---~~p~eia~~v~fL~ 217 (236) T PRK06483 161 SFAAKLAP-EVKVNSIAPALILFNEGDDAAYRQKALAKSLLKIE---PGEEEIIQLVDYLL 217 (236) T ss_pred HHHHHHCC-CCEEEEEEECEEECCCCCCHHHHHHHHHHCCCCCC---CCHHHHHHHHHHHH T ss_conf 99999758-99899996070621899989999999861888899---89899999999999 No 166 >PRK06182 short chain dehydrogenase; Validated Probab=26.88 E-value=39 Score=13.44 Aligned_cols=19 Identities=21% Similarity=-0.066 Sum_probs=12.9 Q ss_pred HHHHHHHHCCCEEEEEEEC Q ss_conf 9999999789879999924 Q gi|254780934|r 310 YYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 310 ~~~~~~k~~gi~IytIg~~ 328 (374) .++.+++..||+|.+|.-| T Consensus 159 ~La~El~~~gI~V~~v~PG 177 (273) T PRK06182 159 ALRLEVAPFGIDVVVIEPG 177 (273) T ss_pred HHHHHHCHHCCEEEEEECC T ss_conf 9999844038789999738 No 167 >PRK07063 short chain dehydrogenase; Provisional Probab=26.28 E-value=40 Score=13.38 Aligned_cols=22 Identities=23% Similarity=0.001 Sum_probs=16.4 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 168 Tr~lA~e~a~~gIrVNaI~PG~ 189 (259) T PRK07063 168 TRALGIEYAARNVRVNAIAPGY 189 (259) T ss_pred HHHHHHHHHHHCEEEEEEEECC T ss_conf 9999999714192999897677 No 168 >TIGR01371 met_syn_B12ind 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase; InterPro: IPR006276 This group represents cobalamin-independent methionine synthase. A family of uncharacterised archaeal proteins having substantial homology to the C-terminal region of this family, are not included in this group, though they are contained in a larger family of vitamin-B12 independent methionine synthase.; GO: 0003871 5-methyltetrahydropteroyltriglutamate-homocysteine S-methyltransferase activity, 0009086 methionine biosynthetic process. Probab=25.46 E-value=41 Score=13.28 Aligned_cols=69 Identities=19% Similarity=0.181 Sum_probs=40.6 Q ss_pred CCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHCC--------CCCEEE-----ECCHHHHHHHHHHHHHHHHE Q ss_conf 88880689999999997898799999248222788998238--------981798-----38989999999999984200 Q gi|254780934|r 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--------PNSFYL-----VENPHSMYDAFSHIGKDIVT 367 (374) Q Consensus 301 ~~~~~~~~~~~~~~~k~~gi~IytIg~~~~~~~~~L~~~As--------~~~~~~-----a~~~~~L~~~f~~I~~~i~~ 367 (374) |+++.... .+...+-..-|-||=..- .+.++|+..-. |.-+|+ +++.+++.+..+++.+.+-+ T Consensus 667 CYSeFn~I---I~~I~~LDADVIsIE~SR-S~~~lL~~f~~~~~Y~~giGpGVYDIHSPRvPS~eE~~~~~~~aL~~~p~ 742 (778) T TIGR01371 667 CYSEFNDI---IEAIAALDADVISIEASR-SDMELLDAFKNIFGYPNGIGPGVYDIHSPRVPSVEEMADLIEKALQVLPA 742 (778) T ss_pred CCCCHHHH---HHHHHHCCCCCEEHHHCC-CCHHHHHHHHHHCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHHHCCH T ss_conf 33774379---998843258710031104-75579999852036432348721322688867389999999999974005 Q ss_pred EEEEEE Q ss_conf 166620 Q gi|254780934|r 368 KRIWYD 373 (374) Q Consensus 368 ~~~~~~ 373 (374) .++|+| T Consensus 743 ~~lWVN 748 (778) T TIGR01371 743 ERLWVN 748 (778) T ss_pred HHCEEC T ss_conf 508057 No 169 >TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB; InterPro: IPR014132 This entry describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reduced sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterised polysaccharide deacetylase; the exact function this protein family is unknown.. Probab=24.87 E-value=43 Score=13.22 Aligned_cols=37 Identities=14% Similarity=0.173 Sum_probs=30.6 Q ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 7999806877888880689999999997898799999 Q gi|254780934|r 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 (374) Q Consensus 290 ~iillTDG~~~~~~~~~~~~~~~~~~k~~gi~IytIg 326 (374) -||||=|+-+++.+-......+...+|++|.+.-+|. T Consensus 159 dIiL~HDASd~~KqT~~ALp~Ii~~LK~~GY~fv~is 195 (198) T TIGR02764 159 DIILLHDASDSAKQTVKALPEIIKKLKEKGYEFVTIS 195 (198) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEHH T ss_conf 5698763487974417789999899875495343422 No 170 >KOG4169 consensus Probab=24.65 E-value=43 Score=13.19 Aligned_cols=25 Identities=8% Similarity=0.054 Sum_probs=15.2 Q ss_pred HHHHCCCEEEEEEECCCCCHHHHHHH Q ss_conf 99978987999992482227889982 Q gi|254780934|r 314 EAKKRGAIVYAIGIRVIRSHEFLRAC 339 (374) Q Consensus 314 ~~k~~gi~IytIg~~~~~~~~~L~~~ 339 (374) ...+.||++++|--|. -..++.+.+ T Consensus 169 yy~~sGV~~~avCPG~-t~t~l~~~~ 193 (261) T KOG4169 169 YYQRSGVRFNAVCPGF-TRTDLAENI 193 (261) T ss_pred HHHHCCEEEEEECCCC-CHHHHHHHH T ss_conf 6765587999977873-148999988 No 171 >PRK05693 short chain dehydrogenase; Provisional Probab=24.46 E-value=43 Score=13.17 Aligned_cols=16 Identities=19% Similarity=0.038 Sum_probs=8.1 Q ss_pred HHHHHCCCEEEEEEEC Q ss_conf 9999789879999924 Q gi|254780934|r 313 NEAKKRGAIVYAIGIR 328 (374) Q Consensus 313 ~~~k~~gi~IytIg~~ 328 (374) .+++..||+|.+|.-| T Consensus 159 ~El~~~gI~V~~v~PG 174 (274) T PRK05693 159 LELAPFGVQVMEVQPG 174 (274) T ss_pred HHHCCCCCEEEEEECC T ss_conf 9842028789999718 No 172 >PRK12744 short chain dehydrogenase; Provisional Probab=24.15 E-value=44 Score=13.13 Aligned_cols=22 Identities=27% Similarity=0.191 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 170 tr~lA~ela~~gIrVNaVaPG~ 191 (257) T PRK12744 170 TRAASKEFGARGISVTAVGPGP 191 (257) T ss_pred HHHHHHHHCCCCEEEEEEEECC T ss_conf 9999999654496999996388 No 173 >PRK07035 short chain dehydrogenase; Provisional Probab=23.33 E-value=46 Score=13.04 Aligned_cols=53 Identities=17% Similarity=0.193 Sum_probs=28.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCC---------CHHHHHHHCC--C-CCEEEECCHHHHHHHHHHHHH Q ss_conf 999999999789879999924822---------2788998238--9-817983898999999999998 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRVIR---------SHEFLRACAS--P-NSFYLVENPHSMYDAFSHIGK 363 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~~~---------~~~~L~~~As--~-~~~~~a~~~~~L~~~f~~I~~ 363 (374) ++.++.++...||+|.+|..|.-. +++.++.... | ++ .-+++|+.++..=+.. T Consensus 169 tr~lA~ela~~gIrVN~V~PG~i~T~~~~~~~~~~~~~~~~~~~~Pl~R---~g~pedia~~v~fL~S 233 (252) T PRK07035 169 TKAFAKECAPFGIRVNALLPGLTDTKFASALFKNDAILKQALAHIPLRR---HAEPSEMAGAVLYLVS 233 (252) T ss_pred HHHHHHHHHHHCEEEEEEEECCCCCHHHHCCCCCHHHHHHHHHCCCCCC---CCCHHHHHHHHHHHHC T ss_conf 9999998603295999996287887424302489999999985699999---8299999999999968 No 174 >PRK05958 8-amino-7-oxononanoate synthase; Reviewed Probab=22.04 E-value=48 Score=12.88 Aligned_cols=56 Identities=14% Similarity=0.150 Sum_probs=39.9 Q ss_pred CHHHHHHHHHHHHCCCEEEEEEEC-CCCCHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHH Q ss_conf 068999999999789879999924-8222788998238981798389899999999999842 Q gi|254780934|r 305 DQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 (374) Q Consensus 305 ~~~~~~~~~~~k~~gi~IytIg~~-~~~~~~~L~~~As~~~~~~a~~~~~L~~~f~~I~~~i 365 (374) ......+++.++++||-+..|.+. .|.++..||-|-+..+ +.++|....+.+.+-+ T Consensus 328 ~~~~~~~~~~L~~~Gi~v~~i~~PtVp~~~~rlRi~i~a~h-----t~~dId~l~~~l~e~~ 384 (387) T PRK05958 328 NERALALAAALQAQGFWVGAIRPPTVPAGTSRLRITLTAAH-----TEADIDRLLEALAEAL 384 (387) T ss_pred HHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEEEECCCC-----CHHHHHHHHHHHHHHH T ss_conf 99999999999977952831789988999844999978779-----9999999999999999 No 175 >PRK05786 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=21.62 E-value=49 Score=12.83 Aligned_cols=22 Identities=18% Similarity=0.072 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 161 tk~lA~ela~~gIrVN~IaPG~ 182 (238) T PRK05786 161 VEILAAELLDRGIRVNGVAPSG 182 (238) T ss_pred HHHHHHHHCCCCEEEEEEECCC T ss_conf 9999999641795999996288 No 176 >PRK12827 short chain dehydrogenase; Provisional Probab=20.82 E-value=51 Score=12.73 Aligned_cols=22 Identities=27% Similarity=0.268 Sum_probs=16.8 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|.-|. T Consensus 171 Tr~lA~e~a~~gIrVNaV~PG~ 192 (251) T PRK12827 171 TKTLANELAPRGITVNAVAPGA 192 (251) T ss_pred HHHHHHHHCCCCEEEEEEEECC T ss_conf 9999999650496999996488 No 177 >PRK07832 short chain dehydrogenase; Provisional Probab=20.55 E-value=52 Score=12.70 Aligned_cols=19 Identities=11% Similarity=-0.048 Sum_probs=13.3 Q ss_pred HHHHHHHHCCCEEEEEEEC Q ss_conf 9999999789879999924 Q gi|254780934|r 310 YYCNEAKKRGAIVYAIGIR 328 (374) Q Consensus 310 ~~~~~~k~~gi~IytIg~~ 328 (374) .+..+++..||+|.+|.-| T Consensus 164 sL~~El~~~gI~V~~v~PG 182 (272) T PRK07832 164 VLRFDLARHGIGVSVVVPG 182 (272) T ss_pred HHHHHHCCCCCEEEEEECC T ss_conf 9999852109789999748 No 178 >PRK07062 short chain dehydrogenase; Provisional Probab=20.41 E-value=52 Score=12.68 Aligned_cols=22 Identities=18% Similarity=0.129 Sum_probs=16.3 Q ss_pred HHHHHHHHHHCCCEEEEEEECC Q ss_conf 9999999997898799999248 Q gi|254780934|r 308 SLYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 308 ~~~~~~~~k~~gi~IytIg~~~ 329 (374) ++.++.++...||++.+|..|. T Consensus 170 tk~lA~ela~~gIrVN~V~PG~ 191 (265) T PRK07062 170 VKSLATELAPEGVRVNSILLGL 191 (265) T ss_pred HHHHHHHHHHHCEEEEEEEECC T ss_conf 9999999766493999896085 No 179 >PRK08415 enoyl-(acyl carrier protein) reductase; Provisional Probab=20.29 E-value=53 Score=12.66 Aligned_cols=21 Identities=29% Similarity=0.300 Sum_probs=13.9 Q ss_pred HHHHHHHHHCCCEEEEEEECC Q ss_conf 999999997898799999248 Q gi|254780934|r 309 LYYCNEAKKRGAIVYAIGIRV 329 (374) Q Consensus 309 ~~~~~~~k~~gi~IytIg~~~ 329 (374) +.++.++...||+|.+|.-|. T Consensus 169 k~lA~Ela~~gIRVN~IaPG~ 189 (274) T PRK08415 169 RYLAVDLGKKGIRVNAISAGP 189 (274) T ss_pred HHHHHHHHHCCEEEEEEEECC T ss_conf 999999835496999987687 No 180 >COG0362 Gnd 6-phosphogluconate dehydrogenase [Carbohydrate transport and metabolism] Probab=20.10 E-value=53 Score=12.64 Aligned_cols=10 Identities=20% Similarity=0.308 Sum_probs=4.4 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780934|r 31 FLVLGMIIEV 40 (374) Q Consensus 31 l~~~g~avD~ 40 (374) ++++|++|-. T Consensus 6 iGviGLaVMG 15 (473) T COG0362 6 IGVIGLAVMG 15 (473) T ss_pred CCEEEHHHHH T ss_conf 0057310302 Done!