Query gi|254780474|ref|YP_003064887.1| ubiquinol oxidase, subunit II [Candidatus Liberibacter asiaticus str. psy62] Match_columns 332 No_of_seqs 221 out of 1420 Neff 5.6 Searched_HMMs 39220 Date Sun May 29 20:15:13 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780474.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10525 cytochrome o ubiquino 100.0 0 0 756.3 31.4 299 11-318 5-305 (305) 2 TIGR01433 CyoA ubiquinol oxida 100.0 0 0 689.9 15.8 226 19-253 1-228 (228) 3 COG1622 CyoA Heme/copper-type 100.0 0 0 526.6 22.1 246 18-280 2-247 (247) 4 TIGR02866 CoxB cytochrome c ox 100.0 0 0 468.2 20.8 208 38-250 1-249 (249) 5 TIGR01432 QOXA cytochrome aa3 100.0 0 0 443.4 8.8 225 20-253 2-229 (229) 6 MTH00140 COX2 cytochrome c oxi 100.0 0 0 422.1 23.3 200 32-252 6-226 (227) 7 MTH00129 COX2 cytochrome c oxi 100.0 0 0 419.7 23.8 204 32-256 6-230 (230) 8 MTH00038 COX2 cytochrome c oxi 100.0 0 0 420.5 21.7 202 32-254 6-228 (229) 9 MTH00154 COX2 cytochrome c oxi 100.0 0 0 417.5 23.4 199 32-251 6-225 (226) 10 MTH00185 COX2 cytochrome c oxi 100.0 0 0 415.8 23.6 203 32-255 6-229 (229) 11 MTH00051 COX2 cytochrome c oxi 100.0 0 0 411.5 22.5 214 21-255 6-246 (247) 12 MTH00117 COX2 cytochrome c oxi 100.0 0 0 409.6 23.3 200 32-252 6-226 (227) 13 MTH00139 COX2 cytochrome c oxi 100.0 0 0 412.3 21.3 197 34-251 4-223 (223) 14 MTH00023 COX2 cytochrome c oxi 100.0 0 0 410.5 22.1 212 23-255 4-247 (247) 15 MTH00168 COX2 cytochrome c oxi 100.0 0 0 407.6 22.3 199 32-251 6-225 (225) 16 MTH00076 COX2 cytochrome c oxi 100.0 0 0 407.9 21.7 201 32-253 6-227 (228) 17 MTH00098 COX2 cytochrome c oxi 100.0 0 0 405.7 23.1 200 32-252 6-226 (227) 18 MTH00008 COX2 cytochrome c oxi 100.0 0 0 408.0 20.5 201 32-253 6-227 (228) 19 MTH00027 COX2 cytochrome c oxi 100.0 0 0 396.1 22.0 205 32-253 21-248 (249) 20 MTH00080 COX2 cytochrome c oxi 100.0 0 0 392.9 20.2 201 35-252 8-229 (231) 21 MTH00047 COX2 cytochrome c oxi 100.0 0 0 374.9 19.8 169 56-243 15-192 (193) 22 KOG4767 consensus 100.0 9.4E-38 2.4E-42 288.2 18.3 200 32-252 9-229 (231) 23 pfam00116 COX2 Cytochrome C ox 100.0 8.2E-37 2.1E-41 281.5 11.1 99 142-240 1-120 (120) 24 PTZ00047 cytochrome C oxidase 100.0 1.2E-33 3.1E-38 258.9 9.4 91 162-252 70-160 (163) 25 pfam06481 COX_ARM COX Aromatic 99.0 2E-10 5E-15 94.0 2.2 47 253-299 1-47 (47) 26 pfam02790 COX2_TM Cytochrome C 98.5 4.4E-07 1.1E-11 70.2 8.1 77 32-115 6-82 (83) 27 PRK02888 nitrous-oxide reducta 98.4 5.5E-07 1.4E-11 69.5 5.8 77 165-241 557-637 (637) 28 TIGR03096 nitroso_cyanin nitro 97.3 0.00084 2.1E-08 46.9 6.2 58 166-223 61-119 (135) 29 COG4263 NosZ Nitrous oxide red 96.3 0.007 1.8E-07 40.3 4.6 74 167-240 559-636 (637) 30 PRK10263 DNA translocase FtsK; 93.2 0.77 2E-05 25.8 7.9 50 35-84 60-109 (1355) 31 TIGR01477 RIFIN variant surfac 92.9 0.13 3.4E-06 31.2 3.7 26 53-79 356-381 (395) 32 PTZ00042 stevor; Provisional 92.3 0.51 1.3E-05 27.1 6.0 52 30-81 242-293 (304) 33 pfam02009 Rifin_STEVOR Rifin/s 91.4 0.5 1.3E-05 27.2 5.1 14 67-80 263-276 (289) 34 PTZ00046 rifin; Provisional 90.2 0.39 9.9E-06 27.9 3.8 13 68-80 323-335 (348) 35 TIGR03095 rusti_cyanin rusticy 89.9 1.6 4.2E-05 23.5 7.5 75 164-239 50-147 (148) 36 pfam07731 Cu-oxidase_2 Multico 85.4 3 7.6E-05 21.6 6.1 77 163-240 30-133 (136) 37 TIGR02376 Cu_nitrite_red nitri 81.5 1.3 3.4E-05 24.1 2.7 80 167-246 60-152 (331) 38 pfam00127 Copper-bind Copper b 80.9 4.4 0.00011 20.4 7.1 68 165-240 16-99 (99) 39 TIGR03388 ascorbase L-ascorbat 80.8 3.6 9.1E-05 21.1 4.7 27 165-191 250-277 (541) 40 PRK06231 F0F1 ATP synthase sub 78.9 5.1 0.00013 20.0 6.3 52 21-74 18-71 (201) 41 PRK10654 dcuC C4-dicarboxylate 78.7 3 7.7E-05 21.6 3.8 84 30-113 165-258 (452) 42 TIGR01478 STEVOR variant surfa 76.9 5.8 0.00015 19.6 6.9 35 49-83 272-306 (315) 43 PRK02463 OxaA-like protein pre 76.1 6 0.00015 19.5 6.6 31 12-42 5-37 (307) 44 PRK02710 plastocyanin; Provisi 75.7 6.2 0.00016 19.4 7.4 68 165-240 46-119 (119) 45 COG3017 LolB Outer membrane li 67.7 6.2 0.00016 19.3 3.2 25 11-35 2-26 (206) 46 pfam06716 DUF1201 Protein of u 67.6 9.2 0.00024 18.1 5.3 47 47-93 6-53 (54) 47 COG4148 ModC ABC-type molybdat 66.9 4 0.0001 20.7 2.1 47 165-211 260-306 (352) 48 TIGR03389 laccase laccase, pla 64.9 10 0.00026 17.8 5.0 28 165-192 234-261 (539) 49 COG4858 Uncharacterized membra 64.5 11 0.00027 17.7 6.4 29 51-79 127-155 (226) 50 TIGR02657 amicyanin amicyanin; 62.3 4.8 0.00012 20.2 1.8 70 166-240 11-83 (83) 51 TIGR02887 spore_ger_x_C germin 62.1 5.1 0.00013 20.0 1.9 19 14-32 1-19 (400) 52 COG4633 Plastocyanin domain co 62.0 12 0.0003 17.4 4.4 72 164-241 95-169 (272) 53 pfam11346 DUF3149 Protein of u 61.8 10 0.00026 17.8 3.4 30 48-77 5-35 (42) 54 PRK10802 peptidoglycan-associa 61.8 7.3 0.00019 18.9 2.6 24 10-33 1-24 (173) 55 pfam06572 DUF1131 Protein of u 61.2 4.6 0.00012 20.3 1.6 25 14-39 2-32 (192) 56 PRK11023 hypothetical protein; 59.5 6.4 0.00016 19.3 2.0 21 13-33 1-21 (191) 57 smart00719 Plus3 Short conserv 59.3 11 0.00028 17.6 3.2 49 196-253 61-109 (109) 58 PRK12700 flgH flagellar basal 57.6 12 0.00032 17.2 3.2 24 10-33 1-24 (230) 59 pfam12501 DUF3708 Phosphate AT 56.4 6.9 0.00018 19.0 1.8 49 61-109 4-58 (127) 60 PRK09859 multidrug efflux syst 55.1 9.9 0.00025 17.9 2.4 19 14-32 4-22 (385) 61 PRK10796 LPS-assembly lipoprot 54.6 10 0.00025 17.9 2.3 20 14-33 2-21 (196) 62 PRK10718 hypothetical protein; 54.0 4.6 0.00012 20.3 0.5 24 14-38 2-31 (191) 63 PRK01622 OxaA-like protein pre 53.9 16 0.0004 16.5 8.4 22 12-33 4-25 (266) 64 TIGR02722 lp_ uncharacterized 53.9 10 0.00026 17.9 2.3 20 14-33 2-21 (215) 65 PRK09915 putative outer membra 52.7 13 0.00032 17.1 2.6 26 8-33 1-26 (488) 66 COG1276 PcoD Putative copper e 52.6 16 0.00042 16.4 5.6 24 87-113 259-282 (289) 67 PRK09804 putative cryptic C4-d 52.1 17 0.00043 16.3 3.8 25 57-81 200-224 (455) 68 PRK09967 putative outer membra 50.3 18 0.00045 16.1 3.3 19 15-33 3-21 (160) 69 pfam08292 RNA_pol_Rbc25 RNA po 50.1 12 0.00031 17.2 2.2 30 148-184 46-75 (120) 70 KOG3297 consensus 49.4 18 0.00047 16.0 4.4 34 148-184 124-157 (202) 71 PRK04405 prsA peptidylprolyl i 49.4 17 0.00043 16.2 2.8 39 8-46 1-43 (298) 72 COG4259 Uncharacterized protei 48.7 14 0.00037 16.8 2.4 24 10-33 1-24 (121) 73 PRK09578 periplasmic multidrug 48.4 17 0.00043 16.3 2.7 20 13-32 5-24 (385) 74 pfam05454 DAG1 Dystroglycan (D 48.4 11 0.00028 17.6 1.8 33 47-79 142-176 (290) 75 TIGR03102 halo_cynanin halocya 48.2 19 0.00049 15.9 7.3 94 137-240 19-115 (115) 76 TIGR02747 TraV type IV conjuga 47.9 12 0.00031 17.3 1.9 20 14-33 2-22 (174) 77 COG2980 RlpB Rare lipoprotein 47.9 13 0.00033 17.1 2.0 19 15-33 3-21 (178) 78 pfam09851 DUF2078 Predicted me 47.8 19 0.00049 15.9 3.7 21 55-75 10-30 (73) 79 pfam08085 Entericidin Enterici 47.7 15 0.00038 16.6 2.3 21 13-33 2-23 (42) 80 PRK12701 flgH flagellar basal 46.9 14 0.00036 16.8 2.1 20 14-33 2-21 (230) 81 pfam07358 DUF1482 Protein of u 46.4 5.5 0.00014 19.7 -0.1 11 183-193 46-56 (57) 82 TIGR02544 III_secr_YscJ type I 46.2 16 0.00041 16.5 2.3 22 17-38 3-26 (203) 83 COG3417 FlgN Collagen-binding 45.8 20 0.00052 15.7 2.8 19 15-33 4-22 (200) 84 COG2268 Uncharacterized protei 45.5 13 0.00033 17.1 1.8 16 142-159 59-74 (548) 85 PRK11627 hypothetical protein; 45.4 17 0.00043 16.3 2.3 25 19-43 6-34 (192) 86 COG4669 EscJ Type III secretor 44.8 21 0.00055 15.5 3.3 64 155-221 131-195 (246) 87 pfam12262 Lipase_bact_N Bacter 44.5 16 0.0004 16.5 2.0 20 14-33 2-21 (269) 88 PRK13562 acetolactate synthase 43.6 8 0.0002 18.6 0.4 37 227-263 40-76 (84) 89 pfam11694 DUF3290 Protein of u 43.4 22 0.00057 15.4 5.3 32 54-85 16-47 (149) 90 pfam06474 MLTD_N MLTD_N. 43.3 14 0.00036 16.8 1.7 15 19-33 4-18 (93) 91 TIGR00039 6PTHBS putative 6-py 43.2 7.8 0.0002 18.6 0.3 51 217-272 29-79 (153) 92 TIGR02163 napH_ ferredoxin-typ 42.3 23 0.00059 15.3 5.7 46 1-46 91-150 (263) 93 PRK13316 heme-degrading monoox 42.0 24 0.0006 15.2 4.2 69 164-251 4-77 (121) 94 TIGR02588 TIGR02588 conserved 41.9 24 0.0006 15.2 2.9 18 150-167 50-67 (122) 95 PRK13733 conjugal transfer pro 41.8 2.1 5.5E-05 22.7 -2.7 20 14-33 2-21 (171) 96 PRK10144 formate-dependent nit 41.7 22 0.00056 15.5 2.4 74 33-116 49-122 (126) 97 COG5000 NtrY Signal transducti 41.7 24 0.00061 15.2 5.6 72 36-114 29-104 (712) 98 COG3462 Predicted membrane pro 41.6 24 0.00061 15.2 5.6 48 33-80 31-80 (117) 99 pfam00394 Cu-oxidase Multicopp 41.4 24 0.00061 15.2 7.7 74 163-236 42-134 (142) 100 PRK12697 flgH flagellar basal 41.0 22 0.00055 15.5 2.3 24 10-33 3-26 (227) 101 pfam05767 Pox_A14 Poxvirus vir 40.8 24 0.00062 15.1 5.3 62 13-77 8-73 (92) 102 PRK02944 OxaA-like protein pre 40.7 25 0.00063 15.1 6.9 31 12-42 1-36 (255) 103 COG3056 Uncharacterized lipopr 40.1 25 0.00064 15.0 2.7 158 11-223 11-172 (204) 104 PRK12788 flgH flagellar basal 39.8 16 0.00041 16.4 1.5 17 16-32 1-17 (231) 105 PRK13731 conjugal transfer sur 39.0 23 0.0006 15.3 2.2 20 13-32 3-22 (243) 106 PRK02939 hypothetical protein; 38.5 22 0.00055 15.5 2.0 31 19-49 5-36 (236) 107 PRK13279 arnT 4-amino-4-deoxy- 38.5 21 0.00054 15.6 2.0 49 207-257 451-514 (551) 108 TIGR02699 archaeo_AfpA archaeo 38.3 22 0.00057 15.4 2.0 27 167-193 133-159 (176) 109 PRK10866 outer membrane protei 38.3 27 0.00068 14.9 3.4 19 15-33 4-22 (243) 110 TIGR03659 IsdE heme ABC transp 37.8 24 0.00061 15.2 2.1 21 13-33 1-21 (289) 111 pfam11874 DUF3394 Domain of un 37.8 7.7 0.0002 18.7 -0.4 35 151-185 48-87 (183) 112 COG3317 NlpB Uncharacterized l 36.8 28 0.00071 14.7 2.4 21 13-33 2-22 (342) 113 KOG3653 consensus 36.6 28 0.00072 14.7 3.1 34 47-80 152-185 (534) 114 pfam07901 DUF1672 Protein of u 36.3 24 0.00061 15.2 1.9 17 17-33 4-20 (304) 115 PRK12698 flgH flagellar basal 35.9 29 0.00074 14.6 2.3 14 20-33 5-18 (224) 116 pfam10643 Cytochrome-c551 Phot 35.9 29 0.00074 14.6 3.2 25 9-33 11-35 (213) 117 pfam07856 Orai-1 Mediator of C 35.6 29 0.00075 14.6 5.0 62 49-113 112-176 (205) 118 PRK10598 hypothetical protein; 35.6 27 0.00069 14.8 2.1 21 232-252 115-135 (186) 119 COG2976 Uncharacterized protei 35.3 25 0.00065 15.0 1.9 24 57-80 23-46 (207) 120 cd00454 Trunc_globin Truncated 35.3 14 0.00037 16.7 0.7 36 215-258 56-91 (116) 121 PRK11372 lysozyme inhibitor; P 35.3 29 0.00073 14.6 2.2 21 138-158 38-58 (109) 122 pfam05590 DUF769 Xylella fasti 35.2 24 0.00061 15.2 1.8 15 19-33 9-23 (279) 123 PRK12407 flgH flagellar basal 35.0 26 0.00066 14.9 1.9 17 17-33 2-18 (220) 124 PRK13684 Ycf48-like protein; P 35.0 30 0.00076 14.5 3.1 26 8-33 1-26 (333) 125 pfam06788 UPF0257 Uncharacteri 34.5 20 0.00051 15.8 1.3 27 136-162 126-152 (236) 126 PRK11653 hypothetical protein; 34.3 29 0.00073 14.7 2.1 33 1-33 1-39 (223) 127 pfam06024 DUF912 Nucleopolyhed 34.2 31 0.00078 14.4 3.2 31 48-78 62-92 (101) 128 PRK00249 flgH flagellar basal 34.1 31 0.00079 14.4 2.5 19 15-33 3-21 (231) 129 PRK10781 rcsF outer membrane l 34.0 27 0.00069 14.8 1.9 22 17-38 2-23 (133) 130 PRK09973 putative outer membra 33.7 20 0.00051 15.7 1.2 21 13-33 2-22 (85) 131 pfam09577 Spore_YpjB Sporulati 33.5 32 0.0008 14.3 3.0 16 101-116 206-221 (232) 132 COG5510 Predicted small secret 33.5 30 0.00076 14.5 2.1 21 13-33 3-24 (44) 133 pfam07010 Endomucin Endomucin. 33.4 32 0.00081 14.3 2.9 14 117-130 214-227 (259) 134 pfam03126 Plus-3 Plus-3 domain 32.6 33 0.00083 14.2 4.4 46 197-251 59-104 (105) 135 TIGR00005 rluA_subfam pseudour 32.4 30 0.00077 14.5 1.9 31 215-252 304-336 (337) 136 PRK00022 lolB outer membrane l 32.2 30 0.00077 14.5 1.9 15 19-33 6-20 (203) 137 TIGR03302 OM_YfiO outer membra 32.2 33 0.00084 14.2 2.2 18 16-33 2-19 (235) 138 pfam06007 PhnJ Phosphonate met 31.8 21 0.00053 15.6 1.0 85 161-246 158-270 (279) 139 KOG3846 consensus 31.7 20 0.00051 15.7 0.9 20 277-296 406-425 (465) 140 TIGR02302 aProt_lowcomp conser 31.3 34 0.00087 14.1 4.6 41 164-204 219-260 (899) 141 PRK11443 hypothetical protein; 31.2 32 0.00081 14.3 1.9 16 19-34 4-19 (120) 142 TIGR01675 plant-AP plant acid 31.1 30 0.00076 14.5 1.7 46 228-273 101-148 (245) 143 pfam05915 DUF872 Eukaryotic pr 30.8 33 0.00085 14.2 1.9 14 51-64 74-87 (112) 144 COG5567 Predicted small peripl 30.4 35 0.0009 14.0 2.3 21 13-33 2-22 (58) 145 PRK05978 hypothetical protein; 30.2 36 0.00091 14.0 4.0 24 53-76 73-96 (149) 146 COG3065 Slp Starvation-inducib 29.5 37 0.00093 13.9 2.6 15 19-33 11-25 (191) 147 PRK11459 multidrug resistance 29.4 37 0.00093 13.9 2.1 21 12-32 3-23 (478) 148 TIGR02142 modC_ABC molybdate A 29.4 30 0.00077 14.5 1.5 68 142-209 238-308 (361) 149 cd01225 PH_Cool_Pix Cool (clon 29.3 37 0.00094 13.9 3.1 24 233-256 86-109 (111) 150 PRK03002 prsA peptidylprolyl i 29.2 37 0.00094 13.9 2.4 20 13-32 3-22 (285) 151 KOG3637 consensus 29.2 37 0.00094 13.9 2.8 32 143-174 149-180 (1030) 152 TIGR01647 ATPase-IIIA_H plasma 29.1 37 0.00094 13.8 7.0 87 90-189 260-350 (835) 153 pfam06832 BiPBP_C Penicillin-B 28.8 37 0.00095 13.8 4.7 73 154-229 13-85 (90) 154 KOG0174 consensus 28.8 37 0.00095 13.8 1.9 63 182-254 114-180 (224) 155 PRK11189 lipoprotein NlpI; Pro 28.5 38 0.00096 13.8 4.6 20 13-32 2-21 (297) 156 PRK07363 NAD(P)H-quinone oxido 28.5 38 0.00097 13.8 7.8 18 186-203 207-224 (499) 157 PRK10175 hypothetical protein; 28.3 22 0.00057 15.4 0.7 18 17-34 2-19 (75) 158 TIGR03502 lipase_Pla1_cef extr 28.3 37 0.00095 13.8 1.8 19 14-32 2-20 (792) 159 COG1113 AnsP Gamma-aminobutyra 27.4 39 0.001 13.7 7.1 28 51-78 362-389 (462) 160 KOG1483 consensus 27.4 39 0.001 13.6 5.0 95 10-115 3-97 (404) 161 TIGR03147 cyt_nit_nrfF cytochr 27.4 40 0.001 13.6 2.9 74 33-116 49-122 (126) 162 PRK09240 thiH thiamine biosynt 27.3 40 0.001 13.6 4.7 16 238-253 271-286 (371) 163 pfam11353 DUF3153 Protein of u 26.6 25 0.00064 15.0 0.7 71 140-211 101-179 (210) 164 PRK11616 hypothetical protein; 26.5 38 0.00097 13.8 1.6 22 14-35 3-24 (109) 165 PRK11029 FtsH protease regulat 26.3 41 0.0011 13.5 2.4 20 162-181 79-98 (334) 166 PRK09810 entericidin A; Provis 26.2 41 0.0011 13.5 2.3 19 14-33 3-21 (41) 167 pfam06604 consensus 26.1 42 0.0011 13.5 2.2 21 13-33 3-23 (181) 168 pfam00479 G6PD_N Glucose-6-pho 25.4 43 0.0011 13.4 3.6 60 260-319 83-153 (183) 169 KOG3039 consensus 25.3 21 0.00054 15.6 0.1 50 274-323 235-291 (303) 170 PRK12450 foldase protein PrsA; 25.3 43 0.0011 13.4 2.7 35 12-46 3-44 (309) 171 TIGR03546 conserved hypothetic 25.0 44 0.0011 13.3 3.9 30 49-78 109-138 (154) 172 PRK08475 F0F1 ATP synthase sub 24.8 44 0.0011 13.3 3.5 10 65-74 38-47 (170) 173 pfam00585 Thr_dehydrat_C C-ter 24.7 19 0.00049 15.9 -0.2 58 211-268 17-87 (91) 174 PRK09739 hypothetical protein; 24.7 37 0.00095 13.8 1.3 72 173-260 120-198 (201) 175 PRK13312 heme-degrading monoox 24.7 44 0.0011 13.3 4.9 54 183-251 17-70 (107) 176 pfam06666 DUF1173 Protein of u 24.7 19 0.00049 15.9 -0.2 35 173-207 324-361 (386) 177 KOG2927 consensus 24.6 44 0.0011 13.3 3.4 26 53-78 190-215 (372) 178 TIGR02878 spore_ypjB sporulati 24.6 44 0.0011 13.3 2.6 24 54-77 202-226 (234) 179 COG3794 PetE Plastocyanin [Ene 24.4 45 0.0011 13.3 7.1 71 166-240 54-127 (128) 180 TIGR01004 PulS_OutS lipoprotei 24.3 45 0.0011 13.3 4.4 41 10-50 3-51 (136) 181 PRK10449 heat-inducible protei 24.2 45 0.0011 13.3 1.9 35 222-256 67-112 (140) 182 pfam02430 AMA-1 Apical membran 23.8 38 0.00098 13.7 1.2 32 220-257 209-240 (462) 183 cd01245 PH_RasGAP_CG5898 RAS G 23.8 46 0.0012 13.2 2.1 16 240-255 83-98 (98) 184 pfam10066 DUF2304 Uncharacteri 23.6 46 0.0012 13.2 2.3 11 110-120 78-88 (115) 185 PRK10783 mltD membrane-bound l 23.1 47 0.0012 13.1 1.6 14 20-33 5-18 (449) 186 TIGR02898 spore_YhcN_YlaJ spor 23.0 47 0.0012 13.1 1.8 16 18-33 2-19 (185) 187 COG1585 Membrane protein impli 22.9 47 0.0012 13.1 3.4 36 46-81 39-74 (140) 188 PRK12699 flgH flagellar basal 22.8 48 0.0012 13.1 1.8 16 19-34 21-36 (246) 189 PRK10523 lipoprotein involved 22.3 49 0.0012 13.0 2.8 23 12-34 2-24 (234) 190 pfam02411 MerT MerT mercuric t 22.1 49 0.0013 13.0 4.7 24 92-115 89-112 (116) 191 PRK01326 prsA foldase protein 22.1 49 0.0013 13.0 2.6 20 13-32 2-22 (310) 192 KOG4789 consensus 22.0 49 0.0013 13.0 2.3 22 168-189 602-623 (668) 193 PRK13751 putative mercuric tra 22.0 49 0.0013 13.0 4.4 29 85-115 84-112 (116) 194 pfam08139 LPAM_1 Prokaryotic m 21.8 50 0.0013 12.9 2.0 14 21-34 7-20 (26) 195 PRK00447 hypothetical protein; 21.8 50 0.0013 12.9 2.5 55 206-261 83-141 (146) 196 PHA01080 hypothetical protein 21.8 50 0.0013 12.9 1.7 24 238-261 28-51 (81) 197 KOG3325 consensus 21.7 50 0.0013 12.9 1.7 82 144-236 56-157 (183) 198 KOG1094 consensus 21.7 50 0.0013 12.9 3.7 38 60-100 400-437 (807) 199 TIGR02683 upstrm_HI1419 probab 21.6 33 0.00084 14.2 0.5 22 238-259 2-23 (95) 200 PRK02998 prsA peptidylprolyl i 21.6 50 0.0013 12.9 2.8 33 14-46 5-40 (283) 201 pfam01152 Bac_globin Bacterial 21.6 50 0.0013 12.9 1.6 25 229-258 71-95 (120) 202 pfam08490 DUF1744 Domain of un 21.6 43 0.0011 13.4 1.1 15 211-225 194-208 (396) 203 pfam01235 Na_Ala_symp Sodium:a 21.5 50 0.0013 12.9 5.4 30 55-84 55-84 (415) 204 pfam07273 DUF1439 Protein of u 21.5 51 0.0013 12.9 2.1 22 232-253 114-135 (177) 205 TIGR02023 BchP-ChlP geranylger 21.4 51 0.0013 12.9 1.5 40 240-279 93-137 (408) 206 pfam09351 DUF1993 Domain of un 21.2 50 0.0013 12.9 1.4 30 165-195 109-138 (162) 207 pfam00767 Poty_coat Potyvirus 21.1 51 0.0013 12.8 2.9 24 240-264 56-79 (237) 208 pfam02402 Lysis_col Lysis prot 20.9 33 0.00084 14.2 0.4 17 17-33 5-21 (46) 209 KOG4390 consensus 20.7 50 0.0013 12.9 1.3 10 57-66 183-192 (632) 210 pfam11344 DUF3146 Protein of u 20.7 52 0.0013 12.8 4.6 45 138-182 15-79 (80) 211 PRK10672 rare lipoprotein A; P 20.6 53 0.0013 12.8 1.8 17 17-33 4-20 (369) 212 TIGR02843 CyoB cytochrome o ub 20.5 53 0.0013 12.8 4.7 48 49-98 494-542 (646) 213 pfam11368 DUF3169 Protein of u 20.5 53 0.0013 12.7 4.9 32 45-76 42-73 (248) 214 PRK12696 flgH flagellar basal 20.5 53 0.0014 12.7 2.7 17 17-33 4-20 (238) 215 TIGR00979 fumC_II fumarate hyd 20.4 21 0.00054 15.6 -0.7 18 293-310 294-311 (459) 216 pfam03767 Acid_phosphat_B HAD 20.4 53 0.0014 12.7 1.9 53 237-291 102-154 (230) 217 PRK10477 outer membrane lipopr 20.3 48 0.0012 13.1 1.1 52 207-260 116-167 (177) 218 cd00260 Sialidase Sialidases o 20.3 53 0.0014 12.7 2.9 99 142-251 132-238 (351) 219 PRK06473 NAD(P)H-quinone oxido 20.2 54 0.0014 12.7 7.5 19 185-203 206-224 (500) 220 pfam11444 DUF2895 Protein of u 20.1 54 0.0014 12.7 7.6 69 48-119 8-76 (199) No 1 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=756.25 Aligned_cols=299 Identities=43% Similarity=0.819 Sum_probs=290.8 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 35668899999999998741364326898989999999999999999999999999999998853046777777875345 Q gi|254780474|r 11 SVGKSLKFFFLFFSSVMLSGFEFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCH 90 (332) Q Consensus 11 ~~~k~l~~~~l~~~~~lLsGC~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~h 90 (332) +-.|.++++++++++++||||+.++|||+|++|++|++|+++++++|++|+++|++++++|+||||++|++++|.|+|+| T Consensus 5 ~~~~~~~~l~l~~~~~lLsGC~~~~LdPkGpva~~q~~Li~~s~~lMliVvvpVi~l~~~~~~ryR~~~~~a~y~P~w~h 84 (305) T PRK10525 5 KYNKSLGWLSLFAGTVLLSGCNSALLDPKGQIGLEQRSLILTAFGLMLIVVIPAILMAVGFAWKYRASNKDAKYSPNWSH 84 (305) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCCCCC T ss_conf 77778999999999998667876328999877899998999999999868999999999988411136899887997557 Q ss_pred CCEEEEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCCCCCEEEEE Q ss_conf 66125530227899999999999988775134442000012446788888668999998436999962678774306896 Q gi|254780474|r 91 STLLELFVWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQVAIINELVV 170 (332) Q Consensus 91 s~~lEi~wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~giatvneL~l 170 (332) |+++|.++|+||++||++|+++||+++|+||||+|+ .++++|++|+|+|+||||+|+|||+||+|+|||++ T Consensus 85 s~~lE~viWtIPilIIi~Lav~T~~~t~~Ldp~~p~---------~~~~~pl~V~Vv~~qWkW~F~YPe~gIat~NeL~i 155 (305) T PRK10525 85 SNKVEAVVWTVPILIIIFLAVLTWKTTHALEPSKPL---------AHDEKPITIEVVSMDWKWFFIYPEQGIATVNEIAF 155 (305) T ss_pred CCEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC---------CCCCCCEEEEEEEEEEEEEEEECCCCCEEEEEEEE T ss_conf 874530468999999999999999887752677878---------88889779999988415999957988356678998 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEC-CHHHHHHHH Q ss_conf 3698289999835810110220107403116985148999942774896661332657611365899970-889999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGK-SEKGFEDWI 249 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~-s~~~f~~Wv 249 (332) |+|+||+|+|||+|||||||||+||||+||||||+|++|++||++|+|+|+||||||.|||+|+|+++++ +++|||+|| T Consensus 156 P~~~pV~f~lTS~dVihSF~IP~L~gk~dampG~~t~l~~~a~~~G~Y~G~cae~cG~ghs~M~F~v~a~~~~~~Fd~WV 235 (305) T PRK10525 156 PANTPVYFKVTSNSVMNSFFIPRLGSQIYAMAGMQTRLHLIANEPGTYDGISASYSGPGFSGMKFKAIATPDRAEFDQWV 235 (305) T ss_pred ECCCEEEEEEEECCEEEEEEECCCCCCEECCCCCCEEEEEECCCCEEEEEECHHCCCCCCCCCEEEEEECCCHHHHHHHH T ss_conf 78987999997277111032131156410379862389998489757888723004803058877999768989999999 Q ss_pred HHHHCCCCCC-CHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHCCCCCCCCCCCCHHHCCCCCCCCCHHH Q ss_conf 9986072456-98999997610212632172047835426873152444885660210246668852344 Q gi|254780474|r 250 SKVKRQGTFL-NRQEYLLLKKPSERDSVRYFSPIEKGLYYSILNLCVHPGKICMDEMMRIDALGGGGMRG 318 (332) Q Consensus 250 ~~~ka~~~~l-~~~~y~~l~~ps~~~~~~~f~~~~~~lf~~i~~~~~~~~~~~~~~~m~~~~~~g~~~~~ 318 (332) +++|+++++| |.++|++|++||+++||+|||+|+|+||++|+++|+++|+.|.+.+++.+.++|.|||+ T Consensus 236 ~~vk~s~~~l~~~~~y~~l~~ps~~~~v~yf~~v~~~Lf~~iv~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (305) T PRK10525 236 AKAKQSPNTMNDMAAFEKLAAPSEYNPVEYFSSVKPDLFKDVINKFMGHGKSMDMTQPEGEHSAHEGMEG 305 (305) T ss_pred HHHHCCCCCCCCHHHHHHHHCCCCCCCCCEECCCCCHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCCCC T ss_conf 9986187557758899998588868996462468930899999997128857665431343003566589 No 2 >TIGR01433 CyoA ubiquinol oxidase, subunit II; InterPro: IPR006333 Members of this domain catalyze the oxidation of ubiquinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. Subunit II is responsible for binding and oxidation of the ubiquinone substrate. This sequence is closely related to QoxA, which oxidizes quinol in Gram-positive bacteria but which is in complex with subunits which utilise cytochrome a in the reduction of molecular oxygen. Slightly more distantly related is subunit II of cytochrome c oxidase which uses cytochrome c as the oxidant . ; GO: 0016682 oxidoreductase activity acting on diphenols and related substances as donors oxygen as acceptor, 0006118 electron transport, 0016020 membrane. Probab=100.00 E-value=0 Score=689.93 Aligned_cols=226 Identities=54% Similarity=1.046 Sum_probs=222.1 Q ss_pred HHHHHHHHHHHCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 99999999874136-43268989899999999999999999999999999999988530467777778753456612553 Q gi|254780474|r 19 FFLFFSSVMLSGFE-FIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELF 97 (332) Q Consensus 19 ~~l~~~~~lLsGC~-~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~ 97 (332) ++++++.++||||| ..+|||+|+||.|||+||++++++||+|+|||++|++||+||||++|++++|.|+|+||+|+|++ T Consensus 1 L~~~~~~~lLsGCNn~~LL~PkG~ig~eersLIL~a~~~MLlvviPVI~~Tl~FAWKYR~tN~~~~Y~P~W~HS~kiE~V 80 (228) T TIGR01433 1 LSLILASLLLSGCNNLVLLDPKGQIGLEERSLILTAFGLMLLVVIPVILLTLFFAWKYRETNKKADYKPNWHHSTKIEIV 80 (228) T ss_pred CHHHHHHHHHHCCCCCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEE T ss_conf 85778999861256765538888653024668999999999999999999998778866655530378885667777860 Q ss_pred EEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCCCCCEEEEEECCCEEE Q ss_conf 02278999999999999887751344420000124467888886689999984369999626787743068963698289 Q gi|254780474|r 98 VWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQVAIINELVVPIDRPLE 177 (332) Q Consensus 98 wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~giatvneL~lPv~~pV~ 177 (332) +|.||++||++||++||.+||+||||||++ ++.+|++|||+|+||||+|+||||||||+||+.+|||+||. T Consensus 81 VW~iP~~Ii~fLg~ltW~~THkLDP~rPL~---------s~~kPI~iEVVAlDWKWLFIYP~~GIAtvNE~A~PVn~Pin 151 (228) T TIGR01433 81 VWTIPILIIVFLGVLTWISTHKLDPYRPLA---------SDVKPITIEVVALDWKWLFIYPDYGIATVNEIAFPVNVPIN 151 (228) T ss_pred EHHHHHHHHHHHHHHHHHHCCCCCCCCCHH---------HCCCCEEEEEEEECCEEEEECCCCCHHHHHHHHCCCCCCEE T ss_conf 212348888787785310001458887642---------45895279998706701341166642416744135988657 Q ss_pred EEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEC-CHHHHHHHHHHHH Q ss_conf 999835810110220107403116985148999942774896661332657611365899970-8899999999986 Q gi|254780474|r 178 FRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGK-SEKGFEDWISKVK 253 (332) Q Consensus 178 f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~-s~~~f~~Wv~~~k 253 (332) |||||++||||||||+||+|+||||||+|+||++||++|+|+|+||+|||.|||+||||++++ +++.||+||+++| T Consensus 152 Fk~Ts~s~MNsFFIP~LGSqiYaM~GMqTkLHLIAN~~G~Y~GISAnYSG~GFS~M~FKa~a~~~~~~FD~WV~kvK 228 (228) T TIGR01433 152 FKITSNSVMNSFFIPQLGSQIYAMAGMQTKLHLIANEEGVYDGISANYSGPGFSDMKFKAIATLDRAAFDQWVAKVK 228 (228) T ss_pred EEECCCCCCHHHCCCCCCCCEEECCCCCCCHHCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHCC T ss_conf 87754546212220122452244665321011044689872440124467771107453430423445678875059 No 3 >COG1622 CyoA Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion] Probab=100.00 E-value=0 Score=526.55 Aligned_cols=246 Identities=43% Similarity=0.817 Sum_probs=233.2 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 99999999987413643268989899999999999999999999999999999988530467777778753456612553 Q gi|254780474|r 18 FFFLFFSSVMLSGFEFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELF 97 (332) Q Consensus 18 ~~~l~~~~~lLsGC~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~ 97 (332) ++++++..++++||+..+++|+|+++.++.+++..++++|++|+++|+++++||+||||+++++++ |+++||++||++ T Consensus 2 ~~~l~~~~~~l~g~~~~~l~p~~~~~~~~~~~~~~~~~~~~~I~~~V~~~~~~~~~k~R~~~~~~~--~~~~~n~~lEi~ 79 (247) T COG1622 2 ALLLLLVALLLSGCNLTLLDPAGAVAAEQRDLIILSTLLMLVIVLPVIVLLVYFAWKYRASNNAAP--PQWHHNTKLEIV 79 (247) T ss_pred CHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC--CCCCCCEEEEEE T ss_conf 459999999861677544586513567888899999999999999999999998764321478899--874686178987 Q ss_pred EEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCCCCCEEEEEECCCEEE Q ss_conf 02278999999999999887751344420000124467888886689999984369999626787743068963698289 Q gi|254780474|r 98 VWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQVAIINELVVPIDRPLE 177 (332) Q Consensus 98 wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~giatvneL~lPv~~pV~ 177 (332) ||++|++|+++|++++|.+++++++++|. ++.|++|+|+|+||+|.|+||+++++++|||++|+|+||+ T Consensus 80 ~t~iP~iiv~~la~~s~~~~~~l~~~~~~-----------~~~~l~i~v~~~qw~W~f~Yp~~~~~t~n~l~lPv~~~V~ 148 (247) T COG1622 80 WTVIPILIVIILAVPTVITTHKLDPVPPG-----------PEIPLTIEVTAYQWKWLFIYPDYGIATVNELVLPVGRPVR 148 (247) T ss_pred EEEHHHHHEEEEEEECHHHEEECCCCCCC-----------CCCCEEEEEEEEEEEEEEECCCCCCCCCCEEEEECCCEEE T ss_conf 24110253112343004221111235888-----------8786799999999999998067675556619985898699 Q ss_pred EEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHCCCC Q ss_conf 99983581011022010740311698514899994277489666133265761136589997088999999999860724 Q gi|254780474|r 178 FRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISKVKRQGT 257 (332) Q Consensus 178 f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~ 257 (332) |++||+|||||||||+||+|+|+||||++++|+++||+|+|+|+||||||.|||.|+|+++|++++||++|++++|+.+ T Consensus 149 f~ltS~DViHsF~IP~l~~k~d~iPG~~~~~~~~~~~~G~Y~g~Cae~CG~gH~~M~~~v~vvs~~~f~~W~~~~~~~~- 227 (247) T COG1622 149 FKLTSADVIHSFWIPQLGGKIDAIPGMTTELWLTANKPGTYRGICAEYCGPGHSFMRFKVIVVSQEDFDAWVAEVKAAA- 227 (247) T ss_pred EEEEECHHCEEEEECCCCCEEEECCCCEEEEEEECCCCEEEEEECHHHCCCCCCCCEEEEEEECHHHHHHHHHHHHHCC- T ss_conf 9998322402588537786045049925899995589837989827656987245567999975899789998623031- Q ss_pred CCCHHHHHHHHHHHHCCCCEEEC Q ss_conf 56989999976102126321720 Q gi|254780474|r 258 FLNRQEYLLLKKPSERDSVRYFS 280 (332) Q Consensus 258 ~l~~~~y~~l~~ps~~~~~~~f~ 280 (332) .+.+.++++|+++.+.++|+ T Consensus 228 ---~~~~~~~~~~~~~~~~~~~~ 247 (247) T COG1622 228 ---ASAYLALAAPSEALPVTYFA 247 (247) T ss_pred ---HHHHHHCCCCCCCCCCCCCC T ss_conf ---12212204775567776789 No 4 >TIGR02866 CoxB cytochrome c oxidase, subunit II; InterPro: IPR014222 This entry contains subunit II (CoxB) of cytochrome c oxidase. Cytochrome c oxidase is the terminal electron acceptor of mitochondria (and one of several possible acceptors in prokaryotes) in the electron transport chain of aerobic respiration. The enzyme couples the oxidation of reduced cytochrome c to the reduction of molecular oxygen to water. This process results in the pumping of four protons across the membrane that are used in the proton gradient powered synthesis of ATP. The oxidase contains two heme a cofactors and three copper atoms as well as other bound ions .. Probab=100.00 E-value=0 Score=468.18 Aligned_cols=208 Identities=30% Similarity=0.493 Sum_probs=188.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CC--CCCC------------CCCCCCCCCEEEEEEEHHH Q ss_conf 9898999999999999999999999999999999885304-67--7777------------7875345661255302278 Q gi|254780474|r 38 PYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRS-TN--KKAR------------YDPKWCHSTLLELFVWLVP 102 (332) Q Consensus 38 P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~-~~--~~a~------------~~p~~~hs~~lEi~wt~iP 102 (332) |+|++++++.+++++.+.+|++|.+.|.++++|++||||+ +. .+.+ .++++|||++||++|+++| T Consensus 1 p~~~~~~~~~~~~~~~~~~~~~~~l~V~a~~~Y~~~KfR~~~~al~~~~G~A~~~~R~M~E~p~~~~~n~~le~~w~v~p 80 (249) T TIGR02866 1 PAGEIAEQIAKLYLFLLAVAVLISLLVAALLAYVVFKFRRDGKALGDEEGYADKVERIMVEKPSKIHGNRKLEILWTVIP 80 (249) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCEEEEEEEEHHH T ss_conf 94178999997659999999999999999988876035898764334688652365301124675346627775320178 Q ss_pred --HHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCC-------------------- Q ss_conf --9999999999998877513444200001244678888866899999843699996267-------------------- Q gi|254780474|r 103 --LVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQ-------------------- 160 (332) Q Consensus 103 --~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~-------------------- 160 (332) ++|+++|..+++..+..+.-.-| ..+..+..+++++|+|+|+||+|+|+||+. T Consensus 81 a~~~i~~~l~a~~~~~~~~~~~~~p-----dvqa~~a~e~al~v~V~G~Qw~W~f~Y~~~~~~~~~~~~~~kddfvkGdl 155 (249) T TIGR02866 81 AVLIIVVGLIAATVKGLLYLERSIP-----DVQAASANEDALKVKVEGYQWYWDFEYPESRESSSDSAIKDKDDFVKGDL 155 (249) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCC-----CCCCCCCCCCCEEEEEEEEEEEEEEEECCCCEEEECCCCCCCCCEECCCE T ss_conf 8999999999987877887641875-----12586235897289999999888998547764330134454310120312 Q ss_pred -CCCCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEE-EECCCE-EEEEECCCCCCCCCCCCEEEE Q ss_conf -87743068963698289999835810110220107403116985148999-942774-896661332657611365899 Q gi|254780474|r 161 -QVAIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHA-VMNKKG-AYSGFSANYSGRGFSHMRFKF 237 (332) Q Consensus 161 -giatvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~-~a~~~G-~y~G~cae~cG~ghs~M~f~v 237 (332) +..+.|+|+||+|++|+|++||+|||||||||+||+|+||+||++|.+|| +||||| +|.|+|+||||.|||.|.|+| T Consensus 156 k~~~~~N~lv~P~~~~V~~~vTs~DVIHsF~vP~l~~K~DAiPG~~n~~~~k~a~~pGk~Y~g~CaElCG~gHs~M~~~v 235 (249) T TIGR02866 156 KGFTTVNELVVPADTDVELQVTSKDVIHSFWVPELGGKIDAIPGQLNALWFKNADEPGKVYYGYCAELCGAGHSLMLFKV 235 (249) T ss_pred EEEECCCEEEECCCCEEEEEEECCCEEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEEEEEEECCCCCCCCEEE T ss_conf 46640545894168538999943741564303554884205887211588964687884430075543178746461079 Q ss_pred EECCHHH-HHHHHH Q ss_conf 9708899-999999 Q gi|254780474|r 238 YGKSEKG-FEDWIS 250 (332) Q Consensus 238 ~v~s~~~-f~~Wv~ 250 (332) +|+++|| ||+|++ T Consensus 236 ~v~~~eeffd~~~~ 249 (249) T TIGR02866 236 VVVDKEEFFDAYIE 249 (249) T ss_pred EEECCCHHHHHHCC T ss_conf 98177005677509 No 5 >TIGR01432 QOXA cytochrome aa3 quinol oxidase, subunit II; InterPro: IPR006332 Members of this family catalyze the oxidation of quinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. This subunit contains two transmembrane helices and a large external domain responsible for the binding and oxidation of quinol. QuoX is (presently) only found in Gram-positive bacteria of the Bacillus/Staphylococcus group . Like CyoA, the ubiquinol oxidase found in proteobacteria, the residues responsible for the ligation of Cu(a) and cytochrome c (found in the related cyt. c oxidases) are absent. Unlike CyoA, QoxA is in complex with a subunit I which contains cytochromes similar to the cytochrome c oxidases (as opposed to cytochromes b).; GO: 0016682 oxidoreductase activity acting on diphenols and related substances as donors oxygen as acceptor, 0006122 mitochondrial electron transport ubiquinol to cytochrome c. Probab=100.00 E-value=0 Score=443.38 Aligned_cols=225 Identities=35% Similarity=0.651 Sum_probs=211.9 Q ss_pred HHHHHHHHHHCC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC--CCCCCCCCCCCCCEEEE Q ss_conf 999999987413-643268989899999999999999999999999999999988530467--77777875345661255 Q gi|254780474|r 20 FLFFSSVMLSGF-EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTN--KKARYDPKWCHSTLLEL 96 (332) Q Consensus 20 ~l~~~~~lLsGC-~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~--~~a~~~p~~~hs~~lEi 96 (332) +|+....+|||| +..+|+|+||+|.+|.+|++.+.+.||+|+..|++|...+++|||.++ +...|.|+.|||..||+ T Consensus 2 ~Llg~~~llsGCs~~~vl~~KGPVA~~Q~~LIlySI~FMLvIv~vv~~lF~I~l~KY~~~KgkD~~~~~p~~H~~~~lE~ 81 (229) T TIGR01432 2 ALLGVVILLSGCSNIEVLNPKGPVAESQSDLILYSIVFMLVIVLVVFVLFTIILVKYRERKGKDNSAYEPEMHGNAILEV 81 (229) T ss_pred HHHEEEEEEECCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEHH T ss_conf 02226642335766347668888445542015678999999999999988875322000135454445876455420003 Q ss_pred EEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCCCCCEEEEEECCCEE Q ss_conf 30227899999999999988775134442000012446788888668999998436999962678774306896369828 Q gi|254780474|r 97 FVWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQVAIINELVVPIDRPL 176 (332) Q Consensus 97 ~wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~giatvneL~lPv~~pV 176 (332) +|++||++|+++|+++|.++.|.++-- -....+++|+.|-.++-+|||.|.|||+.|.|+|.|.+|+|||| T Consensus 82 ~W~vIPi~IV~ALaiPTV~T~Y~~E~a---------P~~t~~KdP~vvYA~S~d~KW~FsYPd~~IETVN~l~IP~DRPv 152 (229) T TIGR01432 82 VWTVIPIIIVIALAIPTVKTIYDLEKA---------PKATKEKDPLVVYATSADWKWVFSYPDEDIETVNYLNIPKDRPV 152 (229) T ss_pred HHHHHHHHHHHHHCCCHHHHHHCCCCC---------CCCCCCCCCEEEEEEECCEEEEEECCCCCCCEEEEEECCCCCCE T ss_conf 545899999987303524555224678---------75556899879998751817887468987223444527899856 Q ss_pred EEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHHH Q ss_conf 99998358101102201074031169851489999427748966613326576113658999708899999999986 Q gi|254780474|r 177 EFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISKVK 253 (332) Q Consensus 177 ~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~k 253 (332) .|+|+|.|-|.||||||||||.|||.||...+.|+|++.|+|+|+.|||.|.|||..+|+|.|+|+.||+.|+++.| T Consensus 153 lfKl~s~DtMtS~WiPQLGGQKYAM~GM~M~~~L~A~~~GTy~GRNaNF~GEgFa~Q~F~V~A~s~KDf~~WvK~~~ 229 (229) T TIGR01432 153 LFKLSSADTMTSFWIPQLGGQKYAMAGMTMNLYLQADKVGTYKGRNANFNGEGFADQEFDVNAVSEKDFDKWVKETK 229 (229) T ss_pred EEEECCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHCCCC T ss_conf 88721101012202234676156665445654420045565235466657876222325788631211342300379 No 6 >MTH00140 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=422.11 Aligned_cols=200 Identities=20% Similarity=0.323 Sum_probs=182.2 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) ...+.+|+++++++...++...+.++++|++.|..+++++++++|.++ ...||++||++||++|++|+++|++ T Consensus 6 ql~~q~~~S~~~~~i~~~h~~~~~i~~~I~~~V~~~~~~~~~~~~~~~-------~~~~~~~LEi~WTiiP~lil~~l~~ 78 (227) T MTH00140 6 QLGFQDAASPLMEELIFFHDHAMVVLIMILTFVMYMGLLLMFNKFSCR-------NILEAQRLETIWTIVPALILVFLAL 78 (227) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-------CCCCCCEEEEHHHHHHHHHHHHHHH T ss_conf 079988689899999999999999999999999999999999537646-------3667734320399999999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCC---------------------CCCEEEEE Q ss_conf 999887751344420000124467888886689999984369999626787---------------------74306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQV---------------------AIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~gi---------------------atvneL~l 170 (332) ++++.++.+|. ..+++++|+|+|+||+|.|+|||.+. .++|+|++ T Consensus 79 ps~~~ly~~~~--------------~~~~~~tikv~G~QW~W~yeY~~~~~~~~ds~m~~~~~~~~g~~~ll~vdn~lvl 144 (227) T MTH00140 79 PSLRLLYLLDE--------------TNNPLLTVKSIGHQWYWSYEYSDFLDIEFDSYMIPTNELELGDFRLLEVDHRLVL 144 (227) T ss_pred HHHHHHHHCCC--------------CCCCCEEEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEECCCCCEEE T ss_conf 99999971142--------------6788567898885216899950687754311356632136798510224551577 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|++++++|+|+|||||+||.+||.|+++|+|++++||++|++ T Consensus 145 P~~~~vr~~vtS~DVIHSf~vP~lg~K~DavPGr~n~~~~~~~~~G~y~GqCsElCG~~Hs~Mpi~V~vv~~~~f~~Wv~ 224 (227) T MTH00140 145 PMKTNTRVIVTSADVLHSWTVPSLGVKVDAIPGRLNQLSFKVSRPGVFYGQCSEICGANHSFMPIVVEAVPMKYFIKWLE 224 (227) T ss_pred EECCEEEEEEECCCCHHCCCCCCCCEEEECCCCCCEEEEEECCCCEEEEEECHHHCCCCCCCCCEEEEEECHHHHHHHHH T ss_conf 60577898873263012013542160553379850799998189887999846753813068817999988999999999 Q ss_pred HH Q ss_conf 98 Q gi|254780474|r 251 KV 252 (332) Q Consensus 251 ~~ 252 (332) .. T Consensus 225 ~~ 226 (227) T MTH00140 225 AM 226 (227) T ss_pred HC T ss_conf 65 No 7 >MTH00129 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=419.73 Aligned_cols=204 Identities=19% Similarity=0.330 Sum_probs=180.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) ....-+|+++++++...++...+.++++|.+.|+.+++++..+.+. ....+||++||++||+||++|++++++ T Consensus 6 Ql~fqd~~S~ime~i~~~hd~vm~ii~~I~~~V~~~l~~~~~~~~~-------~k~~~~n~~LEiiWTiiP~lIL~~ia~ 78 (230) T MTH00129 6 QLGFQDAASPVMEELLHFHDHALMIVFLISTLVLYIIVAMVSTKLT-------NKYILDSQEIEIIWTVLPAVILILIAL 78 (230) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-------CCCCCCCCEEEEHHHHHHHHHHHHHHH T ss_conf 0899898998999999999889999999999999999999971677-------555568833440168999999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC---------------------CCCCEEEEE Q ss_conf 99988775134442000012446788888668999998436999962678---------------------774306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ---------------------VAIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g---------------------iatvneL~l 170 (332) ++++.++.+|. .+++.++|+|+|+||+|.|+|||.+ ..++|+|++ T Consensus 79 pS~~~ly~~~~--------------~~~~~ltIkv~G~QW~W~yeY~d~~~~~~~sym~~~~~~~~~~~rll~vdn~lvl 144 (230) T MTH00129 79 PSLRILYLMDE--------------INDPHLTIKAMGHQWYWSYEYTDYEDLGFDSYMIPTQDLTPGQFRLLETDHRMVV 144 (230) T ss_pred HHHHHHHHHCC--------------CCCCCEEEEEEEEEEEEEEECCCCCCCCCEECCCCHHHCCCCCCEEECCCCCEEE T ss_conf 99999998212--------------5799769998758999999738988753110467856706787257603573277 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++||+++++|+|+||||||||.+||.|+++|++++++||++|++ T Consensus 145 P~~~~v~~~~tS~DViHsf~vP~l~~k~davPG~~n~~~~~~~~~G~y~g~CsElCG~~Hs~M~~~v~~v~~~~f~~Wl~ 224 (230) T MTH00129 145 PVESPIRVLVSAEDVLHSWAVPALGVKMDAVPGRLNQTAFIASRPGVFYGQCSEICGANHSFMPIVVEAVPLEHFENWSS 224 (230) T ss_pred ECCCEEEEEEECCCHHHHEECHHHCCEECCCCCCCEEEEEECCCCEEEEEECHHHCCCCCCCCEEEEEEECHHHHHHHHH T ss_conf 50668999961474245043101162210579842589998289888999817642813158968999987999999997 Q ss_pred HHHCCC Q ss_conf 986072 Q gi|254780474|r 251 KVKRQG 256 (332) Q Consensus 251 ~~ka~~ 256 (332) .+++++ T Consensus 225 ~~~~~a 230 (230) T MTH00129 225 LMLEDA 230 (230) T ss_pred HHHHCC T ss_conf 566449 No 8 >MTH00038 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=420.48 Aligned_cols=202 Identities=19% Similarity=0.368 Sum_probs=184.6 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) ...+.+|+++++++...++...+.++++|++.|+.+++++.++++.++ ...||++||++||++|++|+++|++ T Consensus 6 ql~fqd~~Sp~me~i~~fHd~~m~i~~~I~~~V~~~l~~~~~~~~~~~-------~~~~n~~LEi~WTiiP~iil~~i~~ 78 (229) T MTH00038 6 QLGLQDASSPLMEELIYFHDYALIILTLITILVFYGLASLLFSSNTNR-------FFLEGQGLETIWTVIPAFILIFIAL 78 (229) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-------CCCCCCEEHHHHHHHHHHHHHHHHH T ss_conf 179969789899999999999999999999999999999999737655-------3457823004399889999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC---------------------CCCCEEEEE Q ss_conf 99988775134442000012446788888668999998436999962678---------------------774306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ---------------------VAIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g---------------------iatvneL~l 170 (332) ++++..+.+|. ..++.++|+|+|+||+|.|+|||++ ..++|+|++ T Consensus 79 ps~~~ly~~~~--------------~~~~~~~ikv~G~QW~W~yeY~d~~~~~~ds~~~~~~d~~~g~~rll~vdn~lvl 144 (229) T MTH00038 79 PSLQLLYLMDE--------------VNNPFLTIKAIGHQWYWSYEYTDYNDLEFDSYMVPTSDLSSGNPRLLEVDNRLVL 144 (229) T ss_pred HHHHHHHHCCC--------------CCCCCCCEEEEEEEEEEEEECCCCCCCEEEECCCCHHHCCCCCCEEECCCCCEEE T ss_conf 99999982543--------------5687643013687667999814787522576156821203688413303663355 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|+.++++|+|+|||||+||.+||.|+++|+|++++||++|++ T Consensus 145 P~~~~vrl~vtS~DVIHSf~vP~lg~K~DaiPGr~n~~~~~~~~~G~y~GqCsElCG~~Hs~Mpi~V~~v~~~~f~~Wv~ 224 (229) T MTH00038 145 PSQTPIRVLVSSSDVLHSWAVPSLGVKMDAVPGRLNQVTFFISRPGLFYGQCSEICGANHSFMPIVIESVPFNTFENWVS 224 (229) T ss_pred EEECCCEEEEEECCCCCCCCCCCCCEEEECCCCCCEEEEEECCCCEEEEEECHHHCCCCCCCCCEEEEEECHHHHHHHHH T ss_conf 54013168876030000243541162543479852799998289667999826663804068816999979999999997 Q ss_pred HHHC Q ss_conf 9860 Q gi|254780474|r 251 KVKR 254 (332) Q Consensus 251 ~~ka 254 (332) +.+. T Consensus 225 ~~~e 228 (229) T MTH00038 225 NFLE 228 (229) T ss_pred HHHC T ss_conf 4122 No 9 >MTH00154 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=417.55 Aligned_cols=199 Identities=19% Similarity=0.359 Sum_probs=182.6 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) ...+.+|+++++++...++...++++++|++.|.++++++.++++.++ ...||++||++||++|++|++++++ T Consensus 6 ql~fq~~aS~~me~i~~~Hd~~m~i~~~I~~~V~~~l~~~~~~~~~~~-------~~~~n~~LEi~WTiiP~liL~~ia~ 78 (226) T MTH00154 6 SLSFQDSASPLMEQLIFFHDHTMMILILITVLVGYMMISLLFNKFTNR-------FLLEGQEIEIIWTILPAIILIFIAL 78 (226) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-------CCCCCCEEEEEHHHHHHHHHHHHHH T ss_conf 279999899899999999999999999999999999999998477757-------7758833563247779999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCC---------------------CCCEEEEE Q ss_conf 999887751344420000124467888886689999984369999626787---------------------74306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQV---------------------AIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~gi---------------------atvneL~l 170 (332) ++++.++.+|- .+++.++|+|+|+||+|.|+|||.+. .++|+|++ T Consensus 79 pS~~~ly~~~~--------------~~~~~ltikv~G~QW~W~yeY~d~~~~~fdSym~~~~~~~~~~~rll~vdn~lvl 144 (226) T MTH00154 79 PSLRLLYLLDE--------------VNNPSITLKTIGHQWYWSYEYSDFNNIEFDSYMIPTNELENNGFRLLDVDNRLVL 144 (226) T ss_pred HHHHHHHHCCC--------------CCCCCEEEEEEEEEEEEEEECCCCCCCEEEEECCCCCCCCCCCEEEEECCCCEEE T ss_conf 99999997030--------------5798679998775446998758877730564044443013697213303661465 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|++++++|+|+|||+||||.+||.|+++|+|+++++|++|++ T Consensus 145 P~~~~ir~~itS~DViHS~~vP~lgvK~DavPGr~n~~~~~~~~~g~y~g~CsE~CG~~Hs~Mpi~v~vv~~~~f~~Wv~ 224 (226) T MTH00154 145 PMNTQIRILVTSADVIHSWTVPSLGVKVDAVPGRLNQLNFLINRPGLFFGQCSEICGANHSFMPIVIESVSVKNFINWIK 224 (226) T ss_pred EEEEEEEEEEECCCHHHHEECCCCCEEEECCCCCEEEEEEEECCCEEEEEEEHHHCCCCCCCCEEEEEEECHHHHHHHHH T ss_conf 55124568874585266153553136741489843899999689667999816663804167907999988999999997 Q ss_pred H Q ss_conf 9 Q gi|254780474|r 251 K 251 (332) Q Consensus 251 ~ 251 (332) + T Consensus 225 ~ 225 (226) T MTH00154 225 S 225 (226) T ss_pred H T ss_conf 2 No 10 >MTH00185 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=415.83 Aligned_cols=203 Identities=19% Similarity=0.343 Sum_probs=181.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) .....+|+++++++...++..+++++++|.+.|..+++++..++|.++ ...||++||++||+||++|++++++ T Consensus 6 ql~fqd~aSpime~i~~fHd~~m~i~~~I~~~V~~~~~~~~~~~~~~~-------~~~~~~~LEi~WTiiP~iiL~~ia~ 78 (229) T MTH00185 6 QLGLQDAASPVMEELIHFHDHTLMIVFLISTLVLYIIVAMVTTKLTNK-------YILDSQEIEIVWTILPAIILIMIAL 78 (229) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-------CCCCCCEEEHHHHHHHHHHHHHHHH T ss_conf 279988589899999999989999999999999999999999647888-------6768712002179999999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC---------------------CCCCEEEEE Q ss_conf 99988775134442000012446788888668999998436999962678---------------------774306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ---------------------VAIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g---------------------iatvneL~l 170 (332) ++++..+.+|. ..++.++|+|+|+||+|.|+|||.+ ..++|+|++ T Consensus 79 pS~~~ly~~d~--------------~~~~~~~ikv~G~QW~W~yeY~d~~~~~~ds~~~~~~~~~~~~~rlL~vdn~lvl 144 (229) T MTH00185 79 PSLRILYLMDE--------------INDPHLTIKAMGHQWYWSYEYTDYEDLEFDSYMIPTQDLTPGQFRLLETDHRMVV 144 (229) T ss_pred HHHHHHHHCCC--------------CCCCCEEEEEEEEEEEEEEEECCCCCCCCEECCCCHHHCCCCCCCCCCCCCCEEE T ss_conf 99999872442--------------4688758999999877789804677742200246622216677312246663598 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|++++++|+|+|||||+||.+||.|+++|+|+++++|++|++ T Consensus 145 P~~~~Vr~~itS~DVIHSf~vP~l~~K~DaiPGr~n~~~~~~~~~G~y~g~Cse~CG~~Hs~M~~~v~~v~~~~f~~W~~ 224 (229) T MTH00185 145 PMESPIRVLITAEDVLHSWTVPALGVKMDAVPGRLNQTAFIISRPGVYYGQCSEICGANHSFMPIVVEAVPLEHFENWSS 224 (229) T ss_pred EEEEEEEEEEEECCCCEEEECCHHCCEECCCCCCCEEEEEEECCCEEEEEECHHHCCCCCCCCEEEEEEECHHHHHHHHH T ss_conf 63027999987064310022411053200579830799999379887999857873947068817999988999999997 Q ss_pred HHHCC Q ss_conf 98607 Q gi|254780474|r 251 KVKRQ 255 (332) Q Consensus 251 ~~ka~ 255 (332) +++++ T Consensus 225 ~~~~e 229 (229) T MTH00185 225 LMLEE 229 (229) T ss_pred HHHCC T ss_conf 41329 No 11 >MTH00051 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=411.54 Aligned_cols=214 Identities=21% Similarity=0.346 Sum_probs=184.7 Q ss_pred HHHHHHHHHCC----CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEE Q ss_conf 99999987413----64326898989999999999999999999999999999998853046777777875345661255 Q gi|254780474|r 21 LFFSSVMLSGF----EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLEL 96 (332) Q Consensus 21 l~~~~~lLsGC----~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi 96 (332) ++...+.++.. ....-+|++|++++...++...+++++.|.+.|+.+++++.... .+.+...||+.||+ T Consensus 6 ~~~~~~~~~~~p~~wql~fQd~~Sp~me~i~~fHd~~m~ii~~I~~~V~~~~~~~~~~~-------~~~~~~~~n~~lEi 78 (247) T MTH00051 6 LFCLLLGLKDAPEPWQLGFQDAASPVMEEIIFFHDQIMFILTIIITTVLWLIIRALTTK-------HYHRYLFEGTLIEI 78 (247) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-------CCCCCCCCCHHHHH T ss_conf 46878876489961042898989989999999987999999999999999999999637-------88673406711446 Q ss_pred EEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCC-------------- Q ss_conf 302278999999999999887751344420000124467888886689999984369999626787-------------- Q gi|254780474|r 97 FVWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQV-------------- 162 (332) Q Consensus 97 ~wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~gi-------------- 162 (332) +||++|++|+++|++++.+..+.+|. ..+..++|+|+|+||+|.|+|||.+. T Consensus 79 ~WTiiP~lILi~ia~PS~~~Ly~~de--------------~~~p~~tiKviG~QWyWsYeY~d~~~~~~~FdSym~~~~~ 144 (247) T MTH00051 79 IWTLIPAIILIFIAFPSLKLLYLMDE--------------VIDPALTIKAIGHQWYWSYEYSDYGTDTIEFDSYMVPTSD 144 (247) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC--------------CCCCCCEEEEEEEEEEEEEECCCCCCCEEEEEECCCCHHH T ss_conf 69999999999999999999882344--------------7888511689998567999757688751247742478556 Q ss_pred ---------CCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCC Q ss_conf ---------74306896369828999983581011022010740311698514899994277489666133265761136 Q gi|254780474|r 163 ---------AIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHM 233 (332) Q Consensus 163 ---------atvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M 233 (332) .++|+|++|+|++|+|.+||+|||||||||+||+|+||+|||.|++|+.+++||+|+|||+|+||.+||.| T Consensus 145 l~~~~~rLLevDn~lvlP~~~~Ir~lvTS~DViHS~~vPslgvK~DaiPGr~n~~~~~~~~~G~y~GqCsElCG~~Hs~M 224 (247) T MTH00051 145 LNSGDLRLLEVDNRLVVPIQTQVRVLVTGADVLHSFAVPSLSVKIDAVPGRLNQTSFFIKRPGVFYGQCSEICGANHSFM 224 (247) T ss_pred HCCCCCCCCCCCCCEEEEEECCCEEEEECCCCCEEEECCCCCCEECCCCCCCEEEEEECCCCEEEEEEEHHHCCCCCCCC T ss_conf 24577520146783699860440478861653111213322643014798216899984897658998077528572788 Q ss_pred EEEEEECCHHHHHHHHHHHHCC Q ss_conf 5899970889999999998607 Q gi|254780474|r 234 RFKFYGKSEKGFEDWISKVKRQ 255 (332) Q Consensus 234 ~f~v~v~s~~~f~~Wv~~~ka~ 255 (332) +++|+|+++++|++|++++|++ T Consensus 225 pi~v~~v~~~~f~~W~~~~~~e 246 (247) T MTH00051 225 PIVIEGVSLDKYINWVASQSEE 246 (247) T ss_pred EEEEEEECHHHHHHHHHHHHHC T ss_conf 3899998899999999976543 No 12 >MTH00117 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=409.61 Aligned_cols=200 Identities=20% Similarity=0.346 Sum_probs=179.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) ...+.+|++|++++...++..+++++++|++.|+.+++++.++.+..+ ...||.+||++||++|++|+++|++ T Consensus 6 ql~fqd~aSp~me~i~~~Hd~~m~i~~~I~~~V~~~l~~~~~~k~~~~-------~~~~~~~LEiiWTviP~liL~~ia~ 78 (227) T MTH00117 6 QLGFQDASSPIMEELLEFHDHALMVALLISSLVLYLLTLMLTTKLTHT-------NTVDAQEVELIWTILPAIVLILIAL 78 (227) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-------CCCCCEEEEEEHHHHHHHHHHHHHH T ss_conf 289999899899999999988999999999999999999998266778-------6768705775458889999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC---------------------CCCCEEEEE Q ss_conf 99988775134442000012446788888668999998436999962678---------------------774306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ---------------------VAIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g---------------------iatvneL~l 170 (332) ++++.++.+|. .+++.++|+|+|+||+|.|+|||.+ ..++|+|++ T Consensus 79 pS~~~ly~~~~--------------~~~~~~tikv~G~QWyW~yeY~d~~~~~f~s~~~~~~~~~~g~~rll~vdn~lvl 144 (227) T MTH00117 79 PSLRILYLMDE--------------INEPDLTIKAIGHQWYWSYEYTDFKDLSFDSYMIPTQDLPNGHFRLLEVDHRMVI 144 (227) T ss_pred HHHHHHHHCCC--------------CCCCCEEEEEEEEEEEEEEECCCCCCCEEEECCCCHHHHCCCCCEEEECCCCEEE T ss_conf 99999996334--------------7998768997457535898746887623353367634512587326612682488 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|++++++|+|+|||+||||.+||.|+++|++++++||++|++ T Consensus 145 P~~~~v~~~vtS~DViHsf~vP~lg~k~davPG~~n~~~~~~~~~G~y~g~CsElCG~~Hs~M~~~v~~v~~~~f~~W~~ 224 (227) T MTH00117 145 PMESPIRILITAEDVLHSWAVPSLGVKTDAIPGRLNQTSFITTRPGVFYGQCSEICGANHSFMPIVVESTPLKHFENWSS 224 (227) T ss_pred EEEEEEEEEEECCCHHHHEECCHHCCEECCCCCCCEEEEEECCCCEEEEEECHHHCCCCCCCCEEEEEEECHHHHHHHHH T ss_conf 77546888986485276430511162210469832689997489778999816661837168867999978999999998 Q ss_pred HH Q ss_conf 98 Q gi|254780474|r 251 KV 252 (332) Q Consensus 251 ~~ 252 (332) .. T Consensus 225 ~~ 226 (227) T MTH00117 225 LL 226 (227) T ss_pred HC T ss_conf 41 No 13 >MTH00139 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=412.30 Aligned_cols=197 Identities=19% Similarity=0.326 Sum_probs=177.3 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHHHH Q ss_conf 32689898999999999999999999999999999999885304677777787534566125530227899999999999 Q gi|254780474|r 34 IVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAVIT 113 (332) Q Consensus 34 ~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~~t 113 (332) ..-+|+++++++...++...+.++++|++.|..+++++.++.|.++. ..||++||++||++|++|+++|++++ T Consensus 4 ~fq~~~S~~me~i~~fhd~~~~i~~~I~~~V~~~~~~~~~~~~~~~~-------~~~n~~lEi~WTiiP~iiL~~la~ps 76 (223) T MTH00139 4 WFQDGASPLMEQTIFFHDYVMIFLVVVLTLVLYASIWLLYNSFTSRT-------LLESHQLEVAWTLLPAVVLIFLALPS 76 (223) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-------CCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 27798998999999999999999999999999999999994678677-------74750123578999999999999999 Q ss_pred HHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC-----------------------CCCCEEEEE Q ss_conf 988775134442000012446788888668999998436999962678-----------------------774306896 Q gi|254780474|r 114 WDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ-----------------------VAIINELVV 170 (332) Q Consensus 114 ~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g-----------------------iatvneL~l 170 (332) ++.++.+|. ..+++++|+|+|+||+|.|+|||.. ..++|+|++ T Consensus 77 ~~~ly~~~~--------------~~~~~~~ikV~G~QWyW~yeY~d~~~~~~~~dsym~~~~~~~~g~~rll~vdn~lvl 142 (223) T MTH00139 77 LNLLYLSDN--------------VLDPSITLKVVGHQWFWSYEYSDFPNGPLEFDSYMVPSSDLSSGDFRLLEVDNRIVL 142 (223) T ss_pred HHHHHHHCC--------------CCCCCEEEEEEEEEEEEEEECCCCCCCCCEEEECCCCHHHCCCCCCCEECCCCCEEE T ss_conf 999986121--------------689862589999877788864777898321550336700104576320314773253 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|+.++++|+|+|||||+||.+||.|+++|+++++++|++|++ T Consensus 143 P~~~~vr~~itS~DViHSf~vP~lgvK~DavPGr~n~~~~~~~~~G~y~GqCsElCG~~Hs~Mpi~v~~v~~~~f~~Wv~ 222 (223) T MTH00139 143 PSGSNIRALVTSADVIHSWSVPSLGVKIDAIPGRLNSLSFFSNRVGVYYGQCSEICGANHSFMPICVEFILPQSFIEWVL 222 (223) T ss_pred ECCCEEEEEEEECCCCEEEECCCCCEEECCCCCCEEEEEEECCCCEEEEEECHHHCCCCCCCCCEEEEEECHHHHHHHHC T ss_conf 01301489996075432232341261510699962899999289677999885445815068937999988999999965 Q ss_pred H Q ss_conf 9 Q gi|254780474|r 251 K 251 (332) Q Consensus 251 ~ 251 (332) | T Consensus 223 E 223 (223) T MTH00139 223 E 223 (223) T ss_pred C T ss_conf 9 No 14 >MTH00023 COX2 cytochrome c oxidase subunit II; Validated Probab=100.00 E-value=0 Score=410.46 Aligned_cols=212 Identities=21% Similarity=0.329 Sum_probs=185.9 Q ss_pred HHHHHHHCCC---------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCE Q ss_conf 9999874136---------4326898989999999999999999999999999999998853046777777875345661 Q gi|254780474|r 23 FSSVMLSGFE---------FIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTL 93 (332) Q Consensus 23 ~~~~lLsGC~---------~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~ 93 (332) +.+.+.|+|. ...-+|++|++++...++...++++++|++.|+.++++.....+ +..+..||+. T Consensus 4 ~~~~~~s~~~~~d~P~~wql~fQd~aSpime~i~~fHd~~m~ii~~I~~~V~~~l~~~~~~~~-------~~~~~~~n~~ 76 (247) T MTH00023 4 LFFYIVNVCGKRDIPEPWQLGFQDAADPVMEEIVFFHDQIMFLLIIIVTVVLWLIVEALKGKF-------YDRYLVDGTL 76 (247) T ss_pred HHHHHHCCCCCCCCCCHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-------CCCCCCCCHH T ss_conf 245365262226899511259988489899999999989999999999999999999986378-------8763557633 Q ss_pred EEEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCC----------- Q ss_conf 255302278999999999999887751344420000124467888886689999984369999626787----------- Q gi|254780474|r 94 LELFVWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQV----------- 162 (332) Q Consensus 94 lEi~wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~gi----------- 162 (332) ||++||++|++|++++++++.+..+.+|. ..+..++|+|+|+||+|.|+|||++. T Consensus 77 lEi~WTiiP~lILv~ia~PS~~lLy~~d~--------------~~~p~ltiKviG~QWyWsYeY~d~~~~~~~FdSymi~ 142 (247) T MTH00023 77 LEIVWTIIPAVILVFIALPSLKLLYLMDE--------------VVSPALTIKAIGHQWYWSYEYSDYEGETLEFDSYMVP 142 (247) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCC--------------CCCCCEEEEEEEEEEEEEEECCCCCCCEEEEEECCCC T ss_conf 44669999999999999999998723445--------------7998679999998678999757778872357833367 Q ss_pred ------------CCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCC Q ss_conf ------------74306896369828999983581011022010740311698514899994277489666133265761 Q gi|254780474|r 163 ------------AIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGF 230 (332) Q Consensus 163 ------------atvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~gh 230 (332) .++|+|++|+|+||+|.+||+|||||||||+||+|+||+|||.|++|+.++++|+|+|||+|+||.+| T Consensus 143 ~~~~~~~~~rLLevDn~lvlP~~~~ir~~vtS~DViHS~~vP~lgvK~DavPGr~n~~~~~~~~~G~y~GqCsElCG~~H 222 (247) T MTH00023 143 TSDLNSGDFRLLEVDNRLVVPINTHVRILVTGADVLHSFAVPSLGLKIDAVPGRLNQTGVFIKRPGVFYGQCSEICGANH 222 (247) T ss_pred HHHCCCCCCHHHCCCCCEEEEECCCCEEEEECCCCCCEEECCCCCCEECCCCCCEEEEEEECCCCEEEEEECHHHCCCCC T ss_conf 43325675222067886899974121799962764100332311513103698106899974897789998186709671 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCC Q ss_conf 1365899970889999999998607 Q gi|254780474|r 231 SHMRFKFYGKSEKGFEDWISKVKRQ 255 (332) Q Consensus 231 s~M~f~v~v~s~~~f~~Wv~~~ka~ 255 (332) |.|++.++++++++|++|+++.+.+ T Consensus 223 s~Mpi~v~~vs~~~f~~Wv~~~~~s 247 (247) T MTH00023 223 SFMPIVIEAVSLDRYINWLLSLSNT 247 (247) T ss_pred CCCEEEEEEECHHHHHHHHHHHHCC T ss_conf 6880799998999999999971548 No 15 >MTH00168 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=407.61 Aligned_cols=199 Identities=19% Similarity=0.348 Sum_probs=172.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) +....+|+++++++...++...++++++|.+.|+.+++++.. ++.. .....||++||++||++|++|+++|++ T Consensus 6 ql~fq~~~S~~m~~i~~~Hd~~m~ii~~I~~~V~~~~~~~~~----~~~~---~~~~~~~~~LEiiWTiiP~iiL~~la~ 78 (225) T MTH00168 6 QLGLQDAASPVMEEMVLFHDHVLLVLVFILCLIFYSMLVLSS----SKYV---YRFYTDSHVIETVWTVIPAFILVVLAL 78 (225) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HCCC---CCCCCCCCEEEEHHHHHHHHHHHHHHH T ss_conf 079988689999999999879999999999999999999997----2688---553557826751288889999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCC---------------------CCCEEEEE Q ss_conf 999887751344420000124467888886689999984369999626787---------------------74306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQV---------------------AIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~gi---------------------atvneL~l 170 (332) ++++..+.+|. .++.+++|+|+|+||+|.|+|||.+. .++|+|++ T Consensus 79 ps~~~ly~~~~--------------~~~~~~~ikv~G~QW~W~yeY~d~~~~~~dsym~~~~~~~~g~~rll~vDn~lvl 144 (225) T MTH00168 79 PSLKLLYLMDE--------------LDSPDLTIKAVGHQWYWSYEYGDYYDIEFDSYMLPLGDLSGGDFRLLEVDNRVVL 144 (225) T ss_pred HHHHHHHCCCC--------------CCCCCEEEEEEEEEEEEEEEECCCCCCCCHHCCCCCCCCCCCCEEEEECCCCEEE T ss_conf 99999854564--------------6888557999998778998804777641201056722125775057604671687 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|++++++|+|+|||||+||.+||.|+++|+|++++||++|++ T Consensus 145 P~~~~Vr~~itS~DViHS~~vP~lg~K~DavPGr~n~~~~~~~~~G~y~GqCsElCG~~Hs~Mpi~v~~vs~~~f~~Wv~ 224 (225) T MTH00168 145 PVDTSIRMLVTSADVIHSWTLPSLGLKVDAVPGRLNQLALQSSRPGSFYGQCSEICGANHSFMPIHVEAVPWETFESWCD 224 (225) T ss_pred EEEEEEEEEEECCCCCEEEECCCCCCEECCCCCCEEEEEEECCCCEEEEEECHHHCCCCCCCCCEEEEEECHHHHHHHHC T ss_conf 65038999965374311243341251501699867999998699777999717766832368957999988999999866 Q ss_pred H Q ss_conf 9 Q gi|254780474|r 251 K 251 (332) Q Consensus 251 ~ 251 (332) . T Consensus 225 t 225 (225) T MTH00168 225 T 225 (225) T ss_pred C T ss_conf 7 No 16 >MTH00076 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=407.85 Aligned_cols=201 Identities=21% Similarity=0.338 Sum_probs=178.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) ...+-+|+++++++...++..++.++++|.+.|..++.++.++.|.+++ ..++..||++||++|++|+++|++ T Consensus 6 qlgfqd~aSpime~i~~fhd~~m~i~~~I~~~V~~~~~~~~~~k~~~~~-------~~~~~~LEi~WTiiP~iIl~~ia~ 78 (228) T MTH00076 6 QLGFQDAASPIMEELLHFHDHALMAVFLISTLVLYIITIMMTTKLTNTN-------AMDAQEIEMVWTIMPAIILIVIAL 78 (228) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-------CCCCCEEEHHHHHHHHHHHHHHHH T ss_conf 1799886899999999999899999999999999999999997147787-------878955302489999999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC---------------------CCCCEEEEE Q ss_conf 99988775134442000012446788888668999998436999962678---------------------774306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ---------------------VAIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g---------------------iatvneL~l 170 (332) ++++..+.+|. ..+..++|+|+|+||+|.|+|||.+ ..+.|+|++ T Consensus 79 pS~~~ly~~~~--------------~~~~~~tikv~G~QWyW~yeY~d~~~~~~dSym~~~~~~~~g~~rll~vdn~lvl 144 (228) T MTH00076 79 PSLRILYLMDE--------------INDPHLTVKAIGHQWYWSYEYTNYEDLSFDSYMIPTQDLTPGQFRLLEVDNRMVV 144 (228) T ss_pred HHHHHHHHCCC--------------CCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEECCCCCEEE T ss_conf 99999982534--------------6798633100144457778834677723245667601026787234215675287 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|+.++++|+|+|||||+||.+||.|+++|+|++++||++|++ T Consensus 145 P~~~~Vr~~itS~DViHsf~vP~lg~K~DavPG~~n~~~~~~~~~G~y~g~CsElCG~~Hs~M~~~v~~v~~~~f~~W~~ 224 (228) T MTH00076 145 PMESPIRMLITAEDVLHSWAVPSLGIKTDAIPGRLNQTTFIASRPGVYYGQCSEICGANHSFMPIVVEATPLKNFLNWSS 224 (228) T ss_pred EECCEEEEEEECCCCCEEEECCCCCCEEECCCCCEEEEEEECCCCEEEEEEEHHHCCCCCCCCCEEEEEECHHHHHHHHH T ss_conf 60002889988153212243441273310479821789998289887999808873957278827999989999999998 Q ss_pred HHH Q ss_conf 986 Q gi|254780474|r 251 KVK 253 (332) Q Consensus 251 ~~k 253 (332) +++ T Consensus 225 ~~~ 227 (228) T MTH00076 225 SML 227 (228) T ss_pred HCC T ss_conf 545 No 17 >MTH00098 COX2 cytochrome c oxidase subunit II; Validated Probab=100.00 E-value=0 Score=405.66 Aligned_cols=200 Identities=19% Similarity=0.317 Sum_probs=177.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) ...+.+|+++++++...++...++++++|.+.|+.+++++..+.+.. ....||++||++||++|++|++++++ T Consensus 6 Ql~fqd~aSp~~e~l~~fHd~~m~i~~~I~~~V~~~l~~~~~~k~~~-------~~~~~~~~LEi~WTiiP~liL~~ia~ 78 (227) T MTH00098 6 QLGFQDATSPIMEELLHFHDHTLMIVFLISSLVLYIISLMLTTKLTH-------TSTMDAQEVETIWTILPAIILILIAL 78 (227) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-------CCCCCCCEEEEEHHHHHHHHHHHHHH T ss_conf 36888989989999999985799999999999999999999714565-------65667724430069999999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC---------------------CCCCEEEEE Q ss_conf 99988775134442000012446788888668999998436999962678---------------------774306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ---------------------VAIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g---------------------iatvneL~l 170 (332) ++++..+.+|. .+++.++|+|+|+||+|.|+|||.+ ..++|+|++ T Consensus 79 ps~~~ly~~~~--------------~~~~~ltvkv~G~QWyW~yeY~d~~~~~fdSym~~~~~~~~g~~rll~vdn~lvl 144 (227) T MTH00098 79 PSLRILYMMDE--------------INNPSLTVKTMGHQWYWSYEYTDYEDLSFDSYMIPTQDLKPGELRLLEVDNRVVL 144 (227) T ss_pred HHHHHHHHHCC--------------CCCCCEEEEEEEEEEEEEEECCCCCCEEEEECCCCHHHCCCCCCEEEECCCCEEE T ss_conf 99999997023--------------6899879999998757999758888605552346710148887137722684598 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|+||+|.+||+|||||||||+||+|+||+|||.|++|++++++|+|+|||+|+||.+||.|+++++|+++++|++|++ T Consensus 145 P~~~~v~~~itS~DViHSf~vP~lg~K~DavPGr~~~~~~~~~~~G~~~g~CsE~CG~~Hs~M~~~v~v~~~~~f~~W~~ 224 (227) T MTH00098 145 PMEMPIRMLISSEDVLHSWAVPSLGLKTDAIPGRLNQTTLTSTRPGLYYGQCSEICGSNHSFMPIVLELVPLKYFEKWSA 224 (227) T ss_pred EEEEEEEEEEECCCCHHEEECCCCCEEEECCCCCEEEEEEECCCCEEEEEEEHHHHCCCCCCCEEEEEEECHHHHHHHHH T ss_conf 52208999965596132375442045640469821689998189888999808881904128837999988999999999 Q ss_pred HH Q ss_conf 98 Q gi|254780474|r 251 KV 252 (332) Q Consensus 251 ~~ 252 (332) .. T Consensus 225 ~~ 226 (227) T MTH00098 225 SM 226 (227) T ss_pred HH T ss_conf 51 No 18 >MTH00008 COX2 cytochrome c oxidase subunit II; Validated Probab=100.00 E-value=0 Score=408.03 Aligned_cols=201 Identities=19% Similarity=0.328 Sum_probs=178.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) .....+|+++++++...++....+++++|++.|..++++..++.+.++ ...+|++||++||++|++|+++|++ T Consensus 6 ql~fqd~~S~~m~~l~~fHd~vm~ii~~I~~~V~~~l~~~~~~~~~~~-------~~~~~~~LE~~WTiiP~liL~~la~ 78 (228) T MTH00008 6 QLSFQDAASPVMLQLISFHDHALLVLTLVTTVVGYALTSLMFNSYSNR-------YILEAQQIETIWTILPALILLFLAF 78 (228) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-------CCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 379978689899999999979999999999999999999997256645-------4458701258899999999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC---------------------CCCCEEEEE Q ss_conf 99988775134442000012446788888668999998436999962678---------------------774306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ---------------------VAIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g---------------------iatvneL~l 170 (332) ++++..+.+|- ..+..++|+|+|+||+|+|+|||.+ ..++|+|++ T Consensus 79 pS~~~ly~~~~--------------~~~~~~tikv~G~QWyW~YeY~d~~~~~~dsym~~~~~~~~g~~rll~vdn~lvl 144 (228) T MTH00008 79 PSLHLLYLMDE--------------ISNPSITVKTIGHQWYWSYEYTDFSKLEFDSYMLPDSDLSPGDFRLLEVDHRAVA 144 (228) T ss_pred HHHHHHHCCCC--------------CCCCCCEEEEEEEEEEEEEECCCCCCCCCHHHHCCCCCCCCCCCEEECCCCEEEE T ss_conf 99999864654--------------6888614887788534899813676642235317744347887434413551776 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |+|++|+|.+||+|||||||||+||+|+||+|||.|++|+++++||+|+|||+|+||.+||.|+++|+|+++++|++|++ T Consensus 145 P~~~~ir~lvtS~DVIHSw~vPslgiK~DavPGr~n~~~~~~~~~G~y~GqCsElCG~~Hs~Mpi~V~~v~~~~f~~W~~ 224 (228) T MTH00008 145 PMNTEIRMLVTAADVLHSWTIPSLGVKVDAVPGRLNQIGFTADRPGVFYGQCSEICGANHSFMPIALEIIDTKSFIKWVL 224 (228) T ss_pred EEEEEEEEEECCCCCCEEEECCCCCCEEECCCCCEEEEEEEECCCEEEEEEEHHHCCCCCCCCCEEEEEECHHHHHHHHH T ss_conf 55212666301565412032441262320358822899998179755899828880978156957999989999999998 Q ss_pred HHH Q ss_conf 986 Q gi|254780474|r 251 KVK 253 (332) Q Consensus 251 ~~k 253 (332) +++ T Consensus 225 ~~~ 227 (228) T MTH00008 225 SNS 227 (228) T ss_pred HCC T ss_conf 656 No 19 >MTH00027 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=396.08 Aligned_cols=205 Identities=20% Similarity=0.305 Sum_probs=175.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) ....-+|++|++++...++.....++.+|++.|+.++++..++.+.. .+..+..||+.||++||++|++|++++++ T Consensus 21 Ql~fQd~aSpime~i~~fHd~~m~ii~~I~~~v~~~l~~~~~~~~~~----~~~~~~~~n~~lEiiWTiiP~lILi~ia~ 96 (249) T MTH00027 21 QLGFQDAGSPVMEEIIMLHDQILFILTIIVGVVLWLIIRILLGNNYY----SYYWNKLDGSLIEVIWTLIPAFILILIAF 96 (249) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----CCCCCCCCCCEEHHHHHHHHHHHHHHHHH T ss_conf 34898979989999999986999999999999999999999851444----44344557705048999999999999999 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCC-----------------------CCCEEE Q ss_conf 999887751344420000124467888886689999984369999626787-----------------------743068 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQV-----------------------AIINEL 168 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~gi-----------------------atvneL 168 (332) ++.+..+.+|-. ..+..++|+|+|+||+|.|+|||.|. .+.|+| T Consensus 97 PS~~lLy~~de~-------------~~~p~~tiKv~G~QWyWsYeY~d~~~~~~~fdSymi~~~~~~~g~~rLL~vDn~l 163 (249) T MTH00027 97 PSLRLLYIMDEC-------------GFSANITIKVTGHQWYWSYSYEDYGEKNIEFDSYMIPTADLEFGDLRLLEVDNRL 163 (249) T ss_pred HHHHHHHHHCCC-------------CCCCCEEEEEEEEEEEEEEECCCCCCCCCEEEECCCCCCCCCCCCCEECCCCCCE T ss_conf 999999983214-------------7998779999998767889867677765314532257021357874010577627 Q ss_pred EEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHH Q ss_conf 96369828999983581011022010740311698514899994277489666133265761136589997088999999 Q gi|254780474|r 169 VVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDW 248 (332) Q Consensus 169 ~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~W 248 (332) ++|+|++|+|.+||+|||||||||+||+|+||+|||.|++|+.++++|+|+|||+|+||.+||.|++.|+++++++|++| T Consensus 164 VlP~~~~Ir~lvTS~DViHS~~vP~lg~K~DavPGr~n~~~~~~~~~G~y~GqCsElCG~~Hs~Mpi~v~~v~~~~f~~W 243 (249) T MTH00027 164 ILPVDTNVRVLITAADVLHSWTVPSLAVKMDAVPGRINETGFLIKRPGIFYGQCSEICGANHSFMPIVVESVSLSKYIDW 243 (249) T ss_pred EEEECCEEEEEEECCCCCEEECCCCCCCCCCCCCCCCEEEEEECCCCEEEEEEEHHHCCCCCCCCCEEEEEECHHHHHHH T ss_conf 88515549999571422032015644732125798307899981797779997088818781568479999899999999 Q ss_pred HHHHH Q ss_conf 99986 Q gi|254780474|r 249 ISKVK 253 (332) Q Consensus 249 v~~~k 253 (332) +++++ T Consensus 244 l~~q~ 248 (249) T MTH00027 244 IGRED 248 (249) T ss_pred HHHCC T ss_conf 82168 No 20 >MTH00080 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=392.92 Aligned_cols=201 Identities=21% Similarity=0.301 Sum_probs=161.1 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHHHHH Q ss_conf 26898989999999999999999999999999999998853046777777875345661255302278999999999999 Q gi|254780474|r 35 VMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAVITW 114 (332) Q Consensus 35 ~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~~t~ 114 (332) +-+|.+..+.....++.+...++++++ +++++++...++|.+++. ......||+.||++||++|++|++++++++. T Consensus 8 l~~~~s~~~~~~~~~h~f~~~~i~~i~--ii~~~v~~~i~~~~~~~~--~~~~~~~n~~LEiiWTiiP~lILi~la~pS~ 83 (231) T MTH00080 8 LNFSNSLFSSYMDWFHNFNCSVLFGEF--VLAFVVFIFVYLISSNFY--FKSKKIHYQFLELLWTVFPVLILLMLGVPSL 83 (231) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHCCC--CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 778887766899999999999999999--999999999999983689--8662537864279999999999999999999 Q ss_pred HHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCC---------------------CCCCEEEEEECC Q ss_conf 88775134442000012446788888668999998436999962678---------------------774306896369 Q gi|254780474|r 115 DATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQ---------------------VAIINELVVPID 173 (332) Q Consensus 115 ~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~g---------------------iatvneL~lPv~ 173 (332) +..+.+|.. ..+.+++|+|+|+||+|.|+|||.+ ..++|+|++|+| T Consensus 84 ~~ly~~~~~-------------~~~~~ltIkv~G~QWyW~yeY~d~~~~~fdSym~~~~~~~~g~~rlL~vdn~lvlP~~ 150 (231) T MTH00080 84 KLLYYYGLM-------------NFDSNLTVKVTGHQWYWSYEFSDIPGLEFDSYMKSLDQLRLGEPRLLEVDNRCVLPCD 150 (231) T ss_pred HHHHHHCCC-------------CCCCCEEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEECCCCEEEEEC T ss_conf 999983552-------------6899858999999633899957998831224425443124688604623462366542 Q ss_pred CEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHH Q ss_conf 8289999835810110220107403116985148999942774896661332657611365899970889999999998 Q gi|254780474|r 174 RPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISKV 252 (332) Q Consensus 174 ~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~ 252 (332) +||+|.+||+|||||||||+||+|+||+|||.|++|++++++|+|+|||||+||.+||.|+++|+|+++++|++|++.. T Consensus 151 ~~vr~~itS~DViHSf~vP~lg~K~DavPGr~n~~~~~~~~~G~y~gqCsE~CG~~Hs~M~i~v~~~~~~~f~~Wlk~~ 229 (231) T MTH00080 151 TNIRFCITSSDVIHSWALPSLSIKMDAMPGRLNTLSYSFPMPGVFYGQCSEICGANHSFMPIALEVTSLDNFKEWCKGL 229 (231) T ss_pred CEEEEEEEECCCCEEEECCCCCCEEECCCCCCEEEEEECCCCEEEEEECHHHCCCCCCCCCEEEEEECHHHHHHHHHHH T ss_conf 4379888607532014213117330047983178999838988999985775081615793599998899999999863 No 21 >MTH00047 COX2 cytochrome c oxidase subunit II; Provisional Probab=100.00 E-value=0 Score=374.92 Aligned_cols=169 Identities=20% Similarity=0.363 Sum_probs=136.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCC Q ss_conf 99999999999999998853046777777875345661255302278999999999999887751344420000124467 Q gi|254780474|r 56 LMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSI 135 (332) Q Consensus 56 l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~ 135 (332) +++++++.+++ ++...|++|++++++.+. .||++||++||+||++|+++|++++++..+... T Consensus 15 i~l~v~v~~~v-~~~~~~~~~~~~~~~~~~---~~n~~LEi~WTiiP~iil~~l~~~s~~~l~~~~-------------- 76 (193) T MTH00047 15 LALCVFIPCWV-YVVLCWQVYVGNGSVNLG---SENQVLEFLWTVVPTLIVLVLCFLNLNFITYDL-------------- 76 (193) T ss_pred HHHHHHHHHHH-HHHHHHHHCCCCCCCCCC---CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHCC-------------- T ss_conf 99999999999-999999980476656647---888177402799999999999999999998354-------------- Q ss_pred CCCCCCEEEEEEEEEEEEEEEECCCC---------CCCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCE Q ss_conf 88888668999998436999962678---------774306896369828999983581011022010740311698514 Q gi|254780474|r 136 AKNSKPLVVEVVALDWKWLFLLPEQQ---------VAIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMET 206 (332) Q Consensus 136 ~~~~~~l~I~V~a~qW~W~F~YP~~g---------iatvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t 206 (332) ...+..+|+|+|+||+|+|+|||.. ..+.|+|++|+|+||+|.+||+|||||||||+||+|+||+|||.| T Consensus 77 -~~~~~~~ikV~G~QW~W~~~Ypd~~~ds~~~~~~~~vdn~l~lP~~~~v~~~ltS~DViHSf~VP~l~~K~DavPGr~n 155 (193) T MTH00047 77 -DCFSSETIKVVGHQWYWSYEYSNGSYDSFMTDDIFGVDKPLRLVYGVPYHLLVTSSDVIHSFSVPDLNLKMDAIPGRIN 155 (193) T ss_pred -CCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCEEEECCCEEEEEEEECCCCCCEECCCCCCEEECCCCCEE T ss_conf -6798705999999899999968976445655422157774897169879999996874322651322727625898359 Q ss_pred EEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHH Q ss_conf 8999942774896661332657611365899970889 Q gi|254780474|r 207 KLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEK 243 (332) Q Consensus 207 ~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~ 243 (332) ++|++|+++|+|+||||||||.+||.|+|.|+|+|++ T Consensus 156 ~~~~~~~~~G~y~g~CsElCG~~Hs~M~~~Ve~v~~~ 192 (193) T MTH00047 156 HLFFCPDRHGVFVGYCSELCGVGHSYMPIVIEVVKNS 192 (193) T ss_pred EEEEECCCCEEEEEECHHHCCCCCCCCCEEEEEECCC T ss_conf 9999889967899987788371606792799997279 No 22 >KOG4767 consensus Probab=100.00 E-value=9.4e-38 Score=288.24 Aligned_cols=200 Identities=20% Similarity=0.332 Sum_probs=166.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) +...-|.+.|+.+....+.-..+....+|.+.|.-+++-..|.|-. | ....|...+|++||++|++|++.+++ T Consensus 9 ~l~~Qdsasp~me~~~~~hd~~l~~L~~I~~lv~y~l~~~~~~~~~-n------~~i~hgq~iE~iwTI~PaiIll~ial 81 (231) T KOG4767 9 NLGFQDSASPIMEGIISFHDHLLFYLVFITVLVGYLLFGTFYTKSK-N------KYIIHGQTIELIWTIFPAIILLLIAL 81 (231) T ss_pred HHCCCCCCCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHC-C------EEEECCEEEHHHHHHHHHHHHHHHHH T ss_conf 7416555875787666724027999999999999999999998640-2------02322543013699889999999984 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCC---------------------CCCEEEEE Q ss_conf 999887751344420000124467888886689999984369999626787---------------------74306896 Q gi|254780474|r 112 ITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQV---------------------AIINELVV 170 (332) Q Consensus 112 ~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~gi---------------------atvneL~l 170 (332) ++.+..+.+|-. ....++|+++|+||+|.|+|-|.+. .+.|.+++ T Consensus 82 PS~~LLY~~dev--------------~~ps~TiKaiGhQWywtYEysD~~~l~FdSYMip~e~Le~g~~RlLevDnr~Vl 147 (231) T KOG4767 82 PSLRLLYLLDEV--------------NLPSLTIKAIGHQWYWTYEYSDFNGLEFDSYMIPEEDLELGQLRLLEVDNRVVL 147 (231) T ss_pred HHHHHHHHHHHC--------------CCCCEEEEECCCEEEEEEEECCCCCEEEEECCCCHHHCCCCCCEEEEECCEEEE T ss_conf 788999986220--------------477447653140589998704577603652118878828798247752463899 Q ss_pred ECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHH Q ss_conf 36982899998358101102201074031169851489999427748966613326576113658999708899999999 Q gi|254780474|r 171 PIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWIS 250 (332) Q Consensus 171 Pv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~ 250 (332) |.++++||.+||.||||||-||.||+|.||+||+-|+..+.++++|+|+|||+|.||++||.|+..+.+++-++|..|+. T Consensus 148 P~dt~IR~~vTsaDViHswAvpslgvK~Da~pGrLnq~s~~i~r~GvfYGqCSEiCGanHsfmpIv~E~v~l~~f~~w~~ 227 (231) T KOG4767 148 PIDTHIRFIVTSADVIHSWAVPSLGVKCDAIPGRLNQVSFSIQREGVFYGQCSEICGANHSFMPIVLEAVPLKDFLSWLS 227 (231) T ss_pred ECCCCEEEEEECCCCEECCCCCCCCCEECCCCCEEEEEEEEECCCCEEEEHHHHHHCCCCCCCCEEEEEEEHHHHHHHHH T ss_conf 64785699997064100002465550422567600006897316835752346764878666737999867899987665 Q ss_pred HH Q ss_conf 98 Q gi|254780474|r 251 KV 252 (332) Q Consensus 251 ~~ 252 (332) +. T Consensus 228 ~~ 229 (231) T KOG4767 228 ET 229 (231) T ss_pred HC T ss_conf 31 No 23 >pfam00116 COX2 Cytochrome C oxidase subunit II, periplasmic domain. Probab=100.00 E-value=8.2e-37 Score=281.54 Aligned_cols=99 Identities=21% Similarity=0.462 Sum_probs=94.4 Q ss_pred EEEEEEEEEEEEEEEECCCCC---------------------CCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECC Q ss_conf 689999984369999626787---------------------74306896369828999983581011022010740311 Q gi|254780474|r 142 LVVEVVALDWKWLFLLPEQQV---------------------AIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYA 200 (332) Q Consensus 142 l~I~V~a~qW~W~F~YP~~gi---------------------atvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~da 200 (332) ++|+|+|+||+|.|+||+.+. .+.|+|++|+|+||+|.|||.|||||||||+||+|+|| T Consensus 1 vtIkV~G~QW~W~y~y~~~~~~~~~s~~~~~~d~~~g~~~ll~v~n~lvlP~~~~v~~~ltS~DViHsF~iP~l~~K~Da 80 (120) T pfam00116 1 VTIKAIGHQWYWSYEYSDFKNIEFDSYMIPEDDLEEGQFRLLEVDNRIVLPMGTEVRFLVTSADVIHSFAVPSLGVKMDA 80 (120) T ss_pred CEEEEEEEECCEEEEECCCCCCEEEECCCCHHHCCCCCCCEEECCCEEEEECCCEEEEEEEECCCCEEEECCCCCEEEEC T ss_conf 98999999024799968987841100205721235798633501670899767809799998974540123667758773 Q ss_pred CCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEC Q ss_conf 6985148999942774896661332657611365899970 Q gi|254780474|r 201 MAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGK 240 (332) Q Consensus 201 iPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~ 240 (332) +||+.|++|++++++|+|+|+||||||.|||.|+|+|+|+ T Consensus 81 iPG~~~~~~~~~~~~G~y~g~C~e~CG~~H~~M~~~v~vV 120 (120) T pfam00116 81 VPGRLNQLTFTINRPGVFYGQCSEICGANHSFMPIVIEVV 120 (120) T ss_pred CCCCEEEEEEECCCCEEEEEECHHHCCCCCCCCCEEEEEC T ss_conf 7983699999548963699982879284606890899989 No 24 >PTZ00047 cytochrome C oxidase subunit II; Provisional Probab=100.00 E-value=1.2e-33 Score=258.95 Aligned_cols=91 Identities=21% Similarity=0.354 Sum_probs=88.2 Q ss_pred CCCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECC Q ss_conf 77430689636982899998358101102201074031169851489999427748966613326576113658999708 Q gi|254780474|r 162 VAIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKS 241 (332) Q Consensus 162 iatvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s 241 (332) ..+.|+||+|+|+||+|.+||+|||||||||+||+|+||+||+.|++|+.++++|+|+||||||||.+||.|+|.|+|++ T Consensus 70 LeVDNrLvlPvg~pVr~liTS~DVIHSF~VPslgiK~DaIPGR~n~~~~~~~k~G~Y~GqCaElCG~~Hs~Mpi~Vevvs 149 (163) T PTZ00047 70 LEVDKRLTLPTRTHIRFLVTATDVIHSWSIPSLGIKADAVPGRLHRINTFILREGVYYGQCSEMCGTLHGFMPIVVEAVS 149 (163) T ss_pred ECCCCCEEEECCCEEEEEEEECCCHHHCCCCCCCEEEEECCCCEEEEEEEECCCCEEEEEEHHHHCCCCCCCCEEEEEEC T ss_conf 52348617028981799998341001100565443655169832799999488957999828871958265837999978 Q ss_pred HHHHHHHHHHH Q ss_conf 89999999998 Q gi|254780474|r 242 EKGFEDWISKV 252 (332) Q Consensus 242 ~~~f~~Wv~~~ 252 (332) +|+|+||+++- T Consensus 150 pE~Y~A~~kk~ 160 (163) T PTZ00047 150 PEKYAAHAKKY 160 (163) T ss_pred HHHHHHHHHHH T ss_conf 99999999987 No 25 >pfam06481 COX_ARM COX Aromatic Rich Motif. COX2 (Cytochrome O ubiquinol OXidase 2) is a major component of the respiratory complex during vegetative growth. It transfers electrons from a quinol to the binuclear centre of the catalytic subunit 1. The function of this region is not known. Probab=98.97 E-value=2e-10 Score=94.02 Aligned_cols=47 Identities=43% Similarity=0.699 Sum_probs=44.6 Q ss_pred HCCCCCCCHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHCCCCCCCC Q ss_conf 60724569899999761021263217204783542687315244488 Q gi|254780474|r 253 KRQGTFLNRQEYLLLKKPSERDSVRYFSPIEKGLYYSILNLCVHPGK 299 (332) Q Consensus 253 ka~~~~l~~~~y~~l~~ps~~~~~~~f~~~~~~lf~~i~~~~~~~~~ 299 (332) |+++++||+++|++|++||+++||+|||+|+|+||+.|+++|+.+++ T Consensus 1 kas~~~L~~~~Y~~La~PSe~~pV~yfssv~~~Lf~~Iv~~~~~~~~ 47 (47) T pfam06481 1 KASPKTLDRATYLALAKPSENNPVEYFSSVDPGLFDDIVNKYMGGGK 47 (47) T ss_pred CCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCHHHHHHHHHHCCCC T ss_conf 98867489999999857764567335525883189999999864899 No 26 >pfam02790 COX2_TM Cytochrome C oxidase subunit II, transmembrane domain. The N-terminal domain of cytochrome C oxidase contains two transmembrane alpha-helices. Probab=98.54 E-value=4.4e-07 Score=70.18 Aligned_cols=77 Identities=12% Similarity=0.193 Sum_probs=64.6 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHH Q ss_conf 64326898989999999999999999999999999999998853046777777875345661255302278999999999 Q gi|254780474|r 32 EFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAV 111 (332) Q Consensus 32 ~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~ 111 (332) +....|+++|++++...++..+..++..|.+.|..+.++...+.+.+ ....||+.+|++||++|++|++++|+ T Consensus 6 ql~fqd~~Sp~me~i~~fHd~~m~i~~~I~~~V~~~l~~~~~~~~~~-------~~~~~~~~lE~iWTiiP~lILl~ia~ 78 (83) T pfam02790 6 GLNFQDSASPLMEGLIEFHDHTMMILTLILILVSYILVSLLFNKLTN-------RYLLEGQTIEIIWTILPAIILIFIAL 78 (83) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-------CCCCCCCEEEHHHHHHHHHHHHHHHH T ss_conf 26998979999999999988999999999999999999999847475-------55507866521399789999999982 Q ss_pred HHHH Q ss_conf 9998 Q gi|254780474|r 112 ITWD 115 (332) Q Consensus 112 ~t~~ 115 (332) ++.+ T Consensus 79 PSl~ 82 (83) T pfam02790 79 PSLR 82 (83) T ss_pred HHCC T ss_conf 2325 No 27 >PRK02888 nitrous-oxide reductase; Validated Probab=98.41 E-value=5.5e-07 Score=69.49 Aligned_cols=77 Identities=14% Similarity=0.205 Sum_probs=70.9 Q ss_pred CEEEEEECCCEEEEEEEE----CCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEC Q ss_conf 306896369828999983----5810110220107403116985148999942774896661332657611365899970 Q gi|254780474|r 165 INELVVPIDRPLEFRITA----SSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGK 240 (332) Q Consensus 165 vneL~lPv~~pV~f~ltS----~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~ 240 (332) .++..+-.|-.|.+.||- .|+.|.|-||.-++.+-..||.+..+.|+||+||+|.-.|++||-+=|-.|+.+..|. T Consensus 557 ~~~f~Vk~GDeVtv~lTNle~~~D~tHGFai~~~nin~~v~P~eT~svtF~AdkpGVy~yYCt~FC~ALH~EM~g~m~Ve 636 (637) T PRK02888 557 LREFTVKQGDEVTVIVTNLDKVEDLTHGFAIPNYGVNMEVAPQATASVTFTADKPGVYWYYCTWFCHALHMEMRGRMLVE 636 (637) T ss_pred CCEEEECCCCEEEEEEECCCCCCCCCCCEEECCCCCCEEECCCCEEEEEEECCCCCEEEEHHHHHHHHHHHHHHEEEEEC T ss_conf 86179826999999994566465566502523776136988997078999879995767555456668888752257726 Q ss_pred C Q ss_conf 8 Q gi|254780474|r 241 S 241 (332) Q Consensus 241 s 241 (332) + T Consensus 637 p 637 (637) T PRK02888 637 P 637 (637) T ss_pred C T ss_conf 9 No 28 >TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins. Probab=97.28 E-value=0.00084 Score=46.88 Aligned_cols=58 Identities=21% Similarity=0.295 Sum_probs=55.5 Q ss_pred EEEEEECCCEEEEEEEECC-CCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECC Q ss_conf 0689636982899998358-101102201074031169851489999427748966613 Q gi|254780474|r 166 NELVVPIDRPLEFRITASS-VMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSA 223 (332) Q Consensus 166 neL~lPv~~pV~f~ltS~D-ViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~ca 223 (332) +.|++-.|.+|++.+.+.| +-|.|-|-++|.|-..-+|.+....|+|||+|.|+..|- T Consensus 61 etivVkKGd~VKitienkspis~GFsId~YGI~evik~GetktvsF~ADKaGtf~I~Cq 119 (135) T TIGR03096 61 EALVVKKGTPVKVTVENKSPISEGFSIDAYGISEVIKAGETKTISFKADKAGAFTIWCQ 119 (135) T ss_pred CCEEEECCCEEEEEEEECCCCCCCEEEECCCEEEEECCCCEEEEEEEECCCCEEEEEEE T ss_conf 51898059879999971698887717721450025638973899998258852787765 No 29 >COG4263 NosZ Nitrous oxide reductase [Energy production and conversion] Probab=96.28 E-value=0.007 Score=40.30 Aligned_cols=74 Identities=12% Similarity=0.100 Sum_probs=63.2 Q ss_pred EEEEECCCEEEEEEEE----CCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEC Q ss_conf 6896369828999983----5810110220107403116985148999942774896661332657611365899970 Q gi|254780474|r 167 ELVVPIDRPLEFRITA----SSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGK 240 (332) Q Consensus 167 eL~lPv~~pV~f~ltS----~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~ 240 (332) |..+-.+-.|...+|- +|++|.|-+|+-+.-+-.-|-+....||.++++|+..-.|+.||-+=|..|..+..|. T Consensus 559 ef~Vkq~DEVt~l~tnld~Ved~thgfv~p~~~v~~~v~pq~tasvtf~a~kpgv~w~ycs~fchalh~em~~rmlve 636 (637) T COG4263 559 EFKVKQGDEVTVLTTNLDEVEDLTHGFVIPNYGVNMEVKPQRTASVTFYADKPGVAWYYCSWFCHALHMEMAGRMLVE 636 (637) T ss_pred EEEEECCCEEEEEECCCCEECCCCCEEEECCCCEEEEECCCCCEEEEEECCCCEEEEHHHHHHHHHHHHHHCCCEEEC T ss_conf 999943868899851641110342215631674679974688547999816870210326667888877533432204 No 30 >PRK10263 DNA translocase FtsK; Provisional Probab=93.15 E-value=0.77 Score=25.81 Aligned_cols=50 Identities=16% Similarity=0.262 Sum_probs=36.9 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 26898989999999999999999999999999999998853046777777 Q gi|254780474|r 35 VMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARY 84 (332) Q Consensus 35 ~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~ 84 (332) +-+-.|.++.=..|.++++|+++...+=+++++..|++||+|.....-+| T Consensus 60 IhNlGG~~GAWlADTLFf~FGvLAY~iP~im~~~cW~~fR~r~~~~~iDy 109 (1355) T PRK10263 60 IHNLGGMPGAWLADTLFFIFGVMAYTIPVIIVGGCWFAWRHQSSDEYIDY 109 (1355) T ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH T ss_conf 00157635189998999998999988999999999999986156766008 No 31 >TIGR01477 RIFIN variant surface antigen, rifin family; InterPro: IPR006373 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the rifin family of variant surface antigens, encoded by the rif (short for "repetetive interspersed family") genes. This is the largest variant surface antigen family in P. falciparum, with roughly 150 genes present in the genome of this parasite . The function of these proteins is not yet known, but rifins are strongly immunogenic and high levels of anti-rifin antibodies are correlated with rapid parasite clearance and asymptomatic infections . . Probab=92.87 E-value=0.13 Score=31.22 Aligned_cols=26 Identities=27% Similarity=0.652 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999999999988530467 Q gi|254780474|r 53 AVALMLLIVVPVFFSILFFAWKYRSTN 79 (332) Q Consensus 53 ~~~l~liV~v~V~~l~~~~~~kyR~~~ 79 (332) +.+++.+|+|.|++ ++|-.+||||++ T Consensus 356 aSiiAIliIVLIMv-IIYLILRYRRKK 381 (395) T TIGR01477 356 ASIIAILIIVLIMV-IIYLILRYRRKK 381 (395) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHH T ss_conf 98999999999999-999987543112 No 32 >PTZ00042 stevor; Provisional Probab=92.27 E-value=0.51 Score=27.10 Aligned_cols=52 Identities=17% Similarity=0.278 Sum_probs=28.8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 1364326898989999999999999999999999999999998853046777 Q gi|254780474|r 30 GFEFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKK 81 (332) Q Consensus 30 GC~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~ 81 (332) ||....-|-+|+++..-...|...-..+++.++.+.+|++...|-||+|+.. T Consensus 242 ~~~~~~sD~ag~aa~~A~~~F~P~GIaalVLlilaVvLIILYIWLyrRRKnS 293 (304) T PTZ00042 242 GCAAGASDLAGTAATAAMAAFYPCGIAALVLLILAVVLIILYIWLYRRRKNS 293 (304) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 6355110000078888886411629999999999999999999999841242 No 33 >pfam02009 Rifin_STEVOR Rifin/stevor family. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens. Probab=91.35 E-value=0.5 Score=27.17 Aligned_cols=14 Identities=29% Similarity=0.700 Sum_probs=10.7 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 99999885304677 Q gi|254780474|r 67 SILFFAWKYRSTNK 80 (332) Q Consensus 67 l~~~~~~kyR~~~~ 80 (332) +++|..+||||+++ T Consensus 263 vIIYLILRYRRKKK 276 (289) T pfam02009 263 LIIYLILRYRRKKK 276 (289) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999998732125 No 34 >PTZ00046 rifin; Provisional Probab=90.25 E-value=0.39 Score=27.92 Aligned_cols=13 Identities=31% Similarity=0.807 Sum_probs=10.3 Q ss_pred HHHHHHHHCCCCC Q ss_conf 9999885304677 Q gi|254780474|r 68 ILFFAWKYRSTNK 80 (332) Q Consensus 68 ~~~~~~kyR~~~~ 80 (332) ++|..+||||+++ T Consensus 323 IIYLILRYRRKKK 335 (348) T PTZ00046 323 IIYLILRYRRKKK 335 (348) T ss_pred HHHHHHHHHHHHH T ss_conf 9999998732014 No 35 >TIGR03095 rusti_cyanin rusticyanin. Rusticyanin is a blue copper protein, described in an obligate acidophilic chemolithoautroph, Acidithiobacillus ferrooxidans, as an electron transfer protein. It can constitute up to 5 percent of protein in cells grown on Fe(II) and is thought to be part of an electron chain for Fe(II) oxidation, with two c-type cytochromes, an aa3-type cytochrome oxidase, and 02 as terminal electron acceptor. It is rather closely related to sulfocyanin (TIGR03094). Probab=89.91 E-value=1.6 Score=23.47 Aligned_cols=75 Identities=20% Similarity=0.247 Sum_probs=56.3 Q ss_pred CCEEEEEECCCEEEEEEEE--CCCCHHCCCHHHCCEECCCCCCC---------------------EEEEEEECCCEEEEE Q ss_conf 4306896369828999983--58101102201074031169851---------------------489999427748966 Q gi|254780474|r 164 IINELVVPIDRPLEFRITA--SSVMNSFYIPGLAGQIYAMAGME---------------------TKLHAVMNKKGAYSG 220 (332) Q Consensus 164 tvneL~lPv~~pV~f~ltS--~DViHSF~IP~l~~k~daiPG~~---------------------t~l~~~a~~~G~y~G 220 (332) +-..|++|.|..|.+..-- .|--|||-|-.-|---..|||+. .......+.+|+|.- T Consensus 50 ~NPTLvip~GatV~itvIN~d~~~~Hnf~it~~GPPY~ymp~~~~~~~v~~~p~lP~~~sg~~~y~~ft~~f~~~GtYwY 129 (148) T TIGR03095 50 KNPTIVIPEGVTVHFTVINTDTDSGHNFDISKRGPPYPYMPGMDGLGFVAGTGFLPPPKSGKFGYTDFTYHFSTAGTYWY 129 (148) T ss_pred CCCCEEECCCCEEEEEEEECCCCCCEEEEEECCCCCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEEEEEECCCCEEEEE T ss_conf 38728954898899999979898622487864899986563345556176457346756785047989996289800899 Q ss_pred ECCCCCCCCCCCCEEEEEE Q ss_conf 6133265761136589997 Q gi|254780474|r 221 FSANYSGRGFSHMRFKFYG 239 (332) Q Consensus 221 ~cae~cG~ghs~M~f~v~v 239 (332) .|. +-|..-..|-.+++| T Consensus 130 lC~-yPGHAe~GMYGkiIV 147 (148) T TIGR03095 130 LCT-YPGHAENGMYGKIVV 147 (148) T ss_pred EEC-CCCCHHCCCEEEEEE T ss_conf 953-688221384367986 No 36 >pfam07731 Cu-oxidase_2 Multicopper oxidase. This entry contains many divergent copper oxidase-like domains that are not recognized by the pfam00394 model. Probab=85.37 E-value=3 Score=21.62 Aligned_cols=77 Identities=13% Similarity=0.074 Sum_probs=53.3 Q ss_pred CCCEEEEEECCCEEEEEEEECCCC--------HHCCCHHHCCEE-------------------CCCCCCCEEEEEEECCC Q ss_conf 743068963698289999835810--------110220107403-------------------11698514899994277 Q gi|254780474|r 163 AIINELVVPIDRPLEFRITASSVM--------NSFYIPGLAGQI-------------------YAMAGMETKLHAVMNKK 215 (332) Q Consensus 163 atvneL~lPv~~pV~f~ltS~DVi--------HSF~IP~l~~k~-------------------daiPG~~t~l~~~a~~~ 215 (332) .+.+.+.++.|+.|++.+...+-+ |+|+|-.-++.. ..-||-...+.|++|.+ T Consensus 30 ~~~~~~~v~~g~~v~~~~~N~~~~~Hp~HlHG~~F~vl~~~~~~~~~~~~~~~~~~p~~rDtv~v~~g~~~~i~~~adnp 109 (136) T pfam07731 30 GTTEVITLPNGDRVEIVLQNNTMGPHPFHLHGHSFQVLGRGGGPWDPDRATYNLVDPVRRDTVQVPPGGWVAIRFKADNP 109 (136) T ss_pred CCCCEEEECCCCEEEEEEECCCCCCCCEEECCCEEEEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCC T ss_conf 99986993699999999988998731168638729999616986786433445468873313775999589999992898 Q ss_pred EEEEEECCCCCCCCCCCCEEEEEEC Q ss_conf 4896661332657611365899970 Q gi|254780474|r 216 GAYSGFSANYSGRGFSHMRFKFYGK 240 (332) Q Consensus 216 G~y~G~cae~cG~ghs~M~f~v~v~ 240 (332) |.|.--|-..- ---+.|-..+.+. T Consensus 110 G~w~~HCHi~~-H~~~Gm~~~~~v~ 133 (136) T pfam07731 110 GPWLFHCHILW-HLDQGMMGQFVVD 133 (136) T ss_pred EEEEEECCCHH-HHHCCCEEEEEEC T ss_conf 57999378568-9768992999986 No 37 >TIGR02376 Cu_nitrite_red nitrite reductase, copper-containing; InterPro: IPR012746 This family consists of copper-type nitrite reductase. It reduces nitrite to nitric oxide, the first step in denitrification.. Probab=81.47 E-value=1.3 Score=24.10 Aligned_cols=80 Identities=16% Similarity=0.111 Sum_probs=62.1 Q ss_pred EEEEECCCEEEEEEEE---CCCCHHCCCHH----HCCE--ECCCCCCCEEEEEEECCCEEEEEECCCCC--CCCC--CCC Q ss_conf 6896369828999983---58101102201----0740--31169851489999427748966613326--5761--136 Q gi|254780474|r 167 ELVVPIDRPLEFRITA---SSVMNSFYIPG----LAGQ--IYAMAGMETKLHAVMNKKGAYSGFSANYS--GRGF--SHM 233 (332) Q Consensus 167 eL~lPv~~pV~f~ltS---~DViHSF~IP~----l~~k--~daiPG~~t~l~~~a~~~G~y~G~cae~c--G~gh--s~M 233 (332) -+++=.|=-|+++|.- +++-|+-=.=+ +||- --..||.+..|+|+|+++|.|---||-=. =.-| |.| T Consensus 60 l~~V~~gD~Vel~l~N~~~nt~pHNvDfHAa~G~~GGa~~t~~~PG~~~~lrFKA~~pG~fvYHCA~~~~m~~~Hv~sGM 139 (331) T TIGR02376 60 LIVVREGDYVELTLVNPETNTMPHNVDFHAATGALGGAALTMVNPGETATLRFKATRPGAFVYHCAVPGPMVPMHVSSGM 139 (331) T ss_pred EEEEEECCEEEEEEEECCCCCCCCCCCEEEECCCCCCEEEEEECCCCEEEEEEECCCCCCEEEEECCCCCCCCCEECCCC T ss_conf 78984198899998838878625552025512888860124664996047888706888379960697580050000245 Q ss_pred EEEEEECCHHHHH Q ss_conf 5899970889999 Q gi|254780474|r 234 RFKFYGKSEKGFE 246 (332) Q Consensus 234 ~f~v~v~s~~~f~ 246 (332) ...+-|.++|... T Consensus 140 ~G~ilV~P~eGL~ 152 (331) T TIGR02376 140 FGLILVLPREGLP 152 (331) T ss_pred CCEEEEECCCCCC T ss_conf 5348982788897 No 38 >pfam00127 Copper-bind Copper binding proteins, plastocyanin/azurin family. Probab=80.91 E-value=4.4 Score=20.42 Aligned_cols=68 Identities=15% Similarity=0.130 Sum_probs=40.3 Q ss_pred CEEEEEECCCEEEEEEEECCCCHHCCC-----HH----HCCE-----ECCCCCCCEEEEEEECCCEEEEEECCCCCCCCC Q ss_conf 306896369828999983581011022-----01----0740-----311698514899994277489666133265761 Q gi|254780474|r 165 INELVVPIDRPLEFRITASSVMNSFYI-----PG----LAGQ-----IYAMAGMETKLHAVMNKKGAYSGFSANYSGRGF 230 (332) Q Consensus 165 vneL~lPv~~pV~f~ltS~DViHSF~I-----P~----l~~k-----~daiPG~~t~l~~~a~~~G~y~G~cae~cG~gh 230 (332) -.+|.+++|..|.|.-. ++.-|+.-. |+ +.+. .-..||-. ...+.+++|+|.-+|+ . | T Consensus 16 P~~ltI~~GdtV~w~n~-~~~~Hnvv~~~~~~~~g~~~~~~~~~~~~~~~~~g~~--~s~tF~~~G~Y~Y~C~----p-H 87 (99) T pfam00127 16 PKEITVKKGEKVTFVNN-AGMPHNVVFDKDGVPAGVDADKVKMGDHTKLIGGGET--YSVTFDLAGTYGFFCT----P-H 87 (99) T ss_pred CCEEEECCCCEEEEEEC-CCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCE--EEEEECCCCEEEEEEC----C-C T ss_conf 78799889999999989-9997218988888876542334566655332379857--9999656854999847----6-8 Q ss_pred C--CCEEEEEEC Q ss_conf 1--365899970 Q gi|254780474|r 231 S--HMRFKFYGK 240 (332) Q Consensus 231 s--~M~f~v~v~ 240 (332) . .|..+++|. T Consensus 88 ~~~gM~G~I~Ve 99 (99) T pfam00127 88 QGAGMVGKVTVE 99 (99) T ss_pred CCCCCEEEEEEC T ss_conf 888767999989 No 39 >TIGR03388 ascorbase L-ascorbate oxidase, plant type. Members of this protein family are the copper-containing enzyme L-ascorbate oxidase (EC 1.10.3.3), also called ascorbase. This family is found in flowering plants, and shows greater sequence similarity to a family of laccases (EC 1.10.3.2) from plants than to other known ascorbate oxidases. Probab=80.76 E-value=3.6 Score=21.08 Aligned_cols=27 Identities=7% Similarity=0.250 Sum_probs=13.8 Q ss_pred CEEEEEECCCEEEEEEEECCC-CHHCCC Q ss_conf 306896369828999983581-011022 Q gi|254780474|r 165 INELVVPIDRPLEFRITASSV-MNSFYI 191 (332) Q Consensus 165 vneL~lPv~~pV~f~ltS~DV-iHSF~I 191 (332) +++|.|.+|+...+.++.+.- --++|| T Consensus 250 vd~i~i~~GqRydvlv~~~~~~~~~y~i 277 (541) T TIGR03388 250 VKDIDIYSGETYSVLLTTDQDPSRNYWI 277 (541) T ss_pred ECEEEECCCCCEEEEEECCCCCCCCEEE T ss_conf 5659864888089999768997765699 No 40 >PRK06231 F0F1 ATP synthase subunit B; Validated Probab=78.94 E-value=5.1 Score=19.99 Aligned_cols=52 Identities=15% Similarity=0.193 Sum_probs=26.4 Q ss_pred HHHHHHHHHCCCCCC--CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999874136432--689898999999999999999999999999999999885 Q gi|254780474|r 21 LFFSSVMLSGFEFIV--MDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWK 74 (332) Q Consensus 21 l~~~~~lLsGC~~~~--l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~k 74 (332) .+...++++.|+..+ +.|+.-+..---++ +.|+..++-|+.+++++..++|| T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~iin~lfPn~--~~Fia~lIaF~ILl~lL~k~awK 71 (201) T PRK06231 18 FLIISLFLVSCTENVEELKSKSIINELFPNF--WVFIAHLIAFSILLLLGIFLFWK 71 (201) T ss_pred HHHHHHHHHHHCCCCHHCCHHHHHHHHCCCH--HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999988069802106898999977886--99999999999999999999787 No 41 >PRK10654 dcuC C4-dicarboxylate transporter DcuC; Provisional Probab=78.69 E-value=3 Score=21.58 Aligned_cols=84 Identities=12% Similarity=0.063 Sum_probs=35.5 Q ss_pred CCCCCCCCCCCHH---HHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCCCCCCCCCEEEEEEE Q ss_conf 1364326898989---999999-----9999999999999999999999988530467777--77875345661255302 Q gi|254780474|r 30 GFEFIVMDPYGDI---ALQQAS-----LIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKA--RYDPKWCHSTLLELFVW 99 (332) Q Consensus 30 GC~~~~l~P~G~i---a~~~~~-----L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a--~~~p~~~hs~~lEi~wt 99 (332) +|....+.|.+.. +.+..+ .+........+..+.++....||.-||.++++.. +.....+...+....+- T Consensus 165 tt~~~~~~P~s~~~i~aA~~~g~~v~~y~~~~ql~~~i~~~i~~ai~h~f~q~y~Dkk~~~~~e~~~~~~~~~~~p~~Ya 244 (452) T PRK10654 165 SPAAIILSPTSGDVVLAAQASEMPLIDFAFKTTLPISIAAIIGMAIAHFFWQRYLDKKENISHEMLDVSEITTTAPAFYA 244 (452) T ss_pred HHHHCCCCCCCCCCHHHHHHCCCCHHHEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCCCCCCCCHHH T ss_conf 87432568974210224877299877802111189999999999999999999820213641434210002355655307 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 27899999999999 Q gi|254780474|r 100 LVPLVIVGFLAVIT 113 (332) Q Consensus 100 ~iP~iIi~~L~~~t 113 (332) ..|.+-++.+..++ T Consensus 245 llP~lPlilll~fs 258 (452) T PRK10654 245 ILPFTPIIGVLIFD 258 (452) T ss_pred HHHHHHHHHHHHHC T ss_conf 86789999999961 No 42 >TIGR01478 STEVOR variant surface antigen, stevor family; InterPro: IPR006374 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the stevor (short for "subtelomeric variable open reading frame") family of predicted variant surface antigens. This is the third largest variant surface antigen family in P. falciparum, with 33 genes identified in the genome of this parasite . The function of these proteins is not known, but they are expressed during several stages of the parasite life-cycle and may play a variety of roles in parasite survival and infection .. Probab=76.86 E-value=5.8 Score=19.59 Aligned_cols=35 Identities=17% Similarity=0.361 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999999999999999999885304677777 Q gi|254780474|r 49 LIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKAR 83 (332) Q Consensus 49 L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~ 83 (332) .|..-=.++++.++.+.+|++.-+|-||||++.-. T Consensus 272 tf~PyGIAAlVLi~L~V~LIiLYIWLYrRRK~SWK 306 (315) T TIGR01478 272 TFLPYGIAALVLIILTVVLIILYIWLYRRRKKSWK 306 (315) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 86135899999999999999998888564030356 No 43 >PRK02463 OxaA-like protein precursor; Provisional Probab=76.12 E-value=6 Score=19.45 Aligned_cols=31 Identities=19% Similarity=0.068 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCC-C-CCCCCCCHH Q ss_conf 566889999999999874136-4-326898989 Q gi|254780474|r 12 VGKSLKFFFLFFSSVMLSGFE-F-IVMDPYGDI 42 (332) Q Consensus 12 ~~k~l~~~~l~~~~~lLsGC~-~-~~l~P~G~i 42 (332) .+|.+...++++..++||||. . ..-+|.|-+ T Consensus 5 ~k~~~~~~~~~~~~l~LsgC~~~~~~~~~~G~~ 37 (307) T PRK02463 5 LKRILFSGLALSMLLTLTGCVGRDKHGNPTGMI 37 (307) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHH T ss_conf 999999999999999996346899999977408 No 44 >PRK02710 plastocyanin; Provisional Probab=75.73 E-value=6.2 Score=19.38 Aligned_cols=68 Identities=12% Similarity=0.131 Sum_probs=42.1 Q ss_pred CEEEEEECCCEEEEEEEECCCCHHCCC---HHHCCEECCC-CCCCEEEEEEECCCEEEEEECCCCCCCCC--CCCEEEEE Q ss_conf 306896369828999983581011022---0107403116-98514899994277489666133265761--13658999 Q gi|254780474|r 165 INELVVPIDRPLEFRITASSVMNSFYI---PGLAGQIYAM-AGMETKLHAVMNKKGAYSGFSANYSGRGF--SHMRFKFY 238 (332) Q Consensus 165 vneL~lPv~~pV~f~ltS~DViHSF~I---P~l~~k~dai-PG~~t~l~~~a~~~G~y~G~cae~cG~gh--s~M~f~v~ 238 (332) -.+|.+.+|..|+|.=+ +..-|+-.. +.+...-... ||. ....+.+++|+|.-+|. -| ..|...++ T Consensus 46 P~~ltI~~GDTV~W~n~-d~~~Hnv~~~~~~~~~~~~~~~~~g~--~~~~TF~~~G~Y~Y~C~-----PH~~~gMvG~I~ 117 (119) T PRK02710 46 PSTLTIKAGDTVKWVNN-KLAPHNAVFDGADELSHKDLAFAPGE--SWEETFSEAGTYTYYCE-----PHRGAGMVGKIT 117 (119) T ss_pred CCEEEECCCCEEEEEEC-CCCCCEEEECCCCCCCCCCCCCCCCC--EEEEECCCCEEEEEECC-----CCCCCCCEEEEE T ss_conf 89889899999999869-99996378679875574211247896--79997789857999826-----588999869999 Q ss_pred EC Q ss_conf 70 Q gi|254780474|r 239 GK 240 (332) Q Consensus 239 v~ 240 (332) |. T Consensus 118 V~ 119 (119) T PRK02710 118 VE 119 (119) T ss_pred EC T ss_conf 89 No 45 >COG3017 LolB Outer membrane lipoprotein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane] Probab=67.73 E-value=6.2 Score=19.34 Aligned_cols=25 Identities=24% Similarity=0.436 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 3566889999999999874136432 Q gi|254780474|r 11 SVGKSLKFFFLFFSSVMLSGFEFIV 35 (332) Q Consensus 11 ~~~k~l~~~~l~~~~~lLsGC~~~~ 35 (332) .+.|+++++++.+++++|+||.... T Consensus 2 ~~~~~~~~~l~~~As~LL~aC~~~~ 26 (206) T COG3017 2 PMMKRLLFLLLALASLLLTACTLTA 26 (206) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 5277787899999999997565667 No 46 >pfam06716 DUF1201 Protein of unknown function (DUF1201). This family consists of several Sugar beet yellow virus (SBYV) putative membrane-binding proteins of around 54 residues in length. The function of this family is unknown. Probab=67.62 E-value=9.2 Score=18.14 Aligned_cols=47 Identities=17% Similarity=0.377 Sum_probs=25.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCCCCCCCE Q ss_conf 999999999999999999999999988530-46777777875345661 Q gi|254780474|r 47 ASLIRIAVALMLLIVVPVFFSILFFAWKYR-STNKKARYDPKWCHSTL 93 (332) Q Consensus 47 ~~L~~~~~~l~liV~v~V~~l~~~~~~kyR-~~~~~a~~~p~~~hs~~ 93 (332) +...+.+++.....+..-++..+||++|-- -+|+.+.-...+.||+. T Consensus 6 rsylllafgfliclflfclvvfiwfvykqilfrntapsnearfnhstv 53 (54) T pfam06716 6 RSYLLLAFGFLICLFLFCLVVFIWFVYKQILFRNTAPSNEARFNHSTV 53 (54) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCC T ss_conf 999999999999999999999999999999981679962211466643 No 47 >COG4148 ModC ABC-type molybdate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=66.86 E-value=4 Score=20.73 Aligned_cols=47 Identities=19% Similarity=0.243 Sum_probs=41.3 Q ss_pred CEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEE Q ss_conf 30689636982899998358101102201074031169851489999 Q gi|254780474|r 165 INELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAV 211 (332) Q Consensus 165 vneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~ 211 (332) ++.+..++|.++++++.++||+-+.-.|+=-.-....||..+.+.-. T Consensus 260 v~~~~a~~g~~~R~~I~a~DVslal~~P~~~SirNiLp~~v~~i~~~ 306 (352) T COG4148 260 VPKLDAPVGARLRIRIQARDVSLALQKPEQTSIRNILPGKVVGIEDD 306 (352) T ss_pred ECCCCCCCCCCEEEEEECCCEEEEECCCCCCCHHHCCCEEEEEEECC T ss_conf 50467788983799998141378854865221332154069999648 No 48 >TIGR03389 laccase laccase, plant. Members of this protein family include the copper-containing enzyme laccase (EC 1.10.3.2), often several from a single plant species, and additional, uncharacterized, closely related plant proteins termed laccase-like multicopper oxidases. This protein family shows considerable sequence similarity to the L-ascorbate oxidase (EC 1.10.3.3) family. Laccases are enzymes of rather broad specificity, and classification of all proteins scoring about the trusted cutoff of this model as laccases may be appropriate. Probab=64.90 E-value=10 Score=17.78 Aligned_cols=28 Identities=11% Similarity=0.271 Sum_probs=17.6 Q ss_pred CEEEEEECCCEEEEEEEECCCCHHCCCH Q ss_conf 3068963698289999835810110220 Q gi|254780474|r 165 INELVVPIDRPLEFRITASSVMNSFYIP 192 (332) Q Consensus 165 vneL~lPv~~pV~f~ltS~DViHSF~IP 192 (332) +++|.+..|++..+.++.+.-.-++|+- T Consensus 234 v~~l~i~~GqRydvlv~~~~~~~~y~i~ 261 (539) T TIGR03389 234 TKTIVIGPGQTTNVLLTADQSPGRYFMA 261 (539) T ss_pred ECEEEECCCCEEEEEEECCCCCCCEEEE T ss_conf 3649963888799999779988725799 No 49 >COG4858 Uncharacterized membrane-bound protein conserved in bacteria [Function unknown] Probab=64.51 E-value=11 Score=17.73 Aligned_cols=29 Identities=17% Similarity=0.415 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99999999999999999999988530467 Q gi|254780474|r 51 RIAVALMLLIVVPVFFSILFFAWKYRSTN 79 (332) Q Consensus 51 ~~~~~l~liV~v~V~~l~~~~~~kyR~~~ 79 (332) +++..+..+|.-+++-++-|+++|+|+.| T Consensus 127 lItlll~a~vgGfamy~my~y~yr~~ad~ 155 (226) T COG4858 127 LITLLLTAVVGGFAMYIMYYYAYRMRADN 155 (226) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 79999999866699999999998762434 No 50 >TIGR02657 amicyanin amicyanin; InterPro: IPR013475 Amicyanin is a type I blue copper protein that accepts electrons from the tryptophan tryptophylquinone (TTQ) cofactor of the methylamine dehydrogenase light chain and then transfers them to the heme group of cytochrome c-551i. Amicyanin, methylamine dehydrogenase, and cytochrome c-551i are located in the periplasm and form a complex. This system has been studied primarily in Paracoccus denitrificans and Methylobacterium extorquens. Related type I blue copper proteins include plastocyanin, pseudoazurin, halocyanin, etc.; GO: 0005507 copper ion binding, 0009055 electron carrier activity, 0006118 electron transport. Probab=62.29 E-value=4.8 Score=20.18 Aligned_cols=70 Identities=16% Similarity=0.147 Sum_probs=44.0 Q ss_pred EEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCC---CEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEC Q ss_conf 068963698289999835810110220107403116985---148999942774896661332657611365899970 Q gi|254780474|r 166 NELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGM---ETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGK 240 (332) Q Consensus 166 neL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~---~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~ 240 (332) .||++-+|.-|.-.=|-.=.=|--|+---++-.|.--=| .....++.+++|+|+-.|+ =|..|+.+|+|. T Consensus 11 PE~~~K~G~~VTW~N~EAlPHNVHF~~G~~~E~~~~GPMl~~~Q~YS~~F~~~GTYDY~CT-----PHPFM~GKVVVE 83 (83) T TIGR02657 11 PELRVKVGDAVTWINREALPHNVHFKAGVLVEKDLKGPMLKKEQAYSLKFTEAGTYDYICT-----PHPFMKGKVVVE 83 (83) T ss_pred CEEEEEECCEEEEEECCCCCCCCEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCEEC-----CCCCCCCEEEEC T ss_conf 8258862890333004678745035407763111458724556504787479986153017-----998655515749 No 51 >TIGR02887 spore_ger_x_C germination protein, Ger(x)C family; InterPro: IPR008844 The GerAA, -AB, and -AC proteins of the Bacillus subtilis spore are required for the germination response to L-alanine as the sole germinant. Members of GerAC family are thought to be located in the inner spore membrane. Although the function of this family is unclear, they are likely to encode the components of the germination apparatus that respond directly to this germinant, mediating the spore's response .; GO: 0009847 spore germination, 0016020 membrane. Probab=62.14 E-value=5.1 Score=19.96 Aligned_cols=19 Identities=32% Similarity=0.581 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 6889999999999874136 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFE 32 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~ 32 (332) |.+..+++++++++|+||- T Consensus 1 ~~~~~~~~~~~~~lL~GCw 19 (400) T TIGR02887 1 KKLKILLLILALLLLTGCW 19 (400) T ss_pred CHHHHHHHHHHHHHHCCCC T ss_conf 9466789999999832665 No 52 >COG4633 Plastocyanin domain containing protein [General function prediction only] Probab=62.04 E-value=12 Score=17.43 Aligned_cols=72 Identities=15% Similarity=0.163 Sum_probs=57.6 Q ss_pred CCEEEEEECCCEEEEEEEECCC---CHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEC Q ss_conf 4306896369828999983581---0110220107403116985148999942774896661332657611365899970 Q gi|254780474|r 164 IINELVVPIDRPLEFRITASSV---MNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGK 240 (332) Q Consensus 164 tvneL~lPv~~pV~f~ltS~DV---iHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~ 240 (332) +-..+++-.|+|+++...-+|. .-+--.|.|++--+.=-+..+...++++.+|+|. ..||. ..|+....+. T Consensus 95 ~p~~IvV~~~v~~rl~f~Rkdpspcle~i~~pdfgiaanlpl~q~ssIe~T~~s~ge~a----f~cgm--nm~~G~~~ve 168 (272) T COG4633 95 IPSRIVVVDGVPVRLTFKRKDPSPCLESIMSPDFGIAANLPLNQVSSIEFTPISKGEYA----FLCGM--NMFRGNIQVE 168 (272) T ss_pred CCEEEEEECCCCEEEEECCCCCCCCHHHCCCCCCCCCCCCCCCCEEEEEECCCCCCCHH----HHCCH--HHCCCEEEEE T ss_conf 63269996385037643157998660102155555524687674368883356664102----11124--4416705787 Q ss_pred C Q ss_conf 8 Q gi|254780474|r 241 S 241 (332) Q Consensus 241 s 241 (332) + T Consensus 169 t 169 (272) T COG4633 169 T 169 (272) T ss_pred E T ss_conf 3 No 53 >pfam11346 DUF3149 Protein of unknown function (DUF3149). This bacterial family of proteins has no known function. Probab=61.81 E-value=10 Score=17.82 Aligned_cols=30 Identities=20% Similarity=0.291 Sum_probs=20.3 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999-999999999999999885304 Q gi|254780474|r 48 SLIRIAVAL-MLLIVVPVFFSILFFAWKYRS 77 (332) Q Consensus 48 ~L~~~~~~l-~liV~v~V~~l~~~~~~kyR~ 77 (332) +|+-...++ .++|+++++++..|+.+-|-+ T Consensus 5 ~LFg~~iGL~Sl~VI~~~igm~~f~~~~fi~ 35 (42) T pfam11346 5 DLFGNDIGLMSLLVIFFTIGMMAFFGRMFIY 35 (42) T ss_pred HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8737606699999999999999999889997 No 54 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=61.80 E-value=7.3 Score=18.87 Aligned_cols=24 Identities=13% Similarity=0.218 Sum_probs=17.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 235668899999999998741364 Q gi|254780474|r 10 VSVGKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 10 ~~~~k~l~~~~l~~~~~lLsGC~~ 33 (332) |.+.|.++.++++++.++|++|.. T Consensus 1 M~~nk~~K~l~ia~~~l~LaACSS 24 (173) T PRK10802 1 MQLNKVLKGLMIALPVMAIAACSS 24 (173) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 960789999999999999985579 No 55 >pfam06572 DUF1131 Protein of unknown function (DUF1131). This family consists of several hypothetical bacterial proteins of unknown function. Probab=61.25 E-value=4.6 Score=20.26 Aligned_cols=25 Identities=32% Similarity=0.733 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHCCC------CCCCCCC Q ss_conf 6889999999999874136------4326898 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFE------FIVMDPY 39 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~------~~~l~P~ 39 (332) |.+++ .++.+.++|+||. ++.|+|- T Consensus 2 ~~~r~-~l~~~~llL~GCst~s~fsWSslsP~ 32 (192) T pfam06572 2 KSLRL-FLLALPLLLTGCSTLSNFSWSSLSPW 32 (192) T ss_pred CCHHH-HHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 43013-68776688606556688542226872 No 56 >PRK11023 hypothetical protein; Provisional Probab=59.52 E-value=6.4 Score=19.29 Aligned_cols=21 Identities=24% Similarity=0.308 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 668899999999998741364 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~~ 33 (332) .|.++.+++++++++|+||-. T Consensus 1 mk~~~~l~~l~~~~~L~GC~~ 21 (191) T PRK11023 1 MKALSPLAVLISALLLQGCVA 21 (191) T ss_pred CCHHHHHHHHHHHHHHCCCHH T ss_conf 944889999999999726823 No 57 >smart00719 Plus3 Short conserved domain in transcriptional regulators. Plus3 domains occur in the Saccharomyces cerevisiae Rtf1p protein, which interacts with Spt6p, and in parsley CIP, which interacts with the bZIP protein CPRF1. Probab=59.30 E-value=11 Score=17.64 Aligned_cols=49 Identities=18% Similarity=0.276 Sum_probs=30.5 Q ss_pred CEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHHH Q ss_conf 4031169851489999427748966613326576113658999708899999999986 Q gi|254780474|r 196 GQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISKVK 253 (332) Q Consensus 196 ~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~k 253 (332) .|-|.+.|..|..+|....- +...+|.=...|+..| +++||+.|.+..| T Consensus 61 ~k~Y~~~~~~T~~~L~l~~g----~~~k~~~i~~vSn~~f-----te~E~~~w~~~~k 109 (109) T smart00719 61 DKPYELGGKTTNVLLEVLNG----DSEKVVQINFISNQDF-----TEEEFQRWKQAIK 109 (109) T ss_pred CCCEECCCEEEEEEEEEECC----CEEEEEEEEEEECCCC-----CHHHHHHHHHHHC T ss_conf 83146298773289999609----8056999888638989-----9999999999719 No 58 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=57.64 E-value=12 Score=17.23 Aligned_cols=24 Identities=8% Similarity=0.070 Sum_probs=18.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 235668899999999998741364 Q gi|254780474|r 10 VSVGKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 10 ~~~~k~l~~~~l~~~~~lLsGC~~ 33 (332) |.++..+++.++++.+++|+||.. T Consensus 1 ~~~~t~~Rl~~~~~~~~ll~GCa~ 24 (230) T PRK12700 1 MMLKTVLRLPVCAALLALAAGCAM 24 (230) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCC T ss_conf 915889999999999999744468 No 59 >pfam12501 DUF3708 Phosphate ATP-binding cassette transporter. This domain family is found in bacteria, and is typically between 143 and 173 amino acids in length. The family is found in association with pfam00528. There is a single completely conserved residue P that may be functionally important. Probab=56.38 E-value=6.9 Score=19.02 Aligned_cols=49 Identities=12% Similarity=0.017 Sum_probs=27.7 Q ss_pred HHHHHHHHHHHHHHHCCCCC--CCC----CCCCCCCCCEEEEEEEHHHHHHHHHH Q ss_conf 99999999999885304677--777----78753456612553022789999999 Q gi|254780474|r 61 VVPVFFSILFFAWKYRSTNK--KAR----YDPKWCHSTLLELFVWLVPLVIVGFL 109 (332) Q Consensus 61 ~v~V~~l~~~~~~kyR~~~~--~a~----~~p~~~hs~~lEi~wt~iP~iIi~~L 109 (332) .+.++.+..|+.-|.|+.+- ... -..-.++-...-.+|+++|.++++++ T Consensus 4 ~ll~l~~~~y~~gr~RA~~l~~~~g~~~~LHSlP~YyG~~~alw~~lPal~ll~l 58 (127) T pfam12501 4 LLLLLSAIGFFLGRRRALALARGGGDSRALHSLPNYYGAYVALWAALPALLLLAL 58 (127) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999986369867766666888999999999999999999 No 60 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=55.14 E-value=9.9 Score=17.92 Aligned_cols=19 Identities=21% Similarity=0.410 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 6889999999999874136 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFE 32 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~ 32 (332) |+..++++++.+++|+||+ T Consensus 4 k~~~li~ll~~~~lL~gC~ 22 (385) T PRK09859 4 RRKLLIPLLFCGAMLTACD 22 (385) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 0678999999999995379 No 61 >PRK10796 LPS-assembly lipoprotein RlpB; Provisional Probab=54.56 E-value=10 Score=17.90 Aligned_cols=20 Identities=15% Similarity=0.356 Sum_probs=14.6 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 68899999999998741364 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~~ 33 (332) |.|.-+++.++.++++||.+ T Consensus 2 r~l~~l~l~lavll~agCGF 21 (196) T PRK10796 2 RYLATLLLSLAVLVTAGCGW 21 (196) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 56999999999999747671 No 62 >PRK10718 hypothetical protein; Provisional Probab=53.97 E-value=4.6 Score=20.27 Aligned_cols=24 Identities=29% Similarity=0.628 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHCCC------CCCCCC Q ss_conf 6889999999999874136------432689 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFE------FIVMDP 38 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~------~~~l~P 38 (332) +.|++ +++.+.++|+||. ++.|+| T Consensus 2 ~~lr~-~l~~~~llLsGCst~s~~sWSsl~P 31 (191) T PRK10718 2 KSLRL-LLLALPLLLTGCSTMSSVNWSAANP 31 (191) T ss_pred CCCCC-HHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 85310-2312247640656668864223686 No 63 >PRK01622 OxaA-like protein precursor; Validated Probab=53.88 E-value=16 Score=16.50 Aligned_cols=22 Identities=14% Similarity=0.128 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 5668899999999998741364 Q gi|254780474|r 12 VGKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 12 ~~k~l~~~~l~~~~~lLsGC~~ 33 (332) ..|.+-..++++..++||||.. T Consensus 4 ~~~~~l~~~~~~~~~~lsgc~~ 25 (266) T PRK01622 4 SYRAVLISLSLLLVFVLSGCSN 25 (266) T ss_pred HHHHHHHHHHHHHHHHHHCCCC T ss_conf 8899999999999999954689 No 64 >TIGR02722 lp_ uncharacterized lipoprotein; InterPro: IPR014094 Members of this protein family are restricted to the Proteobacteria, and all are predicted lipoproteins. In genomes that contain the thiK gene for the salvage enzyme thiamin kinase, the member of this family is encoded nearby.. Probab=53.86 E-value=10 Score=17.88 Aligned_cols=20 Identities=25% Similarity=0.326 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 68899999999998741364 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~~ 33 (332) |.+-++++++++++|+||.. T Consensus 2 ~~~~l~~~~~~Al~L~GC~~ 21 (215) T TIGR02722 2 KYKILIFVALLALLLSGCVS 21 (215) T ss_pred HHHHHHHHHHHHHHHHCCCC T ss_conf 37878999999999854688 No 65 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=52.69 E-value=13 Score=17.15 Aligned_cols=26 Identities=19% Similarity=0.138 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 20235668899999999998741364 Q gi|254780474|r 8 MGVSVGKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 8 ~~~~~~k~l~~~~l~~~~~lLsGC~~ 33 (332) |.-+..|++.+..++.++++|+||.. T Consensus 1 ~~~~~~~r~~~~~~l~~~~lLagC~~ 26 (488) T PRK09915 1 MINRQLSRLLLCSILGSTTLISGCAL 26 (488) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 91157769999999999999752668 No 66 >COG1276 PcoD Putative copper export protein [Inorganic ion transport and metabolism] Probab=52.57 E-value=16 Score=16.36 Aligned_cols=24 Identities=29% Similarity=0.397 Sum_probs=11.5 Q ss_pred CCCCCCEEEEEEEHHHHHHHHHHHHHH Q ss_conf 534566125530227899999999999 Q gi|254780474|r 87 KWCHSTLLELFVWLVPLVIVGFLAVIT 113 (332) Q Consensus 87 ~~~hs~~lEi~wt~iP~iIi~~L~~~t 113 (332) .+..|.++|+ ++-++|+.+-+.++ T Consensus 259 ~l~rsi~iE~---~ig~~VL~l~a~l~ 282 (289) T COG1276 259 WLVRSLKLEW---VIGVVVLGVAAVLG 282 (289) T ss_pred HHHHHHHHHH---HHHHHHHHHHHHHC T ss_conf 9999999999---99999999999981 No 67 >PRK09804 putative cryptic C4-dicarboxylate transporter DcuD; Provisional Probab=52.05 E-value=17 Score=16.30 Aligned_cols=25 Identities=16% Similarity=0.085 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999999998853046777 Q gi|254780474|r 57 MLLIVVPVFFSILFFAWKYRSTNKK 81 (332) Q Consensus 57 ~liV~v~V~~l~~~~~~kyR~~~~~ 81 (332) ..+..+.+++...||.-||.+++++ T Consensus 200 ~~i~~ii~iai~h~f~qr~~Dkk~~ 224 (455) T PRK09804 200 VASCVIISVAISHFFVQRAFDKKDK 224 (455) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999999999875124 No 68 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=50.28 E-value=18 Score=16.11 Aligned_cols=19 Identities=26% Similarity=0.357 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 8899999999998741364 Q gi|254780474|r 15 SLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 15 ~l~~~~l~~~~~lLsGC~~ 33 (332) +..+..+++++++|+||.. T Consensus 3 k~~~~~~l~~~l~lsgC~s 21 (160) T PRK09967 3 KHLVAPLIFTSLILTGCQS 21 (160) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 7899999999999984589 No 69 >pfam08292 RNA_pol_Rbc25 RNA polymerase III subunit Rpc25. Rpc25 is a strongly conserved subunit of RNA polymerase III and has homology to Rpa43 in RNA polymerase I, Rpb7 in RNA polymerase II and the archaeal RpoE subunit. Rpc25 is required for transcription initiation and is not essential for the elongating properties of RNA polymerase III. Probab=50.07 E-value=12 Score=17.24 Aligned_cols=30 Identities=23% Similarity=0.566 Sum_probs=23.3 Q ss_pred EEEEEEEEEECCCCCCCCEEEEEECCCEEEEEEEECC Q ss_conf 9843699996267877430689636982899998358 Q gi|254780474|r 148 ALDWKWLFLLPEQQVAIINELVVPIDRPLEFRITASS 184 (332) Q Consensus 148 a~qW~W~F~YP~~giatvneL~lPv~~pV~f~ltS~D 184 (332) -+-|.|++. + -++|.+-.|..|||++.+.. T Consensus 46 eq~WvW~~d--~-----~~~l~~d~~e~IRFRV~~~~ 75 (120) T pfam08292 46 EQAWVWEYD--E-----ETELYFDVGEEIRFRVESEI 75 (120) T ss_pred CCEEEECCC--C-----CCCCCCCCCCEEEEEEEEEE T ss_conf 467992489--8-----64010368996999998879 No 70 >KOG3297 consensus Probab=49.38 E-value=18 Score=16.02 Aligned_cols=34 Identities=18% Similarity=0.406 Sum_probs=25.8 Q ss_pred EEEEEEEEEECCCCCCCCEEEEEECCCEEEEEEEECC Q ss_conf 9843699996267877430689636982899998358 Q gi|254780474|r 148 ALDWKWLFLLPEQQVAIINELVVPIDRPLEFRITASS 184 (332) Q Consensus 148 a~qW~W~F~YP~~giatvneL~lPv~~pV~f~ltS~D 184 (332) .+-|-|++.-.| | .-.+|.+-+|.+|+|++++.+ T Consensus 124 e~vWVWey~~Ed-g--~~~~Ly~D~~e~IRFRV~~e~ 157 (202) T KOG3297 124 EQVWVWEYEQED-G--PGTKLYFDVGEEIRFRVEDES 157 (202) T ss_pred CEEEEEEECCCC-C--CCCEEEECCCCEEEEEEEEEC T ss_conf 479999851667-8--872269628985899985201 No 71 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=49.38 E-value=17 Score=16.24 Aligned_cols=39 Identities=21% Similarity=-0.050 Sum_probs=20.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCC---CCC-CCCHHHHHH Q ss_conf 202356688999999999987413643---268-989899999 Q gi|254780474|r 8 MGVSVGKSLKFFFLFFSSVMLSGFEFI---VMD-PYGDIALQQ 46 (332) Q Consensus 8 ~~~~~~k~l~~~~l~~~~~lLsGC~~~---~l~-P~G~ia~~~ 46 (332) |...++|.+..++.+++.+.|+||... +.. -.|.|..++ T Consensus 1 m~~~MKK~~l~~~~~~~~l~LaaCss~~~~Vat~kg~~IT~~e 43 (298) T PRK04405 1 MKKKMKKWALAAASAGLLLSLAGCSSNNKTVATYKGGKITESQ 43 (298) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCHHH T ss_conf 9067899999999999999998717999769986398725999 No 72 >COG4259 Uncharacterized protein conserved in bacteria [Function unknown] Probab=48.72 E-value=14 Score=16.78 Aligned_cols=24 Identities=25% Similarity=0.216 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 235668899999999998741364 Q gi|254780474|r 10 VSVGKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 10 ~~~~k~l~~~~l~~~~~lLsGC~~ 33 (332) |+..|.++.+.+++++++|+||.. T Consensus 1 M~~~ktlsr~al~~av~~LagC~~ 24 (121) T COG4259 1 MSKLKTLSRLALLLAVAALAGCGG 24 (121) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 951488999999999999988057 No 73 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=48.43 E-value=17 Score=16.29 Aligned_cols=20 Identities=10% Similarity=0.122 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHHHHHCCC Q ss_conf 66889999999999874136 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFE 32 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~ 32 (332) +|+.-.+++++++++|+||. T Consensus 5 ~r~~~~l~~l~~~~~L~gC~ 24 (385) T PRK09578 5 RRRRLALAALVAAFALAGCG 24 (385) T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 87999999999999981679 No 74 >pfam05454 DAG1 Dystroglycan (Dystrophin-associated glycoprotein 1). Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in human. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton. [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin rece Probab=48.42 E-value=11 Score=17.57 Aligned_cols=33 Identities=6% Similarity=0.184 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHCCCC Q ss_conf 99999999999999999999--9999988530467 Q gi|254780474|r 47 ASLIRIAVALMLLIVVPVFF--SILFFAWKYRSTN 79 (332) Q Consensus 47 ~~L~~~~~~l~liV~v~V~~--l~~~~~~kyR~~~ 79 (332) .+.++.+++.+.+|.+..++ +++.+.+|.|+++ T Consensus 142 ~d~yL~TviPavivaaiLliA~iia~icYrrrRkG 176 (290) T pfam05454 142 DDVYLHTVIPAVVVAAILLIAGIIAMICYRKKRKG 176 (290) T ss_pred CCCEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 21011006789999999999999999965021257 No 75 >TIGR03102 halo_cynanin halocyanin domain. Halocyanins are blue (type I) copper redox proteins found in halophilic archaea such as Natronobacterium pharaonis. This model represents a domain duplicated in some halocyanins, while appearing once in others. This domain includes the characteristic copper ligand residues. This family does not include plastocyanins, and does not include certain divergent paralogs of halocyanin. Probab=48.22 E-value=19 Score=15.90 Aligned_cols=94 Identities=11% Similarity=-0.029 Sum_probs=55.5 Q ss_pred CCCCCEEEEEEEEEEEEEEEECCCCCC-CCEEEEEECCCEEEEEEEECCCCHHCC-CHH-HCCEECCCCCCCEEEEEEEC Q ss_conf 888866899999843699996267877-430689636982899998358101102-201-07403116985148999942 Q gi|254780474|r 137 KNSKPLVVEVVALDWKWLFLLPEQQVA-IINELVVPIDRPLEFRITASSVMNSFY-IPG-LAGQIYAMAGMETKLHAVMN 213 (332) Q Consensus 137 ~~~~~l~I~V~a~qW~W~F~YP~~gia-tvneL~lPv~~pV~f~ltS~DViHSF~-IP~-l~~k~daiPG~~t~l~~~a~ 213 (332) ..++.++|+|-..- ++.+.+ .-..|.|..|+.|+|+-|.+.--|+-- .|. ....-+.....-.....+.+ T Consensus 19 t~~devtV~vg~~~-------~gg~~~F~P~~i~I~~GtTV~w~Wt~~g~~HNVv~~~~~~~~s~~~~~~~g~t~~~tF~ 91 (115) T TIGR03102 19 TGQDEVTVDVGAEA-------NGGGFAFDPPAIRVDPGTTVVWEWTGEGGGHNVVSDGDGDLDESERVSEEGTTYEHTFE 91 (115) T ss_pred CCCCEEEEEEECCC-------CCCCEEECCCEEEECCCCEEEEEECCCCCCCCEEECCCCCCCCCCCCCCCCCEEEEECC T ss_conf 79866899987154-------89867875997898999989999999998745587689876667512579967999918 Q ss_pred CCEEEEEECCCCCCCCCCCCEEEEEEC Q ss_conf 774896661332657611365899970 Q gi|254780474|r 214 KKGAYSGFSANYSGRGFSHMRFKFYGK 240 (332) Q Consensus 214 ~~G~y~G~cae~cG~ghs~M~f~v~v~ 240 (332) ++|+|.-+|.---+ ..|...++|. T Consensus 92 ~~G~y~Y~C~PH~~---~gM~G~I~Ve 115 (115) T TIGR03102 92 EPGIYLYVCVPHEA---LGMKGAVVVE 115 (115) T ss_pred CCEEEEEEECCCCC---CCCEEEEEEC T ss_conf 98479998243867---8975999989 No 76 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=47.94 E-value=12 Score=17.28 Aligned_cols=20 Identities=25% Similarity=0.452 Sum_probs=12.3 Q ss_pred HHHHHHHHH-HHHHHHHCCCC Q ss_conf 688999999-99998741364 Q gi|254780474|r 14 KSLKFFFLF-FSSVMLSGFEF 33 (332) Q Consensus 14 k~l~~~~l~-~~~~lLsGC~~ 33 (332) +++.+++++ ..+++|+||.. T Consensus 2 mr~~~L~~~~~~~f~LtGCsa 22 (174) T TIGR02747 2 MRLKVLLLLACVAFLLTGCSA 22 (174) T ss_pred CHHHHHHHHHHHHHHHHHHHH T ss_conf 067899999999872002577 No 77 >COG2980 RlpB Rare lipoprotein B [Cell envelope biogenesis, outer membrane] Probab=47.92 E-value=13 Score=17.09 Aligned_cols=19 Identities=26% Similarity=0.501 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 8899999999998741364 Q gi|254780474|r 15 SLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 15 ~l~~~~l~~~~~lLsGC~~ 33 (332) +|..+++.+...+|+||.+ T Consensus 3 ~L~~~lL~~a~~~L~aCGF 21 (178) T COG2980 3 KLKTLLLLAAVLLLAACGF 21 (178) T ss_pred CHHHHHHHHHHHHHHHCCE T ss_conf 0589999999999733453 No 78 >pfam09851 DUF2078 Predicted membrane protein (DUF2078). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=47.84 E-value=19 Score=15.86 Aligned_cols=21 Identities=19% Similarity=0.395 Sum_probs=8.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999998853 Q gi|254780474|r 55 ALMLLIVVPVFFSILFFAWKY 75 (332) Q Consensus 55 ~l~liV~v~V~~l~~~~~~ky 75 (332) ++|.++.+.++++++|.++|. T Consensus 10 ~~~~i~~~lli~~vv~l~~~~ 30 (73) T pfam09851 10 LLMLLFWLLLIAGVVYLLVRL 30 (73) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999 No 79 >pfam08085 Entericidin Entericidin EcnA/B family. This family consists of the entericidin antidote/toxin peptides. The entericidin locus is activated in stationary phase under high osmolarity conditions by rho-S and simultaneously repressed by the osmoregulatory EnvZ/OmpR signal transduction pathway. The entericidin locus encodes tandem paralogous genes (ecnAB) and directs the synthesis of two small cell-envelope lipoproteins which can maintain plasmids in bacterial population by means of post-segregational killing. Probab=47.72 E-value=15 Score=16.64 Aligned_cols=21 Identities=19% Similarity=0.384 Sum_probs=12.0 Q ss_pred HHHHHH-HHHHHHHHHHHCCCC Q ss_conf 668899-999999998741364 Q gi|254780474|r 13 GKSLKF-FFLFFSSVMLSGFEF 33 (332) Q Consensus 13 ~k~l~~-~~l~~~~~lLsGC~~ 33 (332) +|.+.+ +++++..+.|+|||- T Consensus 2 k~~~~~~~~l~~~~~~lagCNT 23 (42) T pfam08085 2 KKLIALLLALLLLALVLAGCNT 23 (42) T ss_pred HHHHHHHHHHHHHHHHHHCCHH T ss_conf 2369999999999999861222 No 80 >PRK12701 flgH flagellar basal body L-ring protein; Reviewed Probab=46.92 E-value=14 Score=16.80 Aligned_cols=20 Identities=20% Similarity=0.286 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 68899999999998741364 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~~ 33 (332) .++++.+.++++++|+||.. T Consensus 2 ~r~~~~~~~la~~lL~GC~~ 21 (230) T PRK12701 2 NRLNIAVSCLATALLFGCEA 21 (230) T ss_pred CHHHHHHHHHHHHHHHCCCC T ss_conf 25899999999999755668 No 81 >pfam07358 DUF1482 Protein of unknown function (DUF1482). This family consists of several Enterobacterial proteins of around 60 residues in length. The function of this family is unknown. Probab=46.42 E-value=5.5 Score=19.75 Aligned_cols=11 Identities=36% Similarity=0.779 Sum_probs=9.4 Q ss_pred CCCCHHCCCHH Q ss_conf 58101102201 Q gi|254780474|r 183 SSVMNSFYIPG 193 (332) Q Consensus 183 ~DViHSF~IP~ 193 (332) +||||+||+|+ T Consensus 46 ~~~i~~~e~PA 56 (57) T pfam07358 46 EKVIDNFEIPA 56 (57) T ss_pred HHHCCCCCCCC T ss_conf 46013673469 No 82 >TIGR02544 III_secr_YscJ type III secretion apparatus lipoprotein, YscJ/HrcJ family; InterPro: IPR003282 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia. The type III secretion system is of great interest as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis . However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself, type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. One of the outer membrane protein subunit families, termed "K" here for nomenclature purposes, aids in the structural assembly of the invasion complex . It is also described as a lipoprotein. Members of this family include the Salmonella PrgK and SsaJ genes, MxiJ from Shigella, YscJ from Yersinia, and from the plant enteropathogens NolT (Rhizobium) and HrcJ Erwinia). The flagellar M-ring protein FliF also shares a low level of similarity, presumably due to evolution of the type III secretion system from the flagellar biosynthetic pathway.; GO: 0009306 protein secretion. Probab=46.18 E-value=16 Score=16.45 Aligned_cols=22 Identities=14% Similarity=0.321 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHCCCCC--CCCC Q ss_conf 999999999987413643--2689 Q gi|254780474|r 17 KFFFLFFSSVMLSGFEFI--VMDP 38 (332) Q Consensus 17 ~~~~l~~~~~lLsGC~~~--~l~P 38 (332) ..+++++..++|+||+.. ++.- T Consensus 3 ~~~~~~~l~llL~GC~~~~~Ly~g 26 (203) T TIGR02544 3 RKLLLLLLLLLLTGCKVDLELYSG 26 (203) T ss_pred HHHHHHHHHHHHHCCCCEEEEECC T ss_conf 588999999997178740565047 No 83 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=45.85 E-value=20 Score=15.71 Aligned_cols=19 Identities=16% Similarity=0.230 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 8899999999998741364 Q gi|254780474|r 15 SLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 15 ~l~~~~l~~~~~lLsGC~~ 33 (332) .=+..++++++++|+||.. T Consensus 4 ~k~~~~il~~al~l~GCs~ 22 (200) T COG3417 4 MKIYASILLLALFLSGCSS 22 (200) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 7999999999999841136 No 84 >COG2268 Uncharacterized protein conserved in bacteria [Function unknown] Probab=45.54 E-value=13 Score=17.09 Aligned_cols=16 Identities=19% Similarity=-0.039 Sum_probs=8.0 Q ss_pred EEEEEEEEEEEEEEEECC Q ss_conf 689999984369999626 Q gi|254780474|r 142 LVVEVVALDWKWLFLLPE 159 (332) Q Consensus 142 l~I~V~a~qW~W~F~YP~ 159 (332) ...+|+-. .|.|.||- T Consensus 59 ~~~~vV~g--Gg~~v~Pi 74 (548) T COG2268 59 GGQKVVRG--GGAIVMPI 74 (548) T ss_pred CCCCEEEC--CCEEEECE T ss_conf 77537855--86488430 No 85 >PRK11627 hypothetical protein; Provisional Probab=45.37 E-value=17 Score=16.27 Aligned_cols=25 Identities=20% Similarity=0.374 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHCCCC----CCCCCCCHHH Q ss_conf 999999998741364----3268989899 Q gi|254780474|r 19 FFLFFSSVMLSGFEF----IVMDPYGDIA 43 (332) Q Consensus 19 ~~l~~~~~lLsGC~~----~~l~P~G~ia 43 (332) ++.+++.++|+||.. ..+.|.-+.. T Consensus 6 ~~~l~a~~lL~gCas~p~~l~l~P~~~~~ 34 (192) T PRK11627 6 LFPLVALFMLAGCATPPTTLEVSPKITLP 34 (192) T ss_pred HHHHHHHHHHHHHCCCCCEEEECCCCCCC T ss_conf 99999999998606997637867887664 No 86 >COG4669 EscJ Type III secretory pathway, lipoprotein EscJ [Intracellular trafficking and secretion] Probab=44.80 E-value=21 Score=15.54 Aligned_cols=64 Identities=17% Similarity=0.171 Sum_probs=34.8 Q ss_pred EEECCCCCCCCEEEEEECCCEEEEEEEECCCCHHCCCHHH-CCEECCCCCCCEEEEEEECCCEEEEEE Q ss_conf 9962678774306896369828999983581011022010-740311698514899994277489666 Q gi|254780474|r 155 FLLPEQQVAIINELVVPIDRPLEFRITASSVMNSFYIPGL-AGQIYAMAGMETKLHAVMNKKGAYSGF 221 (332) Q Consensus 155 F~YP~~giatvneL~lPv~~pV~f~ltS~DViHSF~IP~l-~~k~daiPG~~t~l~~~a~~~G~y~G~ 221 (332) ..||..+ ..++=..|.---| |...+.|+=.+-+||+. +.-.+++||-.-+---..-.++.|.-. T Consensus 131 I~lp~~~--~~g~~~~P~saSV-fIky~~~~nl~~~v~~IK~LV~nSv~gL~YenISVVl~~~~~~~~ 195 (246) T COG4669 131 ISLPEDD--DEGKNALPSSASV-FIKYSPDVNLSIYVSQIKRLVANSVPGLQYENISVVLVPASDSDP 195 (246) T ss_pred EECCCCC--CCCCCCCCCEEEE-EEEECCCCCHHHHHHHHHHHHHHCCCCCCHHCEEEEEEECCCCCC T ss_conf 9757778--6678888740689-999568898167689999999815677760006999820443444 No 87 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=44.47 E-value=16 Score=16.52 Aligned_cols=20 Identities=25% Similarity=0.281 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 68899999999998741364 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~~ 33 (332) |++-+.+++.++++|+||.. T Consensus 2 Kk~~l~~~i~SAL~LaGCg~ 21 (269) T pfam12262 2 KKKFLALLLASALLLAGCGD 21 (269) T ss_pred CHHHHHHHHHHHHHHCCCCC T ss_conf 24799999999864111579 No 88 >PRK13562 acetolactate synthase 1 regulatory subunit; Provisional Probab=43.60 E-value=8 Score=18.60 Aligned_cols=37 Identities=22% Similarity=0.236 Sum_probs=29.0 Q ss_pred CCCCCCCEEEEEECCHHHHHHHHHHHHCCCCCCCHHH Q ss_conf 5761136589997088999999999860724569899 Q gi|254780474|r 227 GRGFSHMRFKFYGKSEKGFEDWISKVKRQGTFLNRQE 263 (332) Q Consensus 227 G~ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~~l~~~~ 263 (332) -.|-|.|.|.|.+...+.+++-+++.++.=.-|+-++ T Consensus 40 ~~gIS~mt~vv~v~d~~~~eqlikqL~KqInVL~V~~ 76 (84) T PRK13562 40 QPGISNMEIQVDIQDDTSLHILIKKLKQQINVLTVEC 76 (84) T ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHHEEEEEEEE T ss_conf 9883426999964867899999999985455798999 No 89 >pfam11694 DUF3290 Protein of unknown function (DUF3290). This family of proteins with unknown function appears to be restricted to Firmicutes. Probab=43.36 E-value=22 Score=15.39 Aligned_cols=32 Identities=22% Similarity=0.485 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999999999999988530467777778 Q gi|254780474|r 54 VALMLLIVVPVFFSILFFAWKYRSTNKKARYD 85 (332) Q Consensus 54 ~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~ 85 (332) -++..++++.+++++++++++|-+.+.+..|. T Consensus 16 ~~~~~~i~~~~l~~li~~~~~Y~r~r~~tKYR 47 (149) T pfam11694 16 NWLRYILIIILLVVIIIFAFKYLRHKFDTKYR 47 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH T ss_conf 89999999999999999999999852553278 No 90 >pfam06474 MLTD_N MLTD_N. Probab=43.28 E-value=14 Score=16.80 Aligned_cols=15 Identities=13% Similarity=0.233 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999998741364 Q gi|254780474|r 19 FFLFFSSVMLSGFEF 33 (332) Q Consensus 19 ~~l~~~~~lLsGC~~ 33 (332) ..++++++||+||.. T Consensus 4 kail~~alLLaGCQs 18 (93) T pfam06474 4 KAVLAAALLLVGCQS 18 (93) T ss_pred HHHHHHHHHHHHHCC T ss_conf 999999999998137 No 91 >TIGR00039 6PTHBS putative 6-pyruvoyl tetrahydropterin synthase; InterPro: IPR007116 The complex organic chemistry involved in the transformation of GTP to tetrahydrobiopterin is catalysed by only three enzymes: GTP cyclohydrolase I, 6-pyruvoyltetrahydropterin synthase and sepiapterin reductase. Tetrahydrobiopterin is the cofactor for several aromatic amino acid monooxygenases and the nitric oxide synthases. 6-Pyruvoyl tetrahydropterin synthase (PTPS) is a Zn-dependent metalloprotein, transforms dihydroneopterin triphosphate into 6-pyruvoyltetrahydropterin in the presence of Mg(II) and for which the crystal structure is known. The enzyme is a homohexameric, composed of a dimer of trimers. A transition metal binding site formed by the three histidine residues 23, 48 and 50 is present in each subunit, and bound Zn(II) is responsible for the enzymatic activity. Site-directed mutagenesis of each of these three histidine residues results in a complete loss of metal binding and enzymatic activity , . The function of the bacterial branch of the sequence lineage appears not to have been established. ; GO: 0003874 6-pyruvoyltetrahydropterin synthase activity, 0006729 tetrahydrobiopterin biosynthetic process. Probab=43.22 E-value=7.8 Score=18.64 Aligned_cols=51 Identities=12% Similarity=0.064 Sum_probs=28.2 Q ss_pred EEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 89666133265761136589997088999999999860724569899999761021 Q gi|254780474|r 217 AYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISKVKRQGTFLNRQEYLLLKKPSE 272 (332) Q Consensus 217 ~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~~l~~~~y~~l~~ps~ 272 (332) .--|.|.|+||-||+++- +|.+...+=| +..+.+|.-+|-...+++.++-. T Consensus 29 ~~FGKC~N~~~HGH~Y~v-~V~v~Gd~L~----~~~~~~GmV~DF~~LK~~v~~~~ 79 (153) T TIGR00039 29 EVFGKCGNSRLHGHSYKV-EVEVSGDDLY----ERDPKTGMVMDFSDLKKIVKEVI 79 (153) T ss_pred HCCCCCCCCCCCCEEEEE-EEEEECCCCC----CCCCCCCEEECHHHHHHHHHHHH T ss_conf 015552888335501289-9998523221----11687220663899999998873 No 92 >TIGR02163 napH_ ferredoxin-type protein, NapH/MauN family; InterPro: IPR011886 Most members of this family are the NapH protein, found next to NapG,in operons that encode the periplasmic nitrate reductase. Some species with this reductase lack NapC but accomplish electron transfer to NapAB in some other manner, likely to involve NapH, NapG, and/or some other protein. A few members of this protein are designated MauN and are found in methylamine utilization operons in species that appear to lack a periplasmic nitrate reductase.; GO: 0005506 iron ion binding, 0006118 electron transport. Probab=42.32 E-value=23 Score=15.28 Aligned_cols=46 Identities=20% Similarity=0.450 Sum_probs=25.4 Q ss_pred CCHHHHHHHHH-----------HHHHHHHHHHHHHH--HHHHCCC-CCCCCCCCHHHHHH Q ss_conf 91023442023-----------56688999999999--9874136-43268989899999 Q gi|254780474|r 1 MSDCLNWMGVS-----------VGKSLKFFFLFFSS--VMLSGFE-FIVMDPYGDIALQQ 46 (332) Q Consensus 1 ~~~~~~~~~~~-----------~~k~l~~~~l~~~~--~lLsGC~-~~~l~P~G~ia~~~ 46 (332) .+|+++|.+.+ +.|++|.+++.+.. .+++|-- +...||-|-+.+-+ T Consensus 91 vTD~A~wLr~KL~~~~~~~~r~l~r~~RY~~L~~~LllS~~~G~l~wE~~nPVg~l~Rgi 150 (263) T TIGR02163 91 VTDLAAWLRRKLGIKKSIKERLLPRNLRYWVLVLVLLLSFLSGLLIWEWVNPVGLLHRGI 150 (263) T ss_pred HHHHHHHHHHHHCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHH T ss_conf 112589999861341235640388540499999999999997067776744277898999 No 93 >PRK13316 heme-degrading monooxygenase IsdG; Provisional Probab=41.99 E-value=24 Score=15.25 Aligned_cols=69 Identities=16% Similarity=0.228 Sum_probs=42.4 Q ss_pred CCEEEEEECCCEEEEEEEECCCCHHCCCHH----HCCEECCCCCCCE-EEEEEECCCEEEEEECCCCCCCCCCCCEEEEE Q ss_conf 430689636982899998358101102201----0740311698514-89999427748966613326576113658999 Q gi|254780474|r 164 IINELVVPIDRPLEFRITASSVMNSFYIPG----LAGQIYAMAGMET-KLHAVMNKKGAYSGFSANYSGRGFSHMRFKFY 238 (332) Q Consensus 164 tvneL~lPv~~pV~f~ltS~DViHSF~IP~----l~~k~daiPG~~t-~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~ 238 (332) ..|.|.+|.|. ...++.-|.=+. -..||..|||..- ++|.+..+.-. |-.|.+... T Consensus 4 VtN~i~V~KG~-------ae~l~~RFt~~~g~g~~~~kie~~~GF~~~ev~~~~~ed~d------------ydEv~V~T~ 64 (121) T PRK13316 4 VTNTIKVEKGA-------AEHVIRQFTGANGDGHPTKDIAEVEGFLGFELWHSKPEDKD------------YEEVVVTSK 64 (121) T ss_pred EEEEEEECCCH-------HHHHHHHHHCCCCCCCCCCCCEECCCCEEEEEEECCCCCCC------------CCEEEEEEE T ss_conf 97556605634-------88999876435777654444102678017999862688775------------665899989 Q ss_pred ECCHHHHHHHHHH Q ss_conf 7088999999999 Q gi|254780474|r 239 GKSEKGFEDWISK 251 (332) Q Consensus 239 v~s~~~f~~Wv~~ 251 (332) =.+.++|++|+.+ T Consensus 65 Weske~F~~W~~S 77 (121) T PRK13316 65 WESEEAQRNWVKS 77 (121) T ss_pred ECCHHHHHHHHHH T ss_conf 8888898877633 No 94 >TIGR02588 TIGR02588 conserved hypothetical protein TIGR02588; InterPro: IPR013417 The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Nostoc sp. PCC 7120, Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.. Probab=41.90 E-value=24 Score=15.24 Aligned_cols=18 Identities=6% Similarity=0.041 Sum_probs=10.7 Q ss_pred EEEEEEEECCCCCCCCEE Q ss_conf 436999962678774306 Q gi|254780474|r 150 DWKWLFLLPEQQVAIINE 167 (332) Q Consensus 150 qW~W~F~YP~~giatvne 167 (332) |+.=.|.=-+.+..|+.. T Consensus 50 Qy~v~Fai~N~~g~ta~~ 67 (122) T TIGR02588 50 QYYVPFAIENRGGKTAAA 67 (122) T ss_pred CEEEEEEEECCCCCEEEH T ss_conf 758788888178974105 No 95 >PRK13733 conjugal transfer protein TraV; Provisional Probab=41.78 E-value=2.1 Score=22.65 Aligned_cols=20 Identities=20% Similarity=0.438 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 68899999999998741364 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~~ 33 (332) |++.+++.+.+.++|+||.+ T Consensus 2 k~~~~l~~l~~~llLtGCAG 21 (171) T PRK13733 2 KKISLLIPLLGTLLLSGCAG 21 (171) T ss_pred CEEEEEHHHCCEEEEECCCC T ss_conf 51675112026002602457 No 96 >PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional Probab=41.72 E-value=22 Score=15.49 Aligned_cols=74 Identities=22% Similarity=0.381 Sum_probs=43.5 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHHH Q ss_conf 43268989899999999999999999999999999999988530467777778753456612553022789999999999 Q gi|254780474|r 33 FIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAVI 112 (332) Q Consensus 33 ~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~~ 112 (332) .++-+...++|.+.+..+.-..---- -=--..-|+.=|| ++.--|.|....+| .+.|..|++++++.++. T Consensus 49 qsi~~S~s~iA~dmR~~Ir~~i~~G~----sd~eI~~~l~~rY---G~~Il~~Pp~~~~t---~lLW~~P~i~l~~g~~~ 118 (126) T PRK10144 49 QNLLESNAPVAVSMRHQVYSMVAEGK----SEVEIIGWMTERY---GDFVRYNPPLTGQT---LVLWALPVVLLLLMALI 118 (126) T ss_pred CCHHHCCHHHHHHHHHHHHHHHHCCC----CHHHHHHHHHHHC---CCEEEEECCCCCCH---HHHHHHHHHHHHHHHHH T ss_conf 88411577999999999999998699----9999999999864---88498618999561---89999999999999999 Q ss_pred HHHH Q ss_conf 9988 Q gi|254780474|r 113 TWDA 116 (332) Q Consensus 113 t~~~ 116 (332) .|+. T Consensus 119 ~~r~ 122 (126) T PRK10144 119 LWRV 122 (126) T ss_pred HHHH T ss_conf 9854 No 97 >COG5000 NtrY Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation [Signal transduction mechanisms] Probab=41.66 E-value=24 Score=15.21 Aligned_cols=72 Identities=24% Similarity=0.239 Sum_probs=33.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEE---EEEEHHHHHHHHHHHH Q ss_conf 68989899999999999999999999-9999999999885304677777787534566125---5302278999999999 Q gi|254780474|r 36 MDPYGDIALQQASLIRIAVALMLLIV-VPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLE---LFVWLVPLVIVGFLAV 111 (332) Q Consensus 36 l~P~G~ia~~~~~L~~~~~~l~liV~-v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lE---i~wt~iP~iIi~~L~~ 111 (332) .+|-++... .-++...+-++++++ +.+++--++-..+-|++++.+ .+. | +++- .++-++|.+|+.+.+. T Consensus 29 ~t~i~~~~~--~~~~~v~vnl~l~l~L~~vi~r~v~~l~~arr~Gka~---sRL-h-~Riv~lFslvav~Pavivaifs~ 101 (712) T COG5000 29 TTPISPAFD--YILALVAVNLALLLILSAVIGRKVIRLLKARRLGKAG---SRL-H-VRIVGLFSLVAVIPAVIVAIFSA 101 (712) T ss_pred CCCCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH---HHH-H-HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 787886106--6689999879999999999999999999988627257---899-9-99999999999988999999999 Q ss_pred HHH Q ss_conf 999 Q gi|254780474|r 112 ITW 114 (332) Q Consensus 112 ~t~ 114 (332) .+. T Consensus 102 ~~i 104 (712) T COG5000 102 QFI 104 (712) T ss_pred HHH T ss_conf 997 No 98 >COG3462 Predicted membrane protein [Function unknown] Probab=41.55 E-value=24 Score=15.20 Aligned_cols=48 Identities=8% Similarity=-0.012 Sum_probs=25.5 Q ss_pred CCCCCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 4326898989--99999999999999999999999999999885304677 Q gi|254780474|r 33 FIVMDPYGDI--ALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNK 80 (332) Q Consensus 33 ~~~l~P~G~i--a~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~ 80 (332) ....-|-|-- +---+..++++..+.+++.+.+.+++++.+-+.|+.+. T Consensus 31 gg~~y~~gy~gm~GG~yGm~lImpI~~~vvli~lvvfm~~~~g~~r~~~~ 80 (117) T COG3462 31 GGAFYPGGYRGMMGGLYGMWLIMPIFWAVVLIFLVVFMFYILGAVRRGSD 80 (117) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 67556897410014322349999999999999999999999987315420 No 99 >pfam00394 Cu-oxidase Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain. Probab=41.44 E-value=24 Score=15.19 Aligned_cols=74 Identities=18% Similarity=0.189 Sum_probs=49.0 Q ss_pred CCCEEEEEECCCEEEEEEEECCC--CHHCCCHHH----------------CCEECCCCCCCEEEEEEECC-CEEEEEECC Q ss_conf 74306896369828999983581--011022010----------------74031169851489999427-748966613 Q gi|254780474|r 163 AIINELVVPIDRPLEFRITASSV--MNSFYIPGL----------------AGQIYAMAGMETKLHAVMNK-KGAYSGFSA 223 (332) Q Consensus 163 atvneL~lPv~~pV~f~ltS~DV--iHSF~IP~l----------------~~k~daiPG~~t~l~~~a~~-~G~y~G~ca 223 (332) .....+.+..|+..+|++-+... .+.|.|..- --.+..-||.+-..-+++|+ +|.|..+|+ T Consensus 42 ~~~~~~~v~~gk~yrlR~IN~~~~~~~~~~idgH~~~vi~~DG~~v~p~~~~~i~i~~GqRydvlv~a~q~~~nY~i~a~ 121 (142) T pfam00394 42 ADLATLTVTPGKTYRLRIINVALDDSLNFSIEGHKMTVVEVDGVYVNPFTVDSLDIFPGQRYSVLVTANQDPGNYWIVAS 121 (142) T ss_pred CCCEEEEECCCCEEEEEEEEECCCCEEEEEECCCEEEEEEECCEECCCEEECEEEECCCCEEEEEEECCCCCCCEEEEEE T ss_conf 87479998789999999997166940599999977999996984025349686997678889999989999875899975 Q ss_pred CCCCCCCCCCEEE Q ss_conf 3265761136589 Q gi|254780474|r 224 NYSGRGFSHMRFK 236 (332) Q Consensus 224 e~cG~ghs~M~f~ 236 (332) ..++......... T Consensus 122 ~~~~~~~~~~~~~ 134 (142) T pfam00394 122 PRLIPAFDNGTAA 134 (142) T ss_pred CCCCCCCCCCEEE T ss_conf 3767766686078 No 100 >PRK12697 flgH flagellar basal body L-ring protein; Reviewed Probab=41.03 E-value=22 Score=15.52 Aligned_cols=24 Identities=8% Similarity=-0.013 Sum_probs=17.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 235668899999999998741364 Q gi|254780474|r 10 VSVGKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 10 ~~~~k~l~~~~l~~~~~lLsGC~~ 33 (332) +...|+-...++.+++++|+||.. T Consensus 3 ~~~~~~~~~~~~al~~l~LsGCa~ 26 (227) T PRK12697 3 SAAVRRPGAAACALAALALAGCAQ 26 (227) T ss_pred HHHHCCCCHHHHHHHHHHHHCCCC T ss_conf 055414038999999999726567 No 101 >pfam05767 Pox_A14 Poxvirus virion envelope protein A14. This family consists of several Poxvirus virion envelope protein A14 like sequences. A14 is a component of the virion membrane and has been found to be an H1 phosphatase substrate in vivo and in vitro. A14 is hyperphosphorylated on serine residues in the absence of H1 expression. Probab=40.82 E-value=24 Score=15.13 Aligned_cols=62 Identities=15% Similarity=0.097 Sum_probs=30.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCC--CCCCCCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHCC Q ss_conf 668899999999998741364--32689898999-99999999999999999999999999988-5304 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFEF--IVMDPYGDIAL-QQASLIRIAVALMLLIVVPVFFSILFFAW-KYRS 77 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~~--~~l~P~G~ia~-~~~~L~~~~~~l~liV~v~V~~l~~~~~~-kyR~ 77 (332) ++..+..++....++...|-. .-+.-.+..+. .-+.|-.++|.+.. +..+++++|..| ||=+ T Consensus 8 ~ny~S~vli~GIiLL~~aCIfAfidfSK~~~~~~~~wRalSii~FIlgi---v~~lG~~ifs~ygr~C~ 73 (92) T pfam05767 8 KNYYSGVLIAGIILLILACIFAFIEFSKSTKPTDYTWRALSIICFILGI---VLLLGIVLFSMYGRYCA 73 (92) T ss_pred HHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHCC T ss_conf 8445368999999999999985101203799415689999999999999---99999999998731127 No 102 >PRK02944 OxaA-like protein precursor; Validated Probab=40.74 E-value=25 Score=15.12 Aligned_cols=31 Identities=16% Similarity=0.174 Sum_probs=17.9 Q ss_pred HHHHHHHHHH-HHHHHHHHCCCCC----CCCCCCHH Q ss_conf 5668899999-9999987413643----26898989 Q gi|254780474|r 12 VGKSLKFFFL-FFSSVMLSGFEFI----VMDPYGDI 42 (332) Q Consensus 12 ~~k~l~~~~l-~~~~~lLsGC~~~----~l~P~G~i 42 (332) ++|++.++++ ++.+++||||... .-++.|-. T Consensus 1 mkk~~~~~~~~~~~~~~lsgC~~~~~~~~~~~~g~~ 36 (255) T PRK02944 1 MKKKLGLLAMVVALMAILAGCSEVNQPITPKSTGIW 36 (255) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCH T ss_conf 928999999999999999624688999999999818 No 103 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=40.05 E-value=25 Score=15.05 Aligned_cols=158 Identities=20% Similarity=0.185 Sum_probs=67.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC----CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 35668899999999998741364----32689898999999999999999999999999999999885304677777787 Q gi|254780474|r 11 SVGKSLKFFFLFFSSVMLSGFEF----IVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDP 86 (332) Q Consensus 11 ~~~k~l~~~~l~~~~~lLsGC~~----~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p 86 (332) +....++.+.++++.++|+||.. ....|.-+.+.+--.+ |..++- ..- +..+.+ .+-. T Consensus 11 ~k~~t~k~L~~laa~~lLagC~a~~~tl~~tpp~~~~sqn~v~-------~~~~vs--------it~--kD~R~~-q~vA 72 (204) T COG3056 11 SKNMTKKILFPLAAIFLLAGCAAPPTTLIVTPPAPLPSQNPVL-------MGVTVS--------ITG--KDQRSD-QAVA 72 (204) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCCHH-------CCEEEE--------EEC--CCCHHH-HHHH T ss_conf 1788999999999999997537997503136997664437233-------141899--------970--550166-7887 Q ss_pred CCCCCCEEEEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCCCCCE Q ss_conf 53456612553022789999999999998877513444200001244678888866899999843699996267877430 Q gi|254780474|r 87 KWCHSTLLELFVWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQVAIIN 166 (332) Q Consensus 87 ~~~hs~~lEi~wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~giatvn 166 (332) .....+.|+.+ +..|-+-.++=-+.-=.-+-+ --..+...+.++.++-.|.+=.. ++| T Consensus 73 ~v~~~~~lv~L-~as~~vr~llq~vl~~q~~sr-------------Gfrig~n~~~n~~i~V~qlya~V---~qg----- 130 (204) T COG3056 73 KVTRDNQLVKL-TASRDVRFLLQEVLEKQMTSR-------------GFRVGPNGEVNLQITVDQLYADV---EQG----- 130 (204) T ss_pred HHHCCCCEEEE-CCCCCHHHHHHHHHHHHHHHC-------------CEEECCCCCCEEEEEHHHHHHHH---HHC----- T ss_conf 62047978983-478139999999999986534-------------36753689831899849966355---633----- Q ss_pred EEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECC Q ss_conf 689636982899998358101102201074031169851489999427748966613 Q gi|254780474|r 167 ELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSA 223 (332) Q Consensus 167 eL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~ca 223 (332) -|.--.+.+|...++...- .|| |....+.+.+.+|.|..-.- T Consensus 131 nlryni~tkv~i~I~A~~~---------~Gk------~~Kny~at~~~~GafsAsN~ 172 (204) T COG3056 131 NLRYNIDTKVDITITATAA---------NGK------MVKNYRATYNVEGAFSASND 172 (204) T ss_pred CEEECCCCEEEEEEEEECC---------CCC------EEEEEEEEEEEEEEEECCCH T ss_conf 1233025458899999737---------871------55324556666655405886 No 104 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=39.77 E-value=16 Score=16.43 Aligned_cols=17 Identities=12% Similarity=0.483 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHCCC Q ss_conf 89999999999874136 Q gi|254780474|r 16 LKFFFLFFSSVMLSGFE 32 (332) Q Consensus 16 l~~~~l~~~~~lLsGC~ 32 (332) +++++++++.++|+||. T Consensus 1 mr~l~~~~a~l~LsGCa 17 (231) T PRK12788 1 MRLLVAILACLALAGCA 17 (231) T ss_pred CCHHHHHHHHHHHHHHC T ss_conf 90899999999976403 No 105 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=39.02 E-value=23 Score=15.27 Aligned_cols=20 Identities=35% Similarity=0.360 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHHHHHHCCC Q ss_conf 66889999999999874136 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFE 32 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~ 32 (332) .|++-+..++.+.++|+||. T Consensus 3 ~kk~~~~~~~~~~l~lsGC~ 22 (243) T PRK13731 3 TKKLMMVALVSSTLALSGCG 22 (243) T ss_pred HHHHHHHHHHHHHHHHCCHH T ss_conf 37757999999999971568 No 106 >PRK02939 hypothetical protein; Reviewed Probab=38.54 E-value=22 Score=15.53 Aligned_cols=31 Identities=10% Similarity=0.127 Sum_probs=18.9 Q ss_pred HHHHHHHHHHHCCC-CCCCCCCCHHHHHHHHH Q ss_conf 99999999874136-43268989899999999 Q gi|254780474|r 19 FFLFFSSVMLSGFE-FIVMDPYGDIALQQASL 49 (332) Q Consensus 19 ~~l~~~~~lLsGC~-~~~l~P~G~ia~~~~~L 49 (332) +++.+.+++|+||+ ...+.|-.|....-.++ T Consensus 5 ~~~~~l~~~L~gCD~~~al~~ftP~masfSn~ 36 (236) T PRK02939 5 LLPCLLAILLTGCDRTEVTLSFTPEMASFSNE 36 (236) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCHHHHCCCCC T ss_conf 47899999876268888765348667242100 No 107 >PRK13279 arnT 4-amino-4-deoxy-L-arabinose transferase; Provisional Probab=38.53 E-value=21 Score=15.57 Aligned_cols=49 Identities=12% Similarity=0.263 Sum_probs=30.0 Q ss_pred EEEEEECCCEE--------------EEEECCCC-CCCCCCCCEEEEEECCHHHHHHHHHHHHCCCC Q ss_conf 89999427748--------------96661332-65761136589997088999999999860724 Q gi|254780474|r 207 KLHAVMNKKGA--------------YSGFSANY-SGRGFSHMRFKFYGKSEKGFEDWISKVKRQGT 257 (332) Q Consensus 207 ~l~~~a~~~G~--------------y~G~cae~-cG~ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~ 257 (332) .-.+.+|..|+ -.++--|+ -|.+..+.+-+ -++.+||++|+++.|+++. T Consensus 451 ~~~~l~~~~g~~~~law~l~r~di~l~~~~gel~ygl~y~d~~~~--~~~~~~~~~wl~~~~~~g~ 514 (551) T PRK13279 451 SRYILSDSVGVAAGLAWELKRSDITLYETKGELKYGLSYPDAKGR--FVSLDDFPAWLAQARQEGI 514 (551) T ss_pred CEEEEECCHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHCCC--CCCHHHHHHHHHHHCCCCC T ss_conf 708876571088888777247737761675501247898254277--2069999999998255786 No 108 >TIGR02699 archaeo_AfpA archaeoflavoprotein AfpA; InterPro: IPR014072 The prototypical member of this archaeal protein family is AF1518 from Archaeoglobus fulgidus. This homodimer with two non-covalently bound FMN cofactors can receive electrons from ferredoxin, but not from a number of other electron donors such as NADH or rubredoxin. It can then donate electrons to various reductases.. Probab=38.33 E-value=22 Score=15.40 Aligned_cols=27 Identities=22% Similarity=0.295 Sum_probs=22.3 Q ss_pred EEEEECCCEEEEEEEECCCCHHCCCHH Q ss_conf 689636982899998358101102201 Q gi|254780474|r 167 ELVVPIDRPLEFRITASSVMNSFYIPG 193 (332) Q Consensus 167 eL~lPv~~pV~f~ltS~DViHSF~IP~ 193 (332) +=.+|-||..++++|-.||=|+=-+-+ T Consensus 133 ~T~lP~gk~L~L~~r~~DVe~v~kL~~ 159 (176) T TIGR02699 133 KTALPDGKKLELRMRKVDVENVEKLAQ 159 (176) T ss_pred EEECCCCCEEEEEEEEECHHHHHHHHH T ss_conf 985698961689876412024688765 No 109 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=38.28 E-value=27 Score=14.86 Aligned_cols=19 Identities=16% Similarity=-0.034 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 8899999999998741364 Q gi|254780474|r 15 SLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 15 ~l~~~~l~~~~~lLsGC~~ 33 (332) .-.++++++.+++|+||.. T Consensus 4 ~k~l~~~~~l~l~l~gCs~ 22 (243) T PRK10866 4 MKYLVAAATLSLFLAGCSG 22 (243) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 8999999999999987689 No 110 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=37.83 E-value=24 Score=15.17 Aligned_cols=21 Identities=29% Similarity=0.322 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 668899999999998741364 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~~ 33 (332) +|.+.++++++.++.|+||.. T Consensus 1 Kk~~~~~~~~~~~~~l~~C~~ 21 (289) T TIGR03659 1 KKILSLILLALLSLGLTGCSS 21 (289) T ss_pred CCHHHHHHHHHHHHHHHHCCC T ss_conf 912999999999999843289 No 111 >pfam11874 DUF3394 Domain of unknown function (DUF3394). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is about 190 amino acids in length. This domain is found associated with pfam06808. Probab=37.76 E-value=7.7 Score=18.69 Aligned_cols=35 Identities=17% Similarity=0.317 Sum_probs=23.3 Q ss_pred EEEEEEECCCCCCCCEEE-----EEECCCEEEEEEEECCC Q ss_conf 369999626787743068-----96369828999983581 Q gi|254780474|r 151 WKWLFLLPEQQVAIINEL-----VVPIDRPLEFRITASSV 185 (332) Q Consensus 151 W~W~F~YP~~giatvneL-----~lPv~~pV~f~ltS~DV 185 (332) |+|.-.||.+.....+++ .+|.|+.|+++++..|- T Consensus 48 f~ld~i~P~y~~~~~~~~~~~~~~~~~~~~lri~v~g~~~ 87 (183) T pfam11874 48 FWLDMIYPPYEERPPTELVQAAEALPAGEELRLRVEGEDL 87 (183) T ss_pred HHHHCCCCCHHCCCHHHHHHHHHCCCCCCEEEEEEEEECC T ss_conf 8885036852017988999999718999869999982178 No 112 >COG3317 NlpB Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=36.84 E-value=28 Score=14.71 Aligned_cols=21 Identities=14% Similarity=0.102 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 668899999999998741364 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~~ 33 (332) +....+++.++..++|+||+. T Consensus 2 k~~~~lv~~al~v~~LaaCSs 22 (342) T COG3317 2 KSSAKLVLGALLVLLLAACSS 22 (342) T ss_pred CHHHHHHHHHHHHHHHHHCCC T ss_conf 117889999999998741468 No 113 >KOG3653 consensus Probab=36.57 E-value=28 Score=14.68 Aligned_cols=34 Identities=18% Similarity=0.448 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9999999999999999999999999885304677 Q gi|254780474|r 47 ASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNK 80 (332) Q Consensus 47 ~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~ 80 (332) ..+++..+.+.++.+..+++..+|.+||+|+... T Consensus 152 ~~~~~al~~~~~v~~l~~lvi~~~~~~r~~k~~~ 185 (534) T KOG3653 152 EVLIYALIPLLLVSLLAALVILAFLGYRQRKNAR 185 (534) T ss_pred CEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 5321357778899999999999999999862454 No 114 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=36.29 E-value=24 Score=15.22 Aligned_cols=17 Identities=18% Similarity=0.405 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999998741364 Q gi|254780474|r 17 KFFFLFFSSVMLSGFEF 33 (332) Q Consensus 17 ~~~~l~~~~~lLsGC~~ 33 (332) ...+++.+.++|+||.. T Consensus 4 ~i~~il~~~llL~GCs~ 20 (304) T pfam07901 4 LIKLILIATLLLSGCST 20 (304) T ss_pred HHHHHHHHHHHHCCCCC T ss_conf 89999999999713344 No 115 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=35.93 E-value=29 Score=14.61 Aligned_cols=14 Identities=21% Similarity=0.396 Sum_probs=9.4 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999998741364 Q gi|254780474|r 20 FLFFSSVMLSGFEF 33 (332) Q Consensus 20 ~l~~~~~lLsGC~~ 33 (332) ++++.+++|+||.. T Consensus 5 ~l~~~~l~L~gCa~ 18 (224) T PRK12698 5 ILLALALLLAGCSS 18 (224) T ss_pred HHHHHHHHHHCCCC T ss_conf 99999999836568 No 116 >pfam10643 Cytochrome-c551 Photosystem P840 reaction-centre cytochrome c-551. A photosynthetic reaction-centre complex is found in certain green sulphur bacteria such as Chlorobium vibrioforme which are anaerobic photo-auto-trophic organisms. The primary electron donor is P840, a probable B-Chl a dimer, and the primary electron acceptor is a B-Chl monomer. Also on the donor side c-type cytochromes are known to function as electron donors to photo-oxidized P840. This family is thus the secondary endogenous donor of the photosynthetic reaction-centre complex and is a membrane-bound cytochrome containing a single haem group. Probab=35.91 E-value=29 Score=14.61 Aligned_cols=25 Identities=24% Similarity=0.579 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 0235668899999999998741364 Q gi|254780474|r 9 GVSVGKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 9 ~~~~~k~l~~~~l~~~~~lLsGC~~ 33 (332) ..+++-..-...|+.+..+|+|-.. T Consensus 11 alAiGGAvlmgaLFfgvSflTGyk~ 35 (213) T pfam10643 11 ALAIGGAVLMGALFFGVSFLTGYKV 35 (213) T ss_pred EEHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 2035159999999999999844878 No 117 >pfam07856 Orai-1 Mediator of CRAC channel activity. ORAI-1 is a protein homologue of Drosophila Orai and human Orai1, Orai2 and Orai3. ORAI-1 GFP reporters are co- expressed with STIM-1 (ER CA(2+) sensors) in the gonad and intestine. The protein has four predicted transmembrane domains with a highly conserved region between TM2 ad TM3. This conserved domain is thought to function in channel regulation. ORAI1- related proteins are required for the production of the calcium channel, CRAC, along with STIM1-related proteins. Probab=35.62 E-value=29 Score=14.58 Aligned_cols=62 Identities=13% Similarity=0.156 Sum_probs=28.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC-C--CEEEEEEEHHHHHHHHHHHHHH Q ss_conf 999999999999999999999998853046777777875345-6--6125530227899999999999 Q gi|254780474|r 49 LIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCH-S--TLLELFVWLVPLVIVGFLAVIT 113 (332) Q Consensus 49 L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~h-s--~~lEi~wt~iP~iIi~~L~~~t 113 (332) -|.+++++.++.|..-++++.|. ||-.-+.+.. ++...+ + ..+......||+.++.+.+..- T Consensus 112 AW~FST~lG~~LFL~EI~il~WV--KF~~v~~~~~-~~~~~~~~~~AA~a~t~iliPv~ivFv~Faih 176 (205) T pfam07856 112 AWAFSTVLGLLLFLAEVVLLCWV--KFLPVGAPDQ-PTTHSPPGWQAAYASTAIMVPVGLVFVVFAIH 176 (205) T ss_pred HHHHHHHHHHHHHHHHHHHHHEE--EEEECCCCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999998815--6664477656-66667603889999999999999999999999 No 118 >PRK10598 hypothetical protein; Provisional Probab=35.60 E-value=27 Score=14.84 Aligned_cols=21 Identities=5% Similarity=0.018 Sum_probs=14.5 Q ss_pred CCEEEEEECCHHHHHHHHHHH Q ss_conf 365899970889999999998 Q gi|254780474|r 232 HMRFKFYGKSEKGFEDWISKV 252 (332) Q Consensus 232 ~M~f~v~v~s~~~f~~Wv~~~ 252 (332) .|+..=.-.+|++|+.=++.. T Consensus 115 ~L~L~~~~~~P~~~~~~l~~l 135 (186) T PRK10598 115 EMELVDATVQPEKMQTVMQTL 135 (186) T ss_pred EEEEEEECCCHHHHHHHHHHH T ss_conf 137876026989998889988 No 119 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=35.32 E-value=25 Score=15.02 Aligned_cols=24 Identities=25% Similarity=0.412 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999999999999885304677 Q gi|254780474|r 57 MLLIVVPVFFSILFFAWKYRSTNK 80 (332) Q Consensus 57 ~liV~v~V~~l~~~~~~kyR~~~~ 80 (332) -.+++.+++++..+|-|||+.++. T Consensus 23 k~li~gviLg~~~lfGW~ywq~~q 46 (207) T COG2976 23 KALIVGVILGLGGLFGWRYWQSHQ 46 (207) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 078999999999999889999987 No 120 >cd00454 Trunc_globin Truncated hemoglobins (trHbs) are a family of oxygen-binding heme proteins found in cyanobacteria, eubacteria, unicellular eukaryotes, and plants. The truncated hemoglobins have a characteristic two-over-two alpha helical folding pattern that is distinct from the three-over-three pattern found in other globins. A subset of these have been demonstrated to form homodimers. Probab=35.31 E-value=14 Score=16.75 Aligned_cols=36 Identities=19% Similarity=0.368 Sum_probs=24.8 Q ss_pred CEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHCCCCC Q ss_conf 74896661332657611365899970889999999998607245 Q gi|254780474|r 215 KGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISKVKRQGTF 258 (332) Q Consensus 215 ~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~~ 258 (332) |+.|.|.+-. .-|+.| ..++++|++|++-.+++-.. T Consensus 56 p~~Y~G~~m~---~~H~~l-----~I~~~~fd~wl~~~~~al~~ 91 (116) T cd00454 56 PGLYRGHPML---RRHLPF-----PITEEEFDAWLELLRDALDE 91 (116) T ss_pred CCCCCCCCHH---HHHHCC-----CCCHHHHHHHHHHHHHHHHH T ss_conf 9887744399---987379-----98999999999999999988 No 121 >PRK11372 lysozyme inhibitor; Provisional Probab=35.28 E-value=29 Score=14.63 Aligned_cols=21 Identities=24% Similarity=0.325 Sum_probs=10.6 Q ss_pred CCCCEEEEEEEEEEEEEEEEC Q ss_conf 888668999998436999962 Q gi|254780474|r 138 NSKPLVVEVVALDWKWLFLLP 158 (332) Q Consensus 138 ~~~~l~I~V~a~qW~W~F~YP 158 (332) +..|++|..-..+=.=.|.|- T Consensus 38 gt~pLTV~~~n~~~~VsfvlD 58 (109) T PRK11372 38 DEKPLTVKLNNPRQEVSFVYD 58 (109) T ss_pred CCEEEEEEECCCCCEEEEEEC T ss_conf 988379995387637999985 No 122 >pfam05590 DUF769 Xylella fastidiosa protein of unknown function (DUF769). This family consists of several uncharacterized hypothetical proteins of unknown function from Xylella fastidiosa, the organism that causes Pierce's disease in plants. Probab=35.24 E-value=24 Score=15.21 Aligned_cols=15 Identities=27% Similarity=0.363 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999998741364 Q gi|254780474|r 19 FFLFFSSVMLSGFEF 33 (332) Q Consensus 19 ~~l~~~~~lLsGC~~ 33 (332) .+++++++||+||+. T Consensus 9 ~SllaAslLLagCSs 23 (279) T pfam05590 9 CSLLAASLLLAGCSS 23 (279) T ss_pred HHHHHHHHHHHCCCC T ss_conf 699999999723788 No 123 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=35.01 E-value=26 Score=14.95 Aligned_cols=17 Identities=29% Similarity=0.395 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999998741364 Q gi|254780474|r 17 KFFFLFFSSVMLSGFEF 33 (332) Q Consensus 17 ~~~~l~~~~~lLsGC~~ 33 (332) +++++.+..++|+||.. T Consensus 2 r~l~l~~~~l~LsGC~~ 18 (220) T PRK12407 2 RFLILTPMVLALCGCES 18 (220) T ss_pred CCHHHHHHHHHHHCCCC T ss_conf 15799999999736588 No 124 >PRK13684 Ycf48-like protein; Provisional Probab=34.97 E-value=30 Score=14.51 Aligned_cols=26 Identities=23% Similarity=0.309 Sum_probs=19.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 20235668899999999998741364 Q gi|254780474|r 8 MGVSVGKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 8 ~~~~~~k~l~~~~l~~~~~lLsGC~~ 33 (332) |...+++.+.++++++..++|+||.. T Consensus 1 m~~~~~~~~~l~l~~~~~~~~~~c~~ 26 (333) T PRK13684 1 MKRLLKSLLNLLLLLALGLVLSGCST 26 (333) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 92688889999999999986322356 No 125 >pfam06788 UPF0257 Uncharacterized protein family (UPF0257). Probab=34.46 E-value=20 Score=15.76 Aligned_cols=27 Identities=11% Similarity=0.144 Sum_probs=17.7 Q ss_pred CCCCCCEEEEEEEEEEEEEEEECCCCC Q ss_conf 888886689999984369999626787 Q gi|254780474|r 136 AKNSKPLVVEVVALDWKWLFLLPEQQV 162 (332) Q Consensus 136 ~~~~~~l~I~V~a~qW~W~F~YP~~gi 162 (332) ..+++-..+++.+.+-+-.|+|-++|- T Consensus 126 ~~dD~GfV~~a~~~e~~~~YrYD~~G~ 152 (236) T pfam06788 126 ETDDNGFVVSAHGKEMKVEYRYDDEGY 152 (236) T ss_pred EECCCCEEEEEECCCCEEEEEECCCCC T ss_conf 957983588830164047888648996 No 126 >PRK11653 hypothetical protein; Provisional Probab=34.30 E-value=29 Score=14.65 Aligned_cols=33 Identities=21% Similarity=0.098 Sum_probs=17.4 Q ss_pred CCHHHHHHHHHHHHHHH-----HH-HHHHHHHHHHCCCC Q ss_conf 91023442023566889-----99-99999998741364 Q gi|254780474|r 1 MSDCLNWMGVSVGKSLK-----FF-FLFFSSVMLSGFEF 33 (332) Q Consensus 1 ~~~~~~~~~~~~~k~l~-----~~-~l~~~~~lLsGC~~ 33 (332) ||-.-|.+.-.++|.++ .+ +.+.++++|+||+- T Consensus 1 mKRt~~In~~~~rK~w~~~~l~P~alav~avF~LagCe~ 39 (223) T PRK11653 1 MKRTKSINLASFRKNWSARHLTPVALAVAAVFMLAGCEK 39 (223) T ss_pred CCCHHHHCHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 974123267888613654431278889889998650478 No 127 >pfam06024 DUF912 Nucleopolyhedrovirus protein of unknown function (DUF912). This family consists of several Nucleopolyhedrovirus proteins of unknown function. Probab=34.19 E-value=31 Score=14.42 Aligned_cols=31 Identities=16% Similarity=0.322 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999999999999999998853046 Q gi|254780474|r 48 SLIRIAVALMLLIVVPVFFSILFFAWKYRST 78 (332) Q Consensus 48 ~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~ 78 (332) +.++++....+.|++.+++..-|+..|-|++ T Consensus 62 ~~ili~ll~~~~ilvlly~IyYFVILRer~~ 92 (101) T pfam06024 62 NIILIGLLAFFCVLVLLYAIYYFVILRERRK 92 (101) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHEEEECCC T ss_conf 4589999999999999999998623641230 No 128 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=34.09 E-value=31 Score=14.41 Aligned_cols=19 Identities=16% Similarity=0.219 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 8899999999998741364 Q gi|254780474|r 15 SLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 15 ~l~~~~l~~~~~lLsGC~~ 33 (332) .+.+.++++..++|+||.. T Consensus 3 ~~~~~~~~~~~~~LsgCa~ 21 (231) T PRK00249 3 MMLIALALLLLLLLSGCAS 21 (231) T ss_pred CHHHHHHHHHHHHHHHCCC T ss_conf 3899999999999875348 No 129 >PRK10781 rcsF outer membrane lipoprotein; Reviewed Probab=34.02 E-value=27 Score=14.83 Aligned_cols=22 Identities=18% Similarity=0.440 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 9999999999874136432689 Q gi|254780474|r 17 KFFFLFFSSVMLSGFEFIVMDP 38 (332) Q Consensus 17 ~~~~l~~~~~lLsGC~~~~l~P 38 (332) |.+.+.+.+++|+||...-..| T Consensus 2 R~lpi~llal~ltGCs~l~~~p 23 (133) T PRK10781 2 RALPICLLALMLTGCSMLSRSP 23 (133) T ss_pred CHHHHHHHHHHHHCCCCCCCCC T ss_conf 2089999999984313445488 No 130 >PRK09973 putative outer membrane lipoprotein; Provisional Probab=33.68 E-value=20 Score=15.74 Aligned_cols=21 Identities=14% Similarity=0.132 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 668899999999998741364 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~~ 33 (332) ++++.+-.+++++.+|+||.. T Consensus 2 rtk~vlGaviLaS~LLaGCsn 22 (85) T PRK09973 2 KTIFTVGAVVLATCLLSGCVN 22 (85) T ss_pred CCEEHHHHHHHHHHHHHCCCC T ss_conf 602110699999999981555 No 131 >pfam09577 Spore_YpjB Sporulation protein YpjB (SpoYpjB). These proteins are found in the endospore-forming bacteria which include Bacillus species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon. Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set. Probab=33.50 E-value=32 Score=14.35 Aligned_cols=16 Identities=19% Similarity=0.347 Sum_probs=9.0 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 7899999999999988 Q gi|254780474|r 101 VPLVIVGFLAVITWDA 116 (332) Q Consensus 101 iP~iIi~~L~~~t~~~ 116 (332) +-.+||+.|.+.+|+- T Consensus 206 tGg~Iv~tLTYVGwRK 221 (232) T pfam09577 206 TGGIIILTLTYVGWRK 221 (232) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999987766765 No 132 >COG5510 Predicted small secreted protein [Function unknown] Probab=33.46 E-value=30 Score=14.50 Aligned_cols=21 Identities=14% Similarity=0.325 Sum_probs=13.3 Q ss_pred HHHH-HHHHHHHHHHHHHCCCC Q ss_conf 6688-99999999998741364 Q gi|254780474|r 13 GKSL-KFFFLFFSSVMLSGFEF 33 (332) Q Consensus 13 ~k~l-~~~~l~~~~~lLsGC~~ 33 (332) +|.+ ...++++.+.+|++||- T Consensus 3 k~t~l~i~~vll~s~llaaCNT 24 (44) T COG5510 3 KKTILLIALVLLASTLLAACNT 24 (44) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 4899999999999999988663 No 133 >pfam07010 Endomucin Endomucin. This family consists of several mammalian endomucin proteins. Endomucin is an early endothelial-specific antigen that is also expressed on putative hematopoietic progenitor cells. Probab=33.41 E-value=32 Score=14.34 Aligned_cols=14 Identities=21% Similarity=0.264 Sum_probs=5.3 Q ss_pred HHHHCCCCCCHHCC Q ss_conf 77513444200001 Q gi|254780474|r 117 THRMDPYAPLERIS 130 (332) Q Consensus 117 ~~~ldp~~p~~~~~ 130 (332) .++-||..|....+ T Consensus 214 c~k~~p~~~eng~~ 227 (259) T pfam07010 214 CWKRDPGTPENGND 227 (259) T ss_pred HHCCCCCCCCCCCC T ss_conf 85169899777766 No 134 >pfam03126 Plus-3 Plus-3 domain. This domain is about 90 residues in length and is often found associated with the pfam02213 domain. The function of this domain is uncertain. It is possible that this domain is involved in DNA binding as it has three conserved positively charged residues, hence this domain has been named the plus-3 domain. It is found in yeast Rtf1 which may be a transcription elongation factor. Probab=32.57 E-value=33 Score=14.24 Aligned_cols=46 Identities=20% Similarity=0.282 Sum_probs=29.0 Q ss_pred EECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHH Q ss_conf 0311698514899994277489666133265761136589997088999999999 Q gi|254780474|r 197 QIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISK 251 (332) Q Consensus 197 k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~ 251 (332) |.|.+.+..|..++....-+ ....|.=...|+++| +++||++|++. T Consensus 59 k~Y~l~~~~T~~~L~v~~g~----~~~~~~i~~vSn~~f-----te~E~~~w~~~ 104 (105) T pfam03126 59 KPYKLGKKTTDKYLVVTHGK----SEKLFRISFVSNSPF-----TEEEFERWLQS 104 (105) T ss_pred CCCCCCCEEEEEEEEEEECC----EEEEEEEEEEECCCC-----CHHHHHHHHHH T ss_conf 42333986854899998799----888999777628999-----99999999971 No 135 >TIGR00005 rluA_subfam pseudouridine synthase, RluA family; InterPro: IPR006225 This is the RluD subfamily of pseudouridine synthases. In Escherichia coli, RluD (SfhB) modifies uridine to pseudouridine at 23S RNA U1911, 1915, and 1917. RluC modifies 955, 2504 and 2580, and RluA modifies U746 and tRNA U32. An additional homologue from E. coli outside this family, TruC (Q46918 from SWISSPROT), modifies uracil-65 in transfer RNAs to pseudouridine. ; GO: 0003723 RNA binding, 0009982 pseudouridine synthase activity, 0001522 pseudouridine synthesis. Probab=32.40 E-value=30 Score=14.47 Aligned_cols=31 Identities=6% Similarity=0.140 Sum_probs=20.6 Q ss_pred CEEEEE--ECCCCCCCCCCCCEEEEEECCHHHHHHHHHHH Q ss_conf 748966--61332657611365899970889999999998 Q gi|254780474|r 215 KGAYSG--FSANYSGRGFSHMRFKFYGKSEKGFEDWISKV 252 (332) Q Consensus 215 ~G~y~G--~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~ 252 (332) .|++-. +|+| ||+....+++ |+||..|++.. T Consensus 304 ~ge~~~~~~C~~-Ce~~~~~AP~------p~~~~~~l~~L 336 (337) T TIGR00005 304 KGELLEGKKCSE-CETELFEAPL------PADLKELLEAL 336 (337) T ss_pred CCCEECCCCCCC-CCCCHHHCCC------CHHHHHHHHHC T ss_conf 871432650532-4762110588------07999999960 No 136 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=32.20 E-value=30 Score=14.49 Aligned_cols=15 Identities=20% Similarity=0.354 Sum_probs=10.0 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999998741364 Q gi|254780474|r 19 FFLFFSSVMLSGFEF 33 (332) Q Consensus 19 ~~l~~~~~lLsGC~~ 33 (332) .++++++++|+||.. T Consensus 6 ~~l~~~~llLsgCat 20 (203) T PRK00022 6 RLLPLAALLLAGCAS 20 (203) T ss_pred HHHHHHHHHHHHCCC T ss_conf 999999999866148 No 137 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=32.19 E-value=33 Score=14.20 Aligned_cols=18 Identities=22% Similarity=0.359 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHCCCC Q ss_conf 899999999998741364 Q gi|254780474|r 16 LKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 16 l~~~~l~~~~~lLsGC~~ 33 (332) +..+++++.+++|+||.. T Consensus 2 ~~~~~~~~~~~~l~sCs~ 19 (235) T TIGR03302 2 LLLILLLALLLLLAGCSS 19 (235) T ss_pred HHHHHHHHHHHHHHHCCC T ss_conf 799999999999987389 No 138 >pfam06007 PhnJ Phosphonate metabolism protein PhnJ. This family consists of several bacterial phosphonate metabolism (PhnJ) sequences. The exact role that PhnJ plays in phosphonate utilisation is unknown. Probab=31.75 E-value=21 Score=15.61 Aligned_cols=85 Identities=18% Similarity=0.221 Sum_probs=53.6 Q ss_pred CCCCCEEEEEECCCEEEE---EEEECCCCHHCCCHHHCC-------------EECCCCCCCEEEEEEEC----CCEEEEE Q ss_conf 877430689636982899---998358101102201074-------------03116985148999942----7748966 Q gi|254780474|r 161 QVAIINELVVPIDRPLEF---RITASSVMNSFYIPGLAG-------------QIYAMAGMETKLHAVMN----KKGAYSG 220 (332) Q Consensus 161 giatvneL~lPv~~pV~f---~ltS~DViHSF~IP~l~~-------------k~daiPG~~t~l~~~a~----~~G~y~G 220 (332) +|+--+++..-.+-||.. .+.+-+.|--|=+|.|.. +|||+|-.+.---|..+ ++-.+.+ T Consensus 158 di~r~g~i~t~~~YPv~VngrYvm~PSPIPrfDnPKl~~~paL~LfGAGREkRIYAvPPyT~V~sL~FeD~PF~ve~~~~ 237 (279) T pfam06007 158 DIARFGEIATSYDYPVMVNGRYVMDPSPIPRFDNPKLHMSPALQLFGAGREKRIYAVPPYTRVESLAFEDHPFRVEDFDD 237 (279) T ss_pred HHHHCCCEEECCCCCEEECCEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99871846530358767878375189998888871324670654422773103675399775222454578865103557 Q ss_pred ECCCCCCCCCCCCEEEE--------EECCHHHHH Q ss_conf 61332657611365899--------970889999 Q gi|254780474|r 221 FSANYSGRGFSHMRFKF--------YGKSEKGFE 246 (332) Q Consensus 221 ~cae~cG~ghs~M~f~v--------~v~s~~~f~ 246 (332) .|+ +||..+|.+.=.+ .+.|..||= T Consensus 238 ~C~-lCGs~~syLdEvi~dd~G~r~~~CSDtdyC 270 (279) T pfam06007 238 PCA-LCGSTDSFLDEVILDDDGSRMFVCSDTDYC 270 (279) T ss_pred CCC-CCCCCCEECCCEEECCCCCEEEEECCHHHH T ss_conf 200-018987114104554999778996875888 No 139 >KOG3846 consensus Probab=31.70 E-value=20 Score=15.74 Aligned_cols=20 Identities=20% Similarity=0.250 Sum_probs=16.8 Q ss_pred EEECCCCCCHHHHHHCCCCC Q ss_conf 17204783542687315244 Q gi|254780474|r 277 RYFSPIEKGLYYSILNLCVH 296 (332) Q Consensus 277 ~~f~~~~~~lf~~i~~~~~~ 296 (332) -|||+-.|..|+....+.+. T Consensus 406 l~Fssp~~~vfqeL~krGv~ 425 (465) T KOG3846 406 LYFSSPTPKVFQELHKRGVI 425 (465) T ss_pred EEECCCCHHHHHHHHHCCEE T ss_conf 99679951889999873701 No 140 >TIGR02302 aProt_lowcomp conserved hypothetical protein TIGR02302; InterPro: IPR012683 Members of this family are bacterial proteins from the alphaproteobacteria. Each has 2-3 predicted transmembrane helices near the N-terminus and a long C-terminal region that includes stretches of Gln/Gly-rich low complexity sequence, predicted to be outside the membrane. Bradyrhizobium japonicum contains two tandem copies that are, together, homologous to the single members found in other species. . Probab=31.29 E-value=34 Score=14.10 Aligned_cols=41 Identities=12% Similarity=0.187 Sum_probs=28.0 Q ss_pred CCEEEEEECCCEEEEEEEECC-CCHHCCCHHHCCEECCCCCC Q ss_conf 430689636982899998358-10110220107403116985 Q gi|254780474|r 164 IINELVVPIDRPLEFRITASS-VMNSFYIPGLAGQIYAMAGM 204 (332) Q Consensus 164 tvneL~lPv~~pV~f~ltS~D-ViHSF~IP~l~~k~daiPG~ 204 (332) ....|-||++--|.++++--| .--..---+-+++.-.|+=- T Consensus 219 ~~~pitvP~gS~l~VR~~GG~Gt~E~~~~~~agg~~~eikP~ 260 (899) T TIGR02302 219 ESDPITVPQGSTLLVRVSGGDGTEERLLDIEAGGEVLEIKPD 260 (899) T ss_pred CCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHCCCEECCCCC T ss_conf 567735775652788852588712456666417863303760 No 141 >PRK11443 hypothetical protein; Provisional Probab=31.21 E-value=32 Score=14.32 Aligned_cols=16 Identities=19% Similarity=0.362 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 9999999987413643 Q gi|254780474|r 19 FFLFFSSVMLSGFEFI 34 (332) Q Consensus 19 ~~l~~~~~lLsGC~~~ 34 (332) +.+++.+++|+||..+ T Consensus 4 ~~~~~~~lll~GCq~~ 19 (120) T PRK11443 4 FIAPLLALLVSGCQID 19 (120) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 8999999999666789 No 142 >TIGR01675 plant-AP plant acid phosphatase; InterPro: IPR010028 This entry represents a family of acid phosphatase , from plants which are closely related to the class B non-specific acid phosphatase OlpA (IPR006423 from INTERPRO, which is believed to be a 5'-nucleotide phosphatase) and somewhat more distantly to another class B phosphatase, AphA (IPR010025 from INTERPRO). Together these three clades define a subfamily of Acid phosphatase (Class B), which corresponds to the IIIB subfamily of the haloacid dehalogenase (HAD) superfamily of aspartate nucleophile hydrolases. It has been reported that the best substrates were purine 5'-nucleoside phosphates . This is in concordance with the assignment of the Haemophilus influenzae hel protein (from IPR006423 from INTERPRO) as a 5'-nucleotidase, however there is presently no other evidence to support this specific function for this family of plant phosphatases. Many genes from this family have been annotated as vegetative storage proteins (VSPs) due to their close homology with these earlier-characterised gene products which are highly expressed in leaves. There are significant differences however, including expression levels and distribution . The most important difference is the lack in authentic VSPs of the nucleophilic aspartate residue, which is instead replaced by serine, glycine or asparagine. Thus these proteins can not be expected to be active phosphatases. This issue was confused by the publication in 1992 of an article claiming activity for the Glycine max VSP . In 1994 this assertion was refuted by the separation of the activity from the VSP. This entry explicitly excludes the VSPs which lack the nucleophilic aspartate. The possibility exists, however, that some members of this family may, while containing all of the conserved HAD-superfamily catalytic residues, lack activity and have a function related to the function of the VSPs rather than the acid phosphatases.; GO: 0003993 acid phosphatase activity. Probab=31.14 E-value=30 Score=14.52 Aligned_cols=46 Identities=13% Similarity=0.185 Sum_probs=36.2 Q ss_pred CCCCCCE-EEEEECCHHHHHHHHHHHHC-CCCCCCHHHHHHHHHHHHC Q ss_conf 7611365-89997088999999999860-7245698999997610212 Q gi|254780474|r 228 RGFSHMR-FKFYGKSEKGFEDWISKVKR-QGTFLNRQEYLLLKKPSER 273 (332) Q Consensus 228 ~ghs~M~-f~v~v~s~~~f~~Wv~~~ka-~~~~l~~~~y~~l~~ps~~ 273 (332) ..|+.-+ |=.+...+-+|+.|+.+-+| .+-+-...-|.+|.+-+.+ T Consensus 101 pYY~k~gfyG~e~~D~~~F~~W~~~G~atPAlp~~l~LY~~~~~lG~K 148 (245) T TIGR01675 101 PYYKKHGFYGLEKTDPTAFEEWLEKGKATPALPEVLKLYQKILELGIK 148 (245) T ss_pred CHHHCCCCCCCCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCE T ss_conf 111137621576578546899986586788863267999999981870 No 143 >pfam05915 DUF872 Eukaryotic protein of unknown function (DUF872). This family consists of several uncharacterized eukaryotic proteins. The function of this family is unknown. Probab=30.81 E-value=33 Score=14.18 Aligned_cols=14 Identities=21% Similarity=0.384 Sum_probs=5.1 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254780474|r 51 RIAVALMLLIVVPV 64 (332) Q Consensus 51 ~~~~~l~liV~v~V 64 (332) +....+..+.++|- T Consensus 74 ~~~l~lG~l~fiPG 87 (112) T pfam05915 74 YVFLFLGILLFIPG 87 (112) T ss_pred HHHHHHHHHHHCCC T ss_conf 38799899982664 No 144 >COG5567 Predicted small periplasmic lipoprotein [Cell motility and secretion] Probab=30.43 E-value=35 Score=14.00 Aligned_cols=21 Identities=14% Similarity=0.221 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 668899999999998741364 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~~ 33 (332) ++-++++++++..+.|+||.. T Consensus 2 k~~~~s~~ala~l~sLA~CG~ 22 (58) T COG5567 2 KNVFKSLLALATLFSLAGCGL 22 (58) T ss_pred HHHHHHHHHHHHHHHHHHCCC T ss_conf 048999999999999985266 No 145 >PRK05978 hypothetical protein; Provisional Probab=30.18 E-value=36 Score=13.97 Aligned_cols=24 Identities=8% Similarity=0.005 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999988530 Q gi|254780474|r 53 AVALMLLIVVPVFFSILFFAWKYR 76 (332) Q Consensus 53 ~~~l~liV~v~V~~l~~~~~~kyR 76 (332) ++..+++|..+|..+.+++-..|+ T Consensus 73 A~ftIlIvGhivvpl~l~~e~~~~ 96 (149) T PRK05978 73 AYLVIVIVGHIVVGLFMGVETTFA 96 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 137899999999999999999718 No 146 >COG3065 Slp Starvation-inducible outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=29.47 E-value=37 Score=13.89 Aligned_cols=15 Identities=20% Similarity=0.111 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999998741364 Q gi|254780474|r 19 FFLFFSSVMLSGFEF 33 (332) Q Consensus 19 ~~l~~~~~lLsGC~~ 33 (332) +++.+.+++||||.. T Consensus 11 ~l~~~laflLsgC~t 25 (191) T COG3065 11 ALIGTLAFLLSGCVT 25 (191) T ss_pred HHHHHHHHHHHHCCC T ss_conf 999999999763035 No 147 >PRK11459 multidrug resistance outer membrane protein MdtQ; Provisional Probab=29.44 E-value=37 Score=13.89 Aligned_cols=21 Identities=10% Similarity=-0.058 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 566889999999999874136 Q gi|254780474|r 12 VGKSLKFFFLFFSSVMLSGFE 32 (332) Q Consensus 12 ~~k~l~~~~l~~~~~lLsGC~ 32 (332) -++.+.....+..+++|+||. T Consensus 3 ~~~f~~~~a~l~~~llLaGCa 23 (478) T PRK11459 3 RDSFYAATASLPLFILLAGCA 23 (478) T ss_pred CCCHHHHHHHHHHHHHHHCCC T ss_conf 740588899999999983788 No 148 >TIGR02142 modC_ABC molybdate ABC transporter, ATP-binding protein; InterPro: IPR011868 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP-binding cassette (ABC) protein of the three subunit molybdate ABC transporter. The three proteins of this complex are homologous to proteins of the sulphate ABC transporter. Molybdenum may be used in nitrogenases of nitrogen-fixing bacteria and in molybdopterin cofactors. In some cases, molybdate may be transported by a sulphate transporter rather than by a specific molybdate transporter.; GO: 0005524 ATP binding, 0015098 molybdate ion transmembrane transporter activity, 0015689 molybdate ion transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=29.38 E-value=30 Score=14.49 Aligned_cols=68 Identities=18% Similarity=0.107 Sum_probs=49.6 Q ss_pred EEEEEEE-EEEEEEEEEC--CCCCCCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEE Q ss_conf 6899999-8436999962--678774306896369828999983581011022010740311698514899 Q gi|254780474|r 142 LVVEVVA-LDWKWLFLLP--EQQVAIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLH 209 (332) Q Consensus 142 l~I~V~a-~qW~W~F~YP--~~giatvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~ 209 (332) +...|.+ .|=+=++.|+ +.+--.|++.-=|+|++.+|++.|+||+=.--=|+=-.=.++.|+++..+. T Consensus 238 ~~~~v~~~~~~ygL~~l~l~e~~~l~V~~~~~~~G~~~R~~v~A~DVsL~~~~P~~~Si~N~L~a~v~~~~ 308 (361) T TIGR02142 238 LEGVVAEHDQHYGLTALRLGEDGHLWVPEKKGPVGARLRLRVPARDVSLALQKPEATSIRNVLPARVVEIE 308 (361) T ss_pred EEEEEECCCCCCCCEECCCCCCCEEEEECCCCCCCCCEEEEEEECCEEEECCCCCCCCHHCCCCEEEEEEE T ss_conf 56541103865320120158883899726767658523568741428985278530302004612688877 No 149 >cd01225 PH_Cool_Pix Cool (cloned out of library)/Pix (PAK-interactive exchange factor) pleckstrin homology (PH) domain. Cool/Pix contains an N-terminal SH3 domain followed by a RhoGEF (DH) and PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. Probab=29.33 E-value=37 Score=13.88 Aligned_cols=24 Identities=8% Similarity=0.313 Sum_probs=19.0 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCC Q ss_conf 658999708899999999986072 Q gi|254780474|r 233 MRFKFYGKSEKGFEDWISKVKRQG 256 (332) Q Consensus 233 M~f~v~v~s~~~f~~Wv~~~ka~~ 256 (332) =+..|.--+++||++|++..++.. T Consensus 86 e~i~V~C~s~~e~~~Wl~hL~~~~ 109 (111) T cd01225 86 ERIVVVCNNPQDAQEWVELLNANN 109 (111) T ss_pred CCEEEEECCHHHHHHHHHHHHCCC T ss_conf 568999699899999999875029 No 150 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=29.23 E-value=37 Score=13.86 Aligned_cols=20 Identities=30% Similarity=0.373 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHHCCC Q ss_conf 66889999999999874136 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFE 32 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~ 32 (332) +|.+..++.++++++|+||. T Consensus 3 kK~i~~~~~~~svl~LaaC~ 22 (285) T PRK03002 3 GKHIFIITALISILMLSACG 22 (285) T ss_pred HHHHHHHHHHHHHHHHHHHC T ss_conf 01899999999999999845 No 151 >KOG3637 consensus Probab=29.19 E-value=37 Score=13.86 Aligned_cols=32 Identities=13% Similarity=0.096 Sum_probs=14.8 Q ss_pred EEEEEEEEEEEEEEECCCCCCCCEEEEEECCC Q ss_conf 89999984369999626787743068963698 Q gi|254780474|r 143 VVEVVALDWKWLFLLPEQQVAIINELVVPIDR 174 (332) Q Consensus 143 ~I~V~a~qW~W~F~YP~~giatvneL~lPv~~ 174 (332) .|.+-+.+|.|.+.=.......+.+++.=.+. T Consensus 149 ~I~~Ca~~~~~~~~~~~~~~~~~g~cy~i~s~ 180 (1030) T KOG3637 149 VIVVCAHRYVIRYYIGVGERLPVGECYLIASD 180 (1030) T ss_pred EEEECCCHHEEEECCCCCCCCCCCEEEECCCC T ss_conf 58864520068852455665557568983688 No 152 >TIGR01647 ATPase-IIIA_H plasma-membrane proton-efflux P-type ATPase; InterPro: IPR006534 These sequences represent the plasma membrane proton efflux P-type ATPase found in plants, fungi, protozoa, slime molds and archaea. The best studied representative is from yeast . ; GO: 0016887 ATPase activity, 0015992 proton transport, 0016021 integral to membrane. Probab=29.09 E-value=37 Score=13.85 Aligned_cols=87 Identities=13% Similarity=0.106 Sum_probs=53.4 Q ss_pred CCCEEEEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCCCCCCEEEE Q ss_conf 56612553022789999999999998877513444200001244678888866899999843699996267877430689 Q gi|254780474|r 90 HSTLLELFVWLVPLVIVGFLAVITWDATHRMDPYAPLERISESKSIAKNSKPLVVEVVALDWKWLFLLPEQQVAIINELV 169 (332) Q Consensus 90 hs~~lEi~wt~iP~iIi~~L~~~t~~~~~~ldp~~p~~~~~~~~~~~~~~~~l~I~V~a~qW~W~F~YP~~giatvneL~ 169 (332) =++.|=+++-+||+.+=++|.+.-..+-++|--.+.+-..-... .+=.- +++-=-| .-|--|-|.|. T Consensus 260 l~f~LVLLvggIPiAmP~VlSvTmAvGA~~LAk~~AIV~rL~AI---EElAG--mdILCSD--------KTGTLTlNKL~ 326 (835) T TIGR01647 260 LQFALVLLVGGIPIAMPAVLSVTMAVGAAELAKKKAIVTRLTAI---EELAG--MDILCSD--------KTGTLTLNKLS 326 (835) T ss_pred HHCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHH---HHHHC--CCCCCCC--------CCCCCCCCEEE T ss_conf 13015545640000131455677889999986537750148888---88764--8621337--------73432144457 Q ss_pred EECCCEEEEE---EEECCCC-HHC Q ss_conf 6369828999---9835810-110 Q gi|254780474|r 170 VPIDRPLEFR---ITASSVM-NSF 189 (332) Q Consensus 170 lPv~~pV~f~---ltS~DVi-HSF 189 (332) +-..-|+.-. ++-+||| |++ T Consensus 327 i~~~~~~~~~G~~~~~~Dv~l~Aa 350 (835) T TIGR01647 327 IDEILPFFGGGPGFDKDDVLLYAA 350 (835) T ss_pred ECCEEEEECCCCCCCHHHHHHHHH T ss_conf 511124406573204888999887 No 153 >pfam06832 BiPBP_C Penicillin-Binding Protein C-terminus Family. This conserved region of approximately 90 residues is found in a sub-group of bacterial Penicillin-Binding Proteins (PBPs). A variable length loop region separates this region from the transpeptidase unit (pfam00905). It is predicted by PROF to be an all beta fold. Probab=28.81 E-value=37 Score=13.81 Aligned_cols=73 Identities=12% Similarity=0.126 Sum_probs=44.5 Q ss_pred EEEECCCCCCCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCC Q ss_conf 9996267877430689636982899998358101102201074031169851489999427748966613326576 Q gi|254780474|r 154 LFLLPEQQVAIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRG 229 (332) Q Consensus 154 ~F~YP~~giatvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~g 229 (332) .+.||..|-.-.-.--.|..+++.|+......-..||+ .++--.--....++...++++|.|+..+-.=.|.. T Consensus 13 ~I~~P~~G~~~~l~~~~~~~~~l~l~a~~~~~~~~W~l---dg~~l~~~~~~~~~~~~p~~~G~~~L~vvD~~G~~ 85 (90) T pfam06832 13 RIIYPPDGAIIALDPDIPARQRLVLKAAGGEGPLYWFL---DGKPLGETKRGRELFWQPPGPGFHRLTVVDAAGRS 85 (90) T ss_pred EEECCCCCCEEEECCCCCCCCCEEEEECCCCCCEEEEE---CCEEECCCCCCCCEEECCCCCCEEEEEEECCCCCE T ss_conf 77758999999835899877634899617876579999---99990036788715855899950899998899988 No 154 >KOG0174 consensus Probab=28.75 E-value=37 Score=13.83 Aligned_cols=63 Identities=13% Similarity=0.181 Sum_probs=45.0 Q ss_pred ECCCCHHCCCHHHCCEECCCC--CCCEEEEEEECCCEE--EEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHC Q ss_conf 358101102201074031169--851489999427748--9666133265761136589997088999999999860 Q gi|254780474|r 182 ASSVMNSFYIPGLAGQIYAMA--GMETKLHAVMNKKGA--YSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISKVKR 254 (332) Q Consensus 182 S~DViHSF~IP~l~~k~daiP--G~~t~l~~~a~~~G~--y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~ka 254 (332) |..+|-+=|=++.|||+|.+| |+.++.-+.+--.|. -+|+|-.=- - .-+++||-.+.++++-+ T Consensus 114 ~AgliVAGwD~~~gGqVY~iplGG~l~rq~~aIgGSGStfIYGf~D~~~---r-------~nMt~EE~~~fvk~Av~ 180 (224) T KOG0174 114 SAGLIVAGWDEKEGGQVYSIPLGGSLTRQPFAIGGSGSTFIYGFCDANW---R-------PNMTLEECVRFVKNAVS 180 (224) T ss_pred HCCEEEEECCCCCCCEEEEEECCCEEEECCEEECCCCCEEEEEEEHHHC---C-------CCCCHHHHHHHHHHHHH T ss_conf 2142786325566963898614865762634322677414554433221---7-------99999999999999999 No 155 >PRK11189 lipoprotein NlpI; Provisional Probab=28.55 E-value=38 Score=13.78 Aligned_cols=20 Identities=20% Similarity=0.451 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHHHCCC Q ss_conf 66889999999999874136 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFE 32 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~ 32 (332) ++.+++.+++.++++|+||. T Consensus 2 ~~~~r~~~~~~~~l~LsGC~ 21 (297) T PRK11189 2 KPFLRWCFVATAALLLAGCA 21 (297) T ss_pred CHHHHHHHHHHHHHHHHHHC T ss_conf 51899999999999998640 No 156 >PRK07363 NAD(P)H-quinone oxidoreductase subunit M; Validated Probab=28.49 E-value=38 Score=13.78 Aligned_cols=18 Identities=11% Similarity=0.133 Sum_probs=8.5 Q ss_pred CHHCCCHHHCCEECCCCC Q ss_conf 011022010740311698 Q gi|254780474|r 186 MNSFYIPGLAGQIYAMAG 203 (332) Q Consensus 186 iHSF~IP~l~~k~daiPG 203 (332) +-.+..=.++.|.=..|= T Consensus 207 ~~~l~l~gf~vK~~~~P~ 224 (499) T PRK07363 207 LLTLLLLGFGIKIPLVPL 224 (499) T ss_pred HHHHHHHHHHHHHCCCCC T ss_conf 999999999986261644 No 157 >PRK10175 hypothetical protein; Provisional Probab=28.32 E-value=22 Score=15.43 Aligned_cols=18 Identities=22% Similarity=0.346 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHCCCCC Q ss_conf 999999999987413643 Q gi|254780474|r 17 KFFFLFFSSVMLSGFEFI 34 (332) Q Consensus 17 ~~~~l~~~~~lLsGC~~~ 34 (332) |.+++....++||||... T Consensus 2 r~i~~~~~~lllsGC~SI 19 (75) T PRK10175 2 RLIVVSIMVTLLSGCGSI 19 (75) T ss_pred EEHHHHHHHHHHCCCHHH T ss_conf 344888899998164244 No 158 >TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae Probab=28.32 E-value=37 Score=13.81 Aligned_cols=19 Identities=26% Similarity=0.275 Sum_probs=14.3 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 6889999999999874136 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFE 32 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~ 32 (332) |++-+.+++.+++.|+||. T Consensus 2 Kk~~l~~~iasal~LagCG 20 (792) T TIGR03502 2 KKLLLSLAIASALGLAGCG 20 (792) T ss_pred CHHHHHHHHHHHHHHHCCC T ss_conf 3378999999987500457 No 159 >COG1113 AnsP Gamma-aminobutyrate permease and related permeases [Amino acid transport and metabolism] Probab=27.43 E-value=39 Score=13.65 Aligned_cols=28 Identities=21% Similarity=0.220 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999999999999998853046 Q gi|254780474|r 51 RIAVALMLLIVVPVFFSILFFAWKYRST 78 (332) Q Consensus 51 ~~~~~l~liV~v~V~~l~~~~~~kyR~~ 78 (332) ...+.+....++.|+.+++.+-+|||+. T Consensus 362 ~~v~s~s~~~~l~vW~~I~~s~l~~rk~ 389 (462) T COG1113 362 ELVTSSSGLGLLFVWLMILLSQLKLRKA 389 (462) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999999999999999861 No 160 >KOG1483 consensus Probab=27.41 E-value=39 Score=13.65 Aligned_cols=95 Identities=21% Similarity=0.249 Sum_probs=38.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 23566889999999999874136432689898999999999999999999999999999999885304677777787534 Q gi|254780474|r 10 VSVGKSLKFFFLFFSSVMLSGFEFIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWC 89 (332) Q Consensus 10 ~~~~k~l~~~~l~~~~~lLsGC~~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~ 89 (332) .-.+|++++++++.....+-.-+..+-.+-...|.--..+.++.-.++|+|. +.+.|.-++.....|.+.|. T Consensus 3 ~~~gk~~rli~~l~ltiiFfvLEli~gyv~~sLaLiadSfHML~dIiaLiva--------f~~ik~a~~~~~~k~tyGw~ 74 (404) T KOG1483 3 KYGGKSLRLISVLVLTIIFFVLELITGYVTNSLALIADSFHMLNDIIALIVA--------FWAIKEAKRIPLQKYTYGWA 74 (404) T ss_pred CCCCCCCCEEEHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHCCCCCCCCCCHH T ss_conf 4356544356348888988774130020242177774578899999999999--------99987641276100486343 Q ss_pred CCCEEEEEEEHHHHHHHHHHHHHHHH Q ss_conf 56612553022789999999999998 Q gi|254780474|r 90 HSTLLELFVWLVPLVIVGFLAVITWD 115 (332) Q Consensus 90 hs~~lEi~wt~iP~iIi~~L~~~t~~ 115 (332) +-|+.=-.+-.++++.+.+..|. T Consensus 75 ---rAEilGalvN~ifl~alc~~I~~ 97 (404) T KOG1483 75 ---RAEILGALVNAIFLTALCVSILI 97 (404) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHH T ss_conf ---99998655588999999999999 No 161 >TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF. Probab=27.37 E-value=40 Score=13.64 Aligned_cols=74 Identities=22% Similarity=0.336 Sum_probs=42.1 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHHH Q ss_conf 43268989899999999999999999999999999999988530467777778753456612553022789999999999 Q gi|254780474|r 33 FIVMDPYGDIALQQASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAVI 112 (332) Q Consensus 33 ~~~l~P~G~ia~~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~~ 112 (332) .++-+...++|.+.+..+--..---- -=-=..-|+.=|| ++.--|.|....+| .+.|..|++++++-+++ T Consensus 49 qsi~~S~a~~A~dmR~~I~~~i~~G~----sd~eI~~~l~~RY---G~~Il~~Pp~~~~t---~~LW~~P~~~l~~g~~~ 118 (126) T TIGR03147 49 QNLVESNSPIAYDLRHEVYSMVNEGK----SNQQIIDFMTARF---GDFVLYNPPFKWQT---LLLWLLPVVLLLLAFVI 118 (126) T ss_pred CCHHHCCHHHHHHHHHHHHHHHHCCC----CHHHHHHHHHHHC---CCEEEECCCCCCCH---HHHHHHHHHHHHHHHHH T ss_conf 77400761999999999999998599----9899999999970---88287539999641---89999999999999999 Q ss_pred HHHH Q ss_conf 9988 Q gi|254780474|r 113 TWDA 116 (332) Q Consensus 113 t~~~ 116 (332) .|+. T Consensus 119 ~~~~ 122 (126) T TIGR03147 119 LWRV 122 (126) T ss_pred HHHH T ss_conf 9998 No 162 >PRK09240 thiH thiamine biosynthesis protein ThiH; Reviewed Probab=27.29 E-value=40 Score=13.63 Aligned_cols=16 Identities=13% Similarity=0.113 Sum_probs=9.9 Q ss_pred EECCHHHHHHHHHHHH Q ss_conf 9708899999999986 Q gi|254780474|r 238 YGKSEKGFEDWISKVK 253 (332) Q Consensus 238 ~v~s~~~f~~Wv~~~k 253 (332) ..++..+|-+-|.-.| T Consensus 271 ~~vsD~~l~q~i~a~R 286 (371) T PRK09240 271 SIVSDRQLVQLICAFR 286 (371) T ss_pred CCCCHHHHHHHHHHHH T ss_conf 6578899999999999 No 163 >pfam11353 DUF3153 Protein of unknown function (DUF3153). This family of proteins with unknown function appear to be restricted to Cyanobacteria. Some members are annotated as membrane proteins however this cannot be confirmed. Probab=26.62 E-value=25 Score=15.04 Aligned_cols=71 Identities=14% Similarity=-0.041 Sum_probs=33.4 Q ss_pred CCEEEEEEEEEEEEEEEEC--------CCCCCCCEEEEEECCCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEE Q ss_conf 8668999998436999962--------67877430689636982899998358101102201074031169851489999 Q gi|254780474|r 140 KPLVVEVVALDWKWLFLLP--------EQQVAIINELVVPIDRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAV 211 (332) Q Consensus 140 ~~l~I~V~a~qW~W~F~YP--------~~giatvneL~lPv~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~ 211 (332) .+.++..+-..|----.|- +-+....=||.+-...|=..+.+..+-...=-. .=...--..||..|++.+. T Consensus 101 ~~~~i~l~erN~l~~~~~~l~l~lDLr~L~~i~~l~L~~~l~~p~~~~~~~~~~~~~~~~-~~~l~W~L~~GeiN~Le~s 179 (210) T pfam11353 101 PAPELNLVERNWLLGVQQQLSLDLDLRSLPDLPGLELALSLNTPGGARNINSDPTAAEES-KGQLVWQLEPGEINHLEAS 179 (210) T ss_pred CCCCEEEEEEEEEEEEEEEEEEEECHHHCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC-CCEEEEECCCCCCCEEEEE T ss_conf 884101632116999855799998225748999845999956965343257999875335-7778873588872369999 No 164 >PRK11616 hypothetical protein; Provisional Probab=26.46 E-value=38 Score=13.78 Aligned_cols=22 Identities=23% Similarity=0.378 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 6889999999999874136432 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFEFIV 35 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~~~~ 35 (332) +.+..+.+..+.++++||.... T Consensus 3 n~l~~f~~~~g~l~~sGCSSvM 24 (109) T PRK11616 3 NVLLAFMICSGMLLLSGCSSVM 24 (109) T ss_pred CHHHHHHHHHHHHHHCCCHHHH T ss_conf 3033799985899875751230 No 165 >PRK11029 FtsH protease regulator HflC; Provisional Probab=26.34 E-value=41 Score=13.52 Aligned_cols=20 Identities=15% Similarity=0.270 Sum_probs=9.3 Q ss_pred CCCCEEEEEECCCEEEEEEE Q ss_conf 77430689636982899998 Q gi|254780474|r 162 VAIINELVVPIDRPLEFRIT 181 (332) Q Consensus 162 iatvneL~lPv~~pV~f~lt 181 (332) +.|.++..+=++.-|+++|+ T Consensus 79 vlT~D~k~i~VD~~v~wrI~ 98 (334) T PRK11029 79 FVTKEKKDLIVDSYIKWRIS 98 (334) T ss_pred EEECCCCEEEEEEEEEEEEC T ss_conf 88517848998558999973 No 166 >PRK09810 entericidin A; Provisional Probab=26.24 E-value=41 Score=13.51 Aligned_cols=19 Identities=26% Similarity=0.508 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 68899999999998741364 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~~ 33 (332) |++.. ++++...+|+||+. T Consensus 3 krli~-lil~~~~ll~gcnt 21 (41) T PRK09810 3 KRLIV-LVLLASTLLTGCNT 21 (41) T ss_pred HHHHH-HHHHHHHHHHCCCC T ss_conf 89999-99999999825621 No 167 >pfam06604 consensus Probab=26.08 E-value=42 Score=13.49 Aligned_cols=21 Identities=10% Similarity=0.013 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 668899999999998741364 Q gi|254780474|r 13 GKSLKFFFLFFSSVMLSGFEF 33 (332) Q Consensus 13 ~k~l~~~~l~~~~~lLsGC~~ 33 (332) ..+.+++..+...++|+||.- T Consensus 3 ~sk~~~~~~~~~v~~laGCq~ 23 (181) T pfam06604 3 FSKAGLVAGLAAVLVLAGCQR 23 (181) T ss_pred CCHHHHHHHHHHHHHHHCCCC T ss_conf 206778999999999734453 No 168 >pfam00479 G6PD_N Glucose-6-phosphate dehydrogenase, NAD binding domain. Probab=25.39 E-value=43 Score=13.40 Aligned_cols=60 Identities=8% Similarity=0.109 Sum_probs=33.8 Q ss_pred CHHHHHHHHHHH--------HCCCCEEECCCCCCHHHHHHCCCCCCCCCCCC---HHHCCCCCCCCCHHHC Q ss_conf 989999976102--------12632172047835426873152444885660---2102466688523440 Q gi|254780474|r 260 NRQEYLLLKKPS--------ERDSVRYFSPIEKGLYYSILNLCVHPGKICMD---EMMRIDALGGGGMRGI 319 (332) Q Consensus 260 ~~~~y~~l~~ps--------~~~~~~~f~~~~~~lf~~i~~~~~~~~~~~~~---~~m~~~~~~g~~~~~~ 319 (332) +.++|..|.+-- ...+.-+|-+++|.+|..|+..--..|-...+ .-.++.--=|...++. T Consensus 83 ~~~~~~~L~~~l~~~~~~~~~~~~rifYLA~pP~~f~~i~~~L~~~~L~~~~~g~~RiVvEKPfG~Dl~Sa 153 (183) T pfam00479 83 DPEGYERLAERLEELEKKYGTGGNRLFYLALPPSVFGTVAENLKKAGLNKEEGGWTRVVIEKPFGHDLESA 153 (183) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHH T ss_conf 80369999999998876408887558999558899999999999828998888824799857888977889 No 169 >KOG3039 consensus Probab=25.32 E-value=21 Score=15.57 Aligned_cols=50 Identities=14% Similarity=0.090 Sum_probs=27.2 Q ss_pred CCCEEECCCCCCHHHHHHCCCC-------CCCCCCCCHHHCCCCCCCCCHHHCCCCH Q ss_conf 6321720478354268731524-------4488566021024666885234401100 Q gi|254780474|r 274 DSVRYFSPIEKGLYYSILNLCV-------HPGKICMDEMMRIDALGGGGMRGIDRHS 323 (332) Q Consensus 274 ~~~~~f~~~~~~lf~~i~~~~~-------~~~~~~~~~~m~~~~~~g~~~~~~~~~~ 323 (332) .|+.+.++...-.-...+-+.+ -.+..|.+..+..=.+||+|-+..|.+. T Consensus 235 ~~ca~Lr~sg~Vv~~ecvEklir~D~v~pv~d~plkdrdiI~LqrGGTGfa~tndhl 291 (303) T KOG3039 235 TPCAVLRPSGHVVTKECVEKLIRKDMVDPVTDKPLKDRDIIGLQRGGTGFAETNDHL 291 (303) T ss_pred CCEEEECCCCCEEEHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCHHH T ss_conf 513786167867549999986232462557798576455576603654545556054 No 170 >PRK12450 foldase protein PrsA; Reviewed Probab=25.29 E-value=43 Score=13.39 Aligned_cols=35 Identities=14% Similarity=0.065 Sum_probs=19.0 Q ss_pred HHHHH-HHHHHHHHHHHHHCCCCC-----CCC-CCCHHHHHH Q ss_conf 56688-999999999987413643-----268-989899999 Q gi|254780474|r 12 VGKSL-KFFFLFFSSVMLSGFEFI-----VMD-PYGDIALQQ 46 (332) Q Consensus 12 ~~k~l-~~~~l~~~~~lLsGC~~~-----~l~-P~G~ia~~~ 46 (332) ..|++ .-++.++++++|+||... +.. -.|.|-.++ T Consensus 3 ~mKK~~~~~~~~~svl~LaAC~s~~~~~~v~t~kgg~IT~~e 44 (309) T PRK12450 3 QMNKLITGVVTLATVVTLSACQSSHNNTKLVSMKGDTITVSD 44 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHH T ss_conf 799999999999999999860589998368983599240999 No 171 >TIGR03546 conserved hypothetical protein TIGR03546. Members of this family are uncharacterized proteins, usually encoded by a gene adjacent to a member of family TIGR03545, which is also uncharacterized. Probab=24.96 E-value=44 Score=13.35 Aligned_cols=30 Identities=20% Similarity=0.311 Sum_probs=18.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999999999999999998853046 Q gi|254780474|r 49 LIRIAVALMLLIVVPVFFSILFFAWKYRST 78 (332) Q Consensus 49 L~~~~~~l~liV~v~V~~l~~~~~~kyR~~ 78 (332) +.+-+..+.++..+|+..+.-+.+.|||++ T Consensus 109 lvmGs~ii~lvl~~p~y~~~~~~V~~YR~~ 138 (154) T TIGR03546 109 IVMGSFVVGLILLPPAFAISKVIIAKYRKR 138 (154) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 888999999999999999999999999999 No 172 >PRK08475 F0F1 ATP synthase subunit B; Validated Probab=24.81 E-value=44 Score=13.33 Aligned_cols=10 Identities=30% Similarity=0.311 Sum_probs=4.6 Q ss_pred HHHHHHHHHH Q ss_conf 9999999885 Q gi|254780474|r 65 FFSILFFAWK 74 (332) Q Consensus 65 ~~l~~~~~~k 74 (332) ++++.||+++ T Consensus 38 ~gIL~yf~~k 47 (170) T PRK08475 38 VGILWYFAAK 47 (170) T ss_pred HHHHHHHHHH T ss_conf 9999999287 No 173 >pfam00585 Thr_dehydrat_C C-terminal regulatory domain of Threonine dehydratase. Threonine dehydratases pfam00291 all contain a carboxy terminal region. This region may have a regulatory role. Some members contain two copies of this region. This family is homologous to the pfam01842 domain. Probab=24.73 E-value=19 Score=15.90 Aligned_cols=58 Identities=12% Similarity=0.104 Sum_probs=30.2 Q ss_pred EECCCEEEEEECCCCCCC-------------CCCCCEEEEEECCHHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 942774896661332657-------------6113658999708899999999986072456989999976 Q gi|254780474|r 211 VMNKKGAYSGFSANYSGR-------------GFSHMRFKFYGKSEKGFEDWISKVKRQGTFLNRQEYLLLK 268 (332) Q Consensus 211 ~a~~~G~y~G~cae~cG~-------------ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~~l~~~~y~~l~ 268 (332) .|++||.++=+|..+.+. ..+.--.-+.+...+|++++++..++.+-....-+-++++ T Consensus 17 iPErpGal~~Fl~~l~~~~~ITeF~YR~~~~~~a~VlvGi~~~~~~e~~~l~~~L~~~gy~~~dls~ne~~ 87 (91) T pfam00585 17 FPERPGALLKFLDLLGGRNNITLFHYRKHGAKNGSILVGVQLSQASDLDEFIERLEKLGYDYVDLSDNEAA 87 (91) T ss_pred CCCCCCHHHHHHHHHCCCCCEEEEEEECCCCCCEEEEEEEEECCHHHHHHHHHHHHHCCCCEEECCCCHHH T ss_conf 88887499999998299555558998545897035999999798889999999999759980888799998 No 174 >PRK09739 hypothetical protein; Provisional Probab=24.69 E-value=37 Score=13.83 Aligned_cols=72 Identities=17% Similarity=0.154 Sum_probs=41.8 Q ss_pred CCEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCC-------EEEEEECCHHHH Q ss_conf 9828999983581011022010740311698514899994277489666133265761136-------589997088999 Q gi|254780474|r 173 DRPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHM-------RFKFYGKSEKGF 245 (332) Q Consensus 173 ~~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M-------~f~v~v~s~~~f 245 (332) ++++...+|+..--.+|---.+...|+ +.| +.| ..+|||..-+.. -+...-+.++.| T Consensus 120 ~k~~~~~~t~G~~~~~y~~~g~~~~~~------~~L-----~~G-----i~~ycGi~~~~~~~l~~~~~~~~~~~~~~~~ 183 (201) T PRK09739 120 FNKVRWVALVGGDKESFVKRGWEKNMT------DYL-----NVG-----IASYCGIEDSDVTFLYNTLVFDGEELHASHY 183 (201) T ss_pred CCEEEEEEECCCCHHHHCCCCHHHHHH------HHH-----HCC-----CEECCCCCCCCEEEECCCEEEECCCCCHHHH T ss_conf 983899994279979957223314589------998-----528-----3744871105217865716740453588999 Q ss_pred HHHHHHHHCCCCCCC Q ss_conf 999999860724569 Q gi|254780474|r 246 EDWISKVKRQGTFLN 260 (332) Q Consensus 246 ~~Wv~~~ka~~~~l~ 260 (332) +.|+++.+.--.+|+ T Consensus 184 ~~~~~~~~~~~~~~~ 198 (201) T PRK09739 184 QSLLSQARDMVDALE 198 (201) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999866 No 175 >PRK13312 heme-degrading monooxygenase IsdG; Provisional Probab=24.68 E-value=44 Score=13.31 Aligned_cols=54 Identities=19% Similarity=0.433 Sum_probs=34.3 Q ss_pred CCCCHHCCCHHHCCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHH Q ss_conf 581011022010740311698514899994277489666133265761136589997088999999999 Q gi|254780474|r 183 SSVMNSFYIPGLAGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISK 251 (332) Q Consensus 183 ~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~ 251 (332) ++++--|.= +++++.|||..----+...+.. +|..|.....=.|.++|++|.++ T Consensus 17 ~~~~erF~~---~~~IE~~eGF~~~evl~~~~~~------------d~dev~V~T~Wes~e~Fe~W~kS 70 (107) T PRK13312 17 KDIIERFYT---RHGIETLEGFDGMFVTQTLEQE------------DFDEVKILTVWKSKQAFTDWLKS 70 (107) T ss_pred HHHHHHHCC---CCCCEECCCCCEEEEEECCCCC------------CCEEEEEEEEECCHHHHHHHHHH T ss_conf 889998626---8981024882438999637888------------72489999898788898777653 No 176 >pfam06666 DUF1173 Protein of unknown function (DUF1173). This family contains a group of hypothetical bacterial proteins that contain three conserved cysteine residues towards the N-terminal. The function of these proteins is unknown. Probab=24.67 E-value=19 Score=15.87 Aligned_cols=35 Identities=11% Similarity=0.365 Sum_probs=28.5 Q ss_pred CCEEEEEEEECCCCHHCCCHHHCC---EECCCCCCCEE Q ss_conf 982899998358101102201074---03116985148 Q gi|254780474|r 173 DRPLEFRITASSVMNSFYIPGLAG---QIYAMAGMETK 207 (332) Q Consensus 173 ~~pV~f~ltS~DViHSF~IP~l~~---k~daiPG~~t~ 207 (332) -||+++++-+++|+=+|+...-+. -++.+||+.+. T Consensus 324 ~KpLRyn~~~~~~~pdfvL~Dt~~~~~~~~vv~~~~~~ 361 (386) T pfam06666 324 VKPLRYNLDSDQPFPDFVLTDTGPEPTALMVVPGRADE 361 (386) T ss_pred CCCCCCCCCCCCCCCCEEEECCCCCCEEEEEECCCCCH T ss_conf 66546568756654868985589986048994699998 No 177 >KOG2927 consensus Probab=24.62 E-value=44 Score=13.30 Aligned_cols=26 Identities=27% Similarity=0.355 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999999999999998853046 Q gi|254780474|r 53 AVALMLLIVVPVFFSILFFAWKYRST 78 (332) Q Consensus 53 ~~~l~liV~v~V~~l~~~~~~kyR~~ 78 (332) .+++..+.++.++++.+|-+|=.+-+ T Consensus 190 ~~vl~~~fvl~tlaivLFPLWP~~mR 215 (372) T KOG2927 190 WQVLGVLFVLVTLAIVLFPLWPRRMR 215 (372) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 99999999999999973135838874 No 178 >TIGR02878 spore_ypjB sporulation protein YpjB; InterPro: IPR014231 Proteins in thie entry, typified by YpjB, are restricted to a subset of the endospore-forming bacteria which includes Bacillus species, but not species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon . Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set.. Probab=24.62 E-value=44 Score=13.30 Aligned_cols=24 Identities=13% Similarity=0.458 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HCC Q ss_conf 999999999999999999885-304 Q gi|254780474|r 54 VALMLLIVVPVFFSILFFAWK-YRS 77 (332) Q Consensus 54 ~~l~liV~v~V~~l~~~~~~k-yR~ 77 (332) +|+|+.++-+.++-+-|.=|| ||+ T Consensus 202 lWvm~~tGG~I~~~LtYVGwRKYKg 226 (234) T TIGR02878 202 LWVMISTGGLIVATLTYVGWRKYKG 226 (234) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999998799999976544666688 No 179 >COG3794 PetE Plastocyanin [Energy production and conversion] Probab=24.35 E-value=45 Score=13.27 Aligned_cols=71 Identities=18% Similarity=0.178 Sum_probs=46.1 Q ss_pred EEEEEECCCEEEEEEEECCCCHHCCCHHH---CCEECCCCCCCEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEC Q ss_conf 06896369828999983581011022010---7403116985148999942774896661332657611365899970 Q gi|254780474|r 166 NELVVPIDRPLEFRITASSVMNSFYIPGL---AGQIYAMAGMETKLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGK 240 (332) Q Consensus 166 neL~lPv~~pV~f~ltS~DViHSF~IP~l---~~k~daiPG~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~ 240 (332) -++.+-+|..|+|.=+..- -|+--...- .+......+.--....+.+++|+|.-+|+==-|. .|+.+++|- T Consensus 54 A~v~v~pGDTVtw~~~d~~-~Hnv~~~~~~~~~g~~~~~~~~~~s~~~Tfe~~G~Y~Y~C~PH~~~---gM~G~IvV~ 127 (128) T COG3794 54 AEVTVKPGDTVTWVNTDSV-GHNVTAVGGMDPEGSGTLKAGINESFTHTFETPGEYTYYCTPHPGM---GMKGKIVVG 127 (128) T ss_pred CEEEECCCCEEEEEECCCC-CCEEEEECCCCCCCCCCCCCCCCCCEEEEECCCCEEEEEECCCCCC---CCEEEEEEC T ss_conf 6789789998999989988-7107973787745643323679853689943563698994437778---847999837 No 180 >TIGR01004 PulS_OutS lipoprotein, PulS/OutS family; InterPro: IPR005699 This family comprises lipoproteins from four gamma proteobacterial species: PulS protein of Klebsiella pneumoniae (P20440 from SWISSPROT), the OutS protein of Erwinia chrysanthemi (Q01567 from SWISSPROT) and Pectobacterium chrysanthemi, and the functionally uncharacterised E. coli protein EtpO. PulS and OutS have been shown to interact with and facilitate insertion of secretins into the outer membrane, suggesting a chaperone-like, or piloting function for members of this family.; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport. Probab=24.34 E-value=45 Score=13.27 Aligned_cols=41 Identities=20% Similarity=0.126 Sum_probs=25.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCC--CC------CCCCCCHHHHHHHHHH Q ss_conf 23566889999999999874136--43------2689898999999999 Q gi|254780474|r 10 VSVGKSLKFFFLFFSSVMLSGFE--FI------VMDPYGDIALQQASLI 50 (332) Q Consensus 10 ~~~~k~l~~~~l~~~~~lLsGC~--~~------~l~P~G~ia~~~~~L~ 50 (332) .++-|+|-.++..+..+.||||. .. -..|+=|-.+|...|- T Consensus 3 ~n~lKkln~~~~~~~~v~lsgCqtPaPvqn~~~~~~a~Vp~~eQleQlA 51 (136) T TIGR01004 3 RNILKKLNVVLSLLVVVLLSGCQTPAPVQNRAIRKAAAVPASEQLEQLA 51 (136) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 1378999999999999986404789988887555578788777899999 No 181 >PRK10449 heat-inducible protein; Provisional Probab=24.22 E-value=45 Score=13.25 Aligned_cols=35 Identities=20% Similarity=0.313 Sum_probs=16.7 Q ss_pred CCCCCCCCC-CCCEEEEE-------ECC-H--HHHHHHHHHHHCCC Q ss_conf 133265761-13658999-------708-8--99999999986072 Q gi|254780474|r 222 SANYSGRGF-SHMRFKFY-------GKS-E--KGFEDWISKVKRQG 256 (332) Q Consensus 222 cae~cG~gh-s~M~f~v~-------v~s-~--~~f~~Wv~~~ka~~ 256 (332) |..|.|.+. ++..+++. +-. | .++|..+.++=..+ T Consensus 67 CN~f~G~~~l~~~~L~v~~lasT~m~C~dp~l~~~E~~i~~~L~~~ 112 (140) T PRK10449 67 CNRFSGEGKLSDGELTVKGLAMTRMMCADPQLNELDNTISEMLKKG 112 (140) T ss_pred CCCEEEEEEECCCEEEECHHHCCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 4650603898598799830110054689868999999999996247 No 182 >pfam02430 AMA-1 Apical membrane antigen 1. Apical membrane antigen 1 (AMA-1) is a Plasmodium asexual blood-stage antigen. It has been suggested that positive selection operates on the AMA-1 gene in regions coding for antigenic sites. Probab=23.85 E-value=38 Score=13.73 Aligned_cols=32 Identities=9% Similarity=0.113 Sum_probs=13.6 Q ss_pred EECCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHCCCC Q ss_conf 66133265761136589997088999999999860724 Q gi|254780474|r 220 GFSANYSGRGFSHMRFKFYGKSEKGFEDWISKVKRQGT 257 (332) Q Consensus 220 G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~ 257 (332) |.|.++- =.|...+.+-++-.+=+=+.-|+.+ T Consensus 209 G~C~~i~------~~~ee~a~s~~eC~~ivFe~sASDq 240 (462) T pfam02430 209 GNCVDIP------PVFEEYANDLEECNKIVFENSASDQ 240 (462) T ss_pred CCEEECC------HHHHHHCCCHHHHHHHHHCCCCCCC T ss_conf 9577773------5655532459999999734565558 No 183 >cd01245 PH_RasGAP_CG5898 RAS GTPase-activating protein (GAP) CG5898 Pleckstrin homology (PH) domain. This protein has a domain architecture of SH2-SH3-SH2-PH-C2-Ras_GAP. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes. Probab=23.81 E-value=46 Score=13.20 Aligned_cols=16 Identities=25% Similarity=0.613 Sum_probs=13.2 Q ss_pred CCHHHHHHHHHHHHCC Q ss_conf 0889999999998607 Q gi|254780474|r 240 KSEKGFEDWISKVKRQ 255 (332) Q Consensus 240 ~s~~~f~~Wv~~~ka~ 255 (332) -+.++.|+|++..|+. T Consensus 83 ~s~~er~~Wi~~Lr~~ 98 (98) T cd01245 83 RSSEERDKWIESLQAQ 98 (98) T ss_pred CCHHHHHHHHHHHHCC T ss_conf 9999999999998609 No 184 >pfam10066 DUF2304 Uncharacterized conserved protein (DUF2304). Members of this family of hypothetical archaeal proteins have no known function. Probab=23.62 E-value=46 Score=13.17 Aligned_cols=11 Identities=0% Similarity=0.075 Sum_probs=3.8 Q ss_pred HHHHHHHHHHH Q ss_conf 99999887751 Q gi|254780474|r 110 AVITWDATHRM 120 (332) Q Consensus 110 ~~~t~~~~~~l 120 (332) .+.+++.+.++ T Consensus 78 ~~~~f~l~~~i 88 (115) T pfam10066 78 FYLLFKLYLKI 88 (115) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 185 >PRK10783 mltD membrane-bound lytic murein transglycosylase D; Provisional Probab=23.13 E-value=47 Score=13.11 Aligned_cols=14 Identities=29% Similarity=0.517 Sum_probs=9.3 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999998741364 Q gi|254780474|r 20 FLFFSSVMLSGFEF 33 (332) Q Consensus 20 ~l~~~~~lLsGC~~ 33 (332) ++++++++|+||.. T Consensus 5 ~~l~~~~ll~gcq~ 18 (449) T PRK10783 5 AILLASVLLVGCQS 18 (449) T ss_pred HHHHHHHHHHHCCC T ss_conf 99999999972557 No 186 >TIGR02898 spore_YhcN_YlaJ sporulation lipoprotein, YhcN/YlaJ family; InterPro: IPR014247 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain. . Probab=23.00 E-value=47 Score=13.09 Aligned_cols=16 Identities=19% Similarity=0.528 Sum_probs=10.5 Q ss_pred HHHHHHHHHHH--HCCCC Q ss_conf 99999999987--41364 Q gi|254780474|r 18 FFFLFFSSVML--SGFEF 33 (332) Q Consensus 18 ~~~l~~~~~lL--sGC~~ 33 (332) |++++++.++| +||.. T Consensus 2 l~~~ll~~~~l~~tgC~~ 19 (185) T TIGR02898 2 LFIILLLLLLLLATGCTN 19 (185) T ss_pred HHHHHHHHHHHHHHCCCC T ss_conf 688999999998731341 No 187 >COG1585 Membrane protein implicated in regulation of membrane protease activity [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion] Probab=22.91 E-value=47 Score=13.08 Aligned_cols=36 Identities=14% Similarity=0.173 Sum_probs=25.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 999999999999999999999999998853046777 Q gi|254780474|r 46 QASLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKK 81 (332) Q Consensus 46 ~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~ 81 (332) ...+...++.+-++.+.+..++.++..+||+++..+ T Consensus 39 ~~~l~~~~~~~q~v~f~~lsv~~~~l~rr~~~~~~~ 74 (140) T COG1585 39 LALLLLLSWWLQLVLFAILSVLLALLGRRFVRRRLK 74 (140) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 888725249999999999999999999998765037 No 188 >PRK12699 flgH flagellar basal body L-ring protein; Reviewed Probab=22.84 E-value=48 Score=13.07 Aligned_cols=16 Identities=13% Similarity=0.177 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 9999999987413643 Q gi|254780474|r 19 FFLFFSSVMLSGFEFI 34 (332) Q Consensus 19 ~~l~~~~~lLsGC~~~ 34 (332) .+++...++|+||... T Consensus 21 ~~~~~~~~~l~gC~~~ 36 (246) T PRK12699 21 PVLIVMLALVGGCSLP 36 (246) T ss_pred HHHHHHHHHHCCCCCC T ss_conf 8999999986044689 No 189 >PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional Probab=22.26 E-value=49 Score=12.99 Aligned_cols=23 Identities=13% Similarity=0.018 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 56688999999999987413643 Q gi|254780474|r 12 VGKSLKFFFLFFSSVMLSGFEFI 34 (332) Q Consensus 12 ~~k~l~~~~l~~~~~lLsGC~~~ 34 (332) ++|.+-.+++..+.+.|.||+.. T Consensus 2 mkk~~~~~~~a~g~~~l~GC~~r 24 (234) T PRK10523 2 MKKAILTALAAVGLFALMGCNNR 24 (234) T ss_pred HHHHHHHHHHHHHHHHHHCCCCC T ss_conf 05899999999999987634665 No 190 >pfam02411 MerT MerT mercuric transport protein. MerT is an mercuric transport integral membrane protein and is responsible for transport of the Hg2+ iron from periplasmic MerP (also part of the transport system) to mercuric reductase (MerE). Probab=22.14 E-value=49 Score=12.98 Aligned_cols=24 Identities=17% Similarity=0.246 Sum_probs=17.1 Q ss_pred CEEEEEEEHHHHHHHHHHHHHHHH Q ss_conf 612553022789999999999998 Q gi|254780474|r 92 TLLELFVWLVPLVIVGFLAVITWD 115 (332) Q Consensus 92 ~~lEi~wt~iP~iIi~~L~~~t~~ 115 (332) ...-++.|++-+++++.++++-|. T Consensus 89 ~~~k~~~Wivt~~vl~~l~fPy~~ 112 (116) T pfam02411 89 KTYKVIFWLVALLALVAVTFPYVI 112 (116) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 378889999999999999737668 No 191 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=22.06 E-value=49 Score=12.97 Aligned_cols=20 Identities=20% Similarity=0.135 Sum_probs=13.1 Q ss_pred HHHHHH-HHHHHHHHHHHCCC Q ss_conf 668899-99999999874136 Q gi|254780474|r 13 GKSLKF-FFLFFSSVMLSGFE 32 (332) Q Consensus 13 ~k~l~~-~~l~~~~~lLsGC~ 32 (332) +|++-+ .+.++++++|+||. T Consensus 2 KKKl~~~~~~~~svl~LaaCs 22 (310) T PRK01326 2 KKKLIAGAVTLLSVATLAACS 22 (310) T ss_pred CHHHHHHHHHHHHHHHHHHHC T ss_conf 088999999999999998716 No 192 >KOG4789 consensus Probab=22.01 E-value=49 Score=12.96 Aligned_cols=22 Identities=9% Similarity=0.107 Sum_probs=11.7 Q ss_pred EEEECCCEEEEEEEECCCCHHC Q ss_conf 8963698289999835810110 Q gi|254780474|r 168 LVVPIDRPLEFRITASSVMNSF 189 (332) Q Consensus 168 L~lPv~~pV~f~ltS~DViHSF 189 (332) +.=|..|.=|.++|+-.-+|++ T Consensus 602 ~~sptsKrkRVkfttf~t~p~~ 623 (668) T KOG4789 602 ANSPTSKRKRVKFTTFTTIPPE 623 (668) T ss_pred CCCCCCCCCCEEEEEECCCCHH T ss_conf 7688877663588860367814 No 193 >PRK13751 putative mercuric transport protein; Provisional Probab=22.00 E-value=49 Score=12.96 Aligned_cols=29 Identities=17% Similarity=0.240 Sum_probs=17.8 Q ss_pred CCCCCCCCEEEEEEEHHHHHHHHHHHHHHHH Q ss_conf 8753456612553022789999999999998 Q gi|254780474|r 85 DPKWCHSTLLELFVWLVPLVIVGFLAVITWD 115 (332) Q Consensus 85 ~p~~~hs~~lEi~wt~iP~iIi~~L~~~t~~ 115 (332) .|+...+. .++.|++.+++++.++++-|. T Consensus 84 ~P~~~~~~--k~~~Wivtvlv~~al~fPy~a 112 (116) T PRK13751 84 IPQVRATY--KLIFWIVAALVLVALGFPYVM 112 (116) T ss_pred CCCHHHHH--HHHHHHHHHHHHHHHHHHHHH T ss_conf 85066799--999999999999999828778 No 194 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=21.84 E-value=50 Score=12.94 Aligned_cols=14 Identities=21% Similarity=0.470 Sum_probs=8.1 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 99999987413643 Q gi|254780474|r 21 LFFSSVMLSGFEFI 34 (332) Q Consensus 21 l~~~~~lLsGC~~~ 34 (332) +++..+.|+||... T Consensus 7 l~~~~l~LagCas~ 20 (26) T pfam08139 7 LLLALLLLAGCASX 20 (26) T ss_pred HHHHHHHHHCCCCC T ss_conf 99999998233200 No 195 >PRK00447 hypothetical protein; Provisional Probab=21.80 E-value=50 Score=12.93 Aligned_cols=55 Identities=16% Similarity=0.328 Sum_probs=39.2 Q ss_pred EEEEEEECCCEEEEEECCCCCCCCC----CCCEEEEEECCHHHHHHHHHHHHCCCCCCCH Q ss_conf 4899994277489666133265761----1365899970889999999998607245698 Q gi|254780474|r 206 TKLHAVMNKKGAYSGFSANYSGRGF----SHMRFKFYGKSEKGFEDWISKVKRQGTFLNR 261 (332) Q Consensus 206 t~l~~~a~~~G~y~G~cae~cG~gh----s~M~f~v~v~s~~~f~~Wv~~~ka~~~~l~~ 261 (332) +..||..+|...|.|..| ||=..+ ..-++.+....+++|-.|++-....|.++.+ T Consensus 83 ~~t~F~lnKQAA~~gkvs-f~~~~~espLG~I~v~i~~~~ie~~idwlaP~T~~G~pv~e 141 (146) T PRK00447 83 DEITIMVHKQAAYVGKLS-FVDSEEESPLGPITITIETKDPQELIDWLAPRTKDGVPLWE 141 (146) T ss_pred CEEEEEEEHHHHHCCEEE-ECCCCCCCCCCCEEEEEECCCHHHHHHHHCCCCCCCEECCC T ss_conf 989999971231264355-03788878888889999739989988750687879838477 No 196 >PHA01080 hypothetical protein Probab=21.76 E-value=50 Score=12.93 Aligned_cols=24 Identities=8% Similarity=0.036 Sum_probs=16.7 Q ss_pred EECCHHHHHHHHHHHHCCCCCCCH Q ss_conf 970889999999998607245698 Q gi|254780474|r 238 YGKSEKGFEDWISKVKRQGTFLNR 261 (332) Q Consensus 238 ~v~s~~~f~~Wv~~~ka~~~~l~~ 261 (332) +.+++.|||.|++..+-.+....+ T Consensus 28 v~Vt~~eY~~~~~~~~iTpt~~~q 51 (81) T PHA01080 28 VAVTIQDYDYLMSYTRITPTDAGQ 51 (81) T ss_pred EEEEHHHHHHHHHHHCCCHHHHHH T ss_conf 997378988899985757068887 No 197 >KOG3325 consensus Probab=21.71 E-value=50 Score=12.92 Aligned_cols=82 Identities=21% Similarity=0.360 Sum_probs=52.7 Q ss_pred EEEEEEEEEEEEEECCCCCCCCEE---------EEEECCCE-----------EEEEEEECCCCHHCCCHHHCCEECCCCC Q ss_conf 999998436999962678774306---------89636982-----------8999983581011022010740311698 Q gi|254780474|r 144 VEVVALDWKWLFLLPEQQVAIINE---------LVVPIDRP-----------LEFRITASSVMNSFYIPGLAGQIYAMAG 203 (332) Q Consensus 144 I~V~a~qW~W~F~YP~~giatvne---------L~lPv~~p-----------V~f~ltS~DViHSF~IP~l~~k~daiPG 203 (332) |..+--.+-=.-.||++++.|+.+ .++|-|-| |...+|.. -|.|--=++.|+-+..|| T Consensus 56 vhiVrGeFD~~~~yP~~kvvtvGqfkIG~chGhqViP~gd~~sL~~LaRqldvDILl~G~--Th~f~Aye~eg~ffvnPG 133 (183) T KOG3325 56 VHIVRGEFDENLKYPENKVVTVGQFKIGLCHGHQVIPWGDPESLALLARQLDVDILLTGH--THKFEAYEHEGKFFVNPG 133 (183) T ss_pred CEEEECCCCCCCCCCCCCEEEECCEEEEEECCCEEECCCCHHHHHHHHHHCCCCEEEECC--CEEEEEEEECCCEEECCC T ss_conf 279852457545587643588541788765475762389989999999754975999678--135899984780996787 Q ss_pred CCEEEEEEECCCEEEEEECCCCCCCCCCCCEEE Q ss_conf 514899994277489666133265761136589 Q gi|254780474|r 204 METKLHAVMNKKGAYSGFSANYSGRGFSHMRFK 236 (332) Q Consensus 204 ~~t~l~~~a~~~G~y~G~cae~cG~ghs~M~f~ 236 (332) -.| |.|--..+|---.+|..|... T Consensus 134 SaT---------GAfn~~~t~~~~PSFvLmDiq 157 (183) T KOG3325 134 SAT---------GAFNVSDTDIIVPSFVLMDIQ 157 (183) T ss_pred CCC---------CCCCCCCCCCCCCCEEEEEEC T ss_conf 555---------777632357777746999712 No 198 >KOG1094 consensus Probab=21.71 E-value=50 Score=12.92 Aligned_cols=38 Identities=21% Similarity=0.313 Sum_probs=17.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEH Q ss_conf 99999999999988530467777778753456612553022 Q gi|254780474|r 60 IVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWL 100 (332) Q Consensus 60 V~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~ 100 (332) ++..|...+...+||.|+-..+|.- ..-|.+.+++.++ T Consensus 400 if~iva~ii~~~L~R~rr~~~ka~s---~~~n~k~~v~lti 437 (807) T KOG1094 400 IFLIVALIIALMLWRWRRLLSKASS---RVLNEKLTVHLTV 437 (807) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC---CCCCCCEEEEEEC T ss_conf 9999999999999999998755236---7756400589831 No 199 >TIGR02683 upstrm_HI1419 probable addiction module killer protein; InterPro: IPR014056 Members of this strictly bacterial protein family are small, at roughly 100 amino acids. The gene is almost invariably the upstream member of a gene pair, where the downstream member is a predicted DNA-binding protein from a clade within Pfam helix-turn-helix family IPR001387 from INTERPRO. These gene pairs, when found on the bacterial chromosome, often are located with prophage regions, but also in both integrated plasmid regions and near housekeeping genes. Analysis suggests that the gene pair may serve as an addiction module.. Probab=21.63 E-value=33 Score=14.20 Aligned_cols=22 Identities=18% Similarity=0.336 Sum_probs=15.7 Q ss_pred EECCHHHHHHHHHHHHCCCCCC Q ss_conf 9708899999999986072456 Q gi|254780474|r 238 YGKSEKGFEDWISKVKRQGTFL 259 (332) Q Consensus 238 ~v~s~~~f~~Wv~~~ka~~~~l 259 (332) .+...+.|++|+++.|-..... T Consensus 2 ~~~~t~~F~~Wl~~LKD~~Aka 23 (95) T TIGR02683 2 TIKRTDTFDKWLDGLKDPRAKA 23 (95) T ss_pred CCCCCHHHHHHHHHCCCHHHHH T ss_conf 4565324789877316888999 No 200 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=21.60 E-value=50 Score=12.90 Aligned_cols=33 Identities=18% Similarity=0.081 Sum_probs=16.8 Q ss_pred HHHHHHHHHHHHHHHHCCCC--CC-CCCCCHHHHHH Q ss_conf 68899999999998741364--32-68989899999 Q gi|254780474|r 14 KSLKFFFLFFSSVMLSGFEF--IV-MDPYGDIALQQ 46 (332) Q Consensus 14 k~l~~~~l~~~~~lLsGC~~--~~-l~P~G~ia~~~ 46 (332) |.+-..+++++++.|++|.. .+ -.-.|.|.+++ T Consensus 5 k~~~~~~~~~~~l~LaaC~s~~~vat~kgg~IT~~e 40 (283) T PRK02998 5 KLFIGTIISCVVLALSACGSSDNVVTSKVGNITEKE 40 (283) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCHHH T ss_conf 999999999999999866899817984399283999 No 201 >pfam01152 Bac_globin Bacterial-like globin. This family of heme binding proteins are found mainly in bacteria. However they can also be found in some protozoa and plants as well. Probab=21.60 E-value=50 Score=12.90 Aligned_cols=25 Identities=8% Similarity=0.234 Sum_probs=17.9 Q ss_pred CCCCCEEEEEECCHHHHHHHHHHHHCCCCC Q ss_conf 611365899970889999999998607245 Q gi|254780474|r 229 GFSHMRFKFYGKSEKGFEDWISKVKRQGTF 258 (332) Q Consensus 229 ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~~ 258 (332) -|+.| -.++++||+|++-.+++=.. T Consensus 71 ~H~~~-----~I~~~~fd~~l~~~~~al~e 95 (120) T pfam01152 71 RHAPF-----PITNAEFDAWLRHLADALAD 95 (120) T ss_pred HHHCC-----CCCHHHHHHHHHHHHHHHHH T ss_conf 87289-----99999999999999999988 No 202 >pfam08490 DUF1744 Domain of unknown function (DUF1744). This domain is found on the epsilon catalytic subunit of DNA polymerase. It is found C terminal to pfam03104 and pfam00136. Probab=21.58 E-value=43 Score=13.39 Aligned_cols=15 Identities=27% Similarity=0.503 Sum_probs=9.1 Q ss_pred EECCCEEEEEECCCC Q ss_conf 942774896661332 Q gi|254780474|r 211 VMNKKGAYSGFSANY 225 (332) Q Consensus 211 ~a~~~G~y~G~cae~ 225 (332) ..|.||.|+-.|-|+ T Consensus 194 ~in~pG~Y~svCvEl 208 (396) T pfam08490 194 VINNPGFYDTVCVEL 208 (396) T ss_pred EEECCCCCEEEEEEE T ss_conf 574587302689999 No 203 >pfam01235 Na_Ala_symp Sodium:alanine symporter family. Probab=21.54 E-value=50 Score=12.89 Aligned_cols=30 Identities=17% Similarity=0.219 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 999999999999999998853046777777 Q gi|254780474|r 55 ALMLLIVVPVFFSILFFAWKYRSTNKKARY 84 (332) Q Consensus 55 ~l~liV~v~V~~l~~~~~~kyR~~~~~a~~ 84 (332) |++.++...+-..-.+.+.|||.++++.+| T Consensus 55 Wv~al~Gmatkf~E~~La~~yR~~~~~G~~ 84 (415) T pfam01235 55 WVTAFIGMATAFVESTLAQLYKERDKDGNF 84 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHEEECCCCCC T ss_conf 999999999999999989885067899998 No 204 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=21.47 E-value=51 Score=12.89 Aligned_cols=22 Identities=9% Similarity=0.102 Sum_probs=15.5 Q ss_pred CCEEEEEECCHHHHHHHHHHHH Q ss_conf 3658999708899999999986 Q gi|254780474|r 232 HMRFKFYGKSEKGFEDWISKVK 253 (332) Q Consensus 232 ~M~f~v~v~s~~~f~~Wv~~~k 253 (332) .|+..=..++|++|+.=++... T Consensus 114 ~l~l~~~~v~p~~~~~~l~~l~ 135 (177) T pfam07273 114 DLRLVDSSVDPEKFKGELKPLL 135 (177) T ss_pred EEEEEEEECCHHHHHHHHHHHH T ss_conf 6688872149799899999989 No 205 >TIGR02023 BchP-ChlP geranylgeranyl reductase; InterPro: IPR010253 This entry represents a group of geranylgeranyl reductases specific for the biosyntheses of bacteriochlorophyll and chlorophyll . It is unclear whether the processes of isoprenoid ligation to the chlorin ring and reduction of the geranylgeranyl chain to a phytyl chain are necessarily ordered the same way in all species (see introduction to ).; GO: 0045550 geranylgeranyl reductase activity, 0015979 photosynthesis, 0015995 chlorophyll biosynthetic process. Probab=21.37 E-value=51 Score=12.87 Aligned_cols=40 Identities=15% Similarity=0.369 Sum_probs=31.1 Q ss_pred CCHHHHHHHHHH-HHCCCCCCCHHHHHHHHHHHH----CCCCEEE Q ss_conf 088999999999-860724569899999761021----2632172 Q gi|254780474|r 240 KSEKGFEDWISK-VKRQGTFLNRQEYLLLKKPSE----RDSVRYF 279 (332) Q Consensus 240 ~s~~~f~~Wv~~-~ka~~~~l~~~~y~~l~~ps~----~~~~~~f 279 (332) |.-|-||.|+.+ +..+|..+=...|..|..++. ...++|+ T Consensus 93 VrREvFD~~LReRA~kaGAe~~~g~f~~~~~d~~GWds~~~~~~~ 137 (408) T TIGR02023 93 VRREVFDEYLRERAQKAGAELIEGLFKKLERDEDGWDSRVTLQYR 137 (408) T ss_pred EEHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEE T ss_conf 551441578999987621476656755201567777552589987 No 206 >pfam09351 DUF1993 Domain of unknown function (DUF1993). This family of proteins are functionally uncharacterized. Probab=21.18 E-value=50 Score=12.93 Aligned_cols=30 Identities=23% Similarity=0.473 Sum_probs=23.8 Q ss_pred CEEEEEECCCEEEEEEEECCCCHHCCCHHHC Q ss_conf 3068963698289999835810110220107 Q gi|254780474|r 165 INELVVPIDRPLEFRITASSVMNSFYIPGLA 195 (332) Q Consensus 165 vneL~lPv~~pV~f~ltS~DViHSF~IP~l~ 195 (332) ..++.+|.+ ...+.+|+.|-+++|=+|.|- T Consensus 109 ~~~v~~~~~-~~~~~~tg~~yl~~falPNfy 138 (162) T pfam09351 109 DRDITLPLG-GKEFDFTGEAYLLSFALPNFY 138 (162) T ss_pred CCEEEECCC-CEEEEECHHHHHHHHCCCHHH T ss_conf 866883179-803661099999980153489 No 207 >pfam00767 Poty_coat Potyvirus coat protein. Probab=21.14 E-value=51 Score=12.84 Aligned_cols=24 Identities=17% Similarity=0.441 Sum_probs=13.4 Q ss_pred CCHHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 0889999999998607245698999 Q gi|254780474|r 240 KSEKGFEDWISKVKRQGTFLNRQEY 264 (332) Q Consensus 240 ~s~~~f~~Wv~~~ka~~~~l~~~~y 264 (332) -++++|++|.+.+|.+=.. |.+.+ T Consensus 56 AT~~Qf~~W~e~V~~~y~v-~d~~m 79 (237) T pfam00767 56 ATQEQFNTWYEGVKEEYDV-TDEQM 79 (237) T ss_pred HHHHHHHHHHHHHHHHHCC-CHHHH T ss_conf 7899999999999998598-99999 No 208 >pfam02402 Lysis_col Lysis protein. These small bacterial proteins are required for colicin release and partial cell lysis. This family contains lysis proteins for several different forms of colicin. Bacillus subtilis lytA has been included in this family, the similarity is not highly significant, however it is also a short protein, that is involved in secretion of other proteins (Bateman A pers. obs.). Probab=20.94 E-value=33 Score=14.21 Aligned_cols=17 Identities=24% Similarity=0.362 Sum_probs=9.4 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999998741364 Q gi|254780474|r 17 KFFFLFFSSVMLSGFEF 33 (332) Q Consensus 17 ~~~~l~~~~~lLsGC~~ 33 (332) .+..+++..++|++|.. T Consensus 5 ~~~~i~~~~~~L~aCQa 21 (46) T pfam02402 5 LFIGILLLTVLLSACQA 21 (46) T ss_pred EEHHHHHHHHHHHHHHH T ss_conf 01399999999998555 No 209 >KOG4390 consensus Probab=20.68 E-value=50 Score=12.94 Aligned_cols=10 Identities=20% Similarity=-0.094 Sum_probs=3.5 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780474|r 57 MLLIVVPVFF 66 (332) Q Consensus 57 ~liV~v~V~~ 66 (332) +.+|+--|.+ T Consensus 183 ~ALVFYYVtG 192 (632) T KOG4390 183 AALVFYYVTG 192 (632) T ss_pred HHHHHHHHHH T ss_conf 0134654533 No 210 >pfam11344 DUF3146 Protein of unknown function (DUF3146). This family of proteins with unknown function appear to be restricted to Cyanobacteria. Probab=20.65 E-value=52 Score=12.77 Aligned_cols=45 Identities=22% Similarity=0.293 Sum_probs=27.7 Q ss_pred CCCCEEEEEEEEEEEEEEEEC----------CCCCCCC----------EEEEEECCCEEEEEEEE Q ss_conf 888668999998436999962----------6787743----------06896369828999983 Q gi|254780474|r 138 NSKPLVVEVVALDWKWLFLLP----------EQQVAII----------NELVVPIDRPLEFRITA 182 (332) Q Consensus 138 ~~~~l~I~V~a~qW~W~F~YP----------~~giatv----------neL~lPv~~pV~f~ltS 182 (332) .++-++=+|.|-||+|.|..- ..|.|-+ ..-.+-.|-.-.|.|++ T Consensus 15 ~~~~i~G~V~A~~f~W~F~W~F~~G~L~V~PsLGRALI~d~L~RFL~k~DY~LE~GgdY~Ftira 79 (80) T pfam11344 15 SEGCLEGEVSAGEFEWTFRWHFRRGELLVEPSLGRALIKDALLRFLEKSDYQLEPGGDYSFTVRA 79 (80) T ss_pred CCCEEEEEEECCEEEEEEEEEECCCEEEECCCCCHHHHHHHHHHHHHHCCEEECCCCCEEEEEEC T ss_conf 67808899983204899999976982998686027888879997886514060589947899965 No 211 >PRK10672 rare lipoprotein A; Provisional Probab=20.55 E-value=53 Score=12.76 Aligned_cols=17 Identities=6% Similarity=0.169 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999998741364 Q gi|254780474|r 17 KFFFLFFSSVMLSGFEF 33 (332) Q Consensus 17 ~~~~l~~~~~lLsGC~~ 33 (332) -|+.+..++++|+||.. T Consensus 4 ~wl~~~~~~~ll~~c~~ 20 (369) T PRK10672 4 QWLGICIAAGLLAACTS 20 (369) T ss_pred HHHHHHHHHHHHHHCCC T ss_conf 35899999999865258 No 212 >TIGR02843 CyoB cytochrome o ubiquinol oxidase, subunit I; InterPro: IPR014207 This entry represents subunit 1 (CyoB) of the cytochrome o terminal oxidase complex, which is a component of the aerobic respiratory chain that reacts with oxygen, reducing it to water with the concomitant transport of 4 protons across the membrane. Also known as the cytochrome bo complex, cytochrome o ubiquinol oxidase contains four subunits, two heme b cofactors and a copper atom which is believed to be the oxygen active site. This complex is structurally related to the cytochrome caa3 oxidases which utilise cytochrome c as the reductant and contain heme a cofactors, as well as the intermediate form aa3 oxidases which also react directly with quinones as the reductant.. Probab=20.54 E-value=53 Score=12.75 Aligned_cols=48 Identities=23% Similarity=0.291 Sum_probs=26.6 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 9999999999-9999999999999885304677777787534566125530 Q gi|254780474|r 49 LIRIAVALML-LIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFV 98 (332) Q Consensus 49 L~~~~~~l~l-iV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~w 98 (332) |++.+++..+ .+++....+-++...|.|+++.+..=+ .|. .+.||... T Consensus 494 l~~a~~Ga~lI~~Gi~~~~~q~~vSi~~R~~~~d~~GD-~W~-gRTLEWs~ 542 (646) T TIGR02843 494 LIVAAIGAVLIAIGILCQLLQIFVSIRDRDQLRDTTGD-PWN-GRTLEWST 542 (646) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-CCC-CEEECCCC T ss_conf 99999999999999999999999888723551024688-877-62312346 No 213 >pfam11368 DUF3169 Protein of unknown function (DUF3169). Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently there is no known function. Probab=20.50 E-value=53 Score=12.75 Aligned_cols=32 Identities=9% Similarity=0.290 Sum_probs=19.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999999999999999999999988530 Q gi|254780474|r 45 QQASLIRIAVALMLLIVVPVFFSILFFAWKYR 76 (332) Q Consensus 45 ~~~~L~~~~~~l~liV~v~V~~l~~~~~~kyR 76 (332) ...+......+++.+++++...+++++..+.+ T Consensus 42 ~~~~~v~~i~~i~~liiiil~~~~~~~~~~a~ 73 (248) T pfam11368 42 STIDNVQLIIILARIIIIILTVLLFIFLYQAL 73 (248) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 45643999999999999999999999999999 No 214 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=20.45 E-value=53 Score=12.74 Aligned_cols=17 Identities=24% Similarity=0.303 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999998741364 Q gi|254780474|r 17 KFFFLFFSSVMLSGFEF 33 (332) Q Consensus 17 ~~~~l~~~~~lLsGC~~ 33 (332) +++++++++++|+||.. T Consensus 4 ~~l~~~~~~~~L~GC~~ 20 (238) T PRK12696 4 KLLAASCAVLLLSGCNA 20 (238) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999999516568 No 215 >TIGR00979 fumC_II fumarate hydratase, class II; InterPro: IPR005677 Fumarase C catalyzes the stereospecific interconversion of fumarate to L-malate as part of the metabolic citric acid or Kreb's cycle. The recent three-dimensional structure of fumarase C from Escherichia coli has identified a binding site for anions which is generated by side chains from three of the four subunits within the tetramer. These same side chains are found in the three most highly conserved regions within the class II fumarate hydratase family. Putative fumarases from several species (Mycobacterium tuberculosis, Streptomyces coelicolor, Pseudomonas aeruginosa) branch deeply, although within the same branch of a phylogenetic tree rooted by aspartate ammonia-lyase sequences.; GO: 0004333 fumarate hydratase activity, 0006106 fumarate metabolic process, 0045239 tricarboxylic acid cycle enzyme complex. Probab=20.41 E-value=21 Score=15.57 Aligned_cols=18 Identities=11% Similarity=0.078 Sum_probs=7.0 Q ss_pred CCCCCCCCCCCHHHCCCC Q ss_conf 524448856602102466 Q gi|254780474|r 293 LCVHPGKICMDEMMRIDA 310 (332) Q Consensus 293 ~~~~~~~~~~~~~m~~~~ 310 (332) |+.+-||+|+=-.+.+.. T Consensus 294 R~LgSGPRCGlGEl~~PE 311 (459) T TIGR00979 294 RWLGSGPRCGLGELFLPE 311 (459) T ss_pred HHHHCCCCCCCCCCCCCC T ss_conf 785078864200256777 No 216 >pfam03767 Acid_phosphat_B HAD superfamily, subfamily IIIB (Acid phosphatase). This family proteins includes acid phosphatases and a number of vegetative storage proteins. Probab=20.39 E-value=53 Score=12.73 Aligned_cols=53 Identities=13% Similarity=0.111 Sum_probs=28.9 Q ss_pred EEECCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCEEECCCCCCHHHHHH Q ss_conf 9970889999999998607245698999997610212632172047835426873 Q gi|254780474|r 237 FYGKSEKGFEDWISKVKRQGTFLNRQEYLLLKKPSERDSVRYFSPIEKGLYYSIL 291 (332) Q Consensus 237 v~v~s~~~f~~Wv~~~ka~~~~l~~~~y~~l~~ps~~~~~~~f~~~~~~lf~~i~ 291 (332) ...-+++.|++|+.+.++.+-+-...-|+.+.+-+. .+-|-+.=...+....+ T Consensus 102 ~~~~~~~~~~~wv~~~~apaipg~l~l~~~~~~~Gv--kif~lT~R~e~~r~~T~ 154 (230) T pfam03767 102 GEPFDPEKFDEWVNKGEAPALPGALELYNYLVELGV--KIFFVSGRSEDLRAATV 154 (230) T ss_pred CCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCC--EEEEECCCCHHHHHHHH T ss_conf 875780658999961768677359999999997792--79997287378889879 No 217 >PRK10477 outer membrane lipoprotein Blc; Provisional Probab=20.32 E-value=48 Score=13.05 Aligned_cols=52 Identities=8% Similarity=0.024 Sum_probs=32.8 Q ss_pred EEEEEECCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHCCCCCCC Q ss_conf 899994277489666133265761136589997088999999999860724569 Q gi|254780474|r 207 KLHAVMNKKGAYSGFSANYSGRGFSHMRFKFYGKSEKGFEDWISKVKRQGTFLN 260 (332) Q Consensus 207 ~l~~~a~~~G~y~G~cae~cG~ghs~M~f~v~v~s~~~f~~Wv~~~ka~~~~l~ 260 (332) +.-+..|..-.|-..|. +...+...--+-..++++.+++.++.+|+.+-..+ T Consensus 116 YwIl~lD~dY~~aiVg~--p~r~~lWILSR~p~l~~~~~~~ll~~a~~~GyD~~ 167 (177) T PRK10477 116 YNVIALDREYRHALVCG--PDRDYLWILSRTPTISDEVKQQMLAVATREGFDVS 167 (177) T ss_pred EEEEEECCCCCEEEEEE--CCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCHH T ss_conf 48999868998999995--69977999976899999999999999998299779 No 218 >cd00260 Sialidase Sialidases or neuraminidases function to bind and hydrolyze terminal sialic acid residues from various glycoconjugates as well as playing roles in pathogenesis, bacterial nutrition and cellular interactions. They have a six-bladed, beta-propeller fold with the non-viral sialidases containing 2-5 Asp-box motifs (most commonly Ser/Thr-X-Asp-[X]-Gly-X-Thr- Trp/Phe). This CD includes eubacterial, eukaryotic, and viral sialidases. Probab=20.26 E-value=53 Score=12.71 Aligned_cols=99 Identities=18% Similarity=0.182 Sum_probs=54.7 Q ss_pred EEEEEEEEEEEEEEEECCCCCCCC-EEEEEECC------CEEEEEEEECCCCHHCCCHHHCCEECCCCCCCEEEEEEECC Q ss_conf 689999984369999626787743-06896369------82899998358101102201074031169851489999427 Q gi|254780474|r 142 LVVEVVALDWKWLFLLPEQQVAII-NELVVPID------RPLEFRITASSVMNSFYIPGLAGQIYAMAGMETKLHAVMNK 214 (332) Q Consensus 142 l~I~V~a~qW~W~F~YP~~giatv-neL~lPv~------~pV~f~ltS~DViHSF~IP~l~~k~daiPG~~t~l~~~a~~ 214 (332) ++-++....|+|.|.-|..||..- +.|++|+. ......|.|+|==.++-..+... +. ...++-.+.--. T Consensus 132 it~~~~~~~~~~~~~g~G~GI~l~~GrLv~p~~~~~~~~~~~~~~iySdD~G~tW~~g~~~~--~~--~~~~E~~ivEl~ 207 (351) T cd00260 132 LTPSVKGDNWAALFTGPGSGIQMKDGRLVFPVYGGNAGGRVSSAIIYSDDSGKTWKLGEGVN--DA--GGCSECSVVELS 207 (351) T ss_pred CCCCCCCCCCCEEEECCCCEEEEECCCEEEEEEEECCCCCEEEEEEEECCCCCCEEECCCCC--CC--CCCCCCEEEEEC T ss_conf 56303678641353269745997089789999998579969999999899999829898789--98--898778899973 Q ss_pred CEEEEEECCC-CCCCCCCCCEEEEEECCHHHHHHHHHH Q ss_conf 7489666133-265761136589997088999999999 Q gi|254780474|r 215 KGAYSGFSAN-YSGRGFSHMRFKFYGKSEKGFEDWISK 251 (332) Q Consensus 215 ~G~y~G~cae-~cG~ghs~M~f~v~v~s~~~f~~Wv~~ 251 (332) .|.-.-.+-+ .|| -+.++.|.+.=+.|-+. T Consensus 208 dG~l~~~~R~~~~~-------~R~v~~S~DgG~TW~~~ 238 (351) T cd00260 208 DGKLYMYTRDNSGG-------RRPVYESRDMGTTWTEA 238 (351) T ss_pred CCEEEEEEECCCCC-------CEEEEEECCCCCCCCCC T ss_conf 99799999707999-------79999986898576646 No 219 >PRK06473 NAD(P)H-quinone oxidoreductase subunit D4; Validated Probab=20.21 E-value=54 Score=12.71 Aligned_cols=19 Identities=11% Similarity=0.032 Sum_probs=9.9 Q ss_pred CCHHCCCHHHCCEECCCCC Q ss_conf 1011022010740311698 Q gi|254780474|r 185 VMNSFYIPGLAGQIYAMAG 203 (332) Q Consensus 185 ViHSF~IP~l~~k~daiPG 203 (332) ++-....=.++.|.=..|= T Consensus 206 ~~~~~~l~gf~~K~~~~P~ 224 (500) T PRK06473 206 LLLAPILVGFGIKIPLVPF 224 (500) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 9999999999986213121 No 220 >pfam11444 DUF2895 Protein of unknown function (DUF2895). This is a bacterial family of uncharacterized proteins. Probab=20.11 E-value=54 Score=12.69 Aligned_cols=69 Identities=12% Similarity=0.071 Sum_probs=36.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999885304677777787534566125530227899999999999988775 Q gi|254780474|r 48 SLIRIAVALMLLIVVPVFFSILFFAWKYRSTNKKARYDPKWCHSTLLELFVWLVPLVIVGFLAVITWDATHR 119 (332) Q Consensus 48 ~L~~~~~~l~liV~v~V~~l~~~~~~kyR~~~~~a~~~p~~~hs~~lEi~wt~iP~iIi~~L~~~t~~~~~~ 119 (332) +-++.+..+.+.+.+.+.+++.+..++...+= .--.+|+..- .--..||-||.--+-..+++.|+..+. T Consensus 8 ~~hi~tlr~~~~~L~~l~~~l~~g~~~aP~~l-tih~PPDLrs--Gstr~~~eVP~~~Vy~Fa~yIfQQlNr 76 (199) T pfam11444 8 QAHILTLRLAIGLLFVILLALIVGWMTAPKRL-TIHNPPDLRS--GSTRKWWEVPPENVYAFAFYIFQQLNR 76 (199) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCE-EEECCCCCCC--CCCEECCCCCHHHHHHHHHHHHHHHHC T ss_conf 98999999999999999999999998698542-8877987677--771076638858867899999999700 Done!