Query gi|254781029|ref|YP_003065442.1| chemotaxis sensory transducer [Candidatus Liberibacter asiaticus str. psy62] Match_columns 1828 No_of_seqs 3111 out of 5405 Neff 11.9 Searched_HMMs 39220 Date Mon May 30 09:31:47 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781029.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam01442 Apolipoprotein Apoli 98.0 0.0023 6E-08 43.2 27.2 180 266-445 8-191 (191) 2 KOG0161 consensus 96.4 0.075 1.9E-06 30.5 124.2 29 1605-1633 1863-1891(1930) 3 PRK10361 DNA recombination pro 95.7 0.14 3.7E-06 28.1 29.6 92 226-335 66-157 (475) 4 pfam05656 DUF805 Protein of un 95.4 0.15 3.9E-06 27.9 8.4 69 91-169 4-72 (112) 5 pfam04156 IncA IncA protein. C 93.4 0.44 1.1E-05 24.0 21.5 26 97-122 4-29 (186) 6 KOG4674 consensus 93.3 0.45 1.2E-05 23.9 155.2 10 1741-1750 1745-1754(1822) 7 COG3152 Predicted membrane pro 90.9 0.82 2.1E-05 21.7 7.9 71 87-169 9-79 (125) 8 TIGR02121 Na_Pro_sym sodium/pr 90.2 0.81 2.1E-05 21.8 5.3 48 108-158 159-206 (517) 9 pfam07900 DUF1670 Protein of u 89.0 0.46 1.2E-05 23.9 3.3 25 1765-1792 167-191 (220) 10 pfam07415 Herpes_LMP2 Gammaher 87.9 1.3 3.3E-05 20.1 10.6 24 27-50 25-48 (489) 11 PRK09752 adhesin; Provisional 82.4 1.3 3.4E-05 19.9 3.0 12 1775-1786 1002-1013(1250) 12 PRK08476 F0F1 ATP synthase sub 81.7 2.3 5.8E-05 18.0 19.4 26 137-165 6-31 (141) 13 KOG2365 consensus 80.8 2.5 6.2E-05 17.7 4.3 21 131-151 48-68 (808) 14 COG3898 Uncharacterized membra 80.4 2.5 6.4E-05 17.6 6.9 61 102-165 3-68 (531) 15 pfam05653 DUF803 Protein of un 79.0 2.7 7E-05 17.3 5.8 51 97-154 48-98 (300) 16 COG3158 Kup K+ transporter [In 78.9 2.1 5.4E-05 18.3 3.0 17 145-161 170-186 (627) 17 TIGR01992 PTS-IIBC-Tre PTS sys 76.7 3.1 8E-05 16.8 4.6 14 145-158 474-487 (489) 18 PRK10745 trkD potassium transp 76.6 3.2 8E-05 16.8 7.2 17 146-162 166-182 (622) 19 pfam03976 PPK2 Polyphosphate k 75.7 3.3 8.4E-05 16.6 3.5 12 1746-1757 89-100 (229) 20 pfam07739 TipAS TipAS antibiot 75.6 3.3 8.5E-05 16.6 6.0 52 1736-1796 67-118 (122) 21 COG4420 Predicted membrane pro 75.6 3.3 8.5E-05 16.6 6.1 68 81-154 39-108 (191) 22 pfam05977 DUF894 Bacterial pro 74.7 3.5 8.9E-05 16.4 7.5 60 96-165 10-72 (524) 23 pfam10066 DUF2304 Uncharacteri 74.1 3.6 9.2E-05 16.3 9.6 71 108-182 33-106 (115) 24 pfam04276 DUF443 Protein of un 73.6 3.7 9.4E-05 16.2 3.8 10 147-156 97-106 (197) 25 pfam11044 TMEMspv1-c74-12 Plec 73.6 3.7 9.4E-05 16.2 3.3 22 100-122 2-23 (49) 26 TIGR00869 sec62 protein transl 72.9 3.8 9.7E-05 16.1 6.3 27 137-163 159-185 (259) 27 pfam11833 DUF3353 Protein of u 72.9 3.8 9.7E-05 16.1 5.3 34 86-119 125-161 (193) 28 COG3071 HemY Uncharacterized e 72.8 3.8 9.8E-05 16.1 5.4 24 102-125 3-26 (400) 29 pfam07245 Phlebovirus_G2 Phleb 72.5 3.9 9.9E-05 16.0 4.0 71 75-169 426-498 (504) 30 pfam11998 DUF3493 Protein of u 71.4 4.1 0.0001 15.8 5.6 14 97-110 20-33 (75) 31 pfam11990 DUF3487 Protein of u 71.4 4.1 0.0001 15.8 6.5 72 85-165 4-76 (122) 32 COG5615 Predicted integral mem 70.3 3.3 8.3E-05 16.7 2.2 51 130-188 79-129 (161) 33 PRK10414 biopolymer transport 70.1 4.3 0.00011 15.6 9.2 53 131-184 7-59 (239) 34 KOG4673 consensus 69.9 4.4 0.00011 15.6 59.9 20 1677-1696 808-827 (961) 35 TIGR03348 VI_IcmF type VI secr 66.7 5 0.00013 15.1 8.4 17 99-115 2-18 (1169) 36 COG4239 ABC-type uncharacteriz 66.2 5.1 0.00013 15.0 6.1 58 106-164 152-213 (341) 37 PRK01637 rbn ribonuclease BN; 65.7 5.2 0.00013 15.0 7.6 17 141-157 31-47 (284) 38 TIGR01946 rnfD electron transp 65.3 5.3 0.00013 14.9 5.0 39 95-142 82-120 (346) 39 KOG3590 consensus 65.2 4.3 0.00011 15.7 2.0 17 146-162 117-133 (602) 40 TIGR00328 flhB flagellar biosy 65.1 5.3 0.00014 14.9 8.1 19 101-119 31-49 (352) 41 pfam11124 Pho86 Inorganic phos 64.9 5.4 0.00014 14.9 8.6 89 85-176 35-134 (296) 42 PRK13825 conjugal transfer pro 64.1 5.5 0.00014 14.7 3.4 16 147-162 136-151 (389) 43 pfam02526 GBP_repeat Glycophor 64.1 5 0.00013 15.1 2.1 19 1766-1784 8-26 (38) 44 PRK10525 cytochrome o ubiquino 64.1 5.5 0.00014 14.7 3.9 16 150-165 95-110 (305) 45 PRK12505 putative monovalent c 63.5 5.6 0.00014 14.7 3.5 20 141-160 124-143 (156) 46 TIGR00828 EIID-AGA PTS system, 63.4 5 0.00013 15.1 2.0 20 139-159 126-145 (272) 47 MTH00213 ND6 NADH dehydrogenas 63.3 5.7 0.00015 14.6 5.3 52 101-162 43-96 (312) 48 PRK10633 hypothetical protein; 62.8 5.8 0.00015 14.6 6.7 54 103-164 14-67 (80) 49 PRK10999 malF maltose transpor 61.9 6 0.00015 14.4 7.6 15 150-164 329-343 (520) 50 COG4219 MecR1 Antirepressor re 61.4 6.1 0.00016 14.4 2.7 16 103-118 97-112 (337) 51 pfam07181 VirC2 VirC2 protein. 60.7 6.3 0.00016 14.3 4.0 38 1758-1795 93-130 (202) 52 PRK00302 lnt apolipoprotein N- 60.6 6.3 0.00016 14.3 8.7 25 88-112 43-67 (509) 53 pfam10028 DUF2270 Predicted in 60.5 6.3 0.00016 14.3 4.0 33 85-118 112-144 (185) 54 COG3336 Predicted membrane pro 60.4 6.3 0.00016 14.2 4.9 52 106-159 47-98 (299) 55 COG3402 Uncharacterized conser 59.9 6.4 0.00016 14.2 6.1 56 100-156 18-74 (161) 56 COG3462 Predicted membrane pro 59.5 6.5 0.00017 14.1 3.1 40 147-186 61-102 (117) 57 PRK06251 V-type ATP synthase s 59.4 6.5 0.00017 14.1 3.3 48 108-157 42-99 (101) 58 pfam11712 Vma12 Endoplasmic re 59.4 6.6 0.00017 14.1 10.2 77 79-166 58-134 (137) 59 PRK13862 putative crown gall t 59.3 6.6 0.00017 14.1 4.0 39 1757-1795 92-130 (201) 60 TIGR03008 pepcterm_CAAX CAAX p 59.3 6.6 0.00017 14.1 4.8 63 94-156 72-143 (233) 61 pfam07098 DUF1360 Protein of u 58.9 6.7 0.00017 14.1 5.0 32 81-125 50-81 (105) 62 pfam06738 DUF1212 Protein of u 58.6 6.7 0.00017 14.0 6.5 46 82-131 81-129 (193) 63 COG2966 Uncharacterized conser 58.4 6.8 0.00017 14.0 6.3 111 81-202 102-224 (250) 64 pfam03907 Spo7 Spo7-like prote 58.3 6.8 0.00017 14.0 4.9 68 90-162 20-87 (169) 65 pfam11234 DUF3036 Protein of u 57.7 6.9 0.00018 13.9 7.2 23 95-117 85-107 (155) 66 TIGR03109 exosortase_1 exosort 57.7 6.9 0.00018 13.9 7.1 108 89-198 59-168 (267) 67 pfam08566 Pam17 Mitochondrial 57.6 7 0.00018 13.9 5.1 63 102-164 39-110 (174) 68 PRK10649 hypothetical protein; 56.6 7.2 0.00018 13.8 8.8 63 146-208 47-109 (564) 69 COG2807 CynX Cyanate permease 56.4 7.3 0.00019 13.7 4.4 39 150-204 55-97 (395) 70 PRK10490 sensor protein KdpD; 55.3 7.5 0.00019 13.6 6.5 20 1729-1748 820-839 (895) 71 PRK09458 pspB phage shock prot 54.4 6.2 0.00016 14.3 1.3 19 134-155 5-23 (75) 72 KOG1029 consensus 54.2 7.8 0.0002 13.5 38.4 28 230-257 320-347 (1118) 73 TIGR00927 2A1904 K+-dependent 54.2 7.8 0.0002 13.5 4.2 16 150-165 509-524 (1226) 74 COG0004 AmtB Ammonia permease 53.7 7.8 0.0002 13.5 1.7 22 101-122 44-65 (409) 75 pfam10725 DUF2517 Protein of u 53.4 3.7 9.3E-05 16.2 -0.0 14 143-156 18-31 (63) 76 pfam05745 CRPA Chlamydia 15 kD 53.4 8 0.0002 13.4 5.3 55 96-155 60-115 (150) 77 KOG4298 consensus 53.1 8.1 0.00021 13.3 5.5 21 145-165 185-206 (245) 78 PRK02935 hypothetical protein; 52.7 8.2 0.00021 13.3 6.1 49 103-157 12-60 (117) 79 TIGR01194 cyc_pep_trnsptr cycl 52.4 8.3 0.00021 13.3 7.2 36 82-122 2-38 (555) 80 PRK09647 RNA polymerase sigma 52.4 6.1 0.00015 14.4 0.9 12 156-167 100-111 (222) 81 TIGR02976 phageshock_pspB phag 51.0 8.6 0.00022 13.1 3.0 18 149-166 7-24 (75) 82 PRK13657 cyclic beta-1,2-gluca 50.8 8.7 0.00022 13.1 10.1 29 94-122 14-42 (585) 83 TIGR02139 permease_CysT sulfat 50.5 8.8 0.00022 13.0 5.8 51 99-149 54-110 (265) 84 smart00502 BBC B-Box C-termina 50.5 8.8 0.00022 13.0 15.7 13 322-334 54-66 (127) 85 PRK12519 RNA polymerase sigma 50.0 8.9 0.00023 13.0 7.8 37 84-124 15-51 (194) 86 pfam09788 Tmemb_55A Transmembr 49.5 9.1 0.00023 12.9 2.1 21 103-123 202-222 (258) 87 pfam01034 Syndecan Syndecan do 49.4 9.1 0.00023 12.9 3.9 35 131-165 143-178 (207) 88 TIGR03108 eps_aminotran_1 exos 48.8 9.3 0.00024 12.8 2.3 37 1722-1759 583-619 (628) 89 pfam09803 DUF2346 Uncharacteri 48.7 9.3 0.00024 12.8 2.9 20 150-169 17-43 (80) 90 KOG2922 consensus 47.6 9.6 0.00025 12.7 5.5 32 98-129 63-94 (335) 91 pfam06667 PspB Phage shock pro 47.6 9.6 0.00025 12.7 3.0 12 147-158 15-26 (75) 92 COG2245 Predicted membrane pro 46.8 9.9 0.00025 12.6 8.8 21 101-121 55-75 (182) 93 pfam03239 FTR1 Iron permease F 46.3 10 0.00026 12.6 11.6 86 85-173 19-106 (284) 94 PRK10472 low affinity gluconat 46.0 10 0.00026 12.5 7.8 38 146-183 59-96 (445) 95 TIGR01837 PHA_granule_1 poly(h 45.4 10 0.00026 12.4 15.1 72 322-393 19-90 (121) 96 PRK05415 hypothetical protein; 45.3 10 0.00026 12.4 12.0 41 128-173 73-113 (338) 97 PRK10153 DNA-binding transcrip 45.1 10 0.00027 12.4 5.4 23 101-124 162-184 (512) 98 COG0795 Predicted permeases [G 45.1 10 0.00027 12.4 6.3 20 148-167 343-362 (364) 99 PRK12270 kgd alpha-ketoglutara 44.8 11 0.00027 12.4 4.6 33 158-190 251-283 (1234) 100 TIGR00946 2a69 Auxin Efflux Ca 44.6 11 0.00027 12.4 8.8 87 92-191 453-542 (608) 101 PRK09535 btuC corrinoid ABC tr 44.6 11 0.00027 12.4 8.0 29 85-114 112-140 (366) 102 COG3448 CBS-domain-containing 44.4 11 0.00027 12.3 2.0 15 124-138 128-142 (382) 103 PRK02542 photosystem I assembl 44.3 11 0.00027 12.3 5.4 63 94-162 21-89 (188) 104 COG4117 Thiosulfate reductase 44.0 11 0.00027 12.3 8.0 33 67-99 99-131 (221) 105 KOG3091 consensus 43.8 11 0.00028 12.3 17.2 52 172-236 252-303 (508) 106 pfam06196 DUF997 Protein of un 43.6 11 0.00028 12.2 6.8 55 102-164 8-62 (75) 107 TIGR03061 pip_yhgE_Nterm YhgE/ 43.5 11 0.00028 12.2 4.1 30 134-163 6-35 (164) 108 TIGR02353 NRPS_term_dom non-ri 43.2 11 0.00028 12.2 7.0 81 75-162 215-310 (719) 109 PRK09977 putative Mg(2+) trans 43.1 11 0.00028 12.2 5.9 18 105-122 96-113 (215) 110 PRK10160 taurine transporter s 43.0 11 0.00028 12.2 8.3 31 83-113 53-88 (275) 111 PRK11486 flagellar biosynthesi 42.9 11 0.00028 12.2 3.5 17 149-165 26-42 (124) 112 COG4477 EzrA Negative regulato 42.0 11 0.00029 12.1 59.5 55 145-199 5-61 (570) 113 PRK12468 flhB flagellar biosyn 41.9 12 0.00029 12.1 8.2 12 103-114 39-50 (383) 114 TIGR02978 phageshock_pspC phag 41.9 12 0.00029 12.1 4.9 42 113-162 15-57 (128) 115 COG4662 TupA ABC-type tungstat 41.4 12 0.0003 12.0 7.3 70 107-177 69-138 (227) 116 TIGR02226 two_anch N-terminal 40.4 12 0.00031 11.9 2.5 20 135-155 3-22 (84) 117 PRK06433 NADH dehydrogenase su 40.3 12 0.00031 11.9 5.9 28 132-164 52-79 (89) 118 PRK09292 Na(+)-translocating N 40.0 12 0.00031 11.8 2.8 14 150-163 183-196 (208) 119 PRK10983 putative inner membra 39.8 12 0.00031 11.8 5.4 11 145-155 75-85 (363) 120 PRK00247 putative inner membra 39.0 13 0.00032 11.7 7.7 38 140-177 33-70 (422) 121 KOG4702 consensus 38.8 13 0.00032 11.7 1.7 20 150-169 17-36 (77) 122 PRK10699 phosphatidylglyceroph 38.7 13 0.00032 11.7 5.8 58 94-159 6-69 (254) 123 pfam09472 MtrF Tetrahydrometha 38.4 13 0.00033 11.6 2.3 11 146-156 54-64 (64) 124 KOG3514 consensus 38.3 13 0.00033 11.6 4.5 22 1723-1744 1517-1538(1591) 125 pfam09678 Caa3_CtaG Cytochrome 38.1 13 0.00033 11.6 6.3 69 90-164 9-77 (241) 126 pfam11023 DUF2614 Protein of u 37.6 13 0.00034 11.6 6.1 48 104-157 13-60 (114) 127 PRK02251 putative septation in 37.5 13 0.00034 11.5 5.0 28 98-125 26-53 (84) 128 COG5617 Predicted integral mem 37.5 13 0.00034 11.5 10.4 93 85-179 77-183 (801) 129 pfam02683 DsbD Cytochrome C bi 37.3 13 0.00034 11.5 8.9 72 95-168 111-182 (206) 130 COG4252 Predicted transmembran 37.3 13 0.00034 11.5 7.2 38 85-122 303-343 (400) 131 pfam06791 TMP_2 Prophage tail 37.2 13 0.00034 11.5 14.4 69 82-154 11-79 (210) 132 KOG0916 consensus 37.0 13 0.00034 11.5 5.5 29 246-274 222-250 (1679) 133 COG2962 RarD Predicted permeas 37.0 13 0.00034 11.5 4.6 18 96-113 4-21 (293) 134 PRK00159 putative septation in 36.9 14 0.00034 11.5 6.6 31 95-125 25-55 (87) 135 PRK12438 hypothetical protein; 36.7 14 0.00035 11.5 6.9 25 140-164 62-86 (979) 136 PRK10489 enterobactin exporter 36.4 14 0.00035 11.4 8.3 70 92-171 13-85 (416) 137 PRK10952 glycine betaine trans 36.4 14 0.00035 11.4 7.4 108 96-208 52-161 (354) 138 PRK09821 inner membrane permea 36.4 14 0.00035 11.4 6.7 41 145-185 58-98 (454) 139 PRK13105 ubiA prenyltransferas 36.3 14 0.00035 11.4 5.3 43 105-147 112-163 (291) 140 KOG2533 consensus 36.2 14 0.00035 11.4 1.8 15 152-166 95-109 (495) 141 COG3838 VirB2 Type IV secretor 36.0 14 0.00035 11.4 5.6 37 127-164 41-77 (108) 142 COG2456 Uncharacterized conser 35.5 14 0.00036 11.3 9.1 82 95-187 36-118 (121) 143 COG5530 Predicted integral mem 35.3 14 0.00036 11.3 3.3 27 182-208 126-152 (247) 144 pfam11155 DUF2935 Protein of u 35.0 14 0.00037 11.2 2.7 21 1773-1793 23-43 (44) 145 PRK10522 multidrug transporter 34.8 14 0.00037 11.2 6.9 14 100-113 10-23 (547) 146 pfam08405 Calici_PP_N Viral po 34.7 14 0.00037 11.2 2.7 11 106-116 171-181 (358) 147 KOG3142 consensus 34.2 15 0.00038 11.1 3.0 51 105-156 81-140 (187) 148 KOG4050 consensus 34.1 15 0.00038 11.1 6.2 21 144-164 133-153 (188) 149 pfam02447 GntP_permease GntP f 34.0 15 0.00038 11.1 7.5 39 145-183 54-92 (441) 150 pfam05579 Peptidase_S32 Equine 33.6 15 0.00038 11.1 0.7 11 83-93 169-179 (426) 151 KOG4580 consensus 33.5 15 0.00038 11.1 6.9 20 150-169 56-75 (112) 152 PRK13453 F0F1 ATP synthase sub 33.4 15 0.00039 11.0 19.1 13 151-163 28-40 (173) 153 TIGR02141 modB_ABC molybdate A 33.3 15 0.00039 11.0 5.5 24 101-124 4-27 (212) 154 PRK09705 cynX putative cyanate 33.1 15 0.00039 11.0 7.2 65 95-165 1-67 (393) 155 pfam05915 DUF872 Eukaryotic pr 32.7 15 0.0004 11.0 3.1 62 101-165 38-100 (112) 156 COG3190 FliO Flagellar biogene 32.6 16 0.0004 11.0 4.3 22 145-166 28-49 (137) 157 PRK10591 hypothetical protein; 32.6 16 0.0004 11.0 5.5 33 131-165 35-67 (96) 158 PRK06638 NADH dehydrogenase su 32.4 16 0.0004 10.9 6.7 27 132-163 51-77 (196) 159 COG0785 CcdA Cytochrome c biog 32.4 16 0.0004 10.9 8.4 65 98-164 127-191 (220) 160 pfam03116 NQR2_RnfD_RnfE NQR2, 32.2 16 0.0004 10.9 5.1 50 85-144 77-126 (319) 161 pfam06644 ATP11 ATP11 protein. 32.0 16 0.0004 10.9 10.0 58 1715-1777 69-127 (250) 162 COG1173 DppC ABC-type dipeptid 31.4 16 0.00041 10.8 10.4 55 97-151 85-146 (289) 163 COG1743 Adenine-specific DNA m 31.3 16 0.00041 10.8 0.6 27 72-99 51-77 (875) 164 COG1862 YajC Preprotein transl 31.0 16 0.00042 10.8 3.3 27 147-173 11-37 (97) 165 pfam06966 DUF1295 Protein of u 30.9 16 0.00042 10.7 3.6 14 100-113 39-52 (235) 166 PRK11176 lipid transporter ATP 30.9 16 0.00042 10.7 12.1 44 78-122 2-47 (581) 167 cd03498 SQR_TypeB_2_TM Succina 30.8 16 0.00042 10.7 8.0 76 78-157 127-208 (209) 168 TIGR00886 2A0108 nitrite trans 30.6 17 0.00042 10.7 8.6 75 90-165 86-166 (472) 169 pfam10762 DUF2583 Protein of u 30.6 17 0.00042 10.7 3.2 22 127-148 34-55 (89) 170 PRK10847 hypothetical protein; 30.5 17 0.00042 10.7 7.6 68 84-156 140-207 (219) 171 pfam07298 NnrU NnrU protein. T 30.4 17 0.00043 10.7 5.9 33 87-119 24-56 (184) 172 COG3686 Predicted membrane pro 30.4 17 0.00043 10.7 5.0 51 89-143 68-119 (125) 173 TIGR01987 HI0074 nucleotidyltr 30.3 17 0.00043 10.7 1.8 10 1784-1793 117-126 (130) 174 PRK09586 murP N-acetylmuramic 30.0 17 0.00043 10.6 7.6 14 99-112 122-135 (474) 175 pfam04632 FUSC Fusaric acid re 30.0 17 0.00043 10.6 4.6 13 150-162 73-85 (649) 176 TIGR02745 ccoG_rdxA_fixG cytoc 29.8 17 0.00044 10.6 7.9 69 91-160 113-185 (474) 177 pfam02392 Ycf4 Ycf4. This fami 29.6 17 0.00044 10.6 5.5 45 100-148 18-68 (180) 178 PRK10692 hypothetical protein; 29.6 17 0.00044 10.6 4.3 22 127-148 34-55 (92) 179 PRK13499 rhamnose-proton sympo 29.6 17 0.00044 10.6 3.7 28 92-120 65-92 (345) 180 TIGR00769 AAA ADP/ATP carrier 29.3 17 0.00044 10.5 4.6 37 110-146 359-396 (491) 181 pfam11242 DUF2774 Protein of u 29.1 17 0.00045 10.5 2.9 35 1724-1758 16-59 (63) 182 COG1079 Uncharacterized ABC-ty 29.1 18 0.00045 10.5 3.9 34 93-126 182-215 (304) 183 pfam06532 DUF1109 Protein of u 29.1 18 0.00045 10.5 9.4 90 95-189 78-176 (204) 184 COG3242 Uncharacterized protei 29.0 18 0.00045 10.5 4.0 30 108-139 10-39 (62) 185 PRK10144 formate-dependent nit 29.0 18 0.00045 10.5 2.0 19 148-166 105-123 (126) 186 pfam00122 E1-E2_ATPase E1-E2 A 28.9 18 0.00045 10.5 7.9 77 82-165 120-211 (222) 187 TIGR03638 cas1_ECOLI CRISPR-as 28.9 7.7 0.0002 13.5 -1.6 13 171-185 111-123 (269) 188 pfam06781 UPF0233 Uncharacteri 28.8 18 0.00045 10.5 6.6 29 96-124 26-54 (87) 189 PRK10747 putative protoheme IX 28.8 18 0.00045 10.5 7.9 17 149-165 49-65 (398) 190 KOG1923 consensus 28.8 18 0.00045 10.5 13.1 77 82-160 18-103 (830) 191 COG4325 Predicted membrane pro 28.5 18 0.00045 10.4 5.7 25 94-122 32-56 (464) 192 pfam04956 TrbC TrbC/VIRB2 fami 28.5 18 0.00046 10.4 4.9 75 100-180 9-88 (99) 193 TIGR01667 YCCS_YHJK integral m 28.4 18 0.00046 10.4 8.0 92 94-189 79-192 (724) 194 TIGR02005 PTS-IIBC-alpha PTS s 28.3 18 0.00046 10.4 2.0 10 114-123 290-299 (533) 195 PRK10209 acid-resistance membr 28.1 18 0.00046 10.4 6.8 15 100-114 78-92 (190) 196 KOG2563 consensus 27.9 18 0.00047 10.4 3.4 43 157-200 97-149 (480) 197 PRK05760 F0F1 ATP synthase sub 27.9 18 0.00047 10.4 8.7 27 149-175 45-71 (124) 198 pfam04901 RAMP Receptor activi 27.8 18 0.00047 10.4 2.4 28 137-166 81-109 (115) 199 pfam07300 consensus 27.7 18 0.00047 10.3 4.4 22 95-116 12-33 (111) 200 pfam08627 CRT-like CRT-like. T 27.5 19 0.00047 10.3 7.8 68 82-154 36-103 (130) 201 pfam11897 DUF3417 Protein of u 26.9 18 0.00045 10.5 -0.0 82 1721-1815 30-115 (119) 202 KOG1925 consensus 26.8 19 0.00049 10.2 2.9 19 1732-1750 753-771 (817) 203 COG0609 FepD ABC-type Fe3+-sid 26.7 19 0.00049 10.2 7.2 36 85-123 71-106 (334) 204 PRK12671 putative monovalent c 26.7 19 0.00049 10.2 7.1 27 98-124 6-32 (120) 205 COG5232 SEC62 Preprotein trans 26.6 19 0.00049 10.2 3.4 20 139-158 188-207 (259) 206 pfam02508 Rnf-Nqr Rnf-Nqr subu 26.5 19 0.00049 10.2 3.4 16 102-117 4-19 (188) 207 PRK13109 flhB flagellar biosyn 26.4 19 0.00049 10.2 7.7 30 143-182 89-118 (358) 208 PRK00816 rnfD electron transpo 26.3 19 0.00049 10.1 5.1 49 85-143 79-127 (351) 209 pfam11139 DUF2910 Protein of u 26.2 19 0.0005 10.1 8.7 35 85-122 15-49 (212) 210 KOG2257 consensus 26.1 19 0.0005 10.1 6.8 106 94-220 3-114 (135) 211 pfam05982 DUF897 Domain of unk 26.1 19 0.0005 10.1 4.4 51 135-200 53-105 (327) 212 TIGR02587 TIGR02587 putative i 26.1 20 0.0005 10.1 6.4 84 77-161 157-253 (284) 213 pfam09716 ETRAMP Malarial earl 26.0 20 0.0005 10.1 2.7 36 88-123 40-76 (84) 214 pfam06757 Ins_allergen_rp Inse 26.0 20 0.0005 10.1 7.3 65 1718-1784 103-167 (181) 215 pfam07214 DUF1418 Protein of u 26.0 20 0.0005 10.1 5.7 34 130-165 34-67 (96) 216 PRK11562 nitrite transporter N 25.7 20 0.0005 10.1 7.0 28 95-122 21-48 (268) 217 TIGR00957 MRP_assoc_pro multi 25.7 20 0.0005 10.1 6.1 39 105-143 71-110 (1542) 218 PRK05349 Na(+)-translocating N 25.7 20 0.00051 10.1 4.9 14 100-113 59-72 (407) 219 pfam05478 Prominin Prominin. T 25.5 20 0.00051 10.0 31.8 15 244-258 187-201 (809) 220 TIGR02916 PEP_his_kin putative 25.1 20 0.00052 10.0 6.6 33 92-124 24-56 (696) 221 COG3162 Predicted membrane pro 25.1 20 0.00052 10.0 5.5 17 152-168 72-89 (102) 222 pfam04120 Iron_permease Low af 25.1 20 0.00052 10.0 3.3 25 95-119 12-36 (133) 223 TIGR02163 napH_ ferredoxin-typ 25.0 20 0.00052 9.9 5.2 30 127-159 45-76 (263) 224 TIGR02800 propeller_TolB Tol-P 24.9 21 0.00052 9.9 1.2 34 1722-1756 137-170 (439) 225 KOG1962 consensus 24.9 21 0.00053 9.9 20.7 20 103-122 2-21 (216) 226 PRK05470 fumarate reductase su 24.8 21 0.00053 9.9 3.3 39 102-140 23-63 (118) 227 pfam02313 Fumarate_red_D Fumar 24.7 21 0.00053 9.9 3.6 38 103-140 24-63 (118) 228 PRK02889 tolB translocation pr 24.6 21 0.00054 9.9 0.9 25 1729-1754 141-165 (430) 229 PRK04897 heat shock protein Ht 24.6 21 0.00054 9.8 11.4 29 204-233 121-149 (298) 230 pfam10755 DUF2585 Protein of u 24.5 21 0.00054 9.8 6.8 59 93-154 56-125 (165) 231 CHL00036 ycf4 photosystem I as 24.5 21 0.00054 9.8 5.5 42 100-145 21-68 (184) 232 PRK10973 glycerol-3-phosphate 24.5 21 0.00054 9.8 7.5 29 82-110 52-82 (281) 233 COG3924 Predicted membrane pro 24.4 21 0.00054 9.8 6.2 55 101-163 12-66 (80) 234 TIGR00917 2A060601 Niemann-Pic 24.4 21 0.00055 9.8 0.5 24 101-125 235-258 (1471) 235 COG4942 Membrane-bound metallo 24.2 22 0.00055 9.7 35.2 16 1682-1697 313-328 (420) 236 KOG4327 consensus 24.1 22 0.00056 9.7 3.5 13 1633-1645 124-136 (218) 237 COG1174 OpuBB ABC-type proline 23.7 22 0.00057 9.6 7.0 18 177-194 137-154 (221) 238 PRK08665 ribonucleotide-diphos 23.7 22 0.00057 9.6 3.6 10 1722-1731 632-641 (733) 239 COG4720 Predicted membrane pro 23.6 23 0.00057 9.6 7.3 23 99-121 106-128 (177) 240 pfam05664 DUF810 Protein of un 23.6 23 0.00058 9.6 7.0 15 190-204 52-66 (674) 241 KOG0072 consensus 23.6 23 0.00058 9.6 0.5 33 71-104 49-81 (182) 242 cd01209 SHC SHC phosphotyrosin 23.4 23 0.00058 9.5 1.6 12 1715-1726 133-144 (160) 243 PRK09881 ATP-dependent peptide 23.3 23 0.00059 9.5 11.1 51 90-141 88-143 (296) 244 PRK00944 hypothetical protein; 23.3 23 0.00059 9.5 6.9 17 95-111 85-101 (193) 245 pfam05569 Peptidase_M56 BlaR1 23.3 23 0.00059 9.5 3.9 56 231-287 186-250 (299) 246 COG1495 DsbB Disulfide bond fo 23.2 23 0.00059 9.5 6.8 24 100-123 13-36 (170) 247 KOG0206 consensus 23.2 23 0.00059 9.5 8.0 12 148-159 83-94 (1151) 248 TIGR00897 2A0118 polyol permea 23.2 23 0.00059 9.5 4.2 65 97-162 321-399 (411) 249 CHL00016 ndhG NADH dehydrogena 23.2 23 0.00059 9.5 7.4 27 132-163 55-81 (177) 250 COG1808 Predicted membrane pro 23.1 23 0.0006 9.5 6.4 30 85-114 10-41 (334) 251 pfam07926 TPR_MLP1_2 TPR/MLP1/ 23.1 23 0.0006 9.5 20.1 23 228-250 11-33 (132) 252 KOG2265 consensus 23.1 23 0.0006 9.5 1.6 63 1737-1820 99-161 (179) 253 pfam06210 DUF1003 Protein of u 23.1 23 0.0006 9.4 4.9 50 99-155 5-55 (108) 254 COG4425 Predicted membrane pro 23.1 23 0.0006 9.4 9.7 96 93-197 64-167 (588) 255 COG4149 ModC ABC-type molybdat 22.9 24 0.0006 9.4 4.6 41 98-138 10-52 (225) 256 KOG0243 consensus 22.9 24 0.00061 9.4 56.6 17 149-165 163-179 (1041) 257 pfam12089 DUF3566 Transmembran 22.9 24 0.00061 9.4 8.2 65 98-162 19-99 (120) 258 COG1295 Rbn Ribonuclease BN fa 22.9 24 0.00061 9.4 7.3 60 93-162 215-274 (303) 259 pfam10164 DUF2367 Uncharacteri 22.9 24 0.00061 9.4 2.8 26 139-164 65-90 (98) 260 PRK12342 putative electron tra 22.9 10 0.00026 12.4 -1.8 30 177-208 42-77 (254) 261 TIGR02415 23BDH acetoin reduct 22.8 24 0.00061 9.4 0.3 10 181-190 222-231 (258) 262 pfam11297 DUF3098 Protein of u 22.8 24 0.00061 9.4 1.3 49 110-163 9-57 (64) 263 pfam08510 PIG-P PIG-P. PIG-P ( 22.8 24 0.00061 9.4 7.9 66 96-163 1-66 (123) 264 TIGR01912 TatC-Arch Sec-indepe 22.8 24 0.00061 9.4 1.8 25 136-162 225-249 (255) 265 PRK12837 3-ketosteroid-delta-1 22.8 24 0.00061 9.4 2.8 22 1763-1784 379-401 (515) 266 pfam05758 Ycf1 Ycf1. The chlor 22.7 24 0.00062 9.3 6.1 63 87-158 10-83 (832) 267 PRK12844 3-ketosteroid-delta-1 22.7 24 0.00062 9.3 2.7 22 1763-1784 410-432 (552) 268 pfam07461 NADase_NGA Nicotine 22.7 24 0.00062 9.3 2.2 29 1747-1784 196-227 (446) 269 PRK09174 F0F1 ATP synthase sub 22.6 24 0.00062 9.3 17.8 108 242-349 77-184 (204) 270 TIGR00847 ccoS cytochrome oxid 22.5 24 0.00062 9.3 2.7 18 145-162 3-20 (53) 271 TIGR00903 2A0129 major facilit 22.5 24 0.00062 9.3 1.3 42 110-151 194-247 (368) 272 PRK12835 3-ketosteroid-delta-1 22.5 24 0.00062 9.3 2.7 22 1763-1784 431-453 (584) 273 PRK09414 glutamate dehydrogena 22.5 25 0.00063 9.3 4.5 34 1761-1794 5-38 (446) 274 TIGR02908 CoxD_Bacillus cytoch 22.4 25 0.00063 9.3 2.8 24 138-163 86-109 (110) 275 KOG0720 consensus 22.4 8.1 0.00021 13.3 -2.5 85 94-185 62-146 (490) 276 TIGR00955 3a01204 Pigment prec 22.4 25 0.00063 9.3 4.2 58 100-157 484-543 (671) 277 TIGR01163 rpe ribulose-phospha 22.4 25 0.00063 9.3 1.5 11 1781-1791 70-80 (216) 278 PRK11922 RNA polymerase sigma 22.2 25 0.00064 9.2 4.3 41 1705-1745 1-41 (231) 279 PRK01024 Na(+)-translocating N 22.2 25 0.00064 9.2 4.6 52 96-156 163-217 (503) 280 pfam06965 Na_H_antiport_1 Na+/ 22.2 25 0.00064 9.2 4.9 54 111-164 253-308 (374) 281 TIGR01713 typeII_sec_gspC gene 22.2 25 0.00064 9.2 1.3 20 145-164 2-21 (281) 282 KOG3738 consensus 22.2 25 0.00064 9.2 0.7 31 102-133 7-37 (559) 283 COG2311 Predicted membrane pro 22.2 25 0.00064 9.2 11.6 96 85-180 129-245 (394) 284 COG1377 FlhB Flagellar biosynt 22.1 25 0.00064 9.2 8.9 77 101-177 153-229 (363) 285 KOG0015 consensus 22.1 12 0.0003 12.0 -1.7 8 1811-1818 110-117 (338) 286 KOG1010 consensus 22.1 25 0.00064 9.2 1.9 23 1763-1785 741-763 (920) 287 pfam07798 DUF1640 Protein of u 21.9 26 0.00065 9.1 22.5 160 180-339 8-167 (177) 288 COG1298 FlhA Flagellar biosynt 21.9 26 0.00065 9.1 2.0 86 88-183 64-152 (696) 289 COG4171 SapC ABC-type antimicr 21.9 26 0.00065 9.1 6.9 66 97-162 101-167 (296) 290 PRK09824 beta-glucoside-specif 21.9 26 0.00065 9.1 4.8 63 100-162 378-449 (625) 291 COG1272 Predicted membrane pro 21.9 26 0.00065 9.1 5.0 60 102-163 116-175 (226) 292 pfam05568 ASFV_J13L African sw 21.9 26 0.00065 9.1 3.0 18 140-158 35-52 (189) 293 TIGR00892 2A0113 monocarboxyla 21.9 26 0.00065 9.1 3.5 60 84-154 102-163 (475) 294 PRK09776 putative sensor prote 21.8 26 0.00066 9.1 7.6 79 101-183 127-205 (1116) 295 KOG0559 consensus 21.8 26 0.00066 9.1 5.0 75 9-84 151-225 (457) 296 pfam04290 DctQ Tripartite ATP- 21.8 26 0.00066 9.1 7.2 69 96-164 58-126 (133) 297 KOG0832 consensus 21.7 26 0.00066 9.1 0.8 19 76-94 217-235 (251) 298 PRK09500 potC spermidine/putre 21.7 26 0.00066 9.1 7.8 68 97-164 181-254 (257) 299 TIGR00844 c_cpa1 sodium/hydrog 21.7 26 0.00066 9.1 0.0 44 114-157 17-60 (923) 300 pfam10701 consensus 21.7 26 0.00066 9.1 7.5 68 102-169 28-95 (118) 301 PRK06945 flgK flagellar hook-a 21.7 26 0.00066 9.1 10.5 84 251-336 127-210 (649) 302 pfam04403 PqiA Paraquat-induci 21.7 26 0.00066 9.1 11.0 79 100-178 7-108 (162) 303 PRK12883 ubiA prenyltransferas 21.7 26 0.00066 9.1 7.6 67 102-168 104-172 (275) 304 PRK02391 heat shock protein Ht 21.6 26 0.00066 9.1 8.1 67 101-169 150-216 (297) 305 PRK01265 heat shock protein Ht 21.6 26 0.00066 9.1 6.6 73 107-179 10-95 (326) 306 KOG3145 consensus 21.6 26 0.00066 9.1 3.0 44 113-156 306-349 (372) 307 PRK00220 putative glycerol-3-p 21.6 26 0.00067 9.0 7.4 62 101-167 120-181 (198) 308 COG3417 FlgN Collagen-binding 21.6 26 0.00067 9.0 0.4 59 1694-1756 16-76 (200) 309 KOG3626 consensus 21.6 26 0.00067 9.0 3.1 68 95-163 230-298 (735) 310 PRK06743 flagellar motor prote 21.6 26 0.00067 9.0 8.6 87 109-203 2-88 (254) 311 PRK13852 type IV secretion sys 21.6 26 0.00067 9.0 7.7 56 96-164 137-192 (295) 312 PRK09561 nhaA pH-dependent sod 21.5 26 0.00067 9.0 5.1 54 111-164 258-313 (388) 313 PRK13850 type IV secretion sys 21.5 26 0.00067 9.0 3.3 78 96-173 8-97 (670) 314 COG4789 EscV Type III secretor 21.5 26 0.00067 9.0 4.5 38 109-156 281-318 (689) 315 cd01042 DMQH Demethoxyubiquino 21.5 26 0.00067 9.0 3.6 33 85-117 49-82 (165) 316 PRK10941 putative transcriptio 21.4 27 0.00068 9.0 3.5 100 1719-1825 13-113 (269) 317 COG3416 Uncharacterized protei 21.4 27 0.00068 9.0 6.2 66 42-122 93-159 (233) 318 pfam10081 DUF2319 Predicted me 21.4 27 0.00068 9.0 11.3 111 85-197 52-162 (534) 319 TIGR00680 kdpA K+-transporting 21.4 27 0.00068 9.0 4.6 39 85-123 153-193 (571) 320 COG0586 DedA Uncharacterized m 21.4 27 0.00068 9.0 9.0 74 83-164 125-198 (208) 321 cd00378 SHMT Serine-glycine hy 21.3 27 0.00068 9.0 3.3 43 156-207 357-402 (402) 322 pfam09726 Macoilin Transmembra 21.3 27 0.00068 9.0 34.8 270 1400-1680 406-679 (680) 323 COG1972 NupC Nucleoside permea 21.3 27 0.00068 9.0 7.4 108 101-209 48-163 (404) 324 pfam07355 GRDB Glycine/sarcosi 21.3 27 0.00068 8.9 0.4 51 165-215 141-192 (349) 325 TIGR01165 cbiN cobalt transpor 21.2 27 0.00069 8.9 2.5 16 140-155 11-26 (94) 326 KOG0075 consensus 21.2 27 0.00069 8.9 1.4 12 1735-1746 77-88 (186) 327 pfam06898 YqfD Putative stage 21.2 27 0.00069 8.9 1.1 21 143-163 89-109 (383) 328 pfam03264 Cytochrom_NNT NapC/N 21.2 27 0.00069 8.9 4.0 24 139-162 7-30 (174) 329 PHA01399 membrane protein P6 21.2 27 0.00069 8.9 3.0 47 102-148 128-183 (242) 330 PRK06649 V-type ATP synthase s 21.2 27 0.00069 8.9 8.2 79 85-163 61-141 (144) 331 PRK09649 RNA polymerase sigma 21.1 27 0.00069 8.9 3.8 42 1732-1776 7-48 (185) 332 COG1842 PspA Phage shock prote 21.1 27 0.00069 8.9 25.0 218 1312-1536 4-221 (225) 333 PRK10755 sensor protein BasS/P 21.1 27 0.00069 8.9 16.4 169 100-291 21-195 (355) 334 pfam07074 TRAP-gamma Transloco 21.1 27 0.00069 8.9 2.7 20 142-161 23-42 (170) 335 pfam06930 DUF1282 Protein of u 21.1 27 0.0007 8.9 6.8 60 101-162 70-130 (170) 336 TIGR01828 pyru_phos_dikin pyru 20.9 28 0.0007 8.9 0.7 7 1739-1745 135-141 (920) 337 pfam08802 CytB6-F_Fe-S Cytochr 20.9 26 0.00067 9.0 -0.1 19 139-157 16-34 (36) 338 COG1175 UgpA ABC-type sugar tr 20.9 28 0.00071 8.8 8.5 64 97-164 62-125 (295) 339 PRK11383 hypothetical protein; 20.9 28 0.00071 8.8 4.6 42 102-149 80-121 (145) 340 pfam02554 CstA Carbon starvati 20.8 28 0.00071 8.8 7.8 69 83-164 114-182 (382) 341 PRK00665 petG cytochrome b6-f 20.8 28 0.00071 8.8 4.1 27 138-164 4-30 (38) 342 PRK10369 heme lyase subunit Nr 20.8 28 0.00071 8.8 5.5 84 84-169 237-320 (552) 343 TIGR02772 Ku_bact Ku protein; 20.8 28 0.00071 8.8 0.6 14 1814-1827 121-134 (271) 344 pfam09852 DUF2079 Predicted me 20.8 28 0.00071 8.8 9.6 79 85-163 159-261 (451) 345 pfam12127 YdfA_immunity SigmaW 20.8 28 0.00071 8.8 4.0 27 137-163 3-29 (321) 346 pfam06040 Adeno_E3 Adenovirus 20.7 28 0.00071 8.8 0.5 34 92-125 74-108 (126) 347 pfam09793 AD Anticodon-binding 20.7 28 0.00071 8.8 3.6 47 1719-1771 1-47 (90) 348 PRK05880 F0F1 ATP synthase sub 20.7 28 0.00071 8.8 7.4 61 103-163 2-73 (81) 349 PRK12911 bifunctional preprote 20.7 28 0.00071 8.8 6.9 64 88-158 1334-1397(1405) 350 COG1230 CzcD Co/Zn/Cd efflux s 20.7 28 0.00071 8.8 8.7 93 85-180 66-172 (296) 351 PRK05137 tolB translocation pr 20.7 28 0.00071 8.8 0.8 27 1729-1756 146-172 (437) 352 PRK05572 sporulation sigma fac 20.7 28 0.00072 8.8 3.5 41 1731-1773 12-52 (251) 353 PRK00011 glyA serine hydroxyme 20.7 28 0.00072 8.8 3.6 42 157-207 361-405 (415) 354 PRK12523 RNA polymerase sigma 20.7 28 0.00072 8.8 3.3 81 1731-1819 40-130 (172) 355 pfam11658 DUF3260 Protein of u 20.7 28 0.00072 8.8 5.6 52 111-163 47-115 (516) 356 KOG2677 consensus 20.7 28 0.00072 8.8 4.3 63 16-78 55-123 (922) 357 COG4987 CydC ABC-type transpor 20.7 28 0.00072 8.8 12.5 113 102-236 15-132 (573) 358 PRK08156 surface presentation 20.6 28 0.00072 8.8 7.9 70 93-162 21-90 (367) 359 TIGR00785 dass transporter, di 20.6 28 0.00072 8.8 6.8 53 109-162 329-388 (473) 360 cd06480 ACD_HspB8_like Alpha-c 20.6 28 0.00072 8.8 1.6 16 67-82 1-16 (91) 361 PRK12527 RNA polymerase sigma 20.6 28 0.00072 8.7 4.1 45 1726-1770 20-64 (159) 362 PRK13672 hypothetical protein; 20.6 28 0.00072 8.7 1.5 10 1735-1744 60-69 (72) 363 pfam10943 DUF2632 Protein of u 20.5 28 0.00072 8.7 4.6 56 107-164 60-115 (233) 364 PRK13021 secF preprotein trans 20.5 29 0.00073 8.7 3.4 55 95-158 229-283 (297) 365 COG0733 Na+-dependent transpor 20.5 29 0.00073 8.7 5.7 58 102-159 216-279 (439) 366 TIGR01148 mtrC tetrahydrometha 20.5 29 0.00073 8.7 5.2 52 102-153 173-228 (276) 367 KOG1025 consensus 20.5 29 0.00073 8.7 5.8 50 136-185 624-677 (1177) 368 TIGR01197 nramp metal ion tran 20.4 29 0.00073 8.7 5.0 80 84-178 19-98 (429) 369 pfam01271 Granin Granin (chrom 20.4 29 0.00073 8.7 3.1 37 107-153 4-42 (610) 370 KOG4517 consensus 20.4 29 0.00073 8.7 2.6 26 139-164 83-108 (117) 371 TIGR02718 sider_RhtX_FptX side 20.4 29 0.00073 8.7 7.2 85 66-165 186-272 (391) 372 pfam02011 Glyco_hydro_48 Glyco 20.4 27 0.0007 8.9 -0.1 57 1728-1784 506-586 (615) 373 pfam11693 DUF2990 Protein of u 20.3 29 0.00074 8.7 2.9 38 1755-1792 6-43 (64) 374 TIGR03580 EF_0832 conserved hy 20.3 29 0.00074 8.7 4.2 64 100-163 147-218 (246) 375 PRK06030 hypothetical protein; 20.3 29 0.00074 8.7 3.0 21 1771-1795 108-128 (136) 376 TIGR02476 BluB cob(II)yrinic a 20.3 29 0.00074 8.7 1.0 25 96-120 124-151 (206) 377 PRK02101 hypothetical protein; 20.3 29 0.00074 8.7 2.0 46 1726-1771 45-103 (257) 378 PRK07079 hypothetical protein; 20.2 29 0.00074 8.7 2.5 49 1731-1788 1-49 (468) 379 COG3748 Predicted membrane pro 20.2 29 0.00074 8.7 4.4 51 111-165 87-140 (407) 380 COG1721 Uncharacterized conser 20.1 29 0.00075 8.6 5.4 49 94-165 1-49 (416) 381 PRK08301 sporulation sigma fac 20.1 29 0.00075 8.6 3.3 42 1730-1773 38-79 (239) 382 COG4218 MtrF Tetrahydromethano 20.1 29 0.00075 8.6 2.9 36 102-157 38-73 (73) 383 TIGR02251 HIF-SF_euk Dullard-l 20.1 23 0.00058 9.6 -0.6 11 1747-1757 103-114 (193) 384 PRK09304 arginine exporter pro 20.1 29 0.00075 8.6 6.7 55 110-164 94-155 (197) 385 PRK07037 extracytoplasmic-func 20.1 29 0.00075 8.6 3.1 42 1733-1774 44-85 (183) 386 PRK05886 yajC preprotein trans 20.0 30 0.00075 8.6 4.2 30 146-175 4-35 (108) 387 COG2928 Uncharacterized conser 20.0 30 0.00075 8.6 7.8 71 101-173 9-86 (222) 388 TIGR00056 TIGR00056 conserved 20.0 30 0.00076 8.6 3.2 56 101-158 179-235 (275) No 1 >pfam01442 Apolipoprotein Apolipoprotein A1/A4/E domain. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, and Apolipoprotein E. Probab=97.96 E-value=0.0023 Score=43.18 Aligned_cols=180 Identities=14% Similarity=0.224 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 266 TSIAEVHESLKEELSLTSEEISVHLSRAIDSFQSIVDVRIAKVTEKTTRIVQESAQTISSKIDQLLEVLHSTSIVITKDF 345 (1828) Q Consensus 266 ~~i~~~~e~l~~el~~~s~~i~~~~~~~~e~~~~~le~r~~~l~~~~~~~~~~~~~~l~~~~~~l~~~L~~~~~~~~~~~ 345 (1828) ..+...-..|...+.-+++.+..++....+.+...+......+...+....+.+...++..++++...+.....++...+ T Consensus 8 ~e~~~y~~~l~~~l~P~t~~~~~~l~~~~~~lr~~l~~dleelr~~l~p~~~el~~~~~~~~eelr~~l~p~~~el~~~~ 87 (191) T pfam01442 8 DELNSYAEELQEQLGPYAQEFWAQLSKETEALREELQKDLEEVRARLQPYLDELKAKVGQNLEELRQRLAPYAEELRERL 87 (191) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999770516899998888999999999985899999987899999999999899999998777899999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999999----999999999999999999999999999999999999999999999 Q gi|254781029|r 346 DNRIESLSNTLNNSGRSLANQVGNYTLMLGN----NTDKVSIALKEQSQQFMQAFTSHICEMSNFFSEKQKSITVTLNDV 421 (1828) Q Consensus 346 ~~~~e~l~~~L~~~~~~l~~~~~~~~~~l~~----~~e~l~~~l~e~~~~l~e~l~~~~~e~~~~l~~~~~~l~~~l~~~ 421 (1828) ....+.+...|....+++.+.+......+.. ..+.+...+.++...+.+.+.....++.+.+......+...+... T Consensus 88 ~~~~eeLr~~l~~~~eelr~~~~~~ve~lr~~l~P~~ee~r~kl~~~leel~~~~~p~a~e~~~~l~~~~~elr~~l~~~ 167 (191) T pfam01442 88 NRDAEELRRKLAPYAEELRDRLRQNVEALRARLGPYVEELRQKLAERLEELKESVGPYAEEYKAQLSEQVEELREKLEPY 167 (191) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 86599999999999999999999779999999773899999999999999999867668999999999999999986899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999 Q gi|254781029|r 422 LQSLRISLQEKEDSFCSNLKSTTD 445 (1828) Q Consensus 422 ~~~l~~~l~e~~~~~~~~l~~~~~ 445 (1828) ...+...+.-....+...+....+ T Consensus 168 ~e~lr~~l~p~~e~lk~~l~~~~e 191 (191) T pfam01442 168 AEDLREKLEPYAEDLKKQLAQLLE 191 (191) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 999999867669999999986529 No 2 >KOG0161 consensus Probab=96.37 E-value=0.075 Score=30.48 Aligned_cols=29 Identities=17% Similarity=0.134 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999849999999999999999 Q gi|254781029|r 1605 NNSRDILKRDSVSLAKEAKESADTIRSAI 1633 (1828) Q Consensus 1605 e~~r~eLkr~~~eL~~Ea~e~~~~~rk~v 1633 (1828) ...-..++|.+-+.++++......+|+.. T Consensus 1863 ~~k~~~~krQleeaE~~~~~~~~k~R~~q 1891 (1930) T KOG0161 1863 QAKIKQYKRQLEEAEEEANQNLSKYRKLQ 1891 (1930) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999767877778888999999889 No 3 >PRK10361 DNA recombination protein RmuC; Provisional Probab=95.68 E-value=0.14 Score=28.12 Aligned_cols=92 Identities=12% Similarity=0.230 Sum_probs=37.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 226 RSEIEVLENNYTKSEMRIDNITQNLKQEREAIINHGTQLCTSIAEVHESLKEELSLTSEEISVHLSRAIDSFQSIVDVRI 305 (1828) Q Consensus 226 ~~ev~~ler~y~dnE~rir~L~~el~~ere~i~~~ae~~r~~i~~~~e~l~~el~~~s~~i~~~~~~~~e~~~~~le~r~ 305 (1828) .++...+....+..|...+.+--.+..+|... +.-...+....+.|+.++.....+| ++++. T Consensus 66 ~~el~~~~~~~~~lea~l~~~~~rle~~~~~~----eEk~~~l~~a~~~L~~eFenLAn~I--------------le~k~ 127 (475) T PRK10361 66 NNEVRSLQSINTSLEADLREVTTRMEAAQQHA----DDKIRQMINSEQRLSEQFENLANRI--------------FEHSN 127 (475) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHH--------------HHHHH T ss_conf 99999999999989999999999899999999----9999999999999999999999999--------------99988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999 Q gi|254781029|r 306 AKVTEKTTRIVQESAQTISSKIDQLLEVLH 335 (1828) Q Consensus 306 ~~l~~~~~~~~~~~~~~l~~~~~~l~~~L~ 335 (1828) ....+......+.+...|.++++.+....+ T Consensus 128 ~~f~eqn~e~l~~LL~Pl~E~l~~F~~~V~ 157 (475) T PRK10361 128 RRVDEQNRQSLNSLLSPLREQLDGFRRQVQ 157 (475) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 888999899999998789999999999999 No 4 >pfam05656 DUF805 Protein of unknown function (DUF805). This family consists of several bacterial proteins of unknown function. Probab=95.41 E-value=0.15 Score=27.87 Aligned_cols=69 Identities=10% Similarity=-0.026 Sum_probs=37.2 Q ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8541873227999999999999999999998316214555765640648899999999999999999999999999999 Q gi|254781029|r 91 NIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMH 169 (1828) Q Consensus 91 ~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~ 169 (1828) +.++|.+|..||+..++..+-......+....+.... .....++ ...++..+++.-.+|+.+||=+|.- T Consensus 4 ~f~GR~~R~efw~~~l~~~i~~~i~~~i~~~~~~~~~---------~~~~~~~-~~~~~~~~~~ip~ia~~vRRlhD~g 72 (112) T pfam05656 4 DFSGRASRSEYWWFVLFNILITLVLLILDALLGISSS---------LLSGGGL-LLGLISLAIFIPSLAVTVRRLHDTG 72 (112) T ss_pred ECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 0118776899999999999999999999999942013---------5578899-9999999999999999999998669 No 5 >pfam04156 IncA IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is non-fusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA, is functionally associated with the homotypic fusion of inclusions. This family probably includes members of the wider Inc family rather than just IncA. Probab=93.43 E-value=0.44 Score=24.03 Aligned_cols=26 Identities=19% Similarity=0.291 Sum_probs=15.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32279999999999999999999983 Q gi|254781029|r 97 LNIALRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 97 s~~~~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) ++....++.+++++-+++|+++...+ T Consensus 4 ~~i~~i~~iilgilLi~~gi~~l~~~ 29 (186) T pfam04156 4 SRIISIILIILGLLLIASGIALLVLF 29 (186) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999999999999999999999997 No 6 >KOG4674 consensus Probab=93.34 E-value=0.45 Score=23.92 Aligned_cols=10 Identities=30% Similarity=0.086 Sum_probs=3.5 Q ss_pred HHHHHCCCCC Q ss_conf 9975074562 Q gi|254781029|r 1741 KSYTLGEDDI 1750 (1828) Q Consensus 1741 ~~y~~g~~~~ 1750 (1828) .+...|++++ T Consensus 1745 ~~~~~~e~d~ 1754 (1822) T KOG4674 1745 EQGTDVENDT 1754 (1822) T ss_pred CCCCCCCCCC T ss_conf 1356555541 No 7 >COG3152 Predicted membrane protein [Function unknown] Probab=90.89 E-value=0.82 Score=21.74 Aligned_cols=71 Identities=10% Similarity=0.065 Sum_probs=36.6 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89998541873227999999999999999999998316214555765640648899999999999999999999999999 Q gi|254781029|r 87 HILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRAR 166 (1828) Q Consensus 87 ~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~ 166 (1828) ...-+..+|.+|.+||+..+ +|++.++.+....+. ...++.. .+ ++...++.+..|+-..|+.+||-+ T Consensus 9 ~~yf~f~GR~~R~~fW~~~L---~~~ii~~vl~~~~~~--~~~~i~~------~~-~~~~~l~~la~~~p~lal~vrRLH 76 (125) T COG3152 9 KNYFSFSGRIGRSEFWYVIL---FNLIIGVVLFVLAFI--LGLSIAG------VL-VLLTALYLLALFLPTLALTVRRLH 76 (125) T ss_pred HHHCCCCCCCCHHHHHHHHH---HHHHHHHHHHHHHHH--HHHHHHH------HH-HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99438667664398999999---999999999999999--9888888------99-999999999999999999999998 Q ss_pred HHH Q ss_conf 999 Q gi|254781029|r 167 DMH 169 (1828) Q Consensus 167 ~~~ 169 (1828) |.. T Consensus 77 D~g 79 (125) T COG3152 77 DRG 79 (125) T ss_pred HCC T ss_conf 669 No 8 >TIGR02121 Na_Pro_sym sodium/proline symporter; InterPro: IPR011851 This family consists of the sodium/proline symporter (proline permease) from a number of Gram-negative and Gram-positive bacteria and from the archaeal genus Methanosarcina. ; GO: 0005298 proline:sodium symporter activity, 0031402 sodium ion binding, 0006814 sodium ion transport, 0015824 proline transport, 0016021 integral to membrane. Probab=90.15 E-value=0.81 Score=21.75 Aligned_cols=48 Identities=17% Similarity=0.396 Sum_probs=36.6 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999983162145557656406488999999999999999999 Q gi|254781029|r 108 SLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSF 158 (1828) Q Consensus 108 s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 158 (1828) -++|+++++=+.+.|||+=..=|+.|++-. .| .|++.+++||+.+..+ T Consensus 159 ~gl~iga~~~v~YTFfGGfLAVSwTD~vQG--~l-M~~AL~~VPiva~~~~ 206 (517) T TIGR02121 159 TGLLIGALIVVIYTFFGGFLAVSWTDFVQG--LL-MFAALVLVPIVALLNL 206 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HH-HHHHHHHHHHHHHHCC T ss_conf 999999999886686557999999999999--99-9999999999987358 No 9 >pfam07900 DUF1670 Protein of unknown function (DUF1670). The hypothetical eukaryotic proteins found in this family are of unknown function. Probab=89.00 E-value=0.46 Score=23.88 Aligned_cols=25 Identities=24% Similarity=0.465 Sum_probs=11.2 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 9999985078878997899999999999 Q gi|254781029|r 1765 LNLQEQYKADSALRNAIDRYISNFEEML 1792 (1828) Q Consensus 1765 ~~i~~~y~~d~~f~~~v~ry~~~fe~~l 1792 (1828) .+|+|.-.--+ ++|||||.||.|-+ T Consensus 167 ~eiaR~t~HS~---~AvdRYi~df~rV~ 191 (220) T pfam07900 167 SEIARLTNHSP---ESIDRYIKDFTRVE 191 (220) T ss_pred HHHHHHHCCCH---HHHHHHHHHHHHHH T ss_conf 89999853798---99999998768999 No 10 >pfam07415 Herpes_LMP2 Gammaherpesvirus latent membrane protein (LMP2) protein. This family consists of several Gammaherpesvirus latent membrane protein (LMP2) proteins. Epstein-Barr virus is a human Gammaherpesvirus that infects and establishes latency in B lymphocytes in vivo. The latent membrane protein 2 (LMP2) gene is expressed in latently infected B cells and encodes two protein isoforms, LMP2A and LMP2B, that are identical except for an additional N-terminal 119 aa cytoplasmic domain which is present in the LMP2A isoform. LMP2A is thought to play a key role in either the establishment or the maintenance of latency and/or the reactivation of productive infection from the latent state. The significance of LMP2B and its role in pathogenesis remain unclear. Probab=87.94 E-value=1.3 Score=20.10 Aligned_cols=24 Identities=8% Similarity=0.083 Sum_probs=9.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 766666332001236888766310 Q gi|254781029|r 27 HENKKRHHSRKKHKYSRRDDSIES 50 (1828) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~ 50 (1828) +....+.++.......+++|+.++ T Consensus 25 g~~n~~yPS~fgss~~~pgpP~~e 48 (489) T pfam07415 25 GDSNPYYPSSFGSSWDRPGPPVPE 48 (489) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 888999987778778999998967 No 11 >PRK09752 adhesin; Provisional Probab=82.45 E-value=1.3 Score=19.91 Aligned_cols=12 Identities=17% Similarity=0.052 Sum_probs=6.1 Q ss_pred HHHHHHHHHHHH Q ss_conf 878997899999 Q gi|254781029|r 1775 SALRNAIDRYIS 1786 (1828) Q Consensus 1775 ~~f~~~v~ry~~ 1786 (1828) ..++..-|||+- T Consensus 1002 GQLkTQsNrYVv 1013 (1250) T PRK09752 1002 GQLAQHEDTSTV 1013 (1250) T ss_pred CCEEECCCEEEE T ss_conf 545623771377 No 12 >PRK08476 F0F1 ATP synthase subunit B'; Validated Probab=81.69 E-value=2.3 Score=17.96 Aligned_cols=26 Identities=23% Similarity=0.318 Sum_probs=15.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 64889999999999999999999999999 Q gi|254781029|r 137 RPEVITLFLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 137 ~~~~~~~~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) .|++ |+..+|.=++|+|++--++|+- T Consensus 6 n~~~---~i~~~i~FliL~~iL~kflykP 31 (141) T PRK08476 6 NPYL---MLLTFVVFLLLIVILNSWLYKP 31 (141) T ss_pred CHHH---HHHHHHHHHHHHHHHHHHHHHH T ss_conf 6779---9999999999999999995878 No 13 >KOG2365 consensus Probab=80.75 E-value=2.5 Score=17.72 Aligned_cols=21 Identities=14% Similarity=0.351 Sum_probs=12.9 Q ss_pred HHHHHHCHHHHHHHHHHHHHH Q ss_conf 765640648899999999999 Q gi|254781029|r 131 FYDFAMRPEVITLFLVVSVVP 151 (1828) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~p 151 (1828) .--.|.+|.+-|+.+|+.+.| T Consensus 48 mmymFLsPmLWAVLvGt~L~P 68 (808) T KOG2365 48 MMYMFLSPMLWAVLVGTELVP 68 (808) T ss_pred HHHHHHHHHHHHHHHCCEEEC T ss_conf 999998689999986355311 No 14 >COG3898 Uncharacterized membrane-bound protein [Function unknown] Probab=80.44 E-value=2.5 Score=17.64 Aligned_cols=61 Identities=20% Similarity=0.223 Sum_probs=34.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHH-----HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999831621-----455576564064889999999999999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNS-----PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) ++.+++-+| ++.|+||+|+--.|+ +.+....++.+....+ .+++|+-|+++|-+...+|-+ T Consensus 3 RvlfflilV-~alg~gfawLadrPG~lsl~w~G~~~emslm~Aa~~--iva~vaav~llwwlv~~iw~s 68 (531) T COG3898 3 RVLFFLILV-AALGFGFAWLADRPGELSLIWQGQQYEMSLMVAASI--IVALVAAVLLLWWLVRSIWES 68 (531) T ss_pred HHHHHHHHH-HHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHC T ss_conf 689999999-999867899708986536886213688899999999--999999999999999999968 No 15 >pfam05653 DUF803 Protein of unknown function (DUF803). This family consists of several eukaryotic proteins of unknown function. Probab=78.99 E-value=2.7 Score=17.30 Aligned_cols=51 Identities=12% Similarity=0.031 Sum_probs=29.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 3227999999999999999999998316214555765640648899999999999999 Q gi|254781029|r 97 LNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILL 154 (1828) Q Consensus 97 s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 154 (1828) -+.|+||+..+..+=.-.+-+.++++.|.+...-++ .+++.+.++++|++| T Consensus 48 l~~~~Ww~G~~~m~~Gei~nF~Ay~FAPasLVtPLg-------alsvi~naila~~~L 98 (300) T pfam05653 48 LKEWLWWAGLLTMIVGEAANFAAYAFAPATLVTPLG-------ALSVIISAVLSSFFL 98 (300) T ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH-------HHHHHHHHHHHHHHH T ss_conf 508299987899998668889999831799886241-------689999999999997 No 16 >COG3158 Kup K+ transporter [Inorganic ion transport and metabolism] Probab=78.90 E-value=2.1 Score=18.29 Aligned_cols=17 Identities=12% Similarity=0.268 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781029|r 145 LVVSVVPILLFFSFFIM 161 (1828) Q Consensus 145 ~~~~~~p~~~~~~~a~~ 161 (1828) +|-+|+|||++||.++- T Consensus 170 Vg~~FGPVm~lWF~~l~ 186 (627) T COG3158 170 VGKLFGPVMLLWFLALA 186 (627) T ss_pred HHHEECCHHHHHHHHHH T ss_conf 43300569999999999 No 17 >TIGR01992 PTS-IIBC-Tre PTS system, trehalose-specific IIBC component; InterPro: IPR011296 This entry represents the fused enzyme II B and C components of the trehalose-specific PTS sugar transporter system . Trehalose is converted to trehalose-6-phosphate in the process of translocation into the cell. These transporters lack their own IIA domains and instead use the glucose IIA protein (IIAglc or Crr) . The exceptions to this rule are Staphylococci and Streptococci which contain their own A domain as a C-terminal fusion. This family is closely related to the sucrose transporting PTS IIBC enzymes described by the IIB component and the IIC component (IPR004719 from INTERPRO), respectively. In Escherichia coli, Bacillus subtilis and Pseudomonas fluorescens the presence of this gene is associated with the presence of trehalase which degrades T6P to glucose and glucose-6-P. Trehalose may also be transported (in Salmonella) via the mannose PTS or galactose permease systems , or (in Sinorhizobium, Thermococcus and Sulfolobus, for instance) by ABC transporters , , .; GO: 0015574 trehalose transmembrane transporter activity, 0015771 trehalose transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=76.75 E-value=3.1 Score=16.82 Aligned_cols=14 Identities=36% Similarity=0.560 Sum_probs=6.5 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254781029|r 145 LVVSVVPILLFFSF 158 (1828) Q Consensus 145 ~~~~~~p~~~~~~~ 158 (1828) +.++|+|.+|-++| T Consensus 474 aIaIvVP~vLT~~~ 487 (489) T TIGR01992 474 AIAIVVPFVLTVFF 487 (489) T ss_pred HHHHHHHHHHHHHC T ss_conf 77776654885412 No 18 >PRK10745 trkD potassium transport protein Kup; Provisional Probab=76.63 E-value=3.2 Score=16.79 Aligned_cols=17 Identities=18% Similarity=0.360 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781029|r 146 VVSVVPILLFFSFFIMI 162 (1828) Q Consensus 146 ~~~~~p~~~~~~~a~~~ 162 (1828) |.+|+|||+.||+..-+ T Consensus 166 g~~FgPim~vWF~~l~~ 182 (622) T PRK10745 166 GKLFAPIMLTWFLILAV 182 (622) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 64650599999999999 No 19 >pfam03976 PPK2 Polyphosphate kinase 2 (PPK2). Inorganic polyphosphate (polyP) plays a role in metabolism and regulation and has been proposed to serve as a energy source in a pre-ATP world. In prokaryotes, the synthesis and utilisation of polyP are catalysed by PPK1, PPK2 and polyphosphatases. Proteins with a single PPK2 domain catalyse polyP-dependent phosphorylation of ADP to ATP, whereas proteins containing 2 fused PPK2 domains phosphorylate AMP to ADP. The structure of PPK2 from Pseudomonas aeruginosa has revealed a a 3-layer alpha/beta/alpha sandwich fold with an alpha-helical lid similar to the structures of microbial thymidylate kinases. Probab=75.73 E-value=3.3 Score=16.62 Aligned_cols=12 Identities=33% Similarity=0.664 Sum_probs=5.4 Q ss_pred CCCCCHHHHHHH Q ss_conf 745620677412 Q gi|254781029|r 1746 GEDDIFSKRLYT 1757 (1828) Q Consensus 1746 g~~~~f~~~~~~ 1757 (1828) |+=+||.|.=|+ T Consensus 89 G~I~iFdrSWY~ 100 (229) T pfam03976 89 GEIVLFDRSWYN 100 (229) T ss_pred CEEEEECCCHHH T ss_conf 818997462556 No 20 >pfam07739 TipAS TipAS antibiotic-recognition domain. This domain is found at the C-terminus of some MerR family transcription factors. The domain has an alpha-helical globin-like fold. The family includes Mta a central regulator of multidrug resistance in Bacillus subtilis. Probab=75.58 E-value=3.3 Score=16.59 Aligned_cols=52 Identities=19% Similarity=0.298 Sum_probs=31.7 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8999999750745620677412525998999999850788789978999999999999998 Q gi|254781029|r 1736 FVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKADSALRNAIDRYISNFEEMLSEIA 1796 (1828) Q Consensus 1736 ~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l~~~~ 1796 (1828) .+..|..+...- |+ -....+.-|..-|..||.|+...|.|-..+-.-|+++. T Consensus 67 l~~~h~~~l~~~---~~------~~~e~~~~L~~mY~~D~rF~~~yd~~~~gla~fl~~Ai 118 (122) T pfam07739 67 LAERHRAWLNRF---FG------YSPEAHAGLGEMYVADPRFTAYYDKYGPGLAEFLRDAI 118 (122) T ss_pred HHHHHHHHHHHC---CC------CCHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHH T ss_conf 999999999870---69------99999999999971688899877312876999999999 No 21 >COG4420 Predicted membrane protein [Function unknown] Probab=75.57 E-value=3.3 Score=16.58 Aligned_cols=68 Identities=15% Similarity=0.225 Sum_probs=41.1 Q ss_pred CCCCHH-HHHHHHHC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 677989-89998541-873227999999999999999999998316214555765640648899999999999999 Q gi|254781029|r 81 DNGGIT-HILQNIGK-NSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILL 154 (1828) Q Consensus 81 ~r~~~~-~~~~~l~~-~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 154 (1828) +-.|+| .+-..+-+ ..|.+-..+.+++=++|+++-+++... .-..-.-|+.+-.++.+ .+++.+|++| T Consensus 39 ~~~t~gdR~ad~ia~f~Gsw~fil~~~~~ll~Wi~lNl~~~~~-----~~wDpyPFi~LnLllS~-~AaiqAp~Il 108 (191) T COG4420 39 DGETFGDRVADKIARFGGSWAFILTFTLLLLLWIVLNLFLVPG-----LAWDPYPFILLNLLLST-LAAIQAPLIL 108 (191) T ss_pred CCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCC-----CCCCCCCHHHHHHHHHH-HHHHHHHHHH T ss_conf 2441425699999997377699999999999999997755217-----76789669999999999-9999876999 No 22 >pfam05977 DUF894 Bacterial protein of unknown function (DUF894). This family consists of several bacterial proteins, many of which are annotated as putative transmembrane transport proteins. Probab=74.70 E-value=3.5 Score=16.42 Aligned_cols=60 Identities=13% Similarity=0.077 Sum_probs=40.5 Q ss_pred CCCHHHHHHHHHHHH--HHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q ss_conf 732279999999999--99999999998316214555765640648899999999999999999999-99999 Q gi|254781029|r 96 SLNIALRNAFMSSLI--WLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFI-MISRA 165 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~--w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~-~~~r~ 165 (1828) |.=..||++.++|.+ |+- .++..|..+ ++..+|..+|+...+..+|+++|=.++= +.+|- T Consensus 10 ~~FR~lw~a~lvS~~Gtwmq-~VA~~Wlv~---------~lT~S~~~vglV~~a~~LP~~ll~l~aGvlADr~ 72 (524) T pfam05977 10 GTFRWIWIASLVSNLGWLMQ-TVAASWLMT---------DISASPLMVALVQAAATLPIFLLSIPAGAIADNF 72 (524) T ss_pred CCHHHHHHHHHHHHHHHHHH-HHHHHHHHH---------HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 22899999999999999999-999999987---------6508999999999999988999999999999822 No 23 >pfam10066 DUF2304 Uncharacterized conserved protein (DUF2304). Members of this family of hypothetical archaeal proteins have no known function. Probab=74.06 E-value=3.6 Score=16.30 Aligned_cols=71 Identities=17% Similarity=0.279 Sum_probs=32.7 Q ss_pred HHHHHHHHHHHHHH-HCCHHHHHHHHHHH--HCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999998-31621455576564--06488999999999999999999999999999999999999999973 Q gi|254781029|r 108 SLIWLACELGFLIK-FYTNSPMDSFYDFA--MRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRL 182 (1828) Q Consensus 108 s~~w~~~~~~~~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl 182 (1828) ++.|+..++++.+. .|| .....++++. +.|.=+ +|..+ +.+++|-.|-..++-++=-+-+-+--.++|+|= T Consensus 33 ~l~Wl~~~~~~li~~ifP-~i~~~iA~~lGi~rg~n~-lf~~~--i~~l~~~~f~l~~~is~le~~it~L~~~iAi~e 106 (115) T pfam10066 33 LLFWLFFWIVVLLFALFP-KISGRIANLLGIGRGLDA-LFYLG--IGLLFYLLFKLYLKIDKLEREITKLVREIAIRE 106 (115) T ss_pred HHHHHHHHHHHHHHHHCC-CHHHHHHHHHCCCCCHHH-HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999875-188999999557730159-99999--999999999999999999999999999999872 No 24 >pfam04276 DUF443 Protein of unknown function (DUF443). Family of uncharacterized proteins. Probab=73.60 E-value=3.7 Score=16.22 Aligned_cols=10 Identities=10% Similarity=0.471 Sum_probs=3.8 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254781029|r 147 VSVVPILLFF 156 (1828) Q Consensus 147 ~~~~p~~~~~ 156 (1828) .+++||++|+ T Consensus 97 i~~l~vl~~r 106 (197) T pfam04276 97 IVILGVLILR 106 (197) T ss_pred HHHHHHHHHH T ss_conf 9999999886 No 25 >pfam11044 TMEMspv1-c74-12 Plectrovirus spv1-c74 ORF 12 transmembrane protein. This is a family of proteins expressed by Plectroviruses. The plectroviruses are single-stranded DNA viruses belonging to the Inoviridae. Except that it is a putative transmembrane protein the function is not known. Probab=73.59 E-value=3.7 Score=16.22 Aligned_cols=22 Identities=9% Similarity=-0.041 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 79999999999999999999983 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) |.|++++||++-+. |+++.+++ T Consensus 2 ptwlt~ifsvviil-~if~~~gl 23 (49) T pfam11044 2 PTWLTTIFSVVIIL-GVFAWIGL 23 (49) T ss_pred CHHHHHHHHHHHHH-HHHHHHHH T ss_conf 14999999999999-99999999 No 26 >TIGR00869 sec62 protein translocation protein, Sec62 family; InterPro: IPR011553 Members of the NSCC2 family have been sequenced from various fungal and animal species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions. This family of Sec62 proteins is restricted to the Ascomycota. ; GO: 0008565 protein transporter activity, 0015031 protein transport, 0030176 integral to endoplasmic reticulum membrane. Probab=72.93 E-value=3.8 Score=16.10 Aligned_cols=27 Identities=19% Similarity=0.544 Sum_probs=20.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 648899999999999999999999999 Q gi|254781029|r 137 RPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 137 ~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) .=|+||+|++++|+=.+||-..+..++ T Consensus 159 ~lG~ig~FF~~ailRl~lfvlt~~~vk 185 (259) T TIGR00869 159 ALGIIGVFFAVAILRLILFVLTLVVVK 185 (259) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 788999999999999999997354201 No 27 >pfam11833 DUF3353 Protein of unknown function (DUF3353). This family of proteins are functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length. Probab=72.93 E-value=3.8 Score=16.10 Aligned_cols=34 Identities=9% Similarity=-0.200 Sum_probs=19.3 Q ss_pred HHHHHHHH---CCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98999854---18732279999999999999999999 Q gi|254781029|r 86 THILQNIG---KNSLNIALRNAFMSSLIWLACELGFL 119 (1828) Q Consensus 86 ~~~~~~l~---~~~s~~~~~~~~~~s~~w~~~~~~~~ 119 (1828) +...+.|. +++-|+..|.-..+++=|+.||+... T Consensus 125 ~~~iyfl~rk~~~~~rA~~~tl~~L~~G~~~Gs~l~~ 161 (193) T pfam11833 125 GACIYFLNRKGRRFGRALLWSLGGLVVGLLLGSLLAV 161 (193) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999988885031789999999999999999999987 No 28 >COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=72.80 E-value=3.8 Score=16.08 Aligned_cols=24 Identities=17% Similarity=0.170 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 999999999999999999983162 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTN 125 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~ 125 (1828) |+..+|-++=+|.|+|.+...+.+ T Consensus 3 ~vl~l~~ll~agi~~g~~~~~qqg 26 (400) T COG3071 3 RVLLLFVLLLAGIGVGLAIAGQQG 26 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 059999999999999999834678 No 29 >pfam07245 Phlebovirus_G2 Phlebovirus glycoprotein G2. This family consists of several Phlebovirus glycoprotein G2 sequences. Members of the Bunyaviridae family acquire an envelope by budding through the lipid bilayer of the Golgi complex. The budding compartment is thought to be determined by the accumulation of the two heterodimeric membrane glycoproteins G1 and G2 in the Golgi. Probab=72.54 E-value=3.9 Score=16.03 Aligned_cols=71 Identities=17% Similarity=0.054 Sum_probs=37.8 Q ss_pred CCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHH--HHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHH Q ss_conf 88775567798989998541873227999999999--9999999999983162145557656406488999999999999 Q gi|254781029|r 75 YPANDLDNGGITHILQNIGKNSLNIALRNAFMSSL--IWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPI 152 (1828) Q Consensus 75 ~pa~~d~r~~~~~~~~~l~~~~s~~~~~~~~~~s~--~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 152 (1828) +-+++|||+--|.. +.-.+|.....=+-.-||+ =|.+|.+-.. + ++.|.+++-| T Consensus 426 am~p~DdR~h~gg~--S~vvnPk~g~wd~~~Wfsgl~sW~ggpLkti---------------------l-~Ilg~i~l~i 481 (504) T pfam07245 426 AMNPFDDRRHEGGS--STVVNPKTGRWDFSSWFSGLVDWLGGPLKTI---------------------L-VILGFIALGI 481 (504) T ss_pred EECCCCHHHCCCCC--CCCCCCCCCCCCHHHHHHHHHHHCCCHHHHH---------------------H-HHHHHHHHHH T ss_conf 96475311015886--6317788885061576655566515508999---------------------9-9999999999 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781029|r 153 LLFFSFFIMISRARDMH 169 (1828) Q Consensus 153 ~~~~~~a~~~~r~~~~~ 169 (1828) +||+++-++++-+-.|- T Consensus 482 ~~~~~~i~l~~~~v~~~ 498 (504) T pfam07245 482 VLFVLLIILVRTGVGMI 498 (504) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999766465 No 30 >pfam11998 DUF3493 Protein of unknown function (DUF3493). This family of proteins is functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 79 to 331 amino acids in length. Probab=71.41 E-value=4.1 Score=15.84 Aligned_cols=14 Identities=36% Similarity=0.123 Sum_probs=7.4 Q ss_pred CCHHHHHHHHHHHH Q ss_conf 32279999999999 Q gi|254781029|r 97 LNIALRNAFMSSLI 110 (1828) Q Consensus 97 s~~~~~~~~~~s~~ 110 (1828) -|..+|+|+.+|+. T Consensus 20 lR~~ly~Af~aSa~ 33 (75) T pfam11998 20 LRRFLYLAFAASAG 33 (75) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999998779 No 31 >pfam11990 DUF3487 Protein of unknown function (DUF3487). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 121 to 136 amino acids in length. This protein has a conserved RLN sequence motif. Probab=71.35 E-value=4.1 Score=15.83 Aligned_cols=72 Identities=17% Similarity=0.193 Sum_probs=31.3 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH-HHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8989998541873227999999999999999999998316214555-765640648899999999999999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDS-FYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) |-.|...|.+-|. .+--.-.+=+|++++++++..+.-.-++.- ++.++..|+ .+.+.|++..|+-+-++. T Consensus 4 i~FLP~RLN~ePv---V~rGlT~~El~~~a~~g~~~gl~~g~~la~l~g~~~~~pt------~~ll~~~~~v~~gg~~L~ 74 (122) T pfam11990 4 IRFLPDRLNREPV---VFRGLTADELGVAAGLGAAAGLVVGLPLALLTGWIAMIPT------LALLGPILVVLVGGGLLR 74 (122) T ss_pred CCCCHHHCCCCCC---EECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHH T ss_conf 1117022178995---4306574799999999999999999999999987999989------999999999999689999 Q ss_pred HH Q ss_conf 99 Q gi|254781029|r 164 RA 165 (1828) Q Consensus 164 r~ 165 (1828) |- T Consensus 75 r~ 76 (122) T pfam11990 75 RL 76 (122) T ss_pred HH T ss_conf 98 No 32 >COG5615 Predicted integral membrane protein [Function unknown] Probab=70.30 E-value=3.3 Score=16.66 Aligned_cols=51 Identities=16% Similarity=0.294 Sum_probs=26.1 Q ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 57656406488999999999999999999999999999999999999999973092555 Q gi|254781029|r 130 SFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRLIDPEEY 188 (1828) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl~~p~~~ 188 (1828) +..+|.+.|+-++++.+. .-+++||++-.+.---.+.|. -.+.|+..|+.. T Consensus 79 ~~~gf~~lt~ti~g~ill--~k~il~l~m~V~~~~~~~~~~------kli~a~d~e~~~ 129 (161) T COG5615 79 IFGGFALLTTTIGGMILL--YKVILVLIMIVLMYANNIYHG------KLIMALDWEKKY 129 (161) T ss_pred ECCCCCHHHHHHHHHHHH--HHHHHHHHHHHHHHHCCHHHH------HHHHCCCCHHHH T ss_conf 327701102478899999--999999999999986507778------999844924324 No 33 >PRK10414 biopolymer transport protein ExbB; Provisional Probab=70.09 E-value=4.3 Score=15.62 Aligned_cols=53 Identities=8% Similarity=0.186 Sum_probs=29.1 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 765640648899999999999999999999999999999999999999997309 Q gi|254781029|r 131 FYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRLID 184 (1828) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl~~ 184 (1828) +-+++.....++= .+.+++=++=+|.-++++.|.-.++.+-|....---.|.. T Consensus 7 i~~Lf~qad~vvk-~VmiiLli~SI~SW~II~~K~~~~~~~~r~~~~f~~~f~~ 59 (239) T PRK10414 7 VWGMYQHADIVVK-CVMIGLILASVVTWAIFFSKSVEFFNQKRRLKREQQLLAE 59 (239) T ss_pred HHHHHHCCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999980899999-9999999999999999999999999999999999999836 No 34 >KOG4673 consensus Probab=69.93 E-value=4.4 Score=15.60 Aligned_cols=20 Identities=20% Similarity=0.132 Sum_probs=8.0 Q ss_pred CCCCCCCCCCCCCHHHHHHH Q ss_conf 87765444443352678753 Q gi|254781029|r 1677 SGKKTKNNHAIKEWFNKILS 1696 (1828) Q Consensus 1677 ~~~~~~~~~~~~~w~sdll~ 1696 (1828) +.......++...|.+|=+. T Consensus 808 SgSnee~ag~~~~f~~dd~s 827 (961) T KOG4673 808 SGSNEEIAGQNSAFENDDFS 827 (961) T ss_pred CCCHHHHHCCCCHHHCCCHH T ss_conf 87537674131013125302 No 35 >TIGR03348 VI_IcmF type VI secretion protein IcmF. Members of this protein family are IcmF homologs and tend to be associated with type VI secretion systems. Probab=66.73 E-value=5 Score=15.11 Aligned_cols=17 Identities=29% Similarity=0.407 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 27999999999999999 Q gi|254781029|r 99 IALRNAFMSSLIWLACE 115 (1828) Q Consensus 99 ~~~~~~~~~s~~w~~~~ 115 (1828) +.+++..+..++|.+|. T Consensus 2 ~~lg~~~l~~~iW~~GP 18 (1169) T TIGR03348 2 SLLGLILLCILIWWAGP 18 (1169) T ss_pred CHHHHHHHHHHHHHHCC T ss_conf 58999999999999711 No 36 >COG4239 ABC-type uncharacterized transport system, permease component [General function prediction only] Probab=66.22 E-value=5.1 Score=15.04 Aligned_cols=58 Identities=14% Similarity=0.181 Sum_probs=38.6 Q ss_pred HHHHHHHHHHHHHHHHHCCHH----HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999831621----45557656406488999999999999999999999999 Q gi|254781029|r 106 MSSLIWLACELGFLIKFYTNS----PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 106 ~~s~~w~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ++|++ ++.++|...++||+- ....+.-..++|.+.-+...+.++|.-|+|.+++|.=- T Consensus 152 ~~Sal-iGv~~GA~qGyfgg~vdL~~QRfIEvws~mP~lyllii~as~~~p~f~~ll~i~llf 213 (341) T COG4239 152 LISAL-IGVLAGALQGYFGGWVDLLGQRFIEVWSGMPTLYLLIILASIFQPNFWWLLGIMLLF 213 (341) T ss_pred HHHHH-HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999-999888775341300488770599998447299999999999685479999999999 No 37 >PRK01637 rbn ribonuclease BN; Reviewed Probab=65.69 E-value=5.2 Score=14.96 Aligned_cols=17 Identities=18% Similarity=0.424 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781029|r 141 ITLFLVVSVVPILLFFS 157 (1828) Q Consensus 141 ~~~~~~~~~~p~~~~~~ 157 (1828) ++-+....+.|.+++.+ T Consensus 31 lay~~lLSlfP~l~~~~ 47 (284) T PRK01637 31 LAYVTLLSLVPLVAVVF 47 (284) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999999999 No 38 >TIGR01946 rnfD electron transport complex, RnfABCDGE type, D subunit; InterPro: IPR011303 The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation . A closely related complex in Escherichia coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized . This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase, but is distinct. This entry describes the D subunit.; GO: 0006118 electron transport, 0016021 integral to membrane. Probab=65.34 E-value=5.3 Score=14.91 Aligned_cols=39 Identities=5% Similarity=0.036 Sum_probs=25.3 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHH Q ss_conf 873227999999999999999999998316214555765640648899 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVIT 142 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (1828) =|...|||..-+..++= +.++..+||+ +|+-+.+|+++| T Consensus 82 iPp~~PwW~~~iG~~fA----i~~gKhlYGG-----LGqN~FNPAmvG 120 (346) T TIGR01946 82 IPPLAPWWMVVIGTVFA----IVLGKHLYGG-----LGQNIFNPAMVG 120 (346) T ss_pred HHHHHHHHHHHHHHHHH----HHHHHHHCCC-----CCCCCCCHHHHH T ss_conf 53310899999999999----9998754066-----688743179999 No 39 >KOG3590 consensus Probab=65.24 E-value=4.3 Score=15.66 Aligned_cols=17 Identities=18% Similarity=0.022 Sum_probs=9.4 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781029|r 146 VVSVVPILLFFSFFIMI 162 (1828) Q Consensus 146 ~~~~~p~~~~~~~a~~~ 162 (1828) -++.-|+.|.||.-.+- T Consensus 117 ~il~D~~~LSYFiQYLD 133 (602) T KOG3590 117 QVLHDTIVLSYFIQYLD 133 (602) T ss_pred HHHCCHHHHHHHHHHHH T ss_conf 99427199999999984 No 40 >TIGR00328 flhB flagellar biosynthetic protein FlhB; InterPro: IPR006136 The flhB and flhA genes constitute an operon called flhB operon on the Salmonella typhimurium chromosome. They, together with FliI and FliH, may constitute the export apparatus of flagellin, the component protein of flagellar filament. The flhB gene encodes a highly hydrophobic polypeptide with several potential membrane-spanning segments, suggesting that it may be an integral membrane protein .; GO: 0009306 protein secretion, 0016020 membrane. Probab=65.08 E-value=5.3 Score=14.88 Aligned_cols=19 Identities=5% Similarity=-0.171 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFL 119 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~ 119 (1828) ..+..+++++|..|+-++. T Consensus 31 ~~ll~~~~~l~~~~~~~~~ 49 (352) T TIGR00328 31 AVLLAGVLVLLFFGSEGLA 49 (352) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 9999999999998899999 No 41 >pfam11124 Pho86 Inorganic phosphate transporter Pho86. Pho86p is an ER protein which is produced in response to phosphate starvation. It is essential for growth when phosphate levels are limiting. Pho86p is also involved in the regulation of Pho84p, a high-affinity phosphate transporter which is localized to the endoplasmic reticulum (ER) in low phosphate medium. When the level of phosphate increases Pho84p is transported to the vacuole. Pho86p is required for packaging of Pho84p in to COPII vesicles. Probab=64.91 E-value=5.4 Score=14.85 Aligned_cols=89 Identities=10% Similarity=0.092 Sum_probs=51.6 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHH-HHHHH----HHH--HCHHHHHHHHHHHHHHHHHHHH Q ss_conf 8989998541873227999999999999999999998316214-55576----564--0648899999999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSP-MDSFY----DFA--MRPEVITLFLVVSVVPILLFFS 157 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~-~~~~~----~~~--~~~~~~~~~~~~~~~p~~~~~~ 157 (1828) .|--++-+|..-.+..+|--..+|++|+++.+.+++.++-+-. ..++. .++ -.+..+- .+++.=+..+++ T Consensus 35 ~~Df~kQ~Qsl~Nk~v~wHPi~~~v~~~~~~~~~~~~l~~~~~is~s~~e~l~~~~~~nkk~~~~---~~~~~~~~~~~v 111 (296) T pfam11124 35 HGDFYKQLQALLNKYLFWHPISISVLVIALPLYLAYKLWDPINISNSLIEWLYQFFLKNKKDFIF---QLIYTLPSTSFV 111 (296) T ss_pred HHHHHHHHHHHHCCEEEECCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHH---HHHHHHHHHHHH T ss_conf 08899999987420355362789999999999978775000133533789999999971336899---999999999999 Q ss_pred HHHHHH----HHHHHHHHHHHHH Q ss_conf 999999----9999999999999 Q gi|254781029|r 158 FFIMIS----RARDMHDASQSIA 176 (1828) Q Consensus 158 ~a~~~~----r~~~~~~~a~~m~ 176 (1828) |+.+-+ -+-+|+.++-.+. T Consensus 112 Ft~~~~~a~lvtd~l~~~~~k~~ 134 (296) T pfam11124 112 FTLLGRLAYLVSDELFKIAIKLV 134 (296) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999987589998787641 No 42 >PRK13825 conjugal transfer protein TraB; Provisional Probab=64.15 E-value=5.5 Score=14.75 Aligned_cols=16 Identities=6% Similarity=0.032 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254781029|r 147 VSVVPILLFFSFFIMI 162 (1828) Q Consensus 147 ~~~~p~~~~~~~a~~~ 162 (1828) +++.|-...|+++.|+ T Consensus 136 GvlFPG~Gw~GL~a~~ 151 (389) T PRK13825 136 GVLFPGWGWWGLGALT 151 (389) T ss_pred HHHHHCCHHHHHHHHH T ss_conf 6441101799999999 No 43 >pfam02526 GBP_repeat Glycophorin-binding protein. This family contains glycophorin binding proteins from P. falciparum the malarial parasite. Glycophorin is a cell surface protein of erythrocytes. The Glycophorin binding protein contains a tandem 38 residue repeat. In GBP-130 from Plasmodium falciparum the repeat occurs 11 times. Probab=64.13 E-value=5 Score=15.13 Aligned_cols=19 Identities=21% Similarity=0.459 Sum_probs=16.3 Q ss_pred HHHHHHHCCHHHHHHHHHH Q ss_conf 9999850788789978999 Q gi|254781029|r 1766 NLQEQYKADSALRNAIDRY 1784 (1828) Q Consensus 1766 ~i~~~y~~d~~f~~~v~ry 1784 (1828) .|-+-|-+||++|-||+-+ T Consensus 8 qimk~yaadpeyrkh~~v~ 26 (38) T pfam02526 8 QIMKAYAADPEYRKHLNVL 26 (38) T ss_pred HHHHHHHCCHHHHHHHHHH T ss_conf 8999983287999999999 No 44 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=64.12 E-value=5.5 Score=14.74 Aligned_cols=16 Identities=25% Similarity=0.542 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254781029|r 150 VPILLFFSFFIMISRA 165 (1828) Q Consensus 150 ~p~~~~~~~a~~~~r~ 165 (1828) +|++...+++.+.|++ T Consensus 95 IPilIIi~Lav~T~~~ 110 (305) T PRK10525 95 VPILIIIFLAVLTWKT 110 (305) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999988 No 45 >PRK12505 putative monovalent cation/H+ antiporter subunit B; Reviewed Probab=63.53 E-value=5.6 Score=14.66 Aligned_cols=20 Identities=30% Similarity=0.368 Sum_probs=9.3 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254781029|r 141 ITLFLVVSVVPILLFFSFFI 160 (1828) Q Consensus 141 ~~~~~~~~~~p~~~~~~~a~ 160 (1828) +|+.+.+.-+=+.+||.||. T Consensus 124 lGVgltV~~vmi~lF~afa~ 143 (156) T PRK12505 124 LFIGAIVASVIIGLFFALAA 143 (156) T ss_pred HHHHHHHHHHHHHHHHHHHC T ss_conf 68999999999999999856 No 46 >TIGR00828 EIID-AGA PTS system, mannose/fructose/sorbose family, IID component; InterPro: IPR004704 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . Bacterial PTS transporters transport and concomitantly phosphorylate their sugar substrates, and typically consist of multiple subunits or protein domains.The Man family is unique in several respects among PTS permease families. It is the only PTS family in which members possess a IID protein. It is the only PTS family in which the IIB constituent is phosphorylated on a histidyl rather than a cysteyl residue. Its permease members exhibit broad specificity for a range of sugars, rather than being specific for just one or a few sugars. The mannose permease of Escherichia coli, for example, can transport and phosphorylate glucose, mannose, fructose, glucosamine,N-acetylglucosamine, and other sugars. Other members of this can transport sorbose, fructose and N-acetylglucosamine. This family is specific for the IID subunits of this family of PTS transporters.; GO: 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016021 integral to membrane. Probab=63.42 E-value=5 Score=15.12 Aligned_cols=20 Identities=30% Similarity=0.458 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 889999999999999999999 Q gi|254781029|r 139 EVITLFLVVSVVPILLFFSFF 159 (1828) Q Consensus 139 ~~~~~~~~~~~~p~~~~~~~a 159 (1828) ..+|+ -|-+++|++||++|- T Consensus 126 aslA~-~GnllGPlLFF~~~N 145 (272) T TIGR00828 126 ASLAL-SGNLLGPLLFFFLFN 145 (272) T ss_pred HHHHH-CCHHHHHHHHHHHHH T ss_conf 77861-320355899999999 No 47 >MTH00213 ND6 NADH dehydrogenase subunit 6; Provisional Probab=63.27 E-value=5.7 Score=14.63 Aligned_cols=52 Identities=19% Similarity=0.282 Sum_probs=25.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999831621--455576564064889999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNS--PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) |-.-++.|++|+.. +++ +.+ .+--..+|+++-=+| |++|++ - ++|.|.-||+ T Consensus 43 ~s~n~~~SVfwLIL-vF~-----nsSgLFILLG~dFlALIFLI-VYVGAI--A-ILFLFVIMML 96 (312) T MTH00213 43 YSHNFLASVFWLIL-AFI-----GSSGLFIVLGMEFLGLIFLI-VYVGAI--C-IIFLFVIMMI 96 (312) T ss_pred CCCCHHHHHHHHHH-HHH-----HHHHHHHHHHHHHHHHHHHH-HHHHHH--H-HHHHHHHHHC T ss_conf 34441999999999-997-----24699999821999999999-999799--9-9999999943 No 48 >PRK10633 hypothetical protein; Provisional Probab=62.76 E-value=5.8 Score=14.56 Aligned_cols=54 Identities=17% Similarity=0.268 Sum_probs=30.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 103 NAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 103 ~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) +|..+.++..++ ++..++++++. .++.+.|.=. +..-+++|++|+...-.|++. T Consensus 14 ~al~Ltl~yf~~--W~~~AY~~~~~----~~~~GlP~WF--~~SCi~~p~lf~~l~~~mVk~ 67 (80) T PRK10633 14 WALGLTLLYLAG--WLVAAYLPGVA----PGFTGLPRWF--EMACLLTPLLFILLCWAMVKF 67 (80) T ss_pred HHHHHHHHHHHH--HHHHHCCCCCC----CCCCCCHHHH--HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999--99998077886----5556770999--999999999999999999999 No 49 >PRK10999 malF maltose transporter membrane protein; Provisional Probab=61.94 E-value=6 Score=14.45 Aligned_cols=15 Identities=20% Similarity=0.197 Sum_probs=5.1 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999999 Q gi|254781029|r 150 VPILLFFSFFIMISR 164 (1828) Q Consensus 150 ~p~~~~~~~a~~~~r 164 (1828) +|-++=.+.+.|+|| T Consensus 329 lP~avP~~is~lvw~ 343 (520) T PRK10999 329 LPYAVPSFISILIFK 343 (520) T ss_pred HHHHHHHHHHHHHHH T ss_conf 998999999999999 No 50 >COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms] Probab=61.38 E-value=6.1 Score=14.37 Aligned_cols=16 Identities=25% Similarity=0.231 Sum_probs=6.8 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254781029|r 103 NAFMSSLIWLACELGF 118 (1828) Q Consensus 103 ~~~~~s~~w~~~~~~~ 118 (1828) ++-+.+++|++|++.. T Consensus 97 i~~i~l~lWi~g~li~ 112 (337) T COG4219 97 IANILLLLWIVGALIG 112 (337) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999999 No 51 >pfam07181 VirC2 VirC2 protein. This family consists of several VirC2 proteins which seem to be found exclusively in Agrobacterium species and Rhizobium etli. VirC2 is known to be involved in virulence in Agrobacterium species but its exact function is unclear. Probab=60.71 E-value=6.3 Score=14.29 Aligned_cols=38 Identities=32% Similarity=0.464 Sum_probs=26.3 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 52599899999985078878997899999999999999 Q gi|254781029|r 1758 IKGQKVFLNLQEQYKADSALRNAIDRYISNFEEMLSEI 1795 (1828) Q Consensus 1758 ~~~~~~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l~~~ 1795 (1828) +.--++||.+-..|..-..+.--.-|-+.|||.||.+- T Consensus 93 p~vSk~yD~Lilqys~sksLQmILRRAL~Dfe~MLadG 130 (202) T pfam07181 93 PEVSKIYDNLILQYSASKSLQMILRRALDDFENMLADG 130 (202) T ss_pred CCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 52788999898870704689999999998999997268 No 52 >PRK00302 lnt apolipoprotein N-acyltransferase; Reviewed Probab=60.62 E-value=6.3 Score=14.27 Aligned_cols=25 Identities=16% Similarity=0.114 Sum_probs=13.2 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 9998541873227999999999999 Q gi|254781029|r 88 ILQNIGKNSLNIALRNAFMSSLIWL 112 (1828) Q Consensus 88 ~~~~l~~~~s~~~~~~~~~~s~~w~ 112 (1828) ++..+.+.+.+..++.+.++...|- T Consensus 43 l~~~~~~~~~~~~f~~g~~~G~~~f 67 (509) T PRK00302 43 LLLLLRGASPKQAALLGFLWGFGYF 67 (509) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 9999938876899999999999999 No 53 >pfam10028 DUF2270 Predicted integral membrane protein (DUF2270). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=60.49 E-value=6.3 Score=14.26 Aligned_cols=33 Identities=21% Similarity=0.270 Sum_probs=25.7 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 8989998541873227999999999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGF 118 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~ 118 (1828) .-.++.++.+|=-|+=.|+=++..+.|++- +.+ T Consensus 112 ~is~~eAl~rRLrRnY~wi~~ill~AW~~K-i~~ 144 (185) T pfam10028 112 KVSLLEAIGRRLRRNYLWILAILLLAWLLK-IAI 144 (185) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHC T ss_conf 365999999999988999999999999999-961 No 54 >COG3336 Predicted membrane protein [Function unknown] Probab=60.42 E-value=6.3 Score=14.25 Aligned_cols=52 Identities=17% Similarity=0.245 Sum_probs=28.0 Q ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999831621455576564064889999999999999999999 Q gi|254781029|r 106 MSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFF 159 (1828) Q Consensus 106 ~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a 159 (1828) .=-+.|+.|++++..+.- +++...+.....--....++...|+|++|..+.- T Consensus 47 ~r~~~~~lG~val~~a~~--Spv~~~g~~mfs~HMa~hm~L~mvappLlVLg~P 98 (299) T COG3336 47 GRVILWLLGCVALYGAVG--SPVRAYGHAMFSLHMAEHMLLMMVAPPLLVLGAP 98 (299) T ss_pred HHHHHHHHHHHHHHHHHC--CCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 899999999999999707--7155534532799999999999888899997061 No 55 >COG3402 Uncharacterized conserved protein [Function unknown] Probab=59.85 E-value=6.4 Score=14.18 Aligned_cols=56 Identities=11% Similarity=0.148 Sum_probs=26.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC-CHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH Q ss_conf 799999999999999999999831-621455576564064889999999999999999 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKFY-TNSPMDSFYDFAMRPEVITLFLVVSVVPILLFF 156 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 156 (1828) +.|....++++|+..-++..+.+| ++++-.+...+.+.- ++|+++-.+|+|.++-| T Consensus 18 ~~~~~i~~~l~Ll~av~~~~~~~~~~~~~~w~~~a~~av~-l~~~vv~l~iiP~~~Ry 74 (161) T COG3402 18 RVQEWIPIALVLLIAVAAGVLLYFVGLDPNWSSVAAVAVI-LLAAVVTLFIIPQLVRY 74 (161) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHEECCCCCCHHHHHHHHH-HHHHHHHHHHHHHHHHH T ss_conf 8999999999999999999787400468850789999999-99999986223688754 No 56 >COG3462 Predicted membrane protein [Function unknown] Probab=59.51 E-value=6.5 Score=14.13 Aligned_cols=40 Identities=13% Similarity=0.160 Sum_probs=18.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHCCHH Q ss_conf 99999999999999999999999--99999999999730925 Q gi|254781029|r 147 VSVVPILLFFSFFIMISRARDMH--DASQSIAGIALRLIDPE 186 (1828) Q Consensus 147 ~~~~p~~~~~~~a~~~~r~~~~~--~~a~~m~~~a~rl~~p~ 186 (1828) .+++-|++||.+...-|-.+.++ +.+|.|.=.--|+|--| T Consensus 61 li~lvvfm~~~~g~~r~~~~~d~~e~~sRA~eIlkER~AkGE 102 (117) T COG3462 61 LIFLVVFMFYILGAVRRGSDDDDAERGSRAEEILKERYAKGE 102 (117) T ss_pred HHHHHHHHHHHHHHHCCCCHHCCCCCCCHHHHHHHHHHHCCC T ss_conf 999999999999873154200033253179999999973478 No 57 >PRK06251 V-type ATP synthase subunit K; Validated Probab=59.44 E-value=6.5 Score=14.12 Aligned_cols=48 Identities=15% Similarity=0.269 Sum_probs=25.6 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHH----------HHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999998316214555765----------640648899999999999999999 Q gi|254781029|r 108 SLIWLACELGFLIKFYTNSPMDSFYD----------FAMRPEVITLFLVVSVVPILLFFS 157 (1828) Q Consensus 108 s~~w~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~p~~~~~~ 157 (1828) ++-.++.|.|++.+.-+...++.+.. |.+.|..|+++ +.+ +-+++.|+ T Consensus 42 avGLA~iGaG~AvG~~gaAaig~iaE~~e~fg~~Lifv~laEgIaiy-Gll-iaiLilf~ 99 (101) T PRK06251 42 AVGLAAIGAGVAVGMAAAAGIGVLTERRDMFGTVLIFVAIGEGIAVY-GIL-FAVLMLFG 99 (101) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHCCEEEEEEHHHHHHHH-HHH-HHHHHHHC T ss_conf 88888850013200002756634303746517464443125689999-999-99999954 No 58 >pfam11712 Vma12 Endoplasmic reticulum-based factor for assembly of V-ATPase. The yeast vacuolar proton-translocating ATPase (V-ATPase) is the best characterized member of the V-ATPase family. A total of thirteen genes are required for encoding the subunits of the enzyme complex itself and an additional three for providing factors necessary for the assembly of the whole. Vma12 is one of these latter, all three of which are localized to the endoplasmic reticulum. Probab=59.35 E-value=6.6 Score=14.11 Aligned_cols=77 Identities=8% Similarity=0.109 Sum_probs=39.7 Q ss_pred CCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 55677989899985418732279999999999999999999983162145557656406488999999999999999999 Q gi|254781029|r 79 DLDNGGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSF 158 (1828) Q Consensus 79 ~d~r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 158 (1828) .++..++++..+++.+.++- .+=.++|++=++.++++...+.. +. . + ..-.++|++ +++++=|+=+|++ T Consensus 58 ~~~~~t~~~~~~~~~~q~~~---v~Ni~vSV~~~~~~~~~~~~~~~--~~---s-~-~~Rvll~l~-~allV~vAEv~vy 126 (137) T pfam11712 58 SDEDDTIGQEYKEVKRQLST---VLNILVSVVSTFFAGWYWARYSF--GL---S-I-PARVLLGLF-FALLVLVAEVVVY 126 (137) T ss_pred CCCCCCHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHC--CC---C-H-HHHHHHHHH-HHHHHHHHHHHHH T ss_conf 77765668888899877999---99999999999999999999813--75---4-4-699999999-9999999999999 Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|254781029|r 159 FIMISRAR 166 (1828) Q Consensus 159 a~~~~r~~ 166 (1828) +...|+-- T Consensus 127 ~~yl~kve 134 (137) T pfam11712 127 VIYLRKVE 134 (137) T ss_pred HHHHHHHH T ss_conf 99998875 No 59 >PRK13862 putative crown gall tumor protein VirC2; Provisional Probab=59.34 E-value=6.6 Score=14.11 Aligned_cols=39 Identities=28% Similarity=0.440 Sum_probs=26.5 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 252599899999985078878997899999999999999 Q gi|254781029|r 1757 TIKGQKVFLNLQEQYKADSALRNAIDRYISNFEEMLSEI 1795 (1828) Q Consensus 1757 ~~~~~~~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l~~~ 1795 (1828) -+.--++||.+-..|..-..+.--.-|-+.|||.||.+- T Consensus 92 Ap~vSk~yD~Lilqys~sksLQmILRRAL~Dfe~MLadG 130 (201) T PRK13862 92 APEVSKIYDNLILQYSPSKSLQMILRRALDDFENMLADG 130 (201) T ss_pred CCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 852788999898870704689999999998999997267 No 60 >TIGR03008 pepcterm_CAAX CAAX prenyl protease-related protein; InterPro: IPR014346 In eukaryotes, CAAX prenyl protease catalyzes three covalent modifications, including the cleavage and acylation at the C-terminus of certain proteins in a process connected to protein sorting. This entry describes a bacterial protein homologous to one domain of the CAAX-processing enzyme. Members of this protein are found in genomes that carry a predicted protein sorting system, PEP-CTERM/exosortase, usually in the vicinity of the EpsH homologue that is the hallmark of the system. The function of this protein is unknown, but it may relate to protein modification.. Probab=59.32 E-value=6.6 Score=14.11 Aligned_cols=63 Identities=19% Similarity=0.230 Sum_probs=35.8 Q ss_pred CCCCCHHHHHHHHHH--HHHHHHHHHHHHHHCC-HH--HHHHHHHHHHCHHHHH--HHHHHHHHHHH--HHH Q ss_conf 187322799999999--9999999999998316-21--4555765640648899--99999999999--999 Q gi|254781029|r 94 KNSLNIALRNAFMSS--LIWLACELGFLIKFYT-NS--PMDSFYDFAMRPEVIT--LFLVVSVVPIL--LFF 156 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s--~~w~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~--~~~~~~~~p~~--~~~ 156 (1828) .||+..++=+++-+. ++|+..=..+++.--| +. +-.-..+....+.+++ ++=++.||||| +|| T Consensus 72 ~r~~~~~~s~~vGv~Vf~lW~~lD~~l~f~g~pG~Gf~p~~i~~~gl~r~~L~afRl~GAtLvVPv~EElFW 143 (233) T TIGR03008 72 LRPRHLLFSIAVGVAVFVLWVNLDWLLAFQGEPGAGFDPSQIGNDGLIRWVLIAFRLAGATLVVPVMEELFW 143 (233) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 558899999999999999999852533205888627883335645542268999999989999999999898 No 61 >pfam07098 DUF1360 Protein of unknown function (DUF1360). This family consists of several bacterial proteins of around 115 residues in length. Members of this family are found in Bacillus species and Streptomyces coelicolor, the function of the family is unknown. Probab=58.95 E-value=6.7 Score=14.06 Aligned_cols=32 Identities=19% Similarity=0.155 Sum_probs=26.0 Q ss_pred CCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 677989899985418732279999999999999999999983162 Q gi|254781029|r 81 DNGGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTN 125 (1828) Q Consensus 81 ~r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~ 125 (1828) -|+.+|.|| +-||++. +|+++++.+.+.++|. T Consensus 50 ~r~~iGeLl---------sC~wC~g----vW~aa~l~~gy~~~P~ 81 (105) T pfam07098 50 LRHWIGELL---------SCYWCTG----VWVAAFLALGYVFAPR 81 (105) T ss_pred HHHHHHHHH---------CCCCHHH----HHHHHHHHHHHHHHHH T ss_conf 477898883---------4800899----9999999999998798 No 62 >pfam06738 DUF1212 Protein of unknown function (DUF1212). This family represents a conserved region within a number of hypothetical proteins of unknown function found in eukaryotes, bacteria and archaea. Some family members are membrane proteins. Probab=58.62 E-value=6.7 Score=14.02 Aligned_cols=46 Identities=4% Similarity=0.021 Sum_probs=25.7 Q ss_pred CCCHHHHHHHH---HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 77989899985---418732279999999999999999999983162145557 Q gi|254781029|r 82 NGGITHILQNI---GKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSF 131 (1828) Q Consensus 82 r~~~~~~~~~l---~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~ 131 (1828) .-++-.....| .+.|.+-|+|+-.++ |+.++++++.. |++++...+ T Consensus 81 ~~~~~ea~~~L~~I~~~~~~y~~~~~~l~---~~~~~~~fa~l-~gG~~~~~~ 129 (193) T pfam06738 81 EISLEEAHERLDEIDRAPPPYPRWLVVLA---AGLASAAFAVL-FGGDWIDFL 129 (193) T ss_pred CCCHHHHHHHHHHHHHCCCCCCHHHHHHH---HHHHHHHHHHH-HCCCHHHHH T ss_conf 99999999999999857889987999999---99999999999-879779999 No 63 >COG2966 Uncharacterized conserved protein [Function unknown] Probab=58.38 E-value=6.8 Score=13.99 Aligned_cols=111 Identities=16% Similarity=0.175 Sum_probs=59.5 Q ss_pred CCCCHHH---HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHH--HHHHHHHHHHHHH Q ss_conf 6779898---9998541873227999999999999999999998316214555765640648899--9999999999999 Q gi|254781029|r 81 DNGGITH---ILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVIT--LFLVVSVVPILLF 155 (1828) Q Consensus 81 ~r~~~~~---~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~p~~~~ 155 (1828) ++=++.. -|.-|++.|.+-|+|+..+++++ .++++++.++| .++..+..|++.-.+.. .++.-..-|-.|| T Consensus 102 ~~~~~e~a~~~l~~i~~~~~~y~~~l~~~~~g~---~~~~f~~l~gG-~w~d~~iaf~~~~~~~~~~~~l~r~~~~~~~~ 177 (250) T COG2966 102 GRLDLEEAHKKLDEIQKQPLRYSRWLVLLMAGL---AAAAFALLFGG-GWLDFLIAFFAGLLGFLLRQYLSRKGNPDFFF 177 (250) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHH---HHHHHHHHCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHH T ss_conf 887799999999876508655666899999999---99999999089-53888999999999999999999861422699 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-CCHHHHHHHHHH------HHHHHHHH Q ss_conf 999999999999999999999999973-092555699999------99999999 Q gi|254781029|r 156 FSFFIMISRARDMHDASQSIAGIALRL-IDPEEYSSEKMQ------SISSAVRK 202 (1828) Q Consensus 156 ~~~a~~~~r~~~~~~~a~~m~~~a~rl-~~p~~~a~~~~~------~~~qa~r~ 202 (1828) =++|..+- -.+|+-..-+ ++ .-|..+..-.|| .+-.|||+ T Consensus 178 ~~~a~~~~-----~~~a~~~~~~--~~~~~~~~~i~~si~~lvPG~~l~nav~d 224 (250) T COG2966 178 EVLASFIA-----SIVAVLFGSL--GLGITPDALIIGSIMLLVPGVPLTNAVRD 224 (250) T ss_pred HHHHHHHH-----HHHHHHHHHC--CCCCCCHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 99999999-----9999999870--34677029999888987486488999999 No 64 >pfam03907 Spo7 Spo7-like protein. S. cerevisiae Spo7 has an unknown function, but has a role in formation of a spherical nucleus and meiotic division. Probab=58.27 E-value=6.8 Score=13.97 Aligned_cols=68 Identities=16% Similarity=0.047 Sum_probs=34.4 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9854187322799999999999999999999831621455576564064889999999999999999999999 Q gi|254781029|r 90 QNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 90 ~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) |.++-|-.+--|.+=...=++|++. ....++|.|..-++. .+.-.-.-+++ .+.++-++|||+-.++- T Consensus 20 Q~l~lrarrrk~t~FL~~L~~~ia~--~~y~Lf~~p~~~~~~-~~~~~~~k~~l--~~~vvTl~Lf~~SG~y~ 87 (169) T pfam03907 20 QYLQLRARRRQFTFFLSLLAGWAAY--FFYELYFRPREDGSG-GLVRMILKFCL--IFGVVTLLLFHLSGQYR 87 (169) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHEEECCCCCCCC-HHHHHHHHHHH--HHHHHHHHHHHHCCEEE T ss_conf 9999999999999999999999999--988335457344740-79999999999--99999999985046155 No 65 >pfam11234 DUF3036 Protein of unknown function (DUF3036). Some members in this family of proteins are annotated as yoaS. Currently no function is known. Probab=57.75 E-value=6.9 Score=13.91 Aligned_cols=23 Identities=22% Similarity=0.065 Sum_probs=13.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 87322799999999999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELG 117 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~ 117 (1828) |.-+.+-.+|.+.|++|++++.. T Consensus 85 ~~lk~Ik~ca~~is~l~~~~~p~ 107 (155) T pfam11234 85 RALKNIKYCAIAISLLYFALLPL 107 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999899 No 66 >TIGR03109 exosortase_1 exosortase 1. The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. We designate this, the most common type so far, exosortase 1. We propose the gene symbol xrtA, analogous to srtA for the most common type of sortase in Gram-positive bacteria. Probab=57.74 E-value=6.9 Score=13.91 Aligned_cols=108 Identities=14% Similarity=0.083 Sum_probs=43.5 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH--HH Q ss_conf 9985418732279999999999999999999983162145557656406488999999999999999999999999--99 Q gi|254781029|r 89 LQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR--AR 166 (1828) Q Consensus 89 ~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r--~~ 166 (1828) ++.+..+||........+..++|+.|..+-...+..-+....+.+....-.+... .-.+.++++|.+|+.=++- .. T Consensus 59 l~~~~~~~~~~gl~~l~~~~~~~~~g~~~~~~~~~~~sl~~~l~g~~~~~~G~~~--~r~~~FPl~fL~f~vP~p~~l~~ 136 (267) T TIGR03109 59 LAKLAPQPSPLGLLLLIAAGLAWLLGEAASVSLVAQLSLVALLVFSVIALLGPRV--ARALAFPLAFLLFAVPVGDFLIP 136 (267) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHH--HHHHHHHHHHHHHHCCCHHHHHH T ss_conf 5037888873799999999999999998668999999999999999999905999--99999999999998575899999 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 99999999999999730925556999999999 Q gi|254781029|r 167 DMHDASQSIAGIALRLIDPEEYSSEKMQSISS 198 (1828) Q Consensus 167 ~~~~~a~~m~~~a~rl~~p~~~a~~~~~~~~q 198 (1828) -|+..+-..+.-.+++.-=...-+.-+..+++ T Consensus 137 pLq~~~a~~~~~~l~~~Gipv~reG~~i~l~~ 168 (267) T TIGR03109 137 PLQDITAFFTVALLSATGIPVYREGVFIHIPS 168 (267) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEEEEEEECC T ss_conf 99999999999999984998799868999888 No 67 >pfam08566 Pam17 Mitochondrial import protein Pam17. The presequence translocase-associated motor (PAM) drives the completion of preprotein translocation into the mitochondrial matrix. The Pam17 subunit is required for formation of a stable complex between cochaperones Pam16 and Pam18 and promotes the association of Pam16-Pam18 with the presequence translocase. Mitochondria lacking Pam17 are selectively impaired in the import of matrix proteins. Probab=57.56 E-value=7 Score=13.89 Aligned_cols=63 Identities=13% Similarity=-0.015 Sum_probs=36.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHH-------HHHHHHHC--HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999831621455-------57656406--488999999999999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMD-------SFYDFAMR--PEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~-------~~~~~~~~--~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ++.++.|++=..+|.++.+.++....+. .+.-|+.. -++.+.++|-+++|++--.+|-++-++ T Consensus 39 r~~~~~si~t~l~~~~~~~~yl~~~~~D~~~~~I~GlDP~~~~g~~~~a~~~~G~L~GP~iG~~vf~l~~r~ 110 (174) T pfam08566 39 RINTGSSVFTALGGAFAGLAYLSTLEIDSPTQQIMGLDPFMVLGLATIACGALGWLLGPFIGNAVFRLLNRS 110 (174) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 785573899999987778998634333542030138588999999999974255751636789999997377 No 68 >PRK10649 hypothetical protein; Provisional Probab=56.60 E-value=7.2 Score=13.77 Aligned_cols=63 Identities=16% Similarity=0.181 Sum_probs=48.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999997309255569999999999999999723 Q gi|254781029|r 146 VVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRLIDPEEYSSEKMQSISSAVRKEIVLMT 208 (1828) Q Consensus 146 ~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl~~p~~~a~~~~~~~~qa~r~ev~a~~ 208 (1828) ..-++|++||=..+-++|+-=-.=+.+-|.+-+..-...=-.+-.+-|..|.|+=.+|+..+= T Consensus 47 ~~~~~~~~l~p~~~r~~~~~l~~~l~~~s~~a~~y~~~Yg~~~d~smI~nv~ETN~~EA~eyl 109 (564) T PRK10649 47 SLWLIPVFLFPKRIKIIAAVIGVVLWAASLAALSYYVIYGQEFSQSVIFVMFETNTNEASEYL 109 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHH T ss_conf 999999998202579999999999999999999999995887789999999837999999999 No 69 >COG2807 CynX Cyanate permease [Inorganic ion transport and metabolism] Probab=56.39 E-value=7.3 Score=13.74 Aligned_cols=39 Identities=15% Similarity=0.124 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999----99999999999999999730925556999999999999999 Q gi|254781029|r 150 VPILLFFSFFIMIS----RARDMHDASQSIAGIALRLIDPEEYSSEKMQSISSAVRKEI 204 (1828) Q Consensus 150 ~p~~~~~~~a~~~~----r~~~~~~~a~~m~~~a~rl~~p~~~a~~~~~~~~qa~r~ev 204 (1828) +|++.|=+|+-..- |=-+=|.+.-+| =+.++|..+|-.. T Consensus 55 LPll~fg~~ap~a~~Lar~~g~er~l~~~l----------------lli~~G~~iR~~~ 97 (395) T COG2807 55 LPLLAFGLFAPAAPRLARRFGEERSLFLAL----------------LLIAAGILIRSLG 97 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------HHHHHHHHHHHCC T ss_conf 899999988754079999875688999999----------------9999999998205 No 70 >PRK10490 sensor protein KdpD; Provisional Probab=55.30 E-value=7.5 Score=13.61 Aligned_cols=20 Identities=10% Similarity=0.099 Sum_probs=8.7 Q ss_pred HHHCCCHHHHHHHHHHHCCC Q ss_conf 87180138999999750745 Q gi|254781029|r 1729 KFIDYDAFVQLWKSYTLGED 1748 (1828) Q Consensus 1729 ~~~~~~~~~~~w~~y~~g~~ 1748 (1828) .-|..+--..+|++|.||++ T Consensus 820 ~GIp~~~~~~IFe~F~R~~~ 839 (895) T PRK10490 820 PGIPPGQEQLIFDKFARGNK 839 (895) T ss_pred CCCCHHHHHHHCCCCCCCCC T ss_conf 98797999975568712899 No 71 >PRK09458 pspB phage shock protein B; Provisional Probab=54.35 E-value=6.2 Score=14.29 Aligned_cols=19 Identities=47% Similarity=0.807 Sum_probs=7.4 Q ss_pred HHHCHHHHHHHHHHHHHHHHHH Q ss_conf 6406488999999999999999 Q gi|254781029|r 134 FAMRPEVITLFLVVSVVPILLF 155 (1828) Q Consensus 134 ~~~~~~~~~~~~~~~~~p~~~~ 155 (1828) |+..|..+ | .++|+||=++ T Consensus 5 fl~vP~ii--F-~~fVaPiWLi 23 (75) T PRK09458 5 FLAIPLII--F-VLFVAPIWLW 23 (75) T ss_pred EEHHHHHH--H-HHHHHHHHHH T ss_conf 43281999--9-9999999999 No 72 >KOG1029 consensus Probab=54.25 E-value=7.8 Score=13.48 Aligned_cols=28 Identities=25% Similarity=0.364 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999 Q gi|254781029|r 230 EVLENNYTKSEMRIDNITQNLKQEREAI 257 (1828) Q Consensus 230 ~~ler~y~dnE~rir~L~~el~~ere~i 257 (1828) .++++--.+-|.|-+.|.++-..+|+.+ T Consensus 320 eNy~kGqaELerRRq~leeqqqreree~ 347 (1118) T KOG1029 320 ENYEKGQAELERRRQALEEQQQREREEV 347 (1118) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7675324779999999998889999999 No 73 >TIGR00927 2A1904 K+-dependent Na+/Ca+ exchanger; InterPro: IPR004817 This family represents the K+-dependent Na+/Ca+ exchanger 1 proteins. They are critical components of the visual transduction cascade, and control the calcium concentration of outer segments during light and darkness. Light causes a rapid lowering of cytosolic free calcium caused by extrusion via this protein, which plays a key role in light adaptation. It is an integral membrane protein which transports one calcium and one potassium ion in exchange for four sodium ions.; GO: 0008273 calcium potassium:sodium antiporter activity, 0006816 calcium ion transport, 0007601 visual perception, 0016021 integral to membrane. Probab=54.23 E-value=7.8 Score=13.48 Aligned_cols=16 Identities=25% Similarity=0.258 Sum_probs=7.9 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254781029|r 150 VPILLFFSFFIMISRA 165 (1828) Q Consensus 150 ~p~~~~~~~a~~~~r~ 165 (1828) +|-+|.-...++|..+ T Consensus 509 aPelftsliGvfishs 524 (1226) T TIGR00927 509 APELFTSLIGVFISHS 524 (1226) T ss_pred CHHHHHHHHHHHHHHC T ss_conf 1678888876665202 No 74 >COG0004 AmtB Ammonia permease [Inorganic ion transport and metabolism] Probab=53.65 E-value=7.8 Score=13.49 Aligned_cols=22 Identities=9% Similarity=0.246 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999983 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) +-...+.+++|+.-|-++++.. T Consensus 44 ~~~~~i~~~~w~~~Gyslafg~ 65 (409) T COG0004 44 FVAFAIVTLLWIFVGYSLAFGP 65 (409) T ss_pred HHHHHHHHHHHHHHHHHHEECC T ss_conf 9999999999999756530168 No 75 >pfam10725 DUF2517 Protein of unknown function (DUF2517). This family is conserved in Proteobacteria. Several members are annotated as being protein YbfA. The function is not known. Probab=53.45 E-value=3.7 Score=16.25 Aligned_cols=14 Identities=29% Similarity=0.567 Sum_probs=10.0 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254781029|r 143 LFLVVSVVPILLFF 156 (1828) Q Consensus 143 ~~~~~~~~p~~~~~ 156 (1828) +.+|++..|||+|| T Consensus 18 v~~Gvla~Pvmlf~ 31 (63) T pfam10725 18 VLAGVLALPVMLFR 31 (63) T ss_pred HHHHHHHHEEEEEC T ss_conf 99887551134441 No 76 >pfam05745 CRPA Chlamydia 15 kDa cysteine-rich outer membrane protein (CRPA). This family consists of several Chlamydia 15 kDa cysteine-rich outer membrane proteins which are associated with differentiation of reticulate bodies (RBs) into elementary bodies (EBs). Probab=53.36 E-value=8 Score=13.38 Aligned_cols=55 Identities=13% Similarity=0.202 Sum_probs=31.6 Q ss_pred CCCHHHHHHHHH-HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHH Q ss_conf 732279999999-999999999999983162145557656406488999999999999999 Q gi|254781029|r 96 SLNIALRNAFMS-SLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLF 155 (1828) Q Consensus 96 ~s~~~~~~~~~~-s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 155 (1828) +||+ |-++.++ .++-+.+|+++.+.+ ++.++..+=.+..|+.||++ -+.+-+++| T Consensus 60 ~SrA-fqitlvVlGilLvIaglal~Fil--~~qlg~Naf~l~IPAiiGlV--kllvtsl~m 115 (150) T pfam05745 60 SSRA-FQITLVVLGILLVIAGLALTFIL--QAQLGANAFYLAIPAIIGLV--KLLVTSLSM 115 (150) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHH--HHHHCCCHHHHHHHHHHHHH--HHHHHHHHC T ss_conf 6424-54679999999999999999752--25523760565678999799--998876524 No 77 >KOG4298 consensus Probab=53.10 E-value=8.1 Score=13.35 Aligned_cols=21 Identities=33% Similarity=0.493 Sum_probs=9.7 Q ss_pred HHHHHHHHHHHH-HHHHHHHHH Q ss_conf 999999999999-999999999 Q gi|254781029|r 145 LVVSVVPILLFF-SFFIMISRA 165 (1828) Q Consensus 145 ~~~~~~p~~~~~-~~a~~~~r~ 165 (1828) ..++.+||+++| +||+.+.|+ T Consensus 185 st~ilvPvgliFvvFa~hfyrs 206 (245) T KOG4298 185 STAILVPVGLIFVVFAIHFYRS 206 (245) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 4134332458999999999998 No 78 >PRK02935 hypothetical protein; Provisional Probab=52.74 E-value=8.2 Score=13.30 Aligned_cols=49 Identities=18% Similarity=0.323 Sum_probs=22.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999998316214555765640648899999999999999999 Q gi|254781029|r 103 NAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFS 157 (1828) Q Consensus 103 ~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 157 (1828) -+.+.|++.++-.+-+.-.+|..+++ +..++..-|.+. .+.--++.||. T Consensus 12 RtfaL~lvfiG~ivMY~GifF~~s~i--~m~ifmilG~l~----i~~StvvYFwi 60 (117) T PRK02935 12 RTFALSLVFIGLIIMYLGVFFRESII--IMTTFMILGFLA----VIASTVVYFWI 60 (117) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHH--HHHHHHHHHHHH----HHHHHHHHHHH T ss_conf 99999999999999999999725799--999999999999----99999999999 No 79 >TIGR01194 cyc_pep_trnsptr cyclic peptide transporter; InterPro: IPR005898 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Bacteria have elaborate pathways for the production of toxins and secondary metabolites. Many such compounds, including syringomycin and pyoverdine are synthesized on non-ribosomal templates consisting of a multienzyme complex. On several occasions the proteins of the complex and transporter protein are present on the same operon. Often times these compounds cross the biological membrane by specific transporters. This model describes a family of cyclic peptide transporters in bacteria. Syringomycin is an amphipathic, cyclic lipodepsipeptide when inserted into host causes formation of channels, permeable to a variety of cations. On the other hand, pyoverdine is a cyclic octa-peptidyl dihydroxyquinoline, which is efficient in sequestering iron for uptake.; GO: 0005524 ATP binding, 0015197 peptide transporter activity, 0015833 peptide transport, 0016021 integral to membrane. Probab=52.41 E-value=8.3 Score=13.26 Aligned_cols=36 Identities=17% Similarity=0.270 Sum_probs=17.4 Q ss_pred CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHH-HHHHHHHHHH Q ss_conf 7798989998541873227999999999999-9999999983 Q gi|254781029|r 82 NGGITHILQNIGKNSLNIALRNAFMSSLIWL-ACELGFLIKF 122 (1828) Q Consensus 82 r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~-~~~~~~~~~~ 122 (1828) |-.||.+++-| ++||.+-+++|++-. ++|++....+ T Consensus 2 ~~~~~~~l~LL-----~s~~~~~~~~~~~~Gl~~G~A~~~~~ 38 (555) T TIGR01194 2 KAAIRELLALL-----RSSFRALVAVSLLLGLISGLAIILLL 38 (555) T ss_pred CCCHHHHHHHH-----HCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87368999987-----25518999999999888899999999 No 80 >PRK09647 RNA polymerase sigma factor SigE; Reviewed Probab=52.40 E-value=6.1 Score=14.40 Aligned_cols=12 Identities=17% Similarity=-0.000 Sum_probs=6.6 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254781029|r 156 FSFFIMISRARD 167 (1828) Q Consensus 156 ~~~a~~~~r~~~ 167 (1828) |.|.+.....-+ T Consensus 100 WL~rIA~N~~iD 111 (222) T PRK09647 100 WLHRITTNLFLD 111 (222) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 No 81 >TIGR02976 phageshock_pspB phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages . Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one .. Probab=51.03 E-value=8.6 Score=13.10 Aligned_cols=18 Identities=17% Similarity=0.510 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999 Q gi|254781029|r 149 VVPILLFFSFFIMISRAR 166 (1828) Q Consensus 149 ~~p~~~~~~~a~~~~r~~ 166 (1828) ++|++.|.+|..++|.-= T Consensus 7 ~~P~i~f~~fv~~~wl~l 24 (75) T TIGR02976 7 AIPLIIFVIFVALLWLLL 24 (75) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999 No 82 >PRK13657 cyclic beta-1,2-glucan ABC transporter; Provisional Probab=50.80 E-value=8.7 Score=13.08 Aligned_cols=29 Identities=7% Similarity=-0.058 Sum_probs=20.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 18732279999999999999999999983 Q gi|254781029|r 94 KNSLNIALRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) -+|-+..||++.+++++..+++++..+.+ T Consensus 14 ~~~~k~~~~~~~~~~i~~~~~~l~~P~~~ 42 (585) T PRK13657 14 LGAEKRLGWLLAVANVLLAAATFAEPVLF 42 (585) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999999 No 83 >TIGR02139 permease_CysT sulfate ABC transporter, permease protein CysT; InterPro: IPR011865 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents CysT, one of two homologous, tandem permeases in the sulphate ABC transporter system; the other is CysW (IPR011866 from INTERPRO). The sulphate transporter has been described in Escherichia coli as transporting sulphate, thiosulphate, selenate, and selenite. Sulphate transporters may also transport molybdate ion if a specific molybdate transporter is not present.; GO: 0015116 sulfate transmembrane transporter activity, 0008272 sulfate transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=50.45 E-value=8.8 Score=13.03 Aligned_cols=51 Identities=12% Similarity=0.226 Sum_probs=38.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----CCH-HHHHHHHHH-HHCHHHHHHHHHHHH Q ss_conf 279999999999999999999983----162-145557656-406488999999999 Q gi|254781029|r 99 IALRNAFMSSLIWLACELGFLIKF----YTN-SPMDSFYDF-AMRPEVITLFLVVSV 149 (1828) Q Consensus 99 ~~~~~~~~~s~~w~~~~~~~~~~~----~~~-~~~~~~~~~-~~~~~~~~~~~~~~~ 149 (1828) .-+..|.+++++=+.-|..+||-+ ||+ ..+.++-|+ |++|+..|+++.+.+ T Consensus 54 ~sf~~Al~Aa~~N~vFG~l~AWVLVRY~FPGKri~DAlVDLPFALPTAVAGIaL~Tl 110 (265) T TIGR02139 54 LSFGTALIAALINAVFGTLLAWVLVRYRFPGKRIVDALVDLPFALPTAVAGIALATL 110 (265) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 999999999999999999998898612475057776662335420347999999999 No 84 >smart00502 BBC B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains Probab=50.45 E-value=8.8 Score=13.03 Aligned_cols=13 Identities=38% Similarity=0.537 Sum_probs=4.1 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999999 Q gi|254781029|r 322 TISSKIDQLLEVL 334 (1828) Q Consensus 322 ~l~~~~~~l~~~L 334 (1828) .|..+...|+..| T Consensus 54 ~l~~rk~~Ll~~l 66 (127) T smart00502 54 ALNKRKKQLLEDL 66 (127) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999999 No 85 >PRK12519 RNA polymerase sigma factor; Provisional Probab=49.98 E-value=8.9 Score=12.98 Aligned_cols=37 Identities=5% Similarity=-0.000 Sum_probs=15.1 Q ss_pred CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 98989998541873227999999999999999999998316 Q gi|254781029|r 84 GITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYT 124 (1828) Q Consensus 84 ~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~ 124 (1828) |=..|+..++.+-..+ +..++.--|-.. .++++.+.+ T Consensus 15 tD~eLi~~~~~GD~~A---f~~L~~ry~~~l-~~~a~~~~~ 51 (194) T PRK12519 15 SDAELFSALKAGQSAA---LGVLYDRHAGLV-YGLALKILG 51 (194) T ss_pred CHHHHHHHHHHCCHHH---HHHHHHHHHHHH-HHHHHHHCC T ss_conf 0999999999385999---999999999999-999999939 No 86 >pfam09788 Tmemb_55A Transmembrane protein 55A. Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate. Probab=49.55 E-value=9.1 Score=12.93 Aligned_cols=21 Identities=14% Similarity=0.456 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999831 Q gi|254781029|r 103 NAFMSSLIWLACELGFLIKFY 123 (1828) Q Consensus 103 ~~~~~s~~w~~~~~~~~~~~~ 123 (1828) +-++++++.+.+|+|+..+.| T Consensus 202 ~~~il~lifl~~~iglt~gT~ 222 (258) T pfam09788 202 LCFLLALLFIVVAVGLTAGTW 222 (258) T ss_pred EHHHHHHHHHHHHHHHEEEEH T ss_conf 039999999997453136517 No 87 >pfam01034 Syndecan Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors. Probab=49.39 E-value=9.1 Score=12.91 Aligned_cols=35 Identities=17% Similarity=0.242 Sum_probs=13.3 Q ss_pred HHHHHHCHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Q ss_conf 7656406488999999-9999999999999999999 Q gi|254781029|r 131 FYDFAMRPEVITLFLV-VSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 131 ~~~~~~~~~~~~~~~~-~~~~p~~~~~~~a~~~~r~ 165 (1828) ...||..|++||.+++ ++|+=+..+|.+-.|+.|. T Consensus 143 S~~~~~~~~~l~~~i~~~~~~~~~a~~l~~~~~yr~ 178 (207) T pfam01034 143 SKNLLERKEVLAAVIAGGVVGLLFAVFLVMFLVYRM 178 (207) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 522102413789988512899999999999999864 No 88 >TIGR03108 eps_aminotran_1 exosortase 1 system-associated amidotransferase 1. The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. This model represents a distinct clade among a set of amidotransferases largely annotated (not necessarily accurately) as glutatime-hydrolyzing asparagine synthases. Members of this clade are essentially restricted to the characteristic exopolysaccharide (EPS) regions that contain the exosortase 1 genome (xrtA), in genomes that also have numbers of PEP-CTERM domain (TIGR02595) proteins. Probab=48.79 E-value=9.3 Score=12.84 Aligned_cols=37 Identities=16% Similarity=0.184 Sum_probs=27.6 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 57999998718013899999975074562067741252 Q gi|254781029|r 1722 SLVENISKFIDYDAFVQLWKSYTLGEDDIFSKRLYTIK 1759 (1828) Q Consensus 1722 s~s~di~~~~~~~~~~~~w~~y~~g~~~~f~~~~~~~~ 1759 (1828) +-++.....+|++....+|+.++.|.++ |+++|+++. T Consensus 583 ~~~~~~~~~~~~~~v~~l~~~~~~g~~~-~~~~lW~l~ 619 (628) T TIGR03108 583 GETLAETGLFDPAFIRKLVDQHQSGRRD-YSAPLWSLL 619 (628) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHCCCCC-CHHHHHHHH T ss_conf 0337666880999999999998627976-078999999 No 89 >pfam09803 DUF2346 Uncharacterized conserved protein (DUF2346). Members of this family of proteins have no known function. Probab=48.68 E-value=9.3 Score=12.83 Aligned_cols=20 Identities=25% Similarity=0.610 Sum_probs=12.1 Q ss_pred HHHHHHHHHHH-------HHHHHHHHH Q ss_conf 99999999999-------999999999 Q gi|254781029|r 150 VPILLFFSFFI-------MISRARDMH 169 (1828) Q Consensus 150 ~p~~~~~~~a~-------~~~r~~~~~ 169 (1828) -||++||.|-. ++.|-.||. T Consensus 17 FPVamF~~fNQpe~FEe~V~k~kRel~ 43 (80) T pfam09803 17 FPVAMFWIFNQPEYFEEWVTKRKRELY 43 (80) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCC T ss_conf 599999980574999999999876307 No 90 >KOG2922 consensus Probab=47.59 E-value=9.6 Score=12.70 Aligned_cols=32 Identities=19% Similarity=-0.100 Sum_probs=18.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 22799999999999999999999831621455 Q gi|254781029|r 98 NIALRNAFMSSLIWLACELGFLIKFYTNSPMD 129 (1828) Q Consensus 98 ~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~ 129 (1828) +.|.||+..+.-+=.-.+-+.++++.|.+.+. T Consensus 63 ~~~~Ww~G~ltm~vGei~NFaAYaFAPasLVt 94 (335) T KOG2922 63 KEPLWWAGMLTMIVGEIANFAAYAFAPASLVT 94 (335) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHC T ss_conf 63789978899998867557887503476635 No 91 >pfam06667 PspB Phage shock protein B. This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one. Probab=47.57 E-value=9.6 Score=12.70 Aligned_cols=12 Identities=33% Similarity=0.703 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254781029|r 147 VSVVPILLFFSF 158 (1828) Q Consensus 147 ~~~~p~~~~~~~ 158 (1828) ++|+||-+++-+ T Consensus 15 ifVaP~WL~LHY 26 (75) T pfam06667 15 LFVAPIWLILHY 26 (75) T ss_pred HHHHHHHHHHHH T ss_conf 999999999965 No 92 >COG2245 Predicted membrane protein [Function unknown] Probab=46.79 E-value=9.9 Score=12.61 Aligned_cols=21 Identities=10% Similarity=0.150 Sum_probs=8.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999998 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIK 121 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~ 121 (1828) ||+..+.-++|+.|-+.+.++ T Consensus 55 F~~yl~~~v~~l~~~vvvifa 75 (182) T COG2245 55 FNNYLIAVVLGLVGFVVVIFA 75 (182) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999 No 93 >pfam03239 FTR1 Iron permease FTR1 family. Probab=46.35 E-value=10 Score=12.56 Aligned_cols=86 Identities=8% Similarity=0.042 Sum_probs=36.7 Q ss_pred HHHHHHHHH--CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 898999854--187322799999999999999999999831621455576564064889999999999999999999999 Q gi|254781029|r 85 ITHILQNIG--KNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 85 ~~~~~~~l~--~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) +|-++..|+ .++.+.++.+..++.++=+...++........ .+++...+.+.+..=|+ ..+++-+++.|+...|. T Consensus 19 V~IllA~l~~~~~~~~~~~Vw~Gv~~gv~~s~~~g~~~~~~~~-~~~~~~~~~~~el~eg~--~~liA~~lit~m~~WM~ 95 (284) T pfam03239 19 VSVLLAYLKQTQRETDDLQVWAGIALAVVLCLAIGQAFIGIFA-GLGRDIWFIEKELFEGV--LSYIAVVVISWMIFWMR 95 (284) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHCCCCHHHHHHHHHHH--HHHHHHHHHHHHHHHHH T ss_conf 9999999982586312368699999999999999999999999-84653204689999999--99999999999999999 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254781029|r 163 SRARDMHDASQ 173 (1828) Q Consensus 163 ~r~~~~~~~a~ 173 (1828) +-+..|+---+ T Consensus 96 r~~~~~k~~~~ 106 (284) T pfam03239 96 RVGKKLKGEIE 106 (284) T ss_pred HHHHHHHHHHH T ss_conf 98799999999 No 94 >PRK10472 low affinity gluconate transporter; Provisional Probab=45.96 E-value=10 Score=12.52 Aligned_cols=38 Identities=13% Similarity=0.023 Sum_probs=23.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999999999999999999999999999999730 Q gi|254781029|r 146 VVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRLI 183 (1828) Q Consensus 146 ~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl~ 183 (1828) |..+.-|.+.-+|..|+=+-=|---.|+.|++.-+|+- T Consensus 59 G~tlg~igiiI~lG~~iG~~L~~SGaa~~ia~~~l~~~ 96 (445) T PRK10472 59 GGTLGFLAVVVALGAMFGKILHETGAVDQIAVKMLKSF 96 (445) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 88888999999999999999998595999999999986 No 95 >TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein; InterPro: IPR008769 Polyhydroxyalkanoates (PHAs) are storage polyesters synthesised by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon.. Probab=45.36 E-value=10 Score=12.45 Aligned_cols=72 Identities=14% Similarity=0.161 Sum_probs=35.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 322 TISSKIDQLLEVLHSTSIVITKDFDNRIESLSNTLNNSGRSLANQVGNYTLMLGNNTDKVSIALKEQSQQFM 393 (1828) Q Consensus 322 ~l~~~~~~l~~~L~~~~~~~~~~~~~~~e~l~~~L~~~~~~l~~~~~~~~~~l~~~~e~l~~~l~e~~~~l~ 393 (1828) .+...+..+.+.|=..|....+.-..+...-.+.+.+..++...++..........+++|+..|..+...+. T Consensus 19 rv~~EGs~~F~~LV~~GE~~ek~~~~~~~~~~d~~r~~~~~~~~~~~~~~d~~~~~w~~l~~~FD~R~~~~L 90 (121) T TIGR01837 19 RVQEEGSKFFDELVKEGEKAEKRGKKRVDKSVDAARDSIEEVKTKLEQVRDKVQRNWDKLEKAFDERVEQAL 90 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 307789999999999889998999999999999987479999988887799998889889999999999999 No 96 >PRK05415 hypothetical protein; Provisional Probab=45.32 E-value=10 Score=12.44 Aligned_cols=41 Identities=7% Similarity=0.018 Sum_probs=20.3 Q ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5557656406488999999999999999999999999999999999 Q gi|254781029|r 128 MDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQ 173 (1828) Q Consensus 128 ~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~ 173 (1828) ..-+..++..|- ||.+ +++++-++.++++.++++ |++-..| T Consensus 73 ~~l~~~~~~~~w-Lg~~-~~~l~~~~~~~~~~~~~r---E~~~L~R 113 (338) T PRK05415 73 QWLIDALQQQDW-LGLG-AAVVGALIVLAGLGIVVR---EWRRLRR 113 (338) T ss_pred HHHHHHHHHCCH-HHHH-HHHHHHHHHHHHHHHHHH---HHHHHHH T ss_conf 999999985848-9999-999999999999999999---9999999 No 97 >PRK10153 DNA-binding transcriptional activator CadC; Provisional Probab=45.10 E-value=10 Score=12.42 Aligned_cols=23 Identities=17% Similarity=0.140 Sum_probs=14.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999999999998316 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYT 124 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~ 124 (1828) -|..+++.++|++ .+.++++.|. T Consensus 162 ~w~~~~~~al~~~-v~~v~~~~~~ 184 (512) T PRK10153 162 VWKLFLLSALGIC-VLLVAMSSLT 184 (512) T ss_pred HHHHHHHHHHHHH-HHHHEEEEEE T ss_conf 8999999999999-8864024531 No 98 >COG0795 Predicted permeases [General function prediction only] Probab=45.05 E-value=10 Score=12.41 Aligned_cols=20 Identities=20% Similarity=0.497 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254781029|r 148 SVVPILLFFSFFIMISRARD 167 (1828) Q Consensus 148 ~~~p~~~~~~~a~~~~r~~~ 167 (1828) +.+|.++|++++.+..+..+ T Consensus 343 ~~~p~~v~l~~~~~l~~~~~ 362 (364) T COG0795 343 AWLPNLVFLALGLWLLRRKE 362 (364) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999998510 No 99 >PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed Probab=44.78 E-value=11 Score=12.38 Aligned_cols=33 Identities=21% Similarity=0.425 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 999999999999999999999997309255569 Q gi|254781029|r 158 FFIMISRARDMHDASQSIAGIALRLIDPEEYSS 190 (1828) Q Consensus 158 ~a~~~~r~~~~~~~a~~m~~~a~rl~~p~~~a~ 190 (1828) +--+++|++.=.+..--..-+-+-|+-|-++.+ T Consensus 251 y~d~v~~ar~~~l~~~df~g~t~sltnpg~igt 283 (1234) T PRK12270 251 YEDIVRRARDGKLTADDFQGVTISLTNPGGIGT 283 (1234) T ss_pred HHHHHHHHHCCCCCHHHCCCCEEEECCCCCCCC T ss_conf 999999986598767760786698217986453 No 100 >TIGR00946 2a69 Auxin Efflux Carrier; InterPro: IPR014024 This entry is mostly composed of known or predicted PIN proteins from plants, though some homologous prokaryotic proteins are also included. The PIN proteins are components of auxin efflux systems from plants. These carriers are saturable, auxin-specific, and localized to the basal ends of auxin transport-competent cells , . Plants typically posses several of these proteins, each displaying a unique tissue-specific expression pattern. They are expressed in almost all plant tissues including vascular tissues and roots, and influence many processes including the establishment of embryonic polarity, plant growth, apical hook formation in seedlings and the photo- and gravitrophic responses. These plant proteins are typically 600-700 amino acyl residues long and exhibit 8-12 transmembrane segments.. Probab=44.60 E-value=11 Score=12.36 Aligned_cols=87 Identities=20% Similarity=0.268 Sum_probs=58.6 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHH-HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 541873227999999999999999999998316214-5557656406488999999999999999999999999999999 Q gi|254781029|r 92 IGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSP-MDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHD 170 (1828) Q Consensus 92 l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~ 170 (1828) .=++.-+.|..++.++|++|.. +++=|.+-=|.. ..++. +++.+.+|++||..--.|+--...|+. T Consensus 453 ~~k~~~~~P~~~~~Lls~il~l--v~~~~n~~~P~~~~~s~~-----------~Ls~a~~p~alfslGlfl~L~~~~~~~ 519 (608) T TIGR00946 453 VWKKLIRFPPLIASLLSLILSL--VGFKWNFTMPGLILKSIS-----------ILSGAALPMALFSLGLFLALQPRKLKL 519 (608) T ss_pred HHHHHHCCCHHHHHHHHHHHHH--HHHCCCCHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999732542689999999999--861146604789999987-----------532789999999999999753568762 Q ss_pred H-HHHHHHHHHHH-CCHHHHHHH Q ss_conf 9-99999999973-092555699 Q gi|254781029|r 171 A-SQSIAGIALRL-IDPEEYSSE 191 (1828) Q Consensus 171 ~-a~~m~~~a~rl-~~p~~~a~~ 191 (1828) = +-+..=.++|| ..|--++.= T Consensus 520 ~~~~~~~a~~~rf~~~Pa~~~~~ 542 (608) T TIGR00946 520 GVADAILALIVRFLVGPAVMAVI 542 (608) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH T ss_conf 41489999999999999999999 No 101 >PRK09535 btuC corrinoid ABC transporter permease; Reviewed Probab=44.60 E-value=11 Score=12.36 Aligned_cols=29 Identities=14% Similarity=-0.049 Sum_probs=19.5 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 898999854187322799999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLAC 114 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~ 114 (1828) .|.++|++-|+|-..||.+ .+.|+..++. T Consensus 112 aG~lmQ~l~RNPLAdP~iL-GVssGA~lga 140 (366) T PRK09535 112 AGAVMQGFFRNPMADPSII-GVSSGAAVGA 140 (366) T ss_pred HHHHHHHHHCCCCCCCCHH-HHHHHHHHHH T ss_conf 9999999728888697287-9999999999 No 102 >COG3448 CBS-domain-containing membrane protein [Signal transduction mechanisms] Probab=44.37 E-value=11 Score=12.33 Aligned_cols=15 Identities=13% Similarity=-0.017 Sum_probs=4.9 Q ss_pred CHHHHHHHHHHHHCH Q ss_conf 621455576564064 Q gi|254781029|r 124 TNSPMDSFYDFAMRP 138 (1828) Q Consensus 124 ~~~~~~~~~~~~~~~ 138 (1828) ||+..-.+.-..+.| T Consensus 128 PPsgAvALtavlgGp 142 (382) T COG3448 128 PPSGAVALTAVLGGP 142 (382) T ss_pred CCCHHHHHHHHHCCC T ss_conf 984078899884684 No 103 >PRK02542 photosystem I assembly protein Ycf4; Provisional Probab=44.29 E-value=11 Score=12.32 Aligned_cols=63 Identities=13% Similarity=0.208 Sum_probs=29.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHC----CHH--HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 187322799999999999999999999831----621--455576564064889999999999999999999999 Q gi|254781029|r 94 KNSLNIALRNAFMSSLIWLACELGFLIKFY----TNS--PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~----~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) ||.| .||||+++.+ ||+|+...-+ +-. +.....++.--|+++.+.+=++++=.+=||...... T Consensus 21 Rr~S--Ny~wA~ii~~----G~lGFll~g~SSYl~~~llpf~~~~~i~FiPQGivM~FYGi~gl~ls~Ylw~~I~ 89 (188) T PRK02542 21 RRLS--NYLWASMVTI----GGIGFLLAGLSSYLGRDLLPVGDPSTLIFIPQGLAMGFYGVAATLLASYLWLVIA 89 (188) T ss_pred CHHH--HHHHHHHHHH----HHHHHHHHHHHHHHCCCCCCCCCHHHCCEECCCHHHHHHHHHHHHHHHHHHHHHE T ss_conf 4157--8899999986----5589997464887587701458854562526408998988999999999987512 No 104 >COG4117 Thiosulfate reductase cytochrome B subunit (membrane anchoring protein) [Energy production and conversion] Probab=44.00 E-value=11 Score=12.29 Aligned_cols=33 Identities=12% Similarity=0.010 Sum_probs=21.2 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCH Q ss_conf 788888788877556779898999854187322 Q gi|254781029|r 67 NKQTYSGFYPANDLDNGGITHILQNIGKNSLNI 99 (1828) Q Consensus 67 ~~~~~~~~~pa~~d~r~~~~~~~~~l~~~~s~~ 99 (1828) ++..+..|.|.+||.++-.--++.+|+.|-+.. T Consensus 99 tgh~Rr~f~p~~~~~~~~~rd~v~~l~~~~~H~ 131 (221) T COG4117 99 TGHYRRRFSPLLDRAARQTRDYVFALMKREEHP 131 (221) T ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 553577214201266660489999997631187 No 105 >KOG3091 consensus Probab=43.79 E-value=11 Score=12.27 Aligned_cols=52 Identities=12% Similarity=0.162 Sum_probs=24.2 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999973092555699999999999999997236899999999999999999999999999 Q gi|254781029|r 172 SQSIAGIALRLIDPEEYSSEKMQSISSAVRKEIVLMTEEIDRAISRASELEKTVRSEIEVLENNY 236 (1828) Q Consensus 172 a~~m~~~a~rl~~p~~~a~~~~~~~~qa~r~ev~a~~~~~er~~~ra~ele~lv~~ev~~ler~y 236 (1828) ++.-+.++-|-..|..++.--.+++=-.. .+.+.-+.+ +.|++-|+..++.| T Consensus 252 ~k~~~~~a~~~~~p~~~~~~~q~~~~~~~------~n~~~t~~~-------afv~~~~~q~e~~L 303 (508) T KOG3091 252 AKTQGFVAGRIPAPQSLNDQVQKTLKEWL------LNTPKTRVL-------AFVYLSVAQTEAYL 303 (508) T ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHHHHH------HCCCCCHHH-------HHHCCCHHHHHHHH T ss_conf 31322003677865667888999888775------348831334-------21126788877876 No 106 >pfam06196 DUF997 Protein of unknown function (DUF997). Family of predicted bacterial membrane protein with unknown function. Probab=43.58 E-value=11 Score=12.24 Aligned_cols=55 Identities=18% Similarity=0.135 Sum_probs=30.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ++|..+.++..++ ++.+++.+++ ..+.++.|.=. +..-+++|++|....-.|++. T Consensus 8 ~~al~L~l~Yf~~--W~~~AY~~~~----~~~i~GlP~WF--~~SCi~~p~lf~~l~~~mVk~ 62 (75) T pfam06196 8 RWALGLTLLYFIW--WYGFAYGLGS----GNGILGFPLWF--EYSCIVGPILFIVLCWLMVKF 62 (75) T ss_pred HHHHHHHHHHHHH--HHHHHHCCCC----CCCCCCCCHHH--HHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999--9999862688----77777883999--999999999999999999999 No 107 >TIGR03061 pip_yhgE_Nterm YhgE/Pip N-terminal domain. This family contains the N-terminal domain of a family of multiple membrane-spanning proteins of Gram-positive bacteria. One member was shown to be a host protein essential for phage infection, so many members of this family are called "phage infection protein". A separate model, TIGR03062, represents the conserved C-terminal domain. The domains are separated by regions highly variable in both length and sequence, often containing extended heptad repeats as described in model TIGR03057. Probab=43.45 E-value=11 Score=12.23 Aligned_cols=30 Identities=10% Similarity=0.136 Sum_probs=13.6 Q ss_pred HHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 640648899999999999999999999999 Q gi|254781029|r 134 FAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 134 ~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) ++..|..+.++++.+++|.++-|++-.=.| T Consensus 6 ~~~~~~~~~~~~~~~liP~ly~~~~l~a~w 35 (164) T TIGR03061 6 LRKNKLLRIALIAIMLIPLLYGGLFLWAFW 35 (164) T ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 763828999999999999999999984636 No 108 >TIGR02353 NRPS_term_dom non-ribosomal peptide synthetase terminal domain of unknown function; InterPro: IPR012728 This domain is found exclusively in non-ribosomal peptide synthetases and always as the final domain in the polypeptide. This domain is roughly 700 amino acids in size and is found in polypeptides roughly twice that size.. Probab=43.24 E-value=11 Score=12.20 Aligned_cols=81 Identities=11% Similarity=0.104 Sum_probs=37.5 Q ss_pred CCCCCCCCCCHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHHHH-HHHHHHCCHH-------------HHHHHHHHHHCHH Q ss_conf 8877556779898999854-18732279999999999999999-9999831621-------------4555765640648 Q gi|254781029|r 75 YPANDLDNGGITHILQNIG-KNSLNIALRNAFMSSLIWLACEL-GFLIKFYTNS-------------PMDSFYDFAMRPE 139 (1828) Q Consensus 75 ~pa~~d~r~~~~~~~~~l~-~~~s~~~~~~~~~~s~~w~~~~~-~~~~~~~~~~-------------~~~~~~~~~~~~~ 139 (1828) +||.+ +|.--...+ .+|.+.++|....++++-++.-+ -++.+++-+. .+..+-..++.|+ T Consensus 215 SPA~~-----~g~~~~~~~~~~p~~~~r~~~~~~~~~~~~~llp~la~L~~~~~~~~~~~~~~~~~~~~~G~~eGsG~~~ 289 (719) T TIGR02353 215 SPAQK-----VGAAKRKVQPARPARLRRRLYVLGALFVVLVLLPLLAVLFAIPVAIVFVEIDWLLGPDLLGFFEGSGSLT 289 (719) T ss_pred CCCCC-----CCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHH T ss_conf 98744-----4601113565677622678778878887678776787888988876555330378863113044788788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999999999999999999 Q gi|254781029|r 140 VITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 140 ~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) ....+ .+..++.||+++|+-+ T Consensus 290 ~y~~~--~~~~~~ala~~i~~~v 310 (719) T TIGR02353 290 LYALL--LVLTFVALAGFIAYAV 310 (719) T ss_pred HHHHH--HHHHHHHHHHHHHHHH T ss_conf 99999--9999999999999999 No 109 >PRK09977 putative Mg(2+) transport ATPase; Provisional Probab=43.10 E-value=11 Score=12.19 Aligned_cols=18 Identities=22% Similarity=0.545 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999983 Q gi|254781029|r 105 FMSSLIWLACELGFLIKF 122 (1828) Q Consensus 105 ~~~s~~w~~~~~~~~~~~ 122 (1828) |-++.+|+.+++|++.+. T Consensus 96 TTAA~iW~~aaiG~a~G~ 113 (215) T PRK09977 96 TTAADIWVTAAIGMVIGS 113 (215) T ss_pred HHHHHHHHHHHHHHHHHC T ss_conf 899999999999999854 No 110 >PRK10160 taurine transporter subunit; Provisional Probab=43.05 E-value=11 Score=12.18 Aligned_cols=31 Identities=13% Similarity=0.170 Sum_probs=12.9 Q ss_pred CCHHHHHHHHHC-----CCCCHHHHHHHHHHHHHHH Q ss_conf 798989998541-----8732279999999999999 Q gi|254781029|r 83 GGITHILQNIGK-----NSLNIALRNAFMSSLIWLA 113 (1828) Q Consensus 83 ~~~~~~~~~l~~-----~~s~~~~~~~~~~s~~w~~ 113 (1828) ++|..+..++-. ......+|..+..|+.=+. T Consensus 53 P~P~~V~~~l~~~~~~~~~~~g~l~~~~~~Tl~r~~ 88 (275) T PRK10160 53 PPPQQVLAKLLTIAGPQGFMDATLWQHLAASLTRIV 88 (275) T ss_pred CCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH T ss_conf 998999999999843356677549999999999999 No 111 >PRK11486 flagellar biosynthesis protein FliO; Provisional Probab=42.93 E-value=11 Score=12.17 Aligned_cols=17 Identities=18% Similarity=0.288 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781029|r 149 VVPILLFFSFFIMISRA 165 (1828) Q Consensus 149 ~~p~~~~~~~a~~~~r~ 165 (1828) ++=|.++|++|++++|- T Consensus 26 ~~Vi~lI~~~aWL~rR~ 42 (124) T PRK11486 26 IAIIALILAAAWLVKRL 42 (124) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999999982 No 112 >COG4477 EzrA Negative regulator of septation ring formation [Cell division and chromosome partitioning] Probab=42.01 E-value=11 Score=12.06 Aligned_cols=55 Identities=13% Similarity=0.073 Sum_probs=36.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HCCHHHHHHHH-HHHHHHH Q ss_conf 9999999999999999999999999999999999997-30925556999-9999999 Q gi|254781029|r 145 LVVSVVPILLFFSFFIMISRARDMHDASQSIAGIALR-LIDPEEYSSEK-MQSISSA 199 (1828) Q Consensus 145 ~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~r-l~~p~~~a~~~-~~~~~qa 199 (1828) ++++++-++.||+++.++|+...=|+-+=-=.--.++ |.=|+.++.=. +-.+||+ T Consensus 5 iv~llVilv~~~~~g~~lRkk~~~rI~~LEe~K~el~~lPv~dEi~kVK~L~L~GQT 61 (570) T COG4477 5 IVALLVILVAAYAVGYLLRKKNYQRIDKLEERKNELLNLPVNDEISKVKKLHLTGQT 61 (570) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCC T ss_conf 999999999999999999973898999999989999819944579997527013741 No 113 >PRK12468 flhB flagellar biosynthesis protein FlhB; Reviewed Probab=41.90 E-value=12 Score=12.05 Aligned_cols=12 Identities=8% Similarity=0.357 Sum_probs=5.0 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254781029|r 103 NAFMSSLIWLAC 114 (1828) Q Consensus 103 ~~~~~s~~w~~~ 114 (1828) +.+.+.++|+.| T Consensus 39 L~~~~~~l~~~~ 50 (383) T PRK12468 39 LGAGLTILWMSG 50 (383) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 No 114 >TIGR02978 phageshock_pspC phage shock protein C; InterPro: IPR014320 All members of this protein are the phage shock protein PspC. The phage shock regulon is restricted to the Proteobacteria and somewhat sparsely distributed there. It is expressed, under positive control of a sigma-54-dependent transcription factor; PspF, which binds and is modulated by PspA. Stresses that induce the psp regulon include phage secretin over expression, ethanol, heat shock and protein export defects.. Probab=41.90 E-value=12 Score=12.05 Aligned_cols=42 Identities=12% Similarity=0.110 Sum_probs=18.2 Q ss_pred HHHHHHHHHHCCHH-HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999831621-455576564064889999999999999999999999 Q gi|254781029|r 113 ACELGFLIKFYTNS-PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 113 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) |-|+|||.-+-=+- +++-+. +++++++++ .|+++.|+.+.|+ T Consensus 15 GVCAGlA~Yfg~ev~LVRil~-------v~~~lfg~~-~~~~~AYia~~~~ 57 (128) T TIGR02978 15 GVCAGLAEYFGVEVWLVRILV-------VSALLFGGG-FFVLVAYIALWLL 57 (128) T ss_pred CHHHHHHHHHCCCHHHHHHHH-------HHHHHHHHH-HHHHHHHHHHHHH T ss_conf 224577766278688899999-------999999879-9999999999996 No 115 >COG4662 TupA ABC-type tungstate transport system, periplasmic component [Coenzyme metabolism] Probab=41.37 E-value=12 Score=11.99 Aligned_cols=70 Identities=13% Similarity=0.207 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999831621455576564064889999999999999999999999999999999999999 Q gi|254781029|r 107 SSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAG 177 (1828) Q Consensus 107 ~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~ 177 (1828) .|.--+..|+.+..++...++++.++=+|+..+.|-+ -+..+.|++.-|+...+-.--..++..|++..- T Consensus 69 ~s~PTVlvGLlLylLlSr~GPlG~f~LLfT~~amILG-q~iL~lPlvia~~l~ale~~dpr~~ela~~lga 138 (227) T COG4662 69 LSMPTVLVGLLLYLLLSRSGPLGWFNLLFTQDAMILG-QAILILPLVIAFVLTALESVDPRLKELARSLGA 138 (227) T ss_pred HCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCC T ss_conf 3163999999999999646998630367621069988-799999999999999998459789999998099 No 116 >TIGR02226 two_anch N-terminal double-transmembrane domain; InterPro: IPR011933 This entry represents a prokaryotic N-terminal region of about 80 amino acids. The predicted membrane topology by TMHMM puts the N-terminus outside and spans the membrane twice, with a cytosolic region of about 25 amino acids between the two transmembrane regions. Member proteins tend to be between 600 and 1000 amino acids in length.. Probab=40.37 E-value=12 Score=11.87 Aligned_cols=20 Identities=20% Similarity=0.574 Sum_probs=12.7 Q ss_pred HHCHHHHHHHHHHHHHHHHHH Q ss_conf 406488999999999999999 Q gi|254781029|r 135 AMRPEVITLFLVVSVVPILLF 155 (1828) Q Consensus 135 ~~~~~~~~~~~~~~~~p~~~~ 155 (1828) |++|..+++. .++++||++- T Consensus 3 Fl~p~~L~gl-la~~~P~l~H 22 (84) T TIGR02226 3 FLNPALLAGL-LAVVLPLLIH 22 (84) T ss_pred HHHHHHHHHH-HHHHHHHHHH T ss_conf 6779999999-9999999999 No 117 >PRK06433 NADH dehydrogenase subunit J; Provisional Probab=40.33 E-value=12 Score=11.87 Aligned_cols=28 Identities=18% Similarity=0.535 Sum_probs=13.2 Q ss_pred HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 656406488999999999999999999999999 Q gi|254781029|r 132 YDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ++|.+.-+.+ +++| -|+..|.|+.|.-| T Consensus 52 AeFla~vQil-VY~G----Ai~vL~lF~vMLt~ 79 (89) T PRK06433 52 ASFLGVVQVL-VYIG----AVLILITFAVMLTK 79 (89) T ss_pred CHHHHHHHHH-HHHH----HHHHHHHHHHHHCC T ss_conf 8899999999-9980----99999999999828 No 118 >PRK09292 Na(+)-translocating NADH-quinone reductase subunit D; Validated Probab=39.95 E-value=12 Score=11.83 Aligned_cols=14 Identities=29% Similarity=0.417 Sum_probs=5.2 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254781029|r 150 VPILLFFSFFIMIS 163 (1828) Q Consensus 150 ~p~~~~~~~a~~~~ 163 (1828) .|+.-|+.+..|+| T Consensus 183 lppGaFi~lG~lia 196 (208) T PRK09292 183 LPPSAFFLIGLLIW 196 (208) T ss_pred CCCHHHHHHHHHHH T ss_conf 39789999999999 No 119 >PRK10983 putative inner membrane protein; Provisional Probab=39.78 E-value=12 Score=11.81 Aligned_cols=11 Identities=36% Similarity=0.772 Sum_probs=5.6 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254781029|r 145 LVVSVVPILLF 155 (1828) Q Consensus 145 ~~~~~~p~~~~ 155 (1828) +..+++|++++ T Consensus 75 ~l~vi~Pl~~l 85 (363) T PRK10983 75 VLLFVIPIALL 85 (363) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 120 >PRK00247 putative inner membrane protein translocase component YidC; Validated Probab=38.99 E-value=13 Score=11.72 Aligned_cols=38 Identities=18% Similarity=0.155 Sum_probs=24.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999999999999999999999999999999999 Q gi|254781029|r 140 VITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAG 177 (1828) Q Consensus 140 ~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~ 177 (1828) +|++++.++++=++||+++...++=++-|+.+.=-|.. T Consensus 33 ~lSIi~Lti~VR~il~Pl~~kq~kS~r~Mq~lqPemk~ 70 (422) T PRK00247 33 FFSLFGLVITVRAIIAPFTWQMYKSGRTAAHIRPKRAA 70 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHH T ss_conf 99999999999999997799999999999982958999 No 121 >KOG4702 consensus Probab=38.83 E-value=13 Score=11.70 Aligned_cols=20 Identities=15% Similarity=0.064 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254781029|r 150 VPILLFFSFFIMISRARDMH 169 (1828) Q Consensus 150 ~p~~~~~~~a~~~~r~~~~~ 169 (1828) -||.+||+|-+--|=--.|+ T Consensus 17 FPV~~FwlfNQpe~Fee~v~ 36 (77) T KOG4702 17 FPVGAFWLFNQPEIFEEFVR 36 (77) T ss_pred HHHHHHHHCCCHHHHHHHHH T ss_conf 16788997068089999998 No 122 >PRK10699 phosphatidylglycerophosphatase B; Provisional Probab=38.68 E-value=13 Score=11.68 Aligned_cols=58 Identities=9% Similarity=0.052 Sum_probs=24.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHH------HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 187322799999999999999999999831621------455576564064889999999999999999999 Q gi|254781029|r 94 KNSLNIALRNAFMSSLIWLACELGFLIKFYTNS------PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFF 159 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a 159 (1828) +|-+-....+..+=.++|+.| |..+|+. .+.-+..-.+.|-+|. ..++.-..|+|.+- T Consensus 6 kR~~l~tlLLlliP~~~w~sg-----WqWqp~~~~~~~k~LfwlTeTgt~Pygii---T~vvl~~~f~~~lR 69 (254) T PRK10699 6 KRTAVGAALLLVMPLAVWISG-----WQWQPGEQSWWLKGLFWVTETVTQPWGVL---THVLLCGWFLWCLR 69 (254) T ss_pred HHHHHHHHHHHHHHHHHHHHH-----CCCCCCCCCCHHHHHEEEECCCCCCHHHH---HHHHHHHHHHHHHH T ss_conf 999999999999999999970-----71677777343421214541788645999---99999999999997 No 123 >pfam09472 MtrF Tetrahydromethanopterin S-methyltransferase, F subunit (MtrF). Many archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This domain is mostly found in MtrF, where it covers the entire length of the protein. This polypeptide is one of eight subunits of the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase complex found in methanogenic archaea. This is a membrane-associated enzyme complex that uses methyl-transfer reactions to drive a sodium-ion pump. MtrF itself is involved in the transfer of the methyl group from N5-methyltetrahydromethanopterin to coenzyme M. Subsequently, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase. In some organisms this domain is found at the C terminal region of what appears to be a fusion of the MtrA and MtrF proteins. The function of these proteins is unknown, though it is likely Probab=38.37 E-value=13 Score=11.64 Aligned_cols=11 Identities=36% Similarity=0.990 Sum_probs=7.0 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254781029|r 146 VVSVVPILLFF 156 (1828) Q Consensus 146 ~~~~~p~~~~~ 156 (1828) +.+++|++|+| T Consensus 54 vlv~ip~~l~~ 64 (64) T pfam09472 54 LLVLIPILLLW 64 (64) T ss_pred HHHHHHHHHHC T ss_conf 99999999829 No 124 >KOG3514 consensus Probab=38.30 E-value=13 Score=11.64 Aligned_cols=22 Identities=18% Similarity=0.268 Sum_probs=10.5 Q ss_pred HHHHHHHHHCCCHHHHHHHHHH Q ss_conf 7999998718013899999975 Q gi|254781029|r 1723 LVENISKFIDYDAFVQLWKSYT 1744 (1828) Q Consensus 1723 ~s~di~~~~~~~~~~~~w~~y~ 1744 (1828) .-+.|--+.---+..=+|.-|. T Consensus 1517 mVvGIvaAaaLcILilL~am~k 1538 (1591) T KOG3514 1517 MVVGIVAAAALCILILLYAMYK 1538 (1591) T ss_pred HHHHHHHHHHHHHHHHHHHCCC T ss_conf 2467899999999999861002 No 125 >pfam09678 Caa3_CtaG Cytochrome c oxidase caa3 assembly factor (Caa3_CtaG). Members of this family are the CtaG protein required for assembly of active cytochrome c oxidase of the caa3 type, as found in Bacillus subtilis. Probab=38.13 E-value=13 Score=11.61 Aligned_cols=69 Identities=13% Similarity=0.199 Sum_probs=33.9 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 985418732279999999999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 90 QNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 90 ~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ..+.+|..+-|.|=+ +.|.+|.+.+.+... +++...+....+.-.+--++...|+|+++..+.-.-..+ T Consensus 9 ~rlr~~g~~wp~~R~----~~f~~G~~~l~~a~~--spl~~~~~~lfs~HMvqhllL~~vaPpLLvlg~P~~l~~ 77 (241) T pfam09678 9 RRLRRRGDRWPVWRT----VAFLLGLALLYLATS--SPLDAYGHALFSAHMVQHLLLSMVAPPLLVLGAPVTLLL 77 (241) T ss_pred HHHHHCCCCCCHHHH----HHHHHHHHHHHHHHC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 999875799984899----999999999999980--749888888899999999999999999999528099999 No 126 >pfam11023 DUF2614 Protein of unknown function (DUF2614). This is a family of proteins conserved in the Bacillaceae family. Some members are annotated as being protein YgzB. The function is not known. Probab=37.61 E-value=13 Score=11.55 Aligned_cols=48 Identities=17% Similarity=0.301 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999998316214555765640648899999999999999999 Q gi|254781029|r 104 AFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFS 157 (1828) Q Consensus 104 ~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 157 (1828) +.+.|++.++-++-+.-.+|..+++ +..++..-|.| ..+.--|+.||. T Consensus 13 t~aL~lvfiG~~vMy~G~ff~~~~i--~m~ifmi~G~l----~ii~StvvYFwi 60 (114) T pfam11023 13 TFALSLVFIGFIVMYGGIFFRTSPI--IMVIFMILGFL----CIIASTAVYFWI 60 (114) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHH--HHHHHHHHHHH----HHHHHHHHHHHH T ss_conf 9999999999999999998615799--99999999999----999999999999 No 127 >PRK02251 putative septation inhibitor protein; Reviewed Probab=37.46 E-value=13 Score=11.54 Aligned_cols=28 Identities=7% Similarity=-0.057 Sum_probs=21.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 2279999999999999999999983162 Q gi|254781029|r 98 NIALRNAFMSSLIWLACELGFLIKFYTN 125 (1828) Q Consensus 98 ~~~~~~~~~~s~~w~~~~~~~~~~~~~~ 125 (1828) .+|-|++.++=++++.|-+++...+..+ T Consensus 26 ~~P~W~~p~m~~lm~iGL~Wivv~Yis~ 53 (84) T PRK02251 26 VNPRWFAPVMVALMIIGLIWLVVYYVTS 53 (84) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9984089999999999899984531136 No 128 >COG5617 Predicted integral membrane protein [Function unknown] Probab=37.46 E-value=13 Score=11.54 Aligned_cols=93 Identities=12% Similarity=0.073 Sum_probs=47.6 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHH-HHHH--HHHHHCHHH---------HHHHHHHHHHHH Q ss_conf 8989998541873227999999999999999999998316214-5557--656406488---------999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSP-MDSF--YDFAMRPEV---------ITLFLVVSVVPI 152 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~---------~~~~~~~~~~p~ 152 (1828) ||.++-.+-+.++.+=-|+...++ -+++|++++|.+.+-.. +..+ -=++++|.- |--+.+..++|. T Consensus 77 i~aal~~l~~d~~~t~~v~~~la~--llG~~~~~~~r~~g~t~~ia~I~alL~ltsp~~l~vlf~EGniP~v~~i~f~pl 154 (801) T COG5617 77 IGAALNFLLGDVVTTYAVFLMLAF--LLGAGGWLLWRLRGRTGFIALISALLWLTSPENLKVLFIEGNIPRVLAIGFGPL 154 (801) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHH--HHHHHHHHHHHHHCCCCCHHHHHHHHHHHCHHHEEEEEECCCCCHHHHHHHHHH T ss_conf 999987762585699999999999--998799999876145550479999999818434688773476418799999999 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHH Q ss_conf 9999999999--99999999999999999 Q gi|254781029|r 153 LLFFSFFIMI--SRARDMHDASQSIAGIA 179 (1828) Q Consensus 153 ~~~~~~a~~~--~r~~~~~~~a~~m~~~a 179 (1828) +|++.-..|- +|..++--+|=-|+=|+ T Consensus 155 ~l~~l~~~~~~Gkk~r~~l~~allmslv~ 183 (801) T COG5617 155 ALGLLERFLERGKKERSLLRMALLMSLVL 183 (801) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 99999999980735888999999999999 No 129 >pfam02683 DsbD Cytochrome C biogenesis protein transmembrane region. This family consists of the transmembrane (i.e. non-catalytic) region of Cytochrome C biogenesis proteins also known as disulphide interchange proteins. These proteins posses a protein disulphide isomerase like domain that is not found within the aligned region of this family. Probab=37.33 E-value=13 Score=11.52 Aligned_cols=72 Identities=15% Similarity=0.134 Sum_probs=31.1 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87322799999999999999999999831621455576564064889999999999999999999999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDM 168 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~ 168 (1828) ++...+|+....++++|.-+...+.-.... ...+.++....-..+.++..+...|.+++-.+.....+...+ T Consensus 111 ~~~~g~Fl~G~~~~l~~~PC~~P~l~~iL~--~aa~~g~~~~g~~~l~~y~lG~~lP~ll~~~~~~~l~~~~~~ 182 (206) T pfam02683 111 GPMLGAFLLGMLFALGWTPCIGPILASVLA--LAASSGSLLLAALLLVVYVLGLGIPFLLASLFAGSLLFLRWL 182 (206) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHH--HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 850679999999998735453579999999--998489689999999999999999999999999999875777 No 130 >COG4252 Predicted transmembrane sensor domain [Signal transduction mechanisms] Probab=37.27 E-value=13 Score=11.51 Aligned_cols=38 Identities=8% Similarity=0.084 Sum_probs=23.0 Q ss_pred HHHHHHHH-HCCC--CCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89899985-4187--32279999999999999999999983 Q gi|254781029|r 85 ITHILQNI-GKNS--LNIALRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 85 ~~~~~~~l-~~~~--s~~~~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) +-+++.++ .+|| ..-|-|+..+.-.+|...|+.++|.. T Consensus 303 vsQilsa~ldgR~ll~~w~~~~e~l~i~~w~~~g~~~aw~~ 343 (400) T COG4252 303 VSQILSALLDGRPLLPVWPDGAELLWIFAWSLLGGLLAWRL 343 (400) T ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999988558951002177999999999999988872111 No 131 >pfam06791 TMP_2 Prophage tail length tape measure protein. This family represents a conserved region located towards the N-terminal end of prophage tail length tape measure protein (TMP). TMP is important for assembly of phage tails and involved in tail length determination. Mutated forms TMP cause tail fibres to be shortened. Probab=37.16 E-value=13 Score=11.50 Aligned_cols=69 Identities=7% Similarity=-0.034 Sum_probs=28.6 Q ss_pred CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 7798989998541873227999999999999999999998316214555765640648899999999999999 Q gi|254781029|r 82 NGGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILL 154 (1828) Q Consensus 82 r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 154 (1828) -.+.++.-.+++. .|+=++-|+.-|=-+-..+.....-++..-.+++++...-..++.+++.++.|+-+ T Consensus 11 g~Sa~q~~~Amr~----lpaQ~tDI~~sLa~Gq~p~~Vl~QQGgQikd~fGGig~a~~~~~~~i~~~v~p~~~ 79 (210) T pfam06791 11 GKSAKQYAAAMRG----LPAQITDIAVSLAGGQSPLTVLLQQGGQLKDMFGGIGPAAKALGGYILGLVNPYTL 79 (210) T ss_pred CCCHHHHHHHHHC----CCHHHHHHHHHHCCCCCHHHHHHHHCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 7249999999876----55799999999806997999999874118874177699999999999999989999 No 132 >KOG0916 consensus Probab=36.99 E-value=13 Score=11.48 Aligned_cols=29 Identities=10% Similarity=0.188 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999 Q gi|254781029|r 246 ITQNLKQEREAIINHGTQLCTSIAEVHES 274 (1828) Q Consensus 246 L~~el~~ere~i~~~ae~~r~~i~~~~e~ 274 (1828) |-.+.+-+++++.++-|++--.+++++.. T Consensus 222 l~~~fGFq~~nv~nqReh~il~lan~~~r 250 (1679) T KOG0916 222 LDDEFGFQNMNVANQREHLILKLANAQIR 250 (1679) T ss_pred HHHHHCCCCCCHHHHHHHHHHHHCCHHHH T ss_conf 48774621230344566788874033432 No 133 >COG2962 RarD Predicted permeases [General function prediction only] Probab=36.98 E-value=13 Score=11.48 Aligned_cols=18 Identities=17% Similarity=0.307 Sum_probs=11.8 Q ss_pred CCCHHHHHHHHHHHHHHH Q ss_conf 732279999999999999 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLA 113 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~ 113 (1828) -|+.-+.++..++++|.. T Consensus 4 ~~~~Gil~~l~Ay~lwG~ 21 (293) T COG2962 4 DSRKGILLALLAYLLWGL 21 (293) T ss_pred CCCCHHHHHHHHHHHHHH T ss_conf 432221699999999989 No 134 >PRK00159 putative septation inhibitor protein; Reviewed Probab=36.87 E-value=14 Score=11.47 Aligned_cols=31 Identities=3% Similarity=-0.068 Sum_probs=22.7 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 8732279999999999999999999983162 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFLIKFYTN 125 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~ 125 (1828) .+..+|-|++.++=++++.|-+++...+..+ T Consensus 25 ~~~~sp~W~~p~m~~lmliGL~Wivv~Yis~ 55 (87) T PRK00159 25 KVGPSSVWYVVLMLGLMLIGLAWLMVFYLAG 55 (87) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 6899985099999999999899998986515 No 135 >PRK12438 hypothetical protein; Provisional Probab=36.72 E-value=14 Score=11.45 Aligned_cols=25 Identities=12% Similarity=0.258 Sum_probs=9.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999999999999999999 Q gi|254781029|r 140 VITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 140 ~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) -+++|+++.++=.+++|+...+.+| T Consensus 62 ri~Lf~~~~l~~~~~v~~~l~lA~R 86 (979) T PRK12438 62 RLAIVAAVALVVAGIVLAALLLAYR 86 (979) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999984 No 136 >PRK10489 enterobactin exporter EntS; Provisional Probab=36.44 E-value=14 Score=11.42 Aligned_cols=70 Identities=9% Similarity=-0.032 Sum_probs=42.2 Q ss_pred HHCCCCCHHHHHHHHHHHH--HHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Q ss_conf 5418732279999999999--99999999998316214555765640648899999999999999999-99999999999 Q gi|254781029|r 92 IGKNSLNIALRNAFMSSLI--WLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFS-FFIMISRARDM 168 (1828) Q Consensus 92 l~~~~s~~~~~~~~~~s~~--w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~-~a~~~~r~~~~ 168 (1828) |..+|.=.-||++.++|.+ |+. .+++.|..| +...+|..+|++..+..+|.++|-. ...+++|-.-= T Consensus 13 Lr~nr~Fr~l~~~~~~S~lG~~~~-~iAl~~~v~---------~~TgS~~~~glv~~~~~lp~ll~~~~~GvlaDR~~Rr 82 (416) T PRK10489 13 LKTHPAFRAVFLARFISIVSLGLL-GVAVPVQIQ---------MMTHSTWQVGLSVTLTGGAMFVGLMVGGVLADRYERK 82 (416) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHH-HHHHHHHHH---------HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 204926999999999999999999-999999999---------8859999999999999999999999999999944842 Q ss_pred HHH Q ss_conf 999 Q gi|254781029|r 169 HDA 171 (1828) Q Consensus 169 ~~~ 171 (1828) |++ T Consensus 83 ~vm 85 (416) T PRK10489 83 KVI 85 (416) T ss_pred HHH T ss_conf 999 No 137 >PRK10952 glycine betaine transporter membrane protein; Provisional Probab=36.43 E-value=14 Score=11.42 Aligned_cols=108 Identities=8% Similarity=0.178 Sum_probs=54.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7322799999999999999999999831621--45557656406488999999999999999999999999999999999 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLACELGFLIKFYTNS--PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQ 173 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~ 173 (1828) +.-.||.-+.+==.-|+--|+-.....|++- .+...-++... +.-.++ ..+=|++++.+|+.+.|+..-.|+..= T Consensus 52 ~~~~~~~~~~iPl~~Wv~~~vdWl~~~f~p~F~~I~~~i~~~l~-~~~~~L--~~~P~~~~i~~~~~law~~~g~~la~~ 128 (354) T PRK10952 52 NILDPFHKTLIPLDSWVTEGIDWVVTHFRPVFQGIRVPVDYILN-GFQQLL--LGMPAPVAIIVFALIAWQISGVGMGVA 128 (354) T ss_pred CCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH--CCCCHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 63265642771679999999999999859999999999999999-999998--029889999999999999804799999 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999997309255569999999999999999723 Q gi|254781029|r 174 SIAGIALRLIDPEEYSSEKMQSISSAVRKEIVLMT 208 (1828) Q Consensus 174 ~m~~~a~rl~~p~~~a~~~~~~~~qa~r~ev~a~~ 208 (1828) .. +.+=|.--=..=.+.|.|+.--+--=+-++- T Consensus 129 t~--~~l~~I~~~G~W~~aM~TLalvlvav~~~~i 161 (354) T PRK10952 129 TL--ISLIAIGAIGAWSQAMVTLALVLTALLFCIV 161 (354) T ss_pred HH--HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99--9999999966329999999999999999999 No 138 >PRK09821 inner membrane permease YgbN; Provisional Probab=36.35 E-value=14 Score=11.41 Aligned_cols=41 Identities=12% Similarity=0.157 Sum_probs=30.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 99999999999999999999999999999999999973092 Q gi|254781029|r 145 LVVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRLIDP 185 (1828) Q Consensus 145 ~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl~~p 185 (1828) +|..+.-|.+.-+|..|+=+-=|---.|..|++..+|+-=+ T Consensus 58 fG~tlg~igiiI~lG~~iG~~Le~SGaa~~ia~~~l~~~G~ 98 (454) T PRK09821 58 MGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFTRKLGD 98 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCC T ss_conf 88888999999999999999999848099999999998488 No 139 >PRK13105 ubiA prenyltransferase; Reviewed Probab=36.25 E-value=14 Score=11.39 Aligned_cols=43 Identities=19% Similarity=0.123 Sum_probs=25.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCHH-------HHHHH--HHHHHCHHHHHHHHHH Q ss_conf 9999999999999999831621-------45557--6564064889999999 Q gi|254781029|r 105 FMSSLIWLACELGFLIKFYTNS-------PMDSF--YDFAMRPEVITLFLVV 147 (1828) Q Consensus 105 ~~~s~~w~~~~~~~~~~~~~~~-------~~~~~--~~~~~~~~~~~~~~~~ 147 (1828) ...+.+|+++++++...+..|. .+.|+ +.-+..|...|..+++ T Consensus 112 ~~~~~~~l~~~~f~~vaYSaPpLRFKe~P~LDS~ss~~~~v~pa~~~~~l~g 163 (291) T PRK13105 112 TWMSWAVLAVSVFAVIAYSAAGLRFKERPFLDALTSSTHFTSPALYGLTLAG 163 (291) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 7889999999999985025886034666553211221156448999999807 No 140 >KOG2533 consensus Probab=36.18 E-value=14 Score=11.39 Aligned_cols=15 Identities=20% Similarity=0.053 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999999 Q gi|254781029|r 152 ILLFFSFFIMISRAR 166 (1828) Q Consensus 152 ~~~~~~~a~~~~r~~ 166 (1828) +++=|.+-.|..|-. T Consensus 95 ii~~~p~~~L~~r~~ 109 (495) T KOG2533 95 IIGQFPSGLLGDRFP 109 (495) T ss_pred HHHHHHHHHHHHHCC T ss_conf 999856999987478 No 141 >COG3838 VirB2 Type IV secretory pathway, VirB2 components (pilins) [Intracellular trafficking and secretion] Probab=36.01 E-value=14 Score=11.37 Aligned_cols=37 Identities=19% Similarity=0.299 Sum_probs=17.3 Q ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 45557656406488999999999999999999999999 Q gi|254781029|r 127 PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ++..+-+.++.|-.-.+ ++.+|+-+-+-|.|..+-|| T Consensus 41 ~lq~I~~sltGp~A~~i-AVi~Ii~~Gi~~~fG~~~~~ 77 (108) T COG3838 41 PLQQILQSLTGPVAKTI-AVIGIIAIGIMLIFGRLDWR 77 (108) T ss_pred HHHHHHHHHHCHHHHHH-HHHHHHHHHHHHHHHHHHHH T ss_conf 99999999636088999-99999999999998450167 No 142 >COG2456 Uncharacterized conserved protein [Function unknown] Probab=35.48 E-value=14 Score=11.30 Aligned_cols=82 Identities=16% Similarity=0.368 Sum_probs=45.5 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87322799999999999999999999831621455576564064-88999999999999999999999999999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRP-EVITLFLVVSVVPILLFFSFFIMISRARDMHDASQ 173 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~ 173 (1828) -||-..+|.+. |+. -+.+++||+ .-+-++.+++.. |+=++++.++.+.-.|.|-+-.|+.+-+ |-+.. T Consensus 36 s~~~ll~W~~~-----wv~---vlifal~P~-fs~~Ia~ilGlGRGlDaL~vitI~~ayyLlfrlYl~Idkie--reITe 104 (121) T COG2456 36 SLSDLLFWEAF-----WVF---VLIFALFPE-FSGEIAEILGLGRGLDALFVITIGLAYYLLFRLYLDIDKIE--REITE 104 (121) T ss_pred CCCCHHHHHHH-----HHH---HHHHHHCCH-HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH T ss_conf 82105999999-----999---999995622-88999999563656430467999999999999999987899--99999 Q ss_pred HHHHHHHHHCCHHH Q ss_conf 99999997309255 Q gi|254781029|r 174 SIAGIALRLIDPEE 187 (1828) Q Consensus 174 ~m~~~a~rl~~p~~ 187 (1828) -..|.|+|+-|-|+ T Consensus 105 lvreIAiR~~~~~~ 118 (121) T COG2456 105 LVREIAIRLKEEED 118 (121) T ss_pred HHHHHHHHHCCCCC T ss_conf 99999886011311 No 143 >COG5530 Predicted integral membrane protein [Function unknown] Probab=35.34 E-value=14 Score=11.29 Aligned_cols=27 Identities=30% Similarity=0.220 Sum_probs=14.8 Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 309255569999999999999999723 Q gi|254781029|r 182 LIDPEEYSSEKMQSISSAVRKEIVLMT 208 (1828) Q Consensus 182 l~~p~~~a~~~~~~~~qa~r~ev~a~~ 208 (1828) |.---+.++|=-..|+-.+|.----|+ T Consensus 126 l~p~~d~~~~W~~~v~~~Lr~P~flm~ 152 (247) T COG5530 126 LGPKVDAGSDWHQLVLAELRKPRFLMT 152 (247) T ss_pred HCCCCCCCHHHHHHHHHHHCCCCCCCC T ss_conf 289998521089999998627744120 No 144 >pfam11155 DUF2935 Protein of unknown function (DUF2935). This family of proteins with unknown function appears to be restricted to Firmicutes. Probab=35.00 E-value=14 Score=11.25 Aligned_cols=21 Identities=24% Similarity=0.463 Sum_probs=12.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 788789978999999999999 Q gi|254781029|r 1773 ADSALRNAIDRYISNFEEMLS 1793 (1828) Q Consensus 1773 ~d~~f~~~v~ry~~~fe~~l~ 1793 (1828) ++.++-..+|.|+.+|++||. T Consensus 23 ~e~~li~~A~~F~~~Fd~Ll~ 43 (44) T pfam11155 23 KEKELIEKADDFKKEFEQLLL 43 (44) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 479999999999999999972 No 145 >PRK10522 multidrug transporter membrane component/ATP-binding component; Provisional Probab=34.83 E-value=14 Score=11.23 Aligned_cols=14 Identities=21% Similarity=0.238 Sum_probs=6.8 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 79999999999999 Q gi|254781029|r 100 ALRNAFMSSLIWLA 113 (1828) Q Consensus 100 ~~~~~~~~s~~w~~ 113 (1828) ||+|..+++++-.. T Consensus 10 ~yr~~~i~~~~l~~ 23 (547) T PRK10522 10 QYRWPFISVMALSL 23 (547) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999999 No 146 >pfam08405 Calici_PP_N Viral polyprotein N-terminal. This domain is found at the N-terminus of non-structural viral polyproteins of the Caliciviridae subfamily. Probab=34.75 E-value=14 Score=11.22 Aligned_cols=11 Identities=27% Similarity=0.413 Sum_probs=4.3 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254781029|r 106 MSSLIWLACEL 116 (1828) Q Consensus 106 ~~s~~w~~~~~ 116 (1828) +.+-=|...|+ T Consensus 171 ls~CDWTF~Gi 181 (358) T pfam08405 171 LSSCDWTFMGI 181 (358) T ss_pred HHHCCCCHHHH T ss_conf 87088513769 No 147 >KOG3142 consensus Probab=34.20 E-value=15 Score=11.15 Aligned_cols=51 Identities=22% Similarity=0.240 Sum_probs=19.6 Q ss_pred HHHHHHHH------HHHHHHHHH---HCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999------999999998---31621455576564064889999999999999999 Q gi|254781029|r 105 FMSSLIWL------ACELGFLIK---FYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFF 156 (1828) Q Consensus 105 ~~~s~~w~------~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 156 (1828) .++|++|- .+++..+|. ++.++++--.+.-+..-..++++. .+-+|++||= T Consensus 81 ~~~sLi~~P~~Livl~~lv~~w~~LY~~rd~pLvlfgr~i~d~~~l~~L~-~~ti~~lflt 140 (187) T KOG3142 81 LFLSLITHPLSLIVLLALVAAWLFLYFLRDEPLVLFGRQISDREVLIGLV-LITIPVLFLT 140 (187) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHEEEECCCCEEEEEEEECCCCHHHHHH-HHHHHHHHHH T ss_conf 99999986799999999999988403425898588407836822134379-9999999995 No 148 >KOG4050 consensus Probab=34.09 E-value=15 Score=11.14 Aligned_cols=21 Identities=24% Similarity=0.393 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999 Q gi|254781029|r 144 FLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 144 ~~~~~~~p~~~~~~~a~~~~r 164 (1828) ++-++..||+|.|+-|++--| T Consensus 133 ~~F~il~Pv~L~lvHASLRLR 153 (188) T KOG4050 133 FAFAILFPVLLVLVHASLRLR 153 (188) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999983 No 149 >pfam02447 GntP_permease GntP family permease. This is a family of integral membrane permeases that are involved in gluconate uptake. E. coli contains several members of this family including GntU a low affinity transporter and GntT, a high affinity transporter. Probab=33.98 E-value=15 Score=11.12 Aligned_cols=39 Identities=8% Similarity=0.113 Sum_probs=26.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999999999999999999999730 Q gi|254781029|r 145 LVVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRLI 183 (1828) Q Consensus 145 ~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl~ 183 (1828) +|..+.-|.+.-+|..|+=+-=|--..|+.|++.-+|+- T Consensus 54 fG~tlg~igivI~lG~iiG~~Le~SGaa~~IA~~il~~~ 92 (441) T pfam02447 54 FGGTLGSLAIIIGLGAMLGKLLADSGGAERIATTLIKKF 92 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 989989999999999999999998391999999999985 No 150 >pfam05579 Peptidase_S32 Equine arteritis virus serine endopeptidase S32. Serine peptidases involved in processing nidovirus polyprotein. Probab=33.62 E-value=15 Score=11.08 Aligned_cols=11 Identities=9% Similarity=0.416 Sum_probs=6.0 Q ss_pred CCHHHHHHHHH Q ss_conf 79898999854 Q gi|254781029|r 83 GGITHILQNIG 93 (1828) Q Consensus 83 ~~~~~~~~~l~ 93 (1828) .+..+++-++- T Consensus 169 sdLa~ll~s~p 179 (426) T pfam05579 169 SDLAALLESLP 179 (426) T ss_pred HHHHHHHHCCC T ss_conf 66999984387 No 151 >KOG4580 consensus Probab=33.50 E-value=15 Score=11.07 Aligned_cols=20 Identities=20% Similarity=0.399 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254781029|r 150 VPILLFFSFFIMISRARDMH 169 (1828) Q Consensus 150 ~p~~~~~~~a~~~~r~~~~~ 169 (1828) +-...||.+-...||++-.| T Consensus 56 aif~~~ya~~lYlwRa~~I~ 75 (112) T KOG4580 56 AIFCGFYALFLYLWRASMIR 75 (112) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999998 No 152 >PRK13453 F0F1 ATP synthase subunit B; Provisional Probab=33.35 E-value=15 Score=11.05 Aligned_cols=13 Identities=15% Similarity=-0.057 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999999 Q gi|254781029|r 151 PILLFFSFFIMIS 163 (1828) Q Consensus 151 p~~~~~~~a~~~~ 163 (1828) =++|||++--++| T Consensus 28 F~il~~ll~kf~~ 40 (173) T PRK13453 28 FIVLLALLKKFAW 40 (173) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999989 No 153 >TIGR02141 modB_ABC molybdate ABC transporter, permease protein; InterPro: IPR011867 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry describes the permease protein, ModB, of the molybdate ABC transporter. This system has been characterised in Escherichia coli , Staphylococcus carnosus Rhodobacter capsulatus and Azotobacter vinlandii . Molybdate is chemically similar to sulphate, thiosulphate, and selenate. These related substrates, and sometimes molybdate itself, can be transported by the homologous sulphate receptor. Some apparent molybdenum transport operons include a permease related to this ModB, although less similar than some sulphate permease proteins and not included in this model.; GO: 0015098 molybdate ion transmembrane transporter activity, 0015689 molybdate ion transport, 0016021 integral to membrane. Probab=33.31 E-value=15 Score=11.04 Aligned_cols=24 Identities=17% Similarity=0.224 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999999999998316 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYT 124 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~ 124 (1828) ..+|++.-++-...|+.++|.++. T Consensus 4 lk~A~~~Tl~~~~LGi~~A~~La~ 27 (212) T TIGR02141 4 LKVAVIATLLLLPLGIPVAWLLAR 27 (212) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999988 No 154 >PRK09705 cynX putative cyanate transporter; Provisional Probab=33.09 E-value=15 Score=11.02 Aligned_cols=65 Identities=14% Similarity=0.138 Sum_probs=34.4 Q ss_pred CCCCHHHHHHHHHHHHHHH-HHHHHHHHHCCHHHHHHHHHHH-HCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8732279999999999999-9999999831621455576564-064889999999999999999999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLA-CELGFLIKFYTNSPMDSFYDFA-MRPEVITLFLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) +|||.|.|+..+. +++. ..+=......+| .+..+.+-+ .+....|+ .. . +|++.|=.++.+.-+- T Consensus 1 ~p~~~~~~l~l~~--i~lva~nLR~~itsv~P-ll~~I~~dl~ls~~~~Gl-Lt-t-lP~l~fgl~s~~a~~l 67 (393) T PRK09705 1 KPSKFPLNFMLLV--LVLIGLNMRPLLTSVGP-LLPQLRQASGMSFSVAAL-LT-A-LPVVTMGGLALAGSWL 67 (393) T ss_pred CCCHHHHHHHHHH--HHHHHHHCCHHHHCCHH-HHHHHHHHHCCCHHHHHH-HH-H-HHHHHHHHHHHHHHHH T ss_conf 9971679999999--99999705145413052-199999881998889999-99-9-9999999998878999 No 155 >pfam05915 DUF872 Eukaryotic protein of unknown function (DUF872). This family consists of several uncharacterized eukaryotic proteins. The function of this family is unknown. Probab=32.66 E-value=15 Score=10.96 Aligned_cols=62 Identities=10% Similarity=0.081 Sum_probs=25.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHH-HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999831621-455576564064889999999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNS-PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) +.-.+++-++-+.|.+.+..+.+-.. .+.+ ...-+-|-++.+++ .++|=+----.|...||+ T Consensus 38 ~kaI~lA~~Lli~G~~Ll~~g~~~~~~~~~~-~~~~~~~~l~lG~l--~fiPG~Y~~~iay~a~rG 100 (112) T pfam05915 38 WKVIAAALILLLLGAVLLVIGLFIFANPMEG-DASQGYVFLFLGIL--LFIPGFYHVRLAYYAYKG 100 (112) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-CCCCHHHHHHHHHH--HHCCCHHHHHHHHHHHCC T ss_conf 8999999999999999999999996156688-87500387998999--826645421122222028 No 156 >COG3190 FliO Flagellar biogenesis protein [Cell motility and secretion] Probab=32.60 E-value=16 Score=10.96 Aligned_cols=22 Identities=9% Similarity=0.155 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999 Q gi|254781029|r 145 LVVSVVPILLFFSFFIMISRAR 166 (1828) Q Consensus 145 ~~~~~~p~~~~~~~a~~~~r~~ 166 (1828) ++..+.=++||.+++++++|-+ T Consensus 28 ~gsL~~iL~lil~~~wl~kr~~ 49 (137) T COG3190 28 FGSLILILALILFLAWLVKRLG 49 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999984 No 157 >PRK10591 hypothetical protein; Provisional Probab=32.58 E-value=16 Score=10.95 Aligned_cols=33 Identities=15% Similarity=0.201 Sum_probs=16.2 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 76564064889999999999999999999999999 Q gi|254781029|r 131 FYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) +.+.++.|+..-+ -.+++=..|+=....++||+ T Consensus 35 lp~~l~~~~a~i~--Mif~GI~lmiPAav~ivWR~ 67 (96) T PRK10591 35 LPEPFNTPTAAIL--MIFLGVLLMLPAAVVIIWRV 67 (96) T ss_pred CCHHHCCCHHHHH--HHHHHHHHHHHHHHHHHHHH T ss_conf 7633248308999--99999999748999999999 No 158 >PRK06638 NADH dehydrogenase subunit J; Provisional Probab=32.38 E-value=16 Score=10.93 Aligned_cols=27 Identities=19% Similarity=0.523 Sum_probs=13.1 Q ss_pred HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 65640648899999999999999999999999 Q gi|254781029|r 132 YDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) .+|.+.-+.+ +++|+ |+.+|.|++|.- T Consensus 51 AeFla~iqIi-VYaGA----I~VLfLFvIMll 77 (196) T PRK06638 51 AEFLGVLQII-VYVGA----VMVLFLFVVMML 77 (196) T ss_pred CHHHHHHHHH-HHHHH----HHHHHHHHHHHC T ss_conf 4699999999-99999----999999999981 No 159 >COG0785 CcdA Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones] Probab=32.35 E-value=16 Score=10.93 Aligned_cols=65 Identities=14% Similarity=0.099 Sum_probs=30.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2279999999999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 98 NIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 98 ~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) -.+|-+-..++++|.=+-..+.-.... ...+.++.......+..+..+..+|++++-.|...+.+ T Consensus 127 ~~~f~lGl~f~~~wtPC~gPil~sil~--laa~~~~~~~g~~ll~~Y~lGl~lP~~~~~~~~~~~~~ 191 (220) T COG0785 127 LGAFLLGLLFALGWTPCIGPILGSILA--LAASTGSVVLGALLLAAYALGLALPFLLLALLSGRALK 191 (220) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 678899999999985544679999999--99748846899999999999989999999999999999 No 160 >pfam03116 NQR2_RnfD_RnfE NQR2, RnfD, RnfE family. This family of bacterial proteins includes a sodium-translocating NADH-ubiquinone oxidoreductase (i.e. a respiration linked sodium pump). In Vibrio cholerae, it negatively regulates the expression of virulence factors through inhibiting (by an unknown mechanism) the transcription of the transcriptional activator ToxT. The family also includes proteins involved in nitrogen fixation, RnfD and RnfE. The similarity of these proteins to NADH-ubiquinone oxidoreductases was previously noted. Probab=32.21 E-value=16 Score=10.91 Aligned_cols=50 Identities=6% Similarity=0.048 Sum_probs=26.4 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHH Q ss_conf 898999854187322799999999999999999999831621455576564064889999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLF 144 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 144 (1828) +..+|-+|- =|...|+|...+.+++ ++.+....||+ ++.-+.+|.++|-. T Consensus 77 vTglLlal~-lPp~~P~w~~~ig~~f----aIvigK~~fGG-----lG~N~fNPAlvgra 126 (319) T pfam03116 77 VTGLLLALI-LPPTAPWWMVAIGTVF----AIVIGKQVFGG-----LGQNIFNPALVGRA 126 (319) T ss_pred HHHHHHHHH-CCCCCCHHHHHHHHHH----HHHHHHHHHCC-----CCCCCCCHHHHHHH T ss_conf 999999986-5788858999999999----99989983178-----68885689999999 No 161 >pfam06644 ATP11 ATP11 protein. This family consists of several eukaryotic ATP11 proteins. In Saccharomyces cerevisiae, expression of functional F1-ATPase requires two proteins encoded by the ATP11 and ATP12 genes. Atp11p is a molecular chaperone of the mitochondrial matrix that participates in the biogenesis pathway to form F1, the catalytic unit of the ATP synthase. Probab=32.04 E-value=16 Score=10.89 Aligned_cols=58 Identities=12% Similarity=0.292 Sum_probs=39.5 Q ss_pred CCCCCCHHHH-HHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 7788421579-99998718013899999975074562067741252599899999985078878 Q gi|254781029|r 1715 DSLSSIDSLV-ENISKFIDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKADSAL 1777 (1828) Q Consensus 1715 ~~~~~l~s~s-~di~~~~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d~~f 1777 (1828) .+..+|+|.- ++-.+-++++-...+|+.|..|... |.--.--.+|+.|-.+=+.-|-| T Consensus 69 ~~~K~L~s~l~leki~~l~~~eI~~iW~~~h~~~~~-----l~AviP~~~y~~m~~~A~~~P~F 127 (250) T pfam06644 69 KPIKPLSSILDVEKIKDLSAEEIEFIWRAYHAKKDS-----LCAVIPAETYELMEARAKECPQF 127 (250) T ss_pred CCCCCHHHHHCHHHHCCCCHHHHHHHHHHHHCCCCE-----EEEEECHHHHHHHHHHHHHCCCE T ss_conf 898849998676987018999999999998278885-----99980599999999999759914 No 162 >COG1173 DppC ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism] Probab=31.41 E-value=16 Score=10.81 Aligned_cols=55 Identities=25% Similarity=0.354 Sum_probs=28.7 Q ss_pred CCHHHHHHHHHHHHHHHH--HHHHHHHHCCH---H-HHHHHHHHHHCHHHHH-HHHHHHHHH Q ss_conf 322799999999999999--99999983162---1-4555765640648899-999999999 Q gi|254781029|r 97 LNIALRNAFMSSLIWLAC--ELGFLIKFYTN---S-PMDSFYDFAMRPEVIT-LFLVVSVVP 151 (1828) Q Consensus 97 s~~~~~~~~~~s~~w~~~--~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~-~~~~~~~~p 151 (1828) .|....++.++.++++.. .+|+..++|++ . .++-..-+...|..+= +++++++.| T Consensus 85 ~r~SL~I~~~~~~~~~~iG~~lG~iaGy~gG~vD~ilmri~di~ls~P~lll~i~l~~~lg~ 146 (289) T COG1173 85 ARISLLIGLLAVLISLVIGTLLGLLAGYFGGWVDRVLMRITDIFLAFPSLLLAILLVAILGP 146 (289) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC T ss_conf 99999999999999999999999999880761999999999999976599999999999772 No 163 >COG1743 Adenine-specific DNA methylase containing a Zn-ribbon [DNA replication, recombination, and repair] Probab=31.29 E-value=16 Score=10.79 Aligned_cols=27 Identities=11% Similarity=0.234 Sum_probs=13.7 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHCCCCCH Q ss_conf 8788877556779898999854187322 Q gi|254781029|r 72 SGFYPANDLDNGGITHILQNIGKNSLNI 99 (1828) Q Consensus 72 ~~~~pa~~d~r~~~~~~~~~l~~~~s~~ 99 (1828) .++.|-+. |-..++..++...++|-+. T Consensus 51 ~sL~P~d~-d~~~~~~~l~~~~~~~h~~ 77 (875) T COG1743 51 ASLLPEDT-DENAFLSSLRLSEGEPHRL 77 (875) T ss_pred HHHCCCCC-CHHHHHHHHHCCCCCCCCC T ss_conf 86477767-7677655542037887876 No 164 >COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion] Probab=30.99 E-value=16 Score=10.76 Aligned_cols=27 Identities=19% Similarity=0.392 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999 Q gi|254781029|r 147 VSVVPILLFFSFFIMISRARDMHDASQ 173 (1828) Q Consensus 147 ~~~~p~~~~~~~a~~~~r~~~~~~~a~ 173 (1828) .++.+|+||-+|-.|+||-|-=|+-++ T Consensus 11 ~ll~~vl~~~ifyFli~RPQrKr~K~~ 37 (97) T COG1862 11 LLLPLVLIFAIFYFLIIRPQRKRMKEH 37 (97) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 999999999999986167889999999 No 165 >pfam06966 DUF1295 Protein of unknown function (DUF1295). This family contains a number of bacterial and eukaryotic proteins of unknown function that are approximately 300 residues long. Probab=30.92 E-value=16 Score=10.75 Aligned_cols=14 Identities=7% Similarity=0.007 Sum_probs=8.3 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 79999999999999 Q gi|254781029|r 100 ALRNAFMSSLIWLA 113 (1828) Q Consensus 100 ~~~~~~~~s~~w~~ 113 (1828) .-++.+++-.+|.. T Consensus 39 r~~l~~~lv~lW~~ 52 (235) T pfam06966 39 RRMLVTLLVTLWGL 52 (235) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999999 No 166 >PRK11176 lipid transporter ATP-binding/permease protein; Provisional Probab=30.89 E-value=16 Score=10.74 Aligned_cols=44 Identities=14% Similarity=0.057 Sum_probs=23.8 Q ss_pred CCCCCCCHHHHHHHHH--CCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7556779898999854--18732279999999999999999999983 Q gi|254781029|r 78 NDLDNGGITHILQNIG--KNSLNIALRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 78 ~~d~r~~~~~~~~~l~--~~~s~~~~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) || ..=+....++.|= -+|-+..++++.+++++++++++++.+.+ T Consensus 2 ~~-~~~s~~~~~rrL~~~lkpyk~~l~~~~l~~il~~~~~~~~p~l~ 47 (581) T PRK11176 2 ND-KDLSTWQTFRRLWPTIAPFKAGLIVAGVALILNAASDTFMLSLL 47 (581) T ss_pred CC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98-86777999999999999999999999999999999999999999 No 167 >cd03498 SQR_TypeB_2_TM Succinate:quinone oxidoreductase (SQR)-like Type B subfamily 2, transmembrane subunit; composed of proteins with similarity to the SQRs of Geobacter metallireducens and Corynebacterium glutamicum. SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol. C. glutamicum SQR reduces low potential quinones such as menaquinone. SQR is also called succinate dehydrogenase (Sdh) or Complex II and is part of the citric acid cycle and the aerobic respiratory chain. SQR is composed of a flavoprotein catalytic subunit, an iron-sulfur protein and one or two hydrophobic transmembrane subunits. Members of this subfamily are classified as Type B as they contain one transmembrane subunit and two heme groups. The heme and quinone binding sites reside in the transmembrane subunit. The transmembrane subunit of members of this subfamily is also called Sdh cytochrome b558 subunit based on the Bacillus subtilis protein. The structural arrangem Probab=30.83 E-value=16 Score=10.74 Aligned_cols=76 Identities=8% Similarity=0.025 Sum_probs=30.7 Q ss_pred CCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC---HH---HHHHHHHHHHCHHHHHHHHHHHHHH Q ss_conf 75567798989998541873227999999999999999999998316---21---4555765640648899999999999 Q gi|254781029|r 78 NDLDNGGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYT---NS---PMDSFYDFAMRPEVITLFLVVSVVP 151 (1828) Q Consensus 78 ~~d~r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~~~~~~p 151 (1828) .++.+..+..+..+++.-.-...|.++.++-+.=+.=|++=++..+| +. .+..++.+++ +.+++|-+.+| T Consensus 127 ~~~~~d~y~~v~~~F~~p~~~~~Yvvam~~L~~HL~HG~~Sa~qTlG~~~~~~~~~~~~i~~~~a----~~i~~GF~~iP 202 (209) T cd03498 127 GVGRGDVYELLVASFQQPWIAALYVLAMVALGLHLSHGFWSAFQTLGLNNPRYRPALKAVGRVVA----ILIAGGFISIP 202 (209) T ss_pred CCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHH T ss_conf 66501599999999516499999999999999999989999999827998135899999999999----99999999999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254781029|r 152 ILLFFS 157 (1828) Q Consensus 152 ~~~~~~ 157 (1828) +.++++ T Consensus 203 i~i~~g 208 (209) T cd03498 203 LLILFG 208 (209) T ss_pred HHHHCC T ss_conf 999818 No 168 >TIGR00886 2A0108 nitrite transporter; InterPro: IPR004737 This family represents nitrate/nitrite antiporters in bacteria, fungi, algae and marine diatoms. Nitrate transporters in Escherichia coli are involved in excretion of nitrite produced by the dissimilatory reduction of nitrate. NarK is polytopic membrane protein with 12 transmembrane domains which is involved in nitrate uptake and nitrite excretion and is thought to function as a nitrate/nitrite antiporter. At low concentrations of nitrate, NarK mediates the electrogenic excretion of nitrite rather than nitrate/nitrite exchange. This process prevents intracellular accumulation of toxic levels of nitrite and allows further detoxification in the periplasm through the action of nitrite reductase . In the opportunistic fungal pathogen Aspergillus fumigatus the nitrate transporter crnA clusters with nitrite reductase (niiA) and nitrate reductase (niaD). NiaD, niiA and crnA are induced by nitrate and repressed by ammonium at the transcriptional level . In Hebeloma cylindrosporum the nitrate transporter polypeptide (NRT2) is characterised by 12 transmembrane domains and presents both a long putative intracellular loop and a short C-terminal tail, two structural features which distinguish fungal high-affinity transporters from their plant homologues. Transcription is repressed by ammonium and strongly stimulated by nitrate but also by organic nitrogen or nitrogen deficiency. ; GO: 0015103 inorganic anion transmembrane transporter activity, 0015698 inorganic anion transport, 0016021 integral to membrane. Probab=30.64 E-value=17 Score=10.71 Aligned_cols=75 Identities=12% Similarity=0.033 Sum_probs=48.7 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH------CCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 985418732279999999999999999999983------16214555765640648899999999999999999999999 Q gi|254781029|r 90 QNIGKNSLNIALRNAFMSSLIWLACELGFLIKF------YTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 90 ~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) -.|=+.-.++||+.-.++.++--.+|+.++... ||+..++..-++.+.-|=+|+.+..+|.| .+|+.+...+. T Consensus 86 ~~~~~~~p~~s~~~L~~~rlf~Gi~Gg~Fa~~~~~~s~fFp~~~~GtA~gl~aG~GN~G~~~~~~~~P-a~f~i~~~~~~ 164 (472) T TIGR00886 86 AGLAVQDPSTSYSVLLLLRLFIGIAGGSFASCMPWISFFFPKKIQGTALGLAAGWGNMGGGVAQFVMP-ALFAIIASLIK 164 (472) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCHHHHHHHHHHHHHH-HHHHHHHHHHC T ss_conf 98412689989999999999996755468876656630688220146758861432678999999999-99999998622 Q ss_pred HH Q ss_conf 99 Q gi|254781029|r 164 RA 165 (1828) Q Consensus 164 r~ 165 (1828) +. T Consensus 165 ~~ 166 (472) T TIGR00886 165 GA 166 (472) T ss_pred CC T ss_conf 11 No 169 >pfam10762 DUF2583 Protein of unknown function (DUF2583). Some members in this family of proteins are annotated as YchH however currently no function is known. Probab=30.58 E-value=17 Score=10.71 Aligned_cols=22 Identities=14% Similarity=0.277 Sum_probs=13.4 Q ss_pred HHHHHHHHHHCHHHHHHHHHHH Q ss_conf 4555765640648899999999 Q gi|254781029|r 127 PMDSFYDFAMRPEVITLFLVVS 148 (1828) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~ 148 (1828) +.-.+.+|++-..++++|+|++ T Consensus 34 ~~L~lp~~~~~gal~~IFiGAl 55 (89) T pfam10762 34 PQFNLPQFFAHGALLSIFVGAL 55 (89) T ss_pred CCCCCHHHHHHHHHHHHHHHHH T ss_conf 2479719998648999999999 No 170 >PRK10847 hypothetical protein; Provisional Probab=30.55 E-value=17 Score=10.70 Aligned_cols=68 Identities=15% Similarity=0.177 Sum_probs=31.9 Q ss_pred CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH Q ss_conf 9898999854187322799999999999999999999831621455576564064889999999999999999 Q gi|254781029|r 84 GITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFF 156 (1828) Q Consensus 84 ~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 156 (1828) +...++.-.-+=|-+.=..+.++.+++|+.+-+++.+.+ | .+....+.+. +-.+++ +.+.++|+++-| T Consensus 140 t~vp~~AG~~~M~~~rF~~~n~lGa~lW~~~~~~~Gy~~-G--~~p~v~~~~~-~iil~i-i~~s~~p~~~~~ 207 (219) T PRK10847 140 TFAPFVAGMGHMSYRHFAAYNVIGALLWVLLFTYAGYFF-G--TLPFVQDNLK-LLIVGI-IVVSILPGVIEI 207 (219) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-C--CCHHHHHHHH-HHHHHH-HHHHHHHHHHHH T ss_conf 675487560589889999999999999999999999998-6--8889998799-999999-999999999999 No 171 >pfam07298 NnrU NnrU protein. This family consists of several plant and bacterial NnrU proteins. NnrU is thought to be involved in the reduction of nitric oxide. The exact function of NnrU is unclear. It is thought however that NnrU and perhaps NnrT are required for expression of both nirK and nor. Probab=30.38 E-value=17 Score=10.68 Aligned_cols=33 Identities=9% Similarity=0.198 Sum_probs=23.7 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 899985418732279999999999999999999 Q gi|254781029|r 87 HILQNIGKNSLNIALRNAFMSSLIWLACELGFL 119 (1828) Q Consensus 87 ~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~ 119 (1828) .+...+-.++=+..|-+..+++++|+.-|-+.+ T Consensus 24 ~l~~~lG~~~yk~~ysl~sl~gl~lii~g~~~a 56 (184) T pfam07298 24 RLIARLGERGYKGLYSLLSLAGLVLIIIGYGAA 56 (184) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999984740066899999999999999999978 No 172 >COG3686 Predicted membrane protein [Function unknown] Probab=30.37 E-value=17 Score=10.68 Aligned_cols=51 Identities=12% Similarity=0.139 Sum_probs=32.0 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCH-HHHHHHHHHHHCHHHHHH Q ss_conf 9985418732279999999999999999999983162-145557656406488999 Q gi|254781029|r 89 LQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTN-SPMDSFYDFAMRPEVITL 143 (1828) Q Consensus 89 ~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 143 (1828) ..-|.++||...- ...++|+..-+.+.+.+..+ ..++|+-=+.++-+..++ T Consensus 68 ~a~l~G~~~~~~~----~~A~l~i~~RllYi~~Yla~~p~lRSlvW~~gmvg~~~l 119 (125) T COG3686 68 VAHLTGNPSQAAN----TLAILFIAARLLYIPCYLADIPYLRSLVWLGGMVGTGAL 119 (125) T ss_pred HHHHCCCHHHHHH----HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 9998388268899----999999999999999999486999999999999999999 No 173 >TIGR01987 HI0074 nucleotidyltransferase substrate binding protein, HI0074 family; InterPro: IPR010235 The member of this family from Haemophilus influenzae, HI0074, has been shown by crystal structure to resemble nucleotidyltransferase substrate binding proteins . It forms a complex with HI0073 (P43933 from SWISSPROT), encoded by the adjacent gene, which contains a nucleotidyltransferase nucleotide binding domain (IPR002934 from INTERPRO). Double- and single-stranded DNA binding assays showed no evidence of DNA binding to HI0074 or to HI0073/HI0074 complex despite the suggestive shape of the putative binding cleft formed by the HI0074 dimer . . Probab=30.31 E-value=17 Score=10.67 Aligned_cols=10 Identities=40% Similarity=1.032 Sum_probs=3.6 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254781029|r 1784 YISNFEEMLS 1793 (1828) Q Consensus 1784 y~~~fe~~l~ 1793 (1828) ||.+|++|+. T Consensus 117 Yv~E~~~l~~ 126 (130) T TIGR01987 117 YVPEFEKLLE 126 (130) T ss_pred CCHHHHHHHH T ss_conf 0267999987 No 174 >PRK09586 murP N-acetylmuramic acid phosphotransfer permease; Reviewed Probab=30.02 E-value=17 Score=10.63 Aligned_cols=14 Identities=21% Similarity=0.342 Sum_probs=5.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 27999999999999 Q gi|254781029|r 99 IALRNAFMSSLIWL 112 (1828) Q Consensus 99 ~~~~~~~~~s~~w~ 112 (1828) +|+.-+.+.++++. T Consensus 122 ~PiIP~liaaGllk 135 (474) T PRK09586 122 TPLIPGFIAAGLLL 135 (474) T ss_pred HHHHHHHHHHHHHH T ss_conf 60499999999999 No 175 >pfam04632 FUSC Fusaric acid resistance protein family. This family includes a conserved region found in two proteins associated with fusaric acid resistance from Burkholderia cepacia and Klebsiella oxytoca. These proteins are likely to be membrane transporter proteins. Probab=29.96 E-value=17 Score=10.63 Aligned_cols=13 Identities=23% Similarity=0.695 Sum_probs=7.1 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999999 Q gi|254781029|r 150 VPILLFFSFFIMI 162 (1828) Q Consensus 150 ~p~~~~~~~a~~~ 162 (1828) -|++|+.++|.++ T Consensus 73 ~P~L~~~~lalWi 85 (649) T pfam04632 73 APLLFLLALALWI 85 (649) T ss_pred CHHHHHHHHHHHH T ss_conf 7799999999999 No 176 >TIGR02745 ccoG_rdxA_fixG cytochrome c oxidase accessory protein CcoG; InterPro: IPR014116 Member of this ferredoxin-like protein family, include FixG, CcoG and RdxA,. They are found exclusively in species with an operon encoding the cbb3 type of cytochrome c oxidase (cco-cbb3), and near the cco-cbb3 operon in about half the cases. The cco-cbb3 is found in a variety of proteobacteria and almost nowhere else; it is associated with oxygen use under microaerobic conditions. Some (but not all) of these proteobacteria are also nitrogen-fixing, hence the gene symbol fixG. FixG was shown essential for functional cco-cbb3 expression in Bradyrhizobium japonicum.. Probab=29.76 E-value=17 Score=10.60 Aligned_cols=69 Identities=16% Similarity=0.098 Sum_probs=38.1 Q ss_pred HHHCCC--CCHHHHHHHHHHHHHHHHHHHHHHHHCCH-H-HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 854187--32279999999999999999999983162-1-4555765640648899999999999999999999 Q gi|254781029|r 91 NIGKNS--LNIALRNAFMSSLIWLACELGFLIKFYTN-S-PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFI 160 (1828) Q Consensus 91 ~l~~~~--s~~~~~~~~~~s~~w~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~ 160 (1828) .|+..| |-..+|=.++-=++|++.|++.++.+..- . .-.-+.+++++|..-..++.+ ++=-+.+|+||. T Consensus 113 kl~~~p~~~~~~~~kk~lKh~~w~~i~~~tg~~f~~YF~p~~~L~~~~~~~~a~~~~~f~~-~~~t~~ty~fag 185 (474) T TIGR02745 113 KLDKAPKWTFDKVWKKTLKHLIWLVISLVTGGTFIFYFVPAPELFAYLFTGPADHIAYFWV-LFFTAATYIFAG 185 (474) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCHHHHHHH-HHHHHHHHHHHH T ss_conf 1022435898899999999999999999999878735058589999984478535899999-999999999889 No 177 >pfam02392 Ycf4 Ycf4. This family consists of hypothetical Ycf4 proteins from various chloroplast genomes. It has been suggested that Ycf4 is involved in the assembly and/or stability of the photosystem I complex in chloroplasts. Probab=29.59 E-value=17 Score=10.58 Aligned_cols=45 Identities=18% Similarity=0.123 Sum_probs=22.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC----CHH--HHHHHHHHHHCHHHHHHHHHHH Q ss_conf 799999999999999999999831----621--4555765640648899999999 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKFY----TNS--PMDSFYDFAMRPEVITLFLVVS 148 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~~----~~~--~~~~~~~~~~~~~~~~~~~~~~ 148 (1828) .||||.++++ ||+|+...-+ +-. +.....++.--|+++.+.+=++ T Consensus 18 Ny~wA~i~~~----G~lGFll~g~SSYl~~dLlpf~~~~~i~FiPQGlvM~FYGi 68 (180) T pfam02392 18 NYFWASILLL----GGLGFLLVGISSYLGKNLLPFLDPSQIIFIPQGLVMCFYGI 68 (180) T ss_pred HHHHHHHHHH----HHHHHHHHHHHHHHCCCEECCCCHHHCCCCCCCHHHHHHHH T ss_conf 8899999987----55999998878975767231478411834064176656889 No 178 >PRK10692 hypothetical protein; Provisional Probab=29.57 E-value=17 Score=10.58 Aligned_cols=22 Identities=14% Similarity=0.297 Sum_probs=13.6 Q ss_pred HHHHHHHHHHCHHHHHHHHHHH Q ss_conf 4555765640648899999999 Q gi|254781029|r 127 PMDSFYDFAMRPEVITLFLVVS 148 (1828) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~ 148 (1828) +.-++.+|++-..++++|+|++ T Consensus 34 ~~l~lP~f~~~~al~~IFiGAl 55 (92) T PRK10692 34 PQFNLPQFFAHGAILSIFVGAL 55 (92) T ss_pred CCCCCHHHHHHHHHHHHHHHHH T ss_conf 5678608998757999999999 No 179 >PRK13499 rhamnose-proton symporter; Provisional Probab=29.55 E-value=17 Score=10.58 Aligned_cols=28 Identities=18% Similarity=0.343 Sum_probs=17.8 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 54187322799999999999999999999 Q gi|254781029|r 92 IGKNSLNIALRNAFMSSLIWLACELGFLI 120 (1828) Q Consensus 92 l~~~~s~~~~~~~~~~s~~w~~~~~~~~~ 120 (1828) +..-|+. ..+++.++.++|-.||+.+.. T Consensus 65 ~~~~~~~-~l~~~~lfG~lWGiGgi~fGl 92 (345) T PRK13499 65 YSSFSLS-TLLPVFLFGAMWGIGNINYGL 92 (345) T ss_pred HHCCCHH-HHHHHHHHHHHHCCCCCHHHH T ss_conf 8609843-899999999997245202420 No 180 >TIGR00769 AAA ADP/ATP carrier protein family; InterPro: IPR004667 These proteins are members of the ATP:ADP Antiporter (AAA) family, which consists of nucleotide transporters that have 12 GES predicted transmembrane regions. One protein from Rickettsia prowazekii functions to take up ATP from the eukaryotic cell cytoplasm into the bacterium in exchange for ADP. Five AAA family paralogues are encoded within the genome of R. prowazekii. This organism transports UMP and GMP but not CMP, and it seems likely that one or more of the AAA family paralogues are responsible. The genome of Chlamydia trachomatis encodes two AAA family members, Npt1 and Npt2, which catalyse ATP/ADP exchange and GTP, CTP, ATP and UTP uptake probably employing a proton symport mechanism. Two homologous adenylate translocators of Arabidopsis thaliana are postulated to be localized to the intracellular plastid membrane where they function as ATP importers.; GO: 0005471 ATP:ADP antiporter activity, 0005524 ATP binding, 0006810 transport, 0016021 integral to membrane. Probab=29.32 E-value=17 Score=10.55 Aligned_cols=37 Identities=11% Similarity=0.209 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHCCHHHHH-HHHHHHHCHHHHHHHHH Q ss_conf 99999999999831621455-57656406488999999 Q gi|254781029|r 110 IWLACELGFLIKFYTNSPMD-SFYDFAMRPEVITLFLV 146 (1828) Q Consensus 110 ~w~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 146 (1828) +-+.|-++++..+|+..... -.+-|.+.|..+|+++| T Consensus 359 ~L~TG~~FFsli~f~~~~~~~l~a~LG~TPL~lAv~vG 396 (491) T TIGR00769 359 LLLTGVAFFSLILFGKPAAPILVAVLGMTPLLLAVYVG 396 (491) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHHHHH T ss_conf 99999999999983550357889887311899999984 No 181 >pfam11242 DUF2774 Protein of unknown function (DUF2774). This is a viral family of proteins with unknown function. Probab=29.10 E-value=17 Score=10.52 Aligned_cols=35 Identities=26% Similarity=0.412 Sum_probs=17.4 Q ss_pred HHHHHHHHCCCHH--HHHH-------HHHHHCCCCCHHHHHHHH Q ss_conf 9999987180138--9999-------997507456206774125 Q gi|254781029|r 1724 VENISKFIDYDAF--VQLW-------KSYTLGEDDIFSKRLYTI 1758 (1828) Q Consensus 1724 s~di~~~~~~~~~--~~~w-------~~y~~g~~~~f~~~~~~~ 1758 (1828) .++|||++|-.+. ..+| ..|..-|.=||-+|+..+ T Consensus 16 FveIAk~~~i~a~dva~~w~~ve~Ar~~f~~rEkVVYRKr~iN~ 59 (63) T pfam11242 16 FVEIAKKYDIRAKDVAKLWVMVETAREKFKTREKVVYRKRHINK 59 (63) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCEEHHHHHCCC T ss_conf 99999995988788999999999999987501764302331562 No 182 >COG1079 Uncharacterized ABC-type transport system, permease component [General function prediction only] Probab=29.08 E-value=18 Score=10.51 Aligned_cols=34 Identities=12% Similarity=-0.099 Sum_probs=24.4 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 4187322799999999999999999999831621 Q gi|254781029|r 93 GKNSLNIALRNAFMSSLIWLACELGFLIKFYTNS 126 (1828) Q Consensus 93 ~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~ 126 (1828) .+-+....=|.|.++|++-++.|+++...-+.+. T Consensus 182 ~Gi~V~r~R~~av~~sG~laGlaGa~Lsl~~~~~ 215 (304) T COG1079 182 MGINVYRYRYLAVLVSGALAGLAGAYLSLAYSGQ 215 (304) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHEEEEECCC T ss_conf 7987567889999999999875352135550456 No 183 >pfam06532 DUF1109 Protein of unknown function (DUF1109). This family consists of several hypothetical bacterial proteins of unknown function. Probab=29.06 E-value=18 Score=10.51 Aligned_cols=90 Identities=17% Similarity=0.170 Sum_probs=45.7 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------- Q ss_conf 873227999999999999999999998316214555765640648899999999999999999999999999-------- Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRAR-------- 166 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~-------- 166 (1828) ++++...|.+..+.++|+++.+.+... +++... ..+.+..... .+....++=+..|-.+-.+.||+. T Consensus 78 ~~~~~~~~l~~~~~~~~~~~~~~~~~~--~~~~~~--~~~~g~~~~~-C~~~I~llslp~~~a~~~~lR~~APt~p~~aG 152 (204) T pfam06532 78 SRGRGWLLLAAPVALLWLAALLELALA--PPGARA--ALLLGHTWLV-CLLAIPLLSLPPLAALLWALRRGAPTRPALAG 152 (204) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHC--CHHHHH--HHHCCCCHHH-HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 865017999999999999999999828--858764--4213754658-99999999999999999999766889837899 Q ss_pred -HHHHHHHHHHHHHHHHCCHHHHH Q ss_conf -99999999999999730925556 Q gi|254781029|r 167 -DMHDASQSIAGIALRLIDPEEYS 189 (1828) Q Consensus 167 -~~~~~a~~m~~~a~rl~~p~~~a 189 (1828) =--+.|-+++-+++.|--||+-. T Consensus 153 a~aGLaaga~~a~~y~lhCped~~ 176 (204) T pfam06532 153 AAAGLAAGALAAVAYALHCPEDSP 176 (204) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 999999999999999814877660 No 184 >COG3242 Uncharacterized protein conserved in bacteria [Function unknown] Probab=29.05 E-value=18 Score=10.51 Aligned_cols=30 Identities=10% Similarity=-0.007 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHH Q ss_conf 99999999999998316214555765640648 Q gi|254781029|r 108 SLIWLACELGFLIKFYTNSPMDSFYDFAMRPE 139 (1828) Q Consensus 108 s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (1828) +++-+.- |+...+||..+-.-..++.-+|. T Consensus 10 ~LVlVlE--GL~p~l~P~~wkk~~~~l~~lpd 39 (62) T COG3242 10 GLVLVLE--GLGPFLAPRAWKKMATALLELPD 39 (62) T ss_pred HHHHHHH--CCCCCCCHHHHHHHHHHHHHCCH T ss_conf 9999996--10100385999999999987878 No 185 >PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional Probab=29.04 E-value=18 Score=10.51 Aligned_cols=19 Identities=21% Similarity=0.537 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999 Q gi|254781029|r 148 SVVPILLFFSFFIMISRAR 166 (1828) Q Consensus 148 ~~~p~~~~~~~a~~~~r~~ 166 (1828) -++|++|+.+.+.++||.. T Consensus 105 W~~P~i~l~~g~~~~~r~~ 123 (126) T PRK10144 105 WALPVVLLLLMALILWRVR 123 (126) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999998542 No 186 >pfam00122 E1-E2_ATPase E1-E2 ATPase. Probab=28.93 E-value=18 Score=10.50 Aligned_cols=77 Identities=14% Similarity=0.113 Sum_probs=35.2 Q ss_pred CCCHHHHHHHHHC-CCCCHHH---------HHHHHHHHHHHHHHHHHHHHHCCHHHH----HHHHH-HHHCHHHHHHHHH Q ss_conf 7798989998541-8732279---------999999999999999999983162145----55765-6406488999999 Q gi|254781029|r 82 NGGITHILQNIGK-NSLNIAL---------RNAFMSSLIWLACELGFLIKFYTNSPM----DSFYD-FAMRPEVITLFLV 146 (1828) Q Consensus 82 r~~~~~~~~~l~~-~~s~~~~---------~~~~~~s~~w~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~ 146 (1828) ...+|.+.+.++. ++.++|+ |++.++-++. ...++.|.+++.... .++.= ..+-|..|++ T Consensus 120 ~t~~~~i~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~--~~~~~~~~~~~~~~~~~~~~~lsvlv~~~P~aL~l--- 194 (222) T pfam00122 120 DTELGKIARLVEEAKSAKTPLQRLLDKLAKILVPIVLALA--ILVFLIWFFRGGPLLEALLRALAVLVAACPEALPL--- 194 (222) T ss_pred CCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCHHHH--- T ss_conf 2488899985653347899699999999999999999999--99999999936879999999999899545565899--- Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999 Q gi|254781029|r 147 VSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 147 ~~~~p~~~~~~~a~~~~r~ 165 (1828) ++|+.+.++...|.+++ T Consensus 195 --a~~~~l~~~~~~~~k~g 211 (222) T pfam00122 195 --AVPLALAVGAGRLAKKG 211 (222) T ss_pred --HHHHHHHHHHHHHHHCC T ss_conf --99999999999999899 No 187 >TIGR03638 cas1_ECOLI CRISPR-associated protein Cas1, ECOLI subtype. The CRISPR-associated protein Cas1 is virtually universal to CRISPR systems. CRISPR, an acronym for Clustered Regularly Interspaced Short Palindromic Repeats, is prokaryotic immunity system for foreign DNA, mostly from phage. CRISPR systems belong to different subtypes, distinguished by both nature of the repeats, the makeup of the cohort of associated Cas proteins, and by molecular phylogeny within the more universal Cas proteins such as this one. This model is of type EXCEPTION and provides more specific information than the EQUIVALOG model TIGR00287. It describes the Cas1 protein particular to the ECOLI subtype of CRISPR/Cas system. Probab=28.92 E-value=7.7 Score=13.52 Aligned_cols=13 Identities=15% Similarity=0.358 Sum_probs=5.5 Q ss_pred HHHHHHHHHHHHCCH Q ss_conf 999999999973092 Q gi|254781029|r 171 ASQSIAGIALRLIDP 185 (1828) Q Consensus 171 ~a~~m~~~a~rl~~p 185 (1828) +||.| .++||-+| T Consensus 111 vAr~m--y~~RF~~~ 123 (269) T TIGR03638 111 VARRM--YAMRFGEE 123 (269) T ss_pred HHHHH--HHHHCCCC T ss_conf 99999--99866997 No 188 >pfam06781 UPF0233 Uncharacterized protein family (UPF0233). Probab=28.83 E-value=18 Score=10.48 Aligned_cols=29 Identities=7% Similarity=-0.041 Sum_probs=21.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 73227999999999999999999998316 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLACELGFLIKFYT 124 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~~~~~~~~~~~~ 124 (1828) .-.+|-|++.++-++++.|-+++...+.. T Consensus 26 ~~~sp~W~~p~m~~lm~iGL~Wivv~Yls 54 (87) T pfam06781 26 KKPSPVWFVPLMVGLMLLGLLWLVVYYLS 54 (87) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89998619999999999999999999861 No 189 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=28.82 E-value=18 Score=10.48 Aligned_cols=17 Identities=12% Similarity=0.510 Sum_probs=7.1 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781029|r 149 VVPILLFFSFFIMISRA 165 (1828) Q Consensus 149 ~~p~~~~~~~a~~~~r~ 165 (1828) |+=+++||++-.+++|. T Consensus 49 v~~~~~l~~l~~ll~~~ 65 (398) T PRK10747 49 ILAMVVLFAIEWLLRRI 65 (398) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999999999 No 190 >KOG1923 consensus Probab=28.78 E-value=18 Score=10.48 Aligned_cols=77 Identities=10% Similarity=0.031 Sum_probs=45.3 Q ss_pred CCCHHHHHHHHHCCCCCH-HHHHHHHHHHHHHHHHHHHHHHHCCH-H-------HHHHHHHHHHCHHHHHHHHHHHHHHH Q ss_conf 779898999854187322-79999999999999999999983162-1-------45557656406488999999999999 Q gi|254781029|r 82 NGGITHILQNIGKNSLNI-ALRNAFMSSLIWLACELGFLIKFYTN-S-------PMDSFYDFAMRPEVITLFLVVSVVPI 152 (1828) Q Consensus 82 r~~~~~~~~~l~~~~s~~-~~~~~~~~s~~w~~~~~~~~~~~~~~-~-------~~~~~~~~~~~~~~~~~~~~~~~~p~ 152 (1828) +..|..+-.++....+++ ...+-..+|+.|+.||..++|..|+. + ....+..++..|..+-+=+ .+.-+ T Consensus 18 p~A~~~ia~S~~~~n~rtktsvl~~lasic~v~gg~E~~~~a~~~f~~v~qe~~rf~~lm~~~s~~~~fn~df--~~a~l 95 (830) T KOG1923 18 PPAIACIANSSKHTNLRTKTSVLGSLASICYVIGGSEQSWPAAPVFSKVNQEPRRFNTLMYVHSLPPFFNADF--SAAKL 95 (830) T ss_pred CHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCEEECCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCHHH--HHHHH T ss_conf 3377776675226785417789999999999855723431347544666347644563332214730017078--99998 Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|254781029|r 153 LLFFSFFI 160 (1828) Q Consensus 153 ~~~~~~a~ 160 (1828) =||++|.. T Consensus 96 Qf~~ifv~ 103 (830) T KOG1923 96 QFYDVFVG 103 (830) T ss_pred HHHHHHHC T ss_conf 99999834 No 191 >COG4325 Predicted membrane protein [Function unknown] Probab=28.55 E-value=18 Score=10.45 Aligned_cols=25 Identities=12% Similarity=-0.048 Sum_probs=13.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 18732279999999999999999999983 Q gi|254781029|r 94 KNSLNIALRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) -++++. ||+-.++ |+..++++.+.+ T Consensus 32 d~l~~~-~WvipA~---~vv~al~fgf~L 56 (464) T COG4325 32 DYLQGA-VWVIPAF---GVVIALGFGFVL 56 (464) T ss_pred HHHCCC-EEEEHHH---HHHHHHHHHHHH T ss_conf 764276-1662578---999999999843 No 192 >pfam04956 TrbC TrbC/VIRB2 family. Conjugal transfer protein, TrbC has been identified as a subunit of the pilus precursor in bacteria. The protein undergoes three processing steps before gaining its mature cyclic structure. This family also contains several VIRB2 type IV secretion proteins. The virB2 gene encodes a putative type IV secretion system and is known to be a pathogenicity factor in Bartonella species. Probab=28.52 E-value=18 Score=10.44 Aligned_cols=75 Identities=9% Similarity=0.074 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC---H--HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7999999999999999999998316---2--1455576564064889999999999999999999999999999999999 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKFYT---N--SPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQS 174 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~~~---~--~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~ 174 (1828) ....+..+.+++......-+++.-. + ..+.++.++...|...+++ ..+++=+-+.|.| +..++|-...- T Consensus 9 ~~~~~~~~~~~~~~l~~~pA~A~~~~~~~~~~~l~~i~~~l~gpi~~~ia-~iaii~~G~~~~f-----g~~~~~~~~~i 82 (99) T pfam04956 9 RRLALALALAALLLLLASPALAQGSGLDPVETVLQNIVDFLTGPLARVIA-TIAIIIAGIMAAF-----GRLSWRTLVMV 82 (99) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH-HHHHHHHHHHHHH-----CCCCHHHHHHH T ss_conf 99999999999999966877773279996899999999998471899999-9999999999998-----34889999999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254781029|r 175 IAGIAL 180 (1828) Q Consensus 175 m~~~a~ 180 (1828) .--+++ T Consensus 83 viGi~i 88 (99) T pfam04956 83 VIGIVI 88 (99) T ss_pred HHHHHH T ss_conf 989974 No 193 >TIGR01667 YCCS_YHJK integral membrane protein, YccS/YhfK family; InterPro: IPR010020 This entry identifies a clade of sequences from gamma and beta proteobacteria. These proteins are more than 700 amino acids long and many have been annotated as putative membrane proteins. The gene from Salmonella has been annotated as a putative efflux transporter. The gene from Escherichia coli has been annotated as yccS. The YccS hypothetical equivalog is found in beta and gamma proteobacteria, while the smaller YhfK group is only found in E. coli, Salmonella and Yersinia. . Probab=28.39 E-value=18 Score=10.43 Aligned_cols=92 Identities=14% Similarity=0.153 Sum_probs=40.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCH-HHHHHHHHH-------------HHCHHHHHHHH-HHHHHHHHHHHHH Q ss_conf 18732279999999999999999999983162-145557656-------------40648899999-9999999999999 Q gi|254781029|r 94 KNSLNIALRNAFMSSLIWLACELGFLIKFYTN-SPMDSFYDF-------------AMRPEVITLFL-VVSVVPILLFFSF 158 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~-~~~~~~~~~-------------~~~~~~~~~~~-~~~~~p~~~~~~~ 158 (1828) ..|-.-|+-.|...+++-+.|++|=-|+...+ +.+-++-++ +-.|..| ++ |++. =-+|=.+ T Consensus 79 ~~p~~~~~llt~lt~~~~l~g~lG~rYA~ia~asLl~AiYT~~~~~~rGG~~~~w~~~Plll--~AlGt~w--Y~ll~~~ 154 (724) T TIGR01667 79 PKPWLFVVLLTLLTFGFILLGALGQRYATIAFASLLVAIYTMLGAGQRGGEVEVWFIEPLLL--LALGTLW--YGLLTLI 154 (724) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHH--HHHHHHH--HHHHHHH T ss_conf 66068999999999999999736889999999999999999961477887120777405999--9999999--9999999 Q ss_pred HHHHHHHHHHH-HHHHHHHHHH------HHHCCHHHHH Q ss_conf 99999999999-9999999999------9730925556 Q gi|254781029|r 159 FIMISRARDMH-DASQSIAGIA------LRLIDPEEYS 189 (1828) Q Consensus 159 a~~~~r~~~~~-~~a~~m~~~a------~rl~~p~~~a 189 (1828) ..++|+-|=+| ++||.-.+.| .+|-+|.... T Consensus 155 ~~~lF~~~Plq~sls~~Y~~La~Yl~~Ks~lfdP~~~~ 192 (724) T TIGR01667 155 WFLLFPNRPLQESLSRLYRELAEYLELKSSLFDPDQET 192 (724) T ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 99880799736789999999999998876524873001 No 194 >TIGR02005 PTS-IIBC-alpha PTS system, alpha-glucoside-specific IIBC component; InterPro: IPR010975 This entry represents the fused PTS enzyme II B and C domains. A gene from Clostridium has been partially characterised as a maltose transporter, while genes from Fusobacterium and Klebsiella , have been proposed to transport the five non-standard isomers of sucrose.; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system. Probab=28.27 E-value=18 Score=10.41 Aligned_cols=10 Identities=20% Similarity=0.501 Sum_probs=3.4 Q ss_pred HHHHHHHHHC Q ss_conf 9999999831 Q gi|254781029|r 114 CELGFLIKFY 123 (1828) Q Consensus 114 ~~~~~~~~~~ 123 (1828) ||+|++.+.| T Consensus 290 Gc~GIALAmY 299 (533) T TIGR02005 290 GCVGIALAMY 299 (533) T ss_pred HHHHHHHHHH T ss_conf 6789999763 No 195 >PRK10209 acid-resistance membrane protein; Provisional Probab=28.08 E-value=18 Score=10.38 Aligned_cols=15 Identities=13% Similarity=0.452 Sum_probs=5.6 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 799999999999999 Q gi|254781029|r 100 ALRNAFMSSLIWLAC 114 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~ 114 (1828) +.++..+..++|+.+ T Consensus 78 ~~~~~~L~gi~yl~~ 92 (190) T PRK10209 78 PMLSGFLVAVAYLVI 92 (190) T ss_pred HHHHHHHHHHHHHHH T ss_conf 899999999999999 No 196 >KOG2563 consensus Probab=27.93 E-value=18 Score=10.36 Aligned_cols=43 Identities=14% Similarity=0.091 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHHHHHHHHHH-----HHHHHH--CCH---HHHHHHHHHHHHHHH Q ss_conf 99999999999999999999-----999973--092---555699999999999 Q gi|254781029|r 157 SFFIMISRARDMHDASQSIA-----GIALRL--IDP---EEYSSEKMQSISSAV 200 (1828) Q Consensus 157 ~~a~~~~r~~~~~~~a~~m~-----~~a~rl--~~p---~~~a~~~~~~~~qa~ 200 (1828) +.||++=.-.-+|....-=+ -..||+ +-| ...-- .+..+||+| T Consensus 97 f~~mw~ldk~GLR~a~llgt~ln~iGa~Ir~iss~p~v~~~f~~-~l~~~Gq~i 149 (480) T KOG2563 97 FAAMWILDKFGLRTALLLGTVLNGIGAWIRLISSLPFVPPLFRR-PLTHTGQSI 149 (480) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCHH-HHHHHHHHH T ss_conf 77887620131488999999999998987410157555642113-556776889 No 197 >PRK05760 F0F1 ATP synthase subunit I; Validated Probab=27.91 E-value=18 Score=10.36 Aligned_cols=27 Identities=22% Similarity=0.140 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999 Q gi|254781029|r 149 VVPILLFFSFFIMISRARDMHDASQSI 175 (1828) Q Consensus 149 ~~p~~~~~~~a~~~~r~~~~~~~a~~m 175 (1828) ++|=++|=..|-...-++--+.+.+|| T Consensus 45 ~iPn~~Fa~~aFr~~GAr~a~~vv~sf 71 (124) T PRK05760 45 VLANAYFAHKAFRFSGARAAKAIVRSF 71 (124) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 994899999999875031489999999 No 198 >pfam04901 RAMP Receptor activity modifying family. The calcitonin-receptor-like receptor can function as either a calcitonin-gene-related peptide or an adrenomedullin receptor. The receptors function is modified by receptor-activity-modifying protein or RAMP. RAMPs are single-transmembrane-domain proteins. Probab=27.83 E-value=18 Score=10.35 Aligned_cols=28 Identities=32% Similarity=0.527 Sum_probs=18.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q ss_conf 6488999999999999999999-99999999 Q gi|254781029|r 137 RPEVITLFLVVSVVPILLFFSF-FIMISRAR 166 (1828) Q Consensus 137 ~~~~~~~~~~~~~~p~~~~~~~-a~~~~r~~ 166 (1828) -|..| ....|++||++.-++ +.++||+. T Consensus 81 PP~~i--L~~~I~vPI~lt~~m~~LVVWrSK 109 (115) T pfam04901 81 PPEDI--LLPFIIVPICLTLLMTALVVWRSK 109 (115) T ss_pred CCHHH--HHHHHHHHHHHHHHHHHEEEEECC T ss_conf 96888--899999999999998711566426 No 199 >pfam07300 consensus Probab=27.71 E-value=18 Score=10.34 Aligned_cols=22 Identities=14% Similarity=0.202 Sum_probs=15.6 Q ss_pred CCCCHHHHHHHHHHHHHHHHHH Q ss_conf 8732279999999999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACEL 116 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~ 116 (1828) -+|+..|.++.++-++|++.|. T Consensus 12 ~g~p~~F~~a~~~v~~W~~sgp 33 (111) T pfam07300 12 AGRPVAFVLAVALVVVWAVSGP 33 (111) T ss_pred HCCCHHHHHHHHHHHHHHHHHH T ss_conf 6891899999999999999986 No 200 >pfam08627 CRT-like CRT-like. This region is found in proteins related to Plasmodium falciparum chloroquine resistance transporter (CRT). Probab=27.48 E-value=19 Score=10.31 Aligned_cols=68 Identities=12% Similarity=0.096 Sum_probs=39.1 Q ss_pred CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 7798989998541873227999999999999999999998316214555765640648899999999999999 Q gi|254781029|r 82 NGGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILL 154 (1828) Q Consensus 82 r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 154 (1828) .+++.+-+++.-+.+...-.....+.+++++.-|+--..++------...-+||.+...-.+ .+||++ T Consensus 36 k~s~~~kfksf~k~s~~ket~tiliyvvlyiisgvinsvllkk~mn~f~nygFFL~QLtnyg-----yvPIf~ 103 (130) T pfam08627 36 KESLSNKFKSFLKNSMSKETLTILIYVVLYIISGVINSVLLKKVMNKFTNYGFFLSQLTNYG-----YVPIFG 103 (130) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-----HHHHHH T ss_conf 78899999999997734547789999999999878999999999999865899999988427-----368999 No 201 >pfam11897 DUF3417 Protein of unknown function (DUF3417). This family of proteins are functionally uncharacterized. This protein is found in bacteria and archaea. Proteins in this family are typically between 145 to 860 amino acids in length. This protein is found associated with pfam00343. This protein has a conserved AYF sequence motif. Probab=26.94 E-value=18 Score=10.46 Aligned_cols=82 Identities=22% Similarity=0.349 Sum_probs=55.6 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH---H Q ss_conf 1579999987180138999999750745620677412525998999999850788789978999999999999998---2 Q gi|254781029|r 1721 DSLVENISKFIDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKADSALRNAIDRYISNFEEMLSEIA---Q 1797 (1828) Q Consensus 1721 ~s~s~di~~~~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l~~~~---~ 1797 (1828) |.-+.++=|.||+ ++|.... +| --++....+|..++++. .|++|...+++-+.+|..-|..-. + T Consensus 30 ~~~a~~LF~~iDp----~lW~~~~---hN--Pv~~L~~v~~~rL~~la----~D~~f~~~~~~v~~~~~~Yl~~~~w~~~ 96 (119) T pfam11897 30 NPEARALFRRLDP----ELWEETG---HN--PVRLLGEVSQERLEELA----ADPEFLARYDRVAADFDEYLSAPTWFQR 96 (119) T ss_pred CHHHHHHHHHHCH----HHHHHCC---CC--HHHHHHHCCHHHHHHHH----CCHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 8659999999699----9998768---99--79999868999999995----6999999999999999999868864323 Q ss_pred C-CCCHHHHHHHHHHHHHH Q ss_conf 5-87337899998766439 Q gi|254781029|r 1798 S-NDDSPLVQEHIMSNYGK 1815 (1828) Q Consensus 1798 ~-~~~~~~~~~~l~s~~g~ 1815 (1828) + +..+.-.++|.+.+-|. T Consensus 97 ~~~~~~~~~VAYFS~EfGl 115 (119) T pfam11897 97 NPGDEGPRPIAYFSMEFGL 115 (119) T ss_pred CCCCCCCCCEEEECCCCCC T ss_conf 5777899956997103044 No 202 >KOG1925 consensus Probab=26.76 E-value=19 Score=10.21 Aligned_cols=19 Identities=5% Similarity=-0.069 Sum_probs=9.2 Q ss_pred CCCHHHHHHHHHHHCCCCC Q ss_conf 8013899999975074562 Q gi|254781029|r 1732 DYDAFVQLWKSYTLGEDDI 1750 (1828) Q Consensus 1732 ~~~~~~~~w~~y~~g~~~~ 1750 (1828) ..++++|+-++.-+.-..+ T Consensus 753 ~~d~~DEImd~lVKsAT~~ 771 (817) T KOG1925 753 PSDTSDEIMDLLVKSATKS 771 (817) T ss_pred CCCHHHHHHHHHHHHHHCC T ss_conf 8885789999999876238 No 203 >COG0609 FepD ABC-type Fe3+-siderophore transport system, permease component [Inorganic ion transport and metabolism] Probab=26.73 E-value=19 Score=10.21 Aligned_cols=36 Identities=19% Similarity=0.121 Sum_probs=19.8 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 898999854187322799999999999999999999831 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFY 123 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~ 123 (1828) .|.++|.+-++|-..||.+- +-| +++.|+.++..++ T Consensus 71 sG~~lQ~v~rNpLAdP~ilG-Iss--GA~~~~~l~i~~~ 106 (334) T COG0609 71 AGALLQGVTRNPLADPGILG-ISS--GASLGAVLAILLF 106 (334) T ss_pred HHHHHHHHHCCCCCCCCHHC-HHH--HHHHHHHHHHHHH T ss_conf 99999999548789932305-655--6999999999998 No 204 >PRK12671 putative monovalent cation/H+ antiporter subunit G; Reviewed Probab=26.70 E-value=19 Score=10.20 Aligned_cols=27 Identities=19% Similarity=0.087 Sum_probs=20.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 227999999999999999999998316 Q gi|254781029|r 98 NIALRNAFMSSLIWLACELGFLIKFYT 124 (1828) Q Consensus 98 ~~~~~~~~~~s~~w~~~~~~~~~~~~~ 124 (1828) .-|.|.+-+++++-+.|+.+...+-+| T Consensus 6 ~lp~wa~i~v~~ll~~G~~f~liga~G 32 (120) T PRK12671 6 DIPLWAAILIAFFLVLGAGLTLIGTIG 32 (120) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 160999999999999999999999888 No 205 >COG5232 SEC62 Preprotein translocase subunit Sec62 [Intracellular trafficking and secretion] Probab=26.65 E-value=19 Score=10.20 Aligned_cols=20 Identities=30% Similarity=0.615 Sum_probs=7.7 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 88999999999999999999 Q gi|254781029|r 139 EVITLFLVVSVVPILLFFSF 158 (1828) Q Consensus 139 ~~~~~~~~~~~~p~~~~~~~ 158 (1828) |.|++|++.+|+-.+||-.. T Consensus 188 G~i~~ffvlaIlRliLf~it 207 (259) T COG5232 188 GFITFFFVLAILRLILFSIT 207 (259) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 29999999999999999845 No 206 >pfam02508 Rnf-Nqr Rnf-Nqr subunit, membrane protein. This is a family of integral membrane proteins including Rhodobacter-specific nitrogen fixation (rnf) proteins RnfA and RnfE and Na+-translocating NADH:ubiquinone oxidoreductase (Na+-NQR) subunits NqrD and NqrE. Probab=26.46 E-value=19 Score=10.17 Aligned_cols=16 Identities=6% Similarity=-0.047 Sum_probs=8.1 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELG 117 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~ 117 (1828) +.-.+.+.+|.--.+. T Consensus 4 ~~~i~~~~i~~nN~vl 19 (188) T pfam02508 4 YLSIFLSGLFVNNPVL 19 (188) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 8999999999967999 No 207 >PRK13109 flhB flagellar biosynthesis protein FlhB; Reviewed Probab=26.42 E-value=19 Score=10.16 Aligned_cols=30 Identities=23% Similarity=0.192 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999999999999973 Q gi|254781029|r 143 LFLVVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRL 182 (1828) Q Consensus 143 ~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl 182 (1828) ..++.+++|++++.+++.++ -++.++.+.| T Consensus 89 ~~~~~~l~P~~~~~~v~~i~----------~~~~Q~G~~f 118 (358) T PRK13109 89 LAAGRFLLPILLVLAVAGLA----------ASLLQNLPRF 118 (358) T ss_pred HHHHHHHHHHHHHHHHHHHH----------HHHHHCCCEE T ss_conf 99999999999999999999----------9986014030 No 208 >PRK00816 rnfD electron transport complex protein RnfD; Reviewed Probab=26.30 E-value=19 Score=10.15 Aligned_cols=49 Identities=10% Similarity=0.161 Sum_probs=23.9 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHH Q ss_conf 89899985418732279999999999999999999983162145557656406488999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITL 143 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (1828) +..+|-+|-- |...|+|...+.+++ ++.+....||+ ++.-+.+|.++|- T Consensus 79 vTglLlal~l-Pp~~P~W~~~iG~~f----aIvigK~vfGG-----lG~N~fNPAlvgr 127 (351) T PRK00816 79 LTALLLAVSI-PPLAPWWMIVLGTVF----AVIIAKQLYGG-----LGQNPFNPAMIGY 127 (351) T ss_pred HHHHHHHHHC-CCCCCHHHHHHHHHH----HHHHHHHHCCC-----CCCCCCCHHHHHH T ss_conf 9999999628-888737999999999----99999884277-----7878468899999 No 209 >pfam11139 DUF2910 Protein of unknown function (DUF2910). Some members in this bacterial family annotate the proteins as cytochrome C biogenesis proteins however this cannot be confirmed. Currently no function for this family is known. Probab=26.16 E-value=19 Score=10.13 Aligned_cols=35 Identities=11% Similarity=0.054 Sum_probs=15.2 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89899985418732279999999999999999999983 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) |+..+.-|.+ +.|.+...+|-+-|+.++++....+ T Consensus 15 i~~~vllL~~---prp~~~~laFl~g~~~~~~~~g~~~ 49 (212) T pfam11139 15 IGVAVLLLLR---PRPVRRLLAFLAGWLLGYVAVGVVL 49 (212) T ss_pred HHHHHHHHCC---CCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999848---9802666999999999999999999 No 210 >KOG2257 consensus Probab=26.10 E-value=19 Score=10.12 Aligned_cols=106 Identities=16% Similarity=0.217 Sum_probs=59.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 18732279999999999999999999983162145557656406488999999999999999999999999999999999 Q gi|254781029|r 94 KNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQ 173 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~ 173 (1828) ..|.+-.||++.-. +.|.+-++.+.|++-|...+-+++ +---|.-.=.++ +|+.|+ .|++.-- T Consensus 3 ~~~t~e~YgF~~yi-v~~~~~ViylIWai~P~~~l~~~g-i~y~Psk~WalA----ip~~l~--v~~l~~~--------- 65 (135) T KOG2257 3 PHPTREYYGFVLYI-VSWTLFVIYLIWAITPVPILESLG-ITYYPSKYWALA----IPTYLL--VAVLLAY--------- 65 (135) T ss_pred CCCCCEEHHHHHHH-HHHHHHHHHHHHHCCCHHHHHHCC-CEEEHHHHHHHH----HHHHHH--HHHHHHH--------- T ss_conf 99742221468899-999999998874206378886567-446507889999----789999--9999999--------- Q ss_pred HHHHHHHHHCCHHHHH--HHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHH Q ss_conf 9999999730925556--999999999999999----9723689999999999 Q gi|254781029|r 174 SIAGIALRLIDPEEYS--SEKMQSISSAVRKEI----VLMTEEIDRAISRASE 220 (1828) Q Consensus 174 ~m~~~a~rl~~p~~~a--~~~~~~~~qa~r~ev----~a~~~~~er~~~ra~e 220 (1828) |-+=--+|+... =++.-|+-.+-+|++ .-|-.|-+||+.+-.- T Consensus 66 ----v~ll~~N~e~tt~pl~Sl~ti~D~y~r~~~e~~v~~kk~~~~a~~~i~D 114 (135) T KOG2257 66 ----VFLLGYNFESTTKPLDSLNTITDSYAREIQEFAVKMKKGEDRAIDGISD 114 (135) T ss_pred ----HHHHHCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCC T ss_conf ----9998357100479875200203433120102478776215576787100 No 211 >pfam05982 DUF897 Domain of unknown function (DUF897). Family of bacterial proteins with unknown function Probab=26.10 E-value=19 Score=10.12 Aligned_cols=51 Identities=18% Similarity=0.307 Sum_probs=31.5 Q ss_pred HHCHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 40648899999999999999999999999--9999999999999999973092555699999999999 Q gi|254781029|r 135 AMRPEVITLFLVVSVVPILLFFSFFIMIS--RARDMHDASQSIAGIALRLIDPEEYSSEKMQSISSAV 200 (1828) Q Consensus 135 ~~~~~~~~~~~~~~~~p~~~~~~~a~~~~--r~~~~~~~a~~m~~~a~rl~~p~~~a~~~~~~~~qa~ 200 (1828) ...|.+.++ +.++++|++-||.+-.+.+ |..---++ -+.-+-+++|..-|+ T Consensus 53 ~~~~~~~~v-~lg~liPl~ay~iLr~~~~l~~~dAaAiA--------------AhYGSVSaVTF~aa~ 105 (327) T pfam05982 53 VVGPLVAGV-ALGFLIPLIAYFLLRKLTKLDTVDAAATA--------------AHYGSVSAVTFAAAV 105 (327) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCHHHHHHHH--------------HHCCCHHHHHHHHHH T ss_conf 999999999-99999999999999983489746589999--------------863752699999999 No 212 >TIGR02587 TIGR02587 putative integral membrane protein TIGR02587; InterPro: IPR013416 This entry is found in Nostoc sp. PCC 7120, Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus in a conserved two-gene neighbourhood. Proteins containing this entry appear to span the membrane seven times.. Probab=26.08 E-value=20 Score=10.12 Aligned_cols=84 Identities=14% Similarity=0.196 Sum_probs=52.7 Q ss_pred CCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHH----HHHHHHHHH-----CCHH-HHH---HHHHHHHCHHHHHH Q ss_conf 7755677989899985418732279999999999999----999999983-----1621-455---57656406488999 Q gi|254781029|r 77 ANDLDNGGITHILQNIGKNSLNIALRNAFMSSLIWLA----CELGFLIKF-----YTNS-PMD---SFYDFAMRPEVITL 143 (1828) Q Consensus 77 a~~d~r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~----~~~~~~~~~-----~~~~-~~~---~~~~~~~~~~~~~~ 143 (1828) +.|==--=+|.|.=|+..-|.-=+..+|..++=-|+. +.++++|.+ |.++ -.. .+-+.+-+=+.+| T Consensus 157 l~eLf~~~vGAl~ls~NiAPTeEv~liAyk~~P~hi~~L~l~sl~i~h~fVy~l~Frg~~~~~~g~g~~q~f~r~T~~g- 235 (284) T TIGR02587 157 LAELFLMLVGALFLSLNIAPTEEVVLIAYKISPYHILALILASLVIMHAFVYELEFRGQHKRREGKGIFQRFLRYTLIG- 235 (284) T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHH- T ss_conf 9999999999999984279831189998504878999999999999988987312577667543563111107752899- Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999 Q gi|254781029|r 144 FLVVSVVPILLFFSFFIM 161 (1828) Q Consensus 144 ~~~~~~~p~~~~~~~a~~ 161 (1828) +++++++-+.+.|+|--+ T Consensus 236 Y~v~~l~s~y~LwtF~r~ 253 (284) T TIGR02587 236 YLVSLLVSIYILWTFERV 253 (284) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 999999999999863220 No 213 >pfam09716 ETRAMP Malarial early transcribed membrane protein (ETRAMP). These sequences represent a family of proteins from the malaria parasite Plasmodium falciparum, several of which have been shown to be expressed specifically in the ring stage as well as the rodent parasite Plasmodium yoelii. A homologue from Plasmodium chabaudi was localized to the parasitophorous vacuole membrane. Members have an initial hydrophobic, Phe/Tyr-rich, stretch long enough to span the membrane, a highly charged region rich in Lys, a second putative transmembrane region and a second highly charged, low complexity sequence region. Some members have up to 100 residues of additional C-terminal sequence. These genes have been shown to be found in the sub-telomeric regions of both Plasmodium falciparum and P. yoelii chromosomes. Probab=25.97 E-value=20 Score=10.10 Aligned_cols=36 Identities=14% Similarity=0.125 Sum_probs=19.4 Q ss_pred HHHHHHCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99985418-7322799999999999999999999831 Q gi|254781029|r 88 ILQNIGKN-SLNIALRNAFMSSLIWLACELGFLIKFY 123 (1828) Q Consensus 88 ~~~~l~~~-~s~~~~~~~~~~s~~w~~~~~~~~~~~~ 123 (1828) +...+++. -...+..+.+++|++-++.|.++.+++| T Consensus 40 ~~~~~~kKkk~kK~~iiSsias~lalli~~~~G~g~y 76 (84) T pfam09716 40 IDDDLEKKKKNKKKIIISSIASGLALLIGTALGLGYY 76 (84) T ss_pred CCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 0138999886341200899999999999988887654 No 214 >pfam06757 Ins_allergen_rp Insect allergen related repeat, nitrile-specifier detoxification. This family exemplifies a case of novel gene evolution. The case in point is the arms-race between plants and their infective insective herbivores in the area of the glucosinolate-myrosinase system. Brassicas have developed the glucosinolate-myrosinase system as chemical defence mechanism against the insects, and consequently the insects have adapted to produce a detoxifying molecule, nitrile-specifier protein (NSP). NSP is present in the small white butterfly Pieris rapae. NSP is structurally different from and has no amino acid homology to any known detoxifying enzymes, and it appears to have arisen by a process of domain and gene duplication of a sequence of unknown function that is widespread in insect species and referred to as insect-allergen-repeat protein. Thus this family is found either as a single domain or as a multiple repeat-domain. Probab=25.97 E-value=20 Score=10.10 Aligned_cols=65 Identities=12% Similarity=0.213 Sum_probs=46.5 Q ss_pred CCCHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 8421579999987180138999999750745620677412525998999999850788789978999 Q gi|254781029|r 1718 SSIDSLVENISKFIDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKADSALRNAIDRY 1784 (1828) Q Consensus 1718 ~~l~s~s~di~~~~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d~~f~~~v~ry 1784 (1828) ..++++--||-.++..+....+.+.-... +.--+.+|...+-..|..+-.++...|+|+.-+++. T Consensus 103 ~gl~gf~~dv~~~lP~~~l~aL~~~K~~~--s~~F~~~~~~l~S~ef~~l~~~l~~~~~~~~~~~~L 167 (181) T pfam06757 103 RGLNGFVDDVLALLPLDELKALFEEKLAT--SPEFKAFYEALRSPEFKKLLEALWNSPEVQRLLQEL 167 (181) T ss_pred CCHHHHHHHHHHHCCHHHHHHHHHHHHHC--CHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 87688999999979999999999999876--899999999974899999999998296899999999 No 215 >pfam07214 DUF1418 Protein of unknown function (DUF1418). This family consists of several hypothetical Enterobacterial proteins of around 100 residues in length. Members of this family are often described as YbjC. In E. coli the ybjC gene is located downstream of nfsA (which encodes the major oxygen-insensitive nitroreductase). It is thought that nfsA and ybjC form an operon an its promoter is a class I SoxS-dependent promoter. The function of this family is unknown. Probab=25.95 E-value=20 Score=10.10 Aligned_cols=34 Identities=15% Similarity=0.122 Sum_probs=19.3 Q ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 576564064889999999999999999999999999 Q gi|254781029|r 130 SFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) ++.+.++.|++.-+ -.+++=..|+=..+.++||+ T Consensus 34 ~lP~~l~tp~a~i~--MIflGi~lmiPAAv~ivWR~ 67 (96) T pfam07214 34 SLPGPIASPTAAIL--MIFLGVGLMIPAAVVVVWRV 67 (96) T ss_pred ECCHHHCCCHHHHH--HHHHHHHHHHHHHHHHHHHH T ss_conf 37522158058999--99999999889999999999 No 216 >PRK11562 nitrite transporter NirC; Provisional Probab=25.74 E-value=20 Score=10.07 Aligned_cols=28 Identities=0% Similarity=0.010 Sum_probs=21.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8732279999999999999999999983 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~~~~ 122 (1828) ++|+.-|.+-++++++.++.|..+++.. T Consensus 21 ~~~~~~yfv~a~lAGafI~~G~~~~~~v 48 (268) T PRK11562 21 ANNPLGFWVSSAMAGAYVGLGIILIFTL 48 (268) T ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4585999999999999999999999997 No 217 >TIGR00957 MRP_assoc_pro multi drug resistance-associated protein (MRP); InterPro: IPR005292 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family consists of multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to the ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux reduced its function. The C-terminal domain seems to be highly conserved. ; GO: 0005215 transporter activity, 0006810 transport, 0016021 integral to membrane. Probab=25.71 E-value=20 Score=10.07 Aligned_cols=39 Identities=18% Similarity=0.440 Sum_probs=20.8 Q ss_pred HHHHHHHHHHHHHHHHHHCCHH-HHHHHHHHHHCHHHHHH Q ss_conf 9999999999999999831621-45557656406488999 Q gi|254781029|r 105 FMSSLIWLACELGFLIKFYTNS-PMDSFYDFAMRPEVITL 143 (1828) Q Consensus 105 ~~~s~~w~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 143 (1828) .+.-++|+..=+=|.+.|++.+ ++....=|+..|.++|+ T Consensus 71 ~LG~lLW~v~WADLFYSF~~~~~G~~~APVflV~P~~~G~ 110 (1542) T TIGR00957 71 ALGFLLWIVCWADLFYSFWERSRGVLPAPVFLVSPLLLGI 110 (1542) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEECCHHEECHHHHHH T ss_conf 9999999999988777778642762117501002079999 No 218 >PRK05349 Na(+)-translocating NADH-quinone reductase subunit B; Provisional Probab=25.65 E-value=20 Score=10.06 Aligned_cols=14 Identities=7% Similarity=0.183 Sum_probs=7.3 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 79999999999999 Q gi|254781029|r 100 ALRNAFMSSLIWLA 113 (1828) Q Consensus 100 ~~~~~~~~s~~w~~ 113 (1828) ..|+|++-..+|.. T Consensus 59 ~V~~al~P~~~fg~ 72 (407) T PRK05349 59 TVWLALFPAMFFGM 72 (407) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999999 No 219 >pfam05478 Prominin Prominin. The prominins are an emerging family of proteins that among the multispan membrane proteins display a novel topology. Mouse prominin and human prominin (mouse)-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration. Probab=25.50 E-value=20 Score=10.04 Aligned_cols=15 Identities=7% Similarity=0.202 Sum_probs=5.6 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999999 Q gi|254781029|r 244 DNITQNLKQEREAII 258 (1828) Q Consensus 244 r~L~~el~~ere~i~ 258 (1828) +.++.+...+-+.|+ T Consensus 187 ~~fl~d~~~qI~~ll 201 (809) T pfam05478 187 RTYLNDTPQQIDHVL 201 (809) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999980499999999 No 220 >TIGR02916 PEP_his_kin putative PEP-CTERM system histidine kinase; InterPro: IPR014265 Proteins in this entry are putative periplasmic sensor signal transduction histidine kinases. They all contain a GAF domain that is present in phytochromes and cGMP-specific phosphodiesterases, and which has been experimentally proven to be involved in protein:protein interactions. They also contain a C-terminal histidine kinase domain, which is composed of a dimerisation sub-domain and an ATP/ADP-binding phosphotransfer, or catalytic, sub-domain. The proteins in this entry are found strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria, where members of IPR014264 from INTERPRO and IPR014266 from INTERPRO also occur.. Probab=25.09 E-value=20 Score=9.97 Aligned_cols=33 Identities=15% Similarity=0.106 Sum_probs=26.5 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 541873227999999999999999999998316 Q gi|254781029|r 92 IGKNSLNIALRNAFMSSLIWLACELGFLIKFYT 124 (1828) Q Consensus 92 l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~ 124 (1828) =.+++......+|+++|++|+...+++....+| T Consensus 24 wR~~~~G~~ll~A~~~sa~wA~~~~~l~~~~~~ 56 (696) T TIGR02916 24 WRQRLAGGLLLLAAALSAVWALASAALVLLDYP 56 (696) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 227864169999999999999999998740575 No 221 >COG3162 Predicted membrane protein [Function unknown] Probab=25.09 E-value=20 Score=9.97 Aligned_cols=17 Identities=24% Similarity=0.405 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHHHH-HH Q ss_conf 999999999999999-99 Q gi|254781029|r 152 ILLFFSFFIMISRAR-DM 168 (1828) Q Consensus 152 ~~~~~~~a~~~~r~~-~~ 168 (1828) ++-|-..++.+|||+ +. T Consensus 72 v~tfVlt~IYv~rAn~~f 89 (102) T COG3162 72 VMTFVLTGIYVRRANGEF 89 (102) T ss_pred HHHHHHHHHHHHHHHCCC T ss_conf 999999999865741321 No 222 >pfam04120 Iron_permease Low affinity iron permease. Probab=25.07 E-value=20 Score=9.96 Aligned_cols=25 Identities=20% Similarity=0.267 Sum_probs=16.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8732279999999999999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFL 119 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~ 119 (1828) -.++..|..+.++-++|++.|..+. T Consensus 12 aG~p~~fv~a~~~vv~W~~~Gp~f~ 36 (133) T pfam04120 12 AGRPITFVLAVLLVVVWAASGPVFH 36 (133) T ss_pred HCCHHHHHHHHHHHHHHHHHHCCCC T ss_conf 7780999999999999998612478 No 223 >TIGR02163 napH_ ferredoxin-type protein, NapH/MauN family; InterPro: IPR011886 Most members of this family are the NapH protein, found next to NapG,in operons that encode the periplasmic nitrate reductase. Some species with this reductase lack NapC but accomplish electron transfer to NapAB in some other manner, likely to involve NapH, NapG, and/or some other protein. A few members of this protein are designated MauN and are found in methylamine utilization operons in species that appear to lack a periplasmic nitrate reductase.; GO: 0005506 iron ion binding, 0006118 electron transport. Probab=24.99 E-value=20 Score=9.94 Aligned_cols=30 Identities=20% Similarity=0.467 Sum_probs=12.3 Q ss_pred HHHHHHHHHH--CHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4555765640--64889999999999999999999 Q gi|254781029|r 127 PMDSFYDFAM--RPEVITLFLVVSVVPILLFFSFF 159 (1828) Q Consensus 127 ~~~~~~~~~~--~~~~~~~~~~~~~~p~~~~~~~a 159 (1828) |+-.+.-+.+ .|...++ .|+ +=|++||+++ T Consensus 45 Pli~LQ~l~A~~~p~~~aL-iGA--~iI~~FYal~ 76 (263) T TIGR02163 45 PLITLQILLAGHLPDLNAL-IGA--LIIVAFYALF 76 (263) T ss_pred HHHHHHHHHHHHHHHHHHH-HHH--HHHHHHHHHH T ss_conf 6899999986107899999-999--9999999998 No 224 >TIGR02800 propeller_TolB Tol-Pal system beta propeller repeat protein TolB; InterPro: IPR014167 Members of this protein family are the TolB periplasmic protein of Gram-negative bacteria. TolB is part of the Tol-Pal (peptidoglycan-associated lipoprotein) multiprotein complex, comprising five envelope proteins, TolQ, TolR, TolA, TolB and Pal, which form two complexes. The TolQ, TolR and TolA inner-membrane proteins interact via their transmembrane domains. The {beta}-propeller domain of the periplasmic protein TolB is responsible for its interaction with Pal. TolB also interacts with the outer-membrane peptidoglycan-associated proteins Lpp and OmpA. TolA undergoes a conformational change in response to changes in the proton-motive force, and interacts with Pal in an energy-dependent manner. The C-terminal periplasmic domain of TolA also interacts with the N-terminal domain of TolB. The Tol-PAL system is required for bacterial outer membrane integrity. Escherichia coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggests that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.. Probab=24.94 E-value=21 Score=9.93 Aligned_cols=34 Identities=15% Similarity=0.208 Sum_probs=27.6 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 57999998718013899999975074562067741 Q gi|254781029|r 1722 SLVENISKFIDYDAFVQLWKSYTLGEDDIFSKRLY 1756 (1828) Q Consensus 1722 s~s~di~~~~~~~~~~~~w~~y~~g~~~~f~~~~~ 1756 (1828) +...+=-|.+-|..+++.-+ .+.|+|++|+=||+ T Consensus 137 ~~~~~~~R~~AH~iaD~iye-~LTG~~G~F~tRIa 170 (439) T TIGR02800 137 TVTASQLRRVAHRIADAIYE-KLTGERGAFSTRIA 170 (439) T ss_pred EECHHHHHHHHHHHHHHHHH-HHCCCCCCCCCCEE T ss_conf 53478988785588898999-85689985656479 No 225 >KOG1962 consensus Probab=24.88 E-value=21 Score=9.91 Aligned_cols=20 Identities=5% Similarity=0.218 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999983 Q gi|254781029|r 103 NAFMSSLIWLACELGFLIKF 122 (1828) Q Consensus 103 ~~~~~s~~w~~~~~~~~~~~ 122 (1828) |+.+|+++.+=+++.+...+ T Consensus 2 ~tlvf~iL~~Eial~~iL~L 21 (216) T KOG1962 2 WTLVFTILYAEIALFLILLL 21 (216) T ss_pred HHHHHHHHHHHHHHHHHHHC T ss_conf 77999999999999999981 No 226 >PRK05470 fumarate reductase subunit D; Provisional Probab=24.78 E-value=21 Score=9.89 Aligned_cols=39 Identities=10% Similarity=0.175 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHH--HHHHHHHHHCHHH Q ss_conf 99999999999999999998316214--5557656406488 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSP--MDSFYDFAMRPEV 140 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 140 (1828) |.|.++-++=+..|+.+-++.++... -..+..|..+|-+ T Consensus 23 ~~Al~~Pvlill~GillPlG~~~~~a~~y~~i~aFa~~~iG 63 (118) T PRK05470 23 WSAIIAPVLILLVGILLPLGLFPGDALSYERVLAFAQSFIG 63 (118) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 59999889999999986024577763369999999970689 No 227 >pfam02313 Fumarate_red_D Fumarate reductase subunit D. Fumarate reductase is a membrane-bound flavoenzyme consisting of four subunits, A-B. A and B comprise the membrane-extrinsic catalytic domain and C and D link the catalytic centres to the electron-transport chain. This family consists of the 13kD hydrophobic subunit D. Probab=24.69 E-value=21 Score=9.87 Aligned_cols=38 Identities=13% Similarity=0.128 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHCHHH Q ss_conf 999999999999999999831621--45557656406488 Q gi|254781029|r 103 NAFMSSLIWLACELGFLIKFYTNS--PMDSFYDFAMRPEV 140 (1828) Q Consensus 103 ~~~~~s~~w~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 140 (1828) .|.++-++=+..|+.+-.+.+++. .-..+..|..++-+ T Consensus 24 ~A~~~PvlIll~GillPlG~~~~~a~~y~~i~aFa~s~iG 63 (118) T pfam02313 24 SAIVTPVLILILGILLPLGLIDAEALSYDRIIAFAQSWIG 63 (118) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 9999889999999998535678542589999999970899 No 228 >PRK02889 tolB translocation protein TolB; Provisional Probab=24.63 E-value=21 Score=9.85 Aligned_cols=25 Identities=12% Similarity=0.235 Sum_probs=11.2 Q ss_pred HHHCCCHHHHHHHHHHHCCCCCHHHH Q ss_conf 87180138999999750745620677 Q gi|254781029|r 1729 KFIDYDAFVQLWKSYTLGEDDIFSKR 1754 (1828) Q Consensus 1729 ~~~~~~~~~~~w~~y~~g~~~~f~~~ 1754 (1828) |.+-|..++++..+. -|++++|.=| T Consensus 141 R~~aH~iaD~Iy~~l-TG~~G~F~tr 165 (430) T PRK02889 141 RATAHKIADYIYEKL-TGVRGVFATR 165 (430) T ss_pred HHHHHHHHHHHHHHH-HCCCCCCCCE T ss_conf 999999999999986-1898542026 No 229 >PRK04897 heat shock protein HtpX; Provisional Probab=24.55 E-value=21 Score=9.83 Aligned_cols=29 Identities=28% Similarity=0.383 Sum_probs=17.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 997236899999999999999999999999 Q gi|254781029|r 204 IVLMTEEIDRAISRASELEKTVRSEIEVLE 233 (1828) Q Consensus 204 v~a~~~~~er~~~ra~ele~lv~~ev~~le 233 (1828) +-+..+|+=+.+.+ .|||.+++-|++.+. T Consensus 121 ~V~vt~GLL~~L~~-~ELeaVlAHElgHik 149 (298) T PRK04897 121 AVAVTTGLLEVMNR-EELEGVIGHEISHIR 149 (298) T ss_pred EEEECHHHHHHCCH-HHHHHHHHHHHHHHH T ss_conf 89961798976899-999999999999997 No 230 >pfam10755 DUF2585 Protein of unknown function (DUF2585). This family is conserved in Proteobacteria. The function is not known. Probab=24.48 E-value=21 Score=9.81 Aligned_cols=59 Identities=19% Similarity=0.270 Sum_probs=39.9 Q ss_pred HCCCCCHHHHHHHHHHHHHHHH-----------HHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 4187322799999999999999-----------9999998316214555765640648899999999999999 Q gi|254781029|r 93 GKNSLNIALRNAFMSSLIWLAC-----------ELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILL 154 (1828) Q Consensus 93 ~~~~s~~~~~~~~~~s~~w~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 154 (1828) .+.|-+-...+|++....|=.. .+-++..+||+|-+-|..|...+-.+ |..+.-.|+-. T Consensus 56 ~r~p~~~rL~~A~liE~aWEivENsp~II~rYR~aTis~~Y~GDSvlNSv~D~~~m~~G---F~~A~rlPv~v 125 (165) T pfam10755 56 PRLPLGWRLALAVLIEAAWEILENSPFIIDRYRAATISLDYYGDSVLNSVSDTLAMLLG---FLLAARLPVWV 125 (165) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH---HHHHHHHHHHH T ss_conf 58872289999999976677660787899999876220244532888878899999999---99997513999 No 231 >CHL00036 ycf4 photosystem I assembly protein Ycf4 Probab=24.47 E-value=21 Score=9.81 Aligned_cols=42 Identities=21% Similarity=0.244 Sum_probs=20.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----CCHH--HHHHHHHHHHCHHHHHHHH Q ss_conf 79999999999999999999983----1621--4555765640648899999 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKF----YTNS--PMDSFYDFAMRPEVITLFL 145 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~ 145 (1828) .||||.++.+ ||+|+...- ++.. +.....++.--|+++.+.+ T Consensus 21 N~~wA~il~l----G~lGFll~g~SSYlg~nllpf~~~~~i~FiPQGivM~F 68 (184) T CHL00036 21 NFFWAFILFL----GSLGFLLVGISSYLGKNLISFLPSQQIVFFPQGIVMCF 68 (184) T ss_pred HHHHHHHHHH----HHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCHHHHH T ss_conf 7789999987----55999987768976767102478401735064176657 No 232 >PRK10973 glycerol-3-phosphate transporter membrane protein; Provisional Probab=24.45 E-value=21 Score=9.80 Aligned_cols=29 Identities=7% Similarity=0.080 Sum_probs=11.2 Q ss_pred CCCHHHHHHHHHC--CCCCHHHHHHHHHHHH Q ss_conf 7798989998541--8732279999999999 Q gi|254781029|r 82 NGGITHILQNIGK--NSLNIALRNAFMSSLI 110 (1828) Q Consensus 82 r~~~~~~~~~l~~--~~s~~~~~~~~~~s~~ 110 (1828) ..+...-...++. .....+||....-|++ T Consensus 52 ~~~~~ny~~~~~~~~~~~~~~f~~~~~NSli 82 (281) T PRK10973 52 THLLENIANIWVNGVGGNSAPFGRMLLNSFV 82 (281) T ss_pred CCHHHHHHHHHHCCCCCCCCHHHHHHHHHHH T ss_conf 5316899999962553320159999999999 No 233 >COG3924 Predicted membrane protein [Function unknown] Probab=24.44 E-value=21 Score=9.80 Aligned_cols=55 Identities=18% Similarity=0.281 Sum_probs=31.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999998316214555765640648899999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) -++|.-++++.++| ++..++.+ +...++++.|.=.- ++ -+..|++|+-.--.|++ T Consensus 12 A~WAlgLtllYl~g--W~v~AYlp----~~t~G~~gfP~WFE-~a-Ci~lPllFi~l~~~mvk 66 (80) T COG3924 12 ARWALGLTLLYLAG--WLVAAYLP----GNTPGFTGFPLWFE-MA-CILLPLLFIVLCWAMVK 66 (80) T ss_pred HHHHHHHHHHHHHH--HHHHHCCC----CCCCCCCCCHHHHH-HH-HHHHHHHHHHHHHHHHH T ss_conf 99999899999999--99987078----88888777719999-99-99999999999999999 No 234 >TIGR00917 2A060601 Niemann-Pick C type protein family; InterPro: IPR004765 The Niemann-Pick C type protein is an integral membrane protein, which indicates that it is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterised by the lysosomal accumulation of unestrified cholesterol.; GO: 0030301 cholesterol transport, 0016021 integral to membrane. Probab=24.37 E-value=21 Score=9.78 Aligned_cols=24 Identities=17% Similarity=0.293 Sum_probs=10.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 9999999999999999999983162 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTN 125 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~ 125 (1828) .|+--.|.++ +.-|..+...+||. T Consensus 235 iWtfimftlv-ltyGtsmGlvlfGn 258 (1471) T TIGR00917 235 IWTFIMFTLV-LTYGTSMGLVLFGN 258 (1471) T ss_pred HHHHHHHHHH-HHHHHHHCEEEEEE T ss_conf 9999999999-97300210002401 No 235 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=24.23 E-value=22 Score=9.75 Aligned_cols=16 Identities=6% Similarity=0.285 Sum_probs=7.2 Q ss_pred CCCCCCCCHHHHHHHH Q ss_conf 4444433526787531 Q gi|254781029|r 1682 KNNHAIKEWFNKILSS 1697 (1828) Q Consensus 1682 ~~~~~~~~w~sdll~~ 1697 (1828) ...+++..|...++.. T Consensus 313 ~~~~gg~~wkG~vi~a 328 (420) T COG4942 313 QADGGGLRWKGMVIGA 328 (420) T ss_pred CCCCCCCCCCCEEEEC T ss_conf 5678876106758735 No 236 >KOG4327 consensus Probab=24.06 E-value=22 Score=9.70 Aligned_cols=13 Identities=15% Similarity=0.156 Sum_probs=5.6 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999999 Q gi|254781029|r 1633 IEEQINTLKDFQK 1645 (1828) Q Consensus 1633 v~eqi~al~el~~ 1645 (1828) ++.+++++.+.+. T Consensus 124 ia~~~e~~~~~~~ 136 (218) T KOG4327 124 IANNIEQNAQENE 136 (218) T ss_pred HHHHHHHHHHCCC T ss_conf 9989998762031 No 237 >COG1174 OpuBB ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism] Probab=23.74 E-value=22 Score=9.62 Aligned_cols=18 Identities=17% Similarity=0.150 Sum_probs=9.6 Q ss_pred HHHHHHCCHHHHHHHHHH Q ss_conf 999973092555699999 Q gi|254781029|r 177 GIALRLIDPEEYSSEKMQ 194 (1828) Q Consensus 177 ~~a~rl~~p~~~a~~~~~ 194 (1828) .|=+-||-|--++.=|+. T Consensus 137 ~VelPlAlPvIlaGIR~a 154 (221) T COG1174 137 KVELPLALPVILAGIRTA 154 (221) T ss_pred HHHCCCCHHHHHHHHHHH T ss_conf 741200189998648999 No 238 >PRK08665 ribonucleotide-diphosphate reductase subunit alpha; Validated Probab=23.73 E-value=22 Score=9.62 Aligned_cols=10 Identities=30% Similarity=0.504 Sum_probs=3.2 Q ss_pred HHHHHHHHHH Q ss_conf 5799999871 Q gi|254781029|r 1722 SLVENISKFI 1731 (1828) Q Consensus 1722 s~s~di~~~~ 1731 (1828) +++.-++|+| T Consensus 632 ~~~ea~~rli 641 (733) T PRK08665 632 SQSEAIGRLV 641 (733) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 239 >COG4720 Predicted membrane protein [Function unknown] Probab=23.64 E-value=23 Score=9.59 Aligned_cols=23 Identities=4% Similarity=-0.172 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 27999999999999999999998 Q gi|254781029|r 99 IALRNAFMSSLIWLACELGFLIK 121 (1828) Q Consensus 99 ~~~~~~~~~s~~w~~~~~~~~~~ 121 (1828) +..|+.++..++|+.++-.++-. T Consensus 106 ~i~~~~~v~~~~~mv~~y~l~~~ 128 (177) T COG4720 106 TIIWLGIVLGLAIMVGWYLLAPY 128 (177) T ss_pred EEEHHHHHHHHHHHHHHHHHHHH T ss_conf 11026999999999999999988 No 240 >pfam05664 DUF810 Protein of unknown function (DUF810). This family consists of several plant proteins of unknown function. Probab=23.61 E-value=23 Score=9.59 Aligned_cols=15 Identities=13% Similarity=0.270 Sum_probs=7.9 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999999 Q gi|254781029|r 190 SEKMQSISSAVRKEI 204 (1828) Q Consensus 190 ~~~~~~~~qa~r~ev 204 (1828) ..+.+|+++.+|.+. T Consensus 52 ~~~~~t~~E~mR~QM 66 (674) T pfam05664 52 SKRPVTVLELMRIQM 66 (674) T ss_pred CCCCCCHHHHHHHHH T ss_conf 899865999999987 No 241 >KOG0072 consensus Probab=23.57 E-value=23 Score=9.57 Aligned_cols=33 Identities=9% Similarity=0.067 Sum_probs=15.5 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHH Q ss_conf 8878887755677989899985418732279999 Q gi|254781029|r 71 YSGFYPANDLDNGGITHILQNIGKNSLNIALRNA 104 (1828) Q Consensus 71 ~~~~~pa~~d~r~~~~~~~~~l~~~~s~~~~~~~ 104 (1828) +|+|.|+|= .-++...-..-|.+..|-.|||-- T Consensus 49 tigfnve~v-~yKNLk~~vwdLggqtSirPyWRc 81 (182) T KOG0072 49 TIGFNVETV-PYKNLKFQVWDLGGQTSIRPYWRC 81 (182) T ss_pred CCCCCCCCC-CCCCCCCEEEECCCCCCCCHHHHH T ss_conf 988680215-536655124671586433277888 No 242 >cd01209 SHC SHC phosphotyrosine-binding (PTB) domain. SHC is a substrate for receptor tyrosine kinases, which can interact with phosphoproteins at NPXY motifs. SHC contains an PTB domain followed by an SH2 domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether. Probab=23.41 E-value=23 Score=9.53 Aligned_cols=12 Identities=17% Similarity=0.340 Sum_probs=4.2 Q ss_pred CCCCCCHHHHHH Q ss_conf 778842157999 Q gi|254781029|r 1715 DSLSSIDSLVEN 1726 (1828) Q Consensus 1715 ~~~~~l~s~s~d 1726 (1828) |+.+--+.|+-| T Consensus 133 hv~~c~~~lA~d 144 (160) T cd01209 133 HVLECPEGLAQD 144 (160) T ss_pred EEEECCCCHHHH T ss_conf 999867734899 No 243 >PRK09881 ATP-dependent peptide transporter membrane subunit; Provisional Probab=23.35 E-value=23 Score=9.52 Aligned_cols=51 Identities=10% Similarity=-0.044 Sum_probs=30.4 Q ss_pred HHHH-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCH---HHH-HHHHHHHHCHHHH Q ss_conf 9854-18732279999999999999999999983162---145-5576564064889 Q gi|254781029|r 90 QNIG-KNSLNIALRNAFMSSLIWLACELGFLIKFYTN---SPM-DSFYDFAMRPEVI 141 (1828) Q Consensus 90 ~~l~-~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~---~~~-~~~~~~~~~~~~~ 141 (1828) +-+. .|.|=..-.++++++.++. ..+|+..++|++ ..+ .-..-+...|..+ T Consensus 88 rll~G~r~sL~i~l~~~~i~~~iG-~~lG~~aGy~gG~~D~il~ri~Dv~~aiP~ll 143 (296) T PRK09881 88 RVLVGSQQSILAGLVVVAIAGMIG-SLLGCLSGVLGGRADAIIMRIMDIMLSIPSLV 143 (296) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHH T ss_conf 988879999999999999999999-99999888817607665567776301586057 No 244 >PRK00944 hypothetical protein; Provisional Probab=23.34 E-value=23 Score=9.52 Aligned_cols=17 Identities=24% Similarity=0.309 Sum_probs=7.0 Q ss_pred CCCCHHHHHHHHHHHHH Q ss_conf 87322799999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIW 111 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w 111 (1828) .|-+....+|++....| T Consensus 85 ~~~~~rL~~A~liE~aW 101 (193) T PRK00944 85 KPLGARLLLAVLIEGAW 101 (193) T ss_pred CCHHHHHHHHHHHHHHH T ss_conf 86468999999997678 No 245 >pfam05569 Peptidase_M56 BlaR1 peptidase M56. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription. Homologues to this peptidase domain, which corresponds to Merops family M56, are also found in a number of other bacterial genome sequences. Probab=23.34 E-value=23 Score=9.51 Aligned_cols=56 Identities=13% Similarity=0.201 Sum_probs=25.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999--99999999999999-------9999999999999999999 Q gi|254781029|r 231 VLENNYTKSEMRIDNITQNLKQER--EAIINHGTQLCTSI-------AEVHESLKEELSLTSEEIS 287 (1828) Q Consensus 231 ~ler~y~dnE~rir~L~~el~~er--e~i~~~ae~~r~~i-------~~~~e~l~~el~~~s~~i~ 287 (1828) .+-..|++.|.+. -|.+||.|-| |.+.+-.-.+-..| -=.--.++.|.+.+.|+.. T Consensus 186 ~~~~~~~~~~l~~-il~HEl~H~kr~D~~~~~l~~~~~~i~WfNPlvw~~~~~~~~d~E~aCD~~V 250 (299) T pfam05569 186 DFDTRLSDEELDY-ILLHELAHLKRGDLIINLLVAVLQCIHWFNPLVHLAFRKIRIDQELACDAAV 250 (299) T ss_pred HHHHHCCHHHHHH-HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6641089999999-9999999997818999999999999998769999999999999999879999 No 246 >COG1495 DsbB Disulfide bond formation protein DsbB [Posttranslational modification, protein turnover, chaperones] Probab=23.25 E-value=23 Score=9.49 Aligned_cols=24 Identities=13% Similarity=-0.009 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 799999999999999999999831 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKFY 123 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~~ 123 (1828) -+|+..++.++=+++++++.+.+. T Consensus 13 ~~~ll~~~~~~~~~~~~al~fq~i 36 (170) T COG1495 13 LLWLLLALLGLALALLAALYFQYI 36 (170) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999998 No 247 >KOG0206 consensus Probab=23.21 E-value=23 Score=9.48 Aligned_cols=12 Identities=25% Similarity=0.678 Sum_probs=7.6 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254781029|r 148 SVVPILLFFSFF 159 (1828) Q Consensus 148 ~~~p~~~~~~~a 159 (1828) .++|++++|+.- T Consensus 83 ~~~pl~~vl~~t 94 (1151) T KOG0206 83 TLVPLLFVLGIT 94 (1151) T ss_pred EEECCCEEHHHH T ss_conf 020100001488 No 248 >TIGR00897 2A0118 polyol permease; InterPro: IPR004748 Proteins in this entry are mostly known or predicted polyol transporters. The best characterised of these proteins are DalT and RbtT from Klebsiella pneumoniae, which tranpsort D-arabinatol and ribotol respectively. Like other members of the Major Facilitator Superfamily (MFS), DalT and RbtT appear to be secondary transporters capable only of transporting solutes in response to chemiosmotic ion gradients . They contain a total of twelve predicted transmembrane helices; six in the N-terminal portion of the protein, and six in the C-terminal portion. Residues defining substrate specificity lie in the N-terminal region of these proteins.; GO: 0005215 transporter activity, 0006810 transport, 0016021 integral to membrane. Probab=23.20 E-value=23 Score=9.48 Aligned_cols=65 Identities=9% Similarity=0.163 Sum_probs=32.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH--CC----HH-----H---HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32279999999999999999999983--16----21-----4---55576564064889999999999999999999999 Q gi|254781029|r 97 LNIALRNAFMSSLIWLACELGFLIKF--YT----NS-----P---MDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 97 s~~~~~~~~~~s~~w~~~~~~~~~~~--~~----~~-----~---~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) +..-||.+.+.+..|.+.-+|+--.- +| .. + ++..=.-|..|+..-+ ....+.-+-..|.||.|- T Consensus 321 ~Ghs~av~~iia~~~G~~lAgyVPl~A~~Ptlapk~KGAA~SVlNL~aGlsaF~aPaia~l-fig~~ga~GVv~ifaALY 399 (411) T TIGR00897 321 IGHSFAVVLIIAIIYGILLAGYVPLAAVLPTLAPKDKGAAMSVLNLSAGLSAFVAPAIAVL-FIGLLGAIGVVWIFAALY 399 (411) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH-HHHHCCCHHHHHHHHHHH T ss_conf 2258999999999999999866348874223489987315648888862887888999999-985002027999999999 No 249 >CHL00016 ndhG NADH dehydrogenase subunit 6 Probab=23.16 E-value=23 Score=9.47 Aligned_cols=27 Identities=15% Similarity=0.391 Sum_probs=13.3 Q ss_pred HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 65640648899999999999999999999999 Q gi|254781029|r 132 YDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) ++|.+.-+.+ +++|++ +..|.|++|.- T Consensus 55 A~Fla~~QIl-VYaGAI----~VLflFvIMll 81 (177) T CHL00016 55 SDFVAAAQLL-IYVGAI----NVLIIFAVMFM 81 (177) T ss_pred CHHHHHHHHH-HHHHHH----HHHHHHHHHHC T ss_conf 5899999999-999999----99999999982 No 250 >COG1808 Predicted membrane protein [Function unknown] Probab=23.13 E-value=23 Score=9.46 Aligned_cols=30 Identities=13% Similarity=0.174 Sum_probs=14.3 Q ss_pred HHHHHHHHHCCCCCHHHHHHHH--HHHHHHHH Q ss_conf 8989998541873227999999--99999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFM--SSLIWLAC 114 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~--~s~~w~~~ 114 (1828) +--..+.++--|...+||.-.. ....|++. T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~gl 41 (334) T COG1808 10 IEFRNKLLSELELSISKYIFRLIAEIIAWIGL 41 (334) T ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99999898760144315688999989998743 No 251 >pfam07926 TPR_MLP1_2 TPR/MLP1/MLP2-like protein. The sequences featured in this family are similar to a region of human TPR protein and to yeast myosin-like proteins 1 (MLP1) and 2 (MLP2). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores. TPR is thought to be a component of nuclear pore complex- attached intra-nuclear filaments, and is implicated in nuclear protein import. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerization of kinase domains or by targeting these kinases to the nuclear pore complex. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity. Probab=23.12 E-value=23 Score=9.46 Aligned_cols=23 Identities=22% Similarity=0.416 Sum_probs=7.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999 Q gi|254781029|r 228 EIEVLENNYTKSEMRIDNITQNL 250 (1828) Q Consensus 228 ev~~ler~y~dnE~rir~L~~el 250 (1828) |+..+..-..+.+..++.+.++| T Consensus 11 e~~~~~~~~~~~~~~~~~~~~dl 33 (132) T pfam07926 11 ELKRLLEKKEDAEEKIQKLQEDL 33 (132) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998998999999999988889 No 252 >KOG2265 consensus Probab=23.11 E-value=23 Score=9.45 Aligned_cols=63 Identities=19% Similarity=0.214 Sum_probs=39.8 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 99999975074562067741252599899999985078878997899999999999999825873378999987664399 Q gi|254781029|r 1737 VQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKADSALRNAIDRYISNFEEMLSEIAQSNDDSPLVQEHIMSNYGKV 1816 (1828) Q Consensus 1737 ~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l~~~~~~~~~~~~~~~~l~s~~g~~ 1816 (1828) .+-|++...||.-+=+..+- -|=+.+-+-|.+||++|+++..|--.-... .=+||-+|. T Consensus 99 ~eWW~~ll~gep~ID~~ki~--------~e~skl~dldeEtra~vekmmfdq~qk~~~-------------~p~sde~~~ 157 (179) T KOG2265 99 MEWWDSLLEGEPEIDTKKIE--------PEESKLSDLDEETRATVEKMMFDQRQKSMG-------------LPTSDELKK 157 (179) T ss_pred HHHHHHHHCCCCCCCCCCCC--------HHHHHHHHCCHHHHHHHHCCCHHHHHHHCC-------------CCCCCHHHH T ss_conf 77888887389877811058--------213124312698997655003557886369-------------999851367 Q ss_pred HHHH Q ss_conf 9999 Q gi|254781029|r 1817 YTML 1820 (1828) Q Consensus 1817 y~~l 1820 (1828) .-|| T Consensus 158 ~d~L 161 (179) T KOG2265 158 HDML 161 (179) T ss_pred HHHH T ss_conf 8899 No 253 >pfam06210 DUF1003 Protein of unknown function (DUF1003). This family consists of several hypothetical bacterial proteins of unknown function. Probab=23.08 E-value=23 Score=9.44 Aligned_cols=50 Identities=18% Similarity=0.318 Sum_probs=23.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHH Q ss_conf 2799999999999999999999-83162145557656406488999999999999999 Q gi|254781029|r 99 IALRNAFMSSLIWLACELGFLI-KFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLF 155 (1828) Q Consensus 99 ~~~~~~~~~s~~w~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 155 (1828) .-+++-+++=++|++.-+++.. ..|.| .-|+.+-..+-+ .+++.+|++|+ T Consensus 5 ~Fi~~~~~~~~~Wi~~N~~~~~~~~fDp------yPFilLnl~lS~-~Aa~~aplIlm 55 (108) T pfam06210 5 TFIILFTVVLVLWILLNLGAPPGLAFDP------YPFILLNLVLSL-QAAYQAPLILM 55 (108) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCC------CCHHHHHHHHHH-HHHHHHHHHHH T ss_conf 9999999999999999970323367896------529999999999-99999999999 No 254 >COG4425 Predicted membrane protein [Function unknown] Probab=23.06 E-value=23 Score=9.44 Aligned_cols=96 Identities=7% Similarity=-0.092 Sum_probs=44.2 Q ss_pred HCCCCC--HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC------HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 418732--279999999999999999999983162145557656406------488999999999999999999999999 Q gi|254781029|r 93 GKNSLN--IALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMR------PEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 93 ~~~~s~--~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) .--||= .|+|+.++.|++-.+.|-++...+. +|-..-.+-.. -.-++..++++++-+.|.|.++..- T Consensus 64 sLTPSLLPr~~l~qgv~sgf~~A~Gy~~gv~~~---wl~~y~elp~~s~~~~R~~~~~~ai~~~~~a~~fl~qa~~wq-- 138 (588) T COG4425 64 SLTPSLLPRPWLFQGVLSGFSLAAGYGAGVFLH---WLWRYLELPESSPRPPRWAKPAAAIVGAAGAVGFLVQAAVWQ-- 138 (588) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-- T ss_conf 518201685089999999999875667778899---999986188888888512446899999999999999999875-- Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 999999999999999973092555699999999 Q gi|254781029|r 165 ARDMHDASQSIAGIALRLIDPEEYSSEKMQSIS 197 (1828) Q Consensus 165 ~~~~~~~a~~m~~~a~rl~~p~~~a~~~~~~~~ 197 (1828) +.+-+-|..--++=+||--...=.+++.+ T Consensus 139 ----ntvr~Lmgl~~~~sa~p~k~g~la~l~~~ 167 (588) T COG4425 139 ----NTVRDLMGLEPLESAEPPKAGLLALLVFA 167 (588) T ss_pred ----HHHHHHHCCCCCCCCCCHHHHHHHHHHHH T ss_conf ----77999857777876564036489999999 No 255 >COG4149 ModC ABC-type molybdate transport system, permease component [Inorganic ion transport and metabolism] Probab=22.92 E-value=24 Score=9.40 Aligned_cols=41 Identities=15% Similarity=0.250 Sum_probs=26.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHCH Q ss_conf 22799999999999999999999831621--455576564064 Q gi|254781029|r 98 NIALRNAFMSSLIWLACELGFLIKFYTNS--PMDSFYDFAMRP 138 (1828) Q Consensus 98 ~~~~~~~~~~s~~w~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 138 (1828) ..-+++|++..++-...|+.++|.+.... .-+-+..++.+| T Consensus 10 ~LSlkvA~ist~~~l~lgi~~a~~Lar~~~~~k~ll~~lv~LP 52 (225) T COG4149 10 LLSLKVALISTLLLLPLGIALAYLLARRRFRGKSLLESLVLLP 52 (225) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCC T ss_conf 9999999999999999999999999725577630999999732 No 256 >KOG0243 consensus Probab=22.91 E-value=24 Score=9.40 Aligned_cols=17 Identities=18% Similarity=0.231 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781029|r 149 VVPILLFFSFFIMISRA 165 (1828) Q Consensus 149 ~~p~~~~~~~a~~~~r~ 165 (1828) |+|=++.-+|-.+-.-+ T Consensus 163 IIPRal~~IFd~Le~~~ 179 (1041) T KOG0243 163 IIPRALRQIFDTLEAQG 179 (1041) T ss_pred CCHHHHHHHHHHHHHCC T ss_conf 34299999999998448 No 257 >pfam12089 DUF3566 Transmembrane domain of unknown function (DUF3566). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 136 to 304 amino acids in length. This region represents a transmembrane region found at the C-terminus of the proteins. Probab=22.91 E-value=24 Score=9.40 Aligned_cols=65 Identities=11% Similarity=0.254 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC----------------HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2279999999999999999999983162145557656406----------------488999999999999999999999 Q gi|254781029|r 98 NIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMR----------------PEVITLFLVVSVVPILLFFSFFIM 161 (1828) Q Consensus 98 ~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~p~~~~~~~a~~ 161 (1828) +..+.+..++.++|+.+..-+...+.+-+-..++.+++.. +..++..+...++-++++-.+|.+ T Consensus 19 Kvsfl~s~a~~iv~vvA~~lLw~vL~~~Gv~~~v~~~~~~l~~~~~~~~l~~~~s~~~V~~~~~lig~i~vvl~TaLa~l 98 (120) T pfam12089 19 KVSFLLSVALGIVWVVAVAVLWVVLDAMGVFDQVNDLVTDLGGSEGGFDLRSVLSFGRVLGFAALVGAINVVLFTALATL 98 (120) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999997647688899999986467776526776169999999999999999999999999 Q ss_pred H Q ss_conf 9 Q gi|254781029|r 162 I 162 (1828) Q Consensus 162 ~ 162 (1828) . T Consensus 99 ~ 99 (120) T pfam12089 99 G 99 (120) T ss_pred H T ss_conf 9 No 258 >COG1295 Rbn Ribonuclease BN family enzyme [Replication, recombination, and repair] Probab=22.89 E-value=24 Score=9.40 Aligned_cols=60 Identities=10% Similarity=0.029 Sum_probs=0.0 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4187322799999999999999999999831621455576564064889999999999999999999999 Q gi|254781029|r 93 GKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 93 ~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) ..+.-+-.++.|.+.+++|..+..++++.. ..+..=...++.+|++++=.++||+.++++ T Consensus 215 ~~~~~~~~~~Ga~~aai~~~i~~~~f~~Yv----------~~~~~y~~~YGalgsvi~lmlw~y~~~~I~ 274 (303) T COG1295 215 RVLKWRDVLPGALLAAILFELGKYLFGYYL----------SNFANYSSTYGALGSVIILLLWLYISALII 274 (303) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 653017878999999999999999999999----------986033602667999999999999999999 No 259 >pfam10164 DUF2367 Uncharacterized conserved protein (DUF2367). This is a highly conserved family of proteins which contains three pairs of cysteine residues within a length of 42 amino acids and is rich in proline residues towards the N-terminus. The function is unknown. Several members are putatively assigned as brain protein i3 but this was not validated. Probab=22.88 E-value=24 Score=9.39 Aligned_cols=26 Identities=12% Similarity=0.297 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88999999999999999999999999 Q gi|254781029|r 139 EVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 139 ~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ..+|++.+.++-|+-++.-|||.-+| T Consensus 65 t~~Gi~cAI~fFP~GilCClamr~kr 90 (98) T pfam10164 65 TCLGIFCAIFFFPLGILCCLALRNRR 90 (98) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 28889999998758999989862065 No 260 >PRK12342 putative electron transfer flavoprotein YdiQ; Provisional Probab=22.85 E-value=10 Score=12.43 Aligned_cols=30 Identities=20% Similarity=0.203 Sum_probs=0.0 Q ss_pred HHHHHHCCHHHHHHHHHHHHH------HHHHHHHHHHH Q ss_conf 999973092555699999999------99999999723 Q gi|254781029|r 177 GIALRLIDPEEYSSEKMQSIS------SAVRKEIVLMT 208 (1828) Q Consensus 177 ~~a~rl~~p~~~a~~~~~~~~------qa~r~ev~a~~ 208 (1828) |.|+||. +.-.+--|+|+| ..+||+..||| T Consensus 42 E~Al~lk--~~gg~v~vvs~G~~~~~~~~~~r~alAmG 77 (254) T PRK12342 42 EAASQLA--TDGDEIAALTVGGSLLQNSKVRKDVLSRG 77 (254) T ss_pred HHHHHHH--HCCCCEEEEEECCCCHHHHHHHHHHHHHC T ss_conf 9999988--61996999995688025599999999737 No 261 >TIGR02415 23BDH acetoin reductases; InterPro: IPR014007 One member of this family, as characterised in Klebsiella terrigena , is able to interconvert acetoin + NADH with meso-2,3-butanediol + NAD(+). It is also capable of irreversible reduction of diacetyl with NADH to acetoin. There has been a reuctance to classify the enzyme as either 1.1.1.4 from EC, which is (R,R)-butanediol dehydrogenase, or 1.1.1.5 from EC, which is acetoin dehydrogenase without a specified stereochemistry . Another member of this family, from Corynebacterium glutamicum (Brevibacterium flavum), is called L-2,3-butanediol dehydrogenase . This enzyme is a homotetramer in the family of short chain dehydrogenases (IPR002198 from INTERPRO). . Probab=22.84 E-value=24 Score=9.38 Aligned_cols=10 Identities=40% Similarity=0.797 Sum_probs=0.0 Q ss_pred HHCCHHHHHH Q ss_conf 7309255569 Q gi|254781029|r 181 RLIDPEEYSS 190 (1828) Q Consensus 181 rl~~p~~~a~ 190 (1828) |..+||++|. T Consensus 222 R~~~PeDVA~ 231 (258) T TIGR02415 222 RLSEPEDVAG 231 (258) T ss_pred CCCCCCHHHH T ss_conf 8999713887 No 262 >pfam11297 DUF3098 Protein of unknown function (DUF3098). This bacterial family of proteins has no known function. Probab=22.83 E-value=24 Score=9.38 Aligned_cols=49 Identities=22% Similarity=0.335 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999998316214555765640648899999999999999999999999 Q gi|254781029|r 110 IWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 110 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) .|++.|+++...-| +--.++.+..|...+ +-..-++|++.+-+|+.+++ T Consensus 9 ~ll~iG~~vI~~Gf----ilMsgggs~dp~if~-f~ri~lAP~lv~~Gf~~~i~ 57 (64) T pfam11297 9 ILMAIGIAVIALGF----ILMSGEGSVDPDIFG-FRRLRVAPILVLGGFAIIIY 57 (64) T ss_pred HHHHHHHHHHHHHH----HHEECCCCCCCCCCC-EEEEEHHHHHHHHHHHHHHH T ss_conf 99999999999999----944468988976435-45664147999999999967 No 263 >pfam08510 PIG-P PIG-P. PIG-P (phosphatidylinositol N-acetylglucosaminyltransferase subunit P) is an enzyme involved in GPI anchor biosynthesis. Probab=22.83 E-value=24 Score=9.38 Aligned_cols=66 Identities=9% Similarity=0.048 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 73227999999999999999999998316214555765640648899999999999999999999999 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) |+|..|+++.-++ .|++.++.+.|++.|...+.+ -++.--|.--=.++.-+.+.+.++|++..+.. T Consensus 1 ~~~e~yGF~~~i~-s~~~~~lyl~Wa~~P~~~L~~-~gi~y~P~kyWAlaiP~~ll~~~~~~~~~y~~ 66 (123) T pfam08510 1 PTREYYGFVLYIL-STLLFVLYLLWAFLPDPVLHS-LGITYYPSRYWALAIPIYLLMAMLYAYVGYAL 66 (123) T ss_pred CCCHHHHHHHHHH-HHHHHHHHHHHHHCCHHHHHH-CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9502676999999-999999999999577999987-07753661589999999999999999999999 No 264 >TIGR01912 TatC-Arch Sec-independent protein translocase TatC; InterPro: IPR011532 This entry represents the TatC translocase component of the Sec-independent protein translocation system in archaeal species. This system is responsible for translocation of folded proteins, often with bound cofactors across the periplasmic membrane . . Probab=22.76 E-value=24 Score=9.36 Aligned_cols=25 Identities=20% Similarity=0.789 Sum_probs=0.0 Q ss_pred HCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 064889999999999999999999999 Q gi|254781029|r 136 MRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 136 ~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) ..|+.+..+..+ +|.+++|.+++++ T Consensus 225 ~~~dvvs~~lla--~P~~~ly~l~~~~ 249 (255) T TIGR01912 225 DAPDVVSMILLA--IPLILLYELALVV 249 (255) T ss_pred CCCCHHHHHHHH--HHHHHHHHHHEEE T ss_conf 988478999999--9999997524132 No 265 >PRK12837 3-ketosteroid-delta-1-dehydrogenase; Provisional Probab=22.75 E-value=24 Score=9.36 Aligned_cols=22 Identities=18% Similarity=0.353 Sum_probs=0.0 Q ss_pred HHHHHHHHHHCCH-HHHHHHHHH Q ss_conf 8999999850788-789978999 Q gi|254781029|r 1763 VFLNLQEQYKADS-ALRNAIDRY 1784 (1828) Q Consensus 1763 ~~~~i~~~y~~d~-~f~~~v~ry 1784 (1828) |.+|++.++.-|+ .|..||+|| T Consensus 379 TleeLA~~~gi~~~~L~~Tv~~y 401 (515) T PRK12837 379 TLEELAAKIGVPADALTATVARF 401 (515) T ss_pred CHHHHHHHHCCCHHHHHHHHHHH T ss_conf 89999998695999999999999 No 266 >pfam05758 Ycf1 Ycf1. The chloroplast genomes of most higher plants contain two giant open reading frames designated ycf1 and ycf2. Although the function of Ycf1 is unknown, it is known to be an essential gene. Probab=22.68 E-value=24 Score=9.34 Aligned_cols=63 Identities=24% Similarity=0.311 Sum_probs=0.0 Q ss_pred HHHHHHHCCCCCHHHHHHHHH-----------HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHH Q ss_conf 899985418732279999999-----------999999999999983162145557656406488999999999999999 Q gi|254781029|r 87 HILQNIGKNSLNIALRNAFMS-----------SLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLF 155 (1828) Q Consensus 87 ~~~~~l~~~~s~~~~~~~~~~-----------s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 155 (1828) ..|-+|--+||-....=|.++ ++.=+|=.+-+...+|+| +--..+.|-.|-++| +|.+|| T Consensus 10 GFLTTfsIGPSylfllRa~vmeegtek~vsAttGfItGQL~mfiSIYYaP-----LhlaL~rPHtITvL~----lPYllf 80 (832) T pfam05758 10 GFLTTFSIGPSYLFLLRARVLEEGTEKKVSATTGFITGQLIMFISIYYAP-----LYLALGRPHTITVLA----LPYLLF 80 (832) T ss_pred HHHHHHCCCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHHHHH-----HHHHHCCCHHHHHHH----HHHHHH T ss_conf 79887265458999999998625776626887307888999999998899-----999974553308999----999999 Q ss_pred HHH Q ss_conf 999 Q gi|254781029|r 156 FSF 158 (1828) Q Consensus 156 ~~~ 158 (1828) ++| T Consensus 81 ~ff 83 (832) T pfam05758 81 HFF 83 (832) T ss_pred HHH T ss_conf 997 No 267 >PRK12844 3-ketosteroid-delta-1-dehydrogenase; Reviewed Probab=22.67 E-value=24 Score=9.33 Aligned_cols=22 Identities=18% Similarity=0.397 Sum_probs=0.0 Q ss_pred HHHHHHHHHHCCH-HHHHHHHHH Q ss_conf 8999999850788-789978999 Q gi|254781029|r 1763 VFLNLQEQYKADS-ALRNAIDRY 1784 (1828) Q Consensus 1763 ~~~~i~~~y~~d~-~f~~~v~ry 1784 (1828) |.+|++.+..-|+ .|..||+|| T Consensus 410 TleELA~~~gid~~~L~~Tv~~y 432 (552) T PRK12844 410 TIEELAGKCGIDPAGLAATVERF 432 (552) T ss_pred CHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999988596999999999999 No 268 >pfam07461 NADase_NGA Nicotine adenine dinucleotide glycohydrolase (NADase). This family consists of several bacterial nicotine adenine dinucleotide glycohydrolase (NGA) proteins which appear to be specific to Streptococcus pyogenes. NAD glycohydrolase (NADase) is a potential virulence factor. Streptococcal NADase may contribute to virulence by its ability to cleave beta-NAD at the ribose-nicotinamide bond, depleting intracellular NAD pools and producing the potent vasoactive compound nicotinamide. Probab=22.66 E-value=24 Score=9.33 Aligned_cols=29 Identities=31% Similarity=0.721 Sum_probs=0.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCH---HHHHHHHHH Q ss_conf 45620677412525998999999850788---789978999 Q gi|254781029|r 1747 EDDIFSKRLYTIKGQKVFLNLQEQYKADS---ALRNAIDRY 1784 (1828) Q Consensus 1747 ~~~~f~~~~~~~~~~~~~~~i~~~y~~d~---~f~~~v~ry 1784 (1828) ||++|-+. |.||+.|+-+|. ||-.+.|.| T Consensus 196 erdifekk---------fkeikdkWVs~kQADEFiET~D~Y 227 (446) T pfam07461 196 ERDIFEKK---------FKEIKDKWVTDKQADEFIETADKY 227 (446) T ss_pred HHHHHHHH---------HHHHHHHHEEHHHHHHHHHHHHHH T ss_conf 47888988---------887665401055578899888899 No 269 >PRK09174 F0F1 ATP synthase subunit B'; Validated Probab=22.58 E-value=24 Score=9.31 Aligned_cols=108 Identities=10% Similarity=0.147 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 242 RIDNITQNLKQEREAIINHGTQLCTSIAEVHESLKEELSLTSEEISVHLSRAIDSFQSIVDVRIAKVTEKTTRIVQESAQ 321 (1828) Q Consensus 242 rir~L~~el~~ere~i~~~ae~~r~~i~~~~e~l~~el~~~s~~i~~~~~~~~e~~~~~le~r~~~l~~~~~~~~~~~~~ 321 (1828) ||...+++=..--..=+..+++++..-..........|.....+-..-+....+.....++.+.+.+...+......--. T Consensus 77 rI~~vLe~R~~~I~~DLe~Ae~lk~EAe~~~a~Ye~~LaeAR~eA~~Ii~~Ar~~a~~~~e~~~a~~ea~L~~kia~AE~ 156 (204) T PRK09174 77 RIGGIIETRRDRIAQDLDQAARLKQEADAAVAAYEQELAQARSKAASIAQAAREAAKAKAEAERAEIEASLEKKLKEAEE 156 (204) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 37779999999998799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999 Q gi|254781029|r 322 TISSKIDQLLEVLHSTSIVITKDFDNRI 349 (1828) Q Consensus 322 ~l~~~~~~l~~~L~~~~~~~~~~~~~~~ 349 (1828) .+....+.-+..+.....++..++-..+ T Consensus 157 ~I~~~r~~Al~~V~~IA~d~A~~iV~~L 184 (204) T PRK09174 157 RIAAIKAKAMADVGSIAEETAAAIVEQL 184 (204) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999879999999999999999998 No 270 >TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation . Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase. . Probab=22.55 E-value=24 Score=9.30 Aligned_cols=18 Identities=17% Similarity=0.407 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999 Q gi|254781029|r 145 LVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 145 ~~~~~~p~~~~~~~a~~~ 162 (1828) ..++++||.+|.++.-++ T Consensus 3 ~L~~L~Pisl~lG~~GL~ 20 (53) T TIGR00847 3 ILTILIPISLLLGGVGLV 20 (53) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 257788999999999999 No 271 >TIGR00903 2A0129 major facilitator 4 family protein; InterPro: IPR005277 The MFS is a very old, large and diverse superfamily that includes several hundred sequenced members. They catalyze uniport, solute:cation (H+ or Na+) symport and/or solute:H+ or solute:solute antiport. Most are of 400-600 amino acyl residues in length and possess either 12 or 14 putative transmembrane a-helical spanners. They exhibit specificity for sugars, polyols, drugs, neurotransmitters, Krebs cycle metabolites, phosphorylated glycolytic intermediates, amino acids, peptides, osmolites, nucleosides, organic anions, inorganic anions, etc. They are found ubiquitously in all three kingdoms of living organisms. The generalized transport reactions catalyzed by MFS porters are: (1)Uniport: S (out) S (in). (2)Symport: S (out) + [H+ or Na+] (out) ---> S (in) + [H+ or Na+] (in). (3)Antiport: S1 (out) + S2 (in) S1 (in) + S2 (out), (S1 may be H+ or a solute) This family of proteins are uncharacterised proteins from archaea, which may be major facilitators. This family includes proteins from Archaeoglobus fulgidus and Aeropyrum pernix. . Probab=22.53 E-value=24 Score=9.30 Aligned_cols=42 Identities=17% Similarity=0.395 Sum_probs=0.0 Q ss_pred HHHHHHH-HHHHHHC-----------CHHHHHHHHHHHHCHHHHHHHHHHHHHH Q ss_conf 9999999-9999831-----------6214555765640648899999999999 Q gi|254781029|r 110 IWLACEL-GFLIKFY-----------TNSPMDSFYDFAMRPEVITLFLVVSVVP 151 (1828) Q Consensus 110 ~w~~~~~-~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~p 151 (1828) +|+.|.+ |+..+.| .+.+++.++|-+..-..+.+.+|++|+| T Consensus 194 LW~~G~ILGlGVA~FDNl~~WL~~AL~~~GL~~~AG~~VALA~~~GL~GVAVIP 247 (368) T TIGR00903 194 LWLLGLILGLGVALFDNLAIWLEAALRSKGLEDVAGDAVALALLLGLVGVAVIP 247 (368) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCEEECC T ss_conf 898989997689998789999876207146112102499999998240131448 No 272 >PRK12835 3-ketosteroid-delta-1-dehydrogenase; Reviewed Probab=22.49 E-value=24 Score=9.29 Aligned_cols=22 Identities=18% Similarity=0.379 Sum_probs=0.0 Q ss_pred HHHHHHHHHHCCH-HHHHHHHHH Q ss_conf 8999999850788-789978999 Q gi|254781029|r 1763 VFLNLQEQYKADS-ALRNAIDRY 1784 (1828) Q Consensus 1763 ~~~~i~~~y~~d~-~f~~~v~ry 1784 (1828) |.+|++.+..-|+ .|+.||+|| T Consensus 431 TleELA~~~gvd~~~L~~TV~rY 453 (584) T PRK12835 431 TWDELAAKIGVPAENLRATAERF 453 (584) T ss_pred CHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999988298989999999999 No 273 >PRK09414 glutamate dehydrogenase; Provisional Probab=22.46 E-value=25 Score=9.28 Aligned_cols=34 Identities=6% Similarity=0.232 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9989999998507887899789999999999999 Q gi|254781029|r 1761 QKVFLNLQEQYKADSALRNAIDRYISNFEEMLSE 1794 (1828) Q Consensus 1761 ~~~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l~~ 1794 (1828) +..++.|.+|+-..|||.++|.-.++..+-++.. T Consensus 5 ~~~~~~~~~~~p~~~ef~qa~~ev~~sl~p~~~~ 38 (446) T PRK09414 5 ESFLEQVKKRNPGQPEFHQAVREVLESLWPVLEK 38 (446) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9999999864889816999999999988999865 No 274 >TIGR02908 CoxD_Bacillus cytochrome c oxidase, subunit IV; InterPro: IPR014257 This entry represents a small clade of cytochrome oxidase subunit IV sequences found in the Bacillaceae.. Probab=22.42 E-value=25 Score=9.27 Aligned_cols=24 Identities=17% Similarity=0.456 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 48899999999999999999999999 Q gi|254781029|r 138 PEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 138 ~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) |..+ ++ ||++|-.++.|.|...+| T Consensus 86 Pa~f-iY-gGvfV~~l~vlaFtti~W 109 (110) T TIGR02908 86 PALF-IY-GGVFVALLVVLAFTTIVW 109 (110) T ss_pred HHHH-HH-HHHHHHHHHHHHHHHHCC T ss_conf 0799-99-999999999988720004 No 275 >KOG0720 consensus Probab=22.40 E-value=8.1 Score=13.35 Aligned_cols=85 Identities=4% Similarity=0.006 Sum_probs=0.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 18732279999999999999999999983162145557656406488999999999999999999999999999999999 Q gi|254781029|r 94 KNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQ 173 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~ 173 (1828) +.++-....+.+.+|+.|.+.-.+....-+.-.......+....+--++ ..|..+.-+.+||.+..+-|+ .+- T Consensus 62 ~~a~~~s~~~s~~~s~~s~~ql~~~~~~~~a~~~~~~~~g~~~~~~~l~-~~g~~~l~l~~~w~~~~~~~~------~~~ 134 (490) T KOG0720 62 GTAPFFSIMWSTLVSANSMGQLTKFILIMVATVSVALYIGRVVGSVTLA-LFGLLLLWLYSFWGTVLFSFN------LAF 134 (490) T ss_pred CCCCHHHEEEEEEEECCCCCCCCCCCCCHHHHHHHHEECCCCCCCEEEC-CCCHHHHHHHHHHCCHHHHHH------HHH T ss_conf 7761332012101211355431210101030110001036567612320-120889999875133256687------887 Q ss_pred HHHHHHHHHCCH Q ss_conf 999999973092 Q gi|254781029|r 174 SIAGIALRLIDP 185 (1828) Q Consensus 174 ~m~~~a~rl~~p 185 (1828) +|...-++.-.| T Consensus 135 ~~~~~~~~~~~~ 146 (490) T KOG0720 135 LSKDELITVYSV 146 (490) T ss_pred HHHHHEECCCCC T ss_conf 631101212453 No 276 >TIGR00955 3a01204 Pigment precourser permease; InterPro: IPR005284 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family includes different parts of a membrane-spanning permease system necessary for the transport of pigment precursor into pigment cells responsible for eye color. White protein dimerises with brown protein for the transport of guanine and with scarlet protein for the transport of tryptophan.; GO: 0006810 transport. Probab=22.40 E-value=25 Score=9.26 Aligned_cols=58 Identities=17% Similarity=0.132 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 799999999999999999999831621--4555765640648899999999999999999 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKFYTNS--PMDSFYDFAMRPEVITLFLVVSVVPILLFFS 157 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 157 (1828) .||....-|.+|--+..++...+-... +++-+..-+-.-.-+|+.++..++=++|.|+ T Consensus 484 ~Ywm~GL~~~~~~f~~~~~~~~l~~~vA~S~G~~~s~~~~~~~lA~~~~p~~~~pfL~Fg 543 (671) T TIGR00955 484 VYWMIGLRSGATHFLTFLFLVTLVANVATSFGYFISCAFESTSLALTVGPPFVIPFLLFG 543 (671) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC T ss_conf 987515640289999999999999999999999999774452134355789999999835 No 277 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=22.36 E-value=25 Score=9.25 Aligned_cols=11 Identities=55% Similarity=0.936 Sum_probs=0.0 Q ss_pred HHHHHHHHHHH Q ss_conf 89999999999 Q gi|254781029|r 1781 IDRYISNFEEM 1791 (1828) Q Consensus 1781 v~ry~~~fe~~ 1791 (1828) ||+||.+|.+. T Consensus 70 pd~~~~~Fa~a 80 (216) T TIGR01163 70 PDRYIEDFAEA 80 (216) T ss_pred HHHHHHHHHHH T ss_conf 57778899970 No 278 >PRK11922 RNA polymerase sigma factor; Provisional Probab=22.25 E-value=25 Score=9.22 Aligned_cols=41 Identities=15% Similarity=-0.014 Sum_probs=0.0 Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 44677867777788421579999987180138999999750 Q gi|254781029|r 1705 SSSHIDISDKDSLSSIDSLVENISKFIDYDAFVQLWKSYTL 1745 (1828) Q Consensus 1705 ~~~~~~~~~~~~~~~l~s~s~di~~~~~~~~~~~~w~~y~~ 1745 (1828) ...|..+.|.+..+.-|.-=+.=++.=|.+++..+++||+. T Consensus 1 ~~~~~~~~p~~~~~~~D~eLi~~~~~GD~~Af~~L~~ry~~ 41 (231) T PRK11922 1 MTTPVASRPTPLSAADDAELVARVLAGDEAAFEQLMRRHNR 41 (231) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 99999999999999759999999995659999999999999 No 279 >PRK01024 Na(+)-translocating NADH-quinone reductase subunit B; Provisional Probab=22.24 E-value=25 Score=9.22 Aligned_cols=52 Identities=10% Similarity=0.048 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHH---H Q ss_conf 732279999999999999999999983162145557656406488999999999999999---9 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLF---F 156 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~---~ 156 (1828) |...|+|...+. +.-++.++...||+ ++.-+.+|+++|=++..+-=|.-|. | T Consensus 163 Pp~~PlW~valG----~~FaIVigKevFGG-----lG~N~fNPALvgRAfL~~SfP~~MTgdvW 217 (503) T PRK01024 163 PPTIPYWMAALG----IAFGVVVGKELFGG-----TGMNILNPALSGRAFLFFTFPAKMSGDVW 217 (503) T ss_pred CCCCCHHHHHHH----HHHHHHHHHHHCCC-----CCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 898709999999----99999988762375-----68687788999999998753675256633 No 280 >pfam06965 Na_H_antiport_1 Na+/H+ antiporter 1. This family contains a number of bacterial Na+/H+ antiporter 1 proteins. These are integral membrane proteins that catalyse the exchange of H+ for Na+ in a manner that is highly dependent on the pH. Probab=22.23 E-value=25 Score=9.22 Aligned_cols=54 Identities=19% Similarity=0.201 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCCHH--HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999831621--45557656406488999999999999999999999999 Q gi|254781029|r 111 WLACELGFLIKFYTNS--PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 111 w~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) |+..++-=.++++.-+ .-+..-+.+.+|-.+|+++|.+++-++-+|+|+.+..| T Consensus 253 ~v~~~IlPlFAfaNAGV~l~~~~~~~~~~pv~lGI~lGL~vGK~lGI~~~s~lavk 308 (374) T pfam06965 253 WVAFAILPLFAFANAGVSLAGMTLDDLTSPLTLGIALGLVVGKPIGIFGFSWLAVK 308 (374) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHCHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 37989999999850686546775353117699999999980643889999999999 No 281 >TIGR01713 typeII_sec_gspC general secretion pathway protein C; InterPro: IPR001639 The general (type II) secretion pathway (GSP) within Gram-negative bacteria is a signal sequence-dependent process responsible for protein export , , . The process has two stages: exoproteins are first translocated across the inner membrane by the general signal-dependent export pathway (GEP), and then across the outer membrane by a species-specific accessory mechanism. A number of molecules are involved in the GSP; one of these is known as the 'C' protein, the most probable location of which is the inner membrane . This suggests that protein C is part of the GEP apparatus, aiding trans-location of exoproteins from the cytoplasm to the periplasm, prior to transport across the outer membrane. The size of the 'C' protein is around 270 to 300 amino acids. It apparently contains a single transmembrane domain located in the N-terminal section. The short N-terminal domain is predicted to be cytoplasmic and the large C-terminal domain periplasmic. The gene encoding the 'C' protein has been sequenced in a variety of bacteria such as Aeromonas (exeC); Erwinia (outC); Escherichia coli (yheE or gspC); Klebsiella pneumoniae (pulC); or Vibrio cholerae (epsC).; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=22.20 E-value=25 Score=9.21 Aligned_cols=20 Identities=0% Similarity=0.049 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254781029|r 145 LVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 145 ~~~~~~p~~~~~~~a~~~~r 164 (1828) .++.++=|++-|.+++|+|+ T Consensus 2 ~l~~~L~~f~~~~l~~~~w~ 21 (281) T TIGR01713 2 ILTFLLLVFISQQLGKILWN 21 (281) T ss_pred HHHHHHHHHHHHHHHHEEEE T ss_conf 47889999999987431243 No 282 >KOG3738 consensus Probab=22.18 E-value=25 Score=9.20 Aligned_cols=31 Identities=16% Similarity=0.216 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 99999999999999999998316214555765 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMDSFYD 133 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~ 133 (1828) ..-+++.++|+ .|+++.+..-+.+.+...++ T Consensus 7 ~vl~v~~vlWi-~gl~f~~~q~~~s~l~~~~~ 37 (559) T KOG3738 7 TVLTVLAVLWI-LGLAFYYVQSTSSSLAPPGG 37 (559) T ss_pred HHHHHHHHHHH-HHHHHEEEECCCCCCCCCCC T ss_conf 78999999999-87643168467623278999 No 283 >COG2311 Predicted membrane protein [Function unknown] Probab=22.18 E-value=25 Score=9.20 Aligned_cols=96 Identities=9% Similarity=-0.096 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH------------------HHHHHCCHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 898999854187322799999999999999999------------------99983162145557656406488999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELG------------------FLIKFYTNSPMDSFYDFAMRPEVITLFLV 146 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (1828) +|-++-.+-++.-++..++++.+.++|++.++. ..+.+....-+.........-..-...-. T Consensus 129 ~g~ill~~~~~~~k~l~~~~~~l~l~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 208 (394) T COG2311 129 TGLILLLFRRRKPKTLLIWATALLLLPVLLGVLLLLVLESGSEASAIYAEQLLHVYGSGEYLEIRTQRLHDQLGALGANI 208 (394) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99999999860621899999999999999999999865137531777999999874558999999712520226899989 Q ss_pred HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999---9999999999999999999 Q gi|254781029|r 147 VSVVPILLFFSFFIM---ISRARDMHDASQSIAGIAL 180 (1828) Q Consensus 147 ~~~~p~~~~~~~a~~---~~r~~~~~~~a~~m~~~a~ 180 (1828) .-..+++++|+.|++ .-+...-+--.|-++.+++ T Consensus 209 ~~~~~~~~~~~~~~~~~~~f~~~~~~~~~r~~~~~~~ 245 (394) T COG2311 209 GWRLAGLMLFGAALRRTGLFLGGASRAKYRRLVFFAI 245 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH T ss_conf 9999999999999977766212176655788888898 No 284 >COG1377 FlhB Flagellar biosynthesis pathway, component FlhB [Cell motility and secretion / Intracellular trafficking and secretion] Probab=22.14 E-value=25 Score=9.19 Aligned_cols=77 Identities=13% Similarity=0.152 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999831621455576564064889999999999999999999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAG 177 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~ 177 (1828) ..++.+++.+-+-...+....+.+..+..++..+...-..+.+..+.++++|.+|=++..-..-..+|||.=+-+-+ T Consensus 153 i~~v~~v~~~~l~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~liia~~D~~~qr~~~~k~lkMtKqEVKd 229 (363) T COG1377 153 IVLVGLVAYFVLKNHLSELLSLAGQSPVAALSIFMELLGKLLLAVLLLLLIVAAFDYFYQRFQYIKKLKMTKQEVKD 229 (363) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 99999999999994199999770489899999999999999999999999999999999999999966686899999 No 285 >KOG0015 consensus Probab=22.12 E-value=12 Score=11.97 Aligned_cols=8 Identities=63% Similarity=0.920 Sum_probs=0.0 Q ss_pred HHHHHHHH Q ss_conf 66439999 Q gi|254781029|r 1811 SNYGKVYT 1818 (1828) Q Consensus 1811 s~~g~~y~ 1818 (1828) |+||.||| T Consensus 110 SEtGhVyT 117 (338) T KOG0015 110 SETGHVYT 117 (338) T ss_pred ECCCCEEE T ss_conf 44760587 No 286 >KOG1010 consensus Probab=22.08 E-value=25 Score=9.17 Aligned_cols=23 Identities=13% Similarity=0.313 Sum_probs=0.0 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 89999998507887899789999 Q gi|254781029|r 1763 VFLNLQEQYKADSALRNAIDRYI 1785 (1828) Q Consensus 1763 ~~~~i~~~y~~d~~f~~~v~ry~ 1785 (1828) +|.+|-+-||+-|.++..|-|-| T Consensus 741 tF~eIm~~YR~QPqa~~~vyRsV 763 (920) T KOG1010 741 TFSEIMRAYRRQPQAVSLVYRSV 763 (920) T ss_pred HHHHHHHHHHCCCHHHHHHHHHE T ss_conf 19999999851723444455620 No 287 >pfam07798 DUF1640 Protein of unknown function (DUF1640). This family consists of sequences derived from hypothetical eukaryotic proteins. A region approximately 100 residues in length is featured. Probab=21.92 E-value=26 Score=9.13 Aligned_cols=160 Identities=13% Similarity=0.133 Sum_probs=0.0 Q ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 97309255569999999999999999723689999999999999999999999999999999999999999999999999 Q gi|254781029|r 180 LRLIDPEEYSSEKMQSISSAVRKEIVLMTEEIDRAISRASELEKTVRSEIEVLENNYTKSEMRIDNITQNLKQEREAIIN 259 (1828) Q Consensus 180 ~rl~~p~~~a~~~~~~~~qa~r~ev~a~~~~~er~~~ra~ele~lv~~ev~~ler~y~dnE~rir~L~~el~~ere~i~~ 259 (1828) .+.-+..++..++.-++--+++.=+....+.+.+.++-=.++|....--=+++-.--+|....=+.-...|.++++.+.. T Consensus 8 Vk~Le~~Gft~~QAe~i~~~l~~~l~~~l~~~~~~lvsk~~~e~~~y~~~~~~selr~el~~~~~~e~~~lr~e~e~L~~ 87 (177) T pfam07798 8 VQLLEENGFTEEQAEAIVSALTEVINDSIENVYQDMVTKEQQEKTEYQQKVDFAKLKSELQSLEKSEFSNLQAENEKLKN 87 (177) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998879999999999999999999899999998711999999999999999999999998777589999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 260 HGTQLCTSIAEVHESLKEELSLTSEEISVHLSRAIDSFQSIVDVRIAKVTEKTTRIVQESAQTISSKIDQLLEVLHSTSI 339 (1828) Q Consensus 260 ~ae~~r~~i~~~~e~l~~el~~~s~~i~~~~~~~~e~~~~~le~r~~~l~~~~~~~~~~~~~~l~~~~~~l~~~L~~~~~ 339 (1828) ..+.++..+......++.++..--..=.+++.+.....-..+.+-...|...+......+...-..-+.-+....-.... T Consensus 88 dve~L~~~L~~ei~k~~a~~klDlNleK~riree~~~~e~ki~e~~~kId~ei~~lrt~iEs~K~~~ir~l~g~~~~~~a 167 (177) T pfam07798 88 DLEKLKNRLRDEIDKVTAGVRLDLNLEKGRIREELAEQELKIKETDTKIDTEIAGLKTQLESVKLDTIKYLIGVVTGCLA 167 (177) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999861525765455520778999879999999998999999999999999999999999999999999 No 288 >COG1298 FlhA Flagellar biosynthesis pathway, component FlhA [Cell motility and secretion / Intracellular trafficking and secretion] Probab=21.89 E-value=26 Score=9.12 Aligned_cols=86 Identities=21% Similarity=0.264 Sum_probs=0.0 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHH-HHHHHHHHHHHC--CHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999854187322799999999999-999999999831--62145557656406488999999999999999999999999 Q gi|254781029|r 88 ILQNIGKNSLNIALRNAFMSSLIW-LACELGFLIKFY--TNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 88 ~~~~l~~~~s~~~~~~~~~~s~~w-~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) +.+.|+-.+=++.+.++|++-+-- ++-.=++...-+ ++.-|.+++.|+.. |=|+.++|+=++||.+=.+++-. T Consensus 64 i~~pldfs~FPTlLL~~TL~RL~LNvASTR~IL~~G~~aaG~vI~aFG~fvvg----Gn~viG~iVFlILiivnFiVITK 139 (696) T COG1298 64 IQRPLDFSSFPTLLLITTLLRLGLNVASTRVILSHGHEAAGKVIEAFGNFVVG----GNFVIGLIVFLILIIVNFIVITK 139 (696) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCEEEC----CCEEHHHHHHHHHHHHHHEEECC T ss_conf 18833122266899999999987748878889856864552899974021564----84316999999998864069826 Q ss_pred HHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999999730 Q gi|254781029|r 165 ARDMHDASQSIAGIALRLI 183 (1828) Q Consensus 165 ~~~~~~~a~~m~~~a~rl~ 183 (1828) | |-..+||+=||+ T Consensus 140 G------A~RIAEV~ARFt 152 (696) T COG1298 140 G------AERIAEVAARFT 152 (696) T ss_pred C------CHHHHHHHHHHE T ss_conf 8------317666655310 No 289 >COG4171 SapC ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms] Probab=21.88 E-value=26 Score=9.12 Aligned_cols=66 Identities=12% Similarity=-0.027 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCH-HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 322799999999999999999999831621455576564064-889999999999999999999999 Q gi|254781029|r 97 LNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRP-EVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 97 s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) +.....++++.-+.-++.-+|+.+++-..-.=-.+.+|...| .++|++++++..|-++--+||.+. T Consensus 101 ~allvt~~a~l~g~~lGi~AG~t~gl~s~~lnHilDt~lSiPsLLlAiivvaf~gpsl~namfA~~L 167 (296) T COG4171 101 GALLVTLAATICGGVLGIFAGATHGLRSAVLNHILDTLLSIPSLLLAIIVVAFAGPSLSNAMFAVWL 167 (296) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 3999999999999999999977777899999999999973589999999999608328889999999 No 290 >PRK09824 beta-glucoside-specific PTS system components IIABC; Provisional Probab=21.88 E-value=26 Score=9.12 Aligned_cols=63 Identities=14% Similarity=-0.028 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHH---------HHHHHHHHHHHHHHHHHHHH Q ss_conf 7999999999999999999998316214555765640648899---------99999999999999999999 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVIT---------LFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~p~~~~~~~a~~~ 162 (1828) ||.-.++++.+..+.|++++-.+....-...+.++++.|..+. .++.++++.+++-|++..+. T Consensus 378 ~~kkPfi~a~IGgaiGG~~~g~~~v~~~a~g~~gi~~~~~~i~~~~~~~~~~~~~i~~~ia~~~~fv~t~~~ 449 (625) T PRK09824 378 PRKYPFVIACISGALGATIIGYAQTKVYSFGLPSIFTFMQTIPSTGIDFTVWASVIGGVIAIGCAFVGTVML 449 (625) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHEEEEEECCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 455287999998999999999852012454002465556655876644118999999999999999999987 No 291 >COG1272 Predicted membrane protein, hemolysin III homolog [General function prediction only] Probab=21.88 E-value=26 Score=9.12 Aligned_cols=60 Identities=12% Similarity=0.233 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999998316214555765640648899999999999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) +.-++++++|..+.+|+....+.+..-..+..++-. .+|-.....+-|++....+..+.| T Consensus 116 ~~~~~~~iiW~lal~Gi~~kl~~~~~~r~ls~~~yl--~mGw~~v~~~~~l~~~l~~~~~~~ 175 (226) T COG1272 116 LGWILLGLIWGLALAGILFKLFFKKRFRKLSLVLYL--AMGWLGLIVIKPLIAKLGLIGLVL 175 (226) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEHHHH--HHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 478999999999999874523125767614201569--999999999999998686189999 No 292 >pfam05568 ASFV_J13L African swine fever virus J13L protein. This family consists of several African swine fever virus J13L proteins. Probab=21.87 E-value=26 Score=9.12 Aligned_cols=18 Identities=28% Similarity=0.718 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999999999999 Q gi|254781029|r 140 VITLFLVVSVVPILLFFSF 158 (1828) Q Consensus 140 ~~~~~~~~~~~p~~~~~~~ 158 (1828) +|++ ++.+|+-|+|+|.| T Consensus 35 liai-vvliiiiivliylf 52 (189) T pfam05568 35 LIAI-VVLIIIIIVLIYLF 52 (189) T ss_pred HHHH-HHHHHHHHHHHHHH T ss_conf 9999-99999999999997 No 293 >TIGR00892 2A0113 monocarboxylate transporter; InterPro: IPR004743 Proton-linked monocarboxylate transporters catalyse the rapid transport across the plasma membrane of many monocarboxylates such as lactate, pyruvate, branched-chain oxo acids derived from leucine, valine and isoleucine, and the ketone bodies acetoacetate, beta-hydroxybutyrate and acetate.; GO: 0015355 secondary active monocarboxylate transmembrane transporter activity, 0015711 organic anion transport, 0016021 integral to membrane. Probab=21.86 E-value=26 Score=9.11 Aligned_cols=60 Identities=20% Similarity=0.305 Sum_probs=0.0 Q ss_pred CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH--CHHHHHHHHHHHHHHHHH Q ss_conf 98989998541873227999999999999999999998316214555765640--648899999999999999 Q gi|254781029|r 84 GITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAM--RPEVITLFLVVSVVPILL 154 (1828) Q Consensus 84 ~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~p~~~ 154 (1828) +.|+++.|.-.+=+-. |.++..+ +|+|+++.+.| .+.-++.-|- .|..=|++.++ .||.| T Consensus 102 s~GmilASFs~~~~eL-YlT~G~i------tGLGlAfnlqP--sLt~lG~YF~rrRPLANGLA~aG--sPvFL 163 (475) T TIGR00892 102 SLGMILASFSSNVLEL-YLTAGFI------TGLGLAFNLQP--SLTMLGKYFYRRRPLANGLAMAG--SPVFL 163 (475) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHH------HHHHHHHHHHH--HHHHHCCCCCCCCHHHHHHHHCC--CCHHH T ss_conf 6689999986479999-9998888------87889873204--68773232454453578888617--62378 No 294 >PRK09776 putative sensor protein; Provisional Probab=21.79 E-value=26 Score=9.10 Aligned_cols=79 Identities=19% Similarity=0.189 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999831621455576564064889999999999999999999999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAGIAL 180 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~ 180 (1828) .|++.+..++=...|+.+++...+.......-..--....||+++ ++|++|.|. ....+|..-.+.....|.=+++ T Consensus 127 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W~~~d~iG~l~---~~Pl~L~~~-~~~~~~~~~~~~~~~~~~~l~~ 202 (1116) T PRK09776 127 LRLALGSAIVPPLLGGALVVLLTPGDDFLRAWLIWVLSEAIGMLA---LVPLGLLFK-PHYLLRHRNPRLLFESLLTLAI 202 (1116) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH---HHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHH T ss_conf 999999999988999999999746742788999999998999999---874878405-4234323261337999999999 Q ss_pred HHC Q ss_conf 730 Q gi|254781029|r 181 RLI 183 (1828) Q Consensus 181 rl~ 183 (1828) =|+ T Consensus 203 ~l~ 205 (1116) T PRK09776 203 TLT 205 (1116) T ss_pred HHH T ss_conf 999 No 295 >KOG0559 consensus Probab=21.79 E-value=26 Score=9.09 Aligned_cols=75 Identities=4% Similarity=-0.023 Sum_probs=0.0 Q ss_pred CCHHHHHHHCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4323005623012015767666663320012368887663102346788886555567788888788877556779 Q gi|254781029|r 9 ATYRDSLKEFEEQSTSDSHENKKRHHSRKKHKYSRRDDSIESKDRQQTTLSKNFKQASNKQTYSGFYPANDLDNGG 84 (1828) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pa~~d~r~~ 84 (1828) +|.++. +..|-+.....+.++.++..+...++....++..+.|+..+.+.|..+.+.-....+.-.|+.|-.|.+ T Consensus 151 Apa~~~-~~apa~~~pk~~~a~~a~p~~~s~~~p~~~apv~e~p~~p~~~~P~~~~a~k~~v~~~~~~p~~~~~~~ 225 (457) T KOG0559 151 APAKGG-ASAPAKAEPKTAPAAAAPPKPSSKPPPKEAAPVAESPPAPSSPEPVPASAKKPSVAQPKPPPSEGATPS 225 (457) T ss_pred CCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 884234-568776688888888899975678985335887889999999999986545755568999931146877 No 296 >pfam04290 DctQ Tripartite ATP-independent periplasmic transporters, DctQ component. The function of the members of this family is unknown, but DctQ homologues are invariably found in the tripartite ATP-independent periplasmic transporters. Probab=21.78 E-value=26 Score=9.09 Aligned_cols=69 Identities=12% Similarity=-0.025 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 732279999999999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) |.+.-.|...+..++.+..++.+++..+.........+-...+..+-....-+++|+.++...-..+.+ T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~t~~l~ip~~~~~~~i~i~~~l~~l~~i~~ 126 (133) T pfam04290 58 PPRVRRVLDVLAALLALAFAAVLAWAGWDLVLASLGSGETSPALGIPLWWVYLALPVGFALLALRALLR 126 (133) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999999999999998088687755579999999999999999999999 No 297 >KOG0832 consensus Probab=21.74 E-value=26 Score=9.08 Aligned_cols=19 Identities=32% Similarity=0.492 Sum_probs=0.0 Q ss_pred CCCCCCCCCHHHHHHHHHC Q ss_conf 8775567798989998541 Q gi|254781029|r 76 PANDLDNGGITHILQNIGK 94 (1828) Q Consensus 76 pa~~d~r~~~~~~~~~l~~ 94 (1828) |||||.-.++--++.-|.. T Consensus 217 PaNDDs~~sv~f~~~l~k~ 235 (251) T KOG0832 217 PANDDSPASVEFILNLLKR 235 (251) T ss_pred CCCCCCHHHHHHHHHHHHH T ss_conf 8888759999999999999 No 298 >PRK09500 potC spermidine/putrescine ABC transporter membrane protein; Reviewed Probab=21.72 E-value=26 Score=9.07 Aligned_cols=68 Identities=15% Similarity=0.117 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCHH------HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 322799999999999999999999831621------45557656406488999999999999999999999999 Q gi|254781029|r 97 LNIALRNAFMSSLIWLACELGFLIKFYTNS------PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 97 s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ++-....+++++.++..+-......+.+++ .+.+.-.....|..-|+.+..++++++++|..-.+.|| T Consensus 181 ~~P~i~~~~il~F~~s~~ef~~~~~l~g~~~~tlp~~i~~~~~~~~~~~~~Ala~il~~i~~~~~~l~~~l~r~ 254 (257) T PRK09500 181 AMPAVAAGWLLSFTLSMDDVVVSSFVTGPSYEILPLKIYSMVKVGVSPEVNALATILLVLSLVLVILSQLLARD 254 (257) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 79999999999999999999999975289960289999999873787599999999999999999999998665 No 299 >TIGR00844 c_cpa1 sodium/hydrogen antiporter; InterPro: IPR004712 Na^+/H^+ exchange proteins eject protons from cells, effectively eliminating excess acid from actively metabolising cells. Na^+/H^+ exchange activity is also crucial for the regulation of cell volume, and for the reabsorption of NaCl across renal, intestinal, and other epithelia. These antiports exchange Na^+ for H^+ in an electroneutral manner, and this activity is carried out by a family of Na^+/H^+ exchangers, or NHEs, which are known to be present in both prokaryotic and eukaryotic cells. In mammalian cells, Na^+/H^+ exchange activity is found in both the plasma membrane and inner mitochondrial membrane. To date, six mammalian isoforms have been identified (designated NHE1-NHE6) , . These exchangers are highly-regulated (glyco)phosphoproteins, which, based on their primary structure, appear to contain 10-12 membrane-spanning regions (M) at the N-terminus and a large cytoplasmic region at the C-terminus. The transmembrane regions M3-M12 share identity with other members of the family. The M6 and M7 regions are highly conserved. Thus, this is thought to be the region that is involved in the transport of sodium and hydrogen ions. The cytoplasmic region has little similarity throughout the family. There is some evidence that the exchangers may exist in the cell membrane as homodimers, but little is currently known about the mechanism of their antiport . This group is specific for the fungal members of this family. ; GO: 0015385 sodium:hydrogen antiporter activity, 0006814 sodium ion transport, 0016021 integral to membrane. Probab=21.69 E-value=26 Score=9.07 Aligned_cols=44 Identities=9% Similarity=0.095 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999998316214555765640648899999999999999999 Q gi|254781029|r 114 CELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFS 157 (1828) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 157 (1828) .|+|+....|--.++.-=..+.-.--.+|.++|.||+|+-|=|| T Consensus 17 ~CvGiFSSIFSLVSLfVKE~LYIGES~VA~iFGLIVGP~CLNWF 60 (923) T TIGR00844 17 SCVGIFSSIFSLVSLFVKEKLYIGESMVASIFGLIVGPHCLNWF 60 (923) T ss_pred HHHHHHHHHHHHHHHHHHHHHEECHHHHHHHHHHEECCCCCCCC T ss_conf 77768999999988777521010302412331010257101546 No 300 >pfam10701 consensus Probab=21.68 E-value=26 Score=9.07 Aligned_cols=68 Identities=12% Similarity=0.025 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999998316214555765640648899999999999999999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMH 169 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~ 169 (1828) |.|++|+.+|+.---.-..++.--.....+.-....++.-..+..++.+=+.++++|-.=-||-+-|+ T Consensus 28 w~AflF~~lW~l~~rlWl~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~l~~~l~~Gleg~~lr~~~L~ 95 (118) T pfam10701 28 WLAFLFGPLWLLYHRLWLAAIAVLALSIALAAAGTALGLGLPAYFLVNLLLSLLFGLEAPDLRRWKLS 95 (118) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 79999989999999999999999999999999999844136799999999999999733199999999 No 301 >PRK06945 flgK flagellar hook-associated protein FlgK; Validated Probab=21.66 E-value=26 Score=9.06 Aligned_cols=84 Identities=7% Similarity=0.163 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 251 KQEREAIINHGTQLCTSIAEVHESLKEELSLTSEEISVHLSRAIDSFQSIVDVRIAKVTEKTTRIVQESAQTISSKIDQL 330 (1828) Q Consensus 251 ~~ere~i~~~ae~~r~~i~~~~e~l~~el~~~s~~i~~~~~~~~e~~~~~le~r~~~l~~~~~~~~~~~~~~l~~~~~~l 330 (1828) ...|..++..++.+-..|......|..--..+...|...|+++ -.+...+......|......-++...++++.| +.+ T Consensus 127 ~a~R~~~~~~a~~L~~~~~~~~~~l~~~~~~~~~~i~~~V~~i-N~l~~~Ia~LN~qI~~~~~~~g~~~NdL~DqR-D~l 204 (649) T PRK06945 127 PSARQTALSNAQALASQFNAAGQQYDQLRQGVNTQLTSTVTQI-NSYTKQIAQLNQQIAKAESSQGQPPNDLLDQR-DLA 204 (649) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH-HHH T ss_conf 8999999999999999999999999999999999999999999-99999999999999997556899926899999-999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254781029|r 331 LEVLHS 336 (1828) Q Consensus 331 ~~~L~~ 336 (1828) ++.|.. T Consensus 205 l~eLS~ 210 (649) T PRK06945 205 VSNLSK 210 (649) T ss_pred HHHHHH T ss_conf 999986 No 302 >pfam04403 PqiA Paraquat-inducible protein A. Paraquat is a superoxide radical-generating agent. The promoter for the pqiA gene is also inducible by other known superoxide generators. This is predicted to be a family of integral membrane proteins, possibly located in the inner membrane. This family is related to NADH dehydrogenase subunit 2 (pfam00361). Probab=21.66 E-value=26 Score=9.06 Aligned_cols=79 Identities=16% Similarity=0.169 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCH-------HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH---------- Q ss_conf 79999999999999999999983162-------1455576564064889999999999999999999999---------- Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKFYTN-------SPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI---------- 162 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~---------- 162 (1828) ++.+|+++=.+.+..-=.+....+|. +.+..+-+.-.-+..+-+++..+++|++.+..+..+. T Consensus 7 Al~iaalil~~~An~~P~l~~~~~G~~~~~tl~~~i~~L~~~g~~~la~lv~~~si~~P~~~l~~l~~l~~~~~~~~~~~ 86 (162) T pfam04403 7 ALLIAALILYIPANLFPILETEVLGLTSEDTILGGVLALWHQGSWPLALLVFLASILVPLLKLLALLYLLLSLQRGRRPP 86 (162) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 99999999999999746999885578776719999999997898189999999999999999999999999986376888 Q ss_pred --HHHHHHHHHHH----HHHHH Q ss_conf --99999999999----99999 Q gi|254781029|r 163 --SRARDMHDASQ----SIAGI 178 (1828) Q Consensus 163 --~r~~~~~~~a~----~m~~~ 178 (1828) ++..=+|.+-+ ||.|| T Consensus 87 ~~~~~~l~r~~~~l~~WsM~DV 108 (162) T pfam04403 87 GRRRMRLLRILERIGPWSMLDV 108 (162) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 5789899999999865789999 No 303 >PRK12883 ubiA prenyltransferase UbiA-like protein; Reviewed Probab=21.66 E-value=26 Score=9.06 Aligned_cols=67 Identities=10% Similarity=0.001 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHH--HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999831621455--576564064889999999999999999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMD--SFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDM 168 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~ 168 (1828) |++.+++++.......+++.+-+-..++ ..+-+.+.+-..|.++.+-+.+..++.++|.+.+=++|. T Consensus 104 ~~~~~l~~~~~~l~~~Ys~~lK~~~l~gNi~va~~~g~~~l~g~~a~g~~~~~~~l~~~afl~~l~rei 172 (275) T PRK12883 104 IWAFLFALIAYVLTFIYAWKLKPLPFIGNVVVALLTAATPIYGAVGVGRIGLAGYLAICAFLVNVSREI 172 (275) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999989986658855999999999999999999957540999999999999999999 No 304 >PRK02391 heat shock protein HtpX; Provisional Probab=21.65 E-value=26 Score=9.06 Aligned_cols=67 Identities=18% Similarity=0.113 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999998316214555765640648899999999999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMH 169 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~ 169 (1828) .++-++++.+...++....+.++ .+..+........+..+.++++.+++|++.++.. +.+.|.+|.. T Consensus 150 il~~tia~~l~~i~~~i~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ll~-~aiSR~REy~ 216 (297) T PRK02391 150 VAVMTIASFLSTIAGMIVRNGFY-FGGFGGRGRGDGGGGLLLVWLVSLVVWALSFLLI-RALSRYREFA 216 (297) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH T ss_conf 99999999999999999999999-8440567775422899999999999999999999-9988889999 No 305 >PRK01265 heat shock protein HtpX; Provisional Probab=21.65 E-value=26 Score=9.06 Aligned_cols=73 Identities=12% Similarity=0.135 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH----------HHHHHH Q ss_conf 999999999999998316214555765640648899999999999999999---9999999999----------999999 Q gi|254781029|r 107 SSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFS---FFIMISRARD----------MHDASQ 173 (1828) Q Consensus 107 ~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~---~a~~~~r~~~----------~~~~a~ 173 (1828) .+.+-++.|+++.....+-..+..+-+.++.-..++.++.+++.-++-||+ ++..+-|+++ |..+-+ T Consensus 10 ~~~~l~~~~~~~l~~~i~~~~~~~~~g~~~~g~i~~ali~~~~~~~~~y~~s~~ivl~~~~a~~v~~~~~~~p~L~~iV~ 89 (326) T PRK01265 10 LSMILVGIGIAALGFAIAYGVLYYLFGIYTIGIITGALIFVLFFDIIQWLFGPYIINAVYRAKEVSPTDPQYGWLYNIVQ 89 (326) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCCEECCCCCCHHHHHHHHHH T ss_conf 99999999999999999999999998630889999999999999999999349999997297679988802679999999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254781029|r 174 SIAGIA 179 (1828) Q Consensus 174 ~m~~~a 179 (1828) .|+..| T Consensus 90 ~la~~a 95 (326) T PRK01265 90 EVAEYN 95 (326) T ss_pred HHHHHC T ss_conf 999975 No 306 >KOG3145 consensus Probab=21.65 E-value=26 Score=9.06 Aligned_cols=44 Identities=20% Similarity=0.208 Sum_probs=0.0 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999831621455576564064889999999999999999 Q gi|254781029|r 113 ACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFF 156 (1828) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 156 (1828) ||.+.+........--.++..|++.|+-.|+....++--|+|++ T Consensus 306 GG~~slLQMilQ~~N~~sw~~f~gnp~KfGLg~vSi~FdiiFm~ 349 (372) T KOG3145 306 GGTASLLQMILQSSNDNSWDTFYGNPGKFGLGLVSIFFDIIFMM 349 (372) T ss_pred CCHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 64799999999870633177771681120113478899999986 No 307 >PRK00220 putative glycerol-3-phosphate acyltransferase PlsY; Provisional Probab=21.62 E-value=26 Score=9.05 Aligned_cols=62 Identities=15% Similarity=0.168 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999983162145557656406488999999999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARD 167 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~ 167 (1828) .|.+.++-++| .+......-.+++|+...+..|....++-.-....+..+.....++||-.+ T Consensus 120 p~~~l~~~~~~-----~~~~~~tr~vSl~Si~~~~~~p~~~~~~~~~~~~~~~~~~~~~lii~rHr~ 181 (198) T PRK00220 120 PLLALILAATW-----LLVALLTRYSSLAALVAALIAPIYVWWYSYDPQFTIPVALLSVLIIYRHRD 181 (198) T ss_pred HHHHHHHHHHH-----HHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999-----999998333119999999999999999925578999999999999999999 No 308 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=21.61 E-value=26 Score=9.05 Aligned_cols=59 Identities=10% Similarity=0.095 Sum_probs=0.0 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH--HHCCCHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 753102565554467786777778842157999998--718013899999975074562067741 Q gi|254781029|r 1694 ILSSSTHSKGKSSSHIDISDKDSLSSIDSLVENISK--FIDYDAFVQLWKSYTLGEDDIFSKRLY 1756 (1828) Q Consensus 1694 ll~~~~~~~~~~~~~~~~~~~~~~~~l~s~s~di~~--~~~~~~~~~~w~~y~~g~~~~f~~~~~ 1756 (1828) +|.+++. ++.+-....+.|...++|+++..+-... |||.-..+-..+.|..+ ||--|++ T Consensus 16 ~l~GCs~-~~a~~~sv~~a~~~tt~~v~~td~~~~a~~midSm~~~p~~~~ik~~---i~v~~vv 76 (200) T COG3417 16 FLSGCSS-EPAPKESVKDAPENTTEPVDPTDLELIAQGMIDSMPSDPNFEHIKPN---IAVEPVV 76 (200) T ss_pred HHHHCCC-CCCCCHHCCCCCCCCCCCCCCCCHHHCCHHHHHCCCCCCCHHHCCCC---CCEEEHH T ss_conf 9841136-77860102768665437889623222013444315689516544887---4221143 No 309 >KOG3626 consensus Probab=21.61 E-value=26 Score=9.04 Aligned_cols=68 Identities=12% Similarity=0.020 Sum_probs=0.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 873227999999999999999999998316214555765640648899-999999999999999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVIT-LFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~p~~~~~~~a~~~~ 163 (1828) +.++.-+|+-.+++-+-.+.|.-..+.+- -+-+.....---+|.-|| +++..+++|++-|.+-..|.+ T Consensus 230 ~~~~~~~~llff~~q~l~GIG~Tpi~tlG-isYiDDnvk~~~SplYlgi~~~~~~lGPaiGfllgS~~l~ 298 (735) T KOG3626 230 KESTSYPFLLFFLGQLLLGIGATPIFTLG-ISYIDDNVKKKNSPLYLGILYSMAILGPAIGFLLGSFCLK 298 (735) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCCCC-CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 66775157899999997006778874687-7421457564567089999999998604999999999877 No 310 >PRK06743 flagellar motor protein MotP; Reviewed Probab=21.60 E-value=26 Score=9.04 Aligned_cols=87 Identities=13% Similarity=0.179 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 99999999999983162145557656406488999999999999999999999999999999999999999973092555 Q gi|254781029|r 109 LIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAGIALRLIDPEEY 188 (1828) Q Consensus 109 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~rl~~p~~~ 188 (1828) ++-++.|+.+.-.+.+++.+.++..|+-.|..+ +++|+.++-++.-|-+. ++..+-+.+..+-..=.....- T Consensus 2 Gl~~g~~~v~~~~~l~gg~~~~l~~f~~~~s~l-IV~GGt~~a~li~~~~~-------~i~~~~~~~~~~~~~~~~~~~~ 73 (254) T PRK06743 2 GIIVGFAIVIAAIMLGGGGIKAFKNFLDVSSIL-IVIGGTTATIVVAYRFG-------EIKKYTKSIFTVLHRREEDLEQ 73 (254) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHCCCCHHH-HHHHHHHHHHHHHCCHH-------HHHHHHHHHHHHHCCCCCCHHH T ss_conf 247999999999998187477887414705668-45779999999977899-------9999999999996677789999 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 699999999999999 Q gi|254781029|r 189 SSEKMQSISSAVRKE 203 (1828) Q Consensus 189 a~~~~~~~~qa~r~e 203 (1828) --+.++.++.-.||+ T Consensus 74 ~i~~lv~la~~aRk~ 88 (254) T PRK06743 74 LTDLFVDFSKKSKKH 88 (254) T ss_pred HHHHHHHHHHHHHHC T ss_conf 999999999998543 No 311 >PRK13852 type IV secretion system protein VirB6; Provisional Probab=21.55 E-value=26 Score=9.03 Aligned_cols=56 Identities=16% Similarity=0.067 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 732279999999999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) |-....+++....+.+..+.+++.+..| .-..+++.+..+++|+.....+-.-.|| T Consensus 137 ~~~~~~~l~~~g~~~~~~~~l~i~f~i~-------------l~aki~lailLaIGPiFI~~~LF~~TRr 192 (295) T PRK13852 137 PMNDQDILAFQGAQWVFYGTLWSAFGVY-------------DAVGILTKVLLAIGPLILVGYLFDRTRD 192 (295) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7886899999999999999999999999-------------9999999999997199999998888789 No 312 >PRK09561 nhaA pH-dependent sodium/proton antiporter; Reviewed Probab=21.51 E-value=26 Score=9.02 Aligned_cols=54 Identities=15% Similarity=0.037 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCCHH--HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999831621--45557656406488999999999999999999999999 Q gi|254781029|r 111 WLACELGFLIKFYTNS--PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 111 w~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) |++..+--.++++.-+ .-+..-+.+..|-.+|+++|.+++-.+-+|+|+.+..| T Consensus 258 ~v~~~ilPlFAfaNAGV~l~~~~~~~l~~~v~lGI~~GL~iGK~lGI~~~~~la~k 313 (388) T PRK09561 258 WVAYLILPLFAFANAGVSLQGVTLDGLTSILPLGIIAGLLIGKPLGISLFCWLALR 313 (388) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHCHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 47889889999850475646663776417489999999981642879999999998 No 313 >PRK13850 type IV secretion system protein VirD4; Provisional Probab=21.49 E-value=26 Score=9.01 Aligned_cols=78 Identities=17% Similarity=0.179 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCHH----HHHHHHHHHHCHHHHHHH--------HHHHHHHHHHHHHHHHHHH Q ss_conf 7322799999999999999999999831621----455576564064889999--------9999999999999999999 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLACELGFLIKFYTNS----PMDSFYDFAMRPEVITLF--------LVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~--------~~~~~~p~~~~~~~a~~~~ 163 (1828) |-+.++-++.-..+-|.+++.+.+|.+--.. ...-+.-....|-.+|-. ++++..-|+.+-+.+++.+ T Consensus 8 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (670) T PRK13850 8 PQRLAVSIVCSLAAGFCAASLYVTFRHGFNGEAMMTFDVFAFWYETPLYLGYATPVFYRGLAIVVSTSVSVLLSQLIVSL 87 (670) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHCC T ss_conf 86520345578899999989999999605844468762001102152200237446551115655647999999999701 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254781029|r 164 RARDMHDASQ 173 (1828) Q Consensus 164 r~~~~~~~a~ 173 (1828) |.++.+--|| T Consensus 88 r~~~~hGsAr 97 (670) T PRK13850 88 RNREHHGTAR 97 (670) T ss_pred CCCCCCCCCC T ss_conf 0212457603 No 314 >COG4789 EscV Type III secretory pathway, component EscV [Intracellular trafficking and secretion] Probab=21.46 E-value=26 Score=9.00 Aligned_cols=38 Identities=16% Similarity=0.008 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999831621455576564064889999999999999999 Q gi|254781029|r 109 LIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFF 156 (1828) Q Consensus 109 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 156 (1828) ++|+++.+.+.++++| +|-.|..+.+ ++.+.+|..+|| T Consensus 281 al~vaa~lll~~gliP---------GFP~~~Fl~L-A~~~~~~~~~~~ 318 (689) T COG4789 281 ALLVAAALLLLFGLIP---------GFPLLVFLFL-AVLLAAIGFLLR 318 (689) T ss_pred HHHHHHHHHHHHHHCC---------CCHHHHHHHH-HHHHHHHHHHHH T ss_conf 7899999999987258---------9728999999-999987888732 No 315 >cd01042 DMQH Demethoxyubiquinone hydroxylase, ferritin-like diiron-binding domain. Demethoxyubiquinone hydroxylases (DMQH) are members of the ferritin-like, diiron-carboxylate family which are present in eukaryotes (the CLK-1/CAT5 family) and prokaryotes (the Coq7 family). DMQH participates in one of the last steps of ubiquinone biosysnthesis and is responsible for DMQ hydroxylation, resulting in the formation of hydroxyubiquinone, a precursor of ubiquinone. CLK-1 is a mitochondrial inner membrane protein and Coq7 is a proposed interfacial integral membrane protein. Mutations in the Caenorhabditis elegans gene clk-1 affect biological timing and extend longevity. The conserved residues of a diiron center are present in this domain. Probab=21.45 E-value=26 Score=9.00 Aligned_cols=33 Identities=21% Similarity=0.180 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCCCCH-HHHHHHHHHHHHHHHHHH Q ss_conf 898999854187322-799999999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNI-ALRNAFMSSLIWLACELG 117 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~-~~~~~~~~s~~w~~~~~~ 117 (1828) ...+|..+..|||.. |+|..+.|.+=|+.+-+| T Consensus 49 f~~~l~~~~~r~s~l~PlW~~~gf~lG~~~al~G 82 (165) T cd01042 49 FEELLPELGVRPSLLLPLWYVAGFALGALTALLG 82 (165) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 9998886599986078889996889999999827 No 316 >PRK10941 putative transcriptional regulator; Provisional Probab=21.40 E-value=27 Score=8.99 Aligned_cols=100 Identities=11% Similarity=0.004 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHH-HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 42157999998718013899999975074562067741252599-89999998507887899789999999999999982 Q gi|254781029|r 1719 SIDSLVENISKFIDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQK-VFLNLQEQYKADSALRNAIDRYISNFEEMLSEIAQ 1797 (1828) Q Consensus 1719 ~l~s~s~di~~~~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~-~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l~~~~~ 1797 (1828) ||---++-|++.+.+|..+.--..-...=-.-.-.++-....+. -..-+-+-.-.+-.|+.+.+-|-.-=-..|+.|.. T Consensus 13 ~L~e~al~ia~~~~pd~~~~~~~~~Ld~L~~~~~~~l~~~~~~~~~l~~L~~~~y~e~gF~G~~~~Y~~~~ns~l~~Vl~ 92 (269) T PRK10941 13 PLCDGMILASEAIRRDFPSQDVYDELERLVSLAREEISQLLSQDEQLEKLIALFYGEWGFGGARGVYRLSDALWLDKVLK 92 (269) T ss_pred CHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 59999999999757899979999999999999997667668989999999999998628888824437821346999998 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 5873378999987664399999998324 Q gi|254781029|r 1798 SNDDSPLVQEHIMSNYGKVYTMLVHASG 1825 (1828) Q Consensus 1798 ~~~~~~~~~~~l~s~~g~~y~~la~a~g 1825 (1828) +++|..++ .|-||.-+|+..| T Consensus 93 ~r~G~Pis-------L~~i~l~iA~~l~ 113 (269) T PRK10941 93 KRQGSAVS-------LGAILLWIANRLD 113 (269) T ss_pred HCCCCCHH-------HHHHHHHHHHHCC T ss_conf 27887389-------9999999999869 No 317 >COG3416 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.39 E-value=27 Score=8.98 Aligned_cols=66 Identities=8% Similarity=-0.023 Sum_probs=0.0 Q ss_pred CCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 888-7663102346788886555567788888788877556779898999854187322799999999999999999999 Q gi|254781029|r 42 SRR-DDSIESKDRQQTTLSKNFKQASNKQTYSGFYPANDLDNGGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLI 120 (1828) Q Consensus 42 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pa~~d~r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~ 120 (1828) ++. .+++.-.|++..+..|+--+.++-...+...|+.. +++.+-|...++-.+.=|+||..|+. T Consensus 93 t~~e~app~~a~~p~~~aap~S~rs~~~g~t~~p~paa~---------------~~r~ssFLG~AlqTAAGVAGGMlL~n 157 (233) T COG3416 93 TPQEPAPPANAPPPKEPAAPPSWRSSPAGPTTQPSPAAA---------------NTRSSSFLGGALQTAAGVAGGMLLAN 157 (233) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---------------CCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 888899976789988777998765454678888873224---------------66540168888888740102679998 Q ss_pred HH Q ss_conf 83 Q gi|254781029|r 121 KF 122 (1828) Q Consensus 121 ~~ 122 (1828) .+ T Consensus 158 ~L 159 (233) T COG3416 158 GL 159 (233) T ss_pred HH T ss_conf 99 No 318 >pfam10081 DUF2319 Predicted membrane protein (DUF2319). Members of this family of hypothetical bacterial proteins have no known function. Probab=21.39 E-value=27 Score=8.98 Aligned_cols=111 Identities=9% Similarity=0.019 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89899985418732279999999999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ++.+.+.|..+..+-...-..-..++++++.+.+....++-.....+..+.++|.. .....+++.++-.-+|+.++-= T Consensus 52 ~~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~wQ~~~r~lmg~~~~--~~~~~~~~~~~a~~~~~~ll~~ 129 (534) T pfam10081 52 LRWLWRYLELPELRPRPRRGARLVAVALGAAVTVLMLVQAARWQNSVRALMGMPPL--ETAAYLRGVLVALLVFGALVAL 129 (534) T ss_pred HHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--CCHHHHHHHHHHHHHHHHHHHH T ss_conf 99999984288878775248999999999999999999998855999998289987--7058899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 999999999999999973092555699999999 Q gi|254781029|r 165 ARDMHDASQSIAGIALRLIDPEEYSSEKMQSIS 197 (1828) Q Consensus 165 ~~~~~~~a~~m~~~a~rl~~p~~~a~~~~~~~~ 197 (1828) ++-++...|-+.-...|+.-|....-=.++.++ T Consensus 130 gr~~~~~~r~l~~~l~R~~P~~va~~v~~~~~~ 162 (534) T pfam10081 130 GRLLRRLIRFLARRLQRVLPRRVANVVGVLVVV 162 (534) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 999999999999861132768999999999999 No 319 >TIGR00680 kdpA K+-transporting ATPase, A subunit; InterPro: IPR004623 Kdp is a high affinity ATP-driven K+ transport system in Escherichia coli. It is composed of three membrane-bound subunits, KdpA, KdpB and KdpC and one small peptide, KdpF. KdpA is the K+-transporting subunit of this complex. During assembly of the complex, KdpA and KdpC bind to each other. This interaction is thought to stabilize the complex. Data indicates that KdpC might connect the KdpA, the K+-transporting subunit, to KdpB, the ATP-hydrolyzing (energy providing) subunit .; GO: 0008556 potassium-transporting ATPase activity, 0006813 potassium ion transport, 0016021 integral to membrane. Probab=21.37 E-value=27 Score=8.98 Aligned_cols=39 Identities=3% Similarity=0.245 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCCCCHH--HHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 8989998541873227--99999999999999999999831 Q gi|254781029|r 85 ITHILQNIGKNSLNIA--LRNAFMSSLIWLACELGFLIKFY 123 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~--~~~~~~~s~~w~~~~~~~~~~~~ 123 (1828) +-.++|.|.|++.++. +|.=.+.+++|+...+++..+++ T Consensus 153 ~~AlIRgltrq~~StlGNfwvDL~r~~lril~P~slv~Ail 193 (571) T TIGR00680 153 LIALIRGLTRQRASTLGNFWVDLVRSVLRILLPISLVVAIL 193 (571) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998766400256643489999999999999999999999 No 320 >COG0586 DedA Uncharacterized membrane-associated protein [Function unknown] Probab=21.35 E-value=27 Score=8.97 Aligned_cols=74 Identities=12% Similarity=0.168 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 79898999854187322799999999999999999999831621455576564064889999999999999999999999 Q gi|254781029|r 83 GGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 83 ~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) |+...+..=+-+=|-+.=.+...+.+++|+.+-..+.+.+ ...........-.....++++++++...+... T Consensus 125 Rt~ip~~AG~~~m~~~~F~~~n~~ga~iW~~~~~~lGy~~--------G~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 196 (208) T COG0586 125 RTLVPIVAGMSKMPLRRFLLYNILGALLWALVLTLLGYLL--------GEVIDVLVHLLLVLLGILVLAVLVLVLLLALL 196 (208) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH--------HCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7687798786059859999999999999999999999998--------21489888789999999999999999999999 Q ss_pred HH Q ss_conf 99 Q gi|254781029|r 163 SR 164 (1828) Q Consensus 163 ~r 164 (1828) |+ T Consensus 197 ~~ 198 (208) T COG0586 197 WW 198 (208) T ss_pred HH T ss_conf 99 No 321 >cd00378 SHMT Serine-glycine hydroxymethyltransferase (SHMT). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). SHMT carries out interconversion of serine and glycine; it catalyzes the transfer of hydroxymethyl group of N5, N10-methylene tetrahydrofolate to glycine resulting in the formation of serine and tetrahydrofolate. Both eukaryotic and prokaryotic SHMT enzymes form tight obligate homodimers; the mammalian enzyme forms a homotetramer comprising four pyridoxal phosphate-bound active sites. Probab=21.33 E-value=27 Score=8.97 Aligned_cols=43 Identities=23% Similarity=0.292 Sum_probs=0.0 Q ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999---999999999999999730925556999999999999999972 Q gi|254781029|r 156 FSFFIMISRA---RDMHDASQSIAGIALRLIDPEEYSSEKMQSISSAVRKEIVLM 207 (1828) Q Consensus 156 ~~~a~~~~r~---~~~~~~a~~m~~~a~rl~~p~~~a~~~~~~~~qa~r~ev~a~ 207 (1828) .+...|++|| .||+.+|.-|..|...--.| .+-..||.||.+| T Consensus 357 iGT~a~T~rG~~e~em~~ia~lI~~~l~~~~d~---------~~~~~v~~~V~~L 402 (402) T cd00378 357 IGTPAMTTRGMGEEEMEEIADFIARALKDAEDV---------AVAEEVRKEVAEL 402 (402) T ss_pred ECCHHHHHCCCCHHHHHHHHHHHHHHHHCCCCH---------HHHHHHHHHHHHC T ss_conf 477899858998899999999999998178988---------9999999999759 No 322 >pfam09726 Macoilin Transmembrane protein. This entry is a highly conserved protein present in eukaryotes. Probab=21.33 E-value=27 Score=8.97 Aligned_cols=270 Identities=11% Similarity=0.071 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 1400 EIVSKFDKNSQILIKSHDSLMKAQSETKLSLDKDANNLVDLTSRLVSKSSEAQKFVMSILVDVKKIVEQADFLSDTVVKN 1479 (1828) Q Consensus 1400 ~~~~~~~~~~~~l~~~~~~l~~~~~~l~~~l~~~~~~l~~l~~~l~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~ 1479 (1828) +...+++...+.|............+|...+......-..+...+.+...+.+.....+....... +.+...-+.+.. T Consensus 406 ~~~~rLE~dlKkLraELq~~Rq~E~ELR~qi~~l~s~er~~kseL~ql~~e~e~lq~K~~~l~~~r--q~Dkq~l~~lEk 483 (680) T pfam09726 406 SAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAK--QKDKQSMQSMEK 483 (680) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHH T ss_conf 289999999999999999878779999999987650553106789999999999999999999899--988999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999999999---99999999999999999999999999999999999999 Q gi|254781029|r 1480 MTDSIQSSFIKIDGTLSNIETRSRDTVRLIDHNLADIGN---KTVKTIDSNFVTLKEKSYDLSNHMRQKICSTIPNIENI 1556 (1828) Q Consensus 1480 ~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~~~---~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 1556 (1828) ...........++..+.+...+............+.... +..+........++...+.+...+... +.....++.. T Consensus 484 rl~~E~~~r~~~E~qL~~Erk~kk~ee~~aar~~a~~~~~r~Ec~e~ck~Rr~qLE~E~kkLr~eLk~k-Ee~~~~~E~E 562 (680) T pfam09726 484 RLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLK-EEECRMLEKE 562 (680) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH T ss_conf 999999989889999999999887778887443345552350058999999999999999999999877-9999999999 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999-9999999999999999999999999999999999999999999999999984999999999999999999 Q gi|254781029|r 1557 FSTLEEKSD-QSMQVFLDSLNNKVDSFTQKLSKTSDDIALTSRRIAEDLNNSRDILKRDSVSLAKEAKESADTIRSAIEE 1635 (1828) Q Consensus 1557 ~~~~~~~~~-~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eLe~~r~eLkr~~~eL~~Ea~e~~~~~rk~v~e 1635 (1828) +..++...+ +.-....-..-..+++.+..++.....-.+..-++=.-|-.+|-.| +=+.-..-.--+-|.+ T Consensus 563 ~~~lr~~kese~d~E~LmsAL~amQdKn~hLEnsLSAETRiKLDLFSALGdAKRQl--------EI~~~~i~~ke~Ei~~ 634 (680) T pfam09726 563 AQELRKYQESEKETEVLMSALQAMQDKNLMLENSLSAETRLKLDLFSALGDVKRQL--------EIAHGAIYKREAEIVD 634 (680) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHH T ss_conf 99998666034539999999999888899876411367888889999877888899--------9998777515689999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 999999999985320353344444567876567788787888776 Q gi|254781029|r 1636 QINTLKDFQKLITDSVKNNAASYNKGLHSDEYNISQVDKRPSGKK 1680 (1828) Q Consensus 1636 qi~al~el~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 1680 (1828) ----++++-.+.-.+.-.+..|..++.-...+|...-++.|.-|+ T Consensus 635 LK~kiaq~mAVmP~~~~~~~~p~~~~~~~~~~p~~~~~~~~~~~~ 679 (680) T pfam09726 635 LKSKIADILAVMPESRIRSVTPHYSANFLEKPPLVRLSPTSEYPN 679 (680) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 999999998428745678999988764336886546899988999 No 323 >COG1972 NupC Nucleoside permease [Nucleotide transport and metabolism] Probab=21.29 E-value=27 Score=8.96 Aligned_cols=108 Identities=16% Similarity=0.074 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999831621455576564064889999999999999999999999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRARDMHDASQSIAGIAL 180 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a~ 180 (1828) |+-+.--.+-|++.|+.-...+...+.-.-++++...-+.. .|+.-++.||+||=.+-...+-=-=|..+-+-..-+-. T Consensus 48 ~~p~G~~~l~~~~~~v~~li~~a~~Gi~FvFG~l~~~~~gf-~Fa~~VL~~IiF~saLi~iLyy~gIlp~vI~~iG~~l~ 126 (404) T COG1972 48 YTPAGQWALRGLANGVEKLIAYANAGINFVFGSLADPKAGF-IFAFRVLPPIIFISALISILYYIGILPLVIRIIGGGLQ 126 (404) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCE-EEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 57578999999989999999840257214426556888761-55298643899899999999998299999999999999 Q ss_pred HH---CCHHHHHHHHHHHHHHH-----HHHHHHHHHH Q ss_conf 73---09255569999999999-----9999997236 Q gi|254781029|r 181 RL---IDPEEYSSEKMQSISSA-----VRKEIVLMTE 209 (1828) Q Consensus 181 rl---~~p~~~a~~~~~~~~qa-----~r~ev~a~~~ 209 (1828) +. ..+|.+..-..+-+||. ||..+.-|++ T Consensus 127 K~lg~sk~ES~~a~a~iflGqtE~pl~vk~~l~~mt~ 163 (404) T COG1972 127 KALGTSKLESFSAVANIFLGQTEAPLVVKPYLGKMSR 163 (404) T ss_pred HHHCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHCCH T ss_conf 9857985026999999992785431013776623348 No 324 >pfam07355 GRDB Glycine/sarcosine/betaine reductase selenoprotein B (GRDB). This family represents a conserved region approximately 350 residues long within the selenoprotein B component of the bacterial glycine, sarcosine and betaine reductase complexes. Probab=21.26 E-value=27 Score=8.95 Aligned_cols=51 Identities=27% Similarity=0.329 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999973092555-699999999999999997236899999 Q gi|254781029|r 165 ARDMHDASQSIAGIALRLIDPEEY-SSEKMQSISSAVRKEIVLMTEEIDRAI 215 (1828) Q Consensus 165 ~~~~~~~a~~m~~~a~rl~~p~~~-a~~~~~~~~qa~r~ev~a~~~~~er~~ 215 (1828) |..||.....|+-.|.+|+.-|.+ ..+-=.-+.+-||..+-.=--|.+||+ T Consensus 141 Aa~Mr~a~~~ma~la~Kl~~Ge~ig~p~eEGY~~rGiR~n~f~e~~ga~RaV 192 (349) T pfam07355 141 AAGMRKALPAMAKLALKLAKGEEIGPPEEEGYIPRGIRVNYFHEERGAKRAV 192 (349) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCHHHCCCCCCCCEEEECCCCCHHHHHH T ss_conf 7678888999999999986689889965517565660664125531579999 No 325 >TIGR01165 cbiN cobalt transport protein; InterPro: IPR003705 The cobalt transport protein CbiN is part of the active cobalt transport system involved in uptake of cobalt in to the cell involved with cobalamin biosynthesis (vitamin B12). It has been suggested that CbiN may function as the periplasmic binding protein component of the active cobalt transport system .; GO: 0015087 cobalt ion transmembrane transporter activity, 0006824 cobalt ion transport, 0009236 cobalamin biosynthetic process, 0016020 membrane. Probab=21.25 E-value=27 Score=8.94 Aligned_cols=16 Identities=19% Similarity=0.652 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8999999999999999 Q gi|254781029|r 140 VITLFLVVSVVPILLF 155 (1828) Q Consensus 140 ~~~~~~~~~~~p~~~~ 155 (1828) +|+.+++.+|+|.++| T Consensus 11 LL~~va~L~~lPL~i~ 26 (94) T TIGR01165 11 LLAAVAALVVLPLLIY 26 (94) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999871 No 326 >KOG0075 consensus Probab=21.24 E-value=27 Score=8.94 Aligned_cols=12 Identities=33% Similarity=0.952 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHC Q ss_conf 389999997507 Q gi|254781029|r 1735 AFVQLWKSYTLG 1746 (1828) Q Consensus 1735 ~~~~~w~~y~~g 1746 (1828) .+-.||+||-|| T Consensus 77 rfrsmWerycR~ 88 (186) T KOG0075 77 RFRSMWERYCRG 88 (186) T ss_pred CHHHHHHHHHCC T ss_conf 088899998603 No 327 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=21.24 E-value=27 Score=8.94 Aligned_cols=21 Identities=10% Similarity=0.126 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999 Q gi|254781029|r 143 LFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 143 ~~~~~~~~p~~~~~~~a~~~~ 163 (1828) +|++++++=++++|++-.|+| T Consensus 89 ~~~~G~~~f~~~l~~lS~fVW 109 (383) T pfam06898 89 GIVLGILLFVILLYFLSNFIW 109 (383) T ss_pred HHHHHHHHHHHHHHHHHHEEE T ss_conf 999999999999997840699 No 328 >pfam03264 Cytochrom_NNT NapC/NirT cytochrome c family, N-terminal region. Within the NapC/NirT family of cytochrome c proteins, some members, such as NapC and NirT, bind four haem groups, while others, such as TorC, bind five haems. This family aligns the common N-terminal region that contains four haem-binding C-X(2)-CH motifs. Probab=21.22 E-value=27 Score=8.94 Aligned_cols=24 Identities=17% Similarity=0.250 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 889999999999999999999999 Q gi|254781029|r 139 EVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 139 ~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) ..+.++++++++-|++|++|..++ T Consensus 7 ~~~~~~~~g~~~G~~~~~~~~~~~ 30 (174) T pfam03264 7 SLGVLLLVGFLLGVVFWGGFNTAL 30 (174) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999 No 329 >PHA01399 membrane protein P6 Probab=21.21 E-value=27 Score=8.93 Aligned_cols=47 Identities=15% Similarity=0.048 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH---------CHHHHHHHHHHH Q ss_conf 99999999999999999998316214555765640---------648899999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAM---------RPEVITLFLVVS 148 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~ 148 (1828) |+|.++..+|-++|.+.--..-..--+++..+... .|.++++.+++. T Consensus 128 wiagav~~iwkg~~~~a~a~weaakelgsaaggvi~giwdfikdnpeliagg~a~~ 183 (242) T PHA01399 128 WIAGAVAAIWKGAGAAADAAWEAAKELGSAAGGVIGGIWDFIKDNPELIAGGAAAA 183 (242) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99989999983331032679999998622015605548988625988874358898 No 330 >PRK06649 V-type ATP synthase subunit K; Validated Probab=21.19 E-value=27 Score=8.93 Aligned_cols=79 Identities=14% Similarity=0.063 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHH--HHHHHHHHHHHHHHHHHHHH Q ss_conf 8989998541873227999999999999999999998316214555765640648899--99999999999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVIT--LFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~p~~~~~~~a~~~ 162 (1828) +|.+|--+=.+-+...-|+..+.-.+-.+.+++++--+-+........-+...|...+ +++.+++=-|.||-+.-.|+ T Consensus 61 YG~iLM~~m~~~~~~~~~~~~lgiG~~~Glai~~SA~~QGka~A~a~dA~~etgkgfg~yl~~lGivEtvALFv~VF~ll 140 (144) T PRK06649 61 YGYILMNTLAEVMSQTNPWLLLGAGFGGGLAIAASAFAQGRAAAGACDAFAETGKGFATYLLVLGLIESVALFVMVFLMI 140 (144) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999862688189999999989999999998116678888988766436999999999999999999999999 Q ss_pred H Q ss_conf 9 Q gi|254781029|r 163 S 163 (1828) Q Consensus 163 ~ 163 (1828) . T Consensus 141 ~ 141 (144) T PRK06649 141 F 141 (144) T ss_pred H T ss_conf 8 No 331 >PRK09649 RNA polymerase sigma factor SigC; Reviewed Probab=21.14 E-value=27 Score=8.91 Aligned_cols=42 Identities=21% Similarity=0.207 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 801389999997507456206774125259989999998507887 Q gi|254781029|r 1732 DYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKADSA 1776 (1828) Q Consensus 1732 ~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d~~ 1776 (1828) |-++..+|=.++++||+.. ...||..-.+..|.=+ .|..|++ T Consensus 7 dd~~l~aLv~~a~~GD~~A-f~~L~~~~~~~v~r~~--~~l~d~~ 48 (185) T PRK09649 7 DDEAVTALALSAAKGNGRA-LEAFIKATQQDVWRFV--AYLSDVG 48 (185) T ss_pred CHHHHHHHHHHHHHCCHHH-HHHHHHHHHHHHHHHH--HHHCCHH T ss_conf 6099999999999447999-9999999999999999--9938998 No 332 >COG1842 PspA Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms] Probab=21.12 E-value=27 Score=8.91 Aligned_cols=218 Identities=14% Similarity=0.150 Sum_probs=0.0 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 77764100444788640028899999999999999999999999999998999999999999999999999999999999 Q gi|254781029|r 1312 LKKLEALLISDVEKITNRITDSSQDVTTIISDATDSLNKVDERLHQTTNRITETTGHIDTVLAESSKLFEKKIKDLGEIS 1391 (1828) Q Consensus 1312 ~~~~~~~l~~~~~~~~~~l~~~~~~l~~~~~~~~~~l~~~~~~l~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~~l~~~~ 1391 (1828) +......+...+....+.+.+...-+...+......+..+...+............++.........+-......|..-. T Consensus 4 ~~r~~~~~~a~~~~~~dk~EDp~~~l~Q~ird~~~~l~~ar~~~A~~~a~~k~~e~~~~~~~~~~~k~e~~A~~Al~~g~ 83 (225) T COG1842 4 FSRLKDLVKANINELLDKAEDPEKMLEQAIRDMESELAKARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGN 83 (225) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 89999999988878888641899999999999999999999999999999999999999999999999999999998787 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 1392 RVSLLQMSEIVSKFDKNSQILIKSHDSLMKAQSETKLSLDKDANNLVDLTSRLVSKSSEAQKFVMSILVDVKKIVEQADF 1471 (1828) Q Consensus 1392 ~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~l~~~l~~~~~~l~~l~~~l~~~~~~~~~~~~~~~~~~~~~~e~~~~ 1471 (1828) ..+....-.-...++.....+......+......+...+......+..+............. ......+...... T Consensus 84 E~LAr~al~~~~~le~~~~~~~~~~~~~~~~~~~l~~~~~~Le~Ki~e~~~~~~~l~ar~~~-----akA~~~v~~~~~~ 158 (225) T COG1842 84 EDLAREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAKKEALKARKAA-----AKAQEKVNRSLGG 158 (225) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHCC T ss_conf 99999999999989999999999999999999999999999999999999999999979878-----9999999987137 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999999999999999999 Q gi|254781029|r 1472 LSDTVVKNMTDSIQSSFIKIDGTLSNIETRSRDTVRLIDHNLADIGNKTVKTIDSNFVTLKEKSY 1536 (1828) Q Consensus 1472 ~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~e~~~~ 1536 (1828) .........-..++...........-...........++..+..... ...++..+..+..+.. T Consensus 159 ~s~~sa~~~fer~e~kiee~ea~a~~~~el~~~~~~dl~~e~a~~~~--~~~v~~~La~lka~~~ 221 (225) T COG1842 159 GSSSSAMAAFERMEEKIEEREARAEAAAELAEGSGDDLDKEFAQAGA--QSAVDSRLAALKARMK 221 (225) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCC--CCCHHHHHHHHHHHHC T ss_conf 89333699999999989999999977567640685438999988461--2317899999997605 No 333 >PRK10755 sensor protein BasS/PmrB; Provisional Probab=21.09 E-value=27 Score=8.90 Aligned_cols=169 Identities=17% Similarity=0.186 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH-----HHHHHHH-HHHHHHHH Q ss_conf 79999999999999999999983162145557656406488999999999999999999-----9999999-99999999 Q gi|254781029|r 100 ALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSF-----FIMISRA-RDMHDASQ 173 (1828) Q Consensus 100 ~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~-----a~~~~r~-~~~~~~a~ 173 (1828) ++.++.++|++|+---..=-..++-...+......--.-..+.-+++.+++|-+++.++ -+++||. +-++-... T Consensus 21 ~~~~~~~i~~~~l~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~i~rPl~~l~~ 100 (355) T PRK10755 21 ILLVFQLISVFWLWHESTEQIQLLVQALLDNRNNDRHIMREIREAVASLIVPSLFMVSLTLLICFQAVRWITRPLAELQK 100 (355) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999876788899999998724675225567899999999999999999999999999998547999999 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999997309255569999999999999999723689999999999999999999999999999999999999999999 Q gi|254781029|r 174 SIAGIALRLIDPEEYSSEKMQSISSAVRKEIVLMTEEIDRAISRASELEKTVRSEIEVLENNYTKSEMRIDNITQNLKQE 253 (1828) Q Consensus 174 ~m~~~a~rl~~p~~~a~~~~~~~~qa~r~ev~a~~~~~er~~~ra~ele~lv~~ev~~ler~y~dnE~rir~L~~el~~e 253 (1828) ++...+--=-+|-...+. ..|+..+.+....-..| |+.+|...-.=+...-+||..- T Consensus 101 ~~~~~~~~~l~~~~~~~~---------~dEl~~L~~~fN~M~~r--------------L~~~~~~qr~Fia~~SHELRTP 157 (355) T PRK10755 101 ELEARTADNLTPIAIHSA---------TLEIEAVTSALNQLVSR--------------LTSTLDNERLFTADVAHELRTP 157 (355) T ss_pred HHHHHCCCCCCCCCCCCC---------CCHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHCCH T ss_conf 999736689886688999---------71899999999999999--------------9999999999999757774038 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999 Q gi|254781029|r 254 REAIINHGTQLCTSIAEVHESLKEELSLTSEEISVHLS 291 (1828) Q Consensus 254 re~i~~~ae~~r~~i~~~~e~l~~el~~~s~~i~~~~~ 291 (1828) --.|.++.|-+...-......+...+......+..-+. T Consensus 158 Lt~I~~~lEll~~~~~~~~~~~~~~i~rl~~li~~LL~ 195 (355) T PRK10755 158 LAGVRLHLELLAKTHHIDVAPLIARLDQMMHSVSQLLQ 195 (355) T ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999997375753199999999999999999999 No 334 >pfam07074 TRAP-gamma Translocon-associated protein, gamma subunit (TRAP-gamma). This family consists of several eukaryotic translocon-associated protein, gamma subunit (TRAP-gamma) sequences. The translocation site (translocon), at which nascent polypeptides pass through the endoplasmic reticulum membrane, contains a component previously called 'signal sequence receptor' that is now renamed as 'translocon-associated protein' (TRAP). The TRAP complex is comprised of four membrane proteins alpha, beta, gamma and delta which are present in a stoichiometric relation, and are genuine neighbours in intact microsomes. The gamma subunit is predicted to span the membrane four times. Probab=21.08 E-value=27 Score=8.90 Aligned_cols=20 Identities=35% Similarity=0.587 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254781029|r 142 TLFLVVSVVPILLFFSFFIM 161 (1828) Q Consensus 142 ~~~~~~~~~p~~~~~~~a~~ 161 (1828) +-++.+.++||-|||..-.| T Consensus 23 ~NAlIvS~iPiwLFw~Ih~M 42 (170) T pfam07074 23 GNAFIVSAIPIWLFWRVHQM 42 (170) T ss_pred HHHHHHHHHHHHHHHHHHHC T ss_conf 57999987499999999953 No 335 >pfam06930 DUF1282 Protein of unknown function (DUF1282). This family consists of several hypothetical proteins of around 200 residues in length. The function of this family is unknown although a number of family members are thought to be putative membrane proteins. Probab=21.06 E-value=27 Score=8.89 Aligned_cols=60 Identities=13% Similarity=0.141 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCH-HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999983162-1455576564064889999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTN-SPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) |.+++.+.+.++.--.+-.+.--+. ...-.+...++-|..+++++ .+.|+++|.+++.++ T Consensus 70 yl~~gv~~~a~~i~wm~~~~g~~~s~~~~~~~aaY~~tPlfl~giv--~l~p~lw~~~~v~~~ 130 (170) T pfam06930 70 YLAGGVFVLAYLIKLMGRTFGGDGSYKEVLAFAAYTATPLFLGGIA--ALIPPLWLGAFVSLV 130 (170) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH--HHCCHHHHHHHHHHH T ss_conf 9999999999999999887289998888899999871699999999--975189999999999 No 336 >TIGR01828 pyru_phos_dikin pyruvate, phosphate dikinase; InterPro: IPR010121 Pyruvate phosphate dikinase (PPDK, or pyruvate orthophosphate dikinase) is found in plants, bacteria and archaea. The amino acid sequence identity between bacterial and plant enzymes is high, and they are similar in sequence to other PEP-utilizing enzymes. PPDK catalyses the reversible conversion of ATP and pyruvate to AMP and PEP (phosphoenolpyruvate). In bacteria such as Clostridium symbiosum, PPDK uses Mg2+ and NH4+ ions as cofactors . The enzyme has three domains: the N- and C-terminal domains each have an active site centre that catalyses a different step in the reaction, and the middle domain has a carrier histidine residue that moves between the two active centres. In plants, PPDK is localised predominantly in chloroplast stroma where it catalyses the rate-limiting step in the C4 photosynthetic pathway, namely the synthesis of PEP, which acts as the primary CO2 acceptor in C4 photosynthesis . PPDK activity in C4 plants is strictly regulated by light, its activity decreasing in darkness. This response is regulated by phosphorylation and dephosphorylation of the enzyme using ADP; such regulation is not seen in the bacterial form of the enzyme. PPDK is also found in C3 plants, but it is not known to have a photosynthetic role .; GO: 0050242 pyruvate phosphate dikinase activity. Probab=20.93 E-value=28 Score=8.86 Aligned_cols=7 Identities=29% Similarity=0.524 Sum_probs=0.0 Q ss_pred HHHHHHH Q ss_conf 9999750 Q gi|254781029|r 1739 LWKSYTL 1745 (1828) Q Consensus 1739 ~w~~y~~ 1745 (1828) +||+||| T Consensus 135 AyDSYRR 141 (920) T TIGR01828 135 AYDSYRR 141 (920) T ss_pred HHHHHHH T ss_conf 5687788 No 337 >pfam08802 CytB6-F_Fe-S Cytochrome B6-F complex Fe-S subunit. The cytochrome B6-F complex mediates electron transfer between photosystem II (PSII) and photosystem I (PSI), cyclic electron flow around PSI, and state transitions. This domain corresponds to the alpha helical transmembrane domain of the cytochrome B6-F complex iron-sulphur subunit. Probab=20.88 E-value=26 Score=9.04 Aligned_cols=19 Identities=21% Similarity=0.371 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8899999999999999999 Q gi|254781029|r 139 EVITLFLVVSVVPILLFFS 157 (1828) Q Consensus 139 ~~~~~~~~~~~~p~~~~~~ 157 (1828) +.+++-+++.+.|++-||+ T Consensus 16 g~v~~~~~~~l~p~~~ffi 34 (36) T pfam08802 16 GAVALPALGALYPYVKFFI 34 (36) T ss_pred HHHHHHHHHHHHHHEEEEC T ss_conf 4454566766412126317 No 338 >COG1175 UgpA ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism] Probab=20.87 E-value=28 Score=8.84 Aligned_cols=64 Identities=9% Similarity=0.096 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32279999999999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 97 LNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 97 s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) +...||.+..-+++|+...+.+...+ +-.++-+-+-. .++. +.+-...+.|.++-.+.+.++|+ T Consensus 62 ~d~~f~~al~nT~~~~~~~v~~~~~l--gl~lAlll~~~-~~g~-~~~r~~~~lP~~~~~vv~~~iw~ 125 (295) T COG1175 62 SDPTFWNALKNTLVFTVISVPLQLVL--GLLLALLLNRK-LRGR-GFFRTLFFLPWAISSVVAGLIWK 125 (295) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHCCC-HHHH-HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 38479999999999999999999999--99999998470-4479-99999999998999999999999 No 339 >PRK11383 hypothetical protein; Provisional Probab=20.85 E-value=28 Score=8.83 Aligned_cols=42 Identities=24% Similarity=0.126 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHH Q ss_conf 999999999999999999983162145557656406488999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSV 149 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (1828) |+++++|++-++.|++=+ ...-+=.+|.++.=.+++|.+++| T Consensus 80 w~~~~~~i~LL~iGL~NA------tL~lsEKGfy~maf~lsLF~aiaV 121 (145) T PRK11383 80 LTVFIISVALLMVGLWNA------TLLLSEKGFYGLAFFLSLFGAVAV 121 (145) T ss_pred HHHHHHHHHHHHHHHHHC------CHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 999999999999999724------042654239999999999988987 No 340 >pfam02554 CstA Carbon starvation protein CstA. This family consists of Carbon starvation protein CstA a predicted membrane protein. It has been suggested that CstA is involved in peptide utilisation. Probab=20.83 E-value=28 Score=8.83 Aligned_cols=69 Identities=14% Similarity=0.215 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 79898999854187322799999999999999999999831621455576564064889999999999999999999999 Q gi|254781029|r 83 GGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 83 ~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) +|||.+-...=++..+..+.+-..+.++-+.+.-+..... .|...|.. ++.....+||.+++++. |- T Consensus 114 ~Si~~i~~~~lG~~~~~lf~if~~~~lilV~AvF~~v~a~----------~~~~~P~~--~~~~~~~i~iA~~~G~~-~y 180 (382) T pfam02554 114 KSLGEIAREELGPVAGVLFLIFILFILILVLAVFALVVVN----------ALAASPWG--VFTVAMTIPIAILMGIY-RY 180 (382) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHCCCHH--HHHHHHHHHHHHHHHHH-HH T ss_conf 4699999998648899999999999999999999999999----------98518099--99999999999998788-52 Q ss_pred HH Q ss_conf 99 Q gi|254781029|r 163 SR 164 (1828) Q Consensus 163 ~r 164 (1828) +| T Consensus 181 ~~ 182 (382) T pfam02554 181 KR 182 (382) T ss_pred CC T ss_conf 14 No 341 >PRK00665 petG cytochrome b6-f complex subunit PetG; Reviewed Probab=20.83 E-value=28 Score=8.83 Aligned_cols=27 Identities=19% Similarity=0.101 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 488999999999999999999999999 Q gi|254781029|r 138 PEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 138 ~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) |.+.|++.|.+-+.++-.|+.|.+-.| T Consensus 4 plL~GiVLGlipvTl~GLfvaAy~QYr 30 (38) T PRK00665 4 PLLCGIVLGLVPVTLAGLFVAAWNQYR 30 (38) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 699778997389999999999999985 No 342 >PRK10369 heme lyase subunit NrfE; Provisional Probab=20.80 E-value=28 Score=8.82 Aligned_cols=84 Identities=18% Similarity=0.011 Sum_probs=0.0 Q ss_pred CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98989998541873227999999999999999999998316214555765640648899999999999999999999999 Q gi|254781029|r 84 GITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 84 ~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) +..+++|++-..--|..+..-+++-++=...-.-+..-+-+.+.+.|.-.|+..|.. |++...++.-++++ .+...++ T Consensus 237 ~~TAllHS~~v~~kr~~l~~w~~~L~i~tF~lsl~GTFlVRSGVL~SVHAFA~Dp~r-G~~iL~~l~~v~~~-sl~l~~~ 314 (552) T PRK10369 237 SATALLHSLSLTRQRGIFRHWSLLLAIVTLMLSLLGTLIVRSGILVSVHAFALDNVR-AVPLFSLFALISLA-SLALYGW 314 (552) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEECCCCCCC-HHHHHHHHHHHHHH-HHHHHHH T ss_conf 999999999999986058879999999999999987672213544453003669960-69999999999999-9999999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254781029|r 164 RARDMH 169 (1828) Q Consensus 164 r~~~~~ 169 (1828) |+..++ T Consensus 315 r~~~~~ 320 (552) T PRK10369 315 RARDGG 320 (552) T ss_pred HCCCCC T ss_conf 701236 No 343 >TIGR02772 Ku_bact Ku protein; InterPro: IPR009187 This superfamily consists of prokaryotic Ku domain containing proteins. In the eukaryotes it has been shown that the Ku protein is involved in repairing DNA double-strand breaks by non-homologous end-joining , . The Ku protein is a heterodimer of approximately 70 kDa and 80 kDa subunits . Both these subunits have strong sequence similarity and it has been suggested that they may have evolved by gene duplication from a homodimeric ancestor in eukaryotes . The prokaryotic Ku members are homodimers and they have been predicted to be involved in the DNA repair system, which is mechanistically similar to eukaryotic non-homologous end joining , . Recent findings have implicated yeast Ku in telomeric structure maintenance in addition to on-homologous end-joining. Some of the phenotypes of the Ku-knockout mice may indicate a similar role for Ku at mammalian telomeres . Evolutionary notes: With current available phyletic information it is difficult to determine the correct evolutionary trajectory of the Ku domain. It is possible that the core Ku domain was present in bacteria and archaea even before the presence of the eukaryotes. Eukaryotes might have vertically inherited the Ku-core protein, from a common ancestor shared with a certain archaeal lineage or through horizontal transfer from bacteria. Alternatively, the core Ku domain could have evolved in the eukaryotic lineage and then horizontally transferred to the prokaryotes. Sequencing of additional archaeal genomes and those of early-branching eukaryotes may help resolving the evolutionary history of the Ku domain. Structure notes: The eukaryotic Ku heterodimer comprises an alpha/beta N-terminal, a central beta-barrel domain and a helical C-terminal arm . Structural analysis of the Ku70/80 heterodimer bound to DNA indicate that subunit contacts lead to the formation of a highly charged channel through which the DNA passes without making any contacts with the DNA bases . For additional information please see .. Probab=20.79 E-value=28 Score=8.81 Aligned_cols=14 Identities=36% Similarity=0.425 Sum_probs=0.0 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 39999999832472 Q gi|254781029|r 1814 GKVYTMLVHASGRT 1827 (1828) Q Consensus 1814 g~~y~~la~a~gr~ 1827 (1828) +|+|++|.+|.-++ T Consensus 121 ~kay~LLR~Al~~~ 134 (271) T TIGR02772 121 EKAYALLREALEDT 134 (271) T ss_pred CCHHHHHHHHHHHC T ss_conf 61578999999860 No 344 >pfam09852 DUF2079 Predicted membrane protein (DUF2079). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=20.77 E-value=28 Score=8.81 Aligned_cols=79 Identities=13% Similarity=0.176 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC---------------CHHHHHHHHHHHHCHHH--------- Q ss_conf 898999854187322799999999999999999999831---------------62145557656406488--------- Q gi|254781029|r 85 ITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFY---------------TNSPMDSFYDFAMRPEV--------- 140 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~--------- 140 (1828) +|-.+---++.+-+..-+.+.+++++|......+..-.| +++....+.+++..|.. T Consensus 159 lgl~l~l~~~~~~~~~g~~~~~~~~~~~~l~~~vi~P~f~~~~~~~~~~~~~~~g~s~~~il~~~~~~P~~~l~~l~~~~ 238 (451) T pfam09852 159 LGLWLLLRRRWKKRRWGLALLLLGLAWFLLVTNVLIPLFSSGESGRFMVEYSYLGGSTLEILLSLLLQPALLLQELFTPP 238 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCHHHHHHHHHHCHHHHHHHHCCCC T ss_conf 99999997330023699999999999999999999986267665057776435678789999998769899998814534 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999 Q gi|254781029|r 141 ITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 141 ~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) --.++...++|.+|++.+....| T Consensus 239 kl~yll~l~lp~~~l~~~~~~~~ 261 (451) T pfam09852 239 KLLYLLLLLLPLGFLPLLSSPTW 261 (451) T ss_pred HHHHHHHHHHHHHHHHHHCCHHH T ss_conf 39999999999999998605899 No 345 >pfam12127 YdfA_immunity SigmaW regulon antibacterial. This protein is found in bacteria. Proteins in this family are about 330 amino acids in length. The operon from which this protein is derived confers immunity for the host species to a broad range of antibacterial compounds, unlike the specific immunity proteins that are linked to and co-regulated with their antibiotic-synthesis proteins. Probab=20.76 E-value=28 Score=8.80 Aligned_cols=27 Identities=4% Similarity=0.040 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 648899999999999999999999999 Q gi|254781029|r 137 RPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 137 ~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) ..+.+-++++++++-++||||+-...| T Consensus 3 ~i~~~~~~~~~~i~~~v~~~fvp~~lW 29 (321) T pfam12127 3 LIGILVIAALILIFLAVFFSFVPVGLW 29 (321) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 066899999999999999999568798 No 346 >pfam06040 Adeno_E3 Adenovirus E3 protein. This family consists of several Adenovirus E3 proteins. The E3 protein does not seem to be essential for virus replication in cultured cells suggesting that the protein may function in virus-host interactions. Probab=20.75 E-value=28 Score=8.80 Aligned_cols=34 Identities=21% Similarity=0.023 Sum_probs=0.0 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHH-HHHHHHCCH Q ss_conf 5418732279999999999999999-999983162 Q gi|254781029|r 92 IGKNSLNIALRNAFMSSLIWLACEL-GFLIKFYTN 125 (1828) Q Consensus 92 l~~~~s~~~~~~~~~~s~~w~~~~~-~~~~~~~~~ 125 (1828) |..-|+..||-.+..+.+=-++||+ .+...+|.| T Consensus 74 Lr~~pt~~Pw~v~G~v~LGvvaGGlv~ilcyLYtP 108 (126) T pfam06040 74 LRGLPTSGPWVVAGLVTLGLVAGGLVLILCYLYTP 108 (126) T ss_pred ECCCCCCCCEEEEEHHHHHHHHCCHHHHHHHHHCC T ss_conf 44799899815410477888852499999998432 No 347 >pfam09793 AD Anticodon-binding domain. This domain of approximately 100 residues is conserved from plants to humans. It is frequently found in association with Lsm domain-containing proteins. It is an anticodon-binding domain of a prolyl-tRNA synthetase, whose PDB structure is available under the identifier 1h4q. Probab=20.75 E-value=28 Score=8.80 Aligned_cols=47 Identities=11% Similarity=-0.043 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 42157999998718013899999975074562067741252599899999985 Q gi|254781029|r 1719 SIDSLVENISKFIDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQY 1771 (1828) Q Consensus 1719 ~l~s~s~di~~~~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y 1771 (1828) |+.+|-++--+.--.++..+....-.+--.|| |+-||..||.|.+.| T Consensus 1 p~~~ld~~~l~~R~~~ai~~~~~~~~~~g~gV------S~egQ~lFdal~Kt~ 47 (90) T pfam09793 1 PPPPVDLERLEKRLRKAIEEAKEKEARIGKGV------SPEGQELFDALAKTI 47 (90) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC------CHHHHHHHHHHHHHC T ss_conf 97734899999999999999999999708998------999999999999768 No 348 >PRK05880 F0F1 ATP synthase subunit C; Validated Probab=20.74 E-value=28 Score=8.80 Aligned_cols=61 Identities=15% Similarity=0.127 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHH------HHHHHHHHHHCHHHHH-----HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999831621------4555765640648899-----999999999999999999999 Q gi|254781029|r 103 NAFMSSLIWLACELGFLIKFYTNS------PMDSFYDFAMRPEVIT-----LFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 103 ~~~~~s~~w~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~-----~~~~~~~~p~~~~~~~a~~~~ 163 (1828) ++.+.++.-+++|+.+..+-.+.+ .-..+.+..-+|..-+ .|..+.++--+.|+.+|.|.| T Consensus 2 n~ii~A~a~ig~Gl~~gl~aIGagIG~Gl~~~~~iEGiARQPEa~~~L~t~~fIs~gL~Ea~~~I~lafal~ 73 (81) T PRK05880 2 DPTIAAGALIGGGLIMAGGAIGAGIGDGIAGNALISGVARQPEAQGRLFTPFFITVGLVEAAYFINLAFMAL 73 (81) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 438999999998999875454002225689889999986182343247899999999999999999999999 No 349 >PRK12911 bifunctional preprotein translocase subunit SecD/SecF; Reviewed Probab=20.74 E-value=28 Score=8.80 Aligned_cols=64 Identities=16% Similarity=0.191 Sum_probs=0.0 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99985418732279999999999999999999983162145557656406488999999999999999999 Q gi|254781029|r 88 ILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSF 158 (1828) Q Consensus 88 ~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 158 (1828) +-.|+..-=||++++-.|.+-++ ++..+||++.|..++=-+...-++|.+--.+|++++++|+. T Consensus 1334 VN~SINQTLSRTInTSLTTLLVL-------LaLfIFGGeSIrdFAFALLIGIVVGTYSSIFIASPLLLfL~ 1397 (1405) T PRK12911 1334 INDALQKTLGRTVMTTATTLSVL-------LILLFVGGGSIFNFAFIMTIGILLGTLSSLYIAPPLLLFMV 1397 (1405) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999854235678889999999-------99999601789999999999999877648997999999981 No 350 >COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism] Probab=20.72 E-value=28 Score=8.80 Aligned_cols=93 Identities=11% Similarity=0.172 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCCCCHHHHH---------HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHH Q ss_conf 8989998541873227999---------9999999999999999983162145557656406488999999999999999 Q gi|254781029|r 85 ITHILQNIGKNSLNIALRN---------AFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLF 155 (1828) Q Consensus 85 ~~~~~~~l~~~~s~~~~~~---------~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 155 (1828) ++-+-.-+-+||...+|.+ |++-+++=+++++++.|-.. ..+......-. .+.+.+.+++.++-++-+ T Consensus 66 lal~A~~~a~r~~~~~~TfGy~R~eiLaa~~nav~Li~~s~~I~~EAi--~R~~~P~~i~~-~~ml~va~~GL~vN~~~a 142 (296) T COG1230 66 LALIAIKLARRPATKRFTFGYKRLEILAAFLNALLLIVVSLLILWEAI--QRLLAPPPIHY-SGMLVVAIIGLVVNLVSA 142 (296) T ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHCCCCCCCC-CCHHHHHHHHHHHHHHHH T ss_conf 999999984177778777307689999999999999999999999999--99648888886-145999999999999999 Q ss_pred HHHHH-----HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999-----99999999999999999999 Q gi|254781029|r 156 FSFFI-----MISRARDMHDASQSIAGIAL 180 (1828) Q Consensus 156 ~~~a~-----~~~r~~~~~~~a~~m~~~a~ 180 (1828) |.|.- +--|+-=|+..+-.++-|+. T Consensus 143 ~ll~~~~~~~lN~r~a~LHvl~D~Lgsv~v 172 (296) T COG1230 143 LLLHKGHEENLNMRGAYLHVLGDALGSVGV 172 (296) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 996078743403778999999999977999 No 351 >PRK05137 tolB translocation protein TolB; Provisional Probab=20.72 E-value=28 Score=8.80 Aligned_cols=27 Identities=15% Similarity=0.247 Sum_probs=0.0 Q ss_pred HHHCCCHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 8718013899999975074562067741 Q gi|254781029|r 1729 KFIDYDAFVQLWKSYTLGEDDIFSKRLY 1756 (1828) Q Consensus 1729 ~~~~~~~~~~~w~~y~~g~~~~f~~~~~ 1756 (1828) |.|-|..++.+..+ .-|++++|.=||. T Consensus 146 R~~aH~iaD~Iye~-LTG~~G~F~TrIa 172 (437) T PRK05137 146 RRAAHKIADAIYER-LTGEKGYFDTRIV 172 (437) T ss_pred HHHHHHHHHHHHHH-HCCCCCCCCCEEE T ss_conf 99999999999998-6099865556089 No 352 >PRK05572 sporulation sigma factor SigF; Validated Probab=20.70 E-value=28 Score=8.79 Aligned_cols=41 Identities=10% Similarity=0.194 Sum_probs=0.0 Q ss_pred HCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 1801389999997507456206774125259989999998507 Q gi|254781029|r 1731 IDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKA 1773 (1828) Q Consensus 1731 ~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~ 1773 (1828) ++++--.++|.+|+.||+. .|.-....--.-.--|++||.. T Consensus 12 Ls~eee~~L~~~~~~Gd~~--Ar~~Li~~nlrLV~siA~ry~~ 52 (251) T PRK05572 12 LKDEENKELIKKAQDGDKE--ARDKLVEKNLRLVWSVVQRFLN 52 (251) T ss_pred CCHHHHHHHHHHHHCCCHH--HHHHHHHHHHHHHHHHHHHCCC T ss_conf 9999999999999807999--9999999859999999987758 No 353 >PRK00011 glyA serine hydroxymethyltransferase; Reviewed Probab=20.70 E-value=28 Score=8.79 Aligned_cols=42 Identities=17% Similarity=0.209 Sum_probs=0.0 Q ss_pred HHHHHHHHH---HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999---999999999999999730925556999999999999999972 Q gi|254781029|r 157 SFFIMISRA---RDMHDASQSIAGIALRLIDPEEYSSEKMQSISSAVRKEIVLM 207 (1828) Q Consensus 157 ~~a~~~~r~---~~~~~~a~~m~~~a~rl~~p~~~a~~~~~~~~qa~r~ev~a~ 207 (1828) +...|++|| .||+-||.-|.+|...--.|.... .||.||..+ T Consensus 361 Gt~a~TtrG~~e~em~~ia~~I~~~l~~~~d~~~~~---------~ir~eV~~L 405 (415) T PRK00011 361 GTPAITTRGFKEAEMEEIAELIADVLDNPEDEAVLE---------EVKEEVKEL 405 (415) T ss_pred CCHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHH---------HHHHHHHHH T ss_conf 678897489988999999999999983789889999---------999999999 No 354 >PRK12523 RNA polymerase sigma factor; Reviewed Probab=20.69 E-value=28 Score=8.79 Aligned_cols=81 Identities=15% Similarity=0.179 Sum_probs=0.0 Q ss_pred HCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH-- Q ss_conf 180138999999750745620677412525998999999850788789978999999999999998258733789999-- Q gi|254781029|r 1731 IDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKADSALRNAIDRYISNFEEMLSEIAQSNDDSPLVQEH-- 1808 (1828) Q Consensus 1731 ~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l~~~~~~~~~~~~~~~~-- 1808 (1828) |-+|+++-+|.+..-....-|--=|||..-+-.+|.++|.... ..|+.++-..-.....+..+..+...- T Consensus 40 ivQd~Flrl~~~~~~~~~~~~~a~Lf~iarNl~~D~~Rr~~~~--------~~~~~~~~~~~~~~~~~~e~~~~~~e~~~ 111 (172) T PRK12523 40 LSQDTFVRLLGRPELPTPREPRAFLAAVAKGLMFDHFRRAALE--------QAYLAELALVPEAEQPSPEEQHLILEDLK 111 (172) T ss_pred HHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 9999999998277656734599999999999999999989988--------50088886266546899799999999999 Q ss_pred --------HHHHHHHHHHH Q ss_conf --------87664399999 Q gi|254781029|r 1809 --------IMSNYGKVYTM 1819 (1828) Q Consensus 1809 --------l~s~~g~~y~~ 1819 (1828) |....=.||+| T Consensus 112 ~l~~al~~Lp~~~R~v~~L 130 (172) T PRK12523 112 AIDRLLGKLSSKARAAFLY 130 (172) T ss_pred HHHHHHHCCCHHHHHHHHH T ss_conf 9999986499999999999 No 355 >pfam11658 DUF3260 Protein of unknown function (DUF3260). Some members in this family of proteins are annotated as YhjU however this cannot be confirmed. Currently this family has no known function. Probab=20.68 E-value=28 Score=8.78 Aligned_cols=52 Identities=8% Similarity=0.147 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHH-----------------HHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999998316214555765640648899-----------------999999999999999999999 Q gi|254781029|r 111 WLACELGFLIKFYTNSPMDSFYDFAMRPEVIT-----------------LFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 111 w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~p~~~~~~~a~~~~ 163 (1828) |++..+|++..++ ++++-.+..+....+-+. -+++++++=++.+|++..++| T Consensus 47 ~iAi~igi~Lly~-Ds~LPp~~rl~~q~~ql~~Fs~~Yl~EL~~Rfi~~~ml~~~~vllv~y~~ls~wiR 115 (516) T pfam11658 47 LIAIPIAIALLYH-DSWLPPLDRLLSQAGQLLGFSLSYLLELLGRFINWQMLLALFVLLVAYLLLSRWLR 115 (516) T ss_pred HHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999987-51599799999987777256799999999740799999999999999998653355 No 356 >KOG2677 consensus Probab=20.68 E-value=28 Score=8.78 Aligned_cols=63 Identities=8% Similarity=-0.031 Sum_probs=0.0 Q ss_pred HHCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC------CCCCCCCCCCCCCCCCCCCC Q ss_conf 6230120157676666633200123688876631023467888------86555567788888788877 Q gi|254781029|r 16 KEFEEQSTSDSHENKKRHHSRKKHKYSRRDDSIESKDRQQTTL------SKNFKQASNKQTYSGFYPAN 78 (1828) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~pa~ 78 (1828) +..+++...+...-..+.+....-..++..+++...+++.... .+..+.+.-++.+|-.+||| T Consensus 55 ~~~~~~~~~~~~~~~~~~~~~~~~t~a~~~p~~a~~~~p~~~~~~~d~a~~~~~~p~a~~~pp~~~~a~ 123 (922) T KOG2677 55 GDAPTASEPVQELSPTPEASSSTGTSAPARPPPARPEAPKKDETAVDAAAPPRPPPAARPPPPRPAPAN 123 (922) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 778876676645689940014665457889998789999745565332358999855579999988777 No 357 >COG4987 CydC ABC-type transport system involved in cytochrome bd biosynthesis, fused ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=20.67 E-value=28 Score=8.78 Aligned_cols=113 Identities=18% Similarity=0.244 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999831621455576564064889999--99999999999999999999999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLF--LVVSVVPILLFFSFFIMISRARDMHDASQSIAGIA 179 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~p~~~~~~~a~~~~r~~~~~~~a~~m~~~a 179 (1828) |+...++++-...+++.+.+++. ++|.|-+...++.. +-.+++|++..=+||++ |+.+-.- T Consensus 15 ~~~l~Lgi~l~~~t~lasigLl~------lSGwfisasAiag~~~~f~~~~p~a~VR~~aI~-----------Rt~~RY~ 77 (573) T COG4987 15 KFGLLLGIVLAILTLLASIGLLT------LSGWFISASAIAGLAYIFNVMLPSAGVRGLAIL-----------RTAARYV 77 (573) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH-----------HHHHHHH T ss_conf 87799999999999999999998------768999887642678999999799999999998-----------8899999 Q ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 973092555699999999999999997236899999---999999999999999999999 Q gi|254781029|r 180 LRLIDPEEYSSEKMQSISSAVRKEIVLMTEEIDRAI---SRASELEKTVRSEIEVLENNY 236 (1828) Q Consensus 180 ~rl~~p~~~a~~~~~~~~qa~r~ev~a~~~~~er~~---~ra~ele~lv~~ev~~ler~y 236 (1828) -|| ++.+.+.+|-.++|-.+=-+=+-.--++ -|-++|=..+-..|.+|..-| T Consensus 78 ERl-----vsH~AtfrvL~~lRv~~f~kl~p~sp~~~~r~r~gdLL~RLvaDVd~Ld~ly 132 (573) T COG4987 78 ERL-----VSHDATFRVLSALRVRLFEKLEPLSPALLLRYRSGDLLNRLVADVDALDNLY 132 (573) T ss_pred HHH-----HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 998-----6348999999999999998514677378874472758999987588886678 No 358 >PRK08156 surface presentation of antigens protein SpaS; Validated Probab=20.63 E-value=28 Score=8.77 Aligned_cols=70 Identities=11% Similarity=0.031 Sum_probs=0.0 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4187322799999999999999999999831621455576564064889999999999999999999999 Q gi|254781029|r 93 GKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 93 ~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) |.--|+=....++++.++|...+.+-...+...-......++...+..+....+..++.+++.+++..++ T Consensus 21 qVarSkEl~~a~~ll~g~l~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~~~~l 90 (367) T PRK08156 21 QSFKSKDLTTAVVILGGIAYLVSFGSLNEFMGIYRRILADNFDISMGDYFFAVAILFLKILIPFLLLCLL 90 (367) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9987487999999999999999999999999999999817432469999999999999999999999999 No 359 >TIGR00785 dass transporter, divalent anion:Na+ symporter (DASS) family; InterPro: IPR001898 Integral membrane proteins that mediate the intake of a wide variety of molecules with the concomitant uptake of sodium ions (sodium symporters) can be grouped, on the basis of sequence and functional similarities into a number of distinct families. One of these families currently consists of the following proteins: Mammalian sodium/sulphate cotransporter . Mammalian renal sodium/dicarboxylate cotransporter , which transports succinate and citrate. Mammalian intestinal sodium/dicarboxylate cotransporter. Chlamydomonas reinhardtii putative sulphur deprivation response regulator SAC1 . Caenorhabditis elegans hypothetical proteins B0285.6, F31F6.6, K08E5.2 and R107.1. Escherichia coli hypothetical protein yfbS. Haemophilus influenzae hypothetical protein HI0608. Synechocystis sp. (strain PCC 6803) hypothetical protein sll0640. Methanococcus jannaschii hypothetical protein MJ0672. These transporters are proteins of from 430 to 620 amino acids which are highly hydrophobic and which probably contain about 12 transmembrane regions.; GO: 0005215 transporter activity, 0006814 sodium ion transport, 0016020 membrane. Probab=20.61 E-value=28 Score=8.76 Aligned_cols=53 Identities=13% Similarity=0.146 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHCCHH----HHHHHHHHH---HCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999831621----455576564---064889999999999999999999999 Q gi|254781029|r 109 LIWLACELGFLIKFYTNS----PMDSFYDFA---MRPEVITLFLVVSVVPILLFFSFFIMI 162 (1828) Q Consensus 109 ~~w~~~~~~~~~~~~~~~----~~~~~~~~~---~~~~~~~~~~~~~~~p~~~~~~~a~~~ 162 (1828) ++|.+|+++++-.+..-+ ....+.+.. +.|..+.+ ...+++=+++.|.||+-+ T Consensus 329 L~lfgg~i~l~~~L~k~G~~~w~~~~l~~~L~~~g~~~~~~~-~~lv~~~~~~~y~FaS~t 388 (473) T TIGR00785 329 LVLFGGLITLAGGLKKSGFIKWLSEKLVGILGFEGLSPTIAV-LVLVVLFYILLYFFASNT 388 (473) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH-HHHHHHHHHHHHHHHHHH T ss_conf 999999999999988622379999999998521688548899-999999999999988899 No 360 >cd06480 ACD_HspB8_like Alpha-crystallin domain (ACD) found in mammalian 21.6 KDa small heat shock protein (sHsp) HspB8, also denoted as Hsp22 in humans, and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. A chaperone complex formed of HspB8 and Bag3 stimulates degradation of protein complexes by macroautophagy. HspB8 also forms complexes with Hsp27 (HspB1), MKBP (HspB2), HspB3, alphaB-crystallin (HspB5), Hsp20 (HspB6), and cvHsp (HspB7). These latter interactions may depend on phosphorylation of the respective partner sHsp. HspB8 may participate in the regulation of cell proliferation, cardiac hypertrophy, apoptosis, and carcinogenesis. Point mutations in HspB8 have been correlated with the development of several congenital neurological diseases, including Charcot Marie tooth disease and distal motor neuropathy type II. Probab=20.59 E-value=28 Score=8.76 Aligned_cols=16 Identities=13% Similarity=0.036 Sum_probs=0.0 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 7888887888775567 Q gi|254781029|r 67 NKQTYSGFYPANDLDN 82 (1828) Q Consensus 67 ~~~~~~~~~pa~~d~r 82 (1828) +++..|+..|+||+|+ T Consensus 1 g~~~~~~~~~~n~~d~ 16 (91) T cd06480 1 GGPPEPRNPPPNSSEP 16 (91) T ss_pred CCCCCCCCCCCCCCCC T ss_conf 9888756899758978 No 361 >PRK12527 RNA polymerase sigma factor; Reviewed Probab=20.56 E-value=28 Score=8.75 Aligned_cols=45 Identities=11% Similarity=0.080 Sum_probs=0.0 Q ss_pred HHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 999871801389999997507456206774125259989999998 Q gi|254781029|r 1726 NISKFIDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQ 1770 (1828) Q Consensus 1726 di~~~~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~ 1770 (1828) +.|.=|=+|+++-+|.++...+-.-|.-=|||..-+..+|..+++ T Consensus 20 ~~AeDivQd~F~k~~~~~~~~~~~~~~a~L~~ia~N~~~d~~R~~ 64 (159) T PRK12527 20 QAAEDVAHDAYLRVLERSSSAQIEHPRAFLYRTALNLVVDRHRRH 64 (159) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999866635545699999999999999999998 No 362 >PRK13672 hypothetical protein; Provisional Probab=20.55 E-value=28 Score=8.75 Aligned_cols=10 Identities=30% Similarity=0.863 Sum_probs=0.0 Q ss_pred HHHHHHHHHH Q ss_conf 3899999975 Q gi|254781029|r 1735 AFVQLWKSYT 1744 (1828) Q Consensus 1735 ~~~~~w~~y~ 1744 (1828) +++++|+.|. T Consensus 60 iFD~aWe~Yl 69 (72) T PRK13672 60 IFDEAWEEYL 69 (72) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 363 >pfam10943 DUF2632 Protein of unknown function (DUF2632). This is a family of membrane proteins with unknown function. Probab=20.53 E-value=28 Score=8.74 Aligned_cols=56 Identities=18% Similarity=0.288 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999983162145557656406488999999999999999999999999 Q gi|254781029|r 107 SSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 107 ~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) +|-+| -|+--.|++..-.++....=-..+-+.-|-.+.++|+.|+.|-.+-||... T Consensus 60 fspiw--rglenfwlflslaslaiaywwlpsmtftgywaltiiatilvfillimmfvk 115 (233) T pfam10943 60 FSPIW--RGLENFWLFLSLASLAIAYWWLPSMTFTGYWALTIIATILVFILLIMMFVK 115 (233) T ss_pred CCHHH--HHHHHHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 66676--507889999999999999986442443016699999999999999999999 No 364 >PRK13021 secF preprotein translocase subunit SecF; Reviewed Probab=20.49 E-value=29 Score=8.73 Aligned_cols=55 Identities=2% Similarity=-0.074 Sum_probs=0.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8732279999999999999999999983162145557656406488999999999999999999 Q gi|254781029|r 95 NSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSF 158 (1828) Q Consensus 95 ~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 158 (1828) |+-.+....-..+-++|+.||..+-..-+ +-.+|-+.|-.-+ +|++ .|++++|++ T Consensus 229 Rti~Ts~Ttll~~~~l~~~Gg~~l~~Fa~-~l~iGii~GtySS-----IfiA---spl~~~l~~ 283 (297) T PRK13021 229 RTMVTSGTTLMTVGALWIMGGGPLEGFSI-AMFIGILTGTFSS-----ISVG---TSLPELLGL 283 (297) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHH-HHHHHHHHHHHHH-----HHHH---HHHHHHHCC T ss_conf 99999899999999999954567898999-9999887644529-----9999---999999699 No 365 >COG0733 Na+-dependent transporters of the SNF family [General function prediction only] Probab=20.47 E-value=29 Score=8.72 Aligned_cols=58 Identities=16% Similarity=0.191 Sum_probs=0.0 Q ss_pred HHHHHHHHHH---HHHHHHHHHHHCCH---HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999---99999999983162---1455576564064889999999999999999999 Q gi|254781029|r 102 RNAFMSSLIW---LACELGFLIKFYTN---SPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFF 159 (1828) Q Consensus 102 ~~~~~~s~~w---~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a 159 (1828) |+++..=+.| ++.|+-+.++.|-+ +...+.-.....-..+++.+|.+|-|.+++++.. T Consensus 216 ~~~AlGQ~FFsLSlG~g~mitYsSYL~k~~~l~~sa~~v~~~n~~~s~lAGl~Ifpa~f~~g~~ 279 (439) T COG0733 216 WLAALGQAFFSLSLGFGIMITYSSYLSKKSDLVSSALSIVLLNTLISLLAGLVIFPALFSFGAD 279 (439) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 9999878999998758889876830686235012048999888899999999999999983378 No 366 >TIGR01148 mtrC tetrahydromethanopterin S-methyltransferase, subunit C; InterPro: IPR005865 This model describes the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase subunit C in methanogenic archaea. This methyltranferase is a membrane-associated enzyme complex that uses methyl-transfer reaction to drive a sodium-ion pump. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of a methyl group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016021 integral to membrane. Probab=20.46 E-value=29 Score=8.72 Aligned_cols=52 Identities=21% Similarity=0.289 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHC----CHHHHHHHHHHHHCHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999831----621455576564064889999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFY----TNSPMDSFYDFAMRPEVITLFLVVSVVPIL 153 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~ 153 (1828) ......+++|+.+|.++.|=|+ |+....=.=.|...-+.|++|...++.-+. T Consensus 173 ia~G~iAl~Fi~~gm~ilHPFNACLGP~E~q~RTL~LAv~~GaiaL~i~~~v~sL~ 228 (276) T TIGR01148 173 IANGLIALLFIIGGMAILHPFNACLGPNESQDRTLTLAVECGAIALFITGFVSSLL 228 (276) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 86229999999988775123125784178821899999989999999999999986 No 367 >KOG1025 consensus Probab=20.46 E-value=29 Score=8.72 Aligned_cols=50 Identities=12% Similarity=0.215 Sum_probs=0.0 Q ss_pred HCH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH-HHCCH Q ss_conf 064--889999999999999999999999999-999999999999999-73092 Q gi|254781029|r 136 MRP--EVITLFLVVSVVPILLFFSFFIMISRA-RDMHDASQSIAGIAL-RLIDP 185 (1828) Q Consensus 136 ~~~--~~~~~~~~~~~~p~~~~~~~a~~~~r~-~~~~~~a~~m~~~a~-rl~~p 185 (1828) +.| ..+.++.|++++++++|+.|..+++|- |.-+--+.-|.=+-. -|.|| T Consensus 624 a~p~~~~~~~viG~~Ll~~~~~~~~~~~~~~r~q~~~k~~~rr~lq~~~el~ep 677 (1177) T KOG1025 624 ATPYLATAIAVIGGLLLAFFVFLGFSLYMCRRMQIIIKRTMRRLLQLERELVEP 677 (1177) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 784134579999999999999988787422268767389999998750001576 No 368 >TIGR01197 nramp metal ion transporter, metal ion (Mn2+/Fe2+) transporter (Nramp) family; InterPro: IPR001046 The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functionally related proteins defined by a conserved hydrophobic core of ten transmembrane domains . Nramp1 is an integral membrane protein expressed exclusively in cells of the immune system and is recruited to the membrane of a phagosome upon phagocytosis. Nramp2 is a multiple divalent cation transporter for Fe2+, Mn2+ and Zn2+ amongst others. It is expressed at high levels in the intestine; and is major transferrin-independent iron uptake system in mammals . The yeast proteins Smf1 and Smf2 may also transport divalent cations . The natural resistance of mice to infection with intracellular parasites is controlled by the Bcg locus, which modulates the cytostatic/cytocidal activity of phagocytes. Nramp1, the gene responsible, is expressed exclusively in macrophages and poly-morphonuclear leukocytes, and encodes a polypeptide (natural resistance-associated macrophage protein) with features typical of integral membrane proteins. Other transporter proteins from a variety of sources also belong to this family.; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane. Probab=20.45 E-value=29 Score=8.72 Aligned_cols=80 Identities=6% Similarity=0.009 Sum_probs=0.0 Q ss_pred CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98989998541873227999999999999999999998316214555765640648899999999999999999999999 Q gi|254781029|r 84 GITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 84 ~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) .||+.=-.++..+... +.+||+.. .+=..++|-+..-.-+|-..+.-..=..----=-.|=..+|+++ T Consensus 19 DPGN~~td~~AGa~~~-------ykLLwv~L-~sn~~all~Q~L~arLG~VTGldLa~~C~~~yp~~~~i~Lw~~~---- 86 (429) T TIGR01197 19 DPGNYSTDLQAGAAAG-------YKLLWVIL-LSNILALLLQRLCARLGVVTGLDLAEVCREHYPKVLRITLWILA---- 86 (429) T ss_pred CCCCHHHHHHHHHHCC-------HHHHHHHH-HHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCHHHHHHHHH---- T ss_conf 7776567875555415-------05789989-99999999998888720201753579999877984156899999---- Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999999 Q gi|254781029|r 164 RARDMHDASQSIAGI 178 (1828) Q Consensus 164 r~~~~~~~a~~m~~~ 178 (1828) ||-+||--|+|| T Consensus 87 ---E~AIIatD~aEV 98 (429) T TIGR01197 87 ---ELAIIATDLAEV 98 (429) T ss_pred ---HHHHHHHHHHHH T ss_conf ---999987437699 No 369 >pfam01271 Granin Granin (chromogranin or secretogranin). Probab=20.43 E-value=29 Score=8.71 Aligned_cols=37 Identities=19% Similarity=0.251 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHCHHHHHHHHHHHHHHHH Q ss_conf 99999999999999831621--455576564064889999999999999 Q gi|254781029|r 107 SSLIWLACELGFLIKFYTNS--PMDSFYDFAMRPEVITLFLVVSVVPIL 153 (1828) Q Consensus 107 ~s~~w~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~ 153 (1828) +.++|++|+++++...+.++ ....+..+++. ++ ++|+ T Consensus 4 a~~~~l~g~~~~~~v~~~pv~~~~~h~eemvtr-----CI-----iEVL 42 (610) T pfam01271 4 AALLALLGCAGLAAVISLPVDSPANSFQTNVTK-----CI-----IEVL 42 (610) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH-----HH-----HHHH T ss_conf 699998767888753014668865520679989-----99-----9997 No 370 >KOG4517 consensus Probab=20.36 E-value=29 Score=8.69 Aligned_cols=26 Identities=19% Similarity=0.430 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88999999999999999999999999 Q gi|254781029|r 139 EVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 139 ~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) ..+|+++++++-|.+|.--||+-.+| T Consensus 83 talgi~~ai~~fP~g~ic~~alr~rr 108 (117) T KOG4517 83 TALGIFLAIILFPFGFICCFALRKRR 108 (117) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 77658999997316877667643033 No 371 >TIGR02718 sider_RhtX_FptX siderophore transporter, RhtX/FptX family; InterPro: IPR014090 RhtX from Sinorhizobium meliloti 1021 and FptX from Pseudomonas aeruginosa appear to be single polypeptide transporters, from the major facilitator family for import of siderophores as a means to import iron. This function was suggested by proximity to siderophore biosynthesis genes and then confirmed by study of knockout and heterologous expression phenotypes.. Probab=20.36 E-value=29 Score=8.69 Aligned_cols=85 Identities=13% Similarity=0.149 Sum_probs=0.0 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHCHHHHHH Q ss_conf 7788888788877556779898999854187322799999999999999999999831621--45557656406488999 Q gi|254781029|r 66 SNKQTYSGFYPANDLDNGGITHILQNIGKNSLNIALRNAFMSSLIWLACELGFLIKFYTNS--PMDSFYDFAMRPEVITL 143 (1828) Q Consensus 66 ~~~~~~~~~~pa~~d~r~~~~~~~~~l~~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 143 (1828) .+....|..+|+.+ .=|..+-+||-.-...-=++.|..=.+.|-|+...|+.++ ....++-+.+..+..++ T Consensus 186 ~r~~~~~~ap~~~~-------ASL~~F~rRP~a~~lLAlAL~~~~~~~~~~Gl~~~yL~Da~~~~~~IG~l~~a~~~~~~ 258 (391) T TIGR02718 186 DRAVASPEAPLARR-------ASLLRFLRRPLALSLLALALLSAMTAVSGFGLSKLYLVDAGWPLEAIGRLGMAGGLVTV 258 (391) T ss_pred CCCCCCCCCCHHHH-------HHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHCCHHHHHHHHHH T ss_conf 77767788742465-------56888752564589999999998775541141022340024327763102115899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999 Q gi|254781029|r 144 FLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 144 ~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) +.|..+ -|.+|||. T Consensus 259 ~LG~~~--------~A~lvR~~ 272 (391) T TIGR02718 259 LLGCGG--------GALLVRRL 272 (391) T ss_pred HHHHHH--------HHHHHHHH T ss_conf 997679--------99999999 No 372 >pfam02011 Glyco_hydro_48 Glycosyl hydrolase family 48. Members of this family are endoglucanase EC:3.2.1.4 and exoglucanase EC:3.2.1.91 enzymes that cleave cellulose or related substrate. Probab=20.36 E-value=27 Score=8.88 Aligned_cols=57 Identities=19% Similarity=0.420 Sum_probs=0.0 Q ss_pred HHHHCCCHHHHHHHHH----------HHCCCCCHHHH-HHHHHH-------------HHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 9871801389999997----------50745620677-412525-------------99899999985078878997899 Q gi|254781029|r 1728 SKFIDYDAFVQLWKSY----------TLGEDDIFSKR-LYTIKG-------------QKVFLNLQEQYKADSALRNAIDR 1783 (1828) Q Consensus 1728 ~~~~~~~~~~~~w~~y----------~~g~~~~f~~~-~~~~~~-------------~~~~~~i~~~y~~d~~f~~~v~r 1783 (1828) ++-+-..+.+-||..| .|||=+-|.-. +|-|.| -.+|-.||-+|..||.+-.-..- T Consensus 506 a~~~Ak~LLD~~w~~~~D~~Gv~~~E~r~dY~Rf~d~~VYvP~gwtG~mpnGd~I~~g~tF~diRs~Yk~DP~w~~v~a~ 585 (615) T pfam02011 506 ARNTAKELLDRMWANYQDEKGISAPETRADYKRFFDQEVYVPSGWTGTMPNGDVIKPGVTFLDIRSKYKQDPDWPKVEAA 585 (615) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCEECCCCCCCCCCCCCCCCCCCEEEEHHHHHHCCCCHHHHHHH T ss_conf 99999999999997368876636654323365536884564798647689977258987010515465349856999999 Q ss_pred H Q ss_conf 9 Q gi|254781029|r 1784 Y 1784 (1828) Q Consensus 1784 y 1784 (1828) | T Consensus 586 l 586 (615) T pfam02011 586 L 586 (615) T ss_pred H T ss_conf 8 No 373 >pfam11693 DUF2990 Protein of unknown function (DUF2990). This family of proteins represents a fungal protein with unknown function. Probab=20.33 E-value=29 Score=8.68 Aligned_cols=38 Identities=11% Similarity=0.269 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 41252599899999985078878997899999999999 Q gi|254781029|r 1755 LYTIKGQKVFLNLQEQYKADSALRNAIDRYISNFEEML 1792 (1828) Q Consensus 1755 ~~~~~~~~~~~~i~~~y~~d~~f~~~v~ry~~~fe~~l 1792 (1828) .+....+.-||.|=.--+.=+||-.-|..||.++..++ T Consensus 6 a~s~AAPtffdn~Yd~SddLaef~grVSk~I~~~kd~~ 43 (64) T pfam11693 6 ATSMAAPTFFDNIYDYSDDLAEFLGRVSKYIESAKDLF 43 (64) T ss_pred HHHHCCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 31220328887462121789999999999987457761 No 374 >TIGR03580 EF_0832 conserved hypothetical protein EF_0832/AHA_3913. Members of this family of relatively rare proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown. Probab=20.32 E-value=29 Score=8.68 Aligned_cols=64 Identities=14% Similarity=0.041 Sum_probs=0.0 Q ss_pred HHHHHHH----HHHHHHHHHHHHHHHHCCHH----HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7999999----99999999999999831621----4555765640648899999999999999999999999 Q gi|254781029|r 100 ALRNAFM----SSLIWLACELGFLIKFYTNS----PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMIS 163 (1828) Q Consensus 100 ~~~~~~~----~s~~w~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~ 163 (1828) +||+|++ -|++|.....|+++...+.. -++.+.+-...-.+---..-..++-|++.|++--..| T Consensus 147 iFWlAA~DAGkrtG~w~Tv~GGla~liMGNAvPG~VLGIlIGkgvdd~Gwnk~tk~m~~~vilLFils~ffR 218 (246) T TIGR03580 147 IFWLAALDAGKRSGFWGTIFGGLAQLIMGNAVPGVVLGILIGKGVDDSGWTKVTKGMLGAIILLFILSGFFR 218 (246) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999998604652143999874489998437875413576642664434416899999999999999999971 No 375 >PRK06030 hypothetical protein; Provisional Probab=20.32 E-value=29 Score=8.68 Aligned_cols=21 Identities=19% Similarity=0.425 Sum_probs=0.0 Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5078878997899999999999999 Q gi|254781029|r 1771 YKADSALRNAIDRYISNFEEMLSEI 1795 (1828) Q Consensus 1771 y~~d~~f~~~v~ry~~~fe~~l~~~ 1795 (1828) .+.||+| |.||.++|+++... T Consensus 108 ~RdD~~F----D~~v~~LEr~~~~a 128 (136) T PRK06030 108 LRDDVEF----DALVSVLEKIVNSA 128 (136) T ss_pred HCCCHHH----HHHHHHHHHHHHHH T ss_conf 5128878----99999999999999 No 376 >TIGR02476 BluB cob(II)yrinic acid a,c-diamide reductase; InterPro: IPR012825 The BluB protein is related to nitroreductase, and is found to substitute for the FldA cobalt reductase in aerobic species (by phylogenetic profiling). The bluB gene is also found to be preceded by B12 promoter elements . The gene in R. capsulatus has been shown to be essential for cobalamin biosynthesis .. Probab=20.30 E-value=29 Score=8.67 Aligned_cols=25 Identities=24% Similarity=0.294 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHHHHHH---HHHHHH Q ss_conf 7322799999999999999---999999 Q gi|254781029|r 96 SLNIALRNAFMSSLIWLAC---ELGFLI 120 (1828) Q Consensus 96 ~s~~~~~~~~~~s~~w~~~---~~~~~~ 120 (1828) |+..+|=.|.++-.+|+++ |+|+.| T Consensus 124 P~m~~yS~a~Ai~nlWLAARAeGlGvGW 151 (206) T TIGR02476 124 PEMLRYSVALAIQNLWLAARAEGLGVGW 151 (206) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 4289999999999999886240766550 No 377 >PRK02101 hypothetical protein; Validated Probab=20.29 E-value=29 Score=8.67 Aligned_cols=46 Identities=11% Similarity=0.097 Sum_probs=0.0 Q ss_pred HHHHH--HCCCHHHHHHHHHHHCCCC-----------CHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99987--1801389999997507456-----------2067741252599899999985 Q gi|254781029|r 1726 NISKF--IDYDAFVQLWKSYTLGEDD-----------IFSKRLYTIKGQKVFLNLQEQY 1771 (1828) Q Consensus 1726 di~~~--~~~~~~~~~w~~y~~g~~~-----------~f~~~~~~~~~~~~~~~i~~~y 1771 (1828) +|.++ |+...+.+-|.||+.=..+ .|+--+|+..+..+|+.-...| T Consensus 45 el~~lm~is~~lA~~n~~r~q~~~~~~~~~~~~pAi~~y~G~~Y~~L~~~~l~~~~~~~ 103 (257) T PRK02101 45 QLASLMKISDKLAGLNAQRFQDWQPDFTTANARPAILAFTGDVYDGLDADSLSEADLDY 103 (257) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHCCCCCCCCCHHHHHH T ss_conf 99999787999999999999983545676335669998276876247930288999999 No 378 >PRK07079 hypothetical protein; Provisional Probab=20.23 E-value=29 Score=8.65 Aligned_cols=49 Identities=12% Similarity=0.149 Sum_probs=0.0 Q ss_pred HCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 1801389999997507456206774125259989999998507887899789999999 Q gi|254781029|r 1731 IDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKADSALRNAIDRYISNF 1788 (1828) Q Consensus 1731 ~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d~~f~~~v~ry~~~f 1788 (1828) |-++.+...-..|. |++-|-.+|..++.... .-.||+++..|.||++|| T Consensus 1 ~~~~~~~~~~~~~~--~~~~fl~~L~~~v~ipS-------vS~~p~~~~~~~~~~a~~ 49 (468) T PRK07079 1 MTRTAAIARAAAYF--DSGAFFADLARRVALRT-------ESQNPGAAPALRAYLTDE 49 (468) T ss_pred CCHHHHHHHHHHHH--CCHHHHHHHHHHHCCCC-------CCCCCCCHHHHHHHHHHH T ss_conf 98899999999870--73189999999846788-------689975138999999999 No 379 >COG3748 Predicted membrane protein [Function unknown] Probab=20.22 E-value=29 Score=8.65 Aligned_cols=51 Identities=20% Similarity=0.234 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCCHH---HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999831621---455576564064889999999999999999999999999 Q gi|254781029|r 111 WLACELGFLIKFYTNS---PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 111 w~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) |+-|.+.|...+|.+. .|.-...-+..|..|++.. +++.++|++=-++-|+ T Consensus 87 WlSGfalL~ivYY~~a~lyLIdp~v~d~s~~~AI~~~i----Asl~~GW~vYd~lC~s 140 (407) T COG3748 87 WLSGFALLCIVYYFNADLYLIDPQVLDASLKPAIAISI----ASLALGWFVYDALCRS 140 (407) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHH----HHHHHHHHHHHHHHHH T ss_conf 99999999999871775799748876532347899999----9999999999998740 No 380 >COG1721 Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only] Probab=20.15 E-value=29 Score=8.63 Aligned_cols=49 Identities=16% Similarity=0.151 Sum_probs=0.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 187322799999999999999999999831621455576564064889999999999999999999999999 Q gi|254781029|r 94 KNSLNIALRNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISRA 165 (1828) Q Consensus 94 ~~~s~~~~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r~ 165 (1828) .|..+..|++.+++.+.| +...|...+.++...+ |+++++.+++++.++ T Consensus 1 ~r~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~ 49 (416) T COG1721 1 MRALRRRYAGLTLLLVAL----------------------ALSYPLGAGLGLALGL-ALLLLLLVAILFARS 49 (416) T ss_pred CCHHHHHHHHHHHHHHHH----------------------HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHCC T ss_conf 950255588899999999----------------------9987777899999999-999999987764225 No 381 >PRK08301 sporulation sigma factor SigE; Reviewed Probab=20.14 E-value=29 Score=8.63 Aligned_cols=42 Identities=5% Similarity=0.198 Sum_probs=0.0 Q ss_pred HHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 71801389999997507456206774125259989999998507 Q gi|254781029|r 1730 FIDYDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKA 1773 (1828) Q Consensus 1730 ~~~~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~ 1773 (1828) +++++=-.+++.+++.||.. +|....-..-.-+--|++||.. T Consensus 38 ~Ls~eeE~~l~~~~~~GD~~--Ar~~LI~~NLRLVv~IAkky~~ 79 (239) T PRK08301 38 PLSKEEEEVLLNKLPKGDEA--VRSILIERNLRLVVYIARKFEN 79 (239) T ss_pred CCCHHHHHHHHHHHHHCCHH--HHHHHHHHHHHHHHHHHHHCCC T ss_conf 98999999999999834699--9999999878999999975768 No 382 >COG4218 MtrF Tetrahydromethanopterin S-methyltransferase, subunit F [Coenzyme metabolism] Probab=20.14 E-value=29 Score=8.63 Aligned_cols=36 Identities=31% Similarity=0.229 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999998316214555765640648899999999999999999 Q gi|254781029|r 102 RNAFMSSLIWLACELGFLIKFYTNSPMDSFYDFAMRPEVITLFLVVSVVPILLFFS 157 (1828) Q Consensus 102 ~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 157 (1828) +.+-+.|++-..+-.|++.++ +|+++.++|+++.+| T Consensus 38 Rd~rL~sGl~st~i~GlaiGf--------------------vfA~vLv~illllaf 73 (73) T COG4218 38 RDARLYSGLNSTRIAGLAIGF--------------------VFAGVLVGILLLLAF 73 (73) T ss_pred CCCHHHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHHHCC T ss_conf 232145423442057899999--------------------999999998888649 No 383 >TIGR02251 HIF-SF_euk Dullard-like phosphatase domain; InterPro: IPR011948 This entry represents the putative phosphatase domain of a family of eukaryotic proteins including "Dullard" , and the NLI interacting factor (NIF)-like phosphatases .; GO: 0016791 phosphoric monoester hydrolase activity. Probab=20.13 E-value=23 Score=9.57 Aligned_cols=11 Identities=36% Similarity=0.800 Sum_probs=0.0 Q ss_pred CCCC-HHHHHHH Q ss_conf 4562-0677412 Q gi|254781029|r 1747 EDDI-FSKRLYT 1757 (1828) Q Consensus 1747 ~~~~-f~~~~~~ 1757 (1828) .+++ |+||+|+ T Consensus 103 ~~~~v~~~RlYR 114 (193) T TIGR02251 103 KRKLVISRRLYR 114 (193) T ss_pred CCCEEEEEEECC T ss_conf 996788653126 No 384 >PRK09304 arginine exporter protein; Provisional Probab=20.12 E-value=29 Score=8.62 Aligned_cols=55 Identities=11% Similarity=0.092 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHCCHH-------HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999831621-------45557656406488999999999999999999999999 Q gi|254781029|r 110 IWLACELGFLIKFYTNS-------PMDSFYDFAMRPEVITLFLVVSVVPILLFFSFFIMISR 164 (1828) Q Consensus 110 ~w~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~a~~~~r 164 (1828) +|-.....++..+++|. .+++++.-+..+.-+.-.+|++.+-.++||.++.-.+. T Consensus 94 ~~~~i~~~lavT~LNPHvyLDTv~llGs~s~~~~~~~k~~F~~Ga~~AS~iWF~~Lg~ga~~ 155 (197) T PRK09304 94 RWKIIATMLAVTWLNPHVYLDTFVVLGSLGGQLDVEPKRWFALGTISASFLWFFGLALLAAW 155 (197) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999971815888899997430034780378999999999999999999999999 No 385 >PRK07037 extracytoplasmic-function sigma-70 factor; Validated Probab=20.07 E-value=29 Score=8.61 Aligned_cols=42 Identities=17% Similarity=0.103 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 013899999975074562067741252599899999985078 Q gi|254781029|r 1733 YDAFVQLWKSYTLGEDDIFSKRLYTIKGQKVFLNLQEQYKAD 1774 (1828) Q Consensus 1733 ~~~~~~~w~~y~~g~~~~f~~~~~~~~~~~~~~~i~~~y~~d 1774 (1828) +|+++-+|.++...+-+-..-=|||..-+..+|-+++..... T Consensus 44 Qe~Flkl~~~~~~~~~~~~~awL~~iarN~~id~lRr~~~~~ 85 (183) T PRK07037 44 QDAFVKLVEAPNQDAVKQPVAYLFQMVRNLAIDHCRRQSLEN 85 (183) T ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999997454344136999999999999999999777665 No 386 >PRK05886 yajC preprotein translocase subunit YajC; Validated Probab=20.03 E-value=30 Score=8.59 Aligned_cols=30 Identities=23% Similarity=0.484 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999--999999999999999999 Q gi|254781029|r 146 VVSVVPILLFFS--FFIMISRARDMHDASQSI 175 (1828) Q Consensus 146 ~~~~~p~~~~~~--~a~~~~r~~~~~~~a~~m 175 (1828) .+.|+|++++|+ +-.|+|+-+.-.-..+.| T Consensus 4 l~~flPlili~~vf~f~~iRpQkKrqk~~~~m 35 (108) T PRK05886 4 LVLFLPFLLIMGGFMYFASRRQRKAMQATIDL 35 (108) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 89999999999999999316899999999999 No 387 >COG2928 Uncharacterized conserved protein [Function unknown] Probab=20.02 E-value=30 Score=8.59 Aligned_cols=71 Identities=14% Similarity=0.077 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH-HHHCHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999998316214555765-6406488999999------999999999999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNSPMDSFYD-FAMRPEVITLFLV------VSVVPILLFFSFFIMISRARDMHDASQ 173 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~------~~~~p~~~~~~~a~~~~r~~~~~~~a~ 173 (1828) ||++.++-++=++..+++.+.+| +.+.++-. +...-.-.+.++. ++++-|++++....|.+++.-=++++. T Consensus 9 ~fltGLlvllPlaiT~~vv~~i~--~~l~~~~~~~lp~~~~~~~~~~~~i~~lg~il~iili~l~G~l~~~~ig~~l~~~ 86 (222) T COG2928 9 YFLTGLLVLLPLAITLWVVSWIF--GLLDQFVGPLLPDRLRPAVYFPFNIPGLGVILAIILIFLLGFLARNMIGRSLLSL 86 (222) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHCCCHHCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99988999999999999999999--9998764430421212221476034788999999999999999998755489999 No 388 >TIGR00056 TIGR00056 conserved hypothetical protein; InterPro: IPR003453 This domain has no known function nor do any of the proteins that possess it. The aligned region is approximately 150 amino acids long.. Probab=20.00 E-value=30 Score=8.59 Aligned_cols=56 Identities=11% Similarity=0.135 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH-HHHHCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999831621455576-56406488999999999999999999 Q gi|254781029|r 101 LRNAFMSSLIWLACELGFLIKFYTNSPMDSFY-DFAMRPEVITLFLVVSVVPILLFFSF 158 (1828) Q Consensus 101 ~~~~~~~s~~w~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~p~~~~~~~ 158 (1828) .|.-.+++--|.+.+-...|.-. +..-..+. |++..+-- ++|.|.+|+=|-.||+| T Consensus 179 i~GG~~~~~~~~gi~~~~f~~~~-~~~v~~w~~D~~~G~~K-a~fFGv~Ia~igc~~Gf 235 (275) T TIGR00056 179 ILGGAFIASSLLGIGSESFWSKM-QNAVEKWNYDIFAGLVK-AVFFGVAIAIIGCYYGF 235 (275) T ss_pred HHHHHHHHHHHHHCCCHHHHHHH-HHHHHHCCCCEEEECHH-HHHHHHHHHHHHHHCCC T ss_conf 86149878877514404678876-22211104421441135-78999999999973265 Done!