Query gi|255764485|ref|YP_003065139.2| hypothetical protein CLIBASIA_03065 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 243 No_of_seqs 50 out of 52 Neff 3.1 Searched_HMMs 39220 Date Sun May 29 21:12:24 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 255764485.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam09476 Pilus_CpaD Pilus bio 100.0 0 0 438.0 19.7 178 65-243 22-199 (200) 2 COG5461 Type IV pili component 100.0 0 0 429.5 13.9 179 64-243 34-212 (224) 3 TIGR02522 pilus_cpaD pilus (Ca 100.0 0 0 414.1 16.5 200 44-243 4-210 (211) 4 COG2885 OmpA Outer membrane pr 83.8 3.8 9.7E-05 21.6 9.2 88 77-164 77-169 (190) 5 cd07185 OmpA_C-like Peptidogly 82.4 4.3 0.00011 21.2 9.2 83 83-165 2-89 (106) 6 PRK10095 ribonuclease I; Provi 80.8 0.51 1.3E-05 27.5 0.4 26 182-207 78-106 (268) 7 PRK10802 peptidoglycan-associa 78.8 5.7 0.00014 20.4 7.3 83 82-164 68-155 (173) 8 TIGR02802 Pal_lipo peptidoglyc 73.7 7.7 0.0002 19.5 8.2 79 85-163 2-85 (104) 9 PRK09967 putative outer membra 71.5 8.7 0.00022 19.2 9.3 81 84-164 53-139 (160) 10 PRK10510 putative outer membra 70.7 9 0.00023 19.1 8.3 81 84-164 113-198 (219) 11 PRK10919 ATP-dependent DNA hel 69.0 9.7 0.00025 18.8 5.7 22 132-153 325-346 (672) 12 PRK08944 motB flagellar motor 66.4 11 0.00028 18.5 14.4 78 86-164 185-272 (305) 13 TIGR03350 type_VI_ompA type VI 62.6 13 0.00033 18.0 11.0 91 73-164 14-119 (137) 14 pfam00691 OmpA OmpA family. Th 60.1 14 0.00036 17.7 8.0 78 87-164 2-85 (97) 15 PRK10808 outer membrane protei 59.8 14 0.00036 17.7 9.2 82 83-164 224-312 (347) 16 pfam04891 NifQ NifQ. NifQ is i 56.8 8.3 0.00021 19.3 2.2 46 99-145 29-76 (167) 17 PRK06667 motB flagellar motor 55.3 17 0.00043 17.2 7.2 92 72-163 110-221 (254) 18 PRK09041 motB flagellar motor 51.9 19 0.00049 16.9 9.9 106 73-179 147-278 (317) 19 COG4470 Uncharacterized protei 47.9 5.5 0.00014 20.5 0.2 34 198-231 86-122 (126) 20 PRK11773 uvrD DNA-dependent he 47.5 22 0.00057 16.4 3.8 13 137-149 362-374 (722) 21 pfam03281 Mab-21 Mab-21 protei 47.0 5.7 0.00015 20.4 0.1 21 74-94 66-86 (360) 22 KOG3022 consensus 46.0 22 0.00056 16.4 3.0 59 82-155 161-219 (300) 23 COG0616 SppA Periplasmic serin 45.7 24 0.0006 16.2 12.2 127 21-162 11-143 (317) 24 PRK07193 fliF flagellar MS-rin 44.7 25 0.00063 16.1 7.5 18 22-40 6-23 (550) 25 TIGR02159 PA_CoA_Oxy4 phenylac 44.6 25 0.00063 16.1 5.1 45 108-155 19-67 (152) 26 PRK10954 periplasmic protein d 43.9 21 0.00054 16.5 2.6 64 54-125 15-82 (207) 27 TIGR03044 PS_II_psb27 photosys 43.4 13 0.00034 17.9 1.5 79 33-114 3-84 (135) 28 TIGR03025 EPS_sugtrans exopoly 41.6 13 0.00032 18.1 1.2 25 218-242 360-384 (445) 29 TIGR03023 WcaJ_sugtrans Undeca 41.4 14 0.00034 17.9 1.3 47 97-149 175-221 (451) 30 TIGR03013 EpsB_2 sugar transfe 41.2 14 0.00037 17.7 1.4 24 219-242 358-381 (442) 31 KOG4310 consensus 40.5 28 0.00072 15.7 4.6 35 17-51 199-236 (373) 32 KOG0431 consensus 40.0 16 0.00042 17.3 1.6 12 177-188 229-240 (453) 33 TIGR01005 eps_transp_fam exopo 39.9 27 0.00068 15.9 2.6 110 63-172 52-186 (778) 34 PRK06742 flagellar motor prote 39.5 29 0.00075 15.6 7.2 78 85-163 117-204 (225) 35 pfam01514 YscJ_FliF Secretory 39.0 30 0.00076 15.6 4.3 52 92-153 32-89 (206) 36 PRK13863 type IV secretion sys 37.0 32 0.00082 15.3 6.2 76 73-159 35-127 (447) 37 pfam00580 UvrD-helicase UvrD/R 36.5 33 0.00083 15.3 5.2 51 98-156 329-383 (494) 38 KOG4754 consensus 36.1 9.3 0.00024 19.0 -0.2 95 91-203 45-143 (248) 39 COG0210 UvrD Superfamily I DNA 33.6 36 0.00092 15.0 5.2 19 74-92 244-262 (655) 40 KOG2334 consensus 33.5 21 0.00053 16.6 1.3 68 46-113 167-238 (477) 41 TIGR01502 B_methylAsp_ase meth 31.3 39 0.001 14.7 4.9 79 65-154 228-308 (414) 42 pfam04954 SIP Siderophore-inte 30.9 40 0.001 14.7 2.7 28 130-157 86-114 (118) 43 KOG4632 consensus 30.0 34 0.00088 15.1 1.9 52 46-97 73-131 (187) 44 PRK10124 putative UDP-glucose 29.5 30 0.00077 15.5 1.5 46 98-149 189-234 (464) 45 pfam11658 DUF3260 Protein of u 29.5 42 0.0011 14.5 4.0 76 73-156 351-431 (516) 46 PRK07734 motB flagellar motor 29.4 42 0.0011 14.5 9.2 79 85-163 144-232 (259) 47 PRK06925 flagellar motor prote 29.2 43 0.0011 14.5 8.6 78 85-163 120-207 (230) 48 PRK05198 2-dehydro-3-deoxyphos 28.6 44 0.0011 14.4 4.3 42 78-121 189-230 (264) 49 pfam09500 YiiD_Cterm Putative 27.4 46 0.0012 14.3 3.3 50 73-122 15-65 (84) 50 TIGR03368 cellulose_yhjU cellu 27.3 46 0.0012 14.3 4.0 75 73-156 351-430 (518) 51 COG3389 Uncharacterized protei 27.3 1.7 4.3E-05 24.0 -5.2 44 58-101 145-189 (277) 52 pfam10070 DUF2309 Uncharacteri 26.7 47 0.0012 14.2 9.7 123 38-163 419-569 (783) 53 PRK13760 putative RNA-associat 26.3 42 0.0011 14.6 1.8 51 72-127 157-211 (233) 54 TIGR02447 yiiD_Cterm thioester 26.0 48 0.0012 14.2 2.1 50 73-122 70-120 (141) 55 KOG3485 consensus 26.0 24 0.00061 16.2 0.5 20 169-188 63-82 (86) 56 pfam02397 Bac_transf Bacterial 24.8 31 0.00078 15.5 0.9 15 114-128 101-115 (188) 57 cd07127 ALDH_PAD-PaaZ Phenylac 24.7 51 0.0013 14.0 4.6 44 112-156 218-261 (549) 58 PRK10953 cysJ sulfite reductas 23.8 53 0.0014 13.9 5.0 16 152-167 383-398 (599) 59 PRK10949 protease 4; Provision 23.3 54 0.0014 13.8 4.7 84 100-187 350-440 (618) 60 TIGR02857 CydD ABC transporter 23.1 55 0.0014 13.8 5.2 68 76-144 478-548 (570) 61 PRK11546 zraP zinc resistance 22.5 56 0.0014 13.7 2.3 83 48-150 14-120 (139) 62 TIGR03022 WbaP_sugtrans Undeca 22.4 32 0.00081 15.3 0.6 23 219-241 376-398 (456) 63 COG4277 Predicted DNA-binding 22.4 56 0.0014 13.7 2.2 35 115-149 162-204 (404) 64 PRK00484 lysS lysyl-tRNA synth 22.1 45 0.0012 14.3 1.3 10 138-147 337-346 (491) 65 PRK09040 hypothetical protein; 22.0 57 0.0015 13.6 10.8 80 84-163 90-183 (215) 66 cd01844 SGNH_hydrolase_like_6 21.7 58 0.0015 13.6 3.9 69 81-149 59-138 (177) 67 pfam03853 YjeF_N YjeF-related 21.3 59 0.0015 13.5 4.0 47 76-125 98-144 (170) 68 pfam10609 ParA ParA/MinD ATPas 21.2 60 0.0015 13.5 3.7 58 82-155 5-62 (81) 69 pfam06837 Fijivirus_P9-2 Fijiv 21.2 60 0.0015 13.5 4.4 59 5-70 98-156 (214) 70 PRK10697 DNA-binding transcrip 21.0 60 0.0015 13.5 2.8 14 10-24 18-31 (119) 71 PRK10251 phosphopantetheinyltr 20.5 61 0.0016 13.4 2.9 41 180-220 78-133 (207) 72 KOG3133 consensus 20.2 34 0.00087 15.2 0.3 84 57-156 157-240 (267) 73 cd06193 siderophore_interactin 20.2 62 0.0016 13.4 2.6 53 104-156 90-155 (235) No 1 >pfam09476 Pilus_CpaD Pilus biogenesis CpaD protein (pilus_cpaD). Proteins in this entry consist of a pilus biogenesis protein, CpaD, from Caulobacter, and homologues in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function of the homologues is not known. Probab=100.00 E-value=0 Score=438.02 Aligned_cols=178 Identities=35% Similarity=0.646 Sum_probs=175.1 Q ss_pred CCCHHHCCCEEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 88711138577733772698851566768898888999999999974188729999249898878999999999999997 Q gi|255764485|r 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIIS 144 (243) Q Consensus 65 ~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~ 144 (243) -.|||+||||+|+|+++++||||+.++++|+++||++|++|+++|+++|+|.|+|++|+||+|++|+.++.+|||++|.+ T Consensus 22 p~dyr~~hpI~V~~~~~~~di~v~~~~~~Lt~~qr~~l~~f~~~~~~~g~g~i~I~~Psgs~n~~aa~~~~~~ir~~l~~ 101 (200) T pfam09476 22 PDDYRTRHPIVVEEADRTLDLPVGPGRGGLTASQRARLAGFAARYGRRGAGPLVIDVPSGSPNARAASALSAEVRALLAA 101 (200) T ss_pred CCCHHHCCCCEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 87475528966874354888732688898899999999999999845688618997589986357899999999999998 Q ss_pred CCCCCCCEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHCCCC Q ss_conf 59882340661036888543684667754445417888998133146755788623336256887898626968850688 Q gi|255764485|r 145 SGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPR 224 (243) Q Consensus 145 ~GVp~~~I~v~~Y~a~~p~~aAPIRLSY~ri~A~ag~CG~WPeDL~~s~~~Nk~y~NFGCAtQ~NLAAqVaNP~DLv~PR 224 (243) +||++++|++++|+++++.+++||||||.+++|++++||.||+|++ .+.+|++|+|||||+|+||||||+||.||++|| T Consensus 102 ~Gv~~~~I~~~~y~~~~~~~~a~IRl~~~r~~A~~p~Cg~w~~dl~-~~~~N~~~~N~GCa~~~nlAamva~P~DL~~~r 180 (200) T pfam09476 102 AGVPPSNISVRAYDAAGARAPGPIRVSYTRYTAKVPPCGDWPEDLG-ATFENKPYPNFGCATQNNLAAMVANPRDLVQPR 180 (200) T ss_pred CCCCHHHEEECCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCC-CCCCCCCCCCCCHHHHHHHHHHHCCHHHHCCCC T ss_conf 5998431554225777878887789999998976899999822255-543458788845274778998856988872778 Q ss_pred CCCCCCHHHHHHHHHHHCC Q ss_conf 8884436678889987519 Q gi|255764485|r 225 MVTPPDAEQRDKSIQRYRQ 243 (243) Q Consensus 225 ~~tP~Da~RR~~viekYR~ 243 (243) +++|+|+.||+++|||||+ T Consensus 181 ~~~~~d~~~~~~~~~~yr~ 199 (200) T pfam09476 181 AMTPADGARRTTVIEKYRK 199 (200) T ss_pred CCCCCCHHHHHHHHHHHHC T ss_conf 7897508788879988518 No 2 >COG5461 Type IV pili component [Cell motility and secretion] Probab=100.00 E-value=0 Score=429.53 Aligned_cols=179 Identities=44% Similarity=0.742 Sum_probs=175.4 Q ss_pred CCCCHHHCCCEEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH Q ss_conf 78871113857773377269885156676889888899999999997418872999924989887899999999999999 Q gi|255764485|r 64 EGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIII 143 (243) Q Consensus 64 ~~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~ 143 (243) ---|||+||||++.|+++.+||||..|+.+||.+||+.++||+++|....++.++|++|+||+|+.+|.++.+|||++|+ T Consensus 34 ipddyrt~hpI~i~e~~~~~dipi~~gds~Lt~sqrd~lrgf~~~y~s~~a~~l~i~ip~gs~n~~tA~~m~~eir~~l~ 113 (224) T COG5461 34 IPDDYRTRHPIVIREVEPEIDIPILKGDSGLTASQRDRLRGFLDRYSSASADALHIQIPSGSANEVTASRMAKEIRRLLA 113 (224) T ss_pred CCCCCCCCCCEEEECCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHH T ss_conf 65323578876864257776602324765443558899999998764325771699815898425778999999999998 Q ss_pred HCCCCCCCEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHCCC Q ss_conf 75988234066103688854368466775444541788899813314675578862333625688789862696885068 Q gi|255764485|r 144 SSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 (243) Q Consensus 144 ~~GVp~~~I~v~~Y~a~~p~~aAPIRLSY~ri~A~ag~CG~WPeDL~~s~~~Nk~y~NFGCAtQ~NLAAqVaNP~DLv~P 223 (243) ++||.+.+|.+..|++++.++.|||||||.++.|.+++||+||||| .++.+|++|+|||||+|+||||||+||.||++| T Consensus 114 ~~Gv~~~ri~~~~y~a~~~~d~apIRvsyVa~~A~t~~Cg~wpedl-~~T~dN~~y~nfGCa~q~NLAAqvaNP~DLlgp 192 (224) T COG5461 114 GSGVDRARIRVVNYDASSQEDGAPIRVSYVAYTARTPKCGQWPEDL-LNTRDNKNYYNFGCATQSNLAAQVANPRDLLGP 192 (224) T ss_pred HCCCCCCEEEEEEECCCCCCCCCCEEEEEEEEEECCCCCCCCHHHH-HHHHHCCCCCCCCCHHHHHHHHHHCCHHHHCCC T ss_conf 6488846058997334666788635899999970188766567777-643411443355505566699874798774585 Q ss_pred CCCCCCCHHHHHHHHHHHCC Q ss_conf 88884436678889987519 Q gi|255764485|r 224 RMVTPPDAEQRDKSIQRYRQ 243 (243) Q Consensus 224 R~~tP~Da~RR~~viekYR~ 243 (243) |.|+|||++||.+||++||. T Consensus 193 R~mtPiDaerr~~vidtYR~ 212 (224) T COG5461 193 RAMTPIDAERRAIVIDTYRR 212 (224) T ss_pred CCCCCCCCCCCCCEEEEEEC T ss_conf 45885641024522222322 No 3 >TIGR02522 pilus_cpaD pilus (Caulobacter type) biogenesis lipoprotein CpaD; InterPro: IPR013361 Proteins in this entry consist of a pilus biogenesis protein, CpaD, from Caulobacter, and homologues in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function of the homologues is not known.. Probab=100.00 E-value=0 Score=414.13 Aligned_cols=200 Identities=36% Similarity=0.600 Sum_probs=184.5 Q ss_pred HHHHHHHHHHHHCCC--EEEECCCCCHHHCCCEEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 999999999960220--011027887111385777337726988515667688988889999999999741887299992 Q gi|255764485|r 44 LGQLFFLLLFYGTSA--LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLI 121 (243) Q Consensus 44 ~~~~~~~~~~~~~~~--~~~~~~~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~v 121 (243) +.-+.++|-=-++.. ..--.--.|||++|||+|+|.+++++|||+.+++||++.|++++.+|+.+|.|.++..|+|.+ T Consensus 4 ~~a~~~~LagC~~~~P~~~~~~~p~d~~~~~PI~v~~~~~~~~l~v~~~~~Gl~a~~~~~l~~~l~~~~~~~~~~l~~~~ 83 (211) T TIGR02522 4 LLALTALLAGCASTDPEIDVGALPDDYRQRHPIAVQEESKSIDLLVAPGDRGLSASQQDQLLGFLKDWSRASAQTLVVVI 83 (211) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 98899997530588875553432345345887355117701345544888767988899999999998630585378975 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCCCCHHHCCCCCC-----C Q ss_conf 49898878999999999999997598823406610368885436846677544454178889981331467557-----8 Q gi|255764485|r 122 PSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----N 196 (243) Q Consensus 122 PsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y~a~~p~~aAPIRLSY~ri~A~ag~CG~WPeDL~~s~~~-----N 196 (243) |+||.|+.|+..+.+|||++|.++||+++.|+...|+..++++.++|||||.+|.|++++||.|||||+.+..+ | T Consensus 84 p~GSaN~~aa~~~~~e~~~~l~~~Gv~a~~iV~~~~~~~~~~~~g~iRv~~~~~~A~t~~Cg~~~e~l~~~~~~~GyneN 163 (211) T TIGR02522 84 PSGSANEAAAEAMAAEIRRVLAASGVGARNIVKVVYRAEASGDEGPIRVSYVAIEAQTTKCGSWPEDLLLETLEDGYNEN 163 (211) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCCHHEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCCCCHHHHCCCCCCCCCCC T ss_conf 88875468899999999999996599720417744557887766655799999887668888885454035678762245 Q ss_pred CCCCCCCHHHHHHHHHHHCCHHHHCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 86233362568878986269688506888884436678889987519 Q gi|255764485|r 197 RNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 (243) Q Consensus 197 k~y~NFGCAtQ~NLAAqVaNP~DLv~PR~~tP~Da~RR~~viekYR~ 243 (243) ++|.|||||+|+||||||+||+||++||.|+|+|++||.++||+||+ T Consensus 164 ~~Y~~~GCa~q~NLAa~ianP~DL~~pR~~~p~D~~~r~~~~~~YR~ 210 (211) T TIGR02522 164 EAYQNLGCATQNNLAAQIANPRDLLGPRAMTPADGERRDIVIDRYRK 210 (211) T ss_pred CCCCCCCHHHHHHHHHHCCCCHHHHCCCCCCCCCHHHCCCCCCCCCC T ss_conf 44455013678889984038433312667785365452760112358 No 4 >COG2885 OmpA Outer membrane protein and related peptidoglycan-associated (lipo)proteins [Cell envelope biogenesis, outer membrane] Probab=83.83 E-value=3.8 Score=21.59 Aligned_cols=88 Identities=13% Similarity=0.126 Sum_probs=72.0 Q ss_pred EECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEC---CCCC--CHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 3377269885156676889888899999999997418872999924---9898--8789999999999999975988234 Q gi|255764485|r 77 RKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---SPTV--SSASIRRAVKDIRKIIISSGIPVSS 151 (243) Q Consensus 77 ~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vP---sGS~--Na~AA~~~~~eIR~~L~~~GVp~~~ 151 (243) .......+|.++.+.-.|.+.-...+..++.--+....-.|+|+=- +||+ |..--++=...|+..|...||++.. T Consensus 77 ~~~~~~~~v~F~~ds~~l~p~~~~~L~~~a~~L~~~p~~~i~V~GHTD~~Gs~~yN~~LS~rRA~aV~~~L~~~Gv~~~~ 156 (190) T COG2885 77 IILNLPNDVLFDFDSSVLKPKAQATLDELAKYLKKNPITRILVEGHTDSTGSDEYNQALSERRAEAVADYLVSQGVVADR 156 (190) T ss_pred HHCCCCCEEEECCCCHHHCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 00014540683588344398999999999999987899779997557887986787688899999999999984997011 Q ss_pred EEEECCCCCCCCC Q ss_conf 0661036888543 Q gi|255764485|r 152 ISERIYDADYGMD 164 (243) Q Consensus 152 I~v~~Y~a~~p~~ 164 (243) |.+..|-.+.|-. T Consensus 157 i~~~G~G~~~Pia 169 (190) T COG2885 157 ISTVGYGEEKPIA 169 (190) T ss_pred EEEEECCCCCCCC T ss_conf 7898616457778 No 5 >cd07185 OmpA_C-like Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA. OmpA-like domains (named after the C-terminal domain of Escherichia coli OmpA protein) have been shown to non-covalently associate with peptidoglycan, a network of glycan chains composed of disaccharides, which are crosslinked via short peptide bridges. Well-studied members of this family include the Escherichia coli outer membrane protein OmpA, the Escherichia coli lipoprotein PAL, Neisseria meningitdis RmpM, which interact with the outer membrane, as well as the Escherichia coli motor protein MotB, and the Vibrio flagellar motor proteins PomB and MotY, which interact with the inner membrane. Probab=82.43 E-value=4.3 Score=21.22 Aligned_cols=83 Identities=13% Similarity=0.152 Sum_probs=63.3 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEC---CCCC--CHHHHHHHHHHHHHHHHHCCCCCCCEEEECC Q ss_conf 9885156676889888899999999997418872999924---9898--8789999999999999975988234066103 Q gi|255764485|r 83 VDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---SPTV--SSASIRRAVKDIRKIIISSGIPVSSISERIY 157 (243) Q Consensus 83 lDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vP---sGS~--Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y 157 (243) +.|++..+.-.|++..+..+..++........-.|.|.-= .|+. |-.-...=+..|++.|...|+++..|.+..| T Consensus 2 ~~v~F~~~s~~l~~~~~~~L~~ia~~l~~~~~~~v~v~GhtD~~G~~~~N~~LS~~RA~~V~~~L~~~Gi~~~ri~~~g~ 81 (106) T cd07185 2 ITIYFDFGSAELTPEAKPLLDKLAEVLKKNPDAKIRIEGHTDSRGSDAYNQELSERRAEAVADYLVSKGVDASRITAVGY 81 (106) T ss_pred EEEEECCCCCEECHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEE T ss_conf 89996899776198899999999999985899379999984899998998999999999999999984999899899997 Q ss_pred CCCCCCCC Q ss_conf 68885436 Q gi|255764485|r 158 DADYGMDV 165 (243) Q Consensus 158 ~a~~p~~a 165 (243) -...|-.. T Consensus 82 G~~~p~~~ 89 (106) T cd07185 82 GESRPIAS 89 (106) T ss_pred CCCCCCCC T ss_conf 55576899 No 6 >PRK10095 ribonuclease I; Provisional Probab=80.76 E-value=0.51 Score=27.52 Aligned_cols=26 Identities=31% Similarity=0.947 Sum_probs=17.1 Q ss_pred CCCCCH---HHCCCCCCCCCCCCCCHHHH Q ss_conf 899813---31467557886233362568 Q gi|255764485|r 182 CGFWPE---DMLGNAKGNRNWTNYGCAYQ 207 (243) Q Consensus 182 CG~WPe---DL~~s~~~Nk~y~NFGCAtQ 207 (243) -|+||. ++...--+.+.|+.|||+++ T Consensus 78 HGLWP~~pk~~~~~gvd~r~w~r~GC~~~ 106 (268) T PRK10095 78 HGLWPGLPKSVAARGVDERRWMRFGCATR 106 (268) T ss_pred ECCCCCCCCCCCCCCCCCHHHHCCCCCCC T ss_conf 31488985443334743000010365667 No 7 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=78.82 E-value=5.7 Score=20.43 Aligned_cols=83 Identities=8% Similarity=0.028 Sum_probs=66.2 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEC---CCCC--CHHHHHHHHHHHHHHHHHCCCCCCCEEEEC Q ss_conf 69885156676889888899999999997418872999924---9898--878999999999999997598823406610 Q gi|255764485|r 82 IVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---SPTV--SSASIRRAVKDIRKIIISSGIPVSSISERI 156 (243) Q Consensus 82 tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vP---sGS~--Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~ 156 (243) .-.|+++...-.|++.-+..+...+.--.....-.|+|+-- .|++ |-.--.+=...++..|...||+++.|.+.+ T Consensus 68 ~~~vyFdfd~~~l~~~~~~~L~~~a~~L~~~p~~~v~i~GhtD~~Gt~eYN~~Ls~rRA~aV~~yL~~~GV~~~ri~~vs 147 (173) T PRK10802 68 NNIVYFDLDKYDIRSDFAQMLDAHANFLRSNPSYKVTVEGHADERGTPEYNISLGERRANAVKMYLQGKGVSADQISIVS 147 (173) T ss_pred CCEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEEEEE T ss_conf 99799158852169989999999999999889917999874699884999999999999999999998499989968987 Q ss_pred CCCCCCCC Q ss_conf 36888543 Q gi|255764485|r 157 YDADYGMD 164 (243) Q Consensus 157 Y~a~~p~~ 164 (243) |--..|.. T Consensus 148 yGee~P~~ 155 (173) T PRK10802 148 YGKEKPAV 155 (173) T ss_pred ECCCCCCC T ss_conf 66657689 No 8 >TIGR02802 Pal_lipo peptidoglycan-associated lipoprotein; InterPro: IPR014169 Members in this entry represent the peptidoglycan-associated lipoprotein (Pal or OprL) protein of the Tol-Pal system. The system appears to be involved both in the maintenance of outer membrane integrity and in the import of certain organic molecules as nutrients. Members of this family contain a hydrodrophobic lipoprotein signal sequence, a conserved N-terminal cleavage and modification site, a poorly conserved low-complexity region, together comprising about 65 amino acids, and a well-conserved C-terminal domain. The entry represents only the conserved C-terminal domain.. Probab=73.75 E-value=7.7 Score=19.53 Aligned_cols=79 Identities=11% Similarity=0.116 Sum_probs=67.4 Q ss_pred EECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEC---CCCC--CHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCC Q ss_conf 85156676889888899999999997418872999924---9898--878999999999999997598823406610368 Q gi|255764485|r 85 IPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---SPTV--SSASIRRAVKDIRKIIISSGIPVSSISERIYDA 159 (243) Q Consensus 85 IpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vP---sGS~--Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y~a 159 (243) |++......|++.-+..+..=++=-+..+.=.|+|+== .||+ |=+.=++=..-++..|...||++..|.+-+|== T Consensus 2 vyF~~D~~~l~~e~~~~l~~hA~~L~~~p~~~V~iEGH~DERGTrEYNlALGeRRA~AV~~yL~~~GV~~~q~~~vSYGk 81 (104) T TIGR02802 2 VYFDFDSSDLKSEAQAILDAHAQYLKKNPSVRVTIEGHTDERGTREYNLALGERRANAVKDYLQAKGVSASQIETVSYGK 81 (104) T ss_pred EECCCCHHHCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCC T ss_conf 43235814317799999999899998679937888630788888777777889999999999997389611004630266 Q ss_pred CCCC Q ss_conf 8854 Q gi|255764485|r 160 DYGM 163 (243) Q Consensus 160 ~~p~ 163 (243) .-|. T Consensus 82 ErP~ 85 (104) T TIGR02802 82 ERPA 85 (104) T ss_pred CCCC T ss_conf 5687 No 9 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=71.46 E-value=8.7 Score=19.19 Aligned_cols=81 Identities=9% Similarity=0.044 Sum_probs=61.8 Q ss_pred EEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCC-----CHHHHHHHHHHHHHHHH-HCCCCCCCEEEECC Q ss_conf 8851566768898888999999999974188729999249898-----87899999999999999-75988234066103 Q gi|255764485|r 84 DIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV-----SSASIRRAVKDIRKIII-SSGIPVSSISERIY 157 (243) Q Consensus 84 DIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~-----Na~AA~~~~~eIR~~L~-~~GVp~~~I~v~~Y 157 (243) .|++..+.-.|++..+..++..+.--+..+.-.|.|+=-+.+. |-.--.+=...++..|. .+||+...|.+..| T Consensus 53 ~V~Fd~d~~~L~~~s~~~L~~~A~~L~~~p~~~v~I~GHTD~~G~~~yN~~LS~rRA~aV~~~l~~~g~i~~~ri~~~g~ 132 (160) T PRK09967 53 AILFAKNDYKLLPESQQQIQTMAAKLASTGLTHARMDGHTDNYGEDSYNEGLSLKRANVVADAWAMGGQIPRSNLTTQGL 132 (160) T ss_pred CEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEC T ss_conf 37926995011998999999999999978996299987479989989999999999999999998537854435899964 Q ss_pred CCCCCCC Q ss_conf 6888543 Q gi|255764485|r 158 DADYGMD 164 (243) Q Consensus 158 ~a~~p~~ 164 (243) --..|-. T Consensus 133 Ge~~Pv~ 139 (160) T PRK09967 133 GKKYPIA 139 (160) T ss_pred CCCCCCC T ss_conf 6567388 No 10 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=70.67 E-value=9 Score=19.07 Aligned_cols=81 Identities=10% Similarity=0.133 Sum_probs=67.4 Q ss_pred EEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCC-----CHHHHHHHHHHHHHHHHHCCCCCCCEEEECCC Q ss_conf 8851566768898888999999999974188729999249898-----87899999999999999759882340661036 Q gi|255764485|r 84 DIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV-----SSASIRRAVKDIRKIIISSGIPVSSISERIYD 158 (243) Q Consensus 84 DIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~-----Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y~ 158 (243) ||.|.++.-.|.+.-...+..++.-...+..-.|.|.=-+.+. |..--.+=...+++.|...||++++|....|- T Consensus 113 ~I~F~~dsa~l~p~~~~~L~~vA~~L~~yp~~~V~I~GHTD~~G~~~yN~~LS~~RA~aV~~yL~~~GV~~~Ri~~~G~G 192 (219) T PRK10510 113 NVTFDSSSATLKPAGANTLTGVAMVLKEYPKTAVNVVGYTDSTGSHDLNMRLSQQRADSVASALITQGVDASRIRTQGLG 192 (219) T ss_pred CCEEECCCHHCCHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEEC T ss_conf 85661885212967899999999999988992799998768967199999999999999999999859995648999876 Q ss_pred CCCCCC Q ss_conf 888543 Q gi|255764485|r 159 ADYGMD 164 (243) Q Consensus 159 a~~p~~ 164 (243) -+.|-. T Consensus 193 e~~Pia 198 (219) T PRK10510 193 PANPIA 198 (219) T ss_pred CCCCCC T ss_conf 657789 No 11 >PRK10919 ATP-dependent DNA helicase Rep; Provisional Probab=69.03 E-value=9.7 Score=18.84 Aligned_cols=22 Identities=5% Similarity=0.022 Sum_probs=8.4 Q ss_pred HHHHHHHHHHHHHCCCCCCCEE Q ss_conf 9999999999997598823406 Q gi|255764485|r 132 RRAVKDIRKIIISSGIPVSSIS 153 (243) Q Consensus 132 ~~~~~eIR~~L~~~GVp~~~I~ 153 (243) ...+.+|.......|++.+.|. T Consensus 325 ~~i~~~ii~~~~~~~~~~~diA 346 (672) T PRK10919 325 ERVTGELIAHHFVNKTQYKDYA 346 (672) T ss_pred HHHHHHHHHHHHHCCCCCCCEE T ss_conf 9999999999873488835579 No 12 >PRK08944 motB flagellar motor protein MotB; Reviewed Probab=66.39 E-value=11 Score=18.49 Aligned_cols=78 Identities=14% Similarity=0.144 Sum_probs=54.0 Q ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEE-----ECCCC----CC-HHHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 51566768898888999999999974188729999-----24989----88-7899999999999999759882340661 Q gi|255764485|r 86 PLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL-----IPSPT----VS-SASIRRAVKDIRKIIISSGIPVSSISER 155 (243) Q Consensus 86 pVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~-----vPsGS----~N-a~AA~~~~~eIR~~L~~~GVp~~~I~v~ 155 (243) .+..|.-.|++.-+..+...+.--.+.. +.|.|. +|-.+ .| +-.+.|+..-.|.++..+||+++++.+. T Consensus 185 lF~~GSa~l~p~~~~lL~~ia~~l~~~~-n~I~I~GHTD~~p~~~~~~~sNWeLSaaRA~~v~r~L~~~~gi~~~rl~v~ 263 (305) T PRK08944 185 SFPSGSAFLQPKFKPLVRKIGELLKDVP-GIITVSGHTDNMPISSELYRSNWDLSSARAVAVAHELLKVKGFDPSRLKVV 263 (305) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEE T ss_conf 7578864248768999999999997389-627997415777665556787489999999999999998069894348999 Q ss_pred CCCCCCCCC Q ss_conf 036888543 Q gi|255764485|r 156 IYDADYGMD 164 (243) Q Consensus 156 ~Y~a~~p~~ 164 (243) .|.-..|-. T Consensus 264 G~ad~~Pl~ 272 (305) T PRK08944 264 GMADTQPLV 272 (305) T ss_pred EECCCCCCC T ss_conf 865787489 No 13 >TIGR03350 type_VI_ompA type VI secretion system OmpA/MotB family protein. The flagellar motor protein MotB and the Gram-negative bacterial outer membrane protein OmpA share a region of sequence homology. This model describes a domain found fused to type VI secretion system homologs of the type IV system protein DotU (see model TIGR03349), with OmpA/MotB homology. Probab=62.65 E-value=13 Score=18.03 Aligned_cols=91 Identities=7% Similarity=0.085 Sum_probs=62.9 Q ss_pred CEEEEECCEEE------EEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCC-----C----CHHHHHHHHHH Q ss_conf 57773377269------885156676889888899999999997418872999924989-----8----87899999999 Q gi|255764485|r 73 PILMRKVEQIV------DIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT-----V----SSASIRRAVKD 137 (243) Q Consensus 73 PIvi~Ea~~tl------DIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS-----~----Na~AA~~~~~e 137 (243) -|.|.+.++.+ |+.+..|.-.|++.-+..+..++..-.... +.|.|.--+.+ . |-.....=+.. T Consensus 14 ~v~V~~~~~~v~l~l~~~~lF~~gsa~L~~~~~~~L~~va~~L~~~~-~~I~I~GHTD~~~~~~~~~~~N~~LS~~RA~a 92 (137) T TIGR03350 14 LLTVDEDGDRSVVRLRGDELFASGSAEVRADFEPLLDRIAKALAAVP-GRITVVGHTDNVPIRTSRFPSNWHLSEARAKA 92 (137) T ss_pred CEEEEEECCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 64999979999999579871889963209879999999999997099-85999998389877787751046799999999 Q ss_pred HHHHHHHCCCCCCCEEEECCCCCCCCC Q ss_conf 999999759882340661036888543 Q gi|255764485|r 138 IRKIIISSGIPVSSISERIYDADYGMD 164 (243) Q Consensus 138 IR~~L~~~GVp~~~I~v~~Y~a~~p~~ 164 (243) +.+.|...||++..|.+..|--..|.. T Consensus 93 V~~~L~~~Gi~~~rl~~~g~G~~~P~~ 119 (137) T TIGR03350 93 VADVLAQGGVPAGRVRAEGRGDSEPIA 119 (137) T ss_pred HHHHHHHCCCCHHEEEEEECCCCCCCC T ss_conf 999999729972408997012778679 No 14 >pfam00691 OmpA OmpA family. The Pfam entry also includes MotB and related proteins which are not included in the Prosite family. Probab=60.05 E-value=14 Score=17.73 Aligned_cols=78 Identities=17% Similarity=0.195 Sum_probs=54.2 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHC-CCCEEEEEECCCC-----CCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCC Q ss_conf 15667688988889999999999741-8872999924989-----88789999999999999975988234066103688 Q gi|255764485|r 87 LLAGRGEIKYPIHDTIRGFLEKYKND-SASVLFLLIPSPT-----VSSASIRRAVKDIRKIIISSGIPVSSISERIYDAD 160 (243) Q Consensus 87 Vg~g~g~Lt~~qr~~V~gFa~~yrr~-G~G~i~I~vPsGS-----~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y~a~ 160 (243) +..+.-.|++..+..+..++..-... ....|+|.-=+.+ .|-.-..+=+..|++.|...||++..|.+..|-.. T Consensus 2 F~~~s~~L~~~~~~~L~~~a~~l~~~~~~~~v~I~GhtD~~g~~~~N~~LS~~RA~~V~~~L~~~Gv~~~ri~~~g~G~~ 81 (97) T pfam00691 2 FDPGSAELTAEARETLDRLAEVLKAPELKIAIKIEGHTDSRGSAKYNWELSARRAQAVANYLVNHGIPPSRISVEGYGES 81 (97) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCC T ss_conf 99981213989999999999999718997289999998999988899999999999999999985999789899887576 Q ss_pred CCCC Q ss_conf 8543 Q gi|255764485|r 161 YGMD 164 (243) Q Consensus 161 ~p~~ 164 (243) .|-. T Consensus 82 ~Pi~ 85 (97) T pfam00691 82 QPLA 85 (97) T ss_pred CCCC T ss_conf 8889 No 15 >PRK10808 outer membrane protein A; Reviewed Probab=59.85 E-value=14 Score=17.71 Aligned_cols=82 Identities=17% Similarity=0.222 Sum_probs=64.1 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHHHHHHCC--CCEEEEEECCCCC-----CHHHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 9885156676889888899999999997418--8729999249898-----87899999999999999759882340661 Q gi|255764485|r 83 VDIPLLAGRGEIKYPIHDTIRGFLEKYKNDS--ASVLFLLIPSPTV-----SSASIRRAVKDIRKIIISSGIPVSSISER 155 (243) Q Consensus 83 lDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G--~G~i~I~vPsGS~-----Na~AA~~~~~eIR~~L~~~GVp~~~I~v~ 155 (243) .||.+....-.|++.-...+..+..+-.... .+.|+|.=-+.+. |...-.+-...++..|.+.||++.+|.+. T Consensus 224 s~v~F~fdss~l~p~~~~~L~~~~~~l~~~~~~~~~i~v~GhtD~~G~~~yN~~LS~~RA~~V~~~l~~~Gi~~~~i~~~ 303 (347) T PRK10808 224 SDVLFNFNKATLKPEGQQALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQALSERRAQSVVDYLISKGIPADKISAR 303 (347) T ss_pred CCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEEEE T ss_conf 42255157020586689999999999974587765189998369988878879999999999999999869984518999 Q ss_pred CCCCCCCCC Q ss_conf 036888543 Q gi|255764485|r 156 IYDADYGMD 164 (243) Q Consensus 156 ~Y~a~~p~~ 164 (243) .|--++|-. T Consensus 304 g~Ge~~pva 312 (347) T PRK10808 304 GMGESNPVT 312 (347) T ss_pred EECCCCCCC T ss_conf 864668888 No 16 >pfam04891 NifQ NifQ. NifQ is involved in early stages of the biosynthesis of the iron-molybdenum cofactor (FeMo-co), which is an integral part of the active site of dinitrogenase. The conserved C-terminal cysteine residues may be involved in metal binding. Probab=56.79 E-value=8.3 Score=19.32 Aligned_cols=46 Identities=11% Similarity=0.273 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECC--CCCCHHHHHHHHHHHHHHHHHC Q ss_conf 8999999999974188729999249--8988789999999999999975 Q gi|255764485|r 99 HDTIRGFLEKYKNDSASVLFLLIPS--PTVSSASIRRAVKDIRKIIISS 145 (243) Q Consensus 99 r~~V~gFa~~yrr~G~G~i~I~vPs--GS~Na~AA~~~~~eIR~~L~~~ 145 (243) .++.+..+..|--.....+.. .|. ..+....-..-..+++.+|.+. T Consensus 29 ~~~~~~Ll~r~FP~a~~~~~~-~~~~~~~~~~~~~~~E~~~L~~lL~~h 76 (167) T pfam04891 29 AADLAALLARHFPGALAALAA-LALALPGLLPPPLDEEEDDLRDLLLAH 76 (167) T ss_pred HHHHHHHHHHHCCCCCHHHCC-CCCCCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 999999999878784101002-343455566778844899999999974 No 17 >PRK06667 motB flagellar motor protein MotB; Validated Probab=55.31 E-value=17 Score=17.21 Aligned_cols=92 Identities=13% Similarity=-0.002 Sum_probs=55.7 Q ss_pred CCEEEEECCEEEEE------ECCCCCCCCC-HHHHHHHHHHHHHHHHCC--CCEEEEEECCCC---------C--CHHHH Q ss_conf 85777337726988------5156676889-888899999999997418--872999924989---------8--87899 Q gi|255764485|r 72 YPILMRKVEQIVDI------PLLAGRGEIK-YPIHDTIRGFLEKYKNDS--ASVLFLLIPSPT---------V--SSASI 131 (243) Q Consensus 72 HPIvi~Ea~~tlDI------pVg~g~g~Lt-~~qr~~V~gFa~~yrr~G--~G~i~I~vPsGS---------~--Na~AA 131 (243) +-|.+.+.++-|.| .+..|.-.|+ ...+.-+..++.--.... ...|.|+--+.+ + =+-.+ T Consensus 110 ~~v~i~~~~~gl~i~l~~~~lF~~gsa~l~~~~~~~~l~~l~~il~~~~~~~~~I~VeGHTD~~pi~~~~~~~sNWeLSs 189 (254) T PRK06667 110 NKVAVTEDERGLVISLVSDAFFYPGSADVKLEENRDLIQKLASFIGFLDLAGYNFRIEGHTDNVDVNPEGAWKSNWELSG 189 (254) T ss_pred CCEEEEECCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCHHHHHH T ss_conf 87489974997999960685617997312647489999999999984567574289994378776687888886288999 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEECCCCCCCC Q ss_conf 99999999999975988234066103688854 Q gi|255764485|r 132 RRAVKDIRKIIISSGIPVSSISERIYDADYGM 163 (243) Q Consensus 132 ~~~~~eIR~~L~~~GVp~~~I~v~~Y~a~~p~ 163 (243) .|+..-+|.++..+||+++.+.+..|....|- T Consensus 190 aRA~~V~~~l~~~g~i~~~rl~v~Gygd~~P~ 221 (254) T PRK06667 190 ARAVNMLEYILNYEDQSESWFEVSGFGGSRPL 221 (254) T ss_pred HHHHHHHHHHHHCCCCCCCEEEEEEECCCCCC T ss_conf 99999999999817999545999854588768 No 18 >PRK09041 motB flagellar motor protein MotB; Validated Probab=51.87 E-value=19 Score=16.85 Aligned_cols=106 Identities=13% Similarity=0.173 Sum_probs=69.8 Q ss_pred CEEEEECCEEEEE---------ECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEE-----EC-CCC----CCHHHHHH Q ss_conf 5777337726988---------51566768898888999999999974188729999-----24-989----88789999 Q gi|255764485|r 73 PILMRKVEQIVDI---------PLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL-----IP-SPT----VSSASIRR 133 (243) Q Consensus 73 PIvi~Ea~~tlDI---------pVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~-----vP-sGS----~Na~AA~~ 133 (243) -|.+..-++-|.| .+..|.-.|++.-+.-+...+.--... .+.|.|. +| .++ .|=.--.+ T Consensus 147 ~v~v~~t~~GL~I~I~D~~~~~mF~sGSA~l~p~~~~lL~~Ia~vL~~~-~n~I~I~GHTD~~P~~~~~~~~SNWeLSaa 225 (317) T PRK09041 147 QLLIDLTQEGLRIQIVDSQNRPMFATGSAEVEPYMRDILREIAPVLNDV-PNRISLSGHTDATPYAGGEKGYSNWELSAD 225 (317) T ss_pred CEEEEEECCCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCCCCCHHHHHH T ss_conf 4479980694599998667786767886645888999999999999725-785699875587775678888995155899 Q ss_pred HHHHHHHHHHHCCCCCCCEE-EE------CCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 99999999997598823406-61------036888543684667754445417 Q gi|255764485|r 134 AVKDIRKIIISSGIPVSSIS-ER------IYDADYGMDVDTIRLSYFASKPSA 179 (243) Q Consensus 134 ~~~eIR~~L~~~GVp~~~I~-v~------~Y~a~~p~~aAPIRLSY~ri~A~a 179 (243) =...+|++|.++||++.+|. |. ++.+++|..+.==|+|-.-++-.+ T Consensus 226 RA~aarr~L~~~Gl~~~Ri~~V~G~AD~~Pl~~~dP~~a~NRRI~I~vLt~~a 278 (317) T PRK09041 226 RANASRRELVAGGMDEGKVLRVVGLASTMLLDKKDPLNPVNRRISIVVLNKKA 278 (317) T ss_pred HHHHHHHHHHHCCCCHHHEEEEEEECCCCCCCCCCCCCCCCCCEEEEEECHHH T ss_conf 99999999998499976878999835677589999876546857999978599 No 19 >COG4470 Uncharacterized protein conserved in bacteria [Function unknown] Probab=47.88 E-value=5.5 Score=20.50 Aligned_cols=34 Identities=24% Similarity=0.475 Sum_probs=17.0 Q ss_pred CCCCCCHHHH--HHHHHHHCC-HHHHCCCCCCCCCCH Q ss_conf 6233362568--878986269-688506888884436 Q gi|255764485|r 198 NWTNYGCAYQ--NNLAAQVVN-PLDLFSPRMVTPPDA 231 (243) Q Consensus 198 ~y~NFGCAtQ--~NLAAqVaN-P~DLv~PR~~tP~Da 231 (243) .|-|||||+= .|+.-|-+- -..-+-|+.++|.+- T Consensus 86 EyCnfGCaYFVl~~~~~~~~kf~~e~~~~e~e~~~~~ 122 (126) T COG4470 86 EYCNFGCAYFVLKRLRPQDIKFEEERQPTEKESPKSK 122 (126) T ss_pred HHHCCCCHHHHHHHCCHHHHHHHHHHCCHHHCCCCCC T ss_conf 9846774325533216466677887347042096322 No 20 >PRK11773 uvrD DNA-dependent helicase II; Provisional Probab=47.48 E-value=22 Score=16.40 Aligned_cols=13 Identities=23% Similarity=0.634 Sum_probs=5.7 Q ss_pred HHHHHHHHCCCCC Q ss_conf 9999999759882 Q gi|255764485|r 137 DIRKIIISSGIPV 149 (243) Q Consensus 137 eIR~~L~~~GVp~ 149 (243) .+...|...|||. T Consensus 362 ~le~aL~~~gIPy 374 (722) T PRK11773 362 VLEEALLQAGIPY 374 (722) T ss_pred HHHHHHHHCCCCE T ss_conf 8999998659984 No 21 >pfam03281 Mab-21 Mab-21 protein. Probab=47.00 E-value=5.7 Score=20.41 Aligned_cols=21 Identities=10% Similarity=0.033 Sum_probs=10.5 Q ss_pred EEEEECCEEEEEECCCCCCCC Q ss_conf 777337726988515667688 Q gi|255764485|r 74 ILMRKVEQIVDIPLLAGRGEI 94 (243) Q Consensus 74 Ivi~Ea~~tlDIpVg~g~g~L 94 (243) |.|..-+|...+.--.+.|.. T Consensus 66 ikV~sP~eF~V~l~L~~~g~~ 86 (360) T pfam03281 66 LEVISPTEFEIVLYLNQMGVF 86 (360) T ss_pred EEEECCCCEEEEEEECCCHHH T ss_conf 388468706999995000000 No 22 >KOG3022 consensus Probab=45.95 E-value=22 Score=16.44 Aligned_cols=59 Identities=17% Similarity=0.337 Sum_probs=45.8 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 69885156676889888899999999997418872999924989887899999999999999759882340661 Q gi|255764485|r 82 IVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 (243) Q Consensus 82 tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~ 155 (243) .+|-|=|+++.||+-.|-.. ..+|+|++-.|+ +.|..+..++|. .....|||-=+|++. T Consensus 161 viDtPPGtsDehls~~~~~~----------~~~gAviVTTPQ----~vAl~Dv~K~i~-fc~K~~I~ilGvVEN 219 (300) T KOG3022 161 VIDTPPGTSDEHLSLVQFLR----------ESDGAVIVTTPQ----EVALQDVRKEID-FCRKAGIPILGVVEN 219 (300) T ss_pred EEECCCCCCHHHHHEEECCC----------CCCCEEEEECCH----HHHHHHHHHHHH-HHHHCCCCEEEEEEC T ss_conf 99589998701431000044----------557349981720----556688886502-666459856888962 No 23 >COG0616 SppA Periplasmic serine proteases (ClpP class) [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion] Probab=45.71 E-value=24 Score=16.23 Aligned_cols=127 Identities=20% Similarity=0.168 Sum_probs=67.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHCCCEEEECCCCCHHHCCCEEEEECCEEEEEECCCCCCCCCHH Q ss_conf 4899999999987567999999---9999999999960220011027887111385777337726988515667688988 Q gi|255764485|r 21 NMRSLISCLKTIFWKNFFLRTL---MLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYP 97 (243) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~ 97 (243) -+..+..|.+..+|.-+|.... +.+..+....+........ ..+ .+-|++-..+.+|.-. +.+.+.. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-----~~~Iavi~~~G~I~~~-~~~~~~~--- 80 (317) T COG0616 11 LVEDLAERLKILFWLLSLILLLAPTLKAVGLLAKRLGKLEALLR-KRG-----SKVIAVIHVEGAIVAG-GGPLRFI--- 80 (317) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HCC-----CCEEEEEEEEEEEECC-CCCCCCC--- T ss_conf 88876677888888888876553778877889886344443320-136-----8569999745465348-7763446--- Q ss_pred HHHHHHHHHHHHHHCC--CC-EEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCC Q ss_conf 8899999999997418--87-29999249898878999999999999997598823406610368885 Q gi|255764485|r 98 IHDTIRGFLEKYKNDS--AS-VLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYG 162 (243) Q Consensus 98 qr~~V~gFa~~yrr~G--~G-~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y~a~~p 162 (243) ..+++......-+... .| .+.|.-|.|++ .|+..+.++|+++-... |=-+++..|.+|+. T Consensus 81 ~~~~~~~~l~~~~~~~~vk~vvL~inSPGG~v--~as~~i~~~l~~l~~~~---PV~v~v~~~AASGG 143 (317) T COG0616 81 GGDDIEEILRAARADPSVKAVVLRINSPGGSV--VASELIARALKRLRAKK---PVVVSVGGYAASGG 143 (317) T ss_pred CHHHHHHHHHHHHCCCCCCEEEEEEECCCCCH--HHHHHHHHHHHHHHHCC---CEEEEECCCCCCHH T ss_conf 67779999999850899873899998949846--89999999999875329---89999887320289 No 24 >PRK07193 fliF flagellar MS-ring protein; Reviewed Probab=44.69 E-value=25 Score=16.12 Aligned_cols=18 Identities=44% Similarity=0.739 Sum_probs=9.6 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999875679999 Q gi|255764485|r 22 MRSLISCLKTIFWKNFFLR 40 (243) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~ 40 (243) |..+++-||. .||||-++ T Consensus 6 ~~~~~~~l~~-~w~~~~~~ 23 (550) T PRK07193 6 MDAMLSKLKQ-KWSPFRLR 23 (550) T ss_pred HHHHHHHHHH-HHHHCCCC T ss_conf 9999999999-99850425 No 25 >TIGR02159 PA_CoA_Oxy4 phenylacetate-CoA oxygenase, PaaJ subunit; InterPro: IPR011883 Phenylacetate-CoA oxygenase is comprised of a five gene complex responsible for the hydroxylation of phenylacetate-CoA (PA-CoA) as the second catabolic step in phenylacetic acid (PA) degradation , . Although the exact function of this enzyme has not been determined, it has been shown to be required for phenylacetic acid degradation and has been proposed to function in a multicomponent oxygenase acting on phenylacetate-CoA . . Probab=44.60 E-value=25 Score=16.11 Aligned_cols=45 Identities=18% Similarity=0.283 Sum_probs=35.8 Q ss_pred HHHHCCCCEEEE-EEC--CCCCCHHHHHHHHHHHHHHHHHCC-CCCCCEEEE Q ss_conf 997418872999-924--989887899999999999999759-882340661 Q gi|255764485|r 108 KYKNDSASVLFL-LIP--SPTVSSASIRRAVKDIRKIIISSG-IPVSSISER 155 (243) Q Consensus 108 ~yrr~G~G~i~I-~vP--sGS~Na~AA~~~~~eIR~~L~~~G-Vp~~~I~v~ 155 (243) +=..+|-|..+. +.| ||=| |-+.+..+|++.+.+.| ++.=+|..+ T Consensus 19 ~V~v~g~G~~v~s~tPTySGCP---A~~~i~~~i~~A~~~~GW~~~VeV~~~ 67 (152) T TIGR02159 19 EVEVDGEGVVVKSFTPTYSGCP---ALEVIRQDIRDALRALGWVEEVEVKTS 67 (152) T ss_pred EEEECCCCEEEEEEECCCCCCH---HHHHHHHHHHHHHHHCCCCCCEEEEEE T ss_conf 5786479479888745877723---689999999999974789751158877 No 26 >PRK10954 periplasmic protein disulfide isomerase I; Provisional Probab=43.86 E-value=21 Score=16.55 Aligned_cols=64 Identities=11% Similarity=0.072 Sum_probs=39.5 Q ss_pred HHCCCEEEECCCCCHHHCCCEEEEECCEEEEEECCCCCCCCCHHHHHHHHHH---HHHHHHC-CCCEEEEEECCCC Q ss_conf 6022001102788711138577733772698851566768898888999999---9999741-8872999924989 Q gi|255764485|r 54 YGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGF---LEKYKND-SASVLFLLIPSPT 125 (243) Q Consensus 54 ~~~~~~~~~~~~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gF---a~~yrr~-G~G~i~I~vPsGS 125 (243) ...+| +-|.||.||..=.+ -++..++++++| .-|-.|--. .+.. ...|... ..++...-+|..- T Consensus 15 ~~a~A-a~~~eG~~Y~~L~~-p~~~~~eV~EfF-sy~CpHCy~-----fEp~l~i~~~~~k~lp~~v~f~r~pv~f 82 (207) T PRK10954 15 FSASA-AQFTDGKQYTTLDK-PVAGAPQVLEFF-SFYCPHCYQ-----FEEVLHVSDNVKKKLPEGTKMTKYHVNF 82 (207) T ss_pred HHHHH-HHCCCCCCCEECCC-CCCCCCCEEEEE-EECCCCHHH-----CCHHCCCHHHHHHHCCCCCEEEEECCCC T ss_conf 77225-03548997558888-889997089999-746972551-----0700151699997489997899964452 No 27 >TIGR03044 PS_II_psb27 photosystem II protein Psb27. Members of this family are the Psb27 protein of the cyanobacterial photosynthetic supracomplex, photosystem II. Although most protein components of both cyanobacterial and chloroplast versions of photosystem II are closely related and described together by single HMM families, this family is strictly bacterial. Some uncharacterized proteins with highly divergent sequences, from Arabidopsis, score between trusted and noise cutoffs for this model but are not at this time assigned as functionally equivalent photosystem II proteins. Probab=43.39 E-value=13 Score=17.93 Aligned_cols=79 Identities=15% Similarity=0.258 Sum_probs=43.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHCCCEEEEECCEEEEEECCCCCCCCCHH---HHHHHHHHHHHH Q ss_conf 75679999999999999999960220011027887111385777337726988515667688988---889999999999 Q gi|255764485|r 33 FWKNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYP---IHDTIRGFLEKY 109 (243) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~---qr~~V~gFa~~y 109 (243) ||+.++-.+|-|.--+-| ++-|.|.-+--.-.-||.+----||..-..+|++| ..+-+.... -|..|.+|+.+| T Consensus 3 l~~rl~~~alal~L~~~l-~l~~cs~a~~s~ltG~Y~~DT~~Vi~tlr~~i~lp--~d~pn~~ea~aear~~indyisrY 79 (135) T TIGR03044 3 LFSRLSRAALALVLGLCL-LLTACSGAAKTRLTGDYVEDTLAVIQTLREAIDLP--DDDPNKSEAQAEARQLINDYISRY 79 (135) T ss_pred HHHHHHHHHHHHHHHHHH-HHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC--CCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 179999999999999999-97345677765678752998999999999998089--899888999999999999999987 Q ss_pred HHCCC Q ss_conf 74188 Q gi|255764485|r 110 KNDSA 114 (243) Q Consensus 110 rr~G~ 114 (243) ||++. T Consensus 80 Rr~~~ 84 (135) T TIGR03044 80 RRRPR 84 (135) T ss_pred CCCCC T ss_conf 47776 No 28 >TIGR03025 EPS_sugtrans exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase. Certain closely related transferase enzymes such as Sinorhizobium ExoY and Lactococcus EpsD lack the N-terminal domain and are not found by this model. Probab=41.62 E-value=13 Score=18.06 Aligned_cols=25 Identities=28% Similarity=0.573 Sum_probs=19.7 Q ss_pred HHHCCCCCCCCCCHHHHHHHHHHHC Q ss_conf 8850688888443667888998751 Q gi|255764485|r 218 LDLFSPRMVTPPDAEQRDKSIQRYR 242 (243) Q Consensus 218 ~DLv~PR~~tP~Da~RR~~viekYR 242 (243) .-||+|||+.|...++=...+..|+ T Consensus 360 MSlVGPRP~~~~~~~~~~~~~~~y~ 384 (445) T TIGR03025 360 MSLVGPRPERPAEVEKYEQEIPGYM 384 (445) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 4687089997689999986557778 No 29 >TIGR03023 WcaJ_sugtrans Undecaprenyl-phosphate glucose phosphotransferase. Colanic acid biosynthesis utilizes a glucose-undecaprenyl carrier, knockout of EpsB abolishes incorporation of UDP-glucose into the lipid phase and the C-terminal portion of GumD has been shown to be responsible for the glucosyl-1-transferase activity. Probab=41.37 E-value=14 Score=17.88 Aligned_cols=47 Identities=9% Similarity=0.280 Sum_probs=27.2 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 88899999999997418872999924989887899999999999999759882 Q gi|255764485|r 97 PIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPV 149 (243) Q Consensus 97 ~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~ 149 (243) +.-++++.+.+--++.+-..|.+.+|.... ....++-..+...||.- T Consensus 175 pvlg~~~~l~~~i~~~~ideViia~~~~~~------~~~~~li~~~~~~~v~v 221 (451) T TIGR03023 175 PVLGKLDDLEELIREGEVDEVYIALPLAAE------KRILELLDALEDLTVDV 221 (451) T ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCCCCH------HHHHHHHHHHHHCCCEE T ss_conf 736989999999996799889995483556------89999999986459879 No 30 >TIGR03013 EpsB_2 sugar transferase, PEP-CTERM system associated. Members of this protein family belong to the family of bacterial sugar transferases (pfam02397). Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria (notable exceptions appear to include Magnetococcus sp. MC-1 and Myxococcus xanthus DK 1622 ). These genes are generally found near one or more of the PrsK, PrsR or PrsT genes that have been related to the PEP-CTERM system by phylogenetic profiling methods. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species. These proteins are homologs of the EpsB protien found in Methylobacillus sp. strain 12S, which is also associated with a PEP-CTERM system, but of a distinct type. A name which appears attached to a number of genes (by transitive annotation) in this family is "undecapre Probab=41.15 E-value=14 Score=17.70 Aligned_cols=24 Identities=25% Similarity=0.395 Sum_probs=18.4 Q ss_pred HHCCCCCCCCCCHHHHHHHHHHHC Q ss_conf 850688888443667888998751 Q gi|255764485|r 219 DLFSPRMVTPPDAEQRDKSIQRYR 242 (243) Q Consensus 219 DLv~PR~~tP~Da~RR~~viekYR 242 (243) -||+|||+.|..-+.=.+.++.|+ T Consensus 358 SlVGPRP~~p~~ve~~~~~i~~y~ 381 (442) T TIGR03013 358 SFVGPRPERPEFVEKLSEEIPYYN 381 (442) T ss_pred EEECCCCCCCHHHHHHHHHHHHHH T ss_conf 787189986148999987545466 No 31 >KOG4310 consensus Probab=40.52 E-value=28 Score=15.70 Aligned_cols=35 Identities=29% Similarity=0.447 Sum_probs=24.1 Q ss_pred HHHHHHHHHH-HH--HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7676489999-99--99987567999999999999999 Q gi|255764485|r 17 KGLANMRSLI-SC--LKTIFWKNFFLRTLMLGQLFFLL 51 (243) Q Consensus 17 ~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~ 51 (243) .-|+|||--+ .- -...||.|+|.|.-.+-|-..+- T Consensus 199 ~~LakmRFaLVPK~VKEeqFWqNYFYRigLIkQS~laq 236 (373) T KOG4310 199 ELLAKMRFALVPKLVKEEQFWQNYFYRIGLIKQSALAQ 236 (373) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 57765588860798888899998889999999999988 No 32 >KOG0431 consensus Probab=39.96 E-value=16 Score=17.32 Aligned_cols=12 Identities=25% Similarity=0.196 Sum_probs=5.1 Q ss_pred EECCCCCCCCHH Q ss_conf 417888998133 Q gi|255764485|r 177 PSAGKCGFWPED 188 (243) Q Consensus 177 A~ag~CG~WPeD 188 (243) +.++.|+.|..- T Consensus 229 ~~~~~~~~~~~~ 240 (453) T KOG0431 229 SNASKSGSSSDL 240 (453) T ss_pred CCCCCCCCCCCC T ss_conf 266777776644 No 33 >TIGR01005 eps_transp_fam exopolysaccharide transport protein family; InterPro: IPR005700 The bacterial exopolysaccharide transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed at least seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.; GO: 0015542 sugar efflux transmembrane transporter activity, 0006859 extracellular carbohydrate transport. Probab=39.85 E-value=27 Score=15.87 Aligned_cols=110 Identities=12% Similarity=0.130 Sum_probs=68.4 Q ss_pred CCCCCHHHCCCEEEE-ECCEEEEEECCCCCCCCCHH-------------HHHHH-------HHHHHHHHHCCCCEEE--- Q ss_conf 278871113857773-37726988515667688988-------------88999-------9999999741887299--- Q gi|255764485|r 63 DEGSDYRDRYPILMR-KVEQIVDIPLLAGRGEIKYP-------------IHDTI-------RGFLEKYKNDSASVLF--- 118 (243) Q Consensus 63 ~~~~DYR~RHPIvi~-Ea~~tlDIpVg~g~g~Lt~~-------------qr~~V-------~gFa~~yrr~G~G~i~--- 118 (243) |.-|-|..-.|=+.. |+..-+|--|.+...=|... ..+.+ .+...+|.+.-++.+- T Consensus 52 ~n~s~y~~~~g~~~~Gdk~~~~~t~~a~~veIl~S~~~~~~vvdkL~L~~~~~~~nPP~~pvd~~~~wls~~~~llsePG 131 (778) T TIGR01005 52 ENESKYKEEEGSVSDGDKSILVETDVAAEVEILKSREVVKKVVDKLKLAELDKIRNPPREPVDLVKEWLSSALILLSEPG 131 (778) T ss_pred CCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 66540120375877774220023211324334646789999997632220565567788628899999988898616888 Q ss_pred -EEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCCEEEEE Q ss_conf -992498988789999999999999975988234066103688854368466775 Q gi|255764485|r 119 -LLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSY 172 (243) Q Consensus 119 -I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y~a~~p~~aAPIRLSY 172 (243) -.+-.......+-.....+|++-|...+|....|+.-.|+-+||.++|||-=.| T Consensus 132 ~~~l~~~~~~~~~~~~~~~~~~e~L~v~~v~~~~~~a~~frsedp~LaA~I~~~~ 186 (778) T TIGR01005 132 LFDLKEEAADVEAREKAAKKVREKLEVERVEKISVVAVEFRSEDPKLAAAIEDAV 186 (778) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEEEEEECCCCCCCCCCHHHH T ss_conf 8887653236754678888879974587614678998886237852001230343 No 34 >PRK06742 flagellar motor protein MotS; Reviewed Probab=39.51 E-value=29 Score=15.60 Aligned_cols=78 Identities=12% Similarity=0.117 Sum_probs=52.9 Q ss_pred EECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCC-----C----C-HHHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 85156676889888899999999997418872999924989-----8----8-789999999999999975988234066 Q gi|255764485|r 85 IPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT-----V----S-SASIRRAVKDIRKIIISSGIPVSSISE 154 (243) Q Consensus 85 IpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS-----~----N-a~AA~~~~~eIR~~L~~~GVp~~~I~v 154 (243) +++..|.-.|++.-+..+...+.--.... ..|.|.=-+.+ . | +-.+.|+..-+|.++...||+++++++ T Consensus 117 ~lF~~Gsa~l~p~~~~~L~~la~~l~~~~-~~I~I~GHTD~~pi~~~~~~sNWeLSsaRA~~V~r~l~~~~gi~~~rl~~ 195 (225) T PRK06742 117 LIFDTGDANVKPEAKEIISQLVGFFQSVP-NPIVVEGHTDSRPIHNDKFPSNWELSSARAANMIHHLIEVYNVDDKRLAA 195 (225) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 77689843239669999999999996599-70899996788766677888649999999999999999836999314899 Q ss_pred ECCCCCCCC Q ss_conf 103688854 Q gi|255764485|r 155 RIYDADYGM 163 (243) Q Consensus 155 ~~Y~a~~p~ 163 (243) ..|....|- T Consensus 196 ~Gygd~~P~ 204 (225) T PRK06742 196 VGYADTKPV 204 (225) T ss_pred EEECCCCCC T ss_conf 865577617 No 35 >pfam01514 YscJ_FliF Secretory protein of YscJ/FliF family. This family includes proteins that are related to the YscJ lipoprotein, and the amino terminus of FliF, the flageller M-ring protein. The members of the YscJ family are thought to be involved in secretion of several proteins. The FliF protein ring is thought to be part of the export apparatus for flageller proteins, based on the similarity to YscJ proteins. Probab=39.04 E-value=30 Score=15.55 Aligned_cols=52 Identities=13% Similarity=0.229 Sum_probs=37.9 Q ss_pred CCCCHHHHHHHHHHHH----HHHHC--CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 6889888899999999----99741--88729999249898878999999999999997598823406 Q gi|255764485|r 92 GEIKYPIHDTIRGFLE----KYKND--SASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSIS 153 (243) Q Consensus 92 g~Lt~~qr~~V~gFa~----~yrr~--G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~ 153 (243) .+|+..+...|...++ .|+-. |.| -+|+||.... .+.|-.|++.|+|.++-. T Consensus 32 ~~L~~~da~~i~~~L~~~gI~y~~~~~g~g-~tI~Vp~~~~---------~~ar~~La~~GlP~~~~~ 89 (206) T pfam01514 32 TGLSEKDANEVLAALLQAGIPAEKSPDGDG-GTILVPEDDV---------ARARELLAAKGLPREGFV 89 (206) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEECCCC-CEEEECCCHH---------HHHHHHHHHCCCCCCCCC T ss_conf 599999999999999987997598548987-2899160148---------999999997599999998 No 36 >PRK13863 type IV secretion system T-DNA border endonuclease VirD2; Provisional Probab=37.01 E-value=32 Score=15.34 Aligned_cols=76 Identities=14% Similarity=0.269 Sum_probs=54.5 Q ss_pred CEEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCC----------------CCEEEEEECCCCCCHHHHHHHHH Q ss_conf 57773377269885156676889888899999999997418----------------87299992498988789999999 Q gi|255764485|r 73 PILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDS----------------ASVLFLLIPSPTVSSASIRRAVK 136 (243) Q Consensus 73 PIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G----------------~G~i~I~vPsGS~Na~AA~~~~~ 136 (243) -.+++--.+.|||||.. |.|..+++.|.++- +--|++..|.|+..+ ||.++++ T Consensus 35 ~l~lqrs~~~~~~~v~~----------dq~~~la~sw~~etg~y~~~qpd~~~~~~ltthiivsfp~gt~~~-aa~~a~r 103 (447) T PRK13863 35 KLELQRSARHLDIPVPP----------DQIRELARSWVQETGTYDESQPDEERQQELTTHIIVSFPAGTSQV-AAYAASR 103 (447) T ss_pred CEEEEECCCCCCCCCCH----------HHHHHHHHHHHHHCCCCCCCCCCHHHCCCCCEEEEEECCCCCCHH-HHHHHHH T ss_conf 08875213446888985----------899999999998617877678635545644145899615898778-8888878 Q ss_pred HHHHHHHHCCCCCCCE-EEECCCC Q ss_conf 9999999759882340-6610368 Q gi|255764485|r 137 DIRKIIISSGIPVSSI-SERIYDA 159 (243) Q Consensus 137 eIR~~L~~~GVp~~~I-~v~~Y~a 159 (243) +-....-.+|--.... ..+.|+. T Consensus 104 ~wa~~mfgsg~~gg~~~y~ta~h~ 127 (447) T PRK13863 104 EWAAEMFGSGAGGGRYNYLTAFHI 127 (447) T ss_pred HHHHHHCCCCCCCCCCCEEEEEEC T ss_conf 999985057677872211331001 No 37 >pfam00580 UvrD-helicase UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Bacillus subtilis addA and Escherichia coli exodeoxyribonuclease V beta have large insertions near to the carboxy-terminus relative to other members of the family. Probab=36.50 E-value=33 Score=15.29 Aligned_cols=51 Identities=16% Similarity=0.265 Sum_probs=25.0 Q ss_pred HHHHHHHHHHHHHHCCC----CEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEC Q ss_conf 88999999999974188----729999249898878999999999999997598823406610 Q gi|255764485|r 98 IHDTIRGFLEKYKNDSA----SVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERI 156 (243) Q Consensus 98 qr~~V~gFa~~yrr~G~----G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~ 156 (243) +...|..-.++.+..|. +-+.|++.+.+ -++.+.+.|...|||...+.... T Consensus 329 Ea~~ia~~I~~l~~~~~~~~~~DiAIL~R~~~--------~~~~l~~~L~~~gIP~~~~~~~~ 383 (494) T pfam00580 329 EAEFIADEIKALHKSGAAIPFGDIAVLVRSNS--------QSRAIEEALLKAGIPYKILGGLS 383 (494) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCEEEEEECCH--------HHHHHHHHHHHCCCCEEEECCCC T ss_conf 99999999999987799988565689982523--------18999999997799989989842 No 38 >KOG4754 consensus Probab=36.13 E-value=9.3 Score=18.98 Aligned_cols=95 Identities=11% Similarity=0.020 Sum_probs=60.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCCEEE Q ss_conf 76889888899999999997418872999924989887899999999999999759882340661036888543684667 Q gi|255764485|r 91 RGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRL 170 (243) Q Consensus 91 ~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y~a~~p~~aAPIRL 170 (243) +-+||+--+.-|..+...-..++-+... ++=--|| .|+.|..+ .|.-.+|-+.+..+.+|+-+ T Consensus 45 D~~LTplG~~Qv~~l~~~~~A~qL~~~i-eliv~SP-----------MrRtLqT~-----v~~f~~~~~e~g~~~~p~~v 107 (248) T KOG4754 45 DPHLTPLGWKQVDNLRKHLMAKQLPNKI-ELIVVSP-----------MRRTLQTM-----VIAFGGYLAEDGEDPAPVKV 107 (248) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCE-EEEEECH-----------HHHHHHHH-----HHHHCCEECCCCCCCCCEEE T ss_conf 6324877799999986103245457761-4999466-----------99999999-----98714511157876776364 Q ss_pred EEEEEEEE-CCCCCCCCHHHCCCCCC---CCCCCCCC Q ss_conf 75444541-78889981331467557---88623336 Q gi|255764485|r 171 SYFASKPS-AGKCGFWPEDMLGNAKG---NRNWTNYG 203 (243) Q Consensus 171 SY~ri~A~-ag~CG~WPeDL~~s~~~---Nk~y~NFG 203 (243) | +.+.|. -..||-||=|.+.++.. --++++|- T Consensus 108 s-p~~i~~~rE~lG~hpCD~r~~v~~~~~lfp~~DFs 143 (248) T KOG4754 108 S-PPFIAVCRETLGDHPCDRRSSVTDLMKLFPAYDFS 143 (248) T ss_pred C-CHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCE T ss_conf 3-33888889870887664422067888636566622 No 39 >COG0210 UvrD Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair] Probab=33.58 E-value=36 Score=14.98 Aligned_cols=19 Identities=11% Similarity=-0.158 Sum_probs=9.6 Q ss_pred EEEEECCEEEEEECCCCCC Q ss_conf 7773377269885156676 Q gi|255764485|r 74 ILMRKVEQIVDIPLLAGRG 92 (243) Q Consensus 74 Ivi~Ea~~tlDIpVg~g~g 92 (243) ++|-|.+|+|--|=|+.-. T Consensus 244 ~~VGD~dQsIY~frGA~~~ 262 (655) T COG0210 244 FVVGDDDQSIYGFRGADPE 262 (655) T ss_pred EEECCCCCCEEEECCCCCH T ss_conf 9985898736767787836 No 40 >KOG2334 consensus Probab=33.46 E-value=21 Score=16.58 Aligned_cols=68 Identities=24% Similarity=0.346 Sum_probs=53.6 Q ss_pred HHHHHHHHHHCCCEEEECCCCCHHHCCCEE---EEECCEEEE-EECCCCCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 999999996022001102788711138577---733772698-85156676889888899999999997418 Q gi|255764485|r 46 QLFFLLLFYGTSALAYYDEGSDYRDRYPIL---MRKVEQIVD-IPLLAGRGEIKYPIHDTIRGFLEKYKNDS 113 (243) Q Consensus 46 ~~~~~~~~~~~~~~~~~~~~~DYR~RHPIv---i~Ea~~tlD-IpVg~g~g~Lt~~qr~~V~gFa~~yrr~G 113 (243) +|.-.+---|-+++...-.-+|-|++||-- ++|.-+... |||.+.-+.++..|-+|++.|-+.+-..| T Consensus 167 ~lv~ri~~tgi~ai~vh~rt~d~r~~~~~~~~~i~~i~~~~~~V~vi~ng~~~~~e~y~Di~~~~~~~~~~~ 238 (477) T KOG2334 167 KLVKRICATGIAAITVHCRTRDERNQEPATKDYIREIAQACQMVPVIVNGGSMDIEQYSDIEDFQEKTGADS 238 (477) T ss_pred HHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHHHHHHHCCCH T ss_conf 999999962875699986426667778897799999998716633761554125776312888899853404 No 41 >TIGR01502 B_methylAsp_ase methylaspartate ammonia-lyase; InterPro: IPR006395 These sequences describe methylaspartate ammonia-lyase, also called beta-methylaspartase. It follows methylaspartate mutase (composed of S and E subunits) in one of several possible pathways of glutamate fermentation. . Probab=31.27 E-value=39 Score=14.73 Aligned_cols=79 Identities=15% Similarity=0.196 Sum_probs=59.6 Q ss_pred CCCHHHCCCEEEEECCEEEEEECCCCCCCCC-HHHHHHHHHHHHHHHHCCCCE-EEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 8871113857773377269885156676889-888899999999997418872-99992498988789999999999999 Q gi|255764485|r 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIK-YPIHDTIRGFLEKYKNDSASV-LFLLIPSPTVSSASIRRAVKDIRKII 142 (243) Q Consensus 65 ~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt-~~qr~~V~gFa~~yrr~G~G~-i~I~vPsGS~Na~AA~~~~~eIR~~L 142 (243) +.|| |||. -+||. |.==..+. |-.+..|.+|.++-+...... |.|+-|...-...+=-..+++||..| T Consensus 228 ~~gY---~P~f------H~DVY-G~iGe~fgPP~d~~a~adyi~~la~aa~pf~L~IEgPmD~gsrqaQieama~lr~~L 297 (414) T TIGR01502 228 SEGY---RPVF------HVDVY-GTIGEEFGPPLDRKAVADYIRELAEAAKPFTLRIEGPMDVGSRQAQIEAMAELRSEL 297 (414) T ss_pred CCCC---CCEE------EEEEC-CHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 8898---4068------87503-233432388846788888999999846898544367788452789999999999987 Q ss_pred HHCCCCCCCEEE Q ss_conf 975988234066 Q gi|255764485|r 143 ISSGIPVSSISE 154 (243) Q Consensus 143 ~~~GVp~~~I~v 154 (243) ..+||..+ |++ T Consensus 298 d~rGv~a~-iVa 308 (414) T TIGR01502 298 DDRGVSAE-IVA 308 (414) T ss_pred HCCCCEEE-EEE T ss_conf 07896389-985 No 42 >pfam04954 SIP Siderophore-interacting protein. Probab=30.88 E-value=40 Score=14.69 Aligned_cols=28 Identities=21% Similarity=0.395 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHHH-CCCCCCCEEEECC Q ss_conf 999999999999997-5988234066103 Q gi|255764485|r 130 SIRRAVKDIRKIIIS-SGIPVSSISERIY 157 (243) Q Consensus 130 AA~~~~~eIR~~L~~-~GVp~~~I~v~~Y 157 (243) .-+..++++|+.|.. .|++...|....| T Consensus 86 gE~~~~r~lR~~l~~~~g~~~~~i~~~gY 114 (118) T pfam04954 86 GEAGAVRALRRHLRNERGLPREQVYASGY 114 (118) T ss_pred CCHHHHHHHHHHHHHHHCCCHHHEEEEEE T ss_conf 25789999999999853999799389985 No 43 >KOG4632 consensus Probab=29.95 E-value=34 Score=15.13 Aligned_cols=52 Identities=27% Similarity=0.302 Sum_probs=30.2 Q ss_pred HHHHHH---HHHHCCCEEEECCCCCHH----HCCCEEEEECCEEEEEECCCCCCCCCHH Q ss_conf 999999---996022001102788711----1385777337726988515667688988 Q gi|255764485|r 46 QLFFLL---LFYGTSALAYYDEGSDYR----DRYPILMRKVEQIVDIPLLAGRGEIKYP 97 (243) Q Consensus 46 ~~~~~~---~~~~~~~~~~~~~~~DYR----~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~ 97 (243) -+||++ .||||..||-+-||--.. .+|||--==+.-..|-++-...+.|..- T Consensus 73 V~~~~~y~nif~G~aeLaeiPEGy~P~HWeyekhPIsrwiar~f~~Sd~~~yEr~layl 131 (187) T KOG4632 73 VLFCLAYNNIFYGTAELAEIPEGYAPHHWEYEKHPISRWIARWFGVSDVEHYERNLAYL 131 (187) T ss_pred HHHHHHHHHHEECHHHHHHCCCCCCCCCCEEECCCHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 99999987833307887338788898641210483999999984578689999889999 No 44 >PRK10124 putative UDP-glucose lipid carrier transferase; Provisional Probab=29.52 E-value=30 Score=15.52 Aligned_cols=46 Identities=11% Similarity=0.174 Sum_probs=26.2 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 8899999999997418872999924989887899999999999999759882 Q gi|255764485|r 98 IHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPV 149 (243) Q Consensus 98 qr~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~ 149 (243) .-++++.+...-++..-..|.+-.|... +.-.+++-..+...+|.. T Consensus 189 ~lG~~~~l~~~~~~~~id~V~ialp~~~------~~~i~~lv~~l~~~~v~v 234 (464) T PRK10124 189 WAGNLQQLVEDAKAGKIHNVYIAMSMCD------GARVKKLVHQLADTTCSV 234 (464) T ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCH------HHHHHHHHHHHHHCCCEE T ss_conf 8789999999998689998999668236------889999999996189559 No 45 >pfam11658 DUF3260 Protein of unknown function (DUF3260). Some members in this family of proteins are annotated as YhjU however this cannot be confirmed. Currently this family has no known function. Probab=29.52 E-value=42 Score=14.54 Aligned_cols=76 Identities=17% Similarity=0.250 Sum_probs=47.1 Q ss_pred CEEEEECCEEEEEECCCCCCCCCHHHH-----HHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 577733772698851566768898888-----999999999974188729999249898878999999999999997598 Q gi|255764485|r 73 PILMRKVEQIVDIPLLAGRGEIKYPIH-----DTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGI 147 (243) Q Consensus 73 PIvi~Ea~~tlDIpVg~g~g~Lt~~qr-----~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GV 147 (243) -|-+-|+.|.++-- ....--++..| +|+..|.++-.++|+-++++.||.+.+ | ...|-.++.-=+-+ T Consensus 351 tIsLHDGNr~~g~~--~~~s~~sY~~Ra~kLlddld~F~~~le~SgR~vvVv~VPEHGA---A---lrGDk~QisGlREI 422 (516) T pfam11658 351 TISLHDGNRLPGSN--RLNSLASYKLRAQKLLDDLDRFIDELEKSGRKVVVVLVPEHGA---A---LRGDKMQISGLREI 422 (516) T ss_pred EEECCCCCCCCCCC--CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCH---H---CCCCHHHHCCCCCC T ss_conf 26415677567976--6542442699999999999999999997599589999668521---0---04640544256668 Q ss_pred CCCCEEEEC Q ss_conf 823406610 Q gi|255764485|r 148 PVSSISERI 156 (243) Q Consensus 148 p~~~I~v~~ 156 (243) |.-+|+-.| T Consensus 423 PsP~IthvP 431 (516) T pfam11658 423 PSPSITHVP 431 (516) T ss_pred CCCCCEEEC T ss_conf 999735313 No 46 >PRK07734 motB flagellar motor protein MotB; Reviewed Probab=29.40 E-value=42 Score=14.52 Aligned_cols=79 Identities=11% Similarity=0.119 Sum_probs=48.0 Q ss_pred EECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCC----------CHHHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 851566768898888999999999974188729999249898----------8789999999999999975988234066 Q gi|255764485|r 85 IPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV----------SSASIRRAVKDIRKIIISSGIPVSSISE 154 (243) Q Consensus 85 IpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~----------Na~AA~~~~~eIR~~L~~~GVp~~~I~v 154 (243) +++..|.-.|++.-+.-+...+.--....--.|+|.=-+.+. =+-.+.|+..-+|.++..+|++++.+++ T Consensus 144 ~lF~~GsA~l~~~~~~lL~~ia~~l~~~~~~~I~V~GHTD~~Pi~~~~~~sNWeLSaaRA~sV~~~L~~~~gi~~~rl~v 223 (259) T PRK07734 144 ILFDSGKADVRLEDLPLAKEMSNLLVSNPPRNITISGHTDNVPIANAQFASNWELSVMRAVNFMQVLLENKELDPEKFSA 223 (259) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 66688987569779999999999998579963899985577666677676448999999999999999816999567999 Q ss_pred ECCCCCCCC Q ss_conf 103688854 Q gi|255764485|r 155 RIYDADYGM 163 (243) Q Consensus 155 ~~Y~a~~p~ 163 (243) ..|....|- T Consensus 224 ~G~gd~~Pi 232 (259) T PRK07734 224 KGYGEYKPI 232 (259) T ss_pred EEECCCCCC T ss_conf 985578748 No 47 >PRK06925 flagellar motor protein MotS; Reviewed Probab=29.16 E-value=43 Score=14.50 Aligned_cols=78 Identities=13% Similarity=0.032 Sum_probs=56.0 Q ss_pred EECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCC-----C----CC-HHHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 8515667688988889999999999741887299992498-----9----88-789999999999999975988234066 Q gi|255764485|r 85 IPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP-----T----VS-SASIRRAVKDIRKIIISSGIPVSSISE 154 (243) Q Consensus 85 IpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsG-----S----~N-a~AA~~~~~eIR~~L~~~GVp~~~I~v 154 (243) +.+..|...|++.-+..+...+.-..... ..|.|+--+. + .| +-.+.|+..-++.++...|+++.++.+ T Consensus 120 ~lF~~Gsa~l~~~~~~~L~~ia~~l~~~~-~~I~V~GHTD~~p~~~~~~~sNWeLS~aRA~~V~~~li~~~gi~~~rl~~ 198 (230) T PRK06925 120 VLFDTGEADVLKNAEPLLHKIAVLLKTIP-NDIQVEGHTDSRPISTYRYPSNWELSAARASSVIRYFTSKEKLDSSRFIA 198 (230) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHEEE T ss_conf 78788965348779999999999997399-63899874487655556788538999999999999999835998467899 Q ss_pred ECCCCCCCC Q ss_conf 103688854 Q gi|255764485|r 155 RIYDADYGM 163 (243) Q Consensus 155 ~~Y~a~~p~ 163 (243) ..|.-.-|- T Consensus 199 ~Gygd~~P~ 207 (230) T PRK06925 199 VGYADTKPV 207 (230) T ss_pred EEECCCCCC T ss_conf 876578748 No 48 >PRK05198 2-dehydro-3-deoxyphosphooctonate aldolase; Provisional Probab=28.57 E-value=44 Score=14.43 Aligned_cols=42 Identities=14% Similarity=0.178 Sum_probs=15.6 Q ss_pred ECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 37726988515667688988889999999999741887299992 Q gi|255764485|r 78 KVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLI 121 (243) Q Consensus 78 Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~v 121 (243) |..+++-.| .+.++-+..+|+-|..+++.-...|..-+.|++ T Consensus 189 DpSHs~q~p--~~~~~~sgG~r~~v~~la~aaia~G~dGl~iE~ 230 (264) T PRK05198 189 DATHSVQLP--GGQGGSSGGQREFVPVLARAAVAVGVAGLFIET 230 (264) T ss_pred CCCCHHCCC--CCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 573200167--887787655177799999999982999899983 No 49 >pfam09500 YiiD_Cterm Putative thioesterase (yiiD_Cterm). This entry consists of a broadly distributed uncharacterized domain found often as a standalone protein. The member from Shewanella oneidensis is described from crystallography work as a putative thioesterase. About half of the members of this family are fused to an Acetyltransf_1 domain pfam00583. The function of this protein is unknown. Probab=27.36 E-value=46 Score=14.29 Aligned_cols=50 Identities=14% Similarity=0.165 Sum_probs=35.2 Q ss_pred CEEEEECCEEEEEECCCC-CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 577733772698851566-76889888899999999997418872999924 Q gi|255764485|r 73 PILMRKVEQIVDIPLLAG-RGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP 122 (243) Q Consensus 73 PIvi~Ea~~tlDIpVg~g-~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vP 122 (243) -|||+|.+-.---||... ...-+.+..+..++|.+.+.+.|.+.+.+.+- T Consensus 15 ~IVi~~~~I~Y~~PV~~d~~A~~~~p~~~~~~~f~~~L~~~gkaRi~L~~~ 65 (84) T pfam09500 15 DIVLADSQIRYLAPVTGDPVAVCSLPDAAVWSGFLATLARGGKARITLEVE 65 (84) T ss_pred CEEEEECEEEECCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEE T ss_conf 399995417897757999489997797678899999998379779999999 No 50 >TIGR03368 cellulose_yhjU cellulose synthase operon protein YhjU. This protein was identified by the partial phylogenetic profiling algorithm (PMID:16930487) as part of the system for cellulose biosynthesis in bacteria, and in fact is found in cellulose biosynthesis gene regions. The protein was designated YhjU in Salmonella enteritidis, where disruption of its gene disrupts cellulose biosynthesis and biofilm formation (PMID:11929533). Probab=27.28 E-value=46 Score=14.28 Aligned_cols=75 Identities=20% Similarity=0.310 Sum_probs=48.5 Q ss_pred CEEEEECCEEEEEECCCCCCCCCHHHH-----HHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 577733772698851566768898888-----999999999974188729999249898878999999999999997598 Q gi|255764485|r 73 PILMRKVEQIVDIPLLAGRGEIKYPIH-----DTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGI 147 (243) Q Consensus 73 PIvi~Ea~~tlDIpVg~g~g~Lt~~qr-----~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GV 147 (243) -|-+-|+.|.++ ..-..--+++.| +|+..|.++-.++|+-++++.||.+.+ | ...|-.++.-=+-+ T Consensus 351 tIsLHDGNr~~~---~~~~s~~sY~~Ra~kLlddld~F~~~le~SgR~vvVv~VPEHGA---A---lrGDk~QiaGLREI 421 (518) T TIGR03368 351 TISLHDGNRIPN---SGMTSLASYPLRAKKLLDDLDRFFDELEKSGRKVVVVLVPEHGA---A---LRGDKMQISGLREI 421 (518) T ss_pred EEECCCCCCCCC---CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCH---H---CCCCHHHHCCCCCC T ss_conf 364246875788---88551542699999999999999999997699589999668520---0---04640554256668 Q ss_pred CCCCEEEEC Q ss_conf 823406610 Q gi|255764485|r 148 PVSSISERI 156 (243) Q Consensus 148 p~~~I~v~~ 156 (243) |.-+|+-.| T Consensus 422 PsP~IthvP 430 (518) T TIGR03368 422 PSPSITHVP 430 (518) T ss_pred CCCCCEEEC T ss_conf 999735313 No 51 >COG3389 Uncharacterized protein conserved in archaea [Function unknown] Probab=27.26 E-value=1.7 Score=23.96 Aligned_cols=44 Identities=30% Similarity=0.384 Sum_probs=30.8 Q ss_pred CEEEECCCCCHHHCCCEEEEECCEEEEEECCCC-CCCCCHHHHHH Q ss_conf 001102788711138577733772698851566-76889888899 Q gi|255764485|r 58 ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAG-RGEIKYPIHDT 101 (243) Q Consensus 58 ~~~~~~~~~DYR~RHPIvi~Ea~~tlDIpVg~g-~g~Lt~~qr~~ 101 (243) +++-||-=+-|||||=|.++|.---+|+|.-.= -.++..+.+.+ T Consensus 145 ~lavYDaIsVYkT~HMIslA~~v~d~~lPmlfviP~~l~ysf~~~ 189 (277) T COG3389 145 ALAVYDAISVYKTRHMISLAEGVMDLDLPMLFVIPENLAYSFVED 189 (277) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCEEEH T ss_conf 999977898874788999998777528966999606556310002 No 52 >pfam10070 DUF2309 Uncharacterized protein conserved in bacteria (DUF2309). Members of this family of hypothetical bacterial proteins have no known function. Probab=26.72 E-value=47 Score=14.21 Aligned_cols=123 Identities=16% Similarity=0.108 Sum_probs=70.8 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHCCCEEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHC--CCC Q ss_conf 999999999999999960220011027887111385777337726988515667688988889999999999741--887 Q gi|255764485|r 38 FLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKND--SAS 115 (243) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~--G~G 115 (243) |+-.-.+|-+|-+-|+-.|-...--.-+..-+.-+|.+..+.+. +. ......++|-.++.++...+.+-.-- +-. T Consensus 419 f~~ve~~G~~~~~~L~~~tl~p~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~g~t~~e~~~~a~~~Lr~mGLt~~FA 495 (783) T pfam10070 419 FAFVEATGPLYALKLLADTLGPSRRAAGSIAPSLDPAVLKKPLT--DL-EDGLPIGLTLEEQVALAEGALRAMGLTDNFA 495 (783) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHCCCCCC--CC-CCCCCCCCCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 99999889999999999861877655565433235121267654--43-3445679998999999999998667765777 Q ss_pred EEEEEE---------C-------------CCCCCHHHHHHHH--HHHHHHHHHCCC--CCCCEEEECCCCCCCC Q ss_conf 299992---------4-------------9898878999999--999999997598--8234066103688854 Q gi|255764485|r 116 VLFLLI---------P-------------SPTVSSASIRRAV--KDIRKIIISSGI--PVSSISERIYDADYGM 163 (243) Q Consensus 116 ~i~I~v---------P-------------sGS~Na~AA~~~~--~eIR~~L~~~GV--p~~~I~v~~Y~a~~p~ 163 (243) .+++++ | +|.+|+++.+.+. ++||+.|.+.|| |..-..+..+|-.-.+ T Consensus 496 ~lV~l~GHGS~s~NNP~~aaldCGACgG~~G~~NAR~~A~~lN~p~VR~~L~~~GI~IP~dT~F~aa~H~TttD 569 (783) T pfam10070 496 PLVVLVGHGSQSTNNPHAAALDCGACGGQSGGVNARVLAALLNDPEVRAGLAERGIDIPDDTVFVAALHDTTTD 569 (783) T ss_pred CEEEEECCCCCCCCCCHHHHCCCCCCCCCCCCCCHHHHHHHHCCHHHHHHHHHCCCCCCCCCEEEEEECCCCCC T ss_conf 75899516767678804665466565799887149999998579999999997699789988787533167643 No 53 >PRK13760 putative RNA-associated protein; Provisional Probab=26.27 E-value=42 Score=14.56 Aligned_cols=51 Identities=25% Similarity=0.269 Sum_probs=24.0 Q ss_pred CCEEEEECCEEEEEECCCCCCCCCHHHHHHHHHHH----HHHHHCCCCEEEEEECCCCCC Q ss_conf 85777337726988515667688988889999999----999741887299992498988 Q gi|255764485|r 72 YPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFL----EKYKNDSASVLFLLIPSPTVS 127 (243) Q Consensus 72 HPIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa----~~yrr~G~G~i~I~vPsGS~N 127 (243) -||-.++..=.+.|| +... ..-...+..|+ .+|...|+=..++++|.|.-+ T Consensus 157 iPik~~~~~l~v~iP-~~~a----~~a~~~l~~~g~v~kEeW~~dGs~~~vv~ipaG~~~ 211 (233) T PRK13760 157 IPIKFEKARIAVKIP-GEYA----GKAYGELAKFGEIKKEEWQSDGSWVAVLEIPAGLQD 211 (233) T ss_pred CCEEHHHEEEEEEEC-HHHH----HHHHHHHHHHCCCCCEECCCCCCEEEEEEECCCCHH T ss_conf 663133279999967-8998----899999997472010101589848999998975489 No 54 >TIGR02447 yiiD_Cterm thioesterase domain, putative; InterPro: IPR012660 This entry consists of a broadly distributed uncharacterised domain found often as a standalone protein. The member from Shewanella oneidensis is described from crystallography work as a putative thioesterase. About half of the members of this family are fused to an Acetyltransf_1 domain (PF00583). The function of this protein is unknown. . Probab=26.04 E-value=48 Score=14.15 Aligned_cols=50 Identities=14% Similarity=0.161 Sum_probs=40.4 Q ss_pred CEEEEECCEEEEEECCCC-CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 577733772698851566-76889888899999999997418872999924 Q gi|255764485|r 73 PILMRKVEQIVDIPLLAG-RGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP 122 (243) Q Consensus 73 PIvi~Ea~~tlDIpVg~g-~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vP 122 (243) -|||+|.+=.=--||-.. .-+=..+.-..-++|...+.|.|.+-|.+++- T Consensus 70 ~IVi~~~~i~Y~~PV~~~~~A~c~~p~~~~~~~fl~~l~~~gKAR~~l~a~ 120 (141) T TIGR02447 70 DIVIADSQIRYLAPVTGDPVAECEVPDLESWEAFLATLQRGGKARVKLEAQ 120 (141) T ss_pred EEEEECCCCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHCCCEEEEEEEE T ss_conf 499974710215785767235556565336044456885058704899999 No 55 >KOG3485 consensus Probab=26.02 E-value=24 Score=16.18 Aligned_cols=20 Identities=25% Similarity=0.662 Sum_probs=13.8 Q ss_pred EEEEEEEEEECCCCCCCCHH Q ss_conf 67754445417888998133 Q gi|255764485|r 169 RLSYFASKPSAGKCGFWPED 188 (243) Q Consensus 169 RLSY~ri~A~ag~CG~WPeD 188 (243) |+.|--++-.--|||.||-+ T Consensus 63 rvr~N~lekml~pcg~wpp~ 82 (86) T KOG3485 63 RVRFNLLEKMLQPCGPWPPE 82 (86) T ss_pred HHHHCHHHHHHCCCCCCCCC T ss_conf 66406898853356899976 No 56 >pfam02397 Bac_transf Bacterial sugar transferase. This Pfam family represents a conserved region from a number of different bacterial sugar transferases, involved in diverse biosynthesis pathways. Probab=24.82 E-value=31 Score=15.46 Aligned_cols=15 Identities=20% Similarity=0.350 Sum_probs=8.5 Q ss_pred CCEEEEEECCCCCCH Q ss_conf 872999924989887 Q gi|255764485|r 114 ASVLFLLIPSPTVSS 128 (243) Q Consensus 114 ~G~i~I~vPsGS~Na 128 (243) .|-+-+-=|..-... T Consensus 101 ~GdMSlVGPRP~~~~ 115 (188) T pfam02397 101 KGDMSLVGPRPELPE 115 (188) T ss_pred CCEEEEEECCCCCHH T ss_conf 282789818988767 No 57 >cd07127 ALDH_PAD-PaaZ Phenylacetic acid degradation proteins PaaZ (Escherichia coli) and PaaN (Pseudomonas putida)-like. Phenylacetic acid degradation (PAD) proteins PaaZ (Escherichia coli) and PaaN (Pseudomonas putida) are putative aromatic ring cleavage enzymes of the aerobic PA catabolic pathway. PaaZ mutants were defective for growth with PA as a sole carbon source due to interruption of the putative ring opening system. This CD is limited to bacterial monofunctional enzymes. Probab=24.72 E-value=51 Score=13.97 Aligned_cols=44 Identities=11% Similarity=0.203 Sum_probs=33.6 Q ss_pred CCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEC Q ss_conf 188729999249898878999999999999997598823406610 Q gi|255764485|r 112 DSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERI 156 (243) Q Consensus 112 ~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~ 156 (243) --+|.-+|.-|+....- .+..++.-++++|.+.|+|++.+..-. T Consensus 218 LaaGN~VV~KPse~t~l-~~a~~v~~~~evl~eAG~pp~~v~~~~ 261 (549) T cd07127 218 LATGNPVIVKPHPAAIL-PLAITVQVAREVLAEAGFDPNLVTLAA 261 (549) T ss_pred HHCCCEEEEECCCCCCH-HHHHHHHHHHHHHHHHCCCHHHEEEEC T ss_conf 97298699954887644-599999999999998489966857727 No 58 >PRK10953 cysJ sulfite reductase subunit alpha; Provisional Probab=23.75 E-value=53 Score=13.85 Aligned_cols=16 Identities=13% Similarity=0.227 Sum_probs=8.5 Q ss_pred EEEECCCCCCCCCCCC Q ss_conf 0661036888543684 Q gi|255764485|r 152 ISERIYDADYGMDVDT 167 (243) Q Consensus 152 I~v~~Y~a~~p~~aAP 167 (243) |.-|.|+.+......| T Consensus 383 l~PR~YSIsSS~~~~p 398 (599) T PRK10953 383 LTPRLYSIASSQAEVE 398 (599) T ss_pred CCCCCEEECCCCCCCC T ss_conf 6874101225777789 No 59 >PRK10949 protease 4; Provisional Probab=23.26 E-value=54 Score=13.79 Aligned_cols=84 Identities=18% Similarity=0.140 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHHCCC-CEE--EEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCC----CCCCCEEEEE Q ss_conf 999999999974188-729--999249898878999999999999997598823406610368885----4368466775 Q gi|255764485|r 100 DTIRGFLEKYKNDSA-SVL--FLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYG----MDVDTIRLSY 172 (243) Q Consensus 100 ~~V~gFa~~yrr~G~-G~i--~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~~Y~a~~p----~~aAPIRLSY 172 (243) +++..-.++-|.... -.| -|.-|.||+ -|...+.+||.+ +.+.|.|. -+++..+.||+. ..+..|--+= T Consensus 350 dt~~~~lr~a~~D~~VKAvVLRVnSPGGSa--~ASE~I~~el~~-lk~~gKPV-VvSMG~vAASGGYwIa~~Ad~I~A~p 425 (618) T PRK10949 350 DTTAAQIRDARLDPKVKAIVLRVNSPGGSV--TASEVIRAELAA-ARAAGKPV-VVSMGGMAASGGYWISTPANYIVANP 425 (618) T ss_pred HHHHHHHHHHHCCCCCEEEEEEEECCCCCH--HHHHHHHHHHHH-HHHCCCCE-EEEECCCCCCCCEEEECCCCEEEECC T ss_conf 899999999730867228999985899866--789999999999-98449979-99970101566258851687699889 Q ss_pred EEEEEECCCCCCCCH Q ss_conf 444541788899813 Q gi|255764485|r 173 FASKPSAGKCGFWPE 187 (243) Q Consensus 173 ~ri~A~ag~CG~WPe 187 (243) ..++..-|=-|..|. T Consensus 426 ~TITGSIGVfg~~p~ 440 (618) T PRK10949 426 STLTGSIGIFGVINT 440 (618) T ss_pred CCEEEECEEEEEHHH T ss_conf 867886404776104 No 60 >TIGR02857 CydD ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydD; InterPro: IPR014216 The gene pair cydCD encodes an ABC-family transporter in which each gene contains an N-terminal membrane-spanning domain and a C-terminal ATP-binding domain. In Escherichia coli these genes were discovered as mutants which caused the terminal heme-copper oxidase complex cytochrome bd to fail to assemble. Recent work has shown that the transporter is involved in the export of redox-active thiol compounds such as cysteine and glutathione , . The linkage to assembly of the cytochrome bd complex is further supported by the conserved operon structure found outside the gammaproteobacteria (cydABCD) containing both the transporter and oxidase genes components. The genes used as the seed members for this entry are all either found in the gammaproteobacterial context, or the CydABCD context. All members of this entry scoring above the trusted cut off at the time of its creation were from genomes which encode a cytochrome bd complex. Unfortunately, the gene symbol nomenclature, based on this operon in Bacillus subtilis, assigns cydC to the third gene in the operon, where this gene is actually homologous to the E. coli cydD gene. We have chosen to name all homologues in this family in accordance with the precedence of publication of the E. coli name, CydD.. Probab=23.09 E-value=55 Score=13.77 Aligned_cols=68 Identities=15% Similarity=0.272 Sum_probs=59.2 Q ss_pred EEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCC--CCEEEEEECCCCCCHHHHHHHHHHHHHHH-HH Q ss_conf 73377269885156676889888899999999997418--87299992498988789999999999999-97 Q gi|255764485|r 76 MRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDS--ASVLFLLIPSPTVSSASIRRAVKDIRKII-IS 144 (243) Q Consensus 76 i~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G--~G~i~I~vPsGS~Na~AA~~~~~eIR~~L-~~ 144 (243) +++-|+=+|-++|.+-.||+..||=+| +.++-..+.- ++.+..+=|+-.--+.......+.+|++. .. T Consensus 478 ~~~LP~Gl~t~~Ge~G~GLSGGq~QRl-ALARafl~~~~a~~llLLDEPTAhLD~~tEa~v~~~~~~l~A~g 548 (570) T TIGR02857 478 VAALPQGLDTPIGEGGAGLSGGQAQRL-ALARAFLRDATAAPLLLLDEPTAHLDAETEAEVLEALRALAAQG 548 (570) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHHHHH-HHHHHHHCHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 997111313435888884779999999-99999606111176035407764321899999999999997379 No 61 >PRK11546 zraP zinc resistance protein; Provisional Probab=22.47 E-value=56 Score=13.69 Aligned_cols=83 Identities=16% Similarity=0.207 Sum_probs=59.1 Q ss_pred HHHHHHHHCCCEEEECCCCCHHHCCCEEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHC----------CCCEE Q ss_conf 99999960220011027887111385777337726988515667688988889999999999741----------88729 Q gi|255764485|r 48 FFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKND----------SASVL 117 (243) Q Consensus 48 ~~~~~~~~~~~~~~~~~~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~----------G~G~i 117 (243) |-+|.|.+.+|+|--..| --+.+.+.||+.|+..++..-++|-.+ -++-+ T Consensus 14 ~~~~~~~~~~a~a~~~~~--------------------~~~~~~~~LT~EQQa~~qkL~~eF~~eTa~LRqqL~aKr~El 73 (139) T PRK11546 14 LSALAMGSGSAFAHNGHG--------------------MWQQGAAPLTTEQQAAAQKIHNDFYAQTSALRQQLVSKRYEY 73 (139) T ss_pred HHHHHHHCCHHHCCCCCC--------------------CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999998035231257877--------------------778888988999999999999999999999999999989999 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHH--------------HHHCCCCCC Q ss_conf 999249898878999999999999--------------997598823 Q gi|255764485|r 118 FLLIPSPTVSSASIRRAVKDIRKI--------------IISSGIPVS 150 (243) Q Consensus 118 ~I~vPsGS~Na~AA~~~~~eIR~~--------------L~~~GVp~~ 150 (243) ..++-+..|...++...++||-++ ++..|||.. T Consensus 74 NALL~at~PD~~kI~avakEis~LR~kl~e~rv~~dv~~a~agip~~ 120 (139) T PRK11546 74 NALLTANPPDSSKINAVAKEMETLRQSLDEQRVKRDIAMAEAGIPRG 120 (139) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 99980699779999999999999999999999999999998079865 No 62 >TIGR03022 WbaP_sugtrans Undecaprenyl-phosphate galactose phosphotransferase, WbaP. This model includes the enterobacterial enzymes, where the function is presumed to be identical to the S. typhimurium enzyme as well as a somewhat broader group which are likely to catalyze the same or highly similar reactions based on a phylogenetic tree-building analysis of the broader sugar transferase family. Most of these genes are found within large operons dedicated to the production of complex exopolysaccharides such as the enterobacterial O-antigen. The most likely heterogeneity would be in the precise nature of the sugar molecule transferred. Probab=22.43 E-value=32 Score=15.35 Aligned_cols=23 Identities=17% Similarity=0.391 Sum_probs=16.5 Q ss_pred HHCCCCCCCCCCHHHHHHHHHHH Q ss_conf 85068888844366788899875 Q gi|255764485|r 219 DLFSPRMVTPPDAEQRDKSIQRY 241 (243) Q Consensus 219 DLv~PR~~tP~Da~RR~~viekY 241 (243) -||+|||+.|...++-...+..| T Consensus 376 SlVGPRP~~~~~~~~~~~~~~~~ 398 (456) T TIGR03022 376 SLVGPRPYLTSELSRYGEALELY 398 (456) T ss_pred EEECCCCCCHHHHHHHHHHHCCE T ss_conf 78628988546899866742377 No 63 >COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif [General function prediction only] Probab=22.37 E-value=56 Score=13.67 Aligned_cols=35 Identities=17% Similarity=0.176 Sum_probs=16.3 Q ss_pred CEEEEEECCCCC--------CHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 729999249898--------87899999999999999759882 Q gi|255764485|r 115 SVLFLLIPSPTV--------SSASIRRAVKDIRKIIISSGIPV 149 (243) Q Consensus 115 G~i~I~vPsGS~--------Na~AA~~~~~eIR~~L~~~GVp~ 149 (243) =+|.|+.|+... ...-..+.+..||.-..+.+-+. T Consensus 162 vSiNIElp~~~~lk~lap~K~p~dI~r~Mg~ir~~i~e~~e~~ 204 (404) T COG4277 162 VSINIELPTDDGLKLLAPEKDPTDILRSMGWIRLKILENAEDK 204 (404) T ss_pred EEEEEECCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCH T ss_conf 5776744886446661888883788889899998776515502 No 64 >PRK00484 lysS lysyl-tRNA synthetase; Reviewed Probab=22.06 E-value=45 Score=14.33 Aligned_cols=10 Identities=30% Similarity=0.414 Sum_probs=3.9 Q ss_pred HHHHHHHCCC Q ss_conf 9999997598 Q gi|255764485|r 138 IRKIIISSGI 147 (243) Q Consensus 138 IR~~L~~~GV 147 (243) ++.++...|+ T Consensus 337 l~~~~~~~g~ 346 (491) T PRK00484 337 ARAIAKELGV 346 (491) T ss_pred HHHHHHHCCC T ss_conf 9999998699 No 65 >PRK09040 hypothetical protein; Provisional Probab=22.02 E-value=57 Score=13.63 Aligned_cols=80 Identities=9% Similarity=0.181 Sum_probs=54.2 Q ss_pred EEECCCCCCCCCHHHHHHHHHHHHHHHHCC---CCEEEEE-----ECC--CC----CCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 885156676889888899999999997418---8729999-----249--89----887899999999999999759882 Q gi|255764485|r 84 DIPLLAGRGEIKYPIHDTIRGFLEKYKNDS---ASVLFLL-----IPS--PT----VSSASIRRAVKDIRKIIISSGIPV 149 (243) Q Consensus 84 DIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G---~G~i~I~-----vPs--GS----~Na~AA~~~~~eIR~~L~~~GVp~ 149 (243) ++.+..|.-.|++.-+..+..+++.....- .-.|.|+ +|- ++ .|-.--.+=...|.+.|.+.||++ T Consensus 90 ~vlF~~gsa~L~p~~~~~L~~ia~~L~~~~~~~~~~i~V~GhTD~~pi~~~~~~f~sNw~LS~~RA~~V~~~L~~~gv~~ 169 (215) T PRK09040 90 SVLFALNSDQLQPEGRELLKSLAGPLAAYLRARDEILMVSGFTDDQPVRAGNRRFADNWELSAQRALTVTRALIDAGVPA 169 (215) T ss_pred CEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 43676896400832699999999999987447986599873489976767788889789999999999999999869997 Q ss_pred CCEEEECCCCCCCC Q ss_conf 34066103688854 Q gi|255764485|r 150 SSISERIYDADYGM 163 (243) Q Consensus 150 ~~I~v~~Y~a~~p~ 163 (243) .+|....|-...|- T Consensus 170 ~rl~~~g~G~~~P~ 183 (215) T PRK09040 170 SSVFAAAFGSEQPV 183 (215) T ss_pred HHEEEEEECCCCCC T ss_conf 89589886576878 No 66 >cd01844 SGNH_hydrolase_like_6 SGNH_hydrolase subfamily. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases. Probab=21.68 E-value=58 Score=13.58 Aligned_cols=69 Identities=13% Similarity=0.237 Sum_probs=33.7 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCC-CEEEEEECCC------CCCHHH----HHHHHHHHHHHHHHCCCCC Q ss_conf 2698851566768898888999999999974188-7299992498------988789----9999999999999759882 Q gi|255764485|r 81 QIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSA-SVLFLLIPSP------TVSSAS----IRRAVKDIRKIIISSGIPV 149 (243) Q Consensus 81 ~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~-G~i~I~vPsG------S~Na~A----A~~~~~eIR~~L~~~GVp~ 149 (243) ..+.|-++.....-....+.+...|...-|..-. -+|++..|.. +++... ..+..+++-..+.+.|++. T Consensus 59 ~~~vi~~g~N~~~~~~~~~~~~~~~V~~ir~~~p~tPIvlv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~ 138 (177) T cd01844 59 DLYIIDCGPNIVGAEAMVRERLGPLVKGLRETHPDTPILLVSPRYCPDAELTPGRGKLTLAVRRALREAFEKLRADGVPN 138 (177) T ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 88999634678874789999999999999987889978998336776444687527889999999999999999769998 No 67 >pfam03853 YjeF_N YjeF-related protein N-terminus. Probab=21.29 E-value=59 Score=13.53 Aligned_cols=47 Identities=23% Similarity=0.303 Sum_probs=27.6 Q ss_pred EEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 73377269885156676889888899999999997418872999924989 Q gi|255764485|r 76 MRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 (243) Q Consensus 76 i~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS 125 (243) +++.+=.+|=-+|+| ++.+.+..+....+.--+...-.+-|++|||- T Consensus 98 ~~~~d~iVDal~G~G---l~~~l~~~~~~~i~~iN~~~~~visiDiPSGl 144 (170) T pfam03853 98 KALADLIIDALFGTG---LSRPLRGEYAELIEWINASKAPVLAVDIPSGL 144 (170) T ss_pred CCCCCEEEEEEEECC---CCCCCCHHHHHHHHHHHCCCCCEEEEECCCCC T ss_conf 566628997435236---79889889999999997389967997378875 No 68 >pfam10609 ParA ParA/MinD ATPase like. This family contains ATPases involved in plasmid partitioning. It also contains the cytosolic Fe-S cluster assembling factor NBP35 which is required for biogenesis and export of both ribosomal subunits. Probab=21.20 E-value=60 Score=13.52 Aligned_cols=58 Identities=16% Similarity=0.272 Sum_probs=40.1 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 69885156676889888899999999997418872999924989887899999999999999759882340661 Q gi|255764485|r 82 IVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 (243) Q Consensus 82 tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~eIR~~L~~~GVp~~~I~v~ 155 (243) .+|.|=|+|+-.|+-.|.-. -+|.|++-.|+.- | ...++..-.++...++|--+|++. T Consensus 5 iiD~PPGTgD~~ltl~~~~~-----------~~~~ivVTTP~~l----s-~~dv~r~i~~~~~~~i~ilGiVEN 62 (81) T pfam10609 5 IIDMPPGTGDEHLTLAQSLP-----------VDGAVIVTTPQDV----A-LLDVRKAIDMFKKLNVPILGIVEN 62 (81) T ss_pred EEECCCCCCHHHHHHHHHCC-----------CCCEEEEECCHHH----H-HHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 99489996789999997488-----------5776998098599----9-999999999999849961899977 No 69 >pfam06837 Fijivirus_P9-2 Fijivirus P9-2 protein. This family consists of several Fijivirus specific P9-2 proteins from Rice black streaked dwarf virus (RBSDV) and Fiji disease virus. The function of this family is unknown. Probab=21.18 E-value=60 Score=13.52 Aligned_cols=59 Identities=31% Similarity=0.464 Sum_probs=39.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHH Q ss_conf 999988635778767648999999999875679999999999999999960220011027887111 Q gi|255764485|r 5 YMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRD 70 (243) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~DYR~ 70 (243) -|---||+-||-.-| .|.++|-+-||. +-+|..--++++..|||.+-+..|.+-+.-|. T Consensus 98 K~fYhLf~cV~CNpl--~R~~~SiifTi~-----fy~ll~V~iyLl~~ffgD~Ii~~~n~ln~~~s 156 (214) T pfam06837 98 KLFYHLFGCVICNPL--SRLLISIIFTIC-----FYTLLFVLIYLLIIFFGDSIIHCVNYLNNNRS 156 (214) T ss_pred HHHHHHHHHHCCCHH--HHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 999998631104818--999999999999-----99999999999999962079999998850343 No 70 >PRK10697 DNA-binding transcriptional activator PspC; Provisional Probab=21.01 E-value=60 Score=13.49 Aligned_cols=14 Identities=36% Similarity=0.788 Sum_probs=9.9 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 863577876764899 Q gi|255764485|r 10 LFGGVCFKGLANMRS 24 (243) Q Consensus 10 ~~~~~~~~~~~~~~~ 24 (243) .++||| .|+|+--. T Consensus 18 ~i~GVC-aGlA~Yf~ 31 (119) T PRK10697 18 MVRGVC-AGIAHYFD 31 (119) T ss_pred EEEEEH-HHHHHHHC T ss_conf 873311-44998829 No 71 >PRK10251 phosphopantetheinyltransferase component of enterobactin synthase multienzyme complex; Provisional Probab=20.52 E-value=61 Score=13.42 Aligned_cols=41 Identities=12% Similarity=0.298 Sum_probs=30.9 Q ss_pred CCCCCCCHHHCCCCCCCCCCC---------------CCCHHHHHHHHHHHCCHHHH Q ss_conf 888998133146755788623---------------33625688789862696885 Q gi|255764485|r 180 GKCGFWPEDMLGNAKGNRNWT---------------NYGCAYQNNLAAQVVNPLDL 220 (243) Q Consensus 180 g~CG~WPeDL~~s~~~Nk~y~---------------NFGCAtQ~NLAAqVaNP~DL 220 (243) .....||+.+.+|+.-+.+|. .+-.....+|+.+|.+|.+. T Consensus 78 ~RaP~WP~g~~GSIsH~~~~A~avva~~~lGIDiE~~l~~~~A~eL~~~Ilt~~E~ 133 (207) T PRK10251 78 LRQPVWPAGVYGSISHCGTTALAVVSRQPIGIDIEEIFSQQTATELTDNIITPAEH 133 (207) T ss_pred CCCCCCCCCCEEEEECCCCEEEEEEECCCCCCCHHHHCCHHHHHHHHHHCCCHHHH T ss_conf 99988999874866505784899995045651567746999999999872999999 No 72 >KOG3133 consensus Probab=20.18 E-value=34 Score=15.15 Aligned_cols=84 Identities=17% Similarity=0.235 Sum_probs=51.9 Q ss_pred CCEEEECCCCCHHHCCCEEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 20011027887111385777337726988515667688988889999999999741887299992498988789999999 Q gi|255764485|r 57 SALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVK 136 (243) Q Consensus 57 ~~~~~~~~~~DYR~RHPIvi~Ea~~tlDIpVg~g~g~Lt~~qr~~V~gFa~~yrr~G~G~i~I~vPsGS~Na~AA~~~~~ 136 (243) |.-..|.-=-+-+.+.|--++|..+++. ...-+=-..|-.-|.-|-..|-.+- -++.+.-+.+- T Consensus 157 SKEILyeplKEl~~~YPkwLeen~e~l~----~E~~erYqkQ~~~i~~i~~~~e~e~------------~Ds~~~k~~v~ 220 (267) T KOG3133 157 SKEILYEPLKELGANYPKWLEENGESLS----KEDKERYQKQFELIKEIESVYENEP------------NDSATHKDKVL 220 (267) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCC----HHHHHHHHHHHHHHHHHHHHHHCCC------------CCHHHHHHHHH T ss_conf 8877401699999873599985535568----7889999999999999998985377------------42567778999 Q ss_pred HHHHHHHHCCCCCCCEEEEC Q ss_conf 99999997598823406610 Q gi|255764485|r 137 DIRKIIISSGIPVSSISERI 156 (243) Q Consensus 137 eIR~~L~~~GVp~~~I~v~~ 156 (243) ++-+-|..-|-|+..|+... T Consensus 221 elm~eLQe~G~PP~ei~~em 240 (267) T KOG3133 221 ELMQELQELGQPPPEIVNEM 240 (267) T ss_pred HHHHHHHHCCCCCHHHHCCC T ss_conf 99999998399957886178 No 73 >cd06193 siderophore_interacting Siderophore interacting proteins share the domain structure of the ferredoxin reductase like family. Siderophores are produced in various bacteria (and some plants) to extract iron from hosts. Binding constants are high, so iron can be pilfered from transferrin and lactoferrin for bacterial uptake, contributing to pathogen virulence. Ferredoxin reductase (FNR), an FAD and NAD(P) binding protein, was intially identified as a chloroplast reductase activity, catalyzing the electron transfer from reduced iron-sulfur protein ferredoxin to NADP+ as the final step in the electron transport mechanism of photosystem I. FNR transfers electrons from reduced ferredoxin to FAD (forming FADH2 via a semiquinone intermediate) and then transfers a hydride ion to convert NADP+ to NADPH. FNR has since been shown to utilize a variety of electron acceptors and donors and has a variety of physiological functions including nitrogen assimilation, dinitrogen fixation, steroid hy Probab=20.18 E-value=62 Score=13.38 Aligned_cols=53 Identities=13% Similarity=0.098 Sum_probs=31.6 Q ss_pred HHHHHHHHCCC--CEEEEEECCCCCCHHH---------HHHHHHHHHHHHHHC--CCCCCCEEEEC Q ss_conf 99999974188--7299992498988789---------999999999999975--98823406610 Q gi|255764485|r 104 GFLEKYKNDSA--SVLFLLIPSPTVSSAS---------IRRAVKDIRKIIISS--GIPVSSISERI 156 (243) Q Consensus 104 gFa~~yrr~G~--G~i~I~vPsGS~Na~A---------A~~~~~eIR~~L~~~--GVp~~~I~v~~ 156 (243) |-+..|-+... ..|.+.-|.|.-.... =+.+...|.++|... +.+...+...+ T Consensus 90 Gpas~WA~~a~~GD~i~i~gP~g~~~~~~~~~~~ll~gDeTAlPAia~iLe~lp~~~~~~~~iev~ 155 (235) T cd06193 90 GPASRWAASAQPGDTLGIAGPGGSFLPPPDADWYLLAGDETALPAIAAILEELPADARGTALIEVP 155 (235) T ss_pred CCHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEEEEEC T ss_conf 857999985899997788868988789998777999824424899999998688998499999979 Done!