BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|255764485|ref|YP_003065139.2| hypothetical protein CLIBASIA_03065 [Candidatus Liberibacter asiaticus str. psy62] (243 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|255764485|ref|YP_003065139.2| hypothetical protein CLIBASIA_03065 [Candidatus Liberibacter asiaticus str. psy62] gi|254547836|gb|ACT57199.2| hypothetical protein CLIBASIA_03065 [Candidatus Liberibacter asiaticus str. psy62] Length = 243 Score = 502 bits (1292), Expect = e-140, Method: Compositional matrix adjust. Identities = 243/243 (100%), Positives = 243/243 (100%) Query: 1 MMVEYMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALA 60 MMVEYMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALA Sbjct: 1 MMVEYMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALA 60 Query: 61 YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL 120 YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL Sbjct: 61 YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL 120 Query: 121 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAG 180 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAG Sbjct: 121 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAG 180 Query: 181 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR Sbjct: 181 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 Query: 241 YRQ 243 YRQ Sbjct: 241 YRQ 243 >gi|315121890|ref|YP_004062379.1| hypothetical protein CKC_00700 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495292|gb|ADR51891.1| hypothetical protein CKC_00700 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 224 Score = 293 bits (750), Expect = 1e-77, Method: Compositional matrix adjust. Identities = 141/243 (58%), Positives = 188/243 (77%), Gaps = 23/243 (9%) Query: 1 MMVEYMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALA 60 MM + MI +FGG+CFK L + FFL Q+ FLLLF G + LA Sbjct: 1 MMRDIMIKEIFGGMCFKRLVS---------------FFLM-----QISFLLLFCGNNVLA 40 Query: 61 YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL 120 +DYRDRYPI+M+KVE+ +DIPLL+GRG++ ++DTI+GF+++YK +S SV+F+L Sbjct: 41 ---NQNDYRDRYPIVMKKVEKSLDIPLLSGRGKLPSDMYDTIKGFIDRYKQNSTSVIFIL 97 Query: 121 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAG 180 IP+PT+SS +I+ A+K+IR+ IIS+GIP SS+SER YDADY +D+DTIRLSYFAS+PSAG Sbjct: 98 IPTPTISSHAIQDALKNIRRFIISNGIPSSSLSERSYDADYELDIDTIRLSYFASRPSAG 157 Query: 181 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 KCGFWPED+LG++ N NW+NYGC+YQNNLAAQ+VNP+DLF+PR +TPPDA RD+SI R Sbjct: 158 KCGFWPEDILGSSLENSNWSNYGCSYQNNLAAQIVNPMDLFAPRSMTPPDAVHRDRSIHR 217 Query: 241 YRQ 243 Y++ Sbjct: 218 YQE 220 >gi|15963895|ref|NP_384248.1| hypothetical protein SMc04110 [Sinorhizobium meliloti 1021] gi|307315792|ref|ZP_07595306.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium meliloti BL225C] gi|15073070|emb|CAC41529.1| Pilus assembly protein cpaD [Sinorhizobium meliloti 1021] gi|306898560|gb|EFN29233.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium meliloti BL225C] Length = 226 Score = 157 bits (397), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 74/176 (42%), Positives = 109/176 (61%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + E+ +DIP+ +G + D IRGF +Y+N S+SV+ +++P +V Sbjct: 37 DYRTRHPIVLTEGERTIDIPIASGDTRLTQGTRDVIRGFAAEYRNASSSVIQIMLPRGSV 96 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + + + KDIR+++ +SG+ + E YDA D IRLSY A CG WP Sbjct: 97 NGHAAQIVRKDIRRLLAASGVSPKKMIETTYDASVTGDAAPIRLSYVAITAQTAPCGAWP 156 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+ N NRN+ N+GCA Q+NLAAQ+ NP DL PR ++P DAEQR + I +R Sbjct: 157 EDLALNTLENRNYYNFGCATQSNLAAQIANPTDLVGPRQMSPIDAEQRGQVIDSWR 212 >gi|307320427|ref|ZP_07599844.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium meliloti AK83] gi|306893993|gb|EFN24762.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium meliloti AK83] Length = 226 Score = 157 bits (396), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 74/176 (42%), Positives = 109/176 (61%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + E+ +DIP+ +G + D IRGF +Y+N S+SV+ +++P +V Sbjct: 37 DYRTRHPIVLTEGERTIDIPIASGDTRLTQGTRDVIRGFAAEYRNASSSVIQIMLPRGSV 96 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + + + KDIR+++ +SG+ + E YDA D IRLSY A CG WP Sbjct: 97 NGHAAQIVRKDIRRLLAASGVSPKKMIETTYDASVTGDAAPIRLSYVAITAQTAPCGAWP 156 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+ N NRN+ N+GCA Q+NLAAQ+ NP DL PR ++P DAEQR + I +R Sbjct: 157 EDLALNTLVNRNYYNFGCATQSNLAAQIANPTDLVGPRQMSPIDAEQRGQVIDSWR 212 >gi|222147181|ref|YP_002548138.1| component of type IV pilus [Agrobacterium vitis S4] gi|221734171|gb|ACM35134.1| component of type IV pilus [Agrobacterium vitis S4] Length = 196 Score = 152 bits (383), Expect = 5e-35, Method: Compositional matrix adjust. Identities = 71/176 (40%), Positives = 103/176 (58%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ VE +D+P+ G + + D + GF + Y+N S + +L+P + Sbjct: 9 DYRTRHPIIVTDVEHSLDLPVAQGSSRLTIGMSDAVTGFAQDYRNASTGYVQILVPQGSP 68 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + + +R +++S GI I ER Y A D IRLSY A+ AG CG WP Sbjct: 69 NTMAASSIARQVRNLLVSKGIAAPKIVERPYRAGATGDAAPIRLSYVATTAVAGPCGQWP 128 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+ + N+NW N+GCA Q NLAAQV +P DL +PR +TP DAE+R I YR Sbjct: 129 EDLSNDTAQNKNWQNFGCASQANLAAQVASPTDLIAPRGMTPIDAERRSTVIDNYR 184 >gi|150398542|ref|YP_001329009.1| pilus biogenesis lipoprotein CpaD [Sinorhizobium medicae WSM419] gi|150030057|gb|ABR62174.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium medicae WSM419] Length = 242 Score = 152 bits (383), Expect = 5e-35, Method: Compositional matrix adjust. Identities = 73/176 (41%), Positives = 107/176 (60%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + E+ +DIP+ +G + D IRGF +Y+N S+ V+ +++P +V Sbjct: 53 DYRTRHPIVLTEGERTIDIPVASGDTRLTQGTRDVIRGFAAEYRNASSGVVQIMLPRGSV 112 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + + + K+IR+++ SG+ I E YDA D IRLSY A CG WP Sbjct: 113 NGRAAQILRKEIRRLLAGSGVSPKKIIETSYDASVTGDAAPIRLSYVAITAQTAPCGAWP 172 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+ N NRN+ N+GCA Q+NLAAQ+ NP DL PR ++P DAEQR + I +R Sbjct: 173 EDLALNTLENRNYYNFGCATQSNLAAQIANPTDLVGPRRMSPIDAEQRGQVIDSWR 228 >gi|159184217|ref|NP_353253.2| components of type IV pilus [Agrobacterium tumefaciens str. C58] gi|159139546|gb|AAK86038.2| components of type IV pilus [Agrobacterium tumefaciens str. C58] Length = 193 Score = 151 bits (382), Expect = 6e-35, Method: Compositional matrix adjust. Identities = 73/176 (41%), Positives = 103/176 (58%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI + + E +DIP+ AG + + D +RGF + Y + S ++ + +PS + Sbjct: 9 DYRTRHPITLSEAEHSLDIPVSAGDSRLTTAMADNVRGFAQNYASMSTGIVNIQMPSGSP 68 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +SA+ R K IR + +G+ I E Y A D IRLSY A G+CG WP Sbjct: 69 NSATAARMAKQIRSTLSGAGVAQGKIMETRYAASPNGDSAPIRLSYVAVTAMTGQCGQWP 128 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+ N N+NW N+GCA Q+NLAAQ+ NP+DL PR ++P DAE+R I YR Sbjct: 129 EDLSDNTFANKNWYNFGCASQSNLAAQIANPMDLVGPRGMSPIDAERRAVVIDTYR 184 >gi|227823972|ref|YP_002827945.1| pilus assembly protein CpaD [Sinorhizobium fredii NGR234] gi|227342974|gb|ACP27192.1| pilus assembly protein CpaD [Sinorhizobium fredii NGR234] Length = 234 Score = 149 bits (377), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 72/176 (40%), Positives = 108/176 (61%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + E+++DIP+ +G + D IRGF +Y+N S V+ +++P + Sbjct: 53 DYRTRHPIVIAEGERVIDIPVASGDRRLTAGTRDVIRGFATEYRNASGGVIQIMLPRGSA 112 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +S + + KDIR+++ +SG+P + E Y+A D IRLSY A CG WP Sbjct: 113 NSHAAQIVRKDIRRLLAASGVPPKRMIETGYEAVSPGDAAPIRLSYVAITAQTAPCGEWP 172 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+ N NRN+ N+GCA Q+NLAAQ+ NP DL PR ++P DA QR + I +R Sbjct: 173 EDLTLNTLQNRNYYNFGCASQSNLAAQIANPTDLIGPRQMSPVDAAQRGEVIDAWR 228 >gi|325291658|ref|YP_004277522.1| components of type IV pilus [Agrobacterium sp. H13-3] gi|325059511|gb|ADY63202.1| components of type IV pilus [Agrobacterium sp. H13-3] Length = 252 Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 73/176 (41%), Positives = 104/176 (59%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI + + E +DIP+ AG + + D +RGF + Y + S ++ + +PS + Sbjct: 68 DYRTRHPITLSEAEHSLDIPVSAGDSRLTTAMADNVRGFAQNYASMSTGIVNIQMPSGSA 127 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +SA+ + + IR + +G+P I E Y A D IRLSY A G+CG WP Sbjct: 128 NSAAASKMARQIRSALSGAGVPSGKIMETRYAASPNGDAAPIRLSYVAVTAMTGQCGQWP 187 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+ N N+NW N+GCA Q+NLAAQV NP+DL PR ++P DAE+R I YR Sbjct: 188 EDLSDNTFANKNWYNFGCASQSNLAAQVANPMDLVGPRGMSPIDAERRAVVIDAYR 243 >gi|222084470|ref|YP_002542999.1| pilus assembly protein [Agrobacterium radiobacter K84] gi|221721918|gb|ACM25074.1| pilus assembly protein [Agrobacterium radiobacter K84] Length = 235 Score = 142 bits (359), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 71/176 (40%), Positives = 105/176 (59%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ E VDIP+ + + DT+RGF++ Y+ + + ++ P + Sbjct: 46 DYRQRHPIVLTDKEHRVDIPVSVSDRRLTSGMRDTVRGFVQDYRAHATGTVEIMTPRESA 105 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +SA+ + IR+ +++SGIP + I++ Y A D IRL + A+ CG WP Sbjct: 106 NSAAASALRRQIRQELMASGIPSARITDNYYPAGGPGDAAPIRLRFMATAAVTNACGQWP 165 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 D+ NA N+N+ N+GCA QNNLAAQV NP DL +PR +TP DA+QR K I YR Sbjct: 166 ADLADNAFDNQNYYNFGCATQNNLAAQVANPTDLIAPRAMTPIDADQRSKVIDNYR 221 >gi|304392383|ref|ZP_07374324.1| pilus assembly protein [Ahrensia sp. R2A130] gi|303295487|gb|EFL89846.1| pilus assembly protein [Ahrensia sp. R2A130] Length = 227 Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 66/178 (37%), Positives = 104/178 (58%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 S+Y+ R+PI++ + EQ +DIP+ + + GF KY+ + + ++IP + Sbjct: 35 SNYKTRHPIVIDEKEQTLDIPVGSDTVRLPRAQESATEGFASKYRRSPSGTMTIMIPRHS 94 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 ++++ R + +I+ G+P SSI YDA IR+SY A + S +CG W Sbjct: 95 PNASAARSMSHQVAEILRREGVPPSSIVTTSYDASRHGSAAPIRVSYHAVQASVERCGKW 154 Query: 186 PEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 PED+ G N+NW N+GCA QNN+AAQ+ NP DL +PR +T DAE+R+ I+ YR+ Sbjct: 155 PEDLAGPNLDNQNWHNFGCANQNNMAAQIANPSDLVAPRGMTQADAERRNNVIEDYRE 212 >gi|110636322|ref|YP_676530.1| pilus biogenesis lipoprotein CpaD [Mesorhizobium sp. BNC1] gi|110287306|gb|ABG65365.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Chelativorans sp. BNC1] Length = 226 Score = 136 bits (342), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 65/176 (36%), Positives = 103/176 (58%), Gaps = 2/176 (1%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI++ + EQ++D+P+ + ++ GF+E Y +V+ +L+PS + Sbjct: 37 DYRTNHPIVLSEKEQVLDLPVGVFSYRMTPQQKMSLEGFMEHYGESGKAVVTVLVPSGSP 96 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + + R +DI + + G+P + YDA +R+SY + G+CG WP Sbjct: 97 NERAASRLSEDIAQFLYRRGVPKGHLQVLSYDAP-AEQASPVRVSYSVVAATTGQCGRWP 155 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+L + N+++ N+GCAYQNNLAAQ+ NP+DL PR TP DAE RD +I RY+ Sbjct: 156 EDLLDTTE-NKHYANFGCAYQNNLAAQIANPMDLLGPRKTTPIDAENRDTAIGRYK 210 >gi|163757628|ref|ZP_02164717.1| components of type IV pilus [Hoeflea phototrophica DFL-43] gi|162285130|gb|EDQ35412.1| components of type IV pilus [Hoeflea phototrophica DFL-43] Length = 233 Score = 134 bits (337), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 61/177 (34%), Positives = 101/177 (57%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI++ + E+ VDIP+ G E+ + + +RG Y++ ++ + +++P + Sbjct: 49 DYRTNHPIIVAEQERTVDIPVGTGDRELTTSMREIVRGAAHSYRSSASGAVRIMVPVGSA 108 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + + KI+ G+P I Y D IR++Y A S KCG WP Sbjct: 109 NAGAASILSGQVAKILQKEGVPRDRILSSPYSVSSPDDAAPIRIAYLAITASTEKCGRWP 168 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 ED+ + N++W N+GCA Q+NLAAQ+ NP DL +PR ++P DAE+R I+ YR+ Sbjct: 169 EDLAADTTENKHWANFGCASQSNLAAQIANPGDLIAPRGMSPIDAERRSTIIETYRE 225 >gi|116249982|ref|YP_765820.1| pilus assembly protein [Rhizobium leguminosarum bv. viciae 3841] gi|115254630|emb|CAK05704.1| putative pilus assembly protein [Rhizobium leguminosarum bv. viciae 3841] Length = 250 Score = 132 bits (332), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 69/176 (39%), Positives = 103/176 (58%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + EQ VDIP+ + + D IRGF Y + ++ +++L P + Sbjct: 62 DYRARHPIIVTEAEQTVDIPVASTDRRLTIAQRDLIRGFAANYISRASGPVYVLSPQGSP 121 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +SA+ + +R + S GI S I Y A D IRLS+ + +CG WP Sbjct: 122 NSAAAYQLRNQVRAELTSRGIASSKIVNTSYAAVGPGDAAPIRLSFTGTTAVTTQCGQWP 181 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +D + N N+N+ N+GCA QNNLAAQ+ NP DL +PR +TP DA++R+ +IQ YR Sbjct: 182 KD-ISNDLTNQNYYNFGCASQNNLAAQIANPEDLVAPRGMTPIDAQRRNNAIQEYR 236 >gi|190889882|ref|YP_001976424.1| pilus assembly protein [Rhizobium etli CIAT 652] gi|190695161|gb|ACE89246.1| pilus assembly protein [Rhizobium etli CIAT 652] Length = 252 Score = 131 bits (330), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 70/176 (39%), Positives = 103/176 (58%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + EQ VDIP+ + + D IRGF Y + ++ +++L P + Sbjct: 62 DYRARHPIIVTEAEQTVDIPVASTDRRLTIAQRDLIRGFAANYVSRASGPVYVLSPEGSP 121 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +SA+ + +R + S GI S I Y A D IRLS+ + +CG WP Sbjct: 122 NSAAAHQLRNQVRAELASRGIASSKIINTSYAAAGAGDAAPIRLSFTGTTAITTQCGQWP 181 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +D + N N+N+ N+GCA QNNLAAQV NP DL +PR +TP DA++R+ +IQ YR Sbjct: 182 KD-ISNDFANQNYYNFGCATQNNLAAQVANPEDLVAPRGMTPIDAQRRNNAIQEYR 236 >gi|241207158|ref|YP_002978254.1| pilus biogenesis lipoprotein CpaD [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861048|gb|ACS58715.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 235 Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 69/176 (39%), Positives = 103/176 (58%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + EQ VDIP+ + + D IRGF Y + ++ +++L P + Sbjct: 46 DYRARHPIIVTEAEQTVDIPVASTDRRLTIAQRDLIRGFATNYISRASGPVYVLSPQGSP 105 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +SA+ + +R + S GI S I Y A D IRLS+ + +CG WP Sbjct: 106 NSAAAYQLRNQVRAELTSRGIASSKIVNTSYAAAGPGDAAPIRLSFTGTTAVTTQCGQWP 165 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +D + N N+N+ N+GCA QNNLAAQ+ NP DL +PR +TP DA++R+ +IQ YR Sbjct: 166 KD-ISNDLTNQNYYNFGCASQNNLAAQIANPEDLVAPRGMTPIDAQRRNNAIQEYR 220 >gi|13474661|ref|NP_106230.1| pilus assembly protein cpaD [Mesorhizobium loti MAFF303099] gi|14025416|dbj|BAB52016.1| pilus assembly protein; CpaD [Mesorhizobium loti MAFF303099] Length = 251 Score = 130 bits (327), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 66/176 (37%), Positives = 101/176 (57%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI++ + Q +D+P+ AG + DT+ GFL+ Y +A L + IPS + Sbjct: 61 DYRTNHPIVIAEKNQKIDLPVGAGDRGMTGSQRDTLLGFLDGYDKSAAPTLTIQIPSGSA 120 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + + A +D ++ ++SG+ + I Y A +R+SY A + KCG WP Sbjct: 121 NEVAATAAGRDFARLAVASGVKRNRIVVVSYQAGSSETSAPVRVSYIAVRAQTDKCGRWP 180 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED+L ++ N+++ ++GC+YQNNLAAQ+ NP DL PR T DAE R K I YR Sbjct: 181 EDLLETSE-NKHYADFGCSYQNNLAAQMANPADLLGPRKQTTIDAENRGKVIDVYR 235 >gi|327194693|gb|EGE61539.1| pilus assembly protein [Rhizobium etli CNPAF512] Length = 252 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 69/176 (39%), Positives = 102/176 (57%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + EQ VDIP+ + + D IRGF Y + ++ +++L P + Sbjct: 62 DYRARHPIIVTEAEQTVDIPVASTDRRLTIAQRDLIRGFAANYVSRASGPVYVLSPEDSP 121 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +S + + +R + S GI S I Y A D IRLS+ + +CG WP Sbjct: 122 NSTAAHQLRNQVRAELASRGIASSKIINTSYAAAGAGDAAPIRLSFTGTTAITTQCGQWP 181 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +D + N N+N+ N+GCA QNNLAAQV NP DL +PR +TP DA++R+ +IQ YR Sbjct: 182 KD-ISNDFANQNYYNFGCATQNNLAAQVANPEDLVAPRGMTPIDAQRRNNAIQEYR 236 >gi|209551760|ref|YP_002283677.1| pilus biogenesis lipoprotein CpaD [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537516|gb|ACI57451.1| pilus biogenesis lipoprotein CpaD [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 244 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 69/176 (39%), Positives = 102/176 (57%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + EQ VDIP+ + + D IRGF Y ++ +++L P + Sbjct: 54 DYRARHPIIVTEAEQTVDIPVASTDRRLTNAQRDLIRGFAANYIARASGPVYVLSPQGSP 113 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +SA+ + +R + S GI S I Y A D IRLS+ + +CG WP Sbjct: 114 NSAAAYQLRNQVRAELASRGIASSKIVNTSYAAVGPGDAAPIRLSFTGTTAITTQCGQWP 173 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +D + N N+N+ N+GCA QNNLAAQ+ NP DL +PR +TP DA++R+ +IQ YR Sbjct: 174 KD-ISNDFTNQNYYNFGCASQNNLAAQIANPEDLVAPRGMTPIDAQRRNNAIQEYR 228 >gi|260461948|ref|ZP_05810193.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Mesorhizobium opportunistum WSM2075] gi|259032195|gb|EEW33461.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Mesorhizobium opportunistum WSM2075] Length = 243 Score = 129 bits (323), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 65/176 (36%), Positives = 103/176 (58%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI++ + Q +D+P+ AG + DT+ GFL+ Y +A L + IPS + Sbjct: 53 DYRTNHPIVIAEKNQKIDLPVGAGDRGMTGSQRDTLLGFLDGYDKSAAPALTIQIPSGSA 112 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + + R A +D ++ ++SGI + I+ Y A +R+S+ A + KCG WP Sbjct: 113 NEVAARAAGRDFARLAVASGIKRNRIAVVSYQAGSSEASAPVRVSFIAVRAQTDKCGRWP 172 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 ED++ +++ N+++ ++GC+YQNNLAAQ+ NP DL PR T DAE R I YR Sbjct: 173 EDLVESSE-NKHYADFGCSYQNNLAAQMANPADLLGPRKQTTIDAENRGAVIDVYR 227 >gi|86355865|ref|YP_467757.1| pilus assembly protein [Rhizobium etli CFN 42] gi|86279967|gb|ABC89030.1| pilus assembly protein [Rhizobium etli CFN 42] Length = 233 Score = 128 bits (322), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 68/176 (38%), Positives = 103/176 (58%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + EQ VDIP+ + + D IRGF Y + ++ +++L P + Sbjct: 43 DYRARHPIIVTEAEQTVDIPVASTDRRLTIAQRDLIRGFAANYISRASGPVYVLSPEGSP 102 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 +SA+ + +R + + GI S I Y A D IRLS+ + +CG WP Sbjct: 103 NSAAADQLRNQVRAELTTRGIASSKIINTSYAAAGAGDAAPIRLSFTGTTAITTQCGQWP 162 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +D + N N+N+ N+GCA QNNLAAQ+ NP DL +PR +TP DA++R+ +IQ YR Sbjct: 163 KD-ISNDLANQNYYNFGCASQNNLAAQIANPEDLVAPRGMTPIDAQRRNNAIQEYR 217 >gi|254503403|ref|ZP_05115554.1| pilus biogenesis lipoprotein CpaD [Labrenzia alexandrii DFL-11] gi|222439474|gb|EEE46153.1| pilus biogenesis lipoprotein CpaD [Labrenzia alexandrii DFL-11] Length = 221 Score = 121 bits (304), Expect = 8e-26, Method: Compositional matrix adjust. Identities = 58/176 (32%), Positives = 98/176 (55%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR ++PI++ + + +D+P+ ++ P+ DTI F + + + +L+P+ Sbjct: 27 DYRYQHPIVVSEAPETLDLPVGKNTRNLRSPVTDTITSFAMDSRRHGSGNVEILVPTGAA 86 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + +++ V DIR + G+ ++ R Y + IRLSY K + G+CG WP Sbjct: 87 NESAVHAVVHDIRGALSRGGVNGKHVTTRTYRSTDSSADAPIRLSYARMKATTGECGAWP 146 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +++ G N N+ N+GCA Q+NLAA V NP DL +PR +TP D +R I++YR Sbjct: 147 KNIGGGIGENTNYYNFGCATQSNLAAIVENPSDLITPRAMTPSDQNRRAVVIEKYR 202 >gi|319785607|ref|YP_004145083.1| pilus biogenesis lipoprotein CpaD [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171495|gb|ADV15033.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 245 Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 59/177 (33%), Positives = 100/177 (56%), Gaps = 1/177 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI++ + Q +D+P+ AG + DT+ GFL+ Y +A L + +PS + Sbjct: 55 DYRTNHPIVIAEKNQKIDLPVGAGDRGMTGSQRDTLLGFLDGYDRSAAPTLTIQVPSGSA 114 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + + A +D ++ ++SGI + I Y + IR++Y + K KCG WP Sbjct: 115 NEVAATTAARDFARLAVASGIKRNRIVVTSYQSASAEASAPIRVAYISVKAQTDKCGRWP 174 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 ED++ ++ N+++ ++GC+YQNNLAAQ+ NP DL PR D R ++I Y++ Sbjct: 175 EDLMETSE-NKHYADFGCSYQNNLAAQMANPADLLGPRKSANIDPANRSQAIDVYQK 230 >gi|114705455|ref|ZP_01438363.1| pilus assembly protein [Fulvimarina pelagi HTCC2506] gi|114540240|gb|EAU43360.1| pilus assembly protein [Fulvimarina pelagi HTCC2506] Length = 230 Score = 119 bits (297), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 63/209 (30%), Positives = 109/209 (52%), Gaps = 5/209 (2%) Query: 38 FLRTLMLGQLFFLLLFYGTSALAYYDE-GS---DYRDRYPILMRKVEQIVDIPLLAGRGE 93 +R L L + L G ++ E GS DYR R+PI++ + ++ +DIP++ + Sbjct: 5 IIRPLATASLVGIALALGACGNVHHIEVGSVPDDYRTRHPIVVSEADEAIDIPVVTSDQK 64 Query: 94 IKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSIS 153 + I F +++ A + +L+P + ++ + + + ++ +G+ I Sbjct: 65 LAMSDSGRIEDFAHRFRRSGADTMTVLVPYGSRNAVAASSISHEAIRTLMKAGVRREQIV 124 Query: 154 ERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQ 213 + Y A + IRL++ G CG WPED L + N+N+ N+GCA Q NLAAQ Sbjct: 125 MQSYAAHDALGPTPIRLTFSTLVAQTGPCGRWPED-LNSTHENKNYANFGCATQQNLAAQ 183 Query: 214 VVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 + +P DL SPR + P D E+RD+ I++YR Sbjct: 184 IADPRDLLSPRGMGPVDGERRDQVIEKYR 212 >gi|118589704|ref|ZP_01547109.1| components of type IV pilus [Stappia aggregata IAM 12614] gi|118437790|gb|EAV44426.1| components of type IV pilus [Stappia aggregata IAM 12614] Length = 238 Score = 117 bits (293), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 63/178 (35%), Positives = 96/178 (53%), Gaps = 2/178 (1%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI++ + + +D+P+ I PI TI F ++ + + +L+PS Sbjct: 43 DYRLMHPIVITEEPETLDLPVGRNTRNINGPIESTIAAFGQQSRQKGNGSVEILVPSGGA 102 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYD-ADYGMDVDTIRLSYFASKPSAGKCGFW 185 + A++ IR+ + G+ + IS R Y D G D IRLSY + +AG+CG W Sbjct: 103 NEAAVHSITPKIRQALQQGGVSRNRISTRTYSVGDPGADA-PIRLSYARMQATAGECGAW 161 Query: 186 PEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 P ++ G N N+ N+GCA Q NLAA V NP DL +PR P D +R I++YR+ Sbjct: 162 PRNIGGGFGENINYENFGCASQANLAAMVDNPSDLITPRASAPSDQGRRAVVIEKYRK 219 >gi|153008060|ref|YP_001369275.1| pilus biogenesis lipoprotein CpaD [Ochrobactrum anthropi ATCC 49188] gi|151559948|gb|ABS13446.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Ochrobactrum anthropi ATCC 49188] Length = 239 Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 57/174 (32%), Positives = 98/174 (56%), Gaps = 4/174 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI + + EQ+ DIP+ ++ ++G + Y+ + +L++L+PS T Sbjct: 42 DYRTNHPITIAEREQVTDIPVAQADQKLSPMQRGIVQGAIANYRRGGSGMLYVLVPSGTS 101 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + A+ R ++ ++ GI ++I+ Y + IR+SY+A CG WP Sbjct: 102 NQAAAYRLSTEVSAMLRRGGIKANNIAIENYPVENPEAAAPIRISYYAMTAGTTPCGRWP 161 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 +D L + N+++ N+GCA QNNLAAQV NP DL PR+++P D+ D++ +R Sbjct: 162 DD-LASTPENKHYANFGCASQNNLAAQVANPADLLGPRVMSPIDS---DRTTER 211 >gi|328545278|ref|YP_004305387.1| Pilus biogenesis lipoprotein CpaD [polymorphum gilvum SL003B-26A1] gi|326415020|gb|ADZ72083.1| Pilus biogenesis lipoprotein CpaD [Polymorphum gilvum SL003B-26A1] Length = 239 Score = 112 bits (280), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 57/177 (32%), Positives = 93/177 (52%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 +DYR R+PI++ + + +D+P+ + + + F + + + + +L+PS Sbjct: 42 NDYRLRHPIVITEQAETLDLPVGQSTRNLNRDFAERVTEFGQASRRNGNGHVEILVPSGA 101 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 + A++ IR + G+ + +S R Y D IRL+Y K SAG CG W Sbjct: 102 ANEAAVHAVTPRIRSALALGGVSGTHVSTRSYPVDDATAQAPIRLAYTRIKASAGPCGEW 161 Query: 186 PEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 P ++ G+ N+++ N+GCA Q NLAA V NP DL PR +TP D +R Q+YR Sbjct: 162 PANIGGSLNANQDYYNFGCATQANLAAMVDNPADLLGPRAMTPADQMRRATVFQKYR 218 >gi|299132287|ref|ZP_07025482.1| pilus biogenesis lipoprotein CpaD [Afipia sp. 1NLS2] gi|298592424|gb|EFI52624.1| pilus biogenesis lipoprotein CpaD [Afipia sp. 1NLS2] Length = 246 Score = 112 bits (279), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 57/181 (31%), Positives = 97/181 (53%), Gaps = 5/181 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++++ + +I + RG + I G + + ++ + + +P+ T Sbjct: 44 DYRLRHPIVIQEASKTTEIFVGHARGGLTTAQRADIVGLSQAWLSEGTGAITIDVPTGTP 103 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + V+DI+ I+ ++GIP I Y + +R+ Y AG CG WP Sbjct: 104 NAQAASVTVRDIQNILAAAGIPPKGIRVMPYHPNDPRQFAPVRVRYARIIADAGPCGLWP 163 Query: 187 EDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 ED+ + K NR++ N+GCAYQ N+AA V NP DL PR TPP++ +R ++ +Y Sbjct: 164 EDLGPSVKNKSYFENRSYQNFGCAYQRNMAAMVANPADLVQPRAETPPNSARRTEAFAKY 223 Query: 242 R 242 R Sbjct: 224 R 224 >gi|239833236|ref|ZP_04681565.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Ochrobactrum intermedium LMG 3301] gi|239825503|gb|EEQ97071.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Ochrobactrum intermedium LMG 3301] Length = 239 Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 55/167 (32%), Positives = 92/167 (55%), Gaps = 1/167 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI + + EQ+ DIP+ ++ ++G + Y+ + +L++L+PS Sbjct: 42 DYRTNHPITIAEREQVTDIPIAQADQKLSPMQRGIVQGAIANYRRSGSGMLYVLVPSGAS 101 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + A+ R ++ + SGI ++I+ Y + IR+SY+A CG WP Sbjct: 102 NQAAAYRLSTEVAATLRRSGIKANNIAIENYPVESPDAAAPIRISYYAITAGTTPCGRWP 161 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQ 233 +D L + N+++ N+GC QNNLAAQV NP DL PR +TP D+++ Sbjct: 162 DD-LASTPENKHYANFGCVSQNNLAAQVANPADLLGPRTMTPIDSDR 207 >gi|307943146|ref|ZP_07658491.1| pilus biogenesis lipoprotein CpaD [Roseibium sp. TrichSKD4] gi|307773942|gb|EFO33158.1| pilus biogenesis lipoprotein CpaD [Roseibium sp. TrichSKD4] Length = 237 Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 64/208 (30%), Positives = 104/208 (50%), Gaps = 5/208 (2%) Query: 40 RTLMLGQLFFLLLFYGTSALAYYDEGSD-----YRDRYPILMRKVEQIVDIPLLAGRGEI 94 RT + F LL G ++ S+ Y+ R+PI++ + +++D+P+ A + Sbjct: 10 RTAIGATFFVSLLLAGCNSQTVGQNNSNLAATNYQLRHPIVVTEQPEVLDLPIGAHMRNL 69 Query: 95 KYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISE 154 P+ T++ F + + +L+PS + A++ V IR + + G+ +IS Sbjct: 70 NGPLRGTVKAFGADSRKKGNGRVEILVPSGGRNEAAVHALVPQIRSSLKAGGLSGGAIST 129 Query: 155 RIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQV 214 R Y D IRLSY + +AG CG W D+ N ++ NYGCA Q+NLAA V Sbjct: 130 RSYAVDNPSADAPIRLSYPRIQATAGPCGTWNGDIGRTFDRNVDYENYGCATQSNLAAMV 189 Query: 215 VNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR P D +R +++YR Sbjct: 190 ENPSDLLTPRASAPADRMRRANVVEKYR 217 >gi|209886531|ref|YP_002290388.1| pilus assembly protein CpaD [Oligotropha carboxidovorans OM5] gi|209874727|gb|ACI94523.1| pilus assembly protein CpaD [Oligotropha carboxidovorans OM5] Length = 247 Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 53/181 (29%), Positives = 96/181 (53%), Gaps = 5/181 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++++ + +I + RG + I G + + ++ + + P+ T Sbjct: 45 DYRARHPIVIQEAAKTTEIFVGHARGGLTTAQRTDIAGLAQAWLSEGTGAITIDTPTGTP 104 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + V+DI+ +++++GIP + + Y + +R+ Y AG CG WP Sbjct: 105 NAQAASVTVRDIQNMLVAAGIPARGVKVQPYHPNDPRQFAPVRVRYARIIADAGPCGLWP 164 Query: 187 EDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 ED+ + K NR + N+GCAYQ N+AA V NP DL PR +P ++ +R ++ +Y Sbjct: 165 EDLGPSVKNKSYFENRPYQNFGCAYQRNMAAMVANPADLVQPRAESPSNSARRSQAFTKY 224 Query: 242 R 242 R Sbjct: 225 R 225 >gi|146338122|ref|YP_001203170.1| putative pilus assembly protein CpaD [Bradyrhizobium sp. ORS278] gi|146190928|emb|CAL74933.1| Putative pilus assembly protein cpaD; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 246 Score = 101 bits (252), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 56/183 (30%), Positives = 94/183 (51%), Gaps = 7/183 (3%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI +++ + + + GRG + + G + + + +PS T Sbjct: 44 DYRLRHPIAVQEAPDSLVVFVGQGRGGLTAEQRAEVMGLAQSWMRQGTGAIVADVPSGTP 103 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + ++++I+ + ++G+P ++ R Y + IRLSY +AG CG WP Sbjct: 104 NARAAADSMREIQSLFSAAGVPPHGVTVRNYQPKDPRQMAAIRLSYPKLSATAGPCGLWP 163 Query: 187 EDMLGNAKGNRNW------TNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 +D LG + N+NW N+GCAYQ N+AA V NP DL PR TP +R ++ Sbjct: 164 DD-LGPSVKNKNWFDNKPDWNFGCAYQRNMAAMVDNPADLVQPRPETPSYTTRRTALFEK 222 Query: 241 YRQ 243 YR+ Sbjct: 223 YRK 225 >gi|115525750|ref|YP_782661.1| pilus biogenesis lipoprotein CpaD [Rhodopseudomonas palustris BisA53] gi|115519697|gb|ABJ07681.1| pilus biogenesis lipoprotein CpaD [Rhodopseudomonas palustris BisA53] Length = 249 Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 55/182 (30%), Positives = 98/182 (53%), Gaps = 5/182 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI +++ +Q ++I + GRG + P I + + +++ + + P+ T Sbjct: 47 DYRQRHPIAIQEADQTLNIFVGTGRGGLTGPQRAAIAAVAQSWLSEATGRIVIDQPAQTP 106 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + +V++IR ++ ++GIP ++++ R Y IR++Y +AG CG WP Sbjct: 107 NARAAADSVREIRALLAAAGIPTNAVAVREYQPSDPRLFAAIRVNYPRLVATAGPCGLWP 166 Query: 187 EDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 +D+ + N+ N+GCA Q NLAA V NP DL PR TP +R + +Y Sbjct: 167 DDLGPSVNNPGYIENKPSYNHGCAVQRNLAAMVENPADLVQPRAETPAYTARRTIAFDKY 226 Query: 242 RQ 243 R+ Sbjct: 227 RK 228 >gi|39936741|ref|NP_949017.1| pilus assembly protein cpaD [Rhodopseudomonas palustris CGA009] gi|192292567|ref|YP_001993172.1| pilus biogenesis lipoprotein CpaD [Rhodopseudomonas palustris TIE-1] gi|39650597|emb|CAE29120.1| possible pilus assembly protein cpaD [Rhodopseudomonas palustris CGA009] gi|192286316|gb|ACF02697.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Rhodopseudomonas palustris TIE-1] Length = 242 Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 56/182 (30%), Positives = 92/182 (50%), Gaps = 5/182 (2%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 +DYR R+PI +R+ ++ V++ + GRG + + + + + + +PS T Sbjct: 44 TDYRQRHPIAIREADRTVEVFVGNGRGGLTPVQRAEVAELGQTWLREGTGAIIAEVPSDT 103 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 ++ + +++I+ ++ ++G+P ++ + Y TIRL Y AG CG W Sbjct: 104 PNARAASDTIREIQSVLAANGVPARGVTVKHYRPADPRTFATIRLIYPKVTAVAGPCGLW 163 Query: 186 PEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 PED+ + K NR + N GCA Q NLAA V NP DL PR TPP +R + Sbjct: 164 PEDLGPSIKNKGYYDNRPYWNLGCANQRNLAAMVENPSDLVQPRPETPPYTARRAVTYDT 223 Query: 241 YR 242 YR Sbjct: 224 YR 225 >gi|90425198|ref|YP_533568.1| pilus assembly protein CpaD [Rhodopseudomonas palustris BisB18] gi|90107212|gb|ABD89249.1| pilus assembly protein CpaD [Rhodopseudomonas palustris BisB18] Length = 246 Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 62/217 (28%), Positives = 108/217 (49%), Gaps = 11/217 (5%) Query: 38 FLRTLMLGQLFF--LLLFYGTSALAYYDEGS----DYRDRYPILMRKVEQIVDIPLLAGR 91 LR L LG L+ G + + GS DYR R+PI +++ +Q + I GR Sbjct: 9 HLRRLRLGGALVAVCLVLGGCNHTSDEVTGSIVPDDYRQRHPIAIQEADQTLIIFAGTGR 68 Query: 92 GEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSS 151 G + + + + + + + +P+ T ++ + ++++IR ++ + G+P ++ Sbjct: 69 GGLTGAQRADVASLAQTWLREGTGPIVIDLPTHTPNARAAADSLREIRALLAAQGLPPNA 128 Query: 152 ISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAY 206 ++ R Y IR++Y +AG CG WP+D+ ++K N+ + N GCA Sbjct: 129 VTVRDYQPRDTRQFAAIRVNYPRLTATAGPCGLWPDDLGSSSKNHDYFENKPYWNLGCAS 188 Query: 207 QNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 Q NLAA V NP DL PR TP +R S ++YR+ Sbjct: 189 QRNLAAMVDNPADLVQPRGETPAYTARRTNSFEKYRK 225 >gi|90419765|ref|ZP_01227674.1| putative pilus assembly protein cpaD [Aurantimonas manganoxydans SI85-9A1] gi|90335806|gb|EAS49554.1| putative pilus assembly protein cpaD [Aurantimonas manganoxydans SI85-9A1] Length = 244 Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 54/177 (30%), Positives = 93/177 (52%), Gaps = 1/177 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + ++ +DIP++ + Y + F ++++ A + +++P+ + Sbjct: 54 DYRTRHPIVVSEDQEAIDIPIVMSDARLSYANRGRVEHFGDRFRASGADSIQVMLPTGSA 113 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + + R +I + + I I + Y A IRL+Y G CG WP Sbjct: 114 NQYAAERVSHEIVEALRGRYISRDRIFVQPYSAVGAEGPTPIRLTYATLVAKTGPCGRWP 173 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 +DM ++ N+N+ N+GCA Q NLAAQ+ +P DL SPR V DA +R + YR+ Sbjct: 174 DDMTDTSE-NKNYFNFGCASQQNLAAQIADPRDLLSPRGVDSIDAGRRTTVLDNYRR 229 >gi|91977986|ref|YP_570645.1| Type IV pili component-like [Rhodopseudomonas palustris BisB5] gi|91684442|gb|ABE40744.1| Type IV pili component-like [Rhodopseudomonas palustris BisB5] Length = 243 Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 55/182 (30%), Positives = 92/182 (50%), Gaps = 5/182 (2%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 +DYR R+PI +++ + V + + GRG + + F + + + + +P+ T Sbjct: 45 TDYRQRHPIAIQEGDHTVVVFVGNGRGGLTTTQRADVAAFGQGWLREGTGSIIAEVPADT 104 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 ++ + +++I+ ++ + G+P + + Y IRL Y AG CG W Sbjct: 105 PNARAAGETLREIQSLLAAGGVPQRGVIVKPYRPTDPRAFAAIRLIYPKVSAVAGPCGLW 164 Query: 186 PEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 PED+ + K N+ + N+GCAYQ NLAA V NP DL PR TPP A +R + + Sbjct: 165 PEDIGPSIKNKGYLDNKPYWNFGCAYQRNLAAMVENPSDLVQPRPETPPYAARRATTFEA 224 Query: 241 YR 242 YR Sbjct: 225 YR 226 >gi|27376549|ref|NP_768078.1| pilus assembly protein [Bradyrhizobium japonicum USDA 110] gi|27349690|dbj|BAC46703.1| pilus assembly protein [Bradyrhizobium japonicum USDA 110] Length = 244 Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 54/182 (29%), Positives = 93/182 (51%), Gaps = 7/182 (3%) Query: 68 YRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVS 127 Y+ R+PI + + + + + + RG + + G + ++ + + PS T + Sbjct: 43 YKQRHPIAIEEQNRSIVVFVGHARGGLTAAQRADVMGLASAWLHEGTGAIHIDAPSGTPN 102 Query: 128 SASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPE 187 + + ++++I+ ++ ++G+P I R Y + + IRL+Y AG CG WPE Sbjct: 103 ARPVAESMREIQAMLAAAGVPPRGIIARPYQPEDKRFLPPIRLTYSKIAAVAGPCGLWPE 162 Query: 188 DMLGNAKGNRNW------TNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 D +G + N+ W NYGCAYQ NLAA V NP DL PR TP +R + ++Y Sbjct: 163 D-IGPSMKNKGWFENKEYYNYGCAYQRNLAAMVDNPSDLEQPRPETPSYTTRRTAAFEKY 221 Query: 242 RQ 243 R+ Sbjct: 222 RK 223 >gi|148256958|ref|YP_001241543.1| putative pilus assembly protein CpaD [Bradyrhizobium sp. BTAi1] gi|146409131|gb|ABQ37637.1| Putative pilus assembly protein cpaD [Bradyrhizobium sp. BTAi1] Length = 247 Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 53/182 (29%), Positives = 91/182 (50%), Gaps = 5/182 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DY+ R+PI + + Q + + + +GRG + Y + G ++ + + +P+ T Sbjct: 45 DYKIRHPIAVEEGRQSIVVFVGSGRGGLTYQQRADVAGLARSWQREGTGAIVAEVPADTP 104 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + ++I ++ S G+P SI+ R Y D + +RLSY AG CG WP Sbjct: 105 NARAAADTYREIHAMLTSGGVPSRSITLRHYTPDDPRLLAAVRLSYPKIAAVAGPCGLWP 164 Query: 187 EDMLGNA-----KGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 +D+ N N+++ N+GCA Q NLAA + NP DL PR +R ++Y Sbjct: 165 DDLGPNIDNPSYSNNQHYHNFGCATQRNLAAMIDNPADLEQPRAEVAAYTPRRSALFEKY 224 Query: 242 RQ 243 R+ Sbjct: 225 RK 226 >gi|85713506|ref|ZP_01044496.1| pilus assembly protein CpaD [Nitrobacter sp. Nb-311A] gi|85699410|gb|EAQ37277.1| pilus assembly protein CpaD [Nitrobacter sp. Nb-311A] Length = 245 Score = 97.4 bits (241), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 54/183 (29%), Positives = 96/183 (52%), Gaps = 5/183 (2%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 +DYR R+PI +++ ++ V+I + RG + + G + + + +P+ T Sbjct: 42 NDYRLRHPIAIQEADRTVNIFVGNTRGGLTAAQRADVIGLASVWLREGTGAIIAEVPAET 101 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 ++ + + K++R ++ ++G+P I R Y+ TIRL+Y AG CG W Sbjct: 102 RNARAAASSFKEVRSLLTAAGVPPRGIIVRHYNPADPRLFATIRLTYPRIAAVAGPCGVW 161 Query: 186 PEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 P+D+ + K N+ + N+GCA Q NLA+ + NP DL PR TPP +R + ++ Sbjct: 162 PDDLGPSIKNRGYLDNKPYWNFGCATQRNLASMIDNPSDLVQPRPETPPYTARRTEGFEK 221 Query: 241 YRQ 243 YR+ Sbjct: 222 YRK 224 >gi|75674498|ref|YP_316919.1| pilus assembly protein CpaD [Nitrobacter winogradskyi Nb-255] gi|74419368|gb|ABA03567.1| pilus assembly protein CpaD [Nitrobacter winogradskyi Nb-255] Length = 246 Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 54/182 (29%), Positives = 95/182 (52%), Gaps = 5/182 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI +++ ++ V+I + RG + P + G + + + +P+ T Sbjct: 44 DYRLRHPIAIQEADRTVNIFIGNTRGGLTAPQRADVVGLASVWLREGTGAIVAEVPTGTG 103 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + + +++R ++ ++G+P I R Y TIRL+Y AG CG WP Sbjct: 104 NARAAADSFREVRSLLAAAGVPPRGIIVRHYHPADPRLFATIRLTYPRIAAVAGPCGVWP 163 Query: 187 EDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 +D+ + K N+ + N+GCA Q NLA+ + NP DL PR TP +R +S ++Y Sbjct: 164 DDIGPSVKNRGYLDNKPYWNFGCATQRNLASMIDNPSDLVQPRPETPAYTARRTQSFEKY 223 Query: 242 RQ 243 R+ Sbjct: 224 RK 225 >gi|148258236|ref|YP_001242821.1| putative pilus assembly protein CpaD [Bradyrhizobium sp. BTAi1] gi|146410409|gb|ABQ38915.1| Putative pilus assembly protein cpaD [Bradyrhizobium sp. BTAi1] Length = 244 Score = 95.5 bits (236), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 54/183 (29%), Positives = 91/183 (49%), Gaps = 7/183 (3%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI +++ + + + GRG + + + + + +P+ T Sbjct: 42 DYRLRHPIAVQEAPDSLVVFVGQGRGGLTAEQRAEVMALAQSWLRQGTGAISADVPTGTP 101 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + ++++I+ + ++G+P ++ R Y + IRLSY +AG CG WP Sbjct: 102 NARAAGDSMREIQSLFAAAGVPPHGLTVRNYQPKDPRQMAAIRLSYPKMSATAGPCGVWP 161 Query: 187 EDMLGNAKGNRNW------TNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 +D LG N+NW N+GCAYQ N+AA V NP DL PR TP +R + Sbjct: 162 DD-LGPTIKNKNWFENKPDWNFGCAYQRNMAAMVDNPADLVQPRAETPSYTTRRTALFDK 220 Query: 241 YRQ 243 YR+ Sbjct: 221 YRK 223 >gi|27375776|ref|NP_767305.1| pilus assembly protein [Bradyrhizobium japonicum USDA 110] gi|27348914|dbj|BAC45930.1| pilV [Bradyrhizobium japonicum USDA 110] Length = 244 Score = 94.7 bits (234), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 56/183 (30%), Positives = 91/183 (49%), Gaps = 5/183 (2%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 +DYR R+PI +++ ++ + I + RG + + G + + + + +P + Sbjct: 41 TDYRQRHPIAVQEAKKSIVIFVGKARGGLSAAQQSDVAGTARDWVREGTGSVVVDVPIGS 100 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 +S + +IR ++ S G+P +I + Y + + TIRLSY AG CG W Sbjct: 101 ANSRAAATTYHEIRSVLASGGVPSRAIVQHPYRPEDPGLLPTIRLSYSRIAAVAGPCGLW 160 Query: 186 PEDMLGNA-----KGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 PED+ + N+ + N GCA Q NLAA + NP DL PR TP +RD + R Sbjct: 161 PEDVGPSILDPGYNENQPYFNLGCASQRNLAAMIDNPADLEQPRAETPVYTARRDIAFDR 220 Query: 241 YRQ 243 YR+ Sbjct: 221 YRK 223 >gi|316933038|ref|YP_004108020.1| pilus biogenesis lipoprotein CpaD [Rhodopseudomonas palustris DX-1] gi|315600752|gb|ADU43287.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Rhodopseudomonas palustris DX-1] Length = 242 Score = 94.7 bits (234), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 52/168 (30%), Positives = 86/168 (51%), Gaps = 5/168 (2%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 +DYR R+PI +R+ ++ V++ + GRG + + + + + + +PS T Sbjct: 44 TDYRQRHPIAIREADRTVEVFVGNGRGGLTALQRAEVAELGQAWLREGTGAIIAEVPSDT 103 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 ++ + +++I+ ++ ++G+P ++ + Y TIRL Y AG CG W Sbjct: 104 PNARAASDTIREIQSVLSANGVPPRGVTVKHYRPADPRTFATIRLIYPKITAVAGPCGLW 163 Query: 186 PEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTP 228 PED+ + K NR + N GCA Q NLAA V NP DL PR TP Sbjct: 164 PEDIGPSIKNKGYYDNRPYWNLGCANQRNLAAMVENPADLVQPRPETP 211 >gi|86748906|ref|YP_485402.1| Type IV pili component-like [Rhodopseudomonas palustris HaA2] gi|86571934|gb|ABD06491.1| Type IV pili component-like [Rhodopseudomonas palustris HaA2] Length = 242 Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 54/182 (29%), Positives = 91/182 (50%), Gaps = 5/182 (2%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 +DYR R+PI +++ ++ V + + GRG + + F +++ + + +PS T Sbjct: 44 NDYRQRHPIAIQEADRSVVVFVGNGRGGLTATQRADVAAFGKEWLREGTGSIIAEVPSGT 103 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 ++ + +++I+ ++ S G+P + + Y IRL Y AG CG W Sbjct: 104 PNARAASDTMREIQSLLTSGGVPARGVIVKPYQPADPRSFAAIRLLYPKVAAVAGPCGLW 163 Query: 186 PEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 PED+ + K N+ + N+GCA Q NLAA V NP DL PR TP +R + + Sbjct: 164 PEDIGPSIKNKGYLDNKPYWNFGCANQRNLAAMVENPSDLVQPRPETPAYTARRGVTFET 223 Query: 241 YR 242 YR Sbjct: 224 YR 225 >gi|146342079|ref|YP_001207127.1| putative pilus assembly protein CpaD [Bradyrhizobium sp. ORS278] gi|146194885|emb|CAL78910.1| Putative pilus assembly protein cpaD [Bradyrhizobium sp. ORS278] Length = 256 Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 52/182 (28%), Positives = 91/182 (50%), Gaps = 5/182 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DY+ R+PI + + Q + + + +GRG + P + G ++ + + +P+ T Sbjct: 54 DYKMRHPIAIEEGRQSIVVFIGSGRGGLTMPQRADVAGLARSWRREGTGAIVADVPAGTP 113 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 ++ + + A ++I ++ G+P +I+ R D + IRLSY AG CG W Sbjct: 114 NARAAQDAYREIHAMLTEGGVPSRAITMRHPTPDDPRQLAVIRLSYPKIAAVAGPCGLWQ 173 Query: 187 EDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 +D+ N N+++ N+GCA Q NLAA + NP DL PR T +R ++Y Sbjct: 174 DDLGPNINNPGYSSNQHYQNFGCATQRNLAAMIDNPADLEQPRSETAAYTPRRSALFEKY 233 Query: 242 RQ 243 R+ Sbjct: 234 RK 235 >gi|92116012|ref|YP_575741.1| pilus assembly protein CpaD [Nitrobacter hamburgensis X14] gi|91798906|gb|ABE61281.1| pilus assembly protein CpaD [Nitrobacter hamburgensis X14] Length = 246 Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 56/183 (30%), Positives = 92/183 (50%), Gaps = 5/183 (2%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 +DYR R+PI +++ + V+I + RG + + G + + + P T Sbjct: 43 NDYRLRHPIAIQEANRTVNIFVGNTRGGLSASQRADVVGLASVWLREGTGAIVAEAPMGT 102 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 ++ + +++++R ++ ++G+P I R Y TIRL+Y AG CG W Sbjct: 103 SNARAAADSLREVRSLLTAAGVPPRGIIVRHYHPADPRLFATIRLTYPQIAAVAGPCGVW 162 Query: 186 PEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 PED+ + K N+ + N+GCA Q NLAA V NP DL PR TP +R ++ + Sbjct: 163 PEDLGPSIKNKGYLDNKPYWNFGCASQRNLAAMVDNPSDLVQPRPETPTYTARRTYALDK 222 Query: 241 YRQ 243 YRQ Sbjct: 223 YRQ 225 >gi|254473746|ref|ZP_05087141.1| components of type IV pilus [Pseudovibrio sp. JE062] gi|211957132|gb|EEA92337.1| components of type IV pilus [Pseudovibrio sp. JE062] Length = 218 Score = 92.8 bits (229), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 53/176 (30%), Positives = 87/176 (49%), Gaps = 1/176 (0%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ +V + DIP+ + + + F ++ D + +L+PS + Sbjct: 21 DYRKRHPIVITEVPENFDIPVSGEARNLNRSLKTAVAAFGQQAIVDGNGFVEVLVPSGSA 80 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 + A++R IR + G+ + I R Y +RLS+ K + CG WP Sbjct: 81 NEAAVRAISPQIRTALKQGGMEANKIVMRSYSVSDMAASAPVRLSFMRIKGAVRDCGNWP 140 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 M+ N + N ++ N+GCA Q NLAA V NP DL PR + P D + + + + R Sbjct: 141 TGMVVNHQ-NLDYHNFGCASQANLAAVVDNPTDLLRPRTLGPNDPSRTNVVLTKNR 195 >gi|323137426|ref|ZP_08072504.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylocystis sp. ATCC 49242] gi|322397413|gb|EFX99936.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylocystis sp. ATCC 49242] Length = 254 Score = 87.4 bits (215), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 65/220 (29%), Positives = 103/220 (46%), Gaps = 23/220 (10%) Query: 39 LRTLMLGQLFFLLLFYGTSA----------LAYYDEGSDYRDRYPILMRKVEQIVDIPLL 88 LR L G++ LLL A +A YD Y DR+P+++ + ++D+ Sbjct: 22 LRALRAGKVLALLLTAPLGACGVNRVLPPPVAPYD----YHDRHPVVLAEAPHVIDLFPS 77 Query: 89 AGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIP 148 G + Y I+ F+E Y+ + LL P SA+ V +R+ + ++G+ Sbjct: 78 VVHGGVDYTTEGRIKEFVEHYREFGHGQVTLLTPVGAPYSAA---GVSAVRRALAAAGL- 133 Query: 149 VSSISERIYDADYGMDVDTIRLSYFASKPS-AGKCGFWPEDMLGNAK----GNRNWTNYG 203 +I Y IRLS+ + K +G+CG WP D+ N+++ N+G Sbjct: 134 RGNILVGTYSVTDPRLAAPIRLSFQSLKAKVSGRCGEWPTDLASGTSLQGWENQSYWNFG 193 Query: 204 CAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 CA Q L+AQV +P DL PR T D E R ++I R R+ Sbjct: 194 CASQQTLSAQVADPRDLAVPRGETASDIEMRMRAINRVRR 233 >gi|300021855|ref|YP_003754466.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Hyphomicrobium denitrificans ATCC 51888] gi|299523676|gb|ADJ22145.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Hyphomicrobium denitrificans ATCC 51888] Length = 245 Score = 87.4 bits (215), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 58/178 (32%), Positives = 91/178 (51%), Gaps = 6/178 (3%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 D R+PIL+ + ++++ + AG + + F+++++ A +I +P Sbjct: 50 DPEQRHPILVSQQPAVLNLHVAAGSEGLTPSQRSRVIDFIDRHRASDAGNSRFVISAPAG 109 Query: 127 SS--ASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 SS A+ A D R++I+ G SSI+ Y A G D +R+SY +CG Sbjct: 110 SSNEAAAMDAASDTRRLILGGGYADSSIANEAYHAS-GRDA-PLRISYLRYVAEGPECGR 167 Query: 185 -WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 W E+ L A N + N+GC+ Q NLAA V NP DL PR +TP DA +R K ++Y Sbjct: 168 DWSEN-LARAYQNTPYPNFGCSSQRNLAAMVSNPADLLGPRTMTPSDANRRFKMYEKY 224 >gi|296444394|ref|ZP_06886359.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylosinus trichosporium OB3b] gi|296258041|gb|EFH05103.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylosinus trichosporium OB3b] Length = 245 Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 61/185 (32%), Positives = 96/185 (51%), Gaps = 18/185 (9%) Query: 67 DYRDRYPILMRKVEQIVDI-PLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 DYRDR+P+++ +D+ P + D I+ F+++Y+ + LL P+ + Sbjct: 48 DYRDRHPVVLADATTAIDVFP----EQRLDQATVDRIQSFVQRYRRLGHGQITLLAPTGS 103 Query: 126 VSSASIRRAVKDIRKIIISSGIP--VSSISERIYDADYGMDVDTIRLSYFASKPS-AGKC 182 ++A+ R V +R+ + SG+ V + + DAD V RLS+ K A +C Sbjct: 104 RNTAT-RAGVDAVRRQLADSGVAGAVYVGTYPVSDADLAAPV---RLSFQGIKAKVADRC 159 Query: 183 GFWPEDMLGNAKGNRNWTN-----YGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 G WPED L +A + W N +GCA Q LAAQ+ +P DL SPR TP D E R ++ Sbjct: 160 GQWPED-LASASSLKGWNNDTHWNFGCANQATLAAQIDDPRDLASPRGETPADIESRMRA 218 Query: 238 IQRYR 242 + + R Sbjct: 219 LNKVR 223 >gi|218516618|ref|ZP_03513458.1| pilus assembly protein [Rhizobium etli 8C-3] Length = 102 Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 39/75 (52%), Positives = 53/75 (70%), Gaps = 1/75 (1%) Query: 168 IRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVT 227 IRLS+ + +CG WP+D + N N+N+ N+GCA QNNLAAQV NP DL +PR +T Sbjct: 13 IRLSFTGTTAITTQCGQWPKD-ISNDFANQNYYNFGCATQNNLAAQVANPEDLVAPRGMT 71 Query: 228 PPDAEQRDKSIQRYR 242 P DA++R+ +IQ YR Sbjct: 72 PIDAQRRNNAIQEYR 86 >gi|218508708|ref|ZP_03506586.1| pilus assembly protein [Rhizobium etli Brasil 5] Length = 150 Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 50/113 (44%), Positives = 65/113 (57%), Gaps = 12/113 (10%) Query: 130 SIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDM 189 S RR RKII +S Y A D IRLS+ + +CG WP+D Sbjct: 34 SWRREGSRARKIINTS-----------YAAAGAGDAAPIRLSFTGTTAITTQCGQWPKD- 81 Query: 190 LGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 + N N+N+ N+GCA QNNLAAQV NP DL +PR +TP DA++R+ +IQ YR Sbjct: 82 ISNDFANQNYYNFGCATQNNLAAQVANPEDLVAPRGMTPIDAQRRNNAIQEYR 134 >gi|218662338|ref|ZP_03518268.1| pilus assembly protein [Rhizobium etli IE4771] Length = 79 Score = 81.6 bits (200), Expect = 8e-14, Method: Composition-based stats. Identities = 35/62 (56%), Positives = 47/62 (75%), Gaps = 1/62 (1%) Query: 181 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 +CG WP+D + N N+N+ N+GCA QNNLAAQV NP DL +PR +TP DA++R+ +IQ Sbjct: 3 QCGQWPKD-ISNDFANQNYYNFGCASQNNLAAQVANPEDLVAPRGMTPIDAQRRNNAIQE 61 Query: 241 YR 242 YR Sbjct: 62 YR 63 >gi|154250690|ref|YP_001411514.1| pilus biogenesis lipoprotein CpaD [Parvibaculum lavamentivorans DS-1] gi|154154640|gb|ABS61857.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Parvibaculum lavamentivorans DS-1] Length = 234 Score = 79.0 bits (193), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 56/208 (26%), Positives = 100/208 (48%), Gaps = 23/208 (11%) Query: 48 FFLLLFYGTSALAYYD--EGSDYRDRY--PILMRKVEQIVDIPLLAGRGEIKYPIHDTIR 103 F L L +G + ++ E + Y Y PI + ++I ++ G+ + + I Sbjct: 16 FALALAFGVAGCGGFNGAEQAHYDANYTHPISVEADVATLNIDVVPGQPGVTSTDRNAIA 75 Query: 104 GFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGM 163 GF Y+ L + PS + ++ + A+ D+R+++ G+ + +S Y A Sbjct: 76 GFAAGYRQRGHGPLTISTPSGSPNAGAAAVALSDVREVLSEHGVGGNDLSYTPYRASGTD 135 Query: 164 DVDTIRLSY--FASKPSAGKCGFW-------PEDMLGNAKGNRNWTNYGCAYQNNLAAQV 214 + + LS+ + +KP+A CG W P + L N+GC+ QNNLAA V Sbjct: 136 NSAPLILSFKRYVAKPTA--CGDWSGSYSYDPSNGL--------LPNHGCSTQNNLAAMV 185 Query: 215 VNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +P DL +PR ++P DA +R +++YR Sbjct: 186 ADPGDLIAPRNMSPADAARRGTVLEKYR 213 >gi|188581661|ref|YP_001925106.1| pilus biogenesis lipoprotein CpaD [Methylobacterium populi BJ001] gi|179345159|gb|ACB80571.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium populi BJ001] Length = 247 Score = 76.3 bits (186), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 58/186 (31%), Positives = 88/186 (47%), Gaps = 12/186 (6%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---S 123 D R R+PI++ ++ +D+ G G I I FL +Y+ +L + +P S Sbjct: 42 DVRTRHPIVLADADRSLDV-FPTGIGHIDPRQRADIEAFLVEYRRYGRGILVVELPRGVS 100 Query: 124 PTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGKC 182 P ++ S+ R +IR++ G+P + I Y +RLS+ + A KC Sbjct: 101 PGLA-GSVERTGAEIRRLAAEMGVPAAGIRVANYPVANPTLASPLRLSFQRMQAKVADKC 159 Query: 183 GFWPEDMLGNAKGNRNWTN-----YGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 G WP D LG + NW+N GCA Q NLAAQV +P+DL R D R K Sbjct: 160 GLWPRD-LGVSDLRANWSNEPTWNLGCATQANLAAQVADPIDLVRGRPEGRIDTVLRTKD 218 Query: 238 IQRYRQ 243 + + R+ Sbjct: 219 LGQLRE 224 >gi|163851904|ref|YP_001639947.1| pilus biogenesis lipoprotein CpaD [Methylobacterium extorquens PA1] gi|163663509|gb|ABY30876.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium extorquens PA1] Length = 248 Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 87/186 (46%), Gaps = 12/186 (6%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---S 123 D R R+PI++ ++ +D+ G G I I FL +Y+ +L + +P S Sbjct: 43 DVRTRHPIVLADADRTLDV-FPTGIGHIDPRQRADIEAFLGEYRRYGRGILLVELPRGVS 101 Query: 124 PTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGKC 182 P ++ + R IR++ G+P + + Y +RLS+ + A KC Sbjct: 102 PALA-GPVERTGASIRRLAAEMGVPAAGVRVAAYPIANPTLASPLRLSFQRMQAKVADKC 160 Query: 183 GFWPEDMLGNAKGNRNWTN-----YGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 G WP D LG + NW+N GCA Q+N+AAQV +P+DL R D R K Sbjct: 161 GLWPRD-LGASDLRANWSNEPTWNLGCAMQSNVAAQVADPIDLVRGRPEGRIDTVLRTKD 219 Query: 238 IQRYRQ 243 + + R+ Sbjct: 220 LGQLRE 225 >gi|218530655|ref|YP_002421471.1| pilus biogenesis lipoprotein CpaD [Methylobacterium chloromethanicum CM4] gi|218522958|gb|ACK83543.1| pilus biogenesis lipoprotein CpaD [Methylobacterium chloromethanicum CM4] Length = 248 Score = 75.5 bits (184), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 87/186 (46%), Gaps = 12/186 (6%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---S 123 D R R+PI++ ++ +D+ G G I I FL +Y+ +L + +P S Sbjct: 43 DVRTRHPIVLADADRTLDV-FPTGIGHIDPRQRADIEAFLVEYRRYGRGILLVELPRGVS 101 Query: 124 PTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGKC 182 P ++ + R IR++ G+P + + Y +RLS+ + A KC Sbjct: 102 PALA-GPVERTGASIRRLATEMGVPAAGVRVAAYPIANPTLASPLRLSFQRMQAKVADKC 160 Query: 183 GFWPEDMLGNAKGNRNWTN-----YGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 G WP D LG + NW+N GCA Q+N+AAQV +P+DL R D R K Sbjct: 161 GLWPRD-LGASDLRANWSNEPTWNLGCAMQSNVAAQVADPIDLVRGRPEGRIDTVLRTKD 219 Query: 238 IQRYRQ 243 + + R+ Sbjct: 220 LGQLRE 225 >gi|254561622|ref|YP_003068717.1| pilus assembly protein cpaD [Methylobacterium extorquens DM4] gi|254268900|emb|CAX24861.1| pilus assembly protein cpaD [Methylobacterium extorquens DM4] Length = 248 Score = 75.5 bits (184), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 87/186 (46%), Gaps = 12/186 (6%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---S 123 D R R+PI++ ++ +D+ G G I I FL +Y+ +L + +P S Sbjct: 43 DVRTRHPIVLADADRTLDV-FPTGIGHIDPRQRADIEAFLVEYRRYGRGILLVELPRGVS 101 Query: 124 PTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGKC 182 P ++ + R IR++ G+P + + Y +RLS+ + A KC Sbjct: 102 PALA-GPVERTGASIRRLAAEMGVPAAGVRVAAYPIANPTLASPLRLSFQRMQAKVADKC 160 Query: 183 GFWPEDMLGNAKGNRNWTN-----YGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 G WP D LG + NW+N GCA Q+N+AAQV +P+DL R D R K Sbjct: 161 GLWPRD-LGASDLRANWSNEPTWNLGCAMQSNVAAQVADPIDLVRGRPEGRIDTVLRTKD 219 Query: 238 IQRYRQ 243 + + R+ Sbjct: 220 LGQLRE 225 >gi|240139027|ref|YP_002963502.1| pilus assembly protein cpaD [Methylobacterium extorquens AM1] gi|240008999|gb|ACS40225.1| pilus assembly protein cpaD [Methylobacterium extorquens AM1] Length = 248 Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 87/186 (46%), Gaps = 12/186 (6%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---S 123 D R R+PI++ ++ +D+ G G I I FL +Y+ +L + +P S Sbjct: 43 DVRTRHPIVLADADRTLDV-FPTGIGHIDPRQRADIEAFLVEYRRYGRGILLVELPRGVS 101 Query: 124 PTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGKC 182 P ++ + R IR++ G+P + + Y +RLS+ + A KC Sbjct: 102 PALA-GPVERTGASIRRLAAEMGVPAAGVRVAAYPIANLTLASPLRLSFQRMQAKVADKC 160 Query: 183 GFWPEDMLGNAKGNRNWTN-----YGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 G WP D LG + NW+N GCA Q+N+AAQV +P+DL R D R K Sbjct: 161 GLWPRD-LGASDLRANWSNEPTWNLGCAMQSNVAAQVADPIDLVRGRPEGRIDTVLRTKD 219 Query: 238 IQRYRQ 243 + + R+ Sbjct: 220 LGQLRE 225 >gi|167648151|ref|YP_001685814.1| pilus biogenesis lipoprotein CpaD [Caulobacter sp. K31] gi|167350581|gb|ABZ73316.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Caulobacter sp. K31] Length = 234 Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 36/106 (33%), Positives = 59/106 (55%), Gaps = 2/106 (1%) Query: 138 IRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNR 197 +R+ +I+ G P +S+ YDA G+ +++ + CG W + + + N+ Sbjct: 110 VRQRLIAMGAPPASVRVVGYDAGAGLAAAPLKVGFLRYHAQVPTCGGW--ENIAATRDNK 167 Query: 198 NWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 + N+GCA N+AAQV NP DL SPR TP D+ +RD + +YR+ Sbjct: 168 PYDNFGCAVTANMAAQVANPEDLLSPRATTPVDSARRDTVLGKYRK 213 >gi|220922780|ref|YP_002498082.1| pilus biogenesis lipoprotein CpaD [Methylobacterium nodulans ORS 2060] gi|219947387|gb|ACL57779.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium nodulans ORS 2060] Length = 246 Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 45/178 (25%), Positives = 82/178 (46%), Gaps = 7/178 (3%) Query: 71 RYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP-SPTVSSA 129 R+PI++ + +D+ + G G + D I FL +Y+ VL + +P V Sbjct: 47 RHPIVLADAPRSLDV-FVTGIGHVDPRQSDDIDAFLLEYRRYGRGVLVIEVPRGAQVPGP 105 Query: 130 SIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGKCGFWPED 188 ++ R +R+ + G+P + Y +R+S+ + +G CG WP+D Sbjct: 106 AVARTAALLRERAVGRGVPARELVVAPYAVVNPAVAAPVRMSFQRMQARVSGACGLWPQD 165 Query: 189 MLGNAKG----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 + + G N ++ NYGC+ + N A+Q+ +P+DL R D R + I+ R Sbjct: 166 LGVSEPGFELRNESFWNYGCSTRANFASQIADPVDLVRGRQEGRIDTVSRTQDIESLR 223 >gi|295690797|ref|YP_003594490.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Caulobacter segnis ATCC 21756] gi|295432700|gb|ADG11872.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Caulobacter segnis ATCC 21756] Length = 229 Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 38/108 (35%), Positives = 60/108 (55%), Gaps = 6/108 (5%) Query: 138 IRKIIISSGIPVSSISERIYDADYGMDVDTI-RLSYFASKPSAGKCG-FWPEDMLGNAKG 195 +R+ +I G P + + RI AD + + + R+ + + KCG W + L + Sbjct: 105 VRERLIFLGAPAAHV--RIVGADPSLPPEPVLRVGFVRYEAEVPKCGQAW--ESLTATRD 160 Query: 196 NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 N+ + N+GCA N+AAQV NP DL PR +TP DA +RD + +YR+ Sbjct: 161 NKAYENFGCAVAANMAAQVANPEDLVRPRDMTPADAGRRDTVMGKYRR 208 >gi|23011548|ref|ZP_00051876.1| hypothetical protein Magn03006165 [Magnetospirillum magnetotacticum MS-1] Length = 248 Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 50/186 (26%), Positives = 85/186 (45%), Gaps = 12/186 (6%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP---S 123 D R R+PI++ ++ +D+ G G + + FL +Y+ +L + +P S Sbjct: 43 DVRTRHPIVLADADRTLDV-FPTGVGHLDPRQRADLEAFLVEYRRYGRGLLLVEMPRGVS 101 Query: 124 PTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGKC 182 P ++ + R IR++ G+P Y +RLS+ + A +C Sbjct: 102 PALA-GPVERTGAAIRRLAAEMGVPAGGFRIGDYPIANPALAAPLRLSFQRMQAKVADQC 160 Query: 183 GFWPEDMLGNAKGNRNWTN-----YGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 G WP D LG + +W+N +GCA + N AAQV +P+DL R D +R + Sbjct: 161 GLWPRD-LGASDLRADWSNEPTWNFGCATRANFAAQVADPVDLVRGRPEGRIDTIRRTQD 219 Query: 238 IQRYRQ 243 I + R+ Sbjct: 220 IGQLRE 225 >gi|83859358|ref|ZP_00952879.1| pilus assembly protein CpaD [Oceanicaulis alexandrii HTCC2633] gi|83852805|gb|EAP90658.1| pilus assembly protein CpaD [Oceanicaulis alexandrii HTCC2633] Length = 218 Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 2/144 (1%) Query: 100 DTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDA 159 D I +YK L + P ++ + A+ + R + +G+ IS Y+A Sbjct: 55 DVIEAVAAEYKARGHGPLVISYPQNAGNADAAIGAIAEARTRLYEAGLDWRQISGGAYEA 114 Query: 160 DYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLD 219 G + S+ + A +C DM + + N++W +GCA NNLA V +P D Sbjct: 115 G-GQASAPVIFSFTRYQAVAPECSTAWNDM-AHMRANQDWPRFGCATANNLANMVADPRD 172 Query: 220 LFSPRMVTPPDAEQRDKSIQRYRQ 243 L +PR V PD+ +R + RYRQ Sbjct: 173 LVAPRGVDAPDSARRQTVLDRYRQ 196 >gi|170750190|ref|YP_001756450.1| pilus biogenesis lipoprotein CpaD [Methylobacterium radiotolerans JCM 2831] gi|170656712|gb|ACB25767.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium radiotolerans JCM 2831] Length = 247 Score = 62.8 bits (151), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 50/185 (27%), Positives = 83/185 (44%), Gaps = 16/185 (8%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPS-- 123 +DYR R+PI++ + +D+ G G + + F+ +Y+ L + +P Sbjct: 42 TDYRARHPIVLTDGTRSLDV-FPTGTGHLDPRQATDVDAFMLEYRRYGRGSLLMQVPQGV 100 Query: 124 PTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGKC 182 P ++ R + ++ +G+ I+ Y IRLS+ + A C Sbjct: 101 PADQVVAVERTASVLGRLGTQNGVNGREIAVTGYAVAAPTLASPIRLSFQRMQAKVADAC 160 Query: 183 GFWPEDMLGNAK-----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 G WP+D LG + NR N GCA Q+N+AAQV +P+DL R E R + Sbjct: 161 GLWPQD-LGTSNFAIDYNNRPSWNLGCATQSNVAAQVADPVDLVRGR------PEGRIDT 213 Query: 238 IQRYR 242 ++R R Sbjct: 214 VKRVR 218 >gi|218670939|ref|ZP_03520610.1| pilus assembly protein [Rhizobium etli GR56] Length = 170 Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 37/117 (31%), Positives = 59/117 (50%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR R+PI++ + EQ VDIP+ + + D IRGF Y + ++ +++L P + Sbjct: 54 DYRARHPIIVTEAEQTVDIPVASTDRRLTIAQRDLIRGFAANYISRASGPVYVLSPEGSP 113 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCG 183 +SA+ + +R + S GI S I Y A D IRLS+ + +CG Sbjct: 114 NSAAAHQLRNHVRAELASRGIASSKIINTSYAAAGAGDAAPIRLSFTGTTAVTTQCG 170 >gi|170740620|ref|YP_001769275.1| pilus biogenesis lipoprotein CpaD [Methylobacterium sp. 4-46] gi|168194894|gb|ACA16841.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium sp. 4-46] Length = 249 Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 46/186 (24%), Positives = 80/186 (43%), Gaps = 9/186 (4%) Query: 64 EGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP- 122 E + + R+PI + + +D+ + G G + D + FL +++ VL + +P Sbjct: 43 EPASVQARHPIALADAPRNLDV-FVTGMGHVDPRQADDVDAFLLEFRRYGRGVLVIEVPR 101 Query: 123 SPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS-AGK 181 ++ R +R+ ++ G+ + Y +RLS+ + Sbjct: 102 GGQAPGPAVARTAALLRERALARGVSARELVVAPYPVANASVAAPVRLSFQRMQAKVTST 161 Query: 182 CGFWPEDMLGNAK-----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDK 236 CG WP D LG + NR N+GCAYQ+N A Q+ +P+DL R D R + Sbjct: 162 CGLWPND-LGVSDPAVDVSNRTHWNHGCAYQSNFARQIADPVDLVRGRQEGRIDTISRTQ 220 Query: 237 SIQRYR 242 I+ R Sbjct: 221 DIESLR 226 >gi|254420868|ref|ZP_05034592.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Brevundimonas sp. BAL3] gi|196187045|gb|EDX82021.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Brevundimonas sp. BAL3] Length = 205 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 44/160 (27%), Positives = 69/160 (43%), Gaps = 35/160 (21%) Query: 99 HDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYD 158 H +R +++ A V+ + P+ S A+ + D R + Sbjct: 44 HAALRDLAQRFAAAGAGVIVIEAPAGEDSVAA--KTAFDTRAAL---------------- 85 Query: 159 ADYGMDVDTIRL-SYFASKPSAG-------------KCGF-WPEDMLGNAKGNRNWTNYG 203 A G+D + +R+ SY P A +CG W L N + +N+G Sbjct: 86 AQIGLDPNRLRVVSYAGPDPRAPVLVGFETVQAAVPRCGAAW--GNLSRTGDNMSGSNFG 143 Query: 204 CAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 CA NLAAQ+ +P D+ +PR +TPP+A +R RYRQ Sbjct: 144 CAVTANLAAQIADPRDIAAPRALTPPEAGRRSVVFDRYRQ 183 >gi|16127174|ref|NP_421738.1| pilus assembly protein CpaD [Caulobacter crescentus CB15] gi|221235975|ref|YP_002518412.1| pilus assembly protein CpaD [Caulobacter crescentus NA1000] gi|7208426|gb|AAF40193.1|AF229646_5 CpaD [Caulobacter crescentus CB15] gi|13424570|gb|AAK24906.1| pilus assembly protein CpaD [Caulobacter crescentus CB15] gi|220965148|gb|ACL96504.1| pilus assembly protein CpaD [Caulobacter crescentus NA1000] Length = 225 Score = 55.5 bits (132), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 28/64 (43%), Positives = 38/64 (59%), Gaps = 3/64 (4%) Query: 181 KCG-FWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQ 239 KCG W + L + N + N+GCA N+AAQV NP DL PR +TP D +RD + Sbjct: 143 KCGQRW--ENLAATRDNTVYDNFGCAMAANIAAQVANPEDLMRPRDMTPADTGRRDTVLG 200 Query: 240 RYRQ 243 +YR+ Sbjct: 201 KYRR 204 >gi|329847254|ref|ZP_08262282.1| pilus Caulobacter type biogenesis lipoprotein CpaD family protein [Asticcacaulis biprosthecum C19] gi|328842317|gb|EGF91886.1| pilus Caulobacter type biogenesis lipoprotein CpaD family protein [Asticcacaulis biprosthecum C19] Length = 219 Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 4/74 (5%) Query: 170 LSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPP 229 +SY AS P G+ W + L + N N+GCA NLAAQV +P DL P TP Sbjct: 129 VSYRASVPGCGQT--W--ENLAATRKNTPHANFGCAITANLAAQVADPRDLVDPATATPS 184 Query: 230 DAEQRDKSIQRYRQ 243 DA ++ + +YR+ Sbjct: 185 DAGRKSVVLDKYRR 198 >gi|302381755|ref|YP_003817578.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Brevundimonas subvibrioides ATCC 15264] gi|302192383|gb|ADK99954.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Brevundimonas subvibrioides ATCC 15264] Length = 222 Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 45/146 (30%), Positives = 62/146 (42%), Gaps = 32/146 (21%) Query: 128 SASIRRAVKDIRKIIISSGIPV-------------SSISERIYDADYGMDVDTIRL---S 171 SA+ RA++DI + G PV S ++ RI A V + ++ + Sbjct: 57 SANQTRALEDIAGRFYAEGAPVLRIEAPSGNDPVASEMAWRIKGALEASGVSSYQVQVVT 116 Query: 172 YFASKPSAG-------------KCGF-WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNP 217 Y A P A +CG W L N + N+GCA NLAAQ+ NP Sbjct: 117 YVAPDPRAPVLVGFDTVRAVVPQCGTGWTN--LTRTGSNAGYGNFGCAVNANLAAQIANP 174 Query: 218 LDLFSPRMVTPPDAEQRDKSIQRYRQ 243 D+ PR +TP DA +R YRQ Sbjct: 175 RDIVQPRTMTPVDAGRRAVVFDNYRQ 200 >gi|114568971|ref|YP_755651.1| pilus biogenesis lipoprotein CpaD [Maricaulis maris MCS10] gi|114339433|gb|ABI64713.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Maricaulis maris MCS10] Length = 227 Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 41/139 (29%), Positives = 62/139 (44%), Gaps = 7/139 (5%) Query: 108 KYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDT 167 +YK L + P + + A+ + R GI I+ YDA + + Sbjct: 70 EYKARGHGPLVISYPQGAGNEDAAIGAIAEARSFFYEQGIDWRVIAGGAYDARGRQNGEL 129 Query: 168 IR--LSYFASKPSAGKC-GFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPR 224 I Y A P+ +C G W D + N++ TN+GCA NLAA V +P DL +PR Sbjct: 130 IFSFTRYEAVAPA--ECDGSW--DQMALEFDNQHHTNFGCALAVNLAAMVADPRDLVAPR 185 Query: 225 MVTPPDAEQRDKSIQRYRQ 243 + D +R I+ YR+ Sbjct: 186 DMEAGDTGRRQTVIEGYRE 204 >gi|315497464|ref|YP_004086268.1| pilus (caulobacter type) biogenesis lipoprotein cpad [Asticcacaulis excentricus CB 48] gi|315415476|gb|ADU12117.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Asticcacaulis excentricus CB 48] Length = 228 Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust. Identities = 34/119 (28%), Positives = 54/119 (45%), Gaps = 4/119 (3%) Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 T ++ RA IR + + V ++S+ + G D + L + C Sbjct: 93 TANTPDALRAGAAIRAYLNDHQVSVHAVSQTTAE---GQPADVVSLITREYRAVVNDCNL 149 Query: 185 WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 E+ L + N N GCA NLAAQ+ +P D+ +P+ TP DA +R I +YR+ Sbjct: 150 EWEN-LAATRHNAAPQNLGCAINANLAAQIDDPRDIAAPQPATPGDAGRRTVIIDKYRK 207 >gi|304320648|ref|YP_003854291.1| hypothetical protein PB2503_05377 [Parvularcula bermudensis HTCC2503] gi|303299550|gb|ADM09149.1| hypothetical protein PB2503_05377 [Parvularcula bermudensis HTCC2503] Length = 212 Score = 45.8 bits (107), Expect = 0.004, Method: Compositional matrix adjust. Identities = 36/169 (21%), Positives = 68/169 (40%), Gaps = 4/169 (2%) Query: 70 DRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSA 129 +R+PI + + + IP+ + R + + F+ Y+ + + P+ T Sbjct: 32 ERHPITVDQQAVTLTIPIDSTRSGLSRGDLQQLDRFVSAYRTKGYGPITVTAPAGTGRDL 91 Query: 130 SIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDM 189 +R + SG+ + I + +V + Y A P +CG + + Sbjct: 92 EANETAAAVRAALNDSGVAYADIQGASVTSSTAKEVMVSFVRYVAQGP---QCGVFDNER 148 Query: 190 LGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSI 238 + N + N+GC+ Q+NLAA + +P DL + T D K I Sbjct: 149 AARFR-NLSHPNFGCSSQHNLAAMIADPRDLTRAQSTTTRDGHLASKPI 196 >gi|254293215|ref|YP_003059238.1| type IV pili component-like protein [Hirschia baltica ATCC 49814] gi|254041746|gb|ACT58541.1| Type IV pili component-like protein [Hirschia baltica ATCC 49814] Length = 218 Score = 42.0 bits (97), Expect = 0.067, Method: Compositional matrix adjust. Identities = 32/138 (23%), Positives = 60/138 (43%), Gaps = 4/138 (2%) Query: 105 FLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMD 164 F+E + +D + L P S RA+ IR I+ +G+ +I+ Y G + Sbjct: 67 FVENFADDGYGPIVLSAPD---GSREAVRAITSIRSILSRAGVLPDNITVGGYQPAAG-N 122 Query: 165 VDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPR 224 + L+Y + + C E + + N + ++GCA N+A + P DL R Sbjct: 123 AAPLVLAYKSYQAHVPGCSTVNEHDWTDLRSNSSVGSFGCAVNENIAMMIAKPGDLLGER 182 Query: 225 MVTPPDAEQRDKSIQRYR 242 + D+ ++ ++YR Sbjct: 183 KIGDGDSSRQLTVYEKYR 200 >gi|103486616|ref|YP_616177.1| hypothetical protein Sala_1128 [Sphingopyxis alaskensis RB2256] gi|98976693|gb|ABF52844.1| hypothetical protein Sala_1128 [Sphingopyxis alaskensis RB2256] Length = 213 Score = 40.8 bits (94), Expect = 0.14, Method: Compositional matrix adjust. Identities = 34/116 (29%), Positives = 54/116 (46%), Gaps = 10/116 (8%) Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWP 186 S+ + RA+ + R +++S +PV++ + + D IR+ + S C W Sbjct: 88 SAQATVRAMVERRGLLLSKDVPVTTGA--VPDGH-------IRVVVTRASASVPGCPDWN 138 Query: 187 EDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 N+ N +NYGCA +NLAA V +P DL T D ++IQ YR Sbjct: 139 SKSSLNSL-NATSSNYGCATNSNLAAMVADPNDLIKGTRDTGHDPVAATRAIQTYR 193 >gi|149186256|ref|ZP_01864570.1| hypothetical protein ED21_31004 [Erythrobacter sp. SD-21] gi|148830287|gb|EDL48724.1| hypothetical protein ED21_31004 [Erythrobacter sp. SD-21] Length = 217 Score = 39.3 bits (90), Expect = 0.50, Method: Compositional matrix adjust. Identities = 34/120 (28%), Positives = 52/120 (43%), Gaps = 14/120 (11%) Query: 127 SSASIRRAVKDIRK---IIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCG 183 +S ++R V +I I+++ G PV++ + + R+ S S C Sbjct: 88 NSLAVRDDVAEIASRYGILVAEGAPVTAGN---------LGPGQARVVITRSTASVPGCP 138 Query: 184 FWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPD-AEQRDKSIQRYR 242 W + N GN NYGCA +NLA+ V NP DL + T K+I+ YR Sbjct: 139 DWSHTVEAN-DGNATNPNYGCATYSNLASMVANPEDLVQGQQGTGETIVTTSTKAIEAYR 197 >gi|84386796|ref|ZP_00989821.1| putative lipoprotein [Vibrio splendidus 12B01] gi|84378324|gb|EAP95182.1| putative lipoprotein [Vibrio splendidus 12B01] Length = 207 Score = 37.0 bits (84), Expect = 2.1, Method: Compositional matrix adjust. Identities = 35/103 (33%), Positives = 49/103 (47%), Gaps = 7/103 (6%) Query: 136 KDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG 195 + +R +I SG+ S IS A D+ TI + + +K +A G P L NA Sbjct: 83 EKVRLRLIESGLYPSQISVSDTAAQGKGDI-TIFVESYRAKVTACDAGKTPRTTL-NAY- 139 Query: 196 NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSI 238 R N+GCA N LA V NP DL + P D+ Q K++ Sbjct: 140 -RTQRNFGCANANALAQMVANPKDLI---VGQPIDSAQGQKAV 178 >gi|114798148|ref|YP_761690.1| pilus assembly protein CpaD [Hyphomonas neptunium ATCC 15444] gi|114738322|gb|ABI76447.1| pilus assembly protein CpaD [Hyphomonas neptunium ATCC 15444] Length = 236 Score = 37.0 bits (84), Expect = 2.3, Method: Compositional matrix adjust. Identities = 45/187 (24%), Positives = 78/187 (41%), Gaps = 2/187 (1%) Query: 56 TSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSAS 115 T+ A Y E S D PI + K + +++ + A GE+ I+ F+ Y Sbjct: 30 TAVPAAYLETSPL-DLNPIKVEKRTEFLEVSIDAYAGELSSSDRARIQDFMRGYVRRGHG 88 Query: 116 VLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFAS 175 L L +P + + AV + R I G+ IS + + + I L+Y + Sbjct: 89 PLVLSMPQVSSNPQLAVAAVAEARAIAWDMGVEYQEISGTAHGSGSSVSEPMI-LAYQSY 147 Query: 176 KPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRD 235 + A C N N GC+ + NLAA +++P DL R + D +R+ Sbjct: 148 EAIAPNCPPKSTVDFSNIDSNNQMETLGCSVRTNLAAMIIDPADLLGNRPLDRSDLARRE 207 Query: 236 KSIQRYR 242 ++++R Sbjct: 208 VILEKFR 214 >gi|285018613|ref|YP_003376324.1| hypothetical protein XALc_1843 [Xanthomonas albilineans GPE PC73] gi|283473831|emb|CBA16333.1| hypothetical protein XALc_1843 [Xanthomonas albilineans] Length = 147 Score = 37.0 bits (84), Expect = 2.5, Method: Compositional matrix adjust. Identities = 26/98 (26%), Positives = 46/98 (46%), Gaps = 8/98 (8%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVL--------F 118 D+R + + R ++IPL + P+ T++ LE++ DS L Sbjct: 29 DFRGSWKPVNRFSASTMEIPLYSSYVYQAVPVDGTLKTMLERWSKDSNMELSYGIQSDYT 88 Query: 119 LLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERI 156 L P +++ SI++AV ++ I S G+ VS+ RI Sbjct: 89 LYAPVAKINTVSIQQAVAELSVIYESEGVTVSAAGNRI 126 >gi|148557760|ref|YP_001265342.1| hypothetical protein Swit_4867 [Sphingomonas wittichii RW1] gi|148502950|gb|ABQ71204.1| hypothetical protein Swit_4867 [Sphingomonas wittichii RW1] Length = 252 Score = 36.6 bits (83), Expect = 3.4, Method: Compositional matrix adjust. Identities = 17/44 (38%), Positives = 25/44 (56%) Query: 200 TNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 +NYGCA + LAA V NP DL + +A+ K+I+ +R Sbjct: 191 SNYGCAINSTLAAMVANPEDLVKGQAARGSNADTATKAIRIWRN 234 >gi|296284553|ref|ZP_06862551.1| hypothetical protein CbatJ_13041 [Citromicrobium bathyomarinum JL354] Length = 214 Score = 36.2 bits (82), Expect = 3.6, Method: Compositional matrix adjust. Identities = 19/40 (47%), Positives = 21/40 (52%), Gaps = 1/40 (2%) Query: 182 CGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLF 221 C W +D GN N NYGCA +N AA V NP DL Sbjct: 135 CPNWTDDGDGNFA-NATSRNYGCATNSNYAAMVANPEDLV 173 >gi|150398913|ref|YP_001322680.1| succinyl-CoA synthetase subunit alpha [Methanococcus vannielii SB] gi|150011616|gb|ABR54068.1| succinyl-CoA synthetase, alpha subunit [Methanococcus vannielii SB] Length = 287 Score = 36.2 bits (82), Expect = 4.1, Method: Compositional matrix adjust. Identities = 32/107 (29%), Positives = 54/107 (50%), Gaps = 6/107 (5%) Query: 86 PLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKI-IIS 144 P AG+ P++DT+ +EKY + +ASV+F IP+P V A+ I + II+ Sbjct: 40 PGKAGQDVYGIPVYDTVLETVEKY-DVNASVIF--IPAPFVKDAAYEAIDAGIELVTIIT 96 Query: 145 SGIPVSSISERI-YDADYGMDVDTIRLSYFASKPSAGKCGFWPEDML 190 +P+ + + Y +G+++ AS P GK G P ++L Sbjct: 97 EHVPIQDSMDIVSYGKKHGVNIIGPNTPGLAS-PKVGKLGIIPMNIL 142 >gi|251789647|ref|YP_003004368.1| pilus biogenesis lipoprotein CpaD [Dickeya zeae Ech1591] gi|247538268|gb|ACT06889.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Dickeya zeae Ech1591] Length = 223 Score = 36.2 bits (82), Expect = 4.2, Method: Compositional matrix adjust. Identities = 18/41 (43%), Positives = 22/41 (53%) Query: 203 GCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 GCA Q+NLA V P DL R + D SI+RY+Q Sbjct: 169 GCANQSNLAQMVAEPRDLIQARSLDAADGVNMVNSIERYQQ 209 >gi|148976306|ref|ZP_01813030.1| putative lipoprotein [Vibrionales bacterium SWAT-3] gi|145964400|gb|EDK29655.1| putative lipoprotein [Vibrionales bacterium SWAT-3] Length = 195 Score = 36.2 bits (82), Expect = 4.2, Method: Compositional matrix adjust. Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 10/110 (9%) Query: 135 VKDIRKIIISSGIPVSSISERIYDADYGMDVD---TIRLSYFASKPSAGKCGFWPEDMLG 191 ++ +R +I SG+ S +I+ AD + TI + + +K +A G P + Sbjct: 70 IEKVRLHLIESGLYPS----QIWVADEATEGKGDITILVESYRAKVTACDAGKTPRTTVN 125 Query: 192 NAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 + RN +GCA N LA V NP DL + ++ ++ SI+ Y Sbjct: 126 AYRTQRN---FGCANANALAQMVANPKDLIVGQPISGTQGQKAVSSIENY 172 >gi|85373130|ref|YP_457192.1| hypothetical protein ELI_01515 [Erythrobacter litoralis HTCC2594] gi|84786213|gb|ABC62395.1| hypothetical protein ELI_01515 [Erythrobacter litoralis HTCC2594] Length = 188 Score = 36.2 bits (82), Expect = 4.4, Method: Compositional matrix adjust. Identities = 42/165 (25%), Positives = 66/165 (40%), Gaps = 35/165 (21%) Query: 68 YRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYK---NDSASVLFLLIPSP 124 Y + P++ R +D+ G G + P + G+ E D S+ ++ Sbjct: 8 YSTKQPVVER-TNYTLDV--RTGPGGLSIPEQQRLSGWFEAMNLRYGDRVSIEDPML--- 61 Query: 125 TVSSASIRRAVKDI---RKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGK 181 S S + A+ + I++S G PV+S Y ++ + R+ S S Sbjct: 62 ---SGSTKDAISQLAGRHGILVSDGAPVTS--------GY-VEPGSARVVITRSSASVPG 109 Query: 182 CGFWPEDMLGNAKGNRNWTN-----YGCAYQNNLAAQVVNPLDLF 221 C W + K N+TN YGCA NLAA V +P DL Sbjct: 110 CPDW------SVKSEMNYTNGTHPGYGCAINGNLAAMVADPEDLV 148 >gi|227326303|ref|ZP_03830327.1| putative lipoprotein [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 228 Score = 35.0 bits (79), Expect = 8.3, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 21/41 (51%) Query: 203 GCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 GCA QNNLA V P DL + + D SI+RY + Sbjct: 174 GCATQNNLAMMVAEPRDLIQAKALDSADGVAAVNSIERYHK 214 >gi|227114877|ref|ZP_03828533.1| putative lipoprotein [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 228 Score = 35.0 bits (79), Expect = 8.9, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 21/41 (51%) Query: 203 GCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 GCA QNNLA V P DL + + D SI+RY + Sbjct: 174 GCATQNNLAMMVAEPRDLIQAKALDSADGVAAVNSIERYHK 214 >gi|261820215|ref|YP_003258321.1| pilus biogenesis lipoprotein CpaD [Pectobacterium wasabiae WPP163] gi|261604228|gb|ACX86714.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Pectobacterium wasabiae WPP163] Length = 228 Score = 35.0 bits (79), Expect = 9.2, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 21/41 (51%) Query: 203 GCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 GCA QNNLA V P DL + + D SI+RY + Sbjct: 174 GCATQNNLAMMVAEPRDLIQAKALDNADGVAAVNSIERYHK 214 Searching..................................................done Results from round 2 >gi|255764485|ref|YP_003065139.2| hypothetical protein CLIBASIA_03065 [Candidatus Liberibacter asiaticus str. psy62] gi|254547836|gb|ACT57199.2| hypothetical protein CLIBASIA_03065 [Candidatus Liberibacter asiaticus str. psy62] Length = 243 Score = 343 bits (881), Expect = 9e-93, Method: Composition-based stats. Identities = 243/243 (100%), Positives = 243/243 (100%) Query: 1 MMVEYMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALA 60 MMVEYMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALA Sbjct: 1 MMVEYMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALA 60 Query: 61 YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL 120 YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL Sbjct: 61 YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL 120 Query: 121 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAG 180 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAG Sbjct: 121 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAG 180 Query: 181 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR Sbjct: 181 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 Query: 241 YRQ 243 YRQ Sbjct: 241 YRQ 243 >gi|315121890|ref|YP_004062379.1| hypothetical protein CKC_00700 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495292|gb|ADR51891.1| hypothetical protein CKC_00700 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 224 Score = 283 bits (725), Expect = 1e-74, Method: Composition-based stats. Identities = 141/243 (58%), Positives = 188/243 (77%), Gaps = 23/243 (9%) Query: 1 MMVEYMITILFGGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALA 60 MM + MI +FGG+CFK L + FFL Q+ FLLLF G + LA Sbjct: 1 MMRDIMIKEIFGGMCFKRLVS---------------FFLM-----QISFLLLFCGNNVLA 40 Query: 61 YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL 120 +DYRDRYPI+M+KVE+ +DIPLL+GRG++ ++DTI+GF+++YK +S SV+F+L Sbjct: 41 ---NQNDYRDRYPIVMKKVEKSLDIPLLSGRGKLPSDMYDTIKGFIDRYKQNSTSVIFIL 97 Query: 121 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAG 180 IP+PT+SS +I+ A+K+IR+ IIS+GIP SS+SER YDADY +D+DTIRLSYFAS+PSAG Sbjct: 98 IPTPTISSHAIQDALKNIRRFIISNGIPSSSLSERSYDADYELDIDTIRLSYFASRPSAG 157 Query: 181 KCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQR 240 KCGFWPED+LG++ N NW+NYGC+YQNNLAAQ+VNP+DLF+PR +TPPDA RD+SI R Sbjct: 158 KCGFWPEDILGSSLENSNWSNYGCSYQNNLAAQIVNPMDLFAPRSMTPPDAVHRDRSIHR 217 Query: 241 YRQ 243 Y++ Sbjct: 218 YQE 220 >gi|90425198|ref|YP_533568.1| pilus assembly protein CpaD [Rhodopseudomonas palustris BisB18] gi|90107212|gb|ABD89249.1| pilus assembly protein CpaD [Rhodopseudomonas palustris BisB18] Length = 246 Score = 278 bits (710), Expect = 6e-73, Method: Composition-based stats. Identities = 62/217 (28%), Positives = 108/217 (49%), Gaps = 11/217 (5%) Query: 38 FLRTLMLGQLFF--LLLFYGTSALAYYDEGS----DYRDRYPILMRKVEQIVDIPLLAGR 91 LR L LG L+ G + + GS DYR R+PI +++ +Q + I GR Sbjct: 9 HLRRLRLGGALVAVCLVLGGCNHTSDEVTGSIVPDDYRQRHPIAIQEADQTLIIFAGTGR 68 Query: 92 GEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSS 151 G + + + + + + + +P+ T ++ + ++++IR ++ + G+P ++ Sbjct: 69 GGLTGAQRADVASLAQTWLREGTGPIVIDLPTHTPNARAAADSLREIRALLAAQGLPPNA 128 Query: 152 ISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAY 206 ++ R Y IR++Y +AG CG WP+D+ ++K N+ + N GCA Sbjct: 129 VTVRDYQPRDTRQFAAIRVNYPRLTATAGPCGLWPDDLGSSSKNHDYFENKPYWNLGCAS 188 Query: 207 QNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 Q NLAA V NP DL PR TP +R S ++YR+ Sbjct: 189 QRNLAAMVDNPADLVQPRGETPAYTARRTNSFEKYRK 225 >gi|114705455|ref|ZP_01438363.1| pilus assembly protein [Fulvimarina pelagi HTCC2506] gi|114540240|gb|EAU43360.1| pilus assembly protein [Fulvimarina pelagi HTCC2506] Length = 230 Score = 262 bits (669), Expect = 3e-68, Method: Composition-based stats. Identities = 61/209 (29%), Positives = 107/209 (51%), Gaps = 5/209 (2%) Query: 38 FLRTLMLGQLFFLLLFYGTSALAYYDE----GSDYRDRYPILMRKVEQIVDIPLLAGRGE 93 +R L L + L G ++ E DYR R+PI++ + ++ +DIP++ + Sbjct: 5 IIRPLATASLVGIALALGACGNVHHIEVGSVPDDYRTRHPIVVSEADEAIDIPVVTSDQK 64 Query: 94 IKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSIS 153 + I F +++ A + +L+P + ++ + + + ++ +G+ I Sbjct: 65 LAMSDSGRIEDFAHRFRRSGADTMTVLVPYGSRNAVAASSISHEAIRTLMKAGVRREQIV 124 Query: 154 ERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQ 213 + Y A + IRL++ G CG WPED L + N+N+ N+GCA Q NLAAQ Sbjct: 125 MQSYAAHDALGPTPIRLTFSTLVAQTGPCGRWPED-LNSTHENKNYANFGCATQQNLAAQ 183 Query: 214 VVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 + +P DL SPR + P D E+RD+ I++YR Sbjct: 184 IADPRDLLSPRGMGPVDGERRDQVIEKYR 212 >gi|86748906|ref|YP_485402.1| Type IV pili component-like [Rhodopseudomonas palustris HaA2] gi|86571934|gb|ABD06491.1| Type IV pili component-like [Rhodopseudomonas palustris HaA2] Length = 242 Score = 261 bits (667), Expect = 6e-68, Method: Composition-based stats. Identities = 60/216 (27%), Positives = 100/216 (46%), Gaps = 12/216 (5%) Query: 39 LRTLMLGQLFF--LLLFYGTSALAYYDE-----GSDYRDRYPILMRKVEQIVDIPLLAGR 91 +R L G L + + DE +DYR R+PI +++ ++ V + + GR Sbjct: 10 IRGLTCGAALVAVSLSMGACTHTSANDEVTASVPNDYRQRHPIAIQEADRSVVVFVGNGR 69 Query: 92 GEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSS 151 G + + F +++ + + +PS T ++ + +++I+ ++ S G+P Sbjct: 70 GGLTATQRADVAAFGKEWLREGTGSIIAEVPSGTPNARAASDTMREIQSLLTSGGVPARG 129 Query: 152 ISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAY 206 + + Y IRL Y AG CG WPED+ + K N+ + N+GCA Sbjct: 130 VIVKPYQPADPRSFAAIRLLYPKVAAVAGPCGLWPEDIGPSIKNKGYLDNKPYWNFGCAN 189 Query: 207 QNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 Q NLAA V NP DL PR TP +R + + YR Sbjct: 190 QRNLAAMVENPSDLVQPRPETPAYTARRGVTFETYR 225 >gi|75674498|ref|YP_316919.1| pilus assembly protein CpaD [Nitrobacter winogradskyi Nb-255] gi|74419368|gb|ABA03567.1| pilus assembly protein CpaD [Nitrobacter winogradskyi Nb-255] Length = 246 Score = 259 bits (661), Expect = 3e-67, Method: Composition-based stats. Identities = 54/184 (29%), Positives = 95/184 (51%), Gaps = 5/184 (2%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 DYR R+PI +++ ++ V+I + RG + P + G + + + +P+ Sbjct: 42 PDDYRLRHPIAIQEADRTVNIFIGNTRGGLTAPQRADVVGLASVWLREGTGAIVAEVPTG 101 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 T ++ + + +++R ++ ++G+P I R Y TIRL+Y AG CG Sbjct: 102 TGNARAAADSFREVRSLLAAAGVPPRGIIVRHYHPADPRLFATIRLTYPRIAAVAGPCGV 161 Query: 185 WPEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQ 239 WP+D+ + K N+ + N+GCA Q NLA+ + NP DL PR TP +R +S + Sbjct: 162 WPDDIGPSVKNRGYLDNKPYWNFGCATQRNLASMIDNPSDLVQPRPETPAYTARRTQSFE 221 Query: 240 RYRQ 243 +YR+ Sbjct: 222 KYRK 225 >gi|91977986|ref|YP_570645.1| Type IV pili component-like [Rhodopseudomonas palustris BisB5] gi|91684442|gb|ABE40744.1| Type IV pili component-like [Rhodopseudomonas palustris BisB5] Length = 243 Score = 257 bits (656), Expect = 1e-66, Method: Composition-based stats. Identities = 61/229 (26%), Positives = 104/229 (45%), Gaps = 8/229 (3%) Query: 19 LANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRK 78 + +M+ + +F +L LG G +DYR R+PI +++ Sbjct: 1 MTSMKPARATRGLVFGAALTCVSLSLG---ACTHTRGAGDDVTASISTDYRQRHPIAIQE 57 Query: 79 VEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDI 138 + V + + GRG + + F + + + + +P+ T ++ + +++I Sbjct: 58 GDHTVVVFVGNGRGGLTTTQRADVAAFGQGWLREGTGSIIAEVPADTPNARAAGETLREI 117 Query: 139 RKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG--- 195 + ++ + G+P + + Y IRL Y AG CG WPED+ + K Sbjct: 118 QSLLAAGGVPQRGVIVKPYRPTDPRAFAAIRLIYPKVSAVAGPCGLWPEDIGPSIKNKGY 177 Query: 196 --NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 N+ + N+GCAYQ NLAA V NP DL PR TPP A +R + + YR Sbjct: 178 LDNKPYWNFGCAYQRNLAAMVENPSDLVQPRPETPPYAARRATTFEAYR 226 >gi|150398542|ref|YP_001329009.1| pilus biogenesis lipoprotein CpaD [Sinorhizobium medicae WSM419] gi|150030057|gb|ABR62174.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium medicae WSM419] Length = 242 Score = 256 bits (655), Expect = 2e-66, Method: Composition-based stats. Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 3/199 (1%) Query: 47 LFFLLLFYGTSA---LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIR 103 L G + LA DYR R+PI++ + E+ +DIP+ +G + D IR Sbjct: 30 AIAFALLAGCANKDRLATGALPDDYRTRHPIVLTEGERTIDIPVASGDTRLTQGTRDVIR 89 Query: 104 GFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGM 163 GF +Y+N S+ V+ +++P +V+ + + K+IR+++ SG+ I E YDA Sbjct: 90 GFAAEYRNASSGVVQIMLPRGSVNGRAAQILRKEIRRLLAGSGVSPKKIIETSYDASVTG 149 Query: 164 DVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 D IRLSY A CG WPED+ N NRN+ N+GCA Q+NLAAQ+ NP DL P Sbjct: 150 DAAPIRLSYVAITAQTAPCGAWPEDLALNTLENRNYYNFGCATQSNLAAQIANPTDLVGP 209 Query: 224 RMVTPPDAEQRDKSIQRYR 242 R ++P DAEQR + I +R Sbjct: 210 RRMSPIDAEQRGQVIDSWR 228 >gi|39936741|ref|NP_949017.1| pilus assembly protein cpaD [Rhodopseudomonas palustris CGA009] gi|192292567|ref|YP_001993172.1| pilus biogenesis lipoprotein CpaD [Rhodopseudomonas palustris TIE-1] gi|39650597|emb|CAE29120.1| possible pilus assembly protein cpaD [Rhodopseudomonas palustris CGA009] gi|192286316|gb|ACF02697.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Rhodopseudomonas palustris TIE-1] Length = 242 Score = 256 bits (655), Expect = 2e-66, Method: Composition-based stats. Identities = 63/215 (29%), Positives = 101/215 (46%), Gaps = 12/215 (5%) Query: 40 RTLMLGQLF--FLLLFYGTSALAYYDE-----GSDYRDRYPILMRKVEQIVDIPLLAGRG 92 R L LG F L + + E +DYR R+PI +R+ ++ V++ + GRG Sbjct: 11 RGLGLGAALIGFSLSLGACTHTSREVEVTQSIPTDYRQRHPIAIREADRTVEVFVGNGRG 70 Query: 93 EIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + + + + + + +PS T ++ + +++I+ ++ ++G+P + Sbjct: 71 GLTPVQRAEVAELGQTWLREGTGAIIAEVPSDTPNARAASDTIREIQSVLAANGVPARGV 130 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAYQ 207 + + Y TIRL Y AG CG WPED+ + K NR + N GCA Q Sbjct: 131 TVKHYRPADPRTFATIRLIYPKVTAVAGPCGLWPEDLGPSIKNKGYYDNRPYWNLGCANQ 190 Query: 208 NNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NLAA V NP DL PR TPP +R + YR Sbjct: 191 RNLAAMVENPSDLVQPRPETPPYTARRAVTYDTYR 225 >gi|85713506|ref|ZP_01044496.1| pilus assembly protein CpaD [Nitrobacter sp. Nb-311A] gi|85699410|gb|EAQ37277.1| pilus assembly protein CpaD [Nitrobacter sp. Nb-311A] Length = 245 Score = 256 bits (655), Expect = 2e-66, Method: Composition-based stats. Identities = 54/184 (29%), Positives = 96/184 (52%), Gaps = 5/184 (2%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 +DYR R+PI +++ ++ V+I + RG + + G + + + +P+ Sbjct: 41 PNDYRLRHPIAIQEADRTVNIFVGNTRGGLTAAQRADVIGLASVWLREGTGAIIAEVPAE 100 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 T ++ + + K++R ++ ++G+P I R Y+ TIRL+Y AG CG Sbjct: 101 TRNARAAASSFKEVRSLLTAAGVPPRGIIVRHYNPADPRLFATIRLTYPRIAAVAGPCGV 160 Query: 185 WPEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQ 239 WP+D+ + K N+ + N+GCA Q NLA+ + NP DL PR TPP +R + + Sbjct: 161 WPDDLGPSIKNRGYLDNKPYWNFGCATQRNLASMIDNPSDLVQPRPETPPYTARRTEGFE 220 Query: 240 RYRQ 243 +YR+ Sbjct: 221 KYRK 224 >gi|92116012|ref|YP_575741.1| pilus assembly protein CpaD [Nitrobacter hamburgensis X14] gi|91798906|gb|ABE61281.1| pilus assembly protein CpaD [Nitrobacter hamburgensis X14] Length = 246 Score = 256 bits (654), Expect = 2e-66, Method: Composition-based stats. Identities = 56/184 (30%), Positives = 92/184 (50%), Gaps = 5/184 (2%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 +DYR R+PI +++ + V+I + RG + + G + + + P Sbjct: 42 PNDYRLRHPIAIQEANRTVNIFVGNTRGGLSASQRADVVGLASVWLREGTGAIVAEAPMG 101 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 T ++ + +++++R ++ ++G+P I R Y TIRL+Y AG CG Sbjct: 102 TSNARAAADSLREVRSLLTAAGVPPRGIIVRHYHPADPRLFATIRLTYPQIAAVAGPCGV 161 Query: 185 WPEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQ 239 WPED+ + K N+ + N+GCA Q NLAA V NP DL PR TP +R ++ Sbjct: 162 WPEDLGPSIKNKGYLDNKPYWNFGCASQRNLAAMVDNPSDLVQPRPETPTYTARRTYALD 221 Query: 240 RYRQ 243 +YRQ Sbjct: 222 KYRQ 225 >gi|15963895|ref|NP_384248.1| hypothetical protein SMc04110 [Sinorhizobium meliloti 1021] gi|307315792|ref|ZP_07595306.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium meliloti BL225C] gi|15073070|emb|CAC41529.1| Pilus assembly protein cpaD [Sinorhizobium meliloti 1021] gi|306898560|gb|EFN29233.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium meliloti BL225C] Length = 226 Score = 255 bits (651), Expect = 4e-66, Method: Composition-based stats. Identities = 78/199 (39%), Positives = 115/199 (57%), Gaps = 3/199 (1%) Query: 47 LFFLLLFYGTSA---LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIR 103 L G ++ LA DYR R+PI++ + E+ +DIP+ +G + D IR Sbjct: 14 AIVAALLAGCASKDKLATGALPDDYRTRHPIVLTEGERTIDIPIASGDTRLTQGTRDVIR 73 Query: 104 GFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGM 163 GF +Y+N S+SV+ +++P +V+ + + KDIR+++ +SG+ + E YDA Sbjct: 74 GFAAEYRNASSSVIQIMLPRGSVNGHAAQIVRKDIRRLLAASGVSPKKMIETTYDASVTG 133 Query: 164 DVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 D IRLSY A CG WPED+ N NRN+ N+GCA Q+NLAAQ+ NP DL P Sbjct: 134 DAAPIRLSYVAITAQTAPCGAWPEDLALNTLENRNYYNFGCATQSNLAAQIANPTDLVGP 193 Query: 224 RMVTPPDAEQRDKSIQRYR 242 R ++P DAEQR + I +R Sbjct: 194 RQMSPIDAEQRGQVIDSWR 212 >gi|209551760|ref|YP_002283677.1| pilus biogenesis lipoprotein CpaD [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537516|gb|ACI57451.1| pilus biogenesis lipoprotein CpaD [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 244 Score = 254 bits (649), Expect = 7e-66, Method: Composition-based stats. Identities = 73/207 (35%), Positives = 109/207 (52%), Gaps = 5/207 (2%) Query: 40 RTLMLGQLFFLLLFYGTSA----LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 + L L + G + L DYR R+PI++ + EQ VDIP+ + + Sbjct: 23 KGLFAAVAMVLAVLSGCAGPHDQLTTGGIPDDYRARHPIIVTEAEQTVDIPVASTDRRLT 82 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 D IRGF Y ++ +++L P + +SA+ + +R + S GI S I Sbjct: 83 NAQRDLIRGFAANYIARASGPVYVLSPQGSPNSAAAYQLRNQVRAELASRGIASSKIVNT 142 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 Y A D IRLS+ + +CG WP+D+ N N+N+ N+GCA QNNLAAQ+ Sbjct: 143 SYAAVGPGDAAPIRLSFTGTTAITTQCGQWPKDI-SNDFTNQNYYNFGCASQNNLAAQIA 201 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR +TP DA++R+ +IQ YR Sbjct: 202 NPEDLVAPRGMTPIDAQRRNNAIQEYR 228 >gi|299132287|ref|ZP_07025482.1| pilus biogenesis lipoprotein CpaD [Afipia sp. 1NLS2] gi|298592424|gb|EFI52624.1| pilus biogenesis lipoprotein CpaD [Afipia sp. 1NLS2] Length = 246 Score = 254 bits (648), Expect = 9e-66, Method: Composition-based stats. Identities = 59/212 (27%), Positives = 104/212 (49%), Gaps = 10/212 (4%) Query: 41 TLMLGQLFFLLLFYGT-----SALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 + L + + G + + DYR R+PI++++ + +I + RG + Sbjct: 13 AMAALALVSVTMLGGCMHSQEAGIVTGSLPDDYRLRHPIVIQEASKTTEIFVGHARGGLT 72 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 I G + + ++ + + +P+ T ++ + V+DI+ I+ ++GIP I Sbjct: 73 TAQRADIVGLSQAWLSEGTGAITIDVPTGTPNAQAASVTVRDIQNILAAAGIPPKGIRVM 132 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAYQNNL 210 Y + +R+ Y AG CG WPED+ + K NR++ N+GCAYQ N+ Sbjct: 133 PYHPNDPRQFAPVRVRYARIIADAGPCGLWPEDLGPSVKNKSYFENRSYQNFGCAYQRNM 192 Query: 211 AAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 AA V NP DL PR TPP++ +R ++ +YR Sbjct: 193 AAMVANPADLVQPRAETPPNSARRTEAFAKYR 224 >gi|159184217|ref|NP_353253.2| components of type IV pilus [Agrobacterium tumefaciens str. C58] gi|159139546|gb|AAK86038.2| components of type IV pilus [Agrobacterium tumefaciens str. C58] Length = 193 Score = 254 bits (648), Expect = 9e-66, Method: Composition-based stats. Identities = 73/178 (41%), Positives = 103/178 (57%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 DYR R+PI + + E +DIP+ AG + + D +RGF + Y + S ++ + +PS Sbjct: 7 PDDYRTRHPITLSEAEHSLDIPVSAGDSRLTTAMADNVRGFAQNYASMSTGIVNIQMPSG 66 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 + +SA+ R K IR + +G+ I E Y A D IRLSY A G+CG Sbjct: 67 SPNSATAARMAKQIRSTLSGAGVAQGKIMETRYAASPNGDSAPIRLSYVAVTAMTGQCGQ 126 Query: 185 WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 WPED+ N N+NW N+GCA Q+NLAAQ+ NP+DL PR ++P DAE+R I YR Sbjct: 127 WPEDLSDNTFANKNWYNFGCASQSNLAAQIANPMDLVGPRGMSPIDAERRAVVIDTYR 184 >gi|307320427|ref|ZP_07599844.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium meliloti AK83] gi|306893993|gb|EFN24762.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Sinorhizobium meliloti AK83] Length = 226 Score = 253 bits (647), Expect = 1e-65, Method: Composition-based stats. Identities = 78/199 (39%), Positives = 115/199 (57%), Gaps = 3/199 (1%) Query: 47 LFFLLLFYGTSA---LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIR 103 L G ++ LA DYR R+PI++ + E+ +DIP+ +G + D IR Sbjct: 14 AIVAALLAGCASKDKLATGALPDDYRTRHPIVLTEGERTIDIPIASGDTRLTQGTRDVIR 73 Query: 104 GFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGM 163 GF +Y+N S+SV+ +++P +V+ + + KDIR+++ +SG+ + E YDA Sbjct: 74 GFAAEYRNASSSVIQIMLPRGSVNGHAAQIVRKDIRRLLAASGVSPKKMIETTYDASVTG 133 Query: 164 DVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 D IRLSY A CG WPED+ N NRN+ N+GCA Q+NLAAQ+ NP DL P Sbjct: 134 DAAPIRLSYVAITAQTAPCGAWPEDLALNTLVNRNYYNFGCATQSNLAAQIANPTDLVGP 193 Query: 224 RMVTPPDAEQRDKSIQRYR 242 R ++P DAEQR + I +R Sbjct: 194 RQMSPIDAEQRGQVIDSWR 212 >gi|209886531|ref|YP_002290388.1| pilus assembly protein CpaD [Oligotropha carboxidovorans OM5] gi|209874727|gb|ACI94523.1| pilus assembly protein CpaD [Oligotropha carboxidovorans OM5] Length = 247 Score = 253 bits (646), Expect = 2e-65, Method: Composition-based stats. Identities = 57/213 (26%), Positives = 102/213 (47%), Gaps = 11/213 (5%) Query: 41 TLMLGQLFFLLLFYGTSALAYYDE------GSDYRDRYPILMRKVEQIVDIPLLAGRGEI 94 L L G + + E DYR R+PI++++ + +I + RG + Sbjct: 13 ALAALTLIGAAALGGCTHRSQEAEIVTGSLPVDYRARHPIVIQEAAKTTEIFVGHARGGL 72 Query: 95 KYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISE 154 I G + + ++ + + P+ T ++ + V+DI+ +++++GIP + Sbjct: 73 TTAQRTDIAGLAQAWLSEGTGAITIDTPTGTPNAQAASVTVRDIQNMLVAAGIPARGVKV 132 Query: 155 RIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAYQNN 209 + Y + +R+ Y AG CG WPED+ + K NR + N+GCAYQ N Sbjct: 133 QPYHPNDPRQFAPVRVRYARIIADAGPCGLWPEDLGPSVKNKSYFENRPYQNFGCAYQRN 192 Query: 210 LAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +AA V NP DL PR +P ++ +R ++ +YR Sbjct: 193 MAAMVANPADLVQPRAESPSNSARRSQAFTKYR 225 >gi|118589704|ref|ZP_01547109.1| components of type IV pilus [Stappia aggregata IAM 12614] gi|118437790|gb|EAV44426.1| components of type IV pilus [Stappia aggregata IAM 12614] Length = 238 Score = 252 bits (643), Expect = 4e-65, Method: Composition-based stats. Identities = 63/201 (31%), Positives = 99/201 (49%), Gaps = 4/201 (1%) Query: 47 LFFLLLFYGTSALAYYDEG----SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTI 102 +F L+ G + DYR +PI++ + + +D+P+ I PI TI Sbjct: 19 VFGCLIAAGCQNQSQSTSQMLASHDYRLMHPIVITEEPETLDLPVGRNTRNINGPIESTI 78 Query: 103 RGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYG 162 F ++ + + +L+PS + A++ IR+ + G+ + IS R Y Sbjct: 79 AAFGQQSRQKGNGSVEILVPSGGANEAAVHSITPKIRQALQQGGVSRNRISTRTYSVGDP 138 Query: 163 MDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFS 222 IRLSY + +AG+CG WP ++ G N N+ N+GCA Q NLAA V NP DL + Sbjct: 139 GADAPIRLSYARMQATAGECGAWPRNIGGGFGENINYENFGCASQANLAAMVDNPSDLIT 198 Query: 223 PRMVTPPDAEQRDKSIQRYRQ 243 PR P D +R I++YR+ Sbjct: 199 PRASAPSDQGRRAVVIEKYRK 219 >gi|146338122|ref|YP_001203170.1| putative pilus assembly protein CpaD [Bradyrhizobium sp. ORS278] gi|146190928|emb|CAL74933.1| Putative pilus assembly protein cpaD; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 246 Score = 252 bits (643), Expect = 4e-65, Method: Composition-based stats. Identities = 53/184 (28%), Positives = 92/184 (50%), Gaps = 5/184 (2%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 DYR R+PI +++ + + + GRG + + G + + + +PS Sbjct: 42 PDDYRLRHPIAVQEAPDSLVVFVGQGRGGLTAEQRAEVMGLAQSWMRQGTGAIVADVPSG 101 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 T ++ + ++++I+ + ++G+P ++ R Y + IRLSY +AG CG Sbjct: 102 TPNARAAADSMREIQSLFSAAGVPPHGVTVRNYQPKDPRQMAAIRLSYPKLSATAGPCGL 161 Query: 185 WPEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQ 239 WP+D+ + K N+ N+GCAYQ N+AA V NP DL PR TP +R + Sbjct: 162 WPDDLGPSVKNKNWFDNKPDWNFGCAYQRNMAAMVDNPADLVQPRPETPSYTTRRTALFE 221 Query: 240 RYRQ 243 +YR+ Sbjct: 222 KYRK 225 >gi|222147181|ref|YP_002548138.1| component of type IV pilus [Agrobacterium vitis S4] gi|221734171|gb|ACM35134.1| component of type IV pilus [Agrobacterium vitis S4] Length = 196 Score = 252 bits (643), Expect = 4e-65, Method: Composition-based stats. Identities = 71/178 (39%), Positives = 103/178 (57%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 DYR R+PI++ VE +D+P+ G + + D + GF + Y+N S + +L+P Sbjct: 7 PDDYRTRHPIIVTDVEHSLDLPVAQGSSRLTIGMSDAVTGFAQDYRNASTGYVQILVPQG 66 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 + ++ + + +R +++S GI I ER Y A D IRLSY A+ AG CG Sbjct: 67 SPNTMAASSIARQVRNLLVSKGIAAPKIVERPYRAGATGDAAPIRLSYVATTAVAGPCGQ 126 Query: 185 WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 WPED+ + N+NW N+GCA Q NLAAQV +P DL +PR +TP DAE+R I YR Sbjct: 127 WPEDLSNDTAQNKNWQNFGCASQANLAAQVASPTDLIAPRGMTPIDAERRSTVIDNYR 184 >gi|163757628|ref|ZP_02164717.1| components of type IV pilus [Hoeflea phototrophica DFL-43] gi|162285130|gb|EDQ35412.1| components of type IV pilus [Hoeflea phototrophica DFL-43] Length = 233 Score = 251 bits (640), Expect = 9e-65, Method: Composition-based stats. Identities = 65/210 (30%), Positives = 110/210 (52%), Gaps = 5/210 (2%) Query: 39 LRTLMLGQLFFLLLFYGTSA-----LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGE 93 +R L + L + G + + DYR +PI++ + E+ VDIP+ G E Sbjct: 16 VRYLAISMLAAASVLTGCAGWGGKSVVVGAVPDDYRTNHPIIVAEQERTVDIPVGTGDRE 75 Query: 94 IKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSIS 153 + + + +RG Y++ ++ + +++P + ++ + + KI+ G+P I Sbjct: 76 LTTSMREIVRGAAHSYRSSASGAVRIMVPVGSANAGAASILSGQVAKILQKEGVPRDRIL 135 Query: 154 ERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQ 213 Y D IR++Y A S KCG WPED+ + N++W N+GCA Q+NLAAQ Sbjct: 136 SSPYSVSSPDDAAPIRIAYLAITASTEKCGRWPEDLAADTTENKHWANFGCASQSNLAAQ 195 Query: 214 VVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 + NP DL +PR ++P DAE+R I+ YR+ Sbjct: 196 IANPGDLIAPRGMSPIDAERRSTIIETYRE 225 >gi|148258236|ref|YP_001242821.1| putative pilus assembly protein CpaD [Bradyrhizobium sp. BTAi1] gi|146410409|gb|ABQ38915.1| Putative pilus assembly protein cpaD [Bradyrhizobium sp. BTAi1] Length = 244 Score = 249 bits (637), Expect = 2e-64, Method: Composition-based stats. Identities = 51/184 (27%), Positives = 89/184 (48%), Gaps = 5/184 (2%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 DYR R+PI +++ + + + GRG + + + + + +P+ Sbjct: 40 PDDYRLRHPIAVQEAPDSLVVFVGQGRGGLTAEQRAEVMALAQSWLRQGTGAISADVPTG 99 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 T ++ + ++++I+ + ++G+P ++ R Y + IRLSY +AG CG Sbjct: 100 TPNARAAGDSMREIQSLFAAAGVPPHGLTVRNYQPKDPRQMAAIRLSYPKMSATAGPCGV 159 Query: 185 WPEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQ 239 WP+D+ K N+ N+GCAYQ N+AA V NP DL PR TP +R Sbjct: 160 WPDDLGPTIKNKNWFENKPDWNFGCAYQRNMAAMVDNPADLVQPRAETPSYTTRRTALFD 219 Query: 240 RYRQ 243 +YR+ Sbjct: 220 KYRK 223 >gi|227823972|ref|YP_002827945.1| pilus assembly protein CpaD [Sinorhizobium fredii NGR234] gi|227342974|gb|ACP27192.1| pilus assembly protein CpaD [Sinorhizobium fredii NGR234] Length = 234 Score = 249 bits (636), Expect = 2e-64, Method: Composition-based stats. Identities = 74/184 (40%), Positives = 110/184 (59%) Query: 59 LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLF 118 LA DYR R+PI++ + E+++DIP+ +G + D IRGF +Y+N S V+ Sbjct: 45 LATGSIPDDYRTRHPIVIAEGERVIDIPVASGDRRLTAGTRDVIRGFATEYRNASGGVIQ 104 Query: 119 LLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS 178 +++P + +S + + KDIR+++ +SG+P + E Y+A D IRLSY A Sbjct: 105 IMLPRGSANSHAAQIVRKDIRRLLAASGVPPKRMIETGYEAVSPGDAAPIRLSYVAITAQ 164 Query: 179 AGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSI 238 CG WPED+ N NRN+ N+GCA Q+NLAAQ+ NP DL PR ++P DA QR + I Sbjct: 165 TAPCGEWPEDLTLNTLQNRNYYNFGCASQSNLAAQIANPTDLIGPRQMSPVDAAQRGEVI 224 Query: 239 QRYR 242 +R Sbjct: 225 DAWR 228 >gi|148256958|ref|YP_001241543.1| putative pilus assembly protein CpaD [Bradyrhizobium sp. BTAi1] gi|146409131|gb|ABQ37637.1| Putative pilus assembly protein cpaD [Bradyrhizobium sp. BTAi1] Length = 247 Score = 249 bits (635), Expect = 4e-64, Method: Composition-based stats. Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 9/210 (4%) Query: 43 MLGQLFFLLLFYG-TSALAYYDE---GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPI 98 G + L L G +A DY+ R+PI + + Q + + + +GRG + Y Sbjct: 17 RCGAIIGLALTLGACNATTGEVVATIPDDYKIRHPIAVEEGRQSIVVFVGSGRGGLTYQQ 76 Query: 99 HDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYD 158 + G ++ + + +P+ T ++ + ++I ++ S G+P SI+ R Y Sbjct: 77 RADVAGLARSWQREGTGAIVAEVPADTPNARAAADTYREIHAMLTSGGVPSRSITLRHYT 136 Query: 159 ADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAK-----GNRNWTNYGCAYQNNLAAQ 213 D + +RLSY AG CG WP+D+ N N+++ N+GCA Q NLAA Sbjct: 137 PDDPRLLAAVRLSYPKIAAVAGPCGLWPDDLGPNIDNPSYSNNQHYHNFGCATQRNLAAM 196 Query: 214 VVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 + NP DL PR +R ++YR+ Sbjct: 197 IDNPADLEQPRAEVAAYTPRRSALFEKYRK 226 >gi|316933038|ref|YP_004108020.1| pilus biogenesis lipoprotein CpaD [Rhodopseudomonas palustris DX-1] gi|315600752|gb|ADU43287.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Rhodopseudomonas palustris DX-1] Length = 242 Score = 249 bits (635), Expect = 4e-64, Method: Composition-based stats. Identities = 62/216 (28%), Positives = 100/216 (46%), Gaps = 12/216 (5%) Query: 39 LRTLMLGQLF--FLLLFYGTSALAYYDE-----GSDYRDRYPILMRKVEQIVDIPLLAGR 91 +R L LG F L + E +DYR R+PI +R+ ++ V++ + GR Sbjct: 10 MRGLGLGAALIGFSLSLGACTHTKREVEVTQSIPTDYRQRHPIAIREADRTVEVFVGNGR 69 Query: 92 GEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSS 151 G + + + + + + +PS T ++ + +++I+ ++ ++G+P Sbjct: 70 GGLTALQRAEVAELGQAWLREGTGAIIAEVPSDTPNARAASDTIREIQSVLSANGVPPRG 129 Query: 152 ISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAY 206 ++ + Y TIRL Y AG CG WPED+ + K NR + N GCA Sbjct: 130 VTVKHYRPADPRTFATIRLIYPKITAVAGPCGLWPEDIGPSIKNKGYYDNRPYWNLGCAN 189 Query: 207 QNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 Q NLAA V NP DL PR TP +R + YR Sbjct: 190 QRNLAAMVENPADLVQPRPETPAYTARRMVTNDVYR 225 >gi|116249982|ref|YP_765820.1| pilus assembly protein [Rhizobium leguminosarum bv. viciae 3841] gi|115254630|emb|CAK05704.1| putative pilus assembly protein [Rhizobium leguminosarum bv. viciae 3841] Length = 250 Score = 248 bits (633), Expect = 6e-64, Method: Composition-based stats. Identities = 72/207 (34%), Positives = 110/207 (53%), Gaps = 5/207 (2%) Query: 40 RTLMLGQLFFLLLFYGTSA----LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 + L + + G + L DYR R+PI++ + EQ VDIP+ + + Sbjct: 31 KALFATVAMSVAILSGCAGPHDQLTTGGIPDDYRARHPIIVTEAEQTVDIPVASTDRRLT 90 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 D IRGF Y + ++ +++L P + +SA+ + +R + S GI S I Sbjct: 91 IAQRDLIRGFAANYISRASGPVYVLSPQGSPNSAAAYQLRNQVRAELTSRGIASSKIVNT 150 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 Y A D IRLS+ + +CG WP+D+ N N+N+ N+GCA QNNLAAQ+ Sbjct: 151 SYAAVGPGDAAPIRLSFTGTTAVTTQCGQWPKDI-SNDLTNQNYYNFGCASQNNLAAQIA 209 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR +TP DA++R+ +IQ YR Sbjct: 210 NPEDLVAPRGMTPIDAQRRNNAIQEYR 236 >gi|254503403|ref|ZP_05115554.1| pilus biogenesis lipoprotein CpaD [Labrenzia alexandrii DFL-11] gi|222439474|gb|EEE46153.1| pilus biogenesis lipoprotein CpaD [Labrenzia alexandrii DFL-11] Length = 221 Score = 248 bits (632), Expect = 8e-64, Method: Composition-based stats. Identities = 61/200 (30%), Positives = 104/200 (52%), Gaps = 4/200 (2%) Query: 47 LFFLLLFYGTS----ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTI 102 + L+ G + A DYR ++PI++ + + +D+P+ ++ P+ DTI Sbjct: 3 VLAALMLSGCQTEPKSNAELLATHDYRYQHPIVVSEAPETLDLPVGKNTRNLRSPVTDTI 62 Query: 103 RGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYG 162 F + + + +L+P+ + +++ V DIR + G+ ++ R Y + Sbjct: 63 TSFAMDSRRHGSGNVEILVPTGAANESAVHAVVHDIRGALSRGGVNGKHVTTRTYRSTDS 122 Query: 163 MDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFS 222 IRLSY K + G+CG WP+++ G N N+ N+GCA Q+NLAA V NP DL + Sbjct: 123 SADAPIRLSYARMKATTGECGAWPKNIGGGIGENTNYYNFGCATQSNLAAIVENPSDLIT 182 Query: 223 PRMVTPPDAEQRDKSIQRYR 242 PR +TP D +R I++YR Sbjct: 183 PRAMTPSDQNRRAVVIEKYR 202 >gi|304392383|ref|ZP_07374324.1| pilus assembly protein [Ahrensia sp. R2A130] gi|303295487|gb|EFL89846.1| pilus assembly protein [Ahrensia sp. R2A130] Length = 227 Score = 246 bits (627), Expect = 3e-63, Method: Composition-based stats. Identities = 66/178 (37%), Positives = 104/178 (58%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 S+Y+ R+PI++ + EQ +DIP+ + + GF KY+ + + ++IP + Sbjct: 35 SNYKTRHPIVIDEKEQTLDIPVGSDTVRLPRAQESATEGFASKYRRSPSGTMTIMIPRHS 94 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 ++++ R + +I+ G+P SSI YDA IR+SY A + S +CG W Sbjct: 95 PNASAARSMSHQVAEILRREGVPPSSIVTTSYDASRHGSAAPIRVSYHAVQASVERCGKW 154 Query: 186 PEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 PED+ G N+NW N+GCA QNN+AAQ+ NP DL +PR +T DAE+R+ I+ YR+ Sbjct: 155 PEDLAGPNLDNQNWHNFGCANQNNMAAQIANPSDLVAPRGMTQADAERRNNVIEDYRE 212 >gi|307943146|ref|ZP_07658491.1| pilus biogenesis lipoprotein CpaD [Roseibium sp. TrichSKD4] gi|307773942|gb|EFO33158.1| pilus biogenesis lipoprotein CpaD [Roseibium sp. TrichSKD4] Length = 237 Score = 245 bits (625), Expect = 4e-63, Method: Composition-based stats. Identities = 64/208 (30%), Positives = 104/208 (50%), Gaps = 5/208 (2%) Query: 40 RTLMLGQLFFLLLFYGTSALAYYDEGSD-----YRDRYPILMRKVEQIVDIPLLAGRGEI 94 RT + F LL G ++ S+ Y+ R+PI++ + +++D+P+ A + Sbjct: 10 RTAIGATFFVSLLLAGCNSQTVGQNNSNLAATNYQLRHPIVVTEQPEVLDLPIGAHMRNL 69 Query: 95 KYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISE 154 P+ T++ F + + +L+PS + A++ V IR + + G+ +IS Sbjct: 70 NGPLRGTVKAFGADSRKKGNGRVEILVPSGGRNEAAVHALVPQIRSSLKAGGLSGGAIST 129 Query: 155 RIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQV 214 R Y D IRLSY + +AG CG W D+ N ++ NYGCA Q+NLAA V Sbjct: 130 RSYAVDNPSADAPIRLSYPRIQATAGPCGTWNGDIGRTFDRNVDYENYGCATQSNLAAMV 189 Query: 215 VNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR P D +R +++YR Sbjct: 190 ENPSDLLTPRASAPADRMRRANVVEKYR 217 >gi|110636322|ref|YP_676530.1| pilus biogenesis lipoprotein CpaD [Mesorhizobium sp. BNC1] gi|110287306|gb|ABG65365.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Chelativorans sp. BNC1] Length = 226 Score = 244 bits (624), Expect = 6e-63, Method: Composition-based stats. Identities = 65/185 (35%), Positives = 104/185 (56%), Gaps = 2/185 (1%) Query: 58 ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVL 117 ++ DYR +PI++ + EQ++D+P+ + ++ GF+E Y +V+ Sbjct: 28 SITVGALPDDYRTNHPIVLSEKEQVLDLPVGVFSYRMTPQQKMSLEGFMEHYGESGKAVV 87 Query: 118 FLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKP 177 +L+PS + + + R +DI + + G+P + YDA +R+SY Sbjct: 88 TVLVPSGSPNERAASRLSEDIAQFLYRRGVPKGHLQVLSYDA-PAEQASPVRVSYSVVAA 146 Query: 178 SAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 + G+CG WPED+L N+++ N+GCAYQNNLAAQ+ NP+DL PR TP DAE RD + Sbjct: 147 TTGQCGRWPEDLLDTT-ENKHYANFGCAYQNNLAAQIANPMDLLGPRKTTPIDAENRDTA 205 Query: 238 IQRYR 242 I RY+ Sbjct: 206 IGRYK 210 >gi|27375776|ref|NP_767305.1| pilus assembly protein [Bradyrhizobium japonicum USDA 110] gi|27348914|dbj|BAC45930.1| pilV [Bradyrhizobium japonicum USDA 110] Length = 244 Score = 244 bits (622), Expect = 1e-62, Method: Composition-based stats. Identities = 61/209 (29%), Positives = 99/209 (47%), Gaps = 6/209 (2%) Query: 41 TLMLGQLFFLLLFYGTSALAYYDE-GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIH 99 L+L L +L T+ +DYR R+PI +++ ++ + I + RG + Sbjct: 15 ALVLTGLSVMLGACNTTGEIVTQTVPTDYRQRHPIAVQEAKKSIVIFVGKARGGLSAAQQ 74 Query: 100 DTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDA 159 + G + + + + +P + +S + +IR ++ S G+P +I + Y Sbjct: 75 SDVAGTARDWVREGTGSVVVDVPIGSANSRAAATTYHEIRSVLASGGVPSRAIVQHPYRP 134 Query: 160 DYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNA-----KGNRNWTNYGCAYQNNLAAQV 214 + + TIRLSY AG CG WPED+ + N+ + N GCA Q NLAA + Sbjct: 135 EDPGLLPTIRLSYSRIAAVAGPCGLWPEDVGPSILDPGYNENQPYFNLGCASQRNLAAMI 194 Query: 215 VNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 NP DL PR TP +RD + RYR+ Sbjct: 195 DNPADLEQPRAETPVYTARRDIAFDRYRK 223 >gi|27376549|ref|NP_768078.1| pilus assembly protein [Bradyrhizobium japonicum USDA 110] gi|27349690|dbj|BAC46703.1| pilus assembly protein [Bradyrhizobium japonicum USDA 110] Length = 244 Score = 244 bits (622), Expect = 1e-62, Method: Composition-based stats. Identities = 58/208 (27%), Positives = 99/208 (47%), Gaps = 8/208 (3%) Query: 44 LGQLFFLLLFYGTSALAYYDEGS---DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHD 100 G L + L G S Y+ R+PI + + + + + + RG + Sbjct: 16 GGALVGIGLALGGCQHDEAVTASIPDSYKQRHPIAIEEQNRSIVVFVGHARGGLTAAQRA 75 Query: 101 TIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDAD 160 + G + ++ + + PS T ++ + ++++I+ ++ ++G+P I R Y + Sbjct: 76 DVMGLASAWLHEGTGAIHIDAPSGTPNARPVAESMREIQAMLAAAGVPPRGIIARPYQPE 135 Query: 161 YGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVV 215 + IRL+Y AG CG WPED+ + K N+ + NYGCAYQ NLAA V Sbjct: 136 DKRFLPPIRLTYSKIAAVAGPCGLWPEDIGPSMKNKGWFENKEYYNYGCAYQRNLAAMVD 195 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 NP DL PR TP +R + ++YR+ Sbjct: 196 NPSDLEQPRPETPSYTTRRTAAFEKYRK 223 >gi|190889882|ref|YP_001976424.1| pilus assembly protein [Rhizobium etli CIAT 652] gi|190695161|gb|ACE89246.1| pilus assembly protein [Rhizobium etli CIAT 652] Length = 252 Score = 243 bits (621), Expect = 1e-62, Method: Composition-based stats. Identities = 73/207 (35%), Positives = 111/207 (53%), Gaps = 5/207 (2%) Query: 40 RTLMLGQLFFLLLFYGTSA----LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 + L + + + G + L DYR R+PI++ + EQ VDIP+ + + Sbjct: 31 KALFVTAAVSVAVLSGCAGPHDQLTTGGIPDDYRARHPIIVTEAEQTVDIPVASTDRRLT 90 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 D IRGF Y + ++ +++L P + +SA+ + +R + S GI S I Sbjct: 91 IAQRDLIRGFAANYVSRASGPVYVLSPEGSPNSAAAHQLRNQVRAELASRGIASSKIINT 150 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 Y A D IRLS+ + +CG WP+D+ N N+N+ N+GCA QNNLAAQV Sbjct: 151 SYAAAGAGDAAPIRLSFTGTTAITTQCGQWPKDI-SNDFANQNYYNFGCATQNNLAAQVA 209 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR +TP DA++R+ +IQ YR Sbjct: 210 NPEDLVAPRGMTPIDAQRRNNAIQEYR 236 >gi|86355865|ref|YP_467757.1| pilus assembly protein [Rhizobium etli CFN 42] gi|86279967|gb|ABC89030.1| pilus assembly protein [Rhizobium etli CFN 42] Length = 233 Score = 243 bits (621), Expect = 2e-62, Method: Composition-based stats. Identities = 72/207 (34%), Positives = 112/207 (54%), Gaps = 5/207 (2%) Query: 40 RTLMLGQLFFLLLFYGTSA----LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 R L+ + + + G + L DYR R+PI++ + EQ VDIP+ + + Sbjct: 12 RALVTTVVISVAILSGCAGPHDQLTTGGIPDDYRARHPIIVTEAEQTVDIPVASTDRRLT 71 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 D IRGF Y + ++ +++L P + +SA+ + +R + + GI S I Sbjct: 72 IAQRDLIRGFAANYISRASGPVYVLSPEGSPNSAAADQLRNQVRAELTTRGIASSKIINT 131 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 Y A D IRLS+ + +CG WP+D+ N N+N+ N+GCA QNNLAAQ+ Sbjct: 132 SYAAAGAGDAAPIRLSFTGTTAITTQCGQWPKDI-SNDLANQNYYNFGCASQNNLAAQIA 190 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR +TP DA++R+ +IQ YR Sbjct: 191 NPEDLVAPRGMTPIDAQRRNNAIQEYR 217 >gi|146342079|ref|YP_001207127.1| putative pilus assembly protein CpaD [Bradyrhizobium sp. ORS278] gi|146194885|emb|CAL78910.1| Putative pilus assembly protein cpaD [Bradyrhizobium sp. ORS278] Length = 256 Score = 242 bits (619), Expect = 2e-62, Method: Composition-based stats. Identities = 52/196 (26%), Positives = 94/196 (47%), Gaps = 8/196 (4%) Query: 56 TSALAYYDE---GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKND 112 ++ + DY+ R+PI + + Q + + + +GRG + P + G ++ + Sbjct: 40 CNSTSGEVVATIPDDYKMRHPIAIEEGRQSIVVFIGSGRGGLTMPQRADVAGLARSWRRE 99 Query: 113 SASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSY 172 + +P+ T ++ + + A ++I ++ G+P +I+ R D + IRLSY Sbjct: 100 GTGAIVADVPAGTPNARAAQDAYREIHAMLTEGGVPSRAITMRHPTPDDPRQLAVIRLSY 159 Query: 173 FASKPSAGKCGFWPEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVT 227 AG CG W +D+ N N+++ N+GCA Q NLAA + NP DL PR T Sbjct: 160 PKIAAVAGPCGLWQDDLGPNINNPGYSSNQHYQNFGCATQRNLAAMIDNPADLEQPRSET 219 Query: 228 PPDAEQRDKSIQRYRQ 243 +R ++YR+ Sbjct: 220 AAYTPRRSALFEKYRK 235 >gi|115525750|ref|YP_782661.1| pilus biogenesis lipoprotein CpaD [Rhodopseudomonas palustris BisA53] gi|115519697|gb|ABJ07681.1| pilus biogenesis lipoprotein CpaD [Rhodopseudomonas palustris BisA53] Length = 249 Score = 241 bits (615), Expect = 6e-62, Method: Composition-based stats. Identities = 55/184 (29%), Positives = 98/184 (53%), Gaps = 5/184 (2%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 DYR R+PI +++ +Q ++I + GRG + P I + + +++ + + P+ Sbjct: 45 PDDYRQRHPIAIQEADQTLNIFVGTGRGGLTGPQRAAIAAVAQSWLSEATGRIVIDQPAQ 104 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 T ++ + +V++IR ++ ++GIP ++++ R Y IR++Y +AG CG Sbjct: 105 TPNARAAADSVREIRALLAAAGIPTNAVAVREYQPSDPRLFAAIRVNYPRLVATAGPCGL 164 Query: 185 WPEDMLGNAKG-----NRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQ 239 WP+D+ + N+ N+GCA Q NLAA V NP DL PR TP +R + Sbjct: 165 WPDDLGPSVNNPGYIENKPSYNHGCAVQRNLAAMVENPADLVQPRAETPAYTARRTIAFD 224 Query: 240 RYRQ 243 +YR+ Sbjct: 225 KYRK 228 >gi|241207158|ref|YP_002978254.1| pilus biogenesis lipoprotein CpaD [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861048|gb|ACS58715.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 235 Score = 240 bits (613), Expect = 1e-61, Method: Composition-based stats. Identities = 75/212 (35%), Positives = 112/212 (52%), Gaps = 10/212 (4%) Query: 35 KNFFLRTLMLGQLFFLLLFYGTSA----LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAG 90 K FF M + + G + L DYR R+PI++ + EQ VDIP+ + Sbjct: 15 KAFFAMAAMS-----MAILSGCAGPHDQLTTGGIPDDYRARHPIIVTEAEQTVDIPVAST 69 Query: 91 RGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVS 150 + D IRGF Y + ++ +++L P + +SA+ + +R + S GI S Sbjct: 70 DRRLTIAQRDLIRGFATNYISRASGPVYVLSPQGSPNSAAAYQLRNQVRAELTSRGIASS 129 Query: 151 SISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNL 210 I Y A D IRLS+ + +CG WP+D+ N N+N+ N+GCA QNNL Sbjct: 130 KIVNTSYAAAGPGDAAPIRLSFTGTTAVTTQCGQWPKDI-SNDLTNQNYYNFGCASQNNL 188 Query: 211 AAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 AAQ+ NP DL +PR +TP DA++R+ +IQ YR Sbjct: 189 AAQIANPEDLVAPRGMTPIDAQRRNNAIQEYR 220 >gi|325291658|ref|YP_004277522.1| components of type IV pilus [Agrobacterium sp. H13-3] gi|325059511|gb|ADY63202.1| components of type IV pilus [Agrobacterium sp. H13-3] Length = 252 Score = 240 bits (613), Expect = 1e-61, Method: Composition-based stats. Identities = 73/178 (41%), Positives = 104/178 (58%) Query: 65 GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP 124 DYR R+PI + + E +DIP+ AG + + D +RGF + Y + S ++ + +PS Sbjct: 66 PDDYRTRHPITLSEAEHSLDIPVSAGDSRLTTAMADNVRGFAQNYASMSTGIVNIQMPSG 125 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 + +SA+ + + IR + +G+P I E Y A D IRLSY A G+CG Sbjct: 126 SANSAAASKMARQIRSALSGAGVPSGKIMETRYAASPNGDAAPIRLSYVAVTAMTGQCGQ 185 Query: 185 WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 WPED+ N N+NW N+GCA Q+NLAAQV NP+DL PR ++P DAE+R I YR Sbjct: 186 WPEDLSDNTFANKNWYNFGCASQSNLAAQVANPMDLVGPRGMSPIDAERRAVVIDAYR 243 >gi|327194693|gb|EGE61539.1| pilus assembly protein [Rhizobium etli CNPAF512] Length = 252 Score = 239 bits (611), Expect = 2e-61, Method: Composition-based stats. Identities = 72/207 (34%), Positives = 110/207 (53%), Gaps = 5/207 (2%) Query: 40 RTLMLGQLFFLLLFYGTSA----LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 + L + + + G + L DYR R+PI++ + EQ VDIP+ + + Sbjct: 31 KALFVTAAVSVAVLSGCAGPHDQLTTGGIPDDYRARHPIIVTEAEQTVDIPVASTDRRLT 90 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 D IRGF Y + ++ +++L P + +S + + +R + S GI S I Sbjct: 91 IAQRDLIRGFAANYVSRASGPVYVLSPEDSPNSTAAHQLRNQVRAELASRGIASSKIINT 150 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 Y A D IRLS+ + +CG WP+D+ N N+N+ N+GCA QNNLAAQV Sbjct: 151 SYAAAGAGDAAPIRLSFTGTTAITTQCGQWPKDI-SNDFANQNYYNFGCATQNNLAAQVA 209 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR +TP DA++R+ +IQ YR Sbjct: 210 NPEDLVAPRGMTPIDAQRRNNAIQEYR 236 >gi|328545278|ref|YP_004305387.1| Pilus biogenesis lipoprotein CpaD [polymorphum gilvum SL003B-26A1] gi|326415020|gb|ADZ72083.1| Pilus biogenesis lipoprotein CpaD [Polymorphum gilvum SL003B-26A1] Length = 239 Score = 239 bits (609), Expect = 3e-61, Method: Composition-based stats. Identities = 58/183 (31%), Positives = 94/183 (51%) Query: 60 AYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFL 119 A +DYR R+PI++ + + +D+P+ + + + F + + + + + Sbjct: 36 APLAATNDYRLRHPIVITEQAETLDLPVGQSTRNLNRDFAERVTEFGQASRRNGNGHVEI 95 Query: 120 LIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSA 179 L+PS + A++ IR + G+ + +S R Y D IRL+Y K SA Sbjct: 96 LVPSGAANEAAVHAVTPRIRSALALGGVSGTHVSTRSYPVDDATAQAPIRLAYTRIKASA 155 Query: 180 GKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQ 239 G CG WP ++ G+ N+++ N+GCA Q NLAA V NP DL PR +TP D +R Q Sbjct: 156 GPCGEWPANIGGSLNANQDYYNFGCATQANLAAMVDNPADLLGPRAMTPADQMRRATVFQ 215 Query: 240 RYR 242 +YR Sbjct: 216 KYR 218 >gi|13474661|ref|NP_106230.1| pilus assembly protein cpaD [Mesorhizobium loti MAFF303099] gi|14025416|dbj|BAB52016.1| pilus assembly protein; CpaD [Mesorhizobium loti MAFF303099] Length = 251 Score = 234 bits (598), Expect = 7e-60, Method: Composition-based stats. Identities = 68/195 (34%), Positives = 105/195 (53%), Gaps = 4/195 (2%) Query: 51 LLFYGTS---ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLE 107 L G + ++ DYR +PI++ + Q +D+P+ AG + DT+ GFL+ Sbjct: 42 ALLVGCAQRDSITVGAIPDDYRTNHPIVIAEKNQKIDLPVGAGDRGMTGSQRDTLLGFLD 101 Query: 108 KYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDT 167 Y +A L + IPS + + + A +D ++ ++SG+ + I Y A Sbjct: 102 GYDKSAAPTLTIQIPSGSANEVAATAAGRDFARLAVASGVKRNRIVVVSYQAGSSETSAP 161 Query: 168 IRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVT 227 +R+SY A + KCG WPED+L + N+++ ++GC+YQNNLAAQ+ NP DL PR T Sbjct: 162 VRVSYIAVRAQTDKCGRWPEDLLETS-ENKHYADFGCSYQNNLAAQMANPADLLGPRKQT 220 Query: 228 PPDAEQRDKSIQRYR 242 DAE R K I YR Sbjct: 221 TIDAENRGKVIDVYR 235 >gi|222084470|ref|YP_002542999.1| pilus assembly protein [Agrobacterium radiobacter K84] gi|221721918|gb|ACM25074.1| pilus assembly protein [Agrobacterium radiobacter K84] Length = 235 Score = 234 bits (597), Expect = 9e-60, Method: Composition-based stats. Identities = 75/207 (36%), Positives = 111/207 (53%), Gaps = 4/207 (1%) Query: 40 RTLMLGQLFFLLLFYGTSALAYYDEGS----DYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 R L + G + S DYR R+PI++ E VDIP+ + Sbjct: 15 RFAALAMIVMATAVSGCAGSRDGMTTSAITDDYRQRHPIVLTDKEHRVDIPVSVSDRRLT 74 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 + DT+RGF++ Y+ + + ++ P + +SA+ + IR+ +++SGIP + I++ Sbjct: 75 SGMRDTVRGFVQDYRAHATGTVEIMTPRESANSAAASALRRQIRQELMASGIPSARITDN 134 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 Y A D IRL + A+ CG WP D+ NA N+N+ N+GCA QNNLAAQV Sbjct: 135 YYPAGGPGDAAPIRLRFMATAAVTNACGQWPADLADNAFDNQNYYNFGCATQNNLAAQVA 194 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR +TP DA+QR K I YR Sbjct: 195 NPTDLIAPRAMTPIDADQRSKVIDNYR 221 >gi|260461948|ref|ZP_05810193.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Mesorhizobium opportunistum WSM2075] gi|259032195|gb|EEW33461.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Mesorhizobium opportunistum WSM2075] Length = 243 Score = 230 bits (586), Expect = 1e-58, Method: Composition-based stats. Identities = 70/207 (33%), Positives = 112/207 (54%), Gaps = 4/207 (1%) Query: 39 LRTLMLGQLFFLLLFYGTS---ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 LR L + + L G + ++ DYR +PI++ + Q +D+P+ AG + Sbjct: 22 LRALPVLAVAATALLAGCAQRDSVTVGAIPDDYRTNHPIVIAEKNQKIDLPVGAGDRGMT 81 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 DT+ GFL+ Y +A L + IPS + + + R A +D ++ ++SGI + I+ Sbjct: 82 GSQRDTLLGFLDGYDKSAAPALTIQIPSGSANEVAARAAGRDFARLAVASGIKRNRIAVV 141 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 Y A +R+S+ A + KCG WPED++ ++ N+++ ++GC+YQNNLAAQ+ Sbjct: 142 SYQAGSSEASAPVRVSFIAVRAQTDKCGRWPEDLVESS-ENKHYADFGCSYQNNLAAQMA 200 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL PR T DAE R I YR Sbjct: 201 NPADLLGPRKQTTIDAENRGAVIDVYR 227 >gi|153008060|ref|YP_001369275.1| pilus biogenesis lipoprotein CpaD [Ochrobactrum anthropi ATCC 49188] gi|151559948|gb|ABS13446.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Ochrobactrum anthropi ATCC 49188] Length = 239 Score = 229 bits (585), Expect = 2e-58, Method: Composition-based stats. Identities = 55/182 (30%), Positives = 98/182 (53%), Gaps = 1/182 (0%) Query: 58 ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVL 117 + DYR +PI + + EQ+ DIP+ ++ ++G + Y+ + +L Sbjct: 33 HVTVGALPDDYRTNHPITIAEREQVTDIPVAQADQKLSPMQRGIVQGAIANYRRGGSGML 92 Query: 118 FLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKP 177 ++L+PS T + A+ R ++ ++ GI ++I+ Y + IR+SY+A Sbjct: 93 YVLVPSGTSNQAAAYRLSTEVSAMLRRGGIKANNIAIENYPVENPEAAAPIRISYYAMTA 152 Query: 178 SAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 CG WP+D L + N+++ N+GCA QNNLAAQV NP DL PR+++P D+++ + Sbjct: 153 GTTPCGRWPDD-LASTPENKHYANFGCASQNNLAAQVANPADLLGPRVMSPIDSDRTTER 211 Query: 238 IQ 239 + Sbjct: 212 LN 213 >gi|90419765|ref|ZP_01227674.1| putative pilus assembly protein cpaD [Aurantimonas manganoxydans SI85-9A1] gi|90335806|gb|EAS49554.1| putative pilus assembly protein cpaD [Aurantimonas manganoxydans SI85-9A1] Length = 244 Score = 229 bits (585), Expect = 2e-58, Method: Composition-based stats. Identities = 57/194 (29%), Positives = 96/194 (49%), Gaps = 5/194 (2%) Query: 54 YGTSALAYYDE----GSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKY 109 G A + E DYR R+PI++ + ++ +DIP++ + Y + F +++ Sbjct: 37 AGGCANVHNVEVGSIPDDYRTRHPIVVSEDQEAIDIPIVMSDARLSYANRGRVEHFGDRF 96 Query: 110 KNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIR 169 + A + +++P+ + + + R +I + + I I + Y A IR Sbjct: 97 RASGADSIQVMLPTGSANQYAAERVSHEIVEALRGRYISRDRIFVQPYSAVGAEGPTPIR 156 Query: 170 LSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPP 229 L+Y G CG WP+DM + N+N+ N+GCA Q NLAAQ+ +P DL SPR V Sbjct: 157 LTYATLVAKTGPCGRWPDDM-TDTSENKNYFNFGCASQQNLAAQIADPRDLLSPRGVDSI 215 Query: 230 DAEQRDKSIQRYRQ 243 DA +R + YR+ Sbjct: 216 DAGRRTTVLDNYRR 229 >gi|319785607|ref|YP_004145083.1| pilus biogenesis lipoprotein CpaD [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171495|gb|ADV15033.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 245 Score = 227 bits (579), Expect = 1e-57, Method: Composition-based stats. Identities = 59/186 (31%), Positives = 101/186 (54%), Gaps = 1/186 (0%) Query: 58 ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVL 117 ++ DYR +PI++ + Q +D+P+ AG + DT+ GFL+ Y +A L Sbjct: 46 SITVGAIPDDYRTNHPIVIAEKNQKIDLPVGAGDRGMTGSQRDTLLGFLDGYDRSAAPTL 105 Query: 118 FLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKP 177 + +PS + + + A +D ++ ++SGI + I Y + IR++Y + K Sbjct: 106 TIQVPSGSANEVAATTAARDFARLAVASGIKRNRIVVTSYQSASAEASAPIRVAYISVKA 165 Query: 178 SAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKS 237 KCG WPED++ + N+++ ++GC+YQNNLAAQ+ NP DL PR D R ++ Sbjct: 166 QTDKCGRWPEDLMETS-ENKHYADFGCSYQNNLAAQMANPADLLGPRKSANIDPANRSQA 224 Query: 238 IQRYRQ 243 I Y++ Sbjct: 225 IDVYQK 230 >gi|239833236|ref|ZP_04681565.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Ochrobactrum intermedium LMG 3301] gi|239825503|gb|EEQ97071.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Ochrobactrum intermedium LMG 3301] Length = 239 Score = 224 bits (570), Expect = 1e-56, Method: Composition-based stats. Identities = 55/176 (31%), Positives = 93/176 (52%), Gaps = 1/176 (0%) Query: 58 ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVL 117 + DYR +PI + + EQ+ DIP+ ++ ++G + Y+ + +L Sbjct: 33 HVTVGALPDDYRTNHPITIAEREQVTDIPIAQADQKLSPMQRGIVQGAIANYRRSGSGML 92 Query: 118 FLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKP 177 ++L+PS + A+ R ++ + SGI ++I+ Y + IR+SY+A Sbjct: 93 YVLVPSGASNQAAAYRLSTEVAATLRRSGIKANNIAIENYPVESPDAAAPIRISYYAITA 152 Query: 178 SAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQ 233 CG WP+D L + N+++ N+GC QNNLAAQV NP DL PR +TP D+++ Sbjct: 153 GTTPCGRWPDD-LASTPENKHYANFGCVSQNNLAAQVANPADLLGPRTMTPIDSDR 207 >gi|254473746|ref|ZP_05087141.1| components of type IV pilus [Pseudovibrio sp. JE062] gi|211957132|gb|EEA92337.1| components of type IV pilus [Pseudovibrio sp. JE062] Length = 218 Score = 216 bits (550), Expect = 3e-54, Method: Composition-based stats. Identities = 55/196 (28%), Positives = 92/196 (46%), Gaps = 6/196 (3%) Query: 52 LFYGTSALAYYDEGS-----DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFL 106 + ++ + Y E DYR R+PI++ +V + DIP+ + + + F Sbjct: 1 MLAACNSTSSYVEQHTVADIDYRKRHPIVITEVPENFDIPVSGEARNLNRSLKTAVAAFG 60 Query: 107 EKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVD 166 ++ D + +L+PS + + A++R IR + G+ + I R Y Sbjct: 61 QQAIVDGNGFVEVLVPSGSANEAAVRAISPQIRTALKQGGMEANKIVMRSYSVSDMAASA 120 Query: 167 TIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMV 226 +RLS+ K + CG WP M+ N N ++ N+GCA Q NLAA V NP DL PR + Sbjct: 121 PVRLSFMRIKGAVRDCGNWPTGMVVNH-QNLDYHNFGCASQANLAAVVDNPTDLLRPRTL 179 Query: 227 TPPDAEQRDKSIQRYR 242 P D + + + + R Sbjct: 180 GPNDPSRTNVVLTKNR 195 >gi|323137426|ref|ZP_08072504.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylocystis sp. ATCC 49242] gi|322397413|gb|EFX99936.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylocystis sp. ATCC 49242] Length = 254 Score = 215 bits (548), Expect = 3e-54, Method: Composition-based stats. Identities = 63/216 (29%), Positives = 99/216 (45%), Gaps = 15/216 (6%) Query: 39 LRTLMLGQLFFLLLFYGTSALA------YYDEGSDYRDRYPILMRKVEQIVDIPLLAGRG 92 LR L G++ LLL A DY DR+P+++ + ++D+ G Sbjct: 22 LRALRAGKVLALLLTAPLGACGVNRVLPPPVAPYDYHDRHPVVLAEAPHVIDLFPSVVHG 81 Query: 93 EIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + Y I+ F+E Y+ + LL P SA+ V +R+ + ++G+ +I Sbjct: 82 GVDYTTEGRIKEFVEHYREFGHGQVTLLTPVGAPYSAA---GVSAVRRALAAAGL-RGNI 137 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSA-GKCGFWPEDMLGNAK----GNRNWTNYGCAYQ 207 Y IRLS+ + K G+CG WP D+ N+++ N+GCA Q Sbjct: 138 LVGTYSVTDPRLAAPIRLSFQSLKAKVSGRCGEWPTDLASGTSLQGWENQSYWNFGCASQ 197 Query: 208 NNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 L+AQV +P DL PR T D E R ++I R R+ Sbjct: 198 QTLSAQVADPRDLAVPRGETASDIEMRMRAINRVRR 233 >gi|154250690|ref|YP_001411514.1| pilus biogenesis lipoprotein CpaD [Parvibaculum lavamentivorans DS-1] gi|154154640|gb|ABS61857.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Parvibaculum lavamentivorans DS-1] Length = 234 Score = 213 bits (542), Expect = 2e-53, Method: Composition-based stats. Identities = 51/203 (25%), Positives = 93/203 (45%), Gaps = 5/203 (2%) Query: 44 LGQLFFLLLFYGTSALAYYD--EGSDYRDR--YPILMRKVEQIVDIPLLAGRGEIKYPIH 99 + F L L +G + ++ E + Y +PI + ++I ++ G+ + Sbjct: 12 TAKGFALALAFGVAGCGGFNGAEQAHYDANYTHPISVEADVATLNIDVVPGQPGVTSTDR 71 Query: 100 DTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDA 159 + I GF Y+ L + PS + ++ + A+ D+R+++ G+ + +S Y A Sbjct: 72 NAIAGFAAGYRQRGHGPLTISTPSGSPNAGAAAVALSDVREVLSEHGVGGNDLSYTPYRA 131 Query: 160 DYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLD 219 + + LS+ CG W N N+GC+ QNNLAA V +P D Sbjct: 132 SGTDNSAPLILSFKRYVAKPTACGDW-SGSYSYDPSNGLLPNHGCSTQNNLAAMVADPGD 190 Query: 220 LFSPRMVTPPDAEQRDKSIQRYR 242 L +PR ++P DA +R +++YR Sbjct: 191 LIAPRNMSPADAARRGTVLEKYR 213 >gi|188581661|ref|YP_001925106.1| pilus biogenesis lipoprotein CpaD [Methylobacterium populi BJ001] gi|179345159|gb|ACB80571.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium populi BJ001] Length = 247 Score = 206 bits (524), Expect = 2e-51, Method: Composition-based stats. Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 8/192 (4%) Query: 59 LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLF 118 D R R+PI++ ++ +D+ G G I I FL +Y+ +L Sbjct: 34 TTGSTYPIDVRTRHPIVLADADRSLDVF-PTGIGHIDPRQRADIEAFLVEYRRYGRGILV 92 Query: 119 LLIPSPTVS--SASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASK 176 + +P + S+ R +IR++ G+P + I Y +RLS+ + Sbjct: 93 VELPRGVSPGLAGSVERTGAEIRRLAAEMGVPAAGIRVANYPVANPTLASPLRLSFQRMQ 152 Query: 177 P-SAGKCGFWPEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDA 231 A KCG WP D+ + N N GCA Q NLAAQV +P+DL R D Sbjct: 153 AKVADKCGLWPRDLGVSDLRANWSNEPTWNLGCATQANLAAQVADPIDLVRGRPEGRIDT 212 Query: 232 EQRDKSIQRYRQ 243 R K + + R+ Sbjct: 213 VLRTKDLGQLRE 224 >gi|23011548|ref|ZP_00051876.1| hypothetical protein Magn03006165 [Magnetospirillum magnetotacticum MS-1] Length = 248 Score = 204 bits (520), Expect = 7e-51, Method: Composition-based stats. Identities = 47/194 (24%), Positives = 82/194 (42%), Gaps = 8/194 (4%) Query: 57 SALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASV 116 +A D R R+PI++ ++ +D+ G G + + FL +Y+ + Sbjct: 33 AATTGSTYPIDVRTRHPIVLADADRTLDVF-PTGVGHLDPRQRADLEAFLVEYRRYGRGL 91 Query: 117 LFLLIPSPTVSSAS--IRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFA 174 L + +P + + + R IR++ G+P Y +RLS+ Sbjct: 92 LLVEMPRGVSPALAGPVERTGAAIRRLAAEMGVPAGGFRIGDYPIANPALAAPLRLSFQR 151 Query: 175 SKP-SAGKCGFWPEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPP 229 + A +CG WP D+ + N N+GCA + N AAQV +P+DL R Sbjct: 152 MQAKVADQCGLWPRDLGASDLRADWSNEPTWNFGCATRANFAAQVADPVDLVRGRPEGRI 211 Query: 230 DAEQRDKSIQRYRQ 243 D +R + I + R+ Sbjct: 212 DTIRRTQDIGQLRE 225 >gi|218530655|ref|YP_002421471.1| pilus biogenesis lipoprotein CpaD [Methylobacterium chloromethanicum CM4] gi|218522958|gb|ACK83543.1| pilus biogenesis lipoprotein CpaD [Methylobacterium chloromethanicum CM4] Length = 248 Score = 201 bits (512), Expect = 6e-50, Method: Composition-based stats. Identities = 50/192 (26%), Positives = 82/192 (42%), Gaps = 8/192 (4%) Query: 59 LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLF 118 D R R+PI++ ++ +D+ G G I I FL +Y+ +L Sbjct: 35 TTGSTYPIDVRTRHPIVLADADRTLDVF-PTGIGHIDPRQRADIEAFLVEYRRYGRGILL 93 Query: 119 LLIPSPTVSSAS--IRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASK 176 + +P + + + R IR++ G+P + + Y +RLS+ + Sbjct: 94 VELPRGVSPALAGPVERTGASIRRLATEMGVPAAGVRVAAYPIANPTLASPLRLSFQRMQ 153 Query: 177 P-SAGKCGFWPEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDA 231 A KCG WP D+ + N N GCA Q+N+AAQV +P+DL R D Sbjct: 154 AKVADKCGLWPRDLGASDLRANWSNEPTWNLGCAMQSNVAAQVADPIDLVRGRPEGRIDT 213 Query: 232 EQRDKSIQRYRQ 243 R K + + R+ Sbjct: 214 VLRTKDLGQLRE 225 >gi|163851904|ref|YP_001639947.1| pilus biogenesis lipoprotein CpaD [Methylobacterium extorquens PA1] gi|163663509|gb|ABY30876.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium extorquens PA1] Length = 248 Score = 201 bits (511), Expect = 7e-50, Method: Composition-based stats. Identities = 50/192 (26%), Positives = 82/192 (42%), Gaps = 8/192 (4%) Query: 59 LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLF 118 D R R+PI++ ++ +D+ G G I I FL +Y+ +L Sbjct: 35 TTGSTYPIDVRTRHPIVLADADRTLDVF-PTGIGHIDPRQRADIEAFLGEYRRYGRGILL 93 Query: 119 LLIPSPTVSSAS--IRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASK 176 + +P + + + R IR++ G+P + + Y +RLS+ + Sbjct: 94 VELPRGVSPALAGPVERTGASIRRLAAEMGVPAAGVRVAAYPIANPTLASPLRLSFQRMQ 153 Query: 177 P-SAGKCGFWPEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDA 231 A KCG WP D+ + N N GCA Q+N+AAQV +P+DL R D Sbjct: 154 AKVADKCGLWPRDLGASDLRANWSNEPTWNLGCAMQSNVAAQVADPIDLVRGRPEGRIDT 213 Query: 232 EQRDKSIQRYRQ 243 R K + + R+ Sbjct: 214 VLRTKDLGQLRE 225 >gi|254561622|ref|YP_003068717.1| pilus assembly protein cpaD [Methylobacterium extorquens DM4] gi|254268900|emb|CAX24861.1| pilus assembly protein cpaD [Methylobacterium extorquens DM4] Length = 248 Score = 201 bits (510), Expect = 9e-50, Method: Composition-based stats. Identities = 50/192 (26%), Positives = 82/192 (42%), Gaps = 8/192 (4%) Query: 59 LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLF 118 D R R+PI++ ++ +D+ G G I I FL +Y+ +L Sbjct: 35 TTGSTYPIDVRTRHPIVLADADRTLDVF-PTGIGHIDPRQRADIEAFLVEYRRYGRGILL 93 Query: 119 LLIPSPTVSSAS--IRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASK 176 + +P + + + R IR++ G+P + + Y +RLS+ + Sbjct: 94 VELPRGVSPALAGPVERTGASIRRLAAEMGVPAAGVRVAAYPIANPTLASPLRLSFQRMQ 153 Query: 177 P-SAGKCGFWPEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDA 231 A KCG WP D+ + N N GCA Q+N+AAQV +P+DL R D Sbjct: 154 AKVADKCGLWPRDLGASDLRANWSNEPTWNLGCAMQSNVAAQVADPIDLVRGRPEGRIDT 213 Query: 232 EQRDKSIQRYRQ 243 R K + + R+ Sbjct: 214 VLRTKDLGQLRE 225 >gi|300021855|ref|YP_003754466.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Hyphomicrobium denitrificans ATCC 51888] gi|299523676|gb|ADJ22145.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Hyphomicrobium denitrificans ATCC 51888] Length = 245 Score = 200 bits (509), Expect = 1e-49, Method: Composition-based stats. Identities = 51/177 (28%), Positives = 86/177 (48%), Gaps = 4/177 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASV--LFLLIPSP 124 D R+PIL+ + ++++ + AG + + F+++++ A + P+ Sbjct: 50 DPEQRHPILVSQQPAVLNLHVAAGSEGLTPSQRSRVIDFIDRHRASDAGNSRFVISAPAG 109 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 + + A+ A D R++I+ G SSI+ Y A +R+SY +CG Sbjct: 110 SSNEAAAMDAASDTRRLILGGGYADSSIANEAYHASGRD--APLRISYLRYVAEGPECGR 167 Query: 185 WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 + L A N + N+GC+ Q NLAA V NP DL PR +TP DA +R K ++Y Sbjct: 168 DWSENLARAYQNTPYPNFGCSSQRNLAAMVSNPADLLGPRTMTPSDANRRFKMYEKY 224 >gi|170750190|ref|YP_001756450.1| pilus biogenesis lipoprotein CpaD [Methylobacterium radiotolerans JCM 2831] gi|170656712|gb|ACB25767.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium radiotolerans JCM 2831] Length = 247 Score = 199 bits (507), Expect = 2e-49, Method: Composition-based stats. Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 8/192 (4%) Query: 58 ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVL 117 A + +DYR R+PI++ + +D+ G G + + F+ +Y+ L Sbjct: 34 ATTGGIDVTDYRARHPIVLTDGTRSLDVF-PTGTGHLDPRQATDVDAFMLEYRRYGRGSL 92 Query: 118 FLLIPSP--TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFAS 175 + +P ++ R + ++ +G+ I+ Y IRLS+ Sbjct: 93 LMQVPQGVPADQVVAVERTASVLGRLGTQNGVNGREIAVTGYAVAAPTLASPIRLSFQRM 152 Query: 176 KP-SAGKCGFWPEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPD 230 + A CG WP+D+ + NR N GCA Q+N+AAQV +P+DL R D Sbjct: 153 QAKVADACGLWPQDLGTSNFAIDYNNRPSWNLGCATQSNVAAQVADPVDLVRGRPEGRID 212 Query: 231 AEQRDKSIQRYR 242 +R + I + R Sbjct: 213 TVKRVRDIGQLR 224 >gi|220922780|ref|YP_002498082.1| pilus biogenesis lipoprotein CpaD [Methylobacterium nodulans ORS 2060] gi|219947387|gb|ACL57779.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium nodulans ORS 2060] Length = 246 Score = 199 bits (506), Expect = 3e-49, Method: Composition-based stats. Identities = 51/213 (23%), Positives = 89/213 (41%), Gaps = 12/213 (5%) Query: 41 TLMLGQLFFLLLFYGTS-----ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIK 95 L + L L G A S R+PI++ + +D+ + G G + Sbjct: 12 ALRAVSIGALALSLGACVANRAATTGSVYPSSVAARHPIVLADAPRSLDVFV-TGIGHVD 70 Query: 96 YPIHDTIRGFLEKYKNDSASVLFLLIPSPT-VSSASIRRAVKDIRKIIISSGIPVSSISE 154 D I FL +Y+ VL + +P V ++ R +R+ + G+P + Sbjct: 71 PRQSDDIDAFLLEYRRYGRGVLVIEVPRGAQVPGPAVARTAALLRERAVGRGVPARELVV 130 Query: 155 RIYDADYGMDVDTIRLSYFASKP-SAGKCGFWPEDMLGNAKG----NRNWTNYGCAYQNN 209 Y +R+S+ + +G CG WP+D+ + G N ++ NYGC+ + N Sbjct: 131 APYAVVNPAVAAPVRMSFQRMQARVSGACGLWPQDLGVSEPGFELRNESFWNYGCSTRAN 190 Query: 210 LAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 A+Q+ +P+DL R D R + I+ R Sbjct: 191 FASQIADPVDLVRGRQEGRIDTVSRTQDIESLR 223 >gi|170740620|ref|YP_001769275.1| pilus biogenesis lipoprotein CpaD [Methylobacterium sp. 4-46] gi|168194894|gb|ACA16841.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylobacterium sp. 4-46] Length = 249 Score = 199 bits (505), Expect = 4e-49, Method: Composition-based stats. Identities = 48/205 (23%), Positives = 83/205 (40%), Gaps = 12/205 (5%) Query: 49 FLLLFYGTS-----ALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIR 103 L L G A E + + R+PI + + +D+ + G G + D + Sbjct: 23 ALALSLGACVSNRAATTGSIEPASVQARHPIALADAPRNLDVFV-TGMGHVDPRQADDVD 81 Query: 104 GFLEKYKNDSASVLFLLIPSP-TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYG 162 FL +++ VL + +P ++ R +R+ ++ G+ + Y Sbjct: 82 AFLLEFRRYGRGVLVIEVPRGGQAPGPAVARTAALLRERALARGVSARELVVAPYPVANA 141 Query: 163 MDVDTIRLSYFASKP-SAGKCGFWPEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNP 217 +RLS+ + CG WP D+ + NR N+GCAYQ+N A Q+ +P Sbjct: 142 SVAAPVRLSFQRMQAKVTSTCGLWPNDLGVSDPAVDVSNRTHWNHGCAYQSNFARQIADP 201 Query: 218 LDLFSPRMVTPPDAEQRDKSIQRYR 242 +DL R D R + I+ R Sbjct: 202 VDLVRGRQEGRIDTISRTQDIESLR 226 >gi|240139027|ref|YP_002963502.1| pilus assembly protein cpaD [Methylobacterium extorquens AM1] gi|240008999|gb|ACS40225.1| pilus assembly protein cpaD [Methylobacterium extorquens AM1] Length = 248 Score = 197 bits (502), Expect = 7e-49, Method: Composition-based stats. Identities = 50/192 (26%), Positives = 82/192 (42%), Gaps = 8/192 (4%) Query: 59 LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLF 118 D R R+PI++ ++ +D+ G G I I FL +Y+ +L Sbjct: 35 TTGSTYPIDVRTRHPIVLADADRTLDVF-PTGIGHIDPRQRADIEAFLVEYRRYGRGILL 93 Query: 119 LLIPSPTVSSAS--IRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASK 176 + +P + + + R IR++ G+P + + Y +RLS+ + Sbjct: 94 VELPRGVSPALAGPVERTGASIRRLAAEMGVPAAGVRVAAYPIANLTLASPLRLSFQRMQ 153 Query: 177 P-SAGKCGFWPEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDA 231 A KCG WP D+ + N N GCA Q+N+AAQV +P+DL R D Sbjct: 154 AKVADKCGLWPRDLGASDLRANWSNEPTWNLGCAMQSNVAAQVADPIDLVRGRPEGRIDT 213 Query: 232 EQRDKSIQRYRQ 243 R K + + R+ Sbjct: 214 VLRTKDLGQLRE 225 >gi|296444394|ref|ZP_06886359.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylosinus trichosporium OB3b] gi|296258041|gb|EFH05103.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Methylosinus trichosporium OB3b] Length = 245 Score = 192 bits (488), Expect = 3e-47, Method: Composition-based stats. Identities = 53/181 (29%), Positives = 89/181 (49%), Gaps = 10/181 (5%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYRDR+P+++ +D+ + D I+ F+++Y+ + LL P+ + Sbjct: 48 DYRDRHPVVLADATTAIDVFP---EQRLDQATVDRIQSFVQRYRRLGHGQITLLAPTGSR 104 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKP-SAGKCGFW 185 ++ + R V +R+ + SG+ ++ Y +RLS+ K A +CG W Sbjct: 105 NT-ATRAGVDAVRRQLADSGV-AGAVYVGTYPVSDADLAAPVRLSFQGIKAKVADRCGQW 162 Query: 186 PEDMLGNAK----GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 PED+ + N N+GCA Q LAAQ+ +P DL SPR TP D E R +++ + Sbjct: 163 PEDLASASSLKGWNNDTHWNFGCANQATLAAQIDDPRDLASPRGETPADIESRMRALNKV 222 Query: 242 R 242 R Sbjct: 223 R 223 >gi|83859358|ref|ZP_00952879.1| pilus assembly protein CpaD [Oceanicaulis alexandrii HTCC2633] gi|83852805|gb|EAP90658.1| pilus assembly protein CpaD [Oceanicaulis alexandrii HTCC2633] Length = 218 Score = 173 bits (439), Expect = 2e-41, Method: Composition-based stats. Identities = 44/169 (26%), Positives = 74/169 (43%), Gaps = 4/169 (2%) Query: 77 RKVEQIVDI--PLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRA 134 + V + + A + + D I +YK L + P ++ + A Sbjct: 30 TAAPETVRVSMDVSALDNGLTWNQIDVIEAVAAEYKARGHGPLVISYPQNAGNADAAIGA 89 Query: 135 VKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAK 194 + + R + +G+ IS Y+A G + S+ + A +C DM + + Sbjct: 90 IAEARTRLYEAGLDWRQISGGAYEA-GGQASAPVIFSFTRYQAVAPECSTAWNDM-AHMR 147 Query: 195 GNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 N++W +GCA NNLA V +P DL +PR V PD+ +R + RYRQ Sbjct: 148 ANQDWPRFGCATANNLANMVADPRDLVAPRGVDAPDSARRQTVLDRYRQ 196 >gi|114568971|ref|YP_755651.1| pilus biogenesis lipoprotein CpaD [Maricaulis maris MCS10] gi|114339433|gb|ABI64713.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Maricaulis maris MCS10] Length = 227 Score = 161 bits (408), Expect = 6e-38, Method: Composition-based stats. Identities = 40/159 (25%), Positives = 66/159 (41%), Gaps = 5/159 (3%) Query: 87 LLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSG 146 + + + + +YK L + P + + A+ + R G Sbjct: 49 VNPQDNGLTWAQQGMLAAVAAEYKARGHGPLVISYPQGAGNEDAAIGAIAEARSFFYEQG 108 Query: 147 IPVSSISERIYDADYGMDVDTIRLSYFASKPSAG-KC-GFWPEDMLGNAKGNRNWTNYGC 204 I I+ YDA + + I S+ + A +C G W D + N++ TN+GC Sbjct: 109 IDWRVIAGGAYDARGRQNGELI-FSFTRYEAVAPAECDGSW--DQMALEFDNQHHTNFGC 165 Query: 205 AYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 A NLAA V +P DL +PR + D +R I+ YR+ Sbjct: 166 ALAVNLAAMVADPRDLVAPRDMEAGDTGRRQTVIEGYRE 204 >gi|167648151|ref|YP_001685814.1| pilus biogenesis lipoprotein CpaD [Caulobacter sp. K31] gi|167350581|gb|ABZ73316.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Caulobacter sp. K31] Length = 234 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 41/151 (27%), Positives = 71/151 (47%), Gaps = 2/151 (1%) Query: 93 EIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + + G L ++ A + + P + R +R+ +I+ G P +S+ Sbjct: 65 GLSANQAQALDGLLNRWLAAEAREILVSAPIGGKDADVAGRMAFAVRQRLIAMGAPPASV 124 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAA 212 YDA G+ +++ + CG W + + + N+ + N+GCA N+AA Sbjct: 125 RVVGYDAGAGLAAAPLKVGFLRYHAQVPTCGGW--ENIAATRDNKPYDNFGCAVTANMAA 182 Query: 213 QVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 QV NP DL SPR TP D+ +RD + +YR+ Sbjct: 183 QVANPEDLLSPRATTPVDSARRDTVLGKYRK 213 >gi|254420868|ref|ZP_05034592.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Brevundimonas sp. BAL3] gi|196187045|gb|EDX82021.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Brevundimonas sp. BAL3] Length = 205 Score = 145 bits (365), Expect = 7e-33, Method: Composition-based stats. Identities = 39/159 (24%), Positives = 70/159 (44%), Gaps = 5/159 (3%) Query: 85 IPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIIS 144 I L + H +R +++ A V+ + P+ S A+ + D R + Sbjct: 30 IALAVHEQGLSANQHAALRDLAQRFAAAGAGVIVIEAPAGEDSVAA--KTAFDTRAALAQ 87 Query: 145 SGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGC 204 G+ + + Y + + + + + +CG + L N + +N+GC Sbjct: 88 IGLDPNRLRVVSYAGPDPR--APVLVGFETVQAAVPRCGAAWGN-LSRTGDNMSGSNFGC 144 Query: 205 AYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 A NLAAQ+ +P D+ +PR +TPP+A +R RYRQ Sbjct: 145 AVTANLAAQIADPRDIAAPRALTPPEAGRRSVVFDRYRQ 183 >gi|218670939|ref|ZP_03520610.1| pilus assembly protein [Rhizobium etli GR56] Length = 170 Score = 143 bits (360), Expect = 2e-32, Method: Composition-based stats. Identities = 39/141 (27%), Positives = 64/141 (45%), Gaps = 4/141 (2%) Query: 47 LFFLLLFYGTSA----LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTI 102 + + G + L DYR R+PI++ + EQ VDIP+ + + D I Sbjct: 30 AVSVAILSGCAGPHDQLTTGGIPDDYRARHPIIVTEAEQTVDIPVASTDRRLTIAQRDLI 89 Query: 103 RGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYG 162 RGF Y + ++ +++L P + +SA+ + +R + S GI S I Y A Sbjct: 90 RGFAANYISRASGPVYVLSPEGSPNSAAAHQLRNHVRAELASRGIASSKIINTSYAAAGA 149 Query: 163 MDVDTIRLSYFASKPSAGKCG 183 D IRLS+ + +CG Sbjct: 150 GDAAPIRLSFTGTTAVTTQCG 170 >gi|302381755|ref|YP_003817578.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Brevundimonas subvibrioides ATCC 15264] gi|302192383|gb|ADK99954.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Brevundimonas subvibrioides ATCC 15264] Length = 222 Score = 140 bits (354), Expect = 1e-31, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 74/202 (36%), Gaps = 7/202 (3%) Query: 42 LMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDT 101 L+ L G A + RY + + + V + + + Sbjct: 6 LVASGALVLTGCVGLPAEGLDPAPLNPNSRYSLQVEPGIERVALAV--HDTGLSANQTRA 63 Query: 102 IRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADY 161 + ++ + A VL + PS + I+ + +SG+ + Y A Sbjct: 64 LEDIAGRFYAEGAPVLRIEAPSG--NDPVASEMAWRIKGALEASGVSSYQVQVVTYVAPD 121 Query: 162 GMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLF 221 + + + + +CG + L N + N+GCA NLAAQ+ NP D+ Sbjct: 122 PR--APVLVGFDTVRAVVPQCGTGWTN-LTRTGSNAGYGNFGCAVNANLAAQIANPRDIV 178 Query: 222 SPRMVTPPDAEQRDKSIQRYRQ 243 PR +TP DA +R YRQ Sbjct: 179 QPRTMTPVDAGRRAVVFDNYRQ 200 >gi|295690797|ref|YP_003594490.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Caulobacter segnis ATCC 21756] gi|295432700|gb|ADG11872.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Caulobacter segnis ATCC 21756] Length = 229 Score = 136 bits (343), Expect = 2e-30, Method: Composition-based stats. Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 10/152 (6%) Query: 93 EIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + ++ + ++ L + PS +R+ +I G P + Sbjct: 66 GLSDNQRVALQALVGRWLQAEGRELVVTAPSGAG------AMAVQVRERLIFLGAPAAH- 118 Query: 153 SERIYDADYGMDVDTI-RLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLA 211 RI AD + + + R+ + + KCG E L + N+ + N+GCA N+A Sbjct: 119 -VRIVGADPSLPPEPVLRVGFVRYEAEVPKCGQAWE-SLTATRDNKAYENFGCAVAANMA 176 Query: 212 AQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 AQV NP DL PR +TP DA +RD + +YR+ Sbjct: 177 AQVANPEDLVRPRDMTPADAGRRDTVMGKYRR 208 >gi|218508708|ref|ZP_03506586.1| pilus assembly protein [Rhizobium etli Brasil 5] Length = 150 Score = 135 bits (340), Expect = 5e-30, Method: Composition-based stats. Identities = 44/100 (44%), Positives = 58/100 (58%), Gaps = 1/100 (1%) Query: 143 ISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNY 202 G I Y A D IRLS+ + +CG WP+D+ N N+N+ N+ Sbjct: 36 RREGSRARKIINTSYAAAGAGDAAPIRLSFTGTTAITTQCGQWPKDI-SNDFANQNYYNF 94 Query: 203 GCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 GCA QNNLAAQV NP DL +PR +TP DA++R+ +IQ YR Sbjct: 95 GCATQNNLAAQVANPEDLVAPRGMTPIDAQRRNNAIQEYR 134 >gi|218516618|ref|ZP_03513458.1| pilus assembly protein [Rhizobium etli 8C-3] Length = 102 Score = 132 bits (332), Expect = 4e-29, Method: Composition-based stats. Identities = 42/87 (48%), Positives = 56/87 (64%), Gaps = 1/87 (1%) Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 Y A D IRLS+ + +CG WP+D+ N N+N+ N+GCA QNNLAAQV Sbjct: 1 SYAAAGAGDAAPIRLSFTGTTAITTQCGQWPKDI-SNDFANQNYYNFGCATQNNLAAQVA 59 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 NP DL +PR +TP DA++R+ +IQ YR Sbjct: 60 NPEDLVAPRGMTPIDAQRRNNAIQEYR 86 >gi|254293215|ref|YP_003059238.1| type IV pili component-like protein [Hirschia baltica ATCC 49814] gi|254041746|gb|ACT58541.1| Type IV pili component-like protein [Hirschia baltica ATCC 49814] Length = 218 Score = 126 bits (316), Expect = 3e-27, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 76/189 (40%), Gaps = 7/189 (3%) Query: 56 TSALAYYDEGSDYRD-RY-PILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDS 113 SA + E D + I +++ +++ G G + + F+E + +D Sbjct: 17 VSACSQSQENIDIATGNHREIEVKEQTHYLEL-TELGVGHLTPADRRRVELFVENFADDG 75 Query: 114 ASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYF 173 + L P + RA+ IR I+ +G+ +I+ Y + + L+Y Sbjct: 76 YGPIVLSAPDGS---REAVRAITSIRSILSRAGVLPDNITVGGYQPA-AGNAAPLVLAYK 131 Query: 174 ASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQ 233 + + C E + + N + ++GCA N+A + P DL R + D+ + Sbjct: 132 SYQAHVPGCSTVNEHDWTDLRSNSSVGSFGCAVNENIAMMIAKPGDLLGERKIGDGDSSR 191 Query: 234 RDKSIQRYR 242 + ++YR Sbjct: 192 QLTVYEKYR 200 >gi|16127174|ref|NP_421738.1| pilus assembly protein CpaD [Caulobacter crescentus CB15] gi|221235975|ref|YP_002518412.1| pilus assembly protein CpaD [Caulobacter crescentus NA1000] gi|7208426|gb|AAF40193.1|AF229646_5 CpaD [Caulobacter crescentus CB15] gi|13424570|gb|AAK24906.1| pilus assembly protein CpaD [Caulobacter crescentus CB15] gi|220965148|gb|ACL96504.1| pilus assembly protein CpaD [Caulobacter crescentus NA1000] Length = 225 Score = 125 bits (315), Expect = 4e-27, Method: Composition-based stats. Identities = 40/160 (25%), Positives = 65/160 (40%), Gaps = 9/160 (5%) Query: 84 DIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIII 143 +I L + + + ++ A L + P ++A +IR + Sbjct: 54 EILLKPHASGLSANQSAALEALVSRWLAAEARELVVTAP----NTAGAMAI--EIRDRLA 107 Query: 144 SSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYG 203 G + A IR+ + + KCG E+ L + N + N+G Sbjct: 108 GLGAGARVRVVGVDPASAEEGA--IRVGFVRYEARPIKCGQRWEN-LAATRDNTVYDNFG 164 Query: 204 CAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 CA N+AAQV NP DL PR +TP D +RD + +YR+ Sbjct: 165 CAMAANIAAQVANPEDLMRPRDMTPADTGRRDTVLGKYRR 204 >gi|304320648|ref|YP_003854291.1| hypothetical protein PB2503_05377 [Parvularcula bermudensis HTCC2503] gi|303299550|gb|ADM09149.1| hypothetical protein PB2503_05377 [Parvularcula bermudensis HTCC2503] Length = 212 Score = 122 bits (306), Expect = 4e-26, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 64/160 (40%), Gaps = 4/160 (2%) Query: 71 RYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSAS 130 R+PI + + + IP+ + R + + F+ Y+ + + P+ T Sbjct: 33 RHPITVDQQAVTLTIPIDSTRSGLSRGDLQQLDRFVSAYRTKGYGPITVTAPAGTGRDLE 92 Query: 131 IRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDML 190 +R + SG+ + I + + + +S+ +CG + + Sbjct: 93 ANETAAAVRAALNDSGVAYADIQGASVTSSTAKE---VMVSFVRYVAQGPQCGVFDNERA 149 Query: 191 GNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPD 230 + N + N+GC+ Q+NLAA + +P DL + T D Sbjct: 150 ARFR-NLSHPNFGCSSQHNLAAMIADPRDLTRAQSTTTRD 188 >gi|114798148|ref|YP_761690.1| pilus assembly protein CpaD [Hyphomonas neptunium ATCC 15444] gi|114738322|gb|ABI76447.1| pilus assembly protein CpaD [Hyphomonas neptunium ATCC 15444] Length = 236 Score = 115 bits (287), Expect = 7e-24, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 69/170 (40%), Gaps = 1/170 (0%) Query: 73 PILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIR 132 PI + K + +++ + A GE+ I+ F+ Y L L +P + + Sbjct: 46 PIKVEKRTEFLEVSIDAYAGELSSSDRARIQDFMRGYVRRGHGPLVLSMPQVSSNPQLAV 105 Query: 133 RAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGN 192 AV + R I G+ IS + + L+Y + + A C N Sbjct: 106 AAVAEARAIAWDMGVEYQEISGTA-HGSGSSVSEPMILAYQSYEAIAPNCPPKSTVDFSN 164 Query: 193 AKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 N GC+ + NLAA +++P DL R + D +R+ ++++R Sbjct: 165 IDSNNQMETLGCSVRTNLAAMIIDPADLLGNRPLDRSDLARREVILEKFR 214 >gi|218662338|ref|ZP_03518268.1| pilus assembly protein [Rhizobium etli IE4771] Length = 79 Score = 113 bits (284), Expect = 2e-23, Method: Composition-based stats. Identities = 35/64 (54%), Positives = 47/64 (73%), Gaps = 1/64 (1%) Query: 179 AGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSI 238 +CG WP+D+ N N+N+ N+GCA QNNLAAQV NP DL +PR +TP DA++R+ +I Sbjct: 1 TTQCGQWPKDI-SNDFANQNYYNFGCASQNNLAAQVANPEDLVAPRGMTPIDAQRRNNAI 59 Query: 239 QRYR 242 Q YR Sbjct: 60 QEYR 63 >gi|83312841|ref|YP_423105.1| Type IV pili component [Magnetospirillum magneticum AMB-1] gi|82947682|dbj|BAE52546.1| Type IV pili component [Magnetospirillum magneticum AMB-1] Length = 236 Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 68/176 (38%), Gaps = 4/176 (2%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DYR +PI++ + + A + D + E+ A + + + + Sbjct: 30 DYRLSHPIVVEEKAAVALFARPAEGAALSDADRDRLGRLAEESARRGAGPIQISVGALPG 89 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISE-RIYDADYGMDVDTIRLSYFASKPSAGKCGFW 185 A + + + + G+ S+S DA + +R+ + A +CG + Sbjct: 90 EEAGALAFAQTLADTLRAWGVGPVSVSVAGGADAVPQPGIAQVRV--PVWEAKAPECGNF 147 Query: 186 PEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 + N N +N+GC+ Q N A V NP DL R + D + +++Y Sbjct: 148 ERGLNPN-YSNAPHSNWGCSIQRNKALMVQNPADLVRARETSGRDGARATTVLEKY 202 >gi|329847254|ref|ZP_08262282.1| pilus Caulobacter type biogenesis lipoprotein CpaD family protein [Asticcacaulis biprosthecum C19] gi|328842317|gb|EGF91886.1| pilus Caulobacter type biogenesis lipoprotein CpaD family protein [Asticcacaulis biprosthecum C19] Length = 219 Score = 101 bits (252), Expect = 8e-20, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 52/119 (43%), Gaps = 4/119 (3%) Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 T + A I + + ++S+ + V +SY AS P G+ Sbjct: 84 TSPDPASMAAGNAIAGYLAGRDVSRDAVSQFSVQSQPVEIVTVNVVSYRASVPGCGQT-- 141 Query: 185 WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 W + L + N N+GCA NLAAQV +P DL P TP DA ++ + +YR+ Sbjct: 142 W--ENLAATRKNTPHANFGCAITANLAAQVADPRDLVDPATATPSDAGRKSVVLDKYRR 198 >gi|46201038|ref|ZP_00207940.1| hypothetical protein Magn03010629 [Magnetospirillum magnetotacticum MS-1] Length = 227 Score = 96.2 bits (238), Expect = 3e-18, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 74/202 (36%), Gaps = 7/202 (3%) Query: 41 TLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHD 100 +L L + + G A DYR +PI + + + A + D Sbjct: 5 RAILATLVLMPVLAGCEADLAEH---DYRLSHPIAVEEKAAVALFARPAPGAPLSDIDRD 61 Query: 101 TIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDAD 160 + E+ A + + + + T A + + + + + G+ ++ Sbjct: 62 RLGRLAEESLRRGAGPIQITVGARTGEEADAQAFAQTLSDTMRAWGVGPVVVAVAGGADA 121 Query: 161 YGMDV-DTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLD 219 +R+ + A +CG + + L N +N+GC+ Q N A V NP D Sbjct: 122 VPQPGLAQVRV--PVWEAKAPECGTF-DRGLNPDYANAPHSNWGCSIQRNKALMVQNPAD 178 Query: 220 LFSPRMVTPPDAEQRDKSIQRY 241 L R + DA + + +++Y Sbjct: 179 LVRARDTSGRDANRANDVLEKY 200 >gi|315497464|ref|YP_004086268.1| pilus (caulobacter type) biogenesis lipoprotein cpad [Asticcacaulis excentricus CB 48] gi|315415476|gb|ADU12117.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Asticcacaulis excentricus CB 48] Length = 228 Score = 93.5 bits (231), Expect = 2e-17, Method: Composition-based stats. Identities = 36/151 (23%), Positives = 58/151 (38%), Gaps = 6/151 (3%) Query: 93 EIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + + K L + T ++ RA IR + + V ++ Sbjct: 63 GLSDNQRRALDQVAR--KASWNGGEALDMTILTANTPDALRAGAAIRAYLNDHQVSVHAV 120 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAA 212 S+ G D + L + C E+ L + N N GCA NLAA Sbjct: 121 SQTT---AEGQPADVVSLITREYRAVVNDCNLEWEN-LAATRHNAAPQNLGCAINANLAA 176 Query: 213 QVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 Q+ +P D+ +P+ TP DA +R I +YR+ Sbjct: 177 QIDDPRDIAAPQPATPGDAGRRTVIIDKYRK 207 >gi|103486616|ref|YP_616177.1| hypothetical protein Sala_1128 [Sphingopyxis alaskensis RB2256] gi|98976693|gb|ABF52844.1| hypothetical protein Sala_1128 [Sphingopyxis alaskensis RB2256] Length = 213 Score = 84.7 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 46/207 (22%), Positives = 78/207 (37%), Gaps = 17/207 (8%) Query: 39 LRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPI 98 ++ + + L A + Y S P++ I + A GE+ Sbjct: 1 MKNIATWTVLALATTLAGCAGSAYSNRSLESVHQPVV---RNSIYQFDVAAKDGELPPSE 57 Query: 99 HDTIRG-FLEKYKNDSASVLFLLIPS--PTVSSASIRRAVKDIRKIIISSGIPVSSISER 155 ++G F + + PS S+ + RA+ + R +++S +PV++ + Sbjct: 58 QGRLQGWFDAMGIRYG-DRVAIEDPSLYGASSAQATVRAMVERRGLLLSKDVPVTTGAVP 116 Query: 156 IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 IR+ + S C W N+ N +NYGCA +NLAA V Sbjct: 117 DGH---------IRVVVTRASASVPGCPDWNSKSSLNSL-NATSSNYGCATNSNLAAMVA 166 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +P DL T D ++IQ YR Sbjct: 167 DPNDLIKGTRDTGHDPVAATRAIQTYR 193 >gi|149186256|ref|ZP_01864570.1| hypothetical protein ED21_31004 [Erythrobacter sp. SD-21] gi|148830287|gb|EDL48724.1| hypothetical protein ED21_31004 [Erythrobacter sp. SD-21] Length = 217 Score = 83.9 bits (206), Expect = 2e-14, Method: Composition-based stats. Identities = 44/215 (20%), Positives = 75/215 (34%), Gaps = 26/215 (12%) Query: 35 KNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEI 94 +N R L + L L G + Y + P ++ + ++D+ + Sbjct: 2 RNLNTRKLAGPLVVSLALALGACGGTNMSNRTLYSVKQP-VVERSNYVLDL--NTTAEGL 58 Query: 95 KYPIHDTIRG-FLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKI---IISSGIPVS 150 + G F + + +S ++R V +I +++ G PV+ Sbjct: 59 PVSEQQRLTGWFESMGLRYG-DRVAID---DGSNSLAVRDDVAEIASRYGILVAEGAPVT 114 Query: 151 SISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNL 210 + + A R+ S S C W + GN NYGCA +NL Sbjct: 115 AGNLGPGQA---------RVVITRSTASVPGCPDWSH-TVEANDGNATNPNYGCATYSNL 164 Query: 211 AAQVVNPLDLFSPR---MVTPPDAEQRDKSIQRYR 242 A+ V NP DL + T K+I+ YR Sbjct: 165 ASMVANPEDLVQGQQGTGETIVTTS--TKAIEAYR 197 >gi|296284553|ref|ZP_06862551.1| hypothetical protein CbatJ_13041 [Citromicrobium bathyomarinum JL354] Length = 214 Score = 81.6 bits (200), Expect = 9e-14, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 72/203 (35%), Gaps = 19/203 (9%) Query: 44 LGQLFFLLLFYGTSALAYYD-EGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTI 102 + L G A D S Y P ++ + +D + AG G + P + Sbjct: 8 AAAMLTSALVLGLGGCAAPDFNRSLYSLNQP-VVERTHYTLD--VAAGPGGLPIPEQRRL 64 Query: 103 RGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYG 162 G+ E D + + + + ++ + + + +++S PV+ Sbjct: 65 AGWFEALDLDYGDRVAVEADTASPATIAAVAGIVERYGLLLSDQAPVTEGFVEP------ 118 Query: 163 MDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFS 222 R+ S + C W +D N NYGCA +N AA V NP DL Sbjct: 119 ---GRTRVVVTRSVATVPTCPNWTDD-GDGNFANATSRNYGCATNSNYAAMVANPEDLVR 174 Query: 223 P---RMVTPPDAEQRDKSIQRYR 242 R + + R +I+ YR Sbjct: 175 GQSSRGGSSVNGSNR--AIEAYR 195 >gi|326388863|ref|ZP_08210445.1| hypothetical protein Y88_3607 [Novosphingobium nitrogenifigens DSM 19370] gi|326206463|gb|EGD57298.1| hypothetical protein Y88_3607 [Novosphingobium nitrogenifigens DSM 19370] Length = 233 Score = 75.8 bits (185), Expect = 5e-12, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 66/205 (32%), Gaps = 13/205 (6%) Query: 39 LRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPI 98 LR L + L + + ++ + D+P L G I+ Sbjct: 22 LRRLASATIVG-GLALALAGCGGMPTNRSLESAHQPVIERTNYTFDVPTLP-DGGIEPAQ 79 Query: 99 HDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYD 158 + + + + P+ + + + AV R ++ +G + Sbjct: 80 LRRLSDWFSALGLKFGDRVSIDDPASSAVTRAAVEAVMS-RFGLMLNGAAPVT------- 131 Query: 159 ADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPL 218 + + T+R+ + S C W + N N+GCA N+AA V N Sbjct: 132 -EGALAGGTVRIVVSRTYASVPGCPDW-KARSDANFNNATSRNFGCATNANMAAMVANKE 189 Query: 219 DLFSPR-MVTPPDAEQRDKSIQRYR 242 DL + V K+I YR Sbjct: 190 DLVHGQTGVGDTVVMSNTKAIDTYR 214 >gi|332188354|ref|ZP_08390079.1| pilus biogenesis CpaD family protein [Sphingomonas sp. S17] gi|332011583|gb|EGI53663.1| pilus biogenesis CpaD family protein [Sphingomonas sp. S17] Length = 230 Score = 75.1 bits (183), Expect = 7e-12, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 69/177 (38%), Gaps = 20/177 (11%) Query: 72 YPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASI 131 + ++ + + +D+ L R + G+ + + + P+ Sbjct: 32 HQPVVSQADYALDLALSGNR--LAGDEAQRFEGWARNLQLGYGDRVTIEDPAGD-----A 84 Query: 132 RRAVKDIRKIIISSGI---PVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPED 188 A + I +++ G+ P +SI A + TIR+ ++ + C W D Sbjct: 85 PGAYRQIAEMVGRYGLLVGPAASI------ARAPLAPATIRVVVTRARATVPGCPDWRSD 138 Query: 189 MLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTP--PDAEQRDKSIQRYRQ 243 + + GN + +N+GCA NLAA V +P L P D ++I +RQ Sbjct: 139 VAPDWVGNTS-SNHGCAINRNLAAMVADPTHLVHG-AEGPASADPAAATRAINIFRQ 193 >gi|85373130|ref|YP_457192.1| hypothetical protein ELI_01515 [Erythrobacter litoralis HTCC2594] gi|84786213|gb|ABC62395.1| hypothetical protein ELI_01515 [Erythrobacter litoralis HTCC2594] Length = 188 Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 61/182 (33%), Gaps = 16/182 (8%) Query: 63 DEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRG-FLEKYKNDSASVLFLLI 121 + Y + P ++ + +D + G G + P + G F + + Sbjct: 3 QNRTMYSTKQP-VVERTNYTLD--VRTGPGGLSIPEQQRLSGWFEAMNLRYG-DRVSIED 58 Query: 122 PSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGK 181 P + S+ + I++S G PV+S A R+ S S Sbjct: 59 PMLSGSTKDAISQLAGRHGILVSDGAPVTSGYVEPGSA---------RVVITRSSASVPG 109 Query: 182 CGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP-RMVTPPDAEQRDKSIQR 240 C W N YGCA NLAA V +P DL + K+++ Sbjct: 110 CPDWSV-KSEMNYTNGTHPGYGCAINGNLAAMVADPEDLVKGDEGSGETYVRRGSKAVEA 168 Query: 241 YR 242 YR Sbjct: 169 YR 170 >gi|288956970|ref|YP_003447311.1| hypothetical protein AZL_001290 [Azospirillum sp. B510] gi|288909278|dbj|BAI70767.1| hypothetical protein AZL_001290 [Azospirillum sp. B510] Length = 234 Score = 72.7 bits (177), Expect = 4e-11, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 72/211 (34%), Gaps = 21/211 (9%) Query: 42 LMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDT 101 L L ++ G + Y E + + + + + E + L+G + + Sbjct: 12 ASLAGLLASVVLAGCTPTPLYQENTAVQQQLTVDVATAETTLS--PLSGARPLDGASVER 69 Query: 102 IRGFLEKYKNDSAS-VLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDAD 160 +R F+ + + + + + T VK + ++ ++G+P IS Sbjct: 70 LRRFVME---EGNPYAVRIRL---TPGDRVADGVVKRVIAVLTAAGVPGRGISV---SMV 120 Query: 161 YGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNW---------TNYGCAYQNNLA 211 + +RL + C D L + + + GCA NL Sbjct: 121 NRPVGNELRLRRETATAVLPDCPPLNRDTLLDRENDTPLDLGRAPPPPPRLGCATTANLG 180 Query: 212 AQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 + +P DL P+ P + ++ RYR Sbjct: 181 LMLADPRDLIEPQRSGPASGALAEDTVGRYR 211 >gi|85707701|ref|ZP_01038767.1| hypothetical protein NAP1_00660 [Erythrobacter sp. NAP1] gi|85689235|gb|EAQ29238.1| hypothetical protein NAP1_00660 [Erythrobacter sp. NAP1] Length = 214 Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 61/207 (29%), Gaps = 23/207 (11%) Query: 43 MLGQLFFLLLFYGTSALAYYDEGSD-YRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDT 101 L L L + + Y + P++ + + R + Sbjct: 8 KLMGAVALSLGLAVAGCGGMATNTSLYSLKQPVV---ERTNFAMDVNTNRSGLSISEQQR 64 Query: 102 IRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADY 161 + G+ E + + PS ++ AV ++ + Sbjct: 65 LNGWFETMDLRYGDRVAIEDPSSNP---AVAEAVNELAGR--------YGLIVTEVAPTT 113 Query: 162 GMDVDT--IRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLD 219 + R+ S S C W N + YGCA +N+AA + NP D Sbjct: 114 SGTLAPGQARVVITRSDASVPGCPDWST-KSDMNYNNASSPGYGCAINSNMAAMIANPED 172 Query: 220 LFSPR---MVTPPDAEQRDKSIQRYRQ 243 L + T R +I+ YR+ Sbjct: 173 LLEGQKGSGETVIATSNR--AIRTYRE 197 >gi|307293451|ref|ZP_07573297.1| Pilus biogenesis CpaD-related protein [Sphingobium chlorophenolicum L-1] gi|306881517|gb|EFN12733.1| Pilus biogenesis CpaD-related protein [Sphingobium chlorophenolicum L-1] Length = 218 Score = 68.1 bits (165), Expect = 9e-10, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 52/176 (29%), Gaps = 16/176 (9%) Query: 72 YPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASI 131 + ++ D+ G + + + L + + ++ Sbjct: 39 HQPVVSHTAFTYDVQAGP-DGGLTPLEARRLDDWFVS-IGLGYGDQVALATDASYYAPAL 96 Query: 132 RRAVKDIRKIIISSGIPVSSISER--IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDM 189 R + DI + + A +RL + C W D Sbjct: 97 REGIADI--------VARHGMLVGEDSSAAAGSAPQGAVRLIVRRATARVPGCPDW-SDK 147 Query: 190 LGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPD--AEQRDKSIQRYRQ 243 + N+GC NLAA + NP DL + T D +++I YR+ Sbjct: 148 PETDQQLGASANFGCGVNGNLAAMIANPEDLVRGQT-TDSDLRTATSNRAISTYRE 202 >gi|294012434|ref|YP_003545894.1| Flp pilus assembly protein CpaD [Sphingobium japonicum UT26S] gi|292675764|dbj|BAI97282.1| Flp pilus assembly protein CpaD [Sphingobium japonicum UT26S] Length = 222 Score = 67.7 bits (164), Expect = 1e-09, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 61/213 (28%), Gaps = 17/213 (7%) Query: 35 KNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEI 94 K L + LG L + L + + ++ D+ G + Sbjct: 7 KTLLLFGVRLGTLALMALPLSAC-QTDQAANRGVQSVHQPVVSNAAFTYDVQAGP-DGGL 64 Query: 95 KYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISE 154 + + + S ++R + D+ + + Sbjct: 65 TASEARRLDDWFVS-IGLGYGDQVAIATDAGYYSPALREGIADV--------VARHGMLV 115 Query: 155 R--IYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAA 212 +RL + S C W D + TN+GC NLAA Sbjct: 116 GEDGTAVAGAAPQGAVRLIVRRAIASVPGCPDW-SDKPETDQQLGTSTNFGCGVNGNLAA 174 Query: 213 QVVNPLDLFSPRMVTPPD--AEQRDKSIQRYRQ 243 + NP DL + T D +++I YR+ Sbjct: 175 MIANPEDLVRGQT-TDSDLRTATSNRAISTYRE 206 >gi|87199100|ref|YP_496357.1| hypothetical protein Saro_1078 [Novosphingobium aromaticivorans DSM 12444] gi|87134781|gb|ABD25523.1| hypothetical protein Saro_1078 [Novosphingobium aromaticivorans DSM 12444] Length = 223 Score = 67.3 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 66/200 (33%), Gaps = 11/200 (5%) Query: 44 LGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIR 103 F L L SA + E + ++ + + D+ L G G + + Sbjct: 9 TASAFVLSLGIALSACSSVPENRMLTSVHQPVVERNHFVFDLETLPG-GGLSITEQRRLA 67 Query: 104 GFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGM 163 G+ E + + P + ++ S AV ++++ G Y A Sbjct: 68 GWFESLGLKYGDKIAVDDPLQSKATLSAVDAVAGRWGLMLADGAAP---VTPGYVAPGA- 123 Query: 164 DVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 +R+ + S C W E N TN+GCA +NLAA V + L Sbjct: 124 ----VRVVVTRATASVPGCPSW-EYNSDMTLNNHTSTNFGCAVNSNLAAMVADKEHLIQG 178 Query: 224 R-MVTPPDAEQRDKSIQRYR 242 K+I YR Sbjct: 179 ASGTGETVVMSSTKAIDSYR 198 >gi|149912020|ref|ZP_01900614.1| putative lipoprotein [Moritella sp. PE36] gi|149804919|gb|EDM64953.1| putative lipoprotein [Moritella sp. PE36] Length = 215 Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats. Identities = 40/197 (20%), Positives = 77/197 (39%), Gaps = 16/197 (8%) Query: 46 QLFFLLLFYGT-SALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRG 104 ++F +L+ G SA A D R P + + I L + + Sbjct: 10 RIFSILIMCGLLSACAI-----DSVQRQPQVKVEAV-THKIALQLDTEALTKSDKTALNE 63 Query: 105 FLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMD 164 F+ ++ S L + I S + + V + ++ ++G+ S ++ ++ ++ D Sbjct: 64 FI--FQRGELSALRIRIDSYSDKGTNA---VPALIALLKNAGVYPSQVTSQLSESSSTAD 118 Query: 165 VDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPR 224 + I SY + P C E + N + N+GCA + LA V P DL R Sbjct: 119 IALIVESYRSIVP---NCHAGKESHTVLNEFNSS-PNFGCANASALAQMVATPRDLIVGR 174 Query: 225 MVTPPDAEQRDKSIQRY 241 + D + +++ Y Sbjct: 175 TLDATDGRKAVATVEAY 191 >gi|227326303|ref|ZP_03830327.1| putative lipoprotein [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 228 Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats. Identities = 49/233 (21%), Positives = 79/233 (33%), Gaps = 37/233 (15%) Query: 29 LKTIFWKNFFLRTL--MLGQLFFLLLFYGTSALAYYDEGSDYRDR-------YPILMRKV 79 +KTI LR L + L ++L G +D R + PI ++ Sbjct: 1 MKTINNSYPLLRPLHMRVAVLTAVVLLAGCGWNKPI---NDVRMQRFDQPGLQPIAVQ-- 55 Query: 80 EQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKND------SASVLFLLIPSPTVSSASIRR 133 V +PLL RGFL + L + SAS + Sbjct: 56 PSSVSVPLLVAPNG---------RGFLPESLRQLNIMLKDQGRLSAQTLTLIPHSASGEQ 106 Query: 134 AVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNA 193 + ++ ++G ++ + G + D +S A +C N Sbjct: 107 MAGRLVTVLKNAGANPQNVKQMRRSTASGQNGDLEVIS-EALVVKTTRCTI----NDPNQ 161 Query: 194 KGNRNWT---NYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 + + + GCA QNNLA V P DL + + D SI+RY + Sbjct: 162 LMVKPYEAMGSLGCATQNNLAMMVAEPRDLIQAKALDSADGVAAVNSIERYHK 214 >gi|94497255|ref|ZP_01303827.1| hypothetical protein SKA58_13673 [Sphingomonas sp. SKA58] gi|94423360|gb|EAT08389.1| hypothetical protein SKA58_13673 [Sphingomonas sp. SKA58] Length = 215 Score = 65.8 bits (159), Expect = 5e-09, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 60/178 (33%), Gaps = 21/178 (11%) Query: 72 YPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRG-FLEKYKNDSASVLFLLIPSPTVSSAS 130 + ++ D+ +G + + F+ V + + Sbjct: 35 HQPVVSHAAYTFDVMAGSGDT-LPPAEAARLNDWFVSIGLGYGDQVAIV-------NDGY 86 Query: 131 IRRAVKD-IRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDM 189 A+++ I ++ G+ + E +IRL + S C W Sbjct: 87 YGPALREGIANVVARHGL---LVGEDSSAIAGAAPQGSIRLIVRRATASVPGCPDWSAKQ 143 Query: 190 LGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRD----KSIQRYRQ 243 + +N+GC +NLAA V NP DL + D++ R ++I YR+ Sbjct: 144 ESEMTLGTS-SNFGCGVNSNLAAMVANPEDLVRGQS---SDSDLRTATSNRAISTYRE 197 >gi|84386796|ref|ZP_00989821.1| putative lipoprotein [Vibrio splendidus 12B01] gi|84378324|gb|EAP95182.1| putative lipoprotein [Vibrio splendidus 12B01] Length = 207 Score = 65.8 bits (159), Expect = 6e-09, Method: Composition-based stats. Identities = 36/148 (24%), Positives = 57/148 (38%), Gaps = 9/148 (6%) Query: 94 IKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSIS 153 + I F+ + L L+ ++ + +R +I SG+ S IS Sbjct: 46 LSAQEKADISDFIAR-----RGTLSNLMVKIENTTQKGESQSEKVRLRLIESGLYPSQIS 100 Query: 154 ERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQ 213 A D+ TI + + +K +A G P L + R N+GCA N LA Sbjct: 101 VSDTAAQGKGDI-TIFVESYRAKVTACDAGKTPRTTLNAYRTQR---NFGCANANALAQM 156 Query: 214 VVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 V NP DL + + ++ SI Y Sbjct: 157 VANPKDLIVGQPIDSAQGQKAVSSIDNY 184 >gi|253687069|ref|YP_003016259.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753647|gb|ACT11723.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 228 Score = 65.4 bits (158), Expect = 6e-09, Method: Composition-based stats. Identities = 46/226 (20%), Positives = 74/226 (32%), Gaps = 35/226 (15%) Query: 34 WKNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDR-------YPILMRKVEQIVDIP 86 + F L L + L G +D R + PI ++ V +P Sbjct: 8 YPPFRPLPLRAAVLTAVFLLAGCGWNKPI---NDVRMQRFDQPALQPIAVQ--PSSVSVP 62 Query: 87 LLAGRGEIKYPIHDTIRGFLEKYKND------SASVLFLLIPSPTVSSASIRRAVKDIRK 140 LLA RGFL + L + SAS + + Sbjct: 63 LLAAPNG---------RGFLPESLKQLNIMLKDQGRLSAQTITLIPHSASGEQMAGRLAT 113 Query: 141 IIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWT 200 ++ ++G ++ + G + D +S A +C N + + Sbjct: 114 VLKNAGANPQNVKQMRRSTASGQNGDLEVIS-EALVVKTTRCTI----NDPNQLMVKPYE 168 Query: 201 ---NYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 + GCA QNNLA V P DL + + D SI+RY + Sbjct: 169 AIGSLGCATQNNLAMMVAEPRDLIQAKALDDADGVAAVNSIERYHK 214 >gi|227114877|ref|ZP_03828533.1| putative lipoprotein [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 228 Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 70/214 (32%), Gaps = 23/214 (10%) Query: 40 RTLMLGQLFFLLLFYGTSALAYYDEGSDYRDR-------YPILMRKVEQIVDIPLLAGRG 92 R + L ++L G +D R + PI ++ V + + Sbjct: 14 RHMRAAVLTAVVLLAGCGWNKPI---NDVRMQRFDQPGLQPIAVQPSSVSVPLLVAPNGR 70 Query: 93 EIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + L+ SA L LIP T R ++ ++G ++ Sbjct: 71 GFLPESLKQLNIMLKDQGRLSAQTLT-LIPHSTSGEQMAGRLA----TVLKNAGANPQNV 125 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWT---NYGCAYQNN 209 + G + D +S A +C N + + + GCA QNN Sbjct: 126 KQMRRSTASGQNGDLEVIS-EALVVKTTRCTI----NDPNQLMVKPYEAIGSLGCATQNN 180 Query: 210 LAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 LA V P DL + + D SI+RY + Sbjct: 181 LAMMVAEPRDLIQAKALDSADGVAAVNSIERYHK 214 >gi|261820215|ref|YP_003258321.1| pilus biogenesis lipoprotein CpaD [Pectobacterium wasabiae WPP163] gi|261604228|gb|ACX86714.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Pectobacterium wasabiae WPP163] Length = 228 Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 71/218 (32%), Gaps = 27/218 (12%) Query: 39 LRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDR-------YPILMRKVEQIVDIPLLAGR 91 LR L + +F S + +D R + P+ ++ V +PLL Sbjct: 11 LRRLHISAAVLTAVFL-LSGCGWNKPINDVRMQRFDQPALQPVAVQ--PSSVSVPLLVAP 67 Query: 92 GEIKYPIHDTIRGFLEKYKND------SASVLFLLIPSPTVSSASIRRAVKDIRKIIISS 145 RGFL + L + SAS + + ++ ++ Sbjct: 68 NG---------RGFLPESLRQLNIMLKDQGRLSAQTLTLIPHSASGEQMAGRLATVLKNA 118 Query: 146 GIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCA 205 G ++ + G D +S + P ++ + GCA Sbjct: 119 GADAQNVKQMRRSTASGQTGDLEVISEALVVKTTRCTINDPNQLMVKPFDGIGY--LGCA 176 Query: 206 YQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 QNNLA V P DL + + D SI+RY + Sbjct: 177 TQNNLAMMVAEPRDLIQAKALDNADGVAAVNSIERYHK 214 >gi|148557760|ref|YP_001265342.1| hypothetical protein Swit_4867 [Sphingomonas wittichii RW1] gi|148502950|gb|ABQ71204.1| hypothetical protein Swit_4867 [Sphingomonas wittichii RW1] Length = 252 Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 74/203 (36%), Gaps = 22/203 (10%) Query: 43 MLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTI 102 +LG L + L + Y+ G + ++++ + ++D+P + D + Sbjct: 50 ILGTLGIIALAATPALADRYNRG--VESVHQPVVQRSDYVLDVP----ADGLDPAARDRV 103 Query: 103 RGFLEKYKNDSAS---VLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDA 159 + + + + + + +DI +I G+ V S + A Sbjct: 104 ---GQWFDAIGLGYGDRIAIDTSAGG------TGSNRDIAEIAGRYGLFVGSAAPMTEGA 154 Query: 160 DYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLD 219 V + AS P W + A +NYGCA + LAA V NP D Sbjct: 155 IAPGHVRIVVSRSTASVPGCPDYSQWSQPNFTAAAS----SNYGCAINSTLAAMVANPED 210 Query: 220 LFSPRMVTPPDAEQRDKSIQRYR 242 L + +A+ K+I+ +R Sbjct: 211 LVKGQAARGSNADTATKAIRIWR 233 >gi|260778161|ref|ZP_05887054.1| putative lipoprotein [Vibrio coralliilyticus ATCC BAA-450] gi|260606174|gb|EEX32459.1| putative lipoprotein [Vibrio coralliilyticus ATCC BAA-450] Length = 208 Score = 62.7 bits (151), Expect = 4e-08, Method: Composition-based stats. Identities = 37/177 (20%), Positives = 57/177 (32%), Gaps = 12/177 (6%) Query: 66 SDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSAS-VLFLLIPSP 124 SD R P + V + L + + IR F+ L + + S Sbjct: 19 SDQIARQP-AIDVVSVTNKLTLAVEKQSLTPQQQQDIRSFI---VQRGNPYSLRVKLVSY 74 Query: 125 TVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 + S +K I +++ G+ I DV I S+ A P G Sbjct: 75 SPKGQSQ---IKPISNLLLGQGLAKHQIMTERATGTQSGDVQVIVESFRAKVPGCGTDKS 131 Query: 185 WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 P + YGC+ LA V NP DL + P + + +I Y Sbjct: 132 QP----VIFNQYKTHQAYGCSNAAALAQMVANPKDLVVGEKLGPTNGAKAVAAIDAY 184 >gi|261251597|ref|ZP_05944171.1| putative lipoprotein [Vibrio orientalis CIP 102891] gi|260938470|gb|EEX94458.1| putative lipoprotein [Vibrio orientalis CIP 102891] Length = 206 Score = 62.0 bits (149), Expect = 7e-08, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 66/200 (33%), Gaps = 32/200 (16%) Query: 49 FLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDI-------PLLAGRGEIKYPIHDT 101 L+L G S+ PI + +D+ L + Sbjct: 8 ALVLLGGCSSS-------------PI---EKAPALDVVSVTNKLTLAMQSSSLSAKQSKA 51 Query: 102 IRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADY 161 + +++ K + L+ S S +R + I ++I GI +I + Sbjct: 52 VEQLVQR-KGSPYGLNVKLVSL----SKSGQRGLNQIEALLIDQGIAAKNIHREHLAENG 106 Query: 162 GMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLF 221 DV + S+ A P + + + + K ++GCA LA V NP DL Sbjct: 107 KGDVQILIESFKAKVPKC-QAQKYSNNFINRYKE---HPSFGCATSVALAQMVANPKDLV 162 Query: 222 SPRMVTPPDAEQRDKSIQRY 241 + + + +I Y Sbjct: 163 VGEQLGATNGAKAVATIDGY 182 >gi|19749314|gb|AAL98684.1| Y4xK [Sinorhizobium fredii] Length = 188 Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 68/191 (35%), Gaps = 27/191 (14%) Query: 53 FYGTSALAY-YDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKN 111 G ++ A Y E S PI +R+ ++ + + + FL K Sbjct: 16 LAGCTSTAPIYVEQST-----PIFVRQESTVLKL------ESLHASEQQRLLAFLWKASR 64 Query: 112 DSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLS 171 L L+I + SS AV R++ GI S+I D +R+ Sbjct: 65 GRRDALHLVI---SGSSRLSAEAVHQARQM----GIGASNI-----HLLDQNDRGQLRVE 112 Query: 172 YFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDA 231 C +L + ++ GC+ +NLA + +P DL R V P D Sbjct: 113 AVVYHALPPICRSLSSQLLNDEFFDQP---IGCSTSHNLAVMINDPRDLLGNRFVKPSDG 169 Query: 232 EQRDKSIQRYR 242 ++ + YR Sbjct: 170 DRAAIPVTTYR 180 >gi|251789647|ref|YP_003004368.1| pilus biogenesis lipoprotein CpaD [Dickeya zeae Ech1591] gi|247538268|gb|ACT06889.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Dickeya zeae Ech1591] Length = 223 Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats. Identities = 32/151 (21%), Positives = 49/151 (32%), Gaps = 7/151 (4%) Query: 93 EIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + ++ L + S L + T +A + + + ++G S I Sbjct: 66 GLDADSLASLNQLLNQQGRVSKQTLTI-----TPWTARGEQIASRLANALENAGADKSRI 120 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAA 212 D LS A C +L + GCA Q+NLA Sbjct: 121 RVMTRVPAVNQSGDLQVLS-QALAARVPACQVNDAGLLMVKPFDAVGY-LGCANQSNLAQ 178 Query: 213 QVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 V P DL R + D SI+RY+Q Sbjct: 179 MVAEPRDLIQARSLDAADGVNMVNSIERYQQ 209 >gi|312883755|ref|ZP_07743474.1| hypothetical protein VIBC2010_14179 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368504|gb|EFP96037.1| hypothetical protein VIBC2010_14179 [Vibrio caribbenthicus ATCC BAA-2122] Length = 206 Score = 60.0 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 72/201 (35%), Gaps = 34/201 (16%) Query: 49 FLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDI--------PLLAGRGEIKYPIHD 100 L+L G S+ PI + +D+ ++ G + Sbjct: 8 ALVLLGGCSSS-------------PI---ERAPALDVVSVTNKLTFVMQGSF-LSAEQSK 50 Query: 101 TIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDAD 160 + F+++ K + L+ S S +R++ + ++I GI V +I + Sbjct: 51 AVEQFVQR-KGSPYGLNVKLVSL----SKSGQRSLDQVEALLIEQGIAVKNIHREHLTKE 105 Query: 161 YGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDL 220 DV + S+ A P P + N+ ++ ++GCA L V NP DL Sbjct: 106 GKGDVQILIESFKAKIPKCRVKK--PSNTFINSY--KSHPSFGCATSVALGQMVANPKDL 161 Query: 221 FSPRMVTPPDAEQRDKSIQRY 241 + + + +I+ Y Sbjct: 162 VVGEQLGATNGAKAVATIEGY 182 >gi|294139871|ref|YP_003555849.1| hypothetical protein SVI_1100 [Shewanella violacea DSS12] gi|293326340|dbj|BAJ01071.1| hypothetical protein [Shewanella violacea DSS12] Length = 208 Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 68/177 (38%), Gaps = 13/177 (7%) Query: 67 DYRDRYP-ILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT 125 D R P I + V + + LLA + + FL + + L + I + + Sbjct: 19 DPIQRQPNIAIEAVTHVFSLRLLAET--LDGEDRAALEEFLL--RRGDPANLRVRIETHS 74 Query: 126 VSSASIRRAVKDIRKIIISSGIPVSSI-SERIYDADYGMDVDTIRLSYFASKPSAGKCGF 184 + + + +R ++ I S I + + A D+ + SY G Sbjct: 75 LRG---EKVLTSVRDLMHLRNIYPSQIRTLKRESASTSEDLTLVVESYRTLVRHCDA-GK 130 Query: 185 WPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 P+ +L + K + N+GCA + LA V NP DL ++ + + ++ Y Sbjct: 131 EPKTILNSFKRS---ANFGCANASALAQMVANPRDLVVGETLSATEGRKAVSTLDAY 184 >gi|50119735|ref|YP_048902.1| putative lipoprotein [Pectobacterium atrosepticum SCRI1043] gi|49610261|emb|CAG73704.1| putative lipoprotein [Pectobacterium atrosepticum SCRI1043] Length = 228 Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 74/227 (32%), Gaps = 25/227 (11%) Query: 29 LKTIFWKNFFLRTL--MLGQLFFLLLFYGTSALAYYDEGSDYRDR-------YPILMRKV 79 +KTI + R L L + L G +D R + PI ++ Sbjct: 1 MKTINSNHPPFRPLHVRAAVLTAVFLLAGCGWNKPI---NDVRMQRFDQPALQPIAVQPS 57 Query: 80 EQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIR 139 V + + + L+ SA L L+ SAS + + Sbjct: 58 SVSVPLLVAPNGRGFLPESLKQLNIMLKDQGRLSAQTLTLI-----PHSASGEQMAGRLA 112 Query: 140 KIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNW 199 ++ ++G ++ + G D +S A +C N + + Sbjct: 113 TVLKNAGANAQNVKQMRRSTASGQTGDLEVIS-EALVVKTTRCTI----NDPNQLMVKPY 167 Query: 200 T---NYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 GCA QNNLA V P DL + + D SI+RY + Sbjct: 168 EAIGTLGCATQNNLAMIVAEPRDLIQAKALDGADGVAAVNSIERYHK 214 >gi|13475295|ref|NP_106859.1| hypothetical protein mlr8762 [Mesorhizobium loti MAFF303099] gi|14026046|dbj|BAB52645.1| mlr8762 [Mesorhizobium loti MAFF303099] Length = 186 Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats. Identities = 34/170 (20%), Positives = 61/170 (35%), Gaps = 23/170 (13%) Query: 73 PILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIR 132 PIL+R ++ + ++ +R FL++ + L LLI + Sbjct: 32 PILIRPETTVLIL------KSLRASERQRLRVFLDRTSSGRRDALHLLI-WGS------S 78 Query: 133 RAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGN 192 R ++ GI +I D +++ C + +L + Sbjct: 79 RLSAEVVHQARQMGIDTYNI-----HLLDQHDGGAVQVEAIVYHARPPACPSYS--LLSD 131 Query: 193 AKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 + GC+ + NLAA V +P DL + V P D E+ + YR Sbjct: 132 KSFEKP---LGCSTRRNLAAMVNDPRDLLDNQAVEPSDGERAAIPVATYR 178 >gi|237678813|emb|CAQ57540.1| hypothetical protein [Bradyrhizobium elkanii] Length = 221 Score = 57.7 bits (138), Expect = 1e-06, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 49/153 (32%), Gaps = 15/153 (9%) Query: 92 GEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSS 151 ++ +R F+ L + + + + R + + + GI + Sbjct: 46 QSLRGSERHRLRNFIAHASGGRRDALHIDV-TGSP------RLIAQVAHEARAMGIAPYN 98 Query: 152 ISERIYDADYGMDVDTIRL---SYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQN 208 I D +R+ ++ A P + N GC+ +N Sbjct: 99 IRLAASPIDLPARFG-VRIEAITFEAHPPVCPSLSI----VGPAVNDNSFDPTLGCSTRN 153 Query: 209 NLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 NLA V +P+DL R V ++ + Y Sbjct: 154 NLAVMVNDPIDLLDNRSVMTSSGDRAASPLASY 186 >gi|148976306|ref|ZP_01813030.1| putative lipoprotein [Vibrionales bacterium SWAT-3] gi|145964400|gb|EDK29655.1| putative lipoprotein [Vibrionales bacterium SWAT-3] Length = 195 Score = 57.7 bits (138), Expect = 1e-06, Method: Composition-based stats. Identities = 33/148 (22%), Positives = 57/148 (38%), Gaps = 9/148 (6%) Query: 94 IKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSIS 153 + I F+ + L L+ T ++ ++ +R +I SG+ S I Sbjct: 34 LSDQEKADISDFMSR-----RGALNNLMVKITKTTLKGESQIEKVRLHLIESGLYPSQIW 88 Query: 154 ERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQ 213 + D+ + SY A + G P + + R N+GCA N LA Sbjct: 89 VADEATEGKGDITILVESYRAKVTACDA-GKTPRTTVNAYRTQR---NFGCANANALAQM 144 Query: 214 VVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 V NP DL + ++ ++ SI+ Y Sbjct: 145 VANPKDLIVGQPISGTQGQKAVSSIENY 172 >gi|323493502|ref|ZP_08098624.1| hypothetical protein VIBR0546_14315 [Vibrio brasiliensis LMG 20546] gi|323312325|gb|EGA65467.1| hypothetical protein VIBR0546_14315 [Vibrio brasiliensis LMG 20546] Length = 206 Score = 57.3 bits (137), Expect = 2e-06, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 74/200 (37%), Gaps = 32/200 (16%) Query: 49 FLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGR-------GEIKYPIHDT 101 L+L G ++ PI + +D+ + + + + Sbjct: 8 ALVLLGGCTST-------------PI---EKAPALDVVSVTNKLTLKLNGATLSSKQSEG 51 Query: 102 IRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADY 161 I+ F E+ K + V L+ SA + +++ I +++I+ GI ++I Sbjct: 52 IQQFFER-KGLAYGVKVKLV----SYSAIGKSSLEQIEQLLIAQGIAANNIQRVDSKEQS 106 Query: 162 GMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLF 221 DV + S+ +P + + L + ++GCA LA V NP DL Sbjct: 107 QGDVQILAESFKVKQPKC-HVQKYSNNFLNRY---KQHPSFGCANSVALAQMVANPKDLV 162 Query: 222 SPRMVTPPDAEQRDKSIQRY 241 + P + + SI+ Y Sbjct: 163 VGEKLGPTNGAKAVSSIEGY 182 >gi|319940441|ref|ZP_08014786.1| hypothetical protein HMPREF9464_00005 [Sutterella wadsworthensis 3_1_45B] gi|319806067|gb|EFW02816.1| hypothetical protein HMPREF9464_00005 [Sutterella wadsworthensis 3_1_45B] Length = 274 Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 48/139 (34%), Gaps = 17/139 (12%) Query: 115 SVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSY-- 172 + + + S + + + + + +G ++ + + D R+S+ Sbjct: 126 GPIRRQVLTVVPLSERGEKLARRLAEALEEAGAQEPKLAVYVNEKTGKRDFPDRRVSWDL 185 Query: 173 ----FASKPSAGKC-----GFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 A A +C W + GCA N+A V +P DL P Sbjct: 186 ELISEAYVVHAPECEVADPTRWTIEPYEAVGT------LGCANNANIAMMVSDPKDLLRP 239 Query: 224 RMVTPPDAEQRDKSIQRYR 242 R + D + ++Q+Y+ Sbjct: 240 RALEGADGTASNLAVQKYQ 258 >gi|16520027|ref|NP_444147.1| conserved putative lipoprotein of 20.5 kDa [Sinorhizobium fredii NGR234] gi|2496775|sp|P55703|Y4XK_RHISN RecName: Full=Uncharacterized lipoprotein y4xK; Flags: Precursor gi|2182719|gb|AAB91934.1| conserved putative lipoprotein of 20.5 kDa [Sinorhizobium fredii NGR234] Length = 188 Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 66/191 (34%), Gaps = 27/191 (14%) Query: 53 FYGTSALAY-YDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKN 111 G ++ A Y E PI +R+ ++ + + FL K Sbjct: 16 LAGCTSTAPIYVEQPT-----PIFVRQESTVLKL------ESFHASEQQRLLAFLWKASR 64 Query: 112 DSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLS 171 L L+I + SS AV R++ GI S+I D +R+ Sbjct: 65 GRRDALHLVI---SGSSRLSAEAVHQARQM----GIGASNI-----HLLDQNDRGHLRIE 112 Query: 172 YFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDA 231 C +L + ++ GC+ +NLA + +P DL R V P D Sbjct: 113 AVVYHALPPICRSLSSQLLNDEFFDQP---IGCSTSHNLAVMINDPRDLLGNRFVKPSDG 169 Query: 232 EQRDKSIQRYR 242 ++ + YR Sbjct: 170 DRAAIPVTTYR 180 >gi|326797328|ref|YP_004315148.1| pilus biogenesis CpaD-related protein [Marinomonas mediterranea MMB-1] gi|326548092|gb|ADZ93312.1| Pilus biogenesis CpaD-related protein [Marinomonas mediterranea MMB-1] Length = 224 Score = 54.6 bits (130), Expect = 1e-05, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 69/226 (30%), Gaps = 38/226 (16%) Query: 40 RTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLL----------- 88 + + FLL G + + E+ ++ Sbjct: 1 MRIFSISIVFLLFLSGCDHTVHRLRNGS---------SEAEKGTPAFVVKPTVSSISLQL 51 Query: 89 AGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIP 148 G +K D + L S+ V+ L + + + ++ ++S G+ Sbjct: 52 QENGSLKKSALDGLNALLRNQGRLSSQVIRLQPYTDKGNVFASH-----LKDSLLSLGVQ 106 Query: 149 VSSISERI--YDA--------DYGMDVDTIRLSYFASKPSAGKCGFWPEDMLG-NAKGNR 197 S + Y A + D + L+ A C ED + Sbjct: 107 ESKLKILPIQYQATTIKPDLENDKQDKWDLSLTSEAMVVVTKDCSI--EDSQAWSVHSYE 164 Query: 198 NWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 + GCA + N+A V +P DL R + D +++ RY + Sbjct: 165 SIGTLGCANRANIAQMVSDPRDLIRGRTLDDADGVHAVEAMTRYHE 210 >gi|27376952|ref|NP_768481.1| hypothetical protein bll1841 [Bradyrhizobium japonicum USDA 110] gi|12620549|gb|AAG60825.1|AF322012_130 ID279 [Bradyrhizobium japonicum] gi|27350094|dbj|BAC47106.1| bll1841 [Bradyrhizobium japonicum USDA 110] Length = 220 Score = 49.2 bits (116), Expect = 5e-04, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 37/104 (35%), Gaps = 6/104 (5%) Query: 132 RRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDM-- 189 + + + + G+ S+I + +R+ + C P + Sbjct: 79 HKLIAQVAHEARAMGVVPSNIRLSASPLNLSGRSA-VRIEAITFEAHLPNC---PSLLIA 134 Query: 190 LGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQ 233 N GC+ +NNL V +PLDL R V+ + ++ Sbjct: 135 GPAVDDNSFEPTLGCSTKNNLGVMVNDPLDLVDNRSVSTVNGDR 178 >gi|78062905|ref|YP_372813.1| hypothetical protein Bcep18194_B2056 [Burkholderia sp. 383] gi|77970790|gb|ABB12169.1| hypothetical protein Bcep18194_B2056 [Burkholderia sp. 383] Length = 126 Score = 48.9 bits (115), Expect = 7e-04, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 3/74 (4%) Query: 170 LSYFASKPSAGKCGFW--PEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVT 227 + + + A C P ++ R +GCA +NLAA + P DL +P Sbjct: 33 IGFDGAHAVAPDCAKLMQPSHLVDAGFA-RPGVPFGCATYSNLAAMLARPEDLVAPVPYG 91 Query: 228 PPDAEQRDKSIQRY 241 DA+ +++RY Sbjct: 92 GADAQTAADAVRRY 105 >gi|152994348|ref|YP_001339183.1| pilus biogenesis lipoprotein CpaD [Marinomonas sp. MWYL1] gi|150835272|gb|ABR69248.1| pilus (Caulobacter type) biogenesis lipoprotein CpaD [Marinomonas sp. MWYL1] Length = 214 Score = 48.5 bits (114), Expect = 8e-04, Method: Composition-based stats. Identities = 23/112 (20%), Positives = 40/112 (35%), Gaps = 17/112 (15%) Query: 140 KIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKC-----GFW---PEDMLG 191 + ++ G+ S+ + + + A C W P D +G Sbjct: 98 QSLLELGVQGESLLIQPLVYQKAETTWDLSVVSEAIIVVTPDCVIEDSTTWSVKPFDAVG 157 Query: 192 NAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 GCA ++N+A VVNP DL R + D +++RY + Sbjct: 158 T---------LGCANRSNIARMVVNPRDLIRARTLDSADGINAVGAVKRYHE 200 >gi|298294453|gb|ADI75483.1| putative lipoprotein [Sinorhizobium fredii] Length = 54 Score = 47.7 bits (112), Expect = 0.001, Method: Composition-based stats. Identities = 14/40 (35%), Positives = 21/40 (52%) Query: 203 GCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 GC+ +NLA + +P DL R V P D ++ + YR Sbjct: 7 GCSTSHNLAVMINDPRDLLGNRFVKPSDGDRAAIPVTTYR 46 >gi|90406749|ref|ZP_01214942.1| hypothetical protein PCNPT3_01915 [Psychromonas sp. CNPT3] gi|90312202|gb|EAS40294.1| hypothetical protein PCNPT3_01915 [Psychromonas sp. CNPT3] Length = 211 Score = 47.3 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 29/86 (33%), Gaps = 3/86 (3%) Query: 158 DADYGMDVDTIRLSYF--ASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVV 215 ++ + + A C +D + N+GCA N LA + Sbjct: 103 GPQEKQQASAADFTFVVESYRALARHC-QAQKDASIILNNFKRNPNFGCANSNALALMIA 161 Query: 216 NPLDLFSPRMVTPPDAEQRDKSIQRY 241 NP +L + P + + I+ Y Sbjct: 162 NPRELLRSATLAPMEGRKAVSIIESY 187 >gi|313813485|gb|EFS51199.1| type III restriction enzyme, res subunit [Propionibacterium acnes HL025PA1] Length = 862 Score = 46.2 bits (108), Expect = 0.004, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 43/111 (38%), Gaps = 14/111 (12%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DY P +M Q++++ G+ + + F+E + + ++ P TV Sbjct: 41 DYD---PTVM----QVLNLATGVGKTYL-------MAAFVEYLRRQGVGNVVIVTPGKTV 86 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKP 177 + +++ + I + +P ++ + Y A RL++ P Sbjct: 87 QAKTVQNFTPGTPRYITGAAVPPEVVTPQDYSAWIARQNGPARLAFGREVP 137 >gi|167841782|ref|ZP_02468466.1| hypothetical lipoprotein [Burkholderia thailandensis MSMB43] Length = 139 Score = 45.4 bits (106), Expect = 0.006, Method: Composition-based stats. Identities = 19/66 (28%), Positives = 28/66 (42%), Gaps = 1/66 (1%) Query: 177 PSAGKCG-FWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRD 235 A C + +A R ++GCA NLAA + P DL +P DA Sbjct: 55 ARAPDCAALEQRSQMIDAGRARPGVSFGCATYGNLAAMLARPADLVAPLPYAGADAALGA 114 Query: 236 KSIQRY 241 +++RY Sbjct: 115 SAVRRY 120 >gi|83720007|ref|YP_442979.1| lipoprotein [Burkholderia thailandensis E264] gi|167581960|ref|ZP_02374834.1| hypothetical lipoprotein [Burkholderia thailandensis TXDOH] gi|167620124|ref|ZP_02388755.1| hypothetical lipoprotein [Burkholderia thailandensis Bt4] gi|257139203|ref|ZP_05587465.1| lipoprotein [Burkholderia thailandensis E264] gi|83653832|gb|ABC37895.1| hypothetical lipoprotein [Burkholderia thailandensis E264] Length = 139 Score = 45.4 bits (106), Expect = 0.006, Method: Composition-based stats. Identities = 19/66 (28%), Positives = 29/66 (43%), Gaps = 1/66 (1%) Query: 177 PSAGKCG-FWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRD 235 A C + +A R ++GCA NLAA + P DL +P + DA Sbjct: 55 ARAPDCAALEQRSQMIDAGRARPGVSFGCATYGNLAAMLARPADLVAPLPYSGADAALGA 114 Query: 236 KSIQRY 241 +++RY Sbjct: 115 SAVRRY 120 >gi|126452011|ref|YP_001066151.1| putative lipoprotein [Burkholderia pseudomallei 1106a] gi|167911032|ref|ZP_02498123.1| putative lipoprotein [Burkholderia pseudomallei 112] gi|226196379|ref|ZP_03791961.1| putative lipoprotein [Burkholderia pseudomallei Pakistan 9] gi|242317370|ref|ZP_04816386.1| putative lipoprotein [Burkholderia pseudomallei 1106b] gi|254297712|ref|ZP_04965165.1| putative lipoprotein [Burkholderia pseudomallei 406e] gi|126225653|gb|ABN89193.1| putative lipoprotein [Burkholderia pseudomallei 1106a] gi|157807160|gb|EDO84330.1| putative lipoprotein [Burkholderia pseudomallei 406e] gi|225931596|gb|EEH27601.1| putative lipoprotein [Burkholderia pseudomallei Pakistan 9] gi|242140609|gb|EES27011.1| putative lipoprotein [Burkholderia pseudomallei 1106b] Length = 137 Score = 45.4 bits (106), Expect = 0.007, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 32/89 (35%), Gaps = 1/89 (1%) Query: 154 ERIYDADYGMDVDTIRLSYFASKPSAGKCG-FWPEDMLGNAKGNRNWTNYGCAYQNNLAA 212 + D TI A C + +A R ++GCA NLAA Sbjct: 30 MSGHPPYGMPDASTIGYDARTGLARAPDCAALEQRSQMIDAGRARPGVSFGCATYGNLAA 89 Query: 213 QVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 + P DL +P DA +++RY Sbjct: 90 MLARPADLVAPLPYAGADAALGASAVRRY 118 >gi|53719430|ref|YP_108416.1| putative lipoprotein [Burkholderia pseudomallei K96243] gi|76808779|ref|YP_333436.1| putative lipoprotein [Burkholderia pseudomallei 1710b] gi|121601162|ref|YP_993005.1| putative lipoprotein [Burkholderia mallei SAVP1] gi|124386034|ref|YP_001026337.1| putative lipoprotein [Burkholderia mallei NCTC 10229] gi|126439464|ref|YP_001058909.1| putative lipoprotein [Burkholderia pseudomallei 668] gi|126448960|ref|YP_001080389.1| putative lipoprotein [Burkholderia mallei NCTC 10247] gi|134282244|ref|ZP_01768949.1| putative lipoprotein [Burkholderia pseudomallei 305] gi|167919048|ref|ZP_02506139.1| lipoprotein, putative [Burkholderia pseudomallei BCC215] gi|217421405|ref|ZP_03452909.1| putative lipoprotein [Burkholderia pseudomallei 576] gi|237812166|ref|YP_002896617.1| putative lipoprotein [Burkholderia pseudomallei MSHR346] gi|254178529|ref|ZP_04885184.1| putative lipoprotein [Burkholderia mallei ATCC 10399] gi|254179871|ref|ZP_04886470.1| putative lipoprotein [Burkholderia pseudomallei 1655] gi|254188723|ref|ZP_04895234.1| putative lipoprotein [Burkholderia pseudomallei Pasteur 52237] gi|254197664|ref|ZP_04904086.1| putative lipoprotein [Burkholderia pseudomallei S13] gi|254261709|ref|ZP_04952763.1| putative lipoprotein [Burkholderia pseudomallei 1710a] gi|254358539|ref|ZP_04974812.1| putative lipoprotein [Burkholderia mallei 2002721280] gi|52209844|emb|CAH35816.1| putative lipoprotein [Burkholderia pseudomallei K96243] gi|76578232|gb|ABA47707.1| putative lipoprotein [Burkholderia pseudomallei 1710b] gi|121229972|gb|ABM52490.1| lipoprotein, putative [Burkholderia mallei SAVP1] gi|124294054|gb|ABN03323.1| putative liporotein [Burkholderia mallei NCTC 10229] gi|126218957|gb|ABN82463.1| putative lipoprotein [Burkholderia pseudomallei 668] gi|126241830|gb|ABO04923.1| putative liporotein [Burkholderia mallei NCTC 10247] gi|134246282|gb|EBA46371.1| putative lipoprotein [Burkholderia pseudomallei 305] gi|148027666|gb|EDK85687.1| putative lipoprotein [Burkholderia mallei 2002721280] gi|157936402|gb|EDO92072.1| putative lipoprotein [Burkholderia pseudomallei Pasteur 52237] gi|160699568|gb|EDP89538.1| putative lipoprotein [Burkholderia mallei ATCC 10399] gi|169654405|gb|EDS87098.1| putative lipoprotein [Burkholderia pseudomallei S13] gi|184210411|gb|EDU07454.1| putative lipoprotein [Burkholderia pseudomallei 1655] gi|217395147|gb|EEC35165.1| putative lipoprotein [Burkholderia pseudomallei 576] gi|237504103|gb|ACQ96421.1| putative lipoprotein [Burkholderia pseudomallei MSHR346] gi|254220398|gb|EET09782.1| putative lipoprotein [Burkholderia pseudomallei 1710a] Length = 137 Score = 45.4 bits (106), Expect = 0.008, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 32/89 (35%), Gaps = 1/89 (1%) Query: 154 ERIYDADYGMDVDTIRLSYFASKPSAGKCG-FWPEDMLGNAKGNRNWTNYGCAYQNNLAA 212 + D TI A C + +A R ++GCA NLAA Sbjct: 30 MSGHPPYGMPDASTIGYDARTGLARAPDCAALEQRSQMIDAGRARPGVSFGCATYGNLAA 89 Query: 213 QVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 + P DL +P DA +++RY Sbjct: 90 MLARPADLVAPLPYAGADAALGASAVRRY 118 >gi|167902786|ref|ZP_02489991.1| putative lipoprotein [Burkholderia pseudomallei NCTC 13177] Length = 115 Score = 44.6 bits (104), Expect = 0.011, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 32/89 (35%), Gaps = 1/89 (1%) Query: 154 ERIYDADYGMDVDTIRLSYFASKPSAGKCG-FWPEDMLGNAKGNRNWTNYGCAYQNNLAA 212 + D TI A C + +A R ++GCA NLAA Sbjct: 8 MSGHPPYGMPDASTIGYDARTGLARAPDCAALEQRSQMIDAGRARPGVSFGCATYGNLAA 67 Query: 213 QVVNPLDLFSPRMVTPPDAEQRDKSIQRY 241 + P DL +P DA +++RY Sbjct: 68 MLARPADLVAPLPYAGADAALGASAVRRY 96 >gi|50843080|ref|YP_056307.1| putative type III restriction enzyme [Propionibacterium acnes KPA171202] gi|50840682|gb|AAT83349.1| putative type III restriction enzyme [Propionibacterium acnes KPA171202] gi|315106920|gb|EFT78896.1| type III restriction enzyme, res subunit [Propionibacterium acnes HL030PA1] Length = 862 Score = 44.6 bits (104), Expect = 0.011, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 44/111 (39%), Gaps = 14/111 (12%) Query: 67 DYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV 126 DY P +M Q++++ G+ + + F+E + + ++ P TV Sbjct: 41 DYD---PTVM----QVLNLATGVGKTYL-------MAAFVEYLRRQGVGNVVIVTPGKTV 86 Query: 127 SSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKP 177 + +++ + I S +P ++ + Y A ++L++ P Sbjct: 87 QAKTVQNFTPGSSRYITGSALPPEVVTPQDYSAWIARQNGPVQLAFGRETP 137 >gi|167590423|ref|ZP_02382811.1| hypothetical protein BuboB_34117 [Burkholderia ubonensis Bu] Length = 117 Score = 42.3 bits (98), Expect = 0.052, Method: Composition-based stats. Identities = 11/22 (50%), Positives = 14/22 (63%) Query: 202 YGCAYQNNLAAQVVNPLDLFSP 223 +GCA NLAA + P DL +P Sbjct: 56 FGCATYRNLAAMLARPEDLVAP 77 >gi|297627293|ref|YP_003689056.1| type III restriction enzyme [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296923058|emb|CBL57642.1| Putative type III restriction enzyme [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 857 Score = 41.9 bits (97), Expect = 0.068, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 39/100 (39%), Gaps = 7/100 (7%) Query: 78 KVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKD 137 V+Q++++ G+ + + F+E + + ++ P TV + +++ Sbjct: 45 DVQQVLNLATGVGKTYL-------MTAFVEYLRRQGVGNVVIVTPGKTVQAKTVQNFTPG 97 Query: 138 IRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKP 177 + I S +P ++ + Y A LS+ P Sbjct: 98 SSRFITGSPVPPEVVTPQDYSAWIARTNGAAMLSFGREVP 137 >gi|319941901|ref|ZP_08016222.1| hypothetical protein HMPREF9464_01441 [Sutterella wadsworthensis 3_1_45B] gi|319804554|gb|EFW01424.1| hypothetical protein HMPREF9464_01441 [Sutterella wadsworthensis 3_1_45B] Length = 207 Score = 41.5 bits (96), Expect = 0.096, Method: Composition-based stats. Identities = 14/43 (32%), Positives = 23/43 (53%) Query: 201 NYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYRQ 243 N+ A NLA Q P DL +PR + P+ + ++ RY++ Sbjct: 146 NFAHATARNLAQQAAVPSDLETPRALGDPNPQAAIGAVDRYQR 188 >gi|116620655|ref|YP_822811.1| peptidase M23B [Candidatus Solibacter usitatus Ellin6076] gi|116223817|gb|ABJ82526.1| peptidase M23B [Candidatus Solibacter usitatus Ellin6076] Length = 456 Score = 41.5 bits (96), Expect = 0.10, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 53/180 (29%), Gaps = 18/180 (10%) Query: 44 LGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIR 103 G++ L G + + D+R R ++ ++ + Sbjct: 93 AGKIKAAGLKEGDARIVVEAVSDDFRGR------TDSASANVKVVLAPPRVTPDD----- 141 Query: 104 GFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGM 163 + Y N L ++ P + + A ++ R G P + Y D Sbjct: 142 --AQHYINQGGMELAVMTPGGSWNEAGVKVGKYSFRS-FALPGHPEQRFAMFAYPWDLPD 198 Query: 164 DVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 +V + + + + + R++ A + L Q+ +P +P Sbjct: 199 NVTP--MVFARNAAGTEATAQFWFKLFPKKFRVRDFP-IDDALISKLVNQI-DPGGTLAP 254 >gi|146338603|ref|YP_001203651.1| amidase [Bradyrhizobium sp. ORS278] gi|146191409|emb|CAL75414.1| amidase [Bradyrhizobium sp. ORS278] Length = 501 Score = 41.5 bits (96), Expect = 0.11, Method: Composition-based stats. Identities = 13/102 (12%), Positives = 35/102 (34%), Gaps = 5/102 (4%) Query: 73 PILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFL-LIPSPTVSSAS- 130 PI + + + ++AG + + + A+ L + ++P ++ + Sbjct: 232 PITRTVADNALMLEVMAGPDGLDPRQRGAAAQPYTQALSQGAAGLRIGIVPEGFGTAGAE 291 Query: 131 --IRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRL 170 + V++ + + G + +S A I L Sbjct: 292 SEVDDRVREAAGRLQAKGADIREVSV-PLHAAGAAIWTPIFL 332 >gi|162147492|ref|YP_001601953.1| pilus assembly protein [Gluconacetobacter diazotrophicus PAl 5] gi|161786069|emb|CAP55651.1| putative pilus assembly protein [Gluconacetobacter diazotrophicus PAl 5] Length = 500 Score = 40.8 bits (94), Expect = 0.17, Method: Composition-based stats. Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 1/45 (2%) Query: 198 NWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPPDAEQRDKSIQRYR 242 +W G Q+N+ AQV P DL R + DA + ++QR+R Sbjct: 9 HWRPLG-TNQSNIEAQVERPQDLVHGRPLGDADAHEAAVAVQRWR 52 >gi|167836688|ref|ZP_02463571.1| putative lipoprotein [Burkholderia thailandensis MSMB43] Length = 130 Score = 40.8 bits (94), Expect = 0.18, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 29/80 (36%), Gaps = 14/80 (17%) Query: 146 GIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCA 205 G+P +S+ YD G V + +A R +GCA Sbjct: 30 GMPDASVIG--YDPRDGRAVPP------------DCTTLEQPSHMVDAGFGRPGVQFGCA 75 Query: 206 YQNNLAAQVVNPLDLFSPRM 225 +NLA + P DL +P+ Sbjct: 76 TYSNLAVMLARPADLIAPQP 95 >gi|258652061|ref|YP_003201217.1| hypothetical protein Namu_1838 [Nakamurella multipartita DSM 44233] gi|258555286|gb|ACV78228.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233] Length = 893 Score = 40.4 bits (93), Expect = 0.23, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 36/110 (32%), Gaps = 18/110 (16%) Query: 72 YP--ILMRKVEQIVDIPLLAGRGEIKYPIHD-----------TIRGFLEKYKNDSASVLF 118 +P I + E VD + + + L Y+ + L Sbjct: 742 HPFSIRIGDQEASVDYEPAESTTNLFGGNSNWRGPIWMPLNVLLVEALRDYERLTPGALT 801 Query: 119 LLIPSPTVSSASIRRAVKDIRKIIIS---SGIPVSSISERIYD--ADYGM 163 + P+ + S+A++ +A DI + ++S G YD A Sbjct: 802 VEYPTGSGSTATVGQAADDIARRLVSIFLPGPDGRRPVHGWYDLLATDPR 851 >gi|239928195|ref|ZP_04685148.1| cyclase [Streptomyces ghanaensis ATCC 14672] gi|291436525|ref|ZP_06575915.1| germacradienol/germacrene D synthase [Streptomyces ghanaensis ATCC 14672] gi|291339420|gb|EFE66376.1| germacradienol/germacrene D synthase [Streptomyces ghanaensis ATCC 14672] Length = 729 Score = 40.4 bits (93), Expect = 0.23, Method: Composition-based stats. Identities = 16/159 (10%), Positives = 44/159 (27%), Gaps = 39/159 (24%) Query: 46 QLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQI--VDIPLLAGRGEIKYPIHDTIR 103 + L F+G S D +D + + E ++P +A + + Sbjct: 246 GVLVLETFFGCSTQEAADTVNDVLTSR---LHQFEHTAFTEVPAVALEKGLTPDQVAAVA 302 Query: 104 GFL---EKYKNDSA-----------------SVLFLLIPSPTVSSASIRRAVKDIRKIII 143 + + +++ + +++ D+R ++ Sbjct: 303 AYAKGLQDWQSGGHEWHLRSSRYMNQGARTTGPWAAPVGPGGPGTSAA-----DVRALLA 357 Query: 144 SSGIPV---------SSISERIYDADYGMDVDTIRLSYF 173 + G P + + Y + IR+ + Sbjct: 358 APGAPGAPSAPWRRTRAHTHVPYQKVGPSLIPDIRMPFP 396 >gi|172041474|ref|YP_001801188.1| putative DNA restriction-modification system, restriction enzyme [Corynebacterium urealyticum DSM 7109] gi|171852778|emb|CAQ05754.1| putative DNA restriction-modification system, restriction enzyme [Corynebacterium urealyticum DSM 7109] Length = 876 Score = 40.4 bits (93), Expect = 0.24, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 37/98 (37%), Gaps = 7/98 (7%) Query: 81 QIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRK 140 Q++++ G+ + + F+E + + ++ P TV + +++ + Sbjct: 48 QVLNLATGVGKTYL-------MAAFVEYLRRQGVGNVVIVTPGKTVQAKTVQNFTPGAPR 100 Query: 141 IIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS 178 I S +P ++ + Y A L++ P Sbjct: 101 YITGSAVPPEVVTPQDYSAWVARQNGAAELAFGREVPV 138 >gi|171320619|ref|ZP_02909639.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5] gi|171094132|gb|EDT39219.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5] Length = 126 Score = 40.0 bits (92), Expect = 0.26, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 23/56 (41%), Gaps = 3/56 (5%) Query: 170 LSYFASKPSAGKCGFW--PEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 + + + +C P ++ G R +GCA NLA + P DL +P Sbjct: 33 IGFDGVRAVPPECATLMQPSHLVDAGFG-RPGVPFGCATYTNLATMLARPEDLVAP 87 >gi|120603344|ref|YP_967744.1| hypothetical protein Dvul_2301 [Desulfovibrio vulgaris DP4] gi|120563573|gb|ABM29317.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 256 Score = 39.6 bits (91), Expect = 0.39, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 32/103 (31%), Gaps = 5/103 (4%) Query: 94 IKYPIHDTIRGF-LEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + +RGF E + + L + SP ++ V DI +SG + + Sbjct: 19 LVMTRRPLLRGFSGESFISTGRPPLVV---SPASGLHAVGGGVTDI-SPATASGTASARV 74 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG 195 +Y T AG WP D+ + Sbjct: 75 WYALYAPSDNGQPTTDGRRLSVILAEAGDQWQWPHDVSSGLRE 117 >gi|209515952|ref|ZP_03264813.1| conserved hypothetical protein [Burkholderia sp. H160] gi|209503610|gb|EEA03605.1| conserved hypothetical protein [Burkholderia sp. H160] Length = 124 Score = 39.6 bits (91), Expect = 0.41, Method: Composition-based stats. Identities = 13/39 (33%), Positives = 17/39 (43%), Gaps = 6/39 (15%) Query: 202 YGCAYQNNLAAQVVNPLDLFSPRM------VTPPDAEQR 234 +GCA NLA + P DL +P T A +R Sbjct: 66 FGCATLTNLAVMLARPEDLIAPLPYAGSDVTTAAGAVRR 104 >gi|38234464|ref|NP_940231.1| hypothetical protein DIP1894 [Corynebacterium diphtheriae NCTC 13129] gi|38200727|emb|CAE50428.1| Hypothetical protein DIP1894 [Corynebacterium diphtheriae] Length = 865 Score = 39.2 bits (90), Expect = 0.55, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 37/92 (40%), Gaps = 7/92 (7%) Query: 81 QIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRK 140 Q++++ G+ + + F+E + + ++ P TV + +++ + Sbjct: 48 QVLNLATGVGKTYL-------MAAFVEYLRRQGVGNVVIVTPGKTVQAKTVQNFTPGSPR 100 Query: 141 IIISSGIPVSSISERIYDADYGMDVDTIRLSY 172 I S +P ++ + Y A +LS+ Sbjct: 101 YITGSVVPPEVVTPQDYSAWVARQNGAAQLSF 132 >gi|115358172|ref|YP_775310.1| hypothetical protein Bamb_3422 [Burkholderia ambifaria AMMD] gi|115283460|gb|ABI88976.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 127 Score = 39.2 bits (90), Expect = 0.56, Method: Composition-based stats. Identities = 14/55 (25%), Positives = 21/55 (38%), Gaps = 1/55 (1%) Query: 170 LSYFASKPSAGKCG-FWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 + + + C L +A R +GCA NLA + P DL +P Sbjct: 33 IGFDGVRAVPPDCAALMQPSHLVDAGFGRPGVPFGCATYTNLATMLARPEDLVAP 87 >gi|170703888|ref|ZP_02894571.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|172062960|ref|YP_001810611.1| hypothetical protein BamMC406_3929 [Burkholderia ambifaria MC40-6] gi|170131203|gb|EDS99847.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|171995477|gb|ACB66395.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 127 Score = 38.8 bits (89), Expect = 0.57, Method: Composition-based stats. Identities = 14/55 (25%), Positives = 21/55 (38%), Gaps = 1/55 (1%) Query: 170 LSYFASKPSAGKCG-FWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSP 223 + + + C L +A R +GCA NLA + P DL +P Sbjct: 33 IGFDGVRAVPPDCAALMQPSHLVDAGFGRPGVPFGCATYTNLATMLARPEDLVAP 87 >gi|116250165|ref|YP_766003.1| acyltransferase [Rhizobium leguminosarum bv. viciae 3841] gi|115254813|emb|CAK05887.1| putative acyltransferase [Rhizobium leguminosarum bv. viciae 3841] Length = 265 Score = 38.8 bits (89), Expect = 0.66, Method: Composition-based stats. Identities = 15/110 (13%), Positives = 34/110 (30%), Gaps = 19/110 (17%) Query: 85 IPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIIS 144 + ++ + I G + + +L P T S + R ++ + + Sbjct: 112 VFVVREERRKTGHQANEIAG------RMADGEIVVLFPEGTTSDGN--RLLEVKSSLFGA 163 Query: 145 SGIPVSSISERIYDADYGMDVDTIRLSYFASKPSA-----GKCGFWPEDM 189 + + V Y + V + ++Y A WP D+ Sbjct: 164 AAMAV------PYSPTGTVVVQPVAVAYTRVHGIAMGRYHRPLAAWPGDI 207 >gi|317153228|ref|YP_004121276.1| HhH-GPD family protein [Desulfovibrio aespoeensis Aspo-2] gi|316943479|gb|ADU62530.1| HhH-GPD family protein [Desulfovibrio aespoeensis Aspo-2] Length = 219 Score = 38.8 bits (89), Expect = 0.69, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 25/74 (33%), Gaps = 12/74 (16%) Query: 165 VDTIRLSYFASKPSAGKCGFWPED------MLGNAKGNRNWTNYGCAYQNNLAAQVVNPL 218 TI Y A + G +WP D + N NW N A NL A Sbjct: 4 AATITGMYHAMLATLGPSRWWPGDTPFEIAVGAILTQNTNWRNVEKAI-ANLKA-----R 57 Query: 219 DLFSPRMVTPPDAE 232 DL S R + D Sbjct: 58 DLLSARAMHALDTG 71 >gi|84514322|ref|ZP_01001686.1| histidyl-tRNA synthetase [Loktanella vestfoldensis SKA53] gi|84511373|gb|EAQ07826.1| histidyl-tRNA synthetase [Loktanella vestfoldensis SKA53] Length = 492 Score = 38.8 bits (89), Expect = 0.71, Method: Composition-based stats. Identities = 8/66 (12%), Positives = 24/66 (36%), Gaps = 2/66 (3%) Query: 93 EIKYPIHDTIRGFLEKYKNDSASVL--FLLIPSPTVSSASIRRAVKDIRKIIISSGIPVS 150 + D + GF+E ++ A+ + + + + + + +I ++ + G Sbjct: 233 GLSQAQADVVIGFMEAKRDTGAATVARLAELVAGSPIGVAGVAELDEIAGLLSAQGYGPD 292 Query: 151 SISERI 156 I Sbjct: 293 RIVIDP 298 >gi|46579073|ref|YP_009881.1| hypothetical protein DVU0659 [Desulfovibrio vulgaris str. Hildenborough] gi|46448486|gb|AAS95140.1| hypothetical protein DVU_0659 [Desulfovibrio vulgaris str. Hildenborough] gi|311232918|gb|ADP85772.1| hypothetical protein Deval_0604 [Desulfovibrio vulgaris RCH1] Length = 256 Score = 38.8 bits (89), Expect = 0.74, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 31/103 (30%), Gaps = 5/103 (4%) Query: 94 IKYPIHDTIRGF-LEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 + +RGF E + + L + SP ++ V DI +SG + + Sbjct: 19 LVMTRRPLLRGFSGESFISTGRPPLVV---SPASGLHAVGGGVTDI-SPATASGTASARV 74 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSAGKCGFWPEDMLGNAKG 195 +Y T AG WP D+ Sbjct: 75 WYALYAPSGNGQPTTDGRRLSVILAEAGDQWQWPHDVSSGLHE 117 >gi|157373221|ref|YP_001471821.1| hypothetical protein Ssed_0080 [Shewanella sediminis HAW-EB3] gi|157315595|gb|ABV34693.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 304 Score = 38.5 bits (88), Expect = 0.81, Method: Composition-based stats. Identities = 13/89 (14%), Positives = 29/89 (32%), Gaps = 8/89 (8%) Query: 81 QIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT-------VSSASIRR 133 + VDI +G ++ + + Y A+V +L+ + + + Sbjct: 177 RRVDIQFESGTSKLTAAHEVDLEAIAQ-YVQADATVKEILVDAHADASGEHLANLVLSKE 235 Query: 134 AVKDIRKIIISSGIPVSSISERIYDADYG 162 ++ + GI I R + A Sbjct: 236 RADEVASRLFELGIQKQKIQVRHHGARSP 264 >gi|320165665|gb|EFW42564.1| tRNA dihydrouridine synthase Dus2 [Capsaspora owczarzaki ATCC 30864] Length = 503 Score = 38.5 bits (88), Expect = 0.94, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 46/105 (43%), Gaps = 13/105 (12%) Query: 55 GTSALAYYDEGSDYRDR---YPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKN 111 G SALA + + R R +P +++ + + IP++A G + HD I F ++ Sbjct: 153 GVSALAVHCRLTHERPREPGHPDMLKPIVDALSIPVIANGGSLDIVSHDDIEAFRQR--- 209 Query: 112 DSASVLFLLIPSPTVSSASIRRAVK-----DIRKIIISSGIPVSS 151 +++ ++ S+ R ++ + ++ +GI + Sbjct: 210 --TGCASVMVARAAQNNPSVFRPEPLVPRLEMARQLLRTGIEWDN 252 >gi|299134224|ref|ZP_07027417.1| DNA mismatch repair protein MutL [Afipia sp. 1NLS2] gi|298590971|gb|EFI51173.1| DNA mismatch repair protein MutL [Afipia sp. 1NLS2] Length = 598 Score = 38.5 bits (88), Expect = 0.94, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 44/125 (35%), Gaps = 10/125 (8%) Query: 66 SDY--RDRYPIL---MRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL 120 +DY RDR+PI+ + Q VD + + E+++ +R + + L Sbjct: 275 ADYLPRDRHPIVALFVTLDPQEVDANVHPAKTEVRFRNAGLVRALIVHALKEG---LARE 331 Query: 121 IPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIY--DADYGMDVDTIRLSYFASKPS 178 ++ R+ + SG S + A + + ++ A PS Sbjct: 332 GRRTAANTNGAAITTAFHREDLPRSGYDWRSSPAAPFAPRASAMAFAEAPQAAFDAYTPS 391 Query: 179 AGKCG 183 A G Sbjct: 392 ADARG 396 >gi|315505262|ref|YP_004084149.1| beta-ketoacyl synthase [Micromonospora sp. L5] gi|315411881|gb|ADU09998.1| Beta-ketoacyl synthase [Micromonospora sp. L5] Length = 6765 Score = 38.5 bits (88), Expect = 0.94, Method: Composition-based stats. Identities = 12/87 (13%), Positives = 32/87 (36%), Gaps = 11/87 (12%) Query: 86 PLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIR----RAVKDIRKI 141 P+ G G + +TI F + + + ++ ++A A++++R++ Sbjct: 6186 PVAVGIGRLDP---ETIEAFAAELTRRAPGSVSFVVVEAASNAAGGVPVEPAALRELRRV 6242 Query: 142 IISSGIP----VSSISERIYDADYGMD 164 + G+P S + + Sbjct: 6243 TAAHGVPLVLDASRVVDNAVQLAGPGG 6269 >gi|254409849|ref|ZP_05023630.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] gi|196183846|gb|EDX78829.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] Length = 325 Score = 38.1 bits (87), Expect = 1.2, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 23/60 (38%), Gaps = 3/60 (5%) Query: 100 DTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRR---AVKDIRKIIISSGIPVSSISERI 156 D I + Y+ + L ++ P + + DIR +++ G+P S I Sbjct: 157 DRILYAAQLYRQSGNNPLVVVSAGPRPNLQGNQDQIVEANDIRSLLVQFGVPQSRIVLEP 216 >gi|91789977|ref|YP_550929.1| UvrA family protein [Polaromonas sp. JS666] gi|91699202|gb|ABE46031.1| UvrA family protein [Polaromonas sp. JS666] Length = 2024 Score = 38.1 bits (87), Expect = 1.2, Method: Composition-based stats. Identities = 13/108 (12%), Positives = 28/108 (25%), Gaps = 14/108 (12%) Query: 79 VEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP----SPTVSSASIRRA 134 E V++P+ + +R L V+ +L P + + + + Sbjct: 1258 KEHTVELPVA--DIHVTPENEAALRVALATALEHGKGVVHVLAPLDGLRGAMMAGASTKG 1315 Query: 135 VKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKC 182 + + P + Y SY + C Sbjct: 1316 LGRLEAFSTKRACP---VCSTSYPELDPRLF-----SYNSKHGWCPDC 1355 >gi|330990198|ref|ZP_08314176.1| Error-prone DNA polymerase 2 [Gluconacetobacter sp. SXCC-1] gi|329762744|gb|EGG79210.1| Error-prone DNA polymerase 2 [Gluconacetobacter sp. SXCC-1] Length = 632 Score = 38.1 bits (87), Expect = 1.3, Method: Composition-based stats. Identities = 14/89 (15%), Positives = 28/89 (31%), Gaps = 7/89 (7%) Query: 96 YPIHDTIRGFLEKYK---NDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSI 152 GF ++ + A +L +L+P ++ S A + + G+P Sbjct: 96 GRKRAGKGGFSLTWRDLEREGAGLLLILLPDAPDATLSAALARMEAHAR--TGGLPGYVA 153 Query: 153 SERIYDADYGMDVDTIRLSYFASKPSAGK 181 R Y + R++ A Sbjct: 154 LVRRYRPGDAARLA--RIADMAMTAGLRP 180 >gi|326319020|ref|YP_004236692.1| excinuclease ABC subunit A [Acidovorax avenae subsp. avenae ATCC 19860] gi|323375856|gb|ADX48125.1| excinuclease ABC, A subunit [Acidovorax avenae subsp. avenae ATCC 19860] Length = 1953 Score = 37.7 bits (86), Expect = 1.4, Method: Composition-based stats. Identities = 12/105 (11%), Positives = 30/105 (28%), Gaps = 8/105 (7%) Query: 79 VEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP-SPTVSSASIRRAVKD 137 E +++P+ + + +R L + V+ +L + + + ++ Sbjct: 1207 KEHTIELPVASLD--VAPAQESELRAALARALELGKGVVHVLSGLAGLKDAMAAGQSTAR 1264 Query: 138 IRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKC 182 I + S + Y SY + C Sbjct: 1265 IGTLQAFSTKRACPVCATSYAELDPRLF-----SYNSKHGWCPDC 1304 >gi|114327747|ref|YP_744904.1| hypothetical protein GbCGDNIH1_1083 [Granulibacter bethesdensis CGDNIH1] gi|114315921|gb|ABI61981.1| hypothetical secreted protein [Granulibacter bethesdensis CGDNIH1] Length = 275 Score = 37.7 bits (86), Expect = 1.5, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 36/92 (39%), Gaps = 6/92 (6%) Query: 81 QIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLI--PSPTVSSASIRRAVKD- 137 + + + GR +I +D +R + S + +L P ++ RR Sbjct: 148 EGLRVMFGPGRSDISPATNDALREAAREVSKLSNESVTILAYAPGTADDPSTARRLSLSR 207 Query: 138 ---IRKIIISSGIPVSSISERIYDADYGMDVD 166 IR +I++G+P + I R ++ G Sbjct: 208 ALTIRSALIAAGMPSTKIFVRALGSNIGKGPA 239 >gi|37521513|ref|NP_924890.1| polyketide synthase [Gloeobacter violaceus PCC 7421] gi|35212510|dbj|BAC89885.1| gll1944 [Gloeobacter violaceus PCC 7421] Length = 619 Score = 37.3 bits (85), Expect = 2.1, Method: Composition-based stats. Identities = 23/104 (22%), Positives = 37/104 (35%), Gaps = 13/104 (12%) Query: 97 PIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERI 156 PI IRG + S + ++A+ IR+ + +G+ S +S Sbjct: 264 PIRAIIRGTAVNHNGRSNGLT-------APNTAAQEEV---IRRALAQAGVQPSQVSYVE 313 Query: 157 YDADYGMDVDTIRLSYFASKPSAGKCGFW-PEDMLGNAKGNRNW 199 A D I + A K G+ P +LG+ K N Sbjct: 314 THATGTALGDLI--EFKALKAVLGQRQQGDPRCLLGSLKTNIGH 355 >gi|58616590|ref|YP_195721.1| FO synthase [Azoarcus sp. EbN1] gi|56316054|emb|CAI10697.1| hypothetical protein, similar to ThiH,4-methyl-5(beta-hydroxyethyl)thiazole phosphate synthesis [Aromatoleum aromaticum EbN1] Length = 806 Score = 36.9 bits (84), Expect = 2.4, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 37/112 (33%), Gaps = 15/112 (13%) Query: 106 LEKYKNDSA---SVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYG 162 Y D A + P+P + + R IR+ + G+ I ++ A Sbjct: 377 ACGYARDDAWAPGTTAAVPPTPAANLPADARIESIIRRAMDGQGLDEDEI-VTLFHARD- 434 Query: 163 MDVDTIRLSYFASKPSAGKCGFWPEDMLGN--AKGNRNWTNYGCAYQNNLAA 212 + + +A + W + N N+TN C+Y+ A Sbjct: 435 -------VDFQRVCSAADELRQWVNGDIVTYAVNRNINYTNI-CSYKCGFCA 478 >gi|145558866|sp|Q9BZQ2|SHP1L_HUMAN RecName: Full=SHC SH2 domain-binding protein 1-like protein Length = 725 Score = 36.9 bits (84), Expect = 2.7, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 37/107 (34%), Gaps = 10/107 (9%) Query: 86 PLL-AGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL-----IP--SPTVSSASIRRAVKD 137 P+ +GRG K+P + G L +++ S + + P S++++ Sbjct: 49 PIGQSGRGREKWPTAASALGLLRRWRRASKASVPADSFRTISPDRRGEKSASAVSGDTA- 107 Query: 138 IRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFAS-KPSAGKCG 183 + + IPV S+ +T RL A G Sbjct: 108 AATTLKGTAIPVRSVVASPRPVKGKAGRETARLRLQRLPAAQAEDTG 154 >gi|221134123|ref|ZP_03560428.1| 2-deoxy-D-gluconate 3-dehydrogenase [Glaciecola sp. HTCC2999] Length = 257 Score = 36.9 bits (84), Expect = 2.7, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 3/77 (3%) Query: 119 LLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS 178 + +P+ S + + K + SSGI V++I+ ++ D ++ + Y + Sbjct: 157 ITVPAYAASKGGVAQITKALANEWASSGIQVNAIAPGYFETDNTTNIRNDKDRYESITAR 216 Query: 179 AGKCGFW--PEDMLGNA 193 CG W PED+ G Sbjct: 217 IP-CGEWGKPEDLAGAT 232 >gi|12620205|gb|AAG60617.1|AF288398_1 C1orf14 [Homo sapiens] Length = 725 Score = 36.9 bits (84), Expect = 2.7, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 37/107 (34%), Gaps = 10/107 (9%) Query: 86 PLL-AGRGEIKYPIHDTIRGFLEKYKNDSASVLFLL-----IP--SPTVSSASIRRAVKD 137 P+ +GRG K+P + G L +++ S + + P S++++ Sbjct: 49 PIGQSGRGREKWPTAASALGLLRRWRRASKASVPADSFRTISPDRRGEKSASAVSGDTA- 107 Query: 138 IRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFAS-KPSAGKCG 183 + + IPV S+ +T RL A G Sbjct: 108 AATTLKGTAIPVRSVVASPRPVKGKAGRETARLRLQRLPAAQAEDTG 154 >gi|218682220|ref|ZP_03529821.1| 1-acyl-sn-glycerol-3-phosphate acyltransferase (phospholipid/glycerol acyltransferase) protein [Rhizobium etli CIAT 894] Length = 265 Score = 36.5 bits (83), Expect = 2.9, Method: Composition-based stats. Identities = 15/110 (13%), Positives = 34/110 (30%), Gaps = 19/110 (17%) Query: 85 IPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIIS 144 + ++ + I G + + +L P T S + R ++ + + Sbjct: 112 VFVVREEKRRTGHQANEIAG------RMADGEIVVLFPEGTTSDGN--RLLEVKSSLFGA 163 Query: 145 SGIPVSSISERIYDADYGMDVDTIRLSYFASKPSA-----GKCGFWPEDM 189 + + V Y + V + ++Y A WP D+ Sbjct: 164 AAMAV------PYSPTGTVVVQPLAIAYTRVHGIAMGRYHRPLAAWPGDI 207 >gi|153003035|ref|YP_001377360.1| phosphoenolpyruvate-protein phosphotransferase [Anaeromyxobacter sp. Fw109-5] gi|152026608|gb|ABS24376.1| phosphoenolpyruvate-protein phosphotransferase [Anaeromyxobacter sp. Fw109-5] Length = 599 Score = 36.5 bits (83), Expect = 2.9, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 21/55 (38%), Gaps = 5/55 (9%) Query: 99 HDTIRGFLEKYKNDSASVLFLLIP--SPTVSSASIRRAVKDIRKIIISSGIPVSS 151 +R L + L ++ P S + +R + ++R+ + G PV Sbjct: 382 RAQLRALL---RASVHGNLRIMFPMISGVSELRAAKRLLAEVREELRREGAPVRE 433 >gi|50954943|ref|YP_062231.1| NAD synthetase [Leifsonia xyli subsp. xyli str. CTCB07] gi|71648719|sp|Q6AER9|NADE_LEIXX RecName: Full=NH(3)-dependent NAD(+) synthetase gi|50951425|gb|AAT89126.1| NH3-dependent NAD+ synthetase [Leifsonia xyli subsp. xyli str. CTCB07] Length = 279 Score = 36.5 bits (83), Expect = 3.0, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 1/72 (1%) Query: 104 GFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISER-IYDADYG 162 FL +Y + + F+L S S+ R + + + G+ I+ R Y Sbjct: 29 DFLVRYVRAAGASGFVLGVSGGQDSSLAGRLCQLAVERLAEQGVAAEFIAVRLPYAVQND 88 Query: 163 MDVDTIRLSYFA 174 D + LS+ Sbjct: 89 EDDAQLALSFIR 100 >gi|320529880|ref|ZP_08030957.1| thiol reductant ABC exporter, CydC subunit [Selenomonas artemidis F0399] gi|320137898|gb|EFW29803.1| thiol reductant ABC exporter, CydC subunit [Selenomonas artemidis F0399] Length = 549 Score = 36.5 bits (83), Expect = 3.2, Method: Composition-based stats. Identities = 13/72 (18%), Positives = 28/72 (38%), Gaps = 2/72 (2%) Query: 75 LMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRA 134 ++R + + +D PL + + L A VL L P+ +++ R Sbjct: 458 VIRALPRGLDEPLGENAARLSGGQRSRLLTALA--LASDAPVLLLDEPTAGLNAELGERL 515 Query: 135 VKDIRKIIISSG 146 ++ I + + G Sbjct: 516 IRAILTELSAEG 527 >gi|302818540|ref|XP_002990943.1| hypothetical protein SELMODRAFT_448213 [Selaginella moellendorffii] gi|300141274|gb|EFJ07987.1| hypothetical protein SELMODRAFT_448213 [Selaginella moellendorffii] Length = 230 Score = 36.5 bits (83), Expect = 3.2, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 24/51 (47%) Query: 34 WKNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVD 84 W F L+ L + ++ G + Y +DYR R P + V++I+D Sbjct: 129 WLGFILQYLWAIGIVVAVITCGILFVTYKPREADYRFREPPSLEDVKKILD 179 >gi|317133493|ref|YP_004092807.1| transcriptional regulator, LysR family [Ethanoligenens harbinense YUAN-3] gi|315471472|gb|ADU28076.1| transcriptional regulator, LysR family [Ethanoligenens harbinense YUAN-3] Length = 291 Score = 36.5 bits (83), Expect = 3.2, Method: Composition-based stats. Identities = 12/103 (11%), Positives = 30/103 (29%), Gaps = 9/103 (8%) Query: 75 LMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPT---VSSASI 131 + K + + + I + ++ L P+ + + Sbjct: 17 TVTKAAHAL----GYVQSNVTAHIRALEKEVGTPLFQRQHGMV--LTPAGEKLLPYAEQV 70 Query: 132 RRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFA 174 R + + R ++ G P ++ Y +D+ I Y Sbjct: 71 LRMLDEARYVLQDDGKPQGRLAIGTYHPVSAVDLPQIFARYHR 113 >gi|220917838|ref|YP_002493142.1| protein of unknown function DUF1355 [Anaeromyxobacter dehalogenans 2CP-1] gi|219955692|gb|ACL66076.1| protein of unknown function DUF1355 [Anaeromyxobacter dehalogenans 2CP-1] Length = 761 Score = 36.5 bits (83), Expect = 3.3, Method: Composition-based stats. Identities = 12/74 (16%), Positives = 25/74 (33%), Gaps = 3/74 (4%) Query: 88 LAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGI 147 AGR ++ + G + A L + ++A R + + G+ Sbjct: 153 AAGRTDLLGALEAVTSGAGASSRRL-AGALVVS--DGADNAALADGLSGATRARLRALGV 209 Query: 148 PVSSISERIYDADY 161 PVS+++ Sbjct: 210 PVSAVAVGRSAPRD 223 >gi|110679881|ref|YP_682888.1| acyltransferase [Roseobacter denitrificans OCh 114] gi|109455997|gb|ABG32202.1| acyltransferase [Roseobacter denitrificans OCh 114] Length = 273 Score = 36.5 bits (83), Expect = 3.3, Method: Composition-based stats. Identities = 13/122 (10%), Positives = 33/122 (27%), Gaps = 15/122 (12%) Query: 83 VDIPLLAGRGEIKYPIHDTIRGF-LEKYKNDSASVLFLLIPSPTVSSAS---IRRAVKDI 138 +DI +L + + + + + +F+ + + +R Sbjct: 101 LDIFVLNATTRAYFVSKAEVAAWPGIGWLARATGTVFIERSRAKAAQQALVFAQRLGAGH 160 Query: 139 RKIIISSGIPVSSISERIYDADYGMDV-----------DTIRLSYFASKPSAGKCGFWPE 187 R + G + + + + L Y A + + + W Sbjct: 161 RLLFFPEGTSSDGLRVLPFKSSLFAAFFVPELHHVIQVQPVSLRYTAPEGADPRFYGWWG 220 Query: 188 DM 189 DM Sbjct: 221 DM 222 >gi|313680394|ref|YP_004058133.1| carbohydrate kinase, yjef related protein [Oceanithermus profundus DSM 14977] gi|313153109|gb|ADR36960.1| carbohydrate kinase, YjeF related protein [Oceanithermus profundus DSM 14977] Length = 486 Score = 36.5 bits (83), Expect = 3.4, Method: Composition-based stats. Identities = 13/93 (13%), Positives = 32/93 (34%), Gaps = 8/93 (8%) Query: 76 MRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAV 135 +R+ + +VD G + P+ E+ V+ + +PS + +R + Sbjct: 108 LREADVVVDALFGTG---LTRPLEGAWAELAERINAADRPVVAVDVPSGLPYAPHVRADL 164 Query: 136 KDIRKIIISSGIPVSSISERIYDADYGMDVDTI 168 + +G+ + A + + I Sbjct: 165 -----TVALAGLKPDHVFYPGRSACGRIQLAPI 192 >gi|86157647|ref|YP_464432.1| hypothetical protein Adeh_1221 [Anaeromyxobacter dehalogenans 2CP-C] gi|85774158|gb|ABC80995.1| protein of unknown function DUF1355 [Anaeromyxobacter dehalogenans 2CP-C] Length = 761 Score = 36.5 bits (83), Expect = 3.4, Method: Composition-based stats. Identities = 12/74 (16%), Positives = 25/74 (33%), Gaps = 3/74 (4%) Query: 88 LAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGI 147 AGR ++ + G + A L + ++A R + + G+ Sbjct: 153 AAGRTDLLGALEAVTSGAGASSRRL-AGALVVS--DGADNAALADGLSGATRARLRALGV 209 Query: 148 PVSSISERIYDADY 161 PVS+++ Sbjct: 210 PVSAVAVGRSAPRD 223 >gi|326776423|ref|ZP_08235688.1| NADP-dependent oxidoreductase domain protein [Streptomyces cf. griseus XylebKG-1] gi|326656756|gb|EGE41602.1| NADP-dependent oxidoreductase domain protein [Streptomyces cf. griseus XylebKG-1] Length = 318 Score = 36.5 bits (83), Expect = 3.4, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 38/119 (31%), Gaps = 9/119 (7%) Query: 42 LMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDT 101 L+ GT E D R R+P +V + P++AG + T Sbjct: 202 LLAAMPLGSGYLTGTLKPGQGFEPEDLRARHPRFTAEVMAA-NQPVVAGLRRVAERRGAT 260 Query: 102 IRGFLEKYKNDSASVLFLLIPSPTV------SSASIRRAVKDIRKIIISSGIPVSSISE 154 + + + +P ++ + R + D R + G+P + S Sbjct: 261 VAQVALAWVLR-QGPHVVPVPGAKRERWAVENAGAARVVLDD-RDLAEIDGLPAARESW 317 >gi|197123048|ref|YP_002134999.1| hypothetical protein AnaeK_2645 [Anaeromyxobacter sp. K] gi|196172897|gb|ACG73870.1| protein of unknown function DUF1355 [Anaeromyxobacter sp. K] Length = 761 Score = 36.5 bits (83), Expect = 3.5, Method: Composition-based stats. Identities = 12/74 (16%), Positives = 25/74 (33%), Gaps = 3/74 (4%) Query: 88 LAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGI 147 AGR ++ + G + A L + ++A R + + G+ Sbjct: 153 AAGRTDLLGALEAVTSGAGASSRRL-AGALVVS--DGADNAALADGLSGATRARLRALGV 209 Query: 148 PVSSISERIYDADY 161 PVS+++ Sbjct: 210 PVSAVAVGRSAPRD 223 >gi|332820870|ref|XP_001175202.2| PREDICTED: sodium-dependent dopamine transporter [Pan troglodytes] Length = 641 Score = 36.5 bits (83), Expect = 3.6, Method: Composition-based stats. Identities = 18/99 (18%), Positives = 41/99 (41%), Gaps = 12/99 (12%) Query: 71 RYPILMRKVEQIVDIPLLAGRG-----EIKYPI--HDTIRGFLEKYKNDSASVLFLLIP- 122 R+P L+R+ E+ + G + H +RG + + ++S+L +P Sbjct: 509 RHP-LLRRGER---LGTGPFDGISEVLSLDGSQKTHTVVRGVHSGWASSASSMLEAPVPS 564 Query: 123 SPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADY 161 SP++ + + ++ D ++ + G + Y A Sbjct: 565 SPSLLTRAPEQSGPDDQRAALERGAAPGPFASSAYLASS 603 >gi|302802177|ref|XP_002982844.1| hypothetical protein SELMODRAFT_445280 [Selaginella moellendorffii] gi|300149434|gb|EFJ16089.1| hypothetical protein SELMODRAFT_445280 [Selaginella moellendorffii] Length = 171 Score = 36.5 bits (83), Expect = 3.6, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 24/51 (47%) Query: 34 WKNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVD 84 W F L+ L + ++ G + Y +DYR R P + V++I+D Sbjct: 70 WLGFILQYLWAIGIVVAVITCGILFVTYKPREADYRFREPPSLEDVKKILD 120 >gi|317507714|ref|ZP_07965419.1| carbohydrate kinase [Segniliparus rugosus ATCC BAA-974] gi|316253967|gb|EFV13332.1| carbohydrate kinase [Segniliparus rugosus ATCC BAA-974] Length = 512 Score = 36.1 bits (82), Expect = 3.7, Method: Composition-based stats. Identities = 15/147 (10%), Positives = 47/147 (31%), Gaps = 17/147 (11%) Query: 34 WKNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGE 93 ++ F R + ++ L + +G + ++ + + V + + G Sbjct: 368 FEAFAGRPVGADRVAAALELAVECHVTVLLKG------HVTIIAEPSRRVLVNIAQGSWA 421 Query: 94 IKYPIHDTIRGFLE-----------KYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKII 142 D + G + ++ + + + P+ + +A + A+ + Sbjct: 422 ATAGSGDVLSGMIGALLATGAPGPCSVRSAGSGHVVIRAPATSDYAAPLAAAIHSRAADL 481 Query: 143 ISSGIPVSSISERIYDADYGMDVDTIR 169 + G + I+ A + +R Sbjct: 482 AAGGERGAPITASRLAAHIPEAIRAVR 508 >gi|313895426|ref|ZP_07828983.1| thiol reductant ABC exporter, CydC subunit [Selenomonas sp. oral taxon 137 str. F0430] gi|312976321|gb|EFR41779.1| thiol reductant ABC exporter, CydC subunit [Selenomonas sp. oral taxon 137 str. F0430] Length = 548 Score = 36.1 bits (82), Expect = 3.8, Method: Composition-based stats. Identities = 13/72 (18%), Positives = 29/72 (40%), Gaps = 2/72 (2%) Query: 75 LMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRA 134 ++R + + +D PL + + + L A VL L P+ +++ R Sbjct: 457 VVRALPRGLDEPLGENAARLSGGQRNRLLTALA--LASDAPVLLLDEPTAGLNAELGERL 514 Query: 135 VKDIRKIIISSG 146 ++ I + + G Sbjct: 515 IRAILTALSAEG 526 >gi|222081029|ref|YP_002540392.1| 4-hydroxy-2-oxovalerate aldolase protein [Agrobacterium radiobacter K84] gi|263430668|sp|B9JML6|HOA_AGRRK RecName: Full=4-hydroxy-2-oxovalerate aldolase; Short=HOA; AltName: Full=4-hydroxy-2-keto-pentanoic acid aldolase; AltName: Full=4-hydroxy-2-oxopentanoate aldolase gi|221725708|gb|ACM28797.1| 4-hydroxy-2-oxovalerate aldolase protein [Agrobacterium radiobacter K84] Length = 338 Score = 36.1 bits (82), Expect = 3.9, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 63/190 (33%), Gaps = 24/190 (12%) Query: 59 LAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYK---NDSAS 115 +A + +D ++ + + +D+ + G + ++ G ++ + A Sbjct: 107 VATHCTEADVAKQH----IEAARALDLDVA---GFLMMAHMNSPEGLAKQAQMMEAYGAH 159 Query: 116 VLFLLIPSPTVSSASIRRAVKDIRKII---ISSGIPVSS---ISERIYDADYGMDVDTIR 169 +++ + ++ +R V+ +R+ + GI V + + Sbjct: 160 CVYVTDSAGALTMDGVRERVRALRQALKPETQVGIHVHHNLSLGVANAVVGVEEGAYRVD 219 Query: 170 LSYFASKPSAGKCGFWPEDMLGNAKGNRNWTNYGCAYQNNLAAQVVNPLDLFSPRMVTPP 229 S AG P ++ W +GC NL A + DL P P Sbjct: 220 ASLAGMGAGAGNT---PIEVFAAVADRLGWQ-HGC----NLFALMDAADDLVRPLQDRPV 271 Query: 230 DAEQRDKSIQ 239 ++ SI Sbjct: 272 RVDRETLSIG 281 >gi|56461549|ref|YP_156830.1| acetyltransferase domain-containing protein [Idiomarina loihiensis L2TR] gi|56180559|gb|AAV83281.1| Acetyltransferase, GNAT family fused to PaaI related uncharacterized conserved domain [Idiomarina loihiensis L2TR] Length = 300 Score = 36.1 bits (82), Expect = 4.5, Method: Composition-based stats. Identities = 11/62 (17%), Positives = 23/62 (37%), Gaps = 8/62 (12%) Query: 62 YDEGSDYRD--RYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIR--GFLEKYKNDSASVL 117 E +Y + +++ + E+ P+ GR P IR +Y+ + V Sbjct: 33 GSEQDEYDQVGHHRMVVNEAEE----PVAVGRLHFNSPEEAQIRFMAVAPEYRGEGHGVA 88 Query: 118 FL 119 + Sbjct: 89 II 90 >gi|227502928|ref|ZP_03932977.1| siderophore-interacting protein [Corynebacterium accolens ATCC 49725] gi|227076350|gb|EEI14313.1| siderophore-interacting protein [Corynebacterium accolens ATCC 49725] Length = 622 Score = 36.1 bits (82), Expect = 4.6, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 30/89 (33%), Gaps = 10/89 (11%) Query: 79 VEQIVDIPLLAGRGEIKYPI--------HDTIRGFLEKYKNDSASVLFLLIPSPTVSSAS 130 +V++P A ++ P D F+EK + +P + + Sbjct: 182 AVAVVEVPTSADIQDLDIPDSVQIHWAVRDQGEDFVEKSRALFEGTSQSALPGGEAYAWA 241 Query: 131 IRRAVKD--IRKIIISSGIPVSSISERIY 157 A + +R++ +SGI Y Sbjct: 242 AGEASRLKPLRRLFKASGISPEHREITGY 270 >gi|87311447|ref|ZP_01093567.1| endo-1,4-beta-xylanase B [Blastopirellula marina DSM 3645] gi|87285859|gb|EAQ77773.1| endo-1,4-beta-xylanase B [Blastopirellula marina DSM 3645] Length = 284 Score = 36.1 bits (82), Expect = 4.8, Method: Composition-based stats. Identities = 11/95 (11%), Positives = 26/95 (27%), Gaps = 16/95 (16%) Query: 83 VDIPLLAGRGEIKYPIHDT----------IRGFLEKYKNDSASVLFLLIPSPTVSSASIR 132 V + G G + + F+ +Y++ + + Sbjct: 47 VVVLPGGGYGHLATGHEGVDIAKWYNSFGVSAFVVEYRHRGRGY------AHPAPIQDAQ 100 Query: 133 RAVKDIRKIIISSGIPVSSISERIYDADYGMDVDT 167 RA++ +R G+ I + A + Sbjct: 101 RAIRTVRARAEEFGVSPDKIGVMGFSAGGHLASTA 135 >gi|313837570|gb|EFS75284.1| conserved domain protein [Propionibacterium acnes HL037PA2] gi|314927551|gb|EFS91382.1| conserved domain protein [Propionibacterium acnes HL044PA1] gi|314972509|gb|EFT16606.1| conserved domain protein [Propionibacterium acnes HL037PA3] Length = 320 Score = 35.8 bits (81), Expect = 4.8, Method: Composition-based stats. Identities = 16/98 (16%), Positives = 37/98 (37%), Gaps = 7/98 (7%) Query: 81 QIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRK 140 Q++++ G+ + + F+E + + ++ P TV + +++ + Sbjct: 2 QVLNLATGVGKTYL-------MAAFIEYLRRQGVGNVVIVTPGKTVQAKTVQNFALGEPR 54 Query: 141 IIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPS 178 I S +P ++ + Y A LS KP Sbjct: 55 YIAGSSVPPEVVTPQDYSAWIARQNGAEILSSGREKPV 92 >gi|310821456|ref|YP_003953814.1| beta-ketoacyl synthase [Stigmatella aurantiaca DW4/3-1] gi|309394528|gb|ADO71987.1| Beta-ketoacyl synthase [Stigmatella aurantiaca DW4/3-1] Length = 1526 Score = 35.8 bits (81), Expect = 4.8, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 36/108 (33%), Gaps = 16/108 (14%) Query: 97 PIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERI 156 D+I + ++ + +P+ + + I + + +G+P SI Sbjct: 263 ASRDSIHAVILGTAINNDGSTKVGYTAPSPEGQA-----EVIARALAVAGVPARSIGYVE 317 Query: 157 YDADYGMDVDTIRLS-----YFASKPSAGKCGFWPEDMLGNAKGNRNW 199 + D + +S + A G CG LG+ K N Sbjct: 318 AHGTGTLLGDPVEVSALTRVFRAETADTGFCG------LGSVKSNIGH 359 >gi|115379305|ref|ZP_01466416.1| oxidoreductase, short chain dehydrogenase/reductase family [Stigmatella aurantiaca DW4/3-1] gi|115363687|gb|EAU62811.1| oxidoreductase, short chain dehydrogenase/reductase family [Stigmatella aurantiaca DW4/3-1] Length = 1519 Score = 35.8 bits (81), Expect = 4.9, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 36/108 (33%), Gaps = 16/108 (14%) Query: 97 PIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERI 156 D+I + ++ + +P+ + + I + + +G+P SI Sbjct: 256 ASRDSIHAVILGTAINNDGSTKVGYTAPSPEGQA-----EVIARALAVAGVPARSIGYVE 310 Query: 157 YDADYGMDVDTIRLS-----YFASKPSAGKCGFWPEDMLGNAKGNRNW 199 + D + +S + A G CG LG+ K N Sbjct: 311 AHGTGTLLGDPVEVSALTRVFRAETADTGFCG------LGSVKSNIGH 352 >gi|124263034|ref|YP_001023504.1| malate dehydrogenase (NAD) [Methylibium petroleiphilum PM1] gi|124262280|gb|ABM97269.1| malate dehydrogenase (NAD) [Methylibium petroleiphilum PM1] Length = 432 Score = 35.8 bits (81), Expect = 5.1, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 45/137 (32%), Gaps = 10/137 (7%) Query: 40 RTLMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLA---GRGEIKY 96 R L + + F L + R + + + IPL G I+ Sbjct: 261 RVLGMAGVLDSARFCALVGLTGKARPQEVRA---VALGSHGPEMVIPLSQAFVGDRPIES 317 Query: 97 P-IHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSG--IPVSSIS 153 +T++ +E+ + + L+ + + AV +R ++ SG I S Sbjct: 318 MFDAETLKALVERARESG-GEVVKLLQKGSAYFSPAESAVTMVRAMVRDSGEVIAACVRS 376 Query: 154 ERIYDADYGMDVDTIRL 170 Y A +RL Sbjct: 377 RGAYGAVDTRVGLPVRL 393 >gi|108763538|ref|YP_629555.1| putative lipoprotein [Myxococcus xanthus DK 1622] gi|108467418|gb|ABF92603.1| putative lipoprotein [Myxococcus xanthus DK 1622] Length = 160 Score = 35.8 bits (81), Expect = 5.1, Method: Composition-based stats. Identities = 11/83 (13%), Positives = 25/83 (30%), Gaps = 8/83 (9%) Query: 101 TIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDAD 160 + GF ++ D VL L + +S + A D + ++ Sbjct: 37 RVVGFQVDFRQDGTGVLDLDLAVTNPASDAATLAAVDFTLR-----VDGRRVAVGTQQVA 91 Query: 161 YGMDV---DTIRLSYFASKPSAG 180 + +R+ + + A Sbjct: 92 APLAADGSAPLRVLFPLASARAT 114 >gi|124268530|ref|YP_001022534.1| UvrA family protein [Methylibium petroleiphilum PM1] gi|124261305|gb|ABM96299.1| UvrA family protein [Methylibium petroleiphilum PM1] Length = 1929 Score = 35.8 bits (81), Expect = 5.3, Method: Composition-based stats. Identities = 16/105 (15%), Positives = 30/105 (28%), Gaps = 8/105 (7%) Query: 79 VEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIP-SPTVSSASIRRAVKD 137 E +++P+ G + +R L K V+ LL P ++ + Sbjct: 1227 KEHTLELPV--GDIVVTPDNEAELRALLAKTLELGKGVVHLLGPLDGLKAAMAAGAPTHR 1284 Query: 138 IRKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKC 182 I ++ + S Y SY + C Sbjct: 1285 IGRVKVFSTKRACPACGTSYPELDPRMF-----SYNSKHGWCPDC 1324 >gi|299116017|emb|CBN76017.1| inversin protein alternative isoform [Ectocarpus siliculosus] Length = 748 Score = 35.8 bits (81), Expect = 5.7, Method: Composition-based stats. Identities = 11/76 (14%), Positives = 27/76 (35%), Gaps = 11/76 (14%) Query: 95 KYPIHDTIRGFLE---KYKNDSASVLFLLIPSPTVSSASI--------RRAVKDIRKIII 143 I D G +Y+ S + + S ++ + R + ++R+++ Sbjct: 560 TTTIEDRTAGSSAVDTRYRRSSTPQVVSQVVSGGATTKAWGPLHMAIHRGSSSEVRELLT 619 Query: 144 SSGIPVSSISERIYDA 159 G+ + ER + Sbjct: 620 RPGVDKEELLERSFSP 635 >gi|89897655|ref|YP_521142.1| hypothetical protein DSY4909 [Desulfitobacterium hafniense Y51] gi|219670784|ref|YP_002461219.1| UDP-N-acetylglucosamine 1-carboxyvinyltransferase [Desulfitobacterium hafniense DCB-2] gi|89337103|dbj|BAE86698.1| hypothetical protein [Desulfitobacterium hafniense Y51] gi|219541044|gb|ACL22783.1| UDP-N-acetylglucosamine 1-carboxyvinyltransferase [Desulfitobacterium hafniense DCB-2] Length = 416 Score = 35.8 bits (81), Expect = 5.7, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 32/90 (35%), Gaps = 13/90 (14%) Query: 73 PILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIR 132 PI + ++D+ + + I F + + + L++ P T S + Sbjct: 38 PIRIDDAPHLLDVDV----------MCQVIGAFGASVRREGS-QLYINTPEIT-SLEAPH 85 Query: 133 RAVKDIRKIIISSG-IPVSSISERIYDADY 161 V +R I++ G + + RI Sbjct: 86 DLVSQMRASIVTMGPVLARTGRVRISHPGG 115 >gi|221633490|ref|YP_002522715.1| valyl-tRNA synthetase [Thermomicrobium roseum DSM 5159] gi|221157072|gb|ACM06199.1| valyl-tRNA synthetase [Thermomicrobium roseum DSM 5159] Length = 892 Score = 35.4 bits (80), Expect = 6.4, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 49/154 (31%), Gaps = 29/154 (18%) Query: 15 CFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYG-----TSALAYYDEGSDYR 69 C + + + L + I ++LR + G L++ +D R Sbjct: 184 CPRCMTALSDLEVDHEEIEGTLYYLRYPIDGSDESLVVATTRPETMLGDTGVAVHPNDER 243 Query: 70 DRY--------P-------ILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSA 114 R+ P I+ ++ VD G ++ F E + Sbjct: 244 YRHLVGKTAILPLLGRRLTIV---ADEAVDPAFGTGAVKVTPAH--DFTDF-EIAQRHGL 297 Query: 115 SVLFLLIPSPTVSSAS---IRRAVKDIRKIIISS 145 + +L P T++ + +++ R+ ++ Sbjct: 298 PPVNILNPDGTLNEQAGPFAGLTIQEARRRVVEE 331 >gi|325965245|ref|YP_004243151.1| hypothetical protein Asphe3_39290 [Arthrobacter phenanthrenivorans Sphe3] gi|323471332|gb|ADX75017.1| uncharacterized conserved protein [Arthrobacter phenanthrenivorans Sphe3] Length = 331 Score = 35.4 bits (80), Expect = 6.7, Method: Composition-based stats. Identities = 10/62 (16%), Positives = 22/62 (35%) Query: 94 IKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSIS 153 + + + + Y+ + LIPS + + + + + GIP +SI Sbjct: 170 VPPLLAARLDAAVSLYRGKHGGTIRALIPSGGRGADELTTEGAAMARYLQEQGIPAASIL 229 Query: 154 ER 155 Sbjct: 230 IE 231 >gi|75676093|ref|YP_318514.1| hypothetical protein Nwi_1902 [Nitrobacter winogradskyi Nb-255] gi|74420963|gb|ABA05162.1| conserved hypothetical protein [Nitrobacter winogradskyi Nb-255] Length = 499 Score = 35.4 bits (80), Expect = 6.8, Method: Composition-based stats. Identities = 9/87 (10%), Positives = 29/87 (33%), Gaps = 16/87 (18%) Query: 60 AYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFL 119 A + +G Y+ + +++ G + + A ++ + Sbjct: 226 APHIDGHKYKRGHAVVVS--------------GHLTSTGAARLS--ARGALRAGAGLVTV 269 Query: 120 LIPSPTVSSASIRRAVKDIRKIIISSG 146 L P + + + +R++ ++G Sbjct: 270 LSPDDALGANAAALTAVMVRRMNGAAG 296 >gi|330812292|ref|YP_004356754.1| alginate lyase [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327380400|gb|AEA71750.1| putative alginate lyase [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 221 Score = 35.4 bits (80), Expect = 6.9, Method: Composition-based stats. Identities = 13/90 (14%), Positives = 34/90 (37%), Gaps = 5/90 (5%) Query: 82 IVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKI 141 ++ + G ++GF ++Y + + +F P ++ + ++R+ Sbjct: 6 TWNLSIPEGSPPATIETSQLVQGFQDQYFHSDSGTVFFWAPVTGATTTNAIYPRSELRET 65 Query: 142 IISSGIPVSSISERIYDADYGMDVDTIRLS 171 S+G + + Y A T+ +S Sbjct: 66 -YSNGTLRNWL----YPAADNKLAATLAVS 90 >gi|163731849|ref|ZP_02139296.1| acyltransferase [Roseobacter litoralis Och 149] gi|161395303|gb|EDQ19625.1| acyltransferase [Roseobacter litoralis Och 149] Length = 260 Score = 35.4 bits (80), Expect = 7.4, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 37/122 (30%), Gaps = 15/122 (12%) Query: 83 VDIPLLAGRGEIKYPIHDTIRGF-LEKYKNDSASVLFLLIPSPTVSSAS---IRRAVKDI 138 +DI +L + + + + + + +F+ V+ S +R Sbjct: 88 LDIFVLNATTRLYFVSKSEVAAWPGIGWLARATGTVFIERNRSKVAQQSELFAQRLGAGH 147 Query: 139 RKIIISSGIPVSSISERIYDAD-----------YGMDVDTIRLSYFASKPSAGKCGFWPE 187 R + G + + + + V + L Y A + + + W Sbjct: 148 RLLFFPEGTSSDGLRVLPFKSSLFAAFFAPDLRGSIQVQPVSLRYTAPEHADPRFYGWWG 207 Query: 188 DM 189 DM Sbjct: 208 DM 209 >gi|292669583|ref|ZP_06603009.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292648792|gb|EFF66764.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 538 Score = 35.4 bits (80), Expect = 7.9, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 27/73 (36%), Gaps = 2/73 (2%) Query: 75 LMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRA 134 ++R + +D PL + + L A +L L P+ + +A R Sbjct: 447 VVRALPAGLDEPLGKNASRLSGGQRSRLLTALA--LASGAPILLLDEPTAGLDAARGARL 504 Query: 135 VKDIRKIIISSGI 147 ++ + + + G Sbjct: 505 IERVLAALDARGA 517 >gi|83589123|ref|YP_429132.1| UvrD/REP helicase [Moorella thermoacetica ATCC 39073] gi|83572037|gb|ABC18589.1| UvrD/REP helicase [Moorella thermoacetica ATCC 39073] Length = 1232 Score = 35.0 bits (79), Expect = 8.6, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 34/125 (27%), Gaps = 27/125 (21%) Query: 54 YGTSALAYYDEGSDYRDRYPILMRKVEQIVDI---PLLAGRGE------IKYPIHDTIRG 104 G + + R R+P + I P+ AG G + + Sbjct: 491 AGQGGIFAPQVPAGRRTRHP----ETAASAVIRPAPVPAGSGGPGLTGALNPEQQSAVTA 546 Query: 105 FLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMD 164 + + ++ T + + V + +I G+ I+ + Sbjct: 547 --------ATGPVVVIAGPGTGKTRT---LVYRLAYLIKERGVAPGEIAAVTFT---NKA 592 Query: 165 VDTIR 169 IR Sbjct: 593 AAEIR 597 >gi|304311070|ref|YP_003810668.1| Putative SanA protein [gamma proteobacterium HdN1] gi|301796803|emb|CBL45015.1| Putative SanA protein [gamma proteobacterium HdN1] Length = 247 Score = 35.0 bits (79), Expect = 9.1, Method: Composition-based stats. Identities = 18/125 (14%), Positives = 41/125 (32%), Gaps = 16/125 (12%) Query: 38 FLRTLMLGQL---FFLLLFYGTSALAYYDEGSDYRDRYPILMRKVE-QIVDIPLLAG--- 90 R G F ++ + S + + + +V V + L Sbjct: 16 IFRIARWGSFVAGFVSIVLFLCDFSVSASTQSAIKTQ----IGEVPEHSVVVVLGTSWRL 71 Query: 91 -RGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIRKIIISSGIPV 149 G++ + + Y+ A + ++ S ++ + +RK +I+ GIP Sbjct: 72 PSGKVNPYYKARVDAVDQLYR---AGKVQAVVVSG-DNATPQYNEPRKLRKDLIARGIPA 127 Query: 150 SSISE 154 I+ Sbjct: 128 GMITM 132 >gi|254236387|ref|ZP_04929710.1| hypothetical protein PACG_02367 [Pseudomonas aeruginosa C3719] gi|126168318|gb|EAZ53829.1| hypothetical protein PACG_02367 [Pseudomonas aeruginosa C3719] Length = 297 Score = 35.0 bits (79), Expect = 9.3, Method: Composition-based stats. Identities = 32/132 (24%), Positives = 54/132 (40%), Gaps = 10/132 (7%) Query: 12 GGVCFKGLANMRSLISCLKTIFWKNFFLRTLMLGQLFFLLLFYGTSALAYYDEGSDYRDR 71 G V F+ M L L + L L G+L L G AL + ++YR R Sbjct: 61 GAVVFRRAEEMLRLRRDLLSELDD---LSQLNRGELRLGLPLLGADAL-FAQRFAEYRRR 116 Query: 72 YP-ILM---RKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVS 127 YP I + + ++ +LAG E+ + GF Y+ L +L+P+ Sbjct: 117 YPNIAVHLVEGGSKTMEQAVLAGELELAGSLTPADDGF--DYQPFCNEPLDVLLPAGHPK 174 Query: 128 SASIRRAVKDIR 139 +A+ A+ ++ Sbjct: 175 AAAASVALGELA 186 >gi|300693968|ref|YP_003749941.1| excinuclease ABC, a subunit [Ralstonia solanacearum PSI07] gi|299076005|emb|CBJ35316.1| Excinuclease ABC, A subunit [Ralstonia solanacearum PSI07] Length = 1945 Score = 35.0 bits (79), Expect = 9.4, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 28/103 (27%), Gaps = 7/103 (6%) Query: 80 EQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDIR 139 E +++P+ G + +R LE+ V +L P + A + Sbjct: 1212 EHTLELPV--GDIVVTPENEAALRALLEQALEHGKGVAHVLAPLDGLQHAMQNGGSAQVG 1269 Query: 140 KIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKC 182 + + S Y SY + C Sbjct: 1270 SVKVFSTKRACPTCGTSYPELDPRMF-----SYNSKHGWCPGC 1307 >gi|227494172|ref|ZP_03924488.1| MutT/nudix family protein [Actinomyces coleocanis DSM 15436] gi|226831906|gb|EEH64289.1| MutT/nudix family protein [Actinomyces coleocanis DSM 15436] Length = 189 Score = 35.0 bits (79), Expect = 9.6, Method: Composition-based stats. Identities = 12/74 (16%), Positives = 28/74 (37%), Gaps = 4/74 (5%) Query: 71 RYPILMRKVEQIVDIPLLAGR--GEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP--TV 126 R+P ++ Q P++ R G + I + + A + +P Sbjct: 18 RHPAVLSARTQTRQYPVVDERSAGGLVLKIEGGRPLVAVIARRNRAGKIEWCLPKGHIEP 77 Query: 127 SSASIRRAVKDIRK 140 + ++ AV++I + Sbjct: 78 NESAQTAAVREIAE 91 >gi|114319729|ref|YP_741412.1| carbohydrate kinase, YjeF related protein [Alkalilimnicola ehrlichii MLHE-1] gi|114226123|gb|ABI55922.1| carbohydrate kinase, YjeF related protein [Alkalilimnicola ehrlichii MLHE-1] Length = 492 Score = 35.0 bits (79), Expect = 9.6, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 39/114 (34%), Gaps = 7/114 (6%) Query: 73 PILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSP-TVSSASI 131 P + + +VD L G + P+ R L+ K VL + +PS + ++ Sbjct: 115 PTGLADEDVVVDALLGTG---LDRPVEGRYREALQALKAAGVPVLAIDVPSGLNAGTGAV 171 Query: 132 RRAVKDIRKIIISSGIPVSSISERIYDADYGMDVDTIRL---SYFASKPSAGKC 182 + + G+ ++ + D + + Y P AG C Sbjct: 172 MGEAVEAHCTVTFIGLKPGLLTGAGPQCAGTLYFDDLGVPPEIYQDMAPVAGLC 225 >gi|330818051|ref|YP_004361756.1| NAD dependent epimerase/dehydratase family protein [Burkholderia gladioli BSR3] gi|327370444|gb|AEA61800.1| NAD dependent epimerase/dehydratase family protein [Burkholderia gladioli BSR3] Length = 353 Score = 35.0 bits (79), Expect = 9.8, Method: Composition-based stats. Identities = 11/105 (10%), Positives = 22/105 (20%), Gaps = 10/105 (9%) Query: 79 VEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDI 138 Q++ + G I +++ S + Sbjct: 75 ARQVLLLAPPQPSGPDDRRTRALIAALGARHRARG----------GPARSGAAPVGRMRA 124 Query: 139 RKIIISSGIPVSSISERIYDADYGMDVDTIRLSYFASKPSAGKCG 183 G + +RL Y ++ G CG Sbjct: 125 LAGAWRRGTDRGQAAPGAARIVPEGPSAPLRLVYASTSGVYGDCG 169 >gi|224824178|ref|ZP_03697286.1| Carboxymethylenebutenolidase [Lutiella nitroferrum 2002] gi|224603597|gb|EEG09772.1| Carboxymethylenebutenolidase [Lutiella nitroferrum 2002] Length = 287 Score = 35.0 bits (79), Expect = 9.8, Method: Composition-based stats. Identities = 13/90 (14%), Positives = 31/90 (34%), Gaps = 14/90 (15%) Query: 71 RYPILMRKVEQIVDIPLLAGRGEIKYPIH-DTIRGFLEKYKNDSASVLFLLIP------- 122 R+P+ + + P+L G + I DT++ +K + ++ P Sbjct: 201 RHPLDLADQ---LKAPVLGLYGGLDSGISLDTVKAMQDKLAKAGSPSRLIVYPDAGHGFN 257 Query: 123 ---SPTVSSASIRRAVKDIRKIIISSGIPV 149 P+ ++A+ + G+ Sbjct: 258 ADYRPSYNAAAAHDGWAKMLAWFADHGVKP 287 >gi|182435799|ref|YP_001823518.1| putative oxidoreductase [Streptomyces griseus subsp. griseus NBRC 13350] gi|178464315|dbj|BAG18835.1| putative oxidoreductase [Streptomyces griseus subsp. griseus NBRC 13350] Length = 308 Score = 35.0 bits (79), Expect = 9.8, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 37/115 (32%), Gaps = 9/115 (7%) Query: 42 LMLGQLFFLLLFYGTSALAYYDEGSDYRDRYPILMRKVEQIVDIPLLAGRGEIKYPIHDT 101 L+ GT E D R R+P +V + P++AG + T Sbjct: 192 LLAAMPLGSGYLTGTLKPGQGFEPEDLRARHPRFTAEVMAA-NQPVVAGLRRVAERRGAT 250 Query: 102 IRGFLEKYKNDSASVLFLLIPSPTV------SSASIRRAVKDIRKIIISSGIPVS 150 + + + +P ++ + R + D R + G+P + Sbjct: 251 VAQVALAWVLR-QGPHVVPVPGAKRERWAVENAGAARVVLDD-RDLTEIDGLPAA 303 >gi|153005408|ref|YP_001379733.1| hypothetical protein Anae109_2548 [Anaeromyxobacter sp. Fw109-5] gi|152028981|gb|ABS26749.1| protein of unknown function DUF1355 [Anaeromyxobacter sp. Fw109-5] Length = 759 Score = 35.0 bits (79), Expect = 9.8, Method: Composition-based stats. Identities = 13/83 (15%), Positives = 28/83 (33%), Gaps = 3/83 (3%) Query: 79 VEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTVSSASIRRAVKDI 138 E +P AGR ++ + G + A L + ++A + Sbjct: 143 AEAARGVPGRAGRTDVLGALEAVASGAGGSTRRL-AGALVVS--DGADNAALAEGLGPEA 199 Query: 139 RKIIISSGIPVSSISERIYDADY 161 R + G+PV++++ Sbjct: 200 RAKARALGVPVNAVAVGRSAPRD 222 >gi|325921222|ref|ZP_08183083.1| hypothetical protein XGA_2074 [Xanthomonas gardneri ATCC 19865] gi|325548285|gb|EGD19278.1| hypothetical protein XGA_2074 [Xanthomonas gardneri ATCC 19865] Length = 433 Score = 35.0 bits (79), Expect = 9.9, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 38/99 (38%), Gaps = 8/99 (8%) Query: 73 PILMRKVEQIVDIPLLAGRGEIKYPIHDTIRGFLEKYKNDSASVLFLLIPSPTV-----S 127 P+L+ +Q DI + G + + I +R +++ + +L ++PS + Sbjct: 289 PVLVELGQQAPDIAVAVG---LTHDIRADVRTYVDSNLPSAGRILNCMLPSGAGPASVRN 345 Query: 128 SASIRRAVKDIRKIIISSGIPVSSISERIYDADYGMDVD 166 A + + + + +P + + ++ A Sbjct: 346 RAHAALLAHQLAQAVSETRVPGVARTVHLFMAAPNGFFS 384 >gi|239907226|ref|YP_002953967.1| phosphoribosylformylglycinamidine synthase [Desulfovibrio magneticus RS-1] gi|239797092|dbj|BAH76081.1| phosphoribosylformylglycinamidine synthase [Desulfovibrio magneticus RS-1] Length = 270 Score = 35.0 bits (79), Expect = 9.9, Method: Composition-based stats. Identities = 8/67 (11%), Positives = 25/67 (37%), Gaps = 9/67 (13%) Query: 73 PILMRKVEQIVDIPLLAGRGEIKYPIHDTIR------GFLEKYKNDSASVLFLLIPS--- 123 P + K ++D+P+ G G++ + +Y + + + + P+ Sbjct: 152 PCVFTKGLSVIDLPVRHGEGKLVPMEPAVLDELMASGAVALQYADPATGAVTMDYPANPN 211 Query: 124 PTVSSAS 130 + + + Sbjct: 212 GSPQAIA 218 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.310 0.139 0.421 Lambda K H 0.267 0.0428 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 4,588,318,938 Number of Sequences: 14124377 Number of extensions: 191500441 Number of successful extensions: 789178 Number of sequences better than 10.0: 478 Number of HSP's better than 10.0 without gapping: 229 Number of HSP's successfully gapped in prelim test: 348 Number of HSP's that attempted gapping in prelim test: 788573 Number of HSP's gapped (non-prelim): 708 length of query: 243 length of database: 4,842,793,630 effective HSP length: 135 effective length of query: 108 effective length of database: 2,936,002,735 effective search space: 317088295380 effective search space used: 317088295380 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.7 bits) S2: 79 (35.0 bits)