BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780984|ref|YP_003065397.1| hypothetical protein CLIBASIA_04425 [Candidatus Liberibacter asiaticus str. psy62] (125 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780984|ref|YP_003065397.1| hypothetical protein CLIBASIA_04425 [Candidatus Liberibacter asiaticus str. psy62] gi|254040661|gb|ACT57457.1| hypothetical protein CLIBASIA_04425 [Candidatus Liberibacter asiaticus str. psy62] Length = 125 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 125/125 (100%), Positives = 125/125 (100%) Query: 1 MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK 60 MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK Sbjct: 1 MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK 60 Query: 61 EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEARGKYRYGNGYYR 120 EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEARGKYRYGNGYYR Sbjct: 61 EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEARGKYRYGNGYYR 120 Query: 121 NYRSQ 125 NYRSQ Sbjct: 121 NYRSQ 125 >gi|254781126|ref|YP_003065539.1| hypothetical protein CLIBASIA_05140 [Candidatus Liberibacter asiaticus str. psy62] gi|254040803|gb|ACT57599.1| hypothetical protein CLIBASIA_05140 [Candidatus Liberibacter asiaticus str. psy62] Length = 85 Score = 86.2 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 30/49 (61%), Positives = 38/49 (77%) Query: 65 ITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEARGKYR 113 ITR + L +++KKAFF HEKKV +LNYNA DRK NI++ Y+AR KYR Sbjct: 18 ITRGVARLPKDQKKAFFRHEKKVADHLNYNAGDRKSNIDKLYKARCKYR 66 >gi|254780981|ref|YP_003065394.1| hypothetical protein CLIBASIA_04410 [Candidatus Liberibacter asiaticus str. psy62] gi|254040658|gb|ACT57454.1| hypothetical protein CLIBASIA_04410 [Candidatus Liberibacter asiaticus str. psy62] Length = 122 Score = 42.3 bits (98), Expect = 0.023, Method: Composition-based stats. Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 16/102 (15%) Query: 1 MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK 60 MKKY T+LT+L +S+ +N P E+ + + P + L L AENEK K Sbjct: 1 MKKYFTILTMLFVSSAIN---------PCGIEEDNLKSSPLPHIA--LESLSAENEK--K 47 Query: 61 EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNI 102 E + +++ EN KK F EHEKK T + RK NI Sbjct: 48 ELSEHEKKVIESQENPKKQFSEHEKKETDD---PKSARKENI 86 >gi|325914595|ref|ZP_08176936.1| Conjugative relaxosome accessory transposon protein [Xanthomonas vesicatoria ATCC 35937] gi|325539097|gb|EGD10752.1| Conjugative relaxosome accessory transposon protein [Xanthomonas vesicatoria ATCC 35937] Length = 476 Score = 38.5 bits (88), Expect = 0.31, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 37/81 (45%), Gaps = 4/81 (4%) Query: 32 SEERIATCAKPGYS----SRLAQLCAENEKRLKEFDKITRELNTLSENEKKAFFEHEKKV 87 SEE AT S S L L +E + +EF + RE T E K FFE K Sbjct: 391 SEEMAATVVSQVMSQYLQSILISLSSEEGEGTQEFREKAREATTRLREESKPFFERTKSR 450 Query: 88 TSNLNYNARDRKHNINQFYEA 108 T++L+Y R R + + Y A Sbjct: 451 TAHLDYYMRVRDTSSEKPYLA 471 >gi|32492529|gb|AAP85347.1| aminotransferase [Streptomyces griseoruber] Length = 372 Score = 37.3 bits (85), Expect = 0.65, Method: Composition-based stats. Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 1/49 (2%) Query: 21 DAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKR-LKEFDKITRE 68 DA+ RR YG ER PG++SRL ++ AE +R L+ D R Sbjct: 205 DARLRRLRYYGMAERYYVVQTPGHNSRLDEVQAEILRRKLRRLDDYVRG 253 >gi|237724550|ref|ZP_04555031.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. D4] gi|265754991|ref|ZP_06089905.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 3_1_33FAA] gi|229437109|gb|EEO47186.1| RNA polymerase ECF-type sigma factor [Bacteroides dorei 5_1_36/D4] gi|263234602|gb|EEZ20181.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 3_1_33FAA] Length = 188 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%) Query: 11 LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66 L+I + LN L+ RR + + E++ T A+ + R A LC + R +E D KI Sbjct: 64 LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYARFEELDRKIK 123 Query: 67 RELNTLSENEKKAFFEHEKK 86 + L ++ ++AF H K Sbjct: 124 EAMAALPDSYREAFVMHRFK 143 >gi|237709314|ref|ZP_04539795.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 9_1_42FAA] gi|229456699|gb|EEO62420.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 9_1_42FAA] Length = 188 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%) Query: 11 LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66 L+I + LN L+ RR + + E++ T A+ + R A LC + R +E D KI Sbjct: 64 LIIESSLNKYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYARFEELDRKIK 123 Query: 67 RELNTLSENEKKAFFEHEKK 86 + L ++ ++AF H K Sbjct: 124 EAMAALPDSYREAFVMHRFK 143 >gi|150004329|ref|YP_001299073.1| RNA polymerase ECF-type sigma factor [Bacteroides vulgatus ATCC 8482] gi|254883392|ref|ZP_05256102.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 4_3_47FAA] gi|149932753|gb|ABR39451.1| RNA polymerase ECF-type sigma factor [Bacteroides vulgatus ATCC 8482] gi|254836185|gb|EET16494.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 4_3_47FAA] Length = 188 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%) Query: 11 LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66 L+I + LN L+ RR + + E++ T A+ + R A LC + R +E D KI Sbjct: 64 LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYGRFEELDRKIK 123 Query: 67 RELNTLSENEKKAFFEHEKK 86 + L ++ ++AF H K Sbjct: 124 EAMAALPDSYREAFVMHRFK 143 >gi|294776855|ref|ZP_06742318.1| RNA polymerase sigma-70 factor [Bacteroides vulgatus PC510] gi|294449331|gb|EFG17868.1| RNA polymerase sigma-70 factor [Bacteroides vulgatus PC510] Length = 188 Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%) Query: 11 LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66 L+I + LN L+ RR + + E++ T A+ + R A LC + R +E D KI Sbjct: 64 LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYGRFEELDRKIK 123 Query: 67 RELNTLSENEKKAFFEHEKK 86 + L ++ ++AF H K Sbjct: 124 EAMAALPDSYREAFVMHRFK 143 >gi|226357900|ref|YP_002787640.1| dipeptide ABC transporter, periplasmic component [Deinococcus deserti VCD115] gi|226320143|gb|ACO48136.1| putative dipeptide ABC transporter, periplasmic component [Deinococcus deserti VCD115] Length = 588 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 3/93 (3%) Query: 22 AKARRFPTYGSEERIATCAKPGYSSRLAQ-LCAENEKRLKEFDKITRELNTLSENEKKAF 80 AKA ++P Y E A+ GY+ + AQ + ++ EF+ T NT+ E + F Sbjct: 377 AKAPKYP-YNPREAARLLAQLGYTKKNAQGYLVDKAGKVLEFNLSTNAGNTVREQLGRVF 435 Query: 81 FEHEKKVTSNLNYNARDRKHNINQFYEARGKYR 113 + KKV +N+ D + +NQ A+G+ R Sbjct: 436 ADEAKKVGVKVNFTPIDFNNLVNQL-TAKGESR 467 >gi|120436997|ref|YP_862683.1| hypothetical protein GFO_2660 [Gramella forsetii KT0803] gi|117579147|emb|CAL67616.1| conserved hypothetical protein, membrane [Gramella forsetii KT0803] Length = 376 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 20/52 (38%), Positives = 29/52 (55%) Query: 46 SRLAQLCAENEKRLKEFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARD 97 +RL +L NE L + EL+TL EN+K +F + +K+V NL Y D Sbjct: 263 ARLERLELSNESHLDTIKSMKTELSTLKENQKVSFSDRQKEVLGNLAYFGVD 314 >gi|212693788|ref|ZP_03301916.1| hypothetical protein BACDOR_03309 [Bacteroides dorei DSM 17855] gi|212663677|gb|EEB24251.1| hypothetical protein BACDOR_03309 [Bacteroides dorei DSM 17855] Length = 240 Score = 35.4 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%) Query: 11 LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66 L+I + LN L+ RR + + E++ T A+ + R A LC + R +E D KI Sbjct: 116 LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYARFEELDRKIK 175 Query: 67 RELNTLSENEKKAFFEHEKK 86 + L ++ ++AF H K Sbjct: 176 EAMAALPDSYREAFVMHRFK 195 >gi|319643304|ref|ZP_07997931.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 3_1_40A] gi|317385051|gb|EFV66003.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 3_1_40A] Length = 197 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%) Query: 11 LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66 L+I + LN L+ RR + + E++ T A+ + R A LC + R +E D KI Sbjct: 73 LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYGRFEELDRKIK 132 Query: 67 RELNTLSENEKKAFFEHEKK 86 + L ++ ++AF H K Sbjct: 133 EAMAALPDSYREAFVMHRFK 152 >gi|313667776|ref|YP_004048060.1| MafB2 adhesin [Neisseria lactamica ST-640] gi|313005238|emb|CBN86671.1| MafB2 adhesin [Neisseria lactamica 020-06] Length = 507 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 28/98 (28%), Positives = 45/98 (45%), Gaps = 11/98 (11%) Query: 9 TVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAE-NEKRLKEFDKITR 67 TV ++N+L AKA + G ++ AKPG ++ A N + ++ T Sbjct: 309 TVEALANILPF--AKAEKLAKLGKAAKLGRAAKPGKAAVSGDFSAAYNTRTTRKVTTETE 366 Query: 68 ELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQF 105 LN + +N+K + HEK NY RD ++IN Sbjct: 367 GLNRIRQNQKNSNI-HEK------NY-GRDNPNHINVL 396 >gi|261410100|ref|YP_003246341.1| parB-like partition protein [Paenibacillus sp. Y412MC10] gi|329925118|ref|ZP_08280062.1| stage 0 sporulation protein J [Paenibacillus sp. HGF5] gi|261286563|gb|ACX68534.1| parB-like partition protein [Paenibacillus sp. Y412MC10] gi|328940237|gb|EGG36569.1| stage 0 sporulation protein J [Paenibacillus sp. HGF5] Length = 279 Score = 35.0 bits (79), Expect = 4.0, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 35/83 (42%), Gaps = 7/83 (8%) Query: 29 TYGSEERIATCAKPGYSSRLAQLCAENEKRLKEFDKITRELNTLSENEKKAFFEHEKKVT 88 + G I P +LA+ C E E ++E ++ + L+ ++ K ++ Sbjct: 176 SMGHARAIVGIKDPILVKQLAKQCVEQEWSVRELEEAVKNLDRKPADKAKPKVKNRDPYI 235 Query: 89 SNLNYNARDR-------KHNINQ 104 +L + R+R KHN ++ Sbjct: 236 DHLEEDLRERFKTTVKIKHNKDK 258 >gi|260948520|ref|XP_002618557.1| hypothetical protein CLUG_02016 [Clavispora lusitaniae ATCC 42720] gi|238848429|gb|EEQ37893.1| hypothetical protein CLUG_02016 [Clavispora lusitaniae ATCC 42720] Length = 689 Score = 34.6 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 11/69 (15%) Query: 54 ENEKRLKEFDKITRE--------LNTLSENEKKAFFEHEKK--VTSNLNYNARDRKHNIN 103 E E KE K+ ++ L E++++A FE E V L + DRK+ + Sbjct: 517 EAEPEYKEVKKLVKKNDLTIEMTSAALPEDKRQALFEKESALVVADKLVADTEDRKNALE 576 Query: 104 QF-YEARGK 111 ++ YE RGK Sbjct: 577 EYIYELRGK 585 >gi|315122459|ref|YP_004062948.1| hypothetical protein CKC_03555 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495861|gb|ADR52460.1| hypothetical protein CKC_03555 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 103 Score = 34.2 bits (77), Expect = 6.5, Method: Composition-based stats. Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 10/75 (13%) Query: 50 QLCAENEKRLKEF-DKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEA 108 +LC ++++ K F I E+ L + ++K FFE +++ NL R +IN++YE Sbjct: 38 KLCKISDEQTKSFYQNIAPEVIQLPQEKQKEFFERRERLKRNL------RNKHINKYYEL 91 Query: 109 RGKYRYGNGYYRNYR 123 +Y YYR +R Sbjct: 92 EKQY---TPYYRYHR 103 >gi|323968869|gb|EGB64200.1| hypothetical protein ERHG_05028 [Escherichia coli TA007] Length = 238 Score = 33.5 bits (75), Expect = 9.8, Method: Composition-based stats. Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 3/69 (4%) Query: 44 YSSRLAQLCAENEKRLKEFDKITRELNTLSENEK--KAFFEHEKKVTSNLNYNARDRKHN 101 +S ++A L A NEK++KE ++I LN + E E KV S ++Y+ RD + Sbjct: 6 HSRKIAALVAYNEKQIKEINRIITLLNERVNDLNTFTDTLEREGKVVSGISYSLRDSVSH 65 Query: 102 INQFYEARG 110 ++ FY +G Sbjct: 66 VD-FYLNKG 73 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.316 0.136 0.382 Lambda K H 0.267 0.0419 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,134,796,919 Number of Sequences: 14124377 Number of extensions: 40082461 Number of successful extensions: 124548 Number of sequences better than 10.0: 40 Number of HSP's better than 10.0 without gapping: 11 Number of HSP's successfully gapped in prelim test: 29 Number of HSP's that attempted gapping in prelim test: 124531 Number of HSP's gapped (non-prelim): 43 length of query: 125 length of database: 4,842,793,630 effective HSP length: 91 effective length of query: 34 effective length of database: 3,557,475,323 effective search space: 120954160982 effective search space used: 120954160982 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 76 (33.9 bits)