BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254780984|ref|YP_003065397.1| hypothetical protein
CLIBASIA_04425 [Candidatus Liberibacter asiaticus str. psy62]
         (125 letters)

Database: nr 
           14,124,377 sequences; 4,842,793,630 total letters

Searching..................................................done



>gi|254780984|ref|YP_003065397.1| hypothetical protein CLIBASIA_04425 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040661|gb|ACT57457.1| hypothetical protein CLIBASIA_04425 [Candidatus Liberibacter
           asiaticus str. psy62]
          Length = 125

 Score =  211 bits (537), Expect = 2e-53,   Method: Composition-based stats.
 Identities = 125/125 (100%), Positives = 125/125 (100%)

Query: 1   MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK 60
           MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK
Sbjct: 1   MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK 60

Query: 61  EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEARGKYRYGNGYYR 120
           EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEARGKYRYGNGYYR
Sbjct: 61  EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEARGKYRYGNGYYR 120

Query: 121 NYRSQ 125
           NYRSQ
Sbjct: 121 NYRSQ 125


>gi|254781126|ref|YP_003065539.1| hypothetical protein CLIBASIA_05140 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040803|gb|ACT57599.1| hypothetical protein CLIBASIA_05140 [Candidatus Liberibacter
           asiaticus str. psy62]
          Length = 85

 Score = 86.2 bits (212), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 30/49 (61%), Positives = 38/49 (77%)

Query: 65  ITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEARGKYR 113
           ITR +  L +++KKAFF HEKKV  +LNYNA DRK NI++ Y+AR KYR
Sbjct: 18  ITRGVARLPKDQKKAFFRHEKKVADHLNYNAGDRKSNIDKLYKARCKYR 66


>gi|254780981|ref|YP_003065394.1| hypothetical protein CLIBASIA_04410 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040658|gb|ACT57454.1| hypothetical protein CLIBASIA_04410 [Candidatus Liberibacter
           asiaticus str. psy62]
          Length = 122

 Score = 42.3 bits (98), Expect = 0.023,   Method: Composition-based stats.
 Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 16/102 (15%)

Query: 1   MKKYITLLTVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKRLK 60
           MKKY T+LT+L +S+ +N         P    E+ + +   P  +  L  L AENEK  K
Sbjct: 1   MKKYFTILTMLFVSSAIN---------PCGIEEDNLKSSPLPHIA--LESLSAENEK--K 47

Query: 61  EFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNI 102
           E  +  +++    EN KK F EHEKK T +       RK NI
Sbjct: 48  ELSEHEKKVIESQENPKKQFSEHEKKETDD---PKSARKENI 86


>gi|325914595|ref|ZP_08176936.1| Conjugative relaxosome accessory transposon protein [Xanthomonas
           vesicatoria ATCC 35937]
 gi|325539097|gb|EGD10752.1| Conjugative relaxosome accessory transposon protein [Xanthomonas
           vesicatoria ATCC 35937]
          Length = 476

 Score = 38.5 bits (88), Expect = 0.31,   Method: Composition-based stats.
 Identities = 28/81 (34%), Positives = 37/81 (45%), Gaps = 4/81 (4%)

Query: 32  SEERIATCAKPGYS----SRLAQLCAENEKRLKEFDKITRELNTLSENEKKAFFEHEKKV 87
           SEE  AT      S    S L  L +E  +  +EF +  RE  T    E K FFE  K  
Sbjct: 391 SEEMAATVVSQVMSQYLQSILISLSSEEGEGTQEFREKAREATTRLREESKPFFERTKSR 450

Query: 88  TSNLNYNARDRKHNINQFYEA 108
           T++L+Y  R R  +  + Y A
Sbjct: 451 TAHLDYYMRVRDTSSEKPYLA 471


>gi|32492529|gb|AAP85347.1| aminotransferase [Streptomyces griseoruber]
          Length = 372

 Score = 37.3 bits (85), Expect = 0.65,   Method: Composition-based stats.
 Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 21  DAKARRFPTYGSEERIATCAKPGYSSRLAQLCAENEKR-LKEFDKITRE 68
           DA+ RR   YG  ER      PG++SRL ++ AE  +R L+  D   R 
Sbjct: 205 DARLRRLRYYGMAERYYVVQTPGHNSRLDEVQAEILRRKLRRLDDYVRG 253


>gi|237724550|ref|ZP_04555031.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. D4]
 gi|265754991|ref|ZP_06089905.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 3_1_33FAA]
 gi|229437109|gb|EEO47186.1| RNA polymerase ECF-type sigma factor [Bacteroides dorei 5_1_36/D4]
 gi|263234602|gb|EEZ20181.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 3_1_33FAA]
          Length = 188

 Score = 36.2 bits (82), Expect = 1.4,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 11  LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66
           L+I + LN  L+    RR   + + E++ T A+  +  R  A LC  +  R +E D KI 
Sbjct: 64  LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYARFEELDRKIK 123

Query: 67  RELNTLSENEKKAFFEHEKK 86
             +  L ++ ++AF  H  K
Sbjct: 124 EAMAALPDSYREAFVMHRFK 143


>gi|237709314|ref|ZP_04539795.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 9_1_42FAA]
 gi|229456699|gb|EEO62420.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 9_1_42FAA]
          Length = 188

 Score = 36.2 bits (82), Expect = 1.5,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 11  LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66
           L+I + LN  L+    RR   + + E++ T A+  +  R  A LC  +  R +E D KI 
Sbjct: 64  LIIESSLNKYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYARFEELDRKIK 123

Query: 67  RELNTLSENEKKAFFEHEKK 86
             +  L ++ ++AF  H  K
Sbjct: 124 EAMAALPDSYREAFVMHRFK 143


>gi|150004329|ref|YP_001299073.1| RNA polymerase ECF-type sigma factor [Bacteroides vulgatus ATCC
           8482]
 gi|254883392|ref|ZP_05256102.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 4_3_47FAA]
 gi|149932753|gb|ABR39451.1| RNA polymerase ECF-type sigma factor [Bacteroides vulgatus ATCC
           8482]
 gi|254836185|gb|EET16494.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 4_3_47FAA]
          Length = 188

 Score = 36.2 bits (82), Expect = 1.6,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 11  LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66
           L+I + LN  L+    RR   + + E++ T A+  +  R  A LC  +  R +E D KI 
Sbjct: 64  LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYGRFEELDRKIK 123

Query: 67  RELNTLSENEKKAFFEHEKK 86
             +  L ++ ++AF  H  K
Sbjct: 124 EAMAALPDSYREAFVMHRFK 143


>gi|294776855|ref|ZP_06742318.1| RNA polymerase sigma-70 factor [Bacteroides vulgatus PC510]
 gi|294449331|gb|EFG17868.1| RNA polymerase sigma-70 factor [Bacteroides vulgatus PC510]
          Length = 188

 Score = 36.2 bits (82), Expect = 1.7,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 11  LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66
           L+I + LN  L+    RR   + + E++ T A+  +  R  A LC  +  R +E D KI 
Sbjct: 64  LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYGRFEELDRKIK 123

Query: 67  RELNTLSENEKKAFFEHEKK 86
             +  L ++ ++AF  H  K
Sbjct: 124 EAMAALPDSYREAFVMHRFK 143


>gi|226357900|ref|YP_002787640.1| dipeptide ABC transporter, periplasmic component [Deinococcus
           deserti VCD115]
 gi|226320143|gb|ACO48136.1| putative dipeptide ABC transporter, periplasmic component
           [Deinococcus deserti VCD115]
          Length = 588

 Score = 35.8 bits (81), Expect = 2.1,   Method: Composition-based stats.
 Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 3/93 (3%)

Query: 22  AKARRFPTYGSEERIATCAKPGYSSRLAQ-LCAENEKRLKEFDKITRELNTLSENEKKAF 80
           AKA ++P Y   E     A+ GY+ + AQ    +   ++ EF+  T   NT+ E   + F
Sbjct: 377 AKAPKYP-YNPREAARLLAQLGYTKKNAQGYLVDKAGKVLEFNLSTNAGNTVREQLGRVF 435

Query: 81  FEHEKKVTSNLNYNARDRKHNINQFYEARGKYR 113
            +  KKV   +N+   D  + +NQ   A+G+ R
Sbjct: 436 ADEAKKVGVKVNFTPIDFNNLVNQL-TAKGESR 467


>gi|120436997|ref|YP_862683.1| hypothetical protein GFO_2660 [Gramella forsetii KT0803]
 gi|117579147|emb|CAL67616.1| conserved hypothetical protein, membrane [Gramella forsetii KT0803]
          Length = 376

 Score = 35.8 bits (81), Expect = 2.2,   Method: Composition-based stats.
 Identities = 20/52 (38%), Positives = 29/52 (55%)

Query: 46  SRLAQLCAENEKRLKEFDKITRELNTLSENEKKAFFEHEKKVTSNLNYNARD 97
           +RL +L   NE  L     +  EL+TL EN+K +F + +K+V  NL Y   D
Sbjct: 263 ARLERLELSNESHLDTIKSMKTELSTLKENQKVSFSDRQKEVLGNLAYFGVD 314


>gi|212693788|ref|ZP_03301916.1| hypothetical protein BACDOR_03309 [Bacteroides dorei DSM 17855]
 gi|212663677|gb|EEB24251.1| hypothetical protein BACDOR_03309 [Bacteroides dorei DSM 17855]
          Length = 240

 Score = 35.4 bits (80), Expect = 2.4,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 11  LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66
           L+I + LN  L+    RR   + + E++ T A+  +  R  A LC  +  R +E D KI 
Sbjct: 116 LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYARFEELDRKIK 175

Query: 67  RELNTLSENEKKAFFEHEKK 86
             +  L ++ ++AF  H  K
Sbjct: 176 EAMAALPDSYREAFVMHRFK 195


>gi|319643304|ref|ZP_07997931.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 3_1_40A]
 gi|317385051|gb|EFV66003.1| RNA polymerase ECF-type sigma factor [Bacteroides sp. 3_1_40A]
          Length = 197

 Score = 35.4 bits (80), Expect = 2.6,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 11  LLISNVLN--LYDAKARRFPTYGSEERIATCAKPGYSSRL-AQLCAENEKRLKEFD-KIT 66
           L+I + LN  L+    RR   + + E++ T A+  +  R  A LC  +  R +E D KI 
Sbjct: 73  LIIESSLNQYLFKMTYRRVLNHLTREQVKTKAEAAFYERTQAALCEVDYGRFEELDRKIK 132

Query: 67  RELNTLSENEKKAFFEHEKK 86
             +  L ++ ++AF  H  K
Sbjct: 133 EAMAALPDSYREAFVMHRFK 152


>gi|313667776|ref|YP_004048060.1| MafB2 adhesin [Neisseria lactamica ST-640]
 gi|313005238|emb|CBN86671.1| MafB2 adhesin [Neisseria lactamica 020-06]
          Length = 507

 Score = 35.4 bits (80), Expect = 2.9,   Method: Composition-based stats.
 Identities = 28/98 (28%), Positives = 45/98 (45%), Gaps = 11/98 (11%)

Query: 9   TVLLISNVLNLYDAKARRFPTYGSEERIATCAKPGYSSRLAQLCAE-NEKRLKEFDKITR 67
           TV  ++N+L    AKA +    G   ++   AKPG ++      A  N +  ++    T 
Sbjct: 309 TVEALANILPF--AKAEKLAKLGKAAKLGRAAKPGKAAVSGDFSAAYNTRTTRKVTTETE 366

Query: 68  ELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQF 105
            LN + +N+K +   HEK      NY  RD  ++IN  
Sbjct: 367 GLNRIRQNQKNSNI-HEK------NY-GRDNPNHINVL 396


>gi|261410100|ref|YP_003246341.1| parB-like partition protein [Paenibacillus sp. Y412MC10]
 gi|329925118|ref|ZP_08280062.1| stage 0 sporulation protein J [Paenibacillus sp. HGF5]
 gi|261286563|gb|ACX68534.1| parB-like partition protein [Paenibacillus sp. Y412MC10]
 gi|328940237|gb|EGG36569.1| stage 0 sporulation protein J [Paenibacillus sp. HGF5]
          Length = 279

 Score = 35.0 bits (79), Expect = 4.0,   Method: Composition-based stats.
 Identities = 17/83 (20%), Positives = 35/83 (42%), Gaps = 7/83 (8%)

Query: 29  TYGSEERIATCAKPGYSSRLAQLCAENEKRLKEFDKITRELNTLSENEKKAFFEHEKKVT 88
           + G    I     P    +LA+ C E E  ++E ++  + L+    ++ K   ++     
Sbjct: 176 SMGHARAIVGIKDPILVKQLAKQCVEQEWSVRELEEAVKNLDRKPADKAKPKVKNRDPYI 235

Query: 89  SNLNYNARDR-------KHNINQ 104
            +L  + R+R       KHN ++
Sbjct: 236 DHLEEDLRERFKTTVKIKHNKDK 258


>gi|260948520|ref|XP_002618557.1| hypothetical protein CLUG_02016 [Clavispora lusitaniae ATCC 42720]
 gi|238848429|gb|EEQ37893.1| hypothetical protein CLUG_02016 [Clavispora lusitaniae ATCC 42720]
          Length = 689

 Score = 34.6 bits (78), Expect = 5.0,   Method: Composition-based stats.
 Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 11/69 (15%)

Query: 54  ENEKRLKEFDKITRE--------LNTLSENEKKAFFEHEKK--VTSNLNYNARDRKHNIN 103
           E E   KE  K+ ++           L E++++A FE E    V   L  +  DRK+ + 
Sbjct: 517 EAEPEYKEVKKLVKKNDLTIEMTSAALPEDKRQALFEKESALVVADKLVADTEDRKNALE 576

Query: 104 QF-YEARGK 111
           ++ YE RGK
Sbjct: 577 EYIYELRGK 585


>gi|315122459|ref|YP_004062948.1| hypothetical protein CKC_03555 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313495861|gb|ADR52460.1| hypothetical protein CKC_03555 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 103

 Score = 34.2 bits (77), Expect = 6.5,   Method: Composition-based stats.
 Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 10/75 (13%)

Query: 50  QLCAENEKRLKEF-DKITRELNTLSENEKKAFFEHEKKVTSNLNYNARDRKHNINQFYEA 108
           +LC  ++++ K F   I  E+  L + ++K FFE  +++  NL      R  +IN++YE 
Sbjct: 38  KLCKISDEQTKSFYQNIAPEVIQLPQEKQKEFFERRERLKRNL------RNKHINKYYEL 91

Query: 109 RGKYRYGNGYYRNYR 123
             +Y     YYR +R
Sbjct: 92  EKQY---TPYYRYHR 103


>gi|323968869|gb|EGB64200.1| hypothetical protein ERHG_05028 [Escherichia coli TA007]
          Length = 238

 Score = 33.5 bits (75), Expect = 9.8,   Method: Composition-based stats.
 Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 3/69 (4%)

Query: 44  YSSRLAQLCAENEKRLKEFDKITRELNTLSENEK--KAFFEHEKKVTSNLNYNARDRKHN 101
           +S ++A L A NEK++KE ++I   LN    +        E E KV S ++Y+ RD   +
Sbjct: 6   HSRKIAALVAYNEKQIKEINRIITLLNERVNDLNTFTDTLEREGKVVSGISYSLRDSVSH 65

Query: 102 INQFYEARG 110
           ++ FY  +G
Sbjct: 66  VD-FYLNKG 73


  Database: nr
    Posted date:  May 22, 2011 12:22 AM
  Number of letters in database: 999,999,966
  Number of sequences in database:  2,987,313
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 22, 2011 12:30 AM
  Number of letters in database: 999,999,796
  Number of sequences in database:  2,903,041
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 22, 2011 12:36 AM
  Number of letters in database: 999,999,281
  Number of sequences in database:  2,904,016
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 22, 2011 12:41 AM
  Number of letters in database: 999,999,960
  Number of sequences in database:  2,935,328
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 22, 2011 12:46 AM
  Number of letters in database: 842,794,627
  Number of sequences in database:  2,394,679
  
Lambda     K      H
   0.316    0.136    0.382 

Lambda     K      H
   0.267   0.0419    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,134,796,919
Number of Sequences: 14124377
Number of extensions: 40082461
Number of successful extensions: 124548
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 124531
Number of HSP's gapped (non-prelim): 43
length of query: 125
length of database: 4,842,793,630
effective HSP length: 91
effective length of query: 34
effective length of database: 3,557,475,323
effective search space: 120954160982
effective search space used: 120954160982
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)