BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.


Reference for composition-based statistics starting in round 2:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254781081|ref|YP_003065494.1| hypothetical protein
CLIBASIA_04915 [Candidatus Liberibacter asiaticus str. psy62]
         (93 letters)

Database: nr 
           14,124,377 sequences; 4,842,793,630 total letters

Searching..................................................done


Results from round 1


>gi|254781081|ref|YP_003065494.1| hypothetical protein CLIBASIA_04915 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254040758|gb|ACT57554.1| hypothetical protein CLIBASIA_04915 [Candidatus Liberibacter
          asiaticus str. psy62]
          Length = 93

 Score =  183 bits (464), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 93/93 (100%), Positives = 93/93 (100%)

Query: 1  MSEIMCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLD 60
          MSEIMCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLD
Sbjct: 1  MSEIMCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLD 60

Query: 61 KFGSLISDFHRNSESELVRCSDSESEEAKEGTE 93
          KFGSLISDFHRNSESELVRCSDSESEEAKEGTE
Sbjct: 61 KFGSLISDFHRNSESELVRCSDSESEEAKEGTE 93


>gi|194226990|ref|XP_001489883.2| PREDICTED: similar to hCG2042154 [Equus caballus]
          Length = 672

 Score = 35.8 bits (81), Expect = 1.8,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 37/64 (57%), Gaps = 4/64 (6%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQ-ITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFG 63
           +C+AY  +R     D+D + D+ Q +T DE  DLTQ+Y Y L+     + N +S+  +  
Sbjct: 260 VCQAYLGQREHE--DIDISEDAVQDLTEDEWKDLTQQY-YALVQGDAFISNSRSHFSQCQ 316

Query: 64  SLIS 67
           +L++
Sbjct: 317 ALLN 320


Searching..................................................done


Results from round 2





CONVERGED!
>gi|254781081|ref|YP_003065494.1| hypothetical protein CLIBASIA_04915 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254040758|gb|ACT57554.1| hypothetical protein CLIBASIA_04915 [Candidatus Liberibacter
          asiaticus str. psy62]
          Length = 93

 Score =  154 bits (389), Expect = 3e-36,   Method: Composition-based stats.
 Identities = 93/93 (100%), Positives = 93/93 (100%)

Query: 1  MSEIMCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLD 60
          MSEIMCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLD
Sbjct: 1  MSEIMCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLD 60

Query: 61 KFGSLISDFHRNSESELVRCSDSESEEAKEGTE 93
          KFGSLISDFHRNSESELVRCSDSESEEAKEGTE
Sbjct: 61 KFGSLISDFHRNSESELVRCSDSESEEAKEGTE 93


>gi|45201038|ref|NP_986608.1| AGL058Cp [Ashbya gossypii ATCC 10895]
 gi|44985808|gb|AAS54432.1| AGL058Cp [Ashbya gossypii ATCC 10895]
          Length = 969

 Score = 37.8 bits (86), Expect = 0.54,   Method: Composition-based stats.
 Identities = 20/78 (25%), Positives = 42/78 (53%)

Query: 9   YAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGSLISD 68
           Y EE +S+ L L+ A D   I+++E+S+L ++  ++   K+ ++D  K  ++   S  ++
Sbjct: 615 YQEEIHSLELQLEEAHDKLLISAEEISELHEKIAHIESEKNAIIDEKKDIVESLMSAKAE 674

Query: 69  FHRNSESELVRCSDSESE 86
           F +  +  L      E+E
Sbjct: 675 FQQREQEHLTATEQLETE 692


>gi|194226990|ref|XP_001489883.2| PREDICTED: similar to hCG2042154 [Equus caballus]
          Length = 672

 Score = 37.0 bits (84), Expect = 0.93,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 37/64 (57%), Gaps = 4/64 (6%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQ-ITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFG 63
           +C+AY  +R     D+D + D+ Q +T DE  DLTQ+Y Y L+     + N +S+  +  
Sbjct: 260 VCQAYLGQREHE--DIDISEDAVQDLTEDEWKDLTQQY-YALVQGDAFISNSRSHFSQCQ 316

Query: 64  SLIS 67
           +L++
Sbjct: 317 ALLN 320


>gi|296224434|ref|XP_002758058.1| PREDICTED: cytosolic 5'-nucleotidase 1B isoform 3 [Callithrix
           jacchus]
          Length = 592

 Score = 35.1 bits (79), Expect = 3.3,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 3/73 (4%)

Query: 8   AYAEERYSVLLDLD---FASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           AY + +  V  D D   F+ +S+  T +   D   +YD L  +K L    LK +L+  G 
Sbjct: 438 AYCDTQLRVAFDGDAVLFSDESEHFTKEHGMDKFFQYDTLCESKPLAQGPLKGFLEDLGR 497

Query: 65  LISDFHRNSESEL 77
           L   FH  +E  L
Sbjct: 498 LQKKFHAKNERLL 510


>gi|296224432|ref|XP_002758057.1| PREDICTED: cytosolic 5'-nucleotidase 1B isoform 2 [Callithrix
           jacchus]
          Length = 609

 Score = 35.1 bits (79), Expect = 3.3,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 3/73 (4%)

Query: 8   AYAEERYSVLLDLD---FASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           AY + +  V  D D   F+ +S+  T +   D   +YD L  +K L    LK +L+  G 
Sbjct: 455 AYCDTQLRVAFDGDAVLFSDESEHFTKEHGMDKFFQYDTLCESKPLAQGPLKGFLEDLGR 514

Query: 65  LISDFHRNSESEL 77
           L   FH  +E  L
Sbjct: 515 LQKKFHAKNERLL 527


>gi|296224430|ref|XP_002758056.1| PREDICTED: cytosolic 5'-nucleotidase 1B isoform 1 [Callithrix
           jacchus]
          Length = 626

 Score = 35.1 bits (79), Expect = 3.3,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 3/73 (4%)

Query: 8   AYAEERYSVLLDLD---FASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           AY + +  V  D D   F+ +S+  T +   D   +YD L  +K L    LK +L+  G 
Sbjct: 472 AYCDTQLRVAFDGDAVLFSDESEHFTKEHGMDKFFQYDTLCESKPLAQGPLKGFLEDLGR 531

Query: 65  LISDFHRNSESEL 77
           L   FH  +E  L
Sbjct: 532 LQKKFHAKNERLL 544


>gi|149017521|gb|EDL76525.1| rCG59308, isoform CRA_b [Rattus norvegicus]
          Length = 475

 Score = 35.1 bits (79), Expect = 3.6,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           +C+AY  +     +DL  AS ++ ++ +E +DLTQ+Y YLL++ +  + + +SY  +  +
Sbjct: 75  VCQAYLGQLEHEDIDLSDAS-AEALSEEEWNDLTQQY-YLLVHGNASITDSRSYFAQCQA 132

Query: 65  LISDFHR-NSESEL 77
           L+S     N ++E+
Sbjct: 133 LLSKISSVNPQTEI 146


>gi|149017522|gb|EDL76526.1| rCG59308, isoform CRA_c [Rattus norvegicus]
          Length = 588

 Score = 35.1 bits (79), Expect = 3.6,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           +C+AY  +     +DL  AS ++ ++ +E +DLTQ+Y YLL++ +  + + +SY  +  +
Sbjct: 75  VCQAYLGQLEHEDIDLSDAS-AEALSEEEWNDLTQQY-YLLVHGNASITDSRSYFAQCQA 132

Query: 65  LISDFHR-NSESEL 77
           L+S     N ++E+
Sbjct: 133 LLSKISSVNPQTEI 146


>gi|66911735|gb|AAH97453.1| RGD1306462 protein [Rattus norvegicus]
          Length = 375

 Score = 35.1 bits (79), Expect = 3.6,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           +C+AY  +     +DL  AS ++ ++ +E +DLTQ+Y YLL++ +  + + +SY  +  +
Sbjct: 263 VCQAYLGQLEHEDIDLSDAS-AEALSEEEWNDLTQQY-YLLVHGNASITDSRSYFAQCQA 320

Query: 65  LISDFHR-NSESEL 77
           L+S     N ++E+
Sbjct: 321 LLSKISSVNPQTEI 334


>gi|295293121|ref|NP_766406.2| protein monoglycylase TTLL8 [Mus musculus]
 gi|172044389|sp|A4Q9F1|TTLL8_MOUSE RecName: Full=Protein monoglycylase TTLL8; AltName:
           Full=Tubulin--tyrosine ligase-like protein 8
 gi|145369184|emb|CAM84329.1| glycylase [Mus musculus]
 gi|148672443|gb|EDL04390.1| RIKEN cDNA 1700019P01, isoform CRA_b [Mus musculus]
          Length = 832

 Score = 34.7 bits (78), Expect = 3.8,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           +C+AY  +     +D+  AS ++ ++ +E +DLTQ+Y YLL++ +  + + KSY  +  +
Sbjct: 316 VCQAYLGQLEHEDIDVSEAS-TEALSEEEWNDLTQQY-YLLVHGNASITDSKSYFAQCQA 373

Query: 65  LISDFHR-NSESEL 77
           L+S     N ++E+
Sbjct: 374 LLSKISSVNPQTEI 387


>gi|109732579|gb|AAI16295.1| Ttll8 protein [Mus musculus]
          Length = 634

 Score = 34.7 bits (78), Expect = 3.8,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           +C+AY  +     +D+  AS ++ ++ +E +DLTQ+Y YLL++ +  + + KSY  +  +
Sbjct: 118 VCQAYLGQLEHEDIDVSEAS-TEALSEEEWNDLTQQY-YLLVHGNASITDSKSYFAQCQA 175

Query: 65  LISDFHR-NSESEL 77
           L+S     N ++E+
Sbjct: 176 LLSKISSVNPQTEI 189


>gi|26325682|dbj|BAC26595.1| unnamed protein product [Mus musculus]
 gi|148672442|gb|EDL04389.1| RIKEN cDNA 1700019P01, isoform CRA_a [Mus musculus]
          Length = 518

 Score = 34.7 bits (78), Expect = 3.8,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           +C+AY  +     +D+  AS ++ ++ +E +DLTQ+Y YLL++ +  + + KSY  +  +
Sbjct: 118 VCQAYLGQLEHEDIDVSEAS-TEALSEEEWNDLTQQY-YLLVHGNASITDSKSYFAQCQA 175

Query: 65  LISDFHR-NSESEL 77
           L+S     N ++E+
Sbjct: 176 LLSKISSVNPQTEI 189


>gi|26326135|dbj|BAC26811.1| unnamed protein product [Mus musculus]
 gi|109730773|gb|AAI16294.1| Tubulin tyrosine ligase-like family, member 8 [Mus musculus]
          Length = 781

 Score = 34.7 bits (78), Expect = 3.8,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 5   MCEAYAEERYSVLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGS 64
           +C+AY  +     +D+  AS ++ ++ +E +DLTQ+Y YLL++ +  + + KSY  +  +
Sbjct: 265 VCQAYLGQLEHEDIDVSEAS-TEALSEEEWNDLTQQY-YLLVHGNASITDSKSYFAQCQA 322

Query: 65  LISDFHR-NSESEL 77
           L+S     N ++E+
Sbjct: 323 LLSKISSVNPQTEI 336


>gi|221057748|ref|XP_002261382.1| SICA antigen (fragment) [Plasmodium knowlesi strain H]
 gi|194247387|emb|CAQ40787.1| SICA antigen (fragment) [Plasmodium knowlesi strain H]
          Length = 1074

 Score = 34.7 bits (78), Expect = 4.2,   Method: Composition-based stats.
 Identities = 16/64 (25%), Positives = 32/64 (50%)

Query: 27  DQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGSLISDFHRNSESELVRCSDSESE 86
           D ITSD ++D    + + + +   ++D  +    +  +L  +  +  +S    C+D +SE
Sbjct: 627 DNITSDNITDYIDNHHHHITHPSYIIDPWEEVQTQLKNLSGEIQKKKDSVGSHCTDLQSE 686

Query: 87  EAKE 90
           E KE
Sbjct: 687 EEKE 690


>gi|168065993|ref|XP_001784929.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663516|gb|EDQ50276.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 515

 Score = 34.3 bits (77), Expect = 5.9,   Method: Composition-based stats.
 Identities = 21/78 (26%), Positives = 40/78 (51%)

Query: 16  VLLDLDFASDSDQITSDELSDLTQRYDYLLLNKHLLVDNLKSYLDKFGSLISDFHRNSES 75
           +L DL  A+     T+   SDL + YD+L+  +  L++ + + +D+     SD  R   +
Sbjct: 66  LLNDLGLANHLRPQTAGFPSDLDETYDWLVALQDDLMEGIDAAVDQLAKQKSDGKRGRPN 125

Query: 76  ELVRCSDSESEEAKEGTE 93
           E  R   +E  +A +G++
Sbjct: 126 EKFRTPTNEGAKASKGSD 143


  Database: nr
    Posted date:  May 22, 2011 12:22 AM
  Number of letters in database: 999,999,966
  Number of sequences in database:  2,987,313
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 22, 2011 12:30 AM
  Number of letters in database: 999,999,796
  Number of sequences in database:  2,903,041
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 22, 2011 12:36 AM
  Number of letters in database: 999,999,281
  Number of sequences in database:  2,904,016
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 22, 2011 12:41 AM
  Number of letters in database: 999,999,960
  Number of sequences in database:  2,935,328
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 22, 2011 12:46 AM
  Number of letters in database: 842,794,627
  Number of sequences in database:  2,394,679
  
Lambda     K      H
   0.311    0.129    0.346 

Lambda     K      H
   0.267   0.0398    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,533,242,334
Number of Sequences: 14124377
Number of extensions: 52068884
Number of successful extensions: 138622
Number of sequences better than 10.0: 23
Number of HSP's better than 10.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 40
Number of HSP's that attempted gapping in prelim test: 138590
Number of HSP's gapped (non-prelim): 58
length of query: 93
length of database: 4,842,793,630
effective HSP length: 63
effective length of query: 30
effective length of database: 3,952,957,879
effective search space: 118588736370
effective search space used: 118588736370
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 75 (33.5 bits)