BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 034694
         (87 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255543645|ref|XP_002512885.1| conserved hypothetical protein [Ricinus communis]
 gi|223547896|gb|EEF49388.1| conserved hypothetical protein [Ricinus communis]
          Length = 81

 Score =  131 bits (329), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 65/80 (81%), Positives = 69/80 (86%), Gaps = 1/80 (1%)

Query: 3  MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
          M GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+AC+EPPPS
Sbjct: 1  MLGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKACAEPPPS 60

Query: 63 KTSEFS-ADGGGQGDSGGSQ 81
          KTS  S A G  + D   +Q
Sbjct: 61 KTSALSDATGVAEKDGLSTQ 80


>gi|225425170|ref|XP_002264458.1| PREDICTED: uncharacterized protein LOC100264285 [Vitis vinifera]
          Length = 75

 Score =  128 bits (321), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 61/73 (83%), Positives = 64/73 (87%)

Query: 3  MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
          M GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+ CSEPPPS
Sbjct: 1  MAGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKVCSEPPPS 60

Query: 63 KTSEFSADGGGQG 75
          KTS    +  G+ 
Sbjct: 61 KTSAPGTNVTGEA 73


>gi|449445578|ref|XP_004140549.1| PREDICTED: uncharacterized protein LOC101220585 [Cucumis sativus]
 gi|449519856|ref|XP_004166950.1| PREDICTED: uncharacterized protein LOC101231766 [Cucumis sativus]
          Length = 74

 Score =  122 bits (306), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 57/63 (90%), Positives = 60/63 (95%)

Query: 3  MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
          M GAFWGTRV+EIVKKHDSGGLVWKRIKLTTTRKANAKKRL RVWQNEAVL+ACSEPPP 
Sbjct: 1  MAGAFWGTRVLEIVKKHDSGGLVWKRIKLTTTRKANAKKRLHRVWQNEAVLKACSEPPPP 60

Query: 63 KTS 65
          K+S
Sbjct: 61 KSS 63


>gi|224060305|ref|XP_002300133.1| predicted protein [Populus trichocarpa]
 gi|222847391|gb|EEE84938.1| predicted protein [Populus trichocarpa]
          Length = 82

 Score =  120 bits (302), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 58/68 (85%), Positives = 60/68 (88%)

Query: 4  GGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSK 63
           GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+ACS+  PSK
Sbjct: 4  AGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKACSQSQPSK 63

Query: 64 TSEFSADG 71
           S   A G
Sbjct: 64 MSAAGASG 71


>gi|296088720|emb|CBI38170.3| unnamed protein product [Vitis vinifera]
          Length = 65

 Score =  108 bits (271), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 52/63 (82%), Positives = 55/63 (87%)

Query: 13 MEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSKTSEFSADGG 72
          MEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+ CSEPPPSKTS    +  
Sbjct: 1  MEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKVCSEPPPSKTSAPGTNVT 60

Query: 73 GQG 75
          G+ 
Sbjct: 61 GEA 63


>gi|356569111|ref|XP_003552749.1| PREDICTED: uncharacterized protein LOC100819437 [Glycine max]
          Length = 70

 Score =  108 bits (269), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 51/54 (94%), Positives = 52/54 (96%)

Query: 7  FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPP 60
          FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+ACS  P
Sbjct: 3  FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKACSATP 56


>gi|356527453|ref|XP_003532325.1| PREDICTED: uncharacterized protein LOC100804262 [Glycine max]
          Length = 76

 Score =  107 bits (267), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 51/56 (91%), Positives = 52/56 (92%)

Query: 6  AFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPP 61
          AFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+AC    P
Sbjct: 2  AFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKACVAQVP 57


>gi|388499974|gb|AFK38053.1| unknown [Lotus japonicus]
          Length = 78

 Score =  105 bits (261), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 59/68 (86%), Positives = 62/68 (91%)

Query: 3  MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
          M GAFWGTRVMEIVKK DSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+A S+ PPS
Sbjct: 1  MSGAFWGTRVMEIVKKRDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKATSQSPPS 60

Query: 63 KTSEFSAD 70
           +S FSAD
Sbjct: 61 MSSSFSAD 68


>gi|18396334|ref|NP_564283.1| uncharacterized protein [Arabidopsis thaliana]
 gi|14423400|gb|AAK62382.1|AF386937_1 Unknown protein [Arabidopsis thaliana]
 gi|24899683|gb|AAN65056.1| Unknown protein [Arabidopsis thaliana]
 gi|26452099|dbj|BAC43139.1| unknown protein [Arabidopsis thaliana]
 gi|332192708|gb|AEE30829.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 81

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 51/71 (71%), Positives = 58/71 (81%), Gaps = 5/71 (7%)

Query: 5  GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSE-----P 59
          GAFWGTRVME+VKKHDSGGL+WKRIKLT TRKANAK RL RVWQNEAVLRAC E     P
Sbjct: 6  GAFWGTRVMELVKKHDSGGLLWKRIKLTPTRKANAKTRLRRVWQNEAVLRACGESDAPNP 65

Query: 60 PPSKTSEFSAD 70
          P +  ++ S++
Sbjct: 66 PGASNTKSSSN 76


>gi|297851172|ref|XP_002893467.1| hypothetical protein ARALYDRAFT_890267 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297339309|gb|EFH69726.1| hypothetical protein ARALYDRAFT_890267 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 81

 Score =  102 bits (254), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/54 (88%), Positives = 50/54 (92%)

Query: 5  GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSE 58
          GAFWGTRVME+VKKHDSGGL+WKRIKLT TRKANAK RL RVWQNEAVLRAC E
Sbjct: 6  GAFWGTRVMELVKKHDSGGLLWKRIKLTPTRKANAKTRLRRVWQNEAVLRACGE 59


>gi|414877799|tpg|DAA54930.1| TPA: hypothetical protein ZEAMMB73_276313 [Zea mays]
          Length = 79

 Score = 94.7 bits (234), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 44/72 (61%), Positives = 54/72 (75%)

Query: 3  MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
          +GGAFW TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC E    
Sbjct: 7  LGGAFWATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACGESESR 66

Query: 63 KTSEFSADGGGQ 74
            S  ++    Q
Sbjct: 67 SASNVASTTEKQ 78


>gi|326524972|dbj|BAK04422.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 81

 Score = 94.4 bits (233), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 43/68 (63%), Positives = 53/68 (77%)

Query: 7  FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSKTSE 66
          FW TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC++   S TS 
Sbjct: 13 FWATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACTQAESSSTSN 72

Query: 67 FSADGGGQ 74
           +A    Q
Sbjct: 73 TAATSEKQ 80


>gi|388521793|gb|AFK48958.1| unknown [Medicago truncatula]
          Length = 72

 Score = 93.6 bits (231), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 47/64 (73%), Positives = 51/64 (79%), Gaps = 1/64 (1%)

Query: 3  MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
          MGGAFWGTRVME+VKKHDS GL+WKRIKLTTTRKANAKKRL RVWQ +   R    P PS
Sbjct: 1  MGGAFWGTRVMEVVKKHDSSGLLWKRIKLTTTRKANAKKRLRRVWQMKLSSR-LFRPAPS 59

Query: 63 KTSE 66
           +S 
Sbjct: 60 ASSS 63


>gi|242083390|ref|XP_002442120.1| hypothetical protein SORBIDRAFT_08g014050 [Sorghum bicolor]
 gi|241942813|gb|EES15958.1| hypothetical protein SORBIDRAFT_08g014050 [Sorghum bicolor]
          Length = 79

 Score = 88.6 bits (218), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 41/68 (60%), Positives = 50/68 (73%)

Query: 7  FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSKTSE 66
          FW TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC E      S 
Sbjct: 11 FWATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACGESESVSASN 70

Query: 67 FSADGGGQ 74
           ++    Q
Sbjct: 71 TASTAEKQ 78


>gi|116784377|gb|ABK23320.1| unknown [Picea sitchensis]
          Length = 80

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/78 (56%), Positives = 53/78 (67%)

Query: 7  FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSKTSE 66
          F  TR ++IVK++DS GL+WK+IKLT TRKANAKKRL RVWQNEAVL+AC+ P P   + 
Sbjct: 3  FVPTRFLQIVKRNDSPGLLWKKIKLTPTRKANAKKRLRRVWQNEAVLKACAGPLPEGVTP 62

Query: 67 FSADGGGQGDSGGSQAKG 84
                  GD   S AK 
Sbjct: 63 PPESKKKSGDVKDSVAKS 80


>gi|224035127|gb|ACN36639.1| unknown [Zea mays]
 gi|413916390|gb|AFW56322.1| hypothetical protein ZEAMMB73_173232 [Zea mays]
          Length = 75

 Score = 84.7 bits (208), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 38/50 (76%), Positives = 45/50 (90%)

Query: 7  FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRAC 56
          FW TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC
Sbjct: 11 FWATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRAC 60


>gi|357501347|ref|XP_003620962.1| hypothetical protein MTR_7g005370 [Medicago truncatula]
 gi|355495977|gb|AES77180.1| hypothetical protein MTR_7g005370 [Medicago truncatula]
          Length = 254

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 39/46 (84%), Positives = 42/46 (91%)

Query: 3   MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQ 48
           MGGAFWGTRVM++VKKHDS  L+WKRIKLTTT KANAKKRL RVWQ
Sbjct: 168 MGGAFWGTRVMQVVKKHDSPRLLWKRIKLTTTCKANAKKRLHRVWQ 213


>gi|115488542|ref|NP_001066758.1| Os12g0477600 [Oryza sativa Japonica Group]
 gi|113649265|dbj|BAF29777.1| Os12g0477600 [Oryza sativa Japonica Group]
 gi|222617065|gb|EEE53197.1| hypothetical protein OsJ_36069 [Oryza sativa Japonica Group]
          Length = 83

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/51 (74%), Positives = 45/51 (88%)

Query: 8  WGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSE 58
          W TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC E
Sbjct: 14 WATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACGE 64


>gi|218186839|gb|EEC69266.1| hypothetical protein OsI_38307 [Oryza sativa Indica Group]
          Length = 83

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/51 (74%), Positives = 45/51 (88%)

Query: 8  WGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSE 58
          W TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC E
Sbjct: 14 WATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACGE 64


>gi|357150639|ref|XP_003575527.1| PREDICTED: uncharacterized protein LOC100837101 [Brachypodium
          distachyon]
          Length = 68

 Score = 64.3 bits (155), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 29/41 (70%), Positives = 36/41 (87%)

Query: 8  WGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQ 48
          W TR +E+VK++D+ GL+WKRIKLTTTRK NAKKRL R+WQ
Sbjct: 14 WATRALEVVKRNDAPGLLWKRIKLTTTRKNNAKKRLKRLWQ 54


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.312    0.129    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,527,968,358
Number of Sequences: 23463169
Number of extensions: 49534394
Number of successful extensions: 152956
Number of sequences better than 100.0: 25
Number of HSP's better than 100.0 without gapping: 23
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 152930
Number of HSP's gapped (non-prelim): 28
length of query: 87
length of database: 8,064,228,071
effective HSP length: 57
effective length of query: 30
effective length of database: 6,726,827,438
effective search space: 201804823140
effective search space used: 201804823140
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 69 (31.2 bits)