BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033372
         (120 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|297742396|emb|CBI34545.3| unnamed protein product [Vitis vinifera]
          Length = 147

 Score =  142 bits (357), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 70/112 (62%), Positives = 89/112 (79%)

Query: 9   ILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILM 68
           +L  NR+EDV+WLCSLS+SE+DMLISLK+++L+RAKVIGHE+LA KFDLK LRALGFILM
Sbjct: 6   VLGSNRLEDVNWLCSLSDSELDMLISLKMMVLRRAKVIGHEDLAEKFDLKMLRALGFILM 65

Query: 69  EHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
           E+L+ +VKDLS  PG A    F+  CNLLKC   D ++ E+LK C+  + KR
Sbjct: 66  EYLRGQVKDLSAIPGLAGLDKFLNECNLLKCSLKDTISTEELKACICTNSKR 117


>gi|356556924|ref|XP_003546770.1| PREDICTED: uncharacterized protein LOC100818832 [Glycine max]
          Length = 151

 Score =  125 bits (315), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 61/106 (57%), Positives = 82/106 (77%)

Query: 13  NRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILMEHLK 72
            RVEDV+WLCSLSESE+DMLISLKLLI+QRAK++G +ELA+KF+LK +RA+  +LM HLK
Sbjct: 25  TRVEDVAWLCSLSESEIDMLISLKLLIIQRAKMMGCKELASKFNLKMIRAIALVLMGHLK 84

Query: 73  EKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDL 118
           E++KD SL P   +  +F+  CNLLKC N+    +++L T L  D+
Sbjct: 85  EEIKDSSLIPNMVKSTSFLDACNLLKCSNEVDANIDELSTSLGADI 130


>gi|356525841|ref|XP_003531530.1| PREDICTED: uncharacterized protein LOC100778207 isoform 1 [Glycine
           max]
 gi|356525843|ref|XP_003531531.1| PREDICTED: uncharacterized protein LOC100778207 isoform 2 [Glycine
           max]
          Length = 154

 Score =  124 bits (312), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 60/106 (56%), Positives = 80/106 (75%)

Query: 13  NRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILMEHLK 72
            RVEDV+WLCSLSESE+DMLISLKLLI+QRAK++G +ELA+KF+LK +RA+  +LMEHLK
Sbjct: 25  TRVEDVAWLCSLSESEIDMLISLKLLIIQRAKMMGCKELASKFNLKMIRAIALVLMEHLK 84

Query: 73  EKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDL 118
            ++K  SL P + E  +F+  CNLLKC N+    ++ L   L  D+
Sbjct: 85  AEIKGSSLIPNTVESTSFLDACNLLKCSNEVDANIDDLSASLGADI 130


>gi|357513171|ref|XP_003626874.1| hypothetical protein MTR_8g011490 [Medicago truncatula]
 gi|355520896|gb|AET01350.1| hypothetical protein MTR_8g011490 [Medicago truncatula]
          Length = 168

 Score =  123 bits (308), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 61/110 (55%), Positives = 85/110 (77%)

Query: 10  LSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILME 69
           +++ RV+DV+WLCSLSESE+DMLISLKLLI++RA+ IG + LANKFDLK LRA+  +LME
Sbjct: 20  ITETRVQDVAWLCSLSESEIDMLISLKLLIIKRAEWIGCKNLANKFDLKMLRAIALVLME 79

Query: 70  HLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLK 119
           +LK +VKD SL P   +  AF+  CNLL C+++   T+E+L T +  D++
Sbjct: 80  NLKAEVKDASLVPDMVKSTAFLDVCNLLNCNSEVSATIEELSTSVGADIQ 129


>gi|255647835|gb|ACU24377.1| unknown [Glycine max]
          Length = 154

 Score =  123 bits (308), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 59/106 (55%), Positives = 79/106 (74%)

Query: 13  NRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILMEHLK 72
            RVEDV+WLCSLSESE+DMLISLKLLI+QRAK++G +ELA+KF+LK +RA+  +LMEHLK
Sbjct: 25  TRVEDVAWLCSLSESEIDMLISLKLLIIQRAKMMGCKELASKFNLKMIRAIALVLMEHLK 84

Query: 73  EKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDL 118
            ++K  SL P + E  +F+  CNLLKC N+    ++ L      D+
Sbjct: 85  AEIKGSSLIPNTVESTSFLDACNLLKCSNEVDANIDDLSASFGADI 130


>gi|449506107|ref|XP_004162655.1| PREDICTED: uncharacterized LOC101210446 [Cucumis sativus]
          Length = 182

 Score =  121 bits (304), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 63/122 (51%), Positives = 89/122 (72%), Gaps = 7/122 (5%)

Query: 1   MEVKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTL 60
           + ++E NEIL  NR+ED+SWLCSLSESE+D+LIS+K+L+LQRAK IGHE LA KFDL+TL
Sbjct: 46  ISMEEANEILKNNRIEDISWLCSLSESELDLLISIKMLVLQRAKAIGHENLAEKFDLRTL 105

Query: 61  RALGFILMEHLKEKVK--DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDL 118
           RA+GF+LMEHLK K++  D+S    S         CNLL  + + IL+++++   +  D 
Sbjct: 106 RAIGFVLMEHLKGKLRTSDVSDLSQST-----FNACNLLDSNLEKILSIDEIMASICSDR 160

Query: 119 KR 120
           ++
Sbjct: 161 RK 162


>gi|388510664|gb|AFK43398.1| unknown [Medicago truncatula]
          Length = 168

 Score =  121 bits (303), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 60/110 (54%), Positives = 84/110 (76%)

Query: 10  LSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILME 69
           +++ RV+DV+W CSLSESE+DMLISLKLLI++RA+ IG + LANKFDLK LRA+  +LME
Sbjct: 20  ITETRVQDVAWPCSLSESEIDMLISLKLLIIKRAEWIGCKNLANKFDLKMLRAIALVLME 79

Query: 70  HLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLK 119
           +LK +VKD SL P   +  AF+  CNLL C+++   T+E+L T +  D++
Sbjct: 80  NLKAEVKDASLVPDMVKSTAFLDVCNLLNCNSEVSATIEELSTSVGADIQ 129


>gi|224075272|ref|XP_002304585.1| predicted protein [Populus trichocarpa]
 gi|222842017|gb|EEE79564.1| predicted protein [Populus trichocarpa]
          Length = 136

 Score =  120 bits (302), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 65/114 (57%), Positives = 80/114 (70%)

Query: 7   NEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFI 66
            E L  N  ED+ WLC+LSESE+DMLI+LK LIL RAKV+GH+ELA KFD  TLRA+G  
Sbjct: 5   GETLVDNSPEDIRWLCNLSESELDMLITLKSLILHRAKVLGHDELAKKFDSPTLRAVGLF 64

Query: 67  LMEHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
           LME+LK KVKDLS   G  +  AF   CNLLK +  D  ++E+LK  + ID +R
Sbjct: 65  LMEYLKGKVKDLSHVQGLTKLAAFSDCCNLLKGNPGDDSSIEELKASIDIDERR 118


>gi|62320079|dbj|BAD94245.1| hypothetical protein [Arabidopsis thaliana]
          Length = 127

 Score =  119 bits (297), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 59/111 (53%), Positives = 82/111 (73%), Gaps = 1/111 (0%)

Query: 3   VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRA 62
           ++E+ +IL  NR +D++W CSLSESE+D+LISLK L +QRAK+ GH+ELA+KFDLK LR+
Sbjct: 1   MEELTQILKNNRTDDLTWFCSLSESELDLLISLKKLAIQRAKISGHQELADKFDLKLLRS 60

Query: 63  LGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
           LG +LMEH K++V+ D SL P     L  +  CNLLK   DD + +E++ T
Sbjct: 61  LGLVLMEHAKKRVQNDTSLAPSVVHQLRLLDNCNLLKTHVDDAVDIEEILT 111


>gi|18412725|ref|NP_565234.1| Spc97 / Spc98 family of spindle pole body (SBP) component
           [Arabidopsis thaliana]
 gi|42572197|ref|NP_974189.1| Spc97 / Spc98 family of spindle pole body (SBP) component
           [Arabidopsis thaliana]
 gi|145327753|ref|NP_001077852.1| Spc97 / Spc98 family of spindle pole body (SBP) component
           [Arabidopsis thaliana]
 gi|111074472|gb|ABH04609.1| At1g80245 [Arabidopsis thaliana]
 gi|332198254|gb|AEE36375.1| Spc97 / Spc98 family of spindle pole body (SBP) component
           [Arabidopsis thaliana]
 gi|332198255|gb|AEE36376.1| Spc97 / Spc98 family of spindle pole body (SBP) component
           [Arabidopsis thaliana]
 gi|332198256|gb|AEE36377.1| Spc97 / Spc98 family of spindle pole body (SBP) component
           [Arabidopsis thaliana]
          Length = 127

 Score =  118 bits (295), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 58/111 (52%), Positives = 82/111 (73%), Gaps = 1/111 (0%)

Query: 3   VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRA 62
           ++E+ +IL  NR +D++W CSLSESE+D+LISLK L +QRAK+ GH+ELA+KFDLK LR+
Sbjct: 1   MEELTQILKNNRTDDLTWFCSLSESELDLLISLKKLAIQRAKISGHQELADKFDLKLLRS 60

Query: 63  LGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
           LG +LMEH +++V+ D SL P     L  +  CNLLK   DD + +E++ T
Sbjct: 61  LGLVLMEHARKRVQNDTSLAPSVVHQLRLLDNCNLLKTHVDDAVDIEEILT 111


>gi|449453423|ref|XP_004144457.1| PREDICTED: uncharacterized protein LOC101210446 [Cucumis sativus]
          Length = 234

 Score =  115 bits (288), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 59/112 (52%), Positives = 83/112 (74%), Gaps = 7/112 (6%)

Query: 1   MEVKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTL 60
           + ++E NEIL   R+E +SWLCSLSESE+D+LIS+K+L+LQRAK +GHE LA KFDL+TL
Sbjct: 98  ISMEEANEILKNKRIEGISWLCSLSESELDLLISIKMLVLQRAKAVGHENLAEKFDLRTL 157

Query: 61  RALGFILMEHLKEKVK--DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKL 110
           RA+GF+LMEHLK K++  D+S    S         CNLL  + + IL+++++
Sbjct: 158 RAIGFVLMEHLKGKLRTSDVSDLSQST-----FNACNLLDSNLEKILSIDEI 204


>gi|21537144|gb|AAM61485.1| unknown [Arabidopsis thaliana]
          Length = 127

 Score =  115 bits (288), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 57/111 (51%), Positives = 81/111 (72%), Gaps = 1/111 (0%)

Query: 3   VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRA 62
           ++E+ +IL  N  +D++W CSLSESE+D+LISLK L +QRAK+ GH+ELA+KFDLK LR+
Sbjct: 1   MEELTQILKNNSTDDLTWFCSLSESELDLLISLKKLAIQRAKISGHQELADKFDLKLLRS 60

Query: 63  LGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
           LG +LMEH +++V+ D SL P     L  +  CNLLK   DD + +E++ T
Sbjct: 61  LGLVLMEHARKRVQNDTSLAPSVVHQLRLLDNCNLLKTHVDDAVDIEEILT 111


>gi|297842843|ref|XP_002889303.1| hypothetical protein ARALYDRAFT_316935 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335144|gb|EFH65562.1| hypothetical protein ARALYDRAFT_316935 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 137

 Score =  115 bits (288), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 56/111 (50%), Positives = 84/111 (75%), Gaps = 1/111 (0%)

Query: 3   VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRA 62
           ++E++++L  NR +D++W CSLSESE+D+LISLK L +QRAK+ GH+ELA+KFDLK LRA
Sbjct: 1   MEELSQLLKNNRTDDLTWFCSLSESELDLLISLKKLAIQRAKISGHQELADKFDLKLLRA 60

Query: 63  LGFILMEHLKEKVKDLS-LFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
           LG +LME+++++V+D + L P     L  +  CNLLK   DD + +E++ T
Sbjct: 61  LGLVLMEYVRKRVQDDTCLAPSVVHQLMLLDNCNLLKTHEDDTVDMEEILT 111


>gi|145361194|ref|NP_680548.2| uncharacterized protein [Arabidopsis thaliana]
 gi|186511402|ref|NP_001118906.1| uncharacterized protein [Arabidopsis thaliana]
 gi|61742725|gb|AAX55183.1| hypothetical protein At4g00695 [Arabidopsis thaliana]
 gi|332656520|gb|AEE81920.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332656521|gb|AEE81921.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 128

 Score =  109 bits (273), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/112 (50%), Positives = 86/112 (76%), Gaps = 2/112 (1%)

Query: 3   VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEEL-ANKFDLKTLR 61
           ++E+++IL  NR +D++W CSLSESE+D+LISLK L+++RAKV G E+L A+KFDLK LR
Sbjct: 1   MEELSQILKNNRTDDLTWFCSLSESELDLLISLKKLVIRRAKVSGLEDLVADKFDLKMLR 60

Query: 62  ALGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
           +LG +LME+++++V+ D SL P   + L+ +  CNLLK   DD + +E++ T
Sbjct: 61  SLGLVLMEYVRKRVEDDTSLAPSVVQELSLLDSCNLLKTHVDDTVDIEEILT 112


>gi|334186256|ref|NP_001190646.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332656522|gb|AEE81922.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 153

 Score =  109 bits (272), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/112 (50%), Positives = 86/112 (76%), Gaps = 2/112 (1%)

Query: 3   VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEEL-ANKFDLKTLR 61
           ++E+++IL  NR +D++W CSLSESE+D+LISLK L+++RAKV G E+L A+KFDLK LR
Sbjct: 1   MEELSQILKNNRTDDLTWFCSLSESELDLLISLKKLVIRRAKVSGLEDLVADKFDLKMLR 60

Query: 62  ALGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
           +LG +LME+++++V+ D SL P   + L+ +  CNLLK   DD + +E++ T
Sbjct: 61  SLGLVLMEYVRKRVEDDTSLAPSVVQELSLLDSCNLLKTHVDDTVDIEEILT 112


>gi|224053697|ref|XP_002297934.1| predicted protein [Populus trichocarpa]
 gi|222845192|gb|EEE82739.1| predicted protein [Populus trichocarpa]
          Length = 136

 Score =  106 bits (265), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 57/114 (50%), Positives = 77/114 (67%)

Query: 7   NEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFI 66
            +IL  N  E + WLC+LSESE+DMLI LK LIL RAKV+GH+ELA  FDL TLRA+  +
Sbjct: 5   QDILGNNSPEVIRWLCNLSESELDMLIRLKSLILHRAKVLGHDELAKNFDLPTLRAIALL 64

Query: 67  LMEHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
           LME+LK K K  S   G  + + F   CNLL+ +  +  ++E+LK C+ ID ++
Sbjct: 65  LMEYLKGKFKHSSQVQGLTKLVVFPECCNLLEGNPGEDSSMEELKACIGIDERK 118


>gi|302143257|emb|CBI20552.3| unnamed protein product [Vitis vinifera]
          Length = 122

 Score =  101 bits (252), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 54/92 (58%), Positives = 67/92 (72%), Gaps = 2/92 (2%)

Query: 31  MLISLKLLILQRAKVIGHEELANKFDLKTLRAL--GFILMEHLKEKVKDLSLFPGSAEPL 88
           MLISLK+++L+RAKVIGHE+LA KFDLK LRAL  GFILME+L+ +VKDLS  PG A   
Sbjct: 1   MLISLKMMVLRRAKVIGHEDLAEKFDLKMLRALGGGFILMEYLRGQVKDLSTIPGLAGLD 60

Query: 89  AFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
            F   CNLLKC   D ++ E+ K C+  + KR
Sbjct: 61  RFFNECNLLKCSLKDPMSTEEQKACISTNSKR 92


>gi|147804743|emb|CAN69358.1| hypothetical protein VITISV_024910 [Vitis vinifera]
          Length = 583

 Score = 87.4 bits (215), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 40/59 (67%), Positives = 53/59 (89%)

Query: 5  EMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRAL 63
          ++ ++L  NR+EDV+WLCSL+ESE+DMLISLK+++L+RAKVIGHE+LA KFDLK L AL
Sbjct: 2  DIQQVLGSNRLEDVNWLCSLAESELDMLISLKMMVLRRAKVIGHEDLAEKFDLKMLXAL 60


>gi|357117060|ref|XP_003560293.1| PREDICTED: uncharacterized protein LOC100841286 [Brachypodium
           distachyon]
          Length = 197

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/67 (55%), Positives = 48/67 (71%)

Query: 16  EDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILMEHLKEKV 75
           +D  WL +LSE E+D LI LK L   RAK  GH  LA++FDL+TLRALG +L+  LKE++
Sbjct: 40  DDERWLAALSEPELDFLICLKKLAATRAKTAGHPHLADQFDLRTLRALGVVLLGGLKERL 99

Query: 76  KDLSLFP 82
           K+ SL P
Sbjct: 100 KETSLDP 106


>gi|242094314|ref|XP_002437647.1| hypothetical protein SORBIDRAFT_10g031150 [Sorghum bicolor]
 gi|241915870|gb|EER89014.1| hypothetical protein SORBIDRAFT_10g031150 [Sorghum bicolor]
          Length = 156

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 31/64 (48%), Positives = 44/64 (68%), Gaps = 1/64 (1%)

Query: 17 DVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANK-FDLKTLRALGFILMEHLKEKV 75
          D  WL +LSE E+D+L+SL+ L + RA   GH  LA+  F L+ LRALG +L+E LKE++
Sbjct: 31 DAMWLQTLSEPELDVLVSLRELAVTRATNAGHPALADTVFHLRALRALGIVLVEELKERL 90

Query: 76 KDLS 79
          +  S
Sbjct: 91 RQSS 94


>gi|413935094|gb|AFW69645.1| hypothetical protein ZEAMMB73_989907 [Zea mays]
          Length = 153

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/64 (46%), Positives = 44/64 (68%), Gaps = 1/64 (1%)

Query: 17 DVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANK-FDLKTLRALGFILMEHLKEKV 75
          D  WL +LSE E+D+L+SL+ + + RA   GH  LA+  F L+ LRALG +L+E LKE++
Sbjct: 35 DARWLETLSEPELDVLVSLREVAVTRASDAGHPGLADTVFHLRALRALGVVLLEELKERL 94

Query: 76 KDLS 79
          +  S
Sbjct: 95 RQSS 98


>gi|115470114|ref|NP_001058656.1| Os06g0730500 [Oryza sativa Japonica Group]
 gi|54291357|dbj|BAD62123.1| unknown protein [Oryza sativa Japonica Group]
 gi|54291558|dbj|BAD62482.1| unknown protein [Oryza sativa Japonica Group]
 gi|113596696|dbj|BAF20570.1| Os06g0730500 [Oryza sativa Japonica Group]
 gi|218198928|gb|EEC81355.1| hypothetical protein OsI_24548 [Oryza sativa Indica Group]
 gi|222636269|gb|EEE66401.1| hypothetical protein OsJ_22743 [Oryza sativa Japonica Group]
          Length = 173

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 52/85 (61%), Gaps = 10/85 (11%)

Query: 3   VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLIL---QRAK--VIGHEELANKFDL 57
            +E +      + +D   L +LS+ E+D+LI+LK L +   + A   V+GH+     +DL
Sbjct: 32  AEEPSSPTRGGQQQDSRRLRTLSDPELDLLITLKDLAMVCTENASLAVLGHD-----YDL 86

Query: 58  KTLRALGFILMEHLKEKVKDLSLFP 82
            TLRALG +L+E LKE++K+ S+ P
Sbjct: 87  PTLRALGIVLLETLKERLKETSIDP 111


>gi|359473955|ref|XP_003631381.1| PREDICTED: uncharacterized protein LOC100854404 [Vitis vinifera]
          Length = 83

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/53 (47%), Positives = 34/53 (64%)

Query: 68  MEHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
           ME+L+ +VKDLS  PG A    F+  CNLLKC   D ++ E+LK C+  + KR
Sbjct: 1   MEYLRGQVKDLSAIPGLAGLDKFLNECNLLKCSLKDTISTEELKACICTNSKR 53


>gi|5902376|gb|AAD55478.1|AC009322_18 Hypothetical protein [Arabidopsis thaliana]
          Length = 898

 Score = 38.9 bits (89), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 3/61 (4%)

Query: 55  FDL--KTLRALGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLK 111
           FDL      + G +LMEH +++V+ D SL P     L  +  CNLLK   DD + +E++ 
Sbjct: 92  FDLMNPVCYSAGLVLMEHARKRVQNDTSLAPSVVHQLRLLDNCNLLKTHVDDAVDIEEIL 151

Query: 112 T 112
           T
Sbjct: 152 T 152


>gi|359726029|ref|ZP_09264725.1| hypothetical protein Lwei2_02949 [Leptospira weilii str.
           2006001855]
          Length = 340

 Score = 37.4 bits (85), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 28/44 (63%)

Query: 38  LILQRAKVIGHEELANKFDLKTLRALGFILMEHLKEKVKDLSLF 81
           LIL RAK + + EL N F+ +++ +   I  E+ +EK+  LSLF
Sbjct: 289 LILSRAKYLQYNELNNSFNFRSIISSDLIKYEYTQEKLDSLSLF 332


>gi|417779477|ref|ZP_12427263.1| hypothetical protein LEP1GSC036_2287 [Leptospira weilii str.
           2006001853]
 gi|410780380|gb|EKR64973.1| hypothetical protein LEP1GSC036_2287 [Leptospira weilii str.
           2006001853]
          Length = 340

 Score = 37.0 bits (84), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 28/44 (63%)

Query: 38  LILQRAKVIGHEELANKFDLKTLRALGFILMEHLKEKVKDLSLF 81
           LIL RAK + + EL N F+ +++ +   I  E+ +EK+  LSLF
Sbjct: 289 LILSRAKYLQYNELNNSFNFRSIISSDLIKYEYTQEKLDSLSLF 332


>gi|170729045|ref|YP_001763071.1| general secretion pathway protein L [Shewanella woodyi ATCC 51908]
 gi|169814392|gb|ACA88976.1| general secretion pathway protein L [Shewanella woodyi ATCC 51908]
          Length = 399

 Score = 36.2 bits (82), Expect = 2.3,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 4   KEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRAL 63
           ++  EI++   ++D S L SL+E   +  + +  L+   A  +   EL  K   + ++AL
Sbjct: 26  EQEQEIIASGELKDASALSSLTERAGNRPVDV--LVPSSAITLTSVELPEKGQRQAIQAL 83

Query: 64  GFILMEHLKEKVKDLSLFPG 83
            F+L E L E V +L   PG
Sbjct: 84  PFMLEESLAENVDELHFVPG 103


>gi|428165039|gb|EKX34045.1| hypothetical protein GUITHDRAFT_119786 [Guillardia theta CCMP2712]
          Length = 930

 Score = 35.8 bits (81), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 7/76 (9%)

Query: 12  KNRVEDVSWL-------CSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALG 64
           + R E V W+       CS+S S +   +    ++  R +V G+  +AN   +K  R LG
Sbjct: 407 RRREERVKWVRRCIDLSCSISSSRLQFRLFKCAVVAHREQVDGNRRVANSMRIKKRRELG 466

Query: 65  FILMEHLKEKVKDLSL 80
              ++  KE+VK  SL
Sbjct: 467 RSCLQTWKEEVKSSSL 482


>gi|386824537|ref|ZP_10111670.1| diguanylate cyclase [Serratia plymuthica PRI-2C]
 gi|386378494|gb|EIJ19298.1| diguanylate cyclase [Serratia plymuthica PRI-2C]
          Length = 486

 Score = 35.4 bits (80), Expect = 4.2,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 35/75 (46%), Gaps = 1/75 (1%)

Query: 46  IGHEELANKFDLKTLRALGFILMEHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDIL 105
           +G EE A     + L A G  L EHL++ V+   L+PG   PL       +     +++ 
Sbjct: 387 LGGEEFAIMLP-RALAAQGVALAEHLRQLVQKTDLYPGGKTPLKITISVGVASLAMNEVK 445

Query: 106 TVEKLKTCLHIDLKR 120
           ++E+L     I L R
Sbjct: 446 SLEQLMNMADIALYR 460


>gi|157377480|ref|YP_001476080.1| general secretion pathway protein L [Shewanella sediminis HAW-EB3]
 gi|157319854|gb|ABV38952.1| general secretion pathway protein L [Shewanella sediminis HAW-EB3]
          Length = 399

 Score = 35.4 bits (80), Expect = 4.4,   Method: Composition-based stats.
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 12/92 (13%)

Query: 4   KEMNEILSKNRVEDVSWLCSLSESE----VDMLISLKLLILQRAKVIGHEELANKFDLKT 59
           ++  EI++   V+D + L SL+E      VD+L+    + L +       +L  K   + 
Sbjct: 26  EQEQEIIASGEVKDAAALSSLTERAGNRPVDVLVPSSSITLTKV------DLPEKGQRQA 79

Query: 60  LRALGFILMEHLKEKVKDLSLFPG--SAEPLA 89
           ++AL F+L E L E V+ L   PG  S E L+
Sbjct: 80  IQALPFMLEESLAENVEQLHFVPGPRSGESLS 111


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.138    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,687,604,591
Number of Sequences: 23463169
Number of extensions: 56519305
Number of successful extensions: 129280
Number of sequences better than 100.0: 30
Number of HSP's better than 100.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 129241
Number of HSP's gapped (non-prelim): 30
length of query: 120
length of database: 8,064,228,071
effective HSP length: 87
effective length of query: 33
effective length of database: 6,022,932,368
effective search space: 198756768144
effective search space used: 198756768144
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 69 (31.2 bits)