BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 033372
(120 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297742396|emb|CBI34545.3| unnamed protein product [Vitis vinifera]
Length = 147
Score = 142 bits (357), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 70/112 (62%), Positives = 89/112 (79%)
Query: 9 ILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILM 68
+L NR+EDV+WLCSLS+SE+DMLISLK+++L+RAKVIGHE+LA KFDLK LRALGFILM
Sbjct: 6 VLGSNRLEDVNWLCSLSDSELDMLISLKMMVLRRAKVIGHEDLAEKFDLKMLRALGFILM 65
Query: 69 EHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
E+L+ +VKDLS PG A F+ CNLLKC D ++ E+LK C+ + KR
Sbjct: 66 EYLRGQVKDLSAIPGLAGLDKFLNECNLLKCSLKDTISTEELKACICTNSKR 117
>gi|356556924|ref|XP_003546770.1| PREDICTED: uncharacterized protein LOC100818832 [Glycine max]
Length = 151
Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 61/106 (57%), Positives = 82/106 (77%)
Query: 13 NRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILMEHLK 72
RVEDV+WLCSLSESE+DMLISLKLLI+QRAK++G +ELA+KF+LK +RA+ +LM HLK
Sbjct: 25 TRVEDVAWLCSLSESEIDMLISLKLLIIQRAKMMGCKELASKFNLKMIRAIALVLMGHLK 84
Query: 73 EKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDL 118
E++KD SL P + +F+ CNLLKC N+ +++L T L D+
Sbjct: 85 EEIKDSSLIPNMVKSTSFLDACNLLKCSNEVDANIDELSTSLGADI 130
>gi|356525841|ref|XP_003531530.1| PREDICTED: uncharacterized protein LOC100778207 isoform 1 [Glycine
max]
gi|356525843|ref|XP_003531531.1| PREDICTED: uncharacterized protein LOC100778207 isoform 2 [Glycine
max]
Length = 154
Score = 124 bits (312), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 60/106 (56%), Positives = 80/106 (75%)
Query: 13 NRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILMEHLK 72
RVEDV+WLCSLSESE+DMLISLKLLI+QRAK++G +ELA+KF+LK +RA+ +LMEHLK
Sbjct: 25 TRVEDVAWLCSLSESEIDMLISLKLLIIQRAKMMGCKELASKFNLKMIRAIALVLMEHLK 84
Query: 73 EKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDL 118
++K SL P + E +F+ CNLLKC N+ ++ L L D+
Sbjct: 85 AEIKGSSLIPNTVESTSFLDACNLLKCSNEVDANIDDLSASLGADI 130
>gi|357513171|ref|XP_003626874.1| hypothetical protein MTR_8g011490 [Medicago truncatula]
gi|355520896|gb|AET01350.1| hypothetical protein MTR_8g011490 [Medicago truncatula]
Length = 168
Score = 123 bits (308), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 61/110 (55%), Positives = 85/110 (77%)
Query: 10 LSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILME 69
+++ RV+DV+WLCSLSESE+DMLISLKLLI++RA+ IG + LANKFDLK LRA+ +LME
Sbjct: 20 ITETRVQDVAWLCSLSESEIDMLISLKLLIIKRAEWIGCKNLANKFDLKMLRAIALVLME 79
Query: 70 HLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLK 119
+LK +VKD SL P + AF+ CNLL C+++ T+E+L T + D++
Sbjct: 80 NLKAEVKDASLVPDMVKSTAFLDVCNLLNCNSEVSATIEELSTSVGADIQ 129
>gi|255647835|gb|ACU24377.1| unknown [Glycine max]
Length = 154
Score = 123 bits (308), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 59/106 (55%), Positives = 79/106 (74%)
Query: 13 NRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILMEHLK 72
RVEDV+WLCSLSESE+DMLISLKLLI+QRAK++G +ELA+KF+LK +RA+ +LMEHLK
Sbjct: 25 TRVEDVAWLCSLSESEIDMLISLKLLIIQRAKMMGCKELASKFNLKMIRAIALVLMEHLK 84
Query: 73 EKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDL 118
++K SL P + E +F+ CNLLKC N+ ++ L D+
Sbjct: 85 AEIKGSSLIPNTVESTSFLDACNLLKCSNEVDANIDDLSASFGADI 130
>gi|449506107|ref|XP_004162655.1| PREDICTED: uncharacterized LOC101210446 [Cucumis sativus]
Length = 182
Score = 121 bits (304), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/122 (51%), Positives = 89/122 (72%), Gaps = 7/122 (5%)
Query: 1 MEVKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTL 60
+ ++E NEIL NR+ED+SWLCSLSESE+D+LIS+K+L+LQRAK IGHE LA KFDL+TL
Sbjct: 46 ISMEEANEILKNNRIEDISWLCSLSESELDLLISIKMLVLQRAKAIGHENLAEKFDLRTL 105
Query: 61 RALGFILMEHLKEKVK--DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDL 118
RA+GF+LMEHLK K++ D+S S CNLL + + IL+++++ + D
Sbjct: 106 RAIGFVLMEHLKGKLRTSDVSDLSQST-----FNACNLLDSNLEKILSIDEIMASICSDR 160
Query: 119 KR 120
++
Sbjct: 161 RK 162
>gi|388510664|gb|AFK43398.1| unknown [Medicago truncatula]
Length = 168
Score = 121 bits (303), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 60/110 (54%), Positives = 84/110 (76%)
Query: 10 LSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILME 69
+++ RV+DV+W CSLSESE+DMLISLKLLI++RA+ IG + LANKFDLK LRA+ +LME
Sbjct: 20 ITETRVQDVAWPCSLSESEIDMLISLKLLIIKRAEWIGCKNLANKFDLKMLRAIALVLME 79
Query: 70 HLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLK 119
+LK +VKD SL P + AF+ CNLL C+++ T+E+L T + D++
Sbjct: 80 NLKAEVKDASLVPDMVKSTAFLDVCNLLNCNSEVSATIEELSTSVGADIQ 129
>gi|224075272|ref|XP_002304585.1| predicted protein [Populus trichocarpa]
gi|222842017|gb|EEE79564.1| predicted protein [Populus trichocarpa]
Length = 136
Score = 120 bits (302), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 65/114 (57%), Positives = 80/114 (70%)
Query: 7 NEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFI 66
E L N ED+ WLC+LSESE+DMLI+LK LIL RAKV+GH+ELA KFD TLRA+G
Sbjct: 5 GETLVDNSPEDIRWLCNLSESELDMLITLKSLILHRAKVLGHDELAKKFDSPTLRAVGLF 64
Query: 67 LMEHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
LME+LK KVKDLS G + AF CNLLK + D ++E+LK + ID +R
Sbjct: 65 LMEYLKGKVKDLSHVQGLTKLAAFSDCCNLLKGNPGDDSSIEELKASIDIDERR 118
>gi|62320079|dbj|BAD94245.1| hypothetical protein [Arabidopsis thaliana]
Length = 127
Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 59/111 (53%), Positives = 82/111 (73%), Gaps = 1/111 (0%)
Query: 3 VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRA 62
++E+ +IL NR +D++W CSLSESE+D+LISLK L +QRAK+ GH+ELA+KFDLK LR+
Sbjct: 1 MEELTQILKNNRTDDLTWFCSLSESELDLLISLKKLAIQRAKISGHQELADKFDLKLLRS 60
Query: 63 LGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
LG +LMEH K++V+ D SL P L + CNLLK DD + +E++ T
Sbjct: 61 LGLVLMEHAKKRVQNDTSLAPSVVHQLRLLDNCNLLKTHVDDAVDIEEILT 111
>gi|18412725|ref|NP_565234.1| Spc97 / Spc98 family of spindle pole body (SBP) component
[Arabidopsis thaliana]
gi|42572197|ref|NP_974189.1| Spc97 / Spc98 family of spindle pole body (SBP) component
[Arabidopsis thaliana]
gi|145327753|ref|NP_001077852.1| Spc97 / Spc98 family of spindle pole body (SBP) component
[Arabidopsis thaliana]
gi|111074472|gb|ABH04609.1| At1g80245 [Arabidopsis thaliana]
gi|332198254|gb|AEE36375.1| Spc97 / Spc98 family of spindle pole body (SBP) component
[Arabidopsis thaliana]
gi|332198255|gb|AEE36376.1| Spc97 / Spc98 family of spindle pole body (SBP) component
[Arabidopsis thaliana]
gi|332198256|gb|AEE36377.1| Spc97 / Spc98 family of spindle pole body (SBP) component
[Arabidopsis thaliana]
Length = 127
Score = 118 bits (295), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 58/111 (52%), Positives = 82/111 (73%), Gaps = 1/111 (0%)
Query: 3 VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRA 62
++E+ +IL NR +D++W CSLSESE+D+LISLK L +QRAK+ GH+ELA+KFDLK LR+
Sbjct: 1 MEELTQILKNNRTDDLTWFCSLSESELDLLISLKKLAIQRAKISGHQELADKFDLKLLRS 60
Query: 63 LGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
LG +LMEH +++V+ D SL P L + CNLLK DD + +E++ T
Sbjct: 61 LGLVLMEHARKRVQNDTSLAPSVVHQLRLLDNCNLLKTHVDDAVDIEEILT 111
>gi|449453423|ref|XP_004144457.1| PREDICTED: uncharacterized protein LOC101210446 [Cucumis sativus]
Length = 234
Score = 115 bits (288), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 59/112 (52%), Positives = 83/112 (74%), Gaps = 7/112 (6%)
Query: 1 MEVKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTL 60
+ ++E NEIL R+E +SWLCSLSESE+D+LIS+K+L+LQRAK +GHE LA KFDL+TL
Sbjct: 98 ISMEEANEILKNKRIEGISWLCSLSESELDLLISIKMLVLQRAKAVGHENLAEKFDLRTL 157
Query: 61 RALGFILMEHLKEKVK--DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKL 110
RA+GF+LMEHLK K++ D+S S CNLL + + IL+++++
Sbjct: 158 RAIGFVLMEHLKGKLRTSDVSDLSQST-----FNACNLLDSNLEKILSIDEI 204
>gi|21537144|gb|AAM61485.1| unknown [Arabidopsis thaliana]
Length = 127
Score = 115 bits (288), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/111 (51%), Positives = 81/111 (72%), Gaps = 1/111 (0%)
Query: 3 VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRA 62
++E+ +IL N +D++W CSLSESE+D+LISLK L +QRAK+ GH+ELA+KFDLK LR+
Sbjct: 1 MEELTQILKNNSTDDLTWFCSLSESELDLLISLKKLAIQRAKISGHQELADKFDLKLLRS 60
Query: 63 LGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
LG +LMEH +++V+ D SL P L + CNLLK DD + +E++ T
Sbjct: 61 LGLVLMEHARKRVQNDTSLAPSVVHQLRLLDNCNLLKTHVDDAVDIEEILT 111
>gi|297842843|ref|XP_002889303.1| hypothetical protein ARALYDRAFT_316935 [Arabidopsis lyrata subsp.
lyrata]
gi|297335144|gb|EFH65562.1| hypothetical protein ARALYDRAFT_316935 [Arabidopsis lyrata subsp.
lyrata]
Length = 137
Score = 115 bits (288), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 56/111 (50%), Positives = 84/111 (75%), Gaps = 1/111 (0%)
Query: 3 VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRA 62
++E++++L NR +D++W CSLSESE+D+LISLK L +QRAK+ GH+ELA+KFDLK LRA
Sbjct: 1 MEELSQLLKNNRTDDLTWFCSLSESELDLLISLKKLAIQRAKISGHQELADKFDLKLLRA 60
Query: 63 LGFILMEHLKEKVKDLS-LFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
LG +LME+++++V+D + L P L + CNLLK DD + +E++ T
Sbjct: 61 LGLVLMEYVRKRVQDDTCLAPSVVHQLMLLDNCNLLKTHEDDTVDMEEILT 111
>gi|145361194|ref|NP_680548.2| uncharacterized protein [Arabidopsis thaliana]
gi|186511402|ref|NP_001118906.1| uncharacterized protein [Arabidopsis thaliana]
gi|61742725|gb|AAX55183.1| hypothetical protein At4g00695 [Arabidopsis thaliana]
gi|332656520|gb|AEE81920.1| uncharacterized protein [Arabidopsis thaliana]
gi|332656521|gb|AEE81921.1| uncharacterized protein [Arabidopsis thaliana]
Length = 128
Score = 109 bits (273), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/112 (50%), Positives = 86/112 (76%), Gaps = 2/112 (1%)
Query: 3 VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEEL-ANKFDLKTLR 61
++E+++IL NR +D++W CSLSESE+D+LISLK L+++RAKV G E+L A+KFDLK LR
Sbjct: 1 MEELSQILKNNRTDDLTWFCSLSESELDLLISLKKLVIRRAKVSGLEDLVADKFDLKMLR 60
Query: 62 ALGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
+LG +LME+++++V+ D SL P + L+ + CNLLK DD + +E++ T
Sbjct: 61 SLGLVLMEYVRKRVEDDTSLAPSVVQELSLLDSCNLLKTHVDDTVDIEEILT 112
>gi|334186256|ref|NP_001190646.1| uncharacterized protein [Arabidopsis thaliana]
gi|332656522|gb|AEE81922.1| uncharacterized protein [Arabidopsis thaliana]
Length = 153
Score = 109 bits (272), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/112 (50%), Positives = 86/112 (76%), Gaps = 2/112 (1%)
Query: 3 VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEEL-ANKFDLKTLR 61
++E+++IL NR +D++W CSLSESE+D+LISLK L+++RAKV G E+L A+KFDLK LR
Sbjct: 1 MEELSQILKNNRTDDLTWFCSLSESELDLLISLKKLVIRRAKVSGLEDLVADKFDLKMLR 60
Query: 62 ALGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKT 112
+LG +LME+++++V+ D SL P + L+ + CNLLK DD + +E++ T
Sbjct: 61 SLGLVLMEYVRKRVEDDTSLAPSVVQELSLLDSCNLLKTHVDDTVDIEEILT 112
>gi|224053697|ref|XP_002297934.1| predicted protein [Populus trichocarpa]
gi|222845192|gb|EEE82739.1| predicted protein [Populus trichocarpa]
Length = 136
Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 77/114 (67%)
Query: 7 NEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFI 66
+IL N E + WLC+LSESE+DMLI LK LIL RAKV+GH+ELA FDL TLRA+ +
Sbjct: 5 QDILGNNSPEVIRWLCNLSESELDMLIRLKSLILHRAKVLGHDELAKNFDLPTLRAIALL 64
Query: 67 LMEHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
LME+LK K K S G + + F CNLL+ + + ++E+LK C+ ID ++
Sbjct: 65 LMEYLKGKFKHSSQVQGLTKLVVFPECCNLLEGNPGEDSSMEELKACIGIDERK 118
>gi|302143257|emb|CBI20552.3| unnamed protein product [Vitis vinifera]
Length = 122
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/92 (58%), Positives = 67/92 (72%), Gaps = 2/92 (2%)
Query: 31 MLISLKLLILQRAKVIGHEELANKFDLKTLRAL--GFILMEHLKEKVKDLSLFPGSAEPL 88
MLISLK+++L+RAKVIGHE+LA KFDLK LRAL GFILME+L+ +VKDLS PG A
Sbjct: 1 MLISLKMMVLRRAKVIGHEDLAEKFDLKMLRALGGGFILMEYLRGQVKDLSTIPGLAGLD 60
Query: 89 AFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
F CNLLKC D ++ E+ K C+ + KR
Sbjct: 61 RFFNECNLLKCSLKDPMSTEEQKACISTNSKR 92
>gi|147804743|emb|CAN69358.1| hypothetical protein VITISV_024910 [Vitis vinifera]
Length = 583
Score = 87.4 bits (215), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 40/59 (67%), Positives = 53/59 (89%)
Query: 5 EMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRAL 63
++ ++L NR+EDV+WLCSL+ESE+DMLISLK+++L+RAKVIGHE+LA KFDLK L AL
Sbjct: 2 DIQQVLGSNRLEDVNWLCSLAESELDMLISLKMMVLRRAKVIGHEDLAEKFDLKMLXAL 60
>gi|357117060|ref|XP_003560293.1| PREDICTED: uncharacterized protein LOC100841286 [Brachypodium
distachyon]
Length = 197
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/67 (55%), Positives = 48/67 (71%)
Query: 16 EDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALGFILMEHLKEKV 75
+D WL +LSE E+D LI LK L RAK GH LA++FDL+TLRALG +L+ LKE++
Sbjct: 40 DDERWLAALSEPELDFLICLKKLAATRAKTAGHPHLADQFDLRTLRALGVVLLGGLKERL 99
Query: 76 KDLSLFP 82
K+ SL P
Sbjct: 100 KETSLDP 106
>gi|242094314|ref|XP_002437647.1| hypothetical protein SORBIDRAFT_10g031150 [Sorghum bicolor]
gi|241915870|gb|EER89014.1| hypothetical protein SORBIDRAFT_10g031150 [Sorghum bicolor]
Length = 156
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/64 (48%), Positives = 44/64 (68%), Gaps = 1/64 (1%)
Query: 17 DVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANK-FDLKTLRALGFILMEHLKEKV 75
D WL +LSE E+D+L+SL+ L + RA GH LA+ F L+ LRALG +L+E LKE++
Sbjct: 31 DAMWLQTLSEPELDVLVSLRELAVTRATNAGHPALADTVFHLRALRALGIVLVEELKERL 90
Query: 76 KDLS 79
+ S
Sbjct: 91 RQSS 94
>gi|413935094|gb|AFW69645.1| hypothetical protein ZEAMMB73_989907 [Zea mays]
Length = 153
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 44/64 (68%), Gaps = 1/64 (1%)
Query: 17 DVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANK-FDLKTLRALGFILMEHLKEKV 75
D WL +LSE E+D+L+SL+ + + RA GH LA+ F L+ LRALG +L+E LKE++
Sbjct: 35 DARWLETLSEPELDVLVSLREVAVTRASDAGHPGLADTVFHLRALRALGVVLLEELKERL 94
Query: 76 KDLS 79
+ S
Sbjct: 95 RQSS 98
>gi|115470114|ref|NP_001058656.1| Os06g0730500 [Oryza sativa Japonica Group]
gi|54291357|dbj|BAD62123.1| unknown protein [Oryza sativa Japonica Group]
gi|54291558|dbj|BAD62482.1| unknown protein [Oryza sativa Japonica Group]
gi|113596696|dbj|BAF20570.1| Os06g0730500 [Oryza sativa Japonica Group]
gi|218198928|gb|EEC81355.1| hypothetical protein OsI_24548 [Oryza sativa Indica Group]
gi|222636269|gb|EEE66401.1| hypothetical protein OsJ_22743 [Oryza sativa Japonica Group]
Length = 173
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 52/85 (61%), Gaps = 10/85 (11%)
Query: 3 VKEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLIL---QRAK--VIGHEELANKFDL 57
+E + + +D L +LS+ E+D+LI+LK L + + A V+GH+ +DL
Sbjct: 32 AEEPSSPTRGGQQQDSRRLRTLSDPELDLLITLKDLAMVCTENASLAVLGHD-----YDL 86
Query: 58 KTLRALGFILMEHLKEKVKDLSLFP 82
TLRALG +L+E LKE++K+ S+ P
Sbjct: 87 PTLRALGIVLLETLKERLKETSIDP 111
>gi|359473955|ref|XP_003631381.1| PREDICTED: uncharacterized protein LOC100854404 [Vitis vinifera]
Length = 83
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 34/53 (64%)
Query: 68 MEHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLKTCLHIDLKR 120
ME+L+ +VKDLS PG A F+ CNLLKC D ++ E+LK C+ + KR
Sbjct: 1 MEYLRGQVKDLSAIPGLAGLDKFLNECNLLKCSLKDTISTEELKACICTNSKR 53
>gi|5902376|gb|AAD55478.1|AC009322_18 Hypothetical protein [Arabidopsis thaliana]
Length = 898
Score = 38.9 bits (89), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 3/61 (4%)
Query: 55 FDL--KTLRALGFILMEHLKEKVK-DLSLFPGSAEPLAFVAGCNLLKCDNDDILTVEKLK 111
FDL + G +LMEH +++V+ D SL P L + CNLLK DD + +E++
Sbjct: 92 FDLMNPVCYSAGLVLMEHARKRVQNDTSLAPSVVHQLRLLDNCNLLKTHVDDAVDIEEIL 151
Query: 112 T 112
T
Sbjct: 152 T 152
>gi|359726029|ref|ZP_09264725.1| hypothetical protein Lwei2_02949 [Leptospira weilii str.
2006001855]
Length = 340
Score = 37.4 bits (85), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 28/44 (63%)
Query: 38 LILQRAKVIGHEELANKFDLKTLRALGFILMEHLKEKVKDLSLF 81
LIL RAK + + EL N F+ +++ + I E+ +EK+ LSLF
Sbjct: 289 LILSRAKYLQYNELNNSFNFRSIISSDLIKYEYTQEKLDSLSLF 332
>gi|417779477|ref|ZP_12427263.1| hypothetical protein LEP1GSC036_2287 [Leptospira weilii str.
2006001853]
gi|410780380|gb|EKR64973.1| hypothetical protein LEP1GSC036_2287 [Leptospira weilii str.
2006001853]
Length = 340
Score = 37.0 bits (84), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 28/44 (63%)
Query: 38 LILQRAKVIGHEELANKFDLKTLRALGFILMEHLKEKVKDLSLF 81
LIL RAK + + EL N F+ +++ + I E+ +EK+ LSLF
Sbjct: 289 LILSRAKYLQYNELNNSFNFRSIISSDLIKYEYTQEKLDSLSLF 332
>gi|170729045|ref|YP_001763071.1| general secretion pathway protein L [Shewanella woodyi ATCC 51908]
gi|169814392|gb|ACA88976.1| general secretion pathway protein L [Shewanella woodyi ATCC 51908]
Length = 399
Score = 36.2 bits (82), Expect = 2.3, Method: Composition-based stats.
Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 4 KEMNEILSKNRVEDVSWLCSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRAL 63
++ EI++ ++D S L SL+E + + + L+ A + EL K + ++AL
Sbjct: 26 EQEQEIIASGELKDASALSSLTERAGNRPVDV--LVPSSAITLTSVELPEKGQRQAIQAL 83
Query: 64 GFILMEHLKEKVKDLSLFPG 83
F+L E L E V +L PG
Sbjct: 84 PFMLEESLAENVDELHFVPG 103
>gi|428165039|gb|EKX34045.1| hypothetical protein GUITHDRAFT_119786 [Guillardia theta CCMP2712]
Length = 930
Score = 35.8 bits (81), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 7/76 (9%)
Query: 12 KNRVEDVSWL-------CSLSESEVDMLISLKLLILQRAKVIGHEELANKFDLKTLRALG 64
+ R E V W+ CS+S S + + ++ R +V G+ +AN +K R LG
Sbjct: 407 RRREERVKWVRRCIDLSCSISSSRLQFRLFKCAVVAHREQVDGNRRVANSMRIKKRRELG 466
Query: 65 FILMEHLKEKVKDLSL 80
++ KE+VK SL
Sbjct: 467 RSCLQTWKEEVKSSSL 482
>gi|386824537|ref|ZP_10111670.1| diguanylate cyclase [Serratia plymuthica PRI-2C]
gi|386378494|gb|EIJ19298.1| diguanylate cyclase [Serratia plymuthica PRI-2C]
Length = 486
Score = 35.4 bits (80), Expect = 4.2, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 35/75 (46%), Gaps = 1/75 (1%)
Query: 46 IGHEELANKFDLKTLRALGFILMEHLKEKVKDLSLFPGSAEPLAFVAGCNLLKCDNDDIL 105
+G EE A + L A G L EHL++ V+ L+PG PL + +++
Sbjct: 387 LGGEEFAIMLP-RALAAQGVALAEHLRQLVQKTDLYPGGKTPLKITISVGVASLAMNEVK 445
Query: 106 TVEKLKTCLHIDLKR 120
++E+L I L R
Sbjct: 446 SLEQLMNMADIALYR 460
>gi|157377480|ref|YP_001476080.1| general secretion pathway protein L [Shewanella sediminis HAW-EB3]
gi|157319854|gb|ABV38952.1| general secretion pathway protein L [Shewanella sediminis HAW-EB3]
Length = 399
Score = 35.4 bits (80), Expect = 4.4, Method: Composition-based stats.
Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 12/92 (13%)
Query: 4 KEMNEILSKNRVEDVSWLCSLSESE----VDMLISLKLLILQRAKVIGHEELANKFDLKT 59
++ EI++ V+D + L SL+E VD+L+ + L + +L K +
Sbjct: 26 EQEQEIIASGEVKDAAALSSLTERAGNRPVDVLVPSSSITLTKV------DLPEKGQRQA 79
Query: 60 LRALGFILMEHLKEKVKDLSLFPG--SAEPLA 89
++AL F+L E L E V+ L PG S E L+
Sbjct: 80 IQALPFMLEESLAENVEQLHFVPGPRSGESLS 111
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.138 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,687,604,591
Number of Sequences: 23463169
Number of extensions: 56519305
Number of successful extensions: 129280
Number of sequences better than 100.0: 30
Number of HSP's better than 100.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 129241
Number of HSP's gapped (non-prelim): 30
length of query: 120
length of database: 8,064,228,071
effective HSP length: 87
effective length of query: 33
effective length of database: 6,022,932,368
effective search space: 198756768144
effective search space used: 198756768144
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 69 (31.2 bits)