BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 030190
(181 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225424758|ref|XP_002268027.1| PREDICTED: uncharacterized protein LOC100260319 [Vitis vinifera]
gi|296086507|emb|CBI32096.3| unnamed protein product [Vitis vinifera]
Length = 192
Score = 226 bits (577), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 109/138 (78%), Positives = 120/138 (86%), Gaps = 2/138 (1%)
Query: 43 LHDSSTISKKAQLSPLSFSAS-FRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAK 101
LH S T + QL L FS+S F NHICRAAEYKFPDPIPEFA ET+KFRTHLLNK +K
Sbjct: 53 LHSSLTTKLQPQLHSLVFSSSSFANHICRAAEYKFPDPIPEFAQVETEKFRTHLLNKFSK 112
Query: 102 KDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAA 161
KD++G+S+E+VVGICTEIFSTFLH+EYGGPGTLLV+PFIDMADTLNERGLPG QAARAA
Sbjct: 113 KDIYGESVEEVVGICTEIFSTFLHTEYGGPGTLLVIPFIDMADTLNERGLPGS-QAARAA 171
Query: 162 VKWAQRHVDKDWKEWTGD 179
VKWAQ HVDKDW EWTGD
Sbjct: 172 VKWAQDHVDKDWNEWTGD 189
>gi|255588186|ref|XP_002534527.1| conserved hypothetical protein [Ricinus communis]
gi|223525108|gb|EEF27857.1| conserved hypothetical protein [Ricinus communis]
Length = 188
Score = 217 bits (552), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 101/125 (80%), Positives = 108/125 (86%)
Query: 54 QLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVV 113
L P NHICRAAEYKFPDPIPEFAD ET+KFRTHL KL+KK + GDS+E+VV
Sbjct: 60 HLFPYVTKNKGNNHICRAAEYKFPDPIPEFADVETEKFRTHLQKKLSKKGVCGDSIEEVV 119
Query: 114 GICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDW 173
GICTEIFSTFLHSEYGGPGTLLV+PF DMADT+NERGLPGGPQAARAAVKWAQ HVDKDW
Sbjct: 120 GICTEIFSTFLHSEYGGPGTLLVIPFCDMADTINERGLPGGPQAARAAVKWAQDHVDKDW 179
Query: 174 KEWTG 178
KEWTG
Sbjct: 180 KEWTG 184
>gi|449434881|ref|XP_004135224.1| PREDICTED: uncharacterized protein LOC101210245 [Cucumis sativus]
gi|449527396|ref|XP_004170697.1| PREDICTED: uncharacterized LOC101210245 [Cucumis sativus]
Length = 169
Score = 216 bits (550), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 95/119 (79%), Positives = 109/119 (91%)
Query: 61 SASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIF 120
S+SF HIC AAEYKFPDPIPEFA +ET+KFR HLL KL+K+D++GDSL+ VVGICTEIF
Sbjct: 48 SSSFPTHICTAAEYKFPDPIPEFAVAETEKFRAHLLQKLSKRDIYGDSLQQVVGICTEIF 107
Query: 121 STFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
+TF+H+EYGGPGTLLVLPFIDM+D +NERGLPGGPQAARAAVKWAQ HVDKDW +WTGD
Sbjct: 108 NTFMHTEYGGPGTLLVLPFIDMSDAINERGLPGGPQAARAAVKWAQDHVDKDWNQWTGD 166
>gi|351734434|ref|NP_001237024.1| uncharacterized protein LOC100306067 precursor [Glycine max]
gi|255627431|gb|ACU14060.1| unknown [Glycine max]
Length = 173
Score = 215 bits (547), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 100/135 (74%), Positives = 120/135 (88%), Gaps = 2/135 (1%)
Query: 48 TISKKAQLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGD 107
++ + + S L FS++ R + RAAEYKFPDPIPEFAD+ETQKF++HLL KL+KKDM+G+
Sbjct: 38 SVKGRRKFSTLPFSSTKR-LVTRAAEYKFPDPIPEFADAETQKFKSHLLKKLSKKDMYGE 96
Query: 108 SLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQR 167
S+E+VVGICTEIFSTFLHSEYGGPGTLLVLPF+DMAD LNE+GLPGGPQAARAA+KWAQ
Sbjct: 97 SIEEVVGICTEIFSTFLHSEYGGPGTLLVLPFVDMADALNEQGLPGGPQAARAAIKWAQD 156
Query: 168 HVDKDWKEWT-GDSD 181
HVDKDWK WT GD++
Sbjct: 157 HVDKDWKGWTNGDNN 171
>gi|18391169|ref|NP_563872.1| uncharacterized protein [Arabidopsis thaliana]
gi|145323826|ref|NP_001077502.1| uncharacterized protein [Arabidopsis thaliana]
gi|5091545|gb|AAD39574.1|AC007067_14 T10O24.14 [Arabidopsis thaliana]
gi|21554792|gb|AAM63692.1| unknown [Arabidopsis thaliana]
gi|149944329|gb|ABR46207.1| At1g10522 [Arabidopsis thaliana]
gi|332190468|gb|AEE28589.1| uncharacterized protein [Arabidopsis thaliana]
gi|332190469|gb|AEE28590.1| uncharacterized protein [Arabidopsis thaliana]
Length = 179
Score = 213 bits (541), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 93/118 (78%), Positives = 108/118 (91%)
Query: 61 SASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIF 120
S S R +CRAAEYKFPDPIPEFA++ET+KFR H+LNKL+K+D+F DS++++VG+CTEIF
Sbjct: 60 SLSRRGFVCRAAEYKFPDPIPEFAEAETEKFRDHMLNKLSKRDLFEDSVDEIVGVCTEIF 119
Query: 121 STFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTG 178
TFL SEYGGPGTLLV+PFIDMADTLNER LPGGPQAARAA+KWAQ HVDKDWKEWTG
Sbjct: 120 ETFLRSEYGGPGTLLVIPFIDMADTLNERELPGGPQAARAAIKWAQDHVDKDWKEWTG 177
>gi|388507390|gb|AFK41761.1| unknown [Lotus japonicus]
Length = 182
Score = 208 bits (529), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 93/134 (69%), Positives = 116/134 (86%), Gaps = 1/134 (0%)
Query: 48 TISKKAQLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGD 107
+++ + +L+ L FS++ R + RAAEYKFPDPIPEFAD+ET+KF++HLLNKL+K+D++GD
Sbjct: 48 SVNGRRKLATLPFSSTKR-FVIRAAEYKFPDPIPEFADAETEKFKSHLLNKLSKRDIYGD 106
Query: 108 SLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQR 167
S+ +VVG+CTEIF +FLHSEYGGPGTLLV PF DMAD ++ERGLPGGPQAAR AVKWAQ
Sbjct: 107 SVGEVVGVCTEIFGSFLHSEYGGPGTLLVNPFADMADAVSERGLPGGPQAARVAVKWAQN 166
Query: 168 HVDKDWKEWTGDSD 181
HVDKDW EWTG D
Sbjct: 167 HVDKDWNEWTGGGD 180
>gi|297843868|ref|XP_002889815.1| hypothetical protein ARALYDRAFT_471166 [Arabidopsis lyrata subsp.
lyrata]
gi|297335657|gb|EFH66074.1| hypothetical protein ARALYDRAFT_471166 [Arabidopsis lyrata subsp.
lyrata]
Length = 180
Score = 206 bits (523), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 91/118 (77%), Positives = 105/118 (88%)
Query: 61 SASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIF 120
S S R +CRAAEYKFPDPIPEFA++ET+KFR H+ KL+K+D+F DS++++VGICTEIF
Sbjct: 62 SLSRRGFVCRAAEYKFPDPIPEFAEAETEKFRDHMSKKLSKRDLFEDSVDEIVGICTEIF 121
Query: 121 STFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTG 178
TFL SEYGGPGTLLV+PFIDMADTL ER LPGGPQAARAA+KWAQ HVDKDWKEWTG
Sbjct: 122 GTFLRSEYGGPGTLLVIPFIDMADTLKERELPGGPQAARAAIKWAQDHVDKDWKEWTG 179
>gi|357516739|ref|XP_003628658.1| hypothetical protein MTR_8g063150 [Medicago truncatula]
gi|355522680|gb|AET03134.1| hypothetical protein MTR_8g063150 [Medicago truncatula]
gi|388491344|gb|AFK33738.1| unknown [Medicago truncatula]
Length = 172
Score = 191 bits (486), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 98/131 (74%), Positives = 112/131 (85%), Gaps = 3/131 (2%)
Query: 48 TISKKAQLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGD 107
+I++K P S S + I RAAEYKFPDPIPEFADSET+KF+ HLLNKL+KKD+F +
Sbjct: 40 SINRKVSTLPFS---SIKRSITRAAEYKFPDPIPEFADSETEKFQNHLLNKLSKKDVFEE 96
Query: 108 SLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQR 167
S+E+VVG+CTEIFSTFLHSEYGGPGTLLV PFIDMAD +NERGLPGGPQAARAA+ WAQ
Sbjct: 97 SVEEVVGVCTEIFSTFLHSEYGGPGTLLVDPFIDMADIVNERGLPGGPQAARAAINWAQA 156
Query: 168 HVDKDWKEWTG 178
HVDKDW EWTG
Sbjct: 157 HVDKDWNEWTG 167
>gi|224108113|ref|XP_002314726.1| predicted protein [Populus trichocarpa]
gi|222863766|gb|EEF00897.1| predicted protein [Populus trichocarpa]
Length = 96
Score = 171 bits (433), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 76/92 (82%), Positives = 85/92 (92%)
Query: 87 ETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTL 146
ET+KF+THL KL KKD++GDSLE+VVGICTEIFSTFLH+EYGGPGTLLV PFIDMADT+
Sbjct: 1 ETEKFKTHLSKKLTKKDIYGDSLEEVVGICTEIFSTFLHAEYGGPGTLLVTPFIDMADTV 60
Query: 147 NERGLPGGPQAARAAVKWAQRHVDKDWKEWTG 178
NE+GLPGGP+AAR AVKWAQ HVD DWKEWTG
Sbjct: 61 NEQGLPGGPEAARVAVKWAQAHVDNDWKEWTG 92
>gi|147777968|emb|CAN65133.1| hypothetical protein VITISV_018708 [Vitis vinifera]
Length = 170
Score = 171 bits (432), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 79/93 (84%), Positives = 87/93 (93%), Gaps = 1/93 (1%)
Query: 87 ETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTL 146
ET+KFRTHLLNK +KKD++G S+E+VVGICTEIFSTFLH+EYGGPGTLLV+PFIDMADTL
Sbjct: 76 ETEKFRTHLLNKFSKKDIYGXSVEEVVGICTEIFSTFLHTEYGGPGTLLVIPFIDMADTL 135
Query: 147 NERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
NERGLP G QAARAAVKWAQ HVDKDW EWTGD
Sbjct: 136 NERGLP-GSQAARAAVKWAQDHVDKDWNEWTGD 167
>gi|225424768|ref|XP_002266853.1| PREDICTED: uncharacterized protein LOC100251854 [Vitis vinifera]
gi|296086501|emb|CBI32090.3| unnamed protein product [Vitis vinifera]
Length = 188
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 85/185 (45%), Positives = 108/185 (58%), Gaps = 25/185 (13%)
Query: 3 CYCNRSLISSKLDPCSPFSSSSLRLAFHRNSS-----VLLGGGIKLHDSST----ISKKA 53
C +L +S LD S LR+A +N +L + +H SST +S+K
Sbjct: 12 CQVPMALQTSLLD-------SVLRIAPAKNWPSPNILILRTPCVNVHPSSTSFPILSRKI 64
Query: 54 QLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAK-KDMFGDSLEDV 112
PL S +Y +PDPIPEFA++ET+KFR LL KL+K KD F D L V
Sbjct: 65 FPKPLRASTP--------QKYVYPDPIPEFAEAETEKFRVELLKKLSKEKDTFRDDLHAV 116
Query: 113 VGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKD 172
V +C +IFS FLH EYGGPGTLLV PF DM L E+ LPG P AARA++ WAQ +VD+D
Sbjct: 117 VAVCAQIFSEFLHKEYGGPGTLLVEPFTDMLVALKEKKLPGAPLAARASLLWAQNYVDQD 176
Query: 173 WKEWT 177
W+ W
Sbjct: 177 WEVWN 181
>gi|255581190|ref|XP_002531408.1| conserved hypothetical protein [Ricinus communis]
gi|223529001|gb|EEF30992.1| conserved hypothetical protein [Ricinus communis]
Length = 162
Score = 139 bits (350), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 65/120 (54%), Positives = 83/120 (69%), Gaps = 8/120 (6%)
Query: 61 SASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAK-KDMFGDSLEDVVGICTEI 119
+ SF+N++ + DPIPEFA SET+KFR +L KL+K K FG L+ VV +C+EI
Sbjct: 47 TPSFQNYV-------YSDPIPEFAQSETEKFRAEILKKLSKDKQTFGGDLDKVVDVCSEI 99
Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
FS FLH EYGGPGTLLV PF DM L E+ LPG P AAR ++ WAQ +VD+DW+ W +
Sbjct: 100 FSDFLHKEYGGPGTLLVEPFTDMLIALKEKNLPGAPLAARESLLWAQNYVDQDWEAWNSN 159
>gi|357516745|ref|XP_003628661.1| hypothetical protein MTR_8g063190 [Medicago truncatula]
gi|355522683|gb|AET03137.1| hypothetical protein MTR_8g063190 [Medicago truncatula]
Length = 182
Score = 135 bits (339), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 62/109 (56%), Positives = 79/109 (72%), Gaps = 3/109 (2%)
Query: 71 AAEYKFPDPIPEFADSETQKFRTHLLNKLAKK---DMFGDSLEDVVGICTEIFSTFLHSE 127
+ ++ +PDPIPEFA+ ETQKF+ LL KL+ D FGD L+ VV +C +IFS FLH E
Sbjct: 69 SQKHVYPDPIPEFAEFETQKFKVQLLQKLSDSEDVDEFGDELDAVVHVCGQIFSEFLHKE 128
Query: 128 YGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEW 176
YGGPGTLLV PF DM L ++ LPG P AARA++ WAQ +VD+DW+ W
Sbjct: 129 YGGPGTLLVTPFTDMFVALKKKKLPGAPLAARASLLWAQNYVDEDWEIW 177
>gi|326509753|dbj|BAJ87092.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 171
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/109 (63%), Positives = 86/109 (78%), Gaps = 2/109 (1%)
Query: 73 EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEIFSTFLHSEYGG 130
+YKFPDPIPEFA+ ET KFR H+ +L +K D FG+ +ED+V +CTE+ TFL EY G
Sbjct: 62 DYKFPDPIPEFAEQETSKFREHMTWRLEQKKEDYFGEHVEDIVDVCTEVLGTFLQHEYCG 121
Query: 131 PGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
PGTLLV PF+DM + ERGLPG PQAARAA+ WA++++DKDWKEWTGD
Sbjct: 122 PGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKEWTGD 170
>gi|449478547|ref|XP_004155348.1| PREDICTED: uncharacterized LOC101211725 [Cucumis sativus]
Length = 170
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 59/106 (55%), Positives = 76/106 (71%), Gaps = 1/106 (0%)
Query: 72 AEYKFPDPIPEFADSETQKFRTHLLNKLAK-KDMFGDSLEDVVGICTEIFSTFLHSEYGG 130
A Y +P+PIPEFA+ ETQKF+ L KLAK ++ FG+ + VV +C++IF +LH EYGG
Sbjct: 58 ASYVYPEPIPEFAEVETQKFKEQLSKKLAKDRETFGNDFDSVVDVCSKIFGEYLHVEYGG 117
Query: 131 PGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEW 176
PGTL+V PF +M LNER LPG P AAR ++ WAQ H+D DW W
Sbjct: 118 PGTLIVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDNDWNIW 163
>gi|326523125|dbj|BAJ88603.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 168
Score = 133 bits (334), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/109 (63%), Positives = 86/109 (78%), Gaps = 2/109 (1%)
Query: 73 EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEIFSTFLHSEYGG 130
+YKFPDPIPEFA+ ET KFR H+ +L +K D FG+ +ED+V +CTE+ TFL EY G
Sbjct: 59 DYKFPDPIPEFAEQETSKFREHMTWRLEQKKEDYFGEHVEDIVDVCTEVLGTFLQHEYCG 118
Query: 131 PGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
PGTLLV PF+DM + ERGLPG PQAARAA+ WA++++DKDWKEWTGD
Sbjct: 119 PGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKEWTGD 167
>gi|226501364|ref|NP_001145504.1| hypothetical protein [Zea mays]
gi|195657185|gb|ACG48060.1| hypothetical protein [Zea mays]
gi|414589435|tpg|DAA40006.1| TPA: hypothetical protein ZEAMMB73_406956 [Zea mays]
Length = 173
Score = 131 bits (330), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 71/120 (59%), Positives = 90/120 (75%), Gaps = 8/120 (6%)
Query: 68 ICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEI 119
+CR A +YKFPDPIPEFA+ ET KFR H+ +L +K D FGD +E++V +CTEI
Sbjct: 53 VCRRAKNAGYEDYKFPDPIPEFAEQETSKFREHMAWRLEQKKEDYFGDHVEEIVDVCTEI 112
Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
TFL ++Y GPGTLLV PF+DM + ERGLPG PQAARAA+ WA+++VDKDWK WTG+
Sbjct: 113 MGTFLENDYRGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNVDKDWKAWTGE 172
>gi|195637804|gb|ACG38370.1| hypothetical protein [Zea mays]
Length = 173
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 71/120 (59%), Positives = 90/120 (75%), Gaps = 8/120 (6%)
Query: 68 ICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEI 119
+CR A +YKFPDPIPEFA+ ET KFR H+ +L +K D FGD +E++V +CTEI
Sbjct: 53 VCRRAKNAGYEDYKFPDPIPEFAEQETSKFREHMAWRLEQKKEDYFGDHVEEIVDVCTEI 112
Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
TFL ++Y GPGTLLV PF+DM + ERGLPG PQAARAA+ WA+++VDKDWK WTG+
Sbjct: 113 LGTFLENDYRGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNVDKDWKAWTGE 172
>gi|388513625|gb|AFK44874.1| unknown [Lotus japonicus]
Length = 172
Score = 131 bits (329), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 68/148 (45%), Positives = 91/148 (61%), Gaps = 7/148 (4%)
Query: 28 AFHRNSSVLLGGGIKLHDSSTISKKAQLSPLSFSASFRNHICRAAEYKFPDPIPEFADSE 87
F N+ +L +L SS K L S + R+ + ++ +PDP PEFA+SE
Sbjct: 24 TFPSNTLILPHHCFRLSSSSLFVK------LRPSFALRDSLTTHKKHVYPDPSPEFAESE 77
Query: 88 TQKFRTHLLNKLAK-KDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTL 146
TQKFR L KL++ +D FGD L V+ +C +IF+ FL EYGGPGTLLV PF DM L
Sbjct: 78 TQKFRIELFQKLSQDEDEFGDHLHAVIDVCAQIFNEFLQKEYGGPGTLLVEPFTDMLVAL 137
Query: 147 NERGLPGGPQAARAAVKWAQRHVDKDWK 174
++ LPG P AARA++ WAQ +VD DW+
Sbjct: 138 KKKKLPGAPLAARASLLWAQNYVDHDWE 165
>gi|242049158|ref|XP_002462323.1| hypothetical protein SORBIDRAFT_02g023810 [Sorghum bicolor]
gi|241925700|gb|EER98844.1| hypothetical protein SORBIDRAFT_02g023810 [Sorghum bicolor]
Length = 173
Score = 131 bits (329), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 71/120 (59%), Positives = 90/120 (75%), Gaps = 8/120 (6%)
Query: 68 ICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEI 119
+CR A +YKFPDPIPEFA+ ET KFR H+ +L +K D FGD +E++V ICTEI
Sbjct: 53 VCRRAKNAGYEDYKFPDPIPEFAEQETSKFREHMAWRLEQKKEDYFGDHVEEIVDICTEI 112
Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
TFL ++Y GPGTLLV PF+DM + ERGLPG PQAARAA+ WA++++DKDWK WTG+
Sbjct: 113 LGTFLENDYCGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKAWTGE 172
>gi|449434893|ref|XP_004135230.1| PREDICTED: uncharacterized protein LOC101211725 [Cucumis sativus]
Length = 170
Score = 130 bits (326), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 58/106 (54%), Positives = 75/106 (70%), Gaps = 1/106 (0%)
Query: 72 AEYKFPDPIPEFADSETQKFRTHLLNKLAK-KDMFGDSLEDVVGICTEIFSTFLHSEYGG 130
A Y +P+PIPEFA+ ETQKF+ L KLAK ++ FG+ + VV +C++IF +LH EYGG
Sbjct: 58 ASYVYPEPIPEFAEVETQKFKEQLSKKLAKDRETFGNDFDSVVDVCSKIFGEYLHVEYGG 117
Query: 131 PGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEW 176
PGTL+V PF +M LNER L G P AAR ++ WAQ H+D DW W
Sbjct: 118 PGTLIVEPFTNMFIALNERKLSGAPLAARTSLLWAQNHLDNDWNIW 163
>gi|357153554|ref|XP_003576489.1| PREDICTED: uncharacterized protein LOC100837297 [Brachypodium
distachyon]
Length = 169
Score = 127 bits (318), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/120 (55%), Positives = 89/120 (74%), Gaps = 8/120 (6%)
Query: 68 ICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEI 119
+CR A +YKFPDPIPEFA+ ET KFR H+ +L +K + FG+ +E++V +CTE+
Sbjct: 49 VCRRAKNAAFDDYKFPDPIPEFAEQETSKFREHMTLRLEQKKEEYFGEHVEEIVDVCTEV 108
Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
TFL +Y GPGTLLV PF+DM + ERGLPG PQAARAA+ WA++++DKDWK WTG+
Sbjct: 109 LGTFLQHDYCGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKAWTGE 168
>gi|115478941|ref|NP_001063064.1| Os09g0382400 [Oryza sativa Japonica Group]
gi|49387643|dbj|BAD25837.1| unknown protein [Oryza sativa Japonica Group]
gi|113631297|dbj|BAF24978.1| Os09g0382400 [Oryza sativa Japonica Group]
gi|215693061|dbj|BAG88481.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641488|gb|EEE69620.1| hypothetical protein OsJ_29199 [Oryza sativa Japonica Group]
Length = 172
Score = 125 bits (314), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 68/122 (55%), Positives = 90/122 (73%), Gaps = 8/122 (6%)
Query: 66 NHICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICT 117
+ +CR A +YKFPDPIPEFA ET KF+ H++ +L +K D FG+ +E++V +CT
Sbjct: 50 DFVCRRAKNAAYDDYKFPDPIPEFAAQETSKFKEHMMWRLEQKKDDYFGEHVEEIVDVCT 109
Query: 118 EIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWT 177
EI TFL +Y GPGTLLV PF+DM + ERGLPG PQAARAA+ WA++++DKDWK WT
Sbjct: 110 EILGTFLEHDYCGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKAWT 169
Query: 178 GD 179
G+
Sbjct: 170 GE 171
>gi|218202081|gb|EEC84508.1| hypothetical protein OsI_31201 [Oryza sativa Indica Group]
Length = 188
Score = 116 bits (290), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 68/137 (49%), Positives = 90/137 (65%), Gaps = 23/137 (16%)
Query: 66 NHICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICT 117
+ +CR A +YKFPDPIPEFA ET KF+ H++ +L +K D FG+ +E++V +CT
Sbjct: 51 DFVCRRAKNAAYDDYKFPDPIPEFAAQETSKFKEHMMWRLEQKKDDYFGEHVEEIVDVCT 110
Query: 118 E---------------IFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAV 162
E I TFL +Y GPGTLLV PF+DM + ERGLPG PQAARAA+
Sbjct: 111 EASSCNFCTIACIGFLILGTFLEHDYCGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAI 170
Query: 163 KWAQRHVDKDWKEWTGD 179
WA++++DKDWK WTG+
Sbjct: 171 AWAEKNIDKDWKAWTGE 187
>gi|356526856|ref|XP_003532032.1| PREDICTED: uncharacterized protein LOC100778043 [Glycine max]
Length = 163
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/105 (56%), Positives = 75/105 (71%), Gaps = 2/105 (1%)
Query: 74 YKFPDPIPEFADSETQKFRTHLLNKLAKKDM--FGDSLEDVVGICTEIFSTFLHSEYGGP 131
+ +PDP+PEFA+ ET KF+ L KL++ D+ FGD L+ VV +C +IFS FLH EYGGP
Sbjct: 55 HVYPDPVPEFAEHETHKFKVELFRKLSEDDVDEFGDDLDAVVAVCAQIFSEFLHKEYGGP 114
Query: 132 GTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEW 176
GTLLV PF DM L ++ LPG AARA++ WAQ VDKDW+ W
Sbjct: 115 GTLLVEPFTDMMVALKKKKLPGAALAARASLLWAQNFVDKDWEVW 159
>gi|357454523|ref|XP_003597542.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
gi|355486590|gb|AES67793.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
Length = 280
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 45/54 (83%), Positives = 48/54 (88%)
Query: 125 HSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTG 178
HSEYGGPGTLLV PFIDMAD +N+RGLPGGPQAAR A+ WAQ HVDKDW EWTG
Sbjct: 222 HSEYGGPGTLLVDPFIDMADIVNDRGLPGGPQAARTAINWAQAHVDKDWNEWTG 275
>gi|224108121|ref|XP_002314729.1| predicted protein [Populus trichocarpa]
gi|222863769|gb|EEF00900.1| predicted protein [Populus trichocarpa]
Length = 97
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 51/98 (52%), Gaps = 2/98 (2%)
Query: 76 FPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGIC-TEIFSTFLHSEYGGPGTL 134
PD + E + + + + +N G S E+ C +IFS FLH +YGG TL
Sbjct: 1 MPDSVIE-NEQDVRLWEMMQMNNCMLLQWHGCSGEEKTTFCWNQIFSDFLHKDYGGSETL 59
Query: 135 LVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKD 172
+V PF D +L E PG P AA A++ WAQ + D+D
Sbjct: 60 MVEPFTDALVSLEENFFPGAPSAASASLLWAQNYDDQD 97
>gi|125563560|gb|EAZ08940.1| hypothetical protein OsI_31203 [Oryza sativa Indica Group]
Length = 118
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 44/66 (66%), Gaps = 8/66 (12%)
Query: 66 NHICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICT 117
+ +CR A +YKFPDPIPEFA ET KF+ H++ +L +K D FG+ +E++V +CT
Sbjct: 50 DFVCRRAKNAAYDDYKFPDPIPEFAAQETSKFKEHMMWRLEQKKDDYFGEHVEEIVDVCT 109
Query: 118 EIFSTF 123
E S F
Sbjct: 110 EASSWF 115
>gi|116792287|gb|ABK26304.1| unknown [Picea sitchensis]
Length = 177
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 56/112 (50%), Gaps = 6/112 (5%)
Query: 69 CRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIFSTFLHSEY 128
CRA +FAD E +FR LL LA GD +V CT+IF F S
Sbjct: 66 CRAVATHKIVAQSDFADHEATRFRDGLLKMLADDQNIGDLRVRMVDTCTKIFREFAMSY- 124
Query: 129 GGPGTLLVLPFIDMADTLNE-RGLPGG--PQAARAAVKWAQRHVDKDWKEWT 177
G L + PF++M L E +G+P + R A+ W++ ++ +DWK+W+
Sbjct: 125 --SGELPMEPFVEMRRVLEEGQGIPLSIVSKVTRGAIGWSRYNLYRDWKKWS 174
>gi|386721713|ref|YP_006188038.1| hypothetical protein B2K_05970 [Paenibacillus mucilaginosus K02]
gi|384088837|gb|AFH60273.1| hypothetical protein B2K_05970 [Paenibacillus mucilaginosus K02]
Length = 764
Score = 39.3 bits (90), Expect = 0.68, Method: Composition-based stats.
Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 14/87 (16%)
Query: 98 KLAKKDMFGDSLEDVVGICTEI--FSTFLHSEYGGPGTLLVLPFIDMAD----------- 144
+L + D + D D +G CT + +S + +YGGPGT L ID D
Sbjct: 427 ELGRTD-YIDYAADHIGQCTRLGEYSEWDAQQYGGPGTNHQLALIDSLDDCGSFGATMLR 485
Query: 145 TLNERGLPGGPQAARAAVKWAQRHVDK 171
L ER LPGG AA ++ R D+
Sbjct: 486 VLRERELPGGRWAADRIAEYITREQDR 512
>gi|337748029|ref|YP_004642191.1| hypothetical protein KNP414_03782 [Paenibacillus mucilaginosus
KNP414]
gi|379719123|ref|YP_005311254.1| hypothetical protein PM3016_1175 [Paenibacillus mucilaginosus 3016]
gi|336299218|gb|AEI42321.1| YesR2 [Paenibacillus mucilaginosus KNP414]
gi|378567795|gb|AFC28105.1| YesR2 [Paenibacillus mucilaginosus 3016]
Length = 764
Score = 38.9 bits (89), Expect = 0.78, Method: Composition-based stats.
Identities = 27/78 (34%), Positives = 35/78 (44%), Gaps = 13/78 (16%)
Query: 107 DSLEDVVGICTEI--FSTFLHSEYGGPGTLLVLPFIDMAD-----------TLNERGLPG 153
D D +G CT + +S + +YGGPGT L ID D L ER LPG
Sbjct: 435 DYAADHIGQCTRLGEYSEWDAQQYGGPGTNHQLALIDSLDDCGSFGATMLRVLRERELPG 494
Query: 154 GPQAARAAVKWAQRHVDK 171
G AA ++ R D+
Sbjct: 495 GRWAADRIAEYITREQDR 512
>gi|189238626|ref|XP_001810169.1| PREDICTED: similar to CG12581 CG12581-PA [Tribolium castaneum]
gi|270008388|gb|EFA04836.1| hypothetical protein TcasGA2_TC014886 [Tribolium castaneum]
Length = 567
Score = 38.1 bits (87), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 48/113 (42%), Gaps = 17/113 (15%)
Query: 45 DSSTISKKAQLSPLSFSASFRN-HICRA----AEYKFPDPIP--EFADSETQKFRTHLLN 97
++ KK L+ +FS S R H R+ A KF PIP + D E K ++
Sbjct: 462 NTGIYRKKGHLNERAFSYSIRQEHRSRSYGSLANLKFATPIPNGDVEDREDMKKEREIMQ 521
Query: 98 KLAKKDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERG 150
+ D+ GD LE +E+ T + G PG P I A +N RG
Sbjct: 522 MVQDLDLSGDELER-----SEVPRTMYEARGGPPG-----PVIVAAPQMNGRG 564
>gi|424864330|ref|ZP_18288234.1| phytanoyl-CoA dioxygenase [SAR86 cluster bacterium SAR86B]
gi|400759759|gb|EJP73940.1| phytanoyl-CoA dioxygenase [SAR86 cluster bacterium SAR86B]
Length = 224
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/111 (24%), Positives = 46/111 (41%), Gaps = 3/111 (2%)
Query: 2 ACYCNRSLISSKLDPCSPFSSSSLRLAFHRNSSVLLGGGIKLHDSSTISKKAQLSPLSFS 61
A CNR L+ C ++ FHR SSVL G +H S + + + + S
Sbjct: 115 AIICNRQLLHGSFPNCGYEQRVTVNFGFHRRSSVLGTQGGGIHSESQVFTEEVIQRRARS 174
Query: 62 ASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDV 112
+ A + K+ P + + + N+ A+KD+ +LED+
Sbjct: 175 IGY---AIDARKQKYTSETPYSYEPFESTNTSFVWNEEARKDLKDYNLEDL 222
>gi|147801414|emb|CAN65778.1| hypothetical protein VITISV_014488 [Vitis vinifera]
Length = 155
Score = 37.0 bits (84), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 23/34 (67%), Gaps = 1/34 (2%)
Query: 86 SETQKFRTHLLNKLAK-KDMFGDSLEDVVGICTE 118
+ET+KFR LL KL+K KD F D L VV +C +
Sbjct: 34 NETEKFRVELLKKLSKEKDTFRDDLHAVVAVCAQ 67
>gi|424864172|ref|ZP_18288076.1| phytanoyl-CoA dioxygenase [SAR86 cluster bacterium SAR86B]
gi|400759601|gb|EJP73782.1| phytanoyl-CoA dioxygenase [SAR86 cluster bacterium SAR86B]
Length = 389
Score = 36.6 bits (83), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 27/111 (24%), Positives = 46/111 (41%), Gaps = 3/111 (2%)
Query: 2 ACYCNRSLISSKLDPCSPFSSSSLRLAFHRNSSVLLGGGIKLHDSSTISKKAQLSPLSFS 61
A CNR L+ C ++ FHR SSVL G +H S + + + + S
Sbjct: 280 AIICNRQLLHGSFPNCGYEQRVTVNFGFHRRSSVLGTQGGGIHSESQVFTEEIIQRRARS 339
Query: 62 ASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDV 112
+ A + K+ P + + + N+ A+KD+ +LED+
Sbjct: 340 IGY---AIDARKQKYTSETPYSYEPFESTNTSFVWNEEARKDLKDYNLEDL 387
>gi|226947191|ref|YP_002802264.1| thiol oxidoreductase [Azotobacter vinelandii DJ]
gi|226722118|gb|ACO81289.1| thiol oxidoreductase [Azotobacter vinelandii DJ]
Length = 452
Score = 36.2 bits (82), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 5/42 (11%)
Query: 125 HSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQ 166
+++GGP LP D D LN+ G+PG P RAA+ W +
Sbjct: 120 ENDHGGP-----LPHPDYGDQLNDTGIPGVPPEGRAAIAWEE 156
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,975,847,763
Number of Sequences: 23463169
Number of extensions: 118300207
Number of successful extensions: 241547
Number of sequences better than 100.0: 39
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 241495
Number of HSP's gapped (non-prelim): 39
length of query: 181
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 48
effective length of database: 9,238,593,890
effective search space: 443452506720
effective search space used: 443452506720
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)