BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023901
(275 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449458726|ref|XP_004147098.1| PREDICTED: uncharacterized protein LOC101212890 [Cucumis sativus]
gi|449526688|ref|XP_004170345.1| PREDICTED: uncharacterized protein LOC101227242 [Cucumis sativus]
Length = 340
Score = 327 bits (837), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 172/271 (63%), Positives = 205/271 (75%), Gaps = 23/271 (8%)
Query: 13 QFLPRP-----KIPQPPFSSPSLLLRQRRTTYQFSSLVVSAQNDKKSTTTKEVKKKAEEV 67
Q L RP K P FS +L R S+L+++A K S + V K+++
Sbjct: 16 QLLHRPNSLFSKFPSSTFSPFTLSNR--------STLLLAAAKKKDSDSVPAVAKESKTS 67
Query: 68 EVEV----------EEELPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYV 117
+ EEELPW QEKALDLVEF+GSVTQAIPGPRVGQS LPWILAVPLAY+
Sbjct: 68 KSNTVGDEEEFVEVEEELPWYQEKALDLVEFSGSVTQAIPGPRVGQSSLPWILAVPLAYL 127
Query: 118 GVSFVIAFVKTVKKFNSPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTG 177
GV+FVIAFVKTV+KFNSPK KR++ V KNA +C ++DEL +KG D V P AL +VQKTG
Sbjct: 128 GVTFVIAFVKTVRKFNSPKEKRRRQVTKNAFLCISVDELLEKGRDEVKPEALAEIVQKTG 187
Query: 178 FSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPV 237
FS++ +LRKYIRYALNEKPFNP+LV NLIQLRKAS L+D+QVA+ILNE+SRR R+KGPV
Sbjct: 188 FSVDQILRKYIRYALNEKPFNPELVANLIQLRKASALEDTQVAQILNEVSRRIERDKGPV 247
Query: 238 VMNMSGYSEKGFKRKLAVQALFGKVFYLSEV 268
VMNMSGY+EKGFKRKLAVQALFGK+FYLSE+
Sbjct: 248 VMNMSGYTEKGFKRKLAVQALFGKIFYLSEL 278
>gi|225463793|ref|XP_002268157.1| PREDICTED: uncharacterized protein LOC100250766 [Vitis vinifera]
gi|297742717|emb|CBI35351.3| unnamed protein product [Vitis vinifera]
Length = 345
Score = 327 bits (837), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 153/194 (78%), Positives = 174/194 (89%)
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
LPWI+EKALDLVEF+GSV QAIPGPRVG+S PWILA+PLAY G++FVIAFV+TV+KFNS
Sbjct: 88 LPWIEEKALDLVEFSGSVAQAIPGPRVGRSSFPWILAIPLAYAGITFVIAFVRTVQKFNS 147
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNE 194
PK KR+KLVNKNAM+CK+IDE+F G D AL GL+QKTGFS E++ RKYIRYALNE
Sbjct: 148 PKQKRRKLVNKNAMLCKSIDEVFLNGRDEELQSALNGLMQKTGFSREEIFRKYIRYALNE 207
Query: 195 KPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLA 254
KPFNP++V LIQ RKAS+LDDSQVAEILNEISRR VR+KGPVVM+MSGYSEKGFKRKLA
Sbjct: 208 KPFNPEMVATLIQFRKASLLDDSQVAEILNEISRRIVRDKGPVVMDMSGYSEKGFKRKLA 267
Query: 255 VQALFGKVFYLSEV 268
VQALFGKVFYLSE+
Sbjct: 268 VQALFGKVFYLSEL 281
>gi|356577710|ref|XP_003556967.1| PREDICTED: uncharacterized protein LOC100804019 [Glycine max]
Length = 348
Score = 322 bits (824), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 149/194 (76%), Positives = 175/194 (90%)
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
LPWIQEKALDLVEFTGSVTQAIPGPRVG + LPWILA+PL Y G++FVIAFVKTV+KF+S
Sbjct: 95 LPWIQEKALDLVEFTGSVTQAIPGPRVGPTSLPWILAIPLTYAGLTFVIAFVKTVRKFSS 154
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNE 194
PK KR++ V+KNA +CK++D+LFQKG D V ALK + KTGF +E++LRKYIRYALNE
Sbjct: 155 PKAKRRRQVSKNATLCKSLDDLFQKGRDEVKLDALKQIENKTGFDLEEILRKYIRYALNE 214
Query: 195 KPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLA 254
KPFNPD+V +LIQLRKASML+DSQVAEILNEISRR VR+KGP+VM+ SGY+EKGFKRK+A
Sbjct: 215 KPFNPDMVADLIQLRKASMLNDSQVAEILNEISRRIVRDKGPIVMDKSGYTEKGFKRKIA 274
Query: 255 VQALFGKVFYLSEV 268
VQALFGKVFYLSE+
Sbjct: 275 VQALFGKVFYLSEL 288
>gi|357439057|ref|XP_003589805.1| hypothetical protein MTR_1g039490 [Medicago truncatula]
gi|355478853|gb|AES60056.1| hypothetical protein MTR_1g039490 [Medicago truncatula]
Length = 422
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 149/193 (77%), Positives = 170/193 (88%)
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
LPWIQEKALDLVEFTGSVTQAIPGPRVG + LPWILAVPL Y+G++FVIAFVKTVKKF S
Sbjct: 77 LPWIQEKALDLVEFTGSVTQAIPGPRVGPTSLPWILAVPLGYLGLTFVIAFVKTVKKFTS 136
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNE 194
PK +R+KLV KNAM+CK++DEL Q+G D + LK + KTGF +E++LRKYIRYALNE
Sbjct: 137 PKAQRRKLVGKNAMLCKSVDELLQRGRDEIKVDDLKAIENKTGFGLEEILRKYIRYALNE 196
Query: 195 KPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLA 254
KPFNPD+V +LIQLR+AS L DSQ AEILNEISRR VR+KGP+VMN SGY+EKGFKRKLA
Sbjct: 197 KPFNPDVVADLIQLRRASSLSDSQAAEILNEISRRIVRDKGPIVMNKSGYTEKGFKRKLA 256
Query: 255 VQALFGKVFYLSE 267
VQALFGKVFYLSE
Sbjct: 257 VQALFGKVFYLSE 269
>gi|21592521|gb|AAM64471.1| unknown [Arabidopsis thaliana]
Length = 346
Score = 318 bits (815), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 163/253 (64%), Positives = 198/253 (78%), Gaps = 5/253 (1%)
Query: 20 IPQPPFSSPSLLLRQRRTTY-QFSSLVVSAQNDKKSTTTKEVKKKAEEVEVEVEE---EL 75
+P+ P +P LRQ R + S+ Q++ + T KK + E EE ++
Sbjct: 25 LPRTPLFAPLPSLRQLRPKHISISAAAPKKQSETVTAPTPAAKKNSSVEEETEEEVEEDM 84
Query: 76 PWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNSP 135
PWIQEKALDLVEFTGSV+QAIPGPRVG SKLPW+LAVPLAY GV+FV AFVKTV+KF+SP
Sbjct: 85 PWIQEKALDLVEFTGSVSQAIPGPRVGSSKLPWMLAVPLAYAGVTFVTAFVKTVQKFSSP 144
Query: 136 KFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNEK 195
K +RKKLVN+NAM+C++IDEL +K G V+ LK L QKT F+ME++LRKYIRYALNEK
Sbjct: 145 KAQRKKLVNQNAMLCRSIDELLRKDG-TVHSSELKALEQKTEFNMEEILRKYIRYALNEK 203
Query: 196 PFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAV 255
PFNPDLV +LI LRKAS L+DSQ+ EILNEISRR V+EKGPVVM M G++EKGFKRKLAV
Sbjct: 204 PFNPDLVADLIHLRKASGLNDSQIPEILNEISRRIVKEKGPVVMKMQGFTEKGFKRKLAV 263
Query: 256 QALFGKVFYLSEV 268
QALFGK++YLSE+
Sbjct: 264 QALFGKIYYLSEL 276
>gi|297806925|ref|XP_002871346.1| hypothetical protein ARALYDRAFT_487695 [Arabidopsis lyrata subsp.
lyrata]
gi|297317183|gb|EFH47605.1| hypothetical protein ARALYDRAFT_487695 [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 317 bits (812), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 167/255 (65%), Positives = 199/255 (78%), Gaps = 9/255 (3%)
Query: 20 IPQPPF--SSPSLLLRQRRTTY-QFSSLVVSAQNDKKSTTTKEVKKKAEEVEVEVEE--- 73
+P+ P S PSL RQ R + S+ Q+D +T T KK + E EE
Sbjct: 25 LPRTPLFVSLPSL--RQLRPKHPSISAAAPKKQSDTVTTPTPTAKKNSSVEEETEEEVEE 82
Query: 74 ELPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFN 133
++ WIQEKALDLVEFTGSVTQAIPGPRVG SKLPW+LAVPLAY GV+FV AFVKTV+KF+
Sbjct: 83 DMLWIQEKALDLVEFTGSVTQAIPGPRVGSSKLPWMLAVPLAYAGVTFVTAFVKTVQKFS 142
Query: 134 SPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALN 193
SPK +RKKLVN+NAM+C++IDEL +K G V+ LK L QKT F+ME++LRKYIRYALN
Sbjct: 143 SPKAQRKKLVNQNAMLCRSIDELLRKDG-TVHSSELKALEQKTEFNMEEILRKYIRYALN 201
Query: 194 EKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKL 253
EKPFNPDLV +LI LRKAS L+DSQ+ EILNEISRR V+EKGPVVM M G++EKGFKRKL
Sbjct: 202 EKPFNPDLVADLIHLRKASGLNDSQIPEILNEISRRIVKEKGPVVMKMQGFTEKGFKRKL 261
Query: 254 AVQALFGKVFYLSEV 268
AVQALFGK++YLSE+
Sbjct: 262 AVQALFGKIYYLSEL 276
>gi|18415850|ref|NP_568200.1| uncharacterized protein [Arabidopsis thaliana]
gi|14334822|gb|AAK59589.1| unknown protein [Arabidopsis thaliana]
gi|15293203|gb|AAK93712.1| unknown protein [Arabidopsis thaliana]
gi|332003935|gb|AED91318.1| uncharacterized protein [Arabidopsis thaliana]
Length = 346
Score = 317 bits (812), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 168/257 (65%), Positives = 201/257 (78%), Gaps = 13/257 (5%)
Query: 20 IPQPPFSSPSLLLRQRRTTYQFSSLVVSAQNDKKSTTT--------KEVKKKAEEVEVEV 71
+P P +P LRQ R + + +SA KK + T K+ EE E EV
Sbjct: 25 LPLTPLFAPLPSLRQLRPKH----ISISAAAPKKKSETVTAPTPAAKKNSSVEEETEEEV 80
Query: 72 EEELPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKK 131
EE++PWIQEKALDLVEFTGSV+QAIPGPRVG SKLPW+LAVPLAY GV+FV AFVKTV+K
Sbjct: 81 EEDMPWIQEKALDLVEFTGSVSQAIPGPRVGSSKLPWMLAVPLAYAGVTFVTAFVKTVQK 140
Query: 132 FNSPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYA 191
F+SPK +RKKLVN+NAM+C++IDEL +K G V+ LK L QKT F+ME++LRKYIRYA
Sbjct: 141 FSSPKAQRKKLVNQNAMLCRSIDELLRKAG-TVHSSELKALEQKTEFNMEEILRKYIRYA 199
Query: 192 LNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKR 251
LNEKPFNPDLV +LI LRKAS L+DSQ+ EILNEISRR V+EKGPVVM M G++EKGFKR
Sbjct: 200 LNEKPFNPDLVADLIHLRKASGLNDSQIPEILNEISRRIVKEKGPVVMKMQGFTEKGFKR 259
Query: 252 KLAVQALFGKVFYLSEV 268
KLAVQALFGK++YLSE+
Sbjct: 260 KLAVQALFGKIYYLSEL 276
>gi|9759348|dbj|BAB10003.1| unnamed protein product [Arabidopsis thaliana]
Length = 471
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 148/195 (75%), Positives = 174/195 (89%), Gaps = 1/195 (0%)
Query: 74 ELPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFN 133
++PWIQEKALDLVEFTGSV+QAIPGPRVG SKLPW+LAVPLAY GV+FV AFVKTV+KF+
Sbjct: 185 DMPWIQEKALDLVEFTGSVSQAIPGPRVGSSKLPWMLAVPLAYAGVTFVTAFVKTVQKFS 244
Query: 134 SPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALN 193
SPK +RKKLVN+NAM+C++IDEL +K G V+ LK L QKT F+ME++LRKYIRYALN
Sbjct: 245 SPKAQRKKLVNQNAMLCRSIDELLRKAG-TVHSSELKALEQKTEFNMEEILRKYIRYALN 303
Query: 194 EKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKL 253
EKPFNPDLV +LI LRKAS L+DSQ+ EILNEISRR V+EKGPVVM M G++EKGFKRKL
Sbjct: 304 EKPFNPDLVADLIHLRKASGLNDSQIPEILNEISRRIVKEKGPVVMKMQGFTEKGFKRKL 363
Query: 254 AVQALFGKVFYLSEV 268
AVQALFGK++YLSE+
Sbjct: 364 AVQALFGKIYYLSEL 378
>gi|147766314|emb|CAN72276.1| hypothetical protein VITISV_030897 [Vitis vinifera]
Length = 371
Score = 313 bits (802), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 153/215 (71%), Positives = 174/215 (80%), Gaps = 21/215 (9%)
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
LPWI+EKALDLVEF+GSV QAIPGPRVG+S PWILA+PLAY G++FVIAFV+TV+KFNS
Sbjct: 88 LPWIEEKALDLVEFSGSVAQAIPGPRVGRSSFPWILAIPLAYAGITFVIAFVRTVQKFNS 147
Query: 135 PKFKRKKLV---------------------NKNAMVCKTIDELFQKGGDAVNPPALKGLV 173
PK KR+KLV NKNAM+CK+IDE+F G D AL GL+
Sbjct: 148 PKQKRRKLVPVLLICFVYFLIFEDINSNDVNKNAMLCKSIDEVFLNGRDEELQSALNGLM 207
Query: 174 QKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVRE 233
QKTGFS E++ RKYIRYALNEKPFNP++V LIQ RKAS+LDDSQVAEILNEISRR VR+
Sbjct: 208 QKTGFSREEIFRKYIRYALNEKPFNPEMVATLIQFRKASLLDDSQVAEILNEISRRIVRD 267
Query: 234 KGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLSEV 268
KGPVVM+MSGYSEKGFKRKLAVQALFGKVFYLSE+
Sbjct: 268 KGPVVMDMSGYSEKGFKRKLAVQALFGKVFYLSEL 302
>gi|363807650|ref|NP_001242416.1| uncharacterized protein LOC100809862 [Glycine max]
gi|255639857|gb|ACU20221.1| unknown [Glycine max]
Length = 349
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 141/194 (72%), Positives = 173/194 (89%)
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
LPWIQEKALDLVEFTGSVTQAIPGPRVG + +PWILA+PL Y G++FVIAFVKT++KF+S
Sbjct: 96 LPWIQEKALDLVEFTGSVTQAIPGPRVGPTSMPWILAIPLTYAGLTFVIAFVKTIRKFSS 155
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNE 194
PK KR++ V+KNA +CK++D+LF+KG D V ALK + KTGF +E++LRKYIRY LNE
Sbjct: 156 PKAKRRRQVSKNATLCKSLDDLFEKGRDQVKLDALKQIENKTGFDLEEILRKYIRYTLNE 215
Query: 195 KPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLA 254
KPFNPD+V +LI LRKAS+L+DSQVAEILN+ISRR VR+KGP+VM+ SGY++KGFKRK+A
Sbjct: 216 KPFNPDMVADLIHLRKASILNDSQVAEILNDISRRIVRDKGPIVMDKSGYTDKGFKRKIA 275
Query: 255 VQALFGKVFYLSEV 268
VQALFGKVFYLSE+
Sbjct: 276 VQALFGKVFYLSEL 289
>gi|293335391|ref|NP_001168376.1| uncharacterized protein LOC100382145 [Zea mays]
gi|223947855|gb|ACN28011.1| unknown [Zea mays]
gi|414586266|tpg|DAA36837.1| TPA: hypothetical protein ZEAMMB73_234499 [Zea mays]
Length = 335
Score = 309 bits (792), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 158/242 (65%), Positives = 198/242 (81%), Gaps = 3/242 (1%)
Query: 27 SPSLLLRQRRTTYQFSSLVVSAQNDKKSTTTKEVKKKAEEVEVEVEEELPWIQEKALDLV 86
+P L++R +R Q + S Q S K + EE E EVEEE+PWIQ+KALDLV
Sbjct: 40 APLLVVRAKRAGSQPPAAAASRQPANPSAVPK---RDVEEEEEEVEEEMPWIQDKALDLV 96
Query: 87 EFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNSPKFKRKKLVNKN 146
EFTG+VTQAIPGPRVG S +PW+LAVPLAYVGVSFV+A V+TV+KF SP K+K+ V+KN
Sbjct: 97 EFTGTVTQAIPGPRVGSSPVPWLLAVPLAYVGVSFVLAVVRTVRKFTSPHTKKKRRVSKN 156
Query: 147 AMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLI 206
+ K++D+LFQKG +A+N PAL+ L+QKTGF M+DV+RKYIRY LNEK F+PD+VV+LI
Sbjct: 157 IFLLKSLDDLFQKGREAINYPALQDLMQKTGFDMDDVVRKYIRYTLNEKQFSPDVVVDLI 216
Query: 207 QLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLS 266
LRKASML+D++VAEILNEISRR VREKGPVVM++SG++E+GFKRKLAVQALFGK+ YLS
Sbjct: 217 HLRKASMLEDAEVAEILNEISRRIVREKGPVVMDLSGFTEQGFKRKLAVQALFGKILYLS 276
Query: 267 EV 268
E+
Sbjct: 277 EL 278
>gi|242073772|ref|XP_002446822.1| hypothetical protein SORBIDRAFT_06g023220 [Sorghum bicolor]
gi|241938005|gb|EES11150.1| hypothetical protein SORBIDRAFT_06g023220 [Sorghum bicolor]
Length = 335
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 141/194 (72%), Positives = 175/194 (90%)
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
+PWIQ+KALDLVEFTG+VTQAIPGPRVG S +PW+LAVPLAYVGVSFV+A V+TV++F S
Sbjct: 85 MPWIQDKALDLVEFTGTVTQAIPGPRVGSSPVPWLLAVPLAYVGVSFVLAVVRTVRRFTS 144
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNE 194
P+ K+K+ V KN + K++DELFQKG +A++ PAL+ L+QKTGF M+DV+RKYIRY LNE
Sbjct: 145 PRTKKKRRVGKNIFLLKSLDELFQKGREAIDYPALQDLMQKTGFDMDDVVRKYIRYTLNE 204
Query: 195 KPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLA 254
K FNPD+VV+LI LRKASML+D++VAEILNEISRR VREKGPVVM++SG++E+GFKRKLA
Sbjct: 205 KQFNPDVVVDLIHLRKASMLEDAEVAEILNEISRRIVREKGPVVMDLSGFTEQGFKRKLA 264
Query: 255 VQALFGKVFYLSEV 268
VQALFGK+ YLSE+
Sbjct: 265 VQALFGKILYLSEL 278
>gi|115459540|ref|NP_001053370.1| Os04g0527800 [Oryza sativa Japonica Group]
gi|38344450|emb|CAE05656.2| OSJNBa0038O10.22 [Oryza sativa Japonica Group]
gi|113564941|dbj|BAF15284.1| Os04g0527800 [Oryza sativa Japonica Group]
gi|116310971|emb|CAH67907.1| OSIGBa0115K01-H0319F09.13 [Oryza sativa Indica Group]
gi|215701494|dbj|BAG92918.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195250|gb|EEC77677.1| hypothetical protein OsI_16722 [Oryza sativa Indica Group]
Length = 338
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/242 (63%), Positives = 200/242 (82%), Gaps = 6/242 (2%)
Query: 27 SPSLLLRQRRTTYQFSSLVVSAQNDKKSTTTKEVKKKAEEVEVEVEEELPWIQEKALDLV 86
+P L+ R +R + +A ++ +V K+ + EVEVEEE+PWIQ+KALDLV
Sbjct: 46 TPLLVARAKRPGSR------TAAASRQPANPSDVPKREADEEVEVEEEMPWIQDKALDLV 99
Query: 87 EFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNSPKFKRKKLVNKN 146
EFTG+VTQAIPGPRVG S +PW+LAVPLAYVGVSFV+A V+TV++F SP+ ++K+ V+KN
Sbjct: 100 EFTGTVTQAIPGPRVGSSPVPWLLAVPLAYVGVSFVLAVVRTVRRFTSPRTQKKRRVSKN 159
Query: 147 AMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLI 206
+ K++DELFQKG +AV+ PAL+ L++KTGF M+DV+RKYIRY LNEKPFNPD+VV+LI
Sbjct: 160 IFLLKSLDELFQKGREAVDFPALQELMEKTGFDMDDVVRKYIRYTLNEKPFNPDVVVDLI 219
Query: 207 QLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLS 266
LRKASML+D++VAEILNEISRR VREKGPVVM+++G++E+GFKRKLAVQ LFGK+ YLS
Sbjct: 220 HLRKASMLEDAEVAEILNEISRRIVREKGPVVMDLAGFTEQGFKRKLAVQTLFGKILYLS 279
Query: 267 EV 268
E+
Sbjct: 280 EL 281
>gi|222629244|gb|EEE61376.1| hypothetical protein OsJ_15539 [Oryza sativa Japonica Group]
Length = 338
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 149/222 (67%), Positives = 192/222 (86%)
Query: 47 SAQNDKKSTTTKEVKKKAEEVEVEVEEELPWIQEKALDLVEFTGSVTQAIPGPRVGQSKL 106
+A ++ +V K+ + EVEVEEE+PWIQ+KALDLVEFTG+VTQAIPGPRVG S +
Sbjct: 60 TAAASRQPANPSDVPKREADEEVEVEEEMPWIQDKALDLVEFTGTVTQAIPGPRVGSSPV 119
Query: 107 PWILAVPLAYVGVSFVIAFVKTVKKFNSPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNP 166
PW+LAVPLAYVGVSFV+A V+TV++F SP+ ++K+ V+KN + K++DELFQKG +AV+
Sbjct: 120 PWLLAVPLAYVGVSFVLAVVRTVRRFTSPRTQKKRRVSKNIFLLKSLDELFQKGREAVDF 179
Query: 167 PALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEI 226
PAL+ L++KTGF M+DV+RKYIRY LNEKPFNPD+VV+LI LRKASML+D++VAEILNEI
Sbjct: 180 PALQELMEKTGFDMDDVVRKYIRYTLNEKPFNPDVVVDLIHLRKASMLEDAEVAEILNEI 239
Query: 227 SRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLSEV 268
SRR VREKGPVVM+++G++E+GFKRKLAVQ LFGK+ YLSE+
Sbjct: 240 SRRIVREKGPVVMDLAGFTEQGFKRKLAVQTLFGKILYLSEL 281
>gi|357164825|ref|XP_003580179.1| PREDICTED: uncharacterized protein LOC100830071 [Brachypodium
distachyon]
Length = 331
Score = 306 bits (784), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 141/195 (72%), Positives = 174/195 (89%)
Query: 74 ELPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFN 133
ELPWIQ+KALDLVEFTG+VTQAIPGPRVG S +PW+LAVPLAYVG +FV++ V+TV+KF
Sbjct: 80 ELPWIQDKALDLVEFTGTVTQAIPGPRVGSSPVPWLLAVPLAYVGATFVLSVVRTVRKFT 139
Query: 134 SPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALN 193
SP+ ++KK V KN + K++DELFQKG +AV PAL+ L+QKTGF M+DV+RKYIRY LN
Sbjct: 140 SPRTQKKKRVTKNIFLLKSLDELFQKGREAVGFPALQELMQKTGFDMDDVVRKYIRYTLN 199
Query: 194 EKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKL 253
EKPFNPD+VV+LI LRKASML+D++VAEILNEISRR VREKGP+VM++SG++E+GFKRKL
Sbjct: 200 EKPFNPDVVVDLIHLRKASMLEDAEVAEILNEISRRIVREKGPIVMDLSGFTEQGFKRKL 259
Query: 254 AVQALFGKVFYLSEV 268
AVQ LFGK+ YLSE+
Sbjct: 260 AVQTLFGKIMYLSEL 274
>gi|326534176|dbj|BAJ89438.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 319
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 155/254 (61%), Positives = 197/254 (77%), Gaps = 17/254 (6%)
Query: 15 LPRPKIPQPPFSSPSLLLRQRRTTYQFSSLVVSAQNDKKSTTTKEVKKKAEEVEVEVEEE 74
LP P P S+P L+ R +RT + K+ + EVEVEEE
Sbjct: 26 LPAPTRATPRRSTPLLVARAKRTN-----------------NSSAAPKREADEEVEVEEE 68
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
LPWIQ+KALDLVEFTG+VTQAIPGPRVG S +PW+LAVPLAYVG+SF ++ V+TV++F S
Sbjct: 69 LPWIQDKALDLVEFTGTVTQAIPGPRVGSSPVPWLLAVPLAYVGISFALSVVRTVRRFTS 128
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNE 194
P+ ++KK V KN + K++DELFQKG +AV+ PA++ L+QKTGF M+DV+RKYIRY LNE
Sbjct: 129 PRTQKKKRVTKNIFLLKSLDELFQKGREAVDFPAIQELMQKTGFDMDDVVRKYIRYTLNE 188
Query: 195 KPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLA 254
K FNPD+VV+LI LRKASML+D++VAEILNEISRR VREKGP+VM++SG++E+GFKRKLA
Sbjct: 189 KQFNPDVVVDLIHLRKASMLEDNEVAEILNEISRRIVREKGPIVMDLSGFTEQGFKRKLA 248
Query: 255 VQALFGKVFYLSEV 268
VQ LFGK+ YLSE+
Sbjct: 249 VQTLFGKIMYLSEL 262
>gi|260447006|emb|CBG76419.1| OO_Ba0013J05-OO_Ba0033A15.6 [Oryza officinalis]
Length = 521
Score = 303 bits (775), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 151/242 (62%), Positives = 197/242 (81%), Gaps = 6/242 (2%)
Query: 27 SPSLLLRQRRTTYQFSSLVVSAQNDKKSTTTKEVKKKAEEVEVEVEEELPWIQEKALDLV 86
+P L+ R +R+ + +A ++ K+ + EVEVEEE+PWIQ+KALDLV
Sbjct: 41 TPLLVARAKRSGSR------TAAASRQPANPSAAPKREADEEVEVEEEMPWIQDKALDLV 94
Query: 87 EFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNSPKFKRKKLVNKN 146
EFTG+VTQAIPGPRVG S +PW+LAVPLAYVGVSFV+A V+TV++F SP+ ++K+ V+KN
Sbjct: 95 EFTGTVTQAIPGPRVGSSPVPWLLAVPLAYVGVSFVLAVVRTVRRFTSPRTQKKRRVSKN 154
Query: 147 AMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLI 206
+ K++DELFQKG +AV+ PAL+ L++KTGF M+DV+RKYIRY LNEKPFNPD+VV LI
Sbjct: 155 IFLLKSLDELFQKGREAVDFPALQELMEKTGFDMDDVVRKYIRYTLNEKPFNPDVVVELI 214
Query: 207 QLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLS 266
LRK SML+D++VAEILNEISRR VREKGPVVM+++G++E+GFKRKLAVQ LFGK+ YLS
Sbjct: 215 HLRKVSMLEDAEVAEILNEISRRIVREKGPVVMDLAGFTEQGFKRKLAVQTLFGKILYLS 274
Query: 267 EV 268
E+
Sbjct: 275 EL 276
>gi|168046888|ref|XP_001775904.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672736|gb|EDQ59269.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 249
Score = 224 bits (572), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 102/194 (52%), Positives = 144/194 (74%)
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
+ WIQEKA DLV TG +PGPRV ++++PW++A+PLAY+G++FVIA V+T KK+ S
Sbjct: 10 ISWIQEKAEDLVIATGQAIDRVPGPRVAETRMPWLVALPLAYLGITFVIACVRTYKKYTS 69
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNE 194
PK +RK+ V KNA + +++ E F D ++ L+ L K FS+ +VLRKY+RYALNE
Sbjct: 70 PKGQRKRQVGKNAFLVESLGEYFPTKRDELDANKLQKLANKCNFSLGEVLRKYVRYALNE 129
Query: 195 KPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLA 254
+PF P+ V +L+ LRK S L +S+VA++LN++++R V+ KGPVVMN G +EKG KRK A
Sbjct: 130 RPFTPETVADLLHLRKVSGLSESEVADVLNDVAKRLVKSKGPVVMNTEGMTEKGIKRKAA 189
Query: 255 VQALFGKVFYLSEV 268
VQALF K+ YLSE+
Sbjct: 190 VQALFSKLLYLSEL 203
>gi|302804470|ref|XP_002983987.1| hypothetical protein SELMODRAFT_46272 [Selaginella moellendorffii]
gi|300148339|gb|EFJ14999.1| hypothetical protein SELMODRAFT_46272 [Selaginella moellendorffii]
Length = 253
Score = 219 bits (559), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 100/201 (49%), Positives = 149/201 (74%), Gaps = 1/201 (0%)
Query: 70 EVEE-ELPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKT 128
EVEE E+ W++EK +LV+ TG+ QAIPGPRVGQS +PW+L +P+AY V+FVIA +T
Sbjct: 3 EVEEVEMSWVEEKTGELVQMTGNAIQAIPGPRVGQSSVPWLLVLPVAYFSVTFVIAVYRT 62
Query: 129 VKKFNSPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYI 188
VKK++SPK K+++++ KNA + ++D+ F + + + LK L +K F ++VLRKYI
Sbjct: 63 VKKYSSPKAKKRRMIGKNAFLVTSLDKYFPQRREEFDSKVLKELERKCSFDSKEVLRKYI 122
Query: 189 RYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKG 248
RYA+NE+ F P+ V +LI LR+ + L D+++AE+LNE SRR V E G V+M++ G +E+G
Sbjct: 123 RYAMNERAFTPETVADLIHLRRTTKLTDNEIAEVLNETSRRVVNENGTVMMDLRGLTERG 182
Query: 249 FKRKLAVQALFGKVFYLSEVN 269
KRK AV++LF K+ YLSE++
Sbjct: 183 VKRKAAVRSLFSKLLYLSELD 203
>gi|302753452|ref|XP_002960150.1| hypothetical protein SELMODRAFT_437286 [Selaginella moellendorffii]
gi|300171089|gb|EFJ37689.1| hypothetical protein SELMODRAFT_437286 [Selaginella moellendorffii]
Length = 357
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 92/189 (48%), Positives = 139/189 (73%)
Query: 81 KALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNSPKFKRK 140
K +LV+ TG+ QAIPGPRVGQS +PW+L +P+AY V+FVIA +TVKK++SPK K++
Sbjct: 108 KTGELVQMTGNAIQAIPGPRVGQSSVPWLLVLPVAYFSVTFVIAVYRTVKKYSSPKAKKR 167
Query: 141 KLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPD 200
+++ KNA + ++D+ F + + + LK L +K F ++VLRKYIRYA+NE+ F P+
Sbjct: 168 RMIGKNAFLVTSLDKYFPQRREEFDSKVLKELERKCSFDSKEVLRKYIRYAMNERAFTPE 227
Query: 201 LVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFG 260
V +LI LR+ + L D+++A++LNE SRR V E G V+M++ G +E+G KRK AV++LF
Sbjct: 228 TVADLIHLRRITKLTDNEIADVLNETSRRVVNENGTVMMDLRGLTERGVKRKAAVRSLFS 287
Query: 261 KVFYLSEVN 269
K+ YLSE++
Sbjct: 288 KLLYLSELD 296
>gi|255544167|ref|XP_002513146.1| conserved hypothetical protein [Ricinus communis]
gi|223548157|gb|EEF49649.1| conserved hypothetical protein [Ricinus communis]
Length = 198
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 99/161 (61%), Positives = 122/161 (75%)
Query: 15 LPRPKIPQPPFSSPSLLLRQRRTTYQFSSLVVSAQNDKKSTTTKEVKKKAEEVEVEVEEE 74
+P P +P P S LL ++R + V S + + ++ KE K EE E EVEEE
Sbjct: 24 VPFPFMPTPISSRNFFLLYKQRGRRIHVAAVKSNSSSGEKSSDKEKKIVEEEEEEEVEEE 83
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
L WIQEKALDLVEFTGSVTQAIPGPRVGQS LPWILA+PL Y+G++FVIAFVKTVKK++S
Sbjct: 84 LGWIQEKALDLVEFTGSVTQAIPGPRVGQSSLPWILALPLGYLGITFVIAFVKTVKKYSS 143
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQK 175
P+ KRK+LVNKNAM+CK+IDELF +GGDA++ ALK L +K
Sbjct: 144 PRDKRKRLVNKNAMLCKSIDELFHQGGDALHHSALKELEKK 184
>gi|224113227|ref|XP_002316428.1| predicted protein [Populus trichocarpa]
gi|222865468|gb|EEF02599.1| predicted protein [Populus trichocarpa]
Length = 186
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 77/89 (86%), Positives = 84/89 (94%)
Query: 75 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNS 134
LPWIQEKALDLVEFTGSVTQAIPGPRVGQS LPWILA+PLAY G++FVIAFVKTVKKF S
Sbjct: 96 LPWIQEKALDLVEFTGSVTQAIPGPRVGQSSLPWILALPLAYAGITFVIAFVKTVKKFGS 155
Query: 135 PKFKRKKLVNKNAMVCKTIDELFQKGGDA 163
P++KRKKLVNKNAM+CK+IDELFQKGG
Sbjct: 156 PRYKRKKLVNKNAMLCKSIDELFQKGGGG 184
>gi|224113223|ref|XP_002316427.1| predicted protein [Populus trichocarpa]
gi|222865467|gb|EEF02598.1| predicted protein [Populus trichocarpa]
Length = 149
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 80/89 (89%), Positives = 86/89 (96%)
Query: 180 MEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVM 239
M D++RKYIRYALNEKPFNP+LV NLIQLR+ASMLDDSQVAEILN+ISRR VREKGPVVM
Sbjct: 1 MVDIVRKYIRYALNEKPFNPELVANLIQLRQASMLDDSQVAEILNDISRRIVREKGPVVM 60
Query: 240 NMSGYSEKGFKRKLAVQALFGKVFYLSEV 268
NMSGYSEKGFKRKLAVQALFGKVFYLSE+
Sbjct: 61 NMSGYSEKGFKRKLAVQALFGKVFYLSEL 89
>gi|255590156|ref|XP_002535189.1| hypothetical protein RCOM_1973670 [Ricinus communis]
gi|223523800|gb|EEF27195.1| hypothetical protein RCOM_1973670 [Ricinus communis]
Length = 204
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 78/93 (83%), Positives = 88/93 (94%)
Query: 176 TGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKG 235
TGFSMED+ RKYIRYALNEKPFNPDLV NLIQLRKAS+L+DS+VAEILN+ISRR V+EKG
Sbjct: 55 TGFSMEDIFRKYIRYALNEKPFNPDLVANLIQLRKASLLEDSRVAEILNDISRRIVKEKG 114
Query: 236 PVVMNMSGYSEKGFKRKLAVQALFGKVFYLSEV 268
PVVM M+GY+EKGFKRKLAVQ LFGKV+YLSE+
Sbjct: 115 PVVMEMAGYTEKGFKRKLAVQTLFGKVYYLSEL 147
>gi|255638286|gb|ACU19456.1| unknown [Glycine max]
Length = 104
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 56/68 (82%), Positives = 65/68 (95%)
Query: 201 LVVNLIQLRKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFG 260
+V +LIQLRKASML+DSQVAEILNEISRR VR+KGP+VM+ SGY+EKGFKRK+AVQALFG
Sbjct: 1 MVADLIQLRKASMLNDSQVAEILNEISRRIVRDKGPIVMDKSGYTEKGFKRKIAVQALFG 60
Query: 261 KVFYLSEV 268
KVFYLSE+
Sbjct: 61 KVFYLSEL 68
>gi|384250101|gb|EIE23581.1| hypothetical protein COCSUDRAFT_47354 [Coccomyxa subellipsoidea
C-169]
Length = 371
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 98/161 (60%), Gaps = 1/161 (0%)
Query: 107 PWILAVPLAYVGVSFVIAFVKTVKKFNSPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNP 166
P + A+ L +VG +F+++ V+ +++NSP+ KR + VN N + +++D A+
Sbjct: 157 PLLGALGLFFVG-TFLLSAVRVFRRYNSPRSKRTRTVNLNKAIVESLDAYLPANRAALTT 215
Query: 167 PALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEI 226
++GL K+GFS ++ RKY+ Y L E+ F+ D V +L LR A + D +VAE L E
Sbjct: 216 GVMRGLKMKSGFSSTEIFRKYLWYLLRERKFDEDAVADLAALRTALGMTDEEVAEALRER 275
Query: 227 SRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLSE 267
++R + G V++ ++G ++ G +RK +ALF K+ +L+E
Sbjct: 276 AQRIYEKYGNVMLEVAGMTKAGIERKATCRALFSKILFLAE 316
>gi|308811640|ref|XP_003083128.1| RNA-binding protein RBM5 and related proteins, contain G-patch and
RRM domains (ISS) [Ostreococcus tauri]
gi|116055006|emb|CAL57083.1| RNA-binding protein RBM5 and related proteins, contain G-patch and
RRM domains (ISS) [Ostreococcus tauri]
Length = 349
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 102/181 (56%), Gaps = 3/181 (1%)
Query: 90 GSVTQAIPGPRVGQS-KLPWILAVPLAYVGVSFVIAFVKTVKKFNSPKFKRKKLVNKNAM 148
G VT+ + + G++ L + A+ L ++ + + K +K S + KRK+ VNKN
Sbjct: 104 GRVTEGVR--KAGENPGLRNLGALALFFLASTLTYSCYKVYRKATSGRAKRKRTVNKNVE 161
Query: 149 VCKTIDELFQKGGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQL 208
V + + F D++N LKG+ KTG+S +++RKY+RY L E+ F D V +++ L
Sbjct: 162 VVERLKNFFPNERDSLNKGVLKGISLKTGYSQSEIVRKYLRYKLTEEAFTLDFVADMLAL 221
Query: 209 RKASMLDDSQVAEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLSEV 268
+KAS L + IL E R ++ G ++ N++G ++ G +RK+ F K+ YL+++
Sbjct: 222 KKASGLTSGDIKGILLETGERMFKKYGTLMTNLAGLTQSGMERKIDGAGKFAKLMYLADL 281
Query: 269 N 269
+
Sbjct: 282 D 282
>gi|145357130|ref|XP_001422775.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144583018|gb|ABP01092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 202
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 88/154 (57%)
Query: 116 YVGVSFVIAFVKTVKKFNSPKFKRKKLVNKNAMVCKTIDELFQKGGDAVNPPALKGLVQK 175
++ +F + K +K S + +RK+ VNKN V + + F +VN ++GL K
Sbjct: 1 FLASTFAYSCYKVFRKATSGRMRRKRTVNKNVEVVERLKNFFPNERSSVNKGVVRGLALK 60
Query: 176 TGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFVREKG 235
TG+S ++ RKY+RY L E+ F D V +++ L+ A LD ++ EIL E R ++ G
Sbjct: 61 TGYSSAEIFRKYLRYKLTEEAFTLDFVADVLALKGACGLDSEEMKEILLETGERMFKKYG 120
Query: 236 PVVMNMSGYSEKGFKRKLAVQALFGKVFYLSEVN 269
++ N++G ++ G +RK+ F K+ YL++++
Sbjct: 121 TLMTNLAGLTQSGMERKIDGAGKFAKLMYLADLD 154
>gi|307108828|gb|EFN57067.1| hypothetical protein CHLNCDRAFT_143820 [Chlorella variabilis]
Length = 341
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 90/168 (53%), Gaps = 6/168 (3%)
Query: 106 LPWILAVPLAYVGVSFVIAFVKTV------KKFNSPKFKRKKLVNKNAMVCKTIDELFQK 159
L IL P A +G AF+ T + P+ KR K++NKN MV TI +
Sbjct: 122 LGLILQHPAARIGGMAAAAFLGTTLLITLFRMSRDPQHKRSKVINKNKMVVDTIGKYLPG 181
Query: 160 GGDAVNPPALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQV 219
+A++ + + L +TGF+ +V RKY+ + L E+ F+ + +L+ L+ A L D +V
Sbjct: 182 NREAMSAGSFRLLKLQTGFTSVEVFRKYLWFLLRERQFDEGALDDLVALKAALGLSDEEV 241
Query: 220 AEILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLSE 267
A L E + R + G V++N+ G S+ G +RK + + LF K+ L+E
Sbjct: 242 AAALRERAERVYEKYGTVMVNLEGMSQAGIERKASARNLFMKLLSLTE 289
>gi|412985680|emb|CCO19126.1| predicted protein [Bathycoccus prasinos]
Length = 399
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 96/167 (57%), Gaps = 3/167 (1%)
Query: 102 GQSKLPWILAVPLAYVGVSFVIAFVKTVKKFNSPKFKRKKLVNKNAMVCKTIDELFQKGG 161
G L +++ V LA + + K ++ S + RKK VNKN +V + + F
Sbjct: 158 GIRNLSYVVGVFLAG---TLGWSIYKVYRRSTSRRAVRKKTVNKNVLVIERLKPFFPNER 214
Query: 162 DAVNPPALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAE 221
+++ KG+ + TGF+ ++V RKY+RY + E+PF V +++ L+ A L Q++E
Sbjct: 215 ESMTRNVAKGIARSTGFTTQEVFRKYLRYKMVEEPFTGAFVEDILALKNACELTPKQMSE 274
Query: 222 ILNEISRRFVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLSEV 268
IL+E + R V++ G +++++S ++ G +RK+ +F K+ YL+++
Sbjct: 275 ILSESAARMVKKYGTLILDVSELTKSGAERKMIAAQMFSKLCYLADL 321
>gi|302831041|ref|XP_002947086.1| hypothetical protein VOLCADRAFT_116322 [Volvox carteri f.
nagariensis]
gi|300267493|gb|EFJ51676.1| hypothetical protein VOLCADRAFT_116322 [Volvox carteri f.
nagariensis]
Length = 406
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 89/160 (55%), Gaps = 2/160 (1%)
Query: 111 AVPLAYVGVSFVIAFVKTVKKFNSPKFKRKKLVNKNAMVCKTIDELFQKGGDA-VNPPAL 169
+ +A++G + ++A + +K N+ + KR + +++N + + +++ G A + P L
Sbjct: 172 GLAVAFLG-TLLLATYRAWQKSNTAQAKRMRQIDRNRDLVEGLNKYLLNGNRAGLTPGVL 230
Query: 170 KGLVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRR 229
+ L + +GFS +V RKY+ Y L E+ F+ V +L+ ++ L D+ V E L E + R
Sbjct: 231 RKLQRASGFSAVEVFRKYLWYLLRERKFDQGAVEDLVAMKVGLELSDADVGEALRERATR 290
Query: 230 FVREKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLSEVN 269
+ G +++N G + G +RK +LF KV YL+E +
Sbjct: 291 IYDKYGTLMLNTEGLTLSGAQRKATCTSLFRKVLYLAECD 330
>gi|388518605|gb|AFK47364.1| unknown [Lotus japonicus]
Length = 104
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/36 (83%), Positives = 34/36 (94%)
Query: 233 EKGPVVMNMSGYSEKGFKRKLAVQALFGKVFYLSEV 268
+KGPVVM+ SGY+EKGFKRKLAVQ LFGKVFYLSE+
Sbjct: 9 DKGPVVMDKSGYTEKGFKRKLAVQTLFGKVFYLSEL 44
>gi|159474004|ref|XP_001695119.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276053|gb|EDP01827.1| predicted protein [Chlamydomonas reinhardtii]
Length = 285
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 49/92 (53%)
Query: 172 LVQKTGFSMEDVLRKYIRYALNEKPFNPDLVVNLIQLRKASMLDDSQVAEILNEISRRFV 231
L + +GF+ +V RKY+ Y L E+ F+ V +L+ L+ L D E L E S R
Sbjct: 131 LQRASGFTAVEVFRKYLWYLLRERKFDQGAVEDLVALKTGLGLTDGDAGEALRERSARVY 190
Query: 232 REKGPVVMNMSGYSEKGFKRKLAVQALFGKVF 263
+ G +++N G + G +RK ALF K+F
Sbjct: 191 DKYGTLMLNTEGLTLSGAQRKATCMALFRKIF 222
>gi|255544177|ref|XP_002513151.1| conserved hypothetical protein [Ricinus communis]
gi|223548162|gb|EEF49654.1| conserved hypothetical protein [Ricinus communis]
Length = 57
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 22/33 (66%), Positives = 27/33 (81%)
Query: 143 VNKNAMVCKTIDELFQKGGDAVNPPALKGLVQK 175
VNK AM+CK+IDELF KGGDA++ ALK L +K
Sbjct: 11 VNKKAMLCKSIDELFHKGGDALHHSALKELQKK 43
>gi|432634967|ref|ZP_19870861.1| HsdR family type I site-specific deoxyribonuclease [Escherichia
coli KTE81]
gi|431175611|gb|ELE75613.1| HsdR family type I site-specific deoxyribonuclease [Escherichia
coli KTE81]
Length = 1030
Score = 37.0 bits (84), Expect = 8.6, Method: Composition-based stats.
Identities = 25/89 (28%), Positives = 48/89 (53%), Gaps = 12/89 (13%)
Query: 153 IDELFQKGGDAVN--PPALKGLVQKTGFSMEDVLRKYIRYALNEKPFNPD-------LVV 203
I+ + +KG DAV P +++ + ++E+ +RK I ++E P NP L+
Sbjct: 860 IELIVEKGADAVEALPESIRKNQEAMAETIENNVRKTI---VDENPVNPKYYEQMSVLLD 916
Query: 204 NLIQLRKASMLDDSQVAEILNEISRRFVR 232
LI+LR+ ++ + E + E+SR+ +R
Sbjct: 917 ELIELRRQKAIEYQEYLEKIRELSRKVIR 945
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.132 0.371
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,045,033,737
Number of Sequences: 23463169
Number of extensions: 160520044
Number of successful extensions: 616264
Number of sequences better than 100.0: 87
Number of HSP's better than 100.0 without gapping: 34
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 616180
Number of HSP's gapped (non-prelim): 102
length of query: 275
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 135
effective length of database: 9,074,351,707
effective search space: 1225037480445
effective search space used: 1225037480445
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)