BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 041622
(205 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224083266|ref|XP_002306974.1| predicted protein [Populus trichocarpa]
gi|222856423|gb|EEE93970.1| predicted protein [Populus trichocarpa]
Length = 208
Score = 284 bits (726), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 147/208 (70%), Positives = 169/208 (81%), Gaps = 3/208 (1%)
Query: 1 MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
MS+MRTDRKPPLAKSPIR RPRRVLRS STTLQTPPGSLTKSQKP R D E++D R EY
Sbjct: 1 MSRMRTDRKPPLAKSPIRIRPRRVLRSESTTLQTPPGSLTKSQKPNRKLDTEDSDLRPEY 60
Query: 61 RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
SISCELRALA+MV+DEFG ++ GG+GKSL ANSSPLFERGRFYEEY+ARRN+RLKRK
Sbjct: 61 HSISCELRALAKMVRDEFGNGESTNGGVGKSLSANSSPLFERGRFYEEYSARRNDRLKRK 120
Query: 121 KSEACSGEKTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENK 179
K + KTPYNLGVTVE +KRR ++KLES RKSV A VERNE PRY+LRSM+KENK
Sbjct: 121 KGDTGDDMKTPYNLGVTVESSKRRYTRKLESARKSVSDACLVERNETPRYLLRSMNKENK 180
Query: 180 KPPLPLNS--SAMATTEKKTVSRRAGKI 205
KPPLP+ S ++ E+K +R+ KI
Sbjct: 181 KPPLPVYSFEKSVLAGERKVAARKVRKI 208
>gi|224065691|ref|XP_002301923.1| predicted protein [Populus trichocarpa]
gi|222843649|gb|EEE81196.1| predicted protein [Populus trichocarpa]
Length = 208
Score = 272 bits (695), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 141/208 (67%), Positives = 168/208 (80%), Gaps = 3/208 (1%)
Query: 1 MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
MS+M+T KPPLAKSP R RPRR +RS STT QTPPGSL KSQ+P R WDMEE+D R EY
Sbjct: 1 MSRMKTVGKPPLAKSPTRLRPRRSIRSESTTSQTPPGSLAKSQRPNRKWDMEESDLRPEY 60
Query: 61 RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
+SIS ELRALA+M + EFG +++ GG+G+SL ANSSPLFERGRFYEEY+ RRNERLKRK
Sbjct: 61 QSISWELRALAKMARGEFGNGESSNGGVGRSLSANSSPLFERGRFYEEYSVRRNERLKRK 120
Query: 121 KSEACSGEKTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENK 179
K + + KTP NLGVT+E +KRRDSKKLESLRKSV AAY VERNE RYMLRSM+KENK
Sbjct: 121 KGDTGNEVKTPCNLGVTIESSKRRDSKKLESLRKSVSAAYCVERNENSRYMLRSMNKENK 180
Query: 180 KPPLPLNS--SAMATTEKKTVSRRAGKI 205
KPPLP+ + ++ +E+K +R+A KI
Sbjct: 181 KPPLPVYNYEKSVLASERKVTARKARKI 208
>gi|255539210|ref|XP_002510670.1| conserved hypothetical protein [Ricinus communis]
gi|223551371|gb|EEF52857.1| conserved hypothetical protein [Ricinus communis]
Length = 206
Score = 252 bits (643), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 139/206 (67%), Positives = 158/206 (76%), Gaps = 4/206 (1%)
Query: 4 MRTDRKPPLAKSPIRTRPRRVLRSNSTT-LQTPPGSLTKSQKPIRSWDMEETDFRAEYRS 62
M+ DRKPPLAKSPIR RP RVLRS +TT LQ+PPGSL KSQK DMEE D R EYRS
Sbjct: 1 MKYDRKPPLAKSPIRVRPGRVLRSTTTTSLQSPPGSLIKSQKLNPGPDMEELDLRPEYRS 60
Query: 63 ISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRKKS 122
IS ELRALARMVQDEFG + G KSL ANSSPLFERGRFY+ Y+ARRNERLKRK
Sbjct: 61 ISWELRALARMVQDEFGKENATNAGERKSLSANSSPLFERGRFYDVYSARRNERLKRKTG 120
Query: 123 EAC-SGEKTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENKK 180
E G KTP+NLGV +E +K+RD+KKL+SLRKSV AAYSVE +E PRY LRS+SKENKK
Sbjct: 121 ETGEEGVKTPHNLGVNIESSKKRDTKKLQSLRKSVSAAYSVESSEIPRYSLRSLSKENKK 180
Query: 181 PPLPLN-SSAMATTEKKTVSRRAGKI 205
PPLP+N ++ EKK +RR K+
Sbjct: 181 PPLPVNYEKSVLANEKKVGARRVRKV 206
>gi|307136147|gb|ADN33991.1| hypothetical protein [Cucumis melo subsp. melo]
Length = 212
Score = 237 bits (604), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 139/212 (65%), Positives = 167/212 (78%), Gaps = 7/212 (3%)
Query: 1 MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTT-LQTPPGSLTKSQKPIRSWDMEETDFRAE 59
MSKMR+DR+PPLA+SP+R R RRV +SNSTT +QTPPGSLTKSQK IR D++E++ R E
Sbjct: 1 MSKMRSDRRPPLARSPLRLRSRRVPQSNSTTSIQTPPGSLTKSQKMIRPSDLQESELRPE 60
Query: 60 YRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERL-K 118
YR+ISCELRALA MV+ E G ++ G+G L NSS LFERGRFY+EY+ARRNERL K
Sbjct: 61 YRTISCELRALASMVRMELGCIESEGNGVGDKLSVNSSSLFERGRFYDEYSARRNERLKK 120
Query: 119 RKKSEACSGEKTPYNLGVTVEPAKRRDSKKL-ESLRKSVIAAYSVERNE--PRYMLRSMS 175
RKK+E + KTPYNLGVTVE +KR+ SKKL SLRKSV AAYS ER+E PRY+LRSM
Sbjct: 121 RKKAETVAEVKTPYNLGVTVESSKRQSSKKLVNSLRKSVSAAYSAERSETAPRYLLRSMM 180
Query: 176 KENKKPPLPLN--SSAMATTEKKTVSRRAGKI 205
KENKKPPLP+N S + E+K V+RRA +I
Sbjct: 181 KENKKPPLPVNLEKSMIGGGERKMVTRRARRI 212
>gi|225459872|ref|XP_002285934.1| PREDICTED: uncharacterized protein LOC100264304 [Vitis vinifera]
gi|302141661|emb|CBI18864.3| unnamed protein product [Vitis vinifera]
Length = 205
Score = 236 bits (601), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 125/207 (60%), Positives = 156/207 (75%), Gaps = 4/207 (1%)
Query: 1 MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
MSKMR++ KPPL SPI+ R RRVLRS++TT+QTPPGSLTK+ R+ +EE+ R EY
Sbjct: 1 MSKMRSNGKPPLGNSPIQLRSRRVLRSSATTIQTPPGSLTKTLN--RTQVIEESGLRPEY 58
Query: 61 RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
+ISCEL+ALA++V++E G + G + ANSSPLFERGRFY+EY+ARRNERLKRK
Sbjct: 59 LTISCELQALAKIVRNELGNGSSENVGFPNTFGANSSPLFERGRFYDEYSARRNERLKRK 118
Query: 121 KSEACSGEKTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENK 179
K E KTPY LGVTVEP+KRRD+KK ESLRKSV AA SV+R+E PRY+LRS +KENK
Sbjct: 119 KGETGDVGKTPYKLGVTVEPSKRRDTKKFESLRKSVPAALSVDRSENPRYLLRSSTKENK 178
Query: 180 KPPLPLNSSAMATT-EKKTVSRRAGKI 205
KPP+P N A ++K +RR K+
Sbjct: 179 KPPMPFNVEKSAVAGDRKIGARRVRKV 205
>gi|449519444|ref|XP_004166745.1| PREDICTED: uncharacterized LOC101215930 [Cucumis sativus]
Length = 212
Score = 231 bits (588), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 137/212 (64%), Positives = 164/212 (77%), Gaps = 7/212 (3%)
Query: 1 MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTT-LQTPPGSLTKSQKPIRSWDMEETDFRAE 59
MSKMR+DR+PPLA+SP+R R RRV +SNSTT +QTPPGSLTKSQK IR D++E + R E
Sbjct: 1 MSKMRSDRRPPLARSPLRLRSRRVPQSNSTTSMQTPPGSLTKSQKTIRPSDLQEMELRPE 60
Query: 60 YRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERL-K 118
YR+ISCELRALA MV+ E G + G+G NSS LFERGR Y+EY+ARRNERL K
Sbjct: 61 YRTISCELRALASMVRMELGCEEYESNGVGDKSSVNSSSLFERGRLYDEYSARRNERLKK 120
Query: 119 RKKSEACSGEKTPYNLGVTVEPAKRRDSKKL-ESLRKSVIAAYSVERNE--PRYMLRSMS 175
RKK+E+ + KTPYNLGVTVE +KR+ SKKL SLRKSV AAYS ER+E PRY+LRSM
Sbjct: 121 RKKAESVAEVKTPYNLGVTVESSKRQSSKKLVNSLRKSVSAAYSEERSETGPRYLLRSMM 180
Query: 176 KENKKPPLPLN--SSAMATTEKKTVSRRAGKI 205
KENKKPPLP+N S + E+K V+RRA +I
Sbjct: 181 KENKKPPLPVNLEKSTIGGGERKMVTRRARRI 212
>gi|449452981|ref|XP_004144237.1| PREDICTED: uncharacterized protein LOC101215930 [Cucumis sativus]
Length = 212
Score = 231 bits (588), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 136/212 (64%), Positives = 165/212 (77%), Gaps = 7/212 (3%)
Query: 1 MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTT-LQTPPGSLTKSQKPIRSWDMEETDFRAE 59
MSKMR+DR+PP+A+SP+R R RRV +SNSTT +QTPPGSLTKSQK IR D++E++ R E
Sbjct: 1 MSKMRSDRRPPIARSPLRLRSRRVPQSNSTTSMQTPPGSLTKSQKTIRPSDLQESELRPE 60
Query: 60 YRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERL-K 118
YR+ISCELRALA MV+ E G + G+G NSS LFERGR Y+EY+ARRNERL K
Sbjct: 61 YRTISCELRALASMVRMELGCEEYESNGVGDKSSVNSSSLFERGRLYDEYSARRNERLKK 120
Query: 119 RKKSEACSGEKTPYNLGVTVEPAKRRDSKKL-ESLRKSVIAAYSVERNE--PRYMLRSMS 175
RKK+E+ + KTPYNLGVTVE +KR+ SKKL SLRKSV AAYS ER+E PRY+LRSM
Sbjct: 121 RKKAESVAEVKTPYNLGVTVESSKRQSSKKLVNSLRKSVSAAYSEERSETGPRYLLRSMM 180
Query: 176 KENKKPPLPLN--SSAMATTEKKTVSRRAGKI 205
KENKKPPLP+N S + E+K V+RRA +I
Sbjct: 181 KENKKPPLPVNLEKSTIGGGERKMVTRRARRI 212
>gi|255637797|gb|ACU19220.1| unknown [Glycine max]
Length = 204
Score = 186 bits (473), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 113/210 (53%), Positives = 137/210 (65%), Gaps = 16/210 (7%)
Query: 2 SKMRTDRKPPLAKSPIRTRPRRVLRS-NSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
S M++ KPPL +SPIR RP + S NS LQTPPGSLTKSQKP+ S +M R EY
Sbjct: 5 STMKSMYKPPLPRSPIRLRPSHAVHSSNSIALQTPPGSLTKSQKPVISPNM-----RPEY 59
Query: 61 RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
R+ISCELRAL +MV+DEF ++ K ANSS LFERGRFY+EY+ARRNERL RK
Sbjct: 60 RTISCELRALTKMVRDEFSKANSEKAA-----SANSSVLFERGRFYDEYSARRNERLMRK 114
Query: 121 KSEACSGE---KTPYNLGVTVEPAKRRDS-KKLESLRKSVIAAYSVERNE-PRYMLRSMS 175
K E K ++LGVT+E K+ S +K+ +LRKS+ AAYS +E PRYMLRSM+
Sbjct: 115 KGVTAVDEGKAKPVHSLGVTMESGKKSSSVRKMGTLRKSIFAAYSSGVSETPRYMLRSMT 174
Query: 176 KENKKPPLPLNSSAMATTEKKTVSRRAGKI 205
KENKKPPL K + RA KI
Sbjct: 175 KENKKPPLAAKFDKSVVAGDKKMGTRARKI 204
>gi|356517354|ref|XP_003527352.1| PREDICTED: uncharacterized protein LOC100810616 [Glycine max]
Length = 204
Score = 183 bits (465), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 108/188 (57%), Positives = 131/188 (69%), Gaps = 16/188 (8%)
Query: 2 SKMRTDRKPPLAKSPIRTRPRRVLRS-NSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
S M++ KPPL +SPIR RP + S NS LQTPPGSLTKSQKP+ S +M R EY
Sbjct: 5 STMKSMYKPPLPRSPIRLRPSHAVHSSNSIALQTPPGSLTKSQKPVISPNM-----RPEY 59
Query: 61 RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
R+ISCELRAL +MV+DEF ++ K ANSS LFERGRFY+EY+ARRNERL RK
Sbjct: 60 RTISCELRALTKMVRDEFSKANSEKAA-----SANSSVLFERGRFYDEYSARRNERLMRK 114
Query: 121 KSEACSGE---KTPYNLGVTVEPAKRRDS-KKLESLRKSVIAAYSVERNE-PRYMLRSMS 175
K E K ++LGVT+E K+ S +K+ +LRKS+ AAYS +E PRYMLRSM+
Sbjct: 115 KGVTAVDEGKAKPVHSLGVTMESGKKSSSVRKMGTLRKSISAAYSSGVSETPRYMLRSMT 174
Query: 176 KENKKPPL 183
KENKKPPL
Sbjct: 175 KENKKPPL 182
>gi|351726746|ref|NP_001236881.1| uncharacterized protein LOC100306057 [Glycine max]
gi|255627407|gb|ACU14048.1| unknown [Glycine max]
Length = 202
Score = 173 bits (439), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 106/209 (50%), Positives = 130/209 (62%), Gaps = 16/209 (7%)
Query: 2 SKMRTDRKPPLAKSPIRTRPRRV-LRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
S +++ KPPL +SPIR RP SNS LQTPPGSLTKSQKP+ S ++ R EY
Sbjct: 5 STIKSVYKPPLPRSPIRLRPSHAAYSSNSIALQTPPGSLTKSQKPVLSLNI-----RPEY 59
Query: 61 RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
R+ISCEL AL +MV+DEF ++ K NSS LFERGRFY+EY+ARRNERL RK
Sbjct: 60 RTISCELWALTKMVRDEFSKVNSEKA------ATNSSVLFERGRFYDEYSARRNERLMRK 113
Query: 121 KSEACSGE---KTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVE-RNEPRYMLRSMSK 176
K E K +LGVT+E K+ +K+ +LRKS+ AAYS PRYMLRSM+K
Sbjct: 114 KGVTAVDEGKAKPVRSLGVTMESGKKTSVRKMGTLRKSISAAYSSGVSGTPRYMLRSMTK 173
Query: 177 ENKKPPLPLNSSAMATTEKKTVSRRAGKI 205
ENKKPPL +K + R KI
Sbjct: 174 ENKKPPLAAKFDKSVVAGEKKMGTRGRKI 202
>gi|297795311|ref|XP_002865540.1| hypothetical protein ARALYDRAFT_494800 [Arabidopsis lyrata subsp.
lyrata]
gi|297311375|gb|EFH41799.1| hypothetical protein ARALYDRAFT_494800 [Arabidopsis lyrata subsp.
lyrata]
Length = 204
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 109/211 (51%), Positives = 139/211 (65%), Gaps = 16/211 (7%)
Query: 1 MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
MSKMR+ KPPL KSPIR R R+VL S +LQTP K + + D+ + EY
Sbjct: 1 MSKMRSYLKPPLPKSPIRLRSRQVLHS---SLQTPSPVFLKPGRRLSDVDLNPKP-QFEY 56
Query: 61 RSISCELRALARMVQDEFGTNDTNKGGIG-KSLCANSSPLFERGRFYEEYAARRNERLKR 119
SIS E+ ALA+MV+DEF + K +SL NS P+FERGRFYEEY+A+RNERL+R
Sbjct: 57 SSISSEIHALAKMVKDEFAEEENRKKSSNLESLATNSVPVFERGRFYEEYSAKRNERLRR 116
Query: 120 KKS-EACSG---EKTPYNLGVTVEPA--KRRDSKKLESLRKSVIAAYSVERNEPRYMLRS 173
KK EA G + TPYNLGV EP KRR + K ES++K+V++ VE + PRY LRS
Sbjct: 117 KKGEEAVEGGVVKGTPYNLGVNHEPMTNKRRGTAKKESIKKTVVSM--VETSIPRYSLRS 174
Query: 174 MSKENKKPPLPLNSSAMATTEKKTVSRRAGK 204
MSKEN+KPP+PLN +A + KTV+ R G+
Sbjct: 175 MSKENRKPPIPLN---VAVSAMKTVTTRRGR 202
>gi|388499978|gb|AFK38055.1| unknown [Medicago truncatula]
Length = 222
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 112/221 (50%), Positives = 142/221 (64%), Gaps = 24/221 (10%)
Query: 2 SKMRTDRKPP--LAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAE 59
S M++ +PP + KSP+R R R VL +S +LQTPPGSLTKS+K ++S ++ R E
Sbjct: 6 STMKSHLRPPRPIPKSPMRLRTRTVL--DSPSLQTPPGSLTKSRKLVQSPEL-----RPE 58
Query: 60 YRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKR 119
YR+I+ E RAL+RMV+DEFG D + S S LF+RG+FY+EY+ARRNERLKR
Sbjct: 59 YRTIASEFRALSRMVRDEFGKPDPEEIAGTNSCNPKSGVLFQRGKFYDEYSARRNERLKR 118
Query: 120 KKSEAC-----SGEKTP-----YN-LGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-P 167
KK + K P +N LGVTVE K+ ++KL SLRKSV AAYS E +E P
Sbjct: 119 KKGVTVDEINTTSIKQPKVNNHHNVLGVTVESGKKNTARKLGSLRKSVSAAYSAEVSETP 178
Query: 168 RYMLRSMSKENKKPPLP---LNSSAMATTEKKTVSRRAGKI 205
RYMLRSM+KENKKPPL + S +K SRR GKI
Sbjct: 179 RYMLRSMTKENKKPPLAGSRFDKSVTVGEKKVGASRRVGKI 219
>gi|42568267|ref|NP_199048.2| uncharacterized protein [Arabidopsis thaliana]
gi|48958483|gb|AAT47794.1| At5g42330 [Arabidopsis thaliana]
gi|51536556|gb|AAU05516.1| At5g42330 [Arabidopsis thaliana]
gi|51970798|dbj|BAD44091.1| unknown protein [Arabidopsis thaliana]
gi|332007414|gb|AED94797.1| uncharacterized protein [Arabidopsis thaliana]
Length = 205
Score = 168 bits (425), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/212 (51%), Positives = 139/212 (65%), Gaps = 17/212 (8%)
Query: 1 MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
MSKMR+ KPPL KSPIR R R+VL S +LQTP K + + D+ + +Y
Sbjct: 1 MSKMRSYLKPPLPKSPIRLRSRQVLHS---SLQTPSPGFLKPGRRLSDVDLNPKP-QIKY 56
Query: 61 RSISCELRALARMVQDEFGTN-DTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKR 119
SIS E+ ALA+MV+DEF D K +SL NS PLFERGRFYEEY+A+RNERL+R
Sbjct: 57 SSISSEIHALAKMVKDEFAEEEDRKKSSNLESLAKNSVPLFERGRFYEEYSAKRNERLRR 116
Query: 120 KKS-EACSG---EKTPYNLGVTVEPA--KRRDSKKLESLRKSVIAAYSVERNE-PRYMLR 172
KK EA G + TPYNLGV EP KRR + K ES++K+V++ VE + PRY LR
Sbjct: 117 KKGEEAVEGGVVKGTPYNLGVNHEPMTNKRRGTAKKESIKKTVVSM--VESSSIPRYSLR 174
Query: 173 SMSKENKKPPLPLNSSAMATTEKKTVSRRAGK 204
SM+KEN+KPPLPLN +A + K+V+ R G+
Sbjct: 175 SMAKENRKPPLPLN---VAVSAMKSVTTRKGR 203
>gi|9759469|dbj|BAB10474.1| unnamed protein product [Arabidopsis thaliana]
Length = 177
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/137 (54%), Positives = 94/137 (68%), Gaps = 13/137 (9%)
Query: 73 MVQDEFGTN-DTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRKKS-EACSG--- 127
MV+DEF D K +SL NS PLFERGRFYEEY+A+RNERL+RKK EA G
Sbjct: 1 MVKDEFAEEEDRKKSSNLESLAKNSVPLFERGRFYEEYSAKRNERLRRKKGEEAVEGGVV 60
Query: 128 EKTPYNLGVTVEPA--KRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENKKPPLP 184
+ TPYNLGV EP KRR + K ES++K+V++ VE + PRY LRSM+KEN+KPPLP
Sbjct: 61 KGTPYNLGVNHEPMTNKRRGTAKKESIKKTVVSM--VESSSIPRYSLRSMAKENRKPPLP 118
Query: 185 LNSSAMATTEKKTVSRR 201
LN +A + K+V+ R
Sbjct: 119 LN---VAVSAMKSVTTR 132
>gi|357122872|ref|XP_003563138.1| PREDICTED: uncharacterized protein LOC100831929 [Brachypodium
distachyon]
Length = 239
Score = 56.2 bits (134), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 89/181 (49%), Gaps = 28/181 (15%)
Query: 7 DRKPPLAKSPIRTRPRRVLRSN------STTLQTPPGSLTKSQKPIRSW-------DMEE 53
DR+PPLA SP R RPR + S+ TPPG K+Q P+RS E
Sbjct: 2 DRRPPLAVSPRRLRPRPGRAATGRPPVASSVHLTPPGLSKKAQTPMRSSFCALPPSSYLE 61
Query: 54 TDFRA--EYRSISCELRALAR----MVQDEFGTNDTNKGGI--GKSLCANSSPLFERGRF 105
RA ++ RA + + D+ + + G+ + +SPLFERGR
Sbjct: 62 PSPRAKLDFNCAPTPTRAAGKENLHIPDDDAVAFNLSDAGMEDWTATVEPTSPLFERGRL 121
Query: 106 YEEYAARRNERLKRKKSEACSGEKTPYNLG----VTVEPAKRRDSKKLES---LRKSVIA 158
Y+ Y+ARRNERLKRK +GE+ + V VE +KRR +KK + +R+S+ A
Sbjct: 122 YDLYSARRNERLKRKHGWYTAGEEEAGAMAQDPCVAVELSKRRGAKKTGAESVVRRSMPA 181
Query: 159 A 159
A
Sbjct: 182 A 182
>gi|226500280|ref|NP_001144393.1| uncharacterized protein LOC100277325 [Zea mays]
gi|195641510|gb|ACG40223.1| hypothetical protein [Zea mays]
Length = 288
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 72/150 (48%), Gaps = 38/150 (25%)
Query: 63 ISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRKK- 121
++ L A+AR T T GG PLF RGR Y+ Y+ARRNERLKRK+
Sbjct: 143 VALNLAAIAR-------TAGTPSGG----------PLFVRGRLYDLYSARRNERLKRKQG 185
Query: 122 ---------SEACSGEKTPYNLGVTVEPAKRRDSKK------LESLRKSVIAA-YSVERN 165
S A E N V VE +KRR +KK ES+R+S+ A ++ R
Sbjct: 186 FPFYSEEFASAAVDPETMAENPCVAVELSKRRVAKKAYAASGAESVRRSMPAVDFAACRG 245
Query: 166 ---EPRYMLRSMSKENKKPPLPLNSSAMAT 192
PR LRS SKE KK + +MAT
Sbjct: 246 GDLGPRSSLRS-SKEMKKASAASGAVSMAT 274
>gi|293336431|ref|NP_001169778.1| uncharacterized protein LOC100383663 [Zea mays]
gi|224031613|gb|ACN34882.1| unknown [Zea mays]
gi|414590223|tpg|DAA40794.1| TPA: hypothetical protein ZEAMMB73_046692 [Zea mays]
Length = 289
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 72/150 (48%), Gaps = 38/150 (25%)
Query: 63 ISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRKK- 121
++ L A+AR T T GG PLF RGR Y+ Y+ARRNERLKRK+
Sbjct: 144 VALNLAAIAR-------TAGTPSGG----------PLFVRGRLYDLYSARRNERLKRKQD 186
Query: 122 ---------SEACSGEKTPYNLGVTVEPAKRRDSKK------LESLRKSVIAA-YSVERN 165
S A E N V VE +KRR +KK ES+R+S+ A ++ R
Sbjct: 187 FPFYSEEFASAAVDPETMAENPCVAVELSKRRVAKKAYAASGAESVRRSMPAVDFAACRG 246
Query: 166 ---EPRYMLRSMSKENKKPPLPLNSSAMAT 192
PR LRS SKE KK + +MAT
Sbjct: 247 GDLGPRSSLRS-SKEMKKASAASGAVSMAT 275
>gi|242045624|ref|XP_002460683.1| hypothetical protein SORBIDRAFT_02g033050 [Sorghum bicolor]
gi|241924060|gb|EER97204.1| hypothetical protein SORBIDRAFT_02g033050 [Sorghum bicolor]
Length = 298
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 63/121 (52%), Gaps = 22/121 (18%)
Query: 98 PLFERGRFYEEYAARRNERLKRKK-----------SEACSGEKTPYNLGVTVEPAKRRDS 146
PLF RGR Y+ Y+ARRNERLKRK+ + + E + V V+ +KRR +
Sbjct: 170 PLFVRGRLYDAYSARRNERLKRKQGFPYYSEEFAPAASVDPEAIAEDPFVAVQLSKRRVA 229
Query: 147 KK------LESLRKSVIAA-YSVERN---EPRYMLRSMSKENKKPPLPLNSSAMATTEKK 196
KK ES+R+S A ++ R PR LRS SKE KKP + ++A E++
Sbjct: 230 KKAYTASGAESVRRSAPAIDFAAGRGGGLGPRSSLRS-SKEMKKPSAASGTVSVAIKERR 288
Query: 197 T 197
Sbjct: 289 V 289
>gi|125558294|gb|EAZ03830.1| hypothetical protein OsI_25959 [Oryza sativa Indica Group]
Length = 248
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 78/161 (48%), Gaps = 29/161 (18%)
Query: 31 TLQTPPGSLTKSQKPIRSW-------DMEET-----DFRAEYRSISCELRALA--RMVQD 76
++QTPPGS+ K+ P+RS +E T DF A + A R V D
Sbjct: 35 SVQTPPGSIKKATTPMRSSFCALPTSRLEPTPRAKLDFAAAPSPARAAVAAGKENRHVDD 94
Query: 77 EFGTNDTNKG-------GIGKSLCANSSPLFERGRFYEEYAARRNERLKRKKS---EACS 126
E + T L +SPLFERGR Y+ Y+ARRNERLKRK
Sbjct: 95 EVSLDLTAMAMPTPMPTWTASPLPPPTSPLFERGRLYDLYSARRNERLKRKHGFPAGEEE 154
Query: 127 GEKTPYNLGVTVEPAKRRDSKKL---ESLRKSVIAA--YSV 162
E + V VE +KRR +KK+ ES+R+S+ AA +SV
Sbjct: 155 AEAMAADPCVAVELSKRRGAKKMTGAESVRRSMPAAADFSV 195
>gi|115472035|ref|NP_001059616.1| Os07g0474300 [Oryza sativa Japonica Group]
gi|34394631|dbj|BAC83932.1| unknown protein [Oryza sativa Japonica Group]
gi|113611152|dbj|BAF21530.1| Os07g0474300 [Oryza sativa Japonica Group]
gi|125600196|gb|EAZ39772.1| hypothetical protein OsJ_24209 [Oryza sativa Japonica Group]
Length = 248
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/156 (36%), Positives = 75/156 (48%), Gaps = 27/156 (17%)
Query: 31 TLQTPPGSLTKSQKPIRSW-------DMEET-----DFRAEYRSISCELRALA--RMVQD 76
++QTPPGS+ K+ P+RS +E T DF A + A R V D
Sbjct: 35 SVQTPPGSIKKATTPMRSSFCALPTSRLEPTPRAKLDFAAAPSPARAAVAAGKENRHVDD 94
Query: 77 EFGTNDTNKG-------GIGKSLCANSSPLFERGRFYEEYAARRNERLKRKKS---EACS 126
E + T L +SPLFERGR Y+ Y+ARRNERLKRK
Sbjct: 95 EVSLDLTAMAMPTPMPTWTASPLPPPTSPLFERGRLYDLYSARRNERLKRKHGFPAGEEE 154
Query: 127 GEKTPYNLGVTVEPAKRRDSKKL---ESLRKSVIAA 159
E + V VE +KRR +KK+ ES+R+S+ AA
Sbjct: 155 AEAMAADPCVAVELSKRRGAKKMTGAESVRRSMPAA 190
>gi|414886509|tpg|DAA62523.1| TPA: hypothetical protein ZEAMMB73_371595 [Zea mays]
Length = 280
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 57/111 (51%), Gaps = 26/111 (23%)
Query: 95 NSSPLFERGRFYEEYAARRNERLKRKK-------------SEACSGEKTPYNLGVTVEPA 141
+ PLF RGR Y+ Y+ARRNERLKRK+ + + E + V VE +
Sbjct: 150 DGGPLFVRGRLYDAYSARRNERLKRKQGFPYYYYSDEFASTYSVEPEAMAKDPCVAVELS 209
Query: 142 KRRDSKKL--------ESLRKSVIAA-YSVERN---EPRYMLRSMSKENKK 180
KRR +KK ES+R+S+ A ++ R PR LRS SKE KK
Sbjct: 210 KRRVAKKAYTATSGGGESVRRSMPAVDFAARRGGGLGPRSSLRS-SKEMKK 259
>gi|159118288|ref|XP_001709363.1| Protein 21.1 [Giardia lamblia ATCC 50803]
gi|157437479|gb|EDO81689.1| Protein 21.1 [Giardia lamblia ATCC 50803]
Length = 1207
Score = 37.7 bits (86), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 31/60 (51%)
Query: 46 IRSWDMEETDFRAEYRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRF 105
IR + T F+AE+ + + LAR + +E GTND NK S C+ +P+ GR
Sbjct: 617 IREIKQQMTAFKAEFSGYTAQTELLARTLFNESGTNDLNKSLKRSSSCSKINPMSGTGRI 676
>gi|308159758|gb|EFO62278.1| Protein 21.1 [Giardia lamblia P15]
Length = 622
Score = 35.8 bits (81), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 30/60 (50%)
Query: 46 IRSWDMEETDFRAEYRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRF 105
IR + F+AE+ + + LAR + +E GTND NK S C+ +P+ GR
Sbjct: 32 IREIKQQMIAFKAEFSGYTAQTELLARTLFNESGTNDLNKSLKRSSSCSRINPMSGTGRL 91
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.312 0.126 0.352
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,069,477,590
Number of Sequences: 23463169
Number of extensions: 120225773
Number of successful extensions: 253353
Number of sequences better than 100.0: 41
Number of HSP's better than 100.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 253249
Number of HSP's gapped (non-prelim): 57
length of query: 205
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 69
effective length of database: 9,168,204,383
effective search space: 632606102427
effective search space used: 632606102427
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 73 (32.7 bits)