BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 041622
         (205 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224083266|ref|XP_002306974.1| predicted protein [Populus trichocarpa]
 gi|222856423|gb|EEE93970.1| predicted protein [Populus trichocarpa]
          Length = 208

 Score =  284 bits (726), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 147/208 (70%), Positives = 169/208 (81%), Gaps = 3/208 (1%)

Query: 1   MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
           MS+MRTDRKPPLAKSPIR RPRRVLRS STTLQTPPGSLTKSQKP R  D E++D R EY
Sbjct: 1   MSRMRTDRKPPLAKSPIRIRPRRVLRSESTTLQTPPGSLTKSQKPNRKLDTEDSDLRPEY 60

Query: 61  RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
            SISCELRALA+MV+DEFG  ++  GG+GKSL ANSSPLFERGRFYEEY+ARRN+RLKRK
Sbjct: 61  HSISCELRALAKMVRDEFGNGESTNGGVGKSLSANSSPLFERGRFYEEYSARRNDRLKRK 120

Query: 121 KSEACSGEKTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENK 179
           K +     KTPYNLGVTVE +KRR ++KLES RKSV  A  VERNE PRY+LRSM+KENK
Sbjct: 121 KGDTGDDMKTPYNLGVTVESSKRRYTRKLESARKSVSDACLVERNETPRYLLRSMNKENK 180

Query: 180 KPPLPLNS--SAMATTEKKTVSRRAGKI 205
           KPPLP+ S   ++   E+K  +R+  KI
Sbjct: 181 KPPLPVYSFEKSVLAGERKVAARKVRKI 208


>gi|224065691|ref|XP_002301923.1| predicted protein [Populus trichocarpa]
 gi|222843649|gb|EEE81196.1| predicted protein [Populus trichocarpa]
          Length = 208

 Score =  272 bits (695), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 141/208 (67%), Positives = 168/208 (80%), Gaps = 3/208 (1%)

Query: 1   MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
           MS+M+T  KPPLAKSP R RPRR +RS STT QTPPGSL KSQ+P R WDMEE+D R EY
Sbjct: 1   MSRMKTVGKPPLAKSPTRLRPRRSIRSESTTSQTPPGSLAKSQRPNRKWDMEESDLRPEY 60

Query: 61  RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
           +SIS ELRALA+M + EFG  +++ GG+G+SL ANSSPLFERGRFYEEY+ RRNERLKRK
Sbjct: 61  QSISWELRALAKMARGEFGNGESSNGGVGRSLSANSSPLFERGRFYEEYSVRRNERLKRK 120

Query: 121 KSEACSGEKTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENK 179
           K +  +  KTP NLGVT+E +KRRDSKKLESLRKSV AAY VERNE  RYMLRSM+KENK
Sbjct: 121 KGDTGNEVKTPCNLGVTIESSKRRDSKKLESLRKSVSAAYCVERNENSRYMLRSMNKENK 180

Query: 180 KPPLPLNS--SAMATTEKKTVSRRAGKI 205
           KPPLP+ +   ++  +E+K  +R+A KI
Sbjct: 181 KPPLPVYNYEKSVLASERKVTARKARKI 208


>gi|255539210|ref|XP_002510670.1| conserved hypothetical protein [Ricinus communis]
 gi|223551371|gb|EEF52857.1| conserved hypothetical protein [Ricinus communis]
          Length = 206

 Score =  252 bits (643), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 139/206 (67%), Positives = 158/206 (76%), Gaps = 4/206 (1%)

Query: 4   MRTDRKPPLAKSPIRTRPRRVLRSNSTT-LQTPPGSLTKSQKPIRSWDMEETDFRAEYRS 62
           M+ DRKPPLAKSPIR RP RVLRS +TT LQ+PPGSL KSQK     DMEE D R EYRS
Sbjct: 1   MKYDRKPPLAKSPIRVRPGRVLRSTTTTSLQSPPGSLIKSQKLNPGPDMEELDLRPEYRS 60

Query: 63  ISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRKKS 122
           IS ELRALARMVQDEFG  +    G  KSL ANSSPLFERGRFY+ Y+ARRNERLKRK  
Sbjct: 61  ISWELRALARMVQDEFGKENATNAGERKSLSANSSPLFERGRFYDVYSARRNERLKRKTG 120

Query: 123 EAC-SGEKTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENKK 180
           E    G KTP+NLGV +E +K+RD+KKL+SLRKSV AAYSVE +E PRY LRS+SKENKK
Sbjct: 121 ETGEEGVKTPHNLGVNIESSKKRDTKKLQSLRKSVSAAYSVESSEIPRYSLRSLSKENKK 180

Query: 181 PPLPLN-SSAMATTEKKTVSRRAGKI 205
           PPLP+N   ++   EKK  +RR  K+
Sbjct: 181 PPLPVNYEKSVLANEKKVGARRVRKV 206


>gi|307136147|gb|ADN33991.1| hypothetical protein [Cucumis melo subsp. melo]
          Length = 212

 Score =  237 bits (604), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 139/212 (65%), Positives = 167/212 (78%), Gaps = 7/212 (3%)

Query: 1   MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTT-LQTPPGSLTKSQKPIRSWDMEETDFRAE 59
           MSKMR+DR+PPLA+SP+R R RRV +SNSTT +QTPPGSLTKSQK IR  D++E++ R E
Sbjct: 1   MSKMRSDRRPPLARSPLRLRSRRVPQSNSTTSIQTPPGSLTKSQKMIRPSDLQESELRPE 60

Query: 60  YRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERL-K 118
           YR+ISCELRALA MV+ E G  ++   G+G  L  NSS LFERGRFY+EY+ARRNERL K
Sbjct: 61  YRTISCELRALASMVRMELGCIESEGNGVGDKLSVNSSSLFERGRFYDEYSARRNERLKK 120

Query: 119 RKKSEACSGEKTPYNLGVTVEPAKRRDSKKL-ESLRKSVIAAYSVERNE--PRYMLRSMS 175
           RKK+E  +  KTPYNLGVTVE +KR+ SKKL  SLRKSV AAYS ER+E  PRY+LRSM 
Sbjct: 121 RKKAETVAEVKTPYNLGVTVESSKRQSSKKLVNSLRKSVSAAYSAERSETAPRYLLRSMM 180

Query: 176 KENKKPPLPLN--SSAMATTEKKTVSRRAGKI 205
           KENKKPPLP+N   S +   E+K V+RRA +I
Sbjct: 181 KENKKPPLPVNLEKSMIGGGERKMVTRRARRI 212


>gi|225459872|ref|XP_002285934.1| PREDICTED: uncharacterized protein LOC100264304 [Vitis vinifera]
 gi|302141661|emb|CBI18864.3| unnamed protein product [Vitis vinifera]
          Length = 205

 Score =  236 bits (601), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 125/207 (60%), Positives = 156/207 (75%), Gaps = 4/207 (1%)

Query: 1   MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
           MSKMR++ KPPL  SPI+ R RRVLRS++TT+QTPPGSLTK+    R+  +EE+  R EY
Sbjct: 1   MSKMRSNGKPPLGNSPIQLRSRRVLRSSATTIQTPPGSLTKTLN--RTQVIEESGLRPEY 58

Query: 61  RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
            +ISCEL+ALA++V++E G   +   G   +  ANSSPLFERGRFY+EY+ARRNERLKRK
Sbjct: 59  LTISCELQALAKIVRNELGNGSSENVGFPNTFGANSSPLFERGRFYDEYSARRNERLKRK 118

Query: 121 KSEACSGEKTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENK 179
           K E     KTPY LGVTVEP+KRRD+KK ESLRKSV AA SV+R+E PRY+LRS +KENK
Sbjct: 119 KGETGDVGKTPYKLGVTVEPSKRRDTKKFESLRKSVPAALSVDRSENPRYLLRSSTKENK 178

Query: 180 KPPLPLNSSAMATT-EKKTVSRRAGKI 205
           KPP+P N    A   ++K  +RR  K+
Sbjct: 179 KPPMPFNVEKSAVAGDRKIGARRVRKV 205


>gi|449519444|ref|XP_004166745.1| PREDICTED: uncharacterized LOC101215930 [Cucumis sativus]
          Length = 212

 Score =  231 bits (588), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 137/212 (64%), Positives = 164/212 (77%), Gaps = 7/212 (3%)

Query: 1   MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTT-LQTPPGSLTKSQKPIRSWDMEETDFRAE 59
           MSKMR+DR+PPLA+SP+R R RRV +SNSTT +QTPPGSLTKSQK IR  D++E + R E
Sbjct: 1   MSKMRSDRRPPLARSPLRLRSRRVPQSNSTTSMQTPPGSLTKSQKTIRPSDLQEMELRPE 60

Query: 60  YRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERL-K 118
           YR+ISCELRALA MV+ E G  +    G+G     NSS LFERGR Y+EY+ARRNERL K
Sbjct: 61  YRTISCELRALASMVRMELGCEEYESNGVGDKSSVNSSSLFERGRLYDEYSARRNERLKK 120

Query: 119 RKKSEACSGEKTPYNLGVTVEPAKRRDSKKL-ESLRKSVIAAYSVERNE--PRYMLRSMS 175
           RKK+E+ +  KTPYNLGVTVE +KR+ SKKL  SLRKSV AAYS ER+E  PRY+LRSM 
Sbjct: 121 RKKAESVAEVKTPYNLGVTVESSKRQSSKKLVNSLRKSVSAAYSEERSETGPRYLLRSMM 180

Query: 176 KENKKPPLPLN--SSAMATTEKKTVSRRAGKI 205
           KENKKPPLP+N   S +   E+K V+RRA +I
Sbjct: 181 KENKKPPLPVNLEKSTIGGGERKMVTRRARRI 212


>gi|449452981|ref|XP_004144237.1| PREDICTED: uncharacterized protein LOC101215930 [Cucumis sativus]
          Length = 212

 Score =  231 bits (588), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 136/212 (64%), Positives = 165/212 (77%), Gaps = 7/212 (3%)

Query: 1   MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTT-LQTPPGSLTKSQKPIRSWDMEETDFRAE 59
           MSKMR+DR+PP+A+SP+R R RRV +SNSTT +QTPPGSLTKSQK IR  D++E++ R E
Sbjct: 1   MSKMRSDRRPPIARSPLRLRSRRVPQSNSTTSMQTPPGSLTKSQKTIRPSDLQESELRPE 60

Query: 60  YRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERL-K 118
           YR+ISCELRALA MV+ E G  +    G+G     NSS LFERGR Y+EY+ARRNERL K
Sbjct: 61  YRTISCELRALASMVRMELGCEEYESNGVGDKSSVNSSSLFERGRLYDEYSARRNERLKK 120

Query: 119 RKKSEACSGEKTPYNLGVTVEPAKRRDSKKL-ESLRKSVIAAYSVERNE--PRYMLRSMS 175
           RKK+E+ +  KTPYNLGVTVE +KR+ SKKL  SLRKSV AAYS ER+E  PRY+LRSM 
Sbjct: 121 RKKAESVAEVKTPYNLGVTVESSKRQSSKKLVNSLRKSVSAAYSEERSETGPRYLLRSMM 180

Query: 176 KENKKPPLPLN--SSAMATTEKKTVSRRAGKI 205
           KENKKPPLP+N   S +   E+K V+RRA +I
Sbjct: 181 KENKKPPLPVNLEKSTIGGGERKMVTRRARRI 212


>gi|255637797|gb|ACU19220.1| unknown [Glycine max]
          Length = 204

 Score =  186 bits (473), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 113/210 (53%), Positives = 137/210 (65%), Gaps = 16/210 (7%)

Query: 2   SKMRTDRKPPLAKSPIRTRPRRVLRS-NSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
           S M++  KPPL +SPIR RP   + S NS  LQTPPGSLTKSQKP+ S +M     R EY
Sbjct: 5   STMKSMYKPPLPRSPIRLRPSHAVHSSNSIALQTPPGSLTKSQKPVISPNM-----RPEY 59

Query: 61  RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
           R+ISCELRAL +MV+DEF   ++ K        ANSS LFERGRFY+EY+ARRNERL RK
Sbjct: 60  RTISCELRALTKMVRDEFSKANSEKAA-----SANSSVLFERGRFYDEYSARRNERLMRK 114

Query: 121 KSEACSGE---KTPYNLGVTVEPAKRRDS-KKLESLRKSVIAAYSVERNE-PRYMLRSMS 175
           K      E   K  ++LGVT+E  K+  S +K+ +LRKS+ AAYS   +E PRYMLRSM+
Sbjct: 115 KGVTAVDEGKAKPVHSLGVTMESGKKSSSVRKMGTLRKSIFAAYSSGVSETPRYMLRSMT 174

Query: 176 KENKKPPLPLNSSAMATTEKKTVSRRAGKI 205
           KENKKPPL            K +  RA KI
Sbjct: 175 KENKKPPLAAKFDKSVVAGDKKMGTRARKI 204


>gi|356517354|ref|XP_003527352.1| PREDICTED: uncharacterized protein LOC100810616 [Glycine max]
          Length = 204

 Score =  183 bits (465), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 108/188 (57%), Positives = 131/188 (69%), Gaps = 16/188 (8%)

Query: 2   SKMRTDRKPPLAKSPIRTRPRRVLRS-NSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
           S M++  KPPL +SPIR RP   + S NS  LQTPPGSLTKSQKP+ S +M     R EY
Sbjct: 5   STMKSMYKPPLPRSPIRLRPSHAVHSSNSIALQTPPGSLTKSQKPVISPNM-----RPEY 59

Query: 61  RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
           R+ISCELRAL +MV+DEF   ++ K        ANSS LFERGRFY+EY+ARRNERL RK
Sbjct: 60  RTISCELRALTKMVRDEFSKANSEKAA-----SANSSVLFERGRFYDEYSARRNERLMRK 114

Query: 121 KSEACSGE---KTPYNLGVTVEPAKRRDS-KKLESLRKSVIAAYSVERNE-PRYMLRSMS 175
           K      E   K  ++LGVT+E  K+  S +K+ +LRKS+ AAYS   +E PRYMLRSM+
Sbjct: 115 KGVTAVDEGKAKPVHSLGVTMESGKKSSSVRKMGTLRKSISAAYSSGVSETPRYMLRSMT 174

Query: 176 KENKKPPL 183
           KENKKPPL
Sbjct: 175 KENKKPPL 182


>gi|351726746|ref|NP_001236881.1| uncharacterized protein LOC100306057 [Glycine max]
 gi|255627407|gb|ACU14048.1| unknown [Glycine max]
          Length = 202

 Score =  173 bits (439), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 106/209 (50%), Positives = 130/209 (62%), Gaps = 16/209 (7%)

Query: 2   SKMRTDRKPPLAKSPIRTRPRRV-LRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
           S +++  KPPL +SPIR RP      SNS  LQTPPGSLTKSQKP+ S ++     R EY
Sbjct: 5   STIKSVYKPPLPRSPIRLRPSHAAYSSNSIALQTPPGSLTKSQKPVLSLNI-----RPEY 59

Query: 61  RSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRK 120
           R+ISCEL AL +MV+DEF   ++ K         NSS LFERGRFY+EY+ARRNERL RK
Sbjct: 60  RTISCELWALTKMVRDEFSKVNSEKA------ATNSSVLFERGRFYDEYSARRNERLMRK 113

Query: 121 KSEACSGE---KTPYNLGVTVEPAKRRDSKKLESLRKSVIAAYSVE-RNEPRYMLRSMSK 176
           K      E   K   +LGVT+E  K+   +K+ +LRKS+ AAYS      PRYMLRSM+K
Sbjct: 114 KGVTAVDEGKAKPVRSLGVTMESGKKTSVRKMGTLRKSISAAYSSGVSGTPRYMLRSMTK 173

Query: 177 ENKKPPLPLNSSAMATTEKKTVSRRAGKI 205
           ENKKPPL           +K +  R  KI
Sbjct: 174 ENKKPPLAAKFDKSVVAGEKKMGTRGRKI 202


>gi|297795311|ref|XP_002865540.1| hypothetical protein ARALYDRAFT_494800 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311375|gb|EFH41799.1| hypothetical protein ARALYDRAFT_494800 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 204

 Score =  173 bits (438), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 109/211 (51%), Positives = 139/211 (65%), Gaps = 16/211 (7%)

Query: 1   MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
           MSKMR+  KPPL KSPIR R R+VL S   +LQTP     K  + +   D+     + EY
Sbjct: 1   MSKMRSYLKPPLPKSPIRLRSRQVLHS---SLQTPSPVFLKPGRRLSDVDLNPKP-QFEY 56

Query: 61  RSISCELRALARMVQDEFGTNDTNKGGIG-KSLCANSSPLFERGRFYEEYAARRNERLKR 119
            SIS E+ ALA+MV+DEF   +  K     +SL  NS P+FERGRFYEEY+A+RNERL+R
Sbjct: 57  SSISSEIHALAKMVKDEFAEEENRKKSSNLESLATNSVPVFERGRFYEEYSAKRNERLRR 116

Query: 120 KKS-EACSG---EKTPYNLGVTVEPA--KRRDSKKLESLRKSVIAAYSVERNEPRYMLRS 173
           KK  EA  G   + TPYNLGV  EP   KRR + K ES++K+V++   VE + PRY LRS
Sbjct: 117 KKGEEAVEGGVVKGTPYNLGVNHEPMTNKRRGTAKKESIKKTVVSM--VETSIPRYSLRS 174

Query: 174 MSKENKKPPLPLNSSAMATTEKKTVSRRAGK 204
           MSKEN+KPP+PLN   +A +  KTV+ R G+
Sbjct: 175 MSKENRKPPIPLN---VAVSAMKTVTTRRGR 202


>gi|388499978|gb|AFK38055.1| unknown [Medicago truncatula]
          Length = 222

 Score =  170 bits (431), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 112/221 (50%), Positives = 142/221 (64%), Gaps = 24/221 (10%)

Query: 2   SKMRTDRKPP--LAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAE 59
           S M++  +PP  + KSP+R R R VL  +S +LQTPPGSLTKS+K ++S ++     R E
Sbjct: 6   STMKSHLRPPRPIPKSPMRLRTRTVL--DSPSLQTPPGSLTKSRKLVQSPEL-----RPE 58

Query: 60  YRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKR 119
           YR+I+ E RAL+RMV+DEFG  D  +     S    S  LF+RG+FY+EY+ARRNERLKR
Sbjct: 59  YRTIASEFRALSRMVRDEFGKPDPEEIAGTNSCNPKSGVLFQRGKFYDEYSARRNERLKR 118

Query: 120 KKSEAC-----SGEKTP-----YN-LGVTVEPAKRRDSKKLESLRKSVIAAYSVERNE-P 167
           KK         +  K P     +N LGVTVE  K+  ++KL SLRKSV AAYS E +E P
Sbjct: 119 KKGVTVDEINTTSIKQPKVNNHHNVLGVTVESGKKNTARKLGSLRKSVSAAYSAEVSETP 178

Query: 168 RYMLRSMSKENKKPPLP---LNSSAMATTEKKTVSRRAGKI 205
           RYMLRSM+KENKKPPL     + S     +K   SRR GKI
Sbjct: 179 RYMLRSMTKENKKPPLAGSRFDKSVTVGEKKVGASRRVGKI 219


>gi|42568267|ref|NP_199048.2| uncharacterized protein [Arabidopsis thaliana]
 gi|48958483|gb|AAT47794.1| At5g42330 [Arabidopsis thaliana]
 gi|51536556|gb|AAU05516.1| At5g42330 [Arabidopsis thaliana]
 gi|51970798|dbj|BAD44091.1| unknown protein [Arabidopsis thaliana]
 gi|332007414|gb|AED94797.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 205

 Score =  168 bits (425), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 109/212 (51%), Positives = 139/212 (65%), Gaps = 17/212 (8%)

Query: 1   MSKMRTDRKPPLAKSPIRTRPRRVLRSNSTTLQTPPGSLTKSQKPIRSWDMEETDFRAEY 60
           MSKMR+  KPPL KSPIR R R+VL S   +LQTP     K  + +   D+     + +Y
Sbjct: 1   MSKMRSYLKPPLPKSPIRLRSRQVLHS---SLQTPSPGFLKPGRRLSDVDLNPKP-QIKY 56

Query: 61  RSISCELRALARMVQDEFGTN-DTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKR 119
            SIS E+ ALA+MV+DEF    D  K    +SL  NS PLFERGRFYEEY+A+RNERL+R
Sbjct: 57  SSISSEIHALAKMVKDEFAEEEDRKKSSNLESLAKNSVPLFERGRFYEEYSAKRNERLRR 116

Query: 120 KKS-EACSG---EKTPYNLGVTVEPA--KRRDSKKLESLRKSVIAAYSVERNE-PRYMLR 172
           KK  EA  G   + TPYNLGV  EP   KRR + K ES++K+V++   VE +  PRY LR
Sbjct: 117 KKGEEAVEGGVVKGTPYNLGVNHEPMTNKRRGTAKKESIKKTVVSM--VESSSIPRYSLR 174

Query: 173 SMSKENKKPPLPLNSSAMATTEKKTVSRRAGK 204
           SM+KEN+KPPLPLN   +A +  K+V+ R G+
Sbjct: 175 SMAKENRKPPLPLN---VAVSAMKSVTTRKGR 203


>gi|9759469|dbj|BAB10474.1| unnamed protein product [Arabidopsis thaliana]
          Length = 177

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 75/137 (54%), Positives = 94/137 (68%), Gaps = 13/137 (9%)

Query: 73  MVQDEFGTN-DTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRKKS-EACSG--- 127
           MV+DEF    D  K    +SL  NS PLFERGRFYEEY+A+RNERL+RKK  EA  G   
Sbjct: 1   MVKDEFAEEEDRKKSSNLESLAKNSVPLFERGRFYEEYSAKRNERLRRKKGEEAVEGGVV 60

Query: 128 EKTPYNLGVTVEPA--KRRDSKKLESLRKSVIAAYSVERNE-PRYMLRSMSKENKKPPLP 184
           + TPYNLGV  EP   KRR + K ES++K+V++   VE +  PRY LRSM+KEN+KPPLP
Sbjct: 61  KGTPYNLGVNHEPMTNKRRGTAKKESIKKTVVSM--VESSSIPRYSLRSMAKENRKPPLP 118

Query: 185 LNSSAMATTEKKTVSRR 201
           LN   +A +  K+V+ R
Sbjct: 119 LN---VAVSAMKSVTTR 132


>gi|357122872|ref|XP_003563138.1| PREDICTED: uncharacterized protein LOC100831929 [Brachypodium
           distachyon]
          Length = 239

 Score = 56.2 bits (134), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 89/181 (49%), Gaps = 28/181 (15%)

Query: 7   DRKPPLAKSPIRTRPRRVLRSN------STTLQTPPGSLTKSQKPIRSW-------DMEE 53
           DR+PPLA SP R RPR    +       S+   TPPG   K+Q P+RS           E
Sbjct: 2   DRRPPLAVSPRRLRPRPGRAATGRPPVASSVHLTPPGLSKKAQTPMRSSFCALPPSSYLE 61

Query: 54  TDFRA--EYRSISCELRALAR----MVQDEFGTNDTNKGGI--GKSLCANSSPLFERGRF 105
              RA  ++       RA  +    +  D+    + +  G+    +    +SPLFERGR 
Sbjct: 62  PSPRAKLDFNCAPTPTRAAGKENLHIPDDDAVAFNLSDAGMEDWTATVEPTSPLFERGRL 121

Query: 106 YEEYAARRNERLKRKKSEACSGEKTPYNLG----VTVEPAKRRDSKKLES---LRKSVIA 158
           Y+ Y+ARRNERLKRK     +GE+    +     V VE +KRR +KK  +   +R+S+ A
Sbjct: 122 YDLYSARRNERLKRKHGWYTAGEEEAGAMAQDPCVAVELSKRRGAKKTGAESVVRRSMPA 181

Query: 159 A 159
           A
Sbjct: 182 A 182


>gi|226500280|ref|NP_001144393.1| uncharacterized protein LOC100277325 [Zea mays]
 gi|195641510|gb|ACG40223.1| hypothetical protein [Zea mays]
          Length = 288

 Score = 53.9 bits (128), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 57/150 (38%), Positives = 72/150 (48%), Gaps = 38/150 (25%)

Query: 63  ISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRKK- 121
           ++  L A+AR       T  T  GG          PLF RGR Y+ Y+ARRNERLKRK+ 
Sbjct: 143 VALNLAAIAR-------TAGTPSGG----------PLFVRGRLYDLYSARRNERLKRKQG 185

Query: 122 ---------SEACSGEKTPYNLGVTVEPAKRRDSKK------LESLRKSVIAA-YSVERN 165
                    S A   E    N  V VE +KRR +KK       ES+R+S+ A  ++  R 
Sbjct: 186 FPFYSEEFASAAVDPETMAENPCVAVELSKRRVAKKAYAASGAESVRRSMPAVDFAACRG 245

Query: 166 ---EPRYMLRSMSKENKKPPLPLNSSAMAT 192
               PR  LRS SKE KK      + +MAT
Sbjct: 246 GDLGPRSSLRS-SKEMKKASAASGAVSMAT 274


>gi|293336431|ref|NP_001169778.1| uncharacterized protein LOC100383663 [Zea mays]
 gi|224031613|gb|ACN34882.1| unknown [Zea mays]
 gi|414590223|tpg|DAA40794.1| TPA: hypothetical protein ZEAMMB73_046692 [Zea mays]
          Length = 289

 Score = 53.9 bits (128), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 57/150 (38%), Positives = 72/150 (48%), Gaps = 38/150 (25%)

Query: 63  ISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRFYEEYAARRNERLKRKK- 121
           ++  L A+AR       T  T  GG          PLF RGR Y+ Y+ARRNERLKRK+ 
Sbjct: 144 VALNLAAIAR-------TAGTPSGG----------PLFVRGRLYDLYSARRNERLKRKQD 186

Query: 122 ---------SEACSGEKTPYNLGVTVEPAKRRDSKK------LESLRKSVIAA-YSVERN 165
                    S A   E    N  V VE +KRR +KK       ES+R+S+ A  ++  R 
Sbjct: 187 FPFYSEEFASAAVDPETMAENPCVAVELSKRRVAKKAYAASGAESVRRSMPAVDFAACRG 246

Query: 166 ---EPRYMLRSMSKENKKPPLPLNSSAMAT 192
               PR  LRS SKE KK      + +MAT
Sbjct: 247 GDLGPRSSLRS-SKEMKKASAASGAVSMAT 275


>gi|242045624|ref|XP_002460683.1| hypothetical protein SORBIDRAFT_02g033050 [Sorghum bicolor]
 gi|241924060|gb|EER97204.1| hypothetical protein SORBIDRAFT_02g033050 [Sorghum bicolor]
          Length = 298

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 63/121 (52%), Gaps = 22/121 (18%)

Query: 98  PLFERGRFYEEYAARRNERLKRKK-----------SEACSGEKTPYNLGVTVEPAKRRDS 146
           PLF RGR Y+ Y+ARRNERLKRK+           + +   E    +  V V+ +KRR +
Sbjct: 170 PLFVRGRLYDAYSARRNERLKRKQGFPYYSEEFAPAASVDPEAIAEDPFVAVQLSKRRVA 229

Query: 147 KK------LESLRKSVIAA-YSVERN---EPRYMLRSMSKENKKPPLPLNSSAMATTEKK 196
           KK       ES+R+S  A  ++  R     PR  LRS SKE KKP     + ++A  E++
Sbjct: 230 KKAYTASGAESVRRSAPAIDFAAGRGGGLGPRSSLRS-SKEMKKPSAASGTVSVAIKERR 288

Query: 197 T 197
            
Sbjct: 289 V 289


>gi|125558294|gb|EAZ03830.1| hypothetical protein OsI_25959 [Oryza sativa Indica Group]
          Length = 248

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/161 (36%), Positives = 78/161 (48%), Gaps = 29/161 (18%)

Query: 31  TLQTPPGSLTKSQKPIRSW-------DMEET-----DFRAEYRSISCELRALA--RMVQD 76
           ++QTPPGS+ K+  P+RS         +E T     DF A        + A    R V D
Sbjct: 35  SVQTPPGSIKKATTPMRSSFCALPTSRLEPTPRAKLDFAAAPSPARAAVAAGKENRHVDD 94

Query: 77  EFGTNDTNKG-------GIGKSLCANSSPLFERGRFYEEYAARRNERLKRKKS---EACS 126
           E   + T               L   +SPLFERGR Y+ Y+ARRNERLKRK         
Sbjct: 95  EVSLDLTAMAMPTPMPTWTASPLPPPTSPLFERGRLYDLYSARRNERLKRKHGFPAGEEE 154

Query: 127 GEKTPYNLGVTVEPAKRRDSKKL---ESLRKSVIAA--YSV 162
            E    +  V VE +KRR +KK+   ES+R+S+ AA  +SV
Sbjct: 155 AEAMAADPCVAVELSKRRGAKKMTGAESVRRSMPAAADFSV 195


>gi|115472035|ref|NP_001059616.1| Os07g0474300 [Oryza sativa Japonica Group]
 gi|34394631|dbj|BAC83932.1| unknown protein [Oryza sativa Japonica Group]
 gi|113611152|dbj|BAF21530.1| Os07g0474300 [Oryza sativa Japonica Group]
 gi|125600196|gb|EAZ39772.1| hypothetical protein OsJ_24209 [Oryza sativa Japonica Group]
          Length = 248

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/156 (36%), Positives = 75/156 (48%), Gaps = 27/156 (17%)

Query: 31  TLQTPPGSLTKSQKPIRSW-------DMEET-----DFRAEYRSISCELRALA--RMVQD 76
           ++QTPPGS+ K+  P+RS         +E T     DF A        + A    R V D
Sbjct: 35  SVQTPPGSIKKATTPMRSSFCALPTSRLEPTPRAKLDFAAAPSPARAAVAAGKENRHVDD 94

Query: 77  EFGTNDTNKG-------GIGKSLCANSSPLFERGRFYEEYAARRNERLKRKKS---EACS 126
           E   + T               L   +SPLFERGR Y+ Y+ARRNERLKRK         
Sbjct: 95  EVSLDLTAMAMPTPMPTWTASPLPPPTSPLFERGRLYDLYSARRNERLKRKHGFPAGEEE 154

Query: 127 GEKTPYNLGVTVEPAKRRDSKKL---ESLRKSVIAA 159
            E    +  V VE +KRR +KK+   ES+R+S+ AA
Sbjct: 155 AEAMAADPCVAVELSKRRGAKKMTGAESVRRSMPAA 190


>gi|414886509|tpg|DAA62523.1| TPA: hypothetical protein ZEAMMB73_371595 [Zea mays]
          Length = 280

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 43/111 (38%), Positives = 57/111 (51%), Gaps = 26/111 (23%)

Query: 95  NSSPLFERGRFYEEYAARRNERLKRKK-------------SEACSGEKTPYNLGVTVEPA 141
           +  PLF RGR Y+ Y+ARRNERLKRK+             + +   E    +  V VE +
Sbjct: 150 DGGPLFVRGRLYDAYSARRNERLKRKQGFPYYYYSDEFASTYSVEPEAMAKDPCVAVELS 209

Query: 142 KRRDSKKL--------ESLRKSVIAA-YSVERN---EPRYMLRSMSKENKK 180
           KRR +KK         ES+R+S+ A  ++  R     PR  LRS SKE KK
Sbjct: 210 KRRVAKKAYTATSGGGESVRRSMPAVDFAARRGGGLGPRSSLRS-SKEMKK 259


>gi|159118288|ref|XP_001709363.1| Protein 21.1 [Giardia lamblia ATCC 50803]
 gi|157437479|gb|EDO81689.1| Protein 21.1 [Giardia lamblia ATCC 50803]
          Length = 1207

 Score = 37.7 bits (86), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 31/60 (51%)

Query: 46  IRSWDMEETDFRAEYRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRF 105
           IR    + T F+AE+   + +   LAR + +E GTND NK     S C+  +P+   GR 
Sbjct: 617 IREIKQQMTAFKAEFSGYTAQTELLARTLFNESGTNDLNKSLKRSSSCSKINPMSGTGRI 676


>gi|308159758|gb|EFO62278.1| Protein 21.1 [Giardia lamblia P15]
          Length = 622

 Score = 35.8 bits (81), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 30/60 (50%)

Query: 46  IRSWDMEETDFRAEYRSISCELRALARMVQDEFGTNDTNKGGIGKSLCANSSPLFERGRF 105
           IR    +   F+AE+   + +   LAR + +E GTND NK     S C+  +P+   GR 
Sbjct: 32  IREIKQQMIAFKAEFSGYTAQTELLARTLFNESGTNDLNKSLKRSSSCSRINPMSGTGRL 91


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.312    0.126    0.352 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,069,477,590
Number of Sequences: 23463169
Number of extensions: 120225773
Number of successful extensions: 253353
Number of sequences better than 100.0: 41
Number of HSP's better than 100.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 253249
Number of HSP's gapped (non-prelim): 57
length of query: 205
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 69
effective length of database: 9,168,204,383
effective search space: 632606102427
effective search space used: 632606102427
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 73 (32.7 bits)