BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 025573
         (250 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255561651|ref|XP_002521835.1| conserved hypothetical protein [Ricinus communis]
 gi|223538873|gb|EEF40471.1| conserved hypothetical protein [Ricinus communis]
          Length = 190

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 142/189 (75%), Positives = 163/189 (86%)

Query: 61  MNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKY 120
           MN+AAGQSGDPEKLN+D ++++ R LWD SPQPVK FPWNRALENFIQLILDL +AVVKY
Sbjct: 1   MNVAAGQSGDPEKLNVDHIIDKARNLWDTSPQPVKRFPWNRALENFIQLILDLVVAVVKY 60

Query: 121 LSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHL 180
           L VPLLAVSSLSEMSYCAH KKL  VP+PLLIG+ ++ +L+ET L+LSPLLKDAEVPWHL
Sbjct: 61  LCVPLLAVSSLSEMSYCAHHKKLSFVPLPLLIGIVVAGILKETALELSPLLKDAEVPWHL 120

Query: 181 IAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEATPSNDSV 240
           IAI IFF++IKLPGPYYPYWGR+FIPHFANG LWRT+W AFLWYRKP+KAS++    DSV
Sbjct: 121 IAIAIFFSLIKLPGPYYPYWGRIFIPHFANGVLWRTIWSAFLWYRKPRKASQSMLKQDSV 180

Query: 241 NTGLSEPNK 249
               S  NK
Sbjct: 181 IGSNSGTNK 189


>gi|225428693|ref|XP_002284957.1| PREDICTED: uncharacterized protein LOC100250293 [Vitis vinifera]
          Length = 253

 Score =  290 bits (743), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 147/253 (58%), Positives = 179/253 (70%), Gaps = 10/253 (3%)

Query: 3   TNPYQSHLVSASTHLHRSSILPLHGLKQVLP-------YFRASKLRLGANRNGPLWNAKG 55
           ++ Y +H  +AST    SS + L      L        Y + +   L    NG   NA+ 
Sbjct: 2   SSTYPTHFFTASTPRRPSSEMSLGTHSPFLSKFEGLSWYTKTNGCILSFKPNGQFQNARS 61

Query: 56  NICCAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFM 115
           N+   + + AG SGDPEKLNLD ++ + R LWD+SPQPVKSFPWNR LENFIQLILDL +
Sbjct: 62  NLLSKIKVTAGHSGDPEKLNLDAIIEKARTLWDSSPQPVKSFPWNRTLENFIQLILDLTL 121

Query: 116 AVVKYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAE 175
           AV+KYL  PLLAVSSLSEMSYCA+EKKLFLVP+PLLIG+ ++ LL++  L+LSP+LKDAE
Sbjct: 122 AVIKYLCAPLLAVSSLSEMSYCANEKKLFLVPIPLLIGIVVAGLLKDAALELSPVLKDAE 181

Query: 176 VPWHLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEATP 235
           V WHLIAI IFFT++KLPGPYYPYWGR+FIPHFANG L RTLW  FLWYR+P+   E   
Sbjct: 182 VAWHLIAIAIFFTLLKLPGPYYPYWGRIFIPHFANGGLLRTLWSVFLWYRRPR---ETLL 238

Query: 236 SNDSVNTGLSEPN 248
              S+N   S PN
Sbjct: 239 QRKSLNGSGSVPN 251


>gi|297741337|emb|CBI32468.3| unnamed protein product [Vitis vinifera]
          Length = 232

 Score =  287 bits (734), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 138/215 (64%), Positives = 165/215 (76%), Gaps = 3/215 (1%)

Query: 34  YFRASKLRLGANRNGPLWNAKGNICCAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQP 93
           Y + +   L    NG   NA+ N+   + + AG SGDPEKLNLD ++ + R LWD+SPQP
Sbjct: 19  YTKTNGCILSFKPNGQFQNARSNLLSKIKVTAGHSGDPEKLNLDAIIEKARTLWDSSPQP 78

Query: 94  VKSFPWNRALENFIQLILDLFMAVVKYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIG 153
           VKSFPWNR LENFIQLILDL +AV+KYL  PLLAVSSLSEMSYCA+EKKLFLVP+PLLIG
Sbjct: 79  VKSFPWNRTLENFIQLILDLTLAVIKYLCAPLLAVSSLSEMSYCANEKKLFLVPIPLLIG 138

Query: 154 MAISDLLRETILDLSPLLKDAEVPWHLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGAL 213
           + ++ LL++  L+LSP+LKDAEV WHLIAI IFFT++KLPGPYYPYWGR+FIPHFANG L
Sbjct: 139 IVVAGLLKDAALELSPVLKDAEVAWHLIAIAIFFTLLKLPGPYYPYWGRIFIPHFANGGL 198

Query: 214 WRTLWFAFLWYRKPKKASEATPSNDSVNTGLSEPN 248
            RTLW  FLWYR+P+   E      S+N   S PN
Sbjct: 199 LRTLWSVFLWYRRPR---ETLLQRKSLNGSGSVPN 230


>gi|224123264|ref|XP_002330273.1| predicted protein [Populus trichocarpa]
 gi|222871308|gb|EEF08439.1| predicted protein [Populus trichocarpa]
          Length = 231

 Score =  283 bits (724), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 149/228 (65%), Positives = 174/228 (76%), Gaps = 8/228 (3%)

Query: 2   GTNPYQSHLVSASTHLHRSSI---LPLH-----GLKQVLPYFRASKLRLGANRNGPLWNA 53
            T  Y SHL+SAS HL  ++      LH     G ++   Y +A    LG   +G     
Sbjct: 3   ATITYPSHLLSASFHLRTTTTKNAFRLHSPSTQGPERSPHYLKAGSSHLGVKLSGVFQKT 62

Query: 54  KGNICCAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDL 113
           + NI CA+NMAAGQSGDPEK+N D L+N+ RK+WD SPQPVKSFPW RALENFIQLILDL
Sbjct: 63  RLNISCAINMAAGQSGDPEKINFDHLMNKARKIWDRSPQPVKSFPWKRALENFIQLILDL 122

Query: 114 FMAVVKYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKD 173
           F+ VVKYL VPLLAVSSLSEMSYCAH+KKL LVP+PLLIG+ ++ +++ET L+LSPLLKD
Sbjct: 123 FVMVVKYLCVPLLAVSSLSEMSYCAHQKKLLLVPIPLLIGIIVAGVVKETALELSPLLKD 182

Query: 174 AEVPWHLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAF 221
           AEVPWHLI   I FT+IKLPGPYYPYWGR+FIPHFANG LWRTL FAF
Sbjct: 183 AEVPWHLIVTAIVFTLIKLPGPYYPYWGRIFIPHFANGVLWRTLLFAF 230


>gi|356575152|ref|XP_003555706.1| PREDICTED: uncharacterized protein LOC100803646 [Glycine max]
          Length = 240

 Score =  272 bits (696), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 133/231 (57%), Positives = 165/231 (71%), Gaps = 10/231 (4%)

Query: 8   SHLVSASTHLHRSSIL----------PLHGLKQVLPYFRASKLRLGANRNGPLWNAKGNI 57
           SHL+SA+T   R  +L          P  G K    +F      L             NI
Sbjct: 2   SHLLSAATTFSRFQLLTLNNTFTTQPPFLGFKGTSCFFEKHSCCLSVKPFCASQKRGANI 61

Query: 58  CCAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAV 117
            C+MNM+A QS D  K+ LDQL+++V+KLWD+SP+PVK FP N+AL+NFIQLILDL + V
Sbjct: 62  FCSMNMSAQQSDDHGKMKLDQLIDKVQKLWDSSPEPVKKFPHNKALDNFIQLILDLILVV 121

Query: 118 VKYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVP 177
           VKYL+VP+ AV+SLSEMSYCAHE+KL  VP+P+L G+A++ +L+ET L LSP L+D+EVP
Sbjct: 122 VKYLAVPVFAVTSLSEMSYCAHERKLVFVPLPVLFGVAVAGILKETALQLSPRLRDSEVP 181

Query: 178 WHLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPK 228
           WHLIAI IFFT+IKLPGPYYPYW R+ IPHFANG L RTL FA LWYR+PK
Sbjct: 182 WHLIAIAIFFTLIKLPGPYYPYWVRIIIPHFANGVLLRTLLFAILWYRRPK 232


>gi|449495587|ref|XP_004159887.1| PREDICTED: uncharacterized protein LOC101228120 [Cucumis sativus]
          Length = 243

 Score =  267 bits (683), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 129/234 (55%), Positives = 166/234 (70%), Gaps = 2/234 (0%)

Query: 8   SHLVSASTHLHRSSILPLHGLKQVLPYFRASKLR--LGANRNGPLWNAKGNICCAMNMAA 65
           +HL+S  TH   S  L LH    + P  R+ +    +    +    NA   + CAMNM A
Sbjct: 3   THLLSPPTHFQTSMKLALHAPFLLAPSRRSPECSHVVSLKPHKRFQNAGSVVSCAMNMTA 62

Query: 66  GQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPL 125
           GQS D  ++N D L NRV++LW++SP+PVKSFPWN AL+NFIQLI DL + V+KYL VPL
Sbjct: 63  GQSDDSGRVNWDSLKNRVKQLWESSPEPVKSFPWNEALDNFIQLIADLILTVIKYLFVPL 122

Query: 126 LAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGI 185
           L V+SLSEMSYCAHEKKL +VP   +IG ++++++R+T L LSP+LKD EVPWHLI I I
Sbjct: 123 LLVTSLSEMSYCAHEKKLLIVPFLFIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAI 182

Query: 186 FFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEATPSNDS 239
           FFT+IKLPGP YPYWGR+F PH ANG L RT+W  FLW+R+P+K S     N++
Sbjct: 183 FFTLIKLPGPSYPYWGRIFFPHLANGGLVRTIWSMFLWFRRPRKTSMLLKHNEN 236


>gi|449438691|ref|XP_004137121.1| PREDICTED: uncharacterized protein LOC101221519 [Cucumis sativus]
          Length = 251

 Score =  264 bits (675), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 128/226 (56%), Positives = 161/226 (71%), Gaps = 2/226 (0%)

Query: 8   SHLVSASTHLHRSSILPLHGLKQVLPYFRASKLR--LGANRNGPLWNAKGNICCAMNMAA 65
           +HL+S  TH   S  L LH    + P  R+ +    +    +    NA   + CA NM A
Sbjct: 3   THLLSPPTHFQTSMKLALHAPFLLAPSRRSPECSHVVSLKPHKRFQNAGSVVSCATNMTA 62

Query: 66  GQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPL 125
           GQS D  ++N D L NRV++LW++SP+PVKSFPWN AL+NFIQLI DL + V+KYL VPL
Sbjct: 63  GQSDDSGRVNWDSLKNRVKQLWESSPEPVKSFPWNEALDNFIQLIADLILTVIKYLFVPL 122

Query: 126 LAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGI 185
           L V+SLSEMSYCAHEKKL +VP   +IG + ++++R+T L LSP+LKD EVPWHLI I I
Sbjct: 123 LLVTSLSEMSYCAHEKKLLIVPFLFIIGFSAAEVMRQTALSLSPILKDLEVPWHLITIAI 182

Query: 186 FFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKAS 231
           FFT+IKLPGP YPYWGR+FIPH ANG L RT W  FLW+R+P+K S
Sbjct: 183 FFTLIKLPGPSYPYWGRIFIPHLANGGLVRTTWSMFLWFRRPQKTS 228


>gi|297847266|ref|XP_002891514.1| EMB1273 [Arabidopsis lyrata subsp. lyrata]
 gi|297337356|gb|EFH67773.1| EMB1273 [Arabidopsis lyrata subsp. lyrata]
          Length = 237

 Score =  263 bits (673), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 120/187 (64%), Positives = 152/187 (81%), Gaps = 1/187 (0%)

Query: 57  ICCAMNMAAGQSGD-PEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFM 115
           +C A++M +G SG+ P+KLN D L+ + + +WDNSPQPVK FPWNRA ENFIQL+LDL +
Sbjct: 50  LCRAIHMESGHSGEQPKKLNFDNLLRKTKHVWDNSPQPVKEFPWNRAFENFIQLVLDLAI 109

Query: 116 AVVKYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAE 175
           +VVK+L VP+LAVSS+SEMSYCAHE+KL LVP PL+IGM ++ +L+ET L +SP LK+AE
Sbjct: 110 SVVKFLFVPILAVSSISEMSYCAHERKLALVPFPLVIGMVVAGILQETALKISPRLKEAE 169

Query: 176 VPWHLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEATP 235
           VPWHLIA+ +FFT+IKLPGPYYPYWGRLF+PHFANG L+R LW  F WY+K +  S  T 
Sbjct: 170 VPWHLIAMMMFFTLIKLPGPYYPYWGRLFVPHFANGVLFRALWSMFFWYKKTRNTSGNTL 229

Query: 236 SNDSVNT 242
            N S+ T
Sbjct: 230 QNHSLET 236


>gi|356521213|ref|XP_003529251.1| PREDICTED: uncharacterized protein LOC100787804 [Glycine max]
          Length = 245

 Score =  258 bits (660), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 118/173 (68%), Positives = 146/173 (84%)

Query: 56  NICCAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFM 115
            I C+MNM+A QS D  K+ LDQL+++V+KLWD SP+PVK FP N+AL+NFIQLILDL +
Sbjct: 61  KIFCSMNMSAHQSDDHGKMKLDQLIDKVQKLWDISPEPVKKFPHNKALDNFIQLILDLIL 120

Query: 116 AVVKYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAE 175
            VVKYL+VP+ AV+SLSEMSYCAHE+KL  VP+P+L G+A++ +L+ET L LSP L+DAE
Sbjct: 121 VVVKYLAVPVFAVTSLSEMSYCAHERKLVFVPLPVLFGVAVAGILKETALQLSPRLRDAE 180

Query: 176 VPWHLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPK 228
           +PWHLIAI IFFT+IKLPGPYYPYW R+ IPHFANG L RTL FA LW+R+PK
Sbjct: 181 IPWHLIAIAIFFTLIKLPGPYYPYWVRIIIPHFANGVLLRTLLFAILWHRRPK 233


>gi|30694575|ref|NP_175374.2| protein embryo defective 1273 [Arabidopsis thaliana]
 gi|5430750|gb|AAD43150.1|AC007504_5 Unknown Protein [Arabidopsis thaliana]
 gi|29028826|gb|AAO64792.1| At1g49510 [Arabidopsis thaliana]
 gi|110742970|dbj|BAE99379.1| hypothetical protein [Arabidopsis thaliana]
 gi|332194316|gb|AEE32437.1| protein embryo defective 1273 [Arabidopsis thaliana]
          Length = 240

 Score =  258 bits (658), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 131/237 (55%), Positives = 168/237 (70%), Gaps = 3/237 (1%)

Query: 8   SHLVSASTHLHRSSILPLHGLKQ-VLPYFRASKLRLGANRNGPLWNAKGNICCAMNMAAG 66
           SHL  +S+    S  L LH   Q  L Y + +K     + N    + +  +C A++M +G
Sbjct: 4   SHLFLSSSP-QSSLALRLHSTTQFTLSYSKNNKDCSFQSANEAKVSKRSLLCRAIHMESG 62

Query: 67  QSGD-PEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPL 125
            SG+ P+KLN D L+ R + +WDNSPQPVK FPWNRA  NFIQL+LDL ++VVK+L VP+
Sbjct: 63  HSGEQPKKLNFDNLLRRTKHVWDNSPQPVKEFPWNRAFGNFIQLVLDLAISVVKFLFVPI 122

Query: 126 LAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGI 185
           LAVSS+SEMSYCAHE+KL LVP PL+IGM ++ +L+ET L +SP LK+AEVPWHLIA+ +
Sbjct: 123 LAVSSISEMSYCAHERKLALVPFPLVIGMVVAGVLQETALKISPRLKEAEVPWHLIAMMM 182

Query: 186 FFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEATPSNDSVNT 242
           FFT+IKLPGPYYPYWGRL +PHFANG L R LW  F WY+K +  S     N S+ T
Sbjct: 183 FFTLIKLPGPYYPYWGRLLVPHFANGVLLRALWSMFFWYKKTRNTSGNPLQNHSLET 239


>gi|255637826|gb|ACU19233.1| unknown [Glycine max]
          Length = 245

 Score =  256 bits (653), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 117/173 (67%), Positives = 145/173 (83%)

Query: 56  NICCAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFM 115
            I C+MNM+A QS D  K+ LDQL+++V+KLWD SP+PVK FP N+AL+NFIQLILDL +
Sbjct: 61  KIFCSMNMSAHQSDDHGKMKLDQLIDKVQKLWDISPEPVKKFPHNKALDNFIQLILDLIL 120

Query: 116 AVVKYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAE 175
            VVKYL+VP+ AV+SLSEMSYCAHE+KL  VP+P+L G+A++ +L+ET L L P L+DAE
Sbjct: 121 VVVKYLAVPVFAVTSLSEMSYCAHERKLVFVPLPVLFGVAVAGILKETALQLFPRLRDAE 180

Query: 176 VPWHLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPK 228
           +PWHLIAI IFFT+IKLPGPYYPYW R+ IPHFANG L RTL FA LW+R+PK
Sbjct: 181 IPWHLIAIAIFFTLIKLPGPYYPYWVRIIIPHFANGVLLRTLLFAILWHRRPK 233


>gi|148907624|gb|ABR16941.1| unknown [Picea sitchensis]
          Length = 247

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 93/212 (43%), Positives = 131/212 (61%), Gaps = 6/212 (2%)

Query: 32  LPYFRASKLRLGANRNGPLWNAKGNICCAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSP 91
           L YFR +      ++     + K  +   MNM  GQSG P+K+     + +VR LW+  P
Sbjct: 38  LAYFRKASSVSSPSQTVVYRSFKRVVPHVMNMTRGQSGGPDKIR-HVYLEKVRNLWERCP 96

Query: 92  QPVKSFPWNRALENFIQLILDLFMAVVKYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLL 151
           +P+K FPW+R    F   +  L   V K+L +P+L VSSLSEM YC  + K  L+P+ +L
Sbjct: 97  EPIKVFPWDRVYNKFFHHLFRLAFEVAKWLFIPVLFVSSLSEMVYCGWQNKELLIPIGML 156

Query: 152 IGMAISDLLRETILDLSPLLKDAEVPWHLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANG 211
            G+ ++ +L+ET ++LS  L++   PWHL  IG FF +IK PGPYYPYWGR+F+PHFANG
Sbjct: 157 AGIYLAGILKETAVELSENLQEGGFPWHLAIIGAFFALIKFPGPYYPYWGRIFLPHFANG 216

Query: 212 ALWRTLWFAFLWYRKPKKASEATPSNDSVNTG 243
            LWRTLWFA +WY+   K +E     + +N G
Sbjct: 217 GLWRTLWFAHIWYKSQFKTAE-----EEINLG 243


>gi|116781889|gb|ABK22286.1| unknown [Picea sitchensis]
          Length = 259

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 68/165 (41%), Positives = 101/165 (61%), Gaps = 6/165 (3%)

Query: 85  KLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPLLAVSSLSEMSYCAHEKKLF 144
           KLW+  P+PV+ FPW +A E  I  +    + V K L +P+LAV+ L E SYC  + K  
Sbjct: 93  KLWEVCPEPVRKFPWQKAGERVIHHLFQPILGVAKRLLIPILAVTFLMEASYCIAQNKEL 152

Query: 145 LVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGIFFTVIKLPGPYYPYWGRLF 204
            +P+ LL G   +++L+ET L+LS  LK+   PWHL+A+ + F++IK  GP+YPYWG + 
Sbjct: 153 FIPIGLLAGSVFAEILKETALELSQNLKEGGFPWHLLAMALIFSLIKFLGPFYPYWGGIL 212

Query: 205 IPHFANGALWRTLWFAFLWYRKPKKASEA------TPSNDSVNTG 243
           +PHFANG LW+T+W    W++   + +E       T S   + TG
Sbjct: 213 LPHFANGGLWQTIWLTRNWHKNQSQTTEVERNGENTISESDLKTG 257


>gi|346465215|gb|AEO32452.1| hypothetical protein [Amblyomma maculatum]
          Length = 194

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 76/181 (41%), Positives = 113/181 (62%), Gaps = 1/181 (0%)

Query: 15  THLHRSSILPLHGLKQVLPYFRASKLRLGANRNGPLWNAKGNICCAMNMAAGQSGDP-EK 73
           +H    S  P   L +V P    S L        P    +  +  A+N+ +    DP  K
Sbjct: 14  SHASLQSTSPSSFLHKVGPAIPFSTLSQPKPTTNPHRRRRLLLPLAINLTSSNPDDPLGK 73

Query: 74  LNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPLLAVSSLSE 133
               ++++  R L DN PQPVKSFPW++ L++F+ L++DL + VVK+LSVP+LAVSSLSE
Sbjct: 74  FLPSRVLDHARDLLDNLPQPVKSFPWDKTLQSFLGLVIDLAVTVVKWLSVPVLAVSSLSE 133

Query: 134 MSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGIFFTVIKLP 193
           MSYCAHE+K+ ++P+P L G A++ ++++T +DLS   K+   PWHL+ I  FF ++KLP
Sbjct: 134 MSYCAHERKMRVIPIPFLTGFALAGVMKDTAVDLSEEFKEGGFPWHLLLIASFFMLLKLP 193

Query: 194 G 194
           G
Sbjct: 194 G 194


>gi|168001974|ref|XP_001753689.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695096|gb|EDQ81441.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 244

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 67/213 (31%), Positives = 120/213 (56%), Gaps = 4/213 (1%)

Query: 27  GLKQVLPYFRASKLRLGA-NRNGPLWNAKGNICCAMNMAAGQSGDPEKLNLDQLVNRVRK 85
           G+ Q   + R  +++L   ++   L N +  + C  +     S + E+    Q  N +  
Sbjct: 34  GVNQPSVFLRGQRVQLQRLSKPLHLLNLRTPVQCVSSTFGSSSNNGEE---TQENNLLAN 90

Query: 86  LWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPLLAVSSLSEMSYCAHEKKLFL 145
           +   +P+P++ FPW +A   F+Q +L  F  V KYL++P+LA S LSE+SY   ++K+ +
Sbjct: 91  IMAKAPEPIRVFPWGKAGNLFVQRLLHQFWTVGKYLAIPVLAASMLSELSYTLLQEKVLI 150

Query: 146 VPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGIFFTVIKLPGPYYPYWGRLFI 205
           +P  ++ G A + +++ET ++L+  ++  + PWHL+A+G  F  +K  GPY P WGR+ +
Sbjct: 151 IPAGMIGGFAFAGMMKETAVELAADIEVGQFPWHLVALGFMFGALKFIGPYLPAWGRVSV 210

Query: 206 PHFANGALWRTLWFAFLWYRKPKKASEATPSND 238
           PHFANG LW  +     W R+  + ++   S +
Sbjct: 211 PHFANGGLWHVIKLVIDWRRQQAEKTKLKSSRN 243


>gi|168004986|ref|XP_001755192.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693785|gb|EDQ80136.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 147

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 90/129 (69%)

Query: 88  DNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPLLAVSSLSEMSYCAHEKKLFLVP 147
           D + +P++SFPW +A   F + +LD    V K+L++PLLA+S ++E +Y   ++K+ ++P
Sbjct: 3   DKASEPIRSFPWAKASNLFFKRLLDQLWIVGKWLAIPLLALSVVNEFTYTLVQEKVLIIP 62

Query: 148 VPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGIFFTVIKLPGPYYPYWGRLFIPH 207
           V ++ G+  S + RE +L+LSP ++  ++PWHLIA+GI F V+KL GP  P W ++ +PH
Sbjct: 63  VAMICGIVFSGITREVVLELSPSIEGRQLPWHLIALGILFVVLKLIGPVLPIWAQVSVPH 122

Query: 208 FANGALWRT 216
           FANG LW+ 
Sbjct: 123 FANGGLWQV 131


>gi|255537942|ref|XP_002510036.1| conserved hypothetical protein [Ricinus communis]
 gi|223550737|gb|EEF52223.1| conserved hypothetical protein [Ricinus communis]
          Length = 218

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 105/193 (54%), Gaps = 9/193 (4%)

Query: 59  CAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVV 118
           C M +   Q G+P K+ L QL +   KLW+ +P PVK FPW  A +  ++ +L +   V+
Sbjct: 29  CFMKLTKAQFGEPNKVRL-QLSSAKEKLWEATPNPVKKFPWREAGDMLLKRLLFIGQKVL 87

Query: 119 KYLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPW 178
           K+    +  +S +S++ +     +  ++P+ LL+G  ++D L+ET+ +L    +D E+ +
Sbjct: 88  KWSLAVIFVLSCISDIIFSISRNQELMIPIGLLVGCLMTDFLKETLQELFQASEDKELNF 147

Query: 179 HLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASE-----A 233
            L+ I  FF  +K+   Y+    ++ + H ANG L + +W   LW    K+  E     +
Sbjct: 148 PLVIICCFFVFVKVMSAYFISRPQMLLSHVANGGLLQAVW---LWRYLLKENGEHNIENS 204

Query: 234 TPSNDSVNTGLSE 246
           +P+ D+ +T ++E
Sbjct: 205 SPTKDARSTLVAE 217


>gi|356530798|ref|XP_003533967.1| PREDICTED: uncharacterized protein LOC100805526 [Glycine max]
          Length = 66

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 44/66 (66%), Positives = 56/66 (84%)

Query: 134 MSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGIFFTVIKLP 193
           MSYCAHE+KL  VP+ +  G+A++ +L+ET L LSP L+D+EVPW+LIAI IFFT+IKLP
Sbjct: 1   MSYCAHERKLVFVPLTVPFGVAVARILKETALQLSPRLRDSEVPWNLIAIAIFFTLIKLP 60

Query: 194 GPYYPY 199
           GPYYPY
Sbjct: 61  GPYYPY 66


>gi|359493007|ref|XP_002283527.2| PREDICTED: uncharacterized protein LOC100262671 [Vitis vinifera]
 gi|302142146|emb|CBI19349.3| unnamed protein product [Vitis vinifera]
          Length = 232

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 81/153 (52%), Gaps = 1/153 (0%)

Query: 66  GQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPL 125
            Q  +  K+ +   + R ++LW+ +P  +K FPW +A +  +Q +L L    +K+  + L
Sbjct: 50  AQFSEQNKVKMQMSILR-KRLWEAAPDSLKDFPWKKAEDTVLQQLLLLGQKALKWSFITL 108

Query: 126 LAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGI 185
             +SS+S++ +C    K  ++P  L +G  ++D L+ET  +L P  +   +   L+ IG 
Sbjct: 109 FILSSVSDVIFCISRNKELMIPFGLFVGCMMADFLKETSRELFPNSEKTGLKQPLLGIGC 168

Query: 186 FFTVIKLPGPYYPYWGRLFIPHFANGALWRTLW 218
           FF  +K    +    G+ F+ H ANG L + LW
Sbjct: 169 FFVFVKFMSTFLTSQGQAFLLHVANGGLMQILW 201


>gi|356552300|ref|XP_003544506.1| PREDICTED: uncharacterized protein LOC100806012 [Glycine max]
          Length = 227

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 42/174 (24%), Positives = 84/174 (48%), Gaps = 1/174 (0%)

Query: 60  AMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVK 119
           ++ ++     +P ++ +   + R  KL +  P  V+ FPW +A +  +  +L+L    VK
Sbjct: 46  SIKVSMADHNEPSEVKMQIGIMR-EKLKEAMPVSVQEFPWKKAEQILLDRLLNLAQETVK 104

Query: 120 YLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWH 179
           +  V     SS+S++ Y     +  ++P+ L +G+ ++  L+E   DL    ++ ++ WH
Sbjct: 105 WSLVLFFIFSSVSDVVYTFSINRELVIPIGLFVGLLVAGFLKEISQDLFHRPEEKDLKWH 164

Query: 180 LIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEA 233
           L+ +   F   K    ++    R+F+ H ANG L + LW +  +    K   E 
Sbjct: 165 LLGMYCIFVFAKFMSTWFATLPRVFLLHVANGGLIQALWHSTNFMEDEKNKQET 218


>gi|255637608|gb|ACU19129.1| unknown [Glycine max]
          Length = 225

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 42/177 (23%), Positives = 83/177 (46%), Gaps = 1/177 (0%)

Query: 60  AMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVK 119
           ++ ++     +P ++N+   + R  KL +  P  V+ FPW +A    +  +L+L    VK
Sbjct: 44  SIKVSMADQNEPSEVNMQIGIMR-EKLKETLPVSVQEFPWKKAEHILLDRLLNLAQETVK 102

Query: 120 YLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWH 179
           +  V     SS+S++ Y     +  ++PV L +G   +D L+E   +L    ++ ++  H
Sbjct: 103 WSLVLFFIFSSVSDVVYTFSINRELVIPVGLFVGCLAADFLKEISQELFHKFEEKDLKRH 162

Query: 180 LIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEATPS 236
            + +   F  +K    ++    ++F  H ANG L + LW   ++    K   E + S
Sbjct: 163 FLGVYCLFVFVKFMSTWFAALPQVFFLHVANGGLMQVLWHLRIFMEDGKNKQETSSS 219


>gi|356577694|ref|XP_003556959.1| PREDICTED: uncharacterized protein LOC100797104 [Glycine max]
          Length = 227

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/174 (24%), Positives = 83/174 (47%), Gaps = 1/174 (0%)

Query: 60  AMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVK 119
           ++ ++     +P ++N+   + R  KL +  P  V+ FPW +A    +  +L+L    VK
Sbjct: 46  SIKVSMADQNEPSEVNMQIGIMR-EKLKETLPVSVQEFPWKKAEHILLDRLLNLAQETVK 104

Query: 120 YLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWH 179
           +  V     SS+S++ Y     +  ++PV L +G   +D L+E   +L    ++ ++  H
Sbjct: 105 WSLVLFFIFSSVSDVVYTFSINRELVIPVGLFVGCLAADFLKEISQELFHKSEEKDLKRH 164

Query: 180 LIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEA 233
           L+ +   F  +K    ++    ++F+ H ANG L + LW   ++    K   E 
Sbjct: 165 LLGMYCLFVFVKFMSTWFAALPQVFLLHVANGGLMQVLWHLRIFMEDGKNKQET 218


>gi|388513167|gb|AFK44645.1| unknown [Medicago truncatula]
          Length = 226

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 80/160 (50%), Gaps = 1/160 (0%)

Query: 60  AMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVK 119
           ++  +     +P ++ + Q+     KL +  P PV+ FPW +A    +  +  L    +K
Sbjct: 46  SIKASVADHNEPNEVKI-QIGIMKEKLKEALPSPVQEFPWRKAQHTLLDRLHLLAQEALK 104

Query: 120 YLSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWH 179
           + SV     SSLS++ Y     +  ++PV L +G+ ++D L+E  L+L    ++ ++ W 
Sbjct: 105 WSSVVYFISSSLSDVIYTFSINRELIIPVGLFVGVLVADFLKEISLELFHQSEENDLKWC 164

Query: 180 LIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWF 219
            + +   F ++K    ++     +F+ H ANG L + LW+
Sbjct: 165 RLGLCGIFVLVKFMSTWFATLPGVFLLHVANGGLMQLLWY 204


>gi|449447093|ref|XP_004141304.1| PREDICTED: uncharacterized protein LOC101205191 [Cucumis sativus]
 gi|449509249|ref|XP_004163535.1| PREDICTED: uncharacterized LOC101205191 [Cucumis sativus]
          Length = 225

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 82/166 (49%), Gaps = 5/166 (3%)

Query: 78  QLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKYLSVPLLAVSSLSEMSYC 137
           ++ N   K+ +  P+PVK FPW  A +  ++ +  +    +K+  +    +SS S++   
Sbjct: 61  EIENLKEKMQEVVPEPVKIFPWKEAEKIVVERLAFMGKETLKWSLLLFFVLSSCSDIVAS 120

Query: 138 AHEKKLFLVPVPLLIGMAISDLLRETILDLSPLLKDAEVPWHLIAIGIFFTVIKLPGPYY 197
               +  L+P+ L IG+ ++DLL+E   ++    +++     L   G FF ++KL    +
Sbjct: 121 IVRNQELLIPIGLFIGVLLTDLLKEISQEVFGNSEESSFKKQLYGFGSFFILVKLIVYGF 180

Query: 198 PYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEATPSNDSVNTG 243
                +F  H ANG L + LW   LW   P++ ++  P+  S+  G
Sbjct: 181 AIQAPVFPLHVANGGLMQVLW---LWRNLPRERNQ--PNEQSLFVG 221


>gi|224084516|ref|XP_002307322.1| predicted protein [Populus trichocarpa]
 gi|222856771|gb|EEE94318.1| predicted protein [Populus trichocarpa]
          Length = 176

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 50/181 (27%), Positives = 85/181 (46%), Gaps = 12/181 (6%)

Query: 61  MNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLFMAVVKY 120
           M +     G+P K+ L QL     +LW   P+  K FPW +A +  ++ +L +     K+
Sbjct: 1   MKVTKAHFGEPSKVKL-QLNIVKERLWHAIPESAKEFPWRKAEDLLLKRLLLVGQKAFKW 59

Query: 121 LSVPLLAVSSLSEMSYCAHEKKLFLVPVPLLIGMAISDLLRETILDL--SPLLKDAEVPW 178
             V L  +S L+++ Y        ++P  LL G  + D L+E  +++  +P  +   +P 
Sbjct: 60  SLVILFLLSFLADVMYSISRNGELMIPFGLLAGCLMMDFLKEISMEVFQAPEKRGLNLP- 118

Query: 179 HLIAIGIFFTVIKLPGPYYPYWGRLFIPHFANGALWRTLWFAFLWYRKPKKASEATPSND 238
            L+ I   F ++K+   Y+      F+ H ANG L + LW   LW    K+ S+    N 
Sbjct: 119 -LLGISCCFILVKVMSTYFTP----FLLHVANGGLMQVLW---LWGDSLKENSQCNEENI 170

Query: 239 S 239
           S
Sbjct: 171 S 171


>gi|119388359|gb|ABL74120.1| envelope glycoprotein [Human immunodeficiency virus 1]
          Length = 217

 Score = 36.6 bits (83), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 30/116 (25%), Positives = 50/116 (43%), Gaps = 19/116 (16%)

Query: 2   GTNPYQSHL---VSASTHLHRSSILPLHGLKQVLPYFRASKLRLGANRNGPLWNAKGNIC 58
           G    QS+L   + A  HL +   L + G+KQ+     A +  L   +   +W   G + 
Sbjct: 36  GIVQQQSNLLRAIEAQQHLLK---LTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKLI 92

Query: 59  CAMNMAAGQSGDPEKLNLDQLVNRVRKLWDNSPQPVKSFPWNRALENFIQLILDLF 114
           C  N+    S   + LN         ++WDN    +    W++ +EN+ Q+I DL 
Sbjct: 93  CPTNVPWNSSWSNKSLN---------EIWDN----MTWLQWDKEIENYTQIIYDLI 135


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.138    0.441 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,186,677,727
Number of Sequences: 23463169
Number of extensions: 174118145
Number of successful extensions: 334810
Number of sequences better than 100.0: 29
Number of HSP's better than 100.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 334779
Number of HSP's gapped (non-prelim): 29
length of query: 250
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 111
effective length of database: 9,097,814,876
effective search space: 1009857451236
effective search space used: 1009857451236
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)