BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024174
         (271 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255557311|ref|XP_002519686.1| conserved hypothetical protein [Ricinus communis]
 gi|223541103|gb|EEF42659.1| conserved hypothetical protein [Ricinus communis]
          Length = 284

 Score =  226 bits (577), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 134/249 (53%), Positives = 175/249 (70%), Gaps = 12/249 (4%)

Query: 25  SGHSSIWISLSKGSTRRPFGRVFSPVICAHDGSST-PSEND---NKIVKGAVGASLVLTC 80
           S H+S   SL+  +T  P  ++ +P I A DGSS+ PS ND   NKI K    AS+ L C
Sbjct: 36  SNHTSP--SLTSRNTCHPGRKLLNPTIRALDGSSSVPSGNDRKANKIAKNTARASIALAC 93

Query: 81  ALGIMSFSSRMNPKAIAAGPMEMYQKAPRM-SVLP-HPIGGRYALNSFLDVSVRLASSKA 138
            LGI+  S +MNP+AIA GP E +QKAP++ S  P +P+GGRYAL S LDV+V LAS+K 
Sbjct: 94  VLGIIGASLKMNPEAIA-GPREKFQKAPQIVSNSPAYPLGGRYALKSLLDVNVYLASTKI 152

Query: 139 EPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLKKVYEDCKNEPEPAYNVE 198
           EP   P   +P  PS E+V  IK++AV+ +K GKPE AV+LL+  Y+  +++ EPAYNV 
Sbjct: 153 EP-PGPPSRLPARPSMENVYDIKSDAVRLIKCGKPEEAVSLLENSYKFYEHDTEPAYNVG 211

Query: 199 MALVEILIYQGKYREALE-CNCLKDEQRIPSDGRFPFYKAIIYTMLNM-EEAKKWWEEFA 256
           MAL+EILI QGKY+EAL+ CN    EQ +PSDGR P YKAIIYTML+  +EAKKWWE++ 
Sbjct: 212 MALIEILICQGKYKEALQYCNRFTGEQFLPSDGRIPLYKAIIYTMLDQRDEAKKWWEKYV 271

Query: 257 ETIDDEEFD 265
           ET+D ++FD
Sbjct: 272 ETVDADQFD 280


>gi|224078980|ref|XP_002305705.1| predicted protein [Populus trichocarpa]
 gi|222848669|gb|EEE86216.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 131/253 (51%), Positives = 158/253 (62%), Gaps = 16/253 (6%)

Query: 16  RVKPVTKLGSGHSSIWISLSKGSTRRPFGRVFSPVICAH-DGSSTPSEN----DNKIVKG 70
           R KP   L S   S    L+   TR   G +  PVICA  D S TPS N    DNKIVK 
Sbjct: 29  RTKPHLSLVSSTPS----LTSCYTRHKLGHLIRPVICASSDRSPTPSRNENNCDNKIVKA 84

Query: 71  AVGASLVLTCALGIMSFSSRMNPKAIAAGPMEMYQKAPRMSVLPHPIGGRYALNSFLDVS 130
           AVGAS+ L CALGI+  + RM PKAIA GP E+YQKAP++   P  +  + AL SFLDV+
Sbjct: 85  AVGASVALACALGIIGGNFRMYPKAIA-GPRELYQKAPQVEEHPSSLA-KLALESFLDVT 142

Query: 131 VRLASSKA-EPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLKKVYEDCKN 189
             LAS+ A  P     +  PP PS E V  IK  AV+ M YG+ E AV  L++ +E  KN
Sbjct: 143 SDLASTDAVSPI--ATFDPPPNPSIEQVKEIKKHAVRLMIYGEAEEAVWYLQEAFEKYKN 200

Query: 190 EPEPAYNVEMALVEILIYQGKYREALECNCLK-DEQRIPSDGRFPFYKAIIYTMLNM-EE 247
           +PEPAYNVEMALVEILIYQ +Y  AL C+CL  D+Q  PSD R   YKAIIYTML+  EE
Sbjct: 201 DPEPAYNVEMALVEILIYQHEYERALNCDCLNHDDQLGPSDARVFLYKAIIYTMLDFNEE 260

Query: 248 AKKWWEEFAETID 260
           A+ WWE +   ++
Sbjct: 261 ARIWWERYMYAVE 273


>gi|297823237|ref|XP_002879501.1| binding protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325340|gb|EFH55760.1| binding protein [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  150 bits (379), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 99/240 (41%), Positives = 132/240 (55%), Gaps = 24/240 (10%)

Query: 46  VFSPVICAHDGSSTPSENDNKI---VKGAVGASLVLTCALGIMSFSSR------------ 90
           + SP+  A   S +   +D K+   +K A GASL L CAL I  F  +            
Sbjct: 51  IRSPITVAKSNSKSDYHDDKKLFTPLKMAAGASLALACALSIFGFKIKNMSYTAAAAANP 110

Query: 91  -MNPKAIAAGPMEMYQKAPRMSVLPHPIGGRYALNSFLDVSVRLASSKAEPFYWP--RYT 147
                 I+  P         M    +P+  ++AL S  +VS  LAS+K  P   P   + 
Sbjct: 111 SAADMIISGKPTSAVSATSGM----YPLPAKFALRSLFEVSSMLASAKPIPSQRPFNLHK 166

Query: 148 VPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLKKVYEDCKNEPEPAYNVEMALVEILIY 207
           +P  PS ED+++IK EAV++MK GK E AV LL+      KNEPE  +NV+MALVEILI 
Sbjct: 167 LPSLPSKEDIDSIKMEAVRKMKEGKCEEAVQLLRDANMRYKNEPEADFNVQMALVEILIL 226

Query: 208 QGKYREALECNCLKDEQRIPSDGRFPFYKAIIYTMLNME-EAKKWWEEFAETIDDEEFDP 266
             +Y+EA E +CL DE    SD R P YKAIIYTML+ + EAK+ W+EF ++I  E FDP
Sbjct: 227 LERYQEAAEYSCLNDENAQISDVRIPLYKAIIYTMLDKDTEAKQCWKEFRKSI-GEGFDP 285


>gi|79324323|ref|NP_001031482.1| uncharacterized protein [Arabidopsis thaliana]
 gi|3128221|gb|AAC26701.1| hypothetical protein [Arabidopsis thaliana]
 gi|330253894|gb|AEC08988.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 288

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 107/273 (39%), Positives = 150/273 (54%), Gaps = 30/273 (10%)

Query: 19  PVTKLGSGH------SSIWISL--SKGSTRRPFGRVFSPVICAHDGSSTPSENDNKIV-- 68
           P++   S H       +I+I    + G +RR   R  SP+  A   S   S +D K +  
Sbjct: 16  PLSHFTSTHKPTLRFQNIYIGFPSNGGQSRRLIIR--SPITVAKSNSKNDSHDDKKQLRP 73

Query: 69  -KGAVGASLVLTCALGIMSFSSRMNPKAI--------AAGPMEMYQK---APRMSVLPHP 116
            K A GASL L CA+ I  F  ++N  +         ++  M +  K   A   +   +P
Sbjct: 74  LKMAAGASLALACAISIFGF--KINNMSYTAAAAAYPSSADMIISGKPTSAASATSGIYP 131

Query: 117 IGGRYALNSFLDVSVRLASSKAEPFYWP--RYTVPPGPSAEDVNAIKAEAVKQMKYGKPE 174
              ++AL S  +VS  LAS+K  P   P   + +P  PS +D+++IK EAV++MK GK E
Sbjct: 132 FPAKFALRSLFEVSSMLASAKPIPSQKPFNLHKLPSLPSKDDIDSIKMEAVRKMKEGKCE 191

Query: 175 FAVTLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRFPF 234
            AV LL+      +NEPE  +NV+MALVEILI   +Y+EA E +CL DE    SD R P 
Sbjct: 192 EAVQLLRDANMRYRNEPEANFNVQMALVEILILLERYQEAAEYSCLNDENAQISDVRIPL 251

Query: 235 YKAIIYTMLNME-EAKKWWEEFAETIDDEEFDP 266
           YKAIIYTML+ + EAK+ W+EF ++I  E FDP
Sbjct: 252 YKAIIYTMLDKDTEAKQCWKEFRKSI-GEGFDP 283


>gi|41059781|gb|AAR99365.1| hypothetical protein At2g34540 [Arabidopsis thaliana]
 gi|55740593|gb|AAV63889.1| hypothetical protein [Arabidopsis thaliana]
          Length = 213

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 124/210 (59%), Gaps = 17/210 (8%)

Query: 71  AVGASLVLTCALGIMSFSSRMNPKAI--------AAGPMEMYQK---APRMSVLPHPIGG 119
           A GASL L CA+ I  F  ++N  +         ++  M +  K   A   +   +P   
Sbjct: 2   AAGASLALACAISIFGF--KINNMSYTAAAAAYPSSADMIISGKPTSAASATSGIYPFPA 59

Query: 120 RYALNSFLDVSVRLASSKAEPFYWP--RYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAV 177
           ++AL S  +VS  LAS+K  P   P   + +P  PS +D+++IK EAV++MK GK E AV
Sbjct: 60  KFALRSLFEVSSMLASAKPIPSQKPFNLHKLPSLPSKDDIDSIKMEAVRKMKEGKCEEAV 119

Query: 178 TLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRFPFYKA 237
            LL+      +NEPE  +NV+MALVEILI   +Y+EA E +CL DE    SD R P YKA
Sbjct: 120 QLLRDANMRYRNEPEANFNVQMALVEILILLERYQEAAEYSCLNDENAQISDVRIPLYKA 179

Query: 238 IIYTMLNME-EAKKWWEEFAETIDDEEFDP 266
           IIYTML+ + EAK+ W+EF ++I  E FDP
Sbjct: 180 IIYTMLDKDTEAKQCWKEFRKSI-GEGFDP 208


>gi|30686104|ref|NP_181001.2| uncharacterized protein [Arabidopsis thaliana]
 gi|27754469|gb|AAO22682.1| unknown protein [Arabidopsis thaliana]
 gi|330253893|gb|AEC08987.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 271

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/242 (36%), Positives = 126/242 (52%), Gaps = 28/242 (11%)

Query: 19  PVTKLGSGH------SSIWISL--SKGSTRRPFGRVFSPVICAHDGSSTPSENDNKIV-- 68
           P++   S H       +I+I    + G +RR   R  SP+  A   S   S +D K +  
Sbjct: 16  PLSHFTSTHKPTLRFQNIYIGFPSNGGQSRRLIIR--SPITVAKSNSKNDSHDDKKQLRP 73

Query: 69  -KGAVGASLVLTCALGIMSFSSRMNPKAI--------AAGPMEMYQK---APRMSVLPHP 116
            K A GASL L CA+ I  F  ++N  +         ++  M +  K   A   +   +P
Sbjct: 74  LKMAAGASLALACAISIFGF--KINNMSYTAAAAAYPSSADMIISGKPTSAASATSGIYP 131

Query: 117 IGGRYALNSFLDVSVRLASSKAEPFYWP--RYTVPPGPSAEDVNAIKAEAVKQMKYGKPE 174
              ++AL S  +VS  LAS+K  P   P   + +P  PS +D+++IK EAV++MK GK E
Sbjct: 132 FPAKFALRSLFEVSSMLASAKPIPSQKPFNLHKLPSLPSKDDIDSIKMEAVRKMKEGKCE 191

Query: 175 FAVTLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRFPF 234
            AV LL+      +NEPE  +NV+MALVEILI   +Y+EA E +CL DE    SD R P 
Sbjct: 192 EAVQLLRDANMRYRNEPEANFNVQMALVEILILLERYQEAAEYSCLNDENAQISDVRIPL 251

Query: 235 YK 236
           YK
Sbjct: 252 YK 253


>gi|356512185|ref|XP_003524801.1| PREDICTED: uncharacterized protein LOC100793582 [Glycine max]
          Length = 290

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 135/302 (44%), Gaps = 59/302 (19%)

Query: 1   MESALCLNFCPS-------VAMRVKPVTKLGSGHSSIWISLSKGSTRRPFGRVFSPVICA 53
           MESA+CL  CP           R      L   H++        +TR    R F  V C 
Sbjct: 1   MESAICLRHCPFPYGSKGFQGARPSRFHHLKINHATPLSLPQISTTRSSVTRSFGHVNCM 60

Query: 54  HDG-SSTPSEND---NKIVKGAVGASLVLTCALGIMSFSSR-MNPKAIAAGPMEMYQKAP 108
            +   S PSEN+   N+I++G   ASLVLTC LG+ SFS + MNPK   A     Y    
Sbjct: 61  LERFISMPSENEGSGNRILRGVAAASLVLTCVLGLFSFSGKNMNPKLTTA-----YGNG- 114

Query: 109 RMSVLPHPIG---GRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAV 165
               L  PI    GR+AL S L++      +  +      ++V  GPS ++V  +K  A+
Sbjct: 115 ----LEFPISKPQGRFALKSLLEIPKDEELADTQNKMRSNFSV-RGPSQQEVKNLKWLAI 169

Query: 166 KQMKYGKPEFAVTLLKKVYEDCKNEP-EPAYNVEMALVEILIYQGK-------------- 210
             +    P+      K + E    E  EP  N+E+A+VE+L++QG+              
Sbjct: 170 G-LSSQSPDRRREAQKTLEEQYNCESGEPEQNLELAMVELLMFQGEFDKARDRLDDIIRK 228

Query: 211 ------------YREALECNCLKDEQRIPSDGRFPFYKAIIYTMLNME--EAKKWWEEFA 256
                       Y+ A++ N + ++ R+    R   YKAI++T+L  E  E ++WW  F 
Sbjct: 229 ISKVQPSGMSKLYKTAVQENNMYEKLRL---ARLILYKAIVHTVLGSEQKEREQWWNAFM 285

Query: 257 ET 258
           ET
Sbjct: 286 ET 287


>gi|225444553|ref|XP_002275959.1| PREDICTED: uncharacterized protein LOC100253775 [Vitis vinifera]
          Length = 244

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 70/251 (27%), Positives = 111/251 (44%), Gaps = 60/251 (23%)

Query: 27  HSSIWISLSKGSTRRPFGRVFSPVICAHDGSSTPSEN---DNKIVKGAVGASLVLTCALG 83
           HSS  + +S G   R  GR+ +P++CA DG STP  N   D+KIV+  +GA++ LTC LG
Sbjct: 38  HSSTSLPMSSG---RHIGRLLTPIVCAADGVSTPPTNNKGDDKIVRTGIGAAIALTCVLG 94

Query: 84  IMSFSSRMNPKAIAAGPMEMYQKAPRMSVLPHPIGGRYALNSFLDVSVRLASSKAEPFYW 143
           ++  S  +N KAI          +P++ V                      SS+ +  + 
Sbjct: 95  VIGCSYMINQKAIT---------SPKIFV--------------------TGSSRHKNIHQ 125

Query: 144 PRYTVPPGPSAEDVNAIKA-EAVKQMKYGKPEFAVTLLKKVYEDCKNEPEPAYNVEMALV 202
           P              A KA E ++ M  GK       LK ++++    P+ AYNV M L+
Sbjct: 126 P--------------AAKANELMELMDRGKHNTVSEKLKGMFQE---NPDEAYNVGMELI 168

Query: 203 EILIYQGKYREA------LECNCLKDEQRIPSDGRFPFYKAIIYTMLN-MEEAKKWWEEF 255
           +ILI + ++R+       LE    K        G    YK +++ M+  + EA+  W +F
Sbjct: 169 KILISRERFRDGVTISGQLEGLYGKSTSAPLKKGNLFLYKVLMHGMVGEIGEAELSWNKF 228

Query: 256 AETIDDEEFDP 266
              +      P
Sbjct: 229 KNLLGATSVVP 239


>gi|356528124|ref|XP_003532655.1| PREDICTED: uncharacterized protein LOC100775548 [Glycine max]
          Length = 278

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 80/240 (33%), Positives = 113/240 (47%), Gaps = 38/240 (15%)

Query: 1   MESALCLNFCPSVAMRVKPVTKLGSGHSSIWISLSKGS---------TRRPFGRVFSPVI 51
           MESA+CL +CP   +   P    G+  S   + ++  +         TR P  R F    
Sbjct: 1   MESAICLRYCP---IPYGPKGFHGANPSRFHLKINHATPLSLPQISTTRSPVTRSFGHAN 57

Query: 52  CAHDG-SSTPSEND---NKIVKGAVGASLVLTCALGIMSFSSR-MNPKAIA--AGPMEMY 104
           C  +  SS PSEN+   NKI++G   +SLVLTC LG+ SF  + MNPK     A P   Y
Sbjct: 58  CMLERFSSVPSENEGSGNKILRGVTLSSLVLTCVLGLFSFGGKNMNPKLTTAYASP---Y 114

Query: 105 QKAPRMSVLPHPIG--GRYALNSFLDV--SVRLASS--KAEPFYWPRYTVPPGPSAEDVN 158
            K    S+ P   G  G  AL S L++     LA +  K    +  R     GPS  +VN
Sbjct: 115 DKIA--SLFPRLDGAQGSVALKSLLEIPSDAELADTQKKRSSNFGER-----GPSQLEVN 167

Query: 159 AIKAEAVKQMKYGKPEFAVTLLKKVYEDCKNEP---EPAYNVEMALVEILIYQGKYREAL 215
            +K  A+   K      A+  L   Y+ CK      E    +E+A+VE+ +++GK+ EAL
Sbjct: 168 NLKWLAIGLSKSPDRSKALKTLTDQYKYCKAHKITGESEQYLELAMVELSMFKGKFEEAL 227


>gi|297740694|emb|CBI30876.3| unnamed protein product [Vitis vinifera]
          Length = 200

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/240 (27%), Positives = 106/240 (44%), Gaps = 57/240 (23%)

Query: 38  STRRPFGRVFSPVICAHDGSSTPSEN---DNKIVKGAVGASLVLTCALGIMSFSSRMNPK 94
           S+ R  GR+ +P++CA DG STP  N   D+KIV+  +GA++ LTC LG++  S  +N K
Sbjct: 2   SSGRHIGRLLTPIVCAADGVSTPPTNNKGDDKIVRTGIGAAIALTCVLGVIGCSYMINQK 61

Query: 95  AIAAGPMEMYQKAPRMSVLPHPIGGRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSA 154
           AI          +P++ V                      SS+ +  + P          
Sbjct: 62  AIT---------SPKIFV--------------------TGSSRHKNIHQP---------- 82

Query: 155 EDVNAIKA-EAVKQMKYGKPEFAVTLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYRE 213
               A KA E ++ M  GK       LK ++++    P+ AYNV M L++ILI + ++R+
Sbjct: 83  ----AAKANELMELMDRGKHNTVSEKLKGMFQE---NPDEAYNVGMELIKILISRERFRD 135

Query: 214 A------LECNCLKDEQRIPSDGRFPFYKAIIYTMLN-MEEAKKWWEEFAETIDDEEFDP 266
                  LE    K        G    YK +++ M+  + EA+  W +F   +      P
Sbjct: 136 GVTISGQLEGLYGKSTSAPLKKGNLFLYKVLMHGMVGEIGEAELSWNKFKNLLGATSVVP 195


>gi|147792738|emb|CAN68816.1| hypothetical protein VITISV_028771 [Vitis vinifera]
          Length = 255

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 69/251 (27%), Positives = 110/251 (43%), Gaps = 60/251 (23%)

Query: 27  HSSIWISLSKGSTRRPFGRVFSPVICAHDGSSTPSEN---DNKIVKGAVGASLVLTCALG 83
           HSS  + +S G   R  GR+ +P++ A DG STP  N   D+KIV+  +GA++ LTC LG
Sbjct: 38  HSSTSLPMSSG---RHIGRLLTPIVFAADGVSTPPTNNKGDDKIVRTGIGAAIALTCVLG 94

Query: 84  IMSFSSRMNPKAIAAGPMEMYQKAPRMSVLPHPIGGRYALNSFLDVSVRLASSKAEPFYW 143
           ++  S  +N KAI          +P++ V                      SS+ +  + 
Sbjct: 95  VIGCSYMINQKAIT---------SPKIFV--------------------TGSSRHKNIHQ 125

Query: 144 PRYTVPPGPSAEDVNAIKA-EAVKQMKYGKPEFAVTLLKKVYEDCKNEPEPAYNVEMALV 202
           P              A KA E ++ M  GK       LK ++++    P+ AYNV M L+
Sbjct: 126 P--------------AAKANELMELMDRGKHNTVSEKLKGMFQE---NPDEAYNVGMELI 168

Query: 203 EILIYQGKYREA------LECNCLKDEQRIPSDGRFPFYKAIIYTMLN-MEEAKKWWEEF 255
           +ILI + ++R+       LE    K        G    YK +++ M+  + EA+  W +F
Sbjct: 169 KILISRERFRDGVTISGQLEGLYGKSTSAPLKKGNLFLYKVLMHGMVGEIGEAELSWNKF 228

Query: 256 AETIDDEEFDP 266
              +      P
Sbjct: 229 KNLLGATSVVP 239


>gi|356557811|ref|XP_003547204.1| PREDICTED: uncharacterized protein LOC100809217 [Glycine max]
          Length = 246

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 56/88 (63%), Gaps = 6/88 (6%)

Query: 172 KPEFAVTLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYRE---ALECNCLKDEQRIPS 228
           K   A  +L +  +   ++P  AY +EM LVE+LIY+G  ++   AL+C CLKDE     
Sbjct: 125 KEGVAENILAQALKKASDKPHEAYEIEMLLVEVLIYKGGKQDLETALKCKCLKDES--LK 182

Query: 229 DGRFPFYKAIIYTML-NMEEAKKWWEEF 255
           D R P +KAIIY ML +++EAK+ W+EF
Sbjct: 183 DARRPLFKAIIYQMLGSIKEAKQHWDEF 210


>gi|224076910|ref|XP_002305045.1| predicted protein [Populus trichocarpa]
 gi|222848009|gb|EEE85556.1| predicted protein [Populus trichocarpa]
          Length = 232

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 73/145 (50%), Gaps = 12/145 (8%)

Query: 118 GGRYALNSFLD---VSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPE 174
           GG+ AL+  +D        AS K+         +   P   D   +++   +    GK  
Sbjct: 45  GGKLALDRVIDYHRARADEASLKSAENELQDLLIEEHP---DFVKLQSTVARLEMSGKEA 101

Query: 175 FAVTLLKKVYEDCKNEPEP--AYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRF 232
            AV +L+   +  + E +   AY +EM LVE+ IYQG++++AL C CL  E+   SD R 
Sbjct: 102 VAVGILETQLKSARKEGKSHVAYEIEMLLVEMHIYQGEFKKALACECLSHEE--ISDARR 159

Query: 233 PFYKAIIYTMLN--MEEAKKWWEEF 255
           P YKAIIY ML    +EA   WE+F
Sbjct: 160 PLYKAIIYIMLEDPGKEAMNCWEKF 184


>gi|357447301|ref|XP_003593926.1| hypothetical protein MTR_2g019410 [Medicago truncatula]
 gi|355482974|gb|AES64177.1| hypothetical protein MTR_2g019410 [Medicago truncatula]
          Length = 261

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/118 (40%), Positives = 64/118 (54%), Gaps = 9/118 (7%)

Query: 156 DVNAIKAEAVKQMKYGKPEFAVTLLKKVYEDCKN--EPEPAYNVEMALVEILIYQGKYRE 213
           D+  I+  A K    G  + AV +L++  E  +N  +P   Y +EM LVE+LIY+G+   
Sbjct: 125 DLIKIQQAAEKLEMSGSEDSAVEVLERAVEKAENANKPHEVYEIEMFLVEMLIYKGELDR 184

Query: 214 ALECNCLKDEQRIPSDGRFPFYKAIIYTML-NMEEAKKWWEEFAETIDDEEFDPTKGF 270
           AL   CLKD+     D R P YKAIIY M  N E+A + W EF  T+ D    P  GF
Sbjct: 185 ALNRTCLKDDS--LKDARRPLYKAIIYQMKGNTEKADECWNEFL-TVQDP---PCTGF 236


>gi|225444557|ref|XP_002270013.1| PREDICTED: uncharacterized protein LOC100250392 [Vitis vinifera]
          Length = 240

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/253 (27%), Positives = 106/253 (41%), Gaps = 64/253 (25%)

Query: 20  VTKLGSGHSSIWISLSKGSTRRPFGRVFSPVICAHDGSSTPS---ENDNKIVKGAVGASL 76
           +TK   G     +S  KGS  R    + SP+ C  DG S P+   E  N   +  VGASL
Sbjct: 29  LTKTHLGFPPSSLSFPKGSVGRTVQHIPSPITCTLDGFSFPTRKNEGGNNNSRSMVGASL 88

Query: 77  VLTCALGIMSFSSRMNPKAI------AAGPMEMYQKAPRMSVLPHPIGGRYALNSFLDVS 130
            L CALGI+S S  M+ KA       ++  MEM  +                     ++ 
Sbjct: 89  ALACALGIISCSGMMHHKAFAASSTASSTTMEMAART--------------------NIE 128

Query: 131 VRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLKKVYEDCKNE 190
           V+ A             + P   A+       EA K +K           KK+    +N 
Sbjct: 129 VKTAMD----------LLMPSRGAD-------EAEKYLK-----------KKIKAHNENN 160

Query: 191 PEPAYNVEMALVEILIYQGKYREAL-ECNCLKDEQR----IPSDGRFPFYKAIIYTMLNM 245
            E  Y   +AL E+L+++GKY+EA  +C  L+        +  D +F   +  IY MLN 
Sbjct: 161 -EAGYYARVALAEVLMHRGKYKEAAQQCQILEQNPNYHYLVKKDKKFHLMQMFIYGMLNE 219

Query: 246 E-EAKKWWEEFAE 257
           + + K++W+ F +
Sbjct: 220 DKKVKQYWKNFKQ 232


>gi|297826919|ref|XP_002881342.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327181|gb|EFH57601.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 277

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 62/127 (48%), Gaps = 4/127 (3%)

Query: 118 GGRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAV 177
           GG+ AL   LD+      + +       +         D   ++ + VK    GK    V
Sbjct: 82  GGKLALQRLLDLHSYRVDTSSLANAEIEFEALLSKENPDFGLLQRDIVKMEMSGKEANGV 141

Query: 178 TLLKKVYEDCKNEPE--PAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRFPFY 235
            LLKK  E  + E +   AY +EM LVE+LIY G   EA +C CL+DE  + +D R P Y
Sbjct: 142 ELLKKALEKARKEGKGHEAYEIEMLLVEMLIYLGNIEEASKCKCLEDE--VITDARRPLY 199

Query: 236 KAIIYTM 242
           ++II+ +
Sbjct: 200 QSIIHYL 206


>gi|42569613|ref|NP_181000.2| uncharacterized protein [Arabidopsis thaliana]
 gi|28207102|gb|AAO37191.1| hypothetical protein [Arabidopsis thaliana]
 gi|50058951|gb|AAT69220.1| hypothetical protein At2g34530 [Arabidopsis thaliana]
 gi|330253891|gb|AEC08985.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 282

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 60/127 (47%), Gaps = 4/127 (3%)

Query: 118 GGRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAV 177
           GG+ AL   LD+      + +       +         D   ++ + VK    GK     
Sbjct: 85  GGKLALQRLLDLHSYRGDTSSLANAELEFEALLSKEKPDFGLLQRDIVKMEMSGKEAKGA 144

Query: 178 TLLKKVYEDCKNEPE--PAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRFPFY 235
            LLKK  E  + E +   AY +EM LVE+LIY G   EA +C CL+DE  + +D R P Y
Sbjct: 145 ELLKKALEKARKEGKGHEAYEIEMLLVEMLIYLGNMEEASKCKCLEDE--VITDARRPLY 202

Query: 236 KAIIYTM 242
           + II+ +
Sbjct: 203 QTIIHYL 209


>gi|147836573|emb|CAN64229.1| hypothetical protein VITISV_011837 [Vitis vinifera]
          Length = 240

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 64/200 (32%), Positives = 94/200 (47%), Gaps = 29/200 (14%)

Query: 73  GASLVLTCALGIMSFSSRMNPKAIAAG---PMEMYQK-APRMSVLPHPIGGRYALNSFLD 128
           GA +VL   LGI        P A++AG    M+  +K + R+S L    GG+  L   + 
Sbjct: 12  GALMVL---LGI--------PLAVSAGIYFAMKYTEKKSDRLSPLVALHGGQQVLKRLIG 60

Query: 129 VSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVT--------LL 180
               LA          R  +    S  + +  K   +K+++    E  +T        +L
Sbjct: 61  YHGALAKMDLAEATMKRILIED-KSVLNQDKRKRPNIKKLERAIAELEMTSRAVKAADML 119

Query: 181 KKVYEDC--KNEPEPAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRFPFYKAI 238
           K  +ED   K     AY  +M  VE+LIY G++  AL C CL+D QRI +D R P +KA+
Sbjct: 120 KCAFEDAEAKGNLHEAYEFQMLYVEMLIYMGQWDAALTCKCLED-QRI-TDARRPLFKAV 177

Query: 239 IYTML-NMEEAKKWWEEFAE 257
           I  +   + E KK+WEEF E
Sbjct: 178 INILSEKVSEGKKYWEEFRE 197


>gi|147765651|emb|CAN64742.1| hypothetical protein VITISV_006458 [Vitis vinifera]
          Length = 403

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/83 (48%), Positives = 48/83 (57%), Gaps = 5/83 (6%)

Query: 176 AVTLLKKVYEDCKNE--PEPAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRFP 233
           AV  LK   E  + E      Y + M LVE+LIY+G + EAL   CLKDE+   SD R P
Sbjct: 286 AVKALKSAMEKAQREGKAHEEYEIGMLLVEMLIYKGDWNEALSYKCLKDEK--ISDARRP 343

Query: 234 FYKAIIYTMLNM-EEAKKWWEEF 255
            YKAIIY +    EEA+  WEEF
Sbjct: 344 LYKAIIYILXXXCEEAESCWEEF 366


>gi|147781676|emb|CAN75956.1| hypothetical protein VITISV_012561 [Vitis vinifera]
          Length = 240

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 70/254 (27%), Positives = 109/254 (42%), Gaps = 66/254 (25%)

Query: 20  VTKLGSGHSSIWISLSKGSTRRPFGRVFSPVICAHDGSSTPS---ENDNKIVKGAVGASL 76
           +TK   G     +S  KGS  R    + SP+ C  DG S P+   E  N   +  VGASL
Sbjct: 29  LTKTHLGFPPSSLSFPKGSVGRTVQHIPSPITCTLDGFSFPTRKNEGGNNNSRSMVGASL 88

Query: 77  VLTCALGIMSFSSRMNPKAI------AAGPMEMYQKAPRMSVLPHPIGGRYALNSFLDVS 130
            L CALGI+S S  M+ KA       ++  MEM   A R ++          + + +D+ 
Sbjct: 89  ALACALGIISCSGMMHHKAFAAXSTASSTTMEM---AARTNI---------EVKTAMDL- 135

Query: 131 VRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLKKVYEDCKNE 190
                            + P   A+       EA K +K           KK+     NE
Sbjct: 136 -----------------LMPSRGAD-------EAEKYLK-----------KKI--KAHNE 158

Query: 191 PEPA-YNVEMALVEILIYQGKYREALECNCLKDEQ-----RIPSDGRFPFYKAIIYTMLN 244
              A Y   +AL E+L+++GKY+EA +   + ++       +  D +F   +  IY MLN
Sbjct: 159 DNEAGYYARVALAEVLMHRGKYKEAAQQXQILEQNPNYHYLVKKDKKFHLMQMFIYGMLN 218

Query: 245 ME-EAKKWWEEFAE 257
            + + K++W+ F +
Sbjct: 219 EDKKVKQYWKNFKQ 232


>gi|297740692|emb|CBI30874.3| unnamed protein product [Vitis vinifera]
          Length = 209

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 78/164 (47%), Gaps = 14/164 (8%)

Query: 105 QKAPRMSVLPHPIGGRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEA 164
           +K+ R+S L    GG+  L   +     LA          R  +    S  + +  K   
Sbjct: 6   KKSDRLSPLVALHGGQQVLKRLIGYHGALAKMDLAEATMKRILIE-DKSVLNQDKRKRPN 64

Query: 165 VKQMKYGKPEFAVT--------LLKKVYEDC--KNEPEPAYNVEMALVEILIYQGKYREA 214
           +K+++    E  +T        +LK  +ED   K     AY  +M  VE+LIY G++  A
Sbjct: 65  IKKLERAIAELEMTSRAVKAADMLKCAFEDAEAKGNLHEAYEFQMLYVEMLIYMGQWDAA 124

Query: 215 LECNCLKDEQRIPSDGRFPFYKAIIYTML-NMEEAKKWWEEFAE 257
           L C CL+D QRI +D R P +KA+I  +   + E KK+WEEF E
Sbjct: 125 LTCKCLED-QRI-TDARRPLFKAVINILSEKVSEGKKYWEEFRE 166


>gi|388510690|gb|AFK43411.1| unknown [Medicago truncatula]
          Length = 219

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/89 (41%), Positives = 50/89 (56%), Gaps = 4/89 (4%)

Query: 156 DVNAIKAEAVKQMKYGKPEFAVTLLKKVYEDCKN--EPEPAYNVEMALVEILIYQGKYRE 213
           D+  I+  A K    G  + AV +L++  E  +N  +P   Y +EM LVE+LIY+G+   
Sbjct: 125 DLVKIQQAAEKLEMSGSEDSAVEVLERAVEKAENANKPHEVYEIEMFLVEMLIYKGELDR 184

Query: 214 ALECNCLKDEQRIPSDGRFPFYKAIIYTM 242
           AL   CLKD+     D R P YKAIIY M
Sbjct: 185 ALNRTCLKDDS--LKDARRPLYKAIIYQM 211


>gi|42571051|ref|NP_973599.1| uncharacterized protein [Arabidopsis thaliana]
 gi|330253892|gb|AEC08986.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 241

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 57/122 (46%), Gaps = 4/122 (3%)

Query: 118 GGRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAV 177
           GG+ AL   LD+      + +       +         D   ++ + VK    GK     
Sbjct: 85  GGKLALQRLLDLHSYRGDTSSLANAELEFEALLSKEKPDFGLLQRDIVKMEMSGKEAKGA 144

Query: 178 TLLKKVYEDCKNEPE--PAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSDGRFPFY 235
            LLKK  E  + E +   AY +EM LVE+LIY G   EA +C CL+DE  + +D R P Y
Sbjct: 145 ELLKKALEKARKEGKGHEAYEIEMLLVEMLIYLGNMEEASKCKCLEDE--VITDARRPLY 202

Query: 236 KA 237
           +A
Sbjct: 203 QA 204


>gi|255646887|gb|ACU23913.1| unknown [Glycine max]
          Length = 170

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 80/176 (45%), Gaps = 26/176 (14%)

Query: 1   MESALCLNFCPS-------VAMRVKPVTKLGSGHSSIWISLSKGSTRRPFGRVFSPVICA 53
           MESA+CL  CP           R      L   H++        +TR    R F  V C 
Sbjct: 1   MESAICLRHCPFPYGSKGFQGARPSRFHHLKINHATPLSLPQISTTRSSVTRSFGHVNCM 60

Query: 54  HDG-SSTPSEND---NKIVKGAVGASLVLTCALGIMSFSSR-MNPKAIAAGPMEMYQKAP 108
            +   S PSEN+   N+I++G   ASLVLTC LG++SFS + MNPK   A     Y    
Sbjct: 61  LERFISMPSENEGSGNRILRGVAAASLVLTCVLGLLSFSGKNMNPKLTTA-----YGNG- 114

Query: 109 RMSVLPHPIG---GRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIK 161
               L  PI    GR+AL S L++      +  +      ++V  GPS ++V  +K
Sbjct: 115 ----LEFPISKPQGRFALKSLLEIPKDEELADTQNKMRSNFSV-RGPSQQEVKNLK 165


>gi|297740696|emb|CBI30878.3| unnamed protein product [Vitis vinifera]
          Length = 209

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 73/257 (28%), Positives = 103/257 (40%), Gaps = 71/257 (27%)

Query: 1   MESALCLNFCP--SVAMRVKPVTKLGSGHSSIW------ISLSKGSTRRPFGRVFSPVIC 52
           M+S+ CL +C   S+ +RV         +   +      +S+SK S   P   + SP+ C
Sbjct: 1   MDSSFCLRYCSLSSLPLRVPSTVNFSRANKPHFSFRPSSLSVSKSSRSGPIRHIPSPITC 60

Query: 53  AHDGS---STPSEND---NKIVKGAVGASLVLTCALGIMSFSSRMNPKAIAAGPMEMYQK 106
           A +GS   S P+ ND   N I +G V  SL + C LGI+  S  M  KA A         
Sbjct: 61  ATNGSNGFSRPATNDKGDNNIARGTVRVSLAMACVLGIIGCSYTMKHKAHA--------- 111

Query: 107 APRMSVLPHPIGGRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVK 166
                  P PIG  +   +        A++KA  F                       + 
Sbjct: 112 ------FP-PIGSPWGAET--------ATNKANRF-----------------------ID 133

Query: 167 QMKYG-KPEFAVTLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYREA-LECNCLKDEQ 224
           QM  G  P      LK  + +    PE AYNV+M L+ I I +GKY EA   C  L  ++
Sbjct: 134 QMDRGVAPSKIDEQLKNTFME---NPEEAYNVQMELIRIHINRGKYDEASRHCEDLSKKE 190

Query: 225 -----RIPSDGRFPFYK 236
                   +D R+  YK
Sbjct: 191 IYYSRYFRTDQRYHLYK 207


>gi|147799100|emb|CAN68158.1| hypothetical protein VITISV_038906 [Vitis vinifera]
          Length = 233

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 105/264 (39%), Gaps = 71/264 (26%)

Query: 1   MESALCLNFCP--SVAMRVKPVTKLGSGHSSIW------ISLSKGSTRRPFGRVFSPVIC 52
           M+S+ CL +C   S+ +RV         +   +      +S+SK S   P   + SP+ C
Sbjct: 1   MDSSFCLRYCSLSSLPLRVPSTVNFSRANKPHFSFRPSSLSVSKSSRSGPIRHIPSPITC 60

Query: 53  AHDGS---STPSEND---NKIVKGAVGASLVLTCALGIMSFSSRMNPKAIAAGPMEMYQK 106
           A +GS   S P+ ND   N I +G V   L + C LGI+  S  M  KA A         
Sbjct: 61  ATNGSNGFSRPATNDKGDNNIARGTVRVXLAMACVLGIIGCSYTMKHKAHA--------- 111

Query: 107 APRMSVLPHPIGGRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVK 166
                  P PIG  +   +        A++KA  F                       + 
Sbjct: 112 ------FP-PIGSPWGAET--------ATNKANRF-----------------------ID 133

Query: 167 QMKYG-KPEFAVTLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYREA-LECNCLKDEQ 224
           QM  G  P      LK  + +    PE AYNV+M L+ I I +GKY EA   C  L  ++
Sbjct: 134 QMDRGVAPSKIDEQLKNTFME---NPEEAYNVQMELIRIHINRGKYDEASRHCEDLSKKE 190

Query: 225 -----RIPSDGRFPFYKAIIYTML 243
                   +D R+  YK  +  ++
Sbjct: 191 IYYSRYFRTDQRYHLYKCXVLQII 214


>gi|449466991|ref|XP_004151209.1| PREDICTED: uncharacterized protein LOC101223225 [Cucumis sativus]
 gi|449517186|ref|XP_004165627.1| PREDICTED: uncharacterized LOC101223225 [Cucumis sativus]
          Length = 247

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 62/108 (57%), Gaps = 10/108 (9%)

Query: 157 VNAIKAEAVKQMKYGKPEFAVTLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYREALE 216
           +   K+EA+K +  GK   A + ++ +    K + + AY   +A ++ILI+  KY +AL 
Sbjct: 82  ITRFKSEALKLVMDGKYNEAESHMEAL---LKGDTDVAYEARLAHLQILIHLDKYEKAL- 137

Query: 217 CNCLKDEQRIPS----DGRFPFYKAIIYTMLNM-EEAKKWWEEFAETI 259
            N L+ E   P     + R   YKA++YTML+  ++A+KWW ++ +T+
Sbjct: 138 -NFLEKEGDFPRSKLWEERLFLYKAVVYTMLDKDDDAEKWWNKYVDTL 184


>gi|357476437|ref|XP_003608504.1| hypothetical protein MTR_4g096830 [Medicago truncatula]
 gi|355509559|gb|AES90701.1| hypothetical protein MTR_4g096830 [Medicago truncatula]
          Length = 372

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 35/198 (17%)

Query: 38  STRRPFGRVFSPVICAHDGS-STPSE-----NDNKIVKGAVGASLVLTCALGIMSFSSRM 91
           S    F R F    C  + S S P+      ++NK ++G  G SLVL C LGI++FS  M
Sbjct: 52  SASHQFARSFGHANCMINNSFSVPNSKNIEGSENKNLRGMSGVSLVLGCILGIINFSGMM 111

Query: 92  NPKAIAAGPMEMYQKAPRMSVLPHPIGGRYALNSFLDV--SVRLASSKAEPFYWPRYTVP 149
           NPK   A P +           P  IG    +N+F  +  ++     +  P   P  T+ 
Sbjct: 112 NPKISMALPFD-----------PTNIGR--GVNTFDSLWNTINAEGVELNPKLDPNETL- 157

Query: 150 PGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLKKVYEDC--KNEPEPAYN--VEMALVEIL 205
                  V+  K  A+     GK    V +++K+ ++     E +PA    +  A+ E+L
Sbjct: 158 -------VDKKKMHALYLRNRGKKREEVEMVEKLKDEYTKSKEDDPAREPYLRKAVFELL 210

Query: 206 IYQGKYREALECNCLKDE 223
           + QG++ EA  C  L  +
Sbjct: 211 LIQGRFDEA--CKLLDSD 226


>gi|449466989|ref|XP_004151208.1| PREDICTED: uncharacterized protein LOC101222991 [Cucumis sativus]
 gi|449517184|ref|XP_004165626.1| PREDICTED: uncharacterized LOC101222991 [Cucumis sativus]
          Length = 106

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 55/97 (56%), Gaps = 2/97 (2%)

Query: 164 AVKQMKYGKPEFAVTLLKKVYEDCKNEPEPAYNVEMALVEILIYQGKYREALECNCLKDE 223
           A+K++  G+ EFA +L+K   E  K+  E     ++  + ILIY  +Y EAL+     + 
Sbjct: 2   AMKKVVNGEYEFAKSLMKSKVELIKDSQERL-EAQITHIHILIYLDEYEEALDILSEIEY 60

Query: 224 QRIPSDGRFPFYKAIIYTML-NMEEAKKWWEEFAETI 259
           Q  PSD R   YKAI  TML N ++AK  W+ F +TI
Sbjct: 61  QFSPSDFRPWLYKAIGLTMLGNHKDAKTCWKAFMKTI 97


>gi|297740698|emb|CBI30880.3| unnamed protein product [Vitis vinifera]
          Length = 129

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/67 (44%), Positives = 36/67 (53%), Gaps = 4/67 (5%)

Query: 172 KPEFAVTLLKKVYEDCKNE--PEPAYNVEMALVEILIYQGKYREALECNCLKDEQRIPSD 229
           K   AV  LK   E  + E      Y + M LVE+LIY+G + EAL   CLKDE+   SD
Sbjct: 59  KEAAAVKALKSAMEKAQREGKAHEEYEIGMLLVEMLIYKGDWNEALSYKCLKDEK--ISD 116

Query: 230 GRFPFYK 236
            R P YK
Sbjct: 117 ARRPLYK 123


>gi|297740697|emb|CBI30879.3| unnamed protein product [Vitis vinifera]
          Length = 158

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/199 (26%), Positives = 82/199 (41%), Gaps = 61/199 (30%)

Query: 72  VGASLVLTCALGIMSFSSRMNPKAIAAGP------MEMYQKAPRMSVLPHPIGGRYALNS 125
           VGASL L CALGI+S S  M+ KA AA        MEM   A R ++          + +
Sbjct: 2   VGASLALACALGIISCSGMMHHKAFAASSTASSTTMEM---AARTNI---------EVKT 49

Query: 126 FLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLKKVYE 185
            +D+                  + P   A++                   A   LKK  +
Sbjct: 50  AMDL------------------LMPSRGADE-------------------AEKYLKKKIK 72

Query: 186 DCKNEPEPAYNVEMALVEILIYQGKYREAL-ECNCLKD----EQRIPSDGRFPFYKAIIY 240
                 E  Y   +AL E+L+++GKY+EA  +C  L+        +  D +F   +  IY
Sbjct: 73  AHNENNEAGYYARVALAEVLMHRGKYKEAAQQCQILEQNPNYHYLVKKDKKFHLMQMFIY 132

Query: 241 TMLNME-EAKKWWEEFAET 258
            MLN + + K++W+ F + 
Sbjct: 133 GMLNEDKKVKQYWKNFKQA 151


>gi|449433724|ref|XP_004134647.1| PREDICTED: uncharacterized protein LOC101211314 [Cucumis sativus]
 gi|449518593|ref|XP_004166321.1| PREDICTED: uncharacterized LOC101211314 [Cucumis sativus]
          Length = 103

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/60 (50%), Positives = 37/60 (61%), Gaps = 3/60 (5%)

Query: 199 MALVEILIYQGKYREALECNCLKDEQRIPSDGRFPFYKAIIYTMLN-MEEAKKWWEEFAE 257
           M LVE+ IY+G   +A E  CLK++    SD R P YKAII  + N  +EA K WEEF E
Sbjct: 1   MLLVELYIYKGDLVKAEELPCLKNDA--TSDVRRPLYKAIIKVLQNETQEAIKEWEEFRE 58


>gi|3128226|gb|AAC26706.1| hypothetical protein [Arabidopsis thaliana]
          Length = 266

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 39/92 (42%), Gaps = 2/92 (2%)

Query: 118 GGRYALNSFLDVSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAV 177
           GG+ AL   LD+      + +       +         D   ++ + VK    GK     
Sbjct: 95  GGKLALQRLLDLHSYRGDTSSLANAELEFEALLSKEKPDFGLLQRDIVKMEMSGKEAKGA 154

Query: 178 TLLKKVYEDCKNEPE--PAYNVEMALVEILIY 207
            LLKK  E  + E +   AY +EM LVE+LIY
Sbjct: 155 ELLKKALEKARKEGKGHEAYEIEMLLVEMLIY 186


>gi|380025673|ref|XP_003696593.1| PREDICTED: LOW QUALITY PROTEIN: ubiquitin carboxyl-terminal
           hydrolase 5-like [Apis florea]
          Length = 805

 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 46/93 (49%), Gaps = 7/93 (7%)

Query: 129 VSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLK---KVYE 185
           V+++ A+SK  P Y P YT      A   N+    +V QM +  P+F    +    ++++
Sbjct: 300 VALQEATSKLTPLYGPGYT----GLANLGNSCYLNSVMQMLFVIPDFIKRFIDGASQIFQ 355

Query: 186 DCKNEPEPAYNVEMALVEILIYQGKYREALECN 218
               +P   +NV+MA + I +  GKY +  + N
Sbjct: 356 QNYTDPANDFNVQMAKLGIGLLSGKYSDPSDTN 388


>gi|328788537|ref|XP_624702.2| PREDICTED: ubiquitin carboxyl-terminal hydrolase 5-like [Apis
           mellifera]
          Length = 802

 Score = 37.7 bits (86), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 46/93 (49%), Gaps = 7/93 (7%)

Query: 129 VSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLK---KVYE 185
           V+++ A+SK  P Y P YT      A   N+    +V QM +  P+F    +    ++++
Sbjct: 300 VALQEATSKLTPLYGPGYT----GLANLGNSCYLNSVMQMLFVIPDFIKRFIDGASQIFQ 355

Query: 186 DCKNEPEPAYNVEMALVEILIYQGKYREALECN 218
               +P   +NV+MA + I +  GKY +  + N
Sbjct: 356 QNYTDPANDFNVQMAKLGIGLLSGKYSDPSDTN 388


>gi|346318673|gb|EGX88275.1| xenobiotic compound monooxygenase, DszA family, putative [Cordyceps
           militaris CM01]
          Length = 489

 Score = 37.7 bits (86), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 55/127 (43%), Gaps = 8/127 (6%)

Query: 29  SIWISLSKGSTRRPFGRVF-SPVICAHDGSSTPSENDNKIVKGA---VGASLVLTCALGI 84
           S W++L+K      F  +F + V+ A+D    P   D  I+ GA   V   L +  A+  
Sbjct: 39  SHWVNLAKKLEEGKFHGIFIADVLGAYDVYKGPRNPDPAIISGAQWPVNEPLAVVSAMAA 98

Query: 85  MSFSSRMNPKAIAAGPMEMYQKAPRMSVLPHPIGGRYALN---SFLDVSVRLASSKAEPF 141
            + +       +A    + Y  A R+S + H  GGR   N    +LD + R    + +P 
Sbjct: 99  ATRNLGFG-VTVATTYEQPYHLARRLSTVDHLTGGRLGWNVVTGYLDSAARNLGFQEQPK 157

Query: 142 YWPRYTV 148
           +  RY V
Sbjct: 158 HDERYAV 164


>gi|167522819|ref|XP_001745747.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776096|gb|EDQ89718.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1619

 Score = 37.7 bits (86), Expect = 5.2,   Method: Composition-based stats.
 Identities = 29/101 (28%), Positives = 47/101 (46%), Gaps = 18/101 (17%)

Query: 122  ALNSFLDVSVRLASSKAEPFYWPRYT-------VPPGPSAEDVNAIKA------EAVKQM 168
            +L   L   V LAS+ +E  Y    T       +PP    + ++ I++      EA+ Q+
Sbjct: 1196 SLTQLLVQRVTLASNMSEEAYERFITSKLRDAQIPPSQVPQVLDVIRSRKPASPEALHQI 1255

Query: 169  KY-----GKPEFAVTLLKKVYEDCKNEPEPAYNVEMALVEI 204
             Y     G+P+   T L+ V E C N+PE    V M L+++
Sbjct: 1256 FYFVSEHGEPQEVQTYLQMVKETCNNDPETMVRVAMGLIDV 1296


>gi|297740742|emb|CBI30924.3| unnamed protein product [Vitis vinifera]
          Length = 1323

 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 4/67 (5%)

Query: 36  KGSTRRPFGRVFSPVICAHDGSSTPSE----NDNKIVKGAVGASLVLTCALGIMSFSSRM 91
           KG    P  ++  P+ CA  G  TPS     ND+ + K    AS  L C +GI+  S  +
Sbjct: 577 KGFRSYPARQILGPITCAASGFRTPSRRNKANDHNMAKRVGRASFTLACVIGIIGCSFWV 636

Query: 92  NPKAIAA 98
             + +AA
Sbjct: 637 KQEVMAA 643


>gi|322801464|gb|EFZ22125.1| hypothetical protein SINV_08392 [Solenopsis invicta]
          Length = 818

 Score = 37.4 bits (85), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 7/100 (7%)

Query: 129 VSVRLASSKAEPFYWPRYTVPPGPSAEDVNAIKAEAVKQMKYGKPEFAVTLLKK---VYE 185
           VS++ A+SK  P Y   YT          N+    +V QM +  P+F    ++K   ++ 
Sbjct: 317 VSLQEAASKLTPLYGSGYTGLTNLG----NSCYLNSVMQMLFIIPDFVERYVEKAPRIFA 372

Query: 186 DCKNEPEPAYNVEMALVEILIYQGKYREALECNCLKDEQR 225
           +  N+P    NV+MA + I +  GKY +    +   DE R
Sbjct: 373 NNFNDPASDINVQMAKLGIGLLSGKYSDQPSKSSTDDESR 412


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.408 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,397,678,596
Number of Sequences: 23463169
Number of extensions: 180640397
Number of successful extensions: 450991
Number of sequences better than 100.0: 48
Number of HSP's better than 100.0 without gapping: 23
Number of HSP's successfully gapped in prelim test: 25
Number of HSP's that attempted gapping in prelim test: 450896
Number of HSP's gapped (non-prelim): 61
length of query: 271
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 131
effective length of database: 9,074,351,707
effective search space: 1188740073617
effective search space used: 1188740073617
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)