BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017151
         (376 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255583327|ref|XP_002532426.1| conserved hypothetical protein [Ricinus communis]
 gi|223527875|gb|EEF29967.1| conserved hypothetical protein [Ricinus communis]
          Length = 658

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 249/355 (70%), Positives = 286/355 (80%), Gaps = 2/355 (0%)

Query: 12  SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKI 71
           +E+L+ARIQQLEHERDEL KDIEQLCMQQAGPSYLAVAT+MHFQRTAGLEQEIE LK+K+
Sbjct: 8   NESLIARIQQLEHERDELHKDIEQLCMQQAGPSYLAVATKMHFQRTAGLEQEIENLKKKL 67

Query: 72  AACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSV 131
           AAC R+N NLQEELSEAYRIK QLADLH AEV KN EAEKQ+KFFQGC+A+AFAERDNS+
Sbjct: 68  AACTRDNLNLQEELSEAYRIKSQLADLHNAEVAKNKEAEKQLKFFQGCVASAFAERDNSI 127

Query: 132 MEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVIN 191
           MEAEKAKEKEELMSQKFNEFQ RLEEL+S+ +E K+QN  L+ +L KQEE NES K+V N
Sbjct: 128 MEAEKAKEKEELMSQKFNEFQKRLEELTSDCLEQKRQNEALQINLAKQEEQNESLKKVAN 187

Query: 192 KFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVEN 250
           KFYEIRQ SLE  E  SWEDKC  LL DS EMWS+ND STS YISALE+ELE+ R S +N
Sbjct: 188 KFYEIRQHSLEGFEDASWEDKCTWLLHDSKEMWSYNDASTSNYISALEEELEQVRKSADN 247

Query: 251 LQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRS 310
           LQSKLR+GLEIENHLKK VRELEKK I  DK + N IA LR  HS+ R H++N L EG+ 
Sbjct: 248 LQSKLRVGLEIENHLKKQVRELEKKQIQLDKMVMNGIAGLRHYHSEHRGHIMNLLNEGKL 307

Query: 311 HIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNLVSQ 365
           H+KS  D++EEK        + QN    QR   L+E ECRDVHI+ND  + L+S+
Sbjct: 308 HMKSTMDMLEEKIGETYGS-KEQNLRPSQRVIDLEENECRDVHISNDIGSALISE 361


>gi|296083035|emb|CBI22439.3| unnamed protein product [Vitis vinifera]
          Length = 673

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 244/363 (67%), Positives = 291/363 (80%), Gaps = 5/363 (1%)

Query: 7   EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
           EK N++E+L+ARIQQLEHERDELRKDIEQLC+QQAGPSYL VATRMHFQRTAGLEQEIE 
Sbjct: 3   EKSNDNESLIARIQQLEHERDELRKDIEQLCIQQAGPSYLVVATRMHFQRTAGLEQEIEN 62

Query: 67  LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
           LK+K+AA  REN NLQEELSEAYRIK QLADLH AEV KN++AEKQ+KFFQGC+AAAFAE
Sbjct: 63  LKKKVAASTRENLNLQEELSEAYRIKSQLADLHNAEVSKNLDAEKQIKFFQGCVAAAFAE 122

Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
           RD+S+MEAEKAKEKEEL  Q+FN FQ R++EL S+N++ K+ N  L+ DL  QE+ NE+F
Sbjct: 123 RDHSIMEAEKAKEKEELALQEFNNFQKRIKELESDNLKQKQLNVALQIDLANQEDQNETF 182

Query: 187 KEVINKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
           K+VINKF+EIRQ SLE  E  +W+DKC CLL D  EMWSFN   +TSTSKYI ALE+ELE
Sbjct: 183 KKVINKFFEIRQYSLEAFEDANWDDKCGCLLSDPVEMWSFNTNEETSTSKYIDALEEELE 242

Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
             R+SV+NLQ+KLR+GLEIENHLKK V ELEK+ I S +   N I+ L   HSQ R+HVV
Sbjct: 243 MVRNSVDNLQNKLRVGLEIENHLKKKVSELEKQKIISHQMFKNRISGLLHYHSQHRLHVV 302

Query: 303 NSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
           N L+EG+SH+KSI DV+EEK +   D  R QN    QR+ KL E ECRDVH++   D N 
Sbjct: 303 NLLDEGKSHLKSIIDVVEEKIRQL-DADREQNLEPPQRDLKLYENECRDVHVSIVGDHNS 361

Query: 363 VSQ 365
           V++
Sbjct: 362 VAE 364


>gi|359475478|ref|XP_002264967.2| PREDICTED: uncharacterized protein LOC100251573 [Vitis vinifera]
          Length = 721

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 244/363 (67%), Positives = 291/363 (80%), Gaps = 5/363 (1%)

Query: 7   EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
           EK N++E+L+ARIQQLEHERDELRKDIEQLC+QQAGPSYL VATRMHFQRTAGLEQEIE 
Sbjct: 3   EKSNDNESLIARIQQLEHERDELRKDIEQLCIQQAGPSYLVVATRMHFQRTAGLEQEIEN 62

Query: 67  LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
           LK+K+AA  REN NLQEELSEAYRIK QLADLH AEV KN++AEKQ+KFFQGC+AAAFAE
Sbjct: 63  LKKKVAASTRENLNLQEELSEAYRIKSQLADLHNAEVSKNLDAEKQIKFFQGCVAAAFAE 122

Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
           RD+S+MEAEKAKEKEEL  Q+FN FQ R++EL S+N++ K+ N  L+ DL  QE+ NE+F
Sbjct: 123 RDHSIMEAEKAKEKEELALQEFNNFQKRIKELESDNLKQKQLNVALQIDLANQEDQNETF 182

Query: 187 KEVINKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
           K+VINKF+EIRQ SLE  E  +W+DKC CLL D  EMWSFN   +TSTSKYI ALE+ELE
Sbjct: 183 KKVINKFFEIRQYSLEAFEDANWDDKCGCLLSDPVEMWSFNTNEETSTSKYIDALEEELE 242

Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
             R+SV+NLQ+KLR+GLEIENHLKK V ELEK+ I S +   N I+ L   HSQ R+HVV
Sbjct: 243 MVRNSVDNLQNKLRVGLEIENHLKKKVSELEKQKIISHQMFKNRISGLLHYHSQHRLHVV 302

Query: 303 NSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
           N L+EG+SH+KSI DV+EEK +   D  R QN    QR+ KL E ECRDVH++   D N 
Sbjct: 303 NLLDEGKSHLKSIIDVVEEKIRQL-DADREQNLEPPQRDLKLYENECRDVHVSIVGDHNS 361

Query: 363 VSQ 365
           V++
Sbjct: 362 VAE 364


>gi|356495819|ref|XP_003516769.1| PREDICTED: uncharacterized protein LOC100775191 [Glycine max]
          Length = 653

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 221/349 (63%), Positives = 271/349 (77%), Gaps = 3/349 (0%)

Query: 11  ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQK 70
           + E+L+ARIQQLE ERDELRKDIEQLCMQQAGP YLAVATRMHFQRTAGLEQEIE LK+K
Sbjct: 6   DDESLLARIQQLEQERDELRKDIEQLCMQQAGPGYLAVATRMHFQRTAGLEQEIESLKKK 65

Query: 71  IAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNS 130
           +AAC REN NLQEELSEAYRIKGQLADLH+AEV KNMEAEKQVKFFQGC+AAAF+ERD +
Sbjct: 66  LAACTRENLNLQEELSEAYRIKGQLADLHSAEVSKNMEAEKQVKFFQGCVAAAFSERDQA 125

Query: 131 VMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190
           ++EAEKAKEKEE M Q+ N    R+EEL+S+ ++LK+ N  L+ D     + NE+  +VI
Sbjct: 126 IIEAEKAKEKEETMLQQINGIHKRVEELTSDCLKLKEFNDALQIDQAVHMKQNENCMKVI 185

Query: 191 NKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
           NKF++IRQ SL+  E  SW +KCACLL DS E+WSFND STSKYISALE++LE+ R+S++
Sbjct: 186 NKFFQIRQHSLKEFEDMSWNEKCACLLGDSEEVWSFNDASTSKYISALEEQLERLRNSMD 245

Query: 250 NLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309
            LQ+KLR+GLEIENHLKK V  LE K I  DK I N+IA+L+  HS+ R  ++N L +G 
Sbjct: 246 YLQNKLRVGLEIENHLKKRVNALENKQISMDKVIENSIADLKHYHSKCRDEIMNLLGDGE 305

Query: 310 SHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDA 358
           S IKSI + I+EK    D  +      T QR+ + +E EC D+HI+  A
Sbjct: 306 SSIKSIINAIDEKVWSFD--LSTVPNLTPQRDAEPEESECADLHISPQA 352


>gi|15225987|ref|NP_179075.1| myosin heavy chain-related protein [Arabidopsis thaliana]
 gi|334184229|ref|NP_001189526.1| myosin heavy chain-related protein [Arabidopsis thaliana]
 gi|3810592|gb|AAC69374.1| unknown protein [Arabidopsis thaliana]
 gi|330251225|gb|AEC06319.1| myosin heavy chain-related protein [Arabidopsis thaliana]
 gi|330251227|gb|AEC06321.1| myosin heavy chain-related protein [Arabidopsis thaliana]
          Length = 629

 Score =  389 bits (1000), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 220/369 (59%), Positives = 268/369 (72%), Gaps = 9/369 (2%)

Query: 1   MDENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGL 60
           MD N  E EN    L ARI+QLEHER+EL+KDIEQLCMQQ GPS L VA RMHFQRTA L
Sbjct: 1   MDLNLDENEN----LKARIKQLEHERNELQKDIEQLCMQQGGPSILGVAARMHFQRTASL 56

Query: 61  EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCM 120
           EQEIE LK K+AAC RE  NLQEEL+EAYR+K QLADLHA EV KN+EAEKQV+FFQG +
Sbjct: 57  EQEIESLKLKLAACTREKHNLQEELAEAYRVKAQLADLHAGEVAKNLEAEKQVRFFQGSV 116

Query: 121 AAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180
           AAAF+ERD SVMEAEKA+E  E+MSQK +E + RLEELSS+ +  K+ N TL+ DL K E
Sbjct: 117 AAAFSERDKSVMEAEKAEENAEMMSQKLSEIEMRLEELSSDCLVQKRLNDTLQADLAKLE 176

Query: 181 ELNESFKEVINKFYEIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALED 239
           E   ++  VI KFY+IR+ SL E LE S  +KCA LL D  E W+FND STS Y++ALE 
Sbjct: 177 EQTRTYAGVIEKFYDIRKASLCESLEMSLHEKCASLLDDPIESWTFNDPSTSDYVAALEG 236

Query: 240 ELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRV 299
           EL K +++V+NLQSKLR+GLEIENHLKK VR LEKK I +D  I N I ++R  HSQLR 
Sbjct: 237 ELGKVKNTVDNLQSKLRVGLEIENHLKKRVRALEKKNIVADGLIVNGITDIRHHHSQLRA 296

Query: 300 HVVNSLEEGRSHIKSISDVIEEKTQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDA 358
           +++  L E   +IKSIS  +EEK + H  +V   QN    Q + K DE ECRDVH+    
Sbjct: 297 YIIALLNEEGLYIKSISKNVEEKLKLHSSEV---QNVVPPQHDLKPDESECRDVHMTTVV 353

Query: 359 DTNLVSQVC 367
           ++  V+++ 
Sbjct: 354 ESCQVTKLA 362


>gi|449438615|ref|XP_004137083.1| PREDICTED: uncharacterized protein LOC101211967 [Cucumis sativus]
          Length = 641

 Score =  389 bits (1000), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 206/313 (65%), Positives = 254/313 (81%), Gaps = 1/313 (0%)

Query: 11  ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQK 70
           ++E L+ RIQQLE+ERDELRKDIEQLCMQQAGP YL+VATRMHFQRTAGLEQEIE LK+K
Sbjct: 7   DNEHLLTRIQQLEYERDELRKDIEQLCMQQAGPGYLSVATRMHFQRTAGLEQEIENLKKK 66

Query: 71  IAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNS 130
            AAC REN NLQEEL+EAYRIK QLADLH++EV KN+E+EKQ+KFFQGC+A AFAERD+S
Sbjct: 67  FAACTRENHNLQEELAEAYRIKSQLADLHSSEVAKNLESEKQLKFFQGCVATAFAERDHS 126

Query: 131 VMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190
           +MEAEKAKEKEE MSQK  E + RLEELSS+ + L++ N TLR +L KQEE NE  K+VI
Sbjct: 127 IMEAEKAKEKEETMSQKLKEVENRLEELSSDCLNLRETNDTLRINLAKQEEWNEVSKQVI 186

Query: 191 NKFYEIRQQSLE-VLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
           NKFYEIRQ SLE  ++TSW++KC CLL DS+EMWSFND+ST +YI++LE+ LE  + +V+
Sbjct: 187 NKFYEIRQNSLEGFMDTSWDEKCQCLLHDSSEMWSFNDSSTFRYINSLEEGLETMKKTVD 246

Query: 250 NLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309
           NLQ+KLRMG+EIE HLK  V +LE K+IH D  + + I+     +SQ R H++N LE+  
Sbjct: 247 NLQNKLRMGVEIEKHLKGKVHDLELKLIHMDDMVKSKISGFCQNYSQYRDHILNLLEKEM 306

Query: 310 SHIKSISDVIEEK 322
           S++ S    IEEK
Sbjct: 307 SNMHSTIGEIEEK 319


>gi|297831932|ref|XP_002883848.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329688|gb|EFH60107.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 629

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 220/368 (59%), Positives = 269/368 (73%), Gaps = 7/368 (1%)

Query: 1   MDENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGL 60
           MD N  E EN    L ARI+QLEHER+EL+KDIEQLCMQQ GPS L VATRMHFQRTA L
Sbjct: 1   MDLNLDENEN----LKARIKQLEHERNELQKDIEQLCMQQGGPSILGVATRMHFQRTASL 56

Query: 61  EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCM 120
           EQEIE LK K+AAC RE  NLQEEL+EAYR+K QLADLHA EV KN+EAEKQV+FFQG +
Sbjct: 57  EQEIESLKLKLAACTREKHNLQEELAEAYRVKTQLADLHAGEVAKNLEAEKQVRFFQGSV 116

Query: 121 AAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180
           AAAF+ERD SVMEAEKA+E  E+MSQK  E + RLEELSS+ +  K+ N TL+ DL K E
Sbjct: 117 AAAFSERDKSVMEAEKAEENAEMMSQKLGEIEMRLEELSSDCLVQKRLNDTLQADLAKLE 176

Query: 181 ELNESFKEVINKFYEIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALED 239
           E   ++ +VI KF +IR+ SL E LE S  +KCA LL D  E W+FND STS+Y++ALE 
Sbjct: 177 EQTRTYAQVIEKFDDIRKASLCESLEMSLHNKCASLLDDPIESWTFNDPSTSEYVAALEG 236

Query: 240 ELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRV 299
           EL K +++V+NLQSKLR+GLEIENHLKK VR LEKK I +D+ I N I ++R  HSQLR 
Sbjct: 237 ELGKVKNTVDNLQSKLRVGLEIENHLKKRVRALEKKNIVADRLIVNGITDIRHHHSQLRA 296

Query: 300 HVVNSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDAD 359
           ++++ L E   +IKSIS  +EEK +     IR  N    Q + K DE ECRDVHI    +
Sbjct: 297 YIIDLLTEEGLYIKSISKNVEEKLKLHSSEIR--NVVPLQHDFKPDESECRDVHITTVVE 354

Query: 360 TNLVSQVC 367
           +  V+++ 
Sbjct: 355 SCQVTKLA 362


>gi|238479244|ref|NP_001154517.1| myosin heavy chain-related protein [Arabidopsis thaliana]
 gi|330251226|gb|AEC06320.1| myosin heavy chain-related protein [Arabidopsis thaliana]
          Length = 652

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 220/369 (59%), Positives = 268/369 (72%), Gaps = 9/369 (2%)

Query: 1   MDENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGL 60
           MD N  E EN    L ARI+QLEHER+EL+KDIEQLCMQQ GPS L VA RMHFQRTA L
Sbjct: 1   MDLNLDENEN----LKARIKQLEHERNELQKDIEQLCMQQGGPSILGVAARMHFQRTASL 56

Query: 61  EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCM 120
           EQEIE LK K+AAC RE  NLQEEL+EAYR+K QLADLHA EV KN+EAEKQV+FFQG +
Sbjct: 57  EQEIESLKLKLAACTREKHNLQEELAEAYRVKAQLADLHAGEVAKNLEAEKQVRFFQGSV 116

Query: 121 AAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180
           AAAF+ERD SVMEAEKA+E  E+MSQK +E + RLEELSS+ +  K+ N TL+ DL K E
Sbjct: 117 AAAFSERDKSVMEAEKAEENAEMMSQKLSEIEMRLEELSSDCLVQKRLNDTLQADLAKLE 176

Query: 181 ELNESFKEVINKFYEIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALED 239
           E   ++  VI KFY+IR+ SL E LE S  +KCA LL D  E W+FND STS Y++ALE 
Sbjct: 177 EQTRTYAGVIEKFYDIRKASLCESLEMSLHEKCASLLDDPIESWTFNDPSTSDYVAALEG 236

Query: 240 ELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRV 299
           EL K +++V+NLQSKLR+GLEIENHLKK VR LEKK I +D  I N I ++R  HSQLR 
Sbjct: 237 ELGKVKNTVDNLQSKLRVGLEIENHLKKRVRALEKKNIVADGLIVNGITDIRHHHSQLRA 296

Query: 300 HVVNSLEEGRSHIKSISDVIEEKTQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDA 358
           +++  L E   +IKSIS  +EEK + H  +V   QN    Q + K DE ECRDVH+    
Sbjct: 297 YIIALLNEEGLYIKSISKNVEEKLKLHSSEV---QNVVPPQHDLKPDESECRDVHMTTVV 353

Query: 359 DTNLVSQVC 367
           ++  V+++ 
Sbjct: 354 ESCQVTKLA 362


>gi|449479020|ref|XP_004155483.1| PREDICTED: uncharacterized LOC101211967 [Cucumis sativus]
          Length = 641

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 204/313 (65%), Positives = 253/313 (80%), Gaps = 1/313 (0%)

Query: 11  ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQK 70
           ++E L+ RIQQLE+ERDELRKDIEQLCMQQAGP YL+VATRMHFQRTAGLEQEIE LK+K
Sbjct: 7   DNEHLLTRIQQLEYERDELRKDIEQLCMQQAGPGYLSVATRMHFQRTAGLEQEIENLKKK 66

Query: 71  IAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNS 130
            AAC REN NLQEEL+EAYRIK QLADLH++EV KN+E+EKQ+KFFQGC+A AFAERD+S
Sbjct: 67  FAACTRENHNLQEELAEAYRIKSQLADLHSSEVAKNLESEKQLKFFQGCVATAFAERDHS 126

Query: 131 VMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190
           +MEAEKAKEKEE MSQK  E + RLEELSS+ + L++ N TLR +L KQEE NE  K+VI
Sbjct: 127 IMEAEKAKEKEETMSQKLKEVENRLEELSSDCLNLRETNDTLRINLAKQEEWNEVSKQVI 186

Query: 191 NKFYEIRQQSLE-VLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
           NKFYEIRQ SL+  ++TSW++KC CLL DS+EMWSFND+ST +YI++LE+ LE  + +V+
Sbjct: 187 NKFYEIRQNSLDGFMDTSWDEKCQCLLHDSSEMWSFNDSSTFRYINSLEEGLETMKKTVD 246

Query: 250 NLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309
           NLQ+KLRMG+EIE HLK  V +LE K+IH D  + + I+     +SQ R H++N LE+  
Sbjct: 247 NLQNKLRMGVEIEKHLKGKVHDLELKLIHMDDMVKSKISGFCQNYSQYRDHILNLLEKEM 306

Query: 310 SHIKSISDVIEEK 322
           S++ S     EEK
Sbjct: 307 SNMHSTIGEFEEK 319


>gi|357480651|ref|XP_003610611.1| hypothetical protein MTR_5g005010 [Medicago truncatula]
 gi|355511946|gb|AES93569.1| hypothetical protein MTR_5g005010 [Medicago truncatula]
          Length = 586

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 204/356 (57%), Positives = 268/356 (75%), Gaps = 3/356 (0%)

Query: 1   MDENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGL 60
           M E   E  N  E+L+ RI+QLEHERDELRKDIEQLCMQQAGP+YL+VATRMHFQRTAGL
Sbjct: 1   MAEPEAEDMNHHESLLGRIRQLEHERDELRKDIEQLCMQQAGPAYLSVATRMHFQRTAGL 60

Query: 61  EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCM 120
           EQEI+ L+ K+AA   +  NLQ++LS+AYRIK QLADLHAAE+ KN+EAEKQ+KFFQGC+
Sbjct: 61  EQEIQTLQNKLAASTTDTLNLQDQLSQAYRIKAQLADLHAAELSKNIEAEKQLKFFQGCV 120

Query: 121 AAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180
           A AF+ERD++++EAEKAKE E+ MSQ+ + F  R+EEL+S   + K+ N  L+ D     
Sbjct: 121 ATAFSERDHAIIEAEKAKEMEDAMSQQIHGFLKRIEELTSVCCKQKELNYALQSDQAMYI 180

Query: 181 ELNESFKEVINKFYEIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALED 239
           E NE FK+VINKF++IRQ S  E  +TSW+ K  CLL DS E+WSFND STSKYISALE+
Sbjct: 181 EQNEKFKKVINKFFQIRQYSQKECDDTSWDVKSTCLLDDSEELWSFNDASTSKYISALEE 240

Query: 240 ELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRV 299
           +L+K  +SV+ LQSKLR+GLEIENHLKK +  +E+  I+ +K I N IA+L+  H + R 
Sbjct: 241 QLDKVNNSVDCLQSKLRVGLEIENHLKKRINLMEQNQIYMNKVIENGIADLKHHHFKYRD 300

Query: 300 HVVNSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHIN 355
           H++N L +G S IKS  +VI+E+ +  ++ I  +    +QR+ + +E ECRD HI+
Sbjct: 301 HIMNLLRDGESTIKSTINVIDERIRRFNENI--EPNLVHQRDKEREENECRDAHIS 354


>gi|218193111|gb|EEC75538.1| hypothetical protein OsI_12160 [Oryza sativa Indica Group]
          Length = 677

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 164/343 (47%), Positives = 227/343 (66%), Gaps = 7/343 (2%)

Query: 25  ERDELRKDIEQLCMQQ-AGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQE 83
           ERDELRKDIEQLCMQQ AGP Y++VATRM  QRTA LEQ+IE L++K+  C REN NLQE
Sbjct: 56  ERDELRKDIEQLCMQQQAGPGYVSVATRMLSQRTAALEQDIENLQKKLGGCLRENQNLQE 115

Query: 84  ELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEEL 143
           EL+EAYRIK QLA+LH AE+ KN E EKQV+FFQ  +A AFAERDNS+M+ EKAKE+EE 
Sbjct: 116 ELAEAYRIKSQLAELHGAELSKNKELEKQVRFFQSSVAQAFAERDNSLMKCEKAKEREEA 175

Query: 144 MSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEV 203
           + QKF +F+ R  E  S   + K+ N  L+ +L + +E  +S  +VI KFY++R +  E 
Sbjct: 176 ILQKFAKFEERTREYQSSIDDQKRSNDALQMELMRLKEQTQSSLKVIQKFYDVRCRDSEC 235

Query: 204 -LETSWEDKCACLLLDSAEMWSF---NDTSTSKYISALEDELEKTRSSVENLQSKLRMGL 259
            +  + E+KC+ LL DSA+ WSF     TSTSKYI++LE+E +  R+ +  LQ+ LRMGL
Sbjct: 236 SVNITLEEKCSVLLDDSADNWSFSSDGGTSTSKYIASLEEEKDSLRAKIAKLQNNLRMGL 295

Query: 260 EIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVI 319
           EIE HL+++ R LEK+    D+F+ N ++ L+  H Q R  ++  LEE  S + ++ + I
Sbjct: 296 EIEQHLQRNARVLEKRQALYDEFMRNGLSTLQKLHIQQRDDIMKILEEESSQLSTVVNEI 355

Query: 320 EEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
           ++K       I  +       E +  +  C+DVH+  D    +
Sbjct: 356 QDKLSKIR--INPETNENPVGEMQCCDSSCKDVHVTTDVSPGI 396


>gi|40538897|gb|AAR87154.1| expressed protein [Oryza sativa Japonica Group]
 gi|108708972|gb|ABF96767.1| expressed protein [Oryza sativa Japonica Group]
 gi|222625180|gb|EEE59312.1| hypothetical protein OsJ_11369 [Oryza sativa Japonica Group]
          Length = 677

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 165/345 (47%), Positives = 228/345 (66%), Gaps = 11/345 (3%)

Query: 25  ERDELRKDIEQLCMQQ-AGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQE 83
           ERDELRKDIEQLCMQQ AGP Y++VATRM  QRTA LEQ+IE L++K+  C REN NLQE
Sbjct: 56  ERDELRKDIEQLCMQQQAGPGYVSVATRMLSQRTAALEQDIENLQKKLGGCLRENQNLQE 115

Query: 84  ELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEEL 143
           EL+EAYRIK QLA+LH AE+ KN E EKQV+FFQ  +A AFAERDNS+M+ EKAKE+EE 
Sbjct: 116 ELAEAYRIKSQLAELHGAELSKNKELEKQVRFFQSSVAQAFAERDNSLMKCEKAKEREEA 175

Query: 144 MSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEV 203
           + QKF +F+ R  E  S   + K+ N  L+ +L + +E  +S  +VI KFY++R +  E 
Sbjct: 176 ILQKFAKFEERTREYQSSIDDQKRSNDALQMELMRLKEQTQSSLKVIQKFYDVRCRDSEC 235

Query: 204 -LETSWEDKCACLLLDSAEMWSF---NDTSTSKYISALEDELEKTRSSVENLQSKLRMGL 259
            +  + E+KC+ LL DSA+ WSF     TSTSKYI++LE+E +  R+ +  LQ+ LRMGL
Sbjct: 236 SVNITLEEKCSVLLDDSADNWSFSSDGGTSTSKYIASLEEEKDSLRAKIAKLQNNLRMGL 295

Query: 260 EIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVI 319
           EIE HL+++ R LEK+    D+F+ N ++ L+  H Q R  ++  LEE  S + ++ + I
Sbjct: 296 EIEQHLQRNARVLEKRQALYDEFMRNGLSTLQKLHIQQRDDIMKILEEESSQLSTVVNEI 355

Query: 320 EEKTQ--HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
           ++K      +  I     G    E +  +  C+DVH+  D    +
Sbjct: 356 QDKLSKIRINPEINENPVG----EMQCCDSSCKDVHVTTDVSPGI 396


>gi|357155720|ref|XP_003577215.1| PREDICTED: uncharacterized protein LOC100842072 [Brachypodium
           distachyon]
          Length = 641

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 170/376 (45%), Positives = 237/376 (63%), Gaps = 10/376 (2%)

Query: 7   EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
           EKE   E L  R+ +L+ ERDELRKDIEQLCMQQAGP Y++VATRM  QR A LEQ+IE 
Sbjct: 3   EKERTIEVLTLRVGELQRERDELRKDIEQLCMQQAGPGYVSVATRMLSQRNAALEQDIED 62

Query: 67  LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
           L++K+  C REN NLQEEL+E YR+K +LADL+ AE+ KN E E+QV+FFQ  +A AFAE
Sbjct: 63  LQKKLGGCLRENQNLQEELAEVYRVKSKLADLYGAELSKNKELEQQVRFFQSSVAQAFAE 122

Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
           RD S++E EK KE+EE + +KF EF+ R  E  S   + K+ N  L+ +L + +   ES 
Sbjct: 123 RDTSLLECEKVKEREEAVLKKFAEFEERTREYQSAMEDQKQLNDALQMELMELKAHTESS 182

Query: 187 KEVINKFYEIRQQSLEVLET-SWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
             VI KFYE+R +  E     S+E KC+ LL DSA+ WSFN   +TSTSKYI++LE+E  
Sbjct: 183 LNVILKFYEVRCRECECPSNISFEQKCSILLDDSADNWSFNSDGETSTSKYIASLEEENG 242

Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
             ++ +  LQS LRMG E+E HL+++ R L+KK    D F+ N ++ L+  ++  R  ++
Sbjct: 243 SLKAKITKLQSNLRMGFEVEQHLQRNARALQKKQALIDDFMRNGLSALQKFYTHQRAEIM 302

Query: 303 NSLEEGRSHIKSISDVIEEK-TQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADT 360
             LEE  S + +    I++K TQ H +  +     G    E +  +  C+DVH+N D   
Sbjct: 303 RILEEESSQLSTAVIEIQDKLTQIHINTEVIENPIG----EMQCCDSSCKDVHVNMDTGP 358

Query: 361 NLVSQVCLLVASAVNF 376
               +  + VA +  F
Sbjct: 359 GTSPKDDVPVAYSATF 374


>gi|147771163|emb|CAN60964.1| hypothetical protein VITISV_013873 [Vitis vinifera]
          Length = 655

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 160/261 (61%), Positives = 198/261 (75%), Gaps = 5/261 (1%)

Query: 109 AEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQ 168
           AEKQ+KFFQGC+AAAFAERD+S+MEAEKAKEKEEL  Q+FN FQ R++EL S+N++ K+ 
Sbjct: 27  AEKQIKFFQGCVAAAFAERDHSIMEAEKAKEKEELALQEFNNFQKRIKELESDNLKQKQL 86

Query: 169 NATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFN- 226
           N  L+ DL  QE+ NE+FK+VINKF+EIRQ SLE  E  +W+DKC CLL D  EMWSFN 
Sbjct: 87  NVALQIDLANQEDQNETFKKVINKFFEIRQYSLEAFEDANWDDKCGCLLSDPVEMWSFNT 146

Query: 227 --DTSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFIS 284
             +TSTSKYI ALE+ELE  R+SV+NLQ+KLR+GLEIENHLKK V ELEK+ I S +   
Sbjct: 147 NEETSTSKYIDALEEELEMVRNSVDNLQNKLRVGLEIENHLKKKVSELEKQKIISHQMFK 206

Query: 285 NAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKL 344
           N I+ L   HSQ R+HVVN L+EG+SH+KSI DV+EEK +   D  R QN    QR+ KL
Sbjct: 207 NRISGLLHYHSQHRLHVVNLLDEGKSHLKSIIDVVEEKIRQL-DADREQNLEPPQRDLKL 265

Query: 345 DEFECRDVHINNDADTNLVSQ 365
            E ECRDVH++   D N V++
Sbjct: 266 YENECRDVHVSIVGDHNSVAE 286


>gi|242035353|ref|XP_002465071.1| hypothetical protein SORBIDRAFT_01g031580 [Sorghum bicolor]
 gi|241918925|gb|EER92069.1| hypothetical protein SORBIDRAFT_01g031580 [Sorghum bicolor]
          Length = 661

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 162/356 (45%), Positives = 234/356 (65%), Gaps = 8/356 (2%)

Query: 7   EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
           EK +  E L  +++ L+ ERD LRKDIEQLCMQQAGP Y++VATRM  QRT  LEQ+IE 
Sbjct: 23  EKVDLVEVLTRQVEDLQRERDALRKDIEQLCMQQAGPGYVSVATRMLTQRTTALEQDIEN 82

Query: 67  LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
           L++K+  C REN NLQEEL+EAYRIK QLA+LH A + KN + EKQV+FFQ  +A AFAE
Sbjct: 83  LQKKLGGCLRENQNLQEELAEAYRIKSQLAELHGAALSKNKDLEKQVRFFQSSVAQAFAE 142

Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
           RD+S+ME EKAKE+EE +S+KF +F+ R+ E  S   E K  N  L+ +L + +   E  
Sbjct: 143 RDSSLMECEKAKEREEAVSRKFADFEERMREYQSAIDEQKYINDDLQRELTELKAHTELS 202

Query: 187 KEVINKFYEIRQQSLEVLET-SWEDKCACLLLDSAEMWSF---NDTSTSKYISALEDELE 242
            +VI KFY++R +  E   + ++E+KC+ LL DSA+ WSF     T+T KYI++LE+E E
Sbjct: 203 LKVILKFYDLRCRDSECSSSVTFEEKCSILLDDSADNWSFCSDGGTATLKYIASLEEEKE 262

Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
             ++ +  LQ+ LRMGLEIE HL+++ R LEK+    D F+ N ++EL+  ++  +  ++
Sbjct: 263 SLKAKISKLQNNLRMGLEIEQHLQRNARILEKRQALYDGFLRNGLSELQKFYTYQKAEIM 322

Query: 303 NSLEEGRSHIKSISDVIEEK-TQHCDDVIRGQNTGTYQRETKLDEFECRDVHINND 357
             LEE    +  +   I++K T+ C +    ++      ET+  +  C+DVH+  D
Sbjct: 323 KILEEESLQLSKVVAEIQDKLTEMCINTEVNEHPAD---ETQCCDSSCKDVHVTTD 375


>gi|326496427|dbj|BAJ94675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 627

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 160/376 (42%), Positives = 235/376 (62%), Gaps = 15/376 (3%)

Query: 7   EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
           EK  + E L  R+++L+ ERDELRKDIEQLCMQQAGP Y++VAT+M  QRT  LEQ+IEI
Sbjct: 3   EKPYDGEVLTRRVEELQRERDELRKDIEQLCMQQAGPGYVSVATQMLSQRTTALEQDIEI 62

Query: 67  LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
           L++K++ C REN NLQEEL+EAYR+K QLADL+ AE+ K  E E+QV+FFQ  +A AFAE
Sbjct: 63  LQKKLSGCLRENQNLQEELAEAYRVKSQLADLYGAELSKTKELEQQVRFFQSSVAQAFAE 122

Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
           RD S++E     E+EE   +    F+ R+ E  +   + K+ N  L+ +L + +   ES 
Sbjct: 123 RDGSLLE-----EREEAGLKVCATFEERMREYQTAMEDQKRLNDALQMELTELKAHTESS 177

Query: 187 KEVINKFYEIRQQSLEVLET-SWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
             VI KFYE+R +  E     ++E+KC+ LL DSA+ WSFN   +TSTSKYI++LE E E
Sbjct: 178 LNVIKKFYEVRYRECECPSNITFEEKCSILLDDSADNWSFNLDGETSTSKYIASLEQENE 237

Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
             ++ +  LQ+ LRM  EIE HL+++ R LEKK   +D  + N ++ L+  ++  R  ++
Sbjct: 238 SLKAKISKLQNNLRMSFEIEQHLQRNARTLEKKQALNDDLMRNGLSALQKVYTHQRAEIM 297

Query: 303 NSLEEGRSHIKSISDVIEEK-TQHCDDV-IRGQNTGTYQRETKLDEFECRDVHINNDADT 360
             LEE    + +  + I++K TQ C +  I G   G    E +  +  C+DVH+  D   
Sbjct: 298 KILEEESLLLSTAVNEIQDKLTQICFNAEIIGNPVG----EMQCCDSSCKDVHVTMDIGP 353

Query: 361 NLVSQVCLLVASAVNF 376
           + + +  +  + +  F
Sbjct: 354 DTIPKGDVPTSYSTTF 369


>gi|224060835|ref|XP_002300276.1| predicted protein [Populus trichocarpa]
 gi|222847534|gb|EEE85081.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score =  267 bits (683), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 174/357 (48%), Positives = 220/357 (61%), Gaps = 38/357 (10%)

Query: 10  NESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQ 69
           N+ E+++ARIQQLEHER EL KDIEQLCMQQAGP YLA ATRMHFQRTAGLEQEIE LK+
Sbjct: 6   NDKESMIARIQQLEHERGELHKDIEQLCMQQAGPGYLAAATRMHFQRTAGLEQEIENLKK 65

Query: 70  KIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDN 129
           ++AAC+R+N NLQEELSEAYRIK QLA LH AE  KNMEAEKQVKFFQGC+AAAFAERDN
Sbjct: 66  QLAACSRDNLNLQEELSEAYRIKTQLAKLHQAEAAKNMEAEKQVKFFQGCVAAAFAERDN 125

Query: 130 SVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEV 189
           S+MEAEKAKEKEE MSQKF E Q R+   S  +     +  +  F              +
Sbjct: 126 SIMEAEKAKEKEESMSQKFKEIQQRVIYQSKMSRLRLSRRLSTSFTRS---------GSI 176

Query: 190 INKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
           + +   I   ++ V       K   +++    + +          SALE+E+E  R+S++
Sbjct: 177 LLRDLRILVGTINVHGCYMTQKSCGVIMIPRLLNTLQ--------SALEEEVETLRNSLD 228

Query: 250 NLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309
            LQSK+R                   ++  DK +   I EL   HS  RV + + L + R
Sbjct: 229 KLQSKIR-------------------VVLWDKMVMEGITELHHYHSHHRVQITSLLSKER 269

Query: 310 SHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNLVSQV 366
           SHIKSI D++EEK +   DVI  Q+     R  +L E E RDVH++ DAD +L  +V
Sbjct: 270 SHIKSIIDMVEEKIKQF-DVISEQDLVPC-RVVRLQENEFRDVHMSTDADPDLAPKV 324


>gi|224105731|ref|XP_002313915.1| predicted protein [Populus trichocarpa]
 gi|222850323|gb|EEE87870.1| predicted protein [Populus trichocarpa]
          Length = 246

 Score =  253 bits (647), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 158/264 (59%), Positives = 185/264 (70%), Gaps = 32/264 (12%)

Query: 10  NESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQ 69
           N+ E+++ARIQQLEHE DEL KDIEQLCMQQAGP YLA ATRMHFQRTAGL QEIE LK+
Sbjct: 6   NDKESVVARIQQLEHECDELHKDIEQLCMQQAGPGYLAAATRMHFQRTAGLVQEIEKLKK 65

Query: 70  KIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDN 129
           ++ AC R+N NLQEELSEAYRIK QLA+LH AE  KNMEAEKQVKFFQGC+AAAFAERD 
Sbjct: 66  QLVACTRDNLNLQEELSEAYRIKTQLAELHQAEAAKNMEAEKQVKFFQGCVAAAFAERDT 125

Query: 130 SVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEV 189
           S+MEAEKAKEKEE MSQKFNE Q RL      +    +  + L  DL  Q  +       
Sbjct: 126 SIMEAEKAKEKEESMSQKFNEIQQRL------STNFMRSGSILSRDLRIQVGM------- 172

Query: 190 INKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
           IN         + V  T+ + KC  ++     M    +T      SALE+E+E  R+S+E
Sbjct: 173 IN---------VRVCYTT-QKKCGVIM-----MLPLLNTLQ----SALEEEVEALRNSLE 213

Query: 250 NLQSKLRMGLEIENHLKKSVRELE 273
            LQSKL++GLEIENHLKK VRELE
Sbjct: 214 KLQSKLQVGLEIENHLKKKVRELE 237


>gi|39104599|dbj|BAC43422.2| unknown protein [Arabidopsis thaliana]
          Length = 491

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 113/216 (52%), Positives = 146/216 (67%), Gaps = 5/216 (2%)

Query: 143 LMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSL- 201
           +MSQK +E + RLEELSS+ +  K+ N TL+ DL K EE   ++  VI KFY+IR+ SL 
Sbjct: 1   MMSQKLSEIEMRLEELSSDCLVQKRLNDTLQADLAKLEEQTRTYAGVIEKFYDIRKASLC 60

Query: 202 EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMGLEI 261
           E LE S  +KCA LL D  E W+FND STS Y++ALE EL K +++V+NLQSKLR+GLEI
Sbjct: 61  ESLEMSLHEKCASLLDDPIESWTFNDPSTSDYVAALEGELGKVKNTVDNLQSKLRVGLEI 120

Query: 262 ENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVIEE 321
           ENHLKK VR LEK+ I +D  I N I ++R  HSQLR +++  L E   +IKSIS  +EE
Sbjct: 121 ENHLKKRVRALEKENIVADGLIVNGITDIRHHHSQLRAYIIALLNEEGLYIKSISKNVEE 180

Query: 322 KTQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINN 356
           K + H  +V   QN    Q + K DE ECRDVH+  
Sbjct: 181 KLKLHSSEV---QNVVPPQHDLKPDESECRDVHMTT 213


>gi|147771162|emb|CAN60963.1| hypothetical protein VITISV_013872 [Vitis vinifera]
          Length = 108

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 73/86 (84%), Positives = 80/86 (93%)

Query: 7  EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
          EK N++E+L+ARIQQLEHERDELRKDIEQLC+QQAGPSYL VATRMHFQRTAGLEQEIE 
Sbjct: 3  EKSNDNESLIARIQQLEHERDELRKDIEQLCIQQAGPSYLVVATRMHFQRTAGLEQEIEN 62

Query: 67 LKQKIAACARENSNLQEELSEAYRIK 92
          LK+K+AA  REN NLQEELSEAYRIK
Sbjct: 63 LKKKVAASTRENLNLQEELSEAYRIK 88


>gi|242073350|ref|XP_002446611.1| hypothetical protein SORBIDRAFT_06g018840 [Sorghum bicolor]
 gi|241937794|gb|EES10939.1| hypothetical protein SORBIDRAFT_06g018840 [Sorghum bicolor]
          Length = 711

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 64/99 (64%), Positives = 77/99 (77%)

Query: 7   EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
           EK +  E L  R++ L+ ERD LRKDIEQLCMQQAGP Y++VATRM  QRTA LEQ+IE 
Sbjct: 3   EKVDLVEVLTRRVEDLQRERDALRKDIEQLCMQQAGPGYVSVATRMLTQRTAALEQDIEN 62

Query: 67  LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIK 105
           L++K+  C REN NLQEEL+EAYRIK QLA+LH A + K
Sbjct: 63  LQKKLGGCLRENQNLQEELAEAYRIKSQLAELHGAALSK 101



 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 134/230 (58%), Gaps = 8/230 (3%)

Query: 133 EAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINK 192
           + EKAKE+EE +S+KF +F+ R+ E      E K  N  L+ +L + +   E   +VI K
Sbjct: 199 QCEKAKEREEAVSRKFADFEERMREYQLAIDEQKHLNDDLQRELTELKAHTELSLKVILK 258

Query: 193 FYEIRQQSLEVLET-SWEDKCACLLLDSAEMWSF---NDTSTSKYISALEDELEKTRSSV 248
           FY++R +  E   + ++E+KC+ LL D A+ WSF     TST KYI++LE E E  ++ +
Sbjct: 259 FYDLRCRDSECSSSVTFEEKCSILLDDCADNWSFCSDGGTSTLKYIASLEAEKESLKAKI 318

Query: 249 ENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEG 308
             LQS LRMGLEIE HL+++ R LEK+    D F+ N ++EL+  ++  +  ++  LEE 
Sbjct: 319 SKLQSNLRMGLEIEQHLQRNARILEKRQALYDGFLRNGLSELQKFYTYQKAEIMKILEEE 378

Query: 309 RSH-IKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINND 357
            S  IK ++ + ++ T+ C +    ++      E +  +  C+DVH+  D
Sbjct: 379 SSQLIKVVAGIQDKLTEICINTEVNEHPAN---EIQCCDSSCKDVHVTTD 425


>gi|167998604|ref|XP_001752008.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697106|gb|EDQ83443.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 933

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 98/274 (35%), Positives = 141/274 (51%), Gaps = 39/274 (14%)

Query: 14  ALMARIQQLEHERDELRKDIEQLCMQQAG-PSYLAVATRMHFQRTAGLEQEIEILKQKIA 72
           ALM RI QLE ERDEL+KDIE +C+QQ+G    + V TRM  +R A LEQE+E  K+K +
Sbjct: 9   ALMRRIVQLEKERDELQKDIETICLQQSGVAGSIDVNTRMQARRAANLEQELETYKEKTS 68

Query: 73  ACARENSNLQEELSEAYRIKGQLADLHAA------------------------------- 101
              REN  LQ+ELSEAYR+K  L    AA                               
Sbjct: 69  LLTRENHGLQQELSEAYRLKYYLIIGTAAYDLQREFDCDFPQYLMLQTQITEAFKRAVEK 128

Query: 102 EVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSE 161
              +N + EK VKF+Q  +A AFAERD +++E E+ KE E+ M  +  +  +R EE   +
Sbjct: 129 AFFQNNQVEKDVKFYQSKVATAFAERDKALVEVERVKEVEKEMIAEVQQLNSRAEEAEKQ 188

Query: 162 NIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWE------DKCACL 215
             E ++Q   L+ D E + E  +   +V++KF+E+R      L+          DK   L
Sbjct: 189 LKEEEEQKLKLQHDYEVRTEQLDILHKVVDKFWELRGNVPVALDDGQPQDLQPLDKAQAL 248

Query: 216 LLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
           +L+S   W++   S    + AL+ EL+  R+  E
Sbjct: 249 ILESDVQWTYGG-SIQAAVEALQQELKVARTLAE 281


>gi|326496659|dbj|BAJ98356.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 314

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 75/144 (52%), Gaps = 6/144 (4%)

Query: 235 SALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCH 294
           ++LE E E  ++ +  LQ+ LRM  EIE HL+++ R LEKK   +D  + N ++ L+  +
Sbjct: 15  ASLEQENESLKAKISKLQNNLRMSFEIEQHLQRNARTLEKKQALNDDLMRNGLSALQKVY 74

Query: 295 SQLRVHVVNSLEEGRSHIKSISDVIEEK-TQHCDDV-IRGQNTGTYQRETKLDEFECRDV 352
           +  R  ++  LEE    + +  + I++K TQ C +  I G   G    E +  +  C+DV
Sbjct: 75  THQRAEIMKILEEESLLLSTAVNEIQDKLTQICFNAEIIGNPVG----EMQCCDSSCKDV 130

Query: 353 HINNDADTNLVSQVCLLVASAVNF 376
           H+  D   + + +  +  + +  F
Sbjct: 131 HVTMDIGPDTIPKGDVPTSYSTTF 154


>gi|414874086|tpg|DAA52643.1| TPA: hypothetical protein ZEAMMB73_102228 [Zea mays]
          Length = 418

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 77/142 (54%), Gaps = 6/142 (4%)

Query: 217 LDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKI 276
           +D +  W     +  +  ++LE+E E  ++ +  LQS LRMGLEIE HL+++ R LEK+ 
Sbjct: 79  VDVSSGWEGASGTVGQ--ASLEEEKESLKAKISKLQSNLRMGLEIEQHLQRNARILEKRQ 136

Query: 277 IHSDKFISNAIAELRLCHSQLRVHVVNSL-EEGRSHIKSISDVIEEKTQHCDDVIRGQNT 335
              D F+ N +++L+  ++  +  ++  L EE    +K ++++ ++ T+ C   I  + +
Sbjct: 137 ALYDGFLRNGLSKLQKFYTCQKAEIMKILEEESLRLMKVVAEIQDKLTEFC---INTEVS 193

Query: 336 GTYQRETKLDEFECRDVHINND 357
                + +  +  C+ VH+  D
Sbjct: 194 DHPADKIQCCDSSCKHVHVTKD 215


>gi|307106173|gb|EFN54420.1| hypothetical protein CHLNCDRAFT_135784 [Chlorella variabilis]
          Length = 643

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 43/177 (24%), Positives = 81/177 (45%), Gaps = 24/177 (13%)

Query: 24  HERDELRKDIEQLCMQQA------GPSYLA--VATRMHFQRTAGL--------------- 60
            ERD+L +D+E +C+  +      G S L   + +   F +T  +               
Sbjct: 25  RERDDLMRDLENMCLADSRGATFNGSSVLQERIYSTGTFDQTGAVCCAPGPALWPGGGRA 84

Query: 61  -EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGC 119
            ++E+   +  ++    E  NL+E+L E    K +    H A++ +    +K++ F+Q  
Sbjct: 85  QQKELSATRAALSGVTLERDNLREDLGEVKEAKRRAEAGHRAQLERAAALDKELAFYQAQ 144

Query: 120 MAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDL 176
            A   A+RD +  E E+ K +   + Q+  E  +R E  SS+  +L++Q A +R  L
Sbjct: 145 SARVMADRDRAAWEGEELKAQNLRLDQQLREASSRAETESSQRAQLERQLAEVRQQL 201


>gi|384253382|gb|EIE26857.1| hypothetical protein COCSUDRAFT_39833 [Coccomyxa subellipsoidea
           C-169]
          Length = 465

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 73/149 (48%), Gaps = 16/149 (10%)

Query: 17  ARIQQLEH-------ERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQ 69
           A+I+ LEH       ER++L  D+E LCMQ +G  +   A+ +  +R   +++E   L+ 
Sbjct: 19  AKIKHLEHQLSVLTAERNDLAADVESLCMQSSGDIF--SASSVLSERICNMQKETNKLRA 76

Query: 70  KIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDN 129
           ++ A   E   L E+L      K  +      E  K+   E ++ F+Q   A   ++RD 
Sbjct: 77  QLEAVNAERDGLYEDLINHRTSKRTVDKQWRIEREKSERLESELAFYQSHSARTLSDRDK 136

Query: 130 SVMEAEKAKEKEELMSQKFNEFQTRLEEL 158
           +V EA      EEL +Q   E ++RL EL
Sbjct: 137 AVWEA------EELKAQLL-ETESRLSEL 158


>gi|346978055|gb|EGY21507.1| hypothetical protein VDAG_10489 [Verticillium dahliae VdLs.17]
          Length = 1476

 Score = 37.7 bits (86), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 88/174 (50%), Gaps = 11/174 (6%)

Query: 57   TAGLEQEIEILKQKIAACARENSNLQEELSEAYR--IKGQLADLHAAEVIKNMEAEKQVK 114
            TAG+E+EI+ L+ KI     E   +Q+   +A R  +  Q  D+  AEV +  +AEKQ  
Sbjct: 1001 TAGVEEEIQALQDKIMQVGGEKLRIQKANVDALRADVTSQSEDISNAEV-RKAKAEKQRV 1059

Query: 115  FFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQN-ATLR 173
              +   A A  E + +  + EK   + E   +K     +R+EE + E + +KK+  A L+
Sbjct: 1060 KLEKDHAKATKEIEAAARDLEKLDNEIENQGEKAESLTSRVEE-AQEALAVKKEELAALK 1118

Query: 174  FDLE-KQEELNESFK---EVINKFYEIRQQ--SLEVLETSWEDKCACLLLDSAE 221
             +L  K  ELNE+     E+ NK  E ++Q    E     W+DK + L+L + E
Sbjct: 1119 TELNGKTAELNETRAVEIEMRNKLEENQKQLNENEKRLRYWDDKLSKLVLQNVE 1172


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.313    0.127    0.339 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,028,631,077
Number of Sequences: 23463169
Number of extensions: 196926317
Number of successful extensions: 1162989
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 184
Number of HSP's successfully gapped in prelim test: 34421
Number of HSP's that attempted gapping in prelim test: 1026386
Number of HSP's gapped (non-prelim): 124682
length of query: 376
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 232
effective length of database: 8,980,499,031
effective search space: 2083475775192
effective search space used: 2083475775192
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 78 (34.7 bits)