BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017151
(376 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255583327|ref|XP_002532426.1| conserved hypothetical protein [Ricinus communis]
gi|223527875|gb|EEF29967.1| conserved hypothetical protein [Ricinus communis]
Length = 658
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 249/355 (70%), Positives = 286/355 (80%), Gaps = 2/355 (0%)
Query: 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKI 71
+E+L+ARIQQLEHERDEL KDIEQLCMQQAGPSYLAVAT+MHFQRTAGLEQEIE LK+K+
Sbjct: 8 NESLIARIQQLEHERDELHKDIEQLCMQQAGPSYLAVATKMHFQRTAGLEQEIENLKKKL 67
Query: 72 AACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSV 131
AAC R+N NLQEELSEAYRIK QLADLH AEV KN EAEKQ+KFFQGC+A+AFAERDNS+
Sbjct: 68 AACTRDNLNLQEELSEAYRIKSQLADLHNAEVAKNKEAEKQLKFFQGCVASAFAERDNSI 127
Query: 132 MEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVIN 191
MEAEKAKEKEELMSQKFNEFQ RLEEL+S+ +E K+QN L+ +L KQEE NES K+V N
Sbjct: 128 MEAEKAKEKEELMSQKFNEFQKRLEELTSDCLEQKRQNEALQINLAKQEEQNESLKKVAN 187
Query: 192 KFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVEN 250
KFYEIRQ SLE E SWEDKC LL DS EMWS+ND STS YISALE+ELE+ R S +N
Sbjct: 188 KFYEIRQHSLEGFEDASWEDKCTWLLHDSKEMWSYNDASTSNYISALEEELEQVRKSADN 247
Query: 251 LQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRS 310
LQSKLR+GLEIENHLKK VRELEKK I DK + N IA LR HS+ R H++N L EG+
Sbjct: 248 LQSKLRVGLEIENHLKKQVRELEKKQIQLDKMVMNGIAGLRHYHSEHRGHIMNLLNEGKL 307
Query: 311 HIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNLVSQ 365
H+KS D++EEK + QN QR L+E ECRDVHI+ND + L+S+
Sbjct: 308 HMKSTMDMLEEKIGETYGS-KEQNLRPSQRVIDLEENECRDVHISNDIGSALISE 361
>gi|296083035|emb|CBI22439.3| unnamed protein product [Vitis vinifera]
Length = 673
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 244/363 (67%), Positives = 291/363 (80%), Gaps = 5/363 (1%)
Query: 7 EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
EK N++E+L+ARIQQLEHERDELRKDIEQLC+QQAGPSYL VATRMHFQRTAGLEQEIE
Sbjct: 3 EKSNDNESLIARIQQLEHERDELRKDIEQLCIQQAGPSYLVVATRMHFQRTAGLEQEIEN 62
Query: 67 LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
LK+K+AA REN NLQEELSEAYRIK QLADLH AEV KN++AEKQ+KFFQGC+AAAFAE
Sbjct: 63 LKKKVAASTRENLNLQEELSEAYRIKSQLADLHNAEVSKNLDAEKQIKFFQGCVAAAFAE 122
Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
RD+S+MEAEKAKEKEEL Q+FN FQ R++EL S+N++ K+ N L+ DL QE+ NE+F
Sbjct: 123 RDHSIMEAEKAKEKEELALQEFNNFQKRIKELESDNLKQKQLNVALQIDLANQEDQNETF 182
Query: 187 KEVINKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
K+VINKF+EIRQ SLE E +W+DKC CLL D EMWSFN +TSTSKYI ALE+ELE
Sbjct: 183 KKVINKFFEIRQYSLEAFEDANWDDKCGCLLSDPVEMWSFNTNEETSTSKYIDALEEELE 242
Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
R+SV+NLQ+KLR+GLEIENHLKK V ELEK+ I S + N I+ L HSQ R+HVV
Sbjct: 243 MVRNSVDNLQNKLRVGLEIENHLKKKVSELEKQKIISHQMFKNRISGLLHYHSQHRLHVV 302
Query: 303 NSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
N L+EG+SH+KSI DV+EEK + D R QN QR+ KL E ECRDVH++ D N
Sbjct: 303 NLLDEGKSHLKSIIDVVEEKIRQL-DADREQNLEPPQRDLKLYENECRDVHVSIVGDHNS 361
Query: 363 VSQ 365
V++
Sbjct: 362 VAE 364
>gi|359475478|ref|XP_002264967.2| PREDICTED: uncharacterized protein LOC100251573 [Vitis vinifera]
Length = 721
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 244/363 (67%), Positives = 291/363 (80%), Gaps = 5/363 (1%)
Query: 7 EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
EK N++E+L+ARIQQLEHERDELRKDIEQLC+QQAGPSYL VATRMHFQRTAGLEQEIE
Sbjct: 3 EKSNDNESLIARIQQLEHERDELRKDIEQLCIQQAGPSYLVVATRMHFQRTAGLEQEIEN 62
Query: 67 LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
LK+K+AA REN NLQEELSEAYRIK QLADLH AEV KN++AEKQ+KFFQGC+AAAFAE
Sbjct: 63 LKKKVAASTRENLNLQEELSEAYRIKSQLADLHNAEVSKNLDAEKQIKFFQGCVAAAFAE 122
Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
RD+S+MEAEKAKEKEEL Q+FN FQ R++EL S+N++ K+ N L+ DL QE+ NE+F
Sbjct: 123 RDHSIMEAEKAKEKEELALQEFNNFQKRIKELESDNLKQKQLNVALQIDLANQEDQNETF 182
Query: 187 KEVINKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
K+VINKF+EIRQ SLE E +W+DKC CLL D EMWSFN +TSTSKYI ALE+ELE
Sbjct: 183 KKVINKFFEIRQYSLEAFEDANWDDKCGCLLSDPVEMWSFNTNEETSTSKYIDALEEELE 242
Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
R+SV+NLQ+KLR+GLEIENHLKK V ELEK+ I S + N I+ L HSQ R+HVV
Sbjct: 243 MVRNSVDNLQNKLRVGLEIENHLKKKVSELEKQKIISHQMFKNRISGLLHYHSQHRLHVV 302
Query: 303 NSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
N L+EG+SH+KSI DV+EEK + D R QN QR+ KL E ECRDVH++ D N
Sbjct: 303 NLLDEGKSHLKSIIDVVEEKIRQL-DADREQNLEPPQRDLKLYENECRDVHVSIVGDHNS 361
Query: 363 VSQ 365
V++
Sbjct: 362 VAE 364
>gi|356495819|ref|XP_003516769.1| PREDICTED: uncharacterized protein LOC100775191 [Glycine max]
Length = 653
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 221/349 (63%), Positives = 271/349 (77%), Gaps = 3/349 (0%)
Query: 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQK 70
+ E+L+ARIQQLE ERDELRKDIEQLCMQQAGP YLAVATRMHFQRTAGLEQEIE LK+K
Sbjct: 6 DDESLLARIQQLEQERDELRKDIEQLCMQQAGPGYLAVATRMHFQRTAGLEQEIESLKKK 65
Query: 71 IAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNS 130
+AAC REN NLQEELSEAYRIKGQLADLH+AEV KNMEAEKQVKFFQGC+AAAF+ERD +
Sbjct: 66 LAACTRENLNLQEELSEAYRIKGQLADLHSAEVSKNMEAEKQVKFFQGCVAAAFSERDQA 125
Query: 131 VMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190
++EAEKAKEKEE M Q+ N R+EEL+S+ ++LK+ N L+ D + NE+ +VI
Sbjct: 126 IIEAEKAKEKEETMLQQINGIHKRVEELTSDCLKLKEFNDALQIDQAVHMKQNENCMKVI 185
Query: 191 NKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
NKF++IRQ SL+ E SW +KCACLL DS E+WSFND STSKYISALE++LE+ R+S++
Sbjct: 186 NKFFQIRQHSLKEFEDMSWNEKCACLLGDSEEVWSFNDASTSKYISALEEQLERLRNSMD 245
Query: 250 NLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309
LQ+KLR+GLEIENHLKK V LE K I DK I N+IA+L+ HS+ R ++N L +G
Sbjct: 246 YLQNKLRVGLEIENHLKKRVNALENKQISMDKVIENSIADLKHYHSKCRDEIMNLLGDGE 305
Query: 310 SHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDA 358
S IKSI + I+EK D + T QR+ + +E EC D+HI+ A
Sbjct: 306 SSIKSIINAIDEKVWSFD--LSTVPNLTPQRDAEPEESECADLHISPQA 352
>gi|15225987|ref|NP_179075.1| myosin heavy chain-related protein [Arabidopsis thaliana]
gi|334184229|ref|NP_001189526.1| myosin heavy chain-related protein [Arabidopsis thaliana]
gi|3810592|gb|AAC69374.1| unknown protein [Arabidopsis thaliana]
gi|330251225|gb|AEC06319.1| myosin heavy chain-related protein [Arabidopsis thaliana]
gi|330251227|gb|AEC06321.1| myosin heavy chain-related protein [Arabidopsis thaliana]
Length = 629
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 220/369 (59%), Positives = 268/369 (72%), Gaps = 9/369 (2%)
Query: 1 MDENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGL 60
MD N E EN L ARI+QLEHER+EL+KDIEQLCMQQ GPS L VA RMHFQRTA L
Sbjct: 1 MDLNLDENEN----LKARIKQLEHERNELQKDIEQLCMQQGGPSILGVAARMHFQRTASL 56
Query: 61 EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCM 120
EQEIE LK K+AAC RE NLQEEL+EAYR+K QLADLHA EV KN+EAEKQV+FFQG +
Sbjct: 57 EQEIESLKLKLAACTREKHNLQEELAEAYRVKAQLADLHAGEVAKNLEAEKQVRFFQGSV 116
Query: 121 AAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180
AAAF+ERD SVMEAEKA+E E+MSQK +E + RLEELSS+ + K+ N TL+ DL K E
Sbjct: 117 AAAFSERDKSVMEAEKAEENAEMMSQKLSEIEMRLEELSSDCLVQKRLNDTLQADLAKLE 176
Query: 181 ELNESFKEVINKFYEIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALED 239
E ++ VI KFY+IR+ SL E LE S +KCA LL D E W+FND STS Y++ALE
Sbjct: 177 EQTRTYAGVIEKFYDIRKASLCESLEMSLHEKCASLLDDPIESWTFNDPSTSDYVAALEG 236
Query: 240 ELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRV 299
EL K +++V+NLQSKLR+GLEIENHLKK VR LEKK I +D I N I ++R HSQLR
Sbjct: 237 ELGKVKNTVDNLQSKLRVGLEIENHLKKRVRALEKKNIVADGLIVNGITDIRHHHSQLRA 296
Query: 300 HVVNSLEEGRSHIKSISDVIEEKTQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDA 358
+++ L E +IKSIS +EEK + H +V QN Q + K DE ECRDVH+
Sbjct: 297 YIIALLNEEGLYIKSISKNVEEKLKLHSSEV---QNVVPPQHDLKPDESECRDVHMTTVV 353
Query: 359 DTNLVSQVC 367
++ V+++
Sbjct: 354 ESCQVTKLA 362
>gi|449438615|ref|XP_004137083.1| PREDICTED: uncharacterized protein LOC101211967 [Cucumis sativus]
Length = 641
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 206/313 (65%), Positives = 254/313 (81%), Gaps = 1/313 (0%)
Query: 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQK 70
++E L+ RIQQLE+ERDELRKDIEQLCMQQAGP YL+VATRMHFQRTAGLEQEIE LK+K
Sbjct: 7 DNEHLLTRIQQLEYERDELRKDIEQLCMQQAGPGYLSVATRMHFQRTAGLEQEIENLKKK 66
Query: 71 IAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNS 130
AAC REN NLQEEL+EAYRIK QLADLH++EV KN+E+EKQ+KFFQGC+A AFAERD+S
Sbjct: 67 FAACTRENHNLQEELAEAYRIKSQLADLHSSEVAKNLESEKQLKFFQGCVATAFAERDHS 126
Query: 131 VMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190
+MEAEKAKEKEE MSQK E + RLEELSS+ + L++ N TLR +L KQEE NE K+VI
Sbjct: 127 IMEAEKAKEKEETMSQKLKEVENRLEELSSDCLNLRETNDTLRINLAKQEEWNEVSKQVI 186
Query: 191 NKFYEIRQQSLE-VLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
NKFYEIRQ SLE ++TSW++KC CLL DS+EMWSFND+ST +YI++LE+ LE + +V+
Sbjct: 187 NKFYEIRQNSLEGFMDTSWDEKCQCLLHDSSEMWSFNDSSTFRYINSLEEGLETMKKTVD 246
Query: 250 NLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309
NLQ+KLRMG+EIE HLK V +LE K+IH D + + I+ +SQ R H++N LE+
Sbjct: 247 NLQNKLRMGVEIEKHLKGKVHDLELKLIHMDDMVKSKISGFCQNYSQYRDHILNLLEKEM 306
Query: 310 SHIKSISDVIEEK 322
S++ S IEEK
Sbjct: 307 SNMHSTIGEIEEK 319
>gi|297831932|ref|XP_002883848.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297329688|gb|EFH60107.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 629
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 220/368 (59%), Positives = 269/368 (73%), Gaps = 7/368 (1%)
Query: 1 MDENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGL 60
MD N E EN L ARI+QLEHER+EL+KDIEQLCMQQ GPS L VATRMHFQRTA L
Sbjct: 1 MDLNLDENEN----LKARIKQLEHERNELQKDIEQLCMQQGGPSILGVATRMHFQRTASL 56
Query: 61 EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCM 120
EQEIE LK K+AAC RE NLQEEL+EAYR+K QLADLHA EV KN+EAEKQV+FFQG +
Sbjct: 57 EQEIESLKLKLAACTREKHNLQEELAEAYRVKTQLADLHAGEVAKNLEAEKQVRFFQGSV 116
Query: 121 AAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180
AAAF+ERD SVMEAEKA+E E+MSQK E + RLEELSS+ + K+ N TL+ DL K E
Sbjct: 117 AAAFSERDKSVMEAEKAEENAEMMSQKLGEIEMRLEELSSDCLVQKRLNDTLQADLAKLE 176
Query: 181 ELNESFKEVINKFYEIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALED 239
E ++ +VI KF +IR+ SL E LE S +KCA LL D E W+FND STS+Y++ALE
Sbjct: 177 EQTRTYAQVIEKFDDIRKASLCESLEMSLHNKCASLLDDPIESWTFNDPSTSEYVAALEG 236
Query: 240 ELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRV 299
EL K +++V+NLQSKLR+GLEIENHLKK VR LEKK I +D+ I N I ++R HSQLR
Sbjct: 237 ELGKVKNTVDNLQSKLRVGLEIENHLKKRVRALEKKNIVADRLIVNGITDIRHHHSQLRA 296
Query: 300 HVVNSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDAD 359
++++ L E +IKSIS +EEK + IR N Q + K DE ECRDVHI +
Sbjct: 297 YIIDLLTEEGLYIKSISKNVEEKLKLHSSEIR--NVVPLQHDFKPDESECRDVHITTVVE 354
Query: 360 TNLVSQVC 367
+ V+++
Sbjct: 355 SCQVTKLA 362
>gi|238479244|ref|NP_001154517.1| myosin heavy chain-related protein [Arabidopsis thaliana]
gi|330251226|gb|AEC06320.1| myosin heavy chain-related protein [Arabidopsis thaliana]
Length = 652
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 220/369 (59%), Positives = 268/369 (72%), Gaps = 9/369 (2%)
Query: 1 MDENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGL 60
MD N E EN L ARI+QLEHER+EL+KDIEQLCMQQ GPS L VA RMHFQRTA L
Sbjct: 1 MDLNLDENEN----LKARIKQLEHERNELQKDIEQLCMQQGGPSILGVAARMHFQRTASL 56
Query: 61 EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCM 120
EQEIE LK K+AAC RE NLQEEL+EAYR+K QLADLHA EV KN+EAEKQV+FFQG +
Sbjct: 57 EQEIESLKLKLAACTREKHNLQEELAEAYRVKAQLADLHAGEVAKNLEAEKQVRFFQGSV 116
Query: 121 AAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180
AAAF+ERD SVMEAEKA+E E+MSQK +E + RLEELSS+ + K+ N TL+ DL K E
Sbjct: 117 AAAFSERDKSVMEAEKAEENAEMMSQKLSEIEMRLEELSSDCLVQKRLNDTLQADLAKLE 176
Query: 181 ELNESFKEVINKFYEIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALED 239
E ++ VI KFY+IR+ SL E LE S +KCA LL D E W+FND STS Y++ALE
Sbjct: 177 EQTRTYAGVIEKFYDIRKASLCESLEMSLHEKCASLLDDPIESWTFNDPSTSDYVAALEG 236
Query: 240 ELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRV 299
EL K +++V+NLQSKLR+GLEIENHLKK VR LEKK I +D I N I ++R HSQLR
Sbjct: 237 ELGKVKNTVDNLQSKLRVGLEIENHLKKRVRALEKKNIVADGLIVNGITDIRHHHSQLRA 296
Query: 300 HVVNSLEEGRSHIKSISDVIEEKTQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDA 358
+++ L E +IKSIS +EEK + H +V QN Q + K DE ECRDVH+
Sbjct: 297 YIIALLNEEGLYIKSISKNVEEKLKLHSSEV---QNVVPPQHDLKPDESECRDVHMTTVV 353
Query: 359 DTNLVSQVC 367
++ V+++
Sbjct: 354 ESCQVTKLA 362
>gi|449479020|ref|XP_004155483.1| PREDICTED: uncharacterized LOC101211967 [Cucumis sativus]
Length = 641
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 204/313 (65%), Positives = 253/313 (80%), Gaps = 1/313 (0%)
Query: 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQK 70
++E L+ RIQQLE+ERDELRKDIEQLCMQQAGP YL+VATRMHFQRTAGLEQEIE LK+K
Sbjct: 7 DNEHLLTRIQQLEYERDELRKDIEQLCMQQAGPGYLSVATRMHFQRTAGLEQEIENLKKK 66
Query: 71 IAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNS 130
AAC REN NLQEEL+EAYRIK QLADLH++EV KN+E+EKQ+KFFQGC+A AFAERD+S
Sbjct: 67 FAACTRENHNLQEELAEAYRIKSQLADLHSSEVAKNLESEKQLKFFQGCVATAFAERDHS 126
Query: 131 VMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190
+MEAEKAKEKEE MSQK E + RLEELSS+ + L++ N TLR +L KQEE NE K+VI
Sbjct: 127 IMEAEKAKEKEETMSQKLKEVENRLEELSSDCLNLRETNDTLRINLAKQEEWNEVSKQVI 186
Query: 191 NKFYEIRQQSLE-VLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
NKFYEIRQ SL+ ++TSW++KC CLL DS+EMWSFND+ST +YI++LE+ LE + +V+
Sbjct: 187 NKFYEIRQNSLDGFMDTSWDEKCQCLLHDSSEMWSFNDSSTFRYINSLEEGLETMKKTVD 246
Query: 250 NLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309
NLQ+KLRMG+EIE HLK V +LE K+IH D + + I+ +SQ R H++N LE+
Sbjct: 247 NLQNKLRMGVEIEKHLKGKVHDLELKLIHMDDMVKSKISGFCQNYSQYRDHILNLLEKEM 306
Query: 310 SHIKSISDVIEEK 322
S++ S EEK
Sbjct: 307 SNMHSTIGEFEEK 319
>gi|357480651|ref|XP_003610611.1| hypothetical protein MTR_5g005010 [Medicago truncatula]
gi|355511946|gb|AES93569.1| hypothetical protein MTR_5g005010 [Medicago truncatula]
Length = 586
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 204/356 (57%), Positives = 268/356 (75%), Gaps = 3/356 (0%)
Query: 1 MDENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGL 60
M E E N E+L+ RI+QLEHERDELRKDIEQLCMQQAGP+YL+VATRMHFQRTAGL
Sbjct: 1 MAEPEAEDMNHHESLLGRIRQLEHERDELRKDIEQLCMQQAGPAYLSVATRMHFQRTAGL 60
Query: 61 EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCM 120
EQEI+ L+ K+AA + NLQ++LS+AYRIK QLADLHAAE+ KN+EAEKQ+KFFQGC+
Sbjct: 61 EQEIQTLQNKLAASTTDTLNLQDQLSQAYRIKAQLADLHAAELSKNIEAEKQLKFFQGCV 120
Query: 121 AAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180
A AF+ERD++++EAEKAKE E+ MSQ+ + F R+EEL+S + K+ N L+ D
Sbjct: 121 ATAFSERDHAIIEAEKAKEMEDAMSQQIHGFLKRIEELTSVCCKQKELNYALQSDQAMYI 180
Query: 181 ELNESFKEVINKFYEIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALED 239
E NE FK+VINKF++IRQ S E +TSW+ K CLL DS E+WSFND STSKYISALE+
Sbjct: 181 EQNEKFKKVINKFFQIRQYSQKECDDTSWDVKSTCLLDDSEELWSFNDASTSKYISALEE 240
Query: 240 ELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRV 299
+L+K +SV+ LQSKLR+GLEIENHLKK + +E+ I+ +K I N IA+L+ H + R
Sbjct: 241 QLDKVNNSVDCLQSKLRVGLEIENHLKKRINLMEQNQIYMNKVIENGIADLKHHHFKYRD 300
Query: 300 HVVNSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHIN 355
H++N L +G S IKS +VI+E+ + ++ I + +QR+ + +E ECRD HI+
Sbjct: 301 HIMNLLRDGESTIKSTINVIDERIRRFNENI--EPNLVHQRDKEREENECRDAHIS 354
>gi|218193111|gb|EEC75538.1| hypothetical protein OsI_12160 [Oryza sativa Indica Group]
Length = 677
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 227/343 (66%), Gaps = 7/343 (2%)
Query: 25 ERDELRKDIEQLCMQQ-AGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQE 83
ERDELRKDIEQLCMQQ AGP Y++VATRM QRTA LEQ+IE L++K+ C REN NLQE
Sbjct: 56 ERDELRKDIEQLCMQQQAGPGYVSVATRMLSQRTAALEQDIENLQKKLGGCLRENQNLQE 115
Query: 84 ELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEEL 143
EL+EAYRIK QLA+LH AE+ KN E EKQV+FFQ +A AFAERDNS+M+ EKAKE+EE
Sbjct: 116 ELAEAYRIKSQLAELHGAELSKNKELEKQVRFFQSSVAQAFAERDNSLMKCEKAKEREEA 175
Query: 144 MSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEV 203
+ QKF +F+ R E S + K+ N L+ +L + +E +S +VI KFY++R + E
Sbjct: 176 ILQKFAKFEERTREYQSSIDDQKRSNDALQMELMRLKEQTQSSLKVIQKFYDVRCRDSEC 235
Query: 204 -LETSWEDKCACLLLDSAEMWSF---NDTSTSKYISALEDELEKTRSSVENLQSKLRMGL 259
+ + E+KC+ LL DSA+ WSF TSTSKYI++LE+E + R+ + LQ+ LRMGL
Sbjct: 236 SVNITLEEKCSVLLDDSADNWSFSSDGGTSTSKYIASLEEEKDSLRAKIAKLQNNLRMGL 295
Query: 260 EIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVI 319
EIE HL+++ R LEK+ D+F+ N ++ L+ H Q R ++ LEE S + ++ + I
Sbjct: 296 EIEQHLQRNARVLEKRQALYDEFMRNGLSTLQKLHIQQRDDIMKILEEESSQLSTVVNEI 355
Query: 320 EEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
++K I + E + + C+DVH+ D +
Sbjct: 356 QDKLSKIR--INPETNENPVGEMQCCDSSCKDVHVTTDVSPGI 396
>gi|40538897|gb|AAR87154.1| expressed protein [Oryza sativa Japonica Group]
gi|108708972|gb|ABF96767.1| expressed protein [Oryza sativa Japonica Group]
gi|222625180|gb|EEE59312.1| hypothetical protein OsJ_11369 [Oryza sativa Japonica Group]
Length = 677
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 165/345 (47%), Positives = 228/345 (66%), Gaps = 11/345 (3%)
Query: 25 ERDELRKDIEQLCMQQ-AGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQE 83
ERDELRKDIEQLCMQQ AGP Y++VATRM QRTA LEQ+IE L++K+ C REN NLQE
Sbjct: 56 ERDELRKDIEQLCMQQQAGPGYVSVATRMLSQRTAALEQDIENLQKKLGGCLRENQNLQE 115
Query: 84 ELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEEL 143
EL+EAYRIK QLA+LH AE+ KN E EKQV+FFQ +A AFAERDNS+M+ EKAKE+EE
Sbjct: 116 ELAEAYRIKSQLAELHGAELSKNKELEKQVRFFQSSVAQAFAERDNSLMKCEKAKEREEA 175
Query: 144 MSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEV 203
+ QKF +F+ R E S + K+ N L+ +L + +E +S +VI KFY++R + E
Sbjct: 176 ILQKFAKFEERTREYQSSIDDQKRSNDALQMELMRLKEQTQSSLKVIQKFYDVRCRDSEC 235
Query: 204 -LETSWEDKCACLLLDSAEMWSF---NDTSTSKYISALEDELEKTRSSVENLQSKLRMGL 259
+ + E+KC+ LL DSA+ WSF TSTSKYI++LE+E + R+ + LQ+ LRMGL
Sbjct: 236 SVNITLEEKCSVLLDDSADNWSFSSDGGTSTSKYIASLEEEKDSLRAKIAKLQNNLRMGL 295
Query: 260 EIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVI 319
EIE HL+++ R LEK+ D+F+ N ++ L+ H Q R ++ LEE S + ++ + I
Sbjct: 296 EIEQHLQRNARVLEKRQALYDEFMRNGLSTLQKLHIQQRDDIMKILEEESSQLSTVVNEI 355
Query: 320 EEKTQ--HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
++K + I G E + + C+DVH+ D +
Sbjct: 356 QDKLSKIRINPEINENPVG----EMQCCDSSCKDVHVTTDVSPGI 396
>gi|357155720|ref|XP_003577215.1| PREDICTED: uncharacterized protein LOC100842072 [Brachypodium
distachyon]
Length = 641
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 170/376 (45%), Positives = 237/376 (63%), Gaps = 10/376 (2%)
Query: 7 EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
EKE E L R+ +L+ ERDELRKDIEQLCMQQAGP Y++VATRM QR A LEQ+IE
Sbjct: 3 EKERTIEVLTLRVGELQRERDELRKDIEQLCMQQAGPGYVSVATRMLSQRNAALEQDIED 62
Query: 67 LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
L++K+ C REN NLQEEL+E YR+K +LADL+ AE+ KN E E+QV+FFQ +A AFAE
Sbjct: 63 LQKKLGGCLRENQNLQEELAEVYRVKSKLADLYGAELSKNKELEQQVRFFQSSVAQAFAE 122
Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
RD S++E EK KE+EE + +KF EF+ R E S + K+ N L+ +L + + ES
Sbjct: 123 RDTSLLECEKVKEREEAVLKKFAEFEERTREYQSAMEDQKQLNDALQMELMELKAHTESS 182
Query: 187 KEVINKFYEIRQQSLEVLET-SWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
VI KFYE+R + E S+E KC+ LL DSA+ WSFN +TSTSKYI++LE+E
Sbjct: 183 LNVILKFYEVRCRECECPSNISFEQKCSILLDDSADNWSFNSDGETSTSKYIASLEEENG 242
Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
++ + LQS LRMG E+E HL+++ R L+KK D F+ N ++ L+ ++ R ++
Sbjct: 243 SLKAKITKLQSNLRMGFEVEQHLQRNARALQKKQALIDDFMRNGLSALQKFYTHQRAEIM 302
Query: 303 NSLEEGRSHIKSISDVIEEK-TQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADT 360
LEE S + + I++K TQ H + + G E + + C+DVH+N D
Sbjct: 303 RILEEESSQLSTAVIEIQDKLTQIHINTEVIENPIG----EMQCCDSSCKDVHVNMDTGP 358
Query: 361 NLVSQVCLLVASAVNF 376
+ + VA + F
Sbjct: 359 GTSPKDDVPVAYSATF 374
>gi|147771163|emb|CAN60964.1| hypothetical protein VITISV_013873 [Vitis vinifera]
Length = 655
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 160/261 (61%), Positives = 198/261 (75%), Gaps = 5/261 (1%)
Query: 109 AEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQ 168
AEKQ+KFFQGC+AAAFAERD+S+MEAEKAKEKEEL Q+FN FQ R++EL S+N++ K+
Sbjct: 27 AEKQIKFFQGCVAAAFAERDHSIMEAEKAKEKEELALQEFNNFQKRIKELESDNLKQKQL 86
Query: 169 NATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFN- 226
N L+ DL QE+ NE+FK+VINKF+EIRQ SLE E +W+DKC CLL D EMWSFN
Sbjct: 87 NVALQIDLANQEDQNETFKKVINKFFEIRQYSLEAFEDANWDDKCGCLLSDPVEMWSFNT 146
Query: 227 --DTSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFIS 284
+TSTSKYI ALE+ELE R+SV+NLQ+KLR+GLEIENHLKK V ELEK+ I S +
Sbjct: 147 NEETSTSKYIDALEEELEMVRNSVDNLQNKLRVGLEIENHLKKKVSELEKQKIISHQMFK 206
Query: 285 NAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKL 344
N I+ L HSQ R+HVVN L+EG+SH+KSI DV+EEK + D R QN QR+ KL
Sbjct: 207 NRISGLLHYHSQHRLHVVNLLDEGKSHLKSIIDVVEEKIRQL-DADREQNLEPPQRDLKL 265
Query: 345 DEFECRDVHINNDADTNLVSQ 365
E ECRDVH++ D N V++
Sbjct: 266 YENECRDVHVSIVGDHNSVAE 286
>gi|242035353|ref|XP_002465071.1| hypothetical protein SORBIDRAFT_01g031580 [Sorghum bicolor]
gi|241918925|gb|EER92069.1| hypothetical protein SORBIDRAFT_01g031580 [Sorghum bicolor]
Length = 661
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 162/356 (45%), Positives = 234/356 (65%), Gaps = 8/356 (2%)
Query: 7 EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
EK + E L +++ L+ ERD LRKDIEQLCMQQAGP Y++VATRM QRT LEQ+IE
Sbjct: 23 EKVDLVEVLTRQVEDLQRERDALRKDIEQLCMQQAGPGYVSVATRMLTQRTTALEQDIEN 82
Query: 67 LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
L++K+ C REN NLQEEL+EAYRIK QLA+LH A + KN + EKQV+FFQ +A AFAE
Sbjct: 83 LQKKLGGCLRENQNLQEELAEAYRIKSQLAELHGAALSKNKDLEKQVRFFQSSVAQAFAE 142
Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
RD+S+ME EKAKE+EE +S+KF +F+ R+ E S E K N L+ +L + + E
Sbjct: 143 RDSSLMECEKAKEREEAVSRKFADFEERMREYQSAIDEQKYINDDLQRELTELKAHTELS 202
Query: 187 KEVINKFYEIRQQSLEVLET-SWEDKCACLLLDSAEMWSF---NDTSTSKYISALEDELE 242
+VI KFY++R + E + ++E+KC+ LL DSA+ WSF T+T KYI++LE+E E
Sbjct: 203 LKVILKFYDLRCRDSECSSSVTFEEKCSILLDDSADNWSFCSDGGTATLKYIASLEEEKE 262
Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
++ + LQ+ LRMGLEIE HL+++ R LEK+ D F+ N ++EL+ ++ + ++
Sbjct: 263 SLKAKISKLQNNLRMGLEIEQHLQRNARILEKRQALYDGFLRNGLSELQKFYTYQKAEIM 322
Query: 303 NSLEEGRSHIKSISDVIEEK-TQHCDDVIRGQNTGTYQRETKLDEFECRDVHINND 357
LEE + + I++K T+ C + ++ ET+ + C+DVH+ D
Sbjct: 323 KILEEESLQLSKVVAEIQDKLTEMCINTEVNEHPAD---ETQCCDSSCKDVHVTTD 375
>gi|326496427|dbj|BAJ94675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 627
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 160/376 (42%), Positives = 235/376 (62%), Gaps = 15/376 (3%)
Query: 7 EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
EK + E L R+++L+ ERDELRKDIEQLCMQQAGP Y++VAT+M QRT LEQ+IEI
Sbjct: 3 EKPYDGEVLTRRVEELQRERDELRKDIEQLCMQQAGPGYVSVATQMLSQRTTALEQDIEI 62
Query: 67 LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
L++K++ C REN NLQEEL+EAYR+K QLADL+ AE+ K E E+QV+FFQ +A AFAE
Sbjct: 63 LQKKLSGCLRENQNLQEELAEAYRVKSQLADLYGAELSKTKELEQQVRFFQSSVAQAFAE 122
Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
RD S++E E+EE + F+ R+ E + + K+ N L+ +L + + ES
Sbjct: 123 RDGSLLE-----EREEAGLKVCATFEERMREYQTAMEDQKRLNDALQMELTELKAHTESS 177
Query: 187 KEVINKFYEIRQQSLEVLET-SWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
VI KFYE+R + E ++E+KC+ LL DSA+ WSFN +TSTSKYI++LE E E
Sbjct: 178 LNVIKKFYEVRYRECECPSNITFEEKCSILLDDSADNWSFNLDGETSTSKYIASLEQENE 237
Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
++ + LQ+ LRM EIE HL+++ R LEKK +D + N ++ L+ ++ R ++
Sbjct: 238 SLKAKISKLQNNLRMSFEIEQHLQRNARTLEKKQALNDDLMRNGLSALQKVYTHQRAEIM 297
Query: 303 NSLEEGRSHIKSISDVIEEK-TQHCDDV-IRGQNTGTYQRETKLDEFECRDVHINNDADT 360
LEE + + + I++K TQ C + I G G E + + C+DVH+ D
Sbjct: 298 KILEEESLLLSTAVNEIQDKLTQICFNAEIIGNPVG----EMQCCDSSCKDVHVTMDIGP 353
Query: 361 NLVSQVCLLVASAVNF 376
+ + + + + + F
Sbjct: 354 DTIPKGDVPTSYSTTF 369
>gi|224060835|ref|XP_002300276.1| predicted protein [Populus trichocarpa]
gi|222847534|gb|EEE85081.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 174/357 (48%), Positives = 220/357 (61%), Gaps = 38/357 (10%)
Query: 10 NESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQ 69
N+ E+++ARIQQLEHER EL KDIEQLCMQQAGP YLA ATRMHFQRTAGLEQEIE LK+
Sbjct: 6 NDKESMIARIQQLEHERGELHKDIEQLCMQQAGPGYLAAATRMHFQRTAGLEQEIENLKK 65
Query: 70 KIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDN 129
++AAC+R+N NLQEELSEAYRIK QLA LH AE KNMEAEKQVKFFQGC+AAAFAERDN
Sbjct: 66 QLAACSRDNLNLQEELSEAYRIKTQLAKLHQAEAAKNMEAEKQVKFFQGCVAAAFAERDN 125
Query: 130 SVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEV 189
S+MEAEKAKEKEE MSQKF E Q R+ S + + + F +
Sbjct: 126 SIMEAEKAKEKEESMSQKFKEIQQRVIYQSKMSRLRLSRRLSTSFTRS---------GSI 176
Query: 190 INKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
+ + I ++ V K +++ + + SALE+E+E R+S++
Sbjct: 177 LLRDLRILVGTINVHGCYMTQKSCGVIMIPRLLNTLQ--------SALEEEVETLRNSLD 228
Query: 250 NLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309
LQSK+R ++ DK + I EL HS RV + + L + R
Sbjct: 229 KLQSKIR-------------------VVLWDKMVMEGITELHHYHSHHRVQITSLLSKER 269
Query: 310 SHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNLVSQV 366
SHIKSI D++EEK + DVI Q+ R +L E E RDVH++ DAD +L +V
Sbjct: 270 SHIKSIIDMVEEKIKQF-DVISEQDLVPC-RVVRLQENEFRDVHMSTDADPDLAPKV 324
>gi|224105731|ref|XP_002313915.1| predicted protein [Populus trichocarpa]
gi|222850323|gb|EEE87870.1| predicted protein [Populus trichocarpa]
Length = 246
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 158/264 (59%), Positives = 185/264 (70%), Gaps = 32/264 (12%)
Query: 10 NESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQ 69
N+ E+++ARIQQLEHE DEL KDIEQLCMQQAGP YLA ATRMHFQRTAGL QEIE LK+
Sbjct: 6 NDKESVVARIQQLEHECDELHKDIEQLCMQQAGPGYLAAATRMHFQRTAGLVQEIEKLKK 65
Query: 70 KIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDN 129
++ AC R+N NLQEELSEAYRIK QLA+LH AE KNMEAEKQVKFFQGC+AAAFAERD
Sbjct: 66 QLVACTRDNLNLQEELSEAYRIKTQLAELHQAEAAKNMEAEKQVKFFQGCVAAAFAERDT 125
Query: 130 SVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEV 189
S+MEAEKAKEKEE MSQKFNE Q RL + + + L DL Q +
Sbjct: 126 SIMEAEKAKEKEESMSQKFNEIQQRL------STNFMRSGSILSRDLRIQVGM------- 172
Query: 190 INKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
IN + V T+ + KC ++ M +T SALE+E+E R+S+E
Sbjct: 173 IN---------VRVCYTT-QKKCGVIM-----MLPLLNTLQ----SALEEEVEALRNSLE 213
Query: 250 NLQSKLRMGLEIENHLKKSVRELE 273
LQSKL++GLEIENHLKK VRELE
Sbjct: 214 KLQSKLQVGLEIENHLKKKVRELE 237
>gi|39104599|dbj|BAC43422.2| unknown protein [Arabidopsis thaliana]
Length = 491
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 113/216 (52%), Positives = 146/216 (67%), Gaps = 5/216 (2%)
Query: 143 LMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSL- 201
+MSQK +E + RLEELSS+ + K+ N TL+ DL K EE ++ VI KFY+IR+ SL
Sbjct: 1 MMSQKLSEIEMRLEELSSDCLVQKRLNDTLQADLAKLEEQTRTYAGVIEKFYDIRKASLC 60
Query: 202 EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMGLEI 261
E LE S +KCA LL D E W+FND STS Y++ALE EL K +++V+NLQSKLR+GLEI
Sbjct: 61 ESLEMSLHEKCASLLDDPIESWTFNDPSTSDYVAALEGELGKVKNTVDNLQSKLRVGLEI 120
Query: 262 ENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVIEE 321
ENHLKK VR LEK+ I +D I N I ++R HSQLR +++ L E +IKSIS +EE
Sbjct: 121 ENHLKKRVRALEKENIVADGLIVNGITDIRHHHSQLRAYIIALLNEEGLYIKSISKNVEE 180
Query: 322 KTQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINN 356
K + H +V QN Q + K DE ECRDVH+
Sbjct: 181 KLKLHSSEV---QNVVPPQHDLKPDESECRDVHMTT 213
>gi|147771162|emb|CAN60963.1| hypothetical protein VITISV_013872 [Vitis vinifera]
Length = 108
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 73/86 (84%), Positives = 80/86 (93%)
Query: 7 EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
EK N++E+L+ARIQQLEHERDELRKDIEQLC+QQAGPSYL VATRMHFQRTAGLEQEIE
Sbjct: 3 EKSNDNESLIARIQQLEHERDELRKDIEQLCIQQAGPSYLVVATRMHFQRTAGLEQEIEN 62
Query: 67 LKQKIAACARENSNLQEELSEAYRIK 92
LK+K+AA REN NLQEELSEAYRIK
Sbjct: 63 LKKKVAASTRENLNLQEELSEAYRIK 88
>gi|242073350|ref|XP_002446611.1| hypothetical protein SORBIDRAFT_06g018840 [Sorghum bicolor]
gi|241937794|gb|EES10939.1| hypothetical protein SORBIDRAFT_06g018840 [Sorghum bicolor]
Length = 711
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 64/99 (64%), Positives = 77/99 (77%)
Query: 7 EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
EK + E L R++ L+ ERD LRKDIEQLCMQQAGP Y++VATRM QRTA LEQ+IE
Sbjct: 3 EKVDLVEVLTRRVEDLQRERDALRKDIEQLCMQQAGPGYVSVATRMLTQRTAALEQDIEN 62
Query: 67 LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIK 105
L++K+ C REN NLQEEL+EAYRIK QLA+LH A + K
Sbjct: 63 LQKKLGGCLRENQNLQEELAEAYRIKSQLAELHGAALSK 101
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 134/230 (58%), Gaps = 8/230 (3%)
Query: 133 EAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINK 192
+ EKAKE+EE +S+KF +F+ R+ E E K N L+ +L + + E +VI K
Sbjct: 199 QCEKAKEREEAVSRKFADFEERMREYQLAIDEQKHLNDDLQRELTELKAHTELSLKVILK 258
Query: 193 FYEIRQQSLEVLET-SWEDKCACLLLDSAEMWSF---NDTSTSKYISALEDELEKTRSSV 248
FY++R + E + ++E+KC+ LL D A+ WSF TST KYI++LE E E ++ +
Sbjct: 259 FYDLRCRDSECSSSVTFEEKCSILLDDCADNWSFCSDGGTSTLKYIASLEAEKESLKAKI 318
Query: 249 ENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEG 308
LQS LRMGLEIE HL+++ R LEK+ D F+ N ++EL+ ++ + ++ LEE
Sbjct: 319 SKLQSNLRMGLEIEQHLQRNARILEKRQALYDGFLRNGLSELQKFYTYQKAEIMKILEEE 378
Query: 309 RSH-IKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINND 357
S IK ++ + ++ T+ C + ++ E + + C+DVH+ D
Sbjct: 379 SSQLIKVVAGIQDKLTEICINTEVNEHPAN---EIQCCDSSCKDVHVTTD 425
>gi|167998604|ref|XP_001752008.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697106|gb|EDQ83443.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 933
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 98/274 (35%), Positives = 141/274 (51%), Gaps = 39/274 (14%)
Query: 14 ALMARIQQLEHERDELRKDIEQLCMQQAG-PSYLAVATRMHFQRTAGLEQEIEILKQKIA 72
ALM RI QLE ERDEL+KDIE +C+QQ+G + V TRM +R A LEQE+E K+K +
Sbjct: 9 ALMRRIVQLEKERDELQKDIETICLQQSGVAGSIDVNTRMQARRAANLEQELETYKEKTS 68
Query: 73 ACARENSNLQEELSEAYRIKGQLADLHAA------------------------------- 101
REN LQ+ELSEAYR+K L AA
Sbjct: 69 LLTRENHGLQQELSEAYRLKYYLIIGTAAYDLQREFDCDFPQYLMLQTQITEAFKRAVEK 128
Query: 102 EVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSE 161
+N + EK VKF+Q +A AFAERD +++E E+ KE E+ M + + +R EE +
Sbjct: 129 AFFQNNQVEKDVKFYQSKVATAFAERDKALVEVERVKEVEKEMIAEVQQLNSRAEEAEKQ 188
Query: 162 NIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWE------DKCACL 215
E ++Q L+ D E + E + +V++KF+E+R L+ DK L
Sbjct: 189 LKEEEEQKLKLQHDYEVRTEQLDILHKVVDKFWELRGNVPVALDDGQPQDLQPLDKAQAL 248
Query: 216 LLDSAEMWSFNDTSTSKYISALEDELEKTRSSVE 249
+L+S W++ S + AL+ EL+ R+ E
Sbjct: 249 ILESDVQWTYGG-SIQAAVEALQQELKVARTLAE 281
>gi|326496659|dbj|BAJ98356.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 314
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 75/144 (52%), Gaps = 6/144 (4%)
Query: 235 SALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCH 294
++LE E E ++ + LQ+ LRM EIE HL+++ R LEKK +D + N ++ L+ +
Sbjct: 15 ASLEQENESLKAKISKLQNNLRMSFEIEQHLQRNARTLEKKQALNDDLMRNGLSALQKVY 74
Query: 295 SQLRVHVVNSLEEGRSHIKSISDVIEEK-TQHCDDV-IRGQNTGTYQRETKLDEFECRDV 352
+ R ++ LEE + + + I++K TQ C + I G G E + + C+DV
Sbjct: 75 THQRAEIMKILEEESLLLSTAVNEIQDKLTQICFNAEIIGNPVG----EMQCCDSSCKDV 130
Query: 353 HINNDADTNLVSQVCLLVASAVNF 376
H+ D + + + + + + F
Sbjct: 131 HVTMDIGPDTIPKGDVPTSYSTTF 154
>gi|414874086|tpg|DAA52643.1| TPA: hypothetical protein ZEAMMB73_102228 [Zea mays]
Length = 418
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 77/142 (54%), Gaps = 6/142 (4%)
Query: 217 LDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKI 276
+D + W + + ++LE+E E ++ + LQS LRMGLEIE HL+++ R LEK+
Sbjct: 79 VDVSSGWEGASGTVGQ--ASLEEEKESLKAKISKLQSNLRMGLEIEQHLQRNARILEKRQ 136
Query: 277 IHSDKFISNAIAELRLCHSQLRVHVVNSL-EEGRSHIKSISDVIEEKTQHCDDVIRGQNT 335
D F+ N +++L+ ++ + ++ L EE +K ++++ ++ T+ C I + +
Sbjct: 137 ALYDGFLRNGLSKLQKFYTCQKAEIMKILEEESLRLMKVVAEIQDKLTEFC---INTEVS 193
Query: 336 GTYQRETKLDEFECRDVHINND 357
+ + + C+ VH+ D
Sbjct: 194 DHPADKIQCCDSSCKHVHVTKD 215
>gi|307106173|gb|EFN54420.1| hypothetical protein CHLNCDRAFT_135784 [Chlorella variabilis]
Length = 643
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/177 (24%), Positives = 81/177 (45%), Gaps = 24/177 (13%)
Query: 24 HERDELRKDIEQLCMQQA------GPSYLA--VATRMHFQRTAGL--------------- 60
ERD+L +D+E +C+ + G S L + + F +T +
Sbjct: 25 RERDDLMRDLENMCLADSRGATFNGSSVLQERIYSTGTFDQTGAVCCAPGPALWPGGGRA 84
Query: 61 -EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGC 119
++E+ + ++ E NL+E+L E K + H A++ + +K++ F+Q
Sbjct: 85 QQKELSATRAALSGVTLERDNLREDLGEVKEAKRRAEAGHRAQLERAAALDKELAFYQAQ 144
Query: 120 MAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDL 176
A A+RD + E E+ K + + Q+ E +R E SS+ +L++Q A +R L
Sbjct: 145 SARVMADRDRAAWEGEELKAQNLRLDQQLREASSRAETESSQRAQLERQLAEVRQQL 201
>gi|384253382|gb|EIE26857.1| hypothetical protein COCSUDRAFT_39833 [Coccomyxa subellipsoidea
C-169]
Length = 465
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 73/149 (48%), Gaps = 16/149 (10%)
Query: 17 ARIQQLEH-------ERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQ 69
A+I+ LEH ER++L D+E LCMQ +G + A+ + +R +++E L+
Sbjct: 19 AKIKHLEHQLSVLTAERNDLAADVESLCMQSSGDIF--SASSVLSERICNMQKETNKLRA 76
Query: 70 KIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDN 129
++ A E L E+L K + E K+ E ++ F+Q A ++RD
Sbjct: 77 QLEAVNAERDGLYEDLINHRTSKRTVDKQWRIEREKSERLESELAFYQSHSARTLSDRDK 136
Query: 130 SVMEAEKAKEKEELMSQKFNEFQTRLEEL 158
+V EA EEL +Q E ++RL EL
Sbjct: 137 AVWEA------EELKAQLL-ETESRLSEL 158
>gi|346978055|gb|EGY21507.1| hypothetical protein VDAG_10489 [Verticillium dahliae VdLs.17]
Length = 1476
Score = 37.7 bits (86), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 88/174 (50%), Gaps = 11/174 (6%)
Query: 57 TAGLEQEIEILKQKIAACARENSNLQEELSEAYR--IKGQLADLHAAEVIKNMEAEKQVK 114
TAG+E+EI+ L+ KI E +Q+ +A R + Q D+ AEV + +AEKQ
Sbjct: 1001 TAGVEEEIQALQDKIMQVGGEKLRIQKANVDALRADVTSQSEDISNAEV-RKAKAEKQRV 1059
Query: 115 FFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQN-ATLR 173
+ A A E + + + EK + E +K +R+EE + E + +KK+ A L+
Sbjct: 1060 KLEKDHAKATKEIEAAARDLEKLDNEIENQGEKAESLTSRVEE-AQEALAVKKEELAALK 1118
Query: 174 FDLE-KQEELNESFK---EVINKFYEIRQQ--SLEVLETSWEDKCACLLLDSAE 221
+L K ELNE+ E+ NK E ++Q E W+DK + L+L + E
Sbjct: 1119 TELNGKTAELNETRAVEIEMRNKLEENQKQLNENEKRLRYWDDKLSKLVLQNVE 1172
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.313 0.127 0.339
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,028,631,077
Number of Sequences: 23463169
Number of extensions: 196926317
Number of successful extensions: 1162989
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 184
Number of HSP's successfully gapped in prelim test: 34421
Number of HSP's that attempted gapping in prelim test: 1026386
Number of HSP's gapped (non-prelim): 124682
length of query: 376
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 232
effective length of database: 8,980,499,031
effective search space: 2083475775192
effective search space used: 2083475775192
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 78 (34.7 bits)