BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029140
(198 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449450401|ref|XP_004142951.1| PREDICTED: uncharacterized protein LOC101203734 isoform 1 [Cucumis
sativus]
gi|449494502|ref|XP_004159563.1| PREDICTED: uncharacterized protein LOC101226204 isoform 1 [Cucumis
sativus]
Length = 231
Score = 299 bits (766), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 154/227 (67%), Positives = 175/227 (77%), Gaps = 31/227 (13%)
Query: 1 MSLNALVRLPLSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQF 60
MSLNALVRLPLS+S+ ED +V+HS FS+RT KP+ RR L+V+AKGK+G+Q+R
Sbjct: 5 MSLNALVRLPLSNSKLLEDGVVRHSLFSARTMHKPH--HTYRRPLLVEAKGKKGMQSRLS 62
Query: 61 QRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYK 120
QRP PP LPKIEDDGNP+FVIFIRMANVYLWYPLS+I GGTTAKIMVAAKDNF+GKYIYK
Sbjct: 63 QRPGPPPLPKIEDDGNPKFVIFIRMANVYLWYPLSLINGGTTAKIMVAAKDNFVGKYIYK 122
Query: 121 DTLARNLAAVIYR-----------------------------ENGNLRAALSTSDVIELP 151
DTLARNLAAVIYR ENGNLR+ALST+DVIELP
Sbjct: 123 DTLARNLAAVIYRDEKEIQKTAFKQFRVLRTATEFRYGYKLVENGNLRSALSTTDVIELP 182
Query: 152 TPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAKI 198
T D+LKTVLDKVKDFFGDAK+SFGK+TAL S EES+ENS EKAK+
Sbjct: 183 TQDQLKTVLDKVKDFFGDAKDSFGKLTALGSPEGEESDENSTEKAKV 229
>gi|449450403|ref|XP_004142952.1| PREDICTED: uncharacterized protein LOC101203734 isoform 2 [Cucumis
sativus]
gi|449494506|ref|XP_004159564.1| PREDICTED: uncharacterized protein LOC101226204 isoform 2 [Cucumis
sativus]
Length = 228
Score = 298 bits (763), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 154/226 (68%), Positives = 174/226 (76%), Gaps = 31/226 (13%)
Query: 1 MSLNALVRLPLSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQF 60
MSLNALVRLPLS+S+ ED +V+HS FS+RT KP+ RR L+V+AKGK+G+Q+R
Sbjct: 5 MSLNALVRLPLSNSKLLEDGVVRHSLFSARTMHKPH--HTYRRPLLVEAKGKKGMQSRLS 62
Query: 61 QRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYK 120
QRP PP LPKIEDDGNP+FVIFIRMANVYLWYPLS+I GGTTAKIMVAAKDNF+GKYIYK
Sbjct: 63 QRPGPPPLPKIEDDGNPKFVIFIRMANVYLWYPLSLINGGTTAKIMVAAKDNFVGKYIYK 122
Query: 121 DTLARNLAAVIYR-----------------------------ENGNLRAALSTSDVIELP 151
DTLARNLAAVIYR ENGNLR+ALST+DVIELP
Sbjct: 123 DTLARNLAAVIYRDEKEIQKTAFKQFRVLRTATEFRYGYKLVENGNLRSALSTTDVIELP 182
Query: 152 TPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAK 197
T D+LKTVLDKVKDFFGDAK+SFGK+TAL S EES+ENS EKAK
Sbjct: 183 TQDQLKTVLDKVKDFFGDAKDSFGKLTALGSPEGEESDENSTEKAK 228
>gi|255547908|ref|XP_002515011.1| oligopeptidase, putative [Ricinus communis]
gi|223546062|gb|EEF47565.1| oligopeptidase, putative [Ricinus communis]
Length = 233
Score = 296 bits (759), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 154/229 (67%), Positives = 179/229 (78%), Gaps = 33/229 (14%)
Query: 1 MSLNALVRLPLSSSR--THEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQAR 58
+SLNALVRLPLS+S HED + KH+ FS+RT ++Q +LVV+AKGKRG+Q+R
Sbjct: 5 LSLNALVRLPLSNSSRIHHEDGVAKHTLFSTRTVTLQKSLKKQT-LLVVKAKGKRGMQSR 63
Query: 59 QFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYI 118
QFQRPP PSLPKIEDDGNP+F+IFIRMANVYLWYPLS++TGGTTAKIMVAAKDNFLGKYI
Sbjct: 64 QFQRPPAPSLPKIEDDGNPKFLIFIRMANVYLWYPLSLVTGGTTAKIMVAAKDNFLGKYI 123
Query: 119 YKDTLARNLAAVIYR-----------------------------ENGNLRAALSTSDVIE 149
YK+TL RNLAAVIYR ENGN+RAALST+DVIE
Sbjct: 124 YKNTLDRNLAAVIYRDEKEIQKTAFKQFRVLRSATEFRYGYKLVENGNVRAALSTTDVIE 183
Query: 150 LPTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAKI 198
LPTPD+LKTV+DKVKDFFGDAKESFGK+TAL+ST ++E EE SKEKAK+
Sbjct: 184 LPTPDQLKTVVDKVKDFFGDAKESFGKLTALDST-TDEPEEVSKEKAKV 231
>gi|334183724|ref|NP_001185346.1| uncharacterized protein [Arabidopsis thaliana]
gi|332196564|gb|AEE34685.1| uncharacterized protein [Arabidopsis thaliana]
Length = 229
Score = 288 bits (737), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 147/226 (65%), Positives = 170/226 (75%), Gaps = 30/226 (13%)
Query: 1 MSLNALVRLPLSSSRTHEDV-LVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQ 59
MSLNAL RLPL ++ E+V L +HS FSSRT + +++R V VV+AKGK+G+ ARQ
Sbjct: 5 MSLNALTRLPLKNTGRFEEVGLARHSLFSSRTACRETAVQQRRMVFVVEAKGKKGMAARQ 64
Query: 60 FQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIY 119
+QR PPP +PKIEDDGNPRFVIFIRMANVYLWYPLSII GGTTAKIMVAAKDN LGKYIY
Sbjct: 65 YQRTPPP-MPKIEDDGNPRFVIFIRMANVYLWYPLSIIAGGTTAKIMVAAKDNLLGKYIY 123
Query: 120 KDTLARNLAAVIYR----------------------------ENGNLRAALSTSDVIELP 151
KDT+ARN+AAVIYR ENGN+RAALSTSDVIELP
Sbjct: 124 KDTIARNIAAVIYRDEKEIQKTAIKQHRVLRTATEFRYGYKLENGNMRAALSTSDVIELP 183
Query: 152 TPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAK 197
T D+LKTV DKVKD+FGDAKESFGK+++LN S E++EE S EKAK
Sbjct: 184 TQDQLKTVFDKVKDYFGDAKESFGKLSSLNPGSDEKTEETSDEKAK 229
>gi|30697554|ref|NP_850972.1| uncharacterized protein [Arabidopsis thaliana]
gi|15215606|gb|AAK91348.1| At1g67700/F12A21_30 [Arabidopsis thaliana]
gi|21435985|gb|AAM51570.1| At1g67700/F12A21_30 [Arabidopsis thaliana]
gi|332196562|gb|AEE34683.1| uncharacterized protein [Arabidopsis thaliana]
Length = 230
Score = 288 bits (737), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 147/227 (64%), Positives = 170/227 (74%), Gaps = 31/227 (13%)
Query: 1 MSLNALVRLPLSSSRTHEDV-LVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQ 59
MSLNAL RLPL ++ E+V L +HS FSSRT + +++R V VV+AKGK+G+ ARQ
Sbjct: 5 MSLNALTRLPLKNTGRFEEVGLARHSLFSSRTACRETAVQQRRMVFVVEAKGKKGMAARQ 64
Query: 60 FQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIY 119
+QR PPP +PKIEDDGNPRFVIFIRMANVYLWYPLSII GGTTAKIMVAAKDN LGKYIY
Sbjct: 65 YQRTPPP-MPKIEDDGNPRFVIFIRMANVYLWYPLSIIAGGTTAKIMVAAKDNLLGKYIY 123
Query: 120 KDTLARNLAAVIYR-----------------------------ENGNLRAALSTSDVIEL 150
KDT+ARN+AAVIYR ENGN+RAALSTSDVIEL
Sbjct: 124 KDTIARNIAAVIYRDEKEIQKTAIKQHRVLRTATEFRYGYKLVENGNMRAALSTSDVIEL 183
Query: 151 PTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAK 197
PT D+LKTV DKVKD+FGDAKESFGK+++LN S E++EE S EKAK
Sbjct: 184 PTQDQLKTVFDKVKDYFGDAKESFGKLSSLNPGSDEKTEETSDEKAK 230
>gi|30697556|ref|NP_564903.3| uncharacterized protein [Arabidopsis thaliana]
gi|21554073|gb|AAM63154.1| unknown [Arabidopsis thaliana]
gi|222423377|dbj|BAH19661.1| AT1G67700 [Arabidopsis thaliana]
gi|332196563|gb|AEE34684.1| uncharacterized protein [Arabidopsis thaliana]
Length = 231
Score = 288 bits (737), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 147/227 (64%), Positives = 170/227 (74%), Gaps = 31/227 (13%)
Query: 1 MSLNALVRLPLSSSRTHEDV-LVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQ 59
MSLNAL RLPL ++ E+V L +HS FSSRT + +++R V VV+AKGK+G+ ARQ
Sbjct: 5 MSLNALTRLPLKNTGRFEEVGLARHSLFSSRTACRETAVQQRRMVFVVEAKGKKGMAARQ 64
Query: 60 FQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIY 119
+QR PPP +PKIEDDGNPRFVIFIRMANVYLWYPLSII GGTTAKIMVAAKDN LGKYIY
Sbjct: 65 YQRTPPP-MPKIEDDGNPRFVIFIRMANVYLWYPLSIIAGGTTAKIMVAAKDNLLGKYIY 123
Query: 120 KDTLARNLAAVIYR-----------------------------ENGNLRAALSTSDVIEL 150
KDT+ARN+AAVIYR ENGN+RAALSTSDVIEL
Sbjct: 124 KDTIARNIAAVIYRDEKEIQKTAIKQHRVLRTATEFRYGYKLVENGNMRAALSTSDVIEL 183
Query: 151 PTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAK 197
PT D+LKTV DKVKD+FGDAKESFGK+++LN S E++EE S EKAK
Sbjct: 184 PTQDQLKTVFDKVKDYFGDAKESFGKLSSLNPGSDEKTEETSDEKAK 230
>gi|359473697|ref|XP_002272215.2| PREDICTED: uncharacterized protein LOC100256260 [Vitis vinifera]
Length = 229
Score = 288 bits (736), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 151/228 (66%), Positives = 175/228 (76%), Gaps = 36/228 (15%)
Query: 1 MSLNA-LVRLPLSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQ 59
+SLNA LVR PLS+SR ED +KHS FS+RT K +++R LVV+AKGKRG+Q RQ
Sbjct: 6 LSLNAALVRTPLSNSRASEDGFIKHSIFSTRTVPK----SQKKRALVVEAKGKRGMQGRQ 61
Query: 60 FQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIY 119
FQR PPP LPKIEDDGNP+FVIFIRMANVYLWYPLS+ITGGTTAKIMVAAKDNFLGKYIY
Sbjct: 62 FQRQPPP-LPKIEDDGNPKFVIFIRMANVYLWYPLSLITGGTTAKIMVAAKDNFLGKYIY 120
Query: 120 KDTLARNLAAVIYR-----------------------------ENGNLRAALSTSDVIEL 150
+DT+ARNLAAVIY+ EN N+R+ALST+DVIEL
Sbjct: 121 RDTIARNLAAVIYKDEKEIQKTAFRQYRVLRSAKEFRYGYKLVENNNVRSALSTTDVIEL 180
Query: 151 PTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAKI 198
PT +LKTVLDKV+DFFGDAKESFGK+TALNST +EESE+ S EK+K+
Sbjct: 181 PTQGELKTVLDKVRDFFGDAKESFGKLTALNST-TEESEQMSNEKSKV 227
>gi|224107689|ref|XP_002314565.1| predicted protein [Populus trichocarpa]
gi|118487840|gb|ABK95743.1| unknown [Populus trichocarpa]
gi|222863605|gb|EEF00736.1| predicted protein [Populus trichocarpa]
Length = 237
Score = 288 bits (736), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 160/234 (68%), Positives = 178/234 (76%), Gaps = 41/234 (17%)
Query: 1 MSLNALVRLPLSSSRT---HEDV--LVKHSPFSSR--TTQ-KPNGRERQRRVLVVQAKGK 52
MSLNALVRLP SSRT HED L+KH+ FS+R T Q P + + +LVV+AKGK
Sbjct: 5 MSLNALVRLP--SSRTMLLHEDGGGLLKHTLFSTRKSTAQTSPKKQGQGHMLLVVKAKGK 62
Query: 53 RGLQARQFQRPPPP-SLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKD 111
+G+Q RQFQRPPP SLPKIEDDGNP+F+IFIR+ANVYLWYPLS+ITGGTTAKIMVAAKD
Sbjct: 63 KGMQTRQFQRPPPTLSLPKIEDDGNPKFLIFIRVANVYLWYPLSLITGGTTAKIMVAAKD 122
Query: 112 NFLGKYIYKDTLARNLAAVIYR-----------------------------ENGNLRAAL 142
NFLGKYIYKDTLARNLAAVIY+ ENGNLRAAL
Sbjct: 123 NFLGKYIYKDTLARNLAAVIYKDEKEIQKTAFKQHRVLRSATEFRYGYKLVENGNLRAAL 182
Query: 143 STSDVIELPTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKA 196
S +DVIELPT DKLKTVLDKVKDFFGDAKESFGK+TALNST++EE EE SKEKA
Sbjct: 183 SITDVIELPTQDKLKTVLDKVKDFFGDAKESFGKLTALNSTATEE-EEVSKEKA 235
>gi|297838515|ref|XP_002887139.1| F12A21.16 [Arabidopsis lyrata subsp. lyrata]
gi|297332980|gb|EFH63398.1| F12A21.16 [Arabidopsis lyrata subsp. lyrata]
Length = 985
Score = 279 bits (714), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 144/230 (62%), Positives = 168/230 (73%), Gaps = 35/230 (15%)
Query: 1 MSLNALVRLPLSSSRTHEDV-LVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQ 59
MSLNAL RLPL ++ E+V LV+ S FSSRT + +++R V VV+AKGK+G+ ARQ
Sbjct: 689 MSLNALTRLPLKNTGRFEEVGLVRQSLFSSRTACRETAVQQRRMVFVVEAKGKKGMAARQ 748
Query: 60 FQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIY 119
+QR PPP +PKIEDDGNPRFVIFIRMANVYLWYPLSII GGTTAKIMVAAKDN LGKYIY
Sbjct: 749 YQRTPPP-MPKIEDDGNPRFVIFIRMANVYLWYPLSIIAGGTTAKIMVAAKDNLLGKYIY 807
Query: 120 KDTLARNLAAVIYR---------------------------------ENGNLRAALSTSD 146
KDT+ARN+AAVIYR ENGN+RAALSTSD
Sbjct: 808 KDTIARNIAAVIYRVNYTSCFLFGLIPLFNMMRKRFKRQQLSSTVSCENGNMRAALSTSD 867
Query: 147 VIELPTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKA 196
VIELPT D+LKTV DKVKD+FGDAKESFGK+T+LN + +++EE EKA
Sbjct: 868 VIELPTQDQLKTVFDKVKDYFGDAKESFGKLTSLNPGTDQKTEETPDEKA 917
>gi|351723561|ref|NP_001237795.1| uncharacterized protein LOC100306628 [Glycine max]
gi|255629119|gb|ACU14904.1| unknown [Glycine max]
Length = 239
Score = 277 bits (708), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 154/233 (66%), Positives = 179/233 (76%), Gaps = 35/233 (15%)
Query: 1 MSLNALVRLPLSSSRTHED---VLVKHSPFSSRTTQKPNGRER---QRRVLVVQAKGKRG 54
MSLNALVRLPLS+SR H+D ++++HS FSSR Q+ + + +V VV+AKGK+G
Sbjct: 5 MSLNALVRLPLSNSRFHDDAAPMIIRHSLFSSRKQQQQQQSYKVPQRHQVFVVEAKGKKG 64
Query: 55 LQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFL 114
+ +RQFQR PP LPKIEDDGNP+FVIFIRMANVYLWYPLSI++GGTTAKIMVAAKDNFL
Sbjct: 65 MMSRQFQRNAPPPLPKIEDDGNPKFVIFIRMANVYLWYPLSIVSGGTTAKIMVAAKDNFL 124
Query: 115 GKYIYKDTLARNLAAVIYR-----------------------------ENGNLRAALSTS 145
GKYIYKDTL RNLAAVIYR ENGN+RAALST+
Sbjct: 125 GKYIYKDTLDRNLAAVIYRDEKEVQKSAFKQYRVLRTATDFRYGYKLVENGNIRAALSTT 184
Query: 146 DVIELPTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAKI 198
DVIELPT DKLKTVLDKVKDFFGDAKESFGKIT+L +T +EESEE++KEKAK+
Sbjct: 185 DVIELPTQDKLKTVLDKVKDFFGDAKESFGKITSLGTTETEESEEDTKEKAKV 237
>gi|11072026|gb|AAG28905.1|AC008113_21 F12A21.16 [Arabidopsis thaliana]
Length = 998
Score = 267 bits (682), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/249 (57%), Positives = 165/249 (66%), Gaps = 54/249 (21%)
Query: 3 LNALV--RLPLSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQF 60
LN LV L++ R E L +HS FSSRT + +++R V VV+AKGK+G+ ARQ+
Sbjct: 690 LNKLVVSESSLNTGRFEEVGLARHSLFSSRTACRETAVQQRRMVFVVEAKGKKGMAARQY 749
Query: 61 QRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYK 120
QR PPP +PKIEDDGNPRFVIFIRMANVYLWYPLSII GGTTAKIMVAAKDN LGKYIYK
Sbjct: 750 QRTPPP-MPKIEDDGNPRFVIFIRMANVYLWYPLSIIAGGTTAKIMVAAKDNLLGKYIYK 808
Query: 121 DTLARNLAAVIYR----------------------------------------------- 133
DT+ARN+AAVIYR
Sbjct: 809 DTIARNIAAVIYRVNYTSFCLFLCDVISNCFYLQQDEKEIQKTAIKQHRVLRTATEFRYG 868
Query: 134 ----ENGNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESE 189
ENGN+RAALSTSDVIELPT D+LKTV DKVKD+FGDAKESFGK+++LN S E++E
Sbjct: 869 YKLVENGNMRAALSTSDVIELPTQDQLKTVFDKVKDYFGDAKESFGKLSSLNPGSDEKTE 928
Query: 190 ENSKEKAKI 198
E S EKAKI
Sbjct: 929 ETSDEKAKI 937
>gi|357463123|ref|XP_003601843.1| hypothetical protein MTR_3g086040 [Medicago truncatula]
gi|355490891|gb|AES72094.1| hypothetical protein MTR_3g086040 [Medicago truncatula]
gi|388521481|gb|AFK48802.1| unknown [Medicago truncatula]
Length = 227
Score = 260 bits (664), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 141/228 (61%), Positives = 165/228 (72%), Gaps = 36/228 (15%)
Query: 1 MSLNALVRLPLSSS--RTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQAR 58
MSLNAL+ PLSSS R H D L RT + + + VLVV+AK K+G+ +R
Sbjct: 5 MSLNALLHPPLSSSSSRFHNDSLFSMPRQFPRTQR----QHQHHHVLVVEAKSKKGMMSR 60
Query: 59 QFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYI 118
Q+QR PPP LPKIEDDGNP+FV+FIRMANVYLWYPLSI++GGTTAKIMVAAKDNFLGK+I
Sbjct: 61 QYQRQPPPPLPKIEDDGNPKFVVFIRMANVYLWYPLSIVSGGTTAKIMVAAKDNFLGKFI 120
Query: 119 YKDTLARNLAAVIYR-----------------------------ENGNLRAALSTSDVIE 149
YKDTL RNLAAVIYR ENGN++AALSTSDVI+
Sbjct: 121 YKDTLDRNLAAVIYRDEKEIQKSAFKQHRVLKSATEFRYGYKLVENGNVKAALSTSDVIQ 180
Query: 150 LPTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAK 197
LPTPDKLKTV DKVKDFFGD KESFG+IT+L T+++ESE++SKEK K
Sbjct: 181 LPTPDKLKTVADKVKDFFGDVKESFGEITSL-VTATDESEDDSKEKTK 227
>gi|226505412|ref|NP_001141155.1| uncharacterized protein LOC100273241 [Zea mays]
gi|194690160|gb|ACF79164.1| unknown [Zea mays]
gi|194702952|gb|ACF85560.1| unknown [Zea mays]
gi|413937632|gb|AFW72183.1| hypothetical protein ZEAMMB73_079373 [Zea mays]
Length = 249
Score = 205 bits (521), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 110/177 (62%), Positives = 124/177 (70%), Gaps = 34/177 (19%)
Query: 37 GRERQRRVLVVQAKGKRGLQARQFQRPP-PPSLPKIEDDGNPRFVIFIRMANVYLWYPLS 95
GR+ RR LVV+A+G RG RQ Q+ P LPKIEDDGNPRFVIFIR ANVY WYPL+
Sbjct: 38 GRQPARRALVVEARGGRGWSERQMQQQRRAPLLPKIEDDGNPRFVIFIRTANVYFWYPLN 97
Query: 96 IITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYR---------------------- 133
IITGGTTAKIM+AAKDNFLGKYIYKDTLARNLAAVIY+
Sbjct: 98 IITGGTTAKIMLAAKDNFLGKYIYKDTLARNLAAVIYKDEDVIIDTAKEQYRVLKTENQF 157
Query: 134 -------ENGNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGD----AKESFGKITA 179
E GN+R+AL+TS+VIELP D+LKTV+DKVKDFFGD AKESF +IT
Sbjct: 158 RYGYKVVEKGNIRSALTTSNVIELPKKDELKTVVDKVKDFFGDVTAGAKESFAQITG 214
>gi|115447129|ref|NP_001047344.1| Os02g0600200 [Oryza sativa Japonica Group]
gi|47848283|dbj|BAD22147.1| unknown protein [Oryza sativa Japonica Group]
gi|113536875|dbj|BAF09258.1| Os02g0600200 [Oryza sativa Japonica Group]
gi|215695016|dbj|BAG90207.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 266
Score = 198 bits (504), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 105/168 (62%), Positives = 120/168 (71%), Gaps = 34/168 (20%)
Query: 45 LVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAK 104
LVV+A+G RG R+ Q+ P LPKIEDDGNPRFVIFIR ANVY WYPL+I+TGGTTAK
Sbjct: 59 LVVEARG-RGWSDRRSQQQRMPQLPKIEDDGNPRFVIFIRTANVYFWYPLNIVTGGTTAK 117
Query: 105 IMVAAKDNFLGKYIYKDTLARNLAAVIYR-----------------------------EN 135
IM+AAKDNFLGKYIYKDTLARNLAAVIY+ EN
Sbjct: 118 IMLAAKDNFLGKYIYKDTLARNLAAVIYKDEDDIIDTAKEQYRVLKTDNEFRYGYKVVEN 177
Query: 136 GNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGD----AKESFGKITA 179
GNLR+AL+TS+VIELP ++LKTV+DKVKDFFGD AKESF +IT
Sbjct: 178 GNLRSALTTSNVIELPKKEELKTVVDKVKDFFGDVTSGAKESFAQITG 225
>gi|218191110|gb|EEC73537.1| hypothetical protein OsI_07935 [Oryza sativa Indica Group]
gi|222639846|gb|EEE67978.1| hypothetical protein OsJ_25898 [Oryza sativa Japonica Group]
Length = 300
Score = 197 bits (501), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/168 (62%), Positives = 120/168 (71%), Gaps = 34/168 (20%)
Query: 45 LVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAK 104
LVV+A+G RG R+ Q+ P LPKIEDDGNPRFVIFIR ANVY WYPL+I+TGGTTAK
Sbjct: 59 LVVEARG-RGWSDRRSQQQRMPQLPKIEDDGNPRFVIFIRTANVYFWYPLNIVTGGTTAK 117
Query: 105 IMVAAKDNFLGKYIYKDTLARNLAAVIYR-----------------------------EN 135
IM+AAKDNFLGKYIYKDTLARNLAAVIY+ EN
Sbjct: 118 IMLAAKDNFLGKYIYKDTLARNLAAVIYKDEDDIIDTAKEQYRVLKTDNEFRYGYKVVEN 177
Query: 136 GNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGD----AKESFGKITA 179
GNLR+AL+TS+VIELP ++LKTV+DKVKDFFGD AKESF +IT
Sbjct: 178 GNLRSALTTSNVIELPKKEELKTVVDKVKDFFGDVTSGAKESFAQITG 225
>gi|326494034|dbj|BAJ85479.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326494614|dbj|BAJ94426.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 256
Score = 189 bits (481), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/169 (59%), Positives = 118/169 (69%), Gaps = 34/169 (20%)
Query: 45 LVVQAKGKRGLQARQFQRPPP-PSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTA 103
LVV+A+G RG RQ Q+ P LPKIEDDGNPRFVIFIR ANVY WYPL+I+TGGTTA
Sbjct: 46 LVVEARGGRGWSDRQSQQQRRMPQLPKIEDDGNPRFVIFIRTANVYFWYPLNIVTGGTTA 105
Query: 104 KIMVAAKDNFLGKYIYKDTLARNLAAVIYR-----------------------------E 134
KIM+AAKDNFLGKYIYKDTLARNLA VIY+ E
Sbjct: 106 KIMLAAKDNFLGKYIYKDTLARNLATVIYKDEDDIIDLAKEQFRVLKGETEFRYGYKIVE 165
Query: 135 NGNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGD----AKESFGKITA 179
GNLR+AL+TS+V+ELP ++LK+V+DKV+DFFG+ AKESF +IT
Sbjct: 166 KGNLRSALATSNVLELPKKEELKSVVDKVRDFFGEVTSGAKESFAQITG 214
>gi|297738330|emb|CBI27531.3| unnamed protein product [Vitis vinifera]
Length = 144
Score = 182 bits (462), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 94/143 (65%), Positives = 108/143 (75%), Gaps = 30/143 (20%)
Query: 85 MANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYR----------- 133
MANVYLWYPLS+ITGGTTAKIMVAAKDNFLGKYIY+DT+ARNLAAVIY+
Sbjct: 1 MANVYLWYPLSLITGGTTAKIMVAAKDNFLGKYIYRDTIARNLAAVIYKDEKEIQKTAFR 60
Query: 134 ------------------ENGNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGDAKESFG 175
EN N+R+ALST+DVIELPT +LKTVLDKV+DFFGDAKESFG
Sbjct: 61 QYRVLRSAKEFRYGYKLVENNNVRSALSTTDVIELPTQGELKTVLDKVRDFFGDAKESFG 120
Query: 176 KITALNSTSSEESEENSKEKAKI 198
K+TALNST +EESE+ S EK+K+
Sbjct: 121 KLTALNST-TEESEQMSNEKSKV 142
>gi|116783902|gb|ABK23133.1| unknown [Picea sitchensis]
Length = 232
Score = 157 bits (396), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 83/180 (46%), Positives = 119/180 (66%), Gaps = 35/180 (19%)
Query: 44 VLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTA 103
V++V+AKG++ +Q + +Q+ PPPS+PK EDD NPRFV+FIR NV WYPL+II+GGTTA
Sbjct: 51 VVIVEAKGRKSMQEKMYQQRPPPSVPKAEDD-NPRFVVFIRSKNVPRWYPLNIISGGTTA 109
Query: 104 KIMVAAKDNFLGKYIYKDTLARNLAAVIYRENGNLR------------------------ 139
KIMVA KD +GK++Y+ L RN+AAV+Y++ +R
Sbjct: 110 KIMVAGKDTPVGKFLYEGALTRNIAAVVYKDEKEIRKIALKQYPVLKAASELQYGYKLID 169
Query: 140 -----AALSTSDVIELPTPDKLKTVLDKVKDFFGDA----KESFGKITALNSTSSEESEE 190
+AL +SD+I++P ++LK V++KVKDFFG+A KESFG I+ L+ T +EE E+
Sbjct: 170 PKNPNSALYSSDIIKIPPQEELKPVVEKVKDFFGNAMTGVKESFGSISNLD-TGAEEDEK 228
>gi|413937631|gb|AFW72182.1| hypothetical protein ZEAMMB73_079373 [Zea mays]
Length = 165
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 76/98 (77%), Positives = 83/98 (84%), Gaps = 1/98 (1%)
Query: 37 GRERQRRVLVVQAKGKRGLQARQFQRPP-PPSLPKIEDDGNPRFVIFIRMANVYLWYPLS 95
GR+ RR LVV+A+G RG RQ Q+ P LPKIEDDGNPRFVIFIR ANVY WYPL+
Sbjct: 38 GRQPARRALVVEARGGRGWSERQMQQQRRAPLLPKIEDDGNPRFVIFIRTANVYFWYPLN 97
Query: 96 IITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYR 133
IITGGTTAKIM+AAKDNFLGKYIYKDTLARNLAAVIY+
Sbjct: 98 IITGGTTAKIMLAAKDNFLGKYIYKDTLARNLAAVIYK 135
>gi|125540180|gb|EAY86575.1| hypothetical protein OsI_07955 [Oryza sativa Indica Group]
Length = 477
Score = 151 bits (381), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 70/94 (74%), Positives = 81/94 (86%), Gaps = 1/94 (1%)
Query: 45 LVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAK 104
LVV+A+G RG R+ Q+ P LPKIEDDGNPRFVIFIR ANVY WYPL+I+TGGTTAK
Sbjct: 59 LVVEARG-RGWSDRRSQQQRMPQLPKIEDDGNPRFVIFIRTANVYFWYPLNIVTGGTTAK 117
Query: 105 IMVAAKDNFLGKYIYKDTLARNLAAVIYRENGNL 138
IM+AAKDNFLGKYIYKDTLARNLAAVIY++ ++
Sbjct: 118 IMLAAKDNFLGKYIYKDTLARNLAAVIYKDEDDI 151
>gi|168026443|ref|XP_001765741.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682918|gb|EDQ69332.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 209
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/180 (40%), Positives = 106/180 (58%), Gaps = 31/180 (17%)
Query: 47 VQAKGKR--GLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAK 104
V+AKG+R G+ RQ R PS+P +EDDGNP+FV+FIR NV WYPLS++TGGTTAK
Sbjct: 30 VRAKGRRNAGIPGRQPNRQQMPSMPAMEDDGNPKFVLFIRTLNVPRWYPLSVVTGGTTAK 89
Query: 105 IMVAAKDNFLGKYIYKDTLARNLAAVIYRENGNL-------------------------- 138
+MV A N GK +Y+ TL RN+A VIY++ +
Sbjct: 90 MMVGAMKNDWGKKLYEGTLTRNIAGVIYKDEKKIIQTAIKQYPVLKSATGFQYGYKIMDP 149
Query: 139 ---RAALSTSDVIELPTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEK 195
++A+ +SDV+ +P ++L +V+DKVK FF ESFG I +L + S +++ ++ K
Sbjct: 150 EKPQSAMFSSDVVTIPPQEELNSVVDKVKGFFSKVGESFGSIGSLPAGSQDQAGQSKDNK 209
>gi|376340006|gb|AFB34514.1| hypothetical protein CL4511Contig1_02, partial [Pinus mugo]
gi|376340008|gb|AFB34515.1| hypothetical protein CL4511Contig1_02, partial [Pinus mugo]
gi|376340010|gb|AFB34516.1| hypothetical protein CL4511Contig1_02, partial [Pinus mugo]
gi|376340012|gb|AFB34517.1| hypothetical protein CL4511Contig1_02, partial [Pinus mugo]
Length = 85
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 52/86 (60%), Positives = 70/86 (81%), Gaps = 1/86 (1%)
Query: 44 VLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTA 103
+++V+AKGK+ +Q + +++ PPPSLPK EDD NPRFV+FIR NV WYPL+II+GGTTA
Sbjct: 1 LVIVEAKGKKSMQEKLYKQRPPPSLPKAEDD-NPRFVVFIRTKNVPRWYPLNIISGGTTA 59
Query: 104 KIMVAAKDNFLGKYIYKDTLARNLAA 129
KIMVA KD +GK++Y+ L RN+AA
Sbjct: 60 KIMVAGKDTPVGKFLYEGALTRNIAA 85
>gi|361069731|gb|AEW09177.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144713|gb|AFG53863.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144715|gb|AFG53864.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144717|gb|AFG53865.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144719|gb|AFG53866.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144721|gb|AFG53867.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144723|gb|AFG53868.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144725|gb|AFG53869.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144727|gb|AFG53870.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144729|gb|AFG53871.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144731|gb|AFG53872.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144733|gb|AFG53873.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144735|gb|AFG53874.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144737|gb|AFG53875.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144739|gb|AFG53876.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144741|gb|AFG53877.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144743|gb|AFG53878.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
gi|383144745|gb|AFG53879.1| Pinus taeda anonymous locus CL4511Contig1_02 genomic sequence
Length = 85
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 52/86 (60%), Positives = 70/86 (81%), Gaps = 1/86 (1%)
Query: 44 VLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTA 103
+++V+AKGK+ +Q + +++ PPPSLPK EDD NPRFV+FIR NV WYPL+II+GGTTA
Sbjct: 1 LVIVEAKGKKNMQEKLYKQRPPPSLPKAEDD-NPRFVVFIRTKNVPRWYPLNIISGGTTA 59
Query: 104 KIMVAAKDNFLGKYIYKDTLARNLAA 129
KIMVA KD +GK++Y+ L RN+AA
Sbjct: 60 KIMVAGKDTPVGKFLYEGALTRNIAA 85
>gi|376340004|gb|AFB34513.1| hypothetical protein CL4511Contig1_02, partial [Larix decidua]
Length = 85
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 49/86 (56%), Positives = 69/86 (80%), Gaps = 1/86 (1%)
Query: 44 VLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTA 103
+++V+AKG++ +Q + +++ PPPS+P EDD NPRFV+FIR NV WYPL+II+GGTTA
Sbjct: 1 LVIVEAKGRKSMQEKMYKQRPPPSVPPAEDD-NPRFVVFIRSKNVPRWYPLNIISGGTTA 59
Query: 104 KIMVAAKDNFLGKYIYKDTLARNLAA 129
KIMVA KD +GK++Y+ L RN+AA
Sbjct: 60 KIMVAGKDTPVGKFLYEGALTRNIAA 85
>gi|302802935|ref|XP_002983221.1| hypothetical protein SELMODRAFT_422572 [Selaginella moellendorffii]
gi|300148906|gb|EFJ15563.1| hypothetical protein SELMODRAFT_422572 [Selaginella moellendorffii]
Length = 184
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 84/170 (49%), Gaps = 35/170 (20%)
Query: 39 ERQRRVLVVQAKGKRG-----LQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYP 93
ER +VL ++A ++G Q +Q Q+ P +P EDD NPRFVIFIR NV LWYP
Sbjct: 10 ERSLQVLTIEASKRKGPMRRMQQQQQMQQRSLPKVPAAEDD-NPRFVIFIRSKNVPLWYP 68
Query: 94 LSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRENGNLRA------------- 140
L+I++GG AK MV N GK IY ++L N+ A +Y++ + A
Sbjct: 69 LNIVSGGNAAKFMVGVTKNEWGKKIYGNSLTNNIGAAVYKDEEKIIASVVKTYPTLKTAK 128
Query: 141 ----------------ALSTSDVIELPTPDKLKTVLDKVKDFFGDAKESF 174
AL DV +P ++LK + +KV DF G ++
Sbjct: 129 EFQFGYKLVDEEKANEALRPVDVTLIPPKEELKPITEKVTDFVGKGMDNL 178
>gi|302755836|ref|XP_002961342.1| hypothetical protein SELMODRAFT_403146 [Selaginella moellendorffii]
gi|300172281|gb|EFJ38881.1| hypothetical protein SELMODRAFT_403146 [Selaginella moellendorffii]
Length = 179
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 81/164 (49%), Gaps = 35/164 (21%)
Query: 39 ERQRRVLVVQAKGKRG-----LQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYP 93
ER +VL ++A ++G Q +Q Q+ P +P EDD NPRFVIFIR NV LWYP
Sbjct: 5 ERSLQVLTIEASKRKGPMRRMQQQQQMQQRSLPKVPAAEDD-NPRFVIFIRSKNVPLWYP 63
Query: 94 LSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRENGNLRA------------- 140
L+I++GG AK MV N GK IY ++L N+ A +Y++ + A
Sbjct: 64 LNIVSGGNAAKFMVGVTKNEWGKKIYGNSLTNNIGAAVYKDEEKIIASVVKTYPTLKTAK 123
Query: 141 ----------------ALSTSDVIELPTPDKLKTVLDKVKDFFG 168
AL DV +P + LK + +KV DF G
Sbjct: 124 EFQFGYKLVDEEKANEALRPVDVTLIPPKEDLKPITEKVTDFVG 167
>gi|315420437|gb|ADU15868.1| CLCuV responsive protein [Gossypium hirsutum]
Length = 77
Score = 69.7 bits (169), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 35/51 (68%), Positives = 46/51 (90%), Gaps = 1/51 (1%)
Query: 148 IELPTPDKLKTVLDKVKDFFGDAKESFGKITALNSTSSEESEENSKEKAKI 198
IELPT D+LKTV+DKVKDFFGDAKESFGKIT+L S+++ ESEEN +++ ++
Sbjct: 7 IELPTQDQLKTVVDKVKDFFGDAKESFGKITSL-SSATPESEENLQKRPRL 56
>gi|357142775|ref|XP_003572689.1| PREDICTED: putative cyclic nucleotide-gated ion channel 15-like
[Brachypodium distachyon]
Length = 766
Score = 69.3 bits (168), Expect = 8e-10, Method: Composition-based stats.
Identities = 36/69 (52%), Positives = 49/69 (71%), Gaps = 4/69 (5%)
Query: 133 RENGNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGD----AKESFGKITALNSTSSEES 188
+E GNLR+AL+TS+VIELP ++LKTV+DKVKDFFG+ AKESF +IT S +
Sbjct: 676 QEKGNLRSALATSNVIELPKKEELKTVVDKVKDFFGEVTSGAKESFAQITGSASDEAGAG 735
Query: 189 EENSKEKAK 197
+ +EK +
Sbjct: 736 AQEEEEKPR 744
>gi|224138776|ref|XP_002326687.1| predicted protein [Populus trichocarpa]
gi|222834009|gb|EEE72486.1| predicted protein [Populus trichocarpa]
Length = 65
Score = 67.0 bits (162), Expect = 4e-09, Method: Composition-based stats.
Identities = 35/49 (71%), Positives = 38/49 (77%), Gaps = 3/49 (6%)
Query: 88 VYLWY-PLSIITGGT--TAKIMVAAKDNFLGKYIYKDTLARNLAAVIYR 133
VY WY P II GGT AKIMV+A D+ L KYIYKDTLARNLAA+IYR
Sbjct: 11 VYQWYSPSHIIGGGTGTIAKIMVSATDDVLKKYIYKDTLARNLAAIIYR 59
>gi|384250711|gb|EIE24190.1| hypothetical protein COCSUDRAFT_53257 [Coccomyxa subellipsoidea
C-169]
Length = 213
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 56/108 (51%), Gaps = 8/108 (7%)
Query: 31 TTQKPNGRERQRRVLVVQAKGKRGLQARQFQR----PPPPSLPKIEDDGNPRFVIFIRMA 86
T + +GR + V V+ GK+ L PP P + D N FV+FIR
Sbjct: 39 TCRPGSGRSLRLEVSAVKGAGKKKLAQSAQAGAAQLPPTPPV----DPDNAEFVLFIRAV 94
Query: 87 NVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIYRE 134
WYPLS++ GG+ A I+V A ++ G+ +Y TL RN+ V+Y++
Sbjct: 95 KFPQWYPLSVVKGGSAANIIVRAMESEFGRLLYGKTLIRNIGTVVYQD 142
>gi|413937630|gb|AFW72181.1| hypothetical protein ZEAMMB73_079373 [Zea mays]
Length = 92
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 39/53 (73%), Gaps = 1/53 (1%)
Query: 37 GRERQRRVLVVQAKGKRGLQARQFQRPP-PPSLPKIEDDGNPRFVIFIRMANV 88
GR+ RR LVV+A+G RG RQ Q+ P LPKIEDDGNPRFVIFIR ANV
Sbjct: 38 GRQPARRALVVEARGGRGWSERQMQQQRRAPLLPKIEDDGNPRFVIFIRTANV 90
>gi|449016854|dbj|BAM80256.1| hypothetical protein CYME_CMJ126C [Cyanidioschyzon merolae strain
10D]
Length = 224
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 73/163 (44%), Gaps = 13/163 (7%)
Query: 11 LSSSRTHEDVLVKHSPFSSRTTQKPNGRERQRRVLVVQAKGKRGLQARQFQRP----PPP 66
L+++ TH + SR P R + +G GLQA++ +P PP
Sbjct: 26 LTTAHTHGVFCPR-----SRVQSAPASAAGVHRSTALPHRGVIGLQAKK-GKPNAPLAPP 79
Query: 67 SLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARN 126
S ++ +DG P F I R LWYPL + G AK++V A G+ Y DTL +
Sbjct: 80 SELEVPEDGTPVFAILARSPVSGLWYPLGTMRGDGRAKMLVNAMRTSWGRKFYGDTLNKG 139
Query: 127 LAAVIYRENGN--LRAALSTSDVIELPTPDKLKTVLDKV-KDF 166
+A ++ E G + AAL + T D L+ + KDF
Sbjct: 140 VARSVFSEQGRRMIHAALRKYSALRKYTADDLEYGYRVIAKDF 182
>gi|303273350|ref|XP_003056036.1| hypothetical protein MICPUCDRAFT_70697 [Micromonas pusilla
CCMP1545]
gi|226462120|gb|EEH59412.1| hypothetical protein MICPUCDRAFT_70697 [Micromonas pusilla
CCMP1545]
Length = 214
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 2/72 (2%)
Query: 68 LPKIEDDGNPRFVIFIRMANVY-LWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARN 126
LP I+ D N +FVI++R + WYPL+++TGG+TA +V D+ L + + +L N
Sbjct: 71 LPPIDPD-NEQFVIYVRSKKGFKAWYPLNVVTGGSTANTLVKGLDSDLSRELAVKSLTTN 129
Query: 127 LAAVIYRENGNL 138
+ IY+E L
Sbjct: 130 IGQAIYKETEQL 141
>gi|308799329|ref|XP_003074445.1| Metalloendopeptidase family-saccharolysin & thimet oligopeptidase
(ISS) [Ostreococcus tauri]
gi|116000616|emb|CAL50296.1| Metalloendopeptidase family-saccharolysin & thimet oligopeptidase
(ISS) [Ostreococcus tauri]
Length = 205
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 43/68 (63%), Gaps = 2/68 (2%)
Query: 68 LPKIEDDGNPRFVIFIRMA-NVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARN 126
+P I+ D N +FVI++R + WYPL+++TGG+ A +V DN + + + + +L +N
Sbjct: 66 MPPIDPD-NEQFVIYVRSKRGLKAWYPLNVVTGGSAANTLVKGLDNDMSREMAQKSLQQN 124
Query: 127 LAAVIYRE 134
+ IY++
Sbjct: 125 IGKAIYKD 132
>gi|145340935|ref|XP_001415572.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575795|gb|ABO93864.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 147
Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 43/68 (63%), Gaps = 2/68 (2%)
Query: 68 LPKIEDDGNPRFVIFIRMA-NVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARN 126
+P I+ D N +FVI +R + WYPL+++TGG+ A +V DN + K + + +L++N
Sbjct: 9 MPPIDPD-NEQFVIHVRSKRGIKAWYPLNVVTGGSAANTLVKGLDNDMSKEMAQKSLSQN 67
Query: 127 LAAVIYRE 134
+ IY++
Sbjct: 68 IGKAIYKD 75
>gi|255070831|ref|XP_002507497.1| hypothetical protein MICPUN_113391 [Micromonas sp. RCC299]
gi|226522772|gb|ACO68755.1| hypothetical protein MICPUN_113391 [Micromonas sp. RCC299]
Length = 215
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 45/73 (61%), Gaps = 2/73 (2%)
Query: 68 LPKIEDDGNPRFVIFIRMANVY-LWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARN 126
+P ++ D N +FVI++R + W+PL+++TGG+TA +V D+ L + + K +L N
Sbjct: 68 MPPVDPD-NEQFVIYVRSKKGFKAWFPLNVVTGGSTANTLVKGLDSNLSRDMAKKSLTNN 126
Query: 127 LAAVIYRENGNLR 139
+ IY++ L+
Sbjct: 127 IGQAIYKDADQLQ 139
>gi|307111738|gb|EFN59972.1| hypothetical protein CHLNCDRAFT_133084 [Chlorella variabilis]
Length = 206
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 55/133 (41%), Gaps = 30/133 (22%)
Query: 73 DDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKDTLARNLAAVIY 132
D N FVIF+R + W PLS++ GGT A ++V D+ K TL +N+ +Y
Sbjct: 70 DPDNEEFVIFVRSKKLPQWVPLSVVKGGTAANMLVKGLDSDWMKDTTAKTLVQNIGKAVY 129
Query: 133 -----------------------------RENGNLRAALSTSDVIELPTPDKL-KTVLDK 162
RE + +A ++++P +L KT +
Sbjct: 130 KDKEQIIASLRKAYPPFKETKEFEFAFKIREKADPKAWYVPKGLMDIPPEAELEKTPAEN 189
Query: 163 VKDFFGDAKESFG 175
VKDFF A FG
Sbjct: 190 VKDFFSTAFSGFG 202
>gi|412990384|emb|CCO19702.1| predicted protein [Bathycoccus prasinos]
Length = 171
Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 18/89 (20%)
Query: 68 LPKIEDDGNPRFVIFIR-MANVYLWYPLSIITGGTTAKIMVAAKDNFLGK------YIYK 120
LP I+ D N +FVI++R + W+PL+++TGG+TA +V D+ L K + Y
Sbjct: 11 LPPIDPD-NEQFVIYVRSKKGLKTWFPLNVVTGGSTANTLVKGLDSNLSKDVCLSLFYYH 69
Query: 121 ----------DTLARNLAAVIYRENGNLR 139
+L N+ +Y+E L
Sbjct: 70 FSHYFLQMALKSLTNNIGQALYKEQVQLE 98
>gi|428166950|gb|EKX35917.1| hypothetical protein GUITHDRAFT_117950 [Guillardia theta CCMP2712]
Length = 149
Score = 40.0 bits (92), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 4/64 (6%)
Query: 72 EDDGNPRFVIFIRMANVYLWYPLSIITGGTTAKIMVAA-KDN--FLGKYIYKDTLARNLA 128
+ DG P F +F+R +WYP + G +K +V A +DN FL K YK TL + +A
Sbjct: 32 DSDGLPVFNVFVRSKRAGIWYPAGTLKGDARSKSLVEAWRDNSLFL-KDQYKSTLDKGMA 90
Query: 129 AVIY 132
++
Sbjct: 91 KSLF 94
>gi|159474984|ref|XP_001695603.1| hypothetical protein CHLREDRAFT_205992 [Chlamydomonas reinhardtii]
gi|158275614|gb|EDP01390.1| predicted protein [Chlamydomonas reinhardtii]
Length = 223
Score = 39.3 bits (90), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 11/81 (13%)
Query: 73 DDGNPRFVIFIR-----------MANVYLWYPLSIITGGTTAKIMVAAKDNFLGKYIYKD 121
D N FVIF R M LW PLSI+ G + + A + G +Y
Sbjct: 73 DPENAEFVIFFRCLKYKDPQLNAMVGPSLWVPLSIVKGNQVSNFLANAIKSPWGMRLYGR 132
Query: 122 TLARNLAAVIYRENGNLRAAL 142
TL +A+ +Y++ L L
Sbjct: 133 TLIWQMASGLYQDKAKLEKEL 153
>gi|219129507|ref|XP_002184929.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403714|gb|EEC43665.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 206
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 35/73 (47%), Gaps = 1/73 (1%)
Query: 74 DGNPRFVIFIRMANVYLWYPLSIITGGT-TAKIMVAAKDNFLGKYIYKDTLARNLAAVIY 132
DG P F +F+R +WYP G +A + + D L + K + +A +Y
Sbjct: 82 DGLPVFNLFVRTKKANMWYPCGSFKGDERSAALAKSYADGGLLAGVSKRQIDGGIAGSLY 141
Query: 133 RENGNLRAALSTS 145
R+ G L+ +++ S
Sbjct: 142 RDQGKLKESVARS 154
>gi|154147932|ref|YP_001406499.1| preprotein translocase subunit SecA [Campylobacter hominis ATCC
BAA-381]
gi|171769704|sp|A7I1V8.1|SECA_CAMHC RecName: Full=Protein translocase subunit SecA
gi|153803941|gb|ABS50948.1| preprotein translocase, SecA subunit [Campylobacter hominis ATCC
BAA-381]
Length = 884
Score = 36.2 bits (82), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 116 KYIYK-DTLARNLAAVIYR-ENGNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGDAKES 173
KYI + D +A VIYR N L A D I D + VLD+++ F GD+KE+
Sbjct: 611 KYILEYDDVANEQRKVIYRYRNELLEADFDLHDKIISNREDYIANVLDRLEIFDGDSKEN 670
Query: 174 FG--KITA-LNSTSSEESEENSKEKAK 197
F KIT+ + S ++E +EN KA+
Sbjct: 671 FDIQKITSIIKSETTEILDENKLSKAE 697
>gi|428226288|ref|YP_007110385.1| hypothetical protein GEI7407_2862 [Geitlerinema sp. PCC 7407]
gi|427986189|gb|AFY67333.1| hypothetical protein GEI7407_2862 [Geitlerinema sp. PCC 7407]
Length = 145
Score = 35.8 bits (81), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 11/77 (14%)
Query: 41 QRRVLVVQAKGKRGLQARQFQRPPPPSLPKIEDDGNPRFVIFIRMANVYLWYPLSIITGG 100
Q++ V ++ KR A Q+ K++ DG P FV+F+R+ W+P+ I G
Sbjct: 14 QKKEKVAKSAEKRQAAASQYD--------KMKSDGMPEFVVFVRVRGKKNWFPVGSIAVG 65
Query: 101 TTAKI---MVAAKDNFL 114
+ +I + A +D L
Sbjct: 66 RSNQINQAIFANEDELL 82
>gi|17541710|ref|NP_502413.1| Protein PGP-1 [Caenorhabditis elegans]
gi|29429182|sp|P34712.2|PGP1_CAEEL RecName: Full=Multidrug resistance protein pgp-1; AltName:
Full=P-glycoprotein A; AltName:
Full=P-glycoprotein-related protein 1
gi|406855735|pdb|4F4C|A Chain A, The Crystal Structure Of The Multi-Drug Transporter
gi|3878418|emb|CAB01232.1| Protein PGP-1 [Caenorhabditis elegans]
Length = 1321
Score = 35.8 bits (81), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 135 NGNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGDAKESFGKI 177
NG+LR +L T D L D LKT + V+D+ GD +S G+I
Sbjct: 4 NGSLRQSLRTLDSFSLAPEDVLKTAIKTVEDYEGDNIDSNGEI 46
>gi|6809|emb|CAA46190.1| P-glycoprotein A [Caenorhabditis elegans]
Length = 1321
Score = 35.8 bits (81), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 135 NGNLRAALSTSDVIELPTPDKLKTVLDKVKDFFGDAKESFGKI 177
NG+LR +L T D L D LKT + V+D+ GD +S G+I
Sbjct: 4 NGSLRQSLRTLDSFSLAPEDVLKTAIKTVEDYEGDNIDSNGEI 46
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.132 0.367
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,017,255,010
Number of Sequences: 23463169
Number of extensions: 121994340
Number of successful extensions: 391320
Number of sequences better than 100.0: 59
Number of HSP's better than 100.0 without gapping: 49
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 391202
Number of HSP's gapped (non-prelim): 81
length of query: 198
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 63
effective length of database: 9,191,667,552
effective search space: 579075055776
effective search space used: 579075055776
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 73 (32.7 bits)