BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019341
(342 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255575726|ref|XP_002528762.1| conserved hypothetical protein [Ricinus communis]
gi|223531765|gb|EEF33584.1| conserved hypothetical protein [Ricinus communis]
Length = 373
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 196/318 (61%), Positives = 243/318 (76%), Gaps = 13/318 (4%)
Query: 1 MSSSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPSKQDLLRLITVVAIASSVAL 60
MSSS+ KR KP + S SSS + +T PP +LFPSK++ +RLI V+AIASSVA
Sbjct: 1 MSSSSTNKRRKP--------NLSPSSSPTLLTGPPNNLFPSKEEFVRLIAVLAIASSVAF 52
Query: 61 TCNYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLC 120
TCN +A ++N ++KPFCDSN S ++ C PCP NGEC QGKLEC GYRKH +C
Sbjct: 53 TCNLIATYINPSTKPFCDSNT-----DSFSEFCVPCPENGECTQGKLECAEGYRKHRNIC 107
Query: 121 VEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDN 180
+EDGDINE A +LS WVEN LC AYAQ+LCDG G+IW ++NDIW DL+GH+LM+ F+ DN
Sbjct: 108 IEDGDINERAKKLSEWVENHLCEAYAQYLCDGIGTIWFQDNDIWYDLDGHQLMENFQPDN 167
Query: 181 PVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVP 240
Y+Y K++ ME + R LE RTNS+G KELKCP+L+AEHYKP +CR QW+S HA +I
Sbjct: 168 ATYIYAKRKAMEMIVRLLEIRTNSHGNKELKCPDLVAEHYKPFTCRFRQWISNHAFVIAS 227
Query: 241 VCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLR 300
+CSL+VG +LLL K+ RR Y + R EELYHQVCE+LEENALMSK NGEC+ WVVAS+LR
Sbjct: 228 LCSLVVGAVLLLRKLQRRWYLSARGEELYHQVCEVLEENALMSKQSNGECDSWVVASQLR 287
Query: 301 DHLLLPKERKDPVIWKKV 318
DHLLLPKERKDPV+WK+V
Sbjct: 288 DHLLLPKERKDPVLWKRV 305
>gi|224138028|ref|XP_002326500.1| predicted protein [Populus trichocarpa]
gi|222833822|gb|EEE72299.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 198/316 (62%), Positives = 244/316 (77%), Gaps = 6/316 (1%)
Query: 3 SSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPSKQDLLRLITVVAIASSVALTC 62
SST K+RPK +SSSS + SS EPP +LFPSKQ+ LRLI V+AIASSVALTC
Sbjct: 2 SSTPKRRPKRDLSSSSSHPYTISSK----IEPPHNLFPSKQEFLRLIAVLAIASSVALTC 57
Query: 63 NYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVE 122
N++AN+++ ++KPFCD++L S S ++SCEPCP NGEC+QGKLEC GYRKH C+E
Sbjct: 58 NFIANYIDHSTKPFCDTSLD--SSDSLSNSCEPCPRNGECNQGKLECARGYRKHRNTCIE 115
Query: 123 DGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPV 182
DGD+ E A +L VEN LC AYA FLC GTG +WV+E+DI NDL+GH+L++ + DNPV
Sbjct: 116 DGDVYERAKKLLEGVENHLCEAYADFLCYGTGIMWVQEDDILNDLDGHQLLENYSSDNPV 175
Query: 183 YLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVC 242
Y+YTK + MET+ L++RTN G KE KCP+LL EHYKP +C + QW+S HAL+IVPVC
Sbjct: 176 YVYTKMKAMETISEELQTRTNPNGKKEFKCPDLLVEHYKPFTCHLRQWISEHALVIVPVC 235
Query: 243 SLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDH 302
+L+VG L+WK+ RR Y + R EELYHQVC+ILEE ALMSK VN ECEPWVVASRLRDH
Sbjct: 236 ALVVGFAFLVWKIRRRWYLSTRGEELYHQVCDILEERALMSKRVNAECEPWVVASRLRDH 295
Query: 303 LLLPKERKDPVIWKKV 318
LL PKERKD V+WKKV
Sbjct: 296 LLSPKERKDFVLWKKV 311
>gi|225427312|ref|XP_002282079.1| PREDICTED: uncharacterized protein LOC100243743 [Vitis vinifera]
gi|297742158|emb|CBI33945.3| unnamed protein product [Vitis vinifera]
Length = 383
Score = 366 bits (940), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 182/316 (57%), Positives = 245/316 (77%), Gaps = 5/316 (1%)
Query: 3 SSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPSKQDLLRLITVVAIASSVALTC 62
+ST +KRPKP +S S+ S SSSSS + + EPP++ FPSK +L +L+ V+AIA+SVA C
Sbjct: 2 ASTPRKRPKP--HSKSTHSPSSSSSLNALMEPPENFFPSKPELFKLLAVIAIATSVAALC 59
Query: 63 NYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVE 122
NY+ L+ SKPFCD+N DS P+D CEPCPSN EC+QG +EC GYRKHGKLC+E
Sbjct: 60 NYVVTILSRHSKPFCDTNA--DSQYLPSDLCEPCPSNAECYQGMMECVRGYRKHGKLCIE 117
Query: 123 DGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPV 182
DGDINETA +L+ +E +C YAQFLC GTGS+WV+E+++WND++ ++M+ L+N +
Sbjct: 118 DGDINETAKKLANRIETHVCEGYAQFLC-GTGSVWVQEDEVWNDVDELKMMENLGLENAI 176
Query: 183 YLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVC 242
++TK+R ME + LE++ N G+KELKCP LLAEHYKP SCR+ QW+S HAL+++P+C
Sbjct: 177 DMHTKQRAMEMIDGLLETKINHRGIKELKCPNLLAEHYKPFSCRVQQWISNHALVLMPIC 236
Query: 243 SLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDH 302
LLVG +LLL ++ +RR + R EELY+Q+C+ILEENA+M+K +GE EPWVV S LRDH
Sbjct: 237 GLLVGSILLLRRIRQRRNLSARAEELYNQICDILEENAMMTKGGDGEGEPWVVVSWLRDH 296
Query: 303 LLLPKERKDPVIWKKV 318
LLLPKERKDP++W+KV
Sbjct: 297 LLLPKERKDPLLWRKV 312
>gi|356495558|ref|XP_003516643.1| PREDICTED: uncharacterized protein LOC100779650 [Glycine max]
Length = 377
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 180/302 (59%), Positives = 228/302 (75%), Gaps = 11/302 (3%)
Query: 19 SSSSSSSSSWSWM-TEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFC 77
S+S+S S S+S M EPPQ+L PSK D RL+ VVA+AS VA TCN FL + SKPFC
Sbjct: 12 SNSNSESCSFSLMGREPPQNLLPSKHDFPRLVLVVALASLVAWTCN----FLFTPSKPFC 67
Query: 78 DSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWV 137
D NL SP +D CEPCPSNGEC+ GKL+C GY++HG LCVEDGDINE+A +L V
Sbjct: 68 DPNL--HSPDYFSDICEPCPSNGECNDGKLKCLQGYQRHGNLCVEDGDINESARKLLERV 125
Query: 138 ENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRY 197
E+ LC YAQFLC GTG+IWV E+D+WN E +K+ DN +Y YTK++ ET+G+
Sbjct: 126 EHHLCEEYAQFLCTGTGTIWVREDDLWNYFEPVGNVKV---DNALYKYTKQKAFETMGKL 182
Query: 198 LESRTNS-YGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVH 256
L++R NS +GMKE KCP+ LAEHYK +C I QW+S H L+++P+C++LVGC L W V
Sbjct: 183 LDTRLNSSHGMKEFKCPDQLAEHYKSYACCIRQWISQHILVVLPICAMLVGCTALFWSVR 242
Query: 257 RRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDPVIWK 316
++ + R+EELY++VCEILEENAL SKS NGECEPWVV+SRLRDHLLLP+ERK+P++WK
Sbjct: 243 QKLCMSRRIEELYNKVCEILEENALTSKSANGECEPWVVSSRLRDHLLLPRERKNPLLWK 302
Query: 317 KV 318
KV
Sbjct: 303 KV 304
>gi|356540585|ref|XP_003538768.1| PREDICTED: uncharacterized protein LOC100784375 [Glycine max]
Length = 381
Score = 361 bits (927), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 179/303 (59%), Positives = 226/303 (74%), Gaps = 8/303 (2%)
Query: 19 SSSSSSSSSWSWMT-EPPQSLFPSKQDLLRLITVVAIASSVALTCNYL-ANFLNSTSKPF 76
S+S+S S S M EPPQ+L PSK D RL+ V+A+AS VA TCN+L + + SKPF
Sbjct: 12 SNSNSEGCSLSLMGREPPQNLLPSKHDFPRLVLVIALASLVAWTCNFLFTSLFHPPSKPF 71
Query: 77 CDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRW 136
CD+NL SP D C+PCPSNGEC+ GKLEC GY++HG LC EDGDINE+A +L
Sbjct: 72 CDTNL--HSPDYFLDICQPCPSNGECNDGKLECHQGYQRHGNLCAEDGDINESARKLLER 129
Query: 137 VENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGR 196
VE+ LC YAQFLC GTG IWV E+D+WN E +K+ DN +Y YTK+R +ET+G+
Sbjct: 130 VEHHLCEKYAQFLCTGTGIIWVHEDDLWNYFEPVGNVKV---DNALYNYTKQRAVETMGK 186
Query: 197 YLESRTNS-YGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKV 255
LE+R NS +GMKE KCP+ LAEHYKP +C I QW+S H L+++P+C++LVGC L W V
Sbjct: 187 LLETRLNSSHGMKEFKCPDQLAEHYKPYTCCIRQWISQHILVVLPICAMLVGCTALCWNV 246
Query: 256 HRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDPVIW 315
++ + RVEELY +VCEILE+NAL SKS NGECEPWVVASRLRDHLLLP+ERK+P++W
Sbjct: 247 RQKLSMSRRVEELYDKVCEILEDNALTSKSANGECEPWVVASRLRDHLLLPRERKNPLLW 306
Query: 316 KKV 318
KK+
Sbjct: 307 KKL 309
>gi|357482303|ref|XP_003611437.1| hypothetical protein MTR_5g014010 [Medicago truncatula]
gi|355512772|gb|AES94395.1| hypothetical protein MTR_5g014010 [Medicago truncatula]
Length = 374
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 165/287 (57%), Positives = 218/287 (75%), Gaps = 7/287 (2%)
Query: 33 EPPQSLFPSKQDLLRLITVVAIASSVALTCNYL-ANFLNSTSKPFCDSNLLLDSPQSPTD 91
EPP +L PSK + +L+ V+ +AS VA + N L +FL+ ++KPFCD+N L + P D
Sbjct: 22 EPPPNLLPSKHEFPKLLLVLTVASLVAWSSNLLFTSFLHPSTKPFCDTNSLHN--HFP-D 78
Query: 92 SCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCD 151
SCEPCPSNGEC+ GKLEC GY+KHG LCVEDGDIN++A +++ VE LC YAQFLC
Sbjct: 79 SCEPCPSNGECNDGKLECLRGYQKHGNLCVEDGDINDSARKIADTVERHLCGEYAQFLCS 138
Query: 152 GTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELK 211
GTGSIWV ++D+WN +E E +K N +Y YTK++ + + + LE R ++GMKE K
Sbjct: 139 GTGSIWVHDDDLWNYIEPVENVK---EGNALYNYTKQKAFDMMDKLLEMRLTTHGMKEFK 195
Query: 212 CPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQ 271
CP+ L E YKP +CR+ QW++ H L+++P+C++LVGC++L W V R+ + RVEELY++
Sbjct: 196 CPDSLVEQYKPYACRLRQWITQHILVVLPICAMLVGCMILFWNVRRKLRVSRRVEELYNK 255
Query: 272 VCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDPVIWKKV 318
VCEILEENAL SKSVNGECEPWVVASRLRDHLLLP+ERKDP++WKKV
Sbjct: 256 VCEILEENALTSKSVNGECEPWVVASRLRDHLLLPRERKDPLLWKKV 302
>gi|449461577|ref|XP_004148518.1| PREDICTED: uncharacterized protein LOC101208017 [Cucumis sativus]
Length = 404
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 180/328 (54%), Positives = 229/328 (69%), Gaps = 14/328 (4%)
Query: 3 SSTKKKRPKPKSNSSSSSSSSSSSSWSW-----------MTEPPQSLFPSKQDLLRLITV 51
SST KKR K K N +S S S S + EPP+ FPSK DL LITV
Sbjct: 2 SSTPKKRTKVKRNPNSDVGSGSGSGVDSSVSSSSLLLKSIKEPPRDFFPSKDDLAALITV 61
Query: 52 VAIASSVALTCNYLANFLNSTSK-PFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECF 110
+ IA V ++CN+ + L+S PFCD++ DS +D CEPCP +GEC GKLEC
Sbjct: 62 LIIACFVFVSCNFFVSRLSSRHPIPFCDTDA--DSSDFISDVCEPCPRHGECRDGKLECL 119
Query: 111 HGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGH 170
HGYRKHG+LC+EDG INE +LS W+E+ LC A A+FLCDG G +WV+ENDIW+DL+G
Sbjct: 120 HGYRKHGRLCIEDGVINEAVNKLSEWLESHLCEANAKFLCDGIGIVWVKENDIWDDLDGK 179
Query: 171 ELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQW 230
EL++ DN +Y K + +ET+G L++R NS G+KELKCP+LLAE YKP +CRI W
Sbjct: 180 ELVESIGSDNTTLMYAKSKALETIGGLLQTRQNSLGIKELKCPDLLAESYKPFTCRIRHW 239
Query: 231 VSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGEC 290
V HA +++PV LLVGC LLWK++RR+Y R E+LY+QVCEILEENAL S +G+C
Sbjct: 240 VLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALTSTRNSGQC 299
Query: 291 EPWVVASRLRDHLLLPKERKDPVIWKKV 318
E WVVASRLRDHLLLP+ER++P++WKKV
Sbjct: 300 ESWVVASRLRDHLLLPRERRNPLLWKKV 327
>gi|240256408|ref|NP_199468.5| uncharacterized protein [Arabidopsis thaliana]
gi|332008015|gb|AED95398.1| uncharacterized protein [Arabidopsis thaliana]
Length = 387
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 161/318 (50%), Positives = 215/318 (67%), Gaps = 7/318 (2%)
Query: 4 STKKKRPKPKSNSSSSSSSSSSSS-WSWMTEPPQSLFPSKQDLLRLITVVAIASSVALTC 62
S +KRPK ++ + + SSSSSS M EPPQSLFPSK + L+ V+ +A +VA TC
Sbjct: 3 SIPRKRPKSETRTGRTPKSSSSSSPIRSMLEPPQSLFPSKGEFFTLLKVLLVACAVAFTC 62
Query: 63 NYLANFLNST-SKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCV 121
N+L+ L+S SK FCDSN + S D CEPCP NGEC+QGKL+C GY+ LCV
Sbjct: 63 NFLSKSLSSNPSKSFCDSN--FNPIDSDLDICEPCPINGECYQGKLQCNLGYKNQRNLCV 120
Query: 122 EDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNP 181
EDG+INE+ +L + E ++C +YA C GTG+IWV END+W +L + + LD
Sbjct: 121 EDGEINESTKKLVGYFERKVCESYAHNECYGTGTIWVPENDVWTELRSNSFLS--NLDES 178
Query: 182 VYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPV 241
Y + K + +E V LE RTNS G+ ELKCPE +A+ YKPL+CR+HQW+ H LII
Sbjct: 179 AYNFLKGKAVEGVTELLEKRTNSNGIDELKCPESVAKSYKPLTCRLHQWILRHILIISSS 238
Query: 242 CSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSV-NGECEPWVVASRLR 300
C++LVG +L ++ R++ F+ RVEELY QVC+ LEENA+ S S CEPWV+AS LR
Sbjct: 239 CAMLVGSAMLRRRIQRKQCFSRRVEELYDQVCDFLEENAVASNSAETSNCEPWVIASWLR 298
Query: 301 DHLLLPKERKDPVIWKKV 318
D+LLLP+ER+DP++W KV
Sbjct: 299 DYLLLPRERRDPLLWTKV 316
>gi|297794583|ref|XP_002865176.1| hypothetical protein ARALYDRAFT_494322 [Arabidopsis lyrata subsp.
lyrata]
gi|297311011|gb|EFH41435.1| hypothetical protein ARALYDRAFT_494322 [Arabidopsis lyrata subsp.
lyrata]
Length = 389
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 151/320 (47%), Positives = 199/320 (62%), Gaps = 21/320 (6%)
Query: 4 STKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPSKQDLLRLITVVAIASSVALTCN 63
S +KRPK ++ + + SSSS M EPPQS FPSK++ RL+ V+ +A VA TCN
Sbjct: 3 SIPRKRPKSETRTPKRNPKSSSSPIRSMLEPPQSFFPSKEEFFRLLKVLLVACVVAFTCN 62
Query: 64 YLANFLNST-SKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVE 122
+LA L+S +K FCDSN DS +S D CEPCP NGEC QGKL+C GY+K LCVE
Sbjct: 63 FLAKSLSSNPTKSFCDSNY--DSIESDLDFCEPCPINGECCQGKLKCNLGYKKQRNLCVE 120
Query: 123 DGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPV 182
DG+INE+ +L + E ++C AYA C GTG+IWV END+W +L + + LD
Sbjct: 121 DGEINESTKKLVGYFERKVCEAYAHNECYGTGAIWVPENDVWEELRSNSFLN--NLDESA 178
Query: 183 YLYTKKRTMETVGRYLESRTNSYGMK--ELKCPELLAEHYKPLSCRIHQWVSTHALIIVP 240
Y + K + +E V LE RTNS G K L P + H +V
Sbjct: 179 YNFLKGKAVEAVTELLEKRTNSNGRKLQTLDLPH------------ASMALEAHLNYLVR 226
Query: 241 VCS-LLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVN-GECEPWVVASR 298
+C+ +LVG +L K+ R+RYF+ RVEELY QVC+ LEENA+ S S + CEPWV+AS
Sbjct: 227 MCNGILVGSAILRRKIQRKRYFSRRVEELYDQVCDFLEENAVASNSADTSNCEPWVIASW 286
Query: 299 LRDHLLLPKERKDPVIWKKV 318
LRD+LLLP+ER+DP++W KV
Sbjct: 287 LRDYLLLPRERRDPLLWTKV 306
>gi|449510808|ref|XP_004163763.1| PREDICTED: uncharacterized LOC101208017, partial [Cucumis sativus]
Length = 280
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 145/281 (51%), Positives = 187/281 (66%), Gaps = 14/281 (4%)
Query: 3 SSTKKKRPKPKSNSSSSSSSSSSSSWSW-----------MTEPPQSLFPSKQDLLRLITV 51
SST KKR K K N +S S S S + EPP+ FPSK DL LITV
Sbjct: 2 SSTPKKRTKVKRNPNSDVGSGSGSGVDSSVSSSSLLLKSIKEPPRDFFPSKDDLAALITV 61
Query: 52 VAIASSVALTCNYLANFLNSTSK-PFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECF 110
+ IA V ++CN+ + L+S PFCD++ DS +D CEPCP +GEC GKLEC
Sbjct: 62 LIIACFVFVSCNFFVSRLSSRHPIPFCDTDA--DSSDFISDVCEPCPRHGECRDGKLECL 119
Query: 111 HGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGH 170
HGYRKHG+LC+EDG INE +LS W+E+ LC A A+FLCDG G +WV+ENDIW+DL+G
Sbjct: 120 HGYRKHGRLCIEDGVINEAVNKLSEWLESHLCEANAKFLCDGIGIVWVKENDIWDDLDGK 179
Query: 171 ELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQW 230
EL++ DN +Y K + +ET+G L++R NS G+KELKCP+LLAE YKP +CRI W
Sbjct: 180 ELVESIGSDNTTLMYAKSKALETIGGLLQTRQNSLGIKELKCPDLLAESYKPFTCRIRHW 239
Query: 231 VSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQ 271
V HA +++PV LLVGC LLWK++RR+Y R E+LY+Q
Sbjct: 240 VLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQ 280
>gi|8885597|dbj|BAA97527.1| unnamed protein product [Arabidopsis thaliana]
Length = 375
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 197/318 (61%), Gaps = 16/318 (5%)
Query: 4 STKKKRPKPKSNSSSSSSSSSSSS-WSWMTEPPQSLFPSKQDLLRLITVVAIASSVALTC 62
S +KRPK ++ + + SSSSSS M EPPQSLFPSK + L+ V+ +A +VA TC
Sbjct: 3 SIPRKRPKSETRTGRTPKSSSSSSPIRSMLEPPQSLFPSKGEFFTLLKVLLVACAVAFTC 62
Query: 63 NYLANFLNST-SKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCV 121
N+L+ L+S SK FCDSN + S D CEPCP NGEC+QGKL+C GY+ LCV
Sbjct: 63 NFLSKSLSSNPSKSFCDSN--FNPIDSDLDICEPCPINGECYQGKLQCNLGYKNQRNLCV 120
Query: 122 EDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNP 181
EDG+INE+ +L + E ++C +YA C GTG+IWV END+W +L + + LD
Sbjct: 121 EDGEINESTKKLVGYFERKVCESYAHNECYGTGTIWVPENDVWTELRSNSFLS--NLDES 178
Query: 182 VYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPV 241
Y + K + +E V LE RTNS G K L+ +L + H ++V
Sbjct: 179 AYNFLKGKAVEGVTELLEKRTNSNGQK-LQTFDLSLASMDLAAYLNH--------LVVLC 229
Query: 242 CSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSV-NGECEPWVVASRLR 300
+LVG +L ++ R++ F+ RVEELY QVC+ LEENA+ S S CEPWV+AS LR
Sbjct: 230 NGILVGSAMLRRRIQRKQCFSRRVEELYDQVCDFLEENAVASNSAETSNCEPWVIASWLR 289
Query: 301 DHLLLPKERKDPVIWKKV 318
D+LLLP+ER+DP++W KV
Sbjct: 290 DYLLLPRERRDPLLWTKV 307
>gi|108705664|gb|ABF93459.1| expressed protein [Oryza sativa Japonica Group]
Length = 296
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 117/287 (40%), Positives = 177/287 (61%), Gaps = 7/287 (2%)
Query: 31 MTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPT 90
+ EPP LFP++ DLLRL+ VV+IA++ A C+ LN KP CDS + +
Sbjct: 17 LGEPPPGLFPARDDLLRLLAVVSIAAAAAAACS----LLNRRPKPLCDSG---GAAYTHH 69
Query: 91 DSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLC 150
DSC+PCP +G C G LEC G+ K+G LC+EDG +++TA ++S+ +E R+C YA+ LC
Sbjct: 70 DSCQPCPPHGRCVDGNLECVQGFNKYGNLCIEDGLVSQTATKISQLLERRICDQYARALC 129
Query: 151 DGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKEL 210
I +E DI N + L K + +++ + E +S G++E
Sbjct: 130 GQPAKILFQELDILNMADELLSKGFVGLSQDGAKVAKIKVLDSARAFFEKTFSSDGVEEF 189
Query: 211 KCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYH 270
KCP+L+AE Y+PL+C+I QW+S + + + L L +LW +++R+ + R E++Y
Sbjct: 190 KCPDLVAELYRPLTCQIRQWISRNIMSVTAFGVLFSALLWILWTIYKRQALSKRAEQIYA 249
Query: 271 QVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDPVIWKK 317
QVCE+LE+NA+ +K N ECEPWVV S LRDHLL+P+ER++ +WKK
Sbjct: 250 QVCEVLEDNAIDAKIGNSECEPWVVTSWLRDHLLVPQERRNAFLWKK 296
>gi|326526825|dbj|BAK00801.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 387
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 186/319 (58%), Gaps = 23/319 (7%)
Query: 1 MSSSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPSK-QDLLRLITVVAIASSVA 59
M+ T + RP+P+ W EPP LFP+ D LRL++V+AIA++VA
Sbjct: 1 MARRTVRSRPEPR--------------WPLAAEPPPGLFPAGIHDALRLLSVLAIAAAVA 46
Query: 60 LTCNYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKL 119
C+ LN PFCDS D P + DSCEPCP NG C G+L C G+++ G++
Sbjct: 47 AACS----VLNRRPAPFCDS----DDPYAAYDSCEPCPENGRCVDGELRCVEGFKRRGRV 98
Query: 120 CVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELD 179
CVEDG + TA ++S ++ R+C +A+ LC G I +++D + + L
Sbjct: 99 CVEDGLLTHTANKISELLQLRICDEHARALCGQPGKILFQQHDTSSMADELLSKDAARLS 158
Query: 180 NPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIV 239
+ K+R +++ +LE+ + ++ KCPEL AE +PLSC+ QW+S++ + ++
Sbjct: 159 DDGIKVVKERVLQSAHGFLETTSTYDKVQAFKCPELAAELRRPLSCQARQWISSNIVFVI 218
Query: 240 PVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRL 299
C L L +LW +++R + R E++Y QVCEILE+NA+ +K N ECEPWVV S L
Sbjct: 219 TFCVLSAALLRILWSIYQRWALSKRAEQIYEQVCEILEDNAISAKVGNSECEPWVVTSWL 278
Query: 300 RDHLLLPKERKDPVIWKKV 318
RDHLL +ER++ ++WKKV
Sbjct: 279 RDHLLSTRERRNTLLWKKV 297
>gi|242037307|ref|XP_002466048.1| hypothetical protein SORBIDRAFT_01g050680 [Sorghum bicolor]
gi|241919902|gb|EER93046.1| hypothetical protein SORBIDRAFT_01g050680 [Sorghum bicolor]
Length = 387
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 122/306 (39%), Positives = 184/306 (60%), Gaps = 14/306 (4%)
Query: 17 SSSSSSSSSSSWSWMTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPF 76
S S S+ + + EPP LFP+++DL+RL+ V+ IA++ A C+ LN +PF
Sbjct: 3 SRSPRRSARAPFPSAAEPPPGLFPAREDLVRLLAVICIAAAAAAACS----VLNRRPEPF 58
Query: 77 CDSNLLLDSPQSPTD----SCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGR 132
CDS PQSP D SC+PCP NG C G+LEC G+++ K C+EDG +++TA +
Sbjct: 59 CDS------PQSPDDYADDSCQPCPLNGRCVDGELECVQGFKRQDKACIEDGLLSQTANK 112
Query: 133 LSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTME 192
+S ++ +C +A+ LC I +++D+ N ++ L K R +E
Sbjct: 113 ISELLQLWICNQHARALCGQPAEILFQQHDVSNAIDELLSKTPAGLTEAGIQLVKTRVLE 172
Query: 193 TVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLL 252
+ + ++ S +K KCPEL+AE + PL+CR+ QW+S + + + C LL L +L
Sbjct: 173 SSQDFFDTTFTSNKVKVFKCPELVAELHMPLACRVRQWISRNTICVATFCILLAALLWIL 232
Query: 253 WKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDP 312
W ++RRR + R E++Y QVCEILE+NA+ +K N CEPWVV S LRDHLL+P+ERK+
Sbjct: 233 WIIYRRRALSNRAEQIYEQVCEILEDNAINAKIDNSNCEPWVVTSWLRDHLLVPRERKNA 292
Query: 313 VIWKKV 318
+WKKV
Sbjct: 293 FLWKKV 298
>gi|226493918|ref|NP_001143585.1| uncharacterized protein LOC100276289 [Zea mays]
gi|195622902|gb|ACG33281.1| hypothetical protein [Zea mays]
gi|414864219|tpg|DAA42776.1| TPA: hypothetical protein ZEAMMB73_816127 [Zea mays]
Length = 383
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 119/307 (38%), Positives = 178/307 (57%), Gaps = 21/307 (6%)
Query: 17 SSSSSSSSSSSWSWMTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPF 76
S S S+ EPP LFP+++DL+RL+ V+ IA++ A C+ LN +PF
Sbjct: 3 SRSPRQSARPPLPTAAEPPPGLFPAREDLVRLLAVIFIAAAAAAACS----VLNRRPEPF 58
Query: 77 CDSNLLLDSPQSPTD----SCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGR 132
CDS PQ P D SC+PCP NG C G+LEC G+++HGK C+EDG + + A +
Sbjct: 59 CDS------PQWPDDYADDSCQPCPPNGRCVDGELECVQGFKRHGKACIEDGLLIQRANK 112
Query: 133 LSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKI-FELDNPVYLYTKKRTM 191
+ R +C +A+ LC + I +E+++ N ++ L KI L K R +
Sbjct: 113 ILR-----ICDQHARALCGHSAEILFQEHEVSNAID-ELLSKIPAGLTEDGLQLVKTRVL 166
Query: 192 ETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLL 251
E+ G + ++ S +K KCPEL+AE + P+ CR+ QW+ + + C L L +
Sbjct: 167 ESAGGFFDTTFTSSKVKVYKCPELVAELHMPMGCRVRQWIFRNTTSVAIFCILFAALLWI 226
Query: 252 LWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKD 311
LW +++RR + R E++Y QVCEILE+NA+ ++ N E WVV S LRDHLL+P+ERK+
Sbjct: 227 LWIIYKRRTLSNRAEQIYEQVCEILEDNAVNAEIDNSNSEVWVVTSWLRDHLLVPQERKN 286
Query: 312 PVIWKKV 318
+WKKV
Sbjct: 287 AFLWKKV 293
>gi|218191894|gb|EEC74321.1| hypothetical protein OsI_09603 [Oryza sativa Indica Group]
gi|222624004|gb|EEE58136.1| hypothetical protein OsJ_09045 [Oryza sativa Japonica Group]
Length = 339
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 162/288 (56%), Gaps = 34/288 (11%)
Query: 31 MTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPT 90
+ EPP LFP++ DLLRL+ VV+IA++ A C+ LN KP CDS + +
Sbjct: 17 LGEPPPGLFPARDDLLRLLAVVSIAAAAAAACS----LLNRRPKPLCDSG---GAAYTHH 69
Query: 91 DSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLC 150
DSC+PCP +G C G LEC G+ K+G LC+EDG +++TA ++S+ +E R+C YA+ LC
Sbjct: 70 DSCQPCPPHGRCVDGNLECVQGFNKYGNLCIEDGLVSQTATKISQLLERRICDQYARALC 129
Query: 151 DGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKEL 210
I +E DI N + L K + +++ + E +S G++E
Sbjct: 130 GQPAKILFQELDILNMADELLSKGFVGLSQDGAKVAKIKVLDSARAFFEKTFSSDGVEEF 189
Query: 211 KCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYH 270
KCP+L+AE Y+PL+C+I QW+S + + + L
Sbjct: 190 KCPDLVAELYRPLTCQIRQWISRNIMSVTAFGVL-------------------------- 223
Query: 271 QVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDPVIWKKV 318
VCE+LE+NA+ +K N ECEPWVV S LRDHLL+P+ER++ +WKKV
Sbjct: 224 -VCEVLEDNAIDAKIGNSECEPWVVTSWLRDHLLVPQERRNAFLWKKV 270
>gi|108705666|gb|ABF93461.1| expressed protein [Oryza sativa Japonica Group]
Length = 251
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 140/242 (57%), Gaps = 7/242 (2%)
Query: 31 MTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPT 90
+ EPP LFP++ DLLRL+ VV+IA++ A C+ LN KP CDS + +
Sbjct: 17 LGEPPPGLFPARDDLLRLLAVVSIAAAAAAACS----LLNRRPKPLCDSG---GAAYTHH 69
Query: 91 DSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLC 150
DSC+PCP +G C G LEC G+ K+G LC+EDG +++TA ++S+ +E R+C YA+ LC
Sbjct: 70 DSCQPCPPHGRCVDGNLECVQGFNKYGNLCIEDGLVSQTATKISQLLERRICDQYARALC 129
Query: 151 DGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKEL 210
I +E DI N + L K + +++ + E +S G++E
Sbjct: 130 GQPAKILFQELDILNMADELLSKGFVGLSQDGAKVAKIKVLDSARAFFEKTFSSDGVEEF 189
Query: 211 KCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYH 270
KCP+L+AE Y+PL+C+I QW+S + + + L L +LW +++R+ + R E++Y
Sbjct: 190 KCPDLVAELYRPLTCQIRQWISRNIMSVTAFGVLFSALLWILWTIYKRQALSKRAEQIYA 249
Query: 271 QV 272
QV
Sbjct: 250 QV 251
>gi|108705665|gb|ABF93460.1| expressed protein [Oryza sativa Japonica Group]
Length = 249
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 138/242 (57%), Gaps = 9/242 (3%)
Query: 31 MTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPT 90
+ EPP LFP++ DLLRL+ VV+IA++ A C+ LN KP CDS + +
Sbjct: 17 LGEPPPGLFPARDDLLRLLAVVSIAAAAAAACS----LLNRRPKPLCDSG---GAAYTHH 69
Query: 91 DSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLC 150
DSC+PCP +G C G LEC G+ K+G LC+EDG +++TA ++ +E R+C YA+ LC
Sbjct: 70 DSCQPCPPHGRCVDGNLECVQGFNKYGNLCIEDGLVSQTATKI--LLERRICDQYARALC 127
Query: 151 DGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKEL 210
I +E DI N + L K + +++ + E +S G++E
Sbjct: 128 GQPAKILFQELDILNMADELLSKGFVGLSQDGAKVAKIKVLDSARAFFEKTFSSDGVEEF 187
Query: 211 KCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYH 270
KCP+L+AE Y+PL+C+I QW+S + + + L L +LW +++R+ + R E++Y
Sbjct: 188 KCPDLVAELYRPLTCQIRQWISRNIMSVTAFGVLFSALLWILWTIYKRQALSKRAEQIYA 247
Query: 271 QV 272
QV
Sbjct: 248 QV 249
>gi|168000368|ref|XP_001752888.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696051|gb|EDQ82392.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 350
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 149/315 (47%), Gaps = 53/315 (16%)
Query: 6 KKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYL 65
K +R K K + SS S + P SL K+ +++LI ++ A+ V
Sbjct: 41 KDRRHKKKQHHSSGKKRSGT---------PLSL---KEQIVQLIAILLTAAVVYNVTLTA 88
Query: 66 ANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGD 125
+ +PFCD D D+C PCP NG C G+L C G+R+ G LCV D
Sbjct: 89 IGWFEEARRPFCD-----DQQDPLRDNCRPCPENGICRGGELRCIPGFRRQGALCVPDKQ 143
Query: 126 INETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLY 185
I+ A L + E+ IW+ E ++ LD L
Sbjct: 144 IDRNAQNL------------------------IPEHQIWDASEEEKM----GLDAENVLL 175
Query: 186 TKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWV--STHALIIVPVCS 243
++ ++ V L R + +G++E +CP LA Y+P+ CRI + + + + + I
Sbjct: 176 VHEKALQLVRDKLNIRIDDFGLREFRCPIALALTYQPIGCRIRKLIKANIYGIFIASFIV 235
Query: 244 LLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHL 303
+ V +L + RR R EELY QVCE+LEE + S +GE + WVVASRLRDHL
Sbjct: 236 IFVSVYILRYLQSRR--LVTRAEELYMQVCEVLEERSTGS---SGETK-WVVASRLRDHL 289
Query: 304 LLPKERKDPVIWKKV 318
LLP ERK +WK+V
Sbjct: 290 LLPSERKVATLWKEV 304
>gi|297721721|ref|NP_001173224.1| Os03g0100050 [Oryza sativa Japonica Group]
gi|255674135|dbj|BAH91952.1| Os03g0100050 [Oryza sativa Japonica Group]
Length = 243
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/291 (32%), Positives = 140/291 (48%), Gaps = 71/291 (24%)
Query: 31 MTEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPT 90
+ EPP LFP++ DLLRL+ VV+IA++ A C+ LN KP CDS
Sbjct: 17 LGEPPPGLFPARDDLLRLLAVVSIAAAAAAACS----LLNRRPKPLCDSG---------- 62
Query: 91 DSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLC 150
+ H G LEC G+ K+G LC+EDG +++TA ++
Sbjct: 63 ------GAAYTHHDGNLECVQGFNKYGNLCIEDGLVSQTATKI----------------- 99
Query: 151 DGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKEL 210
+E DI N + L K + +++ + E +S G++E
Sbjct: 100 -------FQELDILNMADELLSKGFVGLSQDGAKVAKIKVLDSARAFFEKTFSSDGVEEF 152
Query: 211 KCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYH 270
KCP+L+AE Y+PL+C+I QW+S + + + L
Sbjct: 153 KCPDLVAELYRPLTCQIRQWISRNIMSVTAFGVL-------------------------- 186
Query: 271 QVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDPVIWKKVFFL 321
VCE+LE+NA+ +K N ECEPWVV S LRDHLL+P+ER++ +WKKV L
Sbjct: 187 -VCEVLEDNAIDAKIGNSECEPWVVTSWLRDHLLVPQERRNAFLWKKVITL 236
>gi|326509395|dbj|BAJ91614.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 257
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 85/245 (34%), Positives = 134/245 (54%), Gaps = 23/245 (9%)
Query: 1 MSSSTKKKRPKPKSNSSSSSSSSSSSSWSWMTEPPQSLFPSK-QDLLRLITVVAIASSVA 59
M+ T + RP+P+ W EPP LFP+ D LRL++V+AIA++VA
Sbjct: 1 MARRTVRSRPEPR--------------WPLAAEPPPGLFPAGIHDALRLLSVLAIAAAVA 46
Query: 60 LTCNYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKL 119
C+ LN PFCDS D P + DSCEPCP NG C G+L C G+++ G++
Sbjct: 47 AACS----VLNRRPAPFCDS----DDPYAAYDSCEPCPENGRCVDGELRCVEGFKRRGRV 98
Query: 120 CVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELD 179
CVEDG + TA ++S ++ R+C +A+ LC G I +++DI + + L
Sbjct: 99 CVEDGLLTHTANKISELLQLRICDEHARALCGQPGKILFQQHDISSMADELLSKDAARLS 158
Query: 180 NPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIV 239
+ K+R +++ +LE+ + ++ KCPEL AE +PLSC+ QW+S++ + ++
Sbjct: 159 DDGIKVVKERVLQSAHGFLETTSTYDKVQAFKCPELAAELRRPLSCQARQWISSNIVFVI 218
Query: 240 PVCSL 244
C L
Sbjct: 219 TFCVL 223
>gi|357121072|ref|XP_003562246.1| PREDICTED: uncharacterized protein LOC100842207 [Brachypodium
distachyon]
Length = 330
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/309 (33%), Positives = 147/309 (47%), Gaps = 92/309 (29%)
Query: 20 SSSSSSSSWSWMTEPPQSLFPS-KQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCD 78
S S S WS EPP LFP+ D LRL+ V+AIA++ A C+ L+ N PFCD
Sbjct: 13 SKSRSKPLWS---EPPPGLFPTGAHDFLRLLAVLAIAAAAAAACSLLSRRPN----PFCD 65
Query: 79 SNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRL----- 133
S + + GKLEC G++K+GK+CVEDG + +TA ++
Sbjct: 66 SQ--------------------QHYDGKLECLQGFKKYGKICVEDGLLTQTANKIFQQLD 105
Query: 134 -SRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTME 192
S ++ L + +A DG + K R +E
Sbjct: 106 ISNLADDLLSKDHAHLSDDGIKLV------------------------------KDRVLE 135
Query: 193 TVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLL 252
+ +LE +Y E KCPEL+AE ++PLSC++ QW+ + + +V C L
Sbjct: 136 SAHGFLE-MVPTYNKVEAKCPELVAELHRPLSCQVRQWIFRNIISVVTSCIL-------- 186
Query: 253 WKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDP 312
VCEILE+NA+ +K N ECEPWVV S LRDHLL+ +ER++
Sbjct: 187 -------------------VCEILEDNAMNAKIGNSECEPWVVTSWLRDHLLVTRERRNA 227
Query: 313 VIWKKVFFL 321
V+WKKV L
Sbjct: 228 VLWKKVHSL 236
>gi|414864218|tpg|DAA42775.1| TPA: hypothetical protein ZEAMMB73_816127 [Zea mays]
Length = 169
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 52/74 (70%)
Query: 245 LVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLL 304
L +LW +++RR + R E++Y QVCEILE+NA+ ++ N E WVV S LRDHLL
Sbjct: 6 FAALLWILWIIYKRRTLSNRAEQIYEQVCEILEDNAVNAEIDNSNSEVWVVTSWLRDHLL 65
Query: 305 LPKERKDPVIWKKV 318
+P+ERK+ +WKKV
Sbjct: 66 VPQERKNAFLWKKV 79
>gi|449526940|ref|XP_004170471.1| PREDICTED: uncharacterized LOC101208017, partial [Cucumis sativus]
Length = 124
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 34/47 (72%), Positives = 41/47 (87%)
Query: 272 VCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKERKDPVIWKKV 318
VCEILEENAL S + +CE WVVASRLRDHLLLP+ER++P++WKKV
Sbjct: 1 VCEILEENALTSTRNSSQCESWVVASRLRDHLLLPRERRNPLLWKKV 47
>gi|452820092|gb|EME27139.1| hypothetical protein Gasu_52420 [Galdieria sulphuraria]
Length = 428
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/298 (24%), Positives = 127/298 (42%), Gaps = 29/298 (9%)
Query: 32 TEPPQSLFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPTD 91
T P+S P ++L+++ +A S+ Y + + PFCDSN +
Sbjct: 117 TPQPKSKAPEHFSGIKLLSITILAVSIFFAFTYYSPPIQL---PFCDSN-----NSKVLE 168
Query: 92 SCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCD 151
C PCPS+G C G+L C HG+ + CVED + +E L + LC
Sbjct: 169 DCRPCPSHGTCSAGELTCDHGFIESAGKCVEDKTFSVVTKDTKNRIEAILRKRAGMSLC- 227
Query: 152 GTGSIWVEENDIWNDLEGHELMK--IFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKE 209
GTG ++ +++ +L K + + L + + +S E
Sbjct: 228 GTG-------EVQKEMDSAQLRKAVLLSFGTKLPLEEIRFNRAFTEAFYTIVNDSVSSDE 280
Query: 210 LKCPE----LLAEHYKP---LSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYFA 262
+C E + + P LSCR+ +V ++ S+L+G L + RR+
Sbjct: 281 FECREEGERIWCQALHPSLSLSCRLRFFVLQEWWKLLITMSILIGISRLYFWFRLRRWID 340
Query: 263 IRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLLLPKE--RKDPVIWKKV 318
+V+ + +V L ++ ++ ++ + VV +RLRD LL + + VIW+KV
Sbjct: 341 KQVKTVTEEVYRQLTASSQLTSPLSERKK--VVVTRLRDDLLHDQSLLKWKDVIWRKV 396
>gi|123388850|ref|XP_001299627.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121880519|gb|EAX86697.1| hypothetical protein TVAG_376660 [Trichomonas vaginalis G3]
Length = 337
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 96/223 (43%), Gaps = 30/223 (13%)
Query: 48 LITVVAIASSVALTCNYLA---NFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQ 104
++ + IA VAL+ L N+L + K FCDS ++PT C CPSN C +
Sbjct: 65 IVNKLFIALFVALSAIVLIIAFNYLKPSKKLFCDS-------ENPTGLCIKCPSNAICSE 117
Query: 105 GKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIW 164
GK +C G++ + C+ D D + L + L + +F C + ++ ND+
Sbjct: 118 GKAKCHEGFKLMHRRCIYDDDDSIYISSLYEYEIKLLEKQAGRFDCRLDKTKYITRNDLL 177
Query: 165 NDLEGHELMKIFELDNPVYLYTKKRTM----ETVGRYLESRTNSYGMKELKCPELLAEHY 220
L H K +++ YL K T+G E Y ++
Sbjct: 178 IHLIRHSKFKNVDIE---YLSNKAVDHLVYENTIGNVTEDNQELYYSTNMQ--------- 225
Query: 221 KPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLW----KVHRRR 259
KP+SC + Q + +H + + + +L+ + ++ K +RR+
Sbjct: 226 KPISCIVRQEIESHIFLTISISIILLSTAIYIYVQSVKNNRRK 268
>gi|209879267|ref|XP_002141074.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209556680|gb|EEA06725.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 600
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 73/149 (48%), Gaps = 18/149 (12%)
Query: 38 LFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCP 97
+FPS + ++ L ++ + S + N + F+ FCDS+L S + SC+ CP
Sbjct: 216 IFPSLKKIIILSSITLLFSIGIILYNKYSIFI--LEPKFCDSDL--KSKDLKSSSCKICP 271
Query: 98 SNGECHQGKLECFHGYRK-----HGK---LCVEDGDINETAGRLSRWVENRLCRAYAQFL 149
NG C G L C Y+K HGK LC+ D + ++ A + ++ +RL + L
Sbjct: 272 VNGHCKYGILNCNFQYKKALRYIHGKWTVLCIYDDEAHDLAEEMLLYITSRLRKLKGYNL 331
Query: 150 CDGTGSIWVEENDIWNDLEGHELMKIFEL 178
C ++ +N+ + L HE+ ++ +L
Sbjct: 332 C------YIRDNNNLSKLSEHEINEMIKL 354
>gi|449018457|dbj|BAM81859.1| hypothetical protein CYME_CMP246C [Cyanidioschyzon merolae strain
10D]
Length = 1117
Score = 53.9 bits (128), Expect = 1e-04, Method: Composition-based stats.
Identities = 35/108 (32%), Positives = 47/108 (43%), Gaps = 4/108 (3%)
Query: 68 FLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDIN 127
+L S PFCDS+ +S P C PCP +G C G L C GY G C D D+N
Sbjct: 742 YLVLPSIPFCDSS---ESGSGPVLGCRPCPEHGICVGGVLTCRDGYVVLGSTCAPDRDLN 798
Query: 128 ETAGRLSRWVENRLCRAYAQFLCDGTGSIWV-EENDIWNDLEGHELMK 174
A + V L + ++ C W E ++ LE H M+
Sbjct: 799 RYAHIIQGQVHRLLGESAGRYRCGERAIRWKWTERELREALEQHPYME 846
>gi|328876933|gb|EGG25296.1| hypothetical protein DFA_03545 [Dictyostelium fasciculatum]
Length = 2022
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/256 (22%), Positives = 99/256 (38%), Gaps = 45/256 (17%)
Query: 92 SCEPCPSNGECHQGK-LECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLC 150
SC PCP G C +G + CF GY K G C+ D + ET + ++ L +F C
Sbjct: 1707 SCMPCPERGVCFKGSFVSCFDGYIKQGDECIVDPNKVETMSTVLNKIDYVLRVHKGKFEC 1766
Query: 151 DGTGS---------------IWVEENDIWND-------LEGHELMKIFELDNPVYLYTKK 188
T S W+++ DI + L+G + I EL +
Sbjct: 1767 GVTESFEMPYERLEFEFDRLFWIKKADISDSFKVLTSYLDGRQGAPIKEL-----TVNDQ 1821
Query: 189 RTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGC 248
+ + + ES + Y ++ +PL C+ + + + +P+ LV
Sbjct: 1822 KLADQILSTYESDSQQYYY------HTDSDSIRPLWCQTFITIKSIIMGYLPIIISLVAS 1875
Query: 249 LLLL----WKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWVVASRLRDHLL 304
+ + R++ + V+E Y+ + IL+ A N E PW+ L+D ++
Sbjct: 1876 YVFYRYYKFSQKRKQQELLIVDECYNDLIRILQNRA-----TNYEV-PWISEILLKDDII 1929
Query: 305 LPKERKDPV-IWKKVF 319
K + IW V
Sbjct: 1930 GEDRSKKALKIWDMVL 1945
>gi|290994250|ref|XP_002679745.1| hypothetical protein NAEGRDRAFT_57439 [Naegleria gruberi]
gi|284093363|gb|EFC47001.1| hypothetical protein NAEGRDRAFT_57439 [Naegleria gruberi]
Length = 511
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 74/189 (39%), Gaps = 16/189 (8%)
Query: 89 PTDSCEPCPSNGECHQGKLE-CFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQ 147
P D+C PCP NG C G+L C Y +CVED I A ++N L +
Sbjct: 263 PRDNCLPCPENGYCSDGRLVLCQKPYVIKNAVCVEDSRITLNALEFITELQNELSSKSGK 322
Query: 148 FLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRTNSYGM 207
F+C S DI L +++ +E V + ++S ++ M
Sbjct: 323 FICGRAESNSTSLEDIKAKLTVKLAGSGVAVED---------VLEQVTKLVKSSPQNFQM 373
Query: 208 K----ELKCPELL--AEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYF 261
E E++ P CRI + + +I+ + +L V L+ + ++
Sbjct: 374 NIAPVEGSDKEMIFSLNPTLPFDCRIQKATQENKGVIITILTLAVAILIANFYKNKVERE 433
Query: 262 AIRVEELYH 270
VE+L+
Sbjct: 434 QNDVEDLFR 442
>gi|66363172|ref|XP_628552.1| cysteine rich protein with two potential transmembrane domain
regions [Cryptosporidium parvum Iowa II]
gi|46229565|gb|EAK90383.1| cysteine rich protein with two potential transmembrane domain
regions [Cryptosporidium parvum Iowa II]
Length = 565
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 12/115 (10%)
Query: 48 LITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKL 107
+I +VA+ AL +Y F N + FC+SN+ + SP++ C CP NG C G L
Sbjct: 231 IIILVALFIFFALYNHYYY-FFNEPN--FCNSNIYGEKTLSPSN-CIKCPPNGHCKDGNL 286
Query: 108 ECFHGYRKHGK--------LCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTG 154
+C Y+K K +C+ D + + A + ++ N+L + C G
Sbjct: 287 KCNTQYKKAMKYLNNRWQIICIYDNEAFDLAEEMLTFITNKLRKLRGNRACSGNS 341
>gi|123439914|ref|XP_001310723.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121892505|gb|EAX97793.1| hypothetical protein TVAG_157780 [Trichomonas vaginalis G3]
Length = 275
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/179 (20%), Positives = 85/179 (47%), Gaps = 22/179 (12%)
Query: 76 FCDSNLLLDSPQSPTDSCEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSR 135
F ++ +SP CP NG C+ G+L C +G+R+ G C +D + + R+ +
Sbjct: 38 FNNNRTRFESPDRYFKGAVKCPENGYCYSGQLFCKNGFRRRGYKCEKDPNAIDVDDRIKK 97
Query: 136 WVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMETVG 195
+ + R++ C E++ I+N + +++K ++ N + + G
Sbjct: 98 II--KYIRSHPDTHC--------EKSIIFNFEDFKDMLK--DISNDL--------TDNFG 137
Query: 196 RYLESRTNSYGMKELKCPELLAEH-YKPLSCRIHQWVS-THALIIVPVCSLLVGCLLLL 252
+ N+ +K E+ + + + L CRI +++ ++L+I+ + ++++ C ++L
Sbjct: 138 AFRGMLYNATDIKINSRSEIYSTNPDQTLQCRIEKYIDENYSLVILFLINVILICFIIL 196
>gi|67623503|ref|XP_668034.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54659206|gb|EAL37794.1| hypothetical protein Chro.70437 [Cryptosporidium hominis]
Length = 565
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 53/115 (46%), Gaps = 12/115 (10%)
Query: 48 LITVVAIASSVALTCNYLANFLNSTSKPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGKL 107
+I +VA+ AL +Y F FC+SN+L + +P++ C CP NG C G L
Sbjct: 231 IIILVALFIFFALYNHYYYFFHEPN---FCNSNILSEKTLNPSN-CIKCPPNGHCKDGNL 286
Query: 108 ECFHGYRKHGK--------LCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTG 154
+C Y+K K +C+ D + + A + ++ N+L + C G
Sbjct: 287 KCNTQYKKAIKYLNNRWQIVCIYDNEAFDLAEEMLTFITNKLRKLRGNRACSGNS 341
>gi|301113508|ref|XP_002998524.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111825|gb|EEY69877.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 384
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 4/85 (4%)
Query: 74 KPFCDSNLLLDSPQSPTDSCEPCPSNGECHQGK-LECFHGYRKHGKLCVEDGDINETAGR 132
+PF + + + Q+ T C+PCP G C G + C Y H LC E+ +I + +
Sbjct: 100 QPFIGATAM--ATQTSTPLCQPCPVYGNCLNGSVISCAPPYLLHSGLCKENPEIQQNLDQ 157
Query: 133 LSRWVENRLCRAYAQFLCDGTGSIW 157
L+ ++ + A+ +CD S+W
Sbjct: 158 LAVAIQKFVVEKAAKIVCDNV-SLW 181
>gi|146163292|ref|XP_001011171.2| hypothetical protein TTHERM_00145310 [Tetrahymena thermophila]
gi|146146105|gb|EAR90926.2| hypothetical protein TTHERM_00145310 [Tetrahymena thermophila
SB210]
Length = 727
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 61/264 (23%), Positives = 98/264 (37%), Gaps = 30/264 (11%)
Query: 38 LFPSKQDLLRLITVVAIASSVALTCNYLANFLNSTSK-PFCDSNLLLDSPQSPTDSCEPC 96
+F SK ++ ++ + + + +FL T + +CDS L + Q SC PC
Sbjct: 374 IFFSKIEISHIVMFIMVGLFCIFFLQNIYDFLTHTPELEYCDSTLRQE--QIEGSSCIPC 431
Query: 97 PSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSI 156
P N C ++C H + K CV + I R V L AQ+ C + S
Sbjct: 432 PQNAICLNNNIKCKHSFVLIDKKCVTNPQIEPNTIRYLSTVVQNLQYLQAQYHCYDSNSS 491
Query: 157 WVEEN-----------DIWNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESRT--- 202
N I N L H + D T+ V YL+S++
Sbjct: 492 GYGNNLSRPYKEHKWESIVNILTIHHSKDLLFYD----------TLTKVKDYLKSKSIRE 541
Query: 203 -NSYGMKELKCPELLAEHYKPLSCRIHQWVSTHALIIVPVCSLLVGCLLLLWKVHRRRYF 261
L+ + CRI W + II+ V + + LL++V R F
Sbjct: 542 QFDINYDSLRDTYYVTTFNPSYYCRIRLWSIQNVNIIIAVLCIFF-LIFLLYQVAAVRRF 600
Query: 262 AIRVEELYHQVCEILEENALMSKS 285
I +LY +I+ EN + ++
Sbjct: 601 -IYFFKLYRIQQKIVNENKIQQQN 623
>gi|403348215|gb|EJY73542.1| hypothetical protein OXYTRI_05327 [Oxytricha trifallax]
Length = 932
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 92 SCEPCPSNGECHQGK-LECFHGYRKHGKLCVEDGDINETAGRLSRWVENRL 141
+C CP++G C +GK + C GY K G CVE+ +IN+ A + R + RL
Sbjct: 706 NCIDCPTHGICQEGKFIACSPGYLKEGNRCVENYEINQQANYILREFQYRL 756
>gi|348682419|gb|EGZ22235.1| hypothetical protein PHYSODRAFT_542892 [Phytophthora sojae]
Length = 425
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 57/143 (39%), Gaps = 29/143 (20%)
Query: 42 KQDLLRLITVVAIASSVALTCNYLANFLNSTSKPFCDS--------NLLLDSPQSPTD-- 91
+Q LLRLI+ + + + ++ L S P+CDS + +L P D
Sbjct: 93 RQFLLRLISGALLLCVMVVAAPFVKKLLES-PLPYCDSEWVDANDGSFVLADPADHFDRS 151
Query: 92 ----------------SCEPCPSNGECHQGK-LECFHGYRKHGKLCVEDGDINETAGRLS 134
+C+PCP G C G + C Y LC ED ++ E+ +L+
Sbjct: 152 KALQPFISSTAVAARPTCQPCPVYGNCLNGSVISCAPPYVLQYGLCKEDPEVQESLDQLA 211
Query: 135 RWVENRLCRAYAQFLCDGTGSIW 157
++ + A CD S+W
Sbjct: 212 LSIQKFVVEKAAANACDSV-SLW 233
>gi|440794379|gb|ELR15540.1| hypothetical protein ACA1_163890 [Acanthamoeba castellanii str.
Neff]
Length = 563
Score = 41.2 bits (95), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 6/84 (7%)
Query: 49 ITVVAIASSVAL-TCNYLANFLNSTSKPFCDSNLLLDS---PQSPTDSCEPCPSNGECHQ 104
+ V+A+A L L + L + FCD+ + D+ P+ P C CP+ G C
Sbjct: 234 VLVLAVAVGAGLFMYGQLPSLLQAEKPYFCDTTGVSDNRQNPREPVVPCVDCPNFGVCKD 293
Query: 105 GKL-ECFHGYRKHGKL-CVEDGDI 126
G+L +C GY G + C D +I
Sbjct: 294 GRLIKCMRGYEVSGGVHCARDKEI 317
>gi|210075905|ref|XP_503824.2| YALI0E11539p [Yarrowia lipolytica]
gi|199426902|emb|CAG79417.2| YALI0E11539p [Yarrowia lipolytica CLIB122]
Length = 921
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 62/271 (22%), Positives = 112/271 (41%), Gaps = 49/271 (18%)
Query: 93 CEPCPSNGECHQG-KLEC------------FHGYRKHGKLCVEDGDINETAGRLSRWVEN 139
C PCP N EC++G +LEC F+G+ CV D I A ++ ++
Sbjct: 623 CTPCPENAECYRGFELECHEGFVAEAHPLSFNGFLPLPPKCVYDTRIVSVAQQVLDYL-- 680
Query: 140 RLCRAYAQ---FLCDGTG-SIWVEENDIWNDLEGHELMKIFELDNPVYLYTKKRTMET-- 193
RL +YA+ + TG + V+ +++L + + D+ V+ K E
Sbjct: 681 RLQNSYAECGSPESEATGFDVQVKTEFTYDELYDYGRKNVHAFDDKVWAQAMKAVGEQED 740
Query: 194 --VG-----------RYLESRTNSYGMKELKCPEL---LAEHYKPLSCR----IHQWVST 233
+G R +E Y + E +++ Y L C+ + W++
Sbjct: 741 IIIGSIRISESQRATRLVEGEPEVYYEDVVVAQETYKSISDAYFTLRCKCDKFVRGWLAQ 800
Query: 234 HALIIVPVCSLLVGCLLLLWK--VHRRRYFAIRVEELYHQVCEILEENAL-MSKSVNGEC 290
+ I+ V L+ ++ K ++R+ V EL +QV E++ A G
Sbjct: 801 NQFYILGV--LIAASAAIVGKEWYNQRKREQQHVSELVNQVNEMMMTQAREHDADSTGLK 858
Query: 291 EP-WVVASRLRDHLLL--PKERKDPVIWKKV 318
+P ++ +LRD LL+ ++ +WKKV
Sbjct: 859 QPRFLPVPQLRDELLVDTATSKRKASLWKKV 889
>gi|123396386|ref|XP_001300898.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121882007|gb|EAX87968.1| hypothetical protein TVAG_033600 [Trichomonas vaginalis G3]
Length = 196
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 24/47 (51%)
Query: 93 CEPCPSNGECHQGKLECFHGYRKHGKLCVEDGDINETAGRLSRWVEN 139
C PCP +G+C + + EC Y+K K C +T LS VEN
Sbjct: 77 CTPCPKHGKCTRKEFECKEPYKKISKGCYNTTSTVDTIRELSIDVEN 123
>gi|344289570|ref|XP_003416515.1| PREDICTED: butyrophilin subfamily 1 member A1 [Loxodonta africana]
Length = 526
Score = 37.7 bits (86), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 49/126 (38%), Gaps = 16/126 (12%)
Query: 143 RAYAQFLCDGTGSIWVEENDI-WNDLEGHELMKIFELDNPVYLYTKKRTMETVGRYLESR 201
R Q C G W E + W EG + E NP ++ + TV L R
Sbjct: 157 RGEIQLECSSVG--WYPEPQVQWRTPEGEKFPSTSESSNP-----DEKGLFTVAASLIIR 209
Query: 202 TNSYGMKELKCPELLAEHYK--------PLSCRIHQWVSTHALIIVPVCSLLVGCLLLLW 253
NS G LL K P R+ W+ A+I++ + L +G + W
Sbjct: 210 DNSLGNVSCCIRNLLLGQEKEVGISIPAPFFPRLTPWMVAMAVILMVLGLLTIGSIFFTW 269
Query: 254 KVHRRR 259
+V+R+R
Sbjct: 270 RVYRQR 275
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.135 0.432
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,834,742,720
Number of Sequences: 23463169
Number of extensions: 247574575
Number of successful extensions: 1378341
Number of sequences better than 100.0: 101
Number of HSP's better than 100.0 without gapping: 45
Number of HSP's successfully gapped in prelim test: 56
Number of HSP's that attempted gapping in prelim test: 1375832
Number of HSP's gapped (non-prelim): 1556
length of query: 342
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 199
effective length of database: 9,003,962,200
effective search space: 1791788477800
effective search space used: 1791788477800
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)