BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017655
(368 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224100559|ref|XP_002311923.1| predicted protein [Populus trichocarpa]
gi|222851743|gb|EEE89290.1| predicted protein [Populus trichocarpa]
Length = 659
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 175/391 (44%), Positives = 222/391 (56%), Gaps = 67/391 (17%)
Query: 11 NFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRKFGFYNDQKPK 69
N +NGQ S KS +K SGKN NR + SQ G QKS F+ M NGKRK F N+ K
Sbjct: 303 NTARNGQSSTPKSKWEKPSGKNLKNNRNRGRSQSGHQKSDFNCMDNGKRKLEFSNEHGRK 362
Query: 70 GHGNGKAIKFGLANQMNQPQEKKRK-SLALTYTEQEVKQWCEERRKNYPTKANINKKLTE 128
G+GN + KFG + +Q E+K K S YTEQE+KQW E RRK+YPTK +I KK E
Sbjct: 363 GNGNERVAKFGRTDLTDQATEEKSKPSRTFFYTEQEIKQWRESRRKHYPTKTSIEKKQME 422
Query: 129 KQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS---------------- 172
VID+EA R +QLKEILAKQAELGVEVAEIP YLLDS
Sbjct: 423 ------VIDREANFRHKQLKEILAKQAELGVEVAEIPPDYLLDSEKLGVEVAEIPPPQVL 476
Query: 173 ---------------KKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADK 217
+KQ R +N L K+G+F N +DKRGR+ +K + +K+ A++
Sbjct: 477 SSEKLGVEVAEIPPPQKQEHGREDNRRSLTKKGKFWNKHDKRGRFNKKGRSAKQVGSANE 536
Query: 218 DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVK 277
+ +KPTLL+KLLS D+KRDK LLQ FRFM NSFFKDWPEKP+ FP V+VK
Sbjct: 537 E---------RKPTLLEKLLSTDIKRDKRQLLQVFRFMVANSFFKDWPEKPLKFPSVVVK 587
Query: 278 DGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHHNIQGFEDENDDEEEDND 337
+ G E+VE+KS L+GEE SE + + E+F + DD E +
Sbjct: 588 EDGYEDEIVEKKSSLVGEEVSEDRNNTIAENF-----------------GDRDDNIEHDA 630
Query: 338 NGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
E ++ GK + DE R+ EEGEI+D
Sbjct: 631 QVELGNCFVRGKCDIVDEVDRV--EEGEIVD 659
>gi|147775405|emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera]
Length = 605
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 160/315 (50%), Positives = 199/315 (63%), Gaps = 16/315 (5%)
Query: 11 NFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPK 69
NF+++ + S N K S KNF+ N + K G +KS H M NG+ K G N+ + K
Sbjct: 252 NFVKDAGVNNSNPNWKNSSRKNFMXNPKGKNSHWGSRKSQLHHMQNGRGKAGISNENRGK 311
Query: 70 GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
G N A N Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK EK
Sbjct: 312 GLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEK 371
Query: 130 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLN 187
+NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV R EN
Sbjct: 372 LTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFG 431
Query: 188 KRGRFQNNYDKRGRYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTLLQ 234
K+ RFQN +KR R+ RK D F+K+Q+LA DS SN NKKK TLLQ
Sbjct: 432 KKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGTDSGDTNASNQPPLNKKKQTLLQ 491
Query: 235 KLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLG 294
KLLS D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV VK+ G GEVV+ KS
Sbjct: 492 KLLSTDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVAVKETGCQGEVVDRKSSPTS 551
Query: 295 EEDSEGKGRRMVEHF 309
+ +G + E F
Sbjct: 552 KGVPQGGRKTXAEEF 566
>gi|224113339|ref|XP_002316461.1| predicted protein [Populus trichocarpa]
gi|222865501|gb|EEF02632.1| predicted protein [Populus trichocarpa]
Length = 674
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 184/408 (45%), Positives = 222/408 (54%), Gaps = 81/408 (19%)
Query: 11 NFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRKFGFYNDQKPK 69
N +NGQGS S SGKNF N +E SQ G QKS FH+M NGKRK GF N K
Sbjct: 298 NPAKNGQGSTPNSKQGIFSGKNFKNNPKREQSQSGHQKSEFHRMDNGKRKLGFSNKHGGK 357
Query: 70 GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
G GN +A KFG ++ NQ E+KR + YTEQE+KQW E RRK++PTK NI KK TEK
Sbjct: 358 GKGNERAAKFGRSDPSNQVMEQKRTHI---YTEQEIKQWRESRRKHFPTKTNIEKKQTEK 414
Query: 130 QSNSDVIDKEAKMRRE-------------QLKEILAKQAELGVEVAEIPSYY-------- 168
+S VIDKEA RR+ QLKEILAKQAELGVEVAEIP Y
Sbjct: 415 LIDSGVIDKEANFRRKVLNALPRKLDDKLQLKEILAKQAELGVEVAEIPPEYMLDSEKLG 474
Query: 169 ----------------------------LLDSKKQVRARVENTMPLNKRGRFQNNYDKRG 200
LLDS+KQ R +N L K+GRF N +D+RG
Sbjct: 475 VEVAETPLSYLLDSEKLGVEAAEIPPHHLLDSEKQEHGREDNRRSLTKKGRFWNKHDRRG 534
Query: 201 RYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSF 260
RYKRK + + L +++ +KPTLL+KLL AD+KRDK LLQ FRFM NSF
Sbjct: 535 RYKRKGRSDMQLGLENEE---------RKPTLLEKLLCADIKRDKHRLLQVFRFMVANSF 585
Query: 261 FKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHH 320
FKDW +KP+ FP V+VK+ G E EEK L+GEE+SE VE F
Sbjct: 586 FKDWSDKPLKFPSVVVKEDGCKDEPQEEKPSLVGEEESEVPNNTTVEDF----------- 634
Query: 321 NIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
+ DD +E + E + GK + DE R+ EEGEIID
Sbjct: 635 ------GDRDDGDEHDAQVEPGNGSVMGKCDIVDEVYRV--EEGEIID 674
>gi|255554855|ref|XP_002518465.1| conserved hypothetical protein [Ricinus communis]
gi|223542310|gb|EEF43852.1| conserved hypothetical protein [Ricinus communis]
Length = 273
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 128/218 (58%), Positives = 166/218 (76%), Gaps = 4/218 (1%)
Query: 94 KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAK 153
+S AL YTEQE+K W EERRKN+P+KANI K +E+ +NS IDKEAK+RRE+LKEILAK
Sbjct: 9 RSPALIYTEQEIKLWREERRKNFPSKANIEKMCSERVTNSGGIDKEAKLRRERLKEILAK 68
Query: 154 QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQK 213
QAELGVEVAEIPS+YL DS+KQV+ + ++ K+GR ++ +D+RGRY +KD +++
Sbjct: 69 QAELGVEVAEIPSHYLSDSEKQVKVKKDSRRSAPKKGRSRHKHDRRGRYNKKDGLTQQNT 128
Query: 214 LADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPL 273
LA KDSSN SSF++ KPTLLQKLLSA+V++DK HLLQ FRFM +NSFF D PEKP+ FP
Sbjct: 129 LAKKDSSNGSSFSRTKPTLLQKLLSAEVRKDKHHLLQVFRFMVMNSFFDDGPEKPLKFPS 188
Query: 274 VMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEH 311
V+VK+ EVV EK+ + G+ SE + VE+ H
Sbjct: 189 VVVKE----DEVVAEKTSITGKGISEVSNKTAVENSGH 222
>gi|359487059|ref|XP_002271061.2| PREDICTED: uncharacterized protein LOC100252331 [Vitis vinifera]
Length = 1031
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 136/236 (57%), Positives = 163/236 (69%), Gaps = 15/236 (6%)
Query: 70 GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
G N A N Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK EK
Sbjct: 738 GLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEK 797
Query: 130 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLN 187
+NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV R EN
Sbjct: 798 LTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFG 857
Query: 188 KRGRFQNNYDKRGRYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTLLQ 234
K+ RFQN +KR R+ RK D F+K+Q+LA KDS SN NKKK TLLQ
Sbjct: 858 KKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGKDSGDMNASNQPPLNKKKQTLLQ 917
Query: 235 KLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 290
KLLS D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV+VK+ G GEVV+ KS
Sbjct: 918 KLLSTDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVVVKETGCQGEVVDRKS 973
>gi|449446325|ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus]
Length = 599
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 156/352 (44%), Positives = 214/352 (60%), Gaps = 45/352 (12%)
Query: 31 KNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQE 90
+NF++N K GFQK+ H + N K+KFGF QK KG N + KF N +Q +E
Sbjct: 279 RNFMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKE 334
Query: 91 KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 150
+KR SL+L YT+QE++QW E RRKNYP+ NI KKLT KQ+N ++DKEAK+ R++LKEI
Sbjct: 335 QKR-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEI 393
Query: 151 LAKQAELGVEVAEIPSYYLL-----DSKKQVRARV-------ENTMPLNKRGRFQNNYDK 198
LAKQAELGVEVAEIP YL D++KQ R E ++ ++ QN +K
Sbjct: 394 LAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASI---EKENSQNRLNK 450
Query: 199 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 258
RGR K+K++ K+ K +K SN K++PTLLQKLL ADV++DKS LLQ RF +N
Sbjct: 451 RGRCKKKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMN 509
Query: 259 SFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRE 316
SFFK+WP KP+ FP V VK+ VV+E SL G + E +VE+ HD
Sbjct: 510 SFFKEWPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD---- 565
Query: 317 RMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
+ +N+++ +D++ E++K G G +++ EEEGEIID
Sbjct: 566 --------IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 599
>gi|449494110|ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cucumis sativus]
Length = 552
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 156/352 (44%), Positives = 214/352 (60%), Gaps = 45/352 (12%)
Query: 31 KNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQE 90
+NF++N K GFQK+ H + N K+KFGF QK KG N + KF N +Q +E
Sbjct: 232 RNFMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKE 287
Query: 91 KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 150
+KR SL+L YT+QE++QW E RRKNYP+ NI KKLT KQ+N ++DKEAK+ R++LKEI
Sbjct: 288 QKR-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEI 346
Query: 151 LAKQAELGVEVAEIPSYYLL-----DSKKQVRARV-------ENTMPLNKRGRFQNNYDK 198
LAKQAELGVEVAEIP YL D++KQ R E ++ ++ QN +K
Sbjct: 347 LAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASI---EKENSQNRLNK 403
Query: 199 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 258
RGR K+K++ K+ K +K SN K++PTLLQKLL ADV++DKS LLQ RF +N
Sbjct: 404 RGRCKKKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMN 462
Query: 259 SFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRE 316
SFFK+WP KP+ FP V VK+ VV+E SL G + E +VE+ HD
Sbjct: 463 SFFKEWPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD---- 518
Query: 317 RMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
+ +N+++ +D++ E++K G G +++ EEEGEIID
Sbjct: 519 --------IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 552
>gi|145334515|ref|NP_001078603.1| uncharacterized protein [Arabidopsis thaliana]
gi|145358174|ref|NP_197345.2| uncharacterized protein [Arabidopsis thaliana]
gi|60547897|gb|AAX23912.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|71905555|gb|AAZ52755.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|71905557|gb|AAZ52756.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|332005180|gb|AED92563.1| uncharacterized protein [Arabidopsis thaliana]
gi|332005181|gb|AED92564.1| uncharacterized protein [Arabidopsis thaliana]
Length = 470
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 177/252 (70%), Gaps = 10/252 (3%)
Query: 43 QGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL--ANQMNQPQEKKRKSLALTY 100
QGFQ+ HQ NGKRK GF D + G GN +K GL ++ N +EKKR S AL Y
Sbjct: 202 QGFQRPQLHQADNGKRKSGFNKDHR--GKGNNNKMKTGLDGSDTGNIAKEKKR-SYALMY 258
Query: 101 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVE 160
T +EV+QW E RRKNYPTK + KK+ +K ++ ++D+EAKMRR+QL+E+LAKQAELGVE
Sbjct: 259 TPREVQQWREARRKNYPTKFLVEKKV-KKNVSASILDEEAKMRRQQLREVLAKQAELGVE 317
Query: 161 VAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQF-SKEQKLADKDS 219
VAE+PS+YL ++ +QV N ++GRFQNN + R+ RKD+F +K+ +L DK S
Sbjct: 318 VAEVPSHYLSNNDEQVNGDRGNNN--GRKGRFQNNRRNKRRHDRKDKFDNKKPRLEDKKS 375
Query: 220 SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDG 279
S SS +KPTLL+KLLSAD+KRDKS LLQ FRFM +NS K++PE+P+ PL+ VK+
Sbjct: 376 SQDSSITTRKPTLLEKLLSADIKRDKSQLLQVFRFMVMNSLLKEFPEQPLKLPLITVKET 435
Query: 280 GAAGEVVEEKSL 291
G + +E+ S+
Sbjct: 436 GCE-DAMEDPSI 446
>gi|297807865|ref|XP_002871816.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
lyrata]
gi|297317653|gb|EFH48075.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
lyrata]
Length = 454
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 119/239 (49%), Positives = 170/239 (71%), Gaps = 6/239 (2%)
Query: 43 QGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL-ANQMNQPQEKKRKSLALTYT 101
QGFQ+ HQ NGKRK GF D + G GN +K GL + + ++KR+S AL YT
Sbjct: 203 QGFQRPQLHQADNGKRKSGFNKDHR--GKGNYNKMKNGLDGSDADNIAKEKRRSYALMYT 260
Query: 102 EQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEV 161
++V QW E RRKN+PT+ N+ KK+ +K ++ ++D+EAKMRR+QL+E+LAKQAELG+EV
Sbjct: 261 PKDVNQWREARRKNFPTRLNVEKKV-KKNVSASILDEEAKMRRQQLREVLAKQAELGIEV 319
Query: 162 AEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSN 221
A++PS+YL ++ ++V +N ++ +FQNN K+ R+ RKD+F K +L DK+SS
Sbjct: 320 ADVPSHYLSNTDERVHG--DNGANDGQKRKFQNNRHKQRRHGRKDKFDKTPRLDDKNSSQ 377
Query: 222 TSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGG 280
S KKPTLL+KLLSA++KRDK HLLQ FRFM +NSF K++PE+P+ PL+ V++ G
Sbjct: 378 ESPMTTKKPTLLEKLLSANIKRDKIHLLQVFRFMVMNSFLKEFPEQPLKLPLITVEETG 436
>gi|356553962|ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786384 [Glycine max]
Length = 822
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 149/359 (41%), Positives = 215/359 (59%), Gaps = 17/359 (4%)
Query: 15 NGQGSISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHG 72
N + ++ SN K KNF KN+ F GFQKS FH ++NGK+ GF + KG
Sbjct: 476 NIKTNVPNSNWKGSPSKNF-KNKPNRGGFQAGFQKSKFHDVNNGKKGSGFPIEHNGKGPN 534
Query: 73 NGKAIKFGLANQMNQPQEKKR---KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
+G+ +GL +P+E K+ +SL++TYT QE++QW E R+KN+P NI KK +E
Sbjct: 535 SGRGGHYGL-----KPKEHKQQPERSLSVTYTVQEIQQWREARKKNHPFNNNIQKKHSEH 589
Query: 130 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKR 189
+ I++E R +LKE+LAKQAELGVEVAEIPSYYL +S Q + E +
Sbjct: 590 PKDRKAINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQA-LQSEGKNKFTDK 646
Query: 190 GRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 249
+FQN ++K+ RK +F+K QK DKD S + S K+KPTLLQKLLS+DVKRDKSHL+
Sbjct: 647 RKFQNKFNKKS--DRKGRFAKRQKFDDKDFSESPSLKKRKPTLLQKLLSSDVKRDKSHLI 704
Query: 250 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHF 309
Q FRFM +NSFFK +KP+ +PLV+VK+ G+ + EEK L G++ +G V+
Sbjct: 705 QVFRFMVMNSFFKHCLDKPLRYPLVVVKEKGSEVD-GEEKYLHTGKDVLKGGNEETVQKI 763
Query: 310 EHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
N + +D+ +D DN+ + + + ++ + + +EEEGEII+
Sbjct: 764 VTFNNDNSHDCEDEDSDDDENDSIVDNNLHKDPSSLVKRQCDSGEGIGKFDEEEGEIIE 822
>gi|358347440|ref|XP_003637765.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
gi|355503700|gb|AES84903.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
Length = 553
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 152/355 (42%), Positives = 209/355 (58%), Gaps = 49/355 (13%)
Query: 19 SISKSNSKKVSGKNFIKNRTKEFSQG-FQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 77
+ + SN K KNF + + SQG FQKS F+ + GKR F + K KG N +A
Sbjct: 243 NFANSNWKGSPNKNFKNKQNRGGSQGGFQKSKFNDTNKGKR---FSKEHKGKGPNNERAG 299
Query: 78 KFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVID 137
FGL N QE KR S +LTY+EQE++QW E RRKN+P++ I K+ +E+ +S VID
Sbjct: 300 HFGL----NSKQEPKR-SFSLTYSEQEIQQWREARRKNHPSREKIEKQQSEQSKDSKVID 354
Query: 138 KEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--- 194
+E R +LKE+LAKQAELG+EVAEIPSYYL++ Q + + +F+N
Sbjct: 355 RELLQR--ELKEVLAKQAELGIEVAEIPSYYLMNGTNQGLQSEDKKNSFTDKRKFKNKLM 412
Query: 195 -NYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFR 253
N D+R R +K QK AD+D NKKKPTLLQKLLSAD+KRDKSHL Q FR
Sbjct: 413 RNPDRRSRNNKK------QKFADRDLLE----NKKKPTLLQKLLSADIKRDKSHLFQVFR 462
Query: 254 FMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDE 313
F+ NSFFKD+P+KP+ +P V VK+ G+ EV K L G G ++EH
Sbjct: 463 FITANSFFKDYPDKPLVYPPVSVKEMGS--EVYGGKKHLQG-------GEAVLEH----G 509
Query: 314 NRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 368
+E + ++G ++ +D+E+E++D +G+ D +EEEGEII+
Sbjct: 510 TKEIVQKFVKGSDNGHDNEDEESD-----------EGDNEDGINEFDEEEGEIIE 553
>gi|115440647|ref|NP_001044603.1| Os01g0814000 [Oryza sativa Japonica Group]
gi|56785048|dbj|BAD82687.1| unknown protein [Oryza sativa Japonica Group]
gi|113534134|dbj|BAF06517.1| Os01g0814000 [Oryza sativa Japonica Group]
Length = 533
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/186 (48%), Positives = 121/186 (65%), Gaps = 2/186 (1%)
Query: 93 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 152
RK + Y EVKQW E R+KNYPT NINKKL+E + + + D+EA+MRR++LKE+L
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKKLSESRPDGEKKDEEAQMRRQELKEVLE 340
Query: 153 KQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-KRGRYKRKDQFSKE 211
KQ ELG E+ E+P YL + + Q R N + RF N D KR RY R D SK
Sbjct: 341 KQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNKRSRYDRNDFQSKR 400
Query: 212 QKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVN 270
K+ ++ + + K ++PTLLQKLLS+DVKRD+ LL TF+FM +N+FF D+P+KP+
Sbjct: 401 PKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVLNNFFSDYPDKPLE 460
Query: 271 FPLVMV 276
FP V V
Sbjct: 461 FPSVKV 466
>gi|242054757|ref|XP_002456524.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
gi|241928499|gb|EES01644.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
Length = 568
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 3/188 (1%)
Query: 92 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 151
+RK + Y EVK+W E R+KNYPT N++KKL+E S+++ DK+A++RR++LKE+L
Sbjct: 295 RRKPPRIIYDANEVKRWVEARKKNYPTSVNVHKKLSEINSDNENKDKDAQLRRQELKEVL 354
Query: 152 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQFS 209
AKQ ELG ++ E+P YL ++ Q N + F N N +KR RY+R S
Sbjct: 355 AKQQELGFDLPELPPGYLSETGDQCIENKNNRKAQCRDSHFGNRSNNNKRPRYERGGFQS 414
Query: 210 KEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKP 268
K K+ ++ + K ++PTLLQKLLS+D+KRD+ LL F+FM +N+FFKDWP+KP
Sbjct: 415 KRSKVWNRTPCADDAMAKSREPTLLQKLLSSDIKRDRHRLLHVFKFMTLNNFFKDWPDKP 474
Query: 269 VNFPLVMV 276
+ FP V V
Sbjct: 475 LQFPSVKV 482
>gi|223943225|gb|ACN25696.1| unknown [Zea mays]
gi|414880010|tpg|DAA57141.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
Length = 562
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 84/191 (43%), Positives = 125/191 (65%), Gaps = 13/191 (6%)
Query: 92 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 151
+RK + Y EVKQW E R+KNYPT N+NKKL+E +S+++ DK+A++RR++LKE++
Sbjct: 290 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQELKEVI 349
Query: 152 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN-------NYDKRGRYKR 204
AKQ ELG + E+P YL + Q +EN N++ +F++ N +KR RY+R
Sbjct: 350 AKQQELGFDPPELPPGYLSEIGDQC---IENKN--NRKTQFRDSHFGNRFNNNKRSRYER 404
Query: 205 KDQFSKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 263
SK K+ + ++ ++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKD
Sbjct: 405 GGFQSKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKD 464
Query: 264 WPEKPVNFPLV 274
WP++P+ FP V
Sbjct: 465 WPDRPLQFPSV 475
>gi|357125563|ref|XP_003564462.1| PREDICTED: uncharacterized protein LOC100829078 [Brachypodium
distachyon]
Length = 609
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 87/195 (44%), Positives = 120/195 (61%), Gaps = 15/195 (7%)
Query: 93 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 152
RK + Y EVKQW E R+KNYPT N+NKKL++ QS+ D+EA+MRR +LKE+LA
Sbjct: 333 RKRPPICYDTNEVKQWVEARKKNYPTSVNVNKKLSQSQSDDQKKDEEAQMRRLELKEVLA 392
Query: 153 KQAELGVEVAEIPSYYLLDSKKQVRARVEN-----------TMPLNKRGRFQNNYDKRGR 201
KQ ELG E+ E+P YL +++ Q + N RGR +KR R
Sbjct: 393 KQKELGFELPELPPGYLSETEGQPKGPQGNDKESKWKTQQRDCRFGNRGR----GNKRQR 448
Query: 202 YKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
Y R D SK + + + + K++PTLLQKLLS+D+KRD+ LL TF+FMA+N+FF
Sbjct: 449 YDRADFQSKRPREWNNSRHDGGAVAKREPTLLQKLLSSDIKRDRHRLLHTFKFMALNNFF 508
Query: 262 KDWPEKPVNFPLVMV 276
++ P+KP+ FP V V
Sbjct: 509 RNLPDKPLEFPSVKV 523
>gi|218189267|gb|EEC71694.1| hypothetical protein OsI_04190 [Oryza sativa Indica Group]
Length = 500
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/199 (45%), Positives = 122/199 (61%), Gaps = 15/199 (7%)
Query: 93 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 139
RK + Y EVKQW E R+KNYPT NINK KL+E + + + D+E
Sbjct: 237 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 296
Query: 140 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-K 198
A+MRR++LKE+LAKQ ELG E+ E+P YL + + Q R N + RF N D K
Sbjct: 297 AQMRRQELKEVLAKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNK 356
Query: 199 RGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAI 257
R RY R D SK K+ ++ + + K ++PTLLQKLLS+DVKRD+ LL TF+FM +
Sbjct: 357 RSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVL 416
Query: 258 NSFFKDWPEKPVNFPLVMV 276
N+FF D+P+KP+ FP V V
Sbjct: 417 NNFFSDYPDKPLEFPSVKV 435
>gi|296085358|emb|CBI29090.3| unnamed protein product [Vitis vinifera]
Length = 467
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/214 (47%), Positives = 132/214 (61%), Gaps = 11/214 (5%)
Query: 11 NFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPK 69
NF+++ + S N K S KNF+KN + K G +KS H M N + K G N+ + K
Sbjct: 246 NFVKDAGVNNSNPNWKNSSRKNFMKNPKGKNSHWGSRKSQLHHMQNARGKAGISNENRGK 305
Query: 70 GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 129
G N A N Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK EK
Sbjct: 306 GLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEK 365
Query: 130 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLN 187
+NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV R EN
Sbjct: 366 LTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFG 425
Query: 188 KRGRFQNNYDKRGRYKRK--------DQFSKEQK 213
K+ RFQN +KR R+ RK D F+K+Q+
Sbjct: 426 KKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQR 459
>gi|293335934|ref|NP_001167960.1| uncharacterized protein LOC100381676 [Zea mays]
gi|223945137|gb|ACN26652.1| unknown [Zea mays]
gi|414880008|tpg|DAA57139.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
Length = 295
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 119/187 (63%), Gaps = 4/187 (2%)
Query: 92 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRRE-QLKEI 150
+RK + Y EVKQW E R+KNYPT N+NKKL+E +S+++ DK+A++RR+ +LKE+
Sbjct: 22 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQVELKEV 81
Query: 151 LAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQF 208
+AKQ ELG + E+P YL + Q N + F N N +KR RY+R
Sbjct: 82 IAKQQELGFDPPELPPGYLSEIGDQCIENKNNRKTQFRDSHFGNRFNNNKRSRYERGGFQ 141
Query: 209 SKEQKLADKDS-SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEK 267
SK K+ + ++ ++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKDWP++
Sbjct: 142 SKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDR 201
Query: 268 PVNFPLV 274
P+ FP V
Sbjct: 202 PLQFPSV 208
>gi|20161477|dbj|BAB90401.1| P0432B10.22 [Oryza sativa Japonica Group]
Length = 541
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 124/220 (56%), Gaps = 24/220 (10%)
Query: 93 RKSLALTYTEQEVKQWCEERRKNYPTKANINK----------------------KLTEKQ 130
RK + Y EVKQW E R+KNYPT NINK KL+E +
Sbjct: 267 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKASATLCPHQTIREASSTDHLMQKLSESR 326
Query: 131 SNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRG 190
+ + D+EA+MRR++LKE+L KQ ELG E+ E+P YL + + Q R N +
Sbjct: 327 PDGEKKDEEAQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDC 386
Query: 191 RFQNNYD-KRGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHL 248
RF N D KR RY R D SK K+ ++ + + K ++PTLLQKLLS+DVKRD+ L
Sbjct: 387 RFGNRADNKRSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRL 446
Query: 249 LQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEE 288
L TF+FM +N+FF D+P+KP+ FP V V + EE
Sbjct: 447 LHTFKFMVLNNFFSDYPDKPLEFPSVKVNQIELESNIAEE 486
>gi|326528619|dbj|BAJ97331.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 625
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 128/218 (58%), Gaps = 20/218 (9%)
Query: 93 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 152
RK + Y + EVKQW + R+KNYPT+AN+NKKL + Q + D+EA+MRR++LKE++A
Sbjct: 335 RKRPPIIYDKNEVKQWVQARKKNYPTRANVNKKLCQNQLDEQKKDEEAQMRRQELKEVIA 394
Query: 153 KQAELGVEVAEIPSYYLLDSKKQVR----------------ARVENTMPLNKRGRFQNNY 196
KQ ELG+E+ E+P YL D++ Q R RGR
Sbjct: 395 KQKELGLELPELPPGYLSDNEGQPRDPQKESNWKTRQGGGRFGNRGRGRGQGRGRGSGRD 454
Query: 197 DKRGRY-KRKDQFSKEQKLADKDS---SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTF 252
+KR RY R+D SK + + +S + ++PTLLQKLL++D+KRD+ LL TF
Sbjct: 455 NKRQRYDSREDSPSKRPREWNNNSRCHDGGAVAKIREPTLLQKLLNSDIKRDRHRLLHTF 514
Query: 253 RFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 290
+FMA+N+FFKDWP KP+ FP V + ++ EE S
Sbjct: 515 KFMALNNFFKDWPAKPLEFPSVKLNQIELESDIAEEGS 552
>gi|356562267|ref|XP_003549393.1| PREDICTED: uncharacterized protein LOC100789443 [Glycine max]
Length = 684
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 104/250 (41%), Positives = 155/250 (62%), Gaps = 21/250 (8%)
Query: 124 KKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQ-VRARVEN 182
+K +E +S I++E ++RE LKE+LAKQAELGVEVAEIPSYYL +S Q +++ +N
Sbjct: 451 QKHSECPKDSKAINREV-LQRE-LKEVLAKQAELGVEVAEIPSYYLKNSDNQGLQSEAKN 508
Query: 183 TMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVK 242
+KR +FQN ++K+ RK +F+++QK ADKD S + S K+KPTLLQKLLS+DVK
Sbjct: 509 KY-TDKR-KFQNKFNKKS--DRKGRFAEKQKFADKDFSESPSLKKRKPTLLQKLLSSDVK 564
Query: 243 RDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKG 302
RDKSHLLQ RFM +NSFF+ P+K + +P V VK+ G+ EV EK L +D +G
Sbjct: 565 RDKSHLLQVLRFMVMNSFFRHIPDKALRYPSVEVKEKGS--EVSGEKKHLHTGKDVLNRG 622
Query: 303 R----RMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVR 358
+ + F +D + +D+ +D DN+ + + + + + +
Sbjct: 623 SEETVQKIVIFNNDNGHD--------CDDDENDSIVDNNLHKDPSSLVKRQCDGGEGIKK 674
Query: 359 LEEEEGEIID 368
+EEEGEI++
Sbjct: 675 SDEEEGEILE 684
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
Query: 20 ISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 77
+ SN K KN +KN+ F GFQKS FH ++NGKR GF + KG +G+A
Sbjct: 282 VPNSNWKGSPSKN-LKNKPNRGRFQGGFQKSKFHDVNNGKRGSGFPKEHNGKGPNSGRAG 340
Query: 78 KFGL 81
+GL
Sbjct: 341 HYGL 344
>gi|222619440|gb|EEE55572.1| hypothetical protein OsJ_03849 [Oryza sativa Japonica Group]
Length = 484
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 65/118 (55%), Gaps = 13/118 (11%)
Query: 93 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 139
RK + Y EVKQW E R+KNYPT NINK KL+E + + + D+E
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 340
Query: 140 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD 197
A+MRR++LKE+L KQ ELG E+ E+P YL + + Q R N + RF N D
Sbjct: 341 AQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRAD 398
>gi|168019512|ref|XP_001762288.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686366|gb|EDQ72755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 614
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 65/90 (72%), Gaps = 6/90 (6%)
Query: 96 LALTYTEQ---EVKQWCEERRKNYPTKANINKK---LTEKQSNSDVIDKEAKMRREQLKE 149
+AL +E+ E++ W EER +NYPT++NI +K L K + ++ID++ + R++++KE
Sbjct: 278 VALKASEEDDDEIRAWREERMRNYPTRSNIQRKKEALEGKVARGELIDEDTRKRQQRMKE 337
Query: 150 ILAKQAELGVEVAEIPSYYLLDSKKQVRAR 179
ILAKQAELGV VAE+P++YL S K+ R R
Sbjct: 338 ILAKQAELGVPVAEVPAHYLSQSYKEDRGR 367
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 18/129 (13%)
Query: 185 PLNKRGRFQNNYDKRGRYKR--KDQFSKEQKLADKDSSNTSSFN---------KKKPTLL 233
P +K G+ Q + +RGR K+ K F+ K N + K+ PTLL
Sbjct: 461 PSDKPGKTQCYFYQRGRCKKGFKCHFAHTGPRGKKGGDNEAGRGGPKGVGVEKKRPPTLL 520
Query: 234 QKLLSADVKRDKSHLLQTFRFMAINSFFKDWP-------EKPVNFPLVMVKDGGAAGEVV 286
KLL ++ RDKS++LQ R+ N+F +WP E P + GEVV
Sbjct: 521 SKLLQGEINRDKSYILQCLRYFVNNNFLLNWPNQLSESIELPEQSDDAHESSDYSEGEVV 580
Query: 287 EEKSLLLGE 295
E +LL +
Sbjct: 581 EAPDVLLAD 589
>gi|219363611|ref|NP_001136910.1| uncharacterized protein LOC100217067 [Zea mays]
gi|194697570|gb|ACF82869.1| unknown [Zea mays]
Length = 138
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 40/47 (85%)
Query: 228 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLV 274
++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKDWP++P+ FP V
Sbjct: 5 REPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDRPLQFPSV 51
>gi|255546771|ref|XP_002514444.1| conserved hypothetical protein [Ricinus communis]
gi|223546440|gb|EEF47940.1| conserved hypothetical protein [Ricinus communis]
Length = 123
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 62/129 (48%), Gaps = 39/129 (30%)
Query: 146 QLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRK 205
+LKE LAKQAELGVEVAEIPS YL DS+KQV+
Sbjct: 28 RLKETLAKQAELGVEVAEIPSRYLSDSEKQVK---------------------------- 59
Query: 206 DQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVK-RDKSHLLQTFRFMAINSFF--- 261
L +KDS+N SSF+ KPTL QKLLSA+ K S LQ FF
Sbjct: 60 ------DTLTNKDSTNGSSFSNTKPTLRQKLLSAEGKGAFLSECLQASSTFTCEQFFSSN 113
Query: 262 KDWP-EKPV 269
K W KPV
Sbjct: 114 KLWNFTKPV 122
>gi|384254158|gb|EIE27632.1| hypothetical protein COCSUDRAFT_55624 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 33/50 (66%)
Query: 223 SSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFP 272
S +++PTLLQKLL+ D++ + S LLQ RF+ N+F D+ P+ FP
Sbjct: 313 SPVQQRQPTLLQKLLAPDIRAEHSRLLQCLRFLVTNNFLLDFGRAPLIFP 362
>gi|328766403|gb|EGF76457.1| hypothetical protein BATDEDRAFT_28378 [Batrachochytrium
dendrobatidis JAM81]
Length = 411
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 88/185 (47%), Gaps = 25/185 (13%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKKL---TEKQSNSDVID---KEAKMRREQLKEILAK- 153
T ++++QW +ER++ +PT++NI +K+ ++Q ++ID K+ K R ++++ K
Sbjct: 217 TPKDIEQWIQERKRKFPTESNIKQKMEDEKQRQKQGNIIDVNGKKLKGRNADKRQLIRKN 276
Query: 154 ---QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRGRYKRKDQF 208
Q + G + + YL D+ +T+ L+ G+ + R+ +
Sbjct: 277 DTVQEKHGTSEQKDSTIYLDDASNAKLEASHDTLALDMTGKLNQHKGASSCCRHGSNCRH 336
Query: 209 SKE-QKLADKDSSNTSSFNKKKPT-----------LLQKLLSADVKRDKSHLLQTFRFMA 256
S + KLA+ S+ + PT LL LL D K+DK+ LLQ RF+
Sbjct: 337 SHDLSKLAESRSA-PKTHTPLNPTAAPIKGTNRRPLLNMLLETDFKKDKNALLQCIRFII 395
Query: 257 INSFF 261
N FF
Sbjct: 396 ENKFF 400
>gi|308800704|ref|XP_003075133.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
(ISS) [Ostreococcus tauri]
gi|116061687|emb|CAL52405.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
(ISS) [Ostreococcus tauri]
Length = 401
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 199 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 258
RGR K + + DK +S SS N + TLL+KLL ++KRD+ LLQ FRF+ N
Sbjct: 261 RGRCKAGKRCPFRHDIVDKKTSGVSSGNPQ--TLLKKLLHKEIKRDEERLLQLFRFLVNN 318
Query: 259 SFF 261
FF
Sbjct: 319 DFF 321
>gi|145343958|ref|XP_001416510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576735|gb|ABO94803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 291
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 100/223 (44%), Gaps = 44/223 (19%)
Query: 103 QEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELG---- 158
++V+++ EER K +P+ N+ K + + RRE+L+EILAKQ E+G
Sbjct: 28 EDVRRYREERAKYWPSDRNVRAKTEAGEEEE----RARAARRERLREILAKQREMGHFEA 83
Query: 159 -VEVAE----IPSYYLLDSKKQVRARVENTMPLNKRGRF--------------------- 192
E+ E + + LD K V AR + +K RF
Sbjct: 84 SQEIGEEEGGVAAPAALDGKG-VEARAADG---SKVCRFWLQGGCRKGSACDFKHESAPN 139
Query: 193 ---QNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 249
+ + RGR K + + ++ ++ SS + TLL+KLL ++KRD+ LL
Sbjct: 140 KDQKCRFFARGRCKAGARCPFKHEVTERKSSAADGGGNPQ-TLLKKLLDKEIKRDEQRLL 198
Query: 250 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLL 292
Q FRF N FF K V+ PL M + +V++ ++ L
Sbjct: 199 QLFRFFVNNDFFTG--TKGVDEPLWMFPWADSGKKVIDRRAAL 239
>gi|118358218|ref|XP_001012358.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila]
gi|89294124|gb|EAR92112.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila
SB210]
Length = 762
Score = 45.8 bits (107), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 41/63 (65%)
Query: 204 RKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 263
++D++S+E L+ SN+S+F K TLL L+ ++ R++++LLQ R++ N+FF +
Sbjct: 574 KRDKYSQELDLSFNYKSNSSNFRYKANTLLTNLVLDEIYRERNYLLQAIRYIVSNNFFDE 633
Query: 264 WPE 266
E
Sbjct: 634 QAE 636
Score = 38.9 bits (89), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 104 EVKQWCEERRKNYPTKANINKKLTE 128
E+++W E+R+KNYPT NI KK TE
Sbjct: 289 EIRKWIEQRKKNYPTSENIKKKQTE 313
>gi|68073975|ref|XP_678902.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56499510|emb|CAH94055.1| conserved hypothetical protein [Plasmodium berghei]
Length = 611
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 90/182 (49%), Gaps = 18/182 (9%)
Query: 89 QEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLK 148
EK K+ + +++E+++W EER+KNYPTK KK+ + ++N++ I+ E K ++ + K
Sbjct: 426 HEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMDIKNNNN-INSETKEKKNKEK 480
Query: 149 EILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRGR 201
+I + +L VE + Y +K + + N + N N Y +
Sbjct: 481 KITSLIEKLLVEKYSSAIGRNIYYQTQSQRKSIFVPILNKLIENNHKNVYENTYYSISDK 540
Query: 202 YKRKD-QFSKEQKLADK-DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINS 259
Y +K+ +F+K K + DS N K L+ +L+ D+ + L+Q F+ N+
Sbjct: 541 YNKKNIKFNKRVKKHNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEFIVNNN 596
Query: 260 FF 261
FF
Sbjct: 597 FF 598
>gi|126337727|ref|XP_001369929.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Monodelphis domestica]
Length = 469
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 16/104 (15%)
Query: 159 VEVAEIPSYYLLDSKKQVR-ARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADK 217
V ++++P D KKQ R + V T L K +N KRG Y++K
Sbjct: 358 VILSDVPQNQSEDIKKQERNSPVAKTKSLRKNWMRKNCRYKRGLYRKKP----------- 406
Query: 218 DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
S K++P LL+ LL+ D++ +++ +LQ R++ N FF
Sbjct: 407 ----VSEPQKRRPHLLEMLLAQDIRHERNVILQCVRYIIQNDFF 446
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 22/25 (88%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
T +E+++W EERRKN+PT+ANI KK
Sbjct: 178 TPEEIEKWREERRKNFPTRANIEKK 202
>gi|303286377|ref|XP_003062478.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455995|gb|EEH53297.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 552
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 26/35 (74%)
Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 263
+PTLL+KL + +V+ +S LLQ FRF+ N+F +D
Sbjct: 460 QPTLLKKLFAKEVRVSRSRLLQVFRFLVNNNFLRD 494
>gi|255088323|ref|XP_002506084.1| predicted protein [Micromonas sp. RCC299]
gi|226521355|gb|ACO67342.1| predicted protein [Micromonas sp. RCC299]
Length = 567
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 24/32 (75%)
Query: 231 TLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
TLL+KLL+ +V+ D+S LLQ FRF+ N F +
Sbjct: 458 TLLRKLLAREVRADRSRLLQVFRFLVNNDFLR 489
>gi|392567649|gb|EIW60824.1| hypothetical protein TRAVEDRAFT_71098 [Trametes versicolor
FP-101664 SS1]
Length = 590
Score = 41.6 bits (96), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 64/121 (52%), Gaps = 15/121 (12%)
Query: 102 EQEVKQWCEERRKNYPTKANINKKL----TEKQSNSDVIDKEAKMRREQLKEILAKQAEL 157
EQ ++++ EE++ + + N + + TE +S D+ A R E+ +I+AK EL
Sbjct: 211 EQALQRFIEEQKTSALQRFNFHMAVLIWETEARSAKATGDEAA--RSERHADIIAKLVEL 268
Query: 158 GVEVAEIPSY-----YLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQ 212
G + +EIP+Y LLD +++ R+ NT+ R + ++R R + +D F K+
Sbjct: 269 GYQSSEIPAYDHAFQRLLDQPRKLTPRIWNTI----RPKLVEMLEERRRAREQDAFKKKW 324
Query: 213 K 213
+
Sbjct: 325 R 325
>gi|307107626|gb|EFN55868.1| hypothetical protein CHLNCDRAFT_52119 [Chlorella variabilis]
Length = 684
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 41/70 (58%), Gaps = 2/70 (2%)
Query: 104 EVKQWCEERRKNYPTKANINKKLTEKQS--NSDVIDKEAKMRREQLKEILAKQAELGVEV 161
EV+QW +ER++++PT + +K E ++ +D +R+++L ILA+Q+ +G+
Sbjct: 388 EVEQWIKERKRSFPTSEAVARKEAEAEARRQRGELDAARTLRQQRLAAILAQQSAMGLSR 447
Query: 162 AEIPSYYLLD 171
A LLD
Sbjct: 448 AAGTLDMLLD 457
>gi|82594577|ref|XP_725484.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480507|gb|EAA17049.1| Zinc finger, C2H2 type, putative [Plasmodium yoelii yoelii]
Length = 601
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 89/187 (47%), Gaps = 29/187 (15%)
Query: 89 QEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLK 148
EK K+ + +++E+++W EER+KNYPTK KK+ E ++N+ I+ E K ++ K
Sbjct: 419 HEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMEMKNNN--INSEEKEKKNXQK 472
Query: 149 EILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNK-RGRFQNNYDKRGRY 202
+I + +L VE + Y KK + + N + N + ++N Y Y
Sbjct: 473 KITSLIEKLLVEKYSSAIGRNIYYQTQSQKKSIFVPLLNKLIENNYKNVYENTY-----Y 527
Query: 203 KRKDQFSKEQKLADK--------DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRF 254
D+++++ +K DS N K L+ +L+ D+ + L+Q F
Sbjct: 528 SISDKYNQKNIKFNKKIKKNNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEF 583
Query: 255 MAINSFF 261
+ N+FF
Sbjct: 584 IVNNNFF 590
>gi|47218661|emb|CAG04990.1| unnamed protein product [Tetraodon nigroviridis]
Length = 350
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 25/34 (73%)
Query: 228 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
++PTLL+ LL+ D++ +++ LLQ RF+ N FF
Sbjct: 306 RRPTLLEMLLAPDIRHERNVLLQCVRFVVRNHFF 339
Score = 37.7 bits (86), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 17/25 (68%), Positives = 19/25 (76%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
T +EV +W EERRKNYPT NI KK
Sbjct: 69 TPEEVAKWREERRKNYPTLQNIEKK 93
>gi|158297378|ref|XP_001237955.2| AGAP007875-PA [Anopheles gambiae str. PEST]
gi|157015166|gb|EAU76388.2| AGAP007875-PA [Anopheles gambiae str. PEST]
Length = 474
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 231 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
T+L+KLL AD++ +++ LLQ R++ +N FF
Sbjct: 416 TMLEKLLDADIRHERNVLLQCVRYVLVNKFF 446
>gi|17540726|ref|NP_499991.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
gi|351063723|emb|CCD71947.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
Length = 213
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 8/120 (6%)
Query: 248 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 306
L+ F ++ FF +D P+ N L +V+D G E+ GEE S G+ RR V
Sbjct: 93 LIGAFIYVLAGEFFAQDSPQTIFNKALALVRDDGRCQEIFGASIAGFGEETSRGR-RRHV 151
Query: 307 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 360
H +++++ R R+ +++G DE E E D Q + +Y+ K HV ++
Sbjct: 152 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 211
>gi|291235450|ref|XP_002737657.1| PREDICTED: nuclear fragile X mental retardation protein interacting
protein-like [Saccoglossus kowalevskii]
Length = 639
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 26/35 (74%)
Query: 227 KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
KKKPTLL+ LL+ D++ +++ LLQ R++ +FF
Sbjct: 458 KKKPTLLEMLLAGDIRHERNVLLQCVRYVKKKNFF 492
>gi|345325138|ref|XP_001514551.2| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Ornithorhynchus anatinus]
Length = 446
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 12/52 (23%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 152
T +E+ +W EERRKN+PT ANI KK K ++ REQ E+L
Sbjct: 125 TPEEIAKWREERRKNFPTMANIEKK------------KALQLDREQRGEVLT 164
>gi|313219540|emb|CBY30463.1| unnamed protein product [Oikopleura dioica]
gi|313226347|emb|CBY21491.1| unnamed protein product [Oikopleura dioica]
Length = 310
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 35/49 (71%), Gaps = 1/49 (2%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKE 149
TE+++++W EERRKN+PTKA I +K +++ + + K+ +REQ +E
Sbjct: 96 TEEDIRKWREERRKNFPTKARIAEKEAARKAGTSIETKKFS-KREQERE 143
>gi|270007929|gb|EFA04377.1| hypothetical protein TcasGA2_TC014675 [Tribolium castaneum]
Length = 327
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 39/63 (61%), Gaps = 3/63 (4%)
Query: 203 KRKDQFSKEQKLADKDSSNTSS---FNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINS 259
+R+ K+QKL +K S F K+K TLL+KLL ++++ +++ LLQ + + N+
Sbjct: 265 RRRLSSGKKQKLNNKPEPGGFSRMKFVKRKVTLLEKLLDSEIRHERNVLLQCVQHVVTNN 324
Query: 260 FFK 262
FF+
Sbjct: 325 FFR 327
>gi|156353142|ref|XP_001622934.1| predicted protein [Nematostella vectensis]
gi|156209570|gb|EDO30834.1| predicted protein [Nematostella vectensis]
Length = 401
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 215 ADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
A++ NT S + PTLL+ LL+ +++ +++ +LQ R++ NSFF
Sbjct: 355 ANRTKQNTKS--NRNPTLLEMLLAPEIRHERNAILQCLRYIVRNSFF 399
>gi|156099300|ref|XP_001615652.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804526|gb|EDL45925.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 589
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 44/192 (22%), Positives = 81/192 (42%), Gaps = 28/192 (14%)
Query: 103 QEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVA 162
+E+++W +ER+KNYPT+ I L ++ K+ E+L + + +G +
Sbjct: 418 KEIEKWIQERKKNYPTREKIADSLKNNENGKKKKKKKPNCLIEELL-FESYCSAVGRNI- 475
Query: 163 EIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQF-----SKEQKLADK 217
Y+ +K + + M N F N Y+K + +K+Q+ K
Sbjct: 476 ----YFKSKWEKSIFIPLLTKMEQNN---FHNIYEKSYYPLFNNSMGKFFPTKQQRRPFK 528
Query: 218 DSSNTSSFN-KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMV 276
+ S N +KP LL +L+ ++ + L++ F+ + FF D
Sbjct: 529 ERQGIDSLNIHRKPPLLYQLMKKEIYLYEKRLIKCIEFITRSGFFDD------------- 575
Query: 277 KDGGAAGEVVEE 288
+GGAAG + E
Sbjct: 576 SEGGAAGTDILE 587
>gi|312375641|gb|EFR22971.1| hypothetical protein AND_13906 [Anopheles darlingi]
Length = 575
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 231 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
T+L+KLL AD++ +++ LLQ R++ N+FF
Sbjct: 459 TMLEKLLDADIRHERNVLLQCVRYVVSNNFF 489
>gi|341888654|gb|EGT44589.1| hypothetical protein CAEBREN_21252 [Caenorhabditis brenneri]
Length = 219
Score = 38.9 bits (89), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 8/120 (6%)
Query: 248 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 306
L+ F ++ FF +D P+ N L +V+D G ++ GEE S G+ RR V
Sbjct: 99 LIGAFIYVLCGEFFAQDSPQTIFNKALAIVRDDGRCQDIFGATIAGFGEETSRGR-RRHV 157
Query: 307 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 360
H +++++ R R+ +++G DE E E D Q + +Y+ K HV ++
Sbjct: 158 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 217
>gi|256053309|ref|XP_002570140.1| hypothetical protein [Schistosoma mansoni]
Length = 356
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
+PTLLQ LL+ D++++++ L+Q R++ +FF+
Sbjct: 323 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 356
>gi|350646629|emb|CCD58749.1| hypothetical protein Smp_099560.1 [Schistosoma mansoni]
Length = 336
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
+PTLLQ LL+ D++++++ L+Q R++ +FF+
Sbjct: 303 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 336
>gi|256053307|ref|XP_002570139.1| hypothetical protein [Schistosoma mansoni]
Length = 371
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
+PTLLQ LL+ D++++++ L+Q R++ +FF+
Sbjct: 338 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 371
>gi|7305335|ref|NP_038773.1| nuclear fragile X mental retardation-interacting protein 1 [Mus
musculus]
gi|81917719|sp|Q9QXX8.1|NUFP1_MOUSE RecName: Full=Nuclear fragile X mental retardation-interacting
protein 1; AltName: Full=Nuclear FMRP-interacting
protein 1
gi|6525073|gb|AAF15316.1| nuclear FMRP interacting protein 1 [Mus musculus]
gi|26328149|dbj|BAC27815.1| unnamed protein product [Mus musculus]
gi|33525186|gb|AAH56192.1| Nuclear fragile X mental retardation protein interacting protein 1
[Mus musculus]
gi|74220737|dbj|BAE33625.1| unnamed protein product [Mus musculus]
gi|148703863|gb|EDL35810.1| nuclear fragile X mental retardation protein interacting protein 1
[Mus musculus]
Length = 484
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 5/51 (9%)
Query: 75 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKK 125
K ++F N M+ P KK K T +E+ +W EERRKNYPT ANI +K
Sbjct: 202 KIVQFHWRN-MHAPGMKKIK----LDTPEEIARWREERRKNYPTLANIERK 247
>gi|350646628|emb|CCD58748.1| hypothetical protein Smp_099560.2 [Schistosoma mansoni]
Length = 351
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 229 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 262
+PTLLQ LL+ D++++++ L+Q R++ +FF+
Sbjct: 318 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 351
>gi|327283872|ref|XP_003226664.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Anolis carolinensis]
Length = 451
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 8/50 (16%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 150
T +E+ +W EERRKN+PT ANI KK + KE + R E LK +
Sbjct: 92 TPEEIAKWREERRKNFPTLANIEKK--------KALQKEKEQRGEVLKTL 133
>gi|67474753|ref|XP_653120.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56470045|gb|EAL47734.1| hypothetical protein EHI_019110 [Entamoeba histolytica HM-1:IMSS]
gi|449703311|gb|EMD43787.1| Hypothetical protein EHI5A_022200 [Entamoeba histolytica KU27]
Length = 129
Score = 38.5 bits (88), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 29/39 (74%)
Query: 100 YTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 138
TE+++++W EER+K +PTKA K L+E++ +D I+K
Sbjct: 1 MTEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39
>gi|395527571|ref|XP_003765917.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1 [Sarcophilus harrisii]
Length = 480
Score = 38.1 bits (87), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 22/25 (88%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
T +E+++W EERRKN+PT+ANI KK
Sbjct: 187 TPEEIEKWREERRKNFPTRANIEKK 211
>gi|407040593|gb|EKE40213.1| hypothetical protein ENU1_097960 [Entamoeba nuttalli P19]
Length = 129
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 29/39 (74%)
Query: 100 YTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 138
TE+++++W EER+K +PTKA K L+E++ +D I+K
Sbjct: 1 MTEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39
>gi|157108497|ref|XP_001650253.1| hypothetical protein AaeL_AAEL000699 [Aedes aegypti]
gi|108884013|gb|EAT48238.1| AAEL000699-PA [Aedes aegypti]
Length = 492
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 23/31 (74%)
Query: 231 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 261
TLL+KLL +++ +++ LLQ RF+ NSFF
Sbjct: 444 TLLEKLLEPEIRHERNVLLQCVRFVVQNSFF 474
>gi|448516074|ref|XP_003867484.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis Co 90-125]
gi|380351823|emb|CCG22046.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis]
Length = 243
Score = 37.7 bits (86), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 22/25 (88%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 69 TEEDIAKWIEERKKNWPTRANVERK 93
>gi|12843948|dbj|BAB26177.1| unnamed protein product [Mus musculus]
Length = 249
Score = 37.7 bits (86), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 20/25 (80%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
T +E+ +W EERRKNYPT ANI +K
Sbjct: 223 TPEEIARWREERRKNYPTLANIERK 247
>gi|354543450|emb|CCE40169.1| hypothetical protein CPAR2_102070 [Candida parapsilosis]
Length = 253
Score = 37.7 bits (86), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 22/25 (88%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 77 TEEDIAKWIEERKKNWPTRANVERK 101
>gi|167387090|ref|XP_001738022.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165898944|gb|EDR25671.1| hypothetical protein EDI_290620 [Entamoeba dispar SAW760]
Length = 137
Score = 37.7 bits (86), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 29/40 (72%)
Query: 99 TYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 138
+ TE+++++W EER+K +PTKA K L E++ +D I+K
Sbjct: 8 SMTEEQLQKWIEERKKRFPTKAKREKCLFEEKKKTDEIEK 47
>gi|348545180|ref|XP_003460058.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Oreochromis niloticus]
Length = 544
Score = 37.7 bits (86), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 19/25 (76%)
Query: 101 TEQEVKQWCEERRKNYPTKANINKK 125
T +E+ +W EERRKNYPT NI KK
Sbjct: 152 TPEEITKWREERRKNYPTLVNIEKK 176
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.130 0.363
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,072,966,266
Number of Sequences: 23463169
Number of extensions: 278780249
Number of successful extensions: 1279746
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 202
Number of HSP's successfully gapped in prelim test: 1964
Number of HSP's that attempted gapping in prelim test: 1252546
Number of HSP's gapped (non-prelim): 18829
length of query: 368
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 224
effective length of database: 8,980,499,031
effective search space: 2011631782944
effective search space used: 2011631782944
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 77 (34.3 bits)