BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008530
(562 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|147775405|emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera]
Length = 605
Score = 298 bits (763), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 224/550 (40%), Positives = 282/550 (51%), Gaps = 103/550 (18%)
Query: 34 SVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMF 93
++P Q G++ PQ P NMP NQ FM PL Q+NHLG+P LG
Sbjct: 40 ALPNPLQMGMLXPQTSIPFI-PNMP------NQFGFM----PL---QNNHLGMPHLGSAQ 85
Query: 94 PVHGNFG--NMPLNVSQ-----------GQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVN 140
G+F N N+ Q L G N + LP N Q CLQ+ N
Sbjct: 86 SYQGSFNPQNCVPNLGQSAXRSNPSQPPANLFGHNSAILP---------GNLQFCLQNXN 136
Query: 141 QLLSMQMSNPCLFG------------------NPQFGLVNQNQQNFVQPIKDVS------ 176
Q M M N G PQ NQQ + V
Sbjct: 137 QNFPMPMPNLNQVGPYNIPQASHHMFQLAQALGPQSSAFIPNQQFVPMCVSGVGENVHQD 196
Query: 177 ---------GNNTLK-----------------SCSSEPRQ-GQNLQPSAFMRSQRNFIQN 209
G+N LK + S +P+Q N QPS FM+SQ NF+++
Sbjct: 197 NHKLGPPAMGSNALKQPPIANPQLQGNWPTMATSSVQPQQTWNNQQPSNFMKSQGNFVKD 256
Query: 210 GQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNG 268
+ S N K S KNF+ N + K G +KS H M NG+ K G N+ + KG N
Sbjct: 257 AGVNNSNPNWKNSSRKNFMXNPKGKNSHWGSRKSQLHHMQNGRGKAGISNENRGKGLSNN 316
Query: 269 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSD 328
A N Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK EK +NS+
Sbjct: 317 MAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEKLTNSE 376
Query: 329 VIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLNKRGRF 386
VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV R EN K+ RF
Sbjct: 377 VIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFGKKERF 436
Query: 387 QNNYDKRGRYKRK--------DQFSKEQKLADKDSSNTSS-----FNKKKPTLLQKLLSA 433
QN +KR R+ RK D F+K+Q+LA DS +T++ NKKK TLLQKLLS
Sbjct: 437 QNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGTDSGDTNASNQPPLNKKKQTLLQKLLST 496
Query: 434 DVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSE 493
D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV VK+ G GEVV+ KS + +
Sbjct: 497 DIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVAVKETGCQGEVVDRKSSPTSKGVPQ 556
Query: 494 GKGRRMVEHF 503
G + E F
Sbjct: 557 GGRKTXAEEF 566
>gi|449446325|ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus]
Length = 599
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 233/587 (39%), Positives = 313/587 (53%), Gaps = 113/587 (19%)
Query: 57 MPATAPLMNQNNFM-NASNPLLIAQSNHLGLPQL--GHM--------FPVHGNFGNMPLN 105
+P P M Q M N NPL+ +N LG GHM FP G F MP N
Sbjct: 45 VPTMPPPMFQPGLMMNLQNPLMGLPNNSLGASPFAPGHMGFANSAANFPAQGQFNLMP-N 103
Query: 106 VSQ---------GQLLGPNLSNLPQQFNQNMGFSNGQMCL--QSVNQ-LLSMQMSN---- 149
V+Q Q G N+ NL QQ QNMG +NGQ CL Q++NQ ++ QM N
Sbjct: 104 VNQMNMNSCLPLAQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGQMMNMPQV 163
Query: 150 --------------PCLFGNPQFGL------------VNQNQQNFV-------------- 169
P F NP F VNQN QNF+
Sbjct: 164 PSYTSYGGPNQQAVPMPFQNPGFSTAQSFGVNQGMQPVNQNPQNFIPQAMGGAGSNQFPA 223
Query: 170 --QPIKDVSGNNTLK-SCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKN 226
QP++ GN+T+ + S++P+Q +NLQ AF +Q N + G+ S S S + +N
Sbjct: 224 SAQPLQ---GNSTMPINSSTQPQQARNLQSPAFAGTQGNSSISDGGNGSNSISNNSAHRN 280
Query: 227 FIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKK 286
F++N K GFQK+ H + N K+KFGF QK KG N + KF N +Q +E+K
Sbjct: 281 FMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKEQK 336
Query: 287 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 346
R SL+L YT+QE++QW E RRKNYP+ NI KKLT KQ+N ++DKEAK+ R++LKEILA
Sbjct: 337 R-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEILA 395
Query: 347 KQAELGVEVAEIPSYYLL-----DSKKQVRARV----ENTMPLNKRGRFQNNYDKRGRYK 397
KQAELGVEVAEIP YL D++KQ R E ++ QN +KRGR K
Sbjct: 396 KQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASIEKENSQNRLNKRGRCK 455
Query: 398 RKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 457
+K++ K+ K +K SN K++PTLLQKLL ADV++DKS LLQ RF +NSFFK+
Sbjct: 456 KKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMNSFFKE 514
Query: 458 WPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRERMHHN 515
WP KP+ FP V VK+ VV+E SL G + E +VE+ HD
Sbjct: 515 WPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD--------- 565
Query: 516 IQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 562
+ +N+++ +D++ E++K G G +++ EEEGEIID
Sbjct: 566 ---IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 599
>gi|449494110|ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cucumis sativus]
Length = 552
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 230/581 (39%), Positives = 310/581 (53%), Gaps = 112/581 (19%)
Query: 62 PLMNQNNFMNASNPLLIAQSNHLGLPQL--GHM--------FPVHGNFGNMPLNVSQ--- 108
P+ MN NPL+ +N LG GHM FP G F MP NV+Q
Sbjct: 4 PMFQPGLMMNMQNPLMGLPNNSLGASPFAPGHMGFANSAANFPAQGQFNLMP-NVNQMNM 62
Query: 109 ------GQLLGPNLSNLPQQFNQNMGFSNGQMCL--QSVNQ-LLSMQMSN---------- 149
Q G N+ NL QQ QNMG +NGQ CL Q++NQ ++ QM N
Sbjct: 63 NSCLPLAQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGQMMNMPQVPSYTSY 122
Query: 150 --------PCLFGNPQFGL------------VNQNQQNFV----------------QPIK 173
P F NP F VNQN QNF+ QP++
Sbjct: 123 GGPNQQAVPMPFQNPGFSTAQSFGVNQGMQPVNQNPQNFIPQAMGGAGSNQFPASAQPLQ 182
Query: 174 DVSGNNTLK-SCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRT 232
GN+T+ + S++P+Q +NLQ AF +Q N + G+ S S S + +NF++N
Sbjct: 183 ---GNSTMPINSSTQPQQARNLQSPAFAGTQGNSSISDGGNGSNSISNNSAHRNFMRNSK 239
Query: 233 KEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLAL 292
K GFQK+ H + N K+KFGF QK KG N + KF N +Q +E+KR SL+L
Sbjct: 240 K----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKEQKR-SLSL 294
Query: 293 TYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELG 352
YT+QE++QW E RRKNYP+ NI KKLT KQ+N ++DKEAK+ R++LKEILAKQAELG
Sbjct: 295 VYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEILAKQAELG 354
Query: 353 VEVAEIPSYYLL-----DSKKQVRARV----ENTMPLNKRGRFQNNYDKRGRYKRKDQFS 403
VEVAEIP YL D++KQ R E ++ QN +KRGR K+K++
Sbjct: 355 VEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASIEKENSQNRLNKRGRCKKKNRPR 414
Query: 404 KEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPV 463
K+ K +K SN K++PTLLQKLL ADV++DKS LLQ RF +NSFFK+WP KP+
Sbjct: 415 KKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMNSFFKEWPNKPL 473
Query: 464 NFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRERMHHNIQGFED 521
FP V VK+ VV+E SL G + E +VE+ HD +
Sbjct: 474 KFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD------------IDS 521
Query: 522 ENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 562
+N+++ +D++ E++K G G +++ EEEGEIID
Sbjct: 522 DNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 552
>gi|224100559|ref|XP_002311923.1| predicted protein [Populus trichocarpa]
gi|222851743|gb|EEE89290.1| predicted protein [Populus trichocarpa]
Length = 659
Score = 291 bits (745), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 184/411 (44%), Positives = 233/411 (56%), Gaps = 67/411 (16%)
Query: 185 SSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSH 243
SS +Q N QPS + R Q N +NGQ S KS +K SGKN NR + SQ G QKS
Sbjct: 283 SSASQQSWNSQPSTYNRWQGNTARNGQSSTPKSKWEKPSGKNLKNNRNRGRSQSGHQKSD 342
Query: 244 FHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRK-SLALTYTEQEVKQW 302
F+ M NGKRK F N+ KG+GN + KFG + +Q E+K K S YTEQE+KQW
Sbjct: 343 FNCMDNGKRKLEFSNEHGRKGNGNERVAKFGRTDLTDQATEEKSKPSRTFFYTEQEIKQW 402
Query: 303 CEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYY 362
E RRK+YPTK +I KK E VID+EA R +QLKEILAKQAELGVEVAEIP Y
Sbjct: 403 RESRRKHYPTKTSIEKKQME------VIDREANFRHKQLKEILAKQAELGVEVAEIPPDY 456
Query: 363 LLDS-------------------------------KKQVRARVENTMPLNKRGRFQNNYD 391
LLDS +KQ R +N L K+G+F N +D
Sbjct: 457 LLDSEKLGVEVAEIPPPQVLSSEKLGVEVAEIPPPQKQEHGREDNRRSLTKKGKFWNKHD 516
Query: 392 KRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAI 451
KRGR+ +K + +K+ A+++ +KPTLL+KLLS D+KRDK LLQ FRFM
Sbjct: 517 KRGRFNKKGRSAKQVGSANEE---------RKPTLLEKLLSTDIKRDKRQLLQVFRFMVA 567
Query: 452 NSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRER 511
NSFFKDWPEKP+ FP V+VK+ G E+VE+KS L+GEE SE + + E+F
Sbjct: 568 NSFFKDWPEKPLKFPSVVVKEDGYEDEIVEKKSSLVGEEVSEDRNNTIAENF-------- 619
Query: 512 MHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 562
+ DD E + E ++ GK + DE R+ EEGEI+D
Sbjct: 620 ---------GDRDDNIEHDAQVELGNCFVRGKCDIVDEVDRV--EEGEIVD 659
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 95/216 (43%), Gaps = 42/216 (19%)
Query: 1 MQPLFNAF-ANQTQSNSSTSTLPQMAFRPNM-LQNSVPMQPQTGIMNPQVPFPLSNSNMP 58
M+P FN+ + +T+ L Q F N + NS+PMQPQ G++NPQ+P +NSN
Sbjct: 1 MKPFFNSRRPKHHKQKGNTAPLNQQVFSTNQSVANSMPMQPQLGLVNPQIPITFNNSNT- 59
Query: 59 ATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNM--------PLNVSQGQ 110
L++ M PL+ Q L P + + + +N QG+
Sbjct: 60 ----LLSNGQAMANMPPLIAQQPGILSGPNDLAILQLQNQVNKLNALKMLMDQVNQLQGE 115
Query: 111 LLGPNLSNLPQQFNQNMGF--------SNGQMCLQ--------------SVNQLLSMQMS 148
L GP+ SNLPQQ NQNMG N M LQ S +Q++ Q
Sbjct: 116 LFGPSFSNLPQQLNQNMGLLQNPMQNMMNPVMPLQMPINSQVGSFNVPSSNHQVVGAQSQ 175
Query: 149 NPCLFGNPQFGL---VNQNQQNFVQPIKDVSGNNTL 181
N F NP FG VN NQ N+V P G+ L
Sbjct: 176 N--FFVNPPFGAEQRVNPNQPNYVMPTTSAYGSKLL 209
>gi|224113339|ref|XP_002316461.1| predicted protein [Populus trichocarpa]
gi|222865501|gb|EEF02632.1| predicted protein [Populus trichocarpa]
Length = 674
Score = 285 bits (730), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 262/718 (36%), Positives = 332/718 (46%), Gaps = 200/718 (27%)
Query: 1 MQPLFNAF-ANQTQSNSSTSTLPQMAFRPN-MLQNSVPMQPQTGIMNPQVPF-------P 51
M+P FN+ + +T+ Q F N L NS MQPQ G++NPQ+P P
Sbjct: 1 MEPFFNSRRPRHHKQKGNTAPYNQQGFSANPSLTNS--MQPQLGLVNPQIPIPFNNSNTP 58
Query: 52 LSNS----NMP----------------ATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGH 91
L N NMP A L NQ N +NA +L+ Q N L G
Sbjct: 59 LRNGQAMPNMPPLINQQHGLVSGPNDLAILQLQNQVNKLNALK-MLMNQVNQL----QGE 113
Query: 92 MFPVHGNFGNMP--LNVSQGQLLGP--NLSN-----------------LPQQFNQNMG-- 128
+F F N+P +N + G L P N+ N +P + +Q +G
Sbjct: 114 LF--GPGFSNLPQQINQNMGLLQNPMQNMMNPVMPMQMPMTSQVGSFNVPSRSHQVVGAQ 171
Query: 129 ----FSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQ----------NFVQPIKD 174
F N Q+ N ++ +N G+ Q NQ Q NFV P
Sbjct: 172 SPNFFVNQQVKQNQPNFVMPTTGAN----GSKQLSFENQQMQGNLSATQKNQNFVMPAVG 227
Query: 175 VSGNNTLKSCSSE----------------------------------------PRQGQNL 194
+G+N+L + + +Q QNL
Sbjct: 228 TNGSNSLHVATQQVQGISPASQQSEKFVMPTMAANGPKPLPAATQQVQGNPFTSQQSQNL 287
Query: 195 QPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRK 253
QPSA+ R Q N +NGQGS S SGKNF N +E SQ G QKS FH+M NGKRK
Sbjct: 288 QPSAYNRWQGNPAKNGQGSTPNSKQGIFSGKNFKNNPKREQSQSGHQKSEFHRMDNGKRK 347
Query: 254 FGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTK 313
GF N KG GN +A KFG ++ NQ E+KR + YTEQE+KQW E RRK++PTK
Sbjct: 348 LGFSNKHGGKGKGNERAAKFGRSDPSNQVMEQKRTHI---YTEQEIKQWRESRRKHFPTK 404
Query: 314 ANINKKLTEKQSNSDVIDKEAKMRRE-------------QLKEILAKQAELGVEVAEIPS 360
NI KK TEK +S VIDKEA RR+ QLKEILAKQAELGVEVAEIP
Sbjct: 405 TNIEKKQTEKLIDSGVIDKEANFRRKVLNALPRKLDDKLQLKEILAKQAELGVEVAEIPP 464
Query: 361 YY------------------------------------LLDSKKQVRARVENTMPLNKRG 384
Y LLDS+KQ R +N L K+G
Sbjct: 465 EYMLDSEKLGVEVAETPLSYLLDSEKLGVEAAEIPPHHLLDSEKQEHGREDNRRSLTKKG 524
Query: 385 RFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQ 444
RF N +D+RGRYKRK + + L +++ +KPTLL+KLL AD+KRDK LLQ
Sbjct: 525 RFWNKHDRRGRYKRKGRSDMQLGLENEE---------RKPTLLEKLLCADIKRDKHRLLQ 575
Query: 445 TFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFE 504
FRFM NSFFKDW +KP+ FP V+VK+ G E EEK L+GEE+SE VE F
Sbjct: 576 VFRFMVANSFFKDWSDKPLKFPSVVVKEDGCKDEPQEEKPSLVGEEESEVPNNTTVEDF- 634
Query: 505 HDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 562
+ DD +E + E + GK + DE R+ EEGEIID
Sbjct: 635 ----------------GDRDDGDEHDAQVEPGNGSVMGKCDIVDEVYRV--EEGEIID 674
>gi|356553962|ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786384 [Glycine max]
Length = 822
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 219/607 (36%), Positives = 301/607 (49%), Gaps = 122/607 (20%)
Query: 58 PATAPLMNQNNFMNASN----PLLIAQSNHLGLPQLGH---------------------- 91
P P MN FMNA+N PL + H+GLP GH
Sbjct: 236 PFMQPPMNAAPFMNAANHNHFPLHMP---HMGLP--GHQQGQPHVGGLGPQNSVGNANYN 290
Query: 92 --MFP-----VHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSNGQMC----LQSVN 140
MFP V N L+ QGQ L ++ N+ QQ N NM NGQ C +Q++N
Sbjct: 291 NPMFPPVQGQVMQNQAQFNLSPLQGQFLAQSILNMLQQPNMNMSMPNGQFCGPYLMQNMN 350
Query: 141 QLLSMQMSNP------------C-LFG------------------NPQFGLVNQNQQNFV 169
L MQM NP C +FG NPQ G V NQ V
Sbjct: 351 PQLPMQMPNPSQGVPYGMHPGSCPMFGFPNQVPQAMVPQNSMFSTNPQLGFVPGNQ---V 407
Query: 170 QPIKD----------VSGN---------------NTLKSCSSEPRQGQNLQPSAFMR--- 201
+P D VS N NT S + N QP AFM+
Sbjct: 408 RPQIDPNEKNLNPPNVSANSFVSSSPFSSQQLQGNTSGSLNPNLAHTNNSQPPAFMKQVL 467
Query: 202 -SQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYN 258
++ N + ++ SN K KNF KN+ F GFQKS FH ++NGK+ GF
Sbjct: 468 KTEEKPNSNIKTNVPNSNWKGSPSKNF-KNKPNRGGFQAGFQKSKFHDVNNGKKGSGFPI 526
Query: 259 DQKPKGHGNGKAIKFGLANQMNQPQEKKR---KSLALTYTEQEVKQWCEERRKNYPTKAN 315
+ KG +G+ +GL +P+E K+ +SL++TYT QE++QW E R+KN+P N
Sbjct: 527 EHNGKGPNSGRGGHYGL-----KPKEHKQQPERSLSVTYTVQEIQQWREARKKNHPFNNN 581
Query: 316 INKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVE 375
I KK +E + I++E R +LKE+LAKQAELGVEVAEIPSYYL +S Q + E
Sbjct: 582 IQKKHSEHPKDRKAINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQA-LQSE 638
Query: 376 NTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADV 435
+ +FQN ++K+ RK +F+K QK DKD S + S K+KPTLLQKLLS+DV
Sbjct: 639 GKNKFTDKRKFQNKFNKKS--DRKGRFAKRQKFDDKDFSESPSLKKRKPTLLQKLLSSDV 696
Query: 436 KRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGK 495
KRDKSHL+Q FRFM +NSFFK +KP+ +PLV+VK+ G+ + EEK L G++ +G
Sbjct: 697 KRDKSHLIQVFRFMVMNSFFKHCLDKPLRYPLVVVKEKGSEVD-GEEKYLHTGKDVLKGG 755
Query: 496 GRRMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEE 555
V+ N + +D+ +D DN+ + + + ++ + + +E
Sbjct: 756 NEETVQKIVTFNNDNSHDCEDEDSDDDENDSIVDNNLHKDPSSLVKRQCDSGEGIGKFDE 815
Query: 556 EEGEIID 562
EEGEII+
Sbjct: 816 EEGEIIE 822
>gi|255554855|ref|XP_002518465.1| conserved hypothetical protein [Ricinus communis]
gi|223542310|gb|EEF43852.1| conserved hypothetical protein [Ricinus communis]
Length = 273
Score = 249 bits (635), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 128/218 (58%), Positives = 166/218 (76%), Gaps = 4/218 (1%)
Query: 288 KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAK 347
+S AL YTEQE+K W EERRKN+P+KANI K +E+ +NS IDKEAK+RRE+LKEILAK
Sbjct: 9 RSPALIYTEQEIKLWREERRKNFPSKANIEKMCSERVTNSGGIDKEAKLRRERLKEILAK 68
Query: 348 QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQK 407
QAELGVEVAEIPS+YL DS+KQV+ + ++ K+GR ++ +D+RGRY +KD +++
Sbjct: 69 QAELGVEVAEIPSHYLSDSEKQVKVKKDSRRSAPKKGRSRHKHDRRGRYNKKDGLTQQNT 128
Query: 408 LADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPL 467
LA KDSSN SSF++ KPTLLQKLLSA+V++DK HLLQ FRFM +NSFF D PEKP+ FP
Sbjct: 129 LAKKDSSNGSSFSRTKPTLLQKLLSAEVRKDKHHLLQVFRFMVMNSFFDDGPEKPLKFPS 188
Query: 468 VMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEH 505
V+VK+ EVV EK+ + G+ SE + VE+ H
Sbjct: 189 VVVKE----DEVVAEKTSITGKGISEVSNKTAVENSGH 222
>gi|145334515|ref|NP_001078603.1| uncharacterized protein [Arabidopsis thaliana]
gi|145358174|ref|NP_197345.2| uncharacterized protein [Arabidopsis thaliana]
gi|60547897|gb|AAX23912.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|71905555|gb|AAZ52755.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|71905557|gb|AAZ52756.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|332005180|gb|AED92563.1| uncharacterized protein [Arabidopsis thaliana]
gi|332005181|gb|AED92564.1| uncharacterized protein [Arabidopsis thaliana]
Length = 470
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 185/440 (42%), Positives = 246/440 (55%), Gaps = 58/440 (13%)
Query: 68 NFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNM 127
N MNA N L + + P +GH NMP+ PN+ PQ FN NM
Sbjct: 43 NPMNA-NQLGMMNPQMMNNPMMGH---------NMPM---------PNMPIHPQFFN-NM 82
Query: 128 GFSNGQMCLQS-VNQLLSMQMSNPCLFGNPQFGLVNQNQQ-----NFVQPIKD---VSGN 178
Q + + +NQLL P L GN QF + N N NF QP + S
Sbjct: 83 PQQLPQFAMPNHINQLL------PNLLGNLQFAVANSNLMGHSLPNFFQPSLEPHAFSSR 136
Query: 179 NTLKSCSS-------EPRQGQNLQPSAFM--RSQRNFIQNGQGSISKSNSKKVSGKNFIK 229
L S +S P Q P F R Q + N GS N + F K
Sbjct: 137 PQLNSFNSLPYPPVPNPHQNHQSGPPGFSEPRPQGQSVDNTNGSGPNGNDFR---NKFPK 193
Query: 230 NRT-KEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL--ANQMNQPQEKK 286
++ K QGFQ+ HQ NGKRK GF D + G GN +K GL ++ N +EKK
Sbjct: 194 HQNFKGPGQGFQRPQLHQADNGKRKSGFNKDHR--GKGNNNKMKTGLDGSDTGNIAKEKK 251
Query: 287 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 346
R S AL YT +EV+QW E RRKNYPTK + KK+ +K ++ ++D+EAKMRR+QL+E+LA
Sbjct: 252 R-SYALMYTPREVQQWREARRKNYPTKFLVEKKV-KKNVSASILDEEAKMRRQQLREVLA 309
Query: 347 KQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQF-SKE 405
KQAELGVEVAE+PS+YL ++ +QV N ++GRFQNN + R+ RKD+F +K+
Sbjct: 310 KQAELGVEVAEVPSHYLSNNDEQVNGDRGNNN--GRKGRFQNNRRNKRRHDRKDKFDNKK 367
Query: 406 QKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNF 465
+L DK SS SS +KPTLL+KLLSAD+KRDKS LLQ FRFM +NS K++PE+P+
Sbjct: 368 PRLEDKKSSQDSSITTRKPTLLEKLLSADIKRDKSQLLQVFRFMVMNSLLKEFPEQPLKL 427
Query: 466 PLVMVKDGGAAGEVVEEKSL 485
PL+ VK+ G + +E+ S+
Sbjct: 428 PLITVKETGCE-DAMEDPSI 446
>gi|358347440|ref|XP_003637765.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
gi|355503700|gb|AES84903.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
Length = 553
Score = 244 bits (624), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 203/543 (37%), Positives = 273/543 (50%), Gaps = 121/543 (22%)
Query: 92 MFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNM---GFSNGQMC------LQSV--N 140
MFPVHG G P Q+L N+ NL QQ N NM N Q C +Q++
Sbjct: 60 MFPVHGQGG--PQITHNAQILAQNILNLLQQPNMNMPNMNVPNAQFCAPGPYPMQNMMNQ 117
Query: 141 QLLSMQMS--------------------------------NPCLFGNPQFGLVNQNQQNF 168
QL MQM NP GNPQFG+V NQ
Sbjct: 118 QLPPMQMQRQNPTQQPYGMQPVQQPMFGFPNQVPHAMVPQNPMFNGNPQFGMVPGNQ--- 174
Query: 169 VQPIKDVSGNNTLKSCSSE----------------------PRQGQNLQPSAFMRS--QR 204
++P D++ N + + + P N Q SAF S Q
Sbjct: 175 IRPQIDLNEKNRVPAGNGNANGFVPGGPFPPQQLQGNNGPVPHNANNAQGSAFRNSHPQV 234
Query: 205 NFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQG-FQKSHFHQMHNGKRKFGFYNDQKPK 263
N N + + SN K KNF + + SQG FQKS F+ + GKR F + K K
Sbjct: 235 NPNSNMNTNFANSNWKGSPNKNFKNKQNRGGSQGGFQKSKFNDTNKGKR---FSKEHKGK 291
Query: 264 GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 323
G N +A FGL N QE KR S +LTY+EQE++QW E RRKN+P++ I K+ +E+
Sbjct: 292 GPNNERAGHFGL----NSKQEPKR-SFSLTYSEQEIQQWREARRKNHPSREKIEKQQSEQ 346
Query: 324 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKR 383
+S VID+E R +LKE+LAKQAELG+EVAEIPSYYL++ Q + +
Sbjct: 347 SKDSKVIDRELLQR--ELKEVLAKQAELGIEVAEIPSYYLMNGTNQGLQSEDKKNSFTDK 404
Query: 384 GRFQN----NYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDK 439
+F+N N D+R R +K QK AD+D NKKKPTLLQKLLSAD+KRDK
Sbjct: 405 RKFKNKLMRNPDRRSRNNKK------QKFADRDLLE----NKKKPTLLQKLLSADIKRDK 454
Query: 440 SHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRM 499
SHL Q FRF+ NSFFKD+P+KP+ +P V VK+ G+ EV K L G G +
Sbjct: 455 SHLFQVFRFITANSFFKDYPDKPLVYPPVSVKEMGS--EVYGGKKHLQG-------GEAV 505
Query: 500 VEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGE 559
+EH +E + ++G ++ +D+E+E++D +G+ D +EEEGE
Sbjct: 506 LEH----GTKEIVQKFVKGSDNGHDNEDEESD-----------EGDNEDGINEFDEEEGE 550
Query: 560 IID 562
II+
Sbjct: 551 IIE 553
>gi|297807865|ref|XP_002871816.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
lyrata]
gi|297317653|gb|EFH48075.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
lyrata]
Length = 454
Score = 244 bits (624), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 163/416 (39%), Positives = 233/416 (56%), Gaps = 52/416 (12%)
Query: 73 SNPLLIAQSNHLGLPQLGHMFPVHGNF-GNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSN 131
SNP++ +N + +P + P+H F NMP Q PN
Sbjct: 59 SNPMMGHMNNPIPMPNM----PIHPQFFNNMPQQQQLHQFAMPN---------------- 98
Query: 132 GQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQ-----NFVQPIKDVSGN------NT 180
+NQLL P L GN QF + N N NF QP + S N+
Sbjct: 99 ------HINQLL------PNLLGNLQFAVANSNLMGHSLPNFFQPNLEPSAFTSRPQLNS 146
Query: 181 LKSCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRT-KEFSQGF 239
S P +L+P F + + + + S SN K F K++ K QGF
Sbjct: 147 FNSLPYPPVPNHHLRPPGFSEPRPQVGIDDRTNGSGSNGNDFRNK-FTKHQNFKGPGQGF 205
Query: 240 QKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL-ANQMNQPQEKKRKSLALTYTEQE 298
Q+ HQ NGKRK GF D + G GN +K GL + + ++KR+S AL YT ++
Sbjct: 206 QRPQLHQADNGKRKSGFNKDHR--GKGNYNKMKNGLDGSDADNIAKEKRRSYALMYTPKD 263
Query: 299 VKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEI 358
V QW E RRKN+PT+ N+ KK+ +K ++ ++D+EAKMRR+QL+E+LAKQAELG+EVA++
Sbjct: 264 VNQWREARRKNFPTRLNVEKKV-KKNVSASILDEEAKMRRQQLREVLAKQAELGIEVADV 322
Query: 359 PSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSS 418
PS+YL ++ ++V +N ++ +FQNN K+ R+ RKD+F K +L DK+SS S
Sbjct: 323 PSHYLSNTDERVHG--DNGANDGQKRKFQNNRHKQRRHGRKDKFDKTPRLDDKNSSQESP 380
Query: 419 FNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGG 474
KKPTLL+KLLSA++KRDK HLLQ FRFM +NSF K++PE+P+ PL+ V++ G
Sbjct: 381 MTTKKPTLLEKLLSANIKRDKIHLLQVFRFMVMNSFLKEFPEQPLKLPLITVEETG 436
>gi|359487059|ref|XP_002271061.2| PREDICTED: uncharacterized protein LOC100252331 [Vitis vinifera]
Length = 1031
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 133/223 (59%), Positives = 160/223 (71%), Gaps = 15/223 (6%)
Query: 277 NQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKM 336
N Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK EK +NS+VI+ E K
Sbjct: 751 NFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEKLTNSEVIEAEVKS 810
Query: 337 RREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLNKRGRFQNNYDKRG 394
RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV R EN K+ RFQN +KR
Sbjct: 811 RRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFGKKERFQNRGNKRR 870
Query: 395 RYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTLLQKLLSADVKRDKSH 441
R+ RK D F+K+Q+LA KDS SN NKKK TLLQKLLS D+KRDK H
Sbjct: 871 RHDRKQWQRHDQEDGFTKKQRLAGKDSGDMNASNQPPLNKKKQTLLQKLLSTDIKRDKRH 930
Query: 442 LLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 484
LLQ FRFMA+NSFFKDWPEKP+ FPLV+VK+ G GEVV+ KS
Sbjct: 931 LLQVFRFMAMNSFFKDWPEKPLKFPLVVVKETGCQGEVVDRKS 973
>gi|296085358|emb|CBI29090.3| unnamed protein product [Vitis vinifera]
Length = 467
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 113/240 (47%), Positives = 149/240 (62%), Gaps = 12/240 (5%)
Query: 180 TLKSCSSEPRQ-GQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQ 237
T+ + S +P+Q N QPS FM+SQ NF+++ + S N K S KNF+KN + K
Sbjct: 220 TMATSSVQPQQTWNNQQPSNFMKSQGNFVKDAGVNNSNPNWKNSSRKNFMKNPKGKNSHW 279
Query: 238 GFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQ 297
G +KS H M N + K G N+ + KG N A N Q + +K++ L L YTEQ
Sbjct: 280 GSRKSQLHHMQNARGKAGISNENRGKGLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQ 339
Query: 298 EVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAE 357
E++ W EER+KNYP+K N+ KK EK +NS+VI+ E K RR+QLKEILAKQAELGVEVAE
Sbjct: 340 EIQNWREERKKNYPSKINLEKKSAEKLTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAE 399
Query: 358 IPSYYLLDS-KKQVRARVENT-MPLNKRGRFQNNYDKRGRYKRK--------DQFSKEQK 407
IP +YL DS K+QV R EN K+ RFQN +KR R+ RK D F+K+Q+
Sbjct: 400 IPPHYLSDSEKQQVHGREENNKKAFGKKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQR 459
>gi|115440647|ref|NP_001044603.1| Os01g0814000 [Oryza sativa Japonica Group]
gi|56785048|dbj|BAD82687.1| unknown protein [Oryza sativa Japonica Group]
gi|113534134|dbj|BAF06517.1| Os01g0814000 [Oryza sativa Japonica Group]
Length = 533
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 90/186 (48%), Positives = 121/186 (65%), Gaps = 2/186 (1%)
Query: 287 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 346
RK + Y EVKQW E R+KNYPT NINKKL+E + + + D+EA+MRR++LKE+L
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKKLSESRPDGEKKDEEAQMRRQELKEVLE 340
Query: 347 KQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-KRGRYKRKDQFSKE 405
KQ ELG E+ E+P YL + + Q R N + RF N D KR RY R D SK
Sbjct: 341 KQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNKRSRYDRNDFQSKR 400
Query: 406 QKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVN 464
K+ ++ + + K ++PTLLQKLLS+DVKRD+ LL TF+FM +N+FF D+P+KP+
Sbjct: 401 PKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVLNNFFSDYPDKPLE 460
Query: 465 FPLVMV 470
FP V V
Sbjct: 461 FPSVKV 466
>gi|242054757|ref|XP_002456524.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
gi|241928499|gb|EES01644.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
Length = 568
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 3/188 (1%)
Query: 286 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 345
+RK + Y EVK+W E R+KNYPT N++KKL+E S+++ DK+A++RR++LKE+L
Sbjct: 295 RRKPPRIIYDANEVKRWVEARKKNYPTSVNVHKKLSEINSDNENKDKDAQLRRQELKEVL 354
Query: 346 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQFS 403
AKQ ELG ++ E+P YL ++ Q N + F N N +KR RY+R S
Sbjct: 355 AKQQELGFDLPELPPGYLSETGDQCIENKNNRKAQCRDSHFGNRSNNNKRPRYERGGFQS 414
Query: 404 KEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKP 462
K K+ ++ + K ++PTLLQKLLS+D+KRD+ LL F+FM +N+FFKDWP+KP
Sbjct: 415 KRSKVWNRTPCADDAMAKSREPTLLQKLLSSDIKRDRHRLLHVFKFMTLNNFFKDWPDKP 474
Query: 463 VNFPLVMV 470
+ FP V V
Sbjct: 475 LQFPSVKV 482
>gi|357125563|ref|XP_003564462.1| PREDICTED: uncharacterized protein LOC100829078 [Brachypodium
distachyon]
Length = 609
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 88/191 (46%), Positives = 123/191 (64%), Gaps = 7/191 (3%)
Query: 287 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 346
RK + Y EVKQW E R+KNYPT N+NKKL++ QS+ D+EA+MRR +LKE+LA
Sbjct: 333 RKRPPICYDTNEVKQWVEARKKNYPTSVNVNKKLSQSQSDDQKKDEEAQMRRLELKEVLA 392
Query: 347 KQAELGVEVAEIPSYYLLDSKKQVRA----RVENTMPLNKRG-RFQN--NYDKRGRYKRK 399
KQ ELG E+ E+P YL +++ Q + E+ +R RF N +KR RY R
Sbjct: 393 KQKELGFELPELPPGYLSETEGQPKGPQGNDKESKWKTQQRDCRFGNRGRGNKRQRYDRA 452
Query: 400 DQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWP 459
D SK + + + + K++PTLLQKLLS+D+KRD+ LL TF+FMA+N+FF++ P
Sbjct: 453 DFQSKRPREWNNSRHDGGAVAKREPTLLQKLLSSDIKRDRHRLLHTFKFMALNNFFRNLP 512
Query: 460 EKPVNFPLVMV 470
+KP+ FP V V
Sbjct: 513 DKPLEFPSVKV 523
>gi|218189267|gb|EEC71694.1| hypothetical protein OsI_04190 [Oryza sativa Indica Group]
Length = 500
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 91/199 (45%), Positives = 122/199 (61%), Gaps = 15/199 (7%)
Query: 287 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 333
RK + Y EVKQW E R+KNYPT NINK KL+E + + + D+E
Sbjct: 237 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 296
Query: 334 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-K 392
A+MRR++LKE+LAKQ ELG E+ E+P YL + + Q R N + RF N D K
Sbjct: 297 AQMRRQELKEVLAKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNK 356
Query: 393 RGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAI 451
R RY R D SK K+ ++ + + K ++PTLLQKLLS+DVKRD+ LL TF+FM +
Sbjct: 357 RSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVL 416
Query: 452 NSFFKDWPEKPVNFPLVMV 470
N+FF D+P+KP+ FP V V
Sbjct: 417 NNFFSDYPDKPLEFPSVKV 435
>gi|223943225|gb|ACN25696.1| unknown [Zea mays]
gi|414880010|tpg|DAA57141.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
Length = 562
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 83/186 (44%), Positives = 119/186 (63%), Gaps = 3/186 (1%)
Query: 286 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 345
+RK + Y EVKQW E R+KNYPT N+NKKL+E +S+++ DK+A++RR++LKE++
Sbjct: 290 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQELKEVI 349
Query: 346 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQFS 403
AKQ ELG + E+P YL + Q N + F N N +KR RY+R S
Sbjct: 350 AKQQELGFDPPELPPGYLSEIGDQCIENKNNRKTQFRDSHFGNRFNNNKRSRYERGGFQS 409
Query: 404 KEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKP 462
K K+ + ++ ++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKDWP++P
Sbjct: 410 KRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDRP 469
Query: 463 VNFPLV 468
+ FP V
Sbjct: 470 LQFPSV 475
>gi|20161477|dbj|BAB90401.1| P0432B10.22 [Oryza sativa Japonica Group]
Length = 541
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 124/220 (56%), Gaps = 24/220 (10%)
Query: 287 RKSLALTYTEQEVKQWCEERRKNYPTKANINK----------------------KLTEKQ 324
RK + Y EVKQW E R+KNYPT NINK KL+E +
Sbjct: 267 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKASATLCPHQTIREASSTDHLMQKLSESR 326
Query: 325 SNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRG 384
+ + D+EA+MRR++LKE+L KQ ELG E+ E+P YL + + Q R N +
Sbjct: 327 PDGEKKDEEAQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDC 386
Query: 385 RFQNNYD-KRGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHL 442
RF N D KR RY R D SK K+ ++ + + K ++PTLLQKLLS+DVKRD+ L
Sbjct: 387 RFGNRADNKRSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRL 446
Query: 443 LQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEE 482
L TF+FM +N+FF D+P+KP+ FP V V + EE
Sbjct: 447 LHTFKFMVLNNFFSDYPDKPLEFPSVKVNQIELESNIAEE 486
>gi|356562267|ref|XP_003549393.1| PREDICTED: uncharacterized protein LOC100789443 [Glycine max]
Length = 684
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/250 (41%), Positives = 153/250 (61%), Gaps = 21/250 (8%)
Query: 318 KKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQ-VRARVEN 376
+K +E +S I++E R +LKE+LAKQAELGVEVAEIPSYYL +S Q +++ +N
Sbjct: 451 QKHSECPKDSKAINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQGLQSEAKN 508
Query: 377 TMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVK 436
+KR +FQN ++K+ RK +F+++QK ADKD S + S K+KPTLLQKLLS+DVK
Sbjct: 509 KY-TDKR-KFQNKFNKKS--DRKGRFAEKQKFADKDFSESPSLKKRKPTLLQKLLSSDVK 564
Query: 437 RDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKG 496
RDKSHLLQ RFM +NSFF+ P+K + +P V VK+ G+ EV EK L +D +G
Sbjct: 565 RDKSHLLQVLRFMVMNSFFRHIPDKALRYPSVEVKEKGS--EVSGEKKHLHTGKDVLNRG 622
Query: 497 R----RMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVR 552
+ + F +D N +D+ +D DN+ + + + + + +
Sbjct: 623 SEETVQKIVIFNND--------NGHDCDDDENDSIVDNNLHKDPSSLVKRQCDGGEGIKK 674
Query: 553 LEEEEGEIID 562
+EEEGEI++
Sbjct: 675 SDEEEGEILE 684
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 103/335 (30%), Positives = 131/335 (39%), Gaps = 111/335 (33%)
Query: 35 VPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGL--PQLGH- 91
+P QPQ G +P P P MN FMNA+N NH L P +GH
Sbjct: 27 MPPQPQLGAGQNLMP--------PFMQPHMNAAPFMNAAN------HNHFPLHRPHMGHQ 72
Query: 92 -------------------------MFPVHG----NFGNMPLNVSQGQLLGPNLSNLPQQ 122
MFPV G N + L+ QGQ L ++ N+ QQ
Sbjct: 73 QGQPHVVGGLGPQNSVVGNANYNNPMFPVQGQVMQNQAQLNLSPLQGQFLAQSILNMLQQ 132
Query: 123 FNQNMGFS--NGQMC----LQSVNQLLSMQMSNP-------------------------- 150
N NM S NGQ C +Q++NQ L MQMSNP
Sbjct: 133 PNMNMSMSMPNGQFCGPYPMQNMNQQLPMQMSNPPQGVPYGMHPSSRPVFRFPNQVPQAM 192
Query: 151 ----CLFG-NPQFGLV-------------------NQNQQNFVQ----PIKDVSGNNTLK 182
+F NPQ G V N N FV P + + GN T
Sbjct: 193 VPQNSMFSTNPQLGFVPGNQVRPQIDPNEKILAPPNANANAFVSSSPFPSQQLQGN-TSG 251
Query: 183 SCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKE--FSQGFQ 240
S + N QP AFM+ Q N + ++ SN K KN +KN+ F GFQ
Sbjct: 252 SVNPNLAHTNNSQPPAFMK-QETPNSNIKTNVPNSNWKGSPSKN-LKNKPNRGRFQGGFQ 309
Query: 241 KSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL 275
KS FH ++NGKR GF + KG +G+A +GL
Sbjct: 310 KSKFHDVNNGKRGSGFPKEHNGKGPNSGRAGHYGL 344
>gi|326528619|dbj|BAJ97331.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 625
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 128/218 (58%), Gaps = 20/218 (9%)
Query: 287 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 346
RK + Y + EVKQW + R+KNYPT+AN+NKKL + Q + D+EA+MRR++LKE++A
Sbjct: 335 RKRPPIIYDKNEVKQWVQARKKNYPTRANVNKKLCQNQLDEQKKDEEAQMRRQELKEVIA 394
Query: 347 KQAELGVEVAEIPSYYLLDSKKQVR----------------ARVENTMPLNKRGRFQNNY 390
KQ ELG+E+ E+P YL D++ Q R RGR
Sbjct: 395 KQKELGLELPELPPGYLSDNEGQPRDPQKESNWKTRQGGGRFGNRGRGRGQGRGRGSGRD 454
Query: 391 DKRGRY-KRKDQFSKEQKLADKDS---SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTF 446
+KR RY R+D SK + + +S + ++PTLLQKLL++D+KRD+ LL TF
Sbjct: 455 NKRQRYDSREDSPSKRPREWNNNSRCHDGGAVAKIREPTLLQKLLNSDIKRDRHRLLHTF 514
Query: 447 RFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 484
+FMA+N+FFKDWP KP+ FP V + ++ EE S
Sbjct: 515 KFMALNNFFKDWPAKPLEFPSVKLNQIELESDIAEEGS 552
>gi|293335934|ref|NP_001167960.1| uncharacterized protein LOC100381676 [Zea mays]
gi|223945137|gb|ACN26652.1| unknown [Zea mays]
gi|414880008|tpg|DAA57139.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
Length = 295
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 119/187 (63%), Gaps = 4/187 (2%)
Query: 286 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRRE-QLKEI 344
+RK + Y EVKQW E R+KNYPT N+NKKL+E +S+++ DK+A++RR+ +LKE+
Sbjct: 22 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQVELKEV 81
Query: 345 LAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD--KRGRYKRKDQF 402
+AKQ ELG + E+P YL + Q N + F N ++ KR RY+R
Sbjct: 82 IAKQQELGFDPPELPPGYLSEIGDQCIENKNNRKTQFRDSHFGNRFNNNKRSRYERGGFQ 141
Query: 403 SKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEK 461
SK K+ + ++ ++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKDWP++
Sbjct: 142 SKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDR 201
Query: 462 PVNFPLV 468
P+ FP V
Sbjct: 202 PLQFPSV 208
>gi|222619440|gb|EEE55572.1| hypothetical protein OsJ_03849 [Oryza sativa Japonica Group]
Length = 484
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 65/118 (55%), Gaps = 13/118 (11%)
Query: 287 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 333
RK + Y EVKQW E R+KNYPT NINK KL+E + + + D+E
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 340
Query: 334 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD 391
A+MRR++LKE+L KQ ELG E+ E+P YL + + Q R N + RF N D
Sbjct: 341 AQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRAD 398
>gi|168019512|ref|XP_001762288.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686366|gb|EDQ72755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 614
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 67/95 (70%), Gaps = 6/95 (6%)
Query: 290 LALTYTEQ---EVKQWCEERRKNYPTKANINKK---LTEKQSNSDVIDKEAKMRREQLKE 343
+AL +E+ E++ W EER +NYPT++NI +K L K + ++ID++ + R++++KE
Sbjct: 278 VALKASEEDDDEIRAWREERMRNYPTRSNIQRKKEALEGKVARGELIDEDTRKRQQRMKE 337
Query: 344 ILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTM 378
ILAKQAELGV VAE+P++YL S K+ R R ++
Sbjct: 338 ILAKQAELGVPVAEVPAHYLSQSYKEDRGRGRKSL 372
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 18/129 (13%)
Query: 379 PLNKRGRFQNNYDKRGRYKR--KDQFSKEQKLADKDSSNTSSFN---------KKKPTLL 427
P +K G+ Q + +RGR K+ K F+ K N + K+ PTLL
Sbjct: 461 PSDKPGKTQCYFYQRGRCKKGFKCHFAHTGPRGKKGGDNEAGRGGPKGVGVEKKRPPTLL 520
Query: 428 QKLLSADVKRDKSHLLQTFRFMAINSFFKDWP-------EKPVNFPLVMVKDGGAAGEVV 480
KLL ++ RDKS++LQ R+ N+F +WP E P + GEVV
Sbjct: 521 SKLLQGEINRDKSYILQCLRYFVNNNFLLNWPNQLSESIELPEQSDDAHESSDYSEGEVV 580
Query: 481 EEKSLLLGE 489
E +LL +
Sbjct: 581 EAPDVLLAD 589
>gi|219363611|ref|NP_001136910.1| uncharacterized protein LOC100217067 [Zea mays]
gi|194697570|gb|ACF82869.1| unknown [Zea mays]
Length = 138
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 40/47 (85%)
Query: 422 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLV 468
++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKDWP++P+ FP V
Sbjct: 5 REPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDRPLQFPSV 51
>gi|255546771|ref|XP_002514444.1| conserved hypothetical protein [Ricinus communis]
gi|223546440|gb|EEF47940.1| conserved hypothetical protein [Ricinus communis]
Length = 123
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 62/129 (48%), Gaps = 39/129 (30%)
Query: 340 QLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRK 399
+LKE LAKQAELGVEVAEIPS YL DS+KQV+
Sbjct: 28 RLKETLAKQAELGVEVAEIPSRYLSDSEKQVK---------------------------- 59
Query: 400 DQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRD-KSHLLQTFRFMAINSFF--- 455
L +KDS+N SSF+ KPTL QKLLSA+ K S LQ FF
Sbjct: 60 ------DTLTNKDSTNGSSFSNTKPTLRQKLLSAEGKGAFLSECLQASSTFTCEQFFSSN 113
Query: 456 KDWP-EKPV 463
K W KPV
Sbjct: 114 KLWNFTKPV 122
>gi|328766403|gb|EGF76457.1| hypothetical protein BATDEDRAFT_28378 [Batrachochytrium
dendrobatidis JAM81]
Length = 411
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 88/185 (47%), Gaps = 25/185 (13%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTE---KQSNSDVID---KEAKMRREQLKEILAK- 347
T ++++QW +ER++ +PT++NI +K+ + +Q ++ID K+ K R ++++ K
Sbjct: 217 TPKDIEQWIQERKRKFPTESNIKQKMEDEKQRQKQGNIIDVNGKKLKGRNADKRQLIRKN 276
Query: 348 ---QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRG--RYKRKDQF 402
Q + G + + YL D+ +T+ L+ G+ + R+ +
Sbjct: 277 DTVQEKHGTSEQKDSTIYLDDASNAKLEASHDTLALDMTGKLNQHKGASSCCRHGSNCRH 336
Query: 403 SKE-QKLADKDSSNTSSFNKKKPT-----------LLQKLLSADVKRDKSHLLQTFRFMA 450
S + KLA+ S+ + PT LL LL D K+DK+ LLQ RF+
Sbjct: 337 SHDLSKLAESRSA-PKTHTPLNPTAAPIKGTNRRPLLNMLLETDFKKDKNALLQCIRFII 395
Query: 451 INSFF 455
N FF
Sbjct: 396 ENKFF 400
>gi|384254158|gb|EIE27632.1| hypothetical protein COCSUDRAFT_55624 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 33/50 (66%)
Query: 417 SSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFP 466
S +++PTLLQKLL+ D++ + S LLQ RF+ N+F D+ P+ FP
Sbjct: 313 SPVQQRQPTLLQKLLAPDIRAEHSRLLQCLRFLVTNNFLLDFGRAPLIFP 362
>gi|308800704|ref|XP_003075133.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
(ISS) [Ostreococcus tauri]
gi|116061687|emb|CAL52405.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
(ISS) [Ostreococcus tauri]
Length = 401
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 84/204 (41%), Gaps = 60/204 (29%)
Query: 288 KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAK 347
K + + ++V+++ EER K++P+ +++ Q KEILAK
Sbjct: 144 KPVDVASLPEDVRRYREERAKHWPSDSHV-----------------------QAKEILAK 180
Query: 348 QAELG--------VEVAEIPSYYLLDSKKQVRARVENTMPLN---KRGRF---------- 386
Q E+G E + + +K++ R + P K RF
Sbjct: 181 QREMGHFEASQEIGEEGDGERTATMTTKREQRVGEQEKKPSGDSVKVCRFWLQGGCRKGD 240
Query: 387 --------------QNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLS 432
+ + RGR K + + DK +S SS N + TLL+KLL
Sbjct: 241 ACDFKHEAGPNSDQRCRFFARGRCKAGKRCPFRHDIVDKKTSGVSSGNPQ--TLLKKLLH 298
Query: 433 ADVKRDKSHLLQTFRFMAINSFFK 456
++KRD+ LLQ FRF+ N FF+
Sbjct: 299 KEIKRDEERLLQLFRFLVNNDFFE 322
>gi|145343958|ref|XP_001416510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576735|gb|ABO94803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 291
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 100/223 (44%), Gaps = 44/223 (19%)
Query: 297 QEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELG---- 352
++V+++ EER K +P+ N+ K + + RRE+L+EILAKQ E+G
Sbjct: 28 EDVRRYREERAKYWPSDRNVRAKTEAGEEEE----RARAARRERLREILAKQREMGHFEA 83
Query: 353 -VEVAE----IPSYYLLDSKKQVRARVENTMPLNKRGRF--------------------- 386
E+ E + + LD K V AR + +K RF
Sbjct: 84 SQEIGEEEGGVAAPAALDGK-GVEARAADG---SKVCRFWLQGGCRKGSACDFKHESAPN 139
Query: 387 ---QNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 443
+ + RGR K + + ++ ++ SS + TLL+KLL ++KRD+ LL
Sbjct: 140 KDQKCRFFARGRCKAGARCPFKHEVTERKSSAADGGGNPQ-TLLKKLLDKEIKRDEQRLL 198
Query: 444 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLL 486
Q FRF N FF K V+ PL M + +V++ ++ L
Sbjct: 199 QLFRFFVNNDFFTG--TKGVDEPLWMFPWADSGKKVIDRRAAL 239
>gi|118358218|ref|XP_001012358.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila]
gi|89294124|gb|EAR92112.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila
SB210]
Length = 762
Score = 45.8 bits (107), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 41/63 (65%)
Query: 398 RKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 457
++D++S+E L+ SN+S+F K TLL L+ ++ R++++LLQ R++ N+FF +
Sbjct: 574 KRDKYSQELDLSFNYKSNSSNFRYKANTLLTNLVLDEIYRERNYLLQAIRYIVSNNFFDE 633
Query: 458 WPE 460
E
Sbjct: 634 QAE 636
>gi|307107626|gb|EFN55868.1| hypothetical protein CHLNCDRAFT_52119 [Chlorella variabilis]
Length = 684
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 41/70 (58%), Gaps = 2/70 (2%)
Query: 298 EVKQWCEERRKNYPTKANINKKLTEKQSN--SDVIDKEAKMRREQLKEILAKQAELGVEV 355
EV+QW +ER++++PT + +K E ++ +D +R+++L ILA+Q+ +G+
Sbjct: 388 EVEQWIKERKRSFPTSEAVARKEAEAEARRQRGELDAARTLRQQRLAAILAQQSAMGLSR 447
Query: 356 AEIPSYYLLD 365
A LLD
Sbjct: 448 AAGTLDMLLD 457
>gi|68073975|ref|XP_678902.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56499510|emb|CAH94055.1| conserved hypothetical protein [Plasmodium berghei]
Length = 611
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 90/183 (49%), Gaps = 18/183 (9%)
Query: 282 PQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQL 341
EK K+ + +++E+++W EER+KNYPTK KK+ + ++N++ I+ E K ++ +
Sbjct: 425 SHEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMDIKNNNN-INSETKEKKNKE 479
Query: 342 KEILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRG 394
K+I + +L VE + Y +K + + N + N N Y
Sbjct: 480 KKITSLIEKLLVEKYSSAIGRNIYYQTQSQRKSIFVPILNKLIENNHKNVYENTYYSISD 539
Query: 395 RYKRKD-QFSKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 452
+Y +K+ +F+K K + DS N K L+ +L+ D+ + L+Q F+ N
Sbjct: 540 KYNKKNIKFNKRVKKHNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEFIVNN 595
Query: 453 SFF 455
+FF
Sbjct: 596 NFF 598
>gi|392567649|gb|EIW60824.1| hypothetical protein TRAVEDRAFT_71098 [Trametes versicolor
FP-101664 SS1]
Length = 590
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 64/121 (52%), Gaps = 15/121 (12%)
Query: 296 EQEVKQWCEERRKNYPTKANINKKL----TEKQSNSDVIDKEAKMRREQLKEILAKQAEL 351
EQ ++++ EE++ + + N + + TE +S D+ A R E+ +I+AK EL
Sbjct: 211 EQALQRFIEEQKTSALQRFNFHMAVLIWETEARSAKATGDEAA--RSERHADIIAKLVEL 268
Query: 352 GVEVAEIPSY-----YLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQ 406
G + +EIP+Y LLD +++ R+ NT+ R + ++R R + +D F K+
Sbjct: 269 GYQSSEIPAYDHAFQRLLDQPRKLTPRIWNTI----RPKLVEMLEERRRAREQDAFKKKW 324
Query: 407 K 407
+
Sbjct: 325 R 325
>gi|126337727|ref|XP_001369929.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Monodelphis domestica]
Length = 469
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 16/104 (15%)
Query: 353 VEVAEIPSYYLLDSKKQVR-ARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADK 411
V ++++P D KKQ R + V T L K +N KRG Y++K
Sbjct: 358 VILSDVPQNQSEDIKKQERNSPVAKTKSLRKNWMRKNCRYKRGLYRKKP----------- 406
Query: 412 DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 455
S K++P LL+ LL+ D++ +++ +LQ R++ N FF
Sbjct: 407 ----VSEPQKRRPHLLEMLLAQDIRHERNVILQCVRYIIQNDFF 446
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 22/25 (88%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKK 319
T +E+++W EERRKN+PT+ANI KK
Sbjct: 178 TPEEIEKWREERRKNFPTRANIEKK 202
>gi|255088323|ref|XP_002506084.1| predicted protein [Micromonas sp. RCC299]
gi|226521355|gb|ACO67342.1| predicted protein [Micromonas sp. RCC299]
Length = 567
Score = 42.4 bits (98), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 24/32 (75%)
Query: 425 TLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 456
TLL+KLL+ +V+ D+S LLQ FRF+ N F +
Sbjct: 458 TLLRKLLAREVRADRSRLLQVFRFLVNNDFLR 489
>gi|17540726|ref|NP_499991.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
gi|351063723|emb|CCD71947.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
Length = 213
Score = 42.0 bits (97), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 8/120 (6%)
Query: 442 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 500
L+ F ++ FF +D P+ N L +V+D G E+ GEE S G+ RR V
Sbjct: 93 LIGAFIYVLAGEFFAQDSPQTIFNKALALVRDDGRCQEIFGASIAGFGEETSRGR-RRHV 151
Query: 501 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 554
H +++++ R R+ +++G DE E E D Q + +Y+ K HV ++
Sbjct: 152 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 211
>gi|303286377|ref|XP_003062478.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455995|gb|EEH53297.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 552
Score = 42.0 bits (97), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 26/35 (74%)
Query: 423 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 457
+PTLL+KL + +V+ +S LLQ FRF+ N+F +D
Sbjct: 460 QPTLLKKLFAKEVRVSRSRLLQVFRFLVNNNFLRD 494
>gi|313219540|emb|CBY30463.1| unnamed protein product [Oikopleura dioica]
gi|313226347|emb|CBY21491.1| unnamed protein product [Oikopleura dioica]
Length = 310
Score = 42.0 bits (97), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 35/49 (71%), Gaps = 1/49 (2%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKE 343
TE+++++W EERRKN+PTKA I +K +++ + + K+ +REQ +E
Sbjct: 96 TEEDIRKWREERRKNFPTKARIAEKEAARKAGTSIETKKFS-KREQERE 143
>gi|341888654|gb|EGT44589.1| hypothetical protein CAEBREN_21252 [Caenorhabditis brenneri]
Length = 219
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 8/120 (6%)
Query: 442 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 500
L+ F ++ FF +D P+ N L +V+D G ++ GEE S G+ RR V
Sbjct: 99 LIGAFIYVLCGEFFAQDSPQTIFNKALAIVRDDGRCQDIFGATIAGFGEETSRGR-RRHV 157
Query: 501 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 554
H +++++ R R+ +++G DE E E D Q + +Y+ K HV ++
Sbjct: 158 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 217
>gi|295665248|ref|XP_002793175.1| CCCH zinc finger protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278089|gb|EEH33655.1| CCCH zinc finger protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 764
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 298 EVKQWCEERRKNYPTKANINKKL--TEKQSNSDVIDKEAKMRRE 339
++ W EER+K YPTKA I ++L EKQ + K+AK RE
Sbjct: 356 DIAAWIEERKKRYPTKARIEERLKEAEKQKKASREAKDAKRARE 399
>gi|82594577|ref|XP_725484.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480507|gb|EAA17049.1| Zinc finger, C2H2 type, putative [Plasmodium yoelii yoelii]
Length = 601
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 91/192 (47%), Gaps = 37/192 (19%)
Query: 282 PQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQL 341
EK K+ + +++E+++W EER+KNYPTK KK+ E ++N+ I+ E K ++
Sbjct: 418 SHEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMEMKNNN--INSEEKEKKNXQ 471
Query: 342 KEILAKQAELGVEVAEIP----SYYLLDSKKQ------VRARVENTMPLNKRGRFQNNYD 391
K+I + +L VE YY S+K+ + +EN N + ++N Y
Sbjct: 472 KKITSLIEKLLVEKYSSAIGRNIYYQTQSQKKSIFVPLLNKLIEN----NYKNVYENTY- 526
Query: 392 KRGRYKRKDQFSKEQKLADK--------DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 443
Y D+++++ +K DS N K L+ +L+ D+ + L+
Sbjct: 527 ----YSISDKYNQKNIKFNKKIKKNNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLM 578
Query: 444 QTFRFMAINSFF 455
Q F+ N+FF
Sbjct: 579 QCIEFIVNNNFF 590
>gi|345325138|ref|XP_001514551.2| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Ornithorhynchus anatinus]
Length = 446
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 12/51 (23%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 345
T +E+ +W EERRKN+PT ANI KK K ++ REQ E+L
Sbjct: 125 TPEEIAKWREERRKNFPTMANIEKK------------KALQLDREQRGEVL 163
>gi|291235450|ref|XP_002737657.1| PREDICTED: nuclear fragile X mental retardation protein interacting
protein-like [Saccoglossus kowalevskii]
Length = 639
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 26/35 (74%)
Query: 421 KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 455
KKKPTLL+ LL+ D++ +++ LLQ R++ +FF
Sbjct: 458 KKKPTLLEMLLAGDIRHERNVLLQCVRYVKKKNFF 492
>gi|158297378|ref|XP_001237955.2| AGAP007875-PA [Anopheles gambiae str. PEST]
gi|157015166|gb|EAU76388.2| AGAP007875-PA [Anopheles gambiae str. PEST]
Length = 474
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 425 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 455
T+L+KLL AD++ +++ LLQ R++ +N FF
Sbjct: 416 TMLEKLLDADIRHERNVLLQCVRYVLVNKFF 446
>gi|67474753|ref|XP_653120.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56470045|gb|EAL47734.1| hypothetical protein EHI_019110 [Entamoeba histolytica HM-1:IMSS]
gi|449703311|gb|EMD43787.1| Hypothetical protein EHI5A_022200 [Entamoeba histolytica KU27]
Length = 129
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 29/38 (76%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 332
TE+++++W EER+K +PTKA K L+E++ +D I+K
Sbjct: 2 TEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39
>gi|344301187|gb|EGW31499.1| hypothetical protein SPAPADRAFT_154739 [Spathaspora passalidarum
NRRL Y-27907]
Length = 212
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 35/55 (63%), Gaps = 2/55 (3%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQA 349
TE+++ +W EER+K +P+K + KKL +KQ S++ KE + R + KE K+
Sbjct: 59 TEEDIAKWIEERKKKWPSKQVVAKKLEQKQ--SEIQQKEDRKRPAESKEAPGKRT 111
>gi|407040593|gb|EKE40213.1| hypothetical protein ENU1_097960 [Entamoeba nuttalli P19]
Length = 129
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 29/38 (76%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 332
TE+++++W EER+K +PTKA K L+E++ +D I+K
Sbjct: 2 TEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39
>gi|7305335|ref|NP_038773.1| nuclear fragile X mental retardation-interacting protein 1 [Mus
musculus]
gi|81917719|sp|Q9QXX8.1|NUFP1_MOUSE RecName: Full=Nuclear fragile X mental retardation-interacting
protein 1; AltName: Full=Nuclear FMRP-interacting
protein 1
gi|6525073|gb|AAF15316.1| nuclear FMRP interacting protein 1 [Mus musculus]
gi|26328149|dbj|BAC27815.1| unnamed protein product [Mus musculus]
gi|33525186|gb|AAH56192.1| Nuclear fragile X mental retardation protein interacting protein 1
[Mus musculus]
gi|74220737|dbj|BAE33625.1| unnamed protein product [Mus musculus]
gi|148703863|gb|EDL35810.1| nuclear fragile X mental retardation protein interacting protein 1
[Mus musculus]
Length = 484
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 5/51 (9%)
Query: 269 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKK 319
K ++F N M+ P KK K T +E+ +W EERRKNYPT ANI +K
Sbjct: 202 KIVQFHWRN-MHAPGMKKIK----LDTPEEIARWREERRKNYPTLANIERK 247
>gi|327283872|ref|XP_003226664.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Anolis carolinensis]
Length = 451
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 8/50 (16%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 344
T +E+ +W EERRKN+PT ANI KK + KE + R E LK +
Sbjct: 92 TPEEIAKWREERRKNFPTLANIEKK--------KALQKEKEQRGEVLKTL 133
>gi|225679220|gb|EEH17504.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 766
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 298 EVKQWCEERRKNYPTKANINKKL--TEKQSNSDVIDKEAKMRRE 339
++ W EER+K YPT+A I ++L EKQ + K+AK RE
Sbjct: 358 DIAAWIEERKKRYPTRARIEERLKEAEKQKKASREAKDAKRARE 401
>gi|255554857|ref|XP_002518466.1| conserved hypothetical protein [Ricinus communis]
gi|223542311|gb|EEF43853.1| conserved hypothetical protein [Ricinus communis]
Length = 228
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 56/112 (50%), Gaps = 26/112 (23%)
Query: 43 IMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGN- 101
IMNPQ+ FP +N++ + +P + QN F GLPQLG H N N
Sbjct: 65 IMNPQIQFPFNNNS--SVSPFITQNQF---------------GLPQLGFSPLNHLNHLNT 107
Query: 102 MPLNVSQ---GQLLGPNLS-NLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSN 149
+P+ V+Q QL N PQQ N NMGF N LQ++N L +QM N
Sbjct: 108 VPMFVNQFNPSQLQPQQFPFNFPQQLNHNMGFPN----LQNMNPPLPIQMPN 155
>gi|156099300|ref|XP_001615652.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804526|gb|EDL45925.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 589
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 47/208 (22%), Positives = 86/208 (41%), Gaps = 36/208 (17%)
Query: 285 KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 344
K K + +E+++W +ER+KNYPT+ I L ++ K+ E+L
Sbjct: 406 KNDKDEVILSDSKEIEKWIQERKKNYPTREKIADSLKNNENGKKKKKKKPNCLIEELL-F 464
Query: 345 LAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKR---------GR 395
+ + +G + Y+ +K + + M N F N Y+K G+
Sbjct: 465 ESYCSAVGRNI-----YFKSKWEKSIFIPLLTKMEQNN---FHNIYEKSYYPLFNNSMGK 516
Query: 396 YKRKDQFSKEQKLADKDSSNTSSFN-KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSF 454
+ +K+Q+ K+ S N +KP LL +L+ ++ + L++ F+ + F
Sbjct: 517 FFP----TKQQRRPFKERQGIDSLNIHRKPPLLYQLMKKEIYLYEKRLIKCIEFITRSGF 572
Query: 455 FKDWPEKPVNFPLVMVKDGGAAGEVVEE 482
F D +GGAAG + E
Sbjct: 573 FDD-------------SEGGAAGTDILE 587
>gi|354543450|emb|CCE40169.1| hypothetical protein CPAR2_102070 [Candida parapsilosis]
Length = 253
Score = 39.3 bits (90), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 22/25 (88%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKK 319
TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 77 TEEDIAKWIEERKKNWPTRANVERK 101
>gi|395527571|ref|XP_003765917.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1 [Sarcophilus harrisii]
Length = 480
Score = 39.3 bits (90), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 22/25 (88%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKK 319
T +E+++W EERRKN+PT+ANI KK
Sbjct: 187 TPEEIEKWREERRKNFPTRANIEKK 211
>gi|448516074|ref|XP_003867484.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis Co 90-125]
gi|380351823|emb|CCG22046.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis]
Length = 243
Score = 39.3 bits (90), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 22/25 (88%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKK 319
TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 69 TEEDIAKWIEERKKNWPTRANVERK 93
>gi|71028524|ref|XP_763905.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350859|gb|EAN31622.1| hypothetical protein TP04_0270 [Theileria parva]
Length = 561
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 64/150 (42%), Gaps = 19/150 (12%)
Query: 22 PQMAFRPNMLQNSVPMQPQTGIMNPQVPFP-LSNSNMPATAPLMNQNNFMNASNPLLIAQ 80
PQM NM Q PM P + + P VP P P P+MNQ N S + Q
Sbjct: 109 PQMNHSVNMNQTQQPMIPHSNMPIPHVPHPGFPVMGFPVFQPMMNQPNMGQPS----MGQ 164
Query: 81 SNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFN-QNMGFSNGQMCLQSV 139
N +G P +G N G +N+S+ + P P +F ++MG S M Q+V
Sbjct: 165 PN-MGQPSMG-----QPNMGQANMNMSEFAYVPP-----PMRFTGKDMG-SVDHMTPQAV 212
Query: 140 NQLLSMQMSNP-CLFGNPQFGLVNQNQQNF 168
LS Q P FG+ Q G+ N F
Sbjct: 213 LGRLSSQNLPPMSAFGHHQSGIPPTNYNQF 242
>gi|167387090|ref|XP_001738022.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165898944|gb|EDR25671.1| hypothetical protein EDI_290620 [Entamoeba dispar SAW760]
Length = 137
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 31/45 (68%)
Query: 288 KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 332
K + + TE+++++W EER+K +PTKA K L E++ +D I+K
Sbjct: 3 KFIKSSMTEEQLQKWIEERKKRFPTKAKREKCLFEEKKKTDEIEK 47
>gi|12843948|dbj|BAB26177.1| unnamed protein product [Mus musculus]
Length = 249
Score = 38.9 bits (89), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 5/51 (9%)
Query: 269 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKK 319
K ++F N M+ P KK K T +E+ +W EERRKNYPT ANI +K
Sbjct: 202 KIVQFHWRN-MHAPGMKKIK----LDTPEEIARWREERRKNYPTLANIERK 247
>gi|448079221|ref|XP_004194343.1| Piso0_004831 [Millerozyma farinosa CBS 7064]
gi|359375765|emb|CCE86347.1| Piso0_004831 [Millerozyma farinosa CBS 7064]
Length = 234
Score = 38.9 bits (89), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 14/42 (33%), Positives = 29/42 (69%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKM 336
TE+++K+W EERR+N+P++ N+ K ++ + + D++ K
Sbjct: 72 TEEDIKKWIEERRRNWPSRKNVEAKKEKESAGAPANDEQIKT 113
>gi|195995467|ref|XP_002107602.1| hypothetical protein TRIADDRAFT_51288 [Trichoplax adhaerens]
gi|190588378|gb|EDV28400.1| hypothetical protein TRIADDRAFT_51288 [Trichoplax adhaerens]
Length = 444
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 26/38 (68%), Gaps = 3/38 (7%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 332
+ +++K W E+RRKNYPT AN+ KK KQ D I++
Sbjct: 205 SPEDIKAWREQRRKNYPTAANVEKK---KQEALDRIER 239
>gi|260817126|ref|XP_002603438.1| hypothetical protein BRAFLDRAFT_80413 [Branchiostoma floridae]
gi|229288757|gb|EEN59449.1| hypothetical protein BRAFLDRAFT_80413 [Branchiostoma floridae]
Length = 408
Score = 38.5 bits (88), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 288 KSLALTYTEQEVKQWCEERRKNYPTKANINKK 319
K+L+ T + + +W E+RRKNYPT AN+ KK
Sbjct: 58 KALSKLDTPEAIAKWREDRRKNYPTVANVAKK 89
>gi|395834849|ref|XP_003790401.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1 [Otolemur garnettii]
Length = 514
Score = 38.5 bits (88), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 5/50 (10%)
Query: 269 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINK 318
K ++F N M+ P KK K T +E+ +W EERRKNYPT ANI +
Sbjct: 211 KIVQFHWKN-MHAPGMKKIK----LDTPEEIARWREERRKNYPTLANIER 255
>gi|260945323|ref|XP_002616959.1| hypothetical protein CLUG_02403 [Clavispora lusitaniae ATCC 42720]
gi|238848813|gb|EEQ38277.1| hypothetical protein CLUG_02403 [Clavispora lusitaniae ATCC 42720]
Length = 221
Score = 38.5 bits (88), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 24/33 (72%)
Query: 295 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNS 327
TE+++ +W EERRKN+PT+ NI K + +S +
Sbjct: 76 TEEDIAKWIEERRKNWPTRKNIENKKAQLESKA 108
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.312 0.129 0.368
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,077,724,232
Number of Sequences: 23463169
Number of extensions: 418827682
Number of successful extensions: 1532800
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 161
Number of HSP's successfully gapped in prelim test: 2240
Number of HSP's that attempted gapping in prelim test: 1509835
Number of HSP's gapped (non-prelim): 16519
length of query: 562
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 414
effective length of database: 8,886,646,355
effective search space: 3679071590970
effective search space used: 3679071590970
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 80 (35.4 bits)