BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015654
(403 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224100559|ref|XP_002311923.1| predicted protein [Populus trichocarpa]
gi|222851743|gb|EEE89290.1| predicted protein [Populus trichocarpa]
Length = 659
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 176/393 (44%), Positives = 223/393 (56%), Gaps = 67/393 (17%)
Query: 44 QRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRKFGFYNDQK 102
Q N +NGQ S KS +K SGKN NR + SQ G QKS F+ M NGKRK F N+
Sbjct: 301 QGNTARNGQSSTPKSKWEKPSGKNLKNNRNRGRSQSGHQKSDFNCMDNGKRKLEFSNEHG 360
Query: 103 PKGHGNGKAIKFGLANQMNQPQEKKRK-SLALTYTEQEVKQWCEERRKNYPTKANINKKL 161
KG+GN + KFG + +Q E+K K S YTEQE+KQW E RRK+YPTK +I KK
Sbjct: 361 RKGNGNERVAKFGRTDLTDQATEEKSKPSRTFFYTEQEIKQWRESRRKHYPTKTSIEKKQ 420
Query: 162 TEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-------------- 207
E VID+EA R +QLKEILAKQAELGVEVAEIP YLLDS
Sbjct: 421 ME------VIDREANFRHKQLKEILAKQAELGVEVAEIPPDYLLDSEKLGVEVAEIPPPQ 474
Query: 208 -----------------KKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLA 250
+KQ R +N L K+G+F N +DKRGR+ +K + +K+ A
Sbjct: 475 VLSSEKLGVEVAEIPPPQKQEHGREDNRRSLTKKGKFWNKHDKRGRFNKKGRSAKQVGSA 534
Query: 251 DKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVM 310
+++ +KPTLL+KLLS D+KRDK LLQ FRFM NSFFKDWPEKP+ FP V+
Sbjct: 535 NEE---------RKPTLLEKLLSTDIKRDKRQLLQVFRFMVANSFFKDWPEKPLKFPSVV 585
Query: 311 VKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHHNIQGFEDENDDEEED 370
VK+ G E+VE+KS L+GEE SE + + E+F + DD E
Sbjct: 586 VKEDGYEDEIVEKKSSLVGEEVSEDRNNTIAENF-----------------GDRDDNIEH 628
Query: 371 NDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
+ E ++ GK + DE R+ EEGEI+D
Sbjct: 629 DAQVELGNCFVRGKCDIVDEVDRV--EEGEIVD 659
>gi|147775405|emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera]
Length = 605
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 161/317 (50%), Positives = 200/317 (63%), Gaps = 16/317 (5%)
Query: 44 QRNFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQK 102
Q NF+++ + S N K S KNF+ N + K G +KS H M NG+ K G N+ +
Sbjct: 250 QGNFVKDAGVNNSNPNWKNSSRKNFMXNPKGKNSHWGSRKSQLHHMQNGRGKAGISNENR 309
Query: 103 PKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLT 162
KG N A N Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK
Sbjct: 310 GKGLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSA 369
Query: 163 EKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MP 220
EK +NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV R EN
Sbjct: 370 EKLTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKA 429
Query: 221 LNKRGRFQNNYDKRGRYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTL 267
K+ RFQN +KR R+ RK D F+K+Q+LA DS SN NKKK TL
Sbjct: 430 FGKKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGTDSGDTNASNQPPLNKKKQTL 489
Query: 268 LQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLL 327
LQKLLS D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV VK+ G GEVV+ KS
Sbjct: 490 LQKLLSTDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVAVKETGCQGEVVDRKSSP 549
Query: 328 LGEEDSEGKGRRMVEHF 344
+ +G + E F
Sbjct: 550 TSKGVPQGGRKTXAEEF 566
>gi|224113339|ref|XP_002316461.1| predicted protein [Populus trichocarpa]
gi|222865501|gb|EEF02632.1| predicted protein [Populus trichocarpa]
Length = 674
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 185/410 (45%), Positives = 223/410 (54%), Gaps = 81/410 (19%)
Query: 44 QRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQ-GFQKSHFHQMHNGKRKFGFYNDQK 102
Q N +NGQGS S SGKNF N +E SQ G QKS FH+M NGKRK GF N
Sbjct: 296 QGNPAKNGQGSTPNSKQGIFSGKNFKNNPKREQSQSGHQKSEFHRMDNGKRKLGFSNKHG 355
Query: 103 PKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLT 162
KG GN +A KFG ++ NQ E+KR + YTEQE+KQW E RRK++PTK NI KK T
Sbjct: 356 GKGKGNERAAKFGRSDPSNQVMEQKRTHI---YTEQEIKQWRESRRKHFPTKTNIEKKQT 412
Query: 163 EKQSNSDVIDKEAKMRRE-------------QLKEILAKQAELGVEVAEIPSYY------ 203
EK +S VIDKEA RR+ QLKEILAKQAELGVEVAEIP Y
Sbjct: 413 EKLIDSGVIDKEANFRRKVLNALPRKLDDKLQLKEILAKQAELGVEVAEIPPEYMLDSEK 472
Query: 204 ------------------------------LLDSKKQVRARVENTMPLNKRGRFQNNYDK 233
LLDS+KQ R +N L K+GRF N +D+
Sbjct: 473 LGVEVAETPLSYLLDSEKLGVEAAEIPPHHLLDSEKQEHGREDNRRSLTKKGRFWNKHDR 532
Query: 234 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 293
RGRYKRK + + L +++ +KPTLL+KLL AD+KRDK LLQ FRFM N
Sbjct: 533 RGRYKRKGRSDMQLGLENEE---------RKPTLLEKLLCADIKRDKHRLLQVFRFMVAN 583
Query: 294 SFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERM 353
SFFKDW +KP+ FP V+VK+ G E EEK L+GEE+SE VE F
Sbjct: 584 SFFKDWSDKPLKFPSVVVKEDGCKDEPQEEKPSLVGEEESEVPNNTTVEDF--------- 634
Query: 354 HHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
+ DD +E + E + GK + DE R+ EEGEIID
Sbjct: 635 --------GDRDDGDEHDAQVEPGNGSVMGKCDIVDEVYRV--EEGEIID 674
>gi|255554855|ref|XP_002518465.1| conserved hypothetical protein [Ricinus communis]
gi|223542310|gb|EEF43852.1| conserved hypothetical protein [Ricinus communis]
Length = 273
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 128/222 (57%), Positives = 169/222 (76%), Gaps = 4/222 (1%)
Query: 125 EKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKE 184
+++ +S AL YTEQE+K W EERRKN+P+KANI K +E+ +NS IDKEAK+RRE+LKE
Sbjct: 5 QRENRSPALIYTEQEIKLWREERRKNFPSKANIEKMCSERVTNSGGIDKEAKLRRERLKE 64
Query: 185 ILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFS 244
ILAKQAELGVEVAEIPS+YL DS+KQV+ + ++ K+GR ++ +D+RGRY +KD +
Sbjct: 65 ILAKQAELGVEVAEIPSHYLSDSEKQVKVKKDSRRSAPKKGRSRHKHDRRGRYNKKDGLT 124
Query: 245 KEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPV 304
++ LA KDSSN SSF++ KPTLLQKLLSA+V++DK HLLQ FRFM +NSFF D PEKP+
Sbjct: 125 QQNTLAKKDSSNGSSFSRTKPTLLQKLLSAEVRKDKHHLLQVFRFMVMNSFFDDGPEKPL 184
Query: 305 NFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEH 346
FP V+VK+ EVV EK+ + G+ SE + VE+ H
Sbjct: 185 KFPSVVVKE----DEVVAEKTSITGKGISEVSNKTAVENSGH 222
>gi|359487059|ref|XP_002271061.2| PREDICTED: uncharacterized protein LOC100252331 [Vitis vinifera]
Length = 1031
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 139/257 (54%), Positives = 170/257 (66%), Gaps = 15/257 (5%)
Query: 105 GHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEK 164
G N A N Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK EK
Sbjct: 738 GLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEK 797
Query: 165 QSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLN 222
+NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV R EN
Sbjct: 798 LTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFG 857
Query: 223 KRGRFQNNYDKRGRYKRK--------DQFSKEQKLADKDS-----SNTSSFNKKKPTLLQ 269
K+ RFQN +KR R+ RK D F+K+Q+LA KDS SN NKKK TLLQ
Sbjct: 858 KKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGKDSGDMNASNQPPLNKKKQTLLQ 917
Query: 270 KLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLG 329
KLLS D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV+VK+ G GEVV+ KS
Sbjct: 918 KLLSTDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVVVKETGCQGEVVDRKSSPTS 977
Query: 330 EEDSEGKGRRMVEHFEH 346
+ +G + E F +
Sbjct: 978 KGVPQGGRKTAAEEFSN 994
>gi|449446325|ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus]
Length = 599
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 156/352 (44%), Positives = 214/352 (60%), Gaps = 45/352 (12%)
Query: 66 KNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQE 125
+NF++N K GFQK+ H + N K+KFGF QK KG N + KF N +Q +E
Sbjct: 279 RNFMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKE 334
Query: 126 KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 185
+KR SL+L YT+QE++QW E RRKNYP+ NI KKLT KQ+N ++DKEAK+ R++LKEI
Sbjct: 335 QKR-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEI 393
Query: 186 LAKQAELGVEVAEIPSYYLL-----DSKKQVRARV-------ENTMPLNKRGRFQNNYDK 233
LAKQAELGVEVAEIP YL D++KQ R E ++ ++ QN +K
Sbjct: 394 LAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASI---EKENSQNRLNK 450
Query: 234 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 293
RGR K+K++ K+ K +K SN K++PTLLQKLL ADV++DKS LLQ RF +N
Sbjct: 451 RGRCKKKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMN 509
Query: 294 SFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRE 351
SFFK+WP KP+ FP V VK+ VV+E SL G + E +VE+ HD
Sbjct: 510 SFFKEWPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD---- 565
Query: 352 RMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
+ +N+++ +D++ E++K G G +++ EEEGEIID
Sbjct: 566 --------IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 599
>gi|449494110|ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cucumis sativus]
Length = 552
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 156/349 (44%), Positives = 212/349 (60%), Gaps = 39/349 (11%)
Query: 66 KNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQE 125
+NF++N K GFQK+ H + N K+KFGF QK KG N + KF N +Q +E
Sbjct: 232 RNFMRNSKK----GFQKNQTHHLKNEKKKFGFPGGQKEKGFHNERRNKFCGTNPTDQVKE 287
Query: 126 KKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 185
+KR SL+L YT+QE++QW E RRKNYP+ NI KKLT KQ+N ++DKEAK+ R++LKEI
Sbjct: 288 QKR-SLSLVYTDQEIRQWREARRKNYPSSTNIQKKLTGKQTNCTLVDKEAKLLRQELKEI 346
Query: 186 LAKQAELGVEVAEIPSYYLL-----DSKKQVRARV----ENTMPLNKRGRFQNNYDKRGR 236
LAKQAELGVEVAEIP YL D++KQ R E ++ QN +KRGR
Sbjct: 347 LAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGGRSTLGEEAEEASIEKENSQNRLNKRGR 406
Query: 237 YKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
K+K++ K+ K +K SN K++PTLLQKLL ADV++DKS LLQ RF +NSFF
Sbjct: 407 CKKKNRPRKKGKF-EKHLSNKPPLKKREPTLLQKLLKADVRKDKSQLLQALRFTVMNSFF 465
Query: 297 KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEED-SEGKGRRMVEH-FEHDENRERMH 354
K+WP KP+ FP V VK+ VV+E SL G + E +VE+ HD
Sbjct: 466 KEWPNKPLKFPSVTVKENEGETNVVDETSLSTGNFNLQETNNNSLVENDGSHD------- 518
Query: 355 HNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
+ +N+++ +D++ E++K G G +++ EEEGEIID
Sbjct: 519 -----IDSDNENDIKDSNKDEKLK----GDG------IQVLEEEGEIID 552
>gi|145334515|ref|NP_001078603.1| uncharacterized protein [Arabidopsis thaliana]
gi|145358174|ref|NP_197345.2| uncharacterized protein [Arabidopsis thaliana]
gi|60547897|gb|AAX23912.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|71905555|gb|AAZ52755.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|71905557|gb|AAZ52756.1| hypothetical protein At5g18440 [Arabidopsis thaliana]
gi|332005180|gb|AED92563.1| uncharacterized protein [Arabidopsis thaliana]
gi|332005181|gb|AED92564.1| uncharacterized protein [Arabidopsis thaliana]
Length = 470
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 150/347 (43%), Positives = 207/347 (59%), Gaps = 30/347 (8%)
Query: 2 LTIFHDSLANFAVKTFFEIVI-VHEFSIDLQLPLVNLCLLLLLQRNFIQNGQG------- 53
+ + +L ++ FF+ + H FS QL N L N QN Q
Sbjct: 108 FAVANSNLMGHSLPNFFQPSLEPHAFSSRPQLNSFN-SLPYPPVPNPHQNHQSGPPGFSE 166
Query: 54 ------SISKSNSKKVSGKNFIKNRTKE-----FSQGFQKSHFHQMHNGKRKFGFYNDQK 102
S+ +N +G +F K QGFQ+ HQ NGKRK GF D +
Sbjct: 167 PRPQGQSVDNTNGSGPNGNDFRNKFPKHQNFKGPGQGFQRPQLHQADNGKRKSGFNKDHR 226
Query: 103 PKGHGNGKAIKFGL--ANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKK 160
G GN +K GL ++ N +EKKR S AL YT +EV+QW E RRKNYPTK + KK
Sbjct: 227 --GKGNNNKMKTGLDGSDTGNIAKEKKR-SYALMYTPREVQQWREARRKNYPTKFLVEKK 283
Query: 161 LTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMP 220
+ +K ++ ++D+EAKMRR+QL+E+LAKQAELGVEVAE+PS+YL ++ +QV N
Sbjct: 284 V-KKNVSASILDEEAKMRRQQLREVLAKQAELGVEVAEVPSHYLSNNDEQVNGDRGNNN- 341
Query: 221 LNKRGRFQNNYDKRGRYKRKDQF-SKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRD 279
++GRFQNN + R+ RKD+F +K+ +L DK SS SS +KPTLL+KLLSAD+KRD
Sbjct: 342 -GRKGRFQNNRRNKRRHDRKDKFDNKKPRLEDKKSSQDSSITTRKPTLLEKLLSADIKRD 400
Query: 280 KSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSL 326
KS LLQ FRFM +NS K++PE+P+ PL+ VK+ G + +E+ S+
Sbjct: 401 KSQLLQVFRFMVMNSLLKEFPEQPLKLPLITVKETGCE-DAMEDPSI 446
>gi|297807865|ref|XP_002871816.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
lyrata]
gi|297317653|gb|EFH48075.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp.
lyrata]
Length = 454
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/239 (49%), Positives = 170/239 (71%), Gaps = 6/239 (2%)
Query: 78 QGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGL-ANQMNQPQEKKRKSLALTYT 136
QGFQ+ HQ NGKRK GF D + G GN +K GL + + ++KR+S AL YT
Sbjct: 203 QGFQRPQLHQADNGKRKSGFNKDHR--GKGNYNKMKNGLDGSDADNIAKEKRRSYALMYT 260
Query: 137 EQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEV 196
++V QW E RRKN+PT+ N+ KK+ +K ++ ++D+EAKMRR+QL+E+LAKQAELG+EV
Sbjct: 261 PKDVNQWREARRKNFPTRLNVEKKV-KKNVSASILDEEAKMRRQQLREVLAKQAELGIEV 319
Query: 197 AEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSN 256
A++PS+YL ++ ++V +N ++ +FQNN K+ R+ RKD+F K +L DK+SS
Sbjct: 320 ADVPSHYLSNTDERVHG--DNGANDGQKRKFQNNRHKQRRHGRKDKFDKTPRLDDKNSSQ 377
Query: 257 TSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGG 315
S KKPTLL+KLLSA++KRDK HLLQ FRFM +NSF K++PE+P+ PL+ V++ G
Sbjct: 378 ESPMTTKKPTLLEKLLSANIKRDKIHLLQVFRFMVMNSFLKEFPEQPLKLPLITVEETG 436
>gi|356553962|ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786384 [Glycine max]
Length = 822
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 148/354 (41%), Positives = 212/354 (59%), Gaps = 17/354 (4%)
Query: 55 ISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 112
+ SN K KNF KN+ F GFQKS FH ++NGK+ GF + KG +G+
Sbjct: 481 VPNSNWKGSPSKNF-KNKPNRGGFQAGFQKSKFHDVNNGKKGSGFPIEHNGKGPNSGRGG 539
Query: 113 KFGLANQMNQPQEKKR---KSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSD 169
+GL +P+E K+ +SL++TYT QE++QW E R+KN+P NI KK +E +
Sbjct: 540 HYGL-----KPKEHKQQPERSLSVTYTVQEIQQWREARKKNHPFNNNIQKKHSEHPKDRK 594
Query: 170 VIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN 229
I++E R +LKE+LAKQAELGVEVAEIPSYYL +S Q + E + +FQN
Sbjct: 595 AINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQA-LQSEGKNKFTDKRKFQN 651
Query: 230 NYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRF 289
++K+ RK +F+K QK DKD S + S K+KPTLLQKLLS+DVKRDKSHL+Q FRF
Sbjct: 652 KFNKKS--DRKGRFAKRQKFDDKDFSESPSLKKRKPTLLQKLLSSDVKRDKSHLIQVFRF 709
Query: 290 MAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDEN 349
M +NSFFK +KP+ +PLV+VK+ G+ + EEK L G++ +G V+ N
Sbjct: 710 MVMNSFFKHCLDKPLRYPLVVVKEKGSEVD-GEEKYLHTGKDVLKGGNEETVQKIVTFNN 768
Query: 350 RERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
+ +D+ +D DN+ + + + ++ + + +EEEGEII+
Sbjct: 769 DNSHDCEDEDSDDDENDSIVDNNLHKDPSSLVKRQCDSGEGIGKFDEEEGEIIE 822
>gi|358347440|ref|XP_003637765.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
gi|355503700|gb|AES84903.1| hypothetical protein MTR_101s0034 [Medicago truncatula]
Length = 553
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 152/355 (42%), Positives = 209/355 (58%), Gaps = 49/355 (13%)
Query: 54 SISKSNSKKVSGKNFIKNRTKEFSQG-FQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 112
+ + SN K KNF + + SQG FQKS F+ + GKR F + K KG N +A
Sbjct: 243 NFANSNWKGSPNKNFKNKQNRGGSQGGFQKSKFNDTNKGKR---FSKEHKGKGPNNERAG 299
Query: 113 KFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVID 172
FGL N QE KR S +LTY+EQE++QW E RRKN+P++ I K+ +E+ +S VID
Sbjct: 300 HFGL----NSKQEPKR-SFSLTYSEQEIQQWREARRKNHPSREKIEKQQSEQSKDSKVID 354
Query: 173 KEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--- 229
+E R +LKE+LAKQAELG+EVAEIPSYYL++ Q + + +F+N
Sbjct: 355 RELLQR--ELKEVLAKQAELGIEVAEIPSYYLMNGTNQGLQSEDKKNSFTDKRKFKNKLM 412
Query: 230 -NYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFR 288
N D+R R +K QK AD+D NKKKPTLLQKLLSAD+KRDKSHL Q FR
Sbjct: 413 RNPDRRSRNNKK------QKFADRDLLE----NKKKPTLLQKLLSADIKRDKSHLFQVFR 462
Query: 289 FMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDE 348
F+ NSFFKD+P+KP+ +P V VK+ G+ EV K L G G ++EH
Sbjct: 463 FITANSFFKDYPDKPLVYPPVSVKEMGS--EVYGGKKHLQG-------GEAVLEH----G 509
Query: 349 NRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID 403
+E + ++G ++ +D+E+E++D +G+ D +EEEGEII+
Sbjct: 510 TKEIVQKFVKGSDNGHDNEDEESD-----------EGDNEDGINEFDEEEGEIIE 553
>gi|115440647|ref|NP_001044603.1| Os01g0814000 [Oryza sativa Japonica Group]
gi|56785048|dbj|BAD82687.1| unknown protein [Oryza sativa Japonica Group]
gi|113534134|dbj|BAF06517.1| Os01g0814000 [Oryza sativa Japonica Group]
Length = 533
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/186 (48%), Positives = 121/186 (65%), Gaps = 2/186 (1%)
Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 187
RK + Y EVKQW E R+KNYPT NINKKL+E + + + D+EA+MRR++LKE+L
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKKLSESRPDGEKKDEEAQMRRQELKEVLE 340
Query: 188 KQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-KRGRYKRKDQFSKE 246
KQ ELG E+ E+P YL + + Q R N + RF N D KR RY R D SK
Sbjct: 341 KQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNKRSRYDRNDFQSKR 400
Query: 247 QKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVN 305
K+ ++ + + K ++PTLLQKLLS+DVKRD+ LL TF+FM +N+FF D+P+KP+
Sbjct: 401 PKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVLNNFFSDYPDKPLE 460
Query: 306 FPLVMV 311
FP V V
Sbjct: 461 FPSVKV 466
>gi|242054757|ref|XP_002456524.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
gi|241928499|gb|EES01644.1| hypothetical protein SORBIDRAFT_03g037800 [Sorghum bicolor]
Length = 568
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 3/188 (1%)
Query: 127 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 186
+RK + Y EVK+W E R+KNYPT N++KKL+E S+++ DK+A++RR++LKE+L
Sbjct: 295 RRKPPRIIYDANEVKRWVEARKKNYPTSVNVHKKLSEINSDNENKDKDAQLRRQELKEVL 354
Query: 187 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQFS 244
AKQ ELG ++ E+P YL ++ Q N + F N N +KR RY+R S
Sbjct: 355 AKQQELGFDLPELPPGYLSETGDQCIENKNNRKAQCRDSHFGNRSNNNKRPRYERGGFQS 414
Query: 245 KEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKP 303
K K+ ++ + K ++PTLLQKLLS+D+KRD+ LL F+FM +N+FFKDWP+KP
Sbjct: 415 KRSKVWNRTPCADDAMAKSREPTLLQKLLSSDIKRDRHRLLHVFKFMTLNNFFKDWPDKP 474
Query: 304 VNFPLVMV 311
+ FP V V
Sbjct: 475 LQFPSVKV 482
>gi|357125563|ref|XP_003564462.1| PREDICTED: uncharacterized protein LOC100829078 [Brachypodium
distachyon]
Length = 609
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 87/195 (44%), Positives = 120/195 (61%), Gaps = 15/195 (7%)
Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 187
RK + Y EVKQW E R+KNYPT N+NKKL++ QS+ D+EA+MRR +LKE+LA
Sbjct: 333 RKRPPICYDTNEVKQWVEARKKNYPTSVNVNKKLSQSQSDDQKKDEEAQMRRLELKEVLA 392
Query: 188 KQAELGVEVAEIPSYYLLDSKKQVRARVEN-----------TMPLNKRGRFQNNYDKRGR 236
KQ ELG E+ E+P YL +++ Q + N RGR +KR R
Sbjct: 393 KQKELGFELPELPPGYLSETEGQPKGPQGNDKESKWKTQQRDCRFGNRGR----GNKRQR 448
Query: 237 YKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
Y R D SK + + + + K++PTLLQKLLS+D+KRD+ LL TF+FMA+N+FF
Sbjct: 449 YDRADFQSKRPREWNNSRHDGGAVAKREPTLLQKLLSSDIKRDRHRLLHTFKFMALNNFF 508
Query: 297 KDWPEKPVNFPLVMV 311
++ P+KP+ FP V V
Sbjct: 509 RNLPDKPLEFPSVKV 523
>gi|223943225|gb|ACN25696.1| unknown [Zea mays]
gi|414880010|tpg|DAA57141.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
Length = 562
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 84/191 (43%), Positives = 125/191 (65%), Gaps = 13/191 (6%)
Query: 127 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 186
+RK + Y EVKQW E R+KNYPT N+NKKL+E +S+++ DK+A++RR++LKE++
Sbjct: 290 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQELKEVI 349
Query: 187 AKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN-------NYDKRGRYKR 239
AKQ ELG + E+P YL + Q +EN N++ +F++ N +KR RY+R
Sbjct: 350 AKQQELGFDPPELPPGYLSEIGDQC---IENKN--NRKTQFRDSHFGNRFNNNKRSRYER 404
Query: 240 KDQFSKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 298
SK K+ + ++ ++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKD
Sbjct: 405 GGFQSKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKD 464
Query: 299 WPEKPVNFPLV 309
WP++P+ FP V
Sbjct: 465 WPDRPLQFPSV 475
>gi|218189267|gb|EEC71694.1| hypothetical protein OsI_04190 [Oryza sativa Indica Group]
Length = 500
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/199 (45%), Positives = 122/199 (61%), Gaps = 15/199 (7%)
Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 174
RK + Y EVKQW E R+KNYPT NINK KL+E + + + D+E
Sbjct: 237 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 296
Query: 175 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYD-K 233
A+MRR++LKE+LAKQ ELG E+ E+P YL + + Q R N + RF N D K
Sbjct: 297 AQMRRQELKEVLAKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNK 356
Query: 234 RGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHLLQTFRFMAI 292
R RY R D SK K+ ++ + + K ++PTLLQKLLS+DVKRD+ LL TF+FM +
Sbjct: 357 RSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRLLHTFKFMVL 416
Query: 293 NSFFKDWPEKPVNFPLVMV 311
N+FF D+P+KP+ FP V V
Sbjct: 417 NNFFSDYPDKPLEFPSVKV 435
>gi|296085358|emb|CBI29090.3| unnamed protein product [Vitis vinifera]
Length = 467
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 102/216 (47%), Positives = 133/216 (61%), Gaps = 11/216 (5%)
Query: 44 QRNFIQNGQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQK 102
Q NF+++ + S N K S KNF+KN + K G +KS H M N + K G N+ +
Sbjct: 244 QGNFVKDAGVNNSNPNWKNSSRKNFMKNPKGKNSHWGSRKSQLHHMQNARGKAGISNENR 303
Query: 103 PKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLT 162
KG N A N Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK
Sbjct: 304 GKGLSNNMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSA 363
Query: 163 EKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MP 220
EK +NS+VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV R EN
Sbjct: 364 EKLTNSEVIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKA 423
Query: 221 LNKRGRFQNNYDKRGRYKRK--------DQFSKEQK 248
K+ RFQN +KR R+ RK D F+K+Q+
Sbjct: 424 FGKKERFQNRGNKRRRHDRKQWQRHDQEDGFTKKQR 459
>gi|293335934|ref|NP_001167960.1| uncharacterized protein LOC100381676 [Zea mays]
gi|223945137|gb|ACN26652.1| unknown [Zea mays]
gi|414880008|tpg|DAA57139.1| TPA: hypothetical protein ZEAMMB73_684772 [Zea mays]
Length = 295
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 119/187 (63%), Gaps = 4/187 (2%)
Query: 127 KRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRRE-QLKEI 185
+RK + Y EVKQW E R+KNYPT N+NKKL+E +S+++ DK+A++RR+ +LKE+
Sbjct: 22 RRKPPPIIYDANEVKQWVEARKKNYPTSVNVNKKLSEMKSDNENKDKDAQLRRQVELKEV 81
Query: 186 LAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQN--NYDKRGRYKRKDQF 243
+AKQ ELG + E+P YL + Q N + F N N +KR RY+R
Sbjct: 82 IAKQQELGFDPPELPPGYLSEIGDQCIENKNNRKTQFRDSHFGNRFNNNKRSRYERGGFQ 141
Query: 244 SKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEK 302
SK K+ + ++ ++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKDWP++
Sbjct: 142 SKRSKVWNCTPHADHGMVKSREPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDR 201
Query: 303 PVNFPLV 309
P+ FP V
Sbjct: 202 PLQFPSV 208
>gi|20161477|dbj|BAB90401.1| P0432B10.22 [Oryza sativa Japonica Group]
Length = 541
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 121/208 (58%), Gaps = 24/208 (11%)
Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINK----------------------KLTEKQ 165
RK + Y EVKQW E R+KNYPT NINK KL+E +
Sbjct: 267 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKASATLCPHQTIREASSTDHLMQKLSESR 326
Query: 166 SNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRG 225
+ + D+EA+MRR++LKE+L KQ ELG E+ E+P YL + + Q R N +
Sbjct: 327 PDGEKKDEEAQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDC 386
Query: 226 RFQNNYD-KRGRYKRKDQFSKEQKLADKDSSNTSSFNK-KKPTLLQKLLSADVKRDKSHL 283
RF N D KR RY R D SK K+ ++ + + K ++PTLLQKLLS+DVKRD+ L
Sbjct: 387 RFGNRADNKRSRYDRNDFQSKRPKVRNQTHCDDGAVLKSREPTLLQKLLSSDVKRDRHRL 446
Query: 284 LQTFRFMAINSFFKDWPEKPVNFPLVMV 311
L TF+FM +N+FF D+P+KP+ FP V V
Sbjct: 447 LHTFKFMVLNNFFSDYPDKPLEFPSVKV 474
>gi|326528619|dbj|BAJ97331.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 625
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 128/218 (58%), Gaps = 20/218 (9%)
Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILA 187
RK + Y + EVKQW + R+KNYPT+AN+NKKL + Q + D+EA+MRR++LKE++A
Sbjct: 335 RKRPPIIYDKNEVKQWVQARKKNYPTRANVNKKLCQNQLDEQKKDEEAQMRRQELKEVIA 394
Query: 188 KQAELGVEVAEIPSYYLLDSKKQVR----------------ARVENTMPLNKRGRFQNNY 231
KQ ELG+E+ E+P YL D++ Q R RGR
Sbjct: 395 KQKELGLELPELPPGYLSDNEGQPRDPQKESNWKTRQGGGRFGNRGRGRGQGRGRGSGRD 454
Query: 232 DKRGRY-KRKDQFSKEQKLADKDS---SNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTF 287
+KR RY R+D SK + + +S + ++PTLLQKLL++D+KRD+ LL TF
Sbjct: 455 NKRQRYDSREDSPSKRPREWNNNSRCHDGGAVAKIREPTLLQKLLNSDIKRDRHRLLHTF 514
Query: 288 RFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKS 325
+FMA+N+FFKDWP KP+ FP V + ++ EE S
Sbjct: 515 KFMALNNFFKDWPAKPLEFPSVKLNQIELESDIAEEGS 552
>gi|356562267|ref|XP_003549393.1| PREDICTED: uncharacterized protein LOC100789443 [Glycine max]
Length = 684
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 103/250 (41%), Positives = 153/250 (61%), Gaps = 21/250 (8%)
Query: 159 KKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQ-VRARVEN 217
+K +E +S I++E R +LKE+LAKQAELGVEVAEIPSYYL +S Q +++ +N
Sbjct: 451 QKHSECPKDSKAINREVLQR--ELKEVLAKQAELGVEVAEIPSYYLKNSDNQGLQSEAKN 508
Query: 218 TMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVK 277
+KR +FQN ++K+ RK +F+++QK ADKD S + S K+KPTLLQKLLS+DVK
Sbjct: 509 KY-TDKR-KFQNKFNKKS--DRKGRFAEKQKFADKDFSESPSLKKRKPTLLQKLLSSDVK 564
Query: 278 RDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKG 337
RDKSHLLQ RFM +NSFF+ P+K + +P V VK+ G+ EV EK L +D +G
Sbjct: 565 RDKSHLLQVLRFMVMNSFFRHIPDKALRYPSVEVKEKGS--EVSGEKKHLHTGKDVLNRG 622
Query: 338 R----RMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVR 393
+ + F +D + +D+ +D DN+ + + + + + +
Sbjct: 623 SEETVQKIVIFNNDNGHD--------CDDDENDSIVDNNLHKDPSSLVKRQCDGGEGIKK 674
Query: 394 LEEEEGEIID 403
+EEEGEI++
Sbjct: 675 SDEEEGEILE 684
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
Query: 55 ISKSNSKKVSGKNFIKNRTKE--FSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAI 112
+ SN K KN +KN+ F GFQKS FH ++NGKR GF + KG +G+A
Sbjct: 282 VPNSNWKGSPSKN-LKNKPNRGRFQGGFQKSKFHDVNNGKRGSGFPKEHNGKGPNSGRAG 340
Query: 113 KFGL 116
+GL
Sbjct: 341 HYGL 344
>gi|222619440|gb|EEE55572.1| hypothetical protein OsJ_03849 [Oryza sativa Japonica Group]
Length = 484
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 66/120 (55%), Gaps = 13/120 (10%)
Query: 128 RKSLALTYTEQEVKQWCEERRKNYPTKANINK-------------KLTEKQSNSDVIDKE 174
RK + Y EVKQW E R+KNYPT NINK KL+E + + + D+E
Sbjct: 281 RKRPRIIYDANEVKQWLEARKKNYPTSVNINKTIREASSTDHLMQKLSESRPDGEKKDEE 340
Query: 175 AKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKR 234
A+MRR++LKE+L KQ ELG E+ E+P YL + + Q R N + RF N D +
Sbjct: 341 AQMRRQELKEVLEKQKELGFELPELPPGYLSEHEDQGNGRRSNWKTQRRDCRFGNRADNK 400
>gi|168019512|ref|XP_001762288.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686366|gb|EDQ72755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 614
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 65/90 (72%), Gaps = 6/90 (6%)
Query: 131 LALTYTEQ---EVKQWCEERRKNYPTKANINKK---LTEKQSNSDVIDKEAKMRREQLKE 184
+AL +E+ E++ W EER +NYPT++NI +K L K + ++ID++ + R++++KE
Sbjct: 278 VALKASEEDDDEIRAWREERMRNYPTRSNIQRKKEALEGKVARGELIDEDTRKRQQRMKE 337
Query: 185 ILAKQAELGVEVAEIPSYYLLDSKKQVRAR 214
ILAKQAELGV VAE+P++YL S K+ R R
Sbjct: 338 ILAKQAELGVPVAEVPAHYLSQSYKEDRGR 367
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 18/129 (13%)
Query: 220 PLNKRGRFQNNYDKRGRYKR--KDQFSKEQKLADKDSSNTSSFN---------KKKPTLL 268
P +K G+ Q + +RGR K+ K F+ K N + K+ PTLL
Sbjct: 461 PSDKPGKTQCYFYQRGRCKKGFKCHFAHTGPRGKKGGDNEAGRGGPKGVGVEKKRPPTLL 520
Query: 269 QKLLSADVKRDKSHLLQTFRFMAINSFFKDWP-------EKPVNFPLVMVKDGGAAGEVV 321
KLL ++ RDKS++LQ R+ N+F +WP E P + GEVV
Sbjct: 521 SKLLQGEINRDKSYILQCLRYFVNNNFLLNWPNQLSESIELPEQSDDAHESSDYSEGEVV 580
Query: 322 EEKSLLLGE 330
E +LL +
Sbjct: 581 EAPDVLLAD 589
>gi|219363611|ref|NP_001136910.1| uncharacterized protein LOC100217067 [Zea mays]
gi|194697570|gb|ACF82869.1| unknown [Zea mays]
Length = 138
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 40/47 (85%)
Query: 263 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLV 309
++PTLLQKLLS+D+KRD+ LL FRFM +N+FFKDWP++P+ FP V
Sbjct: 5 REPTLLQKLLSSDIKRDRHRLLHVFRFMTLNNFFKDWPDRPLQFPSV 51
>gi|255546771|ref|XP_002514444.1| conserved hypothetical protein [Ricinus communis]
gi|223546440|gb|EEF47940.1| conserved hypothetical protein [Ricinus communis]
Length = 123
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 62/129 (48%), Gaps = 39/129 (30%)
Query: 181 QLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRK 240
+LKE LAKQAELGVEVAEIPS YL DS+KQV+
Sbjct: 28 RLKETLAKQAELGVEVAEIPSRYLSDSEKQVK---------------------------- 59
Query: 241 DQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRD-KSHLLQTFRFMAINSFF--- 296
L +KDS+N SSF+ KPTL QKLLSA+ K S LQ FF
Sbjct: 60 ------DTLTNKDSTNGSSFSNTKPTLRQKLLSAEGKGAFLSECLQASSTFTCEQFFSSN 113
Query: 297 KDWP-EKPV 304
K W KPV
Sbjct: 114 KLWNFTKPV 122
>gi|384254158|gb|EIE27632.1| hypothetical protein COCSUDRAFT_55624 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 33/50 (66%)
Query: 258 SSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFP 307
S +++PTLLQKLL+ D++ + S LLQ RF+ N+F D+ P+ FP
Sbjct: 313 SPVQQRQPTLLQKLLAPDIRAEHSRLLQCLRFLVTNNFLLDFGRAPLIFP 362
>gi|328766403|gb|EGF76457.1| hypothetical protein BATDEDRAFT_28378 [Batrachochytrium
dendrobatidis JAM81]
Length = 411
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 88/185 (47%), Gaps = 25/185 (13%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKL---TEKQSNSDVID---KEAKMRREQLKEILAK- 188
T ++++QW +ER++ +PT++NI +K+ ++Q ++ID K+ K R ++++ K
Sbjct: 217 TPKDIEQWIQERKRKFPTESNIKQKMEDEKQRQKQGNIIDVNGKKLKGRNADKRQLIRKN 276
Query: 189 ---QAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRGRYKRKDQF 243
Q + G + + YL D+ +T+ L+ G+ + R+ +
Sbjct: 277 DTVQEKHGTSEQKDSTIYLDDASNAKLEASHDTLALDMTGKLNQHKGASSCCRHGSNCRH 336
Query: 244 SKE-QKLADKDSSNTSSFNKKKPT-----------LLQKLLSADVKRDKSHLLQTFRFMA 291
S + KLA+ S+ + PT LL LL D K+DK+ LLQ RF+
Sbjct: 337 SHDLSKLAESRSA-PKTHTPLNPTAAPIKGTNRRPLLNMLLETDFKKDKNALLQCIRFII 395
Query: 292 INSFF 296
N FF
Sbjct: 396 ENKFF 400
>gi|308800704|ref|XP_003075133.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
(ISS) [Ostreococcus tauri]
gi|116061687|emb|CAL52405.1| Polyadenylation factor I complex, subunit, Yth1 (CPSF subunit)
(ISS) [Ostreococcus tauri]
Length = 401
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 38/64 (59%), Gaps = 2/64 (3%)
Query: 234 RGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAIN 293
RGR K + + DK +S SS N + TLL+KLL ++KRD+ LLQ FRF+ N
Sbjct: 261 RGRCKAGKRCPFRHDIVDKKTSGVSSGNPQ--TLLKKLLHKEIKRDEERLLQLFRFLVNN 318
Query: 294 SFFK 297
FF+
Sbjct: 319 DFFE 322
>gi|145343958|ref|XP_001416510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576735|gb|ABO94803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 291
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 100/223 (44%), Gaps = 44/223 (19%)
Query: 138 QEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELG---- 193
++V+++ EER K +P+ N+ K + + RRE+L+EILAKQ E+G
Sbjct: 28 EDVRRYREERAKYWPSDRNVRAKTEAGEEEE----RARAARRERLREILAKQREMGHFEA 83
Query: 194 -VEVAE----IPSYYLLDSKKQVRARVENTMPLNKRGRF--------------------- 227
E+ E + + LD K V AR + +K RF
Sbjct: 84 SQEIGEEEGGVAAPAALDGKG-VEARAADG---SKVCRFWLQGGCRKGSACDFKHESAPN 139
Query: 228 ---QNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 284
+ + RGR K + + ++ ++ SS + TLL+KLL ++KRD+ LL
Sbjct: 140 KDQKCRFFARGRCKAGARCPFKHEVTERKSSAADGGGNPQ-TLLKKLLDKEIKRDEQRLL 198
Query: 285 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLL 327
Q FRF N FF K V+ PL M + +V++ ++ L
Sbjct: 199 QLFRFFVNNDFFTG--TKGVDEPLWMFPWADSGKKVIDRRAAL 239
>gi|118358218|ref|XP_001012358.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila]
gi|89294124|gb|EAR92112.1| hypothetical protein TTHERM_00107150 [Tetrahymena thermophila
SB210]
Length = 762
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 41/63 (65%)
Query: 239 RKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 298
++D++S+E L+ SN+S+F K TLL L+ ++ R++++LLQ R++ N+FF +
Sbjct: 574 KRDKYSQELDLSFNYKSNSSNFRYKANTLLTNLVLDEIYRERNYLLQAIRYIVSNNFFDE 633
Query: 299 WPE 301
E
Sbjct: 634 QAE 636
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 63/123 (51%), Gaps = 21/123 (17%)
Query: 139 EVKQWCEERRKNYPTKANINKKLTEK---QSNSDVIDKE-AKMRREQLKEILAKQAELGV 194
E+++W E+R+KNYPT NI KK TE+ Q N V +E +K+ ++ K+I+ + L
Sbjct: 289 EIRKWIEQRKKNYPTSENIKKKQTEEYRNQQNGQVNREELSKLEQKLRKKIMISNSSLKK 348
Query: 195 EVAEIPSYYLL----------DSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFS 244
+ Y +L K++R +E NK+ + QN D R+K ++ F
Sbjct: 349 LEIQQKHYEMLRKLYTEGDKAPKTKKMREEIE-----NKKDKRQNRKD--DRHKNQENFK 401
Query: 245 KEQ 247
K++
Sbjct: 402 KQE 404
>gi|392567649|gb|EIW60824.1| hypothetical protein TRAVEDRAFT_71098 [Trametes versicolor
FP-101664 SS1]
Length = 590
Score = 42.0 bits (97), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 64/121 (52%), Gaps = 15/121 (12%)
Query: 137 EQEVKQWCEERRKNYPTKANINKKL----TEKQSNSDVIDKEAKMRREQLKEILAKQAEL 192
EQ ++++ EE++ + + N + + TE +S D+ A R E+ +I+AK EL
Sbjct: 211 EQALQRFIEEQKTSALQRFNFHMAVLIWETEARSAKATGDEAA--RSERHADIIAKLVEL 268
Query: 193 GVEVAEIPSY-----YLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQ 247
G + +EIP+Y LLD +++ R+ NT+ R + ++R R + +D F K+
Sbjct: 269 GYQSSEIPAYDHAFQRLLDQPRKLTPRIWNTI----RPKLVEMLEERRRAREQDAFKKKW 324
Query: 248 K 248
+
Sbjct: 325 R 325
>gi|126337727|ref|XP_001369929.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Monodelphis domestica]
Length = 469
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 16/104 (15%)
Query: 194 VEVAEIPSYYLLDSKKQVR-ARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADK 252
V ++++P D KKQ R + V T L K +N KRG Y++K
Sbjct: 358 VILSDVPQNQSEDIKKQERNSPVAKTKSLRKNWMRKNCRYKRGLYRKKP----------- 406
Query: 253 DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
S K++P LL+ LL+ D++ +++ +LQ R++ N FF
Sbjct: 407 ----VSEPQKRRPHLLEMLLAQDIRHERNVILQCVRYIIQNDFF 446
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 22/25 (88%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
T +E+++W EERRKN+PT+ANI KK
Sbjct: 178 TPEEIEKWREERRKNFPTRANIEKK 202
>gi|68073975|ref|XP_678902.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56499510|emb|CAH94055.1| conserved hypothetical protein [Plasmodium berghei]
Length = 611
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 90/182 (49%), Gaps = 18/182 (9%)
Query: 124 QEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLK 183
EK K+ + +++E+++W EER+KNYPTK KK+ + ++N++ I+ E K ++ + K
Sbjct: 426 HEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMDIKNNNN-INSETKEKKNKEK 480
Query: 184 EILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNN--YDKRGR 236
+I + +L VE + Y +K + + N + N N Y +
Sbjct: 481 KITSLIEKLLVEKYSSAIGRNIYYQTQSQRKSIFVPILNKLIENNHKNVYENTYYSISDK 540
Query: 237 YKRKD-QFSKEQKLAD-KDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINS 294
Y +K+ +F+K K + DS N K L+ +L+ D+ + L+Q F+ N+
Sbjct: 541 YNKKNIKFNKRVKKHNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEFIVNNN 596
Query: 295 FF 296
FF
Sbjct: 597 FF 598
>gi|303286377|ref|XP_003062478.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455995|gb|EEH53297.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 552
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 26/35 (74%)
Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKD 298
+PTLL+KL + +V+ +S LLQ FRF+ N+F +D
Sbjct: 460 QPTLLKKLFAKEVRVSRSRLLQVFRFLVNNNFLRD 494
>gi|307107626|gb|EFN55868.1| hypothetical protein CHLNCDRAFT_52119 [Chlorella variabilis]
Length = 684
Score = 41.6 bits (96), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 38/61 (62%), Gaps = 2/61 (3%)
Query: 139 EVKQWCEERRKNYPTKANINKKLTEKQS--NSDVIDKEAKMRREQLKEILAKQAELGVEV 196
EV+QW +ER++++PT + +K E ++ +D +R+++L ILA+Q+ +G+
Sbjct: 388 EVEQWIKERKRSFPTSEAVARKEAEAEARRQRGELDAARTLRQQRLAAILAQQSAMGLSR 447
Query: 197 A 197
A
Sbjct: 448 A 448
>gi|255088323|ref|XP_002506084.1| predicted protein [Micromonas sp. RCC299]
gi|226521355|gb|ACO67342.1| predicted protein [Micromonas sp. RCC299]
Length = 567
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 24/32 (75%)
Query: 266 TLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
TLL+KLL+ +V+ D+S LLQ FRF+ N F +
Sbjct: 458 TLLRKLLAREVRADRSRLLQVFRFLVNNDFLR 489
>gi|17540726|ref|NP_499991.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
gi|351063723|emb|CCD71947.1| Protein F56B3.11, isoform a [Caenorhabditis elegans]
Length = 213
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 8/120 (6%)
Query: 283 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 341
L+ F ++ FF +D P+ N L +V+D G E+ GEE S G+ RR V
Sbjct: 93 LIGAFIYVLAGEFFAQDSPQTIFNKALALVRDDGRCQEIFGASIAGFGEETSRGR-RRHV 151
Query: 342 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 395
H +++++ R R+ +++G DE E E D Q + +Y+ K HV ++
Sbjct: 152 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 211
>gi|313219540|emb|CBY30463.1| unnamed protein product [Oikopleura dioica]
gi|313226347|emb|CBY21491.1| unnamed protein product [Oikopleura dioica]
Length = 310
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 35/49 (71%), Gaps = 1/49 (2%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKE 184
TE+++++W EERRKN+PTKA I +K +++ + + K+ +REQ +E
Sbjct: 96 TEEDIRKWREERRKNFPTKARIAEKEAARKAGTSIETKKFS-KREQERE 143
>gi|345325138|ref|XP_001514551.2| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Ornithorhynchus anatinus]
Length = 446
Score = 39.7 bits (91), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 12/51 (23%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEIL 186
T +E+ +W EERRKN+PT ANI KK K ++ REQ E+L
Sbjct: 125 TPEEIAKWREERRKNFPTMANIEKK------------KALQLDREQRGEVL 163
>gi|82594577|ref|XP_725484.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480507|gb|EAA17049.1| Zinc finger, C2H2 type, putative [Plasmodium yoelii yoelii]
Length = 601
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 89/187 (47%), Gaps = 29/187 (15%)
Query: 124 QEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLK 183
EK K+ + +++E+++W EER+KNYPTK KK+ E ++N+ I+ E K ++ K
Sbjct: 419 HEKNEKNENILESKEEIQKWIEERKKNYPTK----KKIMEMKNNN--INSEEKEKKNXQK 472
Query: 184 EILAKQAELGVE-----VAEIPSYYLLDSKKQVRARVENTMPLNK-RGRFQNNYDKRGRY 237
+I + +L VE + Y KK + + N + N + ++N Y Y
Sbjct: 473 KITSLIEKLLVEKYSSAIGRNIYYQTQSQKKSIFVPLLNKLIENNYKNVYENTY-----Y 527
Query: 238 KRKDQFSKEQKLADK--------DSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRF 289
D+++++ +K DS N K L+ +L+ D+ + L+Q F
Sbjct: 528 SISDKYNQKNIKFNKKIKKNNLIDSLNVH----KNAPLIYQLMKNDIYVYEQKLMQCIEF 583
Query: 290 MAINSFF 296
+ N+FF
Sbjct: 584 IVNNNFF 590
>gi|341888654|gb|EGT44589.1| hypothetical protein CAEBREN_21252 [Caenorhabditis brenneri]
Length = 219
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 8/120 (6%)
Query: 283 LLQTFRFMAINSFF-KDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMV 341
L+ F ++ FF +D P+ N L +V+D G ++ GEE S G+ RR V
Sbjct: 99 LIGAFIYVLCGEFFAQDSPQTIFNKALAIVRDDGRCQDIFGATIAGFGEETSRGR-RRHV 157
Query: 342 EHFEHDEN---RERMHHNIQGFEDEN--DDEEEDNDNGEQMK-VYIAGKGNTNDEHVRLE 395
H +++++ R R+ +++G DE E E D Q + +Y+ K HV ++
Sbjct: 158 AHHKYEKDGMQRIRVLFHVKGDRDEGIAQAEMEQRDGDWQWRFLYVENKRRPKTTHVLID 217
>gi|47218661|emb|CAG04990.1| unnamed protein product [Tetraodon nigroviridis]
Length = 350
Score = 39.3 bits (90), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 25/34 (73%)
Query: 263 KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
++PTLL+ LL+ D++ +++ LLQ RF+ N FF
Sbjct: 306 RRPTLLEMLLAPDIRHERNVLLQCVRFVVRNHFF 339
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 17/25 (68%), Positives = 19/25 (76%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
T +EV +W EERRKNYPT NI KK
Sbjct: 69 TPEEVAKWREERRKNYPTLQNIEKK 93
>gi|291235450|ref|XP_002737657.1| PREDICTED: nuclear fragile X mental retardation protein interacting
protein-like [Saccoglossus kowalevskii]
Length = 639
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 26/35 (74%)
Query: 262 KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
KKKPTLL+ LL+ D++ +++ LLQ R++ +FF
Sbjct: 458 KKKPTLLEMLLAGDIRHERNVLLQCVRYVKKKNFF 492
>gi|67474753|ref|XP_653120.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56470045|gb|EAL47734.1| hypothetical protein EHI_019110 [Entamoeba histolytica HM-1:IMSS]
gi|449703311|gb|EMD43787.1| Hypothetical protein EHI5A_022200 [Entamoeba histolytica KU27]
Length = 129
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 29/38 (76%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 173
TE+++++W EER+K +PTKA K L+E++ +D I+K
Sbjct: 2 TEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39
>gi|158297378|ref|XP_001237955.2| AGAP007875-PA [Anopheles gambiae str. PEST]
gi|157015166|gb|EAU76388.2| AGAP007875-PA [Anopheles gambiae str. PEST]
Length = 474
Score = 38.9 bits (89), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 266 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
T+L+KLL AD++ +++ LLQ R++ +N FF
Sbjct: 416 TMLEKLLDADIRHERNVLLQCVRYVLVNKFF 446
>gi|407040593|gb|EKE40213.1| hypothetical protein ENU1_097960 [Entamoeba nuttalli P19]
Length = 129
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 29/39 (74%)
Query: 135 YTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 173
TE+++++W EER+K +PTKA K L+E++ +D I+K
Sbjct: 1 MTEEQLQKWIEERKKRFPTKAKREKCLSEEKKKTDEIEK 39
>gi|327283872|ref|XP_003226664.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Anolis carolinensis]
Length = 451
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 8/50 (16%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEI 185
T +E+ +W EERRKN+PT ANI KK + KE + R E LK +
Sbjct: 92 TPEEIAKWREERRKNFPTLANIEKK--------KALQKEKEQRGEVLKTL 133
>gi|395527571|ref|XP_003765917.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1 [Sarcophilus harrisii]
Length = 480
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 22/25 (88%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
T +E+++W EERRKN+PT+ANI KK
Sbjct: 187 TPEEIEKWREERRKNFPTRANIEKK 211
>gi|156353142|ref|XP_001622934.1| predicted protein [Nematostella vectensis]
gi|156209570|gb|EDO30834.1| predicted protein [Nematostella vectensis]
Length = 401
Score = 38.5 bits (88), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 250 ADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
A++ NT S + PTLL+ LL+ +++ +++ +LQ R++ NSFF
Sbjct: 355 ANRTKQNTKS--NRNPTLLEMLLAPEIRHERNAILQCLRYIVRNSFF 399
>gi|344301187|gb|EGW31499.1| hypothetical protein SPAPADRAFT_154739 [Spathaspora passalidarum
NRRL Y-27907]
Length = 212
Score = 38.5 bits (88), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 35/55 (63%), Gaps = 2/55 (3%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQA 190
TE+++ +W EER+K +P+K + KKL +KQ S++ KE + R + KE K+
Sbjct: 59 TEEDIAKWIEERKKKWPSKQVVAKKLEQKQ--SEIQQKEDRKRPAESKEAPGKRT 111
>gi|7305335|ref|NP_038773.1| nuclear fragile X mental retardation-interacting protein 1 [Mus
musculus]
gi|81917719|sp|Q9QXX8.1|NUFP1_MOUSE RecName: Full=Nuclear fragile X mental retardation-interacting
protein 1; AltName: Full=Nuclear FMRP-interacting
protein 1
gi|6525073|gb|AAF15316.1| nuclear FMRP interacting protein 1 [Mus musculus]
gi|26328149|dbj|BAC27815.1| unnamed protein product [Mus musculus]
gi|33525186|gb|AAH56192.1| Nuclear fragile X mental retardation protein interacting protein 1
[Mus musculus]
gi|74220737|dbj|BAE33625.1| unnamed protein product [Mus musculus]
gi|148703863|gb|EDL35810.1| nuclear fragile X mental retardation protein interacting protein 1
[Mus musculus]
Length = 484
Score = 38.5 bits (88), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 20/25 (80%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
T +E+ +W EERRKNYPT ANI +K
Sbjct: 223 TPEEIARWREERRKNYPTLANIERK 247
>gi|270007929|gb|EFA04377.1| hypothetical protein TcasGA2_TC014675 [Tribolium castaneum]
Length = 327
Score = 38.1 bits (87), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 39/63 (61%), Gaps = 3/63 (4%)
Query: 238 KRKDQFSKEQKLADKDSSNTSS---FNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINS 294
+R+ K+QKL +K S F K+K TLL+KLL ++++ +++ LLQ + + N+
Sbjct: 265 RRRLSSGKKQKLNNKPEPGGFSRMKFVKRKVTLLEKLLDSEIRHERNVLLQCVQHVVTNN 324
Query: 295 FFK 297
FF+
Sbjct: 325 FFR 327
>gi|350646629|emb|CCD58749.1| hypothetical protein Smp_099560.1 [Schistosoma mansoni]
Length = 336
Score = 38.1 bits (87), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
+PTLLQ LL+ D++++++ L+Q R++ +FF+
Sbjct: 303 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 336
>gi|256053307|ref|XP_002570139.1| hypothetical protein [Schistosoma mansoni]
Length = 371
Score = 38.1 bits (87), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
+PTLLQ LL+ D++++++ L+Q R++ +FF+
Sbjct: 338 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 371
>gi|256053309|ref|XP_002570140.1| hypothetical protein [Schistosoma mansoni]
Length = 356
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
+PTLLQ LL+ D++++++ L+Q R++ +FF+
Sbjct: 323 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 356
>gi|350646628|emb|CCD58748.1| hypothetical protein Smp_099560.2 [Schistosoma mansoni]
Length = 351
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 264 KPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFK 297
+PTLLQ LL+ D++++++ L+Q R++ +FF+
Sbjct: 318 RPTLLQMLLAEDIRKERNQLMQCIRYIVKENFFQ 351
>gi|448516074|ref|XP_003867484.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis Co 90-125]
gi|380351823|emb|CCG22046.1| hypothetical protein CORT_0B03380 [Candida orthopsilosis]
Length = 243
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 22/25 (88%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 69 TEEDIAKWIEERKKNWPTRANVERK 93
>gi|167387090|ref|XP_001738022.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165898944|gb|EDR25671.1| hypothetical protein EDI_290620 [Entamoeba dispar SAW760]
Length = 137
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 29/40 (72%)
Query: 134 TYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDK 173
+ TE+++++W EER+K +PTKA K L E++ +D I+K
Sbjct: 8 SMTEEQLQKWIEERKKRFPTKAKREKCLFEEKKKTDEIEK 47
>gi|195995467|ref|XP_002107602.1| hypothetical protein TRIADDRAFT_51288 [Trichoplax adhaerens]
gi|190588378|gb|EDV28400.1| hypothetical protein TRIADDRAFT_51288 [Trichoplax adhaerens]
Length = 444
Score = 38.1 bits (87), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTE 163
+ +++K W E+RRKNYPT AN+ KK E
Sbjct: 205 SPEDIKAWREQRRKNYPTAANVEKKKQE 232
>gi|354543450|emb|CCE40169.1| hypothetical protein CPAR2_102070 [Candida parapsilosis]
Length = 253
Score = 38.1 bits (87), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 13/25 (52%), Positives = 22/25 (88%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
TE+++ +W EER+KN+PT+AN+ +K
Sbjct: 77 TEEDIAKWIEERKKNWPTRANVERK 101
>gi|448079221|ref|XP_004194343.1| Piso0_004831 [Millerozyma farinosa CBS 7064]
gi|359375765|emb|CCE86347.1| Piso0_004831 [Millerozyma farinosa CBS 7064]
Length = 234
Score = 38.1 bits (87), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 14/42 (33%), Positives = 29/42 (69%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKM 177
TE+++K+W EERR+N+P++ N+ K ++ + + D++ K
Sbjct: 72 TEEDIKKWIEERRRNWPSRKNVEAKKEKESAGAPANDEQIKT 113
>gi|12843948|dbj|BAB26177.1| unnamed protein product [Mus musculus]
Length = 249
Score = 38.1 bits (87), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 20/25 (80%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
T +E+ +W EERRKNYPT ANI +K
Sbjct: 223 TPEEIARWREERRKNYPTLANIERK 247
>gi|348545180|ref|XP_003460058.1| PREDICTED: nuclear fragile X mental retardation-interacting protein
1-like [Oreochromis niloticus]
Length = 544
Score = 37.7 bits (86), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 19/25 (76%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKK 160
T +E+ +W EERRKNYPT NI KK
Sbjct: 152 TPEEITKWREERRKNYPTLVNIEKK 176
>gi|295665248|ref|XP_002793175.1| CCCH zinc finger protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278089|gb|EEH33655.1| CCCH zinc finger protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 764
Score = 37.7 bits (86), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 136 TEQEVKQWCEERRKNYPTKANINKKL--TEKQSNSDVIDKEAKMRRE 180
+ ++ W EER+K YPTKA I ++L EKQ + K+AK RE
Sbjct: 353 SPSDIAAWIEERKKRYPTKARIEERLKEAEKQKKASREAKDAKRARE 399
>gi|312375641|gb|EFR22971.1| hypothetical protein AND_13906 [Anopheles darlingi]
Length = 575
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 266 TLLQKLLSADVKRDKSHLLQTFRFMAINSFF 296
T+L+KLL AD++ +++ LLQ R++ N+FF
Sbjct: 459 TMLEKLLDADIRHERNVLLQCVRYVVSNNFF 489
>gi|260817126|ref|XP_002603438.1| hypothetical protein BRAFLDRAFT_80413 [Branchiostoma floridae]
gi|229288757|gb|EEN59449.1| hypothetical protein BRAFLDRAFT_80413 [Branchiostoma floridae]
Length = 408
Score = 37.7 bits (86), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 129 KSLALTYTEQEVKQWCEERRKNYPTKANINKK 160
K+L+ T + + +W E+RRKNYPT AN+ KK
Sbjct: 58 KALSKLDTPEAIAKWREDRRKNYPTVANVAKK 89
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.132 0.369
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,407,488,096
Number of Sequences: 23463169
Number of extensions: 292132639
Number of successful extensions: 1295094
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 171
Number of HSP's successfully gapped in prelim test: 1716
Number of HSP's that attempted gapping in prelim test: 1271493
Number of HSP's gapped (non-prelim): 16023
length of query: 403
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 258
effective length of database: 8,957,035,862
effective search space: 2310915252396
effective search space used: 2310915252396
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 78 (34.7 bits)