BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 005351
(701 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225426838|ref|XP_002276704.1| PREDICTED: uncharacterized protein LOC100266763 [Vitis vinifera]
Length = 747
Score = 1133 bits (2930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 544/714 (76%), Positives = 616/714 (86%), Gaps = 16/714 (2%)
Query: 1 MRNQTKSLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVL 60
M++Q + + LI+ I ++ V GVQILSKSKLEKCEK ++SDNLNCT KI+L
Sbjct: 1 MKDQKPRTRRRPLALIITIIFLSINGGSVYGVQILSKSKLEKCEKVSESDNLNCTKKIIL 60
Query: 61 NMAVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPY 120
+MAVPSGSSGGEASIVAEVVEVEENST KM+T+R+PP +TVNK+++YAVYE+TYIRDVPY
Sbjct: 61 DMAVPSGSSGGEASIVAEVVEVEENSTHKMQTLRVPPTITVNKSSAYAVYEITYIRDVPY 120
Query: 121 KPQEFYMKTRKCEPDAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFD 168
KPQE+++KTRKCEPDA A VVKICER QPICCPCG RR+PSSCGN FD
Sbjct: 121 KPQEYFVKTRKCEPDASAKVVKICERLQDENGHIIEHTQPICCPCGTHRRVPSSCGNFFD 180
Query: 169 KLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSAD 228
KL+KGKANTAHCLRFPGDWFHVFGIGQRS+GFSV IEVKTGSK+SEV VGPEN+T S D
Sbjct: 181 KLMKGKANTAHCLRFPGDWFHVFGIGQRSLGFSVHIEVKTGSKISEVIVGPENRTVMSND 240
Query: 229 NFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLE 288
NFLKVNLIGDF GYTNIPSFE+FYLV PRQGGPGQPQ+LG NFSMWMLLER RFTLDGLE
Sbjct: 241 NFLKVNLIGDFAGYTNIPSFEDFYLVTPRQGGPGQPQNLGVNFSMWMLLERVRFTLDGLE 300
Query: 289 CNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQ 348
CNKIGVSYEAFNGQP+FCSSPFW+CLHNQLWN+READQNRI+R+QLPLYGVEGRFER+NQ
Sbjct: 301 CNKIGVSYEAFNGQPNFCSSPFWNCLHNQLWNFREADQNRIDRHQLPLYGVEGRFERINQ 360
Query: 349 HPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATI 408
HPNAG+ SFSIG+TEVLN+NLLIEL ADDIEYVYQRSPGKI+SV IPTFEALTQFG ATI
Sbjct: 361 HPNAGTRSFSIGITEVLNTNLLIELSADDIEYVYQRSPGKILSVTIPTFEALTQFGTATI 420
Query: 409 TTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAIL 468
TT+N G+VEASYSLTFDCS GVTLMEEQ+FI+KP E IRSFK+YPTT+QAAKY CSAIL
Sbjct: 421 TTKNVGKVEASYSLTFDCSRGVTLMEEQFFIMKPNENIIRSFKLYPTTDQAAKYVCSAIL 480
Query: 469 KDSDFSEVDRAECQFSTMATVLDNGSQI---TPFQPPKSSINDFFESIESIGKKLWEGLR 525
KDSD+SEVDRAECQF+T ATV DNGSQ+ TPFQPPK+SIN FFESIESI K W+G
Sbjct: 481 KDSDYSEVDRAECQFTTTATVFDNGSQLLQTTPFQPPKTSINGFFESIESIWNKFWDGFV 540
Query: 526 DFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLY 585
DFITGK CRRKCS FFDFSCHIQYIC+SW+V+FGL+LAIFPTVLVLLWLLHQKGLFDPLY
Sbjct: 541 DFITGKTCRRKCSRFFDFSCHIQYICMSWMVMFGLLLAIFPTVLVLLWLLHQKGLFDPLY 600
Query: 586 DWWDDHFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHK 645
DWW+D F +DNQ I D R RIDVD+PH+H+ KHHKQE RH++ +A+ +R IH +HK
Sbjct: 601 DWWEDRFWADNQSIGDTRRHRIDVDNPHIHL-KHHKQEARHYRHDAQSKRRSIHDKRRHK 659
Query: 646 HSDRDTDYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRE 699
HS +D+DYYYYLHHV K+KHK GRSKNSS+M Q+Y D ++D IG R RK RE
Sbjct: 660 HSLQDSDYYYYLHHVHKNKHKQGRSKNSSIMHQVYSDRREDDGIGQRRCRKERE 713
>gi|357481707|ref|XP_003611139.1| HAP2 [Medicago truncatula]
gi|355512474|gb|AES94097.1| HAP2 [Medicago truncatula]
Length = 739
Score = 1053 bits (2724), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/710 (70%), Positives = 593/710 (83%), Gaps = 21/710 (2%)
Query: 7 SLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSD-NLNCTTKIVLNMAVP 65
S ++ + I F + + L+ V GVQI+SKSKLEKCEK ++SD NLNCTTKIVL+MAVP
Sbjct: 5 SPRITLIIFIFFTVSSFLTCH-VTGVQIISKSKLEKCEKNSNSDDNLNCTTKIVLSMAVP 63
Query: 66 SGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEF 125
SGSSGGEASIVAE+VEVEENST KM+T+R+PPV+TVNKT++YAVYELTYIRDVPYKP+EF
Sbjct: 64 SGSSGGEASIVAELVEVEENSTTKMQTLRVPPVITVNKTSAYAVYELTYIRDVPYKPEEF 123
Query: 126 YMKTRKCEPDAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKLLKG 173
Y++TRKCEPDAGA+VVKICER QP CCPCGPQRR+PSSCGN FDKL KG
Sbjct: 124 YVQTRKCEPDAGANVVKICERLRDEDGHIIENTQPTCCPCGPQRRMPSSCGNFFDKLTKG 183
Query: 174 KANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKV 233
KANTAHC+RFPGDWFHVFGIG+R++GFSVRI++K+G+KVSEV VGPEN+T TS D FL+V
Sbjct: 184 KANTAHCVRFPGDWFHVFGIGRRTLGFSVRIQIKSGTKVSEVVVGPENRTVTSDDKFLRV 243
Query: 234 NLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIG 293
NLIGDFVGYTNIPSFE+FYLV+PRQG PGQP DLG N SMWMLLER RFTLDG+ECNKIG
Sbjct: 244 NLIGDFVGYTNIPSFEDFYLVVPRQGDPGQPHDLGRNISMWMLLERVRFTLDGIECNKIG 303
Query: 294 VSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAG 353
VSYEAFNGQP+FC+SPFWSCLHNQLWN+ EAD NRI+RNQ+PLYG+EGRFER+NQHPNAG
Sbjct: 304 VSYEAFNGQPNFCASPFWSCLHNQLWNFHEADLNRISRNQVPLYGLEGRFERINQHPNAG 363
Query: 354 SHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNT 413
S SFSIG+TEVLN+N++IEL A+D++YVYQRSPGKIISV +PTFEALTQFGVATITT+NT
Sbjct: 364 SFSFSIGITEVLNTNIVIELSANDVDYVYQRSPGKIISVSVPTFEALTQFGVATITTKNT 423
Query: 414 GEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDF 473
GEVEASYSLTFDCS +TLMEEQ+ I+KP E + RSFKIYP+T+QA+KY+C+AILKDSD+
Sbjct: 424 GEVEASYSLTFDCSKEITLMEEQFLIMKPNEITTRSFKIYPSTDQASKYSCAAILKDSDY 483
Query: 474 SEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKAC 533
EVDRAECQF+T TVLDNG+Q PFQPP++ IN FF+SIES+ KLW G +FITGK C
Sbjct: 484 GEVDRAECQFTTTGTVLDNGTQGMPFQPPETGINGFFDSIESMWNKLWTGFIEFITGKNC 543
Query: 534 RRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQ 593
R+KC+ FFDF CHIQY+CLSW+++FGL LAIFPTVLVLLWLLHQKGLFDPLYDWW+D
Sbjct: 544 RQKCAGFFDFKCHIQYVCLSWIMMFGLFLAIFPTVLVLLWLLHQKGLFDPLYDWWEDICG 603
Query: 594 SD-NQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEA-RRRRCGIHSDHKHKHSDRDT 651
+D Q I D +I+ H H+H KH KQE RH A RR+ +HKHKHS+ ++
Sbjct: 604 ADEKQFIMDRHRVKINQTHHHIHDNKHRKQEVRHLNHRAPNRRKTSYEHNHKHKHSEGNS 663
Query: 652 DYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRESS 701
DY+ +LHHVQK+ HKH K+ +Q + ++H HH+ RK ++ S
Sbjct: 664 DYFNHLHHVQKETHKHRHRKHVDNLQNI-----DDNHPAHHKHRKEQDPS 708
>gi|356532878|ref|XP_003534996.1| PREDICTED: uncharacterized protein LOC100818339 [Glycine max]
Length = 711
Score = 1030 bits (2662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/706 (70%), Positives = 592/706 (83%), Gaps = 27/706 (3%)
Query: 16 ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDS-DNLNCTTKIVLNMAVPSGSSGGEAS 74
I I+ +LS VVG+QI+SKSKLEKCEK ++S DNLNCTTKIVLNMAVPSGSSGGEAS
Sbjct: 7 ITLIIIFILSSFHVVGIQIISKSKLEKCEKNSNSEDNLNCTTKIVLNMAVPSGSSGGEAS 66
Query: 75 IVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEP 134
IVAE+VEVEENS++KM+T+RIPPV+TVNKT++YA+Y+LTYIRDVPYKP+E+Y+KTRKCEP
Sbjct: 67 IVAELVEVEENSSRKMQTLRIPPVITVNKTSAYALYQLTYIRDVPYKPEEYYVKTRKCEP 126
Query: 135 DAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLR 182
DAGA+VVKICER QPICCPCGPQRR+PSSCGN FDKL KGKANTAHC+R
Sbjct: 127 DAGANVVKICERLRDEEGHIIEYTQPICCPCGPQRRMPSSCGNFFDKLTKGKANTAHCVR 186
Query: 183 FPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGY 242
FPGDWFHVFGIG+R++GFSVRI+VK+G+KVSEV VGPEN+T S D FL+VNLIGDFVGY
Sbjct: 187 FPGDWFHVFGIGRRTLGFSVRIQVKSGTKVSEVFVGPENRTVISDDKFLRVNLIGDFVGY 246
Query: 243 TNIPSFEEFYLVIPRQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
TNIPSFE+FYLV+PRQ PGQPQDLG N SMWMLLER RFTLDG+ECNKIGVSYEAFN
Sbjct: 247 TNIPSFEDFYLVVPRQVCFPGQPQDLGRNISMWMLLERVRFTLDGIECNKIGVSYEAFNQ 306
Query: 302 QPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGV 361
QP+FCSSPFW+CLHNQLWN+READ NRI+RNQ+PLYG+EGRFER+NQHP+AGS+SFSIG+
Sbjct: 307 QPNFCSSPFWTCLHNQLWNFREADLNRISRNQVPLYGLEGRFERINQHPSAGSYSFSIGI 366
Query: 362 TEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYS 421
TEVL++NL++EL A+D+EYVYQRSPGKIISV +PTFEALTQFGVATITT+NTGEVEASYS
Sbjct: 367 TEVLSTNLVLELSANDVEYVYQRSPGKIISVSVPTFEALTQFGVATITTKNTGEVEASYS 426
Query: 422 LTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAEC 481
LTF+CS +TLMEEQ+ I+KP E + +S KIYP+T+QA+KY C+ +LKDSD++EVDRAEC
Sbjct: 427 LTFNCSKDITLMEEQFLIMKPNEVTTQSCKIYPSTDQASKYFCAVVLKDSDYNEVDRAEC 486
Query: 482 QFSTMATVLDNGSQI-----TPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRK 536
QF+T ATVLDN + + PFQPP++SIN FF+SIESI K+W L +FITGK CR K
Sbjct: 487 QFATTATVLDNDTHVCSFLGMPFQPPEASINSFFDSIESIWNKIWRSLTEFITGKTCREK 546
Query: 537 CSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDN 596
CS FFDF CHIQY+CLSW+++FGL L IFPTVLVLLWLLHQKGLFDPLYDWW+D +D
Sbjct: 547 CSGFFDFKCHIQYVCLSWVMMFGLFLTIFPTVLVLLWLLHQKGLFDPLYDWWEDILGADE 606
Query: 597 QRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYY 656
Q I D R +ID H H+H KHHKQE RH A+ RR + +H HKHS+R++DY+
Sbjct: 607 QIIMDKRRFKIDKGHHHIHDNKHHKQELRHSNYSAQNRRRTTY-EHMHKHSERNSDYFDD 665
Query: 657 LHHVQKDKHKHGRSK-NSSVMQQLYLDTGKNDHIGHHRRRKFRESS 701
LHHV K+ HK+G K N ++Q + DH HH+ RK R+SS
Sbjct: 666 LHHVHKEMHKYGHKKQNMDIVQHIV------DHPAHHKHRKKRDSS 705
>gi|356555070|ref|XP_003545862.1| PREDICTED: uncharacterized protein LOC100780334 [Glycine max]
Length = 711
Score = 1026 bits (2653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 497/706 (70%), Positives = 589/706 (83%), Gaps = 27/706 (3%)
Query: 16 ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSD-NLNCTTKIVLNMAVPSGSSGGEAS 74
I I+ +LS VVG+QI+SKSKLEKCEK ++SD NLNCTTKIVLNMAVPSGSSGGEAS
Sbjct: 7 ITLIIIFILSSFYVVGIQIISKSKLEKCEKNSNSDDNLNCTTKIVLNMAVPSGSSGGEAS 66
Query: 75 IVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEP 134
IVAE+VEVEENS++KM+T+RIPPV+TVNKT++YA+Y+LTYIRDVPYKP+E+Y+KTRKCEP
Sbjct: 67 IVAELVEVEENSSRKMQTLRIPPVITVNKTSAYALYQLTYIRDVPYKPEEYYVKTRKCEP 126
Query: 135 DAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLR 182
DAGA+VVK CER QPICCPCGPQRR+PSSCGN FDKL KGKANTAHC+R
Sbjct: 127 DAGANVVKTCERLRDEEGHIIEYTQPICCPCGPQRRMPSSCGNFFDKLTKGKANTAHCVR 186
Query: 183 FPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGY 242
FPGDWFHVFGIG+R++GFSVRI+VK+G+KVSEV VGPEN+T S D FL+VNLIGDFVGY
Sbjct: 187 FPGDWFHVFGIGRRTLGFSVRIQVKSGTKVSEVVVGPENRTVISDDKFLRVNLIGDFVGY 246
Query: 243 TNIPSFEEFYLVIPRQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
TNIPSFE+FYLV+PRQ P QPQDLG N SMWMLLER RFTLDG+ECNKIGVSYEAFN
Sbjct: 247 TNIPSFEDFYLVVPRQVCFPAQPQDLGRNISMWMLLERVRFTLDGIECNKIGVSYEAFNQ 306
Query: 302 QPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGV 361
QP+FC+SPFW+CLHNQLWN+READ NRI+RNQ+PLYG+EGRFER+NQHP+AGS+SFSIG+
Sbjct: 307 QPNFCASPFWTCLHNQLWNFREADLNRISRNQVPLYGLEGRFERINQHPSAGSYSFSIGI 366
Query: 362 TEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYS 421
TEVLN+NL++EL A+D+EYVYQRSPGKIISV +PTFEALTQFGVATITT+NTGEVEASYS
Sbjct: 367 TEVLNTNLVLELSANDVEYVYQRSPGKIISVSVPTFEALTQFGVATITTKNTGEVEASYS 426
Query: 422 LTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAEC 481
LTF+CS +TLMEEQ+ I+KP E + +S KIYPTT+QA+KY C+A+LKDSD++EVDRAEC
Sbjct: 427 LTFNCSRDITLMEEQFLIMKPNEVTTQSCKIYPTTDQASKYFCAAVLKDSDYNEVDRAEC 486
Query: 482 QFSTMATVLDNGSQI-----TPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRK 536
QF+T ATVLDN +Q+ PFQP ++SIN FF+SIESI K+W L +FITGK CR K
Sbjct: 487 QFATTATVLDNDTQVCSFLGMPFQPQETSINSFFDSIESIWNKIWTSLTEFITGKTCREK 546
Query: 537 CSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDN 596
CS FFDF CHIQY+CLSW+++FGL L IFPTVLV+LWLLHQKGLFDPLYDWW+D +D
Sbjct: 547 CSGFFDFKCHIQYVCLSWVMMFGLFLTIFPTVLVVLWLLHQKGLFDPLYDWWEDILGADE 606
Query: 597 QRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYY 656
Q I D R +ID H H+H KHHKQE RH A RR + +H HKHS+R++DY+
Sbjct: 607 QIIMDKRKFKIDKGHHHIHDNKHHKQEHRHSNYSAENRRRTTY-EHMHKHSERNSDYFDD 665
Query: 657 LHHVQKDKHKHGRSK-NSSVMQQLYLDTGKNDHIGHHRRRKFRESS 701
LHHV K+ HK+G K N +Q + DH HH+ RK R+SS
Sbjct: 666 LHHVHKEMHKYGHKKQNMDNVQHIV------DHPVHHKHRKKRDSS 705
>gi|449452486|ref|XP_004143990.1| PREDICTED: protein HAPLESS 2-like [Cucumis sativus]
Length = 667
Score = 1003 bits (2593), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/655 (72%), Positives = 545/655 (83%), Gaps = 20/655 (3%)
Query: 29 VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
+ GVQILSKSKLEKCE+ + SD LNCT KIVLNMAVPSGSSGGEASI+AE+VEVEENST
Sbjct: 20 ISGVQILSKSKLEKCERNSGSDTLNCTKKIVLNMAVPSGSSGGEASIIAEIVEVEENSTN 79
Query: 89 KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
KM+T+R PPVLTV+K+ +Y +YELTYIRDVPYKP+EFY+ TRKCEPDA A VV+ICER
Sbjct: 80 KMQTLRTPPVLTVSKSPAYVLYELTYIRDVPYKPEEFYVPTRKCEPDASARVVQICERLR 139
Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
QPICCPCG +RR+P+SCGN FDK++KGKANTAHCLRFPGDWFHVF IGQ
Sbjct: 140 DESGHIILSTQPICCPCGAKRRMPTSCGNFFDKMIKGKANTAHCLRFPGDWFHVFSIGQW 199
Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
++GFSV+I VK+GSKVSEV+VGPEN+T S DNFL+ NLIGD VGYTNIPSFE+FYLVIP
Sbjct: 200 TLGFSVQIHVKSGSKVSEVSVGPENRTVVSNDNFLRANLIGDLVGYTNIPSFEDFYLVIP 259
Query: 257 RQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHN 316
RQGGPGQPQ+LG NFSMWMLLER RFTLDGLECNKIGV YE FNGQP FC+SPFWSCLHN
Sbjct: 260 RQGGPGQPQNLGTNFSMWMLLERVRFTLDGLECNKIGVGYETFNGQPDFCTSPFWSCLHN 319
Query: 317 QLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRAD 376
QLWN+READ +RI R QLPLYGVEGRFER+NQHPNAG+HSFSIGVTEVLN+NL+IELRAD
Sbjct: 320 QLWNFREADLSRIGRKQLPLYGVEGRFERINQHPNAGTHSFSIGVTEVLNTNLVIELRAD 379
Query: 377 DIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQ 436
D+EYVYQRSPGKI+S+ IPTFEALTQFGVAT+ T+NTGEVEASYSLTF CS V+LMEEQ
Sbjct: 380 DVEYVYQRSPGKIMSISIPTFEALTQFGVATVATKNTGEVEASYSLTFTCSKEVSLMEEQ 439
Query: 437 YFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQI 496
Y+I+KP E + RSFK+YPTT+QAAKY C+AILKD+DFSEVDRAECQF+T ATVLDNGSQI
Sbjct: 440 YYIMKPNEIASRSFKLYPTTDQAAKYVCAAILKDADFSEVDRAECQFATTATVLDNGSQI 499
Query: 497 TPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLV 556
TPF+ PK N F SI+ K+ W + DF+TGK+CR+ CS FFDFSCHIQYICLSWLV
Sbjct: 500 TPFELPKKKENGFIHSIKLAWKQFWGSVIDFVTGKSCRKVCSGFFDFSCHIQYICLSWLV 559
Query: 557 LFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHVHV 616
LFGL LA FP VLV+LW+LHQKGLFDPLYDWW+D F ++ R R + H H H
Sbjct: 560 LFGLFLATFPAVLVILWVLHQKGLFDPLYDWWEDMFCHKSEPTRSTWKYRGERKHYHRHG 619
Query: 617 RKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHVQKDKHKHGRSK 671
+HH+ G +K RR +H KHKHS+RDTD Y+LHHV + K K G ++
Sbjct: 620 SRHHQNHGSGYK----RRSHELHK--KHKHSERDTD--YFLHHVHRKKGKRGHNR 666
>gi|449495900|ref|XP_004159979.1| PREDICTED: protein HAPLESS 2-like [Cucumis sativus]
Length = 833
Score = 1003 bits (2592), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/650 (72%), Positives = 542/650 (83%), Gaps = 20/650 (3%)
Query: 27 RCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENS 86
+ + GVQILSKSKLEKCE+ + SD LNCT KIVLNMAVPSGSSGGEASI+AE+VEVEENS
Sbjct: 18 QTISGVQILSKSKLEKCERNSGSDTLNCTKKIVLNMAVPSGSSGGEASIIAEIVEVEENS 77
Query: 87 TQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
T KM+T+R PPVLTV+K+ +Y +YELTYIRDVPYKP+EFY+ TRKCEPDA A VV+ICER
Sbjct: 78 TNKMQTLRTPPVLTVSKSPAYVLYELTYIRDVPYKPEEFYVPTRKCEPDASARVVQICER 137
Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
QPICCPCG +RR+P+SCGN FDK++KGKANTAHCLRFPGDWFHVF IG
Sbjct: 138 LRDESGHIILSTQPICCPCGAKRRMPTSCGNFFDKMIKGKANTAHCLRFPGDWFHVFSIG 197
Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
Q ++GFSV+I VK+GSKVSEV+VGPEN+T S DNFL+ NLIGD VGYTNIPSFE+FYLV
Sbjct: 198 QWTLGFSVQIHVKSGSKVSEVSVGPENRTVVSNDNFLRANLIGDLVGYTNIPSFEDFYLV 257
Query: 255 IPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCL 314
IPRQGGPGQPQ+LG NFSMWMLLER RFTLDGLECNKIGV YE FNGQP FC+SPFWSCL
Sbjct: 258 IPRQGGPGQPQNLGTNFSMWMLLERVRFTLDGLECNKIGVGYETFNGQPDFCTSPFWSCL 317
Query: 315 HNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELR 374
HNQLWN+READ +RI R QLPLYGVEGRFER+NQHPNAG+HSFSIGVTEVLN+NL+IELR
Sbjct: 318 HNQLWNFREADLSRIGRKQLPLYGVEGRFERINQHPNAGTHSFSIGVTEVLNTNLVIELR 377
Query: 375 ADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLME 434
ADD+EYVYQRSPGKI+S+ IPTFEALTQFGVAT+ T+NTGEVEASYSLTF CS V+LME
Sbjct: 378 ADDVEYVYQRSPGKIMSISIPTFEALTQFGVATVATKNTGEVEASYSLTFTCSKEVSLME 437
Query: 435 EQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGS 494
EQY+I+KP E + RSFK+YPTT+QAAKY C+AILKD+DFSEVDRAECQF+T ATVLDNGS
Sbjct: 438 EQYYIMKPNEIASRSFKLYPTTDQAAKYVCAAILKDADFSEVDRAECQFATTATVLDNGS 497
Query: 495 QITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
QITPF+ PK N F SI+ K+ W + DF+TGK+CR+ CS FFDFSCHIQYICLSW
Sbjct: 498 QITPFELPKKKENGFIHSIKLAWKQFWGSVIDFVTGKSCRKVCSGFFDFSCHIQYICLSW 557
Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
LVLFGL LA FP VLV+LW+LHQKGLFDPLYDWW+D F ++ R R + H H
Sbjct: 558 LVLFGLFLATFPAVLVILWVLHQKGLFDPLYDWWEDMFCHKSEPTRSTWKYRGERKHYHR 617
Query: 615 HVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHVQKDK 664
H +HH+ G +K RR +H KHKHS+RDTD Y+LHHV + K
Sbjct: 618 HGSRHHQNHGSGYK----RRSHELHK--KHKHSERDTD--YFLHHVHRKK 659
>gi|297809457|ref|XP_002872612.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297318449|gb|EFH48871.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 710
Score = 971 bits (2510), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/703 (67%), Positives = 553/703 (78%), Gaps = 45/703 (6%)
Query: 29 VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGSSGGEASIVAE+VEVE+NS+
Sbjct: 22 VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSSGGEASIVAEIVEVEDNSSS 81
Query: 89 KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKPQE+ + TRKCEPDAG D+V+ICER
Sbjct: 82 NMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKPQEYSVTTRKCEPDAGPDIVQICERLR 141
Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
QPICCPCGPQRR+PSSCG++FDK++KGKANTAHCLRFPGDWFHVF IGQR
Sbjct: 142 DEKGNVLEQTQPICCPCGPQRRMPSSCGDIFDKMIKGKANTAHCLRFPGDWFHVFSIGQR 201
Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
S+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNFLKVNLIGDF GYTNIPSFE+FYLVIP
Sbjct: 202 SLGFSVRVELKTGTRVSEVIIGPENRTATANDNFLKVNLIGDFAGYTNIPSFEDFYLVIP 261
Query: 257 RQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
R+ GQP +LG N+SMWMLLER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLH
Sbjct: 262 REAAVAGQPGNLGANYSMWMLLERLRFTLDGLECNKIGVGYEAFNSQPNFCSSPYWSCLH 321
Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
NQLWN+READ NRINRNQLPLYG+EGRFER+NQHPNAG HSFSIGVTE LN+NL+IELRA
Sbjct: 322 NQLWNFREADINRINRNQLPLYGLEGRFERINQHPNAGPHSFSIGVTETLNTNLMIELRA 381
Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
DDIEYV+QRSPGKII++ IPTFEALTQFGVA +TT+NTGEVEASYSLTFDCS GV +EE
Sbjct: 382 DDIEYVFQRSPGKIINIAIPTFEALTQFGVAAVTTKNTGEVEASYSLTFDCSKGVAFVEE 441
Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
Q+FIIKPK + RSFK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q
Sbjct: 442 QFFIIKPKAVTTRSFKLYPTKDQAAKYICTAILKDSLFSEVDRAECQFSTTATVLDNGTQ 501
Query: 496 IT-PFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
+T PFQ P++ FF+SI + KL GL DFITG CR KCSSFFDFSCHIQY+CLSW
Sbjct: 502 VTNPFQIPETRPKGFFDSIRIMWTKLINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSW 561
Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
+V+FGL+LA+ PT VLLWLLHQKGLFDP YDWW+DHF D+ R R R D +
Sbjct: 562 MVMFGLLLALIPTTCVLLWLLHQKGLFDPFYDWWEDHFDLDHHR-RLLPPTREDAINRRH 620
Query: 615 HVRKHHKQEGRHHKLEARRRRCG---------------IHSDHKHKHSDRDTDYYYYLHH 659
H + G RR + DH H YY+ LH
Sbjct: 621 HHHHRQHRHGVKTHNHHRRTHKRHKHHHNQDDDVLQNMLERDHNESH------YYHQLHR 674
Query: 660 VQKD--KHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRES 700
V KD + + R+K+ V+ ++ H+ +R++ RES
Sbjct: 675 VHKDSKQKQRRRAKHGIVLP-------RDVHVDRRKRQRLRES 710
>gi|297809471|ref|XP_002872619.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297318456|gb|EFH48878.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 708
Score = 969 bits (2504), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/668 (69%), Positives = 543/668 (81%), Gaps = 30/668 (4%)
Query: 29 VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGSSGGEASIVAE+VEVE+NS+
Sbjct: 22 VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSSGGEASIVAEIVEVEDNSSS 81
Query: 89 KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKPQE+++ TRKCEPDAG D+V+ICER
Sbjct: 82 NMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKPQEYHVTTRKCEPDAGPDIVQICERLR 141
Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
QPICCPCGPQRR+PSSCG++FDK++KGKANTAHCLRFPGDWFHVF IGQR
Sbjct: 142 DEKGNVLEQTQPICCPCGPQRRMPSSCGDIFDKMIKGKANTAHCLRFPGDWFHVFSIGQR 201
Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
S+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNFLKVNLIGDF GYT+IPSFE+FYLVIP
Sbjct: 202 SLGFSVRVELKTGTRVSEVIIGPENRTATANDNFLKVNLIGDFGGYTSIPSFEDFYLVIP 261
Query: 257 RQGGP-GQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
R+ GQP LG N+SMWMLLER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLH
Sbjct: 262 REAAAAGQPGSLGANYSMWMLLERVRFTLDGLECNKIGVGYEAFNTQPNFCSSPYWSCLH 321
Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
NQLWN+READ NRI+R+QLPLYG+EGRFER+NQHPNAG HSFSIGVTE LN+NL+IELRA
Sbjct: 322 NQLWNFREADINRISRHQLPLYGLEGRFERINQHPNAGPHSFSIGVTETLNTNLMIELRA 381
Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
DDIEYV+QRSPGKII++ IPTFEALTQFGVA +T +NTGEVEASYSLTFDCS GV +EE
Sbjct: 382 DDIEYVFQRSPGKIINIAIPTFEALTQFGVAAVTIKNTGEVEASYSLTFDCSKGVAFVEE 441
Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
Q+FIIKPK + R+FK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q
Sbjct: 442 QFFIIKPKAVTTRAFKLYPTKDQAAKYICTAILKDSQFSEVDRAECQFSTTATVLDNGTQ 501
Query: 496 IT-PFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
+T PFQ P++ FF+SI +G K+ GL DFITG CR KCSSFFDFSCHIQY+CLSW
Sbjct: 502 VTNPFQIPETHPKGFFDSIRILGTKIINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSW 561
Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQR----IRDFRSRRIDVD 610
+V+FGL+LA+FPT +LLWLLHQKGLFDP Y+WW+DHF D+ R R+ + R
Sbjct: 562 MVMFGLLLALFPTTCLLLWLLHQKGLFDPCYNWWEDHFDLDHHRRLLPTRENIANRHHHH 621
Query: 611 -------HPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHVQKD 663
H H R+ H Q +HH E + D H D YY+ LH V KD
Sbjct: 622 HKHHHGVKTHNHHRRTH-QRHKHHHGENHDVLQKMMLDRDHS----DAHYYHQLHRVHKD 676
Query: 664 KHKHGRSK 671
+ R +
Sbjct: 677 SKQKQRRR 684
>gi|66731629|gb|AAY51998.1| HAP2 [Arabidopsis thaliana]
gi|66731631|gb|AAY51999.1| HAP2 [Arabidopsis thaliana]
gi|154425503|dbj|BAE71143.2| generative cell specific-1 [Arabidopsis thaliana]
Length = 705
Score = 966 bits (2496), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/692 (67%), Positives = 555/692 (80%), Gaps = 28/692 (4%)
Query: 29 VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGSSGGEASIVAE+VEVE+NS+
Sbjct: 22 VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSSGGEASIVAEIVEVEDNSSS 81
Query: 89 KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKPQE+++ TRKCEPDAG D+V+ICER
Sbjct: 82 NMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKPQEYHVTTRKCEPDAGPDIVQICERLR 141
Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
QPICCPCGPQRR+PSSCG++FDK++KGKANTAHCLRFPGDWFHVFGIGQR
Sbjct: 142 DEKGNVLEQTQPICCPCGPQRRMPSSCGDIFDKMIKGKANTAHCLRFPGDWFHVFGIGQR 201
Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
S+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNFLKVNLIGDF GYT+IPSFE+FYLVIP
Sbjct: 202 SLGFSVRVELKTGTRVSEVIIGPENRTATANDNFLKVNLIGDFGGYTSIPSFEDFYLVIP 261
Query: 257 RQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
R+ GQP LG N+SMWMLLER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLH
Sbjct: 262 REAAEAGQPGSLGANYSMWMLLERVRFTLDGLECNKIGVGYEAFNTQPNFCSSPYWSCLH 321
Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
NQLWN+RE+D NRI+R+QLPLYG+EGRFER+NQHPNAG HSFSIGVTE LN+NL+IELRA
Sbjct: 322 NQLWNFRESDINRIDRHQLPLYGLEGRFERINQHPNAGPHSFSIGVTETLNTNLMIELRA 381
Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
DDIEYV+QRSPGKII++ IPTFEALTQFGVA + +NTGEVEASYSLTFDCS GV +EE
Sbjct: 382 DDIEYVFQRSPGKIINIAIPTFEALTQFGVAAVIIKNTGEVEASYSLTFDCSKGVAFVEE 441
Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
Q+FIIKPK + RSFK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q
Sbjct: 442 QFFIIKPKAVTTRSFKLYPTKDQAAKYICTAILKDSQFSEVDRAECQFSTTATVLDNGTQ 501
Query: 496 IT-PFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
+T PFQ P++ FF+SI + K+ GL DFITG CR KCSSFFDFSCHIQY+CLSW
Sbjct: 502 VTNPFQIPETQPKGFFDSIRILWTKIINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSW 561
Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
+V+FGL+LA+FP +LLWLLHQKGLFDP YDWW+DHF D+ R R SR V+ H
Sbjct: 562 MVMFGLLLALFPITCLLLWLLHQKGLFDPCYDWWEDHFDLDHHR-RLLPSRADVVNRHHH 620
Query: 615 HVRKHHKQEGRHHKLEARRRRCGIHSDHKHK----HSDRDTDYYYYLHHVQKD--KHKHG 668
H + H + + G D K D+ YY+ LH V KD + +
Sbjct: 621 HHKHRHHHNHHRRTHQRHKHHHGQDDDVLQKMMLERDHSDSHYYHQLHRVHKDSKQKQRR 680
Query: 669 RSKNSSVMQQLYLDTGKNDHIGHHRRRKFRES 700
R+K+ V+ ++ H+ R+++ RES
Sbjct: 681 RAKHGIVLP-------RDVHVERQRKQRLRES 705
>gi|145340119|ref|NP_192909.2| protein hapless 2 [Arabidopsis thaliana]
gi|385178638|sp|F4JP36.1|HAP2_ARATH RecName: Full=Protein HAPLESS 2; AltName: Full=GENERATIVE CELL
SPECIFIC 1; Flags: Precursor
gi|332657641|gb|AEE83041.1| protein hapless 2 [Arabidopsis thaliana]
Length = 705
Score = 961 bits (2485), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 466/692 (67%), Positives = 554/692 (80%), Gaps = 28/692 (4%)
Query: 29 VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ 88
V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGSSGGEASIVAE+VEVE+NS+
Sbjct: 22 VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSSGGEASIVAEIVEVEDNSSS 81
Query: 89 KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER-- 146
M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKPQE+++ TRKCE DAG D+V+ICER
Sbjct: 82 NMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKPQEYHVTTRKCEHDAGPDIVQICERLR 141
Query: 147 ----------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQR 196
QPICCPCGPQRR+PSSCG++FDK++KGKANTAHCLRFPGDWFHVFGIGQR
Sbjct: 142 DEKGNVLEQTQPICCPCGPQRRMPSSCGDIFDKMIKGKANTAHCLRFPGDWFHVFGIGQR 201
Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
S+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNFLKVNLIGDF GYT+IPSFE+FYLVIP
Sbjct: 202 SLGFSVRVELKTGTRVSEVIIGPENRTATANDNFLKVNLIGDFGGYTSIPSFEDFYLVIP 261
Query: 257 RQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
R+ GQP LG N+SMWMLLER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLH
Sbjct: 262 REAAEAGQPGSLGANYSMWMLLERVRFTLDGLECNKIGVGYEAFNTQPNFCSSPYWSCLH 321
Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
NQLWN+RE+D NRI+R+QLPLYG+EGRFER+NQHPNAG HSFSIGVTE LN+NL+IELRA
Sbjct: 322 NQLWNFRESDINRIDRHQLPLYGLEGRFERINQHPNAGPHSFSIGVTETLNTNLMIELRA 381
Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
DDIEYV+QRSPGKII++ IPTFEALTQFGVA + +NTGEVEASYSLTFDCS GV +EE
Sbjct: 382 DDIEYVFQRSPGKIINIAIPTFEALTQFGVAAVIIKNTGEVEASYSLTFDCSKGVAFVEE 441
Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
Q+FIIKPK + RSFK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q
Sbjct: 442 QFFIIKPKAVTTRSFKLYPTKDQAAKYICTAILKDSQFSEVDRAECQFSTTATVLDNGTQ 501
Query: 496 IT-PFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
+T PFQ P++ FF+SI + K+ GL DFITG CR KCSSFFDFSCHIQY+CLSW
Sbjct: 502 VTNPFQIPETQPKGFFDSIRILWTKIINGLVDFITGDTCRNKCSSFFDFSCHIQYVCLSW 561
Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
+V+FGL+LA+FP +LLWLLHQKGLFDP YDWW+DHF D+ R R SR V+ H
Sbjct: 562 MVMFGLLLALFPITCLLLWLLHQKGLFDPCYDWWEDHFDLDHHR-RLLPSRADVVNRHHH 620
Query: 615 HVRKHHKQEGRHHKLEARRRRCGIHSDHKHK----HSDRDTDYYYYLHHVQKD--KHKHG 668
H + H + + G D K D+ YY+ LH V KD + +
Sbjct: 621 HHKHRHHHNHHRRTHQRHKHHHGQDDDVLQKMMLERDHSDSHYYHQLHRVHKDSKQKQRR 680
Query: 669 RSKNSSVMQQLYLDTGKNDHIGHHRRRKFRES 700
R+K+ V+ ++ H+ R+++ RES
Sbjct: 681 RAKHGIVLP-------RDVHVERQRKQRLRES 705
>gi|297742571|emb|CBI34720.3| unnamed protein product [Vitis vinifera]
Length = 818
Score = 953 bits (2463), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 462/653 (70%), Positives = 523/653 (80%), Gaps = 26/653 (3%)
Query: 72 EASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTAS-----------YAVYE-LTYIRDVP 119
E A+VV++ E + ++ VNK S + VYE +T +
Sbjct: 133 EPDASAKVVKICERHVSLLSAIKCQFSGFVNKDISKEVMQKPCCLTWMVYEGMTLPSQLT 192
Query: 120 YKPQEFYMKTRKCEPDAGADVVKICER-QPICCPCGPQRRIPSSCGNVFDKLLKGKANTA 178
K F RK + + + I E QPICCPCG RR+PSSCGN FDKL+KGKANTA
Sbjct: 193 QKMLVFRRSKRKWKEERKYENGHIIEHTQPICCPCGTHRRVPSSCGNFFDKLMKGKANTA 252
Query: 179 HCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGD 238
HCLRFPGDWFHVFGIGQRS+GFSV IEVKTGSK+SEV VGPEN+T S DNFLKVNLIGD
Sbjct: 253 HCLRFPGDWFHVFGIGQRSLGFSVHIEVKTGSKISEVIVGPENRTVMSNDNFLKVNLIGD 312
Query: 239 FVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEA 298
F GYTNIPSFE+FYLV PRQGGPGQPQ+LG NFSMWMLLER RFTLDGLECNKIGVSYEA
Sbjct: 313 FAGYTNIPSFEDFYLVTPRQGGPGQPQNLGVNFSMWMLLERVRFTLDGLECNKIGVSYEA 372
Query: 299 FNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFS 358
FNGQP+FCSSPFW+CLHNQLWN+READQNRI+R+QLPLYGVEGRFER+NQHPNAG+ SFS
Sbjct: 373 FNGQPNFCSSPFWNCLHNQLWNFREADQNRIDRHQLPLYGVEGRFERINQHPNAGTRSFS 432
Query: 359 IGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEA 418
IG+TEVLN+NLLIEL ADDIEYVYQRSPGKI+SV IPTFEALTQFG ATITT+N G+VEA
Sbjct: 433 IGITEVLNTNLLIELSADDIEYVYQRSPGKILSVTIPTFEALTQFGTATITTKNVGKVEA 492
Query: 419 SYSLT------------FDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSA 466
SYSLT FDCS GVTLMEEQ+FI+KP E IRSFK+YPTT+QAAKY CSA
Sbjct: 493 SYSLTALYVREDSVLYFFDCSRGVTLMEEQFFIMKPNENIIRSFKLYPTTDQAAKYVCSA 552
Query: 467 ILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRD 526
ILKDSD+SEVDRAECQF+T ATV DNGSQ TPFQPPK+SIN FFESIESI K W+G D
Sbjct: 553 ILKDSDYSEVDRAECQFTTTATVFDNGSQTTPFQPPKTSINGFFESIESIWNKFWDGFVD 612
Query: 527 FITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYD 586
FITGK CRRKCS FFDFSCHIQYIC+SW+V+FGL+LAIFPTVLVLLWLLHQKGLFDPLYD
Sbjct: 613 FITGKTCRRKCSRFFDFSCHIQYICMSWMVMFGLLLAIFPTVLVLLWLLHQKGLFDPLYD 672
Query: 587 WWDDHFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKH 646
WW+D F +DNQ I D R RIDVD+PH+H+ KHHKQE RH++ +A+ +R IH +HKH
Sbjct: 673 WWEDRFWADNQSIGDTRRHRIDVDNPHIHL-KHHKQEARHYRHDAQSKRRSIHDKRRHKH 731
Query: 647 SDRDTDYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRE 699
S +D+DYYYYLHHV K+KHK GRSKNSS+M Q+Y D ++D IG R RK RE
Sbjct: 732 SLQDSDYYYYLHHVHKNKHKQGRSKNSSIMHQVYSDRREDDGIGQRRCRKERE 784
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 101/147 (68%), Positives = 123/147 (83%)
Query: 1 MRNQTKSLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVL 60
M++Q + + LI+ I ++ V GVQILSKSKLEKCEK ++SDNLNCT KI+L
Sbjct: 1 MKDQKPRTRRRPLALIITIIFLSINGGSVYGVQILSKSKLEKCEKVSESDNLNCTKKIIL 60
Query: 61 NMAVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPY 120
+MAVPSGSSGGEASIVAEVVEVEENST KM+T+R+PP +TVNK+++YAVYE+TYIRDVPY
Sbjct: 61 DMAVPSGSSGGEASIVAEVVEVEENSTHKMQTLRVPPTITVNKSSAYAVYEITYIRDVPY 120
Query: 121 KPQEFYMKTRKCEPDAGADVVKICERQ 147
KPQE+++KTRKCEPDA A VVKICER
Sbjct: 121 KPQEYFVKTRKCEPDASAKVVKICERH 147
>gi|255537305|ref|XP_002509719.1| conserved hypothetical protein [Ricinus communis]
gi|223549618|gb|EEF51106.1| conserved hypothetical protein [Ricinus communis]
Length = 661
Score = 891 bits (2303), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 464/715 (64%), Positives = 546/715 (76%), Gaps = 82/715 (11%)
Query: 8 LKLKHFLLILFCILNL-LSPRCVVGVQILSKSKLEKCEKRTDSDN--LNCTTKIVLNMAV 64
++ + ++IL C+++ L V GV+ILSKSKLEKCEK +DSD+ LNCT KIVLNMAV
Sbjct: 1 MEKQAIVIILCCLVSYYLLVANVNGVEILSKSKLEKCEKASDSDSDSLNCTAKIVLNMAV 60
Query: 65 PSGSSGGEASIVAEVVEVEENST-QKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQ 123
PSGSSGGEASIVAE+VEVEENST M+T+RIPPV+TVNK+A+YA+YELTYIRDV YKPQ
Sbjct: 61 PSGSSGGEASIVAEIVEVEENSTSNNMQTLRIPPVITVNKSATYALYELTYIRDVAYKPQ 120
Query: 124 EFYMKTRKCEPDAGADVVKICER-------------------QPICCPCGPQRRIPSSCG 164
E+Y+KTRKCE DAG +VV+ICER +P CCPCGPQRR+PSSCG
Sbjct: 121 EYYVKTRKCERDAGTNVVQICERRVILLLLLRDEKGHIIEHTEPTCCPCGPQRRVPSSCG 180
Query: 165 NVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTA 224
N FDKL+KGKANTAHC+RFPGDWFHVFGIGQRSIGFS+RIEVKT KVSEV VGPEN+TA
Sbjct: 181 NFFDKLMKGKANTAHCVRFPGDWFHVFGIGQRSIGFSIRIEVKTRYKVSEVIVGPENRTA 240
Query: 225 TSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTL 284
TS DNFL+VNLIGDFVGY+++PSFE+FYLVIPRQ R RFTL
Sbjct: 241 TSKDNFLRVNLIGDFVGYSSLPSFEDFYLVIPRQ--------------------RVRFTL 280
Query: 285 DGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFE 344
DG+ECNKIGVSYEAFN QP+FC+SPFWSCLHNQLWNYR+
Sbjct: 281 DGIECNKIGVSYEAFNQQPNFCASPFWSCLHNQLWNYRD--------------------- 319
Query: 345 RMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFG 404
N G+HSFSIG+TEVLN+NLLIEL ADDIE+VYQRSPGKI++V IPTFEALTQFG
Sbjct: 320 ------NGGTHSFSIGITEVLNTNLLIELSADDIEFVYQRSPGKILNVTIPTFEALTQFG 373
Query: 405 VATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTC 464
V TITT NTG+VEASYSLT EQ+FI+KP E +IRSFK+YPTT+QAAKY C
Sbjct: 374 VGTITTMNTGKVEASYSLT-----------EQFFIMKPNEIAIRSFKLYPTTDQAAKYIC 422
Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGL 524
+AILKDS+F+EVDRAECQFST+AT+LDNGSQITPFQPPK+S N F +S+ESI LW+GL
Sbjct: 423 AAILKDSNFNEVDRAECQFSTIATILDNGSQITPFQPPKNSKNGFLDSVESIWNTLWKGL 482
Query: 525 RDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPL 584
DFITGK CRRKC+SFFDFSCHIQYIC+ W+V+FGL+LAI P VLVLLWLLHQKG FDPL
Sbjct: 483 VDFITGKTCRRKCTSFFDFSCHIQYICMGWMVMFGLLLAIIPLVLVLLWLLHQKGFFDPL 542
Query: 585 YDWWDDHFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKH 644
YDWW+DH +D QR + ID+ H H+HV++HH+ R K +A+ +R H +HKH
Sbjct: 543 YDWWEDHVCADKQRHGYIQRHNIDIHHHHIHVKQHHELGARRRKHDAQYKR-STHREHKH 601
Query: 645 KHSDRDTDYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRE 699
HS DTDY YYLHHV KD+ K +K S + Q LD ++D+I HHR RK +E
Sbjct: 602 NHSGGDTDYNYYLHHVYKDRSKRRSAKKSRIKQHGLLDEMEDDNIKHHRHRKEKE 656
>gi|356540460|ref|XP_003538707.1| PREDICTED: uncharacterized protein LOC100794070 [Glycine max]
Length = 667
Score = 885 bits (2286), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/701 (65%), Positives = 545/701 (77%), Gaps = 61/701 (8%)
Query: 16 ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSD-NLNCTTKIVLNMAVPSGSSGGEAS 74
I I+ +LS VVG+QI+SKSKLEKCEK ++SD NLNCTTKIVLNMAVPSGSSGGEAS
Sbjct: 7 ITLMIIFILSSFHVVGIQIISKSKLEKCEKNSNSDDNLNCTTKIVLNMAVPSGSSGGEAS 66
Query: 75 IVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEP 134
IVAE+VEVEENS++KM+T+RI PV+TVNKT++YA+Y+LTYIRDVPYKP+E+Y+KTRKCEP
Sbjct: 67 IVAELVEVEENSSRKMQTLRITPVITVNKTSAYALYQLTYIRDVPYKPEEYYVKTRKCEP 126
Query: 135 DAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLR 182
DAGA+VVKICER QPICCPCGPQR +PSSCGN FDKL KGKANTAHC+
Sbjct: 127 DAGANVVKICERLRDEEGHIIEYTQPICCPCGPQRWMPSSCGNFFDKLTKGKANTAHCVH 186
Query: 183 FPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGY 242
FPGDWFHVFGIGQR++GFSV+I+VK+G+KVSEV VGP+N+T S D F +VNLIGDFVGY
Sbjct: 187 FPGDWFHVFGIGQRTLGFSVQIQVKSGTKVSEVVVGPQNRTVISDDKFFRVNLIGDFVGY 246
Query: 243 TNIPSFEEFYLVIPRQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
TNIPSFE+FYLV+PRQ PGQPQDLG N SMWMLLER RFTLDG+ECNKIGV+YEAFN
Sbjct: 247 TNIPSFEDFYLVVPRQVCFPGQPQDLGRNISMWMLLERVRFTLDGIECNKIGVNYEAFNQ 306
Query: 302 QPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGV 361
QP+FC SPFW+CLHNQLWN+READ NRI+RNQ+PLYG+EGRFER+NQHP+AGS+SFSIG+
Sbjct: 307 QPNFCPSPFWTCLHNQLWNFREADLNRISRNQVPLYGLEGRFERINQHPSAGSYSFSIGI 366
Query: 362 TEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYS 421
TEVLN+NL++EL A+D+EYVYQRSPGKIISV +PTF ALTQFGVATITT+NTGEVEASYS
Sbjct: 367 TEVLNTNLVLELSANDVEYVYQRSPGKIISVSVPTFAALTQFGVATITTKNTGEVEASYS 426
Query: 422 LTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAEC 481
LTF+CS +TLME Y+ A Y ++LKDSD++EVDRAEC
Sbjct: 427 LTFNCSKDITLME--YY--------------------AKTYARVSVLKDSDYNEVDRAEC 464
Query: 482 QFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFF 541
QF+T ATVLDN +Q+ F P+ KL+ G + FI K R KCS FF
Sbjct: 465 QFATTATVLDNDTQVCSFLVPEF--------------KLFPGNK-FI--KKNREKCSGFF 507
Query: 542 DFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRD 601
DF CHIQY+CLSW+++FGL L IF TVLVLLWLLHQKGLFDPLYDWW+D +D Q I D
Sbjct: 508 DFKCHIQYVCLSWVMMFGLFLTIFLTVLVLLWLLHQKGLFDPLYDWWEDILGADEQIIMD 567
Query: 602 FRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHVQ 661
R +ID H H+H KHHKQE RH A RR + +H HKHS+R++DY+ LHHV
Sbjct: 568 KRKFKIDKGHHHIHENKHHKQEHRHSNYSAENRRRTTY-EHMHKHSERNSDYFDDLHHVH 626
Query: 662 KDKHKHGRSK-NSSVMQQLYLDTGKNDHIGHHRRRKFRESS 701
K+ HK+ K N +Q + DH HH+ RK R+SS
Sbjct: 627 KEMHKYEHKKQNMDNVQHIV------DHPAHHKHRKKRDSS 661
>gi|84453079|dbj|BAE71142.1| generative cell specific-1 [Lilium longiflorum]
Length = 698
Score = 840 bits (2171), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/670 (60%), Positives = 520/670 (77%), Gaps = 33/670 (4%)
Query: 30 VGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVE--ENST 87
V+ILSKS++E+C K +DSD L+C KIV+++AVPSGSSGGEASIVA++VEVE EN+T
Sbjct: 23 TAVEILSKSRVERCTKTSDSDKLDCNNKIVVDLAVPSGSSGGEASIVAQLVEVEQRENAT 82
Query: 88 QKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICE-- 145
+KM T+R PPV+T+NK+A+YA+Y+L Y+RDV YKP+EF+++TR+CEPDA +++ C+
Sbjct: 83 RKMHTLREPPVITINKSAAYALYKLIYLRDVAYKPEEFHVETRRCEPDAPYEILGECQGL 142
Query: 146 ----------RQPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQ 195
QP+CCPCGP+ R P++CG++F ++ KGK NTAHCL+FPGDWFHVF IG+
Sbjct: 143 RDQNGNIIENTQPVCCPCGPEGRYPTTCGSIF-QVFKGKTNTAHCLKFPGDWFHVFAIGK 201
Query: 196 RSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVI 255
RS+GFSVR+EV+ GS SE VGP+N+ S DNFL+VNLIGDFVGYT+IPSFE+FYLV
Sbjct: 202 RSLGFSVRVEVRKGSSQSEAIVGPDNRAVLSEDNFLRVNLIGDFVGYTSIPSFEDFYLVT 261
Query: 256 PRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
PR G GQP DLGG++S WMLLER RFTLDGLECNKIGVSY+A+ QP+FCSSP WSCLH
Sbjct: 262 PRLGAAGQPTDLGGDYSKWMLLERERFTLDGLECNKIGVSYDAYRSQPNFCSSPLWSCLH 321
Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
NQLW++ EADQN+I RNQ P Y VEGRF+R+NQHPNAG+HSFS+G+TE LN+NLLIELRA
Sbjct: 322 NQLWHFWEADQNQIRRNQPPEYVVEGRFKRINQHPNAGTHSFSMGITEALNTNLLIELRA 381
Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
DDI+YVYQRSPGK++++ IPTFEALTQFG AT+TT+NTG++EASYSLTF C +GV+ +EE
Sbjct: 382 DDIDYVYQRSPGKVLAINIPTFEALTQFGTATVTTKNTGKLEASYSLTFRCRSGVSYLEE 441
Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
Q++I+KP+E RSF++Y T++ AA Y C+AILK SDFSEVDRA+CQF+T AT+LD+GSQ
Sbjct: 442 QFYIMKPEEEVSRSFRLYLTSDLAATYECAAILKASDFSEVDRADCQFTTTATILDDGSQ 501
Query: 496 ITPFQPPKS-SINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
I P K IN F+SI+SI +WEGL +F +GK CR KCSSFF+F CH+QYIC+SW
Sbjct: 502 IVPANELKEKGINGIFKSIKSIWGNIWEGLLEFFSGKTCRSKCSSFFNFRCHMQYICMSW 561
Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQR------IRDFRSRRID 608
++L L+LA+FPT +VLLWLLHQ+GLFDP+YDWW D + QR +RD RS R
Sbjct: 562 ILLLSLLLAVFPTGVVLLWLLHQQGLFDPIYDWWYDRYGEGFQRSSSLFSLRDSRSARHR 621
Query: 609 VD-HPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSD-RDTDYYYYLH----HVQK 662
D + + RKH E + K R H+ HS+ D+Y++ H HV K
Sbjct: 622 GDNNARLRDRKHSFYEEKKRKRSHTSRML-----HERSHSEIAAGDHYHHRHESHLHVHK 676
Query: 663 DKHKHGRSKN 672
++HK+ SK+
Sbjct: 677 ERHKYKHSKD 686
>gi|224053957|ref|XP_002298057.1| predicted protein [Populus trichocarpa]
gi|222845315|gb|EEE82862.1| predicted protein [Populus trichocarpa]
Length = 622
Score = 830 bits (2144), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/709 (60%), Positives = 495/709 (69%), Gaps = 128/709 (18%)
Query: 14 LLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGE 72
+ ++FCI LS V ++ILSKSKLE+CEK +DSDN LNCT KIVLNMAVPSGSSGGE
Sbjct: 8 IFLIFCIF--LSYFTVQSIEILSKSKLERCEKASDSDNDLNCTRKIVLNMAVPSGSSGGE 65
Query: 73 ASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKC 132
ASIVAE+ EVEEN+T M TVR+PP DV YKP+E+Y+KTRKC
Sbjct: 66 ASIVAEIAEVEENATDLMETVRVPP-------------------DVAYKPEEYYVKTRKC 106
Query: 133 EPDAGADVVKICERQPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFG 192
+ DAGA+VVKICE R+ G HC FHVFG
Sbjct: 107 DRDAGANVVKICE----------SRQTDEREGQ-------------HCTLRQISRFHVFG 143
Query: 193 IGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFY 252
IGQRS+GFSVRIEVKTGSKVSEVTVGPEN+T TS DNFL+VNLIGDFVGY+NIPSFE+FY
Sbjct: 144 IGQRSMGFSVRIEVKTGSKVSEVTVGPENRTVTSKDNFLRVNLIGDFVGYSNIPSFEDFY 203
Query: 253 LVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWS 312
LVIPRQG PGQPQDLG NFSMWMLLER RFTLDG+ECNKIGVSYEAF+GQP+FC+SPFWS
Sbjct: 204 LVIPRQGEPGQPQDLGRNFSMWMLLERVRFTLDGVECNKIGVSYEAFSGQPNFCASPFWS 263
Query: 313 CLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIE 372
CLHNQLWN+ H NAG+HSFSIG+TEVLN+NLLIE
Sbjct: 264 CLHNQLWNF---------------------------HDNAGTHSFSIGITEVLNTNLLIE 296
Query: 373 LRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLT--------- 423
L ADDIEYVYQRSPGK++S IPTFEALTQFGVAT++ +N GEVEASYSLT
Sbjct: 297 LTADDIEYVYQRSPGKLLSFTIPTFEALTQFGVATVSAENIGEVEASYSLTYGVVDVDSI 356
Query: 424 -------------FDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKD 470
FDCS GV+LMEEQ+FI+KP E +IRSFKIYPTT++AA+Y C+AILKD
Sbjct: 357 KICEKVEGQDLSHFDCSRGVSLMEEQFFILKPNEITIRSFKIYPTTDKAARYVCAAILKD 416
Query: 471 SDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITG 530
S F+E+DRAECQF T AT+LDNGSQI PF PPK+S+N FFESIE+I ++WEGL DFITG
Sbjct: 417 SGFNEIDRAECQFFTTATILDNGSQIAPFLPPKTSVNGFFESIENIWNRIWEGLVDFITG 476
Query: 531 KACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDD 590
K CR+KCSSFFDFSCHIQY KGLFDPLYDWW+D
Sbjct: 477 KTCRQKCSSFFDFSCHIQY----------------------------KGLFDPLYDWWED 508
Query: 591 HFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRD 650
H +D QRIRD R D +HV +HH+ R HK A ++R IH +H+H+HS RD
Sbjct: 509 HLWTDEQRIRDTRRHNKD-----IHVNRHHELGARQHKHNAHKKRT-IHQEHRHRHSGRD 562
Query: 651 TDYYYYLHHVQKDKHKHGRSKNSSVMQQLYLDTGKNDHIGHHRRRKFRE 699
T+YY+YLHHV KDK KH SK +SVMQQ+YLD N +GHH RK R+
Sbjct: 563 TEYYHYLHHVHKDKSKHRGSKKTSVMQQVYLDGVGNTKVGHHGHRKERD 611
>gi|357154351|ref|XP_003576754.1| PREDICTED: uncharacterized protein LOC100833308 [Brachypodium
distachyon]
Length = 740
Score = 823 bits (2127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/578 (65%), Positives = 472/578 (81%), Gaps = 16/578 (2%)
Query: 31 GVQILSKSKLEKCEKRTDSD-NLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQK 89
GV+ILSKS++E+C + + + +L C KI+LN+AVP+GS+GGEAS+VA+VVEVEEN TQ
Sbjct: 29 GVEILSKSRVERCARDSGAGGHLACDRKIILNVAVPTGSTGGEASMVAQVVEVEENDTQA 88
Query: 90 MRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER--- 146
M+T+R PPV+T+NK+A+YAVY L YIRDV Y+P+E +++TRKCE DAGA+VV+ CER
Sbjct: 89 MQTIRDPPVITINKSATYAVYALNYIRDVAYRPEEQFVRTRKCESDAGAEVVRECERLRD 148
Query: 147 ---------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGI-GQR 196
+P+CCPCG Q R+PSSCG FDK++KGKANTAHC+RFPGDWFHVFGI
Sbjct: 149 QNGHVIEHTEPVCCPCGSQHRVPSSCGTFFDKMVKGKANTAHCVRFPGDWFHVFGIETSY 208
Query: 197 SIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
S+GFS+R++VK GS V+E+ VGPENKT S DNFL+VNLIGDFVGY ++P+FE FYLV P
Sbjct: 209 SLGFSIRVQVKKGSSVTEIIVGPENKTVVSKDNFLRVNLIGDFVGYKSVPTFENFYLVTP 268
Query: 257 RQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLH 315
R+G G GQPQ LG FS WMLLER RFTLDGLECNKIGV YEA+ QPSFCS+PFWSCL+
Sbjct: 269 RKGDGGGQPQVLGDEFSRWMLLERVRFTLDGLECNKIGVGYEAYRNQPSFCSNPFWSCLY 328
Query: 316 NQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRA 375
NQLWN+ E+D NRINR Q P Y V+GRFER+NQHP+AG H+FS+G+TE +N+NLLIEL A
Sbjct: 329 NQLWNFWESDNNRINRKQQPQYVVQGRFERINQHPHAGVHTFSVGITESVNTNLLIELSA 388
Query: 376 DDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEE 435
DDI+YVYQRSPGKIIS+ +PTFEAL+Q G A +T +N G++EASYSLTFDC +G+T +EE
Sbjct: 389 DDIDYVYQRSPGKIISINVPTFEALSQVGTAQVTVRNIGKLEASYSLTFDCLSGITYVEE 448
Query: 436 QYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
QYFI+KP E IRSF ++ +T+QA+KY C+AILK SDFSE+DRAECQFST ATVLDNG+Q
Sbjct: 449 QYFILKPDEVLIRSFYLHSSTDQASKYRCAAILKASDFSELDRAECQFSTAATVLDNGTQ 508
Query: 496 ITPF-QPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
I P Q K I FFE+I+++ + W+ + DF TG++C +CSSFFD SCHIQYIC+ W
Sbjct: 509 IGPTNQHAKGGIRGFFEAIKALFRNTWDTVIDFFTGRSCSTRCSSFFDLSCHIQYICIGW 568
Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHF 592
LV+FGL+LAI P V VLLWLLHQ GLFDPLYD W+D F
Sbjct: 569 LVMFGLLLAILPAVAVLLWLLHQNGLFDPLYDCWEDVF 606
>gi|291620044|gb|ADE20442.1| HAP2 [Sisymbrium irio]
Length = 504
Score = 821 bits (2120), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/504 (74%), Positives = 444/504 (88%), Gaps = 14/504 (2%)
Query: 63 AVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKP 122
AVPSGSSGGEASIVAE+VEVE+NS+ M+TVRIPPV+TVNK+A+YA+Y+LTYIRDVPYKP
Sbjct: 1 AVPSGSSGGEASIVAEIVEVEDNSSSNMQTVRIPPVITVNKSAAYALYDLTYIRDVPYKP 60
Query: 123 QEFYMKTRKCEPDAGADVVKICER------------QPICCPCGPQRRIPSSCGNVFDKL 170
QEF++ TRKCEPD+G D+V ICER QP+CCPCGP+RR+PSSCG++F+++
Sbjct: 61 QEFHVTTRKCEPDSGPDIVDICERLRDDTGNVLEQTQPVCCPCGPERRLPSSCGDIFERM 120
Query: 171 LKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNF 230
+KGKANTAHCLRFPGDW+HVF IGQRS+GFSVR+E+KTG++VSEV +GPEN+TAT+ DNF
Sbjct: 121 VKGKANTAHCLRFPGDWYHVFSIGQRSLGFSVRVELKTGTRVSEVIIGPENRTATANDNF 180
Query: 231 LKVNLIGDFVGYTNIPSFEEFYLVIPRQGG-PGQPQDLGGNFSMWMLLERTRFTLDGLEC 289
LKVNLIGDF GYTNIPSFE+FYLVIPR+ GQP +LGGN+SMWMLLER RFTLDG+EC
Sbjct: 181 LKVNLIGDFAGYTNIPSFEDFYLVIPREAAVEGQPGNLGGNYSMWMLLERVRFTLDGIEC 240
Query: 290 NKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQH 349
+KIGV YEAFN QP+FCS+P+WSCLHNQLWN+READ NR+NR+QLPLYG+EGRFER+NQH
Sbjct: 241 DKIGVGYEAFNNQPNFCSAPYWSCLHNQLWNFREADVNRMNRHQLPLYGLEGRFERINQH 300
Query: 350 PNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATIT 409
PN+G HSFSIGVTE LN+NL+IELRADDIEYV+Q+SPGKII++ IPTFEALTQFGVA +T
Sbjct: 301 PNSGPHSFSIGVTETLNTNLMIELRADDIEYVFQKSPGKIINIAIPTFEALTQFGVAAVT 360
Query: 410 TQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILK 469
T+NTGEVEASYSLTFDCS GV +EEQ+FIIKP E + RSFK+YPT +QAAKY C+AILK
Sbjct: 361 TKNTGEVEASYSLTFDCSKGVAFVEEQFFIIKPNEATTRSFKLYPTKDQAAKYICTAILK 420
Query: 470 DSDFSEVDRAECQFSTMATVLDNGSQIT-PFQPPKSSINDFFESIESIGKKLWEGLRDFI 528
DS FSEVDRAECQFST ATVLDNG+Q+T PFQ P++ FFESI + L GL DFI
Sbjct: 421 DSQFSEVDRAECQFSTTATVLDNGTQVTNPFQIPETRPKGFFESIRLMWTNLVNGLVDFI 480
Query: 529 TGKACRRKCSSFFDFSCHIQYICL 552
TG +CR KCSSFFDFSCHIQY+CL
Sbjct: 481 TGDSCRNKCSSFFDFSCHIQYVCL 504
>gi|4539463|emb|CAB39943.1| putative protein [Arabidopsis thaliana]
gi|7267872|emb|CAB78215.1| putative protein [Arabidopsis thaliana]
Length = 658
Score = 700 bits (1807), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/746 (52%), Positives = 462/746 (61%), Gaps = 183/746 (24%)
Query: 29 VVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGS-------------------- 68
V G+QILSKSKLEKCEK +DS NLNC+TKIVLN+AVPSGS
Sbjct: 22 VDGIQILSKSKLEKCEKTSDSGNLNCSTKIVLNLAVPSGSVRFFFFSKTHIYTCFGFVFI 81
Query: 69 --------------SGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTY 114
SGGEASIVAE+VEVE+NS+ M+TVRIPPV+TVNK+A+YA+Y+LTY
Sbjct: 82 NFVFTCFGFVDETKSGGEASIVAEIVEVEDNSSSNMQTVRIPPVITVNKSAAYALYDLTY 141
Query: 115 IRDVPYKPQEFYMKTRKCEPDAGADVVKICER------------QPICCPCGPQRRIPSS 162
IRDVPYKPQE+++ TRKCE DAG D+V+ICER QPICCPCGPQRR+PSS
Sbjct: 142 IRDVPYKPQEYHVTTRKCEHDAGPDIVQICERLRDEKGNVLEQTQPICCPCGPQRRMPSS 201
Query: 163 CGNV-----------FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSK 211
CG++ FDK++KGKANTAHCLRFPGDW
Sbjct: 202 CGDICMCFSFVTFKEFDKMIKGKANTAHCLRFPGDW------------------------ 237
Query: 212 VSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNF 271
TAT+ DNFLKVNLIGDF GYT+IPSFE+FYLVIPR
Sbjct: 238 -----------TATANDNFLKVNLIGDFGGYTSIPSFEDFYLVIPR-------------- 272
Query: 272 SMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINR 331
ER RFTLDGLECNKIGV YEAFN QP+FCSSP+WSCLHNQLWN+RE
Sbjct: 273 ------ERVRFTLDGLECNKIGVGYEAFNTQPNFCSSPYWSCLHNQLWNFRE-------- 318
Query: 332 NQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIIS 391
NAG HSFSIGVTE LN+NL+IELRADDIEYV+QRSPGKII+
Sbjct: 319 -------------------NAGPHSFSIGVTETLNTNLMIELRADDIEYVFQRSPGKIIN 359
Query: 392 VIIPTFEALTQFGVATITTQNTGEVEASYSLT----------FDCSTGVTLMEEQYFIIK 441
+ IPTFEALTQFGVA + +NTGEVEASYSLT FDCS GV +EEQ+FIIK
Sbjct: 360 IAIPTFEALTQFGVAAVIIKNTGEVEASYSLTVISKTESYLIFDCSKGVAFVEEQFFIIK 419
Query: 442 PKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQIT-PFQ 500
PK + RSFK+YPT +QAAKY C+AILKDS FSEVDRAECQFST ATVLDNG+Q+T PFQ
Sbjct: 420 PKAVTTRSFKLYPTKDQAAKYICTAILKDSQFSEVDRAECQFSTTATVLDNGTQVTNPFQ 479
Query: 501 PPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGL 560
P++ FF+SI + K+ GL DFITG C SW+V+FGL
Sbjct: 480 IPETQPKGFFDSIRILWTKIINGLVDFITGDTC-------------------SWMVMFGL 520
Query: 561 VLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHVHVRKHH 620
+LA+FP +LLWLLHQKGLFDP YDWW+DHF D+ R R SR V+ H H + H
Sbjct: 521 LLALFPITCLLLWLLHQKGLFDPCYDWWEDHFDLDHHR-RLLPSRADVVNRHHHHHKHRH 579
Query: 621 KQEGRHHKLEARRRRCGIHSDHKHK----HSDRDTDYYYYLHHVQKD--KHKHGRSKNSS 674
+ + G D K D+ YY+ LH V KD + + R+K+
Sbjct: 580 HHNHHRRTHQRHKHHHGQDDDVLQKMMLERDHSDSHYYHQLHRVHKDSKQKQRRRAKHGI 639
Query: 675 VMQQLYLDTGKNDHIGHHRRRKFRES 700
V+ ++ H+ R+++ RES
Sbjct: 640 VLP-------RDVHVERQRKQRLRES 658
>gi|115462909|ref|NP_001055054.1| Os05g0269500 [Oryza sativa Japonica Group]
gi|75110629|sp|Q5W6B9.1|HAP2A_ORYSJ RecName: Full=Protein HAPLESS 2-A; Flags: Precursor
gi|55168095|gb|AAV43963.1| unknown protein [Oryza sativa Japonica Group]
gi|113578605|dbj|BAF16968.1| Os05g0269500 [Oryza sativa Japonica Group]
gi|215706325|dbj|BAG93181.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 722
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/585 (58%), Positives = 437/585 (74%), Gaps = 32/585 (5%)
Query: 31 GVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE--NST 87
G +ILSKS+LE C +D+ L C K+V+++AVPSG+SGGEAS+VA V VEE ++
Sbjct: 24 GTEILSKSRLESCSHDSDAGGRLKCDRKLVVDLAVPSGASGGEASLVARVAGVEEENDTP 83
Query: 88 QKMRTVRIPPVLTVNKTASYAVYELTYI-RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
+++R PPV+TV+K+A+YA+Y LTY+ RDV Y+P E Y+KT KCEP AGA VV CER
Sbjct: 84 SATKSIRDPPVITVSKSATYALYALTYLDRDVAYRPDEKYVKTHKCEPYAGAKVVGECER 143
Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
+PICCPCGP R + S CG+++ KL KGKANTAHC+RFPGDWFHVFGIG
Sbjct: 144 LWDEKGNVIKQTEPICCPCGPHR-VQSKCGDIWSKLTKGKANTAHCVRFPGDWFHVFGIG 202
Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
S+ FS+R++VK GS V +V VGPENKT S DNFL+V ++GD+ GYT+IPSFE+ YLV
Sbjct: 203 AWSLRFSIRVQVKKGSSVWDVVVGPENKTVVSGDNFLRVKVVGDYTGYTSIPSFEDNYLV 262
Query: 255 IPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSC 313
PR+G G QPQDLG S WM+L+R RFTLDGLEC+KIGV YEA+ QP+FCS+P+ SC
Sbjct: 263 TPRKGTGSSQPQDLGNEHSKWMILDRVRFTLDGLECDKIGVGYEAYRNQPNFCSAPYGSC 322
Query: 314 LHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIEL 373
L NQLWN+ E D+ RI+ +QLPLY VEGRF+R+NQHPNAG+H+FS+GVTE LN+NLLIEL
Sbjct: 323 LGNQLWNFWEYDKRRIDNSQLPLYIVEGRFQRINQHPNAGAHTFSVGVTEDLNTNLLIEL 382
Query: 374 RADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLM 433
ADDIEYVYQRSP KII + +PTFEAL+Q G+A +TT+N G++E+SYSLTF CS+G++ +
Sbjct: 383 MADDIEYVYQRSPAKIIDIRVPTFEALSQVGIANVTTKNIGKLESSYSLTFKCSSGISPV 442
Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
EEQ + +KP E RSF++ TT+QAA + C AILK SDFSE+DR +FST ATV +NG
Sbjct: 443 EEQLYTMKPDEVIARSFELRSTTDQAAMHQCEAILKASDFSELDREGYRFSTAATVYNNG 502
Query: 494 SQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLS 553
+QI P K F++SI K LW L DF+TG+ C KC FDF CHIQY+C+
Sbjct: 503 AQIGPTNDHKK--GGFWDSI----KALWRNLIDFLTGRLCWTKCPRLFDFGCHIQYVCIG 556
Query: 554 WLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWW----DDHFQS 594
W++ +L + P +V LWLLHQ+GLFDPLYDWW DD +++
Sbjct: 557 WIL----LLLLIPAAVVFLWLLHQEGLFDPLYDWWGLEPDDDYRA 597
>gi|222630910|gb|EEE63042.1| hypothetical protein OsJ_17850 [Oryza sativa Japonica Group]
Length = 722
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/585 (58%), Positives = 437/585 (74%), Gaps = 32/585 (5%)
Query: 31 GVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE--NST 87
G +ILSKS+LE C +D+ L C K+V+++AVPSG+SGGEAS+VA V VEE ++
Sbjct: 24 GTEILSKSRLESCSHDSDAGGRLKCDRKLVVDLAVPSGASGGEASLVARVAGVEEENDTP 83
Query: 88 QKMRTVRIPPVLTVNKTASYAVYELTYI-RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
+++R PPV+TV+K+A+YA+Y LTY+ RDV Y+P E Y+KT KCEP AGA VV CER
Sbjct: 84 SATKSIRDPPVITVSKSATYALYALTYLDRDVAYRPDEKYVKTHKCEPYAGAKVVGECER 143
Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
+PICCPCGP R + S CG+++ KL KGKANTAHC+RFPGDWFHVFGIG
Sbjct: 144 LWDEKGNVIKQTEPICCPCGPHR-VQSKCGDIWSKLTKGKANTAHCVRFPGDWFHVFGIG 202
Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
S+ FS+R++VK GS V +V VGPENKT S DNFL+V ++GD+ GYT+IPSFE+ YLV
Sbjct: 203 AWSLRFSIRVQVKKGSSVWDVVVGPENKTVVSGDNFLRVKVVGDYTGYTSIPSFEDNYLV 262
Query: 255 IPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSC 313
PR+G G QPQDLG S WM+L+R RFTLDGLEC+KIGV YEA+ QP+FCS+P+ SC
Sbjct: 263 TPRKGTGSSQPQDLGNEHSKWMILDRVRFTLDGLECDKIGVGYEAYRNQPNFCSAPYGSC 322
Query: 314 LHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIEL 373
L NQLWN+ E D+ RI+ +QLPLY VEGRF+R+NQHPNAG+H+FS+GVTE LN+NLLIEL
Sbjct: 323 LGNQLWNFWEYDKRRIDNSQLPLYIVEGRFQRINQHPNAGAHTFSVGVTEDLNTNLLIEL 382
Query: 374 RADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLM 433
ADDIEYVYQRSP KII + +PTFEAL+Q G+A +TT+N G++E+SYSLTF CS+G++ +
Sbjct: 383 MADDIEYVYQRSPAKIIDIRVPTFEALSQVGIANVTTKNIGKLESSYSLTFKCSSGISPV 442
Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
EEQ + +KP E RSF++ TT+QAA + C AILK SDFSE+DR +FST ATV +NG
Sbjct: 443 EEQLYTMKPDEVIARSFELRSTTDQAAMHQCEAILKASDFSELDREGYRFSTAATVYNNG 502
Query: 494 SQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLS 553
+QI P K F++SI K LW L DF+TG+ C KC FDF CHIQY+C+
Sbjct: 503 AQIGPTNDHKK--GGFWDSI----KALWRNLIDFLTGRLCWTKCPRLFDFGCHIQYVCIG 556
Query: 554 WLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWW----DDHFQS 594
W++ +L + P +V LWLLHQ+GLFDPLYDWW DD +++
Sbjct: 557 WIL----LLLLIPAAVVFLWLLHQEGLFDPLYDWWGLEPDDDYRA 597
>gi|385178637|sp|B9G4M9.1|HAP2B_ORYSJ RecName: Full=Protein HAPLESS 2-B; Flags: Precursor
gi|222641945|gb|EEE70077.1| hypothetical protein OsJ_30063 [Oryza sativa Japonica Group]
Length = 714
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/585 (56%), Positives = 420/585 (71%), Gaps = 40/585 (6%)
Query: 31 GVQILSKSKLEKCEKRTDSDN---LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENST 87
GV++L+KS+LE C + D L C +KIV+++AVPSGS AS+VA V EVEEN T
Sbjct: 33 GVEVLAKSRLESCARGGSDDGRDRLTCDSKIVVDLAVPSGS----ASLVARVAEVEENGT 88
Query: 88 QKMRT-VRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
+ +R P ++T+NK+ YA+Y+LTY+RDV YKP+E ++KTRKCEP+AGA+VVK CER
Sbjct: 89 EAGEMPIRDPLIITINKSEVYALYDLTYLRDVAYKPEEKFVKTRKCEPEAGANVVKSCER 148
Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
+P+CCPCGP RR+PSSCGN+ DK+ KGKANTAHCLRFP DWFHVF IG
Sbjct: 149 LRDEKGSIIEHTEPVCCPCGPHRRVPSSCGNILDKVAKGKANTAHCLRFPDDWFHVFDIG 208
Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
+RS+ FS+R++VK GS SEV VGPEN+T S D+ L+VNL+GDF GYT++PS E FYLV
Sbjct: 209 RRSLWFSIRVQVKKGSSESEVIVGPENRTVVSEDSSLRVNLVGDFAGYTSLPSLENFYLV 268
Query: 255 IPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSC 313
PR+G G GQ + LG +FS WMLLER FTLDGLECNKIGV YEAF QP+FCSSP SC
Sbjct: 269 TPRKGVGGGQLEVLGDDFSRWMLLERVLFTLDGLECNKIGVGYEAFRSQPNFCSSPLDSC 328
Query: 314 LHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIEL 373
L +QL + E D+NR+N +Q P Y V G+FER+NQ+PNAG H+FS+G+ EVLN+NL+IEL
Sbjct: 329 LGDQLSKFWEIDKNRVNNSQPPQYVVLGKFERINQYPNAGVHTFSVGIPEVLNTNLMIEL 388
Query: 374 RADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLM 433
ADDIEYVYQRS GKIIS+ I +FEAL+Q G A + T+N G +EASYSLTFDC +G+ +
Sbjct: 389 SADDIEYVYQRSSGKIISINISSFEALSQVGSARVKTKNIGRLEASYSLTFDCLSGINPV 448
Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
EEQYFI+KP E IR+F + +T+QA+ YTC AILK SDFSE+DR E QFST ATVL+NG
Sbjct: 449 EEQYFIMKPDEKLIRTFDLRSSTDQASNYTCQAILKASDFSELDRKESQFSTTATVLNNG 508
Query: 494 SQITPFQP-PKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICL 552
+QI + K I FFE+I++ K+W L +F TG C +C SF F H
Sbjct: 509 TQIGSSENHTKGGIWGFFEAIKAWCAKMWHMLINFFTGTTCSTRCWSFLKFVIHGL---- 564
Query: 553 SWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQ 597
++ +LWLLH+KGLFDPLY WWD S+ Q
Sbjct: 565 --------------LLVAVLWLLHRKGLFDPLYYWWDGVVGSEAQ 595
>gi|218202482|gb|EEC84909.1| hypothetical protein OsI_32102 [Oryza sativa Indica Group]
Length = 718
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/585 (56%), Positives = 411/585 (70%), Gaps = 37/585 (6%)
Query: 31 GVQILSKSKLEKCEKRTDSDNLNC---TTKIVLNMAVPSGSSGGEASIVAEVVEVEENST 87
GV++L+KS+LE C + D T + P + GGEAS+VA V EVEEN T
Sbjct: 36 GVEVLAKSRLESCARGGSDDGATASPATARSSSTWPCPV-ARGGEASLVARVAEVEENGT 94
Query: 88 QKMRTVRIPP-VLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
+ + P ++T+NK+ YA+Y+LTY+RDV Y P+E Y+KTRKCEP+AGA+VVK CER
Sbjct: 95 EAGEMPILDPLIITINKSEVYALYDLTYLRDVAYIPEEKYVKTRKCEPEAGANVVKSCER 154
Query: 147 ------------QPICCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIG 194
+P+CCPCGP RR+PSSCGN+FDK+ KGKANTAHCLRFP DWFHVF IG
Sbjct: 155 LRDEKGSIIEHTEPVCCPCGPHRRVPSSCGNIFDKVAKGKANTAHCLRFPDDWFHVFDIG 214
Query: 195 QRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLV 254
+RS+ FS+R++VK GS SEV VGPEN+T S D+ L+VNL+GDF GYT++PS E FYLV
Sbjct: 215 RRSLWFSIRVQVKKGSSESEVIVGPENRTVVSEDSSLRVNLVGDFAGYTSLPSLENFYLV 274
Query: 255 IPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSC 313
PR+G G GQ Q LG +FS WMLLER FTLDGLECNKIGV YEAF QP+FCSSP SC
Sbjct: 275 TPRKGVGGGQLQVLGDDFSRWMLLERVLFTLDGLECNKIGVGYEAFRSQPNFCSSPLDSC 334
Query: 314 LHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIEL 373
L +QL + E D+NR+N +Q P Y V G+FER+NQ+PNAG H+FS+G+ EVLN+NL+IEL
Sbjct: 335 LGDQLSKFWEIDKNRVNNSQPPQYVVLGKFERINQYPNAGVHTFSVGIPEVLNTNLMIEL 394
Query: 374 RADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLM 433
ADDIEYVYQRS GKIIS+ I +FEAL+Q G A + T+N G +EASYSLTFDC +G+ +
Sbjct: 395 SADDIEYVYQRSSGKIISINISSFEALSQVGSARVKTKNIGRLEASYSLTFDCLSGINPV 454
Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
EEQYFI+KP E IR+F + +T+QA+ YTC AILK SDFSE+DR E QFST ATVL+NG
Sbjct: 455 EEQYFIMKPDEKLIRTFDLRSSTDQASNYTCQAILKASDFSELDRKESQFSTTATVLNNG 514
Query: 494 SQITPFQP-PKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICL 552
+QI + K I FFE+I++ K+W L +F TG C +C SF F H
Sbjct: 515 TQIGSSENHTKGGIWGFFEAIKAWCAKMWHMLINFFTGTTCSTRCWSFLKFVIHGL---- 570
Query: 553 SWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQ 597
++ +LWLLH+KGLFDPLY WWD S+ Q
Sbjct: 571 --------------LLVAVLWLLHRKGLFDPLYYWWDGVVGSEAQ 601
>gi|218196451|gb|EEC78878.1| hypothetical protein OsI_19239 [Oryza sativa Indica Group]
Length = 532
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 247/412 (59%), Positives = 313/412 (75%), Gaps = 15/412 (3%)
Query: 188 FHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPS 247
FHVFGIG S+ FS+R++VK GS V +V VGPENKT S DNFL+V ++GD+ GYT+IPS
Sbjct: 6 FHVFGIGAWSLRFSIRVQVKKGSSVWDVVVGPENKTVVSGDNFLRVKVVGDYTGYTSIPS 65
Query: 248 FEEFYLVIPRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFC 306
FEE YLV PR+G G QPQDLG S WM+L+R RFTLDGLEC+KIGV YEA+ QP+FC
Sbjct: 66 FEENYLVTPRKGTGSSQPQDLGNEHSKWMILDRVRFTLDGLECDKIGVGYEAYRNQPNFC 125
Query: 307 SSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLN 366
S+P+ SCL NQLWN+ E D+ RI+ +QLPLY VEGRF+R+NQHPNAG+H+FS+GVTE LN
Sbjct: 126 SAPYGSCLGNQLWNFWEYDKRRIDNSQLPLYIVEGRFQRINQHPNAGAHTFSVGVTEDLN 185
Query: 367 SNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDC 426
+NLLIEL ADDIEYVYQRSP KII + +PTFEAL+Q G+A +TT+N G++E+SYSLTF C
Sbjct: 186 TNLLIELMADDIEYVYQRSPAKIIDIRVPTFEALSQVGIANVTTKNIGKLESSYSLTFKC 245
Query: 427 STGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTM 486
S+G++ +EEQ + +KP E RSF++ TT+QAA + C AILK SDFSE+DR +FST
Sbjct: 246 SSGISPVEEQLYTMKPDEVIARSFELRSTTDQAAMHQCEAILKASDFSELDREGYRFSTA 305
Query: 487 ATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCH 546
ATV +NG+QI P K F++SI K LW L DF+TG+ C KC FDF CH
Sbjct: 306 ATVYNNGAQIGPTNDHKKG--GFWDSI----KALWRNLIDFLTGRLCWTKCPRLFDFGCH 359
Query: 547 IQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWW----DDHFQS 594
IQY+C+ W++ +L + P +V LWLLHQ+GLFDPLYDWW DD +++
Sbjct: 360 IQYVCIGWIL----LLLLIPAAVVFLWLLHQEGLFDPLYDWWGLEPDDDYRA 407
>gi|414591383|tpg|DAA41954.1| TPA: hypothetical protein ZEAMMB73_607847 [Zea mays]
Length = 536
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 234/384 (60%), Positives = 295/384 (76%), Gaps = 6/384 (1%)
Query: 214 EVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQ-GGPGQPQDLGGNFS 272
EV VGPEN+T S DNFL+VNLIGDF GYT+IP+FE+FYLV PR+ G G+PQ+LG +
Sbjct: 21 EVVVGPENRTVVSKDNFLRVNLIGDFGGYTSIPAFEDFYLVTPRKSAGSGEPQNLGAEYR 80
Query: 273 MWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRN 332
WMLLER RFT DG+ECNKIGV YEAF QP+FC+SPF SCL+NQLW + E+D+NRI+ +
Sbjct: 81 KWMLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMS 139
Query: 333 QLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISV 392
+ P Y V+GRF+R+NQHP+A HSFSIGVTEV+NSNL IEL ADDIEY+YQRSPG I +
Sbjct: 140 RQPQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDI 199
Query: 393 IIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKI 452
+P FE L+Q+G A +TT+N G +EASY+LTF CS+G++ MEEQY+I+KP E S R F +
Sbjct: 200 SVPAFEVLSQYGTAKVTTKNIGTLEASYTLTFHCSSGISFMEEQYYILKPNEESTRLFYL 259
Query: 453 YPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFES 512
+ +T+QAAKY C+AILK SD SE+DR EC FST ATVLDNG+QI K FF++
Sbjct: 260 HASTDQAAKYQCTAILKASDSSELDRQECVFSTTATVLDNGTQIIGSNGYKLG---FFDT 316
Query: 513 IESIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVL 571
I+ W+ L D I+GK+CR KC SFFDFSCH QY C++WLV+ L+L + P ++
Sbjct: 317 IKGYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQYRCITWLVMLVLLLFMLPAGAIV 376
Query: 572 LWLLHQKGLFDPLYDWWDDHFQSD 595
L+LLHQKG FDP+YDWWDD +D
Sbjct: 377 LYLLHQKGFFDPVYDWWDDLLGAD 400
>gi|110430669|gb|ABG73459.1| histidine rich-like protein [Oryza brachyantha]
Length = 634
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 280/619 (45%), Positives = 366/619 (59%), Gaps = 119/619 (19%)
Query: 37 KSKLEKCEKRTDSDN--LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQK--MRT 92
KS+LE C + TD L C +K+VL++AVPS SSGGEAS+VA+V +VEEN T+ MR
Sbjct: 42 KSRLESCVRDTDDGGRRLTCDSKLVLDVAVPSDSSGGEASLVAKVADVEENDTEATPMR- 100
Query: 93 VRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICERQPICCP 152
+R PPV+T+NK+ +A+Y LTY+RDV YKP+E ++KTRKCEPDAG++VVK CE P
Sbjct: 101 IRDPPVITINKSEVFALYALTYLRDVSYKPEEKFVKTRKCEPDAGSEVVKFCESL-FVVP 159
Query: 153 CGPQR------RIPSSCGNVFDKLLKGKANTAHCLRF---------PGDW--FHVFGIGQ 195
G S N +L G H L++ P + FHVF IG+
Sbjct: 160 VGLTAVHLHPVETYSCLENYSHDILFG--FYIHVLKYMWRKITLWRPSVFARFHVFEIGR 217
Query: 196 RSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVI 255
RS+GFS+ ++VK S VS+V VGP+N+T S DNFL+V L+GDFVGYT+IPSFE+FYLV
Sbjct: 218 RSLGFSISVQVKKASSVSKVIVGPDNRTVVSKDNFLRVKLVGDFVGYTSIPSFEDFYLVT 277
Query: 256 PRQG-GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCL 314
PR+G G G+PQ +G +FS WMLLER RFTLDGLECNKIGV YEA++ QP+FCSSP SCL
Sbjct: 278 PRKGVGGGEPQ-VGDDFSRWMLLERVRFTLDGLECNKIGVGYEAYSSQPNFCSSPLQSCL 336
Query: 315 HNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELR 374
+QLWN+ E+D+ R+N +Q P Y V+G
Sbjct: 337 GDQLWNFWESDKIRVNNSQPPQYLVQG--------------------------------- 363
Query: 375 ADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLME 434
RSPGKIIS+ + TFEAL+Q G A + T+N G++EASYSLTF CS+G+ +E
Sbjct: 364 ---------RSPGKIISINVSTFEALSQVGTAQVKTKNIGKLEASYSLTFGCSSGINPVE 414
Query: 435 EQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGS 494
EQ FI+KP E IRSF ++ +T QA+ YTC AILK S+FSE+DR ECQFST ATVL+NG+
Sbjct: 415 EQSFIMKPDEEIIRSFDLHSSTVQASNYTCKAILKGSNFSELDRKECQFSTTATVLNNGT 474
Query: 495 QITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSW 554
Q F+ + +G +W + RR
Sbjct: 475 QYKMFK------------LFQVG-HVWHAI--------PRR------------------- 494
Query: 555 LVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV 614
IF +V +W LHQKGLFDP+YDWWDD F R R + + H
Sbjct: 495 -------YGIFSSV---MWFLHQKGLFDPIYDWWDDVFGLSEARSHQRHKRSHSLRNYHH 544
Query: 615 HVRKHHKQEGRHHKLEARR 633
H ++H + H+ + R
Sbjct: 545 HHKRHKSEPVSGHRHHSHR 563
>gi|242049910|ref|XP_002462699.1| hypothetical protein SORBIDRAFT_02g030440 [Sorghum bicolor]
gi|241926076|gb|EER99220.1| hypothetical protein SORBIDRAFT_02g030440 [Sorghum bicolor]
Length = 607
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 247/501 (49%), Positives = 333/501 (66%), Gaps = 58/501 (11%)
Query: 31 GVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQK 89
G ++L+KS LE C + + L+C K+V++MAVPS SSGGEAS+VA+V V N T++
Sbjct: 27 GAEVLAKSLLESCVDDSGAGGRLSCDRKVVVDMAVPSESSGGEASLVAQVAHV--NDTEQ 84
Query: 90 MRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCEPDAGADVVKICERQPI 149
+T+R PPV+TVNK A YA+Y L YIRDV YKP+E +++TRKCEPDAGADVV CE
Sbjct: 85 TKTIRNPPVITVNKGAVYALYALNYIRDVAYKPEEQFVETRKCEPDAGADVVGACESL-- 142
Query: 150 CCPCGPQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK-- 207
+P V+ L++ ++F + + + S+ +E++
Sbjct: 143 -------FAVPVVLTAVYLHLVE-----TFLIKFSKEKLIQLTVYDFQVTGSMFLELEKD 190
Query: 208 -----TGSKV-SEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQG-G 260
+G K+ EV VGPEN+T S DNFL+VNLIGDF GYT+IP+FE FYLV PR+G G
Sbjct: 191 YLGSTSGYKLRKEVVVGPENRTVVSKDNFLRVNLIGDFSGYTSIPTFENFYLVTPRKGAG 250
Query: 261 PGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWN 320
G+PQ+LG +S WMLLER RFT +G+EC+KIGV Y+AF QP+FC+S F SCL+NQL
Sbjct: 251 SGEPQNLGAEYSKWMLLERVRFT-EGIECDKIGVGYQAFQNQPNFCASAFGSCLYNQLST 309
Query: 321 YREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEY 380
+ E NA H+FSIGVTEV NSNL IEL ADDIEY
Sbjct: 310 FLE---------------------------NATVHTFSIGVTEVRNSNLRIELSADDIEY 342
Query: 381 VYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFII 440
+YQRSPGKI ++ +PTFEAL+Q+G A +TT+N G++EASY+LTF+C +G++ +EEQY+++
Sbjct: 343 MYQRSPGKITNISVPTFEALSQYGTAKVTTKNIGKLEASYTLTFNCLSGISFVEEQYYVL 402
Query: 441 KPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQ 500
KP E S R F + +T++AAKY C+AILK SDFSE+DR EC FSTMATVLDNG+Q F
Sbjct: 403 KPDEASTRLFYLRASTDKAAKYQCTAILKASDFSELDRQECLFSTMATVLDNGTQKGFFD 462
Query: 501 PPKSSINDFFESIESIGKKLW 521
P + D++E + + + +
Sbjct: 463 P----VYDWWEDLLGLDDRTY 479
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 15/72 (20%)
Query: 529 TGKACRRKCSSFF---DFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLY 585
T KA + +C++ DFS + CL F T+ +L QKG FDP+Y
Sbjct: 418 TDKAAKYQCTAILKASDFSELDRQECL------------FSTMATVLDNGTQKGFFDPVY 465
Query: 586 DWWDDHFQSDNQ 597
DWW+D D++
Sbjct: 466 DWWEDLLGLDDR 477
>gi|226492062|ref|NP_001141873.1| hypothetical protein [Zea mays]
gi|223944697|gb|ACN26432.1| unknown [Zea mays]
gi|414591385|tpg|DAA41956.1| TPA: hypothetical protein ZEAMMB73_607847 [Zea mays]
Length = 454
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 194/322 (60%), Positives = 246/322 (76%), Gaps = 5/322 (1%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
MLLER RFT DG+ECNKIGV YEAF QP+FC+SPF SCL+NQLW + E+D+NRI+ ++
Sbjct: 1 MLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 59
Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
P Y V+GRF+R+NQHP+A HSFSIGVTEV+NSNL IEL ADDIEY+YQRSPG I + +
Sbjct: 60 PQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISV 119
Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP 454
P FE L+Q+G A +TT+N G +EASY+LTF CS+G++ MEEQY+I+KP E S R F ++
Sbjct: 120 PAFEVLSQYGTAKVTTKNIGTLEASYTLTFHCSSGISFMEEQYYILKPNEESTRLFYLHA 179
Query: 455 TTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIE 514
+T+QAAKY C+AILK SD SE+DR EC FST ATVLDNG+QI K FF++I+
Sbjct: 180 STDQAAKYQCTAILKASDSSELDRQECVFSTTATVLDNGTQIIGSNGYKLG---FFDTIK 236
Query: 515 SIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLW 573
W+ L D I+GK+CR KC SFFDFSCH QY C++WLV+ L+L + P ++L+
Sbjct: 237 GYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQYRCITWLVMLVLLLFMLPAGAIVLY 296
Query: 574 LLHQKGLFDPLYDWWDDHFQSD 595
LLHQKG FDP+YDWWDD +D
Sbjct: 297 LLHQKGFFDPVYDWWDDLLGAD 318
>gi|194706256|gb|ACF87212.1| unknown [Zea mays]
Length = 454
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 193/322 (59%), Positives = 245/322 (76%), Gaps = 5/322 (1%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
MLLER RFT DG+ECNKIGV YEAF QP+FC+SPF SCL+NQLW + E+D+NRI+ ++
Sbjct: 1 MLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 59
Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
P Y V+GRF+R+NQHP+A HSFSIGVTEV+NSNL IEL ADDIEY+YQRSPG I + +
Sbjct: 60 PQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISV 119
Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP 454
P FE L+Q+G A +TT+N G +EASY+LTF CS+G++ MEEQY+I+KP E S R F ++
Sbjct: 120 PAFEVLSQYGTAKVTTKNIGTLEASYTLTFHCSSGISFMEEQYYILKPNEESTRLFYLHA 179
Query: 455 TTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIE 514
+T+QAAKY C+AILK SD SE+DR C FST ATVLDNG+QI K FF++I+
Sbjct: 180 STDQAAKYQCTAILKASDSSELDRQGCVFSTTATVLDNGTQIIGSNGYKLG---FFDTIK 236
Query: 515 SIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLW 573
W+ L D I+GK+CR KC SFFDFSCH QY C++WLV+ L+L + P ++L+
Sbjct: 237 GYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQYRCITWLVMLVLLLFMLPAGAIVLY 296
Query: 574 LLHQKGLFDPLYDWWDDHFQSD 595
LLHQKG FDP+YDWWDD +D
Sbjct: 297 LLHQKGFFDPVYDWWDDLLGAD 318
>gi|115480257|ref|NP_001063722.1| Os09g0525700 [Oryza sativa Japonica Group]
gi|52076043|dbj|BAD46496.1| unknown protein [Oryza sativa Japonica Group]
gi|52077311|dbj|BAD46352.1| unknown protein [Oryza sativa Japonica Group]
gi|113631955|dbj|BAF25636.1| Os09g0525700 [Oryza sativa Japonica Group]
Length = 425
Score = 348 bits (893), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 182/324 (56%), Positives = 226/324 (69%), Gaps = 19/324 (5%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
MLLER FTLDGLECNKIGV YEAF QP+FCSSP SCL +QL + E D+NR+N +Q
Sbjct: 1 MLLERVLFTLDGLECNKIGVGYEAFRSQPNFCSSPLDSCLGDQLSKFWEIDKNRVNNSQP 60
Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
P Y V G+FER+NQ+PNAG H+FS+G+ EVLN+NL+IEL ADDIEYVYQRS GKIIS+ I
Sbjct: 61 PQYVVLGKFERINQYPNAGVHTFSVGIPEVLNTNLMIELSADDIEYVYQRSSGKIISINI 120
Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP 454
+FEAL+Q G A + T+N G +EASYSLTFDC +G+ +EEQYFI+KP E IR+F +
Sbjct: 121 SSFEALSQVGSARVKTKNIGRLEASYSLTFDCLSGINPVEEQYFIMKPDEKLIRTFDLRS 180
Query: 455 TTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQP-PKSSINDFFESI 513
+T+QA+ YTC AILK SDFSE+DR E QFST ATVL+NG+QI + K I FFE+I
Sbjct: 181 STDQASNYTCQAILKASDFSELDRKESQFSTTATVLNNGTQIGSSENHTKGGIWGFFEAI 240
Query: 514 ESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLW 573
++ K+W L +F TG C +C SF F H ++ +LW
Sbjct: 241 KAWCAKMWHMLINFFTGTTCSTRCWSFLKFVIHGL------------------LLVAVLW 282
Query: 574 LLHQKGLFDPLYDWWDDHFQSDNQ 597
LLH+KGLFDPLY WWD S+ Q
Sbjct: 283 LLHRKGLFDPLYYWWDGVVGSEAQ 306
>gi|414591386|tpg|DAA41957.1| TPA: hypothetical protein ZEAMMB73_607847, partial [Zea mays]
Length = 224
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 145/222 (65%), Positives = 182/222 (81%), Gaps = 1/222 (0%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
MLLER RFT DG+ECNKIGV YEAF QP+FC+SPF SCL+NQLW + E+D+NRI+ ++
Sbjct: 1 MLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 59
Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
P Y V+GRF+R+NQHP+A HSFSIGVTEV+NSNL IEL ADDIEY+YQRSPG I + +
Sbjct: 60 PQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISV 119
Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP 454
P FE L+Q+G A +TT+N G +EASY+LTF CS+G++ MEEQY+I+KP E S R F ++
Sbjct: 120 PAFEVLSQYGTAKVTTKNIGTLEASYTLTFHCSSGISFMEEQYYILKPNEESTRLFYLHA 179
Query: 455 TTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQI 496
+T+QAAKY C+AILK SD SE+DR EC FST ATVLDNG+Q+
Sbjct: 180 STDQAAKYQCTAILKASDSSELDRQECVFSTTATVLDNGTQV 221
>gi|414591381|tpg|DAA41952.1| TPA: hypothetical protein ZEAMMB73_607847, partial [Zea mays]
Length = 283
Score = 295 bits (754), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 137/199 (68%), Positives = 167/199 (83%), Gaps = 2/199 (1%)
Query: 187 WFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIP 246
WFHVFGIG RS+GF++R++VK GS VSEV VGPEN+T S DNFL+VNLIGDF GYT+IP
Sbjct: 78 WFHVFGIGTRSLGFNIRVQVKKGSSVSEVVVGPENRTVVSKDNFLRVNLIGDFGGYTSIP 137
Query: 247 SFEEFYLVIPRQ-GGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSF 305
+FE+FYLV PR+ G G+PQ+LG + WMLLER RFT DG+ECNKIGV YEAF QP+F
Sbjct: 138 AFEDFYLVTPRKSAGSGEPQNLGAEYRKWMLLERVRFT-DGVECNKIGVGYEAFQNQPNF 196
Query: 306 CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVL 365
C+SPF SCL+NQLW + E+D+NRI+ ++ P Y V+GRF+R+NQHP+A HSFSIGVTEV+
Sbjct: 197 CASPFESCLNNQLWTFLESDKNRISMSRQPQYVVQGRFQRINQHPDASVHSFSIGVTEVI 256
Query: 366 NSNLLIELRADDIEYVYQR 384
NSNL IEL ADDIEY+YQR
Sbjct: 257 NSNLRIELSADDIEYMYQR 275
>gi|224074881|ref|XP_002304473.1| predicted protein [Populus trichocarpa]
gi|222841905|gb|EEE79452.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 126/280 (45%), Positives = 159/280 (56%), Gaps = 51/280 (18%)
Query: 421 SLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAE 480
SL FDCS GV +ME E +IRSFKIYP T++AA+Y C+AILKDS F+E D AE
Sbjct: 5 SLQFDCSKGVAVMELS-------EVTIRSFKIYPATDKAARYVCAAILKDSSFNETDPAE 57
Query: 481 CQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSF 540
CQ T AT+L+NG++ PF+PPK SIN FFESIE I ++WEGL ITGK ++
Sbjct: 58 CQLFTTATILENGARFAPFRPPKISINGFFESIEDIWNRIWEGLVASITGKVGSACAATA 117
Query: 541 FDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIR 600
F + F PLYDWW+DH D Q IR
Sbjct: 118 FA----------------------------------SERTFPPLYDWWEDHLWDDEQGIR 143
Query: 601 DFRSRRIDVDHPHVHVRKHHKQEGRHHKLEARRRRCGIHSDHKHKHSDRDTDYYYYLHHV 660
D + DV+ ++ G + AR+RR I+ +H+ HS RD DYY+YLHHV
Sbjct: 144 DTLRHKKDVNGD--------RELGPRQQHNARKRRS-IYQEHRPGHSGRDADYYHYLHHV 194
Query: 661 QKDKHKHGRSKNSSVMQQLYLDTGKNDHI-GHHRRRKFRE 699
QKDK KH SK S+V QQ+YLD +N +I GHHR RK R+
Sbjct: 195 QKDKSKHRGSKKSNVPQQVYLDGPENSNIGGHHRHRKERD 234
>gi|66819323|ref|XP_643321.1| hypothetical protein DDB_G0276069 [Dictyostelium discoideum AX4]
gi|60471374|gb|EAL69334.1| hypothetical protein DDB_G0276069 [Dictyostelium discoideum AX4]
Length = 572
Score = 196 bits (497), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 137/473 (28%), Positives = 229/473 (48%), Gaps = 37/473 (7%)
Query: 48 DSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASY 107
D NL C K+V+++ + S E + +V E+++ + K++T+ IP + K+ ++
Sbjct: 44 DKTNLKCDKKLVVSLYIDSQKENSE-TFNFQVSEIKDENG-KLKTLVIPISVKFKKSETF 101
Query: 108 AVYELTYIRDVPYKPQEF------YMKTRKCEP-----------DAGADVVKICERQPIC 150
Y L Y+++V Y+P+E Y+ T C+ DA +++ + Q C
Sbjct: 102 INYPLVYVQNVAYQPKETVIYKTDYVLTSGCKDKPTDHTCPGAIDANGKLIR--DSQGFC 159
Query: 151 CPCGPQRRIPS---SCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK 207
C C + + S N+ LL K+++AHCL F + V+ + + + +++ +
Sbjct: 160 CSCSFSDYVGADQNSRANLGCSLLGSKSSSAHCLSFSSVKYDVYNVAKTQVEYTITATLT 219
Query: 208 TGSKVSEVT--VGPENKTATSADNFLK--VNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQ 263
+ +T + N D F + V +IGDF T I F + +V P Q
Sbjct: 220 YSYNQNPITQDIILSNSAPMGMDTFSQAIVRIIGDFQSSTQINQFTDKKVVFPY----NQ 275
Query: 264 PQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYRE 323
P + + M+L++ F L GL CNKIGVSY AF QP+ C++ F SCL NQ+ +Y
Sbjct: 276 PNSI----NTCMVLDQNFFDLSGLTCNKIGVSYSAFQNQPNSCAALFGSCLQNQIADYYN 331
Query: 324 ADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQ 383
AD I+ + Y ++ N S S I E + L I L+AD ++Y+
Sbjct: 332 ADVALISSGKKGNYIASQLGTKVQIAGNQDSRSLKIRFDESHRTMLTITLKADSLQYIVN 391
Query: 384 RSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLT-FDCSTGVTLMEEQYFIIKP 442
SPGKII+ I FE++++ GV + QNTG + A Y++T +C+ + + Q IK
Sbjct: 392 ISPGKIINYQIDRFESMSKNGVLRVNVQNTGTINADYTMTIINCTGDINPINNQQVTIKS 451
Query: 443 KETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
KE F++Y T+ + Y C L + +D F+T T +DNG+Q
Sbjct: 452 KEIYSFVFQVYTTSKLDSSYHCFGDLYNEVAQVIDSIRINFNTSDTEIDNGAQ 504
>gi|147794121|emb|CAN62356.1| hypothetical protein VITISV_001267 [Vitis vinifera]
Length = 933
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 105/150 (70%), Positives = 122/150 (81%), Gaps = 10/150 (6%)
Query: 8 LKLKHFLLILF----------CILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTK 57
L KH L+I F IL + R + GVQILSKSKLEKCEK ++SDNLNCT K
Sbjct: 480 LDSKHALVIGFDGSELINXYPLILTGFNTRRLYGVQILSKSKLEKCEKVSESDNLNCTKK 539
Query: 58 IVLNMAVPSGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRD 117
I+L+MAVPSGSSGGEASIVAEVVEVEENST KM+T+R+PP +TVNK+A+YAVYE+TYIRD
Sbjct: 540 IILDMAVPSGSSGGEASIVAEVVEVEENSTHKMQTLRVPPTITVNKSAAYAVYEITYIRD 599
Query: 118 VPYKPQEFYMKTRKCEPDAGADVVKICERQ 147
VPYKPQE+++KTRKCEPDA A VVKICER
Sbjct: 600 VPYKPQEYFVKTRKCEPDASAKVVKICERH 629
Score = 176 bits (445), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 80/93 (86%), Positives = 85/93 (91%)
Query: 168 DKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSA 227
DKL+KGKANTAHCLRFPGDWFHVFGIGQRS+GFSV IEVKTGSK+SEV VGPEN+T S
Sbjct: 805 DKLMKGKANTAHCLRFPGDWFHVFGIGQRSLGFSVHIEVKTGSKISEVIVGPENRTVMSN 864
Query: 228 DNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGG 260
DNFLKVNLIGDF GYTNIPSFE+FYLV PRQ G
Sbjct: 865 DNFLKVNLIGDFAGYTNIPSFEDFYLVTPRQBG 897
>gi|384253026|gb|EIE26501.1| hypothetical protein COCSUDRAFT_39583 [Coccomyxa subellipsoidea
C-169]
Length = 1085
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 139/520 (26%), Positives = 237/520 (45%), Gaps = 61/520 (11%)
Query: 34 ILSKSKLEKCEKRTDSDNL--NCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQ--- 88
+LS S+L+ C + ++ L C+ K++L +AV +G+S S+ V + S+
Sbjct: 27 VLSSSQLQTCIQDGSAEALLLQCSKKLILTLAVENGASLATQSLQFSVPCINSGSSGCPC 86
Query: 89 ----------KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKT------RKC 132
R +R +T+ K A YA Y L Y + +P E ++T C
Sbjct: 87 TCNYATDPGCTCRDLRDTLNVTITKGAVYASYPLIYQQAFNNRPTEAIIRTGANFPISSC 146
Query: 133 E-------PDAG----ADVVKICERQPICCPCGPQRRIPSSCGNVFDKLLKGKAN----- 176
P G A+ +I Q CC C ++ G+ D+ + +
Sbjct: 147 NDGPLSDTPTCGWATDANGARIPASQGFCCSCTSSALAAATLGSGTDQYTRASLDCDLFH 206
Query: 177 -------TAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTG--SKVSEVTVGPENKTATSA 227
+AHCLR ++ + + + F+++I +++ S +++ P +
Sbjct: 207 TWLRTPGSAHCLRMDDLYYQGYQVDPARLDFNIQISIQSANTSVTQTLSLNPTQPFVVND 266
Query: 228 DNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGL 287
N + L+GD Y ++P F +YL+IP Q L N WM+++++ + DG
Sbjct: 267 ANTVAAKLLGDLATYQSMPDFSSYYLMIPSPADSSPQQVLSSNTDKWMMVDKSMVSTDGT 326
Query: 288 ECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVE---GRFE 344
CNK+G SY AF Q C P +CL NQL++ +AD RI++ P+ V G
Sbjct: 327 TCNKVGTSYFAFQYQSGSCQQPQGTCLGNQLYDLYQADVKRISQGTTPVNFVSRWGGGQP 386
Query: 345 RMNQ--HPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII-------- 394
NQ + ++GS F++ +T +LNS + + + AD + + SPGKI+S +
Sbjct: 387 GANQASYSSSGSLRFALPITNILNSVVTLTVNADAVMLIDNVSPGKILSAQVCQFNNATC 446
Query: 395 PTFEALTQFGVATITTQNTGEVEASYSLT-FDCSTGVTLMEEQYFIIKPKETSIRSFKIY 453
+F+ALTQ G T T QN G + A++ ++ +C+ VT + Q + K T +F I+
Sbjct: 447 GSFQALTQRGYLTATVQNAGSIAATFIVSVVNCTASVTPIVAQSATLASKATKALTFDIF 506
Query: 454 PTTNQA-AKYTCSAILKDSDFSEVDRAECQFSTMATVLDN 492
T+N+A A TC L DS + + S A + N
Sbjct: 507 LTSNKADAAITCDVGLTDSQVNGAGAPQTGPSDCAALCPN 546
>gi|302776592|ref|XP_002971451.1| hypothetical protein SELMODRAFT_451371 [Selaginella moellendorffii]
gi|300160583|gb|EFJ27200.1| hypothetical protein SELMODRAFT_451371 [Selaginella moellendorffii]
Length = 565
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 156/588 (26%), Positives = 255/588 (43%), Gaps = 93/588 (15%)
Query: 30 VGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQK 89
+ + +SKS L+ C D + + C KI++ +A+PSG G +++AEV + ++
Sbjct: 20 INMTTISKSDLDVCVNTGDPNAIQCKKKILVTVAIPSGDGGNGEALIAEVKDPTSRDGKQ 79
Query: 90 MRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRK-CEP------------DA 136
+ I + + KT S Y L Y+++V E +K +K C D+
Sbjct: 80 VLEKSIS--VNIAKTDSIVKYALEYLKNVAGDLNERVIKKKKGCNTKLNDKATCGVLGDS 137
Query: 137 GADVVKICERQPICCPCGPQRRI--------PSSCGNVFDKLLKGKANTAHCLRFPGDWF 188
+VV CC C P ++I P CG + D A A CLRF W
Sbjct: 138 KGNVVP--GSSGFCCTCKPLKQIKHFRGMPKPGHCG-ISD------AGYAFCLRFGQMWC 188
Query: 189 HVFGIGQRSIGFSVRIEV--KTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIP 246
+F I +I F + I + + G+K S +G L++ L+ +
Sbjct: 189 VMFRIRTGTISFEITITLTDQNGNKASSRIIG----------FVLRLTLLAE----KPDG 234
Query: 247 SFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFC 306
++++F PR G +WML++ R TL G C+KIG+S + QP C
Sbjct: 235 NWQQFTRGSPRADG-----------RLWMLVDEARVTLTGSACDKIGLSCLGYAQQPRTC 283
Query: 307 SSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLN 366
C+ QL ++ + D + + +L ++G R + + + I V+ N
Sbjct: 284 DGALGMCIGEQLIDFIKEDLAALGKGRLAIHG----LFRYGSYRSLVPDALQIAVSPT-N 338
Query: 367 SNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDC 426
S + IE+ AD++ + +S GKI+ V +P FEA++ G T+T N G +EASY + +C
Sbjct: 339 SLITIEIAADNVSFRRNKSTGKIVKVEVPPFEAMSTGGTLTLTVVNDGSLEASYGVYVEC 398
Query: 427 STGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTM 486
S + +E + + P + + + A K +C L+DS+ D E +FST
Sbjct: 399 SANINPLEGKRVSMIPNVPQTFLYTLITRSTDATKNSCIVTLRDSEGENCDVKEAKFSTT 458
Query: 487 ATVLDNGSQITPFQPPKSSINDFF--------------ESIESIGKKLWE---------- 522
ATV +NGSQ+ Q S ND F I+ I K +
Sbjct: 459 ATVFNNGSQVGGVQIAGSK-NDTFAKGLGGLGFFGKIGAGIKGIAKGAFNVVTSPFRKMF 517
Query: 523 GLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLF--GLVLAIFPTV 568
GL + + GK C FD C I + C+ ++ F G+V A V
Sbjct: 518 GLFNNLLGKC--DNCPGAFDIGCFIAHFCVKKILFFVGGIVAAALGKV 563
>gi|224031573|gb|ACN34862.1| unknown [Zea mays]
Length = 297
Score = 178 bits (451), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 91/164 (55%), Positives = 116/164 (70%), Gaps = 4/164 (2%)
Query: 433 MEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDN 492
MEEQY+I+KP E S R F ++ +T+QAAKY C+AILK SD SE+DR EC FST ATVLDN
Sbjct: 1 MEEQYYILKPNEESTRLFYLHASTDQAAKYQCTAILKASDSSELDRQECVFSTTATVLDN 60
Query: 493 GSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQYIC 551
G+QI K FF++I+ W+ L D I+GK+CR KC SFFDFSCH QY C
Sbjct: 61 GTQIIGSNGYKLG---FFDTIKGYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQYRC 117
Query: 552 LSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSD 595
++WLV+ L+L + P ++L+LLHQKG FDP+YDWWDD +D
Sbjct: 118 ITWLVMLVLLLFMLPAGAIVLYLLHQKGFFDPVYDWWDDLLGAD 161
>gi|84453083|dbj|BAE71144.1| generative cell specific-1 [Physarum polycephalum]
Length = 808
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 151/605 (24%), Positives = 257/605 (42%), Gaps = 77/605 (12%)
Query: 16 ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDN--LNCTTKIVLNMAVPSGSSGGEA 73
+ CIL L + +++ S++ C S++ LNC K V++++V +G + EA
Sbjct: 4 VFLCILFLFYLFSTLHADLIASSQITNCVLDGSSEDTILNCQKKFVVSLSVDNGQNKTEA 63
Query: 74 SIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTR--- 130
+ N+T + P +T++K+ Y ++Y++ V P E + TR
Sbjct: 64 VQFTISSATDGNTTLQFVN---PWTITLSKSPVAIYYPISYLQTVNADPSEAVIYTRDWI 120
Query: 131 ---KCEPDAGADVV-----------KICERQPICCPCG---------PQRRIPSSCGNVF 167
C+ A +D I + Q CC C Q R +C
Sbjct: 121 VVSSCQSGAYSDNPTCGWYKDSNGNNIPDSQGFCCSCNLAEYLGISDDQTRAGLTC---- 176
Query: 168 DKLLKGKANTAHCLRFPGD-WFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATS 226
G +++AHCLRF + W+ +F I +++ I++ G + TV T S
Sbjct: 177 -SFFSGSSSSAHCLRFDDNGWYDIFQIANAQDMYTIDIDISQGGG-TNTTVTLSPSTTIS 234
Query: 227 ADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDG 286
+ + + L+GDF + +P + YL +P G P + + WM+++ F L G
Sbjct: 235 SSSSVIARLLGDFSPFQQLPVYSTKYLAVPSSGNPRETDGM----DTWMMIDTDLFDLSG 290
Query: 287 LECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREAD-----QNRINRNQLPLYGVEG 341
CNKIGVS+ FN + S C SCL Q+ +Y ++D NRI L +G G
Sbjct: 291 TVCNKIGVSFAGFNSEASHCKLLVNSCLGYQIEDYYQSDLQLQKANRIGNYFLSFFG--G 348
Query: 342 RFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALT 401
+ + + + +T + +S + + ADDI +V SPG+I+S + FEAL+
Sbjct: 349 LYYAETYTSSLTNRFLAFDLTGLQSSVITLTFSADDIRFVTNESPGQIVSAYVEEFEALS 408
Query: 402 QFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAA 460
+ G + N G + A Y +T CSTG+ ++ Q + P++ + F I
Sbjct: 409 KDGRMHVVVVNNGTINAQYEITVTQCSTGIATIQAQEPTLVPRKQTEFIFNIQSENALQK 468
Query: 461 KYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKL 520
Y C L DS +D F+T AT F++ G
Sbjct: 469 SYQCKVSLLDSQAVLLDYRIVYFNTSATN--------------------FQTTAQGGDTS 508
Query: 521 WEGLRDFITGK--ACRRKCSSFFDFSCHIQYICLSWLVLF---GLVLAIFPTVLVLLWLL 575
+ D + K +C + CS+F+D C + + C W +F G ++ I + +L L
Sbjct: 509 GDSGDDLKSDKHSSCSQACSAFYDIICFLSHKC--WKNVFSFLGTIIGIAAGLFILYKLK 566
Query: 576 HQKGL 580
G+
Sbjct: 567 QHFGM 571
>gi|224034879|gb|ACN36515.1| unknown [Zea mays]
Length = 366
Score = 172 bits (437), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 89/167 (53%), Positives = 116/167 (69%), Gaps = 4/167 (2%)
Query: 430 VTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATV 489
+ +EQY+I+KP E S R F ++ +T+QAAKY C+AILK SD SE+DR EC FST ATV
Sbjct: 67 IAGFQEQYYILKPNEESTRLFYLHASTDQAAKYQCTAILKASDSSELDRQECVFSTTATV 126
Query: 490 LDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACR-RKCSSFFDFSCHIQ 548
LDNG+QI K FF++I+ W+ L D I+GK+CR KC SFFDFSCH Q
Sbjct: 127 LDNGTQIIGSNGYKLG---FFDTIKGYLVSFWDFLIDLISGKSCRLNKCRSFFDFSCHAQ 183
Query: 549 YICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSD 595
Y C++WLV+ L+L + P ++L+LLHQKG FDP+YDWWDD +D
Sbjct: 184 YRCITWLVMLVLLLFMLPAGAIVLYLLHQKGFFDPVYDWWDDLLGAD 230
>gi|118396406|ref|XP_001030543.1| hypothetical protein TTHERM_01075640 [Tetrahymena thermophila]
gi|89284850|gb|EAR82880.1| hypothetical protein TTHERM_01075640 [Tetrahymena thermophila
SB210]
Length = 715
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 160/603 (26%), Positives = 255/603 (42%), Gaps = 86/603 (14%)
Query: 8 LKLKHFLLILF--CILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVP 65
+K F LI F CILN RC + ++ S ++KC ++ N NC+ K V+ +++
Sbjct: 1 MKFLAFGLIYFHFCILN----RC----EYITSSTIQKCYNSSNEPN-NCSQKAVIVLSLE 51
Query: 66 SGSSGGEASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQE- 124
+G +VA + ++ ++ K ++ + V K+ A++ L Y++D +P E
Sbjct: 52 NGQIANTEQVVATLNQLSDSGVNKQ--LQNSFIFEVTKSPVTALFPLIYLQDFNSQPLEQ 109
Query: 125 ------------FYMKTRKCEPDAGADVVKICERQPICCPCGPQRRIPS----SCGNVFD 168
FY + C+ + KI + Q CC C + S G V
Sbjct: 110 VIATTLFSCKDGFYDSSPTCKFQYDSKGQKILDSQGYCCYCSLSDILGMGNDLSRGKVCY 169
Query: 169 KL-LKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSE------VTVGPEN 221
L L + TAHCL+F W+ F I Q + F V I + T ++ + + N
Sbjct: 170 ALNLGAGSATAHCLKFSPLWYSAFKIQQYQLYFEVNINIYTVDSQNQKNLKQTLKLSTSN 229
Query: 222 KTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTR 281
T S+DN +IG F +YLV P P+ L G S WM +++T
Sbjct: 230 PTMKSSDNSTISKIIGTFTPTQPPADLSSYYLVKPSFPAT-DPRVLQG-ISSWMFVDKTM 287
Query: 282 FTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEG 341
FTLDG +CNKIGVSY F Q S CS P SCL NQL N ++D L +
Sbjct: 288 FTLDGTQCNKIGVSYSGFRQQSSSCSQPVGSCLQNQLENLYQSD----------LILLSQ 337
Query: 342 RFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALT 401
R +GS S I IE+ A I++V G I I FE+ +
Sbjct: 338 RL--------SGSASTLI----------TIEIDAAQIKFVTNLGIGCISQCSINNFESHS 379
Query: 402 QFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAK 461
G QN G A + L F+CS+ V ++ Q K++ T NQ
Sbjct: 380 GNGKLVALVQNQGNYSAEFVLGFNCSSNVQPIQGQ--------------KLFLTANQLYN 425
Query: 462 YTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLW 521
+ CS + +SD S ++ C + + G+Q+ ++ + S +
Sbjct: 426 FNCSVSV-NSDISAINN-NCTINLYDAI---GNQLDSKNILFNTTSTNHTSNQGNNTGQQ 480
Query: 522 EGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLF 581
+ +++ + ++C KCSSF+ F C+ C+ +A + L L+ L + G
Sbjct: 481 QSSQEYKSSQSCSDKCSSFWSFWCYFSAGCIKEAFKSIASIAGVASALALVIFLAKNGYL 540
Query: 582 DPL 584
P+
Sbjct: 541 VPI 543
>gi|440798371|gb|ELR19439.1| hypothetical protein ACA1_266960 [Acanthamoeba castellanii str.
Neff]
Length = 927
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 148/576 (25%), Positives = 244/576 (42%), Gaps = 83/576 (14%)
Query: 25 SPRCVVGVQ--ILSKSKLEKC--EKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVV 80
+P+ +V ++ +L+ S++E+C + TD ++C ++++ + V SG + E + V+
Sbjct: 23 APQLLVSIEGSLLASSRVERCVQDGATDVPTISCDRRMIVTLTVDSGQNNTEQ--LELVL 80
Query: 81 EVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYM------------- 127
+ ++ +RT+ P + KT +Y +TY+ KP E
Sbjct: 81 DSTQDEDGVLRTLEHPVQIQWAKTIPRLLYPITYVGRTNNKPYETITYKDDILFLFDECN 140
Query: 128 -KTRKCEPDAG----ADVVKICERQPICCPC---------GPQRRIPSSCGNVFDKLLKG 173
P G AD + + Q CC C Q R +C ++F + G
Sbjct: 141 DSPSSSSPTCGWFYNADGTVVRDSQGFCCSCDLSEVLWLSNEQTRAGLTC-SLFAFGVDG 199
Query: 174 KANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK------------TGSKVSE-VTVGPE 220
++AHCLRF W+ VF IG + F V + VK TG V+E + + P
Sbjct: 200 --SSAHCLRFDQLWYDVFSIGAAQVSFEVVLSVKKYQTMTDMYGNTTGGYVTETLRLSPS 257
Query: 221 NKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERT 280
T T+A + L GDF +++ P E YL +P P+ + G WMLL+R+
Sbjct: 258 QTTGTAAGGDIFAKLQGDFAPWSDNPVLSEKYLFVPSSPST-HPRVVAGT-DYWMLLDRS 315
Query: 281 RFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVE 340
GL CNKIGVS+ AF Q C + +CL NQL +Y D R + QL Y V
Sbjct: 316 SADFSGLTCNKIGVSFSAFRYQGGACGNWLQACLGNQLDHYHREDLARWEQGQLGRYFVR 375
Query: 341 --GRFERMNQHPNAGSHSF-SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTF 397
G F + S + ++ + + L ADDI Y RSPG+I+ I F
Sbjct: 376 FWGDFVGNQAVVQTNDQRYLSFALDQIRATVTTLTLNADDIIYTINRSPGRIVVANITGF 435
Query: 398 EALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTT 456
E L G + N G ++A Y++T +C + ++ + I +++ +F +Y
Sbjct: 436 EGLATQGELDVVVMNNGTIQADYTITVTECGDRIQAVQAKMRSISAYQSANLTFALY--- 492
Query: 457 NQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESI 516
+ Y ++ DS + V + + +GS
Sbjct: 493 --MSLYDSLGVIVDSVWVNVTVFATNITCLGGQCSDGS--------------------GG 530
Query: 517 GKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICL 552
GK G + IT +C C+S FD +C++ C+
Sbjct: 531 GKPADGGYKYAIT--SC-SACNSIFDIACYVDNSCM 563
>gi|159475573|ref|XP_001695893.1| gamete-specific protein [Chlamydomonas reinhardtii]
gi|158275453|gb|EDP01230.1| gamete-specific protein [Chlamydomonas reinhardtii]
Length = 813
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 158/641 (24%), Positives = 254/641 (39%), Gaps = 134/641 (20%)
Query: 32 VQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVE-------- 83
++++ +LEKC ++ L+C K+V+ + V +G S + E +E
Sbjct: 22 AEVIASGRLEKCVVDGVTEELDCQEKVVVTLTVGNGQS-----LQTEALEFSLSCLNSPD 76
Query: 84 ---------ENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK--TRKC 132
+ T R + P +++ K+ +A Y L Y+ +KP E ++ + C
Sbjct: 77 GRCPCSCSAADPTCACRDLAAPLRVSLTKSPLWASYPLQYLSSFNWKPLEVILRPSNKVC 136
Query: 133 E-------PDAG---ADVVKICERQPICCPCGPQRRIPSSCG----------------NV 166
+ P G V++ + Q CC C + + G +
Sbjct: 137 KDGDWEDSPTCGWFSQGGVRVADSQGFCCECSSSQVWDDTFGSSKERTRANLDCDFWSDP 196
Query: 167 FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGF--SVRIEVKTGSKVSEVT-------- 216
D L+ K +AHCL F W+ + +G S+ F ++ +EV T + T
Sbjct: 197 LDILIGRKPVSAHCLTFDPQWYSGYELGAASLQFEIAITVEVPTAPSPTPATTSATPRTT 256
Query: 217 -------------------------------VGPENKTATSADNFLKVNLIGDFVGYTNI 245
+GP A+SA L L+GD YT +
Sbjct: 257 NNSSANSTNSTNSPAPQFLSPPAPSTREVLHLGPSVPLASSASRLLSAKLLGDLAMYTQL 316
Query: 246 PSFEEFYLVIPRQGG----PGQPQD--LGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
P+ L++P+ G P D L N S WMLL++T ++DGL C+K+G + AF
Sbjct: 317 PAISNQVLMVPQPPAAAAATGSPLDATLATNRSAWMLLDKTMLSMDGLACDKVGTGFSAF 376
Query: 300 NGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGV---EGRFERMNQHPNAGSHS 356
QPS C +CL QL + EAD RI ++PLY + G + Q + G S
Sbjct: 377 RYQPSGCGRAPQTCLSGQLKDLWEADLARIADGRVPLYMITRFTGGSDTTLQSFSGGPLS 436
Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--------PTFEALTQFGVATI 408
F++ VT S + + + AD + V RSPGKI + FEA+ G +
Sbjct: 437 FALPVTSQSQSLVTLSVAADGVRLVTNRSPGKITGAAVCRFAGTSCGGFEAVAARGYIYV 496
Query: 409 TTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSI--RSFKIYPTTNQAAKY-TC 464
NTG +++ Y+LT +CS+ V +E + ++ + ++Y AA TC
Sbjct: 497 NITNTGRLDSDYTLTVSNCSSNVRPIEARALAVRAGSAASLDPPMELYVEDQAAAAARTC 556
Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGL 524
+ L DS + D F T AT L P N + + +G K
Sbjct: 557 TVSLYDSVGAVTDSLTLSFYTNATQL--------VVKPSGGYNG---TGDGVGVKR---- 601
Query: 525 RDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
G C C++ D C + C S FG +L I
Sbjct: 602 ----NGTDCSTACTNPIDVLCFVTKKCWS---KFGRLLGII 635
>gi|414591384|tpg|DAA41955.1| TPA: hypothetical protein ZEAMMB73_607847, partial [Zea mays]
Length = 124
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 78/110 (70%), Positives = 94/110 (85%), Gaps = 1/110 (0%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
MLLER RFT DG+ECNKIGV YEAF QP+FC+SPF SCL+NQLW + E+D+NRI+ ++
Sbjct: 1 MLLERVRFT-DGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 59
Query: 335 PLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQR 384
P Y V+GRF+R+NQHP+A HSFSIGVTEV+NSNL IEL ADDIEY+YQR
Sbjct: 60 PQYVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQR 109
>gi|288563868|gb|ABO29824.2| fusion protein HAP2/GCS1 [Chlamydomonas reinhardtii]
Length = 1139
Score = 165 bits (418), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 158/641 (24%), Positives = 251/641 (39%), Gaps = 134/641 (20%)
Query: 32 VQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVE-------- 83
++++ +LEKC ++ L+C K+V+ + V +G S + E +E
Sbjct: 22 AEVIASGRLEKCVVDGVTEELDCQEKVVVTLTVGNGQS-----LQTEALEFSLSCLNSPD 76
Query: 84 ---------ENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK--TRKC 132
+ T R + P +++ K+ +A Y L Y+ +KP E ++ + C
Sbjct: 77 GRCPCSCSAADPTCACRDLAAPLRVSLTKSPLWASYPLQYLSSFNWKPLEVILRPSNKVC 136
Query: 133 E-------PDAG---ADVVKICERQPICCPCGPQRRIPSSCG----------------NV 166
+ P G V++ + Q CC C + + G +
Sbjct: 137 KDGDWEDSPTCGWFSQGGVRVADSQGFCCECSSSQVWDDTFGSSKERTRANLDCDFWSDP 196
Query: 167 FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKT------------------ 208
D L+ K +AHCL F W+ + +G S+ F + I V+
Sbjct: 197 LDILIGRKPVSAHCLTFDPQWYSGYELGAASLQFEIAITVEVPTAPSPTTATTSATPRTN 256
Query: 209 ----------------------GSKVSEVT-VGPENKTATSADNFLKVNLIGDFVGYTNI 245
EV +GP A+SA L L+GD YT +
Sbjct: 257 NSSSANSTNSTNSPAPQFLSPPAPSTREVLHLGPSVPLASSASRLLSAKLLGDLAMYTQL 316
Query: 246 PSFEEFYLVIPRQGG----PGQPQD--LGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
P+ L++P+ G P D L N S WMLL++T ++DGL C+K+G + AF
Sbjct: 317 PAISNQVLMVPQPPAAAAATGSPLDATLATNRSAWMLLDKTMLSMDGLACDKVGTGFSAF 376
Query: 300 NGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGV---EGRFERMNQHPNAGSHS 356
QPS C +CL QL + EAD RI ++PLY + G + Q + G S
Sbjct: 377 RYQPSGCGRAPQACLSGQLKDLWEADLARIADGRVPLYMITRFTGGSDTTLQSFSGGPLS 436
Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--------PTFEALTQFGVATI 408
F++ VT S + + + AD + V RSPGKI + FEA+ G +
Sbjct: 437 FALPVTSHSQSLVTLSVAADGVRLVTNRSPGKITGAAVCRFAGTSCGGFEAVAARGYIYV 496
Query: 409 TTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSI--RSFKIYPTTNQAAKY-TC 464
NTG +++ Y+LT +CS+ V +E + ++ + ++Y AA TC
Sbjct: 497 NITNTGRLDSDYTLTVSNCSSNVRPIEARTLAVRAGSAASLDPPMELYVEDQAAAAARTC 556
Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGL 524
+ L DS + D F T AT L P N G G+
Sbjct: 557 TVSLYDSVGAVTDSLTLSFYTNATQL--------VVKPSGGYN---------GTGDGAGV 599
Query: 525 RDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
+ G C C++ D C + C S FG +L I
Sbjct: 600 KR--NGTDCSTACTNPIDVLCFVTKKCWS---KFGRLLGII 635
>gi|261333213|emb|CBH16208.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 618
Score = 165 bits (418), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 136/524 (25%), Positives = 230/524 (43%), Gaps = 54/524 (10%)
Query: 13 FLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGE 72
F+L++ + L PR +++ S +E CE+ + + C K+V+ ++V G G
Sbjct: 10 FVLVVLLPTSGLFPR--TEAALVASSSIEYCERSSKLEPFPCEKKMVVTLSVGGGQKAGV 67
Query: 73 ASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAV-YELTYIRDVPYKPQEFYMKTRK 131
+V V++ +K + V PV V + Y + YIR+ KP E ++T
Sbjct: 68 EEVVLLREAVDKTGDEKGKRVEFEPVRMVTTESPVRYRYPIYYIRNFNAKPYEQRLRTSA 127
Query: 132 ---CEP-------------DAGADVVKICERQPICCPCG---------PQRRIPSSCGNV 166
C+ D DV+ Q CC CG P R +C
Sbjct: 128 SSWCDDSSNPGSATCGVARDRRGDVIPY--SQGFCCLCGACALSGICNPTSRSVGTCS-- 183
Query: 167 FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK----------TGSKVSEVT 216
+ G A CLRF W+ + IG+ + + +++++ TGSK ++
Sbjct: 184 ----VTGDTGMASCLRFSDLWYGGYTIGRGVVWYELQVKLSSGNNSTGGGSTGSKEFTMS 239
Query: 217 VGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWML 276
+GP+ TATS + LIGDF L IP + P + +G ++ W++
Sbjct: 240 LGPDKLTATSTEFGASARLIGDFAPPEMPLDLSGKMLFIPSE--PRGHERVGAGYNEWII 297
Query: 277 LERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPL 336
++ ++ G ECNK+GVSYE F Q S C + +CL NQL +YR+ D + +
Sbjct: 298 VDTHLVSIRGTECNKVGVSYEGFATQGSRCDAYPGACLANQLEDYRDRDLEAETKGERGK 357
Query: 337 YGVEGRFERMNQHP--NAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
Y + F + P NA + + S + L++ + I + AD + +V S G I+ +
Sbjct: 358 Y-MARFFAPLGFDPLANASAPAVSYQASGTLSTIVTITISADKLNFVLSVSSGVIVGATV 416
Query: 395 P--TFEALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFK 451
+ ++ T+T NTG++EA Y++ +C+ V M Q I PK ++ R F
Sbjct: 417 SGKVVHSYSRGSTITVTVLNTGDIEAQYTVVVGECTVNVQPMVAQTVYIPPKGSAQRRFT 476
Query: 452 IYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
+ + + C+A L+++ VD F A NGSQ
Sbjct: 477 LIVQDSIEGEAKCNATLRNARGDVVDTRAISFGVKALKPSNGSQ 520
>gi|145046216|dbj|BAE71145.2| generative cell specific-1 [Chlamydomonas reinhardtii]
Length = 748
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 143/566 (25%), Positives = 230/566 (40%), Gaps = 112/566 (19%)
Query: 32 VQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVE-------- 83
++++ +LEKC ++ L+C K+V+ + V +G S + E +E
Sbjct: 22 AEVIASGRLEKCVVDGVTEELDCQEKVVVTLTVGNGQS-----LQTEALEFSLSCLNSPD 76
Query: 84 ---------ENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK--TRKC 132
+ T R + P +++ K+ +A Y L Y+ +KP E ++ + C
Sbjct: 77 GRCPCSCSAADPTCACRDLAAPLRVSLTKSPLWASYPLQYLSSFNWKPLEVILRPSNKVC 136
Query: 133 E-------PDAG---ADVVKICERQPICCPCGPQRRIPSSCG----------------NV 166
+ P G V++ + Q CC C + + G +
Sbjct: 137 KDGDWEDSPTCGWFSQGGVRVADSQGFCCECSSSQVWDDTFGSSKERTRANLDCDFWSDP 196
Query: 167 FDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKT------------------ 208
D L+ K +AHCL F W+ + +G S+ F + I V+
Sbjct: 197 LDILIGRKPVSAHCLTFDPQWYSGYELGAASLQFEIAITVEVPTAPSPTTATTSATPRTN 256
Query: 209 ----------------------GSKVSEVT-VGPENKTATSADNFLKVNLIGDFVGYTNI 245
EV +GP A+SA L L+GD YT +
Sbjct: 257 NSSSANSTNSTNSPAPQFLSPPAPSTREVLHLGPSVPLASSASRLLSAKLLGDLAMYTQL 316
Query: 246 PSFEEFYLVIPRQGG----PGQPQD--LGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
P+ L++P+ G P D L N S WMLL++T ++DGL C+K+G + AF
Sbjct: 317 PAISNQVLMVPQPPAAAAATGSPLDATLATNRSAWMLLDKTMLSMDGLACDKVGTGFSAF 376
Query: 300 NGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGV---EGRFERMNQHPNAGSHS 356
QPS C +CL QL + EAD RI ++PLY + G + Q + G S
Sbjct: 377 RYQPSGCGRAPQACLSGQLKDLWEADLARIADGRVPLYMITRFTGGSDTTLQSFSGGPLS 436
Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--------PTFEALTQFGVATI 408
F++ VT S + + + AD + V RSPGKI + FEA+ G +
Sbjct: 437 FALPVTSHSQSLVTLSVAADGVRLVTNRSPGKITGAAVCRFAGTSCGGFEAVAARGYIYV 496
Query: 409 TTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRS--FKIYPTTNQAAKY-TC 464
NTG +++ Y+LT +CS+ V +E + ++ + ++Y AA TC
Sbjct: 497 NITNTGRLDSDYTLTVSNCSSNVRPIEARTLAVRAGSAASLDPPMELYVEDQAAAAARTC 556
Query: 465 SAILKDSDFSEVDRAECQFSTMATVL 490
+ L DS + D F T AT L
Sbjct: 557 TVSLYDSVGAVTDSLTLSFYTNATQL 582
>gi|330819085|ref|XP_003291595.1| hypothetical protein DICPUDRAFT_156210 [Dictyostelium purpureum]
gi|325078197|gb|EGC31861.1| hypothetical protein DICPUDRAFT_156210 [Dictyostelium purpureum]
Length = 651
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 141/503 (28%), Positives = 229/503 (45%), Gaps = 61/503 (12%)
Query: 76 VAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEF------YMKT 129
++ VV+++ K +T+ P V+ +K+ ++ VY L Y++ V +KP E Y+
Sbjct: 11 ISNVVDID----GKNKTLLEPIVVRFSKSETFVVYPLEYLQTVAFKPVEKVIYKTDYLIG 66
Query: 130 RKC-----EPDAGADVVKIC-----ERQPICCPCGPQRRIPS---SCGNVFDKLLKGKAN 176
C + G V + + Q CC C + S GN+ L K++
Sbjct: 67 TGCKDLPTDSTCGYAVNSVTGEAIRDSQGFCCSCSMSDYFGADQNSRGNLGCSLFGSKSS 126
Query: 177 TAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVT--VGPENKTATSADNFLKVN 234
+AHCL F + VF I + + + + V++ + V N T + + +
Sbjct: 127 SAHCLSFSELKYDVFDISETRVQYQINATVQSFYNQLPIVDVVKLSNDVTTGKTSQVIIR 186
Query: 235 LIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGV 294
++GD T I + +V PR P + +LL+ F L G CNKIGV
Sbjct: 187 IVGDLSTSTQIKQYPNKKIVFPR--ASSDPISSLPIINTSLLLDDDFFDLSGAGCNKIGV 244
Query: 295 SYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF---------ER 345
Y AF Q + C++ F SCL NQ+ +Y D IN G +GR+ +
Sbjct: 245 GYSAFQNQANRCAAVFQSCLQNQISDYYANDLKLIND------GKKGRYIISQLGTSVKV 298
Query: 346 MNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGV 405
++ N S SF++ E+ + L + L AD +++V SP KIIS I TFE+++ GV
Sbjct: 299 ISSAANKNSRSFAVRFDEIQRTILTLTLSADSLQFVVNISPAKIISYNIETFESMSNNGV 358
Query: 406 ATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTC 464
I+ QNTG + A Y L +CS + M Q I+PKE + SF+IY TT + Y C
Sbjct: 359 LKISVQNTGALNADYLLQVHNCSGDIIQMPNQIATIQPKEIYVFSFQIYTTTMLQSYYYC 418
Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGL 524
A L + + + F+T T+++ G+Q + P S +S+ +IG +L
Sbjct: 419 FADLVNEQSTLLQSIRINFNTSKTIIEQGAQ-SGDNPNNQS-----DSL-NIGYEL---- 467
Query: 525 RDFITGKACRRKCSSFFDFSCHI 547
C C +FF+ C++
Sbjct: 468 -------TCDLVCPNFFNIICYL 483
>gi|307111056|gb|EFN59291.1| hypothetical protein CHLNCDRAFT_137637 [Chlorella variabilis]
Length = 1084
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 140/486 (28%), Positives = 217/486 (44%), Gaps = 64/486 (13%)
Query: 135 DAGADVVKICERQPICCPCGPQRRIPS-----SCGNVFDKLLKGKANTAHCLRFPGDWFH 189
D G DV + Q CC CG S GN+ ++AHCLRF W+H
Sbjct: 137 DGGQDVA---DSQGFCCDCGSLINFGGDDGQLSRGNLDCGGFIQTQDSAHCLRFDNTWWH 193
Query: 190 V-FGIGQRSIGFSVRIEVKTGSKVSE----------VTVGPENKTATSADNFLKVNLIGD 238
+ IG+ S+ F++ + + T + + VT+ P + L L+GD
Sbjct: 194 AGYVIGEYSLDFTINLNITTVTTNATTNATAAASELVTLTPSAPFRRDSSRRLSAKLLGD 253
Query: 239 FVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEA 298
Y P + +L+IP + G G + +F+ WM+++ T GLEC+KIGVSY
Sbjct: 254 LESYQQAPQLDGKWLLIPTKPGEGPQEWYTRHFNEWMVVDGNLVTTTGLECDKIGVSYSG 313
Query: 299 F-NGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMN---------- 347
F N QP+ C++P SCL NQ+ + AD NRIN PLY V GR+
Sbjct: 314 FRNSQPNKCTTPQGSCLRNQIVDLYAADLNRINTGVDPLYFV-GRYGGGTLNSDQLTGEL 372
Query: 348 QHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPT--------FEA 399
Q + + ++ ++ + S L + + ADD+ +V RSP +I SV + T FEA
Sbjct: 373 QEDGSFKLALNLPISAIKVSLLTLMVAADDVAFVVNRSPAQITSVQVCTYDGIICGGFEA 432
Query: 400 LTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQ 458
+T G +T +N+G V + Y++ +C+TGV + Q I P+ +++ F++ ++
Sbjct: 433 MTARGYLRVTVRNSGYVASDYTVQVTNCTTGVRNVLAQRAGIAPQSSTVFQFELQMESDA 492
Query: 459 AAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGK 518
A++ +C + DS V F T AT QPP+ S IG
Sbjct: 493 ASESSCMVSVVDSLGDTVATMGISFYTDATDYT--------QPPEQS---------DIGD 535
Query: 519 KLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPT----VLVLLWL 574
++ D T C++ C DF C I C L GL + P L ++W
Sbjct: 536 QVTGPNED--TPDWCQQVCPRLTDFKCAINKGCYGRLAK-GLSAIVAPVAGLGALFMIWK 592
Query: 575 LHQKGL 580
GL
Sbjct: 593 TGHLGL 598
>gi|71652476|ref|XP_814894.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70879906|gb|EAN93043.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 588
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 140/516 (27%), Positives = 237/516 (45%), Gaps = 31/516 (6%)
Query: 7 SLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPS 66
SL F L+LF ++ +P G+ +L+ S +E+C++ ++L C K+V+ ++V S
Sbjct: 4 SLSRMLFSLLLFALMVATTPFAAEGL-LLASSSIEQCDRVGTDNSLPCEKKLVVTLSVDS 62
Query: 67 GSSGGEASIVAEVVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEF 125
+ V V++ V P+ LT +K+ Y L Y R+ KP E
Sbjct: 63 DQAEDVEEFVILRDAVDKTKGTGEEHVEFQPIRLTTSKSRVQYSYPLFYERNFNAKPYEE 122
Query: 126 YMKTR--KCE----PDAGADVVKICERQPI------CCPCGPQRRIP----SSCGNVFDK 169
+ T C+ P A + +PI CC CGP + + S G
Sbjct: 123 EITTELVGCDDTFSPKATCGLAMDTAGRPIPYSQGFCCRCGPCQLLGLCPVGSRGLQVCD 182
Query: 170 LLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVT------VGPENKT 223
+ +G A A CLRF W+ + +G +I + + +++ T S+ + T +GP+ +
Sbjct: 183 IFRGAA-LASCLRFGELWYSGYSMGSATIWYRLSVKLTTDSQNNSKTKEAVFELGPDVLS 241
Query: 224 ATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFT 283
+SA+ V+LIGDFV L IP P + + W++L++ +
Sbjct: 242 GSSAEFGAWVSLIGDFVPAELPLVLSNKMLFIPSS--PRIHERVLAGQKEWLILDKHHVS 299
Query: 284 LDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
+ G +CNK+GVSYEAF+GQ S C SCL +QL +YR +D R Y
Sbjct: 300 MQGRDCNKVGVSYEAFSGQGSRCQLIRGSCLADQLEDYRSSDLAVEARGGRGKYLARFFG 359
Query: 344 ERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALT 401
+ + + N S + L + L + + AD ++Y+ SPG+I+S ++ T E +
Sbjct: 360 DFVVNNVNNSRTRLSYWMRGSLATMLTVVISADRLQYLVSVSPGEIVSAVMSKSTVEESS 419
Query: 402 QFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKI-YPTTNQA 459
+ G ++ +N G V A Y+L +CS V + Q ++P+ T IRSF + +
Sbjct: 420 RDGSVSVIVRNIGHVTAQYTLGVGNCSGNVFPIMAQTLSLRPRGTVIRSFDLNIQDVAEE 479
Query: 460 AKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
C L+D+ + D+ +F + VL N +Q
Sbjct: 480 RIVQCDVTLRDAKGAITDKKILKFRVTSKVLTNDTQ 515
>gi|407849348|gb|EKG04115.1| hypothetical protein TCSYLVIO_004826 [Trypanosoma cruzi]
Length = 588
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 138/516 (26%), Positives = 230/516 (44%), Gaps = 31/516 (6%)
Query: 7 SLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPS 66
SL F L+LF ++ +P G+ +L+ S +E+C++ ++L C K+V+ ++V S
Sbjct: 4 SLSRMLFSLLLFALMVATTPFAAEGL-LLASSSIEQCDRVGTDNSLPCDKKLVVTLSVDS 62
Query: 67 GSSGGEASIVAEVVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEF 125
+ V V++ V P+ LT +K+ Y L Y R+ KP E
Sbjct: 63 DQAEDVEEFVILRDAVDKTKGTGEERVEFQPIRLTTSKSRVQYTYPLFYERNFNAKPYEE 122
Query: 126 YMKTRKCEPDAGADVVKICE------------RQPICCPCGPQRRIP----SSCGNVFDK 169
+ T D C Q CC CGP + + S G
Sbjct: 123 EITTELVGCDDTFSSKATCGLATDTAGRPIPYSQGFCCRCGPCQLLGLCPVGSRGLQVCD 182
Query: 170 LLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKT----GSKVSEVT--VGPENKT 223
+ +G A A CLRF W+ + +G +I + + +++ T SK E +GP+ +
Sbjct: 183 IFRGAA-LASCLRFGELWYSGYSMGSATIWYRLSVKLTTDSQNNSKAKEAVFELGPDVLS 241
Query: 224 ATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFT 283
+SA+ V+LIGDFV L IP P + + W++L++ +
Sbjct: 242 GSSAEFGAWVSLIGDFVPAELPLVLSNKMLFIPSS--PRIHERVLAGQKEWLILDKHHVS 299
Query: 284 LDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
+ G +CNK+GVSYEAF+GQ S C SCL +QL +YR +D R Y
Sbjct: 300 MQGRDCNKVGVSYEAFSGQGSRCQLIRGSCLADQLEDYRSSDLAVEARGGRGKYLARSFG 359
Query: 344 ERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALT 401
+ + N S + L + L + + AD ++Y+ S G+I+S ++ T E +
Sbjct: 360 DFVVNSVNNSRTRLSYWMRGSLATMLTVVISADRLQYLVSVSQGEIVSAVMSKSTIEESS 419
Query: 402 QFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKI-YPTTNQA 459
+ G ++ +N G V A Y+L +CS V + Q ++P+ET +RSF + +
Sbjct: 420 RDGSVSVIVRNIGHVTAKYTLGVGNCSGNVFPIMAQTLSLRPRETVVRSFDLNIQDVTEE 479
Query: 460 AKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
C L+D+ + D+ +F + VL N +Q
Sbjct: 480 RIVQCDVTLRDAKGAITDKKVLKFRVTSKVLTNDTQ 515
>gi|156370880|ref|XP_001628495.1| predicted protein [Nematostella vectensis]
gi|156215473|gb|EDO36432.1| predicted protein [Nematostella vectensis]
Length = 853
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 142/532 (26%), Positives = 226/532 (42%), Gaps = 65/532 (12%)
Query: 16 ILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDN-------LNCTTKIVLNMAVPSGS 68
I+ ++ LL +++KS L+ CE +SD+ C K+++ ++V SG
Sbjct: 6 IIMILVGLLCLANESYSDVIAKSSLQMCENTGNSDDPYNVVDQKACEKKLIVTLSVRSGQ 65
Query: 69 SGGE-ASIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYM 127
+G E V V +V + + ++M + P ++T+ KT Y Y+ V KP E +
Sbjct: 66 NGTEFLKAVTNVSKVYDQTEKEMARLYNPFIITLAKTPVKLTYPYYYLAMVNNKPTERVV 125
Query: 128 KT----------RKCEPDAGADVVKIC------ERQPI------CCPCGPQRRI------ 159
+ C DA D +C E +PI CC C Q +
Sbjct: 126 ISDSKWHASGSYHACS-DAWDDEDALCGFYTDAEGKPIWDSQGFCCRCTEQEKWRGSFND 184
Query: 160 --PSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV------KTGSK 211
P S + KL G AHC+ F W+ V +G + FS+ ++ K G+K
Sbjct: 185 KNPYSRAGINCKLF-GTQAAAHCMTFDDLWYTVNEVGLWQMDFSIHVKAYDLVVEKVGNK 243
Query: 212 V-------SEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP----RQGG 260
E+ +GP ++ L IG+F + P YL+IP +
Sbjct: 244 TQSKWVDGGEIVIGPTIRSGVGVHGRLHATFIGEFQSHKQFPVLTTKYLLIPYVSEKVDP 303
Query: 261 PGQPQDLGGNFSMWMLLERTRFTLDGL---ECNKIGVSYEAFNGQ-PSFCSSPFWSCLHN 316
PQ G +ML+++ EC+KIGVS+ AF Q P CS CLHN
Sbjct: 304 KTHPQFRNGPHD-YMLIDKHEVNYKSSGPHECDKIGVSFSAFRAQAPMGCSQKQGDCLHN 362
Query: 317 QLWNYREADQNRINRNQLPLYGVE--GRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELR 374
Q +Y E D R + P Y + G+ +NQ + + V EV+ S + +++
Sbjct: 363 QPKDYFEEDTKRRASGKTPYYFPQKFGKLLGVNQRKDNNHFVLTYEVDEVMTSMVTLQIS 422
Query: 375 ADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEAS-YSLTFDCSTGVTLM 433
ADD+ +Y R+ GKI+ FEAL++ G + QN G V A Y + +CS G+ +
Sbjct: 423 ADDVILIYNRAEGKILRAYAQDFEALSRDGNLYVIVQNIGLVTADFYVVIKECSVGIGKL 482
Query: 434 EEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFST 485
E+ I P++T +F + + C L D+ VD + F T
Sbjct: 483 LEKAASINPQQTHSFTFSVKAQQWKGGDNFCIVQLYDARRKMVDSSNVTFRT 534
>gi|71748482|ref|XP_823296.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832964|gb|EAN78468.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 618
Score = 158 bits (400), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 149/588 (25%), Positives = 255/588 (43%), Gaps = 91/588 (15%)
Query: 35 LSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVV-----EVEENSTQK 89
++ S +E CE+ ++ + C K+V+ ++V G E +I AE V V++ +K
Sbjct: 30 VASSSIEYCERSSNGEPFPCEKKMVVGLSV-----GSEQTIEAEEVVLLREAVDKTGDEK 84
Query: 90 MRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMKTRK---CEP----------- 134
+ V P+ L K+ Y + YIR+ KP E ++T C+
Sbjct: 85 GKRVEFEPIRLVTTKSPVQYRYPIYYIRNFNAKPYEQRLRTSASSWCDDSSNPGSATCGV 144
Query: 135 --DAGADVVKICERQPICCPCG---------PQRRIPSSCGNVFDKLLKGKANTAHCLRF 183
D DV+ Q CC CG P R +C + G A CLRF
Sbjct: 145 ARDRRGDVIPY--SQGFCCLCGACALSGICNPTSRSVGTCS------VTGDTGMASCLRF 196
Query: 184 PGDWFHVFGIGQRSIGFSVRIEVK----------TGSKVSEVTVGPENKTATSADNFLKV 233
W+ + IG+ + + +++++ TGSK +++GP+ TATS +
Sbjct: 197 SDLWYGGYTIGRGVVWYELQVKLSSGNNSTGGGSTGSKEFTMSLGPDKLTATSTEFGASA 256
Query: 234 NLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIG 293
LIGDF L IP + P + +G ++ W++++ ++ G ECNK+G
Sbjct: 257 RLIGDFAPPEMPLDLSGKMLFIPSE--PRGHERVGAGYNEWIIVDTHLVSIRGTECNKVG 314
Query: 294 VSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHP--N 351
VSYE F Q S C + +CL NQL +YR+ D + Q Y + F P N
Sbjct: 315 VSYEGFATQGSRCDAYPGACLANQLEDYRDRDLEAETKGQQGKY-MARFFAPFGFDPLAN 373
Query: 352 AGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIP--TFEALTQFGVATIT 409
A + + + VT L++ + I + AD + +V S G I+ + + ++ T+T
Sbjct: 374 ASAPAVAYQVTGTLSTMVTITISADKLNFVLSVSSGVIVGATVSGKVVHSYSRGSTITVT 433
Query: 410 TQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAIL 468
NTG++EA Y++ +C+ V M Q I + ++ R F + + + C+A L
Sbjct: 434 VLNTGDIEAQYTVVVGECTVNVQPMVAQTVYIPLQGSAQRRFTLIVQDSIEGEAKCNATL 493
Query: 469 KDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLR--D 526
+++ VD F A NGSQ G +E R +
Sbjct: 494 RNARGDVVDTRAISFGVKALKPSNGSQ---------------------GGSTFENGRYSE 532
Query: 527 FITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWL 574
G++ ++C S+F+ C +++ C W L + + P+V +L+ L
Sbjct: 533 EAKGESQCQQC-SWFNLLCFLRHRCW-WQPL----VYVLPSVTLLMLL 574
>gi|340508314|gb|EGR34043.1| hypothetical protein IMG5_026080 [Ichthyophthirius multifiliis]
Length = 525
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 150/528 (28%), Positives = 220/528 (41%), Gaps = 79/528 (14%)
Query: 99 LTVNKTASYAVYELTYIRDVPYKPQE-------------FYMKTRKCEPDAGADVVKICE 145
+ V K+ AVY L Y+RD PQE F + C KI +
Sbjct: 14 IEVTKSPVVAVYPLKYMRDYESMPQEKVISKSVFTCQDGFNEDSPTCGFQRDEKGEKIFD 73
Query: 146 RQPICCPCGPQ------RRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIG 199
Q CC CG + + + L G A +AHCLRFPG W+ + I Q I
Sbjct: 74 SQGFCCKCGAADFFGLGKEVMRGVDCLPFNLNSGSA-SAHCLRFPGRWYSGYEILQYYIY 132
Query: 200 FSVRIEV--------KTGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEF 251
+ +++EV K ++T ++ S DN V +IGDF P +
Sbjct: 133 YEIKVEVYELEGNNNKKRKLKYKLTTSTTDRIKKSPDNKFLVKIIGDFFPTQPPPVYNNV 192
Query: 252 YLVIPRQGGPGQPQDLG----GNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCS 307
YLV P P +L S WML+E+ +FTLDG ECNKIGVSY AF + CS
Sbjct: 193 YLVRPTPNRPQANNELRVRVLEGISNWMLIEKNQFTLDGTECNKIGVSYAAFRRENGSCS 252
Query: 308 SPFWSCLHNQLWNYREADQNRINRNQLP--LYGVEGRF-ERMNQHPNAGSHSFSIGVTEV 364
SCL NQ+ ++ D RI + Q L +G F E ++ N F G
Sbjct: 253 KQIGSCLKNQIEHFYLRDIERIKKGQPTQNLLLPKGDFQESWDKQNNTQMILFIEGSMST 312
Query: 365 LNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTF 424
L + IE+ + +I+++ GK I V I FE+ + G N A ++L F
Sbjct: 313 L---ITIEMDSAEIQFLTMLGQGKFILVKINNFESHSGSGKFEAHILNKSSFAAEFNLGF 369
Query: 425 DCSTGVT--------LMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEV 476
C V L ++Q FI K +S+ TN C+ L D+ + +
Sbjct: 370 SCDQNVLPISGQKLFLNQDQLFIFK---SSVNVVSDLGKTNNL----CNVTLSDAVNNVL 422
Query: 477 DRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRK 536
D A+ F+T V +I+ P+ + + E+ ++ K L E C +K
Sbjct: 423 DFAQITFNTTDVV-----RIS----PQGNGTYYNENNSTLKKPLIE--------VTCNQK 465
Query: 537 CSSFFDFSCHIQYICL--------SWLVLFGLVLAIFPTVLVLLWLLH 576
C F+D CH CL + L + + L +F V+ L +LH
Sbjct: 466 CPDFWDIFCHFSTKCLNNGFKTLGTGLGILVIFLELF-DVVALFVVLH 512
>gi|146100443|ref|XP_001468864.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134073233|emb|CAM71954.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 917
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 153/609 (25%), Positives = 253/609 (41%), Gaps = 65/609 (10%)
Query: 15 LILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEAS 74
+ + C+ L+ C +S S + C D +N++CT K+V+ + V GE S
Sbjct: 111 IAVLCVSLLVRLACPARAAFVSSSLISYCSDSGD-ENISCTKKMVVTVTVEGEQLPGEES 169
Query: 75 IV----AEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK-- 128
++ A + V + + +RI T +++A Y L Y+++ KP E +K
Sbjct: 170 LLFLNSATDMTVNNGTAVQFSPLRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVKGS 225
Query: 129 -TRKCEPDAGADVV-----------KICERQPICCPCG---------PQRRIPSSCGNVF 167
+C D AD I Q CC C P R ++C N+F
Sbjct: 226 LLNQCNADFNADTATCGLAYDAAGKAIPYSQGFCCDCSMCQTLGLCQPDARANAAC-NIF 284
Query: 168 DKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK------TGSKVSEVTV---G 218
DK TA CLRF W+ + IG ++V + + G+ +E V
Sbjct: 285 DKY-----TTASCLRFAQRWYSGYTIGGYMTWYTVNLTLSRNVSGSGGAGAAEKVVMHLS 339
Query: 219 PENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQD---LGGNFSMWM 275
P N T+ + + +++ VG T P + L P P + + + W+
Sbjct: 340 PSNNGETAGEGW---DVMARIVG-TYAPVDQPLDLTSRMLFAPAIPPNDARVQAGAAEWL 395
Query: 276 LLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLP 335
LL TLDG EC+K+GVSYEAF Q + C+ SCL +QL +YR AD RI
Sbjct: 396 LLPTNLVTLDGRECDKVGVSYEAFASQGNKCNLRPGSCLSSQLEDYRTADLQRIAAGNKG 455
Query: 336 LYGVEGRFERMNQHPNAGSHSF-SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
Y + F N +A + + S + + I + ADD+EY + GKI+S +
Sbjct: 456 QY-MATSFGDFNLENDAATSPYISYLAASPAATMISITVSADDLEYTVGLASGKIVSADL 514
Query: 395 --PTFEALTQFGVATITTQNTGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFK 451
PT +A T GV T+ +NT V + +CS GV M Q + ++ + +FK
Sbjct: 515 NKPTLQAGTADGVMTVMVRNTAAVTGRLVVGMLNCSDGVFPMTAQKLSLAAQQQAAVTFK 574
Query: 452 IYPTTNQAA-KYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFF 510
+Y + A+ K +C+ +++++ + D + +T NG+Q +
Sbjct: 575 VYVQNSYASGKASCTVVVRNAHEAITDLRVVSWKVSSTNFHNGTQGGSADDGSGGV---- 630
Query: 511 ESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLV 570
S E R A RR+C + + ++ ++ +F L
Sbjct: 631 -STEESSAASCLNCRTLDIACAVRRRCWQLILLDLFVYLLIIAVVLCVIFFWRVFCCCLY 689
Query: 571 LLWLLHQKG 579
LL H++G
Sbjct: 690 LLGRQHRRG 698
>gi|407409949|gb|EKF32579.1| hypothetical protein MOQ_003566 [Trypanosoma cruzi marinkellei]
Length = 589
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 135/511 (26%), Positives = 231/511 (45%), Gaps = 32/511 (6%)
Query: 13 FLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGE 72
F +LF ++ +P G+ +L+ S +E+C++ L C K+V+ ++V S +
Sbjct: 10 FSSLLFALVVATTPFAAEGL-LLASSSIEQCDRVETDKLLPCEKKLVVTLSVDSAQADNV 68
Query: 73 ASIVAEVVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKP--QEFYMKT 129
V V++ V P+ LT +K+ Y L Y R+ KP +E +
Sbjct: 69 EEFVILRDAVDKTKGTGEERVEFEPIRLTTSKSRVQYRYPLFYERNFNAKPYEEEITTEL 128
Query: 130 RKCE----PDAGADVVKICERQPI------CCPCGPQRRIP----SSCGNVFDKLLKGKA 175
C+ P A + K +PI CC CG + + S G + G A
Sbjct: 129 TGCDDTFSPTATCGLAKDTAGRPIPYSQGFCCRCGACQLLGLCPVGSRGLQVCDIFNGAA 188
Query: 176 NTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV-------KTGSKVSEVTVGPENKTATSAD 228
A CLRF W+ + IG +I + + +++ T +K + +GPE + +S +
Sbjct: 189 -LAACLRFGKLWYSGYSIGPATIWYRLLVKLTADAENNSTKAKEAVFELGPEVLSGSSPE 247
Query: 229 NFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLE 288
V+LIGDFV + L IP P + + + W++L++ ++ G +
Sbjct: 248 FGAWVSLIGDFVPAELPLVLSDKMLFIPSS--PRKHERVLAGQKEWIILDKHHVSMQGRD 305
Query: 289 CNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQ 348
CNK+GVSYEAF+ Q S C SCL +QL +YR +D R Y E +
Sbjct: 306 CNKVGVSYEAFSAQGSRCQLIQGSCLADQLEDYRASDLAVEARGGKGKYMARFFGEFVVN 365
Query: 349 HPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALTQFGVA 406
N+ S + L + + + + AD ++Y+ SPG+I+S ++ T E ++ G
Sbjct: 366 TANSSRTRVSYWMRGSLATMITVVISADRLQYLISVSPGEIVSAVMSKSTIEESSRDGSI 425
Query: 407 TITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKI-YPTTNQAAKYTC 464
++ +N G + A Y+L +CS V + Q ++P+ET IRSF + + C
Sbjct: 426 SVMVRNIGNLTAEYTLGVGNCSGNVFPIMAQTLSLRPQETLIRSFDVNIQDVTEERIVQC 485
Query: 465 SAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
L+D+ + D+ +F + VL N +Q
Sbjct: 486 DVTLRDAKDAITDKKVVKFRVIRKVLTNNTQ 516
>gi|398022953|ref|XP_003864638.1| hypothetical protein, conserved [Leishmania donovani]
gi|322502874|emb|CBZ37956.1| hypothetical protein, conserved [Leishmania donovani]
Length = 917
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 153/609 (25%), Positives = 252/609 (41%), Gaps = 65/609 (10%)
Query: 15 LILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEAS 74
+ + C+ L+ C +S S + C D +N++CT K+V+ + V GE S
Sbjct: 111 IAVLCVSLLVRLACPARAAFVSSSLISYCSDSGD-ENISCTKKMVVTVTVEGEQLPGEES 169
Query: 75 IV----AEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK-- 128
++ A + V + + +RI T +++A Y L Y+++ KP E +K
Sbjct: 170 LLFLNSATDMTVNNGTAVQFSPLRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVKGS 225
Query: 129 -TRKCEPDAGADVV-----------KICERQPICCPCG---------PQRRIPSSCGNVF 167
+C D AD I Q CC C P R ++C N+F
Sbjct: 226 LLNQCNADFNADTATCGLAYDAAGKAIPYSQGFCCDCSMCQTLGLCQPDARANAAC-NIF 284
Query: 168 DKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK------TGSKVSEVTV---G 218
DK TA CLRF W+ + IG ++V + + G+ +E V
Sbjct: 285 DKY-----TTASCLRFAQRWYSGYTIGGYMTWYTVNLTLSRNVSGSGGAGAAEKVVMHLS 339
Query: 219 PENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQD---LGGNFSMWM 275
P N T+ + + +++ VG T P + L P P + + + W+
Sbjct: 340 PSNNGETAGEGW---DVMARIVG-TYAPVDQPLDLTSRMLFAPAIPPNDARVQAGAAEWL 395
Query: 276 LLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLP 335
LL TLDG EC+K+GVSYEAF Q + C+ SCL +QL +YR AD RI
Sbjct: 396 LLPTNLVTLDGRECDKVGVSYEAFASQGNKCNLRPGSCLSSQLEDYRTADLQRIAAGNKG 455
Query: 336 LYGVEGRFERMNQHPNAGSHSF-SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII 394
Y + F N +A + + S + + I + ADD+EY GKI+S +
Sbjct: 456 QY-MATSFGDFNLENDAATSPYISYLAASPAATMISITVSADDLEYTVGLVSGKIVSADL 514
Query: 395 --PTFEALTQFGVATITTQNTGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFK 451
PT +A T GV T+ +NT V + +CS GV M Q + ++ + +FK
Sbjct: 515 NKPTLQAGTADGVMTVMVRNTAAVTGRLVVGMLNCSDGVFPMTAQKLSLAAQQQAAVTFK 574
Query: 452 IYPTTNQAA-KYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFF 510
+Y + A+ K +C+ +++++ + D + +T NG+Q +
Sbjct: 575 VYVQNSYASGKASCTVVVRNAHEAITDLRVVSWKVSSTNFHNGTQGGSADDGSGGV---- 630
Query: 511 ESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLV 570
S E R A RR+C + + ++ ++ +F L
Sbjct: 631 -STEESSAASCLNCRTLDIACAVRRRCWQLILLDLFVYLLIIAVVLCVIFFWRVFCCCLY 689
Query: 571 LLWLLHQKG 579
LL H++G
Sbjct: 690 LLGRQHRRG 698
>gi|342184647|emb|CCC94129.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 622
Score = 152 bits (383), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 134/529 (25%), Positives = 231/529 (43%), Gaps = 57/529 (10%)
Query: 10 LKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSS 69
L FL + + SP + I++ S +E CE+ ++ C K+V+ ++V S +
Sbjct: 6 LVPFLTVAALAVVYYSP--ITEGAIVASSSVEHCERDGRTETFPCERKLVVTLSVDSEQT 63
Query: 70 GGEASIVAEVVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMK 128
G ++ +++ +K + V + P+ L K+A + Y + Y+++ KP E +
Sbjct: 64 AGAEEVIFLREALDKTGNRKEKRVFVEPIRLVTIKSAVHYRYPVYYVQNFNAKPYEQQLT 123
Query: 129 TRKCE----------PDAG----ADVVKICERQPICCPCG---------PQRRIPSSCGN 165
T E P G + I Q CC CG P+ R S C
Sbjct: 124 TTAMEWCKDYNESASPTCGLARDSSGRVIPYSQGFCCSCGACELSGICRPKSRGASKCSI 183
Query: 166 VFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSE----------V 215
+ G A CLRF W+ + IG+ ++ + +++ + T VS +
Sbjct: 184 I------GNTGKASCLRFGNMWYSGYNIGRGTVWYRLQVGLTTQGAVSGDGVVKPNQHML 237
Query: 216 TVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQP---QDLGGNFS 272
++GP+ TA+SA+ + LIGDF PS L P P + + +
Sbjct: 238 SLGPDTITASSAEFGVSARLIGDFA-----PSEMPLDLTNKMLFAPAVPRTHERVRAGHN 292
Query: 273 MWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRN 332
W+ L++ ++ G ECN++GVSYE F Q CS+ +CL NQL +YR D +
Sbjct: 293 EWIFLDKHLVSVHGRECNRVGVSYEGFATQGGRCSALPGACLANQLDDYRGLDLKSESEG 352
Query: 333 QLPLYGVE--GRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKII 390
+ Y G F + + H N+ + + L + + I + AD +++V SPG I+
Sbjct: 353 RKGHYMARFFGEF-KTDSHSNSSAPRITYQTRNSLATMVTITILADKLKFVLSVSPGTIV 411
Query: 391 SVIIP--TFEALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSI 447
+V + + ++ TI NTG+VEA Y++ +C+ M Q I P +
Sbjct: 412 NVTVSGTNVASYSRGNTVTINVLNTGDVEAQYTVGVGNCTIDAHPMVAQVAFIPPLHSVQ 471
Query: 448 RSFKIYPTTNQ-AAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
R+F + ++ K +C+A L+++ VD F NGSQ
Sbjct: 472 RNFSLVSQSDSLVEKASCTASLQNARGDVVDTYTFYFDVKPVGWTNGSQ 520
>gi|389594441|ref|XP_003722443.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|323363671|emb|CBZ12676.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 917
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 160/639 (25%), Positives = 262/639 (41%), Gaps = 71/639 (11%)
Query: 35 LSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIV----AEVVEVEENSTQKM 90
+S S + C D +N++CT K+V+ + V GE S++ A + V ++ +
Sbjct: 131 VSSSLISYCSDSGD-ENISCTKKMVVTVTVEGEQLPGEESLLFLNSATDMTVNNGTSVQF 189
Query: 91 RTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK---TRKCEPDAGADVV------ 141
+RI T +++A Y L Y+++ KP E +K +C D AD
Sbjct: 190 SPLRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVKGSLLNQCNADFNADTATCGLAY 245
Query: 142 -----KICERQPICCPCG---------PQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDW 187
I Q CC C P R ++C NVF GK TA CLRF W
Sbjct: 246 DAAGKAIPYSQGFCCDCSMCQTLGLCQPDARANAAC-NVF-----GKYTTASCLRFAQRW 299
Query: 188 FHVFGIGQRSIGFSVRIEVK------TGSKVSEVTV---GPENKTATSADNFLKVNLIGD 238
+ + IG ++V + + G+ +E V P N + + + +++
Sbjct: 300 YSGYTIGGYMTWYTVNLTLSRNVSDSGGAGAAEKVVMRLSPSNNGEVAGEGW---DVMAR 356
Query: 239 FVGYTNIPSFEEFYLVIPRQGGPGQPQD---LGGNFSMWMLLERTRFTLDGLECNKIGVS 295
VG T P + L P P + + + W+LL TLDG EC+K+GVS
Sbjct: 357 IVG-TYAPVDQPLDLTSRMLFAPAIPPNDARVQAGAAEWLLLPTNLVTLDGRECDKVGVS 415
Query: 296 YEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSH 355
YEAF Q + C+ SCL +QL +YR AD RI Y + F N +A +
Sbjct: 416 YEAFASQGNKCNLRPGSCLSSQLEDYRTADLQRIAAGNKGQY-MATSFGDFNLENDAATS 474
Query: 356 SF-SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALTQFGVATITTQN 412
+ S + + I + ADD+EY + GKIIS + PT +A T GV T+ +N
Sbjct: 475 PYISYLAASPAATMISITVSADDLEYTVGLASGKIISTDMNKPTLQAGTADGVMTVMVRN 534
Query: 413 TGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAA-KYTCSAILKD 470
T V + T +CS GV M Q + ++ S +FK+Y ++ A+ +C+ ++++
Sbjct: 535 TAAVTGRLVVGTLNCSDGVFPMTAQKLSLAAQQQSAVTFKVYVQSSHASGNASCTVVVRN 594
Query: 471 SDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITG 530
+ D + +T NG+Q ++ S E R
Sbjct: 595 AHEVITDLRVVSWKVSSTNFHNGTQGG-----SAADGSGGGSTEESSAASCLNCRTLDIA 649
Query: 531 KACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLYDWWDD 590
A RR+C + + ++ ++ +F L LL H++G +
Sbjct: 650 CAVRRRCWQLILLDLFVYLLIIAVILCVIFFWRVFCCCLYLLGRQHRRGSAG------EA 703
Query: 591 HFQSDNQRIRDFRSRRIDVDHPHVHVRKHHKQEGRHHKL 629
+++ R + RR + D + HK G L
Sbjct: 704 EPKNEASRWGAYWKRRGESDATSSSRQTDHKNSGSSDVL 742
>gi|145490447|ref|XP_001431224.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398327|emb|CAK63826.1| unnamed protein product [Paramecium tetraurelia]
Length = 685
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 144/564 (25%), Positives = 242/564 (42%), Gaps = 59/564 (10%)
Query: 32 VQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE----NST 87
+I+S+S++ KC + ++N C+ K+++++ V + + V E +++ E N T
Sbjct: 2 AEIISQSQINKCYSNS-TNNTECSEKMLISLTVENAQNT-----VTEYIKISETTIDNQT 55
Query: 88 QKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK--TRKCE-------PDAGA 138
+++T P ++++ KT YA Y L Y D +P E + C+ P G
Sbjct: 56 SQLKT---PIIISITKTPVYAFYPLKYTEDYNSQPYEVKIAGAILSCDDSWYSNSPTCGF 112
Query: 139 DVVK---ICERQPICCPCGPQRRIPSSC----GNVFDKLLKGKANTAHCLRFPGDWFHVF 191
K I + Q CC CG I S GN+ K A A CLR+ W+ +
Sbjct: 113 QYEKKEKIFDSQGFCCSCGILDLIGLSDEFARGNICHKAGLTTATMAFCLRYSTLWYSAY 172
Query: 192 GIGQRSIGFSVRIEVK-TGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEE 250
I SI +++ I + + + E+ +G E K L +IGDF PS E
Sbjct: 173 EISTYSIYYNITISITYSNQEQEELQLGSEVKVVQGKT--LIGRIIGDFTPLNPPPSLES 230
Query: 251 FYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPF 310
FY + P P + + +M++ + + + ECNKIGVSY AF + C
Sbjct: 231 FYFMRP--SSPNSHARVQAGSAAFMIVSKDQ--VGRGECNKIGVSYSAFRTEAERCKKQV 286
Query: 311 WSCLHNQLWNYREADQNRINRNQLPLYGVE--GRFERMNQHPNAGSHSFSIGVTEVLNSN 368
SCL NQL ++ DQ I N P Y + G+F+ + + N ++ V + +
Sbjct: 287 KSCLKNQLEDFYIEDQALIANNSQPKYLLSRYGKFKSI--YLNNETY-LQYSVEGSMQTM 343
Query: 369 LLIELRADD-IEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCS 427
+ +E+ I YV GKI I FEA + G+ N G++E+ ++ +CS
Sbjct: 344 ITLEITTTGLISYVVNLGKGKIDLAEIQDFEAKSGNGLLYAQITNVGDIESEFNTYLNCS 403
Query: 428 TGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMA 487
V + +KP E+ I + ++ C+ L ++ + +D+ + +F+T
Sbjct: 404 INVIPINSAALYLKPLESYIVKKDVNVLSDMNKSNICTFSLLNNKGTLLDQKQIEFNT-T 462
Query: 488 TVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHI 547
+ Q Q K N+ S ES C CS F D +C+I
Sbjct: 463 EIQHESEQNHEEQNIKD--NEVLASDES--------------QDNCYSDCSVFLDITCYI 506
Query: 548 QYICLSWLVLFGLVLAIFPTVLVL 571
C S ++ F VL I L++
Sbjct: 507 FNDCNSQIITFFTVLGITFIFLII 530
>gi|302842682|ref|XP_002952884.1| hypothetical protein VOLCADRAFT_105708 [Volvox carteri f.
nagariensis]
gi|300261924|gb|EFJ46134.1| hypothetical protein VOLCADRAFT_105708 [Volvox carteri f.
nagariensis]
Length = 1181
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 160/648 (24%), Positives = 256/648 (39%), Gaps = 146/648 (22%)
Query: 17 LFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIV 76
L IL+LL V G ++L+ KLEKC + ++ + C+ K+V+ + V +G + +
Sbjct: 115 LCVILSLLWASKVYG-EVLAAGKLEKCVRDGVTEVVQCSDKLVITVTVANGQTLKTEELD 173
Query: 77 AEVVEVEENSTQ-------------KMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQ 123
V+ V + + R + P +T+ K+ +A Y LT+++ +KP
Sbjct: 174 LTVLCVNSPTGECPCPCNAAVDEDCSCRDLAAPMKVTITKSLLWASYPLTFVQQFNWKPV 233
Query: 124 EF--YMKTRKCEPDA------------GADVVKICERQPICCPCGP-------------Q 156
E Y ++KC G D K+ + Q CC C +
Sbjct: 234 EIIQYTNSKKCRDGDYEQYPTCPYYYDGKD--KVPDSQGFCCQCSSGEVWDDTFGDLKYR 291
Query: 157 RRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHV-------FGIGQRSIGFS--VRIEVK 207
R +C L+ AHC++ F V + +G S+ F V IE+
Sbjct: 292 TRANLNCDFRLGMLIGIYPAAAHCVQLDRFNFAVSTRVGLGYNVGPPSLNFEIYVNIEIP 351
Query: 208 T-----------GSKVS-----------------------EVTVGPENKTATSADNFLKV 233
T S VS +T+ P A S + V
Sbjct: 352 TIPAGWSPRVNGTSSVSVNATTLSNGTLNTSQNTFVMRYETLTLSPSIPLAVSKTKMVSV 411
Query: 234 NLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIG 293
L+GD YT +P+F L++P D+G N S WML++++ +LDG C+KIG
Sbjct: 412 KLLGDLAMYTMLPTFGHQMLMLPLY-------DIG-NRSTWMLVDKSLISLDGRTCDKIG 463
Query: 294 VSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAG 353
S+ AF QPS C +CL QL + + D +RI + + PLY V +Q+P
Sbjct: 464 TSFSAFRYQPSGCHRAVSTCLKGQLKDLYDEDMDRIKKGRAPLYMV-------SQYPGYE 516
Query: 354 SHSFSIG-----------VTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQ 402
SF+ G VT S L + + AD + + RSPGKI V + F +
Sbjct: 517 QASFTAGKFGNETVFLLPVTSQSQSVLTLTVSADKLRLITNRSPGKISDVQLCRFGNASH 576
Query: 403 FGV--------ATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETS--IRSFK 451
G + NTG ++A Y++ +CS+ + +E + + T+ +
Sbjct: 577 CGFFEAGNRGYIRLNVTNTGRLDADYTVAVTNCSSNIRPIEARMIAVSAGRTAPLWPPIE 636
Query: 452 IYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQP-PKSSINDFF 510
+Y + +CS +L DS D+ E FST T F P P N
Sbjct: 637 VYVEDTENKTRSCSVLLYDSTGGIADQTEMSFST---------NQTDFGPTPTGGFNGTG 687
Query: 511 ESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLF 558
+S+ + K L C C++ + C + C W LF
Sbjct: 688 DSLARLEKDL-----------TCDEACTNPINVWCIVVKRC--WSKLF 722
>gi|125505600|gb|ABN45755.1| gamete fusion-like protein [Hydra magnipapillata]
Length = 673
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 142/566 (25%), Positives = 241/566 (42%), Gaps = 92/566 (16%)
Query: 8 LKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLN----------CTTK 57
K+K LL F N+ VG ILSKS +E CE S++L C K
Sbjct: 5 FKMKKQLLSSF--FNITVNIIFVGGLILSKSSIEFCENTGSSNDLKDPTNVVTQSACEKK 62
Query: 58 IVLNMAVPSGSSGGEASIVAEVVEVEENS-TQKMRTVRIPPVLTVNKTASYAVYELTY-- 114
+V+ ++V G+ GE + VV V +NS T + + P ++TV+K+ Y + +
Sbjct: 63 MVVLLSV--GNKQGETEKLQAVVSVVQNSATNEFARLYNPFMITVSKSPVYLNFPFFFNG 120
Query: 115 --IRDVPYK-----PQEFYMK--TRKC--------------------------EPDAGAD 139
+ + PY+ +Y+ +R+C + D
Sbjct: 121 ITVNNQPYEEIILSKNRWYVSDSSRQCLDQWQVEEEDDEHPTCGYQYTNSTQKQTDGTWK 180
Query: 140 VVK--ICERQPICCPCGPQRR---------IPSSCGNVFDKLLKGKANTAHCLRFPGDWF 188
VK I + Q CC C + + G + L +AHC+R W+
Sbjct: 181 TVKTRIWDSQGFCCYCTQDLKNYYIKKDIQDANRAGIICKPLTNSPQASAHCMRMSNLWY 240
Query: 189 HVFGIGQRSIGFSVRIE--------VKTGSKV-----SEVTVGPENKTATSADNFLKVNL 235
+ + FS+ ++ V+ S + E+ + P K+AT + N + N
Sbjct: 241 TLNEFTESYRDFSIYVKAFDQITKVVQNKSYIDYVNGGEILLSPSQKSATGSYNRITGNY 300
Query: 236 IGDFVGYTNIPSFEEFYLVIPRQG---GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKI 292
+GD + P Y +IP P + L S WM++ R + D +C+ I
Sbjct: 301 VGDLQPIKSYPVLTNNYFLIPFSSTNVDPKKEPQLKSGISKWMIIPRDLVSTDAKQCDMI 360
Query: 293 GVSYEAFNGQPSF-----CSSPFWSCLHNQLWNYREADQNRINRNQLPLY--GVEGRFER 345
GV Y AF Q ++ C + SCL NQ +N D++R+ + ++P Y G+
Sbjct: 361 GVGYSAFRNQAAYGTGYGCRAKKGSCLANQPYNKFMDDEDRLEKGKMPWYFPARYGKLAG 420
Query: 346 MNQHPNAGSHSFSIGVTEVLN---SNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQ 402
+ Q N G + + E+ + S + +++ ADD+ VY R+ G I I FEAL+
Sbjct: 421 VKQ--NIGDNDKYLLTYELDDEQISLVTLQISADDVVLVYNRATGIITRTAIQDFEALSL 478
Query: 403 FGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAK 461
G ++ NTG V + + ++ C++GV +EE+ I P+ T +FK+ +T++ +
Sbjct: 479 EGQLSVDVLNTGYVSSDFRISIPSCTSGVQPIEEKRITIDPQMTETITFKMMTSTDKKSA 538
Query: 462 YTCSAILKDSDFSEVDRAECQFSTMA 487
+ C+ L DS + FST A
Sbjct: 539 HDCTINLYDSKNILLQSRNFTFSTKA 564
>gi|290983267|ref|XP_002674350.1| predicted protein [Naegleria gruberi]
gi|284087940|gb|EFC41606.1| predicted protein [Naegleria gruberi]
Length = 615
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 132/537 (24%), Positives = 226/537 (42%), Gaps = 58/537 (10%)
Query: 79 VVEVEENSTQKMRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYM---------- 127
+VE+ + +K V + P+ + + K+A AVY L Y++ KP E +
Sbjct: 2 LVELSDTVNEKGEKVNLRPIKIVIQKSAPKAVYPLLYVKTFNGKPTESIIYKDDILVPTC 61
Query: 128 --KTRKCEPDAG----ADVVKICERQPICCPCGPQRRIPSSCGNVFDKLLKG---KANTA 178
++ P G + KI + Q CC C + S + L G ++A
Sbjct: 62 DDSSKSAAPTCGWVKDSQGNKIPDSQGFCCSCSVGQMFGDSSASNRGALNCGFMQMKSSA 121
Query: 179 HCLRFPGDWFHVFGIGQRSIGFSVRIEV-KTG-SKVSEVTVGPENKTATSADNFLKVNLI 236
HCLR ++ + I + F + + + +TG + +VTV P +K A +V L
Sbjct: 122 HCLRLGEVYWDAYEIEGYVMSFEISVFIGETGFDDIGKVTVSPSSKLAQLPKGG-RVELE 180
Query: 237 GDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSY 296
GDF Y ++P +E YL IP P + + WM ++++ TL G EC+KIGVSY
Sbjct: 181 GDFSAYKSVPLYESKYLFIPSS--PKTSPIVVNGQANWMFIDKSMVTLSGSECDKIGVSY 238
Query: 297 EAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHS 356
F QP+ CS P +CL NQ+ + R AD + + F + +
Sbjct: 239 AQFRNQPNACSRPALTCLANQIEDLRLADVELMKSGLKSGKYIVSNFGSFAVNKTNTGNV 298
Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEV 416
+ E NS + + + ++++++ +S +I + TF +L++ G ++ +N G
Sbjct: 299 LEKYLDEDTNSQINLYINGENVKFLITKSAAEISEAYVKTFTSLSKEGEMLVSVKNKGAN 358
Query: 417 EASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSE 475
SY +T +CS + + +Q + +F++ A C L SD +
Sbjct: 359 GCSYVVTVTECSDNILTIVQQTVFVDASNKKELTFQVRSEQKLATTNQCKVTLLFSDGEK 418
Query: 476 VDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRR 535
+ F + +N + S E GK EG D G+
Sbjct: 419 IQDITVTFDSKDYAYENAME---------------SSGEQTGKVETEG--DHSLGQC--- 458
Query: 536 KCSSFFDFSCHI--QYICLSWLVLFGLVLAIF-----PTVLVLLWLLHQKGLFDPLY 585
KC+S FD C + C S+++ G V +I P + V LW + GLF ++
Sbjct: 459 KCNSPFDVVCIVLNSSSCTSYII--GWVASIVGIIATPVIFVFLW---RCGLFGLMF 510
>gi|66823829|ref|XP_645269.1| hypothetical protein DDB_G0272452 [Dictyostelium discoideum AX4]
gi|60473432|gb|EAL71378.1| hypothetical protein DDB_G0272452 [Dictyostelium discoideum AX4]
Length = 327
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 148/300 (49%), Gaps = 15/300 (5%)
Query: 170 LLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVT--VGPENKTATSA 227
LL ++++AHCL F + V+ I + + + + + + +T + N
Sbjct: 19 LLGSQSSSAHCLSFSPMKYDVYNIAKTQVEYKITATLTYSYNQNPITQDIILSNSNPMGM 78
Query: 228 DNFLK--VNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLD 285
D+F + + ++GDF T I F + +V P QP + + MLL++ F L
Sbjct: 79 DSFSQAMIRIVGDFQSSTQINQFTDKKVVFPY----NQPNSI----NTAMLLDQNFFDLS 130
Query: 286 GLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFER 345
GL CNKIGVSY AF QP+ C++ F SCL NQ+ +Y AD I+ + Y +
Sbjct: 131 GLTCNKIGVSYSAFQNQPNKCAALFGSCLQNQIADYYNADVTLISNGKKGNYIASQFGTK 190
Query: 346 MNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGV 405
+ N S S I E + L I L+AD ++Y SPGKIIS I FE++++ G+
Sbjct: 191 VAGDQN--SRSLKIRFDESHRTMLTITLKADSLQYRVDISPGKIISYQIDRFESMSKNGI 248
Query: 406 ATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTC 464
+ QN G + + Y+L +CS + ++ + IK KE F+I+ T+ + Y C
Sbjct: 249 LRVKVQNIGTINSDYTLAIVNCSGDINPIDSKDVTIKSKEIYSFEFQIFTTSKLDSSYQC 308
>gi|226230652|gb|ACO39319.1| hypothetical protein [Populus balsamifera]
gi|226230656|gb|ACO39321.1| hypothetical protein [Populus balsamifera]
gi|226230698|gb|ACO39342.1| hypothetical protein [Populus balsamifera]
gi|226230712|gb|ACO39349.1| hypothetical protein [Populus balsamifera]
gi|226230720|gb|ACO39353.1| hypothetical protein [Populus balsamifera]
gi|226230724|gb|ACO39355.1| hypothetical protein [Populus balsamifera]
gi|226230756|gb|ACO39371.1| hypothetical protein [Populus balsamifera]
gi|226230764|gb|ACO39375.1| hypothetical protein [Populus balsamifera]
gi|226230772|gb|ACO39379.1| hypothetical protein [Populus balsamifera]
gi|226230774|gb|ACO39380.1| hypothetical protein [Populus balsamifera]
gi|226230780|gb|ACO39383.1| hypothetical protein [Populus balsamifera]
Length = 64
Score = 134 bits (336), Expect = 2e-28, Method: Composition-based stats.
Identities = 58/64 (90%), Positives = 61/64 (95%)
Query: 235 LIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGV 294
LIGDFVGY+NIPSFE+FYLVIPRQG PGQPQDLG NFSMWMLLER RFTLDG+ECNKIGV
Sbjct: 1 LIGDFVGYSNIPSFEDFYLVIPRQGEPGQPQDLGRNFSMWMLLERVRFTLDGVECNKIGV 60
Query: 295 SYEA 298
SYEA
Sbjct: 61 SYEA 64
>gi|154344439|ref|XP_001568161.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065498|emb|CAM43265.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 905
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 129/506 (25%), Positives = 210/506 (41%), Gaps = 59/506 (11%)
Query: 36 SKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIV----AEVVEVEENSTQKMR 91
S S + C D + +NC K+V+ + V G E S++ A + ++ + +
Sbjct: 129 SSSLISYCSDSGD-EKINCKKKMVVTVTVEGGQLPDEESLLFLNSATDMTIKNGTAVQFS 187
Query: 92 TVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK---TRKCEPD-----AGADVVKI 143
+RI T +++A Y L Y+++ KP E +K +C D A +
Sbjct: 188 PIRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVKGSLLNQCNADFDTNTATCGIAHD 243
Query: 144 CERQPI------CCPCG---------PQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWF 188
+PI CC C P R + C N+FD+ TA CLRF W+
Sbjct: 244 AVGKPIPYSQGFCCDCSMCQTLGLCLPDARANAGC-NIFDRY-----TTASCLRFTKRWY 297
Query: 189 HVFGIGQRSIGFSVR------IEVKTGSKVSEVTV--------GPENKTATSADNF-LKV 233
+ IG ++V + V G+ +E V P + T+ + + +
Sbjct: 298 SGYTIGGYVTWYTVNLTLSRNVSVSGGAGSAEKVVTQKVVMHLSPSSNGETAGEEWDVMA 357
Query: 234 NLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIG 293
++G++ L P P + + + WMLL TLDG EC+K+G
Sbjct: 358 RVLGNYAPIVQPLDLTSRMLFAP--AIPPNDERVQAGAAEWMLLPTNLVTLDGRECDKVG 415
Query: 294 VSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAG 353
VSYEAF Q + C+ SCL +QL +YR D RI Y + +
Sbjct: 416 VSYEAFASQGNKCNLRPGSCLSSQLEDYRTTDLERIASGNKGQYMATSFGDFHLERDAVA 475
Query: 354 SHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALTQFGVATITTQ 411
S S T + L I + ADD+EY + GKI+S + P EA T+ GV T+ +
Sbjct: 476 SPYISYRATSPAATMLSITISADDLEYTVGLASGKIVSAELNKPVLEASTKDGVMTVVVR 535
Query: 412 NTGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAK-YTCSAILK 469
N V + T CS GV + Q + ++ S +F +Y + A++ +C +L+
Sbjct: 536 NAASVTGRVVVGTSSCSDGVFPITAQTLSLAAQQQSTVAFNVYMQDSYASENASCMVVLR 595
Query: 470 DSDFSEVDRAECQFSTMATVLDNGSQ 495
++ D + +T NG+Q
Sbjct: 596 NAQEVITDLRTVSWKVSSTSFHNGTQ 621
>gi|125551606|gb|EAY97315.1| hypothetical protein OsI_19236 [Oryza sativa Indica Group]
Length = 143
Score = 132 bits (331), Expect = 9e-28, Method: Composition-based stats.
Identities = 66/120 (55%), Positives = 89/120 (74%), Gaps = 4/120 (3%)
Query: 31 GVQILSKSKLEKCEKRTDSDN-LNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE--NST 87
G +ILSKS+LE C +D+ L C K+V+++AVPSG+SGGEAS+VA V VEE ++
Sbjct: 24 GTEILSKSRLESCSHDSDAGGRLKCDRKLVVDLAVPSGASGGEASLVARVAGVEEENDTP 83
Query: 88 QKMRTVRIPPVLTVNKTASYAVYELTYI-RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
+++R PPV+TV+K+A+YA+Y LTY+ RDV Y+P E Y+KT KCEP AGA VV CER
Sbjct: 84 SATKSIRDPPVITVSKSATYALYALTYLDRDVAYRPDEKYVKTHKCEPYAGAKVVGECER 143
>gi|401429134|ref|XP_003879049.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322495299|emb|CBZ30602.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 917
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 126/479 (26%), Positives = 209/479 (43%), Gaps = 56/479 (11%)
Query: 35 LSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIV----AEVVEVEENSTQKM 90
+S S + C D +++ C K+V+ + V GE S++ A + +++ + +
Sbjct: 131 VSSSLISYCSDSGD-ESIRCEKKMVVTVTVEGEQLPGEESLLFLNSATDMTIDDGTVVQF 189
Query: 91 RTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMK---TRKCEPDAGADVVK----- 142
+RI T +++A Y L Y+++ KP E ++ +C D AD
Sbjct: 190 SPLRI----TTSRSAVRYRYPLFYVQNYNAKPYEATVRGNLLNQCNADFNADKATCGLAY 245
Query: 143 ------ICERQPICCPCG---------PQRRIPSSCGNVFDKLLKGKANTAHCLRFPGDW 187
I Q CC C P R ++C N+FDK A CLRF W
Sbjct: 246 DAAGKPIPYSQGFCCDCSMCQTLGLCKPDARANAAC-NIFDKY-----TAASCLRFGQRW 299
Query: 188 FHVFGIGQRSIGFSVRIEVKTGSKVS---------EVTVGPENKTATSADNF-LKVNLIG 237
+ + IG ++V + + VS E+ + P N T+ + + + ++G
Sbjct: 300 YSGYTIGGYMTWYTVNLTLSRSVSVSGGADAVEKVEMHLSPSNNGETAGEGWDVMARIVG 359
Query: 238 DFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYE 297
++ L P P + + W+LL TLDG EC+K+GVSYE
Sbjct: 360 NYAPVDQPLDLTSRMLFAPAI--PPNDVRVQAGAAEWLLLPTNLVTLDGRECDKVGVSYE 417
Query: 298 AFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSF 357
AF Q + C+ SCL +QL +YR AD RI Y + F N +A + +
Sbjct: 418 AFASQGNKCNLRPGSCLSSQLEDYRTADLERIAAGNKGQY-MATSFGDFNLENDAATSPY 476
Query: 358 -SIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVII--PTFEALTQFGVATITTQNTG 414
S + + I + ADD+EY + GKI+S + PT EA T GV T+ +NT
Sbjct: 477 ISYLAASPAATMISITVSADDLEYTVGVASGKIVSADLNKPTLEAGTTDGVMTVMVRNTA 536
Query: 415 EVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAA-KYTCSAILKDS 471
V + T +CS GV M Q + ++ S +FK+Y + A+ +C+ +++++
Sbjct: 537 AVTGRLVVGTLNCSDGVFPMTAQQLSLAAQQQSAVTFKVYMQNSYASGDASCTVVVRNA 595
>gi|340057663|emb|CCC52009.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 605
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 141/528 (26%), Positives = 214/528 (40%), Gaps = 62/528 (11%)
Query: 14 LLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEA 73
++ F ++ P + ++ S ++ CE+ D + C K+V+ ++V +G G
Sbjct: 27 MVTAFVLIGTHLPHHMAEGVFIASSSIDYCERNNKVDPVPCEKKMVVTLSVDAGQDAG-- 84
Query: 74 SIVAEVVEVEENSTQK----MRTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMK 128
V EVV V E S + R V P+ LT KT Y L Y R+ KP E +
Sbjct: 85 --VEEVVLVREASDKTRDDDKRVVEFEPIYLTTKKTRVRYHYPLFYERNFNAKPYEEQIP 142
Query: 129 TRKCEP--------DAGADVVKICERQPI------CCPCGP---------QRRIPSSCGN 165
T +P A + ++PI CC CG R SC N
Sbjct: 143 TSLFDPCVDKPGSSKATCGIAHDNYQKPIPFSEGFCCNCGACQLAGICPSDSRGLGSC-N 201
Query: 166 VFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGF----SVRIEVKTGSKVSE-----VT 216
+F +A CLR W+ + IGQ + + ++R EV S S ++
Sbjct: 202 IFQT-----TGSASCLRLGELWYSGYNIGQGTAWYRLHVTLRDEVDNNSAASTRGSATMS 256
Query: 217 VGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWML 276
+GP+ S L+GDFV L P P + + + WM
Sbjct: 257 LGPDQPADFSEKFGAWARLVGDFVPPEMPLDLTGKMLFTP--ATPRRHERVIAGSREWMF 314
Query: 277 LERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREAD-----QNRINR 331
L++ +L G ECNKIGVSYE F Q S C S +CL +QL +YR+ D R +
Sbjct: 315 LDKHLVSLQGRECNKIGVSYEGFVTQGSRCVSRPGTCLADQLEDYRQRDVVAEAHGRRGK 374
Query: 332 NQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKI-- 389
L+G N S + + L++ + I + AD + YV SPG I
Sbjct: 375 YMARLFG----DMYTGGTRNTSSPYIAFWLRGSLSTMVTITINADSLRYVQSVSPGTILR 430
Query: 390 ISVIIPTFEALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIR 448
I ++ T + T+ GV ++T NTG E+ Y L +CS GV + Q I +
Sbjct: 431 IKLMNKTVFSYTRSGVVSVTVLNTGRAESQYFLAVRNCSVGVHPIAAQTINIPSGHNATC 490
Query: 449 SFKIYPTTN-QAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
F +Y + C L+D+ + D + M +GSQ
Sbjct: 491 LFDLYVQEDVMTPNVKCHVELRDARGNVTDTSLFYLRLMPVNRTSGSQ 538
>gi|226230642|gb|ACO39314.1| hypothetical protein [Populus balsamifera]
gi|226230644|gb|ACO39315.1| hypothetical protein [Populus balsamifera]
gi|226230646|gb|ACO39316.1| hypothetical protein [Populus balsamifera]
gi|226230648|gb|ACO39317.1| hypothetical protein [Populus balsamifera]
gi|226230650|gb|ACO39318.1| hypothetical protein [Populus balsamifera]
gi|226230654|gb|ACO39320.1| hypothetical protein [Populus balsamifera]
gi|226230658|gb|ACO39322.1| hypothetical protein [Populus balsamifera]
gi|226230660|gb|ACO39323.1| hypothetical protein [Populus balsamifera]
gi|226230662|gb|ACO39324.1| hypothetical protein [Populus balsamifera]
gi|226230664|gb|ACO39325.1| hypothetical protein [Populus balsamifera]
gi|226230666|gb|ACO39326.1| hypothetical protein [Populus balsamifera]
gi|226230668|gb|ACO39327.1| hypothetical protein [Populus balsamifera]
gi|226230670|gb|ACO39328.1| hypothetical protein [Populus balsamifera]
gi|226230672|gb|ACO39329.1| hypothetical protein [Populus balsamifera]
gi|226230674|gb|ACO39330.1| hypothetical protein [Populus balsamifera]
gi|226230676|gb|ACO39331.1| hypothetical protein [Populus balsamifera]
gi|226230678|gb|ACO39332.1| hypothetical protein [Populus balsamifera]
gi|226230680|gb|ACO39333.1| hypothetical protein [Populus balsamifera]
gi|226230682|gb|ACO39334.1| hypothetical protein [Populus balsamifera]
gi|226230684|gb|ACO39335.1| hypothetical protein [Populus balsamifera]
gi|226230686|gb|ACO39336.1| hypothetical protein [Populus balsamifera]
gi|226230688|gb|ACO39337.1| hypothetical protein [Populus balsamifera]
gi|226230690|gb|ACO39338.1| hypothetical protein [Populus balsamifera]
gi|226230694|gb|ACO39340.1| hypothetical protein [Populus balsamifera]
gi|226230696|gb|ACO39341.1| hypothetical protein [Populus balsamifera]
gi|226230700|gb|ACO39343.1| hypothetical protein [Populus balsamifera]
gi|226230702|gb|ACO39344.1| hypothetical protein [Populus balsamifera]
gi|226230704|gb|ACO39345.1| hypothetical protein [Populus balsamifera]
gi|226230706|gb|ACO39346.1| hypothetical protein [Populus balsamifera]
gi|226230708|gb|ACO39347.1| hypothetical protein [Populus balsamifera]
gi|226230710|gb|ACO39348.1| hypothetical protein [Populus balsamifera]
gi|226230714|gb|ACO39350.1| hypothetical protein [Populus balsamifera]
gi|226230716|gb|ACO39351.1| hypothetical protein [Populus balsamifera]
gi|226230718|gb|ACO39352.1| hypothetical protein [Populus balsamifera]
gi|226230722|gb|ACO39354.1| hypothetical protein [Populus balsamifera]
gi|226230726|gb|ACO39356.1| hypothetical protein [Populus balsamifera]
gi|226230728|gb|ACO39357.1| hypothetical protein [Populus balsamifera]
gi|226230730|gb|ACO39358.1| hypothetical protein [Populus balsamifera]
gi|226230732|gb|ACO39359.1| hypothetical protein [Populus balsamifera]
gi|226230734|gb|ACO39360.1| hypothetical protein [Populus balsamifera]
gi|226230736|gb|ACO39361.1| hypothetical protein [Populus balsamifera]
gi|226230738|gb|ACO39362.1| hypothetical protein [Populus balsamifera]
gi|226230740|gb|ACO39363.1| hypothetical protein [Populus balsamifera]
gi|226230742|gb|ACO39364.1| hypothetical protein [Populus balsamifera]
gi|226230744|gb|ACO39365.1| hypothetical protein [Populus balsamifera]
gi|226230746|gb|ACO39366.1| hypothetical protein [Populus balsamifera]
gi|226230748|gb|ACO39367.1| hypothetical protein [Populus balsamifera]
gi|226230750|gb|ACO39368.1| hypothetical protein [Populus balsamifera]
gi|226230752|gb|ACO39369.1| hypothetical protein [Populus balsamifera]
gi|226230754|gb|ACO39370.1| hypothetical protein [Populus balsamifera]
gi|226230758|gb|ACO39372.1| hypothetical protein [Populus balsamifera]
gi|226230762|gb|ACO39374.1| hypothetical protein [Populus balsamifera]
gi|226230766|gb|ACO39376.1| hypothetical protein [Populus balsamifera]
gi|226230768|gb|ACO39377.1| hypothetical protein [Populus balsamifera]
gi|226230770|gb|ACO39378.1| hypothetical protein [Populus balsamifera]
gi|226230778|gb|ACO39382.1| hypothetical protein [Populus balsamifera]
gi|226230782|gb|ACO39384.1| hypothetical protein [Populus balsamifera]
gi|226230784|gb|ACO39385.1| hypothetical protein [Populus balsamifera]
gi|226230786|gb|ACO39386.1| hypothetical protein [Populus balsamifera]
gi|226230788|gb|ACO39387.1| hypothetical protein [Populus balsamifera]
gi|226230790|gb|ACO39388.1| hypothetical protein [Populus balsamifera]
gi|226230792|gb|ACO39389.1| hypothetical protein [Populus balsamifera]
gi|226230794|gb|ACO39390.1| hypothetical protein [Populus balsamifera]
gi|226230796|gb|ACO39391.1| hypothetical protein [Populus balsamifera]
gi|226230798|gb|ACO39392.1| hypothetical protein [Populus balsamifera]
gi|226230800|gb|ACO39393.1| hypothetical protein [Populus balsamifera]
gi|226230802|gb|ACO39394.1| hypothetical protein [Populus balsamifera]
gi|226230804|gb|ACO39395.1| hypothetical protein [Populus balsamifera]
gi|226230806|gb|ACO39396.1| hypothetical protein [Populus balsamifera]
gi|226230808|gb|ACO39397.1| hypothetical protein [Populus balsamifera]
gi|226230810|gb|ACO39398.1| hypothetical protein [Populus balsamifera]
gi|226230812|gb|ACO39399.1| hypothetical protein [Populus balsamifera]
gi|226230814|gb|ACO39400.1| hypothetical protein [Populus balsamifera]
gi|226230816|gb|ACO39401.1| hypothetical protein [Populus balsamifera]
gi|226230818|gb|ACO39402.1| hypothetical protein [Populus balsamifera]
gi|226230820|gb|ACO39403.1| hypothetical protein [Populus balsamifera]
Length = 64
Score = 130 bits (327), Expect = 2e-27, Method: Composition-based stats.
Identities = 57/64 (89%), Positives = 60/64 (93%)
Query: 235 LIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGV 294
LIGDFVGY+NIPSFE+FYLVIPRQG GQPQDLG NFSMWMLLER RFTLDG+ECNKIGV
Sbjct: 1 LIGDFVGYSNIPSFEDFYLVIPRQGESGQPQDLGRNFSMWMLLERVRFTLDGVECNKIGV 60
Query: 295 SYEA 298
SYEA
Sbjct: 61 SYEA 64
>gi|226230692|gb|ACO39339.1| hypothetical protein [Populus balsamifera]
gi|226230760|gb|ACO39373.1| hypothetical protein [Populus balsamifera]
gi|226230776|gb|ACO39381.1| hypothetical protein [Populus balsamifera]
Length = 64
Score = 130 bits (326), Expect = 3e-27, Method: Composition-based stats.
Identities = 57/64 (89%), Positives = 60/64 (93%)
Query: 235 LIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGV 294
LIGDFVGY+NIPSFE+FYLVIPRQG PGQ QDLG NFSMWMLLER RFTLDG+ECNKIGV
Sbjct: 1 LIGDFVGYSNIPSFEDFYLVIPRQGEPGQLQDLGRNFSMWMLLERVRFTLDGVECNKIGV 60
Query: 295 SYEA 298
SYEA
Sbjct: 61 SYEA 64
>gi|168036567|ref|XP_001770778.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677996|gb|EDQ64460.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 346
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 133/290 (45%), Gaps = 23/290 (7%)
Query: 215 VTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGNFSMW 274
+TV P AT+ D V+L GDF+ Y + P + YL P P + W
Sbjct: 15 LTVSPTQMEATNKDRNCIVHLAGDFLNYRSFPQLNDVYLFTPNADDPHD--NPFQKRVKW 72
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINR-NQ 333
+L+ + T DGLECNKIGV + F Q C P +CL +QLW + +A+
Sbjct: 73 LLIPKGHVTDDGLECNKIGVGFTPFRIQERGCYEPVGTCLASQLWTFAQAEAAACALVPP 132
Query: 334 LPLYGV--EG----RFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPG 387
P++ V EG R P+ S +I + +++ S + +E+ A +E+ RSPG
Sbjct: 133 RPIFSVLKEGVVDLRIHNFEGDPD--SRVLTITLDQIMTSVVTLEVEASGMEFFVNRSPG 190
Query: 388 KIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETS 446
KIIS +PTFEA T++G + QNTG + + Y + CS+G II P+
Sbjct: 191 KIISASVPTFEAYTRYGQMEVVVQNTGTIVSLYFIQVHACSSG---------IIDPEGPL 241
Query: 447 IRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQI 496
+ +Y AA L DS V CQF T T G Q+
Sbjct: 242 SNLYVLY--HKNAAVCVIRVDLLDSFAVNVFSQICQFQTTQTENSKGDQV 289
>gi|145492867|ref|XP_001432430.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399542|emb|CAK65033.1| unnamed protein product [Paramecium tetraurelia]
Length = 685
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 138/566 (24%), Positives = 242/566 (42%), Gaps = 89/566 (15%)
Query: 35 LSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENST-QKMRTV 93
L+ S+++ C+ ++D C+ +++++ + E S +++ NST +TV
Sbjct: 17 LTTSQIKVCDSNKNAD---CSENMLISLTI-------ENSFSTSTEQIQINSTILNNQTV 66
Query: 94 RI--PPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKT--RKCE-------PDAG---AD 139
++ P LT+ KT YA Y L Y ++ +P E + + C+ P G +
Sbjct: 67 QLSTPFTLTITKTPVYAYYPLKYFQNYNSQPYELQIPSAVNPCDDNWTSNSPTCGFQYSS 126
Query: 140 VVKICERQPICCPCGPQRRIPSSCGNVFDKLLKGK--ANTAHCLRFPGDWFHVFGIGQRS 197
K+ + Q CC CG + +V + K A A CLR+ W+ + I +
Sbjct: 127 TNKVQDSQGFCCSCGSSEYSGQNDQSVRINICKNASVATMAFCLRYSPLWYSSYNISKFV 186
Query: 198 IGFSVRIEVK-TGSKVSEVTVGPENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIP 256
I +++ I +K + +V + T+G E K + K+ I D++ PS E F L+ P
Sbjct: 187 IHYNITISIKYSNDEVEQYTLGSEVKEVKGESSIAKI--ISDYIPSNQPPSLESFMLMKP 244
Query: 257 RQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHN 316
P + + +M + + F G ECNKIGVSY +F + + C SCL N
Sbjct: 245 --SSPTSHNRVQAGSAAYMFVPK-EFLGQG-ECNKIGVSYTSFKNERNSCKKLIRSCLQN 300
Query: 317 QLWNYREADQNRINRNQLPLYGVE--GRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELR 374
QL + + D ++N N P Y ++ G F+++N + N FSI + + + + +E+
Sbjct: 301 QLEDLYQNDIAQLNNNSQPTYLIQKYGEFKQININ-NDQYLQFSID--QQMFTTITLEIN 357
Query: 375 ADD-IEYVYQRS---PGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGV 430
I Y+ + G+I V I F + G+ NTG + + F+CST
Sbjct: 358 TTGRISYIGNKQESVKGQIDLVEIHNFSIASGSGLLYAQITNTGGSLSEFKSFFNCSTNT 417
Query: 431 TLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVD---RAECQFSTMA 487
+I S ++ P S I++ +D C FS ++
Sbjct: 418 --------------ITINSTELEPLQ--------SIIIQQDINVSIDIKKSTSCNFSLLS 455
Query: 488 ---TVLD-NGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDF 543
+LD + F + N++ ++I S GK C KCS F D
Sbjct: 456 NEGALLDWKIVYLNQFDNNTNQSNNYNQTITS-------------EGKVCEIKCSQFIDI 502
Query: 544 SCHIQYIC----LSWLVLFGLVLAIF 565
SC++Q C +++ + G +L F
Sbjct: 503 SCYLQNNCEKDAITFFTVLGGILLTF 528
>gi|320165667|gb|EFW42566.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 696
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 154/348 (44%), Gaps = 14/348 (4%)
Query: 161 SSCGNVFDKLLKG--KANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVG 218
S+CG D K + +AHC+RF W++V + + + TGS+ +TV
Sbjct: 40 STCGVYMDANSKPIRDSQSAHCMRFDQLWYNVVALDPPQMAVKFTFTIFTGSENKTITVS 99
Query: 219 PENKTATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQG-GPGQPQDLGGNFSMWMLL 277
P +TA ++D + V LIG F + YL+IP+ P PQ G S WML+
Sbjct: 100 PSQRTAKNSDGSVIVRLIGSFQSFVADYDLTTNYLLIPQPATSPTSPQVALGR-SDWMLV 158
Query: 278 ERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLY 337
++ L G C+KIGV + F Q C+ P SCL+NQ ++ AD + ++ Y
Sbjct: 159 PKSTVDLTGATCDKIGVGFTPFRYQEGQCTRPSGSCLNNQPKDFWTADTTLRRQGKMVNY 218
Query: 338 GVEGRFERMNQHPNAGS-HSFSIGVTEVLNSN--------LLIELRADDIEYVYQRSPGK 388
+E + + + A S S G VL + +E+ AD + +
Sbjct: 219 FLERYGDILGLYAGASDIVSMSPGQQYVLATRPRDPASILFTMEIAADSLTFYNNLGQAS 278
Query: 389 IISVIIPTFEALTQFGVATITTQNTGEVEASYSL-TFDCSTGVTLMEEQYFIIKPKETSI 447
I + FE+L+Q G + + +EA Y++ DC + + EQ I +
Sbjct: 279 IKVFSVTDFESLSQRGTLRVLVASETPLEALYAVRVVDCVPAIFPIAEQSQSIGSYQAKW 338
Query: 448 RSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQ 495
+F + T C+ +L +S+ ++D+ F T +T + G+Q
Sbjct: 339 YTFTLQTMTPIGGNTNCTILLVNSNSDQLDKKFVSFRTNSTTIYKGNQ 386
>gi|452824579|gb|EME31581.1| hypothetical protein Gasu_12520 [Galdieria sulphuraria]
Length = 422
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 133/296 (44%), Gaps = 19/296 (6%)
Query: 266 DLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREAD 325
D+ + WML++ + TL G ECNK+GVSY AF + S C SCL NQL N+ ++D
Sbjct: 19 DIPSEIAKWMLVDTDQVTLSGDECNKVGVSYSAFQDESSRCLRAVNSCLGNQLENFYKSD 78
Query: 326 QNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRS 385
+ Y V+ + + + + +S++L++ A+ + +V S
Sbjct: 79 LKALQEGTSGNYFVQFFGDFDGNEVSGANPKMRFWTDRIQSSDILLQFAAESLFHVVDVS 138
Query: 386 PGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKET 445
GKII + +A ++ G T+T QNTG+VEASY + C + + Q I P +T
Sbjct: 139 EGKIIGANVNLVQAYSKNGKMTVTLQNTGKVEASYEVAVSCPNNILPILAQQVYILPNQT 198
Query: 446 SIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSS 505
+F++ + C+ L++S + R + + + + G Q QP S
Sbjct: 199 KNVTFQVDVENTHGGHFVCNVSLQNSIGQSISRYQVKVESSGINVSTGPQAG--QPSGSD 256
Query: 506 INDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLV 561
++ + AC++ C SFF+ C ++ C WL + ++
Sbjct: 257 ---------------GTSSTNYGSSSACQKSCGSFFNIICFFEHSC--WLNILYVI 295
>gi|328872922|gb|EGG21289.1| hypothetical protein DFA_01170 [Dictyostelium fasciculatum]
Length = 749
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 128/558 (22%), Positives = 231/558 (41%), Gaps = 79/558 (14%)
Query: 29 VVGVQILSKSKLEKCEK----RTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEE 84
+V + S ++KC + S NL+C+ K+ +++ + + E + V E+
Sbjct: 19 IVEPTFIGSSTIKKCIRDGTTTETSANLDCSEKLFVSLTLNNNQLETEQ---IQAVVYED 75
Query: 85 NSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTR------KCE----- 133
++ + P ++ +K+ + + + Y V KP E + R +CE
Sbjct: 76 GTSGNLS---YPIEVSFSKSQVFIQHPVIYETTVSNKPYETVIYKRDDIILTECEDKPTQ 132
Query: 134 PDAGADVVK---ICERQPICCPCGPQRRIP---SSCGNVFDKLLKGKANTAHCLRFPGDW 187
G VV + + Q CC C +S N+ LL ++++AHCL F
Sbjct: 133 STCGYAVVNGSAVRDSQGFCCTCIFSDYFTQDHNSRANLKCTLLNDQSSSAHCLGFDKVL 192
Query: 188 FHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLK-VNLIGDFVGYTNIP 246
++V+ I +I + + +K + PE NF + + L+ V T +
Sbjct: 193 YNVYAIQPGTILYQINATIKY--------LDPEF-------NFTRSIPLVVSPVSPTAVD 237
Query: 247 SFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFC 306
+ + L+ P P+ ++L+R F G EC+KIGVSY F QP+ C
Sbjct: 238 AKKNMILI----SDPTNPRTKMSPIQSSLILDRRLF---GDECDKIGVSYSKFQNQPNRC 290
Query: 307 SSPFWSCLHNQLWNYREADQNRINRNQ-----LPLYGVEGRFERMNQHPNAGSHSFSIGV 361
+ F +CL+NQ+ +Y + D +++++ L +G + F ++ + + I +
Sbjct: 291 GAQFGTCLNNQIDDYFKEDTDKMSKGLKGNYILSNFGSQ-MFASLDSSSSQANRFIKIQI 349
Query: 362 TEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYS 421
++ + + +EL+AD + + SPGKII + TFEA++ GV + +NTG + A Y
Sbjct: 350 DQIHQTQISLELKADQLRVIMNTSPGKIIEAYVKTFEAMSNNGVLVASIKNTGVIVAEYD 409
Query: 422 LTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAE 480
+ +CS + + Q I + F I + Y C L + + +D
Sbjct: 410 VQVKNCSQEINPIPAQRSSIAGGQYKTLQFDITTQSELKDTYYCYVDLLNGNAELLDSKL 469
Query: 481 CQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRKCSSF 540
F+ TV+ N P + D G L G C C F
Sbjct: 470 VYFNVSETVIKN--------PQGTGTRD--------GDNLNIGFE-----LTCDDYCPDF 508
Query: 541 FDFSCHI-QYICLSWLVL 557
F C + Q C S L++
Sbjct: 509 FQLLCFVSQPKCTSRLII 526
>gi|281205105|gb|EFA79298.1| hypothetical protein PPL_07716 [Polysphondylium pallidum PN500]
Length = 698
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 126/517 (24%), Positives = 212/517 (41%), Gaps = 63/517 (12%)
Query: 32 VQILSKSKLEKC------EKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEEN 85
+ ++S S+++ C E + D+ L C V+++ + S E S V +N
Sbjct: 13 IDVISSSQIQICKDDGTLESKKDNQYLKCQKMFVVSLTIDSNQDHTELSQFT--VNDVKN 70
Query: 86 STQKMRTVRIPPVLTVNKTASYAVYELTYIR--------DVPYKPQEFYMKTRKCEP--- 134
++ P ++ +K+ Y L Y + +V Y + K P
Sbjct: 71 ENGDTFSLVYPVEISFSKSKQYGKSSLIYRKSYSEEKYENVHYTNDYLLFSSCKDSPSDH 130
Query: 135 ------DAGADVVKICERQPICCPC------GPQR--RIPSSCGNVFDKLLKGKANTAHC 180
DA + V Q CC C G R R SC L G++++A C
Sbjct: 131 TCPTVRDASGNQVPY--SQGFCCSCDLGSYVGIDRDSRSHLSC-----TLFGGRSSSASC 183
Query: 181 LRFPGDWFHVFGIGQRSIGFSV-----RIEVKTG-SKVSEVTVGPENKTATSADNFLKVN 234
+ + + I + + + V TG S+ +G +N + N + +
Sbjct: 184 MAQRPLLYDSYSIEPPVTTYDIVVNITQFNVTTGASQTQTYRLGNDNLILNA--NGIVIK 241
Query: 235 LIGDFVGYTNIPSFEEFYLVIP-RQGGPGQP-QDLGGNFSMWMLLERTRFTLDGLECNKI 292
L+GDF + +FE+ L + Q P P L F M ++ ECNKI
Sbjct: 242 LVGDFASPQALRTFEDSMLFVNNEQSDPNNPIHKLP--FEMRAMIFSKSDVGSPNECNKI 299
Query: 293 GVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNA 352
Y AF QP+ CS+P SCL NQ+ +R+ D + + Y ++ + N
Sbjct: 300 ATDYVAFQNQPNQCSAPLNSCLDNQIKKFRDQDMALYAKGKKGQYLIKNYGATAEIYKNT 359
Query: 353 GSHS--FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITT 410
G+++ + ++ S + I+++AD+ +VY+ SPG I+S + TFE+++ G I T
Sbjct: 360 GNNNLFLQVELSGKQTSLVTIKIKADNFAFVYKESPGVIVSNRLETFESMSSDGNLYIQT 419
Query: 411 QNTGEVEASYSL-TFDCSTGVTLME-EQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAIL 468
+N G Y L +C+ +T+ Q F +KP E F+I T C A L
Sbjct: 420 KNIGATNTQYVLNVLNCTEAITVNNPSQVFTMKPNEIVETKFEIRTVTKFGGNQHCYADL 479
Query: 469 KDSDFSEV-DRAECQFSTMATVL------DNGSQITP 498
K + D +F+T TV+ + GS TP
Sbjct: 480 KGFAMGTLFDSILIKFNTTDTVIKVPGYNETGSGDTP 516
>gi|302842680|ref|XP_002952883.1| hypothetical protein VOLCADRAFT_105707 [Volvox carteri f.
nagariensis]
gi|300261923|gb|EFJ46133.1| hypothetical protein VOLCADRAFT_105707 [Volvox carteri f.
nagariensis]
Length = 980
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 117/469 (24%), Positives = 185/469 (39%), Gaps = 83/469 (17%)
Query: 165 NVFDKLLKGKANTAHCLRFPGDWFHVFGI-GQRSIGFSVRIEV------KTGS------- 210
N+F +L + + +AHCLR W+ + + ++ F VR+EV K S
Sbjct: 166 NMFPELER-EPVSAHCLRLDDQWYRGYNLRPGGTLQFGVRLEVHIPTAGKPASMAYVNGT 224
Query: 211 -KVSEVTVGPENKT---------ATSADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQG- 259
++S+ TV +++ T+++ L+GD Y +P L+IPR
Sbjct: 225 LRISKSTVITRSESLDLTLAGPLVTTSNKMTSARLLGDLSSYAQVPDLGTRALMIPRADF 284
Query: 260 -----GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCL 314
PG + G WM++ RT T DGL C+KIG SY AF Q + C +CL
Sbjct: 285 NETELYPGDSVPVNGR--TWMMVNRTMVTYDGLSCDKIGTSYTAFRNQQNACFRTESTCL 342
Query: 315 HNQLWNYREADQNRINRNQLPLY------GVEGRFERMNQHPNAGSHSFSIGVTEVLNSN 368
NQL + E DQ RI PLY GV G M G+ + + S
Sbjct: 343 RNQLKDLFEGDQKRIGSGMTPLYLLSQFNGVNG----MAVDAIDGAIYLRLPIASQPPS- 397
Query: 369 LLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQ--------NTGEVEASY 420
+++ + AD ++ SPGK+ ++ + F + T G T+ N G + A Y
Sbjct: 398 VMLTVSADTVQMTTALSPGKLSNLRMCEFGSTTSCGSLGFITRGHIFLSVANMGLLPADY 457
Query: 421 SLTF-DCS-TGVTLMEEQYFIIKPKETSIRS--FKIYPTTNQAAKYTCSAILKDSDFSEV 476
+ DCS V +E + + +T S IY + +C+ L D+ V
Sbjct: 458 IIVVSDCSLNNVWPIEARMITVNAGQTVNLSPPIPIYMNDTTTKERSCAVQLYDATGKAV 517
Query: 477 DRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIESIGKKLWEGLRDFITGKACRRK 536
DR + F A+ Q P ++N C
Sbjct: 518 DRQKLTFDATAS--------KGLQKPSRNVN-------------------ATNATNCDEV 550
Query: 537 CSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLFDPLY 585
C++ SC I+ C + + F +LA + L+ L + F LY
Sbjct: 551 CANPAGVSCFIENDCPAKMGRFLGILAAILAGITLMVLACKYSWFSKLY 599
>gi|449017067|dbj|BAM80469.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 708
Score = 99.0 bits (245), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 119/566 (21%), Positives = 220/566 (38%), Gaps = 87/566 (15%)
Query: 13 FLLILFCI-LNLLSP--RCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSS 69
F+L+ F + L L++P VG + + C ++ C+ K VL +AV +G++
Sbjct: 10 FVLLFFQVPLWLVAPFFSASVGGSLTGVGSIVTCLDSGRPGSIPCSKKWVLTLAVENGAT 69
Query: 70 GGEASIVA-EVVEVEENSTQKMRTVRIPPV---------LTVNKTASYAVYELTYIRDVP 119
+S+ A + V ++ +R+ P +T+ K+ Y L Y D
Sbjct: 70 AASSSVSATQAVYGSSSANATVRSADNPNTVYAFKYQVHITLTKSRIRLDYPLYYQSDFN 129
Query: 120 YKPQEFYMKTRK-----------------CEPDAG-----------ADVVKICERQPICC 151
KP E K + +P G AD +I Q CC
Sbjct: 130 NKPYEIVYKYNQKGPLNWLDNQCVATWGSSDPTCGYAYNPSWSTKPAD--RILYSQGFCC 187
Query: 152 PCGPQRRIPSSCGNVFDKL------LKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIE 205
C + S + L +AHCLRF W+ F IG+ + F + +
Sbjct: 188 DCNAGDLLGLSPNRIRGGLDCSLLNFDNPTESAHCLRFDSLWYSAFQIGEPDVNFVILVN 247
Query: 206 V-------KTGSKVSE----------------VTVGPENKTATSADNFLKVNLIGDFVGY 242
V T +S +++ P + +++ + IGDF +
Sbjct: 248 VTKCPLANSTIKSISGLVGNQDQAIQNCSTEIISLSPSSPIGYASNGKISAQAIGDFAPW 307
Query: 243 TNIPSFEEFYLVIP-------RQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVS 295
PS+ E +P + + + WML++ T+ G C+KIGVS
Sbjct: 308 EGTPSYSEKLFFVPSVCTDTSEAWCVDRISYIPTEINRWMLIDNDLVTITGDTCDKIGVS 367
Query: 296 YEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVE--GRFERMNQHPNAG 353
Y AF + C P SCLH+QL +Y ++D ++ Y V+ G F+ P
Sbjct: 368 YSAFTNEGQRCERPTQSCLHDQLQDYYDSDLALEQTGKVGSYFVQFFGDFDVSGLTPRNP 427
Query: 354 SHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVI--IPTFEALTQFGVAT--IT 409
F T+ + ++++ A+++ Y +P + + + I F + ++ G+ I
Sbjct: 428 LLRFFTNRTQA--TEVVLQFAAEELFYTIYLAPARFLRHLSKINPFTSQSKGGLIDLWIV 485
Query: 410 TQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILK 469
++ TG+ A ++++ C V ++ Q + P + S I T C+ L+
Sbjct: 486 SEGTGQNAAQFTVSASCEPNVEPIQAQIVTLAPGQLVSISLPIIETKATGGAGVCNCTLR 545
Query: 470 DSDFSEVDRAECQFSTMATVLDNGSQ 495
++ +D +F+ + +G+Q
Sbjct: 546 NALGQVLDVLVLEFNASSVRTTDGAQ 571
>gi|403356130|gb|EJY77656.1| HAP2-GCS1 domain containing protein [Oxytricha trifallax]
Length = 751
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 123/537 (22%), Positives = 217/537 (40%), Gaps = 73/537 (13%)
Query: 79 VVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRD--VPYKPQEFYMKTRKCE--- 133
+ +VE ++T +M+ + + K+A+ VY+LTY++D Q Y ++ C+
Sbjct: 5 ITKVENSTTGEMQQLNQWIQIGFKKSAAQIVYDLTYVQDFYASVTEQVVYTQSIFCDDSY 64
Query: 134 ----PDAG----ADVVKICERQPICCPCGPQRRIPSSCGNVFDKLLKG----KANT---- 177
P G + KI Q CC C S + DK +G ANT
Sbjct: 65 NSNDPTCGWQYDKNGNKIQYSQGFCCDCPL-----SDIFTIADKETRGFSCILANTFYAT 119
Query: 178 AHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSK------VSEVTVGPENKTATSADNFL 231
AHCL+F + F + I + +++ +++ K ++ + P+ + D+ +
Sbjct: 120 AHCLKFSSERFSAYKISPPRVEYTITAQIQIFDKNYNFYRQYDINLRPDRREKVIDDS-I 178
Query: 232 KVNLIGDFVGYTNIPSF-EEFYLVIPRQGGPGQPQDLGGNF-------SMWMLLERTRFT 283
+++IGDF+ T P + L++P +D G NF +L++R T
Sbjct: 179 YISIIGDFMP-TQFPVYYNNEILLVP-------SKDYGYNFYSTFDNCEYCLLVDREMIT 230
Query: 284 LDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
+ GLEC+KIG SY AF C P ++CL Q+ + D RI + P Y +
Sbjct: 231 MTGLECDKIGTSYYAFQTAGDKCDQPVYTCLKLQIQDLIIDDFKRIQDKKTPNYLLNHVG 290
Query: 344 ERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVYQRSPGKI----ISVIIPTFEA 399
+ + + V ++ L E+ D + ++ R+ K+ + ++ FE
Sbjct: 291 SKTGLKFVQETGQLAASCPYVQSTQLKFEVNTDRMSFI--RAVVKMSISQVKLLNNGFEG 348
Query: 400 LTQFGVATITTQNTGEVEASYSLTF-DCSTGVTLMEEQYFIIKPKETSIRSFKIYPTTNQ 458
T FG+ I +N ++ L+ +CS VT +I S RS+ +N
Sbjct: 349 YTNFGLIMINIRNDEDLAGEGQLSISNCSDFVTFFGNSVQVISVPAYSERSYNFTVGSNS 408
Query: 459 ---AAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNGSQITPFQPPKSSINDFFESIES 515
TCS L +S S ++ QF T AT ++ + ++ S E
Sbjct: 409 NLPIENNTCSIQLTNSIGSILEDRSVQFQTTATNYQTVVEVAQGILSQQKEAEYLLSKEY 468
Query: 516 IGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLL 572
+ K L DF FF C+ +C ++ V + P + +L
Sbjct: 469 LCGKC--DLEDF------------FFALFCYFYEVCTDQILRIIFVYILMPFLFAVL 511
>gi|443715870|gb|ELU07639.1| hypothetical protein CAPTEDRAFT_211680 [Capitella teleta]
Length = 914
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 146/351 (41%), Gaps = 21/351 (5%)
Query: 150 CCPCGPQRRI-PSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKT 208
CC C P+++ S+C N+AHCL+F WF V +GQ + S+R+E+
Sbjct: 160 CCSCDPKKKNRDSACA--------PHQNSAHCLQFHPLWFTVSEVGQLHMKHSIRVELME 211
Query: 209 GSKVSEVTVGPENKTATS----ADNFLKVNLIGDFVGYTNIPSFEEFYLVIPRQGGPGQP 264
++ +E + + TS ++ + + D V ++ E+ L+ P + PG P
Sbjct: 212 PTESNEWSSVADLNIGTSQPVDMNDKVTIQYKMDTVNISDPLKVEDKLLLTP-ELDPGMP 270
Query: 265 --QDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYR 322
Q+L ++L+++ L G C+ +G SY F Q CSS SC+ NQ
Sbjct: 271 LSQELMKEMDKFLLVKKDLVDLSGGSCDSVGTSYPGFLNQKEACSSSLESCMKNQPLELL 330
Query: 323 EADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVLNSNLLIELRADDIEYVY 382
ADQ R++R + + ++ + GSH + EV S + + + AD+++
Sbjct: 331 MADQMRLSRGEAAQLMIGHLGLALSPTYSGGSH-LAFLSNEVHLSQVTVLIDADNLDLKT 389
Query: 383 QRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTFDCSTGVTLMEEQYFIIKP 442
S II V+ + + T +T N G ++ C V + Q + P
Sbjct: 390 ASSDVAIIDVVSTSADHKT---TVKMTLFNAGILDVEVHAQMTCPWLVDIPASQKNRLMP 446
Query: 443 KETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTMATVLDNG 493
++ F N + C+ + D EV R E + L G
Sbjct: 447 YHSARVLFHFDADLND-TEVVCNVQVMDIQDEEVARREVSLKQSTSCLCMG 496
>gi|449664645|ref|XP_002159421.2| PREDICTED: protein HAPLESS 2-A [Hydra magnipapillata]
Length = 385
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 146/373 (39%), Gaps = 79/373 (21%)
Query: 8 LKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLN----------CTTK 57
K+K LL F N+ VG ILSKS +E CE S++L C K
Sbjct: 5 FKMKKQLLSSF--FNITVNIIFVGGLILSKSSIEFCENTGSSNDLKDPTNVVTQSACEKK 62
Query: 58 IVLNMAVPSGSSGGEASIVAEVVEVEENS-TQKMRTVRIPPVLTVNKTASYAVYELTY-- 114
+V+ ++V G+ GE + VV V +NS T + + P ++TV+K+ Y + +
Sbjct: 63 MVVLLSV--GNKQGETEKLQAVVSVVQNSATNEFARLYNPFMITVSKSPVYLNFPFFFNG 120
Query: 115 --IRDVPYK-----PQEFYMK--TRKC--------------------------EPDAGAD 139
+ + PY+ +Y+ +R+C + D
Sbjct: 121 ITVNNQPYEEIILSKNRWYVSDSSRQCLDQWQVEEEDDEHPTCGYQYTNSTQKQTDGTWK 180
Query: 140 VVK--ICERQPICCPCGPQRR---------IPSSCGNVFDKLLKGKANTAHCLRFPGDWF 188
VK I + Q CC C + + G + L +AHC+R W+
Sbjct: 181 TVKTRIWDSQGFCCYCTQDLKNYYIKKDIQDANRAGIICKPLTNSPQASAHCMRMSNLWY 240
Query: 189 HVFGIGQRSIGFSVRIE--------VKTGSKV-----SEVTVGPENKTATSADNFLKVNL 235
+ + FS+ ++ V+ S + E+ + P K+AT + N + N
Sbjct: 241 TLNEFTESYRDFSIYVKAFDQITKVVQNKSYIDYVNGGEILLSPSQKSATGSYNRITGNY 300
Query: 236 IGDFVGYTNIPSFEEFYLVIPRQG---GPGQPQDLGGNFSMWMLLERTRFTLDGLECNKI 292
+GD + P Y +IP P + L S WM++ R + D +C+ I
Sbjct: 301 VGDLQPIKSYPVLTNNYFLIPFSSTNVDPKKESQLKSGISKWMIIPRDLVSTDAKQCDMI 360
Query: 293 GVSYEAFNGQPSF 305
GV Y AF Q ++
Sbjct: 361 GVGYSAFRNQAAY 373
>gi|449686992|ref|XP_004211319.1| PREDICTED: protein HAPLESS 2-A-like, partial [Hydra magnipapillata]
Length = 331
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 92/181 (50%), Gaps = 8/181 (4%)
Query: 313 CLHNQLWNYREADQNRINRNQLPLY--GVEGRFERMNQHPNAGSHSFSIGVTEVLN---S 367
CL NQ +N D++R+ + ++P Y G+ + Q N G + + E+ + S
Sbjct: 1 CLANQPYNKFMDDEDRLEKGKMPWYFPARYGKLAGVKQ--NIGDNDKYLLTYELDDEQIS 58
Query: 368 NLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVEASYSLTF-DC 426
+ +++ ADD+ VY R+ G I I FEAL+ G ++ NTG V + + ++ C
Sbjct: 59 LVTLQISADDVVLVYNRATGIITRTAIQDFEALSLEGQLSVDVLNTGYVSSDFRISIPSC 118
Query: 427 STGVTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQFSTM 486
++GV +EE+ I P+ T +FK+ +T++ + + C+ L DS + FST
Sbjct: 119 TSGVQPIEEKRITIDPQMTETITFKMMTSTDKKSAHDCTINLYDSKNILLQSRNFTFSTK 178
Query: 487 A 487
A
Sbjct: 179 A 179
>gi|242019036|ref|XP_002429972.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515027|gb|EEB17234.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 1342
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 76/270 (28%), Positives = 122/270 (45%), Gaps = 52/270 (19%)
Query: 172 KGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV-------KTGSKVSEVTVGPENKTA 224
K N+AHCLRF G W+ ++ I + + +V ++V S+ ++T G +
Sbjct: 897 KCSGNSAHCLRFGGLWYGMYQIKKPVVAQTVGVQVFEKNVFFNGNSEWRDLTKGKMVRVG 956
Query: 225 T---SADNFL---KVNLIGDFVGY--TNIPSFEEFYLVIPR--QGGPGQPQDLGGNFSMW 274
T A++ L + GD T+I S + + L+IP +GGP +
Sbjct: 957 TFLPKAEDELPTFSMTYAGDNTKKISTSIDS-DNYMLLIPSHMEGGPDE----------- 1004
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQ---LWNYREADQNRINR 331
L+ R++ G ECN IG S+EAF+ QP C+ P SCL Q LW RE + RI
Sbjct: 1005 YLVVRSKDVSSGSECNMIGTSFEAFSQQPDRCARPKGSCLTRQPVDLW--REDMEARIR- 1061
Query: 332 NQLPLYGVEGRF--ERMNQHPNAGSHSFSIGVTEVL--------NSNLLIELRADDIEYV 381
G G++ E PN+ ++S E L S + IE++AD +
Sbjct: 1062 ------GKRGKYFVENFGTVPNSPVKTYSNLSGEYLALEYYGDYTSTIEIEMKADFNVMI 1115
Query: 382 YQRSPGKIISVII-PTFEALTQFGVATITT 410
+ S +I SV + T+ T+ ++ + T
Sbjct: 1116 RKGSSAQIPSVYVDSTYPDKTRIVLSVLNT 1145
>gi|326433608|gb|EGD79178.1| hypothetical protein PTSG_09908 [Salpingoeca sp. ATCC 50818]
Length = 1226
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 119/551 (21%), Positives = 204/551 (37%), Gaps = 126/551 (22%)
Query: 34 ILSKSKLEKCEKRTDSDNL------NCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENST 87
+++ S+LE CE + ++ C K+V+ +++ +G E V EV EN
Sbjct: 30 VIANSRLEVCESTSATNQPLSSLGGTCKKKLVMTLSIGNGQQLPE---VVEVTNYIENGQ 86
Query: 88 QK--MRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEFYMKTRKCE------------ 133
+K + RI P K+ ++ Y L Y + P E + K ++
Sbjct: 87 EKPLGQRYRIYPT----KSDAFVNYPLVYQSSINSLPYEHWFKVKRSNTQIAFFFDPCRD 142
Query: 134 ------PDAGADVVK---ICERQPICCPCGPQRRIPSSC--GNVFDKLLKGKANTAHCLR 182
P G V K I Q CC C ++ GN+ +L ++AHCLR
Sbjct: 143 APTAEHPTCGYFVRKGERIPNSQGFCCKCKFFSGGGNNAYRGNLKCRLYSPHYSSAHCLR 202
Query: 183 FPGDWFHVFGIGQRSIGFSVRIEVKTGSK------------------------------V 212
F +W+ + + + + ++R+ V + V
Sbjct: 203 FWPNWYDAWELDRAQVSHTIRLAVYKETSFNGTIPEPTQACKDSNARPTRRVNGFAFYCV 262
Query: 213 SEVTVGPENKTATSADNFLKVNLIGDFVG-------YTNI---PSFEEFY----LVIPRQ 258
+ +G +NK A S + + +GD T + P+F +F + P +
Sbjct: 263 DVLLIGVQNKVARSTNGTVIATYLGDLAPSIFPHDLTTRVLLTPNFYQFENSTDPMFPFR 322
Query: 259 GGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF--NGQPSFCSSPFWSCLHN 316
P W+LL++T L G CNK GVS+ AF +G+ C+ +SCL N
Sbjct: 323 RSPAH----------WLLLDKTDVDLSGETCNKPGVSFTAFYQHGKSGGCNMDPFSCLDN 372
Query: 317 QLWNYREADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIGVTEVL--------NSN 368
Q + D +R+ + Y + GR P + V +L S
Sbjct: 373 QPAHLVAQDLDRLAAGRAAQY-MAGRL----GPPLVKRQAVGERVQNMLAFKADIPQESL 427
Query: 369 LLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVE--------ASY 420
+E+ ADD ++ V FE L +G+A+++ +E ++
Sbjct: 428 YTLEIEADDSRIF-------VLDVSFSIFE-LGVYGLASVSRDIFLWIELARLDKRPSTV 479
Query: 421 SLTFDCSTGVTLMEEQYFIIKPKETSIRSFKIYP---TTNQAAKYTCSAILKDSDFSEVD 477
L+ +CS V + K +T I P ++ Q +TC K++ D
Sbjct: 480 YLSVECSDLVLPVPTTAISWKQTDTGHLRDAIIPLRTSSTQEVVHTCRVEAKNARGRITD 539
Query: 478 RAECQFSTMAT 488
F T AT
Sbjct: 540 TGFVAFKTTAT 550
>gi|281203885|gb|EFA78081.1| hypothetical protein PPL_08729 [Polysphondylium pallidum PN500]
Length = 371
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 91/196 (46%), Gaps = 21/196 (10%)
Query: 308 SPFWSCLHNQLWNYREADQNRINRNQLPLY--GVEGRFERMNQHPNAGSHSFSIGVTEVL 365
+P +CL NQ+ +R+ D + LY G +G+++ +N +A + ++ L
Sbjct: 179 APMNACLDNQIKKFRDQD--------MALYAKGKKGQYQIINYGASADIYRYTGNKNLFL 230
Query: 366 --------NSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTGEVE 417
S L +++AD+ Y Y+ SP +S + TFE+++ G+ I T+N G +
Sbjct: 231 QVELGGKQTSVLTNKIKADNFAYAYKESPAVFVSNRLETFESMSTDGILYIQTKNIGATK 290
Query: 418 ASYSL-TFDCSTGVTL-MEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSE 475
Y L +C+ G+T+ Q F +KP E F+I T C A LK
Sbjct: 291 EQYDLNVLNCTNGITVNTPSQVFTMKPNEIFESKFEIRTVTKLGGIQHCYADLKGFAMGT 350
Query: 476 V-DRAECQFSTMATVL 490
+ D +F+T TV+
Sbjct: 351 LFDSILIKFNTTDTVI 366
>gi|167524322|ref|XP_001746497.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775259|gb|EDQ88884.1| predicted protein [Monosiga brevicollis MX1]
Length = 1058
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 126/558 (22%), Positives = 200/558 (35%), Gaps = 106/558 (18%)
Query: 14 LLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEA 73
LL F I+ L SP V QI + ++E+C R S NC K+VLN+ V + GGE
Sbjct: 7 LLACFTIMGL-SP--AVQAQIKAAGQIERC-LRDGSLEPNCERKMVLNLGVLGSTGGGEY 62
Query: 74 SIVAEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELTYIRDVPYKPQEF-------- 125
+ + E+ + I + V KT+ Y L Y V KP E
Sbjct: 63 YHLTQS-RTEDGAISDSENEFI---IVVQKTSVSIEYPLRYRGVVNNKPYEIAQPVQTLL 118
Query: 126 ----------------YMKTRKCEPDAGADVVKICERQPICCPCGPQRRI----PSSC-- 163
Y + C K+ Q CC C ++ P+
Sbjct: 119 GGLFGSGDKTGCKDSPYHSSPSCGWLIDGGGKKVEGSQGFCCRCSTADQLGIGMPTDSYR 178
Query: 164 GNVFDKLLKGKANTAHCLRFPGD-WFHVFGIGQRSIGFSVRIEV----KTGSKVSEVT-- 216
N+ L +AHC R+ + W+ V+ I F V + + G S T
Sbjct: 179 ANLDCGLFGKGQQSAHCFRYSDELWYGVYDFDPGHIRFKVYVSIYRKYAVGPGYSHATQD 238
Query: 217 ---------------------------VGPENKTATSADNFLKVNLIGDFVGYTNIPSFE 249
VGP + ++D + V GDF +
Sbjct: 239 DVPAASPDCTVELPDEYADFRCQGVLEVGPHLRGGLTSDGAVSVVFGGDFATPVQLRDMS 298
Query: 250 EFYLVIP------RQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF--NG 301
L+ P Q +D +++++++ + G C+KIGV AF +G
Sbjct: 299 SKTLLAPILETVDNWHTHEQTKD---GMDTYLVVDKSDIDMTGSTCDKIGVEPLAFYLHG 355
Query: 302 QPSFCSSPFWSCLHNQLWNYREADQNRINRNQLPL-----YGVEGRFERMNQHPNAGSHS 356
+ C +CL+NQ ++ ADQ +N + P Y G + + +
Sbjct: 356 KGGGCGVSEGTCLNNQPRDFFLADQALVNAGKRPFNLLSAYNPYGFAQVEDDEESRYGIR 415
Query: 357 FSIGVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEALTQFG--VATITTQNTG 414
F + E S++ I + ADD+ +++ P FEAL+ G +A I + N
Sbjct: 416 FPV---EQHISSITISVNADDLTVYTAVCRDMTLALGSPNFEALSGNGFILANIYS-NCP 471
Query: 415 EVEASYSLTFDCSTGVTLMEEQYFIIKPKETSIRSF-----KIYPTTNQAAKYTCSAILK 469
+ A S++ C G Y E I SF IY + +A + C ++
Sbjct: 472 NMTALVSVSVICH-GNARGSASY------EREITSFIQLQLPIYVVSEEAGDHFCDVLVF 524
Query: 470 DSDFSEVDRAECQFSTMA 487
D+ V FST A
Sbjct: 525 DAVGYLVVNKSVTFSTSA 542
>gi|221504805|gb|EEE30470.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 972
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 80/164 (48%), Gaps = 16/164 (9%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNG-QPSFCSSPFWSCLHNQLWNYREADQNRINRNQ 333
++L++ ++ G EC+K+G + + + FC+ +C+ QL ++E D+ RI +N
Sbjct: 269 IILDKDYVSVTGYECDKVGTGLDRWGDMRGEFCNLLPGTCITGQLRKFKEVDKLRIEQNL 328
Query: 334 LPLYGVE---GRFERMNQHPNAGSHSFSIGVTEVLN--------SNLLIELRADDIEYVY 382
PLY ++ G F R +P G+ + G L S++ E+ A D+ ++
Sbjct: 329 APLYALKREFGGFPRYAPNPMNGTGFSTTGTRHYLGYDFGEQHYSDIRFEMDATDVTWLR 388
Query: 383 QRSPGKIISVIIPTFEALTQFGVATITTQ----NTGEVEASYSL 422
SPG I + +P +A + + + N+G +A++++
Sbjct: 389 ATSPGHITFIEVPQLDACSSSTIGGCPLKAYVWNSGNEDAAFAV 432
>gi|237839869|ref|XP_002369232.1| hypothetical protein TGME49_085940 [Toxoplasma gondii ME49]
gi|211966896|gb|EEB02092.1| hypothetical protein TGME49_085940 [Toxoplasma gondii ME49]
Length = 972
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 80/164 (48%), Gaps = 16/164 (9%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNG-QPSFCSSPFWSCLHNQLWNYREADQNRINRNQ 333
++L++ ++ G EC+K+G + + + FC+ +C+ QL ++E D+ RI +N
Sbjct: 269 IILDKDYVSVTGYECDKVGTGLDRWGDMRGEFCNLLPGTCITGQLRKFKEVDKLRIEQNL 328
Query: 334 LPLYGVE---GRFERMNQHPNAGSHSFSIGVTEVLN--------SNLLIELRADDIEYVY 382
PLY ++ G F R +P G+ + G L S++ E+ A D+ ++
Sbjct: 329 APLYALKREFGGFPRYAPNPMNGTGFSTTGTRHYLGYDFGEQHYSDIRFEMDATDVTWLR 388
Query: 383 QRSPGKIISVIIPTFEALTQFGVATITTQ----NTGEVEASYSL 422
SPG I + +P +A + + + N+G +A++++
Sbjct: 389 ATSPGHITFIEVPQLDACSSSTIGGCPLKAYVWNSGNEDAAFAV 432
>gi|401404273|ref|XP_003881687.1| hypothetical protein NCLIV_014480 [Neospora caninum Liverpool]
gi|325116100|emb|CBZ51654.1| hypothetical protein NCLIV_014480 [Neospora caninum Liverpool]
Length = 1133
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 66/137 (48%), Gaps = 12/137 (8%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNG-QPSFCSSPFWSCLHNQLWNYREADQNRINRNQ 333
++L++ ++ G EC+K+G + + + FC SC+ QL ++E D+ RI +N
Sbjct: 292 VILDKDYVSVTGYECDKVGTGLDRWGDMRGEFCYLLPGSCITGQLRKFKEVDRLRIEQNL 351
Query: 334 LPLYGVE---GRFERMNQHPNAGSHSFSIGVTEVLN--------SNLLIELRADDIEYVY 382
PLY ++ G F R P + S G L S++ E+ A+D+ ++
Sbjct: 352 APLYALKREFGGFPRYAPDPMNATSLSSAGTRHYLGYDFGEQHYSDIRFEMDANDVTWLR 411
Query: 383 QRSPGKIISVIIPTFEA 399
SPG I + +P +A
Sbjct: 412 ATSPGHITFIEVPQLDA 428
>gi|221484611|gb|EEE22905.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 972
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 79/164 (48%), Gaps = 16/164 (9%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNG-QPSFCSSPFWSCLHNQLWNYREADQNRINRNQ 333
++L++ ++ G EC+K+G + + + FC+ +C+ QL ++E D+ RI +N
Sbjct: 269 IILDKDYVSVTGYECDKVGTGLDRWGDMRGEFCNLLPGTCITGQLRKFKEVDKLRIEQNL 328
Query: 334 LPLYGVE---GRFERMNQHPNAGSHSFSIGVTEVLN--------SNLLIELRADDIEYVY 382
PLY ++ G F R +P + + G L S++ E+ A D+ ++
Sbjct: 329 APLYALKREFGGFPRYAPNPMNRTGFSTTGTRHYLGYDFGEQHYSDIRFEMDATDVTWLR 388
Query: 383 QRSPGKIISVIIPTFEALTQFGVATITTQ----NTGEVEASYSL 422
SPG I + +P +A + + + N+G +A++++
Sbjct: 389 ATSPGHITFIEVPQLDACSSSTIGGCPLKAYVWNSGNEDAAFAV 432
>gi|270010014|gb|EFA06462.1| hypothetical protein TcasGA2_TC009345 [Tribolium castaneum]
Length = 964
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 83/203 (40%), Gaps = 30/203 (14%)
Query: 156 QRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV-------KT 208
QRR +C + + + + HCL F W+ V+ I + I +RI++
Sbjct: 286 QRRGGQTCDDADLNIPESFRESTHCLTFSNMWYSVYQISKPEIIHKLRIQIFQKYEDCHG 345
Query: 209 GSKVSEVTVGPENKTATSADNFLKVNLIGDFVG-----YTNIPSFEEFYLVIP--RQGGP 261
+ ++T G + T +++ ++I + ++ L+IP R P
Sbjct: 346 NTHWMDITQGKTIELGTQTPLYVEKDIIAKYCSEDIDFQDQALDYKNLKLLIPERRVVDP 405
Query: 262 GQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNY 321
Q +MLL + + DG C+ GV YEAF Q C+ P SCL NQ
Sbjct: 406 EQ----------FMLLPKNSVS-DGRTCDTAGVGYEAFFKQRKRCAQPQGSCLGNQPNQL 454
Query: 322 READ-----QNRINRNQLPLYGV 339
E+D Q R+ + L YG
Sbjct: 455 HESDAEAVKQGRVGQYFLKFYGT 477
>gi|91085727|ref|XP_973371.1| PREDICTED: similar to synaptic vesicle protein 2 [Tribolium
castaneum]
Length = 1537
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 83/203 (40%), Gaps = 30/203 (14%)
Query: 156 QRRIPSSCGNVFDKLLKGKANTAHCLRFPGDWFHVFGIGQRSIGFSVRIEV-------KT 208
QRR +C + + + + HCL F W+ V+ I + I +RI++
Sbjct: 286 QRRGGQTCDDADLNIPESFRESTHCLTFSNMWYSVYQISKPEIIHKLRIQIFQKYEDCHG 345
Query: 209 GSKVSEVTVGPENKTATSADNFLKVNLIGDFVG-----YTNIPSFEEFYLVIP--RQGGP 261
+ ++T G + T +++ ++I + ++ L+IP R P
Sbjct: 346 NTHWMDITQGKTIELGTQTPLYVEKDIIAKYCSEDIDFQDQALDYKNLKLLIPERRVVDP 405
Query: 262 GQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNY 321
Q +MLL + + DG C+ GV YEAF Q C+ P SCL NQ
Sbjct: 406 EQ----------FMLLPKNSVS-DGRTCDTAGVGYEAFFKQRKRCAQPQGSCLGNQPNQL 454
Query: 322 READ-----QNRINRNQLPLYGV 339
E+D Q R+ + L YG
Sbjct: 455 HESDAEAVKQGRVGQYFLKFYGT 477
>gi|71029132|ref|XP_764209.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351163|gb|EAN31926.1| hypothetical protein TP04_0574 [Theileria parva]
Length = 759
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 21/143 (14%)
Query: 285 DGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNY--READQNRINRNQLPLYGVEGR 342
DGL C+KIG+S + + Q C+S SCL NQL +Y +E D+ ++ + LYGVE
Sbjct: 390 DGLMCDKIGLSMKRWANQEEICNSSPGSCLKNQLKHYFDQEKDEAKLPK----LYGVEPT 445
Query: 343 FERMNQHPNAGSHSFSI-GVTEVLNSNLLIELRADDIEYVYQRSPGKIISVIIPTFEA-- 399
F A S+ V E + L R + Y++ + + + I TF+A
Sbjct: 446 F-------TAVKKDLSLPAVKEANKTTLDDPNRIHTLTYIHSKD--DVTRLKIDTFDATV 496
Query: 400 ---LTQFGVATITTQNTGEVEAS 419
++ F ++ + GE E S
Sbjct: 497 TEIISDFPGFIVSAKMDGECEVS 519
>gi|224084920|ref|XP_002307449.1| predicted protein [Populus trichocarpa]
gi|222856898|gb|EEE94445.1| predicted protein [Populus trichocarpa]
Length = 228
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 41/75 (54%)
Query: 506 INDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
I F + L+ DF+ GK+C C+S +DF C+I+ C++ L+ VLA+
Sbjct: 5 IISFLSGFTKVIGDLFGSPLDFLAGKSCSSVCASPWDFFCYIENFCVASLLKMVAVLALL 64
Query: 566 PTVLVLLWLLHQKGL 580
VL+ +LL++ G+
Sbjct: 65 YIVLLFFYLLYKTGI 79
>gi|209877042|ref|XP_002139963.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555569|gb|EEA05614.1| hypothetical protein CMU_026210 [Cryptosporidium muris RN66]
Length = 696
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 51/245 (20%), Positives = 97/245 (39%), Gaps = 33/245 (13%)
Query: 275 MLLERTRFTLDGLECNKIGVS---YEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINR 331
++L G+ CNKI S + + NG+ FC P +C + Q+ +Y +
Sbjct: 329 LILPPDTVDFTGVSCNKIASSIYTWSSVNGR--FCYHPPLTCQNVQIADYYKKLIKDQTS 386
Query: 332 NQLPLYGVEGR---------------FERMNQHPNAGSHSFSIGVT--EVLNSNLLIELR 374
++ + VE + M N + F +G + ++ ++ +
Sbjct: 387 GKISEFSVEAQNSGEPQLIITPSNYNSSNMISDDNNSTLQFYLGYVFDSIFDTEIMFSVE 446
Query: 375 ADDIEYVYQRSPGKIISVIIPTFEALTQFGVA----TITTQNTGEVEASYSLTFDCSTG- 429
A + +V +PG I + P E+ G I +N+G E+ + + T
Sbjct: 447 ASSVSWVASAAPGIITYIEPPPIESCFAMGYTGCPIKIYVRNSGTFESGFVVQIPYCTKD 506
Query: 430 ------VTLMEEQYFIIKPKETSIRSFKIYPTTNQAAKYTCSAILKDSDFSEVDRAECQF 483
V + Q IK + T + +F I + +KY C+A+L +S +D+ F
Sbjct: 507 SKPTNEVNPIMAQSRSIKAQSTGVFTFIIGVSVTSGSKYECTAVLYNSFSIHLDQHLFTF 566
Query: 484 STMAT 488
ST ++
Sbjct: 567 STQSS 571
>gi|388557120|dbj|BAM16295.1| generative cell specific-1 [Eimeria tenella]
Length = 834
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 71/170 (41%), Gaps = 21/170 (12%)
Query: 275 MLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREADQNRINRNQL 334
++L+ T+DG C+ GVS + + G+ FC +C L + E ++ +
Sbjct: 374 IVLDEQHVTVDGSTCDLPGVSLQQW-GRDGFCDYAQGTCFAKNLKWFHEYNEQAAELGRT 432
Query: 335 PLYGVE---GRFERMN----------QHPNAGS---HSFSIGVTEVLNSNLLIELRADDI 378
PLY +E G + R + AG H + + S + IE+ A I
Sbjct: 433 PLYALEYPPGNYPRYHVGLDNVDDAIDTSKAGPFELHRLAFAYPDSHKSKVRIEMNAGLI 492
Query: 379 EYVYQRSPGKIISVIIPT---FEALTQFGVA-TITTQNTGEVEASYSLTF 424
++ SPG+I S+ P + FG + N+G ++A+Y L
Sbjct: 493 RWIQSTSPGQITSIAPPAPRECDNAQTFGCPLKVYVLNSGTIDATYYLEL 542
>gi|452824580|gb|EME31582.1| hypothetical protein Gasu_12530 [Galdieria sulphuraria]
Length = 265
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 96/247 (38%), Gaps = 49/247 (19%)
Query: 1 MRNQTKSLKLKHFLLILFCILNLLSPRCVVGVQILSKSKLEKCEKRTDSDNLNCTTKIVL 60
MR S L +++I + L + R ++S +++ C S +L C V+
Sbjct: 1 MRRTKPSCNL--YIIICVVLFYLKATR----ATLISAGEIQSC-TNNGSSSLQCDKMWVV 53
Query: 61 NMAVPSGSSGGEASIV-------AEVVEVEENSTQKMRTVRIPPVLTVNKTASYAVYELT 113
+AV +G G ++++ +E V+ + N K +T++K+ Y L
Sbjct: 54 TLAVANGQQGVDSTVAKVFGSNQSEYVK-DPNDPNKAYLFNYTLHITLSKSKIAIEYPLV 112
Query: 114 YIRDVPYKPQEFYMK----------TRKC---------------EPDAGADVV-KICERQ 147
Y++D +P E + T C +P D +I Q
Sbjct: 113 YLQDFNNQPYEIVYEANSNGPLEEYTNPCVDSWGSSNPTCGYAYDPPNEIDAANRIYNSQ 172
Query: 148 PICCPCGPQRRIPS------SCGNVFDKLLKGK-ANTAHCLRFPGDWFHVFGIGQRSIGF 200
CC CG + SC N+ L + + + AHCLRF W+ F IG +G
Sbjct: 173 GFCCQCGVSDYLAEGTREGLSC-NLLGSLFQIEPSQAAHCLRFDPLWYSGFQIGTYEVGV 231
Query: 201 SVRIEVK 207
RI +K
Sbjct: 232 QQRIYLK 238
>gi|449466348|ref|XP_004150888.1| PREDICTED: protein HAPLESS 2-like [Cucumis sativus]
Length = 85
Score = 47.0 bits (110), Expect = 0.038, Method: Composition-based stats.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 116 RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
+DV KP+E+Y+ TRKCE +A A VV+ICER
Sbjct: 39 KDVSCKPEEYYVTTRKCESNASARVVQICER 69
>gi|449520257|ref|XP_004167150.1| PREDICTED: protein HAPLESS 2-like [Cucumis sativus]
Length = 85
Score = 46.2 bits (108), Expect = 0.057, Method: Composition-based stats.
Identities = 19/31 (61%), Positives = 24/31 (77%)
Query: 116 RDVPYKPQEFYMKTRKCEPDAGADVVKICER 146
+D KP+EFY+ TRKCE +A A VV+ICER
Sbjct: 39 KDFSCKPEEFYVTTRKCESNASARVVQICER 69
>gi|389603325|ref|XP_001569028.2| similar to leishmania major. l411.4-like protein [Leishmania
braziliensis MHOM/BR/75/M2904]
gi|322505809|emb|CAM44161.2| similar to leishmania major. l411.4-like protein [Leishmania
braziliensis MHOM/BR/75/M2904]
Length = 570
Score = 45.8 bits (107), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 66/349 (18%), Positives = 139/349 (39%), Gaps = 66/349 (18%)
Query: 36 SKSKLEKCEKRTDSDNLNCTTKIVLNMAVPSGSSGGEASIVAEVVEVEENSTQKM----- 90
+ + + C+ + + C K+V+++ + + G +I+ V VE Q +
Sbjct: 7 ATAYVRHCDATSSATPPGCVRKLVVDLTLDDRTLAG--AILETEVTVEHALHQSLFPHDA 64
Query: 91 -----------RTVRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMKT--------- 129
V +PP+ + + ++A Y LTY+R P ++ Y+K
Sbjct: 65 ASDVAGAAATSLQVSLPPIRVALRRSAVQVRYMLTYLRTFPAALRD-YVKVLHTAMSCDD 123
Query: 130 --RKCEPDAGADVVKICERQPICCPC-GPQRRIPSSCGNVFDK---LLKGKANTAHCLRF 183
+C + +CC C G + + + NV + + A + C++
Sbjct: 124 GVTRCPSYTSMTGALVSAPLGVCCLCIGIECALTNEFCNVSMRGHFCFRTGAAGSICVQN 183
Query: 184 PGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGD 238
G +H + +G +++R+ +G +++ T+ P + S + ++ + +
Sbjct: 184 EGIVYHGWSVGSPLPYYTLRLS-ASGQGIAQTTLQLTTDAPSAQAGASFLHLVQASGVSP 242
Query: 239 FVGYTNIPSFEEFYLVIP------------RQGGPGQPQDLGGNFSMWMLLERTRFTLDG 286
G T + L +P R P + W+LL + + G
Sbjct: 243 GEGGTTV-DIAGRVLFVPSAESSSGSTSHVRDDDPAE----------WLLLPASLVSNSG 291
Query: 287 LECNKIGVSYEAFNGQPSF--CSSPFWSCLHNQLWNYREADQNRINRNQ 333
EC+K+G+S + F Q S C++ +C+ +QL +YRE D +I + +
Sbjct: 292 NECDKVGISPDYFYSQSSTTQCNAQKGTCVRHQLADYREEDLAQIAQGK 340
>gi|157877293|ref|XP_001686969.1| similar to leishmania major. l411.4-like protein [Leishmania major
strain Friedlin]
gi|68130044|emb|CAJ09352.1| similar to leishmania major. l411.4-like protein [Leishmania major
strain Friedlin]
Length = 576
Score = 45.8 bits (107), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 63/284 (22%), Positives = 115/284 (40%), Gaps = 46/284 (16%)
Query: 93 VRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFY--MKTR--------KCEPDAGADVV 141
V +PP+ + + + A Y LTY+R P ++ +KT +C
Sbjct: 78 VSLPPITVAMRRGAVQMRYGLTYLRTFPAALRDSVRVLKTAMSCDDGVTRCPSYMSMTGT 137
Query: 142 KICERQPICCPCGP-QRRIPSSCGNVFDK---LLKGKANTAHCLRFPGDWFHVFGIGQRS 197
+ +CC C + + S N + + A C++ G +H + +G S
Sbjct: 138 LVSAPLGLCCLCTSVECALTSDLCNASMRAHFCFRTGAAGITCVQSEGITYHGWAVGSSS 197
Query: 198 IGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGDFVGYTN-------- 244
+ + + +G ++ T+ PE + SA L+ + G G +N
Sbjct: 198 PYYMMHLS-ASGRGIAPTTLQLTTDAPEVQKGASALQILRAS--GVLPGESNPTVDISGR 254
Query: 245 ---IPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
+PS E GP + D + W+LL ++ G +C+K+G+S + F
Sbjct: 255 VLFVPSAEHSSASRSISTGPVRDDDP----AEWLLLPAPLVSVSGNDCDKVGISPDYFYS 310
Query: 302 QPSF--CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
S C++ +C+ +QL +YR AD +I + GV GR+
Sbjct: 311 LSSTKQCNAQKGTCVRHQLADYRAADLEQIAQ------GVGGRY 348
>gi|65335255|gb|AAY42350.1| excreted/secreted protein 37 [Leishmania major]
Length = 611
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 63/284 (22%), Positives = 115/284 (40%), Gaps = 46/284 (16%)
Query: 93 VRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFY--MKTR--------KCEPDAGADVV 141
V +PP+ + + + A Y LTY+R P ++ +KT +C
Sbjct: 113 VSLPPITVAMRRGAVQMRYGLTYLRTFPAALRDSVRVLKTAMSCDDGVTRCPSYMSMTGT 172
Query: 142 KICERQPICCPCGP-QRRIPSSCGNVFDK---LLKGKANTAHCLRFPGDWFHVFGIGQRS 197
+ +CC C + + S N + + A C++ G +H + +G S
Sbjct: 173 LVSAPLGLCCLCTSVECALTSDLCNASMRAHFCFRTGAAGITCVQSEGITYHGWAVGSSS 232
Query: 198 IGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGDFVGYTN-------- 244
+ + + +G ++ T+ PE + SA L+ + G G +N
Sbjct: 233 PYYMMHLS-ASGRGIAPTTLQLTTDAPEVQKGASALQILRAS--GVLPGESNPTVDISGR 289
Query: 245 ---IPSFEEFYLVIPRQGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAFNG 301
+PS E GP + D + W+LL ++ G +C+K+G+S + F
Sbjct: 290 VLFVPSAEHSSASRSISTGPVRDDDP----AEWLLLPAPLVSVSGNDCDKVGISPDYFYS 345
Query: 302 QPSF--CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
S C++ +C+ +QL +YR AD +I + GV GR+
Sbjct: 346 LSSTKQCNAQKGTCVRHQLADYRAADLEQIAQ------GVGGRY 383
>gi|359492377|ref|XP_003634404.1| PREDICTED: uncharacterized protein LOC100854126 [Vitis vinifera]
Length = 234
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 39/75 (52%)
Query: 506 INDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
I FF + L+ DF++GK+C C +DF C+I+ C++ L+ +V +
Sbjct: 5 IGSFFSGFARVIGDLFGSPLDFLSGKSCSSVCGITWDFICYIENFCVANLLKIAMVSFLL 64
Query: 566 PTVLVLLWLLHQKGL 580
VL+ +LL + G+
Sbjct: 65 YIVLLFFYLLCKLGI 79
>gi|307176762|gb|EFN66162.1| hypothetical protein EAG_13618 [Camponotus floridanus]
Length = 1820
Score = 43.9 bits (102), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 42/177 (23%), Positives = 69/177 (38%), Gaps = 32/177 (18%)
Query: 177 TAHCLRFPGDWFHVFGIGQRSIGFSVRIEVKTGSKVSEVTVGPENKTATSADNFLKVNLI 236
+AHCLRF W+ V+ + + V ++V + ++ E+ T S + +
Sbjct: 1265 SAHCLRFSDLWYSVYQLEDPIVEHIVYLQVYEKRTLRNGSIYWEDLTEDSV--IVYAIRL 1322
Query: 237 GDF------------VGYTNIPSFEEFY-------------LVIPRQ---GGPGQPQDLG 268
G F Y IP + E L++P G P +
Sbjct: 1323 GTFNRHHRGSQGTIVFTYKKIPGWREEEEEEAPNLDVVRDRLLVPSSVTSKDSGYP--VK 1380
Query: 269 GNFSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQLWNYREAD 325
G + ++++ + +G EC+K GV + AF QP C +CL NQ YR D
Sbjct: 1381 GEANEYLVVPASSINENGNECDKAGVGFAAFAKQPDRCGHVSGTCLKNQPLAYRRHD 1437
>gi|328705538|ref|XP_003242840.1| PREDICTED: hypothetical protein LOC100573999 [Acyrthosiphon pisum]
Length = 754
Score = 43.5 bits (101), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 274 WMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQ---LW 319
+++L + G ECNK GVSYEAF Q + C SCL+NQ LW
Sbjct: 409 YLILNANNISTKGDECNKAGVSYEAFFKQSNRCGVKRSSCLNNQPSHLW 457
>gi|398024716|ref|XP_003865519.1| similar to leishmania major. l411.4-like protein [Leishmania
donovani]
gi|322503756|emb|CBZ38842.1| similar to leishmania major. l411.4-like protein [Leishmania
donovani]
Length = 577
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 60/286 (20%), Positives = 114/286 (39%), Gaps = 49/286 (17%)
Query: 93 VRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMKTR----------KCEPDAGADVV 141
V +PP+ + + + A Y LTY+R P ++ R +C
Sbjct: 78 VSLPPITVAIQRGAVQMRYGLTYLRTFPAALRDSVRVLRTAMSCDDGVTRCPSYMSMTGA 137
Query: 142 KICERQPICCPCGP-QRRIPSSCGNVFDKL---LKGKANTAHCLRFPGDWFHVFGIGQRS 197
+ +CC C + + S N + + A C++ G +H + +G S
Sbjct: 138 LVSAPLGLCCLCTSVECALTSDLCNASMRAHFCFRTGAAGITCVQGEGITYHGWSVGSSS 197
Query: 198 IGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGDFVGYTNIPSFE--E 250
+++ + +G ++ T+ PE + SA L+ + D + + P +
Sbjct: 198 PYYTMNLSA-SGRGIAPTTLQLTTDAPEAQNGASALQLLRAS---DVLPEESNPKVDISG 253
Query: 251 FYLVIPR-----------QGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
L +P GP + D + W+LL ++ G +C+K+G+S + F
Sbjct: 254 RVLFVPSAEHSRASRGTTSTGPVRDDDP----AEWLLLPAPLVSVSGNDCDKVGISPDYF 309
Query: 300 NGQPSF--CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
S C++ +C+ +QL +YR AD +I + GV GR+
Sbjct: 310 YSLSSTTQCNAQKGTCVRHQLADYRAADLEQIAQ------GVGGRY 349
>gi|146103721|ref|XP_001469629.1| similar to leishmania major. l411.4-like protein [Leishmania
infantum JPCM5]
gi|134073999|emb|CAM72739.1| similar to leishmania major. l411.4-like protein [Leishmania
infantum JPCM5]
Length = 577
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 60/286 (20%), Positives = 114/286 (39%), Gaps = 49/286 (17%)
Query: 93 VRIPPV-LTVNKTASYAVYELTYIRDVPYKPQEFYMKTR----------KCEPDAGADVV 141
V +PP+ + + + A Y LTY+R P ++ R +C
Sbjct: 78 VSLPPITVAIQRGAVQMRYGLTYLRTFPAALRDSVRVLRTAMSCDDGVTRCPSYMSMTGT 137
Query: 142 KICERQPICCPCGP-QRRIPSSCGNVFDKL---LKGKANTAHCLRFPGDWFHVFGIGQRS 197
+ +CC C + + S N + + A C++ G +H + +G S
Sbjct: 138 LVSAPLGLCCLCTSVECALTSDLCNASMRAHFCFRTGAAGITCVQGEGITYHGWSVGSSS 197
Query: 198 IGFSVRIEVKTGSKVSEVTV-----GPENKTATSADNFLKVNLIGDFVGYTNIPSFE--E 250
+++ + +G ++ T+ PE + SA L+ + D + + P +
Sbjct: 198 PYYTMHLSA-SGRGIAPTTLQLTTDAPEAQNGASALQLLRAS---DVLPEESNPKVDISG 253
Query: 251 FYLVIPR-----------QGGPGQPQDLGGNFSMWMLLERTRFTLDGLECNKIGVSYEAF 299
L +P GP + D + W+LL ++ G +C+K+G+S + F
Sbjct: 254 RVLFVPSAEHSRASRGTTSTGPVRDDDP----AEWLLLPAPLVSVSGNDCDKVGISPDYF 309
Query: 300 NGQPSF--CSSPFWSCLHNQLWNYREADQNRINRNQLPLYGVEGRF 343
S C++ +C+ +QL +YR AD +I + GV GR+
Sbjct: 310 YSLSSTTQCNAQKGTCVRHQLADYRAADLEQIAQ------GVGGRY 349
>gi|302141790|emb|CBI18993.3| unnamed protein product [Vitis vinifera]
Length = 176
Score = 42.4 bits (98), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 39/75 (52%)
Query: 506 INDFFESIESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIF 565
I FF + L+ DF++GK+C C +DF C+I+ C++ L+ +V +
Sbjct: 5 IGSFFSGFARVIGDLFGSPLDFLSGKSCSSVCGITWDFICYIENFCVANLLKIAMVSFLL 64
Query: 566 PTVLVLLWLLHQKGL 580
VL+ +LL + G+
Sbjct: 65 YIVLLFFYLLCKLGI 79
>gi|301614936|ref|XP_002936934.1| PREDICTED: sodium/calcium exchanger 3-like isoform 3 [Xenopus
(Silurana) tropicalis]
Length = 915
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
L L FP +VL W+ ++ LF Y + +++D R + + DHP
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I + K KH ++D D YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388
>gi|301614944|ref|XP_002936938.1| PREDICTED: sodium/calcium exchanger 3-like isoform 7 [Xenopus
(Silurana) tropicalis]
Length = 923
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
L L FP +VL W+ ++ LF Y + +++D R + + DHP
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I + K KH ++D D YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388
>gi|301614932|ref|XP_002936932.1| PREDICTED: sodium/calcium exchanger 3-like isoform 1 [Xenopus
(Silurana) tropicalis]
Length = 922
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
L L FP +VL W+ ++ LF Y + +++D R + + DHP
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I + K KH ++D D YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388
>gi|449500727|ref|XP_004161179.1| PREDICTED: uncharacterized protein LOC101227573 [Cucumis sativus]
Length = 238
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 6/81 (7%)
Query: 502 PKSSINDFFESIESI-GKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGL 560
S +D F +I I G L DF++G++C C S +DF C+I+ C++ L+ G+
Sbjct: 5 ASSLASDVFSAIGKIFGSPL-----DFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGM 59
Query: 561 VLAIFPTVLVLLWLLHQKGLF 581
V + VL+LL+LLH+ G+F
Sbjct: 60 VFILSYFVLLLLYLLHKIGIF 80
>gi|449449908|ref|XP_004142706.1| PREDICTED: uncharacterized protein LOC101218855 [Cucumis sativus]
Length = 238
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 6/81 (7%)
Query: 502 PKSSINDFFESIESI-GKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGL 560
S +D F +I I G L DF++G++C C S +DF C+I+ C++ L+ G+
Sbjct: 5 ASSLASDVFSAIGKIFGSPL-----DFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGM 59
Query: 561 VLAIFPTVLVLLWLLHQKGLF 581
V + VL+LL+LLH+ G+F
Sbjct: 60 VFILSYFVLLLLYLLHKIGIF 80
>gi|301614934|ref|XP_002936933.1| PREDICTED: sodium/calcium exchanger 3-like isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 919
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
L L FP +VL W+ ++ LF Y + +++D R + + DHP
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I + K KH ++D D YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388
>gi|301614938|ref|XP_002936935.1| PREDICTED: sodium/calcium exchanger 3-like isoform 4 [Xenopus
(Silurana) tropicalis]
Length = 925
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
L L FP +VL W+ ++ LF Y + +++D R + + DHP
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I + K KH ++D D YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388
>gi|301614942|ref|XP_002936937.1| PREDICTED: sodium/calcium exchanger 3-like isoform 6 [Xenopus
(Silurana) tropicalis]
Length = 919
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
L L FP +VL W+ ++ LF Y + +++D R + + DHP
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I + K KH ++D D YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388
>gi|301614940|ref|XP_002936936.1| PREDICTED: sodium/calcium exchanger 3-like isoform 5 [Xenopus
(Silurana) tropicalis]
Length = 916
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
L L FP +VL W+ ++ LF Y + +++D R + + DHP
Sbjct: 235 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 288
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I + K KH ++D D YY L H QK
Sbjct: 289 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 348
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 349 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 388
>gi|392967161|ref|ZP_10332579.1| TonB-dependent receptor plug [Fibrisoma limi BUZ 3]
gi|387843958|emb|CCH54627.1| TonB-dependent receptor plug [Fibrisoma limi BUZ 3]
Length = 1078
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 27/184 (14%)
Query: 262 GQPQDLGGNFSMWMLLERT-------RFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCL 314
G DL N+ W++L T RF N SY + SF ++ + +
Sbjct: 611 GYYADLTSNYKNWLILNGTFRYDQTSRFYKSTRPTNSW--SYPYYGAAVSFIATDAFPAI 668
Query: 315 HNQLWNYRE--ADQNRINRNQLPLYGVEGRFERMNQHPNAGSHSFSIG------------ 360
+ NY + A+ N+ + +PLYG++ + P + ++G
Sbjct: 669 KSSFLNYAKIRANYNKNANDNIPLYGLDLAYGNGGGFPYGNTVGLTVGNRLPDANLRPEV 728
Query: 361 --VTEVLNSNLLIELRAD-DIEYVYQRSPGKIISVIIPTFEALTQFGVATITTQNTG-EV 416
TE+ L+ R + D+ QRS G++I+V +P + + T+N G E
Sbjct: 729 VYSTEIGGEFQLLNDRINVDVSAYSQRSEGQVITVRVPNTTGFSSLLINVGETKNWGYEA 788
Query: 417 EASY 420
E Y
Sbjct: 789 EVKY 792
>gi|332027092|gb|EGI67188.1| hypothetical protein G5I_04344 [Acromyrmex echinatior]
Length = 1545
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/167 (23%), Positives = 65/167 (38%), Gaps = 26/167 (15%)
Query: 177 TAHCLRFPGDWFHVFGIGQRSIGFSVRIEVK---------------TGSKVSE-----VT 216
+AHCLRF W+ V+ + + +V ++V TG V +
Sbjct: 967 SAHCLRFSDLWYSVYQLEDPIVDHAVYLQVYEKRVLANGSTYWKDLTGDSVVRQVVYAIR 1026
Query: 217 VGPENK-----TATSADNFLKVNLIG-DFVGYTNIPSFEEFYLVIPRQGGPGQPQDLGGN 270
+G N+ T A + +V ++G + N+ + LV G G
Sbjct: 1027 LGTFNRHHRGNQDTIAFAYKEVKMLGREEDEIPNLDVVRDRLLVPSSVTSKGFEYPAEGE 1086
Query: 271 FSMWMLLERTRFTLDGLECNKIGVSYEAFNGQPSFCSSPFWSCLHNQ 317
++++ + G EC+K GV + AF QP C +CL NQ
Sbjct: 1087 SGEYLVIPASSINESGNECDKAGVGFAAFAKQPDRCERVRGTCLKNQ 1133
>gi|3970809|emb|CAA10220.1| sodium-calcium exchanger III [Oncorhynchus mykiss]
Length = 263
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 55/128 (42%), Gaps = 26/128 (20%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQR-----IRDFRSRRID-VDHPH 613
L LA FP ++L WL ++ LF Y + +++DN R RS+ I+ +D
Sbjct: 138 LTLAFFPICVILAWLADRRLLF---YKFMHKKYRADNHRGVIIETEHERSKGIEMMDGGG 194
Query: 614 VHVRKHHKQE-GRHHKL----------EARRRRCGIHSDHKHKHSDRDTDY------YYY 656
V H + G H L E+RR I D K KH +++ D YY
Sbjct: 195 KMVNSHFAHDGGAAHNLISLIEGKEVDESRRDMIRILKDLKQKHPEKEMDQLVEMANYYA 254
Query: 657 LHHVQKDK 664
L H QK +
Sbjct: 255 LSHQQKSR 262
>gi|63099310|gb|AAY32773.1| solute carrier family 8 member 3 [Mixophyes balbus]
Length = 377
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
L L FP +VL W+ ++ LF Y + +++D R + + +HP
Sbjct: 136 LTLFFFPVCVVLAWVADRRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I D K KH ++D D YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q L LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDLCLD 289
>gi|63099306|gb|AAY32771.1| solute carrier family 8 member 3 [Rheobatrachus silus]
Length = 377
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
L L FP +VL W+ ++ LF Y + +++D R + + +HP
Sbjct: 136 LTLFFFPVCVVLAWVADRRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
+ H +G + E+RR I D K KH ++D D YY L H QK
Sbjct: 190 KIMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS MQ + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSMQDICLD 289
>gi|225443423|ref|XP_002267790.1| PREDICTED: uncharacterized protein LOC100249538 [Vitis vinifera]
Length = 230
Score = 40.8 bits (94), Expect = 2.9, Method: Composition-based stats.
Identities = 18/54 (33%), Positives = 32/54 (59%)
Query: 528 ITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLF 581
I G +C C+ +D +C I+++C+S LV LVL + L+ +L+ + G+F
Sbjct: 27 IFGDSCEGVCAGTWDITCFIEHLCVSNLVKLFLVLGLCYITLLFFYLMFKLGIF 80
>gi|297735741|emb|CBI18428.3| unnamed protein product [Vitis vinifera]
Length = 808
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 32/54 (59%)
Query: 528 ITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAIFPTVLVLLWLLHQKGLF 581
I G +C C+ +D +C I+++C+S LV LVL + L+ +L+ + G+F
Sbjct: 605 IFGDSCEGVCAGTWDITCFIEHLCVSNLVKLFLVLGLCYITLLFFYLMFKLGIF 658
>gi|238478576|ref|NP_001154356.1| uncharacterized protein [Arabidopsis thaliana]
gi|5263325|gb|AAD41427.1|AC007727_16 F8K7.16 [Arabidopsis thaliana]
gi|332192026|gb|AEE30147.1| uncharacterized protein [Arabidopsis thaliana]
Length = 233
Score = 40.0 bits (92), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 506 INDFFESI-ESIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAI 564
++ FF SIG L DF++GK+C C S +DF C+++ C++ L L+L +
Sbjct: 5 MDSFFTGFSHSIGNFFGSPL-DFLSGKSCSSVCPSPWDFICYVENFCVANLAKTALILIL 63
Query: 565 FPTVLVLLWLLHQKGLF 581
L +++L++ G +
Sbjct: 64 SYFFLFFIYMLYKVGFW 80
>gi|63099280|gb|AAY32758.1| solute carrier family 8 member 3 [Xenopus (Silurana) tropicalis]
Length = 377
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPH-----V 614
L L FP +VL W+ ++ LF Y + +++D R + + DHP
Sbjct: 136 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EADHPKGIEMDG 189
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I + K KH ++D D YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKELKQKHPEKDLDQLVEMANYYALSHQQK 249
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 289
>gi|63099318|gb|AAY32777.1| solute carrier family 8 member 3 [Myobatrachus gouldii]
Length = 377
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
L L FP +VL W+ ++ LF Y + +++D R + + +HP
Sbjct: 136 LTLFFFPVCVVLAWVADKRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I D K KH ++D D YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 289
>gi|63099282|gb|AAY32759.1| solute carrier family 8 member 3 [Heleophryne purcelli]
Length = 377
Score = 39.7 bits (91), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
L L FP +VL W+ ++ LF Y + +++D R + + +HP
Sbjct: 136 LTLFFFPVCVVLAWVADRRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I D K KH ++D D YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 289
>gi|63099276|gb|AAY32756.1| solute carrier family 8 member 3 [Limnodynastes salmini]
Length = 377
Score = 39.7 bits (91), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 61/160 (38%), Gaps = 43/160 (26%)
Query: 560 LVLAIFPTVLVLLWLLHQKGLFDPLYDWWDDHFQSDNQRIRDFRSRRIDVDHPHV----- 614
L L FP +VL W+ ++ LF Y + +++D R + + +HP
Sbjct: 136 LTLFFFPVCVVLAWVADRRLLF---YKYMHKKYRTDKHRAIMIET---EAEHPKGIEMDG 189
Query: 615 HVRKHHKQEGRHHKL------EARRRRCGIHSDHKHKHSDRDTDY------YYYLHHVQK 662
V H +G + E+RR I D K KH ++D D YY L H QK
Sbjct: 190 KVMNSHFLDGNLLNMDGKEVDESRRDMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQK 249
Query: 663 D--------------------KHKHGRSKNSSVMQQLYLD 682
KH +SK SS +Q + LD
Sbjct: 250 SRAFYRIQATRMMTGAGNILKKHAAEQSKRSSSLQDICLD 289
>gi|297845152|ref|XP_002890457.1| hypothetical protein ARALYDRAFT_313065 [Arabidopsis lyrata subsp.
lyrata]
gi|297336299|gb|EFH66716.1| hypothetical protein ARALYDRAFT_313065 [Arabidopsis lyrata subsp.
lyrata]
Length = 232
Score = 38.9 bits (89), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 506 INDFFESIE-SIGKKLWEGLRDFITGKACRRKCSSFFDFSCHIQYICLSWLVLFGLVLAI 564
++ FF SIG L DF++GK+C C S +DF C ++ C++ L L+L +
Sbjct: 5 LDSFFTGFSHSIGNFFGSPL-DFLSGKSCSSVCPSPWDFICFVENFCVANLAKAALILIL 63
Query: 565 FPTVLVLLWLLHQKGLF 581
L +++L++ G +
Sbjct: 64 SYFFLFFIYMLYKVGFW 80
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.137 0.424
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,193,821,965
Number of Sequences: 23463169
Number of extensions: 482742170
Number of successful extensions: 1092119
Number of sequences better than 100.0: 407
Number of HSP's better than 100.0 without gapping: 116
Number of HSP's successfully gapped in prelim test: 291
Number of HSP's that attempted gapping in prelim test: 1087273
Number of HSP's gapped (non-prelim): 2882
length of query: 701
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 551
effective length of database: 8,839,720,017
effective search space: 4870685729367
effective search space used: 4870685729367
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 81 (35.8 bits)