BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 005149
(712 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359481238|ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258499 [Vitis vinifera]
Length = 884
Score = 803 bits (2073), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/747 (58%), Positives = 514/747 (68%), Gaps = 54/747 (7%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
MGDLR SPEP G +R S N AIGA W RAE Q II +VQPT VSE
Sbjct: 1 MGDLRACSPEPRGLFTDDRLLPLPSLSHPNPPAIGAAQWARAENTVQEIICEVQPTEVSE 60
Query: 61 ERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSV 120
ERRK V+DYVQ LIR +GCEVFPFGSVPLKTYLPDGDIDLTAFGG VE+ LA +V SV
Sbjct: 61 ERRKEVVDYVQGLIRVRVGCEVFPFGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSV 120
Query: 121 LEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIG 180
LE EDQN+AAEFVVKD QLI AEVKLVKCLVQNIVVDISFNQLGGL TLCFLEQ+DRLIG
Sbjct: 121 LEAEDQNRAAEFVVKDVQLIHAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQIDRLIG 180
Query: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYKF 223
KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL VLYKF
Sbjct: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFLLFHSLLNGPLAVLYKF 240
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
LDYFSKFDWD+YC+SLNGPVRISSLPE++ ETPEN G D LL+++ L++C+++FSVPSRG
Sbjct: 241 LDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPENVGADPLLNNDILRDCLDRFSVPSRG 300
Query: 284 FDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLT 343
+TNSR+F KH NIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG IL QPE+ ++
Sbjct: 301 LETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILLQPEDKIS 360
Query: 344 DELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYESEPNSS 403
+EL KFF+NTL+RHG GQRPDV D +P+S +GFG +S+ S E E + + + +S
Sbjct: 361 EELCKFFTNTLERHGRGQRPDV-DLIPVSCSDGFGFASSISDLEFQEEKRILEVNYTDSR 419
Query: 404 GITENCRIDDEAELCGGVGKIKVSGMESSYCR------------TINEPHNSGNGTAVSE 451
IT +D E +C GV +K+SG E ++E NS N AVS
Sbjct: 420 SITGESELDAERSMCDGVNCVKISGTELGMSNPQRGSKQVVPTSMLSEADNSSNAPAVSG 479
Query: 452 TRLSGDAKDLATSKNLNLVISNETSKCSSLSGEES------KARHAPHLYFSSSTMGNGE 505
R+SGDAKDLA+ + ISN+TSK S SGEES KA APHLYFS S NG+
Sbjct: 480 FRISGDAKDLASPRIRGPKISNDTSKSSPPSGEESVSVLSKKAHFAPHLYFSRSAQ-NGK 538
Query: 506 IRNGNSEWKQQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNH 565
RN N + K NSG +E +E+ ++ +G + NQ NH + SN
Sbjct: 539 ERNENLDKKLAGNSGLSE------------EESSFVVHHGLNGNQSVNNHELLNSFVSND 586
Query: 566 HPSLMSTIPWSTEEFNFSYSGYHASPRT--VGSPRAANSLSDLSGDYESHQISLNHVWWW 623
P +S S+E + ++G P + G+P A NSL+DLSGDY+SH SL + WW
Sbjct: 587 VPPGLSPTACSSE---YLHTGNWDRPSSGNSGNPEAPNSLADLSGDYDSHFNSLQYGWWC 643
Query: 624 YEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPP 683
Y++ + M L SQFQS NSWD +Q+S RRNI PQ++ANG +PRP FYP+ PP
Sbjct: 644 YDYIFGAPALSMPVALPSQFQSNNSWDAIQQSAHIRRNIFPQITANGIIPRPPFYPLNPP 703
Query: 684 MLPGASFGMEEMPKHRGTGTYFPNTVY 710
M+ G FG+EEMPK RGTGTYFPNT +
Sbjct: 704 MISGTGFGVEEMPKPRGTGTYFPNTSH 730
>gi|449449962|ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
Length = 898
Score = 781 bits (2017), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/748 (59%), Positives = 518/748 (69%), Gaps = 49/748 (6%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPS-----NQTAIGAEYWQRAEEATQGIIAQVQP 55
MGDLR WS E NGAV ++PSSSS S S N T IG +YW+RAEEATQ II+QVQP
Sbjct: 1 MGDLRSWSLEQNGAVAEDKPSSSSFSSFSSLLPSNPTPIGVDYWRRAEEATQAIISQVQP 60
Query: 56 TVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALAN 115
TVVSE RRKAVIDYVQRLIR L CEVFPFGSVPLKTYLPDGDIDLTA GG NVEEALA+
Sbjct: 61 TVVSERRRKAVIDYVQRLIRGRLRCEVFPFGSVPLKTYLPDGDIDLTALGGSNVEEALAS 120
Query: 116 DVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQV 175
DVCSVL EDQN AAEFVVKD QLIRAEVKLVKCLVQNIVVDISFNQLGGL TLCFLE++
Sbjct: 121 DVCSVLNSEDQNGAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKI 180
Query: 176 DRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL----------------- 218
DR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL
Sbjct: 181 DRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSALNGPLQ 240
Query: 219 VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFS 278
VLYKFLDYFSKFDWD+YCISLNGPVRISSLPE+V ETP+N GGDLLLS++FL+ C+E FS
Sbjct: 241 VLYKFLDYFSKFDWDNYCISLNGPVRISSLPELVAETPDNGGGDLLLSTDFLQSCLETFS 300
Query: 279 VPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQP 338
VP+RG++ NSR+FP KHLNIVDPLKENNNLGRSVSKGNFYRIRSAF+YGARKLG ILS P
Sbjct: 301 VPARGYEANSRAFPIKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHP 360
Query: 339 EESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYES 398
E+++ DE+RKFFSNTLDRHG GQRPDVQDP P+S + SGTE E
Sbjct: 361 EDNVVDEVRKFFSNTLDRHGGGQRPDVQDPAPVSGGYESCAALLVSGTETQEETNNRDSG 420
Query: 399 EPNSSGITENCRIDDEAELCGGVGKIK-------VSGM--ESSYCRTINEPHNS---GNG 446
+S +C E + GG K V G+ ESS R ++ P N
Sbjct: 421 SVCASDTIGDCSWSQEVSIHGGNANDKEFGEYDHVGGIMNESSQGRPLSVPSGVDGLANA 480
Query: 447 TAVSETRLSGDAKDLATSKNLNLVISNETSKCSSLSGEE--SKARHA---PHLYFSSSTM 501
+S+ RLSGDA DLA+ + L IS++ K S S EE S H PH YFS
Sbjct: 481 IGISDYRLSGDANDLASLRIEGLSISHDAHKSSPSSFEEGISPLGHESLRPHHYFSRPIT 540
Query: 502 GNGEIRNGNSEWKQQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPV 561
NGE+ + N+ N + E + PT K TG QDEN ++ + ++
Sbjct: 541 ENGELIDENT------NKCTPENSYQHLQSPT--KATGSSAKGKQDENHVNNDDEVANQS 592
Query: 562 ESNHHPSLMSTIPWSTEEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISLNHVW 621
E+ + ++ S+E+F S GY VG P A N+LSDL+GDYESH SL
Sbjct: 593 ETKQSSPPLHSVSLSSEDFYPSSRGYRFLTSNVGPPEAFNALSDLNGDYESHCNSLQIGR 652
Query: 622 WWYEHALN-SSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPM 680
W+YE+AL+ ++ SP+ P L SQ+ +KN WD+++RS+ ++N Q+++NG + RP FYPM
Sbjct: 653 WYYEYALSAAALSPIPPPLPSQYPNKNPWDIIRRSVQVKQNAFAQINSNGLLARPAFYPM 712
Query: 681 TPPMLP-GASFGMEEMPKHRGTGTYFPN 707
P+LP GA+ MEEMPK RGTGTYFPN
Sbjct: 713 PSPILPGGATLAMEEMPKPRGTGTYFPN 740
>gi|297735556|emb|CBI18050.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/735 (57%), Positives = 491/735 (66%), Gaps = 90/735 (12%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
MGDLR SPEP G +R S N AIGA W RAE Q II +VQPT VSE
Sbjct: 1 MGDLRACSPEPRGLFTDDRLLPLPSLSHPNPPAIGAAQWARAENTVQEIICEVQPTEVSE 60
Query: 61 ERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSV 120
ERRK V+DYVQ LIR +GCEVFPFGSVPLKTYLPDGDIDLTAFGG VE+ LA +V SV
Sbjct: 61 ERRKEVVDYVQGLIRVRVGCEVFPFGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSV 120
Query: 121 LEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIG 180
LE EDQN+AAEFVVKD QLI AEVKLVKCLVQNIVVDISFNQLGGL TLCFLEQ+DRLIG
Sbjct: 121 LEAEDQNRAAEFVVKDVQLIHAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQIDRLIG 180
Query: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYKF 223
KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL VLYKF
Sbjct: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFLLFHSLLNGPLAVLYKF 240
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
LDYFSKFDWD+YC+SLNGPVRISSLPE++ ETPEN G D LL+++ L++C+++FSVPSRG
Sbjct: 241 LDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPENVGADPLLNNDILRDCLDRFSVPSRG 300
Query: 284 FDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLT 343
+TNSR+F KH NIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG IL QPE+ ++
Sbjct: 301 LETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILLQPEDKIS 360
Query: 344 DELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYESEPNSS 403
+EL KFF+NTL+RHG GQRPDV D +PL
Sbjct: 361 EELCKFFTNTLERHGRGQRPDV-DLIPL-------------------------------- 387
Query: 404 GITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSGDAKDLAT 463
D E +C GV + S ++E NS N AVS R+SGDAKDLA+
Sbjct: 388 --------DAERSMCDGVNLVPTS--------MLSEADNSSNAPAVSGFRISGDAKDLAS 431
Query: 464 SKNLNLVISNETSKCSSLSGEES------KARHAPHLYFSSSTMGNGEIRNGNSEWKQQL 517
+ ISN+TSK S SGEES KA APHLYFS S NG+ RN N + K
Sbjct: 432 PRIRGPKISNDTSKSSPPSGEESVSVLSKKAHFAPHLYFSRSAQ-NGKERNENLDKKLAG 490
Query: 518 NSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMSTIPWST 577
NSG +E +E+ ++ +G + NQ NH + SN P +S S+
Sbjct: 491 NSGLSE------------EESSFVVHHGLNGNQSVNNHELLNSFVSNDVPPGLSPTACSS 538
Query: 578 EEFNFSYSGYHASPRT--VGSPRAANSLSDLSGDYESHQISLNHVWWWYEHALNSSYSPM 635
E + ++G P + G+P A NSL+DLSGDY+SH SL + WW Y++ + M
Sbjct: 539 E---YLHTGNWDRPSSGNSGNPEAPNSLADLSGDYDSHFNSLQYGWWCYDYIFGAPALSM 595
Query: 636 SPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGMEEM 695
L SQFQS NSWD +Q+S RRNI PQ++ANG +PRP FYP+ PPM+ G FG+EEM
Sbjct: 596 PVALPSQFQSNNSWDAIQQSAHIRRNIFPQITANGIIPRPPFYPLNPPMISGTGFGVEEM 655
Query: 696 PKHRGTGTYFPNTVY 710
PK RGTGTYFPNT +
Sbjct: 656 PKPRGTGTYFPNTSH 670
>gi|297816424|ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
lyrata]
gi|297321933|gb|EFH52354.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
lyrata]
Length = 829
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/691 (57%), Positives = 460/691 (66%), Gaps = 59/691 (8%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPD 96
E+W R EEAT+ II QV PT+VSE+RR+ VI YVQ+LIR LGCEV FGSVPLKTYLPD
Sbjct: 32 EFWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRITLGCEVHSFGSVPLKTYLPD 91
Query: 97 GDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVV 156
GDIDLTAFGGL EE LA V SVLERE+ N ++ FVVKD QLIRAEVKLVKCLVQNIVV
Sbjct: 92 GDIDLTAFGGLYHEEELAAKVFSVLEREEHNVSSHFVVKDVQLIRAEVKLVKCLVQNIVV 151
Query: 157 DISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 216
DISFNQ+GG+ TLCFLE++D LIGKDHLFKRSIILIKAWCYYESRILGA HGLISTYALE
Sbjct: 152 DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211
Query: 217 TL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENS 259
TL VLYKFLDYFSKFDWD+YCISLNGPV +SSLPE+VVETPEN
Sbjct: 212 TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVETPENG 271
Query: 260 GGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYR 319
G D LL+SEFLKEC+E +SVPSRGF+TN R F KHLNIVDPLKE NNLGRSVSKGNFYR
Sbjct: 272 GEDFLLTSEFLKECMEMYSVPSRGFETNQRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331
Query: 320 IRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGV 379
IRSAFTYGARKLG I Q +E++ ELRKFFSN L RHGSGQRPDV D VP RYN +
Sbjct: 332 IRSAFTYGARKLGQIFLQSDEAIKSELRKFFSNMLLRHGSGQRPDVLDAVPFVRYNRYNA 391
Query: 380 SSTFSGTELCREDQTIY-ESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTIN 438
S S +E Q +Y +SSG T N R D E L GV +G E S
Sbjct: 392 LSPASNH--FQEGQVVYESESSSSSGATGNGRHDQEGSLDAGVSISSTTGHELSGSPGET 449
Query: 439 EPHNSGNGTAVSETRLSGDAKDLATSKNLNLVISNETSKCSSLSGEESKA-RHAPHLYFS 497
P +VSE R SGDAKDLAT + L IS++ K LS +ES + + H F
Sbjct: 450 AP-------SVSEERFSGDAKDLATLRIQKLEISDDAMKSPCLSDKESVSPLNGKHHSFH 502
Query: 498 SSTMGNGEIRNGNSEWKQQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGA 557
M NGE+ NGN KQQ NS A+ I + ++EN+ V H
Sbjct: 503 Q--MRNGEVLNGNGVGKQQENSCLADSRRVKDI------------HSNENENE-HVGHED 547
Query: 558 SSPVESNHHPSLMSTIPWSTEEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISL 617
+PW E+ + YSG+ S G+P N LSDLSGDYES SL
Sbjct: 548 L---------PFTGAVPWPQEDMHLHYSGHCVS----GTP---NMLSDLSGDYESQLNSL 591
Query: 618 NHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLF 677
WW+++ N SP+SP L Q + NSW++++ +LPFRRN ++ANG VPR +F
Sbjct: 592 RFGRWWFDYVQNGPMSPLSPPGLPQLPNNNSWEVIRHALPFRRNAPTPVNANGVVPRQVF 651
Query: 678 YPMTPPMLPGASFGMEEMPKHRGTGTYFPNT 708
+ + P M+PG F +EE+PK RGTGTYFPN
Sbjct: 652 FHVNPQMIPGPGFAIEELPKPRGTGTYFPNA 682
>gi|79597803|ref|NP_850678.2| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
gi|332645293|gb|AEE78814.1| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
Length = 829
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/691 (56%), Positives = 460/691 (66%), Gaps = 59/691 (8%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPD 96
E W R EEAT+ II QV PT+VSE+RR+ VI YVQ+LIR LGCEV FGSVPLKTYLPD
Sbjct: 32 ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91
Query: 97 GDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVV 156
GDIDLTAFGGL EE LA V +VLERE+ N +++FVVKD QLIRAEVKLVKCLVQNIVV
Sbjct: 92 GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151
Query: 157 DISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 216
DISFNQ+GG+ TLCFLE++D LIGKDHLFKRSIILIKAWCYYESRILGA HGLISTYALE
Sbjct: 152 DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211
Query: 217 TLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENS 259
TLVLY KFLDYFSKFDWDSYCISLNGPV +SSLP++VVETPEN
Sbjct: 212 TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENG 271
Query: 260 GGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYR 319
G DLLL+SEFLKEC+E +SVPSRGF+TN R F KHLNIVDPLKE NNLGRSVSKGNFYR
Sbjct: 272 GEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331
Query: 320 IRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGV 379
IRSAFTYGARKLG + Q +E+++ ELRKFFSN L RHGSGQRPDV D +P RYN +
Sbjct: 332 IRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRHGSGQRPDVHDAIPFLRYNRY-- 389
Query: 380 SSTFSGTELCREDQTI-YESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTIN 438
++ + +E Q + +SSG T N R D E L GV +G + S
Sbjct: 390 NAILPASNHFQEGQVVNESESSSSSGATGNGRHDQEDSLDAGVSIPSTTGPDLSGSPGET 449
Query: 439 EPHNSGNGTAVSETRLSGDAKDLATSKNLNLVISNETSKCSSLSGEESKA-RHAPHLYFS 497
P +VSE R SGDAKDLAT + L IS++ K LS +ES + + H F+
Sbjct: 450 VP-------SVSEERFSGDAKDLATLRIQKLEISDDAMKSPCLSDKESDSPLNGKHHSFN 502
Query: 498 SSTMGNGEIRNGNSEWKQQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGA 557
M NGE+ NGN KQQ NS I +N
Sbjct: 503 Q--MRNGEVLNGNGVGKQQENSWHTGSRRVKDI---------------------HINENE 539
Query: 558 SSPVESNHHPSLMSTIPWSTEEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISL 617
+ V P S +PW E+ + YSG+ S G+P N LSDLSGDYES SL
Sbjct: 540 NEHVGYEDLP-FASAVPWPQEDMHLHYSGHCVS----GTP---NMLSDLSGDYESQLNSL 591
Query: 618 NHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLF 677
WW+++ N SP+SP L Q + NSW++M+ +LPFRRN ++ANG VPR +F
Sbjct: 592 RFGRWWFDYVQNGPMSPLSPPGLPQLPNNNSWEVMRHALPFRRNAPTPVNANGVVPRQVF 651
Query: 678 YPMTPPMLPGASFGMEEMPKHRGTGTYFPNT 708
+ + P M+PG FG+EE+PK RGTGTYFPN
Sbjct: 652 FHVNPQMIPGPGFGIEELPKPRGTGTYFPNA 682
>gi|449526634|ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cucumis sativus]
Length = 816
Score = 684 bits (1764), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/663 (57%), Positives = 454/663 (68%), Gaps = 44/663 (6%)
Query: 81 EVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLI 140
+VFPFGSVPLKTYLPDGDIDLTA GG NVEEALA+DVCSVL EDQN AAEFVVKD QLI
Sbjct: 4 QVFPFGSVPLKTYLPDGDIDLTALGGSNVEEALASDVCSVLNSEDQNGAAEFVVKDVQLI 63
Query: 141 RAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYES 200
RAEVKLVKCLVQNIVVDISFNQLGGL TLCFLE++DR IGKDHLFKRSIILIKAWCYYES
Sbjct: 64 RAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYES 123
Query: 201 RILGAHHGLISTYALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPV 243
RILGAHHGLISTYALETLVLY KFLDYFSKFDWD+YCISLNGPV
Sbjct: 124 RILGAHHGLISTYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGPV 183
Query: 244 RISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLK 303
RISSLPE+V ETP+N GGDLLLS++FL+ C+E FSVP+RG++ NSR+FP KHLNIVDPLK
Sbjct: 184 RISSLPELVAETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPLK 243
Query: 304 ENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRP 363
ENNNLGRSVSKGNFYRIRSAF+YGARKLG ILS PE+++ DE+RKFFSNTLDRHG GQRP
Sbjct: 244 ENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRHGGGQRP 303
Query: 364 DVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYESEPNSSGITENCRIDDEAELCGGVGK 423
DVQDP P+S + SGTE E +S +C E + GG
Sbjct: 304 DVQDPAPVSGGYESCAALLVSGTETQEETNNRDSGSVCASDTIGDCSWSQEVSIHGGNAN 363
Query: 424 IK-------VSGM--ESSYCRTINEPHNS---GNGTAVSETRLSGDAKDLATSKNLNLVI 471
K V G+ ESS R ++ P N +S+ RLSGDA DLA+ + L I
Sbjct: 364 DKEFGEYDHVGGIMNESSQGRPLSVPSGVDGLANAIGISDYRLSGDANDLASLRIEGLSI 423
Query: 472 SNETSKCSSLSGEE--SKARHA---PHLYFSSSTMGNGEIRNGNSEWKQQLNSGSAEKNV 526
S++ K S S EE S H PH YFS NGE+ + N+ N + E +
Sbjct: 424 SHDAHKSSPSSFEEGISPLGHESLRPHHYFSRPITENGELIDENT------NKCTPENSY 477
Query: 527 TSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMSTIPWSTEEFNFSYSG 586
PT K TG QDEN ++ + ++ E+ + ++ S+E+F S G
Sbjct: 478 QHLQSPT--KATGSSAKGKQDENHVNNDDEVANQSETKQSSPPLHSVSLSSEDFYPSSRG 535
Query: 587 YHASPRTVGSPRAANSLSDLSGDYESHQISLNHVWWWYEHALN-SSYSPMSPQLLSQFQS 645
Y VG P A N+LSDL+GDYESH SL W+YE+AL+ ++ SP+ P L SQ+ +
Sbjct: 536 YRFLTSNVGPPEAFNALSDLNGDYESHCNSLQIGRWYYEYALSAAALSPIPPPLPSQYPN 595
Query: 646 KNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLP-GASFGMEEMPKHRGTGTY 704
KN WD+++RS+ ++N Q+++NG + RP FYPM P+LP GA+ MEEMPK RGTGTY
Sbjct: 596 KNPWDIIRRSVQVKQNAFAQINSNGLLARPAFYPMPSPILPGGATLAMEEMPKPRGTGTY 655
Query: 705 FPN 707
FPN
Sbjct: 656 FPN 658
>gi|30693508|ref|NP_190730.2| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
gi|332645292|gb|AEE78813.1| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
Length = 755
Score = 610 bits (1573), Expect = e-172, Method: Compositional matrix adjust.
Identities = 350/674 (51%), Positives = 418/674 (62%), Gaps = 86/674 (12%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPD 96
E W R EEAT+ II QV PT+VSE+RR+ VI YVQ+LIR LGCEV FGSVPLKTYLPD
Sbjct: 32 ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91
Query: 97 GDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVV 156
GDIDLTAFGGL EE LA V +VLERE+ N +++FVVKD QLIRAEVKLVKCLVQNIVV
Sbjct: 92 GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151
Query: 157 DISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 216
DISFNQ+GG+ TLCFLE+
Sbjct: 152 DISFNQIGGICTLCFLEK------------------------------------------ 169
Query: 217 TLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQ 276
VLYKFLDYFSKFDWDSYCISLNGPV +SSLP++VVETPEN G DLLL+SEFLKEC+E
Sbjct: 170 --VLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEM 227
Query: 277 FSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
+SVPSRGF+TN R F KHLNIVDPLKE NNLGRSVSKGNFYRIRSAFTYGARKLG +
Sbjct: 228 YSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFL 287
Query: 337 QPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTI- 395
Q +E+++ ELRKFFSN L RHGSGQRPDV D +P RYN + ++ + +E Q +
Sbjct: 288 QSDEAISSELRKFFSNMLLRHGSGQRPDVHDAIPFLRYNRY--NAILPASNHFQEGQVVN 345
Query: 396 YESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLS 455
+SSG T N R D E L GV +G + S P +VSE R S
Sbjct: 346 ESESSSSSGATGNGRHDQEDSLDAGVSIPSTTGPDLSGSPGETVP-------SVSEERFS 398
Query: 456 GDAKDLATSKNLNLVISNETSKCSSLSGEESKAR-HAPHLYFSSSTMGNGEIRNGNSEWK 514
GDAKDLAT + L IS++ K LS +ES + + H F+ M NGE+ NGN K
Sbjct: 399 GDAKDLATLRIQKLEISDDAMKSPCLSDKESDSPLNGKHHSFNQ--MRNGEVLNGNGVGK 456
Query: 515 QQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMSTIP 574
QQ NS I +N + V P S +P
Sbjct: 457 QQENSWHTGSRRVKDI---------------------HINENENEHVGYEDLP-FASAVP 494
Query: 575 WSTEEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISLNHVWWWYEHALNSSYSP 634
W E+ + YSG+ S G+P N LSDLSGDYES SL WW+++ N SP
Sbjct: 495 WPQEDMHLHYSGHCVS----GTP---NMLSDLSGDYESQLNSLRFGRWWFDYVQNGPMSP 547
Query: 635 MSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGMEE 694
+SP L Q + NSW++M+ +LPFRRN ++ANG VPR +F+ + P M+PG FG+EE
Sbjct: 548 LSPPGLPQLPNNNSWEVMRHALPFRRNAPTPVNANGVVPRQVFFHVNPQMIPGPGFGIEE 607
Query: 695 MPKHRGTGTYFPNT 708
+PK RGTGTYFPN
Sbjct: 608 LPKPRGTGTYFPNA 621
>gi|6572083|emb|CAB63026.1| putative protein [Arabidopsis thaliana]
Length = 764
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 350/674 (51%), Positives = 418/674 (62%), Gaps = 86/674 (12%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPD 96
E W R EEAT+ II QV PT+VSE+RR+ VI YVQ+LIR LGCEV FGSVPLKTYLPD
Sbjct: 32 ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91
Query: 97 GDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVV 156
GDIDLTAFGGL EE LA V +VLERE+ N +++FVVKD QLIRAEVKLVKCLVQNIVV
Sbjct: 92 GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151
Query: 157 DISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 216
DISFNQ+GG+ TLCFLE+
Sbjct: 152 DISFNQIGGICTLCFLEK------------------------------------------ 169
Query: 217 TLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQ 276
VLYKFLDYFSKFDWDSYCISLNGPV +SSLP++VVETPEN G DLLL+SEFLKEC+E
Sbjct: 170 --VLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEM 227
Query: 277 FSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
+SVPSRGF+TN R F KHLNIVDPLKE NNLGRSVSKGNFYRIRSAFTYGARKLG +
Sbjct: 228 YSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFL 287
Query: 337 QPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTI- 395
Q +E+++ ELRKFFSN L RHGSGQRPDV D +P RYN + ++ + +E Q +
Sbjct: 288 QSDEAISSELRKFFSNMLLRHGSGQRPDVHDAIPFLRYNRY--NAILPASNHFQEGQVVN 345
Query: 396 YESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLS 455
+SSG T N R D E L GV +G + S P +VSE R S
Sbjct: 346 ESESSSSSGATGNGRHDQEDSLDAGVSIPSTTGPDLSGSPGETVP-------SVSEERFS 398
Query: 456 GDAKDLATSKNLNLVISNETSKCSSLSGEESKAR-HAPHLYFSSSTMGNGEIRNGNSEWK 514
GDAKDLAT + L IS++ K LS +ES + + H F+ M NGE+ NGN K
Sbjct: 399 GDAKDLATLRIQKLEISDDAMKSPCLSDKESDSPLNGKHHSFNQ--MRNGEVLNGNGVGK 456
Query: 515 QQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMSTIP 574
QQ NS I +N + V P S +P
Sbjct: 457 QQENSWHTGSRRVKDI---------------------HINENENEHVGYEDLP-FASAVP 494
Query: 575 WSTEEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISLNHVWWWYEHALNSSYSP 634
W E+ + YSG+ S G+P N LSDLSGDYES SL WW+++ N SP
Sbjct: 495 WPQEDMHLHYSGHCVS----GTP---NMLSDLSGDYESQLNSLRFGRWWFDYVQNGPMSP 547
Query: 635 MSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGMEE 694
+SP L Q + NSW++M+ +LPFRRN ++ANG VPR +F+ + P M+PG FG+EE
Sbjct: 548 LSPPGLPQLPNNNSWEVMRHALPFRRNAPTPVNANGVVPRQVFFHVNPQMIPGPGFGIEE 607
Query: 695 MPKHRGTGTYFPNT 708
+PK RGTGTYFPN
Sbjct: 608 LPKPRGTGTYFPNA 621
>gi|359478494|ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
Length = 854
Score = 590 bits (1520), Expect = e-165, Method: Compositional matrix adjust.
Identities = 370/738 (50%), Positives = 451/738 (61%), Gaps = 63/738 (8%)
Query: 1 MGDLRDWSP-EPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MGDL+ SP PNG V S S SS P +I + W AE ATQ I+A++QPT+ S
Sbjct: 1 MGDLKLPSPFLPNGVVSYRGASRSLSSSPPLPASIAGDSWAAAERATQEIVAKMQPTLGS 60
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCS 119
R+ VIDYVQRLI LGCEVFP+GSVPLKTYL DGDIDLTA NVEEALA+DV +
Sbjct: 61 MRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTYLLDGDIDLTALCSSNVEEALASDVHA 120
Query: 120 VLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLI 179
VL+ E+QN+ AEF VKD Q I AEVKLVKCLV++IV+DISFNQLGGLSTLCFLEQVDRLI
Sbjct: 121 VLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKDIVIDISFNQLGGLSTLCFLEQVDRLI 180
Query: 180 GKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYK 222
GKDHLFKRSIILIK+WCYYESRILGAHHGLISTYALE L VLY+
Sbjct: 181 GKDHLFKRSIILIKSWCYYESRILGAHHGLISTYALEILVLYIFHLFHLSLDGPLAVLYR 240
Query: 223 FLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSR 282
FLDYFSKFDWD+YCISLNGPV SSLP++V E PEN DLLLS EFL+ CV+ FSVP R
Sbjct: 241 FLDYFSKFDWDNYCISLNGPVCKSSLPDIVAELPENGQDDLLLSEEFLRNCVDMFSVPFR 300
Query: 283 GFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESL 342
G +TNSR+FP KHLNI+DPL+ENNNLGRSV+KGNFYRIRSAF YG+ KLG ILS P E +
Sbjct: 301 GLETNSRTFPLKHLNIIDPLRENNNLGRSVNKGNFYRIRSAFKYGSHKLGQILSLPREVI 360
Query: 343 TDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYESEPNS 402
DEL+ FF++TL+RH S ++Q+ G SS+ SGTE+C ED+ I+ + +S
Sbjct: 361 QDELKNFFASTLERHRSKYMAEIQNSALTFGSRGSSSSSSSSGTEICSEDE-IFLTSLDS 419
Query: 403 SGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSGDAKDLA 462
IT RIDDE G + +S M+SS +G AVS LSGD+K+ A
Sbjct: 420 DKIT---RIDDETSSMGVLSSPSLSEMDSSI-----------DGNAVSGYCLSGDSKESA 465
Query: 463 TSKNLNLVISNETSKCSSLSGE-----ESKARHAPHLYFSSSTMGNGEIRNGNSEWKQQL 517
+ +L I+ + S +G K+ H LY SS + NG L
Sbjct: 466 SCGFHDLRITEDMSDSLPPTGNLGRSLSVKSHHGHRLYISSLFIENG-----------SL 514
Query: 518 NSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMSTIPWST 577
AE +V + ++L EN N SS H S+ S I T
Sbjct: 515 CPKMAESSVID--------DASIVLQQESKENHFVANTSFSSHSYHEGHNSIGSIISRPT 566
Query: 578 ----EEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISLNHVWWWYEHALNSSYS 633
E ++ G + GS + +L DLSGDY+SH SL + Y HAL
Sbjct: 567 ANISENTALAFRGRDFAC-NAGSLGSLETLLDLSGDYDSHIRSLQYGQCCYGHALPPPLL 625
Query: 634 PMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGME 693
P P SQ Q WD +++ L F +N+ QM +NG + F P+ P +FG+E
Sbjct: 626 PSPPLSPSQLQINTPWDKVRQHLQFTQNLHSQMDSNGVILGNHF-PVKHPARSITAFGLE 684
Query: 694 EMPKHRGTGTYFPNTVYL 711
+ K RGTGTYFPN +L
Sbjct: 685 DKQKPRGTGTYFPNMSHL 702
>gi|147817122|emb|CAN62161.1| hypothetical protein VITISV_017634 [Vitis vinifera]
Length = 1147
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 308/574 (53%), Positives = 380/574 (66%), Gaps = 54/574 (9%)
Query: 174 QVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY------------ 221
++DRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY
Sbjct: 405 KIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSLLNGP 464
Query: 222 -----KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQ 276
KFLDYFSKFDWD+YC+SLNGPVRISSLPE++ ETPEN G D LL ++ L++C+++
Sbjct: 465 LAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPENVGADPLLGNDXLRDCLDR 524
Query: 277 FSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
FSVPSRG +TNSR+F KH NIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG IL
Sbjct: 525 FSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILL 584
Query: 337 QPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIY 396
QPE+ +++EL KFF+NTL+RHG GQRPDV D +P+S +GFG +S+ S E E + +
Sbjct: 585 QPEDKISEELCKFFTNTLERHGRGQRPDV-DLIPVSCSDGFGFASSISDLEFQEEKRILE 643
Query: 397 ESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCR------------TINEPHNSG 444
+ +S IT +D E +C GV +K+SG E ++E NS
Sbjct: 644 VNYTDSRSITGESELDAERSMCDGVNCVKISGTELGMSNPQRGSKQVVPTSMLSEADNSS 703
Query: 445 NGTAVSETRLSGDAKDLATSKNLNLVISNETSKCSSLSGEES------KARHAPHLYFSS 498
N AVS R+SGDAKDLA+ + ISN+TSK S SGEES KA APHLYFS
Sbjct: 704 NAPAVSGFRISGDAKDLASPRIRGPKISNDTSKSSPPSGEESVSVLSKKAHFAPHLYFSR 763
Query: 499 STMGNGEIRNGNSEWKQQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGAS 558
S NG+ R+ N + K NSG +E +E+ ++ +G + NQ NH
Sbjct: 764 SAQ-NGKERHENLDKKLAGNSGLSE------------EESSFVVHHGLNGNQSVNNHELL 810
Query: 559 SPVESNHHPSLMSTIPWSTEEFNFSYSGYHASPRT--VGSPRAANSLSDLSGDYESHQIS 616
+ SN P +S S+E + ++G P + G+P A NSL+DLSGDY+SH S
Sbjct: 811 NSFVSNDVPPGLSPTACSSE---YLHTGNWDRPSSGNSGNPEAPNSLADLSGDYDSHFNS 867
Query: 617 LNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPL 676
L + WW Y++ + M L SQFQS NSWD +Q+S RRNI PQ++ANG +PRP
Sbjct: 868 LQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQQSAHIRRNIFPQITANGIIPRPP 927
Query: 677 FYPMTPPMLPGASFGMEEMPKHRGTGTYFPNTVY 710
FYPM PPM+ G FG+EEMPK RGTGTYFPNT +
Sbjct: 928 FYPMNPPMISGTGFGVEEMPKPRGTGTYFPNTSH 961
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 86/117 (73%), Positives = 93/117 (79%), Gaps = 8/117 (6%)
Query: 82 VFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIR 141
VFPFGSVPLKTYLPDGDIDLTAFGG VE+ LA +V SVLE EDQN+AAEFVVKD QLI
Sbjct: 186 VFPFGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIH 245
Query: 142 AEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQV--------DRLIGKDHLFKRSII 190
AEVKLVKCLVQNIVVDISFNQLGGL TLCFLEQ +R + + L+KR I
Sbjct: 246 AEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQQKAIWDGVEERFLKRLSLWKRQYI 302
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/81 (56%), Positives = 50/81 (61%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
MGDLR SPEP G +R S N AIGA W RAE Q II +VQPT VSE
Sbjct: 1 MGDLRACSPEPRGLFTDDRLLPLPSLSHPNPPAIGAAQWARAENTVQEIICEVQPTEVSE 60
Query: 61 ERRKAVIDYVQRLIRNYLGCE 81
ERRK V+DYVQ LIR +GCE
Sbjct: 61 ERRKEVVDYVQGLIRVRVGCE 81
>gi|356553166|ref|XP_003544929.1| PREDICTED: uncharacterized protein LOC100816328 [Glycine max]
Length = 779
Score = 510 bits (1314), Expect = e-141, Method: Compositional matrix adjust.
Identities = 326/734 (44%), Positives = 417/734 (56%), Gaps = 147/734 (20%)
Query: 1 MGDLRDWSPEPNGAVFGE-------RPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQV 53
MGDL NG VFGE PS + +++ A+ W AE+ T I++++
Sbjct: 1 MGDLL-----VNGVVFGEDRPCASSPPSPPLPPSNPDPSSVAADAWAAAEKTTAEILSRI 55
Query: 54 QPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEAL 113
+PT+ ++ RR+ V+DYVQRLIR CEVFP+GSVPLKTYLPDGDIDLTA N+E+ L
Sbjct: 56 RPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLTALSCQNIEDGL 115
Query: 114 ANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLE 173
+DV +VL E+ N+A+E+ VKD + I AEVKLVKC+VQ+IVVDISFNQLGGLSTLCFLE
Sbjct: 116 VSDVRAVLHGEEINEASEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQLGGLSTLCFLE 175
Query: 174 QVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL--------------- 218
+VDRL+ KDHLFKRSIILIKAWCYYESR+LGAHHGLISTYALETL
Sbjct: 176 KVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYIFHQFHVSLDGP 235
Query: 219 --VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQ 276
VLY+FLDYFSKFDWD+YC+SL GPV SS P +V E PEN GG+ LL+ EF++ CVE
Sbjct: 236 LAVLYRFLDYFSKFDWDNYCVSLKGPVGKSSPPNIVAEVPEN-GGNTLLTEEFIRSCVES 294
Query: 277 FSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
FS+PSRG D N R+FP KHLNI+DPLKENNNLGRSV+KGNFYRIRSAF YGARKLG IL
Sbjct: 295 FSLPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFKYGARKLGWILM 354
Query: 337 QPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIY 396
PE+ +T+EL +FF+NTL+RHGS P + F ST S + E+Q Y
Sbjct: 355 LPEDRITEELIRFFTNTLERHGS---------TPGNVNKSFLSLSTASRKDRKPENQHNY 405
Query: 397 ESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSG 456
+ CR + E + G+ S S G AV +L
Sbjct: 406 D-----------CRDERERYVVQDAGEFFDS---------------SRYGNAVGSLKLCE 439
Query: 457 DAKDLATSKNLNLVISNETSKCSSLSGEESKARHAPHLYFSSSTMGNGEIRNGNSEWKQQ 516
D+KD+ATS L+ +N S CS NG+ N S+ +
Sbjct: 440 DSKDVATSGVLDSASTNGWSYCS-----------------------NGQFENNISDSEPA 476
Query: 517 LNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMSTIPWS 576
LNS +++ + E Q G + +H
Sbjct: 477 LNS----------------------VIDDEKEKQ-----GVAGNSPRSH----------- 498
Query: 577 TEEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISLNHVWWWYEHALN---SSYS 633
T+E N + S A+ SL DL+GDY+SH +L Y H N S
Sbjct: 499 TDEKNMAVS------------EASKSLLDLTGDYDSHIGNLQ-----YGHMCNGYPVSPV 541
Query: 634 PMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGME 693
SP +F ++N W+ +++ + +I Q ++N + + + Y + P LP SFG E
Sbjct: 542 VPSPPRSPKFPNRNPWETVRQCVQINHSIRSQANSNSVMGQQV-YVINHPSLPMTSFGSE 600
Query: 694 EMPKHRGTGTYFPN 707
E K RGTG YFPN
Sbjct: 601 EKRKVRGTGAYFPN 614
>gi|110738268|dbj|BAF01063.1| hypothetical protein [Arabidopsis thaliana]
Length = 660
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 288/553 (52%), Positives = 344/553 (62%), Gaps = 59/553 (10%)
Query: 175 VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY------------- 221
+D LIGKDHLFKRSIILIKAWCYYESRILGA HGLISTYALETLVLY
Sbjct: 1 IDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALETLVLYIFHLFHSSLNGPL 60
Query: 222 ----KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQF 277
KFLDYFSKFDWDSYCISLNGPV +SSLP++VVETPEN G DLLL+SEFLKEC+E +
Sbjct: 61 AVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEMY 120
Query: 278 SVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQ 337
SVPSRGF+TN R F KHLNIVDPLKE NNLGRSVSKGNFYRIRSAFTYGARKLG + Q
Sbjct: 121 SVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFLQ 180
Query: 338 PEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTI-Y 396
+E+++ ELRKFFSN L RHGSGQRPDV D +P RYN + ++ + +E Q +
Sbjct: 181 SDEAISSELRKFFSNMLLRHGSGQRPDVHDAIPFLRYNRY--NAILPASNHFQEGQVVNE 238
Query: 397 ESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSG 456
+SSG T N R D E L GV +G + S P +VSE R SG
Sbjct: 239 SESSSSSGATGNGRHDQEDSLDAGVSIPSTTGPDLSGSPGETVP-------SVSEERFSG 291
Query: 457 DAKDLATSKNLNLVISNETSKCSSLSGEESKAR-HAPHLYFSSSTMGNGEIRNGNSEWKQ 515
DAKDLAT + L IS++ K LS +ES + + H F+ M NGE+ NGN KQ
Sbjct: 292 DAKDLATLRIQKLEISDDAMKSPCLSDKESDSPLNGKHHSFNQ--MRNGEVLNGNGVGKQ 349
Query: 516 QLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMSTIPW 575
Q NS I +N + V P S +PW
Sbjct: 350 QENSWHTGSRRVKDI---------------------HINENENEHVGYEDLP-FASAVPW 387
Query: 576 STEEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISLNHVWWWYEHALNSSYSPM 635
E+ + YSG+ S G+P N LSDLSGDYES SL WW+++ N SP+
Sbjct: 388 PQEDMHLHYSGHCVS----GTP---NMLSDLSGDYESQLNSLRFGRWWFDYVQNGPMSPL 440
Query: 636 SPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGMEEM 695
SP L Q + NSW++M+ +LPFRRN ++ANG VPR +F+ + P M+PG FG+EE+
Sbjct: 441 SPPGLPQLPNNNSWEVMRHALPFRRNAPTPVNANGVVPRQVFFHVNPQMIPGPGFGIEEL 500
Query: 696 PKHRGTGTYFPNT 708
PK RGTGTYFPN
Sbjct: 501 PKPRGTGTYFPNA 513
>gi|356500940|ref|XP_003519288.1| PREDICTED: uncharacterized protein LOC100814626 [Glycine max]
Length = 780
Score = 483 bits (1243), Expect = e-133, Method: Compositional matrix adjust.
Identities = 276/535 (51%), Positives = 341/535 (63%), Gaps = 65/535 (12%)
Query: 1 MGDLRDWSPEPNGAVFGE-------RPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQV 53
MGDL NG VFGE PS + +++ A+ W AE T I+ ++
Sbjct: 1 MGDL-----HVNGVVFGEDRPCASSPPSPPLPPWNPDPSSVAADAWAAAERNTAEILRRI 55
Query: 54 QPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEAL 113
+PT+ ++ RR+ V+DYVQRLIR CEVFP+GSVPLKTYLPDGDIDLTA N+E+ L
Sbjct: 56 RPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLTALSCENIEDGL 115
Query: 114 ANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLE 173
+DV +VL E+ N+AAE+ VKD + I AEVKLVKC+VQ+IVVDISFNQLGGLSTLCFLE
Sbjct: 116 VSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQLGGLSTLCFLE 175
Query: 174 QVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL--------------- 218
+VDRL+ KDHLFKRSIILIKAWCYYESR+LGAHHGLISTYALETL
Sbjct: 176 KVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYIFHQFHVSLDGP 235
Query: 219 --VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQ 276
VLY+FLDYFSKFDWD+YC+SL GPV +SLP +V E PEN GG+ LL+ EF++ CVE
Sbjct: 236 LAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIVAEVPEN-GGNTLLTEEFIRSCVES 294
Query: 277 FSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
FSVPSRG D N R+FP KHLNI+DPLKENNNLGRSV+KGNFYRIRSAF YGARKLG IL
Sbjct: 295 FSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFKYGARKLGWILR 354
Query: 337 QPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIY 396
PE+ + +EL +FF+NTL+RHGS Q + F ST SG + +Q +
Sbjct: 355 LPEDRIAEELIRFFANTLERHGSTQG---------NVDKSFLSLSTASGKDRKPGNQHNF 405
Query: 397 ESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSG 456
ES E + D E +S +G AV+ +L
Sbjct: 406 ESRDER----ERYVVQDAGEFFD----------------------SSRDGNAVTSFKLGE 439
Query: 457 DAKDLATSKNLNLVISNETSKCSSLSGEESKARHAPHLYFSSSTMGNGEIRNGNS 511
D+KD+ATS L+ +N S CS+ E + + P L + ++ GNS
Sbjct: 440 DSKDIATSGVLDRTSTNGWSYCSNEQFENNISDSEPALNSVINDEKEKQVMAGNS 494
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 64/122 (52%), Gaps = 7/122 (5%)
Query: 588 HASPRTVGSPRAANSLSDLSGDYESH--QISLNHVWWWYEHALNSSYSPMSPQLLSQFQS 645
H + + A+ SL DL+GDY+SH + H+ Y +L P SP+ F +
Sbjct: 498 HTDEKHMAVSEASKSLLDLTGDYDSHIGNLQYGHMCNGYPVSLVVPSPPRSPK----FPN 553
Query: 646 KNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYF 705
+N W+ + + +P +I Q ++N + + + Y + P LP SFG EE K RGTG YF
Sbjct: 554 RNPWETVHQCVPINHSIRSQANSNCVMGQQV-YVINHPTLPMTSFGSEEKRKVRGTGAYF 612
Query: 706 PN 707
PN
Sbjct: 613 PN 614
>gi|224124740|ref|XP_002319410.1| predicted protein [Populus trichocarpa]
gi|222857786|gb|EEE95333.1| predicted protein [Populus trichocarpa]
Length = 681
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 237/346 (68%), Positives = 270/346 (78%), Gaps = 17/346 (4%)
Query: 39 WQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGD 98
W+RAEE I+ ++ PTV S +RK VIDYVQRLIR LG EVFP+GSVPLKTYLPDGD
Sbjct: 58 WERAEEVATEIVYRIHPTVESSFKRKQVIDYVQRLIRYSLGFEVFPYGSVPLKTYLPDGD 117
Query: 99 IDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDI 158
IDLTA +EEAL +DV +VL E+ N+ A + VKD I AEVKL+KC+VQN VVDI
Sbjct: 118 IDLTAISSPAIEEALVSDVYTVLRGEELNEDALYEVKDVHCIDAEVKLIKCIVQNTVVDI 177
Query: 159 SFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 218
SFNQLGGL TLCFLE+VDRL+GK+HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL
Sbjct: 178 SFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 237
Query: 219 -----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGG 261
VLYKFLDYFSKFDW++YCISLNGPV SSLP +V + PEN G
Sbjct: 238 ILYIFHLFHSSLNGPLAVLYKFLDYFSKFDWENYCISLNGPVCKSSLPNIVAKPPENVSG 297
Query: 262 DLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIR 321
+LLLS EFLK+CV++F VPSR + NSR FP KHLNIVDPLKENNNLGRSV++GNF+RIR
Sbjct: 298 ELLLSDEFLKDCVDRFYVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRIR 357
Query: 322 SAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
SAF YG RKLG IL P E + DEL+ FF+NTLDRHGS DVQ+
Sbjct: 358 SAFKYGGRKLGRILLLPREKIADELKTFFANTLDRHGSDYWSDVQN 403
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 2/103 (1%)
Query: 608 GDYESHQISLNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMS 667
GD++ H SL + + + +A+++ P P + Q ++ N W+ +++SL +RN QM+
Sbjct: 420 GDHDDHLQSLAYSQYCHMYAVSAPIPP-CPSMSPQSENNNRWETVRQSLQLKRNGHSQMN 478
Query: 668 ANGAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPNTVY 710
N V FY + P A+ EE + RGTGTY PN Y
Sbjct: 479 TN-HVYGTQFYCVNPVAPFRAATNSEEKKERRGTGTYIPNMSY 520
>gi|224145449|ref|XP_002325647.1| predicted protein [Populus trichocarpa]
gi|222862522|gb|EEF00029.1| predicted protein [Populus trichocarpa]
Length = 533
Score = 476 bits (1225), Expect = e-131, Method: Compositional matrix adjust.
Identities = 237/360 (65%), Positives = 280/360 (77%), Gaps = 17/360 (4%)
Query: 17 GERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN 76
G+ P S S S + +I E W+RAEE T+ I+ ++ PTV S +RK +I YVQRLI++
Sbjct: 36 GQDPVSPSFSSNPDPWSIVEENWERAEEFTREIVYRIHPTVESNFKRKQIIGYVQRLIKS 95
Query: 77 YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKD 136
LG EVFP+GSVPLKTYLPDGDIDLT+ +EEAL +D+ +VL RE+ N+ + F VKD
Sbjct: 96 SLGFEVFPYGSVPLKTYLPDGDIDLTSISSPAIEEALVSDIHAVLRREELNEDSTFEVKD 155
Query: 137 AQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWC 196
I AEVKL+KC+VQN VVDISFNQLGGL TLCFLE+VDRL+GK+HLFKRSIILIKAWC
Sbjct: 156 VHCIDAEVKLIKCIVQNTVVDISFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWC 215
Query: 197 YYESRILGAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISL 239
YYESRILGAHHGLISTYALETL VLY+FL+YFSKFDW++YCISL
Sbjct: 216 YYESRILGAHHGLISTYALETLILYIFHLFHCSLNGPLAVLYRFLEYFSKFDWENYCISL 275
Query: 240 NGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIV 299
NGPV SSLP +V E EN G+LLLS EFLK+C ++FSVPSR + NSR FP KHLNIV
Sbjct: 276 NGPVCKSSLPNIVAEPLENGQGELLLSDEFLKDCADRFSVPSRKPEMNSRPFPQKHLNIV 335
Query: 300 DPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGS 359
DPLKENNNLGRSV++GNF+RIRSAF YGARKLG IL P+E + DEL+ FF+NTLDRHGS
Sbjct: 336 DPLKENNNLGRSVNRGNFFRIRSAFKYGARKLGQILLLPKERIADELKIFFANTLDRHGS 395
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 61/116 (52%), Gaps = 4/116 (3%)
Query: 595 GSPRAANSLSDLSGDYESHQISLNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQR 654
G+ + NS+S GD+ H SL + + + HA+++ P P +L ++KN W+ +Q+
Sbjct: 409 GARSSDNSVS--RGDHNGHLQSLAYSQYCHMHAVSAPIPP-CPSMLPLSENKNRWETVQQ 465
Query: 655 SLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPNTVY 710
SL ++N QM+ N L Y + P A+ EE RGTGTY PN V+
Sbjct: 466 SLQLKQNGHSQMNTNHIFGTQL-YCVNPGGPFRAATDSEEKKIRRGTGTYIPNMVF 520
>gi|255554485|ref|XP_002518281.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223542501|gb|EEF44041.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 821
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 264/506 (52%), Positives = 327/506 (64%), Gaps = 48/506 (9%)
Query: 17 GERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN 76
G+ P+SS + I E W+RAE+AT I+ ++ PTV ++ RK V++YVQ LI++
Sbjct: 34 GQIPASSP-----DPALISEENWERAEQATLQIVYRIHPTVEADCNRKHVVEYVQSLIQS 88
Query: 77 YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKD 136
LG +VFP+GSVPLKTYLPDGDIDLTA +A +DV +VL RE+QN+ A + VKD
Sbjct: 89 SLGFQVFPYGSVPLKTYLPDGDIDLTAIINPAGVDASVSDVHAVLRREEQNRDAPYKVKD 148
Query: 137 AQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWC 196
I AEVKL+KC+V +IVVDISFNQLGGLSTLCFLEQVD+LIGK HLFKRSIILIKAWC
Sbjct: 149 VHFIDAEVKLIKCIVHDIVVDISFNQLGGLSTLCFLEQVDQLIGKSHLFKRSIILIKAWC 208
Query: 197 YYESRILGAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISL 239
YYESRILGAHHGLISTYALETL VLY+FLDYFSKFDWD+YCISL
Sbjct: 209 YYESRILGAHHGLISTYALETLILYIFHLFHSSLNGPLMVLYRFLDYFSKFDWDNYCISL 268
Query: 240 NGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIV 299
NGPV SSLP++V E PE G+LLL EFL+ V+ SVPSR + NSR F KHLNIV
Sbjct: 269 NGPVCKSSLPKIVAEPPETGRGNLLLDDEFLRNSVKMLSVPSRSPEMNSRPFTQKHLNIV 328
Query: 300 DPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGS 359
DPL+ENNNLGRSV++GNFYRIRSAF YGARKLGHILS + + +EL KFF+NTLDRHGS
Sbjct: 329 DPLRENNNLGRSVNRGNFYRIRSAFKYGARKLGHILSLQSDRMINELDKFFANTLDRHGS 388
Query: 360 GQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYES---------EPNSSGITENCR 410
V+ +S F S+ S ++ ED + +S E + SG + N
Sbjct: 389 NSLTHVKSSCLVSPTGNFDNLSSSSLSDTSSEDSIVQKSTAGCSVRPFETSCSGNSHNAS 448
Query: 411 IDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSGD--AKDLATSKNLN 468
+ L G GK + SG+ +GT ++ + G + + SK +
Sbjct: 449 HFYLSSLHGEDGKFE-SGI--------------SDGTTLANFVIDGQISCTEWSESKENH 493
Query: 469 LVISNETSKCSSLSGEESKARHAPHL 494
VI+N CS+ G+ S P L
Sbjct: 494 FVINNSACSCSNHEGKTSLCSTIPSL 519
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 110/255 (43%), Gaps = 37/255 (14%)
Query: 470 VISNETSKCSSLSGEES---KARHAPHLYFSSSTMGNGEIRNGNSEWKQQLNSGSAEKN- 525
++ T+ CS E S + +A H Y SS +G+ +G S+ G+ N
Sbjct: 423 IVQKSTAGCSVRPFETSCSGNSHNASHFYLSSLHGEDGKFESGISD-------GTTLANF 475
Query: 526 -VTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMSTIPWSTEEFNFSY 584
+ I T + E+ EN +N+ A S SL STIP + +
Sbjct: 476 VIDGQISCTEWSES--------KENHFVINNSACSCSNHEGKTSLCSTIPSLVNNISENL 527
Query: 585 SGYHASPRTVGS----PRAANSLSDLSGDYESHQISLNHVWWWYEHALNSSYSPMSPQLL 640
+ A R S PR+ SL DL+GDY+SH S+ A+++ P SP
Sbjct: 528 APTTAE-RDFASISQIPRSFKSLLDLTGDYDSHLKSVKFGQGCCFFAVSAPVLPCSPTA- 585
Query: 641 SQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFY-----PMTPPMLPGASFGMEEM 695
++KN W+ +++SL +RN+ Q++ NG + P T +F EE
Sbjct: 586 PHSKNKNPWETVRQSLQLKRNVHSQINTNGIFGHQQHFLNHLVPFT------TAFSSEEK 639
Query: 696 PKHRGTGTYFPNTVY 710
K RGTGTY PN Y
Sbjct: 640 RKQRGTGTYIPNMSY 654
>gi|357153090|ref|XP_003576335.1| PREDICTED: uncharacterized protein LOC100826374, partial
[Brachypodium distachyon]
Length = 769
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 226/353 (64%), Positives = 266/353 (75%), Gaps = 17/353 (4%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
+I AE W+ EEA G++ ++QP+ SE RR AV+ YVQRL+R+ +GCEVFPFGSVPLK
Sbjct: 4 ASISAERWRAFEEAALGVVGRIQPSAPSEGRRAAVVHYVQRLVRHAVGCEVFPFGSVPLK 63
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDLTAFG ++ +E LAN+V +VLE E+ K AEF VKD Q I AEVKLVKCLV
Sbjct: 64 TYLPDGDIDLTAFGSISSDENLANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCLV 123
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
QNIVVDISFNQ+GGL TLCFLEQVD+ GK+HLFK+SI+LIKAWCYYESRILGAHHGLIS
Sbjct: 124 QNIVVDISFNQIGGLCTLCFLEQVDQRFGKEHLFKKSIMLIKAWCYYESRILGAHHGLIS 183
Query: 212 TYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYALE L VLY+FLDY+SKFDWD+ ISL GPV +SSLPE+V +
Sbjct: 184 TYALEILVLCIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVLLSSLPELVSD 243
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
P GD L EFL+EC + F+VP R + N+R F K LNIVDPLK+NNNLGRSVSK
Sbjct: 244 APVTHDGDFLKREEFLRECAQTFTVPPRNSEKNTRLFSRKFLNIVDPLKQNNNLGRSVSK 303
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
GNF+RIRSAF GARKLG IL + S E+ +FF NTL R+ + RPDVQD
Sbjct: 304 GNFFRIRSAFDLGARKLGKILKEASSSAVPEVNQFFRNTLKRNRTMVRPDVQD 356
>gi|326531888|dbj|BAK01320.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 702
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 216/352 (61%), Positives = 261/352 (74%), Gaps = 17/352 (4%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
++I + W A G++ ++QPTV SE RR AV+DYVQRL++ +GC VFPFGSVPLK
Sbjct: 25 SSISPDAWAPFGAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVFPFGSVPLK 84
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDL AFG +E++AN+V ++LE E++ K AEF +KD Q I AEVKLVKC V
Sbjct: 85 TYLPDGDIDLAAFGSTCSDESIANEVRAILESEERRKDAEFEIKDVQYINAEVKLVKCFV 144
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
QNIVVDISFNQ+GGL TLCFLEQVD+ K+HLFKRSI+LIKAWCYYESRILGAHHGLIS
Sbjct: 145 QNIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRILGAHHGLIS 204
Query: 212 TYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYALETL VLY+FLDY+SKFDWD+ ISL+GP+ +SSLP++V +
Sbjct: 205 TYALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLSSLPDLVTD 264
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
P L EFL+EC + F+VP R ++ +R FP K LNIVDPLK +NNLGRSVSK
Sbjct: 265 PPGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSNNLGRSVSK 324
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQ 366
GNFYRIRSAF GARKLG IL P S+ DE+ +FF +TL R+ S RPDVQ
Sbjct: 325 GNFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRNRSRVRPDVQ 376
>gi|255559667|ref|XP_002520853.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223539984|gb|EEF41562.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 655
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/355 (58%), Positives = 266/355 (74%), Gaps = 19/355 (5%)
Query: 33 AIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKT 92
+I +E W AE+ Q I+ +QP++ SE++RK VIDY+QRLI+++ EV PFGSVPLKT
Sbjct: 27 SIDSELWLMAEKRAQEILWILQPSLASEQKRKVVIDYIQRLIKHHFATEVLPFGSVPLKT 86
Query: 93 YLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
YLPDGDIDLTA N+EE L ++C++L E+QN +E VKD + I+A+VK+VKC V+
Sbjct: 87 YLPDGDIDLTALSHQNMEEDLVREICNILTYEEQNSESE--VKDVRYIQAQVKIVKCSVK 144
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NI VDISFNQ+ GL LCFLEQVD+LIGKDHL K SIILIKAWC+YESRILGAHHGL+ST
Sbjct: 145 NISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKCSIILIKAWCFYESRILGAHHGLLST 204
Query: 213 YALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALE LVLY +FL+Y+S FDWD+YC+++NGPV +SSLPE++ E+
Sbjct: 205 YALEILVLYIINAFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAVSSLPEIMTES 264
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
P N+G +LLL EFLK C E+FSVP + + F KHLNI+DPLK+NNNLGRSVSKG
Sbjct: 265 PYNNGNELLLCPEFLKRCKEKFSVPIKAVENGGHEFSIKHLNILDPLKDNNNLGRSVSKG 324
Query: 316 NFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVP 370
NF+RI+ A +YGA++LG IL+ P E++ L FF NTLDR+G G+RPD PVP
Sbjct: 325 NFHRIKCALSYGAQRLGEILALPGENMGAGLEIFFINTLDRNGRGERPDTLVPVP 379
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 2/105 (1%)
Query: 605 DLSGDYESHQISLNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIP 664
DLSGDY++H L W+++++L S P SQ Q +WD + + L ++N +
Sbjct: 390 DLSGDYDNHYSGLLQGQWYHKYSLPVSPEMTPPSSPSQIQQSFTWDRLSQLLRCKQNFLS 449
Query: 665 QMSANGAVPR-PLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPNT 708
Q N VPR P +P P + AS +E K +GTGTY PN
Sbjct: 450 QRGTNVFVPRVPHRHPYAPKVYATASTS-DEKGKSQGTGTYIPNV 493
>gi|449443945|ref|XP_004139736.1| PREDICTED: uncharacterized protein LOC101209112 [Cucumis sativus]
Length = 1341
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 220/384 (57%), Positives = 274/384 (71%), Gaps = 23/384 (5%)
Query: 1 MGDLRDWSPEPNGAV-FGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG+ W+ P+G + G P +++ + + +E W +AEE T +IA +QP S
Sbjct: 1 MGEHEGWAQPPSGLLPNGLLPDEAATVM----RMLDSERWSKAEERTAELIACIQPNPPS 56
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALANDVC 118
EERR AV DYVQRLI C+VF FGSVPLKTYLPDGDIDLTAF N++E A+ V
Sbjct: 57 EERRNAVADYVQRLIMKCFPCQVFTFGSVPLKTYLPDGDIDLTAFSKNQNLKETWAHQVR 116
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
+LE E++N+ AEF VK+ Q I+AEVK++KCLV+NIVVDISF+QLGGL TLCFLE+VD L
Sbjct: 117 DMLESEEKNENAEFRVKEVQYIKAEVKIIKCLVENIVVDISFDQLGGLCTLCFLEEVDHL 176
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY----------------- 221
I ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY
Sbjct: 177 INQNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNSFAGPLEVLY 236
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
+FL++FSKFDWD++C+SL GPV ISSLP+V E P GG+LLLS FL+ C ++V
Sbjct: 237 RFLEFFSKFDWDNFCVSLWGPVPISSLPDVTAEPPRKDGGELLLSKLFLEACSAVYAVFP 296
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 341
G + + F KH N++DPL+ NNNLGRSVSKGNF+RIRSAF +GA++L + P E
Sbjct: 297 GGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRSAFAFGAKRLARLFECPRED 356
Query: 342 LTDELRKFFSNTLDRHGSGQRPDV 365
+ EL +FF NT +RHGSGQRPDV
Sbjct: 357 ILAELNQFFLNTWERHGSGQRPDV 380
>gi|357155485|ref|XP_003577136.1| PREDICTED: uncharacterized protein LOC100840351 [Brachypodium
distachyon]
Length = 739
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 230/386 (59%), Positives = 272/386 (70%), Gaps = 30/386 (7%)
Query: 1 MGDLRDWS--PEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVV 58
M D+R+ S PEP A PSS + E W+ E A +I ++QPT+
Sbjct: 1 MVDIREVSLAPEPKHAPANPDPSS-----------VSPEVWEPLEAAALAVIGRIQPTIP 49
Query: 59 SEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVC 118
SE R +V+DY+QRL+R +GC+VFPFGSVPLKTYLPDGDIDLTAFG +E+LAN+V
Sbjct: 50 SEGLRASVVDYIQRLVRCSVGCQVFPFGSVPLKTYLPDGDIDLTAFGSTYSDESLANEVR 109
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
++LE E++ + AEF VKD Q I AEVKLVKC VQNIVVDISFNQ+GGL TLCFLEQVD+
Sbjct: 110 AILEAEERREDAEFEVKDVQYIHAEVKLVKCFVQNIVVDISFNQMGGLCTLCFLEQVDQR 169
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLY 221
K+HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL VLY
Sbjct: 170 FEKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHESLDGPLAVLY 229
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
+FLDY+SKFDWD+ ISL GPV +SSLPE+V E L EFLKEC + F+VP
Sbjct: 230 RFLDYYSKFDWDNKGISLYGPVSLSSLPELVTEPTGTHDDSFLQREEFLKECAKMFTVPP 289
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 341
R + N+R F K+ NIVDPLK++NNLGRSVSKGNFYRIRSAF GARKLG IL P S
Sbjct: 290 RLNEKNTRPFYQKYFNIVDPLKQSNNLGRSVSKGNFYRIRSAFDLGARKLGKILQMPANS 349
Query: 342 LTDELRKFFSNTLDRHGSGQRPDVQD 367
DE+ +FF +TL R+ S RPD+QD
Sbjct: 350 TVDEVNQFFKSTLKRNHSMVRPDIQD 375
>gi|326492351|dbj|BAK01959.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 724
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 217/361 (60%), Positives = 263/361 (72%), Gaps = 24/361 (6%)
Query: 30 NQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEV------- 82
+ ++I + W E A G++ ++QPTV SE RR AV+DYVQRL++ +GC V
Sbjct: 23 DPSSISPDAWAPFEAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVPVTPFPV 82
Query: 83 FPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRA 142
FPFGSVPLKTYLPDGDIDL AFG +E++AN+V ++LE E++ K AEF +KD Q I A
Sbjct: 83 FPFGSVPLKTYLPDGDIDLAAFGSTCSDESIANEVRAILESEERRKDAEFEIKDVQYINA 142
Query: 143 EVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRI 202
EVKLVKC VQNIVVDISFNQ+GGL TLCFLEQVD+ K+HLFKRSI+LIKAWCYYESRI
Sbjct: 143 EVKLVKCFVQNIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRI 202
Query: 203 LGAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRI 245
LGAHHGLISTYALETL VLY+FLDY+SKFDWD+ ISL+GP+ +
Sbjct: 203 LGAHHGLISTYALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISL 262
Query: 246 SSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKEN 305
SSLP++V + P L EFL+EC + F+VP R ++ +R FP K LNIVDPLK +
Sbjct: 263 SSLPDLVTDPPGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPS 322
Query: 306 NNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
NNLGRSVSKGNFYRIRSAF GARKLG IL P S+ DE+ +FF +TL R+ S RPDV
Sbjct: 323 NNLGRSVSKGNFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRNRSRVRPDV 382
Query: 366 Q 366
Q
Sbjct: 383 Q 383
>gi|222616508|gb|EEE52640.1| hypothetical protein OsJ_34991 [Oryza sativa Japonica Group]
Length = 801
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 220/370 (59%), Positives = 264/370 (71%), Gaps = 20/370 (5%)
Query: 17 GERPSSSSSSVPSNQ--TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI 74
G P+ PSN ++I E W E A ++A++QP SE+RR AVI YVQ L+
Sbjct: 6 GCSPALEPVPTPSNPDPSSISQEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQGLL 65
Query: 75 RNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVV 134
R +GC+VFPFGSVPLKTYLPDGDIDLTAFG + +E LA V +VLE E+ K AEF V
Sbjct: 66 RFNVGCQVFPFGSVPLKTYLPDGDIDLTAFGH-SSDEILAKQVQAVLESEEARKDAEFEV 124
Query: 135 KDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKA 194
KD Q I AEVKLVKC+VQNI+VDISFNQ GGL TLCFLE+VD+ K+HLFKRSI+LIKA
Sbjct: 125 KDVQYIHAEVKLVKCIVQNIIVDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKA 184
Query: 195 WCYYESRILGAHHGLISTYALETLVLY-----------------KFLDYFSKFDWDSYCI 237
WCYYESRILGAHHGLISTYALE LVLY +FLDY+SKFDWD+ I
Sbjct: 185 WCYYESRILGAHHGLISTYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGI 244
Query: 238 SLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLN 297
SL GP+ +SSLPE+V ++P+ D + +FLKEC + F+V R + N++ FP K N
Sbjct: 245 SLYGPISLSSLPELVTDSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFN 304
Query: 298 IVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRH 357
IVDPLK++NNLGRSVSKGNF RIRSAF +GARKLG IL P+ DE+ +FF NTL RH
Sbjct: 305 IVDPLKQSNNLGRSVSKGNFLRIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRNTLKRH 364
Query: 358 GSGQRPDVQD 367
S RPDVQ+
Sbjct: 365 CSRVRPDVQE 374
>gi|255564100|ref|XP_002523048.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223537731|gb|EEF39352.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 644
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 210/355 (59%), Positives = 264/355 (74%), Gaps = 19/355 (5%)
Query: 33 AIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKT 92
+I +E W AE+ TQ I+ +QP+ SE++RK VIDY+QRLI+++ EVFPFGSVPLKT
Sbjct: 27 SIDSELWLMAEKRTQEILWVLQPSSSSEQKRKEVIDYIQRLIKHHYATEVFPFGSVPLKT 86
Query: 93 YLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
YLPDGDIDLTA N+EE LA +VC +L +QN +E VKD + I+A+VK+VKC V+
Sbjct: 87 YLPDGDIDLTALSHQNMEEDLAREVCDILTYAEQNLESE--VKDVRYIQAQVKVVKCSVK 144
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NI VDISFNQ+ GL LCFLEQVD+LIGKDHL K SIILIKAWC+YESRILGAHHGL+ST
Sbjct: 145 NISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKHSIILIKAWCFYESRILGAHHGLLST 204
Query: 213 YALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALE LVLY +FL+Y+S FDWD+YC+++NGPV ISSLPE++ E
Sbjct: 205 YALEILVLYIVNVFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAISSLPEIMTEA 264
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
P ++ +LLL+ EFLK C E+FSVP + + F KHLNI+DPLK++NNLGRSVSKG
Sbjct: 265 PYSNRNELLLTPEFLKRCKERFSVPIKAVENGGHEFSIKHLNILDPLKDSNNLGRSVSKG 324
Query: 316 NFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVP 370
NF+RI+ A +YGA++LG IL P E++ L FF NTLDR+G G+RPD PVP
Sbjct: 325 NFHRIKCALSYGAQRLGEILMLPGENMGAGLENFFINTLDRNGRGERPDALVPVP 379
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 60/111 (54%), Gaps = 12/111 (10%)
Query: 604 SDLSGDYESHQISLNHVWWWYEHALNSSYSPMSPQLL-----SQFQSKNSWDLMQRSLPF 658
SDLSGDY+++ + W++ ++L P+SPQL SQ Q +WD + + L +
Sbjct: 389 SDLSGDYDNYYNGILQGQWYHSYSL-----PVSPQLTPPSSPSQIQQGCTWDTLSQLLWW 443
Query: 659 RRNIIPQMSANGAVPR-PLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPNT 708
++N + Q N VPR P +P + AS +EM K RGTGTY P+
Sbjct: 444 KQNFLSQRGTNVFVPRVPYHHPYAAKVYATAS-STDEMGKSRGTGTYIPHV 493
>gi|413924673|gb|AFW64605.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
Length = 815
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 222/382 (58%), Positives = 264/382 (69%), Gaps = 28/382 (7%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
M D+ + SP P P+S + + W+R E A ++ ++QPT SE
Sbjct: 1 MVDISECSPVPESVPAHPDPAS-----------VSPDAWRRFETAALAVVNKIQPTAASE 49
Query: 61 ERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSV 120
R AV+DYVQRL +VFPFGSVPLKTYLPDGDIDLT FG +E LAN+VC++
Sbjct: 50 HLRAAVVDYVQRLFWFQARYQVFPFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCTI 109
Query: 121 LEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIG 180
L+ E++ K +EF VKD Q + AEVKLVKCLVQNIVVDIS NQ+GGL TLCFLE+VD+ G
Sbjct: 110 LKSEERRKDSEFEVKDVQYVPAEVKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQHFG 169
Query: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYKF 223
KDHLFK+SIILIK WCYYESRILGAHHGLISTYALETL VLY+F
Sbjct: 170 KDHLFKKSIILIKDWCYYESRILGAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRF 229
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
LDY+SKFDWD+ ISL GPV +SSLPE+V + P+ D L EFLKEC+E FSV R
Sbjct: 230 LDYYSKFDWDNKGISLFGPVSLSSLPELVTDPPDIQDDDFLQREEFLKECIESFSVLPRN 289
Query: 284 FDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLT 343
+TN R F + LNIVDPLK++NNLGRSVSKGNFYRIRSAF +GARKLG IL P
Sbjct: 290 SETNPRLFSRRFLNIVDPLKQSNNLGRSVSKGNFYRIRSAFDFGARKLGKILQVPSCLTV 349
Query: 344 DELRKFFSNTLDRHGSGQRPDV 365
E+ +FF NTL R+ +G RPDV
Sbjct: 350 GEVNQFFRNTLKRNRTGLRPDV 371
>gi|293332253|ref|NP_001168029.1| uncharacterized protein LOC100381756 [Zea mays]
gi|223945595|gb|ACN26881.1| unknown [Zea mays]
gi|413924674|gb|AFW64606.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
Length = 833
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 222/382 (58%), Positives = 264/382 (69%), Gaps = 28/382 (7%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
M D+ + SP P P+S + + W+R E A ++ ++QPT SE
Sbjct: 1 MVDISECSPVPESVPAHPDPAS-----------VSPDAWRRFETAALAVVNKIQPTAASE 49
Query: 61 ERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSV 120
R AV+DYVQRL +VFPFGSVPLKTYLPDGDIDLT FG +E LAN+VC++
Sbjct: 50 HLRAAVVDYVQRLFWFQARYQVFPFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCTI 109
Query: 121 LEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIG 180
L+ E++ K +EF VKD Q + AEVKLVKCLVQNIVVDIS NQ+GGL TLCFLE+VD+ G
Sbjct: 110 LKSEERRKDSEFEVKDVQYVPAEVKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQHFG 169
Query: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYKF 223
KDHLFK+SIILIK WCYYESRILGAHHGLISTYALETL VLY+F
Sbjct: 170 KDHLFKKSIILIKDWCYYESRILGAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRF 229
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
LDY+SKFDWD+ ISL GPV +SSLPE+V + P+ D L EFLKEC+E FSV R
Sbjct: 230 LDYYSKFDWDNKGISLFGPVSLSSLPELVTDPPDIQDDDFLQREEFLKECIESFSVLPRN 289
Query: 284 FDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLT 343
+TN R F + LNIVDPLK++NNLGRSVSKGNFYRIRSAF +GARKLG IL P
Sbjct: 290 SETNPRLFSRRFLNIVDPLKQSNNLGRSVSKGNFYRIRSAFDFGARKLGKILQVPSCLTV 349
Query: 344 DELRKFFSNTLDRHGSGQRPDV 365
E+ +FF NTL R+ +G RPDV
Sbjct: 350 GEVNQFFRNTLKRNRTGLRPDV 371
>gi|77548394|gb|ABA91191.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
Length = 783
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 214/353 (60%), Positives = 257/353 (72%), Gaps = 18/353 (5%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
++I E W E A ++A++QP SE+RR AVI YVQ L+R +GC+VFPFGSVPLK
Sbjct: 23 SSISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLK 82
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDLTAFG + +E LA V +VLE E+ K AEF VKD Q I AEVKLVKC+V
Sbjct: 83 TYLPDGDIDLTAFGH-SSDEILAKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCIV 141
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
QNI+VDISFNQ GGL TLCFLE+VD+ K HLFKRSI+LIKAWCYYESRILGAHHGLIS
Sbjct: 142 QNIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLIS 201
Query: 212 TYALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYALE LVLY +FLDY+SKFDWD+ ISL GP+ +SSLPE+V +
Sbjct: 202 TYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVTD 261
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
+P+ D + +FLKEC + F+V R + N++ FP K NIVDPLK++NNLGRSVSK
Sbjct: 262 SPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVSK 321
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
GNF RIRSAF +GARKLG I+ P+ DE+ +FF NTL RH S RPDVQ+
Sbjct: 322 GNFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRHCSRVRPDVQE 374
>gi|115483835|ref|NP_001065579.1| Os11g0114700 [Oryza sativa Japonica Group]
gi|77548393|gb|ABA91190.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
gi|113644283|dbj|BAF27424.1| Os11g0114700 [Oryza sativa Japonica Group]
gi|215694848|dbj|BAG90039.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218185112|gb|EEC67539.1| hypothetical protein OsI_34858 [Oryza sativa Indica Group]
gi|222615390|gb|EEE51522.1| hypothetical protein OsJ_32709 [Oryza sativa Japonica Group]
Length = 801
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 214/353 (60%), Positives = 257/353 (72%), Gaps = 18/353 (5%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
++I E W E A ++A++QP SE+RR AVI YVQ L+R +GC+VFPFGSVPLK
Sbjct: 23 SSISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLK 82
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDLTAFG + +E LA V +VLE E+ K AEF VKD Q I AEVKLVKC+V
Sbjct: 83 TYLPDGDIDLTAFGH-SSDEILAKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCIV 141
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
QNI+VDISFNQ GGL TLCFLE+VD+ K HLFKRSI+LIKAWCYYESRILGAHHGLIS
Sbjct: 142 QNIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLIS 201
Query: 212 TYALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYALE LVLY +FLDY+SKFDWD+ ISL GP+ +SSLPE+V +
Sbjct: 202 TYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVTD 261
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
+P+ D + +FLKEC + F+V R + N++ FP K NIVDPLK++NNLGRSVSK
Sbjct: 262 SPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVSK 321
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
GNF RIRSAF +GARKLG I+ P+ DE+ +FF NTL RH S RPDVQ+
Sbjct: 322 GNFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRHCSRVRPDVQE 374
>gi|242036527|ref|XP_002465658.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
gi|241919512|gb|EER92656.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
Length = 1333
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 209/353 (59%), Positives = 255/353 (72%), Gaps = 18/353 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
+ E W AE+ T +IA++QP SE RR AV YVQRLI N L C+VF FGSVPLKTY
Sbjct: 20 LDPERWAVAEDRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79
Query: 94 LPDGDIDLTAFGGLN-VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
LPDGDID+TAF ++E AN V LERE++N+ AEF VK+ Q I+AEVK++KCLV+
Sbjct: 80 LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISFNQ+GGL TLCFLE++D LI ++HLFKRSIILIKAWC+YESRILGAHHGLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISRNHLFKRSIILIKAWCFYESRILGAHHGLIST 199
Query: 213 YALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALETLVLY +FL++FS FDW+ +C+SL GPV ISSLP++ E
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
P G+LLL+ FL C + V R + + F KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRMDSGELLLNKSFLDTCSSAYGVVPRTQENQGQPFVSKHFNVIDPLRANNNLGRSVSKG 319
Query: 316 NFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDP 368
NF+RIRSAF YGA++LG +L P+E L EL +FF+NT RHGSG RPDV P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIAELNQFFTNTWIRHGSGSRPDVPTP 372
>gi|356520288|ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809742 [Glycine max]
Length = 1331
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 222/386 (57%), Positives = 277/386 (71%), Gaps = 25/386 (6%)
Query: 1 MGDLRDWSPEPNGAV-FGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG+ W+ P+G + G P+ ++S + + +E W +AE+ T +IA +QP S
Sbjct: 1 MGEHEGWAQPPSGLLPNGLLPNEAASVI----QVLDSERWLKAEQRTAELIACIQPNPPS 56
Query: 60 EERRKAVIDYVQRLIRNYLGCEV--FPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALAND 116
EERR AV DYVQRLI C+V F FGSVPLKTYLPDGDIDLTAF N++++ A+
Sbjct: 57 EERRNAVADYVQRLIMKCFPCQVGVFTFGSVPLKTYLPDGDIDLTAFSKNQNLKDSWAHQ 116
Query: 117 VCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVD 176
V +LE E++N+ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQLGGL TLCFLE+VD
Sbjct: 117 VRDMLENEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVD 176
Query: 177 RLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY--------------- 221
LI ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY
Sbjct: 177 NLINQNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNSFAGPLEV 236
Query: 222 --KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSV 279
+FL++FSKFDW+++C+SL GPV ISSLP+V E P GGDLLLS FL C ++V
Sbjct: 237 LYRFLEFFSKFDWENFCVSLWGPVPISSLPDVTAEPPRKDGGDLLLSKLFLDACSSVYAV 296
Query: 280 PSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPE 339
G + + F KH N++DPL+ NNNLGRSVSKGNF+RIRSAF +GA+KL +L PE
Sbjct: 297 FPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRSAFAFGAKKLARLLDCPE 356
Query: 340 ESLTDELRKFFSNTLDRHGSGQRPDV 365
E L E+ +FF NT +RHGSG+RPDV
Sbjct: 357 EELFSEVNQFFFNTWERHGSGERPDV 382
>gi|225462743|ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248390 [Vitis vinifera]
Length = 1353
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/387 (55%), Positives = 272/387 (70%), Gaps = 24/387 (6%)
Query: 1 MGDLRDWSPEPNG-AVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG W+ +PNG + G P+ ++S A+ E AEE T+ +IA +QP S
Sbjct: 1 MGGHEGWA-QPNGFSPNGLLPNEAASVT----RALDQERLSLAEERTKQLIACIQPNQPS 55
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALANDVC 118
EERR+AV YV+ LI C+VFPFGSVPLKTYLPDGDIDLTAF N+++ AN+V
Sbjct: 56 EERREAVASYVKSLIMKCFSCKVFPFGSVPLKTYLPDGDIDLTAFSKSPNLKDTWANEVR 115
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
+LERE+++ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQLGGL TLCFLE+VD L
Sbjct: 116 DILEREEKSGDAEFRVKEVQYIQAEVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHL 175
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLY 221
I + HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL VLY
Sbjct: 176 ISQKHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRVFNNSFAGPLEVLY 235
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
+FL++FSKFDW++YC+SL GPV ISSLP+V + P G+LLLS FL C ++V
Sbjct: 236 RFLEFFSKFDWENYCVSLWGPVPISSLPDVTADPPRKDSGELLLSKLFLDACSSVYAVLP 295
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 341
G + + F K+ N++DPL+ NNNLGRSVSKGNF+RIRSAF +GA++L +L P+++
Sbjct: 296 VGQENPEQPFISKYFNVIDPLRTNNNLGRSVSKGNFFRIRSAFAFGAQRLARLLDCPKDN 355
Query: 342 LTDELRKFFSNTLDRHGSGQRPDVQDP 368
+ E+ +FF NT +RHG G RPD P
Sbjct: 356 VIAEVNQFFMNTWERHGKGDRPDAPSP 382
>gi|302143676|emb|CBI22537.3| unnamed protein product [Vitis vinifera]
Length = 1359
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/387 (55%), Positives = 272/387 (70%), Gaps = 24/387 (6%)
Query: 1 MGDLRDWSPEPNG-AVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG W+ +PNG + G P+ ++S A+ E AEE T+ +IA +QP S
Sbjct: 1 MGGHEGWA-QPNGFSPNGLLPNEAASVT----RALDQERLSLAEERTKQLIACIQPNQPS 55
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALANDVC 118
EERR+AV YV+ LI C+VFPFGSVPLKTYLPDGDIDLTAF N+++ AN+V
Sbjct: 56 EERREAVASYVKSLIMKCFSCKVFPFGSVPLKTYLPDGDIDLTAFSKSPNLKDTWANEVR 115
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
+LERE+++ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQLGGL TLCFLE+VD L
Sbjct: 116 DILEREEKSGDAEFRVKEVQYIQAEVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHL 175
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLY 221
I + HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL VLY
Sbjct: 176 ISQKHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRVFNNSFAGPLEVLY 235
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
+FL++FSKFDW++YC+SL GPV ISSLP+V + P G+LLLS FL C ++V
Sbjct: 236 RFLEFFSKFDWENYCVSLWGPVPISSLPDVTADPPRKDSGELLLSKLFLDACSSVYAVLP 295
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 341
G + + F K+ N++DPL+ NNNLGRSVSKGNF+RIRSAF +GA++L +L P+++
Sbjct: 296 VGQENPEQPFISKYFNVIDPLRTNNNLGRSVSKGNFFRIRSAFAFGAQRLARLLDCPKDN 355
Query: 342 LTDELRKFFSNTLDRHGSGQRPDVQDP 368
+ E+ +FF NT +RHG G RPD P
Sbjct: 356 VIAEVNQFFMNTWERHGKGDRPDAPSP 382
>gi|242041009|ref|XP_002467899.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
gi|241921753|gb|EER94897.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
Length = 1046
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 201/334 (60%), Positives = 246/334 (73%), Gaps = 17/334 (5%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
++ +V+PT SE RR V+DY +RL+ + LGCEVF FGSVPLKTYLPDGDIDLT G +
Sbjct: 36 VVRRVRPTEASERRRADVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTS 95
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLST 168
+ L NDV +LE E+QN AEF+VK+ + I AEV+L+KC + NI++DISFNQ GG+
Sbjct: 96 YDSTLVNDVYCILESEEQNSDAEFIVKNLERIDAEVRLIKCTIGNIIIDISFNQTGGICA 155
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL---------- 218
LCFLE VDR +GK+HLFKRSIILIKAWCYYESR+LGAHHGLISTYALE L
Sbjct: 156 LCFLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYIFNLFHK 215
Query: 219 -------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK 271
VLY+FL+YFSKFDWD+YCISLNGPV +SSLP + VE DLL EFLK
Sbjct: 216 SLHSPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLTVEATITHTSDLLFDKEFLK 275
Query: 272 ECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
+++ +VP + D+ F PKHLNIVDPLKE+NNLGRSV++ +F RIR+AF YGARKL
Sbjct: 276 SSMDKATVPPKNSDSCYTRFRPKHLNIVDPLKEHNNLGRSVNRASFNRIRTAFLYGARKL 335
Query: 332 GHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
GHIL P E + DE+ FF NTL+R+G G RPD+
Sbjct: 336 GHILMLPSEVIPDEIYGFFKNTLERNGIGVRPDI 369
>gi|414865287|tpg|DAA43844.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
gi|414865288|tpg|DAA43845.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
Length = 1332
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 209/353 (59%), Positives = 255/353 (72%), Gaps = 18/353 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
+ E W AE T +IA++QP SE RR AV YVQRLI N L C+VF FGSVPLKTY
Sbjct: 20 LDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79
Query: 94 LPDGDIDLTAFGGLN-VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
LPDGDID+TAF ++E AN V LERE++N+ AEF VK+ Q I+AEVK++KCLV+
Sbjct: 80 LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISFNQ+GGL TLCFLE++D LI ++HLFKRSIILIKAWC+YESRILGAHHGLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHGLIST 199
Query: 213 YALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALETLVLY +FL++FS FDW+ +C+SL GPV ISSLP++ E
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
P G+LLL+ FL C + V + +S+ F KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRSVSKG 319
Query: 316 NFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDP 368
NF+RIRSAF YGA++LG +L P+E L EL +FF+NT RHGSG RPDV P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRHGSGSRPDVPTP 372
>gi|414865289|tpg|DAA43846.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
Length = 1348
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 209/353 (59%), Positives = 255/353 (72%), Gaps = 18/353 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
+ E W AE T +IA++QP SE RR AV YVQRLI N L C+VF FGSVPLKTY
Sbjct: 20 LDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79
Query: 94 LPDGDIDLTAFGGLN-VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
LPDGDID+TAF ++E AN V LERE++N+ AEF VK+ Q I+AEVK++KCLV+
Sbjct: 80 LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISFNQ+GGL TLCFLE++D LI ++HLFKRSIILIKAWC+YESRILGAHHGLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHGLIST 199
Query: 213 YALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALETLVLY +FL++FS FDW+ +C+SL GPV ISSLP++ E
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
P G+LLL+ FL C + V + +S+ F KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRSVSKG 319
Query: 316 NFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDP 368
NF+RIRSAF YGA++LG +L P+E L EL +FF+NT RHGSG RPDV P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRHGSGSRPDVPTP 372
>gi|414866687|tpg|DAA45244.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
Length = 1050
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 207/358 (57%), Positives = 252/358 (70%), Gaps = 25/358 (6%)
Query: 52 QVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+V+PT SE RR V+DY +RL+ + LGCEVF FGSVPLKTYLPDGDIDLT G + +
Sbjct: 37 RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDS 96
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCF 171
L NDV +LE E+QN AEFVVKD + I AEV+L+KC + NI+VDISFNQ GG+ LCF
Sbjct: 97 TLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCF 156
Query: 172 LEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL------------- 218
LE VDR +GK+HLFKRSIILIKAWCYYESR+LGAHHGLISTYALE L
Sbjct: 157 LELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSLH 216
Query: 219 ----VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECV 274
VLY+FL+YFSKFDWD+YCISLNGPV +SSLP ++VE DLL EFLK +
Sbjct: 217 SPVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSSM 276
Query: 275 EQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHI 334
++ +VP + D+ F PKHLNIVDPLKE NNLGRSV++ +F RIR+AF YGARKLGHI
Sbjct: 277 DKATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGHI 336
Query: 335 LSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCRED 392
++ P E + DE+ +FF NTL R+ G RPD+ + + S+F E ED
Sbjct: 337 VTLPSEVIPDEIYEFFKNTLGRNELGARPDID--------SNYAFHSSFGTAETILED 386
>gi|414866686|tpg|DAA45243.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
Length = 1056
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 207/358 (57%), Positives = 252/358 (70%), Gaps = 25/358 (6%)
Query: 52 QVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+V+PT SE RR V+DY +RL+ + LGCEVF FGSVPLKTYLPDGDIDLT G + +
Sbjct: 37 RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDS 96
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCF 171
L NDV +LE E+QN AEFVVKD + I AEV+L+KC + NI+VDISFNQ GG+ LCF
Sbjct: 97 TLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCF 156
Query: 172 LEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL------------- 218
LE VDR +GK+HLFKRSIILIKAWCYYESR+LGAHHGLISTYALE L
Sbjct: 157 LELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSLH 216
Query: 219 ----VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECV 274
VLY+FL+YFSKFDWD+YCISLNGPV +SSLP ++VE DLL EFLK +
Sbjct: 217 SPVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSSM 276
Query: 275 EQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHI 334
++ +VP + D+ F PKHLNIVDPLKE NNLGRSV++ +F RIR+AF YGARKLGHI
Sbjct: 277 DKATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGHI 336
Query: 335 LSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCRED 392
++ P E + DE+ +FF NTL R+ G RPD+ + + S+F E ED
Sbjct: 337 VTLPSEVIPDEIYEFFKNTLGRNELGARPDID--------SNYAFHSSFGTAETILED 386
>gi|224146203|ref|XP_002325920.1| predicted protein [Populus trichocarpa]
gi|222862795|gb|EEF00302.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 207/354 (58%), Positives = 255/354 (72%), Gaps = 21/354 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
I E W AE+ TQ I+ +QPT SE +R VI+Y+Q LI+ Y EVF FGSVPLKTY
Sbjct: 28 IDPELWLMAEKRTQEILYTIQPTFASEHKRMEVINYIQSLIKYYFTVEVFAFGSVPLKTY 87
Query: 94 LPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN 153
LPDGDIDL N+EE LA VC++L+RE+ + EF V D Q I A+VKLVKC V+N
Sbjct: 88 LPDGDIDLMVLSHQNMEEELARGVCTLLQREELD--PEFQVNDVQYIHAQVKLVKCSVKN 145
Query: 154 IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 213
I VDISFNQ+ G S LCFLEQVD+LIG+DHLFKRSIILIKAWC+YESRILGAHHGLISTY
Sbjct: 146 ISVDISFNQMAGPSALCFLEQVDQLIGQDHLFKRSIILIKAWCFYESRILGAHHGLISTY 205
Query: 214 ALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETP 256
AL+ L VLYKFLDY+S FDWD+YC+S+NGP+ ISS P+ ++
Sbjct: 206 ALQILVLNIINVFHSSLPDPLAVLYKFLDYYSAFDWDNYCVSINGPIPISSFPQ--TDST 263
Query: 257 ENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGN 316
N+G + L+S EFL+ E+F+ P + + + FP KHLNIVDPLK +NNLGRSV+KGN
Sbjct: 264 HNNGNESLISQEFLRNFREKFAFPMKELENGAHEFPIKHLNIVDPLKSSNNLGRSVNKGN 323
Query: 317 FYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVP 370
F+RIR A +YGA++LG I++ P E++ L KFF NTLDR+G GQRPD PVP
Sbjct: 324 FHRIRGALSYGAQRLGEIIALPGEAMGGRLEKFFMNTLDRNGRGQRPDADVPVP 377
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 1/104 (0%)
Query: 604 SDLSGDYESHQISLNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNII 663
SDL+GDY+ + L H W++ +AL P SP SQ + K++ D++ + L +++I+
Sbjct: 387 SDLNGDYDKYYSGLLHGQWYHSYALPVPPQPSSPSSPSQIKQKSARDVLPQLLQSKQDIL 446
Query: 664 PQMSANGAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPN 707
Q PR +P + S G++ M K RGTGTY P+
Sbjct: 447 SQRGTEVFFPRQKCHPYASQVHVAIS-GIDTMRKSRGTGTYIPD 489
>gi|225454502|ref|XP_002277075.1| PREDICTED: uncharacterized protein LOC100241322 [Vitis vinifera]
Length = 1295
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 215/384 (55%), Positives = 269/384 (70%), Gaps = 25/384 (6%)
Query: 1 MGDLRDWSPEPNGAV-FGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG W+ +P G + G P+ SS++ + E W AEE T +IA +QP S
Sbjct: 1 MGQHEGWA-QPTGLLPNGLLPNEGSSAI----RVLDTERWLIAEERTAELIACIQPNQPS 55
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNVEEALANDVC 118
EE R AV DYVQR++ C+VF FGSVPLKTYLPDGDIDLTAF N+++ AN V
Sbjct: 56 EELRNAVADYVQRIVVQCFPCQVFTFGSVPLKTYLPDGDIDLTAFSNNQNLKDTWANQVR 115
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
+L+ E++N+ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQLGGL TLCFLE+VD L
Sbjct: 116 DMLQSEEKNENAEFRVKEVQYIQAEVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHL 175
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLY 221
I ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL VLY
Sbjct: 176 INQNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNSFTGPLEVLY 235
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
+FL++FS FDWD++C+SL GPV ISSLP+V E P G+LLLS FL C ++V
Sbjct: 236 RFLEFFSSFDWDNFCVSLWGPVPISSLPDVTAEPPRQDSGELLLSKLFLDACSSVYAVFP 295
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 341
G + +SF KH N++DPL+ NNNLGRSVSKGNF+RIRSAF +GA++L +L P+E+
Sbjct: 296 HGQEKQGQSFISKHFNVIDPLRVNNNLGRSVSKGNFFRIRSAFAFGAKRLARLL-DPKEN 354
Query: 342 LTDELRKFFSNTLDRHGSGQRPDV 365
+ E+ + F NT +RHGSG RPD
Sbjct: 355 IIFEVNQLFMNTWERHGSGHRPDT 378
>gi|297745424|emb|CBI40504.3| unnamed protein product [Vitis vinifera]
Length = 1229
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 215/384 (55%), Positives = 269/384 (70%), Gaps = 25/384 (6%)
Query: 1 MGDLRDWSPEPNGAV-FGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG W+ +P G + G P+ SS++ + E W AEE T +IA +QP S
Sbjct: 1 MGQHEGWA-QPTGLLPNGLLPNEGSSAI----RVLDTERWLIAEERTAELIACIQPNQPS 55
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNVEEALANDVC 118
EE R AV DYVQR++ C+VF FGSVPLKTYLPDGDIDLTAF N+++ AN V
Sbjct: 56 EELRNAVADYVQRIVVQCFPCQVFTFGSVPLKTYLPDGDIDLTAFSNNQNLKDTWANQVR 115
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
+L+ E++N+ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQLGGL TLCFLE+VD L
Sbjct: 116 DMLQSEEKNENAEFRVKEVQYIQAEVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHL 175
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLY 221
I ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL VLY
Sbjct: 176 INQNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNSFTGPLEVLY 235
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
+FL++FS FDWD++C+SL GPV ISSLP+V E P G+LLLS FL C ++V
Sbjct: 236 RFLEFFSSFDWDNFCVSLWGPVPISSLPDVTAEPPRQDSGELLLSKLFLDACSSVYAVFP 295
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 341
G + +SF KH N++DPL+ NNNLGRSVSKGNF+RIRSAF +GA++L +L P+E+
Sbjct: 296 HGQEKQGQSFISKHFNVIDPLRVNNNLGRSVSKGNFFRIRSAFAFGAKRLARLL-DPKEN 354
Query: 342 LTDELRKFFSNTLDRHGSGQRPDV 365
+ E+ + F NT +RHGSG RPD
Sbjct: 355 IIFEVNQLFMNTWERHGSGHRPDT 378
>gi|414882102|tpg|DAA59233.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
Length = 875
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 206/351 (58%), Positives = 255/351 (72%), Gaps = 17/351 (4%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
++I + W+R E A GI+ +QP+ SE R A+IDYVQRL+ ++ G +VFPFGSVPLK
Sbjct: 22 SSIPRDAWRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFPFGSVPLK 81
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDLT FG +E LAN+VC++L+ E+ K +EF VKD Q I AEVKLVKC+V
Sbjct: 82 TYLPDGDIDLTTFGPAISDEKLANEVCAILKSEEHRKDSEFDVKDVQYIHAEVKLVKCVV 141
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
QNI+VDIS NQ+GGL TLCFLE+VD GK HLFKRS++LIK WCYYE+RILGAHHGLIS
Sbjct: 142 QNIIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILGAHHGLIS 201
Query: 212 TYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYALE L VLY+FLDY+S+FDWD+ ISL GPV +SSLP++V +
Sbjct: 202 TYALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSSLPDLVTD 261
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
P LL +FL+EC + FSVP R + +++ F K LNIVDPLK++NNLGRSVS+
Sbjct: 262 PPVIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNNLGRSVSR 321
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
GNFYRIRSAF +GARKLG IL +P DE+ +FF NTL R+ G R DV
Sbjct: 322 GNFYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRNRIGFRQDV 372
>gi|302755776|ref|XP_002961312.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
gi|300172251|gb|EFJ38851.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
Length = 351
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 216/351 (61%), Positives = 260/351 (74%), Gaps = 29/351 (8%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPD 96
E W +AE T +I ++QPT SE+RR+AV DYV+RLIR CEVF FGSVPL+TYLPD
Sbjct: 4 ERWVQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTYLPD 63
Query: 97 GDIDLTAFGG-LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEV-KLVKCLVQNI 154
GDIDLTAF G +++E+ ANDV +VLE E+++K AEF VK+ Q I+AEV K++KCLV+NI
Sbjct: 64 GDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLVENI 123
Query: 155 VVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA 214
VVDISFNQLGGL TLCFLE+VDRLIG+DHLFKRSIIL+KAWCYYESRILGAHHGLISTYA
Sbjct: 124 VVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLISTYA 183
Query: 215 LETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPE 257
LETL VLY+FL++FS FDWD YC+SL GP+ +S+LP++
Sbjct: 184 LETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM------ 237
Query: 258 NSGGDLLLSSEFLKECVEQFSVPS----RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVS 313
GG LLL+ FL C ++V G SR F K+LN+VDPLK NNLGRSV+
Sbjct: 238 QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGRSVN 297
Query: 314 KGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPD 364
KGNFYRIR+AF +GARKL IL P E + DE+ KFF NT DRHGSG+RPD
Sbjct: 298 KGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRHGSGRRPD 348
>gi|302802985|ref|XP_002983246.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
gi|300148931|gb|EFJ15588.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
Length = 351
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 216/351 (61%), Positives = 260/351 (74%), Gaps = 29/351 (8%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPD 96
E W +AE T +I ++QPT SE+RR+AV DYV+RLIR CEVF FGSVPL+TYLPD
Sbjct: 4 ERWLQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTYLPD 63
Query: 97 GDIDLTAFGG-LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEV-KLVKCLVQNI 154
GDIDLTAF G +++E+ ANDV +VLE E+++K AEF VK+ Q I+AEV K++KCLV+NI
Sbjct: 64 GDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLVENI 123
Query: 155 VVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA 214
VVDISFNQLGGL TLCFLE+VDRLIG+DHLFKRSIIL+KAWCYYESRILGAHHGLISTYA
Sbjct: 124 VVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLISTYA 183
Query: 215 LETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPE 257
LETL VLY+FL++FS FDWD YC+SL GP+ +S+LP++
Sbjct: 184 LETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM------ 237
Query: 258 NSGGDLLLSSEFLKECVEQFSVPS----RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVS 313
GG LLL+ FL C ++V G SR F K+LN+VDPLK NNLGRSV+
Sbjct: 238 QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGRSVN 297
Query: 314 KGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPD 364
KGNFYRIR+AF +GARKL IL P E + DE+ KFF NT DRHGSG+RPD
Sbjct: 298 KGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRHGSGRRPD 348
>gi|414882101|tpg|DAA59232.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
Length = 906
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 206/351 (58%), Positives = 255/351 (72%), Gaps = 17/351 (4%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
++I + W+R E A GI+ +QP+ SE R A+IDYVQRL+ ++ G +VFPFGSVPLK
Sbjct: 22 SSIPRDAWRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFPFGSVPLK 81
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDLT FG +E LAN+VC++L+ E+ K +EF VKD Q I AEVKLVKC+V
Sbjct: 82 TYLPDGDIDLTTFGPAISDEKLANEVCAILKSEEHRKDSEFDVKDVQYIHAEVKLVKCVV 141
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
QNI+VDIS NQ+GGL TLCFLE+VD GK HLFKRS++LIK WCYYE+RILGAHHGLIS
Sbjct: 142 QNIIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILGAHHGLIS 201
Query: 212 TYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYALE L VLY+FLDY+S+FDWD+ ISL GPV +SSLP++V +
Sbjct: 202 TYALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSSLPDLVTD 261
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
P LL +FL+EC + FSVP R + +++ F K LNIVDPLK++NNLGRSVS+
Sbjct: 262 PPVIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNNLGRSVSR 321
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
GNFYRIRSAF +GARKLG IL +P DE+ +FF NTL R+ G R DV
Sbjct: 322 GNFYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRNRIGFRQDV 372
>gi|255564741|ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus communis]
gi|223537453|gb|EEF39081.1| hypothetical protein RCOM_0719270 [Ricinus communis]
Length = 1334
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/350 (60%), Positives = 259/350 (74%), Gaps = 18/350 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
+ +E W +AEE T +I ++P SE RR AV DYV+RLI C VF FGSVPLKTY
Sbjct: 24 LDSERWAKAEERTAELIDCIKPNEPSERRRNAVADYVERLITKCFPCRVFTFGSVPLKTY 83
Query: 94 LPDGDIDLTAFG-GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
LPDGDIDLTAF G +++E A+ V VLE E++N+ AEF VK+ Q I+AEVK++KCLV+
Sbjct: 84 LPDGDIDLTAFSEGQSMKETWAHQVRDVLENEEKNENAEFRVKEVQYIQAEVKIIKCLVE 143
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISF+QLGGL TLCFLE+VD LI +DHLFK+SIILIKAWCYYESRILGAHHGLIST
Sbjct: 144 NIVVDISFDQLGGLCTLCFLEEVDHLINQDHLFKKSIILIKAWCYYESRILGAHHGLIST 203
Query: 213 YALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALETLVLY +FL++FSKFDWD++C+SL GPV ISSLP+V E
Sbjct: 204 YALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPDVTAEP 263
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
P GG+LLLS FLK C ++V G ++ ++F KH N++DPL+ NNNLGRSVSKG
Sbjct: 264 PRKDGGELLLSKLFLKACGAVYAVSPGGPESQGQTFTSKHFNVIDPLRVNNNLGRSVSKG 323
Query: 316 NFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
NF+RIRSAF +GA++L +L P+E + E+ +FF NT DRHGSG RPD
Sbjct: 324 NFFRIRSAFAFGAKRLARLLDCPKEDIHFEVNQFFMNTWDRHGSGLRPDA 373
>gi|356560284|ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800527 [Glycine max]
Length = 1337
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 219/386 (56%), Positives = 276/386 (71%), Gaps = 25/386 (6%)
Query: 1 MGDLRDWSPEPNGAV-FGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG+ W+ P+G + G P+ ++S + + +E W +AE+ T +IA +QP S
Sbjct: 1 MGEHEGWAQAPSGLLPNGLLPNEAASVI----QVLDSERWLKAEQRTAELIACIQPNPPS 56
Query: 60 EERRKAVIDYVQRLIRNYLGCEV--FPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALAND 116
EERR AV DYVQRLI C+V F FGSVPLKTYLPDGDIDLTAF N++++ A+
Sbjct: 57 EERRNAVADYVQRLIMKCFPCQVRVFTFGSVPLKTYLPDGDIDLTAFSKNQNLKDSWAHQ 116
Query: 117 VCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVD 176
V +LE E++N+ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQLGGL TLCFLE+VD
Sbjct: 117 VRDMLENEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVD 176
Query: 177 RLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY--------------- 221
LI ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY
Sbjct: 177 NLINQNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNSFAGPLEV 236
Query: 222 --KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSV 279
+FL++FSKFDW+++C+SL GPV ISSLP+V E P GGDLLLS FL C ++V
Sbjct: 237 LYRFLEFFSKFDWENFCVSLWGPVPISSLPDVTAEPPRKDGGDLLLSKLFLDACSSVYAV 296
Query: 280 PSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPE 339
G + + F KH N++DPL+ NNNLGRSVSKGNF+RIRSAF +GA++L +L E
Sbjct: 297 FPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKGNFFRIRSAFAFGAKRLARLLDCSE 356
Query: 340 ESLTDELRKFFSNTLDRHGSGQRPDV 365
+ L E+ +FF NT +RHGSG+RPDV
Sbjct: 357 DELFSEVNQFFFNTWERHGSGERPDV 382
>gi|413956606|gb|AFW89255.1| hypothetical protein ZEAMMB73_893455 [Zea mays]
Length = 1316
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 208/355 (58%), Positives = 256/355 (72%), Gaps = 20/355 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
+ E W AE+ T +IA +QP V SE RR AV YVQRLI N L C+VF FGSVPLKTY
Sbjct: 20 LDPERWAVAEDRTAELIACIQPNVYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79
Query: 94 LPDGDIDLTAFGGLN-VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
LPDGDID+TAF ++E AN V LERE++++ AEF VK+ Q I+AEVK++KCLV+
Sbjct: 80 LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKDENAEFHVKEVQYIQAEVKIIKCLVE 139
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISFNQ+GGL TLCFLE++D LI ++HLFKRSIILIKAWC+YESRILGAHHGLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 199
Query: 213 YALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALETLVLY +FL++FS FDW+ +C+SL GPV ISSLP++
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAIP 259
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
P G+LLL+ FL C + V + + F KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRMDSGELLLNKSFLDTCSSAYGVVPHTQENQGQPFVSKHFNVIDPLRTNNNLGRSVSKG 319
Query: 316 NFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVP 370
NF+RIRSAF YGA++LG +L P+E+L EL +FF+NT RHGSG RPDV P+P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEALIPELNQFFTNTWIRHGSGSRPDV--PIP 372
>gi|242069725|ref|XP_002450139.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
gi|241935982|gb|EES09127.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
Length = 835
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 219/382 (57%), Positives = 263/382 (68%), Gaps = 28/382 (7%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
M D+ + SP P P+S + + W+R E A ++ ++QPT SE
Sbjct: 1 MVDISECSPVPESVPAHPDPAS-----------VSPDAWRRFETAALAVVNKIQPTAASE 49
Query: 61 ERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSV 120
+ R AVI+YVQRL +VFPFGSVPLKTYLPDGDIDLT FG +E LAN+VC++
Sbjct: 50 QLRAAVIEYVQRLFWFQARYQVFPFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCAI 109
Query: 121 LEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIG 180
L+ E++ K +EF VKD + AEVKLVKCLVQNIVVDIS NQ+GGL TLCFLE+VD+ G
Sbjct: 110 LKSEERRKDSEFEVKDVHYVPAEVKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQNFG 169
Query: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYKF 223
K+HLFKRSI+L+K WCYYESRILGAHHGLISTYALETL VLY+F
Sbjct: 170 KNHLFKRSIMLVKDWCYYESRILGAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRF 229
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
LDY+SKFDWD+ ISL GPV +SSLPE+V + P+ D L EFLKEC E FSV R
Sbjct: 230 LDYYSKFDWDNKGISLFGPVSLSSLPELVTDPPDTQDDDFLQREEFLKECTESFSVLPRN 289
Query: 284 FDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLT 343
+TN R F + LNIVDPLK++NNLGRSVSKGNFYRIRSAF +GARKLG IL P
Sbjct: 290 SETNPRVFSRRFLNIVDPLKQSNNLGRSVSKGNFYRIRSAFDFGARKLGKILQVPSCLTV 349
Query: 344 DELRKFFSNTLDRHGSGQRPDV 365
E+ +FF NTL R+ +G RPDV
Sbjct: 350 SEVNQFFRNTLKRNRTGLRPDV 371
>gi|108706800|gb|ABF94595.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
sativa Japonica Group]
Length = 1316
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 202/351 (57%), Positives = 255/351 (72%), Gaps = 19/351 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
+ E W AE T +IA++QP SE RR+AV DYV+RLI N L C+VF FGSVPLKTY
Sbjct: 18 LDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVPLKTY 77
Query: 94 LPDGDIDLTAFG-GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
LPDGDID+TAF +++ AN V LE E++++ AEF VK+ Q I+AEVK++KCLV
Sbjct: 78 LPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIKCLVD 137
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISFNQ+GGL TLCFLE+VD LI ++HLFKRSIILIKAWC+YESRILGAHHGLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 197
Query: 213 YALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALETL VLY+FL++FS FDW+ +C+SL+GPV ISSLP++ E
Sbjct: 198 YALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDMTAEP 257
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRS-FPPKHLNIVDPLKENNNLGRSVSK 314
P +LLLS FL +C ++V R ++ + F KH N++DPL+ NNNLGRSVSK
Sbjct: 258 PRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRSVSK 317
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
GNF+RIRSAF++GA++L +L P+E L E+ +FF+NT RHGSG RPD
Sbjct: 318 GNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 368
>gi|218192316|gb|EEC74743.1| hypothetical protein OsI_10487 [Oryza sativa Indica Group]
Length = 1316
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 202/351 (57%), Positives = 255/351 (72%), Gaps = 19/351 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
+ E W AE T +IA++QP SE RR+AV DYV+RLI N L C+VF FGSVPLKTY
Sbjct: 18 LDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVPLKTY 77
Query: 94 LPDGDIDLTAFG-GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
LPDGDID+TAF +++ AN V LE E++++ AEF VK+ Q I+AEVK++KCLV
Sbjct: 78 LPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIKCLVD 137
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISFNQ+GGL TLCFLE+VD LI ++HLFKRSIILIKAWC+YESRILGAHHGLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 197
Query: 213 YALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALETL VLY+FL++FS FDW+ +C+SL+GPV ISSLP++ E
Sbjct: 198 YALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDMTAEP 257
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRS-FPPKHLNIVDPLKENNNLGRSVSK 314
P +LLLS FL +C ++V R ++ + F KH N++DPL+ NNNLGRSVSK
Sbjct: 258 PRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRSVSK 317
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
GNF+RIRSAF++GA++L +L P+E L E+ +FF+NT RHGSG RPD
Sbjct: 318 GNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 368
>gi|357113459|ref|XP_003558520.1| PREDICTED: uncharacterized protein LOC100841269 [Brachypodium
distachyon]
Length = 1305
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 201/353 (56%), Positives = 249/353 (70%), Gaps = 18/353 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
+ E W AE T +IA++QP SE RR AV +YV+RLI N L CEVF FGSVPLKTY
Sbjct: 18 LDPERWAVAESRTAELIARIQPNAHSEGRRLAVYNYVRRLIMNCLSCEVFTFGSVPLKTY 77
Query: 94 LPDGDIDLTAFGGLN-VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
LPDGDID+TAF +++ AN V LE E++++ AEF VK+ Q I+AEVK++KCLV
Sbjct: 78 LPDGDIDVTAFSNSEELKDTWANLVRDALEHEEKSENAEFCVKEVQYIQAEVKIIKCLVD 137
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISFNQ+GGL TLCFLE+VD LI HLFKRSIIL+KAWC+YESRILGAHHGLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDNLINHSHLFKRSIILVKAWCFYESRILGAHHGLIST 197
Query: 213 YALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALETLVLY +FL++F FDW+ +C+SL GPV ISSLP++ E
Sbjct: 198 YALETLVLYIFHVFNNSFTGPLEVLYRFLEFFGNFDWEKFCLSLWGPVPISSLPDMTAEP 257
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
P G+LLL FL C + + V R +T + F KH N++DPL+ NNNLGRSV KG
Sbjct: 258 PRMDTGELLLGKPFLDNCNQAYGVMPRTQETQGQPFVSKHFNVIDPLRTNNNLGRSVGKG 317
Query: 316 NFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDP 368
N++RIRSAF +GA+KL +L P+E + E+ +FF+NTL RHGSG RPD P
Sbjct: 318 NYFRIRSAFCFGAKKLAKLLECPKEDIITEVNQFFTNTLTRHGSGNRPDAPTP 370
>gi|357112328|ref|XP_003557961.1| PREDICTED: uncharacterized protein LOC100823912 [Brachypodium
distachyon]
Length = 1051
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 212/366 (57%), Positives = 250/366 (68%), Gaps = 18/366 (4%)
Query: 17 GERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN 76
G P+ + +VP AI AE A ++ +VQPT SE RR VIDY +R++
Sbjct: 7 GTLPAVMARAVPG-PAAIPTGAMAAAEAAAAEVVRRVQPTEASERRRAEVIDYARRIVGT 65
Query: 77 YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKD 136
LGCEVF FGSVPLKTYLPDGDIDLT G + + L +DV +L +QN AEF VKD
Sbjct: 66 ALGCEVFAFGSVPLKTYLPDGDIDLTVLGNASCDSTLIDDVYCILGSGEQNSDAEFEVKD 125
Query: 137 AQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWC 196
+ I AEVKL+KC ++NI+VDISFNQ GG+ LCFLE VDR IGK+HLFKRSIILIKAWC
Sbjct: 126 LEHIDAEVKLIKCTIENIIVDISFNQTGGICALCFLELVDRKIGKNHLFKRSIILIKAWC 185
Query: 197 YYESRILGAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISL 239
YYESR+LGAHHGLISTYALETL VLY+FL+YFSKFDWD+YCISL
Sbjct: 186 YYESRLLGAHHGLISTYALETLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISL 245
Query: 240 NGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIV 299
NGPV +SSLP ++VE DLL EFL VE+ SVP R D F KHLNI+
Sbjct: 246 NGPVALSSLPNLIVEGTNIPVDDLLFDKEFLHSSVEKASVPPRDSDARCTKFRVKHLNII 305
Query: 300 DPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGS 359
DPLKE NNLGRSV+K NF RIR+AF+YGARKLG L P E ++ E+ FF NTL R+G
Sbjct: 306 DPLKECNNLGRSVNKANFSRIRTAFSYGARKLGQYLMLPSERISGEIFGFFKNTLKRNGR 365
Query: 360 GQRPDV 365
G R D+
Sbjct: 366 GVRADI 371
>gi|224118186|ref|XP_002317752.1| predicted protein [Populus trichocarpa]
gi|222858425|gb|EEE95972.1| predicted protein [Populus trichocarpa]
Length = 353
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 204/350 (58%), Positives = 253/350 (72%), Gaps = 18/350 (5%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPD 96
E W AEE T +IA +QP SEERR AV+ YVQRLI C+VF FGSVPLKTYLPD
Sbjct: 4 ERWAIAEERTAELIACIQPNQPSEERRTAVLGYVQRLIMKCFPCQVFTFGSVPLKTYLPD 63
Query: 97 GDIDLTAFG-GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIV 155
GDID+T F ++++ A++V +L+ E++++ AEF VK+ Q I+AEVK++KCLV+NIV
Sbjct: 64 GDIDITVFTESQDLKKTWADEVKDILQHEEKSENAEFHVKEVQYIQAEVKIIKCLVENIV 123
Query: 156 VDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215
VDISFNQLGGL TLCFLE+VD+LI ++HLFKRSIILIKAWCYYESRILGAHHGLISTYAL
Sbjct: 124 VDISFNQLGGLCTLCFLEEVDQLISQNHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 183
Query: 216 ETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPEN 258
ETL VLY+FL++FSKFDW+ +CISL GPV ISSLP V +P
Sbjct: 184 ETLVLYIFHVFNNRFAGPLEVLYRFLEFFSKFDWEHFCISLWGPVPISSLPNVTALSPRE 243
Query: 259 SGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFY 318
GG +LLS FL+ C ++V + +SF K+ N++DPL+ NNNLGRSVSKGNFY
Sbjct: 244 DGGQILLSQLFLEVCSSVYAVFPSQQENQEQSFVSKYFNVIDPLRTNNNLGRSVSKGNFY 303
Query: 319 RIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDP 368
RIRSAF +GA++L +L P+E+L E +FF NT DRH G RPD P
Sbjct: 304 RIRSAFAFGAQRLARLLDCPKENLLAEFNQFFMNTWDRHCKGHRPDAPSP 353
>gi|115452887|ref|NP_001050044.1| Os03g0336700 [Oryza sativa Japonica Group]
gi|108708028|gb|ABF95823.1| expressed protein [Oryza sativa Japonica Group]
gi|113548515|dbj|BAF11958.1| Os03g0336700 [Oryza sativa Japonica Group]
Length = 1035
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 202/344 (58%), Positives = 246/344 (71%), Gaps = 22/344 (6%)
Query: 41 RAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDID 100
RAEEA ++ +V+PT SE RR AV+ Y +RL+ LGCEVF +GSVPLKTYLPDGD+D
Sbjct: 35 RAEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVD 94
Query: 101 LTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISF 160
LT G + L +D+ +L+ E+QN AEF VKD QLI AEV+L+KC ++NIVVDISF
Sbjct: 95 LTVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVVDISF 154
Query: 161 NQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-- 218
NQ GG+ LCFLE VDR +GK+HL K SIILIKAWCYYESR+LGAHHGLISTYALETL
Sbjct: 155 NQTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALETLIL 214
Query: 219 ---------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDL 263
VLY+FL+YFSKFDWD+YCISLNGPV +SSLP +VE G DL
Sbjct: 215 YIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTPGSDL 274
Query: 264 LLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSA 323
L EFL V++ S +T RS K+LNI+DPLKE+NNLGRSV+K +F RIR+A
Sbjct: 275 LFDKEFLNNSVQK--TDSNACNTEFRS---KYLNIIDPLKEHNNLGRSVNKASFNRIRTA 329
Query: 324 FTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
F+YGA+KLG +L E + DE+ FF NTL+R GSG RPD+ D
Sbjct: 330 FSYGAQKLGQVLLLQPELIPDEIYGFFKNTLNRIGSGVRPDIGD 373
>gi|108708029|gb|ABF95824.1| expressed protein [Oryza sativa Japonica Group]
Length = 1004
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/344 (58%), Positives = 246/344 (71%), Gaps = 22/344 (6%)
Query: 41 RAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDID 100
RAEEA ++ +V+PT SE RR AV+ Y +RL+ LGCEVF +GSVPLKTYLPDGD+D
Sbjct: 35 RAEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVD 94
Query: 101 LTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISF 160
LT G + L +D+ +L+ E+QN AEF VKD QLI AEV+L+KC ++NIVVDISF
Sbjct: 95 LTVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVVDISF 154
Query: 161 NQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-- 218
NQ GG+ LCFLE VDR +GK+HL K SIILIKAWCYYESR+LGAHHGLISTYALETL
Sbjct: 155 NQTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALETLIL 214
Query: 219 ---------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDL 263
VLY+FL+YFSKFDWD+YCISLNGPV +SSLP +VE G DL
Sbjct: 215 YIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTPGSDL 274
Query: 264 LLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSA 323
L EFL V++ S +T RS K+LNI+DPLKE+NNLGRSV+K +F RIR+A
Sbjct: 275 LFDKEFLNNSVQK--TDSNACNTEFRS---KYLNIIDPLKEHNNLGRSVNKASFNRIRTA 329
Query: 324 FTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
F+YGA+KLG +L E + DE+ FF NTL+R GSG RPD+ D
Sbjct: 330 FSYGAQKLGQVLLLQPELIPDEIYGFFKNTLNRIGSGVRPDIGD 373
>gi|297820390|ref|XP_002878078.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323916|gb|EFH54337.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 602
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/353 (56%), Positives = 250/353 (70%), Gaps = 20/353 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
I E W AEE I+ +QP +VS++ R +IDYV+ LI+++ G EVF FGSVPLKTY
Sbjct: 35 IEEESWMIAEERAHEILCTIQPALVSDKSRNEIIDYVRTLIKSHDGIEVFSFGSVPLKTY 94
Query: 94 LPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN 153
LPDGDIDLT N+++ +CS L+ E+ + +EF D Q I A+VK++KC ++N
Sbjct: 95 LPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEE--RESEFHATDVQFIPAQVKVIKCNIRN 152
Query: 154 IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 213
I VDISFNQ GL LCFLEQVD+L G+DHLFKRSIIL+KAWCYYESRILGA+ GLISTY
Sbjct: 153 IAVDISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGANTGLISTY 212
Query: 214 ALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETP 256
AL L VLYKFLDY+ FDW++YCIS+NGPV ISSLPE+ +P
Sbjct: 213 ALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSLPELTAASP 272
Query: 257 ENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGN 316
EN G +LLL +FL+ CVE FS P++ D+N FP KHLNIVDPLK +NNLG+SV++GN
Sbjct: 273 EN-GHELLLDEKFLRNCVELFSAPTKAVDSNGLDFPIKHLNIVDPLKYSNNLGKSVTQGN 331
Query: 317 FYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPV 369
RIR AFT GARKL +LS P +++ L KFF N+L+R+G GQR DV DPV
Sbjct: 332 VQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDVNDPV 384
>gi|42565972|ref|NP_191191.2| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
thaliana]
gi|30725328|gb|AAP37686.1| At3g56320 [Arabidopsis thaliana]
gi|110736147|dbj|BAF00045.1| hypothetical protein [Arabidopsis thaliana]
gi|332645988|gb|AEE79509.1| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
thaliana]
Length = 603
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 199/353 (56%), Positives = 249/353 (70%), Gaps = 20/353 (5%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
I A+ W AEE I+ +QP +VS+ R +IDYV+ LI ++ G EVF FGSVPLKTY
Sbjct: 35 IDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKTY 94
Query: 94 LPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN 153
LPDGDIDLT N+++ +CS L+ E+ + +EF D Q I A+VK++KC ++N
Sbjct: 95 LPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEE--RESEFHATDVQFIPAQVKVIKCNIRN 152
Query: 154 IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 213
I VDISFNQ GL LCFLEQVD+L G+DHLFKRSIIL+KAWCYYESRILGA+ GLISTY
Sbjct: 153 IAVDISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGANTGLISTY 212
Query: 214 ALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETP 256
AL L VLYKFLDY+ FDW++YCIS+NGPV ISSLPE+ +P
Sbjct: 213 ALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSLPELTAASP 272
Query: 257 ENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGN 316
EN G +LLL +FL+ CVE +S P++ D+N FP KHLNIVDPLK +NNLG+SV++GN
Sbjct: 273 EN-GHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLKYSNNLGKSVTQGN 331
Query: 317 FYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPV 369
RIR AFT GARKL +LS P +++ L KFF N+L+R+G GQR DV DPV
Sbjct: 332 VQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDVNDPV 384
Score = 42.7 bits (99), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 17/109 (15%)
Query: 603 LSDLSGDYESHQISLNHVWWWYEHAL----NSSYSPMSPQLLSQFQSKNSWDLMQRSLPF 658
LS+LSGD++S L + ++ H+L Y P+S L + WD+++ + +
Sbjct: 394 LSELSGDFDSSFGRLVYGQMYHGHSLPGTFQHGYIPVSSHL-------SGWDIVRHLVTY 446
Query: 659 RRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPN 707
R+N S N + M P L G + M K RGTGTY P+
Sbjct: 447 RKNEFHLRSLNVSTS------MQPFPLHSLPNGCQNMRKTRGTGTYIPD 489
>gi|42566126|ref|NP_191728.2| nucleotidyltransferase [Arabidopsis thaliana]
gi|332646720|gb|AEE80241.1| nucleotidyltransferase [Arabidopsis thaliana]
Length = 1303
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 211/392 (53%), Positives = 273/392 (69%), Gaps = 35/392 (8%)
Query: 1 MGDLRDWSP--------EPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQ 52
MG+ W+ PNG + G+ ++S + P + AE W +AE+ T +IA
Sbjct: 1 MGEHESWAASPPSPSGLHPNGLLPGK---AASVTRP-----LDAERWAKAEDRTAKLIAC 52
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNVE 110
+QP SE+RR AV YV+RLI + ++F FGSVPLKTYLPDGDIDLTAF N++
Sbjct: 53 IQPNPPSEDRRNAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLK 112
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLC 170
++ AN V +LE+E++N+ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQ+GGL TLC
Sbjct: 113 DSWANLVRDMLEKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLC 172
Query: 171 FLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL------------ 218
FLE+VD I ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL
Sbjct: 173 FLEEVDHYINQNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSF 232
Query: 219 -----VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKEC 273
VLY+FL++FSKFDW ++C+SL GPV +SSLP+V E P G+L +S F + C
Sbjct: 233 SGPLEVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLPDVTAEPPRRDVGELRVSEAFYRAC 292
Query: 274 VEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGH 333
++V + + F KH N++DPL+ENNNLGRSVSKGNF+RIRSAFT GA+KL
Sbjct: 293 SRVYAVNIAPQEIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLTR 352
Query: 334 ILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
+L P+E+L E+ +FF NT +RHGSG+RPD
Sbjct: 353 LLECPKENLIHEVNQFFMNTWERHGSGRRPDA 384
>gi|6850860|emb|CAB71099.1| putative protein [Arabidopsis thaliana]
Length = 1388
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 211/392 (53%), Positives = 273/392 (69%), Gaps = 35/392 (8%)
Query: 1 MGDLRDWSP--------EPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQ 52
MG+ W+ PNG + G+ ++S + P + AE W +AE+ T +IA
Sbjct: 1 MGEHESWAASPPSPSGLHPNGLLPGK---AASVTRP-----LDAERWAKAEDRTAKLIAC 52
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNVE 110
+QP SE+RR AV YV+RLI + ++F FGSVPLKTYLPDGDIDLTAF N++
Sbjct: 53 IQPNPPSEDRRNAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLK 112
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLC 170
++ AN V +LE+E++N+ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQ+GGL TLC
Sbjct: 113 DSWANLVRDMLEKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLC 172
Query: 171 FLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL------------ 218
FLE+VD I ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL
Sbjct: 173 FLEEVDHYINQNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSF 232
Query: 219 -----VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKEC 273
VLY+FL++FSKFDW ++C+SL GPV +SSLP+V E P G+L +S F + C
Sbjct: 233 SGPLEVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLPDVTAEPPRRDVGELRVSEAFYRAC 292
Query: 274 VEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGH 333
++V + + F KH N++DPL+ENNNLGRSVSKGNF+RIRSAFT GA+KL
Sbjct: 293 SRVYAVNIAPQEIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLTR 352
Query: 334 ILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
+L P+E+L E+ +FF NT +RHGSG+RPD
Sbjct: 353 LLECPKENLIHEVNQFFMNTWERHGSGRRPDA 384
>gi|297817502|ref|XP_002876634.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
gi|297322472|gb|EFH52893.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
Length = 1302
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 208/388 (53%), Positives = 268/388 (69%), Gaps = 27/388 (6%)
Query: 1 MGDLRDWSPEPNGAVF----GERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPT 56
MG+ W+ P G P ++S + AE W +AE+ T +IA +QP
Sbjct: 1 MGEHESWAASPPSPTLLYPNGLLPGKAASVT----RQLDAERWAKAEDRTAKLIACIQPN 56
Query: 57 VVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNVEEALA 114
SE+RR AV YV+RLI + ++F FGSVPLKTYLPDGDIDLTAF N++++ A
Sbjct: 57 PPSEDRRNAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLKDSWA 116
Query: 115 NDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQ 174
N V +LE+E++N+ AEF VK+ Q I+AEVK++KCLV+NIVVDISFNQ+GGL TLCFLE+
Sbjct: 117 NLVRDMLEKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLCFLEE 176
Query: 175 VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL---------------- 218
VD I ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETL
Sbjct: 177 VDHYINQNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSFSGPL 236
Query: 219 -VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQF 277
VLY+FL++FSKFDW ++C+SL GPV +SSLP+V P G+L +S F + C + +
Sbjct: 237 EVLYRFLEFFSKFDWQNFCLSLWGPVPVSSLPDVTAAPPRKDVGELRVSEAFYRACSKVY 296
Query: 278 SVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQ 337
+V + + F KH N++DPL+ENNNLGRSVSKGNF+RIRSAFT GA+KL +L
Sbjct: 297 AVNIAPQEIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLARLLEC 356
Query: 338 PEESLTDELRKFFSNTLDRHGSGQRPDV 365
P+E+L E+ +FF NT +RHGSG+RPD
Sbjct: 357 PKENLIHEVNQFFMNTWERHGSGRRPDA 384
>gi|7572930|emb|CAB87431.1| putative protein [Arabidopsis thaliana]
Length = 614
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 199/364 (54%), Positives = 249/364 (68%), Gaps = 31/364 (8%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
I A+ W AEE I+ +QP +VS+ R +IDYV+ LI ++ G EVF FGSVPLKTY
Sbjct: 35 IDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKTY 94
Query: 94 LPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN 153
LPDGDIDLT N+++ +CS L+ E+ + +EF D Q I A+VK++KC ++N
Sbjct: 95 LPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEE--RESEFHATDVQFIPAQVKVIKCNIRN 152
Query: 154 IVVDISFNQLGGLSTLCFLEQV-----------DRLIGKDHLFKRSIILIKAWCYYESRI 202
I VDISFNQ GL LCFLEQV D+L G+DHLFKRSIIL+KAWCYYESRI
Sbjct: 153 IAVDISFNQTAGLCALCFLEQVLSAIQNQAPEVDQLFGRDHLFKRSIILVKAWCYYESRI 212
Query: 203 LGAHHGLISTYALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRI 245
LGA+ GLISTYAL LVLY KFLDY+ FDW++YCIS+NGPV I
Sbjct: 213 LGANTGLISTYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPI 272
Query: 246 SSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKEN 305
SSLPE+ +PEN G +LLL +FL+ CVE +S P++ D+N FP KHLNIVDPLK +
Sbjct: 273 SSLPELTAASPEN-GHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLKYS 331
Query: 306 NNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
NNLG+SV++GN RIR AFT GARKL +LS P +++ L KFF N+L+R+G GQR DV
Sbjct: 332 NNLGKSVTQGNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDV 391
Query: 366 QDPV 369
DPV
Sbjct: 392 NDPV 395
Score = 42.7 bits (99), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 17/109 (15%)
Query: 603 LSDLSGDYESHQISLNHVWWWYEHAL----NSSYSPMSPQLLSQFQSKNSWDLMQRSLPF 658
LS+LSGD++S L + ++ H+L Y P+S L + WD+++ + +
Sbjct: 405 LSELSGDFDSSFGRLVYGQMYHGHSLPGTFQHGYIPVSSHL-------SGWDIVRHLVTY 457
Query: 659 RRNIIPQMSANGAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPN 707
R+N S N + M P L G + M K RGTGTY P+
Sbjct: 458 RKNEFHLRSLNVSTS------MQPFPLHSLPNGCQNMRKTRGTGTYIPD 500
>gi|356561857|ref|XP_003549193.1| PREDICTED: uncharacterized protein LOC100787145 [Glycine max]
Length = 684
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/361 (55%), Positives = 248/361 (68%), Gaps = 21/361 (5%)
Query: 26 SVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPF 85
S+PS +I E W+ AEE Q I+ ++P V+SE RK VIDYVQRLIR Y G EV PF
Sbjct: 11 SMPSQLLSIDEELWRMAEERAQEILWTIEPIVLSEVNRKDVIDYVQRLIRGYYGAEVLPF 70
Query: 86 GSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVK 145
GSVPLKTYLPDGDIDLTA + EE LA VC++L+ D E+ VKD Q IRA+V+
Sbjct: 71 GSVPLKTYLPDGDIDLTALSHEDAEEDLAQAVCNILQSGDD---PEYQVKDIQYIRAQVR 127
Query: 146 LVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGA 205
LVKC V+NI VDISFNQ+ G+ TL FLEQVD+L+GK+H+FK SIILIKAWCYYESR+LG
Sbjct: 128 LVKCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFKHSIILIKAWCYYESRLLGG 187
Query: 206 HHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSL 248
HHGL+STYA+E L VLY FLDY+ FDWD +S+ GP +SSL
Sbjct: 188 HHGLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYVSIWGPKPLSSL 247
Query: 249 PEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNL 308
PE + ETPE G+ LL EFL+ S PSR +T + FP K +NI+DPL+ +NNL
Sbjct: 248 PE-IAETPECDQGEFLLQKEFLRNYRNMCSFPSRASETMTHEFPVKFMNILDPLRNDNNL 306
Query: 309 GRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDP 368
GRSV+ N +R+R A +YGAR+L IL+ P E++ L KFF +TLDR+G G+R DV P
Sbjct: 307 GRSVNIANLHRVRFALSYGARRLKQILTLPGENMGAALEKFFFSTLDRNGKGERADVAVP 366
Query: 369 V 369
V
Sbjct: 367 V 367
>gi|414591190|tpg|DAA41761.1| TPA: hypothetical protein ZEAMMB73_453733 [Zea mays]
Length = 918
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 199/351 (56%), Positives = 246/351 (70%), Gaps = 18/351 (5%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
+AI + + AE A ++ +V PT +E RR+ VI Y+ RLI + LGCEVF FGSVPL+
Sbjct: 65 SAIRRDAVRVAEAAAGEVLLRVHPTREAERRRQDVIAYLTRLIGSSLGCEVFAFGSVPLR 124
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGD+D+T G + L +DV S+L+ E +N AE + I AEVKL+KC++
Sbjct: 125 TYLPDGDVDITVLGNTWLNSTLIDDVRSMLQSEQENCDAELKLTGLHFIDAEVKLIKCVI 184
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
+NI+VD+SFNQ+GG+ST CFLE VDR +GK+HLFKRSI+L KAWCY+ESRILGAHHGLIS
Sbjct: 185 ENIIVDVSFNQIGGVSTFCFLELVDRQVGKNHLFKRSIMLTKAWCYHESRILGAHHGLIS 244
Query: 212 TYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYALETL VLYKFL+YFSKFDWD Y ISLNGPV +SSLP + VE
Sbjct: 245 TYALETLVLYIFNMFHKSLHGPLEVLYKFLEYFSKFDWDRYGISLNGPVDLSSLPSLTVE 304
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
P G+LLL +F + +++ V FD F K LNIVDPLK NNNLGRSVSK
Sbjct: 305 -PTEVQGELLLGKDFHQGSLDRLVVIPNEFDGCDTQFRQKFLNIVDPLKANNNLGRSVSK 363
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
NFYRIRSAF++GA+KLG IL P E + DE+ FFSNTL RHG G+R D+
Sbjct: 364 ANFYRIRSAFSFGAQKLGQILLLPSEYICDEIYGFFSNTLKRHGKGERLDI 414
>gi|326517667|dbj|BAK03752.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 334
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 197/303 (65%), Positives = 229/303 (75%), Gaps = 17/303 (5%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
++I A W+ E+A ++ ++QP+V SE+RR AV+ YVQRLIR +GCEVFPFGSVPLK
Sbjct: 28 SSISAGAWRPFEDAAAAVVGRIQPSVSSEDRRAAVVHYVQRLIRCSVGCEVFPFGSVPLK 87
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDLTAFG + +E LAN+V +VLE E+ K AEF VKD Q I AEVKLVKCLV
Sbjct: 88 TYLPDGDIDLTAFGSASSDENLANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCLV 147
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
QNIVVDISFNQ+GGL TLCFLEQVD GK HLFK+SI+LIKAWCYYESRILGAHHGLIS
Sbjct: 148 QNIVVDISFNQIGGLCTLCFLEQVDERFGKKHLFKKSIMLIKAWCYYESRILGAHHGLIS 207
Query: 212 TYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYALE L VLY+FLDY+SKFDWD+ ISL GPV +SSLPE+V +
Sbjct: 208 TYALEILVLYIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVPLSSLPELVSD 267
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
TP+ D L EFLKE + F+VP R F+ N+R F K LNIVDPLK+NNNLGRSVSK
Sbjct: 268 TPDTHDVDFLKREEFLKEFAQMFTVPPRSFERNNRLFLRKFLNIVDPLKQNNNLGRSVSK 327
Query: 315 GNF 317
G F
Sbjct: 328 GFF 330
>gi|356570171|ref|XP_003553264.1| PREDICTED: uncharacterized protein LOC100797780 [Glycine max]
Length = 644
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 201/362 (55%), Positives = 245/362 (67%), Gaps = 21/362 (5%)
Query: 25 SSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFP 84
SS+PS +I E WQ AEE Q I+ +QP V+SE RK VIDYVQRLIR Y G EV P
Sbjct: 10 SSMPSQLLSIDKELWQMAEERAQEILWTIQPNVLSEVNRKDVIDYVQRLIRGYYGAEVLP 69
Query: 85 FGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEV 144
FGSVPLKTYLPDGDIDLTA + EE LA VC VL+ D E+ VKD + IRA+V
Sbjct: 70 FGSVPLKTYLPDGDIDLTALSHEDAEEDLAQAVCYVLQSGDD---PEYQVKDIKYIRAQV 126
Query: 145 KLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILG 204
+LVKC V+NI VDISFNQ+ G+ TL FLEQVD+L+GK+H+FKRSIILIKAWCYYESR+LG
Sbjct: 127 RLVKCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFKRSIILIKAWCYYESRLLG 186
Query: 205 AHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISS 247
HHGL+STYA+E L VLY FLDY+ FDWD +S+ GP +SS
Sbjct: 187 GHHGLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYVSIWGPKPLSS 246
Query: 248 LPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNN 307
PE + ET E G+ LL EFL+ S PSR T + FP K +NI+DPL+ +NN
Sbjct: 247 FPE-IAETLECDHGEFLLQKEFLRNYRNMCSFPSRATKTMTHEFPVKFMNILDPLRNDNN 305
Query: 308 LGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
LGRSV+ + +R R A +YGAR+L IL+ P E++ L KFF +TLDR+G G+R DV
Sbjct: 306 LGRSVNIASLHRFRFALSYGARRLKQILTLPGETMGAALEKFFFSTLDRNGKGERADVDV 365
Query: 368 PV 369
PV
Sbjct: 366 PV 367
>gi|326490774|dbj|BAJ90054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1030
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 195/341 (57%), Positives = 236/341 (69%), Gaps = 18/341 (5%)
Query: 54 QPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEAL 113
QPT S+ RR V+D+ +R++ LGCEVF FGSVPLKTYLPDGDIDLT G + L
Sbjct: 43 QPTQASDRRRAEVVDHARRIVGTALGCEVFVFGSVPLKTYLPDGDIDLTVIGNTSCGSTL 102
Query: 114 ANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLE 173
+DV +LE ++N AEF VKD + I AEV+L+KC + NI+VDISFNQ GG+ + FLE
Sbjct: 103 IDDVYHILESGEENGDAEFEVKDLEHIDAEVRLIKCTIGNIIVDISFNQTGGICAVSFLE 162
Query: 174 QVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL--------------- 218
VDR +GK+HLFKRSIILIK WCYYESR+LGAHHGLISTYALETL
Sbjct: 163 LVDRKVGKNHLFKRSIILIKGWCYYESRLLGAHHGLISTYALETLILYVFNLFHKSLHGP 222
Query: 219 --VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQ 276
VLY+FL+YFSKFDWD YCISLNGPV +SSLP ++VE G DLL EFL VE+
Sbjct: 223 LEVLYRFLEYFSKFDWDKYCISLNGPVALSSLPNLIVEGLNVPGDDLLFDREFLDNSVEK 282
Query: 277 FSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
S P R D F K LNI+DPLKE NNLGRSV++ NF+RIR+AF++GARKLG IL
Sbjct: 283 ASAPPRNSDARCSKFRVKCLNIIDPLKECNNLGRSVNRANFHRIRTAFSFGARKLGQILM 342
Query: 337 QPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGF 377
P E + D++ FF NTL+R+ +G R D+ D V Y F
Sbjct: 343 LPPELIPDDIFAFFKNTLERNENGVRSDI-DHVGAFHYQPF 382
>gi|297745772|emb|CBI15828.3| unnamed protein product [Vitis vinifera]
Length = 929
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 256/563 (45%), Positives = 320/563 (56%), Gaps = 62/563 (11%)
Query: 175 VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY------------- 221
VDRLIGKDHLFKRSIILIK+WCYYESRILGAHHGLISTYALE LVLY
Sbjct: 251 VDRLIGKDHLFKRSIILIKSWCYYESRILGAHHGLISTYALEILVLYIFHLFHLSLDGPL 310
Query: 222 ----KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQF 277
+FLDYFSKFDWD+YCISLNGPV SSLP++V E PEN DLLLS EFL+ CV+ F
Sbjct: 311 AVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPDIVAELPENGQDDLLLSEEFLRNCVDMF 370
Query: 278 SVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQ 337
SVP RG +TNSR+FP KHLNI+DPL+ENNNLGRSV+KGNFYRIRSAF YG+ KLG ILS
Sbjct: 371 SVPFRGLETNSRTFPLKHLNIIDPLRENNNLGRSVNKGNFYRIRSAFKYGSHKLGQILSL 430
Query: 338 PEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYE 397
P E + DEL+ FF++TL+RH S ++Q+ G SS+ SGTE+C ED+ I+
Sbjct: 431 PREVIQDELKNFFASTLERHRSKYMAEIQNSALTFGSRGSSSSSSSSGTEICSEDE-IFL 489
Query: 398 SEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSGD 457
+ +S IT RIDDE G + +S M+SS +G AVS LSGD
Sbjct: 490 TSLDSDKIT---RIDDETSSMGVLSSPSLSEMDSSI-----------DGNAVSGYCLSGD 535
Query: 458 AKDLATSKNLNLVISNETSKCSSLSGE-----ESKARHAPHLYFSSSTMGNGEIRNGNSE 512
+K+ A+ +L I+ + S +G K+ H LY SS + NG
Sbjct: 536 SKESASCGFHDLRITEDMSDSLPPTGNLGRSLSVKSHHGHRLYISSLFIENG-------- 587
Query: 513 WKQQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDVNHGASSPVESNHHPSLMST 572
L AE +V + ++L EN N SS H S+ S
Sbjct: 588 ---SLCPKMAESSVID--------DASIVLQQESKENHFVANTSFSSHSYHEGHNSIGSI 636
Query: 573 IPWST----EEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESHQISLNHVWWWYEHAL 628
I T E ++ G + GS + +L DLSGDY+SH SL + Y HAL
Sbjct: 637 ISRPTANISENTALAFRGRDFAC-NAGSLGSLETLLDLSGDYDSHIRSLQYGQCCYGHAL 695
Query: 629 NSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSANGAVPRPLFYPMTPPMLPGA 688
P P SQ Q WD +++ L F +N+ QM +NG + F P+ P
Sbjct: 696 PPPLLPSPPLSPSQLQINTPWDKVRQHLQFTQNLHSQMDSNGVILGNHF-PVKHPARSIT 754
Query: 689 SFGMEEMPKHRGTGTYFPNTVYL 711
+FG+E+ K RGTGTYFPN +L
Sbjct: 755 AFGLEDKQKPRGTGTYFPNMSHL 777
Score = 206 bits (524), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 114/175 (65%), Positives = 131/175 (74%), Gaps = 1/175 (0%)
Query: 1 MGDLRDWSP-EPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MGDL+ SP PNG V S S SS P +I + W AE ATQ I+A++QPT+ S
Sbjct: 1 MGDLKLPSPFLPNGVVSYRGASRSLSSSPPLPASIAGDSWAAAERATQEIVAKMQPTLGS 60
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCS 119
R+ VIDYVQRLI LGCEVFP+GSVPLKTYL DGDIDLTA NVEEALA+DV +
Sbjct: 61 MRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTYLLDGDIDLTALCSSNVEEALASDVHA 120
Query: 120 VLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQ 174
VL+ E+QN+ AEF VKD Q I AEVKLVKCLV++IV+DISFNQLGGLSTLCFLEQ
Sbjct: 121 VLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKDIVIDISFNQLGGLSTLCFLEQ 175
>gi|356507300|ref|XP_003522406.1| PREDICTED: uncharacterized protein LOC100813790 [Glycine max]
Length = 692
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 197/370 (53%), Positives = 251/370 (67%), Gaps = 21/370 (5%)
Query: 17 GERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN 76
G+R + SS+PS +I E WQ AE+ Q I+ +QP V+SE RK VIDYVQRLIR+
Sbjct: 3 GKRENLLPSSLPSQLLSIDEELWQMAEDRVQEILWTIQPNVLSEVNRKDVIDYVQRLIRD 62
Query: 77 YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKD 136
Y G EV PFGSVPLKTYLPDGD+DLT + E+ LA +C+VL+ D +E+ VKD
Sbjct: 63 YYGAEVLPFGSVPLKTYLPDGDVDLTTLIHEDAEDDLAQAICNVLKSGDD---SEYQVKD 119
Query: 137 AQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWC 196
Q IRA+V+LVKC V+NI VDISFNQ+ G+ TL FLEQVD+L+GK+H+FKRSIILIK WC
Sbjct: 120 IQYIRAQVRLVKCTVKNIAVDISFNQMAGIYTLRFLEQVDQLVGKNHIFKRSIILIKGWC 179
Query: 197 YYESRILGAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISL 239
YY+SR+LG HHGL+STYA+E L VLY FLDY+ FDWD IS+
Sbjct: 180 YYDSRLLGGHHGLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYISI 239
Query: 240 NGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIV 299
GP +SSLPE + E PE G+ LL EFL S P+ +T + FP K +NI+
Sbjct: 240 WGPKSLSSLPE-IAEAPECDQGEFLLQKEFLGNYKNMCSYPAGASETLTHEFPVKFMNIL 298
Query: 300 DPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGS 359
DPL+ +NNLGRSVS + +R+R AF+YG +KL I + P E++ L KFFS+TL+R+G
Sbjct: 299 DPLRNDNNLGRSVSIASLHRLRFAFSYGVQKLKQIFTLPGENMGAALEKFFSSTLNRNGK 358
Query: 360 GQRPDVQDPV 369
G+R DV PV
Sbjct: 359 GERADVSVPV 368
>gi|359486339|ref|XP_002274554.2| PREDICTED: uncharacterized protein LOC100253615 [Vitis vinifera]
Length = 755
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 199/357 (55%), Positives = 247/357 (69%), Gaps = 22/357 (6%)
Query: 33 AIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKT 92
++ AE W + Q I+ +QPT+VSE+RRK +IDYVQRLIR+ G EV PFGS+PLKT
Sbjct: 26 SVTAECWSITKLTIQEILCAIQPTIVSEQRRKEIIDYVQRLIRDSFGNEVLPFGSMPLKT 85
Query: 93 YLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
YLPDGDIDLTA N EE A DVC++LE E Q +EF V+D IRA+VK+VKC+VQ
Sbjct: 86 YLPDGDIDLTALCPENDEEDFARDVCTLLEGERQ-MGSEFRVEDISYIRAKVKIVKCMVQ 144
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
+I VDISFNQ GGLSTLCFLEQ+D LIGKDHLFKRS+ILIKAWCYYE RILG+H GL+ST
Sbjct: 145 DISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEGRILGSHCGLLST 204
Query: 213 YALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
YALE L VLY+FLDY+S FDW+ + +S+ GPV ISSL +
Sbjct: 205 YALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPVSISSL---LTGA 261
Query: 256 PENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
PE + LL++ EFL C E F+V R + + F KH+NI DPL++ NNLGRS+S G
Sbjct: 262 PETADKPLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLG 321
Query: 316 NFYRIRSAFTYGARKLGHILSQ-PEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPL 371
N YR R A + GA++L IL PE + + L++FF+NTLDR+G GQ D D VP
Sbjct: 322 NSYRFRYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGADEGDLVPF 378
>gi|414888115|tpg|DAA64129.1| TPA: hypothetical protein ZEAMMB73_121752 [Zea mays]
Length = 942
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 192/334 (57%), Positives = 239/334 (71%), Gaps = 21/334 (6%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
++ +V PT +E RR+ VI Y++RLI + LGCEVF FGSVPL+TYLPDGD+D+T G
Sbjct: 70 VVLRVHPTREAERRRQDVIAYLRRLIGSCLGCEVFAFGSVPLRTYLPDGDVDITVLGNTW 129
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLST 168
+ +DV S+L+ E +N AEF + Q I AEVKL+KC+++NI+VD+SFNQ+GG+ST
Sbjct: 130 LNSTFIDDVRSMLQSEQENCDAEFKLTGLQFINAEVKLIKCVIENIIVDVSFNQIGGVST 189
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV--------- 219
CFLE VDR IG++HLFKRSI+LIKAWCY+ESRILGAHHGLISTYALETLV
Sbjct: 190 FCFLELVDRQIGQNHLFKRSIMLIKAWCYHESRILGAHHGLISTYALETLVLYIFNMFHK 249
Query: 220 --------LYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK 271
LY+FL+YFSKFDWD Y ISLNG V +SSL VE P + G+ LL E +
Sbjct: 250 SLHGPLEALYRFLEYFSKFDWDRYGISLNGQVDLSSL---TVE-PTDVQGESLLGKELQQ 305
Query: 272 ECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
+++ V FD F K LNI+DPLK NNNLGRSVSK NFYRIRSAF++GA+KL
Sbjct: 306 GYLDRLVVIPNEFDGCGTQFRQKFLNIIDPLKANNNLGRSVSKANFYRIRSAFSFGAQKL 365
Query: 332 GHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
G IL P E + DE+ FF+NTL RHG+G+RPDV
Sbjct: 366 GQILLLPSEYIRDEIYGFFANTLKRHGNGERPDV 399
>gi|356570173|ref|XP_003553265.1| PREDICTED: uncharacterized protein LOC100798838 [Glycine max]
Length = 626
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 196/362 (54%), Positives = 250/362 (69%), Gaps = 21/362 (5%)
Query: 25 SSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFP 84
SS+PS +I E W+ EE Q I+ +QP V+SE RK +IDYVQRLI Y G +VFP
Sbjct: 11 SSLPSQLLSIDEELWRMIEERAQEILWTIQPNVLSEVNRKNIIDYVQRLIGEYCGAQVFP 70
Query: 85 FGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEV 144
FGS PLKTYLPDGDIDLTA + EE L VC++L+ ED +E+ VKD + IRA+V
Sbjct: 71 FGSFPLKTYLPDGDIDLTALSHEDEEEDLVRAVCNILKSEDD---SEYQVKDIEHIRAQV 127
Query: 145 KLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILG 204
++VKC V+NI VDISFNQ+ GL TL FLEQVD+L+GK+H+FKRS+ILIK+WCYYESRILG
Sbjct: 128 QVVKCTVKNIPVDISFNQMAGLYTLFFLEQVDQLVGKNHIFKRSVILIKSWCYYESRILG 187
Query: 205 AHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISS 247
AH GL+STYA E L VLY FLDY+S FDW+ IS+ GP +SS
Sbjct: 188 AHCGLLSTYATEILVLYIINRFHSSVRGPLAVLYVFLDYYSSFDWEHNYISIWGPKVLSS 247
Query: 248 LPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNN 307
LPE +V+TPE G+ LL EFLK + S ++ +T + +FP KH+NI+DPL+ NNN
Sbjct: 248 LPE-IVDTPEYDQGEFLLQKEFLKNYRDMCSSKAKASETMTNAFPVKHMNILDPLRNNNN 306
Query: 308 LGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
LGRSV+ GN RIR AF+ G+++L IL+ E++ L KFF NTL+ +G G+R DV
Sbjct: 307 LGRSVNIGNLSRIRLAFSLGSQRLKQILTLAGENMGAALEKFFFNTLENNGKGERADVGV 366
Query: 368 PV 369
PV
Sbjct: 367 PV 368
>gi|356518940|ref|XP_003528133.1| PREDICTED: uncharacterized protein LOC100815787 [Glycine max]
Length = 680
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 190/361 (52%), Positives = 246/361 (68%), Gaps = 23/361 (6%)
Query: 26 SVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPF 85
S+PS +I E W+ AE+ Q I+ ++P V+SE RK VIDYVQRLI+ Y G +V PF
Sbjct: 12 SLPSQLVSIDEELWRMAEDRVQEILWTIEPNVLSEVNRKDVIDYVQRLIKGYYGAKVLPF 71
Query: 86 GSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVK 145
GSVPLKTYLPDGD+DLT + EE LA +C++L+ D +E+ VKD Q IRA+V+
Sbjct: 72 GSVPLKTYLPDGDVDLTTLIHEDAEEDLAQAICNILKSGDD---SEYQVKDIQYIRAQVR 128
Query: 146 LVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGA 205
LVKC V+NI VDISFNQ+ G+ TL FLEQVD+L+GK+H+FKRSIILIKAWCYY+SR+LG
Sbjct: 129 LVKCTVKNIAVDISFNQMAGIYTLRFLEQVDQLVGKNHIFKRSIILIKAWCYYDSRLLGG 188
Query: 206 HHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSL 248
H+GL+STYA+E L VLY FLDY+S FDWD +S+ GP +SSL
Sbjct: 189 HYGLLSTYAVEILVLYIINRFHSVVRGPLEVLYIFLDYYSSFDWDHNYVSIWGPKSLSSL 248
Query: 249 PEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNL 308
PE+ TPE G+ LL EFL S P+R +T + FP K +NI+DPL+ +NNL
Sbjct: 249 PEI---TPECDQGEFLLQKEFLTNYKNMCSYPTRASETLTHEFPVKFMNILDPLRNDNNL 305
Query: 309 GRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDP 368
GRSVS + +R+R AF Y A+KL I + P E++ L KFF +TL+R+G G+R DV P
Sbjct: 306 GRSVSIASLHRLRFAFAYSAQKLKQIFTLPGENMGAALEKFFFSTLERNGKGERADVGVP 365
Query: 369 V 369
V
Sbjct: 366 V 366
>gi|242082774|ref|XP_002441812.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
gi|241942505|gb|EES15650.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
Length = 546
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 205/399 (51%), Positives = 251/399 (62%), Gaps = 64/399 (16%)
Query: 30 NQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRL---------------- 73
+ ++I + W+R E A G++ ++QPTV SE R AVIDY++RL
Sbjct: 25 DPSSIPPDAWRRFESAALGVVNKIQPTVASENFRSAVIDYLKRLLGSRAGVQSWLLPFLP 84
Query: 74 -----------IRNY-----------LGCE-------VFPFGSVPLKTYLPDGDIDLTAF 104
+R+Y +GC VFPFGSVPLKTYLPDGDIDLTAF
Sbjct: 85 FHFYVFFGAKPVRDYEYKCVTVWIYFVGCALESLCDLVFPFGSVPLKTYLPDGDIDLTAF 144
Query: 105 GGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLG 164
+E LAN V ++L E K +EF VKD Q I AEVKLVKCLVQNIVVDIS NQ+G
Sbjct: 145 SPAISDENLANQVYAILSSEQHRKDSEFDVKDVQYIHAEVKLVKCLVQNIVVDISVNQIG 204
Query: 165 GLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL------ 218
GLSTLCFLE+VD GK HL KRSI+LIK WCYYESRILGA +GL+STYALE L
Sbjct: 205 GLSTLCFLEKVDENFGKKHLLKRSIVLIKDWCYYESRILGAQNGLLSTYALEVLVLYVFL 264
Query: 219 -----------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE--TPENSGGDLLL 265
VLY+FLD++SKFDWDS ISL GPV +SSLP +V + P +
Sbjct: 265 IFHRSLGGPLAVLYRFLDFYSKFDWDSKGISLFGPVSLSSLPNLVTDPHLPAIDDDFFVP 324
Query: 266 SSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFT 325
+ L++ E FS P R + +++ F K LNIVDPLK++NNLGRSV+KGNFYRIRSAF
Sbjct: 325 REKILRKYAEDFSAPPRNSERDAQVFSRKFLNIVDPLKQSNNLGRSVNKGNFYRIRSAFD 384
Query: 326 YGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPD 364
+GARKLG IL P +E+ +FFSNTL R+ +G RPD
Sbjct: 385 FGARKLGKILQMPVCYTVNEVNQFFSNTLKRNHTGFRPD 423
>gi|115488182|ref|NP_001066578.1| Os12g0283100 [Oryza sativa Japonica Group]
gi|77554657|gb|ABA97453.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649085|dbj|BAF29597.1| Os12g0283100 [Oryza sativa Japonica Group]
gi|222616913|gb|EEE53045.1| hypothetical protein OsJ_35772 [Oryza sativa Japonica Group]
Length = 989
Score = 370 bits (951), Expect = e-99, Method: Compositional matrix adjust.
Identities = 188/334 (56%), Positives = 236/334 (70%), Gaps = 28/334 (8%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
++ +V PT +E RR+ V+ Y++RL+ LGCEV FGSVPLK+YLPDGD+D+T G
Sbjct: 56 VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGNTA 115
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLST 168
++ A +DV S+LE E+Q+ AE +K I AEVKL+KC+++NIVVDISFNQ+GG+ST
Sbjct: 116 LDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVST 175
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV--------- 219
LCFLE DR +GK+HLFKRSI+LIKAWCY+ESRILGAHHGL+STYALETLV
Sbjct: 176 LCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFHK 235
Query: 220 --------LYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK 271
LYKFL+YFSKFDWD YCISLNGPV +SSLP VE P + +LL + L
Sbjct: 236 SLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVE-PSSIQDELLFGKKTLP 294
Query: 272 ECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
E S G + N F KHLNI+DPLK +NNLGRSVS+G+FYRIR A ++GA+KL
Sbjct: 295 EV-------SDGSNIN---FCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQKL 344
Query: 332 GHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
G IL + + E+ FF+NTL RHG G+R DV
Sbjct: 345 GQILMLHSDLIPTEIFGFFANTLKRHGRGERSDV 378
>gi|218186672|gb|EEC69099.1| hypothetical protein OsI_37998 [Oryza sativa Indica Group]
Length = 989
Score = 370 bits (951), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 188/334 (56%), Positives = 236/334 (70%), Gaps = 28/334 (8%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
++ +V PT +E RR+ V+ Y++RL+ LGCEV FGSVPLK+YLPDGD+D+T G
Sbjct: 56 VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGNTA 115
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLST 168
++ A +DV S+LE E+Q+ AE +K I AEVKL+KC+++NIVVDISFNQ+GG+ST
Sbjct: 116 LDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVST 175
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV--------- 219
LCFLE DR +GK+HLFKRSI+LIKAWCY+ESRILGAHHGL+STYALETLV
Sbjct: 176 LCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFHK 235
Query: 220 --------LYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK 271
LYKFL+YFSKFDWD YCISLNGPV +SSLP VE P + +LL + L
Sbjct: 236 SLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVE-PSSIQDELLFGKKTLP 294
Query: 272 ECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
E S G + N F KHLNI+DPLK +NNLGRSVS+G+FYRIR A ++GA+KL
Sbjct: 295 EV-------SDGSNIN---FCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQKL 344
Query: 332 GHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
G IL + + E+ FF+NTL RHG G+R DV
Sbjct: 345 GQILMLHSDLIPTEIFGFFANTLKRHGRGERSDV 378
>gi|297823987|ref|XP_002879876.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
lyrata]
gi|297325715|gb|EFH56135.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
lyrata]
Length = 516
Score = 365 bits (936), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 181/353 (51%), Positives = 239/353 (67%), Gaps = 23/353 (6%)
Query: 34 IGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
IG E W AEE Q I+ +QP +SE R +I+++Q L+R LG EVF FGSVPLKTY
Sbjct: 28 IGEELWLIAEERAQEILFAIQPMYLSERSRNEIINHLQTLMRERLGIEVFLFGSVPLKTY 87
Query: 94 LPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN 153
LPDGDIDLT +EE A + ++LE E ++F V D Q I A+VK++KC ++N
Sbjct: 88 LPDGDIDLTVLTPYGMEENCAKALRNILEAE--RGESDFQVTDVQYIHAQVKVIKCTIRN 145
Query: 154 IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 213
+ +DISFNQ+ GLS LCFLEQVDR G+DHLFKRSIILIKAWC+YESRILGA++GLISTY
Sbjct: 146 VALDISFNQMAGLSALCFLEQVDRAFGRDHLFKRSIILIKAWCFYESRILGANNGLISTY 205
Query: 214 ALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETP 256
AL L VLYKF+D++ FDW++YCI++ G V ISS P++
Sbjct: 206 ALAILVLNIVNMSYSSVSGPLAVLYKFMDFYGSFDWENYCITVTGLVPISSFPDIT---- 261
Query: 257 ENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGN 316
E ++ L +F +EC+E +S P+ + N + FP KH NI+DPLK +NNLGRSVS+GN
Sbjct: 262 ETRNHEVFLDEKFFRECIESYSGPANVVEANRKYFPVKHYNILDPLKHSNNLGRSVSEGN 321
Query: 317 FYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPV 369
R+R F GA+KL +L+ P E++ +L FF N+LDR+G GQR DV++PV
Sbjct: 322 AIRLRHCFRRGAQKLRDVLTFPGETVGWKLEDFFGNSLDRNGKGQRQDVEEPV 374
>gi|297736507|emb|CBI25378.3| unnamed protein product [Vitis vinifera]
Length = 893
Score = 363 bits (931), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 199/408 (48%), Positives = 246/408 (60%), Gaps = 70/408 (17%)
Query: 33 AIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCE----------- 81
++ AE W + Q I+ +QPT+VSE+RRK +IDYVQRLIR+ G E
Sbjct: 110 SVTAECWSITKLTIQEILCAIQPTIVSEQRRKEIIDYVQRLIRDSFGNEREIEESSRAST 169
Query: 82 ----------------------------------------VFPFGSVPLKTYLPDGDIDL 101
V PFGS+PLKTYLPDGDIDL
Sbjct: 170 FQSGDERPLFKQAMLERRNSRSIEVLLQVYFEEPKNIKLYVLPFGSMPLKTYLPDGDIDL 229
Query: 102 TAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFN 161
TA N EE A DVC++LE E Q +EF V+D IRA+VK+VKC+VQ+I VDISFN
Sbjct: 230 TALCPENDEEDFARDVCTLLEGERQ-MGSEFRVEDISYIRAKVKIVKCMVQDISVDISFN 288
Query: 162 QLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL--- 218
Q GGLSTLCFLEQ+D LIGKDHLFKRS+ILIKAWCYYE RILG+H GL+STYALE L
Sbjct: 289 QTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEGRILGSHCGLLSTYALEILVLY 348
Query: 219 --------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLL 264
VLY+FLDY+S FDW+ + +S+ GPV ISSL E E + LL
Sbjct: 349 VINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPVSISSLLTGAPEAAETADKPLL 408
Query: 265 LSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAF 324
++ EFL C E F+V R + + F KH+NI DPL++ NNLGRS+S GN YR R A
Sbjct: 409 INEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRFRYAI 468
Query: 325 TYGARKLGHILSQ-PEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPL 371
+ GA++L IL PE + + L++FF+NTLDR+G GQ D D VP
Sbjct: 469 SVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGADEGDLVPF 516
>gi|168037604|ref|XP_001771293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677382|gb|EDQ63853.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2035
Score = 361 bits (926), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 189/392 (48%), Positives = 245/392 (62%), Gaps = 60/392 (15%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEV-------------- 82
++W RAE T +I ++PT +SEERR AV +V+RLIR+ CEV
Sbjct: 585 DWWTRAEGQTAELIDSLKPTRLSEERRTAVTGFVERLIRDRFECEVSALPHELNGFIVRS 644
Query: 83 ----------FPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEF 132
FGSVPLKTYLPDGDIDL F +++E A DV L++ + + AEF
Sbjct: 645 SAGAVRYSAVIRFGSVPLKTYLPDGDIDLYIFARNDLKETWAQDVLKALKQAEDDADAEF 704
Query: 133 VVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILI 192
VK+ Q I+AEVKL+KCLV+NIVVDISFNQ+GGLSTLCFLE+VD +G +HLFKRS+IL+
Sbjct: 705 RVKEVQYIQAEVKLIKCLVENIVVDISFNQIGGLSTLCFLERVDEEVGLNHLFKRSVILV 764
Query: 193 KAWCYYESRILGAHHGLISTYALETL-------------------VLYKFLDYFSKFDWD 233
KAWCYYESRILGAHHGLIST+ALETL VLY FL YF FDWD
Sbjct: 765 KAWCYYESRILGAHHGLISTFALETLVLYIFHVFHSMRSLHGPLEVLYLFLTYFCNFDWD 824
Query: 234 SYCISLNGPVRISSLPE-------------VVVETPENSGGDLLLSSEFLKECVEQFSVP 280
YC+S+ GPV + +P+ V +P GG L S EF++EC+ ++S
Sbjct: 825 QYCLSIWGPVPLDHIPKNSSELSQKDGGWRTVARSPWEVGGKLYFSEEFIEECINRYSDV 884
Query: 281 SRGFDTNS-RSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPE 339
G +++ R F PK+LN++DP++ NNLGRSV+ G+F RIRSAF GAR LG + P+
Sbjct: 885 RAGSESSQGRIFNPKYLNVLDPIRHTNNLGRSVNVGSFKRIRSAFGLGARTLGEVFECPK 944
Query: 340 ESLTDELRKFFSNT---LDRHGSGQRPDVQDP 368
+ +T++ + FFS T L R+ S RPD P
Sbjct: 945 DQITEKFKSFFSCTFKSLGRYRSAGRPDSGIP 976
>gi|357491471|ref|XP_003616023.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
gi|355517358|gb|AES98981.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
Length = 387
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 203/389 (52%), Positives = 238/389 (61%), Gaps = 89/389 (22%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPS-------NQTAIGAEYWQRAEEATQGIIAQV 53
MGDL NG VF E SSS + +++ E AE+ T I+ ++
Sbjct: 1 MGDL-----HLNGVVFAEDRPYSSSPPSPPLPVFNPDPSSVTDEASSAAEQTTAEILRRI 55
Query: 54 QPTVVSEERRKAVIDYVQRLIRNYLGCE---------------------VFPFGSVPLKT 92
QPT+ ++ RR+ V+DYVQRLIR CE VFP+GSVPLKT
Sbjct: 56 QPTLAADRRRREVVDYVQRLIRYGARCEKLLPNVWRKLDFEVRIFRIGKVFPYGSVPLKT 115
Query: 93 YLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ 152
YLPDGDIDLTA N+E+ L +DV +VL E+ N AAE+ VKD + I AE
Sbjct: 116 YLPDGDIDLTALSPQNIEDGLVSDVHAVLRGEENNDAAEYEVKDVRFIDAE--------- 166
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
NIVVDISFNQLGGLSTLCFLE+VDRL+ KDH+FKRSIILIKAWCYYESRILGAHHGLIST
Sbjct: 167 NIVVDISFNQLGGLSTLCFLEKVDRLVAKDHIFKRSIILIKAWCYYESRILGAHHGLIST 226
Query: 213 YALETL----------------------------------------------VLYKFLDY 226
YALETL VLY+FLDY
Sbjct: 227 YALETLVLYIFHRFHVSLDGPLAEKERKRNLNHIMLVMHPFNKHFMHPALFQVLYRFLDY 286
Query: 227 FSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDT 286
FSKFDWD+YC+SL GPV SS P+VV E EN GG+ LL+ EF++ CVE FSVP RG D
Sbjct: 287 FSKFDWDNYCVSLKGPVAKSSPPDVVAEALEN-GGNTLLTDEFIRSCVESFSVPPRGLDL 345
Query: 287 NSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
N R+FP KHLNI+DPLKENNNLGRSV+KG
Sbjct: 346 NLRAFPHKHLNIIDPLKENNNLGRSVNKG 374
>gi|356518706|ref|XP_003528019.1| PREDICTED: uncharacterized protein LOC100788864 [Glycine max]
Length = 721
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 179/362 (49%), Positives = 243/362 (67%), Gaps = 21/362 (5%)
Query: 25 SSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFP 84
SS+P +I E W+ EE Q I+ +QP V+SE RK V++YVQ+LI +Y +VFP
Sbjct: 11 SSLPRQLLSIDEELWRMTEERIQEILWTIQPNVLSEMNRKNVLNYVQKLIGDYYDTKVFP 70
Query: 85 FGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEV 144
FGS PLKTYLPDGDIDLT + EE LA ++C++LE + + VKD + IRA+V
Sbjct: 71 FGSFPLKTYLPDGDIDLTVINHEDEEENLAKEICTILECAND---LIYQVKDIEHIRAQV 127
Query: 145 KLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILG 204
++VKC V+NI +DI+FNQ+ GL TLCFLEQVD+L GK+H+FKRSIILIKAWC Y+SR+LG
Sbjct: 128 QVVKCTVKNIPIDITFNQMTGLCTLCFLEQVDQLAGKNHIFKRSIILIKAWCCYDSRLLG 187
Query: 205 AHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISS 247
+ HGL+STYA E L VLY F DY+ FDW+ +S+ GP +SS
Sbjct: 188 SQHGLLSTYATEVLVLYIINRFHASVRDPLEVLYIFFDYYGTFDWEHNYMSIWGPKALSS 247
Query: 248 LPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNN 307
LPE +V+ PE + LL EFL + FS ++ +T + +FP KH+NI+DPL+ +NN
Sbjct: 248 LPE-IVDRPECDQDEFLLHKEFLINYRDIFSSKAKSSETTTNTFPVKHINILDPLRNDNN 306
Query: 308 LGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
LGRSV++ +F+RIR A +YGA+K I + E++ + L KFF +TL R+G G+R DV
Sbjct: 307 LGRSVNEASFHRIRFALSYGAKKFKQIFTLAGENMGEALEKFFFDTLQRNGKGERADVDV 366
Query: 368 PV 369
PV
Sbjct: 367 PV 368
>gi|218200261|gb|EEC82688.1| hypothetical protein OsI_27344 [Oryza sativa Indica Group]
Length = 1001
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 178/334 (53%), Positives = 225/334 (67%), Gaps = 21/334 (6%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
++ +VQPT +E R+ +I Y++ L LGCEVF FGSVPLKTYLPDGDID+T G
Sbjct: 55 VVLRVQPTEEAERTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNTA 114
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLST 168
+ ++V +LE E+Q A+ + Q I AEVKL+KC++ NIVVDISFNQ+GG++T
Sbjct: 115 PDSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGVTT 174
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL---------- 218
LC LE VD +G DHLFKRSI+LIKAWCY+ES ILGAH GLISTYALE L
Sbjct: 175 LCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIFHK 234
Query: 219 -------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK 271
VLYKFL+YFSKFDWD YCISLNGPV +SSLP + VE P +LL
Sbjct: 235 SLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNLTVE-PSGIHDELLFGP---N 290
Query: 272 ECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
++ V + D ++ +F PK+LNI+DP+K +NNLGRSVSKG+FYRIR AF++GA+ L
Sbjct: 291 GSCDRLIVLKKDSDGSNMNFRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGAQNL 350
Query: 332 GHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
IL P + + E+ FF NTL HG G+R DV
Sbjct: 351 SQILMLPTDLIPTEIFGFFVNTLKSHGRGKRSDV 384
>gi|222637691|gb|EEE67823.1| hypothetical protein OsJ_25591 [Oryza sativa Japonica Group]
Length = 1001
Score = 355 bits (911), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 177/334 (52%), Positives = 225/334 (67%), Gaps = 21/334 (6%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
++ +VQPT ++ R+ +I Y++ L LGCEVF FGSVPLKTYLPDGDID+T G
Sbjct: 55 VVLRVQPTEEADRTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNTA 114
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLST 168
+ ++V +LE E+Q A+ + Q I AEVKL+KC++ NIVVDISFNQ+GG++T
Sbjct: 115 PDSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGVTT 174
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL---------- 218
LC LE VD +G DHLFKRSI+LIKAWCY+ES ILGAH GLISTYALE L
Sbjct: 175 LCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIFHK 234
Query: 219 -------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK 271
VLYKFL+YFSKFDWD YCISLNGPV +SSLP + VE P +LL
Sbjct: 235 SLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNLTVE-PSGIHDELLFGP---N 290
Query: 272 ECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
++ V + D ++ +F PK+LNI+DP+K +NNLGRSVSKG+FYRIR AF++GA+ L
Sbjct: 291 GSCDRLIVLKKDSDGSNMNFRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGAQNL 350
Query: 332 GHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
IL P + + E+ FF NTL HG G+R DV
Sbjct: 351 SQILMLPTDLIPTEIFGFFVNTLKSHGRGKRSDV 384
>gi|413924678|gb|AFW64610.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
Length = 474
Score = 353 bits (905), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 191/374 (51%), Positives = 242/374 (64%), Gaps = 28/374 (7%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
M ++ + SP P A P SSS P + W+R E AT ++ ++ PTV S+
Sbjct: 44 MVNIHERSPVP--ACVPAHPDPSSSISPDD--------WRRLEGATFSVMCKIHPTVSSQ 93
Query: 61 ERRKAVIDYVQRLIR-NYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCS 119
R VIDYVQRL R ++ G +V FGSVPLKTYLPDGDIDLT +E L N+VC+
Sbjct: 94 HLRARVIDYVQRLFRLHHDGYQVISFGSVPLKTYLPDGDIDLTLLCAAISDENLENEVCA 153
Query: 120 VLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLI 179
+L+ E+Q K +EF VKD + + AEVKLVKC VQNI VDIS NQ+GG + + FLE+VD+ +
Sbjct: 154 ILKSEEQRKDSEFEVKDVKYVPAEVKLVKCKVQNIAVDISVNQIGGPNKVYFLEKVDQNL 213
Query: 180 GKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYK 222
GK++L +RSI+LIK WCYYES ILGA GL+STYALETL VLY+
Sbjct: 214 GKNNLLRRSIMLIKHWCYYESCILGAQRGLVSTYALETLVLYIFHVFHKSLDGPLAVLYR 273
Query: 223 FLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSR 282
FLDY+SKFDWD+ ISL GP+ +SSLPE+V E P L FLK+C + FSVP
Sbjct: 274 FLDYYSKFDWDNKGISLFGPISLSSLPELVTEPPYTRDDGFLSREAFLKDCAKAFSVPPI 333
Query: 283 GFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESL 342
+ N + F K +NIVDPLK++NNLGRS+SKGN RIR F +GA KLG IL P
Sbjct: 334 NSEENPQVFSKKFVNIVDPLKQSNNLGRSISKGNLGRIRKEFYFGACKLGKILQAPACFS 393
Query: 343 TDELRKFFSNTLDR 356
+E+ +FF NTL R
Sbjct: 394 ANEINRFFRNTLSR 407
>gi|147867191|emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera]
Length = 1388
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 189/387 (48%), Positives = 242/387 (62%), Gaps = 55/387 (14%)
Query: 1 MGDLRDWSPEPNG-AVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG W+ +PNG + G P+ ++S A+ E AEE T+ +IA +QP S
Sbjct: 1 MGGHEGWA-QPNGFSPNGLLPNEAASVT----RALDQERLSLAEERTKQLIACIQPNQPS 55
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALANDVC 118
EERR+AV YV+ LI C+VFPFGSVPLKTYLPDGDIDLTAF N+++ AN+V
Sbjct: 56 EERREAVASYVKSLIMKCFSCKVFPFGSVPLKTYLPDGDIDLTAFSKSPNLKDTWANEVR 115
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
+LERE+++ AEF VK+ Q I+AEV D L
Sbjct: 116 DILEREEKSGDAEFRVKEVQYIQAEV-------------------------------DHL 144
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY----------------- 221
I + HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY
Sbjct: 145 ISQKHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRVFNNSFAGPLEVLY 204
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
+FL++FSKFDW++YC+SL GPV ISSLP+V + P G+LLLS FL C ++V
Sbjct: 205 RFLEFFSKFDWENYCVSLWGPVPISSLPDVTADPPRKDSGELLLSKLFLDACSSVYAVLP 264
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 341
G + + F K+ N++DPL+ NNNLGRSVSKGNF+RIRSAF +GA++L +L P+++
Sbjct: 265 VGQENPEQPFISKYFNVIDPLRTNNNLGRSVSKGNFFRIRSAFAFGAQRLARLLDCPKDN 324
Query: 342 LTDELRKFFSNTLDRHGSGQRPDVQDP 368
+ E+ +FF NT +RHG G RPD P
Sbjct: 325 VIAEVNQFFMNTWERHGKGDRPDAPSP 351
>gi|21805733|gb|AAM76764.1| hypothetical protein [Arabidopsis thaliana]
Length = 502
Score = 342 bits (877), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 171/355 (48%), Positives = 233/355 (65%), Gaps = 23/355 (6%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
T I AE W AE Q I+ VQP ++E R +I +Q L+ LG EV+ FGS+PLK
Sbjct: 26 TPIEAEVWLIAEARAQEILCAVQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLK 85
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDLT EE A VC VLE E N ++ V Q ++A+VK++KC +
Sbjct: 86 TYLPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGN--SDLQVTGVQYVQAKVKVIKCSI 143
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
+++ DISFNQL GL LCFLEQVD+ G+DHLFK+SIIL+KAWC+YESRILGA+ GLIS
Sbjct: 144 RDVAFDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLIS 203
Query: 212 TYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYAL L VLYKF++Y+ FDW +YC+++ GPV ISSLP++
Sbjct: 204 TYALAILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDIT-- 261
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
E ++ L +F +EC+E +S + + + + FP K+ NI+DPLK +NNLGRSV+K
Sbjct: 262 --ETGNHEVFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTK 319
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPV 369
GN R+R+ F G +KL +L+ P E++ +L KFF+ +L+R+G GQR DV++PV
Sbjct: 320 GNMVRLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPV 374
>gi|30688308|ref|NP_850331.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|145330711|ref|NP_001078031.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|186506897|ref|NP_001118485.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|186506900|ref|NP_001118486.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|60547743|gb|AAX23835.1| hypothetical protein At2g40520 [Arabidopsis thaliana]
gi|330254746|gb|AEC09840.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254747|gb|AEC09841.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254748|gb|AEC09842.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254749|gb|AEC09843.1| nucleotidyltransferase protein [Arabidopsis thaliana]
Length = 502
Score = 342 bits (876), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 170/355 (47%), Positives = 233/355 (65%), Gaps = 23/355 (6%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLK 91
T I AE W AE Q I+ +QP ++E R +I +Q L+ LG EV+ FGS+PLK
Sbjct: 26 TPIEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLK 85
Query: 92 TYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
TYLPDGDIDLT EE A VC VLE E N ++ V Q ++A+VK++KC +
Sbjct: 86 TYLPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGN--SDLQVTGVQYVQAKVKVIKCSI 143
Query: 152 QNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
+++ DISFNQL GL LCFLEQVD+ G+DHLFK+SIIL+KAWC+YESRILGA+ GLIS
Sbjct: 144 RDVAFDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLIS 203
Query: 212 TYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
TYAL L VLYKF++Y+ FDW +YC+++ GPV ISSLP++
Sbjct: 204 TYALAILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDIT-- 261
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
E ++ L +F +EC+E +S + + + + FP K+ NI+DPLK +NNLGRSV+K
Sbjct: 262 --ETGNHEVFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTK 319
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPV 369
GN R+R+ F G +KL +L+ P E++ +L KFF+ +L+R+G GQR DV++PV
Sbjct: 320 GNMVRLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPV 374
>gi|326521958|dbj|BAK04107.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1031
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 170/339 (50%), Positives = 229/339 (67%), Gaps = 20/339 (5%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
++ ++ PT +E RR +IDY + LI GCEVF FGSVPLKTYLPDGD+D+T +N
Sbjct: 157 VLLRLHPTEEAERRRHKIIDYAKNLIGTTFGCEVFAFGSVPLKTYLPDGDVDITILTNVN 216
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLST 168
++ DVC +L E N+AAEF +K+ Q+I A+VK++KC++ N+V+DISFNQ+GG+ST
Sbjct: 217 LDNNFVQDVCCLLAAEQSNEAAEFALKEIQVINAKVKIIKCVIDNLVMDISFNQVGGVST 276
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV--------- 219
LCFLE ++ IGKDHLFKRSIILIKAWCY+E I G++H L+STYALE L+
Sbjct: 277 LCFLEMANKEIGKDHLFKRSIILIKAWCYHEGSIHGSNHWLMSTYALEVLILYIFNLFHT 336
Query: 220 --------LYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK 271
LYKFL+Y+SKFDWD+ C++LNGPV +SSL P S +LLLS E L+
Sbjct: 337 VLHGPLQALYKFLEYYSKFDWDNQCLTLNGPVPLSSL-RNYTAGPTGSNEELLLSKEPLE 395
Query: 272 ECVEQ-FSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARK 330
+ + F +P+ G D F K+LNI+DPLK NNLG S+S+ N IR AF GA K
Sbjct: 396 PSLRRLFDLPA-GSDGRGPEFRLKYLNIIDPLKGGNNLGTSISEANSRVIRDAFAAGAEK 454
Query: 331 LGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPV 369
LG IL P E + +++ FF++TL +HG G+R D+ + V
Sbjct: 455 LGQILKLPCELIAEQVYVFFTHTLGKHGRGERQDLGESV 493
>gi|147820621|emb|CAN67650.1| hypothetical protein VITISV_005081 [Vitis vinifera]
Length = 1572
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 171/303 (56%), Positives = 207/303 (68%), Gaps = 46/303 (15%)
Query: 81 EVFPFGSVPLKTYLPDGDIDLTAF-GGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQL 139
VF FGSVPLKTYLPDGDIDLTAF N+++ AN
Sbjct: 227 RVFTFGSVPLKTYLPDGDIDLTAFSNNQNLKDTWAN------------------------ 262
Query: 140 IRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYE 199
+VK++KCLV+NIVVDISFNQLGGL TLCFLE+VD LI ++HLFKRSIILIKAWCYYE
Sbjct: 263 ---QVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYE 319
Query: 200 SRILGAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGP 242
SRILGAHHGLISTYALETL VLY+FL++FS FDWD++C+SL GP
Sbjct: 320 SRILGAHHGLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGP 379
Query: 243 VRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPL 302
V ISSLP+V E P G+LLLS FL C ++V G + +SF KH N++DPL
Sbjct: 380 VPISSLPDVTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPL 439
Query: 303 KENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQR 362
+ NNNLGRSVSKGNF+RIRSAF +GA++L +L P+E++ E+ + F NT +RHGSG R
Sbjct: 440 RVNNNLGRSVSKGNFFRIRSAFAFGAKRLARLL-DPKENIIFEVNQLFMNTWERHGSGHR 498
Query: 363 PDV 365
PD
Sbjct: 499 PDT 501
>gi|357463851|ref|XP_003602207.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
gi|355491255|gb|AES72458.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
Length = 768
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 198/492 (40%), Positives = 276/492 (56%), Gaps = 69/492 (14%)
Query: 39 WQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGD 98
W + E+ T ++ ++P SE R ++ Y++ LI +++ +VF FGSVPLKTYL DGD
Sbjct: 27 WSQVEDRTIELLQFLEPNPKSETLRNNIVSYIKGLIISHVPVKVFEFGSVPLKTYLRDGD 86
Query: 99 IDLTAFGGLNV-EEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVD 157
IDLT FG + E + +LE E N+ ++F VK+ QL+ AEVK++KCLV+ V+D
Sbjct: 87 IDLTIFGNNELFPEIFIPHIQQILESEMNNEFSKFRVKEVQLVNAEVKIIKCLVEKFVID 146
Query: 158 ISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 217
ISFNQL GL +LCFL++VD LI ++H+FKRS+ILIKAWCY+ESR+LG+ GL STYALE
Sbjct: 147 ISFNQLSGLCSLCFLDEVDYLISRNHIFKRSVILIKAWCYHESRLLGSKSGLFSTYALEI 206
Query: 218 LVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSG 260
LVLY +FL++FSKFDW +YCISL+GPV + SLP + + P
Sbjct: 207 LVLYLFNLYNNEFVGPLEVLFRFLEFFSKFDWGNYCISLSGPVPLDSLPNMTADCPRKDR 266
Query: 261 GDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRI 320
DLLL+ FL +F R + F KH+NI+DPL+ENNNLG S+S+GNF+RI
Sbjct: 267 QDLLLTESFL--IASKFCYGWRNQKNREKHFVSKHINIIDPLQENNNLGHSISRGNFFRI 324
Query: 321 RSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVS 380
+SA YGA ++ IL +E L E FF NT +RHG+G + +S YN +
Sbjct: 325 KSAIAYGAEQMMRILDCTDEYLISEFDHFFENTWNRHGNGSW------IRVSIYN-LDIR 377
Query: 381 STFSGTELCREDQTIYESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEP 440
G C+E +DE +L +K G+ Y ++ N+
Sbjct: 378 DKKVGKLTCQE-------------------FEDELDL----ASLKSQGI---YQKSDNQL 411
Query: 441 HNSGNGTAVSETR-------LSGDAKDLATSKNLNLVISNETSKCSSLSGEESKARHAPH 493
+ + VS TR LS D K +SK L N++ CSS HA H
Sbjct: 412 EELKDASVVSHTRSSRTDDMLSCDRKH-TSSKKKALTDKNKSPLCSS--------SHAMH 462
Query: 494 LYFSSSTMGNGE 505
+F+S + E
Sbjct: 463 HHFASDCCSSPE 474
>gi|147780178|emb|CAN75522.1| hypothetical protein VITISV_043595 [Vitis vinifera]
Length = 733
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 179/349 (51%), Positives = 219/349 (62%), Gaps = 53/349 (15%)
Query: 75 RNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVV 134
RN EV PFGS+PLKTYLPDGDIDLTA N EE A DVC++LE E Q +EF V
Sbjct: 5 RNSRSIEVLPFGSMPLKTYLPDGDIDLTALCPENDEEDFARDVCTLLEGERQ-MGSEFRV 63
Query: 135 KDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKA 194
+D IRA+VK+VKC+VQ+I VDISFNQ GGLSTLCFLEQ+D LIGKDHLFKRS+ILIKA
Sbjct: 64 EDISYIRAKVKIVKCMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKA 123
Query: 195 WCYYESRILGAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCI 237
WCYYE RILG+H GL+STYALE L VLY+FLDY+S FDW+ + +
Sbjct: 124 WCYYEGRILGSHCGLLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGV 183
Query: 238 SLNGPVRISSL-------------------------------PEVVV---ETPENSGGDL 263
S+ GPV ISSL P+ V+ E E + L
Sbjct: 184 SVLGPVSISSLLTGARESCLIMWLCLMVCFFRLIGLPFYLIFPDFVLFVAEAAETADKPL 243
Query: 264 LLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSA 323
L++ EFL C E F+V R + + F KH+NI DPL++ NNLGRS+S GN YR R A
Sbjct: 244 LINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRFRYA 303
Query: 324 FTYGARKLGHILSQ-PEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPL 371
+ GA++L IL PE + + L++FF+NTLDR+G GQ D D VP
Sbjct: 304 ISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGADEGDLVPF 352
>gi|168035287|ref|XP_001770142.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678668|gb|EDQ65124.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1504
Score = 316 bits (810), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 165/314 (52%), Positives = 206/314 (65%), Gaps = 28/314 (8%)
Query: 85 FGSVPLKTYLPDGDIDLTAFG-GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAE 143
FGSVPLKTYLPDGDIDL+AF +V+ D + L++ N +EF VK+ QLI AE
Sbjct: 144 FGSVPLKTYLPDGDIDLSAFTPSPDVKRTWIQDTYNALQKAKDNPNSEFRVKEVQLIHAE 203
Query: 144 VKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRIL 203
VK+VKC V+NI+VD+SF+QLGGL TLCFL +VD+LIG+DHLFKRSIIL+KAWCYYESRIL
Sbjct: 204 VKIVKCFVENILVDVSFDQLGGLGTLCFLVEVDKLIGEDHLFKRSIILVKAWCYYESRIL 263
Query: 204 GAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRIS 246
GAH GL+STYA+E L VLY FL++FS FDWD+YC+SL+ P+ +
Sbjct: 264 GAHCGLMSTYAVEALVLYIFDKFHASLRGPLQVLYLFLEFFSSFDWDNYCVSLSSPIPLK 323
Query: 247 SLP---------EVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLN 297
SL + + + GG+L + EFL C ++ V T S F K LN
Sbjct: 324 SLSKDSEKLEDLQKLALSTRRDGGELFFTKEFLVACETEYGVVPVSQITKSNKFTVKCLN 383
Query: 298 IVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNT-LDR 356
I DPL+ +NNLGRSV++GNF RIR AF +GAR L +LS EE + EL +FF N L
Sbjct: 384 ISDPLRSSNNLGRSVNQGNFARIRRAFDFGARTLRRVLSCTEEDVPAELEQFFKNCNLRL 443
Query: 357 HGSGQRPDVQDPVP 370
HG QRPDV P P
Sbjct: 444 HGGYQRPDVASPRP 457
Score = 42.7 bits (99), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 42/82 (51%), Gaps = 10/82 (12%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAI--GAEYWQRAEEATQGIIAQVQPTVV 58
MGD W PE +G P + S + T + G +W +AE +I +QP
Sbjct: 1 MGDPERW-PEASGLT----PEGAQSFDAAFATFLSKGDAWWAKAELRAAELITSLQPNEA 55
Query: 59 SEERRKAVIDYVQRLIRNYLGC 80
SE+RR+ VIDYV+ L++ GC
Sbjct: 56 SEQRRQDVIDYVRGLVK---GC 74
>gi|358347363|ref|XP_003637727.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
gi|355503662|gb|AES84865.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
Length = 827
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 151/239 (63%), Positives = 177/239 (74%), Gaps = 17/239 (7%)
Query: 144 VKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRIL 203
VKLVKCLV+NIVVDISFNQLGGL TLCFLE+VD LI +HLFKRSIILIKAWCYYESRIL
Sbjct: 109 VKLVKCLVENIVVDISFNQLGGLCTLCFLEEVDGLINHNHLFKRSIILIKAWCYYESRIL 168
Query: 204 GAHHGLISTYALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRIS 246
GAHHGLISTYALETLVLY +FL++FSKFDWD++C+SL GPV IS
Sbjct: 169 GAHHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPIS 228
Query: 247 SLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENN 306
SLP+V E P G+LLL FL C ++V G + + F KH N++DPL+ NN
Sbjct: 229 SLPDVTAEPPRKDAGELLLHKSFLDACSTVYAVFPGGPENQGQPFVSKHFNVIDPLRVNN 288
Query: 307 NLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
NLGRSVSKGNF+RIRSAF +GA+KL +L P++ L E+ +FF NT DRHGSGQRPD
Sbjct: 289 NLGRSVSKGNFFRIRSAFAFGAKKLARLLDCPKDELFLEVNQFFLNTWDRHGSGQRPDA 347
>gi|302835555|ref|XP_002949339.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
nagariensis]
gi|300265641|gb|EFJ49832.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
nagariensis]
Length = 3433
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 166/368 (45%), Positives = 218/368 (59%), Gaps = 34/368 (9%)
Query: 46 TQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLG---CEVFPFGSVPLKTYLPDGDIDLT 102
T +I++++PT +S +RR + ++V +++ PFGSVPLKTYLPDGDIDL+
Sbjct: 34 TDTLISRIRPTGLSLQRRWVITEHVTSIVKRCFAPHDVTAIPFGSVPLKTYLPDGDIDLS 93
Query: 103 AFG----GLNVEEAL----ANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNI 154
+ ++EAL A + LE E N A F V + Q+I AEVKL+KCLV NI
Sbjct: 94 IYSESPRAQALKEALRDTWATQLQVCLEEEANNPTAVFRVANVQVIHAEVKLLKCLVDNI 153
Query: 155 VVDISFNQLGGLSTLCFLEQVDRLIG-----KDHLFKRSIILIKAWCYYESRILGAHHGL 209
VVDISF Q+GGL+T FLE VDR + + HLFK SIIL+K WCYYESR+LGAHHGL
Sbjct: 154 VVDISFFQVGGLNTYNFLEDVDRFVDQCIPVRKHLFKDSIILVKGWCYYESRVLGAHHGL 213
Query: 210 ISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVV 252
ISTYALETL VLYKFL S FDW++YC+SL GP+ +SS P+ V
Sbjct: 214 ISTYALETLVLYVINLYHRELTNPLQVLYKFLVECSCFDWENYCLSLEGPIPLSSFPKPV 273
Query: 253 VETPENSGGDLLLSSEFLKECVEQFSVPS-RGFDTNSRSFPPKHLNIVDPLKENNNLGRS 311
VETPE D LL+ +F+ +++ P R + F K LN++DP+ NNLGRS
Sbjct: 274 VETPEALQRDALLTKDFMARAYFKYTEPQLRAQGGEPKPFAIKQLNVMDPILPGNNLGRS 333
Query: 312 VSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPL 371
VSK ++ RIR AF +GAR L I Q +E K F N + + QRP + P
Sbjct: 334 VSKASYLRIRRAFEHGARMLADIAEQGKELGPFIGAKRFDNFFGKAWNAQRPRNRPAGPA 393
Query: 372 SRYNGFGV 379
+G G+
Sbjct: 394 VAEHGNGM 401
>gi|159471748|ref|XP_001694018.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277185|gb|EDP02954.1| predicted protein [Chlamydomonas reinhardtii]
Length = 633
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 161/373 (43%), Positives = 215/373 (57%), Gaps = 34/373 (9%)
Query: 26 SVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLG---CEV 82
S P+ + E+ T +I++++PT +S +RR + ++V +L++
Sbjct: 11 SAPAALRKLSPEFGNDLLSRTDTLISRIRPTTLSLQRRFVITEHVTQLVKRCFAPHDVTA 70
Query: 83 FPFGSVPLKTYLPDGDIDLTAFGGLNVEEAL--------ANDVCSVLEREDQNKAAEFVV 134
PFGSVPLKTYLPDGDIDL+ + + ++L A + LE E N A F V
Sbjct: 71 VPFGSVPLKTYLPDGDIDLSIYSYSSRAQSLKDQLRDTWATTLQLCLEDEANNPHAAFKV 130
Query: 135 KDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGK-----DHLFKRSI 189
+ Q+I AEVKL+KCLV NIVVDISF Q+GGL+T FLE VD + K HLFK SI
Sbjct: 131 ANVQVIHAEVKLLKCLVDNIVVDISFFQIGGLNTYNFLEDVDAFVDKAITARKHLFKDSI 190
Query: 190 ILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYKFLDYFSKFDW 232
IL+K WCYYESR+LGAHHGLISTYALETL VLYKFL S FDW
Sbjct: 191 ILVKGWCYYESRVLGAHHGLISTYALETLVLYVINLYHRELSNPLQVLYKFLVECSGFDW 250
Query: 233 DSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS-RGFDTNSRSF 291
+ YC++L GP+ ++S P VVETPE + LL+ F+ +++ P + F
Sbjct: 251 ERYCLTLQGPIPLASFPNPVVETPEPLQREPLLTEHFMTRAYNKYTAPQVAAMGGEVKPF 310
Query: 292 PPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFS 351
K LN++DP+ NNNLGRSVSK ++ RIR AF +GAR L I Q +E + F
Sbjct: 311 AIKQLNVMDPILPNNNLGRSVSKASYLRIRRAFEHGARMLAAIAEQTKELGAVVASRNFD 370
Query: 352 NTLDRHGSGQRPD 364
N + + QRP+
Sbjct: 371 NFFGKVWNAQRPN 383
>gi|384253068|gb|EIE26543.1| hypothetical protein COCSUDRAFT_39611 [Coccomyxa subellipsoidea
C-169]
Length = 1155
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 148/295 (50%), Positives = 188/295 (63%), Gaps = 22/295 (7%)
Query: 78 LGCEVFPFGSVPLKTYLPDGDIDLTAF--GGLNVEEALANDVCSVLEREDQNKAAEFVVK 135
L E + FGSVPLKTYLPDGDIDL F G + + ++ ++LE E +N VK
Sbjct: 2 LQVEAYMFGSVPLKTYLPDGDIDLAVFQGKGPRLRDVWTYELSALLEAEGRNALNPHRVK 61
Query: 136 DAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAW 195
D Q+I AEVKL+KCLV NIVVDISF+ LGGL T+ FLE +DR IGK HLFKRS+IL+KAW
Sbjct: 62 DVQIINAEVKLLKCLVDNIVVDISFDTLGGLCTVAFLESIDRHIGKQHLFKRSVILVKAW 121
Query: 196 CYYESRILGAHHGLISTYALETLVLY-----------------KFLDYFSKFDWDSYCIS 238
CYYESR+LGAHHGL+STYALET+VLY KFL FSKFDWD + +S
Sbjct: 122 CYYESRLLGAHHGLLSTYALETMVLYIFNMYHHELQSPLKVLRKFLVVFSKFDWDGHALS 181
Query: 239 LNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNI 298
L GP+ +SS P+ VE + G LL + LK +E +S +G ++F K++NI
Sbjct: 182 LQGPIPLSSFPDPQVEPVAGAEGGALLRGDVLKTMLEMYSPVQQG---PGKAFTIKNMNI 238
Query: 299 VDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNT 353
+DPL NNLGRSV+K + RIR A +G L I + + T+ + FF NT
Sbjct: 239 MDPLLPTNNLGRSVNKASKARIRKALAHGCHMLDSIFDKVGQEATEAVDGFFRNT 293
>gi|357116041|ref|XP_003559793.1| PREDICTED: uncharacterized protein LOC100830879 [Brachypodium
distachyon]
Length = 899
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 147/303 (48%), Positives = 197/303 (65%), Gaps = 19/303 (6%)
Query: 29 SNQTAIGA-EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGS 87
+ +TAI E + AE A G++ + PT +E RR+ V D+ +RLI GC+V +GS
Sbjct: 25 TEKTAIQVPEQMRVAEAAAAGVLRCLLPTEEAERRRRQVTDHARRLIGTNFGCQVLTYGS 84
Query: 88 VPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLV 147
VPLKTYLPDGDID+T ++ + +DV ++L E++N AEFV++ + + A+VK+
Sbjct: 85 VPLKTYLPDGDIDVTILTHKPLDSTIIDDVRNLLNAEEKNTDAEFVLESRRYVDAQVKVF 144
Query: 148 KCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHH 207
KC + NI VDISFNQ+GG+STLCFLE VD +GKDHLFKRSIILIKAWCY E+RI G+
Sbjct: 145 KCNIANIDVDISFNQIGGVSTLCFLELVDTEVGKDHLFKRSIILIKAWCYNEARIQGSDQ 204
Query: 208 GLISTYALETLV-----------------LYKFLDYFSKFDWDSYCISLNGPVRISSLPE 250
L+STYALE L+ LY FL+Y+SKFDW YC++L+GPV +SSL
Sbjct: 205 WLLSTYALEILILYIFNMFHNSLHGPFEALYMFLEYYSKFDWGKYCVTLDGPVPLSSLAN 264
Query: 251 VVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGR 310
E P + +LLL E L ++ V +G D + F PK LNI+DPLK +NNLGR
Sbjct: 265 FTAE-PAVANDELLLGKESLSASSDRLLVLPKGSDRHDPEFRPKILNIIDPLKGDNNLGR 323
Query: 311 SVS 313
S+S
Sbjct: 324 SIS 326
>gi|302125450|emb|CBI35537.3| unnamed protein product [Vitis vinifera]
Length = 398
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 164/313 (52%), Positives = 188/313 (60%), Gaps = 81/313 (25%)
Query: 73 LIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEF 132
LIR LGCEVFP+GSVPLK YL DGDIDLT NVEEALA+DV +VL+ E QN+ AEF
Sbjct: 55 LIRCCLGCEVFPYGSVPLKIYLLDGDIDLTVLCSSNVEEALASDVHAVLKGERQNENAEF 114
Query: 133 VVKDAQL-IRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIIL 191
VK+ Q I EVK VKCLV++IV+DISFNQLGGLSTLCFL+QVDRLIGKDHLFKRSIIL
Sbjct: 115 EVKNVQFNIIVEVKPVKCLVKDIVIDISFNQLGGLSTLCFLKQVDRLIGKDHLFKRSIIL 174
Query: 192 IKAWCYYESRILGAHHGLISTYALETLVLY-----------------KFLDYFSKFDWDS 234
IK+ CYYESRILGA+HGLISTYALE LVLY +FLDYFSKFDWD+
Sbjct: 175 IKSRCYYESRILGAYHGLISTYALEILVLYIFHLFHSSLDGPLAVGYRFLDYFSKFDWDN 234
Query: 235 YCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPK 294
YCISLNG V SSLP
Sbjct: 235 YCISLNGSVCKSSLP--------------------------------------------- 249
Query: 295 HLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTL 354
+IV L EN F YG+ KLG L P E + DEL+ FF++TL
Sbjct: 250 --DIVAELPEN----------------GGFKYGSHKLGQNLLLPREVIQDELKNFFASTL 291
Query: 355 DRHGSGQRPDVQD 367
+RH S ++Q+
Sbjct: 292 ERHRSKYMAEIQN 304
Score = 46.2 bits (108), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 593 TVGSPRAANSLSDLSGDYESHQISLNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLM 652
T GS + L DL+GDY+SH SL + Y HAL P P SQ Q WD +
Sbjct: 308 TFGSRGSLEILLDLNGDYDSHIRSLQYGQCCYGHALPPPLLPSPPLSPSQLQINTPWDKV 367
Query: 653 QRSLPFRRNIIPQMSANGAVPRPLFYPMTPP 683
+ L F+RN+ QM +NG + F P+ P
Sbjct: 368 HQHLQFKRNLYSQMDSNGVILGNHF-PVKQP 397
>gi|414866688|tpg|DAA45245.1| TPA: hypothetical protein ZEAMMB73_273182, partial [Zea mays]
Length = 260
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 124/195 (63%), Positives = 147/195 (75%), Gaps = 9/195 (4%)
Query: 52 QVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+V+PT SE RR V+DY +RL+ + LGCEVF FGSVPLKTYLPDGDIDLT G + +
Sbjct: 37 RVRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDS 96
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCF 171
L NDV +LE E+QN AEFVVKD + I AEV+L+KC + NI+VDISFNQ GG+ LCF
Sbjct: 97 TLVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCF 156
Query: 172 LEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFD 231
LE VDR +GK+HLFKRSIILIKAWCYYESR+LGAHHGLISTYALE L+LY F + F K
Sbjct: 157 LELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVF-NLFHK-- 213
Query: 232 WDSYCISLNGPVRIS 246
SL+ PV +
Sbjct: 214 ------SLHSPVEVC 222
>gi|255083767|ref|XP_002508458.1| predicted protein [Micromonas sp. RCC299]
gi|226523735|gb|ACO69716.1| predicted protein [Micromonas sp. RCC299]
Length = 1269
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 206/360 (57%), Gaps = 58/360 (16%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCE---VFPFGSVPLKTYLPDGDIDLTAFG 105
+I ++PT S+ RR+ V ++ L+ E V FGSVPL+TYLPDGDID+ G
Sbjct: 31 LIDVLRPTEQSDRRRRGVFRHIASLVDGCFAGENVLVTAFGSVPLRTYLPDGDIDVCLLG 90
Query: 106 GLNVEEALANDVCSV-----LER----------EDQNKAAEFVVKDAQLIRAEVKLVKCL 150
E L+ D +V +ER E + AEF V + +I AEVKL+K +
Sbjct: 91 ---PHELLSRDDWTVRLRAHVERAEAAAAEASIELGSPVAEFAVSEIHIIHAEVKLMKLI 147
Query: 151 VQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 210
+VVD+S NQ GGL+ L FLE+V+ IGK +FKRSI+LIKAW +YE R+LGAHH LI
Sbjct: 148 CDGVVVDVSANQFGGLAALGFLEEVNAFIGKGEIFKRSIVLIKAWGFYEGRLLGAHHALI 207
Query: 211 STYALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEV-- 251
STYALETLVLY KFL +F+ FDWD + +S++GPV + L +V
Sbjct: 208 STYALETLVLYILNRFHKELSTPLEVLHKFLVFFADFDWDKFAVSVHGPVPLEDLHKVTG 267
Query: 252 -VVETPENSGGDLLLSSEFLKECVEQF---SVPSR---GFDTNSRSFPPKHLNIVDPLKE 304
+ + PE LL+ +F+ ++++ SV ++ G D+ R K+LN+VDPL
Sbjct: 268 PIGKRPEVHAEGALLTPDFMWRMMDKYGNESVSAKLGGGADSTPRPMARKYLNVVDPLLS 327
Query: 305 NNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDE-------LRKFFSNTLDRH 357
+NNLGRSVS+GN RIR A GA++L + E S E L +FF NT+ RH
Sbjct: 328 SNNLGRSVSQGNAKRIRKALALGAQRLTALR---ESSTGGECFGAVRMLEQFFGNTM-RH 383
>gi|145341816|ref|XP_001415999.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576222|gb|ABO94291.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 904
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 192/318 (60%), Gaps = 28/318 (8%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCE---VFPFGSVPLKTYLPDGDI 99
E T ++A ++PT +SE RR+AV +++++L + G V +GSVPL+ YLPDGDI
Sbjct: 36 ETLTNELVASLRPTEMSEIRRRAVFEHIKQLAQECFGTAHTLVSAYGSVPLRAYLPDGDI 95
Query: 100 DLTAFGGLNV--EEALANDVCSVLEREDQ--NKAAEFVVKDAQLIRAEVKLVKCLVQNIV 155
D+ G V + +E+ + + EF V + +I AEV+L+KC+V ++
Sbjct: 96 DVCLLGDHRVIDKAQWTTKFRKHIEKAEAEADPPHEFAVSEVSVINAEVRLMKCIVDGMM 155
Query: 156 VDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215
VD+S NQ GGL++L FLE+++ IG+D LF RSIIL+KAW +YE RILGAHH LISTYAL
Sbjct: 156 VDVSANQFGGLASLGFLEEMNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALISTYAL 215
Query: 216 ETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPEN 258
ETLVLY K L F++FDW+ Y ++++GPV I E + P+
Sbjct: 216 ETLVLYIINKYHADLTCPLSVLHKLLSVFAEFDWEGYALTIHGPVAI----EGIATPPDE 271
Query: 259 SGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFY 318
L++ EF++ + +S +S K++NI+DPL NNNLGRSVS GN+
Sbjct: 272 CLEGGLITEEFMRTMLSTYSCEFMRAAASSAPVTVKYMNIIDPLLPNNNLGRSVSCGNYR 331
Query: 319 RIRSAFTYGARKLGHILS 336
R+R+A GA++L +++
Sbjct: 332 RVRAALKLGAQRLDALMA 349
>gi|308799699|ref|XP_003074630.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
gi|116000801|emb|CAL50481.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
Length = 875
Score = 238 bits (607), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 192/322 (59%), Gaps = 35/322 (10%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCE---VFPFGSVPLKTYLPDGDI 99
E T ++ ++PT SE RR+AV ++++ L + G V +GSVPL+ YLPDGDI
Sbjct: 32 ETLTNELVESLRPTAKSEMRRRAVFEHIKELAQGCFGTAHTLVSVYGSVPLRAYLPDGDI 91
Query: 100 DLTAFGGLNV--EEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVD 157
D+ G V + + +E+ + EF V + +I AEV+L+KC+V ++VD
Sbjct: 92 DVCLLGDHRVIDKASWTTKFQKHIEKVEAESDFEFAVSEVSVINAEVRLMKCIVDGMMVD 151
Query: 158 ISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 217
+S NQ GGL++L FLE+ + IG+D LF RSIIL+KAW +YE RILGAHH LI+TYALET
Sbjct: 152 VSANQFGGLASLGFLEETNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALIATYALET 211
Query: 218 LVLY-----------------KFLDYFSKFDWDSYCISLNGPVRI---SSLPEVVVETPE 257
LVLY K L F FDW+ Y ++++GPV + +++P +E
Sbjct: 212 LVLYIINKYYAELTCPLSVLHKLLRVFGDFDWEGYVLTIHGPVALEDANNIPPGCLE--- 268
Query: 258 NSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNF 317
GG LL+ EF++ + Q+ + +NS K++NI+DPL NNNLGRSVS GN+
Sbjct: 269 --GG--LLTEEFMQSMLCQY---GQIETSNSAPVVVKYMNIIDPLVPNNNLGRSVSCGNY 321
Query: 318 YRIRSAFTYGARKLGHILSQPE 339
R+R+A GAR L ++ + E
Sbjct: 322 RRVRAALRLGARHLDKLMERSE 343
>gi|77553482|gb|ABA96278.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
gi|215769169|dbj|BAH01398.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 622
Score = 232 bits (591), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 114/195 (58%), Positives = 138/195 (70%), Gaps = 17/195 (8%)
Query: 190 ILIKAWCYYESRILGAHHGLISTYALETLVLY-----------------KFLDYFSKFDW 232
+LIKAWCYYESRILGAHHGLISTYALE LVLY +FLDY+SKFDW
Sbjct: 1 MLIKAWCYYESRILGAHHGLISTYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDW 60
Query: 233 DSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFP 292
D+ ISL GP+ +SSLPE+V ++P+ D + +FLKEC + F+V R + N++ FP
Sbjct: 61 DNKGISLYGPISLSSLPELVTDSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFP 120
Query: 293 PKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSN 352
K NIVDPLK++NNLGRSVSKGNF RIRSAF +GARKLG IL P+ DE+ +FF N
Sbjct: 121 RKFFNIVDPLKQSNNLGRSVSKGNFLRIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRN 180
Query: 353 TLDRHGSGQRPDVQD 367
TL RH S RPDVQ+
Sbjct: 181 TLKRHCSRVRPDVQE 195
>gi|290976573|ref|XP_002671014.1| predicted protein [Naegleria gruberi]
gi|284084579|gb|EFC38270.1| predicted protein [Naegleria gruberi]
Length = 763
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 142/389 (36%), Positives = 210/389 (53%), Gaps = 81/389 (20%)
Query: 39 WQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGD 98
++R Q ++ ++QP+ SE+ RK V D + ++ + + +GSV KTYLPDGD
Sbjct: 162 FRRCNSLIQQLLYRIQPSSESEKHRKEVFDIIAAVLE-LANLKTYLYGSVAFKTYLPDGD 220
Query: 99 IDLTAFGGLNVEEAL---ANDVCSVLEREDQ----------------------------- 126
IDL+ F ++ EE L + +V ++L + Q
Sbjct: 221 IDLSVF--VSNEEYLELSSQNVNNLLSHQPQVNDSTISYVHNVLLKNMHIGLKQQLADPS 278
Query: 127 ----NKAAEFV----------VKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFL 172
NKA ++D I AEVKL+KC V NI +D+S Q+GGLSTLCFL
Sbjct: 279 IPWYNKARSLFSEIQRNNLAYIEDMTFINAEVKLIKCTVNNIPIDMSSGQIGGLSTLCFL 338
Query: 173 EQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV------------- 219
+VD I +HLFKRSIIL+K+W YYESRILG+HHGL+STY L L+
Sbjct: 339 HEVDDKIADNHLFKRSIILMKSWSYYESRILGSHHGLVSTYGLTVLLMYMFRLYKIETPL 398
Query: 220 --LYKFLDYFSKFDWDSYCISLNGPVRI------SSLPEVVVET--PENSGGDLLLSSEF 269
LY+FL+Y+S FDW ++ IS+ GP+ + S+ + E PE L+S F
Sbjct: 399 QALYRFLNYYSTFDWTNFGISIYGPIPLGAINDHKSIEDFYYENLPPERHDS---LTSSF 455
Query: 270 LKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGAR 329
L+ C ++ G +S++F K+LNIVDPL++ NNLGRSV+ NF RIR A G++
Sbjct: 456 LQSCKSKY-----GTVDSSKTFTIKNLNIVDPLRDFNNLGRSVNYNNFLRIRRAIKKGSK 510
Query: 330 KLGHILSQPEESLTDELRK-FFSNTLDRH 357
+ IL + ++++ K FF N ++++
Sbjct: 511 TITDILISNDLMESEKILKLFFKNVVEKY 539
>gi|307104056|gb|EFN52312.1| hypothetical protein CHLNCDRAFT_58914 [Chlorella variabilis]
Length = 740
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 146/353 (41%), Positives = 204/353 (57%), Gaps = 62/353 (17%)
Query: 40 QRAEEAT----QGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLP 95
QR EE T + + A ++ + E+R+ AV L++ L E F FGSVPL+ LP
Sbjct: 352 QRQEEGTGQSQEAVRASLEVEQLLEQRQAAVA-----LVQECLQVEAFMFGSVPLRAVLP 406
Query: 96 DGDIDLTAF-------------GGL------------NVEEALANDVCSVLEREDQNKAA 130
DGDID++ F GG ++ + A+ + LERE A
Sbjct: 407 DGDIDISFFATAATTPSSPSGNGGEQPGHRAGASPPGDLRDTWASQLLRALEREAVRPDA 466
Query: 131 EFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSII 190
F ++D Q+I+AEVKLVKC+V ++VVD+SF+ +GGL T+ FLE DR IG+ HLFKRSI+
Sbjct: 467 PFKIRDVQIIQAEVKLVKCVVHDVVVDVSFDTVGGLCTVAFLEAADRRIGRQHLFKRSIL 526
Query: 191 LIKAWCYYESRILGAHHGLISTYALETLVLY-----------------KFLDYFSKFDWD 233
L+KAWCYYESR+LGAHHGLIS+YALE LVLY +FL FDW+
Sbjct: 527 LLKAWCYYESRLLGAHHGLISSYALEVLVLYIFNLHHAELHTPLDVLRRFLAVLGSFDWE 586
Query: 234 SYCISLNGPVRISSLPEVVVETPE--NSGGDLLLSSEFLKECVEQFSV---------PSR 282
YC++L GP+ I+ L ++ V+ +SG + LL ++F++ ++ +SV +
Sbjct: 587 RYCLALQGPLPIADLHKLHVDRTALVSSGTEPLLDADFMRGVLQHYSVQHLSQQQQQEAA 646
Query: 283 GFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHIL 335
G + FP KHLNIVDPL +NNLGRSVSK ++ R++ A G R L L
Sbjct: 647 GMQLVAPRFPLKHLNIVDPLLPSNNLGRSVSKASYARVKKALALGNRMLEEAL 699
>gi|428171015|gb|EKX39935.1| hypothetical protein GUITHDRAFT_113927 [Guillardia theta CCMP2712]
Length = 632
Score = 229 bits (584), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 150/389 (38%), Positives = 220/389 (56%), Gaps = 53/389 (13%)
Query: 27 VPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCE----- 81
+P + + ++ Q +E+A + I+ Q+QP +E R V +YV++LI++ E
Sbjct: 1 MPCSSSEPARKHGQLSEQADE-IVRQLQPHRRAERHRLTVFEYVKKLIKHVADEENKTEI 59
Query: 82 -VFPFGSVPLKTYLPDGDIDLTAFGGLNV----------EEALANDVCSV-----LERED 125
V FGSVPLKTYLP GD+D+TAF ++ +EA ND+ V + R+
Sbjct: 60 YVHRFGSVPLKTYLPHGDLDVTAFAANDLWLERLKAKLEDEAKKNDMYVVSGVHSVPRDL 119
Query: 126 QNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLF 185
+ ++ E + K Q VK+VKC V I VDI+ N LGG+ LCFLE+VD ++ +DHLF
Sbjct: 120 RAQSREELGKKDQGPVEIVKVVKCQVNGISVDITANALGGMCNLCFLEKVDTMLKRDHLF 179
Query: 186 KRSIILIKAWCYYESRILGAHHGLISTYALETL-----------------VLYKFLDYFS 228
KR+ IL+K+WCY+ES IL + +GL+STYALETL VL +FL+Y++
Sbjct: 180 KRATILVKSWCYFESHILSSQNGLLSTYALETLVLCIVNIFHEELQTPLDVLKRFLEYYA 239
Query: 229 KFDWDSYCISLNGPVRISSLPEVVVETPE-NSGGDLLLSSEFLKECVE-QFSVPSRGFDT 286
FDW ++C+++ GPV S++P E P ++ LL+ L+E QF + G
Sbjct: 240 NFDWRNHCLTMRGPVNRSNIPP-GGEVPHLDNEPSYLLNDAILQEDSHLQFLMS--GLQD 296
Query: 287 NSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHIL-----SQPEES 341
++R F K++NI DPL NN+GRSVS+ + YRI SAF +G + L +L S
Sbjct: 297 DNRGFQWKYMNICDPLSTRNNIGRSVSRSSAYRIASAFRHGWQSLSGLLYCSLHHNSSIS 356
Query: 342 LT---DELRKFFSNTLDRHGSGQRPDVQD 367
LT R FF T + +G RPDV D
Sbjct: 357 LTKSEKSARAFFHFT-GKTLTGHRPDVGD 384
>gi|297612542|ref|NP_001065982.2| Os12g0114200 [Oryza sativa Japonica Group]
gi|255669984|dbj|BAF29001.2| Os12g0114200, partial [Oryza sativa Japonica Group]
Length = 178
Score = 229 bits (583), Expect = 4e-57, Method: Composition-based stats.
Identities = 115/167 (68%), Positives = 131/167 (78%), Gaps = 10/167 (5%)
Query: 81 EVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLI 140
+VFPFGSVPLKTYLPDGDIDLTAFG + +E LA V +VLE E+ K AEF VKD Q I
Sbjct: 1 QVFPFGSVPLKTYLPDGDIDLTAFGH-SSDEILAKQVQAVLESEEARKDAEFEVKDVQYI 59
Query: 141 RAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYES 200
AEVKLVKC+VQNI+VDISFNQ GGL TLCFLE+VD+ K+HLFKRSI+LIKAWCYYES
Sbjct: 60 HAEVKLVKCIVQNIIVDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCYYES 119
Query: 201 RILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISS 247
RILGAHHGLISTYALE LVLY F + +L+GP+ +SS
Sbjct: 120 RILGAHHGLISTYALEILVLYIFHLFHG---------TLDGPLAVSS 157
>gi|168035607|ref|XP_001770301.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678518|gb|EDQ64976.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1631
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/263 (43%), Positives = 156/263 (59%), Gaps = 49/263 (18%)
Query: 150 LVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKD-------HLFKRSIILIKAWCYYESRI 202
LV++++ +S LS L E VDR I ++ HLFKRS+IL+KAWCYYESRI
Sbjct: 232 LVKDLLFHLS------LSNLATRELVDREIDRNDFELKQNHLFKRSVILVKAWCYYESRI 285
Query: 203 LGAHHGLISTYALETL-------------------VLYKFLDYFSKFDWDSYCISLNGPV 243
LGAHHGLISTYALETL VLY FL YF FDWD YC+++ GPV
Sbjct: 286 LGAHHGLISTYALETLVLYIFHVFHPKRRLRGPLEVLYLFLVYFCNFDWDKYCVTMWGPV 345
Query: 244 RISSLPEV-------------VVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNS-R 289
++ + E+ E P G LLLS EFL+ C++ +S G +++ R
Sbjct: 346 PLARITEISSGSARKTFRISDFAEAPRKDRGKLLLSKEFLERCIDSYSDAKGGQESSQRR 405
Query: 290 SFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKF 349
+F K LN++DP+++ NNLGRSV+ G+F RIRSAF GAR LG +L P + + ++ + F
Sbjct: 406 NFITKFLNVLDPIRDTNNLGRSVNVGSFKRIRSAFGLGARTLGEVLECPTDQINEKFKSF 465
Query: 350 FSNT---LDRHGSGQRPDVQDPV 369
FS T L+R+ G RPD +P+
Sbjct: 466 FSCTFKSLERYRIGGRPDTGNPL 488
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/166 (52%), Positives = 113/166 (68%), Gaps = 2/166 (1%)
Query: 9 PEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVID 68
P+PNG +S ++ S+ + +W R E T +I ++PT SEERR AV
Sbjct: 8 PQPNG--LTTEGASQFATAFSSAAKLEDGWWSRVEGHTAELIDSIKPTRSSEERRTAVTA 65
Query: 69 YVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNK 128
+VQRLIR+ C+V FGSVPLKTYLPDGDIDLT F +++E A DV L++ +++
Sbjct: 66 FVQRLIRDRFDCKVVKFGSVPLKTYLPDGDIDLTIFARNDLKETWAQDVVKALKQAEEDT 125
Query: 129 AAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQ 174
AEF VK+ Q I+AEVKL+KCLV+NIVVDISFNQ GGLST CFLE+
Sbjct: 126 NAEFRVKEVQYIQAEVKLIKCLVENIVVDISFNQTGGLSTFCFLEE 171
>gi|303287038|ref|XP_003062808.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455444|gb|EEH52747.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 781
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 114/257 (44%), Positives = 152/257 (59%), Gaps = 41/257 (15%)
Query: 147 VKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAH 206
+KC+ +VVDIS NQ GGL+TL FLE+VD I +D +FKRSIILIKAW +YE R+LGAH
Sbjct: 1 MKCIADGVVVDISANQFGGLATLGFLEEVDAFIARDGIFKRSIILIKAWGFYEGRVLGAH 60
Query: 207 HGLISTYALETLVLY-----------------KFLDYFSKFDWDSYCISLNGPVRISSLP 249
H LISTYALETLVLY KFL YF+ F+WD+Y +S++GPVR+ +L
Sbjct: 61 HALISTYALETLVLYVLNAYHEELSTPLEVLHKFLTYFADFEWDAYAVSIHGPVRLDALE 120
Query: 250 EVVVETPENSGGDLLLSSEFLKECVEQFS----------VPSRGFDTNSRSFPPKHLNIV 299
+ V + + G LL+ F K ++++ G N R+ PKHLN++
Sbjct: 121 KGVRDADAPARGP-LLTPAFTKRVLDKYGNDAIINAEKGQAGPGGGGNRRAMQPKHLNVI 179
Query: 300 DPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDE------LRKFFSNT 353
DPL +NNLGRSVS+GN RI+ A GA KL + + + ++ E L FF T
Sbjct: 180 DPLLPSNNLGRSVSQGNAKRIQKALRLGAAKLTSLRNAMRDGVSCELNAVRVLEHFFGCT 239
Query: 354 LDRHGSGQRPDVQDPVP 370
+ RH R DV P+P
Sbjct: 240 I-RH----RRDV--PLP 249
>gi|218192781|gb|EEC75208.1| hypothetical protein OsI_11468 [Oryza sativa Indica Group]
Length = 860
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/189 (52%), Positives = 116/189 (61%), Gaps = 48/189 (25%)
Query: 82 VFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIR 141
VF +GSVPLKTYLPDGD+DLT G + L +D+ +L+ E+QN AEF VKD QLI
Sbjct: 20 VFAYGSVPLKTYLPDGDVDLTVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLIN 79
Query: 142 AEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESR 201
AE VDR +GK+HL K SIILIKAWCYYESR
Sbjct: 80 AE-------------------------------VDRKVGKNHLVKNSIILIKAWCYYESR 108
Query: 202 ILGAHHGLISTYALETL-----------------VLYKFLDYFSKFDWDSYCISLNGPVR 244
+LGAHHGLISTYALETL VLY+FL+YFSKFDWD+YCISLNGPV
Sbjct: 109 LLGAHHGLISTYALETLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVA 168
Query: 245 ISSLPEVVV 253
+SSLP +
Sbjct: 169 LSSLPNQIA 177
>gi|226506494|ref|NP_001141604.1| uncharacterized protein LOC100273722 [Zea mays]
gi|194705246|gb|ACF86707.1| unknown [Zea mays]
gi|413924676|gb|AFW64608.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
Length = 251
Score = 185 bits (470), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 97/184 (52%), Positives = 117/184 (63%), Gaps = 17/184 (9%)
Query: 190 ILIKAWCYYESRILGAHHGLISTYALETLVLY-----------------KFLDYFSKFDW 232
+LIK WCYYES ILGA GL+STYALETLVLY +FLDY+SKFDW
Sbjct: 1 MLIKHWCYYESCILGAQRGLVSTYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDW 60
Query: 233 DSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFP 292
D+ ISL GP+ +SSLPE+V E P L FLK+C + FSVP + N + F
Sbjct: 61 DNKGISLFGPISLSSLPELVTEPPYTRDDGFLSREAFLKDCAKAFSVPPINSEENPQVFS 120
Query: 293 PKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSN 352
K +NIVDPLK++NNLGRS+SKGN RIR F +GA KLG IL P +E+ +FF N
Sbjct: 121 KKFVNIVDPLKQSNNLGRSISKGNLGRIRKEFYFGACKLGKILQAPACFSANEINRFFRN 180
Query: 353 TLDR 356
TL R
Sbjct: 181 TLSR 184
>gi|412992209|emb|CCO19922.1| predicted protein [Bathycoccus prasinos]
Length = 1318
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/293 (37%), Positives = 150/293 (51%), Gaps = 69/293 (23%)
Query: 134 VKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIK 193
VKD +I A+V+L+KC+V IVVD+S NQ GGL+TL FL++V+ IGK+ LFKRS+IL+K
Sbjct: 291 VKDIVVIHADVRLLKCVVDGIVVDVSANQFGGLATLAFLKEVNSKIGKNDLFKRSVILVK 350
Query: 194 AWCYYESRILGAHHGLISTYALETL------------------------------VLYKF 223
AW +YESRILGA + L+STYAL+TL VL F
Sbjct: 351 AWAFYESRILGAPYALLSTYALKTLIICALRRFNKKESKSDATKTKKREIATPLDVLRIF 410
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVE-----------------------TPENSG 260
+Y S F W+++ +++ G V + L +V V E +
Sbjct: 411 FEYVSDFPWETHAVTIFGDVPVEKLDKVSVREFSSSSKSEKNKNKNNDDEREEKDDEEAE 470
Query: 261 GDLLLSSEFLKECVEQFSVPSRGFDTN--------------SRSFPPKHLNIVDPLKENN 306
D LL F+ ++ + SR D N R+ KHL+I+DPL E N
Sbjct: 471 EDPLLDDTFVDTILKSYGPDSRP-DANVLLNIGNGKKAPFRRRAIGAKHLHILDPLSETN 529
Query: 307 NLGRSVSKGNFYRIRSAFTYGARKLGHILSQPE-ESLTDELRKFFSNTLDRHG 358
NLGRSVS GNF R+R+AF GA +L + + E E++T FF L G
Sbjct: 530 NLGRSVSLGNFARVRAAFRLGAERLKRLEMESEPENITRGFEYFFKVALANRG 582
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 41/81 (50%), Gaps = 14/81 (17%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCE--------------VFPFGSV 88
E T+ +IA ++P+ SE+RR+ V ++ LIR E V FGSV
Sbjct: 129 EALTEELIASLRPSKQSEKRRRMVFRKMESLIRECFEKEFEGEGVNEKKNTIVVSAFGSV 188
Query: 89 PLKTYLPDGDIDLTAFGGLNV 109
P TYLPDGDID+ G V
Sbjct: 189 PFGTYLPDGDIDVCILGDHEV 209
>gi|401410712|ref|XP_003884804.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
gi|325119222|emb|CBZ54776.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
Length = 3449
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 80/212 (37%), Positives = 118/212 (55%), Gaps = 33/212 (15%)
Query: 72 RLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG---------GLNVEEALANDVCSVLE 122
R + + V+ +GS PL+T+LPDGD+D+ G +AL + +
Sbjct: 348 RAEEDEINITVYRYGSFPLRTFLPDGDLDIGIISYNRRTGVVEGEEESDALLAVLLDKFQ 407
Query: 123 REDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKD 182
RED F +++A L+ AEV+++KC+V I VD+S N++GG +L FLE DR IG+
Sbjct: 408 REDVKTHKTFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRH 467
Query: 183 HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV------------------LYKFL 224
HLFKRS++LIK+W YES +LG+ GL++TY +E LV LY F
Sbjct: 468 HLFKRSVLLIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVLPASLLPTPLHLLYHFF 527
Query: 225 DYFSKFDWDSYCISLNGPV------RISSLPE 250
Y+S F WD Y ++ GP+ R SS+P+
Sbjct: 528 SYYSSFHWDRYAVTACGPLPLTFITRASSVPD 559
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 8/79 (10%)
Query: 294 KHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNT 353
+ +N+VDPL NNL RSVS+ FYR+ A G + L HIL+ + + L F N+
Sbjct: 893 RSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTHILATGDAARFRTL--FLPNS 950
Query: 354 ---LDRHGSGQRPDVQDPV 369
LDR S PDV PV
Sbjct: 951 YQLLDRIKS---PDVAYPV 966
>gi|221502484|gb|EEE28211.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 3297
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 75/187 (40%), Positives = 110/187 (58%), Gaps = 27/187 (14%)
Query: 82 VFPFGSVPLKTYLPDGDIDLTAF------GGLNVEE---ALANDVCSVLEREDQNKAAEF 132
V+ +GS PL+T+LPDGD+D+ G L EE AL + +R + F
Sbjct: 224 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 283
Query: 133 VVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILI 192
+++A L+ AEV+++KC+V I VD+S N++GG +L FLE DR IG++HLFKRS++LI
Sbjct: 284 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 343
Query: 193 KAWCYYESRILGAHHGLISTYALETLV------------------LYKFLDYFSKFDWDS 234
K+W YES +LG+ GL++TY +E LV LY+F Y+S F WD
Sbjct: 344 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 403
Query: 235 YCISLNG 241
Y ++ G
Sbjct: 404 YAVTACG 410
Score = 46.2 bits (108), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 8/79 (10%)
Query: 294 KHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNT 353
+ +N+VDPL NNL RSVS+ FYR+ A G + L +L+ + + L F N+
Sbjct: 742 RSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVLASGDAARFRRL--FLPNS 799
Query: 354 ---LDRHGSGQRPDVQDPV 369
LDR S PDV PV
Sbjct: 800 YQLLDRIKS---PDVAYPV 815
>gi|237843045|ref|XP_002370820.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
gi|211968484|gb|EEB03680.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
Length = 3436
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 75/187 (40%), Positives = 110/187 (58%), Gaps = 27/187 (14%)
Query: 82 VFPFGSVPLKTYLPDGDIDLTAF------GGLNVEE---ALANDVCSVLEREDQNKAAEF 132
V+ +GS PL+T+LPDGD+D+ G L EE AL + +R + F
Sbjct: 363 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 422
Query: 133 VVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILI 192
+++A L+ AEV+++KC+V I VD+S N++GG +L FLE DR IG++HLFKRS++LI
Sbjct: 423 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 482
Query: 193 KAWCYYESRILGAHHGLISTYALETLV------------------LYKFLDYFSKFDWDS 234
K+W YES +LG+ GL++TY +E LV LY+F Y+S F WD
Sbjct: 483 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 542
Query: 235 YCISLNG 241
Y ++ G
Sbjct: 543 YAVTACG 549
Score = 46.2 bits (108), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 8/79 (10%)
Query: 294 KHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNT 353
+ +N+VDPL NNL RSVS+ FYR+ A G + L +L+ + + L F N+
Sbjct: 881 RSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVLASGDAARFRRL--FLPNS 938
Query: 354 ---LDRHGSGQRPDVQDPV 369
LDR S PDV PV
Sbjct: 939 YQLLDRIKS---PDVAYPV 954
>gi|221482136|gb|EEE20497.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 3441
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 75/187 (40%), Positives = 110/187 (58%), Gaps = 27/187 (14%)
Query: 82 VFPFGSVPLKTYLPDGDIDLTAF------GGLNVEE---ALANDVCSVLEREDQNKAAEF 132
V+ +GS PL+T+LPDGD+D+ G L EE AL + +R + F
Sbjct: 369 VYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHKTF 428
Query: 133 VVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILI 192
+++A L+ AEV+++KC+V I VD+S N++GG +L FLE DR IG++HLFKRS++LI
Sbjct: 429 PLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVLLI 488
Query: 193 KAWCYYESRILGAHHGLISTYALETLV------------------LYKFLDYFSKFDWDS 234
K+W YES +LG+ GL++TY +E LV LY+F Y+S F WD
Sbjct: 489 KSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHWDR 548
Query: 235 YCISLNG 241
Y ++ G
Sbjct: 549 YAVTACG 555
Score = 46.2 bits (108), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 8/79 (10%)
Query: 294 KHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNT 353
+ +N+VDPL NNL RSVS+ FYR+ A G + L +L+ + + L F N+
Sbjct: 887 RSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVLASGDAARFRRL--FLPNS 944
Query: 354 ---LDRHGSGQRPDVQDPV 369
LDR S PDV PV
Sbjct: 945 YQLLDRIKS---PDVAYPV 960
>gi|403357215|gb|EJY78230.1| hypothetical protein OXYTRI_24618 [Oxytricha trifallax]
Length = 831
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 172/360 (47%), Gaps = 58/360 (16%)
Query: 29 SNQTAIGAEYWQRAEEA-TQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLG----CEVF 83
SNQ ++ E AE+A + ++ PT SE +R + + V+ LI LG V
Sbjct: 11 SNQNSLQKE----AEDAFVNYFLNKIGPTQESERKRVKIFEQVKFLIEKALGGKSQVMVI 66
Query: 84 PFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFV---------V 134
+GS PLKTYLPD DID+T ++ N + ++ + + K E V
Sbjct: 67 RYGSDPLKTYLPDSDIDITVIRRDYLQGNQTNQLTALTQLKLIKKEIEIFGETQNGKNFV 126
Query: 135 KDAQLI-RAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIK 193
K LI +A+V+++K QN VDIS Q+GG+ TL F+ + + IGK L K+SIIL+K
Sbjct: 127 KSMVLIDQADVEIIKLNFQNTFVDISIKQVGGICTLYFMNYMAKRIGKQQLLKKSIILLK 186
Query: 194 AWCYYESRILGAHHGLISTYALETLVLY-----------------KFLDYFSKFDWDSYC 236
AW Y++ ILG+ ++TYAL +VL+ F +S FDW++
Sbjct: 187 AWFTYDASILGSQAACMATYALYVMVLFILNNFYDELNSPMDVIMMFFKVWSHFDWENNI 246
Query: 237 ISLNGPVRISSLPEVVVET---------------PENSGGDLLLSSEFLKECVEQFS--- 278
+++ GP++ S E + E E LL++ + L QFS
Sbjct: 247 VTIFGPIKSSGFYERLKECQFDIDRLTMLDRSLHQEYQYRKLLVTPDELSFLNLQFSGVR 306
Query: 279 ---VPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHIL 335
V S N +SF K+ NI+DP NNLG+S+SK N RI+ K+ I
Sbjct: 307 LSDVSSYNL-ANKKSFNTKYFNIIDPTFSKNNLGKSISKLNSSRIKQVLRLQNMKMRQIF 365
>gi|301093296|ref|XP_002997496.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110638|gb|EEY68690.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 782
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 120/420 (28%), Positives = 188/420 (44%), Gaps = 106/420 (25%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPF----GSVPLKTYLPDGDIDLTAF 104
+I + P+ V++ R+ V+ +VQR+I + P GS P+KTYLP D+D+
Sbjct: 251 LIEWMGPSDVADRVRQQVLSFVQRVITAHFPLAAAPLFFATGSYPMKTYLPGSDLDICLL 310
Query: 105 GGLNVEEA----LANDVC---------SVLEREDQNKAAEF------------------- 132
+E + + +C +VL+ + +
Sbjct: 311 VPQELESSWYYIVTQALCVAGGSGGAGTVLDLGNSASSDVSGSSSPSGPAAASGGGPLLL 370
Query: 133 --VVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSII 190
V++ I A+V++VKC V NI VD + N++G L + L+ + +G+ HLFK+S+I
Sbjct: 371 TNTVRNVTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAARVGRQHLFKKSLI 430
Query: 191 LIKAWCYYESR---------------------ILGAHHGLISTYALETLV---------- 219
LIKAWC +ESR ++GA HG +STYA+ T+V
Sbjct: 431 LIKAWCTHESRPFMQRASNEAGGSVPGSTPASVMGASHGALSTYAVNTIVMALFNQHGDA 490
Query: 220 -------LYKFLDYFSKFDWDSYCISLNGPV---RISSLP-------EVVVETPENSGGD 262
LY FLD ++F W ++L+GPV R++S P ++T + D
Sbjct: 491 LTHPLQALYLFLDRLAEFPWHECALTLHGPVPLSRLASTPLNGTTSYRSKLKTAKLDASD 550
Query: 263 LLLSSEFLKECVEQFSVP---SRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYR 319
+ + L + F S+G T FP + NIVDPL + NNL RSVS F
Sbjct: 551 VEAIRDTLADQFGAFDAALKSSKGTPTG--LFPIRACNIVDPLDDKNNLARSVSAEGFPV 608
Query: 320 IRSAFTYGARKLGHILSQPEESLTDE---LRK---------FFSN--TLDRHGSGQRPDV 365
++ AF +L +L+ P ++ D+ LR+ FFS L G G RPD+
Sbjct: 609 MKRAFRLARDQLAAMLA-PRSAIDDDAELLREESSKDVGVTFFSRCWQLYARGDGWRPDL 667
>gi|325189429|emb|CCA23919.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1193
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 188/389 (48%), Gaps = 71/389 (18%)
Query: 41 RAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL----GCEVFPFGSVPLKTYLPD 96
R E + + +I + PT +++ R ++ Y++ L+ FP GS P KTYLPD
Sbjct: 721 RVETSVKKLIHALSPTHEADQARCNILAYLRHLLELQFPRSSSILFFPTGSFPCKTYLPD 780
Query: 97 GDIDLTAFGGLNVEE------------ALANDVCSVLEREDQNKAAEF----------VV 134
D+D+ ++E A NDV + + ++ A V
Sbjct: 781 ADLDVCLLVPRSMEPTWFFSVVQMLCFAATNDVHAEPKHSLESVQAPSWMNSTSSTGNTV 840
Query: 135 KDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKA 194
++ I A+V++VKC + N+ VDI+ N++G L L L+ D +G+ HLFK+S++LIKA
Sbjct: 841 RNVTFINADVRVVKCTIDNVAVDITVNRVGALGALVLLDTFDLRVGRHHLFKQSLVLIKA 900
Query: 195 WCYYESR-------ILGAHHGLISTYALETLV-----------------LYKFLDYFSKF 230
WC + +LG+ +G STYA+ T+V L+ FLD ++F
Sbjct: 901 WCALDCLEGGQGCGVLGSKNGAFSTYAVNTMVMTLFNRWGYRIQHPLEALHLFLDIMTQF 960
Query: 231 DWDSYCISLNGPVRISSLPE-----VVVETPENSGGDLLLSSEFLKE---CVEQ----FS 278
W ++ GPV + L + +V E + + L++ E +++ C+ + F
Sbjct: 961 PWQECAWTIFGPVLFTQLYQNLSSRIVPPGWETASANCLITREDIEQIRVCLNEYFGSFD 1020
Query: 279 VPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQP 338
V S G +TN+ FP + N++DPL+ NNL RSV F ++ F G +L +LS+
Sbjct: 1021 V-SLGTETNA-VFPLRSFNMIDPLQLGNNLARSVLPEIFPSLQVMFRDGRDRLDRVLSEE 1078
Query: 339 EESLTDELRKFFSNTLDRHGSGQ--RPDV 365
+ + +FF ++ +G G RPD+
Sbjct: 1079 KTVM-----EFFKHSWKLYGRGDGWRPDL 1102
>gi|242051292|ref|XP_002463390.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
gi|241926767|gb|EER99911.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
Length = 208
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 68/125 (54%), Positives = 92/125 (73%)
Query: 52 QVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+V PT +E RR+ VI Y++RLI + LGCEVF FGSVPL+TYLPDGD+D+T G +
Sbjct: 83 RVHPTQEAERRRQDVISYLRRLIGSSLGCEVFAFGSVPLRTYLPDGDVDITVLGNTWLNS 142
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCF 171
+DV S+LE E +N AEF + I AEVKL+KC+++NI+VD+SFNQ+GG+ST CF
Sbjct: 143 TFIDDVRSMLESEQENCDAEFKLTGLHFINAEVKLIKCIIENIIVDVSFNQIGGVSTFCF 202
Query: 172 LEQVD 176
LE ++
Sbjct: 203 LELIN 207
>gi|348683529|gb|EGZ23344.1| hypothetical protein PHYSODRAFT_485178 [Phytophthora sojae]
Length = 793
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 119/439 (27%), Positives = 187/439 (42%), Gaps = 123/439 (28%)
Query: 42 AEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPF----GSVPLKTYLPDG 97
A +I + P+ ++ R+ V+ +VQ++I + P GS P+KTYLP
Sbjct: 248 AARQADALIEWMGPSDAADRVRQQVLSFVQQVITAHFPLAAAPLFFATGSYPMKTYLPGS 307
Query: 98 DIDLTAFGGLNVEEA----LANDVC---------SVLEREDQNKAAEF------------ 132
D+D+ +E + + +C +VL+ + + +
Sbjct: 308 DLDICLLVPQELESSWYFIVTQALCIAGGSGGAGTVLDVGNPGGSVDGSGSSSPSGPAVG 367
Query: 133 -----------VVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGK 181
V++ I A+V++VKC V NI VD + N++G L + L+ + +G+
Sbjct: 368 SGSSGALLLTNTVRNVTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAVRVGR 427
Query: 182 DHLFKRSIILIKAWCYYES-------------------------RILGAHHGLISTYALE 216
HLFK+S+ILIKAWC +ES ++GA HG +STYA+
Sbjct: 428 QHLFKKSLILIKAWCTHESSPFMQAASVECGGLGPSVVPGSTPTSVMGASHGALSTYAVN 487
Query: 217 TLV-----------------LYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE--TPE 257
T+V LY FLD ++F W ++L+G V +S L + TP
Sbjct: 488 TIVMALFNQHGDALTHPLQALYLFLDRLAEFPWHEAALTLHGAVPLSRLATTPLNGTTPS 547
Query: 258 NS--------GGDLLLSSEFLKECV-EQFSVPSRGFDTNSRS--------FPPKHLNIVD 300
S GD+ E +++ + +QF FD RS FP + NIVD
Sbjct: 548 KSKLKAAKLDAGDV----EAIRDTLSDQFG----AFDAGLRSGKSAPTGLFPIRACNIVD 599
Query: 301 PLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQ------------PEESLTDELRK 348
PL + NNL RSVS F ++ AF +L +L+ EE+ +D
Sbjct: 600 PLDDKNNLARSVSAEGFPVMKRAFRLARDQLAAMLAPRTSHRDDDAELLSEETGSDVGMA 659
Query: 349 FFSNTLDRHGSGQ--RPDV 365
FFS +G G RPD+
Sbjct: 660 FFSRCWQLYGRGDGWRPDL 678
>gi|2651305|gb|AAB87585.1| hypothetical protein [Arabidopsis thaliana]
Length = 384
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/254 (37%), Positives = 122/254 (48%), Gaps = 66/254 (25%)
Query: 32 TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEV--------- 82
T I AE W AE Q I+ +QP ++E R +I +Q L+ LG EV
Sbjct: 26 TPIEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVRTFLLLLDE 85
Query: 83 -------------FPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKA 129
+ FGS+PLKTYLPDGDIDLT EE A VC VLE E N
Sbjct: 86 LSFSLQRIRNAKVYLFGSMPLKTYLPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGN-- 143
Query: 130 AEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSI 189
++ V Q ++A +VD+ G+DHLFK+SI
Sbjct: 144 SDLQVTGVQYVQA-------------------------------KVDKAFGRDHLFKKSI 172
Query: 190 ILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLP 249
IL+KAWC+YESRILGA+ GLISTYAL LVL +S SL+GP ++ L
Sbjct: 173 ILVKAWCFYESRILGANSGLISTYALAILVLNIVNMSYS---------SLSGP--LAKLR 221
Query: 250 EVVVETPENSGGDL 263
+V+ EN G L
Sbjct: 222 DVLTLPGENVGWKL 235
Score = 42.7 bits (99), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 30/40 (75%)
Query: 330 KLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPV 369
KL +L+ P E++ +L KFF+ +L+R+G GQR DV++PV
Sbjct: 219 KLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPV 258
>gi|357491469|ref|XP_003616022.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
gi|355517357|gb|AES98980.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
Length = 490
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 126/398 (31%), Positives = 177/398 (44%), Gaps = 89/398 (22%)
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRY 374
GNFYRIRSAF YGARKLG IL PE+ + DEL +FF+NTLDRHGS Q +
Sbjct: 11 GNFYRIRSAFKYGARKLGWILMLPEDRIADELNRFFANTLDRHGSNQG---------NED 61
Query: 375 NGFGVSSTFSGTELCREDQTIYESEPNSSGITENCRIDDEAELCGGVGKIKVSGMESSYC 434
N F ST S ++ +Q YE R + E + + ++ S +S
Sbjct: 62 NSFLCLSTGS-KDMITGNQHNYE-----------IRSERERYVVKDISSLEGSSFDS--- 106
Query: 435 RTINEPHNSGNGTAVSETRLSGDAKDLATSKNLNLVISNETSKCSSLSGE-ESKARHAPH 493
SG+G V+ +L D+K +ATS L + +N S CS+ E S + +
Sbjct: 107 --------SGDGNTVAIYKLGEDSKHVATSGVLGIASTNGFSHCSNGKAESRSCSETDVN 158
Query: 494 LYFSSSTMGNGEIRNGNSEWKQQLNSGSAEKNVTSGILPTHYKETGLILLNGQDENQLDV 553
F +G + N S + EKN+ S G +L N
Sbjct: 159 SIFDDEKEKHGMVSN-------SPRSHTDEKNMAS---------NGSTVLRDAANN---- 198
Query: 554 NHGASSPVESNHHPSLMSTIPWSTEEFNFSYSGYHASPRTVGSPRAANSLSDLSGDYESH 613
+ + ++ +N S SG A+ SL DL+GDY+SH
Sbjct: 199 ----------------LENGFFHSDRYNNSVSG---------GTEASKSLLDLAGDYDSH 233
Query: 614 QISLNHVWWWYEHALNS----SYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSAN 669
+L+ Y H N SP +F ++NSW+ +++ L +I PQ ++N
Sbjct: 234 IANLH-----YGHMCNGYPVSPVVVPSPPRSPKFHNRNSWETVRQCLQMNHSIHPQTNSN 288
Query: 670 GAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPN 707
G V PL Y + P +P ASFG EE K RGTG YFPN
Sbjct: 289 GVVG-PL-YLVNHPTIPMASFGAEEKRKPRGTGAYFPN 324
>gi|308163112|gb|EFO65472.1| Hypothetical protein GLP15_5146 [Giardia lamblia P15]
Length = 719
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 75/256 (29%), Positives = 124/256 (48%), Gaps = 49/256 (19%)
Query: 31 QTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL-GCEVFPFGSVP 89
+ A A+ + + T I++ V P SEE R + ++ R+I L + P+GS
Sbjct: 35 EAARVAQTLEALSQRTDYIVSLVSPDKASEEFRLKIFTFISRVIEAVLPNTLIVPYGSFI 94
Query: 90 LKTYLPDGDIDLTAFG---------------------------GLNVEEALANDVCSVLE 122
K YLP D+D+ + G+ V A++ + S +
Sbjct: 95 SKIYLPSSDLDICCYNHGLDEIPLLQKILEALTIFSDPSLRPTGVRVSPAVSQLINSRIS 154
Query: 123 REDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKD 182
E++ +++ + I A+V L+KC V + VDIS Q G L T +E++ + IG++
Sbjct: 155 AEER-----LELENIEFIMAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRN 209
Query: 183 HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV----------------LYKFLDY 226
+L KRS +LI++WC YE+RI+G H ++S+YAL +V LY FL Y
Sbjct: 210 NLLKRSFLLIQSWCLYEARIVGGHSQMLSSYALRVMVINILLNCKDIYTPFQALYVFLAY 269
Query: 227 FSKFDWDSYCISLNGP 242
+S FD+D + +GP
Sbjct: 270 YSTFDYDKNIVHPSGP 285
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 45/79 (56%), Gaps = 6/79 (7%)
Query: 286 TNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQP---EESL 342
++SR F P +++IVDPL+ NNLGRSVS+ NF RI +F L I+ ++
Sbjct: 582 SDSRVFLPSYISIVDPLQVINNLGRSVSEPNFMRITRSFQTAHIVLSDIVQMCITGSMTI 641
Query: 343 TDELRK---FFSNTLDRHG 358
T+ L + FF +TL G
Sbjct: 642 TEALAEYDCFFISTLSIFG 660
>gi|297600524|ref|NP_001049344.2| Os03g0210800 [Oryza sativa Japonica Group]
gi|255674303|dbj|BAF11258.2| Os03g0210800 [Oryza sativa Japonica Group]
Length = 871
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 78/116 (67%), Gaps = 1/116 (0%)
Query: 251 VVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRS-FPPKHLNIVDPLKENNNLG 309
+ E P +LLLS FL +C ++V R ++ + F KH N++DPL+ NNNLG
Sbjct: 1 MTAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLG 60
Query: 310 RSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
RSVSKGNF+RIRSAF++GA++L +L P+E L E+ +FF+NT RHGSG RPD
Sbjct: 61 RSVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 116
>gi|159115240|ref|XP_001707843.1| Hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
gi|157435951|gb|EDO80169.1| hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
Length = 731
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 123/252 (48%), Gaps = 49/252 (19%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL-GCEVFPFGSVPLKTYL 94
A+ + + T I++ V P SEE R + ++ ++I L + P+GS K YL
Sbjct: 55 AQALEALSQRTDYIVSLVSPDKASEEFRLKIFTFISKVIEAVLPNTLIVPYGSFISKIYL 114
Query: 95 PDGDIDLTAFG---------------------------GLNVEEALANDVCSVLEREDQN 127
P D+D+ F G+ V A++ + S + E++
Sbjct: 115 PSSDLDICCFNHGLDEIPLLQKILEALTVFSDPSLRPTGVRVPPAVSQLINSRIPTEER- 173
Query: 128 KAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKR 187
+++ + I A+V L+KC V + VDIS Q G L T +E++ + IG+++L KR
Sbjct: 174 ----LELENIEFIMAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKR 229
Query: 188 SIILIKAWCYYESRILGAHHGLISTYALETLV----------------LYKFLDYFSKFD 231
S +LI++WC YE+RI+G H ++S+YAL +V LY FL Y+S FD
Sbjct: 230 SFLLIQSWCLYEARIVGGHSQMLSSYALRVMVINILLNCRDIYTPFQALYVFLAYYSSFD 289
Query: 232 WDSYCISLNGPV 243
+D + +GP+
Sbjct: 290 YDRDIVHPSGPL 301
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 47/79 (59%), Gaps = 6/79 (7%)
Query: 286 TNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHIL---SQPEESL 342
++SR+F P +++IVDPL+ NNLGRSVS+ NF RI +F L I+ + ++
Sbjct: 594 SDSRAFLPSYISIVDPLQVINNLGRSVSEPNFMRITRSFQTAHIVLSDIVQMCTTGNMTI 653
Query: 343 TDELRK---FFSNTLDRHG 358
T+ L + FF +TL G
Sbjct: 654 TEALAEYDCFFISTLSIFG 672
>gi|222624434|gb|EEE58566.1| hypothetical protein OsJ_09878 [Oryza sativa Japonica Group]
Length = 1064
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 77/114 (67%), Gaps = 1/114 (0%)
Query: 253 VETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRS-FPPKHLNIVDPLKENNNLGRS 311
E P +LLLS FL +C ++V R ++ + F KH N++DPL+ NNNLGRS
Sbjct: 3 AEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRS 62
Query: 312 VSKGNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
VSKGNF+RIRSAF++GA++L +L P+E L E+ +FF+NT RHGSG RPD
Sbjct: 63 VSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 116
>gi|253742434|gb|EES99267.1| Hypothetical protein GL50581_3482 [Giardia intestinalis ATCC 50581]
Length = 711
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 76/247 (30%), Positives = 120/247 (48%), Gaps = 39/247 (15%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL-GCEVFPFGSVPLKTYL 94
A+ + + T II+ V P SEE R + ++ ++I L + P+GS K YL
Sbjct: 40 AQALEALSQRTDYIISLVSPDRASEEFRLKIFTFISKVIDVVLPNTLIVPYGSFISKIYL 99
Query: 95 PDGDIDLTAFGG--------LNVEEAL--------------ANDVCSVLEREDQNKAAEF 132
P D+D+ + + EAL + V S L +
Sbjct: 100 PSSDLDICCYNHSIDEIPLLQKILEALMVFSDPNLQSTGTRVSPVVSQLINSHISADERL 159
Query: 133 VVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILI 192
+++ + I A+V L+KC V + VDIS Q G L T +E++ + IG+++L KRS +LI
Sbjct: 160 ELENIEFIMAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLI 219
Query: 193 KAWCYYESRILGAHHGLISTYALETL----------------VLYKFLDYFSKFDWDSYC 236
++WC YE+RI+G H ++S+YAL + VLY FL Y+S FD+D
Sbjct: 220 QSWCLYEARIVGGHSQMLSSYALRVMIINILINCKDIYTPFQVLYVFLAYYSNFDYDRNI 279
Query: 237 ISLNGPV 243
I +GP+
Sbjct: 280 IHPSGPL 286
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 6/91 (6%)
Query: 286 TNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQP---EESL 342
++S+ F P +++IVDPL+ NNLGRSVS+ NF RI +F L I+ ++
Sbjct: 572 SDSQVFLPSYISIVDPLQVTNNLGRSVSEPNFMRITRSFQTAHTVLSDIVQMCITGGTTI 631
Query: 343 TDELRK---FFSNTLDRHGSGQRPDVQDPVP 370
T+ L + FF +TL G + VP
Sbjct: 632 TEALAEYDCFFISTLSIFGDAHAEKLHGDVP 662
>gi|159108047|ref|XP_001704297.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
gi|157432356|gb|EDO76623.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
Length = 512
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 113/207 (54%), Gaps = 18/207 (8%)
Query: 55 PTVVSEERRKAVIDYVQ-RLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN-VEEA 112
PT S R +I Y++ +L + ++ P+GS + +LPDGDIDL G + +
Sbjct: 47 PTEDSITCRYQIIKYIRDKLHSLFPELQLIPYGSFVTRIFLPDGDIDLAIIVGEDDAADV 106
Query: 113 LANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFL 172
LA + E ++ F + + I+AEV +++ ++ + +DIS + GL T +L
Sbjct: 107 LAQFYIYLKEVAASHEDTPFKLTNLSKIQAEVPIIRLVINGVFIDISSARPVGLVTSLYL 166
Query: 173 EQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-------------- 218
+ ++ IG+++L KRS+ILI+AWC YE+ ILG+H ++++YAL +
Sbjct: 167 QLLNDAIGRNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTTFILTNSPELVHP 226
Query: 219 --VLYKFLDYFSKFDWDSYCISLNGPV 243
VL+KF ++S FD+ + I+ G V
Sbjct: 227 LQVLFKFFAFYSAFDFTNNTITAFGVV 253
Score = 43.1 bits (100), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 39/70 (55%), Gaps = 6/70 (8%)
Query: 291 FPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHIL------SQPEESLTD 344
+ P ++I+DP++ +NNL ++ S +F R+++A G +LG ++ + P +
Sbjct: 393 YHPTTISILDPVQPSNNLAKATSAASFARLKAALRTGYFQLGSLMMMLCDKTAPLSVIQT 452
Query: 345 ELRKFFSNTL 354
E FSNTL
Sbjct: 453 EFDLLFSNTL 462
>gi|308159127|gb|EFO61675.1| Topoisomerase I-related protein [Giardia lamblia P15]
Length = 512
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 113/213 (53%), Gaps = 26/213 (12%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRNYL-GCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+ PT S R +I Y++ + + ++ P+GS + +LPDGDIDL G E
Sbjct: 45 LAPTEDSITYRYQIIKYIRDKLHDLFPELQLIPYGSFVTRIFLPDGDIDLAIIVG----E 100
Query: 112 ALANDVCSVLEREDQNKAAE-----FVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGL 166
A DV + ++ A F V + I+AEV +++ ++ I +DIS + GL
Sbjct: 101 DDAADVLTQFYIHLKDIVASQEDTPFRVTNLSKIQAEVPIIRLVINGIFIDISSARPVGL 160
Query: 167 STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL-------- 218
T +L+ ++ IG+++L KRS+ILI+AWC YE+ ILG+H ++++YAL +
Sbjct: 161 VTSLYLQLLNDAIGRNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTIFILTNS 220
Query: 219 --------VLYKFLDYFSKFDWDSYCISLNGPV 243
VL+KF ++S FD+ + I+ G +
Sbjct: 221 PELVHPLQVLFKFFAFYSAFDFTNNTITAFGVI 253
>gi|298707565|emb|CBJ30149.1| nucleotidyltransferase family protein [Ectocarpus siliculosus]
Length = 1301
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 126/291 (43%), Gaps = 75/291 (25%)
Query: 27 VPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFG 86
+PSN+ + + +R ++ ++ ++P +E R++V +V R ++ LG + FP G
Sbjct: 606 LPSNKQPLAID--RRVDD----LLRLLRPAPRAEGYRRSVFRFVTRQVKRALGAQCFPVG 659
Query: 87 SVPLKTYLPDGDI----------------------------------------------- 99
++ YLPD ++
Sbjct: 660 GYAIQAYLPDEEVGISAFLCHGQEKSWFVRVNETLCKVSSEASEEAAEEEGSGTSVGGCP 719
Query: 100 -DLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIR-AEVKLVKCLVQN-IVV 156
+TA G + D R+++ + + + I V+ +KC+V N + V
Sbjct: 720 EKITAVGEGATPKQEGGDGGGAAIRKEEGSSYRHRLSNVNFINMGRVQKIKCVVDNQVAV 779
Query: 157 DISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGL--ISTYA 214
DI NQ+G ++T+ LE+ D+L+GKDHLFKRS++LIK+W YESR + L I+ A
Sbjct: 780 DIGANQVGDIATVALLEETDQLLGKDHLFKRSLLLIKSWWVYESRAYTGSNMLSRITESA 839
Query: 215 LETLVLYK-----------------FLDYFSKFDWDSYCISLNGPVRISSL 248
L T+VL F S FDW YC + GP R+ +L
Sbjct: 840 LATMVLAVVNQHHARLHTPLQVMALFFQMHSHFDWSRYCWCIEGPRRLDTL 890
>gi|224064218|ref|XP_002301405.1| predicted protein [Populus trichocarpa]
gi|222843131|gb|EEE80678.1| predicted protein [Populus trichocarpa]
Length = 141
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 91/145 (62%), Gaps = 6/145 (4%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSS-SSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG+ W+ P+G P+ + PS + +E W +AEE T +IA +QP S
Sbjct: 1 MGEHEGWAQPPSGL----SPNGLLAIEAPSVIRVLDSERWSKAEERTAELIACIQPNQPS 56
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALANDVC 118
EE R AV DYVQRLI C+VF FGSVPLKTYLPDGDIDLTAF N+++ A+ V
Sbjct: 57 EELRNAVADYVQRLIAKCFPCQVFTFGSVPLKTYLPDGDIDLTAFSKNPNLKDTWAHQVR 116
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAE 143
+LE E++N+ AEF VK+ Q I+AE
Sbjct: 117 DMLENEEKNENAEFRVKEVQYIQAE 141
>gi|224127915|ref|XP_002320195.1| predicted protein [Populus trichocarpa]
gi|222860968|gb|EEE98510.1| predicted protein [Populus trichocarpa]
Length = 145
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 91/145 (62%), Gaps = 6/145 (4%)
Query: 1 MGDLRDWSPEPNGAV-FGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVS 59
MG+ W+ P+G + G P ++S + + + W +AEE T +I +QP S
Sbjct: 1 MGEHEGWAQPPSGLIPNGLLPEEAASVI----RVLDLDRWSKAEERTAELIDCIQPNQPS 56
Query: 60 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALANDVC 118
EE R AV DYVQRLI C+VF FGSVPLKTYLPDGDIDLTAF N+++ A+ V
Sbjct: 57 EELRNAVADYVQRLILKCFPCQVFTFGSVPLKTYLPDGDIDLTAFSKNPNLKDTWAHQVR 116
Query: 119 SVLEREDQNKAAEFVVKDAQLIRAE 143
+LE E++N+ AEF VK+ Q I+AE
Sbjct: 117 DMLENEEKNENAEFRVKEVQYIQAE 141
>gi|253744327|gb|EET00549.1| Topoisomerase I-related protein [Giardia intestinalis ATCC 50581]
Length = 511
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 111/203 (54%), Gaps = 26/203 (12%)
Query: 63 RKAVIDYVQRLIRNYL-GCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCS-- 119
R +I Y++ + + ++ P+GS + +LPDGD+DL+ V E ANDV S
Sbjct: 55 RYQIIKYIRDELHSIFPELQLIPYGSFVTRIFLPDGDVDLSII----VAEDDANDVFSQF 110
Query: 120 ---VLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVD 176
+ E + A F + + I+AE+ +++ ++ I +DIS + GL T +++ ++
Sbjct: 111 YTHLKEIASSQEHATFKITNLSKIQAEMSIIRLVINGIFIDISAARPTGLVTSLYIQLLN 170
Query: 177 RLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL----------------VL 220
IG+++L KRS+IL++AW YE+ ILG+H ++++YAL + VL
Sbjct: 171 DSIGRNNLLKRSVILVQAWSLYEAHILGSHSQMLNSYALRVMTAFILTNSPELVHPLQVL 230
Query: 221 YKFLDYFSKFDWDSYCISLNGPV 243
+KF ++S FD+ + I+ G +
Sbjct: 231 FKFFAFYSTFDFTNNTITAFGVI 253
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 6/78 (7%)
Query: 283 GFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHIL------S 336
F ++S + P ++I+DP+ NNL ++ S +F R+R+A G +LG ++ +
Sbjct: 384 SFPSSSSVYHPTIISILDPVHPCNNLAKATSAASFARLRAALRMGYLQLGALMMLLCDKT 443
Query: 337 QPEESLTDELRKFFSNTL 354
P ++ E FSNT
Sbjct: 444 APLSAIQAEFDLLFSNTF 461
>gi|261333426|emb|CBH16421.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 1120
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 127/291 (43%), Gaps = 90/291 (30%)
Query: 76 NYLGCEVFPFGSVPLKTYLPDGDIDLT--AFGGLN------VEEALAN------------ 115
NY + FGS+ T LPDGD D+T G LN EA A+
Sbjct: 386 NYGEKRYYVFGSLASGTVLPDGDNDMTIEVDGLLNPTKIETQSEAHADSSDGAAGSSCSS 445
Query: 116 -------DVCSVLEREDQNKAAEFVVKDAQ------LIRAEVKLVKCLVQNIVVDISFNQ 162
+V E + A+++ ++ + ++ AEV+++K ++ DI+ Q
Sbjct: 446 SISGTSSQATTVAGGELLSSVADYLRENNKSVYVDTVVVAEVRVLKLVMDGSSYDITVGQ 505
Query: 163 LGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL-- 220
LGG+S + FL ++D IG +HL KR+++L+KAWC YE+ +L G IS+YA +++
Sbjct: 506 LGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYEAHVLSGQGGYISSYAATVMIISM 565
Query: 221 -----------------------------------------YKFLDYFSKFDWDSYCISL 239
+FL YFS FD++SYC++L
Sbjct: 566 INTVEFLEDVEREERGGEGDGKHLDERQRGEYQHISPLQLFARFLKYFSYFDFESYCLTL 625
Query: 240 NGPV---RISSLP------EVVVETPENSGGDLLL-----SSEFLKECVEQ 276
GPV +I+++P E VE + GG + E L C+ +
Sbjct: 626 FGPVPCDKINNVPLDLDLVESQVEHFQQPGGSAVFGLTAEGQEALGHCLRR 676
>gi|71748824|ref|XP_823467.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70833135|gb|EAN78639.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 1120
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 127/291 (43%), Gaps = 90/291 (30%)
Query: 76 NYLGCEVFPFGSVPLKTYLPDGDIDLT--AFGGLN------VEEALAN------------ 115
NY + FGS+ T LPDGD D+T G LN EA A+
Sbjct: 386 NYGEKRYYVFGSLASGTVLPDGDNDMTIEVDGLLNPTKIETQSEAHADSSDGAAGSSCSS 445
Query: 116 -------DVCSVLEREDQNKAAEFVVKDAQ------LIRAEVKLVKCLVQNIVVDISFNQ 162
+V E + A+++ ++ + ++ AEV+++K ++ DI+ Q
Sbjct: 446 SISGTSSQATTVAGGELLSSVADYLRENNKSVYVDTVVVAEVRVLKLVMDGSSYDITVGQ 505
Query: 163 LGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL-- 220
LGG+S + FL ++D IG +HL KR+++L+KAWC YE+ +L G IS+YA +++
Sbjct: 506 LGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYEAHVLSGQGGYISSYAATVMIISM 565
Query: 221 -----------------------------------------YKFLDYFSKFDWDSYCISL 239
+FL YFS FD++SYC++L
Sbjct: 566 INTVEFLEDVEREERGGEGDGKHLEERQRGEYQHISPLQLFARFLKYFSYFDFESYCLTL 625
Query: 240 NGPV---RISSLP------EVVVETPENSGGDLLL-----SSEFLKECVEQ 276
GPV +I+++P E VE + GG + E L C+ +
Sbjct: 626 FGPVPCDKINNVPLDLDLVESQVEHFQQPGGSAVFGLTAEGQEALGHCLRR 676
>gi|224135259|ref|XP_002322023.1| predicted protein [Populus trichocarpa]
gi|222869019|gb|EEF06150.1| predicted protein [Populus trichocarpa]
Length = 85
Score = 93.2 bits (230), Expect = 4e-16, Method: Composition-based stats.
Identities = 45/67 (67%), Positives = 52/67 (77%), Gaps = 9/67 (13%)
Query: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLN 240
++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY F + +KF
Sbjct: 1 QNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNKF---------A 51
Query: 241 GPVRISS 247
GP+ +S+
Sbjct: 52 GPLEVST 58
>gi|298710234|emb|CBJ26309.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1317
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 82/140 (58%), Gaps = 25/140 (17%)
Query: 134 VKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIK 193
+++ LI A +V +V N+VVD++ NQ G ++ LE+ D LI ++HLFKRS++L+K
Sbjct: 117 IRNVSLINARTPIVTMVVGNVVVDLTENQGGSVAASALLEEADNLIQRNHLFKRSLLLLK 176
Query: 194 AWCYYES------RILGAHHGLISTYALETLVLY-------------------KFLDYFS 228
AW + E+ R+LGA G +++Y L +VL+ +F + +S
Sbjct: 177 AWAWCETPRLVGNRVLGARKGGLTSYGLSVMVLHLFAASASADALVHPLDVLIRFFEVYS 236
Query: 229 KFDWDSYCISLNGPVRISSL 248
+FDW YC++L+GPV + S+
Sbjct: 237 EFDWARYCLTLDGPVPLESV 256
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 30/53 (56%), Gaps = 2/53 (3%)
Query: 291 FPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG--HILSQPEES 341
FP + NI DPL NNLG SV+K N ++ A G +KL H++S P S
Sbjct: 345 FPRRDCNIQDPLNALNNLGHSVTKNNLKALKRALQQGQKKLEAWHLVSHPSPS 397
>gi|342184813|emb|CCC94295.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 1108
Score = 92.4 bits (228), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 66/245 (26%), Positives = 104/245 (42%), Gaps = 80/245 (32%)
Query: 85 FGSVPLKTYLPDGDIDLT--AFGGLN-----------------VEEALANDVCSVLERED 125
FGS+ T LPDGD D+T G LN + ND S
Sbjct: 383 FGSLATGTVLPDGDNDITIEVDGLLNPAKIEIQGEAQNSFPNGAATSSCNDSISATSSSH 442
Query: 126 QNKAA--EFVVKDAQLIR-------------AEVKLVKCLVQNIVVDISFNQLGGLSTLC 170
A E + + A +R AEV+++K ++ D++ QLGG+S +
Sbjct: 443 ATAIAGGELLSEIADYLRENNASVYVDTVVVAEVRVLKLVMDGSSYDVTVGQLGGVSCIR 502
Query: 171 FLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL---------- 220
FL ++D +G +HL KR+++L+KAWC YE+ +L G +S+YA +++
Sbjct: 503 FLHEMDMRVGCEHLLKRTLLLMKAWCCYEAHVLSGQGGYMSSYAATVMLITMINTVEFLE 562
Query: 221 ---------------------------------YKFLDYFSKFDWDSYCISLNGPV---R 244
+FL Y+S FD+D YC++L GPV R
Sbjct: 563 DVEAEGSDGKTCSNCPEGHKSEGHVQISPLQLFARFLKYYSYFDFDRYCLTLFGPVPCDR 622
Query: 245 ISSLP 249
++ +P
Sbjct: 623 VNQIP 627
>gi|224114896|ref|XP_002316887.1| predicted protein [Populus trichocarpa]
gi|222859952|gb|EEE97499.1| predicted protein [Populus trichocarpa]
Length = 199
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/79 (63%), Positives = 63/79 (79%), Gaps = 1/79 (1%)
Query: 97 GDIDLTAFG-GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIV 155
GDIDLTAF N+++ A VC +LE E+ N+ AEF VK+ + I+AEVK++KCLV+NIV
Sbjct: 18 GDIDLTAFSENPNLKDTWAPQVCDMLENEENNENAEFGVKEVEYIQAEVKIIKCLVENIV 77
Query: 156 VDISFNQLGGLSTLCFLEQ 174
VDISFNQLGGL TLCFLE+
Sbjct: 78 VDISFNQLGGLFTLCFLEK 96
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 31/50 (62%)
Query: 265 LSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSK 314
LS FL+ C ++V G D + F KH N++DPL+ NNNLG SV+K
Sbjct: 106 LSKLFLEACSAIYAVLPAGQDNQGQPFLSKHFNVIDPLRINNNLGHSVNK 155
>gi|452823525|gb|EME30535.1| nucleotidyltransferase [Galdieria sulphuraria]
Length = 1412
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/388 (23%), Positives = 146/388 (37%), Gaps = 105/388 (27%)
Query: 41 RAEEATQGIIAQVQ-------PTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTY 93
RAE I +V PT SE RR+AV V +I+ +G + F +GS KTY
Sbjct: 689 RAERTEDAICVRVNRFLDVCVPTSFSELRREAVFRVVASIIKRSIGAQAFCYGSFATKTY 748
Query: 94 LPDGDIDLTAF----------------GGLNVEEALANDVCS------VLEREDQNKAAE 131
D +++ AF L + LA+D S L Q
Sbjct: 749 HADSILEIGAFLVGKNDTAAEWSAKLMAALCEDATLASDHSSSSLEFSYLSLIQQKHPVP 808
Query: 132 FVVKDAQLIRAEVKLVKC--------------------------------LVQNIVVDIS 159
V++ R + C + N+ V ++
Sbjct: 809 LPVRNISYFRPKPTPSGCQPPPAVTFTVNWPIEDPRSGLVALDTNSTERDIAPNVRVSVT 868
Query: 160 FNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV 219
N + G+ T C LE+ D +G++HLFKRS++L++ W Y ++ ++ + A+E LV
Sbjct: 869 LNHVAGIHTACVLEEFDHAMGRNHLFKRSLLLVRTWVDYGVKLT----DILPSRAVEVLV 924
Query: 220 -----------------LYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPEN---- 258
LY+FL YF FDW + + G + +++ + EN
Sbjct: 925 VFVANCFHSSIETPFDLLYRFLTYFVHFDWRKFGLCETGIIDLATGQRKQPISSENYLFP 984
Query: 259 SGGDLLL----SSEFLKECV--EQFSVPSRGFDTNSRSFPPKHLNIVDPL--KENNNLGR 310
++LL ++E +CV EQF V P + LNI D + K N+
Sbjct: 985 PNAEVLLYHRTTTEAETQCVTEEQFEV-----------VPLEALNIFDHVSWKHYRNICV 1033
Query: 311 SVSKGNFYRIRSAFTYGARKLGHILSQP 338
++ + + A G R S P
Sbjct: 1034 DSTEMDIVAFQKAVNKGLRDAELFRSAP 1061
>gi|398013931|ref|XP_003860157.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498376|emb|CBZ33450.1| hypothetical protein, conserved [Leishmania donovani]
Length = 2047
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 64/104 (61%), Gaps = 2/104 (1%)
Query: 139 LIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYY 198
L+ AEV+++K ++ DI+ Q GG++ + FL ++D +IG H+ KR+++L+KAWC Y
Sbjct: 1057 LVMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCY 1116
Query: 199 ESRILGAHHGLISTYALETLVLYKFLDYFSKF-DWDSYCISLNG 241
E+ ILG G I +YA T++L L+ D D+ I +G
Sbjct: 1117 EAHILGGQAGYIGSYA-ATVMLISMLNTVEFLEDADAGQIDSDG 1159
Score = 39.7 bits (91), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 287 NSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
N+ FP + +N++DPL+ ++N+ R V + + RI+ AF G R L
Sbjct: 1437 NTAVFPVRDMNVLDPLRWSSNMVRGVCRNHLQRIQRAFLEGLRLL 1481
>gi|339897903|ref|XP_001464956.2| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399300|emb|CAM67197.2| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 2047
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 64/104 (61%), Gaps = 2/104 (1%)
Query: 139 LIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYY 198
L+ AEV+++K ++ DI+ Q GG++ + FL ++D +IG H+ KR+++L+KAWC Y
Sbjct: 1057 LVMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCY 1116
Query: 199 ESRILGAHHGLISTYALETLVLYKFLDYFSKF-DWDSYCISLNG 241
E+ ILG G I +YA T++L L+ D D+ I +G
Sbjct: 1117 EAHILGGQAGYIGSYA-ATVMLISMLNTVEFLEDADAGQIDSDG 1159
Score = 39.7 bits (91), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 287 NSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
N+ FP + +N++DPL+ ++N+ R V + + RI+ AF G R L
Sbjct: 1437 NTAVFPVRDMNVLDPLRWSSNMVRGVCRNHLQRIQRAFLEGLRLL 1481
>gi|389601018|ref|XP_001564070.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504611|emb|CAM38122.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 2016
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 67/107 (62%), Gaps = 2/107 (1%)
Query: 120 VLER-EDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
VL R D ++A+ V L+ AEV+++K ++ DI+ Q GG++ + FL ++D +
Sbjct: 1034 VLGRVRDYLRSAKTPVFVDSLVMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAV 1093
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLD 225
IG H+ KR+++L+KAWC YE+ ILG G I +YA T++L L+
Sbjct: 1094 IGDQHVLKRTLLLLKAWCCYEAHILGGQAGYIGSYA-ATVMLISMLN 1139
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 29/45 (64%)
Query: 287 NSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
N+ FP + +N++DPL+ ++N+ R V + + RIR AF G R L
Sbjct: 1433 NTAVFPVRDMNVLDPLRWSSNMVRGVCRNHLQRIRRAFLEGLRLL 1477
>gi|157868001|ref|XP_001682554.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126008|emb|CAJ04245.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 1964
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 58/87 (66%), Gaps = 1/87 (1%)
Query: 139 LIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYY 198
L+ AEV+++K ++ DI+ Q GG++ + FL ++D +IG H+ KR+++L+KAWC Y
Sbjct: 970 LVMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCY 1029
Query: 199 ESRILGAHHGLISTYALETLVLYKFLD 225
E+ ILG G I +YA T++L L+
Sbjct: 1030 EAHILGGQAGYIGSYA-ATVMLISMLN 1055
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 287 NSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
N+ FP + +N++DPL+ ++N+ R V + + RI+ AF G R L
Sbjct: 1352 NTAVFPVRDMNVLDPLRWSSNMVRGVCRNHLQRIQRAFLEGLRLL 1396
>gi|401419332|ref|XP_003874156.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490390|emb|CBZ25650.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 2020
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 58/87 (66%), Gaps = 1/87 (1%)
Query: 139 LIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYY 198
L+ AEV+++K ++ DI+ Q GG++ + FL ++D +IG H+ KR+++L+KAWC Y
Sbjct: 1050 LVMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCY 1109
Query: 199 ESRILGAHHGLISTYALETLVLYKFLD 225
E+ ILG G I +YA T++L L+
Sbjct: 1110 EAHILGGQAGYIGSYA-ATVMLISMLN 1135
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 287 NSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
N+ FP + +N++DPL+ ++N+ R V + + RI+ AF G R L
Sbjct: 1433 NTAVFPVRDMNVLDPLRWSSNMVRGVCRNHLQRIQRAFLEGLRLL 1477
>gi|340057832|emb|CCC52183.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 1145
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 124/316 (39%), Gaps = 93/316 (29%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLP 95
A Q A EATQ AQ++ ++ +A+ +R Y + FGS+ +T LP
Sbjct: 372 APLSQNALEATQHRRAQLR--RLAGYLNEAINHVARRKGVKYGKVRYYVFGSLATRTVLP 429
Query: 96 DGD----------IDLTAFGGLNVEEALANDV-----CS------VLEREDQNKAAEFVV 134
DGD +D G ++ + D CS + E +
Sbjct: 430 DGDNDITIDIDGLLDPVKVGPQGDTQSTSQDGGEASGCSSEFSGASPAQAAAIAGGELLS 489
Query: 135 KDAQLIR-------------AEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGK 181
A +R AEV++ K ++ D++ QLGG+S + FL QVD IG
Sbjct: 490 NIADYLRENNDSVFVDAVVVAEVRVAKLIMDGNSYDVTVGQLGGVSCIRFLHQVDTKIGC 549
Query: 182 DHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL--------------------- 220
HL KR+++L+KAWC YE+ +L G +S+YA +++
Sbjct: 550 GHLLKRTLLLMKAWCCYEAHVLSGQGGYMSSYAATVMLIAMINTIEFLEDAESEACTELE 609
Query: 221 ------------------------YKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETP 256
+FL YFS FD++ YC++L GPV P
Sbjct: 610 EPARTHALEGRLGALNGVSPLQLFARFLKYFSCFDFERYCVTLFGPV------------P 657
Query: 257 ENSGGDLLLSSEFLKE 272
D L ++ LKE
Sbjct: 658 CEKINDAFLDADVLKE 673
>gi|224064842|ref|XP_002301578.1| predicted protein [Populus trichocarpa]
gi|222843304|gb|EEE80851.1| predicted protein [Populus trichocarpa]
Length = 60
Score = 73.2 bits (178), Expect = 5e-10, Method: Composition-based stats.
Identities = 34/47 (72%), Positives = 40/47 (85%)
Query: 132 FVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178
F VK + I+AEVK++KCLV+NIVVDISFNQLGGL TLCFLE+V L
Sbjct: 13 FRVKKVEYIQAEVKIIKCLVKNIVVDISFNQLGGLFTLCFLEKVSAL 59
>gi|224135265|ref|XP_002322024.1| predicted protein [Populus trichocarpa]
gi|222869020|gb|EEF06151.1| predicted protein [Populus trichocarpa]
Length = 122
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/148 (36%), Positives = 67/148 (45%), Gaps = 41/148 (27%)
Query: 1 MGDLRDWSP----EPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPT 56
MG L W PNG + E V S A+ E W AEE T +IA +QP
Sbjct: 1 MGGLEGWVQPSGFSPNGLLPNE--------VASVTQALEPERWATAEERTAELIACIQPN 52
Query: 57 VVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALAND 116
SEERR AV+ YVQRLI N C+ E AN+
Sbjct: 53 QPSEERRNAVLCYVQRLIMNCFPCQ-----------------------------ETWANE 83
Query: 117 VCSVLEREDQNKAAEFVVKDAQLIRAEV 144
V +LE E++N+ AEF VK+ Q I+AEV
Sbjct: 84 VRDILEHEEKNENAEFHVKEVQYIQAEV 111
>gi|449533401|ref|XP_004173664.1| PREDICTED: uncharacterized LOC101209112 [Cucumis sativus]
Length = 831
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 38/51 (74%)
Query: 315 GNFYRIRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRHGSGQRPDV 365
GNF+RIRSAF +GA++L + P E + EL +FF NT +RHGSGQRPDV
Sbjct: 5 GNFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERHGSGQRPDV 55
>gi|2642156|gb|AAB87123.1| hypothetical protein [Arabidopsis thaliana]
Length = 474
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/310 (23%), Positives = 131/310 (42%), Gaps = 52/310 (16%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN---YLGCEVFPFGSVPLKTYLPDGDIDLTA 103
Q I+ ++PT + R VID ++ ++++ G V PFGS + GD+D++
Sbjct: 12 QEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDISV 71
Query: 104 ---------FGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--Q 152
F G ++ L + L +A+ K +I A V ++K + Q
Sbjct: 72 DLFSGSSILFTGKKQKQTLLGHLLRAL------RASGLWYKLQFVIHARVPILKVVSGHQ 125
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
I DIS + L GL FL + + G+ F+ ++L+K W + I + G ++
Sbjct: 126 RISCDISIDNLDGLLKSRFLFWISEIDGR---FRDLVLLVKEWAKAHN-INDSKTGTFNS 181
Query: 213 YALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKE 272
Y+L LV++ F C+ + LP + V P+++ DL + +E
Sbjct: 182 YSLSLLVIFHF----------QTCVP-------AILPPLRVIYPKSAVDDLTGVRKTAEE 224
Query: 273 CVEQFSVPS-------RGFDTNSRSFPPKHLN----IVDPLKENNNLGRSVSKGNFYRIR 321
+ Q + + R N S ++ + DP ++ N RSVS+ N RI
Sbjct: 225 SIAQVTAANIARFKSERAKSVNRSSLSELLVSFFAKVEDPFEQPVNAARSVSRRNLDRIA 284
Query: 322 SAFTYGARKL 331
F +R+L
Sbjct: 285 QVFQITSRRL 294
>gi|407407321|gb|EKF31173.1| hypothetical protein MOQ_004991 [Trypanosoma cruzi marinkellei]
Length = 1349
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 57/82 (69%)
Query: 139 LIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYY 198
L+ AEV+++K +++ DI+ QLGG+ + FL+++D LIG HL KR+++L+KAWC Y
Sbjct: 593 LVFAEVRVLKLVMEGSCFDITVGQLGGVECVRFLQEMDMLIGCQHLLKRTLLLLKAWCCY 652
Query: 199 ESRILGAHHGLISTYALETLVL 220
E+ IL G +S+YA +++
Sbjct: 653 EAHILSGQGGYLSSYAATIMLI 674
>gi|71652853|ref|XP_815075.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70880102|gb|EAN93224.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1276
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 57/82 (69%)
Query: 139 LIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYY 198
L+ AEV+++K +++ DI+ QLGG+ + FL+++D LIG HL KR+++L+KAWC Y
Sbjct: 564 LVVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCY 623
Query: 199 ESRILGAHHGLISTYALETLVL 220
E+ IL G +S+YA +++
Sbjct: 624 EAHILSGQGGYLSSYAATIMLI 645
>gi|71005312|ref|XP_757322.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
gi|46096726|gb|EAK81959.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
Length = 730
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 100/221 (45%), Gaps = 43/221 (19%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT----AFGGL 107
+ PT E R VI+ + R I++ + EV+PFGS K YLP GD+DL + L
Sbjct: 110 MTPTAAEHETRCMVIELISRAIKSQFRDAEVYPFGSQETKLYLPQGDLDLVVVSNSMANL 169
Query: 108 NVEEALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLV--QNIVVDISFNQLG 164
V+ AL + + L R + + D Q+I +A+V ++K + + VDIS N
Sbjct: 170 RVQSALRT-MAACLRRHN-------LATDVQVIAKAKVPIIKFVTTYARLKVDISLNHTN 221
Query: 165 GLSTLCFLEQVDR---------LIGKDHLFKRS--------------IILIKAWCYYESR 201
GL+T ++ R L+ K L +R II++ ++ +
Sbjct: 222 GLTTASYVNSWLRKWPHIRPLILVVKYLLMQRGMSEVFSGGLGSYSVIIMVISFLQLHPK 281
Query: 202 ILGAHHGLISTYALETLVLYKFLDYFSK-FDWDSYCISLNG 241
+ G I ++L +FL+ + K F +D+ IS+ G
Sbjct: 282 V---QRGEIDADRSLGVLLLEFLELYGKNFGYDNCGISIRG 319
>gi|343427054|emb|CBQ70582.1| related to TRF4-topoisomerase I-related protein [Sporisorium
reilianum SRZ2]
Length = 697
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 90/202 (44%), Gaps = 37/202 (18%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT----AFGGL 107
+ PTV E R V++ + R I++ + EV PFGS K YLP GD+DL + L
Sbjct: 112 MAPTVAEHETRCMVVELISRAIKSQFRDAEVHPFGSQETKLYLPQGDLDLVVVSQSMANL 171
Query: 108 NVEEALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLV--QNIVVDISFNQLG 164
+ AL + + L R + + D Q+I +A+V ++K + + VDIS N
Sbjct: 172 RTQSAL-RTMAACLRRHN-------LATDVQVIAKAKVPIIKFVTTYARLKVDISLNHTN 223
Query: 165 GLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAH----HGLISTYALETLVL 220
GL+T ++ ++ W + L H G I ++L
Sbjct: 224 GLTTASYVNS----------------WLRKWPHIRISFLQLHPKVQRGEIEADRSLGVLL 267
Query: 221 YKFLDYFSK-FDWDSYCISLNG 241
+FL+ + K F +D+ IS+ G
Sbjct: 268 LEFLELYGKNFGYDNCGISIRG 289
>gi|71408844|ref|XP_806800.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870651|gb|EAN84949.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1239
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 57/82 (69%)
Query: 139 LIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYY 198
L+ AEV+++K +++ DI+ QLGG+ + FL+++D LIG HL KR+++L+KAWC Y
Sbjct: 567 LVVAEVRVLKLVMEGGCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCY 626
Query: 199 ESRILGAHHGLISTYALETLVL 220
E+ IL G +S+YA +++
Sbjct: 627 EAHILSGQGGYLSSYAATIMLI 648
>gi|407846652|gb|EKG02680.1| hypothetical protein TCSYLVIO_006286 [Trypanosoma cruzi]
Length = 893
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 57/82 (69%)
Query: 139 LIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYY 198
L+ AEV+++K +++ DI+ QLGG+ + FL+++D LIG HL KR+++L+KAWC Y
Sbjct: 221 LVVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCY 280
Query: 199 ESRILGAHHGLISTYALETLVL 220
E+ IL G +S+YA +++
Sbjct: 281 EAHILSGQGGYLSSYAATIMLI 302
>gi|302691928|ref|XP_003035643.1| hypothetical protein SCHCODRAFT_104957 [Schizophyllum commune H4-8]
gi|300109339|gb|EFJ00741.1| hypothetical protein SCHCODRAFT_104957, partial [Schizophyllum
commune H4-8]
Length = 671
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 130/301 (43%), Gaps = 55/301 (18%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+ PT +E R ++ + R+I++ + EV PFGS K YLP GDIDL +E+
Sbjct: 170 ISPTPAEDEVRSMIVLLIARIIQDKFPDAEVRPFGSYGTKLYLPHGDIDLVVQSN-TLEQ 228
Query: 112 ALANDVCSVLER-EDQNKAAEFVVKDAQLIRAEVKLVKCLVQ----NIVVDISFNQLGGL 166
N+ +VL+R D ++A Q+I A V ++K + +DIS NQ GL
Sbjct: 229 ---NNKKTVLQRLADLIRSARLSSGKVQVIGARVPIIKFITAAEYGRFQIDISVNQFSGL 285
Query: 167 STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDY 226
+ + R + + + RS++LI + + + G + +Y++ LVL FL
Sbjct: 286 VSSDIINGFQRGM-QCPIAIRSLVLILKLYLSQRGMNEVYTGGLGSYSIVCLVL-SFLQM 343
Query: 227 FSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEF----------------L 270
K NG + PE + G LLL EF L
Sbjct: 344 HPKI--------RNGEI-----------DPERNLGVLLL--EFFELYGKYHNYEEVGVSL 382
Query: 271 KECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARK 330
+ + FS RG+ +R P+ L+I DP N++ + N++++R GA
Sbjct: 383 RHGGQYFSKRVRGWYNYTR---PRSLSIEDPSDPENDV--ASGSYNYFKVRQTMA-GAHD 436
Query: 331 L 331
L
Sbjct: 437 L 437
>gi|443895250|dbj|GAC72596.1| DNA polymerase sigma [Pseudozyma antarctica T-34]
Length = 689
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 98/221 (44%), Gaps = 43/221 (19%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT----AFGGL 107
+ PT E R VI+ + R I++ + EV PFGS K YLP GD+DL + L
Sbjct: 109 MAPTAAEHETRCMVIELISRAIKSQFRDAEVHPFGSQETKLYLPQGDLDLVVVSRSMANL 168
Query: 108 NVEEALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLV--QNIVVDISFNQLG 164
+ AL + + L R + + D Q+I +A+V ++K + + VDIS N
Sbjct: 169 RTQSALRT-MAACLRRHN-------LATDVQVIAKAKVPIIKFVTTYARLKVDISLNHTN 220
Query: 165 GLSTLCFLEQVDR---------LIGKDHLFKRS--------------IILIKAWCYYESR 201
GL+T F+ R ++ K L +R II++ ++ +
Sbjct: 221 GLTTASFVNSWLRKWPHIRPLIIVVKHLLMQRGMSEVFSGGLGSYSIIIMVISFLQLHPK 280
Query: 202 ILGAHHGLISTYALETLVLYKFLDYFSK-FDWDSYCISLNG 241
+ G I ++L +FL+ + K F +D+ IS+ G
Sbjct: 281 V---QRGEIEPGRSLGVLLLEFLELYGKNFGYDNCGISIRG 318
>gi|403419742|emb|CCM06442.1| predicted protein [Fibroporia radiculosa]
Length = 1487
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 92/205 (44%), Gaps = 28/205 (13%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYL 94
AE R EA + + PT E R V+ + R + + + EV PFGS K YL
Sbjct: 153 AEMLHRDVEA---FVNYISPTPEENEVRSLVVALITRAVTQAFPDAEVHPFGSYDTKLYL 209
Query: 95 PDGDIDLTAFG---GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVK--C 149
P GDIDL + +EA+ + + + ++R + K A+V +VK
Sbjct: 210 PVGDIDLVVHSQSMAYSKKEAVLHSIANTMKRAGITDRVRIISK------AKVPIVKFVT 263
Query: 150 LVQNIVVDISFNQLGGLS--TLC--FLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGA 205
L NI VDIS NQ G++ T+ FL ++ L RS++LI + +
Sbjct: 264 LHGNIPVDISINQGNGVTAGTMIKHFLAELPAL--------RSLVLIVKSFLSQRSMNEV 315
Query: 206 HHGLISTYALETLVLYKFLDYFSKF 230
+ G + +Y++ LV+ FL K
Sbjct: 316 YTGGLGSYSIVCLVI-SFLQMHPKI 339
>gi|391346299|ref|XP_003747415.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Metaseiulus occidentalis]
Length = 491
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 120/309 (38%), Gaps = 67/309 (21%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIR-NYLGCEVFPFGSVPLKTYLPDGDIDLT 102
E QP + RR+ VI+ V+ IR + C V FGS YLP GDID+
Sbjct: 88 EEIHDFFMYAQPNAADQSRREQVIEKVRAAIREKWPDCVVEVFGSYKTGLYLPTGDIDMV 147
Query: 103 AFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISF 160
G + L + ++E++ K F V D +A V L+K + I VD+SF
Sbjct: 148 IQGNWEIIPPLFDLERQLIEKKVGEKNT-FKVLD----KASVPLIKFKDADTEIRVDLSF 202
Query: 161 NQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV- 219
NQ F++Q R + I ++K + + HG IS+Y+L ++
Sbjct: 203 NQANCTEAAAFVKQCCRTFPP---LAKLIFVLKQYLSLHG-LNEVFHGGISSYSLTLMIL 258
Query: 220 ----------------------LYKFLDYF-SKFDWDSYCISLNGPVRISSLPEVVVETP 256
L +FL+++ +F++D I +
Sbjct: 259 SFLQLHPEQEMVRSDKPETGKLLVEFLEFYGDRFEYDKMGIRIR---------------- 302
Query: 257 ENSGGDLLLSSEFLKEC-VEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKG 315
G L+ L+EC + PS G + L I DPL N++ RS
Sbjct: 303 ---DGGALVDKNQLRECLIAAGGPPSSGSNL---------LCIEDPLTPGNDVARSSYAM 350
Query: 316 NFYRIRSAF 324
+ R+R AF
Sbjct: 351 S--RVRDAF 357
>gi|79571331|ref|NP_181504.2| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|53850481|gb|AAU95417.1| At2g39740 [Arabidopsis thaliana]
gi|55733735|gb|AAV59264.1| At2g39740 [Arabidopsis thaliana]
gi|330254623|gb|AEC09717.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 511
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 142/340 (41%), Gaps = 75/340 (22%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN---YLGCEVFPFGSVPLKTYLPDGDIDLTA 103
Q I+ ++PT + R VID ++ ++++ G V PFGS + GD+D++
Sbjct: 12 QEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDISV 71
Query: 104 ---------FGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--Q 152
F G ++ L + L +A+ K +I A V ++K + Q
Sbjct: 72 DLFSGSSILFTGKKQKQTLLGHLLRAL------RASGLWYKLQFVIHARVPILKVVSGHQ 125
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
I DIS + L GL FL + + G+ F+ ++L+K W + I + G ++
Sbjct: 126 RISCDISIDNLDGLLKSRFLFWISEIDGR---FRDLVLLVKEWAKAHN-INDSKTGTFNS 181
Query: 213 YALETLVLYKFLDYFSKFDWDSYCI-SLNGPVRI----SSLPEV--VVETPENSGGDLLL 265
Y+L LV++ F C+ ++ P+R+ S++ ++ V +T E S +
Sbjct: 182 YSLSLLVIFHF----------QTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTA 231
Query: 266 S------------------SEFLKECVEQFS---VPSRGF------------DTNSRSFP 292
+ SE L +FS V ++ F +N+ P
Sbjct: 232 ANIARFKSERAKSVNRSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLP 291
Query: 293 PKH-LNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
+ L + DP ++ N RSVS+ N RI F +R+L
Sbjct: 292 KTYSLFVEDPFEQPVNAARSVSRRNLDRIAQVFQITSRRL 331
>gi|110735731|dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana]
Length = 511
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 142/340 (41%), Gaps = 75/340 (22%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN---YLGCEVFPFGSVPLKTYLPDGDIDLTA 103
Q I+ ++PT + R VID ++ ++++ G V PFGS + GD+D++
Sbjct: 12 QEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDISV 71
Query: 104 ---------FGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--Q 152
F G ++ L + L +A+ K +I A V ++K + Q
Sbjct: 72 DLFSGSSILFTGKKQKQILLGHLLRAL------RASGLWYKLQFVIHARVPILKVVSGHQ 125
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
I DIS + L GL FL + + G+ F+ ++L+K W + I + G ++
Sbjct: 126 RISCDISIDNLDGLLKSRFLFWISEIDGR---FRDLVLLVKEWAKAHN-INDSKTGTFNS 181
Query: 213 YALETLVLYKFLDYFSKFDWDSYCI-SLNGPVRI----SSLPEV--VVETPENSGGDLLL 265
Y+L LV++ F C+ ++ P+R+ S++ ++ V +T E S +
Sbjct: 182 YSLSLLVIFHF----------QTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTA 231
Query: 266 S------------------SEFLKECVEQFS---VPSRGF------------DTNSRSFP 292
+ SE L +FS V ++ F +N+ P
Sbjct: 232 ANIARFKSERAKSVNRSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLP 291
Query: 293 PKH-LNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
+ L + DP ++ N RSVS+ N RI F +R+L
Sbjct: 292 KTYSLFVEDPFEQPVNAARSVSRRNLDRIAQVFQITSRRL 331
>gi|395333834|gb|EJF66211.1| hypothetical protein DICSQDRAFT_152192 [Dichomitus squalens
LYAD-421 SS1]
Length = 647
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 81/317 (25%), Positives = 127/317 (40%), Gaps = 38/317 (11%)
Query: 46 TQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYLPDGDIDLTAF 104
+ + + PT + +E R + + R I ++Y +V PFGS K YLP GDIDL +
Sbjct: 162 VEAFVDYMSPTPIEDEVRSLSVQLIARAISKSYPDAKVLPFGSYETKLYLPSGDIDLVIY 221
Query: 105 GGLNVEEALANDVCSVLER-EDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFN 161
+ D SVL + K A + + +A+V ++K + + VDIS N
Sbjct: 222 S----HSMMRMDKVSVLHSLANIMKRAGITDRVTIIAKAKVPIIKFVTAHGRFSVDISVN 277
Query: 162 QLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY 221
Q G+ T ++Q R + RS++LI + + G + +Y++ L +
Sbjct: 278 QGNGVDTGKMVKQFLRELPA----LRSLVLIIKNFLSQRSMNEVFTGGLGSYSIVCLAI- 332
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEF---LKECVEQFS 278
FL K S N V + E+ G E L++ F+
Sbjct: 333 SFLQMHPKIRRGEIDPSKNLGVLVMEFFELY--------GSYFNYQEVGISLRDGGSYFN 384
Query: 279 VPSRG-FDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAF-------TYGARK 330
RG FD P+ L+I DP N++ R NF R+R+ T A
Sbjct: 385 KRQRGWFDYRE----PRLLSIEDPGDPTNDISRGSY--NFARVRTTLAGAHGIMTAAAYA 438
Query: 331 LGHILSQPEESLTDELR 347
I+S E T LR
Sbjct: 439 QASIISARREGRTVRLR 455
>gi|388851758|emb|CCF54564.1| related to TRF4-topoisomerase I-related protein [Ustilago hordei]
Length = 701
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 70/267 (26%), Positives = 115/267 (43%), Gaps = 46/267 (17%)
Query: 10 EPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQ---PTVVSEERRKAV 66
E +G ER + S P + ++ AE + +IA Q PT E R V
Sbjct: 64 EDDGRTKKEREMARSRHTPWSADVEWSKCQNGAEALHRELIAFDQWMAPTGAEHETRCMV 123
Query: 67 IDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT----AFGGLNVEEALANDVCSVL 121
I+ + R I++ + EV PFGS K YLP GD+DL + L + AL + + L
Sbjct: 124 IELIARAIKSQFRDAEVRPFGSQETKLYLPQGDLDLVVVSRSMANLRTQSALRT-MAACL 182
Query: 122 EREDQNKAAEFVVKDAQLI-RAEVKLVKCLV--QNIVVDISFNQLGGLSTLCFLEQVDR- 177
R + + D Q+I +A+V ++K + + VDIS N GL+T ++ R
Sbjct: 183 RRHN-------LATDVQVIAKAKVPIIKFVTTYARLKVDISLNHTNGLTTASYVNGWLRK 235
Query: 178 --------LIGKDHLFKRS--------------IILIKAWCYYESRILGAHHGLISTYAL 215
L+ K L +R II++ ++ ++ G I
Sbjct: 236 WPHIRPLILVIKHLLMQRGMSEVFSGGLGSYSVIIMVISFLQLHPKL---QRGEIEPGRS 292
Query: 216 ETLVLYKFLDYFSK-FDWDSYCISLNG 241
++L +FL+ + K F +D+ IS+ G
Sbjct: 293 LGVLLLEFLELYGKNFGYDNCGISIRG 319
>gi|328860813|gb|EGG09918.1| hypothetical protein MELLADRAFT_115680 [Melampsora larici-populina
98AG31]
Length = 987
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 72/135 (53%), Gaps = 13/135 (9%)
Query: 50 IAQVQPTVVSEERRKAVIDYVQRLIR-NYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
+A ++PT +E R +I+ +++ + + +V PFGS K YLP GDIDL
Sbjct: 242 VAYIRPTREEDELRLMIIEMIRKAVTMQWPDADVVPFGSFGTKLYLPGGDIDLVILSTRM 301
Query: 109 VEEALANDVCSV--LEREDQNKAAEFVVKDAQLIRAEVKLV--KCLVQNIVVDISFNQLG 164
+++A + + + L RE QN + VV + +A+V ++ K + N VDIS NQ
Sbjct: 302 MKDAKSKILYRLAPLLRE-QNIGQDVVV----IAKAKVPIIKFKTIFGNFQVDISINQSN 356
Query: 165 GLSTLCFLEQVDRLI 179
G L LE+V+ L+
Sbjct: 357 G---LVALEKVNELL 368
>gi|392567029|gb|EIW60204.1| hypothetical protein TRAVEDRAFT_164816 [Trametes versicolor
FP-101664 SS1]
Length = 660
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 79/171 (46%), Gaps = 34/171 (19%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYL 94
AE + R E + + + PT + +E R V+ V R + R Y +V PFGS K YL
Sbjct: 164 AEMYARIE--VEAFVKYISPTPIEDEVRSLVVALVSRAVTRTYTDAQVLPFGSYETKLYL 221
Query: 95 PDGDIDLTAFGG-------LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLV 147
P GDIDL + ++V +LAN V K A + + +A+V ++
Sbjct: 222 PLGDIDLVIYSQSMARMDRVSVLHSLANIV----------KRAGITDRVTIIAKAKVPII 271
Query: 148 KCLVQN--IVVDISFNQLGGLS----TLCFLEQVDRLIGKDHLFKRSIILI 192
K + + VDIS NQ G++ FLE++ L RS++LI
Sbjct: 272 KFVTTHGRFSVDISINQGNGVTAGKMVKQFLEELPAL--------RSLVLI 314
>gi|440291374|gb|ELP84643.1| PAP-associated domain containing protein, putative [Entamoeba
invadens IP1]
Length = 475
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 108/248 (43%), Gaps = 47/248 (18%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRNYLGCE--VFPFGSVPLKTYLPDGDIDLTAFGGLNVE 110
V+P + E R+ V++ R+I N E V PFGS K +LP DID T
Sbjct: 27 VEPNPIEYEIRRYVLEKYTRVIENDKKSEIKVVPFGSTQSKLFLPSSDIDFTVVTKGGKT 86
Query: 111 EALANDVCSVLE---REDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGG 165
+ N V +L ED+ +A +RA V ++K + IV+DIS N G
Sbjct: 87 NMVLNSVARILSLYTMEDEKRA----------LRATVPVIKLTDRETGIVLDISHNNESG 136
Query: 166 LSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV------ 219
+ T+ ++E + + + L + + +IK L A GL TY+L +V
Sbjct: 137 VDTVRWME---KEMKSNALIRPLLFIIKTVLSSYELNLPALGGL-GTYSLFMMVFCFFRE 192
Query: 220 -------------LYKFLDYF-SKFDWDSYCISLNGPVRI------SSLPEVVVETPENS 259
L +FL Y+ ++FD + +S+ G +SL + +E P ++
Sbjct: 193 KGSDLKDKRGGAILLRFLKYYATEFDSRKFGLSVTGNFSREERHWDASLQNLSIEDPCDT 252
Query: 260 GGDLLLSS 267
D+ +SS
Sbjct: 253 SNDVSISS 260
>gi|343425896|emb|CBQ69429.1| related to caffeine-induced death protein 1 Cid1 [Sporisorium
reilianum SRZ2]
Length = 1181
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 101/407 (24%), Positives = 169/407 (41%), Gaps = 74/407 (18%)
Query: 12 NGAVFGERPSSSSSSVPSN-QTAIGAEYWQRAEEATQGIIAQVQPTVVSEER---RKAVI 67
NG+V G S+++ PS+ QT G ++ + E T I+A + P + +EE ++A
Sbjct: 304 NGSVNGASAPISAATSPSHTQTHSGPQWERHTTELTNCIVAFLSPILPTEEEYRIKEATR 363
Query: 68 DYVQRLI-RNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQ 126
++RL R G ++ FGS+ L + D+DL G + + Q
Sbjct: 364 RQLERLANRVSPGAKLLAFGSMANGFALRNSDMDLCCLIGKGPDG----------QPTTQ 413
Query: 127 NKAAEFVVKDAQLIR------------AEVKLVKC-------LVQNIVVDISF-NQLGGL 166
+ A+E V QLIR A + ++K L I DI F N+L
Sbjct: 414 HTASELVEILGQLIREETDFTVMPLPKARIPIIKINRSPTADLPYEIACDIGFENRLALE 473
Query: 167 STLCFLEQVDRLIGKDHLFKRSIIL-IKAWCYYESRILGAHHGLISTYALETLVLYKFLD 225
+T L ++ L R+++L +K W ++ + G +S+Y +VL+ FL
Sbjct: 474 NTRLLLSYA--MVDPPRL--RTLVLFLKVWAKRR-KLNSPYMGTLSSYGYTLMVLF-FLA 527
Query: 226 YFSK-------------FDWDSYCISLNGP-----VRISSLPEVVVETPENSGGDLLLSS 267
Y K + LNG +++L + ++ G+LL+
Sbjct: 528 YVKKPAVLPNLQRVPPTRTMKPDEMELNGNNIYFYDDVAALRKAWTSHNTDNVGELLI-- 585
Query: 268 EFLKECVEQFS----VPSRGFDTNSRSFPPKHLN----IVDPLKENNNLGRSVSKGNFYR 319
+F + ++FS V S +T S K N I DP + N+ R+V+K Y
Sbjct: 586 DFFRYFSKEFSYARDVISLKSETGLLSKDSKSWNAELCIEDPFQMGYNVSRTVTKDGLYT 645
Query: 320 IRSAFTYGARKLGHILSQPEESLTDELRKFFSNTLDRH----GSGQR 362
IR F +R L + Q +L +L + ++L R GS QR
Sbjct: 646 IRGEFMRASRILANARGQKISALIADLCEEREDSLSRAPDGPGSMQR 692
>gi|393216777|gb|EJD02267.1| hypothetical protein FOMMEDRAFT_141374 [Fomitiporia mediterranea
MF3/22]
Length = 732
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 80/185 (43%), Gaps = 24/185 (12%)
Query: 46 TQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYLPDGDID---- 100
+ + V PT V E R V+ + I R Y EV PFGS K YLP GDID
Sbjct: 153 VEAYLKYVSPTPVEHEVRWMVVQLISSSIKRVYSDSEVLPFGSFGTKLYLPQGDIDLVVQ 212
Query: 101 ---LTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVK--CLVQNIV 155
L +F + ++LAN V K K + +A V ++K L
Sbjct: 213 SRTLASFEKVTALKSLANIV----------KRTGLADKVTIISQARVPIIKFTTLYGRFA 262
Query: 156 VDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215
VDIS NQ G+ T + ++R + + R+I+LI + + + G + +YA+
Sbjct: 263 VDISMNQSNGVKT---GDMINRFLNEFPAL-RAIVLIVKSFLKQRNLNEVYSGGLGSYAI 318
Query: 216 ETLVL 220
L +
Sbjct: 319 VCLAV 323
>gi|336367333|gb|EGN95678.1| hypothetical protein SERLA73DRAFT_60289 [Serpula lacrymans var.
lacrymans S7.3]
Length = 538
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 85/188 (45%), Gaps = 7/188 (3%)
Query: 46 TQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF 104
+ + + P+ V +E R VI V + + + + +V PFGS K YLPDGDIDL
Sbjct: 189 VEAFVNYMSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDIDLVI- 247
Query: 105 GGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQ 162
+ A +N V + + K A+ K + +A+V +VK + + + VDIS NQ
Sbjct: 248 --QSESMAYSNKVTVLHALANTLKRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDISINQ 305
Query: 163 LGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYK 222
G+ + + + RS+++I + + + G + +Y++ L +
Sbjct: 306 GNGVIAGKIVNGFLKDMHGCGFALRSLVMITKAFLNQRGMNEVYTGGLGSYSIVCLAI-S 364
Query: 223 FLDYFSKF 230
FL K
Sbjct: 365 FLQMHPKI 372
>gi|336380050|gb|EGO21204.1| hypothetical protein SERLADRAFT_476100 [Serpula lacrymans var.
lacrymans S7.9]
Length = 592
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 6/124 (4%)
Query: 46 TQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF 104
+ + + P+ V +E R VI V + + + + +V PFGS K YLPDGDIDL
Sbjct: 189 VEAFVNYMSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDIDLVI- 247
Query: 105 GGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQ 162
+ A +N V + + K A+ K + +A+V +VK + + + VDIS NQ
Sbjct: 248 --QSESMAYSNKVTVLHALANTLKRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDISINQ 305
Query: 163 LGGL 166
G+
Sbjct: 306 GNGV 309
>gi|449547164|gb|EMD38132.1| hypothetical protein CERSUDRAFT_49354 [Ceriporiopsis subvermispora
B]
Length = 547
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 60/133 (45%), Gaps = 20/133 (15%)
Query: 46 TQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYLPDGDIDLTAF 104
+G + + PT +E R V++ ++R I R + +V PFGS K YLP GDIDL
Sbjct: 176 VEGFVRYISPTPQEDEVRSLVVELIRRAITRQFPDAQVLPFGSYETKLYLPLGDIDLVIH 235
Query: 105 GGL-------NVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--QNIV 155
NV ALAN L R + + K A+V +VK +
Sbjct: 236 SNTMAYSDKENVLRALAN----TLRRAGITDNVKIIAK------AKVPIVKFVTIHGRFS 285
Query: 156 VDISFNQLGGLST 168
VDIS NQ G++
Sbjct: 286 VDISINQGNGVAA 298
>gi|449707156|gb|EMD46861.1| PAPassociated domain containing protein [Entamoeba histolytica
KU27]
Length = 400
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 116/271 (42%), Gaps = 42/271 (15%)
Query: 79 GCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQ 138
G V FGS K +LP DID + + N V S+L + +V++D +
Sbjct: 52 GYNVMAFGSTQSKLFLPTSDIDFSVLTNEYNTRKVLNSVSSIL--------SSYVLEDQK 103
Query: 139 L-IRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKA- 194
+A + ++K + IV+DIS N G T+ F+E+V I KD ++ ++LIK+
Sbjct: 104 RNFKASIPVLKLTDKKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSI 160
Query: 195 WCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
C Y+ +G + TY++ +V YC N + E++
Sbjct: 161 LCCYDFH--QPANGGLGTYSVFVMV---------------YCYINNNNITTHDYGELLKG 203
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPK--HLNIVDPLKENNNLGRSV 312
+ G D + +E SV F+ + R++ K +L+I DP +N++ +
Sbjct: 204 FLKYYGID-------FRSDIEGLSVFEGKFNRSERNWDSKISNLSIEDPCDLSNDVSVTS 256
Query: 313 SKGNFYRIRSAFTYGARKLGHILS-QPEESL 342
+ + + +Y A H+ PE SL
Sbjct: 257 FRWQYIKYLFKMSYNALHYTHLKKYDPEHSL 287
>gi|409081996|gb|EKM82354.1| hypothetical protein AGABI1DRAFT_52475, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 559
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 80/174 (45%), Gaps = 12/174 (6%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG---GLN 108
+ PT + +E R+ + + R I + G +VFPFGS K YLP GDIDL +
Sbjct: 153 MAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDIDLVIVSDSMAYS 212
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGL 166
+ ++ + + SVL R A + +A+V +VK + + VDIS NQ G+
Sbjct: 213 NKSSVLHSLASVLRR------AGIASNVTVIAKAKVPIVKFVTIHGRFNVDISINQTNGI 266
Query: 167 STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
++ + + L RS++LI + + G + +Y++ L +
Sbjct: 267 VGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSIVCLAI 320
>gi|297823863|ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
lyrata]
gi|297325653|gb|EFH56073.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
lyrata]
Length = 500
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 132/333 (39%), Gaps = 61/333 (18%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN---YLGCEVFPFGSVPLKTYLPDGDIDLTA 103
Q I+ ++PT + R VID ++ +++ G V PFGS + GD+DL+
Sbjct: 12 QEILQVIKPTRADWDTRIRVIDQLRDVLQTVECLRGATVQPFGSFVSNLFTRWGDLDLSV 71
Query: 104 ---------FGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--Q 152
F G ++ L + L +FV I A V ++K + Q
Sbjct: 72 DLFSGSSILFTGKKQKQTLLRHLLRALRASGLWYKLQFV------IHARVPILKVVSGHQ 125
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
I DIS + L GL FL + + G+ F+ ++L+K W + I + +G ++
Sbjct: 126 RIACDISIDNLDGLLKSRFLFWISEIDGR---FRDLVLLVKEWAKAHN-INDSKNGTFNS 181
Query: 213 YALETLVLYKF---------------------------------LDYFSKFDWDSYCISL 239
Y+L LV++ + + + + ++
Sbjct: 182 YSLSLLVIFHLQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKLNT 241
Query: 240 NGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKH-LNI 298
V SSL E++V D+ L ++ L C F+ +N+ P + L +
Sbjct: 242 AKSVNRSSLSELLVSF-YAKFSDINLKAQELGVC--PFTGRWENISSNTTWLPKTYSLFV 298
Query: 299 VDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
DP ++ N RSVS+ N RI F +R+L
Sbjct: 299 EDPFEQPVNAARSVSRRNLDRIAQVFQITSRRL 331
>gi|67465021|ref|XP_648697.1| topoisomerase [Entamoeba histolytica HM-1:IMSS]
gi|56464936|gb|EAL43308.1| topoisomerase, putative [Entamoeba histolytica HM-1:IMSS]
Length = 400
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 116/271 (42%), Gaps = 42/271 (15%)
Query: 79 GCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQ 138
G V FGS K +LP DID + + N V S+L + +V++D +
Sbjct: 52 GYNVMAFGSTQSKLFLPTSDIDFSVLTNEYNTRKVLNSVSSIL--------SSYVLEDQK 103
Query: 139 L-IRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKA- 194
+A + ++K + IV+DIS N G T+ F+E+V I KD ++ ++LIK+
Sbjct: 104 RNFKASIPVLKLTDKKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSI 160
Query: 195 WCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
C Y+ +G + TY++ +V YC N + E++
Sbjct: 161 LCCYDFH--QPANGGLGTYSVFVMV---------------YCYINNNIITTHDYGELLKG 203
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPK--HLNIVDPLKENNNLGRSV 312
+ G D + +E SV F+ + R++ K +L+I DP +N++ +
Sbjct: 204 FLKYYGID-------FRSDIEGLSVFEGKFNRSERNWDSKISNLSIEDPCDLSNDVSVTS 256
Query: 313 SKGNFYRIRSAFTYGARKLGHILS-QPEESL 342
+ + + +Y A H+ PE SL
Sbjct: 257 FRWQYIKYLFKMSYNALHYTHLKKYDPEHSL 287
>gi|426199822|gb|EKV49746.1| hypothetical protein AGABI2DRAFT_63272 [Agaricus bisporus var.
bisporus H97]
Length = 481
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 79/174 (45%), Gaps = 12/174 (6%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG---GLN 108
+ PT + +E R+ + + R I + G +VFPFGS K YLP GDIDL +
Sbjct: 154 MAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDIDLVIVSDSMAYS 213
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGL 166
+ ++ + + SVL R + K A+V +VK + + VDIS NQ G+
Sbjct: 214 NKSSVLHSLASVLRRAGIASNVTVIAK------AKVPIVKFVTIHGRFNVDISINQTNGI 267
Query: 167 STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
++ + + L RS++LI + + G + +Y++ L +
Sbjct: 268 VGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSIVCLAI 321
>gi|403159818|ref|XP_003320384.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168256|gb|EFP75965.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 876
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 62/123 (50%), Gaps = 6/123 (4%)
Query: 50 IAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
+A +QPT + R+ +I +++ + + + +V PFGS K YLP GDIDL
Sbjct: 80 VAYIQPTHEEHQLRQMIIQMIRKTVHSRWPDADVEPFGSFGTKLYLPAGDIDLVIISTQM 139
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLV--KCLVQNIVVDISFNQLGGL 166
+ E + + + +N + VV + +A+V ++ K + NI VDIS NQ G+
Sbjct: 140 MNEQKSRILYKLAPLIRENNIGQDVVV---IAKAKVPIIKFKTIFGNINVDISINQTNGI 196
Query: 167 STL 169
+
Sbjct: 197 VAM 199
>gi|238609344|ref|XP_002397464.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
gi|215471952|gb|EEB98394.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
Length = 174
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 55/119 (46%), Gaps = 24/119 (20%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+ P+ V +E R ++ + I+ Y EV PFGS K YLP GDID+
Sbjct: 27 ISPSPVEDEIRSLLVQLISSAIKTRYPDAEVHPFGSYATKLYLPTGDIDIV--------- 77
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ----NIVVDISFNQLGGL 166
VL R FV A+L +A V +VK + + I VDISFNQ GG+
Sbjct: 78 --------VLSRTHTIAFRCFVT--AKLAKARVPIVKFVTRVELGGIPVDISFNQPGGV 126
>gi|410911160|ref|XP_003969058.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
Length = 803
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 117/295 (39%), Gaps = 60/295 (20%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
RK V++ ++ +I+ + +V FGS YLP DIDL FG E ++ L
Sbjct: 275 RKEVVNRIETIIKELWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQELEQAL 332
Query: 122 EREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDISFNQLGGLSTLCFLEQVDRLI 179
+ N A F +K L +A V ++K Q + VDISFN G+ F++ ++
Sbjct: 333 RK--HNVAEPFSIK--VLDKATVPIIKLTDQETEVKVDISFNVETGVKAASFIKDYVKMY 388
Query: 180 GKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFD-------- 231
+ I ++K + + + G IS+Y+L L++ FL + D
Sbjct: 389 P---VLPYLIFVLKQFL-LQRDLNEVFTGGISSYSL-ILMVISFLQLHPRIDARNPNENL 443
Query: 232 ----------WDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
+ + L +RI +GG + E +KE
Sbjct: 444 GVLLIEFFELYGRHFNYLKTGIRIK------------NGGSYMAKEEIMKEM-------- 483
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
+ + P L I DPL N++GR S G + ++ F Y L H +S
Sbjct: 484 ------NNGYRPSMLCIEDPLLPGNDVGRG-SYGAMH-VKQVFDYAYTVLSHAVS 530
>gi|409045762|gb|EKM55242.1| hypothetical protein PHACADRAFT_93478 [Phanerochaete carnosa
HHB-10118-sp]
Length = 478
Score = 52.4 bits (124), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 74/163 (45%), Gaps = 29/163 (17%)
Query: 46 TQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYLPDGDIDLT-- 102
+ + + PT +E R +++ + R + + + V PFGS K YLP GDIDL
Sbjct: 157 VEAFVKYISPTQEEDEIRSLIVESISRAVTKAFPDARVLPFGSYETKLYLPLGDIDLVIE 216
Query: 103 ----AFGG-LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IV 155
A+ +NV +ALA + K A K + +A+V ++K + ++
Sbjct: 217 SDSMAYNNKVNVLQALATTM----------KRAGITDKVTIIAKAKVPIIKFVTRHGRFS 266
Query: 156 VDISFNQLGGLSTLCFLE---------QVDRLIGKDHLFKRSI 189
VDIS NQ+ G+ ++ Q LI K L +RS+
Sbjct: 267 VDISLNQMNGVKAGTMIKRFLDHIPALQALVLITKSFLSQRSM 309
>gi|348500306|ref|XP_003437714.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Oreochromis niloticus]
Length = 672
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 74/301 (24%), Positives = 124/301 (41%), Gaps = 52/301 (17%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGG----- 106
+ P E+ R V+D ++ +I + + EV FGS YLP DIDL FG
Sbjct: 191 ISPRPEEEKMRLEVVDRIKEVIHDLWPSAEVEVFGSFSTGLYLPTSDIDLVVFGKWESLP 250
Query: 107 -LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQL 163
+EEAL +N A E +K L +A V ++K + VDISFN +
Sbjct: 251 LWTLEEAL----------RKKNVADENSIK--VLDKATVPIIKLTDSYTEVKVDISFNVM 298
Query: 164 GGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKF 223
G+ +++ K + ++++K + + + G I +Y+L L+ F
Sbjct: 299 SGVKAARLIKEFKE---KYPVLPYLVLVLKQFL-LQRDLNEVFTGGIGSYSL-FLMAVSF 353
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPE--------NSGGDLLLSSEFLKECVE 275
L + D + +++N V + E+ GG + E K ++
Sbjct: 354 LQL--HYREDVFGLNINIGVLLIEFFELYGRNFNYLKTGIRIKDGGCYVAKDEVQKNMLD 411
Query: 276 QFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHIL 335
+ P L I DPL+ +N++GRS S G +++ AF Y L H +
Sbjct: 412 --------------GYRPSMLYIEDPLQPDNDVGRS-SYGAM-QVKQAFDYAYVVLSHAV 455
Query: 336 S 336
S
Sbjct: 456 S 456
>gi|407039791|gb|EKE39813.1| topoisomerase, putative [Entamoeba nuttalli P19]
Length = 400
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 112/262 (42%), Gaps = 41/262 (15%)
Query: 79 GCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQ 138
G V FGS K +LP DID + + N V S+L + +V++D +
Sbjct: 52 GYNVMAFGSTQSKLFLPTSDIDFSVITNEYNTRKVLNSVSSIL--------SSYVLEDQK 103
Query: 139 L-IRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKA- 194
+A + ++K + IV+DIS N G T+ F+E+V I KD ++ ++LIK+
Sbjct: 104 RNFKASIPVLKLTDKKTLIVLDISHNNTNGTKTVNFIEEV---IKKDDRIRKLVLLIKSL 160
Query: 195 WCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVE 254
C Y+ +G + TY++ +V YC N + E++
Sbjct: 161 LCCYDFH--QPANGGLGTYSVFVMV---------------YCYINNNIITTHDYGELLKG 203
Query: 255 TPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPK--HLNIVDPLKENNNLGRSV 312
+ G D + +E SV F+ + R++ K +L+I DP +N++ +
Sbjct: 204 FLKYYGID-------FRSDIEGLSVFEGKFNRSERNWDSKISNLSIEDPCDLSNDVSITS 256
Query: 313 SKGNFYRIRSAFTYGARKLGHI 334
+ + + +Y A H+
Sbjct: 257 FRWQYIKYLFKMSYNALHYTHL 278
>gi|358055188|dbj|GAA98957.1| hypothetical protein E5Q_05645 [Mixia osmundae IAM 14324]
Length = 813
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 62/133 (46%), Gaps = 8/133 (6%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
Q A V P+ R VID V+ +R + ++ PFGS + YLP GDIDL
Sbjct: 288 QAFKAYVTPSRAEHAFRGHVIDQVRNALRQIWADTDLQPFGSYLTQLYLPGGDIDLVMLS 347
Query: 106 GLNVEEALANDVCSVLE-REDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQ 162
+ + + + + D N +FVV + RA+V +VK + +DIS NQ
Sbjct: 348 ATAASQTPSRVLHRIAQIMRDANIGYDFVV----ISRAKVPIVKFISTTGGFNIDISLNQ 403
Query: 163 LGGLSTLCFLEQV 175
GG+ ++++
Sbjct: 404 PGGIRAGTVVQRM 416
>gi|406701338|gb|EKD04487.1| hypothetical protein A1Q2_01263 [Trichosporon asahii var. asahii
CBS 8904]
Length = 624
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 12/162 (7%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
RK +ID + +IR + V PFGS + YLP GDIDL E+ + +
Sbjct: 126 RKTMIDLITHIIRKEWRDATVTPFGSWETQLYLPTGDIDLVVSTPRLSEKNKVTMLHQLA 185
Query: 122 EREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQ-VDRL 178
N E V A + RA+V ++K + I VDIS NQ G+S + + + L
Sbjct: 186 RMMRGNHITETV---AVITRAKVPIIKFVTAEGGINVDISLNQTNGVSAVKIVNHYLKAL 242
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
G L I++IKA+ S + + G + +Y++ L L
Sbjct: 243 PGAREL----ILVIKAFLSQRS-MNEVYTGGLGSYSVICLAL 279
>gi|326676716|ref|XP_686065.4| PREDICTED: DNA polymerase sigma [Danio rerio]
Length = 706
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 71/292 (24%), Positives = 115/292 (39%), Gaps = 54/292 (18%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
R+ V+D ++ +I+ + +V FGS +LP DIDL FG E+ + L
Sbjct: 231 RQEVVDRIESVIKELWPTADVQIFGSFSTGLFLPTSDIDLVVFGKW--EKPPLQQLEQAL 288
Query: 122 EREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDISFNQLGGLSTLCFLEQVDRLI 179
+ + V D +A V ++K Q + VDISFN G+ F+++ +
Sbjct: 289 RKHSVAEPYSIKVLD----KATVPIIKLTDQETEVKVDISFNVETGIKAASFIKE---YV 341
Query: 180 GKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISL 239
K + I ++K + + + G IS+Y+L L++ FL + D
Sbjct: 342 KKYTVLPYLIFVLKQFL-LQRDLNEVFTGGISSYSL-ILMVISFLQLHPRID-------- 391
Query: 240 NGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTN------------ 287
P + G LL+ EF + F+ G
Sbjct: 392 -------------TRNPNMNLGILLI--EFFELYGRHFNYLKTGIRIKNGGAYMAKEDIM 436
Query: 288 ---SRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
S + P L I DPL N++GRS S G +++ AF Y L H +S
Sbjct: 437 KAMSNGYRPSMLCIEDPLLPGNDVGRS-SYGAM-QVKEAFDYAYIILSHAVS 486
>gi|218186296|gb|EEC68723.1| hypothetical protein OsI_37216 [Oryza sativa Indica Group]
Length = 112
Score = 51.6 bits (122), Expect = 0.001, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 17 GERPSSSSSSVPSNQ--TAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI 74
G P+ PSN ++I E W E A ++A++QP SE+RR AVI YVQ L+
Sbjct: 6 GCSPALEPVPTPSNPDPSSISQEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQGLL 65
Query: 75 RNYLGCEV 82
R +GC++
Sbjct: 66 RFNVGCQM 73
>gi|401882466|gb|EJT46724.1| hypothetical protein A1Q1_04689 [Trichosporon asahii var. asahii
CBS 2479]
Length = 631
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 12/162 (7%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
RK +ID + +IR + V PFGS + YLP GDIDL E+ + +
Sbjct: 126 RKTMIDLITHIIRKEWRDATVTPFGSWETQLYLPTGDIDLVVSTPRLSEKNKVTMLHQLA 185
Query: 122 EREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDISFNQLGGLSTLCFLEQ-VDRL 178
N E V A + RA+V ++K + I VDIS NQ G+S + + + L
Sbjct: 186 RMMRGNHITETV---AVITRAKVPIIKFVTAEGGINVDISLNQTNGVSAVKIVNHYLKAL 242
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
G L I++IKA+ S + + G + +Y++ L L
Sbjct: 243 PGAREL----ILVIKAFLSQRS-MNEVYTGGLGSYSVICLAL 279
>gi|346972692|gb|EGY16144.1| Poly(A) RNA polymerase cid14 [Verticillium dahliae VdLs.17]
Length = 726
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 94/214 (43%), Gaps = 32/214 (14%)
Query: 52 QVQPTVVSEERRKAVIDYVQRLIR---NYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
V+P + R +I+ ++ +R Y GCEV PFGS YLP D+D+ +
Sbjct: 420 HVRPRAFEQRMRGELIERIRDSLRRNPKYRGCEVHPFGSYMSGLYLPTADMDIV----IC 475
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKD--------AQLIRAEVKLVKCL--VQNIVVDI 158
+E L+ + + K F+ ++ + +A V LVK + V + VDI
Sbjct: 476 SKEWLSGRMTAFPGGSSLYKFRAFLTQNRLADPSSVEVIAKARVPLVKYIDAVTGLRVDI 535
Query: 159 SFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAH-HGLISTYALET 217
SF+++ G + + L K+ I++ + R L +G I +++ +
Sbjct: 536 SFDRMDGPAAIKTF-----LNWKEQYPALPILVTIIKHFLAMRGLNEPVNGGIGSFSSKN 590
Query: 218 LV--------LYKFLD-YFSKFDWDSYCISLNGP 242
LV L +F D Y ++FD+ + I +N P
Sbjct: 591 LVPEHHLGEMLMEFFDLYGNRFDYKTTAIRINPP 624
>gi|68363844|ref|XP_697115.1| PREDICTED: PAP-associated domain-containing protein 5 [Danio rerio]
Length = 653
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 129/307 (42%), Gaps = 46/307 (14%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT 102
E + + P E+ R V+ +QR+I++ + EV FGS YLP DIDL
Sbjct: 170 EEIKDFYEYISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLV 229
Query: 103 AFGG------LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--I 154
FG +EEAL + A E +K L +A V ++K + + +
Sbjct: 230 VFGNWETLPLWTLEEAL----------RKRKVADENSIK--VLDKATVPIIKLMDSHTEV 277
Query: 155 VVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA 214
VDISFN G+ ++ + + ++L+ + + G I +Y+
Sbjct: 278 KVDISFNVQSGVKAANLIKDYK----QQYPVLPYLVLVLKQFLLQRELNEVFTGGIGSYS 333
Query: 215 LETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGG--DLLLSSEFLKE 272
L L+ FL + D +S + P +L +++E E G + L + +K+
Sbjct: 334 L-FLMAVSFLQLHCRED-----VSSSNP----NLGVLLIEFFELYGRHFNYLKTGIRIKD 383
Query: 273 ---CVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGAR 329
V + V D + P L I DPL+ N++GRS S G +++ AF Y
Sbjct: 384 GGSYVAKDEVQKSMLD----GYRPSMLYIEDPLQPGNDVGRS-SYGAM-QVKEAFDYAYV 437
Query: 330 KLGHILS 336
L H +S
Sbjct: 438 ILSHAVS 444
>gi|348512463|ref|XP_003443762.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
Length = 789
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 119/292 (40%), Gaps = 34/292 (11%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+ P E R+ V++ ++ +I++ + +V FGS YLP DIDL FG N +
Sbjct: 242 MSPKPEEESMRRDVVNRIEGIIKDLWPTVQVEIFGSFSTGLYLPTSDIDLVVFG--NWDH 299
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCF 171
++ L++ + + + + D + +KL C + + VDISFN + T
Sbjct: 300 PPLQELEQALKKHNVSGSHPIKLLDKATVPI-IKLTDCETR-VKVDISFN----IETAVK 353
Query: 172 LEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFD 231
Q + K + +I + + + G IS+Y+L L+ FL + D
Sbjct: 354 AAQFIKSYLKKYPVLPPLIFVLKQFLLQRELNEVFTGGISSYSL-ILMAISFLQLHPRID 412
Query: 232 WDSYCISLNGPVRIS-------SLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGF 284
I+L G + I + +GG L E LKE
Sbjct: 413 TSRPNINL-GILLIEFFELYGRDFDYIKTAIRVKNGGAYLCKEEMLKE-----------M 460
Query: 285 DTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILS 336
+R P L I DP++ N++GRS S G +++ F + L H +S
Sbjct: 461 GNGNR---PSMLCIEDPVQPGNDVGRS-SYG-VLQVKQVFDFAYMVLSHSVS 507
>gi|67989518|ref|NP_001018181.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe 972h-]
gi|81175166|sp|Q9UTN3.2|CID14_SCHPO RecName: Full=Poly(A) RNA polymerase cid14; Short=PAP; AltName:
Full=Caffeine-induced death protein 14; AltName:
Full=Polynucleotide adenylyltransferase cid14
gi|62554069|emb|CAI79317.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe]
Length = 684
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 129/296 (43%), Gaps = 49/296 (16%)
Query: 49 IIAQVQPTVVSEERRKAVIDYV-QRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGL 107
I + PT RK ++ + Q +++ + ++ FGS K YLP D+DL
Sbjct: 248 FIDYITPTPEEHAVRKTLVSRINQAVLQKWPDVSLYVFGSFETKLYLPTSDLDLVIISPE 307
Query: 108 NVEEALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCL--VQNIVVDISFNQLG 164
+ D+ + + K A + Q+I A V ++K + + + VDISFNQ G
Sbjct: 308 HHYRGTKKDMFVLAHHLKKLKLAS----EVQVITTANVPIIKFVDPLTKVHVDISFNQPG 363
Query: 165 GLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFL 224
GL T C + V+ + K + +I+IK + + + G +S+YA+ LV+
Sbjct: 364 GLKT-CLV--VNGFMKKYPALRPLVIIIKHFLNMRA-LNEVFLGGLSSYAIVCLVV---- 415
Query: 225 DYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGF 284
+ L+ + S+ E E++ G LLL EFL+ +QF + G
Sbjct: 416 ----------SFLQLHPRLSTGSMRE------EDNFGVLLL--EFLELYGKQFYYDAVGI 457
Query: 285 DTNSRSF-------------PPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYG 327
++ F P L+I DP+ N++ +S S+G R+++ F G
Sbjct: 458 AVHNGGFYFSKKKMGWLKPNQPYLLSIQDPVDFQNDVSKS-SRG-LLRVKATFANG 511
>gi|116235017|dbj|BAF34948.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 578
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 85/390 (21%), Positives = 158/390 (40%), Gaps = 58/390 (14%)
Query: 5 RDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEY-WQRAEEATQGIIAQ----------- 52
R SP+P + S S P+ Q A G+E W R + + Q
Sbjct: 98 RALSPQP-----APEATLSGSLAPAEQPAQGSERAWLRGGRRFRSPMLQLHKEILDFCDF 152
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGG-LNVE 110
+ P+ + R A + V +I++ + C+V FGS +LP DID+ F +
Sbjct: 153 ISPSAEEQSSRTAAVKAVSNVIKHIWPQCKVEVFGSFRTGLFLPTSDIDVVIFDSRVKTP 212
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLST 168
+ + L ++ K + + K A V +VK + + I DISF+ GG
Sbjct: 213 QVGLYALAKALSQKGVAKKIQVIAK------ARVPIVKFVERKSEIAFDISFDMDGGPQA 266
Query: 169 LCFLEQVDRLIGKDHLFK----RSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFL 224
F+ KD++ K R + +I ++ + + G I +YAL T+++
Sbjct: 267 ADFI--------KDYVKKFPALRHLCMILKVFLHQRELNEVYTGGIGSYALLTMLITHLQ 318
Query: 225 DYFSKFDWDSY-CISLNGPVRISSLPEVVVETPENSGGDLLLSSE--FLKECVEQFSVPS 281
+ D Y N + + +L + N + +S F + + F+ P
Sbjct: 319 LIWGGKDILGYRKKEHNLGILLIALFDFYGRKLNNWDVGISCNSARTFFLKTDKNFANPD 378
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFT--YGARKLGHILSQ-- 337
R + L I DP+ +N++G+ + N+++++SAF+ Y ++++
Sbjct: 379 RAY----------LLAIQDPMVPDNDIGK--NSFNYFKVKSAFSKAYSVLTDANLITSLG 426
Query: 338 PEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
P S+ + + S LDR G + + D
Sbjct: 427 PNRSILGTIVRPDSVLLDRKGWNKDATIPD 456
>gi|310799736|gb|EFQ34629.1| hypothetical protein GLRG_09773 [Glomerella graminicola M1.001]
Length = 756
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 114/275 (41%), Gaps = 56/275 (20%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN---YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNV 109
V+P E R +++ ++R ++ Y C+V PFGS YLP D+DL +
Sbjct: 439 VRPRDFEHEMRTQLVERLRRSLKTSHFYKDCDVRPFGSYMSGLYLPTADMDLVVCARSWL 498
Query: 110 EEALANDVCSVLERE-----DQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQ 162
+ A +N R QNK + + + A+V LVK + + + VDISF++
Sbjct: 499 DGAHSNFFGMKALRNFGKFLAQNKVTHYNTMEF-IASAKVPLVKYIDNITGLRVDISFDR 557
Query: 163 LGG-------------------LSTLC--FL------EQVDRLIGKDHLFKRSIILIKAW 195
L G L T+ FL E V+ IG + + +++
Sbjct: 558 LDGPQAVKTFAEWKEQYPAMPILVTMIKHFLAMRGLNEPVNGGIGSFTVTCMVVSMLQLM 617
Query: 196 CYYESRILGAHHGLISTYALETLVLYKFLD-YFSKFDWDSYCISLNGP-----VRI---- 245
+SR L H L ++ +FLD Y ++FD+ + I +N P VR+
Sbjct: 618 PQVQSRNLIPEHHLGE-------MMMEFLDLYGNRFDYVNTAIRMNPPGYVHKVRVREVV 670
Query: 246 -SSLPEVVVETPENSGGDLLLSSEFLKECVEQFSV 279
++ + V P N D+ S +E+FSV
Sbjct: 671 YKNMDRISVIDPNNPANDISGGSSNAGRILEEFSV 705
>gi|432853107|ref|XP_004067543.1| PREDICTED: PAP-associated domain-containing protein 5-like [Oryzias
latipes]
Length = 679
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 124/302 (41%), Gaps = 52/302 (17%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGG----- 106
+ P E+ R V+D ++ +I + + EV FGS YLP DIDL FG
Sbjct: 197 ISPRPEEEKMRLEVVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLVVFGKWETLP 256
Query: 107 -LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQL 163
+EEAL +R +K+A V L +A V ++K V + VDISFN
Sbjct: 257 LWTLEEALR-------KRNVADKSAIKV-----LDKATVPIIKLTDSVTEVKVDISFNVE 304
Query: 164 GGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKF 223
G+ +++ K + ++++K + + + G I +Y+L L+ F
Sbjct: 305 SGVKAARLIKEFKE---KYPVLPYLVLVLKQFL-LQRDLNEVFTGGIGSYSL-FLMAVSF 359
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVV--------VETPENSGGDLLLSSEFLKECVE 275
L F D ++N V + E+ GG + E K ++
Sbjct: 360 LQLH--FREDVCSPNINIGVLLIEFFELYGRHFNYLKTGIRIKDGGSYVAKDEVQKNMMD 417
Query: 276 QFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHIL 335
+ P L I DPL+ +N++GRS S G +++ AF Y L H +
Sbjct: 418 --------------GYRPSMLYIEDPLQPDNDVGRS-SYGAM-QVKQAFEYAFVVLHHAV 461
Query: 336 SQ 337
SQ
Sbjct: 462 SQ 463
>gi|313232447|emb|CBY24115.1| unnamed protein product [Oikopleura dioica]
Length = 887
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/302 (21%), Positives = 113/302 (37%), Gaps = 61/302 (20%)
Query: 62 RRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
R V+ + + + + G +V FGS YLP DID+ G +E N
Sbjct: 165 RHDVVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPTSDIDMVVLGE-KIEPRYGNPQNGPH 223
Query: 122 ER-ED----QNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVD 176
R +D Q A + +K + ++ ++ +I VDISFN G++ + ++
Sbjct: 224 YRLQDRLLKQGIAERYSIKVIDSAAVPIIKMRDMITDIKVDISFNMKTGVTAIGLVKGYI 283
Query: 177 RLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV----------------- 219
R R ++L+ + + G IS+Y L +V
Sbjct: 284 RQFPA----LRYLVLVLKQFLLQRDMNEVWTGGISSYGLILMVVSFLQHQGADNTGDDVN 339
Query: 220 ----LYKFLDYFS-KFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECV 274
L KFL ++ +F++ CI + +GG + E +
Sbjct: 340 LGVLLIKFLRFYGMEFEYSKCCIRV------------------KNGGQFIKKEEMATQMK 381
Query: 275 EQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHI 334
E + P + P L+I DPL +N++GR+ ++ AF + R L
Sbjct: 382 EAPTGP---------KYVPNFLSIEDPLTPSNDVGRASHGAE--NVKDAFLFAYRVLDRG 430
Query: 335 LS 336
+S
Sbjct: 431 VS 432
>gi|297597347|ref|NP_001043830.2| Os01g0672700 [Oryza sativa Japonica Group]
gi|56201854|dbj|BAD73304.1| polymerase (DNA directed) sigma-like [Oryza sativa Japonica Group]
gi|56201907|dbj|BAD73357.1| polymerase (DNA directed) sigma-like [Oryza sativa Japonica Group]
gi|255673541|dbj|BAF05744.2| Os01g0672700 [Oryza sativa Japonica Group]
Length = 578
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 85/390 (21%), Positives = 158/390 (40%), Gaps = 58/390 (14%)
Query: 5 RDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEY-WQRAEEATQGIIAQ----------- 52
R SP+P + S S P+ Q A G+E W R + + Q
Sbjct: 98 RALSPQP-----APEATLSGSLAPAEQPAQGSERAWFRGGRRFRSPMLQLHKEILDFCDF 152
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGG-LNVE 110
+ P+ + R A + V +I++ + C+V FGS +LP DID+ F +
Sbjct: 153 ISPSAEEQSSRTAAVKAVSNVIKHIWPQCKVEVFGSFRTGLFLPTSDIDVVIFDSRVKTP 212
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLST 168
+ + L ++ K + + K A V +VK + + I DISF+ GG
Sbjct: 213 QVGLYALAKALSQKGVAKKIQVIAK------ARVPIVKFVERKSEIAFDISFDMDGGPQA 266
Query: 169 LCFLEQVDRLIGKDHLFK----RSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFL 224
F+ KD++ K R + +I ++ + + G I +YAL T+++
Sbjct: 267 ADFI--------KDYVKKFPALRHLCMILKVFLHQRELNEVYTGGIGSYALLTMLITHLQ 318
Query: 225 DYFSKFDWDSY-CISLNGPVRISSLPEVVVETPENSGGDLLLSSE--FLKECVEQFSVPS 281
+ D Y N + + +L + N + +S F + + F+ P
Sbjct: 319 LIWGGKDILGYRKKEHNLGILLIALFDFYGRKLNNWDVGISCNSARTFFLKTDKNFANPD 378
Query: 282 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFT--YGARKLGHILSQ-- 337
R + L I DP+ +N++G+ + N+++++SAF+ Y ++++
Sbjct: 379 RAY----------LLAIQDPMVPDNDIGK--NSFNYFKVKSAFSKAYSVLTDANLITSLG 426
Query: 338 PEESLTDELRKFFSNTLDRHGSGQRPDVQD 367
P S+ + + S LDR G + + D
Sbjct: 427 PNRSILGTIVRPDSVLLDRKGWNKDATIPD 456
>gi|170109615|ref|XP_001886014.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164638944|gb|EDR03218.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 397
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 137/329 (41%), Gaps = 82/329 (24%)
Query: 39 WQRAEEATQGIIAQVQ-------PTVVSEERRKAVIDYVQRLIR-NYLGCEVFPFGSVPL 90
W + + + + A+V+ P+ V +E R ++ + ++ ++ V PFGS
Sbjct: 92 WDKHKNVAEMLHAEVKAFVHWISPSPVEDEVRGLIVTQISNTVKASFPDARVLPFGSYET 151
Query: 91 KTYLPDGDIDLTAFGG-------LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAE 143
K YLP GDIDL +NV ALAN L+R + K A+
Sbjct: 152 KLYLPLGDIDLVILSDSMAYSNKVNVLHALAN----TLKRSGVTSHVTIIAK------AK 201
Query: 144 VKLVKCLVQN--IVVDISFNQLGGLSTL----CFLEQV--DRLIGKDHLFKRSIILIKAW 195
V +VK + + VDIS NQ GL + FL+ + + GK + RS++++
Sbjct: 202 VPIVKFVTTHGRFHVDISLNQSNGLLSGKIINGFLKDMHGNGAEGKGSMALRSLVMVTKA 261
Query: 196 CYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVET 255
+ + + G + +Y++ L + FL K NG +
Sbjct: 262 FLTQRSMNEVYTGGLGSYSIVCLAV-SFLQMHPKI--------RNGEI-----------D 301
Query: 256 PENSGGDLLLSSEF----------------LKECVEQFSVPSRG-FDTNSRSFPPKHLNI 298
PE + G +L+ EF L++ FS RG +D + R L++
Sbjct: 302 PEKNLG--VLAMEFFELYGCYFNYDEVGISLRDGGMYFSKRKRGWYDYDRRGI----LSL 355
Query: 299 VDPLKENNNLGRSVSKGN--FYRIRSAFT 325
DP +N+ +SKG+ F+++R+AF
Sbjct: 356 EDPADPSND----ISKGSYGFHKVRTAFA 380
>gi|403331574|gb|EJY64740.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
Length = 316
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 88/187 (47%), Gaps = 17/187 (9%)
Query: 39 WQ--RAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL-GCEVFPFGSVPLKTYLP 95
WQ E +T + V P+ +E R V ++ +I+ C VF FGS LP
Sbjct: 10 WQLSMTETSTHDFVNFVTPSKEDKEIRNKVATSIEEVIKGVFPDCHVFVFGSCATGLNLP 69
Query: 96 DGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN-- 153
+ DIDL + E + V + R+ + K + V+K+ + V L+K
Sbjct: 70 NSDIDLIVYQPDVSESRMITKVADAIVRQKKCKTID-VLKNTK-----VPLIKITDSEFG 123
Query: 154 IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGA-HHGLIST 212
+ VDISFN+ G+ + ++Q+ ++ + K ++++K C+ +SR L + G + +
Sbjct: 124 VNVDISFNRTNGVYCVKLVKQLLQMFPE---LKPLMMVLK--CFLKSRQLNEPYSGGVGS 178
Query: 213 YALETLV 219
+ L +V
Sbjct: 179 FLLTMMV 185
>gi|299752783|ref|XP_002911796.1| Trf5 [Coprinopsis cinerea okayama7#130]
gi|298409998|gb|EFI28302.1| Trf5 [Coprinopsis cinerea okayama7#130]
Length = 816
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 62/134 (46%), Gaps = 9/134 (6%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYL 94
AE + EA + + PT V +E R ++ + +++ + V PFGS K YL
Sbjct: 275 AEMMHKEVEA---FVKWISPTPVEDEIRGLIVKQIAVTVQSKFPDASVLPFGSYETKLYL 331
Query: 95 PDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN- 153
P GDIDL L+ A +N V + + K A + + +A V +VK + +
Sbjct: 332 PMGDIDLVI---LSESMAYSNKVSVLHTLANTLKRAGITSRVTVIAKARVPIVKFVTTHG 388
Query: 154 -IVVDISFNQLGGL 166
VDIS NQ GL
Sbjct: 389 RFNVDISINQENGL 402
>gi|84468450|dbj|BAE71308.1| hypothetical protein [Trifolium pratense]
Length = 518
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 125/280 (44%), Gaps = 36/280 (12%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNVE 110
+ PT + +R A I+ V +I++ + C+V FGS YLP DID+ GL
Sbjct: 122 LSPTPEEKAKRDAAIESVFEVIKHIWPHCQVEIFGSFRTGLYLPTSDIDVVILKSGLPNP 181
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLST 168
+ N + L + K + + K A V ++K + + + DISF+ G
Sbjct: 182 QIGLNAISRSLSQRSMAKKIQVIGK------ARVPIIKFVEKKSGLSFDISFDIDNGPKA 235
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILG-AHHGLISTYALETLVLYKFLDYF 227
++++ + K + +++K + + R L + G I +YAL T+++ +
Sbjct: 236 AEYIQEA---VAKWPQLRPLCLILK--VFLQQRELNEVYSGGIGSYALLTMLMAMLRN-- 288
Query: 228 SKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQ---FSVPSRGF 284
+ + P +L ++V + G L +S+ C+ + F SRGF
Sbjct: 289 ---------VRQSQPTAEHNLGVLLVHFFDFYGRK-LNTSDVGVSCIGEGTFFRKSSRGF 338
Query: 285 DTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAF 324
+R F L I DP +N++G+ + N++++RSAF
Sbjct: 339 YNKTRPF---LLGIQDPQTPDNDIGK--NSFNYFQVRSAF 373
>gi|313242854|emb|CBY39607.1| unnamed protein product [Oikopleura dioica]
Length = 833
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 67/311 (21%), Positives = 115/311 (36%), Gaps = 79/311 (25%)
Query: 62 RRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
R V+ + + + + G +V FGS YLP DID+ G +E N
Sbjct: 111 RHDVVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPTSDIDMVVLGE-KIEPRYGNPQNGPH 169
Query: 122 ER-ED----QNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVD 176
R +D Q A + +K + ++ ++ +I VDISFN G++ + ++
Sbjct: 170 YRLQDRLLKQGIAERYSIKVIDSAAVPIIKMRDMITDIKVDISFNMKTGVTAIGLVKGYI 229
Query: 177 R---------LIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV-------- 219
R L+ K L +R + + W G IS+Y L +V
Sbjct: 230 RQFPALRYLVLVLKQFLLQRD--MNEVWT-----------GGISSYGLILMVVSFLQHQG 276
Query: 220 -------------LYKFLDYFS-KFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLL 265
L KFL ++ +F++ CI + +GG +
Sbjct: 277 ADNTADDVNLGVLLIKFLRFYGMEFEYSKCCIRV------------------KNGGQFIK 318
Query: 266 SSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFT 325
E + E + P + P L+I DPL +N++GR+ ++ AF
Sbjct: 319 KEEMATQMKESPTGP---------KYVPNFLSIEDPLTPSNDVGRASHGAE--NVKDAFL 367
Query: 326 YGARKLGHILS 336
+ R L +S
Sbjct: 368 FAYRVLDRGVS 378
>gi|124481633|gb|AAI33102.1| LOC568678 protein [Danio rerio]
Length = 535
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 129/307 (42%), Gaps = 46/307 (14%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT 102
E + + P E+ R V+ +QR+I++ + EV FGS YLP DIDL
Sbjct: 52 EEIKDFYEYISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLV 111
Query: 103 AFGG------LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--I 154
FG +EEAL + A E +K L +A V ++K + + +
Sbjct: 112 VFGNWETLPLWTLEEAL----------RKRKVADENSIK--VLDKATVPIIKLMDSHTEV 159
Query: 155 VVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA 214
VDISFN G+ ++ + + ++L+ + + G I +Y+
Sbjct: 160 KVDISFNVQSGVKAANLIKDYK----QQYPVLPYLVLVLKQFLLQRELNEVFTGGIGSYS 215
Query: 215 LETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGG--DLLLSSEFLKE 272
L L+ FL + D +S + P +L +++E E G + L + +K+
Sbjct: 216 L-FLMAVSFLQLHCRED-----VSSSNP----NLGVLLIEFFELYGRHFNYLKTGIRIKD 265
Query: 273 ---CVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGAR 329
V + V D + P L I DPL+ N++GRS S G +++ AF Y
Sbjct: 266 GGSYVAKDEVQKSMLD----GYRPSMLYIEDPLQPGNDVGRS-SYGAM-QVKEAFDYAYV 319
Query: 330 KLGHILS 336
L H +S
Sbjct: 320 ILSHAVS 326
>gi|302405651|ref|XP_003000662.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
gi|261360619|gb|EEY23047.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
Length = 723
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 97/218 (44%), Gaps = 40/218 (18%)
Query: 52 QVQPTVVSEERRKAVIDYVQRLIR---NYLGCEVFPFGSVPLKTYLPDGDID-------- 100
V+P + R +I+ ++ +R Y GCEV PFGS YLP D+D
Sbjct: 417 HVRPRDFEQRMRGELIERIRDSLRRNPKYRGCEVHPFGSYMSGLYLPTADMDIVICSKEW 476
Query: 101 ----LTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNI 154
+TAF G + + QN+ A+ + + +A V LVK + V +
Sbjct: 477 LSGRMTAFPGGSSLYKFRGFLT-------QNRLADPSSVEV-IAKARVPLVKYIDAVTGL 528
Query: 155 VVDISFNQLGGLSTL-CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 213
VDISF+++ G + + FL+ ++ L + +IK + + +G I ++
Sbjct: 529 RVDISFDRMDGPAAIKTFLDWKEQYPALPIL----VTIIKHFLAMRG-LNEPVNGGIGSF 583
Query: 214 ALETLV--------LYKFLD-YFSKFDWDSYCISLNGP 242
+ + LV L +F D Y ++FD+ + I +N P
Sbjct: 584 SSKNLVPEHHLGEMLMEFFDLYGNRFDYKTTAIRINPP 621
>gi|390597612|gb|EIN07011.1| Nucleotidyltransferase [Punctularia strigosozonata HHB-11173 SS5]
Length = 464
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 88/195 (45%), Gaps = 27/195 (13%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYL 94
AE QR EA I + PT +E R ++ + R + + + +V PFGS K YL
Sbjct: 138 AEMLQRDVEA---FIDYISPTPAEDEIRGLIVQLISRAVTQAFPDAQVLPFGSYETKLYL 194
Query: 95 PDGDIDLT------AFGG-LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLV 147
P GDIDL A+ + V ALAN + R V K A+V ++
Sbjct: 195 PLGDIDLVIQSPSMAYSDKVTVLHALAN----TMRRAGITDRVTIVAK------AKVPII 244
Query: 148 KCLVQN--IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGA 205
K + + VDIS NQ G++ + ++R + + + +++ KA+ S +
Sbjct: 245 KFITTHGRFAVDISLNQTNGVAA---GKMINRYLRELPALRGLVMITKAFLSQRS-MNEV 300
Query: 206 HHGLISTYALETLVL 220
+ G + +Y++ L +
Sbjct: 301 YTGGLGSYSIVCLAI 315
>gi|325181595|emb|CCA16045.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
gi|325191995|emb|CCA26462.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
Length = 494
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 81/170 (47%), Gaps = 15/170 (8%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
R+ +I ++ L+ N + V FGS + +LP DIDL FG +E+L + + L
Sbjct: 147 RENLIAQMKNLVSNLWPRAAVETFGSHETQMFLPQSDIDLVIFGAPTGKESLFV-LAAEL 205
Query: 122 EREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQVDRLI 179
E D E + K A + +VK + +N I VDISFN GL+T ++Q R+
Sbjct: 206 EARDMVSYLEVIDK------ARIPIVKFVDKNSAIQVDISFNISSGLATADLIKQYMRIF 259
Query: 180 GKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSK 229
R ++L+ + + + G I ++ L+ +V+ FL + +
Sbjct: 260 PS----FRPLVLVLKYFLAQRELNETFQGGIGSFLLQLMVV-SFLQQYRR 304
>gi|164656242|ref|XP_001729249.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
gi|159103139|gb|EDP42035.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
Length = 527
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 79/169 (46%), Gaps = 13/169 (7%)
Query: 66 VIDYVQR-LIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLERE 124
VI +QR L + V+ FGS + YLP GDIDL NV + ++ ++ E
Sbjct: 2 VISLLQRALCSKWPDARVYSFGSQDTQLYLPQGDIDLVVLS--NVMNDMPREI-TLSEMA 58
Query: 125 DQNKAAEFVVKDAQLIRAEVKLVK--CLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKD 182
++ + + L RA+V ++K C VDIS NQ GL F V+ + K
Sbjct: 59 ACLRSYQLAIHVQVLARAKVPIIKFVCPYGQFNVDISINQANGLQASKF---VNGWLKKQ 115
Query: 183 HLFKRSIILIKAWCYYESRILG-AHHGLISTYALETLVLYKFLDYFSKF 230
+ +++IK + + R L + G + +Y++ TL++ FL K
Sbjct: 116 PAIRPLVMVIKQ--FLQQRALSEVYTGGLGSYSV-TLMVLSFLQLHPKL 161
>gi|350415058|ref|XP_003490519.1| PREDICTED: PAP-associated domain-containing protein 5-like [Bombus
impatiens]
Length = 572
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 130/319 (40%), Gaps = 45/319 (14%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT 102
E + A + P+ R V+ ++++I + + +V FGS YLP DIDL
Sbjct: 123 EEIEDFFAYMCPSNEEHSLRMRVVKRIEQVIYDLWQDSKVEVFGSFRTGLYLPTSDIDLV 182
Query: 103 AFGGLNVEEALANDVCSVLERE--DQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDI 158
G N LER DQN A +K L +A V +VK + I VDI
Sbjct: 183 VIG------MWTNLPLRTLERALLDQNIAEPSSIK--VLDKASVPIVKLTDKETEIKVDI 234
Query: 159 SFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 218
SFN G+ + E ++ + + ++ ++++K + + + G IS+Y+L L
Sbjct: 235 SFNMNNGVKS---AELINSFKKRFPVLEKLVMVLKQFL-LQRDLNEVFTGGISSYSL-IL 289
Query: 219 VLYKFLDYFSKFDWDSYCISLNGPVRISSLPE--------VVVETPENSGGDLLLSSEFL 270
+ FL + ++YC S N V + E V GG + E
Sbjct: 290 MTISFLQLHPR--QNAYCSSANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEEVQ 347
Query: 271 KECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARK 330
++ ++ P L I DPL N++GRS S G Y ++ AF +
Sbjct: 348 RDMID--------------GHRPSLLCIEDPLTPGNDIGRS-SYGALY-VKDAFDWAYYV 391
Query: 331 LGHILSQPEESLTDELRKF 349
L +S P L ++ K
Sbjct: 392 LSQAVS-PLNILVNDANKI 409
>gi|328772133|gb|EGF82172.1| hypothetical protein BATDEDRAFT_23561 [Batrachochytrium
dendrobatidis JAM81]
Length = 752
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 57/125 (45%), Gaps = 9/125 (7%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V+PT RK I V+++++ + EV FGS K YLP D+D+ G V
Sbjct: 189 VRPTEAEHSLRKLTIARVRKIVKQIWADAEVHVFGSFQTKLYLPSSDVDIVVVGDSCVLP 248
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQLGGLSTL 169
+ E+ D E + K +V ++K + + + +DISFN + G+ +
Sbjct: 249 KCLRQLAKAFEKADTLSRMEVIEK------TKVPIIKGVDKLTHFSLDISFNMVNGIKSA 302
Query: 170 CFLEQ 174
+++
Sbjct: 303 NIVKR 307
>gi|213403316|ref|XP_002172430.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
gi|212000477|gb|EEB06137.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
Length = 667
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 80/174 (45%), Gaps = 10/174 (5%)
Query: 50 IAQVQPTVVSEERRKAVIDYVQRLIR-NYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
I ++PT RK++I + R IR + V+ FGS + YLP DID+
Sbjct: 240 INYLEPTPQEHAVRKSLITKLDRAIRAKWPEVTVYVFGSFETRLYLPTSDIDMVVMSSDT 299
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQLGGL 166
V + S L R +N + + + A V ++K + I VD+SFNQ GGL
Sbjct: 300 VHRGTKKHMYS-LARHLKN--CKLATEIQVITTANVPIIKFVDPFTRIHVDVSFNQPGGL 356
Query: 167 STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
T C + V+ + K + +L+K + + + G +S+YA+ LV+
Sbjct: 357 KT-CLV--VNGFLKKFPAVRPLTMLVKHFLNMRA-LNEVFLGGLSSYAIVCLVV 406
>gi|392595411|gb|EIW84734.1| hypothetical protein CONPUDRAFT_47123 [Coniophora puteana
RWD-64-598 SS2]
Length = 663
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 68/137 (49%), Gaps = 15/137 (10%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYL 94
AE + R EA + + PT + +E R V+ V + + + + +V PFGS K YL
Sbjct: 148 AEMFHREVEA---FVDYMSPTSIEDEIRGLVVKLVGKAVTSAFPDAKVLPFGSYGTKLYL 204
Query: 95 PDGDIDLTAFGG---LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV 151
P GDIDL + ++ + + +VL+R A K + +A+V +VK +
Sbjct: 205 PSGDIDLVIESDSMQYVPKNSVLHSLANVLKR------AGIADKVTIIAKAKVPIVKFIT 258
Query: 152 QN--IVVDISFNQLGGL 166
++ + VDIS NQ GL
Sbjct: 259 RHGRLNVDISINQSNGL 275
>gi|226504162|ref|NP_001145652.1| uncharacterized protein LOC100279144 [Zea mays]
gi|195659215|gb|ACG49075.1| hypothetical protein [Zea mays]
Length = 103
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 39/75 (52%), Gaps = 10/75 (13%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
M ++ + SP P A P SSS P + W+R E AT ++ ++ PTV S+
Sbjct: 1 MVNIHERSPVP--ACVPAHPDPSSSISPDD--------WRRLEGATFSVMCKIHPTVSSQ 50
Query: 61 ERRKAVIDYVQRLIR 75
R VIDYVQRL R
Sbjct: 51 HLRARVIDYVQRLFR 65
>gi|242212981|ref|XP_002472321.1| predicted protein [Postia placenta Mad-698-R]
gi|220728598|gb|EED82489.1| predicted protein [Postia placenta Mad-698-R]
Length = 1512
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 87/198 (43%), Gaps = 28/198 (14%)
Query: 36 AEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYL 94
AE R EA + + PT +E R V+ + R + R + +V PFGS K YL
Sbjct: 157 AEMLHRDVEA---FVKYISPTPEEDEVRSLVVTLISRAVTRAFPDAQVLPFGSYETKLYL 213
Query: 95 PDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ-- 152
P G+ + +V ALAN V K A + + +A+V +VK +
Sbjct: 214 PIGNKE-------SVLHALANTV----------KRAGITDRVKIIAKAKVPIVKFVTTHG 256
Query: 153 NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
+ VDIS NQ G++ ++ + + + I++IK++ S + + G + +
Sbjct: 257 HFSVDISVNQGNGVTAGKMIKH---YLAELPALRSLILVIKSFLSQRS-MNEVYTGGLGS 312
Query: 213 YALETLVLYKFLDYFSKF 230
Y++ L + FL K
Sbjct: 313 YSIVCLAI-SFLQMHPKI 329
>gi|357130698|ref|XP_003566984.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Brachypodium distachyon]
Length = 619
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 81/370 (21%), Positives = 150/370 (40%), Gaps = 66/370 (17%)
Query: 23 SSSSVPSNQTAIGAEY-WQRAEEATQGIIAQ-----------VQPTVVSEERRKAVIDYV 70
SSS P+ + A G+E W R + + Q + P+ + R A + V
Sbjct: 145 SSSLAPAEKPAQGSERAWFRGGRRFRSPMLQLHKEILDFCDFISPSAEEQSSRTAAVQAV 204
Query: 71 QRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNVEEALANDVCSVLEREDQNK 128
++++ + C+V FGS YLP DID+ F + + + L ++ K
Sbjct: 205 SDVVKHIWPHCKVEVFGSFRTGLYLPTSDIDVVIFESRVKTPQVGLYALAKALSQKGVAK 264
Query: 129 AAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFK 186
+ + K A V +VK + V I DISF+ GG F++ R +
Sbjct: 265 KIQVIAK------ARVPIVKFVERVSGIPFDISFDIDGGPQAADFIKDAIRKMPA----L 314
Query: 187 RSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRIS 246
R + +I ++ + + G + +YAL T+++ + D Y
Sbjct: 315 RPLCMILKVFLHQRELNEVYTGGVGSYALLTMLITHLQLIWGVKDMLGY----------- 363
Query: 247 SLPEVVVETPENSGGDLLLS-SEFLKECVEQFSVPSRGFDTNS-RSF------------P 292
++ E++ G LL+ +F + + V G NS R+F
Sbjct: 364 ------RQSKEHNLGILLVKFFDFYGRKLNNWDV---GISCNSARTFFLKSDKDFVNLDR 414
Query: 293 PKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFT--YGARKLGHILSQ--PEESLTDELRK 348
P + I DP+ +N++G+ + N+++++SAF+ Y +++ P S+ + +
Sbjct: 415 PHLIAIQDPMVPDNDIGK--NSFNYFKVKSAFSKAYSVLTDAKLITSLGPNRSILGAIVR 472
Query: 349 FFSNTLDRHG 358
S LDR G
Sbjct: 473 PDSVLLDRKG 482
>gi|391342828|ref|XP_003745717.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
[Metaseiulus occidentalis]
Length = 512
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 59/125 (47%), Gaps = 13/125 (10%)
Query: 54 QPTVVSEERRKAVIDYVQRLIRNY---LGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVE 110
+PT + R+ V++ V+ ++R CEVF GS YLP DIDL G + E
Sbjct: 106 KPTRTEHQVRQEVVNRVKEVVRQLWPQAQCEVF--GSFCTGLYLPTSDIDLVILG--DWE 161
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDISFNQLGGLST 168
+ L +E A+ V D RA V +VK Q N+ VDISFNQ G+ +
Sbjct: 162 TLPMFTLHKALIQEKIASASTIKVLD----RASVPIVKFTEQSTNVKVDISFNQKNGVKS 217
Query: 169 LCFLE 173
++
Sbjct: 218 AKLIK 222
>gi|389642869|ref|XP_003719067.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
gi|351641620|gb|EHA49483.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
gi|440474598|gb|ELQ43333.1| DNA polymerase sigma [Magnaporthe oryzae Y34]
gi|440486580|gb|ELQ66430.1| DNA polymerase sigma [Magnaporthe oryzae P131]
Length = 703
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 119/298 (39%), Gaps = 58/298 (19%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN--YLGCEVFPFGSVPLKTYLPDGDIDL-----TAFG 105
+P E+ R+ +ID + +LIRN + V+PFGS YLP GD+DL +
Sbjct: 385 AKPRDFEEKLRQGLIDELAKLIRNSQFRDATVYPFGSFKSNLYLPTGDMDLVFCSDSYMS 444
Query: 106 GLNVEEALANDVC---SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISF 160
G + N V + +ER+ Q V K + +A V LVK + + VD+SF
Sbjct: 445 GRAARYSSKNHVFKFGAFIERK-QLAVDNHVEK---ISKARVPLVKYVDSRTGLKVDVSF 500
Query: 161 NQLGGLSTL-CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV 219
+ G+ + FL ++ L + IK + A+ G+ T + +V
Sbjct: 501 ENITGIRAIETFLAWREQFPDMPVL----VTCIKHFLAMRGLNEPANGGIGGTTVICLVV 556
Query: 220 LYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSV 279
+ L+ V+ S+ TPE+ G LLL F +FS
Sbjct: 557 ---------------SMLQLSPDVQSRSM------TPESHLGQLLL--RFFDLYGNRFSY 593
Query: 280 PSRGFDTNSRSFPPK------------HLNIVDPLKENNNLGRSVSKGNFYRIRSAFT 325
N + PK L+I+DP N++ S GN I++AF+
Sbjct: 594 DRVAISMNPPRYIPKSQVTNIVYRNTDRLSIIDPNNPENDI--SGGSGNIRTIKAAFS 649
>gi|388580693|gb|EIM21006.1| Nucleotidyltransferase, partial [Wallemia sebi CBS 633.66]
Length = 360
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 87/176 (49%), Gaps = 20/176 (11%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNV- 109
+ P++ + R+ I+ ++R I + + EVF FGS + YLPDGDIDL +N
Sbjct: 86 ISPSLTEHKTREYTIECIRRCITSRWADAEVFAFGSFETRLYLPDGDIDLVVMRKSVNQY 145
Query: 110 -EEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQLGGL 166
++++ + + S+L + + ++ + + K A V ++K +DIS NQ G+
Sbjct: 146 NKQSMLHTMASMLRQANLAQSIQVISK------ARVPIIKFTSSFGGYPIDISLNQTNGV 199
Query: 167 STLCFLEQV-DRLIGKDHLFKRSIILIKAWCYYESRILG-AHHGLISTYALETLVL 220
+ ++ DR L +L+K C+ R + + G +S+Y++ LV+
Sbjct: 200 DAGRMVNEILDRYPAARPLS----MLLK--CFLSQRSMNEVYTGGVSSYSVICLVV 249
>gi|383852647|ref|XP_003701838.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Megachile rotundata]
Length = 573
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 125/306 (40%), Gaps = 44/306 (14%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT 102
E + A + P+ R V+ ++++I + + +V FGS YLP DIDL
Sbjct: 123 EEIEDFFAYMCPSNEEHSLRIRVVKRIEQVIYDLWPDSKVEVFGSFRTGLYLPTSDIDLV 182
Query: 103 AFGGLNVEEALANDVCSVLERE--DQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDI 158
G N LER DQN A +K L +A V +VK + I VDI
Sbjct: 183 VIG------MWTNLPLRTLERALLDQNIAEPSSIK--VLDKASVPIVKLTDKETEIKVDI 234
Query: 159 SFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 218
SFN G+ + E ++ + + ++ ++++K + + + G IS+Y+L L
Sbjct: 235 SFNMSNGVKS---AELINSFKKRYPVLEKLVMVLKQFL-LQRDLNEVFTGGISSYSL-IL 289
Query: 219 VLYKFLDYFSKFDWDSYCISLNGPVRISSLPE--------VVVETPENSGGDLLLSSEFL 270
+ FL + ++YC + N V + E V GG + E
Sbjct: 290 MTISFLQLHPR--QNAYCSNANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEEVQ 347
Query: 271 KECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARK 330
++ ++ P L I DPL N++GRS S G Y ++ AF +
Sbjct: 348 RDMID--------------GHRPSLLCIEDPLTPGNDIGRS-SYGALY-VKDAFDWAYYV 391
Query: 331 LGHILS 336
L +S
Sbjct: 392 LSQAVS 397
>gi|430813412|emb|CCJ29233.1| unnamed protein product [Pneumocystis jirovecii]
Length = 398
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 93/206 (45%), Gaps = 33/206 (16%)
Query: 53 VQPTVVSEERRKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG-GLNVE 110
+ PT R+ V+ + L+ +++ ++ FGS YLP DIDL G +
Sbjct: 97 ISPTKEEHFVRELVVQRINALVQKHWKNVQLCAFGSFDTMLYLPTSDIDLVILSLGPRIY 156
Query: 111 EALANDVCSVLEREDQNKAAEF-----VVKDAQLIRAE----VKLVKCLVQNIVVDISFN 161
E R+D +K + + V KD Q+I +K + L Q I VDISFN
Sbjct: 157 ET----------RKDLHKLSRYLRCSNVAKDIQVITGASVPIIKFIDTLTQ-IHVDISFN 205
Query: 162 QLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHH-GLISTYALET--- 217
+ GGL + ++Q + + + K ++ IK + R L H L+S E
Sbjct: 206 KPGGLVSANIIKQ---YMKEHYALKPLVMFIKH--FLNMRGLNERHPKLLSKEIKEQDNL 260
Query: 218 -LVLYKFLDYFSK-FDWDSYCISLNG 241
++L +F + + K F+++ IS+N
Sbjct: 261 GVLLMEFFELYGKLFNYNEVGISINN 286
>gi|66557991|ref|XP_625041.1| PREDICTED: PAP-associated domain-containing protein 5-like [Apis
mellifera]
Length = 539
Score = 47.8 bits (112), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 125/306 (40%), Gaps = 44/306 (14%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT 102
E + A + P+ R V+ ++++I + + +V FGS YLP DIDL
Sbjct: 123 EEIEDFFAYMCPSNEEHSLRIRVVKRIEQVIYDLWPDSKVEVFGSFRTGLYLPTSDIDLV 182
Query: 103 AFGGLNVEEALANDVCSVLERE--DQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDI 158
G N LER DQN A +K L +A V +VK + I VDI
Sbjct: 183 VIG------MWTNLPLRTLERALLDQNIAEPSSIK--VLDKASVPIVKLTDKETEIKVDI 234
Query: 159 SFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 218
SFN G+ + E ++ + + ++ ++++K + + + G IS+Y+L L
Sbjct: 235 SFNMNNGVKS---AELINSFKKRFPVLEKLVMVLKQFL-LQRDLNEVFTGGISSYSL-IL 289
Query: 219 VLYKFLDYFSKFDWDSYCISLNGPVRISSLPE--------VVVETPENSGGDLLLSSEFL 270
+ FL + ++YC + N V + E V GG + E
Sbjct: 290 MTISFLQLHPR--QNAYCSNANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEEVQ 347
Query: 271 KECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARK 330
++ ++ P L I DPL N++GRS S G Y ++ AF +
Sbjct: 348 RDMID--------------GHRPSLLCIEDPLTPGNDIGRS-SYGALY-VKDAFDWAYYV 391
Query: 331 LGHILS 336
L +S
Sbjct: 392 LSQAVS 397
>gi|167384281|ref|XP_001736885.1| PAP-associated domain-containing protein [Entamoeba dispar SAW760]
gi|165900593|gb|EDR26889.1| PAP-associated domain-containing protein, putative [Entamoeba
dispar SAW760]
Length = 400
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 66/267 (24%), Positives = 111/267 (41%), Gaps = 51/267 (19%)
Query: 79 GCEVFPFGSVPLKTYLPDGDIDLTAFGG-LNVEEALANDVCS----VLEREDQN-KAAEF 132
G + PFGS K +LP DID + N + L + VLE + +N KA+
Sbjct: 52 GYNIMPFGSTQSKLFLPTSDIDFSVITNEYNTRKVLNSISSILSSYVLEDQKRNFKASVP 111
Query: 133 VVK--DAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSII 190
V+K D Q + IV+DIS N G T+ F+E++ I KD +R ++
Sbjct: 112 VLKLTDKQTL-------------IVLDISHNNTSGTKTVDFIEEI---IKKDDRIRRLVL 155
Query: 191 LIKA-WCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLP 249
LIK+ C Y+ +G + TY++ +V YC N +
Sbjct: 156 LIKSILCCYDFH--QPANGGLGTYSVFVMV---------------YCYVNNNNITTHDYG 198
Query: 250 EVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPK--HLNIVDPLKENNN 307
E++ + G D + +E SV F+ R++ K +L+I DP +N+
Sbjct: 199 ELLKGFLKYYGID-------FRSDIEGLSVFEGKFNRGERNWDSKISNLSIEDPCDLSND 251
Query: 308 LGRSVSKGNFYRIRSAFTYGARKLGHI 334
+ + + + + +Y A H+
Sbjct: 252 VSITSFRWQYIKYLFKMSYNALHYTHL 278
>gi|313241181|emb|CBY33472.1| unnamed protein product [Oikopleura dioica]
Length = 422
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 132/322 (40%), Gaps = 62/322 (19%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT 102
E + I +QPT + R V+ ++++++ + ++ FGS YLPDGDID+
Sbjct: 90 EEIEDFIKFMQPTESEQAMRDDVVWRIRQVVKELWPSAKLETFGSYNTGLYLPDGDIDMV 149
Query: 103 AFG---GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIV--VD 157
G L + + V + RE+ E +A V ++K + N + VD
Sbjct: 150 IQGQWEQLPMWQLRNKLVERRIAREENITVIE---------KAVVPIIKLIESNTLVHVD 200
Query: 158 ISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 217
ISFN G V + + + K+ ++L+K + + G + +YAL T
Sbjct: 201 ISFNTSNGREAAAL---VKKYMAEYPNLKQLVVLLK-YILNHRGLNEVWKGGLGSYAL-T 255
Query: 218 LVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQF 277
L++ FL S+ + E EN G LL EF + QF
Sbjct: 256 LLVVNFLQQHSRKN--------------------AKEDGENLGVLLL---EFFELYGRQF 292
Query: 278 SVPSRGFDTNSRS-FPP-----KHLN----------IVDPLKENNNLGRSVSKGNFYRIR 321
+ + G + + P K +N I DPL N++GRS + + ++
Sbjct: 293 NYETCGIRIRDEAGYIPIDTLRKQMNAHGTKYGPLCIEDPLNTTNDVGRSTFQ--WKHVQ 350
Query: 322 SAFTYGARKLGHIL-SQPEESL 342
+ F + RKL L QP+ ++
Sbjct: 351 ACFDHCCRKLKKALEEQPDPAM 372
>gi|413924677|gb|AFW64609.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
Length = 146
Score = 47.4 bits (111), Expect = 0.030, Method: Composition-based stats.
Identities = 29/75 (38%), Positives = 39/75 (52%), Gaps = 10/75 (13%)
Query: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSE 60
M ++ + SP P A P SSS P + W+R E AT ++ ++ PTV S+
Sbjct: 44 MVNIHERSPVP--ACVPAHPDPSSSISPDD--------WRRLEGATFSVMCKIHPTVSSQ 93
Query: 61 ERRKAVIDYVQRLIR 75
R VIDYVQRL R
Sbjct: 94 HLRARVIDYVQRLFR 108
>gi|402083045|gb|EJT78063.1| DNA polymerase sigma [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 732
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 93/229 (40%), Gaps = 50/229 (21%)
Query: 53 VQPTVVSEERRKAVID-YVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V+PT E+ R++++D V + R + V+PFGS YLP GD+DL ++
Sbjct: 413 VKPTHFEEKLRQSLVDELVTHVRRTWNDASVYPFGSFKSGLYLPTGDMDLV----FCSDK 468
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDA--------QLIRAEVKLVKCL--VQNIVVDISFN 161
L+ + ++ A FV K ++ +A V LVK + + VDISF
Sbjct: 469 YLSRHIAQYTPKKQVFHFARFVEKRGLAHQHRVERIHKARVPLVKYVDARTGLKVDISFE 528
Query: 162 QLGGLSTL-CFL--------------------------EQVDRLIGKDHLFKRSIILIKA 194
G++ + FL E V+ +G + + +++
Sbjct: 529 NSTGITAVNTFLAWKEEFPAMPILVTVIKHFLAMRGLNEPVNGGLGGFSVICLVVSMLQM 588
Query: 195 WCYYESR-ILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISLNGP 242
+SR IL H L L+L+ F Y +KF + ISL P
Sbjct: 589 MPEVQSRAILPGQH-------LGELLLHFFDLYGNKFQYQKMAISLKPP 630
>gi|367040851|ref|XP_003650806.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
gi|346998067|gb|AEO64470.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
Length = 759
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 89/221 (40%), Gaps = 28/221 (12%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT----AFGGL 107
++P E R ++++++ R + EV+PFGS P YLP D+DL ++
Sbjct: 404 IKPRDFEERLRGELVEHLKTFCRKTFKDAEVYPFGSFPSGLYLPTADMDLAFISDSYAKG 463
Query: 108 NVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGG 165
V + +N + + ++ A+V LVK + + VDISF G
Sbjct: 464 GVPRYGTKSFLYRFRSQLKNHRIAWEDEIELIVGAKVPLVKFIEHRTGLKVDISFENRTG 523
Query: 166 LSTL----CFLEQVDRLIGKDHLFKRSIIL-------------IKAWCYYESRILGAHH- 207
L+ + + EQ + L K +++ C S +
Sbjct: 524 LTAIETFKAWREQYPGMPALVTLIKHFLLMRGLNEPVNGGIGGFSVICLVVSMLQMMPEV 583
Query: 208 ---GLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRI 245
L + + L L+L+ F Y +KF++ + IS+N P I
Sbjct: 584 QSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAISMNPPRYI 624
>gi|147787660|emb|CAN69576.1| hypothetical protein VITISV_028613 [Vitis vinifera]
Length = 192
Score = 47.0 bits (110), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 25/38 (65%)
Query: 230 FDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSS 267
DWDS+C+SL GPV ISSLP+ E P +LLL S
Sbjct: 147 IDWDSFCVSLWGPVPISSLPDATTEPPRQGSRELLLDS 184
>gi|452839453|gb|EME41392.1| hypothetical protein DOTSEDRAFT_46399 [Dothistroma septosporum
NZE10]
Length = 754
Score = 47.0 bits (110), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 79/185 (42%), Gaps = 24/185 (12%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIR-------NYLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
V+P EE R +ID +QR ++ N EV FGS P YLP D+DL A
Sbjct: 333 VRPHRHEEELRAGIIDRLQRDLQYFRQIGPNVNKIEVRSFGSFPAGLYLPTADMDLVALS 392
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKD--------AQLIRAEVKLVKCLVQN--IV 155
++ L +C + R+ K ++ + A +I A+V LVK + + I
Sbjct: 393 SDYLDHGLKR-LCQI--RKHMWKMSDHFNRSRLPAPGTVAPVIGAKVPLVKFVDGHTGIK 449
Query: 156 VDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215
VD+SF GL+ +Q KD+ +++I + H G I + +
Sbjct: 450 VDLSFENDSGLTANQTFQQWK----KDYPEMPVLVMIIKQMLAMRGLNEVHTGGIGGFTI 505
Query: 216 ETLVL 220
LV+
Sbjct: 506 ICLVV 510
>gi|384485719|gb|EIE77899.1| hypothetical protein RO3G_02603 [Rhizopus delemar RA 99-880]
Length = 494
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 21/117 (17%)
Query: 62 RRKAVIDYVQRLIRNY--------------LGCEVFPFGSVPLKTYLPDGDIDLTAFGGL 107
+RK VID +Q ++ N+ + C + PFGS L Y+ D DIDL +
Sbjct: 31 KRKNVIDLLQHILVNFQRAVTKDLDWKRGDIECFLSPFGSYALGGYIRDADIDLVLVCPI 90
Query: 108 NVEEALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLVQNIVVDISFNQL 163
V L ++ + +V + + I +A V ++KC + NI +DISF +L
Sbjct: 91 QVLRKYFFKFFPQLLKQ------QTLVSNVESIQKANVPIIKCTIDNISIDISFVRL 141
>gi|58260578|ref|XP_567699.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57229780|gb|AAW46182.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 779
Score = 46.6 bits (109), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 81/199 (40%), Gaps = 30/199 (15%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V PT E R +I+ + R I + EV PFGS + YLP GDIDL +
Sbjct: 154 VSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHKYLSD- 212
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL-------------------VQ 152
AN + E + A A + RA V ++K + V
Sbjct: 213 --ANKQRLLAELGKAMRQANITDVVAIIARARVPIIKFVTLEGKSHVSSLEYFSKQEGVG 270
Query: 153 NIVVDISFNQLGGLSTLCFLEQ-VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
I VDIS NQ G++ + Q +D L G R +ILI + + + + G +
Sbjct: 271 KINVDISLNQANGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQRSMNEVYTGGLG 325
Query: 212 TYALETLVLYKFLDYFSKF 230
+Y++ +V+ FL K
Sbjct: 326 SYSVICMVI-SFLQLHPKL 343
>gi|440296452|gb|ELP89279.1| PAP-associated domain containing protein, putative [Entamoeba
invadens IP1]
Length = 344
Score = 46.6 bits (109), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 118/293 (40%), Gaps = 54/293 (18%)
Query: 60 EERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT-AFGGLNVEEALANDV 117
+E R+ V +L+ N Y GCEV +GS LP DIDL +F EE N V
Sbjct: 32 QELRQISYQKVSQLLTNRYPGCEVTIYGSYVSGFSLPSSDIDLVLSFS----EEVSKNQV 87
Query: 118 CSVLER-EDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQ 174
+L + ++++F+ + + A+V ++K L + I +D+S N GG+ +
Sbjct: 88 KKLLFKISTICRSSKFLRVEDVITNAKVPIIKLLDLDTTISIDLSINCEGGIDS----SA 143
Query: 175 VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDS 234
+ + F + I L + +++ + +HG I +YA+ L+ FL ++
Sbjct: 144 LTHSLLTSSQFTQEIALFVKYLVFQNNLNEPYHGGIGSYAI-VLLTATFLKFY------- 195
Query: 235 YCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFD--------T 286
P++S G L+ EFL F + G
Sbjct: 196 ---------------------PQHSLGRALV--EFLNFYGNIFKMGKTGVSYQHGFFSLV 232
Query: 287 NSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPE 339
F L I DP E NN+GRS K N ++ F + I+ PE
Sbjct: 233 EKNLFEEDSLVIEDPCDEGNNVGRSSFKFN--AVQFLFKKTLMGINLIIKNPE 283
>gi|145533334|ref|XP_001452417.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420105|emb|CAK85020.1| unnamed protein product [Paramecium tetraurelia]
Length = 361
Score = 46.6 bits (109), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 76/172 (44%), Gaps = 15/172 (8%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRNYLG-CEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
+ PT RR+ I V+ I+ + ++ FGS K YLP+ DID+ +
Sbjct: 75 IIPTSEEHRRREQAIMRVETFIKEFASEVDIQAFGSFKTKLYLPNADIDVVMIDKSMSAK 134
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKC--LVQNIVVDISFNQLGGLSTL 169
L V L + D+ + + A+V ++K + DISFNQ+ GL
Sbjct: 135 ELYKKVAQSLMKSDKFENVNLIA------NAKVPIIKFFEVESQYQFDISFNQMDGLKQ- 187
Query: 170 CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILG-AHHGLISTYALETLVL 220
++++ + FK I+++K C + R L + G I ++ L ++L
Sbjct: 188 --IDEIRKAFTIYPEFKYLIMILK--CMLKQRELNETYSGGIGSFLLFQMIL 235
>gi|134117055|ref|XP_772754.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255372|gb|EAL18107.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 779
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 81/199 (40%), Gaps = 30/199 (15%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V PT E R +I+ + R I + EV PFGS + YLP GDIDL +
Sbjct: 154 VSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHKYLSD- 212
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL-------------------VQ 152
AN + E + A A + RA V ++K + +
Sbjct: 213 --ANKQRLLAELGKAMRQANITDVVAIIARARVPIIKFVTLEGKSHVSSLEYFSKQEGIG 270
Query: 153 NIVVDISFNQLGGLSTLCFLEQ-VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
I VDIS NQ G++ + Q +D L G R +ILI + + + + G +
Sbjct: 271 KINVDISLNQANGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQRSMNEVYTGGLG 325
Query: 212 TYALETLVLYKFLDYFSKF 230
+Y++ +V+ FL K
Sbjct: 326 SYSVICMVI-SFLQLHPKL 343
>gi|307168873|gb|EFN61797.1| PAP-associated domain-containing protein 5 [Camponotus floridanus]
Length = 643
Score = 45.8 bits (107), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 75/307 (24%), Positives = 124/307 (40%), Gaps = 44/307 (14%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDL 101
E + A + P+ R V+ ++++I + + +V FGS YLP DIDL
Sbjct: 188 HEEIEDFFAYMCPSHEEHVLRIRVVKRIEQVIYDLWPNSKVEVFGSFRTGLYLPTSDIDL 247
Query: 102 TAFGGLNVEEALANDVCSVLERE--DQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVD 157
G N LER DQN A +K L +A V +VK + I VD
Sbjct: 248 VVIG------MWTNLPLRTLERALLDQNIAEPSSIK--VLDKASVPIVKLTDKESEIKVD 299
Query: 158 ISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALET 217
ISFN G+ + + + + + ++ ++++K + + + G IS+Y+L
Sbjct: 300 ISFNMNNGVKSADLINSFKK---RYPVLEKLVMVLKQFL-LQRDLNEVFTGGISSYSL-I 354
Query: 218 LVLYKFLDYFSKFDWDSYCISLNGPVRISSLPE--------VVVETPENSGGDLLLSSEF 269
L+ FL + ++YC + N V + E V GG + E
Sbjct: 355 LMTISFLQLHPR--QNAYCSNANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEEV 412
Query: 270 LKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGAR 329
++ ++ P L I DPL N++GRS S G Y ++ AF +
Sbjct: 413 QRDMID--------------GHRPSLLCIEDPLTPGNDIGRS-SYGALY-VKDAFDWAYF 456
Query: 330 KLGHILS 336
L +S
Sbjct: 457 VLSQAVS 463
>gi|224128147|ref|XP_002329093.1| predicted protein [Populus trichocarpa]
gi|222869762|gb|EEF06893.1| predicted protein [Populus trichocarpa]
Length = 543
Score = 45.8 bits (107), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 147/383 (38%), Gaps = 93/383 (24%)
Query: 61 ERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFG-GLNVEEALANDVCS 119
E + V D ++ + N C+V FGS YLP DID+ G GL + N +
Sbjct: 148 EAVRCVFDVIKYIWPN---CKVEVFGSFRTGLYLPTSDIDVVILGSGLKSPQIGLNALSR 204
Query: 120 VLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQVD 176
L ++ V K Q+I RA V +VK + + + DISF+ GG F++
Sbjct: 205 ALSQKG-------VAKKIQVIARARVPIVKFVEKRSGVSFDISFDVNGGPIAAEFIKNA- 256
Query: 177 RLIGKDHLFKRSIILIKAWCYYESRILG-AHHGLISTY------------------ALET 217
I K + +++K + + R L + G IS+Y +LE
Sbjct: 257 --ISKWPELRPLCLILKV--FLQQRELNEVYSGGISSYALLAMLMAMLQNHRECQASLER 312
Query: 218 ---LVLYKFLDYFS-KFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKEC 273
L+L F D++ K + + +S G + F
Sbjct: 313 NLGLLLIHFFDFYGRKLNTTNVGVSCKG------------------------TGTF---- 344
Query: 274 VEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGH 333
FS ++GF N R F + I DP N++G+ + N+++IRSAF
Sbjct: 345 ---FSKRTKGFMNNGRPF---LIAIEDPQAPENDIGK--NSFNYFQIRSAFAMAF----T 392
Query: 334 ILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQ 393
L+ P+ L+ + T+ R DPV L R G TFS L
Sbjct: 393 TLTNPKTILSLGPNRSILGTIIR---------PDPVLLERKGGKNGEVTFSS--LLPGAG 441
Query: 394 TIYESEPNSSGITENCRIDDEAE 416
+S I N ++DDE E
Sbjct: 442 EPLQSNYGQQEILCNWQLDDEEE 464
>gi|320164013|gb|EFW40912.1| PAP associated domain containing 5 [Capsaspora owczarzaki ATCC
30864]
Length = 558
Score = 45.8 bits (107), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 71/302 (23%), Positives = 120/302 (39%), Gaps = 31/302 (10%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDL 101
E+ + ++PT + + R+ ++ ++ +I + V FGS YLP DID+
Sbjct: 213 EQEMYDFVEFIKPTPLEHQMREEIVQRIREVITGAWKHARVEVFGSFATGLYLPMSDIDI 272
Query: 102 TAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFN 161
FG N ++ + +LE K + + K + I +KL L + VDISFN
Sbjct: 273 VVFG--NWDQIPLFTLGKLLEESRIAKNVKVIDKTSVPI---IKLADAL-SGVFVDISFN 326
Query: 162 QLGGLSTLCFLEQ-VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
GL T+ F+ VD HL +IK + + ++ + G + +Y++ LV+
Sbjct: 327 LESGLRTVEFIRACVDEYRMLYHL----TFVIKQFL-AQRQLNEPYSGGLGSYSVVLLVV 381
Query: 221 YKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVP 280
FL + D D + ++VE E G D + P
Sbjct: 382 -SFLQRHPRQDRD------------PNFGVLLVEFFELYGKDFNYRKVGIAVTEGGRYFP 428
Query: 281 SRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEE 340
NS P I DP++ N++G N +R AF + L + +
Sbjct: 429 KSDASANSNEVRP---FIQDPMEPGNDVGYKTY--NMIAVRDAFRHAYDTLTRVTTDQHS 483
Query: 341 SL 342
SL
Sbjct: 484 SL 485
>gi|432884542|ref|XP_004074488.1| PREDICTED: uncharacterized protein LOC101158959 [Oryzias latipes]
Length = 421
Score = 45.8 bits (107), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 56/119 (47%), Gaps = 9/119 (7%)
Query: 63 RKAVIDYVQRLIR-NYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
RK V+ ++ +I+ + +V FGS YLP DIDL FG E ++ L
Sbjct: 263 RKEVVKRIETIIKEQWPSADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQELEQAL 320
Query: 122 EREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDISFNQLGGLSTLCFLEQVDRL 178
+ N A F +K L +A V ++K Q + VDISFN G+ F++ +L
Sbjct: 321 RK--HNVAEPFSIK--VLDKATVPIIKLTDQETEVKVDISFNVETGVKAASFIKDYVKL 375
>gi|440796505|gb|ELR17614.1| nucleotidyltransferase domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 911
Score = 45.8 bits (107), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 74/171 (43%), Gaps = 14/171 (8%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V P+ ++ R+ VI + +++ + ++ FGS YLP DIDL G
Sbjct: 272 VSPSAEEKQMREDVIARISKVVETLWPSVQLRVFGSCATDIYLPTSDIDLCIMGANACSP 331
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCF 171
+ +++ S L R + +I+ CL VDISF+ G + +
Sbjct: 332 SPIDELASALRRRSMGRVQAIATARVPIIKLVDAATGCL-----VDISFDVPTGPAHINL 386
Query: 172 LEQVDRLIGKDHLFKRSIILIKAWCYYESR--ILGAHHGLISTYALETLVL 220
++ R + ++ K +LIK YY + + + G + +YAL +++
Sbjct: 387 IK---RYLDEEPSVKPLALLIK---YYLKQFGMNEPYTGGLGSYALIIMII 431
>gi|405123317|gb|AFR98082.1| DNA polymerase sigma [Cryptococcus neoformans var. grubii H99]
Length = 649
Score = 45.4 bits (106), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 137/337 (40%), Gaps = 65/337 (19%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V PT E R +I+ + R I + EV PFGS + YLP GDIDL +
Sbjct: 19 VSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHKYLSD- 77
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL-------------------VQ 152
AN + E + A A + RA V ++K + +
Sbjct: 78 --ANKQRLLAELGKAMRQANITDVVAIIARARVPIIKFVTLEGESHVTSLADSSKQGAIG 135
Query: 153 NIVVDISFNQLGGLSTLCFLEQ-VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
I VDIS NQ G++ + Q +D L G R +ILI + + + + G +
Sbjct: 136 KINVDISLNQGNGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQRSMNEVYTGGLG 190
Query: 212 TYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSG-----GDLLLS 266
+Y++ +V+ FL K LN L +++E E G D+ +S
Sbjct: 191 SYSVICMVI-SFLQLHPKLRRSEINPELN-------LGTLLIEFFELFGRNFNYNDVGIS 242
Query: 267 SEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTY 326
++ FS SRG+ +SF L+I DP ++N++ S ++
Sbjct: 243 ---IRRGGFYFSKASRGW-MKGQSF---LLSIEDPQDKDNDI-------------SGGSF 282
Query: 327 GARKLGHILSQPEESLTDELRKF-FSNTLDRHGSGQR 362
G R++ + L E L+ +R F + + R+G G R
Sbjct: 283 GIRQVRNTLGGAYELLS--MRLFERAEEMSRNGRGGR 317
>gi|301093772|ref|XP_002997731.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
gi|262109980|gb|EEY68032.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
Length = 489
Score = 45.4 bits (106), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 78/180 (43%), Gaps = 32/180 (17%)
Query: 85 FGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEV 144
FGS + +LP DID+ FG +E L + LE +D+ E + K A +
Sbjct: 165 FGSHYTQMFLPQSDIDMVLFGVPEGKEPLYK-LAQCLEEKDRVSYLEVIDK------ARI 217
Query: 145 KLVKCLVQ--NIVVDISFNQLGGLSTLCFLEQVDR---------LIGKDHLFKRSI---- 189
+VK + + +I VD+SFN GGL+T ++ R L+ K + +R +
Sbjct: 218 PIVKMVHKGSDIHVDVSFNVAGGLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLNETY 277
Query: 190 ----------ILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISL 239
+++ ++ + R LGA H L L++ F Y F++ +S+
Sbjct: 278 SGGVGSFLLQMMVVSFLQHHGRALGAEHDDPKFNNLGQLLMGFFTLYGRDFNYTDLAVSV 337
>gi|307200518|gb|EFN80680.1| PAP-associated domain-containing protein 5 [Harpegnathos saltator]
Length = 636
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 123/306 (40%), Gaps = 44/306 (14%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT 102
E + A + P+ R V+ ++ +I + + +V FGS YLP DIDL
Sbjct: 181 EEIEDFFAYMCPSHEEHVLRMRVVKRIEYVIYDLWPDSKVEVFGSFRTGLYLPTSDIDLV 240
Query: 103 AFGGLNVEEALANDVCSVLERE--DQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDI 158
G N LER DQN A +K L +A V +VK + I VDI
Sbjct: 241 VIG------MWKNLPLRTLERALLDQNIAEPSSIK--VLDKASVPIVKLTDKESEIKVDI 292
Query: 159 SFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 218
SFN G+ + + R + + ++ ++++K + + + G IS+Y+L L
Sbjct: 293 SFNMNNGVKSAELINSFKR---QYPVLEKLVMVLKQFL-LQRDLNEVFTGGISSYSL-IL 347
Query: 219 VLYKFLDYFSKFDWDSYCISLNGPVRISSLPE--------VVVETPENSGGDLLLSSEFL 270
+ FL + ++YC + N V + E V GG + E
Sbjct: 348 MTISFLQLHPR--QNAYCSNANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEEVQ 405
Query: 271 KECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARK 330
++ ++ P L I DPL N++GRS S G Y ++ AF +
Sbjct: 406 RDMID--------------GHRPSLLCIEDPLTPGNDIGRS-SYGALY-VKDAFDWAYFV 449
Query: 331 LGHILS 336
L +S
Sbjct: 450 LSQAVS 455
>gi|356569346|ref|XP_003552863.1| PREDICTED: poly(A) RNA polymerase cid11-like [Glycine max]
Length = 415
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 43/187 (22%), Positives = 79/187 (42%), Gaps = 20/187 (10%)
Query: 49 IIAQVQPTVVSEERRKAVIDYVQRLIRN---YLGCEVFPFGSVPLKTYLPDGDIDLT--- 102
I+ V P E R A+I+ ++ ++ + G V PFGS + GD+D++
Sbjct: 14 ILRVVTPVQEDWEIRFAIINDLRSIVESVESLRGATVEPFGSFVSNLFTRWGDLDISIEL 73
Query: 103 ------AFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVV 156
+ G ++ DV L + +F+ R + K Q +
Sbjct: 74 SNGLHISSAGKKQKQTFLGDVLKALRMKGGGSNLQFISNA----RVPILKFKSYRQGVSC 129
Query: 157 DISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 216
DIS N L G L ++++ G+ F+ ++L+K W +I + G ++Y+L
Sbjct: 130 DISINNLPGQMKSKILLWINKIDGR---FRHMVLLVKEWAKAH-KINNSKAGTFNSYSLS 185
Query: 217 TLVLYKF 223
LV++ F
Sbjct: 186 LLVIFYF 192
>gi|321263807|ref|XP_003196621.1| hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
gi|317463098|gb|ADV24834.1| Hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
Length = 784
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 137/337 (40%), Gaps = 65/337 (19%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V PT E R +I+ + R I + EV PFGS + YLP GDIDL +
Sbjct: 154 VSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHKYLSD- 212
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL-------------------VQ 152
AN + E + A A + RA V ++K + +
Sbjct: 213 --ANKQRLLAELGKAMRQANITDVVAIIARARVPIIKFVTLEGKSHVFSLAYLTKQEGIG 270
Query: 153 NIVVDISFNQLGGLSTLCFLEQ-VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
I VDIS NQ G++ + Q +D L G R +ILI + + + + G +
Sbjct: 271 KINVDISLNQGNGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQRSMNEVYTGGLG 325
Query: 212 TYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSG-----GDLLLS 266
+Y++ +V+ FL K LN L +++E E G D+ +S
Sbjct: 326 SYSVICMVI-SFLQLHPKLRRSEINPELN-------LGTLLIEFFELFGRNFNYNDVGIS 377
Query: 267 SEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTY 326
++ FS SRG+ +SF L+I DP ++N++ S ++
Sbjct: 378 ---IRRGGFYFSKASRGW-MKGQSF---LLSIEDPQDKDNDI-------------SGGSF 417
Query: 327 GARKLGHILSQPEESLTDELRKF-FSNTLDRHGSGQR 362
G R++ + L E L+ +R F + ++R+ G R
Sbjct: 418 GIRQVRNTLGGAYELLS--MRLFEIAEEMNRNARGGR 452
>gi|406604992|emb|CCH43591.1| Poly(A) RNA polymerase protein 1 [Wickerhamomyces ciferrii]
Length = 624
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 79/185 (42%), Gaps = 20/185 (10%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF- 104
+ IA + P+ E R + ++ I + CEV FGS YLP DID+
Sbjct: 217 KDFIAYISPSKEEIELRNNTVRKLREAIMELWPDCEVHVFGSYATDLYLPGSDIDMVIVS 276
Query: 105 --GGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISF 160
GG +L + + S L+R++ K E + K A+V ++K NI +D+SF
Sbjct: 277 EHGGYESRNSLYS-LSSFLKRKNLAKNVEVIAK------AKVPIIKFTESTSNIHIDVSF 329
Query: 161 NQLGGLSTLCFLEQ-VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV 219
+ G+ + + G R I+LI ++ H G + Y++ LV
Sbjct: 330 ERTNGIDAAKTIRSWITETPG-----LREIVLIVKQFLSSRKLNNVHVGGLGGYSIICLV 384
Query: 220 LYKFL 224
Y FL
Sbjct: 385 -YSFL 388
>gi|330792667|ref|XP_003284409.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
gi|325085656|gb|EGC39059.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
Length = 1460
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 83/169 (49%), Gaps = 16/169 (9%)
Query: 66 VIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGG--LNVEEALANDVCSVLE 122
VI +++ +++ + ++ FGS +LP DID+ G ++++ A + VLE
Sbjct: 1092 VIGWIRAVVKKLWSHADLDLFGSFMTGLWLPSSDIDIVVNYGNNMSIKPKNAQFLLKVLE 1151
Query: 123 REDQNKAAEFVVKDAQLIRAEVKLVKCL-VQNIVVDISFNQ------LGGLSTLCFLEQV 175
++ +N F++ + A++ ++K + +NI VDISF + G + + V
Sbjct: 1152 KQIRNDLDGFILSMVCIPSAKIPVIKLVTTENISVDISFRESPTSIHTGIAARDLIADCV 1211
Query: 176 DRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFL 224
++G L+ +I+L W E + + G +S+Y L L+L FL
Sbjct: 1212 KDVVG---LYPLAIVL--KWFLRERGLNNTYTGGLSSYCL-VLMLISFL 1254
>gi|449017212|dbj|BAM80614.1| hypothetical protein CYME_CMK272C [Cyanidioschyzon merolae strain
10D]
Length = 1647
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 25/50 (50%)
Query: 146 LVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAW 195
+V+C + N L CFL + D LIG+ HL R +IL+K W
Sbjct: 1160 VVRCRTNGLTTQFLLNPAVALCRSCFLVECDELIGRRHLLIRCLILLKVW 1209
>gi|395505923|ref|XP_003757286.1| PREDICTED: PAP-associated domain-containing protein 5, partial
[Sarcophilus harrisii]
Length = 615
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 116/308 (37%), Gaps = 66/308 (21%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGL---- 107
+ P E+ R V++ ++ +I+ + +V FGS YLP DIDL FG
Sbjct: 127 MSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGKWENLP 186
Query: 108 --NVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQL 163
+EEAL + ED K L +A V ++K + VDISFN
Sbjct: 187 LWTLEEALRKHKVA---DEDSVKV---------LDKATVPIIKLTDSFTEVKVDISFNVQ 234
Query: 164 GGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKF 223
G+ Q+ + K + ++L+ + + G I +Y+L L+ F
Sbjct: 235 NGVKA----AQLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSL-FLMAVSF 289
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
L + D CI P + G LL+ EF + F+ G
Sbjct: 290 LQLHPRED---ACI------------------PNTNYGVLLI--EFFELYGRHFNYLKTG 326
Query: 284 FDTNS---------------RSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGA 328
+ P L I DPL+ N++GRS S G +++ AF Y
Sbjct: 327 IRIKDGGSYVAKDEVQKNMLDGYRPSMLYIEDPLQPGNDVGRS-SYGAM-QVKQAFDYAY 384
Query: 329 RKLGHILS 336
L H +S
Sbjct: 385 VVLSHAVS 392
>gi|389748468|gb|EIM89645.1| Nucleotidyltransferase [Stereum hirsutum FP-91666 SS1]
Length = 479
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 60/133 (45%), Gaps = 6/133 (4%)
Query: 40 QRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGD 98
Q + + PT V +E R V+ +QR I + + +V FGS K YLP GD
Sbjct: 101 QLLHREVDAFVRYISPTPVEDEIRSLVVLQIQRCISSKFPDAKVRSFGSYETKLYLPLGD 160
Query: 99 IDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVV 156
IDL ++ A ++ V + + + A + + + +A+V +VK + V
Sbjct: 161 IDLVI---ISKSMAYSDRVTVLHAVANTLRTAGITDRVSVIAKAKVPIVKFVTTFGRFAV 217
Query: 157 DISFNQLGGLSTL 169
DIS N G+ +
Sbjct: 218 DISINMSNGVEAI 230
>gi|345493399|ref|XP_001604785.2| PREDICTED: PAP-associated domain-containing protein 5-like [Nasonia
vitripennis]
Length = 462
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 124/309 (40%), Gaps = 46/309 (14%)
Query: 41 RAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDI 99
R E + + PT R VI ++ +I + + +V FGS YLP DI
Sbjct: 45 RLHEEIEDFFTYMCPTNEEHLLRVKVIKRIENVIYDLWPDSKVEIFGSFRTGLYLPTSDI 104
Query: 100 DLTAFGGLNVEEALANDVCSVLERE--DQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIV 155
DL G N LER DQN VK L RA V +VK + I
Sbjct: 105 DLVVIG------MWTNLPLHTLERALIDQNIVEPSSVK--VLDRASVPIVKLTDRETEIK 156
Query: 156 VDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215
VDISFN G+ + ++ R + ++ ++++K + + + G IS+Y+L
Sbjct: 157 VDISFNMNNGVKSAELIKTFKR---QYPALEKLVMVLKQFL-LQRDLNEVFTGGISSYSL 212
Query: 216 ETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLP-------EVVVETPENSGGDLLLSSE 268
L+ FL + + S ++L G + I L V GG + E
Sbjct: 213 -ILMTISFLQLHPRNNISSPDVNL-GVLLIEFLELYGRKFNYVKTGIRVKGGGTYISKEE 270
Query: 269 FLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGA 328
+E ++ P L I DPL N++GRS S G Y ++ AF +
Sbjct: 271 VQREMID--------------GHRPSLLCIEDPLTPGNDIGRS-SYGALY-VKDAFDWAY 314
Query: 329 RKLGHILSQ 337
++LSQ
Sbjct: 315 ----YVLSQ 319
>gi|387196341|gb|AFJ68755.1| DNA polymerase sigma subunit, partial [Nannochloropsis gaditana
CCMP526]
Length = 419
Score = 44.3 bits (103), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 85/174 (48%), Gaps = 17/174 (9%)
Query: 53 VQPTVVSEERRKAVI----DYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
+ PT E R+ V D V++L ++ +V FGS K +LPD DID+ +
Sbjct: 78 LAPTRAELEARQKVTRISADTVKKLWPSF---DVHVFGSEATKVFLPDSDIDMVVLPPTD 134
Query: 109 VE-EALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLVQNIVVDISFNQLGGL 166
+ + ++ ++ E Q ++ V ++I +A V +VK QN+ VDISF+ GL
Sbjct: 135 LPLHQIRKNLFTLAEAFKQEES----VSGMEIISQARVPIVKLRFQNLQVDISFSSDSGL 190
Query: 167 STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
+ ++ ++++ L R +IL+ + + + + G ++ L+ +V+
Sbjct: 191 KSARYM--LEKMEAMPPL--RPLILVLKYFLAQRELNQTYMGGCGSFLLQLMVI 240
>gi|340371638|ref|XP_003384352.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Amphimedon queenslandica]
Length = 462
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 18/110 (16%)
Query: 61 ERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSV 120
E+ KA+I ++ + +V+PFGS YLP DID+ G E LA + S+
Sbjct: 119 EKVKAII------LKLWPRAQVYPFGSFCTNLYLPTSDIDIVVLG-----EWLALPLFSL 167
Query: 121 LEREDQNKAAEFVVKDAQLI--RAEVKLVKCLVQ--NIVVDISFNQLGGL 166
ED A+ ++D+ ++ + V ++K + + VDISFNQ G+
Sbjct: 168 ---EDAFLKAQIAIEDSIMVLDKTTVPIIKFTDRETEVKVDISFNQETGI 214
>gi|401826816|ref|XP_003887501.1| DNA polymerase sigma [Encephalitozoon hellem ATCC 50504]
gi|395460019|gb|AFM98520.1| DNA polymerase sigma [Encephalitozoon hellem ATCC 50504]
Length = 354
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 46/199 (23%), Positives = 89/199 (44%), Gaps = 12/199 (6%)
Query: 17 GERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI-R 75
G SS S + +N +++ ++ + + ++ PT + R + + +++LI R
Sbjct: 17 GHMLSSIESLLDTNMSSVSLGNLEKLDLELLQLYQEIAPTQIEINSRMYIFERIKKLIVR 76
Query: 76 NYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVK 135
V PFGS +P DID+ G++ ++ AN S + ++ A+FV K
Sbjct: 77 ELPSANVVPFGSHTTGLIVPSSDIDVNVQLGIDTDKEYANRYLSKI--KNLMMGADFVKK 134
Query: 136 DA--QLIRAEVKLVKC--LVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSI-I 190
+ + + + ++K + +DIS NQ G+ F+ R K+H + R I
Sbjct: 135 ETLFHIRKCRIPILKLRDRIFGFRIDISVNQENGVEAAKFI----RYTLKEHPYIRVFAI 190
Query: 191 LIKAWCYYESRILGAHHGL 209
L+K + ++ A GL
Sbjct: 191 LLKHFLTIRNQSDAATGGL 209
>gi|348528609|ref|XP_003451809.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Oreochromis niloticus]
Length = 481
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 99/220 (45%), Gaps = 30/220 (13%)
Query: 134 VKDAQLIRAEVKLVKCLVQ--NIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIIL 191
V+ QLIRA+V +++ + ++ D++ N G+ L D + I++
Sbjct: 247 VERNQLIRAKVPILRFREKGSDLEFDLNVNNTVGIRNTFLLRSY---AYADLRVRPMILV 303
Query: 192 IKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDS----YCISLNGPVRISS 247
IK W Y + I A G +S+Y L +VL+ +L S+ S Y S N + +
Sbjct: 304 IKKWARYNN-INDASKGTLSSYTLVLMVLH-YLQTLSEPVLPSLQRDYPESFNPLMDLDM 361
Query: 248 LPEVVVETP------ENSGGDLLLSSEFLKECVEQFS--------VPSRGF-DTNSRSFP 292
+PE P ++S G+LLL FLK +FS +R F NS+ +
Sbjct: 362 VPEGPKHIPPYISRNKSSLGELLLG--FLKYYATEFSWDKQVISVREARAFPKNNSKEWN 419
Query: 293 PKHLNIVDPLKENNNLGRSV-SKGNFYRIRSAFTYGARKL 331
K + + +P E NN+ R+V K F I++ F R L
Sbjct: 420 NKFICVEEPF-ERNNVARAVHEKLKFDAIKAKFAESCRLL 458
>gi|312077329|ref|XP_003141256.1| PAP/25A associated domain-containing protein [Loa loa]
gi|307763579|gb|EFO22813.1| PAP/25A associated domain-containing protein [Loa loa]
Length = 419
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 74/333 (22%), Positives = 134/333 (40%), Gaps = 48/333 (14%)
Query: 28 PSNQTAIGAEYWQR--------AEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYL 78
P TA+ A + +R E + A ++P+ + + R V + V+ ++ R +
Sbjct: 43 PRENTALIAPWCRRRYALSLRGLHEELLDLYAWLKPSPLEKALRLRVFERVRGVLQRIWP 102
Query: 79 GCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQ 138
++ FGS+ +LP DID+ L EE L+ ++
Sbjct: 103 TAKIDVFGSLYTSLFLPTSDIDVVVESDLVSEEPPLWKTAVALKESGITESINV------ 156
Query: 139 LIRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWC 196
L +A V +VK + ++ I +DISFN + G+ + F+E + + ++L+
Sbjct: 157 LDKAFVPIVKMVDKDTKIYLDISFNTVQGVRSARFIEDMK----MRYPVLEPLVLVLKQF 212
Query: 197 YYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCIS-LNGPVRISSLPEVV--- 252
+ ++ G +S+Y L L+L FL +D+ I+ +N V + S ++
Sbjct: 213 LMQRQLNQVFTGGLSSYGL-ILMLISFLQLHPSYDYSYKGITEVNMGVLLLSFLQLYGQE 271
Query: 253 -----VETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNN 307
+SGG + E L Q + PS L I DPL+ N+
Sbjct: 272 FNYMKTALRIHSGGAYVCKDEILV----QMNRPSNSM-----------LCIEDPLQPGND 316
Query: 308 LGRSVSKGNFYRIRSAFTYGARKLGHILSQPEE 340
+GR N +R AF + L + + E
Sbjct: 317 IGR--CSHNIQLVRQAFEHAFATLCAVFVRSRE 347
>gi|348687890|gb|EGZ27704.1| hypothetical protein PHYSODRAFT_343641 [Phytophthora sojae]
Length = 501
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 93/224 (41%), Gaps = 37/224 (16%)
Query: 44 EATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLT 102
E ++ + PT R +I+ ++ +++ + V FGS + +LP DID+
Sbjct: 134 EEIMDFVSFISPTEQELSSRAELIEEMREIVKGLWPEATVETFGSHYTQMFLPQSDIDMV 193
Query: 103 AFGGLNVEEALAN--DVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDI 158
FG V E A + LE ++ E + K A + +VK + +I VD+
Sbjct: 194 LFG---VPEGKAPLFKLAQCLEEKELVSYLEVIDK------ARIPIVKMVHKASDIHVDV 244
Query: 159 SFNQLGGLSTLCFLEQVDR---------LIGKDHLFKRSI--------------ILIKAW 195
SFN GGL+T ++ R L+ K + +R + +++ ++
Sbjct: 245 SFNVAGGLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLNETYTGGVGSFLLQMMVVSF 304
Query: 196 CYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCISL 239
+ R LGA H L L++ F Y F++ IS+
Sbjct: 305 LQHHGRALGAEHDDPKFNNLGQLLMGFFTLYGRDFNYTDLAISV 348
>gi|303323645|ref|XP_003071814.1| PAP/25A associated domain containing protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240111516|gb|EER29669.1| PAP/25A associated domain containing protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 680
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 60/289 (20%), Positives = 118/289 (40%), Gaps = 47/289 (16%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V+P + R +I +RL++N + G ++ FGS YLP D+DL + +
Sbjct: 258 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 317
Query: 112 A---LANDVCSVLEREDQNKAAEFVVKDA--QLIRAEVKLVKCL--VQNIVVDISFNQLG 164
L + + + E V + + A V ++K + + + VD+SF+
Sbjct: 318 GRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDNNS 377
Query: 165 GLSTLCFLEQVDRLIGKDHL--------FKRSIILIKAWCYYESRILGAHHGL-ISTYAL 215
GL+ +Q K+H + +L++ + LG + + T L
Sbjct: 378 GLAANRTFQQ-----WKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTSLL 432
Query: 216 ETL-----------VLYKFLDYF-SKFDWDSYCISLNGP----------VRISSLPEVVV 253
+ L VL F D++ +KFD+ + I LN P + +S + +
Sbjct: 433 QHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPPGYFHKHTRNIYQANSRDRLSI 492
Query: 254 ETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRS---FPPKHLNIV 299
P N D+ +S++ ++ + F+ N RS PP++++++
Sbjct: 493 IDPNNPDNDISVSTKEIRRVFKAFAEAFHTLSQNIRSASFLPPQNISLL 541
>gi|168031583|ref|XP_001768300.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680478|gb|EDQ66914.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 787
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 141/341 (41%), Gaps = 32/341 (9%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF--GGLNV 109
V PT ++ R+ ++ V ++++ + +V FGS YLP D+D+ G +
Sbjct: 214 VAPTEEEQQMRETAVERVSGVVQSIWPHSQVKVFGSFATGLYLPTSDVDVVVLDSGCTAL 273
Query: 110 EEALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCL--VQNIVVDISFNQLGGL 166
++ L + L R V K+ Q+I +A V ++K + V NI DISF+ G
Sbjct: 274 QDGL-KALAKALTR-------GHVGKNIQVIGKARVPIIKFVETVSNIPFDISFDVANGP 325
Query: 167 STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDY 226
F++ I R + L+ + + + G I +YAL ++L +
Sbjct: 326 EAADFIKAAMGAIPP----LRPLCLVLKIFLQQRELNEVYQGGIGSYALLVMLLTHLQMH 381
Query: 227 FSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFL--KECVEQFSVPSRGF 284
SK S GP ++L ++V+ + G L + + + F RGF
Sbjct: 382 PSKRRVSSRG---QGPPLETNLGILLVDFLDLYGRTLNMKDVGISCRGGGRFFPKRDRGF 438
Query: 285 DTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTD 344
+ + R F L + DP +N++G++ ++RSAF R L ++ + E +
Sbjct: 439 NDSKRPF---LLCVEDPQSPDNDIGKNSYA--IQKVRSAFMMAHRLLTNLSANNEVGILS 493
Query: 345 ELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSG 385
+ + + R + VP R +TF+G
Sbjct: 494 RIVRVDEKLVGRKVPAALTPMAQSVPAKRSR----PATFAG 530
>gi|320035002|gb|EFW16944.1| Poly(A) polymerase [Coccidioides posadasii str. Silveira]
Length = 680
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 60/289 (20%), Positives = 118/289 (40%), Gaps = 47/289 (16%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V+P + R +I +RL++N + G ++ FGS YLP D+DL + +
Sbjct: 258 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 317
Query: 112 A---LANDVCSVLEREDQNKAAEFVVKDA--QLIRAEVKLVKCL--VQNIVVDISFNQLG 164
L + + + E V + + A V ++K + + + VD+SF+
Sbjct: 318 GRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDNNS 377
Query: 165 GLSTLCFLEQVDRLIGKDHL--------FKRSIILIKAWCYYESRILGAHHGL-ISTYAL 215
GL+ +Q K+H + +L++ + LG + + T L
Sbjct: 378 GLAANRTFQQ-----WKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTSLL 432
Query: 216 ETL-----------VLYKFLDYF-SKFDWDSYCISLNGP----------VRISSLPEVVV 253
+ L VL F D++ +KFD+ + I LN P + +S + +
Sbjct: 433 QHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPPGYFHKHTRNIYQANSRDRLSI 492
Query: 254 ETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRS---FPPKHLNIV 299
P N D+ +S++ ++ + F+ N RS PP++++++
Sbjct: 493 IDPNNPDNDISVSTKEIRRVFKAFAEAFHTLSQNIRSASFLPPQNISLL 541
>gi|242053947|ref|XP_002456119.1| hypothetical protein SORBIDRAFT_03g030810 [Sorghum bicolor]
gi|241928094|gb|EES01239.1| hypothetical protein SORBIDRAFT_03g030810 [Sorghum bicolor]
Length = 568
Score = 43.9 bits (102), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 120/291 (41%), Gaps = 50/291 (17%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF-GGLNVE 110
+ P+ + R A + V ++++ + C+V FGS YLP DID+ F +
Sbjct: 152 ISPSTEEQSSRTAAVQDVSDVVKHIWPQCKVEVFGSFRTGLYLPTSDIDVVVFESRVKTP 211
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLST 168
+ + L ++ K + + K A V +VK + + I DISF+ GG
Sbjct: 212 QVGLYALAKALSQKGVAKKIQVIAK------ARVPIVKFVERKSGIAFDISFDMDGGPQA 265
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFS 228
F++ + + R + +I ++ + + G I +YAL T+++ +
Sbjct: 266 ADFIKDAVKKLPA----LRPLCMILKVFLHQRELNEVYTGGIGSYALLTMLITHLQLVWG 321
Query: 229 KFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLS-SEFLKECVEQFSVPSRGFDTN 287
D Y ++ E++ G LL+ +F + + V G N
Sbjct: 322 GKDILGYH-----------------QSKEHNLGILLVRFFDFYGRKLNHWDV---GISCN 361
Query: 288 -SRSF------------PPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFT 325
SR+F P L I DP+ N++G+ + N+++++SAF+
Sbjct: 362 SSRTFFLKSDKDFMNHDRPHLLAIQDPMVPENDIGK--NSFNYFKVKSAFS 410
>gi|222619531|gb|EEE55663.1| hypothetical protein OsJ_04061 [Oryza sativa Japonica Group]
Length = 461
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 69/320 (21%), Positives = 119/320 (37%), Gaps = 61/320 (19%)
Query: 39 WQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRL------IRNYLGCEVFPFGSVPLKT 92
+ E+ + I++ ++P V ++RRK + +Q L + G PFGS
Sbjct: 30 YDVVEQCVKNILSLIKP--VEDDRRKR-LSAIQELSNSIPKVAALRGAVFKPFGSFVSNL 86
Query: 93 YLPDGDIDLTAFGGLN--VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL 150
Y GD+D++ N + + V L R QN+ V+ R V
Sbjct: 87 YSNSGDLDISVHLPNNSIISKKKKQYVLRELMRVLQNRGVAGYVQFVPFARVPVLQYVSN 146
Query: 151 VQNIVVDISFNQLGGL---STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHH 207
I DIS N G C++ +D G ++LIK W ++ I
Sbjct: 147 TFGISCDISVNNYPGRIKSKIFCWISSLDVRFGD------MVLLIKEWAKAQN-INDPKT 199
Query: 208 GLISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSS 267
G +++Y+L LVL+ F + P ++ E G+
Sbjct: 200 GTLNSYSLCLLVLFHFQ---------------------TCEPAILPPLKEIYEGN----- 233
Query: 268 EFLKECVEQFSVPSRGFDTNSRSFPPKHLN-IVDPLKENNNLGRSVSKGNFYRIRSAFTY 326
++E + + +V + +HL+ + DP++ +N R+V RI AFT
Sbjct: 234 --IEEGIAEMTV-----------YDEEHLDEVKDPIERPDNAARAVDLKGLERIAGAFTA 280
Query: 327 GARKLGHILSQPEESLTDEL 346
RK + L + L
Sbjct: 281 ANRKFASLQHAKRNDLLEML 300
>gi|392580130|gb|EIW73257.1| hypothetical protein TREMEDRAFT_22292, partial [Tremella
mesenterica DSM 1558]
Length = 303
Score = 43.9 bits (102), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 83/175 (47%), Gaps = 18/175 (10%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIR-NYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVE- 110
+ PT E R +I+ + R ++ + V PFGS + YLP GDIDL E
Sbjct: 32 MSPTREEYEVRLLIIESITRAVKYKWPEATVTPFGSWQTQLYLPQGDIDLVVTHPTLTEH 91
Query: 111 --EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGL 166
+ L ND+ + + A + +A V ++K + ++ + VDIS NQ+ G+
Sbjct: 92 NKKNLLNDLARTM------RYAMITDNVVVISKARVPIIKFVTKHGKLNVDISLNQVNGI 145
Query: 167 STLCFLEQ-VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
S + Q +D + G L I+++KA+ S + + G + +Y++ LV+
Sbjct: 146 SAGKIINQYLDVIPGARQL----ILVVKAFLSQRS-MNEVYTGGLGSYSVICLVI 195
>gi|221061551|ref|XP_002262345.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193811495|emb|CAQ42223.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 870
Score = 43.5 bits (101), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 75/165 (45%), Gaps = 12/165 (7%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
R+ ++ VQ I+ Y + FGS + L + DID+ + + + + + +
Sbjct: 304 RQKLLKEVQIFIKAVYPQVYLLIFGSCNTELDLYNSDIDICIYNNVENDRTNIRKLYNEM 363
Query: 122 EREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQVDRLI 179
+R + A Q+I A+V ++KC + I +D SFNQ+ + + + +
Sbjct: 364 KRNKLFQNATI----KQIIGAKVPIIKCFFTHIQISIDFSFNQVSAIVSTV---ETQSFL 416
Query: 180 GKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFL 224
K+ L K +I K E + A G IS++ L L+L KFL
Sbjct: 417 KKNPLIKYVVIFFKI-VLSEYNLNDAFQGGISSFKL-FLILVKFL 459
>gi|359493669|ref|XP_002282332.2| PREDICTED: PAP-associated domain-containing protein 5-like [Vitis
vinifera]
gi|302143015|emb|CBI20310.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 43.5 bits (101), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 79/361 (21%), Positives = 136/361 (37%), Gaps = 90/361 (24%)
Query: 3 DLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQ-------- 54
D R EP+ A F P +S P+ ++ W R + + ++
Sbjct: 60 DARADVEEPSPARFRTPPPASEEEAPAVESG-----WFRGNSRLRSPMLKLHKEILDFSD 114
Query: 55 ---PTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVE 110
PT + R A I+ V +IR + C+V FGS YLP DID+ G
Sbjct: 115 FLSPTPKEQSARNAAIESVFNVIRYIWPNCKVEVFGSFKTGLYLPTSDIDVVILGSDIKT 174
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLVQ--NIVVDISFNQLGGLS 167
+ L R K + K Q+I +A V ++K + + ++ DISF+ G
Sbjct: 175 PQIG---LYALSRALSQKG---IAKKIQVIAKARVPIIKFIEKRSSVAFDISFDVENGPK 228
Query: 168 TLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILG-AHHGLISTYALETL-------- 218
+++ I K + +++K + + R L + G I +YAL +
Sbjct: 229 AAEYIQDA---ISKWPPLRPLCLILK--VFLQQRELNEVYSGGIGSYALLAMLIAMLQNL 283
Query: 219 -------------VLYKFLDYFS-KFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLL 264
+L F D++ K + ++ NGP G
Sbjct: 284 QEWNASVEHNLGVLLVNFFDFYGRKLNTVDIGVTCNGP------------------GTFF 325
Query: 265 LSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAF 324
L S ++GF + F ++I DP N++G+ + N+++IRSAF
Sbjct: 326 LKS-------------TKGFVNKGQKF---LISIEDPQLPGNDIGK--NSFNYFQIRSAF 367
Query: 325 T 325
+
Sbjct: 368 S 368
>gi|359486610|ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like [Vitis vinifera]
gi|296086183|emb|CBI31624.3| unnamed protein product [Vitis vinifera]
Length = 453
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 75/342 (21%), Positives = 119/342 (34%), Gaps = 91/342 (26%)
Query: 64 KAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLT---------AFGGLNVEEALA 114
+ +D V+ L G V PFGS Y GD+D++ + G ++ L
Sbjct: 36 RTAVDSVESL----RGATVEPFGSFLSNLYTQWGDLDISIELPNGAYISSAGKRHKQTLL 91
Query: 115 NDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVK--CLVQNIVVDISFNQLGGLSTLCFL 172
V + L + + +F+ A V ++K NI D+S N L G FL
Sbjct: 92 GHVLNALRSKGGWRKLQFIPN------ARVPIIKFESYHPNISCDVSINNLKGQMKSKFL 145
Query: 173 EQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDW 232
+ G D F+ ++L+K W I + G +++Y+L LV++
Sbjct: 146 FWIS---GIDGRFRDLVLLVKEWARAHD-INNSKTGTLNSYSLSLLVVFHL--------- 192
Query: 233 DSYCISLNGPVRISSLPEVVVETPENSGGDLL---------------------------- 264
C R + LP + P N DL+
Sbjct: 193 -QTC-------RPAILPPLKEIYPGNVADDLIGVRAVVEGQIEETSAANINRFKRDRSRA 244
Query: 265 -----LSSEFLKECVEQFSVPSRG--------------FDTNSRSFPPKH-LNIVDPLKE 304
LS F+ + + SR D+N R P + L + DP ++
Sbjct: 245 PNRSSLSELFISFLAKFVDITSRASEQGICPYTGQWVDIDSNMRWMPRTYELFVEDPFEQ 304
Query: 305 NNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLTDEL 346
N R V RI AF ++L +Q + SL D L
Sbjct: 305 PENTARGVRSRQLQRISEAFQTTHQRLTSA-NQDQHSLIDTL 345
>gi|123449289|ref|XP_001313365.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
G3]
gi|121895246|gb|EAY00436.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
G3]
Length = 346
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 90/222 (40%), Gaps = 42/222 (18%)
Query: 13 GAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQR 72
G + ERP S + + E I Q+ PT R+ ++D +
Sbjct: 23 GLAYLERPPPSDPVL------------SKFHEQLVKFIKQLIPTKADVNVRQYIVDQICD 70
Query: 73 LIRNYLGCE------VFPFGSVPLKTYLPDGDIDLTAF------GGLNVEEALANDVCSV 120
I+ L C V P GS T+LP+ DID F +N+ + L
Sbjct: 71 KIKKSLPCPKDNKLIVLPCGSCMSGTFLPNADIDFAIFYYPIPCNPVNIMQQL------- 123
Query: 121 LEREDQNKAAEFVVKDAQ-LIRAEVKLVKCLVQ-NIVVDISFNQLGGLSTLCFLEQVDRL 178
Q AEF + L +A+V ++K + I +DISF++L G LC ++ + +
Sbjct: 124 -----QTSLAEFALDGFNPLPQAKVPVLKFMTNPGISIDISFDELHG--PLC-VQTIREI 175
Query: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
+ I +KA +++ G IS+Y L+ ++L
Sbjct: 176 FRTIPCILPAQIFLKAM-LRRNKLDQPFLGGISSYTLQLMIL 216
>gi|332030078|gb|EGI69903.1| PAP-associated domain-containing protein 5 [Acromyrmex echinatior]
Length = 662
Score = 43.1 bits (100), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 114/278 (41%), Gaps = 44/278 (15%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
R V+ ++ +I + + +V FGS YLP DIDL G N L
Sbjct: 228 RLRVVKRIENVIYDLWPDSKVEVFGSFRTGLYLPTSDIDLVVIG------MWTNLPLRTL 281
Query: 122 ERE--DQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDISFNQLGGLSTLCFLEQVDR 177
ER D+N A +K L +A V +VK + I VDISFN G+ + + R
Sbjct: 282 ERALLDRNIAEPSSIK--VLDKASVPIVKLTDKESEIKVDISFNMNNGVKSAELINSYKR 339
Query: 178 LIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCI 237
+ + ++ ++++K + + + G IS+Y+L L+ FL + D +C
Sbjct: 340 ---QYPVLEKLVMVLKQFL-LQRDLNEVFTGGISSYSL-ILMTISFLQLHPRKDI--HCP 392
Query: 238 SLNGPVRISSLPE--------VVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSR 289
+ N V + E V GG + E ++ ++
Sbjct: 393 NTNLGVLLIEFLELYGRKFNYVKTGIRIKDGGQYISKEEIQRDMID-------------- 438
Query: 290 SFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYG 327
P L I DPL N++GRS S G Y ++SAF +
Sbjct: 439 GHRPSLLCIEDPLTPGNDIGRS-SYGALY-VKSAFNWA 474
>gi|400597598|gb|EJP65328.1| Poly(A) RNA polymerase cid14 [Beauveria bassiana ARSEF 2860]
Length = 649
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 60/129 (46%), Gaps = 13/129 (10%)
Query: 53 VQPTVVSEERRKAVIDYVQRLI----RNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLN 108
V+P + R ++D +++ + RN+ VFPFGS YLP D+D+
Sbjct: 381 VRPRQFEQRIRDNLVDNLKQAMKREGRNFASAHVFPFGSFMSGLYLPTADMDIVVCSASF 440
Query: 109 VEEALAN--DVCSVLEREDQNKAAEFVVKDAQLIR----AEVKLVKCL--VQNIVVDISF 160
+ A S L + + A+ V DA I+ A + LVK + + + VDISF
Sbjct: 441 MRGGPATYLGAKSWLYKFQKFLVAQRVA-DADAIQVIAHARIPLVKYVDKMTGLRVDISF 499
Query: 161 NQLGGLSTL 169
LGG++ +
Sbjct: 500 ENLGGVNAI 508
>gi|255566462|ref|XP_002524216.1| zinc finger protein, putative [Ricinus communis]
gi|223536493|gb|EEF38140.1| zinc finger protein, putative [Ricinus communis]
Length = 493
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 115/305 (37%), Gaps = 62/305 (20%)
Query: 74 IRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALAND--VCSVLEREDQNKAAE 131
I + G V PFGS + GD+D++ LAN + S ++ QN E
Sbjct: 42 IESLRGATVEPFGSFVSNLFTRWGDLDISIM--------LANGSYISSAAKKRKQNVLRE 93
Query: 132 FVV--------KDAQLI-RAEVKLVKCLV--QNIVVDISFNQLGGLSTLCFLEQVDRLIG 180
F + Q + A V L+K QNI D+S + L G FL ++++ G
Sbjct: 94 FHKALRQKGGWRRLQFVPNARVPLLKFESGRQNISCDVSIDNLQGQIKSNFLFWLNQIDG 153
Query: 181 KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKF----------------- 223
+ F+ ++L+K W + I G +++Y+L LV++ F
Sbjct: 154 R---FRDMVLLVKEWAKAHN-INNPKTGTLNSYSLSLLVIFHFQTCVPAILPPLKEIYPR 209
Query: 224 ----------------LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSS 267
+ + Y V SSL E+ + G L ++
Sbjct: 210 NVVDDLTGVRTVAEERIKETCNANIARYMSDKYRAVNRSSLSELFISFFAKFSGISLKAA 269
Query: 268 EFLKECVEQFSVPSRGFDTNSRSFPPKH-LNIVDPLKENNNLGRSVSKGNFYRIRSAFTY 326
+ L C F+ + R P + L I DP ++ N R+VS GN +I AF
Sbjct: 270 D-LGICT--FTGQWLDIRSTMRWLPKTYALFIEDPFEQPENAARAVSAGNLVKIAEAFQT 326
Query: 327 GARKL 331
KL
Sbjct: 327 TYHKL 331
>gi|115673160|ref|XP_796681.2| PREDICTED: uncharacterized protein LOC592046 [Strongylocentrotus
purpuratus]
Length = 830
Score = 42.7 bits (99), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 55/115 (47%), Gaps = 9/115 (7%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
R+ V+ +Q ++R+ + +V +GS YLP DIDL FG ++ E+ + + L
Sbjct: 156 RREVVQRIQGIVRSIWPKAKVEIYGSTRTMLYLPTSDIDLVLFG--DIGESPFFRLGNEL 213
Query: 122 EREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQLGGLSTLCFLEQ 174
E+ + V D +A V +VK V + VDISFN G +E+
Sbjct: 214 EKSGIAEQGSIKVLD----KASVPIVKLTDNVTKVRVDISFNMQTGTDCAKLIEE 264
>gi|336276454|ref|XP_003352980.1| hypothetical protein SMAC_03298 [Sordaria macrospora k-hell]
gi|380092465|emb|CCC09742.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 781
Score = 42.7 bits (99), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 7/124 (5%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
++P + R+ V+D + R +R+ + V+PFGS P YLP GD+D+
Sbjct: 460 IKPRAFEKRIRQEVLDEINRFVRSTFPDAGVYPFGSFPSGLYLPTGDMDMVLCSDQYKRN 519
Query: 112 ALAN-DVCSVLER-EDQNKAAEFVVK-DAQLIR-AEVKLVKCLVQN--IVVDISFNQLGG 165
A D + R D K + + + ++I A+V LVK + + +D+SF G
Sbjct: 520 YRAKYDTRRTMYRLSDALKQQKLAFQNEVEIIAFAKVPLVKWVDSRTGLKIDVSFENDTG 579
Query: 166 LSTL 169
L +
Sbjct: 580 LQAI 583
>gi|384246771|gb|EIE20260.1| Nucleotidyltransferase [Coccomyxa subellipsoidea C-169]
Length = 454
Score = 42.7 bits (99), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 60/253 (23%), Positives = 105/253 (41%), Gaps = 46/253 (18%)
Query: 37 EYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPD 96
E R ++ +A V PT R A + VQ +V+PFGS L +
Sbjct: 190 ECTARLQQEIVDFVANVAPTWEESNLRDAALGRVQGACAMMHLYDVYPFGSKASGLELWN 249
Query: 97 GDIDLTAFG---------GLNVEEAL-ANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKL 146
DID+ G G EE + NDV + + Q + + V K + ++ V +
Sbjct: 250 SDIDVVVLGIVEPSKDNLGYTTEEKVPVNDVLGKIVQ--QLRRSNSVRKTFHIRQSRVPI 307
Query: 147 VKC-LVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILG- 204
+KC V+ + VD+S N G+ FL +D+ + L + I+LIKA ++LG
Sbjct: 308 IKCTTVEGVDVDVSVNGDRGICAAQFL--IDQQARRPAL-RPLILLIKAVL----KVLGL 360
Query: 205 --AHHGLISTYALETLVLYK---------------------FLDYFSKFDWDSYCISL-- 239
G + +++L +V+ L Y + F++D + +++
Sbjct: 361 GDVSQGGLGSFSLANMVIAHLQEEEKVGRGQENLGVSLLAFLLRYGTYFNYDQHVVAIGR 420
Query: 240 NGPVRISSLPEVV 252
G V +++P V
Sbjct: 421 GGIVSRTAVPGAV 433
>gi|218189365|gb|EEC71792.1| hypothetical protein OsI_04417 [Oryza sativa Indica Group]
Length = 557
Score = 42.7 bits (99), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 70/316 (22%), Positives = 118/316 (37%), Gaps = 61/316 (19%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRL------IRNYLGCEVFPFGSVPLKTYLPD 96
E+ + I++ ++P V ++RRK + +Q L + G PFGS Y
Sbjct: 11 EQCVKNILSLIKP--VEDDRRKR-LSAIQELSNSIPKVAALRGAVFKPFGSFVSNLYSNS 67
Query: 97 GDIDLTAFGGLN--VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNI 154
GD+D++ N + + V L R QN+ V+ R V I
Sbjct: 68 GDLDISVQLPNNSIISKKKKQYVLRELMRVLQNRGVAGYVQFIPFARVPVLQYVSNTFGI 127
Query: 155 VVDISFNQLGGL---STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIS 211
DIS N G C++ +D G ++LIK W ++ I G ++
Sbjct: 128 SCDISVNNYPGRIKSKIFCWISSLDVRFGD------MVLLIKEWAKAQN-INDPKTGTLN 180
Query: 212 TYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK 271
+Y+L LVL+ F + P ++ E G+ ++
Sbjct: 181 SYSLCLLVLFHFQ---------------------TCEPAILPPLKEIYEGN-------IE 212
Query: 272 ECVEQFSVPSRGFDTNSRSFPPKHLNIV-DPLKENNNLGRSVSKGNFYRIRSAFTYGARK 330
E + + +V + +HL+ V DP++ +N R+V RI AFT RK
Sbjct: 213 EGIAEMTV-----------YDEEHLDEVEDPIERPDNAARAVGLKGLERIAGAFTAANRK 261
Query: 331 LGHILSQPEESLTDEL 346
+ L + L
Sbjct: 262 FASLQHAKRNDLLEML 277
>gi|448097882|ref|XP_004198786.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
gi|359380208|emb|CCE82449.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
Length = 650
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 111/272 (40%), Gaps = 32/272 (11%)
Query: 63 RKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDL--TAFGGLNVEEALANDVCS 119
R V+ ++R I N + E FGS YLP DID+ T+ G + + S
Sbjct: 210 RNRVVKDLKREINNLWPDTEAHVFGSSATDLYLPGSDIDMVVTSNTGDYENRSKLYQLSS 269
Query: 120 VLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQLGGLSTLCFLEQ-VD 176
L K E + K A+V +VK + NI +DISF + G+ + + +D
Sbjct: 270 YLRNRKLAKDIEVIAK------AKVPIVKFVDPSSNIHIDISFERRNGIEAAKRIRRWLD 323
Query: 177 RLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL-YKFLDYFSKFDWDSY 235
R G R ++LI R+ H G + Y+ T++L Y FL +
Sbjct: 324 RTPG-----LRELVLIVKQFLRSRRLNNVHVGGLGGYS--TIILCYHFLRLHPR------ 370
Query: 236 CISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK---ECVEQFSVPSRGFDTNSRSFP 292
IS N + +L +++E E G + + + E E +P + + S
Sbjct: 371 -ISTNNISILDNLGSLLIEFFELYGRNFSYDNLIIAIDPETDEVKYLPKKDHAYLNPSKN 429
Query: 293 PKHLNIVDPLKENNNLGRSVSKGNFYRIRSAF 324
P + I DP NN+ RS N ++ AF
Sbjct: 430 PFSIVIQDPADSTNNISRSSY--NLRDVKKAF 459
>gi|147825319|emb|CAN73261.1| hypothetical protein VITISV_003724 [Vitis vinifera]
Length = 106
Score = 42.4 bits (98), Expect = 0.90, Method: Composition-based stats.
Identities = 16/26 (61%), Positives = 19/26 (73%)
Query: 231 DWDSYCISLNGPVRISSLPEVVVETP 256
DWD +C+SL GPV ISSLP+ E P
Sbjct: 47 DWDGFCVSLGGPVPISSLPDATTEPP 72
>gi|403338429|gb|EJY68454.1| hypothetical protein OXYTRI_10932 [Oxytricha trifallax]
Length = 1545
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 66/319 (20%), Positives = 114/319 (35%), Gaps = 103/319 (32%)
Query: 62 RRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVL 121
R + +Q+ + N EV +GS K LP DIDL + V S++
Sbjct: 1179 RESNITSVIQKALPN---SEVKVYGSHATKLCLPWSDIDLVIKTNSTDHYSTPKHVLSII 1235
Query: 122 EREDQNKAAEFVVKDAQLIR---AEVKLVKCLVQNIVVDISFNQLGGLSTL------CFL 172
RE Q+ +++ + + V VKC + +I+ Q GL++ FL
Sbjct: 1236 TRELQSDHTTKWIQEVKFVENASVPVVKVKCQIDHIM------QTSGLASQNISKYQTFL 1289
Query: 173 EQ---------------------VDRLIGKDHLFKRSIILIKAW---CYYESRILGAHHG 208
EQ V + ++ + + I+++K + C Y G
Sbjct: 1290 EQPFSIDITQLTDNHNGLECVKLVQEFLSENEVIEPLILVLKQYLKVCQYNDPYFGG--- 1346
Query: 209 LISTYALETLV---------------------LYKFLDYFSKFDWDSYCISLNGPVRISS 247
IS+YAL ++ L F ++ F + SY I + P +IS
Sbjct: 1347 -ISSYALFLMIVSYLQSIQAPKLISQVNLGHILISFFQFYGDFQYQSYGIYTHLPGKIS- 1404
Query: 248 LPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNN 307
E +++ + F + + I DPL +NN
Sbjct: 1405 ------------------------EKTNHYAIVN---------FLTQTVQIDDPLHVHNN 1431
Query: 308 LGRSVSKGNFYRIRSAFTY 326
+G+S K FY I+ +F +
Sbjct: 1432 VGKSSFK--FYEIKDSFKF 1448
>gi|402220735|gb|EJU00806.1| Nucleotidyltransferase, partial [Dacryopinax sp. DJM-731 SS1]
Length = 266
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 69/142 (48%), Gaps = 18/142 (12%)
Query: 63 RKAVIDYVQRLI-RNYLGCEVFPFGSVPLKTYLPDGDIDLTA-FGGLNVEE-----ALAN 115
R VI+ ++ I R + V FGS + Y P+GDIDL + G++VE + +
Sbjct: 22 RLMVIECIRSSITRKWPSARVLAFGSQETQLYFPNGDIDLVVHYDGISVERKDQIVSFLS 81
Query: 116 DVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQ--NIVVDISFNQLGGLSTLCFLE 173
++ +L++ ++ + K A V ++K + + + VDIS NQ GL +
Sbjct: 82 EISCLLQQAKVSRRVNLIGK------ARVPIIKFVTELGHFAVDISVNQTNGLRAVTV-- 133
Query: 174 QVDRLIGKDHLFKRSIILIKAW 195
V+R + + +++IKA+
Sbjct: 134 -VNRFLWYLPAVRPLVMVIKAF 154
>gi|452823485|gb|EME30495.1| DNA polymerase sigma subunit [Galdieria sulphuraria]
Length = 417
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 114/302 (37%), Gaps = 84/302 (27%)
Query: 61 ERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAF----GGLNVEEALAN 115
++RK +I+ V +IR + V FGS YLP DIDL G E L
Sbjct: 116 KQRKQLIERVTEIIRQIWPNSSVHVFGSFATNLYLPTSDIDLCILSSPENGSKRELHLLA 175
Query: 116 DVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLE 173
DV R NK + D +A V ++K + I DISF + G+ +
Sbjct: 176 DVL----RRKTNKMRRVMAID----KARVPIIKVTDRETGIQCDISFGRTNGIENV---R 224
Query: 174 QVDRLIGKDHLFKRSIILIKAWCYYESRILG-AHHGLISTYA------------------ 214
+ + + + + +++IK C+ R L H G I +Y
Sbjct: 225 HIQKYLKRYPSLRPLMMVIK--CFLHQRALNEVHEGGIGSYLLLLSIISHLQMIPVNFPD 282
Query: 215 ---------LETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLL 265
L +L+L F Y F++ IS+ +GG
Sbjct: 283 MRKEGFISNLGSLLLSYFQLYGRLFNYMKTGISVK------------------NGG---- 320
Query: 266 SSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFT 325
+ E VE+F F+ N P L++ DP E N LGR+ + R+R+AF+
Sbjct: 321 ---YYYEKVERFP-----FEINR----PNLLSLEDPRDEENELGRNSFAVS--RVRTAFS 366
Query: 326 YG 327
G
Sbjct: 367 QG 368
>gi|303389764|ref|XP_003073114.1| DNA polymerase sigma [Encephalitozoon intestinalis ATCC 50506]
gi|303302258|gb|ADM11754.1| DNA polymerase sigma [Encephalitozoon intestinalis ATCC 50506]
Length = 378
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 71/159 (44%), Gaps = 7/159 (4%)
Query: 19 RPSSSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNY 77
+ S S + +N +++ ++ + + ++ PT RK + + ++RLI R
Sbjct: 43 KAVSIESLLDTNMSSVSLGNLEKLDLELHQLYQKLAPTTTEINSRKYIFEKIKRLIVREI 102
Query: 78 LGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDA 137
EV PFGS +P DID+ G N ++ +N S + + A+F+ K+
Sbjct: 103 PNAEVEPFGSYTTGLIIPSSDIDINIQLGNNHDKEYSNRYLSKI--KSLMLKADFIRKET 160
Query: 138 --QLIRAEVKLVKC--LVQNIVVDISFNQLGGLSTLCFL 172
+ + + ++K V +DIS NQ G+ F+
Sbjct: 161 LFHIRKCRIPILKFSDKVFGFKIDISVNQTNGIEAAKFV 199
>gi|299473006|emb|CBN77407.1| nucleotidyltransferase family protein [Ectocarpus siliculosus]
Length = 434
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 38/168 (22%), Positives = 75/168 (44%), Gaps = 14/168 (8%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRNYLGCE--VFPFGSVPLKTYLPDGDIDLTAFGGLNVE 110
+ PT + + I Y++R++ LG E V FGS LP DID GG
Sbjct: 276 LMPTPEEKAKTAIAITYIKRVVEETLGSEARVEIFGSQLTGLVLPSSDIDSVVLGGPR-- 333
Query: 111 EALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKC--LVQNIVVDISFNQLGGLST 168
+L + ++ R+++ + E V + A V LVK + + VD+ F+Q G+ +
Sbjct: 334 GSLGSLGAAMYRRQNKGEVREVTV----IKSARVPLVKFVHVGSGVQVDVCFDQESGMKS 389
Query: 169 LCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALE 216
+ R + + R ++++ + ++ +HG + ++ L+
Sbjct: 390 ----GRAARAMMRQMQPVRPLVMVLKAYMGQRKLNETYHGGVGSFLLQ 433
>gi|47208265|emb|CAF92498.1| unnamed protein product [Tetraodon nigroviridis]
Length = 297
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 75/306 (24%), Positives = 115/306 (37%), Gaps = 68/306 (22%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFGG----- 106
+ P E+ R V+D ++ +I + + EV FGS YLP DIDL FG
Sbjct: 1 ISPRPEEEKMRLEVVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLVVFGKWETLP 60
Query: 107 -LNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGG 165
+EEAL + K A D I+ K L I VDISFN G
Sbjct: 61 LWTLEEALR-----------KRKVA-----DENSIKVLDKATVSLFSLIFVDISFNMKSG 104
Query: 166 LSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLD 225
+ +++ K + ++++K + + + G I +Y+L L+ FL
Sbjct: 105 VKAAQLIKEFKE---KYPVLPYLVLVLKQFL-LQRDLNEVFTGGIGSYSL-FLMAVSFLQ 159
Query: 226 YFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFD 285
+ D V P + G LL+ EF + F+ G
Sbjct: 160 LHYRED---------------------VCNPNINIGVLLI--EFFELYGRHFNYLKTGIR 196
Query: 286 TNS---------------RSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARK 330
+ P L I DPL+ +N++GRS S G +++ AF Y
Sbjct: 197 IKDGGCYVAKDEVQKNLMDGYRPSMLYIEDPLQPDNDVGRS-SYGAM-QVKQAFDYAYVV 254
Query: 331 LGHILS 336
L H +S
Sbjct: 255 LSHAVS 260
>gi|170592851|ref|XP_001901178.1| PAP/25A associated domain containing protein [Brugia malayi]
gi|158591245|gb|EDP29858.1| PAP/25A associated domain containing protein [Brugia malayi]
Length = 421
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 133/333 (39%), Gaps = 48/333 (14%)
Query: 28 PSNQTAIGAEYWQR--------AEEATQGIIAQVQPTVVSEERRKAVIDYVQRLI-RNYL 78
P TA+ A + +R E + A ++P+ + R V + V+ ++ R +
Sbjct: 45 PREGTALIAPWCRRRYALSLRGLHEELLDLYAWLKPSPLERALRLRVFERVRGVLQRIWP 104
Query: 79 GCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQ 138
++ FGS+ +LP DID+ L EE L+ ++
Sbjct: 105 TAKIDVFGSLYTSLFLPTSDIDVVVESDLVSEEPPLWKTAIALKESGITESINV------ 158
Query: 139 LIRAEVKLVKCLVQN--IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWC 196
L +A V +VK + ++ I +DISFN + G+ + F+E + + ++L+
Sbjct: 159 LDKAFVPIVKMVDKDTKIYLDISFNTVQGVRSAKFIED----MKMRYPVLEPLVLVLKQF 214
Query: 197 YYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDSYCIS-LNGPVRISSLPEVV--- 252
+ ++ G +S+Y L L+L FL +D+ I+ +N V + + ++
Sbjct: 215 LMQRQLNQVFTGGLSSYGL-ILMLISFLQLHPSYDYSYKRITEVNMGVLLLNFLQLYGQE 273
Query: 253 -----VETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNN 307
+SGG + E L Q + PS L I DPL+ N+
Sbjct: 274 FNYMKTALRIHSGGAYVCKDEILV----QMNRPSNSM-----------LCIEDPLQPGND 318
Query: 308 LGRSVSKGNFYRIRSAFTYGARKLGHILSQPEE 340
+GR N +R AF + L + + E
Sbjct: 319 IGR--CSHNIQLVRQAFEHAFATLCAVFVRSRE 349
>gi|281211597|gb|EFA85759.1| PAP/25A-associated domain-containing protein [Polysphondylium
pallidum PN500]
Length = 918
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 74/332 (22%), Positives = 135/332 (40%), Gaps = 46/332 (13%)
Query: 55 PTVVSEERRKAVIDYVQRLIR-NYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEAL 113
PT R+ +++ ++ +++ N+ +V FGS +P DID+ G V
Sbjct: 473 PTQYENRMRQKIVNDIEAIVKQNWPKAKVIVFGSFSTDLCIPSSDIDIQISGITEVASGN 532
Query: 114 A-------NDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLVQNI--VVDISFNQL 163
ND+ + L + Q EF + +LI A+V ++K ++ VDI F+
Sbjct: 533 GRTYSNPINDLYNTLSKHHQ---REFT--NIRLIAAAKVPIIKMAHKSTWYNVDICFDTP 587
Query: 164 GGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKF 223
G+ ++Q R + + ++LI + +++ + + G I +YAL +V+
Sbjct: 588 NGIENTEIVKQFLR----KYKSMKILLLILKYFMFQNNMNETYSGGIGSYALALMVVSYI 643
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
++ D + N + +T G ++L +F K + F G
Sbjct: 644 QLRYASMDQRVHHKRSNYQHDSENHRHAGNDT---DYGKMIL--DFFKLYGQLFQYTRHG 698
Query: 284 FDTNSRSFPPK-------HLNIVDPLKENNNLGR-----SVSKGNFYRIRSAFT------ 325
N+ F K +L I DP NN++G+ S +G F+ T
Sbjct: 699 ICLNNGGFYFKKGEQYGIYLTIRDPHDANNDVGKNSFNISFIRGVFFNAMLKMTSDELLK 758
Query: 326 --YGARKLGHILSQ-PEESLTDELRKFFSNTL 354
Y A K ILS+ EE L ++ K +N +
Sbjct: 759 DKYSALKFPTILSRLIEERLVEQQAKDRNNVI 790
>gi|449465848|ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
gi|449516431|ref|XP_004165250.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
Length = 464
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 80/193 (41%), Gaps = 20/193 (10%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRL---IRNYLGCEVFPFGSVPLKTYLPDGDI 99
+ + I+ V+P R VI+ ++ + I + G + PFGS + GD+
Sbjct: 7 DRVIKDILRVVEPLQDDWTARFQVINELRNVVQSIESLRGATIEPFGSFVSNLFSRWGDL 66
Query: 100 DL---------TAFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL 150
DL T+ G ++ L D+ + + + + + R + ++ +
Sbjct: 67 DLSVQLNNGSYTSTAGKKRKQTLLRDIQNASRKNGRWYKLQLIPHA----RVPILKIEHI 122
Query: 151 VQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 210
NI DIS + L G L V+ + G+ F ++L+K W I + G
Sbjct: 123 QHNISCDISIDNLVGQIKSKILLWVNEIDGR---FHDMVLLVKEWAKAHD-INNSKQGTF 178
Query: 211 STYALETLVLYKF 223
++Y+L LV++ F
Sbjct: 179 NSYSLSLLVIFHF 191
>gi|190407236|gb|EDV10503.1| DNA polymerase sigma [Saccharomyces cerevisiae RM11-1a]
gi|259149371|emb|CAY86175.1| Pap2p [Saccharomyces cerevisiae EC1118]
Length = 584
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 77/182 (42%), Gaps = 14/182 (7%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ +A + P+ E R I ++ ++ + ++ FGS YLP DID
Sbjct: 184 KDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVTS 243
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--QNIVVDISFNQL 163
L +E+ N + +N A E V + +A V ++K + I +D+SF +
Sbjct: 244 KLGGKESRNNLYSLASHLKKKNLATEVEV----VAKARVPIIKFVEPHSGIHIDVSFERT 299
Query: 164 GGLSTLCFL-EQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYK 222
G+ + E +D G R ++LI + R+ H G + +++ LV +
Sbjct: 300 NGIEAAKLIREWLDDTPG-----LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV-FS 353
Query: 223 FL 224
FL
Sbjct: 354 FL 355
>gi|401623740|gb|EJS41828.1| trf4p [Saccharomyces arboricola H-6]
Length = 573
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 43/181 (23%), Positives = 76/181 (41%), Gaps = 12/181 (6%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ +A + P+ E R I ++ ++ + ++ FGS YLP DID
Sbjct: 176 KDFVAYISPSREEIEVRNQTISMIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVITS 235
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQL 163
L +E+ N + +N A E V + +A V ++K + N I +D+SF +
Sbjct: 236 ELGGKESRNNLFSLASHLKKKNLATEIEV----VAKARVPIIKFVEPNSGIHIDVSFERT 291
Query: 164 GGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKF 223
GL + R D R ++LI + R+ H G + +++ LV + F
Sbjct: 292 NGLEAAKLI----REWLNDTPGLRELVLIVKQFLHSRRLNNVHTGGLGGFSIICLV-FSF 346
Query: 224 L 224
L
Sbjct: 347 L 347
>gi|147799779|emb|CAN72745.1| hypothetical protein VITISV_018734 [Vitis vinifera]
Length = 258
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 230 FDWDSYCISLNGPVRISSLPEVVVETPEN 258
DWDS+C+SL GPV ISSLP+ + P
Sbjct: 114 IDWDSFCVSLWGPVPISSLPDATTKPPRQ 142
>gi|254573058|ref|XP_002493638.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p)
[Komagataella pastoris GS115]
gi|238033437|emb|CAY71459.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p)
[Komagataella pastoris GS115]
gi|328354535|emb|CCA40932.1| DNA polymerase sigma subunit [Komagataella pastoris CBS 7435]
Length = 601
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 80/186 (43%), Gaps = 21/186 (11%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL--GCEVFPFGSVPLKTYLPDGDIDL--T 102
+ I + P++ E R + +++ I L C V FGS YLP DID+ T
Sbjct: 140 KDFINYISPSIAEIEARNNAVKRLRKEITTNLWPDCYVNVFGSFATDLYLPGSDIDMVIT 199
Query: 103 AFGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISF 160
+ G ++ + S L ++ V + RA+V ++K + I +D+SF
Sbjct: 200 SDSGKYCAKSYLYQLSSFL------RSKNLGVNIETIARAKVPIIKFIEPRSKIHIDVSF 253
Query: 161 NQLGGLSTLCFLEQVDRLIG--KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 218
+ GL +R+ G ++ R ++LI R+ HHG + +++ L
Sbjct: 254 EKTNGLRA------AERIQGWLRETPGLRELVLIVKQFLAVRRMNNVHHGGLGGFSIICL 307
Query: 219 VLYKFL 224
V + FL
Sbjct: 308 V-HSFL 312
>gi|151945519|gb|EDN63760.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
Length = 584
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 77/182 (42%), Gaps = 14/182 (7%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ +A + P+ E R I ++ ++ + ++ FGS YLP DID
Sbjct: 184 KDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVTS 243
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--QNIVVDISFNQL 163
L +E+ N + +N A E V + +A V ++K + I +D+SF +
Sbjct: 244 KLGGKESRNNLYSLASHLKKKNLATEVEV----VAKARVPIIKFVEPHSGIHIDVSFERT 299
Query: 164 GGLSTLCFL-EQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYK 222
G+ + E +D G R ++LI + R+ H G + +++ LV +
Sbjct: 300 NGIEAAKLIREWLDDTPG-----LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV-FS 353
Query: 223 FL 224
FL
Sbjct: 354 FL 355
>gi|6324457|ref|NP_014526.1| non-canonical poly(A) polymerase PAP2 [Saccharomyces cerevisiae
S288c]
gi|1717744|sp|P53632.1|PAP2_YEAST RecName: Full=Poly(A) RNA polymerase protein 2; AltName: Full=DNA
polymerase kappa; AltName: Full=DNA polymerase sigma;
AltName: Full=Topoisomerase 1-related protein TRF4
gi|663237|emb|CAA88145.1| ORF [Saccharomyces cerevisiae]
gi|950226|gb|AAC49091.1| Trf4p [Saccharomyces cerevisiae]
gi|1419987|emb|CAA99134.1| TRF4 [Saccharomyces cerevisiae]
gi|51830518|gb|AAU09782.1| YOL115W [Saccharomyces cerevisiae]
gi|285814775|tpg|DAA10668.1| TPA: non-canonical poly(A) polymerase PAP2 [Saccharomyces
cerevisiae S288c]
gi|392296670|gb|EIW07772.1| Pap2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 584
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 78/182 (42%), Gaps = 14/182 (7%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ +A + P+ E R I ++ ++ + ++ FGS YLP DID
Sbjct: 184 KDFVAYISPSREEIEIRNQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVTS 243
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQN--IVVDISFNQL 163
L +E+ N + +N A E V + +A V ++K + + I +D+SF +
Sbjct: 244 ELGGKESRNNLYSLASHLKKKNLATEVEV----VAKARVPIIKFVEPHSGIHIDVSFERT 299
Query: 164 GGLSTLCFL-EQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYK 222
G+ + E +D G R ++LI + R+ H G + +++ LV +
Sbjct: 300 NGIEAAKLIREWLDDTPG-----LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV-FS 353
Query: 223 FL 224
FL
Sbjct: 354 FL 355
>gi|448519050|ref|XP_003868035.1| non-canonical poly(A) polymerase [Candida orthopsilosis Co 90-125]
gi|380352374|emb|CCG22600.1| non-canonical poly(A) polymerase [Candida orthopsilosis]
Length = 604
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 66/300 (22%), Positives = 116/300 (38%), Gaps = 56/300 (18%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL-GCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ ++ + P+ R VI+ ++R + ++ G E FGS YLP DID+
Sbjct: 170 KDFVSYISPSRAEIVTRNNVINTLKREVSSFWPGTEAHVFGSCATDLYLPGSDIDMVVIS 229
Query: 106 GLNVEEALAN--DVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFN 161
E + + S L ++ K E + A+V ++K + N+ +DISF
Sbjct: 230 STGDYENRSRLYQLSSFLRAKNLAKNVEVIAS------AKVPIIKFVDPESNLPIDISFE 283
Query: 162 QLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY 221
+ GL + + L+ L R ++L+ ++ H G + YA ++ Y
Sbjct: 284 RTNGLDAARRIRRW--LLATPGL--RELVLVVKQFLRSRKLNNVHVGGLGGYAT-IIMCY 338
Query: 222 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 281
F+ K IS N + P+N G +L EF + FS +
Sbjct: 339 HFMQLHPK-------ISTN-----------TMNAPDNLG---VLLIEFFELYGRNFSYDN 377
Query: 282 RGFDTNSRSFPPKHLN-----------------IVDPLKENNNLGRSVSKGNFYRIRSAF 324
+S + P++L+ I DP +NN+ RS N ++ AF
Sbjct: 378 LIISIDSETQLPRYLHKGRHPSLNTARNTFSIVIQDPADPSNNITRSSY--NLRDLKKAF 435
>gi|357131279|ref|XP_003567266.1| PREDICTED: poly(A) RNA polymerase protein cid1-like [Brachypodium
distachyon]
Length = 595
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 74/188 (39%), Gaps = 11/188 (5%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL---GCEVFPFGSVPLKTYLPDGDI 99
E+ + I++Q++P V +R + I + I++ G PFGS Y GD+
Sbjct: 9 EKCIKEILSQIKPAEVDRNKRLSAIKELDISIQSVAALKGAAAKPFGSFLSNLYSKSGDL 68
Query: 100 DLTA----FGGLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIV 155
DL+ L V + + VL + Q ++ R V I
Sbjct: 69 DLSVQLMNSSNLPVSKKKKQSILRVLRKALQRNGVAGYMEFIPHARVPVLQYVSNSFGIS 128
Query: 156 VDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYAL 215
D+S + G + L D F ++LIK W ++ I G +++Y+L
Sbjct: 129 CDLSIDNYPGRIKSRIFYWISTL---DERFGDMVLLIKEWAKCQN-INDPKTGTLNSYSL 184
Query: 216 ETLVLYKF 223
LVL+ F
Sbjct: 185 CLLVLFHF 192
>gi|349581056|dbj|GAA26214.1| K7_Pap2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 584
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 77/182 (42%), Gaps = 14/182 (7%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ +A + P+ E R I ++ ++ + ++ FGS YLP DID
Sbjct: 184 KDFVAYISPSREEIEIRNQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVTS 243
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--QNIVVDISFNQL 163
L +E+ N + +N A E V + +A V ++K + I +D+SF +
Sbjct: 244 ELGGKESRNNLYSLASHLKKKNLATEVEV----VAKARVPIIKFVEPHSGIHIDVSFERT 299
Query: 164 GGLSTLCFL-EQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYK 222
G+ + E +D G R ++LI + R+ H G + +++ LV +
Sbjct: 300 NGIEAAKLIREWLDDTPG-----LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV-FS 353
Query: 223 FL 224
FL
Sbjct: 354 FL 355
>gi|254579541|ref|XP_002495756.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
gi|238938647|emb|CAR26823.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
Length = 531
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 44/186 (23%), Positives = 78/186 (41%), Gaps = 20/186 (10%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ +A + P+ E R I ++ +R + G ++ FGS YLP DID
Sbjct: 106 RDFVAYISPSRQEIELRNKTIRTLRHAVRKLWPGADLQVFGSYATDLYLPGSDIDCV--- 162
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQL 163
+N + + S+ E K + + + +A V ++K + I VD+SF +
Sbjct: 163 -INSKTGDKENRSSLYELAHFLKNRKLATQVEVIAKARVPIIKFVEPTSQIHVDVSFERT 221
Query: 164 GGLSTL----CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLV 219
GL +L+Q L R ++LI + R+ H G + +++ LV
Sbjct: 222 NGLEAAKLIRSWLQQTPGL--------RELVLIVKQFLHARRLNNVHTGGLGGFSIICLV 273
Query: 220 LYKFLD 225
Y FL+
Sbjct: 274 -YAFLN 278
>gi|302791355|ref|XP_002977444.1| hypothetical protein SELMODRAFT_417492 [Selaginella moellendorffii]
gi|300154814|gb|EFJ21448.1| hypothetical protein SELMODRAFT_417492 [Selaginella moellendorffii]
Length = 479
Score = 40.0 bits (92), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 73/175 (41%), Gaps = 9/175 (5%)
Query: 52 QVQPTVVSEERRKAVIDYVQRLIRNYLGCE---VFPFGSVPLKTYLPDGDIDLTAFGGLN 108
Q+QPT E R ++ ++ LIR C+ + PFGS Y P GD+D+T +
Sbjct: 41 QLQPTQQDFEARVDILRRLEYLIREIDVCKGLAIKPFGSFLSNLYTPWGDLDITLMPLES 100
Query: 109 VEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKC--LVQNIVVDISFNQLGGL 166
+ + + D A ++ L R V L+ I DIS + +
Sbjct: 101 APLSRSKKTKILKSIHDALLQAGGAIRVQVLFRPRVPLLMFEDAWWRISCDISVSNTDAV 160
Query: 167 STLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLY 221
+ ++G D ++ I L+K W + I G +++YAL LV++
Sbjct: 161 FK---SHALGLIVGMDLRCRQLIFLVKCWAKAQC-INDPKMGTLNSYALSLLVIF 211
>gi|19075773|ref|NP_588273.1| poly(A) polymerase Cid12 [Schizosaccharomyces pombe 972h-]
gi|74582471|sp|O74518.1|CID12_SCHPO RecName: Full=Poly(A) RNA polymerase cid12; Short=PAP; AltName:
Full=Caffeine-induced death protein 12; AltName:
Full=Polynucleotide adenylyltransferase cid12
gi|3426138|emb|CAA20372.1| poly(A) polymerase Cid12 [Schizosaccharomyces pombe]
Length = 336
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 127/281 (45%), Gaps = 27/281 (9%)
Query: 53 VQPTVVSEERRKAVIDYVQRLIRNY-LGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEE 111
V P + + RK +++ +Q IR L E+ +GS+ + T L D+D++ V E
Sbjct: 31 VSPKIEELKYRKLLLEKLQTHIREVVLDAELQVYGSMYIGTTLSISDVDVS-LKSPRVGE 89
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCF 171
V VL R+ + A+F A++ R + LV V I VD++F G C
Sbjct: 90 LEKRRVTMVL-RKYLDADADFH-SSARVPR--INLVD--VSGIGVDLTF----GNDKACR 139
Query: 172 LEQVDRLIGKDH-LFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKF 230
++ + ++H +F R ++L+K W + E + HHG I++ AL +++ F K
Sbjct: 140 TAELQKAYNEEHPIFGRLLMLLKHWLF-ERDLENVHHGGIASCALSYMLIGWLEMRFHKK 198
Query: 231 DWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRGFDTNSRS 290
DS P+R + L + +L + + V + +G+ +
Sbjct: 199 GIDSEV----QPIR-ALLQKFFYFWGVEWTYELFVLRPLTGQIVPKL---QKGWLNEVQ- 249
Query: 291 FPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
P L+I DP+ NN++G+ + + I++AF A +L
Sbjct: 250 --PNLLSIEDPIDRNNDIGKQSFQISM--IKAAFVASANEL 286
>gi|448101749|ref|XP_004199636.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
gi|359381058|emb|CCE81517.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
Length = 646
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 66/272 (24%), Positives = 109/272 (40%), Gaps = 31/272 (11%)
Query: 62 RRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDL--TAFGGLNVEEALANDVCS 119
R + V D + + + E FGS YLP DID+ T+ G + + S
Sbjct: 209 RNRVVKDLKREINSLWPDTETHVFGSSATDLYLPGSDIDMVVTSKTGDYENRSKLYQLSS 268
Query: 120 VLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQLGGLSTLCFLEQ-VD 176
L K E + K A+V ++K + NI +DISF + G+ + + +D
Sbjct: 269 YLRNRKLAKDIEVIAK------AKVPIIKFVDPSSNIHIDISFERRNGIEAAKRIRKWLD 322
Query: 177 RLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL-YKFLDYFSKFDWDSY 235
+ G R ++LI R+ H G + Y+ T++L Y FL +
Sbjct: 323 KTPG-----LRELVLIIKQFLRSRRLNNVHVGGLGGYS--TIILCYHFLRLHPR------ 369
Query: 236 CISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLK---ECVEQFSVPSRGFDTNSRSFP 292
IS N + +L +++E E G + + + E E +P + + S
Sbjct: 370 -ISTNNMSILDNLGSLLIEFFELYGRNFSYDNLIIAIDPETDEPKYLPKKDHAYLNSSKN 428
Query: 293 PKHLNIVDPLKENNNLGRSVSKGNFYRIRSAF 324
P + I DP NN+ RS N ++ AF
Sbjct: 429 PFSIVIQDPADSTNNISRSSY--NLRDVKKAF 458
>gi|340506956|gb|EGR32991.1| hypothetical protein IMG5_064460 [Ichthyophthirius multifiliis]
Length = 347
Score = 39.7 bits (91), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 10/145 (6%)
Query: 29 SNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYL-GCEVFPFGS 87
S+QT + R + + PT E R + + ++I++ + CEV FGS
Sbjct: 35 SDQTLLIKNPLYRLHNEIIELTEYLAPTKEEHELRIKSFENLTQIIKSVIPDCEVKTFGS 94
Query: 88 VPLKTYLPDGDIDLTAFGGLNVEEALANDVCS-VLEREDQNKAAEFVVKDAQLIRAEVKL 146
K YLP+ DID+ + L V VL ED + F+ A+V L
Sbjct: 95 FSSKLYLPNSDIDIVIVKEGESNKYLYKKVADVVLTCEDIYENISFIT------NAKVPL 148
Query: 147 VKCLVQNIVV--DISFNQLGGLSTL 169
+K + ++ DISFN+ G+ L
Sbjct: 149 IKFVEKSTQTNFDISFNKEDGVKQL 173
>gi|256271045|gb|EEU06149.1| Pap2p [Saccharomyces cerevisiae JAY291]
Length = 584
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 43/184 (23%), Positives = 78/184 (42%), Gaps = 18/184 (9%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ +A + P+ E R I ++ ++ + ++ FGS YLP DID
Sbjct: 184 KDFVAYISPSREEIEIRNKTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVTS 243
Query: 106 GLNVEEALAN--DVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLV--QNIVVDISFN 161
L +E+ N + S L+++ E V K A V ++K + I +D+SF
Sbjct: 244 KLGGKESRNNLYSLASHLKKKKLATEVEVVAK------ARVPIIKFVEPHSGIHIDVSFE 297
Query: 162 QLGGLSTLCFL-EQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL 220
+ G+ + E +D G R ++LI + R+ H G + +++ LV
Sbjct: 298 RTNGIEAAKLIREWLDDTPG-----LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 351
Query: 221 YKFL 224
+ FL
Sbjct: 352 FSFL 355
>gi|154337346|ref|XP_001564906.1| RNA polymerase II [Leishmania braziliensis MHOM/BR/75/M2904]
gi|134061944|emb|CAM38985.1| RNA polymerase II [Leishmania braziliensis MHOM/BR/75/M2904]
Length = 374
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 65/155 (41%), Gaps = 20/155 (12%)
Query: 82 VFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKAAEFVV------- 134
V P+GS+ T L DGD D L E + C V+ERE Q K +
Sbjct: 68 VLPYGSIVSGTSLRDGDADYIVSFPLASESTCQSIAC-VIERERQEKLLSDIFVHIRKNN 126
Query: 135 KDAQL-----IRAEVKLVKCLVQNIVVDISFN---QLGGLSTLCFLEQVDRLIGKDHLFK 186
+D +L RA V +V+ + ++ F+ LGGL L Q + D +
Sbjct: 127 RDDELYPQRIFRARVPIVQYVRKSACEISKFDICLSLGGLKNSLLLRQ---YMAGDPRLR 183
Query: 187 RSIILIKAWCYYESRILGAHHGLISTYALETLVLY 221
++ K W + +IL G IS YAL + ++
Sbjct: 184 LGVLGAKQWG-RDHQILNTRRGWISPYALSIMYIH 217
>gi|146086153|ref|XP_001465471.1| RNA polymerase II [Leishmania infantum JPCM5]
gi|134069569|emb|CAM67892.1| RNA polymerase II [Leishmania infantum JPCM5]
Length = 382
Score = 39.3 bits (90), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 61/269 (22%), Positives = 107/269 (39%), Gaps = 44/269 (16%)
Query: 22 SSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNY---L 78
+ +S+ N +G + E + I++Q Q + + E+RR ++Q L++ +
Sbjct: 9 TRASTYRRNGGELGTSMSSKIAELSGRILSQQQYSQILEQRR-----WLQGLLKGVSLDM 63
Query: 79 GCE---------VFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKA 129
G + V P+GS+ T DGD D + E + C V+ RE Q K
Sbjct: 64 GAKEEEKSASPVVLPYGSIVSGTSFTDGDADYIVSFPIVSESTRQSGAC-VIARERQEKC 122
Query: 130 ------------AEFVVKDAQLIRAEVKLVKCLVQNIVVDISFN---QLGGLSTLCFLEQ 174
++ + ++ RA V +V+ + ++ F+ L GL L
Sbjct: 123 LSDIFSHIRKCNSDVELHPQRIFRARVPIVQYVRKSAQESTKFDLSLSLDGLKNSLLLRH 182
Query: 175 VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLDYFSKFDWDS 234
+ D + ++ K W E +IL A G IS YAL + + +F K D
Sbjct: 183 Y---MAGDPRLRLGVLGAKQWG-REQQILNARRGWISPYALSIMYI-----HFMK-DTGR 232
Query: 235 YCISLNGPVRISSLPEVVVETPENSGGDL 263
+S + +S +V T S GD+
Sbjct: 233 TALSFDEEA-VSQRVNAIVSTAAESEGDI 260
>gi|313231448|emb|CBY08562.1| unnamed protein product [Oikopleura dioica]
Length = 587
Score = 39.3 bits (90), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 65/299 (21%), Positives = 116/299 (38%), Gaps = 76/299 (25%)
Query: 64 KAVIDYVQRLIRNYLGCEVFP------FGSVPLKTYLPDGDIDLTAFGGLNVE------E 111
++ +++ + L R L +FP +GS + LP DID+ N E
Sbjct: 22 RSYLEFNRSLPRKGLSTCLFPGSKVEIYGSFQTRLNLPTSDIDMVICDFENSSRDQQPYE 81
Query: 112 ALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKC--LVQNIVVDISFNQLGGLSTL 169
AL + + K AE + L+ A V +VK L+ ++ VDISFN G+ ++
Sbjct: 82 ALRHALVEA-------KIAEPLTLKV-LMGASVPIVKMRDLLSDVKVDISFNVKTGIKSV 133
Query: 170 CFLEQVDR---------LIGKDHLFKRSI------------ILIKAWCYYESRILGAHHG 208
++ R L+ K L ++++ +++ C+ ++ GA
Sbjct: 134 SLIKAFIREYQVLPVLVLVLKQFLLQQNLNEVWTGGISSYGLILMVLCFLQNHPRGARAT 193
Query: 209 LISTYALETLVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSE 268
L++ F Y F ++ N +R+ S G +S +
Sbjct: 194 SPENANFGLLLIEFFELYGKNFSYE------NCSIRVKS-------------GGAFISKQ 234
Query: 269 FLKECVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYG 327
+ + +E SR P +L+I DPL E N++GRS K ++ F Y
Sbjct: 235 VMAKQME------------SRGQVPAYLSIEDPLTEWNDIGRSSYKA--LEMKEKFNYA 279
>gi|115441021|ref|NP_001044790.1| Os01g0846500 [Oryza sativa Japonica Group]
gi|56784029|dbj|BAD82657.1| unknown protein [Oryza sativa Japonica Group]
gi|56784702|dbj|BAD81828.1| unknown protein [Oryza sativa Japonica Group]
gi|113534321|dbj|BAF06704.1| Os01g0846500 [Oryza sativa Japonica Group]
gi|222619532|gb|EEE55664.1| hypothetical protein OsJ_04062 [Oryza sativa Japonica Group]
Length = 381
Score = 39.3 bits (90), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 45/191 (23%), Positives = 83/191 (43%), Gaps = 17/191 (8%)
Query: 43 EEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRN---YLGCEVFPFGSVPLKTYLPDGDI 99
E+ T+ I++ ++P +R I + I + G V PFGS + Y GD+
Sbjct: 13 EKCTEDILSLIKPVEGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYAKSGDL 72
Query: 100 DLTA--FGGLN--VEEALANDVCSVLEREDQNKAAEFVVKDAQLI-RAEVKLVKCLVQN- 153
D++ F LN + + D + R Q + + + + I A V +++ +
Sbjct: 73 DVSVELFNALNLPISKRKKQDTLREVRRALQKRG---IARHMEFIPNARVPVLQYVSNQY 129
Query: 154 -IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLIST 212
I DIS + G ++ L D F ++L+K W ++ I +G +++
Sbjct: 130 GISCDISISNYPGRIKSKIFYWINTL---DDRFGDMVLLVKEWAKAQN-INDPKNGTLNS 185
Query: 213 YALETLVLYKF 223
Y+L LVL+ F
Sbjct: 186 YSLCLLVLFHF 196
>gi|195375624|ref|XP_002046600.1| GJ12395 [Drosophila virilis]
gi|194153758|gb|EDW68942.1| GJ12395 [Drosophila virilis]
Length = 726
Score = 39.3 bits (90), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 71/150 (47%), Gaps = 23/150 (15%)
Query: 81 EVFPFGSVPLKTYLPDGDIDL------TAFGGLNVEEALANDVCSVLEREDQNKAAEFVV 134
+V+PFGS+ L D DIDL T+ ++ L N + + L R D
Sbjct: 117 KVYPFGSLVTGLALKDSDIDLFLEQTDTSSNSMS-HRQLFNKIYNFLRRTD-------CF 168
Query: 135 KDAQLIR-AEVKLVKC--LVQNIVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIIL 191
+D IR A V +++C + + +DI+ + F V L+G+D + +
Sbjct: 169 QDVFAIRHARVPIIRCKHVYSGLSLDINMSSPNSTYNSRF---VAELLGRDVRMRELFLF 225
Query: 192 IKAWCYYESRILGAHHGLISTYALETLVLY 221
+K W + +I+G+ G +++Y L TL+++
Sbjct: 226 LKLWA-KKLKIIGS--GSMTSYCLITLIIF 252
>gi|401837953|gb|EJT41787.1| TRF5-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 642
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 66/300 (22%), Positives = 112/300 (37%), Gaps = 45/300 (15%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ + + P+ + R ID +++ ++ + ++ FGS YLP DID
Sbjct: 181 KDFVHYISPSKSEIKCRNRTIDKLRQAVKQLWSDADLHVFGSFATDLYLPGSDIDCV--- 237
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQL 163
+N D + E K ++ ++R V ++K + + + +D+SF +
Sbjct: 238 -INSRHHDKEDRNYIYELARYLKNEGLAIRMEVIVRTRVPIIKFIEPLSQLHIDVSFERT 296
Query: 164 GGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKF 223
GL + + R D R ++L+ + R+ H G + + + LV Y F
Sbjct: 297 NGLEAARLIREWLR----DSPGLRELVLVIKQFLHSRRLNNVHTGGLGGFTVICLV-YSF 351
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
LN RI S ++TP+N G L+ E + V
Sbjct: 352 ---------------LNMHPRIKSND---IDTPDNLGVLLIDFFELYGKNFGYDDVAISI 393
Query: 284 FDTNSRSFPPKH------------LNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
D + P H L I DP NNN+ R N I+ AF GA +L
Sbjct: 394 SDDHPSYIPKSHWKTLELSRSKFSLAIQDPGDPNNNISR--GSFNMKDIKKAFA-GAFEL 450
>gi|365758850|gb|EHN00675.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 642
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 66/300 (22%), Positives = 112/300 (37%), Gaps = 45/300 (15%)
Query: 47 QGIIAQVQPTVVSEERRKAVIDYVQRLIRN-YLGCEVFPFGSVPLKTYLPDGDIDLTAFG 105
+ + + P+ + R ID +++ ++ + ++ FGS YLP DID
Sbjct: 181 KDFVHYISPSKSEIKCRNRTIDKLRQAVKKLWSDADLHVFGSFATDLYLPGSDIDCV--- 237
Query: 106 GLNVEEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCL--VQNIVVDISFNQL 163
+N D + E K ++ ++R V ++K + + + +D+SF +
Sbjct: 238 -INSRHHDKEDRNYIYELARYLKNEGLAIRMEVIVRTRVPIIKFIEPLSQLHIDVSFERT 296
Query: 164 GGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKF 223
GL + + R D R ++L+ + R+ H G + + + LV Y F
Sbjct: 297 NGLEAARLIREWLR----DSPGLRELVLVIKQFLHSRRLNNVHTGGLGGFTVICLV-YSF 351
Query: 224 LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 283
LN RI S ++TP+N G L+ E + V
Sbjct: 352 ---------------LNMHPRIKSND---IDTPDNLGVLLIDFFELYGKNFGYDDVAISI 393
Query: 284 FDTNSRSFPPKH------------LNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKL 331
D + P H L I DP NNN+ R N I+ AF GA +L
Sbjct: 394 SDDHPSYIPKSHWKTLELSRSKFSLAIQDPGDPNNNISR--GSFNMKDIKKAFA-GAFEL 450
>gi|302823109|ref|XP_002993209.1| hypothetical protein SELMODRAFT_431345 [Selaginella moellendorffii]
gi|300138979|gb|EFJ05729.1| hypothetical protein SELMODRAFT_431345 [Selaginella moellendorffii]
Length = 420
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 80/191 (41%), Gaps = 19/191 (9%)
Query: 41 RAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNYLGCE--VFPFGSVPLKTYLPDGD 98
R A + I+ ++P+ + R A++ ++ L V PFGS T+ D D
Sbjct: 44 RFSRAIEEILGDLEPSQEDRDARAAIVASFDSFVKQTLSGSSVVAPFGSYVTNTFTCDSD 103
Query: 99 IDLTAF----GGLNVEEALANDVCSVLEREDQNKAAEFVVKD--AQLIRAEVKLVKCLVQ 152
+DL+ + L+ EE L L+R + A D + +A V +VK + +
Sbjct: 104 LDLSLYVNRMNPLSREEKL-----YFLKRVTTSLQAMHARYDQIQPIYKATVPVVKFVDR 158
Query: 153 N--IVVDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 210
I D+S + G S L + + D F+ +L+K W I A G +
Sbjct: 159 KTGIQCDLSVDNKDGASKSLVLAALSSI---DKRFRPLCLLLKKWA-KSHEINDASAGTL 214
Query: 211 STYALETLVLY 221
S+Y + L ++
Sbjct: 215 SSYVITLLAIF 225
>gi|398015076|ref|XP_003860728.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498950|emb|CBZ34023.1| hypothetical protein, conserved [Leishmania donovani]
Length = 382
Score = 38.9 bits (89), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 93/231 (40%), Gaps = 37/231 (16%)
Query: 22 SSSSSVPSNQTAIGAEYWQRAEEATQGIIAQVQPTVVSEERRKAVIDYVQRLIRNY---L 78
+ +S+ N +G + E + I++Q Q + + E+RR ++Q L++ +
Sbjct: 9 TRASTYRRNGGELGTSMSSKIAELSGRILSQQQYSQILEQRR-----WLQGLLKGVSLDM 63
Query: 79 GCE---------VFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSVLEREDQNKA 129
G + V P+GS+ T DGD D + E + C V+ RE Q K
Sbjct: 64 GAKEEEKSASPVVLPYGSIVSGTSFTDGDADYIVSFPIVSESTRQSGAC-VIARERQEKC 122
Query: 130 ------------AEFVVKDAQLIRAEVKLVKCLVQNIVVDISFN---QLGGLSTLCFLEQ 174
++ + ++ RA V +V+ + ++ F+ L GL L
Sbjct: 123 LSDIFSHIRKCNSDVELHPQRIFRARVPIVQYVRKSAQESTKFDLSLSLDGLKNSLLLRH 182
Query: 175 VDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYKFLD 225
+ D + ++ K W E +IL A G IS YAL + ++ D
Sbjct: 183 Y---MAGDPRLRLGVLGAKQWG-REQQILNARRGWISPYALSIMYIHFMKD 229
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.132 0.394
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,787,006,644
Number of Sequences: 23463169
Number of extensions: 516516732
Number of successful extensions: 901194
Number of sequences better than 100.0: 380
Number of HSP's better than 100.0 without gapping: 162
Number of HSP's successfully gapped in prelim test: 218
Number of HSP's that attempted gapping in prelim test: 900092
Number of HSP's gapped (non-prelim): 871
length of query: 712
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 562
effective length of database: 8,839,720,017
effective search space: 4967922649554
effective search space used: 4967922649554
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 81 (35.8 bits)