BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001512
(1063 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|296083100|emb|CBI22504.3| unnamed protein product [Vitis vinifera]
Length = 977
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 394/747 (52%), Positives = 496/747 (66%), Gaps = 51/747 (6%)
Query: 275 HEPSVVNYKLGSQL--EQTELGETSAGE--LGASLELVVKSSI-EQLKQPEVPITIPSTK 329
HEP+ ++ L + + EQ+EL G+ LG V K+S+ E L P + K
Sbjct: 49 HEPAEISPVLSNCIVTEQSELPPEDVGDTILGLPPADVTKNSLTEHLGLPPED----AIK 104
Query: 330 TSATKHLQSSSDLMEKKSCLE---QSETPPNYVANNSACLGRKG--------KRATKSLK 378
T+ L +++ K S +E QSE PP VA S L + G KR K +K
Sbjct: 105 NDGTEQLGFFPEVVTKSSIIEKLGQSEPPPENVARYSG-LDQSGSAPKDLANKRTAKLVK 163
Query: 379 NNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKKIVADEYS 438
Y +RS + RVLRSRS E+P + S N VN+ R++K R K R K ADE++
Sbjct: 164 RKYKLRSSVSGSRVLRSRSQEKPKASQPSDNF--VNASASRERKGRKKKRMNKTTADEFA 221
Query: 439 RIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLF 498
RIR HLRYLLNR++YEQNLIDAYS+EGWKG SVEKLKPEKELQRA+SEI RRKL+IRDLF
Sbjct: 222 RIRKHLRYLLNRMSYEQNLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLF 281
Query: 499 QRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCL 557
Q LDSLCA G FP+SLFDSEGQIDSEDI+CAKC SKD+SADNDIILCDGACDRGFHQ+CL
Sbjct: 282 QHLDSLCAEGRFPESLFDSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCL 341
Query: 558 EPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-AAAGHNQD 616
EPPLLKE+IPPDDEGWLCP CDCKVDC+DL+N+ QGT+L + D+WEKVFPE AAAG+NQD
Sbjct: 342 EPPLLKEEIPPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQD 401
Query: 617 PNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDG-------SSSDDSDFTSTSDEVEAP 669
N G +SDDS+DN+YDPD DE+ +GD+SSSD SD+SDFTS SD++
Sbjct: 402 NNSGFSSDDSEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVS 461
Query: 670 ADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGN 729
+++ LGL S+DSEDD+++PDAPE+D++V Q SS SDFTSDSED A L+ S N
Sbjct: 462 PNNEQCLGLPSDDSEDDDFDPDAPEIDEQVNQ--GSSSSDFTSDSEDFTATLDRRNFSDN 519
Query: 730 DEGAASPLGHSNGQRYKDGGNNESLNNELLSIIK--PGQDGAAPVYGKRSSERLDYKKLY 787
++G + QR ++L +ELLS+++ GQD AP+ KR ERLDYKKL+
Sbjct: 520 EDGL-------DEQRRFGRKKKDTLKDELLSVLESNSGQDN-APLSAKRHVERLDYKKLH 571
Query: 788 DETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNE 847
DE YGNV DSSDDE W+++ PRKR + +S SP+G T + +TK K L
Sbjct: 572 DEAYGNVSSDSSDDEDWTENVIPRKRKNLSGNVASVSPNGNTSITENGTNTKDIKHDLEA 631
Query: 848 TENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRH-RTSYRKLGEEVTQKLYNSFK 906
TPKRR R KLN E +N S A+SH+ +PGS G + ++SY+KLGE VT++LY SF+
Sbjct: 632 AGCTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQ 691
Query: 907 ENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANS--EKGTCTPQSN 964
ENQYP+R KE LA+ELG+T QV KWFEN RWSF H K A S +K T Q++
Sbjct: 692 ENQYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVKKDASTSQTD 751
Query: 965 KN----TVGRVSNCNGAENVQSSKTGV 987
+ V R S+ NG +S K G
Sbjct: 752 QKPEQEVVLRESSHNGVGKKESPKAGA 778
>gi|225429315|ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera]
Length = 968
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 394/747 (52%), Positives = 496/747 (66%), Gaps = 51/747 (6%)
Query: 275 HEPSVVNYKLGSQL--EQTELGETSAGE--LGASLELVVKSSI-EQLKQPEVPITIPSTK 329
HEP+ ++ L + + EQ+EL G+ LG V K+S+ E L P + K
Sbjct: 49 HEPAEISPVLSNCIVTEQSELPPEDVGDTILGLPPADVTKNSLTEHLGLPPED----AIK 104
Query: 330 TSATKHLQSSSDLMEKKSCLE---QSETPPNYVANNSACLGRKG--------KRATKSLK 378
T+ L +++ K S +E QSE PP VA S L + G KR K +K
Sbjct: 105 NDGTEQLGFFPEVVTKSSIIEKLGQSEPPPENVARYSG-LDQSGSAPKDLANKRTAKLVK 163
Query: 379 NNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKKIVADEYS 438
Y +RS + RVLRSRS E+P + S N VN+ R++K R K R K ADE++
Sbjct: 164 RKYKLRSSVSGSRVLRSRSQEKPKASQPSDNF--VNASASRERKGRKKKRMNKTTADEFA 221
Query: 439 RIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLF 498
RIR HLRYLLNR++YEQNLIDAYS+EGWKG SVEKLKPEKELQRA+SEI RRKL+IRDLF
Sbjct: 222 RIRKHLRYLLNRMSYEQNLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLF 281
Query: 499 QRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCL 557
Q LDSLCA G FP+SLFDSEGQIDSEDI+CAKC SKD+SADNDIILCDGACDRGFHQ+CL
Sbjct: 282 QHLDSLCAEGRFPESLFDSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCL 341
Query: 558 EPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-AAAGHNQD 616
EPPLLKE+IPPDDEGWLCP CDCKVDC+DL+N+ QGT+L + D+WEKVFPE AAAG+NQD
Sbjct: 342 EPPLLKEEIPPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQD 401
Query: 617 PNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDG-------SSSDDSDFTSTSDEVEAP 669
N G +SDDS+DN+YDPD DE+ +GD+SSSD SD+SDFTS SD++
Sbjct: 402 NNSGFSSDDSEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVS 461
Query: 670 ADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGN 729
+++ LGL S+DSEDD+++PDAPE+D++V Q SS SDFTSDSED A L+ S N
Sbjct: 462 PNNEQCLGLPSDDSEDDDFDPDAPEIDEQVNQ--GSSSSDFTSDSEDFTATLDRRNFSDN 519
Query: 730 DEGAASPLGHSNGQRYKDGGNNESLNNELLSIIK--PGQDGAAPVYGKRSSERLDYKKLY 787
++G + QR ++L +ELLS+++ GQD AP+ KR ERLDYKKL+
Sbjct: 520 EDGL-------DEQRRFGRKKKDTLKDELLSVLESNSGQDN-APLSAKRHVERLDYKKLH 571
Query: 788 DETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNE 847
DE YGNV DSSDDE W+++ PRKR + +S SP+G T + +TK K L
Sbjct: 572 DEAYGNVSSDSSDDEDWTENVIPRKRKNLSGNVASVSPNGNTSITENGTNTKDIKHDLEA 631
Query: 848 TENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRH-RTSYRKLGEEVTQKLYNSFK 906
TPKRR R KLN E +N S A+SH+ +PGS G + ++SY+KLGE VT++LY SF+
Sbjct: 632 AGCTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQ 691
Query: 907 ENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANS--EKGTCTPQSN 964
ENQYP+R KE LA+ELG+T QV KWFEN RWSF H K A S +K T Q++
Sbjct: 692 ENQYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVKKDASTSQTD 751
Query: 965 KN----TVGRVSNCNGAENVQSSKTGV 987
+ V R S+ NG +S K G
Sbjct: 752 QKPEQEVVLRESSHNGVGKKESPKAGA 778
>gi|224060647|ref|XP_002300247.1| predicted protein [Populus trichocarpa]
gi|222847505|gb|EEE85052.1| predicted protein [Populus trichocarpa]
Length = 930
Score = 581 bits (1498), Expect = e-163, Method: Compositional matrix adjust.
Identities = 449/987 (45%), Positives = 574/987 (58%), Gaps = 111/987 (11%)
Query: 1 MSKAEHMGVSPSQVTSSTRDHSCLKQTTPEQMCKFCAECIHSEPLESKSLLGSEAVENEP 60
MS+AEHMGVSPS+V S T+ +SC QTT E + AE E + L E ++ E
Sbjct: 1 MSEAEHMGVSPSKV-SHTKSYSCPAQTTLENTHEPSAEYKFGGYPEERHKLECEIIQTEA 59
Query: 61 RETSIPNSEKLQAFCGDVPDSSFTDHLAPPSEDMRKSTQTNKASCSQQNTSEQKHGTELM 120
+ + LQ+ G+V + P ++D+ KS
Sbjct: 60 GDN---RAAVLQSCSGEV--------VQPSTDDLTKSPL--------------------- 87
Query: 121 HNEQSEQKHQLCYQIVFDKP---QATSLVDNATLQPVSKDVSKSSQTGNRQALDFLSGNR 177
I D P ++L DN+ +P+S + + + G
Sbjct: 88 --------------IDLDPPPDDARSALFDNSP-RPISTAMDQKLEPG------------ 120
Query: 178 CNELDVDCVHSEPLNQKHQLGSEIIQNEPAVNIARLPSDGVEENLQTISEDLTKVCPVEP 237
++ CVHSE + S I+ +EP + L S E Q E L E
Sbjct: 121 ATSVNTACVHSE---SSKAIDSSILLDEPRNSNTELSSCIANETSQASLEGLANDSRAED 177
Query: 238 SQSPPRDANKSCQAGEISCLQQSSSEQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETS 297
+ +A+ S E S QQ++S QT EF + +P K GS+L + E E
Sbjct: 178 AGLSLVEASNSDLIDESSYSQQTTSGQTREFHSDRACCKPLEERQKPGSELAENESMEIG 237
Query: 298 AG--------ELGASLELVVKSSIEQLKQPEVP----ITIPSTKTSATKHLQSSSDLMEK 345
G L ELV KS +K +P I+IP A + ++ + D K
Sbjct: 238 IGLPSGIAIENLEPLTELVTKSC--PIKHIGLPPGDDISIP-----ANEQIRPTHDKESK 290
Query: 346 KSCLEQSETPPNYVANNSACLGRKGKRATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPE 405
E E V ++ KR +K YT S SDRVLRS S E+P PE
Sbjct: 291 YPDCEHLEKLSGIVIGITSQGVPSVKRTSKLSGKKYTSSSR-KSDRVLRSNSQEKPKAPE 349
Query: 406 SSINLADVNSIGERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEG 465
S N +VNS GE K K+R K R K IVADEYSRIR LRYLLNR++YEQ+LI AYS EG
Sbjct: 350 PSNNSTNVNSTGEEKGKRRKKRRGKSIVADEYSRIRARLRYLLNRMSYEQSLITAYSGEG 409
Query: 466 WKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSED 524
WKGLS+EKLKPEKELQRATSEI+RRK+KIRDLFQ +DSLC G FP SLFDSEGQIDSED
Sbjct: 410 WKGLSLEKLKPEKELQRATSEIIRRKVKIRDLFQHIDSLCGEGRFPASLFDSEGQIDSED 469
Query: 525 IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDC 584
I+CAKCGSKDL+ADNDIILCDGACDRGFHQ+CL PPLL+EDIPP DEGWLCPGCDCKVDC
Sbjct: 470 IFCAKCGSKDLTADNDIILCDGACDRGFHQFCLVPPLLREDIPPGDEGWLCPGCDCKVDC 529
Query: 585 IDLVNELQGTRLFITDNWEKVFPEAAA---GHNQDPNFGLASDDSDDNEYDPDGSATDEQ 641
IDL+N+ QGT + I+D W+ VFPEAAA G D NFGL+SDDSDDN+YDPDG DE+
Sbjct: 530 IDLLNDSQGTNISISDRWDNVFPEAAAVASGQKLDYNFGLSSDDSDDNDYDPDGPDIDEK 589
Query: 642 DEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQ 701
+ + SS + +S SDE EAP DDK YLGL S+DSEDD+Y+PDAP L++K+ Q
Sbjct: 590 SQEESSSDESDF------SSASDEFEAPPDDKQYLGLPSDDSEDDDYDPDAPVLEEKLKQ 643
Query: 702 ESSSSGSDFTSDSEDLAAVLEDNRSSGND-EGAASPLGHSNGQRYKDGG-NNESLNNELL 759
ESSSS S+ D A + D S G++ P SNG+R + GG N SLN++LL
Sbjct: 644 ESSSSDFTSDSEDLD-ATLNGDGLSLGDEYHMPIEPHEDSNGRRSRFGGKKNHSLNSKLL 702
Query: 760 SIIKP--GQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKST 817
S+++P Q+ +APV GKR+ ERLDYKKLYDETYGN+ S DD ++D PRKR K+T
Sbjct: 703 SMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETYGNICTSSDDD--FTDTVAPRKRRKNT 760
Query: 818 KEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNT--EDSNISPAKSHEG 875
+ + +G V ++K ++L + E+T GR N+ +D+N+SPAK+H G
Sbjct: 761 GDVAMGIANGDASVTENGLNSKNMNQELKKNEHT---SGRTHQNSSFQDTNVSPAKTHVG 817
Query: 876 CSTPGSRGRRHR-TSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWF 934
S GS +R R ++Y+KLGE VTQKLY+ FKEN+YP++ K SLA+ELG+TF QV KWF
Sbjct: 818 ESLSGSSSKRVRPSAYKKLGEAVTQKLYSFFKENRYPDQAAKASLAEELGITFEQVNKWF 877
Query: 935 ENTRWSFNHPSSKNAKLANSE--KGTC 959
N RWSFNH S + A S KG+C
Sbjct: 878 MNARWSFNHSSPEGTSKAESASGKGSC 904
>gi|255573075|ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus communis]
gi|223533107|gb|EEF34865.1| Homeobox protein HAT3.1, putative [Ricinus communis]
Length = 896
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 414/758 (54%), Positives = 500/758 (65%), Gaps = 44/758 (5%)
Query: 329 KTSATKHLQSSSDLMEKKSCLEQSETPPNYVANNSACLGRKGKRATKSLKNNYTVRSLIG 388
K A++ L D +K QSE +NS+ LGR+ K KS K Y +R L
Sbjct: 159 KVPASEKLGPPHDAEDKHWNGTQSEILSKDAVSNSSRLGRRVKTTAKSRKK-YMLRCLRR 217
Query: 389 SDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKKIVADEYSRIRTHLRYLL 448
SDRV++ RS E+P PESS NL +V+S E+ +KK+ K RK + ADEYS IR +LRYLL
Sbjct: 218 SDRVMQYRSQEKPKAPESSTNLPNVSSNVEKTRKKKKKRERKSVEADEYSIIRKNLRYLL 277
Query: 449 NRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-G 507
NRI YEQ+LI AYS+EGWKGLS+EKLKPEKELQRATSEILRRK KIRDLFQR+DSLC G
Sbjct: 278 NRIGYEQSLITAYSAEGWKGLSLEKLKPEKELQRATSEILRRKSKIRDLFQRIDSLCGEG 337
Query: 508 GFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIP 567
FP+SLFDS+GQI SEDI+CAKCGSKDL+ADNDIILCDGACDRGFHQYCL PPLLKEDIP
Sbjct: 338 RFPESLFDSDGQISSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQYCLVPPLLKEDIP 397
Query: 568 PDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA-GHNQDPNFGLASDDS 626
PDD+GWLCPGCDCKVDCIDL+NE QGT + I+D+WEKVFPEAAA G N D NFG SDDS
Sbjct: 398 PDDQGWLCPGCDCKVDCIDLLNESQGTNISISDSWEKVFPEAAAPGQNPDQNFGPPSDDS 457
Query: 627 DDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDD 686
DDN+YDPD DE+ + SS D D TSDE+EAP DK LGLSSEDS DD
Sbjct: 458 DDNDYDPDIPEIDEK----SQGDESSSDDSDDSDFTSDELEAPPGDKQQLGLSSEDSGDD 513
Query: 687 EYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDE--------GAASPLG 738
+Y+PDAP+LDD V + SS SDFTSDSEDLAA L++N SG DE G ++ G
Sbjct: 514 DYDPDAPDLDDIV--KEESSSSDFTSDSEDLAATLDNNELSGEDERRISVGTRGDSTKEG 571
Query: 739 HSNGQRYKDGGNNESLNNELLSIIK--PGQDGAAPVYGKRSSERLDYKKLYDETYGNVPY 796
G++ K +SL +ELLSI + P QDG+AP+ GKR+ ERLDYKKLYDETYGNV
Sbjct: 572 SKRGRKKK-----QSLQSELLSIEEPNPSQDGSAPISGKRNVERLDYKKLYDETYGNVSS 626
Query: 797 DSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRG 856
DSSDDE ++DD G KR KST + + S +G V T K+ L ETE PK R
Sbjct: 627 DSSDDEDFTDDVGAVKRRKST-QAALGSANGNASV------TDTGKQDLKETEYVPK-RS 678
Query: 857 RPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTS-YRKLGEEVTQKLYNSFKENQYPNRTT 915
R +L +E+++I+P K+HEG S S G+ R S YR+LGE VT+ LY SFKENQYP+R
Sbjct: 679 RQRLISENTSITPTKAHEGTSPSSSCGKTVRPSGYRRLGETVTKGLYRSFKENQYPDRDR 738
Query: 916 KESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLAN---------SEKGTCTPQSNKN 966
KE LA+ELG+T+ QV KWFEN RWSFNH SS +A S+ T +S
Sbjct: 739 KEHLAEELGITYQQVTKWFENARWSFNHSSSMDANRIGKTPENNSPVSKTTTILLESAPE 798
Query: 967 TVGRVSNCNGAENVQSSKTGVDDTGCMTGDVKNNTQEC-NSIKPTSQTSRKRDRDGKSGD 1025
TV + + A+ +S K G D + S+T + R R SGD
Sbjct: 799 TVSGAAIDSAAQREESPKIGDAMVEIYVEDARETVLGIPKCCAQNSKTPKSRKRKHNSGD 858
Query: 1026 QASDPSSKMEVIQGLSANSPKV-EVQANGRTRRRRNSA 1062
+ SD SK E + AN PK E + GR R +++A
Sbjct: 859 RLSDLESKKEEAKIAPANLPKAQETRVGGRVTRSKSAA 896
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 69/231 (29%), Positives = 101/231 (43%), Gaps = 53/231 (22%)
Query: 1 MSKAEHMGVSPSQVTSSTRDHSCLKQTTPEQMCKFCAECIHSEPLESKSLLGSEAVENEP 60
M KA+HMGVS SQ +S T+ +SC KQ+TPE+ EC + + ++S L SE V
Sbjct: 1 MFKADHMGVSSSQASSHTKSYSCPKQSTPEET----PECGDTSTVATQSQLSSEGVNK-- 54
Query: 61 RETSIPNSEKLQAFCGDVPDSSFTDHLAPPSEDMRKSTQTNKASCSQQNTSEQ------- 113
S T++L P SE+ KS+ + ++ + ++
Sbjct: 55 --------------------GSLTENLVPTSEEACKSSLIDTSTSPKTAIDQKLGFVSDD 94
Query: 114 ---KHGTELMHNEQSEQKHQLCYQIVFDKPQATSLVDNATLQPVSKDVSKSSQTGNRQAL 170
K GT +HN QS++ L IV ++ N TL P+ +D SKS AL
Sbjct: 95 THIKCGTVSVHNGQSKRNGSLGSGIVQHDSAISTFAVNETLHPLHQDASKS-------AL 147
Query: 171 DFLSGNRCNELDVDCVHSEPLN------QKHQLG--SEIIQNEPAVNIARL 213
+ NE+ V SE L KH G SEI+ + N +RL
Sbjct: 148 GHMEPPPNNEMKVPA--SEKLGPPHDAEDKHWNGTQSEILSKDAVSNSSRL 196
>gi|356574291|ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Glycine max]
Length = 820
Score = 569 bits (1467), Expect = e-159, Method: Compositional matrix adjust.
Identities = 395/879 (44%), Positives = 530/879 (60%), Gaps = 105/879 (11%)
Query: 196 QLGSEIIQNEPAVNIARLPSDGVEENLQTISEDLTKVCPVEPSQSPPRDANKSCQAGEIS 255
Q+GSE ++ E + L S ++E +S D+T+ ++ P D K+CQ E S
Sbjct: 32 QIGSEGLEREQKELLTELTSFVIDEKSNQVSADVTENSVIQLPAPPQHDFEKNCQTVEGS 91
Query: 256 CLQQSSSEQTP-EFTPGISSHEPSVVNYKLGSQLEQTELGETSAGELGASLELVVKSSI- 313
CL+QS+ EQ + + S ++ ++ + S+ ++ G++ +S SS+
Sbjct: 92 CLEQSTVEQVSVDLSNDKSENKCKPLSENVQSEPVESIPAFVVDGQMQSSPAQANMSSVN 151
Query: 314 EQLKQPEVPITIPSTKTSATKHLQSSSDLMEK-KSCLEQSETPPNYVANNSACLGRKGKR 372
E L QP S D++ +C E+ P++ + RKGKR
Sbjct: 152 ELLDQP-------------------SGDVVNNITNCSEKMSNSPSHSQSR-----RKGKR 187
Query: 373 ATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIG--ERKQKKRNKIRRK 430
+K LK Y +RSL S R LRSR+ E+P PE + NL D NS +RK ++ K RR+
Sbjct: 188 NSKLLKKKYMLRSLGSSGRALRSRTKEKPKEPEPTSNLVDGNSNDGVKRKSGRKKKKRRE 247
Query: 431 KIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRR 490
+ + D++SRIR+HLRYLLNRI+YE +LIDAYS EGWKG S+EKLKPEKELQRA SEILRR
Sbjct: 248 EGITDQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSMEKLKPEKELQRAKSEILRR 307
Query: 491 KLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACD 549
KLKIRDLF+ LDSLCA G FP+SLFDS G+IDSEDI+CAKC SK+LS +NDIILCDG CD
Sbjct: 308 KLKIRDLFRNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCD 367
Query: 550 RGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEA 609
RGFHQ CL+PPLL EDIPP DEGWLCPGCDCK DC+DLVN+ GT L I+D WE+VFPEA
Sbjct: 368 RGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEA 427
Query: 610 A--AGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVE 667
A AG+N D N GL SDDSDD++Y+P+GS D + EGDE SSSD+S++ S S+++E
Sbjct: 428 ASFAGNNMDNNLGLPSDDSDDDDYNPNGS-DDVKIEGDE-----SSSDESEYASASEKLE 481
Query: 668 APADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSS 727
+ + YLGL SEDS+D +Y+PDAP++D KV +ESSSS TSDSEDLAA EDN S
Sbjct: 482 GGSHEDQYLGLPSEDSDDGDYDPDAPDVDCKVNEESSSSDF--TSDSEDLAAAFEDNTSP 539
Query: 728 GNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKP--GQDGAAPVYGKRSSERLDYKK 785
G D G N + K S+ +EL S+++P GQ G PV GKR ERLDYKK
Sbjct: 540 GQDGGI-------NSSKKKGKVGKLSMADELSSLLEPDSGQGGPTPVSGKRHVERLDYKK 592
Query: 786 LYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKL 845
LY+ETY + D+SDDE W+D P ++ K T + SP+
Sbjct: 593 LYEETYHS---DTSDDEDWNDAAAPSRKKKLTGNVTPVSPNANAS--------------- 634
Query: 846 NETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTS-YRKLGEEVTQKLYNS 904
N + +T KR E++N SP KS +G S GSR +R +S +++LGE V Q+L+ S
Sbjct: 635 NNSIHTLKRNAHQN-KVENTNSSPTKSLDGRSKSGSRDKRSGSSAHKRLGEAVVQRLHKS 693
Query: 905 FKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANSEKGTCTPQSN 964
FKENQYP+R+TKESLA+ELGLT+ QV KWF+NTRWSF H S ++ + +P++
Sbjct: 694 FKENQYPDRSTKESLAQELGLTYQQVAKWFDNTRWSFRHSS----QMETNSGRNASPEA- 748
Query: 965 KNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDVKNNTQECNSIKPTSQTSRKRDRDGKSG 1024
T GR N ++C S+ P + +
Sbjct: 749 --TDGRAEN-------------------------EGEKQCESMSPEVSGKNSKTTSSRKR 781
Query: 1025 DQASDPSSKMEV-IQGLSANSPKV---EVQANGRTRRRR 1059
S+P S+ ++ I GL+ +SP V +V +TR+R+
Sbjct: 782 KHLSEPLSEAQLDINGLATSSPNVHQTQVGNKMKTRKRK 820
Score = 39.7 bits (91), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 35/68 (51%), Gaps = 5/68 (7%)
Query: 51 LGSEAVENEPRE-----TSIPNSEKLQAFCGDVPDSSFTDHLAPPSEDMRKSTQTNKASC 105
+GSE +E E +E TS EK DV ++S APP D K+ QT + SC
Sbjct: 33 IGSEGLEREQKELLTELTSFVIDEKSNQVSADVTENSVIQLPAPPQHDFEKNCQTVEGSC 92
Query: 106 SQQNTSEQ 113
+Q+T EQ
Sbjct: 93 LEQSTVEQ 100
>gi|356534303|ref|XP_003535696.1| PREDICTED: uncharacterized protein LOC100306715 [Glycine max]
Length = 963
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 395/875 (45%), Positives = 527/875 (60%), Gaps = 97/875 (11%)
Query: 196 QLGSEIIQNEPAVNIARLPSDGVEENLQTISEDLTKVCPVEPSQSPPRDANKSCQAGEIS 255
Q+GSE ++NE L S +EE +S +T+ ++ + D K+CQ E S
Sbjct: 175 QIGSEGLENEQKELGTELTSSVIEEKSNQVSAIVTENAVIQLPEPLQHDLQKNCQTVEGS 234
Query: 256 CLQQSSSEQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGELGASLELVVKSSIEQ 315
CL+QS+ EQ T +S+ +P L ++ + A +E ++S+ Q
Sbjct: 235 CLEQSTVEQV---TVDLSNDKPENKCKPLSENVQSEPVESIPA----VVVEGQMQSNPSQ 287
Query: 316 LKQPEVPITIPSTKTSATKHLQSSSDLMEKKSCLEQSETPPNYVANNSACLGRKGKRATK 375
V + A ++ S+ C E+ P + + RKGK+ +K
Sbjct: 288 ANMSSVNELLDQPSGDAVNNISSN--------CSEKMSNSPTHSQSR-----RKGKKNSK 334
Query: 376 SLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIG-ERKQKKRNKIRRKKIVA 434
LK Y +RSL SDR LRSR+ E+P PE + NL D N+ G +RK ++ K R+++ +
Sbjct: 335 LLKK-YMLRSLGSSDRALRSRTKEKPKEPEPTSNLVDGNNNGVKRKSGRKKKKRKEEGIT 393
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKI 494
+++SRIR+HLRYLLNRI+YE +LIDAYS EGWKG S+EKLKPEKELQRA SEILRRKLKI
Sbjct: 394 NQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSIEKLKPEKELQRAKSEILRRKLKI 453
Query: 495 RDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFH 553
RDLFQ LDSLCA G FP+SLFDS G+IDSEDI+CAKC SK+LS +NDIILCDG CDRGFH
Sbjct: 454 RDLFQNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFH 513
Query: 554 QYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA--A 611
Q CL+PP+L EDIPP DEGWLCPGCDCK DC+DLVN+ GT L I+D WE+VFPEAA A
Sbjct: 514 QLCLDPPMLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEAASFA 573
Query: 612 GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPAD 671
G+N D N G+ SDDSDD++Y+P+G D + EGDE SSSD+S++ S S+++E +
Sbjct: 574 GNNMDNNSGVPSDDSDDDDYNPNG-PDDVKVEGDE-----SSSDESEYASASEKLEGGSH 627
Query: 672 DKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDE 731
+ YLGL SEDS+D +Y+PDAP+++ KV +ESSSS TSDSEDLAA +EDN S G D
Sbjct: 628 EDQYLGLPSEDSDDGDYDPDAPDVECKVNEESSSSDF--TSDSEDLAAAIEDNTSPGQDG 685
Query: 732 GAASPLGHSNGQRYKDGGNNESLNNELLSIIKP--GQDGAAPVYGKRSSERLDYKKLYDE 789
G +S ++ G SL +EL S+++P GQ+ PV GKR ERLDYKKLY+E
Sbjct: 686 GISS------SKKKGKVGKKLSLPDELSSLLEPDSGQEAPTPVSGKRHVERLDYKKLYEE 739
Query: 790 TYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETE 849
TY + D+SDDE W+D P + K T + SP+G N +
Sbjct: 740 TYHS---DTSDDEDWNDTAAPSGKKKLTGNVTPVSPNGNAS---------------NNSI 781
Query: 850 NTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTS-YRKLGEEVTQKLYNSFKEN 908
+TPKR N E++N SP KS EGCS GSR ++ +S +++LGE V Q+L+ SFKEN
Sbjct: 782 HTPKRNAHQN-NVENTNNSPTKSLEGCSKSGSRDKKSGSSAHKRLGEAVVQRLHKSFKEN 840
Query: 909 QYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANSEKGTCTPQSNKNTV 968
QYP+RTTKESLA+ELGLT+ QV KWF NTRWSF H S E + S + T
Sbjct: 841 QYPDRTTKESLAQELGLTYQQVAKWFGNTRWSFRHSSQM-------ETNSGINASQQVTD 893
Query: 969 GRVSNCNGAENVQSSKTGVDDTGCMTGDVKNNTQECNSIKPTSQTSRKRDRDGKSGDQAS 1028
GR N +EC I + + + + S
Sbjct: 894 GRAEN-------------------------EGEKECELISLEFSGEKSKTPNSRKRKHLS 928
Query: 1029 DPSSKMEV-IQGLSANSPKVEVQANG---RTRRRR 1059
+P S+ ++ I G +A+SP V + G +TR+R+
Sbjct: 929 EPLSEAQLDINGSAASSPNVHLTQIGNKMKTRKRK 963
>gi|449501740|ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cucumis sativus]
Length = 749
Score = 533 bits (1374), Expect = e-148, Method: Compositional matrix adjust.
Identities = 371/754 (49%), Positives = 459/754 (60%), Gaps = 79/754 (10%)
Query: 359 VANNSACLGRKGKRATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNS--I 416
+ +NS RK K KS K NY +RS + SDRVLRSR+ E+ PE S +L + +
Sbjct: 20 IPSNSQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEED 79
Query: 417 GERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKP 476
G+RK+KK+ I+ K DEYS IR HLRYLLNRI YEQ+LI+AYSSEGWKG S +KLKP
Sbjct: 80 GKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKP 139
Query: 477 EKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDL 535
EKELQRA++EI+RRKLKIRDLFQR+D+LCA G +SLFDSEGQIDSEDI+CAKCGSK+L
Sbjct: 140 EKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKEL 199
Query: 536 SADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTR 595
S +NDIILCDG CDRGFHQ+CLEPPLL DIPPDDEGWLCPGCDCK DC+DL+NE QG+
Sbjct: 200 SLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSN 259
Query: 596 LFITDNWEKVFPE---AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQD--------EG 644
L ITD WEKV+PE AAAG N D GL SDDS+D +YDPD T +QD
Sbjct: 260 LSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSS 319
Query: 645 DESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESS 704
D+S+SD S+SD S + S S+ +E ++D YLGL S+DSED++Y+P PELD+ V QESS
Sbjct: 320 DQSNSDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESS 379
Query: 705 SSGSDFTSDSEDLAAVLEDNRSSGNDEGAAS------PLGHSNGQRYKDGGNNESLNNEL 758
SS TSDSEDLAA+ DN S D S P+ +SNGQ G N +L+NEL
Sbjct: 380 SSDF--TSDSEDLAAL--DNNCSSKDGDLVSSLNNTLPVKNSNGQ--SSGPNKSALHNEL 433
Query: 759 LSIIK--PGQDGAAPVYGKRSSERLDYKKLYDETYGNVP-----------YDSSDDESWS 805
S++ P +DG PV G+R ERLDYKKL+DETYGNVP DSSDD W
Sbjct: 434 SSLLDSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGW- 492
Query: 806 DDGGPRKRTKST---KEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNT 862
D G RKR T ++ S D T V +T+ + KRR R K
Sbjct: 493 -DSGTRKRGPKTLVLALSNNGSNDDLTNV---------------KTKRSYKRRTRQKPGA 536
Query: 863 EDSNISPAKS-HEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAK 921
+ N S ++ + + S + +S R+L + ++L SF+EN+YP R TK+SLA+
Sbjct: 537 INVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQ 596
Query: 922 ELGLTFSQVRKWFENTRWSFNHPSSKNAKL------------ANSEKGTCTPQSNKNTVG 969
ELGL QV KWFENTRWS HPSS K A+ E P+S T
Sbjct: 597 ELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGELSKNEPES--ATCF 654
Query: 970 RVSNCNGAENVQSSKTGVDDTGCMTGDVKNNTQECNSIK-PTSQTSRKRDRDGKSGDQAS 1028
R ++ NGA + C +GD + K S ++ R R G+S + AS
Sbjct: 655 RDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKRKGRSDNTAS 714
Query: 1029 DPSSKMEVIQGLSANSPKV-EVQANGR--TRRRR 1059
S E A SPKV E+Q R TRRRR
Sbjct: 715 H-SKDREGSPRPPAKSPKVNEMQTADRFKTRRRR 747
>gi|449446105|ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204775 [Cucumis sativus]
Length = 1061
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 366/747 (48%), Positives = 454/747 (60%), Gaps = 77/747 (10%)
Query: 359 VANNSACLGRKGKRATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNS--I 416
+ +NS RK K KS K NY +RS + SDRVLRSR+ E+ PE S +L + +
Sbjct: 252 IPSNSQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEED 311
Query: 417 GERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKP 476
G+RK+KK+ I+ K DEYS IR HLRYLLNRI YEQ+LI+AYSSEGWKG S +KLKP
Sbjct: 312 GKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKP 371
Query: 477 EKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDL 535
EKELQRA++EI+RRKLKIRDLFQR+D+LCA G +SLFDSEGQIDSEDI+CAKCGSK+L
Sbjct: 372 EKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKEL 431
Query: 536 SADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTR 595
S +NDIILCDG CDRGFHQ+CLEPPLL DIPPDDEGWLCPGCDCK DC+DL+NE QG+
Sbjct: 432 SLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSN 491
Query: 596 LFITDNWEKVFPE---AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQD--------EG 644
L ITD WEKV+PE AAAG N D GL SDDS+D +YDPD T +QD
Sbjct: 492 LSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSS 551
Query: 645 DESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESS 704
D+S+SD S+SD S + S S+ +E ++D YLGL S+DSED++Y+P PELD+ V QESS
Sbjct: 552 DQSNSDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESS 611
Query: 705 SSGSDFTSDSEDLAAVLEDNRSSGNDEGAAS------PLGHSNGQRYKDGGNNESLNNEL 758
SS TSDSEDLAA+ DN S D S P+ +SNGQ G N +L+NEL
Sbjct: 612 SSDF--TSDSEDLAAL--DNNCSSKDGDLVSSLNNTLPVKNSNGQ--SSGPNKSALHNEL 665
Query: 759 LSIIK--PGQDGAAPVYGKRSSERLDYKKLYDETYGNVP-----------YDSSDDESWS 805
S++ P +DG PV G+R ERLDYKKL+DETYGNVP DSSDD W
Sbjct: 666 SSLLDSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGW- 724
Query: 806 DDGGPRKRTKST---KEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNT 862
D G RKR T ++ S D T V +T+ + KRR R K
Sbjct: 725 -DSGTRKRGPKTLVLALSNNGSNDDLTNV---------------KTKRSYKRRTRQKPGA 768
Query: 863 EDSNISPAKS-HEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAK 921
+ N S ++ + + S + +S R+L + ++L SF+EN+YP R TK+SLA+
Sbjct: 769 INVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQ 828
Query: 922 ELGLTFSQVRKWFENTRWSFNHPSSKNAKL------------ANSEKGTCTPQSNKNTVG 969
ELGL QV KWFENTRWS HPSS K A+ E P+S T
Sbjct: 829 ELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGELSKNEPES--ATCF 886
Query: 970 RVSNCNGAENVQSSKTGVDDTGCMTGDVKNNTQECNSIK-PTSQTSRKRDRDGKSGDQAS 1028
R ++ NGA + C +GD + K S ++ R R G+S + AS
Sbjct: 887 RDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKRKGRSDNTAS 946
Query: 1029 DPSSKMEVIQGLSANSPKV-EVQANGR 1054
S E A SPKV E+Q R
Sbjct: 947 H-SKDREGSPRPPAKSPKVNEMQTADR 972
>gi|147822101|emb|CAN68079.1| hypothetical protein VITISV_006312 [Vitis vinifera]
Length = 611
Score = 467 bits (1201), Expect = e-128, Method: Compositional matrix adjust.
Identities = 299/551 (54%), Positives = 375/551 (68%), Gaps = 44/551 (7%)
Query: 275 HEPSVVNYKLGSQL--EQTELGETSAGE--LGASLELVVKSSI-EQLKQPEVPITIPSTK 329
HEP+ ++ L + + EQ+EL G+ LG V K+S+ E L P + K
Sbjct: 49 HEPAEISPVLSNCIVTEQSELPPEDVGDTILGLPPADVTKNSLXEHLGLPPED----AIK 104
Query: 330 TSATKHLQSSSDLMEKKSCLE---QSETPPNYVANNSACLGRKG--------KRATKSLK 378
T+ L +++ K S +E QSE PP VA S L + G KR K +K
Sbjct: 105 NDGTEQLGXFPEVVTKSSIIEKLGQSEPPPENVARYSG-LDQSGSAPKDLANKRTAKLVK 163
Query: 379 NNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKKIVADEYS 438
Y +RS + RVLRSRS E+P + S N VN+ R++K R K R K ADE++
Sbjct: 164 RKYKLRSSVSGSRVLRSRSQEKPKASQPSDNF--VNASASRERKGRKKKRMNKTTADEFA 221
Query: 439 RIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLF 498
RIR HLRYLLNR++YEQNLIDAYS+EGWKG SVEKLKPEKELQRA+SEI RRKL IRDLF
Sbjct: 222 RIRKHLRYLLNRMSYEQNLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLXIRDLF 281
Query: 499 QRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCL 557
Q LDSLCA G FP+SLFDSEGQIDSEDI+CAKC SKD+SADNDIILCDGACDRGFHQ+CL
Sbjct: 282 QHLDSLCAEGRFPESLFDSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCL 341
Query: 558 EPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-AAAGHNQD 616
EPPLLKE+IPPDDEGWLCP CDCKVDC+DL+N+ QGT+L + D+WEKVFPE AAAG+NQD
Sbjct: 342 EPPLLKEEIPPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQD 401
Query: 617 PNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDG-------SSSDDSDFTSTSDEVEAP 669
N G +SDDS+DN+YDPD DE+ +GD+SSSD SD+SDFTS SD++
Sbjct: 402 NNSGFSSDDSEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVS 461
Query: 670 ADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGN 729
+++ LGL S+DSEDD+++PDAPE+D++V Q SSSS TSDSED A L+ S N
Sbjct: 462 PNNEQCLGLPSDDSEDDDFDPDAPEIDEQVNQGSSSSDF--TSDSEDFTATLDRRNFSDN 519
Query: 730 DEGAASPLGHSNGQRYKDGGNNESLNNELLSIIK--PGQDGAAPVYGKRSSERLDYKKLY 787
++G + QR ++L +ELLS+++ GQD AP+ KR ERLDYKKL+
Sbjct: 520 EDGL-------DEQRRFGRKKKDTLKDELLSVLESNSGQDN-APLSAKRHVERLDYKKLH 571
Query: 788 DETYGNVPYDS 798
D + + P+ S
Sbjct: 572 DVSAHDSPFSS 582
>gi|16326|emb|CAA49263.1| HAT 3.1 [Arabidopsis thaliana]
Length = 660
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 294/629 (46%), Positives = 369/629 (58%), Gaps = 80/629 (12%)
Query: 413 VNSIGERKQKKRNKIRRKKIVA--DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLS 470
NS + KK+NK K V DEY+RI+ LRY LNRINYEQ+LIDAYS EGWKG S
Sbjct: 89 ANSTPVGRPKKKNKTMNKGQVREDDEYTRIKKKLRYFLNRINYEQSLIDAYSLEGWKGSS 148
Query: 471 VEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAK 529
+EK++PEKEL+RAT EILR LK+ DLFQ LD+LCA G P+SLFD++G+I SEDI+CAK
Sbjct: 149 LEKIRPEKELERATKEILRASLKL-DLFQHLDTLCAEGSLPESLFDTDGEISSEDIFCAK 207
Query: 530 CGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVN 589
CGSKDLS DNDIILCDG CDRGFHQYCLEPPL KEDIPPDDEGWLCPGCDCK D +DL+N
Sbjct: 208 CGSKDLSVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDEGWLCPGCDCKDDSLDLLN 267
Query: 590 ELQGTRLFITDNWEKVFPEAAA---GHNQDPNFGLASDDSDDNEYDPDGSATDEQDE--- 643
+ GT+ ++D+WEK+FPEAAA G Q+ + L S DSDD EYDPD +E DE
Sbjct: 268 DSLGTKFSVSDSWEKIFPEAAAALVGGGQNLDCDLPSADSDDEEYDPDCLNDNENDEDGS 327
Query: 644 --GDESSSDGSSSDDSDFTSTSDEV-----EAPADDKTYLGLSSEDSEDDEYNPDAPELD 696
+ES ++ SSD+++FTS SDE+ E K + L S+DSEDD+Y+PDAP D
Sbjct: 328 DDNEESENEDGSSDETEFTSASDEMIESFKEGKDIMKDVMALPSDDSEDDDYDPDAPTCD 387
Query: 697 DKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNN 756
D + SS SD TSD+EDL + + + N + +PL Q + G+
Sbjct: 388 D----DKESSNSDCTSDTEDLETSFKGDET--NQQAEDTPLEDPGRQTSQLQGD------ 435
Query: 757 ELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKS 816
+L DG A V +R+ ERLDYKKLYDE Y NVP SSDD+ K +
Sbjct: 436 AILESDVGLDDGPAGVSRRRNVERLDYKKLYDEEYDNVPTSSSDDDD------WDKTARM 489
Query: 817 TKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGC 876
KE S + +G T + K+ N ++T K+ R K D + EG
Sbjct: 490 GKEDSESEDEGDTVPL---------KQSSNAEDHTSKKLIR-KSKRADKKDTLEMPQEGP 539
Query: 877 STPGSRGRRHRTSYR--KLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWF 934
G G ++S K + TQ+LY SF+ENQYP++ TKESLAKEL +T QV WF
Sbjct: 540 GENGGSGEIEKSSSSACKQTDPKTQRLYISFQENQYPDKATKESLAKELQMTVKQVNNWF 599
Query: 935 ENTRWSFNHPSSKNAKLANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMT 994
++ RWS N +K SE ENV+ KTG + C T
Sbjct: 600 KHRRWSIN------SKPLVSE----------------------ENVEKLKTGKEGE-CET 630
Query: 995 --GDVKNNTQECNSI--KPTSQTSRKRDR 1019
T E S+ KPT+ SRKR R
Sbjct: 631 SVAGSSKQTMETESVAEKPTNTGSRKRRR 659
>gi|15230408|ref|NP_188582.1| homeobox protein HAT3.1 [Arabidopsis thaliana]
gi|148886602|sp|Q04996.3|HAT31_ARATH RecName: Full=Homeobox protein HAT3.1
gi|26449313|dbj|BAC41784.1| putative homeobox protein HAT3.1 [Arabidopsis thaliana]
gi|29029042|gb|AAO64900.1| At3g19510 [Arabidopsis thaliana]
gi|332642729|gb|AEE76250.1| homeobox protein HAT3.1 [Arabidopsis thaliana]
Length = 723
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 295/637 (46%), Positives = 367/637 (57%), Gaps = 95/637 (14%)
Query: 413 VNSIGERKQKKRNKIRRKKIVA--DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLS 470
NS + KK+NK K V DEY+RI+ LRY LNRINYEQ+LIDAYS EGWKG S
Sbjct: 151 ANSTPVGRPKKKNKTMNKGQVREDDEYTRIKKKLRYFLNRINYEQSLIDAYSLEGWKGSS 210
Query: 471 VEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAK 529
+EK++PEKEL+RAT EILRRKLKIRDLFQ LD+LCA G P+SLFD++G+I SEDI+CAK
Sbjct: 211 LEKIRPEKELERATKEILRRKLKIRDLFQHLDTLCAEGSLPESLFDTDGEISSEDIFCAK 270
Query: 530 CGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVN 589
CGSKDLS DNDIILCDG CDRGFHQYCLEPPL KEDIPPDDEGWLCPGCDCK D +DL+N
Sbjct: 271 CGSKDLSVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDEGWLCPGCDCKDDSLDLLN 330
Query: 590 ELQGTRLFITDNWEKVFPEAAA-----GHNQDPNFGLASDDS--------DDNEYDPDGS 636
+ GT+ ++D+WEK+FPEAAA G N D + D +DNE D DGS
Sbjct: 331 DSLGTKFSVSDSWEKIFPEAAAALVGGGQNLDCDLPSDDSDDEEYDPDCLNDNENDEDGS 390
Query: 637 ATDEQDEGDESSSDGSSSDDSDFTSTSDEV-----EAPADDKTYLGLSSEDSEDDEYNPD 691
D+ +ES ++ SSD+++FTS SDE+ E K + L S+DSEDD+Y+PD
Sbjct: 391 -----DDNEESENEDGSSDETEFTSASDEMIESFKEGKDIMKDVMALPSDDSEDDDYDPD 445
Query: 692 APELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPL---GHSNGQRYKDG 748
AP DD + SS SD TSD+EDL + + + N + +PL G Q D
Sbjct: 446 APTCDD----DKESSNSDCTSDTEDLETSFKGDET--NQQAEDTPLEDPGRQTSQLQGDA 499
Query: 749 GNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDG 808
+L DG A V +R+ ERLDYKKLYDE Y NVP SSDD+
Sbjct: 500 ---------ILESDVGLDDGPAGVSRRRNVERLDYKKLYDEEYDNVPTSSSDDDD----- 545
Query: 809 GPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNIS 868
K + KE S + +G T + K+ N ++T K+ R K D +
Sbjct: 546 -WDKTARMGKEDSESEDEGDTVPL---------KQSSNAEDHTSKKLIR-KSKRADKKDT 594
Query: 869 PAKSHEGCSTPGSRGRRHRTSYR--KLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLT 926
EG G G ++S K + TQ+LY SF+ENQYP++ TKESLAKEL +T
Sbjct: 595 LEMPQEGPGENGGSGEIEKSSSSACKQTDPKTQRLYISFQENQYPDKATKESLAKELQMT 654
Query: 927 FSQVRKWFENTRWSFNHPSSKNAKLANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSKTG 986
QV WF++ RWS N +K SE ENV+ KTG
Sbjct: 655 VKQVNNWFKHRRWSIN------SKPLVSE----------------------ENVEKLKTG 686
Query: 987 VDDTGCMT--GDVKNNTQECNSI--KPTSQTSRKRDR 1019
+ C T T E S+ KPT+ SRKR R
Sbjct: 687 KEGE-CETSVAGSSKQTMETESVAEKPTNTGSRKRRR 722
>gi|11994474|dbj|BAB02476.1| homeotic protein HAT 3.1 [Arabidopsis thaliana]
Length = 661
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 294/634 (46%), Positives = 368/634 (58%), Gaps = 89/634 (14%)
Query: 413 VNSIGERKQKKRNKIRRKKIVA--DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLS 470
NS + KK+NK K V DEY+RI+ LRY LNRINYEQ+LIDAYS EGWKG S
Sbjct: 89 ANSTPVGRPKKKNKTMNKGQVREDDEYTRIKKKLRYFLNRINYEQSLIDAYSLEGWKGSS 148
Query: 471 VEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAK 529
+EK++PEKEL+RAT EILRRKLKIRDLFQ LD+LCA G P+SLFD++G+I SEDI+CAK
Sbjct: 149 LEKIRPEKELERATKEILRRKLKIRDLFQHLDTLCAEGSLPESLFDTDGEISSEDIFCAK 208
Query: 530 CGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVN 589
CGSKDLS DNDIILCDG CDRGFHQYCLEPPL KEDIPPDDEGWLCPGCDCK D +DL+N
Sbjct: 209 CGSKDLSVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDEGWLCPGCDCKDDSLDLLN 268
Query: 590 ELQGTRLFITDNWEKVFPEAAA-----GHNQDPNFGLASDDS--------DDNEYDPDGS 636
+ GT+ ++D+WEK+FPEAAA G N D + D +DNE D DGS
Sbjct: 269 DSLGTKFSVSDSWEKIFPEAAAALVGGGQNLDCDLPSDDSDDEEYDPDCLNDNENDEDGS 328
Query: 637 ATDEQDEGDESSSDGSSSDDSDFTSTSDEV-----EAPADDKTYLGLSSEDSEDDEYNPD 691
D+ +ES ++ SSD+++FTS SDE+ E K + L S+DSEDD+Y+PD
Sbjct: 329 -----DDNEESENEDGSSDETEFTSASDEMIESFKEGKDIMKDVMALPSDDSEDDDYDPD 383
Query: 692 APELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDGGNN 751
AP DD + SS SD TSD+EDL + + + N + +PL Q + G+
Sbjct: 384 APTCDD----DKESSNSDCTSDTEDLETSFKGDET--NQQAEDTPLEDPGRQTSQLQGD- 436
Query: 752 ESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPR 811
+L DG A V +R+ ERLDYKKLYDE Y NVP SSDD+
Sbjct: 437 -----AILESDVGLDDGPAGVSRRRNVERLDYKKLYDEEYDNVPTSSSDDDD------WD 485
Query: 812 KRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAK 871
K + KE S + +G T + K+ N ++T K+ R K D +
Sbjct: 486 KTARMGKEDSESEDEGDTVPL---------KQSSNAEDHTSKKLIR-KSKRADKKDTLEM 535
Query: 872 SHEGCSTPGSRGRRHRTSYR--KLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQ 929
EG G G ++S K + TQ+LY SF+ENQYP++ TKESLAKEL +T Q
Sbjct: 536 PQEGPGENGGSGEIEKSSSSACKQTDPKTQRLYISFQENQYPDKATKESLAKELQMTVKQ 595
Query: 930 VRKWFENTRWSFNHPSSKNAKLANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSKTGVDD 989
V WF++ RWS N +K SE ENV+ KTG +
Sbjct: 596 VNNWFKHRRWSIN------SKPLVSE----------------------ENVEKLKTGKEG 627
Query: 990 TGCMT--GDVKNNTQECNSI--KPTSQTSRKRDR 1019
C T T E S+ KPT+ SRKR R
Sbjct: 628 E-CETSVAGSSKQTMETESVAEKPTNTGSRKRRR 660
>gi|115467230|ref|NP_001057214.1| Os06g0229300 [Oryza sativa Japonica Group]
gi|22830607|dbj|BAC15621.1| PHD-finger family homeodomain protein [Oryza sativa Japonica Group]
gi|51535397|dbj|BAD37267.1| PHD-finger family homeodomain protein [Oryza sativa Japonica Group]
gi|113595254|dbj|BAF19128.1| Os06g0229300 [Oryza sativa Japonica Group]
gi|215768196|dbj|BAH00425.1| unnamed protein product [Oryza sativa Japonica Group]
gi|284431756|gb|ADB84619.1| PHD-finger family homeodomain protein [Oryza sativa Japonica Group]
Length = 792
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 263/591 (44%), Positives = 341/591 (57%), Gaps = 65/591 (10%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKI-----------RRKKI------- 432
RV + R +P+ P S L S E+K K N++ +++K+
Sbjct: 94 RVAKKRKRSKPLRPAPSRVL---RSTSEKKNKAHNELLNDGAGVQPAEKKRKVGRPPKGG 150
Query: 433 -VADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491
D+Y IR +RY+LNR+NYEQ+LI AY+SEGWKG S+EK++PEKEL+RA EILR K
Sbjct: 151 TPKDDYLMIRKRVRYVLNRMNYEQSLIQAYASEGWKGQSLEKIRPEKELERAKVEILRCK 210
Query: 492 LKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
+IR+ F+ LDSL + G +S+FDS G+I SEDI+CA CGSKD++ NDIILCDG CDR
Sbjct: 211 SRIREAFRNLDSLLSEGKLDESMFDSAGEISSEDIFCAACGSKDVTLKNDIILCDGICDR 270
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
GFHQYCL PPLL EDIP DEGWLCP CDCK+DCID++NELQG +L I D+WEKVFPEAA
Sbjct: 271 GFHQYCLNPPLLAEDIPQGDEGWLCPACDCKIDCIDVLNELQGVKLSIHDSWEKVFPEAA 330
Query: 611 A---GHNQDPNFGLASDDSDDNEYDP---DGSATDEQ----DEGDESSSDGSSSDDSDFT 660
+ G Q L SDDS DN+YDP G DE+ ++G E SS + +
Sbjct: 331 SFLNGSKQIDASDLPSDDSADNDYDPTLAQGHKVDEEKSSGEDGGEGLDSDDSSSEDSES 390
Query: 661 STSDEVEAPADDKTY--LGLSSEDSEDDEYNPDAPELDDKVTQESSSSG---SDFTSDSE 715
S ++ + + +T LGL SEDSED +++P P+ D + ES+S SDFTSDS+
Sbjct: 391 SEKEKSKTSQNGRTVDDLGLPSEDSEDGDFDPAGPDSDKEQNDESNSDQSDESDFTSDSD 450
Query: 716 DLAAVLEDNRSSGNDE--GAAS----PLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGA 769
D A E +S G DE G +S + ++G + DG N +N + QD
Sbjct: 451 DFCA--EIAKSCGQDEISGPSSSQIRTVDRTDGSGF-DGEPNAENSNLAFMETELEQDMV 507
Query: 770 APVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSA--SPDG 827
P+ KR ERLDYKKLY+E YG DSSDDE W + P K E S SP G
Sbjct: 508 LPISSKRQVERLDYKKLYNEAYGKASSDSSDDEEWYGNSTPEKGNLEDSETDSLAESPQG 567
Query: 828 KTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHR 887
RR A + + E+TP+ RP + D S+ ST
Sbjct: 568 GKGFSRR------APVRYHNNEHTPQNV-RPGGSVSDQQTEVLCSNSNGST--------- 611
Query: 888 TSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
R G + QKL FKE+ YP+R TKE+LA+ELGLTF+QV KWF +TR
Sbjct: 612 AKNRHFGPAINQKLKAHFKEDPYPSRATKENLAQELGLTFNQVTKWFSSTR 662
>gi|218197844|gb|EEC80271.1| hypothetical protein OsI_22252 [Oryza sativa Indica Group]
Length = 830
Score = 411 bits (1057), Expect = e-111, Method: Compositional matrix adjust.
Identities = 263/591 (44%), Positives = 341/591 (57%), Gaps = 65/591 (10%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKI-----------RRKKI------- 432
RV + R +P+ P S L S E+K K N++ +++K+
Sbjct: 132 RVAKKRKRSKPLRPAPSRVL---RSTSEKKNKAHNELLNDGAGVQPAEKKRKVGRPPKGG 188
Query: 433 -VADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491
D+Y IR +RY+LNR+NYEQ+LI AY+SEGWKG S+EK++PEKEL+RA EILR K
Sbjct: 189 TPKDDYLMIRKRVRYVLNRMNYEQSLIQAYASEGWKGQSLEKIRPEKELERAKVEILRCK 248
Query: 492 LKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
+IR+ F+ LDSL + G +S+FDS G+I SEDI+CA CGSKD++ NDIILCDG CDR
Sbjct: 249 SRIREAFRNLDSLLSEGKLDESMFDSAGEISSEDIFCAACGSKDVTLKNDIILCDGICDR 308
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
GFHQYCL PPLL EDIP DEGWLCP CDCK+DCID++NELQG +L I D+WEKVFPEAA
Sbjct: 309 GFHQYCLNPPLLAEDIPQGDEGWLCPACDCKIDCIDVLNELQGVKLSIHDSWEKVFPEAA 368
Query: 611 A---GHNQDPNFGLASDDSDDNEYDP---DGSATDEQ----DEGDESSSDGSSSDDSDFT 660
+ G Q L SDDS DN+YDP G DE+ ++G E SS + +
Sbjct: 369 SFLNGSKQIDASDLPSDDSADNDYDPTLAQGHKVDEEKSSGEDGGEGLDSDDSSSEDSES 428
Query: 661 STSDEVEAPADDKTY--LGLSSEDSEDDEYNPDAPELDDKVTQESSSSG---SDFTSDSE 715
S ++ + + +T LGL SEDSED +++P P+ D + ES+S SDFTSDS+
Sbjct: 429 SEKEKSKTSQNGRTVDDLGLPSEDSEDGDFDPAGPDSDKEQNDESNSDQSDESDFTSDSD 488
Query: 716 DLAAVLEDNRSSGNDE--GAAS----PLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGA 769
D A E +S G DE G +S + ++G + DG N +N + QD
Sbjct: 489 DFCA--EIAKSCGQDEISGPSSSQIRTVDRTDGSGF-DGEPNAENSNLAFMETELEQDMV 545
Query: 770 APVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSA--SPDG 827
P+ KR ERLDYKKLY+E YG DSSDDE W + P K E S SP G
Sbjct: 546 LPISSKRQVERLDYKKLYNEAYGKASSDSSDDEEWYGNSTPEKGNLEDSETDSLAESPQG 605
Query: 828 KTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHR 887
RR A + + E+TP+ RP + D S+ ST
Sbjct: 606 GKGFSRR------APVRYHNNEHTPQNV-RPGGSVSDQQTEVLCSNSNGST--------- 649
Query: 888 TSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
R G + QKL FKE+ YP+R TKE+LA+ELGLTF+QV KWF +TR
Sbjct: 650 AKNRHFGPAINQKLKAHFKEDPYPSRATKENLAQELGLTFNQVTKWFSSTR 700
>gi|22830609|dbj|BAC15622.1| PHD-finger family homeodomain protein [Oryza sativa Japonica Group]
Length = 698
Score = 411 bits (1056), Expect = e-111, Method: Compositional matrix adjust.
Identities = 263/591 (44%), Positives = 341/591 (57%), Gaps = 65/591 (10%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKI-----------RRKKI------- 432
RV + R +P+ P S L S E+K K N++ +++K+
Sbjct: 94 RVAKKRKRSKPLRPAPSRVL---RSTSEKKNKAHNELLNDGAGVQPAEKKRKVGRPPKGG 150
Query: 433 -VADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491
D+Y IR +RY+LNR+NYEQ+LI AY+SEGWKG S+EK++PEKEL+RA EILR K
Sbjct: 151 TPKDDYLMIRKRVRYVLNRMNYEQSLIQAYASEGWKGQSLEKIRPEKELERAKVEILRCK 210
Query: 492 LKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
+IR+ F+ LDSL + G +S+FDS G+I SEDI+CA CGSKD++ NDIILCDG CDR
Sbjct: 211 SRIREAFRNLDSLLSEGKLDESMFDSAGEISSEDIFCAACGSKDVTLKNDIILCDGICDR 270
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
GFHQYCL PPLL EDIP DEGWLCP CDCK+DCID++NELQG +L I D+WEKVFPEAA
Sbjct: 271 GFHQYCLNPPLLAEDIPQGDEGWLCPACDCKIDCIDVLNELQGVKLSIHDSWEKVFPEAA 330
Query: 611 A---GHNQDPNFGLASDDSDDNEYDP---DGSATDEQ----DEGDESSSDGSSSDDSDFT 660
+ G Q L SDDS DN+YDP G DE+ ++G E SS + +
Sbjct: 331 SFLNGSKQIDASDLPSDDSADNDYDPTLAQGHKVDEEKSSGEDGGEGLDSDDSSSEDSES 390
Query: 661 STSDEVEAPADDKTY--LGLSSEDSEDDEYNPDAPELDDKVTQESSSSG---SDFTSDSE 715
S ++ + + +T LGL SEDSED +++P P+ D + ES+S SDFTSDS+
Sbjct: 391 SEKEKSKTSQNGRTVDDLGLPSEDSEDGDFDPAGPDSDKEQNDESNSDQSDESDFTSDSD 450
Query: 716 DLAAVLEDNRSSGNDE--GAAS----PLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGA 769
D A E +S G DE G +S + ++G + DG N +N + QD
Sbjct: 451 DFCA--EIAKSCGQDEISGPSSSQIRTVDRTDGSGF-DGEPNAENSNLAFMETELEQDMV 507
Query: 770 APVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSA--SPDG 827
P+ KR ERLDYKKLY+E YG DSSDDE W + P K E S SP G
Sbjct: 508 LPISSKRQVERLDYKKLYNEAYGKASSDSSDDEEWYGNSTPEKGNLEDSETDSLAESPQG 567
Query: 828 KTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHR 887
RR A + + E+TP+ RP + D S+ ST
Sbjct: 568 GKGFSRR------APVRYHNNEHTPQNV-RPGGSVSDQQTEVLCSNSNGST--------- 611
Query: 888 TSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
R G + QKL FKE+ YP+R TKE+LA+ELGLTF+QV KWF +TR
Sbjct: 612 AKNRHFGPAINQKLKAHFKEDPYPSRATKENLAQELGLTFNQVTKWFSSTR 662
>gi|162462826|ref|NP_001105918.1| HOX1B protein [Zea mays]
gi|1648931|emb|CAA63156.1| HOX1B protein [Zea mays]
Length = 692
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 293/639 (45%), Positives = 375/639 (58%), Gaps = 65/639 (10%)
Query: 343 MEKKSCLEQSETPPNY--VANNSACLGRKGKRATKSLKNN-YTVRSLIGSDRVLRSRSGE 399
+E L S+T PN + N +GK+ ++ L YT+RS RVLRS SG
Sbjct: 29 LEHPVLLSASQTVPNNLGIRKNYKRAANRGKKGSQGLTGQAYTLRSSDIDVRVLRSTSGS 88
Query: 400 RPIPPESSINLADVNSIGERKQKKRNKIR-RKKIVADEYSRIRTHLRYLLNRINYEQNLI 458
+ E V G+ K+R + R K DE+S+IR +RY+LNR+NYEQ+LI
Sbjct: 89 KTTSTEH------VQPPGQPAAKRRKRSRASNKNSTDEFSQIRKRVRYILNRMNYEQSLI 142
Query: 459 DAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG-FPKSLFDSE 517
+AY+SEGWK S++K++PEKEL+RA SEILR KL+IR++FQ +DSL + G +SLFDSE
Sbjct: 143 EAYASEGWKNQSLDKIRPEKELERAKSEILRCKLRIREVFQNIDSLLSKGKIDESLFDSE 202
Query: 518 GQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPG 577
G+I EDI+CA CGSK+ + NDIILCDGACDRGFHQ CL PPL EDIP DEGWLCP
Sbjct: 203 GEISCEDIFCATCGSKNATLGNDIILCDGACDRGFHQNCLNPPLRTEDIPMGDEGWLCPA 262
Query: 578 CDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA---GHNQDPNFGLASDDSDDNEYDPD 634
CDCK+DCIDL+NELQG+ + I D WEKVFPEAAA G QD F L SDDSDDN++DP+
Sbjct: 263 CDCKLDCIDLINELQGSDISIEDPWEKVFPEAAAMTNGSKQDERFDLPSDDSDDNDFDPN 322
Query: 635 ------GSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTY--LGLSSEDSEDD 686
S + E +E GS SDDSDF + SD++E D K LGLSSEDSEDD
Sbjct: 323 MPEEHVASKEEGSSEEEEDDDGGSDSDDSDFLTCSDDLEPLIDKKKVDDLGLSSEDSEDD 382
Query: 687 EYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYK 746
+Y+P P+ D V ++S+S SDFTSDS+D E +S G+DE ++ PL +
Sbjct: 383 DYDPAGPDSDKDVEKKSNSDESDFTSDSDDFCK--EIKKSGGHDEVSSPPLPDV---KVG 437
Query: 747 DGGNNESLNNELLSIIKP-----GQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDD 801
D N + +N S P Q PV +R +ERLDYK+LYDE YG DSSD+
Sbjct: 438 DMEKNTAQSNTTSSADDPMETEIDQSVVLPVSRRRQAERLDYKRLYDEAYGEASSDSSDE 497
Query: 802 ESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRK--STKAAKEKLNETENTPKRRGRPK 859
E WS P KS +EG SP GK + +T+ KE L+ + +
Sbjct: 498 EEWSGKNTP---IKSNEEGEVGSPAGKGSRVAHHNELTTQNTKESLHSLHGSVDEK-HGD 553
Query: 860 LNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESL 919
L + SNI K H G ++QKL+ FK YP+R+ KESL
Sbjct: 554 LTSNGSNIKDRKGH-------------------FGPVISQKLHEHFKTQPYPSRSLKESL 594
Query: 920 AKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANSEKGT 958
A+ELGLTF QV +WFEN R A+LA+S KG
Sbjct: 595 AEELGLTFHQVNRWFENRR--------HFARLASSRKGI 625
>gi|1346791|sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodomain protein; Short=PRHP
gi|666128|gb|AAA62237.1| homeodomain protein [Petroselinum crispum]
Length = 1088
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 266/553 (48%), Positives = 332/553 (60%), Gaps = 72/553 (13%)
Query: 434 ADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLK 493
DE+ RIRTHLRYLL+RI YE+N +DAYS EGWKG S++K+KPEKEL+RA +EI RKLK
Sbjct: 487 VDEFCRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLK 546
Query: 494 IRDLFQRLD-SLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGF 552
IRDLFQRLD + G P+ LFDS G+IDSEDI+CAKCGSKD++ NDIILCDGACDRGF
Sbjct: 547 IRDLFQRLDLARSEGRLPEILFDSRGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGF 606
Query: 553 HQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVF----PE 608
HQ+CL+PPLLKE IPPDDEGWLCPGC+CK+DCI L+N+ Q T + + D+WEKVF
Sbjct: 607 HQFCLDPPLLKEYIPPDDEGWLCPGCECKIDCIKLLNDSQETNILLGDSWEKVFAEEAAA 666
Query: 609 AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEA 668
AA+G N D N GL SDDS+D++YDP G DE+ +GD+ SS+D+SD+ S SD+++
Sbjct: 667 AASGKNLDDNSGLPSDDSEDDDYDPGGPDLDEKVQGDD-----SSTDESDYQSESDDMQV 721
Query: 669 PADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSG 728
K GL S+DSEDDEY+P + D++ ++SS SDFTSDSED V +D + +G
Sbjct: 722 -IRQKNSRGLPSDDSEDDEYDPSGL-VTDQMYKDSSC--SDFTSDSEDFTGVFDDYKDTG 777
Query: 729 NDEG-AASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLY 787
+G AS H NNE P Q AP+Y +R E LDYKKL
Sbjct: 778 KAQGPLASTPDHVR--------NNEE------GCGHPEQGDTAPLYPRRQVESLDYKKLN 823
Query: 788 D--------------------------ETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGS 821
D E YGN DSS DE + P K S KE
Sbjct: 824 DIEFSKMCDILDILSSQLDVIICTGNQEEYGNTSSDSS-DEDYMVTSSPDK-NNSDKEA- 880
Query: 822 SASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTP-- 879
T + R R+S ++ E+T RR K E ++ ++S E + P
Sbjct: 881 -------TAMERGRESGDLELDQ-KARESTHNRRYIKKFAVEGTDSFLSRSCEDSAAPVA 932
Query: 880 GSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
GS+ TS GE TQ+L SFKENQYP R KESLA EL L+ QV WF N RW
Sbjct: 933 GSKS----TSKTLHGEHATQRLLQSFKENQYPQRAVKESLAAELALSVRQVSNWFNNRRW 988
Query: 940 SFNHPSSKNAKLA 952
SF H S + +A
Sbjct: 989 SFRHSSRIGSDVA 1001
>gi|413946205|gb|AFW78854.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 693
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 293/639 (45%), Positives = 375/639 (58%), Gaps = 65/639 (10%)
Query: 343 MEKKSCLEQSETPPNY--VANNSACLGRKGKRATKSLKNN-YTVRSLIGSDRVLRSRSGE 399
+E L S+T PN + N +GK+ ++ L YT+RS RVLRS SG
Sbjct: 30 LEHPVLLSASQTVPNNLGIRKNYKRAANRGKKGSEGLTGQAYTLRSSDIDVRVLRSTSGS 89
Query: 400 RPIPPESSINLADVNSIGERKQKKRNKIR-RKKIVADEYSRIRTHLRYLLNRINYEQNLI 458
+ E V G+ K+R + R K DE+S+IR +RY+LNR+NYEQ+LI
Sbjct: 90 KTTSTEH------VQPPGQPAAKRRKRSRASNKNSTDEFSQIRKRVRYILNRMNYEQSLI 143
Query: 459 DAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG-FPKSLFDSE 517
+AY+SEGWK S++K +PEKEL+RA SEILR KL+IR++FQ +DSL + G +SLFDSE
Sbjct: 144 EAYASEGWKNQSLDKTRPEKELERAKSEILRCKLRIREVFQNIDSLLSKGKIDESLFDSE 203
Query: 518 GQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPG 577
G+I EDI+CA CGSK+ + NDIILCDGACDRGFHQ CL PPL EDIP DEGWLCP
Sbjct: 204 GEISCEDIFCATCGSKNATLGNDIILCDGACDRGFHQNCLNPPLRTEDIPMGDEGWLCPA 263
Query: 578 CDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA---GHNQDPNFGLASDDSDDNEYDPD 634
CDCK+DCIDL+NELQG+ + I D WEKVFPEAAA G QD F L SDDSDDN++DP+
Sbjct: 264 CDCKLDCIDLINELQGSDISIEDPWEKVFPEAAAMTNGSKQDEAFDLPSDDSDDNDFDPN 323
Query: 635 ------GSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTY--LGLSSEDSEDD 686
S +E E +E GS SDDSDF + SD++E D K LGL SEDSEDD
Sbjct: 324 MPEEHVASKEEESSEEEEDDDGGSDSDDSDFLTCSDDLEPLIDKKKVDDLGLPSEDSEDD 383
Query: 687 EYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYK 746
+Y+P P+ D V ++S+S SDFTSDS+D E +S G+DE ++ PL +
Sbjct: 384 DYDPAGPDSDKDVEKKSNSDESDFTSDSDDFCK--EIKKSGGHDEVSSPPLPDV---KVG 438
Query: 747 DGGNNESLNNELLSIIKP-----GQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDD 801
D N + +N + S P Q PV +R +ERLDYK+LYDE YG DSSD+
Sbjct: 439 DMEKNTAQSNTISSADDPMETEIDQSVVLPVSRRRQAERLDYKRLYDEAYGEASSDSSDE 498
Query: 802 ESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRK--STKAAKEKLNETENTPKRRGRPK 859
E WS P KS +EG SP GK + +T+ KE L+ + +
Sbjct: 499 EEWSGKNTP---IKSNEEGEVGSPAGKGSRVAHHNELTTQNTKESLHSLHGSVDEK-HGD 554
Query: 860 LNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESL 919
L + SNI K H G ++QKL+ FK YP+R+ KESL
Sbjct: 555 LTSNGSNIKDRKGH-------------------FGPVISQKLHEHFKTQPYPSRSLKESL 595
Query: 920 AKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANSEKGT 958
A+ELGLTF QV +WFEN R A+LA+S KG
Sbjct: 596 AEELGLTFHQVNRWFENRR--------HFARLASSRKGI 626
>gi|357136423|ref|XP_003569804.1| PREDICTED: homeobox protein HOX1A-like [Brachypodium distachyon]
Length = 861
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 248/570 (43%), Positives = 332/570 (58%), Gaps = 39/570 (6%)
Query: 389 SDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKKIVADEYSRIRTHLRYLL 448
S RVLRS S + P +N RK +KR + +++ +IR +RY+L
Sbjct: 106 STRVLRSTSKKTAKAPIEPVNDNTTFQPAARK-RKRGRPSNAASPKNDHIKIRQRVRYIL 164
Query: 449 NRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-G 507
NR+NYEQ+LI AY+SEGWKG S+EK++PEKELQRA +EILR KL+IR+ FQ +DSL + G
Sbjct: 165 NRMNYEQSLIQAYASEGWKGQSLEKIRPEKELQRAKAEILRCKLRIREAFQNMDSLLSEG 224
Query: 508 GFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIP 567
+SLFDSEG+I SEDI+CA CGSK ++ NDIILCDGAC RGFHQ CL PPLL +DIP
Sbjct: 225 KLEESLFDSEGEISSEDIFCAICGSKHVTLRNDIILCDGACVRGFHQKCLNPPLLAQDIP 284
Query: 568 PDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA---GHNQDPNFGLASD 624
P DEGWLCP CDCK+DCID++NELQG+ L I D+WEKVFPE+A G NQ + L SD
Sbjct: 285 PGDEGWLCPACDCKLDCIDVLNELQGSTLSIHDSWEKVFPESALLANGSNQIGSSDLPSD 344
Query: 625 DSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEA----------PADDKT 674
DS+DN+Y P + DE S+ D DSD + +A + KT
Sbjct: 345 DSEDNDYSPALAEGQMVDENKSSAEDEDEVSDSDDSDFMTSSDASELSMKKRSESKNKKT 404
Query: 675 Y--LGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEG 732
LGL SEDS DD+++P P + +++S SDFTSDS+D A E +S G DE
Sbjct: 405 VDDLGLPSEDSGDDDFDPAGPVSSEDQKAKTNSEESDFTSDSDDFCA--EIAKSCGQDEV 462
Query: 733 A-ASPLGHSNGQRYK--DGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDE 789
+ S H++G D + NN ++ + QD PV +R ERLDY++LYDE
Sbjct: 463 SPPSSSDHADGIEISAVDNQLGQEDNNRIIMETELDQDVVLPVSSRRQVERLDYRRLYDE 522
Query: 790 TYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRK-STKAAKEKLNET 848
YG DSSDDE WS D T S + D ++ K ++ +
Sbjct: 523 AYGKEMSDSSDDEEWSGDS-----TSSNGNLDDSGTDSLAGSLQFAKIHSRRPQAGHQNN 577
Query: 849 ENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKEN 908
++TP+ T ++S + E + GS H+ + G + QKL +++
Sbjct: 578 KHTPQ--------TGRCSVSTGQCSEVLYSNGSSSTSHK---KHFGPIINQKLKVHLEKD 626
Query: 909 QYPNRTTKESLAKELGLTFSQVRKWFENTR 938
YP+R TKESLA+ELGLTF+QV +WF + R
Sbjct: 627 PYPSRPTKESLAQELGLTFNQVSRWFSSAR 656
>gi|413946213|gb|AFW78862.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 692
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 291/639 (45%), Positives = 373/639 (58%), Gaps = 65/639 (10%)
Query: 343 MEKKSCLEQSETPPNY--VANNSACLGRKGKRATKSLKNN-YTVRSLIGSDRVLRSRSGE 399
+E L S+T PN + N +GK+ ++ L YT+RS RVLRS SG
Sbjct: 29 LEHPVLLSASQTVPNNLGIRKNYKRAANRGKKGSQGLTGQAYTLRSSDIDVRVLRSTSGS 88
Query: 400 RPIPPESSINLADVNSIGERKQKKRNKIR-RKKIVADEYSRIRTHLRYLLNRINYEQNLI 458
+ E V G+ K+R + R K DE+S+IR +RY+LNR+NYEQ+LI
Sbjct: 89 KTTSTEH------VQPPGQPAAKRRKRSRASNKNSTDEFSQIRKRVRYILNRMNYEQSLI 142
Query: 459 DAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG-FPKSLFDSE 517
+AY+SEGWK S++K++PEKEL+RA SEILR KL+IR++FQ +DSL + G +SLFDSE
Sbjct: 143 EAYASEGWKNQSLDKIRPEKELERAKSEILRCKLRIREVFQNIDSLLSKGKIDESLFDSE 202
Query: 518 GQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPG 577
G+I EDI+CA CGSK+ + NDIILCDGACDRGFHQ CL PPL EDIP DEGWLCP
Sbjct: 203 GEISCEDIFCATCGSKNATLGNDIILCDGACDRGFHQNCLNPPLRTEDIPMGDEGWLCPA 262
Query: 578 CDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHN---QDPNFGLASDDSDDNEYD-- 632
CDCK+DCIDL+NELQG+ + I D WEKVFPEAAA N QD F L SDDSDDN++D
Sbjct: 263 CDCKLDCIDLINELQGSDISIEDPWEKVFPEAAAMANSSKQDEAFDLPSDDSDDNDFDPN 322
Query: 633 -PD---GSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTY--LGLSSEDSEDD 686
P+ S + E +E GS SDDSDF + SD++E D K LGL SEDSEDD
Sbjct: 323 MPEEHVASKEEGSSEEEEDDDGGSDSDDSDFLTCSDDLEPLIDKKNVDDLGLPSEDSEDD 382
Query: 687 EYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYK 746
+Y+P P+ D V ++S+S SDF SDS+D E +S G+DE ++ PL +
Sbjct: 383 DYDPAGPDSDKDVEKKSNSDESDFMSDSDDFCK--EIKKSGGHDEVSSPPLPDV---KVG 437
Query: 747 DGGNNESLNNELLSIIKP-----GQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDD 801
D N + +N S P Q PV +R +ERLDYK+LYDE YG DSSD+
Sbjct: 438 DMEKNTAQSNTTSSADDPMETEIDQSVVLPVSRRRQAERLDYKRLYDEAYGEASSDSSDE 497
Query: 802 ESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRK--STKAAKEKLNETENTPKRRGRPK 859
E WS P KS +EG SP GK + +T+ KE L+ + +
Sbjct: 498 EEWSGKNTP---IKSNEEGEVGSPAGKGSRVAHHNELTTQNTKESLHSLHGSVDEK-HGD 553
Query: 860 LNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESL 919
L + SNI K H G ++QKL+ FK YP+R+ KESL
Sbjct: 554 LTSNGSNIKDRKGH-------------------FGPVISQKLHEHFKTQPYPSRSLKESL 594
Query: 920 AKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANSEKGT 958
A+ELGLTF QV +WFEN R A+LA+S KG
Sbjct: 595 AEELGLTFHQVNRWFENRR--------HFARLASSRKGI 625
>gi|162464040|ref|NP_001105549.1| LOC542536 [Zea mays]
gi|1143707|emb|CAA61910.1| Hox2b [Zea mays]
Length = 1539
Score = 388 bits (996), Expect = e-104, Method: Compositional matrix adjust.
Identities = 241/557 (43%), Positives = 330/557 (59%), Gaps = 53/557 (9%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERKQKKRN-KIRRKKIVADEYSRIRTHLRYLLN 449
RVLRS SG++ E+ I + ++ E KRN K + +Y +I +RY+LN
Sbjct: 341 RVLRSASGKKN---EACIGHVNDSTSAEPTVTKRNRKPSMDRSPKKDYLKICQRVRYILN 397
Query: 450 RINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG- 508
R+NY+Q I AY+SEGWKG S+EK++PEKEL+RA +EIL+ KL+IR+ F+ +DSL + G
Sbjct: 398 RMNYQQTFIQAYASEGWKGQSLEKIRPEKELERAKAEILQCKLRIREAFRNMDSLLSKGK 457
Query: 509 FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPP 568
+SLFDS G+I SEDI+CA CGSKD++ NDIILCDGACDRGFHQ CL PPLL EDIPP
Sbjct: 458 LEESLFDSAGEISSEDIFCAVCGSKDVTLQNDIILCDGACDRGFHQNCLNPPLLTEDIPP 517
Query: 569 DDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDD 628
D+ WLCP C CK D ID +NELQG++L I D+WEKVFPEAA+ N S D
Sbjct: 518 GDQRWLCPACVCKADSIDALNELQGSKLSIHDSWEKVFPEAASIAN-------GSKQVDT 570
Query: 629 NEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEY 688
++ PD + + + +G D+ ++ D + ADD L LSSEDS D ++
Sbjct: 571 SDLLPD----HIKHSDNPALVEGLMVDEVRLSAEDD---SKADD---LRLSSEDSGDGDF 620
Query: 689 NPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYK-- 746
+P P+ + +S SDFTSDS+D A E +S G DE +ASPL + Y+
Sbjct: 621 DPSGPDSSEDQKDGLNSEESDFTSDSDDFCA--EIAKSCGQDEVSASPLSNVINHTYRMK 678
Query: 747 ----DGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDE 802
+ +NE ++ + ++ GQD PV +R ERLDYKKLYDE YG +SSDD+
Sbjct: 679 LRASNNRSNEENHDHVFMDMELGQDMVLPVSSRRQVERLDYKKLYDEAYGKESSNSSDDK 738
Query: 803 SWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLN- 861
WS K EGS + P +R S +A E+ N E+TP+R +L+
Sbjct: 739 EWSG--------KELLEGSETDSLSERPHPVKRCSRRAQAEQQN-NEHTPQRE---RLHG 786
Query: 862 TEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAK 921
+E + G S+ G RK G TQKL F+++ YP+R TKE++++
Sbjct: 787 SESEQKTGILRSNGSSSTG----------RKFGPVATQKLKVHFEKDPYPSRETKENISE 836
Query: 922 ELGLTFSQVRKWFENTR 938
ELGLTF+QV +WF +TR
Sbjct: 837 ELGLTFNQVSRWFSSTR 853
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 4/76 (5%)
Query: 893 LGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLA 952
G VTQKL F+++ YP R TKE LA+ELGLTF+Q+ KWF T +KN K
Sbjct: 1046 FGPIVTQKLKEHFEKDPYPCRATKEGLAQELGLTFNQISKWFSATHHYSRDAVAKNQKYP 1105
Query: 953 NSEKGTCTPQSNKNTV 968
G T ++N +T+
Sbjct: 1106 ----GENTTENNSSTI 1117
>gi|297834862|ref|XP_002885313.1| hypothetical protein ARALYDRAFT_479466 [Arabidopsis lyrata subsp.
lyrata]
gi|297331153|gb|EFH61572.1| hypothetical protein ARALYDRAFT_479466 [Arabidopsis lyrata subsp.
lyrata]
Length = 724
Score = 384 bits (987), Expect = e-103, Method: Compositional matrix adjust.
Identities = 277/599 (46%), Positives = 362/599 (60%), Gaps = 70/599 (11%)
Query: 413 VNSIGERKQKKRNKIRRKKIVA--DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLS 470
NS + KKRNK +K V DEY+RI+ LRY LNRI+YEQNLIDAYS EGWKG S
Sbjct: 146 ANSTPVGRPKKRNKTMKKGEVREDDEYTRIKKKLRYFLNRISYEQNLIDAYSLEGWKGSS 205
Query: 471 VEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAK 529
+EK++PEKEL+RAT EILRRKLKIRDLFQ LD+LCA G P+SLFDS+G+I SEDI+CAK
Sbjct: 206 LEKIRPEKELERATKEILRRKLKIRDLFQHLDTLCAEGSLPESLFDSDGEISSEDIFCAK 265
Query: 530 CGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVN 589
CGSKDLS DNDIILCDG CDRGFHQYCLEPPL KEDIPPDDE WLCPGCDCK D +DL+N
Sbjct: 266 CGSKDLSVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDESWLCPGCDCKDDSLDLLN 325
Query: 590 ELQGTRLFITDNWEKVFPE-----AAAGHNQDPNFGLASDDS--------DDNEYDPDGS 636
+ GT+L ++D+WEK+FPE A G N D + D +DNE D DGS
Sbjct: 326 DSLGTKLLVSDSWEKIFPEAAAALAGGGQNLDCDLPSDDSDDEEYDPDGLNDNENDEDGS 385
Query: 637 ATDEQDEGDESSSDGSSSDDSDFTSTSDEV-----EAPADDKTYLGLSSEDSEDDEYNPD 691
+E+ S ++ SSD+S+ TS SD++ E K + L S+DSEDD+Y+PD
Sbjct: 386 DDNEE-----SENEDDSSDESESTSASDKMIESFKEGKDIMKDIMALPSDDSEDDDYDPD 440
Query: 692 APELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDGGNN 751
AP D ++ SS SD TSD EDL + + S N + +P ++D G
Sbjct: 441 APTCD----EDKESSNSDCTSDYEDLETSFKGDES--NQQAEDTP--------FEDPGRQ 486
Query: 752 ESL--NNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGG 809
S + +L DG A V +R+ ERLDYKKLY+E Y NVP SSDD+ W
Sbjct: 487 TSQLPCDAILESDVGLDDGPAGVSRRRNVERLDYKKLYNEEYDNVPTSSSDDDDWD---- 542
Query: 810 PRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISP 869
K + KE S + +G ++ ++S+ A + T P+R+ + + + +D+ P
Sbjct: 543 --KTARMGKEDSESEDEGD--IVPLKQSSNAE----DHTSKKPRRKSK-RTDKKDTLEVP 593
Query: 870 AKSHEGCSTPGSRG-----RRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELG 924
+ PG G + +S K + TQ+LY SF+EN+YP++ TKESLAKEL
Sbjct: 594 QE------CPGENGGSGEIEKSSSSANKQTDPKTQRLYISFQENRYPDKATKESLAKELQ 647
Query: 925 LTFSQVRKWFENTRWSFNHP---SSKNA-KLANSEKGTCTPQSNKNTVGRVSNCNGAEN 979
+T +QV WF+N R S N S +N KL ++G C ++V + + AEN
Sbjct: 648 MTVTQVNNWFKNRRSSINSKPLVSEENVEKLKTGKEGECETSVAGSSVQTMETESVAEN 706
>gi|5690096|emb|CAB51951.1| homeodomain transcription factor [Zea mays]
Length = 719
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 269/639 (42%), Positives = 355/639 (55%), Gaps = 69/639 (10%)
Query: 343 MEKKSCLEQSETPPNY--VANNSACLGRKGKRATKSLKNN-YTVRSLIGSDRVLRSRSGE 399
+E L S+T PN + N +GK+ ++ L YT+ S RVLRS S
Sbjct: 29 LEHSVLLSTSQTMPNNLGIRKNYKRAANRGKKGSQGLTGQAYTLMSSNSDVRVLRSTSSS 88
Query: 400 RPIPPESSINLADVNSIGERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLID 459
+ E A V +R++ R K DE+S+IR +RY+LNR+NYEQ+LI+
Sbjct: 89 KTTSTEHV--QAPVQPAAKRRKMSRAS---NKSSTDEFSQIRKQVRYILNRMNYEQSLIE 143
Query: 460 AYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG-FPKSLFDSEG 518
AY+SEGWK S++K++PEKEL+RA SEILR KL+IR++F+ +DSL + G ++LFDSEG
Sbjct: 144 AYASEGWKNQSLDKIRPEKELERAKSEILRCKLRIREVFRNIDSLLSKGKIDETLFDSEG 203
Query: 519 QIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+I EDI+C+ CGS D + NDIILCDGACDRGFHQ CL PPL EDIP DEGWLCP C
Sbjct: 204 EISCEDIFCSTCGSNDATLGNDIILCDGACDRGFHQNCLNPPLRTEDIPMGDEGWLCPAC 263
Query: 579 DCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDDNEYDPDGSAT 638
DCK+DCIDL+NEL G+ + I D+WEKVFP+AAA N SDD++ +
Sbjct: 264 DCKIDCIDLINELHGSNISIEDSWEKVFPDAAAMANDSKQDDAFDLPSDDSDDNDFDPNM 323
Query: 639 DEQ-------DEGDESSSDGSSSDDSDFTSTSDEVEAPADDKT-YLGLSSEDSEDDEYNP 690
E+ + +E GS SDDSDF + SD+ E D K L L SEDSEDD+Y+P
Sbjct: 324 PEEHVVGKDEESSEEDEDGGSDSDDSDFLTCSDDSEPLIDKKVDDLRLPSEDSEDDDYDP 383
Query: 691 DAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPL---------GHSN 741
P+ D V ++SSS SDFTSDS+D + SG+DE +SPL
Sbjct: 384 AGPDSDKDVEKKSSSDESDFTSDSDDFCKEIS---KSGHDE-VSSPLLPDAKVGDMEKIT 439
Query: 742 GQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDD 801
Q ++ + E+ Q P +R +ERLDYKKLYDE YG DSSDD
Sbjct: 440 AQAKTTSSADDPMETEI------DQGVVLPDSRRRQAERLDYKKLYDEAYGEASSDSSDD 493
Query: 802 ESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRK---STKAAKEKLNETENTPKRRGRP 858
E WS P KS +EG + SP GK + +T++ K+ L+ + +
Sbjct: 494 EEWSGKNTP--IIKSNEEGEANSPAGKGSRVVHHNDELTTQSTKKSLHSIHGSVDEKP-G 550
Query: 859 KLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKES 918
L + SN + K H G + QKL+ FK YP+R+ KES
Sbjct: 551 DLTSNGSNSTARKGH-------------------FGPVINQKLHEHFKTQPYPSRSVKES 591
Query: 919 LAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANSEKG 957
LA+ELGLTF QV KWFE R S A++A+S KG
Sbjct: 592 LAEELGLTFRQVNKWFETRRHS--------ARVASSRKG 622
>gi|162460529|ref|NP_001105447.1| homeobox protein HOX1A [Zea mays]
gi|1170434|sp|P46605.1|HOX1A_MAIZE RecName: Full=Homeobox protein HOX1A
gi|22331|emb|CAA47859.1| Zmhox1a homeobox protein [Zea mays]
Length = 719
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 269/639 (42%), Positives = 355/639 (55%), Gaps = 69/639 (10%)
Query: 343 MEKKSCLEQSETPPNY--VANNSACLGRKGKRATKSLKNN-YTVRSLIGSDRVLRSRSGE 399
+E L S+T PN + N +GK+ ++ L YT+ S RVLRS S
Sbjct: 29 LEHSVLLSTSQTMPNNLGIRKNYKRAANRGKKGSQGLTGQAYTLMSSNSDVRVLRSTSSS 88
Query: 400 RPIPPESSINLADVNSIGERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLID 459
+ E A V +R++ R K DE+S+IR +RY+LNR+NYEQ+LI+
Sbjct: 89 KTTSTEHV--QAPVQPAAKRRKMSRAS---NKSSTDEFSQIRKRVRYILNRMNYEQSLIE 143
Query: 460 AYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG-FPKSLFDSEG 518
AY+SEGWK S++K++PEKEL+RA SEILR KL+IR++F+ +DSL + G ++LFDSEG
Sbjct: 144 AYASEGWKNQSLDKIRPEKELERAKSEILRCKLRIREVFRNIDSLLSKGKIDETLFDSEG 203
Query: 519 QIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+I EDI+C+ CGS D + NDIILCDGACDRGFHQ CL PPL EDIP DEGWLCP C
Sbjct: 204 EISCEDIFCSTCGSNDATLGNDIILCDGACDRGFHQNCLNPPLRTEDIPMGDEGWLCPAC 263
Query: 579 DCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDDNEYDPDGSAT 638
DCK+DCIDL+NEL G+ + I D+WEKVFP+AAA N SDD++ +
Sbjct: 264 DCKIDCIDLINELHGSNISIEDSWEKVFPDAAAMANDSKQDDAFDLPSDDSDDNDFDPNM 323
Query: 639 DEQ-------DEGDESSSDGSSSDDSDFTSTSDEVEAPADDKT-YLGLSSEDSEDDEYNP 690
E+ + +E GS SDDSDF + SD+ E D K L L SEDSEDD+Y+P
Sbjct: 324 PEEHVVGKDEESSEEDEDGGSDSDDSDFLTCSDDSEPLIDKKVDDLRLPSEDSEDDDYDP 383
Query: 691 DAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPL---------GHSN 741
P+ D V ++SSS SDFTSDS+D + SG+DE +SPL
Sbjct: 384 AGPDSDKDVEKKSSSDESDFTSDSDDFCKEIS---KSGHDE-VSSPLLPDAKVGDMEKIT 439
Query: 742 GQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDD 801
Q ++ + E+ Q P +R +ERLDYKKLYDE YG DSSDD
Sbjct: 440 AQAKTTSSADDPMETEI------DQGVVLPDSRRRQAERLDYKKLYDEAYGEASSDSSDD 493
Query: 802 ESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRK---STKAAKEKLNETENTPKRRGRP 858
E WS P KS +EG + SP GK + +T++ K+ L+ + +
Sbjct: 494 EEWSGKNTP--IIKSNEEGEANSPAGKGSRVVHHNDELTTQSTKKSLHSIHGSVDEKP-G 550
Query: 859 KLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKES 918
L + SN + K H G + QKL+ FK YP+R+ KES
Sbjct: 551 DLTSNGSNSTARKGH-------------------FGPVINQKLHEHFKTQPYPSRSVKES 591
Query: 919 LAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANSEKG 957
LA+ELGLTF QV KWFE R S A++A+S KG
Sbjct: 592 LAEELGLTFRQVNKWFETRRHS--------ARVASSRKG 622
>gi|242058675|ref|XP_002458483.1| hypothetical protein SORBIDRAFT_03g034510 [Sorghum bicolor]
gi|241930458|gb|EES03603.1| hypothetical protein SORBIDRAFT_03g034510 [Sorghum bicolor]
Length = 1582
Score = 355 bits (911), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 227/511 (44%), Positives = 299/511 (58%), Gaps = 58/511 (11%)
Query: 436 EYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIR 495
+Y +I +RY+LNR+NY+Q I AY+SEGWKG SVEK++PEKEL+RA EIL+ KL+IR
Sbjct: 386 DYLKICQRVRYILNRMNYQQTFIQAYASEGWKGQSVEKIRPEKELERAKEEILQCKLRIR 445
Query: 496 DLFQRLDSLCAGG-FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQ 554
+ F+ +DSL + G +SLFDS G+I SEDI+CA CGSKD++ NDIILCDGACDRGFHQ
Sbjct: 446 EAFRNMDSLLSKGKLEESLFDSAGEISSEDIFCAVCGSKDVTLQNDIILCDGACDRGFHQ 505
Query: 555 YCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHN 614
CL PPLL EDIPP DEGWLCP C CK D ID +NELQG++L I D+WEKVFPEAA+ N
Sbjct: 506 NCLNPPLLTEDIPPGDEGWLCPACVCKADSIDALNELQGSKLSIHDSWEKVFPEAASIAN 565
Query: 615 QDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKT 674
S D ++ PD +D + + ++G ++ F++ D ADD
Sbjct: 566 -------GSKQVDASDLLPD----HIKDSDNLALAEGHMVNEVRFSAEDD---TKADD-- 609
Query: 675 YLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAA 734
LGL SEDS D +++P P+ + +S SDFTSDS+D A E +S G DE A
Sbjct: 610 -LGLPSEDSGDGDFDPAGPDSSEDQKDGLNSEESDFTSDSDDFCA--EIAKSCGQDEVLA 666
Query: 735 SPLGH----SNGQRYKDGGNN---ESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLY 787
SPL + ++ + + GN E+ NN + D PV +R E LDYKKLY
Sbjct: 667 SPLSNVINRTDRMKLRAAGNRSNEENHNNAFM-------DMELPVSSRRQVEPLDYKKLY 719
Query: 788 DETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNE 847
DE Y +SSD+E K EGS G+ +R S KA + N
Sbjct: 720 DEAYEEESSNSSDEEE--------WSGKELLEGSETDSLGERLRPVKRCSRKAPAGQQN- 770
Query: 848 TENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKE 907
E+TP+ K E + G S+ G RK G VTQKL F++
Sbjct: 771 IEHTPQ-----KFGPESEQKTEILRSNGSSSTG----------RKFGPVVTQKLKVHFEK 815
Query: 908 NQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ YP+R TK +L++ELGLTF+QV +WF +TR
Sbjct: 816 DPYPSRETKVNLSEELGLTFNQVSRWFSSTR 846
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 39/64 (60%)
Query: 887 RTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSS 946
+ S G V+QKL F+++ YP R TKESLA+ELGLT +Q+ KWF TR +
Sbjct: 1038 KVSKYHFGPIVSQKLKEHFEKDPYPCRATKESLARELGLTCNQISKWFSATRHYSRDAVA 1097
Query: 947 KNAK 950
KN K
Sbjct: 1098 KNQK 1101
>gi|326514940|dbj|BAJ99831.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1240
Score = 353 bits (905), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 252/607 (41%), Positives = 340/607 (56%), Gaps = 75/607 (12%)
Query: 363 SACLGRKGKRATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQK 422
+A GRKG R S Y +RS + RVLRSRS P +D I ER +
Sbjct: 51 AANRGRKGSRVLSS--RTYPLRSSESTVRVLRSRSVADKSP-------SDAVQIAERAAE 101
Query: 423 K-------------RNKIRRKKIVA----DEYSRIRTHLRYLLNRINYEQNLIDAYSSEG 465
K +I+R + DE S+IR +RY+LNR+NY+Q+ ++AY++EG
Sbjct: 102 KPPSDSVDAVVKPPAKRIKRDRPAKGGPDDELSKIRKRIRYVLNRMNYQQSFLEAYANEG 161
Query: 466 WKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSED 524
WK S+EK++PEKEL+RA +EI+R KL+IR+ FQ LD L G +SLFDSEG+I S+D
Sbjct: 162 WKNQSLEKIRPEKELERAKAEIMRCKLRIREAFQNLDHLLTLGKLEESLFDSEGEISSDD 221
Query: 525 IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDC 584
I CA C +D++ +NDIILCDGACDRGFHQ CL PPLL +DIP +EGWLCP CDCK+DC
Sbjct: 222 IVCATCSLQDVTLNNDIILCDGACDRGFHQNCLNPPLLTKDIPEGEEGWLCPACDCKIDC 281
Query: 585 IDLVNELQGTRLFITDNWEKVFPEAAA-GHNQDPNFGLASDDSDDNEYDPDGSATDEQDE 643
I+L+NELQGT L I D+WEKVFPEAAA H N D + D D + ++E
Sbjct: 282 IELINELQGTDLDINDSWEKVFPEAAAVAHGSMQNDIPDLPSDDSEDDDFDPNISEEHVS 341
Query: 644 G------DESSSDGSSSDDSDFTSTSDEVEAPADDKTY--LGLSSEDSEDDEYNPDAPEL 695
G +E +GS SDDS+F ++S+ E + + LGL SEDSEDD+Y+P+ P+
Sbjct: 342 GHVEGSSEEEGDEGSDSDDSNFITSSENSEHVKEKEKVDDLGLPSEDSEDDDYDPEGPDS 401
Query: 696 DDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPL--GHSNGQRYKDGGNNES 753
D + E SDFTSDS+D A E +S G DE ++ P G +N N S
Sbjct: 402 DKDI--EEKQDESDFTSDSDDFCA--EITKSCGKDEVSSGPKVGGRTNDLEGAPVRPNTS 457
Query: 754 LNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKR 813
+++ ++ QD +R ++LDYKKLYD+ YG P DSSD E WS
Sbjct: 458 MSHLTTKDLEIDQD-VILSSRRRQVQQLDYKKLYDDAYGEAPSDSSDAEEWS-------- 508
Query: 814 TKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKR--RGRPKLNTEDSNISPAK 871
GK+ + K E+ NE ++ + RG ++ + +
Sbjct: 509 -------------GKS-------TLKEDNEERNEVDSFACKSSRGTRAVHPDSFHGLVND 548
Query: 872 SHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVR 931
H G T S G + R+ G + ++L F QYP+R KESLA+ELGLTF QV
Sbjct: 549 QHAGGLT--SDGSNSKAKRRQFGPVINERLNQHFTTEQYPSRALKESLAQELGLTFRQVN 606
Query: 932 KWFENTR 938
KWFE+TR
Sbjct: 607 KWFESTR 613
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 85/185 (45%), Gaps = 31/185 (16%)
Query: 765 GQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSAS 824
GQ PV ++ + D KKLY +G P DSS E S G P K + KE + S
Sbjct: 962 GQYVVLPVSARQQVQSPDSKKLY-AAHGKAPSDSSYSEDLSRKGTPEK--DNGKESEAGS 1018
Query: 825 PDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGR 884
+ + R ++S + + E ++P G ST G +
Sbjct: 1019 FVNEFTLQRSQQSLHGSVNGQHAEEL----------------LTP----NGSSTTGQK-- 1056
Query: 885 RHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHP 944
R+ G + Q+L+ F+ N YP+R KESLA+ELGLTF QV +WFE+ R
Sbjct: 1057 ------RQYGPIINQRLHEQFETNPYPSRAVKESLARELGLTFRQVERWFESKRRRLRVA 1110
Query: 945 SSKNA 949
SSKN+
Sbjct: 1111 SSKNS 1115
>gi|414880502|tpg|DAA57633.1| TPA: putative homeodomain-like transcription factor superfamily
protein [Zea mays]
Length = 1577
Score = 349 bits (895), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 239/557 (42%), Positives = 320/557 (57%), Gaps = 53/557 (9%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERKQKKRN-KIRRKKIVADEYSRIRTHLRYLLN 449
RVLRS SG++ +N + S KKRN K + + +Y +I ++Y+LN
Sbjct: 337 RVLRSASGKKNETCTEPVNGS--TSAEPTVTKKRNCKPSKDRSPKKDYLKICQRIKYILN 394
Query: 450 RINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG- 508
R+ YEQ I AY+SEGWKG S+EK++PEKEL+RA +EIL+ KL+IR+ F+ +DSL + G
Sbjct: 395 RMKYEQAFIQAYASEGWKGQSLEKIRPEKELERAKAEILQCKLRIREAFRNMDSLLSKGK 454
Query: 509 FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPP 568
+SLFDS GQI SEDI+CA CGSKD+++ NDIILCDGACDRGFHQ CL PPLL E+IPP
Sbjct: 455 LEESLFDSAGQISSEDIFCAICGSKDVTSQNDIILCDGACDRGFHQNCLSPPLLTEEIPP 514
Query: 569 DDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDD 628
DEGWLCP C CK D ID +NELQG++L I D+W KVFPEAA+ N S D
Sbjct: 515 GDEGWLCPACVCKADYIDALNELQGSKLSIHDSWVKVFPEAASTAN-------GSKQVDA 567
Query: 629 NEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEY 688
++ PD +D + + ++ F S+E ++ ADD LGL SEDS D ++
Sbjct: 568 SDLLPD----HIKDSANLALVGTHMVNEIRF---SEEDDSKADD---LGLPSEDSGDGDF 617
Query: 689 NPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQ----- 743
+P P+ + +S SDFTSDS+ A E +S G DE +ASPL + +
Sbjct: 618 DPAGPDSSEDQNDGLNSEESDFTSDSDYFCA--EIAKSCGQDEVSASPLSNVINRTDRMK 675
Query: 744 -RYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDE 802
R +NE +N ++ QD PV +R ERLDYKKLYDE Y +SSD+E
Sbjct: 676 LRAYSRRSNEENHNHAFMDMELEQDIVLPVPSRRQVERLDYKKLYDEAYEEESSNSSDEE 735
Query: 803 SWSDDGGPRKRTKSTKEGSSA-SPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLN 861
K EGS S D + ++R A ++ NE TP+ R
Sbjct: 736 E--------WSGKELLEGSGTDSLDERLRPVKRCSRRAPAGQQNNEL--TPQSR------ 779
Query: 862 TEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAK 921
P S+ T R ++ RK EVTQKL F+++ YP+R TKE+L++
Sbjct: 780 -------PHGSNSEQKTEVLRSNGSSSTGRKYDPEVTQKLKVHFEKDPYPSRETKENLSE 832
Query: 922 ELGLTFSQVRKWFENTR 938
ELGLTF+QV KWF +TR
Sbjct: 833 ELGLTFNQVSKWFSSTR 849
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/64 (48%), Positives = 40/64 (62%)
Query: 887 RTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSS 946
+ S G V+QKL F+++ YP R TKESLA+ELGLTF+Q+ KWF TR +
Sbjct: 1036 KVSKYHFGPIVSQKLKEHFEKDPYPCRATKESLAQELGLTFNQISKWFSGTRHYSRDAVA 1095
Query: 947 KNAK 950
KN K
Sbjct: 1096 KNQK 1099
>gi|162458957|ref|NP_001105471.1| homeobox3 [Zea mays]
gi|1143705|emb|CAA61909.1| Hox2a [Zea mays]
Length = 1576
Score = 347 bits (890), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 238/559 (42%), Positives = 318/559 (56%), Gaps = 58/559 (10%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERK---QKKRN-KIRRKKIVADEYSRIRTHLRY 446
RVLRS SG++ + VN R+ KKRN K + + +Y +I ++Y
Sbjct: 337 RVLRSASGKK-----NETCTEPVNRSTSREPTVTKKRNCKPSKDRGPKKDYLKICQRIKY 391
Query: 447 LLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA 506
+LNR+ YEQ I AY+SEGWKG S+EK++PEKEL+RA +EIL+ KL+IR+ F+ +DSL +
Sbjct: 392 ILNRMKYEQAFIQAYASEGWKGQSLEKIRPEKELERAKAEILQCKLRIREAFRNMDSLLS 451
Query: 507 GG-FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKED 565
G +SLFDS GQI SEDI+CA CGSKD+++ NDIILCDGACDRGFHQ CL PPLL E+
Sbjct: 452 KGKLEESLFDSAGQISSEDIFCAICGSKDVTSQNDIILCDGACDRGFHQNCLSPPLLTEE 511
Query: 566 IPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDD 625
IPP DEGWLCP C CK D ID +NELQG++L I D+W KVFPEAA+ N S
Sbjct: 512 IPPGDEGWLCPACVCKADYIDALNELQGSKLSIHDSWVKVFPEAASTAN-------GSKQ 564
Query: 626 SDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSED 685
D ++ PD +D + + ++ F S+E ++ ADD LGL SEDS D
Sbjct: 565 VDASDLLPD----HIKDSANLALVGTHMVNEIRF---SEEDDSKADD---LGLPSEDSGD 614
Query: 686 DEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQ-- 743
+++P P+ + +S SDFTSDS+ A E +S G DE +ASPL + +
Sbjct: 615 GDFDPAGPDSSEDQNDGLNSEESDFTSDSDYFCA--EIAKSCGQDEVSASPLSNVINRTD 672
Query: 744 ----RYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSS 799
R +NE +N ++ QD PV +R ERLDYKKLYDE Y +SS
Sbjct: 673 RMKLRAYSRRSNEENHNHAFMDMELEQDIVLPVPSRRQVERLDYKKLYDEAYEEESSNSS 732
Query: 800 DDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPK 859
D+E K EGS + +R A ++ NE TP+ R
Sbjct: 733 DEEE--------WSGKELLEGSGTDSLDEPSSCKRLSRRAPAGQQNNEL--TPQSR---- 778
Query: 860 LNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESL 919
P S+ T R ++ RK EVTQKL F+++ YP+R TKE+L
Sbjct: 779 ---------PHGSNSEQKTEVLRSNGSSSTGRKYDPEVTQKLKVHFEKDPYPSRETKENL 829
Query: 920 AKELGLTFSQVRKWFENTR 938
++ELGLTF+QV KWF +TR
Sbjct: 830 SEELGLTFNQVSKWFSSTR 848
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/64 (48%), Positives = 40/64 (62%)
Query: 887 RTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSS 946
+ S G V+QKL F+++ YP R TKESLA+ELGLTF+Q+ KWF TR +
Sbjct: 1035 KVSKYHFGPIVSQKLKEHFEKDPYPCRATKESLAQELGLTFNQISKWFSGTRHYSRDAVA 1094
Query: 947 KNAK 950
KN K
Sbjct: 1095 KNQK 1098
>gi|357128751|ref|XP_003566033.1| PREDICTED: homeobox protein HOX1A-like [Brachypodium distachyon]
Length = 963
Score = 342 bits (877), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 223/473 (47%), Positives = 297/473 (62%), Gaps = 41/473 (8%)
Query: 363 SACLGRKGKRATKSLKNNYTVRSLIGSD-----RVLRSRSGERPIPPESSINLADV---- 413
+A GRKG R NY +RS SD RVLRSRS P +S+ L
Sbjct: 54 TANRGRKGYRVLAD--RNYPLRS---SDCDTTVRVLRSRSAANKPPSDSAHTLVHSAANK 108
Query: 414 ---NSIGERKQKKRNKIRR----KKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGW 466
+S+ Q +I+R KK +E+S+IR +RY+LNR+NYEQ+L++AY++EGW
Sbjct: 109 FPSDSVHTLVQPAAKRIKRGRPTKKGPNNEFSKIRQRVRYILNRMNYEQSLLEAYANEGW 168
Query: 467 KGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDI 525
K S+EK++PEKEL+RA EI+R KL+IR+ FQ LDSL + G +SLFD+EG+I EDI
Sbjct: 169 KRQSLEKIRPEKELERAKEEIVRCKLRIREAFQNLDSLLSMGKLDESLFDNEGKISCEDI 228
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCI 585
CA C S+ ++ +NDIILCDG CDRGFHQ CL PPLL +DIP DEGWLCP CDCK+DCI
Sbjct: 229 ICATCSSQYVTLNNDIILCDGVCDRGFHQKCLNPPLLTKDIPAGDEGWLCPACDCKIDCI 288
Query: 586 DLVNELQGTRLFITDNWEKVFPEAAA-GHNQDPN--FGLASDDSDDNEYDPDGSATDEQD 642
D++NELQGT L I+D+WEKVFPE AA H P F L S+DS+D+++DP+ +E
Sbjct: 289 DVINELQGTNLSISDSWEKVFPETAALAHGTIPKDAFDLPSEDSEDSDFDPN--IAEEHV 346
Query: 643 EG------DESSSDGSSSDDSDFTSTSDEVEAPADDKTY--LGLSSEDSEDDEYNPDAPE 694
G +E +GS SDDS+F +T D E + + LGL SEDSEDD+Y+P P+
Sbjct: 347 TGHEEGSSEEDEDEGSDSDDSNFVTTCDNSEHLKEKEKVDDLGLPSEDSEDDDYDPAGPD 406
Query: 695 LDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDE-GAASPLGH-SNGQRYKDGGNNE 752
D + ++ SDFTSDS+D A E +S G DE + + +G +N +N
Sbjct: 407 SDKDIKEKQDE--SDFTSDSDDFCA--EIAKSCGRDEVSSGAKVGDPTNDLEGATACSNT 462
Query: 753 SLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWS 805
++ N + ++ QD PV G+R +RLDYKKLYD+ YG P DSSD E WS
Sbjct: 463 AIPNLTSNDLEIDQDEVLPVLGRRQGQRLDYKKLYDDAYGEAPSDSSDGEEWS 515
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 86/193 (44%), Gaps = 46/193 (23%)
Query: 766 QDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRK----------RTK 815
QD +PV G+R + LD KKLYD DSSD E WS P K RT
Sbjct: 675 QDVVSPVSGRRQDQSLDCKKLYDACR-EASSDSSDGEDWSGKCTPEKDSEKQSDTCRRTG 733
Query: 816 STKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEG 875
+ ++ + ++PD A++ L+ T + E
Sbjct: 734 AVRQSNGSTPDN-------------ARQSLHRTNGQ-------------------HTEEL 761
Query: 876 CSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFE 935
++ GS HR R+ G + Q+L+ FK +QYP R KESLA ELGLTF QV KWFE
Sbjct: 762 LTSDGSSSTVHR---RQFGPIIYQRLHEHFKTDQYPKRAVKESLANELGLTFQQVSKWFE 818
Query: 936 NTRWSFNHPSSKN 948
R+ + KN
Sbjct: 819 TRRYKTRIAAKKN 831
>gi|125603298|gb|EAZ42623.1| hypothetical protein OsJ_27188 [Oryza sativa Japonica Group]
Length = 634
Score = 333 bits (853), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 205/442 (46%), Positives = 263/442 (59%), Gaps = 53/442 (11%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKI-----------RRKKI------- 432
RV + R +P+ P S L S E+K K N++ +++K+
Sbjct: 170 RVAKKRKRSKPLRPAPSRVL---RSTSEKKNKAHNELLNDGAGVQPAEKKRKVGRPPKGG 226
Query: 433 -VADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491
D+Y IR +RY+LNR+NYEQ+LI AY+SEGWKG S+EK++PEKEL+RA EILR K
Sbjct: 227 TPKDDYLMIRKRVRYVLNRMNYEQSLIQAYASEGWKGQSLEKIRPEKELERAKVEILRCK 286
Query: 492 LKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
+IR+ F+ LDSL + G +S+FDS G+I SEDI+CA CGSKD++ NDIILCDG CDR
Sbjct: 287 SRIREAFRNLDSLLSEGKLDESMFDSAGEISSEDIFCAACGSKDVTLKNDIILCDGICDR 346
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
GFHQYCL PPLL EDIP DEGWLCP CDCK+DCID++NELQG +L I D+WEKVFPEAA
Sbjct: 347 GFHQYCLNPPLLAEDIPQGDEGWLCPACDCKIDCIDVLNELQGVKLSIHDSWEKVFPEAA 406
Query: 611 A---GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGS------------SSD 655
+ G Q L SDDS DN+YDP + + DE S DG S
Sbjct: 407 SFLNGSKQIDASDLPSDDSADNDYDPTLAQGHKVDEEKSSGEDGGEGLDSDDSSSEDSES 466
Query: 656 DSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSG---SDFTS 712
S + + DD LGL SEDSED +++P P+ D + ES+S SDFTS
Sbjct: 467 SEKEKSKTSQNGRTVDD---LGLPSEDSEDGDFDPAGPDSDKEQNDESNSDQSDESDFTS 523
Query: 713 DSEDLAAVLEDNRSSGNDE--GAAS----PLGHSNGQRYKDGGNNESLNNELLSIIKPGQ 766
DS+D A E +S G DE G +S + ++G + DG N +N + Q
Sbjct: 524 DSDDFCA--EIAKSCGQDEISGPSSSQIRTVDRTDGSGF-DGEPNAENSNLAFMETELEQ 580
Query: 767 DGAAPVYGKRSSERLDYKKLYD 788
D P+ KR ERLDYKKLY+
Sbjct: 581 DMVLPISSKRQVERLDYKKLYN 602
>gi|413952454|gb|AFW85103.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 1513
Score = 291 bits (744), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 207/557 (37%), Positives = 296/557 (53%), Gaps = 92/557 (16%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERKQKKRN-KIRRKKIVADEYSRIRTHLRYLLN 449
RVLRS SG++ E+ I + ++ E KRN K + +Y +I +RY+LN
Sbjct: 341 RVLRSASGKKN---EACIGHVNDSTSAEPTVTKRNRKPSMDRSPKKDYLKICQRVRYILN 397
Query: 450 RINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG- 508
R+NY+Q I AY+SEGWKG S+EK++PEKEL+RA +EIL+ KL+IR+ F+ +DSL + G
Sbjct: 398 RMNYQQTFIQAYASEGWKGQSLEKIRPEKELERAKAEILQCKLRIREAFRNMDSLLSKGK 457
Query: 509 FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPP 568
+SLFDS G+I SED+ +PP
Sbjct: 458 LEESLFDSAGEISSEDV---------------------------------------GVPP 478
Query: 569 DDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDD 628
D+ WLCP C CK D ID +NELQG++L I D+WEKVFPEAA+ N S D
Sbjct: 479 GDQRWLCPACVCKADSIDALNELQGSKLSIHDSWEKVFPEAASIAN-------GSKQVDT 531
Query: 629 NEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEY 688
++ PD + + + +G ++ ++ D + ADD LGL SEDS D ++
Sbjct: 532 SDLLPD----HIKHSDNPALVEGLMVNEVRLSAEDD---SKADD---LGLPSEDSGDGDF 581
Query: 689 NPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYK-- 746
+P P+ + +S SDFTSDS+D A E +S G DE +ASPL + Y+
Sbjct: 582 DPSGPDSSEDQKDGLNSEESDFTSDSDDFCA--EIAKSCGQDEVSASPLSNVINHTYRMK 639
Query: 747 ----DGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDE 802
+ +NE ++ + ++ GQD PV +R ERLDYKKLYDE YG +SSDD+
Sbjct: 640 LRASNNRSNEENHDHVFMDMELGQDMVLPVSSRRQVERLDYKKLYDEAYGKESSNSSDDK 699
Query: 803 SWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLN- 861
WS K EGS + P +R S +A E+ N E+TP+R +L+
Sbjct: 700 EWSG--------KELLEGSETDSLSERPHPVKRCSRRAQAEQQN-NEHTPQRE---RLHG 747
Query: 862 TEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAK 921
+E + G S+ G RK G TQKL F+++ YP+R TKE++++
Sbjct: 748 SESEQKTGILRSNGSSSTG----------RKFGPVATQKLKVHFEKDPYPSRETKENISE 797
Query: 922 ELGLTFSQVRKWFENTR 938
ELGLTF+QV +WF +TR
Sbjct: 798 ELGLTFNQVSRWFSSTR 814
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 4/76 (5%)
Query: 893 LGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLA 952
G VTQKL F+++ YP R TKE LA+ELGLTF+Q+ KWF T +KN K
Sbjct: 1020 FGPIVTQKLKEHFEKDPYPCRATKEGLAQELGLTFNQISKWFSATHHYSRDAVAKNQKYP 1079
Query: 953 NSEKGTCTPQSNKNTV 968
G T ++N +T+
Sbjct: 1080 ----GENTTENNSSTI 1091
>gi|414880503|tpg|DAA57634.1| TPA: putative homeodomain-like transcription factor superfamily
protein, partial [Zea mays]
Length = 683
Score = 290 bits (741), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 168/351 (47%), Positives = 223/351 (63%), Gaps = 23/351 (6%)
Query: 391 RVLRSRSGERPIPPESSINLADVNSIGERKQKKRN-KIRRKKIVADEYSRIRTHLRYLLN 449
RVLRS SG++ +N + S KKRN K + + +Y +I ++Y+LN
Sbjct: 337 RVLRSASGKKNETCTEPVNGS--TSAEPTVTKKRNCKPSKDRSPKKDYLKICQRIKYILN 394
Query: 450 RINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGG- 508
R+ YEQ I AY+SEGWKG S+EK++PEKEL+RA +EIL+ KL+IR+ F+ +DSL + G
Sbjct: 395 RMKYEQAFIQAYASEGWKGQSLEKIRPEKELERAKAEILQCKLRIREAFRNMDSLLSKGK 454
Query: 509 FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPP 568
+SLFDS GQI SEDI+CA CGSKD+++ NDIILCDGACDRGFHQ CL PPLL E+IPP
Sbjct: 455 LEESLFDSAGQISSEDIFCAICGSKDVTSQNDIILCDGACDRGFHQNCLSPPLLTEEIPP 514
Query: 569 DDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDD 628
DEGWLCP C CK D ID +NELQG++L I D+W KVFPEAA+ N S D
Sbjct: 515 GDEGWLCPACVCKADYIDALNELQGSKLSIHDSWVKVFPEAASTAN-------GSKQVDA 567
Query: 629 NEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEY 688
++ PD +D + + ++ F S+E ++ ADD LGL SEDS D ++
Sbjct: 568 SDLLPD----HIKDSANLALVGTHMVNEIRF---SEEDDSKADD---LGLPSEDSGDGDF 617
Query: 689 NPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGH 739
+P P+ + +S SDFTSDS+ A E +S G DE +ASPL +
Sbjct: 618 DPAGPDSSEDQNDGLNSEESDFTSDSDYFCA--EIAKSCGQDEVSASPLSN 666
>gi|168020007|ref|XP_001762535.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686268|gb|EDQ72658.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 882
Score = 268 bits (684), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 183/523 (34%), Positives = 269/523 (51%), Gaps = 57/523 (10%)
Query: 432 IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGL--------SVEKLKPEKELQRA 483
+ D +RI+ ++YLL ++ +QNL+DAYS EGWKG S EK++PE+ELQRA
Sbjct: 245 VPLDNRTRIKRRVKYLLMKMRVDQNLLDAYSGEGWKGQRNSLFGISSREKIRPEQELQRA 304
Query: 484 TSEILRRKLKIRDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDII 542
++IL KL IR+ LD+L G ++ FD+EG++ E+I+CAKC S+D DNDII
Sbjct: 305 EAQILHSKLAIREAIHELDNLGLEGSLDENAFDAEGRVYHEEIFCAKCRSQDALPDNDII 364
Query: 543 LCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNW 602
LCDGAC+RGFHQYCL+PPL +IPP DEGWLCP C+CK++CI+++N GT + ++W
Sbjct: 365 LCDGACNRGFHQYCLDPPLATINIPPGDEGWLCPVCECKMECIEVINAYLGTHFEVENSW 424
Query: 603 EKVFPE----AAAGHNQDPNFG-LASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDS 657
E++F E AA G Q G S DS+D++Y+P+G+ +++ + + DS
Sbjct: 425 EELFSEEALVAAGGRTQGVTDGDFPSSDSEDDDYNPEGA--EKRTDSESEGESDEGGGDS 482
Query: 658 DFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDL 717
D S E EA DK D +D + L + S S D
Sbjct: 483 DSGSDFRENEASGSDK--------DEQDPMQATNLERLARTAKRTSKEGSGGPESSGSDD 534
Query: 718 AAV-LEDNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKR 776
+ V L D + G G G R +D ES ++E +I GKR
Sbjct: 535 SEVFLVDGFTDG---------GVKLGLRRRDSQAEESASDEETMVIA----------GKR 575
Query: 777 SSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRK 836
+ +DYK+L+DE +GN+ D D S +D GP++R + T PD + R
Sbjct: 576 HRKAVDYKRLHDEMFGNMEIDVDDIISEDEDWGPKRRRRRT------IPDDPSRSRPPRV 629
Query: 837 STKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEE 896
+ A+ + G P S++ EG + R R +RKL +
Sbjct: 630 RRRKARTSTGDALKEVDADGVP------SDLPSGSQGEGAADTLEAVRDKRM-WRKLPDS 682
Query: 897 VTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
+ L + P+++ KE LA +LGL+FSQV WF+N R+
Sbjct: 683 AVETLRLAAAVTTLPSKSRKEELATQLGLSFSQVHGWFKNQRY 725
>gi|225470662|ref|XP_002269263.1| PREDICTED: pathogenesis-related homeodomain protein [Vitis
vinifera]
gi|296090281|emb|CBI40100.3| unnamed protein product [Vitis vinifera]
Length = 727
Score = 236 bits (603), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 165/517 (31%), Positives = 245/517 (47%), Gaps = 131/517 (25%)
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKI 494
DE SR++ RYLL ++ EQNLIDAYS EGWKG S EK++PEKELQRAT +IL+ KL I
Sbjct: 147 DEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGHSREKIRPEKELQRATKQILKCKLGI 206
Query: 495 RDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFH 553
RD ++L+SL + G + S+G + E I CAKC ++ DNDIILCDG C+ FH
Sbjct: 207 RDAIRQLESLSSIGCIEDTAIASDGSVYHEHIICAKCKLREAFPDNDIILCDGTCNCAFH 266
Query: 554 QYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGH 613
Q CL+PPL E+IPP D+GW C C+CK++ ++ +N GTR + W+ +F E AA
Sbjct: 267 QKCLDPPLETENIPPGDQGWFCKFCECKMEILEAMNAHLGTRFSVDSTWQDIFKEEAAL- 325
Query: 614 NQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDK 673
PDG + + P +D
Sbjct: 326 -------------------PDGGSA---------------------------LPYPEED- 338
Query: 674 TYLGLSSEDSEDDEYNPDAPELDDKV-TQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEG 732
S+DS+D +Y+P+ E + T + + SD T+ S L+ ED SG+
Sbjct: 339 ----WPSDDSQDHDYDPERNENSCSISTAGTEGNASDDTNSSLSLSWSFEDEILSGSKRS 394
Query: 733 AASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYG 792
++E+ + E++S G+R +DY+KLYDE +G
Sbjct: 395 GII-----------SADSDETSDCEIIS-------------GRRQRRAVDYRKLYDEMFG 430
Query: 793 NVPYDS---SDDESWSDDGGPRKRTKSTKEGSSASP-------DGKTPVIRRRKSTKAAK 842
+ + S+DE W GP + + KE +AS + K P + T AK
Sbjct: 431 KDAHANEQVSEDEDW----GPANKRRREKESDAASTLITLYEGEKKLPNV----ETMEAK 482
Query: 843 EKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLY 902
+K++ T RP + ++ + +KL
Sbjct: 483 QKISSDPQTK----RP-------------------------------FSRIPLDAVEKLR 507
Query: 903 NSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
+F EN+ P+R +E+LAK+LGL + +V KWF+N R+
Sbjct: 508 QAFGENELPSRDVRENLAKQLGLDYEKVNKWFKNARY 544
>gi|357449593|ref|XP_003595073.1| Pathogenesis-related homeodomain protein [Medicago truncatula]
gi|355484121|gb|AES65324.1| Pathogenesis-related homeodomain protein [Medicago truncatula]
Length = 707
Score = 234 bits (596), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 171/512 (33%), Positives = 246/512 (48%), Gaps = 120/512 (23%)
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKI 494
D+ R++ R +L R+ EQNLIDAY+ EGWKG S EK++PE ELQRA +IL+ KL I
Sbjct: 135 DDTVRLQRRTRNILIRMKQEQNLIDAYAGEGWKGQSREKIRPEMELQRAKKQILKCKLSI 194
Query: 495 RDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFH 553
RD +LDSL + G S+ ++G + E I+CA C ++S DNDIILCDG C+R FH
Sbjct: 195 RDAIHQLDSLSSVGSIEGSVIATDGSVSHEHIFCANCKINEVSPDNDIILCDGTCNRAFH 254
Query: 554 QYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGH 613
Q CL+PPL EDIPP+D+GW C CDCK++ ++ N GTR + W+ VF E AA
Sbjct: 255 QRCLDPPLETEDIPPEDQGWFCKYCDCKIEILEATNAHLGTRFPLDSTWQDVFKEEAAI- 313
Query: 614 NQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDK 673
PDG D + E E P+D
Sbjct: 314 -------------------PDG----------------------DAALLNQEEEWPSD-- 330
Query: 674 TYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGA 733
D EDD+YNP+ +E S G F ++ D +++ +D +
Sbjct: 331 --------DPEDDDYNPE--------RKEESHGG--FNTEGND--------KNASDDSSS 364
Query: 734 ASPLGHSNGQ--RYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETY 791
+S + NG+ +G N E +N+ + + G+ G+R +DYKKLYDE +
Sbjct: 365 SSSMWSLNGECSLLDEGINLEYYSNDHIDSDESGEIAC----GRRQRRAVDYKKLYDEMF 420
Query: 792 G-NVP--YDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNET 848
G + P S+DE W GPRKR + KE + + L+E+
Sbjct: 421 GKDAPPCEQVSEDEDW----GPRKRRRREKESEAVN----------------TLMTLHES 460
Query: 849 ENT-PKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKE 907
EN P + ++ S I R +R E +KL F E
Sbjct: 461 ENKYPNNKNNDRIRGNSSGIK------------------RPCFR-FSHEAVEKLRQVFAE 501
Query: 908 NQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
N+ P ++ K++L+KELGL ++V KWF+N R+
Sbjct: 502 NELPPKSVKDALSKELGLDAAKVNKWFKNARY 533
>gi|224090517|ref|XP_002309010.1| predicted protein [Populus trichocarpa]
gi|222854986|gb|EEE92533.1| predicted protein [Populus trichocarpa]
Length = 725
Score = 232 bits (592), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 163/514 (31%), Positives = 238/514 (46%), Gaps = 112/514 (21%)
Query: 432 IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491
+ DE SR++ RYL+ ++ +QNLIDAYS EGWKG S EK++PEKEL RA +IL+ K
Sbjct: 144 VKLDEPSRLQRRARYLMIKMKLDQNLIDAYSGEGWKGQSREKIRPEKELLRARKQILKCK 203
Query: 492 LKIRDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
L +RD+ +++DSL G +++ +G + E I+CAKC ++S DNDI+LCDG C+
Sbjct: 204 LGLRDIIRQVDSLSTVGCIEETVMAPDGSVSHEHIFCAKCKLNEVSPDNDIVLCDGTCNC 263
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
FHQ CLEPPL E IPP D+GW C C+C++D I+ +N GT +W+ +F E A
Sbjct: 264 AFHQKCLEPPLDTESIPPGDQGWFCKFCECRMDIIEAMNAHLGTHFSEDSSWQDIFTEEA 323
Query: 611 AGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPA 670
A D+ + +P E E P+
Sbjct: 324 -----------AIPDAGNVLLNP-------------------------------EEEWPS 341
Query: 671 DDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSS--G 728
DD SEDD Y+P E D + E+ + S + + G
Sbjct: 342 DD----------SEDDNYDP---ERRDNIMSEAGTDDDASDDISSSTSLGWSSDGEVFLG 388
Query: 729 NDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYD 788
+ L N Y ++E+ + E+ V G+R +DYKKLYD
Sbjct: 389 SRRWEMHGLDFRNNSIYSSLDSDETSDGEI-------------VCGRRQRRAIDYKKLYD 435
Query: 789 ETYG-NVPY--DSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKL 845
E +G + P +S+DE W GP KR + KE ++AS + K + E +
Sbjct: 436 EVFGKDAPAHEQASEDEDW----GPGKRKRREKESNAAST---LMTLCESKKKSKSDETI 488
Query: 846 NETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSF 905
N P + RP +L + +KL F
Sbjct: 489 EGMMNLPPQTRRPIF-------------------------------RLPPDAVEKLRQVF 517
Query: 906 KENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
EN+ P+RT KE+L+KELGL +V KWF+N+R+
Sbjct: 518 VENELPSRTVKENLSKELGLEPGKVSKWFKNSRY 551
>gi|168055725|ref|XP_001779874.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668687|gb|EDQ55289.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 792
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 122/270 (45%), Positives = 166/270 (61%), Gaps = 22/270 (8%)
Query: 428 RRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEI 487
++ + D +RI H++YLL ++ QNL+DAY+ EGWKG S EK++PE+ELQRA ++I
Sbjct: 88 KKSDVPLDNPTRILKHVKYLLIKMRVHQNLLDAYTGEGWKGQSREKIRPEQELQRAKAQI 147
Query: 488 LRRKLKIRDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDG 546
LR KL IR+ LD + G K+ FDSEG+I E+I+CAKC S++ DNDIILCDG
Sbjct: 148 LRSKLAIREAIHELDDVGLEGSLDKNAFDSEGRIYHEEIFCAKCKSQEALPDNDIILCDG 207
Query: 547 ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVF 606
AC+RGFHQYCL+PPL EDIPP DEGWLCP CDCK++CI+ +N GT + ++WE F
Sbjct: 208 ACNRGFHQYCLDPPLATEDIPPGDEGWLCPVCDCKMECIEAINSYFGTSFEVENSWESFF 267
Query: 607 PEAAAGHNQDPNFGLASDDSDDNEYDPDG-SATDEQDEGDESSSDG-SSSDDSDFTSTSD 664
A G+A+ D S+ DE DE D +++G SS+ +S
Sbjct: 268 SNEA---------GIAAGGGTQEGAGGDWPSSGDEDDEYDPETAEGPSSASESGAQVIGL 318
Query: 665 EVEAPADDKTYLGLSSEDSEDDEYNPDAPE 694
V+ P S D ED +Y+P+A E
Sbjct: 319 AVDWP----------SSDDEDHDYDPEAAE 338
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 82/176 (46%), Gaps = 10/176 (5%)
Query: 769 AAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDE-SWSDDGGPRKRTKSTKEGSSASPDG 827
A + GKR + +DYK+L+DE YG D+ DD S +D GP +R + T SP
Sbjct: 493 AVIIAGKRHRKAVDYKRLHDEMYGKAEADNDDDVISEDEDWGPERRRRRTIPDDPNSPRS 552
Query: 828 KTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRR-- 885
+ IR RK + +++ + G P + + S + P G
Sbjct: 553 R---IRGRK-VRTPSGDISKGADADLPSGLPDMPSHFPTGSQGEVGSPQDIPALDGAADS 608
Query: 886 ---HRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ +R+L + + L N+ P+++ KE LA +LGL+FSQV WF+N R
Sbjct: 609 PGGEKRMWRRLPDSAVEALRCILDVNRLPSKSRKEELAIKLGLSFSQVHGWFKNQR 664
>gi|356530336|ref|XP_003533738.1| PREDICTED: pathogenesis-related homeodomain protein-like [Glycine
max]
Length = 714
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 163/494 (32%), Positives = 223/494 (45%), Gaps = 112/494 (22%)
Query: 450 RINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GG 508
++ EQNLIDAYS EGWKG S EK++PEKEL RA +IL+ KL IRD ++LDSL + G
Sbjct: 157 KMKLEQNLIDAYSGEGWKGQSREKIRPEKELLRAKKQILKCKLNIRDAIRQLDSLSSVGS 216
Query: 509 FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPP 568
S +G + E+I+CA C + DNDIILCDG C+R FHQ CL PPL E+IPP
Sbjct: 217 IEDSAIAPDGSVYHENIFCANCKLHEAFPDNDIILCDGTCNRAFHQRCLNPPLDTENIPP 276
Query: 569 DDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDD 628
D+GW C C+CK++ ++ N GT+ + W+ VF E A+
Sbjct: 277 GDQGWFCKFCECKIEILEATNAHLGTQFSLDSTWQDVFKEEASM---------------- 320
Query: 629 NEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEY 688
PDG D + E E P+DD EDD+Y
Sbjct: 321 ----PDG----------------------DIALLNPEEEWPSDD----------PEDDDY 344
Query: 689 NPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDG 748
NP+ E S D D + + + S + G P+ Y
Sbjct: 345 NPEKKE---------DSHSIDIEGDDGNASNDSTSSSSLWSLNGECPPVDEGVSHEYY-- 393
Query: 749 GNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYG-NVP--YDSSDDESWS 805
S+N S I + G G R + +DYKKLYDE +G + P S+DE W
Sbjct: 394 ----SVN----SCIDSNESGEIAC-GPRQRKAVDYKKLYDEMFGKDAPPCEQVSEDEDW- 443
Query: 806 DDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDS 865
GP KR RR K + A + E+ K K +T
Sbjct: 444 ---GPGKRK------------------RREKESDAVNTLMTLHESENKHSNNEKNDT--- 479
Query: 866 NISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGL 925
+ GS G + R S ++ + +KL +F EN+ P R+ K+SL+KELGL
Sbjct: 480 -----------TREGSSGIQIRRSCFRIPVDAVEKLRQAFAENELPPRSVKDSLSKELGL 528
Query: 926 TFSQVRKWFENTRW 939
+V KWF+N R+
Sbjct: 529 DPEKVSKWFKNARY 542
>gi|302813012|ref|XP_002988192.1| hypothetical protein SELMODRAFT_447253 [Selaginella moellendorffii]
gi|300143924|gb|EFJ10611.1| hypothetical protein SELMODRAFT_447253 [Selaginella moellendorffii]
Length = 582
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 98/178 (55%), Positives = 126/178 (70%), Gaps = 1/178 (0%)
Query: 438 SRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDL 497
++I ++Y LN + EQ+ IDAY+++GWKG S EK++PEKEL+RAT++IL KLKIR
Sbjct: 87 AKIMKQVKYYLNSMKLEQSFIDAYANDGWKGQSREKVRPEKELERATAKILGWKLKIRKA 146
Query: 498 FQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYC 556
Q LD +C G S FD++GQ+D ++I+CAKC S + DNDIILCDG C+RGFHQ C
Sbjct: 147 VQELDKMCLEGSILDSAFDADGQLDYQEIFCAKCHSHEAEVDNDIILCDGFCNRGFHQKC 206
Query: 557 LEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHN 614
L PPL EDIPP DEGW+CP CDCK DC +N+ GT + +WEK FPEA G N
Sbjct: 207 LNPPLATEDIPPGDEGWMCPICDCKSDCASFLNDHLGTDFDVEMSWEKFFPEAVEGSN 264
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 74/168 (44%), Gaps = 49/168 (29%)
Query: 772 VYGKRSSERLDYKKLYDETYGNVPYDSSDD-ESWSDDGGPRKRTKSTKEGSSASPDGKTP 830
V GKR +DY+KL+DE +G V Y+ ++ +S +D GP KR +
Sbjct: 379 VSGKRRRRDVDYRKLHDEMFG-VEYEWAEAFQSEDEDYGPLKRPR--------------- 422
Query: 831 VIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSY 890
K+R K + + N ++ +G S R
Sbjct: 423 ----------------------KQRSERKTDRNEQNQDHGENQDGFSDKSKR-------- 452
Query: 891 RKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ E Q+L +F +Q P+R +KE++AKELGL++ +V WF+N R
Sbjct: 453 --IPPEAVQRLREAFDRDQLPSRDSKEAIAKELGLSYRKVDVWFKNVR 498
>gi|302760121|ref|XP_002963483.1| hypothetical protein SELMODRAFT_438630 [Selaginella moellendorffii]
gi|300168751|gb|EFJ35354.1| hypothetical protein SELMODRAFT_438630 [Selaginella moellendorffii]
Length = 582
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 97/178 (54%), Positives = 125/178 (70%), Gaps = 1/178 (0%)
Query: 438 SRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDL 497
++I ++Y N + EQ+ IDAY+++GWKG S EK++PEKEL+RAT++IL KLKIR
Sbjct: 87 AKIMKQVKYYFNSMKLEQSFIDAYANDGWKGQSREKVRPEKELERATAKILGWKLKIRKA 146
Query: 498 FQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYC 556
Q LD +C G S FD++GQ+D ++I+CAKC S + DNDIILCDG C+RGFHQ C
Sbjct: 147 VQELDKMCLEGSILDSAFDADGQLDYQEIFCAKCHSHEAEVDNDIILCDGFCNRGFHQKC 206
Query: 557 LEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHN 614
L PPL EDIPP DEGW+CP CDCK DC +N+ GT + +WEK FPEA G N
Sbjct: 207 LNPPLATEDIPPGDEGWMCPICDCKSDCASFLNDHLGTDFDVEMSWEKFFPEAVEGSN 264
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 74/168 (44%), Gaps = 49/168 (29%)
Query: 772 VYGKRSSERLDYKKLYDETYGNVPYDSSDD-ESWSDDGGPRKRTKSTKEGSSASPDGKTP 830
V GKR +DY+KL+DE +G V Y+ ++ +S +D GP KR +
Sbjct: 379 VSGKRRRRDVDYRKLHDEMFG-VEYEWAEAFQSEDEDYGPLKRPR--------------- 422
Query: 831 VIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSY 890
K+R K + + N ++ +G S R
Sbjct: 423 ----------------------KQRSERKTDRNEQNQDHGENQDGFSDKSKR-------- 452
Query: 891 RKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ E Q+L +F +Q P+R +KE++AKELGL++ +V WF+N R
Sbjct: 453 --IPPEAVQRLRVAFDRDQLPSRDSKEAIAKELGLSYRKVDVWFKNVR 498
>gi|356558320|ref|XP_003547455.1| PREDICTED: pathogenesis-related homeodomain protein-like [Glycine
max]
Length = 600
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 166/497 (33%), Positives = 229/497 (46%), Gaps = 118/497 (23%)
Query: 450 RINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCA-GG 508
++ EQNLIDAYS EGWKG S EK++PEKEL RA +IL+ KL IRD +LDSL + G
Sbjct: 157 KMKLEQNLIDAYSGEGWKGQSREKIRPEKELLRAKKQILKCKLSIRDAIHQLDSLSSVGS 216
Query: 509 FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPP 568
S +G + E+I+CA C + DNDIILCDG C+R FHQ CL PPL E+IPP
Sbjct: 217 IEDSAIAPDGSVYHENIFCANCKLHEAFPDNDIILCDGTCNRAFHQRCLNPPLDTENIPP 276
Query: 569 DDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDD 628
D+GW C C+CK++ ++ N GT+ + W+ VF E AA
Sbjct: 277 GDQGWFCKFCECKIEILEATNAHLGTQFSLDSTWQDVFKEEAAM---------------- 320
Query: 629 NEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEY 688
PDG D + E E P+DD EDD+Y
Sbjct: 321 ----PDG----------------------DIALLNPEEEWPSDD----------PEDDDY 344
Query: 689 NPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQ--RYK 746
NP+ E +F ++ D ++ ND + S L NG+
Sbjct: 345 NPERKE-----------DNHNFDTEGAD--------ENASNDSTSCSSLLSLNGECPPVD 385
Query: 747 DGGNNESLN-NELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETYG-NVP--YDSSDDE 802
+G +E + N L + G+ P R + +DYKKLYDE YG + P S+DE
Sbjct: 386 EGICHEYYSVNSCLDSDESGEIACGP----RQRKAVDYKKLYDEMYGKDAPPCEQMSEDE 441
Query: 803 SWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNT 862
W GP KR + KE + + L+E+EN
Sbjct: 442 DW----GPGKRKRREKESDAVN----------------TLMTLHESENK----------- 470
Query: 863 EDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKE 922
N ++ EG S G + R S ++ + +KL F EN+ P R+ K+SL+KE
Sbjct: 471 HSDNEKNDRTREGSS-----GIQIRRSCFRIPLDAVEKLRQVFAENELPPRSVKDSLSKE 525
Query: 923 LGLTFSQVRKWFENTRW 939
LGL +V KWF+N R+
Sbjct: 526 LGLDPEKVSKWFKNARY 542
>gi|449457269|ref|XP_004146371.1| PREDICTED: pathogenesis-related homeodomain protein-like [Cucumis
sativus]
Length = 714
Score = 202 bits (513), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 163/525 (31%), Positives = 237/525 (45%), Gaps = 133/525 (25%)
Query: 432 IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491
+ DE SR++ RYL+ ++ EQNLIDAYS EGWKG S EK++PEKELQRA +IL+ K
Sbjct: 143 VELDEASRLQRRTRYLIIKMKLEQNLIDAYSGEGWKGQSREKIRPEKELQRAMKQILKCK 202
Query: 492 LKIRDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
L IRD ++LD L G S+ +G + E I+CAKC ++ DNDIILCDG C+
Sbjct: 203 LGIRDAIRQLDLLGSVGCIEDSVIGPDGSVYHEHIFCAKCKLREAFPDNDIILCDGTCNC 262
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
FHQ CL+PPL + IPP D+GW C C+CK++ ++ +N GTR +
Sbjct: 263 AFHQKCLDPPLDTKSIPPGDQGWFCKFCECKMEILEGMNAHLGTRFSL------------ 310
Query: 611 AGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPA 670
N G ++ + G+A +E
Sbjct: 311 -------NIGWEDIFKEEAAFPDGGNALLNHEEDW------------------------- 338
Query: 671 DDKTYLGLSSEDS---------EDDEYNPDAPELDDK--VTQESSSSGSDFTSDSEDLAA 719
S+DS +++ ++ + E +DK + + SSS+ ++ D EDL
Sbjct: 339 --------PSDDSEDDDYDPDKKENCHDNASEEENDKEVLEESSSSTSLSWSLDGEDLV- 389
Query: 720 VLEDNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSE 779
SGN G G SI+ G + G+R
Sbjct: 390 -------SGNGIGCEDHFGAGT------------------SIVSDGSNEEGITCGRRQRH 424
Query: 780 RLDYKKLYDETYG-NVPY---DSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRR 835
+DYKKLYDE +G + P + S+DE W GP KR RR
Sbjct: 425 AVDYKKLYDEMFGKDTPAHEQEVSEDEDW----GPAKRR------------------RRE 462
Query: 836 KSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGE 895
K AA ++ E+ K ++D ++ K S GR S+ ++
Sbjct: 463 KECDAASTLMSLCESEKK--------SQDIDMEAEK-----KLLNSHGR----SFFRIPR 505
Query: 896 EVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWS 940
+KL F +N+ P+R KE+L+KELGL +V KWF+N R+S
Sbjct: 506 HAVEKLRKVFADNELPSRDVKENLSKELGLDAEKVSKWFKNARYS 550
>gi|15233766|ref|NP_194723.1| Pathogenesis-related homeodomain protein [Arabidopsis thaliana]
gi|1346790|sp|P48785.1|PRH_ARATH RecName: Full=Pathogenesis-related homeodomain protein; Short=PRHA
gi|507220|gb|AAA32843.1| homeodomain protein [Arabidopsis thaliana]
gi|2501810|gb|AAC49836.1| PRHA [Arabidopsis thaliana]
gi|4914418|emb|CAB43669.1| pathogenesis related homeodomain protein (PRHA) [Arabidopsis
thaliana]
gi|7269893|emb|CAB79752.1| pathogenesis related homeodomain protein (PRHA) [Arabidopsis
thaliana]
gi|332660298|gb|AEE85698.1| Pathogenesis-related homeodomain protein [Arabidopsis thaliana]
Length = 796
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 88/182 (48%), Positives = 123/182 (67%), Gaps = 1/182 (0%)
Query: 431 KIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRR 490
K+ D+ R++ RYLL ++ +QNLIDAY++EGWKG S EK++P+KEL+RA EIL
Sbjct: 96 KVEVDDSLRLQRRTRYLLIKMKMQQNLIDAYATEGWKGQSREKIRPDKELERARKEILNC 155
Query: 491 KLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACD 549
KL +RD ++LD L + G + + S+G I + I+CA+C S++ DNDIILCDG C+
Sbjct: 156 KLGLRDAIRQLDLLSSVGSMEEKVIASDGSIHHDHIFCAECNSREAFPDNDIILCDGTCN 215
Query: 550 RGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEA 609
R FHQ CL+PPL E IPP D+GW C CDCK++ ID +N GT + NW+ +F E
Sbjct: 216 RAFHQKCLDPPLETESIPPGDQGWFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQDIFNEE 275
Query: 610 AA 611
A+
Sbjct: 276 AS 277
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 72/171 (42%), Gaps = 35/171 (20%)
Query: 772 VYGKRSSERLDYKKLYDETYGN---VPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGK 828
V G R +DY +LY E +G + S+DE W GP R K +E D
Sbjct: 368 VCGPRQRRTVDYTQLYYEMFGKDAVLQEQGSEDEDW----GPNDRRKRKRES-----DAG 418
Query: 829 TPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRT 888
+ ++ +S+K ++ + E + + + S GRR
Sbjct: 419 STLVTMCESSKKDQDVVETLEQSER--------------------DSVSVENKGGRRR-- 456
Query: 889 SYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
+L +KL F E + P++ ++ LAKEL L +V KWF+NTR+
Sbjct: 457 -MFRLPRNAVEKLRQVFAETELPSKAVRDRLAKELSLDPEKVNKWFKNTRY 506
>gi|449503000|ref|XP_004161802.1| PREDICTED: pathogenesis-related homeodomain protein-like [Cucumis
sativus]
Length = 741
Score = 201 bits (510), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 163/522 (31%), Positives = 236/522 (45%), Gaps = 133/522 (25%)
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKI 494
DE SR++ RYL+ ++ EQNLIDAYS EGWKG S EK++PEKELQRA +IL+ KL I
Sbjct: 173 DEASRLQRRTRYLIIKMKLEQNLIDAYSGEGWKGQSREKIRPEKELQRAMKQILKCKLGI 232
Query: 495 RDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFH 553
RD ++LD L G S+ +G + E I+CAKC ++ DNDIILCDG C+ FH
Sbjct: 233 RDAIRQLDLLGSVGCIEDSVIGPDGSVYHEHIFCAKCKLREAFPDNDIILCDGTCNCAFH 292
Query: 554 QYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGH 613
Q CL+PPL + IPP D+GW C C+CK++ ++ +N GTR +
Sbjct: 293 QKCLDPPLDTKSIPPGDQGWFCKFCECKMEILEGMNAHLGTRFSL--------------- 337
Query: 614 NQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDK 673
N G ++ + G+A +E
Sbjct: 338 ----NIGWEDIFKEEAAFPDGGNALLNHEEDW---------------------------- 365
Query: 674 TYLGLSSEDS---------EDDEYNPDAPELDDK--VTQESSSSGSDFTSDSEDLAAVLE 722
S+DS +++ ++ + E +DK + + SSS+ ++ D EDL
Sbjct: 366 -----PSDDSEDDDYDPDKKENCHDNASEEENDKEVLEESSSSTSLSWSLDGEDLV---- 416
Query: 723 DNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLD 782
SGN G G SI+ G + G+R +D
Sbjct: 417 ----SGNGIGCEDHFGAGT------------------SIVSDGSNEEGITCGRRQRHAVD 454
Query: 783 YKKLYDETYG-NVPY---DSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKST 838
YKKLYDE +G + P + S+DE W GP KR RR K
Sbjct: 455 YKKLYDEMFGKDTPAHEQEVSEDEDW----GPAKRR------------------RREKEC 492
Query: 839 KAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVT 898
AA ++ E+ K ++D ++ K S GR S+ ++
Sbjct: 493 DAASTLMSLCESEKK--------SQDIDMEAEK-----KLLNSHGR----SFFRIPRHAV 535
Query: 899 QKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWS 940
+KL F +N+ P+R KE+L+KELGL +V KWF+N R+S
Sbjct: 536 EKLRKVFADNELPSRDVKENLSKELGLDAEKVSKWFKNARYS 577
>gi|334187017|ref|NP_001190867.1| Pathogenesis-related homeodomain protein [Arabidopsis thaliana]
gi|332660299|gb|AEE85699.1| Pathogenesis-related homeodomain protein [Arabidopsis thaliana]
Length = 769
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 88/182 (48%), Positives = 123/182 (67%), Gaps = 1/182 (0%)
Query: 431 KIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRR 490
K+ D+ R++ RYLL ++ +QNLIDAY++EGWKG S EK++P+KEL+RA EIL
Sbjct: 96 KVEVDDSLRLQRRTRYLLIKMKMQQNLIDAYATEGWKGQSREKIRPDKELERARKEILNC 155
Query: 491 KLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACD 549
KL +RD ++LD L + G + + S+G I + I+CA+C S++ DNDIILCDG C+
Sbjct: 156 KLGLRDAIRQLDLLSSVGSMEEKVIASDGSIHHDHIFCAECNSREAFPDNDIILCDGTCN 215
Query: 550 RGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEA 609
R FHQ CL+PPL E IPP D+GW C CDCK++ ID +N GT + NW+ +F E
Sbjct: 216 RAFHQKCLDPPLETESIPPGDQGWFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQDIFNEE 275
Query: 610 AA 611
A+
Sbjct: 276 AS 277
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 72/171 (42%), Gaps = 35/171 (20%)
Query: 772 VYGKRSSERLDYKKLYDETYGN---VPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGK 828
V G R +DY +LY E +G + S+DE W GP R K +E D
Sbjct: 368 VCGPRQRRTVDYTQLYYEMFGKDAVLQEQGSEDEDW----GPNDRRKRKRES-----DAG 418
Query: 829 TPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRT 888
+ ++ +S+K ++ + E + + + S GRR
Sbjct: 419 STLVTMCESSKKDQDVVETLEQSER--------------------DSVSVENKGGRRR-- 456
Query: 889 SYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
+L +KL F E + P++ ++ LAKEL L +V KWF+NTR+
Sbjct: 457 -MFRLPRNAVEKLRQVFAETELPSKAVRDRLAKELSLDPEKVNKWFKNTRY 506
>gi|255573765|ref|XP_002527803.1| conserved hypothetical protein [Ricinus communis]
gi|223532799|gb|EEF34575.1| conserved hypothetical protein [Ricinus communis]
Length = 732
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 90/184 (48%), Positives = 125/184 (67%), Gaps = 1/184 (0%)
Query: 428 RRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEI 487
+++K+ DE SR++ +YL+ ++ EQNLIDAYS EGWKG S EK+KPEKEL RA +I
Sbjct: 145 QKEKVKLDEPSRLQRRTKYLMIKMKLEQNLIDAYSGEGWKGQSREKIKPEKELVRAKKQI 204
Query: 488 LRRKLKIRDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDG 546
L+ KL IRD +LDSL G S+ +G + E I+CAKC S ++S DNDI+LCDG
Sbjct: 205 LKCKLGIRDAIHQLDSLSTVGCIEDSVIAPDGSVSHEHIFCAKCKSNEVSPDNDIVLCDG 264
Query: 547 ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVF 606
C+ GFHQ CL+PPL E+IPP D+GW C C+C+++ I+ +N GT+ + W+ +F
Sbjct: 265 TCNCGFHQRCLDPPLDTENIPPGDQGWYCKFCECRMEIIEAMNAHLGTQFSVDSCWQDIF 324
Query: 607 PEAA 610
E A
Sbjct: 325 QEEA 328
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/222 (31%), Positives = 96/222 (43%), Gaps = 48/222 (21%)
Query: 723 DNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQ--DGAAPVYGKRSSER 780
DN SS G +S +G R D G+ N + S + + DG V G+R
Sbjct: 374 DNASSATSLGWSSDGEVLSGSRKWDMGDTYFRNQFIYSSLDSDETSDGEI-VCGRRQRRA 432
Query: 781 LDYKKLYDETYGNVPYDS---SDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKS 837
+DYKKLYDE +G + S+DE W GP KR + KE +AS
Sbjct: 433 VDYKKLYDEMFGKDAQEHEQVSEDEDW----GPGKRKRREKESDAAS------------- 475
Query: 838 TKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEV 897
L E+E T K+ K H SR + + ++
Sbjct: 476 ---TLMTLYESEKTSKK---------------VKKH-------SRDPQVKRPLFRIPPSA 510
Query: 898 TQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
+KL F EN+ P+RT KE+L+KELGL +V KWF+N R+
Sbjct: 511 VEKLRQVFAENELPSRTIKENLSKELGLEPGKVSKWFKNARY 552
>gi|297798990|ref|XP_002867379.1| hypothetical protein ARALYDRAFT_913493 [Arabidopsis lyrata subsp.
lyrata]
gi|297313215|gb|EFH43638.1| hypothetical protein ARALYDRAFT_913493 [Arabidopsis lyrata subsp.
lyrata]
Length = 781
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 120/182 (65%), Gaps = 1/182 (0%)
Query: 431 KIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRR 490
K+ D+ R++ RYLL ++ +QNLIDAY++EGWKG S EK++P+KEL+RA +IL
Sbjct: 96 KVEVDDSLRLQRRTRYLLIKMKMQQNLIDAYATEGWKGQSREKIRPDKELERARKQILNC 155
Query: 491 KLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACD 549
KL +RD ++LD L + G + + +G I + I+CA C S++ DNDIILCDG C+
Sbjct: 156 KLGLRDAIRQLDLLSSVGSMEEKVIAPDGSIHHDHIFCAVCNSREAFPDNDIILCDGTCN 215
Query: 550 RGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEA 609
R FHQ CL+PPL E IPP D GW C CDCK++ ID +N GT + NW+ +F E
Sbjct: 216 RAFHQKCLDPPLETESIPPGDLGWFCKFCDCKMEIIDTMNAQIGTHFPVDSNWQDIFNEE 275
Query: 610 AA 611
A+
Sbjct: 276 AS 277
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 72/171 (42%), Gaps = 35/171 (20%)
Query: 772 VYGKRSSERLDYKKLYDETYGN---VPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGK 828
V G R +DY KLY E +G + S+DE W GP R K +E D
Sbjct: 374 VCGPRQRRTVDYTKLYYEMFGKDAVLQEQGSEDEDW----GPNDRRKRKRES-----DAG 424
Query: 829 TPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRT 888
+ ++ +S+K ++ + E + + + S GRR
Sbjct: 425 STLVTMCESSKKDQDVVETLEQSER--------------------DSVSVENKGGRR--- 461
Query: 889 SYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
+L + +KL F E + P + ++ L+KEL L +V KWF+NTR+
Sbjct: 462 PMFRLPKYAVEKLRQVFAETELPTKAVRDRLSKELSLDPEKVNKWFKNTRY 512
>gi|357138687|ref|XP_003570921.1| PREDICTED: pathogenesis-related homeodomain protein-like
[Brachypodium distachyon]
Length = 736
Score = 192 bits (488), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 147/509 (28%), Positives = 223/509 (43%), Gaps = 124/509 (24%)
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKI 494
DE +R+ RYLL +I EQNL+DAYS +GW G S EKLKPEKELQRA +I++ K+ I
Sbjct: 108 DEATRLERRARYLLIKIKSEQNLLDAYSGDGWNGHSREKLKPEKELQRAKRQIIKSKIAI 167
Query: 495 RDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFH 553
RD+ +LD +G S+ + ++ ++ C++C S + DN II C+G+C +H
Sbjct: 168 RDIIHQLDLYSSSGNMDDSVMPPDEPVNPDNTICSRCKSDESVPDNKIIFCEGSCKMSYH 227
Query: 554 QYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA--A 611
Q C EPP K +P GWLC C CK+ ++ VN GT L +T + +F EA
Sbjct: 228 QKCSEPPFDKI-LPTGGHGWLCKFCLCKMKILEAVNAHLGTSLTVTCPSKDIFKEATEHI 286
Query: 612 GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPAD 671
G + P L S+ S D +YDP E+++G SS
Sbjct: 287 GSDDGPGEDLLSEYSGDEDYDP------EENDGTSSS----------------------- 317
Query: 672 DKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDE 731
LG E ++ ES+ SGS S ++D+ + D
Sbjct: 318 ----LGRGEE----------------SISSESNCSGSPLYSPNDDIPGFI------SADF 351
Query: 732 GAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETY 791
A H+N D G E + E+++ +P +D +DY++L +E +
Sbjct: 352 TDAEGFCHANSHLEFDSG--EDVTAEMVNYQRPKRD-------------VDYRRLNEEMF 396
Query: 792 GNVPYD--SSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKSTKAAKEKLNETE 849
G + + SDDE W + R+R S G + S +G + V
Sbjct: 397 GKLAENEKQSDDEDWGVNRKKRRRVDSA--GGAKSVEGVSGV------------------ 436
Query: 850 NTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQ 909
T + N+ P HR ++ + L F ++
Sbjct: 437 ------------TSNENLQP----------------HRRKLFRMPPAAVEVLRKVFAVDE 468
Query: 910 YPNRTTKESLAKELGLTFSQVRKWFENTR 938
P R KE LA ELG+++ ++ KWF+NTR
Sbjct: 469 LPARDVKEKLATELGISYEKIDKWFKNTR 497
>gi|356564821|ref|XP_003550646.1| PREDICTED: pathogenesis-related homeodomain protein-like [Glycine
max]
Length = 617
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 90/181 (49%), Positives = 120/181 (66%), Gaps = 7/181 (3%)
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKI 494
D+ SR++ RYLL ++ EQNLIDAYS EGWKG S EK+KPEKELQRA +IL+ +L I
Sbjct: 132 DDPSRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGQSREKIKPEKELQRARKQILKCRLGI 191
Query: 495 RDLFQRLDSLCAGGFPKSLFDS----EGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
RD+ ++LDSL G S+ DS +G + E I+C KC ++ DNDIILC+G C R
Sbjct: 192 RDVIRQLDSL---GSLSSIEDSAIAPDGSVYHEHIFCVKCTVREELPDNDIILCNGTCKR 248
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
FHQ CL+PPL E+IPP ++GW C C+CK++ ++ N GT + W+ VF E A
Sbjct: 249 AFHQKCLDPPLDTENIPPGEQGWFCKFCECKMEILEATNAHLGTHFSLHSTWQDVFKEEA 308
Query: 611 A 611
A
Sbjct: 309 A 309
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 76/169 (44%), Gaps = 39/169 (23%)
Query: 774 GKRSSERLDYKKLYDETYG-NVP-YDS-SDDESWSDDGGPRKRTKSTKEGSSASPDGKTP 830
G R + +DYKKLYDE +G + P Y+ S+DE W GP KR + KE +
Sbjct: 399 GCRQRKAVDYKKLYDEMFGKDAPAYEQLSEDEDW----GPGKRKRREKESDAVD------ 448
Query: 831 VIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSY 890
+ L+E+E N+ P H + S + +
Sbjct: 449 ----------SLMTLHESE----------------NMHPNNEHLDMTRKNSSSIKIKRHC 482
Query: 891 RKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
++ + +KL F EN+ P R+ +E L+KELGL +V KWF+N R+
Sbjct: 483 FRIPHDAVEKLRQVFAENELPPRSIREGLSKELGLDTEKVSKWFKNARY 531
>gi|224144330|ref|XP_002325264.1| predicted protein [Populus trichocarpa]
gi|222866698|gb|EEF03829.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 84/181 (46%), Positives = 119/181 (65%), Gaps = 1/181 (0%)
Query: 432 IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491
+ DE R++ RYL+ ++ EQNLIDAYS EGWKG S EK++PEKEL RA +IL+ K
Sbjct: 142 VKLDEPPRLQRRARYLMIKMKLEQNLIDAYSGEGWKGKSREKIRPEKELLRARKQILKCK 201
Query: 492 LKIRDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
L +R++ +++DSL G ++ +G + E I+CAKC ++S DNDI+LCDG C+
Sbjct: 202 LGLREIIRQVDSLSTVGCIEDAVMAPDGSVSHEHIFCAKCKLNEVSQDNDIVLCDGTCNC 261
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
FHQ CL+PPL E+IPP D+GW C CDC+++ I+ +N GT W+ +F E A
Sbjct: 262 AFHQKCLDPPLDTENIPPGDQGWFCKFCDCRMEIIEAMNAHLGTHFSEDSGWQDIFKEEA 321
Query: 611 A 611
A
Sbjct: 322 A 322
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 95/232 (40%), Gaps = 69/232 (29%)
Query: 772 VYGKRSSERLDYKKLYDETYG-NVPYDS--SDDESWSDDGGPRKRTKSTKEGSSASPDGK 828
+ G+R +DYKKLY+E +G + P S+DE W GP KR + KE +AS
Sbjct: 417 ICGRRQRRAVDYKKLYNEMFGKDAPAHEQPSEDEDW----GPSKRKRREKESDAASTLMT 472
Query: 829 TPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRT 888
+RR A E + + P+ R RP
Sbjct: 473 LYESKRRCKNDATIEGMMKLPRDPQIR-RPIF---------------------------- 503
Query: 889 SYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW--------- 939
+L + +KL F EN+ P+RT KE+L+KELGL +V KWF+N+R+
Sbjct: 504 ---RLPPDAVEKLRQVFAENELPSRTVKENLSKELGLEPGKVSKWFKNSRYLALKSRKVE 560
Query: 940 ----------------SFNHPSSKNAKL-----ANSEKGTCTPQSNKNTVGR 970
+ N K A L + +E G CTP++ K + R
Sbjct: 561 KGEQVHYSSSKVSAEPTLNVMKDKTADLSLLKDSQAETGVCTPENLKRILQR 612
>gi|115444215|ref|NP_001045887.1| Os02g0147800 [Oryza sativa Japonica Group]
gi|45736026|dbj|BAD13053.1| putative pathogenesis related homeodomain protein PRHA [Oryza
sativa Japonica Group]
gi|113535418|dbj|BAF07801.1| Os02g0147800 [Oryza sativa Japonica Group]
gi|215694461|dbj|BAG89438.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215697796|dbj|BAG91989.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704341|dbj|BAG93775.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218190057|gb|EEC72484.1| hypothetical protein OsI_05848 [Oryza sativa Indica Group]
gi|222622167|gb|EEE56299.1| hypothetical protein OsJ_05372 [Oryza sativa Japonica Group]
gi|284431776|gb|ADB84629.1| homeodomain protein [Oryza sativa Japonica Group]
Length = 681
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 158/528 (29%), Positives = 236/528 (44%), Gaps = 137/528 (25%)
Query: 422 KKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQ 481
++++K +RK +DE SR+ RYLL +I EQNL+DAYS +GW G S EK+KPEKELQ
Sbjct: 89 RRKHKQKRKNDESDEVSRMEKRARYLLIKIKQEQNLLDAYSGDGWNGHSREKIKPEKELQ 148
Query: 482 RATSEILRRKLKIRDLFQRLDSLC--AGGFPKSLFDSEG---QIDSEDIYCAKCGSKDLS 536
RA +I++ K+ IRD+ +LD LC +G S+ +G ++ E C++C S +
Sbjct: 149 RAKKQIMKYKIAIRDVIHQLD-LCSSSGSKDDSVIPPDGCHESVNPEHTICSRCKSHESF 207
Query: 537 ADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRL 596
DN+II C+G C HQ CLEPP K +P G LC C K+ +D +N GT
Sbjct: 208 PDNNIIFCEGGCKLACHQKCLEPPFDK-ILPTTRHGRLCKHCSSKMKILDAINAHLGTSF 266
Query: 597 FITDNWEKVFPEAAAGHNQDPNFG--LASDDSDDNEYDPDGSATDEQDEGDESSSDGSSS 654
+ +F EAA N D G S+ S D +YDP E +E+SS G +
Sbjct: 267 TVKCPSSDIFKEAAEHFNSDDGLGQDWLSEYSGDEDYDP---------EENEASSSGEEN 317
Query: 655 DDSDFTSTSDEVEAPADD-KTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSD 713
+D + + +P DD ++ D+E +ESS+ G DF D
Sbjct: 318 KSADSNCSGSPLYSPNDDIPDFISADFNDAEG-------------FCRESSNLGIDFGED 364
Query: 714 SEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVY 773
G A L H +P +D
Sbjct: 365 ------------------GLAEILTHQ----------------------RPRRD------ 378
Query: 774 GKRSSERLDYKKLYDETYGNVPY---DSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTP 830
+DY +L ++ +G P + S+DE W + ++RT ST G++ S +G++
Sbjct: 379 -------VDYTQLNEQMFGE-PIGNDEQSEDEDWGLNKRKKRRTGSTGVGTN-SVEGRSD 429
Query: 831 VIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSY 890
V KS K A+ P+R+ I PA
Sbjct: 430 V----KSNKKAQ---------PRRKL--------FRIPPA-------------------- 448
Query: 891 RKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ L +F EN+ P R+ KE+L+ ELG++F ++ KWF+NTR
Sbjct: 449 ------AVEVLRKAFAENELPARSVKENLSTELGISFEKIDKWFKNTR 490
>gi|242060500|ref|XP_002451539.1| hypothetical protein SORBIDRAFT_04g003450 [Sorghum bicolor]
gi|241931370|gb|EES04515.1| hypothetical protein SORBIDRAFT_04g003450 [Sorghum bicolor]
Length = 706
Score = 184 bits (466), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 163/594 (27%), Positives = 249/594 (41%), Gaps = 175/594 (29%)
Query: 368 RKGKRAT--KSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRN 425
RK + AT KS+KN Y L + R G S+ V + R++KKR
Sbjct: 30 RKVRNATLAKSIKNKYHCSPL-------KQRRGS------DSVAGKIVTGLTARRRKKR- 75
Query: 426 KIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATS 485
K DE +R+ RY L +I EQNL+DAYS +GW G S EK+KPEKELQRA
Sbjct: 76 ----KMQNTDEATRLERRARYFLIKIKLEQNLLDAYSGDGWNGQSREKIKPEKELQRARK 131
Query: 486 EILRRKLKIRDLFQRLDSLCAGGFPKSLFDSEGQIDS-----------EDIYCAKCGSKD 534
+I++ K+ IRD+ ++L L+ S G +D E C+ C S +
Sbjct: 132 QIIKCKIAIRDIIRQL----------CLYTSTGSVDDPAMPPDQFTNPEHTMCSTCKSHE 181
Query: 535 LSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGT 594
N I C+G C R +H+ CLEPPL K +P GWLC C CKV ++ +N GT
Sbjct: 182 SFPSNKFIFCEGPCKRAYHEKCLEPPLNKGVLPTSSHGWLCKFCLCKVKILETINAHLGT 241
Query: 595 RLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSS 654
+ ++E +F EA T++ D D
Sbjct: 242 SFTVMCSFEDIFKEA----------------------------TEQIDSED--------- 264
Query: 655 DDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDK--------VTQESSSS 706
A D+ +L SE S D++Y+PD E D ++ +S+ S
Sbjct: 265 ---------------ALDEDWL---SEYSGDEDYDPDENEDSDNCMDSGEEIMSDDSNGS 306
Query: 707 GSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQ 766
GS S ++D+ + D H+N D G ++ ++L+ +P +
Sbjct: 307 GSPLYSPNDDIPDFI------SADLNDVEGFCHANLDLGIDAGEDDLA--QILTYQRPRR 358
Query: 767 DGAAPVYGKRSSERLDYKKLYDETYGNVP--YDSSDDESWSDDGGPRKRTKSTKEGSSAS 824
D +DY++L +E +G + + S+DE W + ++RT+S G ++
Sbjct: 359 D-------------VDYRRLNEEMFGKIMGNEEQSEDEDWGHERRKKRRTRSGGAGDNSV 405
Query: 825 PDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGR 884
G + VI KS K ++ I PA
Sbjct: 406 --GFSNVISDEKSQKKGRKLFR--------------------IPPA-------------- 429
Query: 885 RHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ L +F EN+ P R KE+L++ELG++F ++ KWF+NTR
Sbjct: 430 ------------AVEVLRRAFAENELPPRDVKENLSRELGISFEKIDKWFKNTR 471
>gi|356550724|ref|XP_003543734.1| PREDICTED: pathogenesis-related homeodomain protein-like [Glycine
max]
Length = 699
Score = 180 bits (456), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 88/181 (48%), Positives = 117/181 (64%), Gaps = 7/181 (3%)
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKI 494
D+ SR++ RYLL ++ EQNLIDAYS EGWKG S EK++PEKELQRA +IL+ +L I
Sbjct: 132 DDPSRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGQSREKIRPEKELQRARKQILKCRLGI 191
Query: 495 RDLFQRLDSLCAGGFPKSLFDS----EGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550
RD ++LDSL G S+ DS +G + E I C KC + DNDIILC+G C+R
Sbjct: 192 RDAIRQLDSL---GSLSSIEDSAIALDGSVCHEHILCVKCKVHEELPDNDIILCNGKCER 248
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
FHQ CL+PPL E+I P ++GW C C+CK++ ++ N GT + W+ VF E A
Sbjct: 249 AFHQKCLDPPLDTENISPGEQGWFCKFCECKMEILEATNAHLGTHFSLHSTWQDVFKEEA 308
Query: 611 A 611
A
Sbjct: 309 A 309
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 77/169 (45%), Gaps = 40/169 (23%)
Query: 774 GKRSSERLDYKKLYDETYG-NVP-YD-SSDDESWSDDGGPRKRTKSTKEGSSASPDGKTP 830
G+R + +DYKKLYDE +G + P Y+ S+DE W GP KR + KE +
Sbjct: 399 GRRQRKAVDYKKLYDEMFGKDAPAYELLSEDEDW----GPGKRKRREKESDAVD------ 448
Query: 831 VIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSY 890
+ L+E+EN P H S S + R +
Sbjct: 449 ----------SLMTLHESENR----------------HPNNEHNMTSKDSSSIKIKRHCF 482
Query: 891 RKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
R + + ++L F EN+ P R+ +E L+KELGL +V KWF+N R+
Sbjct: 483 R-IPRDAVERLRQVFAENELPPRSIREGLSKELGLDTEKVSKWFKNARY 530
>gi|413935604|gb|AFW70155.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 721
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 139/510 (27%), Positives = 216/510 (42%), Gaps = 122/510 (23%)
Query: 433 VADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKL 492
+ DE +R+ RY L +I EQNL+DAYS +GW G S EK+KPEKELQRA +I++ K+
Sbjct: 93 ITDEATRLERRARYFLIKIKLEQNLLDAYSGDGWNGQSREKIKPEKELQRARKQIIKCKI 152
Query: 493 KIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRG 551
IRD+ ++LD + G L ++ + E C+ C S + N II C G C R
Sbjct: 153 AIRDIIRQLDLYTSTGSVDDPLMPTDQSTNPEHTMCSTCKSHESFPSNKIIFCKGPCKRA 212
Query: 552 FHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA 611
H+ CLEPPL K +P GWLC C CKV ++ +N GT + ++E +F E
Sbjct: 213 CHEKCLEPPLNKSVLPTSSHGWLCKFCLCKVRILETINAHLGTSFTVKCHFEDIFKETTE 272
Query: 612 GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPAD 671
+ S+D+ D ++ + S DE + DE+ + G D + + D
Sbjct: 273 L--------IDSEDALDEDWLSEYSG-DEDYDPDENEASGDCMDSGEKIMSDD------- 316
Query: 672 DKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDE 731
S+ SGS S ++D+ + D
Sbjct: 317 -------------------------------SNGSGSPLYSPNDDIPDFI------SADL 339
Query: 732 GAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETY 791
H+N D ++ ++L+ +P +D +DY++L +E +
Sbjct: 340 NVVEGFCHTNLDLGIDAVEDDFA--QILTYQRPRRD-------------VDYRRLNEEMF 384
Query: 792 GNVP--YDSSDDESWSDDGGPRKRTKSTKEGSSA-SPDGKTPVIRRRKSTKAAKEKLNET 848
G + + S+DE W G +R K T G + + G VI KS K ++
Sbjct: 385 GKITGNEEQSEDEDW----GHERRKKRTHSGVAGDNSVGFLNVISDEKSQKKGRKLFR-- 438
Query: 849 ENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKEN 908
I PA + L +F EN
Sbjct: 439 ------------------IPPA--------------------------AVEVLRRAFAEN 454
Query: 909 QYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ P R KE+L++ELG++F ++ KWF+NTR
Sbjct: 455 ELPPRDVKENLSRELGISFEKIDKWFKNTR 484
>gi|413935603|gb|AFW70154.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 527
Score = 172 bits (437), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 140/510 (27%), Positives = 212/510 (41%), Gaps = 122/510 (23%)
Query: 433 VADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKL 492
+ DE +R+ RY L +I EQNL+DAYS +GW G S EK+KPEKELQRA +I++ K+
Sbjct: 93 ITDEATRLERRARYFLIKIKLEQNLLDAYSGDGWNGQSREKIKPEKELQRARKQIIKCKI 152
Query: 493 KIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRG 551
IRD+ ++LD + G L ++ + E C+ C S + N II C G C R
Sbjct: 153 AIRDIIRQLDLYTSTGSVDDPLMPTDQSTNPEHTMCSTCKSHESFPSNKIIFCKGPCKRA 212
Query: 552 FHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA 611
H+ CLEPPL K +P GWLC C CKV ++ +N GT + ++E +F E
Sbjct: 213 CHEKCLEPPLNKSVLPTSSHGWLCKFCLCKVRILETINAHLGTSFTVKCHFEDIFKETTE 272
Query: 612 GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPAD 671
+ S+D+ D ++ + S DE + DE+ + G D + + D
Sbjct: 273 L--------IDSEDALDEDWLSEYSG-DEDYDPDENEASGDCMDSGEKIMSDD------- 316
Query: 672 DKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDE 731
S+ SGS S ++D+ + D
Sbjct: 317 -------------------------------SNGSGSPLYSPNDDIPDFI------SADL 339
Query: 732 GAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPVYGKRSSERLDYKKLYDETY 791
H+N L I D A + +R +DY++L +E +
Sbjct: 340 NVVEGFCHTNLD---------------LGIDAVEDDFAQILTYQRPRRDVDYRRLNEEMF 384
Query: 792 GNVP--YDSSDDESWSDDGGPRKRTKSTKEGSSA-SPDGKTPVIRRRKSTKAAKEKLNET 848
G + + S+DE W G +R K T G + + G VI KS K ++
Sbjct: 385 GKITGNEEQSEDEDW----GHERRKKRTHSGVAGDNSVGFLNVISDEKSQKKGRKLFR-- 438
Query: 849 ENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKEN 908
I PA + L +F EN
Sbjct: 439 ------------------IPPA--------------------------AVEVLRRAFAEN 454
Query: 909 QYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ P R KE+L++ELG++F ++ KWF+NTR
Sbjct: 455 ELPPRDVKENLSRELGISFEKIDKWFKNTR 484
>gi|303280896|ref|XP_003059740.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458395|gb|EEH55692.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 534
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 78/181 (43%), Positives = 109/181 (60%), Gaps = 10/181 (5%)
Query: 436 EYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIR 495
EY R R + L+ I +IDAY +GW+G S EKL+P +E+++A +I KL++R
Sbjct: 87 EYKRTRGKIHSQLSAIRQLNAMIDAYDMDGWRGASREKLRPAEEIRKAKIKIFNAKLRVR 146
Query: 496 DLFQRLDSLCAGGFP-KSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQ 554
+LF+ +D G ++ D +G++D+ED++C CG + + DNDI+LCDG CDRGFHQ
Sbjct: 147 ELFKHIDLAMDGARDLHAITDEDGELDAEDVFCCACGDGESTDDNDILLCDGFCDRGFHQ 206
Query: 555 YCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVN-----ELQGTRLFITDNWEKVFPEA 609
C PP+ EDIP DEGWLC CD +VDC +N EL + D VFPE
Sbjct: 207 RCCVPPVRTEDIPEGDEGWLCALCDARVDCFYTLNADFELELDAGKATFAD----VFPEE 262
Query: 610 A 610
A
Sbjct: 263 A 263
Score = 46.6 bits (109), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 75/171 (43%), Gaps = 22/171 (12%)
Query: 772 VYGKRSSERLDYKKLYDETYGNV-PYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTP 830
V GKR ++DY+KL DE +G+ ++ D+ GP S K G + +GK P
Sbjct: 368 VVGKRRRAKVDYRKLNDEMFGDGEAFEGEFDDERRGGWGP----ASPKSGMVTTGEGKRP 423
Query: 831 VIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISP--AKSHEGCSTPGSRGRRHRT 888
KA + K + + +RR +T + SP KS +TP + T
Sbjct: 424 -------KKAPRAKTPSSASKKRRRS----STSAAPTSPRSPKSRRLSNTPAG---KQST 469
Query: 889 SYRKLGEEVTQKLYNSFKENQY-PNRTTKESLAKELGLTFSQVRKWFENTR 938
+ G+E L +F + P +++ + +GLT QV+ WF N R
Sbjct: 470 KTSRFGDETRASLERTFSSGKTNPGADDCDAIGERVGLTSHQVKIWFMNRR 520
>gi|384249383|gb|EIE22865.1| hypothetical protein COCSUDRAFT_63988 [Coccomyxa subellipsoidea
C-169]
Length = 1086
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 76/195 (38%), Positives = 112/195 (57%), Gaps = 6/195 (3%)
Query: 418 ERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPE 477
E +QK + R+++ V Y+++ +R L++RI E+ +DAY EGW+G + EKL+P
Sbjct: 696 EGEQKPKKAERKRERVVTPYTKLVNRVRGLISRIRVEEAYLDAYEGEGWRGANREKLRPT 755
Query: 478 KELQRATSEILRRKLKIRDLFQRLDSLCAGG---FPKSLFDSEGQIDSEDIYCAKCGSKD 534
EL++A +I + KL +R+ + + +GG P D +G+ID I+C C +
Sbjct: 756 AELEQARLQIEKCKLGMREAVKECEE--SGGDRAIPPEHIDEDGEIDEAHIFCGHCHDPE 813
Query: 535 LSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGT 594
DNDIILCDG C R +H+ CL PPL + P+DEGWLCP CD K D + ++NE G
Sbjct: 814 SYEDNDIILCDGNCHRAYHEKCLVPPLDSSTL-PEDEGWLCPACDAKADILTMINEEYGF 872
Query: 595 RLFITDNWEKVFPEA 609
W VF EA
Sbjct: 873 EYEQETPWHAVFAEA 887
>gi|308811298|ref|XP_003082957.1| homeodomain protein (ISS) [Ostreococcus tauri]
gi|116054835|emb|CAL56912.1| homeodomain protein (ISS) [Ostreococcus tauri]
Length = 682
Score = 154 bits (389), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 104/156 (66%)
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKI 494
+E++ R+ ++ I Q L++AY+S+GW+G + +K KP +E+++A +I KLKI
Sbjct: 2 NEFALARSRIQSQFATIRRHQALVEAYASDGWRGQAAQKPKPVREIEKAREKIFEGKLKI 61
Query: 495 RDLFQRLDSLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQ 554
R+ F+ L+ ++ D G+ D+ DI+C+KC D D+DI+LCDG CDR +HQ
Sbjct: 62 REYFKVLEFDEREREITTVADEFGECDAADIFCSKCTLADDRHDDDILLCDGFCDRAYHQ 121
Query: 555 YCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNE 590
C+ PP+L EDIPP+DEGWLCP CD +VD I ++N+
Sbjct: 122 SCVAPPVLAEDIPPEDEGWLCPRCDARVDVIYVLND 157
>gi|255087094|ref|XP_002505470.1| predicted protein [Micromonas sp. RCC299]
gi|226520740|gb|ACO66728.1| predicted protein [Micromonas sp. RCC299]
Length = 504
Score = 145 bits (367), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 72/166 (43%), Positives = 103/166 (62%), Gaps = 7/166 (4%)
Query: 448 LNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAG 507
L+ I EQ+ IDAY +GWKG S EKL+P E++++ +I KLK+RD+F+ +D + G
Sbjct: 81 LSTIRREQHHIDAYEMDGWKGASREKLRPADEIRKSQVKIFNCKLKVRDMFRDID-MDTG 139
Query: 508 --GFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKED 565
F + + G++D ED+ CA+C D + +NDI++CDG CDR FHQ C+ PP+ ++
Sbjct: 140 EVSFRDRIEEDTGEVDVEDVVCARCADGDATDENDILICDGYCDRAFHQRCVVPPVKADE 199
Query: 566 IPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITD-NWEKVFPEAA 610
IP DE WLCP CD +VDC +N L D +W VFP A
Sbjct: 200 IP--DE-WLCPLCDARVDCFYTLNADFDLELDAADASWRDVFPTEA 242
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 31/71 (43%), Gaps = 11/71 (15%)
Query: 868 SPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTF 927
SPAK G P +K + V L SF N +P+ S+ +GLT
Sbjct: 427 SPAKKRTGAPPP-----------KKFSDSVRAALEASFNANNFPSHHEMASIGGSIGLTD 475
Query: 928 SQVRKWFENTR 938
QV+ WF+N R
Sbjct: 476 HQVKVWFQNRR 486
>gi|145356963|ref|XP_001422692.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582935|gb|ABP01009.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 383
Score = 145 bits (365), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/221 (36%), Positives = 125/221 (56%), Gaps = 11/221 (4%)
Query: 440 IRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQ 499
+R+ ++ I Q+L+DAY+ +GW+G + +K KP EL A +I + K+KIR+LF+
Sbjct: 1 VRSRIQSQFAMIRRHQSLLDAYAGDGWRGAASQKPKPLGELALAREKIFKGKVKIRELFK 60
Query: 500 RLDSLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEP 559
L+ ++ D G+ D+ DI+C+KC D ++DI+LCDG CDR +HQ C+ P
Sbjct: 61 GLEFDPNEREITTVEDELGETDAADIFCSKCAMADDREEDDILLCDGFCDRAYHQSCVVP 120
Query: 560 PLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNF 619
+ EDIPPDDEGWLCP CD +VD I ++N+ L E VF A +
Sbjct: 121 AVKTEDIPPDDEGWLCPLCDARVDVIYVLNDEYEQDLGQKCVAEDVFKAEAEMQEKGIVP 180
Query: 620 GLA-----------SDDSDDNEYDPDGSATDEQDEGDESSS 649
G A SD+S+D ++D ++ D +D+ +E+ S
Sbjct: 181 GTAQFAHAHEEAWPSDESEDEDFDHGRASDDGRDDENEALS 221
>gi|302833301|ref|XP_002948214.1| hypothetical protein VOLCADRAFT_103835 [Volvox carteri f.
nagariensis]
gi|300266434|gb|EFJ50621.1| hypothetical protein VOLCADRAFT_103835 [Volvox carteri f.
nagariensis]
Length = 3230
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 104/175 (59%), Gaps = 2/175 (1%)
Query: 437 YSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRD 496
Y ++ ++ + I E++ + Y++EGW+G + +K++ E+QR+ + RR+ IR+
Sbjct: 2045 YRQLLRRVKRQVQHIKQEEHALAVYAAEGWRGGARKKVQLTYEMQRSRVALERRRAAIRE 2104
Query: 497 LFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQY 555
D+ P LFD EG+++ I+CA C S +++ D+D+ILCDG C+ FHQ
Sbjct: 2105 ALALCDAPPGLRSIPAELFDEEGELEEHHIFCAVCYSYEMADDDDVILCDGPCNCAFHQN 2164
Query: 556 CLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAA 610
CL+PP+ + P+DEGWLCP CDCK D +D++ E G I + W + P +A
Sbjct: 2165 CLDPPVDVSKL-PEDEGWLCPACDCKADILDILYEEFGIEYDINEPWMNILPPSA 2218
>gi|302398843|gb|ADL36716.1| HD domain class transcription factor [Malus x domestica]
Length = 404
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 113/202 (55%), Gaps = 12/202 (5%)
Query: 479 ELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSA 537
ELQRA +IL KL IRD +LDSL + G S +G + E I+CAKC +
Sbjct: 2 ELQRANKQILNCKLGIRDAIHQLDSLSSVGSIADSFISPDGSVSHEHIFCAKCKLNEAFP 61
Query: 538 DNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLF 597
DNDIILCDG C+ FHQ CL+PPL E+IP ++GW C C+CK++ ++LVN GT
Sbjct: 62 DNDIILCDGTCNCAFHQKCLDPPLDTENIPRGEQGWFCKFCECKMEILELVNAHLGTCFP 121
Query: 598 ITDNWEKVFPEAAA---GHNQ--DPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGS 652
+ W+ VF E A G N +P+ SDDS+D++Y+P E++E S S G
Sbjct: 122 MNSGWQDVFKEEATFPHGKNSLLNPDEEWPSDDSEDDDYNP------ERNENSCSISRGG 175
Query: 653 SSDDSDFTSTSDEVEAPADDKT 674
S D + S +V +D T
Sbjct: 176 SDDIASEEELSTDVSVGSDVST 197
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 76/171 (44%), Gaps = 39/171 (22%)
Query: 772 VYGKRSSERLDYKKLYDETYGN---VPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGK 828
V +R +DYKKLYDE +G + SDDE W GP KR + E +AS
Sbjct: 202 VSSRRQKRSVDYKKLYDEMFGKDGPLLEQISDDEDW----GPGKRKRRGNESDAAS---- 253
Query: 829 TPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRT 888
L E+E P D + S KS + S R R+
Sbjct: 254 ------------TLMTLYESERNP-----------DVDRSEKKSSQHSSDTQVR----RS 286
Query: 889 SYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRW 939
+R + +KL F EN+ P+R K++L+KELGL +V KWF+N R+
Sbjct: 287 CFR-IPRNAVEKLRQVFSENELPSRAVKDNLSKELGLNPEKVSKWFKNARY 336
>gi|412985441|emb|CCO18887.1| predicted protein [Bathycoccus prasinos]
Length = 464
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 126/234 (53%), Gaps = 19/234 (8%)
Query: 434 ADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLK 493
A+ + RT + L+ + Q +D Y+ +GW+ + EKLKP EL + +I RKLK
Sbjct: 60 AEGVQKARTQICVQLSALRRSQAYLDTYTLDGWRSSNQEKLKPIAELAKKREDIFFRKLK 119
Query: 494 IRDLFQRL--DSLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSAD-NDIILCDGACDR 550
+R+LF++L D F K D +G+I+ ED+ C +CGS + + + ND++ CDG CD
Sbjct: 120 VRELFEKLNYDEKKLEKF-KCKEDEDGEIECEDVVCCECGSGECNEEENDVVFCDGYCDL 178
Query: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFI------------ 598
+H C++PPL EDIP DEGWLCP CDC+VD I +N RL I
Sbjct: 179 AYHMKCVKPPLKPEDIPKGDEGWLCPLCDCRVDVIYYLNLDYDQRLDIETCTHLDVFKKE 238
Query: 599 TDNWEK-VFPEAAAGHNQDPNF--GLASDDSDDNEYDPDGSATDEQDEGDESSS 649
D ++K + P + H N SD+S+D ++ D +D+ DES S
Sbjct: 239 QDMFDKGIIPGTSRFHLHGENEEDVWPSDESEDEDFKEKNEKDDGKDDSDESLS 292
>gi|298706045|emb|CBJ29155.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 646
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 102/190 (53%), Gaps = 13/190 (6%)
Query: 437 YSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRD 496
+ RIR + LL+++ + + + Y +GWK EKLKP EL++A I + ++R+
Sbjct: 21 WDRIRAKFKLLLHKMRRDTHFLQVYQQDGWKSSGREKLKPTAELEKARERIRGIQAELRE 80
Query: 497 LFQRLDSLCAGGFPKSLF--DSEGQIDSEDIYCAKCGSKDLSADNDIILCDGA-CDRGFH 553
+ +RL + GG D EG I+ ++I C KCG D +DI+LCD A C R +H
Sbjct: 81 VTRRLTEMNPGGLRWGAIQEDGEGNIEVDEIMCTKCGQGDSDDHDDILLCDYAGCFRAYH 140
Query: 554 QYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLV-NELQGTRLFITDNWEKVFP----- 607
Q CL PP+ E P ++E W C C+C DC +++ NE QG + +W+ VFP
Sbjct: 141 QNCLSPPIKPEVFPEEEEDWFCWQCECLTDCFEMLENEFQGEKF---TSWKDVFPVVETG 197
Query: 608 -EAAAGHNQD 616
E A QD
Sbjct: 198 AEGGAAGQQD 207
>gi|428184321|gb|EKX53176.1| hypothetical protein GUITHDRAFT_132940 [Guillardia theta CCMP2712]
Length = 371
Score = 128 bits (322), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 101/177 (57%), Gaps = 13/177 (7%)
Query: 439 RIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLF 498
RIR ++ L Y Q ++DAY +GW S KLKP +EL++++ ++ K +I++L
Sbjct: 134 RIRNRIKTRLTNARYFQTMLDAYEQDGWGPSSHRKLKPTEELKKSSESLIHAKKEIKELL 193
Query: 499 QRLDSLCAGGFPKSL------FDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGA-CDRG 551
+++D K +++EG + E I C+ CG+ D NDI+LCD + C+R
Sbjct: 194 KQMDPASNELAQKETRIDEREYEAEG-LAFEKIVCSICGTGDARRGNDILLCDYSECNRA 252
Query: 552 FHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE 608
FHQ C P + ++P DDE W C C C DCI+ VNEL GT + T WEK+FPE
Sbjct: 253 FHQKCHHPQVT--ELPHDDEDWFCTHCLCYTDCIEAVNELFGTH-YTT--WEKMFPE 304
>gi|159485012|ref|XP_001700543.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272183|gb|EDO97987.1| predicted protein [Chlamydomonas reinhardtii]
Length = 738
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 87/141 (61%), Gaps = 10/141 (7%)
Query: 455 QNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAG-----GF 509
++ + +EGW+G + +K++ ELQR+ + +R+ IR+ SLC
Sbjct: 512 EHALAVLEAEGWRGGARKKVQLTYELQRSREALEKRRAAIREAV----SLCDAPPGLKAI 567
Query: 510 PKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPD 569
P LFD EG+++ I+CA C S +++ D+D++LCDG C+ FHQ CL+PP+ + P+
Sbjct: 568 PAELFDEEGELEEHHIFCAVCYSYEMADDDDVVLCDGPCNCAFHQNCLDPPVDVTKL-PE 626
Query: 570 DEGWLCPGCDCKVDCIDLVNE 590
DEGWLCP CDCK D +DL+ E
Sbjct: 627 DEGWLCPACDCKADILDLLYE 647
>gi|301122605|ref|XP_002909029.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099791|gb|EEY57843.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 540
Score = 119 bits (299), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 94/180 (52%), Gaps = 15/180 (8%)
Query: 433 VADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKL 492
V E+ R+ ++ +R+ Y IDAY EGW S++KLKP KEL +I + K
Sbjct: 77 VRPEHKRLENKIKSQAHRLRYPLAFIDAYEGEGWNKSSLDKLKPSKELDAERRKIEKGKR 136
Query: 493 KIRDLFQRLDSLCAG---GFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGA-C 548
+ D Q L +L A P ++F ED++C++CGS D+ DNDI+LCD C
Sbjct: 137 ALIDGLQELTALYANEPQVPPMAVF--------EDVHCSRCGSTDVELDNDILLCDSVGC 188
Query: 549 DRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE 608
R +HQ C P +L IP +E W C C +C+ +N + GT +N + +FPE
Sbjct: 189 HRAYHQKCQTPVVLTAKIPAGEEPWFCEVCLAVFECLKSINSVFGT---TYENVDDLFPE 245
>gi|413949845|gb|AFW82494.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 442
Score = 119 bits (298), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 120/324 (37%), Positives = 161/324 (49%), Gaps = 46/324 (14%)
Query: 659 FTSTSDEVEAPADDKTY-LGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDL 717
F + SD+ E D K L L SEDSEDD+Y+P P+ D V ++SSS SDFTSDS+D
Sbjct: 74 FLTCSDDSEPLIDKKVDDLRLPSEDSEDDDYDPAGPDSDKDVEKKSSSDESDFTSDSDDF 133
Query: 718 AAVLEDNRSSGNDEGAASPL--GHSNGQRYK---DGGNNESLNNELLSIIKPGQDGAAPV 772
+ SG+DE +SPL G K S ++ + + I G P
Sbjct: 134 CKEIS---KSGHDE-VSSPLLPDAKVGDMEKITAQAKTTSSADDPMETEIDQGV--VLPD 187
Query: 773 YGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVI 832
+R +ERLDYKKLYDE YG DSSDDE WS P KS +EG + SP GK +
Sbjct: 188 SRRRQAERLDYKKLYDEAYGEASSDSSDDEEWSGKNTPI--IKSNEEGEANSPAGKGSRV 245
Query: 833 RRRK---STKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTS 889
+T++ K+ L+ + + L + SN + K H
Sbjct: 246 VHHNDELTTQSTKKSLHSIHGSVDEKPG-DLTSNGSNSTARKGH---------------- 288
Query: 890 YRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNA 949
G + QKL+ FK YP+R+ KESLA+ELGLTF QV KWFE R S A
Sbjct: 289 ---FGPVINQKLHEHFKTQPYPSRSVKESLAEELGLTFRQVNKWFETRRHS--------A 337
Query: 950 KLANSEKG-TCTPQSNKNTVGRVS 972
++A+S KG + S +NT +V+
Sbjct: 338 RVASSRKGISLDKHSPQNTNSQVT 361
>gi|224029065|gb|ACN33608.1| unknown [Zea mays]
Length = 268
Score = 106 bits (264), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 76/202 (37%), Positives = 101/202 (50%), Gaps = 39/202 (19%)
Query: 775 KRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRR 834
+R +ERLDYKKLYDE YG DSSDDE WS P KS +EG + SP GK +
Sbjct: 16 RRQAERLDYKKLYDEAYGEASSDSSDDEEWSGKNTPI--IKSNEEGEANSPAGKGSRVVH 73
Query: 835 RK---STKAAKEKLNETENTPKRRGRP-KLNTEDSNISPAKSHEGCSTPGSRGRRHRTSY 890
+T++ K+ L+ + + P L + SN + K H
Sbjct: 74 HNDELTTQSTKKSLHSIHGSVDEK--PGDLTSNGSNSTARKGH----------------- 114
Query: 891 RKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAK 950
G + QKL+ FK YP+R+ KESLA+ELGLTF QV KWFE R S A+
Sbjct: 115 --FGPVINQKLHEHFKTQPYPSRSVKESLAEELGLTFRQVNKWFETRRHS--------AR 164
Query: 951 LANSEKGTC----TPQSNKNTV 968
+A+S KG +PQ+ + V
Sbjct: 165 VASSRKGISLDKHSPQNTNSQV 186
>gi|325191695|emb|CCA25730.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 819
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 136/338 (40%), Gaps = 63/338 (18%)
Query: 373 ATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLAD---------------VNSIG 417
AT +K N ++R ++GS R R +P + LA+ V+S
Sbjct: 153 ATSDMKENKSIRKVVGSRR--NKRETIKPKQLNKATKLAEGKANLRRAEKKKQDTVSSTK 210
Query: 418 ERKQKKRNKIRRKKIVADE--YSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLK 475
K+ KR + +KI + I +R + ++ Y + + AY SEGW +VE+LK
Sbjct: 211 STKKGKRGPKKHEKISTSPPPHKLILNKVRAQVRQLTYHLHFVQAYESEGWNRKNVERLK 270
Query: 476 PEKELQRATSEILRRKLKIRDLFQRLDSL---------CAGGFPKSL----FDSEGQIDS 522
P + L A ++L K I L L L A F L D + +
Sbjct: 271 PYESLLEAKQKVLTGKRSIVQLLGALQDLHKHDRQFDYLAPKFSHLLKSSSIDVQAPAHA 330
Query: 523 ED-----IYCAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
E+ IYC++C S + +NDII+CD C+R +HQ C PP+ DIP W C
Sbjct: 331 ENLQQYVIYCSRCNSTNTIPENDIIICDSEGCNRAYHQKCQNPPVATSDIPLGAALWYCE 390
Query: 577 GCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPN------------------ 618
C+ C+ +N T D +VFPE G D
Sbjct: 391 ICEALFKCLKCINAAFETAYETVD---EVFPELK-GEEVDARDKPSTIPQKSKTACSWLD 446
Query: 619 -FGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSD 655
+D DD D D T + D+GD S ++SD
Sbjct: 447 VLRGTNDSRDDESADED--FTPKSDKGDHSGGSVTNSD 482
>gi|413935606|gb|AFW70157.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 535
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/415 (24%), Positives = 154/415 (37%), Gaps = 121/415 (29%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C+ C S + N II C G C R H+ CLEPPL K +P GWLC C CKV ++
Sbjct: 2 CSTCKSHESFPSNKIIFCKGPCKRACHEKCLEPPLNKSVLPTSSHGWLCKFCLCKVRILE 61
Query: 587 LVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDE 646
+N GT + ++E +F E + S+D+ D ++ + S DE + DE
Sbjct: 62 TINAHLGTSFTVKCHFEDIFKETTEL--------IDSEDALDEDWLSEYSG-DEDYDPDE 112
Query: 647 SSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSS 706
+ + G D + + D S+ S
Sbjct: 113 NEASGDCMDSGEKIMSDD--------------------------------------SNGS 134
Query: 707 GSDFTSDSEDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQ 766
GS S ++D+ + D H+N L I
Sbjct: 135 GSPLYSPNDDIPDFI------SADLNVVEGFCHTNLD---------------LGIDAVED 173
Query: 767 DGAAPVYGKRSSERLDYKKLYDETYGNVP--YDSSDDESWSDDGGPRKRTKSTKEGSSA- 823
D A + +R +DY++L +E +G + + S+DE W G +R K T G +
Sbjct: 174 DFAQILTYQRPRRDVDYRRLNEEMFGKITGNEEQSEDEDW----GHERRKKRTHSGVAGD 229
Query: 824 SPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRG 883
+ G VI KS K ++ I PA
Sbjct: 230 NSVGFLNVISDEKSQKKGRKLFR--------------------IPPA------------- 256
Query: 884 RRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+ L +F EN+ P R KE+L++ELG++F ++ KWF+NTR
Sbjct: 257 -------------AVEVLRRAFAENELPPRDVKENLSRELGISFEKIDKWFKNTR 298
>gi|307102157|gb|EFN50542.1| hypothetical protein CHLNCDRAFT_137229 [Chlorella variabilis]
Length = 231
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/74 (52%), Positives = 53/74 (71%), Gaps = 1/74 (1%)
Query: 509 FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPP 568
P LFDS+G++D + I+C+KC + +NDIILCDG C+R +H CL PP+ E++ P
Sbjct: 24 IPSELFDSDGELDLDHIFCSKCRGNESDEENDIILCDGMCNRAYHVRCLVPPVNPEEL-P 82
Query: 569 DDEGWLCPGCDCKV 582
+DEGWLCP CD KV
Sbjct: 83 EDEGWLCPACDRKV 96
>gi|242088511|ref|XP_002440088.1| hypothetical protein SORBIDRAFT_09g025810 [Sorghum bicolor]
gi|241945373|gb|EES18518.1| hypothetical protein SORBIDRAFT_09g025810 [Sorghum bicolor]
Length = 477
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 34/45 (75%), Positives = 39/45 (86%)
Query: 570 DEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHN 614
DEGWLCP CDCK+DCIDL+N+LQG+ L I D+WEKVFPEAAA N
Sbjct: 3 DEGWLCPACDCKIDCIDLLNDLQGSDLSIEDSWEKVFPEAAAMAN 47
>gi|224105643|ref|XP_002313885.1| predicted protein [Populus trichocarpa]
gi|222850293|gb|EEE87840.1| predicted protein [Populus trichocarpa]
Length = 88
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 38/65 (58%), Positives = 45/65 (69%), Gaps = 2/65 (3%)
Query: 900 KLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKLANSE--KG 957
+LY+ F+ENQYP+R K SLA+ELG+TF QV KWF N RWSFNH SS A S KG
Sbjct: 1 RLYSYFRENQYPDRAAKASLAEELGITFEQVNKWFVNARWSFNHSSSTGTSKAESASGKG 60
Query: 958 TCTPQ 962
+C Q
Sbjct: 61 SCDGQ 65
>gi|357505081|ref|XP_003622829.1| Cell division cycle protein-like protein [Medicago truncatula]
gi|355497844|gb|AES79047.1| Cell division cycle protein-like protein [Medicago truncatula]
Length = 899
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/107 (42%), Positives = 61/107 (57%), Gaps = 13/107 (12%)
Query: 381 YTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKKIVADEYSRI 440
Y +R L SDR LRSR +P PPE +N++ E ++K+ K R + I A ++SRI
Sbjct: 672 YILRLLGSSDRALRSRD-NKPKPPEP------INNVAETIERKKKKTRNEGINA-QFSRI 723
Query: 441 RTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEI 487
R LRY LNRI+YEQ+LIDAY L V E EL A ++
Sbjct: 724 RAQLRYYLNRISYEQSLIDAYCV-----LLVNCWTTEMELHLAFYKL 765
>gi|224007168|ref|XP_002292544.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972186|gb|EED90519.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1660
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 47/98 (47%), Gaps = 8/98 (8%)
Query: 520 IDSEDIYCAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
I +++ C C S +ND++LCDG C R FH C+EP L ED+ +DE W CP C
Sbjct: 1269 ISLDNLRCVCCFKGTASNENDLLLCDGMGCYRAFHMCCVEPKLTLEDVEDEDESWFCPLC 1328
Query: 579 DCKVDCIDLVN-ELQGTRLFIT---DNWE---KVFPEA 609
+ E G I + WE VFPEA
Sbjct: 1329 TAHATLVHHAQKESLGDEFHINPPPEEWEVATDVFPEA 1366
>gi|242006500|ref|XP_002424088.1| arginine/serine-rich protein, putative [Pediculus humanus corporis]
gi|212507394|gb|EEB11350.1| arginine/serine-rich protein, putative [Pediculus humanus corporis]
Length = 2376
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/58 (51%), Positives = 36/58 (62%), Gaps = 6/58 (10%)
Query: 524 DIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCK 581
D C CGS D ++ ++LCD CD+GFH CL PPL EDI PDD W CP CD +
Sbjct: 152 DTVCEICGSGD--REDILLLCDN-CDKGFHTTCLTPPL--EDI-PDDNEWFCPDCDFR 203
>gi|219115495|ref|XP_002178543.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410278|gb|EEC50208.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 866
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 49/96 (51%), Gaps = 6/96 (6%)
Query: 510 PKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPP 568
P + D + I +++ C C + + +ND+I+CDG C R +H CL+P + E++
Sbjct: 401 PIAKHDGDSGISLDNLRCCVCHQSEATDENDMIMCDGCGCYRAYHMRCLQPHVKPEEVEN 460
Query: 569 DDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEK 604
+++ W CP C D + L+ + D WE+
Sbjct: 461 EEDDWFCPLCSTLADMMLLIQTNH-----MGDEWEQ 491
>gi|397577372|gb|EJK50559.1| hypothetical protein THAOC_30416 [Thalassiosira oceanica]
Length = 881
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 47/107 (43%), Gaps = 16/107 (14%)
Query: 515 DSEGQIDSEDIYCAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPP--DDE 571
DS I +++ C C S ND++LCDG C R FH CL P + E++ DD+
Sbjct: 550 DSSSGISLDNLRCHSCMGNHASESNDMLLCDGMGCHRAFHMECLHPKVTPEEVAKSGDDD 609
Query: 572 GWLCPGCDCKVDCIDLVNELQGTRLF------ITDNWEK---VFPEA 609
W CP C I + F D+WE VFPEA
Sbjct: 610 DWFCPLCTAHATLIHYAQ----SEYFGHDESQDVDDWENALDVFPEA 652
>gi|412986027|emb|CCO17227.1| unnamed protein product [Bathycoccus prasinos]
Length = 1015
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 37/61 (60%), Gaps = 5/61 (8%)
Query: 523 EDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKV 582
++I C C SKD D ++LCDG CD GFH +CL+PP+ K IP D+ W C C V
Sbjct: 888 DEIPCKVCKSKD--DDEKMLLCDG-CDCGFHIFCLKPPMKK--IPEGDDDWFCKPCKAGV 942
Query: 583 D 583
+
Sbjct: 943 E 943
>gi|427792405|gb|JAA61654.1| Putative remodeling and spacing factor 1, partial [Rhipicephalus
pulchellus]
Length = 2760
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 38/66 (57%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
CAKCG D I+LCD CD G+H CL+P L+ I PD + W CP CD + C
Sbjct: 1859 CAKCGKGDHP--EWILLCD-VCDAGYHTSCLKPALM---IIPDGD-WFCPPCDHRKLCEK 1911
Query: 587 LVNELQ 592
L+ EL+
Sbjct: 1912 LMEELK 1917
>gi|270002669|gb|EEZ99116.1| hypothetical protein TcasGA2_TC005009 [Tribolium castaneum]
Length = 1732
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 40/75 (53%), Gaps = 8/75 (10%)
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCI 585
+C CGS D + ++LCDG CD GFH CL PPL DIPP W C C + D +
Sbjct: 221 FCEICGSSD--REERMLLCDG-CDLGFHMECLTPPLA--DIPPG--AWFCNDCTPE-DLV 272
Query: 586 DLVNELQGTRLFITD 600
D EL +L + D
Sbjct: 273 DAEIELYELQLLLDD 287
>gi|255079372|ref|XP_002503266.1| JmjN/JmjC protein [Micromonas sp. RCC299]
gi|226518532|gb|ACO64524.1| JmjN/JmjC protein [Micromonas sp. RCC299]
Length = 2663
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 36/60 (60%), Gaps = 7/60 (11%)
Query: 519 QIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
Q++ + C CG S + +ILCDG CDRG+H YCL PPL +++P D W CP C
Sbjct: 249 QLEEQPAGCVNCGGT--SHEESMILCDG-CDRGYHMYCLSPPL--DELPQGD--WFCPDC 301
>gi|301102963|ref|XP_002900568.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101831|gb|EEY59883.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 3127
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 48/89 (53%), Gaps = 10/89 (11%)
Query: 493 KIRDLFQRLDSLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGF 552
++R +F RL P + + SE++ C +CG ++A ++LCD +CD +
Sbjct: 3045 RLRSIFNRLFEQWVTNVPPN--TPVTHLASEEL-CRQCG--QMNAQESMLLCD-SCDAAY 3098
Query: 553 HQYCLEPPLLKEDIPPDDEGWLCPGCDCK 581
H +CL+PPL IPPD+ W CP C K
Sbjct: 3099 HAFCLQPPL--SSIPPDN--WFCPRCPVK 3123
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 7/60 (11%)
Query: 519 QIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
I +E+ C C + + A ++LCD CD +H +CLE PLL +IP + W CP C
Sbjct: 941 MIAAEEDNCRNC--QTIHAKGRLLLCD-RCDAPYHTFCLESPLL--EIPKSE--WFCPTC 993
>gi|91093064|ref|XP_968013.1| PREDICTED: similar to CG2926 CG2926-PA [Tribolium castaneum]
Length = 1599
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 40/75 (53%), Gaps = 8/75 (10%)
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCI 585
+C CGS D + ++LCDG CD GFH CL PPL DIPP W C C + D +
Sbjct: 190 FCEICGSSD--REERMLLCDG-CDLGFHMECLTPPLA--DIPPG--AWFCNDCTPE-DLV 241
Query: 586 DLVNELQGTRLFITD 600
D EL +L + D
Sbjct: 242 DAEIELYELQLLLDD 256
>gi|413936027|gb|AFW70578.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 793
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 37/58 (63%)
Query: 893 LGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAK 950
G VTQKL F+++ YP TK+ LA+ELGLTF+Q+ KWF TR+ +KN K
Sbjct: 309 FGPIVTQKLKEHFEKDPYPCHATKDGLAQELGLTFNQISKWFSATRYYSRVAVAKNQK 366
>gi|300122638|emb|CBK23206.2| unnamed protein product [Blastocystis hominis]
Length = 258
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 33/53 (62%), Gaps = 7/53 (13%)
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+C CG D + ++LCDG CD+GFH +CL PPL + IP D W CPGC
Sbjct: 64 FCEVCGRSD--KEESLLLCDG-CDKGFHLFCLNPPL--KQIPSGD--WYCPGC 109
>gi|145207283|gb|AAH46124.2| RSF1 protein [Homo sapiens]
Length = 805
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 695 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 747
Query: 587 LVNELQ 592
L +LQ
Sbjct: 748 LEEQLQ 753
>gi|10803733|gb|AAF61709.2|AF227948_1 HBV pX associated protein-8 [Homo sapiens]
Length = 1189
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 642 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 694
Query: 587 LVNELQ 592
L +LQ
Sbjct: 695 LEEQLQ 700
>gi|449456407|ref|XP_004145941.1| PREDICTED: histone-lysine N-methyltransferase ATXR6-like [Cucumis
sativus]
gi|449526858|ref|XP_004170430.1| PREDICTED: histone-lysine N-methyltransferase ATXR6-like [Cucumis
sativus]
Length = 342
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 36/59 (61%), Gaps = 9/59 (15%)
Query: 521 DSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEG-WLCPGC 578
D +DI+C KCGS D A D++LCD CDRG+H +CL P L P +G W CP C
Sbjct: 24 DCDDIFCQKCGSGDSPA--DLLLCD-KCDRGYHLFCLTPIL-----PSVPKGTWFCPTC 74
>gi|426369892|ref|XP_004051915.1| PREDICTED: remodeling and spacing factor 1 [Gorilla gorilla
gorilla]
Length = 1189
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 642 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 694
Query: 587 LVNELQ 592
L +LQ
Sbjct: 695 LEEQLQ 700
>gi|449547717|gb|EMD38685.1| hypothetical protein CERSUDRAFT_113863 [Ceriporiopsis subvermispora
B]
Length = 906
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 30/53 (56%), Gaps = 6/53 (11%)
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+C+ C S ++ CDG C R FH +CL+PP+ D+P DE W CP C
Sbjct: 217 HCSSCRSL-----GSLVYCDG-CPRAFHLWCLDPPMAASDLPEGDERWYCPAC 263
>gi|119595439|gb|EAW75033.1| hepatitis B virus x associated protein, isoform CRA_a [Homo
sapiens]
Length = 1008
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 645 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 697
Query: 587 LVNELQ 592
L +LQ
Sbjct: 698 LEEQLQ 703
>gi|395814781|ref|XP_003780919.1| PREDICTED: remodeling and spacing factor 1 [Otolemur garnettii]
Length = 1186
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 639 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 691
Query: 587 LVNELQ 592
L +LQ
Sbjct: 692 LEEQLQ 697
>gi|449484819|ref|XP_002189691.2| PREDICTED: remodeling and spacing factor 1 [Taeniopygia guttata]
Length = 1351
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 38/72 (52%), Gaps = 7/72 (9%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 803 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 855
Query: 587 LVNELQGTRLFI 598
L +LQ + +
Sbjct: 856 LEEQLQDLDVVL 867
>gi|349602979|gb|AEP98951.1| Remodeling and spacing factor 1-like protein, partial [Equus
caballus]
Length = 827
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 653 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 705
Query: 587 LVNELQ 592
L +LQ
Sbjct: 706 LEEQLQ 711
>gi|301614223|ref|XP_002936596.1| PREDICTED: hypothetical protein LOC100485119 [Xenopus (Silurana)
tropicalis]
Length = 1043
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 38/72 (52%), Gaps = 7/72 (9%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ + PD E W CP C K+ C
Sbjct: 825 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---LIPDGE-WFCPPCQHKLLCEK 877
Query: 587 LVNELQGTRLFI 598
L +LQ + +
Sbjct: 878 LDEQLQNLDVVL 889
>gi|225431531|ref|XP_002281774.1| PREDICTED: histone-lysine N-methyltransferase ATXR5 [Vitis
vinifera]
gi|296088576|emb|CBI37567.3| unnamed protein product [Vitis vinifera]
Length = 374
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/58 (46%), Positives = 36/58 (62%), Gaps = 7/58 (12%)
Query: 521 DSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
D D+ C +CGS D D++++LCD CDRGFH CL P +++ IP WLCP C
Sbjct: 56 DYSDVRCVQCGSGD--HDDELLLCD-KCDRGFHMSCLRPIVVR--IPIGT--WLCPSC 106
>gi|426252240|ref|XP_004019822.1| PREDICTED: remodeling and spacing factor 1 [Ovis aries]
Length = 1458
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 913 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 965
Query: 587 LVNELQ 592
L +LQ
Sbjct: 966 LEEQLQ 971
>gi|393220180|gb|EJD05666.1| hypothetical protein FOMMEDRAFT_145119 [Fomitiporia mediterranea
MF3/22]
Length = 736
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 6/53 (11%)
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+C+ C S+ ++ CDG C R FH +CL+PP+ D P +E W CPGC
Sbjct: 209 HCSSC-----SSVGALVYCDG-CPRAFHLWCLDPPMDPSDFPDGEESWYCPGC 255
>gi|431838462|gb|ELK00394.1| Remodeling and spacing factor 1 [Pteropus alecto]
Length = 1401
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 855 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 907
Query: 587 LVNELQ 592
L +LQ
Sbjct: 908 LEEQLQ 913
>gi|410352579|gb|JAA42893.1| remodeling and spacing factor 1 [Pan troglodytes]
Length = 1440
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|296216988|ref|XP_002754838.1| PREDICTED: remodeling and spacing factor 1 [Callithrix jacchus]
Length = 1438
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 893 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 945
Query: 587 LVNELQ 592
L +LQ
Sbjct: 946 LEEQLQ 951
>gi|334327753|ref|XP_001377733.2| PREDICTED: remodeling and spacing factor 1-like [Monodelphis
domestica]
Length = 1659
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 1115 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 1167
Query: 587 LVNELQ 592
L +LQ
Sbjct: 1168 LEEQLQ 1173
>gi|410352577|gb|JAA42892.1| remodeling and spacing factor 1 [Pan troglodytes]
Length = 1437
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 891 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 943
Query: 587 LVNELQ 592
L +LQ
Sbjct: 944 LEEQLQ 949
>gi|14211816|gb|AAK57515.1|AF380176_1 HBV pX associated protein 8 large isoform [Homo sapiens]
Length = 1431
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 884 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 936
Query: 587 LVNELQ 592
L +LQ
Sbjct: 937 LEEQLQ 942
>gi|332211223|ref|XP_003254719.1| PREDICTED: remodeling and spacing factor 1 [Nomascus leucogenys]
Length = 1440
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|355566895|gb|EHH23274.1| hypothetical protein EGK_06710 [Macaca mulatta]
Length = 1441
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|109108037|ref|XP_001091378.1| PREDICTED: remodeling and spacing factor 1 isoform 2 [Macaca
mulatta]
Length = 1441
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|402897456|ref|XP_003919743.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase UHRF2
[Papio anubis]
Length = 767
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 85/191 (44%), Gaps = 33/191 (17%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C CG K N +LCD C+ +H YCL PPL D P++E W CP CK D +
Sbjct: 347 CRVCGGK--HEPNMQLLCD-ECNVAYHIYCLNPPL---DKVPEEEYWYCP--SCKTDSSE 398
Query: 587 LVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSD-------DNEYDPD----- 634
+V G RL ++ K+ P A+ +D G+A N Y P
Sbjct: 399 VVK--AGERLKMSKKKAKM-PSASTESRRDWGRGMACVGRTRECTIVPSNHYGPIPGIPV 455
Query: 635 GSAT--DEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDA 692
GS QD GDE + GS + + + + AP+ D+T ++ + N DA
Sbjct: 456 GSTWRFRVQDRGDEFTYTGSGGKN---LAGNKRIGAPSADQTLTNMNRALA----LNCDA 508
Query: 693 PELDDKVTQES 703
P LDDK+ ES
Sbjct: 509 P-LDDKIGAES 518
>gi|397473367|ref|XP_003808185.1| PREDICTED: remodeling and spacing factor 1 [Pan paniscus]
Length = 1440
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|119595440|gb|EAW75034.1| hepatitis B virus x associated protein, isoform CRA_b [Homo
sapiens]
Length = 1440
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 893 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 945
Query: 587 LVNELQ 592
L +LQ
Sbjct: 946 LEEQLQ 951
>gi|38788333|ref|NP_057662.3| remodeling and spacing factor 1 [Homo sapiens]
gi|251757329|sp|Q96T23.2|RSF1_HUMAN RecName: Full=Remodeling and spacing factor 1; Short=Rsf-1;
AltName: Full=HBV pX-associated protein 8; AltName:
Full=Hepatitis B virus X-associated protein; AltName:
Full=p325 subunit of RSF chromatin-remodeling complex
gi|151554975|gb|AAI48327.1| Remodeling and spacing factor 1 [synthetic construct]
gi|157169674|gb|AAI53211.1| Remodeling and spacing factor 1 [synthetic construct]
gi|261858180|dbj|BAI45612.1| remodeling and spacing factor 1 [synthetic construct]
Length = 1441
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|432090050|gb|ELK23650.1| Remodeling and spacing factor 1 [Myotis davidii]
Length = 1345
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 804 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 856
Query: 587 LVNELQ 592
L +LQ
Sbjct: 857 LEEQLQ 862
>gi|403287731|ref|XP_003935087.1| PREDICTED: remodeling and spacing factor 1 [Saimiri boliviensis
boliviensis]
Length = 1439
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|149719282|ref|XP_001493263.1| PREDICTED: remodeling and spacing factor 1 [Equus caballus]
Length = 1436
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 891 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 943
Query: 587 LVNELQ 592
L +LQ
Sbjct: 944 LEEQLQ 949
>gi|332837299|ref|XP_001175061.2| PREDICTED: remodeling and spacing factor 1 isoform 2 [Pan
troglodytes]
gi|410297798|gb|JAA27499.1| remodeling and spacing factor 1 [Pan troglodytes]
Length = 1440
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|380796877|gb|AFE70314.1| remodeling and spacing factor 1, partial [Macaca mulatta]
Length = 1437
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 890 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 942
Query: 587 LVNELQ 592
L +LQ
Sbjct: 943 LEEQLQ 948
>gi|355717434|gb|AES05931.1| remodeling and spacing factor 1 [Mustela putorius furo]
Length = 1419
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 876 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 928
Query: 587 LVNELQ 592
L +LQ
Sbjct: 929 LEEQLQ 934
>gi|119595441|gb|EAW75035.1| hepatitis B virus x associated protein, isoform CRA_c [Homo
sapiens]
Length = 1428
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 881 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 933
Query: 587 LVNELQ 592
L +LQ
Sbjct: 934 LEEQLQ 939
>gi|417413842|gb|JAA53231.1| Putative microtubule-associated protein futsch, partial [Desmodus
rotundus]
Length = 1433
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 889 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 941
Query: 587 LVNELQ 592
L +LQ
Sbjct: 942 LEEQLQ 947
>gi|417413831|gb|JAA53227.1| Putative microtubule-associated protein futsch, partial [Desmodus
rotundus]
Length = 1421
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 889 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 941
Query: 587 LVNELQ 592
L +LQ
Sbjct: 942 LEEQLQ 947
>gi|344244350|gb|EGW00454.1| Remodeling and spacing factor 1 [Cricetulus griseus]
Length = 1308
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 818 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 870
Query: 587 LVNELQ 592
L +LQ
Sbjct: 871 LEEQLQ 876
>gi|297689801|ref|XP_002822324.1| PREDICTED: remodeling and spacing factor 1 [Pongo abelii]
Length = 1441
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 894 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 946
Query: 587 LVNELQ 592
L +LQ
Sbjct: 947 LEEQLQ 952
>gi|218664475|ref|NP_001136306.1| remodeling and spacing factor 1 [Sus scrofa]
gi|213688843|gb|ACJ53937.1| remodeling and spacing factor 1 [Sus scrofa]
Length = 1431
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 885 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 937
Query: 587 LVNELQ 592
L +LQ
Sbjct: 938 LEEQLQ 943
>gi|344293713|ref|XP_003418565.1| PREDICTED: remodeling and spacing factor 1 [Loxodonta africana]
Length = 1439
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 893 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 945
Query: 587 LVNELQ 592
L +LQ
Sbjct: 946 LEEQLQ 951
>gi|354496667|ref|XP_003510447.1| PREDICTED: remodeling and spacing factor 1-like [Cricetulus
griseus]
Length = 1373
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 828 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 880
Query: 587 LVNELQ 592
L +LQ
Sbjct: 881 LEEQLQ 886
>gi|402894781|ref|XP_003910524.1| PREDICTED: remodeling and spacing factor 1 [Papio anubis]
Length = 1403
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 856 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 908
Query: 587 LVNELQ 592
L +LQ
Sbjct: 909 LEEQLQ 914
>gi|410972585|ref|XP_003992739.1| PREDICTED: remodeling and spacing factor 1 [Felis catus]
Length = 1435
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 892 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 944
Query: 587 LVNELQ 592
L +LQ
Sbjct: 945 LEEQLQ 950
>gi|301761792|ref|XP_002916323.1| PREDICTED: remodeling and spacing factor 1-like [Ailuropoda
melanoleuca]
Length = 1454
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 911 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 963
Query: 587 LVNELQ 592
L +LQ
Sbjct: 964 LEEQLQ 969
>gi|291384198|ref|XP_002708720.1| PREDICTED: remodeling and spacing factor 1 [Oryctolagus cuniculus]
Length = 1438
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 893 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 945
Query: 587 LVNELQ 592
L +LQ
Sbjct: 946 LEEQLQ 951
>gi|348565583|ref|XP_003468582.1| PREDICTED: remodeling and spacing factor 1-like [Cavia porcellus]
Length = 1433
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 890 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 942
Query: 587 LVNELQ 592
L +LQ
Sbjct: 943 LEEQLQ 948
>gi|281341060|gb|EFB16644.1| hypothetical protein PANDA_004375 [Ailuropoda melanoleuca]
Length = 1433
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 890 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 942
Query: 587 LVNELQ 592
L +LQ
Sbjct: 943 LEEQLQ 948
>gi|452824399|gb|EME31402.1| hypothetical protein Gasu_13660 [Galdieria sulphuraria]
Length = 382
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 36/56 (64%), Gaps = 7/56 (12%)
Query: 523 EDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
E +YCA CGS + + ++LCDG C+ G H YCL PPL +++PP + W CP C
Sbjct: 156 EAVYCAFCGSD--TNEQVLLLCDG-CNVGMHTYCLTPPL--DEVPPGE--WFCPEC 204
>gi|7022417|dbj|BAA91591.1| unnamed protein product [Homo sapiens]
Length = 572
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 398 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 450
Query: 587 LVNELQ 592
L +LQ
Sbjct: 451 LEEQLQ 456
>gi|351698363|gb|EHB01282.1| Remodeling and spacing factor 1 [Heterocephalus glaber]
Length = 1433
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 888 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 940
Query: 587 LVNELQ 592
L +LQ
Sbjct: 941 LEEQLQ 946
>gi|73987867|ref|XP_534002.2| PREDICTED: remodeling and spacing factor 1 isoform 1 [Canis lupus
familiaris]
Length = 1434
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 891 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 943
Query: 587 LVNELQ 592
L +LQ
Sbjct: 944 LEEQLQ 949
>gi|392337823|ref|XP_001064125.3| PREDICTED: remodeling and spacing factor 1 [Rattus norvegicus]
gi|392344459|ref|XP_218939.4| PREDICTED: remodeling and spacing factor 1 [Rattus norvegicus]
Length = 1448
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 898 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 950
Query: 587 LVNELQ 592
L +LQ
Sbjct: 951 LEEQLQ 956
>gi|76663102|ref|XP_869632.1| PREDICTED: remodeling and spacing factor 1 isoform 2 [Bos taurus]
gi|297491749|ref|XP_002699109.1| PREDICTED: remodeling and spacing factor 1 [Bos taurus]
gi|296471934|tpg|DAA14049.1| TPA: remodeling and spacing factor 1 [Bos taurus]
Length = 1429
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 884 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 936
Query: 587 LVNELQ 592
L +LQ
Sbjct: 937 LEEQLQ 942
>gi|303283986|ref|XP_003061284.1| JmjN/JmjC protein [Micromonas pusilla CCMP1545]
gi|226457635|gb|EEH54934.1| JmjN/JmjC protein [Micromonas pusilla CCMP1545]
Length = 2683
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 10/79 (12%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKV-DCI 585
C CG S + +ILCDG CD+G+H YCL PP+ E++P D W CP C D
Sbjct: 264 CLNCGGS--SHEESMILCDG-CDQGYHMYCLSPPM--EELPRGD--WFCPNCVAAANDAS 316
Query: 586 DLVNELQGTRLFITDNWEK 604
D + F D++EK
Sbjct: 317 DF--GFNSGKTFTMDDFEK 333
>gi|149068918|gb|EDM18470.1| rCG40672, isoform CRA_b [Rattus norvegicus]
Length = 1418
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 868 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 920
Query: 587 LVNELQ 592
L +LQ
Sbjct: 921 LEEQLQ 926
>gi|124486905|ref|NP_001074736.1| remodeling and spacing factor 1 [Mus musculus]
Length = 1441
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 897 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 949
Query: 587 LVNELQ 592
L +LQ
Sbjct: 950 LEEQLQ 955
>gi|403224324|dbj|BAM42454.1| Requim, req/dpf2 [Theileria orientalis strain Shintoku]
Length = 966
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 8/77 (10%)
Query: 519 QIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
Q + ED CA CG+ D ++LCD CD+G+H YCL+PPL ++PP++ W C C
Sbjct: 549 QGEEEDEGCAICGNDDNWPQ--LLLCDN-CDKGYHMYCLDPPL--TEVPPNN--WYCAQC 601
Query: 579 DCKVDCIDLVNELQGTR 595
+ + + + E G R
Sbjct: 602 NMEAG-VTIAGETYGLR 617
>gi|148684364|gb|EDL16311.1| mCG124268, isoform CRA_a [Mus musculus]
Length = 1396
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 852 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 904
Query: 587 LVNELQ 592
L +LQ
Sbjct: 905 LEEQLQ 910
>gi|327260131|ref|XP_003214889.1| PREDICTED: LOW QUALITY PROTEIN: PHD and RING finger
domain-containing protein 1-like [Anolis carolinensis]
Length = 1656
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 7/62 (11%)
Query: 517 EGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
E +++ + +C CG D ++ ++LCDG CD G+H CL PPL +IP D+ W CP
Sbjct: 279 EDEVEDDPTFCEVCGRSD--REDRLLLCDG-CDAGYHMECLNPPL--SEIPVDE--WFCP 331
Query: 577 GC 578
C
Sbjct: 332 PC 333
>gi|311247024|ref|XP_003122440.1| PREDICTED: PHD and RING finger domain-containing protein 1 [Sus
scrofa]
Length = 1642
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 8/62 (12%)
Query: 518 GQIDSED-IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
G+ + ED +C CG D ++ ++LCDG CD G+H CLEPPL +++P D+ W CP
Sbjct: 175 GEHEEEDPTFCEVCGRSDR--EDRLLLCDG-CDAGYHMECLEPPL--QEVPVDE--WFCP 227
Query: 577 GC 578
C
Sbjct: 228 EC 229
>gi|224120882|ref|XP_002318442.1| SET domain protein [Populus trichocarpa]
gi|222859115|gb|EEE96662.1| SET domain protein [Populus trichocarpa]
Length = 319
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 35/58 (60%), Gaps = 7/58 (12%)
Query: 521 DSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
D D+YC KCGS + + +++LCD CD+GFH +CL P L+ +P W CP C
Sbjct: 1 DYSDVYCEKCGSGE--SPGELLLCD-KCDKGFHLFCLRPILVA--VPKG--SWFCPSC 51
>gi|348688432|gb|EGZ28246.1| hypothetical protein PHYSODRAFT_470076 [Phytophthora sojae]
Length = 2182
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 37/72 (51%), Gaps = 10/72 (13%)
Query: 507 GGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDI 566
GG P + G I+ D C CG L I+LCDG C R FH C+ +K++
Sbjct: 1641 GGKPPTALFQAGAINDSDDLCTLCGDGGL-----ILLCDGPCHRSFHLECVG---MKDE- 1691
Query: 567 PPDDEGWLCPGC 578
P+DE WLCP C
Sbjct: 1692 -PNDEQWLCPDC 1702
>gi|74183063|dbj|BAE20484.1| unnamed protein product [Mus musculus]
Length = 980
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 882 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 934
Query: 587 LVNELQ 592
L +LQ
Sbjct: 935 LEEQLQ 940
>gi|148684366|gb|EDL16313.1| mCG124268, isoform CRA_c [Mus musculus]
Length = 1010
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 37/66 (56%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG + I+LCD +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 852 CKKCGLPN--HPELILLCD-SCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 904
Query: 587 LVNELQ 592
L +LQ
Sbjct: 905 LEEQLQ 910
>gi|74144745|dbj|BAE27351.1| unnamed protein product [Mus musculus]
Length = 998
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 900 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 952
Query: 587 LVNELQ 592
L +LQ
Sbjct: 953 LEEQLQ 958
>gi|412992642|emb|CCO18622.1| predicted protein [Bathycoccus prasinos]
Length = 666
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%), Gaps = 5/61 (8%)
Query: 521 DSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDC 580
+++++ CA+CG+ + I+LCDG CD G H +CL P L +DIP D+ W C C+
Sbjct: 580 EADEVMCAECGAG--HSPEKILLCDG-CDAGLHCFCLTPKL--DDIPEGDDPWYCDKCES 634
Query: 581 K 581
K
Sbjct: 635 K 635
>gi|149068917|gb|EDM18469.1| rCG40672, isoform CRA_a [Rattus norvegicus]
Length = 979
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 37/66 (56%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG + I+LCD +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 868 CKKCGLPN--HPELILLCD-SCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 920
Query: 587 LVNELQ 592
L +LQ
Sbjct: 921 LEEQLQ 926
>gi|326671393|ref|XP_684139.4| PREDICTED: hypothetical protein LOC327618 [Danio rerio]
Length = 1899
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 31/51 (60%), Gaps = 5/51 (9%)
Query: 541 IILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNEL 591
I+LCD +CD G+H CL PPL+ I PD E W CP C K+ C L +L
Sbjct: 1383 ILLCD-SCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCDRLEEQL 1428
>gi|15232121|ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana]
gi|75337201|sp|Q9SGH2.1|MBD9_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 9;
Short=AtMBD9; Short=MBD09; AltName: Full=Histone acetyl
tranferase MBD9; AltName: Full=Methyl-CpG-binding protein
MBD9
gi|6692266|gb|AAF24616.1|AC010870_9 unknown protein [Arabidopsis thaliana]
gi|332640148|gb|AEE73669.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana]
Length = 2176
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 60/109 (55%), Gaps = 25/109 (22%)
Query: 471 VEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGGFPKSLFDSEGQIDSEDIYCAKC 530
V+KLK ++L+ ++E+ K +I+D+ ++ L PK+ +D EG C C
Sbjct: 1250 VQKLKDYRKLECLSAEM---KKEIKDIVVSVNKL-----PKAPWD-EG-------VCKVC 1293
Query: 531 GS-KDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
G KD D+ ++LCD CD +H YCL PPL++ IP D W CP C
Sbjct: 1294 GVDKD---DDSVLLCD-TCDAEYHTYCLNPPLIR--IP--DGNWYCPSC 1334
>gi|291416231|ref|XP_002724349.1| PREDICTED: PHD and ring finger domains 1 [Oryctolagus cuniculus]
Length = 1608
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 34/53 (64%), Gaps = 7/53 (13%)
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+C CG D ++ ++LCDG CD G+H CLEPPL +++P D+ W CP C
Sbjct: 187 FCEVCGRSDR--EDRLLLCDG-CDAGYHMECLEPPL--QEVPVDE--WFCPEC 232
>gi|328909281|gb|AEB61308.1| remodeling and spacing factor 1-like protein, partial [Equus
caballus]
Length = 324
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG L ++IL +CD G+H CL PPL+ I PD E W CP C K+ C
Sbjct: 152 CKKCG---LPNHPELILLCDSCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKLLCEK 204
Query: 587 LVNELQ 592
L +LQ
Sbjct: 205 LEEQLQ 210
>gi|255918306|gb|ACU33968.1| HOX12 protein isoform 3 [Crassostrea angulata]
Length = 380
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 49/111 (44%), Gaps = 10/111 (9%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K E+ + + S N YP+ KE LA++ G+T SQV WF N R +
Sbjct: 265 ARRKRRNFSKQATELLNEYFYSHLSNPYPSEEAKEELARKCGITVSQVSNWFGNKRIRYK 324
Query: 943 HPSSKNAKLANSE----------KGTCTPQSNKNTVGRVSNCNGAENVQSS 983
+K + AN +G G +N NG +N QSS
Sbjct: 325 KNITKVQEEANVYAAKSAAALEGQGAAGYNVGGGQAGMYTNLNGGDNYQSS 375
>gi|345783845|ref|XP_540525.3| PREDICTED: PHD and RING finger domain-containing protein 1 [Canis
lupus familiaris]
Length = 1635
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 39/62 (62%), Gaps = 8/62 (12%)
Query: 518 GQIDSED-IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
G+ + ED +C CG D ++ ++LCDG CD G+H CL+PPL +++P D+ W CP
Sbjct: 182 GEEEEEDPTFCEVCGRSDR--EDRLLLCDG-CDAGYHMECLDPPL--QEVPVDE--WFCP 234
Query: 577 GC 578
C
Sbjct: 235 EC 236
>gi|350409455|ref|XP_003488744.1| PREDICTED: hypothetical protein LOC100749759 [Bombus impatiens]
Length = 2491
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 60/128 (46%), Gaps = 21/128 (16%)
Query: 484 TSEILRRKLKIRDLFQRLDSLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIIL 543
T + R+ I ++ +L P + SE + + +C C + ++ ++L
Sbjct: 180 TCPVDRQTFTIINVRDKLGGQIIRCIPVEVASSEEEKLDDLTFCEVCHQSN--REDRMLL 237
Query: 544 CDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCD---------CKVDC---IDLVNEL 591
CDG CDRG+H CL PPL +++P E W CP C K+D IDLV+E
Sbjct: 238 CDG-CDRGYHLECLTPPL--DEVP--IEEWFCPECSQNNQTNTETVKIDVEEIIDLVDE- 291
Query: 592 QGTRLFIT 599
RL +T
Sbjct: 292 -ARRLGVT 298
>gi|426252674|ref|XP_004020029.1| PREDICTED: LOW QUALITY PROTEIN: PHD and RING finger
domain-containing protein 1 [Ovis aries]
Length = 1656
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 7/63 (11%)
Query: 521 DSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDC 580
D + +C CG D ++ ++LCDG CD G+H CL+PPL +++P D+ W CP C
Sbjct: 148 DEDPTFCEVCGRSDR--EDRLLLCDG-CDAGYHMECLDPPL--QEVPVDE--WFCPECAA 200
Query: 581 KVD 583
D
Sbjct: 201 PGD 203
>gi|281201499|gb|EFA75708.1| hypothetical protein PPL_10761 [Polysphondylium pallidum PN500]
Length = 785
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 44/81 (54%)
Query: 858 PKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKE 917
P L+ + N S + H G +P S G + + +L +E + L + F+++ YPN+ KE
Sbjct: 334 PPLSPNEHNNSGKEFHTGSRSPSSGGEKEKKRRTRLKKEQSSTLKSFFEKDNYPNKEEKE 393
Query: 918 SLAKELGLTFSQVRKWFENTR 938
LA LG+ ++ V WF N R
Sbjct: 394 KLASMLGMNYTAVTTWFSNKR 414
>gi|149759683|ref|XP_001489811.1| PREDICTED: PHD and RING finger domain-containing protein 1 [Equus
caballus]
Length = 1650
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 39/62 (62%), Gaps = 8/62 (12%)
Query: 518 GQIDSED-IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
G+ + ED +C CG D ++ ++LCDG CD G+H CL+PPL +++P D+ W CP
Sbjct: 176 GEDEEEDPTFCEVCGRSD--REDRLLLCDG-CDAGYHMECLDPPL--QEVPVDE--WFCP 228
Query: 577 GC 578
C
Sbjct: 229 EC 230
>gi|444519152|gb|ELV12614.1| Ras association domain-containing protein 7 [Tupaia chinensis]
Length = 719
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 39/62 (62%), Gaps = 7/62 (11%)
Query: 517 EGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
+G+ + + +C CG D ++ ++LCDG CD G+H CL+PPL +++P D+ W CP
Sbjct: 499 DGEEEEDPTFCEVCGRSD--REDRLLLCDG-CDAGYHMECLDPPL--QEVPVDE--WFCP 551
Query: 577 GC 578
C
Sbjct: 552 EC 553
>gi|351712204|gb|EHB15123.1| E3 ubiquitin-protein ligase UHRF1 [Heterocephalus glaber]
Length = 779
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 9/70 (12%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C +CG ++ A + ++CD CD +H YCL+PPL +PP +E W CP DC+ + +
Sbjct: 319 CHRCGGRE--APDKQLMCD-ECDMAYHLYCLDPPL--SSVPPPEEDWYCP--DCRNEPSE 371
Query: 587 LVNELQGTRL 596
+V L G RL
Sbjct: 372 VV--LAGQRL 379
>gi|440906583|gb|ELR56831.1| PHD and RING finger domain-containing protein 1 [Bos grunniens
mutus]
Length = 1601
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 40/62 (64%), Gaps = 8/62 (12%)
Query: 518 GQIDSED-IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
G+ ++ED +C CG D ++ ++LCDG CD G+H CL+PPL +++P D+ W CP
Sbjct: 173 GENENEDPTFCEVCGRSDR--EDRLLLCDG-CDAGYHMECLDPPL--QEVPVDE--WFCP 225
Query: 577 GC 578
C
Sbjct: 226 EC 227
>gi|355711326|gb|AES03976.1| PHD and ring finger domains 1 [Mustela putorius furo]
Length = 1647
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 39/62 (62%), Gaps = 8/62 (12%)
Query: 518 GQIDSED-IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
G+ + ED +C CG D ++ ++LCDG CD G+H CL+PPL +++P D+ W CP
Sbjct: 185 GEDEEEDPTFCEVCGRSDR--EDRLLLCDG-CDAGYHMECLDPPL--QEVPVDE--WFCP 237
Query: 577 GC 578
C
Sbjct: 238 EC 239
>gi|384254264|gb|EIE27738.1| hypothetical protein COCSUDRAFT_39323 [Coccomyxa subellipsoidea
C-169]
Length = 1967
Score = 53.1 bits (126), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 33/54 (61%), Gaps = 7/54 (12%)
Query: 525 IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+ C +CG D +++LCDG CD G+H CL+PPL ++IP D W CP C
Sbjct: 794 VGCEECGKNDRG--EEMLLCDG-CDHGYHTDCLDPPL--KEIPEGD--WFCPSC 840
>gi|319411664|emb|CBQ73708.1| related to histone acetyltransferase 3 (myst) [Sporisorium
reilianum SRZ2]
Length = 1223
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 32/55 (58%), Gaps = 6/55 (10%)
Query: 525 IYCAKCG-SKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
I C KC +D D ++ CD CDRG+H YCL PPLLK PP + W CP C
Sbjct: 141 IECKKCEICRDKGDDAQLMFCD-RCDRGWHLYCLSPPLLK---PPKGQ-WHCPTC 190
>gi|356991154|ref|NP_001103433.2| PHD and RING finger domain-containing protein 1 [Bos taurus]
Length = 1613
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 40/62 (64%), Gaps = 8/62 (12%)
Query: 518 GQIDSED-IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCP 576
G+ ++ED +C CG D ++ ++LCDG CD G+H CL+PPL +++P D+ W CP
Sbjct: 172 GENENEDPTFCEVCGRSDR--EDRLLLCDG-CDAGYHMECLDPPL--QEVPVDE--WFCP 224
Query: 577 GC 578
C
Sbjct: 225 EC 226
>gi|296083939|emb|CBI24327.3| unnamed protein product [Vitis vinifera]
Length = 78
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/65 (55%), Positives = 45/65 (69%), Gaps = 3/65 (4%)
Query: 751 NESLNNELLSIIKP--GQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDG 808
N L +ELLS+++ GQD A P+ KR ER DYKKL+DE YGNV DSSDDE +++
Sbjct: 13 NVQLKDELLSVLESNSGQDNA-PLSAKRRVERRDYKKLHDEAYGNVSSDSSDDEDLTENV 71
Query: 809 GPRKR 813
PRKR
Sbjct: 72 IPRKR 76
>gi|159488318|ref|XP_001702161.1| hypothetical protein CHLREDRAFT_122939 [Chlamydomonas reinhardtii]
gi|158271346|gb|EDO97167.1| predicted protein [Chlamydomonas reinhardtii]
Length = 61
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 29/56 (51%), Positives = 35/56 (62%), Gaps = 7/56 (12%)
Query: 523 EDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+D CA+CG D A I+LCD CD G+H CL+PPL E+IP DD W CP C
Sbjct: 4 DDTLCARCGGGDDPAS--ILLCD-TCDAGYHMACLDPPL--EEIPDDD--WHCPKC 52
>gi|432948522|ref|XP_004084087.1| PREDICTED: uncharacterized protein LOC101155159, partial [Oryzias
latipes]
Length = 1229
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 541 IILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNEL 591
I+LCD +CD G+H CL PPL+ I PD E W CP C K C L +L
Sbjct: 879 ILLCD-SCDSGYHTACLRPPLM---IIPDGE-WFCPPCQHKQLCDKLEEQL 924
>gi|242060274|ref|XP_002451426.1| hypothetical protein SORBIDRAFT_04g001900 [Sorghum bicolor]
gi|241931257|gb|EES04402.1| hypothetical protein SORBIDRAFT_04g001900 [Sorghum bicolor]
gi|300681316|emb|CAZ96029.1| histone-lysine N-methyltransferase [Sorghum bicolor]
Length = 365
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 33/54 (61%), Gaps = 7/54 (12%)
Query: 525 IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+ C CGS D A +++LCDG CDRGFH +CL P L + +P D W CP C
Sbjct: 32 VRCEACGSGD--AAPELMLCDG-CDRGFHIFCLRPILPR--VPAGD--WYCPSC 78
>gi|410909614|ref|XP_003968285.1| PREDICTED: uncharacterized protein LOC101062098 [Takifugu rubripes]
Length = 1314
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/255 (23%), Positives = 103/255 (40%), Gaps = 43/255 (16%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C+ CG L ++IL +CD G+H CL PPL+ + PD E W CP C K+ C
Sbjct: 752 CSHCG---LPNHPELILLCDSCDSGYHTACLRPPLM---LIPDGE-WFCPPCQHKLLCEK 804
Query: 587 LVNELQG----------------TRLFITDNWEKVFPEAAAGHNQD---------PNFGL 621
L +L +++ + E + + A +D N GL
Sbjct: 805 LEEQLHNLDSALKKKERAERRRERLVYVGISEENIIRDGDADEEKDETSAKIKSKKNKGL 864
Query: 622 ASDDSDDNE-----YDPDGSATDEQDEGD-ESSSDGSSSDDSDFTSTSDEVEAPADDKTY 675
+ + +D A DE E D DGS D ++F+ + E + P +T
Sbjct: 865 GRRSTRTRKHISYRFDDFDDAIDEAIEEDIRELCDGSGKDLANFSEDNRESQRPIRSQTC 924
Query: 676 LGLSSEDSEDDEYNPDA--PELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGNDEGA 733
++ + ++ D+ E +D+ +SS DF + +A +++ +G+D G+
Sbjct: 925 PAINRKRRRFNDLEGDSTPAESEDEFLLSNSSEDEDFGA---SVADDDDEDEDAGSDIGS 981
Query: 734 ASPLGHSNGQRYKDG 748
HS + G
Sbjct: 982 VDSRAHSRRTAHSRG 996
>gi|307103473|gb|EFN51733.1| hypothetical protein CHLNCDRAFT_139929 [Chlorella variabilis]
Length = 1890
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 42/68 (61%), Gaps = 4/68 (5%)
Query: 437 YSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLKIRD 496
Y R+ LR L +EQ L++AY+++GW+G S EK+KP E++RA +I + + IR+
Sbjct: 1673 YDRMLARLRSQL----HEQALVEAYAADGWRGASREKVKPVAEIKRAKDQISKCREAIRE 1728
Query: 497 LFQRLDSL 504
+D +
Sbjct: 1729 CMDMVDLI 1736
>gi|327266582|ref|XP_003218083.1| PREDICTED: pre-B-cell leukemia transcription factor 2-like [Anolis
carolinensis]
Length = 421
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 73/173 (42%), Gaps = 18/173 (10%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K EV + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 238 ARRKRRNFSKQATEVLNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRYK 297
Query: 943 HPSSKNAKLANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSK-TGVDDTGCMTGDVKNNT 1001
K + AN + K V + + N SS +G +GD+
Sbjct: 298 KNIGKFQEEANI-------YAVKTAVSAAQSGGDSPNTPSSAGSGGSFNLSSSGDMFMGL 350
Query: 1002 QECNS-IKPTSQTSRKRDRDGKSGDQASDPSSKMEVIQGLSANSPKVEVQANG 1053
Q N PTSQ R G G S +++M SP+ E++ANG
Sbjct: 351 QGLNGDSYPTSQVKSMRHSMGPGGYGDSMAATQM--------YSPR-EMRANG 394
>gi|147846402|emb|CAN83770.1| hypothetical protein VITISV_008470 [Vitis vinifera]
Length = 186
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 26/34 (76%)
Query: 435 DEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKG 468
DE SR++ RYLL ++ EQNLIDAYS EGWKG
Sbjct: 147 DEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWKG 180
>gi|391334814|ref|XP_003741795.1| PREDICTED: uncharacterized protein LOC100898311 [Metaseiulus
occidentalis]
Length = 1347
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 11/78 (14%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KC D I+LCD +CD G+H CL+P L+ IP + W CP C+
Sbjct: 806 CRKCNKSDHP--EFILLCD-SCDAGYHMSCLKPALMV--IPLGN--WYCPPCEHDA---- 854
Query: 587 LVNELQGTRLFITDNWEK 604
L+ L G FI WEK
Sbjct: 855 LIEALNGKLSFIEAEWEK 872
>gi|443712743|gb|ELU05907.1| hypothetical protein CAPTEDRAFT_221977 [Capitella teleta]
Length = 789
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 10/63 (15%)
Query: 527 CAKCGSKDLSADNDI-ILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCI 585
C KCG K+ D ++ ++CD CD +H YCL PPL D P+++ W CP DCKVD
Sbjct: 327 CCKCGKKE---DPELQLMCDD-CDSAYHTYCLNPPL---DALPEEDEWYCP--DCKVDSS 377
Query: 586 DLV 588
++V
Sbjct: 378 EVV 380
>gi|147783861|emb|CAN65750.1| hypothetical protein VITISV_037273 [Vitis vinifera]
Length = 105
Score = 52.0 bits (123), Expect = 0.002, Method: Composition-based stats.
Identities = 36/65 (55%), Positives = 45/65 (69%), Gaps = 3/65 (4%)
Query: 751 NESLNNELLSIIKP--GQDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDG 808
N L +ELLS+++ GQD A P+ KR ER DYKKL+DE YGNV DSSDDE +++
Sbjct: 40 NVQLKDELLSVLESNSGQDNA-PLSAKRRVERRDYKKLHDEAYGNVSSDSSDDEDLTENV 98
Query: 809 GPRKR 813
PRKR
Sbjct: 99 IPRKR 103
>gi|444726592|gb|ELW67116.1| Pre-B-cell leukemia transcription factor 4 [Tupaia chinensis]
Length = 445
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 42/86 (48%), Gaps = 1/86 (1%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K EV K + S N YP+ KE LA++ GLT SQV WF N R +
Sbjct: 184 ARRKRRNFSKQATEVLNKYFYSHLSNPYPSEEAKEELARKGGLTVSQVSNWFGNKRIRYK 243
Query: 943 HPSSKNAKLANSEKGTCTPQSNKNTV 968
K + A KG TP + + V
Sbjct: 244 KNMGKFQEEATIYKGK-TPDTTEARV 268
>gi|291227018|ref|XP_002733487.1| PREDICTED: topoisomerase (DNA) III beta-like [Saccoglossus
kowalevskii]
Length = 3134
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 7/56 (12%)
Query: 523 EDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
E YC CG D ++ ++LCDG CD G+H CL+PPL +IP E W CP C
Sbjct: 1101 EPTYCEVCGRCD--REDRLLLCDG-CDAGYHCECLDPPL--RNIPV--EEWFCPEC 1149
>gi|47220028|emb|CAG12176.1| unnamed protein product [Tetraodon nigroviridis]
Length = 509
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 1/130 (0%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K EV + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 231 ARRKRRNFNKQATEVLNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRYK 290
Query: 943 HPSSKNAKLANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDVKNNTQ 1002
K + AN T N + S N S + TG + +
Sbjct: 291 KNIGKFQEEANL-YAVKTAVDAANVSAQASQANSPATPNSGSYSLSHTGDAYLGLHSLNG 349
Query: 1003 ECNSIKPTSQ 1012
E SI P+ Q
Sbjct: 350 EGLSIAPSLQ 359
>gi|119482239|ref|XP_001261148.1| PHD finger domain protein, putative [Neosartorya fischeri NRRL 181]
gi|119409302|gb|EAW19251.1| PHD finger domain protein, putative [Neosartorya fischeri NRRL 181]
Length = 939
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 52/115 (45%), Gaps = 14/115 (12%)
Query: 514 FDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGW 573
F EG +D+ D+ C +CG + ++ CDG C FH CL+PPL + P D W
Sbjct: 511 FSREGSLDNSDL-CRECGGR-----GQLLCCDG-CVNSFHFSCLDPPLDPANPPEGD--W 561
Query: 574 LCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDD 628
CP C +V++L RL N E + P + H G +D +D
Sbjct: 562 FCPKCSVSRPIRKMVDKLD--RLA---NKEFMLPASIRNHFAGVKTGSRPEDKND 611
>gi|157822083|ref|NP_001101869.1| pre-B-cell leukemia transcription factor 4 [Rattus norvegicus]
gi|149035959|gb|EDL90625.1| endothelial differentiation, lysophosphatidic acid
G-protein-coupled receptor 4 (predicted), isoform CRA_b
[Rattus norvegicus]
Length = 379
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 44/92 (47%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K EV + + S N YP+ TKE LA++ G+T SQV WF N R +
Sbjct: 216 ARRKRRNFSKQATEVLNEYFYSHLSNPYPSEETKEELARKGGITVSQVSNWFGNKRIRYK 275
Query: 943 HPSSKNAKLANSEKGTCTPQSNKNTVGRVSNC 974
+ K + A G + + G ++C
Sbjct: 276 KNTGKFQEEATMYTGKTSTVTKTRVSGGQTSC 307
>gi|403364586|gb|EJY82059.1| Protein Jade-3 [Oxytricha trifallax]
Length = 1141
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 13/123 (10%)
Query: 467 KGLSVEKLKPEKELQRATSEILRRKLKIRDLFQRLDSLCAGGFPKSLFDSEGQID----- 521
K + VE+++ EK + T ++K K + + + L A P +L++S+ Q+
Sbjct: 446 KLIGVEEIQKEKPCRIRTETNDKKKQKSKTQLSKDEQLFASIDPATLYNSDPQVKYSPIR 505
Query: 522 ------SEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLC 575
E I C C + D+ I+LCD C+ G HQ C +L + IP D+E W C
Sbjct: 506 EYQNYIEEQISCDICRDGENYDDDTIVLCD-LCNSGAHQSCYGNDIL-DQIPQDEESWYC 563
Query: 576 PGC 578
C
Sbjct: 564 QRC 566
>gi|322785937|gb|EFZ12556.1| hypothetical protein SINV_80415 [Solenopsis invicta]
Length = 833
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 49/116 (42%), Gaps = 12/116 (10%)
Query: 524 DIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVD 583
D+ C C DNDII+CD C RG+HQ C +P + K + P D W+C C D
Sbjct: 77 DVMCVLCKKSQPKTDNDIIICD-KCGRGYHQMCHQPEISKPE-PGKDVHWICKRC---TD 131
Query: 584 CIDLVNELQGTRLFITDNWEKVFPEAAAGHNQDPNFGLASDDSDDNEYDPDGSATD 639
E + + N KV +G +Q DD YDPD + D
Sbjct: 132 SQPRTRECEPKNSMVKANIRKV----CSGRDQP---QPPPDDMTKLPYDPDMLSWD 180
>gi|301091846|ref|XP_002896098.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262094976|gb|EEY53028.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 375
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCK 581
C C + D +N+IILCD C+ +H +CL+PPL K +P +DE W CP C K
Sbjct: 176 CEVCKNSD--RENEIILCDD-CNAEYHIFCLQPPLSK--VPNEDEMWYCPKCRVK 225
>gi|301090672|ref|XP_002895541.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262097826|gb|EEY55878.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 373
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCK 581
C C + D +N+IILCD C+ +H +CL+PPL K +P +DE W CP C K
Sbjct: 174 CEVCKNSD--RENEIILCDD-CNAEYHIFCLQPPLSK--VPGEDEMWYCPKCRVK 223
>gi|302773900|ref|XP_002970367.1| hypothetical protein SELMODRAFT_171548 [Selaginella moellendorffii]
gi|300161883|gb|EFJ28497.1| hypothetical protein SELMODRAFT_171548 [Selaginella moellendorffii]
Length = 431
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 31/52 (59%), Gaps = 6/52 (11%)
Query: 528 AKCGSKDLSAD-NDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
KCG+ D + D +LCDG CDRG+H YCL P L+ +P D W CP C
Sbjct: 120 VKCGNCDRANDPQRFLLCDG-CDRGYHMYCLSPILVA--VPKGD--WFCPHC 166
>gi|297811079|ref|XP_002873423.1| hypothetical protein ARALYDRAFT_487800 [Arabidopsis lyrata subsp.
lyrata]
gi|297319260|gb|EFH49682.1| hypothetical protein ARALYDRAFT_487800 [Arabidopsis lyrata subsp.
lyrata]
Length = 379
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 33/54 (61%), Gaps = 7/54 (12%)
Query: 525 IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+ C KCGS + D++++LCD CDRGFH CL P +++ I P WLC C
Sbjct: 65 VTCEKCGSGE--GDDELLLCD-KCDRGFHMKCLRPIVVRVPIGP----WLCVDC 111
>gi|189241962|ref|XP_968788.2| PREDICTED: similar to CG8677 CG8677-PA [Tribolium castaneum]
Length = 2314
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 30/55 (54%), Gaps = 7/55 (12%)
Query: 524 DIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
D C KCG D I+LCD +CD G+H CL PPLL IP D W CP C
Sbjct: 1457 DCPCQKCGKSD--HPEWILLCD-SCDNGWHCSCLRPPLLV--IPEGD--WFCPPC 1504
>gi|339895|gb|AAA36764.1| E2A/PRL fusion protein, partial [Homo sapiens]
Length = 550
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 32/58 (55%)
Query: 884 RRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 354 RRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 411
>gi|270015458|gb|EFA11906.1| hypothetical protein TcasGA2_TC004063 [Tribolium castaneum]
Length = 2306
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 30/55 (54%), Gaps = 7/55 (12%)
Query: 524 DIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
D C KCG D I+LCD +CD G+H CL PPLL IP D W CP C
Sbjct: 1449 DCPCQKCGKSD--HPEWILLCD-SCDNGWHCSCLRPPLLV--IPEGD--WFCPPC 1496
>gi|157117643|ref|XP_001658866.1| hypothetical protein AaeL_AAEL008063 [Aedes aegypti]
gi|108875955|gb|EAT40180.1| AAEL008063-PA [Aedes aegypti]
Length = 1636
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 42/84 (50%), Gaps = 11/84 (13%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG D I+LCD +CD+G+H CL P L IP D W CP C +
Sbjct: 740 CQKCGKSD--HPEWILLCD-SCDKGYHCSCLVPVLFI--IPEGD--WFCPLCQHE----K 788
Query: 587 LVNELQGTRLFITDNWEKVFPEAA 610
L++ LQ L + +EK+ E A
Sbjct: 789 LISNLQSKLLQFDEYYEKLKAEEA 812
>gi|449268916|gb|EMC79745.1| Pre-B-cell leukemia transcription factor 4, partial [Columba livia]
Length = 332
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 26/124 (20%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K EV + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 169 ARRKRRNFSKQATEVLNEYFYSHLSNPYPSEEAKEELAKKGGITVSQVSNWFGNKRIRY- 227
Query: 943 HPSSKNAKLANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDVKNNTQ 1002
K +G+ N+ ++KT VD T G+ N+
Sbjct: 228 ----------------------KKNMGKFQE---EANIYAAKTAVDATNAAQGNQANSPP 262
Query: 1003 ECNS 1006
NS
Sbjct: 263 TPNS 266
>gi|327284083|ref|XP_003226768.1| PREDICTED: pre-B-cell leukemia transcription factor 1-like [Anolis
carolinensis]
Length = 433
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 236 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 294
>gi|157117645|ref|XP_001658867.1| hypothetical protein AaeL_AAEL008063 [Aedes aegypti]
gi|108875956|gb|EAT40181.1| AAEL008063-PB [Aedes aegypti]
Length = 1400
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 42/84 (50%), Gaps = 11/84 (13%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG D I+LCD +CD+G+H CL P L IP D W CP C +
Sbjct: 740 CQKCGKSDHPEW--ILLCD-SCDKGYHCSCLVPVLFI--IPEGD--WFCPLCQHE----K 788
Query: 587 LVNELQGTRLFITDNWEKVFPEAA 610
L++ LQ L + +EK+ E A
Sbjct: 789 LISNLQSKLLQFDEYYEKLKAEEA 812
>gi|41053014|dbj|BAD07945.1| hypothetical protein [Oryza sativa Japonica Group]
gi|125537874|gb|EAY84269.1| hypothetical protein OsI_05649 [Oryza sativa Indica Group]
Length = 361
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 33/54 (61%), Gaps = 7/54 (12%)
Query: 525 IYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+ C CGS + +A+ ++LCDG CDRG H +CL P L + +P D W CP C
Sbjct: 33 VRCEACGSGESAAE--LLLCDG-CDRGLHIFCLRPILPR--VPAGD--WFCPSC 79
>gi|402858031|ref|XP_003893534.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Papio
anubis]
Length = 430
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|334321871|ref|XP_001370326.2| PREDICTED: pre-B-cell leukemia transcription factor 1 [Monodelphis
domestica]
gi|395530694|ref|XP_003767423.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 1
[Sarcophilus harrisii]
Length = 430
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|46048897|ref|NP_990077.1| pre-B-cell leukemia transcription factor 1 [Gallus gallus]
gi|8096555|dbj|BAA96135.1| PBX1A [Gallus gallus]
Length = 430
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|395530696|ref|XP_003767424.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 2
[Sarcophilus harrisii]
Length = 422
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|224105647|ref|XP_002313886.1| predicted protein [Populus trichocarpa]
gi|222850294|gb|EEE87841.1| predicted protein [Populus trichocarpa]
Length = 134
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 53/97 (54%), Gaps = 6/97 (6%)
Query: 7 MGVSPSQVTSSTRDHSCLKQTTPEQMCKFCAECIHSEPLESKSLLGSEAVENEPRE--TS 64
MGVSPSQV+S T+ +SC QT E F AE E K L SE ++ E + T+
Sbjct: 1 MGVSPSQVSSQTKSYSCPSQTKLENTHGFTAEYNCGGYSEEKHKLESEIIQTEAGDSGTA 60
Query: 65 IPNS---EKLQAFCGDVPDSSFTDHLAPPSEDMRKST 98
+ S E ++ DV ++SFTD L PP ED R +T
Sbjct: 61 VLQSGAGETVEPSTEDVTNNSFTD-LDPPPEDARGAT 96
>gi|158256266|dbj|BAF84104.1| unnamed protein product [Homo sapiens]
Length = 430
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|149755772|ref|XP_001493290.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 1
[Equus caballus]
Length = 430
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|4505623|ref|NP_002576.1| pre-B-cell leukemia transcription factor 1 isoform 1 [Homo sapiens]
gi|34365779|ref|NP_899198.1| pre-B-cell leukemia transcription factor 1 isoform a [Mus musculus]
gi|198442893|ref|NP_001128334.1| pre-B-cell leukemia transcription factor 1 isoform a [Rattus
norvegicus]
gi|300795559|ref|NP_001179697.1| pre-B-cell leukemia transcription factor 1 [Bos taurus]
gi|114561077|ref|XP_001174529.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 3
[Pan troglodytes]
gi|291397516|ref|XP_002715935.1| PREDICTED: pre-B-cell leukemia homeobox 1-like [Oryctolagus
cuniculus]
gi|296229942|ref|XP_002760490.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Callithrix
jacchus]
gi|297662854|ref|XP_002809903.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 3
[Pongo abelii]
gi|335286581|ref|XP_003355126.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 2
[Sus scrofa]
gi|344287029|ref|XP_003415258.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 1
[Loxodonta africana]
gi|348565789|ref|XP_003468685.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 1
[Cavia porcellus]
gi|395825139|ref|XP_003785799.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Otolemur
garnettii]
gi|397508362|ref|XP_003824627.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Pan
paniscus]
gi|403305809|ref|XP_003943445.1| PREDICTED: pre-B-cell leukemia transcription factor 1-like [Saimiri
boliviensis boliviensis]
gi|426332547|ref|XP_004027865.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 1
[Gorilla gorilla gorilla]
gi|730279|sp|P40424.1|PBX1_HUMAN RecName: Full=Pre-B-cell leukemia transcription factor 1; AltName:
Full=Homeobox protein PBX1; AltName: Full=Homeobox
protein PRL
gi|20141754|sp|P41778.2|PBX1_MOUSE RecName: Full=Pre-B-cell leukemia transcription factor 1; AltName:
Full=Homeobox protein PBX1
gi|189648|gb|AAA60031.1| PBX1a [Homo sapiens]
gi|2432009|gb|AAB71191.1| PBX1a [Mus musculus]
gi|11120612|gb|AAG30941.1| PBX1 [Homo sapiens]
gi|34849830|gb|AAH58390.1| Pre B-cell leukemia transcription factor 1 [Mus musculus]
gi|75516599|gb|AAI01579.1| Pre-B-cell leukemia homeobox 1 [Homo sapiens]
gi|119611145|gb|EAW90739.1| pre-B-cell leukemia transcription factor 1, isoform CRA_b [Homo
sapiens]
gi|119611146|gb|EAW90740.1| pre-B-cell leukemia transcription factor 1, isoform CRA_b [Homo
sapiens]
gi|168277526|dbj|BAG10741.1| pre-B-cell leukemia transcription factor 1 [synthetic construct]
gi|296489925|tpg|DAA32038.1| TPA: pre-B-cell leukemia homeobox 1 [Bos taurus]
gi|380813258|gb|AFE78503.1| pre-B-cell leukemia transcription factor 1 isoform 1 [Macaca
mulatta]
gi|410226274|gb|JAA10356.1| pre-B-cell leukemia homeobox 1 [Pan troglodytes]
gi|410255186|gb|JAA15560.1| pre-B-cell leukemia homeobox 1 [Pan troglodytes]
gi|410299148|gb|JAA28174.1| pre-B-cell leukemia homeobox 1 [Pan troglodytes]
gi|410338451|gb|JAA38172.1| pre-B-cell leukemia homeobox 1 [Pan troglodytes]
Length = 430
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|297710671|ref|XP_002831994.1| PREDICTED: homeobox protein ESX1 [Pongo abelii]
Length = 412
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 36/67 (53%), Gaps = 5/67 (7%)
Query: 879 PGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
P R RR RT++ + Q+L N F E QYP+ +E LA L LT +V+ WF+N R
Sbjct: 135 PPERKRRRRTAFTQFQ---LQELENFFDEAQYPDVVARERLAARLNLTEDRVQVWFQNRR 191
Query: 939 --WSFNH 943
W N
Sbjct: 192 AKWKRNQ 198
>gi|74006104|ref|XP_545786.2| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 3
[Canis lupus familiaris]
gi|301756733|ref|XP_002914215.1| PREDICTED: pre-B-cell leukemia transcription factor 1-like isoform
1 [Ailuropoda melanoleuca]
gi|410986577|ref|XP_003999586.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 1
[Felis catus]
Length = 430
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|355746153|gb|EHH50778.1| hypothetical protein EGM_01655, partial [Macaca fascicularis]
Length = 419
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 222 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 280
>gi|114561079|ref|XP_001174513.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 1
[Pan troglodytes]
Length = 422
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|432910572|ref|XP_004078420.1| PREDICTED: pre-B-cell leukemia transcription factor 2-like [Oryzias
latipes]
Length = 396
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 36/71 (50%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K EV + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 247 ARRKRRNFSKQATEVLNEYFYSHLSNPYPSEEAKEELAKQCGITVSQVSNWFGNKRIRYK 306
Query: 943 HPSSKNAKLAN 953
K + AN
Sbjct: 307 KNIGKFQEEAN 317
>gi|23592228|ref|NP_703149.1| homeobox protein ESX1 [Homo sapiens]
gi|116241356|sp|Q8N693.3|ESX1_HUMAN RecName: Full=Homeobox protein ESX1; AltName: Full=Extraembryonic,
spermatogenesis, homeobox 1; Contains: RecName:
Full=Homeobox protein ESX1-N; Contains: RecName:
Full=Homeobox protein ESX1-C
gi|21629646|gb|AAM62141.1| ESX1L [Homo sapiens]
gi|119623160|gb|EAX02755.1| extraembryonic, spermatogenesis, homeobox 1 homolog (mouse) [Homo
sapiens]
Length = 406
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 37/66 (56%), Gaps = 5/66 (7%)
Query: 879 PGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
P R RR RT++ + Q+L N F E+QYP+ +E LA L LT +V+ WF+N R
Sbjct: 135 PPERKRRRRTAFTQFQ---LQELENFFDESQYPDVVARERLAARLNLTEDRVQVWFQNRR 191
Query: 939 --WSFN 942
W N
Sbjct: 192 AKWKRN 197
>gi|321475569|gb|EFX86531.1| hypothetical protein DAPPUDRAFT_312821 [Daphnia pulex]
Length = 1672
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 31/54 (57%), Gaps = 9/54 (16%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEG-WLCPGCD 579
C KCG D I+LCD CD G+H C++PPLL IP EG W CP CD
Sbjct: 685 CHKCGQSD--HPEWILLCD-RCDAGWHANCVKPPLLV--IP---EGNWFCPPCD 730
>gi|410986583|ref|XP_003999589.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 4
[Felis catus]
Length = 422
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|417410615|gb|JAA51777.1| Putative transcription factor pbx, partial [Desmodus rotundus]
Length = 427
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 230 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 288
>gi|71028128|ref|XP_763707.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350661|gb|EAN31424.1| hypothetical protein TP04_0072 [Theileria parva]
Length = 964
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 29/52 (55%), Gaps = 7/52 (13%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
C CG+ D N +LCD CD+GFH YCL PPL + IP + W C C
Sbjct: 487 CQICGNDD--NWNQQLLCDN-CDKGFHTYCLNPPLTR--IP--ESNWYCQHC 531
>gi|355558989|gb|EHH15769.1| hypothetical protein EGK_01905, partial [Macaca mulatta]
Length = 399
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 202 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 260
>gi|196010814|ref|XP_002115271.1| hypothetical protein TRIADDRAFT_29410 [Trichoplax adhaerens]
gi|190582042|gb|EDV22116.1| hypothetical protein TRIADDRAFT_29410 [Trichoplax adhaerens]
Length = 400
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 66/145 (45%), Gaps = 36/145 (24%)
Query: 862 TEDSNI--SPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESL 919
T SN+ S H+ + RR+RT + +E +KL + FK QYP+ T+E L
Sbjct: 208 TNKSNLIYSSKMDHQAADLKLQKLRRNRTMFT---DEQIKKLEDIFKSTQYPDVYTREEL 264
Query: 920 AKELGLTFSQVRKWFENTR--WS-------------------FNHPSSKNA--------- 949
A ++GL+ ++V+ WF N R W FN S+ ++
Sbjct: 265 ASKIGLSEARVQVWFSNRRAKWRKEGKHRSPSKLSNDGSTTLFNSKSTNSSVSEQSDVLH 324
Query: 950 -KLANSEKGTCTPQSNKNTVGRVSN 973
+ANS+K TCT + + V ++N
Sbjct: 325 DAMANSKKNTCTIDMSSDEVDDITN 349
>gi|242011986|ref|XP_002426724.1| hepatitis B virus X associated protein, hbxa, putative [Pediculus
humanus corporis]
gi|212510895|gb|EEB13986.1| hepatitis B virus X associated protein, hbxa, putative [Pediculus
humanus corporis]
Length = 2351
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 13/78 (16%)
Query: 515 DSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWL 574
D G+ D D C CG+ D I+LCD CD+G+H CL P L+ IP D W
Sbjct: 1259 DEFGKYD--DTSCENCGNNDHPEW--ILLCD-KCDKGWHASCLRPTLMI--IPEGD--WF 1309
Query: 575 CPGCDCKVDCIDLVNELQ 592
CP C+ LVN+L+
Sbjct: 1310 CPPCEHSF----LVNKLE 1323
>gi|441634917|ref|XP_004089876.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 2
[Nomascus leucogenys]
Length = 358
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 161 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 219
>gi|224058884|ref|XP_002190535.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Taeniopygia
guttata]
Length = 374
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 177 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 235
>gi|107390|pir||B33061 homeotic protein prl - human
gi|190358|gb|AAA36484.1| homeobox-containing protein, partial [Homo sapiens]
Length = 342
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 145 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 203
>gi|281348857|gb|EFB24441.1| hypothetical protein PANDA_002083 [Ailuropoda melanoleuca]
Length = 342
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 145 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 203
>gi|449268314|gb|EMC79184.1| Pre-B-cell leukemia transcription factor 1, partial [Columba livia]
Length = 343
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 146 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 204
>gi|426219117|ref|XP_004003776.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Ovis aries]
Length = 488
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 291 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 349
>gi|440790082|gb|ELR11370.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 1415
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 35/58 (60%), Gaps = 7/58 (12%)
Query: 523 EDIYCAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCD 579
+ + C CG D N ++LCDG C++G+H +C+ PPL ++IP DD W C C+
Sbjct: 689 QHLKCECCGRGD--DGNKLLLCDGEGCNKGYHIFCIFPPL--DEIPEDD--WFCDQCE 740
Score = 45.8 bits (107), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 5/38 (13%)
Query: 541 IILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
++LCD CD G+H +CL+PPL K IP D W CP C
Sbjct: 100 LLLCDD-CDDGYHTFCLDPPLKK--IPSGD--WFCPSC 132
>gi|297281401|ref|XP_001084131.2| PREDICTED: pre-B-cell leukemia transcription factor 1-like isoform
2 [Macaca mulatta]
gi|332811115|ref|XP_003308630.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Pan
troglodytes]
gi|335286583|ref|XP_003355127.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 3
[Sus scrofa]
gi|338724771|ref|XP_003365013.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 3
[Equus caballus]
gi|426332551|ref|XP_004027867.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 3
[Gorilla gorilla gorilla]
gi|441634920|ref|XP_003258842.2| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 1
[Nomascus leucogenys]
Length = 347
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 150 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 208
>gi|345797793|ref|XP_003434357.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Canis lupus
familiaris]
gi|410986579|ref|XP_003999587.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 2
[Felis catus]
Length = 347
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 150 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 208
>gi|332811117|ref|XP_003308631.1| PREDICTED: pre-B-cell leukemia transcription factor 1 [Pan
troglodytes]
gi|426332553|ref|XP_004027868.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 4
[Gorilla gorilla gorilla]
Length = 420
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|149035958|gb|EDL90624.1| endothelial differentiation, lysophosphatidic acid
G-protein-coupled receptor 4 (predicted), isoform CRA_a
[Rattus norvegicus]
Length = 197
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 44/92 (47%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K EV + + S N YP+ TKE LA++ G+T SQV WF N R +
Sbjct: 34 ARRKRRNFSKQATEVLNEYFYSHLSNPYPSEETKEELARKGGITVSQVSNWFGNKRIRYK 93
Query: 943 HPSSKNAKLANSEKGTCTPQSNKNTVGRVSNC 974
+ K + A G + + G ++C
Sbjct: 94 KNTGKFQEEATMYTGKTSTVTKTRVSGGQTSC 125
>gi|326320052|ref|NP_001191892.1| pre-B-cell leukemia transcription factor 1 isoform 3 [Homo sapiens]
Length = 420
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 291
>gi|431916084|gb|ELK16338.1| Pre-B-cell leukemia transcription factor 1 [Pteropus alecto]
Length = 356
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 31/56 (55%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R
Sbjct: 152 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKR 207
>gi|297281399|ref|XP_002802090.1| PREDICTED: pre-B-cell leukemia transcription factor 1-like [Macaca
mulatta]
gi|119611144|gb|EAW90738.1| pre-B-cell leukemia transcription factor 1, isoform CRA_a [Homo
sapiens]
gi|344235790|gb|EGV91893.1| Pre-B-cell leukemia transcription factor 1 [Cricetulus griseus]
Length = 325
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 128 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 186
>gi|18149007|dbj|BAB83538.1| pre-B-cell leukemia transcription factor 1 [Macaca fascicularis]
Length = 325
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 128 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 186
>gi|332020758|gb|EGI61162.1| E3 ubiquitin-protein ligase UHRF1 [Acromyrmex echinatior]
Length = 584
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 19/114 (16%)
Query: 521 DSEDIYCAKCGSKDLSA---DNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPG 577
D+ D C KCG K + ++ +++CD CD FH CL PPLLK P++ W CP
Sbjct: 134 DNPDEECRKCGCKICAGKEEEDTLLICD-ECDYMFHMKCLNPPLLK---LPEETDWYCP- 188
Query: 578 CDCKVDCIDLVNELQG------TRLFITDNWEKVFPEAAAGHNQDPNFGLASDD 625
+CK+D NE+ G + FI E+ + A + GL S D
Sbjct: 189 -ECKID----ENEIAGDKLKKTKKKFICSEKERKWGGGMACVGRQKVCGLVSSD 237
>gi|119611838|gb|EAW91432.1| Jumonji, AT rich interactive domain 1B (RBP2-like), isoform CRA_a
[Homo sapiens]
gi|193783724|dbj|BAG53706.1| unnamed protein product [Homo sapiens]
Length = 1275
Score = 48.9 bits (115), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 138 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 189
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 190 KGD--WRCPKC 198
>gi|147861294|emb|CAN79739.1| hypothetical protein VITISV_008917 [Vitis vinifera]
Length = 537
Score = 48.9 bits (115), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 45/68 (66%), Gaps = 10/68 (14%)
Query: 634 DGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAP 693
D S +DE DE SD+SDFTS S+++ A +++ +G S+DSED ++NP+AP
Sbjct: 126 DISQSDEFDE----------SDESDFTSASNDMVASPNNEQCMGPPSDDSEDGDHNPNAP 175
Query: 694 ELDDKVTQ 701
E+D++V Q
Sbjct: 176 EIDEQVNQ 183
>gi|114679326|ref|XP_524417.2| PREDICTED: double homeobox protein A [Pan troglodytes]
Length = 204
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 3/82 (3%)
Query: 857 RPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTK 916
RP+ T +S+ S + G P RR RT+Y L +F +N YP ++
Sbjct: 75 RPEAETLESSQSQGQDQPGVEFPSREARRCRTTY---SASQLHTLIKAFMKNPYPGIDSR 131
Query: 917 ESLAKELGLTFSQVRKWFENTR 938
E LAKE+G+ S+V+ WF+N R
Sbjct: 132 EELAKEIGVPESRVQIWFQNRR 153
Score = 40.4 bits (93), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%)
Query: 886 HRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
HR K EE + L N+F + YP TK+ LA E+ S+++ WF+N R
Sbjct: 15 HRRCRTKFTEEQLKILINTFNQKPYPGYATKQKLALEINTEESRIQIWFQNRR 67
>gi|426244326|ref|XP_004015974.1| PREDICTED: double homeobox protein A [Ovis aries]
Length = 229
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 62/117 (52%), Gaps = 10/117 (8%)
Query: 828 KTPVIRRRKSTKAAKEKLNETENTPK----RRGRPKLNTEDSNISPAKS--HEGCSTPGS 881
K P+ R + + A + +E+ P R+GR + + + SP+++ EG T +
Sbjct: 9 KPPLPERLRVPREAGGPYSRSEHLPPLLGDRKGRSRGSVQPFGTSPSRAGDREG-PTMVT 67
Query: 882 RGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
+GRR RTS+ EE + L +F +N YP T K+ LA E+ S+++ WF+N R
Sbjct: 68 KGRRSRTSF---TEEQLEILVQAFSQNPYPGYTAKQRLAVEINAEESRIQIWFQNRR 121
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 5/81 (6%)
Query: 860 LNTEDSNISPAKSHEGCS--TPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKE 917
L + P+K H G + RR RTSY Q L N+F EN YP ++E
Sbjct: 127 LKRPKKKLDPSKDHASPKEKIQGKQDRRCRTSY---TSSQLQTLKNAFTENPYPGIDSRE 183
Query: 918 SLAKELGLTFSQVRKWFENTR 938
LA+++G+ S+V+ WF+N R
Sbjct: 184 QLAEKIGVPESRVQIWFQNRR 204
>gi|397477130|ref|XP_003809932.1| PREDICTED: double homeobox protein A [Pan paniscus]
Length = 195
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 3/82 (3%)
Query: 857 RPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTK 916
RP+ T +S+ S + G P RR RT+Y L +F +N YP ++
Sbjct: 66 RPEAETLESSQSQGQDQPGVEFPSREARRCRTTY---SASQLHTLIKAFMKNPYPGIDSR 122
Query: 917 ESLAKELGLTFSQVRKWFENTR 938
E LAKE+G+ S+V+ WF+N R
Sbjct: 123 EELAKEIGVPESRVQIWFQNRR 144
Score = 40.4 bits (93), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%)
Query: 886 HRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
HR K EE + L N+F + YP TK+ LA E+ S+++ WF+N R
Sbjct: 6 HRRCRTKFTEEQLKILINTFNQKPYPGYATKQKLALEINTEESRIQIWFQNRR 58
>gi|194760895|ref|XP_001962668.1| GF14316 [Drosophila ananassae]
gi|190616365|gb|EDV31889.1| GF14316 [Drosophila ananassae]
Length = 2673
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 39/79 (49%), Gaps = 7/79 (8%)
Query: 515 DSEGQIDSEDIYCAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPPDDEGW 573
++E Q D + C KCG D I+LCD AC++G+H CL P L IP D W
Sbjct: 1363 NAEDQSDDAEEACQKCGKSDHP--EWILLCDTPACNKGYHCSCLSPVLFY--IPEGD--W 1416
Query: 574 LCPGCDCKVDCIDLVNELQ 592
CP C + L ELQ
Sbjct: 1417 HCPPCQQEQLIAALERELQ 1435
>gi|350412090|ref|XP_003489540.1| PREDICTED: hypothetical protein LOC100746698 [Bombus impatiens]
Length = 3834
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 34/66 (51%), Gaps = 7/66 (10%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCID 586
C KCG D I+LCD +CD+G+H CL P L+ IP D W CP C + I
Sbjct: 2588 CQKCGKAD--HPEWILLCD-SCDKGWHCSCLRPALML--IPEGD--WFCPPCQHNLLVIK 2640
Query: 587 LVNELQ 592
L L+
Sbjct: 2641 LRESLK 2646
>gi|326924814|ref|XP_003208620.1| PREDICTED: pre-B-cell leukemia transcription factor 1-like
[Meleagris gallopavo]
Length = 415
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 218 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 276
>gi|338724773|ref|XP_003365014.1| PREDICTED: pre-B-cell leukemia transcription factor 1 isoform 4
[Equus caballus]
Length = 332
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 159 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 217
>gi|119611840|gb|EAW91434.1| Jumonji, AT rich interactive domain 1B (RBP2-like), isoform CRA_c
[Homo sapiens]
Length = 1614
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 393 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 444
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 445 KGD--WRCPKC 453
>gi|351700823|gb|EHB03742.1| Lysine-specific demethylase 5B [Heterocephalus glaber]
Length = 1483
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 235 PKS--RSKKTTNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 286
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 287 KGD--WRCPKC 295
>gi|6453448|emb|CAB61368.1| hypothetical protein [Homo sapiens]
Length = 1350
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 102 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 153
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 154 KGD--WRCPKC 162
>gi|332019010|gb|EGI59549.1| Remodeling and spacing factor 1 [Acromyrmex echinatior]
Length = 4018
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 34/59 (57%), Gaps = 8/59 (13%)
Query: 521 DSEDIY-CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
++ED Y C KCG D I+LCD +CD+G+H CL P L+ IP D W CP C
Sbjct: 2765 EAEDEYACQKCGKADHP--EWILLCD-SCDKGWHCSCLRPALML--IPEGD--WFCPPC 2816
>gi|427788397|gb|JAA59650.1| Putative bromodomain adjacent to zinc finger domain protein 1a
[Rhipicephalus pulchellus]
Length = 1568
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 30/53 (56%), Gaps = 7/53 (13%)
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
YC C + ++LCDG CDRG H YCL+PPL E+IP D W C C
Sbjct: 1159 YCRICRRR--RDPERMLLCDG-CDRGHHLYCLKPPL--EEIPKGD--WYCISC 1204
>gi|427788395|gb|JAA59649.1| Putative atp-dependent chromatin assembly factor large subunit
[Rhipicephalus pulchellus]
Length = 1568
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 30/53 (56%), Gaps = 7/53 (13%)
Query: 526 YCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
YC C + ++LCDG CDRG H YCL+PPL E+IP D W C C
Sbjct: 1159 YCRICRRR--RDPERMLLCDG-CDRGHHLYCLKPPL--EEIPKGD--WYCISC 1204
>gi|6572291|emb|CAB63108.1| RB-binding protein [Homo sapiens]
Length = 1681
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 433 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 484
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 485 KGD--WRCPKC 493
>gi|74041518|gb|AAZ95044.1| homeodomain transcription factor Pbx1 [Ambystoma mexicanum]
Length = 425
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 42/90 (46%), Gaps = 1/90 (1%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFN 942
RR R ++ K EV + + S N YP+ KE LAK G++ SQV WF N R +
Sbjct: 230 ARRKRRNFSKQATEVLNEYFYSHLSNPYPSEEAKEDLAKRGGISVSQVSNWFGNKRIRYK 289
Query: 943 HPSSKNAKLANSEKG-TCTPQSNKNTVGRV 971
K + AN T +N T G++
Sbjct: 290 KNMGKFQEEANIYAAKTAVDATNAVTQGKL 319
>gi|332811646|ref|XP_514104.3| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5B [Pan
troglodytes]
Length = 1681
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 433 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 484
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 485 KGD--WRCPKC 493
>gi|395729189|ref|XP_002809647.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5B
[Pongo abelii]
Length = 1433
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 296 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 347
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 348 KGD--WRCPKC 356
>gi|441624639|ref|XP_004089006.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5B,
partial [Nomascus leucogenys]
Length = 1960
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 712 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 763
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 764 KGD--WRCPKC 772
>gi|221043666|dbj|BAH13510.1| unnamed protein product [Homo sapiens]
Length = 301
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 883 GRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
RR R ++ K E+ + + S N YP+ KE LAK+ G+T SQV WF N R +
Sbjct: 128 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRY 186
>gi|428174337|gb|EKX43233.1| hypothetical protein GUITHDRAFT_163873 [Guillardia theta CCMP2712]
Length = 692
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 33/59 (55%), Gaps = 9/59 (15%)
Query: 523 EDIYCAKCGSKDLSADNDIILCDG---ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
++I C C D S DNDI+LC G C +G+H CL+ +KE +P D W CP C
Sbjct: 166 DEIGCEMCKGFDCSEDNDIVLCGGKDTGCGKGWHIRCLK---IKE-LPEGD--WFCPKC 218
>gi|119611841|gb|EAW91435.1| Jumonji, AT rich interactive domain 1B (RBP2-like), isoform CRA_d
[Homo sapiens]
Length = 1641
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 393 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 444
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 445 KGD--WRCPKC 453
>gi|410215984|gb|JAA05211.1| lysine (K)-specific demethylase 5B [Pan troglodytes]
gi|410266276|gb|JAA21104.1| lysine (K)-specific demethylase 5B [Pan troglodytes]
gi|410297570|gb|JAA27385.1| lysine (K)-specific demethylase 5B [Pan troglodytes]
gi|410332597|gb|JAA35245.1| lysine (K)-specific demethylase 5B [Pan troglodytes]
Length = 1544
Score = 48.1 bits (113), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 296 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 347
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 348 KGD--WRCPKC 356
>gi|397504984|ref|XP_003823056.1| PREDICTED: lysine-specific demethylase 5B [Pan paniscus]
Length = 1544
Score = 48.1 bits (113), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 296 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 347
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 348 KGD--WRCPKC 356
>gi|57242796|ref|NP_006609.3| lysine-specific demethylase 5B [Homo sapiens]
gi|296439317|sp|Q9UGL1.3|KDM5B_HUMAN RecName: Full=Lysine-specific demethylase 5B; AltName:
Full=Cancer/testis antigen 31; Short=CT31; AltName:
Full=Histone demethylase JARID1B; AltName:
Full=Jumonji/ARID domain-containing protein 1B; AltName:
Full=PLU-1; AltName: Full=Retinoblastoma-binding protein
2 homolog 1; Short=RBP2-H1
gi|4902724|emb|CAB43532.1| PLU-1 protein [Homo sapiens]
gi|162319252|gb|AAI56050.1| Jumonji, AT rich interactive domain 1B [synthetic construct]
gi|162319436|gb|AAI57032.1| Jumonji, AT rich interactive domain 1B [synthetic construct]
Length = 1544
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 296 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 347
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 348 KGD--WRCPKC 356
>gi|380787677|gb|AFE65714.1| lysine-specific demethylase 5B [Macaca mulatta]
Length = 1544
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 296 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 347
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 348 KGD--WRCPKC 356
>gi|390477405|ref|XP_002807770.2| PREDICTED: lysine-specific demethylase 5B isoform 1 [Callithrix
jacchus]
Length = 1544
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 296 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 347
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 348 KGD--WRCPKC 356
>gi|402857599|ref|XP_003893337.1| PREDICTED: lysine-specific demethylase 5B [Papio anubis]
Length = 1544
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 296 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 347
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 348 KGD--WRCPKC 356
>gi|383413189|gb|AFH29808.1| lysine-specific demethylase 5B [Macaca mulatta]
gi|384939318|gb|AFI33264.1| lysine-specific demethylase 5B [Macaca mulatta]
gi|387540224|gb|AFJ70739.1| lysine-specific demethylase 5B [Macaca mulatta]
Length = 1544
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 296 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 347
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 348 KGD--WRCPKC 356
>gi|297281074|ref|XP_002808303.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5B-like
[Macaca mulatta]
Length = 1578
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 41/71 (57%), Gaps = 12/71 (16%)
Query: 510 PKSLFDSEGQIDSEDIY-CAKCGSKDLSADND-IILCDGACDRGFHQYCLEPPLLKEDIP 567
PKS S+ ++ D+Y C CGS + D D ++LCDG CD +H +CL PPL D+P
Sbjct: 347 PKS--RSKKATNAVDLYVCLLCGSGN---DEDRLLLCDG-CDDSYHTFCLIPPL--HDVP 398
Query: 568 PDDEGWLCPGC 578
D W CP C
Sbjct: 399 KGD--WRCPKC 407
>gi|291241106|ref|XP_002740458.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 1294
Score = 48.1 bits (113), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 17/109 (15%)
Query: 475 KPEKELQRATSEILRRKLKIRDLFQRLDSL-CAGGFPKSLFDSEGQIDSEDIYCAKCGSK 533
KPEK + E L I LF + +L + + KS+ + +I C + G
Sbjct: 734 KPEKTITERWQESLMSSSSIPQLFLHMATLDSSVMWSKSILHARCRI------CRRKGDA 787
Query: 534 DLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKV 582
+ ++LCDG CDRG H YCL+PP+ + IP D W C C K+
Sbjct: 788 E-----RMLLCDG-CDRGHHMYCLKPPV--KSIPSGD--WYCVDCRPKI 826
Score = 41.2 bits (95), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 10/52 (19%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
CA+C ++ILCD +C FH C++PPLL +PPD WLC C
Sbjct: 1004 CARC-----RRGGELILCD-SCPLSFHLDCVDPPLL--GVPPDI--WLCQLC 1045
>gi|67527267|gb|AAY68376.1| PaxB, partial [Trichoplax adhaerens]
Length = 318
Score = 48.1 bits (113), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 55/102 (53%), Gaps = 7/102 (6%)
Query: 873 HEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRK 932
H+ + RR+RT + +E +KL + FK QYP+ T+E LA ++GL+ ++V+
Sbjct: 211 HQAADLKLQKLRRNRTMF---TDEQIKKLEDIFKSTQYPDVYTREELASKIGLSEARVQV 267
Query: 933 WFENTR--WSFNHPSSKNAKLANSEKGTCTPQSNKNTVGRVS 972
WF N R W +KL+N G+ T ++K+T VS
Sbjct: 268 WFSNRRAKWRKEGKHRSPSKLSND--GSTTLFNSKSTNSSVS 307
>gi|198427843|ref|XP_002120103.1| PREDICTED: transcription factor protein [Ciona intestinalis]
Length = 510
Score = 48.1 bits (113), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 43/84 (51%), Gaps = 3/84 (3%)
Query: 858 PKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKE 917
P + + + A H G S P + RR RT++ +++ F+ YP+ +E
Sbjct: 305 PFMTSHGHHTDDASGHGGVSVPRKKQRRTRTTFNSGQLAALERV---FERTHYPDAFVRE 361
Query: 918 SLAKELGLTFSQVRKWFENTRWSF 941
LA+ +GL+ ++V+ WF+N R F
Sbjct: 362 ELARRVGLSEARVQVWFQNRRAKF 385
>gi|405973225|gb|EKC37949.1| Remodeling and spacing factor 1 [Crassostrea gigas]
Length = 2475
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 35/69 (50%), Gaps = 11/69 (15%)
Query: 524 DIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVD 583
D C KC + ++LCD CD FH CL PPL+ I PD + W CP C+ K
Sbjct: 1157 DTPCCKC--QKTHQPEWLLLCD-KCDAAFHTACLRPPLM---IIPDGD-WYCPPCEHK-- 1207
Query: 584 CIDLVNELQ 592
LV+ LQ
Sbjct: 1208 --SLVSRLQ 1214
>gi|443690042|gb|ELT92280.1| hypothetical protein CAPTEDRAFT_224752 [Capitella teleta]
Length = 1892
Score = 47.8 bits (112), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 44/94 (46%), Gaps = 10/94 (10%)
Query: 509 FPKSLFDSEGQIDSEDIYCAKCG-SKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIP 567
F SL + G + + I C KC + +++++ CD ACDRG H C PPL
Sbjct: 246 FSDSLVERVGHMRWQCIECKKCSLCGETGKEDNMLFCD-ACDRGIHMECCIPPLTSA--- 301
Query: 568 PDDEG-WLCPGCDCKVDCIDLVNELQGTRLFITD 600
EG W+C C C D I+ L + ++D
Sbjct: 302 --PEGKWVC--CLCDEDTIEAFKALNQPTIKLSD 331
>gi|328780804|ref|XP_001121484.2| PREDICTED: hypothetical protein LOC725662 [Apis mellifera]
Length = 3827
Score = 47.4 bits (111), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 29/52 (55%), Gaps = 7/52 (13%)
Query: 527 CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
C KCG D I+LCD +CD+G+H CL P L+ IP D W CP C
Sbjct: 2589 CQKCGKAD--HPEWILLCD-SCDKGWHCSCLRPALML--IPEGD--WFCPPC 2633
>gi|348508478|ref|XP_003441781.1| PREDICTED: tyrosine-protein kinase BAZ1B-like [Oreochromis niloticus]
Length = 1521
Score = 47.4 bits (111), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 17/105 (16%)
Query: 482 RATSEILRRKLKIRDL--FQRLDSLCAGGFPKSLFDS--EGQIDSEDIYCAKCGSKDLSA 537
R + + + K IR+ F R+ L + D+ + + +E+ C C K
Sbjct: 1159 RVATAVEKWKTAIREAQTFSRMHVLLG------MLDACIKWDMSAENARCKVCRRK--GD 1210
Query: 538 DNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKV 582
D +ILCD C++ FH +CL P L + P+ E WLCP C V
Sbjct: 1211 DEKLILCD-ECNKAFHLFCLRPALYR---VPNGE-WLCPACQPTV 1250
>gi|196014713|ref|XP_002117215.1| hypothetical protein TRIADDRAFT_61269 [Trichoplax adhaerens]
gi|190580180|gb|EDV20265.1| hypothetical protein TRIADDRAFT_61269 [Trichoplax adhaerens]
Length = 1478
Score = 47.4 bits (111), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 10/75 (13%)
Query: 504 LCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLK 563
+ KS+ S+ +++ C + G +L ++LCD CDRG H YCL PPL
Sbjct: 1121 IYLATLDKSVVWSKSILNARCRICRRKGDAEL-----MLLCD-ECDRGHHTYCLRPPL-- 1172
Query: 564 EDIPPDDEGWLCPGC 578
IP + W CP C
Sbjct: 1173 NSIPAGN--WYCPDC 1185
>gi|147781987|emb|CAN63298.1| hypothetical protein VITISV_044136 [Vitis vinifera]
Length = 270
Score = 47.0 bits (110), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 45/68 (66%), Gaps = 10/68 (14%)
Query: 634 DGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAP 693
D S +DE DE SD+SDFTS S+++ A +++ +G S+DSED ++NP+AP
Sbjct: 126 DISQSDEFDE----------SDESDFTSASNDMVASPNNEQCMGPLSDDSEDGDHNPNAP 175
Query: 694 ELDDKVTQ 701
E+D++V Q
Sbjct: 176 EIDEQVNQ 183
>gi|195437556|ref|XP_002066706.1| GK24629 [Drosophila willistoni]
gi|194162791|gb|EDW77692.1| GK24629 [Drosophila willistoni]
Length = 460
Score = 46.6 bits (109), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 36/67 (53%), Gaps = 3/67 (4%)
Query: 872 SHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVR 931
SH G P R RRHRT + EE ++L +F + YP+ +E LA ++ L +V
Sbjct: 299 SHLGHGPPPKRKRRHRTIF---TEEQLEQLEATFDKTHYPDVVLREQLALKVDLKEERVE 355
Query: 932 KWFENTR 938
WF+N R
Sbjct: 356 VWFKNRR 362
>gi|242019572|ref|XP_002430234.1| Paired mesoderm homeobox protein 2B, putative [Pediculus humanus
corporis]
gi|212515334|gb|EEB17496.1| Paired mesoderm homeobox protein 2B, putative [Pediculus humanus
corporis]
Length = 168
Score = 46.2 bits (108), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 40/69 (57%), Gaps = 3/69 (4%)
Query: 873 HEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRK 932
H+G T + RR RT++ ++L +F+E YP+ T+E +A ++ LT ++V+
Sbjct: 17 HDGVLTEKRKQRRIRTTFTSAQ---LKELERAFQETHYPDIYTREEIAMKIDLTEARVQV 73
Query: 933 WFENTRWSF 941
WF+N R F
Sbjct: 74 WFQNRRAKF 82
>gi|307193181|gb|EFN76086.1| Remodeling and spacing factor 1 [Harpegnathos saltator]
Length = 4001
Score = 46.2 bits (108), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 40/75 (53%), Gaps = 12/75 (16%)
Query: 521 DSEDIY-CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCD 579
++ED Y C KC D I+LCD +CD+G+H CL P L+ IP D W CP C
Sbjct: 2748 EAEDEYACQKCNKADHP--EWILLCD-SCDKGWHCSCLRPALML--IPEGD--WFCPPCQ 2800
Query: 580 CKVDCIDLVNELQGT 594
+ LV +L+ T
Sbjct: 2801 HNL----LVTKLRET 2811
>gi|357138535|ref|XP_003570847.1| PREDICTED: histone-lysine N-methyltransferase ATXR6-like
[Brachypodium distachyon]
Length = 355
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 28/41 (68%), Gaps = 3/41 (7%)
Query: 521 DSEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPL 561
D +++ C CGS D +A+ ++LCDG CDRG H +CL P L
Sbjct: 22 DEDEVRCEACGSGDAAAE--LLLCDG-CDRGLHIFCLRPIL 59
>gi|195385426|ref|XP_002051407.1| GJ12419 [Drosophila virilis]
gi|194147864|gb|EDW63562.1| GJ12419 [Drosophila virilis]
Length = 2967
Score = 44.7 bits (104), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 40/87 (45%), Gaps = 8/87 (9%)
Query: 527 CAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCI 585
C KCG D I+LCD C++G+H CL P L IP D W CP C + +
Sbjct: 1583 CQKCGKSD--HPEWILLCDTPGCNKGYHCSCLSPVLFY--IPEGD--WHCPPCQQEQLIV 1636
Query: 586 DLVNELQGT-RLFITDNWEKVFPEAAA 611
L +LQ L + EK+ E A
Sbjct: 1637 ALEQKLQQFDDLVARKHQEKLLAEEAV 1663
>gi|195475926|ref|XP_002090234.1| GE12907 [Drosophila yakuba]
gi|194176335|gb|EDW89946.1| GE12907 [Drosophila yakuba]
Length = 2650
Score = 44.7 bits (104), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 33/67 (49%), Gaps = 7/67 (10%)
Query: 527 CAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCI 585
C KCG D I+LCD AC++G+H CL P L IP D W CP C +
Sbjct: 1353 CQKCGKSD--HPEWILLCDTLACNKGYHCSCLSPVLFY--IPEGD--WHCPPCQQEQLIT 1406
Query: 586 DLVNELQ 592
L +LQ
Sbjct: 1407 ALERQLQ 1413
>gi|414585168|tpg|DAA35739.1| TPA: putative homeodomain-like transcription factor superfamily
protein [Zea mays]
Length = 816
Score = 44.3 bits (103), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 26/37 (70%)
Query: 893 LGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQ 929
G VTQKL F+++ YP TKE LA+ELGLTF+Q
Sbjct: 131 FGPIVTQKLKEHFEKDPYPCHATKEGLAQELGLTFNQ 167
>gi|307179212|gb|EFN67624.1| Remodeling and spacing factor 1 [Camponotus floridanus]
Length = 3956
Score = 43.5 bits (101), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 32/112 (28%)
Query: 521 DSEDIY-CAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCD 579
++ED Y C KCG D I+LCD +CD+G+H CL P L+ IP D W CP C
Sbjct: 2708 EAEDEYACQKCGKAD--HPEWILLCD-SCDKGWHCSCLRPALML--IPEGD--WFCPPCQ 2760
Query: 580 CKVDCIDLVNELQGT-------------------RL-FITDNWEKVFPEAAA 611
+ LV +LQ T RL F+ + + V + A
Sbjct: 2761 HNL----LVTKLQETLKTLDQLTKRHENEVLRKKRLAFVGISLDNVLHKGEA 2808
>gi|195116391|ref|XP_002002739.1| GI11272 [Drosophila mojavensis]
gi|193913314|gb|EDW12181.1| GI11272 [Drosophila mojavensis]
Length = 2942
Score = 43.1 bits (100), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 32/67 (47%), Gaps = 7/67 (10%)
Query: 527 CAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCI 585
C KCG D I+LCD C++G+H CL P L IP D W CP C +
Sbjct: 1589 CQKCGKSDHP--EWILLCDTPGCNKGYHCSCLSPVLFY--IPEGD--WHCPPCQQEQLIA 1642
Query: 586 DLVNELQ 592
L +LQ
Sbjct: 1643 ALEQQLQ 1649
>gi|194892982|ref|XP_001977782.1| GG18050 [Drosophila erecta]
gi|190649431|gb|EDV46709.1| GG18050 [Drosophila erecta]
Length = 769
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 56/223 (25%), Positives = 94/223 (42%), Gaps = 37/223 (16%)
Query: 748 GGNNESLNNELLSIIKPGQDGAAPV--YGKRSSERLDYKKL-------YDETYGNVP--- 795
GG++E N LLS+ QD +P+ K SS L++++ + Y +P
Sbjct: 376 GGSHE---NRLLSLASSVQDTRSPITTLEKSSSSSLNHQRKCSSTPEDFSALYSGLPTPG 432
Query: 796 YDSSDDESWSDDGGPRKRTKSTKEG--------SSASPDGKTPVIR--RRKSTKAAKEKL 845
DSS P + + T G A+ V+R RR + + A +KL
Sbjct: 433 MDSSSHHHTPAHTPPSRLSDHTISGRPQPQPPPPPAAFFDLARVLREWRRVAAEGAFKKL 492
Query: 846 NETENTPKRRGRPKLNTEDSNISPAKSHEG-------CSTPGSRGRRHRTSYRKLGEEVT 898
N+ + + S +S H G C TP R RHRT++ +E
Sbjct: 493 KPEPNSGLSTVSAGITSPGSGLSSLSQHAGHTPTTASCPTPARR--RHRTTFT---QEQL 547
Query: 899 QKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSF 941
+L +F ++ YP+ +E LA+ L ++++ WF+N R +
Sbjct: 548 AELEAAFAKSHYPDIYCREELARTTKLNEARIQVWFQNRRAKY 590
>gi|442628745|ref|NP_610101.3| CG8677 [Drosophila melanogaster]
gi|440214036|gb|AAF53983.3| CG8677 [Drosophila melanogaster]
Length = 2663
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 27/53 (50%), Gaps = 7/53 (13%)
Query: 527 CAKCGSKDLSADNDIILCDG-ACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
C KCG D I+LCD C++G+H CL P L IP D W CP C
Sbjct: 1373 CQKCGKSD--HPEWILLCDTPTCNKGYHCSCLSPVLFY--IPEGD--WHCPPC 1419
>gi|157138655|ref|XP_001657329.1| zinc finger protein [Aedes aegypti]
gi|108869464|gb|EAT33689.1| AAEL014034-PA [Aedes aegypti]
Length = 1526
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 7/57 (12%)
Query: 522 SEDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGC 578
+E I C C K + +LCD C+R H YCL+P L + +P D W CP C
Sbjct: 1105 AERIACMICRRKGIPEQT--LLCDD-CNRACHMYCLKPKL--KQVPEGD--WYCPKC 1154
>gi|363730693|ref|XP_419142.3| PREDICTED: LOW QUALITY PROTEIN: erythrocyte membrane protein band
4.1-like 3 [Gallus gallus]
Length = 1098
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 40/178 (22%), Positives = 76/178 (42%), Gaps = 28/178 (15%)
Query: 621 LASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADD-------- 672
LA+D SD +E + D TD +G+ ++++ +D D + +++ +D
Sbjct: 686 LANDLSDSSEEETDSEQTDTAADGETTATESDQEEDGDLKA--QDLDKTQEDLMKHQTNI 743
Query: 673 ----KTYLGLSSEDSEDDEY--------------NPDAPELDDKVTQESSSSGSDFTSDS 714
+T+L S+E + +E+ +AP ++ V +E+ S + T D
Sbjct: 744 SELKRTFLETSTETAVSNEWEKRLSTSPVRLAARQEEAPMIEPLVPEETKQSTGEKTMDG 803
Query: 715 EDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPV 772
D+ +++E R G + HS QR + E ++EL +P QD V
Sbjct: 804 SDIFSLIESARKPTEFIGGVTTTSHSWAQRIDTTTSQEVTSSELKQAAQPHQDAVTKV 861
>gi|195059442|ref|XP_001995638.1| GH17865 [Drosophila grimshawi]
gi|193896424|gb|EDV95290.1| GH17865 [Drosophila grimshawi]
Length = 711
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 5/66 (7%)
Query: 876 CSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFE 935
C TP R RHRT++ +E +L +F ++ YP+ +E LA+ L ++++ WF+
Sbjct: 492 CPTPARR--RHRTTFT---QEQLAELEAAFAKSHYPDIYCREELARTTKLNEARIQVWFQ 546
Query: 936 NTRWSF 941
N R +
Sbjct: 547 NRRAKY 552
>gi|427796623|gb|JAA63763.1| Putative phosphatase binding protein, partial [Rhipicephalus
pulchellus]
Length = 628
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 58/267 (21%), Positives = 120/267 (44%), Gaps = 23/267 (8%)
Query: 244 DANKSCQAGEISCLQQSSSEQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGELGA 303
DA ++ E+S S+ + T +S + P+ K GS+L++T L + +
Sbjct: 10 DAKAKLKSKEVSSKASSTKGSDGKATKTLSKNIPTKKATKKGSELKKTVLTKAKKAASKS 69
Query: 304 SLELVVKSSIEQLKQPEVPITIPSTKTSATKHLQSSSDLMEKKSCLEQSETPPNYVA--N 361
+LVV + K P + K + + +S++ + + K CL + + +
Sbjct: 70 KTKLVVAAK----KGPNTKSKLAPVKKTKAQSAKSAAIVAKYKKCLIKKASQKQKLGKPK 125
Query: 362 NSACLGRKGKRATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQ 421
N+ L +K + KS+K+ V + D+V+++ G+ +SS N ++ E K
Sbjct: 126 NTVTLKKKVQTDAKSVKHKQKV--VRKPDKVVKASKGK-----QSSKNEHNMAVAKEMKV 178
Query: 422 KKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSS------EGWKGLSVEKLK 475
KK N I+ +K V + L+ +N + + SS + K L V+ +K
Sbjct: 179 KKENNIKPEKKVMIIKKVVNKSLKKAKETVNRAKKPANKASSPSKQSVQNVKKLPVKNMK 238
Query: 476 PEKELQRATSEILRRKLKIRDLFQRLD 502
P +R +++ R + +I+++ +R++
Sbjct: 239 P----RRQPTKVFRTRYRIKEMHRRIE 261
>gi|195448076|ref|XP_002071499.1| GK25107 [Drosophila willistoni]
gi|194167584|gb|EDW82485.1| GK25107 [Drosophila willistoni]
Length = 638
Score = 41.2 bits (95), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 5/66 (7%)
Query: 876 CSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFE 935
C TP R RHRT++ +E +L +F ++ YP+ +E LA+ L ++++ WF+
Sbjct: 459 CPTPARR--RHRTTFT---QEQLAELEAAFAKSHYPDIYCREELARTTKLNEARIQVWFQ 513
Query: 936 NTRWSF 941
N R +
Sbjct: 514 NRRAKY 519
>gi|440808048|gb|AGC24167.1| Phox2 [Lymnaea stagnalis]
Length = 343
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 861 NTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLA 920
++ DS++SP K + RR RT++ ++L +F E YP+ T+E +A
Sbjct: 124 HSHDSSVSPEKRKQ---------RRIRTTFT---SAQLKELEKAFAETHYPDIYTREEIA 171
Query: 921 KELGLTFSQVRKWFENTRWSF 941
+ LT ++V+ WF+N R F
Sbjct: 172 MKTDLTEARVQVWFQNRRAKF 192
>gi|195118212|ref|XP_002003634.1| GI18020 [Drosophila mojavensis]
gi|193914209|gb|EDW13076.1| GI18020 [Drosophila mojavensis]
Length = 538
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 3/57 (5%)
Query: 882 RGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTR 938
R RRHRT + EE ++L +F + YP+ +E LA ++ L +V WF+N R
Sbjct: 392 RKRRHRTIF---TEEQLEQLEATFDKTHYPDVVLREQLALKVDLKEERVEVWFKNRR 445
>gi|312385739|gb|EFR30164.1| hypothetical protein AND_00394 [Anopheles darlingi]
Length = 820
Score = 40.4 bits (93), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 523 EDIYCAKCGSKDLSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCK 581
ED+ C +C D+SA N ++ C C +HQ C +P + + D ++ W C C K
Sbjct: 35 EDLMCVECRRMDVSARNRLVEC-ADCHALYHQECHKPNISEADANDEENAWYCTLCRTK 92
>gi|326917450|ref|XP_003205012.1| PREDICTED: LOW QUALITY PROTEIN: band 4.1-like protein 3-like
[Meleagris gallopavo]
Length = 1096
Score = 39.7 bits (91), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 40/178 (22%), Positives = 76/178 (42%), Gaps = 28/178 (15%)
Query: 621 LASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAPADD-------- 672
LA+D SD +E + D TD +G+ ++++ +D D + +++ +D
Sbjct: 677 LANDLSDSSEEETDSEQTDTAADGETTATESDQEEDGDLKA--QDLDKTQEDLMKHQTNI 734
Query: 673 ----KTYLGLSSEDSEDDEY--------------NPDAPELDDKVTQESSSSGSDFTSDS 714
+T+L S+E + +E+ +AP ++ V +E+ S + T D
Sbjct: 735 SELKRTFLETSTETTVSNEWEKRLSTSPVRLAARQEEAPMIEPLVPEETKQSTGEKTLDG 794
Query: 715 EDLAAVLEDNRSSGNDEGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPGQDGAAPV 772
D+ +++E R G + HS QR + E ++EL +P QD V
Sbjct: 795 SDIFSLIESARKPTEFIGGVTTTSHSWAQRIDTTTSQEITSSELKQEAQPHQDAVTKV 852
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.307 0.125 0.356
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,326,473,710
Number of Sequences: 23463169
Number of extensions: 780169532
Number of successful extensions: 3498533
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5032
Number of HSP's successfully gapped in prelim test: 22229
Number of HSP's that attempted gapping in prelim test: 2967957
Number of HSP's gapped (non-prelim): 221750
length of query: 1063
length of database: 8,064,228,071
effective HSP length: 153
effective length of query: 910
effective length of database: 8,769,330,510
effective search space: 7980090764100
effective search space used: 7980090764100
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 83 (36.6 bits)