BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006207
(657 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225445450|ref|XP_002285089.1| PREDICTED: uncharacterized protein LOC100266409 [Vitis vinifera]
gi|147821405|emb|CAN63500.1| hypothetical protein VITISV_011675 [Vitis vinifera]
gi|297738929|emb|CBI28174.3| unnamed protein product [Vitis vinifera]
Length = 660
Score = 950 bits (2455), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 466/666 (69%), Positives = 547/666 (82%), Gaps = 15/666 (2%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M+G +EIS+DKLP+KRLDAIEE GAERFP DVGYD+KR SLIRRIDFAWAVEKD K+ K
Sbjct: 1 MDGKMEISLDKLPIKRLDAIEENGAERFPTDVGYDDKRVSLIRRIDFAWAVEKD-TKKQK 59
Query: 61 KSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPN 120
K+SKE + TPW WQ++VENL+ AHQELSVIIDLI+TVEANDAVTVA MTRPK LPN
Sbjct: 60 KASKEEA-----TPWPWQSLVENLRLAHQELSVIIDLISTVEANDAVTVASMTRPKPLPN 114
Query: 121 EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVA 180
EVLSDL VSAATKLQC+RHL YFKQSAK+LEQQIA+EARFYGALIRLQQNWKVKRQRVA
Sbjct: 115 EVLSDLGVSAATKLQCFRHLSKYFKQSAKALEQQIAREARFYGALIRLQQNWKVKRQRVA 174
Query: 181 APASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFL 240
A A GNEGF+IDLF NSL+D V RPSSLST+R+DHDSAGMLA++LPPNSCR+ FGFL
Sbjct: 175 ASAPGNEGFSIDLFSNSLHDPVAVFRPSSLSTVRVDHDSAGMLAVHLPPNSCRALHFGFL 234
Query: 241 GVQSGDSSKQCSKVKNSCSPR-PSKEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIV 299
G G+ K+ SK K PSKE K+ ++D+ECV+E HS+LRE HQAIF EQVFD+V
Sbjct: 235 GGHLGNIPKEPSKTKTYGPDELPSKEIKKPLSDNECVKETHSVLREGHQAIFDEQVFDLV 294
Query: 300 NREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGILPL 359
NREAF S GVNVTGIRENYLQL IG G S+F+SL+PS Q + + D QN+ES ILP+
Sbjct: 295 NREAFNSSSGVNVTGIRENYLQLNIGQGASVFMSLVPSGQDEKTADGMGMQNLESAILPM 354
Query: 360 DSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTG-TRVSGPP 418
D+ DGVKL++ K D +K G+PN ++ EIYL+Q+FHE+++ RAK+K IS G T+ S P
Sbjct: 355 DTFDGVKLSDGKHDNDKKKSGFPNRISSEIYLKQLFHEHVFVRAKDKHISAGRTQPSSQP 414
Query: 419 TKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKVP 478
KDG GLLGHFC+SLAHRIFS+KV +ELEN V VPYLHL+SHPTWHSRTSSWT+ +KVP
Sbjct: 415 AKDGFGLLGHFCMSLAHRIFSSKVLMELENLVSRVPYLHLLSHPTWHSRTSSWTLLMKVP 474
Query: 479 QSILHAESNSRTAE------TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSS 532
+S H +RT++ K+HFRT VV+NDDCI+VEG+G PNVVGLFKG SED+ S
Sbjct: 475 ESSFHPGCQTRTSDIHNVKNIIKTHFRTKVVINDDCINVEGDGAPNVVGLFKGSSEDVCS 534
Query: 533 MNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAH 592
MN+YDCDLAD+PVI+LQQVASQVIRWLHEEAL VGIKANRDFL LSFE+DQGE +SLVAH
Sbjct: 535 MNRYDCDLADIPVILLQQVASQVIRWLHEEALKVGIKANRDFLCLSFEMDQGEMLSLVAH 594
Query: 593 VDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLVSLC 652
V+P+D +GCISWWLVM+DGF+ + K D D S+Y+KFLGHLSL+VLYSTLMDLVSLC
Sbjct: 595 VNPDDPQGCISWWLVMDDGFSEDLKFHTDAPDGGSEYRKFLGHLSLEVLYSTLMDLVSLC 654
Query: 653 -GGGSH 657
GGG H
Sbjct: 655 NGGGRH 660
>gi|224130170|ref|XP_002328671.1| predicted protein [Populus trichocarpa]
gi|222838847|gb|EEE77198.1| predicted protein [Populus trichocarpa]
Length = 667
Score = 940 bits (2430), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/668 (70%), Positives = 551/668 (82%), Gaps = 12/668 (1%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M+G LE+S+DKLPVKRL++IEE G ERFP D+GYDEK+ +LIRRIDFAWAVEK+D ++ +
Sbjct: 1 MDGKLEMSLDKLPVKRLESIEENGFERFPTDIGYDEKQVALIRRIDFAWAVEKEDKEKKQ 60
Query: 61 KSSKESSSSA---TTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKA 117
K ++ SS TTTPW WQNMVENL AHQELSVIIDLINTVEANDAVTVAGMTRPK
Sbjct: 61 KKKQKKSSRESSSTTTPWPWQNMVENLHLAHQELSVIIDLINTVEANDAVTVAGMTRPKP 120
Query: 118 LPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQ 177
LPNE+L+DL+VS ATKLQCYR+LG YFKQSAK+LEQQ+A+EARFYGALIRLQQNWKVKRQ
Sbjct: 121 LPNEILADLAVSTATKLQCYRNLGKYFKQSAKALEQQVAREARFYGALIRLQQNWKVKRQ 180
Query: 178 RVAAPASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRF 237
RVAA A GNEGF IDLFDNSLYDS V +PSSLSTIRIDHDS GMLAINLP SC S F
Sbjct: 181 RVAAIAPGNEGFMIDLFDNSLYDSVAVFQPSSLSTIRIDHDSDGMLAINLPSKSCHSLVF 240
Query: 238 GFLGVQSGDSSKQCSKVKNSCSPR-PSKEA-KESVNDDECVREKHSLLREVHQAIFYEQV 295
GFL S + K+ +K+K S + PSK KES++D+ECV++ H LLR+VH+ IF EQV
Sbjct: 241 GFLSGHS-NVPKKSNKIKTHGSLKNPSKNPEKESLSDNECVKDTHLLLRKVHRTIFDEQV 299
Query: 296 FDIVNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESG 355
FD+VNR A QS G+NVTGI+ENYLQL IG GISIF+S++PS+QGD ++DS +N+ES
Sbjct: 300 FDMVNRGAVNQSSGLNVTGIQENYLQLCIGPGISIFISIVPSDQGDQAIDSEGPENLESA 359
Query: 356 ILPLDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVS 415
++PLDS DGVKLAEEK + L K +PN +TYEIYL+Q+FHEY++ AK +P TGTR+
Sbjct: 360 VVPLDSFDGVKLAEEKHNSLTKKPRFPNCITYEIYLKQIFHEYVFVEAKGRPSFTGTRMP 419
Query: 416 GPPTKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFL 475
G P DGSGLL HFCLSL+HRI SNKV +ELEN VC VPYLHL+SHPTWHSR+S+WT+F+
Sbjct: 420 GQPANDGSGLLSHFCLSLSHRIISNKVLMELENVVCRVPYLHLISHPTWHSRSSAWTIFM 479
Query: 476 KVPQSILHAESNSRTAE------TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSED 529
K+P SILHA S +RT + KS F T VVV+DDCI++E EG PNVVGLFK S+D
Sbjct: 480 KIPPSILHASSQTRTPDIQNMKNVVKSEFWTKVVVHDDCINIEAEGAPNVVGLFKDSSDD 539
Query: 530 MSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSL 589
S NKYDC+L DLPVIILQQVASQVIRWLHEEAL VGIKANRDFL LSFEL+QGE ++L
Sbjct: 540 KCSTNKYDCNLDDLPVIILQQVASQVIRWLHEEALAVGIKANRDFLCLSFELEQGEILNL 599
Query: 590 VAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLV 649
VAHVDPED +GCISWWL MEDGFA E+KL ++I+D AS+Y+KFLG+L LDVLYSTLMDLV
Sbjct: 600 VAHVDPEDTQGCISWWLTMEDGFAEEKKLHMNIADGASEYRKFLGYLPLDVLYSTLMDLV 659
Query: 650 SLCGGGSH 657
SLCGGGSH
Sbjct: 660 SLCGGGSH 667
>gi|356513925|ref|XP_003525658.1| PREDICTED: uncharacterized protein LOC100810597 [Glycine max]
Length = 659
Score = 867 bits (2239), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/658 (65%), Positives = 523/658 (79%), Gaps = 15/658 (2%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSSK 64
L+IS+DKLP+KRLD+IEE G ERFPPDV YDEKR SLIRRIDFAWAVEK +K K
Sbjct: 7 LQISLDKLPIKRLDSIEENGIERFPPDVDYDEKRLSLIRRIDFAWAVEK----DEEKKKK 62
Query: 65 ESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLS 124
+ SS T+TPWQWQ MVENLQ AHQELSVIIDLINTVEANDAVTVA MTRPK LPNE LS
Sbjct: 63 QKSSKETSTPWQWQGMVENLQLAHQELSVIIDLINTVEANDAVTVASMTRPKLLPNEALS 122
Query: 125 DLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVAAPAS 184
DL+VSAATKLQCYRH+G YFKQSAK+ EQQ+A+EARFYGALIRLQQNWKVKRQR AA
Sbjct: 123 DLAVSAATKLQCYRHVGKYFKQSAKAFEQQVAREARFYGALIRLQQNWKVKRQRQAAIVP 182
Query: 185 GNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQS 244
GNEGFT DLFDNS YD A + R S+ST+R++HD+AG LAIN+ P+ C S +FGF+G QS
Sbjct: 183 GNEGFTFDLFDNS-YDQAAIIRSLSMSTVRVNHDAAGNLAINVSPDLCHSLQFGFVGAQS 241
Query: 245 GDSSKQCSKVKNSCSPRPS--KEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIVNRE 302
D+ ++ ++ K+ S + K +ES++D+ECV++ HSLLREVH+AIF EQVFD+VNRE
Sbjct: 242 DDTRRKSNQNKSHFSGELNLGKTGEESLSDEECVKKTHSLLREVHEAIFNEQVFDLVNRE 301
Query: 303 AFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGILPLDSH 362
AF GV+VTGIRENYLQL +G S++L+L+ ++Q +V+ + N E+ ILPL+S
Sbjct: 302 AFNTVAGVSVTGIRENYLQLSLGQETSVYLTLVSNSQDHSTVEGELTDNAENAILPLESS 361
Query: 363 DGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGPPTKDG 422
DG+K E K + K G + N + YEIY+QQ+FHEY++G+ +KP S+G R+SG KDG
Sbjct: 362 DGMK-REAKQNTSNK-GQFSNSICYEIYIQQIFHEYIFGKGGDKPSSSGNRLSGIQAKDG 419
Query: 423 SGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKVPQSIL 482
S LLGHF SLAHRIFS KV ELEN VC VPY+ L+S+PTWHSR SSWT++++VPQSIL
Sbjct: 420 SSLLGHFFKSLAHRIFSTKVLAELENVVCKVPYIQLISNPTWHSRASSWTLYMEVPQSIL 479
Query: 483 HAESNSRTAE-----TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSSMNKYD 537
S ++T++ K F T VVVNDDCI+V+ EG+PNV GLFKG+ E+ S+NKY+
Sbjct: 480 RG-SQTKTSDYYEKNAVKRQFWTKVVVNDDCINVKAEGSPNVAGLFKGKIEETHSINKYN 538
Query: 538 CDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAHVDPED 597
C+LADLPVIILQQVASQ+I WL++EA+MVGIKANRDFL LS EL+QGET+ LVA VDPED
Sbjct: 539 CNLADLPVIILQQVASQIINWLYQEAMMVGIKANRDFLCLSLELEQGETLGLVASVDPED 598
Query: 598 MRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLVSLCGGG 655
GCISWWLVMED FA E+KL + I+D AS+Y+KFLGHLSLD+LY+TL+DLVSLC GG
Sbjct: 599 SEGCISWWLVMEDSFAEEQKLHMSITDGASEYRKFLGHLSLDLLYATLIDLVSLCSGG 656
>gi|449447591|ref|XP_004141551.1| PREDICTED: mediator of RNA polymerase II transcription subunit
17-like [Cucumis sativus]
gi|449481525|ref|XP_004156208.1| PREDICTED: mediator of RNA polymerase II transcription subunit
17-like [Cucumis sativus]
Length = 662
Score = 860 bits (2221), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/666 (65%), Positives = 526/666 (78%), Gaps = 21/666 (3%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M+ ++++S+DKLPVKRL+AIEE G ERFP DVGY+EKR SLIRRIDFAWA+EK
Sbjct: 1 MDEDMKVSLDKLPVKRLEAIEENGLERFPSDVGYEEKRLSLIRRIDFAWAIEK----DDD 56
Query: 61 KSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPN 120
K ++ SS ++TPWQWQ+M+ENLQ AHQELSVIIDLINTVEANDAVTVAGMTRPK LPN
Sbjct: 57 KKKQKKSSKESSTPWQWQSMIENLQLAHQELSVIIDLINTVEANDAVTVAGMTRPKPLPN 116
Query: 121 EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVA 180
EVL+DL VS ATKLQC+RHLG YFKQSAK LE+Q+A+EARFYGALIRLQQNWKVKRQRVA
Sbjct: 117 EVLADLGVSVATKLQCFRHLGKYFKQSAKGLERQVAREARFYGALIRLQQNWKVKRQRVA 176
Query: 181 APASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFL 240
APA NEGFTIDLFDNS D + V RPSS STIR+DHDS+GMLAINLPPNSC S RFGFL
Sbjct: 177 APAPANEGFTIDLFDNSSCDPSSVFRPSS-STIRVDHDSSGMLAINLPPNSCHSLRFGFL 235
Query: 241 GVQSGDSSKQCSKVKNSCSPRPSKEA-----KESVNDDECVREKHSLLREVHQAIFYEQV 295
SG + + + + S PS ++ KE NDDE ++E +SLLR+VH AIF EQV
Sbjct: 236 ---SGCNVENPLERDKNESTNPSNQSSVNREKEFTNDDEYIKETNSLLRQVHHAIFSEQV 292
Query: 296 FDIVNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESG 355
FD+VN+EAF S+G+NVTGIRENYLQL I G S+F+SL+PS Q +V+ ++ +E+
Sbjct: 293 FDLVNQEAFNPSVGINVTGIRENYLQLSIDQGTSVFISLVPSGQSSQTVEGANSEILENA 352
Query: 356 ILPLDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVS 415
LP DS DG++L + + D L K PN +T+EIYLQQ+FHE ++ ++ ++PIS+ +R S
Sbjct: 353 SLPFDSLDGIELPD-RSDPLEKKLRNPNHITFEIYLQQIFHELVFVKSNDRPISSLSRQS 411
Query: 416 GPPTKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFL 475
G + DGSGLLGHFCLSLAHR+F+N V +ELEN V VP+L L+SHPTW+SR SSWT F+
Sbjct: 412 GQVSNDGSGLLGHFCLSLAHRMFANNVLMELENVVWKVPFLQLISHPTWNSRKSSWTFFM 471
Query: 476 KVPQSILHAES-NSRTAE------TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSE 528
+VP SILH S +RT++ K+ F T VVVNDDCI +EGEG PNVVGLF G S+
Sbjct: 472 EVPWSILHPNSIKARTSDGYQMNNVTKTQFLTKVVVNDDCITIEGEGAPNVVGLFNGNSK 531
Query: 529 DMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVS 588
D+ S N+Y CDLADLPVIIL QVASQ+I WLHEEAL+ GIKANRDFLSLSFEL+QGET+
Sbjct: 532 DIYSTNRYSCDLADLPVIILLQVASQIILWLHEEALIWGIKANRDFLSLSFELEQGETLR 591
Query: 589 LVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDL 648
LVAHVDPED +GC+SWWLVM+DG +RKL+ + SD +Y+KFLGHLSL+VL+STLMDL
Sbjct: 592 LVAHVDPEDPQGCLSWWLVMDDGLMEDRKLNFETSDVVPEYRKFLGHLSLEVLHSTLMDL 651
Query: 649 VSLCGG 654
VSLC G
Sbjct: 652 VSLCSG 657
>gi|356563049|ref|XP_003549778.1| PREDICTED: uncharacterized protein LOC100778422 [Glycine max]
Length = 660
Score = 853 bits (2203), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/658 (64%), Positives = 519/658 (78%), Gaps = 14/658 (2%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSSK 64
L+IS+DKLP+KRLD+IEE G ERFP DV YDEKR SLIRRIDFAWAVEKD+ K+ KK
Sbjct: 7 LQISLDKLPIKRLDSIEENGIERFPLDVDYDEKRLSLIRRIDFAWAVEKDEEKKKKKQKS 66
Query: 65 ESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLS 124
+SA PWQWQ MVENLQ AHQELSVIIDLINTVEANDAVTVA MTRPK LPNE LS
Sbjct: 67 SKETSA---PWQWQGMVENLQLAHQELSVIIDLINTVEANDAVTVASMTRPKLLPNEALS 123
Query: 125 DLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVAAPAS 184
DL+VSAATKLQCYRH+G YFKQSAK+ EQQ+A+EARFYGALIRLQQNWKVKRQR AA
Sbjct: 124 DLAVSAATKLQCYRHVGKYFKQSAKAFEQQVAREARFYGALIRLQQNWKVKRQRQAAIVP 183
Query: 185 GNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQS 244
GNEGFT DL DNS YD A + R S+ST+R++HD+AGMLAIN+ P+ C S +FGF+G QS
Sbjct: 184 GNEGFTFDLLDNS-YDQAAIIRSLSMSTVRVNHDAAGMLAINVSPDLCHSLQFGFVGAQS 242
Query: 245 GDSSKQCSKVKNSCSPRPS--KEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIVNRE 302
D+ ++ ++ K+ S + + +ES++D+ECV++ +SLLREVH+AIF EQVFD+VNRE
Sbjct: 243 DDTWRKSNENKSHFSGEHNLGEMGEESLSDEECVKKTNSLLREVHEAIFNEQVFDLVNRE 302
Query: 303 AFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGILPLDSH 362
AF GV+VTGIRENYLQL +G G S++L+L+ ++Q +V+ +N NVE+ ILPL+S
Sbjct: 303 AFNTVAGVSVTGIRENYLQLSLGQGTSVYLTLVSNSQDHSTVEDELNDNVENAILPLESS 362
Query: 363 DGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGPPTKDG 422
DG+K E K G + N + YEIY+QQ+FHE+++G+ K IS+G R+SG +DG
Sbjct: 363 DGMK--REAKQNTSKKGQFSNSICYEIYIQQIFHEHIFGKGDEKAISSGNRLSGVQARDG 420
Query: 423 SGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKVPQSIL 482
S LLGHF SLAHRIFS KV ELEN VC VPYL L+S+PTW+SR SSWT++++VPQSIL
Sbjct: 421 SSLLGHFFKSLAHRIFSTKVLAELENVVCKVPYLQLISNPTWNSRASSWTLYMEVPQSIL 480
Query: 483 HAESNSRTAE-----TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSSMNKYD 537
S ++T++ AK F T VVVNDDCI+V+ EG+PNV GLFKG+ E+ S+NKY+
Sbjct: 481 RG-SQTKTSDYYEKNAAKRQFWTKVVVNDDCINVKAEGSPNVAGLFKGKIEETHSINKYN 539
Query: 538 CDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAHVDPED 597
C+LADLPVIILQQVASQ+I WL++EA+MVGIK NRDFL LS EL QGET+ LVA VDPED
Sbjct: 540 CNLADLPVIILQQVASQIINWLYQEAMMVGIKVNRDFLCLSLELKQGETLGLVASVDPED 599
Query: 598 MRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLVSLCGGG 655
CISWWLVMED FA E+KL + I+D AS+Y+KFLGHLSLD+LY+TL+DLV LC GG
Sbjct: 600 SEECISWWLVMEDSFAEEQKLHMSITDGASEYRKFLGHLSLDLLYATLIDLVGLCSGG 657
>gi|357478013|ref|XP_003609292.1| hypothetical protein MTR_4g114100 [Medicago truncatula]
gi|355510347|gb|AES91489.1| hypothetical protein MTR_4g114100 [Medicago truncatula]
Length = 856
Score = 801 bits (2069), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/657 (61%), Positives = 499/657 (75%), Gaps = 13/657 (1%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSSK 64
L++S+DKLP+KRLD+IEE G ERFP DV YDEKR SLIRRIDFAWA+EKD ++ K
Sbjct: 7 LQLSLDKLPIKRLDSIEENGNERFPLDVDYDEKRVSLIRRIDFAWAIEKD-----EEKKK 61
Query: 65 ESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLS 124
+ SS TTPWQWQ MVENLQ AHQELSVIID INTVE NDAVTVA MTRPK+LPNE LS
Sbjct: 62 QKKSSKETTPWQWQGMVENLQLAHQELSVIIDFINTVETNDAVTVASMTRPKSLPNEALS 121
Query: 125 DLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVAAPAS 184
DL+VSAATKLQCYR +G YFKQSAK+ EQQ+A+EARFYGALIRLQQNWKVKRQR +
Sbjct: 122 DLAVSAATKLQCYRQVGKYFKQSAKAFEQQVAREARFYGALIRLQQNWKVKRQRQTSLVP 181
Query: 185 GNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQS 244
GNEGFT DLFDNS YD + R SS+ST+R++HD+AGMLAIN+ P C S +FGF+ Q
Sbjct: 182 GNEGFTFDLFDNS-YDQGAIVRSSSMSTVRVNHDAAGMLAINVSPELCHSLQFGFVSAQP 240
Query: 245 GDSSKQCSKVKNSCSPRP--SKEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIVNRE 302
D K+ ++ ++ + ES +D+ECV++ HSLLR+VHQAIF EQVFD+VNRE
Sbjct: 241 DDMQKKSNENQSHLLGEDCLGETGIESSSDEECVKKTHSLLRDVHQAIFNEQVFDLVNRE 300
Query: 303 AFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGIL-PLDS 361
AF S G +TGIRENYLQL +G G S++LSL+ + Q + +V+ + N + PL+S
Sbjct: 301 AFNTSTGFTLTGIRENYLQLSLGQGTSVYLSLVSTGQDNPTVEGELTNNADDNAFSPLES 360
Query: 362 HDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGPPTKD 421
D V + + + + L+K G + N YEIY+QQ++HE+++GR KPIS+G R+SG KD
Sbjct: 361 SD-VLMHDAQQNTLKKKGRHSNSTCYEIYIQQIYHEHIFGRGSEKPISSGNRLSGAQAKD 419
Query: 422 GSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKVPQSI 481
GS LL HF +SLAHRIFS K+ ELEN V VPYL L+S+PTWHSR SSWT+F++VP SI
Sbjct: 420 GSYLLSHFFMSLAHRIFSTKILAELENVVFKVPYLQLISNPTWHSRGSSWTLFMEVPPSI 479
Query: 482 L---HAESNSRTAETAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSSMNKYDC 538
L +++ K F T VVV DDCI V+ EG+PNV GLFKG+SED S+NKYDC
Sbjct: 480 LRGCQVKTSDFENNAIKRQFWTKVVVIDDCISVKAEGSPNVSGLFKGKSEDTHSINKYDC 539
Query: 539 DLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAHVDPEDM 598
+LADLPVIILQQVASQ+I WL+ EALMVGIKANRDFL LSFEL+QGET+ LVA+VDP+D
Sbjct: 540 NLADLPVIILQQVASQIINWLYHEALMVGIKANRDFLCLSFELEQGETLGLVANVDPKDS 599
Query: 599 RGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLVSLCGGG 655
GCISW LVMED FA +KL +++D AS+Y+KFLG LSL++LY+TL+DL++ GG
Sbjct: 600 DGCISWSLVMEDSFAEVQKLHTNLTDGASEYRKFLGPLSLELLYATLIDLIAFVSGG 656
>gi|297812205|ref|XP_002873986.1| hypothetical protein ARALYDRAFT_351112 [Arabidopsis lyrata subsp.
lyrata]
gi|297319823|gb|EFH50245.1| hypothetical protein ARALYDRAFT_351112 [Arabidopsis lyrata subsp.
lyrata]
Length = 652
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/663 (59%), Positives = 497/663 (74%), Gaps = 25/663 (3%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M+ ++EIS+D+LP+KRL++IEE GAERFP DVGYD+KR SLIRRIDFAWA+E++D + K
Sbjct: 1 MDSDMEISLDRLPIKRLESIEENGAERFPSDVGYDDKRVSLIRRIDFAWALEEEDELKKK 60
Query: 61 KSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPN 120
K K S SA W+W+ MVENLQ AHQELSVIIDLI+TV+ANDAVTVAGMTRPK LPN
Sbjct: 61 KQKKSSKDSAEQ--WKWKGMVENLQLAHQELSVIIDLIDTVQANDAVTVAGMTRPKPLPN 118
Query: 121 EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVA 180
E+LSDL+VS ATKLQ YR+LG YFKQSAK+LEQ+I +EARFYGALIRLQ+NWKVKRQR+
Sbjct: 119 EILSDLAVSTATKLQGYRNLGNYFKQSAKALEQKINREARFYGALIRLQRNWKVKRQRML 178
Query: 181 APASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFL 240
A + NEGFTIDL D+SL+D RPS+LSTIR+DHDSAGMLAIN+P +S S RFGF+
Sbjct: 179 ASNASNEGFTIDLSDSSLHDPTSGFRPSTLSTIRVDHDSAGMLAINVPQDSWYSLRFGFV 238
Query: 241 GVQSGDSSKQCSKVKNSCSPR--PSKEAKESVNDDECVREKHSLLREVHQAIFYEQV-FD 297
G+ D++ + + +S P K+S +DDE V+E HSLLREVH++IF EQV FD
Sbjct: 239 GLNPIDNTNESDEHLDSTMGHDIPGTSEKQSASDDEYVKETHSLLREVHKSIFAEQVLFD 298
Query: 298 IVNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGIL 357
++NREAF + +G N++GIREN++++ IG G S+F+SL PS + + S+ + ES L
Sbjct: 299 MLNREAFNEGVGFNISGIRENFMEMSIGQGASLFVSLHPSGK-NASI-----KKSESATL 352
Query: 358 PLDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGP 417
++S ++ AE D K G+PN +YEIYLQQ+FHE+ +G+AK++P S R S
Sbjct: 353 LIESSGRIEPAE--GDYRLKKLGFPNRASYEIYLQQIFHEHAFGKAKDQPKSKSIRASNQ 410
Query: 418 PTKD-GSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLK 476
KD SGLL HFCLSL HRIFSN+V LE+ VC VPYLHL+SHPTW+SRTSSWTVF+
Sbjct: 411 TRKDSNSGLLDHFCLSLTHRIFSNRVLEHLESVVCKVPYLHLISHPTWNSRTSSWTVFMT 470
Query: 477 VPQSILHAESNSRTAETAKSH----FRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSS 532
VP SI+ S+ + K + FRT VVV D+CI VE E TPNVVGL K S ++ +
Sbjct: 471 VPPSIIPQGSSETQSPDGKRNLKMQFRTKVVVKDECISVEAECTPNVVGLLKSSSCNLFA 530
Query: 533 MNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAH 592
MNKY+C +ADLPVIILQQVASQ++ WL EEA VG KA+RDFLSLS E+ +GE VSLVA
Sbjct: 531 MNKYECGVADLPVIILQQVASQIVCWLLEEARTVGTKASRDFLSLSLEIVEGERVSLVAQ 590
Query: 593 VDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLVSLC 652
V+PED +GCISWWLVME+G ER + S+ +K LGHLSLDVLYS LMDL++LC
Sbjct: 591 VNPEDAKGCISWWLVMENGSTEER-------EGVSESRKLLGHLSLDVLYSVLMDLINLC 643
Query: 653 GGG 655
G G
Sbjct: 644 GTG 646
>gi|145358255|ref|NP_197517.3| RNA polymerase II transcription mediator [Arabidopsis thaliana]
gi|395406781|sp|F4K460.1|MED17_ARATH RecName: Full=Mediator of RNA polymerase II transcription subunit
17
gi|332005425|gb|AED92808.1| RNA polymerase II transcription mediator [Arabidopsis thaliana]
Length = 653
Score = 757 bits (1955), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/662 (59%), Positives = 497/662 (75%), Gaps = 23/662 (3%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M+ ++EIS+D+LP+KRL++IEE GAERFP DV YD+KR SLIRRIDFAWA+E+ LK
Sbjct: 1 MDSDMEISLDRLPIKRLESIEENGAERFPSDVDYDDKRVSLIRRIDFAWALEE--EDELK 58
Query: 61 KSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPN 120
K ++ SS + W+W+ MVENLQ AHQEL+VIIDLI+TV+ANDAVTVAGMTRPK +PN
Sbjct: 59 KKKQKKSSKDSVEQWKWKGMVENLQLAHQELTVIIDLIDTVQANDAVTVAGMTRPKPMPN 118
Query: 121 EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVA 180
E+LSDL+VS ATKLQ YR+LG YFKQSAK+LEQ+I +EARFYGALIRLQ+NWKVKRQR+
Sbjct: 119 EILSDLAVSTATKLQGYRNLGNYFKQSAKALEQKINREARFYGALIRLQRNWKVKRQRML 178
Query: 181 APASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFL 240
A + NEGFTIDL D+SLYD RPS+LSTIR+DHDSAGMLAIN+P +S S RFG++
Sbjct: 179 ASNASNEGFTIDLSDSSLYDPTSGFRPSTLSTIRVDHDSAGMLAINVPQDSWYSLRFGYV 238
Query: 241 GVQSGDSSKQCSKVKNSCSPR--PSKEAKESVNDDECVREKHSLLREVHQAIFYEQVFDI 298
G+ +S + + +S + P K S +DD+ V+E HSLLREVH++IF EQ+FD+
Sbjct: 239 GLNPIGNSNESDEHIDSTTGHDIPGTSEKLSASDDKYVKETHSLLREVHKSIFAEQLFDM 298
Query: 299 VNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGILP 358
+NREAF + +G N++G+REN++++ IG G S+F+SL PS + S ES L
Sbjct: 299 LNREAFNEGVGFNISGLRENFMEMSIGQGASLFVSLHPSGKNPSIKKS------ESATLL 352
Query: 359 LDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGPP 418
++S V+ AE D L+K G+PN +YEIYLQQ+FHE+ +G+AK++ S R S
Sbjct: 353 IESSGRVEPAEGGDYRLKKL-GFPNRTSYEIYLQQIFHEHAFGKAKDQLKSKSIRASNQT 411
Query: 419 TKD-GSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKV 477
KD SGLL HFCLSL HRIFSN+V V LE+ VC VPYLHL+SHPTW+SRTSSWTVF+ V
Sbjct: 412 EKDSNSGLLDHFCLSLTHRIFSNRVLVHLESVVCKVPYLHLISHPTWNSRTSSWTVFMTV 471
Query: 478 PQSIL---HAESNSRTAE-TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSSM 533
P SI+ +E+ S + K+ FRT VVV DDCI VE E TPNVVGL K S ++ S+
Sbjct: 472 PPSIIPQGRSETQSPDGKRNLKTQFRTKVVVKDDCISVEAECTPNVVGLLKSSSCNLFSI 531
Query: 534 NKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAHV 593
NKY+CD+ADLPV+ILQQVASQ++ WL EEA VG KA+R+FLSLS E+ +GE VSLVAHV
Sbjct: 532 NKYECDVADLPVMILQQVASQIVCWLLEEARTVGTKASREFLSLSLEIVEGERVSLVAHV 591
Query: 594 DPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLVSLCG 653
+PED +GCISWWLVME+G ER + S+ +K LGHLSLDVLYS LMDL++LCG
Sbjct: 592 NPEDAKGCISWWLVMENGCTEER-------EGVSESRKLLGHLSLDVLYSVLMDLINLCG 644
Query: 654 GG 655
G
Sbjct: 645 TG 646
>gi|115489788|ref|NP_001067381.1| Os12g0638600 [Oryza sativa Japonica Group]
gi|77556810|gb|ABA99606.1| expressed protein [Oryza sativa Japonica Group]
gi|113649888|dbj|BAF30400.1| Os12g0638600 [Oryza sativa Japonica Group]
gi|215694565|dbj|BAG89558.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218187319|gb|EEC69746.1| hypothetical protein OsI_39278 [Oryza sativa Indica Group]
Length = 655
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 341/671 (50%), Positives = 448/671 (66%), Gaps = 31/671 (4%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M + + +DKLP+KRL AI+E G E +PPD +E+R S IRRIDF+W ++KD K K
Sbjct: 1 MEEAVRVDLDKLPIKRLHAIDEAGNEHYPPDTSSEEQRLSAIRRIDFSWVIDKDAKKPKK 60
Query: 61 KSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPN 120
++++ A W WQ M+E+LQ A QELSV+IDLI+TVEANDAV VAGM +PK+LP
Sbjct: 61 DTAQQQQQQA----WPWQGMMESLQQAQQELSVVIDLISTVEANDAVAVAGMLKPKSLPT 116
Query: 121 EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVA 180
E L D +VSAATKLQ RHL YFKQSAK++EQQ KE+RFYG+LIRLQQNWKVKRQR
Sbjct: 117 ETLVDTAVSAATKLQRVRHLSRYFKQSAKTMEQQFQKESRFYGSLIRLQQNWKVKRQRFG 176
Query: 181 APASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFL 240
G+EGF DL D S D+A + R SSL I ID DS+G L++ +P SCR F
Sbjct: 177 GSGPGSEGFMFDLIDTSQLDTAAMPRLSSLPLIPIDQDSSGTLSVQVPQKSCRFLSLNFR 236
Query: 241 GVQSGDSSKQCSKVKNSCSPRPSKEAKESVNDD--ECVREKHSLLREVHQAIFYEQVFDI 298
G + K+K+ S S + E+ NDD + ++ HS+LR +H++IF EQVFD+
Sbjct: 237 GDSANGVENYGHKLKDGIS---SITSSETDNDDVNKSIKHAHSILRNIHKSIFEEQVFDM 293
Query: 299 VNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLS--VDSWVNQNVE-SG 355
V RE F QS G+NVTG+RE++LQL IG S+ LSL S G S VD + N E +
Sbjct: 294 VIRETFVQSQGINVTGMREDFLQLAIGQECSLCLSLAHSGDGSDSEMVDHEDHANSEDAS 353
Query: 356 ILPLDSHDGVKLAEEKDDILRKS-GGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRV 414
L L + +G K D LRK G+PNP + EIYL Q+FHE + + + K ++ G R
Sbjct: 354 NLVLVTMNG------KLDPLRKDVTGFPNPRSLEIYLLQLFHENILRKVREKSLNIG-RY 406
Query: 415 SGPP--TKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWT 472
P D GLLGHFCL++AHRIFSNKV VELE+ V VPYLHL S PTWHSRTSSW+
Sbjct: 407 QSPAQVAGDDYGLLGHFCLTVAHRIFSNKVLVELESVVSRVPYLHLRSLPTWHSRTSSWS 466
Query: 473 VFLKVPQSILHAESNSRTAET------AKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGR 526
+ LKVPQ IL A+ ++ ++ ++S F T V+V D I + GEG+P++ G G+
Sbjct: 467 LCLKVPQPILAADRIAKPSDNHELKYKSRSQFNTKVIVKDSQISLMGEGSPSIAGSLTGK 526
Query: 527 SEDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGET 586
D +N Y+CDL DLP ++LQQVASQVI WLHEEAL++G+ RDFL L F+L+QGET
Sbjct: 527 PSDGYLVNSYNCDLEDLPTMLLQQVASQVIHWLHEEALVLGMNVTRDFLCLYFDLEQGET 586
Query: 587 VSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLM 646
+ LVA+VDP+D GCISW+L + D + K+S D + + ++FLG++SL+VLYSTLM
Sbjct: 587 LGLVANVDPDDTCGCISWYLTI-DHPTEDGKMSADSQE--FEKRRFLGYVSLEVLYSTLM 643
Query: 647 DLVSLCGGGSH 657
DL++LC G+H
Sbjct: 644 DLINLCNAGAH 654
>gi|222617546|gb|EEE53678.1| hypothetical protein OsJ_37013 [Oryza sativa Japonica Group]
Length = 655
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 339/671 (50%), Positives = 448/671 (66%), Gaps = 31/671 (4%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M + + +DKLP+KRL AI+E G E +PPD +E+R S IRRIDF+W ++KD K
Sbjct: 1 MEEAVRVDLDKLPIKRLHAIDEAGNEHYPPDTSSEEQRLSAIRRIDFSWVIDKD----AK 56
Query: 61 KSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPN 120
K + SS+++ + + M+E+LQ A QELSV+IDLI+TVEANDAV VAGM +PK+LP
Sbjct: 57 KPKRTPPSSSSSRHGRGRGMMESLQQAQQELSVVIDLISTVEANDAVAVAGMLKPKSLPT 116
Query: 121 EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVA 180
E L D +VSAATKLQ RHL YFKQSAK++EQQ KE+RFYG+LIRLQQNWKVKRQR
Sbjct: 117 ETLVDTAVSAATKLQRVRHLSRYFKQSAKTMEQQFQKESRFYGSLIRLQQNWKVKRQRFG 176
Query: 181 APASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFL 240
G+EGF DL D S D+A + R SSL I ID DS+G L++ +P SCR F
Sbjct: 177 GSGPGSEGFMFDLIDTSQLDTAAMPRLSSLPLIPIDQDSSGTLSVQVPQKSCRFLSLNFR 236
Query: 241 GVQSGDSSKQCSKVKNSCSPRPSKEAKESVNDD--ECVREKHSLLREVHQAIFYEQVFDI 298
G + K+K+ S S + E+ NDD + ++ HS+LR +H++IF EQVFD+
Sbjct: 237 GDSANGVENYGHKLKDGIS---SITSSETDNDDVNKSIKHAHSILRNIHKSIFEEQVFDM 293
Query: 299 VNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLS--VDSWVNQNVE-SG 355
V RE F QS G+NVTG+RE++LQL IG S+ LSL S G S VD + N E +
Sbjct: 294 VIRETFVQSQGINVTGMREDFLQLAIGQECSLCLSLAHSGDGSDSEMVDHEDHANSEDAS 353
Query: 356 ILPLDSHDGVKLAEEKDDILRKS-GGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRV 414
L L + +G K D LRK G+PNP + EIYL Q+FHE + + + K ++ G R
Sbjct: 354 NLVLVTMNG------KLDPLRKDVTGFPNPRSLEIYLLQLFHENILRKVREKSLNIG-RY 406
Query: 415 SGPP--TKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWT 472
P D GLLGHFCL++AHRIFSNKV VELE+ V VPYLHL S PTWHSRTSSW+
Sbjct: 407 QSPAQVAGDDYGLLGHFCLTVAHRIFSNKVLVELESVVSRVPYLHLRSLPTWHSRTSSWS 466
Query: 473 VFLKVPQSILHAESNSRTAET------AKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGR 526
+ LKVPQ IL A+ ++ ++ ++S F T V+V D I + GEG+P++ G G+
Sbjct: 467 LCLKVPQPILAADRIAKPSDNHELKYKSRSQFNTKVIVKDSQISLMGEGSPSIAGSLTGK 526
Query: 527 SEDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGET 586
D +N Y+CDL DLP ++LQQVASQVI WLHEEAL++G+ RDFL L F+L+QGET
Sbjct: 527 PSDGYLVNSYNCDLEDLPTMLLQQVASQVIHWLHEEALVLGMNVTRDFLCLYFDLEQGET 586
Query: 587 VSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLM 646
+ LVA+VDP+D GCISW+L + D + K+S D + + ++FLG++SL+VLYSTLM
Sbjct: 587 LGLVANVDPDDTCGCISWYLTI-DHPTEDGKMSADSQE--FEKRRFLGYVSLEVLYSTLM 643
Query: 647 DLVSLCGGGSH 657
DL++LC G+H
Sbjct: 644 DLINLCNAGAH 654
>gi|357135663|ref|XP_003569428.1| PREDICTED: uncharacterized protein LOC100822959 [Brachypodium
distachyon]
Length = 656
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 322/671 (47%), Positives = 429/671 (63%), Gaps = 30/671 (4%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M+ + + +DKLP+KRLDAI+E G E +PPD +E+R + IRR+DF+W +E+D K K
Sbjct: 1 MDEGVRLDLDKLPIKRLDAIDEAGNEHYPPDTSSEEQRLAAIRRVDFSWVIERDAKKAKK 60
Query: 61 KSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPN 120
+ + S WQWQ ++E+LQ A QEL+V++DLI+TVEANDAV V ++PK+LPN
Sbjct: 61 AAEDTAQQS-----WQWQGLMESLQQAQQELTVVLDLISTVEANDAVAVTTTSKPKSLPN 115
Query: 121 EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVA 180
EVL D++VSAATKLQ RHLG YFKQSAK++EQQ KEARFYG+LIRLQQNWKVKR R
Sbjct: 116 EVLDDMAVSAATKLQRLRHLGRYFKQSAKTMEQQFQKEARFYGSLIRLQQNWKVKRHRFG 175
Query: 181 APASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFL 240
A G++ F D+ D S D+A + R SS+S + ID DS+G L++ +P SCR
Sbjct: 176 ATGPGSDSFMFDVVDTSQLDTAAMPRMSSMSLVPIDQDSSGTLSVQVPQKSCRFLSLQLR 235
Query: 241 GVQSGDSSKQCSKVKNSCSPRPSKEAKESVNDD--ECVREKHSLLREVHQAIFYEQVFDI 298
G + + K+K S + E NDD + V+ HS+LR +H++IF EQVFD+
Sbjct: 236 GDNASSAESYACKMKGISSTTSAAENDALENDDVNKSVKHAHSILRNIHKSIFEEQVFDM 295
Query: 299 VNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGILP 358
V RE F S G+NVTG+RE++LQL IG + LSL L DS +E P
Sbjct: 296 VIRETFVPSQGINVTGMREDFLQLAIGQENLLCLSL---EHSGLDTDS----EMEGLEEP 348
Query: 359 LDSHDGVKLA----EEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRV 414
DS D L K + + NP + EIYL +FHE + + + K R
Sbjct: 349 TDSEDASNLVLATINGKQEPSKMDASGLNPKSLEIYLLHMFHENILRKVREK-YRNIVRY 407
Query: 415 SGPPTK--DGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWT 472
P D LLGHFC+++AHRIFSNKVH+ELE+ V VPYLHL S PTWHSRTSSW+
Sbjct: 408 PSPAQAAGDDCNLLGHFCMTVAHRIFSNKVHLELESVVSRVPYLHLQSLPTWHSRTSSWS 467
Query: 473 VFLKVPQSILHAE------SNSRTAETAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGR 526
+ L+VPQ IL A+ N ++S F T V++ D I + GEG+P++ G +
Sbjct: 468 LCLRVPQPILAADRVTKPLDNDEPKYKSRSQFNTKVILKDGQISLMGEGSPSIAGSLTRK 527
Query: 527 SEDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGET 586
D +N Y+CDL DLP ++LQQVASQVI WLHEEAL++G+ RDFL L F+LDQG+T
Sbjct: 528 PSDGYLINSYNCDLEDLPTMLLQQVASQVINWLHEEALVLGMNVTRDFLCLYFDLDQGDT 587
Query: 587 VSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLM 646
+ LVAHVDP+D GCISW+L + D A + K+S D + + ++FLG+LSL+VLYSTLM
Sbjct: 588 LGLVAHVDPDDAYGCISWYLTI-DHSAKDGKMSTD--NPEFEKRRFLGYLSLEVLYSTLM 644
Query: 647 DLVSLCGGGSH 657
DL++LC G H
Sbjct: 645 DLINLCSTGVH 655
>gi|226494373|ref|NP_001145828.1| uncharacterized protein LOC100279335 [Zea mays]
gi|194704804|gb|ACF86486.1| unknown [Zea mays]
gi|219884585|gb|ACL52667.1| unknown [Zea mays]
gi|413947117|gb|AFW79766.1| hypothetical protein ZEAMMB73_523859 [Zea mays]
Length = 661
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 328/665 (49%), Positives = 438/665 (65%), Gaps = 31/665 (4%)
Query: 3 GNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKS 62
GN+ + +DKLP+KRL+AIEETG E +PPD +E+R + IRRIDF+W +EKD K K +
Sbjct: 4 GNVRVDLDKLPIKRLEAIEETGNEHYPPDTSNEEQRLAAIRRIDFSWVIEKDAKKAKKAA 63
Query: 63 SKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEV 122
+++ A W WQ ++E+LQ AHQELSV+IDLI TVEANDAV VA T+PK+ PNE+
Sbjct: 64 EADTAQQA----WPWQGLMESLQQAHQELSVVIDLIGTVEANDAVAVASTTKPKSQPNEI 119
Query: 123 LSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVAAP 182
L D++VSAATKLQ RHL YFKQSAK++EQQ KE RFYG+LIRLQQNWKVKRQR A
Sbjct: 120 LVDMAVSAATKLQRLRHLSWYFKQSAKTMEQQFQKETRFYGSLIRLQQNWKVKRQR-AVG 178
Query: 183 ASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGV 242
+ G+EGF DL D+S D+ V R S LS I ID DS+G L++ +P S RS F G
Sbjct: 179 SPGSEGFMFDLVDSSQLDTTTVLRLSPLSLIPIDQDSSGTLSVQIPQKSLRSLSLKFHGD 238
Query: 243 QSGDSSKQCSKVKNSCSPRPSKEAKESV--NDD--ECVREKHSLLREVHQAIFYEQVFDI 298
++ K K + EA++ V +DD + +R HS+LR++H++IF EQVFD+
Sbjct: 239 IVNNAESSAIKKKEGTLTNTTAEAEKDVLADDDVNKSIRYAHSILRDIHKSIFEEQVFDM 298
Query: 299 VNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQ----GDLSVDSWVNQNVES 354
V RE F QS G+NVTG+ E++LQL IG S+ LSL S Q G + ++ N
Sbjct: 299 VIRETFNQSQGINVTGMCEDFLQLAIGQECSLCLSLELSGQNNNAGTVGQQDHMDTNYPG 358
Query: 355 GILPLDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYG-RAKNKPISTGTR 413
+ V K+ + G+PNP + EIYL +FHE L R K++ + +
Sbjct: 359 NL-------AVATVNGKESSNKDVRGFPNPKSLEIYLLHMFHEILRKLREKSRHV-VRYQ 410
Query: 414 VSGPPTKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTV 473
S D GLL HFC+++AHRIFSNKVH+ELE+ V VPYLHL S PTWHSRTSSW++
Sbjct: 411 SSAQAAPDDCGLLSHFCMTVAHRIFSNKVHLELESVVSRVPYLHLRSLPTWHSRTSSWSL 470
Query: 474 FLKVPQSILHAESNSRTAE------TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRS 527
LKVPQ IL ++ ++ ++S F T V++ D I + GEG+P++VG G+
Sbjct: 471 CLKVPQPILATGRVTKPSDYHELKYKSRSQFSTKVILKDGQISLMGEGSPSIVGSLTGKP 530
Query: 528 EDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETV 587
D +N Y+CDL DLP+++LQQVASQVI WLH+EA+++G+ RDFL L F+L QGET+
Sbjct: 531 SDSRLINSYNCDLEDLPMMLLQQVASQVIHWLHDEAMILGMNVTRDFLCLYFDLGQGETL 590
Query: 588 SLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMD 647
L+AHVDP+D GCISW+L +E E K+S S+ ++FLG+LSL+VLYSTLMD
Sbjct: 591 GLLAHVDPDDTYGCISWYLTVEHPM-EEGKMS--AGSPESEKRRFLGYLSLEVLYSTLMD 647
Query: 648 LVSLC 652
L+ LC
Sbjct: 648 LIKLC 652
>gi|242056755|ref|XP_002457523.1| hypothetical protein SORBIDRAFT_03g008700 [Sorghum bicolor]
gi|241929498|gb|EES02643.1| hypothetical protein SORBIDRAFT_03g008700 [Sorghum bicolor]
Length = 659
Score = 591 bits (1524), Expect = e-166, Method: Compositional matrix adjust.
Identities = 316/663 (47%), Positives = 433/663 (65%), Gaps = 22/663 (3%)
Query: 4 NLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSS 63
++ + +DKLP+KRL+AI+E G E +PPD +E+R + IRRIDF+W + K KK+
Sbjct: 5 DVRVDLDKLPIKRLEAIDEIGNEHYPPDTSNEEQRLAAIRRIDFSWVI----EKDAKKAK 60
Query: 64 KESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVL 123
K + + T W WQ ++E+LQ AH ELSV+IDLI TVEANDAV VA T+PK+ PNE+L
Sbjct: 61 KAAEADTTQQVWPWQGLMESLQQAHHELSVVIDLIGTVEANDAVAVASTTKPKSQPNEIL 120
Query: 124 SDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVAAPA 183
D++VSAATKLQ RHL YFKQSA+++EQQ KE RFY +LIRLQQNWKVKRQR A +
Sbjct: 121 VDMAVSAATKLQRLRHLSRYFKQSARTMEQQFQKETRFYSSLIRLQQNWKVKRQR-AVGS 179
Query: 184 SGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQ 243
G+EGF DL D+S D+ + R S LS I ID DS+G L++ +P S RS F G
Sbjct: 180 PGSEGFMFDLVDSSQLDTTTMPRLSPLSLIPIDQDSSGTLSVQIPQKSFRSLTLQFRGDI 239
Query: 244 SGDSSKQCSKVKNSCSPRPSKEAKESV--NDD--ECVREKHSLLREVHQAIFYEQVFDIV 299
+ ++ + K K + EA++ V N D + ++ HS+LR++H++IF EQVFD+V
Sbjct: 240 ANNAERSAIKKKEGTLTNTTTEAEKDVLENGDVNKSIKHAHSILRDIHKSIFEEQVFDMV 299
Query: 300 NREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGILPL 359
RE F QS G+NVTG+ E++LQL IG S+ LSL S Q +S ++++
Sbjct: 300 IRETFVQSQGINVTGMCEDFLQLAIGQECSLCLSLELSRQNSISETVGQEDHMDTD---Y 356
Query: 360 DSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPIS-TGTRVSGPP 418
+ V K+ + G+PNP EIYL +FHE + + + K + S
Sbjct: 357 TENLAVATVNGKESSNKDVRGFPNPKCLEIYLLHMFHENILRKLREKSRHMVRYQSSAQA 416
Query: 419 TKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKVP 478
D GLL HFC++++HRIFSNKVH+ELE+ V VPYLHL S PTWHSRTSSW++ LKVP
Sbjct: 417 APDDCGLLSHFCMTVSHRIFSNKVHLELESVVSRVPYLHLCSLPTWHSRTSSWSLCLKVP 476
Query: 479 QSILHAESNSRTAE------TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSS 532
Q IL + + ++ ++S F T V++ D I + GEG+P++ G G+ D
Sbjct: 477 QPILATDRVRKPSDHHKLKYKSRSQFSTKVILKDGQISLMGEGSPSIAGSLTGKPSDGRL 536
Query: 533 MNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAH 592
+N Y+CDL DLP+++LQQVASQVI WLH+EA+++G+ RDFL L F+LDQGET+ LVAH
Sbjct: 537 INSYNCDLEDLPMMLLQQVASQVIHWLHDEAMVLGMNVTRDFLCLYFDLDQGETLGLVAH 596
Query: 593 VDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLVSLC 652
VDP+D GCISW+L + D E K+S D + + ++FLG+LSL+VLYSTLMDL+ LC
Sbjct: 597 VDPDDTYGCISWYLTV-DHPMEEGKMSADSPE--LEKRRFLGYLSLEVLYSTLMDLIKLC 653
Query: 653 GGG 655
G
Sbjct: 654 STG 656
>gi|223945737|gb|ACN26952.1| unknown [Zea mays]
gi|413926580|gb|AFW66512.1| hypothetical protein ZEAMMB73_010825 [Zea mays]
gi|413926581|gb|AFW66513.1| hypothetical protein ZEAMMB73_010825 [Zea mays]
Length = 660
Score = 577 bits (1486), Expect = e-162, Method: Compositional matrix adjust.
Identities = 318/675 (47%), Positives = 430/675 (63%), Gaps = 43/675 (6%)
Query: 3 GNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKS 62
G++ + +DKLP+KRL+AI+ETG E +PPD +E+R + IRRIDF+W +EKD K K +
Sbjct: 4 GDVLVDLDKLPMKRLEAIDETGNEHYPPDTSNEEQRLAAIRRIDFSWVIEKDAKKAKKAA 63
Query: 63 SKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEV 122
+++ A W WQ ++E+LQ AHQELSV+IDLI TVEANDA+ VA T+PK+ PNE+
Sbjct: 64 EADTTQQA----WPWQGLMESLQQAHQELSVVIDLIGTVEANDAMAVASTTKPKSQPNEI 119
Query: 123 LSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQR-VAA 181
L D++V AATKLQC RHL YFKQSA ++EQQ KE RFY +LIRLQQNWKVKRQR V +
Sbjct: 120 LVDMAVCAATKLQCLRHLSRYFKQSATTMEQQFQKETRFYSSLIRLQQNWKVKRQRSVGS 179
Query: 182 PASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLG 241
P G+EGF DL D S D+ + R S LS I ID DS+G L++ +P S F G
Sbjct: 180 P--GSEGFMFDLVDTSQLDTTTMPRLSPLSLIPIDQDSSGTLSVQIPQKCFHSLSLRFHG 237
Query: 242 VQSGDSSKQCSKVKNSCSPRPSKEAKESV--NDD--ECVREKHSLLREVHQAIFYEQVFD 297
+ ++ K K + A+ V NDD + HS+LR++H++IF EQVFD
Sbjct: 238 DIANNAESSAVKRKEGTLTNTTTGAENDVLENDDVNKSTEHAHSILRDIHKSIFEEQVFD 297
Query: 298 IVNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGIL 357
+V +E F QS G+NVTG+ E++LQL IG S+ LS DLS QN SG +
Sbjct: 298 MVIQETFIQSQGINVTGMCEDFLQLAIGQECSLCLS------HDLS-----GQNNNSGTV 346
Query: 358 PLDSHD--------GVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKP-- 407
+ H V K+ + G+PNP + EIYL VFHE + + + K
Sbjct: 347 GQEDHMDIDHTGNLAVATVNGKESSNKDVRGFPNPKSLEIYLLHVFHEGILRKLREKSRH 406
Query: 408 -ISTGTRVSGPPTKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHS 466
+ + G P D GLL HFC++++HRIFS+KVH+ELE+ V VPYLHL S PTWHS
Sbjct: 407 VVRYQSSAQGTP--DECGLLSHFCMTVSHRIFSSKVHLELESVVSRVPYLHLCSLPTWHS 464
Query: 467 RTSSWTVFLKVPQSILHAESNSRTAE------TAKSHFRTNVVVNDDCIHVEGEGTPNVV 520
RTSSW++ LKVPQ IL + R ++ ++S F T V++ D I + GEG+P++
Sbjct: 465 RTSSWSLCLKVPQPILATDRVRRPSDYSELKCKSRSQFGTKVILKDGQISLMGEGSPSIA 524
Query: 521 GLFKGRSEDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFE 580
G G+ D +N Y+CDL DLP+++LQQVASQVI WLH+EA+++G+ A DFL L +
Sbjct: 525 GSLAGKPSDGRLVNSYNCDLEDLPMMLLQQVASQVIHWLHDEAVVLGMNATIDFLCLYLD 584
Query: 581 LDQGETVSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDV 640
LDQGET+ LVAHVDP+D GC+SW+L ++ E K+S + ++FLG+LS ++
Sbjct: 585 LDQGETLGLVAHVDPDDTYGCVSWYLTLDHHPTEEGKMS--AGSPELEERRFLGYLSPEL 642
Query: 641 LYSTLMDLVSLCGGG 655
LYSTLMDLV LC G
Sbjct: 643 LYSTLMDLVKLCSTG 657
>gi|118481027|gb|ABK92467.1| unknown [Populus trichocarpa]
Length = 366
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 253/366 (69%), Positives = 301/366 (82%), Gaps = 6/366 (1%)
Query: 298 IVNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGIL 357
+VNR A QS G+NVTGI+ENYLQL IG GISIF+S++PS+QGD ++DS +N+ES ++
Sbjct: 1 MVNRGAVNQSSGLNVTGIQENYLQLCIGPGISIFISIVPSDQGDQAIDSEGPENLESAVV 60
Query: 358 PLDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGP 417
PLDS DGVKLAEEK + L K +PN +TYEIYL+Q+FHEY++ AK +P TGTR+ G
Sbjct: 61 PLDSFDGVKLAEEKHNSLTKKPRFPNCITYEIYLKQIFHEYVFVEAKGRPSFTGTRMPGQ 120
Query: 418 PTKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKV 477
P DGSGLL HFCLSL+HRI SNKV +ELEN VC VPYLHL+SHPTWHSR+S+WT+F+K+
Sbjct: 121 PANDGSGLLSHFCLSLSHRIISNKVLMELENVVCRVPYLHLISHPTWHSRSSAWTIFMKI 180
Query: 478 PQSILHAESNSRTAE------TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMS 531
P SILHA S +RT + KS F T VVV+DDCI++E EG PNVVGLFK S+D
Sbjct: 181 PPSILHASSQTRTPDIQNMKNVVKSEFWTKVVVHDDCINIEAEGAPNVVGLFKDSSDDKC 240
Query: 532 SMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVA 591
S NKYDC+L DLPVIILQQVASQVIRWLHEEAL VGIKANRDFL LSFEL+QGE ++LVA
Sbjct: 241 STNKYDCNLDDLPVIILQQVASQVIRWLHEEALAVGIKANRDFLCLSFELEQGEILNLVA 300
Query: 592 HVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLVSL 651
HVDPED +GCISWWL MEDGFA E+KL ++I+D AS+Y+KFLG+L LDVLYSTLMDLVSL
Sbjct: 301 HVDPEDTQGCISWWLTMEDGFAEEKKLHMNIADGASEYRKFLGYLPLDVLYSTLMDLVSL 360
Query: 652 CGGGSH 657
CGGGSH
Sbjct: 361 CGGGSH 366
>gi|326527475|dbj|BAK08012.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 590
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 277/594 (46%), Positives = 383/594 (64%), Gaps = 25/594 (4%)
Query: 78 QNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLSDLSVSAATKLQCY 137
Q + E+LQ AHQEL+V++DLI+TVEAND VTVA ++PK ++VL +++VSAATKLQ
Sbjct: 1 QGLHESLQLAHQELTVVLDLISTVEANDTVTVATFSKPKKRLDKVLVNMAVSAATKLQRL 60
Query: 138 RHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVAAPASGNEGFTIDLFDNS 197
RHLG YFKQSAK++EQQ KEARFYG+LIRLQQNWKV RQ AP G+ F D+ D S
Sbjct: 61 RHLGRYFKQSAKTMEQQFQKEARFYGSLIRLQQNWKVNRQCGNAP--GSNSFMFDVVDTS 118
Query: 198 LYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLG-VQSGDSSKQCSKVKN 256
D+A + R SSLS + ID DS+G L++++P SCR F G SG S C+
Sbjct: 119 HLDTAVMPRSSSLSLVPIDQDSSGTLSVHVPQKSCRFLSLQFCGDSTSGTESYACNTKGV 178
Query: 257 SCSPRPSKEAKESVNDD--ECVREKHSLLREVHQAIFYEQVFDIVNREAFKQSLGVNVTG 314
S + + E NDD + V++ HS+LR +H++IF EQVFD+V E F Q+ GVNVTG
Sbjct: 179 SSTTSSAVEDDVPENDDVNKSVKQAHSILRNIHRSIFEEQVFDMVTCETFVQTKGVNVTG 238
Query: 315 IRENYLQLGIGLGISIFLSLIPSNQ-GDLSVDSWVNQNVESGILPLDSHDGVKLAEEKDD 373
+ E++LQ+ I I + LSL+ S Q D + N L L + +G +++
Sbjct: 239 MWEDFLQIAIDQEILLCLSLVNSGQDSDSEMAGHEEHNNSEANLVLATTNG-----KQEP 293
Query: 374 ILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGP-----PTKDGSGLLGH 428
+ + G+ NP + EIYL +FH+ + + + K R P P D GLLGH
Sbjct: 294 LKSDASGFLNPKSLEIYLLHMFHDNILRKVREK-YRNIVRYQSPGQTAEPAGDECGLLGH 352
Query: 429 FCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKVPQSILHAESNS 488
FC+++AH+IFSNKV +ELE+ + VPYLHL S PTWHSRTSSW++ L++P IL A+ S
Sbjct: 353 FCMTVAHKIFSNKVQLELESVLSRVPYLHLQSLPTWHSRTSSWSLCLRIPPPILAADKPS 412
Query: 489 RTAE----TAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSSMNKYDCDLADLP 544
E ++++ F T +V+ D I + GEG+P++ G + D +N Y+CDL DLP
Sbjct: 413 DNGEPKYKSSRTQFNTKIVLKDVQISLFGEGSPSIAGSLTRKPSDGYLINNYNCDLEDLP 472
Query: 545 VIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGET-VSLVAHVDPEDMRGCIS 603
++LQQVASQVI WLHEE ++G+ RDFL L F+LD G+T + LVAHVDP+D GC+S
Sbjct: 473 TMVLQQVASQVINWLHEETQVLGMSVTRDFLGLYFDLDHGDTMLGLVAHVDPDDAYGCVS 532
Query: 604 WWLVMEDGFAAERKLSIDISDDASDYKK--FLGHLSLDVLYSTLMDLVSLCGGG 655
W+L + D A E ++ + ++ +K FLG+LSL+VLYSTLMDL++LCG G
Sbjct: 533 WYLTV-DHPAEEDGMTPAANGPWAEEEKCRFLGYLSLEVLYSTLMDLINLCGTG 585
>gi|326533946|dbj|BAJ93746.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 581
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 263/571 (46%), Positives = 363/571 (63%), Gaps = 25/571 (4%)
Query: 101 VEANDAVTVAGMTRPKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEAR 160
VEAND VTVA ++PK ++VL +++VSAATKLQ RHLG YFKQSAK++EQQ KEAR
Sbjct: 15 VEANDTVTVATFSKPKKRLDKVLVNMAVSAATKLQRLRHLGRYFKQSAKTMEQQFQKEAR 74
Query: 161 FYGALIRLQQNWKVKRQRVAAPASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSA 220
FYG+LIRLQQNWKV RQ AP G+ F D+ D S D+A + R SSLS + ID DS+
Sbjct: 75 FYGSLIRLQQNWKVNRQCGNAP--GSNSFMFDVVDTSHLDTAVMPRSSSLSLVPIDQDSS 132
Query: 221 GMLAINLPPNSCRSFRFGFLG-VQSGDSSKQCSKVKNSCSPRPSKEAKESVNDD--ECVR 277
G L++++P SCR F G SG S C+ S + + E NDD + V+
Sbjct: 133 GTLSVHVPQKSCRFLSLQFCGDSTSGTESYACNTKGVSSTTSSAVEDDVPENDDVNKSVK 192
Query: 278 EKHSLLREVHQAIFYEQVFDIVNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPS 337
+ HS+LR +H++IF EQVFD+V E F Q+ GVNVTG+ E++LQ+ I I + LSL+ S
Sbjct: 193 QAHSILRNIHRSIFEEQVFDMVTCETFVQTKGVNVTGMWEDFLQIAIDQEILLCLSLVNS 252
Query: 338 NQ-GDLSVDSWVNQNVESGILPLDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFH 396
Q D + N L L + +G +++ + + G+ NP + EIYL +FH
Sbjct: 253 GQDSDSEMAGHEEHNNSEANLVLATTNG-----KQEPLKSDASGFLNPKSLEIYLLHMFH 307
Query: 397 EYLYGRAKNKPISTGTRVSGP-----PTKDGSGLLGHFCLSLAHRIFSNKVHVELENAVC 451
+ + + + K R P P D GLLGHFC+++AH+IFSNKV +ELE+ +
Sbjct: 308 DNILRKVREK-YRNIVRYQSPGQTAEPAGDECGLLGHFCMTVAHKIFSNKVQLELESVLS 366
Query: 452 GVPYLHLVSHPTWHSRTSSWTVFLKVPQSILHAESNSRTAE----TAKSHFRTNVVVNDD 507
VPYLHL S PTWHSRTSSW++ L++P IL A+ S E ++++ F T +V+ D
Sbjct: 367 RVPYLHLQSLPTWHSRTSSWSLCLRIPPPILAADKPSDNGEPKYKSSRTQFNTKIVLKDV 426
Query: 508 CIHVEGEGTPNVVGLFKGRSEDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVG 567
I + GEG+P++ G + D +N Y+CDL DLP ++LQQVASQVI WLHEE ++G
Sbjct: 427 QISLFGEGSPSIAGSLTRKPSDGYLINNYNCDLEDLPTMVLQQVASQVINWLHEETQVLG 486
Query: 568 IKANRDFLSLSFELDQGET-VSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDA 626
+ RDFL L F+LD G+T + LVAHVDP+D GC+SW+L + D A E ++ +
Sbjct: 487 MSVTRDFLGLYFDLDHGDTMLGLVAHVDPDDAYGCVSWYLTV-DHPAEEDGMTPAANGPW 545
Query: 627 SDYKK--FLGHLSLDVLYSTLMDLVSLCGGG 655
++ +K FLG+LSL+VLYSTLMDL++LCG G
Sbjct: 546 AEEEKCRFLGYLSLEVLYSTLMDLINLCGTG 576
>gi|255566969|ref|XP_002524467.1| hypothetical protein RCOM_0221540 [Ricinus communis]
gi|223536255|gb|EEF37907.1| hypothetical protein RCOM_0221540 [Ricinus communis]
Length = 407
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 195/300 (65%), Positives = 220/300 (73%), Gaps = 44/300 (14%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M G +EIS+DKLPVKRL+AIEE G ERFP DVGYDEKR SLIRRIDF WAVEK D ++ K
Sbjct: 1 MEGKVEISLDKLPVKRLEAIEENGVERFPTDVGYDEKRVSLIRRIDFGWAVEKQDEEKNK 60
Query: 61 KSSKESSSSATTT----PWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPK 116
+ K+ + S+ + PW WQ+MVENLQ AHQELSVIIDLINT
Sbjct: 61 EKKKQKTKSSKESSSSTPWPWQSMVENLQLAHQELSVIIDLINT---------------- 104
Query: 117 ALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKR 176
HLG YFKQSAK+ EQQIA+EARFYGALIRLQQNWKVKR
Sbjct: 105 ----------------------HLGKYFKQSAKAFEQQIAREARFYGALIRLQQNWKVKR 142
Query: 177 QRVAAPASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFR 236
QR+AA A NEGFTIDLFDNSLYDSA + RP SLSTIRIDHDSAGMLAINLPP+SCRS
Sbjct: 143 QRMAATALSNEGFTIDLFDNSLYDSASLFRPPSLSTIRIDHDSAGMLAINLPPSSCRSLH 202
Query: 237 FGFLGVQSGDSSKQCSKVKNSCS-PRPSKEA-KESVNDDECVREKHSLLREVHQAIFYEQ 294
FGFLGV S D+ K+C+K+K+SCS SKEA KES++D+ECV+E HSLLREVHQAIF EQ
Sbjct: 203 FGFLGVHSNDNIKKCTKIKSSCSVEHSSKEAEKESLSDNECVKETHSLLREVHQAIFDEQ 262
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 120/147 (81%), Positives = 133/147 (90%), Gaps = 1/147 (0%)
Query: 512 EGEGTPNVVGLFKGRSEDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKAN 571
E EG PNVVGLFKG S+D+ S+NKYDCDLADLPVIILQQVASQVIRWLHEEAL VGIKAN
Sbjct: 261 EQEGAPNVVGLFKGSSDDVCSVNKYDCDLADLPVIILQQVASQVIRWLHEEALSVGIKAN 320
Query: 572 RDFLSLSFELDQGETVSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKK 631
RDFL LSFEL+QGE +SLVAHVDPED GCISWWLVMEDGFA ERKL +DI+D+AS+Y+K
Sbjct: 321 RDFLCLSFELEQGEVLSLVAHVDPEDTEGCISWWLVMEDGFAEERKLHMDIADEASEYRK 380
Query: 632 FLGHLSLDVLYSTLMDLVSLC-GGGSH 657
FLG+L LD+LYS LM+LVSLC GGGSH
Sbjct: 381 FLGYLPLDILYSILMNLVSLCSGGGSH 407
>gi|326518212|dbj|BAK07358.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 474
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 172/397 (43%), Positives = 255/397 (64%), Gaps = 20/397 (5%)
Query: 273 DECVREKHSLLREVHQAIFYEQVFDIVNREAFKQSLGVNVTGIRENYLQLGIGLGISIFL 332
++ V++ HS+LR +H++IF EQV D+V RE F Q+ GVNVTG+RE++LQL IG + L
Sbjct: 81 NKSVKQAHSILRNIHKSIFEEQVLDMVIRETFVQTQGVNVTGMREDFLQLAIGEESLLCL 140
Query: 333 SLIPSNQ-GDLSVDSWVNQNVESGILPLDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYL 391
SL+ S Q D + N L L + +G +++ + + G+ NP + EIYL
Sbjct: 141 SLVNSGQDSDSEIAGHEEHNNSEANLVLATTNG-----KQEPLKMDTSGFLNPKSLEIYL 195
Query: 392 QQVFHEYLYGRAKNKPISTGTRVSGPPTKDGSG----LLGHFCLSLAHRIFSNKVHVELE 447
+FHE + + + K + S T + +G LL HFC+++AH+ FS KV +ELE
Sbjct: 196 LHLFHENILRKVREKYRNIVRYQSPAQTAESAGEDCGLLSHFCMTVAHKTFSKKVQLELE 255
Query: 448 NAVCGVPYLHLVSHPTWHSRTSSWTVFLKVPQSILHAESNSRTAE-------TAKSHFRT 500
+ V VPYL L S PTWHSRTSSW++ L+VPQ IL A+ ++ ++ ++++ F T
Sbjct: 256 SVVSRVPYLQLRSLPTWHSRTSSWSLCLRVPQPILAADRPTKPSDNGEPKYKSSRTQFNT 315
Query: 501 NVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSSMNKYDCDLADLPVIILQQVASQVIRWLH 560
+V+ D I + GEG+P++ G + D +N Y+CDL DLP ++LQQVASQ+I WLH
Sbjct: 316 KIVLKDGQISLLGEGSPSIAGSLTRKPSDGYLINSYNCDLEDLPTMVLQQVASQIINWLH 375
Query: 561 EEALMVGIKANRDFLSLSFELDQGETVSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSI 620
EEAL++G+ RDFL L F+L+ G+T+ LVAHVDP+D GCISW+L ++ AE
Sbjct: 376 EEALVLGMSVTRDFLCLYFDLEHGDTLGLVAHVDPDDEYGCISWYLTVD--HPAEEDGKA 433
Query: 621 DISDDA-SDYKKFLGHLSLDVLYSTLMDLVSLCGGGS 656
DD ++ ++FLG+LSL+VLYSTL+DL++LCG G+
Sbjct: 434 PAGDDPWAEKRRFLGYLSLEVLYSTLLDLINLCGTGA 470
>gi|168061092|ref|XP_001782525.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666010|gb|EDQ52677.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 626
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 224/671 (33%), Positives = 351/671 (52%), Gaps = 91/671 (13%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSSK 64
+E+S+D LP+KR+ A+EE+G E PP+ +EK +L+ R+DF +K + +
Sbjct: 1 MEVSLDPLPLKRVLAMEESGTELLPPEPSQEEKSLALLGRLDFGQEFKKIKKEEDGNKKE 60
Query: 65 ESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLS 124
E W WQ +V++LQ AH EL++I+D+IN VEA +AV VA M RPK P E+ S
Sbjct: 61 EKEKEIVQV-WPWQGLVDHLQQAHHELAIILDMINHVEAGEAVAVARMERPKLQPQEICS 119
Query: 125 DLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQR-VAAPA 183
D+++ A++KL +R++G Y K+SA+SLEQQ+ +EA FYGAL+RLQ+NWKVKRQR VAA
Sbjct: 120 DVALRASSKLHHFRNVGKYLKRSAQSLEQQVEREAVFYGALMRLQRNWKVKRQRGVAAGP 179
Query: 184 SGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPP------------NS 231
G+ GF+ID+ + +SAP I ++ +S G+L +LP ++
Sbjct: 180 GGSAGFSIDVGTS---ESAPC-------LITLEQNSVGLLTAHLPAALTLSSLHVTLLST 229
Query: 232 CRSFRFGFLGVQSGDSSKQCSKVKNSCSPRPSKEAKESVND--DECVREKHSLLREVHQA 289
F+FG + +SK+ + + P K + V D ++LR++ A
Sbjct: 230 PAPFKFGKDTERLSATSKEGTS-DATVLPEAPKPGERGVGPGVDRGATRTQAMLRQIQTA 288
Query: 290 IFYEQVFDIVNREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVN 349
F QVF+ V R+A S +NVTG+ + LQL +G S+ + L P ++ +S +
Sbjct: 289 NFDAQVFEWVGRQALSLSPYINVTGLSDTCLQLSLGNAASLNVHLTP--MAPIADESQGS 346
Query: 350 QNVESGILPLDSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNK-PI 408
V + P H +L PN + + LQQ + +L+GR N
Sbjct: 347 DKVLAPRTP--DHQNPRL-------------LPNEGSLSVCLQQAYQRHLHGRDDNALSK 391
Query: 409 STGTRVSGPPTKDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRT 468
TG G + +GL+ H + HR+ S+K+ LE V GVP L +SHPTWH++
Sbjct: 392 ETGKHKDG--YAESAGLVKHVSAIMKHRVASDKIISVLEKQVHGVPELRFISHPTWHAQV 449
Query: 469 SSWTVFLKVPQSILHAESNSRTAETAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSE 528
S+W ++L VP S L ++ +VV+ +D + + GL G
Sbjct: 450 STWDLWLDVPDSAL------------LGRWQASVVLREDML--------TIAGLPSGNGN 489
Query: 529 DMSSMNKYDCDLADL-PVIILQ--------QVASQVIRWLHEEALMVGIKANRDFLSLSF 579
++S C L++L P ++ Q Q+A+ ++ WLH EA +G++A D LS++F
Sbjct: 490 RLTSTA---CTLSELSPFLLFQRLIADVCWQMAAHLVTWLHAEASGMGVEARLDSLSITF 546
Query: 580 ELDQGETVSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLD 639
ELD E VSLVA P I+W L +LS ++D+ +FLG L L+
Sbjct: 547 ELDNFEDVSLVA--SPAVKSHTINWSL----------RLSGSRTEDSDATSRFLGPLPLE 594
Query: 640 VLYSTLMDLVS 650
L + ++DL++
Sbjct: 595 TLRAIVIDLMN 605
>gi|302796709|ref|XP_002980116.1| hypothetical protein SELMODRAFT_419660 [Selaginella moellendorffii]
gi|300152343|gb|EFJ18986.1| hypothetical protein SELMODRAFT_419660 [Selaginella moellendorffii]
Length = 697
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 206/656 (31%), Positives = 344/656 (52%), Gaps = 69/656 (10%)
Query: 6 EISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSSKE 65
++++D LP+KR+ A+EE+G E +PP+ DEK SL++ +DF E+ D K KKS
Sbjct: 3 KVTLDSLPLKRVLALEESGIELYPPESSQDEKPLSLLQSLDF----ERPDEKEAKKSK-- 56
Query: 66 SSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLSD 125
S + W WQ ++E+ A +EL ++D I VE+N+A+TV M +PK LPNE +D
Sbjct: 57 -SGKDGASQWPWQGLIEHFHQAREELMTLLDFIAHVESNEALTVTHMMKPKPLPNEAGAD 115
Query: 126 LSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQR-VAAPAS 184
L++ A+K + Y+ + Y K++AKSLE+Q+ +E+ FYGAL+RLQ+NWK+KRQR V+A
Sbjct: 116 LALRTASKSRSYKEIASYLKRNAKSLERQVERESVFYGALMRLQRNWKIKRQRGVSAGPG 175
Query: 185 GNEGFTIDL-----FDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGF 239
G GF +DL D SL S SR ++ + I+++ D+ G+L++ +P + +
Sbjct: 176 GKAGFLMDLSFPLSVDPSLISS---SRTAAWTLIKVNQDANGLLSVQIPAGKTITTLQVY 232
Query: 240 LGVQSGDSSKQCSKVKNSCSPRPSKEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIV 299
L S + + SK + PR +++ + DE + LR + +IF E +F+ +
Sbjct: 233 L---SRELEEYTSKQQLEQPPR----RRDNRDPDEGTSSANHRLRNIQLSIFDELIFECL 285
Query: 300 NREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGILPL 359
R+ + S ++ I E+ L +G G I+ F L+ + D + D+ + + V+ LPL
Sbjct: 286 CRDVLQPSSATSLHEIGESSLDIGAGPSIASFFRLV---EDDAANDAALKE-VDVKRLPL 341
Query: 360 DSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGPPT 419
+ IYLQQ F+ L + + G R +G
Sbjct: 342 AG------------------------SLRIYLQQGFYSMLLASIVQRS-APGPR-AGTGK 375
Query: 420 KDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKVPQ 479
++ + LL +F + HR +K+ L+ V VP L SHPTW S+W V+ +P+
Sbjct: 376 EEPANLLKNFSVLARHRSAGDKIVSLLDTLVLQVPQLWFRSHPTWRPFVSAWDVYFDIPE 435
Query: 480 SIL------HAESNSRTAETAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSSM 533
+I+ H E N+ ET+ + F+ +V+ + + +EG +G ++ + M
Sbjct: 436 AIMNGGQMKHLEWNALVRETSAT-FQCTLVLRQEILSIEGLDGCMPLG---ASGQESTGM 491
Query: 534 NKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAHV 593
C +++LP +L Q+A +VI WL EEA+++G + RDFLS+ F + VSLVA
Sbjct: 492 KSCVCSVSELPFFLLTQIAGKVIGWLEEEAVLLGPRVKRDFLSIVFNIQNNGLVSLVAA- 550
Query: 594 DPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLMDLV 649
P CI+WWL F ++ + A ++FLG L L+ L + + ++
Sbjct: 551 -PSGY--CINWWLRFYS-FGSD-DTAAASGASAPASERFLGPLPLETLRAVVQRVI 601
>gi|302820478|ref|XP_002991906.1| hypothetical protein SELMODRAFT_430185 [Selaginella moellendorffii]
gi|300140292|gb|EFJ07017.1| hypothetical protein SELMODRAFT_430185 [Selaginella moellendorffii]
Length = 738
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 208/653 (31%), Positives = 342/653 (52%), Gaps = 69/653 (10%)
Query: 6 EISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSSKE 65
++++D LP+KR+ A+EE+G E +PP+ DEK SL++ +DF E+ D K KKS
Sbjct: 3 KVTLDSLPLKRVLALEESGIELYPPESSQDEKPLSLLQSLDF----ERPDEKEAKKSK-- 56
Query: 66 SSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLSD 125
S + W WQ ++E+ A +EL ++D I VE+N+A+TV M +PK LPNE +D
Sbjct: 57 -SGKDGASQWPWQGLIEHFHQAREELMTLLDFIAHVESNEALTVTHMMKPKPLPNEAGAD 115
Query: 126 LSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQR-VAAPAS 184
L++ A+K + Y+ + Y K++AKSLE+Q+ +E+ FYGAL+RLQ+NWK+KRQR V+A
Sbjct: 116 LALRTASKSRSYKEIASYLKRNAKSLERQVERESVFYGALMRLQRNWKIKRQRGVSAGPG 175
Query: 185 GNEGFTIDL-----FDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGF 239
G GF IDL D SL S SR ++ + I+++ D+ G+L++ +P + +
Sbjct: 176 GKAGFLIDLSFPLSVDPSLISS---SRTAAWTLIKVNQDANGLLSVQIPAGKTITTLQVY 232
Query: 240 LGVQSGDSSKQCSKVKNSCSPRPSKEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIV 299
L S + + SK + PR +++ + DE + LR + +IF E +F+ +
Sbjct: 233 L---SRELEEYTSKQQLEQPPR----RRDNRDPDEGTSSANHRLRNIQLSIFDELIFECL 285
Query: 300 NREAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVNQNVESGILPL 359
R+ + S ++ I E+ L +G G I+ F L+ + D + D+ + + V+ LPL
Sbjct: 286 CRDVLQPSSATSLHEIGESSLDIGAGPSIASFFRLV---EDDAAKDATLKE-VDVKRLPL 341
Query: 360 DSHDGVKLAEEKDDILRKSGGYPNPLTYEIYLQQVFHEYLYGRAKNKPISTGTRVSGPPT 419
+ IYLQQ F+ L + + G R +G
Sbjct: 342 AG------------------------SLRIYLQQGFYSMLLASIVQRS-APGPR-AGTGK 375
Query: 420 KDGSGLLGHFCLSLAHRIFSNKVHVELENAVCGVPYLHLVSHPTWHSRTSSWTVFLKVPQ 479
++ + LL +F + HR +K+ L+ V VP L SHPTW S+W V+ +P+
Sbjct: 376 EEPANLLKNFSVLARHRSAGDKIVSLLDTLVLQVPQLWFRSHPTWRPFVSAWDVYFDIPE 435
Query: 480 SIL------HAESNSRTAETAKSHFRTNVVVNDDCIHVEGEGTPNVVGLFKGRSEDMSSM 533
+I+ H E N ET+ + F+ +V+ + + +EG +G ++ + M
Sbjct: 436 AIMNGGQMKHLEWNVLVRETSAT-FQCTLVLRQEILSIEGLDGCMPLG---ASGQESTGM 491
Query: 534 NKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFELDQGETVSLVAHV 593
C +++LP +L Q+A +VI WL EEA+++G + RDFLS+ F + VSLVA
Sbjct: 492 KSCVCSVSELPFFLLTQIAGKVIGWLEEEAVLLGPRVKRDFLSIVFNIQNNGLVSLVAA- 550
Query: 594 DPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDVLYSTLM 646
P CI+WWL F ++ + A ++FLG L L+ L + L+
Sbjct: 551 -PSGY--CINWWLRFYS-FGSD-DTAAASGASAPASERFLGPLPLETLRAALV 598
>gi|413947118|gb|AFW79767.1| hypothetical protein ZEAMMB73_523859 [Zea mays]
Length = 300
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 135/269 (50%), Positives = 181/269 (67%), Gaps = 9/269 (3%)
Query: 390 YLQQVFHEYLYGRAKNKPISTGTRVSGPPTKDGSGLLGHFCLSLAHRIFSNKVHVELENA 449
YL+ FHE L + + S D GLL HFC+++AHRIFSNKVH+ELE+
Sbjct: 26 YLRSRFHEILRKLREKSRHVVRYQSSAQAAPDDCGLLSHFCMTVAHRIFSNKVHLELESV 85
Query: 450 VCGVPYLHLVSHPTWHSRTSSWTVFLKVPQSILHAESNSRTAE------TAKSHFRTNVV 503
V VPYLHL S PTWHSRTSSW++ LKVPQ IL ++ ++ ++S F T V+
Sbjct: 86 VSRVPYLHLRSLPTWHSRTSSWSLCLKVPQPILATGRVTKPSDYHELKYKSRSQFSTKVI 145
Query: 504 VNDDCIHVEGEGTPNVVGLFKGRSEDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEA 563
+ D I + GEG+P++VG G+ D +N Y+CDL DLP+++LQQVASQVI WLH+EA
Sbjct: 146 LKDGQISLMGEGSPSIVGSLTGKPSDSRLINSYNCDLEDLPMMLLQQVASQVIHWLHDEA 205
Query: 564 LMVGIKANRDFLSLSFELDQGETVSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDIS 623
+++G+ RDFL L F+L QGET+ L+AHVDP+D GCISW+L +E E K+S
Sbjct: 206 MILGMNVTRDFLCLYFDLGQGETLGLLAHVDPDDTYGCISWYLTVEHPM-EEGKMS--AG 262
Query: 624 DDASDYKKFLGHLSLDVLYSTLMDLVSLC 652
S+ ++FLG+LSL+VLYSTLMDL+ LC
Sbjct: 263 SPESEKRRFLGYLSLEVLYSTLMDLIKLC 291
>gi|296089212|emb|CBI38915.3| unnamed protein product [Vitis vinifera]
Length = 269
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/56 (76%), Positives = 48/56 (85%)
Query: 3 GNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKR 58
G +EIS+DKLP+KRLDAIEE G ERFP DVGYD+K SLIRRIDFAWAVEKD K+
Sbjct: 68 GKMEISLDKLPIKRLDAIEENGVERFPTDVGYDDKWVSLIRRIDFAWAVEKDTKKQ 123
>gi|442753715|gb|JAA69017.1| Putative rna polymerase ii transcription mediator rna polymerase ii
transcription mediator [Ixodes ricinus]
Length = 103
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 63/89 (70%), Gaps = 4/89 (4%)
Query: 210 LSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQSGD---SSKQCSKVKNSCSPRPSKEA 266
+ST+R+ HD+AGML IN+ P+ C S +F F+G QS D +SKQ +K ++S +
Sbjct: 1 MSTVRVSHDAAGMLTINMSPDLCHSLQFDFVGAQSDDILRNSKQ-NKSRSSIDHSLGETG 59
Query: 267 KESVNDDECVREKHSLLREVHQAIFYEQV 295
KES +D+ECV++ H+LLREVH AIF EQV
Sbjct: 60 KESSSDEECVKKTHTLLREVHGAIFNEQV 88
>gi|194691398|gb|ACF79783.1| unknown [Zea mays]
Length = 89
Score = 89.7 bits (221), Expect = 4e-15, Method: Composition-based stats.
Identities = 44/88 (50%), Positives = 59/88 (67%), Gaps = 2/88 (2%)
Query: 568 IKANRDFLSLSFELDQGETVSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDAS 627
+ A DFL L +LDQGET+ LVAHVDP+D GC+SW+L ++ E K+S
Sbjct: 1 MNATIDFLCLYLDLDQGETLGLVAHVDPDDTYGCVSWYLTLDHHPTEEGKMS--AGSPEL 58
Query: 628 DYKKFLGHLSLDVLYSTLMDLVSLCGGG 655
+ ++FLG+LS ++LYSTLMDLV LC G
Sbjct: 59 EERRFLGYLSPELLYSTLMDLVKLCSTG 86
>gi|147834867|emb|CAN63371.1| hypothetical protein VITISV_031279 [Vitis vinifera]
Length = 348
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/70 (58%), Positives = 48/70 (68%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M N + V KLP+KRLDAIEE G ERFP DV YD+K SLI RIDFAWAVE D K+
Sbjct: 168 MGINGLLPVYKLPIKRLDAIEENGVERFPTDVCYDDKWVSLIWRIDFAWAVENDAKKQKV 227
Query: 61 KSSKESSSSA 70
S +++SA
Sbjct: 228 MHSGMANASA 237
>gi|168043689|ref|XP_001774316.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674308|gb|EDQ60818.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 440
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 76/147 (51%), Gaps = 29/147 (19%)
Query: 521 GLFKGRSEDMSSMNKYDCDLADLPVIILQQVASQVIRWLHEEALMVGIKANRDFLSLSFE 580
GL G + ++S C L++L +L Q+A+ ++ WLH EA +G++A + SL+FE
Sbjct: 307 GLPSGNGKRLTSTA---CTLSELSPFLLFQMAAHLVAWLHAEASGMGVEARLN--SLTFE 361
Query: 581 LDQGETVSLVAHVDPEDMRGCISWWLVMEDGFAAERKLSIDISDDASDYKKFLGHLSLDV 640
LD E VSLVA P I+W L +LS ++D+ +FLG L+L
Sbjct: 362 LDNFEDVSLVA--SPAVKSHTINWSL----------RLSGSRTEDSDATSRFLGALALQT 409
Query: 641 LYSTLMDLV------------SLCGGG 655
L ++++DLV SLCG G
Sbjct: 410 LRASVIDLVNEKLDVKPAGDASLCGEG 436
>gi|198423211|ref|XP_002128356.1| PREDICTED: similar to mediator complex subunit 17 [Ciona
intestinalis]
Length = 653
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 85/191 (44%), Gaps = 28/191 (14%)
Query: 4 NLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSS 63
N++IS+ L ++ I G E F P + E L RI+F E D ++ +++
Sbjct: 6 NVKISLQPLSETKIQEIAYNGRETFIPPLSMSESLTDLANRINFLAEDESDVDQDVERK- 64
Query: 64 KESSSSATTTP---WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPN 120
TP W W+N+ +LQ A E++V++D++ + D+ + + PN
Sbjct: 65 --------LTPGKQWPWENIRSHLQSALIEMNVLVDVMAIAQNKDSTSKDK--QESTDPN 114
Query: 121 EVLSDLSVS-----------AATKLQCYRHLGIYFKQSAKSLEQ---QIAKEARFYGALI 166
L SV+ TK +C G A+ L + +++ + F+ L+
Sbjct: 115 RYLMFSSVAKEAETPKPMLQMITKKRCLGAAGKILLDGAERLNKARNEVSTQQNFHAQLL 174
Query: 167 RLQQNWKVKRQ 177
+L+Q W+V+R
Sbjct: 175 KLRQRWRVRRH 185
>gi|156357299|ref|XP_001624158.1| predicted protein [Nematostella vectensis]
gi|156210917|gb|EDO32058.1| predicted protein [Nematostella vectensis]
Length = 645
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 88/188 (46%), Gaps = 20/188 (10%)
Query: 4 NLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSS 63
++ I++++L ++ I G E++ P + E+ L +IDF + + +
Sbjct: 6 SVNIAIEQLLESEVEEIARDGQEKYVPQLSMSEQLSKLAHKIDFLSEAKSKADDNADDND 65
Query: 64 KESSSSATTT--P--WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALP 119
ES + T P W W ++ L+ A E+ V+ DL+N V+ V + +++ +A+
Sbjct: 66 DESDNKTLVTFQPSLWPWDSVRTKLRNALTEVCVLSDLLNVVKEKKYVVLDPVSQSQAIS 125
Query: 120 NEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLE----QQIAKEAR-------FYGALIRL 168
L L+ K + R Q +K +E +++++++R F+ LI L
Sbjct: 126 KPTLQLLA-----KKKNLRDAATILLQGSKRMESAVQERMSQQSRPDQERSDFHSELINL 180
Query: 169 QQNWKVKR 176
+Q W++KR
Sbjct: 181 RQRWRLKR 188
>gi|281210443|gb|EFA84609.1| hypothetical protein PPL_01599 [Polysphondylium pallidum PN500]
Length = 577
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 52/275 (18%), Positives = 115/275 (41%), Gaps = 22/275 (8%)
Query: 80 MVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLSDLSVSAATKLQCYRH 139
+N+ A EL +I LI+ ++ ++++ + +P+A L+D + K +
Sbjct: 110 FTDNMNLARIELEQMIFLIDMIKNERHISLSSIDKPQASEQRELNDAVQQISLKQKALSD 169
Query: 140 LGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVK-RQRVAAPASGNEGFTIDL-FDNS 197
+G A ++ +A + ++ + L+ W++K + + N+ T L D
Sbjct: 170 MGNRLISGAARMKASVATASSWWSDVSTLRSKWRLKGKDSMTMMIPTNKASTKKLSIDYG 229
Query: 198 LYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQSGDSSKQCSKVKNS 257
Y S + + R + + NLP S + R ++ V+S DS +
Sbjct: 230 FYTSGSIVDQTEADLSRGNKGEMNIEFPNLPYPSALNKRL-YISVKSRDSKTFSNNSSGQ 288
Query: 258 CSPRPSKEAKESVNDDECVREK-----------------HSLLREVHQAIFYEQVFDIVN 300
P + +S + E + + +LL + H+ F+ ++FD +N
Sbjct: 289 IHRLPVTDHSKSYREFEQLLNRVDTLPLEDARKLSFLKCSNLLNKAHKYQFHSELFDSLN 348
Query: 301 REAFKQSLGVNVTGIRENYLQLGIGLGISIFLSLI 335
REA + G + + E +++ G G+++F+ ++
Sbjct: 349 REASIGNTG-EIVMLTEKEIRIECG-GLNLFIGMV 381
>gi|198426488|ref|XP_002123046.1| PREDICTED: hypothetical protein [Ciona intestinalis]
Length = 1189
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 75/177 (42%), Gaps = 27/177 (15%)
Query: 261 RPSKEAKESVNDDECVREKHSLLREVHQAIFYEQ----------VFDIVNREAFKQSLGV 310
R K+A VN C R LLRE + Y Q + +IV+RE KQ L
Sbjct: 522 RDVKQALRKVNS--CDRCGEVLLRECANVVMYAQQAAEVITSTALKEIVSRENLKQPLEE 579
Query: 311 NVTG--IRENYLQLGIGLGISIFLSLIPSNQGDLSVDSWVN-----QNVESGILPLDSHD 363
+ E Y I + +P N+ ++ + SWV ++V S + S++
Sbjct: 580 RAANASVLEAYRDTKIAEETKAYFENLPKNKENI-LTSWVEDKKCFRSVISVKVNGYSNE 638
Query: 364 GVKLAEEKDDILRKSGGYPNPLTYEIYLQQVF-------HEYLYGRAKNKPISTGTR 413
++ +EK DI ++SG NP T E Y++ ++ +L + P+ T R
Sbjct: 639 KPEILKEKPDIFQRSGLQMNPKTVECYIEDLYPVTRSLSSSFLNENTEEPPLQTRQR 695
>gi|390336615|ref|XP_783830.3| PREDICTED: mediator of RNA polymerase II transcription subunit
17-like isoform 2 [Strongylocentrotus purpuratus]
gi|390336617|ref|XP_003724387.1| PREDICTED: mediator of RNA polymerase II transcription subunit
17-like isoform 1 [Strongylocentrotus purpuratus]
Length = 638
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 88/187 (47%), Gaps = 16/187 (8%)
Query: 4 NLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWA---VEKDDNKRLK 60
++++S++ L ++ I G E + + E L +IDFA E + + +
Sbjct: 5 SVKLSLESLQEHKVQEISLDGLETYTKPLSMAENLAKLAHKIDFASQSTDAESPEKETPE 64
Query: 61 KSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVE-AN--------DAVTVAG 111
KE ++S W W ++ ++ A E+SV+ D+++ AN D V+V G
Sbjct: 65 TEDKEGTTSFQQPQWPWDSVRNQVRAALTEMSVLSDILHIARPANKEQPYMVLDPVSVEG 124
Query: 112 MTRPKALPNEVLSDLSV--SAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQ 169
PK +++ +AA+ L + + K+ S E++ AK+ F+G L RL+
Sbjct: 125 -NPPKQYALQIVEKKKSLETAASILTSGANRMLQSKRFGVSAEKKNAKD-DFFGELHRLR 182
Query: 170 QNWKVKR 176
+W++K+
Sbjct: 183 MHWRLKK 189
>gi|320169704|gb|EFW46603.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 642
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 52/226 (23%), Positives = 93/226 (41%), Gaps = 21/226 (9%)
Query: 75 WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLSDLSVSAATKL 134
W W+++ +++ +A E V++DL + + T +A + DL+ K
Sbjct: 119 WHWKDVQDHIFYAQSESEVLLDLFALTRNKKYLAINHTTAVEA----PVPDLAFRVVAKR 174
Query: 135 QCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVAAPASGNEGFTIDLF 194
Q + +A LE + E FY L LQQ+W++ ++ G T+ F
Sbjct: 175 QLLANAAEALASAAARLENALETEQSFYSQLTSLQQDWRIVQR----------GQTLG-F 223
Query: 195 DNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQSGDSSKQCSKV 254
D L + S L +IR D+A + I PP+ + R F SG S +
Sbjct: 224 DVGLRSAGSRHPGSGLYSIR-RGDTAASVVIESPPDLQTATRLRF--SVSGPGLSATSTI 280
Query: 255 KNSCSPRPSKEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIVN 300
+ P +E + + E + + LRE+ A+ + + D+ N
Sbjct: 281 LH--PPFQGQELSQISSTLEAAQHSN-FLREIFSALTSDALEDVSN 323
>gi|195055229|ref|XP_001994522.1| GH15742 [Drosophila grimshawi]
gi|193892285|gb|EDV91151.1| GH15742 [Drosophila grimshawi]
Length = 648
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 40/182 (21%), Positives = 77/182 (42%), Gaps = 7/182 (3%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M+ ++ ISV+ ++ I G E + P E RIDF+ DD K+ +
Sbjct: 1 MSNSVNISVETTCENQIREIAYDGTELYQPPPTLSESLAKCAARIDFS-KTSLDDLKKEE 59
Query: 61 KSS------KESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTR 114
KS+ ++ + + W W + L+ A+ E+ V+ D+I+ + + + +
Sbjct: 60 KSAAAAEDKEDKDNQFQESLWPWDAVRNKLKDAYTEICVLSDVISIAKDKRYLVLDPLLE 119
Query: 115 PKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKV 174
++ S A LG + EQ+ + F+ L+RL+QNW++
Sbjct: 120 DGDDTKAIVQVYSRKKALSQAAQVLLGGAERLRTAHSEQRSRNVSDFHIELLRLRQNWRL 179
Query: 175 KR 176
K+
Sbjct: 180 KK 181
>gi|348522991|ref|XP_003449007.1| PREDICTED: mediator of RNA polymerase II transcription subunit
17-like [Oreochromis niloticus]
Length = 642
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 38/189 (20%), Positives = 83/189 (43%), Gaps = 21/189 (11%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAV--------EKDDN 56
+ IS++ K++ + G E + P + + L +RIDF+ E D
Sbjct: 7 VRISIESSCEKQVQEVALDGTETYVPPLSMSQNLAKLAQRIDFSQGSDSEEDVEGEPKDR 66
Query: 57 KRLKKSSKESSSSATTTP--WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTR 114
+ K+ +E + P W W ++ NL+ A E+ V+ D+++ V+ + + +++
Sbjct: 67 EWGKQDGEEEEGTVKFQPSLWPWDSVRNNLRSALTEMCVLYDVLSVVKEKKYMALDPVSQ 126
Query: 115 -PKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIA------KEARFYGALIR 167
P A + L +K + + ++ A+ L + +A ++ F L+R
Sbjct: 127 DPAAGKTPQVFQL----ISKKKSLATAAMILQKGAEKLSKSVAENQENRRQRDFNSELLR 182
Query: 168 LQQNWKVKR 176
L+ WK+++
Sbjct: 183 LRSQWKLRK 191
>gi|380011201|ref|XP_003689699.1| PREDICTED: LOW QUALITY PROTEIN: mediator of RNA polymerase II
transcription subunit 17-like [Apis florea]
Length = 644
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 41/185 (22%), Positives = 84/185 (45%), Gaps = 12/185 (6%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M ++ ISV+ ++ I G E + + E + ++IDF+ +D K L+
Sbjct: 1 MAYSVNISVEAPIENQIQEITYDGQEIYQAPLTLSENLAKIAQKIDFSKTNGEDVKKELE 60
Query: 61 KSSK-----ESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRP 115
K + S+S ++ W W ++ L+ A E+ V+ D++ + + + + +
Sbjct: 61 NGEKSEEDTKDSASFQSSLWPWDSVRNKLRNALTEVCVLADVLAIAKEKHYMVLDPVPQE 120
Query: 116 KALPNEVLSDLSVSAATK-LQCYRHLGIYFKQSAKSLEQQIAKEAR---FYGALIRLQQN 171
P EV + V A K L + + K+ + ++A+ F+ L+RL+QN
Sbjct: 121 ---PTEVKPMIQVYARKKALAGAASVIMMGADRLKNCQNELARNRSTPDFHIELLRLRQN 177
Query: 172 WKVKR 176
W++K+
Sbjct: 178 WRLKK 182
>gi|384249298|gb|EIE22780.1| hypothetical protein COCSUDRAFT_42399 [Coccomyxa subellipsoidea
C-169]
Length = 403
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 54/113 (47%), Gaps = 5/113 (4%)
Query: 81 VENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVLSDLSVSAATKLQCYRHL 140
ENL + E+ VI+DL++ VEA + VA + PK + + S ++ K +
Sbjct: 35 AENLA-SQGEVDVILDLVSAVEAGQHLNVANVPPPKNTLHGLTSQRALQVQAKRLQLQDA 93
Query: 141 GIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRVAAPASGNEGFTIDL 193
+ A L + ++ +Y +LQ++WK+K A+P+ T+D+
Sbjct: 94 AQRLSRGAGLLAASLDRDDTYYSQAAQLQRSWKLK----ASPSGAAAPLTVDV 142
>gi|66507375|ref|XP_394516.2| PREDICTED: mediator of RNA polymerase II transcription subunit 17
[Apis mellifera]
Length = 644
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 41/185 (22%), Positives = 84/185 (45%), Gaps = 12/185 (6%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLK 60
M ++ ISV+ ++ I G E + + E + ++IDF+ +D K L+
Sbjct: 1 MAYSVNISVEAPIENQIQEITYDGQEIYQAPLTLSENLAKIAQKIDFSKTNGEDVKKELE 60
Query: 61 KSSK-----ESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRP 115
K + S+S ++ W W ++ L+ A E+ V+ D++ + + + + +
Sbjct: 61 SGEKSEEDTKDSASFQSSLWPWDSVRNKLRNALTEVCVLADVLAIAKEKHYMVLDPVPQE 120
Query: 116 KALPNEVLSDLSVSAATK-LQCYRHLGIYFKQSAKSLEQQIAKEAR---FYGALIRLQQN 171
P EV + V A K L + + K+ + ++A+ F+ L+RL+QN
Sbjct: 121 ---PTEVKPMIQVYARKKALAGAASVIMMGADRLKNCQNELARNRSTPDFHIELLRLRQN 177
Query: 172 WKVKR 176
W++K+
Sbjct: 178 WRLKK 182
>gi|410909574|ref|XP_003968265.1| PREDICTED: mediator of RNA polymerase II transcription subunit
17-like isoform 1 [Takifugu rubripes]
Length = 642
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 36/189 (19%), Positives = 83/189 (43%), Gaps = 21/189 (11%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKD---------D 55
++IS++ + K++ + G E + P + + L +RIDF + + D
Sbjct: 7 VKISIESVCEKQVQEVALDGTETYVPPLSMSQNLSKLAQRIDFNQGSDSEEVDGEGEFRD 66
Query: 56 NKRLKKSSKESSSSATTTP--WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMT 113
+ K+ +E + P W W ++ NL+ A E+ V+ D+++ V+ + + ++
Sbjct: 67 REWSKQEQEEEDGTVKFQPSLWPWDSVRNNLRSALTEMCVLYDVLSVVKEKKYMALDPVS 126
Query: 114 RPKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIA------KEARFYGALIR 167
+ + + L +K + G + A+ L + +A ++ F L+R
Sbjct: 127 QDQTGKTPQVFQL----ISKKKSLATAGQLLLKGAEKLSKSVAENQENRRQRDFNSELLR 182
Query: 168 LQQNWKVKR 176
L+ WK+++
Sbjct: 183 LRSQWKLRK 191
>gi|47213573|emb|CAF95555.1| unnamed protein product [Tetraodon nigroviridis]
Length = 641
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 38/187 (20%), Positives = 84/187 (44%), Gaps = 17/187 (9%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRL----- 59
+ IS++ + K++ + G E + P + + L +RIDF+ + ++ +
Sbjct: 7 VRISIESVCEKQVQEVALDGTETYVPPLSMSQNLSKLAQRIDFSQGSDSEEVEGEGELRE 66
Query: 60 ----KKSSKESSSSATTTP--WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMT 113
K+ +E + P W W ++ NL+ A E+ V+ D+++ V+ + + ++
Sbjct: 67 REWSKQEQEEEDGTVKFQPSLWPWDSVRNNLRSALTEMCVLYDVLSVVKEKKYMALDPVS 126
Query: 114 RPKA--LPN--EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQ 169
+ + P ++LS A + K A++LE Q ++ F L+RL+
Sbjct: 127 QDQTGKTPQVFQLLSKKKSLATAGQLLLKGAEKLSKSVAENLENQ--RQRDFNSELLRLR 184
Query: 170 QNWKVKR 176
WK+++
Sbjct: 185 SQWKLRK 191
>gi|307212331|gb|EFN88135.1| Mediator of RNA polymerase II transcription subunit 17
[Harpegnathos saltator]
Length = 646
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 41/186 (22%), Positives = 89/186 (47%), Gaps = 13/186 (6%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWA----VEKDDN 56
M ++ IS++ ++ I G E + + E + ++IDF+ + K+
Sbjct: 1 MAYSVNISIEAPIENQIQEISYDGQEIYQAPLTLSENLAKIAQKIDFSKTNGDEISKEQL 60
Query: 57 KRLKKSSKESSSSAT--TTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTR 114
+ +KS ++ SA+ ++ W W ++ L+ A E+ V+ D++ + + + + + +
Sbjct: 61 EGGEKSEEDPKDSASFQSSLWPWDSIRNKLRSALTEVCVLADVLAIAKEKNYMVLDPVPQ 120
Query: 115 PKALPNEVLSDLSVSAATK-LQCYRHLGIYFKQSAKSLEQQIAKEAR---FYGALIRLQQ 170
P EV + V A K L +L I+ Q ++ + ++ K F+ L+RL+Q
Sbjct: 121 D---PVEVKPMMQVYARKKALTGAANLIIHGAQRLQNSQAELTKNRSTPDFHIELLRLRQ 177
Query: 171 NWKVKR 176
NW++K+
Sbjct: 178 NWRLKK 183
>gi|291224957|ref|XP_002732468.1| PREDICTED: mediator complex subunit 17-like, partial [Saccoglossus
kowalevskii]
Length = 326
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 39/177 (22%), Positives = 79/177 (44%), Gaps = 8/177 (4%)
Query: 4 NLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRLKKSS 63
+++SV+ + ++ + G E + + E L +IDF + E D +K
Sbjct: 5 GVKVSVEPVLEHQIQEVALDGQETYVTPLSMSENLTKLAHKIDFGKSDE--DEVSGEKEK 62
Query: 64 KESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMTRPKALPNEVL 123
E + W W+++ L+ A E+ V+ D+++ + + + +++ +A+ L
Sbjct: 63 DEELTPFQPPQWPWESVRNKLRNAFTEVCVLADVLSIAKETRYMVLDEVSQEEAISKPAL 122
Query: 124 SDL----SVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKR 176
L S+S A ++ G FK S + K F+ L++L QNW++KR
Sbjct: 123 QLLAKKRSLSGAAQI-LVEGAGRLFKSQVVSGSLKSNKH-DFHVELLKLHQNWRLKR 177
>gi|290983196|ref|XP_002674315.1| predicted protein [Naegleria gruberi]
gi|284087904|gb|EFC41571.1| predicted protein [Naegleria gruberi]
Length = 587
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 82/194 (42%), Gaps = 32/194 (16%)
Query: 16 RLDAIEETGAERFPPDVGYDEKRESLIR-RIDFA--WAVEKDDNKRLKKSSKESSSSATT 72
R+ + G E P++G +E +I+ + DF + + ++ R S + +
Sbjct: 37 RIQFYDTDGKETIVPNLGAEENFARIIQNKFDFTPTYYQRQLESIRFDDSQQLPPPTIRK 96
Query: 73 TPW-QWQNMVENLQFAHQELSVIID------------LINTVEANDAVTVAGMTRPKALP 119
P +W+ V L+ A E+ I D LI TV A+ T+ KA
Sbjct: 97 VPEPEWEKTVNKLRQAIYEVQFIHDVSKMMKETSDDQLIETVNAHRTFNEDLRTKEKA-- 154
Query: 120 NEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNWKVKRQRV 179
+L S +L Y H GI K ++ L ++ + + FY + +LQ+ WKVK+
Sbjct: 155 ----GELFPSRIEQL-IYTH-GI-LKSGSEQLRKRSERTSDFYQTIFKLQKKWKVKQ--- 204
Query: 180 AAPASGNEGFTIDL 193
GN F +DL
Sbjct: 205 ----FGNNNFFVDL 214
>gi|239609191|gb|EEQ86178.1| RNA polymerase II mediator complex component SRB4 [Ajellomyces
dermatitidis ER-3]
Length = 692
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 48/210 (22%), Positives = 90/210 (42%), Gaps = 29/210 (13%)
Query: 128 VSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQN-WKVKRQRVAAPASGN 186
+S KL+ + Q+A LEQ++A E +++ ++ +++ WK+ R G
Sbjct: 157 LSRGWKLETFDFAANKLLQAASRLEQEVAAETKYWADVLSIKEKGWKICRIPRERQTLGV 216
Query: 187 E-GFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQSG 245
+ GF ++ P R L+ +R +L L P+ RS R + VQ
Sbjct: 217 QCGF---------LEATPTLRDRGLAALRRGDGGDLILDRGLQPSQPRSLR---VRVQQH 264
Query: 246 DSSKQCSKVKNSCSPRPSKEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIVNREAFK 305
D CSK+ + E D+ + + L R+V ++ E++F +NREA
Sbjct: 265 DQIVGCSKLI----------SLELAPGDDTIEHRIRLARDV---LYEEELFHELNREA-- 309
Query: 306 QSLGVNVTGIRENYLQLGIGLGISIFLSLI 335
++L + +EN +Q I + ++
Sbjct: 310 RTLLQHGIQSKENLIQFQASDNQQILIDMV 339
>gi|117606149|ref|NP_001071042.1| mediator of RNA polymerase II transcription subunit 17 [Danio
rerio]
gi|123884354|sp|Q08BY1.1|MED17_DANRE RecName: Full=Mediator of RNA polymerase II transcription subunit
17; AltName: Full=Cofactor required for Sp1
transcriptional activation subunit 6; Short=CRSP complex
subunit 6; AltName: Full=Mediator complex subunit 17
gi|115313857|gb|AAI24508.1| Cofactor required for Sp1 transcriptional activation, subunit 6
[Danio rerio]
gi|182891978|gb|AAI65631.1| Crsp6 protein [Danio rerio]
Length = 643
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 36/186 (19%), Positives = 83/186 (44%), Gaps = 14/186 (7%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDN-------- 56
+ +S++ +++ + G E + P + + L++RIDF + + +++
Sbjct: 7 VRVSIESSCERQVQEVSLDGMETYVPPLSMSQNLAKLVQRIDFCQSSDSEEDGAERARAG 66
Query: 57 -KRLKKSSKESSSSATTTP--WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMT 113
++ K+ +E P W W ++ NL+ A E+ V+ D+++ ++ +T+ ++
Sbjct: 67 REQWKQEPEEDEGQLKFQPSLWPWDSVRNNLRSALTEMCVLHDVLSVLKERKYMTLDPVS 126
Query: 114 RPKALPN--EVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEAR-FYGALIRLQQ 170
+ A+ +V +S + L K S E Q + R F L+RL+
Sbjct: 127 QDPAMAKTPQVFQLISKKKSLGTAAQLLLKGAEKLSKSVSENQEQRRQRDFNSELLRLRS 186
Query: 171 NWKVKR 176
WK+++
Sbjct: 187 QWKLRK 192
>gi|410909576|ref|XP_003968266.1| PREDICTED: mediator of RNA polymerase II transcription subunit
17-like isoform 2 [Takifugu rubripes]
Length = 650
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 39/198 (19%), Positives = 85/198 (42%), Gaps = 31/198 (15%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKD---------D 55
++IS++ + K++ + G E + P + + L +RIDF + + D
Sbjct: 7 VKISIESVCEKQVQEVALDGTETYVPPLSMSQNLSKLAQRIDFNQGSDSEEVDGEGEFRD 66
Query: 56 NKRLKKSSKESSSSATTTP--WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMT 113
+ K+ +E + P W W ++ NL+ A E+ V+ D+++ V+ + + ++
Sbjct: 67 REWSKQEQEEEDGTVKFQPSLWPWDSVRNNLRSALTEMCVLYDVLSVVKEKKYMALDPVS 126
Query: 114 R---------PKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIA------KE 158
+ P A +V +S K + G + A+ L + +A ++
Sbjct: 127 QDQTGKHHHHPFAQTPQVFQLIS-----KKKSLATAGQLLLKGAEKLSKSVAENQENRRQ 181
Query: 159 ARFYGALIRLQQNWKVKR 176
F L+RL+ WK+++
Sbjct: 182 RDFNSELLRLRSQWKLRK 199
>gi|261189001|ref|XP_002620913.1| RNA polymerase II mediator complex component SRB4 [Ajellomyces
dermatitidis SLH14081]
gi|239591917|gb|EEQ74498.1| RNA polymerase II mediator complex component SRB4 [Ajellomyces
dermatitidis SLH14081]
gi|327355907|gb|EGE84764.1| RNA polymerase II mediator complex component SRB4 [Ajellomyces
dermatitidis ATCC 18188]
Length = 692
Score = 39.7 bits (91), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 48/210 (22%), Positives = 90/210 (42%), Gaps = 29/210 (13%)
Query: 128 VSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQN-WKVKRQRVAAPASGN 186
+S KL+ + Q+A LEQ++A E +++ ++ +++ WK+ R G
Sbjct: 157 LSRGWKLETFDFAANKLLQAASRLEQEVAAETKYWADVLSIKEKGWKICRIPRERQTLGV 216
Query: 187 E-GFTIDLFDNSLYDSAPVSRPSSLSTIRIDHDSAGMLAINLPPNSCRSFRFGFLGVQSG 245
+ GF ++ P R L+ +R +L L P+ RS R + VQ
Sbjct: 217 QCGF---------LEATPTLRDRGLAALRRGDGGDLILDRGLQPSQPRSLR---VRVQQH 264
Query: 246 DSSKQCSKVKNSCSPRPSKEAKESVNDDECVREKHSLLREVHQAIFYEQVFDIVNREAFK 305
D CSK+ + E D+ + + L R+V ++ E++F +NREA
Sbjct: 265 DQIVGCSKLI----------SLELAPGDDTIEHRIRLARDV---LYEEELFHELNREA-- 309
Query: 306 QSLGVNVTGIRENYLQLGIGLGISIFLSLI 335
++L + +EN +Q I + ++
Sbjct: 310 RTLLQHGIQSKENLIQFQASDNQQILIDMV 339
>gi|195444779|ref|XP_002070025.1| GK11248 [Drosophila willistoni]
gi|194166110|gb|EDW81011.1| GK11248 [Drosophila willistoni]
Length = 645
Score = 39.7 bits (91), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 43/184 (23%), Positives = 76/184 (41%), Gaps = 9/184 (4%)
Query: 1 MNGN-LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRL 59
M+ N + ISV+ ++ I G E + P E RIDF+ DD K+
Sbjct: 1 MSANSVNISVETTCENQIREIGYDGTELYQPPPTLSESLAKCAARIDFS-KTSLDDLKKE 59
Query: 60 KKSSKESSSSA-------TTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGM 112
+KS+ E+ S + W W + L+ A E+ V+ D+I+ + + + +
Sbjct: 60 EKSATEAEESRDKDGNQFQESLWPWDAVRNKLKDAFTEICVLSDVISIAKDKRYLVLDPL 119
Query: 113 TRPKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRLQQNW 172
++ S A LG + EQ+ + F+ L+RL+QNW
Sbjct: 120 LEDGDDTKPIVQVYSRKKAISQAAQVLLGGAERLRNAHSEQRSRNISDFHIELLRLRQNW 179
Query: 173 KVKR 176
++K+
Sbjct: 180 RLKK 183
>gi|18311664|ref|NP_558331.1| hypothetical protein PAE0030 [Pyrobaculum aerophilum str. IM2]
gi|74566312|sp|Q8ZZX6.1|AUBA_PYRAE RecName: Full=RNA-binding protein AU-1; AltName: Full=AU-binding
protein
gi|18159062|gb|AAL62513.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
Length = 443
Score = 39.3 bits (90), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 7/102 (6%)
Query: 118 LPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSL-EQQIAKEAR-FYGALIRLQQNWKVK 175
+P E L + A T+L+ Y +G+ FK SAK E+QI KEA Y L++L
Sbjct: 166 IPQEDRLRLGILAETRLKQYASIGLRFKSSAKYADEEQIIKEAEVLYRELLQLSHGGPPG 225
Query: 176 RQRVAAPASGNEGFTIDLFDNSLYDSAPVSRPSSLSTIRIDH 217
A GN F + LFD + +R S++ T+R H
Sbjct: 226 ----AVLRRGN-CFAVVLFDRRSKEVLDSARASAVPTVRGHH 262
>gi|91080451|ref|XP_969541.1| PREDICTED: similar to AGAP009141-PA [Tribolium castaneum]
gi|270005569|gb|EFA02017.1| hypothetical protein TcasGA2_TC007640 [Tribolium castaneum]
Length = 639
Score = 39.3 bits (90), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 40/188 (21%), Positives = 80/188 (42%), Gaps = 17/188 (9%)
Query: 1 MNGNLEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAVEKDDNKRL- 59
M+ ++ ISV+ ++ I G E + P + E ++IDF+ + D K
Sbjct: 1 MSYSVNISVEAPIENQIQEITYDGTEIYQPPLTLSESLTKYAQKIDFSKTSDIDFKKEPG 60
Query: 60 -----KKSSKESSSSATTTPWQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGM-- 112
KS +S + ++ W W + L+ A QE+ V+ D++ + + + +
Sbjct: 61 EAIEDNKSDSDSKDAFQSSLWPWDSARNKLRNAFQEVCVLADVLAIAKDKRYMVLDPVQQ 120
Query: 113 ----TRPKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIAKEARFYGALIRL 168
T+P L+ +A+ L L ++A++ F+ L+RL
Sbjct: 121 EPIETKPMVQIYARKKALAGAASVLLTGAERLKTSQNEAARN-----RTVPDFHIELLRL 175
Query: 169 QQNWKVKR 176
+QNW++K+
Sbjct: 176 RQNWRLKK 183
>gi|302696519|ref|XP_003037938.1| hypothetical protein SCHCODRAFT_230551 [Schizophyllum commune H4-8]
gi|300111635|gb|EFJ03036.1| hypothetical protein SCHCODRAFT_230551 [Schizophyllum commune H4-8]
Length = 495
Score = 38.9 bits (89), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 53/245 (21%), Positives = 104/245 (42%), Gaps = 30/245 (12%)
Query: 104 NDAVTVAGMTRPKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQ-QIAKEARFY 162
D +T +T+P +P+ + ++ +K + R FK +A+ +E+ +I +E F
Sbjct: 69 KDVLTSTIVTKPPPIPSVQAFNAQLALGSKDEALRKASKLFKDAAEDMERSRIKEEKYFL 128
Query: 163 GALIRLQQNWKVKRQRVAAP-------ASGNEGFTIDLFDN-SLYDSAPVSRPSSLSTI- 213
AL ++NW + V AP A G+E +DL L +S P R S++T+
Sbjct: 129 NALKIRRENWGM----VPAPLPLWMAQAKGSEKTAMDLLVCFGLEESPPAFRRQSIATMG 184
Query: 214 --RIDHDSAGMLAINLP-PNSCRSFRFGFLGVQSGDSSKQCSKVKNSCSPRPSKEAKESV 270
D D +N P R + SG + + + ++ P+ A+E
Sbjct: 185 SFEADTDP-----LNFPHRQRTRLRVTLTTTLPSGARVQSQNTITHAPVDAPALHAQEEA 239
Query: 271 NDDECVREKHSLLREVHQAIFYEQVFDIVNREAFKQSLGVNVTGIRENYLQLGIGLGISI 330
+ H L+++ + ++F ++ REA L +RE ++ + G+S+
Sbjct: 240 ST------LHGLIQQAQHEMVDREMFSVLVREA--GHLPTAAAHVREKFIVIEAAQGVSL 291
Query: 331 FLSLI 335
++
Sbjct: 292 RFDMV 296
>gi|432889346|ref|XP_004075231.1| PREDICTED: mediator of RNA polymerase II transcription subunit
17-like [Oryzias latipes]
Length = 643
Score = 38.9 bits (89), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 37/189 (19%), Positives = 80/189 (42%), Gaps = 20/189 (10%)
Query: 5 LEISVDKLPVKRLDAIEETGAERFPPDVGYDEKRESLIRRIDFAWAV---------EKDD 55
+ IS++ K++ + G E + P + + L +RIDF+ E D
Sbjct: 7 VRISIESSCEKQVQEVALDGTETYVPPLSMSQNLAKLAQRIDFSQGSDSEEDGAEGESRD 66
Query: 56 NKRLKKSSKESSSSATTTP--WQWQNMVENLQFAHQELSVIIDLINTVEANDAVTVAGMT 113
+ K+ +E + P W W ++ NL+ A E+ V+ D+++ V+ + + ++
Sbjct: 67 REWSKQEPEEEEGAVKFQPSLWPWDSVRNNLRSALTEMCVLHDVLSVVKEKKYMALDPVS 126
Query: 114 RPKALPNEVLSDLSVSAATKLQCYRHLGIYFKQSAKSLEQQIA------KEARFYGALIR 167
P+ V + +K + + A+ L + +A ++ F L+R
Sbjct: 127 HD---PSAVKTPQVFQLISKKKSLGTAAQILLKGAERLSKSVAENQENRRQRDFNSELLR 183
Query: 168 LQQNWKVKR 176
L+ WK+++
Sbjct: 184 LRSQWKLRK 192
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,415,203,574
Number of Sequences: 23463169
Number of extensions: 439292034
Number of successful extensions: 1053507
Number of sequences better than 100.0: 104
Number of HSP's better than 100.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 75
Number of HSP's that attempted gapping in prelim test: 1053245
Number of HSP's gapped (non-prelim): 118
length of query: 657
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 508
effective length of database: 8,863,183,186
effective search space: 4502497058488
effective search space used: 4502497058488
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)