BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 009565
(532 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|317106644|dbj|BAJ53149.1| JHL23J11.4 [Jatropha curcas]
Length = 552
Score = 694 bits (1791), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/536 (68%), Positives = 421/536 (78%), Gaps = 44/536 (8%)
Query: 23 EEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSGISQ 82
E GWYIL ENQQH+GPYA SEL EHFLNGYL E+TLVWS+GR WQPLSSIP+ +SGISQ
Sbjct: 35 EVGWYILGENQQHLGPYASSELREHFLNGYLSESTLVWSEGRVVWQPLSSIPELISGISQ 94
Query: 83 QVARGSTAVPCNDGIE-EVREQIEEA----AGMQSQSFSSAEQ------GVPSHVDD-EF 130
Q A S A N E E + +EE G +++S S+ ++ VP DD EF
Sbjct: 95 QKADSSIAARTNSDNELEKSDDVEETERGNVGFKNESHSTDKEQAKDSSSVPFSDDDAEF 154
Query: 131 EKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDR 190
EKWQREVR+AE EA++LKNGS S S+G +G + PEGE+EFTDDDGT YKWDR
Sbjct: 155 EKWQREVRDAEAEAQQLKNGSDSGSIGAGVG------AMSPPEGEEEFTDDDGTTYKWDR 208
Query: 191 GLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNS 250
GLRAWVPQ+DTSS YG++EMTFL+EEEVFPTVN+ D +++EK N
Sbjct: 209 GLRAWVPQDDTSSMGGQYGLDEMTFLQEEEVFPTVNIED------------AASKEKFNG 256
Query: 251 ADNVVEEKHNGKRK-----------QPDK---QVEKKEANKPPDSWFELKVNTHVYVTGL 296
+ +E KHNGKRK QPDK Q + KEANK PDSWFELKVNTH+YVTGL
Sbjct: 257 TSDTLEPKHNGKRKLMDMQTDNNEKQPDKKEKQADNKEANKAPDSWFELKVNTHIYVTGL 316
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
PDDVT EE+VEVFSKCGIIKEDPETKKPR+KIYVDKETG KGDAL+TYLKEPSV LA Q
Sbjct: 317 PDDVTAEEVVEVFSKCGIIKEDPETKKPRVKIYVDKETGRTKGDALITYLKEPSVDLAMQ 376
Query: 357 LLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKL 416
+LDGTP RP G IPMSV++AKFEQKG+RFI K+VD+KKKKKLK+VEEK+LGWGGRDDAK+
Sbjct: 377 ILDGTPLRPGGTIPMSVSRAKFEQKGDRFIPKKVDNKKKKKLKRVEEKILGWGGRDDAKV 436
Query: 417 TIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFK 476
IPATV+LR+MFTPAEMRADENLRSELE DV+EECVK+GPVDSVKVCENHPQGVVLVRFK
Sbjct: 437 LIPATVVLRYMFTPAEMRADENLRSELELDVKEECVKLGPVDSVKVCENHPQGVVLVRFK 496
Query: 477 DRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
DRKDAQKCIELMNGRWFGGRQ+HASEDDG VNHA +RD + +RLEQFGAELEAD
Sbjct: 497 DRKDAQKCIELMNGRWFGGRQVHASEDDGSVNHAIVRDYAEDTARLEQFGAELEAD 552
>gi|224112381|ref|XP_002316170.1| predicted protein [Populus trichocarpa]
gi|222865210|gb|EEF02341.1| predicted protein [Populus trichocarpa]
Length = 497
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/522 (67%), Positives = 396/522 (75%), Gaps = 48/522 (9%)
Query: 17 NYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQF 76
NY T E GWYIL E+QQ VGPY SEL EHFLNGYLLE+TLVWS+GRS+WQPLSSIP+
Sbjct: 18 NYNTVAEVGWYILGEDQQQVGPYVFSELREHFLNGYLLESTLVWSEGRSDWQPLSSIPEL 77
Query: 77 LSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQRE 136
+SG SQQ + S AV ND +DEFEKWQRE
Sbjct: 78 MSGTSQQGSDYSRAVSSNDD------------------------------EDEFEKWQRE 107
Query: 137 VREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWV 196
V+EAE EAERLKNGS + G G DD D + + P+GEDEFTDDDGT YKWDR LRAWV
Sbjct: 108 VKEAEAEAERLKNGSLPGNTGDDFGIDDSDRILSPPDGEDEFTDDDGTTYKWDRSLRAWV 167
Query: 197 PQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVE 256
PQ++ SS + YG+E+MTF ++EEVF VN D DE N VVE
Sbjct: 168 PQDNLSSVSGQYGVEQMTFHEQEEVFLNVNAADASLKDEA------------NGTGEVVE 215
Query: 257 EKHNGKRK------QPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFS 310
+ + KRK DKQ +KKEANK PDSWFELKVNTHVYVTGLPDDVT EE+VEVFS
Sbjct: 216 SQRSDKRKLQDEQADKDKQADKKEANKAPDSWFELKVNTHVYVTGLPDDVTAEEVVEVFS 275
Query: 311 KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIP 370
KCG+IKEDPE KKPR+KIYVDKETG KGDALVTYLKEPSV LA Q+LDGTP RP G IP
Sbjct: 276 KCGVIKEDPEKKKPRVKIYVDKETGRIKGDALVTYLKEPSVDLAMQILDGTPLRPGGTIP 335
Query: 371 MSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTP 430
MSVTQAKFEQKG+RFI KQVDSKKK+KLKKVE+++LGWGGRDDAK++IPATV+LR MFT
Sbjct: 336 MSVTQAKFEQKGDRFITKQVDSKKKRKLKKVEDRILGWGGRDDAKVSIPATVVLRQMFTL 395
Query: 431 AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG 490
+EMRADE+LRSELE DV+EEC K+GPVDSVKVCEN+P GVVLV+FKDRKDAQ CIELMNG
Sbjct: 396 SEMRADESLRSELEVDVREECAKLGPVDSVKVCENNPHGVVLVKFKDRKDAQSCIELMNG 455
Query: 491 RWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
RWFGGRQ+ ASEDDGL+NHA +RD D +A+RLEQFGAELEAD
Sbjct: 456 RWFGGRQVDASEDDGLINHALVRDHDEDAARLEQFGAELEAD 497
>gi|356535254|ref|XP_003536163.1| PREDICTED: HIV Tat-specific factor 1 homolog [Glycine max]
Length = 503
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/520 (64%), Positives = 389/520 (74%), Gaps = 41/520 (7%)
Query: 19 ETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLS 78
E E GWY+L E+QQ +GPYA SEL EHFLNGYL E T VWS+GRSEWQPLSS+ +
Sbjct: 19 EKITEVGWYVLGEDQQQIGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLWA 78
Query: 79 GISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVR 138
I+QQ ST V D DEFE+WQ+E++
Sbjct: 79 QINQQGPDSSTTVSAPDV-------------------------------DEFERWQKEIQ 107
Query: 139 EAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQ 198
EAE + E + GS S + G +D + PEGE+EFTDDDGT YKWDR LRAWVPQ
Sbjct: 108 EAEAQVEGSEFGSLSGNAGSTGAGEDSERPSTPPEGEEEFTDDDGTVYKWDRNLRAWVPQ 167
Query: 199 EDTSSQNDGYGIEEMTFLKEEEVFPTVNVTD------DLANDEVGKEKLNSTEEKVNSAD 252
E + + YG++EMTFL+EEEVFPT+ ++D D V L + N+ +
Sbjct: 168 EHPTGSTEPYGVQEMTFLEEEEVFPTIPISDASEKFEDSPKLSVSVPPLKEETNEANNTN 227
Query: 253 NVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKC 312
V EK RK D+Q +KKEANKPPDSWFELK+NTHVYVTGLP+DVT +E+VEVFSKC
Sbjct: 228 VVSGEK----RKLSDQQTDKKEANKPPDSWFELKINTHVYVTGLPEDVTTDEIVEVFSKC 283
Query: 313 GIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMS 372
GIIKEDPETKKPR+K+YVDK TG KKGDALVTYLKEPSVALA Q+LDG P RP+GKIPMS
Sbjct: 284 GIIKEDPETKKPRVKLYVDKGTGRKKGDALVTYLKEPSVALAIQILDGAPLRPNGKIPMS 343
Query: 373 VTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAE 432
V+QAKFEQKG++F++KQVD+KKKKKLKKVE+KMLGWGGRDDAK++IPATVILR+MF PAE
Sbjct: 344 VSQAKFEQKGDKFVSKQVDNKKKKKLKKVEDKMLGWGGRDDAKVSIPATVILRYMFAPAE 403
Query: 433 MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRW 492
MRADENLR ELE DV+EEC K+GP+DSVK+CENHPQGVVLVRFKDRKDAQKCIELMNGRW
Sbjct: 404 MRADENLRLELEEDVKEECTKLGPLDSVKICENHPQGVVLVRFKDRKDAQKCIELMNGRW 463
Query: 493 FGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
FGGRQIHASEDDG VNHA +RDL+ +A RLEQFGAELE D
Sbjct: 464 FGGRQIHASEDDGSVNHALVRDLEEDAIRLEQFGAELEGD 503
>gi|255543755|ref|XP_002512940.1| Splicing factor U2AF-associated protein, putative [Ricinus
communis]
gi|223547951|gb|EEF49443.1| Splicing factor U2AF-associated protein, putative [Ricinus
communis]
Length = 518
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/538 (62%), Positives = 400/538 (74%), Gaps = 62/538 (11%)
Query: 26 WYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSGISQQVA 85
WYILD+NQQ GPYAI E+ EHFLNG+L E++ VW++GR +WQPL +IP LS ++ Q A
Sbjct: 12 WYILDDNQQSFGPYAIHEMREHFLNGFLSESSFVWTEGRVDWQPLFAIPDLLSQLTLQRA 71
Query: 86 RGSTAVPCNDGIE-----EVREQIEEAAGMQSQSFSSAEQGVPSHV-------DDEFEKW 133
S + N IE +VR G+Q S SS +Q + + +DEFEKW
Sbjct: 72 DTSLSASINSDIESEKWNDVRGAERGIVGLQDGSQSSNKQQINTSSAVRSNDNEDEFEKW 131
Query: 134 QREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLR 193
QRE+ EAE EA+R + + PEGE+EFTDDDGT YKWDRGLR
Sbjct: 132 QREISEAEAEADRPQ---------------------SPPEGEEEFTDDDGTTYKWDRGLR 170
Query: 194 AWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADN 253
AWVPQ++TSS YG+EEMTFL+EE+VFPTV++ N+ KE++N + E
Sbjct: 171 AWVPQDNTSSVTGQYGLEEMTFLQEEDVFPTVDI-----NNAAFKEEINGSGES-----E 220
Query: 254 VVEEKHNGKRK----------QPDKQVE---------KKEANKPPDSWFELKVNTHVYVT 294
+E KHNGKRK Q DK + KKEANK PDSWFELKVNTHVY+T
Sbjct: 221 TLESKHNGKRKLQGLQDDSKMQADKDTQPDKKEKEADKKEANKAPDSWFELKVNTHVYIT 280
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLPDDVT EE+VEVFSKCGIIKEDPETKKPR+KIYVDKETG KGDALVT+LKEPSV LA
Sbjct: 281 GLPDDVTSEEVVEVFSKCGIIKEDPETKKPRVKIYVDKETGRIKGDALVTFLKEPSVDLA 340
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDA 414
Q+LDGTP RP G +PMSV++AKF+QKG+RFI KQ D+KKKKKLK+VEE++LGWGGRDD
Sbjct: 341 LQILDGTPLRPGGAVPMSVSRAKFQQKGDRFIPKQADNKKKKKLKRVEERILGWGGRDDV 400
Query: 415 KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
K++IPATV+LR+MFTPAEMR DENLRSELE D++EECVK+GPVDSVKVCENHPQGVVLV+
Sbjct: 401 KVSIPATVVLRYMFTPAEMRTDENLRSELEVDIREECVKLGPVDSVKVCENHPQGVVLVK 460
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
FKDRKDAQ CIELMNGRWFGGRQ+HASEDDG VNHA +RDLD +A+RLEQFGAELE +
Sbjct: 461 FKDRKDAQNCIELMNGRWFGGRQVHASEDDGSVNHALVRDLDQDAARLEQFGAELEGE 518
>gi|356576634|ref|XP_003556435.1| PREDICTED: HIV Tat-specific factor 1 homolog [Glycine max]
Length = 500
Score = 626 bits (1615), Expect = e-177, Method: Compositional matrix adjust.
Identities = 331/520 (63%), Positives = 386/520 (74%), Gaps = 44/520 (8%)
Query: 19 ETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLS 78
E E GWY+L E+QQ +GPYA SELC+HFLNGYL E T VWS+G SEWQPLSS+ +
Sbjct: 19 EKVTEVGWYVLGEDQQQIGPYAFSELCQHFLNGYLSENTFVWSEGSSEWQPLSSVSDLWA 78
Query: 79 GISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVR 138
I++Q ST V D DEFE+WQ+E++
Sbjct: 79 QINRQGPDSSTTVSAPDV-------------------------------DEFERWQKEIQ 107
Query: 139 EAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQ 198
E E + E + GS S +VGG +D + PEGE+ FTDDDGT YKWDR LRAWVPQ
Sbjct: 108 EVEAQVEGSEFGSLSGNVGGTGAGEDSERPSTPPEGEEGFTDDDGTVYKWDRSLRAWVPQ 167
Query: 199 EDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEK 258
+ + YG+EEMTFL+EEEVFPT+ +D EK + + S + EE+
Sbjct: 168 DYPTGSTKPYGVEEMTFLEEEEVFPTIPNSD-------ASEKFEDSPKLSVSVPPLKEEE 220
Query: 259 HN------GKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKC 312
+N GKR D+Q +KKEANKPPDSWFELK+NTHVYVTGLP+DVT +E+VEVFSKC
Sbjct: 221 NNTNVISGGKRMLSDQQTDKKEANKPPDSWFELKINTHVYVTGLPEDVTTDEIVEVFSKC 280
Query: 313 GIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMS 372
GIIKEDPETK+PR+K+YVDKETG KKGDALVTYLKEPSVALA Q+LDG P RP GKIPMS
Sbjct: 281 GIIKEDPETKRPRVKLYVDKETGRKKGDALVTYLKEPSVALAIQILDGAPLRPGGKIPMS 340
Query: 373 VTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAE 432
V+QAKFEQKG++F++KQVD KKKKKLKKVE+KMLGWGGRDDAK++IPATVILR+MF PAE
Sbjct: 341 VSQAKFEQKGDKFVSKQVDGKKKKKLKKVEDKMLGWGGRDDAKVSIPATVILRYMFAPAE 400
Query: 433 MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRW 492
MRADENL ELE DV+EEC K+GPVDSVK+CENHPQGVVLVRFKDRKDAQKCIELMNGRW
Sbjct: 401 MRADENLHLELEEDVKEECTKLGPVDSVKICENHPQGVVLVRFKDRKDAQKCIELMNGRW 460
Query: 493 FGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
FGGRQIHASEDDG VNHA +RDL+ + RLEQFGAELE D
Sbjct: 461 FGGRQIHASEDDGSVNHALVRDLEEDVIRLEQFGAELEGD 500
>gi|449518089|ref|XP_004166076.1| PREDICTED: HIV Tat-specific factor 1 homolog [Cucumis sativus]
Length = 498
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 331/523 (63%), Positives = 392/523 (74%), Gaps = 39/523 (7%)
Query: 17 NYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQF 76
N E E GWYIL ENQQHVGPYA SEL EHFLNGYLLE+TL WS+G+SEWQPLSSIP
Sbjct: 8 NLEMVTEAGWYILGENQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGL 67
Query: 77 LSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQRE 136
+ + Q + T VP N+ + DDE EK+Q+E
Sbjct: 68 TTEVYGQDSNLPTTVPANN-----------------------------NDDDELEKYQKE 98
Query: 137 VREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWV 196
V E E ++ + S + G G + P PEGE+EFTDDDGT YKWDR LRAWV
Sbjct: 99 VGETEATT-KVSSPSGGRNFGLVEGDLERPTTP--PEGEEEFTDDDGTPYKWDRVLRAWV 155
Query: 197 PQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTE-------EKVN 249
PQ+D +++ Y EEMTF++EEEVFP ++ + + + + ST ++ N
Sbjct: 156 PQDDAFFKHEQYRPEEMTFMQEEEVFPQLDADAPCTSIKEEGDSVPSTSIEADHITKETN 215
Query: 250 SADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVF 309
E K N KRK QVEKKEANK PD WFELK+NTHVYVTGLP+DVT++E+VEVF
Sbjct: 216 GKSEETETKKNVKRKLSGNQVEKKEANKGPDGWFELKINTHVYVTGLPEDVTIDEVVEVF 275
Query: 310 SKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKI 369
SKCGIIKEDPETKKPR+K+YVD+ETG KKGDALV+Y+KEPSVALA Q+LDGTP RP GK+
Sbjct: 276 SKCGIIKEDPETKKPRVKLYVDRETGKKKGDALVSYMKEPSVALAMQILDGTPLRPGGKM 335
Query: 370 PMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFT 429
MSVTQAKFEQKG++F++K+VD+KKKKKLKKVE+K+LGWGGRDDAK++IPATVILRFMFT
Sbjct: 336 LMSVTQAKFEQKGDKFVSKKVDNKKKKKLKKVEDKILGWGGRDDAKVSIPATVILRFMFT 395
Query: 430 PAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMN 489
PAEMRADENL SE+E DV+EE K GPVDSVKVCENHPQGVVL+RFKDRKDAQKCIELMN
Sbjct: 396 PAEMRADENLASEIETDVKEESTKFGPVDSVKVCENHPQGVVLIRFKDRKDAQKCIELMN 455
Query: 490 GRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
GRWFGG+QIHASEDDGLVNHA +RDL+A+A+RLEQFG+ELEAD
Sbjct: 456 GRWFGGKQIHASEDDGLVNHAMVRDLEADAARLEQFGSELEAD 498
>gi|449457363|ref|XP_004146418.1| PREDICTED: HIV Tat-specific factor 1 homolog [Cucumis sativus]
Length = 496
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 331/523 (63%), Positives = 389/523 (74%), Gaps = 41/523 (7%)
Query: 17 NYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQF 76
N E E GWYIL ENQQHVGPYA SEL EHFLNGYLLE+TL WS+G+SEWQPLSSIP
Sbjct: 8 NLEMVTEAGWYILGENQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGL 67
Query: 77 LSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQRE 136
+ + Q + T VP N+ + DDE EK+Q+E
Sbjct: 68 TTEVYGQDSNLPTTVPANN-----------------------------NDDDELEKYQKE 98
Query: 137 VREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWV 196
V E E + G V G + R PEGE+EFTDDDGT YKWDR LRAWV
Sbjct: 99 VGETEATTKVPSGGRNFGLVEGDLERPT-----TPPEGEEEFTDDDGTPYKWDRVLRAWV 153
Query: 197 PQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTE-------EKVN 249
PQ+D +++ Y EEMTF++EEEVFP ++ + + + + ST ++ N
Sbjct: 154 PQDDAFFKHEQYRPEEMTFMQEEEVFPQLDADAPCTSIKEEGDSVPSTSIEADHITKETN 213
Query: 250 SADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVF 309
E K N KRK QVEKKEANK PD WFELK+NTHVYVTGLP+DVT++E+VEVF
Sbjct: 214 GKSEETETKKNVKRKLSGNQVEKKEANKGPDGWFELKINTHVYVTGLPEDVTIDEVVEVF 273
Query: 310 SKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKI 369
SKCGIIKEDPETKKPR+K+YVD+ETG KKGDALV+Y+KEPSVALA Q+LDGTP RP GK+
Sbjct: 274 SKCGIIKEDPETKKPRVKLYVDRETGKKKGDALVSYMKEPSVALAMQILDGTPLRPGGKM 333
Query: 370 PMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFT 429
MSVTQAKFEQKG++F++K+VD+KKKKKLKKVE+K+LGWGGRDDAK++IPATVILRFMFT
Sbjct: 334 LMSVTQAKFEQKGDKFVSKKVDNKKKKKLKKVEDKILGWGGRDDAKVSIPATVILRFMFT 393
Query: 430 PAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMN 489
PAEMRADENL SE+E DV+EE K GPVDSVKVCENHPQGVVL+RFKDRKDAQKCIELMN
Sbjct: 394 PAEMRADENLASEIETDVKEESTKFGPVDSVKVCENHPQGVVLIRFKDRKDAQKCIELMN 453
Query: 490 GRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
GRWFGG+QIHASEDDGLVNHA +RDL+A+A+RLEQFG+ELEAD
Sbjct: 454 GRWFGGKQIHASEDDGLVNHAMVRDLEADAARLEQFGSELEAD 496
>gi|297807655|ref|XP_002871711.1| RNA recognition motif-containing protein [Arabidopsis lyrata subsp.
lyrata]
gi|297317548|gb|EFH47970.1| RNA recognition motif-containing protein [Arabidopsis lyrata subsp.
lyrata]
Length = 519
Score = 616 bits (1589), Expect = e-174, Method: Compositional matrix adjust.
Identities = 338/543 (62%), Positives = 402/543 (74%), Gaps = 38/543 (6%)
Query: 3 LDDVDSQQQLSGATNYETAGEE-GWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWS 61
+ D D+ Q AT A + GWYIL ENQQ++GPY +SELC+HF NGYLLETTLVW+
Sbjct: 1 MSDSDNLQLPPSATGATVAATDLGWYILGENQQNLGPYTVSELCDHFRNGYLLETTLVWA 60
Query: 62 QGRSEWQPLSSIPQFLSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQ- 120
+GRSEWQPLS+IP+ +S IS G E V + A+G+ + S + EQ
Sbjct: 61 EGRSEWQPLSAIPELMSRIS--------------GAEIVYPAVG-ASGLVNGSNAGTEQE 105
Query: 121 ----GVPSHVDDEFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGED 176
+ +DEFEKWQRE++EAE EAERLKNGS S G + DDH+ + PEGED
Sbjct: 106 KQDYTASASTEDEFEKWQREIKEAEEEAERLKNGSVS---GTELVEDDHERASSPPEGED 162
Query: 177 EFTDDDGTRYKWDRGLRAWVPQEDTS-SQNDGYGIEEMTFLKEEEVFPTVNVTD------ 229
EFTDDDGTRYKWDR R WVPQ+D D YG+EEMTF KE+EVFPT+N+ D
Sbjct: 163 EFTDDDGTRYKWDRARRVWVPQDDPPLGSVDPYGLEEMTFAKEDEVFPTINILDTSVDKK 222
Query: 230 DLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNT 289
D A D+V +K EE + +D E NGKRK P+ + EKKE NKPPDSWFELKVN
Sbjct: 223 DAAKDDVAGKK----EE--DGSDETAEINSNGKRKLPEPETEKKEPNKPPDSWFELKVNP 276
Query: 290 HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
H+YVTGLPDDVT+EE+ EVFSKCGIIKED ET KPRIK+Y DK TG KGDAL+TY+KEP
Sbjct: 277 HIYVTGLPDDVTLEEVAEVFSKCGIIKED-ETGKPRIKLYSDKGTGKLKGDALITYMKEP 335
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
SV LA ++LDG P RP K+ MSV++AKFEQKGERFI KQ D+KKKKKLKKVE+K+LGWG
Sbjct: 336 SVDLAIKILDGAPLRPADKLLMSVSRAKFEQKGERFITKQTDNKKKKKLKKVEQKLLGWG 395
Query: 410 GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG 469
G DDAK++IPATV+LR+MF+PAE+R DE+L +ELE DV+EE +K GP DSVKVCE+HPQG
Sbjct: 396 GTDDAKVSIPATVVLRYMFSPAELRTDEDLVAELEEDVKEESLKHGPFDSVKVCEHHPQG 455
Query: 470 VVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAEL 529
VVLVRFKDR+DAQKCIE MNGRW+ RQIHAS DDG VNHA +RD D EA RL+QF AEL
Sbjct: 456 VVLVRFKDRRDAQKCIEAMNGRWYAKRQIHASLDDGSVNHATVRDFDLEAERLDQFSAEL 515
Query: 530 EAD 532
EAD
Sbjct: 516 EAD 518
>gi|15237311|ref|NP_197130.1| HIV Tat-specific factor 1 [Arabidopsis thaliana]
gi|9755836|emb|CAC01867.1| putative protein [Arabidopsis thaliana]
gi|63003814|gb|AAY25436.1| At5g16260 [Arabidopsis thaliana]
gi|110738266|dbj|BAF01062.1| hypothetical protein [Arabidopsis thaliana]
gi|332004883|gb|AED92266.1| HIV Tat-specific factor 1 [Arabidopsis thaliana]
Length = 519
Score = 610 bits (1572), Expect = e-172, Method: Compositional matrix adjust.
Identities = 327/541 (60%), Positives = 397/541 (73%), Gaps = 36/541 (6%)
Query: 3 LDDVDSQQQLSGATNYETAGEE-GWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWS 61
+ D D+ Q +T A + GWYIL ENQQ++GPY SELC HF NGYLLETTLVW+
Sbjct: 1 MSDSDNLQLPPSSTGATVAATDVGWYILGENQQNLGPYTFSELCNHFRNGYLLETTLVWA 60
Query: 62 QGRSEWQPLSSIPQFLSGIS----QQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSS 117
GRSEWQPLS+IP +S IS A GS+ + N R++ Q +++
Sbjct: 61 DGRSEWQPLSAIPDLMSRISGAEIAYPAVGSSGL-INGSNAGTRQE--------KQDYTA 111
Query: 118 AEQGVPSHVDDEFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDE 177
+ + +DEFEKWQRE+++AE EAERLKNGS S G + DDH+ + PEGEDE
Sbjct: 112 S-----ASTEDEFEKWQREIKDAEAEAERLKNGSVS---GTELVEDDHERASSPPEGEDE 163
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTS-SQNDGYGIEEMTFLKEEEVFPTVNVTD------D 230
FTDDDGT+YKWDR R WVPQ+D D YG+EEMTF KE+EVFPT+N+ D D
Sbjct: 164 FTDDDGTKYKWDRARRVWVPQDDPPLGSVDPYGLEEMTFAKEDEVFPTINILDTSVDKKD 223
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTH 290
+ D+V +K EE + +D E NGKRK P+ + EKKE NKPPDSWFELKVN H
Sbjct: 224 ASKDDVAGKK----EE--DGSDETAEINSNGKRKLPEPETEKKEPNKPPDSWFELKVNPH 277
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV GLPDDVT+EE+ EVFSKCGIIKED +T KPRIK+Y DK TG KGDAL++Y+KEPS
Sbjct: 278 IYVNGLPDDVTIEEVAEVFSKCGIIKED-DTGKPRIKLYSDKATGKLKGDALISYMKEPS 336
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
V LA ++LDG P RP K+ MSV++AKFEQKGERFI KQ D+KKKKKLKKVE+K+LGWGG
Sbjct: 337 VDLAIKILDGAPLRPADKLLMSVSRAKFEQKGERFITKQTDNKKKKKLKKVEQKLLGWGG 396
Query: 411 RDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
DD+K++IPATV+LR+MF+PAE+ ADE+L +ELE DV+EE +K GP DSVKVCE+HPQGV
Sbjct: 397 TDDSKVSIPATVVLRYMFSPAELMADEDLVAELEEDVKEESLKHGPFDSVKVCEHHPQGV 456
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
VLVRFKDR+DAQKCIE MNGRW+ RQIHAS DDG VNHA +RD D EA RL+QF AELE
Sbjct: 457 VLVRFKDRRDAQKCIEAMNGRWYAKRQIHASLDDGSVNHATVRDFDLEAERLDQFAAELE 516
Query: 531 A 531
A
Sbjct: 517 A 517
>gi|413924809|gb|AFW64741.1| hypothetical protein ZEAMMB73_003429 [Zea mays]
Length = 475
Score = 591 bits (1523), Expect = e-166, Method: Compositional matrix adjust.
Identities = 313/511 (61%), Positives = 365/511 (71%), Gaps = 47/511 (9%)
Query: 20 TAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSG 79
A E GWY+L NQ+ VGPYA++EL EHF NGY E+T++W++GR EW PLSSI + S
Sbjct: 9 AATEAGWYVLGPNQEGVGPYALAELREHFANGYFNESTMLWAEGRKEWMPLSSISELHSA 68
Query: 80 ISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVRE 139
+ AA QS EQ P VDD+FEK+Q+EV E
Sbjct: 69 V--------------------------AAKDQS------EQDAPD-VDDDFEKFQKEVIE 95
Query: 140 AEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQE 199
AE E + LK G + + D + P+GE+EFTDDDGT YKWDR LRAWVPQ
Sbjct: 96 AETEVKALK--------GNDVDQQDDERPATPPDGEEEFTDDDGTIYKWDRTLRAWVPQN 147
Query: 200 DTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKH 259
D S + Y +EEMTF EEE+F ++ E++N+ E N + E +
Sbjct: 148 DASGSKEDYAVEEMTFAVEEEIFQAPDILGP------ALEEINNLPESKNKGSDKAEMRG 201
Query: 260 NGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDP 319
+ KRK + EKKEANKPPDSWF+LKVNTHVYVTGLPDDVTVEE+VEVFSKCGIIKEDP
Sbjct: 202 DKKRKSYENPAEKKEANKPPDSWFDLKVNTHVYVTGLPDDVTVEEIVEVFSKCGIIKEDP 261
Query: 320 ETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFE 379
ETKKPR+KIY D+ETG KKGDALVTY KEPSVALA QLLDGTPFRP GK MSV+ AKFE
Sbjct: 262 ETKKPRVKIYTDRETGRKKGDALVTYFKEPSVALAVQLLDGTPFRPGGKTHMSVSPAKFE 321
Query: 380 QKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENL 439
QKG+ F++K+ D +KK+K+KKVE+KMLGWGG DD KL IPATVILR MFTPAE+RADE L
Sbjct: 322 QKGDVFVSKKTDKQKKRKIKKVEDKMLGWGGHDDKKLMIPATVILRHMFTPAELRADEEL 381
Query: 440 RSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIH 499
ELEADV+EEC+K GPVD+VKVCENHPQGV+LVRFKDRKD KCIE MNGRWF GRQIH
Sbjct: 382 LPELEADVREECIKFGPVDNVKVCENHPQGVILVRFKDRKDGAKCIEKMNGRWFAGRQIH 441
Query: 500 ASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
ASEDDG VNHA IRD DAE SRL++FG ELE
Sbjct: 442 ASEDDGSVNHALIRDYDAEVSRLDRFGQELE 472
>gi|242069889|ref|XP_002450221.1| hypothetical protein SORBIDRAFT_05g002130 [Sorghum bicolor]
gi|241936064|gb|EES09209.1| hypothetical protein SORBIDRAFT_05g002130 [Sorghum bicolor]
Length = 469
Score = 590 bits (1521), Expect = e-166, Method: Compositional matrix adjust.
Identities = 313/521 (60%), Positives = 368/521 (70%), Gaps = 57/521 (10%)
Query: 11 QLSGATNYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPL 70
+ SGA A E GWY+L NQ+ VGPYA++EL EHF NGYL E+T++W++GR EW PL
Sbjct: 2 ETSGAAAV-AATEVGWYVLGPNQESVGPYALAELQEHFANGYLNESTMLWAEGRKEWMPL 60
Query: 71 SSIPQFLSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEF 130
SSIP+ S ++ + ++Q P VDD+F
Sbjct: 61 SSIPELQSAVTSK--------------------------------DQSKQDAPD-VDDDF 87
Query: 131 EKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASP-EGEDEFTDDDGTRYKWD 189
EK+Q+EV EAE + ++ D PA+P +GE+EFTDDDGT YKWD
Sbjct: 88 EKFQKEVTEAEADVDQ-----------------QDDERPATPPDGEEEFTDDDGTIYKWD 130
Query: 190 RGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVN 249
R LRAWVPQ D S + Y +EEMTF EEEVF ++ A +E+ N+ E N
Sbjct: 131 RTLRAWVPQNDASGSKENYAVEEMTFALEEEVFQAPDILGPSALEEI-----NTLSESKN 185
Query: 250 SADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVF 309
+ E + + KRK +K EKKEANKPP+SWF+LKVNTHVYVTGLPDDVT EE+VEVF
Sbjct: 186 KGSDKAETRGDKKRKSSEKPAEKKEANKPPESWFDLKVNTHVYVTGLPDDVTAEEIVEVF 245
Query: 310 SKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKI 369
SKCGIIKEDPETKKPR+KIY DKETG KKGDALVTY KEPSVALA QLLDGT FRP KI
Sbjct: 246 SKCGIIKEDPETKKPRVKIYTDKETGRKKGDALVTYFKEPSVALAVQLLDGTSFRPGVKI 305
Query: 370 PMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFT 429
PMSV+ AKFEQKG+ FI+K+ D +KK+K+KKVE+KMLGWGG DD KL IP TVILR MFT
Sbjct: 306 PMSVSPAKFEQKGDVFISKKTDKQKKRKIKKVEDKMLGWGGHDDKKLMIPTTVILRHMFT 365
Query: 430 PAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMN 489
PAE+RADE L SELE DV+EEC+K GPVD+VKVCENHPQGVVLV+FKDRKDA KCIE MN
Sbjct: 366 PAELRADEELLSELETDVREECIKFGPVDNVKVCENHPQGVVLVKFKDRKDAAKCIEKMN 425
Query: 490 GRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
GRWF GRQIHASEDDG VNH IRD DAE SRL++FG ELE
Sbjct: 426 GRWFAGRQIHASEDDGSVNHTLIRDYDAEVSRLDRFGEELE 466
>gi|242082602|ref|XP_002441726.1| hypothetical protein SORBIDRAFT_08g001370 [Sorghum bicolor]
gi|241942419|gb|EES15564.1| hypothetical protein SORBIDRAFT_08g001370 [Sorghum bicolor]
Length = 479
Score = 584 bits (1505), Expect = e-164, Method: Compositional matrix adjust.
Identities = 306/508 (60%), Positives = 361/508 (71%), Gaps = 43/508 (8%)
Query: 23 EEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSGISQ 82
E GWY+L +QQ VGPYA+ EL EHF NGYL E+T++W+QGR EW PLSSIP+ S
Sbjct: 12 EVGWYVLGPDQQGVGPYALVELREHFANGYLNESTMLWAQGRKEWMPLSSIPELHS---- 67
Query: 83 QVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVREAEI 142
+ + S + Q VDD+F K+Q+EV EAE
Sbjct: 68 ---------------------------IATASKDQSTQAAAPDVDDDFAKFQKEVTEAEK 100
Query: 143 EAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQEDTS 202
E + GSA G + R D + P+GE+EFTDDDGT YKWD LRAWVPQ D S
Sbjct: 101 EVK----GSAE---VGDVDRQDDERPATPPDGEEEFTDDDGTIYKWDCTLRAWVPQNDVS 153
Query: 203 SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGK 262
+ DGY +EEMTF EEEVF ++ A +E+ N+ E N + E++ K
Sbjct: 154 GKKDGYAVEEMTFALEEEVFQAPDIPGPSALEEI-----NTPAEVKNKESDKAEKRGEKK 208
Query: 263 RKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETK 322
+K K EKKEANKPPDSWF+LKVNTHVYV GLPDDVTVEE+VEVFSKCGIIKEDPETK
Sbjct: 209 QKSSQKPAEKKEANKPPDSWFDLKVNTHVYVNGLPDDVTVEEIVEVFSKCGIIKEDPETK 268
Query: 323 KPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKG 382
KPR+KIY DKET KKGDALVTY KEPSVALA QLLDGT FRP GK MSV+ A+F+QKG
Sbjct: 269 KPRVKIYTDKETSRKKGDALVTYFKEPSVALAVQLLDGTSFRPGGKTLMSVSPARFQQKG 328
Query: 383 ERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSE 442
+ FIAK+ D +KK+K++KVE+KMLGWGG DD K+ IP TVILR MF+PAE+RADE+L SE
Sbjct: 329 DVFIAKKTDKQKKRKIRKVEDKMLGWGGHDDKKVMIPTTVILRHMFSPAELRADEDLLSE 388
Query: 443 LEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASE 502
LE DV+EEC K GPVD+VKVCENHPQGV+LV+FKDRKD KCIE MNGRWFGGRQIHASE
Sbjct: 389 LEVDVREECTKFGPVDNVKVCENHPQGVILVKFKDRKDGAKCIEKMNGRWFGGRQIHASE 448
Query: 503 DDGLVNHAAIRDLDAEASRLEQFGAELE 530
DDG +NH+ IRD DAE SRL++FG ELE
Sbjct: 449 DDGSINHSLIRDYDAEVSRLDRFGEELE 476
>gi|414588676|tpg|DAA39247.1| TPA: hypothetical protein ZEAMMB73_120405 [Zea mays]
Length = 544
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 309/520 (59%), Positives = 363/520 (69%), Gaps = 48/520 (9%)
Query: 11 QLSGATNYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPL 70
+ SGA A E GWY+L NQ+ VGPYA++EL EHF NGYL E+T++W++GR W L
Sbjct: 70 ETSGAAA--AATEVGWYVLGPNQESVGPYALAELREHFANGYLNESTMLWAEGRKAWMAL 127
Query: 71 SSIPQFLSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEF 130
SSIP S ++ + EQ P VD +F
Sbjct: 128 SSIPDLHSAVTAK--------------------------------DRFEQDAPD-VDGDF 154
Query: 131 EKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDR 190
EK+Q+EV EAE E + LK G + + D + P+GE EFTDDDGT YKWDR
Sbjct: 155 EKFQKEVTEAEAEVKALK--------GSDVDQQDDERPATPPDGESEFTDDDGTIYKWDR 206
Query: 191 GLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNS 250
LRAWVPQ D S +GY +EEMTF EEEVF ++ A +E+ ++ E N
Sbjct: 207 TLRAWVPQNDASGSKEGYAVEEMTFALEEEVFQAPDILGPSALEEI-----DTLSESKNK 261
Query: 251 ADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFS 310
+ E + + KRK +K EKKEANKPPDSWF+LKVNTHVY+TGLPDDVTVEE+VE FS
Sbjct: 262 GSDKAETRGDKKRKSSEKPSEKKEANKPPDSWFDLKVNTHVYITGLPDDVTVEEIVETFS 321
Query: 311 KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIP 370
KCGIIKEDPETKKPR+KIY DKETG KKGDALVTY KEPSVALA QLLDGT FR GK P
Sbjct: 322 KCGIIKEDPETKKPRVKIYTDKETGRKKGDALVTYFKEPSVALAVQLLDGTSFRSGGKTP 381
Query: 371 MSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTP 430
MSV+ AKFEQKG+ FI+K+ D +KK+K+KKVE+KMLGWGG DD KL IP TVILR MFTP
Sbjct: 382 MSVSPAKFEQKGDVFISKKSDKQKKRKIKKVEDKMLGWGGHDDKKLMIPTTVILRHMFTP 441
Query: 431 AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG 490
AE+RADE L SELE DV+EEC+K GPVD+VKVCENHPQGV+LV+FKDRKD KCIE MNG
Sbjct: 442 AELRADEELLSELETDVREECIKFGPVDNVKVCENHPQGVILVKFKDRKDGAKCIEKMNG 501
Query: 491 RWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
RWF G+QIHASEDDG VNH IRD DAE SRL++FG ELE
Sbjct: 502 RWFAGQQIHASEDDGSVNHTLIRDYDAEVSRLDRFGEELE 541
>gi|116310691|emb|CAH67490.1| H0306B06.5 [Oryza sativa Indica Group]
gi|116310709|emb|CAH67506.1| OSIGBa0092E01.1 [Oryza sativa Indica Group]
gi|125548880|gb|EAY94702.1| hypothetical protein OsI_16479 [Oryza sativa Indica Group]
Length = 476
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 309/513 (60%), Positives = 377/513 (73%), Gaps = 42/513 (8%)
Query: 19 ETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLS 78
E GE GWY+L NQ+HVGPYA+SEL EHF NGY+ E++++W++GRSEW PLSSIP L+
Sbjct: 2 EGGGEVGWYVLGPNQEHVGPYALSELREHFANGYISESSMLWAEGRSEWMPLSSIPDLLA 61
Query: 79 GISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVR 138
+++ ++Q +E ++D+F+K+Q+EV
Sbjct: 62 VVTK------------------KDQPDEG------------------IEDDFDKFQKEVI 85
Query: 139 EAEIEAERLKNGSASDSVGGYMGRDDHDGVPASP-EGEDEFTDDDGTRYKWDRGLRAWVP 197
EAE E E + +A +++ G DD D PA+P +GEDEFTDDDGT YKWDR LRAWVP
Sbjct: 86 EAEAEVEASTDKAADNNINQEHGADDPDDRPATPPDGEDEFTDDDGTVYKWDRVLRAWVP 145
Query: 198 QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEE 257
Q+D +ND Y +E+MTF EEEVF D+A +E S E ++ V E+
Sbjct: 146 QDDLEGKNDNYEVEDMTFAHEEEVFQAP----DIAGSTTLEENNVSAEIEIKEPTKV-EK 200
Query: 258 KHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKE 317
+ + KRK +K +KKEANKPPDSWF+LKVNTHVYVTGLPDDVT EE+VEVFSKCGIIKE
Sbjct: 201 RADKKRKSSEKPADKKEANKPPDSWFDLKVNTHVYVTGLPDDVTAEEIVEVFSKCGIIKE 260
Query: 318 DPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
DPET+KPR+KIY D+ETG KKGDALVTYLKEPSVALA QLLDGT FRP GK MSV+ AK
Sbjct: 261 DPETRKPRVKIYTDRETGRKKGDALVTYLKEPSVALAIQLLDGTSFRPGGKTLMSVSPAK 320
Query: 378 FEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADE 437
FEQKG+ FI+K+ D +KK+K KKVE+K+LGWGG DD K+TI TVILR MFTPAE+RADE
Sbjct: 321 FEQKGDVFISKKTDKQKKRKTKKVEDKILGWGGHDDKKVTITTTVILRHMFTPAELRADE 380
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
L ELEADV+EEC+K+GPVD+VKVCENHP+GV+LV+FKDRKD KCIE MNGRWFGG Q
Sbjct: 381 TLLPELEADVREECMKLGPVDNVKVCENHPEGVILVKFKDRKDGIKCIEKMNGRWFGGNQ 440
Query: 498 IHASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
I ASEDDG +NHA IRD DAE SRL++FG ELE
Sbjct: 441 IQASEDDGSINHALIRDYDAEVSRLDRFGEELE 473
>gi|224098794|ref|XP_002311268.1| predicted protein [Populus trichocarpa]
gi|222851088|gb|EEE88635.1| predicted protein [Populus trichocarpa]
Length = 624
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 344/626 (54%), Positives = 408/626 (65%), Gaps = 121/626 (19%)
Query: 9 QQQLSGATN-----YETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQG 63
QQ SGA N Y E GW+IL E+QQ VGPY SEL EHFLNGYL+E+TLVWS+G
Sbjct: 16 QQPYSGAGNGHDGSYNRVAEVGWFILGEDQQQVGPYTFSELSEHFLNGYLVESTLVWSEG 75
Query: 64 RSEWQPLSSIPQFLSGISQQVARGSTA-VPCNDG----IEEVREQIEEAAGMQSQSFSSA 118
RSEWQPLSS P+F SGISQQ + STA + ND ++E RE E G+++ S SS
Sbjct: 76 RSEWQPLSSFPEFTSGISQQGSDYSTAALAYNDKEVEKLQESREAELEFVGLRNGSHSSN 135
Query: 119 EQGV-------------------------------PSHVDDEFEKWQREVREAEIEAE-- 145
EQ P+ +DEFEKW+REV EAE EAE
Sbjct: 136 EQKAKHSTLGEDLLPPSFLILLVEAVLCYDLFAVSPNTDEDEFEKWKREVEEAEAEAEAE 195
Query: 146 RLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGT--------------------- 184
RLKNGS S + G +G DD D V + P+GEDEFTDDDGT
Sbjct: 196 RLKNGSLSGNTGDDLGIDDPDRVLSPPDGEDEFTDDDGTTYKWDGSLRAWVPQQLDTEDL 255
Query: 185 -RYKWDRG----------------LRA---------------------WVPQEDTSSQND 206
R+ RG LRA W + D S
Sbjct: 256 GRHPIVRGVAMNSVDQILMKFGLTLRAPAGFRAGVGKGRNEFMPRRSIWKGRADNPSSVS 315
Query: 207 G-YGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQ 265
G +G+EEMTF ++EEVF VN A D KE+ N T+E V S N + + + +
Sbjct: 316 GRFGVEEMTFHEQEEVFLNVN-----AADATLKEEFNVTDEVVGSQLNNKRKLRDKQADK 370
Query: 266 PDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPR 325
D+Q +KKEANK PDSWFELKVNTHVYVTGLPDDVT EE DPETKKPR
Sbjct: 371 KDEQADKKEANKAPDSWFELKVNTHVYVTGLPDDVTAEE-------------DPETKKPR 417
Query: 326 IKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF 385
+KIYVDKET KGDALVTYLKEPSV LA Q+LDGTP RP G IPMSV+QAKFEQ+G+RF
Sbjct: 418 VKIYVDKETRRVKGDALVTYLKEPSVDLAVQILDGTPLRPGGTIPMSVSQAKFEQRGDRF 477
Query: 386 IAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEA 445
I+KQ+DSKKK+KLKKVE+++LGWGGRDDAK++IPATV+LR +FT +EMRADE+L SELE
Sbjct: 478 ISKQIDSKKKRKLKKVEDRILGWGGRDDAKVSIPATVVLRHLFTLSEMRADESLGSELEV 537
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
DV+EECVK+GP+DS+KVCEN+P GVVLVRFKDR DA++CIELMNGRWFGGR+IHASEDDG
Sbjct: 538 DVREECVKLGPIDSIKVCENNPHGVVLVRFKDRNDARRCIELMNGRWFGGREIHASEDDG 597
Query: 506 LVNHAAIRDLDAEASRLEQFGAELEA 531
L+NHA++RDLD +A+RLEQFGAELEA
Sbjct: 598 LINHASVRDLDEDAARLEQFGAELEA 623
>gi|115459200|ref|NP_001053200.1| Os04g0496400 [Oryza sativa Japonica Group]
gi|38345380|emb|CAE03112.2| OSJNBa0067K08.9 [Oryza sativa Japonica Group]
gi|113564771|dbj|BAF15114.1| Os04g0496400 [Oryza sativa Japonica Group]
gi|125590877|gb|EAZ31227.1| hypothetical protein OsJ_15326 [Oryza sativa Japonica Group]
Length = 476
Score = 577 bits (1486), Expect = e-162, Method: Compositional matrix adjust.
Identities = 309/513 (60%), Positives = 376/513 (73%), Gaps = 42/513 (8%)
Query: 19 ETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLS 78
E GE GWY+L NQ+HVGPYA+SEL EHF NGY+ E++++W++GRSEW PLSSIP L+
Sbjct: 2 EGGGEVGWYVLGPNQEHVGPYALSELREHFANGYISESSMLWAEGRSEWMPLSSIPDLLA 61
Query: 79 GISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVR 138
+++ ++Q +E ++D+F+K+Q+EV
Sbjct: 62 VVTK------------------KDQPDEG------------------IEDDFDKFQKEVI 85
Query: 139 EAEIEAERLKNGSASDSVGGYMGRDDHDGVPASP-EGEDEFTDDDGTRYKWDRGLRAWVP 197
EAE E E + +A + + G DD D PA+P +GEDEFTDDDGT YKWDR LRAWVP
Sbjct: 86 EAEAEVEASTDKAADNDINQEHGADDPDDRPATPPDGEDEFTDDDGTVYKWDRVLRAWVP 145
Query: 198 QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEE 257
Q+D +ND Y +E+MTF EEEVF D+A +E S E ++ V E+
Sbjct: 146 QDDLEGKNDNYEVEDMTFAHEEEVFQAP----DIAGSTTLEENNVSAEIEIKEPTKV-EK 200
Query: 258 KHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKE 317
+ + KRK +K +KKEANKPPDSWF+LKVNTHVYVTGLPDDVT EE+VEVFSKCGIIKE
Sbjct: 201 RADKKRKSSEKPADKKEANKPPDSWFDLKVNTHVYVTGLPDDVTAEEIVEVFSKCGIIKE 260
Query: 318 DPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
DPET+KPR+KIY D+ETG KKGDALVTYLKEPSVALA QLLDGT FRP GK MSV+ AK
Sbjct: 261 DPETRKPRVKIYTDRETGRKKGDALVTYLKEPSVALAIQLLDGTSFRPGGKTLMSVSPAK 320
Query: 378 FEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADE 437
FEQKG+ FI+K+ D +KK+K KKVE+K+LGWGG DD K+TI TVILR MFTPAE+RADE
Sbjct: 321 FEQKGDVFISKKTDKQKKRKTKKVEDKILGWGGHDDKKVTITTTVILRHMFTPAELRADE 380
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
L ELEADV+EEC+K+GPVD+VKVCENHP+GV+LV+FKDRKD KCIE MNGRWFGG Q
Sbjct: 381 TLLPELEADVREECMKLGPVDNVKVCENHPEGVILVKFKDRKDGIKCIEKMNGRWFGGNQ 440
Query: 498 IHASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
I ASEDDG +NHA IRD DAE SRL++FG ELE
Sbjct: 441 IQASEDDGSINHALIRDYDAEVSRLDRFGEELE 473
>gi|226496095|ref|NP_001141229.1| uncharacterized protein LOC100273316 [Zea mays]
gi|194703396|gb|ACF85782.1| unknown [Zea mays]
gi|413924810|gb|AFW64742.1| hypothetical protein ZEAMMB73_003429 [Zea mays]
Length = 460
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 302/511 (59%), Positives = 353/511 (69%), Gaps = 62/511 (12%)
Query: 20 TAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSG 79
A E GWY+L NQ+ VGPYA++EL EHF NGY E+T++W++GR EW PLSSI + S
Sbjct: 9 AATEAGWYVLGPNQEGVGPYALAELREHFANGYFNESTMLWAEGRKEWMPLSSISELHSA 68
Query: 80 ISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVRE 139
+ AA QS EQ P VDD+FEK+Q+EV E
Sbjct: 69 V--------------------------AAKDQS------EQDAPD-VDDDFEKFQKEVIE 95
Query: 140 AEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQE 199
AE E + LK G + + D + P+GE+EFTDDDGT YKWDR LRAWVPQ
Sbjct: 96 AETEVKALK--------GNDVDQQDDERPATPPDGEEEFTDDDGTIYKWDRTLRAWVPQN 147
Query: 200 DTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKH 259
D S + Y +EEMTF EEE+F ++ E++N+ E N + E +
Sbjct: 148 DASGSKEDYAVEEMTFAVEEEIFQAPDILGP------ALEEINNLPESKNKGSDKAEMRG 201
Query: 260 NGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDP 319
+ KRK + EKKEANKPPDSWF+LKVNTHVYVTGLPDDVTVEE+VEVFSKCGIIKEDP
Sbjct: 202 DKKRKSYENPAEKKEANKPPDSWFDLKVNTHVYVTGLPDDVTVEEIVEVFSKCGIIKEDP 261
Query: 320 ETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFE 379
ETKKPR+KIY D+ETG KKGDALVTY KEPSVALA QLLDGTPFRP G +
Sbjct: 262 ETKKPRVKIYTDRETGRKKGDALVTYFKEPSVALAVQLLDGTPFRPGGDV---------- 311
Query: 380 QKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENL 439
F++K+ D +KK+K+KKVE+KMLGWGG DD KL IPATVILR MFTPAE+RADE L
Sbjct: 312 -----FVSKKTDKQKKRKIKKVEDKMLGWGGHDDKKLMIPATVILRHMFTPAELRADEEL 366
Query: 440 RSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIH 499
ELEADV+EEC+K GPVD+VKVCENHPQGV+LVRFKDRKD KCIE MNGRWF GRQIH
Sbjct: 367 LPELEADVREECIKFGPVDNVKVCENHPQGVILVRFKDRKDGAKCIEKMNGRWFAGRQIH 426
Query: 500 ASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
ASEDDG VNHA IRD DAE SRL++FG ELE
Sbjct: 427 ASEDDGSVNHALIRDYDAEVSRLDRFGQELE 457
>gi|357164467|ref|XP_003580063.1| PREDICTED: HIV Tat-specific factor 1 homolog isoform 2
[Brachypodium distachyon]
Length = 477
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 298/509 (58%), Positives = 361/509 (70%), Gaps = 42/509 (8%)
Query: 19 ETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLS 78
+++G GWY+L N +HVGPYA+SEL EHF NGYL E+T++W+ GRSEW LSSIP+ S
Sbjct: 2 DSSGGVGWYVLGPNHEHVGPYALSELREHFANGYLNESTMLWADGRSEWMSLSSIPEIHS 61
Query: 79 GISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVR 138
G+++ ++Q E+ +D+FEK+Q+EV
Sbjct: 62 GVTK------------------KDQPEQ------------------ETEDDFEKFQKEVI 85
Query: 139 EAEIEAERLKNGSASDSVGGYMGRDDHDGVPAS-PEGEDEFTDDDGTRYKWDRGLRAWVP 197
+AE E E LK A ++ G DD D P+S P+G +EF DDDGT YKWDR LRA VP
Sbjct: 86 DAESEVEALKPKDADGNLNKEHGIDDPDERPSSPPDGVEEFIDDDGTIYKWDRMLRAMVP 145
Query: 198 QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEE 257
Q D + D Y +E+MTF EEEVF ++ DE N++ EK VE+
Sbjct: 146 QNDEPGKKDDYRLEDMTFALEEEVFQPPDIPGSSTLDEN-----NASTEKEIKEPVKVEK 200
Query: 258 KHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKE 317
+ KRK +K VEKKEANKPP+SWF+LKVNTHVY+TGLPDDVT EE+VEVFSKCGIIKE
Sbjct: 201 RQEKKRKSSEKPVEKKEANKPPESWFDLKVNTHVYITGLPDDVTAEEIVEVFSKCGIIKE 260
Query: 318 DPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
DPET+KPR+KIY DKETG KKGDALVTYLKEPSV LA QLLDGT FRP GK MSV+ AK
Sbjct: 261 DPETRKPRVKIYTDKETGRKKGDALVTYLKEPSVPLAIQLLDGTSFRPGGKTLMSVSVAK 320
Query: 378 FEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADE 437
F+QKG+ F+AK+ D +KKKK KKVE+KMLGWGG DD K+ IP VILR MFTPAE+R DE
Sbjct: 321 FQQKGDVFMAKKADKQKKKKGKKVEDKMLGWGGHDDKKVMIPTQVILRNMFTPAELRTDE 380
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
L ELE DV++EC+K GPVD++KVCENHPQGVVLV+FKDRKD KCIE +N RWFGG+Q
Sbjct: 381 TLLPELEEDVRQECMKFGPVDNIKVCENHPQGVVLVKFKDRKDGIKCIEALNERWFGGKQ 440
Query: 498 IHASEDDGLVNHAAIRDLDAEASRLEQFG 526
IHASEDDG + H+ IRD DAE SRLE+FG
Sbjct: 441 IHASEDDGSIKHSLIRDYDAEVSRLERFG 469
>gi|357164465|ref|XP_003580062.1| PREDICTED: HIV Tat-specific factor 1 homolog isoform 1
[Brachypodium distachyon]
Length = 489
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 300/521 (57%), Positives = 363/521 (69%), Gaps = 54/521 (10%)
Query: 19 ETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLS 78
+++G GWY+L N +HVGPYA+SEL EHF NGYL E+T++W+ GRSEW LSSIP+ S
Sbjct: 2 DSSGGVGWYVLGPNHEHVGPYALSELREHFANGYLNESTMLWADGRSEWMSLSSIPEIHS 61
Query: 79 GISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVR 138
G+++ ++Q E+ +D+FEK+Q+EV
Sbjct: 62 GVTK------------------KDQPEQ------------------ETEDDFEKFQKEVI 85
Query: 139 EAEIEAERLKNGSASDSVGGYMGRDDHDGVPAS-PEGEDEFTDDDGTRYKWDRGLRAWVP 197
+AE E E LK A ++ G DD D P+S P+G +EF DDDGT YKWDR LRA VP
Sbjct: 86 DAESEVEALKPKDADGNLNKEHGIDDPDERPSSPPDGVEEFIDDDGTIYKWDRMLRAMVP 145
Query: 198 QEDTSSQNDG------------YGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTE 245
Q D S + DG Y +E+MTF EEEVF ++ DE N++
Sbjct: 146 QNDVSGKTDGYEENDEPGKKDDYRLEDMTFALEEEVFQPPDIPGSSTLDEN-----NAST 200
Query: 246 EKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEM 305
EK VE++ KRK +K VEKKEANKPP+SWF+LKVNTHVY+TGLPDDVT EE+
Sbjct: 201 EKEIKEPVKVEKRQEKKRKSSEKPVEKKEANKPPESWFDLKVNTHVYITGLPDDVTAEEI 260
Query: 306 VEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRP 365
VEVFSKCGIIKEDPET+KPR+KIY DKETG KKGDALVTYLKEPSV LA QLLDGT FRP
Sbjct: 261 VEVFSKCGIIKEDPETRKPRVKIYTDKETGRKKGDALVTYLKEPSVPLAIQLLDGTSFRP 320
Query: 366 DGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILR 425
GK MSV+ AKF+QKG+ F+AK+ D +KKKK KKVE+KMLGWGG DD K+ IP VILR
Sbjct: 321 GGKTLMSVSVAKFQQKGDVFMAKKADKQKKKKGKKVEDKMLGWGGHDDKKVMIPTQVILR 380
Query: 426 FMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCI 485
MFTPAE+R DE L ELE DV++EC+K GPVD++KVCENHPQGVVLV+FKDRKD KCI
Sbjct: 381 NMFTPAELRTDETLLPELEEDVRQECMKFGPVDNIKVCENHPQGVVLVKFKDRKDGIKCI 440
Query: 486 ELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFG 526
E +N RWFGG+QIHASEDDG + H+ IRD DAE SRLE+FG
Sbjct: 441 EALNERWFGGKQIHASEDDGSIKHSLIRDYDAEVSRLERFG 481
>gi|296080862|emb|CBI18792.3| unnamed protein product [Vitis vinifera]
Length = 486
Score = 533 bits (1373), Expect = e-149, Method: Compositional matrix adjust.
Identities = 302/522 (57%), Positives = 362/522 (69%), Gaps = 52/522 (9%)
Query: 9 QQQLSGATNYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQ 68
QQQ +G N + E GWYIL ENQQ++GPYA SEL EHFLNGYL E +L+WS+GRS+WQ
Sbjct: 14 QQQATGVDNCD---EVGWYILGENQQNLGPYAFSELREHFLNGYLSENSLLWSEGRSDWQ 70
Query: 69 PLSSIPQFLSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDD 128
PLSSIP+ + ISQ S+A P P + +D
Sbjct: 71 PLSSIPELTTAISQPGVDCSSAGP------------------------------PINDED 100
Query: 129 EFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKW 188
EFEKWQ+EVREAE LKNGSAS SVGG G +D++ P+GEDEFTDDDGT YKW
Sbjct: 101 EFEKWQKEVREAEA----LKNGSASGSVGGDFGDEDNERPSTPPDGEDEFTDDDGTTYKW 156
Query: 189 DRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKV 248
DRGLRAWVPQ + + Y TF ++T L+ ++ L +
Sbjct: 157 DRGLRAWVPQVTSWTCALVYYYSSRTF----------SMTVFLSWSKILSFFLRKKDFIR 206
Query: 249 NSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEV 308
S D + RK P++Q EKKEANKPPDSWF+LKVNTHVYVTGLPDDVTV+E+VEV
Sbjct: 207 QSPDK--KGAQPTYRKLPEQQAEKKEANKPPDSWFDLKVNTHVYVTGLPDDVTVDEVVEV 264
Query: 309 FSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGK 368
FSKCG+IKED + K+ RI+IY + LK+ + L+ ++ P G
Sbjct: 265 FSKCGLIKEDSQGKQYRIQIY---GCAYVYCCIFIFLLKKYFIVLSCHVIPYISSMPVGT 321
Query: 369 IPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMF 428
IPMSVT AKFEQKGE+F+AKQ+D +KKKKLK+VE+K+LGWGG DDAKL+IPATV+LR+MF
Sbjct: 322 IPMSVTLAKFEQKGEKFVAKQIDKRKKKKLKRVEDKILGWGGHDDAKLSIPATVVLRYMF 381
Query: 429 TPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELM 488
TPAEMRAD NLRSELE DVQEEC+K+G VD VKVCE+HPQGVVLV++KDR+DAQKCIELM
Sbjct: 382 TPAEMRADPNLRSELEGDVQEECIKLGSVDLVKVCESHPQGVVLVKYKDRRDAQKCIELM 441
Query: 489 NGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
NGRWFGGRQIHASEDDG VNHA +RDLDA+A RLE FGAELE
Sbjct: 442 NGRWFGGRQIHASEDDGSVNHALVRDLDADAERLEAFGAELE 483
>gi|294462381|gb|ADE76739.1| unknown [Picea sitchensis]
Length = 525
Score = 526 bits (1355), Expect = e-146, Method: Compositional matrix adjust.
Identities = 290/540 (53%), Positives = 368/540 (68%), Gaps = 28/540 (5%)
Query: 6 VDSQQQLSGATNYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRS 65
+DS + + N E GWYIL +NQ++VGPY ++EL EH+ +GY E+TL+W++GR+
Sbjct: 1 MDSATESASDPNALAGQELGWYILGDNQEYVGPYTVAELQEHYASGYFTESTLLWAEGRN 60
Query: 66 EWQPLSSIPQFLSGI---SQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGV 122
W PLSSI + S + Q+ V ++ + Q G + S +S
Sbjct: 61 NWMPLSSIQELHSLVISDETQLKSYENDVSYSENQSHSQPQFN---GKTTASLTSDASLK 117
Query: 123 PSHVDDEFEKWQREVREAEIEAERLKNGSASDSVGGYMG---------RDDHDGVPASP- 172
S DD+F +W+ EVR+AE EAE LK+G + DD D P +P
Sbjct: 118 ASDSDDDFLRWKEEVRQAEAEAEALKSGKTKRAEVYVQSAKEEELETQTDDADERPRTPP 177
Query: 173 EGEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLA 232
EGE EFTDDDGT Y+WD LRAWVPQ+D+ ++ YG+EEMTF++EEE+ P + + D
Sbjct: 178 EGEQEFTDDDGTVYRWDHNLRAWVPQDDSFHKDSLYGLEEMTFVEEEEIMPAIMIAD--- 234
Query: 233 NDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVY 292
S E++ VE KH KRK Q EKKEAN P DSWF+LKVNTHVY
Sbjct: 235 ---------ISDEKECKDDSLAVESKHGLKRKLEADQDEKKEANTPADSWFDLKVNTHVY 285
Query: 293 VTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVA 352
VTGLP+DVT EE+VE FSKCGIIKED +TKKPR+KIYVDKETG +KGDALVTYLKEPSV
Sbjct: 286 VTGLPEDVTTEEIVEAFSKCGIIKEDLDTKKPRVKIYVDKETGRQKGDALVTYLKEPSVD 345
Query: 353 LATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRD 412
LA Q+LDGTP RP G+ MSV+ AKFEQKG+ FI KQ +++KKKK+K+VE+K LGWGG D
Sbjct: 346 LAIQILDGTPLRPGGRQLMSVSIAKFEQKGDTFIKKQQNNQKKKKIKRVEQKALGWGGFD 405
Query: 413 DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVL 472
DAK +P +V+L+ MF PAE+R+D L S+LEAD+ EEC K+GP++ +KV ENHP G +L
Sbjct: 406 DAKRVLPLSVLLKNMFRPAELRSDTTLLSDLEADIAEECSKVGPIERIKVYENHPLGAIL 465
Query: 473 VRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
V+FKDR+D KCI+LMNGRWFGGRQI A EDDG +NHA +RD EA RLE+FGAELEAD
Sbjct: 466 VKFKDRRDGLKCIQLMNGRWFGGRQIQAVEDDGTINHALVRDESEEAERLERFGAELEAD 525
>gi|359495884|ref|XP_003635108.1| PREDICTED: HIV Tat-specific factor 1 homolog, partial [Vitis
vinifera]
Length = 322
Score = 474 bits (1221), Expect = e-131, Method: Compositional matrix adjust.
Identities = 242/332 (72%), Positives = 285/332 (85%), Gaps = 13/332 (3%)
Query: 199 EDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEK 258
++ S+++D Y EEMTF EEE+FPT+ V +D + ++VN D V E K
Sbjct: 1 DNPSTRSDEYKPEEMTFSVEEEIFPTIQVAED-------------SVKEVNGTDVVEETK 47
Query: 259 HNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKED 318
H+ KRK P++Q EKKEANKPPDSWF+LKVNTHVYVTGLPDDVTV+E+VEVFSKCG+IKED
Sbjct: 48 HDAKRKLPEQQAEKKEANKPPDSWFDLKVNTHVYVTGLPDDVTVDEVVEVFSKCGLIKED 107
Query: 319 PETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKF 378
PET++PR+K+Y+DK TG KKGDALV+YLKEPSVALA Q+LDGTP RP G IPMSVT AKF
Sbjct: 108 PETRRPRVKLYIDKNTGRKKGDALVSYLKEPSVALAIQILDGTPLRPVGTIPMSVTLAKF 167
Query: 379 EQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADEN 438
EQKGE+F+AKQ+D +KKKKLK+VE+K+LGWGG DDAKL+IPATV+LR+MFTPAEMRAD N
Sbjct: 168 EQKGEKFVAKQIDKRKKKKLKRVEDKILGWGGHDDAKLSIPATVVLRYMFTPAEMRADPN 227
Query: 439 LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
LRSELE DVQEEC+K+G VD VKVCE+HPQGVVLV++KDR+DAQKCIELMNGRWFGGRQI
Sbjct: 228 LRSELEGDVQEECIKLGSVDLVKVCESHPQGVVLVKYKDRRDAQKCIELMNGRWFGGRQI 287
Query: 499 HASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
HASEDDG VNHA +RDLDA+A RLE FGAELE
Sbjct: 288 HASEDDGSVNHALVRDLDADAERLEAFGAELE 319
>gi|296083389|emb|CBI23344.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 237/318 (74%), Positives = 275/318 (86%), Gaps = 13/318 (4%)
Query: 213 MTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEK 272
MTF EEE+FPT+ V +D + ++VN D V E KH+ KRK P++Q EK
Sbjct: 1 MTFSVEEEIFPTIQVAED-------------SVKEVNGTDVVEETKHDAKRKLPEQQAEK 47
Query: 273 KEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDK 332
KEANKPPDSWF+LKVNTHVYVTGLPDDVTV+E+VEVFSKCG+IKEDPET++PR+K+Y+DK
Sbjct: 48 KEANKPPDSWFDLKVNTHVYVTGLPDDVTVDEVVEVFSKCGLIKEDPETRRPRVKLYIDK 107
Query: 333 ETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDS 392
TG KKGDALV+YLKEPSVALA Q+LDGTP RP G IPMSVT AKFEQKGE+F+AKQ+D
Sbjct: 108 NTGRKKGDALVSYLKEPSVALAIQILDGTPLRPVGTIPMSVTLAKFEQKGEKFVAKQIDK 167
Query: 393 KKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECV 452
+KKKKLK+VE+K+LGWGG DDAKL+IPATV+LR+MFTPAEMRAD NLRSELE DVQEEC+
Sbjct: 168 RKKKKLKRVEDKILGWGGHDDAKLSIPATVVLRYMFTPAEMRADPNLRSELEGDVQEECI 227
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAI 512
K+G VD VKVCE+HPQGVVLV++KDR+DAQKCIELMNGRWFGGRQIHASEDDG VNHA +
Sbjct: 228 KLGSVDLVKVCESHPQGVVLVKYKDRRDAQKCIELMNGRWFGGRQIHASEDDGSVNHALV 287
Query: 513 RDLDAEASRLEQFGAELE 530
RDLDA+A RLE FGAELE
Sbjct: 288 RDLDADAERLEAFGAELE 305
>gi|223946691|gb|ACN27429.1| unknown [Zea mays]
Length = 316
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 224/318 (70%), Positives = 255/318 (80%), Gaps = 5/318 (1%)
Query: 213 MTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEK 272
MTF EEEVF ++ A +E+ ++ E N + E + + KRK +K EK
Sbjct: 1 MTFALEEEVFQAPDILGPSALEEI-----DTLSESKNKGSDKAETRGDKKRKSSEKPSEK 55
Query: 273 KEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDK 332
KEANKPPDSWF+LKVNTHVY+TGLPDDVTVEE+VE FSKCGIIKEDPETKKPR+KIY DK
Sbjct: 56 KEANKPPDSWFDLKVNTHVYITGLPDDVTVEEIVETFSKCGIIKEDPETKKPRVKIYTDK 115
Query: 333 ETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDS 392
ETG KKGDALVTY KEPSVALA QLLDGT FR GK PMSV+ AKFEQKG+ FI+K+ D
Sbjct: 116 ETGRKKGDALVTYFKEPSVALAVQLLDGTSFRSGGKTPMSVSPAKFEQKGDVFISKKSDK 175
Query: 393 KKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECV 452
+KK+K+KKVE+KMLGWGG DD KL IP TVILR MFTPAE+RADE L SELE DV+EEC+
Sbjct: 176 QKKRKIKKVEDKMLGWGGHDDKKLMIPTTVILRHMFTPAELRADEELLSELETDVREECI 235
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAI 512
K GPVD+VKVCENHPQGV+LV+FKDRKD KCIE MNGRWF G+QIHASEDDG VNH I
Sbjct: 236 KFGPVDNVKVCENHPQGVILVKFKDRKDGAKCIEKMNGRWFAGQQIHASEDDGSVNHTLI 295
Query: 513 RDLDAEASRLEQFGAELE 530
RD DAE SRL++FG ELE
Sbjct: 296 RDYDAEVSRLDRFGEELE 313
>gi|302762569|ref|XP_002964706.1| hypothetical protein SELMODRAFT_83723 [Selaginella moellendorffii]
gi|300166939|gb|EFJ33544.1| hypothetical protein SELMODRAFT_83723 [Selaginella moellendorffii]
Length = 489
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 238/511 (46%), Positives = 319/511 (62%), Gaps = 38/511 (7%)
Query: 23 EEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSGISQ 82
E GW+ LD H GP+ I +L + + G L +++VW+ GRSEW PLS+IP + +
Sbjct: 13 EAGWFYLDSGSNHRGPFKIDQLRDLYSAGDLTGSSMVWADGRSEWSPLSAIPD----VYE 68
Query: 83 QVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVREAEI 142
+VA G+ N ++ + DD+F KWQ E+R+AE
Sbjct: 69 KVAGGAKPGTENGSVDTKTTATTTSETTAED-------------DDDFRKWQEEIRKAEE 115
Query: 143 EAERLKNGSASDSVGGYMGRD--DHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQED 200
EA LK+G A + + R D D PEGE+ FTDDDGT Y WD+ + W+P+ +
Sbjct: 116 EAAMLKSGKAPRASNPFKKRKEADKDRPEGFPEGEERFTDDDGTIYNWDKRNKVWIPENE 175
Query: 201 TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHN 260
+ + Y E + E+ P A++ K+ EEK S ++
Sbjct: 176 DAPASANYAAAEENATQGEQTMP--------ASNHASKQTAPPPEEKEKSVEHQEAAMPT 227
Query: 261 GKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPE 320
GK KEA P+SWFELKVNT+VYVTG+P+D TV+E+VEVFSKCG+IKED E
Sbjct: 228 GK----------KEAQPEPESWFELKVNTNVYVTGIPEDATVDEVVEVFSKCGLIKEDLE 277
Query: 321 TKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQ 380
TKKPR+K+YVDK TGM+KGD LVTYLKEPSV LA ++LDGTP RP G M+V+ A+F+Q
Sbjct: 278 TKKPRVKLYVDKATGMQKGDGLVTYLKEPSVELAIKILDGTPLRPGGTKLMTVSLAQFQQ 337
Query: 381 KGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLR 440
KGE F+ KQ + KKKK+K++EEK LGW G DAK P +V+L+ MF P E+ + L
Sbjct: 338 KGEVFVKKQTNRNKKKKMKQLEEKALGWAG-SDAKKARPISVVLKNMFLPKEITDEPTLL 396
Query: 441 SELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
SELE DV EC KIG V+ V++ E HP+GV++V+FK+R++ CI+LM+GRWFGGRQI A
Sbjct: 397 SELETDVGIECSKIGTVEKVRIFERHPEGVIVVKFKNREEGLSCIKLMDGRWFGGRQIEA 456
Query: 501 SEDDGLVNHAAIRDLDAEASRLEQFGAELEA 531
+ED GLV++AA+ DL EA RLE+FGAELEA
Sbjct: 457 AEDAGLVDYAAVVDLAEEAERLEKFGAELEA 487
>gi|302756421|ref|XP_002961634.1| hypothetical protein SELMODRAFT_77296 [Selaginella moellendorffii]
gi|300170293|gb|EFJ36894.1| hypothetical protein SELMODRAFT_77296 [Selaginella moellendorffii]
Length = 489
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 237/511 (46%), Positives = 318/511 (62%), Gaps = 38/511 (7%)
Query: 23 EEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSGISQ 82
E GW+ LD H GP+ I +L + + G L ++++W+ GRSEW PLS+IP + +
Sbjct: 13 EAGWFYLDSGSNHRGPFKIDQLRDLYSAGDLTGSSMIWADGRSEWSPLSAIPD----VYE 68
Query: 83 QVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVREAEI 142
+VA G+ N ++ + DD+F KWQ E+R+AE
Sbjct: 69 KVAGGTKPGTENGSVDTKTTATTTSETTAED-------------DDDFRKWQEEIRKAEE 115
Query: 143 EAERLKNGSASDSVGGYMGRD--DHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQED 200
EA LK+G A + + R D D PEGE+ FTDDDGT Y WD+ + W+P+ +
Sbjct: 116 EAAMLKSGKAPRASNPFKKRKEADKDRPEGFPEGEERFTDDDGTIYNWDKRNKVWIPENE 175
Query: 201 TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHN 260
+ + Y E + E+ P A++ K+ EEK S ++
Sbjct: 176 DAPASANYAAAEENATQGEQTMP--------ASNHASKQTAPPPEEKEKSVEHQEAAMPT 227
Query: 261 GKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPE 320
GK KEA P+SWFELKVNT+VYVTG+P+D TV+E+VEVFSKCG+IKED E
Sbjct: 228 GK----------KEAQPEPESWFELKVNTNVYVTGIPEDATVDEVVEVFSKCGLIKEDLE 277
Query: 321 TKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQ 380
TKKPR+K+YVDK TGM+KGD LVTYLKEPSV LA ++LDGTP RP G M+V+ A+F+Q
Sbjct: 278 TKKPRVKLYVDKATGMQKGDGLVTYLKEPSVELAIKILDGTPLRPGGTKLMTVSLAQFQQ 337
Query: 381 KGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLR 440
KGE F+ KQ + KKKK+K++EEK LGW G DAK P +V+L+ MF P E+ + L
Sbjct: 338 KGEVFVKKQTNRNKKKKMKQLEEKALGWAG-SDAKKARPISVVLKNMFLPKEITDEPTLL 396
Query: 441 SELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
SELE DV EC KIG V+ V++ E HP+GV++V+FK+R++ CI LM+GRWFGGRQI A
Sbjct: 397 SELETDVGIECSKIGTVEKVRIFERHPEGVIVVKFKNREEGLSCIRLMDGRWFGGRQIEA 456
Query: 501 SEDDGLVNHAAIRDLDAEASRLEQFGAELEA 531
+ED GLV++AA+ DL EA RLE+FGAELEA
Sbjct: 457 AEDAGLVDYAAVVDLAEEAERLEKFGAELEA 487
>gi|414588677|tpg|DAA39248.1| TPA: hypothetical protein ZEAMMB73_120405 [Zea mays]
Length = 448
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 235/424 (55%), Positives = 281/424 (66%), Gaps = 48/424 (11%)
Query: 11 QLSGATNYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPL 70
+ SGA A E GWY+L NQ+ VGPYA++EL EHF NGYL E+T++W++GR W L
Sbjct: 70 ETSGAAA--AATEVGWYVLGPNQESVGPYALAELREHFANGYLNESTMLWAEGRKAWMAL 127
Query: 71 SSIPQFLSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEF 130
SSIP S ++ + EQ P VD +F
Sbjct: 128 SSIPDLHSAVTAK--------------------------------DRFEQDAPD-VDGDF 154
Query: 131 EKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDR 190
EK+Q+EV EAE E + LK G + + D + P+GE EFTDDDGT YKWDR
Sbjct: 155 EKFQKEVTEAEAEVKALK--------GSDVDQQDDERPATPPDGESEFTDDDGTIYKWDR 206
Query: 191 GLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNS 250
LRAWVPQ D S +GY +EEMTF EEEVF ++ A +E+ ++ E N
Sbjct: 207 TLRAWVPQNDASGSKEGYAVEEMTFALEEEVFQAPDILGPSALEEI-----DTLSESKNK 261
Query: 251 ADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFS 310
+ E + + KRK +K EKKEANKPPDSWF+LKVNTHVY+TGLPDDVTVEE+VE FS
Sbjct: 262 GSDKAETRGDKKRKSSEKPSEKKEANKPPDSWFDLKVNTHVYITGLPDDVTVEEIVETFS 321
Query: 311 KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIP 370
KCGIIKEDPETKKPR+KIY DKETG KKGDALVTY KEPSVALA QLLDGT FR GK P
Sbjct: 322 KCGIIKEDPETKKPRVKIYTDKETGRKKGDALVTYFKEPSVALAVQLLDGTSFRSGGKTP 381
Query: 371 MSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTP 430
MSV+ AKFEQKG+ FI+K+ D +KK+K+KKVE+KMLGWGG DD KL IP TVILR MFTP
Sbjct: 382 MSVSPAKFEQKGDVFISKKSDKQKKRKIKKVEDKMLGWGGHDDKKLMIPTTVILRHMFTP 441
Query: 431 AEMR 434
AE+R
Sbjct: 442 AELR 445
>gi|168001531|ref|XP_001753468.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695347|gb|EDQ81691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 358
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 213/369 (57%), Positives = 265/369 (71%), Gaps = 20/369 (5%)
Query: 170 ASPEGEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFP------ 223
++PE E EFTDDDGT Y+WD G AW+PQE+ + YG ++MTF++EEEVFP
Sbjct: 2 SAPENEQEFTDDDGTVYRWDHGRHAWIPQENIH-ELPAYGADDMTFVQEEEVFPELPAVI 60
Query: 224 -TVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPD-KQVEKKEANKPPDS 281
++ TDD DEV N V+ K GKRK D K+ KKEA K P++
Sbjct: 61 PSILQTDDDVEDEVA-----------NIPAKTVDLKTGGKRKVDDTKEAVKKEAQKEPEA 109
Query: 282 WFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDA 341
WF+LKVNTHVY+ GLP+D T++E+VEVFSKCG+IKEDP+T+KPRIK+YVDK TG +KGD
Sbjct: 110 WFDLKVNTHVYIDGLPEDATMDEVVEVFSKCGVIKEDPDTRKPRIKLYVDKATGKQKGDG 169
Query: 342 LVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKV 401
LVTYLKEPSV LA +LDGT RP MSVT+AKFEQKGE F+ KQ + +KKKKLK+
Sbjct: 170 LVTYLKEPSVDLALSILDGTSLRPGVGPIMSVTRAKFEQKGEVFMKKQQNKQKKKKLKQQ 229
Query: 402 EEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
E+K LGWGG DD K P TVIL+ MFT +E+ AD +L E+E DV EC KIGP++ ++
Sbjct: 230 EQKALGWGGFDDKKKLEPMTVILKHMFTRSELLADPSLLPEVEEDVMTECTKIGPIERLR 289
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASR 521
V ENHP+GVV+V+FKD+ KCIE+MNGRWFGG+QI A ED G VN+A +RD+ E SR
Sbjct: 290 VYENHPEGVVMVKFKDKTAGLKCIEIMNGRWFGGKQIEAFEDPGTVNYALVRDVAEEESR 349
Query: 522 LEQFGAELE 530
LEQFG ELE
Sbjct: 350 LEQFGRELE 358
>gi|359496942|ref|XP_003635379.1| PREDICTED: HIV Tat-specific factor 1 homolog [Vitis vinifera]
Length = 408
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 194/269 (72%), Positives = 222/269 (82%), Gaps = 25/269 (9%)
Query: 262 KRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPET 321
+RK P++Q EKKEANKPPDSWF+LKVNTHVYVTGLPDDVTV+E+VEVFSKCG+IKED +
Sbjct: 162 QRKLPEQQAEKKEANKPPDSWFDLKVNTHVYVTGLPDDVTVDEVVEVFSKCGLIKEDSQG 221
Query: 322 KKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK 381
K+ RI+IY G A GTP RP G IPMSVT AKFEQK
Sbjct: 222 KQYRIQIY---------GCAYX----------------GTPLRPVGTIPMSVTLAKFEQK 256
Query: 382 GERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRS 441
GE+F+AKQ+D +KKKKLK+VE+K+LGWGG DDAKL+IPATV+LR+MFTPAEMRAD NLRS
Sbjct: 257 GEKFVAKQIDKRKKKKLKRVEDKILGWGGHDDAKLSIPATVVLRYMFTPAEMRADPNLRS 316
Query: 442 ELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
ELE DVQEEC+K+G VD VKVCE+HPQGVVLV++KDR+DAQKCIELMNGRWFGGRQIHAS
Sbjct: 317 ELEGDVQEECIKLGSVDLVKVCESHPQGVVLVKYKDRRDAQKCIELMNGRWFGGRQIHAS 376
Query: 502 EDDGLVNHAAIRDLDAEASRLEQFGAELE 530
EDDG VNHA +RDLDA+A RLE FGAELE
Sbjct: 377 EDDGSVNHALVRDLDADAERLEAFGAELE 405
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 104/196 (53%), Positives = 120/196 (61%), Gaps = 41/196 (20%)
Query: 9 QQQLSGATNYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQ 68
QQQ +G N + E GWYIL ENQQ++GPYA SEL EHFLNGYL E +L+WS+GRS+WQ
Sbjct: 14 QQQATGVDNCD---EVGWYILGENQQNLGPYAFSELREHFLNGYLSENSLLWSEGRSDWQ 70
Query: 69 PLSSIPQFLSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDD 128
PLSSIP+ + ISQ G P ND +D
Sbjct: 71 PLSSIPELTTAISQP---GVDWPPIND-------------------------------ED 96
Query: 129 EFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKW 188
EFEKWQ+EVREAE LKNGSAS SVGG G +D++ P+GEDEFTDDDGT YKW
Sbjct: 97 EFEKWQKEVREAEA----LKNGSASGSVGGDFGDEDNERPSTPPDGEDEFTDDDGTTYKW 152
Query: 189 DRGLRAWVPQEDTSSQ 204
DRGLRAWVPQ Q
Sbjct: 153 DRGLRAWVPQRKLPEQ 168
>gi|414886705|tpg|DAA62719.1| TPA: hypothetical protein ZEAMMB73_659569 [Zea mays]
Length = 492
Score = 347 bits (891), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 174/223 (78%), Positives = 195/223 (87%)
Query: 270 VEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIY 329
+ +KEANKPPDSWF+LKVNTHVYVTGLPDDVTVEE+VEVFSKC IIKEDPETKKPR+KIY
Sbjct: 65 LNQKEANKPPDSWFDLKVNTHVYVTGLPDDVTVEEIVEVFSKCEIIKEDPETKKPRVKIY 124
Query: 330 VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ 389
D+ETG KKGDALVTY KEPSVALA LLDGTPF P GK MSV+ AKFEQKG+ F++K+
Sbjct: 125 TDRETGRKKGDALVTYFKEPSVALAVLLLDGTPFCPGGKTHMSVSPAKFEQKGDVFVSKK 184
Query: 390 VDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQE 449
D +KK+K+KKVE+KMLGWGG DD KL IPATVILR MFTP E+RADE L ELEADV+E
Sbjct: 185 TDKQKKRKIKKVEDKMLGWGGHDDKKLMIPATVILRHMFTPVELRADEELLFELEADVRE 244
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRW 492
EC+K G VD+VKVCENHPQGV+LVRFKDRKD KCIE M+GRW
Sbjct: 245 ECIKFGLVDNVKVCENHPQGVILVRFKDRKDGAKCIEKMSGRW 287
>gi|384251973|gb|EIE25450.1| hypothetical protein COCSUDRAFT_65250 [Coccomyxa subellipsoidea
C-169]
Length = 542
Score = 315 bits (806), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 215/563 (38%), Positives = 290/563 (51%), Gaps = 80/563 (14%)
Query: 20 TAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSG 79
T GWY L + Q+VGPY E GYL E TL+W+ GR+ W P +P+
Sbjct: 6 TGNGAGWYYLSADNQNVGPYDTKTFTELAAGGYLTEGTLIWATGRANWLPAKDVPEIW-- 63
Query: 80 ISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQG-----VPSH--------- 125
T P + E RE A+ + + S+ EQ PS
Sbjct: 64 ---------TVAPPSSSAEVAREGEGRASAAEGATTSAPEQANLLPDRPSQRSTVVKAAK 114
Query: 126 --------VDDEFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDE 177
VD E +Q AE+ A + A + ++ P +
Sbjct: 115 AVKAIVAPVDRELAAFQ-----AEMSALGATDAPAPEDAA----LNEPSRAETPPPEDRR 165
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDG-------YGIEEMTFLKEEEVFPTV----- 225
F DDDGT + WD LR +V +E Y +EEMTF ++E P
Sbjct: 166 FEDDDGTWFVWDSTLRRFVEEEGAGQGAGTAAQQLPDYQVEEMTFEMDDEKIPEYKPPKS 225
Query: 226 -------NVTDDLANDEVG--------KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQV 270
+ D AND G KE + ++ + A++V+E KH K+K+ +
Sbjct: 226 DDEDEPPGTSQDDANDGAGTSNGQQAKKEGQSGAGQEKSKAESVME-KHLEKQKRARESA 284
Query: 271 EKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYV 330
E + A WF+LK NT VY+TGLPDD T E+ +VF+KCGI+KED + KPRIKIY
Sbjct: 285 EAQAA------WFDLKKNTSVYITGLPDDATEAEIAQVFTKCGILKED-DDGKPRIKIYR 337
Query: 331 DKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQV 390
DK TGM KGD LVTYLKEPSV LA +LDG P R D ++V AKFE G++++ K+
Sbjct: 338 DKATGMIKGDGLVTYLKEPSVDLACNILDGAPLRDDISKTLTVQPAKFEMHGDQYVPKKK 397
Query: 391 DSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEE 450
+K KKK EKMLGWGG DD K TVIL+ MF P E+ + +L+ +LE DV+ E
Sbjct: 398 QNKNKKKKVNKVEKMLGWGGFDDLKNPTLVTVILKQMFHPNELIEEPSLKDDLETDVKTE 457
Query: 451 CVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHA 510
C K+G VD V+V + HP+GVV V+FKD AQ CI+LM GR+FGGRQ+ A DG+ N+
Sbjct: 458 CTKLGAVDKVRVFQFHPEGVVSVKFKDEDAAQGCIKLMEGRFFGGRQVSAQLWDGIANYN 517
Query: 511 AIRDLDA---EASRLEQFGAELE 530
+ + EA+RLE+F AE+E
Sbjct: 518 VKKPQETAEEEAARLERFAAEIE 540
>gi|29367423|gb|AAO72584.1| putative RNA-binding domain protein [Oryza sativa Japonica Group]
Length = 286
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 168/326 (51%), Positives = 215/326 (65%), Gaps = 42/326 (12%)
Query: 19 ETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLS 78
E GE GWY+L NQ+HVGPYA+SEL EHF NGY+ E++++W++GRSEW PLSSIP L+
Sbjct: 2 EGGGEVGWYVLGPNQEHVGPYALSELREHFANGYISESSMLWAEGRSEWMPLSSIPDLLA 61
Query: 79 GISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVR 138
+++ ++Q +E ++D+F+K+Q+EV
Sbjct: 62 VVTK------------------KDQPDEG------------------IEDDFDKFQKEVI 85
Query: 139 EAEIEAERLKNGSASDSVGGYMGRDDHDGVPASP-EGEDEFTDDDGTRYKWDRGLRAWVP 197
EAE E E + +A + + G DD D PA+P +GEDEFTDDDGT YKWDR LRAWVP
Sbjct: 86 EAEAEVEASTDKAADNDINQEHGADDPDDRPATPPDGEDEFTDDDGTVYKWDRVLRAWVP 145
Query: 198 QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEE 257
Q+D +ND Y +E+MTF EEEVF D+A +E S E ++ V E+
Sbjct: 146 QDDLEGKNDNYEVEDMTFAHEEEVF----QAPDIAGSTTLEENNVSAEIEIKEPPKV-EK 200
Query: 258 KHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKE 317
+ + KRK +K +KKEA KPPDSW +LKVNTHVYVTGLPDDVT EE+VEVFSKCGIIKE
Sbjct: 201 RAHKKRKSSEKPADKKEAYKPPDSWVDLKVNTHVYVTGLPDDVTAEEIVEVFSKCGIIKE 260
Query: 318 DPETKKPRIKIYVDKETGMKKGDALV 343
DPET+KPR+KIY D+ETG KKGDALV
Sbjct: 261 DPETRKPRVKIYTDRETGRKKGDALV 286
>gi|414873030|tpg|DAA51587.1| TPA: hypothetical protein ZEAMMB73_095588 [Zea mays]
Length = 533
Score = 290 bits (743), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 150/218 (68%), Positives = 163/218 (74%), Gaps = 32/218 (14%)
Query: 274 EANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKE 333
EANKPPDSWF+LKVNTHVYVTGLP+DVTVEE+VEVFSKCGIIKEDPETKKPR+KIY D+E
Sbjct: 217 EANKPPDSWFDLKVNTHVYVTGLPNDVTVEEIVEVFSKCGIIKEDPETKKPRVKIYTDRE 276
Query: 334 TGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSK 393
TG KKGDALVTY KEPSVALA QLLDGTPFRP GK MSV+ AKFEQK
Sbjct: 277 TGRKKGDALVTYFKEPSVALAVQLLDGTPFRPGGKTHMSVSPAKFEQK------------ 324
Query: 394 KKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVK 453
GG DD KL IPATVILR MFTPAE+RADE L SELEADV+EEC+K
Sbjct: 325 ---------------GGHDDKKLMIPATVILRHMFTPAELRADEELLSELEADVREECIK 369
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDR-----KDAQKCIE 486
G VD+VKVCEN+PQGV+L K R + KCIE
Sbjct: 370 FGLVDNVKVCENNPQGVILDDPKSRVLLALRAVIKCIE 407
>gi|413936600|gb|AFW71151.1| hypothetical protein ZEAMMB73_874757 [Zea mays]
Length = 544
Score = 290 bits (742), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 158/241 (65%), Positives = 178/241 (73%), Gaps = 31/241 (12%)
Query: 274 EANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKE 333
EANKPPDSWF+LKVNTHVYVTGLPDDVTVEE+VEVFSK GIIKEDPETKKPR+KIY D+E
Sbjct: 165 EANKPPDSWFDLKVNTHVYVTGLPDDVTVEEIVEVFSKRGIIKEDPETKKPRVKIYTDRE 224
Query: 334 TGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSK 393
TG KKGDALVTY KEPSVALA QLLDGTPFRP GK MSV+ AKFEQKG+ F++K+ D +
Sbjct: 225 TGRKKGDALVTYFKEPSVALAVQLLDGTPFRPGGKTHMSVSPAKFEQKGDVFVSKKTDKQ 284
Query: 394 KKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRAD----------------- 436
KK+K+KKVE+KMLGWGG DD KL IPATVILR MFTPAE+R
Sbjct: 285 KKRKIKKVEDKMLGWGGHDDKKLMIPATVILRHMFTPAELRVAHMRFMLHSSFIIVLTGF 344
Query: 437 ---------ENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVL--VRFKDRKDAQKCI 485
++ L +EEC+K G VD+VKVCENHPQGV+L RF K KCI
Sbjct: 345 SPIISRRLMKSFFLSLRLIFREECIKFGLVDNVKVCENHPQGVILGAARF---KGLIKCI 401
Query: 486 E 486
E
Sbjct: 402 E 402
>gi|212721492|ref|NP_001131413.1| uncharacterized protein LOC100192742 [Zea mays]
gi|194691454|gb|ACF79811.1| unknown [Zea mays]
Length = 220
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 150/223 (67%), Positives = 173/223 (77%), Gaps = 5/223 (2%)
Query: 213 MTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEK 272
MTF EEEVF ++ A +E+ ++ E N + E + + KRK +K EK
Sbjct: 1 MTFALEEEVFQAPDILGPSALEEI-----DTLSESKNKGSDKAETRGDKKRKSSEKPSEK 55
Query: 273 KEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDK 332
KEANKPPDSWF+LKVNTHVY+TGLPDDVTVEE+VE FSKCGIIKEDPETKKPR+KIY DK
Sbjct: 56 KEANKPPDSWFDLKVNTHVYITGLPDDVTVEEIVETFSKCGIIKEDPETKKPRVKIYTDK 115
Query: 333 ETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDS 392
ETG KKGDALVTY KEPSVALA QLLDGT FR GK PMSV+ AKFEQKG+ FI+K+ D
Sbjct: 116 ETGRKKGDALVTYFKEPSVALAVQLLDGTSFRSGGKTPMSVSPAKFEQKGDVFISKKSDK 175
Query: 393 KKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRA 435
+KK+K+KKVE+KMLGWGG DD KL IP TVILR MFTPAE+R
Sbjct: 176 QKKRKIKKVEDKMLGWGGHDDKKLMIPTTVILRHMFTPAELRV 218
>gi|388517133|gb|AFK46628.1| unknown [Lotus japonicus]
Length = 162
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 136/162 (83%), Positives = 152/162 (93%)
Query: 371 MSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTP 430
MSVT AKFEQKG++F++KQVD+KKKKKLKKVE+KMLGWGGRDDAKL+IPATVILR+MF+P
Sbjct: 1 MSVTPAKFEQKGDKFVSKQVDNKKKKKLKKVEDKMLGWGGRDDAKLSIPATVILRYMFSP 60
Query: 431 AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG 490
AEMRADENLR ELE DV+EEC K+GP+DSVKVCENHPQGVVLVRFKDRKDAQ CI+ MNG
Sbjct: 61 AEMRADENLRLELEEDVKEECTKLGPIDSVKVCENHPQGVVLVRFKDRKDAQTCIDTMNG 120
Query: 491 RWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
RWFGGRQIHASEDDGLVNHA +RDLD +A RLEQFGAELE +
Sbjct: 121 RWFGGRQIHASEDDGLVNHALVRDLDEDAKRLEQFGAELEGE 162
>gi|40850584|gb|AAR96015.1| putative RNA-binding protein [Musa acuminata]
Length = 308
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 156/314 (49%), Positives = 189/314 (60%), Gaps = 54/314 (17%)
Query: 13 SGATNYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSS 72
S +Y A E GWYIL ENQ+HVGPYA+SEL EHF NGY+ E TL+W +GRSEW PLSS
Sbjct: 4 SSDASYAVATEVGWYILGENQEHVGPYALSELQEHFANGYISENTLLWYEGRSEWAPLSS 63
Query: 73 IPQFLSGISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEK 132
IP+ +G + GS A +P DD+F K
Sbjct: 64 IPELYAGTPIKKLDGSAA------------------------------DLPDE-DDDFAK 92
Query: 133 WQREVREAEIEAERLKNGSASDSVGGYMGR---------DDHDGVPASPEGEDEFTDDDG 183
WQ+EV+EAE AE LKNG+ + + DD P P+GE+EFTDDDG
Sbjct: 93 WQKEVQEAEAAAEALKNGTTTSVLAEGGEGLQEGEGLGADDQPTTP--PDGEEEFTDDDG 150
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
+ YKWDR LR WVPQ++ S+ GYG+E+MTF +EEEV+ V + +
Sbjct: 151 STYKWDRALRKWVPQDNLLSEKSGYGVEDMTFAQEEEVYIAPEVPEP-----------SV 199
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVE 303
TE K +S N +E K KRK PDK EKKEANKPP+SWF+LKVNTHVYVTGLPDDVTV
Sbjct: 200 TEAKDSSTKN-MESKPETKRKLPDKPTEKKEANKPPESWFDLKVNTHVYVTGLPDDVTVA 258
Query: 304 EMVEVFSKCGIIKE 317
E+VE FSKCGIIKE
Sbjct: 259 EIVEAFSKCGIIKE 272
>gi|303283314|ref|XP_003060948.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457299|gb|EEH54598.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 299
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 136/261 (52%), Positives = 171/261 (65%), Gaps = 6/261 (2%)
Query: 276 NKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETG 335
N + WF LK NT VYV GLPDD T +E+ EVF KCG+IK DP+T P+IK+Y DK TG
Sbjct: 23 NVDENGWFSLKTNTSVYVDGLPDDATADEVKEVFQKCGVIKLDPDTSLPKIKLYADKATG 82
Query: 336 MKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDS-KK 394
KGD LVTYLKEPSVALAT +LDGTPFR MSVT AKF+ KG+ ++ KQ S K
Sbjct: 83 AFKGDGLVTYLKEPSVALATTILDGTPFRVGMGTNMSVTAAKFQMKGDAYVKKQRGSKKA 142
Query: 395 KKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
K+ +E LGWGG DD K T IL+ MFT EM +D R EL+ DV+ E K
Sbjct: 143 KRAAIAKQESALGWGGFDDTKDRKKTTAILKRMFTLDEMFSDPKFREELQEDVEAEAAKF 202
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH--AAI 512
G V++VKV +P G V VRFKD A+ C++ M GRWFGG ++ A+ DG+ N+ A +
Sbjct: 203 GAVETVKVFTTNPDGAVSVRFKDPAAAEACVKAMRGRWFGGSRLEAALWDGVTNYATAGM 262
Query: 513 RDLDAEA---SRLEQFGAELE 530
R ++EA +RL++FGA LE
Sbjct: 263 RADESEAEMQARLDRFGASLE 283
>gi|307111213|gb|EFN59448.1| hypothetical protein CHLNCDRAFT_138015 [Chlorella variabilis]
Length = 567
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 135/261 (51%), Positives = 178/261 (68%), Gaps = 4/261 (1%)
Query: 272 KKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVD 331
KK P WF+LK+NT+VYVTGLP+DV+ E+VEVFSKCG+IKED E K PRIKIY D
Sbjct: 295 KKAKASQPQGWFDLKINTNVYVTGLPEDVSEAEIVEVFSKCGVIKEDLEGK-PRIKIYRD 353
Query: 332 KETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVD 391
+E+G KGD L+TYLKEPSV LA Q+LDGTP R MSV++A+FEQKGE F+A+ +
Sbjct: 354 RESGRPKGDGLITYLKEPSVDLAVQILDGTPLRYGLPQVMSVSKAQFEQKGEAFVARASN 413
Query: 392 SKKKKKLKKVE-EKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEE 450
K KK + + LGWGG DD TVIL+ MF P E+ L+ ELE D++ E
Sbjct: 414 KKAAKKKLEKLERRALGWGGFDDTLKPQQVTVILKHMFEPGELVESPALKDELETDIRSE 473
Query: 451 CVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVN-H 509
C K+G VD ++V HPQGVV V+F + A +C+ +MNGR+FGGRQ+ A++ DG N +
Sbjct: 474 CGKLGKVDKLRVFAAHPQGVVSVKFTTLEAADECVRVMNGRFFGGRQLEAAKWDGFTNFN 533
Query: 510 AAIRDLDAE-ASRLEQFGAEL 529
+++ + E +RLE+F EL
Sbjct: 534 VKVQESEEEQQARLERFAREL 554
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 10/65 (15%)
Query: 169 PASPEG-EDEFTDDDGTRYKWDRGLRAWVP---------QEDTSSQNDGYGIEEMTFLKE 218
PASP E +F DDDGT Y W GLR ++P ++ Y ++M F+ E
Sbjct: 139 PASPPADERQFEDDDGTVYVWHSGLRKFMPLGEGQAGGAAGPAAAPAAPYNEQDMVFVPE 198
Query: 219 EEVFP 223
EE P
Sbjct: 199 EEQLP 203
>gi|255078502|ref|XP_002502831.1| predicted protein [Micromonas sp. RCC299]
gi|226518097|gb|ACO64089.1| predicted protein [Micromonas sp. RCC299]
Length = 244
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 138/244 (56%), Positives = 161/244 (65%), Gaps = 6/244 (2%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYVTGLPDD V+E+ EVFSKCG+IK D +T PRIK+Y KETG KGD LVTYLK
Sbjct: 1 NTSVYVTGLPDDAEVDEVKEVFSKCGVIKLDADTAAPRIKLYRHKETGELKGDGLVTYLK 60
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKV--EEKM 405
EPSV LA +LDG PFR M+VT AKFE KG+ F+AK+ KK+K + +E
Sbjct: 61 EPSVQLACTILDGAPFRHTMGTNMTVTPAKFEMKGD-FVAKKRSGGKKRKAAVIAKQEAE 119
Query: 406 LGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
LGWGG DD K TVI++ MFT EM +D N R ELE DV+ EC K G VD VKV
Sbjct: 120 LGWGGFDDTKDRKKTTVIIKHMFTLDEMFSDPNFRVELEEDVEAECGKFGAVDKVKVFTT 179
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAI---RDLDAEASRL 522
+P+GVV VRFKD DAQKC+ M GRWFGGRQ+ AS DG N A + EA RL
Sbjct: 180 NPEGVVSVRFKDGGDAQKCVTAMKGRWFGGRQLEASLWDGFTNFAKAGLESTEEDEAKRL 239
Query: 523 EQFG 526
+ FG
Sbjct: 240 KAFG 243
>gi|302839783|ref|XP_002951448.1| hypothetical protein VOLCADRAFT_91952 [Volvox carteri f.
nagariensis]
gi|300263423|gb|EFJ47624.1| hypothetical protein VOLCADRAFT_91952 [Volvox carteri f.
nagariensis]
Length = 630
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/267 (45%), Positives = 164/267 (61%), Gaps = 28/267 (10%)
Query: 281 SWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYV----DKETGM 336
+WF+LK+NT+VYVTGLP DVTV+E+ EVFSKCGI+K D E PRIK+Y DK TG+
Sbjct: 334 NWFDLKINTNVYVTGLPLDVTVQEINEVFSKCGIVKVD-EKGHPRIKLYKWVNGDKATGL 392
Query: 337 KKGDALVTYLKEPSVALATQLLDGTPFR-----------PDGKIPMSVTQAKFEQKGERF 385
KGDALV+YLKEPSV LA + L + FR P M+V AKFE KGE +
Sbjct: 393 LKGDALVSYLKEPSVELACRFLHQSQFRSWRVSLELCVRPGSGPLMTVEPAKFEMKGETY 452
Query: 386 IAKQVDSKKKK-KLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELE 444
K + K+KK +L ++E++ LGWGG DD T +L MF D+ L + L
Sbjct: 453 KPKVSNKKEKKKQLAQLEQRALGWGGFDDKAPPEKTTAVLSNMFA-----TDDFLENMLL 507
Query: 445 ADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
A C K+G ++ V++ +++PQG+V VRF+ + A +C+ LMNGR+FGGRQ+HA D
Sbjct: 508 AG----CTKLGTIEKVRIFKHNPQGIVTVRFRTPEAAHQCVALMNGRYFGGRQLHAFMWD 563
Query: 505 GLVNHAA--IRDLDAEASRLEQFGAEL 529
G N+ + E +RLE F EL
Sbjct: 564 GFTNYNVKPKETPEEEQARLEAFAREL 590
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 82/184 (44%), Gaps = 39/184 (21%)
Query: 50 NGYLLETTLVWSQGRSEWQPLSSIPQFLSGISQQVARGSTAVPCNDGIEEVREQIEEAAG 109
+GYL + VW +GR+EW+ L D ++ + + +E G
Sbjct: 58 DGYLTGDSHVWCEGRTEWKKLRE---------------------TDDLKYLTDA-DEGDG 95
Query: 110 MQSQSFSSAEQGVPSHVDDEFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHD-GV 168
++ + G E E E+ A + AE+ +++ GG G
Sbjct: 96 RPKKARRAMAPGT------ELEALDAEL--AAMRAEKAAKVGVANADGGDEGAGSRPVDA 147
Query: 169 PASPEGEDEFTDDDGTRYKWDRGLRAWVPQED-------TSSQNDGYGIEEMTFLKEEEV 221
P +PE E EF DDDGT Y WD+ LR ++P+E + ++ GY I+ MT+ EEEV
Sbjct: 148 PDTPE-ELEFEDDDGTVYVWDKRLRKYMPKEGHLPDKAAPAPEDIGYDIDAMTYQPEEEV 206
Query: 222 FPTV 225
P++
Sbjct: 207 IPSL 210
>gi|325189076|emb|CCA23603.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325189678|emb|CCA24162.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 526
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 163/512 (31%), Positives = 249/512 (48%), Gaps = 76/512 (14%)
Query: 26 WYILD-ENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSGISQQV 84
W LD ++ GP S + + F G L LVW G SEW+PL S+ F
Sbjct: 18 WVYLDAQSGAQRGPLTDSMMKKLFRKGILKTDQLVWKPGLSEWKPLHSVEPFDIQCVTWG 77
Query: 85 ARGSTAVPCN--------------------DGIEEVREQIEEAAGMQSQSFSSAEQGVPS 124
R P N DG+ V EA+ + S +Q V +
Sbjct: 78 LRWFFLTPDNVEQGPVTTESLLELFMDGKVDGLTVVWTDKSEASRLPLCEVQSLKQ-VIN 136
Query: 125 HVDDEFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGT 184
+++ E+ + R+ E SASD V + F +DG
Sbjct: 137 EMNEALEQQEEFTRQQE-------TFSASDQV-------------FKESAGEAFVAEDGK 176
Query: 185 RYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS- 243
Y +D+ + ++ +D EE+ L +E+ +V DD K +NS
Sbjct: 177 EYVFDQETKQFITPQDKIE-------EELNAL--QEIVASVPERDD-------KRSINSQ 220
Query: 244 -------TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGL 296
+EEK ++ + E R+ K+ +K +AN W K T +YV GL
Sbjct: 221 PLLPDAESEEKDDTKRTISENGEEAIRELKRKRKKKSKANI---RWRRSKQKTWIYVNGL 277
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
P D T+EE+V+ FSKCG+I++D ++PRIK+Y K+ G GDA + Y+KE SV LATQ
Sbjct: 278 PLDTTIEEVVDYFSKCGVIQKDLLNEQPRIKLYQTKDEGGLNGDAAICYMKEASVELATQ 337
Query: 357 LLDGTPFRPDGKIPMSVTQAKFEQKGERFIA-KQVDSKKKKKLKKVE-EKMLGWG-GRDD 413
LLD + FRP+ +I V+ A F+QKG+ F+A K+V + K+K++E EK L W G DD
Sbjct: 338 LLDQSEFRPEWRI--DVSPAVFQQKGDEFVAKKKVKLDARTKVKRLEQEKALSWNEGEDD 395
Query: 414 AKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLV 473
K + V+L+ +F P E++ DE E+ D+++E VK+G ++ + + +H GVV+V
Sbjct: 396 EKYGL-RIVVLKHLFCPQEIK-DETYEKEILEDIEQELVKLGEINKITLFGHHQDGVVIV 453
Query: 474 RFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
+F A KC+E+M+GR+F GR+I DG
Sbjct: 454 KFASSGSAAKCLEVMDGRFFAGRKIDCQYWDG 485
>gi|145341994|ref|XP_001416083.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576307|gb|ABO94375.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 365
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/264 (43%), Positives = 145/264 (54%), Gaps = 23/264 (8%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
+T YVTGLP D T EE+ E F KCG++K D +T + R+K+Y D + G KGD LV +LK
Sbjct: 76 STGAYVTGLPSDATEEELGEAFKKCGVVKLDAKTGRARVKVYRDAD-GKVKGDGLVVFLK 134
Query: 348 EPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQK--------GERFIAK---------Q 389
PSV LA LLD T R D M+VT AKFE K G + AK +
Sbjct: 135 APSVDLAIALLDQTELRLGDATTRMTVTAAKFEAKARGDDEGGGAKVAAKASGGGARMTK 194
Query: 390 VDSKKKKK-LKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQ 448
D K+ LK+ E + LGW G DD V+LR MFT EM AD NLR ELE DV
Sbjct: 195 ADRKRAAALLKRQEAEALGWAGFDDDVDAKKLIVVLRRMFTLEEMYADANLRKELEEDVM 254
Query: 449 EECVK-IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLV 507
EE + GPV SVK G + +RFK + + C++ NGRWF GRQI AS DG
Sbjct: 255 EEAQRTCGPVMSVKTYTTSQDGTMTIRFKSLEAVEACVKAWNGRWFDGRQIEASMWDGKS 314
Query: 508 NHAAIRDLD--AEASRLEQFGAEL 529
+ RD A+ +RL+ + AEL
Sbjct: 315 KFVSQRDESEAAQRARLDAYAAEL 338
>gi|10798630|emb|CAC12815.1| hypothetical protein [Nicotiana tabacum]
Length = 242
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 110/273 (40%), Positives = 150/273 (54%), Gaps = 55/273 (20%)
Query: 8 SQQQLSGATN---YETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGR 64
SQQ G N Y T+G GWY+L ++QQ +GPY I EL EH+ GYLLE TLVWS+GR
Sbjct: 12 SQQYNIGEGNSSLYTTSGL-GWYVLAQDQQQLGPYTIPELREHYSAGYLLENTLVWSEGR 70
Query: 65 SEWQPLSSIPQFLSGISQQVARGSTAVP--CNDGIEEVREQIEEAAGMQSQSFSSAEQGV 122
S+WQPL SIP L+ + +Q G+ +VP ND +
Sbjct: 71 SDWQPLCSIPGLLTDVPEQSIDGTNSVPLTSNDPL------------------------- 105
Query: 123 PSHVDDEFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDD 182
DE+EK+Q+EV+EAE E D+ P+GE+EFTDDD
Sbjct: 106 -----DEYEKFQKEVKEAEDE----------------QAVDEEQRPSTPPDGEEEFTDDD 144
Query: 183 GTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLN 242
GTRYKWD+ L+ WVPQED + +N+ YG+E+MT+ +EEE+FPT+ D N+ + +
Sbjct: 145 GTRYKWDKALKVWVPQEDPTEKNE-YGLEDMTYAEEEELFPTLPADDSSGNENKNMDNIE 203
Query: 243 STEEKVNSADNVVEEKHNGKRKQPDK-QVEKKE 274
+ + +SA+ NGKRK P+ EKKE
Sbjct: 204 ADKTARDSAETTA-ATENGKRKLPETDDSEKKE 235
>gi|59862144|gb|AAH90381.1| htatsf1-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 524
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 185/351 (52%), Gaps = 34/351 (9%)
Query: 173 EGEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVN 226
+G+D +T DGT+Y+WD +AW P+ ED + + YG++
Sbjct: 100 DGQDPYTYRDPSDGTQYEWDIEKKAWFPKITEDFLAMYHANYGVD-------------AG 146
Query: 227 VTDDLANDEVGKEKLNSTEEKVNSADNVVEE-KHNGKRKQPDKQVEKKEANKPPDSWF-- 283
TD+ N + K K S + + EE K ++ +P + EK++A+ WF
Sbjct: 147 STDNTVNTDAAKAKETSVSSEAQAIKKPKEEQKPQTEQGEPKAKGEKRKADP---GWFNV 203
Query: 284 ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
E + NT+VYVTGLP D+T EE E+ SKCGII DP+++ +IK+Y D+E G KGD L
Sbjct: 204 EAQRNTNVYVTGLPPDITNEEFAEIMSKCGIIMRDPQSEDYKIKLYKDRE-GNLKGDGLC 262
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVE 402
YLK SV LA +LLD R + V AKF+ KGE K+ K K K ++
Sbjct: 263 CYLKRESVDLALRLLDEYEIR---GYKLHVENAKFQLKGEYDATKKKKKCKDYKKKISIQ 319
Query: 403 EKMLGW----GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
+K L W D +++ VI++ MF P + D + +E+ D++ EC K G V
Sbjct: 320 QKQLDWRPEKKANDTSRMRFERVVIIKNMFHPKDFEEDPLVLNEIREDLRSECEKFGQVK 379
Query: 459 SVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+ + + HP GV V FK+ ++ CI+ +NGRWFGGRQ+ DG+ ++
Sbjct: 380 KLLIFDQHPDGVASVAFKEAEEGDMCIQALNGRWFGGRQLIVESWDGVTDY 430
>gi|301606045|ref|XP_002932639.1| PREDICTED: HIV Tat-specific factor 1 homolog [Xenopus (Silurana)
tropicalis]
Length = 452
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 185/351 (52%), Gaps = 34/351 (9%)
Query: 173 EGEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVN 226
+G+D +T DGT+Y+WD +AW P+ ED + + YG++
Sbjct: 28 DGQDPYTYRDPSDGTQYEWDIEKKAWFPKITEDFLAMYHANYGVD-------------AG 74
Query: 227 VTDDLANDEVGKEKLNSTEEKVNSADNVVEE-KHNGKRKQPDKQVEKKEANKPPDSWF-- 283
TD+ N + K K S + + EE K ++ +P + EK++A+ WF
Sbjct: 75 STDNTVNTDAAKAKETSVSSEAQAIKKPKEEQKPQTEQGEPKAKGEKRKADP---GWFNV 131
Query: 284 ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
E + NT+VYVTGLP D+T EE E+ SKCGII DP+++ +IK+Y D+E G KGD L
Sbjct: 132 EAQRNTNVYVTGLPPDITNEEFAEIMSKCGIIMRDPQSEDYKIKLYKDRE-GNLKGDGLC 190
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVE 402
YLK SV LA +LLD R + V AKF+ KGE K+ K K K ++
Sbjct: 191 CYLKRESVDLALRLLDEYEIR---GYKLHVENAKFQLKGEYDATKKKKKCKDYKKKISIQ 247
Query: 403 EKMLGW----GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
+K L W D +++ VI++ MF P + D + +E+ D++ EC K G V
Sbjct: 248 QKQLDWRPEKKANDTSRMRFERVVIIKNMFHPKDFEEDPLVLNEIREDLRSECEKFGQVK 307
Query: 459 SVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+ + + HP GV V FK+ ++ CI+ +NGRWFGGRQ+ DG+ ++
Sbjct: 308 KLLIFDQHPDGVASVAFKEAEEGDMCIQALNGRWFGGRQLIVESWDGVTDY 358
>gi|395546036|ref|XP_003774900.1| PREDICTED: HIV Tat-specific factor 1 [Sarcophilus harrisii]
Length = 444
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 187/355 (52%), Gaps = 42/355 (11%)
Query: 169 PASPEGEDEFT---DDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTV 225
P PEG D FT DGT Y+WD +AW P+ E+ T
Sbjct: 27 PRMPEGSDPFTYVDQADGTTYEWDLEKKAWFPK------------------ISEDFLATY 68
Query: 226 NVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGK--RKQPDKQVEKK---EANKPPD 280
++D +S +K AD V EK GK ++ + +EKK E KP
Sbjct: 69 QANYGFSDDSPS----SSHTKKETKAD--VSEKAKGKPTEQKSIEDIEKKQKGEKRKPEP 122
Query: 281 SWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
WF+++ NT+VYVTGLP D+T EE V++ SKCGII +DP+T++ +IK+Y DK+ G K
Sbjct: 123 GWFQVEEDRNTNVYVTGLPPDITKEEFVQLMSKCGIIMKDPQTEEYKIKLYKDKQ-GNLK 181
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL 398
GD L YLK+ SV LA +LLD R + V +AKF+ KGE +K+ K K
Sbjct: 182 GDGLCCYLKKESVPLALKLLDDDQIR---GYRLHVEKAKFQLKGEYDASKKKKKCKDYKK 238
Query: 399 K-KVEEKMLGWGGRDD---AKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
K +++K L W A++ VI++ MF P + D + +E+ D++ EC K
Sbjct: 239 KLSLQQKQLDWWPEKKTGTARMRHERVVIIKNMFHPKDFEEDPLVLNEIREDLRTECEKF 298
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
G V V V + HP GV V +K+ ++A CI+ +N RWFGGRQ++ DG+ ++
Sbjct: 299 GQVKKVLVFDRHPDGVASVSYKEPEEADLCIQALNERWFGGRQLNVQVWDGVTDY 353
>gi|334311921|ref|XP_001368201.2| PREDICTED: HIV Tat-specific factor 1-like [Monodelphis domestica]
Length = 555
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 134/355 (37%), Positives = 189/355 (53%), Gaps = 40/355 (11%)
Query: 167 GVPASPEGEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEE 220
G P PEG+D +T DGT Y+WDR +AW P+ ED ++ YG
Sbjct: 137 GSPEGPEGDDPYTFVDPADGTAYEWDRDKKAWFPKITEDFLANYQANYG----------- 185
Query: 221 VFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPD 280
FP D ++ KEK E K + + N ++KQ K E KP
Sbjct: 186 -FPA----DSSSSSGGQKEKQADVEVKASQKPTDPKASQNPEKKQ------KGEKRKPEP 234
Query: 281 SWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
WF ++ NT+VYVTGLP D+T +E V++ SKCGII DP+T++ +IK+Y DK+ G K
Sbjct: 235 GWFHVEEDKNTNVYVTGLPPDITKDEFVQLMSKCGIIMRDPQTEEYKIKLYKDKQ-GNLK 293
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL 398
GD L YLK SV LA +LLD R + V AKF+ KGE + A + K K
Sbjct: 294 GDGLCCYLKRESVDLALRLLDDDEIR---GYKLHVEMAKFQLKGE-YDASKKKKKCKDYR 349
Query: 399 KKV--EEKMLGW---GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVK 453
KK+ ++K L W + A++ VI+R MF P + AD + +E+ D++ EC K
Sbjct: 350 KKMSQQQKQLDWRPEKKANTARMRHERVVIIRNMFHPKDFEADPLVLNEIREDLRSECEK 409
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVN 508
G V V V + HP GV V +K+ ++A CI+ +NGRWFGGRQ++ DG+ +
Sbjct: 410 FGQVKKVLVFDRHPDGVASVSYKEPEEADLCIQALNGRWFGGRQLNVEVWDGVTD 464
>gi|212542527|ref|XP_002151418.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces marneffei ATCC 18224]
gi|210066325|gb|EEA20418.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces marneffei ATCC 18224]
Length = 390
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 128/382 (33%), Positives = 189/382 (49%), Gaps = 79/382 (20%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DG Y +D L+ WV D + Q Y +
Sbjct: 42 EDGQEYGYDTALKRWVQTIDDALLEQQRQAYKV--------------------------- 74
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
E ++ TEE S E KRKQPD + +K+ + +VNT VY+T +P
Sbjct: 75 -EGVDETEEMTRS------ELRKKKRKQPDDENGQKQKRQ--------RVNTAVYITSIP 119
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D T++E+ +VF KCG+I E+ ++ +PRIK+Y+D ++G KG+ALV Y + SV LA Q+
Sbjct: 120 LDATLDEIRDVFCKCGVIAEEIDSGRPRIKMYMD-DSGNFKGEALVVYFRPESVNLAIQM 178
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGW 408
LD + FR P G PM V A F K ++ + + KKK+ +K+ K+ W
Sbjct: 179 LDESNFRIGQSGPLG--PMKVQAADFSYKSQQDAPTNTNMRDKKKIMKRTQKLNSKLADW 236
Query: 409 GGRDDAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
DD +P T V+L+ MFT E+ D +++ D+++EC K+G V +V
Sbjct: 237 D--DDEPSALPDTSSRFDKVVVLKHMFTLEEIEEDPAAILDIKQDIRDECSKLGDVTNVV 294
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG-------------LVN 508
+ + P GVV VRF D A++C++ M GR+FGG Q+ A DG L +
Sbjct: 295 LYDKEPAGVVTVRFSDPDAARQCVKTMGGRFFGGTQVVAYIADGSEKFKKTNEKRAALED 354
Query: 509 HAAIRDLDAEASRLEQFGAELE 530
A RD D EA RLE FG+ LE
Sbjct: 355 LAEGRDADEEADRLEGFGSWLE 376
>gi|301105717|ref|XP_002901942.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099280|gb|EEY57332.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 520
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 196/362 (54%), Gaps = 24/362 (6%)
Query: 173 EGEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLA 232
E D +DG RY +D + +V ED EE+ L+E
Sbjct: 89 ESTDALVAEDGKRYVFDAESKTYVTPEDKIE-------EELASLQEAMQEENAEKQGRNR 141
Query: 233 NDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKV---NT 289
N+ + + ++ +E ++ D V ++++ ++ + D + KK K + K NT
Sbjct: 142 NENQSENQKHNDKENASTGDKVAAKENSAEQPEADAEAAKKRKKKKKKKSDKWKKSKNNT 201
Query: 290 HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
VYV GLP DVTV+E+ + F+KCG+I+ D T +PRIK+Y +KE+G GD V Y+KE
Sbjct: 202 WVYVNGLPLDVTVQEVHDHFAKCGVIQPDIATGEPRIKLYQNKESGGLNGDGSVCYMKEA 261
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAK---QVDSKKKKKLKKVEEKML 406
SV LA QLLD + RPD P+ V+ A F+QK F+ + ++DS+ K K+ + +EK L
Sbjct: 262 SVELAVQLLDKSQIRPD--WPIDVSPAVFQQKEGDFVKRKKLKIDSRAKIKMFE-KEKAL 318
Query: 407 GWGGRDDAKLTIPA---TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC 463
W ++ ++ PA V+++ MFTPAE+ DE EL+ D+ +EC KIG V + +
Sbjct: 319 SW---NEGEVNEPAGLRIVVIKHMFTPAEIE-DEAYEKELQEDIHDECSKIGDVTKITLF 374
Query: 464 ENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLE 523
H GVV+++F A +C+++M+GR+F GR++ DG ++ + E R E
Sbjct: 375 AKHVDGVVVIKFASSGSAARCVDVMSGRFFAGRKLECGFWDG-TDYTHHESKNEEKERAE 433
Query: 524 QF 525
+F
Sbjct: 434 KF 435
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 29/54 (53%), Gaps = 3/54 (5%)
Query: 36 VGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIP---QFLSGISQQVAR 86
GP +L HF++G + TLVWSQ W+P+ +P +FL ++ + R
Sbjct: 15 TGPVTTQQLVHHFVDGDVDGMTLVWSQELDSWKPIGEVPSLKEFLQEANEDLDR 68
>gi|196005113|ref|XP_002112423.1| hypothetical protein TRIADDRAFT_63946 [Trichoplax adhaerens]
gi|190584464|gb|EDV24533.1| hypothetical protein TRIADDRAFT_63946 [Trichoplax adhaerens]
Length = 378
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 123/363 (33%), Positives = 203/363 (55%), Gaps = 33/363 (9%)
Query: 180 DDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
D+DG Y++D RAW P+ D ++ YG T ++E PT V + +
Sbjct: 37 DNDGVVYEFDEEKRAWFPKIDDDFIAAYQYHYG----TNTNQQE--PTTGV---ITAANM 87
Query: 237 GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVT 294
+ + VN+++ + ++K G+ E K+ P+ WF++ NT+VYV+
Sbjct: 88 APPTAEAAKPIVNASNAIPKQKQKGE--------ESKDRKSQPEGWFDVDETKNTNVYVS 139
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D T EE VE+ SKCGII +D +TK+ ++K+Y D + G KGD YLK SV LA
Sbjct: 140 GLPMDTTEEEFVELMSKCGIIMQDDDTKEFKVKLYRDTD-GQLKGDGRCCYLKVESVDLA 198
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKGERF--IAKQVDSKKKKKLKKVEEKMLGWG-GR 411
Q+LDG+ F+ D I +V +A F KG R+ ++ KK+ +K +EK+L W +
Sbjct: 199 LQILDGSLFK-DSTI--NVERAMFTLKG-RYDPSLRKKKKNTKKRKRKTQEKLLSWNESK 254
Query: 412 DDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGV 470
+ + + +I++ +F P E D +EL++ V+EEC K GP+ + + + HP+GV
Sbjct: 255 SEKRSKLDKVLIVKNIFHPDEFEKDPTYITELKSSVKEECEAKFGPIKKIIIFDRHPEGV 314
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA-SRLEQFGAEL 529
+ + F + +D KC+E+M+GR+F R++HA + DG N+ I + D E R++++ A +
Sbjct: 315 ISITFANPEDRDKCLEVMHGRYFAKRKLHAEKWDGFTNY-QIEETDMEREERIKKWEAYI 373
Query: 530 EAD 532
E D
Sbjct: 374 EED 376
>gi|452823570|gb|EME30579.1| RNA-binding protein [Galdieria sulphuraria]
Length = 400
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 113/287 (39%), Positives = 165/287 (57%), Gaps = 14/287 (4%)
Query: 229 DDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFE--LK 286
+D+ ND +G+ L + +S + E K ++ K+ E+K+ K + W E K
Sbjct: 85 EDVLNDNIGEPNLERS--NTSSKKKLTLEDLKKKERKRLKRAEQKK--KKRERWNEEVAK 140
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VY TG+P D T EE+VE FSKCGI+K D + K ++K+Y DK G KGD +VTY
Sbjct: 141 NNTSVYFTGIPSDATEEEVVEFFSKCGILKVDAYSGKAKVKLYRDK-LGYLKGDGVVTYA 199
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL--KKVEEK 404
+PSV A ++LD T FR + + A+FE KGE FI ++V + K K++ E+
Sbjct: 200 LQPSVENAFKVLDQTEFRFGTGTRIHLEPARFELKGEDFIPRKVPNTGKPLFSTKQLIEQ 259
Query: 405 MLGWG-GRDDAK-LTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV 462
W G DD + L I VIL+ +F P E D + ++ D+ EEC K+G ++ + V
Sbjct: 260 KTSWNDGVDDGRGLRI---VILKKVFEPKEALTDPHYYEDIRKDMLEECSKLGEIEKLTV 316
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
E +P+GVV VRF+ A+ CIELM GRW+GGRQ+ A DG ++
Sbjct: 317 FERNPEGVVAVRFRSPAAAESCIELMTGRWYGGRQLEAEFYDGKTDY 363
>gi|242768282|ref|XP_002341537.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces stipitatus ATCC 10500]
gi|218724733|gb|EED24150.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces stipitatus ATCC 10500]
Length = 389
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 110/296 (37%), Positives = 164/296 (55%), Gaps = 38/296 (12%)
Query: 262 KRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPET 321
KRKQPD + +K + +VNT VY+T +P D T++E+ +VF KCG+I E+ ++
Sbjct: 91 KRKQPDDENGQKPKRQ--------RVNTAVYITSIPLDATLDEIRDVFCKCGVIAEEIDS 142
Query: 322 KKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGKIPMSVTQA 376
+PRIK+Y+D E G KG+ALV Y + SV LA Q+LD + FR P G PM V A
Sbjct: 143 GRPRIKMYMD-EAGNFKGEALVVYFRPESVNLAIQMLDESDFRIGQSGPSG--PMKVQAA 199
Query: 377 KFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGRD-----DAKLTIPATVILRFM 427
F K ++ + + + KKK+ +K+ K+ W + D V+L+ M
Sbjct: 200 DFSFKSQQDVPTSTNMRDKKKIMKRTQKLNSKLADWDDDEPSAVLDTSSRFDKVVVLKHM 259
Query: 428 FTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIEL 487
FT E+ D +++ D+++EC K+G V +V + + P GVV VRF D + A++C++
Sbjct: 260 FTLEEIEKDPAAILDIKQDIRDECSKLGDVTNVVLYDQEPAGVVTVRFSDPEAARQCVKT 319
Query: 488 MNGRWFGGRQIHASEDDG-------------LVNHAAIRDLDAEASRLEQFGAELE 530
M GR+FGG Q+ A DG L + A RD D EA RLE+FG+ LE
Sbjct: 320 MGGRYFGGTQVVAYIADGSEKFKKTNEKRAALEDLAEGRDPDEEADRLEEFGSWLE 375
>gi|258568972|ref|XP_002585230.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237906676|gb|EEP81077.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 310
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 109/277 (39%), Positives = 157/277 (56%), Gaps = 38/277 (13%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+ NT VYVT +P D T+EE+ +VF KCG+I E+ ++ +PRIK+Y D E G KGDALV Y
Sbjct: 29 RTNTAVYVTSIPLDATIEEINDVFCKCGVIAEEIDSHRPRIKMYTD-ENGKFKGDALVVY 87
Query: 346 LKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKK----K 396
+ SV LA Q+LD + FR P GK M V QA F K ++ ++ +++ K K
Sbjct: 88 FRPESVNLAIQMLDDSDFRLGETGPQGK--MKVQQADFSFKAQQEAPQKQNTRDKAKIIK 145
Query: 397 KLKKVEEKMLGWGGRDDAKLTIPA-----TVILRFMFTPAEMRADENLRSELEADVQEEC 451
K ++++ K+ W DDA P VILR MFT AE+ D +++ D+++EC
Sbjct: 146 KTQRLKNKLADWD-EDDAATLQPTGRWEKVVILRHMFTLAELEDDPAAILDIKEDIRDEC 204
Query: 452 VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA----------- 500
K+G V +V + + GVV VRFKD + AQ C+E+MNGR+FGG ++ A
Sbjct: 205 SKLGEVTNVVLYDKEESGVVTVRFKDPEAAQACVEMMNGRFFGGTKVEAYIADGRERFRK 264
Query: 501 -------SEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
EDDG A+ D E+ RLE+F + +E
Sbjct: 265 SNDKSYDYEDDGAGWEASYD--DEESKRLEKFSSWIE 299
>gi|348684506|gb|EGZ24321.1| hypothetical protein PHYSODRAFT_311360 [Phytophthora sojae]
Length = 529
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 146/225 (64%), Gaps = 13/225 (5%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
VNT VYV GLP D+TV+E+ + F+KCG+I+ D T +PRIK+Y +KE+G GD V Y+
Sbjct: 205 VNTWVYVNGLPLDITVQEVHDHFAKCGVIQSDIATGEPRIKMYQNKESGGLNGDCSVCYM 264
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAK---QVDSKKKKKLKKVEE 403
KE SV LA QLLD + RP+ P+ V+ A+F+QKG+ F+ + ++D++ K K+ + +E
Sbjct: 265 KEASVELAVQLLDKSQIRPE--WPIDVSPAEFKQKGQDFVKRKKPKIDTRAKIKMFE-KE 321
Query: 404 KMLGWGGRDDAKLTIPA---TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSV 460
K L W ++ +++ PA V+++ MFTPAE+ E EL+ D+ +EC KIG V +
Sbjct: 322 KALSW---NEGEVSEPAGLRIVVIKHMFTPAEIED-EEYEKELQEDIHDECSKIGEVSKI 377
Query: 461 KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
+ H GVV+++F A +C+E+MNGR+F GR++ DG
Sbjct: 378 TLFAKHVDGVVVIKFASSGSAARCVEVMNGRFFAGRKLECGFWDG 422
>gi|345306681|ref|XP_001514214.2| PREDICTED: HIV Tat-specific factor 1 homolog [Ornithorhynchus
anatinus]
Length = 439
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 132/366 (36%), Positives = 188/366 (51%), Gaps = 45/366 (12%)
Query: 179 TDDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDE 235
T DG+ Y+WDR +AW P+ ED ++ + YG FP + A+DE
Sbjct: 28 TQADGSAYEWDRDHKAWFPKITEDFLATYHANYG------------FPADGSS---ASDE 72
Query: 236 VGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYV 293
TE K + ++ + K Q + K E KP WF ++ NT+VYV
Sbjct: 73 -------KTETKADGDAKTSQKPVDKKTAQHTEPKPKGEKRKPEAGWFHVEEDRNTNVYV 125
Query: 294 TGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVAL 353
TGLP D+T +E V+V SKCGII DP+T++ +IK+Y DK+ G KGD L YLK SV L
Sbjct: 126 TGLPPDITKDEFVQVMSKCGIIMRDPQTEEYKIKLYKDKQ-GNLKGDGLCCYLKRESVDL 184
Query: 354 ATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKM------LG 407
A +LLD R + V AKF+ KGE + A SKK+KK K +KM L
Sbjct: 185 ALRLLDDDEIR---GYKLHVEVAKFQLKGE-YDA----SKKRKKCKDYRKKMSLQQKQLD 236
Query: 408 WGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
W A ++ +I+R MF P + D + +E+ D++ EC K G V V + +
Sbjct: 237 WRPEKKAGASRMRHERVIIIRNMFHPKDFEEDPLVLNEIREDLRTECEKFGQVKKVLLFD 296
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQ 524
HP GV V +K+ +A CI+ +NGRWFGGRQ+ DG+ ++ RL+
Sbjct: 297 RHPDGVASVSYKEADEADLCIQALNGRWFGGRQLSVEAWDGVTDYQVEETSREREERLKV 356
Query: 525 FGAELE 530
+G+ LE
Sbjct: 357 WGSFLE 362
>gi|223948613|gb|ACN28390.1| unknown [Zea mays]
gi|413924808|gb|AFW64740.1| hypothetical protein ZEAMMB73_003429 [Zea mays]
Length = 216
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/254 (40%), Positives = 136/254 (53%), Gaps = 47/254 (18%)
Query: 20 TAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLSSIPQFLSG 79
A E GWY+L NQ+ VGPYA++EL EHF NGY E+T++W++GR EW PLSSI + S
Sbjct: 9 AATEAGWYVLGPNQEGVGPYALAELREHFANGYFNESTMLWAEGRKEWMPLSSISELHSA 68
Query: 80 ISQQVARGSTAVPCNDGIEEVREQIEEAAGMQSQSFSSAEQGVPSHVDDEFEKWQREVRE 139
+ AA QS EQ P VDD+FEK+Q+EV E
Sbjct: 69 V--------------------------AAKDQS------EQDAPD-VDDDFEKFQKEVIE 95
Query: 140 AEIEAERLKNGSASDSVGGYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQE 199
AE E + LK G + + D + P+GE+EFTDDDGT YKWDR LRAWVPQ
Sbjct: 96 AETEVKALK--------GNDVDQQDDERPATPPDGEEEFTDDDGTIYKWDRTLRAWVPQN 147
Query: 200 DTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKH 259
D S + Y +EEMTF EEE+F ++ E++N+ E N + E +
Sbjct: 148 DASGSKEDYAVEEMTFAVEEEIFQAPDILG------PALEEINNLPESKNKGSDKAEMRG 201
Query: 260 NGKRKQPDKQVEKK 273
+ KRK + EKK
Sbjct: 202 DKKRKSYENPAEKK 215
>gi|156552643|ref|XP_001603250.1| PREDICTED: HIV Tat-specific factor 1 homolog [Nasonia vitripennis]
Length = 583
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 126/362 (34%), Positives = 186/362 (51%), Gaps = 39/362 (10%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
+TD DGT Y WDR AW P+ D +D +M++ T N D EV
Sbjct: 232 YTDPSDGTSYIWDREKNAWFPKVD----DDFMARYQMSY------GFTDNTKSDTTESEV 281
Query: 237 GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVT 294
K N E K + +K+ E K N+ P +WF++ NT +Y++
Sbjct: 282 PKNPSNEPEPP-------------KKLTKEEKKAENKRKNQEPPTWFDIDEAHNTTIYIS 328
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
LP DVT+EE+ E+ +KCG++ D E K ++K+Y D + G KGDA TY+K SV LA
Sbjct: 329 NLPLDVTMEELKELVTKCGLLARD-EKGKDKLKLYTDAD-GEPKGDARCTYIKIESVDLA 386
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKG--ERFIAKQVDSKKKKKLKKVEEKMLGWGGRD 412
+LDG R +SV +AKF+ KG + + + K K K KK+ EK+L W R
Sbjct: 387 LNILDGWQMRDKT---LSVQRAKFQLKGAYDPSLKPKRKKKDKDKQKKIHEKLLDW--RP 441
Query: 413 DA----KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ 468
D L VI++ +F P + D L E + DV++EC+K G V V + + HP+
Sbjct: 442 DKLKGEPLKCERVVIIKNLFKPEDFDKDVTLLLEYQQDVRKECIKCGEVKKVTIYDRHPE 501
Query: 469 GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAE 528
GV V FK+ ++AQ CI+L+NGRWF R+I A DG + + + +R+ ++
Sbjct: 502 GVAQVTFKEPEEAQACIQLLNGRWFDQRRITAEIWDGKTKYKILETEEQIEARINKWDKF 561
Query: 529 LE 530
LE
Sbjct: 562 LE 563
>gi|317030364|ref|XP_001392435.2| nuclear mRNA splicing factor-associated protein [Aspergillus niger
CBS 513.88]
Length = 398
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 120/381 (31%), Positives = 194/381 (50%), Gaps = 69/381 (18%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
+DG Y++D L+ W+P ++E +++E + V ++ E+
Sbjct: 48 EDGQEYEYDTVLKRWIP-----------TVDEDLLQQQQEAYRVQGVNEE--------EQ 88
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
+ + + + E+ NG++K P KQ +VNT VYVT +P D
Sbjct: 89 ITAAQLRKKRKQQATTEEGNGQQK-PKKQ----------------RVNTAVYVTSIPLDA 131
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
+E+ +VFSKCG+I E+ ++ +PRIK+Y D E+G KG+ALV Y + SV LA Q+LD
Sbjct: 132 EFDEIRDVFSKCGVIAEEIDSGRPRIKMYTD-ESGNFKGEALVVYFRPESVNLAIQMLDD 190
Query: 361 TPFR---PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGRDD 413
+ FR P + PM V A F K ++ + + KKK+ +K+ K+ W DD
Sbjct: 191 SNFRIGVPGPQGPMRVQHADFSFKSQQEAPTKTSMRDKKKIMRRTQKLNSKLADWD--DD 248
Query: 414 AKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENH 466
+P T VIL+ MFT E+ D +++ D+++EC K+G V +V + +
Sbjct: 249 DPSALPDTTSRFDKIVILKHMFTLKELDEDPAAILDIKEDIRDECSKLGDVTNVVLYDKE 308
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVN-------HAAIRDL---- 515
P+GVV VRF D + A+ C+++M+GR+F G ++ A DG AA+ DL
Sbjct: 309 PEGVVSVRFSDPEAARSCVKMMDGRFFAGTRVDAYIADGSERFKKTNEKRAALEDLAEKG 368
Query: 516 -----DAEASRLEQFGAELEA 531
+ E RL++FG LE+
Sbjct: 369 LDASDEEEEQRLDEFGTWLES 389
>gi|391865430|gb|EIT74714.1| transcription elongation factor TAT-SF1 [Aspergillus oryzae 3.042]
Length = 466
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 192/384 (50%), Gaps = 75/384 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDG +++D L+ W+P ++E +++E + V D NDEV +
Sbjct: 116 DDGQEFEYDTALKRWIP-----------TVDEQLLRQQQEAYKVEGVDD---NDEVTASQ 161
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
L ++ + D G ++P KQ +VNT VYVT +P D
Sbjct: 162 LKKKRKQQATKDE-------GNGQKPKKQ----------------RVNTAVYVTSIPLDA 198
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
EE+ VFSKCG+I E+ ++ +PRIK+Y+D + G KG+ALV + + SV LA Q+LD
Sbjct: 199 DFEEIRYVFSKCGVIAEEIDSGRPRIKMYMDDD-GKFKGEALVVFFRPESVNLAIQMLDD 257
Query: 361 TPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGR 411
+ FR P G PM V A F K ++ + +K K+K+ +++ K+ W
Sbjct: 258 SDFRLGVTGPQG--PMRVQPADFSYKSQQEAPTKTSAKDKRKIIQRTQRLNNKLADWD-- 313
Query: 412 DDAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
DD +P T VIL+ MFT E+ D +++ D+++EC K+G V +V + +
Sbjct: 314 DDEPSALPETNSKFEKVVILKHMFTLKELDDDPAAILDIKEDIRDECSKLGEVTNVVLYD 373
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG------------LVNHAAI 512
GVV V+F+D + A+ C++LM+GR+F G ++ A DG + A
Sbjct: 374 KETDGVVSVKFQDPEAARNCVKLMDGRYFAGTRVEAYISDGSERFKKSNEKRAALEDLAE 433
Query: 513 RDLDA-----EASRLEQFGAELEA 531
R LDA E RL++FG LE+
Sbjct: 434 RGLDADEDEEEKQRLDEFGTWLES 457
>gi|238498374|ref|XP_002380422.1| nuclear mRNA splicing factor-associated protein, putative
[Aspergillus flavus NRRL3357]
gi|220693696|gb|EED50041.1| nuclear mRNA splicing factor-associated protein, putative
[Aspergillus flavus NRRL3357]
Length = 390
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 192/384 (50%), Gaps = 75/384 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDG +++D L+ W+P ++E +++E + V D NDEV +
Sbjct: 40 DDGQEFEYDTALKRWIPT-----------VDEQLLRQQQEAYKVEGVDD---NDEVTASQ 85
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
L ++ + D G ++P KQ +VNT VYVT +P D
Sbjct: 86 LKKKRKQQATKDE-------GNGQKPKKQ----------------RVNTAVYVTSIPLDA 122
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
EE+ VFSKCG+I E+ ++ +PRIK+Y+D + G KG+ALV + + SV LA Q+LD
Sbjct: 123 DFEEIRYVFSKCGVIAEEIDSGRPRIKMYMD-DDGKFKGEALVVFFRPESVNLAIQMLDD 181
Query: 361 TPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGR 411
+ FR P G PM V A F K ++ + +K K+K+ +++ K+ W
Sbjct: 182 SDFRLGVTGPQG--PMRVQPADFSYKSQQEAPTKTSAKDKRKIIQRTQRLNNKLADWD-- 237
Query: 412 DDAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
DD +P T VIL+ MFT E+ D +++ D+++EC K+G V +V + +
Sbjct: 238 DDEPSALPETNSKFEKVVILKHMFTLKELDDDPAAILDIKEDIRDECSKLGEVTNVVLYD 297
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG------------LVNHAAI 512
GVV V+F+D + A+ C++LM+GR+F G ++ A DG + A
Sbjct: 298 KETDGVVSVKFQDPEAARNCVKLMDGRYFAGTRVEAYISDGSERFKKSNEKRAALEDLAE 357
Query: 513 RDLDA-----EASRLEQFGAELEA 531
R LDA E RL++FG LE+
Sbjct: 358 RGLDADEDEEEKQRLDEFGTWLES 381
>gi|221119146|ref|XP_002158619.1| PREDICTED: HIV Tat-specific factor 1 homolog [Hydra magnipapillata]
Length = 409
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 183/347 (52%), Gaps = 35/347 (10%)
Query: 182 DGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDD-LANDEVGKEK 240
DGT Y+WD+G R W+P+ D Y F E P +N+ DD DE K
Sbjct: 81 DGTIYEWDQGKRGWIPKIDDDFI--AYYQANYGFTPSGEHDPNINLHDDENVKDETSK-- 136
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVTGLPD 298
V +EK + + KQ ++ E++E WF++ K N +VYVTGLP+
Sbjct: 137 -------------VSKEKKSEEMKQKKRKNEEQE-------WFDIDQKTNNNVYVTGLPE 176
Query: 299 DVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLL 358
++T E+ V + SKCGII ED +T ++++Y + + G KGDA YLK SV LA +LL
Sbjct: 177 NLTEEDFVTLMSKCGIIMED-DTGAKKVRLYKNTD-GSFKGDARCCYLKHESVDLACKLL 234
Query: 359 DGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK--KVEEKMLGWGGRDDAKL 416
D + F + V QA F+ KG F KKKKK++ K + K+L W R +
Sbjct: 235 DESDFEGS---KIHVEQAVFQLKG-NFNPNLKPKKKKKKVREGKGQTKLLDWVDRPQKRS 290
Query: 417 TIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFK 476
VIL+ MF E D L +EL+AD++ EC K G + + V + +P+GV + FK
Sbjct: 291 KFDRIVILKNMFDNKEFENDPTLINELKADLRSECEKFGEIKKIIVFDRNPEGVASILFK 350
Query: 477 DRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLE 523
+ + A KCIE +NGR++ + + A+ DG+ N+ + EA RL+
Sbjct: 351 EPEYADKCIEALNGRYYAKKVLSATTYDGVTNYQVQETAEEEAKRLQ 397
>gi|334350282|ref|XP_001362589.2| PREDICTED: HIV Tat-specific factor 1-like [Monodelphis domestica]
Length = 444
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 126/352 (35%), Positives = 192/352 (54%), Gaps = 42/352 (11%)
Query: 172 PEGEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTV 225
PEG D +T DGT Y+WD +AW P+ ED ++ YG FP
Sbjct: 29 PEGNDPYTYVDQADGTAYEWDLDKKAWFPKITEDFLATYQANYG------------FPP- 75
Query: 226 NVTDDLANDEVGKEKLNSTEEKVNSADNVVEEK--HNGKRKQPDKQVEKKEANKPPDSWF 283
D+ A+ K K +++ V + ++ ++K N ++KQ K E KP WF
Sbjct: 76 ---DEPASSHPQKAK--ESDKNVKAGEDPTDQKTAQNSEKKQ------KGEKRKPEPGWF 124
Query: 284 ELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDA 341
++ NT+VYVTGLP D+T +E +++ SKCGII +DP+T++ +IK+Y DK+ G KGD
Sbjct: 125 HVEEGRNTNVYVTGLPPDITKDEFIQLMSKCGIIMKDPQTEEYKIKLYKDKQ-GNLKGDG 183
Query: 342 LVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-K 400
L YLK+ SV LA +LLD R + V +AKF+ KGE +K+ K + K
Sbjct: 184 LCCYLKKESVDLALKLLDDDVIR---GYKLHVEKAKFQLKGEYDASKKKKKCKDYRKKLS 240
Query: 401 VEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPV 457
+++K L W A ++ V++R MF P + D + +E+ D++ EC K G V
Sbjct: 241 LQQKQLDWRPEKKAGSTRMRHERVVVIRNMFHPKDFEEDPLVLNEIREDLRTECEKFGQV 300
Query: 458 DSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
V + + HP GV V +K+ ++A CI+ +NGRWFGGRQ+H DG+ ++
Sbjct: 301 KKVLLFDRHPDGVASVSYKEPEEADLCIQALNGRWFGGRQLHVEVWDGVTDY 352
>gi|291243989|ref|XP_002741882.1| PREDICTED: HIV-1 Tat specific factor 1-like [Saccoglossus
kowalevskii]
Length = 370
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 127/368 (34%), Positives = 186/368 (50%), Gaps = 44/368 (11%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTF--LKEEEVFPTVNVTDDL 231
+TD +DGT Y+WD G +AW P+ D + YG E+ K+ PT
Sbjct: 27 YTDPNDGTVYEWDDGKQAWFPKIDDDFIAQYQANYGAHELVVGESKQSASNPT------- 79
Query: 232 ANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNT 289
K+ + ++D+ + E +++ D + WFE+ K N
Sbjct: 80 --------KVEQSPCFATTSDSKIMESQKAHKRKAD------------EGWFEMEEKKNR 119
Query: 290 HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
+VYVTGLP D T++E ++ SKCGII D +T +P+IK+Y+D E G KGD YLK
Sbjct: 120 NVYVTGLPYDTTLQEFQDLMSKCGIIMVDEQTNEPKIKLYLD-EIGNLKGDGRCCYLKRE 178
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGW- 408
SV LA Q+LDG R ++V A+F+ KG +K+ +KKKKK K +EK+L W
Sbjct: 179 SVDLALQILDGYDMRGH---RITVELAQFQLKGRFDPSKKKKTKKKKKKKNSQEKLLEWR 235
Query: 409 ----GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
+++ VILR MF P E D +E+ D++ EC K G V V + +
Sbjct: 236 PERTKMMGESRKRHECVVILRHMFEPKEFEEDPMAINEIIDDLKAECGKFGEVKKVLIFD 295
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQ 524
HP GV V+FK+ ++A CI +NGRWF RQI A+ DG ++ RL+
Sbjct: 296 RHPDGVASVKFKEPEEADVCIAALNGRWFAKRQIIANTWDGQTDYQVEETSREREERLKG 355
Query: 525 FGAELEAD 532
+ + LE D
Sbjct: 356 WQSFLEQD 363
>gi|327293814|ref|XP_003231603.1| nuclear mRNA splicing factor-associated protein [Trichophyton
rubrum CBS 118892]
gi|326466231|gb|EGD91684.1| nuclear mRNA splicing factor-associated protein [Trichophyton
rubrum CBS 118892]
Length = 388
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 128/383 (33%), Positives = 179/383 (46%), Gaps = 75/383 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
D+G Y +D L+ W+P D S Q Y I+ + DE
Sbjct: 37 DNGEEYTYDDALKRWIPSLDESLMEQQRQAYKIQGV--------------------DEEE 76
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
L + +EK KRK D+ ++ KP +VNT VYVT +P
Sbjct: 77 PANLKTLQEK------------KKKRKHNDESTTAQKPKKP-------RVNTAVYVTAIP 117
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D TV E+ +FSKCG+I E+ ++ KPRIK+Y D E G KGDALV Y + SV LA Q+
Sbjct: 118 LDATVSEISSLFSKCGVIAEEIDSGKPRIKMYTD-EQGAFKGDALVVYFRPESVNLAIQM 176
Query: 358 LDGTPFRPDGKIP---MSVTQAKFEQKGERFIAKQVDSKKK----KKLKKVEEKMLGW-- 408
LD T FR K M V A F K + + + + K +K +K+ K+ W
Sbjct: 177 LDDTDFRFGEKGAEGNMRVQPADFSFKAVQETPAKANLRDKMKIIRKTQKLNNKLTDWDD 236
Query: 409 ---GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
G R K V+L+ MFT E+ D +++ D++EEC K+G V +V + +
Sbjct: 237 DDVGPRHSGKAG--KVVVLKHMFTLQELEEDPAAILDIKEDIREECSKLGEVTNVVLYDK 294
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHA--------------- 510
G+ VRF D + AQ C+++MNGR+FGG Q+ A DG V
Sbjct: 295 EESGIATVRFSDPECAQACVQMMNGRFFGGTQVEAYVVDGKVRFKKSGASAAAALQDDGA 354
Query: 511 ---AIRDLDAEASRLEQFGAELE 530
A D EA RL++FGA LE
Sbjct: 355 GWEAEASKDDEAQRLDKFGAWLE 377
>gi|393245184|gb|EJD52695.1| hypothetical protein AURDEDRAFT_111297 [Auricularia delicata
TFB-10046 SS5]
Length = 387
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 133/377 (35%), Positives = 186/377 (49%), Gaps = 54/377 (14%)
Query: 174 GEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAN 233
G E+ DDDG +WD AWVP D LK+++ +V D+ A
Sbjct: 32 GRWEYEDDDGNTMEWDMVKSAWVPVVDDE------------LLKKQQAAYSVAGVDEEAP 79
Query: 234 DEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYV 293
+ K EE + N+ K G P QV K NT VYV
Sbjct: 80 ALEPRAKKRKKEEDPDLPVNIKRPKKGGASAAPRPQVSK---------------NTAVYV 124
Query: 294 TGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVAL 353
TGLP D T +E+VE FSK G+I ED E P++K+Y ++ G G+ALV Y KE SV L
Sbjct: 125 TGLPPDATKDEVVERFSKFGLIMEDDEGA-PKVKMYA-RDDGSFNGEALVVYFKEESVTL 182
Query: 354 ATQLLDGTPFRPD-GKIPMSVTQAKFEQKGE-----RFIAKQVDSKKK--KKLKKVEEKM 405
A +LD R K MSV +A F K + A +V KKK K++ K+++K+
Sbjct: 183 AVTMLDDAELRVGIDKTRMSVQKADFTHKQPQQAEVKSSAPRVVDKKKATKRIGKMQKKL 242
Query: 406 LGWGGRDD-------AKLTIPAT----VILRFMFTPAEMRADENLRSELEADVQEECVKI 454
WG DD K PA+ V+L+ MF+ AE+ D +L +L+ DV+EEC +
Sbjct: 243 EEWGSDDDFGPSREAPKKAGPASTSRVVVLKHMFSLAELDEDASLLLDLKDDVREECSNL 302
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRD 514
G V +V + + P G++ V+F+D AQ C+ MNGR+FGGR+I A DG + D
Sbjct: 303 GDVTNVVLYDKEPDGIMTVKFRDPVSAQACVIKMNGRFFGGRRIEAWVYDGKQRYQRTGD 362
Query: 515 LDA------EASRLEQF 525
+A E +RL+ F
Sbjct: 363 RNAEDGDEDEKARLDAF 379
>gi|116182010|ref|XP_001220854.1| hypothetical protein CHGG_01633 [Chaetomium globosum CBS 148.51]
gi|88185930|gb|EAQ93398.1| hypothetical protein CHGG_01633 [Chaetomium globosum CBS 148.51]
Length = 411
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 178/370 (48%), Gaps = 74/370 (20%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
DDGT +++D+GL+ W+P D + Q GY + P V+ D G
Sbjct: 37 DDGTEFEFDQGLKRWIPMIDEALIEEQQKGY------------MMPNVDEEDARQGSTQG 84
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
+++ K++S D + HN K + + K+ N+ P + K NT VYVTGLP
Sbjct: 85 RKR------KMDSNDREQDNNHNSKDRS--SKSAKRHGNRGPS---QPKQNTAVYVTGLP 133
Query: 298 DDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
D T EE+ E+FS KCG+I E+ ++ +PRIK+Y D + G KGDAL+ + K SV +A
Sbjct: 134 SDATAEEVAELFSRKCGVIAEEIDSGRPRIKMYTDAD-GNFKGDALIVFFKPQSVEMAIM 192
Query: 357 LLDGTPFR--PDGKIPMSVTQAKFEQKGERF-----------------IAKQVD------ 391
LLD T FR M V A K ++ I + D
Sbjct: 193 LLDDTDFRFGSSATPKMHVQAADMSYKKTKYDGGDAKPGEANNDNNNNITRSSDASAADK 252
Query: 392 ------SKKKKKLKKVEEKMLGWGGRD----DAKLTIPA-----------TVILRFMFTP 430
+K KK +K+ K+ W + D PA VILR MFT
Sbjct: 253 RSNQDKAKIIKKTQKLSAKLADWSDDEHADGDPMTGAPAHTGSGSSKHDRLVILRHMFTL 312
Query: 431 AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG 490
E+R D +++ D++EEC K+GPV +V + + +G+V VRF+ R+ A+ C+ LM+G
Sbjct: 313 EELREDPTAMLDIKEDIREECAKLGPVTNVVLYDEEEEGIVSVRFRTREAAEACLRLMHG 372
Query: 491 RWFGGRQIHA 500
R F GR + A
Sbjct: 373 RAFAGRIVEA 382
>gi|378729763|gb|EHY56222.1| hypothetical protein HMPREF1120_04312 [Exophiala dermatitidis
NIH/UT8656]
Length = 379
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/340 (35%), Positives = 164/340 (48%), Gaps = 63/340 (18%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
++G ++WD L+ W+P D S ++ E++ V +
Sbjct: 42 EEGNEFEWDTALKRWIPVLDQS-----------LLEQQSEIYKVPGVDE----------- 79
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
N K KRK + E + K P +VNT VYVT +P D
Sbjct: 80 ------------NATPAKEKKKRKHANNDDEAGQKPKKP------RVNTAVYVTSIPLDA 121
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
+E+ +VFSKCG+I ED +T KPRIK+Y D E G KGDALV Y + SV LA Q+LD
Sbjct: 122 DQDEIQQVFSKCGVIAEDLDTGKPRIKMYED-ENGKFKGDALVVYFRPESVQLAVQMLDD 180
Query: 361 TPFR-----PDGKIPMSVTQAKFEQKGERFIAKQV------DSKKKKKLKKVEEKMLG-- 407
T FR P GK M V A F K ++ Q K +KKL K +KM+G
Sbjct: 181 TDFRFGVEGPMGK--MKVQPADFSHKKQQQPEDQPAQPSKPSRKDQKKLMKRTQKMIGKL 238
Query: 408 --WGGRDDAKLTIPAT-----VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSV 460
W D L ++ VIL+ MFT E+ D E++ D++EEC K+G V +V
Sbjct: 239 TDWDDDDPQALQATSSRWDKVVILKHMFTLEELEEDPAAMLEIKEDIREECSKLGQVTNV 298
Query: 461 KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
+ + P+GV VRF + + A C+ LMNGRWF RQ+ A
Sbjct: 299 VLFDKEPEGVASVRFSNAESAAACVRLMNGRWFDERQLEA 338
>gi|317155641|ref|XP_001825256.2| nuclear mRNA splicing factor-associated protein [Aspergillus oryzae
RIB40]
Length = 390
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 191/384 (49%), Gaps = 75/384 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDG +++D L+ W+P ++E +++E + V D NDEV +
Sbjct: 40 DDGQEFEYDTALKRWIPT-----------VDEQLLRQQQEAYKVEGVDD---NDEVTASQ 85
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
L ++ + D G ++P KQ +VNT VYVT +P D
Sbjct: 86 LKKKRKQQATKDE-------GNGQKPKKQ----------------RVNTAVYVTSIPLDA 122
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
EE+ VFSKCG+I E+ ++ +PRIK+Y+D + G KG+ALV + + SV LA Q+LD
Sbjct: 123 DFEEIRYVFSKCGVIAEEIDSGRPRIKMYMD-DDGKFKGEALVVFFRPESVNLAIQMLDD 181
Query: 361 TPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGR 411
+ FR P G PM V A F K ++ + +K K+K+ +++ K+ W
Sbjct: 182 SDFRLGVTGPQG--PMRVQPADFSYKSQQEAPTKTSAKDKRKIIQRTQRLNNKLADWD-- 237
Query: 412 DDAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
DD +P T VIL+ MFT E+ D +++ D+++EC K+G V +V + +
Sbjct: 238 DDEPSALPETNSKFEKVVILKHMFTLKELDDDPAAILDIKEDIRDECSKLGEVTNVVLYD 297
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG------------LVNHAAI 512
GVV V+F+D + A+ C++LM+GR+F G ++ A DG + A
Sbjct: 298 KETDGVVSVKFQDPEAARNCVKLMDGRYFAGTRVEAYISDGSERFKKSNEKRAALEDLAE 357
Query: 513 RDLDA-----EASRLEQFGAELEA 531
R LDA E RL +FG LE+
Sbjct: 358 RGLDADEDEEEKQRLVEFGTWLES 381
>gi|119184211|ref|XP_001243033.1| hypothetical protein CIMG_06929 [Coccidioides immitis RS]
gi|392865934|gb|EAS31786.2| nuclear mRNA splicing factor-associated protein [Coccidioides
immitis RS]
Length = 391
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/277 (38%), Positives = 156/277 (56%), Gaps = 39/277 (14%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+ NT VYVT +P D TV+E+ +VF KCG+I E+ ++ +PRIK+Y D E G KGDALV Y
Sbjct: 111 RTNTAVYVTSIPLDATVDEINDVFCKCGVIAEEIDSHRPRIKMYTD-ENGNFKGDALVVY 169
Query: 346 LKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL-- 398
+ SV LA Q+LD + FR P GK M V QA F K ++ ++ +++ K K+
Sbjct: 170 FRPESVNLAIQMLDDSDFRFGEPGPQGK--MKVQQADFSFKAQQEAPEKPNARDKAKIIR 227
Query: 399 --KKVEEKMLGWGGRDDAKLTIPA-----TVILRFMFTPAEMRADENLRSELEADVQEEC 451
+K++ K+ W D A T P+ VIL+ MFT E+ D +++ D+++EC
Sbjct: 228 KTQKLKNKLADWDEEDAA--TQPSGRWEKVVILKHMFTLQELEEDPAAILDIKEDIRDEC 285
Query: 452 VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI------------- 498
K+G V +V + + GVV VRFKD AQ C+ +M+GR+FGG Q+
Sbjct: 286 SKLGEVTNVVMYDKEESGVVTVRFKDPDAAQACVRMMHGRFFGGTQVEAYIADGRERFKK 345
Query: 499 -----HASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
H EDDG + + D EA RLE+FG+ +E
Sbjct: 346 SSTKGHDYEDDGA--GWEVGNDDEEAKRLEEFGSWIE 380
>gi|303320251|ref|XP_003070125.1| RNA recognition motif family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240109811|gb|EER27980.1| RNA recognition motif family protein [Coccidioides posadasii C735
delta SOWgp]
gi|320031977|gb|EFW13934.1| nuclear mRNA splicing factor-associated protein [Coccidioides
posadasii str. Silveira]
Length = 391
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/277 (38%), Positives = 156/277 (56%), Gaps = 39/277 (14%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+ NT VYVT +P D TV+E+ +VF KCG+I E+ ++ +PRIK+Y D E G KGDALV Y
Sbjct: 111 RTNTAVYVTSIPLDATVDEINDVFCKCGVIAEEIDSHRPRIKMYTD-ENGNFKGDALVVY 169
Query: 346 LKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL-- 398
+ SV LA Q+LD + FR P GK M V QA F K ++ ++ +++ K K+
Sbjct: 170 FRPESVNLAIQMLDDSDFRFGEPGPQGK--MKVQQADFSFKAQQEAPEKPNARDKAKIIR 227
Query: 399 --KKVEEKMLGWGGRDDAKLTIPA-----TVILRFMFTPAEMRADENLRSELEADVQEEC 451
+K++ K+ W D A T P+ VIL+ MFT E+ D +++ D+++EC
Sbjct: 228 KTQKLKNKLADWDEEDAA--TQPSGRWEKVVILKHMFTLQELEEDPAAILDIKEDIRDEC 285
Query: 452 VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI------------- 498
K+G V +V + + GVV VRFKD AQ C+ +M+GR+FGG Q+
Sbjct: 286 SKLGEVTNVVMYDKEESGVVTVRFKDPDAAQACVRMMHGRFFGGTQVEAYIADGRERFKK 345
Query: 499 -----HASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
H EDDG + + D EA RLE+FG+ +E
Sbjct: 346 SSTKGHDYEDDGA--GWEVGNDDEEARRLEEFGSWIE 380
>gi|358372866|dbj|GAA89467.1| nuclear mRNA splicing factor-associated protein [Aspergillus
kawachii IFO 4308]
Length = 386
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 112/314 (35%), Positives = 168/314 (53%), Gaps = 37/314 (11%)
Query: 248 VNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVE 307
VN + + + KRKQ E KP + +VNT VYVT +P D +E+ +
Sbjct: 71 VNEEEQITAAQLRKKRKQQASTEEGNGQQKPK----KQRVNTAVYVTSIPLDAEFDEIRD 126
Query: 308 VFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR--- 364
VFSKCG+I E+ ++ +PRIK+Y D E+G KG+ALV Y + SV LA Q+LD + FR
Sbjct: 127 VFSKCGVIAEEIDSGRPRIKMYTD-ESGKFKGEALVVYFRPESVNLAIQMLDDSNFRIGV 185
Query: 365 PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGRDDAKLTIPA 420
P + PM V A F K ++ + + KKK+ +K+ K+ W DD +P
Sbjct: 186 PGPQGPMRVQHADFSFKSQQEAPTKTSMRDKKKIMRRTQKLNSKLADWD--DDDPSALPD 243
Query: 421 T-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLV 473
T VIL+ MFT E+ D +++ D++EEC K+G V +V + + P+GVV V
Sbjct: 244 TTSRFDKLVILKHMFTLKELEEDPAAILDIKEDIREECSKLGDVTNVVLYDKEPEGVVSV 303
Query: 474 RFKDRKDAQKCIELMNGRWFGGRQIHASEDDGL-------VNHAAIRDL---------DA 517
RF D + A+ C+++M+GR+F G ++ A DG AA+ DL +
Sbjct: 304 RFSDPEAARSCVKMMDGRFFAGTRVDAYIADGSEKFKKTNEKRAALEDLAEKGLDASDEE 363
Query: 518 EASRLEQFGAELEA 531
E RL++FG LE+
Sbjct: 364 EEQRLDEFGTWLES 377
>gi|41055136|ref|NP_957497.1| HIV TAT specific factor 1 [Danio rerio]
gi|33417205|gb|AAH55565.1| HIV TAT specific factor 1 [Danio rerio]
Length = 450
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 125/343 (36%), Positives = 185/343 (53%), Gaps = 30/343 (8%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFP----TVNVTDDLA 232
+TD +DGT Y WD RAW P+ +D + + EE P VN + L
Sbjct: 34 YTDPEDGTVYDWDHEKRAWFPK----INDDFIAAYQANYGFNEEGAPDPSAAVNSAESLP 89
Query: 233 NDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKV--NTH 290
E+ EE A E GK K+ +K+ EK++A+ +WFE++ NT+
Sbjct: 90 ---AKSEEPKKPEEPTKPA----EASDQGKNKEGEKKGEKRKADP---TWFEVEKEKNTN 139
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+VTGLP D+T +E V V SKCGI+ DP T++ +I+IY DK+ G +KGD L YLK+ S
Sbjct: 140 VHVTGLPPDITPDEFVGVMSKCGIVMRDPITEEYKIRIYKDKD-GNQKGDGLCCYLKKES 198
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
VALA QLLD T R + V +A+FE KG+ F A + K K+ KK++++
Sbjct: 199 VALAKQLLDETEIR---GYRLHVEEARFELKGQ-FDASKKKKKNKEYRKKLQQQQKQLDW 254
Query: 411 RDDAKLTI----PATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENH 466
R + + +I++ MF P + D + +E D++ EC K G V V + + H
Sbjct: 255 RPEKSGEVRKRHEKVIIIQNMFHPTDFEEDPLVLNEYRDDLRTECEKFGQVKKVIIFDRH 314
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
P GV V FK+ ++A C +NGRWFGGR++ A DG+ ++
Sbjct: 315 PDGVASVAFKEPEEADACQVALNGRWFGGRKLSALLWDGVTDY 357
>gi|169642352|gb|AAI60527.1| htatsf1 protein [Xenopus (Silurana) tropicalis]
Length = 389
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 158/290 (54%), Gaps = 15/290 (5%)
Query: 228 TDDLANDEVGKEKLNSTEEKVNSADNVVEE-KHNGKRKQPDKQVEKKEANKPPDSWF--E 284
TD+ N + K K S + + EE K ++ +P + EK++A+ WF E
Sbjct: 13 TDNTVNTDAAKAKETSVSSEAQAIKKPKEEQKPQTEQGEPKAKGEKRKADP---GWFNVE 69
Query: 285 LKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
+ NT+VYVTGLP D+T EE E+ SKCGII DP+++ +IK+Y D+E G KGD L
Sbjct: 70 AQRNTNVYVTGLPPDITNEEFAEIMSKCGIIMRDPQSEDYKIKLYKDRE-GNLKGDGLCC 128
Query: 345 YLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEE 403
YLK SV LA +LLD R + V AKF+ KGE K+ K K K +++
Sbjct: 129 YLKRESVDLALRLLDEYEIRG---YKLHVENAKFQLKGEYDATKKKKKCKDYKKKISIQQ 185
Query: 404 KMLGW----GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDS 459
K L W D +++ VI++ MF P + D + +E+ D++ EC K G V
Sbjct: 186 KQLDWRPEKKANDTSRMRFERVVIIKNMFHPKDFEEDPLVLNEIREDLRSECEKFGQVKK 245
Query: 460 VKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+ + + HP GV V FK+ ++ CI+ +NGRWFGGRQ+ DG+ ++
Sbjct: 246 LLIFDQHPDGVASVAFKEAEEGDMCIQALNGRWFGGRQLIVESWDGVTDY 295
>gi|380027330|ref|XP_003697380.1| PREDICTED: HIV Tat-specific factor 1 homolog [Apis florea]
Length = 482
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 121/352 (34%), Positives = 189/352 (53%), Gaps = 48/352 (13%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
+TD +DG+ Y WD+ AW P+ D D +M++
Sbjct: 78 YTDPNDGSVYFWDKEKNAWFPKVD----EDFMARYQMSY--------------------- 112
Query: 237 GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVT 294
G N E K V + + GK + K++E K ++ P +WFE+ NT +YV+
Sbjct: 113 GFTDTNRIETKP-----VQQSQSQGKIDKEKKKMEAKRKSQEPPTWFEVDEAHNTAIYVS 167
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D+T++E+ E+F+KCG+I D E K +IK+Y D G KGDAL Y+K SV LA
Sbjct: 168 GLPLDITMDELTELFNKCGLIARD-EKGKDKIKLYKD-SNGQPKGDALCIYIKVESVDLA 225
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKGERF---IAKQVDSKKKKKLKKVEEKMLGWGGR 411
++LD + R GK +SV +AKF+ KG+ + + + K K++ KK++EK+ W R
Sbjct: 226 LKILDKSQIR--GKT-LSVQRAKFQMKGDAYDPALKPKRKKKDKERQKKLQEKLFDW--R 280
Query: 412 DDAKLTIPA----TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
+ P+ VI++ +F+P + + +L E + D++ EC+K G V V + + HP
Sbjct: 281 PERAPGEPSKHERVVIIKNLFSPEDFDKEVSLLLEYQQDIRSECLKCGDVRKVVIYDRHP 340
Query: 468 QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA 519
+GV V F++ +AQ CI+L+NGRWF R+I A DG + I + DAE
Sbjct: 341 EGVAQVTFREPAEAQACIQLLNGRWFSQRKISAEIWDGKTKY-KILETDAET 391
>gi|259488493|tpe|CBF87970.1| TPA: nuclear mRNA splicing factor-associated protein, putative
(AFU_orthologue; AFUA_1G10920) [Aspergillus nidulans
FGSC A4]
Length = 396
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 122/379 (32%), Positives = 189/379 (49%), Gaps = 68/379 (17%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDG Y +D L+ W+P I++ +++E + V +D ++ G ++
Sbjct: 49 DDGQEYSYDTILKRWIPT-----------IDDDLLRQQQEAYKVEGVDED---EQAGAQR 94
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
L KRKQ E+ +A KP + +VNT V+VT +P D
Sbjct: 95 LKK------------------KRKQ---GSEEGDAQKPK----KQRVNTAVFVTSIPLDA 129
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
+E+ ++FS+CG+I E+ ++ +PRIK+Y D E G KG+ALV Y + SV LA Q+LD
Sbjct: 130 EFDEIRDIFSRCGVIAEEIDSGRPRIKMYTDDE-GKFKGEALVVYFRPESVNLAIQMLDD 188
Query: 361 TPFR---PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGRDD 413
+ FR P + PM V A F K ++ + + KKK+ +K+ K+ W +
Sbjct: 189 SDFRLGVPGPQGPMRVQPADFSFKSQQEAPTKTSMRDKKKIIKRTQKLNNKLADWDDDEP 248
Query: 414 AKLT-----IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ 468
A L VIL+ MFT E+ D +++ D++EEC K+G V +V + +
Sbjct: 249 AALVDTNSKFEKIVILKHMFTLQELEEDPAAILDIKEDIREECSKLGEVTNVVLYDKEEA 308
Query: 469 GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGL-------VNHAAIRDL------ 515
GVV VRF + + A+ C++LMNGR+FGG + A DG AA+ DL
Sbjct: 309 GVVSVRFSNPESARACVKLMNGRFFGGTAVEAYISDGSERFRKSNEKRAALEDLAEKGLD 368
Query: 516 ---DAEASRLEQFGAELEA 531
D E RL++FG LE+
Sbjct: 369 AEDDDENQRLDEFGTWLES 387
>gi|348518982|ref|XP_003447010.1| PREDICTED: HIV Tat-specific factor 1 homolog [Oreochromis
niloticus]
Length = 452
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 120/364 (32%), Positives = 180/364 (49%), Gaps = 50/364 (13%)
Query: 173 EGEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVN 226
+G D +T +DGT Y WD +AW P+ ED ++ YG + + L + T
Sbjct: 27 DGSDPYTYTDPEDGTVYDWDHEKKAWFPKITEDFMAAYQANYGFTQESALDGKNTSLTGT 86
Query: 227 VTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL- 285
D +K E+ + A N E+H K+ ++ EK++A + WF++
Sbjct: 87 NPAAPGTDSKPPQKPPEKEKPEDPAPNSASEQHETPGKETKQKGEKRKAEQ---GWFDID 143
Query: 286 -KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
NT+VYV+GLP D++ +E E+ SKCGI+ DP T++ ++K+Y DKE G KGD L
Sbjct: 144 ESKNTNVYVSGLPPDISSDEFAELMSKCGIVMRDPITEEYKVKLYKDKE-GNLKGDGLCC 202
Query: 345 YLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDS------------ 392
YLK+ SV LA +L+D + R + V A+FE KG+ +K+
Sbjct: 203 YLKKESVDLAIRLIDESEVR---GYKLHVEAARFELKGQYDASKKKKKSKDYKKKLQQQQ 259
Query: 393 -------KKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEA 445
+KK L+K EK+ VI+R MF P++ D + +E
Sbjct: 260 KQLDWRPEKKGDLRKRHEKV----------------VIIRNMFHPSDFEEDPLVLNEYRE 303
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
D++ EC K G V V + + HP GV V FK+ + A CI NGRWFGGRQ+ A DG
Sbjct: 304 DLRSECEKFGEVKKVILFDRHPDGVASVAFKEPEQADACILSFNGRWFGGRQLSAQLWDG 363
Query: 506 LVNH 509
++
Sbjct: 364 TTDY 367
>gi|37590291|gb|AAH59346.1| LOC398736 protein, partial [Xenopus laevis]
Length = 477
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 180/351 (51%), Gaps = 34/351 (9%)
Query: 173 EGEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVN 226
+G+D +T DGT Y+WD +AW P+ ED + + YG++
Sbjct: 53 DGQDPYTFRDPSDGTEYEWDVEKKAWFPKITEDFLAMYHANYGVD-------------TG 99
Query: 227 VTDDLANDEVGKEK-LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL 285
TD+ N + K K + + E E+K ++ +P + EK++A+ WF +
Sbjct: 100 GTDNALNTDATKTKEMPVSSEAQEKKKPKEEQKLQTEQGEPKAKGEKRKADP---GWFNV 156
Query: 286 --KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
+ NT+VYVTGLP D T EE E+ SKCGII DP ++ +IK+Y D+E G KGD L
Sbjct: 157 EEQRNTNVYVTGLPPDTTNEEFAEIMSKCGIIMRDPHSEDYKIKLYKDRE-GNLKGDGLC 215
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER-FIAKQVDSKKKKKLKKVE 402
YLK SV LA +LLD R + V AKF+ KG K+ K +K ++
Sbjct: 216 CYLKRESVELALKLLDEYEIR---GYKLRVENAKFQLKGAYDATKKKKKCKDYRKKMSMQ 272
Query: 403 EKMLGW----GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
+K L W D +++ VI++ MF P + D + +E+ D++ EC K G V
Sbjct: 273 QKQLDWRPEKKANDASRMRFERVVIIKNMFHPKDFEEDPLVLNEIREDLRSECEKFGQVK 332
Query: 459 SVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+ + + HP GV V FK+ + CI+ +NGRWFGGRQ+ DG+ ++
Sbjct: 333 KLLIFDQHPDGVASVAFKEANEGDLCIQALNGRWFGGRQLAVEPWDGVTDY 383
>gi|67518043|ref|XP_658792.1| hypothetical protein AN1188.2 [Aspergillus nidulans FGSC A4]
gi|40747150|gb|EAA66306.1| hypothetical protein AN1188.2 [Aspergillus nidulans FGSC A4]
Length = 502
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 122/379 (32%), Positives = 188/379 (49%), Gaps = 68/379 (17%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDG Y +D L+ W+P I++ +++E + V +D ++ G ++
Sbjct: 49 DDGQEYSYDTILKRWIPT-----------IDDDLLRQQQEAYKVEGVDED---EQAGAQR 94
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
L KRKQ E+ +A KP +VNT V+VT +P D
Sbjct: 95 LKK------------------KRKQ---GSEEGDAQKPKKQ----RVNTAVFVTSIPLDA 129
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
+E+ ++FS+CG+I E+ ++ +PRIK+Y D E G KG+ALV Y + SV LA Q+LD
Sbjct: 130 EFDEIRDIFSRCGVIAEEIDSGRPRIKMYTDDE-GKFKGEALVVYFRPESVNLAIQMLDD 188
Query: 361 TPFR---PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGRDD 413
+ FR P + PM V A F K ++ + + KKK+ +K+ K+ W +
Sbjct: 189 SDFRLGVPGPQGPMRVQPADFSFKSQQEAPTKTSMRDKKKIIKRTQKLNNKLADWDDDEP 248
Query: 414 AKLT-----IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ 468
A L VIL+ MFT E+ D +++ D++EEC K+G V +V + +
Sbjct: 249 AALVDTNSKFEKIVILKHMFTLQELEEDPAAILDIKEDIREECSKLGEVTNVVLYDKEEA 308
Query: 469 GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVN-------HAAIRDL------ 515
GVV VRF + + A+ C++LMNGR+FGG + A DG AA+ DL
Sbjct: 309 GVVSVRFSNPESARACVKLMNGRFFGGTAVEAYISDGSERFRKSNEKRAALEDLAEKGLD 368
Query: 516 ---DAEASRLEQFGAELEA 531
D E RL++FG LE+
Sbjct: 369 AEDDDENQRLDEFGTWLES 387
>gi|239608186|gb|EEQ85173.1| nuclear mRNA splicing factor-associated protein [Ajellomyces
dermatitidis ER-3]
gi|327349369|gb|EGE78226.1| hypothetical protein BDDG_01163 [Ajellomyces dermatitidis ATCC
18188]
Length = 399
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 128/414 (30%), Positives = 198/414 (47%), Gaps = 91/414 (21%)
Query: 172 PEGEDEFTDD-----------------DGTRYKWDRGLRAWVPQEDTS---SQNDGYGIE 211
P+ DEF DD DG+ +++D L+ WVP D + Q + Y +E
Sbjct: 19 PQTPDEFDDDYRISYSKLDKKYHLEAEDGSEWEYDDALKRWVPLVDDALLEQQREAYKVE 78
Query: 212 EMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVE 271
+ DE L +++ N NG+ + +KQ +
Sbjct: 79 GV--------------------DEHEPADLKQQQQRRLKRKNYT----NGE--EANKQTQ 112
Query: 272 KKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVD 331
+A +P +VNT VYVT +P D ++EE+ +VFSKCG+I E+ + ++PRIK+Y D
Sbjct: 113 APKAKRP-------RVNTAVYVTSIPLDASLEEVNDVFSKCGVIAEEIDRRRPRIKLYTD 165
Query: 332 KETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFI 386
E G KGDAL+ Y + SV LA Q+LD T FR P GK+ + F+ + E
Sbjct: 166 DE-GKFKGDALIVYFRPESVNLAIQMLDDTDFRFGETGPQGKMKVQAADFSFKAQKEAPA 224
Query: 387 AKQVDSKKK--KKLKKVEEKMLGWGGRDDAKLTIPAT-------VILRFMFTPAEMRADE 437
+ K+K +K +++ K+ W DD T+P + VIL+ MFT E+ D
Sbjct: 225 KPNMSEKRKIMRKTQRLNSKLADWD--DDEPSTVPPSSSRWDRVVILKHMFTLQELEEDP 282
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
+++ D+++EC K+G V +V + + +GV VRF D + A C++ MNGR+F G +
Sbjct: 283 AAILDIKEDIRQECSKLGEVTNVVLYDKEEEGVASVRFSDVESANICVQKMNGRFFSGTR 342
Query: 498 IHASEDDG-----------------LVNHAAIRDLDA----EASRLEQFGAELE 530
+ A DG L N A D + EA RL++FG+ LE
Sbjct: 343 VVAYIADGSERFKKSSSKQAAAGTTLANVEAEEDSEQVDEEEAERLDKFGSWLE 396
>gi|70995402|ref|XP_752457.1| nuclear mRNA splicing factor-associated protein [Aspergillus
fumigatus Af293]
gi|66850092|gb|EAL90419.1| nuclear mRNA splicing factor-associated protein, putative
[Aspergillus fumigatus Af293]
gi|159131212|gb|EDP56325.1| nuclear mRNA splicing factor-associated protein, putative
[Aspergillus fumigatus A1163]
Length = 395
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 193/384 (50%), Gaps = 76/384 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DG +++D L+ W+P D Q + Y I+ + N++V
Sbjct: 46 EDGQEFEYDTVLKRWIPTVDDDLLRQQQEAYKIQGVD-----------------ENEQVT 88
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
++L ++ S D ++NG++ P KQ +VNT V+VT +P
Sbjct: 89 AQQLRKKRKQQTSGD-----ENNGQK--PKKQ----------------RVNTAVFVTSIP 125
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D +E+ +VFSKCG+I E+ ++ +PRIK+Y+D E G KG+AL+ Y + SV LA Q+
Sbjct: 126 LDAEFDEIRDVFSKCGVIAEEIDSGRPRIKMYMD-ENGKFKGEALIVYFRPESVNLAIQM 184
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGW 408
LD + FR P G PM V A F K ++ + + KKK+ +K+ K+ W
Sbjct: 185 LDESDFRLGVSGPQG--PMKVQAADFSFKSQQEAPAKSSMRDKKKIIKRTQKLNSKLADW 242
Query: 409 GGRDDAKLT-----IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC 463
+ + LT VIL+ MFT E+ D +++ D++EEC K+G V +V +
Sbjct: 243 SDDEPSILTDTSSRFEKVVILKHMFTLKEIDEDPAAILDIKDDIREECSKLGEVTNVVLY 302
Query: 464 ENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG------------LVNHAA 511
+ P GVV VRF D + A++C+ +M+GR+F G ++ A DG + A
Sbjct: 303 DKEPDGVVSVRFTDPEAARQCVRVMDGRFFAGTRVEAYITDGKEKFKKSNERRAALEDMA 362
Query: 512 IRDLDA----EASRLEQFGAELEA 531
R LDA E +RL++FG LE+
Sbjct: 363 ERGLDAEDEEENARLDEFGTWLES 386
>gi|147898592|ref|NP_001083090.1| HIV Tat-specific factor 1 [Xenopus laevis]
gi|54038341|gb|AAH84628.1| LOC398736 protein [Xenopus laevis]
Length = 452
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 180/351 (51%), Gaps = 34/351 (9%)
Query: 173 EGEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVN 226
+G+D +T DGT Y+WD +AW P+ ED + + YG++
Sbjct: 28 DGQDPYTFRDPSDGTEYEWDVEKKAWFPKITEDFLAMYHANYGVD-------------TG 74
Query: 227 VTDDLANDEVGKEK-LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWF-- 283
TD+ N + K K + + E E+K ++ +P + EK++A+ WF
Sbjct: 75 GTDNALNTDATKTKEMPVSSEAQEKKKPKEEQKLQTEQGEPKAKGEKRKADP---GWFNV 131
Query: 284 ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
E + NT+VYVTGLP D T EE E+ SKCGII DP ++ +IK+Y D+E G KGD L
Sbjct: 132 EEQRNTNVYVTGLPPDTTNEEFAEIMSKCGIIMRDPHSEDYKIKLYKDRE-GNLKGDGLC 190
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER-FIAKQVDSKKKKKLKKVE 402
YLK SV LA +LLD R + V AKF+ KG K+ K +K ++
Sbjct: 191 CYLKRESVELALKLLDEYEIR---GYKLRVENAKFQLKGAYDATKKKKKCKDYRKKMSMQ 247
Query: 403 EKMLGW----GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
+K L W D +++ VI++ MF P + D + +E+ D++ EC K G V
Sbjct: 248 QKQLDWRPEKKANDASRMRFERVVIIKNMFHPKDFEEDPLVLNEIREDLRSECEKFGQVK 307
Query: 459 SVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+ + + HP GV V FK+ + CI+ +NGRWFGGRQ+ DG+ ++
Sbjct: 308 KLLIFDQHPDGVASVAFKEANEGDLCIQALNGRWFGGRQLAVEPWDGVTDY 358
>gi|134076946|emb|CAK45355.1| unnamed protein product [Aspergillus niger]
gi|350629574|gb|EHA17947.1| hypothetical protein ASPNIDRAFT_38446 [Aspergillus niger ATCC 1015]
Length = 389
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 120/381 (31%), Positives = 192/381 (50%), Gaps = 78/381 (20%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
+DG Y++D L+ W+P + + Y ++ + N+E E+
Sbjct: 48 EDGQEYEYDTVLKRWIPTQ------EAYRVQGV-------------------NEE---EQ 79
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
+ + + + E+ NG++K P KQ +VNT VYVT +P D
Sbjct: 80 ITAAQLRKKRKQQATTEEGNGQQK-PKKQ----------------RVNTAVYVTSIPLDA 122
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
+E+ +VFSKCG+I E+ ++ +PRIK+Y D E+G KG+ALV Y + SV LA Q+LD
Sbjct: 123 EFDEIRDVFSKCGVIAEEIDSGRPRIKMYTD-ESGNFKGEALVVYFRPESVNLAIQMLDD 181
Query: 361 TPFR---PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGWGGRDD 413
+ FR P + PM V A F K ++ + + KKK+ +K+ K+ W DD
Sbjct: 182 SNFRIGVPGPQGPMRVQHADFSFKSQQEAPTKTSMRDKKKIMRRTQKLNSKLADWD--DD 239
Query: 414 AKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENH 466
+P T VIL+ MFT E+ D +++ D+++EC K+G V +V + +
Sbjct: 240 DPSALPDTTSRFDKIVILKHMFTLKELDEDPAAILDIKEDIRDECSKLGDVTNVVLYDKE 299
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVN-------HAAIRDL---- 515
P+GVV VRF D + A+ C+++M+GR+F G ++ A DG AA+ DL
Sbjct: 300 PEGVVSVRFSDPEAARSCVKMMDGRFFAGTRVDAYIADGSERFKKTNEKRAALEDLAEKG 359
Query: 516 -----DAEASRLEQFGAELEA 531
+ E RL++FG LE+
Sbjct: 360 LDASDEEEEQRLDEFGTWLES 380
>gi|115491489|ref|XP_001210372.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114197232|gb|EAU38932.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 371
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 103/277 (37%), Positives = 153/277 (55%), Gaps = 36/277 (12%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+VNT VYVT +P D EE+ VFSKCG+I E+ ++ +PRIK+Y+D E G KG+AL+ Y
Sbjct: 91 RVNTAVYVTSIPLDADFEEIQHVFSKCGVIAEEIDSGRPRIKMYMDDE-GKFKGEALIVY 149
Query: 346 LKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL-- 398
+ SV LA Q+LD + FR P G PM V A F K ++ + + K+K+
Sbjct: 150 FRPESVNLAIQMLDDSDFRLGVTGPQG--PMKVQAADFSFKSQKEAPTKTSMRDKRKIMQ 207
Query: 399 --KKVEEKMLGWGGRDDAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQE 449
+K+ K+ W DD +P T VIL+ MFT E+ D +++ D++E
Sbjct: 208 RTQKLNSKLADWD--DDEPSALPDTNSRFEKIVILKHMFTLKELDEDPAAILDIKEDIRE 265
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGL--- 506
EC K+G + +V + + P+GVV VRF D + A+ C+ +M+GR+F G ++ A DG
Sbjct: 266 ECSKLGEITNVVLYDKEPEGVVSVRFADPEAARNCVRVMDGRYFAGTRVEAYISDGSERF 325
Query: 507 ----VNHAAIRDL--------DAEASRLEQFGAELEA 531
AA+ DL D E RL++FG LE+
Sbjct: 326 KKTNEKRAALEDLAEKGFDAEDEEEQRLDEFGTWLES 362
>gi|430812485|emb|CCJ30102.1| unnamed protein product [Pneumocystis jirovecii]
Length = 376
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 107/258 (41%), Positives = 156/258 (60%), Gaps = 15/258 (5%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VY++ LP DVT +E+ E FSKCG+I E+ +T KPRIKIY++ E G KGDA+V + +
Sbjct: 118 NTAVYISNLPPDVTEDEIRERFSKCGVISENIDTGKPRIKIYMN-ERGEPKGDAMVVFFR 176
Query: 348 EPSVALATQLLDGTPFRPDGKI--PMSVTQAKFEQKGERF--IAKQVD--SKKKKKLKKV 401
E SV LA QLLD T R D K M V QA K ++ + + ++ K ++KLKK+
Sbjct: 177 EESVKLAIQLLDDTCLRVDDKNGQKMRVQQADVSYKKSQYNVVKRSMNEKRKLQQKLKKL 236
Query: 402 EEKMLGWGGRDDAKLT--IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDS 459
+K+ W A + +IL+ MFT E+ D L +L+ D+ EEC KIG V +
Sbjct: 237 SDKVSEWDDNSSAASSGRWSKVIILKHMFTLKELEDDITLIMDLKEDIWEECSKIGNVTN 296
Query: 460 VKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH--AAIRDLDA 517
V + + P+GVV VRF D + A C+++MNGR+FGG+ + A DG V + ++ + +D
Sbjct: 297 VVLFDLEPEGVVSVRFSDEESALACVKMMNGRYFGGKVVEAQIYDGKVRYRKSSNKTIDE 356
Query: 518 ----EASRLEQFGAELEA 531
E +RLE+FG LE+
Sbjct: 357 NDIDEQARLEKFGNWLES 374
>gi|225560023|gb|EEH08305.1| splicing factor U2AF-associated protein [Ajellomyces capsulatus
G186AR]
Length = 415
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 131/424 (30%), Positives = 196/424 (46%), Gaps = 105/424 (24%)
Query: 167 GVPASPEGEDEFTDD-----------------DGTRYKWDRGLRAWVPQEDTS---SQND 206
G P SP DEF DD DG+ + +D L+ WVP D + Q +
Sbjct: 17 GFPQSP---DEFDDDYRISYSKLENKYILEAEDGSEWYYDDALKRWVPMVDNALLEQQGE 73
Query: 207 GYGIE-----EMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNG 261
Y +E E K++ N T N EE S + H
Sbjct: 74 AYKVEGIDEHEPADFKQQRRLKRKNYT-------------NCEEENKQS------QAHKA 114
Query: 262 KRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPET 321
K+ +VNT VYVT +P D T+EE+ VFSKCG+I E+ +
Sbjct: 115 KKS---------------------RVNTAVYVTSIPLDATIEEVNNVFSKCGVIAEEIDR 153
Query: 322 KKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGKIPMSVTQA 376
++PRIK+Y D E G KGDAL+ Y + SV LA Q+LD T FR PDGK+ +
Sbjct: 154 RRPRIKMYTDDE-GKFKGDALIVYFRPESVNLAIQMLDDTDFRFGETGPDGKMKVQTADY 212
Query: 377 KFEQKGERFIAKQVDSKKK--KKLKKVEEKMLGWGGRDDAKLTIPAT------VILRFMF 428
F+ + + ++ K+K KK +++ K+ W D+ + P++ VIL+ MF
Sbjct: 213 SFKVQKDAPTKPNMNEKRKIMKKTQRLNSKLADWDD-DEPSMAPPSSSRWDKVVILKHMF 271
Query: 429 TPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELM 488
T E+ D +++ D+++EC K+G V +V + + GV VRF D + A+ C++ M
Sbjct: 272 TLQELEEDPAAILDIKEDIRQECSKLGDVTNVVLYDKEEDGVASVRFADVESAKICVQKM 331
Query: 489 NGRWFGGRQIHASEDDG-------LVNHAA-----IRDLDA----------EASRLEQFG 526
+GR+F G ++ A DG HAA + +DA EA RL++FG
Sbjct: 332 DGRFFSGTRVAAYIADGSERFRKSSSRHAATAGTTLAGVDAEDDSDQADEEEAERLDKFG 391
Query: 527 AELE 530
+ LE
Sbjct: 392 SWLE 395
>gi|328787588|ref|XP_391935.4| PREDICTED: HIV Tat-specific factor 1 homolog [Apis mellifera]
Length = 429
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 205/416 (49%), Gaps = 53/416 (12%)
Query: 111 QSQSFSSAEQGVPSHVDDEFEKWQREVREAEIEAERLKNGSASDSVGGYMGRDDHDGVPA 170
QS ++ V D E +W V+ E + N + S + G G ++
Sbjct: 22 QSYIYNDPTNNVTYKFDQEKNEWV--VKNTEEINVKNSNDNESKTQEGVFGFENDTHTYT 79
Query: 171 SPEGEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDD 230
P +DG+ Y WD+ AW P+ D D +M++
Sbjct: 80 DP--------NDGSVYFWDKEKNAWFPKVD----EDFMARYQMSY--------------- 112
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVN 288
G N E K V + + GK + K++E K ++ P +WFE+ N
Sbjct: 113 ------GFTDTNRIETKP-----VQQFQSQGKVDKEKKKMEAKRKSQEPPTWFEVDEAHN 161
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T +YV+GLP D+T++E+ E+F+KCG+I D E K +IK+Y D G KGDAL Y+K
Sbjct: 162 TAIYVSGLPLDITMDELTELFNKCGLIARD-EKGKDKIKLYKD-SNGQPKGDALCIYIKV 219
Query: 349 PSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF---IAKQVDSKKKKKLKKVEEKM 405
SV LA ++LD + R GK +SV +AKF+ KG+ + + + K K++ KK++EK+
Sbjct: 220 ESVDLALKILDKSQIR--GKT-LSVQRAKFQMKGDAYDPALKPKRKKKDKERQKKLQEKL 276
Query: 406 LGWGGRDDA--KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC 463
W L VI++ +F+P + + +L E + D++ EC+K G V V +
Sbjct: 277 FDWRPERAPGEPLKHERVVIIKNLFSPEDFDKEVSLLLEYQQDIRSECLKCGDVRKVVIY 336
Query: 464 ENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA 519
+ HP+GV V F++ +AQ CI+L+NGRWF R+I A DG + I + DAE
Sbjct: 337 DRHPEGVAQVTFREPAEAQACIQLLNGRWFSQRKISAEIWDGKTKY-KILEPDAET 391
>gi|118089467|ref|XP_001233032.1| PREDICTED: HIV Tat-specific factor 1 homolog [Gallus gallus]
Length = 450
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 127/361 (35%), Positives = 179/361 (49%), Gaps = 36/361 (9%)
Query: 182 DGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKL 241
DG Y+WDR +AW P+ +T E+ T + DE
Sbjct: 40 DGAAYEWDREKKAWFPK--------------IT----EDFLATYHANYGFNADETDSSSA 81
Query: 242 NST--EEKVNSADNVVEEKHNGKRKQPDKQVE---KKEANKPPDSWFELK--VNTHVYVT 294
+ T E K+ + E + + K P KQ E K E K WF ++ NT+VYVT
Sbjct: 82 SGTASENKLPLSSKTSETQPSANEKGP-KQTEPKQKTEKRKLEPGWFHVEEDRNTNVYVT 140
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D+T +E V+V SK GII DP+T++ +IK+Y DKE G KGD L YLK SV LA
Sbjct: 141 GLPPDITKDEFVQVMSKGGIIMRDPQTEEHKIKLYKDKE-GNLKGDGLCCYLKRESVQLA 199
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDD- 413
+LLD R + V AKF+ KGE + A + K K KK+ ++ R +
Sbjct: 200 LRLLDEAEIR---GYKLHVEVAKFQLKGE-YDASKKKKKCKDYKKKLSQQQKQLDWRPEK 255
Query: 414 ----AKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG 469
A++ VI+R MF P + D + +E+ D++ EC K G V V + + HP G
Sbjct: 256 KDGAARMRHERIVIIRNMFHPKDFEEDPLVLNEIREDLRTECEKFGQVKKVLIFDRHPDG 315
Query: 470 VVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAEL 529
V V FK+ +A C +NGRWFGGRQ+ A DG+ ++ RL+ +G+ L
Sbjct: 316 VASVSFKEATEADLCKLTLNGRWFGGRQLSAETWDGVTDYQVEETAREREERLKVWGSFL 375
Query: 530 E 530
E
Sbjct: 376 E 376
>gi|53131339|emb|CAG31810.1| hypothetical protein RCJMB04_11k6 [Gallus gallus]
Length = 450
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 127/361 (35%), Positives = 179/361 (49%), Gaps = 36/361 (9%)
Query: 182 DGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKL 241
DG Y+WDR +AW P+ +T E+ T + DE
Sbjct: 40 DGAAYEWDREKKAWFPK--------------IT----EDFLATYHANYGFNADETDSSSA 81
Query: 242 NST--EEKVNSADNVVEEKHNGKRKQPDKQVE---KKEANKPPDSWFELK--VNTHVYVT 294
+ T E K+ + E + + K P KQ E K E K WF ++ NT+VYVT
Sbjct: 82 SGTASENKLPLSSKTSETQPSANEKGP-KQTEPKQKTEKRKLEPGWFHVEEDRNTNVYVT 140
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D+T +E V+V SK GII DP+T++ +IK+Y DKE G KGD L YLK SV LA
Sbjct: 141 GLPPDITKDEFVQVMSKGGIIMRDPQTEEHKIKLYKDKE-GNLKGDGLCCYLKRESVQLA 199
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDD- 413
+LLD R + V AKF+ KGE + A + K K KK+ ++ R +
Sbjct: 200 LRLLDEAEIR---GYKLHVEVAKFQLKGE-YDASKKKKKCKDYKKKLSQQQKQLDWRPEK 255
Query: 414 ----AKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG 469
A++ VI+R MF P + D + +E+ D++ EC K G V V + + HP G
Sbjct: 256 KDGAARMRHERIVIIRNMFHPKDFEEDPLVLNEIREDLRTECEKFGQVKKVLIFDRHPDG 315
Query: 470 VVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAEL 529
V V FK+ +A C +NGRWFGGRQ+ A DG+ ++ RL+ +G+ L
Sbjct: 316 VASVSFKEATEADLCKLTLNGRWFGGRQLSAETWDGVTDYQVEETAREREERLKVWGSFL 375
Query: 530 E 530
E
Sbjct: 376 E 376
>gi|261203563|ref|XP_002628995.1| nuclear mRNA splicing factor-associated protein [Ajellomyces
dermatitidis SLH14081]
gi|239586780|gb|EEQ69423.1| nuclear mRNA splicing factor-associated protein [Ajellomyces
dermatitidis SLH14081]
Length = 400
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 123/414 (29%), Positives = 196/414 (47%), Gaps = 90/414 (21%)
Query: 172 PEGEDEFTDD-----------------DGTRYKWDRGLRAWVPQEDTS---SQNDGYGIE 211
P+ DEF DD DG+ +++D L+ WVP D + Q + Y +E
Sbjct: 19 PQTPDEFDDDYRISYSKLDKKYHLEAEDGSEWEYDDALKRWVPLVDDALLEQQREAYKVE 78
Query: 212 EMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVE 271
G ++ + K + + + ++ +KQ +
Sbjct: 79 -------------------------GVDEHEPADLKQQQQRRLKRKNYPMAKRSANKQTQ 113
Query: 272 KKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVD 331
+A +P +VNT VYVT +P D ++EE+ +VFSKCG+I E+ + ++PRIK+Y D
Sbjct: 114 APKAKRP-------RVNTAVYVTSIPLDASLEEVNDVFSKCGVIAEEIDRRRPRIKLYTD 166
Query: 332 KETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFI 386
E G KGDAL+ Y + SV LA Q+LD T FR P GK+ + F+ + E
Sbjct: 167 DE-GKFKGDALIVYFRPESVNLAIQMLDDTDFRFGETGPQGKMKVQAADFSFKAQKEAPA 225
Query: 387 AKQVDSKKK--KKLKKVEEKMLGWGGRDDAKLTIPAT-------VILRFMFTPAEMRADE 437
+ K+K +K +++ K+ W DD T+P + VIL+ MFT E+ D
Sbjct: 226 KPNMSEKRKIMRKTQRLNSKLADWD--DDEPSTVPPSSSRWDRVVILKHMFTLQELEEDP 283
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
+++ D+++EC K+G V +V + + +GV VRF D + A C++ M+GR+F G +
Sbjct: 284 AAILDIKEDIRQECSKLGEVTNVVLYDKEEEGVASVRFSDVESANICVQKMDGRFFSGTR 343
Query: 498 IHASEDDG-----------------LVNHAAIRDLDA----EASRLEQFGAELE 530
+ A DG L N A D + EA RL++FG+ LE
Sbjct: 344 VVAYIADGSERFKKSSSKQAAAGTTLANVEAEEDSEQVDEEEAERLDKFGSWLE 397
>gi|296816623|ref|XP_002848648.1| splicing factor u2af-associated protein 2 [Arthroderma otae CBS
113480]
gi|238839101|gb|EEQ28763.1| splicing factor u2af-associated protein 2 [Arthroderma otae CBS
113480]
Length = 389
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 123/383 (32%), Positives = 181/383 (47%), Gaps = 74/383 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
D+G + +D L+ W+P D S Q Y I+ + DE
Sbjct: 37 DNGEEFMYDDALKRWIPSLDESLMEQQRQAYKIQGV--------------------DEEE 76
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
L + +EK + KHN P ++ +K +VNT VYVT +P
Sbjct: 77 PANLKALQEKKK------KRKHNSDESHPPQKPKKP------------RVNTAVYVTAIP 118
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D T+ E+ +FSKCG+I E+ ++ +PRIK+Y D + G KGDALV Y + SV LA Q+
Sbjct: 119 LDATLSEISSLFSKCGVIAEEIDSGQPRIKMYTD-DQGQFKGDALVVYFRPESVNLAVQM 177
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKK----KKLKKVEEKMLGW 408
LD T FR DG M V A F K + +V+ K+K +K +K+ K+ W
Sbjct: 178 LDDTDFRFGEKGTDGN--MRVQPADFSFKAVQDAPAKVNMKEKMKIMRKTQKLNNKLTDW 235
Query: 409 GGRDDAKLTIPAT---VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
D + T V+L+ MFT E+ D +++ D++EEC K+G V +V + +
Sbjct: 236 DDDDIGPRQLAKTGKVVVLKHMFTLQELEEDPAAILDIKEDIREECAKLGEVTNVVLYDK 295
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHA--------------- 510
GV VRF + AQ C+++MNGR+FGG ++ A +G V
Sbjct: 296 EESGVATVRFSEPDCAQACVQMMNGRFFGGTKVEAYIAEGKVRFKKSGASAAAALQDDGA 355
Query: 511 ---AIRDLDAEASRLEQFGAELE 530
A D EA RL++FGA LE
Sbjct: 356 GWEADNGKDDEAQRLDKFGAWLE 378
>gi|121701809|ref|XP_001269169.1| nuclear mRNA splicing factor-associated protein, putative
[Aspergillus clavatus NRRL 1]
gi|119397312|gb|EAW07743.1| nuclear mRNA splicing factor-associated protein, putative
[Aspergillus clavatus NRRL 1]
Length = 395
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 123/386 (31%), Positives = 188/386 (48%), Gaps = 80/386 (20%)
Query: 181 DDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DG Y +D L+ W+P D Q + Y I+ V+ T+ L ++
Sbjct: 46 EDGQEYVYDSVLKRWIPTVDDDLLQQQQEAYKIQ------------GVDETEQLTAQQLR 93
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
K++ K G + + Q KK+ +VNT VYVT +P
Sbjct: 94 KKR-----------------KQQGGADENNSQKPKKQ-----------RVNTAVYVTSIP 125
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D EE+ +VFSKCG+I E+ ++ +PRIK+Y+D E G KG+ALV Y + SV LA Q+
Sbjct: 126 LDAEFEEIRDVFSKCGVIAEEIDSGRPRIKMYMD-EDGKFKGEALVVYFRPESVNLAIQM 184
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGW 408
LD + FR P G PM V A F K ++ + + KKK+ +K+ K+ W
Sbjct: 185 LDDSDFRLGVAGPQG--PMKVQLADFSFKSQQEAPAKTSMRDKKKIIRRTQKLNSKLADW 242
Query: 409 GGRDDAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
DD +P T VIL+ MFT E+ D +++ D+++EC K+G V +V
Sbjct: 243 D--DDDPSMLPDTSSRFEKVVILKHMFTLEELEEDPAAILDIKQDIRDECSKLGEVTNVV 300
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGL-------VNHAAIRD 514
+ + P GV VRF D + A++C+ +M+GR+F G ++ A DG AA+ D
Sbjct: 301 LYDKEPDGVASVRFVDPEAARQCVRVMDGRFFAGTRVEAYVSDGKEKFRKTNEKRAALED 360
Query: 515 L---------DAEASRLEQFGAELEA 531
+ D E RL++FG LE+
Sbjct: 361 MAERGLDAEDDEENERLDEFGTWLES 386
>gi|350418587|ref|XP_003491906.1| PREDICTED: HIV Tat-specific factor 1 homolog [Bombus impatiens]
Length = 488
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 123/365 (33%), Positives = 197/365 (53%), Gaps = 51/365 (13%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTF-LKEEEVFPTVNVTDDLANDE 235
+TD +DG+ Y WD+ AW P+ D D +M++ + T V A +
Sbjct: 78 YTDPNDGSVYFWDKEKNAWFPKVD----EDFMARYQMSYGFTDASKVETNTVQQSQAQVK 133
Query: 236 VGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYV 293
+ KEK +K KRK A++PP +WFE+ NT +YV
Sbjct: 134 IDKEK----------------KKMEAKRK----------AHEPP-TWFEVDEAHNTAIYV 166
Query: 294 TGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVAL 353
+GLP D+T++E+ ++F+KCG+I D E K +IK+Y D G KGDAL Y+K SV L
Sbjct: 167 SGLPLDITMDELTKLFNKCGLIARD-EKGKDKIKLYKDT-NGQPKGDALCIYIKVESVDL 224
Query: 354 ATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF---IAKQVDSKKKKKLKKVEEKMLGWGG 410
A ++LD + R GK +SV +AKF+ KG+ + + + K K++ KK+++K+ W
Sbjct: 225 ALKILDKSQIR--GKT-LSVQRAKFQMKGDAYDPALKPKRKKKDKERQKKLQDKLFDW-- 279
Query: 411 RDDAKLTIPA----TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENH 466
R + L P V+++ +F+P + + +L E + D++ EC+K G V V + + H
Sbjct: 280 RPERVLGEPLKHERVVVIKNLFSPEDFDQEVSLLLEYQQDIRSECLKCGDVRKVVIYDRH 339
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA-SRLEQF 525
P+GV V F++ +AQ CI+L+NGRWF R+I A DG + + + DAE +R+ ++
Sbjct: 340 PEGVAQVTFREPAEAQACIQLLNGRWFSQRKISAEIWDGKTKY-KVTETDAEVEARIAKW 398
Query: 526 GAELE 530
LE
Sbjct: 399 DKYLE 403
>gi|213403862|ref|XP_002172703.1| splicing factor U2AF-associated protein [Schizosaccharomyces
japonicus yFS275]
gi|212000750|gb|EEB06410.1| splicing factor U2AF-associated protein [Schizosaccharomyces
japonicus yFS275]
Length = 366
Score = 161 bits (408), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 119/370 (32%), Positives = 183/370 (49%), Gaps = 41/370 (11%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
F D +G WD G +AW Q+ +N + + +K E ++ + +E
Sbjct: 16 FKDPNGAELYWDDGEKAWKDQDGAVLKNT-----DPSEVKSFEKLEAEHIQEQRQKNETR 70
Query: 238 KEKLNSTEEKVN--SADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTG 295
K K +S+ + +N + N EE + K+ANKPP + +N VYV G
Sbjct: 71 KRKNSSSSDTINPPCSPNRTEEVN-------------KDANKPP-----VVINKAVYVQG 112
Query: 296 LPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALAT 355
LP +V+V+E+ E FSKCGII E+ +T PRIK+Y + + G KGDAL+ +L+ SV +A
Sbjct: 113 LPKNVSVKEVQETFSKCGIIAENIDTGTPRIKLYTN-DAGELKGDALIVFLRSESVDMAV 171
Query: 356 QLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVE----EKMLGWGGR 411
QLLD T + M V A + K E+ + + KKKLK+ EK+ W
Sbjct: 172 QLLDDTELHYGSGLRMHVQPASIDYKKEKTVRNALPENVKKKLKRRRQQQLEKLAEWDDT 231
Query: 412 DDA-----KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENH 466
+ + V+L+ MFT E+ A L +L+ D+ E K G V +V + +
Sbjct: 232 ESPEAERLRKKWGKFVVLKHMFTLEEIEAAPELLIDLKEDITAEAEKCGEVTNVVLYDKE 291
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLV------NHAAIRDLDAEAS 520
P G+V+VRF+ + A +C+ LM GR+F GR + A DG H + + E+
Sbjct: 292 PDGIVMVRFRTTEAADECVRLMEGRFFDGRVVEAFVYDGKTRYQKTGRHEELNGDENESE 351
Query: 521 RLEQFGAELE 530
RLE+F +E
Sbjct: 352 RLERFSEWIE 361
>gi|224097256|ref|XP_002189273.1| PREDICTED: HIV Tat-specific factor 1 homolog [Taeniopygia guttata]
Length = 451
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 126/361 (34%), Positives = 184/361 (50%), Gaps = 40/361 (11%)
Query: 165 HDGVPASPEGE-DEFT---DDDGTRYKWDRGLRAWVPQ--ED---TSSQNDGYGIEEMTF 215
++ PA EGE D FT DG Y+WD +AW P+ ED T N G+ +E
Sbjct: 20 YEAKPAGGEGESDPFTYVDPADGAAYEWDLEKKAWFPKITEDFLATYHANYGFHPDE--- 76
Query: 216 LKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEA 275
+ + K+ ++S E + N + +K P +++EK++
Sbjct: 77 ---------TDTSSASGTATESKQPVSSKESGTQPSANEMGQKQT----DPKQKLEKRKL 123
Query: 276 NKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKE 333
WF ++ NT+VYVTGLP D+T +E V+V SKCGII DP+T++ +IK+Y DKE
Sbjct: 124 EP---GWFHVEEDRNTNVYVTGLPPDITKDEFVQVMSKCGIIMRDPQTEEHKIKLYKDKE 180
Query: 334 TGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSK 393
G KGD L YLK SV LA +LLD R + V AKF+ KGE + A + K
Sbjct: 181 -GNLKGDGLCCYLKRESVQLALRLLDEAEIR---GYKLHVEVAKFQLKGE-YDASKKKKK 235
Query: 394 KKKKLKKVEEKMLGWGGRDDAK-----LTIPATVILRFMFTPAEMRADENLRSELEADVQ 448
K KK+ ++ R + K + +I+R MF P + D + +E+ D++
Sbjct: 236 CKDYKKKLSQQQKQLDWRPEKKDGATRMRHERIIIIRNMFHPKDFEEDPLVLNEIREDLR 295
Query: 449 EECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVN 508
EC K G V V + + HP GV V FK+ +A C +NGRWFGGRQ+ A DG+ +
Sbjct: 296 TECEKFGQVKKVLIFDRHPDGVASVSFKEATEADLCKLTLNGRWFGGRQLSAETWDGVTD 355
Query: 509 H 509
+
Sbjct: 356 Y 356
>gi|315049877|ref|XP_003174313.1| splicing factor U2AF-associated protein 2 [Arthroderma gypseum CBS
118893]
gi|311342280|gb|EFR01483.1| splicing factor U2AF-associated protein 2 [Arthroderma gypseum CBS
118893]
Length = 388
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 125/383 (32%), Positives = 179/383 (46%), Gaps = 75/383 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
D G Y +D L+ W+P D + Q Y I+ + DE
Sbjct: 37 DHGEEYTYDDALKRWIPSLDEALLEQQRQAYKIQGV--------------------DEEE 76
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
L + ++K + KHN + P K KP +VNT VYVT +P
Sbjct: 77 PANLKALQDKKK------KRKHNDESNPPQK------PKKP-------RVNTAVYVTAIP 117
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D TV E+ +FSKCG+I E+ ++ KPRIK+Y D + G KGDALV Y + SV LA Q+
Sbjct: 118 LDATVSEISALFSKCGVIAEEIDSGKPRIKMYTD-DQGAFKGDALVVYFRPESVNLAIQM 176
Query: 358 LDGTPFR---PDGKIPMSVTQAKFEQKGERFIAKQVDSKKK----KKLKKVEEKMLGW-- 408
LD T FR + M V A F K + +V+ + K +K +K+ K+ W
Sbjct: 177 LDDTDFRFGEKGAQGNMRVQPADFSFKAVQEAPAKVNMRDKMKIIRKTQKLNNKLTDWDD 236
Query: 409 ---GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
G R K V+L+ MFT E+ D +++ D++EEC K+G V +V + +
Sbjct: 237 DDAGPRQSTKAG--KVVVLKHMFTLQELEEDPAAILDIKEDIREECSKLGEVTNVVLYDK 294
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHA--------------- 510
G+ VRF D + AQ C+++MNGR+FGG Q+ A +G V
Sbjct: 295 EESGIATVRFADPECAQACVQMMNGRFFGGTQVEAYVVEGKVRFKKSGASAAAALQDDGA 354
Query: 511 ---AIRDLDAEASRLEQFGAELE 530
A D EA RL++FGA LE
Sbjct: 355 GWEAETGKDEEAQRLDKFGAWLE 377
>gi|119495800|ref|XP_001264677.1| nuclear mRNA splicing factor-associated protein, putative
[Neosartorya fischeri NRRL 181]
gi|119412839|gb|EAW22780.1| nuclear mRNA splicing factor-associated protein, putative
[Neosartorya fischeri NRRL 181]
Length = 395
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 192/384 (50%), Gaps = 76/384 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DG +++D L+ W+P D Q + Y I+ + N++V
Sbjct: 46 EDGQEFEYDTVLKRWIPTVDDDLLQQQQEAYKIQGVD-----------------ENEQVT 88
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
++L ++ D ++NG++ P KQ +VNT V+VT +P
Sbjct: 89 AQQLKKKRKQQTGGD-----ENNGQK--PKKQ----------------RVNTAVFVTSIP 125
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D +E+ +VFSKCG+I E+ ++ +PRIK+Y+D E G KG+AL+ Y + SV LA Q+
Sbjct: 126 LDAEFDEIRDVFSKCGVIAEEIDSGRPRIKMYMD-ENGKFKGEALIVYFRPESVNLAIQM 184
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGW 408
LD + FR P G PM V A F K ++ + + KKK+ +K+ K+ W
Sbjct: 185 LDESDFRLGVTGPQG--PMKVQAADFSFKSQQEAPAKTSMRDKKKIIKRTQKLNSKLADW 242
Query: 409 GGRDDAKLT-----IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC 463
+ + LT VIL+ MFT E+ D +++ D++EEC K+G V +V +
Sbjct: 243 SDDEPSNLTDTSSRFEKVVILKHMFTLKEIEEDPAAILDIKDDIREECSKLGEVTNVVLY 302
Query: 464 ENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG------------LVNHAA 511
+ P GVV VRF D + A++C+ +M+GR+F G ++ A DG + A
Sbjct: 303 DKEPDGVVSVRFTDPEAARQCVRVMDGRFFAGTRVEAYITDGKEKFKKSNERRAALEDMA 362
Query: 512 IRDLDA----EASRLEQFGAELEA 531
R LDA E +RL++FG LE+
Sbjct: 363 ERGLDAEDEEENARLDEFGTWLES 386
>gi|345570528|gb|EGX53349.1| hypothetical protein AOL_s00006g215 [Arthrobotrys oligospora ATCC
24927]
Length = 385
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 182/368 (49%), Gaps = 49/368 (13%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DGT Y++D L+ W D S Q+ YG V+ TD +
Sbjct: 44 EDGTEYEFDEKLKRWYEVLDNSLAEEQSKAYG--------------PVDATDSVTPRNKK 89
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
++ + E +NV K NGK Q A P +VN VYVT LP
Sbjct: 90 RKNEHDNEANQPEINNVKRPKKNGKADQ---------AAAP-------RVNKAVYVTNLP 133
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D T +E+ E+FSK G++ E+ ++ K R+K+Y D E+G KGDAL+ Y + SV LA Q+
Sbjct: 134 LDATAQEVEELFSKYGVLAEEIDSGKKRVKLYTD-ESGNPKGDALIVYFRPESVKLAIQM 192
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKK---KLKKVEEKMLGWG 409
LD T FR G+I + ++ + + K++ KKK K +K+ K+ W
Sbjct: 193 LDDTDFRLGVGDAGGRIKVQAADYSYKAQQDAPQDKKMTRDKKKIIAKSQKLNNKLADWD 252
Query: 410 GRDDAKLTIPA----TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
D A ++ + IL+ MFT E+ D +++ D++EEC K+G V +V + ++
Sbjct: 253 DDDPATVSTSSKFDKVAILKHMFTLKELEEDPAAILDIKEDIREECSKLGEVTNVTLYDD 312
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG--LVNHAAIRDLDA-EASRL 522
GVV V+F D A C+ +M+GR F G ++ A DG +RD DA E+SRL
Sbjct: 313 EEAGVVSVKFGDEASALACVRMMDGRHFSGAKVEAYIYDGQERFRKKKVRDTDAEESSRL 372
Query: 523 EQFGAELE 530
++FG LE
Sbjct: 373 DKFGNWLE 380
>gi|154287880|ref|XP_001544735.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150408376|gb|EDN03917.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 410
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 129/419 (30%), Positives = 200/419 (47%), Gaps = 99/419 (23%)
Query: 167 GVPASPEGEDEFTDD-----------------DGTRYKWDRGLRAWVPQEDTS---SQND 206
G P SP DEF DD DG+ + +D L+ W+P D + Q +
Sbjct: 17 GFPQSP---DEFDDDYRISYSKLENKYILEAEDGSEWYYDDALKRWIPMVDNALLEQQGE 73
Query: 207 GYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQP 266
Y +E + ++ AD +++ KRK
Sbjct: 74 AYKVEGI--------------------------------DEHEPAD--FKQQRRLKRKNY 99
Query: 267 DKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRI 326
E+ +A+K S +VNT VYVT +P D TVEE+ +VFSKCG+I E+ + ++PRI
Sbjct: 100 -TNCEESQAHKAKKS----RVNTAVYVTSIPLDATVEEVNDVFSKCGVIAEEIDRRRPRI 154
Query: 327 KIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQK 381
K+Y D E G KGDAL+ Y + SV LA Q+LD T FR PDGK+ + F+ +
Sbjct: 155 KMYTDDE-GKFKGDALIVYFRPESVNLAIQMLDDTDFRFGETGPDGKMKVQTADYSFKVQ 213
Query: 382 GERFIAKQVDSKKK--KKLKKVEEKMLGWGGRDDAKLTIPAT------VILRFMFTPAEM 433
+ ++ K+K KK +++ K+ W D+ + P++ VIL+ MFT E+
Sbjct: 214 KDAPTKPNLNEKRKIMKKTQRLNSKLADWDD-DEPSMAPPSSSRWEKVVILKHMFTLQEL 272
Query: 434 RADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF 493
D +++ D+++EC K+G V +V + + GV VRF D + A+ C++ M+GR+F
Sbjct: 273 EEDPAAILDIKEDIRQECSKLGDVTNVVLYDKEEDGVASVRFADVESAKICVQKMDGRFF 332
Query: 494 GGRQIHASEDDG-------LVNHA-----AIRDLDA----------EASRLEQFGAELE 530
G ++ A DG HA + +DA EA RL++FG+ LE
Sbjct: 333 SGTRVAAYIADGSERFRKSSSRHATTAGTTLAGVDAEDDSDQVDEEEAERLDKFGSWLE 391
>gi|348557907|ref|XP_003464760.1| PREDICTED: LOW QUALITY PROTEIN: HIV Tat-specific factor 1 homolog
[Cavia porcellus]
Length = 752
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 114/335 (34%), Positives = 171/335 (51%), Gaps = 38/335 (11%)
Query: 184 TRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
T Y+WD RAW P+ ED ++ YG T D A+
Sbjct: 48 TPYEWDLDKRAWFPKITEDFIATYQANYGF-----------------TSDGAS------- 83
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPD 298
STE +++ EE K +P ++ E K WF ++ NT+VYV+GLP
Sbjct: 84 -GSTEHVSDTSMRTAEEPPQRKTPEPTDPKKRGEKRKGDSGWFHVEEDRNTNVYVSGLPP 142
Query: 299 DVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLL 358
D+TV+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LL
Sbjct: 143 DITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLL 201
Query: 359 DGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA--- 414
D R + V AKF+ KGE +K+ K K K +++K L W A
Sbjct: 202 DEDEIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERRAGPS 258
Query: 415 KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
++ VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V
Sbjct: 259 RMRHERVVIIKNMFHPEDFEDDPLVLNEIREDLRVECAKFGQIKKLLLFDRHPDGVASVS 318
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
F+D ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 319 FRDAEEADHCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|291408297|ref|XP_002720373.1| PREDICTED: HIV-1 Tat specific factor 1 isoform 2 [Oryctolagus
cuniculus]
Length = 752
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 170/332 (51%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND G +
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSND--GASSSTA 87
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ VN+ + EE K +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 88 NVQDVNT--RIAEEPPQRKTPEPTDPRKKGEKRKSESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIKKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAEAWDGTTDY 353
>gi|291408295|ref|XP_002720372.1| PREDICTED: HIV-1 Tat specific factor 1 isoform 1 [Oryctolagus
cuniculus]
Length = 770
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 170/332 (51%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND G +
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSND--GASSSTA 87
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ VN+ + EE K +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 88 NVQDVNT--RIAEEPPQRKTPEPTDPRKKGEKRKSESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIKKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAEAWDGTTDY 353
>gi|325090026|gb|EGC43336.1| nuclear mRNA splicing factor-associated protein [Ajellomyces
capsulatus H88]
Length = 414
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 119/378 (31%), Positives = 176/378 (46%), Gaps = 85/378 (22%)
Query: 167 GVPASPEGEDEFTDD-----------------DGTRYKWDRGLRAWVPQEDTS---SQND 206
G P SP DEF DD DG+ + +D L+ WVP D + Q +
Sbjct: 17 GFPQSP---DEFDDDYRISYSKLENKYILEAEDGSEWYYDDALKRWVPMVDNALLEQQGE 73
Query: 207 GYGIE-----EMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNG 261
Y +E E K++ N T N EE S + H
Sbjct: 74 AYKVEGIDEHEPADFKQQRRLKRKNYT-------------NCEEENKQS------QAHKA 114
Query: 262 KRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPET 321
K+ +VNT VYVT +P D T+EE+ VFSKCG+I E+ +
Sbjct: 115 KKS---------------------RVNTAVYVTSIPLDATIEEVNNVFSKCGVIAEEIDR 153
Query: 322 KKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGKIPMSVTQA 376
++PRIK+Y D E G KGDAL+ Y + SV LA Q+LD T FR PDGK+ +
Sbjct: 154 RRPRIKMYTDDE-GKFKGDALIVYFRPESVNLAIQMLDDTDFRLGETGPDGKMKVQTADY 212
Query: 377 KFEQKGERFIAKQVDSKKK--KKLKKVEEKMLGWGGRDDAKLTIPAT-------VILRFM 427
F+ + + ++ K+K KK +++ K+ W DD P + VIL+ M
Sbjct: 213 SFKVQKDAPTKPNMNEKRKIMKKTQRLNSKLADWD--DDEPSMAPQSSSRWDKVVILKHM 270
Query: 428 FTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIEL 487
FT E+ D +++ D+++EC K+G V +V + + GV VRF D + A+ C++
Sbjct: 271 FTLQELEEDPAAILDIKEDIRQECSKLGDVTNVVLYDKEEDGVASVRFADVESAKICVQK 330
Query: 488 MNGRWFGGRQIHASEDDG 505
M+GR+F G ++ A DG
Sbjct: 331 MDGRFFSGTRVAAYIADG 348
>gi|449546586|gb|EMD37555.1| hypothetical protein CERSUDRAFT_73425 [Ceriporiopsis subvermispora
B]
Length = 414
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 116/346 (33%), Positives = 168/346 (48%), Gaps = 44/346 (12%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
F DDDG+ +WD WVP I+E K++ + V ++ V
Sbjct: 49 FEDDDGSEMEWDAAKGTWVPV-----------IDEDLLKKQQAAYSVAGVDEETPAAPVL 97
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
K +E E + G P + K+ N P + NT VYVTGLP
Sbjct: 98 KRTTKKRKEP---------EDYTGFTAGPSEPEAKRGKNGPSPAPERKSKNTAVYVTGLP 148
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D T +E+ FS+CG+I+ED +T P+IK+Y +E G GDAL+ Y KE SV LA +
Sbjct: 149 SDTTPDELAARFSRCGLIEED-DTGAPKIKMYA-REDGTFSGDALIVYFKEDSVDLALSI 206
Query: 358 LDGTPFR-PDGKIPMSVTQAKFEQK-------GERFIAKQVDSKKK--KKLKKVEEKMLG 407
LD R + M V +A+F K GE + +D K+K K++ K++ K+
Sbjct: 207 LDDDELRLGESGTRMHVQRAQFGHKHEAGQGGGEMKPRRTMDDKRKATKRIGKMQRKLGE 266
Query: 408 WGGRDDAKLTI------------PATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
W D +I V+L+ MFT E+ D +L +L+ DV+EEC +G
Sbjct: 267 WDDEDGFGPSITEEDKRQQEARNSRVVVLKHMFTLKELEEDPSLLLDLKEDVREECETLG 326
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
V +V + + P GV+ V+F+D AQ CI M+GR+F GRQI AS
Sbjct: 327 DVTNVVLYDQEPDGVMTVKFRDPLSAQACIIKMHGRFFAGRQIQAS 372
>gi|298715665|emb|CBJ28191.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 635
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 98/252 (38%), Positives = 143/252 (56%), Gaps = 11/252 (4%)
Query: 282 WFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDA 341
W + N VYV GLP D+ VEE+ E FSKCGII DP T++P+IKIY D E G KGD
Sbjct: 370 WNKKATNLWVYVKGLPVDIEVEEVREHFSKCGIIATDPLTQQPKIKIYKDDE-GHPKGDG 428
Query: 342 LVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAK---QVDSKKKKKL 398
V Y K SV +A +L RP + + V++A F+QKG+ F +V+ + K
Sbjct: 429 SVCYAKAESVEMAINVLHEGQLRPG--VTIEVSKAVFQQKGQSFDNTKRLKVNDARVKVA 486
Query: 399 KKVEEKMLGWGGRDDAKLTIPA--TVILRFMFTPAEMRADENLRSELEADVQEECVKIGP 456
+ E+ L W DD T V++ MF P++ A+E ELEAD+ E K+GP
Sbjct: 487 RAAAEQALSWAENDDTGATKGGLKIVVVENMFHPSDFEANERFGEELEADLLAEGEKLGP 546
Query: 457 VDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLD 516
V+ + V + +G V+V+F A +C+++ +GR+FGGR++ DG+ N+ I++ D
Sbjct: 547 VEKITVFAKNNKGPVVVKFGTAYAASECVKVFDGRFFGGRKLSCHFWDGVTNY-TIKEED 605
Query: 517 --AEASRLEQFG 526
EA RL+ FG
Sbjct: 606 EKEEAQRLDAFG 617
>gi|355757742|gb|EHH61267.1| hypothetical protein EGM_19235 [Macaca fascicularis]
Length = 754
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 169/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK + +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHAKTAEEPPQEKAS----EPTDPRKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|297304889|ref|XP_001082302.2| PREDICTED: HIV Tat-specific factor 1 homolog isoform 1 [Macaca
mulatta]
gi|355705198|gb|EHH31123.1| hypothetical protein EGK_20983 [Macaca mulatta]
Length = 754
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 169/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK + +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTAEEPPQEKAS----EPTDPRKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|301769097|ref|XP_002919964.1| PREDICTED: HIV Tat-specific factor 1 homolog [Ailuropoda
melanoleuca]
Length = 767
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 167/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
E +A EE K +P ++ E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 QEGSARTA----EEPPQRKTPEPSDPKKRGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVDLALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKRKCKDYKKKLSLQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGATDY 353
>gi|1667611|gb|AAB18823.1| Tat-SF1 [Homo sapiens]
Length = 754
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 167/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A EE K +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTA----EEPPQEKAPEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|358379829|gb|EHK17508.1| hypothetical protein TRIVIDRAFT_43268 [Trichoderma virens Gv29-8]
Length = 357
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 122/354 (34%), Positives = 179/354 (50%), Gaps = 72/354 (20%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDGT +++D + WVP E+ ++D + L+E P A+D+
Sbjct: 32 DDGTEFEFDADQKKWVPTEEEPLEDD------LDDLREYSGTP--------ADDDAS--- 74
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
N KRKQ VE +A KPP + K NT VYVTGLP D
Sbjct: 75 -------------------NKKRKQ---GVENGDAQKPPP---QPKQNTAVYVTGLPADA 109
Query: 301 TVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
TV+E+ ++FS K G+I E+ ++ PRIK+Y D E G KGDAL+ + K SV +A LLD
Sbjct: 110 TVDEVHDLFSRKGGVIAEEIDSGAPRIKMYNDSE-GNFKGDALIVFFKPQSVDMAIMLLD 168
Query: 360 GTPFR------PDGKIPM-----SVTQAKFEQKGERFIAKQVDSKKK-----------KK 397
T FR +G+I + S + +++Q+G A + +K+ KK
Sbjct: 169 DTNFRFLPSGTTEGRIRVQAADSSYKKTQYDQEGGAGEASNGNGQKRQRNERDRQKIIKK 228
Query: 398 LKKVEEKMLGWGGRDDAKLTIPA------TVILRFMFTPAEMRADENLRSELEADVQEEC 451
+K++ K+ W D +P+ TVILR MFT E+ D E++ D+++EC
Sbjct: 229 TQKLDAKLADWDDDDPYASLVPSNSKRDKTVILRHMFTLQELEEDPAALLEIKEDIRDEC 288
Query: 452 VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
K+G V SV + + P GVV V+FK + A CI+LM+GR FGG + AS +G
Sbjct: 289 SKLGAVTSVVLYDLEPDGVVSVKFKSTESAAACIKLMHGRSFGGSTVEASLSEG 342
>gi|327286675|ref|XP_003228055.1| PREDICTED: HIV Tat-specific factor 1-like [Anolis carolinensis]
Length = 533
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 101/255 (39%), Positives = 145/255 (56%), Gaps = 12/255 (4%)
Query: 262 KRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDP 319
K Q Q +K E KP WF + + NT+VYVTGLP D+T +E VEV SKCGII D
Sbjct: 190 KASQLTDQKQKGEKRKPDAGWFHVEEQKNTNVYVTGLPPDITKDEFVEVMSKCGIIMRDL 249
Query: 320 ETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFE 379
+T++P+IK+Y D+E G KGD L YLK SV LA +LLD R + V A+F+
Sbjct: 250 QTEEPKIKLYKDRE-GNLKGDGLCCYLKRESVELALKLLDENEIR---GYKLHVEVAQFQ 305
Query: 380 QKGERFIAKQVDSKKKKKLKKV--EEKMLGW---GGRDDAKLTIPATVILRFMFTPAEMR 434
KGE + A + K K KK+ ++K+L W ++ +I+R MF P +
Sbjct: 306 LKGE-YDASKKRKKCKDYKKKLSQQQKLLDWRPEKKEGSVRMRHERVIIIRNMFHPKDFE 364
Query: 435 ADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFG 494
D + +E+ D++ EC K G V V + + HP GV V FK+ ++A C + ++GRWFG
Sbjct: 365 EDPLVLNEIREDLRSECEKFGQVKKVIIFDRHPDGVASVSFKEAEEADVCKQALDGRWFG 424
Query: 495 GRQIHASEDDGLVNH 509
GRQ+ DG+ ++
Sbjct: 425 GRQLSVETWDGVTDY 439
>gi|194387946|dbj|BAG61386.1| unnamed protein product [Homo sapiens]
Length = 721
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 168/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTAEEPPQEKA----PEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|326431900|gb|EGD77470.1| hypothetical protein PTSG_08565 [Salpingoeca sp. ATCC 50818]
Length = 418
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/251 (41%), Positives = 141/251 (56%), Gaps = 15/251 (5%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
K N HVYVTGLP D E E KCG++ ED E P+I++Y D E G KGD TY
Sbjct: 165 KKNPHVYVTGLPLDTDEVEFYEFMKKCGVMAEDEEGA-PKIQLYRD-ENGNVKGDGKCTY 222
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKG-ERFIAKQ-----VDSK---KKK 396
L+ SV LA QLLDGT +R DG I + + +AKF ++ ER Q DSK KK
Sbjct: 223 LRVESVTLALQLLDGTDYR-DGNI-VHLERAKFRKRDIERKKKMQKEKDGSDSKGGGKKA 280
Query: 397 KLKK--VEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
K+ K + K+LGW D + +ILR MF P+E D L EL+ D+++EC K
Sbjct: 281 KVNKKTLSHKLLGWHDTDLKRKKAVGVLILRHMFHPSEFTEDPMLLVELKRDIEKECKKF 340
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRD 514
GP+ +++ + +P+GVV+VR++D I +NGRWFGG+QI A DG
Sbjct: 341 GPIKKLEIFDRNPEGVVMVRYQDEAALGPAIATLNGRWFGGQQIVAEPWDGREKFKVEET 400
Query: 515 LDAEASRLEQF 525
+ + RL+Q+
Sbjct: 401 EEEKEQRLKQW 411
>gi|397482309|ref|XP_003812372.1| PREDICTED: HIV Tat-specific factor 1 [Pan paniscus]
gi|410211494|gb|JAA02966.1| HIV-1 Tat specific factor 1 [Pan troglodytes]
gi|410261782|gb|JAA18857.1| HIV-1 Tat specific factor 1 [Pan troglodytes]
gi|410304122|gb|JAA30661.1| HIV-1 Tat specific factor 1 [Pan troglodytes]
gi|410333195|gb|JAA35544.1| HIV-1 Tat specific factor 1 [Pan troglodytes]
Length = 755
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 168/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTAEEPPQEKA----PEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|21361437|ref|NP_055315.2| HIV Tat-specific factor 1 [Homo sapiens]
gi|253970456|ref|NP_001156752.1| HIV Tat-specific factor 1 [Homo sapiens]
gi|74754346|sp|O43719.1|HTSF1_HUMAN RecName: Full=HIV Tat-specific factor 1; Short=Tat-SF1
gi|14602772|gb|AAH09896.1| HIV-1 Tat specific factor 1 [Homo sapiens]
gi|30582611|gb|AAP35532.1| HIV TAT specific factor 1 [Homo sapiens]
gi|61362248|gb|AAX42186.1| HIV TAT specific factor 1 [synthetic construct]
gi|119608874|gb|EAW88468.1| HIV-1 Tat specific factor 1, isoform CRA_a [Homo sapiens]
gi|119608875|gb|EAW88469.1| HIV-1 Tat specific factor 1, isoform CRA_a [Homo sapiens]
gi|157929022|gb|ABW03796.1| HIV-1 Tat specific factor 1 [synthetic construct]
Length = 755
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 168/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTAEEPPQEKA----PEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|410989518|ref|XP_004001008.1| PREDICTED: HIV Tat-specific factor 1 homolog [Felis catus]
Length = 780
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 171/332 (51%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N+
Sbjct: 62 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANA 103
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
E +A++ + K + P K+ EK++A WF ++ NT+VYV+GLP D+T
Sbjct: 104 QEATARTAEDPPQRK-TPEPSDPKKRGEKRKAES---GWFHVEEDRNTNVYVSGLPPDIT 159
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 160 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVDLALKLLDED 218
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V A F+ KGE +K+ K K K +++K L W A ++
Sbjct: 219 EIR---GYKLHVEVATFQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERTAGPSRMR 275
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 276 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECAKFGQIRKLLLFDRHPDGVASVSFRD 335
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 336 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 367
>gi|367052485|ref|XP_003656621.1| hypothetical protein THITE_2121511 [Thielavia terrestris NRRL 8126]
gi|347003886|gb|AEO70285.1| hypothetical protein THITE_2121511 [Thielavia terrestris NRRL 8126]
Length = 442
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 125/384 (32%), Positives = 184/384 (47%), Gaps = 84/384 (21%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
DDGT Y++D LR W+P D + Q GY + P+ D ND
Sbjct: 37 DDGTEYEFDPELRRWIPMIDEALIEEQQKGY------------IMPSA----DADNDSDR 80
Query: 238 KEKLNSTEEKVNSADNVVEEKHNG---KRKQPDKQVEKKEANKPPDSWFELKVNTHVYVT 294
+ + + K+++ DN+ + +N R + K ++ N+ P + K NT VYVT
Sbjct: 81 RPPPHGKKRKMDNTDNLEDNNNNNNHNSRDRSSKNARRQHGNRGPP---QPKQNTAVYVT 137
Query: 295 GLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVAL 353
G+P D TVEE+ E+FS KCG+I E+ ++ +PRIK+Y D E G KGDAL+ + K SV +
Sbjct: 138 GIPLDATVEEVAELFSRKCGVIAEEIDSGRPRIKMYTDAE-GNFKGDALIVFFKPQSVDM 196
Query: 354 ATQLLDGTPFR-PDGKIP----MSVTQAKFEQKGERFI---------------------- 386
A LLD T FR P P M V A K ++
Sbjct: 197 AIMLLDDTDFRFPPPGAPSQPKMRVMAADSSYKKTKYDSGGGGDSSSSSHQNNTPSATST 256
Query: 387 --------------AKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPA------------ 420
+ Q +K +K +K+ K+ W + + L PA
Sbjct: 257 KPGESSRPPGDGRRSDQDKAKIIRKTQKLSAKLADWSDDEPSALHDPADRNSSSAAPKGG 316
Query: 421 ----TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFK 476
VILR MFTP E+RAD +++ DV+EEC K+GPV +V + + +G+V VRF+
Sbjct: 317 RWDRVVILRHMFTPEELRADPAALLDIKEDVREECEKLGPVTNVVLYDEEEEGIVSVRFR 376
Query: 477 DRKDAQKCIELMNGRWFGGRQIHA 500
R+ A+ C+ +M+GR FGGR I A
Sbjct: 377 TREAAEACLRVMHGRAFGGRIIEA 400
>gi|426397566|ref|XP_004064985.1| PREDICTED: HIV Tat-specific factor 1 [Gorilla gorilla gorilla]
Length = 755
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 168/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTAEEPPQEKA----PEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|30584025|gb|AAP36261.1| Homo sapiens HIV TAT specific factor 1 [synthetic construct]
gi|60653905|gb|AAX29645.1| HIV TAT specific factor 1 [synthetic construct]
gi|60653907|gb|AAX29646.1| HIV TAT specific factor 1 [synthetic construct]
Length = 756
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 168/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTAEEPPQEKA----PEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|390352112|ref|XP_003727819.1| PREDICTED: HIV Tat-specific factor 1 homolog isoform 2
[Strongylocentrotus purpuratus]
gi|390352114|ref|XP_001198057.2| PREDICTED: HIV Tat-specific factor 1 homolog isoform 1
[Strongylocentrotus purpuratus]
Length = 464
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 127/364 (34%), Positives = 193/364 (53%), Gaps = 32/364 (8%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQED---TSSQNDGYGI--EEMTFLKEEEVFPTVNVTDDL 231
+TD +DGT Y+WD +AW P+ D +S YG+ +E T + + P+
Sbjct: 28 YTDPNDGTMYEWDPEKKAWFPKVDENFIASYQASYGVTTDENTADSQPDPQPSAAAATSA 87
Query: 232 ANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVE----------KKEANKPPDS 281
+ ST ++ +E G + D+Q + K++A + S
Sbjct: 88 SASASSATPGTSTGVDKDTTPGGQDEAKKGGEVKIDEQPQMLYRYQKMQMKRKAEQK--S 145
Query: 282 WFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
WFE+ NT+VYV+GLP D+T+EE EV +K GII + ET+KP+IK+Y+D+E G KG
Sbjct: 146 WFEVDPTKNTNVYVSGLPTDITLEEFQEVMAKWGIIMLEEETEKPKIKLYMDEE-GRPKG 204
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE-RFIAKQVDSKKKKKL 398
D YLK SV LA QLLD + R KI + V A F KG+ + ++ KKK
Sbjct: 205 DGRCCYLKRESVDLALQLLDESEIR-GHKIHVEV--ATFTLKGDYKPDMRKKKKPNKKKK 261
Query: 399 KKVEEKMLGWGGRDDAKLTI----PATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
V++K+L W R + K +IL+ +F P+E D L ++++ DV++EC
Sbjct: 262 GNVQDKLLDW--RPEKKFQQRKRNEQVIILKHVFHPSEFEEDAMLINDIKDDVKDECSTY 319
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRD 514
G V V + + HP GVV V+FKD ++A KCI+ +NGRWF R++ + DG ++ I +
Sbjct: 320 GEVKKVLIFDRHPDGVVSVKFKDVENADKCIQSLNGRWFAKRKLEVTHWDGKTDY-KIEE 378
Query: 515 LDAE 518
D E
Sbjct: 379 TDKE 382
>gi|296236526|ref|XP_002763352.1| PREDICTED: HIV Tat-specific factor 1 homolog [Callithrix jacchus]
Length = 744
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 171/332 (51%), Gaps = 31/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N+
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANA 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ ++ + P K+ EK++A WF ++ NT+VYV+GLP D+T
Sbjct: 90 QDVHARTAEEQPPQEKAPEPTDPRKKGEKRKAES---GWFHVEEDRNTNVYVSGLPPDIT 146
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 147 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 205
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 206 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 262
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 263 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 322
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 323 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 354
>gi|403300078|ref|XP_003940786.1| PREDICTED: HIV Tat-specific factor 1 homolog [Saimiri boliviensis
boliviensis]
Length = 743
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 167/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTTNV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A EE K +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 QDVHARTA----EEPPQEKAPEPTDPRKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|197102050|ref|NP_001125576.1| HIV Tat-specific factor 1 homolog [Pongo abelii]
gi|75070769|sp|Q5RB63.1|HTSF1_PONAB RecName: Full=HIV Tat-specific factor 1 homolog
gi|55728510|emb|CAH90997.1| hypothetical protein [Pongo abelii]
Length = 754
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 168/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTAEEPPQEKA----PEPTDPRKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|403264917|ref|XP_003924710.1| PREDICTED: HIV Tat-specific factor 1 homolog [Saimiri boliviensis
boliviensis]
Length = 792
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 168/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND V N
Sbjct: 96 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGVSSSTANV 137
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 138 QDVHARTAEEPTQEKA----PEPTDSGKKGEKRKAESGWFHVEEYRNTNVYVSGLPPDIT 193
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
++E +++ SK GII DP+T++ +IK+Y D + G KGD L YLK SV LA +LLD
Sbjct: 194 LDEFIKLMSKFGIIMRDPQTEEFKIKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 252
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDAK---LT 417
R + V AKF+ KGE +K+ K K K +++K L W A +
Sbjct: 253 EIR---GHKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSWMR 309
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 310 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFQD 369
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 370 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 401
>gi|332246954|ref|XP_003272619.1| PREDICTED: HIV Tat-specific factor 1 homolog isoform 1 [Nomascus
leucogenys]
Length = 754
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 167/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTTNV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 EDVHARTAAEPPQEKA----PEPTDPRKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|225679638|gb|EEH17922.1| U2 snRNP-associated protein Uap2 [Paracoccidioides brasiliensis
Pb03]
Length = 428
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/278 (35%), Positives = 157/278 (56%), Gaps = 34/278 (12%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+VNT VYVT +P D +++E+ +VFSKCG+I E+ + +KPRIK+Y D E G KGDALV Y
Sbjct: 151 RVNTAVYVTSIPLDASMDEVNDVFSKCGVIAEEIDRRKPRIKMYTDDE-GKFKGDALVVY 209
Query: 346 LKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK- 399
+ SV LA Q+LD T FR P+GK+ + F+ + E + + + K+K +K
Sbjct: 210 FRPESVNLAIQMLDDTDFRFGETGPEGKMKVQAADFSFKAQKEAPVKQNMSEKRKIMMKT 269
Query: 400 -KVEEKMLGWGGRDDAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEEC 451
++ K+ W DD T+P + VIL+ MFT E+ D +++ D+++EC
Sbjct: 270 QRLNSKLADWD--DDEPSTVPTSSSRWDKVVILKHMFTLQELEEDPAAILDIKEDIRQEC 327
Query: 452 VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG------ 505
K+G V +V + + +GV VRF D + A C++ M+GR+F G ++ A DG
Sbjct: 328 SKLGDVTNVVLYDREEEGVASVRFSDAESANTCVQKMDGRFFSGTRVVAYIADGSERFRK 387
Query: 506 --LVNHAAIRDL---------DAEASRLEQFGAELEAD 532
N AA+ + + EA RL++FG+ LE++
Sbjct: 388 SSAKNPAALAAVDAEEAEDASEEEAERLDKFGSWLESE 425
>gi|320169913|gb|EFW46812.1| HIV TAT specific factor 1 [Capsaspora owczarzaki ATCC 30864]
Length = 570
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 129/422 (30%), Positives = 198/422 (46%), Gaps = 86/422 (20%)
Query: 182 DGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLAND-EVG 237
DGT Y+WD +AW P+ D + S YG E + T D E G
Sbjct: 106 DGTIYEWDGRAKAWFPRLDEAFIYSYQLSYGGNENAAAAAPSTTTEASTTRRTRADLESG 165
Query: 238 KEKLNSTEEKVNSADNVVEEKHN-GKRKQPDKQVEKKEANKPPDSWFELKV--------- 287
+++ + N ++ E H KR + ++ ++ E KPP + +
Sbjct: 166 GQQVATGNSAPNGSEMDAEAAHREYKRMRMEQHDDEDELVKPPAAAGPGRPGKGAPKAAA 225
Query: 288 -------------NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKET 334
N++VYVTGLP DV EE E S+CGII+ D T + ++K+Y D
Sbjct: 226 ASKSQEGDAPAFKNSNVYVTGLPYDVDAEEFAEYMSQCGIIRFDMNTNELKVKVYTDA-N 284
Query: 335 GMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK------------- 381
G+ KGDAL +Y + SV LA LDG+ FR GK P+ V AKF K
Sbjct: 285 GVPKGDALCSYNRAESVQLALDFLDGSLFR--GKHPIHVEAAKFSHKESSKEASKGGSAS 342
Query: 382 -----------------GERFIAKQV-----------DSKKKKKLK----KVEEKMLGWG 409
G + Q ++K+ K++K KV +K+ GW
Sbjct: 343 KGAKAAVAGAKGGAKEAGTSVQSDQAPPPQHHGPVINNAKQLKQVKARRQKVMQKLYGW- 401
Query: 410 GRDDAKLTI-----PATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
D + + +TVI+ +F+P E + N +EL+ D++ EC K G V V + +
Sbjct: 402 ---DEFVPMENKRGASTVIISNVFSPEEFDDNPNELNELKEDMESECGKCGVVRKVTIFD 458
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA-SRLE 523
+P GVV V+F +DA KC+ELMNGRWFGGR++ A++ DG ++ + + DA+ +RL+
Sbjct: 459 RNPLGVVSVKFDLPEDAGKCVELMNGRWFGGRRLEAAKWDGRTSY-KVEETDAQREARLQ 517
Query: 524 QF 525
++
Sbjct: 518 KW 519
>gi|47221801|emb|CAG08855.1| unnamed protein product [Tetraodon nigroviridis]
Length = 383
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 119/342 (34%), Positives = 179/342 (52%), Gaps = 22/342 (6%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAN 233
+TD +DGT Y WD +AW P+ ED ++ YG + + + + D +
Sbjct: 34 YTDPEDGTVYDWDHEKKAWFPKITEDFLAAYQANYGFTQEGGSDSKSAAVSSSEPADPDS 93
Query: 234 DEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKK-EANKPPDSWFELK--VNTH 290
++ EK N S + E P K++++K E K WF + NT+
Sbjct: 94 NKTPPEKQNPGRPTQTSNADPPE--------APSKEIKQKGEKRKAEPGWFSIDEDKNTN 145
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
VYV+GLP D++ EE E+ SKCGI+ DP T++ ++K+Y DKE G KGD L YLK+ S
Sbjct: 146 VYVSGLPPDISTEEFAELMSKCGIVMRDPMTEEYKVKLYKDKE-GNLKGDGLCCYLKKES 204
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER-FIAKQVDSKKKKKLKKVEEKMLGWG 409
VALA +L+D + R + V A+FE KG+ K+ SK KK + ++K L W
Sbjct: 205 VALAIRLIDESEVR---GYRLHVEAARFELKGQYDATKKKKKSKDYKKRLQQQQKQLDWR 261
Query: 410 --GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
+ D ++ VI+R MF P++ D + +E D++ EC K G V V + + HP
Sbjct: 262 PEKKGDNRMRHEKVVIIRNMFHPSDFEEDPLVLNEYREDLRVECEKFGVVKKVILFDRHP 321
Query: 468 QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
GV V FK+ ++A C + NGRWFGGRQ+ DG ++
Sbjct: 322 DGVASVAFKEPEEADACFQSFNGRWFGGRQLSVEFWDGKTDY 363
>gi|345807273|ref|XP_538183.3| PREDICTED: HIV Tat-specific factor 1 homolog [Canis lupus
familiaris]
Length = 768
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 168/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +N+ N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNEGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
E +A EE K +P+ ++ E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 QEVSARTA----EEPPQRKTPEPNDPKKRGEKRKAEPGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVDLALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|340371085|ref|XP_003384076.1| PREDICTED: hypothetical protein LOC100639094 [Amphimedon
queenslandica]
Length = 692
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 131/366 (35%), Positives = 191/366 (52%), Gaps = 40/366 (10%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLAN 233
+TD DGT Y+WD RAW P+ D + + + YG F + EE T ++ A+
Sbjct: 27 YTDPTDGTVYEWDPEKRAWFPKIDNTFLANYHASYG-----FYQPEEEKETEKKVEETAS 81
Query: 234 DEVGKEKLNSTEEKVNSADNVVEEKHNGKRK-QPDKQVEKKEANKPPDSWFEL--KVNTH 290
S VEE+ GK + + +KK+ N+P SWF++ NT+
Sbjct: 82 ---------------ASKTETVEERGGGKTDGTSEPEPKKKKVNEP--SWFDIDESHNTN 124
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
VYV+GLP D+T+EE E +KCGII ED + P++K+Y D + G KGD L YLK S
Sbjct: 125 VYVSGLPLDITLEEFAEHMTKCGIIMED-DDGDPKVKLYHDSD-GQLKGDGLCCYLKIES 182
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA-KQVDSKKKKKLKKVEEKMLGWG 409
V LA LLD + R GK +SV +A+F+ KGE + ++ KKK +E++L W
Sbjct: 183 VQLALDLLDESEIR--GK-KLSVKRAQFQMKGEYNPSLRKKKQNKKKAKGSKQERLLDWK 239
Query: 410 ---GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENH 466
G K VI + +F P E D L +++ D++EEC K G V V V + H
Sbjct: 240 ERKGDGKPKFKHQRIVIFKHLFDPKEFEVDPTLITDIRDDLREECSKFGEVKKVLVFDRH 299
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEAS-RLEQF 525
GV V FK+ + A+ I MNGR++ GRQ+ DG+ N+ I + D E RL+Q+
Sbjct: 300 VDGVASVAFKEFEPAEAAITAMNGRYYAGRQLEVFLWDGVTNY-QIEETDKERELRLKQW 358
Query: 526 GAELEA 531
L++
Sbjct: 359 EEYLQS 364
>gi|340520994|gb|EGR51229.1| predicted protein [Trichoderma reesei QM6a]
Length = 355
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 173/351 (49%), Gaps = 76/351 (21%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDGT +++D L+ WVP E+ ++ E+ L+E P D + K++
Sbjct: 32 DDGTEFEFDAELKKWVPTEEEPLED------ELNDLREYSGTPA-------EGDALHKKR 78
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
+ E NG K P + K NT VYVTGLP D
Sbjct: 79 KHDEE--------------NGDEKPPPQP----------------KQNTAVYVTGLPADA 108
Query: 301 TVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
TVEE+ ++FS K G+I E+ ++ KPRIK+Y D E G KGDAL+ + K SV +A LLD
Sbjct: 109 TVEEVHDLFSRKAGVIAEEIDSGKPRIKMYTDAE-GNFKGDALIVFFKPESVNMAIMLLD 167
Query: 360 GTPFR------PDGKIPMSVTQAKFEQ--------------KGERFIAKQVDSKKK-KKL 398
T FR +G+I + + +++ G++ + D +K KK
Sbjct: 168 DTEFRFLPNGSTEGRIRVQAADSSYKKTQYDQENTGEASNGNGQKRQRNEKDRQKIIKKT 227
Query: 399 KKVEEKMLGWGGRDDAKLTIPA--------TVILRFMFTPAEMRADENLRSELEADVQEE 450
+K+ K+ W DD P TVILR MFT E+ D E++ D+++E
Sbjct: 228 QKLAAKLADWD--DDEPYLAPVQPNSKRDRTVILRHMFTLQELEEDPAALLEIKEDIRDE 285
Query: 451 CVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
C K+GPV SV + + P+GVV VRFK + A CI+LM+GR FGG +HA+
Sbjct: 286 CSKLGPVTSVVLYDLEPEGVVSVRFKTAEAAAACIKLMHGRSFGGNIVHAT 336
>gi|431891353|gb|ELK02229.1| HIV Tat-specific factor 1 like protein [Pteropus alecto]
Length = 786
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 170/334 (50%), Gaps = 32/334 (9%)
Query: 182 DGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKL 241
D T Y+WD +AW P+ +T E+ T +ND
Sbjct: 45 DDTPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGAS---- 82
Query: 242 NSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDD 299
+ST + + +EE K +P ++ E K WF ++ NT+VYV+GLP D
Sbjct: 83 SSTASVQDVSARPIEEPPQRKTPEPTDPKKRGEKRKADSGWFHVEEDRNTNVYVSGLPPD 142
Query: 300 VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
+TV+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 143 ITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVDLALKLLD 201
Query: 360 GTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---K 415
R + V AKF+ KGE +K+ K K K +++K L W A +
Sbjct: 202 EDEIR---GYRLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERRAGPSR 258
Query: 416 LTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRF 475
+ VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F
Sbjct: 259 MRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSF 318
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+D ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 319 RDPEEADYCIQTLDGRWFGGRQITAQVWDGTTDY 352
>gi|62088186|dbj|BAD92540.1| HIV TAT specific factor 1 variant [Homo sapiens]
Length = 758
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 166/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 51 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 92
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A EE K +P +K E K W ++ NT+VYV+GLP D+T
Sbjct: 93 EDVHARTA----EEPPQEKAPEPTDARKKGEKRKAESGWVHVEEDRNTNVYVSGLPPDIT 148
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 149 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 207
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 208 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 264
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 265 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRD 324
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 325 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 356
>gi|427785711|gb|JAA58307.1| Putative transcription elongation factor tat-sf1 [Rhipicephalus
pulchellus]
Length = 419
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 132/378 (34%), Positives = 194/378 (51%), Gaps = 50/378 (13%)
Query: 174 GEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNV 227
G D +T +DGT Y+WD AW P+ ED ++ YG E T P+
Sbjct: 23 GSDPYTYVDPNDGTVYEWDHDKHAWFPKLTEDFLAAYQANYGTAEGT--------PS--- 71
Query: 228 TDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL-- 285
+ A++ G ++ + EK + EE P K+ K + ++P WFE+
Sbjct: 72 --EGASNTSGSSQMAAIPEKTAPPVHNPEE-------NPSKKAAKGKQSEP--GWFEIDD 120
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
NT VYV+ LPDD+ EE E+ SKCG++ +D E + +IK+Y + G KGDAL Y
Sbjct: 121 AHNTRVYVSNLPDDIDEEEFFELMSKCGLVMKD-EKGQFKIKLYRTAD-GHLKGDALCCY 178
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKG-----ERFIAKQVDSKKKKKLKK 400
+K SV LA ++LDG R D +I V +A+F+ KG ++ K+ SK K+++KK
Sbjct: 179 IKVESVELALRILDGYRLR-DKEI--RVERARFQLKGAYDPTKKPKKKKQASKDKERIKK 235
Query: 401 VEEKMLGW------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
EK+ W G RD + TV+L+ MF E +D L E + D++EEC +
Sbjct: 236 KIEKLFDWRPEKLRGMRDKHE----CTVVLKNMFDTKEFESDPTLILEYQKDLREECGQF 291
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRD 514
G V V V + HP+GV V FK+ ++A CI MNGRWF RQ+ A DG +
Sbjct: 292 GEVKKVVVYDRHPEGVATVTFKEPEEADACISRMNGRWFAQRQLSAETWDGRTRYKIFET 351
Query: 515 LDAEASRLEQFGAELEAD 532
+ RL+++ LEAD
Sbjct: 352 EEELEERLKKWDDYLEAD 369
>gi|12847222|dbj|BAB27483.1| unnamed protein product [Mus musculus]
Length = 561
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/337 (34%), Positives = 174/337 (51%), Gaps = 38/337 (11%)
Query: 182 DGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK 238
D T Y+WD +AW P+ ED ++ YG F + T NV D AN + +
Sbjct: 47 DDTPYEWDLDKKAWFPKITEDFIATYQANYG-----FSSDGASSSTANVQD--ANTKAVE 99
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGL 296
E V E + KRK ++ E WF ++ NT+VYV+GL
Sbjct: 100 E----------PPQKEVPETPDSKRKGEKRKAES--------GWFHVEEDRNTNVYVSGL 141
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
P D+TV+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK+ SV LA +
Sbjct: 142 PPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKD-DQGNLKGDGLCCYLKKESVELALK 200
Query: 357 LLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA- 414
LLD R + V AKF+ KGE +K+ K K K +++K L W A
Sbjct: 201 LLDEDEIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERRAG 257
Query: 415 --KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVL 472
+L VIL+ MF P + D + +E+ D++ EC K G + + + + HP GV
Sbjct: 258 PNRLRHERVVILKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVAS 317
Query: 473 VRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
V F++ ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 318 VSFREPEEADHCIQTLDGRWFGGRQITAQAWDGTTDY 354
>gi|351695595|gb|EHA98513.1| HIV Tat-specific factor 1-like protein [Heterocephalus glaber]
Length = 728
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 170/332 (51%), Gaps = 35/332 (10%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +D +
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFCSDGASSSTAHV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ + +A ++ + GK +P + +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 PDTSMRTAKDLPQ----GKTPEPTEPRKKGEKRK---GWFHVEEDRNTNVYVSGLPPDIT 142
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 143 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 201
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K V++K L W A ++
Sbjct: 202 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSVQQKQLDWRPERRAGPSRMR 258
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VIL+ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 259 HERVVILKNMFHPTDFEDDPLVLNEIRDDLRVECSKFGQIKKLLLFDRHPDGVASVSFRD 318
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 319 PEEADHCIQTLDGRWFGGRQITAQPWDGTTDY 350
>gi|326474220|gb|EGD98229.1| nuclear mRNA splicing factor-associated protein [Trichophyton
tonsurans CBS 112818]
Length = 298
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/275 (38%), Positives = 149/275 (54%), Gaps = 33/275 (12%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+VNT VYVT +P D TV E+ +FSKCG+I E+ ++ KPRIK+Y D E G KGDALV Y
Sbjct: 15 RVNTAVYVTTIPLDATVSEISSLFSKCGVIAEEIDSGKPRIKMYTD-EQGTFKGDALVVY 73
Query: 346 LKEPSVALATQLLDGTPFRPDGKIP---MSVTQAKFEQKGERFIAKQVDSKKK----KKL 398
+ SV LA Q+LD T FR K M V A F K + + + + K +K
Sbjct: 74 FRPESVNLAIQMLDDTDFRFGEKGAEGNMRVQPADFSFKAVQETPAKANMRDKMKIIRKT 133
Query: 399 KKVEEKMLGW-----GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVK 453
+K+ K+ W G R K V+L+ MFT E+ D +++ D++EEC K
Sbjct: 134 QKLNIKLTDWDDDDVGPRHSGKAG--KVVVLKHMFTLQELEEDPAAILDIKEDIREECSK 191
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH---- 509
+G V +V + + G+ VRF D + AQ C+++MNGR+FGG Q+ A +G V
Sbjct: 192 LGEVTNVVLYDKEESGIATVRFSDPECAQACVQMMNGRFFGGTQVEAYVVEGKVRFKKSG 251
Query: 510 ----AAIRD----------LDAEASRLEQFGAELE 530
AA++D D EA RL++FGA LE
Sbjct: 252 ASAAAALQDDGAGWEAEAGKDDEAQRLDKFGAWLE 286
>gi|344297715|ref|XP_003420542.1| PREDICTED: HIV Tat-specific factor 1 homolog [Loxodonta africana]
Length = 743
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 167/332 (50%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 89
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ SA N E K +P + +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 90 QD---VSAGNSGEPPQR-KPPEPTEPKKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERKAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIKKLILFDRHPDGVASVSFRD 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIHTLDGRWFGGRQITAEAWDGTTDY 353
>gi|335306576|ref|XP_003360505.1| PREDICTED: HIV Tat-specific factor 1 homolog [Sus scrofa]
Length = 760
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 170/332 (51%), Gaps = 32/332 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND G +
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSND--GASSSAA 87
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ + V++ EE K +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 88 SVQDVSA--RTTEEPPQRKTPEPSDPRKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVDLALKLLDED 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F++
Sbjct: 262 HERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRE 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 322 PEEADYCIQTLDGRWFGGRQITAEAWDGTTDY 353
>gi|23956212|ref|NP_082518.1| HIV Tat-specific factor 1 homolog [Mus musculus]
gi|321267528|ref|NP_083647.1| HIV Tat-specific factor 1 homolog [Mus musculus]
gi|81913100|sp|Q8BGC0.1|HTSF1_MOUSE RecName: Full=HIV Tat-specific factor 1 homolog
gi|22902415|gb|AAH37711.1| HIV TAT specific factor 1 [Mus musculus]
gi|26340228|dbj|BAC33777.1| unnamed protein product [Mus musculus]
gi|148710215|gb|EDL42161.1| HIV TAT specific factor 1, isoform CRA_a [Mus musculus]
Length = 757
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 117/337 (34%), Positives = 174/337 (51%), Gaps = 38/337 (11%)
Query: 182 DGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK 238
D T Y+WD +AW P+ ED ++ YG F + T NV D AN + +
Sbjct: 47 DDTPYEWDLDKKAWFPKITEDFIATYQANYG-----FSSDGASSSTANVQD--ANTKAVE 99
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGL 296
E V E + KRK ++ E WF ++ NT+VYV+GL
Sbjct: 100 EPPQKE----------VPETPDSKRKGEKRKAES--------GWFHVEEDRNTNVYVSGL 141
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
P D+TV+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK+ SV LA +
Sbjct: 142 PPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKD-DQGNLKGDGLCCYLKKESVELALK 200
Query: 357 LLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA- 414
LLD R + V AKF+ KGE +K+ K K K +++K L W A
Sbjct: 201 LLDEDEIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERRAG 257
Query: 415 --KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVL 472
+L VIL+ MF P + D + +E+ D++ EC K G + + + + HP GV
Sbjct: 258 PNRLRHERVVILKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVAS 317
Query: 473 VRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
V F++ ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 318 VSFREPEEADHCIQTLDGRWFGGRQITAQAWDGTTDY 354
>gi|148710216|gb|EDL42162.1| HIV TAT specific factor 1, isoform CRA_b [Mus musculus]
Length = 793
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 117/337 (34%), Positives = 174/337 (51%), Gaps = 38/337 (11%)
Query: 182 DGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK 238
D T Y+WD +AW P+ ED ++ YG F + T NV D AN + +
Sbjct: 83 DDTPYEWDLDKKAWFPKITEDFIATYQANYG-----FSSDGASSSTANVQD--ANTKAVE 135
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGL 296
E V E + KRK ++ E WF ++ NT+VYV+GL
Sbjct: 136 EPPQKE----------VPETPDSKRKGEKRKAES--------GWFHVEEDRNTNVYVSGL 177
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
P D+TV+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK+ SV LA +
Sbjct: 178 PPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKD-DQGNLKGDGLCCYLKKESVELALK 236
Query: 357 LLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA- 414
LLD R + V AKF+ KGE +K+ K K K +++K L W A
Sbjct: 237 LLDEDEIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERRAG 293
Query: 415 --KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVL 472
+L VIL+ MF P + D + +E+ D++ EC K G + + + + HP GV
Sbjct: 294 PNRLRHERVVILKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVAS 353
Query: 473 VRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
V F++ ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 354 VSFREPEEADHCIQTLDGRWFGGRQITAQAWDGTTDY 390
>gi|383851570|ref|XP_003701305.1| PREDICTED: HIV Tat-specific factor 1 homolog [Megachile rotundata]
Length = 459
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 193/366 (52%), Gaps = 49/366 (13%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
+TD +DG+ Y WD+ AW P+ D +D +M++
Sbjct: 78 YTDPNDGSVYFWDKEKNAWFPKVD----DDFMARYQMSY--------------------- 112
Query: 237 GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVT 294
T+ E + K ++ K+VE K + P +WFE+ NT +YV+
Sbjct: 113 -----GFTDSSAGETKPAPEPEVKTKVEKEKKKVEAKRKAQDPPTWFEVDEAHNTAIYVS 167
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D+T++E+ E+F+KCG+I D E K +IK+Y D G KGDAL Y+K SV LA
Sbjct: 168 GLPLDITLDELTELFNKCGLIARD-EKGKDKIKLYRDS-NGQPKGDALCIYIKVESVDLA 225
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKGERF---IAKQVDSKKKKKLKKVEEKMLGWGGR 411
++LD + R GK +SV +AKF+ KG+ + + + K K++ KK+ EK+ W R
Sbjct: 226 LKILDKSQIR--GKT-LSVQRAKFQMKGDAYDPALKPKRKKKDKERQKKIHEKLFDW--R 280
Query: 412 DDAKLTIPA----TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
+ L PA VI++ +F+P + + L E + D++ EC+K G V V + + HP
Sbjct: 281 PERPLGEPAKHERVVIIKNLFSPEDFDKEVALLLEYQQDIRSECLKCGDVRKVIIYDRHP 340
Query: 468 QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA-SRLEQFG 526
+GV V F++ +AQ C++L+NGRWF R+I A DG + + + DAE +R+ ++
Sbjct: 341 EGVAQVIFREPAEAQACVQLLNGRWFSQRKISAEIWDGKTKY-KVTETDAEIEARIAKWD 399
Query: 527 AELEAD 532
LE +
Sbjct: 400 KFLEGE 405
>gi|432895861|ref|XP_004076198.1| PREDICTED: HIV Tat-specific factor 1-like [Oryzias latipes]
Length = 454
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 122/370 (32%), Positives = 181/370 (48%), Gaps = 22/370 (5%)
Query: 173 EGEDEFT---DDDGTRYKWDRGLRAWVPQ---EDTSSQNDGYGIEEMTFLKEEEVFPTVN 226
+G D FT DGT Y WD +AW P+ + ++ YG F KE + P N
Sbjct: 27 DGSDPFTYTDPADGTVYDWDPEKKAWFPKVTDDFIAAYQANYG-----FTKEGD--PDTN 79
Query: 227 VTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK 286
++D E + EK S + + + ++ +K E K WFE+
Sbjct: 80 AGAVSSSDPPAPEPNSKAPEKAKSETSAAKPAPDQTEATTNQAKQKGEKRKADPGWFEID 139
Query: 287 --VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
NT+VYV+GLP D+T EE V + SK GI+ DP T++ ++K+Y DK+ G KGD L
Sbjct: 140 EGKNTNVYVSGLPPDITPEEFVGLMSKYGIVMRDPITEEYKVKLYKDKD-GNLKGDGLCC 198
Query: 345 YLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEK 404
Y K+ S+ LA +LLD R + V A+FE KG+ +K+ K K K +++
Sbjct: 199 YFKKESLELAVRLLDEYEVR---GYKLHVEAAQFELKGQYDASKKKKKSKDYKKKLQQQQ 255
Query: 405 MLG-WG--GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
W + D + VI+R MF P++ D + +E D++ EC K G V V
Sbjct: 256 KQLDWRPEKKGDVRKRHEKVVIIRNMFHPSDFEEDPLVLNEYREDLRTECEKFGGVKKVV 315
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASR 521
+ + HP GV V FK+ ++A C+ NGRWFGGRQ+ A DG ++ R
Sbjct: 316 LFDRHPDGVASVAFKEPEEADACVLSFNGRWFGGRQLIAHLWDGTTDYQVEETTREREER 375
Query: 522 LEQFGAELEA 531
L+ + A LE
Sbjct: 376 LKGWSAFLEG 385
>gi|296481686|tpg|DAA23801.1| TPA: HIV-1 Tat specific factor 1-like [Bos taurus]
Length = 766
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/331 (33%), Positives = 163/331 (49%), Gaps = 32/331 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND NS
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGAS----NS 85
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
T + + EE + P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 86 TASVQDVSARTAEEPPQRQPPDPSDLKKKGEKRKTESGWFHVEDDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 VDEFIQLMSKFGIIMRDPQTGDFKVKLYKD-DQGNLKGDGLCCYLKRESVDLALKLLDEG 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYRLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERQAGSSRMY 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+
Sbjct: 262 HERVVIIKNMFHPVDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRS 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVN 508
++A CI+ +NGRWFGGRQI A DG +
Sbjct: 322 PEEADYCIQTLNGRWFGGRQITAQAWDGTTD 352
>gi|426257498|ref|XP_004022363.1| PREDICTED: HIV Tat-specific factor 1 homolog [Ovis aries]
Length = 774
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/244 (38%), Positives = 139/244 (56%), Gaps = 10/244 (4%)
Query: 272 KKEANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIY 329
K E K WF ++ NT+VYV+GLP D+TV+E +++ SK GII DP+T++ ++K+Y
Sbjct: 114 KGEKRKAESGWFHVEEGRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLY 173
Query: 330 VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ 389
D + G KGD L YLK SV LA +LLD R + V AKF+ KGE +K+
Sbjct: 174 KDNQ-GNLKGDGLCCYLKRESVDLALKLLDEDEIR---GYKLHVEVAKFQLKGEYDASKK 229
Query: 390 VDSKKKKKLK-KVEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEA 445
K K K +++K L W A ++ VI++ MF P + D + +E+
Sbjct: 230 KKKCKDYKKKLSMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIRE 289
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
D++ EC K G + + + + HP GV V FKD ++A CI+ +NGRWFGGRQI A DG
Sbjct: 290 DLRVECSKFGQIRKLLLFDRHPDGVASVSFKDPEEADYCIQTLNGRWFGGRQITAQAWDG 349
Query: 506 LVNH 509
++
Sbjct: 350 TTDY 353
>gi|115618688|ref|XP_793195.2| PREDICTED: HIV Tat-specific factor 1 homolog [Strongylocentrotus
purpuratus]
Length = 462
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 123/359 (34%), Positives = 187/359 (52%), Gaps = 41/359 (11%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQED---TSSQNDGYGI--EEMTFLKEEEVFP-------- 223
+TD +DGT Y+WD +AW P+ D +S YG+ +E T + + P
Sbjct: 28 YTDPNDGTMYEWDPEKKAWFPKVDENFIASYQASYGVTTDENTADGQTDPQPSAAAATSA 87
Query: 224 -----TVNVTDDLANDEV-GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANK 277
T + + D G + E +V D + + ++ Q ++ E
Sbjct: 88 SASSGTPGTSAGVGEDTTPGGKDGAKGEGEVKKLDEQPQMLYRYQKMQMKRKAEP----- 142
Query: 278 PPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETG 335
WFE+ NT+VYV+GLP D+T+EE EV +K GII + ET+KP+IK+Y+D+E G
Sbjct: 143 ---GWFEVDPTKNTNVYVSGLPTDITLEEFQEVMAKWGIIMLEEETEKPKIKLYMDEE-G 198
Query: 336 MKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE-RFIAKQVDSKK 394
KGD YLK SV LA QLLD + R KI + V A F KG+ + ++
Sbjct: 199 RPKGDGRCCYLKRESVDLALQLLDESEIRGH-KIHVEV--ATFTLKGDYKPDMRKKKKPN 255
Query: 395 KKKLKKVEEKMLGWGGRDDAKLTI----PATVILRFMFTPAEMRADENLRSELEADVQEE 450
KKK V++K+L W R + K +IL+ +F P+E D L ++++ DV++E
Sbjct: 256 KKKKGNVQDKLLDW--RPEKKFQQRKRNEQVIILKHVFHPSEFEEDAMLINDIKDDVKDE 313
Query: 451 CVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
C G V V + + HP GVV V+FKD ++A KCI+ +NGRWF R++ + DG ++
Sbjct: 314 CSTYGEVKKVLIFDRHPDGVVSVKFKDVENADKCIQSLNGRWFAKRKLEVTHWDGKTDY 372
>gi|328772225|gb|EGF82264.1| hypothetical protein BATDEDRAFT_23720 [Batrachochytrium
dendrobatidis JAM81]
Length = 389
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 178/347 (51%), Gaps = 49/347 (14%)
Query: 178 FTDDDGTRYKWDRGLRAWVP---QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAND 234
+T DDG+ Y +D R W P +E +Q Y V P+ + D A++
Sbjct: 35 YTGDDGS-YTYDPTTRQWFPFFDEELIKAQQSAYYTN---------VIPSCVM--DPADE 82
Query: 235 EVGKEKLNSTEEKVNSADNVVEEKHNGKRK---QPDKQVEKKEANKPPDSWFEL----KV 287
+ + N + ++E N KRK QP + +++K KP S + +V
Sbjct: 83 PESQPEFN----------HHLDEHANEKRKSGTQPTQNLKRK---KPKHSTLDAQPKKRV 129
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYVT LP D TV E+ ++FSKCGII EDP T P+IK+YVD + G KGDALV + K
Sbjct: 130 NTAVYVTHLPHDTTVTELHDLFSKCGIILEDPSTNIPKIKLYVDDQ-GCFKGDALVIFFK 188
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK--GE-------RFIAKQVDSKKKKKL 398
E SV LA QL+D T FR I + V +A F++K GE + + + ++ +KK
Sbjct: 189 EESVELAVQLMDDTLFRAGDGIKIRVQKAVFQEKPAGEAHSDEKGKSGSGKPNTNEKKPT 248
Query: 399 KKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENL----RSELEADVQEECVKI 454
+ + +KM + T R RA +++ S L+ DV+EEC K+
Sbjct: 249 QTIRQKMERQVLTHSSPHQSIETNSNRVHIYQQGCRAKKDVYIKRTSNLKQDVREECEKL 308
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
G V +V + + G++ VRFKD + A C+ MNGR+FGG+ I AS
Sbjct: 309 GQVTNVVLYDLEEDGIMTVRFKDVESAAACVLKMNGRFFGGQTIEAS 355
>gi|330340363|ref|NP_001193343.1| HIV Tat-specific factor 1 [Bos taurus]
Length = 773
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 139/244 (56%), Gaps = 10/244 (4%)
Query: 272 KKEANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIY 329
K E K WF ++ NT+VYV+GLP D+TV+E +++ SK GII DP+T++ ++K+Y
Sbjct: 114 KGEKRKAESGWFHVEEGRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLY 173
Query: 330 VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ 389
D + G KGD L YLK SV LA +LLD R + V AKF+ KGE +K+
Sbjct: 174 KDNQ-GNLKGDGLCCYLKRESVDLALKLLDEDEIR---GYKLHVEVAKFQLKGEYDASKK 229
Query: 390 VDSKKKKKLK-KVEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEA 445
K K K +++K L W A ++ VI++ MF P + D + +E+
Sbjct: 230 KKKCKDYKKKLSMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIRE 289
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
D++ EC K G + + + + HP GV V F+D ++A CI+ +NGRWFGGRQI A DG
Sbjct: 290 DLRVECSKFGQIRKLLLFDRHPDGVASVSFRDPEEADYCIQTLNGRWFGGRQITAQAWDG 349
Query: 506 LVNH 509
++
Sbjct: 350 TTDY 353
>gi|156053139|ref|XP_001592496.1| hypothetical protein SS1G_06737 [Sclerotinia sclerotiorum 1980]
gi|154704515|gb|EDO04254.1| hypothetical protein SS1G_06737 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 381
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 120/354 (33%), Positives = 180/354 (50%), Gaps = 65/354 (18%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDGT +++D ++ W+P D + L+E++ V+ D+ ++ V K
Sbjct: 41 DDGTEFEFDDAIKRWIPVLDEA------------LLEEQQKAYKVSGVDE--SEPVDAMK 86
Query: 241 LNSTEEKVNSADNVVEEKHNGKR--KQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPD 298
+E VN D G R K P K+A P L+ NT VYVTGLP
Sbjct: 87 RKRKKEYVNGEDE-------GGRIVKAP------KKAKAP----LPLRANTAVYVTGLPS 129
Query: 299 DVTVEEMVEVFSK-CGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
DVTVEE+ EVFSK CG+I E+ ++ KPRIK+Y D E G KGDAL+ + K PSV +A L
Sbjct: 130 DVTVEEVHEVFSKKCGVIAEEIDSGKPRIKLYTD-EKGEFKGDALIVFFKPPSVQMAIML 188
Query: 358 LDGTPFRPDG-----KIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK------------- 399
LD T FR + +I M V A+ K +Q D++ K+K K
Sbjct: 189 LDDTDFRIESGASKERIKMRVQAAEASYKK----VQQTDAEGKEKEKPKTSMKDKQKIIK 244
Query: 400 ---KVEEKMLGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADENLRSELEADVQEEC 451
K++ ++ W + + L ++ VIL+ MFT E+ D +++ D+++EC
Sbjct: 245 KTQKLDARLADWSDDEPSALVETSSRWDKVVILKHMFTLKELEEDPAAMLDIKEDIRDEC 304
Query: 452 VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
K+G V +V + + GV VRF + + A+ C+ LMNGR F G+++ A DG
Sbjct: 305 GKLGEVTNVVLYDLEEDGVASVRFANAESAKACVRLMNGRKFDGQEVEAYITDG 358
>gi|440907708|gb|ELR57819.1| HIV Tat-specific factor 1 [Bos grunniens mutus]
Length = 766
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 110/331 (33%), Positives = 163/331 (49%), Gaps = 32/331 (9%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND NS
Sbjct: 48 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGAS----NS 85
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
T + + EE + P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 86 TASVQDVSARTAEEPPQRQPPDPSDPKKKGEKRKTESGWFHVEDDRNTNVYVSGLPPDIT 145
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
+E +++ SK GII DP+T + ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 146 ADEFIQLMSKFGIIMRDPQTGEFKVKLYKD-DQGNLKGDGLCCYLKRESVDLALKLLDEG 204
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 205 EIR---GYRLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERQAGSSRMH 261
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+
Sbjct: 262 HERVVIIKNMFHPVDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRS 321
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVN 508
++A CI+ +NGRWFGGRQI A DG +
Sbjct: 322 PEEADYCIQTLNGRWFGGRQITAQAWDGTTD 352
>gi|410915234|ref|XP_003971092.1| PREDICTED: HIV Tat-specific factor 1 homolog [Takifugu rubripes]
Length = 439
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 110/339 (32%), Positives = 170/339 (50%), Gaps = 16/339 (4%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
+TD +DGT Y WD +AW P+ D + + +E N +++
Sbjct: 34 YTDPEDGTVYDWDHDKKAWFPK----ITEDFLAAYQANYGFRQEGGSDSNSAAVSSSEPT 89
Query: 237 GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVT 294
+ EK D + + + +K E K WF + NT+VYV+
Sbjct: 90 VPNSHKPSPEKQKPGDPIRTPNADQPETAAKEVKQKAEKRKAEPGWFSIDEDKNTNVYVS 149
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D++ +E E+ SKCGI+ DP T++ ++K+Y DKE G KGD L YLK+ SV LA
Sbjct: 150 GLPPDISTDEFAELMSKCGIVMRDPMTEEYKVKLYKDKE-GNLKGDGLCCYLKKESVPLA 208
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDA 414
+L+D + R + V A+FE KG+ + A + K K+ K+++++ R +
Sbjct: 209 IRLIDESEVR---GYRLHVEAARFELKGQ-YDASKKKKKNKEYKKRLQQQQKQLDWRPEK 264
Query: 415 K----LTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
K + VI+R MF P++ D + +E D++ EC K G V V + + HP GV
Sbjct: 265 KGENRMRHEKVVIIRNMFHPSDFEEDPLVLNEYREDLRVECEKFGAVKKVILFDRHPDGV 324
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
V FK+ +A CI+ NGRWFGGRQ+ A DG ++
Sbjct: 325 ASVAFKEPDEADACIQSFNGRWFGGRQLSAQFWDGKTDY 363
>gi|86438143|gb|AAI12691.1| HTATSF1 protein [Bos taurus]
Length = 524
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 139/244 (56%), Gaps = 10/244 (4%)
Query: 272 KKEANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIY 329
K E K WF ++ NT+VYV+GLP D+TV+E +++ SK GII DP+T++ ++K+Y
Sbjct: 114 KGEKRKAESGWFHVEEGRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLY 173
Query: 330 VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ 389
D + G KGD L YLK SV LA +LLD R + V AKF+ KGE +K+
Sbjct: 174 KDNQ-GNLKGDGLCCYLKRESVDLALKLLDEDEIR---GYKLHVEVAKFQLKGEYDASKK 229
Query: 390 VDSKKKKKLK-KVEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEA 445
K K K +++K L W A ++ VI++ MF P + D + +E+
Sbjct: 230 KKKCKDYKKKLSMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIRE 289
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
D++ EC K G + + + + HP GV V F+D ++A CI+ +NGRWFGGRQI A DG
Sbjct: 290 DLRVECSKFGQIRKLLLFDRHPDGVASVSFRDPEEADYCIQTLNGRWFGGRQITAQAWDG 349
Query: 506 LVNH 509
++
Sbjct: 350 TTDY 353
>gi|395860822|ref|XP_003802703.1| PREDICTED: HIV Tat-specific factor 1 homolog [Otolemur garnettii]
Length = 767
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 138/244 (56%), Gaps = 10/244 (4%)
Query: 272 KKEANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIY 329
K E K WF+++ NT+VYV+GLP D+TV+E +++ SK GII DP+T++ ++K+Y
Sbjct: 114 KGEKRKAESGWFQVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLY 173
Query: 330 VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ 389
D + G KGD L YLK SV LA +LLD R + V AKF+ KG +K+
Sbjct: 174 KDNQ-GNLKGDGLCCYLKRESVELALKLLDEDEIR---GYKLHVEVAKFQLKGTYDASKK 229
Query: 390 VDSKKKKKLK-KVEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEA 445
K K K +++K L W A ++ VI++ MF P + D + +E+
Sbjct: 230 KKKCKDYKKKLSLQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIRE 289
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
D++ EC K G + + + + HP GV V F+D ++A CI +NGRWFGGRQI A DG
Sbjct: 290 DLRVECSKFGQIRKLLLFDRHPDGVASVSFRDAEEADYCIHTLNGRWFGGRQITAEAWDG 349
Query: 506 LVNH 509
++
Sbjct: 350 TTDY 353
>gi|407926473|gb|EKG19440.1| hypothetical protein MPH_03303 [Macrophomina phaseolina MS6]
Length = 397
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 133/392 (33%), Positives = 196/392 (50%), Gaps = 48/392 (12%)
Query: 166 DGVPAS-PEGEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPT 224
D +PA+ P ++F DD R W R W+ +E T+ + + +++
Sbjct: 3 DSLPANFPTAVEDFETDD--RISWSRADNKWILEE-TNGNEWEFDVALKRWVQ------- 52
Query: 225 VNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFE 284
V DL + ++ +E +++ EE + KRK PD + + K A K E
Sbjct: 53 -TVDKDLLEQQSKAYRIPGVDE----SEDAQEEIKSKKRKHPDSEQDGKPAKKK-----E 102
Query: 285 LKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
L+ NT VYVT LP D T EE+ EVFS+ G+I E +T++ R+++Y DK+ G G+ALV
Sbjct: 103 LQ-NTAVYVTSLPLDATKEEVTEVFSRYGLIAESIDTEEKRVRLYTDKD-GNFTGEALVV 160
Query: 345 YLKEPSVALATQLLDGTPFR----PDGKIPMSVTQAKFEQ-KGERFIAKQVDSK-----K 394
Y + SV LA +LD T F PDG++ + F++ KG+ K+ K K
Sbjct: 161 YFRPESVGLAINMLDDTDFNFRQGPDGRMRVKEADMSFKKNKGDEDGVKKAQHKPSKADK 220
Query: 395 KKKL---KKVEEKMLGWG--GRD----DAKLTIPA-----TVILRFMFTPAEMRADENLR 440
KL KK +K+ WG G D D+ + PA TV+L+ MFT E+ D
Sbjct: 221 DSKLRYEKKQAQKLADWGDEGDDEPAHDSSTSGPAFRWARTVVLKHMFTLKELEEDPEAE 280
Query: 441 SELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
++ D+ EE K G V+ V V P GVV+VRF + AQ+C++ NGR F GR I A
Sbjct: 281 EDIRDDISEEASKFGEVEVVWVFNQEPDGVVVVRFSEPAQAQECVKAFNGRAFDGRIIEA 340
Query: 501 SEDDGLVNHAAIR-DLDAEASRLEQFGAELEA 531
+ + R D D E +RL +F ELEA
Sbjct: 341 TIPKSKERYRRKREDQDKENARLRKFANELEA 372
>gi|157818333|ref|NP_001101729.1| HIV-1 Tat specific factor 1 [Rattus norvegicus]
gi|149015808|gb|EDL75132.1| HIV TAT specific factor 1 (predicted) [Rattus norvegicus]
Length = 776
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/337 (33%), Positives = 170/337 (50%), Gaps = 38/337 (11%)
Query: 182 DGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK 238
D T Y+WD +AW P+ ED +S YG F + T NV D
Sbjct: 46 DDTPYEWDLDKKAWFPKITEDFIASYQANYG-----FSGDGAPSSTGNVQD--------- 91
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGL 296
+ VEE + + ++ E K WF ++ NT+VYV+GL
Sbjct: 92 -----------ANTKTVEEPPQKEIPETTDSKKRGEKRKAESGWFHVEEDRNTNVYVSGL 140
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
P D+TV+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK+ SV LA +
Sbjct: 141 PPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKD-DQGNLKGDGLCCYLKKESVELALK 199
Query: 357 LLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA- 414
LLD R + V A+F+ KGE +K+ K K K +++K L W A
Sbjct: 200 LLDEDEIR---GYKLHVEVAQFQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERRAG 256
Query: 415 --KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVL 472
+L VIL+ MF P + D + +E+ D++ EC K G + + + + HP GV
Sbjct: 257 PNRLRHERVVILKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVAS 316
Query: 473 VRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
V F++ ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 317 VSFREPEEADNCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|444722708|gb|ELW63388.1| HIV Tat-specific factor 1 [Tupaia chinensis]
Length = 449
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 138/244 (56%), Gaps = 14/244 (5%)
Query: 274 EANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVD 331
E K WF+++ NT+VYV+GLP D T EE +++ SK GII +P+T++ ++K+Y D
Sbjct: 116 ENRKAESGWFDVEEDKNTNVYVSGLPPDTTAEEFIQLMSKFGIIMRNPQTEELKVKLYKD 175
Query: 332 KETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVD 391
+ G KGD L YLK SV LA +LLD R + V AKFE KGE +++
Sbjct: 176 NQ-GNLKGDGLCCYLKRESVELALKLLDEDEIR---GYKLRVEVAKFELKGEYDASRKKK 231
Query: 392 SKKKKKLK-KVEEKMLGWGGRDDAKLTIPA-----TVILRFMFTPAEMRADENLRSELEA 445
K K K +++K L W R + + +P VI++ MF P + D L +++
Sbjct: 232 KCKDYKKKLSLQQKQLDW--RPERRAGVPQKRHERVVIIKNMFHPMDFEDDPLLLNDIRK 289
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
D+Q EC K G + + + + HP GV V F+D ++A CI ++GRWFGGRQI A DG
Sbjct: 290 DLQVECSKFGKIKKLLLFDRHPDGVASVSFRDPQEADYCIRTLDGRWFGGRQITAQVWDG 349
Query: 506 LVNH 509
++
Sbjct: 350 TTDY 353
>gi|19113618|ref|NP_596826.1| U2 snRNP-associated protein Uap2 [Schizosaccharomyces pombe 972h-]
gi|26401115|sp|O43120.1|UAP2_SCHPO RecName: Full=Splicing factor U2AF-associated protein 2; AltName:
Full=Cold sensitive U2 snRNA suppressor 2 homolog
gi|2911284|gb|AAC04326.1| putative splicing factor [Schizosaccharomyces pombe]
gi|4490657|emb|CAB38682.1| U2 snRNP-associated protein Uap2 [Schizosaccharomyces pombe]
Length = 367
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 97/258 (37%), Positives = 138/258 (53%), Gaps = 15/258 (5%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
+N VY+ GLP DVTV+E+ EVF KCG+I ++ + PRIKIY E G KGDAL+ +
Sbjct: 108 INKAVYIQGLPLDVTVDEIEEVFKKCGVIAKNIDNGTPRIKIY-RTEDGTPKGDALIVFF 166
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEK-- 404
+ SV LA QL D T FR M V +A + K E+ + K V KKK ++ ++
Sbjct: 167 RSESVELAEQLFDDTEFRYGSGQKMRVQKANIDYKKEKTVNKDVGGALKKKALRLRQQQM 226
Query: 405 --MLGWGGRDDA-----KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPV 457
+ W D+ K V+L+ +FT E+ L +L+ D+ EE K G V
Sbjct: 227 QQISSWDDVDEEVDDKRKKRFNKIVVLKHIFTLEELDKTPELLIDLKDDITEEAEKCGRV 286
Query: 458 DSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHA-----AI 512
+V + + P GVV VRF + ++A+ C+ LM GR+F GR + AS DG V +
Sbjct: 287 TNVVLYDKEPDGVVTVRFSNNEEAEACVRLMQGRYFDGRVVEASIYDGKVRFQKSGKHTL 346
Query: 513 RDLDAEASRLEQFGAELE 530
D + E RLE+F LE
Sbjct: 347 DDEEDEEKRLEKFADWLE 364
>gi|425767555|gb|EKV06124.1| Nuclear mRNA splicing factor-associated protein, putative
[Penicillium digitatum PHI26]
gi|425780358|gb|EKV18366.1| Nuclear mRNA splicing factor-associated protein, putative
[Penicillium digitatum Pd1]
Length = 394
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 113/312 (36%), Positives = 163/312 (52%), Gaps = 45/312 (14%)
Query: 254 VVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCG 313
+V + KRK + + + KP +VNT V+VT +P D + E+ +VFSK G
Sbjct: 68 IVHPRDRKKRKSSHDETAEPKVKKP-------RVNTAVWVTKIPGDAELSEIQDVFSKYG 120
Query: 314 IIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGK 368
I+ E+ +T KPRIK+Y D E G G+AL+ Y + SV LA +LD T FR P G
Sbjct: 121 ILAEELDTGKPRIKMYTD-ENGNFNGEALIVYFRPESVNLAIDVLDETDFRMGSRNPAG- 178
Query: 369 IPMSVTQAKFEQKGERFIAKQVDS--KKKKKLK----KVEEKMLGWGGRDDAKLTIPAT- 421
PM V A F K E+ + +V K+KKKLK ++ +K+ WG D+A+ +
Sbjct: 179 -PMRVQAADFSYKREQDVQPKVTMTMKEKKKLKERAERLNKKLSDWGD-DEAEQALEVMK 236
Query: 422 --------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLV 473
VIL+ MFT E+ D E++ D++ EC KIG V V V + +GVV V
Sbjct: 237 AAEEAKRHVILKHMFTLKELEEDPLASIEIQDDIRSECSKIGEVTKVVVWDGEAEGVVTV 296
Query: 474 RFKDRKDAQKCIELMNGRWFGGRQIHASEDDG---LVNHAAIRD--------LDA---EA 519
RF DA++C+++M+GR+F G + A DG + RD LDA E
Sbjct: 297 RFVASADARRCVQIMSGRFFAGNTVLAYIWDGEEKFNKYHPRRDAGGKISNPLDADDEEN 356
Query: 520 SRLEQFGAELEA 531
RLE++G LE+
Sbjct: 357 ERLERYGDWLES 368
>gi|85111692|ref|XP_964058.1| hypothetical protein NCU02633 [Neurospora crassa OR74A]
gi|28925820|gb|EAA34822.1| hypothetical protein NCU02633 [Neurospora crassa OR74A]
Length = 432
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/390 (30%), Positives = 180/390 (46%), Gaps = 93/390 (23%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDGT Y++D GL+ W+P I+E + + + T DD ++GK +
Sbjct: 41 DDGTEYEFDEGLKRWLPI-----------IDEALIEEHQRAYITSLGDDDTETGQLGKGQ 89
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
+ K + ++ HN +P K ++++A + P K NT VYVTGLP D
Sbjct: 90 --GKKRKNDDREDSASSNHN-MNARPSKNRKREKAPREP------KQNTAVYVTGLPLDA 140
Query: 301 TVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
T +E+ E+FS KCG+I E+ ++ +PRIK+Y D G KGDAL+ + K SV +A LLD
Sbjct: 141 TADEVAELFSRKCGVIAEEIDSGRPRIKMYTDA-NGNFKGDALIVFFKPQSVEMAIMLLD 199
Query: 360 GTPFR----PDGK------IPMSVTQAKFEQ----------------------------- 380
T FR PDG MS + K++
Sbjct: 200 DTDFRFEPAPDGSRMRVQAADMSYKKTKYDAAAGAENGGGDKAGNGGAEGEGSSSSTPYN 259
Query: 381 --------------KGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDD-----AKLTIPA- 420
G ++Q +K KK +K+ K+ W DD A+L PA
Sbjct: 260 NITNPNPVSSDGGGNGHNQRSQQDKAKIIKKTQKLSAKLADWD--DDEPSPSARLLGPAI 317
Query: 421 ----------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
VILR MFT E+ D E++ D++EEC K+GPV +V + + +G+
Sbjct: 318 KEAKGGKWDKVVILRHMFTLEELNEDPAALLEIKDDIREECAKLGPVTNVVLFDEEEEGI 377
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V V+F + A+ C+ LM+GR F GR + A
Sbjct: 378 VSVKFATVEAAEACVHLMHGRAFDGRIVEA 407
>gi|320582494|gb|EFW96711.1| nuclear mRNA splicing factor-associated protein, putative [Ogataea
parapolymorpha DL-1]
Length = 357
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 89/257 (34%), Positives = 143/257 (55%), Gaps = 9/257 (3%)
Query: 283 FELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDAL 342
F NT +YV+ +P D+T EE+ VF K G++ +D T +IK+Y D E K GD L
Sbjct: 96 FRRNRNTCIYVSQIPQDITYEELEAVFGKYGVLAQDLRTGSSKIKMYKDDEDHFK-GDCL 154
Query: 343 VTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKK----KKKL 398
+ YLKE S LA +LLD T RP + P+ V++A+F K E+ ++ +K+ KKK+
Sbjct: 155 IEYLKEESCDLAIELLDETKLRPTDESPIRVSKAEFNNKAEQKPRVKMKTKERLQLKKKI 214
Query: 399 KKVEEKMLGWGGRDD----AKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
+++ +K+ W ++ +L T+I + FT E+ D +++ D++E C +I
Sbjct: 215 ERIHKKVNDWSDTEEDDETRRLREEKTLIFKHCFTLKELEDDPGAILDIKEDIREGCEEI 274
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRD 514
G V +V + + +G+V VRFK A CI MNGR+FGGR++ + DG+ +
Sbjct: 275 GEVTNVVLYDLEEEGIVSVRFKSAVAANNCISKMNGRYFGGRRLQVEKYDGITQYKKSSG 334
Query: 515 LDAEASRLEQFGAELEA 531
AE RL++F +E
Sbjct: 335 DAAEGDRLDRFAQWVEG 351
>gi|270016285|gb|EFA12731.1| hypothetical protein TcasGA2_TC002366 [Tribolium castaneum]
Length = 451
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 184/363 (50%), Gaps = 42/363 (11%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+TD DG ++ WD+ +AW P+ D ++ I ++ +
Sbjct: 107 YTDKDGVKFFWDKEKKAWFPKVD----DEFMAIYQLNY---------------------- 140
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVTG 295
+ + + V EE+ + QP + EK++A++P +WFE+ NT VYV+
Sbjct: 141 --GFSENNSENSEEKKVDEEEPKVDKSQPKPKGEKRKASEP--TWFEVDETENTKVYVSN 196
Query: 296 LPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALAT 355
LP D+ +E V++ KCG++ +DP ++K+Y + TG KGDAL TY+K SV LA
Sbjct: 197 LPTDIEEQEFVDLMQKCGLVMKDPTNGSFKVKLYKEPGTGYLKGDALCTYIKIESVELAL 256
Query: 356 QLLDGTPFRPDGKIPMSVTQAKFEQKGE---RFIAKQVDSKKKKKLKKVEEKMLGWGGRD 412
LLDG ++ + V +AKF+ KG + K K+K KLKK++EK+ W
Sbjct: 257 SLLDGYDYKGHK---LKVERAKFQMKGAYDPKLKPKMKKRKEKLKLKKMQEKLFDWRPEK 313
Query: 413 --DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
D + VI++ +F P+ D L E + D++EE KIG V V + + HP+GV
Sbjct: 314 PIDERAKHEKVVIVKNLFDPSIFDTDVGLILEFQEDLREEVSKIGEVRKVMIYDRHPEGV 373
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEAS-RLEQFGAEL 529
+ ++A + ++L+NGRWF RQ+ A DG I + DA+ + RLE + L
Sbjct: 374 AQINMSSPEEADEVVKLLNGRWFMKRQLTAEIWDGKTKF-KIAETDAQINERLENWDKFL 432
Query: 530 EAD 532
E D
Sbjct: 433 EED 435
>gi|340924261|gb|EGS19164.1| hypothetical protein CTHT_0057890 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 409
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 125/355 (35%), Positives = 176/355 (49%), Gaps = 45/355 (12%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDGT Y++D + WVP + + GI E P T +D +E
Sbjct: 36 DDGTEYEFDEAQKRWVPIDYADDDLNEGGIPN---FDSEAQQPAPQGTKRKLDDWDDRED 92
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
NS N+ N +N R P K ++ PP + K NT VYVTGLP D
Sbjct: 93 NNSKNSNSNNGYNNNTTNNNKGR--PSKNPKRTNQRGPP----QPKQNTAVYVTGLPLDA 146
Query: 301 TVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
TV+E+ E+FS KCG+I E+ ++ +PRIK+Y D E G KGDALV + K SV +A LLD
Sbjct: 147 TVDEVAELFSRKCGVIAEEIDSGRPRIKMYTDSE-GKFKGDALVVFFKPESVHMAIMLLD 205
Query: 360 GTPFRPDG--KIP-MSVTQAKFEQKGERF------------------------IAKQVDS 392
T FR G P M V +A K ++ ++Q +
Sbjct: 206 DTEFRLTGDPNAPRMRVMEADSSYKKVKYDTAGDGEGADGKPASQAQAQAPPRRSEQEKA 265
Query: 393 KKKKKLKKVEEKMLGWGGRDD--AKLTIPA-----TVILRFMFTPAEMRADENLRSELEA 445
K +K +K+ K+ W + AK P VILR MFT E+R D +++
Sbjct: 266 KIIRKTQKLASKLADWSDDESHAAKTLQPPKKWERVVILRNMFTLEELRDDPTAILDIKE 325
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
DV+EEC K+GPV +V + + +G+V V+FK R+ A++C+ LM+GR FGGR I A
Sbjct: 326 DVREECEKLGPVTNVVLYDLEEEGIVTVKFKTREAAEECLRLMHGRAFGGRIIEA 380
>gi|189242416|ref|XP_001811064.1| PREDICTED: similar to DEAD box ATP-dependent RNA helicase
[Tribolium castaneum]
Length = 1142
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 187/363 (51%), Gaps = 42/363 (11%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+TD DG ++ WD+ +AW P+ D E M + F + ++
Sbjct: 107 YTDKDGVKFFWDKEKKAWFPKVDD---------EFMAIYQLNYGF------SENNSENSE 151
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVTG 295
++K++ E KV+ + QP + EK++A++P +WFE+ NT VYV+
Sbjct: 152 EKKVDEEEPKVD-------------KSQPKPKGEKRKASEP--TWFEVDETENTKVYVSN 196
Query: 296 LPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALAT 355
LP D+ +E V++ KCG++ +DP ++K+Y + TG KGDAL TY+K SV LA
Sbjct: 197 LPTDIEEQEFVDLMQKCGLVMKDPTNGSFKVKLYKEPGTGYLKGDALCTYIKIESVELAL 256
Query: 356 QLLDGTPFRPDGKIPMSVTQAKFEQKGE---RFIAKQVDSKKKKKLKKVEEKMLGWGGRD 412
LLDG ++ + V +AKF+ KG + K K+K KLKK++EK+ W
Sbjct: 257 SLLDGYDYKGHK---LKVERAKFQMKGAYDPKLKPKMKKRKEKLKLKKMQEKLFDWRPEK 313
Query: 413 --DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
D + VI++ +F P+ D L E + D++EE KIG V V + + HP+GV
Sbjct: 314 PIDERAKHEKVVIVKNLFDPSIFDTDVGLILEFQEDLREEVSKIGEVRKVMIYDRHPEGV 373
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEAS-RLEQFGAEL 529
+ ++A + ++L+NGRWF RQ+ A DG I + DA+ + RLE + L
Sbjct: 374 AQINMSSPEEADEVVKLLNGRWFMKRQLTAEIWDGKTKF-KIAETDAQINERLENWDKFL 432
Query: 530 EAD 532
E D
Sbjct: 433 EED 435
>gi|440801311|gb|ELR22331.1| RNA recognition motif domain containing protein, partial
[Acanthamoeba castellanii str. Neff]
Length = 375
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 143/249 (57%), Gaps = 14/249 (5%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
T VYV+GLP D+T EE++ +KCGI+++DP++ + ++K+Y D E G KGDALV Y K
Sbjct: 122 TTTVYVSGLPLDMTKEELLPFMAKCGIVRKDPDSGEYKVKLYED-EQGRLKGDALVVYFK 180
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKK------LKKV 401
SV LA QLLD + RP + VT+AKFE K + AK K K ++
Sbjct: 181 RESVDLALQLLDESEIRPGCVV--RVTEAKFENKPDSEQAKPKAKKPVGKKKQKKKVRIN 238
Query: 402 EEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
+E+ L W + VI++ MF+P E D+ +L+ DV+EE K G V+ +
Sbjct: 239 QEEELSW-----VEWETRRHVIIKNMFSPEEAIGDDEFYPDLKNDVREEVEKFGEVEVLT 293
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASR 521
V E +P+GVV ++F + + A KC+E+MNGR+F RQ+ A DG+ N+ + R
Sbjct: 294 VFERNPEGVVAIKFVEPEAAVKCLEVMNGRFFAKRQLVAEWYDGVTNYKVKETEADQKHR 353
Query: 522 LEQFGAELE 530
L+ FG LE
Sbjct: 354 LDAFGDWLE 362
>gi|440637909|gb|ELR07828.1| hypothetical protein GMDG_00449 [Geomyces destructans 20631-21]
Length = 385
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 115/357 (32%), Positives = 168/357 (47%), Gaps = 64/357 (17%)
Query: 181 DDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
DDG+ Y++D L W+P D + Q Y ++ + EEE +
Sbjct: 41 DDGSEYEFDDALMRWIPVLDDELAAQQQAAYKVQG---VDEEETVEDM------------ 85
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
+ +E VN D+ NG R + K P NT VYVTGLP
Sbjct: 86 --RRKRKKEYVNGEDD------NGGRAVKAPKKAKAPLPPRP--------NTAVYVTGLP 129
Query: 298 DDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
DVT++E+ EVFS KCG+I E+ ++ +PRIK+Y D E+G KGDAL+ + K PSV +A
Sbjct: 130 RDVTIDEVAEVFSRKCGVIAEEIDSGRPRIKLYTD-ESGAFKGDALIVFFKAPSVNMAIT 188
Query: 357 LLDGTPFR-----PDGKIPM-----------------SVTQAKFEQKGERFIAKQVDSKK 394
LLD T FR GK+ + S + A E ER D K
Sbjct: 189 LLDDTDFRYSDLGGSGKMRVMEADSSYKKTKAEEGSGSASTADKEAARERAKTSARDKAK 248
Query: 395 K-KKLKKVEEKMLGWGGRDDAKLTIPA-----TVILRFMFTPAEMRADENLRSELEADVQ 448
KK +K++ ++ W D L V+L+ MFT E+ D +++ D++
Sbjct: 249 VIKKTQKLDARLADWDDDDPQALYEAGPKPQKLVVLKHMFTLEELEEDPAAMLDIKEDIR 308
Query: 449 EECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
EEC K+G V +V + + GV VRF+D A +C+ +MNGR F G+ + A +G
Sbjct: 309 EECAKLGEVTNVVLFDLEEAGVASVRFRDEAAAAECVRVMNGRSFAGQTVEAYLSEG 365
>gi|260836212|ref|XP_002613100.1| hypothetical protein BRAFLDRAFT_125707 [Branchiostoma floridae]
gi|229298484|gb|EEN69109.1| hypothetical protein BRAFLDRAFT_125707 [Branchiostoma floridae]
Length = 400
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 116/342 (33%), Positives = 172/342 (50%), Gaps = 44/342 (12%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDL----- 231
+TD DDGT Y+WD+ AW P+ +TDD
Sbjct: 30 YTDPDDGTVYEWDQDKLAWFPK----------------------------ITDDFIAAYQ 61
Query: 232 ANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNT 289
AN V + T +++ K K+ K EK++AN P WF++ NT
Sbjct: 62 ANYGVAESTQQQTASTTQQQQTQQQQQQQKKAKKDLKPGEKRKANDP--EWFDVDEVKNT 119
Query: 290 HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
+VYV+GLP D+T +E ++ KCGII ED ET +P+ K+Y D + G KGD YLK
Sbjct: 120 NVYVSGLPTDITDDEFKDLMQKCGIIMEDMETGQPKFKLYRD-QNGQLKGDGRCCYLKRE 178
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
SV LA Q+LD FR + V AKF+ KG A + KK+KK K ++E++L W
Sbjct: 179 SVDLALQILDNYEFRSH---TIHVEPAKFQLKGAYNPALKKKKKKQKKQKNMQERLLDWR 235
Query: 410 G--RDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
++ + +I+R MF P + D +E+ D++ EC K G V V + + HP
Sbjct: 236 PERKEPLRPKHERVMIIRNMFHPKDFEEDPLELNEISDDLRSECEKFGIVKKVLIFDRHP 295
Query: 468 QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
GV V +K+ + A + +NGRWFGGRQ++ + DG+ ++
Sbjct: 296 DGVASVAYKEAEMADAAVAALNGRWFGGRQLNVALWDGVTDY 337
>gi|255937471|ref|XP_002559762.1| Pc13g13480 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584382|emb|CAP92417.1| Pc13g13480 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 405
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 112/311 (36%), Positives = 162/311 (52%), Gaps = 44/311 (14%)
Query: 254 VVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCG 313
VV + KRK + ++ + KP +VNT V+VT +P D + E+ ++FSK G
Sbjct: 60 VVHPRDRKKRKNLHDETDEPKVKKP-------RVNTAVWVTNIPRDAELSEIRDLFSKYG 112
Query: 314 IIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGK 368
I+ E+ +T KPRIK+Y D + G GD LV Y K SV LA + LD T FR P+G
Sbjct: 113 ILAEEVDTGKPRIKMYTD-DDGNFNGDVLVVYFKPESVDLAIKWLDETDFRLGPRDPNG- 170
Query: 369 IPMSVTQAKFEQKGE-----RFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT-- 421
PM V A F K E + + Q K K++ +++ +K+ WG D+A+ + A
Sbjct: 171 -PMRVQVADFSYKREQDSEPKVMTAQEKKKFKERAERLNKKLSDWGD-DEAENVMAAMKA 228
Query: 422 -------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
V+L+ MFT AE+ D E++ D++ EC KIG V V + + GVV VR
Sbjct: 229 EEEARRHVVLKNMFTLAELEEDPLASIEIQEDIERECSKIGDVAKVVIWDGEADGVVTVR 288
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHASEDDG---LVNHAAIRD--------LDA---EAS 520
F DA++C++ M+GR F G +HA DG + RD LDA E +
Sbjct: 289 FVASADARQCVKEMSGRQFSGNPVHAYIWDGEEKFNKYHPRRDAEGKKVNPLDADEDEEA 348
Query: 521 RLEQFGAELEA 531
RLE+FG LE+
Sbjct: 349 RLERFGDWLES 359
>gi|46108256|ref|XP_381186.1| hypothetical protein FG01010.1 [Gibberella zeae PH-1]
Length = 376
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 116/352 (32%), Positives = 172/352 (48%), Gaps = 61/352 (17%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVF--PTVNVTDDLANDEVGK 238
DDG Y++D + W+ L +EE P +V DD A +
Sbjct: 32 DDGNEYEFDADSKRWI-------------------LADEEPLEPPVTDVYDDFAEFASNE 72
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPD 298
+ + ++ N + E K +P+K+ PP + K NT VYVTGLP
Sbjct: 73 AAEDGSRKRKNGSGQDSETPEPSKPSRPNKK-----QKAPP----QPKQNTAVYVTGLPP 123
Query: 299 DVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D TVEE+ ++FS K G+I E+ ++ PRIK+Y D + G KGDAL+ + K SV +A L
Sbjct: 124 DTTVEEVHDLFSRKGGVIAEEIDSGAPRIKLYNDAD-GNFKGDALIVFFKPQSVEMAIML 182
Query: 358 LDGTPFR------PDGKIPM-----SVTQAKFEQK-----------GERFIAKQVDSKKK 395
LD T FR +G+I + S + K++Q+ ER D +K
Sbjct: 183 LDDTDFRITASGTHEGRIKVQAADSSYKKVKYDQEPGAGGEKSNGAAERKPQNNRDRQKI 242
Query: 396 -KKLKKVEEKMLGWG------GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQ 448
KK +K++ K+ W G+ +A VILR MFT E+ D E++ D++
Sbjct: 243 IKKTQKLDAKLADWDDDLPYPGQPEAATKWDKLVILRHMFTLEELEEDPAALLEIKEDIR 302
Query: 449 EECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
EEC K+G V +V + + P+G+V V+FKD A CI LM+GR F GR + A
Sbjct: 303 EECAKLGTVTNVVLFDQEPEGIVSVKFKDTASAHACINLMHGRRFDGRTVEA 354
>gi|225710668|gb|ACO11180.1| HIV Tat-specific factor 1 homolog [Caligus rogercresseyi]
Length = 388
Score = 148 bits (373), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 187/374 (50%), Gaps = 40/374 (10%)
Query: 171 SPEGEDEFTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMT--FLKEEEVFPTVNV 227
S G +TD DGT Y+WD +AW P+ DT D + ++ F + + P+V
Sbjct: 30 SESGTQTYTDPSDGTLYEWDASKKAWFPKIDT----DFMAVYQLNYGFTSDGKAQPSVPT 85
Query: 228 TDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKV 287
+N+ E N+ +++ N + K+P+ WF+++
Sbjct: 86 PSPALT------VVNNPEPGANNKTKAKQQQANSQSKKPE--------------WFQVEA 125
Query: 288 --NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETK-KPRIKIYVDKETGMKKGDALVT 344
NT VYV+GLP+ ++ E+ + + S+ G+I+ D + KP+IK+Y D G+ KGDAL +
Sbjct: 126 AKNTKVYVSGLPESISEEKFISMMSRFGVIETDVRNENKPKIKLYRD-SNGIPKGDALCS 184
Query: 345 YLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA----KQVDSKKKKKLKK 400
++ SV LA Q+LD + + DG+ +SV +A+F+ KGE + K++ K+ KL+K
Sbjct: 185 FVMVESVDLAIQILDDSLYE-DGRSRISVERARFQMKGEAYKPELKPKKLRKKELDKLRK 243
Query: 401 VEEKMLGWGGRD----DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGP 456
++K L W +++ +++ +FTP + + E++ V+ C K G
Sbjct: 244 TKDKKLAWDFDVPRGLESRPKCHKVLVISNLFTPLDFNEEPEKIFEVKGKVKSACEKFGS 303
Query: 457 VDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLD 516
+ V++ + HP+GV LV F+D ++ I+++NGR GR I DG + +
Sbjct: 304 IRKVEIFDQHPEGVGLVFFRDFEETDVAIDMLNGRLLNGRAIKTVHWDGKTKYENKETQE 363
Query: 517 AEASRLEQFGAELE 530
E RL + LE
Sbjct: 364 EEMKRLHVWNKFLE 377
>gi|388581353|gb|EIM21662.1| hypothetical protein WALSEDRAFT_38275 [Wallemia sebi CBS 633.66]
Length = 368
Score = 148 bits (373), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 162/308 (52%), Gaps = 38/308 (12%)
Query: 253 NVVEEKHNGKRK---QPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVF 309
+V+ + KRK + E K+ PP T V+VTGLP DV+++E+ EVF
Sbjct: 70 DVIARRQEKKRKDYTSNNTNTENKKQKGPP-------PKTAVFVTGLPPDVSIDEIAEVF 122
Query: 310 SKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKI 369
KCG++ + E P++K+Y D + G KG+ALV Y KE SV+LA QLLD T R
Sbjct: 123 GKCGVLLPNDEGG-PKVKLYRD-DQGNFKGEALVVYYKEASVSLAIQLLDDTELRYGDGS 180
Query: 370 PMSVTQAKFE---------QKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLT--I 418
+ V+ A F+ Q + + + KK+ K +K++EK+ W D++ L +
Sbjct: 181 SIRVSVADFKADTANEQTKQSKHKPLTEDEKRKKQSKFRKLDEKLNDWDSEDESGLADKL 240
Query: 419 PA-----TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLV 473
P+ V+L+ MF+ E++ D L +L+ DV+E+ +IG V +V + + P+GV+ V
Sbjct: 241 PSKVTSRVVVLKHMFSLDELKEDPTLLLDLKEDVREDAEQIGQVTNVVLYDAEPEGVMTV 300
Query: 474 RFKDRKDAQKCIELMNGRWFGGRQIHASEDDG---------LVNHAAIRDLDAEASRLEQ 524
+F D AQ C+ GR+FGGR++ + DG +A D D E RL +
Sbjct: 301 KFSDHIAAQACVLKYEGRFFGGRRVVSYLYDGKESFKSSQTANKYAETTDQD-EEQRLNE 359
Query: 525 FGAELEAD 532
FG LE D
Sbjct: 360 FGKWLEKD 367
>gi|154318273|ref|XP_001558455.1| hypothetical protein BC1G_03304 [Botryotinia fuckeliana B05.10]
gi|347837569|emb|CCD52141.1| similar to nuclear mRNA splicing factor-associated protein
[Botryotinia fuckeliana]
Length = 380
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 95/243 (39%), Positives = 137/243 (56%), Gaps = 28/243 (11%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VYVTGLP DVTVEE+ EVFS KCG+I E+ ++ KPRIK+Y D E G KGDAL+ +
Sbjct: 119 NTAVYVTGLPSDVTVEEVHEVFSRKCGVIAEEIDSGKPRIKLYTD-EKGEFKGDALIVFF 177
Query: 347 KEPSVALATQLLDGTPFRPDG-----KIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-- 399
K PSV +A LLD T FR DG +I M V A+ K + D +K+K K
Sbjct: 178 KAPSVQMAIMLLDDTDFRIDGGTSNERIKMRVQAAEASYK--KVQQTDADGNEKEKPKTS 235
Query: 400 ------------KVEEKMLGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADENLRSE 442
K++ ++ W + + L ++ VIL+ MFT E+ D +
Sbjct: 236 MKDKQKIIKKTQKLDARLADWSDDEPSALVESSSKWDKVVILKHMFTLKELEEDPAAMLD 295
Query: 443 LEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASE 502
++ D+++EC K+G V +V + + GV VRF + + A+ C++LMNGR F G+++ A
Sbjct: 296 IKEDIRDECGKLGEVTNVVLYDLEKDGVASVRFANAESAKACVKLMNGRKFDGQEVEAYI 355
Query: 503 DDG 505
DG
Sbjct: 356 TDG 358
>gi|417404468|gb|JAA48986.1| Putative transcription elongation factor tat-sf1 [Desmodus
rotundus]
Length = 766
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 109/334 (32%), Positives = 169/334 (50%), Gaps = 32/334 (9%)
Query: 182 DGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKL 241
D T Y+WD +AW P+ +T E+ T +ND
Sbjct: 46 DDTPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGAS---- 83
Query: 242 NSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDD 299
+ST + + + EE +P ++ E K WF ++ NT+VYV+GLP D
Sbjct: 84 SSTANVQDVSGSPAEEPPQRNAPEPTDPKKRGEKRKAESGWFHIEEDRNTNVYVSGLPPD 143
Query: 300 VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
+TV+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 144 ITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVDLALKLLD 202
Query: 360 GTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLG-WGG-RDDA--K 415
R + V AKF+ KGE +K+ K K K +++ W R DA +
Sbjct: 203 EDEIR---GYKLHVELAKFQLKGEYDASKKKKKCKDYKKKLSQQQKQLDWRPERRDAPSR 259
Query: 416 LTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRF 475
+ VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F
Sbjct: 260 MRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIKKLLLFDRHPDGVASVSF 319
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++ ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 320 RNPEEADHCIQTLDGRWFGGRQITAQAWDGTTDY 353
>gi|148710217|gb|EDL42163.1| HIV TAT specific factor 1, isoform CRA_c [Mus musculus]
Length = 464
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 137/234 (58%), Gaps = 10/234 (4%)
Query: 282 WFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
WF ++ NT+VYV+GLP D+TV+E +++ SK GII DP+T++ ++K+Y D + G KG
Sbjct: 28 WFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKD-DQGNLKG 86
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK 399
D L YLK+ SV LA +LLD R + V AKF+ KGE +K+ K K K
Sbjct: 87 DGLCCYLKKESVELALKLLDEDEIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKK 143
Query: 400 -KVEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
+++K L W A +L VIL+ MF P + D + +E+ D++ EC K G
Sbjct: 144 LSLQQKQLDWRPERRAGPNRLRHERVVILKNMFHPMDFEDDPLVLNEIREDLRVECSKFG 203
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+ + + + HP GV V F++ ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 204 QIRKLLLFDRHPDGVASVSFREPEEADHCIQTLDGRWFGGRQITAQAWDGTTDY 257
>gi|145508836|ref|XP_001440362.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407579|emb|CAK72965.1| unnamed protein product [Paramecium tetraurelia]
Length = 403
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/263 (39%), Positives = 149/263 (56%), Gaps = 23/263 (8%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
+ W+ K+NT+VYV GLP D+T+EEM FSK GII+ +PET +P IKIY D + G KG
Sbjct: 145 NQWYTPKINTNVYVEGLPQDITMEEMKVFFSKAGIIRINPETLQPTIKIYRD-QNGNCKG 203
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK 399
D L++Y SV A ++LDG RPD + + VT+A FEQKG+ ++ ++KK KL+
Sbjct: 204 DGLISYKMVESVQTAREMLDGLHIRPD--VIVKVTEAVFEQKGQ---YRKRENKKVDKLQ 258
Query: 400 KVEEK------MLGWGGRDDAK-LTIPATVILRFMFTPAEMRADE---NLRSELEADVQE 449
K + M G DD K L I +I + +++P + + E L EL +
Sbjct: 259 KALARQKEMTQMAEEGQEDDGKGLKI---LIFKNLYSPTQAQNPEFMNQLYGELLLKI-- 313
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
E ++I V ++ ++HPQGV VRF DA+ C+ ++G F GR+IH DG N
Sbjct: 314 ESLQIF-VQKLEFFKDHPQGVCKVRFHSSYDAEICLTSLSGIEFNGRKIHIQYWDGKENF 372
Query: 510 AA-IRDLDAEASRLEQFGAELEA 531
+ I + E RLE+FG LE
Sbjct: 373 KSNIESKEVEEQRLEEFGQWLEG 395
>gi|10798632|emb|CAC12816.1| hypothetical protein [Nicotiana tabacum]
Length = 86
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 66/85 (77%), Positives = 73/85 (85%)
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLV 507
+EEC K GPVD VKVCENHPQGVVLV+FKDR+DA +C E MNGRWF GRQIHASEDDG V
Sbjct: 1 KEECSKFGPVDLVKVCENHPQGVVLVKFKDRRDAHRCTEAMNGRWFAGRQIHASEDDGSV 60
Query: 508 NHAAIRDLDAEASRLEQFGAELEAD 532
NHA +RD+D E RLE+FGAELEAD
Sbjct: 61 NHALVRDIDEETDRLEKFGAELEAD 85
>gi|83773998|dbj|BAE64123.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 446
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 105/298 (35%), Positives = 152/298 (51%), Gaps = 55/298 (18%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+VNT VYVT +P D EE+ VFSKCG+I E+ ++ +PRIK+Y+D + G KG+ALV +
Sbjct: 143 RVNTAVYVTSIPLDADFEEIRYVFSKCGVIAEEIDSGRPRIKMYMDDD-GKFKGEALVVF 201
Query: 346 LKEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKK--- 397
+ SV LA Q+LD + FR P G PM V A F K ++ + +K K+K
Sbjct: 202 FRPESVNLAIQMLDDSDFRLGVTGPQG--PMRVQPADFSYKSQQEAPTKTSAKDKRKIIQ 259
Query: 398 ----LKKVEEKMLGWG-GR---------------DDAKLTIPAT-------VILRFMFTP 430
L K E W GR DD +P T VIL+ MFT
Sbjct: 260 RTQRLNKYETYKYEWSFGRRDLLMWWNSKLADWDDDEPSALPETNSKFEKVVILKHMFTL 319
Query: 431 AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG 490
E+ D +++ D+++EC K+G V +V + + GVV V+F+D + A+ C++LM+G
Sbjct: 320 KELDDDPAAILDIKEDIRDECSKLGEVTNVVLYDKETDGVVSVKFQDPEAARNCVKLMDG 379
Query: 491 RWFGGRQIHASEDDG------------LVNHAAIRDLDA-----EASRLEQFGAELEA 531
R+F G ++ A DG + A R LDA E RL +FG LE+
Sbjct: 380 RYFAGTRVEAYISDGSERFKKSNEKRAALEDLAERGLDADEDEEEKQRLVEFGTWLES 437
>gi|336371953|gb|EGO00293.1| hypothetical protein SERLA73DRAFT_180800 [Serpula lacrymans var.
lacrymans S7.3]
gi|336384704|gb|EGO25852.1| hypothetical protein SERLADRAFT_466558 [Serpula lacrymans var.
lacrymans S7.9]
Length = 400
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 115/348 (33%), Positives = 168/348 (48%), Gaps = 51/348 (14%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAND 234
+ DDDG+ ++D WVP D Q Y + + +E P V LA +
Sbjct: 39 YEDDDGSEMEYDAVKGTWVPLVDEDLLKKQQAAYSVAGV-----DEATPAAPV---LARE 90
Query: 235 EVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVT 294
+K E+ ++ GK + DK V+ K NT VYVT
Sbjct: 91 ---NKKRKEPEDYTSATTTAGPSSKRGKNDKKDKPVKSK--------------NTAVYVT 133
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D +E++E FSKCG+I+ED + P++K+Y +E G G+ALV Y KE SV LA
Sbjct: 134 GLPLDAEADEIIERFSKCGVIEED-DDGDPKVKLYA-REDGSFSGEALVVYFKEDSVILA 191
Query: 355 TQLLDGTPFR-PDGKIPMSVTQAKFEQKGERFIA-------KQVDSKK-KKKLKKVEEKM 405
LLD R D M V +A+F K +A K VD KK +++ K+++K+
Sbjct: 192 VNLLDDAELRFGDSSSVMKVQKAEFGHKNTSGVASGESQPRKTVDKKKASRRIGKMQKKL 251
Query: 406 LGWGGRD------------DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVK 453
L W D V+L+ MFT E+ D +L +L+ DV+EEC
Sbjct: 252 LEWDDEDGFGPSKMEEENAQVVNRNSRVVVLKHMFTLKELEKDASLLLDLKEDVREECST 311
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+G V +V + + GV+ V+F+D AQ CI M+GR+F GR++ AS
Sbjct: 312 LGEVTNVVLYDKEADGVMTVKFRDPLSAQACILKMSGRFFDGRRVEAS 359
>gi|241707741|ref|XP_002403221.1| conserved hypothetical protein [Ixodes scapularis]
gi|215505032|gb|EEC14526.1| conserved hypothetical protein [Ixodes scapularis]
Length = 401
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 191/378 (50%), Gaps = 46/378 (12%)
Query: 174 GEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNV 227
G D +T +DGT Y+WD AW P+ ED ++ + YG E+ + T++
Sbjct: 26 GSDPYTYVDPNDGTVYEWDVQKNAWFPKLTEDFLAAYHANYGSGEVQNAEGTSQVATISQ 85
Query: 228 TDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL-- 285
+ GK++ + K + N +++ + A WFE+
Sbjct: 86 NQQSNAQQSGKKQ--DPKAKPSDPGNALDDSVH--------------ACVSCAGWFEIDD 129
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
NT VYV+ LP D+ EE VE+ +KCG++ +D E +IK+Y E G KGDAL Y
Sbjct: 130 AHNTRVYVSNLPSDMDEEEFVEIMAKCGLVMKD-EKGDYKIKLYRTPE-GNFKGDALCCY 187
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVE--- 402
+K SV LA ++LDG +R G+ M V +A+F+ KG K+ KK K+ +K++
Sbjct: 188 IKVESVDLALRILDG--YRLRGQ-EMRVERARFQLKGAYDPTKKPKKKKGKEKEKLKKKI 244
Query: 403 --------EKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
EK+ G G++++ TV+L+ MF P E D L E + D++EEC K
Sbjct: 245 EKLFDWRPEKLRGMRGKNES------TVVLKNMFEPKEFEGDPTLILEYQKDLREECAKF 298
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRD 514
G V V V + +P+GV V FK+ ++A CIELMNGRWF RQ+ A DG + I
Sbjct: 299 GEVKKVVVYDRNPEGVATVTFKEPEEADACIELMNGRWFAQRQLSAETWDGKTRYKIIET 358
Query: 515 LDAEASRLEQFGAELEAD 532
+ RL+++ L+ +
Sbjct: 359 EEELEERLKKWDEYLDGE 376
>gi|380481347|emb|CCF41897.1| splicing factor u2af-associated protein 2 [Colletotrichum
higginsianum]
Length = 415
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/300 (37%), Positives = 152/300 (50%), Gaps = 52/300 (17%)
Query: 248 VNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVE 307
+N+A NGK + P K+ K A PP + + NT VYVTGLP D TV+E+ E
Sbjct: 110 INAAQGEAGGNGNGKVR-PAKKA--KPARAPP----QPRQNTAVYVTGLPADATVDEVHE 162
Query: 308 VFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPD 366
VFS K G+I E+ ++ +PRIK+Y D E G KGDALV + K SV +A LLD T FR +
Sbjct: 163 VFSRKAGVIAEEIDSGRPRIKMYTD-EMGNFKGDALVVFFKPQSVEMAIMLLDDTEFRFE 221
Query: 367 ---GKIP-MSVTQAKFEQKGERF---------------------IAKQVDSKKK------ 395
G+ P M V A K ++ + Q ++
Sbjct: 222 PGTGEAPKMRVQAADSSYKKVKYDENGAEGAGGTGGKGGDNGEGTSAQPKRERNDRDRQK 281
Query: 396 --KKLKKVEEKMLGWGGRDDAKLTIPA--------TVILRFMFTPAEMRADENLRSELEA 445
+K +K+ K+ W DD IP TVIL+ MFT AE+ D E++
Sbjct: 282 IIRKTQKMAAKLADWS--DDDTAAIPTETNSKWDKTVILKHMFTLAELEEDPAALLEIKE 339
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
D++EEC K+G V +V + + P GVV V+F + AQ CI+LMNGR F GR + AS G
Sbjct: 340 DIREECAKLGNVTNVVLYDEEPDGVVSVKFSQPQSAQACIQLMNGRSFDGRVVEASTATG 399
>gi|302497914|ref|XP_003010956.1| hypothetical protein ARB_02854 [Arthroderma benhamiae CBS 112371]
gi|291174502|gb|EFE30316.1| hypothetical protein ARB_02854 [Arthroderma benhamiae CBS 112371]
Length = 421
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 128/410 (31%), Positives = 185/410 (45%), Gaps = 97/410 (23%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
D+G Y +D L+ W+P D S +M + EEE AN
Sbjct: 37 DNGEEYTYDDALKRWIPSLDESLMEQQRQAYKMQGVDEEEP----------AN------- 79
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
L + +EK KRK D+ ++ KP +VNT VYVT +P D
Sbjct: 80 LKTLQEK------------KKKRKHNDESTTAQKPKKP-------RVNTAVYVTAIPLDA 120
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
TV E+ +FSKCG+I E+ ++ KPRIK+Y D E G KGDALV Y + SV LA Q+LD
Sbjct: 121 TVSEISSLFSKCGVIAEEIDSGKPRIKMYTD-EQGAFKGDALVVYFRPESVNLAIQMLDD 179
Query: 361 TPFRPDGKIP---MSVTQAKFEQKGERFIAKQVDSKKKKKLKK------------VEEKM 405
T FR K M V A F K + + + + K K+ + +E +
Sbjct: 180 TDFRFGEKGTEGNMRVQPADFSFKAVQEAPAKANMRDKMKIIRKTQKLNKYAAVQLERLI 239
Query: 406 LGWGGRDDAKLT--------------IPATVILRFMFTPAEMRA-------------DEN 438
+ W +KLT V+L+ MFT E+ D
Sbjct: 240 VVWLTWAYSKLTDWDDDDVGPRHSGKAGKVVVLKHMFTLQELECINMFLTSGFVSQEDPA 299
Query: 439 LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
+++ D++EEC K+G V +V + + G+ VRF D + A+ C+++MNGR+FGG Q+
Sbjct: 300 AILDIKEDIREECSKLGEVTNVVLYDKEESGIATVRFSDPECARACVQMMNGRFFGGTQV 359
Query: 499 HASEDDGLVNH--------AAIRD----------LDAEASRLEQFGAELE 530
A +G V AA++D D EA RL++FGA LE
Sbjct: 360 EAYVVEGKVRFKKSGASAAAALQDDGAGWEAEAGKDDEAQRLDKFGAWLE 409
>gi|409080583|gb|EKM80943.1| hypothetical protein AGABI1DRAFT_119482 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 399
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/273 (38%), Positives = 146/273 (53%), Gaps = 32/273 (11%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYVTGLP D +E+VE FS+CG+I+ED + +P++K+Y KE G G+ALV Y K
Sbjct: 129 NTAVYVTGLPLDAEQDEIVERFSRCGVIEED-DVGEPKVKMYA-KEDGSFIGEALVVYFK 186
Query: 348 EPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQKGERFIA------KQVDSKK-KKKLK 399
E SV LA LLD R D M V++A F K K VD K+ +++
Sbjct: 187 EDSVILAINLLDEAELRIGDSSTVMHVSKADFGHKNNTTTREGGQPRKVVDKKRATRRIG 246
Query: 400 KVEEKMLGWG--------------GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEA 445
K+++K+L W G+D +K V+LR MFT ++ D L EL+
Sbjct: 247 KMQKKLLEWDDEDGFGPAKEPEDPGKDISKHN--RVVVLRHMFTLEDLEKDATLLLELKE 304
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
DV+EEC +G V +V + + G++ V+F+D AQ CI MNGR+F GR+I AS G
Sbjct: 305 DVREECSTLGEVTNVVLYDKEKDGIMTVKFRDPISAQACILRMNGRFFDGRRIEASLFVG 364
Query: 506 ---LVNHAAIRDL---DAEASRLEQFGAELEAD 532
A DL D E+ RL+ F L A+
Sbjct: 365 RQRFKRSGAGEDLEGEDTESKRLDDFAQWLMAE 397
>gi|426197497|gb|EKV47424.1| hypothetical protein AGABI2DRAFT_185370 [Agaricus bisporus var.
bisporus H97]
Length = 397
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/273 (38%), Positives = 146/273 (53%), Gaps = 32/273 (11%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYVTGLP D +E+VE FS+CG+I+ED + +P++K+Y KE G G+ALV Y K
Sbjct: 127 NTAVYVTGLPLDAEQDEIVERFSRCGVIEED-DVGEPKVKMYA-KEDGSFIGEALVVYFK 184
Query: 348 EPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQKGERFIA------KQVDSKK-KKKLK 399
E SV LA LLD R D M V++A F K K VD K+ +++
Sbjct: 185 EDSVILAINLLDEAELRIGDSSTVMHVSKADFGHKNNTTTREGGQPRKVVDKKRATRRIG 244
Query: 400 KVEEKMLGWG--------------GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEA 445
K+++K+L W G+D +K V+LR MFT ++ D L EL+
Sbjct: 245 KMQKKLLEWDDEDGFGPAKEPEDPGKDISKHN--RVVVLRHMFTLEDLEKDATLLLELKE 302
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
DV+EEC +G V +V + + G++ V+F+D AQ CI MNGR+F GR+I AS G
Sbjct: 303 DVREECSTLGEVTNVVLYDKEKDGIMTVKFRDPISAQACILRMNGRFFDGRRIEASLFVG 362
Query: 506 ---LVNHAAIRDL---DAEASRLEQFGAELEAD 532
A DL D E+ RL+ F L A+
Sbjct: 363 RQRFKRSGAGEDLEGEDTESKRLDDFAQWLMAE 395
>gi|169846907|ref|XP_001830167.1| splicing factor u2af-associated protein 2 [Coprinopsis cinerea
okayama7#130]
gi|116508750|gb|EAU91645.1| splicing factor u2af-associated protein 2 [Coprinopsis cinerea
okayama7#130]
Length = 389
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 93/233 (39%), Positives = 131/233 (56%), Gaps = 25/233 (10%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYVTGLP D +E+V FSKCG+I+ED + +P++K+Y KE G G+ALV Y K
Sbjct: 121 NTAVYVTGLPLDAEADEIVARFSKCGVIEED-DNGEPKVKMYA-KEDGTFSGEALVVYFK 178
Query: 348 EPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQKGE---------RFIAKQVDSKKKKK 397
E SV LA LLD R D M V QA F K E R + K+ +++
Sbjct: 179 EDSVLLAENLLDEAELRLGDASTVMRVRQADFTHKNENKDGVSQPRRVVDKKAATRR--- 235
Query: 398 LKKVEEKMLGWG-------GRDDAKLTIPAT---VILRFMFTPAEMRADENLRSELEADV 447
+ K+++K+ WG D +P T V+L+ MFT E+ D L +L+ DV
Sbjct: 236 IGKMQKKLNEWGFDDGFGPQPDPEDKPVPRTSRVVVLKHMFTIEELEKDATLLLDLKEDV 295
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
+EEC +G V +V + + P+G++ V+FKD AQ C+ MNGR+F GR+I A
Sbjct: 296 REECSTLGEVTNVTLYDKEPEGIMTVKFKDPLSAQACVIKMNGRFFDGRKIVA 348
>gi|70569833|dbj|BAE06485.1| Ci-HTATSF1 [Ciona intestinalis]
Length = 494
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 119/354 (33%), Positives = 177/354 (50%), Gaps = 46/354 (12%)
Query: 170 ASPEGEDEFTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVT 228
S +G +TD +G +Y+WD + WV ++ + +E+ ++ E P N+
Sbjct: 125 VSKDGNYIYTDPSNGDKYEWDSNKQVWVKKKFKEN-----NLEDDNIIQPSEATP--NLE 177
Query: 229 DDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--K 286
D+ N E +KL V + K+K K +K+ A++P WF L
Sbjct: 178 DESWNSE---KKLWGE----------VPIEKQSKKKNHQKSGQKRAASRP--EWFSLDKS 222
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VYV GLP D+T++E ++ KCGII + ++P++K+Y + G KGD L YL
Sbjct: 223 KNTSVYVEGLPSDITMDEFRDLMLKCGIIATNVVNQQPKLKLYTNA-GGNIKGDGLCCYL 281
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL-------- 398
K SV L+ QLLDG R + V +AKFE KGE D+ KK K+
Sbjct: 282 KRESVELSLQLLDGMEVRGHK---LKVQEAKFELKGE------YDASKKPKMLSKKEKRK 332
Query: 399 -KKVEEKMLGWGGRDDAKLTIPA--TVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK +EK+L W ++ VIL+ MF E D L +++ D++ EC K G
Sbjct: 333 IKKEKEKLLDWKLARSEEIVSKKDRVVILKKMFQVNEFEEDPVLITDVRDDLRAECEKFG 392
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
V V V + HP GV V FK+ + A KC + +NGRWF ++I AS DG ++
Sbjct: 393 QVKKVIVFDRHPDGVCSVAFKEAESATKCQQALNGRWFACKEIEASIWDGHTDY 446
>gi|198414021|ref|XP_002128481.1| PREDICTED: HTATSF1 protein [Ciona intestinalis]
Length = 471
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 119/354 (33%), Positives = 177/354 (50%), Gaps = 46/354 (12%)
Query: 170 ASPEGEDEFTD-DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVT 228
S +G +TD +G +Y+WD + WV ++ + +E+ ++ E P N+
Sbjct: 102 VSKDGNYIYTDPSNGDKYEWDSNKQVWVKKKFKEN-----NLEDDNIIQPSEATP--NLE 154
Query: 229 DDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--K 286
D+ N E +KL V + K+K K +K+ A++P WF L
Sbjct: 155 DESWNSE---KKLWGE----------VPIEKQSKKKNHQKSGQKRAASRP--EWFSLDKS 199
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VYV GLP D+T++E ++ KCGII + ++P++K+Y + G KGD L YL
Sbjct: 200 KNTSVYVEGLPSDITMDEFRDLMLKCGIIATNVVNQQPKLKLYTNA-GGNIKGDGLCCYL 258
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL-------- 398
K SV L+ QLLDG R + V +AKFE KGE D+ KK K+
Sbjct: 259 KRESVELSLQLLDGMEVRGHK---LKVQEAKFELKGE------YDASKKPKMLSKKEKRK 309
Query: 399 -KKVEEKMLGWGGRDDAKLTIPA--TVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK +EK+L W ++ VIL+ MF E D L +++ D++ EC K G
Sbjct: 310 IKKEKEKLLDWKLARSEEIVSKKDRVVILKKMFQVNEFEEDPVLITDVRDDLRAECEKFG 369
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
V V V + HP GV V FK+ + A KC + +NGRWF ++I AS DG ++
Sbjct: 370 QVKKVIVFDRHPDGVCSVAFKEAESATKCQQALNGRWFACKEIEASIWDGHTDY 423
>gi|342887868|gb|EGU87296.1| hypothetical protein FOXB_02172 [Fusarium oxysporum Fo5176]
Length = 377
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 114/353 (32%), Positives = 173/353 (49%), Gaps = 62/353 (17%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVF--PTVNVTDDLANDEVGK 238
DDG Y++D + WV L +E P +D A + V
Sbjct: 32 DDGNEYEFDADSKRWV-------------------LADENALEPPAAGALNDFA-ESVSH 71
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPD 298
+ ++ +K + E H + +P K + + K P + K NT +YVTGLP
Sbjct: 72 DTVDGGAKKRKN-----ESSHGSETPEPSKPSRQNKKQKAPQ---QPKQNTAIYVTGLPA 123
Query: 299 DVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D TVEE+ ++FS K G+I E+ ++ PRIK+Y D E G KGDAL+ + K SV +A L
Sbjct: 124 DATVEEVHDLFSRKGGVIAEEIDSGAPRIKLYTDSE-GKFKGDALIVFFKPQSVEMAIML 182
Query: 358 LDGTPFR------PDGKIPM-----SVTQAKFEQK-----------GERFIAKQVDSKKK 395
LD T FR +G+I + S + K++Q+ ER + ++K
Sbjct: 183 LDDTDFRITPSGTREGRIRVQAADSSYKKVKYDQEGGAGGEGGNVTAERKPQNKDRDRQK 242
Query: 396 --KKLKKVEEKMLGWG------GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADV 447
KK +K++ K+ W G+ +A VILR MFT E+ D E++ D+
Sbjct: 243 IIKKTQKLDAKLADWDDDVPYPGQPEAATKWDKLVILRHMFTLEELEEDPAALLEIKEDI 302
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
+EEC K+G V +V + + P+G+V V+FKD A+ CI LM+GR F GR + A
Sbjct: 303 REECAKLGTVTNVVLFDQEPEGIVSVKFKDADSARACINLMHGRSFDGRTVEA 355
>gi|198466675|ref|XP_001354089.2| GA19321 [Drosophila pseudoobscura pseudoobscura]
gi|198150709|gb|EAL29828.2| GA19321 [Drosophila pseudoobscura pseudoobscura]
Length = 601
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 185/372 (49%), Gaps = 29/372 (7%)
Query: 174 GEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDD 230
GE +TD DGT Y WD AW P+ D + YG + T E+E
Sbjct: 229 GERTYTDKDGTVYLWDAEKSAWFPKIDDDFMAQYQMNYGFIDNTSAGEKEK--------- 279
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVN 288
A+ E+ + K E K +A+ + P + K++A +PP WFE+ N
Sbjct: 280 -ADKEIAEAKRKEEELKRMTAEAEAAMARDESGVGPAASLGKRKAPEPP-KWFEMDPSQN 337
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T VYV LP D+T EE E+ KCG++ DP+T++P++K+Y + + G KGD L Y+K
Sbjct: 338 TKVYVDNLPLDITKEEFAELMGKCGMVMRDPKTQQPKLKLYTEAD-GQFKGDGLCDYIKV 396
Query: 349 PSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGW 408
SV LA ++LD R + V +A+F+ +GE A + KKK K K + K +
Sbjct: 397 ESVHLALKILDDYDLRGHK---IRVQRAQFQMRGEYNPALKPKRKKKDKEKMQKMKEKLF 453
Query: 409 GGRDDA----KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
R D + TVIL+ +FTP + L E + +++EEC K G V V + +
Sbjct: 454 DWRPDKMRGERSKHEKTVILKNLFTPELFEKEVELILEYQNNLREECGKCGMVRKVVIYD 513
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA----S 520
HP+G+ + ++A I++M GR+FG RQ+ A DG + I + EA S
Sbjct: 514 RHPEGIAQINMSSPEEADLVIQMMQGRFFGQRQLTADHWDGQTKY-KIDESAVEAHQRLS 572
Query: 521 RLEQFGAELEAD 532
+ +++ A EAD
Sbjct: 573 KWDEYLAAEEAD 584
>gi|408391869|gb|EKJ71236.1| hypothetical protein FPSE_08599 [Fusarium pseudograminearum CS3096]
Length = 378
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 171/354 (48%), Gaps = 63/354 (17%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVF--PTVNVTDDLANDEVGK 238
DDG Y++D + W+ L +EE P + DD A +
Sbjct: 32 DDGNEYEFDADSKRWI-------------------LADEEPLEPPVTDAYDDFAEFASNE 72
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPD 298
+ + ++ N + E K +P+K+ PP + K NT VYVTGLP
Sbjct: 73 AAEDGSRKRKNGSGQDSETPEPSKPSRPNKK-----QKAPP----QPKQNTAVYVTGLPP 123
Query: 299 DVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D TVEE+ ++FS K G+I E+ ++ PRIK+Y D + G KGDAL+ + K SV +A L
Sbjct: 124 DTTVEEVHDLFSRKGGVIAEEIDSGAPRIKLYNDAD-GNFKGDALIVFFKPQSVEMAIML 182
Query: 358 LDGTPFR------PDGKIPM-----SVTQAKFEQK-------------GERFIAKQVDSK 393
LD T FR +G+I + S + K++Q+ ER D +
Sbjct: 183 LDDTDFRFTASGTHEGRIKVQAADSSYKKVKYDQEPGAGAGGEKSNGAAERKPQNNRDRQ 242
Query: 394 KK-KKLKKVEEKMLGWG------GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEAD 446
K KK +K++ K+ W G+ +A VILR MFT E+ D E++ D
Sbjct: 243 KIIKKTQKLDAKLADWDDDLPYPGQPEAATKWDKLVILRHMFTLEELEEDPAALLEIKED 302
Query: 447 VQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
++EEC K+G V +V + + P+G+V V+FKD A CI LM+GR F GR + A
Sbjct: 303 IREECAKLGTVTNVVLFDQEPEGIVSVKFKDTVSAHACINLMHGRRFDGRTVEA 356
>gi|290999681|ref|XP_002682408.1| RNA binding domain-containing protein [Naegleria gruberi]
gi|284096035|gb|EFC49664.1| RNA binding domain-containing protein [Naegleria gruberi]
Length = 418
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 182/361 (50%), Gaps = 31/361 (8%)
Query: 183 GTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLN 242
G Y + +AW+P +EE F ++ ++ TD++ + + KEK
Sbjct: 66 GLEYCYHGDQQAWLP-----------FVEEAQFENQQSIY-HFEETDEMRFERLEKEKAL 113
Query: 243 STEEKVNSADNVVEEKHNGKRKQPDKQ----VEKKEANKPPDSWFELKVNTHVYVTGLPD 298
E++ +EK + D++ +KK N W + NT +Y++GLP
Sbjct: 114 KLREQLEMERKEKKEKRGNIKSMMDQENGEPYQKKRKNDFAPGWHS-QFNTSIYISGLPL 172
Query: 299 D---VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALAT 355
D V + +++ FSKCGIIK DP T++P++K+Y + + G KGDA VT+LKE S+ LA
Sbjct: 173 DEKKVHNQLLIDEFSKCGIIKTDPFTEQPKVKLYRN-DDGSLKGDARVTFLKEESIDLAI 231
Query: 356 QLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQV--DSKKKKKLKKVEEKMLGWGGRDD 413
L DG DG ++VT+A+F K E + A ++K++K+ E K L WG DD
Sbjct: 232 TLFDGASLFGDGST-ITVTRAEF-TKPENYDANISLEYHNRRKQIKEKENKKLHWGFADD 289
Query: 414 AKLTIPA------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
+T TVIL+ MF P + ++ +EL+ +++ E K G ++ +K+ +P
Sbjct: 290 PDVTTETAEPSDRTVILKHMFVPDDFSSNPLFGAELKDEIKPEMEKYGKIEKIKLYPENP 349
Query: 468 QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGA 527
GVV +RFK A+ CI +GR F G ++ A DG + E RLE++
Sbjct: 350 DGVVEIRFKTLHAARTCISENHGRLFDGIKLIAYIWDGKERFNVKETKEQEQERLEKYTK 409
Query: 528 E 528
E
Sbjct: 410 E 410
>gi|412988900|emb|CCO15491.1| predicted protein [Bathycoccus prasinos]
Length = 457
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 124/413 (30%), Positives = 189/413 (45%), Gaps = 68/413 (16%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDG------YGIEEMTFLKE----EEVFPTVNV 227
F DDDGT Y W + + P+E N + + EMTF E E+V P +
Sbjct: 51 FVDDDGTTYDWSEEEKRYKPREMNEEGNGNGGAGDGFDVREMTFCPEDDSNEKVLPPPSF 110
Query: 228 TDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKV 287
+ S +K V+ K +++ K+ ++ +AN ++
Sbjct: 111 KKSFNEKKNNTNNTKSERKKPEIDIEAVKRKREQQQQNKSKKQKQTKANTKTNA------ 164
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
T VY LP D TVE + + FSKCG IK DP T P+IK+Y ++ + G+ALVTYL
Sbjct: 165 TTSVYCENLPRDATVERVEKFFSKCGQIKRDPATLMPKIKLYEEEGKNVFSGNALVTYLL 224
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKK---------- 397
PSV LA +LDG F G+ + VT+A F + R ++ D+ K +
Sbjct: 225 RPSVELALTVLDGAKFELVGE-EVKVTEADFSKSKGRGAGEENDANKNESNGNIGNVNAK 283
Query: 398 ------------------------LKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEM 433
LK+ E+ LGW G DD V+LR ++ ++
Sbjct: 284 KGTTSAIDKYAHVSKEEIRKNAALLKRKAERQLGWDGFDDEHDPTKTMVVLRNIYDENDL 343
Query: 434 ---RAD----ENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCI 485
R D + EL+ DV EEC VK G V++ V N GVV VRFK+ + A C+
Sbjct: 344 EEARKDGLNAQTFSDELKEDVAEECRVKCGKVENAYVNAN---GVVTVRFKEPEGADACL 400
Query: 486 ELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEAS------RLEQFGAELEAD 532
+LM+ RWFGG+Q+ A +G+ +++++ + + RL+ + A L D
Sbjct: 401 QLMHNRWFGGKQLKAEMWNGVEKFIGLKNIEKKETEEEENRRLDAYAATLGQD 453
>gi|66361740|ref|XP_627392.1| hypothetical protein [Cryptosporidium parvum Iowa II]
gi|46228866|gb|EAK89736.1| conserved hypothetical protein [Cryptosporidium parvum Iowa II]
Length = 274
Score = 145 bits (365), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 94/236 (39%), Positives = 133/236 (56%), Gaps = 13/236 (5%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
VNT +YV GLP D+T EEM F + GIIK DP T P+IK+Y D+ T G+ALV Y
Sbjct: 38 VNTSIYVQGLPKDITFEEMKYFFGRGGIIKIDPTTLSPKIKLYNDELTKEFSGNALVVYK 97
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDS---KKKKKLKKV-- 401
E S+ LA + L+ T RP K+ + +A F + + IA ++ KK+K+LK
Sbjct: 98 YEQSIELALKYLNETEIRPGFKV--KIEKAIFNKSSKSNIAPLSETEINKKRKQLKAAEM 155
Query: 402 -EEKMLGWGGRDDAKLTIPAT--VILRFMFT--PAEMRAD-ENLRSELEADVQEECVKIG 455
EE++L W + + A+ V+LR M++ AEM + ++ SELE +VQEE K
Sbjct: 156 EEERLLSWSNEQHSLNSNIASRIVVLRPMYSRKEAEMYPEGDDFYSELEIEVQEEVTKYC 215
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAA 511
V SV HPQG+V V+ K + DA+ I++ N R+F GRQI DG + A
Sbjct: 216 EVVSVTCIPRHPQGIVCVKLKSQHDAEIVIDIFNQRYFDGRQIEVHMYDGSTDFKA 271
>gi|224000860|ref|XP_002290102.1| RNA binding protein [Thalassiosira pseudonana CCMP1335]
gi|220973524|gb|EED91854.1| RNA binding protein [Thalassiosira pseudonana CCMP1335]
Length = 279
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 104/263 (39%), Positives = 148/263 (56%), Gaps = 27/263 (10%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKE----TGMKKGDALVTYL 346
VYVTGLP D EE+ FSK GI+ DPE++KP++K+Y +K+ TG KGDA + Y
Sbjct: 5 VYVTGLPKDTDEEEVAAYFSKVGILDLDPESQKPKVKLYREKKDKKGTGPLKGDASICYA 64
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF------IAKQVDSKKKKKLKK 400
+ SV LA Q+LD FR DG +SV +AKFEQ+G F +++ S+ K+K+ +
Sbjct: 65 RAESVDLALQILDENLFR-DGAT-LSVQRAKFEQQGGDFENGSNNNGRRMVSEAKRKVAR 122
Query: 401 VEE-KMLGW----GGRDDAKLTIPATVILRFMFTPAEMRADEN---LRSELEADVQEECV 452
+ + +GW GR L V+L MF P E+ DEN LR LE +V EC
Sbjct: 123 LAALQAVGWDEGENGRIAGGLKGLRIVVLMNMFDPKELEKDENDEKLRR-LEKEVHLECQ 181
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF-GGRQIHASEDDGLVNHAA 511
+IG V+ + V HP GV++V+F+ DA + NG GR++ A DG+ ++
Sbjct: 182 EIGVVEKITVFSKHPAGVIIVKFEKPNDASDAVSKFNGEIRPNGRKVEAHYWDGVTDY-T 240
Query: 512 IRDLDAEAS----RLEQFGAELE 530
+RD + EA RL+QFG LE
Sbjct: 241 VRDAEKEAKDAEQRLDQFGDWLE 263
>gi|226291373|gb|EEH46801.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 336
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 100/324 (30%), Positives = 166/324 (51%), Gaps = 48/324 (14%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
+DGT +++D L WVP +++ ++ E + V +D D +++
Sbjct: 46 EDGTEWEYDNALNRWVPL-----------VDDALLEQQREAYKVEGVNEDEPVDLKQQQQ 94
Query: 241 LNSTEEKVNSAD-NVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDD 299
+ K + D N ++ K K+P +VNT VYVT +P D
Sbjct: 95 QRRLKRKNYTGDQNDIKPTQVSKPKKP-------------------RVNTAVYVTSIPLD 135
Query: 300 VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
+++E+ +VFSKCG+I E+ + +KPRIK+Y D E G KGDALV Y + SV LA Q+LD
Sbjct: 136 ASMDEVNDVFSKCGVIAEEIDRRKPRIKMYTDDE-GKFKGDALVVYFRPESVNLAIQMLD 194
Query: 360 GTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK--KVEEKMLGWGGRD 412
T FR P+GK+ + F+ + E + + + K+K +K ++ K+ W D
Sbjct: 195 DTDFRFGETGPEGKMKVQAADFSFKAQKEAPVKQNMSEKRKIMIKTQRLNSKLADWD--D 252
Query: 413 DAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
D T+P + VIL+ MFT E+ D +++ D+++EC K+G V +V + +
Sbjct: 253 DEPSTVPTSSSRWDKVVILKHMFTLQELEEDPAAILDIKEDIRQECSKLGDVTNVVLYDR 312
Query: 466 HPQGVVLVRFKDRKDAQKCIELMN 489
+GV VRF D + A C++L++
Sbjct: 313 EEEGVASVRFSDAESANTCVQLLS 336
>gi|164661703|ref|XP_001731974.1| hypothetical protein MGL_1242 [Malassezia globosa CBS 7966]
gi|159105875|gb|EDP44760.1| hypothetical protein MGL_1242 [Malassezia globosa CBS 7966]
Length = 329
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 90/238 (37%), Positives = 135/238 (56%), Gaps = 19/238 (7%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+ NT V+V+GLP D + E+ EVF++ G++ ED + PR+K+Y D ET + KG+ALV Y
Sbjct: 54 RPNTSVFVSGLPLDASAAEIAEVFARYGVLLED-DDGHPRVKMYQDPETRVFKGEALVVY 112
Query: 346 LKEPSVALATQLLDGTPFRPDGKI----PMSVTQAKF------EQKGERFIAKQVDSKK- 394
K SV LA QLLD T R I M V +A+F E R + D K
Sbjct: 113 FKPESVDLAVQLLDDTYLRASKGILTGPCMKVQRAEFHHDPGRENDERRAPLSEADKKSI 172
Query: 395 KKKLKKVEEKMLGWGGRDDAKLTIPA-------TVILRFMFTPAEMRADENLRSELEADV 447
++++ ++ K+ W D++ A TV+L+ MFT AE+ AD L +L+ DV
Sbjct: 173 RRRMNRMHNKINDWDSDSDSERRAAAGPGPSARTVVLKKMFTLAELDADPTLLLDLKQDV 232
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
+EEC +G V +V + + P+G++ VRF++ A+ C + M+GR+F GRQI A DG
Sbjct: 233 REECETLGDVTNVVLWDKEPEGLISVRFREPHVARACEQRMHGRYFAGRQITAMRVDG 290
>gi|429848182|gb|ELA23696.1| nuclear mrna splicing factor-associated [Colletotrichum
gloeosporioides Nara gc5]
Length = 396
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 99/267 (37%), Positives = 142/267 (53%), Gaps = 41/267 (15%)
Query: 273 KEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVD 331
K A PP + + NT VYVTGLP D T +E+ E+FS K G+I E+ ++ +PRIK+Y+D
Sbjct: 121 KPARAPP----QPRQNTAVYVTGLPTDATADEVHELFSRKAGVIAEEIDSGRPRIKMYMD 176
Query: 332 KETGMKKGDALVTYLKEPSVALATQLLDGTPFR---PDGKIPMSVTQAKFEQKGERF--- 385
E+G KGDALV + K SV +A LLD T FR P G+ M V A K ++
Sbjct: 177 -ESGNFKGDALVVFFKPQSVEMAIMLLDDTDFRFDGPAGQPRMRVQAADSSYKKTQYEDG 235
Query: 386 ---------------IAKQVDSKKKKKL----KKVEEKMLGWGGRDDAKLTIPA------ 420
++ + K ++K+ +K+ K+ W DD +P
Sbjct: 236 AGAEGKENGGSSNGQQKRERNDKDRQKIIRKTQKMAAKLADWS--DDEPSMMPTETNSKW 293
Query: 421 --TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDR 478
TVIL+ MFT E+ D E++ D+++EC K+G V +V + + P GVV V+F
Sbjct: 294 DKTVILKHMFTLEELAEDPAALLEIKEDIRDECSKLGTVTNVVLYDEEPDGVVSVKFSKP 353
Query: 479 KDAQKCIELMNGRWFGGRQIHASEDDG 505
+ AQ CI+LMNGR F GR + AS G
Sbjct: 354 ESAQACIQLMNGRSFDGRVVEASVSTG 380
>gi|209882250|ref|XP_002142562.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
gi|209558168|gb|EEA08213.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
Length = 261
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 88/230 (38%), Positives = 125/230 (54%), Gaps = 12/230 (5%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT +YVTGLP D+T+EE+ F++CGIIK DP T +P+IK+Y DKET KGDALV+Y
Sbjct: 18 NTSIYVTGLPRDITIEEVRNFFTRCGIIKIDPSTLEPKIKLYKDKETNELKGDALVSYKF 77
Query: 348 EPSVALATQLLDGTPFRP--DGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL--KKVEE 403
+ SV LA + LD T R KI ++ Q + +D KK+ + K E+
Sbjct: 78 QESVELALKYLDQTEIRCGYSVKIQKAIFNVNHSQYNKPEDKNLLDIHKKQLIAAKLEEQ 137
Query: 404 KMLGWG-----GRDDAKLTIPATVILRFMFTPAE---MRADENLRSELEADVQEECVKIG 455
+++ W G K V+LR M++ + D+ ELE ++ EE K G
Sbjct: 138 RLMSWSNEEVIGISQGKSLNTRIVVLRHMYSKQDAEKFNEDDLFYKELEDEIYEEVSKFG 197
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
V +V HP G+V V+FK +DA+ + +N R+F GRQI A DG
Sbjct: 198 TVINVTSIPRHPHGIVCVKFKKSEDAEIAVSYLNNRFFDGRQIEAFLYDG 247
>gi|67605399|ref|XP_666681.1| RNA recognition motif (RRM)-containing protein [Cryptosporidium
hominis TU502]
gi|54657719|gb|EAL36449.1| RNA recognition motif (RRM)-containing protein [Cryptosporidium
hominis]
Length = 255
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 93/236 (39%), Positives = 130/236 (55%), Gaps = 13/236 (5%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
VNT +YV GLP D+T EEM F + GIIK DP T P+IK+Y D+ T G+ALV Y
Sbjct: 19 VNTSIYVQGLPKDITFEEMKYFFGRGGIIKIDPTTLSPKIKLYNDELTKEFSGNALVVYK 78
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDS---KKKKKLKKV-- 401
E S+ LA + L+ T RP K+ + +A F + + IA ++ KK+K+LK
Sbjct: 79 YEQSIELALKYLNETEIRPGFKV--KIEKAIFNKSSKPNIAPLSETEINKKRKQLKAAEM 136
Query: 402 -EEKMLGWGGRDDAKLTIPAT--VILRFMFTPAEMRA---DENLRSELEADVQEECVKIG 455
EE++L W + + A+ V+LR M++ E + SELE +VQEE K
Sbjct: 137 EEERLLSWSNEQHSLNSNIASRIVVLRPMYSRKEAETYPEGDEFYSELEIEVQEEVTKYC 196
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAA 511
V SV HPQG+V V+ K + DA+ I++ N R+F GRQI A DG + A
Sbjct: 197 EVVSVTCIPRHPQGIVCVKLKSQHDAEIVIDIFNQRYFDGRQIEAHMYDGSTDFKA 252
>gi|295667872|ref|XP_002794485.1| splicing factor U2AF-associated protein [Paracoccidioides sp.
'lutzii' Pb01]
gi|226285901|gb|EEH41467.1| splicing factor U2AF-associated protein [Paracoccidioides sp.
'lutzii' Pb01]
Length = 336
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 100/324 (30%), Positives = 165/324 (50%), Gaps = 48/324 (14%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
+DGT +++D L WVP +++ ++ E + V +D D +++
Sbjct: 46 EDGTEWEYDNALNRWVPL-----------VDDALLEQQREAYKVEGVNEDEPVDLKQQQQ 94
Query: 241 LNSTEEKVNSAD-NVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDD 299
+ K + D N ++ K K+P +VNT VYVT +P D
Sbjct: 95 QRRLKRKNYTGDQNDIKPTQVSKPKKP-------------------RVNTAVYVTSIPLD 135
Query: 300 VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
+++E+ +VFSKCG+I E+ + +KPRIK+Y D E G KGDALV Y + SV LA Q+LD
Sbjct: 136 ASIDEVNDVFSKCGVIAEEIDRRKPRIKMYTDDE-GKFKGDALVVYFRPESVNLAIQMLD 194
Query: 360 GTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK--KVEEKMLGWGGRD 412
T FR P+GK+ + F+ + E + + + K+K +K ++ K+ W D
Sbjct: 195 DTDFRFGETGPEGKMKVQAADFSFKAQKEAPVKQNMSEKRKIMMKTQRLNSKLADWD--D 252
Query: 413 DAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
D T P + VIL+ MFT E+ D +++ D+++EC K+G V +V + +
Sbjct: 253 DEPSTAPTSSSRWDKVVILKHMFTLQELEEDPAAILDIKEDIRQECSKLGDVTNVVLYDK 312
Query: 466 HPQGVVLVRFKDRKDAQKCIELMN 489
+GV VRF D + A C++L++
Sbjct: 313 EEEGVASVRFSDAESANTCVQLLS 336
>gi|310790944|gb|EFQ26477.1| splicing factor u2af-associated protein 2 [Glomerella graminicola
M1.001]
Length = 419
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/281 (36%), Positives = 147/281 (52%), Gaps = 51/281 (18%)
Query: 263 RKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPET 321
R +P K+ K A PP + + NT VYVTGLP D TV+E+ E+FS K G++ E+ ++
Sbjct: 128 RARPAKKA--KPARAPP----QPRQNTAVYVTGLPADATVDEVHELFSRKAGVVAEEIDS 181
Query: 322 KKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR---PDGKIPMSVTQA-- 376
+PRIK+Y D + G KGDALV + K SV +A LLD T FR G+ P QA
Sbjct: 182 GRPRIKMYTD-DKGNFKGDALVVFFKPQSVEMAIMLLDDTDFRFAPGTGETPKMRVQAAD 240
Query: 377 ------KFEQKGERFIA------------------KQVDSKKKKKL----KKVEEKMLGW 408
K+ + G + ++ + K ++K+ +K+ K+ W
Sbjct: 241 SSYKKVKYNEDGAGTASGKGGNGGDDGEGASAQPKRERNDKDRQKIIRKTQKMAAKLADW 300
Query: 409 GGRDDAKLTIPA--------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSV 460
DD IP TVIL+ MFT AE+ D E++ D++EEC K+G V +V
Sbjct: 301 S--DDDTAAIPPETNSKWDKTVILKHMFTLAELEEDPAALLEIKEDIREECAKLGNVTNV 358
Query: 461 KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ + P GVV V+F + AQ CI+LMNGR F GR + AS
Sbjct: 359 VLYDEEPDGVVSVKFSQSQSAQACIQLMNGRSFDGRFVEAS 399
>gi|296426028|ref|XP_002842538.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295638810|emb|CAZ80273.1| unnamed protein product [Tuber melanosporum]
Length = 399
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/342 (31%), Positives = 171/342 (50%), Gaps = 36/342 (10%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
DDGT +++D G++ WV D + Q Y + + + E V P
Sbjct: 44 DDGTEFEYDEGIKRWVSVLDETLAEQQRAAYAVAGVD--ESEPVAP-------------- 87
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVTG 295
+N +KV ++ E+ + K+ + K S F + NT VY+T
Sbjct: 88 ---VNKKRKKVYTSSEGNEDPTHPKKSKNTKDSSSSTDPTTASSSFPAPARKNTAVYITN 144
Query: 296 LPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALAT 355
LP D T +E+ +VFS+ G+I E+ + KPRIK+Y + E G KGDALV Y + SV LA
Sbjct: 145 LPPDTTEDEVNKVFSRFGVIAEEIDRGKPRIKLYRN-EDGSVKGDALVVYFRPESVNLAV 203
Query: 356 QLLDGTPFR----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKK--KKLKKVEEKMLGWG 409
Q+LD + FR G+I + ++ + + + K KKK K +K+ K+ W
Sbjct: 204 QMLDDSDFRFGVSEGGRIKVQPADFSYKAQQDAPVRKNARDKKKVIAKTQKLNNKLADWD 263
Query: 410 GRDDAKL-----TIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
D +K+ VILR MFT ++ D +L+ D++EEC K+G V +V + +
Sbjct: 264 DDDPSKIPDITSKYDKVVILRHMFTLKQLEEDPTAILDLKEDIREECEKLGDVTNVVLYD 323
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGL 506
G+V VRF + + A+ C+ +M+GR+F G+++ A DGL
Sbjct: 324 KEEDGIVSVRFANAESAKACVGVMHGRFFDGQRVEAYIFDGL 365
>gi|346472937|gb|AEO36313.1| hypothetical protein [Amblyomma maculatum]
Length = 419
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 128/372 (34%), Positives = 184/372 (49%), Gaps = 52/372 (13%)
Query: 174 GEDEFT---DDDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNV 227
G D +T +DGT Y+WD AW P+ ED ++ YG T
Sbjct: 22 GSDPYTYVDPNDGTVYEWDHAKHAWFPKLTEDFLAAYQANYGA-------------TDEA 68
Query: 228 TDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPD--SWFEL 285
T A D + S EK + G P++ + K A P SWFEL
Sbjct: 69 TGQGAGDASSSSQAVSVTEKPAA----------GSHHPPEQSNQAKPAKPKPSEPSWFEL 118
Query: 286 --KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
NT VYV+ LPDD+ EE +E+ SKCG++ +D E +IK+Y +E G KGDAL
Sbjct: 119 DDAHNTWVYVSNLPDDIDEEEFLELMSKCGLLMKD-EKGNYKIKLYRTRE-GELKGDALC 176
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER----FIAKQVDSKKKKKLK 399
Y+K SV LA +++DG R D +I V +A+F+ KG K+ +K K+KLK
Sbjct: 177 CYIKVESVELALRIIDGYRLR-DKEI--RVERAQFQLKGSYDPTMKPKKKKQAKDKEKLK 233
Query: 400 KVEEKMLGW------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVK 453
K EK+ W G RD + TV+L+ MF P + +D L E + D++EEC +
Sbjct: 234 KKIEKLFDWRPEKLRGMRDKNE----CTVVLKNMFEPKDFESDPTLILEYQKDLREECSQ 289
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIR 513
G V V V + +P+GV + FK+ ++A CI MNGRWF RQ+ A DG +
Sbjct: 290 FGEVKKVVVYDRNPEGVATITFKEPEEADACISRMNGRWFAQRQLSAETWDGRTKYKIFE 349
Query: 514 DLDAEASRLEQF 525
+ +RL+++
Sbjct: 350 TEEELEARLKKW 361
>gi|358400651|gb|EHK49977.1| hypothetical protein TRIATDRAFT_297340 [Trichoderma atroviride IMI
206040]
Length = 373
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 138/248 (55%), Gaps = 37/248 (14%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSK-CGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VYVTGLP + TV+E+ ++FS+ G+I E+ ++ PRIK+Y D E G KGDALV +
Sbjct: 110 NTAVYVTGLPTNATVDEIHDLFSRRAGVIAEEIDSGAPRIKMYTDSE-GNFKGDALVVFF 168
Query: 347 KEPSVALATQLLDGTPFR--PDG----KIPMSVTQAKFEQ-----------------KGE 383
K SV +A LLD T FR PDG +I + V + +++ G+
Sbjct: 169 KPQSVDMAVMLLDDTVFRYLPDGTAEGRIRVQVADSSYKKTQYEPAGGSGDASNGSGSGQ 228
Query: 384 RFIAKQVDSKKK-KKLKKVEEKMLGWGGRDD---AKLTIPA------TVILRFMFTPAEM 433
+ D +K KK +K+ K+ W DD L P+ TVIL+ MFT E+
Sbjct: 229 AKPRNERDRQKIIKKTQKLAAKLADWD--DDEPYPALAAPSNSKRDKTVILKHMFTLQEL 286
Query: 434 RADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF 493
D E++ D+++EC K+GPV SV + + P GVV V+FK+ + A CI+LMNGR F
Sbjct: 287 EEDPAALLEIKEDIRDECSKLGPVTSVVLYDLEPDGVVSVKFKNVESAAACIKLMNGRSF 346
Query: 494 GGRQIHAS 501
GG + AS
Sbjct: 347 GGSTVQAS 354
>gi|322695060|gb|EFY86875.1| RNA recognition motif family protein [Metarhizium acridum CQMa 102]
Length = 374
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 168/352 (47%), Gaps = 62/352 (17%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDGT +++D L+ WVP E+ ++G+ D +E
Sbjct: 32 DDGTEFEFDADLKRWVPAEEEPLDHEGHS---------------------YGQDSPAQEL 70
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
++ +N E G + P + KK+ P + NT VYVTGLP D
Sbjct: 71 QGGARKRHFETENGSE----GSNQTPPARPNKKQKAPP-----RPRENTAVYVTGLPLDA 121
Query: 301 TVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
TV E+ ++FS K G+I E+ ++ PRIK+Y D E+G KGDAL+ + K SV +A LLD
Sbjct: 122 TVSEVHDLFSRKGGVIAEEIDSGAPRIKLYSD-ESGNFKGDALIVFFKPQSVEMAIMLLD 180
Query: 360 GTPFR------PDGKIPMSVTQAKFE-----------------QKGERFIAKQVDSKKK- 395
T FR +G++ + + ++ Q+ +R D +K
Sbjct: 181 DTDFRVTATGTREGRMRVQAADSSYKKVQYEADGTPGGGSNGGQQDKRPQRSDRDRQKII 240
Query: 396 KKLKKVEEKMLGWGGRDDAKLTIPA------TVILRFMFTPAEMRADENLRSELEADVQE 449
KK +K++ K+ W D + TVILR MFT E+ D E++ D+++
Sbjct: 241 KKTQKLDAKLADWDDDDPYPAMTQSNSKKDRTVILRHMFTLEELDEDPAALLEIKEDIRD 300
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
EC K+G V SV + + GVV VRFKD + A CI+LM+GR F GR + AS
Sbjct: 301 ECAKLGTVTSVVLYDQEEDGVVSVRFKDTESAAACIKLMHGRSFDGRVVEAS 352
>gi|350296885|gb|EGZ77862.1| hypothetical protein NEUTE2DRAFT_154421 [Neurospora tetrasperma
FGSC 2509]
Length = 496
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 121/401 (30%), Positives = 181/401 (45%), Gaps = 102/401 (25%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK-- 238
DDGT Y++D GL+ W+P I+E + + + T DD ++GK
Sbjct: 92 DDGTEYEFDEGLKRWLPI-----------IDEALIEEHQRAYITSLGDDDAETGQLGKGQ 140
Query: 239 --EKLNSTEEKVNSADNV-------VEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNT 289
++ N E + ++ HN + P K ++++A + P K NT
Sbjct: 141 GKKRKNDDRETFSFFPSLRARLQDSASSNHNINNR-PSKNRKREKAPREP------KQNT 193
Query: 290 HVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
VYVTGLP D T +E+ E+FS KCG+I E+ ++ +PRIK+Y D G KGDAL+ + K
Sbjct: 194 AVYVTGLPLDATADEVAELFSRKCGVIAEEIDSGRPRIKMYTDA-NGNFKGDALIVFFKP 252
Query: 349 PSVALATQLLDGTPFR----PDGK------IPMSVTQAKFEQ------------------ 380
SV +A LLD T FR PDG MS + K++
Sbjct: 253 QSVEMAIMLLDDTDFRFEPAPDGSRMRVQAADMSYKKTKYDAAAGAENGGGDKAGNGGAE 312
Query: 381 -------------------------KGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDD-- 413
G ++Q +K KK +K+ K+ W DD
Sbjct: 313 GEGSSSSTPYNNITNPNPVSSDGGGNGHNQRSQQDKAKIIKKTQKLSAKLADWD--DDEP 370
Query: 414 ---AKLTIPA-----------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDS 459
A+L PA VILR MFT E+ D E++ D++EEC K+GPV +
Sbjct: 371 SPSARLLGPAIKEAKGGKWDKVVILRHMFTLEELNEDPAALLEIKDDIREECAKLGPVTN 430
Query: 460 VKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V + + +G+V V+F + A+ C+ LM+GR F GR + A
Sbjct: 431 VVLFDEEEEGIVSVKFATVEAAEACVHLMHGRAFDGRIVEA 471
>gi|353235550|emb|CCA67561.1| related to Splicing factor U2AF-associated protein 2
[Piriformospora indica DSM 11827]
Length = 390
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 108/340 (31%), Positives = 166/340 (48%), Gaps = 38/340 (11%)
Query: 177 EFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
EF DDDG +WD AWVP ++E +++ V+ V ++ V
Sbjct: 33 EFEDDDGNEMEWDDVKNAWVPL-----------VDEELVRQQQAVYSVAGVDENTPAAPV 81
Query: 237 GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGL 296
+ + ++ D K+ + + K A+K NT V+V+ L
Sbjct: 82 LARENKKRKAGQSNGDITTSAGPAKKKGKNEGSTNTKPASK----------NTAVFVSNL 131
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
P D TV+E+ E F K G+I ED + +P+IK+Y D + G G+ALV Y E SV LA
Sbjct: 132 PRDATVDEIAERFGKFGVIMED-DAGEPKIKMYAD-DKGNFNGEALVVYFMEGSVNLAVN 189
Query: 357 LLDGTPFR-PDGKIPMSVTQAKF--EQKGERFIAKQVDSKKK--KKLKKVEEKMLGWGGR 411
LLD R D M V +F +Q G + ++V KKK ++ K++ K+ W
Sbjct: 190 LLDEAELRLGDQSTIMRVQPGEFGHKQSGGVEVTRRVVDKKKATSRIAKLKNKVGDWDFD 249
Query: 412 D----------DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
D ++ V+L+ MFT E++ D +L +L+ DV+EE IG V +V
Sbjct: 250 DGFGPTEEPVQKEQIIESRVVVLKHMFTLEELKEDPSLLLDLKEDVREEAETIGDVTNVV 309
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ + P+G++ ++F+D A+ CI MNGRWF GRQI AS
Sbjct: 310 LYDEEPEGIMTIKFRDPISAKACIIKMNGRWFSGRQIEAS 349
>gi|21357269|ref|NP_649313.1| barricade, isoform A [Drosophila melanogaster]
gi|24668057|ref|NP_730629.1| barricade, isoform C [Drosophila melanogaster]
gi|16769348|gb|AAL28893.1| LD27763p [Drosophila melanogaster]
gi|23094232|gb|AAN12166.1| barricade, isoform A [Drosophila melanogaster]
gi|23094233|gb|AAN12167.1| barricade, isoform C [Drosophila melanogaster]
gi|220946898|gb|ACL85992.1| CG6049-PA [synthetic construct]
gi|220956554|gb|ACL90820.1| CG6049-PA [synthetic construct]
Length = 556
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 125/392 (31%), Positives = 193/392 (49%), Gaps = 45/392 (11%)
Query: 158 GYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMT 214
G G D+H GE +TD DG + WD AW P+ D + YG + T
Sbjct: 167 GVCGVDEH--------GERTYTDKDGVEFFWDATKSAWFPKIDDDFMARYQMNYGFIDNT 218
Query: 215 FLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKH--NGKRKQPDKQVEK 272
E+E +E+ +++ + E S DN GKRK
Sbjct: 219 SAGEKEKAEKEAAEAKRKEEEL--KRMTAEAEAAMSRDNPASSAAVPTGKRK-------- 268
Query: 273 KEANKPPDSWFELKV--NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYV 330
A +PP WFE+ NT VYV+ LP D+T++E ++ KCG++ DP+T+K ++K+Y
Sbjct: 269 --AQEPP-KWFEMDPLQNTKVYVSNLPLDITMDEFADLMGKCGMVMRDPQTQKFKLKLYA 325
Query: 331 DKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAK 388
+K+ G KGD L Y+K SV LA ++LD R + V +A+F+ +GE +
Sbjct: 326 EKD-GQIKGDGLCDYIKVESVNLALKILDEYNLRGHK---IRVQRAQFQMRGEYNPALKP 381
Query: 389 QVDSKKKKKLKKVEEKMLGWGGRDDA----KLTIPATVILRFMFTPAEMRADENLRSELE 444
+ K K+KL+K++EK+ W R D + TVI++ +FTP + L E +
Sbjct: 382 KRKKKDKEKLQKMKEKLFDW--RPDKLRGERSKNEKTVIIKNLFTPELFEKEVELILEYQ 439
Query: 445 ADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
+++EEC K G V V + + HP GV + ++A I++M GR+FG RQ+ A D
Sbjct: 440 NNLREECSKCGMVRKVVIYDRHPDGVAQINMASPEEADLVIQMMQGRYFGQRQLSAEAWD 499
Query: 505 GLVNHAAIRDLDAEA----SRLEQFGAELEAD 532
G + I + EA S+ ++F AE E D
Sbjct: 500 GKTKY-KIEESAVEAHERLSKWDEFLAEEETD 530
>gi|194749471|ref|XP_001957162.1| GF24193 [Drosophila ananassae]
gi|190624444|gb|EDV39968.1| GF24193 [Drosophila ananassae]
Length = 555
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 174/348 (50%), Gaps = 28/348 (8%)
Query: 173 EGEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTD 229
+GE +TD DG+ + WD AW P+ D + YG + T E+E
Sbjct: 169 QGERTYTDKDGSVFFWDTAKGAWFPKIDDDFMARYQMNYGFIDNTSAGEKEKAEKEAAEA 228
Query: 230 DLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KV 287
+E+ + + + + GKRK A +PP WFE+
Sbjct: 229 KRKEEELKRMTAEAEAAMAKGSAESIAVAPTGKRK----------AQEPP-KWFEMDPTQ 277
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYV+ LP D+T++E E+ KCG++ DP+T+K ++K+Y +K+ G KGD L Y+K
Sbjct: 278 NTKVYVSNLPLDITIDEFAELMGKCGMVMRDPQTQKFKLKLYAEKD-GQIKGDGLCDYIK 336
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEEKM 405
SV LA ++LD +R + V +A+F+ +GE + + K K+KL+K++EK+
Sbjct: 337 VESVNLALKILDEYNYRGHK---IRVQRAQFQMRGEYNPALKPKRKKKDKEKLQKMKEKL 393
Query: 406 LGWGGRDDA----KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
W R D + TVIL+ +FTP L E + +++EEC K G V V
Sbjct: 394 FDW--RPDKMRGERSKNEKTVILKNLFTPELFEKQVELILEYQNNLREECAKCGMVRKVV 451
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+ + HP+GV V ++A I++M GR+FG R++ A DG +
Sbjct: 452 IYDRHPEGVAQVNMASPEEADLVIQMMQGRFFGQRKLTAETWDGKTKY 499
>gi|212542529|ref|XP_002151419.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces marneffei ATCC 18224]
gi|210066326|gb|EEA20419.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces marneffei ATCC 18224]
Length = 363
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 162/326 (49%), Gaps = 66/326 (20%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DG Y +D L+ WV D + Q Y +
Sbjct: 42 EDGQEYGYDTALKRWVQTIDDALLEQQRQAYKV--------------------------- 74
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
E ++ TEE S E KRKQPD + +K+ + +VNT VY+T +P
Sbjct: 75 -EGVDETEEMTRS------ELRKKKRKQPDDENGQKQKRQ--------RVNTAVYITSIP 119
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D T++E+ +VF KCG+I E+ ++ +PRIK+Y+D ++G KG+ALV Y + SV LA Q+
Sbjct: 120 LDATLDEIRDVFCKCGVIAEEIDSGRPRIKMYMD-DSGNFKGEALVVYFRPESVNLAIQM 178
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGW 408
LD + FR P G PM V A F K ++ + + KKK+ +K+ K+ W
Sbjct: 179 LDESNFRIGQSGPLG--PMKVQAADFSYKSQQDAPTNTNMRDKKKIMKRTQKLNSKLADW 236
Query: 409 GGRDDAKLTIPAT-------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
DD +P T V+L+ MFT E+ D +++ D+++EC K+G V +V
Sbjct: 237 D--DDEPSALPDTSSRFDKVVVLKHMFTLEEIEEDPAAILDIKQDIRDECSKLGDVTNVV 294
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIEL 487
+ + P GVV VRF D A++C+++
Sbjct: 295 LYDKEPAGVVTVRFSDPDAARQCVKV 320
>gi|194875554|ref|XP_001973620.1| GG13238 [Drosophila erecta]
gi|190655403|gb|EDV52646.1| GG13238 [Drosophila erecta]
Length = 554
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 122/392 (31%), Positives = 196/392 (50%), Gaps = 45/392 (11%)
Query: 158 GYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMT 214
G G D+H GE +TD DG + WD AW P+ D + YG + T
Sbjct: 165 GVYGVDEH--------GERTYTDKDGVEFFWDATKSAWFPKIDDDFMARYQMNYGFIDNT 216
Query: 215 FLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKH--NGKRKQPDKQVEK 272
E+E +E+ +++ + E S DN GKRK
Sbjct: 217 SAGEKEKAEKEAAEAKRKEEEL--KRMTAEAEAAMSRDNPATSTAVPTGKRK-------- 266
Query: 273 KEANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYV 330
A +PP WFE+ NT VYV+ LP D++++E ++ KCG++ DP+T+K ++K+Y
Sbjct: 267 --AQEPP-KWFEMDPSQNTKVYVSNLPLDISMDEFADLMGKCGMVMRDPQTQKFKLKLYA 323
Query: 331 DKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAK 388
+K+ G KGD L Y+K SV LA ++LD R + V +A+F+ +GE +
Sbjct: 324 EKD-GQIKGDGLCDYIKVESVNLALKILDEYNLRGHK---IRVQRAQFQMRGEYNPALKP 379
Query: 389 QVDSKKKKKLKKVEEKMLGWGGRDDA----KLTIPATVILRFMFTPAEMRADENLRSELE 444
+ K K+KL+K++EK+ W R D + TVI++ +FTP + L E +
Sbjct: 380 KRKKKDKEKLQKMKEKLFDW--RPDKMRGERSKNEKTVIIKNLFTPELFEKEVELILEYQ 437
Query: 445 ADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
+++EEC K G V V + + HP+GV + ++A I++M GR+FG RQ+ A D
Sbjct: 438 NNLREECSKCGMVRKVVIYDRHPEGVAQINMASPEEADIVIQMMQGRYFGQRQLTAESWD 497
Query: 505 G----LVNHAAIRDLDAEASRLEQFGAELEAD 532
G ++ +A+ + + S+ ++F AE E D
Sbjct: 498 GKTKYKIDESAV-EANERLSKWDEFLAEEETD 528
>gi|219112279|ref|XP_002177891.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410776|gb|EEC50705.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 689
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 93/267 (34%), Positives = 146/267 (54%), Gaps = 32/267 (11%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YVTGLP D T EE+ +F K GII DPET++P+IKIY+D+ +G+ KGDA + Y + S
Sbjct: 414 IYVTGLPPDCTEEEIASIFCKAGIIDLDPETQQPKIKIYLDQASGLPKGDASICYARAES 473
Query: 351 VALATQLLDGTPFRPDGK------IPMSVTQAKFEQKGERF--IAKQVDSKKKKKLKKVE 402
V LA LLD PFRP + + V +AKFEQ+G F ++V K+K K
Sbjct: 474 VDLAVTLLDEAPFRPSVRSDACVQYVLHVERAKFEQRGRVFDDGRQRVSLAKRKVAKLAA 533
Query: 403 EKMLGW----------GGRDDAKLTIPATVILRFMFTPAEMRADE--NLRSELEADVQEE 450
+ W GGR ++ V+L+ +F P+ + A+E + + LE D+++E
Sbjct: 534 VQATDWDEGEFNGRLTGGRKGLRI-----VVLKHLFDPSVLSANEEDGMLAVLERDLRKE 588
Query: 451 CVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG-RWFGG---RQIHASEDDGL 506
C + G V+ + + + QGVV+V+F A I+ ++G W G R++HA+ DG+
Sbjct: 589 CEQWGVVEKITIFSKNLQGVVVVKFAQPGSASDAIKHLDGLEWPTGSSKRRVHATFWDGV 648
Query: 507 VNHAA---IRDLDAEASRLEQFGAELE 530
+ I++ + R ++FG LE
Sbjct: 649 TDFTVRNEIKEQEEAEKRQKEFGNWLE 675
>gi|322708706|gb|EFZ00283.1| nuclear mRNA splicing factor-associated protein, putative
[Metarhizium anisopliae ARSEF 23]
Length = 375
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 115/357 (32%), Positives = 168/357 (47%), Gaps = 72/357 (20%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDGT +++D L+ WVP E+ L EE + D + +E
Sbjct: 32 DDGTEFEFDAHLKRWVPAEEEP-------------LDHEE--------HNYGQDSLAQEL 70
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
T ++ +N E G + P + KK+ P + NT VYVTGLP D
Sbjct: 71 QGRTRKRHLDTENGSE----GSNQTPPSRPNKKQKAPP-----RPRENTAVYVTGLPLDA 121
Query: 301 TVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
TV E+ E+FS K G+I E+ ++ PRIK+Y D E G KGDAL+ + K SV +A LLD
Sbjct: 122 TVSEVHELFSRKGGVIAEEIDSGAPRIKLYSD-EAGNFKGDALIVFFKPQSVEMAIMLLD 180
Query: 360 GTPFRPD------GKIPMSVTQAKFE-----------------------QKGERFIAKQV 390
T FR + G++ + + ++ Q+ +R K +
Sbjct: 181 DTDFRMNATGTREGRMRVQAADSSYKKVQYETDGTPGGGSNGGREDKRPQRSDRDRQKII 240
Query: 391 DSKKKKKLKKVEEKMLGWGGRDDAKLTIPA------TVILRFMFTPAEMRADENLRSELE 444
KK +K++ K+ W D + TVILR MFT E+ D E++
Sbjct: 241 -----KKTQKLDAKLADWDDDDPYPAMTQSNSKRDKTVILRHMFTLEELDEDPAALLEIK 295
Query: 445 ADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
D+++EC K+G V SV + + GVV VRFKD + A CI+LM+GR F GR + AS
Sbjct: 296 EDIRDECAKLGTVTSVVLYDQEVDGVVSVRFKDTESAAACIKLMHGRSFDGRVVEAS 352
>gi|195376801|ref|XP_002047181.1| GJ12075 [Drosophila virilis]
gi|194154339|gb|EDW69523.1| GJ12075 [Drosophila virilis]
Length = 541
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 117/373 (31%), Positives = 182/373 (48%), Gaps = 31/373 (8%)
Query: 174 GEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDD 230
G+ +TD DG + WD AW P+ D + YG F+ +
Sbjct: 164 GDRIYTDKDGIVFLWDASKSAWFPKIDDDFMARYQMNYG-----FIDNTSAGEREKEERE 218
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVN 288
A + E+L + +A N +G P V K + P WFE+ N
Sbjct: 219 AAEVKRKAEELKRMTAEAQAAMNAPILDKDGA---PTASVTGKRKAQEPPKWFEVDPTQN 275
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T VYV+ LP D+T++E E+ KCG+I DP+T+K ++K+Y + + G KGD L Y+K
Sbjct: 276 TKVYVSNLPLDITMDEFAELMGKCGLIMRDPQTQKYKLKLYTEAD-GQIKGDGLCDYIKV 334
Query: 349 PSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEEKML 406
SV LA ++LD R + V +A+F+ +GE + + K K+KL+K++EK+
Sbjct: 335 ESVNLALEILDDYILRGHK---IHVQRAQFQMRGEYNPALKPKRKKKDKEKLQKIKEKLF 391
Query: 407 GW------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSV 460
W G R + TVI++ +FTP + L E + ++EEC K G V V
Sbjct: 392 DWRPDKMRGERSKNE----KTVIIKNLFTPELFEKEVELILEYQTSLREECGKCGMVRKV 447
Query: 461 KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEAS 520
+ + HP+GV + ++A I++M GR+FG RQ+ A DG + I + AEAS
Sbjct: 448 VIYDRHPEGVAQINMSTPEEADIVIQMMQGRYFGQRQLSAEHWDGKTKY-KIDESAAEAS 506
Query: 521 -RLEQFGAELEAD 532
RL ++ L A+
Sbjct: 507 ERLSKWDEFLAAE 519
>gi|302680382|ref|XP_003029873.1| hypothetical protein SCHCODRAFT_58953 [Schizophyllum commune H4-8]
gi|300103563|gb|EFI94970.1| hypothetical protein SCHCODRAFT_58953, partial [Schizophyllum
commune H4-8]
Length = 367
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 144/265 (54%), Gaps = 29/265 (10%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYVTGLP D +E+V FSKCG+++ED E P++K+Y +E G G+ALV+Y K
Sbjct: 96 NTSVYVTGLPPDTDSDELVARFSKCGVLEEDDEGD-PKVKMYA-REDGTFNGEALVSYFK 153
Query: 348 EPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQKGERFIA-----KQVDSKK-KKKLKK 400
E SV LA +LD R D MSV++A F KG + + VD KK +++ K
Sbjct: 154 EDSVLLALNILDDAELRIGDPSTRMSVSKADFSAKGNAGQSGDKPRRTVDKKKATRRIGK 213
Query: 401 VEEKMLGWGGRD------DAKLTIPA------TVILRFMFTPAEMRADENLRSELEADVQ 448
+++K+ W D + + T P V+L+ MFT E+ D L +L+ +V+
Sbjct: 214 MQKKIDEWKDEDGFGPQLEVQDTQPVANRNSRVVVLKHMFTLEELEKDATLLLDLKEEVR 273
Query: 449 EECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG--- 505
EE +G V +V + + P+GV+ ++F+D AQ CI MNGR+F GR++ A DG
Sbjct: 274 EEAETLGEVTNVVLYDKEPEGVMTIKFRDPLSAQACIIKMNGRFFDGRRVEAYLYDGRQR 333
Query: 506 LVNHAAIRDLD-----AEASRLEQF 525
A D+D AE RL F
Sbjct: 334 FRRSNAADDIDGTGDEAEKRRLANF 358
>gi|302926132|ref|XP_003054233.1| hypothetical protein NECHADRAFT_74744 [Nectria haematococca mpVI
77-13-4]
gi|256735174|gb|EEU48520.1| hypothetical protein NECHADRAFT_74744 [Nectria haematococca mpVI
77-13-4]
Length = 377
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 109/351 (31%), Positives = 171/351 (48%), Gaps = 58/351 (16%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDG Y++D + W+ +D + + + E+ T ++ DE +++
Sbjct: 32 DDGNEYEFDADQKRWILTDDEPLEPEHFDTHELH----------GGHTSQVSQDEGSRKR 81
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
N + + + N K+K P + + NT VYVTGLP D
Sbjct: 82 KNGSGPSSETTGTPKPTRPNKKQKAPPQP----------------RQNTAVYVTGLPADT 125
Query: 301 TVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
TV+E+ E+FS K G+I E+ ++ PRIK+Y D + G KGDAL+ + K SV +A LLD
Sbjct: 126 TVQEVHELFSRKGGVIAEEIDSGAPRIKLYTDPD-GNFKGDALIVFFKPQSVEMAIMLLD 184
Query: 360 GTPFRPD------GKIPM-----SVTQAKFEQ-----------KGERFIAKQVDSKKK-- 395
T FR + G+I + S + K++Q E+ K ++K
Sbjct: 185 DTDFRINPSGTREGRIRVQAADSSYKKVKYDQDDAASGDKSNGAAEQKPQKNNRDRQKII 244
Query: 396 KKLKKVEEKMLGW------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQE 449
KK +K++ K+ W G+ +A VILR MFT E+ D E++ D++E
Sbjct: 245 KKTQKLDAKLADWDDDIPYAGQPEAATKWDKLVILRHMFTLEELEEDPAALLEIKEDIRE 304
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
EC K+G V +V + + P+G+V V+FKD A CI+LM+GR F GR + A
Sbjct: 305 ECSKLGTVTNVVLFDQEPEGIVSVKFKDADSALACIKLMHGRSFDGRTVEA 355
>gi|332376037|gb|AEE63159.1| unknown [Dendroctonus ponderosae]
Length = 466
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 115/361 (31%), Positives = 187/361 (51%), Gaps = 46/361 (12%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+TD DGT+ WD+ +AW P+ D ++ I +M++
Sbjct: 131 YTDTDGTKLFWDKAKKAWFPRID----DEFMAIYQMSY---------------------- 164
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVTG 295
TE + + + +P + EK++A++P +WFE+ K NT+VYV+
Sbjct: 165 ----GFTEAPKSPEKPEPTPEETAETAKP--KGEKRKASEP--TWFEVDEKQNTNVYVSN 216
Query: 296 LPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALAT 355
LP D+ +E V+ KCG++ DP T ++K+Y ++ TG KGDAL TY++ SVALA
Sbjct: 217 LPLDIEEQEFVDFMQKCGLVMRDPVTGNFKVKLYRERGTGQLKGDALCTYIRIESVALAL 276
Query: 356 QLLDGTPFRPDGKIPMSVTQAKFEQKGE---RFIAKQVDSKKKKKLKKVEEKMLGWG--G 410
+LLDG ++ D K+ V +AKF+ KGE + K K K+KLKK ++K+ W
Sbjct: 277 KLLDGYIYK-DHKV--KVERAKFQLKGEFDPKLKPKMKKRKDKQKLKKQQDKLFDWRPEK 333
Query: 411 RDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
++ + VI++ +F P D +L E + D++EE KIG V V + + HP+GV
Sbjct: 334 KEGDRAKHERIVIIKNLFDPTIFDKDVSLILEFQEDLREEAGKIGEVRKVMLYDRHPEGV 393
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEAS-RLEQFGAEL 529
+ ++A + + +NGRWF RQ+ A DG + + + D++ S RLE + L
Sbjct: 394 AQINMASPEEADQVVARLNGRWFMKRQLKAEIHDGKTKY-KMSETDSQISQRLEGWDKFL 452
Query: 530 E 530
E
Sbjct: 453 E 453
>gi|449305014|gb|EMD01021.1| hypothetical protein BAUCODRAFT_20991 [Baudoinia compniacensis UAMH
10762]
Length = 520
Score = 141 bits (356), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 194/392 (49%), Gaps = 74/392 (18%)
Query: 182 DGTRYKWDRGLRAWVP---QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK 238
DGT ++W + WVP +E+ +Q Y +E + +E P +++ +A +
Sbjct: 157 DGTAWEWVGNRQKWVPAMNEEEIEAQRQAYKVEGV-----DENEPAMDL---MAKKRKAQ 208
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPD 298
++T+ +A V K+++P E+ +P NT +YVT LPD
Sbjct: 209 HDGDNTDALTTNARPV------AKKQKP----EETRKERP---------NTAIYVTSLPD 249
Query: 299 DVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV------------TYL 346
DV ++E+ + FS+ GII E+ ET +PRIK+Y D + G KG+AL+ Y
Sbjct: 250 DVDIDELHDKFSRYGIISENLETNEPRIKLYYDDD-GKFKGEALIGKSLCQHCFAYKFYF 308
Query: 347 KEPSVALATQLLDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKK------ 395
+ SV +A + D + FR P G + + A F+ + E+ +A ++KKK
Sbjct: 309 RPESVKMAIDMTDESDFRLGQALPTGPMRVHEADASFKSQKEKPLATD-EAKKKGTTANR 367
Query: 396 ------KKLKKVEEKMLGWGGRDDAKLTIPAT-------VILRFMFTPAEMRADENLRSE 442
KK + + ++ W DD +P T VIL+ MFT E++ D + S+
Sbjct: 368 DRERVIKKTEAMNSRLADWD--DDDPQALPDTSSRWDKVVILKGMFTLEELKDDPHATSD 425
Query: 443 LEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASE 502
++ADV EE K G V SV + + GV+ VRF D + A+ CI++ +GR + G ++ A
Sbjct: 426 IKADVTEEAEKFGTVSSVTLYDKEEAGVMTVRFTDARAARACIDVFDGRKYAGTKVQAYT 485
Query: 503 DDGL--VNHAAIRDLD--AEASRLEQFGAELE 530
DG ++ R++D EA RLE+F +LE
Sbjct: 486 ADGTERFKKSSQREVDEAGEAERLERFTRDLE 517
>gi|403419004|emb|CCM05704.1| predicted protein [Fibroporia radiculosa]
Length = 429
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 170/358 (47%), Gaps = 46/358 (12%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
F +DDG+ ++D AWVP + Q E P V +DL +
Sbjct: 44 FENDDGSELEYDAAKGAWVPLVSATIQ--------CMHTYRLEPSPIGKVDEDLMKAQQA 95
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQ------------VEKKEANKPPDSWFEL 285
+ +E+ +A V ++ N KRK+P+ +++ + D E
Sbjct: 96 AYSIAGVDEETPAAP--VLKRANKKRKEPEDYTSNNAGSSTGPVLKRGKKGAGADGQPER 153
Query: 286 KV-NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
K NT VYVTGLP D +E+V FSK G+I+ED E P+IK+Y ++ G GDALV
Sbjct: 154 KSKNTAVYVTGLPSDTDHDELVSRFSKFGLIEEDDEGA-PKIKLYA-RDDGTFSGDALVV 211
Query: 345 YLKEPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQKGE--------RFIAKQVDSKKK 395
Y KE SV LA LLD R D M V +A+F+ K E + VD KK
Sbjct: 212 YFKEESVDLAVTLLDDDELRVGDPNTRMRVRRAEFQHKHEGEGKSGEMKPRKTMVDKKKA 271
Query: 396 KKLKKVEEKMLGWGGRDDAKLTI------------PATVILRFMFTPAEMRADENLRSEL 443
+ V K+ W D +I V+L+ MFT E+ D +L +L
Sbjct: 272 TRRIGVTRKLGEWDDEDGFGPSITEEDKAPVANKNSRVVVLKHMFTLQELDEDASLLLDL 331
Query: 444 EADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ DV++EC +G V +V + + +GV+ V+F+D AQ C+ M+GR+F GR+I AS
Sbjct: 332 KEDVRDECSSLGEVTNVVLYDKEYEGVMTVKFRDPLSAQACVIKMDGRFFAGRRIEAS 389
>gi|195495582|ref|XP_002095329.1| GE22336 [Drosophila yakuba]
gi|194181430|gb|EDW95041.1| GE22336 [Drosophila yakuba]
Length = 554
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 121/390 (31%), Positives = 189/390 (48%), Gaps = 41/390 (10%)
Query: 158 GYMGRDDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMT 214
G G D+H GE +TD DG + WD AW P+ D + YG + T
Sbjct: 165 GVYGVDEH--------GERTYTDKDGVEFFWDATKSAWFPKIDDDFMARYQMNYGFIDNT 216
Query: 215 FLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKE 274
E+E +E+ + + N GKRK
Sbjct: 217 SAGEKEKAEKEAAEAKRKEEELKRMTAEAEAAMSNKNPATSNAVPTGKRK---------- 266
Query: 275 ANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDK 332
+PP WFE+ NT VYV+ LP D+T++E ++ KCG++ DP+T+K ++K+Y +K
Sbjct: 267 VQEPP-KWFEMDPSQNTKVYVSNLPLDITMDEFADLMGKCGMVMRDPQTQKFKLKLYAEK 325
Query: 333 ETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQV 390
+ G KGD L Y+K SV LA ++LD R + V +A+F+ +GE + +
Sbjct: 326 D-GQIKGDGLCDYIKVESVNLALKILDEYNLRGHK---IRVQRAQFQMRGEYNPALKPKR 381
Query: 391 DSKKKKKLKKVEEKMLGWGGRDDA----KLTIPATVILRFMFTPAEMRADENLRSELEAD 446
K K+KL+K++EK+ W R D + TVI++ +FTP + L E + +
Sbjct: 382 KKKDKEKLQKMKEKLFDW--RPDKMRGERSKNEKTVIIKNLFTPELFEKEVELILEYQNN 439
Query: 447 VQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGL 506
++EEC K G V V + + HP+GV + ++A I++M GR+FG RQ+ A DG
Sbjct: 440 LREECSKCGMVRKVVIYDRHPEGVAQINMASPEEADVVIQMMQGRYFGQRQLTAESWDGK 499
Query: 507 VNHAAIRDLDAEA----SRLEQFGAELEAD 532
+ I + EA S+ ++F AE E+D
Sbjct: 500 TKY-KIDESAVEAHERLSKWDEFLAEEESD 528
>gi|390602588|gb|EIN11981.1| hypothetical protein PUNSTDRAFT_63042 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 408
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 123/401 (30%), Positives = 183/401 (45%), Gaps = 78/401 (19%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
F DDGT +WD G AW+P D DD+ +
Sbjct: 38 FEADDGTEMEWDVGKNAWLPVLD----------------------------DDVVKSQQA 69
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKV---------- 287
+ +E+ +A V ++ N KRK+P+ ++ P +
Sbjct: 70 AYSVAGVDEEAPAA--PVAKRLNKKRKEPEDYTSATQSTDPVNIKKAKVKPPKAAGERKS 127
Query: 288 -NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VYVT LP D T +E++E F K G+++ED + +P++K+Y K+ G G+ALV Y
Sbjct: 128 KNTAVYVTQLPLDTTHDELIERFRKFGVLEED-DDGEPKVKMYA-KDDGTFSGEALVVYF 185
Query: 347 KEPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQK---------GE---RFIAKQVDSK 393
KE SV LA +LD R D M V +A F K GE K +D K
Sbjct: 186 KEESVTLAETMLDDAELRIGDPSTRMKVAKADFGHKQGDDHAKADGEAAGHAPRKTIDRK 245
Query: 394 K-KKKLKKVEEKMLGWGGRD---------DAKLTI---PATVILRFMFTPAEMRADENLR 440
K +++ +++K+ W D D TI V+L++MFT E+ D L
Sbjct: 246 KATRRINNMKKKLEEWDDEDAFGPAKTEEDESKTINKNSRVVVLKYMFTLQELEEDPALL 305
Query: 441 SELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
+L+ DV+EEC +G V +V + + P G++ V+F+D A+ C+ MNGR+F GR+I A
Sbjct: 306 LDLKEDVREECETLGEVTNVVLYDKEPDGIMTVKFRDSISARACVLRMNGRFFAGRRIEA 365
Query: 501 SEDDG---LVNHAAIRDL------DAEASRLEQFGAELEAD 532
S G A DL +AE RL+QF L A+
Sbjct: 366 SLYAGRQRFKRGNAGADLVEGDDEEAEKKRLDQFAQWLMAE 406
>gi|321479438|gb|EFX90394.1| hypothetical protein DAPPUDRAFT_299831 [Daphnia pulex]
Length = 460
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 120/398 (30%), Positives = 196/398 (49%), Gaps = 56/398 (14%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSS-------------QNDGYGIEEMTFLKEEEVFPT 224
+ D+ G +WD + W P+ +T + +D G E++ K+ +
Sbjct: 62 YKDESGRLLEWDLEAKEWKPKLETQAIRRIAKNPDEEFDSSDESGEEKLLEQKKSVINHH 121
Query: 225 VNVTDD-------------LANDEVGKEKLNSTEEKV-----------NSADNVVEEKHN 260
V+V D DE K +E+ S + V E
Sbjct: 122 VHVNSDGVKTYTDPSDGTIFEWDEEKKAWFPKLDEEFIARYQLSYGGEASTEPVAE--LE 179
Query: 261 GKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKED 318
K+ +P ++EKK+ ++P +WFE+ NT VYVT LP ++ EE+VE KCG+I++D
Sbjct: 180 KKKVEPPPKIEKKQDSEP--NWFEVDQTKNTKVYVTNLPTEINEEEIVEFMQKCGMIEKD 237
Query: 319 PETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKF 378
ET K +IK+Y + E G KGDAL TY+K SV LA ++LDG+ + + V +A F
Sbjct: 238 LETGKHKIKLYRN-ENGQVKGDALCTYIKIESVELALKILDGSVLKDK---TVGVERATF 293
Query: 379 EQKGERFIAKQVDSKKK---KKLKKVEEKMLGWGGRDDA----KLTIPATVILRFMFTPA 431
KG K+ +KK +KL+K +EK+ W R D + V+L+ +F P
Sbjct: 294 TLKGNYDPTKKPRKRKKKDVEKLRKKQEKLFDW--RPDKLRGERARNENVVVLKRLFKPE 351
Query: 432 EMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGR 491
E + + E + D++EEC K G V V + + + +G V FK ++A +C+E+++GR
Sbjct: 352 EFDVNPAMLLEYQRDLREECAKFGHVKRVVIFDRNEEGAAQVFFKTPEEADRCVEVLHGR 411
Query: 492 WFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAEL 529
WF G++I A DG + + +A+RL+++ L
Sbjct: 412 WFAGQKITAETWDGQTKYRKDETEEEKAARLKKWSEFL 449
>gi|336464786|gb|EGO53026.1| hypothetical protein NEUTE1DRAFT_133528 [Neurospora tetrasperma
FGSC 2508]
Length = 496
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 120/401 (29%), Positives = 180/401 (44%), Gaps = 102/401 (25%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG--- 237
DDGT Y++D GL+ W+P I+E + + + T D ++G
Sbjct: 92 DDGTEYEFDEGLKRWLPI-----------IDEALIEEHQRAYITSLGDDGTETGQLGTGQ 140
Query: 238 -KEKLNSTEEKVNSADNV-------VEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNT 289
K++ N E + ++ HN + P K ++++A + P K NT
Sbjct: 141 GKKRKNDDRETFSFFPSLRARLQDSASSNHNINNR-PSKNRKREKAPREP------KQNT 193
Query: 290 HVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
VYVTGLP D T +E+ E+FS KCG+I E+ ++ +PRIK+Y D G KGDAL+ + K
Sbjct: 194 AVYVTGLPLDATADEVAELFSRKCGVIAEEIDSGRPRIKMYTDA-NGNFKGDALIVFFKP 252
Query: 349 PSVALATQLLDGTPFR----PDGK------IPMSVTQAKFEQ------------------ 380
SV +A LLD T FR PDG MS + K++
Sbjct: 253 QSVEMAIMLLDDTDFRFEPAPDGSRMRVQAADMSYKKTKYDAAAGAENGGGDKAGNGGAE 312
Query: 381 -------------------------KGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDD-- 413
G ++Q +K KK +K+ K+ W DD
Sbjct: 313 GEGSSSSTPYNNITNPNPVSSDGGGNGHNQRSQQDKAKIIKKTQKLSAKLADWD--DDEP 370
Query: 414 ---AKLTIPA-----------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDS 459
A+L PA VILR MFT E+ D E++ D++EEC K+GPV +
Sbjct: 371 SPSARLLGPAIKEAKGGKWDKVVILRHMFTLEELNEDPAALLEIKDDIREECAKLGPVTN 430
Query: 460 VKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V + + +G+V V+F + A+ C+ LM+GR F GR + A
Sbjct: 431 VVLFDEEEEGIVSVKFATVEAAEACVHLMHGRAFDGRIVEA 471
>gi|242220456|ref|XP_002475994.1| predicted protein [Postia placenta Mad-698-R]
gi|220724783|gb|EED78803.1| predicted protein [Postia placenta Mad-698-R]
Length = 384
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 133/237 (56%), Gaps = 27/237 (11%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYVTGLP D +E+V FSK G+I+ED E P+IK+Y ++ G GDAL+ Y K
Sbjct: 115 NTAVYVTGLPPDTEADEIVTRFSKFGLIEEDDEGA-PKIKLYA-RDDGTFSGDALIVYFK 172
Query: 348 EPSVALATQLLDGTPFR---PDGKIPMSVTQAKFEQK-------GERFIAKQVDSKKK-- 395
E SV LA LLD R P + M V +A+F K GE K V KKK
Sbjct: 173 EESVDLAVSLLDDDELRIGLPSTR--MRVQRAEFGHKHEKEGGQGEMKPRKTVSDKKKAT 230
Query: 396 KKLKKVEEKMLGWGGRD----------DAKLTIPA-TVILRFMFTPAEMRADENLRSELE 444
K++ K+++K+ W D A L A V+L+ MFT E+ D +L +L+
Sbjct: 231 KRIGKMQKKLGEWDDEDGFGPAITEEDKAPLVNKARVVVLKHMFTLQELEEDASLLLDLK 290
Query: 445 ADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
DV++EC +G V +V + + P+GV+ V+F+D AQ C+ MNGR+F GR+I AS
Sbjct: 291 EDVRDECSSLGEVTNVVLYDKEPEGVMTVKFRDLLGAQACVIKMNGRFFAGRRIEAS 347
>gi|240276109|gb|EER39621.1| nuclear mRNA splicing factor-associated protein [Ajellomyces
capsulatus H143]
Length = 368
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 165/360 (45%), Gaps = 88/360 (24%)
Query: 167 GVPASPEGEDEFTDD-----------------DGTRYKWDRGLRAWVPQEDTS---SQND 206
G P SP DEF DD DG+ + +D L+ WVP D + Q +
Sbjct: 17 GFPQSP---DEFDDDYRISYSKLENKYILEAEDGSEWYYDDALKRWVPMVDNALLEQQGE 73
Query: 207 GYGIE-----EMTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNG 261
Y +E E K++ N T N EE S + H
Sbjct: 74 AYKVEGIDEHEPADFKQQRRLKRKNYT-------------NCEEENKQS------QAHKA 114
Query: 262 KRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPET 321
K+ +VNT VYVT +P D T+EE+ VFSKCG+I E+ +
Sbjct: 115 KKS---------------------RVNTAVYVTSIPLDATIEEVNNVFSKCGVIAEEIDR 153
Query: 322 KKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR-----PDGKIPMSVTQA 376
++PRIK+Y D E G KGDAL+ Y + SV LA Q+LD T FR PDGK M V A
Sbjct: 154 RRPRIKMYTDDE-GKFKGDALIVYFRPESVNLAIQMLDDTDFRLGETGPDGK--MKVQTA 210
Query: 377 KFEQKGERFIAKQVDSKKKKKLKKVEE---KMLGWGGRDDAKLTIPAT-------VILRF 426
+ K ++ K ++K+K +KK + K+ W DD P + VIL+
Sbjct: 211 DYSFKVQKMHDKPNMNEKRKIMKKTQRLNSKLADWD--DDEPSMAPQSSSRWDKVVILKH 268
Query: 427 MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIE 486
MFT E+ D +++ D+++EC K+G V +V + + GV VRF D + A+ C++
Sbjct: 269 MFTLQELEEDPAAILDIKEDIRQECSKLGDVTNVVLYDKEEDGVASVRFADVESAKICVQ 328
>gi|346979480|gb|EGY22932.1| hypothetical protein VDAG_04370 [Verticillium dahliae VdLs.17]
Length = 440
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 87/223 (39%), Positives = 127/223 (56%), Gaps = 14/223 (6%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VYVTGLP D T +E+ E+FS K G++ E+ ++ +PRI +Y D++ G K GDALV +
Sbjct: 202 NTAVYVTGLPRDATADEVHELFSRKAGVVAEEIDSGRPRITMYTDEQGGFK-GDALVVFF 260
Query: 347 KEPS--VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEK 404
P+ V A D P+G P+ + + + +R +Q K KK +K++ K
Sbjct: 261 HAPTDNVPAAAAAADAKGKAPEGAAPLPPKHQQ-QSRADRDRDRQ---KIIKKTQKLDAK 316
Query: 405 MLGWGGRDDAKLTIPAT------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
+ W D A L A V+LR MFT AE+ D E++ DV+EEC K+G V
Sbjct: 317 LADWSDDDTAALPTAAASKWDRLVVLRHMFTLAELEEDPAALLEIKEDVREECAKLGAVT 376
Query: 459 SVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+V + + P GVV V+F+D + AQ CI +M+GR F GR + AS
Sbjct: 377 NVVLFDEEPDGVVSVKFRDPQAAQACIAMMHGRSFDGRVVEAS 419
>gi|340500658|gb|EGR27521.1| RNA binding domain protein [Ichthyophthirius multifiliis]
Length = 288
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/262 (34%), Positives = 150/262 (57%), Gaps = 21/262 (8%)
Query: 277 KPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGM 336
K + W+ K NT++Y++GLP ++T+E++ + FS+ G+I++D + + G
Sbjct: 25 KKKEKWYNPKYNTYIYISGLPKNITIEKLDQFFSRAGVIRKD-HVSLEKKIKIYKDKDGS 83
Query: 337 KKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE----RFIAKQVDS 392
KGDA ++Y S+ LA +LD P G I + V +A+F Q GE R KQ +
Sbjct: 84 PKGDAAISYTMIESIDLAITMLDQREIEP-GYI-IKVEKAQFNQHGEQYKKRLGEKQNNC 141
Query: 393 KKKKKLKKVEE--KMLGWGGRD--DAKLTIPATVILRFMFTPAEMRAD---ENLRSELEA 445
K KKL+ E + LGW D D L I V ++ +F P + ++D E+L+SE+
Sbjct: 142 DKVKKLQNQAEIKQQLGWEDEDQIDTGLKI---VTIKNLFNPNQEKSDTFYEDLQSEVIQ 198
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
++Q +GPV K+ EN+P+GVV ++FK+ + AQKCIE+MNGR+F +Q+ DG
Sbjct: 199 EIQNS---VGPVQRFKIFENNPEGVVQIKFKNSQIAQKCIEIMNGRFFDEKQLICEYWDG 255
Query: 506 LVNHAAIRD-LDAEASRLEQFG 526
+ ++ ++ ++ E RL+ FG
Sbjct: 256 VTDYKTDKNEIEDEDQRLDDFG 277
>gi|406859070|gb|EKD12141.1| splicing factor U2AF-associated protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 382
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 109/349 (31%), Positives = 175/349 (50%), Gaps = 56/349 (16%)
Query: 182 DGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKL 241
DG +++D LR W P D + L+E++ V+ D+ + V K
Sbjct: 43 DGAEFEFDDALRKWRPVVDEA------------LLEEQQKAYMVSGVDEA--EPVDAMKR 88
Query: 242 NSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVT 301
+E VN D G+ + K+ + +P NT VYVTGLP D T
Sbjct: 89 KRKKEYVNGED------EGGRASKAPKKAKAPLPPRP---------NTAVYVTGLPHDAT 133
Query: 302 VEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
V E+ +VFS KCG+I E+ ++ KPRIK+Y D G KGDALV + K PSV +A LLD
Sbjct: 134 VAEVHDVFSRKCGVIAEEIDSGKPRIKLYTDA-NGAFKGDALVVFFKAPSVQMAITLLDD 192
Query: 361 TPFR----PDGKIPMSVTQAKFEQ-----KGERFIAKQVDSKKKKKLK----------KV 401
T FR GK+ + ++ +++ GE A + + KKK +K K+
Sbjct: 193 TEFRFGDRSSGKMRVQAAESSYKKVQHTDGGEAGSAAETE-KKKTSMKDKQKIIKKTQKL 251
Query: 402 EEKMLGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADENLRSELEADVQEECVKIGP 456
+ ++ WG + + L ++ VIL+ MFT E+ D +++ D++EEC K+G
Sbjct: 252 DARLADWGDEEPSSLVETSSRWDKVVILKHMFTLEELAEDPAAILDIKEDIREECSKLGE 311
Query: 457 VDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
V +V + + +G+ VR+ + + A+ C+ +M+GR F G+++ A DG
Sbjct: 312 VTNVVLFDLEEEGIASVRYANGEAAKACVRVMDGRSFAGQKVVAYISDG 360
>gi|392570333|gb|EIW63506.1| hypothetical protein TRAVEDRAFT_43795 [Trametes versicolor
FP-101664 SS1]
Length = 424
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 170/355 (47%), Gaps = 44/355 (12%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAND 234
F DDGT ++D AWVP D +Q Y I + +E P V L
Sbjct: 42 FEQDDGTELEYDDEKGAWVPLLDEDLVKAQQAAYSIAGV-----DEETPAAPVLKRLKKK 96
Query: 235 EVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVT 294
E +N T K N D+ +NG A P NT VYVT
Sbjct: 97 RKA-EDVNYTGAKRNKNDDA-STSNNGASSSSSSAPAAAPAPAAPPE--RKSKNTAVYVT 152
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D ++E+ F K G+I+ED E P++K+Y ++ G GDALV Y KE SV+LA
Sbjct: 153 GLPADADLDEVAARFGKFGLIEEDDEGS-PKVKLYA-RDDGSFSGDALVVYFKEESVSLA 210
Query: 355 TQLLDGTPFR---PDGKIPMSVTQAKFEQKGERFIA-----------KQVDSKKK--KKL 398
+LD R P+ + MSV +A+F K E+ A K V KK+ +++
Sbjct: 211 ITMLDDAELRVGEPNTR--MSVRRAEFGHKHEQGPAGGAGGGEARPRKTVHDKKRATRRI 268
Query: 399 KKVEEKMLGWGGRDDAKLTIP------------ATVILRFMFTPAEMRADENLRSELEAD 446
K+++K+ W D +I V+L++MFT E+ D L +L+ +
Sbjct: 269 GKMQKKLGEWDDEDGFGPSISEEDKLKEMNKHGRVVVLKYMFTLKELEEDSALLLDLKEE 328
Query: 447 VQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
V+EEC +G V +V + + P G++ V+FKD AQ C+ MNGR+F GR++ AS
Sbjct: 329 VREECETLGEVTNVVLYDEEPDGIMTVKFKDALSAQACVLKMNGRFFAGRRVEAS 383
>gi|170092249|ref|XP_001877346.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164647205|gb|EDR11449.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 377
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 125/385 (32%), Positives = 180/385 (46%), Gaps = 63/385 (16%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAND 234
+ DD T ++D WVP D +Q Y I + +E P V
Sbjct: 12 YEQDDDTELEYDPSNAVWVPLVDEDLIKNQQAAYSIAGV-----DEETPAAPV------- 59
Query: 235 EVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVT 294
+ +E E SA + + KR + DK+ +KPP NT VYVT
Sbjct: 60 -LRRESKKRKVEDYTSATPLTAAGPSIKRGKNDKR------DKPPAE--RKSKNTAVYVT 110
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
GLP D +E++E FS+CG+I+ED E +P+IK+Y ++ G G+ALV Y KE SV LA
Sbjct: 111 GLPLDTEQDELIERFSRCGVIEED-EQGEPKIKMYA-RDDGSFSGEALVVYFKEESVLLA 168
Query: 355 TQLLDGTPFRPDGKIP---MSVTQAKFEQKGERFIAKQVDSKKKK---------KLKKVE 402
+LD R +P M VT+A F K I SK +K ++ K++
Sbjct: 169 LNILDDAELRL--GVPSTVMRVTKADFAHKNNS-IGTGDQSKPRKTVNKKKTTKRIGKMQ 225
Query: 403 EKMLGWGGRDDAKLTIP-------------ATVILRFMFTPAEMRADENLRSELEADVQE 449
+K+ WG DD +P V+L+ MFT E++ D L +L+ DV+E
Sbjct: 226 KKLAEWGD-DDGFGPMPDPEDDMNVANKNSRVVVLKHMFTLQELQEDSALLLDLKEDVRE 284
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG---L 506
EC +G V +V + + GV+ V+F+D AQ C+ M+GR+F GR+I A G
Sbjct: 285 ECASLGEVTNVVLYDKEQDGVMTVKFRDTISAQACVLKMSGRFFAGRRIEADLYSGKQRF 344
Query: 507 VNHAAIRDL-----DAEASRLEQFG 526
A DL DAE RL+ F
Sbjct: 345 KRSGAGDDLGGEGEDAERKRLDDFA 369
>gi|391339544|ref|XP_003744108.1| PREDICTED: HIV Tat-specific factor 1-like [Metaseiulus
occidentalis]
Length = 393
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 173/351 (49%), Gaps = 50/351 (14%)
Query: 182 DGTRYKWDRGLRAWVPQ--ED---TSSQNDGYGI---EEMTFLKEEEVFPTVNVTDDLAN 233
DGT Y+WD RAW P+ ED N GY E + E E P + D++
Sbjct: 39 DGTAYEWDHEKRAWFPKINEDFIAMYQSNYGYAPAPEESTSKSTEPEKPPPKDNNDEV-- 96
Query: 234 DEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHV 291
E EK +E++ +S+D +K+ K +K+ P +WFEL NT V
Sbjct: 97 -EASSEKEEDSEKQPSSSDAAEVAGSEILKKKGLKNAQKR-----PATWFELDDAHNTKV 150
Query: 292 YVTGLP-DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
YV+GLP +D E+ +E+ SK G++ +D + K R K+Y D G KGD + TYLK S
Sbjct: 151 YVSGLPTEDFDDEKFMEMMSKYGMLFKDA-SGKYRAKLYKDA-NGKLKGDGICTYLKVES 208
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKK---------LKKV 401
V LA QLLDG+ G + V +A+F+ KGE D KK K LKK
Sbjct: 209 VDLAIQLLDGSQM---GDQQVHVERARFQMKGE------FDPSKKPKPLMKKEKEKLKKK 259
Query: 402 EEKMLGW------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEEC-VKI 454
EK+ W G R+ + TV++R F P E D L E + D++EEC K
Sbjct: 260 IEKLFAWKPDKLRGERNKWE----NTVVIRNFFEPQEFDQDPRLVLEYQKDLREECESKF 315
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
G V + + +PQGV V FKD + A CI M GR+F GRQ+ A DG
Sbjct: 316 GETKKVVIYDRNPQGVATVTFKDVEHADACITTMKGRFFAGRQLEADHWDG 366
>gi|328856068|gb|EGG05191.1| hypothetical protein MELLADRAFT_48926 [Melampsora larici-populina
98AG31]
Length = 383
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 117/374 (31%), Positives = 181/374 (48%), Gaps = 59/374 (15%)
Query: 177 EFTDDDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLAN 233
EF DDG Y+W ++WVP D S +Q Y +
Sbjct: 48 EFEADDGKEYEWIEAAQSWVPIIDQSLWTAQQAAYNLH---------------------- 85
Query: 234 DEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYV 293
D+ + ++ N +RK+ K++ E + E NT VYV
Sbjct: 86 ---------------GHPDDQIRQQANAERKEKQKRMLGTEDGTGKKN--EAVRNTAVYV 128
Query: 294 TGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVAL 353
+ LP DV ++E+ E FSK G++ D E + P+IK+Y D++TG G AL+ YLK SV L
Sbjct: 129 SRLPLDVKIDEIHETFSKAGLVLVDEENR-PKIKLYEDEQTGRFNGTALIVYLKVESVEL 187
Query: 354 ATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQV--DSKKKKK-----LKKVEEKML 406
A +L D + R M V++A+F +K + + DSKKKK+ + +++ K+
Sbjct: 188 AIKLFDESSLRIGKSELMKVSRAQFSEKKSNLNSTIMMNDSKKKKQKLAKKVDQLKSKLE 247
Query: 407 GWGGRDDAKLTIPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
W D+ L + V+L++MFT E+ D +L +L+ DV+EEC +G V +V + +
Sbjct: 248 EWDSEDEESLKKDSRMVVLKYMFTVEELAQDPSLLIDLKEDVREECELLGNVTNVTLYDL 307
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD-----GLVNHAAIRDLDA--- 517
+G++ V+FK+ AQ CI MN R+F GRQ+ A DD NH + D
Sbjct: 308 EEEGIMTVKFKEPIAAQACILKMNNRFFAGRQVQAMIDDRSRRFKKSNHHQSNEKDGEED 367
Query: 518 EASRLEQFGAELEA 531
E RL+ F LE+
Sbjct: 368 EKKRLDGFAKYLES 381
>gi|346326798|gb|EGX96394.1| nuclear mRNA splicing factor-associated protein, putative
[Cordyceps militaris CM01]
Length = 381
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 89/254 (35%), Positives = 134/254 (52%), Gaps = 42/254 (16%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +YVTGLP D T+EE+ E+FS K G+I E+ ++ PRIK+Y D + G KGDAL+ +
Sbjct: 110 NTAIYVTGLPPDATIEEVHELFSRKGGVIAEEIDSGAPRIKLYNDAD-GNFKGDALIVFF 168
Query: 347 KEPSVALATQLLDGTPFR------PDGKIPMSVTQAKFEQ-------------------- 380
K SV +A LLD T FR +G+I + + +++
Sbjct: 169 KPQSVEMAIMLLDDTDFRLTASGTREGRIRVQEADSSYKKVQYDKDGPGKGEDGSAKPDT 228
Query: 381 ------KGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT-------VILRFM 427
GER + K KK +K++ K+ W D+A +P V+L+ M
Sbjct: 229 KSNGNGNGERRAPTKDRKKIIKKTQKMDAKLADWSD-DEAPANVPPISSKKDKMVVLQHM 287
Query: 428 FTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIEL 487
F+ E+ AD E++ D+++EC K+G V SV + + G+V VRFKD + A CI L
Sbjct: 288 FSRQELEADPAALLEIKEDIRDECSKLGTVTSVVLYDEEEDGIVTVRFKDAESAMACINL 347
Query: 488 MNGRWFGGRQIHAS 501
M+GR FGG ++ A+
Sbjct: 348 MHGRNFGGMKVEAA 361
>gi|145535560|ref|XP_001453513.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421235|emb|CAK86116.1| unnamed protein product [Paramecium tetraurelia]
Length = 409
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/257 (36%), Positives = 145/257 (56%), Gaps = 13/257 (5%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
+ W+ K+NT+VYV GLP D+T+EEM FSK GII+ +PET +P IKIY D + G KG
Sbjct: 145 NQWYTPKINTNVYVEGLPQDITMEEMKVFFSKAGIIRINPETLQPTIKIYRD-QNGNCKG 203
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE--RFIAKQVDSKKKKK 397
D L++Y SV A ++LDG RP+ + + VT+A FEQKG+ + K+VD +K
Sbjct: 204 DGLISYKMVESVQTAKEMLDGLHIRPN--VIVKVTEAVFEQKGQYRKRENKKVDKLQKAL 261
Query: 398 LKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADE---NLRSELEADVQEECVKI 454
++ E L G++D + +I + +++P + + E L EL + E ++I
Sbjct: 262 ARQKEMTQLAEEGQEDDGKGL-KILIFKNLYSPTQAQNPEFMNQLYGELLLKI--ESLQI 318
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAA-IR 513
V ++ ++HPQGV V+F DA+ C+ ++G F R I+ DG N + I
Sbjct: 319 Y-VQKLEFFKDHPQGVAKVKFHSAYDAEICLSNLSGIEFNQRNINIQYWDGRQNFKSNIE 377
Query: 514 DLDAEASRLEQFGAELE 530
+ E RL++FG LE
Sbjct: 378 SKEVEEQRLDEFGQWLE 394
>gi|400601981|gb|EJP69606.1| splicing factor u2af-associated protein 2 [Beauveria bassiana ARSEF
2860]
Length = 373
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 91/249 (36%), Positives = 134/249 (53%), Gaps = 36/249 (14%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VYVTGLP D TV+E+ E+FS K G+I E+ ++ PRIK+Y D + G KGDAL+ +
Sbjct: 106 NTAVYVTGLPPDATVDEVHELFSRKGGVIAEEIDSGAPRIKMYNDAD-GTFKGDALIVFF 164
Query: 347 KEPSVALATQLLDGTPFR------PDGKIPM-----SVTQAKFEQ--------------- 380
K SV +A LLD T FR G+I + S + +++Q
Sbjct: 165 KPQSVEMAIMLLDDTDFRITANGTRQGRIRVQEADSSYKKVQYDQDGQNKGKSANHSAKS 224
Query: 381 ---KGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIP-----ATVILRFMFTPAE 432
GER + K KK +K++ K+ W D + + TV+L+ MFT E
Sbjct: 225 DNGNGERRPPTKDRQKIIKKTQKMDAKLADWSDDDGPAIALANSKKDKTVVLQHMFTLQE 284
Query: 433 MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRW 492
+ D E++ D++EEC K+G V SV + + G+V VRFK+ + A CI+LM+GR
Sbjct: 285 LVDDPAALLEIKEDIREECSKLGTVTSVVLYDEEEDGIVTVRFKEAESALACIKLMHGRS 344
Query: 493 FGGRQIHAS 501
FGG ++ A+
Sbjct: 345 FGGMRVEAA 353
>gi|302657258|ref|XP_003020354.1| hypothetical protein TRV_05547 [Trichophyton verrucosum HKI 0517]
gi|291184182|gb|EFE39736.1| hypothetical protein TRV_05547 [Trichophyton verrucosum HKI 0517]
Length = 452
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 125/441 (28%), Positives = 186/441 (42%), Gaps = 128/441 (29%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
D+G Y +D L+ W+P D S +M + EEE AN
Sbjct: 37 DNGEEYTYDDALKRWIPSLDESLMEQQRQAYKMQGVDEEEP----------AN------- 79
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
L + +EK KRK D+ ++ KP +VNT VYVT +P D
Sbjct: 80 LKTLQEK------------KKKRKHNDESTTAQKPKKP-------RVNTAVYVTAIPLDA 120
Query: 301 TVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
TV E+ +FSKCG+I E+ ++ KPRIK+Y D++ K GDALV Y + SV LA Q+LD
Sbjct: 121 TVSEVSSLFSKCGVIAEEIDSGKPRIKMYTDEQDAFK-GDALVVYFRPESVNLAIQMLDD 179
Query: 361 TPFRPDGKIP---MSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLT 417
T FR K M V A F K + + + + K K+ + +K+ + +L
Sbjct: 180 TDFRFGEKGAEGNMRVQPADFSFKAVQEAPAKANMRDKMKIIRKTQKLNKYAAVQSERLI 239
Query: 418 IP--------------------------ATVILRFMFTPAEMRADENLRS---------- 441
+ V+L+ MFT E+ + +RS
Sbjct: 240 VVWLTWAYSKLTDWDDDDVGPRHSGKAGKVVVLKHMFTLQELEVNPPIRSIQCISMFFLT 299
Query: 442 -------------ELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCI--- 485
+++ D++EEC K+G V +V + + G+ VRF D + AQ C+
Sbjct: 300 SGFVSQEDPAAILDIKEDIREECSKLGEVTNVVLYDKEESGIATVRFSDPECAQACVQVW 359
Query: 486 ------------------ELMNGRWFGGRQIHASEDDGLVNH--------AAIRD----- 514
++MNGR+FGG Q+ A +G V AA++D
Sbjct: 360 LEYQHDFTDIESRANNNGQMMNGRFFGGTQVEAYVVEGKVRFKKSGASAAAALQDDGAGW 419
Query: 515 -----LDAEASRLEQFGAELE 530
D EA RL++FGA LE
Sbjct: 420 EAEAGKDDEAQRLDKFGAWLE 440
>gi|118388492|ref|XP_001027343.1| RNA binding domain protein [Tetrahymena thermophila]
gi|89309113|gb|EAS07101.1| RNA binding domain protein [Tetrahymena thermophila SB210]
Length = 563
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/274 (32%), Positives = 152/274 (55%), Gaps = 33/274 (12%)
Query: 274 EANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKE 333
+A K W++ + GLP +T E++ E FS+ G+I++DP T K +IKIY D E
Sbjct: 293 QAEKKKQKWYQSR--------GLPLSITEEKLDEFFSRAGVIRKDPITNKKKIKIYSD-E 343
Query: 334 TGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ---- 389
G+ KGDA++++ SV +A +LD P G I + V +A FEQ G+ + ++
Sbjct: 344 NGLPKGDAVISFQMMESVEIAITMLDEREIEP-GHI-IRVERANFEQHGDTYKKREGVVI 401
Query: 390 -----VDSKKKKKLKKVEEKMLGWGGRD--DAKLTIPATVILRFMFTPAEMRADEN---- 438
+D + ++K + + LGW D D L I VIL+ +FT ++ D N
Sbjct: 402 NKEKALDKIQLAQMKAAQRQALGWEDEDQIDTGLKI---VILKNVFTLKDIEEDPNFLEE 458
Query: 439 LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
LR E+ +++ C GP+ +K+ E +P+GV+ ++FK+ DA+ CI+ M+GR+F R++
Sbjct: 459 LREEMAKEIESTC---GPIQRLKIFEENPEGVIEIKFKNSADAKVCIQKMDGRYFDEREL 515
Query: 499 HASEDDGLVNHA-AIRDLDAEASRLEQFGAELEA 531
DG V++ + + ++ + RLE+FG +E
Sbjct: 516 ECFFWDGKVDYKRSQKQMEDDEQRLEEFGKWIEG 549
>gi|397642759|gb|EJK75438.1| hypothetical protein THAOC_02838, partial [Thalassiosira oceanica]
Length = 596
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 96/261 (36%), Positives = 144/261 (55%), Gaps = 24/261 (9%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDK-----ETGMKKGDALVTY 345
VYVTGLP D +E+ + FSK GI+ DPET+KP++K+Y K + G+ KGDA V Y
Sbjct: 325 VYVTGLPQDTNEDEVAKYFSKVGILDLDPETQKPKVKLYRMKNGDGNKPGLLKGDASVCY 384
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKG----ERFIAKQVDSKKKKKLKKV 401
+ SV LA Q+LD +R DG ++V +AKFEQ+G +++ S+ K+K+ ++
Sbjct: 385 ARPESVELALQILDDNLYR-DGAT-LTVQRAKFEQQGSFDHRSTGGRRIVSEAKRKVARI 442
Query: 402 EE-KMLGW----GGRDDAKLTIPATVILRFMFTPAEMRADEN---LRSELEADVQEECVK 453
+ +GW GR L ++L MF P ++ D N LRS +E V+ EC +
Sbjct: 443 AALQAVGWDEGENGRIAGGLKGLRIIVLTNMFDPKDLEKDTNDEKLRS-IEKSVRAECDE 501
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGR-WFGGRQIHASEDDGLVN---H 509
+G V+ + V HP GV++V+F A ++ GR GR+I AS DG+ + H
Sbjct: 502 LGDVEKITVFSKHPAGVMVVKFSQPNAASAAVDRFKGRDGCCGRKIEASYWDGVTDYTAH 561
Query: 510 AAIRDLDAEASRLEQFGAELE 530
A ++ RL+QFG LE
Sbjct: 562 NAEQEERENEKRLDQFGDWLE 582
>gi|406604986|emb|CCH43585.1| Splicing factor U2AF-associated protein 2 [Wickerhamomyces
ciferrii]
Length = 354
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 97/262 (37%), Positives = 145/262 (55%), Gaps = 26/262 (9%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
KV+T VYV+ LP D V E+ E+FSK GII ED T K +IK+Y +++ KGDALV Y
Sbjct: 90 KVSTAVYVSNLPLDTDVTELKEIFSKYGIIAEDLLTGKSKIKLYTNEQNEF-KGDALVVY 148
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAKF--------EQKGERFIAKQVDSKKKKK 397
LK SV LA Q+L+ T R +G + +SV +A+F E+K +R + ++ S +K+
Sbjct: 149 LKPESVELAVQMLNETKLRVNGDV-ISVQKAEFNQSKTEDNEKKHKRPLTEEEKSIIRKR 207
Query: 398 LKKVEEKMLGWGGRDDAKLTIPA---TVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
LK + EK+ W G DD P TVI++ FT E+ D + E+E D+ E C ++
Sbjct: 208 LKTLNEKVDDWNGEDDQ--INPKWIRTVIIKRAFTLKELEEDPDALKEIEEDMMEGCEEL 265
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIEL---MNGRWFGGRQIHASE--DDGLVNH 509
G V+ V V + +GVV+VRF + + A KCI++ + + G + SE ++
Sbjct: 266 GQVEKVIVFDQEEEGVVMVRFYNSESAVKCIDVSQKLAVTRYNGEHYNKSEPKSKDKTDN 325
Query: 510 AAIR------DLDAEASRLEQF 525
IR D D E R++ F
Sbjct: 326 TPIRIPTGVEDNDEEEERIQSF 347
>gi|402590286|gb|EJW84217.1| Htatsf1 protein [Wuchereria bancrofti]
Length = 394
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/242 (34%), Positives = 136/242 (56%), Gaps = 11/242 (4%)
Query: 277 KPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKET 334
K W EL+ NT VYV+ LP +T E +E+ SKCG+I+ D T K +IK+Y + +
Sbjct: 157 KEKQGWVELEEERNTSVYVSNLPYSITEESFIELMSKCGVIQRDARTNKFKIKLYRN-DD 215
Query: 335 GMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE---RFIAKQVD 391
G KGD Y+K+ SV +A +LDG + DGKI +SV +AKF+ KGE K++
Sbjct: 216 GTFKGDGRCCYIKKESVIMALDILDG--WNVDGKI-ISVEKAKFQMKGEFDPSKKKKKLT 272
Query: 392 SKKKKKLKKVEEKMLGWGGRD--DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQE 449
+ +KK+ + +E++ W + + T++++ MFT +M + L +LE ++++
Sbjct: 273 AAQKKRFIENQERIFEWKPEKPRNYRPISDCTIVMKNMFTLEQMMKNATLLLDLEEEIRK 332
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
C + G V V V +N+P+GV+ V FK+ + + + +NGR GRQI + DG +
Sbjct: 333 VCERFGAVKKVVVHDNNPEGVICVTFKNVEHSDTAVRSLNGRVVDGRQISVTLWDGKTKY 392
Query: 510 AA 511
Sbjct: 393 TV 394
>gi|341891028|gb|EGT46963.1| hypothetical protein CAEBREN_21533 [Caenorhabditis brenneri]
Length = 467
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 114/378 (30%), Positives = 188/378 (49%), Gaps = 35/378 (9%)
Query: 166 DGVPASPEGEDEFT--------DDDGT---RYKWDRGLRAWVPQEDTSSQNDGYGIEEMT 214
D VP PE E EF D + T Y WD R WVP+ ++++Q E++
Sbjct: 65 DWVPV-PEDEQEFLSQLWFEQEDQEKTARKHYDWDEEKREWVPKANSTAQ------EQLE 117
Query: 215 FLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKE 274
E+ + DE+ K K++ ++ + +E+ +++ K + +E
Sbjct: 118 V--NEDFIAEYQANYGVQYDEIYK-KMDDELQQKTAKIQKEQEEKKELKRKKKKGLHTEE 174
Query: 275 ANKPPDSWFELKVNTH-VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKE 333
K + W +L H VYV+ LP+D+T EE E SKCG+I+ D T KP+ K+Y D E
Sbjct: 175 EKK--EGWVDLGDKVHAVYVSNLPEDITDEEFQEFMSKCGVIQPDIRTNKPKCKLYRD-E 231
Query: 334 TGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE---RFIAKQV 390
G KGD Y+K+ S+ LA +LDG+ F+ + + V +A FE KG+ K++
Sbjct: 232 NGELKGDGRCCYIKKESIDLACNILDGSQFKGN---TVKVEEAHFEMKGDFDPSRKRKKL 288
Query: 391 DSKKKKKLKKVEEKMLGWGGRDDAKLTIP---ATVILRFMFTPAEMRADENLRSELEADV 447
+KK+ + + K+ W D + P TVI++ +FT M + L +L+ ++
Sbjct: 289 TVAQKKRYMEQQNKIFEWTP-DKPRNYRPKSDCTVIVKNLFTLEMMAKNAALMLDLKEEM 347
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLV 507
+ C K G V V V +NHP+GVV V F +++ ++ +NGR GR++ A DG
Sbjct: 348 TQSCQKYGTVKKVVVYDNHPEGVVSVTFVTTEESDMAVKYLNGRVVDGRKLTAELWDGKT 407
Query: 508 NHAAIRDLDAEASRLEQF 525
+ ++E RL++F
Sbjct: 408 KYKVQETEESEERRLKEF 425
>gi|332861778|ref|XP_003317777.1| PREDICTED: LOW QUALITY PROTEIN: HIV Tat-specific factor 1 homolog,
partial [Pan troglodytes]
Length = 725
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/332 (31%), Positives = 153/332 (46%), Gaps = 63/332 (18%)
Query: 184 TRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKLNS 243
T Y+WD +AW P+ +T E+ T +ND N
Sbjct: 49 TPYEWDLDKKAWFPK--------------IT----EDFIATYQANYGFSNDGASSSTANV 90
Query: 244 TEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVT 301
+ +A+ +EK +P +K E K WF ++ NT+VYV+GLP D+T
Sbjct: 91 EDVHARTAEEPPQEKA----PEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDIT 146
Query: 302 VEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGT 361
V+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV LA +LLD
Sbjct: 147 VDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQ-GNLKGDGLCCYLKRESVELALKLLDED 205
Query: 362 PFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLT 417
R + V AKF+ KGE +K+ K K K +++K L W A ++
Sbjct: 206 EIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMR 262
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
VI++ MF P + R HP GV V F+D
Sbjct: 263 HERVVIIKNMFHPMDFR-------------------------------HPDGVASVSFRD 291
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 292 PEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 323
>gi|392593698|gb|EIW83023.1| hypothetical protein CONPUDRAFT_52939 [Coniophora puteana
RWD-64-598 SS2]
Length = 395
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/347 (30%), Positives = 163/347 (46%), Gaps = 51/347 (14%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+ DDGT ++D AWVP ++E K++ + V ++ +
Sbjct: 36 YEQDDGTELEYDTAKAAWVPV-----------LDEDLIKKQQAAYSVAGVDEETPAAPIL 84
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
+E SA KR + DK+ + + N T VYVTGLP
Sbjct: 85 ARIKKRKQEDYTSASAPA---GPSKRGKGDKEQQARSKN------------TAVYVTGLP 129
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D V+E+VE FSKCG+I+ED E P+IK+Y + E G G+ALV + KE SV LA L
Sbjct: 130 LDTDVDEIVERFSKCGVIEEDDEGD-PKIKLYAN-EDGNFTGEALVVFFKEDSVILAINL 187
Query: 358 LDGTPFR-PDGKIPMSVTQAKF---------EQKGERFIAKQVDSKK-KKKLKKVEEKML 406
LD R D + V +A+F E G + VD KK +++ K+++K+
Sbjct: 188 LDEAELRIGDASTVIKVRKAEFGHKNSGGSGETNGAARPRRTVDKKKITRRIGKMQKKLQ 247
Query: 407 GWGGRD------------DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
W D A V+L+ MF ++ D L +L+ DV+EE +
Sbjct: 248 EWDDDDGFGPSKTEEDNSQALNKNSRVVVLKHMFAIEDLEKDATLLLDLKEDVREEASTL 307
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
G V +V + + P GV+ ++F++ AQ C+ MNGR+F G++I AS
Sbjct: 308 GEVTNVVLYDKEPDGVMTIKFREPLSAQACVIKMNGRFFDGKRIEAS 354
>gi|393217803|gb|EJD03292.1| hypothetical protein FOMMEDRAFT_85466 [Fomitiporia mediterranea
MF3/22]
Length = 474
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/250 (35%), Positives = 137/250 (54%), Gaps = 38/250 (15%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VYVT LP D T EE+ F++CG+++ED + P+IK+Y ++ G G+ALV Y K
Sbjct: 180 NTAVYVTRLPPDTTPEELAARFARCGVLEED-DDGDPKIKLYA-RDDGSFSGEALVVYFK 237
Query: 348 EPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQKGERFIA--------------KQVDS 392
E SVALA +LD R + MSV+QA+F KGE K +D
Sbjct: 238 EESVALAVAMLDEAELRIGEPSTTMSVSQAEFGHKGEVGGGGGAGAAGEGEHKPRKTIDK 297
Query: 393 KK-KKKLKKVEEKMLGWGGRDD-------AKLTIPAT-------------VILRFMFTPA 431
K+ K++ ++++K+ WG DD A +I A V+L+ MFT
Sbjct: 298 KRATKRIGRMQKKLEDWGSDDDFGPSPDAASSSIRAAGAAGGVNNKNARVVVLKHMFTLD 357
Query: 432 EMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGR 491
+++ D +L +L+ DV+EEC ++G V +V + + P GV+ V+F++ AQ C+ M R
Sbjct: 358 QLQEDASLILDLKEDVREECSQLGEVTNVVLYDEEPDGVMTVKFREPISAQACVLKMQNR 417
Query: 492 WFGGRQIHAS 501
+F GR++ AS
Sbjct: 418 FFDGRRVEAS 427
>gi|195020028|ref|XP_001985105.1| GH14692 [Drosophila grimshawi]
gi|193898587|gb|EDV97453.1| GH14692 [Drosophila grimshawi]
Length = 569
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 182/371 (49%), Gaps = 26/371 (7%)
Query: 174 GEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDD 230
G+ +TD DG + WD AW P+ D + YG + T E E
Sbjct: 191 GDRTYTDKDGVVFFWDASKSAWFPKIDDDFMARYQMNYGFIDNTSAGEREKAEREAAEAK 250
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKV--N 288
DE L + +A N +G + K++A PP WFE+ N
Sbjct: 251 RKEDE-----LKRMTAEAQAAMNAPLMDKDGD-PTATTALGKRKAQDPP-KWFEMDPLQN 303
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T VYV+ LP D+T++E E+ KCG++ DP+T+K ++K+Y + + G KGD L Y+K
Sbjct: 304 TKVYVSNLPLDITIDEFAELMGKCGLVMRDPQTQKYKLKLYTEAD-GQIKGDGLCDYIKV 362
Query: 349 PSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEEKML 406
SV LA ++LD R + V +A+F+ +GE + + K K+KL+K++EK+
Sbjct: 363 ESVNLALEILDEYDLRGHK---IRVQRAQFQMRGEYNPALKPKRKKKDKQKLQKIKEKLF 419
Query: 407 GWGGRDDA----KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV 462
W R D + TVI++ +F P + L E + ++++EC K G V V +
Sbjct: 420 DW--RPDKMRGERSKNEKTVIIKNLFVPELFENEVELILEYQNNLRDECGKCGMVRKVVI 477
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEAS-R 521
+ H +G+ + ++A I++M GR+FG RQ+ A DG + I + AEAS R
Sbjct: 478 YDRHQEGIAQINMSTPEEADVVIQMMQGRYFGQRQLSAEHWDGKTKY-KIDESAAEASER 536
Query: 522 LEQFGAELEAD 532
L ++ L A+
Sbjct: 537 LNKWDEYLAAE 547
>gi|256081369|ref|XP_002576943.1| hypothetical protein [Schistosoma mansoni]
gi|350645401|emb|CCD59930.1| hypothetical protein Smp_152440 [Schistosoma mansoni]
Length = 535
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 93/279 (33%), Positives = 142/279 (50%), Gaps = 30/279 (10%)
Query: 267 DKQVEKKEANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKP 324
D K++ P +W+E+ NTHVYV+GLP +T EE + SKCG+I +P T P
Sbjct: 225 DPNTRKRKQTAPLPAWYEIDESKNTHVYVSGLPPTITDEEFSALMSKCGVIMNEPFTNIP 284
Query: 325 RIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER 384
RIK+Y D + G+ KGD Y++ SV LA ++LDG + P I V +AKF+ KGE
Sbjct: 285 RIKLYKD-QAGIPKGDGRCCYVRVESVELALKILDGMLYTPGYTI--HVERAKFQPKGE- 340
Query: 385 FIAKQVDSKKKKKL--------KKVEEKMLGWGGRDDAKLT---IPATVILRFMFTPAEM 433
D KK+++L K+ +E + W D +K VIL+ F ++
Sbjct: 341 -----FDPKKRRRLTVKEKKKLKEQQENLFRW-SIDTSKFIRGKKERVVILKNAFIESDF 394
Query: 434 RADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF 493
+ D L + ++ +C K G + + V + HP+GVV V F ++A I+ ++ F
Sbjct: 395 QRDVTLIPIVRERLRVQCAKCGIIKKIVVHDAHPEGVVSVTFSTPEEADTGIKFLSKALF 454
Query: 494 -------GGRQIHASEDDGLVNHAAIRDLDAEASRLEQF 525
G RQ+ A DG N++ D EASRL+ +
Sbjct: 455 VDYPGSGGPRQLEAERWDGKTNYSVSESKDKEASRLQSW 493
>gi|358060822|dbj|GAA93593.1| hypothetical protein E5Q_00237 [Mixia osmundae IAM 14324]
Length = 422
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/275 (38%), Positives = 145/275 (52%), Gaps = 43/275 (15%)
Query: 256 EEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGII 315
+E+ NGKR++ DK + NT VYVT LP D TVEEM VFSK G+I
Sbjct: 120 DEETNGKRRKDDKPRASR--------------NTAVYVTHLPPDATVEEMKGVFSKAGLI 165
Query: 316 KEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFR---PDGKIPMS 372
ED + +P+I+IY + E G KG+ALV YL E SV LA LLD T R +G+ M
Sbjct: 166 LEDQD-GEPKIRIYKN-EDGTPKGEALVVYLNEASVMLAETLLDDTELRLGSNEGR--MR 221
Query: 373 VTQAKFEQKGERFIA----------KQVDSKKKKKLKKVEEKMLGWGGRDD--------- 413
V +A F + A +Q +K+KK+ +++ K+ W DD
Sbjct: 222 VAKADFSKSANSNNANDQSEGSKKSEQDKAKQKKRAERLRSKLTDWSSDDDTLTAQTKAP 281
Query: 414 -AKLT-IPATVILRFMFTPAEMRADENLRSELEADVQEECVK-IGPVDSVKVCENHPQGV 470
AK T V+L MFT E+ D L EL+ +V EEC + IG V+++ + + GV
Sbjct: 282 TAKATRFARIVVLYHMFTLTELEEDPTLLLELKEEVLEECEETIGKVNAITLYDKEADGV 341
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
+ ++F D AQ C+ MNGR+F GRQI AS DG
Sbjct: 342 ITIKFADAIAAQACVLKMNGRFFAGRQIEASIFDG 376
>gi|402224210|gb|EJU04273.1| splicing factor u2af-associated protein 2 [Dacryopinax sp. DJM-731
SS1]
Length = 399
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 117/384 (30%), Positives = 187/384 (48%), Gaps = 59/384 (15%)
Query: 178 FTDDD--GTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLA 232
F DDD GT +WD R WVP D +Q Y ++ + +E P V
Sbjct: 45 FEDDDEEGTEMEWDAKKRNWVPVIDEDLVRAQQAAYSVQGV-----DESAPAAPV----- 94
Query: 233 NDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVY 292
+ +EK +E+V+ K+ +P+ + K NT VY
Sbjct: 95 ---LAREKKKRKKEQVDYTSGNTTPLPPAKKTKPEPKRGK---------------NTAVY 136
Query: 293 VTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVA 352
V+ +P D T +E+++ F K G+++E+ E + P++K+Y D+E G+ G ALV Y KE SV
Sbjct: 137 VSSIPLDATKDEIIDRFGKFGVLEEEDEGE-PKVKMYADEE-GLFNGTALVVYFKEESVT 194
Query: 353 LATQLLDGTPFRPDGKIPMSV-----TQAKFEQKGERFIAKQV--DSKKKKKLKKVEEKM 405
LA +LD R M V K++ K E + ++V SK ++LK++E ++
Sbjct: 195 LAITMLDEAELRIGEGGSMRVELPNYDNDKWKNKREAKVERKVVDKSKASRRLKRMEGRI 254
Query: 406 LGWGGRD------DAKLTIPA-----TVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
W D + K +PA V+L+ MFT ++ +D +L +L+ DV+EEC +
Sbjct: 255 AEWDDEDGFGPQLEKKEPLPAPLESRVVVLKHMFTLDQLASDPSLLLDLKEDVREECEAL 314
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRD 514
G V +V + + P+GV+ V+F+D AQ C+ MNGR+F R++ A G D
Sbjct: 315 GQVTNVILYDKEPEGVMTVKFRDPVSAQACVLKMNGRYFDKRRVLAELYTGRQRFKRTGD 374
Query: 515 ------LDAEASRLEQFGAELEAD 532
+AE RL++F A L A+
Sbjct: 375 DITKEVDEAEKQRLDEFAAWLMAE 398
>gi|342318991|gb|EGU10943.1| Splicing factor u2af-associated protein 2 [Rhodotorula glutinis
ATCC 204091]
Length = 1453
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 164/351 (46%), Gaps = 64/351 (18%)
Query: 180 DDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKE 239
D DG +WD+ AWVP +TD++ +
Sbjct: 55 DRDGNELEWDQARNAWVPA----------------------------ITDEVLKAQQAAY 86
Query: 240 KLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDD 299
++ +E+ + EEK KRK D+ K++ KP + NT V+V+ LP
Sbjct: 87 SVDGVDEEAPAGPAAKEEKKGKKRKAADENGGKQK--KP-------RANTAVFVSSLPLS 137
Query: 300 VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
TV+++V FSK G+I ED E P++K+Y D E G KG+ALV YL+E SV LA +L D
Sbjct: 138 TTVDQLVSTFSKAGLILEDVE-GNPKVKLYKD-ENGRFKGEALVVYLQEASVELACRLFD 195
Query: 360 GTPF-RPDGKIPMSVTQAKFEQ------------------KGERFIAKQVDSKKKKKLKK 400
T G +SV A+++ G + Q+ +++ KK
Sbjct: 196 ETELVLGSGDKVISVKAAEWDNSKKEKADGADKGEGGSQGNGAKGKPDQLKARQGKKAAA 255
Query: 401 VEEKMLGWGGRDDAKLTIP------ATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
+ +K+ W +D V+L MFT E+ D L +L+ DV+EEC +
Sbjct: 256 LRQKLEDWSDDEDPNAAAARARKYRGVVVLEGMFTLQELEDDPTLLLDLKEDVREECETL 315
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
G V +V + + +GV+ VRFK+ AQ CI MNGR+FGGRQI A DG
Sbjct: 316 GEVTNVTLYDKEDKGVMTVRFKEELAAQACIAKMNGRFFGGRQISAFPMDG 366
>gi|158295648|ref|XP_316330.4| AGAP006266-PA [Anopheles gambiae str. PEST]
gi|157016137|gb|EAA10753.4| AGAP006266-PA [Anopheles gambiae str. PEST]
Length = 665
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 181/364 (49%), Gaps = 28/364 (7%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAND 234
+ D DG Y WD +AW P+ D + YG + T + ++
Sbjct: 222 YKDKDGAVYFWDEERKAWFPKVDDEFMAIYQLNYGFIDNTSSGGGPAA-AAPAPARVVSE 280
Query: 235 EVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVY 292
G+E E+ + AD EE K ++K +PP WFE+ + NT VY
Sbjct: 281 PSGRELPGCDED--DDADEQGEEAQRALAAAAAKGKKRKAPPEPP-KWFEIAPEHNTKVY 337
Query: 293 VTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVA 352
V+ LP D++ EE E+ SKCG++ +DP+T K ++K+Y + + G KGD L Y+K SV
Sbjct: 338 VSNLPTDISEEEFGELMSKCGMVMKDPKTHKLKLKLYREPD-GTLKGDGLCHYIKIESVD 396
Query: 353 LATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEEKMLGW-- 408
LA ++LD R + V +A+F+ +GE + ++ K+K++LKK++E + W
Sbjct: 397 LALKILDNYDVRGHK---IKVQRAEFQMRGEYNPTLKPKMRKKEKERLKKIQESLFDWRP 453
Query: 409 ----GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
G R + VI++ +F P + +L E + D++EEC K G V V + +
Sbjct: 454 VKMRGERSKHERI----VIIKNLFEPELFDREVHLLLEYQNDLREECGKCGTVRRVLLYD 509
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIR---DLDAEASR 521
HP+GV V D ++A ++LMNGR+FG R++ A+ DG + D+D
Sbjct: 510 RHPEGVAQVTMGDPEEADLVVQLMNGRFFGQRKLTAAIWDGRTKYRIAETDADIDKRRGN 569
Query: 522 LEQF 525
EQ+
Sbjct: 570 WEQY 573
>gi|170590123|ref|XP_001899822.1| HIV TAT specific factor 1 [Brugia malayi]
gi|158592741|gb|EDP31338.1| HIV TAT specific factor 1, putative [Brugia malayi]
Length = 394
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/252 (34%), Positives = 140/252 (55%), Gaps = 13/252 (5%)
Query: 263 RKQPDKQVEKKEAN--KPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKED 318
RK D + K + + K W EL+ NT VYV+ LP +T E E+ KCG+I+ D
Sbjct: 141 RKSEDHKHTKHKLHDKKEQQGWVELEEERNTSVYVSNLPYSITEESFTELMGKCGVIQRD 200
Query: 319 PETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKF 378
T K +IK+Y + + G KGD Y+K+ SV +A +LDG + DGKI +SV +AKF
Sbjct: 201 ARTNKLKIKLYRN-DDGTFKGDGRCCYIKKESVVMALDILDG--WNVDGKI-ISVEKAKF 256
Query: 379 EQKGE---RFIAKQVDSKKKKKLKKVEEKMLGWGGRD--DAKLTIPATVILRFMFTPAEM 433
+ KGE K++ + +KK+ + +E++ W + + T++++ MFT +M
Sbjct: 257 QMKGEFDPSKKKKKLTAAQKKRFIENQERIFEWKPEKPRNYRPISDCTIVMKNMFTLEQM 316
Query: 434 RADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF 493
+ L +LE +V++ C + G V V V +N+P+GV+ V F++ + + + +NGR
Sbjct: 317 MKNATLLLDLEEEVRKVCERFGAVKKVVVHDNNPEGVICVTFQNVEHSDTAVRSLNGRVV 376
Query: 494 GGRQIHASEDDG 505
GRQI + DG
Sbjct: 377 DGRQISVTLWDG 388
>gi|116780381|gb|ABK21659.1| unknown [Picea sitchensis]
Length = 110
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/120 (57%), Positives = 81/120 (67%), Gaps = 12/120 (10%)
Query: 213 MTFLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEK 272
MTF++EEE+ P + + D S E++ VE KH KRK Q EK
Sbjct: 1 MTFVEEEEIMPAIMIAD------------ISDEKECKDDSLAVESKHGLKRKLEADQDEK 48
Query: 273 KEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDK 332
KEAN P DSWF+LKVNTHVYVTGLP+DVT EE+VE FSKCGIIKED +TKKPR+KIYV K
Sbjct: 49 KEANTPADSWFDLKVNTHVYVTGLPEDVTTEEIVEAFSKCGIIKEDLDTKKPRVKIYVTK 108
>gi|389739977|gb|EIM81169.1| hypothetical protein STEHIDRAFT_172176 [Stereum hirsutum FP-91666
SS1]
Length = 454
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 121/381 (31%), Positives = 173/381 (45%), Gaps = 73/381 (19%)
Query: 177 EFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
+F DD+G ++D G AWVP ++E ++ + V ++ V
Sbjct: 46 KFEDDNGDELEYDAGKGAWVPV-----------VDEELLKRQAAAYSIAGVDEEAPAAPV 94
Query: 237 GKE-KLNSTEEKVNSA---DNVVEEKHNGKRKQPDKQVEKKE-ANKPPDSWFELKVNTHV 291
K K E SA + K +P A+ PP + NT V
Sbjct: 95 AKRAKKRKQPEDYTSATPDSGPISIKRGKNSSKPSASTSNGTGADAPPP---KKSKNTAV 151
Query: 292 YVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKET---------------GM 336
+VT LP D T EE+V FSKCG+I+ED + +P++K+Y +ET G
Sbjct: 152 FVTRLPLDATHEEIVSCFSKCGLIEED-DDGEPKVKMYA-RETDEGSNGNGNGAGGGGGE 209
Query: 337 KKGDALVTYLKEPSVALATQLLDGTPFR-PDGKIPMSVTQAKF----------------E 379
G+ALV Y KE SV LA +LD T R D M V +A+F
Sbjct: 210 FSGEALVVYFKEDSVELAVNILDDTELRLGDDTTRMRVQRAEFGHKGEEGGGSGAPGSAS 269
Query: 380 QKGERFIAKQVDSKKK-KKLKKVEEKMLGWGGRD-------DAKLTIPA----------- 420
KGER K VD +K ++L K+++K+ W D DA P
Sbjct: 270 GKGER-TRKVVDKRKTTRRLGKMQKKLEEWDDEDNFGPSLTDADRLGPGPQGAGANPNSR 328
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MFT E+ D L +L+ DV+EEC +G V +V + + P+G++ V+F+D
Sbjct: 329 VVVLRHMFTLEELEKDATLLLDLKEDVREECETLGEVTNVVLYDKEPEGIMTVKFRDPVS 388
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
AQ C+ MNGR+F GR I AS
Sbjct: 389 AQACVIKMNGRFFAGRAIEAS 409
>gi|388855188|emb|CCF51082.1| related to Splicing factor U2AF-associated protein 2 [Ustilago
hordei]
Length = 448
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 97/292 (33%), Positives = 141/292 (48%), Gaps = 59/292 (20%)
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T +Y+TGLP D T EE+ VFS+ G++ ED E KPR+K+Y D+ TG+ +G+ALV Y K
Sbjct: 139 TSLYITGLPLDATQEEIARVFSRYGVLLEDDE-GKPRVKMYYDERTGVFRGEALVVYFKA 197
Query: 349 PSVALATQLLDGTPFR--------PDGKIPMSVTQAKF---------------------- 378
SV LA ++LD T R G M V +A+F
Sbjct: 198 ESVELAIRMLDETSLRGAIGGSNNAGGGPVMRVQRAQFPATSNNTTSSATAEKQEESTNG 257
Query: 379 ------EQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKL---------TIPA--- 420
+ R + Q K++ K+E K+ W RDD L T P+
Sbjct: 258 KDEGPSTTRQRRNLTDQERKNIAKRVAKLENKLSDW--RDDDPLSRSSVPLATTSPSTRA 315
Query: 421 ----TVILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRF 475
TV+L MFT E+ +D L +L+ DV++EC KIG V +V + + P G++ VRF
Sbjct: 316 AKGRTVVLTKMFTLHELDSDPTLLLDLKEDVRDECTAKIGGVTNVVLWDKEPAGIMTVRF 375
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRD---LDAEASRLEQ 524
+ + A+ C+++M R+F R I A D + R+ D EAS EQ
Sbjct: 376 QTEQQAEACVKMMKARYFAQRSIDAWLADATPSFKKSRNDGAADDEASDAEQ 427
>gi|341880660|gb|EGT36595.1| hypothetical protein CAEBREN_04351 [Caenorhabditis brenneri]
Length = 470
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 110/358 (30%), Positives = 179/358 (50%), Gaps = 35/358 (9%)
Query: 166 DGVPASPEGEDEFT--------DDDGT---RYKWDRGLRAWVPQEDTSSQNDGYGIEEMT 214
D VP PE E EF D + T Y WD R WVP+ ++++Q E++
Sbjct: 65 DWVPV-PEDEQEFLSQLWFEQEDQEKTARKHYDWDEEKREWVPKANSTAQ------EQLE 117
Query: 215 FLKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKE 274
E+ + DE+ K K++ ++ + +E+ +++ K + +E
Sbjct: 118 V--NEDFIAEYQANYGVQYDEIYK-KMDDELQQKTAKIQKEQEEKKELKRKKKKGLHTEE 174
Query: 275 ANKPPDSWFELKVNTH-VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKE 333
K + W +L H VYV+ LP+D+T EE E SKCG+I+ D T KP+ K+Y D E
Sbjct: 175 EKK--EGWVDLGDKVHAVYVSNLPEDITDEEFQEFMSKCGVIQPDIRTNKPKCKLYRD-E 231
Query: 334 TGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE---RFIAKQV 390
G KGD Y+K+ S+ LA +LDG+ F+ + + V +A FE KG+ K++
Sbjct: 232 NGELKGDGRCCYIKKESIDLACNILDGSQFKGN---TVKVEEAHFEMKGDFDPSRKRKKL 288
Query: 391 DSKKKKKLKKVEEKMLGWGGRDDAKLTIP---ATVILRFMFTPAEMRADENLRSELEADV 447
+KK+ + + K+ W D + P TVI++ +FT M + L +L+ ++
Sbjct: 289 TVAQKKRYMEQQNKIFEWTP-DKPRNYRPKSDCTVIVKNLFTLEMMAKNAALMLDLKEEM 347
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
+ C K G V V V +NHP+GVV V F +++ ++ +NGR GR++ A DG
Sbjct: 348 TQSCQKYGSVKKVVVYDNHPEGVVSVTFVTTEESDMAVKYLNGRVVDGRKLTAELWDG 405
>gi|156086920|ref|XP_001610867.1| hypothetical protein [Babesia bovis T2Bo]
gi|154798120|gb|EDO07299.1| conserved hypothetical protein [Babesia bovis]
Length = 361
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 151/317 (47%), Gaps = 13/317 (4%)
Query: 226 NVTDDLANDEVGKEKLNSTEEKVNSAD----NVVEEKHNGKRKQPDKQ-VEKKEANKPPD 280
+V D L ND V +L T D +V E+ ++++ KQ +++K A
Sbjct: 22 SVNDPLPNDNVSNTELTDTVTISPVGDGDKPSVDEDPERIRKREKKKQYLQRKRARIESG 81
Query: 281 SWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGD 340
W + N VY+TGLPDDVT EE+ VF + G+IK DP T P+I++Y D + G+ K D
Sbjct: 82 QWIDSNRNLSVYITGLPDDVTSEEIANVFRRAGLIKIDPITTLPKIRMYTDAQ-GVFKND 140
Query: 341 ALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL-- 398
A VT++ + SV A + LD FRPD I + +++ + +K+
Sbjct: 141 ARVTFVNKESVDFAIRYLDNYHFRPDCVIHVEKATYNPQKRASNSTVSLSGEELRKRYLA 200
Query: 399 KKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADEN---LRSELEADVQEECVKIG 455
K E+ L +D T VI + MF+ + A E +L +VQ E K
Sbjct: 201 AKYEQNRLQSWDQDIDDGTGRRIVICKPMFSTEDAWAHEAGDVFYDDLRDEVQYEITKFV 260
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAA--IR 513
PV+ V HPQGVV V+ K DA+ I R F GR++ DG + A +
Sbjct: 261 PVEKVTPIARHPQGVVCVKLKTSADAEIFISQFQDRLFDGRRLQVYFFDGKTDLQAQCLP 320
Query: 514 DLDAEASRLEQFGAELE 530
DA+ + L + AEL+
Sbjct: 321 SKDAKEAALRRATAELK 337
>gi|428673000|gb|EKX73913.1| hypothetical protein BEWA_039510 [Babesia equi]
Length = 360
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/237 (37%), Positives = 123/237 (51%), Gaps = 14/237 (5%)
Query: 282 WFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDA 341
W + VY++GLP D TVEE+ +VF + G+IK DP T P++K+Y D+E G K DA
Sbjct: 68 WVDSSKVYSVYISGLPKDTTVEEVSQVFKRAGVIKIDPITTLPKVKLYTDEE-GNLKSDA 126
Query: 342 LVTYLKEPSVALATQLLDGTPFRPDGKIPMSVT---------QAKFE-QKGERFIAKQVD 391
VT++ + SV LA + D + FR I + QAK+E QKG+R + +
Sbjct: 127 TVTFVNQESVDLALRYFDNSEFRTGYIIHVEKVCMDNKCIRIQAKYEPQKGKRILTTDPE 186
Query: 392 SKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTP--AEMR-ADENLRSELEADVQ 448
+KK K E++ L G D T VI + MF+ AEM AD + EL+ ++
Sbjct: 187 MRKKYLAAKYEQERLQSWGYDMDDGTDRRIVISKPMFSTKEAEMHEADSDFYKELQEEIH 246
Query: 449 EECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
E K V+ V HPQG+V ++FK DAQ I N R F GRQ+ DG
Sbjct: 247 TEVAKYVEVEKVTPIPRHPQGIVCIKFKTGLDAQTFISKFNNRMFDGRQLDVYFFDG 303
>gi|296471225|tpg|DAA13340.1| TPA: HIV-1 Tat specific factor 1 [Bos taurus]
Length = 743
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/244 (36%), Positives = 124/244 (50%), Gaps = 40/244 (16%)
Query: 272 KKEANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIY 329
K E K WF ++ NT+VYV+GLP D+TV+E +++ SK GII DP+T++ ++K+Y
Sbjct: 114 KGEKRKAESGWFHVEEGRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLY 173
Query: 330 VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ 389
D + G KGD L YLK SV LA +LLD R + V AKF+ KGE +K+
Sbjct: 174 KDNQ-GNLKGDGLCCYLKRESVDLALKLLDEDEIR---GYKLHVEVAKFQLKGEYDASKK 229
Query: 390 VDSKKKKKLK-KVEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEA 445
K K K +++K L W A ++ VI++ MF P +
Sbjct: 230 KKKCKDYKKKLSMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDF------------ 277
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
E HP GV V F+D ++A CI+ +NGRWFGGRQI A DG
Sbjct: 278 ------------------ERHPDGVASVSFRDPEEADYCIQTLNGRWFGGRQITAQAWDG 319
Query: 506 LVNH 509
++
Sbjct: 320 TTDY 323
>gi|221482159|gb|EEE20520.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 728
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/301 (30%), Positives = 153/301 (50%), Gaps = 29/301 (9%)
Query: 217 KEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEAN 276
+E+ VFP ++++ + G+ + T E + A+ +E + KR+ ++++KK
Sbjct: 349 REDMVFP--GLSEEEVDRAEGEGEQGRTRE-LTEAERQKKEANAEKRRMYRQRLKKK--- 402
Query: 277 KPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGM 336
K W + N ++YV LP D T EE+ E+F K G+ K DPET +P+I++Y D +TG
Sbjct: 403 KQEGRWVAGRKNPNIYVANLPADCTEEELAEIFKKAGVFKIDPETLRPKIRVYTD-DTGR 461
Query: 337 KKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQV------ 390
KGDAL++++ E SV +A + DG FR DG + V +A+F K + + +
Sbjct: 462 CKGDALISFVHENSVDIAIKYFDGFSFR-DGACTLKVQRAEFAPKAGQGTSPKTEGSEGG 520
Query: 391 ---------DSKKKKKLKKVEEKMLGWG-GRDDAKLTIPATVILRFMFTPAEMRADEN-- 438
D +K K +E++L WG DD + +IL+ ++ E E
Sbjct: 521 SAESRTGKRDRRKYLAAKYEQERLLSWGDAIDDG--SGRRIIILKPTYSSEEAELYEEGD 578
Query: 439 -LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
EL ++ +E + + V V H QGV V+ K +DA++ IE GR+F GRQ
Sbjct: 579 AFYEELRQELFDEIAQFAKPEKVTVIPRHVQGVACVKLKTAEDAERIIEQFRGRYFDGRQ 638
Query: 498 I 498
+
Sbjct: 639 L 639
>gi|312084737|ref|XP_003144397.1| HIV TAT specific factor 1 [Loa loa]
gi|307760441|gb|EFO19675.1| HIV TAT specific factor 1 [Loa loa]
Length = 394
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/341 (30%), Positives = 173/341 (50%), Gaps = 50/341 (14%)
Query: 182 DGTRYKWDRGLRAWVP----QEDTSSQ-NDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
DG +W++ + W+P ED +Q + YGI+ +
Sbjct: 91 DGVTMEWNQISQQWLPVVEINEDFLAQYHSNYGIKY----------------------DF 128
Query: 237 GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVT 294
G+++ +S EEK ++N KH K DK K W EL+ NT VYV+
Sbjct: 129 GQKQGSSMEEKDEKSENPKPVKH----KLHDK--------KEQQGWVELEEERNTSVYVS 176
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
LP +T E +E+ SKCG+I+ D T K +IK+Y + + G KGD Y+K+ SV +A
Sbjct: 177 NLPFSITEESFIELMSKCGVIQRDARTNKLKIKLYKNDD-GSVKGDGRCCYIKKESVIMA 235
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKGE---RFIAKQVDSKKKKKLKKVEEKMLGWGGR 411
+LDG + DGKI +SV +AKF+ KGE K++ + +KK+ + +E++ W
Sbjct: 236 LDILDG--WNVDGKI-ISVEKAKFQMKGEFDPLKKKKKLTAAQKKRFIESQERIFEWKPE 292
Query: 412 D--DAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG 469
+ + T++++ MFT +M + L +LE +V++ C + G V V + +N+P+G
Sbjct: 293 KPRNYRPISDCTIVMKNMFTLEQMMKNATLLLDLEEEVKKVCERFGTVKKVTIHDNNPEG 352
Query: 470 VVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHA 510
V+ V F++ + + + +N R GRQI + DG +A
Sbjct: 353 VICVTFQNVEHSDIAVRALNDRVVDGRQISVTLWDGKTKYA 393
>gi|221502462|gb|EEE28189.1| splicing factor u2af-associated protein, putative [Toxoplasma
gondii VEG]
Length = 728
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/301 (30%), Positives = 153/301 (50%), Gaps = 29/301 (9%)
Query: 217 KEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEAN 276
+E+ VFP ++++ + G+ + T E + A+ +E + KR+ ++++KK
Sbjct: 349 REDMVFP--GLSEEEVDRAEGEGEQGRTRE-LTEAERQKKEANAEKRRMYRQRLKKK--- 402
Query: 277 KPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGM 336
K W + N ++YV LP D T EE+ E+F K G+ K DPET +P+I++Y D +TG
Sbjct: 403 KQEGRWVAGRKNPNIYVANLPADCTEEELAEIFKKAGVFKIDPETLRPKIRVYTD-DTGR 461
Query: 337 KKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQV------ 390
KGDAL++++ E SV +A + DG FR DG + V +A+F K + + +
Sbjct: 462 CKGDALISFVHENSVDIAIKYFDGFSFR-DGACTLKVQRAEFAPKAGQGTSPKTEGSEGG 520
Query: 391 ---------DSKKKKKLKKVEEKMLGWG-GRDDAKLTIPATVILRFMFTPAEMRADEN-- 438
D +K K +E++L WG DD + +IL+ ++ E E
Sbjct: 521 SAESRTGKRDRRKYLAAKYEQERLLSWGDAIDDG--SGRRIIILKPTYSSEEAELYEEGD 578
Query: 439 -LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
EL ++ +E + + V V H QGV V+ K +DA++ IE GR+F GRQ
Sbjct: 579 AFYEELRQELFDEIAQFAKPEKVTVIPRHVQGVACVKLKTAEDAERIIEQFRGRYFDGRQ 638
Query: 498 I 498
+
Sbjct: 639 L 639
>gi|428162656|gb|EKX31778.1| hypothetical protein GUITHDRAFT_149069, partial [Guillardia theta
CCMP2712]
Length = 447
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 130/228 (57%), Gaps = 37/228 (16%)
Query: 285 LKVNTHVYVTGLPDDVTVEEMVEVFS--KCGIIKEDPE-TKKPRIKIYVDKETGMKKGDA 341
+K NT+VYV GLP D T+EE+ F + G++K E K P+IK+Y D E G KGDA
Sbjct: 239 IKDNTYVYVQGLPVDTTMEEVATFFRHLQFGVLKPSEEDGKTPKIKLYRD-EDGNLKGDA 297
Query: 342 LVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK-- 399
+ +LK S+ L + DG+ FR D K P+ V A+F+ KG++++ K+ S++ KK +
Sbjct: 298 RIAFLKAASIPL---IADGSNFR-DDKYPLKVQPAQFQMKGQQYVEKEKMSEEAKKERQK 353
Query: 400 -----KVEEKMLGWG---GRDDAK-LTIPATVILRFMFTPAEMRADENLRSELEADVQEE 450
K +E+ L WG G DD + L I VIL+ MFT EM+ D L+
Sbjct: 354 RIQAIKKQERALDWGADEGIDDGRGLRI---VILKNMFTVEEMKVDAEAAITLQ------ 404
Query: 451 CVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
V + + +P+GVV ++FK A++CIE+ NGR+FGGR++
Sbjct: 405 ---------VTIFDRNPEGVVAIKFKQATSAEECIEIFNGRFFGGRKL 443
>gi|237843089|ref|XP_002370842.1| splicing factor U2AF-associated protein, related [Toxoplasma gondii
ME49]
gi|211968506|gb|EEB03702.1| splicing factor U2AF-associated protein, related [Toxoplasma gondii
ME49]
Length = 728
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/301 (30%), Positives = 153/301 (50%), Gaps = 29/301 (9%)
Query: 217 KEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEAN 276
+E+ VFP ++++ + G+ + T E + A+ +E + KR+ ++++KK
Sbjct: 349 REDMVFP--GLSEEEVDRAEGEGEQGRTRE-LTEAERQKKEANAEKRRMYRQRLKKK--- 402
Query: 277 KPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGM 336
K W + N ++YV LP D T EE+ E+F K G+ K DPET +P+I++Y D +TG
Sbjct: 403 KQEGRWVAGRKNPNIYVANLPADCTEEELAEIFKKAGVFKIDPETLRPKIRVYTD-DTGR 461
Query: 337 KKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQV------ 390
KGDAL++++ E SV +A + DG FR DG + V +A+F K + + +
Sbjct: 462 CKGDALISFVHENSVDIAIKYFDGFSFR-DGACTLKVQRAEFAPKAGQGTSPKTEGSEGG 520
Query: 391 ---------DSKKKKKLKKVEEKMLGWG-GRDDAKLTIPATVILRFMFTPAEMRADEN-- 438
D +K K +E++L WG DD + +IL+ ++ E E
Sbjct: 521 SAESRTGKRDRRKYLAAKYEQERLLSWGDAIDDG--SGRRIIILKPTYSSEEAELYEEGD 578
Query: 439 -LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
EL ++ +E + + V V H QGV V+ K +DA++ IE GR+F GRQ
Sbjct: 579 AFYEELRQELFDEIAQFAKPEKVTVIPRHVQGVACVKLKTAEDAERIIEQFRGRYFDGRQ 638
Query: 498 I 498
+
Sbjct: 639 L 639
>gi|50554841|ref|XP_504829.1| YALI0F00660p [Yarrowia lipolytica]
gi|49650699|emb|CAG77631.1| YALI0F00660p [Yarrowia lipolytica CLIB122]
Length = 364
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 84/269 (31%), Positives = 141/269 (52%), Gaps = 35/269 (13%)
Query: 254 VVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCG 313
V+EE+ N K+ Q ++ +KPP +YV+GLP T +E+V++F K G
Sbjct: 80 VIEERRNAKKAQ-------QKGDKPP--------QKAIYVSGLPSTATNDELVDIFQKYG 124
Query: 314 IIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTP-FRPDGKIPMS 372
++ ED T K + ++YVD E G KGD LV + K SV LA +L P + D + ++
Sbjct: 125 VLAEDVYTGKKKARVYVD-EQGKGKGDGLVVFFKPESVKLAVDMLHNQPVYVGDTMVTLN 183
Query: 373 VTQAKFE-QKGERFIAKQ-----------VDSKKKKKLKKVEEKMLGWGGRDDAKLTIPA 420
V A F+ +KG KQ +K K+K ++++++ W + ++ +
Sbjct: 184 VQPAVFDKEKGSSDNKKQETNSGPNYSEEAKAKAKRKYTQLQQRLNDWDEEEVKRVKTES 243
Query: 421 ------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
V L+ +FT E++ D + +++ D+ C IG V +V + + P GVV V+
Sbjct: 244 REKSSKVVTLKRVFTIQELQDDVDAEMDIKEDIYNGCGAIGTVTNVTLYDLEPDGVVTVK 303
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHASED 503
F+ DA +C+E MNGR+FGG+++ A D
Sbjct: 304 FERASDAAECVEKMNGRFFGGQKLEAYID 332
>gi|350014158|dbj|GAA37173.1| HIV Tat-specific factor 1 [Clonorchis sinensis]
Length = 552
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 89/273 (32%), Positives = 139/273 (50%), Gaps = 29/273 (10%)
Query: 272 KKEANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIY 329
K++ P +W+E+ NTHVYVTGLP D+T + E+ SK G+I +P T +PRIK+Y
Sbjct: 230 KRKTTTPAPAWYEMSEDKNTHVYVTGLPVDITESDFFELMSKYGVIMNEPFTDRPRIKLY 289
Query: 330 VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ 389
+D+E G KGD Y+K SV LA +LL+G + P G + + V +AKF KG Q
Sbjct: 290 LDEE-GKPKGDGRCCYVKVESVDLAMKLLEGMEYSP-GHL-LHVERAKFTPKG------Q 340
Query: 390 VDSKKKKKL--------KKVEEKMLGWGGRDDAKLTIPA---TVILRFMFTPAEMRADEN 438
D KK+++L K+ +E + W G D +K ++L+ F + D
Sbjct: 341 FDPKKRRRLTLKEKKKLKEQQESLFRW-GIDTSKFVRSKKERVLVLKNAFVETDFVTDVT 399
Query: 439 LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF----- 493
L + ++ +C K G V + V + +P GVV V F ++A + +N F
Sbjct: 400 LIPLVRDRLRAQCAKCGVVKKIVVHDTNPLGVVTVTFNTPEEADTALGFLNKALFNYPGP 459
Query: 494 -GGRQIHASEDDGLVNHAAIRDLDAEASRLEQF 525
G RQ+ DG N++ D E +R+ ++
Sbjct: 460 GGVRQLQVERWDGRTNYSLKEDESEEMNRIHKW 492
>gi|254573048|ref|XP_002493633.1| hypothetical protein [Komagataella pastoris GS115]
gi|238033432|emb|CAY71454.1| hypothetical protein PAS_chr4_0929 [Komagataella pastoris GS115]
gi|328354539|emb|CCA40936.1| Splicing factor U2AF-associated protein 2 [Komagataella pastoris
CBS 7435]
Length = 338
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 128/230 (55%), Gaps = 17/230 (7%)
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T VY+T LP D T+EE+ + F K G I E+ T + +IK+Y ++E K GDALV YLK
Sbjct: 89 TGVYITNLPSDTTLEEIEDAFGKLGTIGENLITGEKKIKLYRNEENQFK-GDALVVYLKP 147
Query: 349 PSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKK------KKKLKKVE 402
SV LA ++LDG +RP K + V +A F K A D + KK+L++++
Sbjct: 148 ESVDLAIEMLDGIQWRPLSKETIHVEKADFSHKESDSKAHNKDLTEEQKQVIKKRLQRLK 207
Query: 403 EKMLGWGG-------RDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
+ W R A+ V+L+ +F +++ E+ E++ D++E C +IG
Sbjct: 208 SRSDDWKDDIQVEYERRQAEERFKHFVVLKNVF---DLKTSEDDLFEIKQDIREGCEEIG 264
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
V +V + + P+G+V VRF + DA +C + MNGR+F G+++ A DG
Sbjct: 265 SVTNVVLYDLEPEGIVSVRFSSKDDAARCAQEMNGRYFDGKKLEAFIYDG 314
>gi|392574970|gb|EIW68105.1| hypothetical protein TREMEDRAFT_32917 [Tremella mesenterica DSM
1558]
Length = 420
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 165/364 (45%), Gaps = 58/364 (15%)
Query: 180 DDDGTRYKWDRGLRAWVP---QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV 236
+D G Y+W +AW+P +E+ +Q Y I + +E P+ E
Sbjct: 31 EDTGKEYEWTG--KAWIPLIEEEEWKAQQAAYSISGV-----DETTPSNAALARQQRIEA 83
Query: 237 GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGL 296
++K N+ D PD + ++ P K T V+VT L
Sbjct: 84 QRKKKTG-----NTVDTATTLASISSSSTPDVGITNGKSTSKPVQASAPK-KTGVWVTNL 137
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
P D T E + VFSK G++ + KPRIK+Y D E G KG+ALV Y KE SV LA
Sbjct: 138 PPDTTPELLANVFSKAGVLMIG-DDGKPRIKMYYDDE-GRFKGEALVLYFKEGSVGLAVT 195
Query: 357 LLDGTPFRPDGKIP-MSVTQAKFEQKGERF-------------------------IAKQV 390
LLD T G M V +A++E KG+ F ++ +
Sbjct: 196 LLDDTELELGGGFGNMRVREAEYE-KGDAFGKEGKGEGGNGNVEGEKTGGEKRKNLSHEE 254
Query: 391 DSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT-------------VILRFMFTPAEMRADE 437
K K++K+++EK+ D P++ V+L+ MF+P ++ D
Sbjct: 255 KLKMSKRIKRMQEKITWHSDSDSDDPLAPSSGAPKPGASRFTRVVVLKGMFSPKDLDEDP 314
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
L EL+ DV+EE +G V +V + + +GV+ ++FKD AQ C+ MNGR+F GR+
Sbjct: 315 GLLLELKEDVREEAETLGEVTNVVLYDKEEEGVMTIKFKDTISAQACVNKMNGRFFDGRR 374
Query: 498 IHAS 501
I AS
Sbjct: 375 ISAS 378
>gi|443923369|gb|ELU42621.1| splicing factor u2af-associated protein 2 [Rhizoctonia solani AG-1
IA]
Length = 418
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 112/347 (32%), Positives = 161/347 (46%), Gaps = 45/347 (12%)
Query: 178 FTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAND 234
F DDG +WD AWV D +Q Y +E + +E P + T N
Sbjct: 57 FEADDGREMQWDANKNAWVEVVDDELVKAQQAAYAVEGVD--EETPAAPVLART----NK 110
Query: 235 EVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVT 294
+ KE ++ T +A ++ GK + DK+ E +P S NT V+V+
Sbjct: 111 KRKKEPIDYTSNTDPAAPGPSIKR--GKNNKGDKKDESTAPAQPRKS-----KNTAVFVS 163
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
LP D +E+ F + G+I ED E K P++K+Y E G G+ALV Y KE SV LA
Sbjct: 164 SLPPDAEADEIAARFGRFGLIMEDDEGK-PKVKLYR-SEDGTFTGEALVVYYKEESVELA 221
Query: 355 TQLLDGTPFR-PDGKIPMSVTQAKFEQKGERFIA---------KQVDSKK-KKKLKKVEE 403
LLD R + M V +A++ KG A + VD KK K++ +E
Sbjct: 222 VTLLDDAELRLGEAGTRMKVQRAEYGHKGGDHEAVGNENGGQKRTVDKKKASKRIGNMER 281
Query: 404 KMLGWGGRDD-----AKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
K+ W D+ K+ V+L+ MFT E+ D L +L+ DV+EE +
Sbjct: 282 KLADWDSDDEFGPSGPKIVTSRVVVLKHMFTKEELAEDATLLLDLKEDVREESL------ 335
Query: 459 SVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
+ GV+ V+FKD AQ CI MNGR+F GR++ A DG
Sbjct: 336 -----QEEEDGVMTVKFKDPVSAQACILKMNGRFFAGRKVVAEFFDG 377
>gi|225714366|gb|ACO13029.1| HIV Tat-specific factor 1 homolog [Lepeophtheirus salmonis]
Length = 481
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 168/362 (46%), Gaps = 53/362 (14%)
Query: 182 DGTRYKWDRGLRAWVPQEDT---SSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK 238
DGT Y+WD AW P+ D+ + YG F + + PT+ D E K
Sbjct: 135 DGTTYEWDSDKNAWFPKMDSEFMAVYQLNYG-----FTSDGQAKPTI---PDAPTPEAPK 186
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTH--VYVTGL 296
P + ++K+ K WF+++ N + VYV+GL
Sbjct: 187 ---------------------------PTPETKQKQTKKA--EWFQVEKNKNAKVYVSGL 217
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPET-KKPRIKIYVDKETGMKKGDALVTYLKEPSVALAT 355
P+ +T E+ + + S+ G+I+ D KPRIKIY D G+ KGDAL +Y+ SV L+
Sbjct: 218 PESITEEKFISMMSRFGVIESDVRNDNKPRIKIYRDL-NGVPKGDALCSYVMVESVDLSI 276
Query: 356 QLLDGTPFRPDGKIPMSVTQAKFEQKGERF----IAKQVDSKKKKKLKKVEEKMLGW--- 408
Q+LD + + DGK + V +A FE KGE + K++ K+ + LKK ++K L W
Sbjct: 277 QILDDSLYE-DGKSKIRVERATFEMKGEAYNPDLKPKKLRKKELENLKKQKDKKLAWDLD 335
Query: 409 -GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
+ + ++L +F+P++ +L+ ++ C K G V ++V + H
Sbjct: 336 VARGMECRPKYHKVIVLFNLFSPSDFVEKPEKIIDLKVKIRNSCEKFGRVKRLEVYDQHV 395
Query: 468 QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGA 527
G+ ++ F D +A I+++NGR F GR + DG + D + R E +
Sbjct: 396 DGIGMISFNDPIEADLAIQMLNGRLFNGRVLKVIVWDGKTKYQNSETDDEQQKRYEAWKK 455
Query: 528 EL 529
L
Sbjct: 456 FL 457
>gi|323508142|emb|CBQ68013.1| related to Splicing factor U2AF-associated protein 2 [Sporisorium
reilianum SRZ2]
Length = 439
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/265 (33%), Positives = 132/265 (49%), Gaps = 56/265 (21%)
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T +YVTGLP D +V+E+ VFS+ G++ ED + +PRIK+Y D+ TGM KG+ALV Y K
Sbjct: 134 TSLYVTGLPLDASVDEIARVFSRYGVLLED-DQGQPRIKMYRDERTGMFKGEALVVYFKP 192
Query: 349 PSVALATQLLDGTPFRP-----DGKIP-MSVTQAKF------------------------ 378
SV LA +LD T R G P M V +A+F
Sbjct: 193 ESVELAIGMLDETGMRAAIGQSAGAGPVMRVQRAEFSSDASSRKAGVGASAKQNNGATSS 252
Query: 379 ---EQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDD-----AKLTIPA---------- 420
R + Q K K++ ++E K+ W RDD A T P+
Sbjct: 253 GQESSTSRRNLTDQDRKKIAKRVARLETKLSDW--RDDSPSPTASPTEPSTTTTTTTTAA 310
Query: 421 ----TVILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRF 475
TV+L MFT E+ +D L +L+ DV++EC +G V +V + + P+G++ VRF
Sbjct: 311 GGGRTVVLTKMFTLHELDSDPTLLLDLKQDVRDECSDTVGGVTNVVLWDKEPEGLMTVRF 370
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHA 500
+ + A C+ +M GR+F R+I A
Sbjct: 371 QTEQQAAACVRIMKGRFFAQRRIDA 395
>gi|84998460|ref|XP_953951.1| RNA binding protein [Theileria annulata]
gi|65304949|emb|CAI73274.1| RNA binding protein, putative [Theileria annulata]
Length = 373
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/230 (39%), Positives = 123/230 (53%), Gaps = 15/230 (6%)
Query: 285 LKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
LK +T VYV+GLP D T+ E+ EVF K G+IK DP T P+IK+Y D E G K D VT
Sbjct: 100 LKSHT-VYVSGLPKDTTISEVAEVFKKAGLIKIDPHTTLPKIKLYTD-EKGELKSDGTVT 157
Query: 345 YLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF--IAKQVDSKKKKKL--KK 400
++ + S+ LA + LD FR + I V QAKFE + + I + S+ +KK K
Sbjct: 158 FVNKESIDLAIRYLDNYHFRDNCVI--HVEQAKFEPRSTQIKPIPPSIKSELRKKYLAAK 215
Query: 401 VEEKML-GWGGR-DDAKLTIPATVILRFMFT---PAEMRADENLRSELEADVQEECVKIG 455
EEK L GW DD T VI + MF+ + A + EL+ ++ E K
Sbjct: 216 YEEKRLQGWSDTLDDG--TGRRIVISKPMFSMEDAMKYEAGDEFYEELKEEILSEIKKYV 273
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
V+ V HPQGVV ++FK+ DA+ I NGR F GR++ DG
Sbjct: 274 EVEKVTPIARHPQGVVCIKFKNSSDAEVFISKFNGRMFDGRELEVYFFDG 323
>gi|401410748|ref|XP_003884822.1| putative splicing factor U2AF-associated protein, related [Neospora
caninum Liverpool]
gi|325119240|emb|CBZ54794.1| putative splicing factor U2AF-associated protein, related [Neospora
caninum Liverpool]
Length = 711
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 161/323 (49%), Gaps = 27/323 (8%)
Query: 196 VPQEDTSSQNDGYGIEEMTFLKEEE-VFPTVNVTDDLANDEVGKEKLNSTEEKVNSADNV 254
+P + + G+ ++EE+ VFP + ++ A E ++ ++ A+
Sbjct: 312 LPSASSPGEKPSSGVSPPLQVREEDMVFPGLAAVEEEAGTEGEGKEGAGDTRELTEAERQ 371
Query: 255 VEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGI 314
+E + KR+ ++++KK K W + N ++YV+ LP D T E+ E+F K G+
Sbjct: 372 KKEINAEKRRLYRQRLKKK---KQEGRWVVSRKNPNIYVSNLPPDCTEAELAEIFKKAGV 428
Query: 315 IKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVT 374
K DP+T +P+I++Y D + G KGDAL++++ E SV +A + DG FR DG + V
Sbjct: 429 FKIDPDTLQPKIRVYAD-DAGRCKGDALISFVHENSVDIAIKYFDGFAFR-DGACTLKVQ 486
Query: 375 QAKFEQKG-----------ERFIAKQVDSKKKKK----LKKVEEKMLGWG-GRDDAKLTI 418
+A+F K ER +++ K+ ++ K +E++L WG DD T
Sbjct: 487 RAEFAPKAGQGSSPTADGLERGVSESRPGKRDRRKYLAAKYEQERLLSWGDAVDDG--TG 544
Query: 419 PATVILRFMFTPAEMRADEN---LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRF 475
+IL+ ++ E E EL ++ +E + + V V H QGV V+
Sbjct: 545 RRIIILKPTYSSEEAELYEEGDAFYEELRQELFDEIAQFAKPEKVTVIPRHVQGVACVKL 604
Query: 476 KDRKDAQKCIELMNGRWFGGRQI 498
K +DA++ IE GR+F GRQ+
Sbjct: 605 KTAEDAERIIEQFRGRYFDGRQL 627
>gi|399218106|emb|CCF74993.1| unnamed protein product [Babesia microti strain RI]
Length = 507
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 140/285 (49%), Gaps = 17/285 (5%)
Query: 233 NDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVY 292
ND +E + + + N A + + K K+ +++E+ E + K N VY
Sbjct: 152 NDATTREMIKKIDTEENEAKVIEKFKKQKYLKRKRERIERGE-------FIASKKNYSVY 204
Query: 293 VTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVA 352
+TGLP DVT +E+ VF K G+IK DP T P++KIY D E + KGDA +T+L SV+
Sbjct: 205 ITGLPKDVTSDELQSVFRKAGVIKIDPITTLPKVKIYRD-ECDVPKGDATITFLNRESVS 263
Query: 353 LATQLLDGTPFRPDGKIPMSVTQAKFEQKGER-FIAKQVDSKKKKKLKKVEEKMLGWGGR 411
L + D FRP+ KI + Q K +R +I ++ KK K +++ W
Sbjct: 264 LCLKYFDDYYFRPNVKIHVEEAQFKANMDKKRPYIDPKLRKKKFLAAKYEAQRISSWKDE 323
Query: 412 -DDAKLTIPATVILRFMFTPAE----MRADENLRSELEADVQEECVKIGPVDSVKVCENH 466
DD T VI + MF+ + + DE +L +V EE +K VD + H
Sbjct: 324 MDDG--TGRRVVIAKPMFSLEQAKKYTKGDE-FYEKLRHEVYEEVIKFAEVDKITPMSRH 380
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAA 511
PQG+V ++FK KDA+ I R F G ++ DG + AA
Sbjct: 381 PQGIVCLKFKTAKDAEIFIAAFKDRLFDGNKLDVYFYDGKTDLAA 425
>gi|324508726|gb|ADY43681.1| HIV Tat-specific factor 1 [Ascaris suum]
Length = 421
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/269 (32%), Positives = 144/269 (53%), Gaps = 16/269 (5%)
Query: 249 NSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMV 306
+S+ NVV E + K+P + KE + W EL N VYV+ LP +T + +
Sbjct: 134 SSSKNVVAEGESTPPKKPKPHDKTKE-----EGWVELDETRNAAVYVSNLPKTITEDAFI 188
Query: 307 EVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPD 366
E+ SKCG+I+ D T K +IKIY +E G KGDA Y+K+ SV LA Q+LDG + D
Sbjct: 189 ELMSKCGVIQRDARTNKLKIKIYRGEE-GEPKGDARCGYVKKESVDLALQILDG--WNLD 245
Query: 367 GKIPMSVTQAKFEQKGERFIAKQ---VDSKKKKKLKKVEEKMLGWGGRD--DAKLTIPAT 421
G + + V +AKF+ KGE +K+ + + +KK+ + ++++ W + + T
Sbjct: 246 GNL-IHVEKAKFQMKGEFDPSKKRRKLTAAQKKRFIESQQRLFEWKPEKPRNYRPFSDCT 304
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
+++ MFT +M + L +L+ ++ C + G V V V +N+P G++ V F + + +
Sbjct: 305 AVMKNMFTMEQMMRNPTLLMDLQEQTKKACERFGTVKKVVVHDNNPDGIICVTFDNVEHS 364
Query: 482 QKCIELMNGRWFGGRQIHASEDDGLVNHA 510
+ MNG GR+I S DG +A
Sbjct: 365 DLAVRSMNGLVVDGRKIEVSTWDGKTKYA 393
>gi|66804879|ref|XP_636172.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
discoideum AX4]
gi|60464530|gb|EAL62669.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
discoideum AX4]
Length = 387
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 74/227 (32%), Positives = 125/227 (55%), Gaps = 16/227 (7%)
Query: 291 VYVTGLPDDVTVEEM--VEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
++++G+P D ++ E+ KCG IK++ E +P I + VD E G ALV++ ++
Sbjct: 147 LHISGIPMDGSIRELQISNYLKKCGYIKKN-EYGRPIISLLVDNENQFT-GCALVSFERK 204
Query: 349 PSVALATQLLDGTPFRPDGKIPM---SVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKM 405
SV +A D T P+ I + ++ ++K E+ KQ S K ++ + E
Sbjct: 205 ESVPIAILQYDETEISPNNIIRIRKATLEESKLHLVNEKSNKKQKKSHKDRRSEYENEAK 264
Query: 406 LGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEE---CVKIGPVDSVKV 462
GW + TVI++ +F+P E D N ++L+AD+++E C K GP+ S+ +
Sbjct: 265 YGWADSESK------TVIIKNLFSPEEAMVDPNFFNDLQADLEDETHGCGKCGPISSIHI 318
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
E +P GVV V+F + + A+KC+ LMN R+FGGR++ A DG N+
Sbjct: 319 FEYNPDGVVSVKFSEFESAKKCVALMNDRFFGGRKLSADFYDGYSNY 365
>gi|361129326|gb|EHL01235.1| putative Splicing factor U2AF-associated protein 2 [Glarea
lozoyensis 74030]
Length = 353
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 107/342 (31%), Positives = 169/342 (49%), Gaps = 71/342 (20%)
Query: 182 DGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEKL 241
DGT Y++D ++ W+P ++E ++++ + V + D + +++
Sbjct: 47 DGTEYEFDDAIKRWIPV-----------VDEALLEQQQKAYQVEGVDESEPVDALKRKR- 94
Query: 242 NSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVT 301
+E VN D G R++ K +K +A PP K NT VYVTGLP DVT
Sbjct: 95 --KKEYVNGEDE-------GGREK--KAPKKTKAPLPP------KANTAVYVTGLPFDVT 137
Query: 302 VEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDG 360
+ E+ EVFS KCG+I E+ ++ PRIK+Y D + G+ KGDAL+ + K PSV +A LLD
Sbjct: 138 IPEVHEVFSRKCGVIAEEIDSGAPRIKLYTDSK-GIFKGDALIVFFKPPSVDMAIMLLDD 196
Query: 361 TPFRP---DGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRD----- 412
T FR +G M V A K Q D++ K K+ E++ RD
Sbjct: 197 TQFRMADGEGSGRMRVQAADMSYK-----RVQNDTEVTAKDKEKEKERQRTKARDKQKII 251
Query: 413 ------DAKLTI---PATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC 463
DA+L PA ++ +++ D++EEC K+G V +V +
Sbjct: 252 KKTQKLDARLADWNDPAAIL------------------DIKEDIREECSKLGEVTNVVLF 293
Query: 464 ENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
+ +G+ VRF + + A+ C+ LMNGR F G+++ A +G
Sbjct: 294 DLEEEGIASVRFTNAEAAKACVILMNGRNFDGQKVEAYVSEG 335
>gi|156362084|ref|XP_001625611.1| predicted protein [Nematostella vectensis]
gi|156212453|gb|EDO33511.1| predicted protein [Nematostella vectensis]
Length = 313
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 93/254 (36%), Positives = 128/254 (50%), Gaps = 27/254 (10%)
Query: 278 PPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETG 335
P WF+ + N +VYV+GLP D+T EE VE+ SK GII EDP+T
Sbjct: 59 PKVGWFQPDPEKNPNVYVSGLPLDITDEEFVELMSKYGIIMEDPDTH------------- 105
Query: 336 MKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA-KQVDSKK 394
YLK SV LA LLD ++ K + V QA+F KG A K+ KK
Sbjct: 106 -----GRCCYLKMASVHLAIDLLDEAEYK---KSTLHVEQAQFNMKGNYNPALKKKKKKK 157
Query: 395 KKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKI 454
KKK K +EK+L W RD+ + ++++ MF P D L D+++EC K
Sbjct: 158 KKKQGKQQEKLLDWVDRDNRRPKNERVIVIKNMFDPKVFEKDPQLIITTRNDLRKECEKF 217
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVN---HAA 511
G V V V + H +G+ V FK+ A C+E MNGRWF GR++ A DG+ N +
Sbjct: 218 GDVRKVIVFDRHSEGICSVAFKEHPPADACLERMNGRWFAGRRLAADRWDGVSNFNIQES 277
Query: 512 IRDLDAEASRLEQF 525
+DL + E+F
Sbjct: 278 DKDLKDRMANWEKF 291
>gi|357623952|gb|EHJ74902.1| hypothetical protein KGM_22644 [Danaus plexippus]
Length = 258
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 82/235 (34%), Positives = 133/235 (56%), Gaps = 19/235 (8%)
Query: 309 FSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGK 368
KCG+++ DP +K ++K+Y+DKE KGDAL TY+K SV LA +LLDG+ ++ +
Sbjct: 1 MQKCGLVERDPVNQKMKVKLYMDKEQNCFKGDALCTYIKIESVDLALKLLDGSDYKGN-- 58
Query: 369 IPMSVTQAKFEQKGERFIAKQVDSKKKKKLKK---VEEKMLGW------GGRDDAKLTIP 419
+ V +A+F+ KG+ A + KKKK+L+K +++K+ W G R +
Sbjct: 59 -KIKVERAQFQMKGDYNPALKPKKKKKKELEKLKKMQQKLFDWRPEKFIGERSKHE---- 113
Query: 420 ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRK 479
VI++ +F P++ D L + + D++EEC K G V V + + HP+GV + K+ +
Sbjct: 114 RIVIVKNLFHPSDFDNDVQLILDYQQDLREECSKCGEVRKVVIYDAHPEGVAQITMKEPE 173
Query: 480 DAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIR---DLDAEASRLEQFGAELEA 531
A I+L+NGRWFG RQI A DG + D++ ++ ++F E EA
Sbjct: 174 QADAVIQLINGRWFGKRQITAETYDGRTKYRIAETDADINKRINKWDKFLEEEEA 228
>gi|321260576|ref|XP_003195008.1| splicing factor u2af-associated protein 2 [Cryptococcus gattii
WM276]
gi|317461480|gb|ADV23221.1| splicing factor u2af-associated protein 2, putative [Cryptococcus
gattii WM276]
Length = 397
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/360 (30%), Positives = 164/360 (45%), Gaps = 73/360 (20%)
Query: 174 GEDEFTDDDGTRYKWDRGLRAWVP---QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDD 230
G+ ++ DD+GT ++W+ AW+P E +Q Y + P V+
Sbjct: 43 GKWQYEDDEGTEHEWNG--TAWIPIIDDELVKAQQAAYSV------------PGVD---- 84
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKP-------PDSWF 283
E ++A EE+ N KRK+ +K +N P P
Sbjct: 85 --------------ESTPSNAAITREERRNKKRKKGEKDYTSNSSNAPAAHVTSKPAPAP 130
Query: 284 ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
+ T V+VT LP + TV+++ +VFSK G++ D E PRIK+Y D E G KG+A V
Sbjct: 131 SVPKKTGVWVTNLPPNTTVQKLADVFSKAGVLHIDDEGN-PRIKMYYDDE-GNFKGEAWV 188
Query: 344 TYLKEPSVALATQLLDGTPFRPD-GKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVE 402
Y KE SV LA LLD T G PM V A++ ++ R K++K ++
Sbjct: 189 VYFKEGSVDLAITLLDDTELELGAGYPPMRVKVAEYFKEKHRM---------SKRMKTLQ 239
Query: 403 EKMLGWGGRDDA--------------KLTIPATVILRFMFTPAEMRADENLRSELEADVQ 448
K + W D++ V+L+ MF ++ D L EL+ +V+
Sbjct: 240 SK-ITWRSDDESDDAAAPLGGAPAPTNNRFARVVVLKGMFVLEDLEKDPALLLELKEEVR 298
Query: 449 EECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVN 508
EE +G V SV + + GV+ ++FK+ AQ C+ MN R+F GR I+A GL N
Sbjct: 299 EEAETLGQVTSVILYDKEEDGVMTIKFKEPVSAQACVAKMNNRYFDGRVIYA----GLYN 354
>gi|330805047|ref|XP_003290499.1| hypothetical protein DICPUDRAFT_155008 [Dictyostelium purpureum]
gi|325079378|gb|EGC32981.1| hypothetical protein DICPUDRAFT_155008 [Dictyostelium purpureum]
Length = 365
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/252 (32%), Positives = 135/252 (53%), Gaps = 21/252 (8%)
Query: 273 KEANKPPDSWFELKVN-THVYVTGLPDDVTV--EEMVEVFSKCGIIKEDPETKKPRIKIY 329
K++ K D EL+ N + +Y++GLP D +V EE+ + KCG +K++ E +P I +Y
Sbjct: 95 KKSEKKRDKNQELEENISELYISGLPIDGSVREEEINKYLKKCGFVKKN-EYGRPIINLY 153
Query: 330 VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKI---------PMSVTQAKFEQ 380
VD E + G+ALV++ ++ S+ +A D T P+ I +SV A
Sbjct: 154 VD-ENNLFTGNALVSFERKESIPIAILQYDDTEIVPNHPIKLRKATLEESLSVNNATQNS 212
Query: 381 KGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLR 440
K+ K + + E L +G D TVI++ +F+P+E D N
Sbjct: 213 NNNSNNKKKKQKKSGQDKRSEYENELKYGWADSE----SKTVIIKNLFSPSEAWEDPNFF 268
Query: 441 SELEADVQEE---CVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
+ L+AD+++E C K G + S+ + E +P GVV V+FK+ + A+ C+ LM+GR+FGGR+
Sbjct: 269 NSLQADLEDETHGCAKCGEISSIHIFEYNPDGVVSVKFKEFESAESCVALMDGRFFGGRK 328
Query: 498 IHASEDDGLVNH 509
+ A DG N+
Sbjct: 329 LSADFYDGFTNY 340
>gi|150864548|ref|XP_001383408.2| hypothetical protein PICST_57807 [Scheffersomyces stipitis CBS
6054]
gi|149385804|gb|ABN65379.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 383
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/238 (35%), Positives = 133/238 (55%), Gaps = 13/238 (5%)
Query: 275 ANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKET 334
+KP DS K + VYV+ LP D+T +E+ + FSK GII ED T +PRIK+Y D +
Sbjct: 112 TSKPLDSNSNEKPVSAVYVSKLPLDITKDELAQSFSKYGIISEDYNTSQPRIKMYYDNDK 171
Query: 335 GMKKGDALVTYLKEPSVALATQLLDGTPFRP--DGKIPMSVTQAKFEQKGERFIAKQ-VD 391
KG+A+V Y SV LA +++D T RP +G+ +SV A+F+ + ++ V
Sbjct: 172 F--KGEAVVFYHAVESVRLAIEMMDNTYIRPSSNGESRISVQPAQFDNTRSKSTEERPVL 229
Query: 392 SKKKKKL-----KKVEEKMLGWGG--RDDAKLTIPATVILRFMFTPAEMRADENLRSELE 444
S +K+KL + +++++ W + K V+++ MF E+++D L +L+
Sbjct: 230 SAEKRKLLLQKKESLKKRLTQWDDDVTSNEKEKEAKIVLVKQMFREEELKSDPMLELDLK 289
Query: 445 ADVQEECVKIGP-VDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
D+QEEC KIG D K+ GVV ++FK+ + CI NGR+F G ++ AS
Sbjct: 290 EDIQEECDKIGIGNDITKITVYDITGVVTIKFKNPISSSTCISNFNGRFFDGLKLQAS 347
>gi|58268534|ref|XP_571423.1| splicing factor u2af-associated protein 2 [Cryptococcus neoformans
var. neoformans JEC21]
gi|57227658|gb|AAW44116.1| splicing factor u2af-associated protein 2, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 489
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/378 (29%), Positives = 169/378 (44%), Gaps = 84/378 (22%)
Query: 174 GEDEFTDDDGTRYKWDRGLRAWVP---QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDD 230
G+ ++ DD+GT ++W+ AW+P E +Q Y + P V+
Sbjct: 110 GKWQYEDDEGTEHEWNGT--AWIPIIDDELVRAQQAAYSV------------PGVD---- 151
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKP--------PDSW 282
E ++A EE+ N KRK+ +K +N P P
Sbjct: 152 --------------ESTPSNAAIAREERRNKKRKKGEKDYTSNTSNAPAAATEASKPAPA 197
Query: 283 FELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDAL 342
T V+VT LP + T++++ +VFSK G++ D E PRIK+Y D E G KG+A
Sbjct: 198 PSAPKKTGVWVTNLPPNTTIQKLADVFSKAGVLHIDDEGN-PRIKMYYDDE-GNFKGEAW 255
Query: 343 VTYLKEPSVALATQLLDGTPFRPD-GKIPMSVTQAKF---------------EQKGERFI 386
V Y KE SV LA LLD T G PM V A++ + ++ +
Sbjct: 256 VVYFKEGSVDLAITLLDDTELELGAGYPPMRVKVAEYFKDQEKGKDKEKKEKTEGEKKKL 315
Query: 387 AKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPA----------------TVILRFMFTP 430
+ K K++K ++ K + W R D + PA V+L+ MF P
Sbjct: 316 TAEEKQKMSKRMKTLQSK-ITW--RSDDESDDPAAPLGGAPAPTNNRFARVVVLKGMFVP 372
Query: 431 AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG 490
E+ D L EL+ +V+EE +G V SV + + GV+ ++FK+ AQ C+ MN
Sbjct: 373 EELEKDPALLLELKEEVREEAETLGQVTSVILYDKEEDGVMTIKFKEPVSAQACVAKMNN 432
Query: 491 RWFGGRQIHASEDDGLVN 508
R+F GR I+A GL N
Sbjct: 433 RYFDGRVIYA----GLYN 446
>gi|17510375|ref|NP_490765.1| Protein Y65B4A.1 [Caenorhabditis elegans]
gi|373220065|emb|CCD71919.1| Protein Y65B4A.1 [Caenorhabditis elegans]
Length = 442
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 128/233 (54%), Gaps = 12/233 (5%)
Query: 280 DSWFELKVNTH-VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
+ W +L H VYV+ LP+D+T EE + SKCG+I+ D T KP+ K+Y + E G K
Sbjct: 169 EGWLDLGDKVHAVYVSNLPEDITDEEFQKFMSKCGVIQPDIRTNKPKCKLYRE-ENGKLK 227
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ---VDSKKK 395
GD Y+K+ SV LA +LDG +G+ + V +A+FE KG+ A++ + + +K
Sbjct: 228 GDGRCCYIKKESVELACNILDGANL--NGR-EVKVEEARFEMKGDFDPARKRRKLTAAQK 284
Query: 396 KKLKKVEEKMLGWGGRDDAKLTIP---ATVILRFMFTPAEMRADENLRSELEADVQEECV 452
K+ + + K+ W D + P TVI++ +FT M + L +L+ ++ + C
Sbjct: 285 KRYMEQQNKIFEWTP-DKPRNYRPKSDCTVIVKNLFTQEMMNKNAALMLDLKEEMTQSCQ 343
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
K G V V V NHP GVV V F +++ ++ ++GR GR++ A DG
Sbjct: 344 KYGIVKKVVVYANHPDGVVSVTFPTTEESDMAVKYLHGRVVDGRKLTAELWDG 396
>gi|134112848|ref|XP_774967.1| hypothetical protein CNBF1310 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257615|gb|EAL20320.1| hypothetical protein CNBF1310 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 402
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/378 (29%), Positives = 169/378 (44%), Gaps = 84/378 (22%)
Query: 174 GEDEFTDDDGTRYKWDRGLRAWVP---QEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDD 230
G+ ++ DD+GT ++W+ AW+P E +Q Y + P V+
Sbjct: 23 GKWQYEDDEGTEHEWNG--TAWIPIIDDELVRAQQAAYSV------------PGVD---- 64
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKP--------PDSW 282
E ++A EE+ N KRK+ +K +N P P
Sbjct: 65 --------------ESTPSNAAIAREERRNKKRKKGEKDYTSNTSNAPAAATEASKPAPA 110
Query: 283 FELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDAL 342
T V+VT LP + T++++ +VFSK G++ D E PRIK+Y D E G KG+A
Sbjct: 111 PSAPKKTGVWVTNLPPNTTIQKLADVFSKAGVLHIDDEGN-PRIKMYYDDE-GNFKGEAW 168
Query: 343 VTYLKEPSVALATQLLDGTPFRPD-GKIPMSVTQAKF---------------EQKGERFI 386
V Y KE SV LA LLD T G PM V A++ + ++ +
Sbjct: 169 VVYFKEGSVDLAITLLDDTELELGAGYPPMRVKVAEYFKDQEKGKDKEKKEKTEGEKKKL 228
Query: 387 AKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT----------------VILRFMFTP 430
+ K K++K ++ K+ W R D + PA V+L+ MF P
Sbjct: 229 TAEEKQKMSKRMKTLQSKIT-W--RSDDESDDPAAPLGGAPAPTNNRFARVVVLKGMFVP 285
Query: 431 AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG 490
E+ D L EL+ +V+EE +G V SV + + GV+ ++FK+ AQ C+ MN
Sbjct: 286 EELEKDPALLLELKEEVREEAETLGQVTSVILYDKEEDGVMTIKFKEPVSAQACVAKMNN 345
Query: 491 RWFGGRQIHASEDDGLVN 508
R+F GR I+A GL N
Sbjct: 346 RYFDGRVIYA----GLYN 359
>gi|401882532|gb|EJT46786.1| hypothetical protein A1Q1_04464 [Trichosporon asahii var. asahii
CBS 2479]
gi|406700778|gb|EKD03942.1| hypothetical protein A1Q2_01766 [Trichosporon asahii var. asahii
CBS 8904]
Length = 407
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 160/337 (47%), Gaps = 59/337 (17%)
Query: 242 NSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVT 301
N+ V + ++ ++++ G+++QP K + + P NT V+V+ LP D T
Sbjct: 61 NTPAAPVLAREDRIKKRKAGEKEQPSKTPKPLNQSGPK--------NTAVWVSNLPPDAT 112
Query: 302 VEEMVEVFSKCGI--IKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
E++ VFSK G+ I +D E PRIK+Y D ++G KG+ALV Y KE SV LA LLD
Sbjct: 113 PEQLAAVFSKAGVLLIGDDGE---PRIKLYYD-DSGKFKGEALVMYFKEGSVDLAITLLD 168
Query: 360 GTPF-------------------------RPDGKI---PMSVTQAKFEQKGERFIAKQVD 391
T RP+ + P + K + ++ ++K++
Sbjct: 169 DTELEFGAGCGNMRVRAAEYQHSKSNNDKRPENRPQQPPQPTEKKKLTAEEKQRMSKRMR 228
Query: 392 SKKKKKLKKVE---------------EKMLGWGGRDDAK-LTIPATVILRFMFTPAEMRA 435
+ ++ ++ + E +G R K L V+L+ MF P +++
Sbjct: 229 TLEECVFRRCQSLITLSKLAWHSDEDEDPIGSSARQPGKVLPFNRVVVLKGMFRPEDIKK 288
Query: 436 DENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
+ L EL+ DV+EE +G V SV + + P+GV+ ++FKD AQ C+ MNGR+F G
Sbjct: 289 EPELLLELKDDVREEAESLGVVTSVILYDKEPEGVITIKFKDSTAAQACLLKMNGRYFDG 348
Query: 496 RQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELEAD 532
R++ S +G+ N L R + G + E D
Sbjct: 349 RKVRRSP-EGVANILITATLFTGKERYRRSGGKFEED 384
>gi|71004164|ref|XP_756748.1| hypothetical protein UM00601.1 [Ustilago maydis 521]
gi|46096017|gb|EAK81250.1| hypothetical protein UM00601.1 [Ustilago maydis 521]
Length = 465
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 85/263 (32%), Positives = 126/263 (47%), Gaps = 52/263 (19%)
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T +YVTGLP D T +E+ VF + G++ ED + +PR+K+Y D +T M KG+ALV Y K
Sbjct: 152 TSLYVTGLPLDATSDEIARVFQRYGVLLED-DQGRPRVKMYYDDKTRMFKGEALVVYFKP 210
Query: 349 PSVALATQLLDGTPFR-----PDGKIPMSVTQAKF-------------EQKG-------- 382
SV LA +LD T R G M V +A+F +Q G
Sbjct: 211 ESVDLAISMLDETSLRGAIGSSSGGQVMRVQRAEFHNAQAGDAKASGKQQAGADGVKPNS 270
Query: 383 --------ERFIAKQVDSKKKKKLKKVEEKMLG----------------WGGRDDAKLTI 418
R + +Q K K++ ++E K+ G A+
Sbjct: 271 TDASCNSHRRNLTEQDRKKIAKRVARLETKLSDWRDDSSSSSVSDSETMHTGSATARTGG 330
Query: 419 PATVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKD 477
TV+L MFT E+ +D L +L+ DV+EEC IG V +V + + P+G++ V+F
Sbjct: 331 ARTVVLTKMFTLFELESDPTLLLDLKEDVREECNSSIGGVTNVVLWDAEPEGIITVKFST 390
Query: 478 RKDAQKCIELMNGRWFGGRQIHA 500
A C+ +MNGR+F R+I A
Sbjct: 391 PDQANACVRMMNGRFFAQRRIDA 413
>gi|308485810|ref|XP_003105103.1| hypothetical protein CRE_20758 [Caenorhabditis remanei]
gi|308257048|gb|EFP01001.1| hypothetical protein CRE_20758 [Caenorhabditis remanei]
Length = 450
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 126/233 (54%), Gaps = 12/233 (5%)
Query: 280 DSWFELKVNTH-VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
+ W + H VYV+ LP D+T EE E SKCG+I+ D T KP+ K+Y ++E G K
Sbjct: 174 EGWVDFGDKVHAVYVSNLPLDITDEEFQEFMSKCGVIQPDIRTNKPKCKLYRNEE-GDLK 232
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ---VDSKKK 395
GD Y+K+ S+ LA +LDG+ + + V +A FE KG+ +K+ + + +K
Sbjct: 233 GDGRCCYIKKESIELACNILDGSLLKNK---EVKVEEAHFELKGDFDPSKKRRKLTAAQK 289
Query: 396 KKLKKVEEKMLGWGGRDDAKLTIP---ATVILRFMFTPAEMRADENLRSELEADVQEECV 452
K+ + + K+ W D + P TVI++ +FT M + L +L+ ++ + C
Sbjct: 290 KRYMEQQNKLFEWTP-DKPRNYRPKSDCTVIVKNLFTLEMMSKNAALMLDLKEEMTQSCQ 348
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
K G V V V +NHP GVV V F +++ ++ +NGR GR++ A DG
Sbjct: 349 KYGTVKKVVVYDNHPDGVVSVTFPTTEESDMAVKYLNGRVVDGRKLTAELWDG 401
>gi|328869826|gb|EGG18201.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
fasciculatum]
Length = 435
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 72/240 (30%), Positives = 128/240 (53%), Gaps = 19/240 (7%)
Query: 291 VYVTGLPDD---VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
+ ++GLP + + V ++ + F K G I+E + P+I+ ++D E+G + G A+++Y +
Sbjct: 178 LIISGLPTNPSLIRVNDLFQAFRKAGFIQETADGD-PKIQFFID-ESGARTGQAVISYAR 235
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKK----LKKVEE 403
E S+ LA QLLD T P K M + QA EQ + + SKK K+ + K E
Sbjct: 236 EESIHLAIQLLDDTEIIP--KYKMKLAQATPEQV--KTTQAKAPSKKGKREDSRVVKKRE 291
Query: 404 KMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC 463
GWG + ++++ +F P + ++ N EL+ D++ C + G + S+ +
Sbjct: 292 LNYGWGESESR------VIVIKNLFDPKDSWSNLNFYDELKEDLEMGCQRCGEIQSITIF 345
Query: 464 ENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLE 523
E +P GV ++FKD + A+KC+ LM GR+F R++ A DG ++ + + RL+
Sbjct: 346 ERNPDGVATIKFKDFEAAEKCVALMEGRYFAQRKLTADFYDGFTDYHVEETEEEKEQRLK 405
>gi|384493494|gb|EIE83985.1| hypothetical protein RO3G_08690 [Rhizopus delemar RA 99-880]
Length = 283
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 14/229 (6%)
Query: 239 EKLNSTEEKVNSADNVVE--EKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGL 296
E+L + V + D V E + K+K+ E+ E+ K P + T VY+TGL
Sbjct: 51 EQLMQAQASVYAIDGVEETVQPKEKKKKRSLSTYEQDESQKKPKH----ERITSVYITGL 106
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
P DVT +E+ FSKCG+I ED ET +P+IKIY D E + KGDALV+Y KE SV LA +
Sbjct: 107 PQDVTADELKTTFSKCGVIMEDLETGEPKIKIYKD-ENNVPKGDALVSYFKEESVPLAIE 165
Query: 357 LLDGTPFRP-DGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGW----GGR 411
LLD RP M+V++A F++K E+ K+ + K KK + ++ L W G+
Sbjct: 166 LLDEAELRPGKSATKMTVSKAVFKEKKEQNEKKRTNKMKAKKKMQQLQRKLDWVDDESGK 225
Query: 412 DDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSV 460
K VIL+ M+T E+ D L EL+ DV++EC K+G V +V
Sbjct: 226 RQEKFA--KIVILKNMYTQEELDEDPTLLLELKEDVRDECEKLGEVTNV 272
>gi|405121186|gb|AFR95955.1| splicing factor u2af-associated protein 2 [Cryptococcus neoformans
var. grubii H99]
Length = 426
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/372 (29%), Positives = 168/372 (45%), Gaps = 84/372 (22%)
Query: 174 GEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLAN 233
G+ ++ DD+GT ++W+ AW+P + S G++E T +N
Sbjct: 57 GKWQYEDDEGTEHEWNG--TAWIPIQAAYSVP---GVDEST----------------PSN 95
Query: 234 DEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKP-------PDSWFELK 286
+ + EE+ N KRK+ +K +N P P
Sbjct: 96 AAIAR-----------------EERRNKKRKKGEKDYTSNTSNAPATADTSRPAPAPSAP 138
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
T V+VT LP + TV+++ +VFSK G++ D E PRIK+Y D E G KG+A V Y
Sbjct: 139 KKTGVWVTNLPPNTTVQKLADVFSKAGVLHIDDE-GNPRIKMYYDDE-GNFKGEAWVVYF 196
Query: 347 KEPSVALATQLLDGTPFRPD-GKIPMSVTQAKF----------------EQKGERFIAKQ 389
KE SV LA LLD T G PM V A++ E + +R A++
Sbjct: 197 KEGSVDLAITLLDDTELELGAGYPPMRVKVAEYFKDQDKGKDKEKKEKIEGEKKRLTAEE 256
Query: 390 VDSKKKKKLKKVEEKMLGWGGRDDAKLTIPA----------------TVILRFMFTPAEM 433
K K++K ++ K + W R D + PA V+L+ MF E+
Sbjct: 257 -KQKMSKRMKTLQSK-ITW--RSDDESDDPAAPLGGAPAPTNNRFARVVVLKGMFVLEEL 312
Query: 434 RADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF 493
D L EL+ +V+EE +G V SV + + GV+ ++FK+ AQ C+ MN R+F
Sbjct: 313 EKDPALLLELKEEVREEAATLGQVTSVILYDKEEDGVMTIKFKEPVSAQACVAKMNNRYF 372
Query: 494 GGRQIHASEDDG 505
GR I+A +G
Sbjct: 373 DGRVIYAGLYNG 384
>gi|294654473|ref|XP_456534.2| DEHA2A04884p [Debaryomyces hansenii CBS767]
gi|199428911|emb|CAG84489.2| DEHA2A04884p [Debaryomyces hansenii CBS767]
Length = 389
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 173/383 (45%), Gaps = 69/383 (18%)
Query: 179 TDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK 238
D+ T Y ++ L W+P + E+ T EEE + LA + K
Sbjct: 43 NDNKSTEYYYNFILERWIPTAEIEDNGRKRSREDETIHDEEE-------ENKLAIKNLKK 95
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPD 298
+KL +E++N + K +G + + S FE NT VYV+ LP
Sbjct: 96 QKLKEMKEQMNKS------KADGNKNK---------------STFE---NTGVYVSQLPG 131
Query: 299 DVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLL 358
D+T +E++ FSK G+I ED +T +PRIKIY + + DAL+ Y + SV+LA ++L
Sbjct: 132 DITKDELMVTFSKYGLISEDYKTGEPRIKIYHNDGQRDDQRDALIIYHSKESVSLAIEML 191
Query: 359 DGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSK--KKKKL------------KKVEEK 404
D + RP P Q K + + F D+ KKK L + +++K
Sbjct: 192 DDSSIRP----PSDKDQTKIKVQPAEFKKSDSDTTNDKKKTLTYEEKKLLNKKKEMMKKK 247
Query: 405 MLGWGGRDDAKLTIPA----------------TVILRFMFTPAEMRADENLRSELEADVQ 448
+ W DD + TI A V++ MF E+++D L +L+ D+Q
Sbjct: 248 LSSW---DDDEGTIQAGDSDKTNNIKKRIWDKIVVIEKMFRIEELKSDPLLEMDLKEDIQ 304
Query: 449 EECVKIGP-VDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLV 507
+EC K+ D K+ G++ V+F + + KCIE +GR++ G + A G
Sbjct: 305 DECNKLNIGNDVTKITIYDVSGIITVKFNNGDSSLKCIESFSGRYYDGLTLKAYLYQGEK 364
Query: 508 NHAAIRDLDAEASRLEQFGAELE 530
+D + E RL+ FG LE
Sbjct: 365 FQKTSQDKENEGERLDSFGNWLE 387
>gi|146421728|ref|XP_001486808.1| hypothetical protein PGUG_00185 [Meyerozyma guilliermondii ATCC
6260]
Length = 359
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 127/236 (53%), Gaps = 26/236 (11%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V+ LP DVT +E+ E+FSK G+I ED +T +PRIK+Y D+ G K +ALV Y + S
Sbjct: 114 VFVSKLPSDVTSDELFEIFSKYGVIAEDFKTNRPRIKLYYDQ--GQFKNEALVIYHNKES 171
Query: 351 VALATQLLDGTPFRPDGKI-PMSVTQAKF------EQKGERFIA---KQVDSKKKKKLKK 400
V LA +LD + R ++ P+ V A F E+ R + KQ+ KKK+ LK
Sbjct: 172 VYLAIDMLDNSKIRSSKELEPIKVEPAAFSEERDLEKAPSRVLTAEEKQLLHKKKEMLK- 230
Query: 401 VEEKMLGWGGRDDAKLTIPA----------TVILRFMFTPAEMRADENLRSELEADVQEE 450
+K+ W D + + A ++++ MF E++ D+ L S++ D++ E
Sbjct: 231 --QKVSSWDDEDPSAVAGRAEQIRQRIWKKSLVIENMFRVEELKNDKYLESDIIEDIKSE 288
Query: 451 CVKIGPVDSV-KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
C K G SV KV + VV+VRF + A +C +GR++ G +IHA E G
Sbjct: 289 CDKYGIGSSVTKVSFFDVERVVVVRFDLIEAASQCRSAFDGRYYDGLKIHAREYAG 344
>gi|443710311|gb|ELU04565.1| hypothetical protein CAPTEDRAFT_225820 [Capitella teleta]
Length = 492
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 82/240 (34%), Positives = 126/240 (52%), Gaps = 21/240 (8%)
Query: 282 WFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
WF + +T VYV+GLP D+T EE E KCG+I D KK ++K+Y DK+ G KG
Sbjct: 212 WFNANHEKSTSVYVSGLPTDITDEEFEEFMKKCGLILYDQVAKKLKVKLYKDKD-GKNKG 270
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE---RFIAKQVDSKKKK 396
D + Y+K SV LA ++LDG R + ++V +AKFE KG+ + +++ +K+KK
Sbjct: 271 DGICGYIKAESVDLALEILDGNDIRGN---RVTVERAKFELKGQYDPKKKRRKLTNKEKK 327
Query: 397 KLKKVEEKMLGWGGRDDAKLTIPA-------TVILRFMFTPAEMRADENLRSELEADVQE 449
++K+ +EK+ W R D +P VI + FTP + D + + ++
Sbjct: 328 RIKEKQEKLFAW--RPD---KLPGERPRSDRVVIAKNAFTPEQFAKDPTKINLVSQKMRG 382
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+C + G V VKV + H GVV V ++A I+ +NG GR I DG +
Sbjct: 383 KCNQFGDVRKVKVHDGHVDGVVSVTMGTPEEADLAIKGLNGCLLFGRVISVETWDGTTRY 442
>gi|190344417|gb|EDK36087.2| hypothetical protein PGUG_00185 [Meyerozyma guilliermondii ATCC
6260]
Length = 359
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 126/236 (53%), Gaps = 26/236 (11%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V+ LP DVT +E+ E+FSK G+I ED +T +PRIK+Y D+ G K +ALV Y + S
Sbjct: 114 VFVSKLPSDVTSDELFEIFSKYGVIAEDFKTNRPRIKLYYDQ--GQFKNEALVIYHNKES 171
Query: 351 VALATQLLDGTPFRPDGKI-PMSVTQAKF------EQKGERFIA---KQVDSKKKKKLKK 400
V LA +LD + R + P+ V A F E+ R + KQ+ KKK+ LK
Sbjct: 172 VYLAIDMLDNSKIRSSKESEPIKVEPAAFSEERDSEKAPSRVLTAEEKQLLHKKKEMLK- 230
Query: 401 VEEKMLGWGGRDDAKLTIPA----------TVILRFMFTPAEMRADENLRSELEADVQEE 450
+K+ W D + + A ++++ MF E++ D+ L S++ D++ E
Sbjct: 231 --QKVSSWDDEDPSAVAGRAEQIRQRIWKKSLVIENMFRVEELKNDKYLESDIIEDIKSE 288
Query: 451 CVKIGPVDSV-KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
C K G SV KV + VV+VRF + A +C +GR++ G +IHA E G
Sbjct: 289 CDKYGIGSSVTKVSFFDVERVVVVRFDSIEAASQCRSAFDGRYYDGLKIHAREYAG 344
>gi|367018740|ref|XP_003658655.1| hypothetical protein MYCTH_2294700 [Myceliophthora thermophila ATCC
42464]
gi|347005922|gb|AEO53410.1| hypothetical protein MYCTH_2294700 [Myceliophthora thermophila ATCC
42464]
Length = 428
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 75/189 (39%), Positives = 102/189 (53%), Gaps = 28/189 (14%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
DDGT Y++D GLR W+P D + Q GY + P DD G
Sbjct: 36 DDGTEYEFDEGLRRWIPIVDEALIEEQQKGY------------IMPNSAAQDDRQELAQG 83
Query: 238 KE-KLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGL 296
K+ KL+S + + ++ N N + +P K ++ PP + K NT VYVTGL
Sbjct: 84 KKRKLDSNDREDSNYSN------NNNKARPFKAARRQGNRGPP----QPKQNTAVYVTGL 133
Query: 297 PDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALAT 355
P D TVEE+ E+FS KCG+I E+ ++ +PRIK+Y D E G KGDAL+ + K SV +A
Sbjct: 134 PLDATVEEVAELFSRKCGVIAEEIDSGRPRIKMYTD-ENGNFKGDALIVFFKPQSVDMAI 192
Query: 356 QLLDGTPFR 364
LLD T FR
Sbjct: 193 MLLDDTDFR 201
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 7/116 (6%)
Query: 392 SKKKKKLKKVEEKMLGWGGRDDAKLTIPAT-------VILRFMFTPAEMRADENLRSELE 444
+K KK +K+ K+ W DD A VILR MFT E+R D +++
Sbjct: 283 AKIIKKTQKLSAKLADWSDDDDPSALREAVNPKYQRVVILRNMFTLDELREDPAALLDIK 342
Query: 445 ADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
D++EEC K+GPV +V + + G+V V+F+ R+ A+ C+ LM+GR F GR + A
Sbjct: 343 EDIREECAKLGPVTNVVLYDEEEDGIVSVKFRTREAAEACLRLMHGRAFAGRIVEA 398
>gi|448116993|ref|XP_004203148.1| Piso0_000749 [Millerozyma farinosa CBS 7064]
gi|359384016|emb|CCE78720.1| Piso0_000749 [Millerozyma farinosa CBS 7064]
Length = 371
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 166/343 (48%), Gaps = 62/343 (18%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSS------QNDGYGIEEMTFLKEEEVFPTVNVTDDLAND 234
D T Y+++ L W+P+ D + + G GI+E KEE
Sbjct: 45 DKHTEYQYNFVLERWLPKVDNADGYYDNLNHSGAGIDEEEANKEEL-------------R 91
Query: 235 EVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVT 294
+V K+KL ++++N ++ E + K +P NT VYV+
Sbjct: 92 KVKKQKLIEVKDEMNKLNS---EMYASKGGRP---------------------NTGVYVS 127
Query: 295 GLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
LPDDVT +++VE FSK G+I ED +T + RIK+Y + E K +ALV Y + SV LA
Sbjct: 128 NLPDDVTKQDIVEAFSKYGVISEDFKTGEQRIKLYYENEKF--KNEALVIYHNKESVELA 185
Query: 355 TQLLDGTPFRPDGKIPMSVTQAKFEQKG------ERFIAKQVDSKKKKKLKKVEEKMLGW 408
Q+LD + R + V A+F+++ +R + + KK+ +++++++ W
Sbjct: 186 IQMLDDSLLRVGDTKRIRVQPAEFQKESNSSVQEKRQLTAEEKKLLKKRKEQLQKRISNW 245
Query: 409 -----GGRDDAKLTIP-----ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
G DD + I TV++ M +E + + +L D+QEEC KIG +
Sbjct: 246 DDEESGVLDDKAVQIKRKIWDKTVVIENMLHKSEYEENALVELDLTEDIQEECDKIGIGN 305
Query: 459 SV-KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
S+ K+ +V+V+F + + + KCIE N R+F G +++
Sbjct: 306 SITKIAFYDIDEIVIVKFSNPQHSLKCIEAFNNRYFDGLKLNV 348
>gi|453089327|gb|EMF17367.1| hypothetical protein SEPMUDRAFT_146408 [Mycosphaerella populorum
SO2202]
Length = 393
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 149/290 (51%), Gaps = 27/290 (9%)
Query: 240 KLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDD 299
K+ +++ + D + + K N +P+K+ K A+ P S + +T V+V+GLP D
Sbjct: 65 KVAGVDDEAPALDPMKKRKANQDDSEPNKKA-KASASGPATSSAPRR-STAVFVSGLPSD 122
Query: 300 VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
V V+E+ F + GII E P+ + RIK+Y DK+ G KG+AL+ Y + SVA+A + D
Sbjct: 123 VDVDEVRACFQRYGIIAESPDDNEKRIKLYNDKD-GNFKGEALIIYFRPESVAMAINMQD 181
Query: 360 GTPFRPDGKIP---MSVTQAK----------------FEQKGERFIAKQVDSKKKKKLKK 400
G F D +P ++V +A E + + K K KKK ++
Sbjct: 182 GYEFPRDPDLPTGRITVAEADHSYKRHKEDSVKDDTVVEDRNAAYKGKPSKVKTKKKAEE 241
Query: 401 VEEKMLGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
+ ++ W D + + ++ V+L+++FT E+ DE LR ++ D++EE K G
Sbjct: 242 MNSRLADWSDDDVSTMKQTSSRADKVVVLKYVFTLQELADDEGLREDIMEDMREEAEKHG 301
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
V ++ + + GVV +RF + A+ C + +GR + GR + A+ +G
Sbjct: 302 DVKNITIFDREEDGVVTIRFSNAMAARACADACDGRGYSGRPLAATISNG 351
>gi|212542531|ref|XP_002151420.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces marneffei ATCC 18224]
gi|210066327|gb|EEA20420.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces marneffei ATCC 18224]
Length = 275
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 132/272 (48%), Gaps = 66/272 (24%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DG Y +D L+ WV D + Q Y +E V+ T+++ E+
Sbjct: 42 EDGQEYGYDTALKRWVQTIDDALLEQQRQAYKVE------------GVDETEEMTRSELR 89
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
K+ KRKQPD + +K+ + +VNT VY+T +P
Sbjct: 90 KK----------------------KRKQPDDENGQKQKRQ--------RVNTAVYITSIP 119
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D T++E+ +VF KCG+I E+ ++ +PRIK+Y+D ++G KG+ALV Y + SV LA Q+
Sbjct: 120 LDATLDEIRDVFCKCGVIAEEIDSGRPRIKMYMD-DSGNFKGEALVVYFRPESVNLAIQM 178
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGW 408
LD + FR P G PM V A F K ++ + + KKK+ +K+ K+ W
Sbjct: 179 LDESNFRIGQSGPLG--PMKVQAADFSYKSQQDAPTNTNMRDKKKIMKRTQKLNSKLADW 236
Query: 409 GGRDDAKLTIPAT-------VILRFMFTPAEM 433
DD +P T V+L+ MFT E+
Sbjct: 237 D--DDEPSALPDTSSRFDKVVVLKHMFTLEEI 266
>gi|313237739|emb|CBY12877.1| unnamed protein product [Oikopleura dioica]
Length = 331
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 133/269 (49%), Gaps = 21/269 (7%)
Query: 259 HNGKRKQPDKQVE---KKEANKPPDSWFELKVN--THVYVTGLPDDVTVEEMVEVFSKCG 313
H K K P + E K + WFE + + VYV+ LP +T+++ V++ KCG
Sbjct: 38 HPKKTKVPKVEKEGIIKTSTKRKKVEWFEAEETQLSSVYVSNLPKTITIDKFVKLMQKCG 97
Query: 314 IIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSV 373
++KE +T+KP++K+Y D E G KGD L YL SV LA Q+LD + + V
Sbjct: 98 LLKECEKTEKPKVKLYTDNE-GKFKGDGLAHYLAPESVDLALQILDEQDVEGN---KIKV 153
Query: 374 TQAKFEQKGERFIAKQ---VDSKKKKKLKKVEEKMLGWG------GRDDAKLT---IPAT 421
AKFE KG+ K+ ++ ++K KK + K+LGWG G +D K
Sbjct: 154 ELAKFEMKGKFDKTKKKKGMNKREKIAAKKTKNKLLGWGGMGITTGSEDPKTKRARYEKV 213
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
V+ FT E+ D + +++ ++ C G V + + HP G+ V F +DA
Sbjct: 214 VVFSNCFTVDEVARDPTIILQVKDALRAACSGFGAPRKVNMFDGHPNGICSVAFNSAEDA 273
Query: 482 QKCIELMNGRWFGGRQIHASEDDGLVNHA 510
+ I+ ++ R GR + A DG+ ++
Sbjct: 274 DRAIDGLDKRLLRGRTLSAKRWDGVTDYT 302
>gi|169607607|ref|XP_001797223.1| hypothetical protein SNOG_06862 [Phaeosphaeria nodorum SN15]
gi|111064393|gb|EAT85513.1| hypothetical protein SNOG_06862 [Phaeosphaeria nodorum SN15]
Length = 314
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 141/271 (52%), Gaps = 24/271 (8%)
Query: 249 NSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEV 308
N+++ + E N R + K + + A+K P ++ N ++VT LP D +++ E
Sbjct: 19 NTSEGPIVESANDVRDKKRKAKQDEHASKKPKV---VQENRAIWVTNLPLDADKDDIEEA 75
Query: 309 FSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDG- 367
FS+ GII + + +K RIK+Y D E G G+AL+ Y K S+ALA +LDG P R D
Sbjct: 76 FSRYGIIDQGADGEK-RIKMYADDE-GNFNGNALIVYFKRASIALACTMLDGYPLRMDEP 133
Query: 368 -KIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEE--------KMLGWGGRDD--AKL 416
K ++VT+A K + ++Q+ SK +K KK E K+ W ++ A+
Sbjct: 134 EKGTITVTEADPSHKKNK-DSEQIVSKLTRKDKKASERNRADLNRKLAEWSDNEEEVAEA 192
Query: 417 TIP------ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
P I++ FT ++ D +++ D++EE K G V +V + + P+G+
Sbjct: 193 FAPRKNKWAKVCIVKHAFTLDDLEEDPAAYLDIKDDMREEAEKFGDVTNVTLFDKEPEGI 252
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ +RFKD A+KC + NGR F +++ +
Sbjct: 253 LTIRFKDFDAAEKCRDGWNGRSFAFQRLEVT 283
>gi|242016593|ref|XP_002428863.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212513606|gb|EEB16125.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 267
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 81/231 (35%), Positives = 119/231 (51%), Gaps = 42/231 (18%)
Query: 178 FTDDDGTRYKWDRGLRAW--VPQEDTSSQNDG-----YGIE-----------EMTFLKEE 219
+ D DG +Y W L W + T +QN YG + T++ +
Sbjct: 30 YEDKDGVKYYWSSELNKWEKCTNDSTETQNSKSETPQYGFDGTHHTYTDPKDNTTYIWDG 89
Query: 220 E---VFPTVNVTDD--LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKE 274
E FP V DD LAN ++ +++T + + DNV +E+ + K+K P +
Sbjct: 90 EKNAWFPKV---DDEFLANYQLNYGFVDNTTKPEDKTDNVTKEEPSLKKKTPQEP----- 141
Query: 275 ANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDK 332
SWFE+ + T VYV+ LP D+T EE V + KCG++ +D T K +IK+Y DK
Sbjct: 142 ------SWFEVDDQHCTKVYVSNLPLDITEEEFVALMQKCGLVMKDINTNKWKIKLYTDK 195
Query: 333 ETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE 383
+ KGDAL TY+K+ SV LA +LDG F+ + +SV +AKFE KGE
Sbjct: 196 TSNELKGDALCTYIKKESVDLALSVLDGYEFKGNK---ISVQRAKFEMKGE 243
>gi|281200292|gb|EFA74513.1| RNA-binding region RNP-1 domain-containing protein [Polysphondylium
pallidum PN500]
Length = 447
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 136/279 (48%), Gaps = 31/279 (11%)
Query: 254 VVEEKHNGKRKQPDKQVEKK-----EANKPPDSWFELKVNTHVYVTGLPDD---VTVEEM 305
+ +K+N RK +K+VE EA+K + F++ + V+ +P D +T+ ++
Sbjct: 129 ITGKKYNRDRKN-NKKVENGDDHDGEASKQEEEKFDI---VDLVVSNIPRDPRLLTINDL 184
Query: 306 VEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRP 365
+ F K G I+E+ + P IK ++ E G + G+A++++ ++ S+ LA QL D T P
Sbjct: 185 YQYFRKAGFIQEN-DHGTPLIKFFM-LEDGGRSGEAVISFARKESLPLAIQLYDETEIVP 242
Query: 366 DGKIPMSVTQAKFEQKGERFIAKQVDS-------KKKKKLKKVEEKMLGWGGRDDAKLTI 418
I +S+ A +Q R I DS R D K +
Sbjct: 243 GYTIKLSI--ASGDQIQSRSIVSSGDSVNKKKKKSNGSGGDNSGGGGKSGDKRSDLKKQM 300
Query: 419 --------PATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
V+L+ +F P E + N EL+ D++ C + G + +V V E +P+GV
Sbjct: 301 DYGWEESESRVVVLKNLFDPVESWTNPNFYEELKEDIEMGCQRCGEIQTVTVFERNPEGV 360
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
+V+FKD + A+KC+ LM GR+F RQ+ A DG ++
Sbjct: 361 AIVKFKDYESAEKCVALMEGRFFAQRQLKAELYDGFTDY 399
>gi|185133850|ref|NP_001117650.1| tFZR1 [Oncorhynchus mykiss]
gi|2982698|dbj|BAA25269.1| tFZR1 [Oncorhynchus mykiss]
Length = 447
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 143/270 (52%), Gaps = 40/270 (14%)
Query: 178 FTD-DDGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNV---TDD 230
F D +DGT Y WD +AW P+ ED ++ YG F K+ + P TD
Sbjct: 200 FVDPEDGTVYDWDHDKKAWFPKITEDFLAAYQANYG-----FTKDGQHDPNAACAPDTDT 254
Query: 231 LANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDS--WFELKV- 287
A E GK+ TE++ ++ +QP ++K + D+ WF+++
Sbjct: 255 TAKPEEGKK----TEKRTDT-------------EQPKDGKKEKGEKRKADTAAWFDVETD 297
Query: 288 -NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT+VYV+GLP D+T EE VEV SKCGI+ DP +++ ++K+Y D + G +KGD L YL
Sbjct: 298 KNTNVYVSGLPPDITTEEFVEVMSKCGIVMRDPISEEYKVKLYRDGQ-GNQKGDGLCCYL 356
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER-FIAKQVDSKKKKKLKKVEEKM 405
K+ SVALA +L+D + R + V A+FE KG+ K+ SK +K K ++K
Sbjct: 357 KKESVALAERLIDESEIR---GYQLHVEAARFELKGQYDASKKKKKSKDYRKRMKAQQKQ 413
Query: 406 LGWG--GRDDAKLTIPATVILRFMFTPAEM 433
L W + +A+ +I++ MF P++
Sbjct: 414 LDWRPEKKGEARKRHERVLIIQNMFHPSDF 443
>gi|448119442|ref|XP_004203731.1| Piso0_000749 [Millerozyma farinosa CBS 7064]
gi|359384599|emb|CCE78134.1| Piso0_000749 [Millerozyma farinosa CBS 7064]
Length = 372
Score = 105 bits (261), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 73/231 (31%), Positives = 123/231 (53%), Gaps = 20/231 (8%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT VY++ LPDDVT +++VE F K G+I ED +T + RIK+Y + E K +ALV Y
Sbjct: 121 NTGVYISNLPDDVTKKDIVEAFGKYGVISEDFKTGEQRIKLYYENEKF--KNEALVIYHN 178
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE------RFIAKQVDSKKKKKLKKV 401
+ SV LA Q+LD + + V A+F+++ + R + + KK+ +++
Sbjct: 179 KESVELAIQMLDDSLLHVGDTKRIRVQPAEFQKENDSSVQEKRQLTAEEKKLLKKRKEQL 238
Query: 402 EEKMLGW------GGRDDAKLTIP-----ATVILRFMFTPAEMRADENLRSELEADVQEE 450
++++ W G DD + I TV++ M +E + + +L D+QEE
Sbjct: 239 QKRISNWDDEESGGVLDDKAVQIKRKIWDKTVVIENMLHKSEYEENALVELDLTEDIQEE 298
Query: 451 CVKIGPVDSV-KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
C KIG +S+ K+ +V+V+F + + KCIE N R+F G +++
Sbjct: 299 CDKIGIGNSITKIAFYDMSEIVIVKFSTPQHSLKCIEAFNNRYFDGLKLNV 349
>gi|398411354|ref|XP_003857017.1| hypothetical protein MYCGRDRAFT_67212 [Zymoseptoria tritici IPO323]
gi|339476902|gb|EGP91993.1| hypothetical protein MYCGRDRAFT_67212 [Zymoseptoria tritici IPO323]
Length = 310
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/259 (31%), Positives = 124/259 (47%), Gaps = 23/259 (8%)
Query: 285 LKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
L + + VTGLP DV +E+ +VF + G+I E P I I + E G+ G+A++
Sbjct: 61 LDPDNAIVVTGLPLDVNADEIEKVFQRYGVIAETPNDSSKIIHIS-ENEDGLPTGNAVIV 119
Query: 345 YLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEK 404
+ VA + +L D F G + + K E F ++V K +LK E
Sbjct: 120 FNDLHGVAQSIELQDDAEFSRKGSRKTNKISVRAATK-EDFQREKVRIDKPVRLKSFE-- 176
Query: 405 MLGWGGRDDAKLTIPA---------TVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
D+ + IPA VI+ +FT A++ DE L ++ DV E+ IG
Sbjct: 177 -------DEDEEPIPAPRKSASWEKLVIISNIFTLAQLEEDEELADDIHRDVLEDAETIG 229
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA---SEDDGLVNHAAI 512
PV +V V + P+G+ ++RF+D + AQKC + +GR + GR + A +E
Sbjct: 230 PVKNVVVYDLEPRGICVIRFRDVESAQKCADAWDGRRYNGRAVRAGVPTEAVKFKRSLKT 289
Query: 513 RDLDAEASRLEQFGAELEA 531
R D E RL QF +LEA
Sbjct: 290 RKEDDEEERLAQFSKDLEA 308
>gi|242768287|ref|XP_002341538.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces stipitatus ATCC 10500]
gi|218724734|gb|EED24151.1| nuclear mRNA splicing factor-associated protein, putative
[Talaromyces stipitatus ATCC 10500]
Length = 284
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/270 (31%), Positives = 128/270 (47%), Gaps = 62/270 (22%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DG Y++D L+ WV D + Q Y +E + +EE EV
Sbjct: 41 EDGQEYEYDTALKRWVQTIDDALLEQQRQAYKVEGV----DEE-------------KEVT 83
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
+ +L KRKQPD + +K + +VNT VY+T +P
Sbjct: 84 RSQL-----------------RKKKRKQPDDENGQKPKRQ--------RVNTAVYITSIP 118
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D T++E+ +VF KCG+I E+ ++ +PRIK+Y+D E G KG+ALV Y + SV LA Q+
Sbjct: 119 LDATLDEIRDVFCKCGVIAEEIDSGRPRIKMYMD-EAGNFKGEALVVYFRPESVNLAIQM 177
Query: 358 LDGTPFR-----PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL----KKVEEKMLGW 408
LD + FR P G PM V A F K ++ + + + KKK+ +K+ K+ W
Sbjct: 178 LDESDFRIGQSGPSG--PMKVQAADFSFKSQQDVPTSTNMRDKKKIMKRTQKLNSKLADW 235
Query: 409 GGRD-----DAKLTIPATVILRFMFTPAEM 433
+ D V+L+ MFT E+
Sbjct: 236 DDDEPSAVLDTSSRFDKVVVLKHMFTLEEI 265
>gi|313240200|emb|CBY32549.1| unnamed protein product [Oikopleura dioica]
Length = 331
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 121/242 (50%), Gaps = 18/242 (7%)
Query: 282 WFELKVN--THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
WFE + + VYV+ LP +T+++ V++ KCG++K+ +T+KP++K+Y D E G KG
Sbjct: 64 WFEAEETQLSSVYVSNLPKTITIDKFVKLMQKCGLLKQCEKTEKPKVKLYTDNE-GKFKG 122
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL- 398
D L YL SV LA Q+LD + + V AKFE KG+ K+ KK+
Sbjct: 123 DGLAHYLAPESVDLALQILDEQDLEGN---KIKVELAKFEMKGKFDKTKKKKGMNKKEKI 179
Query: 399 --KKVEEKMLGWGGR---------DDAKLTIPATVILRFMFTPAEMRADENLRSELEADV 447
KK + K+LGWGG + V+ FT E+ D + +++ +
Sbjct: 180 AAKKTKNKLLGWGGMGITTGSEDPKTKRARYEKVVVFSNCFTVDEVARDPTIILQVKDAL 239
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLV 507
+ C G V + + HP G+ V F +DA + I+ ++ R GR + A DG+
Sbjct: 240 RAACSGFGAPRKVNMFDGHPNGICSVAFNSAEDADRAIDGLDKRLLRGRTLSAKRWDGVT 299
Query: 508 NH 509
++
Sbjct: 300 DY 301
>gi|296004899|ref|XP_002808796.1| RNA binding protein, putative [Plasmodium falciparum 3D7]
gi|225632185|emb|CAX64069.1| RNA binding protein, putative [Plasmodium falciparum 3D7]
Length = 492
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 121/229 (52%), Gaps = 19/229 (8%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGLP+DV EE+ EVF K GIIK D E +P+IKIY D + KGDALVTY+
Sbjct: 205 NSSIYITGLPNDVVKEEIYEVFKKAGIIKIDTERNEPQIKIYYDDNNNI-KGDALVTYVY 263
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFI----AKQVDSKKKKKLKKVEE 403
SV +A + D FR + I V +A+F K ++ I + + KKK K K+E+
Sbjct: 264 TQSVDMAIKYFDNFLFRQNCII--HVEKAQFNNKKKQVIKISKEEILKKKKKIKAAKLEQ 321
Query: 404 KMLGWGGRDDAKLTIPATVILRFMFTPAE-MRADEN------LRSELEADVQEECVKIGP 456
L GG + T V+ R +F+ + M+ DE +++ LE +++ K P
Sbjct: 322 LRLQRGG-EVYTGTKKKIVVFRNVFSYEDAMKYDEGDSFYEFIKNMLEMEIK----KYVP 376
Query: 457 VDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
V V HP G+V V+FK ++A+ + N G+++ DG
Sbjct: 377 VHKVYPIPKHPHGIVCVKFKGVEEAETVVSCFNDIELNGKKLEVYFYDG 425
>gi|195127253|ref|XP_002008083.1| GI13305 [Drosophila mojavensis]
gi|193919692|gb|EDW18559.1| GI13305 [Drosophila mojavensis]
Length = 265
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 113/207 (54%), Gaps = 16/207 (7%)
Query: 307 EVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPD 366
E+ KCG++ DP+T+K ++K+Y + + G KGD L Y+K SV LA ++LD R
Sbjct: 2 ELMGKCGLVMRDPQTQKYKLKLYTEAD-GQIKGDGLCDYIKVESVNLALEILDDYNLRGH 60
Query: 367 GKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEEKMLGW------GGRDDAKLTI 418
+ V +A+F+ +GE + + K K+KL+K++EK+ W G R +
Sbjct: 61 ---KIHVQRAQFQMRGEYNPALKPKRKKKDKEKLQKIKEKLFDWRPDKMRGERSKNE--- 114
Query: 419 PATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDR 478
TVI++ +FTP + L E + ++EEC K G V V + + HP+G+ +
Sbjct: 115 -KTVIIKNLFTPELFEKEVELILEYQNSLREECSKCGMVRKVVIYDRHPEGIAQINMSTP 173
Query: 479 KDAQKCIELMNGRWFGGRQIHASEDDG 505
++A I++M GR+FG RQ+ A DG
Sbjct: 174 EEADMVIQMMQGRYFGQRQLSADHWDG 200
>gi|403221188|dbj|BAM39321.1| uncharacterized protein TOT_010001193 [Theileria orientalis strain
Shintoku]
Length = 255
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 109/222 (49%), Gaps = 27/222 (12%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
VYV+GLP D ++ E+ EVF K G+IK DP T P+IK+Y D E G K D VT++ + S
Sbjct: 6 VYVSGLPKDTSLAEVAEVFKKAGLIKIDPLTTLPKIKLYTD-ENGDLKSDGTVTFVNKES 64
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKG---ERFIAKQVDSKKKKKL---KKVEEK 404
V A + LD FR + I V AKFE + + + V S+ +KK K EE+
Sbjct: 65 VEFALRYLDNYHFRENCVI--HVEPAKFEPRSNQQNKPVPASVKSELRKKYLAAKYEEER 122
Query: 405 MLGWG-GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC 463
+ W DD T VI + MF+ + +E E K+ P+
Sbjct: 123 LKSWSDNLDDG--TGRRIVISKPMFS---------MEDAMEIKKYVEVEKVTPI------ 165
Query: 464 ENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
HPQGVV ++FK+ DA+ I N R F GR + DG
Sbjct: 166 ARHPQGVVCIKFKNSLDAEVFISKFNNRLFDGRSLEVYFFDG 207
>gi|452847575|gb|EME49507.1| hypothetical protein DOTSEDRAFT_68316 [Dothistroma septosporum
NZE10]
Length = 403
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 134/278 (48%), Gaps = 32/278 (11%)
Query: 277 KPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGM 336
KPP + NT V+VTGLP DV EE+ + F K G+I E + + R+K+Y DK G
Sbjct: 120 KPP------RQNTAVFVTGLPQDVDHEEVRDHFKKFGMISESIDDNEKRVKLYNDK-GGN 172
Query: 337 KKGDALVTYLKEPSVALATQLLDGT--PFR----PDGKIPMSVTQAKFEQKGERFIA--- 387
KG+AL+ Y + SV LA + DG P R P I + + F+ + +A
Sbjct: 173 FKGEALIIYYRPESVKLAIDMADGAYLPRRDASAPTASISVVAADSSFKAHKDDTVAEER 232
Query: 388 -----KQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADE 437
K +K K+K ++ ++ W + + + ++ VI++ +FT + D+
Sbjct: 233 PKGKPKGSRAKAKQKADEMNSRLADWSDDEPSAMQQTSSRFDKVVIIKNVFTLKALEEDK 292
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
+ ++ D++E G + ++ + + GVV +RF + A+ C + NGR + GR+
Sbjct: 293 DYYEDIMDDMREAGAH-GDIKNITIFDKEEDGVVTIRFSNAMAAKACADTFNGRGYDGRR 351
Query: 498 IHASEDDGL-----VNHAAIRDLDAEASRLEQFGAELE 530
+ AS GL A D + EA RLE + +E
Sbjct: 352 LEASIATGLEKFKKSRKAKTGDDEEEAKRLEDYSNFIE 389
>gi|190409254|gb|EDV12519.1| hypothetical protein SCRG_03412 [Saccharomyces cerevisiae RM11-1a]
Length = 285
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 288 NTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +Y++GLP D T +E + E F K G+I+ + + + P K+YV+ + G KGDAL+TY
Sbjct: 44 NTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLCKLYVN-DKGAFKGDALITYS 101
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERFIAKQVDSKK-------KKKL 398
KE SV LA ++++ + F GK + V +A+F+ K G+ K+ D K+ KKL
Sbjct: 102 KEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNMHGKENDLKEFNGPEPPIKKL 158
Query: 399 KKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK + + G DD L TVI +F + ++++ ++++ D+ E C +IG
Sbjct: 159 KKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYKSYTNDDI-NDIQEDLLEGCEEIG 217
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VDS+ V N + V+ FK+ K A +C ++M GR+F G+++ A
Sbjct: 218 QVDSISVSPNKGEATVV--FKNNKVALQCCKIMTGRYFDGQKLLA 260
>gi|323303306|gb|EGA57102.1| Cus2p [Saccharomyces cerevisiae FostersB]
Length = 285
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 288 NTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +Y++GLP D T +E + E F K G+I+ + + + P K+YV+ + G KGDAL+TY
Sbjct: 44 NTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLCKLYVN-DKGXFKGDALITYS 101
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERFIAKQVDSKK-------KKKL 398
KE SV LA ++++ + F GK + V +A+F+ K G+ K+ D K+ KKL
Sbjct: 102 KEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNMHGKENDLKEFNGPEPPIKKL 158
Query: 399 KKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK + + G DD L TVI +F + ++++ ++++ D+ E C +IG
Sbjct: 159 KKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYKSYTNDDI-NDIQEDLLEGCEEIG 217
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VDS+ V N + V+ FK+ K A +C ++M GR+F G+++ A
Sbjct: 218 QVDSISVSPNKGEATVV--FKNNKVALQCCKIMTGRYFDGQKLLA 260
>gi|151944261|gb|EDN62540.1| cold sensitive u2 snRNA suppressor [Saccharomyces cerevisiae
YJM789]
gi|323352832|gb|EGA85134.1| Cus2p [Saccharomyces cerevisiae VL3]
gi|365763615|gb|EHN05142.1| Cus2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 285
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 288 NTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +Y++GLP D T +E + E F K G+I+ + + + P K+YV+ + G KGDAL+TY
Sbjct: 44 NTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLCKLYVN-DKGAFKGDALITYS 101
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERFIAKQVDSKK-------KKKL 398
KE SV LA ++++ + F GK + V +A+F+ K G+ K+ D K+ KKL
Sbjct: 102 KEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNMHGKENDLKEFNGPEPPIKKL 158
Query: 399 KKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK + + G DD L TVI +F + ++++ ++++ D+ E C +IG
Sbjct: 159 KKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYKSYTNDDI-NDIQEDLLEGCEEIG 217
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VDS+ V N + V+ FK+ K A +C ++M GR+F G+++ A
Sbjct: 218 QVDSISVSPNKGEATVV--FKNNKVALQCCKIMTGRYFDGQKLLA 260
>gi|6324043|ref|NP_014113.1| Cus2p [Saccharomyces cerevisiae S288c]
gi|1730649|sp|P53830.1|CUS2_YEAST RecName: Full=Cold sensitive U2 snRNA suppressor 2
gi|1302367|emb|CAA96203.1| CUS2 [Saccharomyces cerevisiae]
gi|285814379|tpg|DAA10273.1| TPA: Cus2p [Saccharomyces cerevisiae S288c]
gi|392297067|gb|EIW08168.1| Cus2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 285
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 288 NTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +Y++GLP D T +E + E F K G+I+ + + + P K+YV+ + G KGDAL+TY
Sbjct: 44 NTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLCKLYVN-DKGAFKGDALITYS 101
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERFIAKQVDSKK-------KKKL 398
KE SV LA ++++ + F GK + V +A+F+ K G+ K+ D K+ K+L
Sbjct: 102 KEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNMHGKENDLKEFNGPEPPIKRL 158
Query: 399 KKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK + + G DD L TVI +F + ++++ ++++ D+ E C +IG
Sbjct: 159 KKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYKSYTNDDI-NDIQEDLLEGCEEIG 217
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VDS+ V N + V+ FK+ K A +C ++M GR+F G+++ A
Sbjct: 218 QVDSISVSPNKGEATVV--FKNNKVALQCCKIMTGRYFDGQKLLA 260
>gi|328716153|ref|XP_001951637.2| PREDICTED: HIV Tat-specific factor 1 homolog [Acyrthosiphon pisum]
Length = 315
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 113/207 (54%), Gaps = 30/207 (14%)
Query: 161 GRDDHDGVPASPEGEDEFTDD-DGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFL 216
G+ HDG+ + +TD DGT Y WDR AW P+ D + YG F
Sbjct: 130 GKYGHDGITQT------YTDPADGTVYIWDREKNAWFPKIDDDFMAHYQLSYG-----FN 178
Query: 217 KEEEVFPTVNVTDDLANDEVGKE-KLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEA 275
+ +VN TD L + +G + K N +E +N++D V+E + KRK P +
Sbjct: 179 STDSTTASVNSTDTLKTNTIGAQSKENQSESTLNASDVVLEPE---KRKLP-------QP 228
Query: 276 NKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKE 333
++P +WFE+ + NT VYV+ LP D+T +E +++ KCG+I +D ++ + +IK+Y ++
Sbjct: 229 SEP--NWFEIDEEHNTKVYVSNLPLDITEQEFIDLMQKCGLIMKDIDSGQMKIKLYTERG 286
Query: 334 TGMKKGDALVTYLKEPSVALATQLLDG 360
T + KGDAL TY+K V + LL+
Sbjct: 287 TDILKGDALCTYIKVLHVFIIILLLNS 313
>gi|336262820|ref|XP_003346192.1| hypothetical protein SMAC_05729 [Sordaria macrospora k-hell]
gi|380093521|emb|CCC08484.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 438
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 103/192 (53%), Gaps = 25/192 (13%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGKEK 240
DDGT Y++D GL+ W+P I+E + + + T DD + GK +
Sbjct: 41 DDGTEYEFDEGLKRWLPI-----------IDEALIEEHQRAYITSLGDDDTETGQSGKGQ 89
Query: 241 LNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDV 300
+ K N ++ +N ++P K ++++A + P K NT VYVTGLP D
Sbjct: 90 --GKKRKNNDREDSASSNNNMNNQRPPKNQKREKAPREP------KQNTAVYVTGLPLDA 141
Query: 301 TVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
T +E+ E+FS KCG+I E+ ++ +PRIK+Y D G KGDAL+ + K SV +A LLD
Sbjct: 142 TADEVAELFSRKCGVIAEEIDSGRPRIKMYTDA-YGKFKGDALIVFFKPQSVDMAIMLLD 200
Query: 360 GTPFR----PDG 367
T FR PDG
Sbjct: 201 DTDFRFEPAPDG 212
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 64/119 (53%), Gaps = 14/119 (11%)
Query: 396 KKLKKVEEKMLGWGGRD---DAKLTIPA-----------TVILRFMFTPAEMRADENLRS 441
KK +K+ K+ W + A+L PA VILR MFT E+ D
Sbjct: 293 KKTQKLSAKLADWDDDEPSPSARLLGPAIKEAKGGKWDKVVILRHMFTLEELNEDPAALL 352
Query: 442 ELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
E++ D++EEC K+GPV +V + + +G+V V+F + A+ C+ LM+GR F GR + A
Sbjct: 353 EIKDDIREECAKLGPVTNVVLFDQEEEGIVSVKFATVEAAEACVHLMHGRAFDGRIVEA 411
>gi|259149083|emb|CAY82325.1| Cus2p [Saccharomyces cerevisiae EC1118]
Length = 300
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 288 NTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +Y++GLP D T +E + E F K G+I+ + + + P K+YV+ + G KGDAL+TY
Sbjct: 44 NTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLCKLYVN-DKGAFKGDALITYS 101
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERFIAKQVDSKK-------KKKL 398
KE SV LA ++++ + F GK + V +A+F+ K G+ K+ D K+ K+L
Sbjct: 102 KEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNMHGKENDLKEFNGPEPPIKRL 158
Query: 399 KKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK + + G DD L TVI +F + ++++ ++++ D+ E C +IG
Sbjct: 159 KKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYKSYTNDDI-NDIQEDLLEGCEEIG 217
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VDS+ V N + V+ FK+ K A +C ++M GR+F G+++ A
Sbjct: 218 QVDSISVSPNKGEATVV--FKNNKVALQCCKIMTGRYFDGQKLLA 260
>gi|207341950|gb|EDZ69866.1| YNL286Wp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|323331840|gb|EGA73252.1| Cus2p [Saccharomyces cerevisiae AWRI796]
Length = 246
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 288 NTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +Y++GLP D T +E + E F K G+I+ + + + P K+YV+ + G KGDAL+TY
Sbjct: 5 NTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLCKLYVN-DKGAFKGDALITYS 62
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERFIAKQVDSKK-------KKKL 398
KE SV LA ++++ + F GK + V +A+F+ K G+ K+ D K+ KKL
Sbjct: 63 KEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNMHGKENDLKEFNGPEPPIKKL 119
Query: 399 KKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK + + G DD L TVI +F + ++++ ++++ D+ E C +IG
Sbjct: 120 KKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYKSYTNDDI-NDIQEDLLEGCEEIG 178
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VDS+ V N + V+ FK+ K A +C ++M GR+F G+++ A
Sbjct: 179 QVDSISVSPNKGEATVV--FKNNKVALQCCKIMTGRYFDGQKLLA 221
>gi|349580664|dbj|GAA25823.1| K7_Cus2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 285
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 80/245 (32%), Positives = 136/245 (55%), Gaps = 20/245 (8%)
Query: 268 KQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRI 326
KQ+++ K + NT +Y++GLP D T +E + E F K G+I+ + + + P
Sbjct: 24 KQLKESNLQKRELEYNNASKNTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLC 82
Query: 327 KIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERF 385
K+YV+ + G KGDAL+TY KE SV LA ++++ + F GK + V +A+F+ K G+
Sbjct: 83 KLYVN-DKGAFKGDALITYSKEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNM 138
Query: 386 IAKQVDSKK-------KKKLKKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAEMRA 435
K+ D K+ K+LKK + + G DD L TVI +F +
Sbjct: 139 HGKENDLKEFNGPEPPIKRLKKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYKSYT 198
Query: 436 DENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
++++ ++++ D+ E C +IG VDS+ V N + V+ FK+ K A +C ++M GR+F G
Sbjct: 199 NDDI-NDIQEDLLEGCEEIGQVDSISVSPNKGEATVV--FKNNKVALQCCKIMTGRYFDG 255
Query: 496 RQIHA 500
+++ A
Sbjct: 256 QKLLA 260
>gi|396463116|ref|XP_003836169.1| hypothetical protein LEMA_P055100.1 [Leptosphaeria maculans JN3]
gi|312212721|emb|CBX92804.1| hypothetical protein LEMA_P055100.1 [Leptosphaeria maculans JN3]
Length = 305
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 78/269 (28%), Positives = 140/269 (52%), Gaps = 27/269 (10%)
Query: 253 NVVEEKHNGKRKQPD----KQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEV 308
++VE N + +P+ K E+ EA K P +++ N ++V+ LP D T +E+ E
Sbjct: 22 SLVESAPNNENSEPETKKRKTEEEPEAVKKP----KIQENRAIFVSNLPLDTTKDELEET 77
Query: 309 FSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPD-- 366
F K GI+ + E PRIK+Y D E G G+AL+ Y ++ S+ LA Q++D FR +
Sbjct: 78 FKKYGILDKSTEGL-PRIKMYEDDE-GKFNGEALIVYFRKDSIKLAIQMMDDYFFRIEDQ 135
Query: 367 --GKIPMSVTQAKFEQKGE-RFIAKQVDSKKKKKLKK----VEEKMLGWGGRDD--AKLT 417
G+I + +++ E IAK++ + KK ++ + K+ W ++ A+
Sbjct: 136 SAGRIRVKEADMSYKRNKEGDEIAKKLSRQDKKASERNRADLNRKLAEWSDNEEEVAEKY 195
Query: 418 IPA------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVV 471
P V+L++ FT E+ D E++ D++EE + G + +V + + G+
Sbjct: 196 APKKNKWAKVVVLKYTFTLEELAEDAANYLEIKDDIREEAERFGTITNVTLYDKEADGIC 255
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VRF++ +DA+K + NG+ F R++ A
Sbjct: 256 TVRFREFEDAEKFCKHANGKPFAYRKLEA 284
>gi|451848160|gb|EMD61466.1| hypothetical protein COCSADRAFT_231870 [Cochliobolus sativus
ND90Pr]
Length = 317
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 139/269 (51%), Gaps = 26/269 (9%)
Query: 252 DNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSK 311
D+ + G+ + K ++ EANK P ++ N V+++ LP D T +E+ + FS+
Sbjct: 23 DDGAADAQAGQNSKKRKNGKEPEANKKP----KVIENKAVWISNLPPDTTAKEIEDEFSR 78
Query: 312 CGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRP----DG 367
GII + + + PRIK+Y+D ETG G+A+V Y ++ ++ A ++D RP +G
Sbjct: 79 FGIIDKGADGQ-PRIKMYMDDETGKFTGNAMVVYFRKEAITNAVNMMDDYVLRPGDYSNG 137
Query: 368 KIPMSVTQAKFEQKGER---FIAKQVDSKKKKKLKK----VEEKMLGWGGRDD--AKLTI 418
I V AK E K ER IA ++ K +K ++ + K+ W ++ A+
Sbjct: 138 TI--RVEPAKIEHKKERDGDKIASKLTRKDRKASERNRAELNRKLNEWSDNEEEVAEAFA 195
Query: 419 PA------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVL 472
P VI++ FTPAE+ + E++ +++E + G V + + + P+G+V
Sbjct: 196 PKKNKWAKVVIIKHAFTPAELDEEPEAYLEIKEEMREAAEEYGEVTNCTLYDKEPEGIVT 255
Query: 473 VRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
VRF++ + A+K + GR + R++ S
Sbjct: 256 VRFREFEPAEKFMADYQGRGYQKRKLALS 284
>gi|326479238|gb|EGE03248.1| nuclear mRNA splicing factor-associated protein [Trichophyton
equinum CBS 127.97]
Length = 262
Score = 98.6 bits (244), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 88/266 (33%), Positives = 121/266 (45%), Gaps = 52/266 (19%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
D+G Y +D L+ W+P D S Q Y I+ + DE
Sbjct: 37 DNGEEYTYDDALKRWIPSLDESLMEQQRQAYKIQGV--------------------DEEE 76
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
L + +EK KRK D+ ++ KP +VNT VYVT +P
Sbjct: 77 PANLKTLQEK------------KKKRKHNDESNAAQKPKKP-------RVNTAVYVTTIP 117
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D TV E+ +FSKCG+I E+ ++ KPRIK+Y D E G KGDALV Y + SV LA Q+
Sbjct: 118 LDATVSEISSLFSKCGVIAEEIDSGKPRIKMYTD-EQGTFKGDALVVYFRPESVNLAIQM 176
Query: 358 LDGTPFRPDGKIP---MSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKM---LGWGGR 411
LD T FR K M V A F K + + + + K K+ + +K+ L
Sbjct: 177 LDDTDFRFGEKGAEGNMRVQPADFSFKAVQETPAKANMRDKMKIIRKTQKLNNKLTDWDD 236
Query: 412 DDAKLTIPAT---VILRFMFTPAEMR 434
DD L I +L+ MFT E+R
Sbjct: 237 DDVGLDIQERQEGCVLKHMFTLQEVR 262
>gi|45269922|gb|AAS56342.1| YNL286W [Saccharomyces cerevisiae]
Length = 285
Score = 98.6 bits (244), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 288 NTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +Y++GLP D T +E + E F K G+I+ + + + P K+YV+ + G KGDAL+TY
Sbjct: 44 NTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLCKLYVN-DKGAFKGDALITYS 101
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERFIAKQVDSKK-------KKKL 398
KE SV LA ++++ + F GK + V +A+F+ K G+ K+ D K+ K+L
Sbjct: 102 KEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNMHGKENDLKEFNGPEPPIKRL 158
Query: 399 KKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG 455
KK + + G DD L TVI +F + ++++ ++++ D+ E C +IG
Sbjct: 159 KKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYKSYTNDDI-NDIQEDLLEGCEEIG 217
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VDS+ V N + V++ K+ K A +C ++M GR+F G+++ A
Sbjct: 218 QVDSISVSPNKGEATVVL--KNNKVALQCCKIMTGRYFDGQKLLA 260
>gi|156095177|ref|XP_001613624.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802498|gb|EDL43897.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 527
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 112/208 (53%), Gaps = 12/208 (5%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGLP D+T EE+ VF K GIIK D ET +P+IKIY D E KGDALVTY+
Sbjct: 246 NSCIYITGLPSDITREEIHSVFKKAGIIKIDAETTEPKIKIYYD-ENNQVKGDALVTYVY 304
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE-RFIAKQ--VDSKKKKKLKKVEE- 403
SV +A + D R D I V +A+F +K E I+K+ + KKK K K E+
Sbjct: 305 TQSVDIAIKYFDNFYLRQDCMI--RVEKAQFNKKKEASNISKEEMLIKKKKIKAAKYEQL 362
Query: 404 KMLGWGGRDDAKLTIPATVILRFMFTPAE-MRADEN--LRSELEADVQEECVKIGPVDSV 460
++ WG D T VI R +F+ + ++ DE ++ V+ E K PV V
Sbjct: 363 RLQKWG--DGYTGTKKKIVIFRNVFSYEDALKHDEGDPFYDFIKDLVEMEVKKYAPVHKV 420
Query: 461 KVCENHPQGVVLVRFKDRKDAQKCIELM 488
HP G+V V+FK ++A+ +
Sbjct: 421 YPIPKHPNGIVCVKFKGVEEAEMIVSCF 448
>gi|402911564|ref|XP_003918392.1| PREDICTED: HIV Tat-specific factor 1 homolog [Papio anubis]
Length = 645
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 94/176 (53%), Gaps = 7/176 (3%)
Query: 338 KGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKK 397
KGD L YLK SV LA +LLD R + V AKF+ KGE +K+ K K
Sbjct: 76 KGDGLCCYLKRESVELALKLLDEDEIR---GYKLHVEVAKFQLKGEYDASKKKKKCKDYK 132
Query: 398 LK-KVEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVK 453
K +++K L W A ++ VI++ MF P + D + +E+ D++ EC K
Sbjct: 133 KKLSMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSK 192
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
G + + + + HP GV V F+D ++A CI+ ++GRWFGGRQI A DG ++
Sbjct: 193 FGQIRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDGRWFGGRQITAQAWDGTTDY 248
>gi|452987670|gb|EME87425.1| hypothetical protein MYCFIDRAFT_75283 [Pseudocercospora fijiensis
CIRAD86]
Length = 401
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 136/272 (50%), Gaps = 32/272 (11%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
+T V+V+GLP DV EE+ EVFSK GII E E + R+K+Y D G KG+AL+ + +
Sbjct: 122 STAVFVSGLPLDVDTEEVREVFSKYGIIAESAEDNEKRVKLYNDA-NGNFKGEALIIFYR 180
Query: 348 EPSVALATQLLDGTPF-----RPDGKIPMSVTQAKFEQKGERFIAKQVDSKK----KKKL 398
SV A L DG + +P I + + +++ + +A + K K KL
Sbjct: 181 PESVRQAVMLADGMLWPRDFGQPTSTISVIEADSTYKKSNDDTVAPERGPSKARPSKAKL 240
Query: 399 KKVEEKM---LG-WGGRDDAKLTIPAT-------VILRFMFTPAEM---RADENLRSELE 444
K+ E+M LG W DD T+P T V+++ F E + DE++R ++
Sbjct: 241 KRKAEEMNQRLGEWS--DDDISTMPQTSSRYDKVVVVKNAFKLEEFENQKPDEDVRQDIY 298
Query: 445 ADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG-RQIHASED 503
D+ EE K G V +++ + +GVV +RF+ + A ++GR FG QI AS
Sbjct: 299 DDMFEEGGKYGTVKHIEIFDLEEEGVVTIRFQTAEAASAFARAIDGRRFGSPMQIQASIS 358
Query: 504 DG----LVNHAAIRDLDA-EASRLEQFGAELE 530
G N +A EA RLEQ+ ++E
Sbjct: 359 TGDERFKKNRKTAEQKEAEEARRLEQYSKDIE 390
>gi|451999201|gb|EMD91664.1| hypothetical protein COCHEDRAFT_1175834 [Cochliobolus
heterostrophus C5]
Length = 404
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 139/269 (51%), Gaps = 26/269 (9%)
Query: 252 DNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSK 311
D+ + G+ + K ++ EANK P ++ N ++++ LP D T +E+ + FS+
Sbjct: 110 DDGAADAQAGQNSKKRKNGKEPEANKKP----KVIENKAIWISNLPPDTTAKEIEDEFSR 165
Query: 312 CGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRP----DG 367
GII + + + PRIK+Y+D ETG G+A+V Y ++ ++ A ++D RP +G
Sbjct: 166 FGIIDKGADGQ-PRIKMYMDDETGKFTGNAMVVYFRKEAITNAVNMMDDYVLRPGDYSNG 224
Query: 368 KIPMSVTQAKFEQKGER---FIAKQVDSKKKKKLKK----VEEKMLGWGGRDD--AKLTI 418
I V AK E K ER IA ++ K +K ++ + K+ W ++ A+
Sbjct: 225 TI--RVEPAKIEHKKERDGDKIASKLTRKDRKASERNRAELNRKLNEWSDNEEEVAEAFA 282
Query: 419 PA------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVL 472
P VI++ FTPAE+ + E++ +++E + G V + + + P+G+V
Sbjct: 283 PKKNKWAKVVIIKHAFTPAELDEEPEAYLEIKEEMREAAEEYGEVTNCTLYDKEPEGIVT 342
Query: 473 VRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
VRF++ + A+K + GR + R++ S
Sbjct: 343 VRFREFEPAEKFMADYQGRGYQRRKLALS 371
>gi|255710505|ref|XP_002551536.1| KLTH0A01716p [Lachancea thermotolerans]
gi|238932913|emb|CAR21094.1| KLTH0A01716p [Lachancea thermotolerans CBS 6340]
Length = 296
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 135/274 (49%), Gaps = 49/274 (17%)
Query: 261 GKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPE 320
+RK+ E+KE K P LK N +Y++ LP +VT +E++E F+K G+I++D +
Sbjct: 21 ARRKRARDNTERKEDTKRP-----LK-NCAIYISHLPLEVTKDEVIEEFTKYGVIRKDLK 74
Query: 321 TKKPRIKIY--VDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKF 378
+ +P+ K Y VD G +G AL+ Y++ SV +A L+DG F + + V +A F
Sbjct: 75 SSEPKCKFYYGVD---GSFEGAALIVYMRPESVRMAVDLMDGYSFMGN---KLKVEEATF 128
Query: 379 E-------------QKGERFIAKQVD-SKKKKKLKKVEEKMLGWGGRDDAKLT------- 417
+ Q+G R ++ + + KL + E ++ W D K T
Sbjct: 129 KKEPKDDKKSPNASQEGSRDVSAEPKLNNIHAKLNEQERELQDWDDTTDEKTTPREEGSE 188
Query: 418 ---IP--------ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENH 466
IP TV+L + A + + +E+ AD++E C IG V S + +
Sbjct: 189 VDSIPDESENASSRTVVLANVLDLYANLAPQQI-AEVAADLKEGCEAIGSVSSFEF--DE 245
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
G V +K + AQKC +LMNGR+F GR++ A
Sbjct: 246 VLGQAKVEYKSSEIAQKCCQLMNGRYFDGRKLVA 279
>gi|221052463|ref|XP_002257807.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|193807638|emb|CAQ38143.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 525
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 112/223 (50%), Gaps = 8/223 (3%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGLP D+T EE+ VF K GIIK D ET +P+IKIY D E KGDALVTY+
Sbjct: 240 NSCIYITGLPSDITREEIHNVFKKAGIIKIDAETTEPKIKIYYD-EKNQVKGDALVTYVY 298
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE--RFIAKQVDSKKKKKLKKVEEKM 405
SV +A + D R D I V +A+F +K E + +++ KKKK E++
Sbjct: 299 TQSVDIAIKYFDNFYLRQDCMI--RVEKAQFNKKKEASKVSKEEMLIKKKKIKAAKYEQL 356
Query: 406 LGWGGRDDAKLTIPATVILRFMFTPAE-MRADEN--LRSELEADVQEECVKIGPVDSVKV 462
D T VI R +F+ + ++ DE ++ ++ E K PV V
Sbjct: 357 RLQKWEDGYTGTKKKIVIFRNVFSYEDAVKHDEGDPFYDFIKDLIETEVKKYVPVHKVYP 416
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
HP G+V V+FK ++A+ + +++ DG
Sbjct: 417 IPKHPNGIVCVKFKGVEEAEMIVSYFKDMELNDKKLEVYFYDG 459
>gi|289742005|gb|ADD19750.1| transcription elongation factor tAt-SF1 [Glossina morsitans
morsitans]
Length = 418
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 173 EGEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLA 232
+G+ +TD DG+ + WD AW P+ D F+ ++ N DL
Sbjct: 212 DGQRIYTDKDGSIFFWDTQKSAWFPKIDDE------------FMARYQMSYGFN---DLK 256
Query: 233 NDEVGKEKLNSTEE----KVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL--K 286
++ +E+ S +E K + E K +G+ + +KQ K++ + WF++
Sbjct: 257 ENDQNEEETRSAQEVAFTKSEQQTEMDEIKSSGESYEMNKQCSKRKQQQEAPKWFDIDPA 316
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT VYV+ LP D+T++E E+ KCG+I DP+++K ++K+Y + + G KGD L Y+
Sbjct: 317 HNTKVYVSNLPLDITLDEFAELMGKCGLIMRDPQSQKFKLKLYTEAD-GQIKGDGLCDYI 375
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE 383
K SV LA ++LD R + V +AKF+ +GE
Sbjct: 376 KVESVDLALKILDEYVLRGHK---IRVQRAKFQMRGE 409
>gi|67469647|ref|XP_650802.1| RNA-binding protein [Entamoeba histolytica HM-1:IMSS]
gi|56467458|gb|EAL45416.1| RNA-binding protein, putative [Entamoeba histolytica HM-1:IMSS]
gi|407042178|gb|EKE41185.1| RNA-binding protein, putative [Entamoeba nuttalli P19]
gi|449709297|gb|EMD48584.1| RNA-binding protein, putative [Entamoeba histolytica KU27]
Length = 255
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/248 (29%), Positives = 121/248 (48%), Gaps = 25/248 (10%)
Query: 264 KQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKK 323
KQP K+ +K+ NK + +V + V+ +P T+E+ V+ F CGI K+ T+
Sbjct: 6 KQP-KRSKKRSHNKLLPKTVQKEVMNGIRVSNIPIKYTIEQFVDYFRHCGIAKKKTPTQY 64
Query: 324 PRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE 383
+ ++ K + ++ +L E SV A + D + P I + E
Sbjct: 65 D--AYFSNENESNKTKEGVLYFLHEESVQKAIEYYDNSQIEPKCFIQI-----------E 111
Query: 384 RFIAKQVDSK-KKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSE 442
++ DS +KK + E W DD KL V+++ MF + + DE +E
Sbjct: 112 GIFSQPEDSNPNEKKYDQTRED--EW---DDNKL---VHVVIKNMFDLKKPK-DEQFFNE 162
Query: 443 LEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
L+ DV+EE K G V+ V V +P+G+V+++FKD A++C+ LMNGRWF Q+
Sbjct: 163 LKEDVEEEVKAKCGSVEKVTVFNTNPEGIVIIKFKDHSAAEQCVALMNGRWFDKHQLSCD 222
Query: 502 EDDGLVNH 509
DG N+
Sbjct: 223 YYDGYTNY 230
>gi|167389083|ref|XP_001738810.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897801|gb|EDR24873.1| hypothetical protein EDI_216920 [Entamoeba dispar SAW760]
Length = 255
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 118/249 (47%), Gaps = 23/249 (9%)
Query: 264 KQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKK 323
KQP K+ +K+ NK + +V + V+ +P T+E+ V+ F CGI K+ T+
Sbjct: 6 KQP-KRSKKRSHNKLLPKTVQKEVMNGIRVSNIPIQYTIEQFVDYFRHCGIAKKKTPTQY 64
Query: 324 PRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE 383
+ ++ K + ++ +L E SV A + D + P I +
Sbjct: 65 D--AYFCNENENDKTKEGILYFLHEESVQKAIEYYDNSQIEPKCFIQIEGI--------- 113
Query: 384 RFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSEL 443
F + + +KK + E W DD KL V+++ MF E + D +EL
Sbjct: 114 -FSQPENSNPNEKKYDQTRED--EW---DDNKL---VHVVIKNMFDLKEPK-DLQFFNEL 163
Query: 444 EADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASE 502
+ DV+EE K G V+ V V +P+G+V+++FKD A++CI LMNGRWF Q+
Sbjct: 164 KEDVEEEVKAKCGSVEKVTVFNTNPEGIVIIKFKDHNAAEQCIALMNGRWFDKHQLSCDY 223
Query: 503 DDGLVNHAA 511
DG N+
Sbjct: 224 YDGYTNYKV 232
>gi|402078096|gb|EJT73445.1| splicing factor U2AF-associated protein 2 [Gaeumannomyces graminis
var. tritici R3-111a-1]
Length = 415
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 91/190 (47%), Gaps = 34/190 (17%)
Query: 182 DGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEV----- 236
DG Y+WD GL+ W+P D + L EE +V+D AND
Sbjct: 38 DGAEYEWDGGLKRWIPVLDDA-------------LIEEHQAAYGHVSD--ANDSPAGPPA 82
Query: 237 -GKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTG 295
G K + A N + G+R P + + NT VYVTG
Sbjct: 83 GGARKRKHDDTTAGGASN----SNGGRRGGPKRHAASAAPPA-------PRQNTAVYVTG 131
Query: 296 LPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALA 354
LP D T++E+ E+FS KCG+I E+ + +PRIK+Y D + G KGDAL+ + K SV +A
Sbjct: 132 LPPDATLDEIAELFSRKCGVIAEEIDGNRPRIKMYTDAD-GNFKGDALIVFFKPQSVDMA 190
Query: 355 TQLLDGTPFR 364
LLD T FR
Sbjct: 191 IMLLDDTDFR 200
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 55/85 (64%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
TVIL+ MFT E+ D +++ D+++EC K+G V ++ + + +G+V VRF+ ++
Sbjct: 312 TVILKHMFTLQELDDDPAALLDIKEDIRDECEKLGDVTNIVLFDQEAEGIVSVRFRQQQA 371
Query: 481 AQKCIELMNGRWFGGRQIHASEDDG 505
A+ CI+LM+GR F GR + AS G
Sbjct: 372 AETCIKLMHGRSFDGRTVEASYSTG 396
>gi|389631769|ref|XP_003713537.1| splicing factor U2AF-associated protein 2 [Magnaporthe oryzae
70-15]
gi|351645870|gb|EHA53730.1| splicing factor U2AF-associated protein 2 [Magnaporthe oryzae
70-15]
gi|440465561|gb|ELQ34880.1| splicing factor U2AF-associated protein 2 [Magnaporthe oryzae Y34]
gi|440478553|gb|ELQ59372.1| splicing factor U2AF-associated protein 2 [Magnaporthe oryzae P131]
Length = 410
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/188 (36%), Positives = 89/188 (47%), Gaps = 46/188 (24%)
Query: 181 DDGTRYKWDRGLRAWVPQEDTS---SQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVG 237
+DG Y+WD GL+ W+P D + S YG E+E P D
Sbjct: 48 EDGAEYEWDGGLKRWIPIVDDALIESHQAAYG----GGAAEDEEAPATTAAD-------- 95
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
++K + D+ GK K+P + NT VYVTGLP
Sbjct: 96 -----RRKKKRKAGDD-----DGGKAKRP-------------------RQNTAVYVTGLP 126
Query: 298 DDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQ 356
DVTVEE+ E+FS KCG+I E+ + PRIK+Y D G KGDALV + K SV +A
Sbjct: 127 LDVTVEEIAELFSRKCGVIAENIDDSSPRIKLYTDG-GGNFKGDALVVFFKPQSVDMAIM 185
Query: 357 LLDGTPFR 364
LLD T FR
Sbjct: 186 LLDDTDFR 193
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 50/81 (61%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
TVILR MFT E+ D E++ D+++EC K+G V + + + G+V VRF+
Sbjct: 307 TVILRHMFTLKELDEDPAALLEIKEDIRDECEKLGEVTNCVLFDLEEDGIVSVRFRTPDA 366
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
A+ CI+LM+GR F GR + AS
Sbjct: 367 AEACIKLMHGRSFDGRTVQAS 387
>gi|367014075|ref|XP_003681537.1| hypothetical protein TDEL_0E00830 [Torulaspora delbrueckii]
gi|359749198|emb|CCE92326.1| hypothetical protein TDEL_0E00830 [Torulaspora delbrueckii]
Length = 290
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 77/228 (33%), Positives = 116/228 (50%), Gaps = 29/228 (12%)
Query: 289 THVYVTGLPDDVTVE-EMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
+ +Y++ LP VE E++ FSK G IK+D + + K+Y D + G KGDAL+ Y +
Sbjct: 42 SAIYISNLPKQANVENELIYEFSKFGKIKKDQDGN-VKFKVYKD-DDGEMKGDALIVYAR 99
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKFEQKGERFIAK-------QVDSKKKKKLK 399
SV +A Q++DG F DG KI + V A F+ + +R K Q S+ K LK
Sbjct: 100 HESVPIAIQMMDGYEF--DGAKIKVEV--ATFKNEKKRKYDKLTNDQESQSSSRATKLLK 155
Query: 400 K---VEEKMLGWGGRDDAKLTIPATVI------LRFMFTPAEMRADENLRSELEADVQEE 450
K VEE DD L T++ L + E+ + N E+ D+ +
Sbjct: 156 KNDTVEEVASHLSDSDDESLKRARTIVIANCIDLYYDLDGEELVEELN---EIRLDLLDG 212
Query: 451 CVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
C GPV+ +++ N QG V F+ +DAQ+C MN R+FGGR++
Sbjct: 213 CKATGPVERIQL--NARQGKATVIFEKERDAQECCRKMNKRFFGGREL 258
>gi|330938245|ref|XP_003305711.1| hypothetical protein PTT_18626 [Pyrenophora teres f. teres 0-1]
gi|311317137|gb|EFQ86178.1| hypothetical protein PTT_18626 [Pyrenophora teres f. teres 0-1]
Length = 304
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 142/280 (50%), Gaps = 32/280 (11%)
Query: 252 DNVVEEKHNGKRK------QPDKQV----EKKEANKPPDSWFELKV--NTHVYVTGLPDD 299
++ +EE G+R PD Q +K++A K P+ + K N +++T LP D
Sbjct: 5 EDFMEEMGQGRRSGSPTSDAPDNQSAVNSKKRKATKEPEVKKKAKTMENKAIWITNLPPD 64
Query: 300 VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
T +E+ + FS+ GII + + + PRIK+Y D+ETG G+A++ Y K+ ++ A +++D
Sbjct: 65 TTFKELEDEFSRFGIIDKGADGQ-PRIKMYNDEETGKFTGNAMIVYFKKEAIVNAIKMMD 123
Query: 360 GTPFRP----DGKI---PMSVTQAKFEQKGERFIAKQVDSKKK---KKLKKVEEKMLGWG 409
RP +G I P ++ K E+ GE+ +K +K + +++ K+ W
Sbjct: 124 DYVLRPGDYSNGNIRVEPANIDHKK-EKDGEKIASKLTRKDRKASERNRQELNRKLNEWS 182
Query: 410 GRDD--AKLTIPA------TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
++ A P I++ +FT E+ D+ +++ D +E K G V +
Sbjct: 183 DNEEEVAAAFAPKKNKWAKVAIVKHVFTLKELEEDDEAILDIKEDFREAGEKYGEVTNCT 242
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ + P+G++ VRF++ + A+ ++ G+ + R++ S
Sbjct: 243 LYDKEPEGIMTVRFREFESAENFMKDYQGKSYARRRLQIS 282
>gi|255732409|ref|XP_002551128.1| hypothetical protein CTRG_05426 [Candida tropicalis MYA-3404]
gi|240131414|gb|EER30974.1| hypothetical protein CTRG_05426 [Candida tropicalis MYA-3404]
Length = 296
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 109/213 (51%), Gaps = 31/213 (14%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N +Y+T LP +T++++ + F G I+ D E K P+IK+Y D E G KGDALV Y
Sbjct: 110 NHSIYITNLPSTLTIQDIEKSFGTFGKIQFDKEGK-PKIKMYRD-EKGNFKGDALVIYTL 167
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLG 407
S LA +++D + F +G+ + V A+F+ K +D KK ++
Sbjct: 168 SDSAYLAIEMMDNSLF--NGQ-TIRVEHARFDDKP-------LDKKKSEQ---------- 207
Query: 408 WGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
+ VI+ MF E+ +D+ L++++ D+ EECVKIG + +
Sbjct: 208 ---------SHFPVVIIENMFRNEELTSDKYLKTDIIEDINEECVKIGIPKVLDIQFESE 258
Query: 468 QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
+G V V+F + A+ CI+ N R++ G Q++
Sbjct: 259 KGNVTVKFDTLEHAKICIQKFNNRYYDGLQLNV 291
>gi|354475309|ref|XP_003499872.1| PREDICTED: HIV Tat-specific factor 1 homolog [Cricetulus griseus]
Length = 714
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/256 (29%), Positives = 114/256 (44%), Gaps = 57/256 (22%)
Query: 262 KRKQPDKQVEKK--EANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKE 317
+R+ P+ KK E K WF ++ NT+VYV+GLP D+T E
Sbjct: 74 QRETPETTDSKKRGEKRKAESGWFHVEEDRNTNVYVSGLPPDITKE-------------- 119
Query: 318 DPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
SV LA +LLD R + V AK
Sbjct: 120 --------------------------------SVELALKLLDEDEIR---GYKLHVEVAK 144
Query: 378 FEQKGERFIAKQVDSKKKKKLK-KVEEKMLGWGGRDDA---KLTIPATVILRFMFTPAEM 433
F+ KGE +K+ K K K +++K L W A ++ VI++ MF P +
Sbjct: 145 FQLKGEYDASKKKKKCKDYKKKLSLQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDF 204
Query: 434 RADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF 493
D + +E+ D++ EC K G + + + + HP GV V F++ ++A CI+ ++GRWF
Sbjct: 205 EDDPLVLNEIREDLRVECAKFGQIKKLLLFDRHPDGVASVSFREPEEADYCIQTLDGRWF 264
Query: 494 GGRQIHASEDDGLVNH 509
GGRQI A DG ++
Sbjct: 265 GGRQITAEAWDGTTDY 280
>gi|320590038|gb|EFX02483.1| nuclear mRNA splicing factor-associated protein [Grosmannia
clavigera kw1407]
Length = 436
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 99/197 (50%), Gaps = 29/197 (14%)
Query: 181 DDGTRYKWDRGLRAWVPQED-------TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLA- 232
+DG +++D+ L+ W+P D ++Q YG + E+ P N
Sbjct: 39 EDGNEFEFDQALKRWIPHADDDENDMAVAAQLAAYGDRDTM---EDGRDPKKNGVARQGQ 95
Query: 233 ----NDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVN 288
+ G E +++ ++ N A + KR + + A KP + N
Sbjct: 96 KRKQDQAYGHEPSDTSGKRRNDASSSGGALGISKRARA-----AQTAPKP-------RQN 143
Query: 289 THVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
T VYVTGLP D TV+E+ E+FS KCG+I E+ ++ +PRIK+Y D + G KGDALV + K
Sbjct: 144 TAVYVTGLPSDATVDEVAELFSRKCGVIAEEIDSGRPRIKLYTDAD-GQFKGDALVVFFK 202
Query: 348 EPSVALATQLLDGTPFR 364
SV +A LLD T FR
Sbjct: 203 PQSVDMAIMLLDDTEFR 219
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 48/80 (60%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
V+L+ MFT E+ D +++ D++EEC K+G V +V + + GVV V+F+ A
Sbjct: 334 VVLKHMFTLEELEEDAAALLDIKEDIREECAKLGEVTNVVLFDQEAAGVVSVKFRTTAAA 393
Query: 482 QKCIELMNGRWFGGRQIHAS 501
C+ LM+GR F G+ + AS
Sbjct: 394 AACVRLMDGRAFAGQVVEAS 413
>gi|83032918|ref|XP_729249.1| splicing factor [Plasmodium yoelii yoelii 17XNL]
gi|23486504|gb|EAA20814.1| putative splicing factor [Plasmodium yoelii yoelii]
Length = 404
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 63/97 (64%), Gaps = 3/97 (3%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VY++GLP DVT EE+ VF K GIIK D ET KP+IKIY D+ + KGDALVTY+
Sbjct: 244 NSSVYISGLPKDVTQEEINNVFKKAGIIKIDSETTKPKIKIYYDENNNV-KGDALVTYVY 302
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER 384
SV +A + D FR + I +V +A+F +K E
Sbjct: 303 TQSVDIAIKYFDKFHFRQNCVI--NVEKAQFNKKVEH 337
>gi|189189564|ref|XP_001931121.1| U2 snRNP-associated protein Uap2 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972727|gb|EDU40226.1| U2 snRNP-associated protein Uap2 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 304
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 134/260 (51%), Gaps = 26/260 (10%)
Query: 266 PDKQV----EKKEANKPPDSWFELKV--NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDP 319
PD Q +K++A P++ + KV N +++T LP D T +E+ + FS+ GII +
Sbjct: 25 PDNQSAVNSKKRKATMEPETKKKAKVMENKAIWITNLPPDTTFKELEDEFSRFGIIDKGA 84
Query: 320 ETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRP----DGKI---PMS 372
+ + RIK+Y D+ETG G+A++ Y K+ ++ A +++D RP +G I P +
Sbjct: 85 DGQ-SRIKMYNDEETGKFTGNAMIVYFKKEAIVNAIKMMDDYVLRPGDYSNGNIRVEPAN 143
Query: 373 VTQAKFEQKGERFIAKQVDSKKK---KKLKKVEEKMLGWGGRDD--AKLTIPA------T 421
+ K E+ G++ +K +K + +++ K+ W ++ A P
Sbjct: 144 IDHKK-EKDGDKIASKLTRKDRKASERNRQELNRKLNEWSDNEEEVAAAFAPKKNKWAKV 202
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
I++ +FT E+ D+ +++ D +E K G V + + + P+G++ VRF++ + A
Sbjct: 203 AIVKHVFTLKELEEDDEAILDIKEDFREAGEKYGEVTNCTLYDKEPEGIITVRFREFESA 262
Query: 482 QKCIELMNGRWFGGRQIHAS 501
+ ++ G+ + R++ S
Sbjct: 263 ENFMKDYQGKSYARRRLQIS 282
>gi|171687281|ref|XP_001908581.1| hypothetical protein [Podospora anserina S mat+]
gi|170943602|emb|CAP69254.1| unnamed protein product [Podospora anserina S mat+]
Length = 144
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 50/94 (53%), Positives = 63/94 (67%), Gaps = 6/94 (6%)
Query: 272 KKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFS-KCGIIKEDPETKKPRIKIYV 330
KK+ PP + + NT VYVTGLP D TVEE+ E+FS KCG+I E+ ++ +PRIK+Y
Sbjct: 25 KKKQRPPP----QPRQNTAVYVTGLPSDATVEEVAELFSRKCGVIAEEIDSGRPRIKMYT 80
Query: 331 DKETGMKKGDALVTYLKEPSVALATQLLDGTPFR 364
D E G KGDALV + K SV +A LLD T FR
Sbjct: 81 DGE-GKFKGDALVVFFKPQSVEMAIMLLDDTDFR 113
>gi|238882985|gb|EEQ46623.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 283
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 127/265 (47%), Gaps = 36/265 (13%)
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
KE+ +T ++ S D+ +E +RKQ D K+E N S + K N+ ++++ LP
Sbjct: 51 KEQWVNTSKRSLSEDDTNKEDIKRQRKQ-DMSKLKEELN----SLKQKKKNSSIFISNLP 105
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D E+ E FSK G I + +KPRIK+Y + E G KG+AL+ Y S LA ++
Sbjct: 106 VDSGFSEIEEAFSKYGKISVGKD-EKPRIKMYTN-EKGSFKGEALIIYSNPESALLAIEM 163
Query: 358 LDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLT 417
+D T + + + V +AKF+ K E + KKK+ +
Sbjct: 164 MDNTQYNGNT---IRVEEAKFDNKKE-------NPKKKQD-------------------S 194
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
+ VI+R M E+ D +++ ++ D++EEC IG D + N VLV+F
Sbjct: 195 LSKAVIIRNMVRKEELANDIHIKQDIIDDIKEECKNIGISDIEDIVFNEENATVLVKFSK 254
Query: 478 RKDAQKCIELMNGRWFGGRQIHASE 502
++ CI+ + R++ G + E
Sbjct: 255 KESLLLCIKKFHNRYYDGLTLDVQE 279
>gi|325187011|emb|CCA21555.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 278
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 119/260 (45%), Gaps = 42/260 (16%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N +YVTGL V ++ + +K G +K RIK Y E G KGDALVT+L
Sbjct: 42 NRSIYVTGLTTYVACKQFEGLCAKIGKVK--------RIKFY-KCERGRLKGDALVTFLS 92
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQ-KGERFIAKQVDSKKKKKLKKVEEKML 406
+ A + LD +P + ++ + A F Q KG + + S + L E +
Sbjct: 93 HSIMEKAIEKLDNFEIKPG--VVITASAADFAQRKGPKDVNSTEQSDTQTDLPAPEAPV- 149
Query: 407 GWGGRDDAKL----------------TIPATVILRFMFTPAEMRADENLR-SELEADVQE 449
++AKL T TVIL + P+ M+ D L +ELE D+
Sbjct: 150 --EANNEAKLDEKEQVKSIDPKADSNTPSQTVILLNAWDPSGMQDDITLYFNELEGDIHS 207
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
EC K G V+ V + + G + VRF + A+KC+++MN RWF GRQI A D
Sbjct: 208 ECSKFGKVEHVHIAAD---GSIQVRFSALECAKKCLQVMNKRWFDGRQIIAMFDPS---- 260
Query: 510 AAIRDLDAEASRLEQFGAEL 529
+ D E +LE F A +
Sbjct: 261 ---QPADEEDVKLEAFLASI 277
>gi|401838819|gb|EJT42261.1| CUS2-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 287
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/221 (31%), Positives = 123/221 (55%), Gaps = 22/221 (9%)
Query: 291 VYVTGLPDD-VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
VY++GLP D +T +E+ + FSK G I+ + + ++P K+YV+ E G+ KGDAL+ Y KE
Sbjct: 49 VYISGLPTDRITKDELTKQFSKYGKIRINRD-EEPLCKLYVNDE-GVPKGDALIIYSKEE 106
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKK--------KKLKKV 401
SV LA ++D + F + + V +A+FE +G + K+ S K+ KK K+
Sbjct: 107 SVTLAVDMMDESIFLGN---KIRVEKAQFEDRGSENMDKKESSLKELDECEPPTKKPKQT 163
Query: 402 E----EKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPV 457
E ++ + D++ TV+ +F + +++ ++++ D+ C +IG V
Sbjct: 164 NFSRAEDVIDYND-DESLAKADRTVVFANVFNIYKEYTPDDI-TDIQDDLLGGCEEIGQV 221
Query: 458 DSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
D + V + + VL FK KDA +C ++M GR+F G+++
Sbjct: 222 DGLSVSPSKGEATVL--FKKSKDALRCCKIMTGRYFDGQKL 260
>gi|68475926|ref|XP_718036.1| potential spliceosome factor [Candida albicans SC5314]
gi|46439782|gb|EAK99096.1| potential spliceosome factor [Candida albicans SC5314]
Length = 283
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 126/265 (47%), Gaps = 36/265 (13%)
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
KE+ +T ++ S D+ +E +RKQ D K+E N S + K N+ ++++ LP
Sbjct: 51 KEQWVNTSKRSLSEDDTNKEDIKRQRKQ-DMSKLKEELN----SLKQKKKNSSIFISNLP 105
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D E+ E FSK G I + +KPRIK+Y + E G KG+AL+ Y S LA ++
Sbjct: 106 VDSGFSEIEEAFSKYGKISVGKD-EKPRIKMYTN-EKGSFKGEALIIYSNPESALLAIEM 163
Query: 358 LDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLT 417
+D T + + + V +AKF+ K E + KKK+ +
Sbjct: 164 MDNTQYNGNT---IRVEEAKFDNKKE-------NPKKKQD-------------------S 194
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
+ VI+R M E+ D +++ ++ D++EEC IG D + N VLV+F
Sbjct: 195 LSKAVIIRNMVRKEELANDIHIKQDIIDDIKEECKNIGISDIEDIVFNEENATVLVKFSK 254
Query: 478 RKDAQKCIELMNGRWFGGRQIHASE 502
+ CI+ + R++ G + E
Sbjct: 255 EESLLLCIKKFHNRYYDGLTLDVQE 279
>gi|344300223|gb|EGW30563.1| hypothetical protein SPAPADRAFT_143429 [Spathaspora passalidarum
NRRL Y-27907]
Length = 304
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 108/217 (49%), Gaps = 25/217 (11%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
T +++ LP +T +E+ ++FSK G I D + PR+K+Y T G ALV Y
Sbjct: 108 TTTAAFISNLPVTITSQEIADLFSKYGSIALDKQ-GNPRVKLYTATNTHEFNGQALVIYN 166
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKML 406
K SV +A ++DGT + D I V A F +K ++ + +++S K+K
Sbjct: 167 KPESVTMAIDMMDGTTVK-DTTI--KVEPATFSEKPKKDYSNEIESIKQK---------- 213
Query: 407 GWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIG-PVDSVKVCEN 465
K PA VIL MF ++ DE L+ ++ D+ +EC K+ D + +
Sbjct: 214 -------LKSRHPAVVILTGMFRKSDY--DETLQQDIILDINDECAKLDINNDDIHSIQF 264
Query: 466 HPQG-VVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
P+ + ++F ++ A+ CI+ N RW+ G ++ AS
Sbjct: 265 MPETEEIHIKFTKQQLAEICIQCFNDRWYDGLKVAAS 301
>gi|167516256|ref|XP_001742469.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779093|gb|EDQ92707.1| predicted protein [Monosiga brevicollis MX1]
Length = 1058
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/318 (27%), Positives = 125/318 (39%), Gaps = 93/318 (29%)
Query: 287 VNTHVYVTGLP------------------------DDVTVEEMVEVFSKCGIIKEDPETK 322
VN HVYVTGLP D+T+EE K GII ED
Sbjct: 730 VNCHVYVTGLPLVGAPSTVPLGVRACSHVCCHARVKDITLEEFTAFMRKAGIINEDAH-G 788
Query: 323 KPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFE--- 379
+P+IK+Y D E G KGD TYL+ SV LA QLLD T RP K+ + +A F+
Sbjct: 789 EPKIKLYTD-EHGEPKGDGKCTYLRVESVELALQLLDETEIRPGFKV--KIQRAVFQLRE 845
Query: 380 -------------------------------QKGERFIAKQVDS---------------- 392
QK R A V S
Sbjct: 846 GMTLGKKNDEDEVEEGSLEPAKKKAKKKSLGQKYARVAAVHVSSLLGDTDSTDLGEQDIC 905
Query: 393 KKKKKLKKVEEKM---LGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQE 449
L V + L W D + ++L+ MFT EM+ D + EL+ V
Sbjct: 906 ASFLTLYAVTAPVLRKLHWHETDTKRKRAVGVLVLKHMFTLEEMKEDASYIFELKDTVV- 964
Query: 450 ECVKIGPVDSVKVC--ENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLV 507
++ C +N+P GVV+VR+ + I +NGR+F G+++ A E DG
Sbjct: 965 ---------LMRRCGMQNNPDGVVMVRYFTDEPLGPAIATLNGRFFAGQKVVAEEWDGKT 1015
Query: 508 NHAAIRDLDAEASRLEQF 525
+ + + +R++Q+
Sbjct: 1016 KYKVEESEEEKEARIKQW 1033
>gi|302414782|ref|XP_003005223.1| splicing factor U2AF-associated protein [Verticillium albo-atrum
VaMs.102]
gi|261356292|gb|EEY18720.1| splicing factor U2AF-associated protein [Verticillium albo-atrum
VaMs.102]
Length = 210
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 80/143 (55%), Gaps = 9/143 (6%)
Query: 365 PDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT--- 421
P+G P+ Q + + + +R +Q K KK +K++ K+ W D A L +
Sbjct: 50 PEGAAPLPPKQQQQQSRADRDRDRQ---KIIKKTQKLDAKLADWSDDDTAALPTASASKW 106
Query: 422 ---VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDR 478
V+LR MFT AE+ D E++ DV+EEC K+G V +V + + P GVV V+F++
Sbjct: 107 DRLVVLRHMFTLAELEEDPAALLEIKEDVREECAKLGAVTNVVLFDEEPDGVVSVKFREP 166
Query: 479 KDAQKCIELMNGRWFGGRQIHAS 501
+ AQ CI +M+GR F GR I AS
Sbjct: 167 QAAQACIAMMDGRSFDGRVIEAS 189
>gi|68475791|ref|XP_718102.1| potential spliceosome factor [Candida albicans SC5314]
gi|46439857|gb|EAK99170.1| potential spliceosome factor [Candida albicans SC5314]
Length = 283
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 75/265 (28%), Positives = 126/265 (47%), Gaps = 36/265 (13%)
Query: 238 KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP 297
KE+ +T ++ S D+ +E +RKQ D K+E N S + K N+ ++++ LP
Sbjct: 51 KEQWVNTSKRSLSEDDTNKEDIKRQRKQ-DMSKLKEELN----SLKQKKKNSSIFISNLP 105
Query: 298 DDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQL 357
D E+ E FSK G I + +KPRIK+Y + E G KG+AL+ Y S LA ++
Sbjct: 106 VDSGFSEIEEAFSKYGKISVGKD-EKPRIKMYTN-EKGSFKGEALIIYSNPESALLAIEM 163
Query: 358 LDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLT 417
+D + + + V +AKF+ K E + KKK+ +
Sbjct: 164 MDNAQYNGNT---IRVEEAKFDNKKE-------NPKKKQD-------------------S 194
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
+ VI+R M E+ D +++ ++ D++EEC IG D + N VLV+F
Sbjct: 195 LSKAVIIRNMVRKKELANDIHIKQDIIDDIKEECKNIGISDIEDIVFNEENATVLVKFSK 254
Query: 478 RKDAQKCIELMNGRWFGGRQIHASE 502
++ CI+ + R++ G + E
Sbjct: 255 KESLLLCIKKFHNRYYDGLTLDVQE 279
>gi|241956866|ref|XP_002421153.1| RNA binding protein, putative; U2 snRNA binding protein, putative
[Candida dubliniensis CD36]
gi|223644496|emb|CAX41312.1| RNA binding protein, putative [Candida dubliniensis CD36]
Length = 286
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 72/260 (27%), Positives = 124/260 (47%), Gaps = 35/260 (13%)
Query: 243 STEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTV 302
+T ++ S D++ +E +RKQ D K+E N S + K N+ ++++ LP D
Sbjct: 56 NTSKRSLSEDDLNKEDIKRQRKQ-DMSKLKEELN----SLKQKKKNSSIFISNLPIDSRF 110
Query: 303 EEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTP 362
E+ E F+K G I + +K RIK+Y + E G KG+AL+ Y S LA +++D T
Sbjct: 111 SEIEETFAKYGKISVGKD-EKSRIKMYTN-EKGSFKGEALIIYSNPESALLAIEMMDNTE 168
Query: 363 FRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATV 422
+ + + V +AKF+ ++ KK+ LKK K TV
Sbjct: 169 YNGN---TIRVEEAKFDNN---------NNSKKENLKKNSSK----------------TV 200
Query: 423 ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
++R M E+ D +++ ++ D++EEC IG D + N + VRF ++
Sbjct: 201 VIRNMVRKEELANDIHIKQDIIDDIKEECKNIGVFDIEDIAFNEEDATISVRFSKQESLL 260
Query: 483 KCIELMNGRWFGGRQIHASE 502
CI+ + R++ G + E
Sbjct: 261 SCIKKFHNRYYDGLTLDVQE 280
>gi|170053610|ref|XP_001862754.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167874063|gb|EDS37446.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 377
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 98/191 (51%), Gaps = 34/191 (17%)
Query: 172 PEGEDE-----------FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEE 220
PE EDE +TD DG Y WD +AW P+ D ++ I ++++
Sbjct: 195 PESEDERKYSYVDGVHHYTDKDGAIYFWDGEKKAWFPKID----DEFMAIYQLSY----- 245
Query: 221 VFPTVNVTDDLAN--DEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKP 278
TD+ ++ D+ E+ T KV D E+ + + + +Q +K++A
Sbjct: 246 -----GFTDNTSSSADKKQPEERPPTPPKVEEQD----EEPSPEDQAGGRQGKKRKAPPE 296
Query: 279 PDSWFELKV--NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGM 336
P WFELK NT VYV+ +P DVT EE E+ +KCG++ +DP T K ++K+Y D +G
Sbjct: 297 PPKWFELKPEHNTKVYVSNMPPDVTDEEFAELMAKCGMVLKDPRTNKLKLKLYRDA-SGQ 355
Query: 337 KKGDALVTYLK 347
KGD L Y+K
Sbjct: 356 VKGDGLCHYIK 366
>gi|307199125|gb|EFN79835.1| HIV Tat-specific factor 1-like protein [Harpegnathos saltator]
Length = 239
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 112/247 (45%), Gaps = 47/247 (19%)
Query: 111 QSQSFSSAEQGVPSHVDDEFEKW--------QREVREAEIEAERLKNGSASDSVGGYMGR 162
Q+ ++ + D E KW + +E+ E E+ +N S+ + G G
Sbjct: 22 QTYIYTDKATNITYKFDQENNKWMIKKDNEKNSDNKESSGETEKKENASSEVNSSGIFGF 81
Query: 163 DDHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVF 222
++ P +DG+ Y WD+ AW P+ D E+
Sbjct: 82 ENDTHTYTDP--------NDGSMYIWDKEKNAWFPKVD------------------EDFM 115
Query: 223 PTVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSW 282
++ AN +L + + A + EEK K + K++A +PP SW
Sbjct: 116 AKYQMSYGFAN--TNAPQLKAEPQPPAPAQKIKEEKEQRKAEA------KRKAQEPP-SW 166
Query: 283 FEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGD 340
FE+ NT +YV+GLP D+T++E+ ++F+KCG+I D E + +IK+Y D G KGD
Sbjct: 167 FEVDEAHNTAIYVSGLPLDITMDELTQLFTKCGLIARD-EKGRDKIKLYKDA-AGEPKGD 224
Query: 341 ALVTYLK 347
AL TY+K
Sbjct: 225 ALCTYIK 231
>gi|307190188|gb|EFN74303.1| HIV Tat-specific factor 1-like protein [Camponotus floridanus]
Length = 260
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 416 LTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRF 475
L VIL+ +FTP + D L E + D++ EC+K G V V + + HP+GV V F
Sbjct: 122 LKCERVVILKNLFTPEDFDRDVQLLLEYQQDIRSECLKCGDVRKVIIYDRHPEGVAQVTF 181
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA-SRLEQFGAELE 530
++ +AQ CI+L+NGRWF R+I A DG + I + DAE +RL + LE
Sbjct: 182 REPAEAQACIQLLNGRWFSQRKISAEIWDGKTKY-KITETDAEIEARLTNWDTYLE 236
>gi|365984609|ref|XP_003669137.1| hypothetical protein NDAI_0C02340 [Naumovozyma dairenensis CBS 421]
gi|343767905|emb|CCD23894.1| hypothetical protein NDAI_0C02340 [Naumovozyma dairenensis CBS 421]
Length = 387
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 123/247 (49%), Gaps = 41/247 (16%)
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T +Y++ LP D+T +E++ F+ GII++ E + R K+Y D + G KGDAL+ Y +
Sbjct: 131 TSIYISNLPKDITEDELIREFTHYGIIRKTSEG-EIRCKLYKDSD-GKVKGDALIVYARI 188
Query: 349 PSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAK-------QVDSKKKKKLKKV 401
SV LA ++D T R G I ++V A+F+ + K V K+ K+L+K
Sbjct: 189 ESVQLAIDMMDKTILR--GSI-INVQTAQFKSNKRKIDDKNNAEEGDHVPLKRGKQLQKY 245
Query: 402 EE---------------KMLGWGGRDDAKLTIPATVILRF--MFTPAEMRADENLRSELE 444
E + G +A TI +L +T E+ E++
Sbjct: 246 AEDGTIQDDSAKEVSDWHITKLEGHSEADRTIVLNNVLDVGAKYTEGEL-------DEIK 298
Query: 445 ADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA---S 501
D+ +EC + ++ ++ E++ G LV ++D A C L++GR+F GR+I A +
Sbjct: 299 EDIVDECSTVKGIERFELDESN--GRALVVYEDVPSALNCRSLLDGRFFDGREIIARTLT 356
Query: 502 EDDGLVN 508
E++GL+N
Sbjct: 357 EEEGLLN 363
>gi|322803089|gb|EFZ23177.1| hypothetical protein SINV_07893 [Solenopsis invicta]
Length = 240
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 67/115 (58%), Gaps = 2/115 (1%)
Query: 416 LTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRF 475
L VI++ +FTP + + L E + D++ EC+K G V V +C+ HP+GV V F
Sbjct: 104 LKCERVVIIKNLFTPEDFDKEVQLLLEYQQDIRSECLKCGDVRKVVICDRHPEGVAQVTF 163
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA-SRLEQFGAEL 529
++ +AQ C++L+NGRWF R+I A DG + I + DAE +RL ++ L
Sbjct: 164 REPAEAQACVQLLNGRWFSQRKISAEIWDGKTKY-KITETDAEIEARLNKWDKYL 217
>gi|195441291|ref|XP_002068447.1| GK20417 [Drosophila willistoni]
gi|194164532|gb|EDW79433.1| GK20417 [Drosophila willistoni]
Length = 402
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 88/182 (48%), Gaps = 19/182 (10%)
Query: 173 EGEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMTFLKEEEVFPTVNVTD 229
+GE +TD DG Y WD AW P+ D + YG + T E+E
Sbjct: 224 DGERTYTDKDGAVYFWDSKKSAWFPKIDDDFMARYQMSYGFIDNTSAGEKEK-------- 275
Query: 230 DLANDEVG--KEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFEL-- 285
A+ E K K + A+ + + K++A +PP WFE+
Sbjct: 276 --ADKEAAELKRKQEELQRMTAEAEEAMAKDAAATAAGTAATTSKRKAPEPP-KWFEMDP 332
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
NT VYV+ LP D+T++E E+ KCG++ DP+T+K ++K+Y +K+ G KGD L Y
Sbjct: 333 SQNTKVYVSNLPLDITMDEFAELMGKCGMVMRDPQTQKFKLKLYAEKD-GQIKGDGLCDY 391
Query: 346 LK 347
+K
Sbjct: 392 IK 393
>gi|409044782|gb|EKM54263.1| hypothetical protein PHACADRAFT_257994, partial [Phanerochaete
carnosa HHB-10118-sp]
Length = 138
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/131 (39%), Positives = 75/131 (57%), Gaps = 7/131 (5%)
Query: 272 KKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVD 331
K +KPP+ + NT VYVTGLP D +E++E FSKCG+I+ED + +P+IK+Y
Sbjct: 2 KSNGDKPPERRSK---NTAVYVTGLPSDADQDEIMERFSKCGLIEED-DKGEPKIKMYA- 56
Query: 332 KETGMKKGDALVTYLKEPSVALATQLLDGTPFR-PDGKIPMSVTQAKFEQKGER-FIAKQ 389
+E G G+ L+ Y KE SV LA +LD R + M V +A F K E +
Sbjct: 57 REDGSFSGEVLIVYFKEESVMLAESILDDAELRLGESSTRMRVQKADFSHKHESGSVEGS 116
Query: 390 VDSKKKKKLKK 400
VD++ +K + K
Sbjct: 117 VDARLRKTVDK 127
>gi|327284309|ref|XP_003226881.1| PREDICTED: RNA-binding protein EWS-like isoform 2 [Anolis
carolinensis]
Length = 612
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D+VT+EE+ + F +CG++K + T +P I IY+DK+TG KGDA V Y
Sbjct: 307 NSAVYVQGLTDNVTLEELTDFFKQCGVVKMNKRTGQPMITIYLDKDTGKPKGDATVCYDD 366
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + LDG F+
Sbjct: 367 PPTAKAAVEWLDGKDFQ 383
>gi|327284311|ref|XP_003226882.1| PREDICTED: RNA-binding protein EWS-like isoform 3 [Anolis
carolinensis]
Length = 667
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D+VT+EE+ + F +CG++K + T +P I IY+DK+TG KGDA V Y
Sbjct: 362 NSAVYVQGLTDNVTLEELTDFFKQCGVVKMNKRTGQPMITIYLDKDTGKPKGDATVCYDD 421
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + LDG F+
Sbjct: 422 PPTAKAAVEWLDGKDFQ 438
>gi|327284307|ref|XP_003226880.1| PREDICTED: RNA-binding protein EWS-like isoform 1 [Anolis
carolinensis]
Length = 672
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D+VT+EE+ + F +CG++K + T +P I IY+DK+TG KGDA V Y
Sbjct: 367 NSAVYVQGLTDNVTLEELTDFFKQCGVVKMNKRTGQPMITIYLDKDTGKPKGDATVCYDD 426
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + LDG F+
Sbjct: 427 PPTAKAAVEWLDGKDFQ 443
>gi|242220458|ref|XP_002475995.1| predicted protein [Postia placenta Mad-698-R]
gi|220724784|gb|EED78804.1| predicted protein [Postia placenta Mad-698-R]
Length = 192
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 80/151 (52%), Gaps = 20/151 (13%)
Query: 371 MSVTQAKFEQK-------GERFIAKQVDSKKK--KKLKKVEEKMLGWGGRDDAKLTI--- 418
M V +A+F K GE K V KKK K++ K+++K+ W D I
Sbjct: 1 MRVQRAEFGHKHEKEGGQGEMKPRKTVSDKKKATKRIGKMQKKLGEWDDEDGFGPAITEE 60
Query: 419 --------PATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV 470
V+L+ MFT E+ D +L +L+ DV++EC +G V +V + + P+GV
Sbjct: 61 DKVPLVNKARVVVLKHMFTLQELEEDASLLLDLKEDVRDECSSLGEVTNVVLYDKEPEGV 120
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ V+F+D AQ C+ MNGR+F GR+I AS
Sbjct: 121 MTVKFRDLLGAQACVIKMNGRFFAGRRIEAS 151
>gi|444705508|gb|ELW46932.1| HIV Tat-specific factor 1 like protein [Tupaia chinensis]
Length = 623
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 73/141 (51%), Gaps = 4/141 (2%)
Query: 373 VTQAKFEQKGERFIAKQVDSKKKKKLKKVEE-KMLGWGGRDDA---KLTIPATVILRFMF 428
V E+ +R + D +K+ + +K E K L W A ++ VI++ MF
Sbjct: 92 VDSRTVEEPPQRIPPEPADPRKRGEKRKNESGKQLDWRPERRAGPSRMRHERVVIIKNMF 151
Query: 429 TPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELM 488
P + D + +E+ D++ EC K G + + + + HP GV V F+D +A CI+ +
Sbjct: 152 HPMDFEDDPLVLNEIREDLRVECSKFGQIKKLLLFDRHPDGVASVSFRDPDEADLCIQTL 211
Query: 489 NGRWFGGRQIHASEDDGLVNH 509
+GRWFGGRQI A DG ++
Sbjct: 212 DGRWFGGRQITAQAWDGTTDY 232
>gi|410075623|ref|XP_003955394.1| hypothetical protein KAFR_0A08250 [Kazachstania africana CBS 2517]
gi|372461976|emb|CCF56259.1| hypothetical protein KAFR_0A08250 [Kazachstania africana CBS 2517]
Length = 272
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 129/257 (50%), Gaps = 35/257 (13%)
Query: 262 KRKQPDKQVEKKEANKPPDSWFELKVNTHVYVTGLP-DDVTVEEMVEVFSKCGIIKEDPE 320
+R+Q KQ K +++P T +Y++ L + TV+ +++ FS G IK+D +
Sbjct: 22 RRRQASKQ---KRSSEP----------TGLYISNLSLESTTVDSLIDEFSTFGKIKKDHQ 68
Query: 321 TKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQ 380
R K+Y+D E G KGDAL+ Y + SV LA L++ + V +A+F
Sbjct: 69 NN-YRCKLYLD-ENGRFKGDALIIYERAESVQLAIDLINDAEL---NGCKIKVERAEFNN 123
Query: 381 KGERFIAKQVDSKK--KKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAE-MRADE 437
I + DS+ K+++ +V ++ + R + T++L + E + +DE
Sbjct: 124 DKRERIHDKDDSEPPLKRRVIEVSQEKIDTSSRKER------TIVLSNILDIYEDIESDE 177
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
+L+ D+ E C G V ++ V N + V+ FK +KDA +C + MN R+F GR+
Sbjct: 178 --LHDLKQDILEGCESFGEVLNITVDSNRGEAHVV--FKRQKDALQCCKKMNNRFFDGRK 233
Query: 498 IHA---SEDDGLVNHAA 511
+ A SE+D + + A
Sbjct: 234 LIAFMLSEEDDVASSGA 250
>gi|323346960|gb|EGA81238.1| Cus2p [Saccharomyces cerevisiae Lalvin QA23]
Length = 215
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 105/188 (55%), Gaps = 18/188 (9%)
Query: 324 PRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-G 382
P K+YV+ + G KGDAL+TY KE SV LA ++++ + F GK + V +A+F+ K G
Sbjct: 10 PLCKLYVN-DKGAFKGDALITYSKEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEG 65
Query: 383 ERFIAKQVDSKK-------KKKLKKVEEKMLG--WGGRDDAKLT-IPATVILRFMFTPAE 432
+ K+ D K+ K+LKK + + G DD L TVI +F +
Sbjct: 66 DNMHGKENDLKEFNGPEPPIKRLKKAKSEGEGEVIDYNDDESLAKADRTVIFANVFNIYK 125
Query: 433 MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRW 492
++++ ++++ D+ E C +IG VDS+ V N + V+ FK+ K A +C ++M GR+
Sbjct: 126 SYTNDDI-NDIQEDLLEGCEEIGQVDSISVSPNKGEATVV--FKNNKVALQCCKIMTGRY 182
Query: 493 FGGRQIHA 500
F G+++ A
Sbjct: 183 FDGQKLLA 190
>gi|307190186|gb|EFN74301.1| HIV Tat-specific factor 1-like protein [Camponotus floridanus]
Length = 251
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/246 (28%), Positives = 110/246 (44%), Gaps = 45/246 (18%)
Query: 111 QSQSFSSAEQGVPSHVDDEFEKW-------QREVREAEIEAERLKNGSASDSVGGYMGRD 163
Q+ ++ + D E KW + +E+ E+ +N S G G +
Sbjct: 26 QTYIYTDKATNITYRFDQENTKWVVKEDNVSSDNKESSNRKEKKENSGPEVSSAGTFGFE 85
Query: 164 DHDGVPASPEGEDEFTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFP 223
+ P +DG+ Y WD+ AW P+ D D +M++ + P
Sbjct: 86 NDTHTYTDP--------NDGSVYFWDKEKSAWFPKVD----QDFMARYQMSYGFADTSAP 133
Query: 224 TVNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWF 283
N ++ + K K EEK + + K++A +PP +WF
Sbjct: 134 QSNPSEPNQPTSLQKAK---------------EEKEQRRTEA------KRKAQEPP-TWF 171
Query: 284 EL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDA 341
E+ NT +YV+GLP ++T+EE+ E+F+KCG+I D E K +IK+Y D G KGDA
Sbjct: 172 EVDEAHNTAIYVSGLPLNITMEELTELFTKCGLIARD-ERGKDKIKLYKDV-NGEPKGDA 229
Query: 342 LVTYLK 347
L TY+K
Sbjct: 230 LCTYIK 235
>gi|440299360|gb|ELP91928.1| hypothetical protein EIN_399680 [Entamoeba invadens IP1]
Length = 209
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 111/234 (47%), Gaps = 31/234 (13%)
Query: 300 VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLD 359
+T ++ F CGIIK+ + K+ KE + + +L E S A + D
Sbjct: 1 MTSSDLASYFRHCGIIKKISPDEFDITKV---KECE-NSNEWNLYFLHEESAQKAIEFYD 56
Query: 360 GTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIP 419
+ D KI +S + + E++ K+ D W DD+K+
Sbjct: 57 ESEITRDCKIGISGLFSVQDTGKEQYDQKREDE---------------W---DDSKI--- 95
Query: 420 ATVILRFMF---TPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFK 476
VI++ MF T ++L+++++++V++ C GPVD V V +PQGVV+++F
Sbjct: 96 VHVIIKNMFDLTTEKPQSFFDDLKNDIKSEVRKSC---GPVDRVIVFSTNPQGVVMLKFY 152
Query: 477 DRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
A+KCI+LMNGRWF RQ+ DG N+ + + R++++ +E
Sbjct: 153 TNDVAEKCIKLMNGRWFDQRQLTCEYYDGFTNYKIEESKEQKEKRIKEWSEYVE 206
>gi|291398631|ref|XP_002715944.1| PREDICTED: Ewing sarcoma breakpoint region 1-like isoform 1
[Oryctolagus cuniculus]
Length = 650
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VTV+++ + F +CG++K + T +P I +Y+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDNVTVDDLADFFKQCGVVKMNKRTGQPMIHVYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|291398633|ref|XP_002715945.1| PREDICTED: Ewing sarcoma breakpoint region 1-like isoform 2
[Oryctolagus cuniculus]
Length = 594
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VTV+++ + F +CG++K + T +P I +Y+DKETG KGDA V+Y
Sbjct: 304 NSAIYVQGLNDNVTVDDLADFFKQCGVVKMNKRTGQPMIHVYLDKETGKPKGDATVSYED 363
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 364 PPTAKAAVEWFDGKDFQ 380
>gi|441620097|ref|XP_003258090.2| PREDICTED: RNA-binding protein EWS isoform 2 [Nomascus leucogenys]
Length = 692
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VTV+++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 396 NSAIYVQGLNDSVTVDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 455
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 456 PPTAKAAVEWFDGKDFQ 472
>gi|73994891|ref|XP_864944.1| PREDICTED: RNA-binding protein EWS isoform 9 [Canis lupus
familiaris]
Length = 600
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 304 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 363
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 364 PPTAKAAVEWFDGKDFQ 380
>gi|12852168|dbj|BAB29301.1| unnamed protein product [Mus musculus]
Length = 333
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 58/90 (64%), Gaps = 1/90 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 37 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 96
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
P+ A + DG F+ K+ +S+ + K
Sbjct: 97 PPTAKAAVEWFDGKDFQ-GSKLKVSLARKK 125
>gi|148708571|gb|EDL40518.1| mCG14699, isoform CRA_b [Mus musculus]
Length = 626
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 330 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 389
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 390 PPTAKAAVEWFDGKDFQ 406
>gi|395833785|ref|XP_003789900.1| PREDICTED: RNA-binding protein EWS isoform 3 [Otolemur garnettii]
Length = 600
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 304 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 363
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 364 PPTAKAAVEWFDGKDFQ 380
>gi|344256186|gb|EGW12290.1| RNA-binding protein EWS [Cricetulus griseus]
Length = 656
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|392347121|ref|XP_003749733.1| PREDICTED: RNA-binding protein EWS-like isoform 3 [Rattus
norvegicus]
Length = 600
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 304 NSAVYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 363
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 364 PPTAKAAVEWFDGKDFQ 380
>gi|354486390|ref|XP_003505364.1| PREDICTED: RNA-binding protein EWS [Cricetulus griseus]
Length = 655
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 419 PPTAKAAVEWFDGKDFQ 435
>gi|355784882|gb|EHH65733.1| hypothetical protein EGM_02560, partial [Macaca fascicularis]
Length = 657
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 361 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 420
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 421 PPTAKAAVEWFDGKDFQ 437
>gi|488513|emb|CAA55815.1| EWS [Mus musculus]
Length = 655
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 419 PPTAKAAVEWFDGKDFQ 435
>gi|355563559|gb|EHH20121.1| hypothetical protein EGK_02912, partial [Macaca mulatta]
Length = 657
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 361 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 420
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 421 PPTAKAAVEWFDGKDFQ 437
>gi|395517080|ref|XP_003762710.1| PREDICTED: RNA-binding protein EWS-like [Sarcophilus harrisii]
Length = 913
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL ++VTV+E+ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 617 NSAIYVQGLNENVTVDELADFFKQCGVVKMNKRTGQPMINIYLDKETGKPKGDATVSYDD 676
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 677 PPTAKAAVEWFDGKDFQ 693
>gi|356461003|ref|NP_001238977.1| RNA-binding protein EWS [Sus scrofa]
gi|351738726|gb|AEQ61463.1| EWS [Sus scrofa]
Length = 656
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|73994917|ref|XP_865186.1| PREDICTED: RNA-binding protein EWS isoform 21 [Canis lupus
familiaris]
Length = 656
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|443896497|dbj|GAC73841.1| transcription elongation factor TAT-SF1 [Pseudozyma antarctica
T-34]
Length = 448
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
T +YV+GLP D T +E+ VF + G++ ED + KPRIK+Y D +TGM +G+ALV Y K
Sbjct: 141 TSLYVSGLPLDATSDEIARVFCRYGVLLED-DQGKPRIKMYYDDKTGMFRGEALVVYFKP 199
Query: 349 PSVALATQLLDGTPFR 364
SV LA +LD T R
Sbjct: 200 ESVELAINMLDETNMR 215
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 59/106 (55%), Gaps = 18/106 (16%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MFT E+ D L +L+ DV+EEC KIG V +V + + P+G++ VRF
Sbjct: 333 VVLTKMFTLFELEQDPTLLLDLKDDVREECTDKIGGVTNVVLWDLEPEGIMTVRFSSHDQ 392
Query: 481 AQKCIELMNGRWFGGRQIHA-----------------SEDDGLVNH 509
AQKC+++M+GR+F R+I+A +DD L NH
Sbjct: 393 AQKCVQMMDGRFFAQRRINAFILNAKPRFRKSTTHPDDDDDNLANH 438
>gi|46399229|gb|AAH68226.1| Ewing sarcoma breakpoint region 1 [Mus musculus]
gi|74142191|dbj|BAE31862.1| unnamed protein product [Mus musculus]
gi|74219411|dbj|BAE29484.1| unnamed protein product [Mus musculus]
Length = 656
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|189303559|ref|NP_001020803.2| RNA-binding protein EWS [Rattus norvegicus]
gi|149047600|gb|EDM00270.1| rCG36110, isoform CRA_a [Rattus norvegicus]
gi|171847425|gb|AAI62004.1| Ewsr1 protein [Rattus norvegicus]
Length = 655
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAVYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 419 PPTAKAAVEWFDGKDFQ 435
>gi|88853581|ref|NP_031994.2| RNA-binding protein EWS [Mus musculus]
gi|341940677|sp|Q61545.2|EWS_MOUSE RecName: Full=RNA-binding protein EWS
gi|12859677|dbj|BAB31732.1| unnamed protein product [Mus musculus]
gi|74196009|dbj|BAE30560.1| unnamed protein product [Mus musculus]
gi|148708572|gb|EDL40519.1| mCG14699, isoform CRA_c [Mus musculus]
Length = 655
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 419 PPTAKAAVEWFDGKDFQ 435
>gi|16552153|dbj|BAB71252.1| unnamed protein product [Homo sapiens]
Length = 600
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 304 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 363
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 364 PPTAKAAVEWFDGKDFQ 380
>gi|431920874|gb|ELK18645.1| RNA-binding protein EWS [Pteropus alecto]
Length = 656
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|392347117|ref|XP_003749731.1| PREDICTED: RNA-binding protein EWS-like isoform 1 [Rattus
norvegicus]
Length = 656
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAVYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|74151104|dbj|BAE27676.1| unnamed protein product [Mus musculus]
Length = 655
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 419 PPTAKAAVEWFDGKDFQ 435
>gi|345313944|ref|XP_001516464.2| PREDICTED: RNA-binding protein EWS [Ornithorhynchus anatinus]
Length = 709
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
NT +YV GL ++VT+E++ + F +CG++K + T +P I IY+DKETG KGDA V Y
Sbjct: 414 NTAIYVQGLNENVTLEDLADFFKQCGVLKMNKRTGQPLINIYLDKETGKPKGDATVAYDD 473
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 474 SPTAKAAVEWFDGKDFQ 490
>gi|68534160|gb|AAH98822.1| Ewing sarcoma breakpoint region 1 [Rattus norvegicus]
Length = 317
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 57/90 (63%), Gaps = 1/90 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 21 NSAVYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 80
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
P+ A + DG F+ K+ +S+ + K
Sbjct: 81 PPTAKAAVEWFDGKDFQ-GSKLKVSLARKK 109
>gi|395833781|ref|XP_003789898.1| PREDICTED: RNA-binding protein EWS isoform 1 [Otolemur garnettii]
Length = 656
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|332025819|gb|EGI65976.1| HIV Tat-specific factor 1-like protein [Acromyrmex echinatior]
Length = 165
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 68/120 (56%), Gaps = 5/120 (4%)
Query: 416 LTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRF 475
L VI++ +F P + + L E + D++ EC+K G V V + + HP+GV V F
Sbjct: 15 LKCERVVIIKNLFAPEDFDNEVQLLLEYQQDIRSECLKCGDVRKVIIYDRHPEGVAQVTF 74
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH----AAIRDLDAEASRLEQFGAELEA 531
++ +AQ C++L+NGRWF R+I A DG + AA+ ++ + R+ + AE+EA
Sbjct: 75 REPAEAQACVQLLNGRWFSQRKISAEIWDGKTKYNERDAAV-NVSNVSFRITETDAEIEA 133
>gi|345791017|ref|XP_003433443.1| PREDICTED: RNA-binding protein EWS [Canis lupus familiaris]
Length = 661
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 365 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 424
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 425 PPTAKAAVEWFDGKDFQ 441
>gi|392347119|ref|XP_003749732.1| PREDICTED: RNA-binding protein EWS-like isoform 2 [Rattus
norvegicus]
Length = 661
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 365 NSAVYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 424
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 425 PPTAKAAVEWFDGKDFQ 441
>gi|307199123|gb|EFN79833.1| HIV Tat-specific factor 1-like protein [Harpegnathos saltator]
Length = 177
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 1/103 (0%)
Query: 416 LTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRF 475
L V+++ +F P + + L E + D++ EC+K G V V + + HP+GV V F
Sbjct: 41 LKCERVVVIKNLFAPEDFDKEVQLLLEYQQDIRSECLKCGDVRKVVIYDRHPEGVAQVTF 100
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAE 518
++ +AQ C++L+NGRWF R+I A DG + I + DAE
Sbjct: 101 REPAEAQACVQLLNGRWFSQRKISAEIWDGKTKY-KITETDAE 142
>gi|410977064|ref|XP_003994932.1| PREDICTED: RNA-binding protein EWS [Felis catus]
Length = 701
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 405 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 464
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 465 PPTAKAAVEWFDGKDFQ 481
>gi|402883897|ref|XP_003905432.1| PREDICTED: RNA-binding protein EWS [Papio anubis]
gi|403295160|ref|XP_003938521.1| PREDICTED: RNA-binding protein EWS [Saimiri boliviensis
boliviensis]
gi|67971838|dbj|BAE02261.1| unnamed protein product [Macaca fascicularis]
gi|380783979|gb|AFE63865.1| RNA-binding protein EWS isoform 3 [Macaca mulatta]
gi|384940572|gb|AFI33891.1| RNA-binding protein EWS isoform 3 [Macaca mulatta]
Length = 655
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 419 PPTAKAAVEWFDGKDFQ 435
>gi|395833783|ref|XP_003789899.1| PREDICTED: RNA-binding protein EWS isoform 2 [Otolemur garnettii]
Length = 661
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 365 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 424
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 425 PPTAKAAVEWFDGKDFQ 441
>gi|388452546|ref|NP_001253170.1| RNA-binding protein EWS [Macaca mulatta]
gi|380783977|gb|AFE63864.1| RNA-binding protein EWS isoform 1 [Macaca mulatta]
Length = 661
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 365 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 424
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 425 PPTAKAAVEWFDGKDFQ 441
>gi|253970504|ref|NP_001156758.1| RNA-binding protein EWS isoform 4 [Homo sapiens]
gi|119580190|gb|EAW59786.1| Ewing sarcoma breakpoint region 1, isoform CRA_g [Homo sapiens]
Length = 600
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 304 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 363
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 364 PPTAKAAVEWFDGKDFQ 380
>gi|338727546|ref|XP_001495353.3| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein EWS [Equus
caballus]
Length = 204
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 21 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 80
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 81 PPTAKAAVEWFDGKDFQ 97
>gi|395753179|ref|XP_003779556.1| PREDICTED: RNA-binding protein EWS isoform 3 [Pongo abelii]
Length = 600
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 304 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 363
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 364 PPTAKAAVEWFDGKDFQ 380
>gi|344247059|gb|EGW03163.1| HIV Tat-specific factor 1-like [Cricetulus griseus]
Length = 528
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F++ ++A
Sbjct: 7 VIIKNMFHPMDFEDDPLVLNEIREDLRVECAKFGQIKKLLLFDRHPDGVASVSFREPEEA 66
Query: 482 QKCIELMNGRWFGGRQIHASEDDGLVNH 509
CI+ ++GRWFGGRQI A DG ++
Sbjct: 67 DYCIQTLDGRWFGGRQITAEAWDGTTDY 94
>gi|410302980|gb|JAA30090.1| Ewing sarcoma breakpoint region 1 [Pan troglodytes]
Length = 599
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 303 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 362
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 363 PPTAKAAVEWFDGKDFQ 379
>gi|326930061|ref|XP_003211171.1| PREDICTED: RNA-binding protein EWS-like [Meleagris gallopavo]
Length = 701
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 58/90 (64%), Gaps = 1/90 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D+VT+E++ + F +CG++K + T +P I +Y+DKETG KGDA V+Y
Sbjct: 395 NSSVYVQGLNDNVTLEDLADFFKQCGVVKMNKRTGQPMIHLYIDKETGKPKGDATVSYDD 454
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
+ A + DG F+ K+ +S+T+ K
Sbjct: 455 PSTAKTAVEWFDGKDFQ-GSKLKVSLTRKK 483
>gi|159163838|pdb|2CPE|A Chain A, Solution Structure Of The Rna Recognition Motif Of Ewing
Sarcoma(Ews) Protein
Length = 113
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 15 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 74
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 75 PPTAKAAVEWFDGKDFQ 91
>gi|344295001|ref|XP_003419203.1| PREDICTED: RNA-binding protein EWS [Loxodonta africana]
Length = 659
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 363 NSAIYVQGLNDSVTLDDLTDFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 422
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 423 PPTAKAAVEWFDGKDFQ 439
>gi|159164131|pdb|2DIT|A Chain A, Solution Structure Of The Rrm_1 Domain Of Hiv Tat Specific
Factor 1 Variant
Length = 112
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 55/91 (60%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
VI++ MF P + D + +E+ D++ EC K G + + + + HP GV V F+D ++
Sbjct: 17 VVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRDPEE 76
Query: 481 AQKCIELMNGRWFGGRQIHASEDDGLVNHAA 511
A CI+ ++GRWFGGRQI A DG ++ +
Sbjct: 77 ADYCIQTLDGRWFGGRQITAQAWDGTTDYQS 107
>gi|4885225|ref|NP_005234.1| RNA-binding protein EWS isoform 2 [Homo sapiens]
gi|544261|sp|Q01844.1|EWS_HUMAN RecName: Full=RNA-binding protein EWS; AltName: Full=EWS oncogene;
AltName: Full=Ewing sarcoma breakpoint region 1 protein
gi|31280|emb|CAA47350.1| RNA binding protein [Homo sapiens]
gi|825654|emb|CAA51489.1| EWS [Homo sapiens]
gi|3164078|emb|CAA69177.1| RNA binding protein [Homo sapiens]
gi|13435963|gb|AAH04817.1| Ewing sarcoma breakpoint region 1 [Homo sapiens]
gi|119580189|gb|EAW59785.1| Ewing sarcoma breakpoint region 1, isoform CRA_f [Homo sapiens]
gi|381963|prf||1818357A EWS gene
Length = 656
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|297708547|ref|XP_002831024.1| PREDICTED: RNA-binding protein EWS isoform 1 [Pongo abelii]
Length = 656
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|343961051|dbj|BAK62115.1| RNA-binding protein EWS [Pan troglodytes]
Length = 656
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|30584431|gb|AAP36468.1| Homo sapiens Ewing sarcoma breakpoint region 1 [synthetic
construct]
gi|61371501|gb|AAX43678.1| Ewing sarcoma breakpoint region 1 [synthetic construct]
Length = 657
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|410055755|ref|XP_001173113.2| PREDICTED: RNA-binding protein EWS isoform 4 [Pan troglodytes]
gi|426393983|ref|XP_004063283.1| PREDICTED: RNA-binding protein EWS [Gorilla gorilla gorilla]
gi|410218350|gb|JAA06394.1| Ewing sarcoma breakpoint region 1 [Pan troglodytes]
gi|410263668|gb|JAA19800.1| Ewing sarcoma breakpoint region 1 [Pan troglodytes]
gi|410302978|gb|JAA30089.1| Ewing sarcoma breakpoint region 1 [Pan troglodytes]
gi|410352103|gb|JAA42655.1| Ewing sarcoma breakpoint region 1 [Pan troglodytes]
Length = 655
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 419 PPTAKAAVEWFDGKDFQ 435
>gi|253970502|ref|NP_001156757.1| RNA-binding protein EWS isoform 3 [Homo sapiens]
gi|15029675|gb|AAH11048.1| Ewing sarcoma breakpoint region 1 [Homo sapiens]
gi|47678511|emb|CAG30376.1| EWSR1 [Homo sapiens]
gi|48734727|gb|AAH72442.1| Ewing sarcoma breakpoint region 1 [Homo sapiens]
gi|109451314|emb|CAK54518.1| EWSR1 [synthetic construct]
gi|109451892|emb|CAK54817.1| EWSR1 [synthetic construct]
gi|119580185|gb|EAW59781.1| Ewing sarcoma breakpoint region 1, isoform CRA_b [Homo sapiens]
gi|208967807|dbj|BAG72549.1| Ewing sarcoma breakpoint region 1 [synthetic construct]
Length = 655
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 419 PPTAKAAVEWFDGKDFQ 435
>gi|297708549|ref|XP_002831025.1| PREDICTED: RNA-binding protein EWS isoform 2 [Pongo abelii]
gi|397481620|ref|XP_003812038.1| PREDICTED: RNA-binding protein EWS [Pan paniscus]
Length = 661
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 365 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 424
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 425 PPTAKAAVEWFDGKDFQ 441
>gi|253970500|ref|NP_053733.2| RNA-binding protein EWS isoform 1 [Homo sapiens]
gi|16551674|dbj|BAB71145.1| unnamed protein product [Homo sapiens]
gi|119580191|gb|EAW59787.1| Ewing sarcoma breakpoint region 1, isoform CRA_h [Homo sapiens]
Length = 661
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 365 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 424
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 425 PPTAKAAVEWFDGKDFQ 441
>gi|119580187|gb|EAW59783.1| Ewing sarcoma breakpoint region 1, isoform CRA_d [Homo sapiens]
Length = 451
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 155 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 214
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 215 PPTAKAAVEWFDGKDFQ 231
>gi|334327485|ref|XP_003340905.1| PREDICTED: RNA-binding protein EWS [Monodelphis domestica]
Length = 603
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 49/77 (63%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL + VT +E+ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 307 NSAIYVQGLNESVTADELADFFKQCGVVKMNKRTGQPMINIYLDKETGKPKGDATVSYDD 366
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 367 PPTAKAAVEWFDGKDFQ 383
>gi|348585263|ref|XP_003478391.1| PREDICTED: RNA-binding protein EWS-like isoform 3 [Cavia porcellus]
Length = 600
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 304 NSAIYVQGLNDNVTLDDLTDFFKQCGVVKINKRTGQPMIHIYLDKETGKPKGDATVSYED 363
Query: 348 EPSVALATQLLDGTPFR 364
P+ A DG F+
Sbjct: 364 PPTAKAAVDWFDGKDFQ 380
>gi|119580188|gb|EAW59784.1| Ewing sarcoma breakpoint region 1, isoform CRA_e [Homo sapiens]
Length = 583
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 287 NSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 346
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 347 PPTAKAAVEWFDGKDFQ 363
>gi|126324897|ref|XP_001364505.1| PREDICTED: RNA-binding protein EWS isoform 2 [Monodelphis
domestica]
Length = 659
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 49/77 (63%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL + VT +E+ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 363 NSAIYVQGLNESVTADELADFFKQCGVVKMNKRTGQPMINIYLDKETGKPKGDATVSYDD 422
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 423 PPTAKAAVEWFDGKDFQ 439
>gi|12838145|dbj|BAB24100.1| unnamed protein product [Mus musculus]
Length = 195
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/175 (34%), Positives = 88/175 (50%), Gaps = 31/175 (17%)
Query: 182 DGTRYKWDRGLRAWVPQ--ED-TSSQNDGYGIEEMTFLKEEEVFPTVNVTDDLANDEVGK 238
D T Y+WD +AW P+ ED ++ YG F + T NV D AN + +
Sbjct: 47 DDTPYEWDLDKKAWFPKITEDFIATYQANYG-----FSSDGASSSTANVQD--ANTKAVE 99
Query: 239 EKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGL 296
E V E + KRK ++ E WF ++ NT+VYV+GL
Sbjct: 100 E----------PPQKEVPETPDSKRKGEKRKAES--------GWFHVEEDRNTNVYVSGL 141
Query: 297 PDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSV 351
P D+TV+E +++ SK GII DP+T++ ++K+Y D + G KGD L YLK SV
Sbjct: 142 PPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDDQ-GNLKGDGLCCYLKVFSV 195
>gi|426247850|ref|XP_004017689.1| PREDICTED: RNA-binding protein EWS [Ovis aries]
Length = 717
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++V+ F +CG +K + T +P I IY+DKETG KGDA V+Y
Sbjct: 421 NSAIYVQGLSDSVTLDDLVDFFKQCGGVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 480
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 481 PPTAKTAVEWFDGKDFQ 497
>gi|126303391|ref|XP_001372987.1| PREDICTED: RNA-binding protein EWS-like isoform 1 [Monodelphis
domestica]
Length = 622
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 49/77 (63%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL + VT +E+ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 352 NSAIYVQGLNESVTADELADFFKQCGVVKMNKRTGQPVINIYLDKETGKPKGDATVSYDD 411
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 412 PPTAKAAVEWFDGKDFQ 428
>gi|126324895|ref|XP_001364430.1| PREDICTED: RNA-binding protein EWS isoform 1 [Monodelphis
domestica]
Length = 664
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 49/77 (63%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL + VT +E+ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 368 NSAIYVQGLNESVTADELADFFKQCGVVKMNKRTGQPMINIYLDKETGKPKGDATVSYDD 427
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 428 PPTAKAAVEWFDGKDFQ 444
>gi|348585259|ref|XP_003478389.1| PREDICTED: RNA-binding protein EWS-like isoform 1 [Cavia porcellus]
Length = 655
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDNVTLDDLTDFFKQCGVVKINKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
P+ A DG F+
Sbjct: 419 PPTAKAAVDWFDGKDFQ 435
>gi|86129438|ref|NP_001034356.1| RNA-binding protein EWS [Gallus gallus]
gi|53127284|emb|CAG31025.1| hypothetical protein RCJMB04_1k19 [Gallus gallus]
Length = 689
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/91 (43%), Positives = 61/91 (67%), Gaps = 3/91 (3%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ VYV GL D+VT+E++ + F +CG++K + T +P I +Y+DKETG KGDA V+Y
Sbjct: 361 NSSVYVQGLNDNVTLEDLADFFKQCGVVKMNKRTGQPMIHLYIDKETGKPKGDATVSY-D 419
Query: 348 EPSVA-LATQLLDGTPFRPDGKIPMSVTQAK 377
+PS A A + DG F+ K+ +S+T+ K
Sbjct: 420 DPSTAKTAVEWFDGKDFQ-GSKLKVSLTRKK 449
>gi|83318265|gb|AAI08816.1| Unknown (protein for MGC:132193) [Xenopus laevis]
Length = 671
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/73 (47%), Positives = 46/73 (63%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV GL D+VTVEE+V+ F CG +K + T +P + I+ DKETG KGD V++
Sbjct: 390 NNTVYVQGLNDNVTVEEIVDFFKHCGDVKINKRTGQPLVNIFTDKETGKPKGDGTVSFED 449
Query: 348 EPSVALATQLLDG 360
PS A +L DG
Sbjct: 450 PPSAKTAVELCDG 462
>gi|348585261|ref|XP_003478390.1| PREDICTED: RNA-binding protein EWS-like isoform 2 [Cavia porcellus]
Length = 661
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 365 NSAIYVQGLNDNVTLDDLTDFFKQCGVVKINKRTGQPMIHIYLDKETGKPKGDATVSYED 424
Query: 348 EPSVALATQLLDGTPFR 364
P+ A DG F+
Sbjct: 425 PPTAKAAVDWFDGKDFQ 441
>gi|301759593|ref|XP_002915634.1| PREDICTED: RNA-binding protein EWS-like [Ailuropoda melanoleuca]
Length = 656
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG+++ + T +P I IY+DKETG KGDA V+Y
Sbjct: 360 NSAIYVQGLNDSVTLDDLADFFKQCGVVRMNKRTGQPMIHIYLDKETGKPKGDATVSYED 419
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 420 PPTAKAAVEWFDGKDFQ 436
>gi|432105129|gb|ELK31498.1| RNA-binding protein EWS, partial [Myotis davidii]
Length = 432
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 48/73 (65%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 337 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 396
Query: 348 EPSVALATQLLDG 360
P+ A DG
Sbjct: 397 PPTAKAAVDWFDG 409
>gi|351696166|gb|EHA99084.1| RNA-binding protein EWS [Heterocephalus glaber]
Length = 641
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 49/73 (67%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 372 NSAIYVQGLNDNVTLDDLADFFKQCGVVKINKRTGQPMIHIYLDKETGKPKGDATVSYED 431
Query: 348 EPSVALATQLLDG 360
P+ A + DG
Sbjct: 432 PPTAKAAVEWFDG 444
>gi|281350152|gb|EFB25736.1| hypothetical protein PANDA_003651 [Ailuropoda melanoleuca]
Length = 520
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++ + F +CG+++ + T +P I IY+DKETG KGDA V+Y
Sbjct: 333 NSAIYVQGLNDSVTLDDLADFFKQCGVVRMNKRTGQPMIHIYLDKETGKPKGDATVSYED 392
Query: 348 EPSVALATQLLDGTPFR 364
P+ A + DG F+
Sbjct: 393 PPTAKAAVEWFDGKDFQ 409
>gi|148235537|ref|NP_001089902.1| uncharacterized protein LOC734969 [Xenopus laevis]
gi|83318255|gb|AAI08750.1| MGC132026 protein [Xenopus laevis]
Length = 667
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/73 (47%), Positives = 46/73 (63%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV GL +DVTVEE+V+ F CG +K + T +P + I+ DKETG KGD V++
Sbjct: 388 NNTVYVQGLNEDVTVEEIVDFFKHCGDVKINKRTGQPLVNIFTDKETGKPKGDGTVSFED 447
Query: 348 EPSVALATQLLDG 360
PS A +L DG
Sbjct: 448 PPSAKTAVELCDG 460
>gi|67677974|gb|AAH97641.1| LOC733270 protein [Xenopus laevis]
Length = 666
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/73 (47%), Positives = 46/73 (63%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV GL +DVTVEE+V+ F CG +K + T +P + I+ DKETG KGD V++
Sbjct: 384 NNTVYVQGLNEDVTVEEIVDFFKHCGDVKINKRTGQPLVNIFTDKETGKPKGDGTVSFED 443
Query: 348 EPSVALATQLLDG 360
PS A +L DG
Sbjct: 444 PPSAKTAVELCDG 456
>gi|366988311|ref|XP_003673922.1| hypothetical protein NCAS_0A09830 [Naumovozyma castellii CBS 4309]
gi|342299785|emb|CCC67541.1| hypothetical protein NCAS_0A09830 [Naumovozyma castellii CBS 4309]
Length = 283
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/238 (31%), Positives = 113/238 (47%), Gaps = 38/238 (15%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV+ + + +++ +FSK G I+ E + K+Y D E G KGDALV Y K S
Sbjct: 43 IYVSNISKHTSQLDLITLFSKYGKIRRTRE-EALNCKMYQD-EKGNFKGDALVVYEKPES 100
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKK----KKLKKVEEKML 406
V LA ++DGT F K + V +A+FE + VDS + KK+K E+ +
Sbjct: 101 VQLAIDMVDGTIF---NKSTIKVERAQFEDNKRKL--DDVDSAAEDPPNKKIKS-EDPVT 154
Query: 407 GWGGRDDAKLTIP-----------------ATVILRFMFTPAEMRADENLRSELEADVQE 449
G+D+ IP TVIL + + + E + ++ D+
Sbjct: 155 SAHGKDN----IPEETNKQDGEDGNVNEEEKTVILANVLGVHQKYSSEEI-DDITDDILG 209
Query: 450 ECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS--EDDG 505
C IG V S+K+ N + V+ F +DA KC MNGR+F GR++ A DDG
Sbjct: 210 GCSSIGTVQSLKIDINSDEAKVV--FATVEDAIKCKLQMNGRFFDGRELVAYMLSDDG 265
>gi|308800922|ref|XP_003075242.1| splicing factor u2af-associated p (ISS) [Ostreococcus tauri]
gi|116061796|emb|CAL52514.1| splicing factor u2af-associated p (ISS), partial [Ostreococcus
tauri]
Length = 131
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 54/104 (51%), Gaps = 1/104 (0%)
Query: 427 MFTPAEMRADENLRSELEADVQEECVK-IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCI 485
MF EM AD LR ELE DV EE + GPV+SVK G + + FK + A C+
Sbjct: 2 MFKLEEMYADAGLRKELEEDVMEEARRTCGPVESVKTYTTSKDGTMTICFKSIEAADACV 61
Query: 486 ELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAEL 529
NGRWF GRQI AS DG + A+ RL+ F EL
Sbjct: 62 TAWNGRWFDGRQIAASIWDGKSKFVSEESEAAQKERLDNFVREL 105
>gi|449477515|ref|XP_004175068.1| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein EWS-like
[Taeniopygia guttata]
Length = 608
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
+T VYV GL D+VT+E++ + F +CG++K + T +P I +Y+DKETG KGDA V+Y
Sbjct: 322 STAVYVQGLSDNVTLEDLADFFKQCGVVKVNKRTGQPMINLYIDKETGKPKGDATVSYDD 381
Query: 348 EPSVALATQLLDGTPFR 364
+ A + DG F+
Sbjct: 382 PSTAKTAVEWFDGKDFQ 398
>gi|339254770|ref|XP_003372608.1| conserved hypothetical protein [Trichinella spiralis]
gi|316966942|gb|EFV51453.1| conserved hypothetical protein [Trichinella spiralis]
Length = 395
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 79/310 (25%), Positives = 133/310 (42%), Gaps = 65/310 (20%)
Query: 170 ASPEGEDE-----FTDDDGTRYKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEEEVFPT 224
A P E+E F D T Y+WD +AW P+ Y E F
Sbjct: 125 AGPSNENETTTVRFDASDNTAYEWDATKQAWFPKVTRKQYKYIYSFHEF--------FCY 176
Query: 225 VNVTDDLANDEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSWFE 284
+ + L + + + +++ N + K ++V + +N P + F
Sbjct: 177 LFMWMTLFWLIIWQHMVTLHYNRLSF--------RNRQLKCLRQKVRIQISNSPKCNCFC 228
Query: 285 LKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
L V ++++ SKCG+I DP + K ++K+Y +E G KGD L T
Sbjct: 229 L--------------VLCQDLM---SKCGLIARDPISNKLKLKLYKSEE-GDNKGDGLCT 270
Query: 345 YLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL------ 398
Y+K+ S+ LA Q+LDG R +S+ +A+FE KG SKK+K L
Sbjct: 271 YIKKESLELAKQILDGYQIRNH---VISIEKARFEMKGS-----YDPSKKRKSLTAKQKK 322
Query: 399 --KKVEEKMLGW------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEE 450
K+ ++K+ W G R + TVI++++FT E+ + + + + ++ E
Sbjct: 323 LLKEKQDKLFDWRPERLRGERPKSD----CTVIIKYLFTSEEISNNAAILLDRKELLKTE 378
Query: 451 CVKIGPVDSV 460
C K G V +
Sbjct: 379 CGKYGTVKKI 388
>gi|157954057|ref|NP_001103270.1| RNA-binding protein EWS [Bos taurus]
gi|157743191|gb|AAI53845.1| EWSR1 protein [Bos taurus]
gi|296478416|tpg|DAA20531.1| TPA: Ewing sarcoma breakpoint region 1 [Bos taurus]
Length = 655
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 50/77 (64%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++V+ F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 359 NSAIYVQGLNDSVTLDDLVDFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 418
Query: 348 EPSVALATQLLDGTPFR 364
+ A + DG F+
Sbjct: 419 PATAKAAVEWFDGKDFQ 435
>gi|157134312|ref|XP_001663237.1| hypothetical protein AaeL_AAEL003124 [Aedes aegypti]
gi|108881402|gb|EAT45627.1| AAEL003124-PA [Aedes aegypti]
Length = 383
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 87/188 (46%), Gaps = 28/188 (14%)
Query: 185 RYKWDRGLRAWVPQEDTSSQNDGYGIEE-----------MTFLKEEE--VFPTVN----- 226
Y+W W+P++ S + Y EE + F EE+ FP V+
Sbjct: 192 HYRWCEETNQWIPKQQISEEGSVYSNEEGVHHYRDKEGMVHFWDEEKKAWFPKVDDDFMA 251
Query: 227 ---VTDDLANDEV-GKEKL-NSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDS 281
+ ND G E + ST E+ E + QP Q +K++ P
Sbjct: 252 RYQLNYGFGNDSASGNENVEGSTNEEEPPEVPEQEPLPEEQVTQP--QGKKRKGPPEPPK 309
Query: 282 WFELKV--NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
WFELK NT VYV+ LP D+T EE E+ SKCG++ +DP T K ++K+Y D G KG
Sbjct: 310 WFELKPEHNTKVYVSNLPLDITEEEFGELMSKCGMVLKDPRTNKLKLKLYRDS-NGALKG 368
Query: 340 DALVTYLK 347
D L Y+K
Sbjct: 369 DGLCHYIK 376
>gi|440912881|gb|ELR62407.1| RNA-binding protein EWS, partial [Bos grunniens mutus]
Length = 669
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 48/73 (65%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D VT++++V+ F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 368 NSAIYVQGLNDSVTLDDLVDFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 427
Query: 348 EPSVALATQLLDG 360
+ A + DG
Sbjct: 428 PATAKAAVEWFDG 440
>gi|312372544|gb|EFR20484.1| hypothetical protein AND_20007 [Anopheles darlingi]
Length = 387
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/187 (31%), Positives = 88/187 (47%), Gaps = 30/187 (16%)
Query: 173 EGEDEFTDDDGTRYKWDRGLRAWVPQED---TSSQNDGYGIEEMT------FLKEEEVFP 223
+G + D DG Y WD +AW P+ D + YG + T E P
Sbjct: 212 DGVHTYKDKDGAVYFWDEERKAWFPKIDDDFMAMYQLNYGFIDNTSGTAKPAESAERSAP 271
Query: 224 TVNVTDDLAN-DEVGKEKLNSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDSW 282
T + + DE KEK +EE+ G + + +K++A P W
Sbjct: 272 AATATRTVGSRDEQSKEK-------------ELEEEGAGAQGK----SKKRKAPPEPPKW 314
Query: 283 FEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGD 340
F+L + NT VYV+ LP D++ EE V SKCG++ +DP+ K ++K+Y D + G KGD
Sbjct: 315 FDLAPEQNTKVYVSNLPTDISEEEFGAVMSKCGMVMKDPKVNKLKLKLYRDAD-GKPKGD 373
Query: 341 ALVTYLK 347
L Y+K
Sbjct: 374 GLCHYIK 380
>gi|254578534|ref|XP_002495253.1| ZYRO0B06996p [Zygosaccharomyces rouxii]
gi|238938143|emb|CAR26320.1| ZYRO0B06996p [Zygosaccharomyces rouxii]
Length = 264
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 107/209 (51%), Gaps = 18/209 (8%)
Query: 291 VYVTGLPDDVTVE-EMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
+Y++ L T E ++++ FSK G IK D + P+ K+Y D E G KGDAL+ Y +
Sbjct: 44 IYLSNLEKQNTTEDQLIDEFSKFGAIKRD-QAGVPKCKLYKDDE-GKVKGDALIVYARHE 101
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
SV +A ++++G ++ +G + V A+F+ K + + + K +K
Sbjct: 102 SVPIAIEMMNG--YKLNG-FEIKVEVAQFQDKKRKLEDSNDEERSYKSVK---------S 149
Query: 410 GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG 469
++ + P V++ + E ++ L +++ D+ + C++IG V + + N G
Sbjct: 150 HNNEENVPKPPVVVIGNILDLYEDYHEQEL-DDIKRDILDGCLEIGFVKKIDL--NSVTG 206
Query: 470 VVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
V F++ A+ C + MNGR+F GR++
Sbjct: 207 EAEVSFENESHARNCCKEMNGRFFDGRRL 235
>gi|157110352|ref|XP_001651063.1| hypothetical protein AaeL_AAEL005549 [Aedes aegypti]
gi|108878734|gb|EAT42959.1| AAEL005549-PA [Aedes aegypti]
Length = 383
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 87/188 (46%), Gaps = 28/188 (14%)
Query: 185 RYKWDRGLRAWVPQEDTSSQNDGYGIEE-----------MTFLKEEE--VFPTVN----- 226
Y+W W+P++ S + Y EE + F EE+ FP V+
Sbjct: 192 HYRWCEETNQWIPKQQISEEGSVYSNEEGVHHYRDKEGMVHFWDEEKKAWFPKVDDDFMA 251
Query: 227 ---VTDDLANDEV-GKEKL-NSTEEKVNSADNVVEEKHNGKRKQPDKQVEKKEANKPPDS 281
+ ND G E + ST E+ E + QP Q +K++ P
Sbjct: 252 RYQLNYGFGNDSASGNENVEGSTNEEEPPEVPEQEPLPEEQVTQP--QGKKRKGPPEPPK 309
Query: 282 WFELKV--NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
WFE+K NT VYV+ LP D+T EE E+ SKCG++ +DP T K ++K+Y D G KG
Sbjct: 310 WFEMKPEHNTKVYVSNLPLDITEEEFGELMSKCGMVLKDPRTNKLKLKLYRDS-NGALKG 368
Query: 340 DALVTYLK 347
D L Y+K
Sbjct: 369 DGLCHYIK 376
>gi|156841911|ref|XP_001644326.1| hypothetical protein Kpol_1066p35 [Vanderwaltozyma polyspora DSM
70294]
gi|156114966|gb|EDO16468.1| hypothetical protein Kpol_1066p35 [Vanderwaltozyma polyspora DSM
70294]
Length = 300
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 121/244 (49%), Gaps = 31/244 (12%)
Query: 289 THVYVTGLPD-DVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
T V+++ LP+ D +E+ SK G I++D + KIY DK G +G+AL+ Y +
Sbjct: 40 TSVFLSNLPNIDNLKQELTINASKFGTIRKDKDGN-TLCKIYYDK-AGNFEGNALIVYSR 97
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF------IAKQVDSKKKKKLKKV 401
S+ LA +++D + F + + V A FE K + +++ D K KKL+ V
Sbjct: 98 AESIPLAIEMMDDSMFHGN---QIKVELASFENKKRNYENMNKDKSEEHDVKPLKKLQDV 154
Query: 402 E---EKMLGWGGR-----DD---AKLTI---PATVILRFMFTPAEMRADENLRSELEADV 447
E++L + DD KL I T +L + + DE L E+ D+
Sbjct: 155 HNKSEQILEVNEQTSDTDDDWVSEKLVIENRKKTAVLSNIVDIYGEQNDEEL-DEITDDI 213
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS--EDDG 505
E C ++G ++ K+ + G +RF +++A CI MN R+F GR+I A +DD
Sbjct: 214 IEGCQQMGTIEDFKLLK--ELGKAEIRFSKKEEAHHCISKMNNRFFDGRKIVAYMLDDDQ 271
Query: 506 LVNH 509
L+++
Sbjct: 272 LIDN 275
>gi|444724764|gb|ELW65360.1| RNA-binding protein EWS [Tupaia chinensis]
Length = 290
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 55/90 (61%), Gaps = 1/90 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ + V GL D+VT++++ E F CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 136 NSAICVQGLNDNVTIDDLAEFFKPCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 195
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
P+ A + G F+ K+ +S+ Q K
Sbjct: 196 PPTTKAAMEWFVGKYFQ-GSKLKISLAQKK 224
>gi|444725966|gb|ELW66515.1| RNA-binding protein EWS [Tupaia chinensis]
Length = 508
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 47/71 (66%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV GL D+VT++++ + F +CG++K + T +P I IY+DKETG KGDA V+Y
Sbjct: 208 NSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYED 267
Query: 348 EPSVALATQLL 358
P+ A +
Sbjct: 268 PPTAKAAVEWF 278
>gi|71033411|ref|XP_766347.1| hypothetical protein [Theileria parva strain Muguga]
gi|68353304|gb|EAN34064.1| hypothetical protein TP01_0826 [Theileria parva]
Length = 255
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 94/200 (47%), Gaps = 25/200 (12%)
Query: 324 PRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE 383
P+IK+Y D E G K D VT++ + S+ LA + LD FR + I V QAKFE K
Sbjct: 42 PKIKLYTD-ENGELKSDGTVTFVNKESIDLAIRYLDNYHFRDNCVI--HVEQAKFEPKPT 98
Query: 384 RFIAKQVDSKKKKKLKKV-------EEKMLGWG-GRDDAKLTIPATVILRFMFTPAEMRA 435
+ AK + K +L+K E ++ GW DD T VI + MF+ +
Sbjct: 99 Q--AKPIPPSIKSELRKKYLAAKYEERRLQGWSDTLDDG--TGRRIVISKPMFSMEDAMV 154
Query: 436 D----ENLRSELEADV------QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCI 485
EN + D+ +E K V+ V HPQG+V ++FK+ DA+ I
Sbjct: 155 TFIILENFQKYETGDLFYEELKEEILSKYVEVEKVTPIPRHPQGIVCIKFKNSSDAEVFI 214
Query: 486 ELMNGRWFGGRQIHASEDDG 505
NGR F GR++ DG
Sbjct: 215 SKFNGRMFDGRELEVYFFDG 234
>gi|49671130|gb|AAH75120.1| hypothetical protein MGC76258 [Xenopus (Silurana) tropicalis]
Length = 673
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 48/73 (65%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
++ +YV GL D+VTVEE+V+ F CG +K + T +P + +++DKETG KGD V++
Sbjct: 391 SSTIYVQGLNDNVTVEEIVDFFKHCGDVKINKRTGEPLVNLFMDKETGKPKGDGTVSFED 450
Query: 348 EPSVALATQLLDG 360
PS A +L DG
Sbjct: 451 PPSAKTAIELCDG 463
>gi|45361465|ref|NP_989309.1| RNA-binding protein EWS [Xenopus (Silurana) tropicalis]
gi|39794445|gb|AAH63928.1| hypothetical protein MGC76258 [Xenopus (Silurana) tropicalis]
Length = 674
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 48/73 (65%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
++ +YV GL D+VTVEE+V+ F CG +K + T +P + +++DKETG KGD V++
Sbjct: 392 SSTIYVQGLNDNVTVEEIVDFFKHCGDVKINKRTGEPLVNLFMDKETGKPKGDGTVSFED 451
Query: 348 EPSVALATQLLDG 360
PS A +L DG
Sbjct: 452 PPSAKTAIELCDG 464
>gi|332025817|gb|EGI65974.1| HIV Tat-specific factor 1-like protein [Acromyrmex echinatior]
Length = 251
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 92/200 (46%), Gaps = 41/200 (20%)
Query: 186 YKWDRGLRAWVPQEDTSSQNDGYGIEEMTFLKEE--------EVFPTVNVTDDLAN---- 233
YK+D+ W+ +ED +DG +M KEE EVF N T +
Sbjct: 41 YKFDQENNKWIVKEDKKVSSDGKKSLQME-KKEESGPEAGAAEVFGFENDTHTYTDPNDG 99
Query: 234 ------------------DEVGKEKLN------STEEKVNSADNVVEEKHNGKRKQPDKQ 269
D + K ++N ST + K ++ ++
Sbjct: 100 SVYFWDKEKNAWFPKVDEDFMAKYQMNYGFTDTSTPQSKPPEPKQPPPTQKIKEEKEQRR 159
Query: 270 VEKKEANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
VE K + P +WFE+ NT +YV+GLP D+T+EE+ ++F+KCG+I D E K +IK
Sbjct: 160 VEAKRKAQEPPTWFEVDEAHNTAIYVSGLPLDITIEELTQLFTKCGLIARD-EKGKDKIK 218
Query: 328 IYVDKETGMKKGDALVTYLK 347
+Y D G KGDAL TY+K
Sbjct: 219 LYKDG-NGEPKGDALCTYIK 237
>gi|33604057|gb|AAH56281.1| Ewsr1a protein [Danio rerio]
Length = 623
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 48/77 (62%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGL ++ T+EE+ + F GII+ + T P + IY DK+TG KGDA ++Y +
Sbjct: 346 NSTIYITGLTENATLEEVADFFKHSGIIRINKRTGLPAVNIYTDKDTGKPKGDATLSYEE 405
Query: 348 EPSVALATQLLDGTPFR 364
PS A + DG F+
Sbjct: 406 PPSAKAAVEWFDGKDFQ 422
>gi|169646246|ref|NP_001108610.1| Ewing sarcoma breakpoint region 1a [Danio rerio]
Length = 626
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 48/77 (62%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGL ++ T+EE+ + F GII+ + T P + IY DK+TG KGDA ++Y +
Sbjct: 345 NSTIYITGLTENATLEEVADFFKHSGIIRINKRTGLPAVNIYTDKDTGKPKGDATLSYEE 404
Query: 348 EPSVALATQLLDGTPFR 364
PS A + DG F+
Sbjct: 405 PPSAKAAVEWFDGKDFQ 421
>gi|46249721|gb|AAH68397.1| Ewsr1a protein [Danio rerio]
Length = 624
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 48/77 (62%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGL ++ T+EE+ + F GII+ + T P + IY DK+TG KGDA ++Y +
Sbjct: 343 NSTIYITGLTENATLEEVADFFKHSGIIRINKRTGLPAVNIYTDKDTGKPKGDATLSYEE 402
Query: 348 EPSVALATQLLDGTPFR 364
PS A + DG F+
Sbjct: 403 PPSAKAAVEWFDGKDFQ 419
>gi|17532119|ref|NP_495483.1| Protein FUST-1 [Caenorhabditis elegans]
gi|351058431|emb|CCD65889.1| Protein FUST-1 [Caenorhabditis elegans]
Length = 448
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 51/88 (57%), Gaps = 3/88 (3%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ + +VFS CG I ++ + PRIKIY D+ TG KG+ ++T++ +
Sbjct: 203 VFVQGISTTANEAYIADVFSTCGDIAKN--DRGPRIKIYTDRNTGEPKGECMITFVDASA 260
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKF 378
A + +G PF P G PMS++ AKF
Sbjct: 261 AQQAITMYNGQPF-PGGSSPMSISLAKF 287
>gi|118100155|ref|XP_415770.2| PREDICTED: TATA-binding protein-associated factor 2N [Gallus
gallus]
Length = 472
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL +DV+ +++ + F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 266 NNTIFVQGLGEDVSTDQVADYFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 325
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 326 PPSAKAAIDWFDGKEF 341
>gi|449479649|ref|XP_002195151.2| PREDICTED: TATA-binding protein-associated factor 2N-like
[Taeniopygia guttata]
Length = 471
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL +DV+ +++ + F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 266 NNTIFVQGLGEDVSTDQVADYFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 325
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 326 PPSAKAAIDWFDGKEF 341
>gi|312372547|gb|EFR20487.1| hypothetical protein AND_20012 [Anopheles darlingi]
Length = 271
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 100/194 (51%), Gaps = 21/194 (10%)
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF--IAKQVDSKKKKKLKKVEEKMLG 407
SV LA +LD R + V +A+F+ +GE + ++ K+K++LKK++E +
Sbjct: 14 SVDLALSILDNYDVRGHK---IKVQRAEFQMRGEYNPKLKPKMRKKEKERLKKMQESLFD 70
Query: 408 W------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
W G R + VI++ +F PA + +L E + D++EEC K G V V
Sbjct: 71 WRPEKMRGERSKHE----RIVIIKNLFEPALFDREVHLLLEYQNDLREECGKCGSVRRVL 126
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIR---DLDAE 518
+ + HP+GV V D ++A ++LMNGR+FG R++ A+ DG + D+D
Sbjct: 127 LYDRHPEGVAQVTMGDPEEADLVVQLMNGRFFGQRKLTAAIWDGRTKYRIAETDADIDKR 186
Query: 519 ASRLEQFGAELEAD 532
E+F LEAD
Sbjct: 187 RGNWEEF---LEAD 197
>gi|326931302|ref|XP_003211771.1| PREDICTED: TATA-binding protein-associated factor 2N-like
[Meleagris gallopavo]
Length = 477
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL +DV+ +++ + F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 271 NNTIFVQGLGEDVSTDQVADYFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 330
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 331 PPSAKAAIDWFDGKEF 346
>gi|449265940|gb|EMC77067.1| TATA-binding protein-associated factor 2N, partial [Columba livia]
Length = 436
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL +DV+ +++ + F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 242 NNTIFVQGLGEDVSTDQVADYFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 301
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 302 PPSAKAAIDWFDGKEF 317
>gi|195348533|ref|XP_002040803.1| GM22141 [Drosophila sechellia]
gi|194122313|gb|EDW44356.1| GM22141 [Drosophila sechellia]
Length = 649
Score = 68.6 bits (166), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 102/199 (51%), Gaps = 20/199 (10%)
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEE 403
L+ SV LA ++LD R + V +A+F+ +GE + + K K+KL+K++E
Sbjct: 433 LQVESVNLALKILDEYNLRGHK---IRVQRAQFQMRGEYNPALKPKRKKKDKEKLQKMKE 489
Query: 404 KMLGW------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPV 457
K+ W G R + TVI++ +FTP + L E + +++EEC K G V
Sbjct: 490 KLFDWRPDKMRGERSKNE----KTVIIKNLFTPELFEKEVELILEYQNNLREECSKCGMV 545
Query: 458 DSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDA 517
V + + HP+GV + ++A I++M GR+FG RQ+ A DG + I +
Sbjct: 546 RKVVIYDRHPEGVAQINMASPEEADLVIQMMQGRFFGQRQLSAESWDGKTKY-KIDESAV 604
Query: 518 EA----SRLEQFGAELEAD 532
EA S+ ++F AE E D
Sbjct: 605 EAHERLSKWDEFLAEEETD 623
>gi|109511562|ref|XP_001060226.1| PREDICTED: RNA-binding protein FUS-like [Rattus norvegicus]
gi|293362785|ref|XP_576927.3| PREDICTED: RNA-binding protein FUS-like [Rattus norvegicus]
Length = 516
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 277 NNTIFVQGLGENVTIESVADYFKQIGIIKTNEKTGQPMINLYTDRETGKLKGEATVSFDD 336
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 337 PPSAKTAIDWFDGKEF 352
>gi|341899812|gb|EGT55747.1| CBN-FUST-1 protein [Caenorhabditis brenneri]
Length = 475
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 51/88 (57%), Gaps = 3/88 (3%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ + +VFS CG I ++ + PRIKIY D+ TG KG+ ++T++
Sbjct: 231 IFVQGISTSANEAYIADVFSTCGDIAKN--ERGPRIKIYTDRNTGEPKGECMITFVDAND 288
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKF 378
A + +G+PF P G PM+++ AKF
Sbjct: 289 AQKAITMYNGSPF-PGGSQPMNISLAKF 315
>gi|432892495|ref|XP_004075809.1| PREDICTED: TATA-binding protein-associated factor 2N-like [Oryzias
latipes]
Length = 446
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 55/95 (57%), Gaps = 3/95 (3%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL +D TV+E+ + F + GIIK + +T +P I IY DK TG KG+A V++
Sbjct: 230 NNTIFVQGLGEDATVQEVGDYFKQIGIIKVNKKTGQPMINIYSDKATGRPKGEATVSFDD 289
Query: 348 EPSVALATQLLDGTPF--RPDGKIPMSVTQAKFEQ 380
PS A DG F +P K+ + +A+F Q
Sbjct: 290 PPSAKAAIDWFDGKEFNGKPI-KVSFATRRAEFTQ 323
>gi|147903153|ref|NP_001090836.1| fused in sarcoma [Xenopus (Silurana) tropicalis]
gi|89271358|emb|CAJ83479.1| fusion (involved in t(12;16) in malignant liposarcoma) [Xenopus
(Silurana) tropicalis]
gi|197246695|gb|AAI68544.1| fusion (involved in t(12;16) in malignant liposarcoma) [Xenopus
(Silurana) tropicalis]
Length = 539
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VTVE + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 297 NNTIFVQGLGENVTVESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 356
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 357 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 388
>gi|440791603|gb|ELR12841.1| splicing factor, CC1like subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 594
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 1/81 (1%)
Query: 420 ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRK 479
A ++L+ MF PA+ + N +++ DV EEC K G V V + P G+V +RF+ +
Sbjct: 501 AFMLLKNMFDPAQ-ETEPNFHLDIQEDVTEECSKYGKVLQCHVVRDSPSGLVYLRFESSE 559
Query: 480 DAQKCIELMNGRWFGGRQIHA 500
A K I+ +NGRWF G+ I A
Sbjct: 560 GAAKAIQALNGRWFAGKVISA 580
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 46/210 (21%), Positives = 85/210 (40%), Gaps = 40/210 (19%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+ + LP + E+ E F K G + + I++ D+ + KG + YL + S
Sbjct: 171 VFASNLPIRASESELFEFFGKAGKVHD--------IRLITDRNSRKSKGFGYIEYLDKSS 222
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
V LA L+GT + KG+ + + ++K +
Sbjct: 223 VPLALHQLNGT-----------------QCKGQTVLVQITQAEKNRAAAAAAAAAA---- 261
Query: 411 RDDAKLTIPATVILRFMFTPAEMRADENLRSELE--ADVQEECVKIGPVDSVKVCENHPQ 468
L+ P + + + T A+++LR+ E D+Q+ + I P +
Sbjct: 262 NAPPSLSAPTRLYVGNLHTDL---AEDDLRTVFEPFGDIQQINLHIDPE------TGRSK 312
Query: 469 GVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
G V++K +DA+K ++ NG GRQ+
Sbjct: 313 GFAFVQYKSPEDAKKALQHCNGMELAGRQL 342
>gi|195592132|ref|XP_002085790.1| GD12119 [Drosophila simulans]
gi|194197799|gb|EDX11375.1| GD12119 [Drosophila simulans]
Length = 605
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/168 (32%), Positives = 88/168 (52%), Gaps = 15/168 (8%)
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEE 403
L+ SV LA ++LD R + V +A+F+ +GE + + K K+KL+K++E
Sbjct: 440 LQVESVNLALKILDEYNLRGHK---IRVQRAQFQMRGEYNPALKPKRKKKDKEKLQKMKE 496
Query: 404 KMLGW------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPV 457
K+ W G R + TVI++ +FTP + L E + +++EEC K G V
Sbjct: 497 KLFDWRPDKMRGERSKNE----KTVIIKNLFTPELFEKEVELILEYQNNLREECSKCGMV 552
Query: 458 DSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
V + + HP+GV + ++A I++M GR+FG RQ+ A DG
Sbjct: 553 RKVVIYDRHPEGVAQINMASPEEADLVIQMMQGRFFGQRQLSAESWDG 600
>gi|147898544|ref|NP_001087044.1| MGC80893 protein [Xenopus laevis]
gi|50415028|gb|AAH77935.1| MGC80893 protein [Xenopus laevis]
Length = 482
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 46/73 (63%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ +DV+ E++ E F + GIIK + +T +P I +Y DKETG KG+A V++ PS
Sbjct: 275 IFVQGMGEDVSQEQISEFFKQIGIIKTNKKTGQPMINLYTDKETGKSKGEATVSFDDPPS 334
Query: 351 VALATQLLDGTPF 363
A + DG F
Sbjct: 335 AKAAIEWFDGKMF 347
>gi|348509964|ref|XP_003442516.1| PREDICTED: hypothetical protein LOC100697861 [Oreochromis
niloticus]
Length = 542
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 52/92 (56%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL DD TVE + + F + GIIK + +T P I +Y D+ETG KG+A V++
Sbjct: 299 NNTIFVQGLGDDYTVESVADFFKQIGIIKVNKKTGLPMINLYTDRETGKLKGEATVSFDD 358
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A + DG F + K+ + +A F
Sbjct: 359 PPSAKAAIEWFDGKDFNGNPIKVSFATRRADF 390
>gi|148685673|gb|EDL17620.1| fusion, derived from t(12;16) malignant liposarcoma (human),
isoform CRA_f [Mus musculus]
Length = 531
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 290 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 349
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 350 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 381
>gi|354497863|ref|XP_003511037.1| PREDICTED: RNA-binding protein FUS-like [Cricetulus griseus]
Length = 517
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 276 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 335
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 336 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 367
>gi|74145855|dbj|BAE24196.1| unnamed protein product [Mus musculus]
Length = 518
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 277 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 336
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 337 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 368
>gi|126334532|ref|XP_001364792.1| PREDICTED: RNA-binding protein FUS-like [Monodelphis domestica]
Length = 527
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 285 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 344
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 345 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 376
>gi|26252108|gb|AAH40827.1| Fusion, derived from t(12;16) malignant liposarcoma (human) [Mus
musculus]
gi|34849597|gb|AAH58247.1| Fusion, derived from t(12;16) malignant liposarcoma (human) [Mus
musculus]
Length = 517
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 276 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 335
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 336 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 367
>gi|410984740|ref|XP_003998684.1| PREDICTED: RNA-binding protein FUS [Felis catus]
Length = 516
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 274 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 333
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 334 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 365
>gi|58865844|ref|NP_001012137.1| RNA-binding protein FUS [Rattus norvegicus]
gi|56268930|gb|AAH87153.1| Fusion, derived from t(12;16) malignant liposarcoma (human) [Rattus
norvegicus]
gi|149067652|gb|EDM17204.1| rCG39872, isoform CRA_d [Rattus norvegicus]
Length = 518
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 277 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 336
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 337 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 368
>gi|20982845|ref|NP_631888.1| RNA-binding protein FUS [Mus musculus]
gi|8928084|sp|P56959.1|FUS_MOUSE RecName: Full=RNA-binding protein FUS; AltName: Full=Protein pigpen
gi|7920331|gb|AAF70602.1| pigpen protein [Mus musculus]
gi|30027148|gb|AAP06748.1| pigpen protein [Mus musculus]
gi|74211596|dbj|BAE26524.1| unnamed protein product [Mus musculus]
gi|148685668|gb|EDL17615.1| fusion, derived from t(12;16) malignant liposarcoma (human),
isoform CRA_a [Mus musculus]
Length = 518
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 277 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 336
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 337 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 368
>gi|149067649|gb|EDM17201.1| rCG39872, isoform CRA_a [Rattus norvegicus]
Length = 506
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 277 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 336
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 337 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 368
>gi|148685669|gb|EDL17616.1| fusion, derived from t(12;16) malignant liposarcoma (human),
isoform CRA_b [Mus musculus]
Length = 506
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 277 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 336
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 337 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 368
>gi|147900831|ref|NP_001087676.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa [Xenopus laevis]
gi|51703637|gb|AAH81071.1| MGC82028 protein [Xenopus laevis]
Length = 475
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 46/73 (63%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V GL +DV+ E++ + F + GIIK + +T KP I +Y DKETG KG+A V++ PS
Sbjct: 267 IFVQGLGEDVSEEQISDFFKQIGIIKINKKTGKPMINLYADKETGKSKGEATVSFDDPPS 326
Query: 351 VALATQLLDGTPF 363
A + DG F
Sbjct: 327 AKAAIEWFDGKTF 339
>gi|148224502|ref|NP_001086293.1| MGC84691 protein [Xenopus laevis]
gi|49522198|gb|AAH74437.1| MGC84691 protein [Xenopus laevis]
Length = 536
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VTVE + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 296 NNTIFVQGLGENVTVESVADYFKQIGIIKINKKTGQPMINLYTDRETGKLKGEATVSFDD 355
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKFE 379
PS A DG F + K+ + +A F
Sbjct: 356 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADFN 388
>gi|417401285|gb|JAA47533.1| Putative conserved zn-finger protein [Desmodus rotundus]
Length = 457
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 274 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 333
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 334 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 365
>gi|348523696|ref|XP_003449359.1| PREDICTED: hypothetical protein LOC100703628 [Oreochromis
niloticus]
Length = 436
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 55/95 (57%), Gaps = 3/95 (3%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL +D TV+E+ + F + GIIK + +T +P I IY DK TG KG+A V++
Sbjct: 217 NNTIFVQGLGEDATVQEVGDFFKQIGIIKVNKKTGQPMINIYSDKATGRPKGEATVSFDD 276
Query: 348 EPSVALATQLLDGTPF--RPDGKIPMSVTQAKFEQ 380
PS A DG F +P K+ + +A+F Q
Sbjct: 277 PPSAKAAIDWFDGKDFNGKPI-KVSFATRRAEFTQ 310
>gi|335284313|ref|XP_003124517.2| PREDICTED: RNA-binding protein FUS-like isoform 1 [Sus scrofa]
Length = 514
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 272 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 331
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 332 PPSAKAAIDWFDGKEF 347
>gi|351738724|gb|AEQ61462.1| FUS [Sus scrofa]
Length = 517
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 275 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 334
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 335 PPSAKAAIDWFDGKEF 350
>gi|27806887|ref|NP_776337.1| RNA-binding protein FUS [Bos taurus]
gi|1172095|gb|AAC13543.1| pigpen [Bos taurus]
gi|296473237|tpg|DAA15352.1| TPA: RNA-binding protein FUS [Bos taurus]
Length = 512
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 270 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 329
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 330 PPSAKAAIDWFDGKEF 345
>gi|335284315|ref|XP_003354570.1| PREDICTED: RNA-binding protein FUS-like isoform 2 [Sus scrofa]
Length = 518
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 276 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 335
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 336 PPSAKAAIDWFDGKEF 351
>gi|194386744|dbj|BAG61182.1| unnamed protein product [Homo sapiens]
Length = 429
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 213 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 272
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 273 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 304
>gi|426381968|ref|XP_004057601.1| PREDICTED: RNA-binding protein FUS [Gorilla gorilla gorilla]
Length = 524
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 282 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 341
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 342 PPSAKAAIDWFDGKEF 357
>gi|149067653|gb|EDM17205.1| rCG39872, isoform CRA_e [Rattus norvegicus]
Length = 445
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 204 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 263
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 264 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 295
>gi|148685672|gb|EDL17619.1| fusion, derived from t(12;16) malignant liposarcoma (human),
isoform CRA_e [Mus musculus]
Length = 445
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 204 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 263
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 264 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 295
>gi|283135201|ref|NP_001164105.1| RNA-binding protein FUS isoform 2 [Homo sapiens]
gi|3582783|gb|AAC35284.1| FUS/TLS protein [Homo sapiens]
gi|4210363|emb|CAA50559.1| FUS gycline rich protein [Homo sapiens]
gi|119572536|gb|EAW52151.1| fusion (involved in t(12;16) in malignant liposarcoma), isoform
CRA_b [Homo sapiens]
Length = 525
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 283 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 342
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 343 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 374
>gi|116241371|sp|Q28009.2|FUS_BOVIN RecName: Full=RNA-binding protein FUS; AltName: Full=Protein pigpen
gi|112362011|gb|AAI19966.1| Fusion (involved in t(12;16) in malignant liposarcoma) [Bos taurus]
Length = 513
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 271 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 330
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 331 PPSAKAAIDWFDGKEF 346
>gi|4826734|ref|NP_004951.1| RNA-binding protein FUS isoform 1 [Homo sapiens]
gi|544357|sp|P35637.1|FUS_HUMAN RecName: Full=RNA-binding protein FUS; AltName: Full=75 kDa
DNA-pairing protein; AltName: Full=Oncogene FUS;
AltName: Full=Oncogene TLS; AltName: Full=POMp75;
AltName: Full=Translocated in liposarcoma protein
gi|386157|gb|AAB27102.1| TLS [Homo sapiens]
gi|3582784|gb|AAC35285.1| FUS/TLS protein [Homo sapiens]
gi|12653267|gb|AAH00402.1| Fusion (involved in t(12;16) in malignant liposarcoma) [Homo
sapiens]
gi|12803287|gb|AAH02459.1| Fusion (involved in t(12;16) in malignant liposarcoma) [Homo
sapiens]
gi|30583101|gb|AAP35795.1| fusion, derived from t(12;16) malignant liposarcoma [Homo sapiens]
gi|48145611|emb|CAG33028.1| FUS [Homo sapiens]
gi|60655179|gb|AAX32153.1| fusion [synthetic construct]
gi|60655181|gb|AAX32154.1| fusion [synthetic construct]
gi|119572535|gb|EAW52150.1| fusion (involved in t(12;16) in malignant liposarcoma), isoform
CRA_a [Homo sapiens]
gi|123981604|gb|ABM82631.1| fusion (involved in t(12;16) in malignant liposarcoma) [synthetic
construct]
gi|123996413|gb|ABM85808.1| fusion (involved in t(12;16) in malignant liposarcoma) [synthetic
construct]
gi|261858878|dbj|BAI45961.1| RNA-binding protein FUS [synthetic construct]
Length = 526
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 284 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 343
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 344 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 375
>gi|19913512|gb|AAH26062.1| Fusion (involved in t(12;16) in malignant liposarcoma) [Homo
sapiens]
Length = 526
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 284 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 343
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 344 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 375
>gi|440913331|gb|ELR62795.1| RNA-binding protein FUS, partial [Bos grunniens mutus]
Length = 509
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 267 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 326
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 327 PPSAKAAIDWFDGKEF 342
>gi|171687279|ref|XP_001908580.1| hypothetical protein [Podospora anserina S mat+]
gi|170943601|emb|CAP69253.1| unnamed protein product [Podospora anserina S mat+]
Length = 154
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 51/80 (63%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
VILR MFT E+ D +++ D++EEC +G V++V + + +G+V+V+F R+
Sbjct: 51 VVILRHMFTLEELEEDPAALLDIKDDIREECETLGKVNNVILYDQEEEGIVMVKFGSREA 110
Query: 481 AQKCIELMNGRWFGGRQIHA 500
A+KC+ M+GR F GR + A
Sbjct: 111 AEKCLSKMHGRKFDGRTVEA 130
>gi|410353427|gb|JAA43317.1| fused in sarcoma [Pan troglodytes]
gi|410353433|gb|JAA43320.1| fused in sarcoma [Pan troglodytes]
Length = 524
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 282 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 341
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 342 PPSAKAAIDWFDGKEF 357
>gi|30584649|gb|AAP36577.1| Homo sapiens fusion, derived from t(12;16) malignant liposarcoma
[synthetic construct]
gi|61372059|gb|AAX43778.1| fusion [synthetic construct]
gi|61372064|gb|AAX43779.1| fusion [synthetic construct]
Length = 527
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 284 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 343
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 344 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 375
>gi|395514942|ref|XP_003761668.1| PREDICTED: uncharacterized protein LOC100915308 [Sarcophilus
harrisii]
Length = 536
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 294 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 353
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 354 PPSAKAAIDWFDGKEF 369
>gi|426254615|ref|XP_004020972.1| PREDICTED: RNA-binding protein FUS isoform 2 [Ovis aries]
Length = 515
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 273 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 332
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 333 PPSAKAAIDWFDGKEF 348
>gi|194218995|ref|XP_001500625.2| PREDICTED: RNA-binding protein FUS-like isoform 1 [Equus caballus]
Length = 520
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 278 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 337
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 338 PPSAKAAIDWFDGKEF 353
>gi|441597858|ref|XP_003280509.2| PREDICTED: uncharacterized protein LOC100594007 [Nomascus
leucogenys]
Length = 533
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 291 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 350
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 351 PPSAKAAIDWFDGKEF 366
>gi|338712778|ref|XP_003362768.1| PREDICTED: RNA-binding protein FUS-like isoform 2 [Equus caballus]
Length = 515
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 273 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 332
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 333 PPSAKAAIDWFDGKEF 348
>gi|448295|prf||1916411B TLS protein
Length = 260
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 18 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 77
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 78 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 109
>gi|119572537|gb|EAW52152.1| fusion (involved in t(12;16) in malignant liposarcoma), isoform
CRA_c [Homo sapiens]
Length = 321
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 79 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 138
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 139 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 170
>gi|432113328|gb|ELK35741.1| TATA-binding protein-associated factor 2N [Myotis davidii]
Length = 385
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 49/84 (58%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS F+ N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 28 DSEFDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 87
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 88 EATVSFDDPPSAKAAIDWFDGKEF 111
>gi|283135173|ref|NP_001164408.1| RNA-binding protein FUS isoform 3 [Homo sapiens]
Length = 522
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 280 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 339
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 340 PPSAKAAIDWFDGKEF 355
>gi|417411322|gb|JAA52101.1| Putative conserved zn-finger protein, partial [Desmodus rotundus]
Length = 512
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 270 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 329
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 330 PPSAKAAIDWFDGKEF 345
>gi|147902226|ref|NP_001080383.1| fused in sarcoma [Xenopus laevis]
gi|27881721|gb|AAH44319.1| Fus protein [Xenopus laevis]
Length = 536
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VTVE + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 297 NNTIFVQGLGENVTVESVADYFKQIGIIKINKKTGQPMINLYTDRETGKLKGEATVSFDD 356
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 357 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 388
>gi|431909852|gb|ELK12954.1| hypothetical protein PAL_GLEAN10007997 [Pteropus alecto]
Length = 480
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ + V GL D VT++++ + F +CG++K + T+ P + IY+DKET KGDA V++
Sbjct: 325 NSAICVQGLNDIVTLDDLADFFKQCGVVKMNKRTRHPMLHIYLDKETRKPKGDATVSFED 384
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKF 378
P+ A + G F+ K+ +S+ Q K
Sbjct: 385 PPTAKAAVEWFAGKDFQ-GSKLKVSLAQKKL 414
>gi|426254613|ref|XP_004020971.1| PREDICTED: RNA-binding protein FUS isoform 1 [Ovis aries]
Length = 511
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 269 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 328
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 329 PPSAKAAIDWFDGKEF 344
>gi|403276834|ref|XP_003930088.1| PREDICTED: uncharacterized protein LOC101037596 [Saimiri
boliviensis boliviensis]
Length = 531
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 289 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 348
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 349 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 380
>gi|158255308|dbj|BAF83625.1| unnamed protein product [Homo sapiens]
Length = 525
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 283 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 342
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 343 PPSAKAAIDWFDGKEF 358
>gi|29179510|gb|AAH49308.1| Fusion (involved in t(12;16) in malignant liposarcoma) [Danio
rerio]
Length = 410
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 54/97 (55%), Gaps = 1/97 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL DD TV+ + + F + GIIK + +T P I +Y D+ETG KG+A V++
Sbjct: 298 NNTIFVQGLGDDYTVDSVADYFKQIGIIKVNKKTGLPMINLYTDRETGKLKGEATVSFDD 357
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKFEQKGE 383
PS A DG F + K+ + +A+F + G
Sbjct: 358 PPSAKAAIDWFDGKDFNGNPIKVSFATRRAEFGRGGS 394
>gi|397471984|ref|XP_003807543.1| PREDICTED: RNA-binding protein FUS [Pan paniscus]
Length = 521
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 279 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 338
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 339 PPSAKAAIDWFDGKEF 354
>gi|343962169|dbj|BAK62672.1| RNA-binding protein FUS [Pan troglodytes]
gi|410353431|gb|JAA43319.1| fused in sarcoma [Pan troglodytes]
Length = 524
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 282 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 341
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 342 PPSAKAAIDWFDGKEF 357
>gi|291411009|ref|XP_002721797.1| PREDICTED: fused in sarcoma [Oryctolagus cuniculus]
Length = 594
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 349 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 408
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 409 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 440
>gi|327278232|ref|XP_003223866.1| PREDICTED: hypothetical protein LOC100556650 [Anolis carolinensis]
Length = 521
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 276 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 335
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 336 PPSAKAAIDWFDGKEF 351
>gi|62087384|dbj|BAD92139.1| fusion (involved in t(12;16) in malignant liposarcoma) isoform a
variant [Homo sapiens]
Length = 300
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 58 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 117
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 118 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 149
>gi|354498506|ref|XP_003511356.1| PREDICTED: TATA-binding protein-associated factor 2N-like
[Cricetulus griseus]
Length = 616
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 263 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 322
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 323 PPSAKAAIDWFDGKEF 338
>gi|392571432|gb|EIW64604.1| splicing factor CC1-like protein [Trametes versicolor FP-101664
SS1]
Length = 344
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 104/229 (45%), Gaps = 33/229 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 120 LYVGSLHFNLTESDIKQVFEPFGELE--------FVDLHRDPMTGRSKGYAFVQYKRAED 171
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKMLGW 408
+A + ++G F G+ ++ +KG +Q +D L + L
Sbjct: 172 AKMALEQMEG--FELAGR---TLRVNTVHEKGSTRYTQQDTLDEAGGGNLNAASRQALMQ 226
Query: 409 G-GRDD---------AKLTIPAT-----VILRFMFTPAEMRADENLRSELEADVQEEC-V 452
R D A+ IP T V+L+ MF P E + + +L DV+ EC
Sbjct: 227 KLARTDSAPVKLEPVARPHIPQTMQSRSVLLKNMFNPEE-ETERDWDKDLADDVKSECAT 285
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
K GPV ++KV E QG + V F+ A + IE +NGRWFGGRQI A+
Sbjct: 286 KYGPVQAIKV-EKETQGEIYVLFETVDAAGQAIEGLNGRWFGGRQISAA 333
>gi|392351445|ref|XP_003750930.1| PREDICTED: TATA-binding protein-associated factor 2N [Rattus
norvegicus]
Length = 550
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 231 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 290
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 291 PPSAKAAIDWFDGKEF 306
>gi|426238615|ref|XP_004023731.1| PREDICTED: LOW QUALITY PROTEIN: TATA-binding protein-associated
factor 2N-like [Ovis aries]
Length = 570
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 224 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 283
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 284 EATVSFDDPPSAKAAIDWFDGKEF 307
>gi|187469033|gb|AAI66769.1| Taf15 protein [Rattus norvegicus]
Length = 572
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 223 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 282
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 283 EATVSFDDPPSAKAAIDWFDGKEF 306
>gi|90075012|dbj|BAE87186.1| unnamed protein product [Macaca fascicularis]
Length = 533
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 291 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 350
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 351 PPSAKAAIDWFDGKEF 366
>gi|109128306|ref|XP_001112691.1| PREDICTED: hypothetical protein LOC713433 isoform 3 [Macaca
mulatta]
Length = 533
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 291 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 350
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 351 PPSAKAAIDWFDGKEF 366
>gi|390471545|ref|XP_002756139.2| PREDICTED: RNA-binding protein FUS [Callithrix jacchus]
Length = 526
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 284 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 343
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKFE 379
PS A DG F + K+ + +A F
Sbjct: 344 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADFN 376
>gi|355710153|gb|EHH31617.1| hypothetical protein EGK_12717, partial [Macaca mulatta]
Length = 528
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 287 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 346
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 347 PPSAKAAIDWFDGKEF 362
>gi|426258988|ref|XP_004023084.1| PREDICTED: TATA-binding protein-associated factor 2N-like, partial
[Ovis aries]
Length = 486
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 224 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 283
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 284 EATVSFDDPPSAKAAIDWFDGKEF 307
>gi|1040970|gb|AAA79948.1| fus-like protein, partial [Homo sapiens]
Length = 528
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 287 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 346
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 347 PPSAKAAIDWFDGKEF 362
>gi|338711012|ref|XP_001501405.3| PREDICTED: TATA-binding protein-associated factor 2N [Equus
caballus]
Length = 599
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 224 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 283
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 284 EATVSFDDPPSAKAAIDWFDGKEF 307
>gi|74151472|dbj|BAE38848.1| unnamed protein product [Mus musculus]
Length = 518
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 231 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 290
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 291 PPSAKAAIDWFDGKEF 306
>gi|397494299|ref|XP_003846265.1| PREDICTED: LOW QUALITY PROTEIN: TATA-binding protein-associated
factor 2N [Pan paniscus]
Length = 592
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 225 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 284
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 285 EATVSFDDPPSAKAAIDWFDGKEF 308
>gi|426348662|ref|XP_004041948.1| PREDICTED: TATA-binding protein-associated factor 2N isoform 1
[Gorilla gorilla gorilla]
Length = 593
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEF 308
>gi|30794412|ref|NP_081703.1| TATA-binding protein-associated factor 2N [Mus musculus]
gi|26342028|dbj|BAC34676.1| unnamed protein product [Mus musculus]
gi|74226883|dbj|BAE27086.1| unnamed protein product [Mus musculus]
gi|187950713|gb|AAI37592.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
Length = 557
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 223 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 282
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 283 EATVSFDDPPSAKAAIDWFDGKEF 306
>gi|410980556|ref|XP_003996643.1| PREDICTED: TATA-binding protein-associated factor 2N [Felis catus]
Length = 561
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 228 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 287
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 288 EATVSFDDPPSAKAAIDWFDGKEF 311
>gi|380816820|gb|AFE80284.1| TATA-binding protein-associated factor 2N isoform 1 [Macaca
mulatta]
Length = 564
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 225 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 284
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 285 EATVSFDDPPSAKAAIDWFDGKEF 308
>gi|348567467|ref|XP_003469520.1| PREDICTED: TATA-binding protein-associated factor 2N-like [Cavia
porcellus]
Length = 346
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 236 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 295
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 296 PPSAKAAIDWFDGKEFH 312
>gi|296201980|ref|XP_002748387.1| PREDICTED: TATA-binding protein-associated factor 2N [Callithrix
jacchus]
Length = 592
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEF 308
>gi|440902812|gb|ELR53553.1| TATA-binding protein-associated factor 2N, partial [Bos grunniens
mutus]
Length = 609
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 222 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 281
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 282 EATVSFDDPPSAKAAIDWFDGKEF 305
>gi|402899399|ref|XP_003912685.1| PREDICTED: TATA-binding protein-associated factor 2N [Papio anubis]
Length = 589
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 230 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 289
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 290 PPSAKAAIDWFDGKEF 305
>gi|380816818|gb|AFE80283.1| TATA-binding protein-associated factor 2N isoform 2 [Macaca
mulatta]
Length = 561
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 222 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 281
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 282 EATVSFDDPPSAKAAIDWFDGKEF 305
>gi|73966759|ref|XP_548255.2| PREDICTED: TATA-binding protein-associated factor 2N isoform 1
[Canis lupus familiaris]
Length = 571
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 224 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 283
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 284 EATVSFDDPPSAKAAIDWFDGKEF 307
>gi|21327701|ref|NP_631961.1| TATA-binding protein-associated factor 2N isoform 1 [Homo sapiens]
gi|8928305|sp|Q92804.1|RBP56_HUMAN RecName: Full=TATA-binding protein-associated factor 2N; AltName:
Full=68 kDa TATA-binding protein-associated factor;
Short=TAF(II)68; Short=TAFII68; AltName:
Full=RNA-binding protein 56
gi|1613775|gb|AAC50932.1| putative RNA binding protein RBP56 [Homo sapiens]
gi|3763906|dbj|BAA33811.1| RBP56/hTAFII68 [Homo sapiens]
gi|119600531|gb|EAW80125.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa, isoform CRA_b [Homo sapiens]
gi|127797770|gb|AAH46099.2| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa [Homo sapiens]
gi|189054353|dbj|BAG36873.1| unnamed protein product [Homo sapiens]
gi|307686323|dbj|BAJ21092.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa [synthetic construct]
Length = 592
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEF 308
>gi|426348664|ref|XP_004041949.1| PREDICTED: TATA-binding protein-associated factor 2N isoform 2
[Gorilla gorilla gorilla]
Length = 593
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEFH 309
>gi|380791919|gb|AFE67835.1| TATA-binding protein-associated factor 2N isoform 1, partial
[Macaca mulatta]
Length = 453
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEFH 309
>gi|351702187|gb|EHB05106.1| TATA-binding protein-associated factor 2N, partial [Heterocephalus
glaber]
Length = 566
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 222 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 281
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 282 EATVSFDDPPSAKAAIDWFDGKEF 305
>gi|431906874|gb|ELK10995.1| RNA-binding protein FUS [Pteropus alecto]
Length = 496
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 266 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 325
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 326 PPSAKAAIDWFDGKEF 341
>gi|355568429|gb|EHH24710.1| RNA-binding protein 56, partial [Macaca mulatta]
gi|355753929|gb|EHH57894.1| RNA-binding protein 56, partial [Macaca fascicularis]
Length = 590
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 223 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 282
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 283 EATVSFDDPPSAKAAIDWFDGKEF 306
>gi|74226636|dbj|BAE26972.1| unnamed protein product [Mus musculus]
Length = 518
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 52/92 (56%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 277 NNTIFVQGLGENVTIESVAGYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 336
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 337 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 368
>gi|109114031|ref|XP_001114501.1| PREDICTED: TATA-binding protein-associated factor 2N-like isoform 2
[Macaca mulatta]
Length = 584
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEF 308
>gi|441677907|ref|XP_003281421.2| PREDICTED: TATA-binding protein-associated factor 2N [Nomascus
leucogenys]
Length = 590
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 232 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 291
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 292 PPSAKAAIDWFDGKEF 307
>gi|268562591|ref|XP_002646698.1| Hypothetical protein CBG13075 [Caenorhabditis briggsae]
Length = 457
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCG-IIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
V+V G+ + +VFS CG I K D + PRIKIY D+ +G KG+ ++T++
Sbjct: 204 VFVQGISTTANEAYIADVFSTCGDIAKND---RGPRIKIYTDRNSGEPKGECMITFVDSA 260
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKF 378
+ A + +G PF P G PM ++ AKF
Sbjct: 261 AAQQAITMYNGQPF-PGGSSPMIISLAKF 288
>gi|4507353|ref|NP_003478.1| TATA-binding protein-associated factor 2N isoform 2 [Homo sapiens]
gi|1628403|emb|CAA67398.1| hTAFII68 [Homo sapiens]
gi|3763907|dbj|BAA33812.1| RBP56/hTAFII68 [Homo sapiens]
gi|27501920|gb|AAO13485.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa [Homo sapiens]
gi|119600530|gb|EAW80124.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa, isoform CRA_a [Homo sapiens]
gi|189053782|dbj|BAG36034.1| unnamed protein product [Homo sapiens]
Length = 589
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 222 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 281
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 282 EATVSFDDPPSAKAAIDWFDGKEF 305
>gi|395748856|ref|XP_003778843.1| PREDICTED: LOW QUALITY PROTEIN: TATA-binding protein-associated
factor 2N [Pongo abelii]
Length = 592
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEF 308
>gi|351738722|gb|AEQ61461.1| Taf15 [Sus scrofa]
Length = 602
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 232 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 291
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 292 PPSAKAAIDWFDGKEF 307
>gi|297462338|ref|XP_871684.3| PREDICTED: TATA-binding protein-associated factor 2N isoform 2 [Bos
taurus]
gi|297486470|ref|XP_002695694.1| PREDICTED: TATA-binding protein-associated factor 2N [Bos taurus]
gi|296476972|tpg|DAA19087.1| TPA: TBP-associated factor 15-like [Bos taurus]
Length = 591
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 224 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 283
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 284 EATVSFDDPPSAKAAIDWFDGKEF 307
>gi|114668017|ref|XP_511417.2| PREDICTED: TATA-binding protein-associated factor 2N [Pan
troglodytes]
Length = 580
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEFH 309
>gi|345311960|ref|XP_001517600.2| PREDICTED: RNA-binding protein FUS-like [Ornithorhynchus anatinus]
Length = 550
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 309 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 368
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 369 PPSAKAAIDWFDGKEF 384
>gi|297291539|ref|XP_001083269.2| PREDICTED: TATA-binding protein-associated factor 2N-like isoform 5
[Macaca mulatta]
Length = 603
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEFH 309
>gi|119600532|gb|EAW80126.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa, isoform CRA_c [Homo sapiens]
Length = 603
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 233 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 292
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 293 PPSAKAAIDWFDGKEFH 309
>gi|119600533|gb|EAW80127.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa, isoform CRA_d [Homo sapiens]
Length = 498
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 230 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 289
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 290 PPSAKAAIDWFDGKEF 305
>gi|393247915|gb|EJD55422.1| splicing factor, CC1-like protein [Auricularia delicata TFB-10046
SS5]
Length = 581
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 108/244 (44%), Gaps = 34/244 (13%)
Query: 276 NKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETG 335
N PP +YV L ++T +++ +VF G + + ++ D TG
Sbjct: 341 NLPPGVSAPQGGAMQLYVGSLHFNLTEQDIKQVFEPFGELDF--------VDLHRDPGTG 392
Query: 336 MKKGDALVTYLKEPSVALATQLLDGTPFRPDGK-------------IPMSVTQAKFEQKG 382
KG A V Y + +A + +DG F G+ + + T E G
Sbjct: 393 RSKGYAFVQYKRAEDAKMALEQMDG--FELAGRTLRVNSVNEKGVAVRNTTTIDSLEDSG 450
Query: 383 ERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPA-----TVILRFMFTPAEMRADE 437
+ +KL +++ + +A+ IP +V+L MF P E +
Sbjct: 451 GGNLNAASRQALMQKLARIDPPK---SSQPEARKHIPQNQSTRSVLLLNMFDPDE-ETEP 506
Query: 438 NLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGR 496
+ +L DV+ EC K GPV ++K+ E QG + V+F+ A+K ++ +NGRWFGGR
Sbjct: 507 DWDKDLADDVKGECASKYGPVTALKI-EKDSQGEIYVQFESVDSAKKAVDSLNGRWFGGR 565
Query: 497 QIHA 500
Q++A
Sbjct: 566 QVNA 569
>gi|395845947|ref|XP_003795678.1| PREDICTED: TATA-binding protein-associated factor 2N [Otolemur
garnettii]
Length = 584
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + G+IK + +T KP I +Y DK+TG KG
Sbjct: 224 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGVIKTNKKTGKPMINLYTDKDTGKPKG 283
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 284 EATVSFDDPPSAKAAIDWFDGKEF 307
>gi|301776494|ref|XP_002923669.1| PREDICTED: TATA-binding protein-associated factor 2N-like
[Ailuropoda melanoleuca]
Length = 571
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 223 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 282
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 283 EATVSFDDPPSAKAAIDWFDGKEF 306
>gi|281352697|gb|EFB28281.1| hypothetical protein PANDA_014202 [Ailuropoda melanoleuca]
Length = 342
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 45/73 (61%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 270 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 329
Query: 348 EPSVALATQLLDG 360
PS A DG
Sbjct: 330 PPSAKAAIDWFDG 342
>gi|344285304|ref|XP_003414402.1| PREDICTED: TATA-binding protein-associated factor 2N [Loxodonta
africana]
Length = 626
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 48/85 (56%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 242 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 301
Query: 340 DALVTYLKEPSVALATQLLDGTPFR 364
+A V++ PS A DG F
Sbjct: 302 EATVSFDDPPSAKAAIDWFDGKEFH 326
>gi|291237133|ref|XP_002738491.1| PREDICTED: GL21338-like [Saccoglossus kowalevskii]
Length = 504
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 46/77 (59%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V+GL VT E++ E F + G+IK D T KP+I +Y++KETG KG+ +TY +
Sbjct: 269 VFVSGLLKTVTEEQLAEHFGQIGVIKNDRRTNKPKIWMYMNKETGEPKGECTITYDDASA 328
Query: 351 VALATQLLDGTPFRPDG 367
A + +G P +G
Sbjct: 329 AKAAIEWFNGKPLAGNG 345
>gi|281341937|gb|EFB17521.1| hypothetical protein PANDA_012845 [Ailuropoda melanoleuca]
Length = 505
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 48/85 (56%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 208 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 267
Query: 340 DALVTYLKEPSVALATQLLDGTPFR 364
+A V++ PS A DG F
Sbjct: 268 EATVSFDDPPSAKAAIDWFDGKEFH 292
>gi|47217983|emb|CAG02266.1| unnamed protein product [Tetraodon nigroviridis]
Length = 254
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 44/76 (57%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL DD TVE + + F + GIIK + +T P I +Y D+ETG KG+A V++
Sbjct: 18 NNTIFVQGLGDDYTVETVADFFKQIGIIKVNKKTGLPMINLYTDRETGKLKGEATVSFDD 77
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 78 PPSAKAAIDWFDGKDF 93
>gi|321473917|gb|EFX84883.1| hypothetical protein DAPPUDRAFT_238350 [Daphnia pulex]
Length = 403
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 51/90 (56%), Gaps = 2/90 (2%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V+GLP+ VT E+ F G+IK D +T KP+I ++ D++TG KG+A +TY +
Sbjct: 38 VFVSGLPESVTETEIETFFGSIGVIKIDKKTGKPKIWLFKDRDTGAPKGEATITYDDPSA 97
Query: 351 VALATQLLDGTPFR--PDGKIPMSVTQAKF 378
A Q G PF P K+ M+ + F
Sbjct: 98 ANAAIQWFHGQPFNNGPVLKVSMATRKNNF 127
>gi|313225840|emb|CBY07314.1| unnamed protein product [Oikopleura dioica]
Length = 335
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 2/91 (2%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V+GLP+ V E + FS G I D +T+ RI ++++KET GD VTY
Sbjct: 161 VFVSGLPNTVRKENISMFFSTAGQIMIDKKTQNERIFLFLNKETKQPVGDCTVTYQDPSH 220
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQK 381
A A ++L+GT F +G P+SV QA EQK
Sbjct: 221 AAAAIEMLNGTDF--NGAGPISVQQATPEQK 249
>gi|313221201|emb|CBY32024.1| unnamed protein product [Oikopleura dioica]
Length = 335
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 2/91 (2%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V+GLP+ V E + FS G I D +T+ RI ++++KET GD VTY
Sbjct: 161 VFVSGLPNTVRKENISMFFSTAGQIMIDKKTQNERIFLFLNKETKQPVGDCTVTYQDPSH 220
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQK 381
A A ++L+GT F +G P+SV QA EQK
Sbjct: 221 AAAAIEMLNGTDF--NGAGPISVQQATPEQK 249
>gi|48374057|ref|NP_001001531.1| RNA-binding protein FUS [Gallus gallus]
gi|47420845|gb|AAT27458.1| FUS/TLS [Gallus gallus]
Length = 504
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 252 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 311
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 312 PPSAKAAIDWFDGKEF 327
>gi|254553492|ref|NP_957377.2| RNA-binding protein FUS [Danio rerio]
Length = 541
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 44/76 (57%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL DD TV+ + + F + GIIK + +T P I +Y D+ETG KG+A V++
Sbjct: 298 NNTIFVQGLGDDYTVDSVADYFKQIGIIKVNKKTGLPMINLYTDRETGKLKGEATVSFDD 357
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 358 PPSAKAAIDWFDGKDF 373
>gi|392597434|gb|EIW86756.1| splicing factor CC1-like protein [Coniophora puteana RWD-64-598
SS2]
Length = 360
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 113/258 (43%), Gaps = 41/258 (15%)
Query: 269 QVEKKEANK--PPDSWFELKVNTH-------VYVTGLPDDVTVEEMVEVFSKCGIIKEDP 319
Q+ + E N+ P D L H +YV L ++T ++ +VF G ++
Sbjct: 106 QLTESERNRLHPGDGNLNLPPGVHAPHGAMQLYVGSLHFNLTEADIKQVFEPFGDLE--- 162
Query: 320 ETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVT----- 374
+ ++ D TG KG A V Y + +A + +DG F G+ T
Sbjct: 163 -----FVDLHRDSTTGRSKGYAFVQYKRPEDAKMALEQMDG--FELAGRTLRVNTVHEKG 215
Query: 375 ------QAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPA-----TVI 423
Q E+ G + +KL ++E ++ +IP +V+
Sbjct: 216 TARYTQQDSLEETGGGNLNAASRQALMQKLARIETPT---PAEPVSRPSIPQAMQSRSVL 272
Query: 424 LRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
++ MF P E + + EL DV+ EC K G V+++KV E QG + V+F A+
Sbjct: 273 MKNMFDPEE-ETERDWDKELAEDVKGECQEKYGKVEAIKV-EKETQGEIYVKFATIDSAK 330
Query: 483 KCIELMNGRWFGGRQIHA 500
+ ++ +NGRWFGGRQI A
Sbjct: 331 EAVQALNGRWFGGRQISA 348
>gi|390136092|pdb|2LCW|A Chain A, Solution Structure Of FusTLS RRM DOMAIN
Length = 116
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 7 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 66
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKF 378
PS A DG F + K+ + +A F
Sbjct: 67 PPSAKAAIDWFDGKEFSGNPIKVSFATRRADF 98
>gi|149579399|ref|XP_001519015.1| PREDICTED: TATA-binding protein-associated factor 2N-like
[Ornithorhynchus anatinus]
Length = 536
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL DDV+ +++ + F + GIIK + +T K I +Y DK+TG KG
Sbjct: 223 DSESDNSDNNTIFVQGLGDDVSTDQVGDFFKQIGIIKTNKKTGKLMINLYTDKDTGKPKG 282
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 283 EATVSFDDPPSAKAAIDWFDGKQF 306
>gi|402908219|ref|XP_003916850.1| PREDICTED: uncharacterized protein LOC100999207 [Papio anubis]
Length = 686
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 444 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 503
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 504 PPSAKAAIDWFDGKEF 519
>gi|13874546|dbj|BAB46889.1| hypothetical protein [Macaca fascicularis]
Length = 397
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 48/84 (57%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
DS + N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG
Sbjct: 134 DSESDNSDNNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKG 193
Query: 340 DALVTYLKEPSVALATQLLDGTPF 363
+A V++ PS A DG F
Sbjct: 194 EATVSFDDPPSAKAAIDWFDGKEF 217
>gi|395536039|ref|XP_003770028.1| PREDICTED: TATA-binding protein-associated factor 2N [Sarcophilus
harrisii]
Length = 547
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ + F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 231 NNTIFVQGLGEGVSTDQVGDFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 290
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 291 PPSAKAAIDWFDGKEF 306
>gi|350590618|ref|XP_003131769.3| PREDICTED: TATA-binding protein-associated factor 2N-like, partial
[Sus scrofa]
Length = 506
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 136 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 195
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 196 PPSAKAAIDWFDGKEF 211
>gi|195108089|ref|XP_001998625.1| GI24076 [Drosophila mojavensis]
gi|193915219|gb|EDW14086.1| GI24076 [Drosophila mojavensis]
Length = 395
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 71/132 (53%), Gaps = 10/132 (7%)
Query: 283 FELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDAL 342
F L+++T ++V G+ +VT +++E F K G IK D T KP+I +Y +K TG KG+A
Sbjct: 176 FVLRLDT-IFVLGMRLNVTKNDIIEFFGKMGRIKMDELTMKPKIFVYKNKLTGRSKGEAT 234
Query: 343 VTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQ---KGERFIAKQ---VDSKKKK 396
+TY + A LDG F G++ +SV A KG RF Q ++ ++++
Sbjct: 235 ITYTSPYAAQAAIHYLDGAKFL--GQV-LSVVPAYLSTQPGKGVRFSYAQETGIEQQRRQ 291
Query: 397 KLKKVEEKMLGW 408
+ KK + W
Sbjct: 292 RQKKWKPAFDNW 303
>gi|148683762|gb|EDL15709.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
Length = 644
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 231 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 290
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 291 PPSAKAAIDWFDGKEFH 307
>gi|365758860|gb|EHN00685.1| Cus2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 172
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 67/105 (63%), Gaps = 6/105 (5%)
Query: 291 VYVTGLPDD-VTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
VY++GLP D +T +E+ + FSK G I+ + + ++P K+YV+ E G+ KGDAL+ Y KE
Sbjct: 47 VYISGLPTDRITKDELTKQFSKYGKIRINRD-EEPLCKLYVNDE-GVPKGDALIIYSKEE 104
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKK 394
SV LA +++D + F + + V +A+FE +G + K+ S K
Sbjct: 105 SVTLAVEMMDESIFLGN---KIRVEKAQFEDRGSENMGKKESSSK 146
>gi|121582458|ref|NP_001073442.1| TATA-binding protein-associated factor 2N [Danio rerio]
gi|118763905|gb|AAI28852.1| Zgc:158363 [Danio rerio]
Length = 434
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 53/92 (57%), Gaps = 3/92 (3%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V GL +DV +E+ + F + GIIK + +T KP I +Y DK TG KG+A V++ PS
Sbjct: 210 IFVQGLGEDVNAQEVGDYFKQIGIIKVNKKTGKPMINLYSDKATGRLKGEATVSFDDPPS 269
Query: 351 VALATQLLDGTPF--RPDGKIPMSVTQAKFEQ 380
A DG F RP K+ + +A+F Q
Sbjct: 270 AKAAIDWFDGKEFNGRPI-KVSFATRRAEFTQ 300
>gi|52345516|ref|NP_001004806.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 68kDa [Xenopus (Silurana) tropicalis]
gi|49257736|gb|AAH74568.1| MGC69517 protein [Xenopus (Silurana) tropicalis]
Length = 501
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 45/73 (61%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ +D T +++ + F + GIIK + +T KP I +Y DKETG KG+A V++ PS
Sbjct: 293 IFVQGMGEDATQDQISDYFKQIGIIKINKKTGKPMINLYTDKETGKSKGEATVSFDDPPS 352
Query: 351 VALATQLLDGTPF 363
A + DG F
Sbjct: 353 AKAAIEWFDGKMF 365
>gi|195170986|ref|XP_002026292.1| GL24582 [Drosophila persimilis]
gi|194111187|gb|EDW33230.1| GL24582 [Drosophila persimilis]
Length = 232
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 81/160 (50%), Gaps = 7/160 (4%)
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
SV LA ++LD R + V +A+F+ +GE A + KKK K K + K +
Sbjct: 29 SVHLALKILDDYDLRGH---KIRVQRAQFQMRGEYNPALKPKRKKKDKEKMQKMKEKLFD 85
Query: 410 GRDDA----KLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
R D + TVIL+ +FTP + L E + +++EEC K G V V + +
Sbjct: 86 WRPDKMRGERSKHEKTVILKNLFTPELFEKEVELILEYQNNLREECGKCGMVRKVVIYDR 145
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
HP+G+ + ++A I++M GR+FG RQ+ A DG
Sbjct: 146 HPEGIAQINMSSPEEADLVIQMMQGRFFGQRQLTADHWDG 185
>gi|431890896|gb|ELK01775.1| TATA-binding protein-associated factor 2N [Pteropus alecto]
Length = 729
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 258 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 317
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 318 PPSAKAAIDWFDGKEFH 334
>gi|308502470|ref|XP_003113419.1| hypothetical protein CRE_26445 [Caenorhabditis remanei]
gi|308263378|gb|EFP07331.1| hypothetical protein CRE_26445 [Caenorhabditis remanei]
Length = 557
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 51/88 (57%), Gaps = 3/88 (3%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ + +VFS CG I ++ + PRIKIY D+ TG KG+ ++T++ +
Sbjct: 311 VFVQGISTTANEAYIADVFSTCGDIAKN--ERGPRIKIYTDRNTGEPKGECMITFIDAAA 368
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKF 378
A + +G PF P G PM+++ AKF
Sbjct: 369 AQQAITMYNGQPF-PGGSSPMNISLAKF 395
>gi|85001331|ref|XP_955384.1| RNA splicing factor [Theileria annulata strain Ankara]
gi|65303530|emb|CAI75908.1| RNA splicing factor, putative [Theileria annulata]
Length = 643
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%)
Query: 419 PATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDR 478
+ ++L M+T A+ + E+E DV+EEC K G V V V +P G V V+FK+
Sbjct: 547 SSNLVLSNMYTSADYADNHEFFDEIEEDVKEECGKYGTVVQVFVNRRNPDGKVYVKFKNN 606
Query: 479 KDAQKCIELMNGRWFGGRQIHAS 501
DAQ + + GR+F G I S
Sbjct: 607 DDAQSANKSLQGRYFAGNTIQVS 629
>gi|323307587|gb|EGA60856.1| Cus2p [Saccharomyces cerevisiae FostersO]
Length = 163
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 75/124 (60%), Gaps = 14/124 (11%)
Query: 288 NTHVYVTGLPDDVTVEE-MVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
NT +Y++GLP D T +E + E F K G+I+ + + + P K+YV+ + G KGDAL+TY
Sbjct: 44 NTSIYISGLPTDKTTKEGLTEQFCKYGMIRTNRDGE-PLCKLYVN-DKGAFKGDALITYS 101
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQK-GERFIAKQVDSKK-------KKKL 398
KE SV LA ++++ + F GK + V +A+F+ K G+ K+ D K+ KKL
Sbjct: 102 KEESVTLAIEMMNESIFL--GK-QIRVERAQFQNKEGDNMHGKENDLKEFNGPEPPIKKL 158
Query: 399 KKVE 402
KK E
Sbjct: 159 KKSE 162
>gi|307170581|gb|EFN62775.1| RNA-binding protein cabeza [Camponotus floridanus]
Length = 257
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 52/95 (54%), Gaps = 1/95 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+V F G+IK+D T KP+I +Y DK TG KG+A VTY + +
Sbjct: 8 IFVSGMDPSISEEEIVSHFGAIGVIKQDKRTGKPKIWMYKDKNTGKSKGEATVTYDDQNA 67
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF 385
A DG F+ I + + Q K +G R
Sbjct: 68 ARSAINWFDGKDFK-GCTIKVQIAQHKSSWQGNRM 101
>gi|402912957|ref|XP_003919001.1| PREDICTED: HIV Tat-specific factor 1 homolog, partial [Papio
anubis]
Length = 119
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 52/86 (60%), Gaps = 2/86 (2%)
Query: 254 VVEEKHNGKRKQPDKQVEKKEANKPPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSK 311
EE K +P +K E K WF ++ NT+VYV+GLP D+TV+E +++ SK
Sbjct: 34 TAEEPPQEKASEPTDPRKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSK 93
Query: 312 CGIIKEDPETKKPRIKIYVDKETGMK 337
GII DP+T++ ++K+Y D + +K
Sbjct: 94 FGIIMRDPQTEEFKVKLYKDNQGNLK 119
>gi|71026268|ref|XP_762815.1| splicing factor [Theileria parva strain Muguga]
gi|68349767|gb|EAN30532.1| splicing factor, putative [Theileria parva]
Length = 644
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 46/82 (56%)
Query: 420 ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRK 479
+ ++L M+T A+ + E+E DV+EEC K G V V V + +P G V V+FK+
Sbjct: 549 SNLVLSNMYTSADYEDNREFFDEIEEDVKEECGKYGTVIQVFVNKRNPDGKVYVKFKNND 608
Query: 480 DAQKCIELMNGRWFGGRQIHAS 501
DAQ + + GR+F G I S
Sbjct: 609 DAQAANKSLQGRYFAGNTIQVS 630
>gi|403275313|ref|XP_003945352.1| PREDICTED: LOW QUALITY PROTEIN: TATA-binding protein-associated
factor 2N [Saimiri boliviensis boliviensis]
Length = 606
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 183 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 242
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 243 PPSAKAAIDWFDGKEFH 259
>gi|332138053|pdb|2LA6|A Chain A, Solution Nmr Structure Of Rrm Domain Of Rna-Binding
Protein Fus From Homo Sapiens, Northeast Structural
Genomics Consortium Target Hr6430a
Length = 99
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 52/89 (58%), Gaps = 3/89 (3%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 13 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 72
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQA 376
PS A DG F + P+ V+ A
Sbjct: 73 PPSAKAAIDWFDGKEFSGN---PIKVSFA 98
>gi|344258800|gb|EGW14904.1| TATA-binding protein-associated factor 2N [Cricetulus griseus]
Length = 389
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 36 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 95
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 96 PPSAKAAIDWFDGKEF 111
>gi|336389603|gb|EGO30746.1| hypothetical protein SERLADRAFT_455043 [Serpula lacrymans var.
lacrymans S7.9]
Length = 583
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/227 (28%), Positives = 103/227 (45%), Gaps = 32/227 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 360 LYVGSLHFNLTESDIKQVFEPFGELEF--------VDLHRDPMTGRSKGYAFVQYKRAED 411
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKMLGW 408
+A + ++G F G+ ++ +KG A+Q +D L + L
Sbjct: 412 ARMALEQMEG--FELAGR---TLRVNTVHEKGTARYAQQDTLDEAGGGNLNAASRQALMQ 466
Query: 409 G---------GRDDAKLTIPA-----TVILRFMFTPAEMRADENLRSELEADVQEECV-K 453
+ K TIP +V+L+ MF P E + + +L DV+ EC K
Sbjct: 467 KLARIEPIPKPPTNNKPTIPQAMQSRSVLLKNMFDPEE-ETERDWDKDLAEDVKGECEDK 525
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
G VD++KV E QG + V+F A+ I+ +NGRWFGGRQ+ A
Sbjct: 526 YGQVDAIKV-EQETQGKIYVKFNSIDSAKNAIQGLNGRWFGGRQVSA 571
>gi|336376609|gb|EGO04944.1| hypothetical protein SERLA73DRAFT_174031 [Serpula lacrymans var.
lacrymans S7.3]
Length = 583
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/227 (28%), Positives = 103/227 (45%), Gaps = 32/227 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 360 LYVGSLHFNLTESDIKQVFEPFGELEF--------VDLHRDPMTGRSKGYAFVQYKRAED 411
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKMLGW 408
+A + ++G F G+ ++ +KG A+Q +D L + L
Sbjct: 412 ARMALEQMEG--FELAGR---TLRVNTVHEKGTARYAQQDTLDEAGGGNLNAASRQALMQ 466
Query: 409 G---------GRDDAKLTIPA-----TVILRFMFTPAEMRADENLRSELEADVQEECV-K 453
+ K TIP +V+L+ MF P E + + +L DV+ EC K
Sbjct: 467 KLARIEPIPKPPTNNKPTIPQAMQSRSVLLKNMFDPEE-ETERDWDKDLAEDVKGECEDK 525
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
G VD++KV E QG + V+F A+ I+ +NGRWFGGRQ+ A
Sbjct: 526 YGQVDAIKV-EQETQGEIYVKFNSIDSAKNAIQGLNGRWFGGRQVSA 571
>gi|194385898|dbj|BAG65324.1| unnamed protein product [Homo sapiens]
Length = 395
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 36 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 95
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 96 PPSAKAAIDWFDGKEF 111
>gi|157130748|ref|XP_001661993.1| hypothetical protein AaeL_AAEL011851 [Aedes aegypti]
gi|108871805|gb|EAT36030.1| AAEL011851-PA [Aedes aegypti]
Length = 298
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/164 (32%), Positives = 86/164 (52%), Gaps = 15/164 (9%)
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEEKMLG 407
SV LA +LD R + V QA+F+ KGE + +V K+K+K++K++E +
Sbjct: 25 SVELALNILDNYDIRGHK---IRVQQAEFQLKGEYNPALKPKVKKKEKEKIRKMQEALFD 81
Query: 408 W------GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVK 461
W G R + VI++ +F P + +L E + D++EEC K G V
Sbjct: 82 WRPEKMRGERSKHE----RIVIIKNLFEPELFDREVHLLLEYQNDLREECNKCGICRRVV 137
Query: 462 VCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
+ + HP+GV + D ++A ++LMNGR+FG R++ A DG
Sbjct: 138 LFDRHPEGVAQITMSDPEEADLVVKLMNGRFFGKRKLTAEIWDG 181
>gi|351711364|gb|EHB14283.1| Tripartite motif-containing protein 72 [Heterocephalus glaber]
Length = 649
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 45/73 (61%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 270 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 329
Query: 348 EPSVALATQLLDG 360
PS A DG
Sbjct: 330 PPSAKAAIDWFDG 342
>gi|45709123|gb|AAH67661.1| Ewsr1b protein [Danio rerio]
Length = 575
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGL ++ T+ EM E F G I+ + +P I IY DK++G KGDA ++Y +
Sbjct: 307 NSTIYITGLTENATLPEMAEFFKHTGAIRINRRLNQPAINIYTDKDSGKPKGDATLSY-E 365
Query: 348 EPSVA-LATQLLDGTPFR 364
EP+ A A + DG F+
Sbjct: 366 EPAFAKAAVEHFDGKEFQ 383
>gi|384498450|gb|EIE88941.1| hypothetical protein RO3G_13652 [Rhizopus delemar RA 99-880]
Length = 454
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 2/91 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
V+L MF P E D + S+LEAD++ EC K G V+ +KV + G V ++F A
Sbjct: 366 VMLNNMFNPNE-ETDPDWVSDLEADIKIECEKYGRVEHIKVNSDS-MGEVFLKFDRVGSA 423
Query: 482 QKCIELMNGRWFGGRQIHASEDDGLVNHAAI 512
+K I +NGRWFGG+QI A+ + +A I
Sbjct: 424 EKAISALNGRWFGGKQITAACISDAIYNANI 454
>gi|47086775|ref|NP_997795.1| Ewing sarcoma breakpoint region 1b [Danio rerio]
gi|27881957|gb|AAH44518.1| Ewing sarcoma breakpoint region 1b [Danio rerio]
Length = 578
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGL ++ T+ EM E F G I+ + +P I IY DK++G KGDA ++Y +
Sbjct: 310 NSTIYITGLTENATLPEMAEFFKHTGAIRINRRLNQPAINIYTDKDSGKPKGDATLSY-E 368
Query: 348 EPSVA-LATQLLDGTPFR 364
EP+ A A + DG F+
Sbjct: 369 EPAFAKAAVEHFDGKEFQ 386
>gi|67678073|gb|AAH97019.1| Ewing sarcoma breakpoint region 1b [Danio rerio]
Length = 579
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGL ++ T+ EM E F G I+ + +P I IY DK++G KGDA ++Y +
Sbjct: 311 NSTIYITGLTENATLPEMAEFFKHTGAIRINRRLNQPAINIYTDKDSGKPKGDATLSY-E 369
Query: 348 EPSVA-LATQLLDGTPFR 364
EP+ A A + DG F+
Sbjct: 370 EPAFAKAAVEHFDGKEFQ 387
>gi|47207611|emb|CAF91750.1| unnamed protein product [Tetraodon nigroviridis]
Length = 512
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 45/74 (60%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL +DVT++E+ + F + GIIK + +T P I +Y DK TG KG+A V++
Sbjct: 181 NNTIFVQGLAEDVTLQEVGDYFKQIGIIKVNKKTGLPMINLYSDKATGQPKGEATVSFDD 240
Query: 348 EPSVALATQLLDGT 361
PS A DG+
Sbjct: 241 PPSAKAAIDWFDGS 254
>gi|298708685|emb|CBJ26172.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 257
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 45/81 (55%), Gaps = 1/81 (1%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ-GVVLVRFKDRK 479
V+LR ++T E AD + +LEAD+ ECVK G V SV E G V++ FK
Sbjct: 103 VVVLRSVYTREESEADPHFFDDLEADMLMECVKFGTVISVATPEEPGYFGCVVITFKTSD 162
Query: 480 DAQKCIELMNGRWFGGRQIHA 500
A C E M+GRWF RQI A
Sbjct: 163 GANACAEAMHGRWFDQRQIEA 183
>gi|395334381|gb|EJF66757.1| splicing factor CC1-like protein [Dichomitus squalens LYAD-421 SS1]
Length = 624
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 105/231 (45%), Gaps = 37/231 (16%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L +++ ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 400 LYVGSLHFNLSESDIKQVFEPFGELEF--------VDLHRDPVTGRSKGYAFVQYKRAED 451
Query: 351 VALATQLLDGTPFRPDGKIPMSVT-----QAKFEQKGE---------RFIAKQVDSKKKK 396
+A + +DG F G+ T A++ Q+ ++Q +K
Sbjct: 452 AKMALEQMDG--FELAGRTLRVNTVHEKGSARYTQQDSLDEAGGGNLNAASRQALMQKLA 509
Query: 397 KLKKVEEKMLGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADENLRSELEADVQEEC 451
++ KM A+ IP T V+++ MF P E + + +L DV+ EC
Sbjct: 510 RIDPTPAKMEPI-----ARPNIPQTMQSRSVLMKNMFNPEE-ETERDWDKDLAEDVKGEC 563
Query: 452 -VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
K G V ++KV E QG + V+F+ A+ IE +NGRWFGGRQI A+
Sbjct: 564 ESKYGRVLAIKV-EKESQGEIYVKFETVDAAKNAIEGLNGRWFGGRQITAA 613
>gi|307194675|gb|EFN76934.1| RNA-binding protein cabeza [Harpegnathos saltator]
Length = 274
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 52/95 (54%), Gaps = 1/95 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+ + F G+IK D T KP++ +Y DK TG KG+A VTY + +
Sbjct: 41 IFVSGMDPSISEEEICQHFGAIGVIKHDKRTGKPKVWLYKDKNTGKSKGEATVTYDDQNA 100
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF 385
A DG F+ + I + + Q K +G R
Sbjct: 101 ARSAISWFDGKEFK-NCTIKVQMAQHKSNWQGNRM 134
>gi|380016137|ref|XP_003692045.1| PREDICTED: uncharacterized protein LOC100866765 [Apis florea]
Length = 420
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 1/92 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+ + F GIIK D T KP++ +Y DK TG KG+A VTY + +
Sbjct: 179 IFVSGMDPSISEEEICQHFGAIGIIKHDKRTGKPKVWMYKDKNTGKSKGEATVTYDDQNA 238
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKG 382
A DG F+ KI + + Q K +G
Sbjct: 239 ARSAIDWFDGKEFKGK-KIRVQIAQHKSNWQG 269
>gi|417410243|gb|JAA51598.1| Putative dosage compensation complex subunit mle, partial [Desmodus
rotundus]
Length = 381
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 20 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 79
Query: 348 EPSVALATQLLDGTPF 363
PS A DG F
Sbjct: 80 PPSAKAAIDWFDGKEF 95
>gi|328786898|ref|XP_624681.2| PREDICTED: hypothetical protein LOC552303 [Apis mellifera]
Length = 419
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 1/92 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+ + F GIIK D T KP++ +Y DK TG KG+A VTY + +
Sbjct: 178 IFVSGMDPSISEEEICQHFGAIGIIKHDKRTGKPKVWMYKDKNTGKSKGEATVTYDDQNA 237
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKG 382
A DG F+ KI + + Q K +G
Sbjct: 238 ARSAIDWFDGKEFKGK-KIRVQIAQHKSNWQG 268
>gi|299755304|ref|XP_002912089.1| hypothetical protein CC1G_13622 [Coprinopsis cinerea okayama7#130]
gi|298411164|gb|EFI28595.1| hypothetical protein CC1G_13622 [Coprinopsis cinerea okayama7#130]
Length = 580
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 104/227 (45%), Gaps = 30/227 (13%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 357 LYVGSLHFNLTESDIKQVFEPFGELE--------FVDLHKDPMTGRSKGYAFVQYKRAED 408
Query: 351 VALATQLLDG----------TPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKK 400
+A + ++G G + + T + + G A + +K L +
Sbjct: 409 ARMALEQMEGFELAGRTLRVNTVHEKGSVRYTQTDSLDDSGGANLNAASRQALMQK-LAR 467
Query: 401 VEEKMLGWGGRDDAKLTIPA-----TVILRFMFTPAEMRADENLRSELEADVQEEC-VKI 454
E+ ++ + K IP +V+L+ MF P E ++N +L DV+ EC K
Sbjct: 468 TEQPVVPA---EPVKPIIPQAMQSRSVLLKNMFNPEE-ETEQNWDKDLADDVKGECENKY 523
Query: 455 GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
G V ++KV E QG + V+F A+ ++ +NGRWFGGRQI A+
Sbjct: 524 GKVLAIKV-EKDSQGEIYVKFDTVDTAKSAVQGLNGRWFGGRQISAN 569
>gi|198424271|ref|XP_002130882.1| PREDICTED: similar to fusion (involved in t(12;16) in malignant
liposarcoma) [Ciona intestinalis]
Length = 314
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 41/69 (59%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ ++T E + E F + GIIK+D T++PRI +Y+DKE KG+ VTY PS
Sbjct: 137 IFVQGMGQNITKESIAEYFGQIGIIKKDKRTQEPRIYVYMDKEINQPKGECTVTYDDPPS 196
Query: 351 VALATQLLD 359
A +
Sbjct: 197 AQAAINWFN 205
>gi|302790311|ref|XP_002976923.1| hypothetical protein SELMODRAFT_151471 [Selaginella moellendorffii]
gi|302797811|ref|XP_002980666.1| hypothetical protein SELMODRAFT_178258 [Selaginella moellendorffii]
gi|300151672|gb|EFJ18317.1| hypothetical protein SELMODRAFT_178258 [Selaginella moellendorffii]
gi|300155401|gb|EFJ22033.1| hypothetical protein SELMODRAFT_151471 [Selaginella moellendorffii]
Length = 532
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 106/255 (41%), Gaps = 55/255 (21%)
Query: 290 HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
+YV L ++T +++ +VF G+I+ +++ +D+ETG+ KG V Y K
Sbjct: 273 RLYVGNLHVNITEDQLRQVFEPFGVIE--------LVQLPLDQETGLCKGYGFVQYSKLE 324
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVD------------SKKKKK 397
A Q L+G G+ + V+ +Q G+ Q D ++ +
Sbjct: 325 DARAAQQGLNGI-LELAGRA-IKVSAVTDQQTGQDIGTTQNDLDDDEGGGLALNARSRAL 382
Query: 398 LKKVEEKMLGWGG----RDDAKLTIPAT---------------------------VILRF 426
L + ++ G G + A + +P ++L+
Sbjct: 383 LMQKLDRSGGVGAPASLQPQAPMAVPPVYGGIGSALSMQNGSVLPEEPIGPPSECLLLKN 442
Query: 427 MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIE 486
MF PA D +++ DVQ+EC K GPV + V + + QG V VRF DA
Sbjct: 443 MFDPA-TETDPEFDIDIKNDVQDECSKFGPVKHISV-DKYSQGHVYVRFGTAIDALAAQL 500
Query: 487 LMNGRWFGGRQIHAS 501
+N RWF G+ I A+
Sbjct: 501 NLNKRWFAGKTIRAT 515
>gi|432873995|ref|XP_004072420.1| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein EWS-like
[Oryzias latipes]
Length = 612
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGL + +EEM E F G I+ + +P I IY DKE+G KGDA ++Y +
Sbjct: 315 NSTIYITGLTEKANLEEMAEFFKHVGPIRMNRRLGQPAINIYTDKESGKPKGDATLSYEE 374
Query: 348 EPSVALATQLLDGTPFR 364
A + DG F+
Sbjct: 375 PICAKAAVEHFDGKEFQ 391
>gi|430811846|emb|CCJ30702.1| unnamed protein product [Pneumocystis jirovecii]
Length = 486
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 49/81 (60%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L+ MF P E D +R ELE DV+ EC K G V + V EN P G V ++F +
Sbjct: 398 VLLKNMFNPQEESGDNWIR-ELEDDVKAECENKYGKVLHIHVEENSP-GDVYIKFDNVVA 455
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
++ I+ +NGRWFGGR I AS
Sbjct: 456 GERAIQGLNGRWFGGRTISAS 476
>gi|426201409|gb|EKV51332.1| hypothetical protein AGABI2DRAFT_189584 [Agaricus bisporus var.
bisporus H97]
Length = 563
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 101/228 (44%), Gaps = 33/228 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 339 LYVGSLHFNLTESDIKQVFEVFGELE--------FVDLHRDAMTGRSKGYAFVQYKRAED 390
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKMLGW 408
+A Q ++G F G+ ++ +KG +Q +D L + L
Sbjct: 391 ARMALQQMEG--FELAGR---TLRVNTVHEKGTTKYTQQDSLDESGGGNLNAASRQALMQ 445
Query: 409 G-GRDDAKLTIP--------------ATVILRFMFTPAEMRADENLRSELEADVQEEC-V 452
R D P +V+L+ MF P E +++ EL DV+ EC
Sbjct: 446 KLARTDQPAPRPEPVQRPNIPQAMQSRSVLLKNMFDPDE-ETEKDWDRELAQDVKGECES 504
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K G V ++KV E QG + V+F AQK I+ +NGRWFGGRQ+ A
Sbjct: 505 KYGKVLAIKV-EKDSQGEIYVKFDSIDYAQKAIQGLNGRWFGGRQVSA 551
>gi|409083550|gb|EKM83907.1| hypothetical protein AGABI1DRAFT_110515 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 563
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 101/228 (44%), Gaps = 33/228 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 339 LYVGSLHFNLTESDIKQVFEVFGELE--------FVDLHRDAMTGRSKGYAFVQYKRAED 390
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKMLGW 408
+A Q ++G F G+ ++ +KG +Q +D L + L
Sbjct: 391 ARMALQQMEG--FELAGR---TLRVNTVHEKGTTKYTQQDSLDESGGGNLNAASRQALMQ 445
Query: 409 G-GRDDAKLTIP--------------ATVILRFMFTPAEMRADENLRSELEADVQEEC-V 452
R D P +V+L+ MF P E +++ EL DV+ EC
Sbjct: 446 KLARTDQPAPRPEPVQRPNIPQAMQSRSVLLKNMFDPDE-ETEKDWDRELAQDVKGECES 504
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K G V ++KV E QG + V+F AQK I+ +NGRWFGGRQ+ A
Sbjct: 505 KYGKVLAIKV-EKDSQGEIYVKFDSIDYAQKAIQGLNGRWFGGRQVSA 551
>gi|238601653|ref|XP_002395467.1| hypothetical protein MPER_04477 [Moniliophthora perniciosa FA553]
gi|215466258|gb|EEB96397.1| hypothetical protein MPER_04477 [Moniliophthora perniciosa FA553]
Length = 180
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 82/175 (46%), Gaps = 14/175 (8%)
Query: 334 TGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSK 393
TG KG A V Y + +A + ++G F G+ TQ + G +
Sbjct: 2 TGRSKGYAFVQYKRAEHAKMALEQMEG--FELAGRTLRVNTQDSLDDSGGGNLNAASRQA 59
Query: 394 KKKKLKKVEEKMLGWGGRDDAKLTIPA------TVILRFMFTPAEMRADENLRSELEADV 447
+KL ++E D IP +V+++ MF P E + + EL DV
Sbjct: 60 LMQKLARIEPPP---SLPDPVTPNIPTQVMQSRSVLMKNMFDPGE-ETERDWDKELADDV 115
Query: 448 QEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ EC K G V ++KV E QG + V+F + A+K ++ +NGRWFGG+Q+ A+
Sbjct: 116 KGECGEKYGKVTAIKV-EKETQGEIYVKFDSIESAKKAVQGLNGRWFGGKQVTAT 169
>gi|452820290|gb|EME27334.1| RNA-binding protein [Galdieria sulphuraria]
Length = 596
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 52/82 (63%), Gaps = 4/82 (4%)
Query: 419 PAT-VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFK 476
P+T ++LR MF PA+ D N E++ DV++EC+ K GP+ + V +N G+V V+F+
Sbjct: 493 PSTCLMLRNMFDPAQ-ETDPNFHLEVQEDVRDECISKFGPLRHIFVDKNS-AGLVYVQFE 550
Query: 477 DRKDAQKCIELMNGRWFGGRQI 498
DA K + ++GRWF G Q+
Sbjct: 551 TMSDAMKAKQGLHGRWFAGHQV 572
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 118/262 (45%), Gaps = 53/262 (20%)
Query: 276 NKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETG 335
N PD +L+ ++V L VT +++ FS G +++ I++ +DK +G
Sbjct: 176 NSFPDYERDLRT---IFVWQLAQKVTEKDVYNFFSAAGKVRD--------IRMIIDKRSG 224
Query: 336 MKKGDALV-TYLKE--PS-VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVD 391
KG A V Y KE PS + LA Q L G P + + ++A E+ IA ++
Sbjct: 225 RHKGAAYVEFYYKEAIPSAMRLAGQQLCGYP------VAIKPSEA------EKNIAAEMA 272
Query: 392 SKKKKKLKKVEEKMLG-W-GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQE 449
+++ ++ L W GG D + + P T ++ + ++++LR+ E
Sbjct: 273 AREAAAAQQARLAELEEWSGGGDTSPNSNPLTFTKLYVGSIHFSISEDDLRTIFEP---- 328
Query: 450 ECVKIGPVDSVKVCEN----HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDG 505
G V S+++ ++ +G V++K+ +DA+K E +NG GR + G
Sbjct: 329 ----FGEVISLQLHKDPETGRSRGFGFVQYKNHEDAKKAFEQLNGLDLAGRPLKV----G 380
Query: 506 LVNHAAIRDLDAEASRLEQFGA 527
L AEA +L+ GA
Sbjct: 381 LAT--------AEAQKLQVLGA 394
>gi|322792857|gb|EFZ16690.1| hypothetical protein SINV_10530 [Solenopsis invicta]
Length = 443
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 44/74 (59%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+ + F GIIK+D T KP+I +Y DK TG KG+A VTY + +
Sbjct: 194 IFVSGMDPSISEEEICQHFGAIGIIKQDKRTGKPKIWMYKDKNTGKPKGEATVTYDDQNA 253
Query: 351 VALATQLLDGTPFR 364
A DG F+
Sbjct: 254 ARSAISWFDGKDFK 267
>gi|45184618|gb|AAS55561.1| CG14718 [Drosophila mauritiana]
Length = 202
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +M+ F K GIIK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 61 VFVLGMRLNVTKNDMILFFGKVGIIKMDESTNKPKIFVYKNKITGRSKGEATITYVSPFS 120
Query: 351 VALATQLLDGTPF 363
A L G F
Sbjct: 121 AQAAISCLSGAKF 133
>gi|414588678|tpg|DAA39249.1| TPA: hypothetical protein ZEAMMB73_120405, partial [Zea mays]
Length = 61
Score = 61.2 bits (147), Expect = 1e-06, Method: Composition-based stats.
Identities = 28/38 (73%), Positives = 31/38 (81%)
Query: 493 FGGRQIHASEDDGLVNHAAIRDLDAEASRLEQFGAELE 530
F G+QIHASEDDG VNH IRD DAE SRL++FG ELE
Sbjct: 21 FAGQQIHASEDDGSVNHTLIRDYDAEVSRLDRFGEELE 58
>gi|340721842|ref|XP_003399323.1| PREDICTED: hypothetical protein LOC100645958 [Bombus terrestris]
Length = 417
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 43/74 (58%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+ + F GIIK D T KP++ +Y DK TG KG+A VTY + +
Sbjct: 178 IFVSGMDPSISEEEICQHFGAIGIIKHDKRTGKPKVWMYKDKNTGKSKGEATVTYDDQNA 237
Query: 351 VALATQLLDGTPFR 364
A DG F+
Sbjct: 238 ARSAIDWFDGKEFK 251
>gi|350408037|ref|XP_003488281.1| PREDICTED: hypothetical protein LOC100744767 [Bombus impatiens]
Length = 418
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 43/74 (58%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+ + F GIIK D T KP++ +Y DK TG KG+A VTY + +
Sbjct: 179 IFVSGMDPSISEEEICQHFGAIGIIKNDKRTGKPKVWMYKDKNTGKSKGEATVTYDDQNA 238
Query: 351 VALATQLLDGTPFR 364
A DG F+
Sbjct: 239 ARSAIDWFDGKEFK 252
>gi|194744403|ref|XP_001954684.1| GF18392 [Drosophila ananassae]
gi|190627721|gb|EDV43245.1| GF18392 [Drosophila ananassae]
Length = 505
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 44/73 (60%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +++ F K G+IK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 282 VFVLGMRLNVTKNDIIMFFGKLGLIKMDESTSKPKIFVYKNKLTGRSKGEATITYVSPYS 341
Query: 351 VALATQLLDGTPF 363
A L+G+ F
Sbjct: 342 AQAAIACLNGSKF 354
>gi|170083917|ref|XP_001873182.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164650734|gb|EDR14974.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 448
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/229 (27%), Positives = 106/229 (46%), Gaps = 33/229 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 224 LYVGSLHFNLTESDIKQVFEPFGELE--------FVDLHRDPMTGRSKGYAFVQYKRSED 275
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKMLGW 408
+A + ++G F G+ ++ +KG +Q +D L + L
Sbjct: 276 ARMALEQMEG--FELAGR---TLRVNTVHEKGTARYTQQDSLDEAGGGNLNAASRQALMQ 330
Query: 409 G-GRDDAKLT---------IPA-----TVILRFMFTPAEMRADENLRSELEADVQEEC-V 452
R +A T IP +V+L+ MF P E + + EL DV+ EC
Sbjct: 331 KLARTEAPPTFTEPVARPNIPQAMQSRSVLLKNMFDPEE-ETERDWDKELADDVKVECEN 389
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
K G V+++KV E QG + ++F + A++ I+ +NGRWFGGRQ+ A+
Sbjct: 390 KYGKVEAIKV-ERETQGEIYLKFDSIESAKQAIQGLNGRWFGGRQVSAA 437
>gi|348528518|ref|XP_003451764.1| PREDICTED: RNA-binding protein EWS-like [Oreochromis niloticus]
Length = 606
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 43/77 (55%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +Y+TGL + +EEM E F G I+ + +P I IY DK++G KGDA ++Y +
Sbjct: 322 NSTIYITGLTEKANLEEMAEFFKHVGPIRINRRLGQPAINIYTDKDSGKPKGDATLSYEE 381
Query: 348 EPSVALATQLLDGTPFR 364
A + DG F+
Sbjct: 382 PICAKAAVEHFDGKEFQ 398
>gi|195571669|ref|XP_002103825.1| GD18777 [Drosophila simulans]
gi|194199752|gb|EDX13328.1| GD18777 [Drosophila simulans]
Length = 445
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +++ F K GIIK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 222 VFVLGMRLNVTKNDIILFFGKVGIIKMDESTNKPKIFVYKNKITGRSKGEATITYVSPFS 281
Query: 351 VALATQLLDGTPF 363
A L G F
Sbjct: 282 AQAAISCLSGVKF 294
>gi|383864419|ref|XP_003707676.1| PREDICTED: uncharacterized protein LOC100879389 [Megachile
rotundata]
Length = 417
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 43/74 (58%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+ + F GIIK D T KP++ +Y DK TG KG+A VTY + +
Sbjct: 178 IFVSGMDPSISEEEICQHFGAIGIIKHDKRTGKPKVWMYKDKNTGKSKGEATVTYDDQNA 237
Query: 351 VALATQLLDGTPFR 364
A DG F+
Sbjct: 238 ARSAIDWFDGKEFK 251
>gi|195329764|ref|XP_002031580.1| GM23972 [Drosophila sechellia]
gi|194120523|gb|EDW42566.1| GM23972 [Drosophila sechellia]
Length = 436
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +++ F K GIIK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 213 VFVLGMRLNVTKNDIILFFGKVGIIKMDESTNKPKIFVYKNKITGRSKGEATITYVSPFS 272
Query: 351 VALATQLLDGTPF 363
A L G F
Sbjct: 273 AQAAISCLSGAKF 285
>gi|157786842|ref|NP_001099294.1| TATA-binding protein-associated factor 2N [Rattus norvegicus]
gi|149053669|gb|EDM05486.1| TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated
factor (predicted), isoform CRA_a [Rattus norvegicus]
Length = 394
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL + V+ +++ E F + GIIK + +T KP I +Y DK+TG KG+A V++
Sbjct: 173 NNTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDD 232
Query: 348 EPSVALATQLLDGTPFR 364
PS A DG F
Sbjct: 233 PPSAKAAIDWFDGKEFH 249
>gi|345490315|ref|XP_001606397.2| PREDICTED: hypothetical protein LOC100122794 [Nasonia vitripennis]
Length = 481
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 43/74 (58%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ +T EE+ + F G+IK D T KP+I +Y DK TG KG+A VTY + +
Sbjct: 194 IFVSGMDATITEEEICQHFGAIGLIKNDKRTGKPKIWMYKDKSTGKNKGEATVTYDDQNA 253
Query: 351 VALATQLLDGTPFR 364
A DG F+
Sbjct: 254 ARSAIDWFDGKEFK 267
>gi|195501824|ref|XP_002097959.1| GE24196 [Drosophila yakuba]
gi|194184060|gb|EDW97671.1| GE24196 [Drosophila yakuba]
Length = 459
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +++ F K G+IK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 234 VFVLGMRLNVTKNDIILFFGKLGVIKMDESTNKPKIFVYKNKMTGRSKGEATITYVSPFS 293
Query: 351 VALATQLLDGTPF 363
A L G F
Sbjct: 294 AQAAISCLSGAKF 306
>gi|908756|gb|AAA70425.1| unknown protein, partial [Drosophila melanogaster]
Length = 365
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 83 IFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 142
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F + I +S+ Q
Sbjct: 143 AQSAIEWFDGRXFNGNA-IKVSLAQ 166
>gi|332028574|gb|EGI68611.1| RNA-binding protein cabeza [Acromyrmex echinatior]
Length = 377
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 43/74 (58%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ ++ EE+ + F GIIK D T KP++ +Y DK TG KG+A VTY + +
Sbjct: 131 IFVSGMDPSISEEEICQHFGAIGIIKHDKRTGKPKVWMYKDKNTGKSKGEATVTYDDQNA 190
Query: 351 VALATQLLDGTPFR 364
A DG F+
Sbjct: 191 ARSAISWFDGKDFK 204
>gi|344231366|gb|EGV63248.1| hypothetical protein CANTEDRAFT_106814 [Candida tenuis ATCC 10573]
Length = 300
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 103/216 (47%), Gaps = 33/216 (15%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
+T ++++ LP +TV+E+ E F+K G I D + PRIK+Y D++ K+ +AL+ Y
Sbjct: 106 STGIFISNLPQSITVDELNEEFAKYGTISLD-KGNSPRIKLYYDEKDKFKQ-EALIIYDN 163
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFE---QKGERFIAKQVDSKKKKKLKKVEEK 404
SV LA Q+++ + + ++V +AKFE K +R A ++ SK K+ +E
Sbjct: 164 ATSVDLAIQMMNQVKMKNN---ILNVEEAKFEPIEDKSQR--ADEIRSKFYSKVMVIEN- 217
Query: 405 MLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
MF E + + L ++E D++EEC K G D + V
Sbjct: 218 ----------------------MFRKQEYKENTKLAEDIEEDIREECEKSGIKDILNVTF 255
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
VV V+FK IE + R + G +++
Sbjct: 256 FPSDCVVTVKFKSSSSVDTIIESFDKRDYDGLKLNV 291
>gi|194901966|ref|XP_001980522.1| GG18377 [Drosophila erecta]
gi|190652225|gb|EDV49480.1| GG18377 [Drosophila erecta]
Length = 444
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +++ F K G+IK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 219 VFVLGMRLNVTKNDIILFFGKLGVIKMDESTNKPKIFVYKNKMTGRSKGEATITYVSPFS 278
Query: 351 VALATQLLDGTPF 363
A L G F
Sbjct: 279 AQAAISCLSGAKF 291
>gi|21356695|ref|NP_650107.1| CG14718 [Drosophila melanogaster]
gi|15291249|gb|AAK92893.1| GH13594p [Drosophila melanogaster]
gi|23171035|gb|AAF54686.3| CG14718 [Drosophila melanogaster]
gi|220945246|gb|ACL85166.1| CG14718-PA [synthetic construct]
gi|220955064|gb|ACL90075.1| CG14718-PA [synthetic construct]
Length = 446
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +++ F K G+IK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 223 VFVLGMRLNVTKNDIILFFGKVGVIKMDESTNKPKIFVYKNKITGRSKGEATITYVSPFS 282
Query: 351 VALATQLLDGTPF 363
A L G F
Sbjct: 283 AQAAISCLSGAKF 295
>gi|402217675|gb|EJT97754.1| splicing factor CC1-like protein [Dacryopinax sp. DJM-731 SS1]
Length = 640
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 104/243 (42%), Gaps = 30/243 (12%)
Query: 278 PPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMK 337
PP + +YV L ++T ++ +VF G ++ + ++ D TG
Sbjct: 398 PPGVTAPPPGSMQLYVGSLHFNLTESDVKQVFEPFGELEF--------VDLHRDPLTGRS 449
Query: 338 KGDALVTYLKEPSVALATQLLDG-----------TPFRPDGKIPMSVTQAKFEQKGERFI 386
KG A V Y + +A Q +DG T G I ++ G +
Sbjct: 450 KGFAFVQYKRSEDARMALQSMDGFDLAGRQLKVNTVHEKGGAIRYQSQSDSLDESGGGNL 509
Query: 387 AKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPA----TVILRFMFTPAEMRADENLRS- 441
+KL ++E A L A +V+LR MF E+ EN +
Sbjct: 510 NAASRQALMQKLARIEPPKPAISPM--ASLPKAAMQSRSVLLRNMFKEPELEEKENGPNW 567
Query: 442 --ELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
EL DV++EC K G VD +K+ E QG + ++FK + A K IE +NGR+FGG+ I
Sbjct: 568 AKELTDDVKQECEDKYGLVDFIKL-EPDSQGEMYLKFKSIEAASKAIEGLNGRYFGGQPI 626
Query: 499 HAS 501
A+
Sbjct: 627 QAT 629
>gi|347966310|ref|XP_321451.5| AGAP001645-PA [Anopheles gambiae str. PEST]
gi|333470117|gb|EAA43136.5| AGAP001645-PA [Anopheles gambiae str. PEST]
Length = 409
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 41/73 (56%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ G+ + T EE+ E F G+IK D T+KP+I +Y DKETG KG+A VTY +
Sbjct: 131 IFIQGMTPETTEEEIAERFGSIGVIKRDKRTQKPKIWMYKDKETGNMKGEATVTYDDASA 190
Query: 351 VALATQLLDGTPF 363
A D F
Sbjct: 191 AQSAIGWFDNKEF 203
>gi|301632392|ref|XP_002945270.1| PREDICTED: TATA-binding protein-associated factor 2N-like, partial
[Xenopus (Silurana) tropicalis]
Length = 129
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 45/73 (61%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ +D T +++ + F + GIIK + +T KP I +Y DKETG KG+A V++ PS
Sbjct: 21 IFVQGMGEDATQDQISDYFKQIGIIKINKKTGKPMINLYTDKETGKSKGEATVSFDDPPS 80
Query: 351 VALATQLLDGTPF 363
A + DG F
Sbjct: 81 AKAAIEWFDGKMF 93
>gi|389581947|dbj|GAB64668.1| hypothetical protein PCYB_022380 [Plasmodium cynomolgi strain B]
Length = 408
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 93/192 (48%), Gaps = 12/192 (6%)
Query: 321 TKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQ 380
T +P+IKIY D E KGDALVTY+ SV +A + D R D I V +A+F +
Sbjct: 156 TTEPKIKIYYD-ENNQVKGDALVTYVYTQSVDIAIKYFDNFYIRQDCMI--RVEKAQFNK 212
Query: 381 KGERF-IAKQ--VDSKKKKKLKKVEE-KMLGWGGRDDAKLTIPATVILRFMFTPAE-MRA 435
K E ++K+ + KKK K K E+ ++ WG D T VI R +F+ + ++
Sbjct: 213 KKEASKVSKEEMLIKKKKIKAAKYEQLRLQKWG--DGYTGTKKKIVIFRNVFSYEDAVKH 270
Query: 436 DEN--LRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF 493
DE ++ V+ E K PV V HP G+V V+FK ++A+ +
Sbjct: 271 DEGDPFYDFIKDLVEMEVKKYAPVHKVYPIPKHPNGIVCVKFKGVEEAEMIVSCFKDMEL 330
Query: 494 GGRQIHASEDDG 505
+++ DG
Sbjct: 331 NDKKLEVYFYDG 342
>gi|312385255|gb|EFR29802.1| hypothetical protein AND_00971 [Anopheles darlingi]
Length = 306
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 41/73 (56%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ G+ + T EE+ E F G+IK D T+KP+I +Y DKETG KG+A VTY +
Sbjct: 27 IFIQGMTPETTEEEIAERFGSIGVIKRDKRTQKPKIWMYKDKETGNMKGEATVTYDDASA 86
Query: 351 VALATQLLDGTPF 363
A D F
Sbjct: 87 AQSAIGWFDNKEF 99
>gi|302695543|ref|XP_003037450.1| hypothetical protein SCHCODRAFT_80935 [Schizophyllum commune H4-8]
gi|300111147|gb|EFJ02548.1| hypothetical protein SCHCODRAFT_80935 [Schizophyllum commune H4-8]
Length = 409
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 106/243 (43%), Gaps = 30/243 (12%)
Query: 276 NKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETG 335
N PP +YV L ++T ++ +VF G ++ + ++ D TG
Sbjct: 169 NLPPGVSAPHGGAMQLYVGSLHFNLTEADIKQVFEPFGELE--------FVDLHRDPTTG 220
Query: 336 MKKGDALVTYLKEPSVALATQLLDG-----------TPFRPDGKIPMSVTQAKFEQKGER 384
KG A V Y + +A + ++G T G + + T++ + G
Sbjct: 221 RSKGYAFVQYKRAEDARMAMEQMEGFELAGRQLKVNTVHDKGGVVRYAQTESLDDSGGGN 280
Query: 385 FIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADENL 439
A + +K + +L A+ IP T V+L+ MF P E D+
Sbjct: 281 LNAASRQALMQKLARTDSAPLLP---EPVARPNIPQTMESRSVLLKNMFDPEEESGDD-W 336
Query: 440 RSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
EL DV+ EC K G V ++KV E QG + V+F A+K ++ +NGRWFGG+Q+
Sbjct: 337 DKELADDVKGECESKYGKVSAIKV-EKETQGEIYVKFDAVDAARKAVQGLNGRWFGGKQV 395
Query: 499 HAS 501
A+
Sbjct: 396 SAA 398
>gi|442616561|ref|NP_001259601.1| cabeza, isoform D [Drosophila melanogaster]
gi|440216829|gb|AGB95443.1| cabeza, isoform D [Drosophila melanogaster]
Length = 355
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 78 IFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 137
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F + I +S+ Q
Sbjct: 138 AQSAIEWFDGRDFNGNA-IKVSLAQ 161
>gi|195134702|ref|XP_002011776.1| GI11213 [Drosophila mojavensis]
gi|193906899|gb|EDW05766.1| GI11213 [Drosophila mojavensis]
Length = 414
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 125 IFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGTSKGEATVTYDDVNA 184
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A Q DG F +I +S+ Q
Sbjct: 185 AQSAIQWFDGNDFN-GAQIKVSLAQ 208
>gi|45184622|gb|AAS55563.1| CG14718 [Drosophila simulans]
Length = 202
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +++ F K GIIK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 61 VFVLGMRLNVTKNDIILFFGKVGIIKMDESTNKPKIFVYKNKITGRSKGEATITYVSPFS 120
Query: 351 VALATQLLDGTPF 363
A L G F
Sbjct: 121 AQAAISCLSGVKF 133
>gi|45184620|gb|AAS55562.1| CG14718 [Drosophila simulans]
Length = 202
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V G+ +VT +++ F K GIIK D T KP+I +Y +K TG KG+A +TY+ S
Sbjct: 61 VFVLGMRLNVTKNDIILFFGKVGIIKMDESTNKPKIFVYKNKITGRSKGEATITYVSPFS 120
Query: 351 VALATQLLDGTPF 363
A L G F
Sbjct: 121 AQAAISCLSGVKF 133
>gi|195395432|ref|XP_002056340.1| GJ10896 [Drosophila virilis]
gi|194143049|gb|EDW59452.1| GJ10896 [Drosophila virilis]
Length = 248
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 47/81 (58%), Gaps = 1/81 (1%)
Query: 283 FELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDAL 342
F L+++T ++V G+ +VT +++ F K G IK D T KP+I +Y +K TG KG+A
Sbjct: 29 FVLRLDT-IFVLGMRLNVTKNDIIIFFGKLGRIKMDELTMKPKIFVYKNKLTGRSKGEAT 87
Query: 343 VTYLKEPSVALATQLLDGTPF 363
+TY + A LDG F
Sbjct: 88 ITYTSPYAAQAAIHYLDGAKF 108
>gi|357610730|gb|EHJ67121.1| hypothetical protein KGM_22524 [Danaus plexippus]
Length = 466
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 43/74 (58%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ G+ T +E+ + F GIIK D +T++P++ +Y DK TG KG+A VTY +
Sbjct: 214 IFIQGMNPSTTEDELCQHFGAIGIIKTDKKTQRPKVWMYKDKATGQPKGEATVTYEDSNA 273
Query: 351 VALATQLLDGTPFR 364
+ A Q DG F
Sbjct: 274 ASSAIQWFDGKDFN 287
>gi|195441289|ref|XP_002068446.1| GK20418 [Drosophila willistoni]
gi|194164531|gb|EDW79432.1| GK20418 [Drosophila willistoni]
Length = 173
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 48/85 (56%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
TVI++ +F P + L E + ++++EC K G V V + + HP+G+ V ++
Sbjct: 46 TVIIKNLFVPELFEREVELILEYQNNLRDECGKCGMVRKVVIYDRHPEGIAQVNMSSPEE 105
Query: 481 AQKCIELMNGRWFGGRQIHASEDDG 505
A I++M GR+FG RQ+ A DG
Sbjct: 106 ADVVIQMMQGRFFGQRQLTAEHWDG 130
>gi|307109776|gb|EFN58013.1| hypothetical protein CHLNCDRAFT_57088 [Chlorella variabilis]
Length = 546
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 2/79 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PAE AD N +E+ DV EC K GPV+ V V N +G V V+F A
Sbjct: 456 LLLKNMFDPAEETAD-NWDAEIAEDVGSECQKYGPVEHVFVDRNS-RGFVYVKFTAISSA 513
Query: 482 QKCIELMNGRWFGGRQIHA 500
++GRWF RQI A
Sbjct: 514 ANAQRALHGRWFAARQIAA 532
>gi|403222792|dbj|BAM40923.1| RNA splicing factor [Theileria orientalis strain Shintoku]
Length = 649
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%)
Query: 419 PATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDR 478
A ++L M+T + + + E+E DV+EEC K G V V + P G V V+F+
Sbjct: 554 AANIVLSNMYTSEDYADNNDFFDEIEDDVREECKKYGEVVKVYLNRRKPDGKVYVKFRSN 613
Query: 479 KDAQKCIELMNGRWFGGRQIH 499
DAQ + + GR+F G I
Sbjct: 614 TDAQTAHKSLQGRYFAGNTIQ 634
>gi|195447632|ref|XP_002071301.1| GK25719 [Drosophila willistoni]
gi|194167386|gb|EDW82287.1| GK25719 [Drosophila willistoni]
Length = 396
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +E+ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 121 IFVSGMDPSTTEQEIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 180
Query: 351 VALATQLLDGTPF 363
A + DG F
Sbjct: 181 AQSAIEWFDGREF 193
>gi|409051610|gb|EKM61086.1| hypothetical protein PHACADRAFT_247456 [Phanerochaete carnosa
HHB-10118-sp]
Length = 584
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 64/229 (27%), Positives = 105/229 (45%), Gaps = 33/229 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 360 LYVGSLHFNLTESDIRQVFEPFGELE--------FVDLHRDPMTGRSKGYAFVQYKRGED 411
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE-RFIAKQ-VDSKKKKKLKKVEEKMLGW 408
+A + ++G F G+ ++ +KG R+ ++ +D L + L
Sbjct: 412 AKMALEQMEG--FELAGR---TLRVNTVHEKGNVRYTPQESLDDTGGGNLNAASRQALMQ 466
Query: 409 G-GRDD---------AKLTIP-----ATVILRFMFTPAEMRADENLRSELEADVQEECV- 452
R D K IP +V+L+ MF P E + + EL DV+ E
Sbjct: 467 KLARTDQPAARPQPIMKPNIPQSMQSKSVLLKNMFNPEE-ETERDWDKELADDVKNEVED 525
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
K G V+ +KV E QG + V+F + A+K IE ++GRWFGGRQ+ A+
Sbjct: 526 KYGDVNFIKV-ERESQGEIYVKFDSIESAKKAIEGLHGRWFGGRQVSAA 573
>gi|154413935|ref|XP_001579996.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121914209|gb|EAY19010.1| hypothetical protein TVAG_246890 [Trichomonas vaginalis G3]
Length = 273
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 103/242 (42%), Gaps = 21/242 (8%)
Query: 269 QVEKKEANKPPDSWFELK-VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
Q +KKE N P E K + V LP+ T +++ E + G++ PET + I
Sbjct: 18 QNDKKEENNQPAQVVEHKNAQLSIVVKNLPEGTTSQQISEFCKRVGVLATHPETGEDLIL 77
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ + A VTY + A ++LDG F G+ + + ER +
Sbjct: 78 L------NPRAHKATVTYAYPEATNHAIEILDGEQFLVTGE----------KVQVERAVR 121
Query: 388 KQVDSKKKKKLKKVEEKMLGWGG----RDDAKLTIPATVILRFMFTPAEMRADENLRSEL 443
+ D K K K+ K + G D ++ +I++ +F E+ + L ++
Sbjct: 122 EPYDFSKWKGAMKLSRKFHSYFGGEEKLDPSEQKRVKIMIMKNVFDMDELIKEPELYGKI 181
Query: 444 EADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASED 503
D+ +C G V VK E +P+G+V+VRF + A K I ++ + R I
Sbjct: 182 IKDLTTQCEPFGKVTLVKPIEKNPEGIVIVRFDQAQSASKAIGDLDQCEYRNRIITVEPW 241
Query: 504 DG 505
DG
Sbjct: 242 DG 243
>gi|341895819|gb|EGT51754.1| hypothetical protein CAEBREN_22252 [Caenorhabditis brenneri]
Length = 650
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 69/146 (47%), Gaps = 17/146 (11%)
Query: 247 KVNSADNVVEEKHNGKRKQPDKQVEKKEANKP--------PDSWFELKVNTHVYVTGLPD 298
K N V +E + KR Q DK E E + P P+ W ELK V+V G+
Sbjct: 4 KANVMGLVEQETTSPKRPQLDKS-EGGEGHVPTVTEAALKPEKWVELK--DTVFVQGIST 60
Query: 299 DVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLL 358
+ +FS CG I ++ ET PRIKI DK G G+ +VT++ A ++
Sbjct: 61 SANTASIAALFSTCGDIAQN-ET-GPRIKINSDKNNG---GECMVTFVNANDAQKAIKMY 115
Query: 359 DGTPFRPDGKIPMSVTQAKFEQKGER 384
+G PF G PM V+ AKF E+
Sbjct: 116 NGRPFSA-GAEPMKVSLAKFHGDKEK 140
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 3/94 (3%)
Query: 285 LKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
+K++ + T + + + + +F C I + PR+KIY D++TG KG+ +VT
Sbjct: 504 VKIHLKISSTTIISRASADFLAFLFRTCQDIARNES--GPRVKIYTDRDTGEPKGECMVT 561
Query: 345 YLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKF 378
+ A Q +G PF P G M V AKF
Sbjct: 562 FDNADDAQKAIQTFNGKPF-PGGSQTMKVVLAKF 594
>gi|303271495|ref|XP_003055109.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463083|gb|EEH60361.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 101
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 41/74 (55%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VY+ GLPD V ++ E+F G +K D + ++ IY DK+TG+ KGDA V+Y
Sbjct: 4 NDMVYIAGLPDTVRESDLSEIFGSIGQLKRDKKKNCDKVWIYRDKDTGLPKGDATVSYAD 63
Query: 348 EPSVALATQLLDGT 361
+ A DGT
Sbjct: 64 PHAAEAAVSWFDGT 77
>gi|241812001|ref|XP_002416459.1| RNA binding protein, putative [Ixodes scapularis]
gi|215510923|gb|EEC20376.1| RNA binding protein, putative [Ixodes scapularis]
Length = 347
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 1/87 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+ LP+DV+ + E F G+IK D +T K +I IY DK TG KG+A VTY P+
Sbjct: 129 IFVSNLPEDVSENHLAEHFGAIGLIKIDKKTGKSKIWIYKDKITGKGKGEATVTYDDPPT 188
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAK 377
+ A G F KI + + Q K
Sbjct: 189 ASSAITWFHGKEFM-GSKISVELAQRK 214
>gi|442616559|ref|NP_727946.2| cabeza, isoform C [Drosophila melanogaster]
gi|440216828|gb|AAF48578.3| cabeza, isoform C [Drosophila melanogaster]
Length = 384
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 107 IFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 166
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F + I +S+ Q
Sbjct: 167 AQSAIEWFDGRDFNGNA-IKVSLAQ 190
>gi|195174668|ref|XP_002028094.1| GL21338 [Drosophila persimilis]
gi|194115834|gb|EDW37877.1| GL21338 [Drosophila persimilis]
Length = 386
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ + T +++ F GIIK+D T KP+I +Y +K+TG+ KG+A VTY +
Sbjct: 103 IFVSGMDTNTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKDTGLSKGEATVTYDDTNA 162
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F+ I +S+ Q
Sbjct: 163 AQSAIEWFDGRVFK-GANIKVSLAQ 186
>gi|198469746|ref|XP_002134400.1| GA22827 [Drosophila pseudoobscura pseudoobscura]
gi|198147015|gb|EDY73027.1| GA22827 [Drosophila pseudoobscura pseudoobscura]
Length = 386
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ + T +++ F GIIK+D T KP+I +Y +K+TG+ KG+A VTY +
Sbjct: 103 IFVSGMDTNTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKDTGLSKGEATVTYDDTNA 162
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F+ I +S+ Q
Sbjct: 163 AQSAIEWFDGRVFK-GANIKVSLAQ 186
>gi|428672327|gb|EKX73241.1| RNA recognition motif domain containing protein [Babesia equi]
Length = 511
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 102/232 (43%), Gaps = 34/232 (14%)
Query: 291 VYVTGLPDDVTV---EEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
+YV GL ++ E+ ++FS G I + ++I+ D ETG KG A + + +
Sbjct: 275 IYVGGLIGALSALNEVELKQLFSPFGTIID--------VEIFRDPETGESKGYAFLKFRR 326
Query: 348 EPSVALATQLLDGTPFRPDGK--------IPMSVTQAKFEQKGERFIAKQVD------SK 393
A ++G F G+ + + ++++ G+ I + D S
Sbjct: 327 SSEAKEAMNTMNG--FDIGGQQIKVGYANLNTTDSKSRLSSLGDVDIERLDDDGGGLISG 384
Query: 394 KKKKLKKVEEKMLGWGGRDDAKLTI------PATVILRFMFTPAEMRADE-NLRSELEAD 446
K+ +E+ A + + +IL MFT + ADE N E+E D
Sbjct: 385 ATNKIALMEKLQRTTAAPISATFSSGKASGPTSNIILSNMFTANDPGADEPNFFVEIEED 444
Query: 447 VQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
V+EEC K G V +V + + G V V+F++ DA + +NGR+F G I
Sbjct: 445 VKEECEKYGKVVAVYLNKKTIDGKVWVKFQNSTDASTAYKGLNGRYFAGNTI 496
>gi|344294421|ref|XP_003418916.1| PREDICTED: RNA-binding protein FUS-like [Loxodonta africana]
Length = 519
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 43/76 (56%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T +P I +Y D+ETG KG+A V++
Sbjct: 277 NNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDD 336
Query: 348 EPSVALATQLLDGTPF 363
P G F
Sbjct: 337 PPXXXXXXXXXXGKEF 352
>gi|91090396|ref|XP_970338.1| PREDICTED: similar to cabeza CG3606-PB [Tribolium castaneum]
gi|270013385|gb|EFA09833.1| hypothetical protein TcasGA2_TC011980 [Tribolium castaneum]
Length = 357
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 50/79 (63%), Gaps = 1/79 (1%)
Query: 285 LKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVT 344
++V+T ++V+G+ ++T +E+ + F G+IK D +T+K +I +Y DK+TG+ KG+A VT
Sbjct: 106 VQVDT-IFVSGMNPNLTEDEIAQHFGSIGVIKMDKKTQKRKIWLYKDKQTGLSKGEATVT 164
Query: 345 YLKEPSVALATQLLDGTPF 363
Y + A DG F
Sbjct: 165 YDDSNAAQSAISWFDGKEF 183
>gi|300175926|emb|CBK21922.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 90/174 (51%), Gaps = 32/174 (18%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPR------IKIYVDKETGMKKG 339
K+NT +Y+ LP DVT EE+ +VF + GII+++ + P+ IK+Y + E G G
Sbjct: 57 KLNTWIYIQNLPKDVTEEELSDVFKRYGIIQQNLDG-SPKALFGCSIKLYKN-EKGELTG 114
Query: 340 DALVTYLKEPSVALATQLLDGTPFR----------PDGKIPMSVTQAKFEQK---GERFI 386
DA + YL + S+ LA ++ D P R K + V++A F++K ++ +
Sbjct: 115 DARLCYLCKESIQLAIKMQDDYPLRYGCRHREEVTCSDKENIRVSEAVFDKKDKEHDQAV 174
Query: 387 AKQVDSKKKKKLK----KVEEKMLGWGGRDDAKLTIPAT----VILRFMFTPAE 432
+ +K K+L+ + +++ L W DD K+ ++L+ MF+ AE
Sbjct: 175 NLGEEDEKTKRLRWMFMQQQKQQLSW---DDEKVDTSRVGLRIIVLQNMFSLAE 225
>gi|405959794|gb|EKC25786.1| Leucine-zipper-like transcriptional regulator 1 [Crassostrea gigas]
Length = 1621
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 38/55 (69%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
++V+GLP+D+ + +V+ F G+IK D T KP++ IY DK TG KG+A +TY
Sbjct: 1371 IFVSGLPEDIDEDGLVQHFGSIGVIKTDKRTGKPKVWIYKDKNTGRPKGEATITY 1425
>gi|242008664|ref|XP_002425122.1| RNA-binding protein cabeza, putative [Pediculus humanus corporis]
gi|212508796|gb|EEB12384.1| RNA-binding protein cabeza, putative [Pediculus humanus corporis]
Length = 424
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 44/74 (59%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+++G+P VT E+ + F G+IK+D +T KPRI IY DK TG KG+A VTY +
Sbjct: 184 VFISGMPPHVTELEIEQHFGAIGLIKKDKKTMKPRIWIYNDKVTGKSKGEATVTYDDPNA 243
Query: 351 VALATQLLDGTPFR 364
+ +G F+
Sbjct: 244 ARSSIDWFNGKDFK 257
>gi|118357832|ref|XP_001012164.1| splicing factor, CC1-like family protein [Tetrahymena thermophila]
gi|89293931|gb|EAR91919.1| splicing factor, CC1-like family protein [Tetrahymena thermophila
SB210]
Length = 588
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 44/90 (48%)
Query: 409 GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ 468
G R L ++L MF ++ + D + ++L+ DV EEC K G V V V P
Sbjct: 481 GNRPPTSLLPTNYIVLVNMFAESDFKNDPSFFADLKEDVSEECKKYGDVVEVFVAIRSPC 540
Query: 469 GVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
G V V F D A C + M+GR F GR I
Sbjct: 541 GSVFVLFNDAVGAVTCAKNMDGRMFNGRPI 570
>gi|195041477|ref|XP_001991263.1| GH12560 [Drosophila grimshawi]
gi|193901021|gb|EDV99887.1| GH12560 [Drosophila grimshawi]
Length = 415
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 42/73 (57%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK+D T KP+I +Y +K+TG KG+A VTY +
Sbjct: 125 IFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKDTGASKGEATVTYDDVNA 184
Query: 351 VALATQLLDGTPF 363
A Q DG F
Sbjct: 185 AQSAIQWFDGNDF 197
>gi|255080516|ref|XP_002503838.1| predicted protein [Micromonas sp. RCC299]
gi|226519105|gb|ACO65096.1| predicted protein [Micromonas sp. RCC299]
Length = 447
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 54/96 (56%), Gaps = 3/96 (3%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV GLPD V+ ++ F G +K+D + + +I +Y D++TG+ KGDA V+Y+
Sbjct: 7 NDTVYVAGLPDTVSERDIAAHFGSIGQLKQDKKRRCEKIWLYRDRDTGLPKGDATVSYMD 66
Query: 348 EPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE 383
+ A + T F G+ +SV+ A+ + GE
Sbjct: 67 PHAAEAAVNWFNNTQFM--GRT-LSVSLAERKGGGE 99
>gi|195091488|ref|XP_001997534.1| GH11790 [Drosophila grimshawi]
gi|193905671|gb|EDW04538.1| GH11790 [Drosophila grimshawi]
Length = 179
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 61/113 (53%), Gaps = 2/113 (1%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
TVI++ +F P + L E + ++++EC K G V V + + H +G+ + ++
Sbjct: 46 TVIIKNLFVPELFENEVELILEYQNNLRDECGKCGMVRKVVIYDRHQEGIAQINMSTPEE 105
Query: 481 AQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEAS-RLEQFGAELEAD 532
A I++M GR+FG RQ+ A DG + I + AEAS RL ++ L A+
Sbjct: 106 ADVVIQMMQGRYFGQRQLSAEHWDGKTKY-KIDESAAEASERLNKWDEYLAAE 157
>gi|532788|gb|AAA86955.1| RNA binding protein [Drosophila melanogaster]
gi|567106|gb|AAC41563.1| RNA binding protein [Drosophila melanogaster]
Length = 404
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 121 IFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 180
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F + I +S+ Q
Sbjct: 181 AQSAIEWFDGRDFNGNA-IKVSLAQ 204
>gi|24642436|ref|NP_523365.2| cabeza, isoform B [Drosophila melanogaster]
gi|47117804|sp|Q27294.2|CAZ_DROME RecName: Full=RNA-binding protein cabeza; AltName: Full=P19;
AltName: Full=Sarcoma-associated RNA-binding fly homolog
gi|22832345|gb|AAN09389.1| cabeza, isoform B [Drosophila melanogaster]
gi|28557651|gb|AAO45231.1| LD22761p [Drosophila melanogaster]
gi|220944750|gb|ACL84918.1| caz-PB [synthetic construct]
gi|220954666|gb|ACL89876.1| caz-PB [synthetic construct]
Length = 399
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 122 IFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 181
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F + I +S+ Q
Sbjct: 182 AQSAIEWFDGRDFNGNA-IKVSLAQ 205
>gi|452823555|gb|EME30564.1| U2 snRNP auxiliary factor large subunit, putative isoform 1
[Galdieria sulphuraria]
Length = 522
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 94/224 (41%), Gaps = 45/224 (20%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V++ GLP +T +++ E+ S G + + D TG+ KG A Y K+PS
Sbjct: 325 VFIGGLPYHLTEDQIKEILSSYGPLNA--------FNLVKDPNTGLSKGYAFFQY-KDPS 375
Query: 351 VA-LATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
+ A + L+G G ++V +A +V + G
Sbjct: 376 IVEAAIKGLNGMTM---GDKTLTVRRAS----------------------QVSSGSVELG 410
Query: 410 GRDDAKLTIPATVI-LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC----- 463
+ P ++ LR M P E+ DE +E DV+EE K G V VK+
Sbjct: 411 QSFSPTVRYPTRILELRNMVEPEELVDDEEYEDIIE-DVREESSKYGEVTEVKIPRPSKT 469
Query: 464 -ENHPQGV--VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
E +P G+ V V FK DA+K + GR FGG+ + A+ D
Sbjct: 470 DEANPPGLGKVFVSFKTVSDAEKAFAALTGRRFGGKSVIANYYD 513
>gi|268563044|ref|XP_002638738.1| Hypothetical protein CBG18538 [Caenorhabditis briggsae]
Length = 129
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 56/96 (58%), Gaps = 11/96 (11%)
Query: 310 SKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKI 369
SKCG+I+ D T KP+ K+Y D E G KGD Y+K+ SV LA +LDG+ GK
Sbjct: 3 SKCGVIQPDIRTNKPKCKLYRD-EHGELKGDGRCCYIKKESVELACNILDGSMLL--GK- 58
Query: 370 PMSVTQAKFEQKGERFIAKQVD-SKKKKKLKKVEEK 404
+ V +A FE KG+ D S+K+KKL ++K
Sbjct: 59 EVKVEEAHFEMKGD------FDPSRKRKKLTATQKK 88
>gi|195351386|ref|XP_002042215.1| GM13418 [Drosophila sechellia]
gi|194124058|gb|EDW46101.1| GM13418 [Drosophila sechellia]
Length = 400
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 119 IFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 178
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F + I +S+ Q
Sbjct: 179 AQSAIEWFDGRDFNGNA-IKVSLAQ 202
>gi|388583572|gb|EIM23873.1| splicing factor, CC1-like protein [Wallemia sebi CBS 633.66]
Length = 459
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 14/107 (13%)
Query: 395 KKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEEC-VK 453
++K KKV+E+M G V+L++MF +E + EL DV+ EC K
Sbjct: 350 REKEKKVQERMKTKG------------VLLKYMFKASE-ETEAGWEKELAEDVKTECENK 396
Query: 454 IGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
G V + V + +G ++V+F + A+ I +NGRWFGGRQ+ A
Sbjct: 397 YGKVQEIGVDKESEEGEIVVKFYTIESAEDAINGLNGRWFGGRQVKA 443
>gi|427787095|gb|JAA58999.1| Putative rna-binding protein fus [Rhipicephalus pulchellus]
Length = 597
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 1/87 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+ LP+DV ++ E F G+IK D +T K +I IY DK TG KG+A +TY P+
Sbjct: 345 IFVSNLPEDVGEIQLAEHFGAIGLIKIDKKTGKNKIWIYKDKITGKGKGEATITYDDPPT 404
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAK 377
A G F GKI + + Q K
Sbjct: 405 ANSAITWFHGKEFM-GGKINVELAQRK 430
>gi|403412344|emb|CCL99044.1| predicted protein [Fibroporia radiculosa]
Length = 599
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 102/229 (44%), Gaps = 33/229 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 375 LYVGSLHFNLTESDIKQVFEPFGELE--------FVDLHRDPMTGRSKGYAFVQYKRAED 426
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKML-- 406
+A + ++G F G+ ++ +KG A+Q +D L + L
Sbjct: 427 ARMALEQMEG--FELAGR---TLRVNTVHEKGTTKYAQQDSLDEAGGGNLNAASRQALMQ 481
Query: 407 GWGGRDDAKLTIPA-------------TVILRFMFTPAEMRADENLRSELEADVQEECV- 452
D + +P +V+L+ MF P E + + +L DV+ EC
Sbjct: 482 KLARTDQPAVKLPPVTKPNIPQSMQSRSVLLKNMFNPEE-ETERDWDKDLAEDVKGECED 540
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
K G V+ +KV E QG + V+F + A+ I+ ++GRWFGG Q+ A+
Sbjct: 541 KYGKVEFIKV-ERESQGEIYVKFDSIESAKNAIQGLHGRWFGGNQVSAA 588
>gi|221126827|ref|XP_002154968.1| PREDICTED: uncharacterized protein LOC100197414 [Hydra
magnipapillata]
Length = 383
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 2/92 (2%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V GLP TVEE+ F + GIIK D T++P+I +Y ++ G KG+ VTY +
Sbjct: 187 VFVAGLPPSATVEEIANFFGQIGIIKMDKRTREPKIWVYKHRD-GTGKGECTVTYEDAEA 245
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKG 382
A +G F + I +S+ Q K ++G
Sbjct: 246 SQAAVDWFNGKQFGSN-SIKVSIAQRKAPEEG 276
>gi|351700890|gb|EHB03809.1| RNA-binding protein FUS [Heterocephalus glaber]
Length = 383
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 53/94 (56%), Gaps = 1/94 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N ++V GL ++VT+E + + F + GIIK + +T++ I +Y D+ET KG+ V++ +
Sbjct: 242 NNTIFVQGLGENVTIESVADHFKQIGIIKTNKKTEQSMINLYTDRETAKLKGEGTVSFDE 301
Query: 348 EPSVALATQLLDGTPFRPDG-KIPMSVTQAKFEQ 380
PS A L+G F + K+ + A F Q
Sbjct: 302 PPSAKAAIDWLEGKEFSGNPIKVSFATRLADFNQ 335
>gi|452823554|gb|EME30563.1| U2 snRNP auxiliary factor large subunit, putative isoform 2
[Galdieria sulphuraria]
Length = 538
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 94/224 (41%), Gaps = 45/224 (20%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V++ GLP +T +++ E+ S G + + D TG+ KG A Y K+PS
Sbjct: 341 VFIGGLPYHLTEDQIKEILSSYGPLNA--------FNLVKDPNTGLSKGYAFFQY-KDPS 391
Query: 351 VA-LATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
+ A + L+G G ++V +A +V + G
Sbjct: 392 IVEAAIKGLNGMTM---GDKTLTVRRAS----------------------QVSSGSVELG 426
Query: 410 GRDDAKLTIPATVI-LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC----- 463
+ P ++ LR M P E+ DE +E DV+EE K G V VK+
Sbjct: 427 QSFSPTVRYPTRILELRNMVEPEELVDDEEYEDIIE-DVREESSKYGEVTEVKIPRPSKT 485
Query: 464 -ENHPQGV--VLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
E +P G+ V V FK DA+K + GR FGG+ + A+ D
Sbjct: 486 DEANPPGLGKVFVSFKTVSDAEKAFAALTGRRFGGKSVIANYYD 529
>gi|407918973|gb|EKG12232.1| hypothetical protein MPH_10639 [Macrophomina phaseolina MS6]
Length = 602
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 10/95 (10%)
Query: 414 AKLTIPATV-------ILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCEN 465
A T+P +V +L+ MF PAE E+ ELE DV+EEC K G V + + N
Sbjct: 494 AARTLPVSVNQASRCVVLKNMFDPAEEEG-ESWMKELEDDVKEECQAKYGTVVHIALDPN 552
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
QG + V+F + + I+ +NGR+FGGRQI A
Sbjct: 553 T-QGDIYVKFDTVQGGENAIKGLNGRFFGGRQITA 586
>gi|164659450|ref|XP_001730849.1| hypothetical protein MGL_1848 [Malassezia globosa CBS 7966]
gi|159104747|gb|EDP43635.1| hypothetical protein MGL_1848 [Malassezia globosa CBS 7966]
Length = 549
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 47/82 (57%), Gaps = 1/82 (1%)
Query: 420 ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRK 479
A ++LR MF PAE + + +L DV+ EC + G V+SV V + G V V F
Sbjct: 465 AAILLRHMFDPAE-ETEPHWHVDLREDVRAECERHGTVESVFVDTSSRDGEVYVCFATTD 523
Query: 480 DAQKCIELMNGRWFGGRQIHAS 501
DAQ+ + GR+FGG+++ AS
Sbjct: 524 DAQRARASLQGRFFGGKRVEAS 545
>gi|449551106|gb|EMD42070.1| hypothetical protein CERSUDRAFT_90674 [Ceriporiopsis subvermispora
B]
Length = 623
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 100/228 (43%), Gaps = 33/228 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G ++ + ++ D TG KG A V Y +
Sbjct: 399 LYVGSLHFNLTESDIKQVFEPFGELEF--------VDLHRDPMTGRSKGYAFVQYKRSED 450
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKMLGW 408
+A + ++G F G+ ++ +KG +Q +D L + L
Sbjct: 451 ARMALEQMEG--FELAGR---TLRVNTVHEKGTIRYTQQDSLDEAGGGNLNAASRQALMQ 505
Query: 409 G-GRDDAKLTIPA--------------TVILRFMFTPAEMRADENLRSELEADVQEECV- 452
R D + P +V+L+ MF P E + + +L DV+ EC
Sbjct: 506 KLARTDQTVITPPPVVRPNIPQTMQSRSVLLKNMFNP-ENETERDWDKDLAEDVKYECED 564
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K G V+ +KV E QG + V+F + A+ I+ +NGRWFGG Q+ A
Sbjct: 565 KYGKVEFIKV-EKDSQGEIYVKFDSVESAKNAIQGLNGRWFGGNQVSA 611
>gi|390604396|gb|EIN13787.1| splicing factor CC1-like protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 433
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 3/82 (3%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRK 479
+V+L+ MF PAE +++ +L DV+ EC K G V+ +KV + QG + V+F
Sbjct: 343 SVLLKNMFDPAE-ETEKDWDKDLADDVKVECENKYGMVNFIKV-DKESQGEIYVKFDTVD 400
Query: 480 DAQKCIELMNGRWFGGRQIHAS 501
A+K IE +NGR+FGGRQ+ A+
Sbjct: 401 SAKKAIEGLNGRYFGGRQVTAT 422
>gi|403175151|ref|XP_003889057.1| hypothetical protein PGTG_22242 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171476|gb|EHS64385.1| hypothetical protein PGTG_22242 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 705
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 110/261 (42%), Gaps = 52/261 (19%)
Query: 279 PDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIK-----EDPETKKPRIKIYV--- 330
P+S+ L YV L ++T +++ +VF G I+ D T K + +V
Sbjct: 446 PNSYARL------YVGSLNFNLTDDDIRQVFQPFGDIEYVDLHRDQITGKSKGYAFVQFK 499
Query: 331 ---DKETGMKK-------GDALVTYLK-EPSVALATQLLDGTP----FRPDGKIPMSVTQ 375
D + M+K G AL +K +P AL G P G +
Sbjct: 500 NMHDAKNAMEKMNGFQLAGRALRVEIKAQPPAALLNATAPGVANPVIVTPSGGNFTAPAP 559
Query: 376 AKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT-------------- 421
+ FE++ E I ++ ++ +VE ++ R + +P T
Sbjct: 560 STFEERLEDPIGGNLN-----QISRVE--LMHKLARTEQPTNVPVTDMFRPNIPTATSRS 612
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L+ MF P E ++ EL DV+ EC K GPV ++ + + G + + F
Sbjct: 613 VLLKNMFNPEE-ETEQGWDIELRDDVKGECEEKYGPVLAIAIEKESTAGDIYITFDSVPS 671
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
AQK I +N RWFGGRQI A+
Sbjct: 672 AQKAITGLNNRWFGGRQITAA 692
>gi|443684509|gb|ELT88437.1| hypothetical protein CAPTEDRAFT_222177 [Capitella teleta]
Length = 323
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 45/79 (56%), Gaps = 2/79 (2%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V GL + T EE+ E F GIIK+D T P+I +Y DK TG KG+A +TY + +
Sbjct: 84 VFVQGLDSETTQEELQEHFGSIGIIKKDKRTGLPKIWVYKDKMTGKPKGEATITYEDDQT 143
Query: 351 VALATQLLDGTPFRPDGKI 369
A +G F+ GK+
Sbjct: 144 ATSAIDWFNGKEFK--GKV 160
>gi|71003261|ref|XP_756311.1| hypothetical protein UM00164.1 [Ustilago maydis 521]
gi|46096316|gb|EAK81549.1| hypothetical protein UM00164.1 [Ustilago maydis 521]
Length = 640
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 50/81 (61%), Gaps = 3/81 (3%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRK 479
+++L+ MF PAE + N ++L DV+EEC K GPV S+ V E G + V F +
Sbjct: 561 SLLLKNMFDPAE-ETEPNWDTDLAEDVKEECQAKYGPVTSIHV-EKDSAGEIYVTFANLD 618
Query: 480 DAQKCIELMNGRWFGGRQIHA 500
++K ++ +NGR+FGG+ I A
Sbjct: 619 ASRKALDGLNGRFFGGKPISA 639
>gi|225425084|ref|XP_002273586.1| PREDICTED: uncharacterized protein LOC100249627 [Vitis vinifera]
gi|297738257|emb|CBI27458.3| unnamed protein product [Vitis vinifera]
Length = 414
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV LP + E F G+IK+D T +P+I +Y DK T KGDA VTY +
Sbjct: 12 NGSVYVCNLPHGTDETMLAEYFGTIGLIKKDKRTGRPKIWLYRDKVTNEPKGDATVTY-E 70
Query: 348 EPSVALAT 355
+P ALA
Sbjct: 71 DPHAALAA 78
>gi|194770186|ref|XP_001967178.1| GF19034 [Drosophila ananassae]
gi|190619298|gb|EDV34822.1| GF19034 [Drosophila ananassae]
Length = 374
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 41/73 (56%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ T +++ F GIIK+D T KP+I +Y +KETG KG+A VTY +
Sbjct: 107 IFVCGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 166
Query: 351 VALATQLLDGTPF 363
A + DG F
Sbjct: 167 AQSAIEWFDGRDF 179
>gi|403178034|ref|XP_003336468.2| hypothetical protein PGTG_18139 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375173310|gb|EFP92049.2| hypothetical protein PGTG_18139 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 705
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 110/261 (42%), Gaps = 52/261 (19%)
Query: 279 PDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIK-----EDPETKKPRIKIYV--- 330
P+S+ L YV L ++T +++ +VF G I+ D T K + +V
Sbjct: 446 PNSYARL------YVGSLNFNLTDDDIRQVFQPFGDIEYVDLHRDQITGKSKGYAFVQFK 499
Query: 331 ---DKETGMKK-------GDALVTYLK-EPSVALATQLLDGTP----FRPDGKIPMSVTQ 375
D + M+K G AL +K +P AL G P G +
Sbjct: 500 NMHDAKNAMEKMNGFQLAGRALRVEIKAQPPAALLNATAPGVANPVIVTPSGGNFTAPAP 559
Query: 376 AKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPAT-------------- 421
+ FE++ E I ++ ++ +VE ++ R + +P T
Sbjct: 560 STFEERLEDPIGGNLN-----QISRVE--LMHKLARTEQPTNVPVTDMFRPNIPTATSRS 612
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L+ MF P E ++ EL DV+ EC K GPV ++ + + G + + F
Sbjct: 613 VLLKNMFNPEE-ETEQGWDIELRDDVKGECEEKYGPVLAIAIEKESTAGDIYITFDSVPS 671
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
AQK I +N RWFGGRQI A+
Sbjct: 672 AQKAITGLNNRWFGGRQITAA 692
>gi|427795129|gb|JAA63016.1| Putative rna binding protein, partial [Rhipicephalus pulchellus]
Length = 298
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 1/87 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+ LP+DV ++ E F G+IK D +T K +I IY DK TG KG+A +TY P+
Sbjct: 46 IFVSNLPEDVGEIQLAEHFGAIGLIKIDKKTGKNKIWIYKDKITGKGKGEATITYDDPPT 105
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAK 377
A G F GKI + + Q K
Sbjct: 106 ANSAITWFHGKEFM-GGKINVELAQRK 131
>gi|378731264|gb|EHY57723.1| nucleolin [Exophiala dermatitidis NIH/UT8656]
Length = 562
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE EN ELE DV+ EC K G V + V N QG + ++F+
Sbjct: 467 VLLRNMFDPAEEEG-ENWVKELEDDVRAECEEKYGHVVHISVDPN-TQGDIYLKFERVSG 524
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 525 GENAIKGLNGRYFGGRQISA 544
>gi|363808018|ref|NP_001241952.1| uncharacterized protein LOC100780819 [Glycine max]
gi|255646988|gb|ACU23963.1| unknown [Glycine max]
Length = 365
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N YV LP + E F G+IK+D T +P+I +Y DKET KGDA VTY +
Sbjct: 12 NGSAYVCNLPYGTDDNMLAEYFGTIGLIKKDKRTGRPKIWLYRDKETNEPKGDATVTY-E 70
Query: 348 EPSVALAT 355
+P A+A
Sbjct: 71 DPHAAVAA 78
>gi|118357830|ref|XP_001012163.1| hypothetical protein TTHERM_00101140 [Tetrahymena thermophila]
gi|89293930|gb|EAR91918.1| hypothetical protein TTHERM_00101140 [Tetrahymena thermophila
SB210]
Length = 110
Score = 56.2 bits (134), Expect = 4e-05, Method: Composition-based stats.
Identities = 36/98 (36%), Positives = 48/98 (48%), Gaps = 3/98 (3%)
Query: 409 GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ 468
G R L ++L MFT + + D + ++L+ DV EEC K G V V V P
Sbjct: 3 GNRPPTSLLPTNYIVLVNMFTETDFQNDPDFFTDLKEDVSEECKKYGDVVEVFVAIRSPC 62
Query: 469 GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA---SED 503
G V V F D A C + ++GR F GR I + SED
Sbjct: 63 GSVFVLFNDATGAVNCAKNIDGRMFNGRPISSYCISED 100
>gi|428174818|gb|EKX43711.1| hypothetical protein GUITHDRAFT_163728 [Guillardia theta CCMP2712]
Length = 501
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
++L MF PA+ + + E++ DV+EEC K G V V HP G+V VRF
Sbjct: 409 MVLVNMFDPAK-ETEPDFHVEIQDDVKEECEAKFGRVIQVVADRKHPNGLVYVRFDSTDT 467
Query: 481 AQKCIELMNGRWFGGRQIHA 500
AQK + GR+F G+QI A
Sbjct: 468 AQKAQAGLQGRFFAGKQISA 487
>gi|237836525|ref|XP_002367560.1| RNA-binding protein, putative [Toxoplasma gondii ME49]
gi|211965224|gb|EEB00420.1| RNA-binding protein, putative [Toxoplasma gondii ME49]
gi|221484014|gb|EEE22318.1| hypothetical protein TGGT1_018190 [Toxoplasma gondii GT1]
gi|221505287|gb|EEE30941.1| RNA-binding protein, putative [Toxoplasma gondii VEG]
Length = 532
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 6/83 (7%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
V+L M TP+E + EL+ +V+EEC K G + V+V + V F D A
Sbjct: 445 VLLSNMVTPSE------VDGELKDEVREECSKFGSIKRVEVHTLKETVRIFVEFSDLSGA 498
Query: 482 QKCIELMNGRWFGGRQIHASEDD 504
++ I ++GRWFGGRQI A+ D
Sbjct: 499 REAIPSLHGRWFGGRQIIANTYD 521
>gi|402593184|gb|EJW87111.1| Zn-finger in Ran binding protein [Wuchereria bancrofti]
Length = 343
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 53/96 (55%), Gaps = 1/96 (1%)
Query: 284 ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
+++V V++ G+P + + +VFS G I + T +PRIKIY D+ET KG+ +
Sbjct: 42 QIEVRETVFIQGIPVTANEQFIADVFSTQGDIARNERTGEPRIKIYTDRETNQPKGECTI 101
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFE 379
T++ + + +G F P + MS++ AK++
Sbjct: 102 TFIDAKTAERVINVYNGQCF-PGSEQCMSLSFAKYK 136
>gi|393218616|gb|EJD04104.1| splicing factor, CC1-like protein [Fomitiporia mediterranea MF3/22]
Length = 464
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/229 (26%), Positives = 99/229 (43%), Gaps = 33/229 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T ++ +VF G + + ++ D TG KG A + Y +
Sbjct: 240 LYVGSLHFNLTESDIKQVFEPFGELD--------FVDLHKDSATGRSKGYAFIHYKRAED 291
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQ--VDSKKKKKLKKVEEKML-- 406
+A + ++G F G+ ++ +KG+ I+ Q +D L + L
Sbjct: 292 AKMALEQMEG--FELAGR---TLRVNTVHEKGQTRISTQDSLDESGGGNLNAASRQALMQ 346
Query: 407 GWGGRDDAKLT-------------IPATVILRFMFTPAEMRADENLRSELEADVQEEC-V 452
D A +T +V++R MF P E + +L DV+ EC
Sbjct: 347 KLARIDSAPVTQQPIMKPTVAQPMTSKSVLMRNMFDPEE-ETEPAWDKDLAEDVKTECQA 405
Query: 453 KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
K G V +KV E +G + V+F A+ I +NGRWFGG+QI A+
Sbjct: 406 KYGRVQHIKV-EKDSEGEIYVQFDTVDAAKAAINGLNGRWFGGKQISAT 453
>gi|167518462|ref|XP_001743571.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777533|gb|EDQ91149.1| predicted protein [Monosiga brevicollis MX1]
Length = 652
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 45/79 (56%), Gaps = 2/79 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L MF PA+ + N ++ +V EEC K GPV + V + QG + ++F D + A
Sbjct: 550 LVLHHMFDPAK-ETEANWDKDMAEEVLEECTKSGPVFHI-VVDKTSQGDIYLKFSDTRSA 607
Query: 482 QKCIELMNGRWFGGRQIHA 500
+MNGR+F GRQI A
Sbjct: 608 AHASSVMNGRFFDGRQITA 626
>gi|312091924|ref|XP_003147155.1| hypothetical protein LOAG_11589 [Loa loa]
Length = 338
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 53/96 (55%), Gaps = 1/96 (1%)
Query: 284 ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
+++V V++ G+P + + +VFS G I + T +PRIKIY D+ET KG+ +
Sbjct: 37 QIEVRETVFIQGIPVTANEQFIADVFSTQGDIARNDRTGEPRIKIYTDRETNQPKGECTI 96
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFE 379
T++ + + +G F P + MS++ AK++
Sbjct: 97 TFVDAKTAERVINVYNGQCF-PGSEQCMSLSFAKYK 131
>gi|224109048|ref|XP_002315063.1| predicted protein [Populus trichocarpa]
gi|222864103|gb|EEF01234.1| predicted protein [Populus trichocarpa]
Length = 387
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N +YV+ LP+ + E F G++K+D T +P+I +Y DK T KGDA VTY +
Sbjct: 12 NGTIYVSNLPEGTDDSMLAEHFGTIGLLKKDKRTGRPKIWLYRDKMTNEPKGDATVTY-E 70
Query: 348 EPSVALAT 355
+P ALA
Sbjct: 71 DPHAALAA 78
>gi|195567072|ref|XP_002107096.1| GD17272 [Drosophila simulans]
gi|194204495|gb|EDX18071.1| GD17272 [Drosophila simulans]
Length = 406
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK + T KP+I +Y +KETG KG+A VTY +
Sbjct: 119 IFVSGMDPSTTEQDIETHFGAIGIIKANKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 178
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F + I +S+ Q
Sbjct: 179 AQSAIEWFDGRDFNGNA-IKVSLAQ 202
>gi|156363387|ref|XP_001626026.1| predicted protein [Nematostella vectensis]
gi|156212886|gb|EDO33926.1| predicted protein [Nematostella vectensis]
Length = 468
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 44/79 (55%), Gaps = 1/79 (1%)
Query: 423 ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
+L MF P + R D ++ DV EEC K GP+ + V +N PQG+V V+ A
Sbjct: 365 MLTNMFDPTKER-DAGWDLDIRDDVLEECNKFGPIVHIHVDKNSPQGIVYVKCATPDIAI 423
Query: 483 KCIELMNGRWFGGRQIHAS 501
+ ++GRWF G+QI A+
Sbjct: 424 SASKSLHGRWFAGKQIIAA 442
>gi|195037871|ref|XP_001990384.1| GH19314 [Drosophila grimshawi]
gi|193894580|gb|EDV93446.1| GH19314 [Drosophila grimshawi]
Length = 332
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 70/135 (51%), Gaps = 10/135 (7%)
Query: 280 DSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
D + L+++T ++V G+ ++T ++V F K G IK D T KP+I +Y +K TG KG
Sbjct: 111 DEIYALRLDT-IFVLGMRQNLTKNDIVVFFGKIGRIKMDELTMKPKIFVYKNKLTGRSKG 169
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQ---KGERFIAKQ---VDSK 393
A +TY + A Q L+G+ F G + +SV A K RF Q ++ +
Sbjct: 170 QATITYTNAYAAQAAIQYLNGSKFL--GHV-LSVVPAYLSTRPGKSVRFCYAQETGIEQQ 226
Query: 394 KKKKLKKVEEKMLGW 408
++++ KK + W
Sbjct: 227 RRQRQKKWKPAYDNW 241
>gi|343426615|emb|CBQ70144.1| related to pre-mRNA splicing factor U2AF large chain [Sporisorium
reilianum SRZ2]
Length = 710
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 96/250 (38%), Gaps = 47/250 (18%)
Query: 279 PDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
PDS F+ ++V GLP +T ++++E+ G ++ + D TG K
Sbjct: 444 PDSPFK------IFVGGLPTYLTDDQVIELLQAFGELRA--------FNLVKDTGTGASK 489
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL 398
G A Y+ LA Q L+G D + + E+K + A +S +
Sbjct: 490 GFAFCEYVDTALTDLACQGLNGMELG-DRNLVVQRASVGSEKKAQALAATGANS-GALGI 547
Query: 399 KKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
V + G D + +++ M TP E++ DE +E D+++EC K G V
Sbjct: 548 AAVPSSVQQSAGEDGDAGEPTSCMVMLNMVTPEELQDDEEYADIVE-DIRDECTKYGAVT 606
Query: 459 SVKV------------------------------CENHPQGVVLVRFKDRKDAQKCIELM 488
V+V E G V VR+ + + + + +
Sbjct: 607 DVRVPRPAKESKGAAAHQWKRSQDESGAEGDKPDAEREGVGRVYVRYGETEHCAQALRAI 666
Query: 489 NGRWFGGRQI 498
GR FGGR +
Sbjct: 667 AGRQFGGRTV 676
>gi|443921112|gb|ELU40879.1| splicing factor, CC1-like family protein [Rhizoctonia solani AG-1
IA]
Length = 399
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 106/248 (42%), Gaps = 41/248 (16%)
Query: 278 PPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMK 337
PP + +YV L ++T ++ +VF +P + + ++ D TG
Sbjct: 159 PPGVSHPHAGSMQLYVGSLHFNLTESDIRQVF--------EPFGELDFVDLHRDPATGKS 210
Query: 338 KGDALVTYLKEPSVALATQLLDGTPF------------RPDGKIPMSVT-QAKFEQKGER 384
KG + Y + +A + ++G + G + +S Q E G
Sbjct: 211 KGYCFIQYKRPEDARMALEQMEGFELAGRQLRVNTVHDKGQGTVRISTAPQDSLEDTGGV 270
Query: 385 FIAKQVDSKKKKKLKKVEE-----KMLGWGGRDDAKLTIPAT-----VILRFMFTPAEMR 434
+ +KL + E+ ML K IP T V+LR MF P E
Sbjct: 271 LNNSTSRHQLMQKLARTEQPSKNNTML-------MKSNIPQTLSSRCVLLRNMFDPDE-E 322
Query: 435 ADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWF 493
+ + +L DV+ EC K G V +KV E +G + ++F+ + A+K I+ +NGRWF
Sbjct: 323 TERDWDKDLADDVRGECEEKYGKVLDLKV-EKESEGEIYIKFESVESAEKAIKGLNGRWF 381
Query: 494 GGRQIHAS 501
GG+Q+ AS
Sbjct: 382 GGKQVTAS 389
>gi|325188176|emb|CCA22716.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 401
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 61/102 (59%), Gaps = 14/102 (13%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCE--NHPQGV-VLVRFKD 477
++L+ M TP E+ D++L SE V+EEC K GPV + + + +HP+ + + V F++
Sbjct: 310 ILLQNMVTPDEV--DDHLGSE----VKEECSQKYGPVRNCIIYKVVSHPEAIRIFVEFEN 363
Query: 478 RKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEA 519
+DA + + +NGR+FGGR++ A+ N + R LD A
Sbjct: 364 VQDADRAVAGLNGRFFGGRKVLATN----YNESKFRRLDFSA 401
>gi|297852730|ref|XP_002894246.1| hypothetical protein ARALYDRAFT_891960 [Arabidopsis lyrata subsp.
lyrata]
gi|297340088|gb|EFH70505.1| hypothetical protein ARALYDRAFT_891960 [Arabidopsis lyrata subsp.
lyrata]
Length = 314
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
N VYV+ LP + E F G++K D T P++ +Y DKET KGDA VTY
Sbjct: 11 TNGSVYVSNLPLGTDENMLAEYFGTIGLLKRDKRTGTPKVWLYRDKETDEPKGDATVTY- 69
Query: 347 KEPSVALA 354
++P ALA
Sbjct: 70 EDPHAALA 77
>gi|194893921|ref|XP_001977968.1| GG17948 [Drosophila erecta]
gi|190649617|gb|EDV46895.1| GG17948 [Drosophila erecta]
Length = 406
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 1/85 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ T +++ F GIIK + T KP+I +Y +KETG KG+A VTY +
Sbjct: 119 IFVSGMDPSTTEQDIETHFGAIGIIKANKRTMKPKIWLYKNKETGASKGEATVTYDDTNA 178
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQ 375
A + DG F + I +S+ Q
Sbjct: 179 AQSAIEWFDGRDFNGNA-IKVSLAQ 202
>gi|170593403|ref|XP_001901454.1| Zn-finger in Ran binding protein and others containing protein
[Brugia malayi]
gi|158591521|gb|EDP30134.1| Zn-finger in Ran binding protein and others containing protein
[Brugia malayi]
Length = 578
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 53/96 (55%), Gaps = 1/96 (1%)
Query: 284 ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
+++V V++ G+P + + +VFS G I + T +PRIKIY D+ET KG+ +
Sbjct: 278 QIEVRETVFIQGIPVTANEQFIADVFSTQGDIARNERTGEPRIKIYTDRETNQPKGECTI 337
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFE 379
T++ + + +G F P + MS++ AK++
Sbjct: 338 TFVDAKTAERVINVYNGQCF-PGSEQCMSLSFAKYK 372
>gi|444726953|gb|ELW67463.1| RNA-binding protein EWS [Tupaia chinensis]
Length = 107
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 44/67 (65%)
Query: 278 PPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMK 337
PP E ++ +YV GL D+VT++++ + F +CG++K + T++P I + +D+ETG
Sbjct: 30 PPIDLDEDSDHSAIYVQGLNDNVTLDDLADFFKQCGVVKMNKRTEQPMIHVSLDEETGKP 89
Query: 338 KGDALVT 344
KG A V+
Sbjct: 90 KGHATVS 96
>gi|296411655|ref|XP_002835545.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295629331|emb|CAZ79702.1| unnamed protein product [Tuber melanosporum]
Length = 533
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L+ MF PAE D ++ +LE DV+ EC K G V + + N QG + ++F+ +
Sbjct: 436 VVLKNMFDPAEETGDAWIK-DLEDDVKTECENKYGHVVHIALDPNS-QGEIYIKFEKVQG 493
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+K I+ +NGR+FGGR+I AS
Sbjct: 494 GEKAIQGLNGRFFGGRRISAS 514
>gi|440632886|gb|ELR02805.1| hypothetical protein GMDG_05742 [Geomyces destructans 20631-21]
Length = 575
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 60/123 (48%), Gaps = 19/123 (15%)
Query: 396 KKLKKVEEKMLGWGGRDDAK-LTIPAT---------------VILRFMFTPAEMRADENL 439
+KL + +E GRDD + + P T V+LR MF PAE EN
Sbjct: 438 RKLARTDEPAPATNGRDDRREVAKPRTEVKPPPINVNTASRCVVLRNMFDPAE-ETGENW 496
Query: 440 RSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
ELE DV+ E K G V + + N QG + ++F + + I+ +NGR+FGGR I
Sbjct: 497 EKELEDDVRAEAEEKYGHVVHISLDPNS-QGDIYLKFDRVQGGENAIQGLNGRYFGGRMI 555
Query: 499 HAS 501
A+
Sbjct: 556 SAT 558
>gi|449494688|ref|XP_004159619.1| PREDICTED: uncharacterized LOC101219150 [Cucumis sativus]
Length = 396
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N +Y+ LP + E F G++K+D T +P+I +Y DK T KGDA VTY +
Sbjct: 12 NGSIYICNLPYGTDENMLAEYFGTIGVLKKDKRTGRPKIWLYRDKSTNEPKGDATVTY-E 70
Query: 348 EPSVALAT 355
+P ALA
Sbjct: 71 DPHAALAA 78
>gi|115471625|ref|NP_001059411.1| Os07g0296200 [Oryza sativa Japonica Group]
gi|34394336|dbj|BAC84891.1| RRM-containing RNA-binding protein-like [Oryza sativa Japonica
Group]
gi|113610947|dbj|BAF21325.1| Os07g0296200 [Oryza sativa Japonica Group]
gi|215693885|dbj|BAG89084.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222636864|gb|EEE66996.1| hypothetical protein OsJ_23906 [Oryza sativa Japonica Group]
Length = 394
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 38/76 (50%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV LP + E F G++K+D T +P+I IY DK T KGDA VTY
Sbjct: 11 NGSVYVCNLPPGTDETMLAEYFGTIGLLKKDKRTGRPKIWIYRDKVTNEPKGDATVTYED 70
Query: 348 EPSVALATQLLDGTPF 363
+ + A + + F
Sbjct: 71 PHAASAAVEWFNNKDF 86
>gi|328848768|gb|EGF97966.1| hypothetical protein MELLADRAFT_84111 [Melampsora larici-populina
98AG31]
Length = 695
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 45/82 (54%), Gaps = 2/82 (2%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRK 479
+V+L+ MF P E + EL DV+ EC K GPV ++ + + G + + F
Sbjct: 602 SVLLKNMFNPEE-ETERGWDMELRDDVKGECEEKYGPVLAIAIEKESMGGEIYITFDSVA 660
Query: 480 DAQKCIELMNGRWFGGRQIHAS 501
AQK I +N RWFGGRQI A+
Sbjct: 661 SAQKAIAGLNNRWFGGRQITAA 682
>gi|302781272|ref|XP_002972410.1| hypothetical protein SELMODRAFT_97426 [Selaginella moellendorffii]
gi|302804981|ref|XP_002984242.1| hypothetical protein SELMODRAFT_156345 [Selaginella moellendorffii]
gi|300148091|gb|EFJ14752.1| hypothetical protein SELMODRAFT_156345 [Selaginella moellendorffii]
gi|300159877|gb|EFJ26496.1| hypothetical protein SELMODRAFT_97426 [Selaginella moellendorffii]
Length = 370
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
+ + VYV LP + E F + G+IK+D +P+I IY DK + KGDA VTY
Sbjct: 9 QATSSVYVCNLPAGTDEALLAEHFGQIGVIKKDKRHGRPKIWIYRDKASNEPKGDATVTY 68
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
+ + A + D F + I +S+ ++K
Sbjct: 69 EDPHAASAAVEWFDNKEFHGN-LIRVSIAESK 99
>gi|429243189|ref|NP_594422.2| RNA-binding protein Rsd1 (predicted) [Schizosaccharomyces pombe
972h-]
gi|380865416|sp|O13845.2|RSD1_SCHPO RecName: Full=RNA-binding protein rsd1
gi|347834186|emb|CAB10118.2| RNA-binding protein Rsd1 (predicted) [Schizosaccharomyces pombe]
Length = 603
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
+L MF PAE N ELE DV+EEC K G V + V N G + V+F++
Sbjct: 508 ALLENMFNPAE-ETSPNWVQELEQDVKEECDEKYGKVVHIAVVPNE-LGQIFVKFENADF 565
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
A+K I ++ RWFGGR I AS
Sbjct: 566 AEKAITGLHQRWFGGRTIKAS 586
>gi|452820320|gb|EME27364.1| splicing factor U2AF 35 kDa subunit [Galdieria sulphuraria]
Length = 285
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 47/77 (61%), Gaps = 7/77 (9%)
Query: 430 PAEMRADENLRSELE---ADVQEECVKIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQK 483
P EM ++E+++ + DV +E K G ++ + VCEN H G V ++FKD AQ+
Sbjct: 69 PPEM-SEEDIKYHFDDFYEDVYDELSKYGEIEEMHVCENMSEHLTGNVYIKFKDEDAAQR 127
Query: 484 CIELMNGRWFGGRQIHA 500
++ +NGR++ GR +HA
Sbjct: 128 ALQAVNGRYYAGRMVHA 144
>gi|242050110|ref|XP_002462799.1| hypothetical protein SORBIDRAFT_02g032170 [Sorghum bicolor]
gi|241926176|gb|EER99320.1| hypothetical protein SORBIDRAFT_02g032170 [Sorghum bicolor]
Length = 392
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 39/76 (51%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV LP + + E F G++K+D T +P+I IY DK T KGDA VTY
Sbjct: 11 NGCVYVCNLPPGTDEDMLAEYFGTIGLLKKDKRTGRPKIWIYRDKVTNEPKGDATVTYED 70
Query: 348 EPSVALATQLLDGTPF 363
+ + A + + F
Sbjct: 71 PHAASAAVEWFNNKDF 86
>gi|393908436|gb|EJD75057.1| Zn-finger in Ran binding protein [Loa loa]
Length = 564
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 53/96 (55%), Gaps = 1/96 (1%)
Query: 284 ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALV 343
+++V V++ G+P + + +VFS G I + T +PRIKIY D+ET KG+ +
Sbjct: 263 QIEVRETVFIQGIPVTANEQFIADVFSTQGDIARNDRTGEPRIKIYTDRETNQPKGECTI 322
Query: 344 TYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFE 379
T++ + + +G F P + MS++ AK++
Sbjct: 323 TFVDAKTAERVINVYNGQCF-PGSEQCMSLSFAKYK 357
>gi|255559159|ref|XP_002520601.1| RNA binding protein, putative [Ricinus communis]
gi|223540200|gb|EEF41774.1| RNA binding protein, putative [Ricinus communis]
Length = 483
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 3/117 (2%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
N VYV LP + + E F G++K+D T +P+I +Y DK T KGDA VTY
Sbjct: 12 ANGSVYVCNLPQGTDEDMLAEYFGTIGLLKKDKRTGRPKIWLYRDKLTNEPKGDATVTYE 71
Query: 347 KEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEE 403
+ A + + F G + + V A+ + K E +VD +EE
Sbjct: 72 DPHAAQAAIEWFNNKDFH--GNL-IGVFMAESKNKDEHAYNSEVDPNAVGDFGGLEE 125
>gi|148674761|gb|EDL06708.1| mCG51545 [Mus musculus]
Length = 364
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V GL +++TV+ + + F GI K + +T++ I +Y D+ETG KG+A V++ PS
Sbjct: 267 IFVQGLGENLTVKSVADYFKHIGI-KTNKKTRQSMINLYTDRETGKLKGEATVSFDDPPS 325
Query: 351 VALATQLLDGTPF 363
AT DG F
Sbjct: 326 AKAATDWFDGKKF 338
>gi|430811054|emb|CCJ31450.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 484
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 49/94 (52%), Gaps = 15/94 (15%)
Query: 422 VILRFMFTPAEM-------------RADENLRSELEADVQEEC-VKIGPVDSVKVCENHP 467
V+L+ MF P E + +N ELE DV+ EC K G V + V EN P
Sbjct: 383 VLLKNMFNPQEYVFVLVFKFLIFFRESGDNWIRELEDDVKAECENKYGKVLHIHVEENSP 442
Query: 468 QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
G V ++F + ++ I+ +NGRWFGGR I AS
Sbjct: 443 -GDVYIKFDNVVAGERAIQGLNGRWFGGRTISAS 475
>gi|215820612|ref|NP_001135965.1| RNA binding motif protein 39 [Acyrthosiphon pisum]
Length = 501
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 114/296 (38%), Gaps = 54/296 (18%)
Query: 269 QVEKKEANKPPDSWFELKVNTH-----VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKK 323
Q + E N+ +S + TH +YV L ++T E + +F G +
Sbjct: 213 QPTQAEKNRMANSMPNMVQRTHYGPMKLYVGSLHYNITEEMLRGIFEPFGHVDN------ 266
Query: 324 PRIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE 383
I++ +D ETG KG +TY A + L+G F G+ PM V
Sbjct: 267 --IQLMMDTETGRSKGYGFLTYRNAEDAKKALEHLNG--FEIAGR-PMKVGHVTENHSVY 321
Query: 384 RFIAKQVDSKKKK--------KLKKVEEKMLGWG---------------------GRDDA 414
A +VD + +L+ + + G G
Sbjct: 322 DKTAFEVDELDRAGYDLGATGRLQLMYKLAEGTGFPIPQAAANALQVASGVQAAPAAPTV 381
Query: 415 KLTIPATV---ILRFMFTPAEMRADENLR--SELEADVQEECVKIGPVDSVKVCENHPQG 469
++T P +L MF P + D N +E+ DV EEC K G V V V + PQG
Sbjct: 382 QVTPPIATQCFLLANMFDPNKEDVDSNTTWETEIRDDVIEECNKHGGVLHVYVDKASPQG 441
Query: 470 VVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRLEQF 525
V V+ + A + ++GRWFGGR I A+ V +L EAS L +
Sbjct: 442 NVYVKCTTIETALASVAALHGRWFGGRVITAA----YVPVTNYHNLFPEASSLTKL 493
>gi|170056366|ref|XP_001863997.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876094|gb|EDS39477.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 325
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 95/199 (47%), Gaps = 42/199 (21%)
Query: 346 LKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER--FIAKQVDSKKKKKLKKVEE 403
LK SV LA +LLD R + V +A+F+ KGE + ++ K+K+K+KK++E
Sbjct: 19 LKIESVDLALELLDNYDVRGHK---IRVQRAEFQLKGEYNPSLKPKIKKKEKEKMKKMQE 75
Query: 404 KMLG-----------WGGRDDA-KLTIPAT-------------------------VILRF 426
+L +G DDA K I A+ VI++
Sbjct: 76 AVLRNLTPPKRSCALYGADDDASKWQILASNEVASRLFDWRPEKMRGERSKHERIVIVKN 135
Query: 427 MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIE 486
+F P + +L E + D++EEC K G V V + HP+GV V D ++A ++
Sbjct: 136 LFEPELFDREVHLLLEYQNDLREECNKCGTCRRVLVFDRHPEGVAQVTMSDPEEADLVVK 195
Query: 487 LMNGRWFGGRQIHASEDDG 505
L++GR+FG R++ A DG
Sbjct: 196 LLHGRFFGKRKLAAEIWDG 214
>gi|401406015|ref|XP_003882457.1| GA11385, related [Neospora caninum Liverpool]
gi|325116872|emb|CBZ52425.1| GA11385, related [Neospora caninum Liverpool]
Length = 564
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 6/83 (7%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
V+L M TP+E + EL+ +V++EC K G + V+V + V F D A
Sbjct: 477 VLLTNMVTPSE------VDGELKDEVRDECSKFGGIKRVEVHTLKDTVRIFVEFSDLSGA 530
Query: 482 QKCIELMNGRWFGGRQIHASEDD 504
++ I ++GRWFGGRQI A+ D
Sbjct: 531 REAIPSLHGRWFGGRQIIANTYD 553
>gi|390179235|ref|XP_003736839.1| GA13199 [Drosophila pseudoobscura pseudoobscura]
gi|388859764|gb|EIM52912.1| GA13199 [Drosophila pseudoobscura pseudoobscura]
Length = 408
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 40/73 (54%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ +T +++ F K G IK D T KP+I +Y DK TG KG+A +TY+ S
Sbjct: 176 IFVLGMRPSITKNDIIVCFGKIGHIKVDEGTDKPKIFVYKDKLTGRSKGEATITYVSPLS 235
Query: 351 VALATQLLDGTPF 363
A + G F
Sbjct: 236 AHAAITAMSGRRF 248
>gi|356494997|ref|XP_003516367.1| PREDICTED: TATA-binding protein-associated factor 2N-like [Glycine
max]
Length = 224
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 34/58 (58%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
N VYV LP +VE F G+IK+D T +P+I +Y DKET KGDA VTY
Sbjct: 12 NGSVYVCNLPYGTNDIMLVEYFGTIGLIKKDKRTGRPKIWLYRDKETNEPKGDATVTY 69
>gi|195157032|ref|XP_002019400.1| GL12385 [Drosophila persimilis]
gi|194115991|gb|EDW38034.1| GL12385 [Drosophila persimilis]
Length = 410
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 40/73 (54%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ +T +++ F K G IK D T KP+I +Y DK TG KG+A +TY+ S
Sbjct: 178 IFVLGMRPSITKNDIIVCFGKIGHIKVDEGTDKPKIFVYKDKLTGRSKGEATITYVSPLS 237
Query: 351 VALATQLLDGTPF 363
A + G F
Sbjct: 238 AHAAITAMSGRRF 250
>gi|18403012|ref|NP_564565.1| TBP-associated factor 15 [Arabidopsis thaliana]
gi|13605793|gb|AAK32882.1|AF367295_1 At1g50300/F14I3_23 [Arabidopsis thaliana]
gi|20147137|gb|AAM10285.1| At1g50300/F14I3_23 [Arabidopsis thaliana]
gi|39545914|gb|AAR28020.1| TAF15 [Arabidopsis thaliana]
gi|332194413|gb|AEE32534.1| TBP-associated factor 15 [Arabidopsis thaliana]
Length = 372
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
N VYV+ LP + + F G++K D T P++ +Y DKET KGDA VTY
Sbjct: 6 TNGSVYVSNLPLGTDENMLADYFGTIGLLKRDKRTGTPKVWLYRDKETDEPKGDATVTY- 64
Query: 347 KEPSVALA 354
++P ALA
Sbjct: 65 EDPHAALA 72
>gi|402590758|gb|EJW84688.1| hypothetical protein WUBG_04401 [Wuchereria bancrofti]
Length = 477
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 93/228 (40%), Gaps = 36/228 (15%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP + E++ E+ S G +K + ++ TG+ KG A YL
Sbjct: 273 IFIGGLPSYLNAEQVKELLSSFGQLKA--------FNLVTEQSTGVSKGYAFAEYLDPSL 324
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVD-SKKKKKLKKVEEKMLGWG 409
A L+G Q G++ + Q+ + + + + +
Sbjct: 325 TDQAIAGLNGM------------------QLGDKNLVVQLSCANARNNVAQNTFPQIQVA 366
Query: 410 GRDDAKLTIPATVILRFM--FTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
G D + P T +L M T E++ DE LE D++EEC K G V S+++ + P
Sbjct: 367 GIDLSHGAGPPTEVLCLMNMVTEDELKDDEEYEDILE-DIREECAKYGIVKSLEIPRSVP 425
Query: 468 ------QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
G V V F +++ QK + GR F R + S D + H
Sbjct: 426 GVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYH 473
>gi|444708508|gb|ELW49571.1| RNA-binding protein EWS [Tupaia chinensis]
Length = 368
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 37/58 (63%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
N+ VYV GL D VT++++ + + G++K + T +P I IY KETG K DA V+Y
Sbjct: 186 NSAVYVQGLNDSVTLDDLADFCKQGGVVKMNKRTGQPMIHIYPGKETGKPKSDATVSY 243
>gi|387193898|gb|AFJ68729.1| rna binding motif protein 39b, partial [Nannochloropsis gaditana
CCMP526]
Length = 456
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 52/86 (60%), Gaps = 4/86 (4%)
Query: 416 LTIPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
+ IP+ ++L+ MF PAE + D+ +++ DV++E K G V V+V P G V +
Sbjct: 351 VGIPSRALLLKNMFDPAEEK-DDGWELDIQDDVEDEGRKFGEVRHVRVDVKGPGGHVYML 409
Query: 475 FKDRKD--AQKCIELMNGRWFGGRQI 498
FK+ ++ AQK + +NGRWF GR I
Sbjct: 410 FKEGEEEGAQKSAQALNGRWFAGRMI 435
>gi|226295167|gb|EEH50587.1| RNA splicing factor Pad-1 [Paracoccidioides brasiliensis Pb18]
Length = 600
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE D ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 505 VLLRNMFDPAEEEGDGWIK-ELEDDVRAECEEKYGHVVHIALDPNT-QGDIYLKFDRVQG 562
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 563 GENAIKGLNGRFFGGRQISA 582
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 61/219 (27%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V L + +E++ F K G +KE +I D+ +G KG V + E S
Sbjct: 207 VFVQQLAARLRTKELIAFFEKVGPVKE--------AQIVKDRVSGRSKGVGYVEFKDEES 258
Query: 351 VALATQL----LDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKML 406
V LA QL L G P IA+ +++K ++ + E +
Sbjct: 259 VPLAIQLTGQKLLGIPI----------------------IAQLTEAEKNRQARNPEANVS 296
Query: 407 GWGGRDDAKLTIPATVI----LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV 462
G + +IP + + F T E+D+Q+ G +D V++
Sbjct: 297 G------NQNSIPFHRLYVGNIHFSIT--------------ESDLQKVFEPFGELDFVQL 336
Query: 463 CE---NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
+ +G V+F+D A++ +E MNG GR I
Sbjct: 337 QKEEGGRSRGYGFVQFRDPNQAREALEKMNGFDLAGRPI 375
>gi|443898020|dbj|GAC75358.1| splicing factor U2AF, large subunit [Pseudozyma antarctica T-34]
Length = 699
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 99/250 (39%), Gaps = 51/250 (20%)
Query: 279 PDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
PDS F+ ++V GLP +T ++++E+ G ++ + D T K
Sbjct: 434 PDSPFK------IFVGGLPTYLTDDQVIELLQAFGELRS--------FNLVKDPATNASK 479
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKF-EQKGERFIAKQVDSKKKKK 397
G A Y+ LA Q L+G G + V +A +K + IA +
Sbjct: 480 GFAFCEYVDTALTDLACQGLNGMEL---GDRNLVVQRASVGSEKKAQAIAAYGANVGALG 536
Query: 398 LKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPV 457
+ ++ G GG DA V+L M TP E++ DE +E D+++EC K G V
Sbjct: 537 VPSSVQQFAGAGG--DAGEPTSCMVMLN-MVTPEELQDDEEYADIVE-DIRDECTKFGTV 592
Query: 458 DSVKV-----------------------------CENHPQGVVLVRFKDRKDAQKCIELM 488
+ V+V E G V VR+ + + ++ +
Sbjct: 593 NDVRVPRPAKESKGAAAHQWKRSQNDEAADAGKPSEREGVGRVYVRYAETDQCAQALKSI 652
Query: 489 NGRWFGGRQI 498
GR FGGR +
Sbjct: 653 AGRQFGGRTV 662
>gi|295664082|ref|XP_002792593.1| RNA-binding protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278707|gb|EEH34273.1| RNA-binding protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 596
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE D ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 501 VLLRNMFDPAEEEGDGWIK-ELEDDVRAECEEKYGHVVHIALDPN-TQGDIYLKFDRVQG 558
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 559 GENAIKGLNGRFFGGRQISA 578
>gi|225677677|gb|EEH15961.1| RNA-binding protein rsd1 [Paracoccidioides brasiliensis Pb03]
Length = 600
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE D ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 505 VLLRNMFDPAEEEGDGWIK-ELEDDVRAECEEKYGHVVHIALDPNT-QGDIYLKFDRVQG 562
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 563 GENAIKGLNGRFFGGRQISA 582
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 61/219 (27%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V L + +E++ F K G +KE +I D+ +G KG V + E S
Sbjct: 207 VFVQQLAARLRTKELIAFFEKVGPVKE--------AQIVKDRVSGRSKGVGYVEFKDEES 258
Query: 351 VALATQL----LDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKML 406
V LA QL L G P IA+ +++K ++ + E +
Sbjct: 259 VPLAIQLTGQKLLGIPI----------------------IAQLTEAEKNRQARNPEANVS 296
Query: 407 GWGGRDDAKLTIPATVI----LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV 462
G + +IP + + F T E+D+Q+ G +D V++
Sbjct: 297 G------NQNSIPFHRLYVGNIHFSIT--------------ESDLQKVFEPFGELDFVQL 336
Query: 463 CE---NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
+ +G V+F+D A++ +E MNG GR I
Sbjct: 337 QKEEGGRSRGYGFVQFRDPNQAREALEKMNGFDLAGRPI 375
>gi|255941952|ref|XP_002561745.1| Pc16g14480 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211586368|emb|CAP94118.1| Pc16g14480 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 562
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE EN ELE DV+ EC K G V + + + QG + ++F +
Sbjct: 467 VLLRNMFDPAE-ETGENWVKELEEDVRSECEKKYGHVVHISL-DASSQGDIYLKFDRVQG 524
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGG+QI A
Sbjct: 525 GENAIKGLNGRYFGGKQITA 544
>gi|388856534|emb|CCF49840.1| related to pre-mRNA splicing factor U2AF large chain [Ustilago
hordei]
Length = 718
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 92/241 (38%), Gaps = 46/241 (19%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V GLP +T ++++E+ G ++ + D G KG A Y+
Sbjct: 454 IFVGGLPTYLTDDQVIELLQAFGELRA--------FNLVKDTANGASKGFAFCEYVDTAL 505
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKK--KKKLKKVEEKMLGW 408
LA Q L+G D + + E+K + A ++ + ++ G
Sbjct: 506 TDLACQGLNGMELG-DRNLVVQRASVGSEKKAQAIAATGANAGALGDAGMPSSVQQFAGE 564
Query: 409 GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV------ 462
GG DA V+L M TP E++ DE +E D++EEC K G V V+V
Sbjct: 565 GG--DAGEPRSCMVMLN-MVTPEELQDDEEYADIVE-DIREECTKYGTVTDVRVPRPAKE 620
Query: 463 -------------------------CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
E G V VR+ + D + + + GR FGGR
Sbjct: 621 SKGAAAHQWKRTQDESAASGGEKPATEREGVGRVYVRYAETGDCAQALRAIAGRQFGGRT 680
Query: 498 I 498
+
Sbjct: 681 V 681
>gi|303281406|ref|XP_003059995.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458650|gb|EEH55947.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 206
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 47/94 (50%), Gaps = 14/94 (14%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV-------VLV 473
++LR M P E+ D LE +V EEC K G V V + E G + V
Sbjct: 102 VLLLRNMIGPGEVDED------LEDEVAEECEKHGAVHRVMIFEVTESGYDPREAVRIFV 155
Query: 474 RFKDRKDAQKCIELMNGRWFGGRQIHASE-DDGL 506
F +DA KC M+GR+FGGR + AS D+GL
Sbjct: 156 EFTKPEDAAKCANEMDGRFFGGRTVAASHYDEGL 189
>gi|195403125|ref|XP_002060145.1| GJ18507 [Drosophila virilis]
gi|194140989|gb|EDW57415.1| GJ18507 [Drosophila virilis]
Length = 386
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 2/75 (2%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIK--EDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
++V+G+ T +++ F GIIK +D T KP+I +Y +KETG KG+A VTY
Sbjct: 121 IFVSGMDPSTTEQDIETHFGAIGIIKASKDKRTMKPKIWLYKNKETGTSKGEATVTYDDV 180
Query: 349 PSVALATQLLDGTPF 363
+ A Q DG F
Sbjct: 181 NAAQSAIQWFDGNDF 195
>gi|393909510|gb|EJD75480.1| hypothetical protein LOAG_17389 [Loa loa]
Length = 502
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 93/228 (40%), Gaps = 36/228 (15%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP + E++ E+ S G +K + ++ TG+ KG A YL
Sbjct: 298 IFIGGLPSYLNAEQVKELLSSFGQLKA--------FNLVTEQSTGVSKGYAFAEYLDPSL 349
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVD-SKKKKKLKKVEEKMLGWG 409
A L+G Q G++ + Q+ + + + + +
Sbjct: 350 TDQAIAGLNGM------------------QLGDKNLVVQLSCANARNNVAQNTFPQIQVA 391
Query: 410 GRDDAKLTIPATVILRFM--FTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
G D + P T +L M T E++ DE LE D++EEC K G V S+++ + P
Sbjct: 392 GIDLSHGAGPPTEVLCLMNMVTEDELKDDEEYEDILE-DIREECAKYGIVKSLEIPRSVP 450
Query: 468 ------QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
G V V F +++ QK + GR F R + S D + H
Sbjct: 451 GVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYH 498
>gi|170575889|ref|XP_001893425.1| U2 auxiliary factor 65 kDa subunit [Brugia malayi]
gi|158600599|gb|EDP37742.1| U2 auxiliary factor 65 kDa subunit, putative [Brugia malayi]
Length = 502
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 93/228 (40%), Gaps = 36/228 (15%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP + E++ E+ S G +K + ++ TG+ KG A YL
Sbjct: 298 IFIGGLPSYLNAEQVKELLSSFGQLKA--------FNLVTEQSTGVSKGYAFAEYLDPSL 349
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVD-SKKKKKLKKVEEKMLGWG 409
A L+G Q G++ + Q+ + + + + +
Sbjct: 350 TDQAIAGLNGM------------------QLGDKNLVVQLSCANARNNVAQNTFPQIQVA 391
Query: 410 GRDDAKLTIPATVILRFM--FTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
G D + P T +L M T E++ DE LE D++EEC K G V S+++ + P
Sbjct: 392 GIDLSHGAGPPTEVLCLMNMVTEDELKDDEEYEDILE-DIREECAKYGIVKSLEIPRSVP 450
Query: 468 ------QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
G V V F +++ QK + GR F R + S D + H
Sbjct: 451 GVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYH 498
>gi|359475014|ref|XP_002279887.2| PREDICTED: RNA-binding protein rsd1-like [Vitis vinifera]
Length = 609
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 16/115 (13%)
Query: 401 VEEKMLGWGGRDDAKLTIPATVI--------------LRFMFTPAEMRADENLRSELEAD 446
V+ + G G A L +PA + L+ MF P ++ + + +++ D
Sbjct: 482 VQAPVPGLAGLPGAGLQVPAVTVPSIDTIGVPSECLMLKNMFDP-KLETEPDFDLDIKED 540
Query: 447 VQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
VQ+EC K G V + V +N G V +RF++ + A ++GRWF G+ I A+
Sbjct: 541 VQDECSKFGTVKHIYVDKNS-AGFVFLRFENTQAAISAQRALHGRWFAGKMITAT 594
>gi|425781538|gb|EKV19498.1| Ribosome biogenesis (Nop4), putative [Penicillium digitatum PHI26]
gi|425782769|gb|EKV20659.1| Ribosome biogenesis (Nop4), putative [Penicillium digitatum Pd1]
Length = 722
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 94/210 (44%), Gaps = 30/210 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V LP T E + E FS+ IIK + DKET + KG VT+
Sbjct: 38 LFVRSLPTSATTESLAEHFSQSYIIKH--------AVVVSDKETKVSKGFGFVTFADVED 89
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
A + +G+ F DGKI +++ + I + V + ++ KK +E+ G G
Sbjct: 90 AESALKEFNGSKF--DGKIIRVDYAESRKREIDEKIGRSVPTAASRESKKQKEEERGQG- 146
Query: 411 RDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ-- 468
+P +I+R + P ++ E+L + G V V + + + +
Sbjct: 147 -------LPPKLIVRNL--PWSVKEPEDLNVLFRS--------FGKVKFVTLPKRNGKLS 189
Query: 469 GVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
G V + RK+A+K ++++NG+ GRQI
Sbjct: 190 GFGFVTMRGRKNAEKALQMINGKEIDGRQI 219
>gi|5734788|gb|AAD50053.1|AC007980_18 Similar to RNA-binding proteins [Arabidopsis thaliana]
Length = 211
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 287 VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYL 346
N VYV+ LP + + F G++K D T P++ +Y DKET KGDA VTY
Sbjct: 6 TNGSVYVSNLPLGTDENMLADYFGTIGLLKRDKRTGTPKVWLYRDKETDEPKGDATVTY- 64
Query: 347 KEPSVALA 354
++P ALA
Sbjct: 65 EDPHAALA 72
>gi|401411525|ref|XP_003885210.1| hypothetical protein NCLIV_056060 [Neospora caninum Liverpool]
gi|325119629|emb|CBZ55182.1| hypothetical protein NCLIV_056060 [Neospora caninum Liverpool]
Length = 642
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 2/86 (2%)
Query: 417 TIPATVILRFMFTP--AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
T V+L MF P +++ D + +L DV++EC K G V+ V + E + G V +R
Sbjct: 544 TGSCNVVLHNMFAPKDVDLKEDPHFFLDLGDDVRDECKKFGSVEKVWIDERNVDGNVWIR 603
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
F A+ +NGR+F G+ I A
Sbjct: 604 FAHPDQARAAFGALNGRYFAGKPISA 629
>gi|357121679|ref|XP_003562545.1| PREDICTED: uncharacterized protein LOC100833358 [Brachypodium
distachyon]
Length = 389
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 37/76 (48%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N +YV LP + E F G++K+D T P+I IY DK T KGDA VTY
Sbjct: 11 NGSIYVCNLPPGTDETMLAEYFGTIGLLKKDKRTGHPKIWIYRDKVTNEPKGDATVTYED 70
Query: 348 EPSVALATQLLDGTPF 363
+ + A + + F
Sbjct: 71 PHAASAAVEWFNNKDF 86
>gi|218199460|gb|EEC81887.1| hypothetical protein OsI_25698 [Oryza sativa Indica Group]
Length = 433
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 38/76 (50%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV LP + E F G++K+D T +P+I IY DK T KGDA VTY
Sbjct: 11 NGSVYVCNLPPGTDETMLAEYFGTIGLLKKDKRTGRPKIWIYRDKVTNEPKGDATVTYED 70
Query: 348 EPSVALATQLLDGTPF 363
+ + A + + F
Sbjct: 71 PHAASAAVEWFNNKDF 86
>gi|323507689|emb|CBQ67560.1| related to RNA-binding region containing protein 2 [Sporisorium
reilianum SRZ2]
Length = 659
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 49/81 (60%), Gaps = 3/81 (3%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRK 479
+++L+ MF PAE + + ++L DV+EEC K G V S+ V E G + V F D
Sbjct: 568 SLLLKNMFNPAE-ETEPDWDTDLAEDVKEECQAKYGRVTSIHV-EKESAGEIYVTFADLD 625
Query: 480 DAQKCIELMNGRWFGGRQIHA 500
++K ++ +NGR+FGG+ I A
Sbjct: 626 ASRKALDGLNGRFFGGKPISA 646
>gi|212531765|ref|XP_002146039.1| RNA splicing factor (Pad-1), putative [Talaromyces marneffei ATCC
18224]
gi|210071403|gb|EEA25492.1| RNA splicing factor (Pad-1), putative [Talaromyces marneffei ATCC
18224]
Length = 562
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 467 VLLRNMFDPAEEEGEAWIK-ELEDDVRAECEEKYGHVVHISLDPNS-QGDIYLKFDRVQG 524
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 525 GENAIKGLNGRFFGGRQISA 544
>gi|115397599|ref|XP_001214391.1| hypothetical protein ATEG_05213 [Aspergillus terreus NIH2624]
gi|114192582|gb|EAU34282.1| hypothetical protein ATEG_05213 [Aspergillus terreus NIH2624]
Length = 568
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 473 VLLRNMFDPAEEEGESWIK-ELEDDVRAECEDKYGHVVHIALDPNS-QGDIYLKFDRVQG 530
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 531 GENAIKGLNGRFFGGRQITA 550
>gi|70996384|ref|XP_752947.1| RNA splicing factor (Pad-1) [Aspergillus fumigatus Af293]
gi|66850582|gb|EAL90909.1| RNA splicing factor (Pad-1), putative [Aspergillus fumigatus Af293]
gi|159131701|gb|EDP56814.1| RNA splicing factor (Pad-1), putative [Aspergillus fumigatus A1163]
Length = 566
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 471 VLLRNMFDPAEEEGEAWIK-ELEDDVRAECEDKYGHVVHIALDPNS-QGDIYLKFDRVQG 528
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 529 GENAIKGLNGRFFGGRQITA 548
>gi|297744588|emb|CBI37850.3| unnamed protein product [Vitis vinifera]
Length = 427
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 57/115 (49%), Gaps = 16/115 (13%)
Query: 401 VEEKMLGWGGRDDAKLTIPAT--------------VILRFMFTPAEMRADENLRSELEAD 446
V+ + G G A L +PA ++L+ MF P ++ + + +++ D
Sbjct: 300 VQAPVPGLAGLPGAGLQVPAVTVPSIDTIGVPSECLMLKNMFDP-KLETEPDFDLDIKED 358
Query: 447 VQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
VQ+EC K G V + V +N G V +RF++ + A ++GRWF G+ I A+
Sbjct: 359 VQDECSKFGTVKHIYVDKNS-AGFVFLRFENTQAAISAQRALHGRWFAGKMITAT 412
>gi|261191422|ref|XP_002622119.1| RNA splicing factor [Ajellomyces dermatitidis SLH14081]
gi|239589885|gb|EEQ72528.1| RNA splicing factor [Ajellomyces dermatitidis SLH14081]
gi|239612708|gb|EEQ89695.1| RNA splicing factor [Ajellomyces dermatitidis ER-3]
gi|327351781|gb|EGE80638.1| RNA splicing factor [Ajellomyces dermatitidis ATCC 18188]
Length = 583
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE E+ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 488 VLLRNMFDPAEEEG-ESWVKELEDDVRAECEDKYGHVVHIALDPN-TQGDIYLKFDRVQG 545
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 546 GENAIKGLNGRFFGGRQISA 565
>gi|410915937|ref|XP_003971443.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like isoform 2
[Takifugu rubripes]
Length = 598
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 58/244 (23%), Positives = 111/244 (45%), Gaps = 34/244 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYTGSQPPVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D +G+ +G A VT+ + + A +L + RP +I + ++ A F+
Sbjct: 193 LMMDPLSGLNRGYAFVTFCSKEAAQEAVKLCNNHEIRPGKQIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILR---FMF-------TPAEMRA-- 435
SK K+++ VEE G D L + + F F T A+++
Sbjct: 249 SIPKSKTKEQI--VEEFSKVTEGLSDVILYLQPQDKSKNRGFCFLEYEDHKTAAQVKVLF 306
Query: 436 DENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFG 494
NL + + ++ E+ + G ++ VK +++ + F++R A K +E MNG+
Sbjct: 307 VRNLANSVTEEILEKSFSEYGNLERVKKLKDY----AFIHFEERDGAVKALEEMNGKELE 362
Query: 495 GRQI 498
G I
Sbjct: 363 GEPI 366
>gi|405120919|gb|AFR95689.1| RNA splicing factor Pad-1 [Cryptococcus neoformans var. grubii H99]
Length = 588
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 62/243 (25%), Positives = 113/243 (46%), Gaps = 34/243 (13%)
Query: 279 PDSWFELKVNTH-VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMK 337
PD+ + + H ++V+ L +T +++ +VF G I+ + +++D +G++
Sbjct: 347 PDAHKDAAIPYHRLFVSNLAFSLTADDVRQVFEPFGEIEF--------VDLHMDL-SGLR 397
Query: 338 KGDALVTYLKEPSVALATQLLDGTPFRPDGK-IPMSVTQAKFEQKGERFI------AKQV 390
KG A V + S +A + G F G+ I + Q + + I ++
Sbjct: 398 KGTAYVQFKDVKSAQMALDAMAG--FDLAGRLIKVQTIQERGTYQTPDLIEDSGNYGTRL 455
Query: 391 DSKKKK----KLKKVEEKM---LGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADEN 438
D+ +++ KL + E + L +D++ +PA +++ MF P E + N
Sbjct: 456 DANQRQQLMFKLARTEPNVNLSLSAPRINDSQSKVPAMDPTPRIVVHNMFNPEE-ETERN 514
Query: 439 LRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
+L DV+ E K G V +KV E G V + F D A K ++ +NGR+FGGRQ
Sbjct: 515 WDLDLAEDVKGEVESKYGRVKRIKV-EKMSAGEVYIEFIDTDSAIKAVKGLNGRFFGGRQ 573
Query: 498 IHA 500
+ A
Sbjct: 574 LQA 576
>gi|119494703|ref|XP_001264168.1| RNA splicing factor (Pad-1), putative [Neosartorya fischeri NRRL
181]
gi|119412330|gb|EAW22271.1| RNA splicing factor (Pad-1), putative [Neosartorya fischeri NRRL
181]
Length = 566
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 471 VLLRNMFDPAEEEGEAWIK-ELEDDVRAECEDKYGHVVHIALDPNS-QGDIYLKFDRVQG 528
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 529 GENAIKGLNGRFFGGRQITA 548
>gi|325090859|gb|EGC44169.1| RNA splicing factor [Ajellomyces capsulatus H88]
Length = 585
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 490 VLLRNMFDPAEEEGESWIK-ELEDDVRAECEDKYGHVVHIALDPN-TQGDIYLKFDRVQG 547
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 548 GENAIKGLNGRFFGGRQISA 567
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 61/219 (27%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V L + +E++ F K G +KE +I D+ +G KG V + E S
Sbjct: 192 VFVQQLAARLRTKELIAFFEKVGPVKE--------AQIVKDRVSGRSKGVGYVEFKNEES 243
Query: 351 VALATQL----LDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKML 406
V LA QL L G P IA+ +++K ++ + E +
Sbjct: 244 VPLAIQLTGQKLLGIPI----------------------IAQLTEAEKNRQARNPEANVS 281
Query: 407 GWGGRDDAKLTIPATVI----LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV 462
G + +IP + + F T E+D+Q+ G +D V++
Sbjct: 282 G------NQNSIPFHRLYVGNIHFSIT--------------ESDLQKVFEPFGELDFVQL 321
Query: 463 CE---NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
+ +G V+F+D A++ +E MNG GR I
Sbjct: 322 QKEEGGRSRGYGFVQFRDPNQAREALEKMNGFDLAGRPI 360
>gi|225561416|gb|EEH09696.1| RNA splicing factor Pad-1 [Ajellomyces capsulatus G186AR]
Length = 584
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 489 VLLRNMFDPAEEEGESWIK-ELEDDVRAECEDKYGHVVHIALDPN-TQGDIYLKFDRVQG 546
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 547 GENAIKGLNGRFFGGRQISA 566
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 61/219 (27%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V L + +E++ F K G +KE +I D+ +G KG V + E S
Sbjct: 191 VFVQQLAARLRTKELIAFFEKVGPVKE--------AQIVKDRVSGRSKGVGYVEFKNEES 242
Query: 351 VALATQL----LDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKML 406
V LA QL L G P IA+ +++K ++ + E +
Sbjct: 243 VPLAIQLTGQKLLGIPI----------------------IAQLTEAEKNRQARNPEANVS 280
Query: 407 GWGGRDDAKLTIPATVI----LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV 462
G + +IP + + F T E+D+Q+ G +D V++
Sbjct: 281 G------NQNSIPFHRLYVGNIHFSIT--------------ESDLQKVFEPFGELDFVQL 320
Query: 463 CE---NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
+ +G V+F+D A++ +E MNG GR I
Sbjct: 321 QKEEGGRSRGYGFVQFRDPNQAREALEKMNGFDLAGRPI 359
>gi|154282581|ref|XP_001542086.1| hypothetical protein HCAG_02257 [Ajellomyces capsulatus NAm1]
gi|150410266|gb|EDN05654.1| hypothetical protein HCAG_02257 [Ajellomyces capsulatus NAm1]
Length = 585
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 490 VLLRNMFDPAEEEGESWIK-ELEDDVRAECEDKYGHVVHIALDPN-TQGDIYLKFDRVQG 547
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 548 GENAIKGLNGRFFGGRQISA 567
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 61/219 (27%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V L + +E++ F K G +KE +I D+ +G KG V + E S
Sbjct: 192 VFVQQLAARLRTKELIAFFEKVGPVKE--------AQIVKDRVSGRSKGVGYVEFKNEES 243
Query: 351 VALATQL----LDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKML 406
V LA QL L G P IA+ +++K ++ + E +
Sbjct: 244 VPLAIQLTGQKLLGIPI----------------------IAQLTEAEKNRQARNPEANVS 281
Query: 407 GWGGRDDAKLTIPATVI----LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV 462
G + +IP + + F T E+D+Q+ G +D V++
Sbjct: 282 G------NQNSIPFHRLYVGNIHFSIT--------------ESDLQKVFEPFGELDFVQL 321
Query: 463 CE---NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
+ +G V+F+D A++ +E MNG GR I
Sbjct: 322 QKEEGGRSRGYGFVQFRDPNQAREALEKMNGFDLAGRPI 360
>gi|331242737|ref|XP_003334014.1| RNA-binding protein rsd1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 88
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 427 MFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCI 485
MF P E ++ EL DV+ EC K GPV ++ + + G + + F AQK I
Sbjct: 1 MFNPEE-ETEQGWDIELRDDVKGECEEKYGPVLAIAIEKESTAGDIYITFDSVPSAQKAI 59
Query: 486 ELMNGRWFGGRQIHAS 501
+N RWFGGRQI A+
Sbjct: 60 TGLNNRWFGGRQITAA 75
>gi|195420125|ref|XP_002060758.1| GK16102 [Drosophila willistoni]
gi|194156843|gb|EDW71744.1| GK16102 [Drosophila willistoni]
Length = 194
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V G+ VT ++ FS G+I D T KPR+ +Y +K+ G KG++ +TY+
Sbjct: 20 IFVLGMRLSVTKNYILMFFSSVGMIAVD-HTSKPRMFVYKNKQNGRSKGESTITYISPFM 78
Query: 351 VALATQLLDGTPFR 364
+A + L+G+ F+
Sbjct: 79 AEMAIRCLNGSKFK 92
>gi|240274521|gb|EER38037.1| RNA splicing factor [Ajellomyces capsulatus H143]
Length = 537
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 442 VLLRNMFDPAEEEGESWIK-ELEDDVRAECEDKYGHVVHIALDPNT-QGDIYLKFDRVQG 499
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 500 GENAIKGLNGRFFGGRQISA 519
>gi|195380577|ref|XP_002049047.1| GJ20975 [Drosophila virilis]
gi|194143844|gb|EDW60240.1| GJ20975 [Drosophila virilis]
Length = 428
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 95/223 (42%), Gaps = 37/223 (16%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+Y+ GLP + E++ E+ G ++ + D TG KG A YL +PS
Sbjct: 224 IYIGGLPTCLNEEQVKELLVTFGQLRG--------FNLVKDTITGQSKGFAFCEYL-DPS 274
Query: 351 VA-LATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
+ A L+G Q G+R + Q + + + +L
Sbjct: 275 ITEQAIAGLNGM------------------QLGDRKLVVQRSIAGVRNMVASQLPVLQVP 316
Query: 410 GRDDAKLTIPATVILRF--MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE--- 464
G T AT +L M P+E+ D + ++ D+++EC K G V S+K+
Sbjct: 317 GFPIDVSTCKATEVLCLLNMVLPSEL-LDNDEYEDIRTDIKQECAKYGKVKSLKIPRPIG 375
Query: 465 NHPQ---GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
+ PQ G V VRF+ +D +K + ++GR F GR + S D
Sbjct: 376 DPPQTGCGKVFVRFESIEDCKKALNALSGRKFSGRIVMTSFYD 418
>gi|258578315|ref|XP_002543339.1| hypothetical protein UREG_02855 [Uncinocarpus reesii 1704]
gi|237903605|gb|EEP78006.1| hypothetical protein UREG_02855 [Uncinocarpus reesii 1704]
Length = 582
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF P E D ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 487 VLLRNMFDPTEEEGDSWVK-ELEDDVRAECEEKYGHVVHIALDPNT-QGDIYLKFDRVQG 544
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 545 GENAIKGLNGRFFGGRQITA 564
>gi|170040164|ref|XP_001847879.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863738|gb|EDS27121.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 278
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
++V G+ + + E+ + F GIIK+D T KP+I IY DKE+G KG+A VTY
Sbjct: 8 IFVQGMNPETSETEIADYFGAIGIIKKDKRTMKPKIWIYKDKESGRPKGEATVTY 62
>gi|325188850|emb|CCA23378.1| Mitochondrial Carrier (MC) Family putative [Albugo laibachii Nc14]
Length = 886
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 7/80 (8%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETK------KPRIKIYVDKETGMKKGDALVT 344
+Y++ LPDDVT++E+ E+F GI+ + + + +IK+Y D E G KGDA++T
Sbjct: 169 IYISNLPDDVTIDELQEIFGAIGIVAREKQKRGFKDQWPFKIKMYTD-EAGKFKGDAVLT 227
Query: 345 YLKEPSVALATQLLDGTPFR 364
Y + A +G+ R
Sbjct: 228 YEDANAARTAPSFFNGSEIR 247
>gi|242047468|ref|XP_002461480.1| hypothetical protein SORBIDRAFT_02g003310 [Sorghum bicolor]
gi|241924857|gb|EER98001.1| hypothetical protein SORBIDRAFT_02g003310 [Sorghum bicolor]
Length = 338
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 79/215 (36%), Gaps = 45/215 (20%)
Query: 290 HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
++V LP T EE+ +VF++ G + + +I DK T +G A VT
Sbjct: 111 RLFVGNLPYTYTSEELAQVFAEAGRVDD--------AQIIYDKVTNRSRGFAFVTMATAE 162
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
A A Q+ DG G V + + GER ++ K+ LGWG
Sbjct: 163 EAAKAIQMFDGALL---GGRTARVNYPEVPRGGERRTVTMAGRRRDDGTYKIYAGNLGWG 219
Query: 410 GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE---NH 466
R D LR+ E + G +D+ + E
Sbjct: 220 VRAD------------------------TLRNVFEG-------RAGLLDARVIFERETGR 248
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+G V F+ +DAQ +E ++G GR + S
Sbjct: 249 SRGFGFVSFRTAEDAQAALEALDGVELEGRPLRLS 283
>gi|218201876|gb|EEC84303.1| hypothetical protein OsI_30792 [Oryza sativa Indica Group]
Length = 471
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV LP + + F G++K+D T +P+I IY DK T KGDA VTY
Sbjct: 68 NGSVYVCNLPPGTDETMLADYFGTIGLLKKDKRTGRPKIWIYRDKVTNEPKGDATVTYED 127
Query: 348 EPSVALATQLLDGTPF 363
+ + A + + F
Sbjct: 128 PHAASAAVEWFNNKDF 143
>gi|82595543|ref|XP_725893.1| splicing factor [Plasmodium yoelii yoelii 17XNL]
gi|23481068|gb|EAA17458.1| putative splicing factor [Plasmodium yoelii yoelii]
Length = 698
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 48/86 (55%), Gaps = 2/86 (2%)
Query: 417 TIPATVILRFMFTP--AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
I ++L MF+P + + +D + S++ DV+EEC K G + + + + G + ++
Sbjct: 600 NITPNIVLCNMFSPNDSSIGSDPDFFSDIIEDVREECSKYGNIIKIWLNSKNIDGKIYIK 659
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
+ ++ K + +NGR+FGG I+A
Sbjct: 660 YTKNDESLKAFQCLNGRYFGGSLINA 685
>gi|119186105|ref|XP_001243659.1| hypothetical protein CIMG_03100 [Coccidioides immitis RS]
gi|392870366|gb|EAS32162.2| CC1-like family splicing factor [Coccidioides immitis RS]
Length = 595
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF P E D ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 500 VLLRNMFDPTEEEGDSWVK-ELEDDVRAECEEKYGHVVHIALDPN-AQGDIYLKFDRVQG 557
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 558 GENAIKGLNGRFFGGRQITA 577
>gi|303318024|ref|XP_003069014.1| splicing factor, CC1-like family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240108695|gb|EER26869.1| splicing factor, CC1-like family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320036831|gb|EFW18769.1| RNA splicing factor [Coccidioides posadasii str. Silveira]
Length = 593
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF P E D ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 498 VLLRNMFDPTEEEGDSWVK-ELEDDVRAECEEKYGHVVHIALDPN-AQGDIYLKFDRVQG 555
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 556 GENAIKGLNGRFFGGRQITA 575
>gi|68073969|ref|XP_678899.1| splicing factor [Plasmodium berghei strain ANKA]
gi|56499509|emb|CAH94054.1| splicing factor, putative [Plasmodium berghei]
Length = 535
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 49/86 (56%), Gaps = 2/86 (2%)
Query: 417 TIPATVILRFMFTP--AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
+ + ++L MF+P + + +D + S++ DV+EEC K G + + + + G + ++
Sbjct: 437 NVTSNIVLCNMFSPNDSNIGSDPDFFSDIIEDVREECSKYGNIIKIWLNTKNIDGKIYIK 496
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
+ ++ K + +NGR+FGG I+A
Sbjct: 497 YTKNDESLKAFQCLNGRYFGGSLINA 522
>gi|242036887|ref|XP_002465838.1| hypothetical protein SORBIDRAFT_01g046740 [Sorghum bicolor]
gi|241919692|gb|EER92836.1| hypothetical protein SORBIDRAFT_01g046740 [Sorghum bicolor]
Length = 541
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + ++ DVQEEC K G + + V +N G V +RF A
Sbjct: 451 LLLKNMFDPA-LETDPDFDLDIRDDVQEECSKFGQLKHIFVDKNT-AGFVYLRFDSITAA 508
Query: 482 QKCIELMNGRWFGGRQIHAS 501
+ ++GRWF G+ I A+
Sbjct: 509 MSAQKALHGRWFAGKMITAT 528
>gi|298708815|emb|CBJ30774.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 604
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++++ MF PA DE +++ +++EEC K G V V P G+V V F A
Sbjct: 498 LLVKNMFDPAT-ETDEGWELDIKEEMEEECSKHGAVMHSYVESRQPGGLVYVMFSTTGAA 556
Query: 482 QKCIELMNGRWFGGRQI 498
E++NGRWF GR +
Sbjct: 557 VASAEMLNGRWFAGRMV 573
>gi|270001539|gb|EEZ97986.1| hypothetical protein TcasGA2_TC000381 [Tribolium castaneum]
Length = 591
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 18/101 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG----------V 470
VILR M P ++ DE L+ E +QEEC K G V+ V + N Q
Sbjct: 494 VVILRNMVGPEDV--DETLQEE----IQEECSKFGTVERV-IIYNEKQSEEDDASDVVVK 546
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHAS-EDDGLVNHA 510
+ V F + +A+K + +NGR+FGGR + AS D L +H+
Sbjct: 547 IFVEFTETPEAEKARDALNGRYFGGRMVQASLYDQTLFDHS 587
>gi|388852481|emb|CCF53883.1| related to RNA-binding region containing protein 2 [Ustilago
hordei]
Length = 678
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRK 479
+++L+ MF PAE ++ +L DV++EC K G V S+ V E G V V F +
Sbjct: 587 SLLLKNMFNPAE-ETEQGWDKDLAEDVKDECHAKYGAVTSIHV-EKDSAGEVYVTFANLD 644
Query: 480 DAQKCIELMNGRWFGGRQIHA 500
++K +E +NGR+FGG+ I A
Sbjct: 645 GSRKALEGLNGRFFGGKPISA 665
>gi|356575464|ref|XP_003555861.1| PREDICTED: RNA-binding protein 39-like [Glycine max]
Length = 600
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 13/106 (12%)
Query: 406 LGWGGRDDAKLTIPA---------TVILRFMFTPA-EMRADENLRSELEADVQEECVKIG 455
L GG TIP+ ++L+ MF P E+ D +L +++ DV+ EC K+G
Sbjct: 483 LAGGGLQIPTATIPSIDTIGVASECLMLKNMFDPKDEIEPDFDL--DIKEDVEAECSKLG 540
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ + V + G V +RF+D + A + ++GRWF G+ I AS
Sbjct: 541 TLKHIYV-DKKSAGFVYLRFEDTQSAISAQQALHGRWFAGKMITAS 585
>gi|297836410|ref|XP_002886087.1| hypothetical protein ARALYDRAFT_480573 [Arabidopsis lyrata subsp.
lyrata]
gi|297331927|gb|EFH62346.1| hypothetical protein ARALYDRAFT_480573 [Arabidopsis lyrata subsp.
lyrata]
Length = 557
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 49/80 (61%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + +++ ++E DV+EEC K G ++ + V +N G V +RF++ + A
Sbjct: 463 LLLKNMFDPS-TQTEDDFDKDIEEDVKEECSKFGNLNHIYVDKNS-VGFVYLRFENAQAA 520
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 521 IGAQRALHGRWFAGKMITAT 540
>gi|221058957|ref|XP_002260124.1| splicing factor [Plasmodium knowlesi strain H]
gi|193810197|emb|CAQ41391.1| splicing factor, putative [Plasmodium knowlesi strain H]
Length = 970
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 49/86 (56%), Gaps = 2/86 (2%)
Query: 417 TIPATVILRFMFTPAE--MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
I ++L MF+P + + +D + S++ DV+EEC K G + + + + G + ++
Sbjct: 872 NITPNLVLCNMFSPNDENIGSDPDFFSDIIEDVKEECSKYGSITKIWMDTKNIDGKIYIK 931
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
+ + +A K + +NGR+FGG I+A
Sbjct: 932 YTKQDEAFKAFQFLNGRYFGGSLINA 957
>gi|389585114|dbj|GAB67845.1| splicing factor [Plasmodium cynomolgi strain B]
Length = 877
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 49/86 (56%), Gaps = 2/86 (2%)
Query: 417 TIPATVILRFMFTPAE--MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
I ++L MF+P + + +D + +++ DV+EEC K G V + + + G + ++
Sbjct: 779 NITPNLVLCNMFSPNDESIGSDPDFFTDIIEDVKEECSKYGSVTKIWLDTKNIDGKIYIK 838
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
+ + ++ K + +NGR+FGG I+A
Sbjct: 839 YANNDESLKAFQFLNGRYFGGSLINA 864
>gi|328717697|ref|XP_001948049.2| PREDICTED: hypothetical protein LOC100168603 isoform 1
[Acyrthosiphon pisum]
gi|328717699|ref|XP_003246281.1| PREDICTED: hypothetical protein LOC100168603 isoform 2
[Acyrthosiphon pisum]
Length = 484
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 40/73 (54%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V+G+ + ++ + F GIIK D +T KP+I +Y DK +G KG+A VTY S
Sbjct: 168 IFVSGMSTTTSEADIEQHFGSIGIIKTDKKTGKPKIWMYNDKASGRPKGEATVTYDDAHS 227
Query: 351 VALATQLLDGTPF 363
A +G F
Sbjct: 228 ANSAISWFNGKSF 240
>gi|384252120|gb|EIE25597.1| splicing factor, CC1-like protein [Coccomyxa subellipsoidea C-169]
Length = 497
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 418 IPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFK 476
IP ++L+ MF P E + + E+ DV EEC K GPV V +N +G V ++F
Sbjct: 402 IPTQCLLLKNMFDPKE-ETEPDWDQEIATDVTEECSKYGPVSHTHVDKNS-KGFVYLKFV 459
Query: 477 DRKDAQKCIELMNGRWFGGRQIHA 500
+ + + ++GRWF GRQ+ A
Sbjct: 460 TVEGSAAAQKALHGRWFAGRQVVA 483
>gi|414590139|tpg|DAA40710.1| TPA: hypothetical protein ZEAMMB73_970290 [Zea mays]
Length = 490
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 37/76 (48%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV LP + E F G++K+D +P+I IY DK + KGDA VTY
Sbjct: 57 NGSVYVCNLPPGTDENMLAEYFGTIGLLKKDKRNGRPKIWIYRDKVSNEPKGDATVTYED 116
Query: 348 EPSVALATQLLDGTPF 363
+ + A + + F
Sbjct: 117 PHAASAAVEWFNNKDF 132
>gi|224098862|ref|XP_002311296.1| predicted protein [Populus trichocarpa]
gi|222851116|gb|EEE88663.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N +Y++ LP+ + F G++K+D T +P+I +Y DK T KGDA VTY +
Sbjct: 12 NGTIYISNLPEGTDDIMLAGYFGTIGLLKKDKRTGRPKIWLYRDKMTNEPKGDATVTY-E 70
Query: 348 EPSVALAT 355
+P ALA
Sbjct: 71 DPHAALAA 78
>gi|340508480|gb|EGR34174.1| splicing factor, putative [Ichthyophthirius multifiliis]
Length = 582
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 45/94 (47%), Gaps = 6/94 (6%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
V+L MF E D ++L+ DV EEC K G V V V P G V V F + A
Sbjct: 488 VVLVNMFLEGEGNGDSAFFADLKEDVAEECKKFGDVIQVIVAVKSPCGAVFVIFNNEISA 547
Query: 482 QKCIELMNGRWFGGRQIHA---SED---DGLVNH 509
C + M+ R F RQI+ SE+ D L NH
Sbjct: 548 VNCAKNMDMRLFNQRQINCYCISEETLQDELQNH 581
>gi|195449081|ref|XP_002071929.1| GK21292 [Drosophila willistoni]
gi|194168014|gb|EDW82915.1| GK21292 [Drosophila willistoni]
Length = 189
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 283 FELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDAL 342
FEL+ T ++V G+ VT ++ FS G+I D KPRI +Y +K+ G KG+A
Sbjct: 67 FELEPET-IFVLGMRLSVTKNDIRMFFSSVGMIAVD-HASKPRIFVYKNKQNGRSKGEAT 124
Query: 343 VTYLKEPSVALATQLLDGTPFR 364
+TY+ +A + L+G+ F+
Sbjct: 125 ITYISPFMAEMAIRCLNGSKFK 146
>gi|340377156|ref|XP_003387096.1| PREDICTED: splicing factor U2AF 35 kDa subunit-like [Amphimedon
queenslandica]
Length = 227
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 38/63 (60%), Gaps = 4/63 (6%)
Query: 442 ELEADVQEECV-KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
E DV EC K GP++ + VC+N H G V ++F+ +DA+K ++ +N RWF G
Sbjct: 79 EFFEDVFIECEEKYGPIEKMNVCDNLGDHLVGNVYIKFRFEEDAEKAVQSLNNRWFNGHP 138
Query: 498 IHA 500
IHA
Sbjct: 139 IHA 141
>gi|303271187|ref|XP_003054955.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462929|gb|EEH60207.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 811
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 2/82 (2%)
Query: 418 IPATVI-LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFK 476
+P+T + L+ MF PA D ++ SE+ DV+ EC G V V +N QG V V+F
Sbjct: 713 VPSTCVNLKNMFDPAHASRDASVVSEVHEDVRLECGNHGVVTHSAVDKNS-QGFVYVKFD 771
Query: 477 DRKDAQKCIELMNGRWFGGRQI 498
+ A + +N RWF GR +
Sbjct: 772 TVQAATEARRTLNARWFAGRLV 793
>gi|222624209|gb|EEE58341.1| hypothetical protein OsJ_09449 [Oryza sativa Japonica Group]
Length = 565
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + +++ DVQEEC K G V + V +N G V +RF A
Sbjct: 475 LLLKNMFDPS-VETDPDFDLDIKDDVQEECSKFGQVKHIFVDKNT-SGFVYLRFDSITAA 532
Query: 482 QKCIELMNGRWFGGRQIHAS 501
+ ++GRWF G+ I A+
Sbjct: 533 MSAQKALHGRWFAGKMITAT 552
>gi|218192112|gb|EEC74539.1| hypothetical protein OsI_10059 [Oryza sativa Indica Group]
Length = 567
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + +++ DVQEEC K G V + V +N G V +RF A
Sbjct: 477 LLLKNMFDPS-VETDPDFDLDIKDDVQEECSKFGQVKHIFVDKNT-SGFVYLRFDSITAA 534
Query: 482 QKCIELMNGRWFGGRQIHAS 501
+ ++GRWF G+ I A+
Sbjct: 535 MSAQKALHGRWFAGKMITAT 554
>gi|189234753|ref|XP_974490.2| PREDICTED: similar to STE20-like kinase (yeast) [Tribolium
castaneum]
Length = 1819
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 18/103 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG----------V 470
VILR M P ++ DE L+ E +QEEC K G V+ V + N Q
Sbjct: 451 VVILRNMVGPEDV--DETLQEE----IQEECSKFGTVERV-IIYNEKQSEEDDASDVVVK 503
Query: 471 VLVRFKDRKDAQKCIELMNGRWFGGRQIHAS-EDDGLVNHAAI 512
+ V F + +A+K + +NGR+FGGR + AS D L +H +
Sbjct: 504 IFVEFTETPEAEKARDALNGRYFGGRMVQASLYDQTLFDHTKM 546
>gi|50725051|dbj|BAD33184.1| RNA-binding protein-like [Oryza sativa Japonica Group]
gi|222641273|gb|EEE69405.1| hypothetical protein OsJ_28765 [Oryza sativa Japonica Group]
Length = 414
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV LP + + F G++K+D T +P+I IY DK T KGDA VTY
Sbjct: 11 NGSVYVCNLPPGTDETMLADYFGTIGLLKKDKRTGRPKIWIYRDKVTNEPKGDATVTYED 70
Query: 348 EPSVALATQLLDGTPF 363
+ + A + + F
Sbjct: 71 PHAASAAVEWFNNKDF 86
>gi|302664299|ref|XP_003023781.1| hypothetical protein TRV_02070 [Trichophyton verrucosum HKI 0517]
gi|291187796|gb|EFE43163.1| hypothetical protein TRV_02070 [Trichophyton verrucosum HKI 0517]
Length = 592
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PA+ E+ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 497 VLLRNMFDPAQEEG-ESWVKELEDDVRAECEEKYGHVVHISLDPNT-QGDIYLKFDRVQG 554
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 555 GENAIKGLNGRFFGGRQISA 574
>gi|242039571|ref|XP_002467180.1| hypothetical protein SORBIDRAFT_01g021020 [Sorghum bicolor]
gi|241921034|gb|EER94178.1| hypothetical protein SORBIDRAFT_01g021020 [Sorghum bicolor]
Length = 535
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + ++ DVQ+EC K G V + V +N G V ++F A
Sbjct: 444 LLLKNMFDPA-LETDPDFDLDIRDDVQDECSKFGAVKHIFVDKNT-AGFVYLQFDSVAAA 501
Query: 482 QKCIELMNGRWFGGRQIHAS 501
K + ++GRWF G+ I A+
Sbjct: 502 GKAQQALHGRWFAGKMITAT 521
>gi|194704666|gb|ACF86417.1| unknown [Zea mays]
gi|414871304|tpg|DAA49861.1| TPA: hypothetical protein ZEAMMB73_656561 [Zea mays]
Length = 537
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + ++ DVQ+EC K G V + V +N G V ++F A
Sbjct: 446 LLLKNMFDPA-LETDPDFDLDIRDDVQDECSKFGAVKHIFVDKNT-AGFVYLQFDSVTAA 503
Query: 482 QKCIELMNGRWFGGRQIHAS 501
K ++GRWF G+ I A+
Sbjct: 504 AKAQNALHGRWFAGKMITAT 523
>gi|198424504|ref|XP_002131946.1| PREDICTED: similar to poly-U binding splicing factor 60KDa [Ciona
intestinalis]
Length = 491
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 8/72 (11%)
Query: 437 ENLRSELEADVQEECVKIGPVDSVKV-------CENHPQGV-VLVRFKDRKDAQKCIELM 488
E + +LE++V EEC K G V V + E+ P V + V F D +K +E +
Sbjct: 405 EEIDDDLESEVTEECGKFGSVSRVVIYQEKQSEAEDAPVTVKIYVEFTDSVFCKKAVESL 464
Query: 489 NGRWFGGRQIHA 500
NGRWFGGR+I A
Sbjct: 465 NGRWFGGRKIEA 476
>gi|299115956|emb|CBN75961.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 158
Score = 51.6 bits (122), Expect = 0.001, Method: Composition-based stats.
Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 7/92 (7%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPR------IKIYVDKETGMKKGDA 341
N VYV+GLP +T ++VE FS GII + + + ++IY D TG KGD
Sbjct: 58 NNRVYVSGLPTTITETDLVEKFSTIGIIARIRQKRGYKDQWPFSVRIYKDDNTGKPKGDC 117
Query: 342 LVTYLKEPSVALATQLLDGTPFRPDGKIPMSV 373
++ Y + S A + DG F+ GKI +++
Sbjct: 118 IIKYEEPNSAHAALRWFDGIDFK-GGKIGVAM 148
>gi|296820488|ref|XP_002849949.1| RNA splicing factor Pad-1 [Arthroderma otae CBS 113480]
gi|238837503|gb|EEQ27165.1| RNA splicing factor Pad-1 [Arthroderma otae CBS 113480]
Length = 595
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PA+ E+ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 500 VLLRNMFDPAQEEG-ESWVKELEDDVRAECEEKYGHVVHISLDPN-TQGDIYLKFDRVQG 557
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 558 GENAIKGLNGRFFGGRQISA 577
>gi|327293227|ref|XP_003231310.1| RNA splicing factor [Trichophyton rubrum CBS 118892]
gi|326466426|gb|EGD91879.1| RNA splicing factor [Trichophyton rubrum CBS 118892]
Length = 592
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PA+ E+ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 497 VLLRNMFDPAQEEG-ESWVKELEDDVRAECEEKYGHVVHISLDPNT-QGDIYLKFDRVQG 554
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 555 GENAIKGLNGRFFGGRQISA 574
>gi|326485177|gb|EGE09187.1| RNA splicing factor [Trichophyton equinum CBS 127.97]
Length = 560
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PA+ E+ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 465 VLLRNMFDPAQEEG-ESWVKELEDDVRAECEEKYGHVVHISLDPNT-QGDIYLKFDRVQG 522
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 523 GENAIKGLNGRFFGGRQISA 542
>gi|302499164|ref|XP_003011578.1| hypothetical protein ARB_02131 [Arthroderma benhamiae CBS 112371]
gi|291175130|gb|EFE30938.1| hypothetical protein ARB_02131 [Arthroderma benhamiae CBS 112371]
Length = 594
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PA+ E+ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 499 VLLRNMFDPAQEEG-ESWVKELEDDVRAECEEKYGHVVHISLDPNT-QGDIYLKFDRVQG 556
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 557 GENAIKGLNGRFFGGRQISA 576
>gi|326471726|gb|EGD95735.1| RNA splicing factor [Trichophyton tonsurans CBS 112818]
Length = 592
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PA+ E+ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 497 VLLRNMFDPAQEEG-ESWVKELEDDVRAECEEKYGHVVHISLDPNT-QGDIYLKFDRVQG 554
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 555 GENAIKGLNGRFFGGRQISA 574
>gi|357146380|ref|XP_003573971.1| PREDICTED: RNA-binding protein 39-like [Brachypodium distachyon]
Length = 597
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 42/80 (52%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + ++ DVQEEC K G V + V + H G V + F A
Sbjct: 503 LLLKNMFDPA-VETDPDFDLDIRDDVQEECSKFGVVKHIFV-DKHTAGFVYLHFDSSTAA 560
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 561 TSAQRSLHGRWFAGKMITAT 580
>gi|315043566|ref|XP_003171159.1| RNA-binding protein rsd1 [Arthroderma gypseum CBS 118893]
gi|311344948|gb|EFR04151.1| RNA-binding protein rsd1 [Arthroderma gypseum CBS 118893]
Length = 589
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PA+ E+ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 494 VLLRNMFDPAQEEG-ESWVKELEDDVRAECEEKYGHVVHISLDPNT-QGDIYLKFDRVQG 551
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 552 GENAIKGLNGRFFGGRQISA 571
>gi|226497258|ref|NP_001140274.1| uncharacterized protein LOC100272318 [Zea mays]
gi|194698792|gb|ACF83480.1| unknown [Zea mays]
gi|414883568|tpg|DAA59582.1| TPA: ribonucleoprotein [Zea mays]
Length = 341
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 80/215 (37%), Gaps = 45/215 (20%)
Query: 290 HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
++V LP T EE+ +VFS+ G + + +I DK T +G A VT
Sbjct: 113 RLFVGNLPYTYTSEELAQVFSEAGRVDD--------AQIIYDKVTNRSRGFAFVTMATAE 164
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
A A Q+ DG G V + + GER ++ K+ LGWG
Sbjct: 165 EAAKAIQMFDGALL---GGRTARVNYPEVPRGGERRTVTMSGRRRDDGTYKIYAGNLGWG 221
Query: 410 GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE---NH 466
+RAD LR+ E + G +D+ + E
Sbjct: 222 -----------------------VRAD-TLRNVFEG-------RAGLLDARVIFERETGR 250
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+G V F +DAQ +E ++G GR + S
Sbjct: 251 SRGFGFVSFSTAEDAQAALESLDGVELEGRSLRLS 285
>gi|119569012|gb|EAW48627.1| synaptotagmin binding, cytoplasmic RNA interacting protein, isoform
CRA_e [Homo sapiens]
Length = 588
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 106/243 (43%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF----MFTPAEMRAD 436
SK K+++ KV E + + + DD K L + A +
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQARLLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|242773880|ref|XP_002478329.1| RNA splicing factor (Pad-1), putative [Talaromyces stipitatus ATCC
10500]
gi|218721948|gb|EED21366.1| RNA splicing factor (Pad-1), putative [Talaromyces stipitatus ATCC
10500]
Length = 562
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF P E + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 467 VLLRNMFDPTEEEGESWIK-ELEDDVRAECEEKYGHVVHISLDPNS-QGDIYLKFDRVQG 524
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 525 GENAIKGLNGRFFGGRQISA 544
>gi|260800441|ref|XP_002595142.1| hypothetical protein BRAFLDRAFT_118606 [Branchiostoma floridae]
gi|229280384|gb|EEN51153.1| hypothetical protein BRAFLDRAFT_118606 [Branchiostoma floridae]
Length = 517
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 265 QPDKQVEKKEANKPPDSWFELKVNT-HVYVTGLPDDVTVEEM----VEVFSKCGIIKEDP 319
+PD V + N P + +++V ++V LP + EE+ F GIIK D
Sbjct: 378 EPDAGVSVLKVNDVPATDGDMQVQQDTIFVQNLPTSMREEELAGHLTSHFGSIGIIKMDK 437
Query: 320 ETKKPRIKIYVDKETGMKKGDALVTY 345
+T KP+I IY DK TG KG+A VTY
Sbjct: 438 KTHKPKIWIYKDKMTGSPKGEATVTY 463
>gi|108706237|gb|ABF94032.1| transposon protein, putative, CACTA, En/Spm sub-class, expressed
[Oryza sativa Japonica Group]
Length = 566
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + +++ DVQEEC K G V + V +N G V +RF A
Sbjct: 476 LLLKNMFDPS-VETDPDFDLDIKDDVQEECSKFGQVKHIFVDKNT-SGFVYLRFDSITAA 533
Query: 482 QKCIELMNGRWFGGRQIHAS 501
+ ++GRWF G+ I A+
Sbjct: 534 MSAQKALHGRWFAGKMITAT 553
>gi|226509668|ref|NP_001142242.1| uncharacterized protein LOC100274411 [Zea mays]
gi|194690874|gb|ACF79521.1| unknown [Zea mays]
gi|194707772|gb|ACF87970.1| unknown [Zea mays]
gi|414871307|tpg|DAA49864.1| TPA: hypothetical protein ZEAMMB73_656561 [Zea mays]
gi|414871308|tpg|DAA49865.1| TPA: hypothetical protein ZEAMMB73_656561 [Zea mays]
Length = 318
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + ++ DVQ+EC K G V + V +N G V ++F A
Sbjct: 227 LLLKNMFDPA-LETDPDFDLDIRDDVQDECSKFGAVKHIFVDKNT-AGFVYLQFDSVTAA 284
Query: 482 QKCIELMNGRWFGGRQIHAS 501
K ++GRWF G+ I A+
Sbjct: 285 AKAQNALHGRWFAGKMITAT 304
>gi|58267808|ref|XP_571060.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134112313|ref|XP_775132.1| hypothetical protein CNBE4060 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257784|gb|EAL20485.1| hypothetical protein CNBE4060 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57227294|gb|AAW43753.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 615
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 110/246 (44%), Gaps = 40/246 (16%)
Query: 279 PDSWFELKVNTH-VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKET--- 334
PD+ + + H ++V+ L +T +++ +VF G I+ +VD T
Sbjct: 374 PDAHKDAAIPYHRLFVSNLAFSLTADDVRQVFEPFGEIE------------FVDLHTDLS 421
Query: 335 GMKKGDALVTYLKEPSVALATQLLDGTPFRPDGK-IPMSVTQAKFEQKGERFI------A 387
G++KG A V + S +A + G F G+ I + Q + + I
Sbjct: 422 GLRKGTAYVQFKDVKSAQMALDAMAG--FDLAGRLIKVQTIQERGTYQTPDLIEDSGNYG 479
Query: 388 KQVDSKKKK----KLKKVEEKM---LGWGGRDDAKLTIPAT-----VILRFMFTPAEMRA 435
++D+ +++ KL + E + L + ++ IPA +++ MF P E
Sbjct: 480 TRLDANQRQQLMFKLARTEPNVNLSLSAPKINGSQSKIPAMDPTPRIVVHNMFNPEE-ET 538
Query: 436 DENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFG 494
+ N +L DV+ E K G V +KV E G V + F D A K ++ +NGR+FG
Sbjct: 539 ERNWDLDLAEDVKGEVESKYGRVKRIKV-EKMSAGEVYIEFIDTDSAIKAVKGLNGRFFG 597
Query: 495 GRQIHA 500
GRQ+ A
Sbjct: 598 GRQLQA 603
>gi|395857936|ref|XP_003801336.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 4
[Otolemur garnettii]
Length = 588
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSIYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|15809588|gb|AAK59704.1| hnRNP Q2 [Homo sapiens]
Length = 588
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 110/243 (45%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K + L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RSFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|389745686|gb|EIM86867.1| hypothetical protein STEHIDRAFT_57258 [Stereum hirsutum FP-91666
SS1]
Length = 417
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 93/231 (40%), Gaps = 48/231 (20%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V GLP + E+++E+ G +K + + G KG A Y+
Sbjct: 175 IFVGGLPSYLNEEQVMELLKSFGDLKA--------FNLVRENGNGPSKGFAFFEYVDPEV 226
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKK------------KL 398
+A Q L G + G+R++ Q S K +
Sbjct: 227 TDVAIQSLSGM------------------ELGDRYLVVQRASVGAKAGQPGMPNLPYDQF 268
Query: 399 KKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVD 458
++ +L G D + ++L M TP ++ D++ ++L D++EE G VD
Sbjct: 269 PEIPRPILPAGASDLSSANSRILLMLN-MVTPEDL-IDDSEYADLLEDIREEVANYGDVD 326
Query: 459 SVKVCENHPQ--------GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
V++ Q G V VR+KD + A K ++ + GR F GR I A+
Sbjct: 327 DVRIPRPDAQRADEAAGVGRVYVRYKDAEGAAKGMQALAGRSFAGRSIIAT 377
>gi|345326438|ref|XP_003431044.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q
[Ornithorhynchus anatinus]
Length = 588
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGSIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQIVEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|338710583|ref|XP_003362387.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q [Equus
caballus]
Length = 588
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|440797518|gb|ELR18604.1| RNA recognition motif domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 696
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 7/80 (8%)
Query: 432 EMRADENLRSELEADVQEECVKIGPVDSVKVCENH----PQGVVL---VRFKDRKDAQKC 484
+M E++ ELE ++ +E K G V+ V + + P V++ + F+ AQK
Sbjct: 605 DMAGPEDVDDELEGEITDEATKYGIVERVVIYQERQSEKPGDVIIKIFILFQSADQAQKA 664
Query: 485 IELMNGRWFGGRQIHASEDD 504
+ +NGRWFGGRQI A+ D
Sbjct: 665 LTSLNGRWFGGRQIKAAFYD 684
>gi|198437801|ref|XP_002129262.1| PREDICTED: similar to RNA binding motif protein 17 [Ciona
intestinalis]
Length = 378
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 47/94 (50%), Gaps = 16/94 (17%)
Query: 416 LTIPATVI-LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV---- 470
L P VI LR M P E+ D LEA+ +EEC K G V+ + E H GV
Sbjct: 276 LKNPTKVICLRNMVGPGEVDDD------LEAETKEECSKYGSVNKCLIFEMH--GVTEDE 327
Query: 471 ---VLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ + F + A K + MNGR+FGGR + AS
Sbjct: 328 AIRIFIEFDRMESAIKSLVDMNGRFFGGRTVKAS 361
>gi|328770116|gb|EGF80158.1| hypothetical protein BATDEDRAFT_11758 [Batrachochytrium
dendrobatidis JAM81]
Length = 389
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 104/255 (40%), Gaps = 49/255 (19%)
Query: 289 THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKE 348
+ +YV L +T +++ F G I+ I I++DKETG KG A V Y+K+
Sbjct: 136 SRIYVGSLDYSLTEQDVANAFQGFGPIE--------FINIHMDKETGHSKGYAFVQYVKD 187
Query: 349 P------------SVALATQLLDGTPFRPDGKIPM-SVTQAKFEQKGERFIAKQVDSKKK 395
V L ++ P + + S A F Q++S +
Sbjct: 188 AWFLNIFILKLQIRVNLISEKSQSAPVTNASNVGIASENIATFNMDDGEMEGFQMNSLSR 247
Query: 396 K----KLKKVEEKMLGWGGRDDAKL-----------TIPATVI---------LRFMFTPA 431
KL + E + A + T PA V+ L+ MF P
Sbjct: 248 AELMAKLARTEPTLSKTSASVAAIIFTNTYFVFDSSTKPAVVVPTMETRCVLLKNMFNP- 306
Query: 432 EMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGR 491
E + N E+E DV+EEC K G + + V + G V ++F +Q ++ ++GR
Sbjct: 307 ENEVESNWVQEIEEDVKEECTKFGKILHISV-DPESHGHVYLKFDSLVSSQAAVKALHGR 365
Query: 492 WFGGRQIHASEDDGL 506
+F G ++ S+D G+
Sbjct: 366 FFAGARV--SKDGGV 378
>gi|195611722|gb|ACG27691.1| ribonucleoprotein [Zea mays]
Length = 341
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 78/215 (36%), Gaps = 45/215 (20%)
Query: 290 HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
++V LP T EE+ +VFS+ G + + +I DK T +G A VT
Sbjct: 113 RLFVGNLPYTYTSEELAQVFSEAGRVDD--------AQIIYDKVTNRSRGFAFVTMATAE 164
Query: 350 SVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
A A Q+ DG G V + + GER ++ K+ LGWG
Sbjct: 165 EAAKAIQMFDGALL---GGRTARVNYPEVPRGGERRTVTMSGRRRDDGTYKIYAGNLGWG 221
Query: 410 GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE---NH 466
R D LR+ E + G +D+ + E
Sbjct: 222 VRAD------------------------TLRNVFEG-------RAGLLDARVIFERETGR 250
Query: 467 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+G V F +DAQ +E ++G GR + S
Sbjct: 251 SRGFGFVSFSTAEDAQAALESLDGVELEGRPLRLS 285
>gi|334313305|ref|XP_001378882.2| PREDICTED: LOW QUALITY PROTEIN: hypothetical protein LOC100029004
[Monodelphis domestica]
Length = 1546
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 1/77 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N+ +YV L D +T + + + F K GI+K + T +P I I +DK+TG KGDA+V+Y
Sbjct: 235 NSAIYVQRLNDSMTADRLAD-FLKXGIVKINKRTGQPMIDIDLDKKTGKPKGDAIVSYDD 293
Query: 348 EPSVALATQLLDGTPFR 364
P+ + G F+
Sbjct: 294 PPTAKAVIEWFGGKDFQ 310
>gi|228008295|ref|NP_001153147.1| heterogeneous nuclear ribonucleoprotein Q isoform 4 [Homo sapiens]
gi|297678606|ref|XP_002817155.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 1
[Pongo abelii]
gi|301775587|ref|XP_002923214.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like isoform 4
[Ailuropoda melanoleuca]
gi|332218421|ref|XP_003258354.1| PREDICTED: uncharacterized protein LOC100579672 [Nomascus
leucogenys]
gi|332824482|ref|XP_518621.3| PREDICTED: heterogeneous nuclear ribonucleoprotein Q [Pan
troglodytes]
gi|348578332|ref|XP_003474937.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like isoform 4
[Cavia porcellus]
gi|426234738|ref|XP_004011349.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 6
[Ovis aries]
gi|296484069|tpg|DAA26184.1| TPA: synaptotagmin binding, cytoplasmic RNA interacting protein
isoform 1 [Bos taurus]
Length = 588
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|260828249|ref|XP_002609076.1| hypothetical protein BRAFLDRAFT_60256 [Branchiostoma floridae]
gi|229294430|gb|EEN65086.1| hypothetical protein BRAFLDRAFT_60256 [Branchiostoma floridae]
Length = 495
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 433 MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VVLVRFKDRKDAQK 483
M E+L ELE +V +EC K G V+ V + + QG + V F + +A++
Sbjct: 405 MVGIEDLDDELEGEVTDECGKYGQVNRVIIYQER-QGEEENADVIVKIFVEFSESFEAER 463
Query: 484 CIELMNGRWFGGRQIHASEDDGL 506
++ +NGRWFGGR++ A D L
Sbjct: 464 GVQALNGRWFGGRKVSAESYDQL 486
>gi|334324402|ref|XP_003340516.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q [Monodelphis
domestica]
Length = 588
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|195996811|ref|XP_002108274.1| hypothetical protein TRIADDRAFT_37071 [Trichoplax adhaerens]
gi|190589050|gb|EDV29072.1| hypothetical protein TRIADDRAFT_37071 [Trichoplax adhaerens]
Length = 351
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 1/78 (1%)
Query: 423 ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
++ MF PA+ D++ ++ D+ EEC K G V V V + P+G+V V+ + A
Sbjct: 248 LISNMFDPAK-ETDQDWDLDIRDDIIEECNKHGNVYHVYVDKTSPKGIVYVKCQTIDVAA 306
Query: 483 KCIELMNGRWFGGRQIHA 500
+ ++ +NGRWF G I A
Sbjct: 307 RAVKSLNGRWFAGNMITA 324
>gi|1710361|gb|AAB38280.1| splicing factor U2AF65 [Caenorhabditis briggsae]
Length = 488
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 99/249 (39%), Gaps = 38/249 (15%)
Query: 265 QPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKP 324
QP + A P S N +++ GLP+ +T +++ E+ G +K
Sbjct: 259 QPSQNTFDMNARMPVSSIVVDSAN-KIFIGGLPNYLTEDQVKELLCSFGPLKA------- 310
Query: 325 RIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER 384
+ VD + G KG A YL A L+G Q G++
Sbjct: 311 -FSLNVDSQ-GNSKGYAFAEYLDPTLTDQAIAGLNGM------------------QLGDK 350
Query: 385 FIAKQVDSKKKKKLK-KVEEKMLGWGGRDDAKLTIPATVILRFM--FTPAEMRADENLRS 441
+ Q+ + + + G D ++ AT IL M T E+R+DE+
Sbjct: 351 QLVVQLACANQTRHNTHLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELRSDEDYEE 410
Query: 442 ELEADVQEECVKIGPVDSVKV---CENHP---QGVVLVRFKDRKDAQKCIELMNGRWFGG 495
LE DV+EEC K G V S+++ ++HP G V V F D Q+ + GR F
Sbjct: 411 ILE-DVREECSKYGIVRSLEIPRPYDDHPVPGVGKVFVEFATTSDCQRAQAALTGRKFAN 469
Query: 496 RQIHASEDD 504
R + S D
Sbjct: 470 RTVVTSYYD 478
>gi|268575804|ref|XP_002642882.1| C. briggsae CBR-UAF-1 protein [Caenorhabditis briggsae]
gi|60415989|sp|P90727.2|U2AF2_CAEBR RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; Short=U2AF65; AltName:
Full=U2 snRNP auxiliary factor large subunit
Length = 488
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 99/249 (39%), Gaps = 38/249 (15%)
Query: 265 QPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKP 324
QP + A P S N +++ GLP+ +T +++ E+ G +K
Sbjct: 259 QPSQNTFDMNARMPVSSIVVDSAN-KIFIGGLPNYLTEDQVKELLCSFGPLKA------- 310
Query: 325 RIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER 384
+ VD + G KG A YL A L+G Q G++
Sbjct: 311 -FSLNVDSQ-GNSKGYAFAEYLDPTLTDQAIAGLNGM------------------QLGDK 350
Query: 385 FIAKQVDSKKKKKLK-KVEEKMLGWGGRDDAKLTIPATVILRFM--FTPAEMRADENLRS 441
+ Q+ + + + G D ++ AT IL M T E+R+DE+
Sbjct: 351 QLVVQLACANQTRHNTHLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELRSDEDYEE 410
Query: 442 ELEADVQEECVKIGPVDSVKV---CENHP---QGVVLVRFKDRKDAQKCIELMNGRWFGG 495
LE DV+EEC K G V S+++ ++HP G V V F D Q+ + GR F
Sbjct: 411 ILE-DVREECSKYGIVRSLEIPRPYDDHPVPGVGKVFVEFATTSDCQRAQAALTGRKFAN 469
Query: 496 RQIHASEDD 504
R + S D
Sbjct: 470 RTVVTSYYD 478
>gi|443896055|dbj|GAC73399.1| transcriptional coactivator CAPER [Pseudozyma antarctica T-34]
Length = 662
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRK 479
+++L+ MF PAE + + +L DV++EC K GPV + V E G + + F D
Sbjct: 571 SLLLKNMFDPAE-ETEADWDKDLAEDVKDECQAKYGPVTRIHV-EKESAGEIYLTFADLD 628
Query: 480 DAQKCIELMNGRWFGGRQIHA 500
++K ++ +NGR+FGG+ I A
Sbjct: 629 ASRKALDGLNGRFFGGKPISA 649
>gi|344264675|ref|XP_003404417.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 3
[Loxodonta africana]
Length = 585
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 138 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 189
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 190 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 245
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 246 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 304
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 305 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 360
Query: 496 RQI 498
I
Sbjct: 361 ENI 363
>gi|255949670|ref|XP_002565602.1| Pc22g16890 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592619|emb|CAP98977.1| Pc22g16890 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 724
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/211 (24%), Positives = 96/211 (45%), Gaps = 32/211 (15%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
++V LP T E + E FS+ IIK + DKET KG VT+
Sbjct: 38 LFVRSLPTSATTESLAEYFSQSYIIKH--------AVVVCDKETKASKGFGFVTFADVED 89
Query: 351 VALATQLLDGTPFRPDGK-IPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
A + L+G+ F DGK I + +++ + E+ + + V + ++ KK +E+ G G
Sbjct: 90 AESALKELNGSKF--DGKVIRVDYAESRKREIDEK-VGRSVPTAASRESKKQKEEERGQG 146
Query: 410 GRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ- 468
+P +I+R + P ++ E+L + G V + + + +
Sbjct: 147 --------LPPKLIVRNL--PWSIKEPEDLNVLFRS--------FGKVKFATLPKRNGKL 188
Query: 469 -GVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
G + + RK+A++ ++++NG+ GRQ+
Sbjct: 189 SGFGFITMRGRKNAERALQMINGKEIDGRQL 219
>gi|413934153|gb|AFW68704.1| hypothetical protein ZEAMMB73_109361 [Zea mays]
gi|413934154|gb|AFW68705.1| hypothetical protein ZEAMMB73_109361 [Zea mays]
Length = 536
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + ++ DVQ+EC K G V + V +N G V ++F A
Sbjct: 445 LLLKNMFDPS-LETDPDFDLDIRDDVQDECSKFGAVKHIFVDKN-TAGFVYLQFDSVTAA 502
Query: 482 QKCIELMNGRWFGGRQIHAS 501
K + ++GRWF G+ I A+
Sbjct: 503 GKAQQALHGRWFAGKMITAT 522
>gi|353238006|emb|CCA69965.1| related to splicing factor HCC1 [Piriformospora indica DSM 11827]
Length = 543
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 102/250 (40%), Gaps = 34/250 (13%)
Query: 276 NKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETG 335
N PP +YV L +T EE+ +VF G ++ + ++ D TG
Sbjct: 292 NLPPGVTATSHGPMQLYVGSLHFQLTEEEIKQVFEPFGELE--------FVDLHRDPATG 343
Query: 336 MKKGDALVTYLKEPSVALATQLLDGTPFR---------------PDGKIPMSV-----TQ 375
KG + Y + +A + +DG G + MS+ T
Sbjct: 344 RSKGYCFIQYRRPEDAKMALEQMDGFELAGRQLRVNTVHEKGVAAPGGVGMSLRAMGNTA 403
Query: 376 AKFEQKGERF--IAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEM 433
++ G +++Q +K + + + + G + + K ++L+ F PAE
Sbjct: 404 ESLDEGGGNLNAVSRQALMQKLARTNEPQIPLPVSGPKINIKSQETRCILLKNAFDPAE- 462
Query: 434 RADENLRSELEADVQEECVKI--GPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGR 491
D + L+ DV EC G V+ + V E +G + ++ A++ + ++GR
Sbjct: 463 ETDPDWDKNLQEDVVAECESKFQGRVEKI-VVEKDSKGEIYIQCDSIDMAKRAVANLDGR 521
Query: 492 WFGGRQIHAS 501
WFGGRQI AS
Sbjct: 522 WFGGRQISAS 531
>gi|297609178|ref|NP_001062815.2| Os09g0299500 [Oryza sativa Japonica Group]
gi|255678748|dbj|BAF24729.2| Os09g0299500 [Oryza sativa Japonica Group]
Length = 194
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLK 347
N VYV LP + + F G++K+D T +P+I IY DK T KGDA VTY +
Sbjct: 11 NGSVYVCNLPPGTDETMLADYFGTIGLLKKDKRTGRPKIWIYRDKVTNEPKGDATVTY-E 69
Query: 348 EPSVALA 354
+P A A
Sbjct: 70 DPHAASA 76
>gi|356536386|ref|XP_003536719.1| PREDICTED: RNA-binding protein 39-like [Glycine max]
Length = 597
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 53/106 (50%), Gaps = 13/106 (12%)
Query: 406 LGWGGRDDAKLTIPA---------TVILRFMFTPA-EMRADENLRSELEADVQEECVKIG 455
L GG TIP+ ++L+ MF P E D +L +++ DV+ EC K G
Sbjct: 480 LAGGGLQIPTATIPSIDPIGVPSECLMLKNMFDPKDETEPDFDL--DIKEDVEAECSKFG 537
Query: 456 PVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+ + V + G V +RF+D + A + ++GRWF G+ I AS
Sbjct: 538 ALKHIYV-DKKSAGFVYLRFEDTQSAISAQQALHGRWFAGKMITAS 582
>gi|121700823|ref|XP_001268676.1| RNA splicing factor (Pad-1), putative [Aspergillus clavatus NRRL 1]
gi|119396819|gb|EAW07250.1| RNA splicing factor (Pad-1), putative [Aspergillus clavatus NRRL 1]
Length = 566
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF P+E + ++ +LE DV+ EC K G V + + N QG + ++F +
Sbjct: 471 VLLRNMFDPSEEEGEAWIK-DLEDDVRAECEEKYGHVVHIALDPNS-QGDIYLKFDRVQG 528
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 529 GENAIKGLNGRFFGGRQITA 548
>gi|425770690|gb|EKV09156.1| RNA splicing factor (Pad-1), putative [Penicillium digitatum Pd1]
gi|425772037|gb|EKV10463.1| RNA splicing factor (Pad-1), putative [Penicillium digitatum PHI26]
Length = 556
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF PAE + ++ ELE DV+ EC K G V + + + QG + ++F +
Sbjct: 461 VLLRNMFDPAEETGEAWIK-ELEEDVRSECEEKYGHVVHISL-DVSSQGDIYLKFDRVQG 518
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGG+QI A
Sbjct: 519 GENAIKGLNGRYFGGKQITA 538
>gi|145231104|ref|XP_001389816.1| RNA-binding protein rsd1 [Aspergillus niger CBS 513.88]
gi|134055944|emb|CAK37421.1| unnamed protein product [Aspergillus niger]
gi|350638781|gb|EHA27137.1| hypothetical protein ASPNIDRAFT_46267 [Aspergillus niger ATCC 1015]
Length = 570
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF P E + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 475 VMLRNMFDPNEEEGEAWIK-ELEDDVRAECEDKYGHVVHIALDPNS-QGDIYLKFDRVQG 532
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 533 GENAIKGLNGRFFGGRQITA 552
>gi|358370208|dbj|GAA86820.1| RNA splicing factor (Pad-1) [Aspergillus kawachii IFO 4308]
Length = 571
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF P E + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 476 VMLRNMFDPNEEEGEAWIK-ELEDDVRAECEDKYGHVVHIALDPNS-QGDIYLKFDRVQG 533
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGGRQI A
Sbjct: 534 GENAIKGLNGRFFGGRQITA 553
>gi|219120589|ref|XP_002181030.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407746|gb|EEC47682.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 687
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 44/94 (46%), Gaps = 6/94 (6%)
Query: 410 GRDDAKL-----TIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE 464
G DAK+ T V++R MF + D +++ + ++EC K G + SV V
Sbjct: 581 GSPDAKIVGNADTPTKNVLIRNMFD-KDKETDPGWEKDIKEEFEDECSKYGTITSVTVMH 639
Query: 465 NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
P G + F AQKC + GRWF RQ+
Sbjct: 640 MEPGGKIYASFDSVIGAQKCASSLAGRWFDKRQL 673
>gi|195434196|ref|XP_002065089.1| GK15272 [Drosophila willistoni]
gi|194161174|gb|EDW76075.1| GK15272 [Drosophila willistoni]
Length = 612
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 91/241 (37%), Gaps = 43/241 (17%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T + + +F +P K I++ +D ETG KG +TY
Sbjct: 355 LYVGSLHFNITEDMLRGIF--------EPFGKIDVIQLIMDTETGRSKGYGFITYHNADD 406
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQA--KFEQKGERFIAKQVDSKKKKKLKKVEEKMLGW 408
A + L+G F G+ PM V + + ++D + L L +
Sbjct: 407 AKKALEQLNG--FELAGR-PMKVGNVTERLDMNTTSLDTDEMD-RTGIDLGATGRLQLMF 462
Query: 409 GGRDDAKLTIP--------------------------AT--VILRFMFTPAEMRADENLR 440
+ A L +P AT IL MF PA +
Sbjct: 463 KLAEGAGLAVPQAAANALLATAPQPAPVQQQQQTPSIATQCFILSNMFDPA-TETNTTWD 521
Query: 441 SELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
SE+ DV EEC K G V + V G V V+ A + ++GRWF GR I A
Sbjct: 522 SEIRDDVLEECAKHGGVLHIHVDTASSTGTVYVKCPSTTTAVLAVNALHGRWFAGRVITA 581
Query: 501 S 501
+
Sbjct: 582 A 582
>gi|412990165|emb|CCO19483.1| predicted protein [Bathycoccus prasinos]
Length = 495
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 96/226 (42%), Gaps = 26/226 (11%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+V GLP +T ++ E+ G I R + D++TG KG V Y P+
Sbjct: 273 VFVGGLPYFLTEAQIRELLEAFGPIT--------RFDLVRDRDTGGSKGYGFVVYRDGPA 324
Query: 351 VA-LATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
+ +A Q L G G ++V +A ER +Q + ++++ + G
Sbjct: 325 ITDIAIQGLHGMQM---GDKQLTVRRAN--ATLERMQQEQRAALQQQQQQHQLLGNGGAA 379
Query: 410 GRDDAKLTIPAT----------VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDS 459
+ + PA ++L+ M E D + D QEEC K G V
Sbjct: 380 AQFLPQSAAPAAPEVDENATECLVLKNMGIKDEELNDPEEYEIIVEDTQEECEKFGKVLG 439
Query: 460 VKVCE--NHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASED 503
+K+ + + GVV VRF+ + A+K + +NGR F G + A D
Sbjct: 440 MKIPKPPSKSAGVVFVRFETAESARKARKSLNGRKFAGNIVSAQYD 485
>gi|324503285|gb|ADY41429.1| Splicing factor U2AF 65 kDa subunit [Ascaris suum]
Length = 522
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 88/227 (38%), Gaps = 34/227 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP + E++ E+ S G +K + D TG+ KG A YL
Sbjct: 318 IFIGGLPSYLNAEQVKELLSSFGQLKA--------FNLVTDVSTGVSKGYAFAEYLDPSL 369
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVD-SKKKKKLKKVEEKMLGWG 409
A L+G Q G++ + Q+ + + + +
Sbjct: 370 TDQAIAGLNGM------------------QLGDKNLVVQLSCANARAAMSTTAFPQIQVA 411
Query: 410 GRDDAKLTIPATVILRFM-FTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP- 467
G D + P T +L M E ++ ++ D++EEC K G V S++V + P
Sbjct: 412 GIDLSHGAGPPTEVLCLMNMVTEEELKEDEEYEDILEDIREECAKYGFVKSIEVPRSIPG 471
Query: 468 -----QGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGLVNH 509
G V V F +++ QK + GR F R + S D + H
Sbjct: 472 VDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDLYH 518
>gi|293336330|ref|NP_001169267.1| uncharacterized protein LOC100383130 [Zea mays]
gi|223975957|gb|ACN32166.1| unknown [Zea mays]
gi|413934151|gb|AFW68702.1| hypothetical protein ZEAMMB73_109361 [Zea mays]
Length = 318
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + ++ DVQ+EC K G V + V +N G V ++F A
Sbjct: 227 LLLKNMFDPS-LETDPDFDLDIRDDVQDECSKFGAVKHIFVDKNT-AGFVYLQFDSVTAA 284
Query: 482 QKCIELMNGRWFGGRQIHAS 501
K + ++GRWF G+ I A+
Sbjct: 285 GKAQQALHGRWFAGKMITAT 304
>gi|224034337|gb|ACN36244.1| unknown [Zea mays]
Length = 410
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + ++ DVQ+EC K G V + V +N G V ++F A
Sbjct: 319 LLLKNMFDPS-LETDPDFDLDIRDDVQDECSKFGAVKHIFVDKN-TAGFVYLQFDSVTAA 376
Query: 482 QKCIELMNGRWFGGRQIHAS 501
K + ++GRWF G+ I A+
Sbjct: 377 GKAQQALHGRWFAGKMITAT 396
>gi|413934152|gb|AFW68703.1| hypothetical protein ZEAMMB73_109361 [Zea mays]
Length = 300
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + ++ DVQ+EC K G V + V +N G V ++F A
Sbjct: 209 LLLKNMFDPS-LETDPDFDLDIRDDVQDECSKFGAVKHIFVDKNT-AGFVYLQFDSVTAA 266
Query: 482 QKCIELMNGRWFGGRQIHAS 501
K + ++GRWF G+ I A+
Sbjct: 267 GKAQQALHGRWFAGKMITAT 286
>gi|449676978|ref|XP_002157894.2| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like [Hydra
magnipapillata]
Length = 623
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 80/193 (41%), Gaps = 32/193 (16%)
Query: 216 LKEEEVFPTVNVTDDLANDEVGKEKLNSTEEKVNSA---DNVVEEKHNGKRKQPDKQVEK 272
+K ++ PT +V + EV + K EEKV + G+RK
Sbjct: 91 MKNKDSIPTPSVGAGVTKSEVAEIKQGPHEEKVKQLLDRTGYTLDITTGQRKY------- 143
Query: 273 KEANKPPDSW-------FELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPR 325
PP W E K + +++ LP D+ +E+V +F G I +
Sbjct: 144 ---GGPPPHWEGPVPGTGEKKFCSQIFIGKLPRDMYEDELVPMFEPHGTIYD-------- 192
Query: 326 IKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF 385
++I VD +G+ KG A TY + + LA + +DG R + + ++QA F
Sbjct: 193 LRIMVDPFSGLNKGFAFCTYTTKEAATLAVKEMDGKQVRDGKTLGVCLSQA----NNRLF 248
Query: 386 IAKQVDSKKKKKL 398
+ SK K+++
Sbjct: 249 VGSIPKSKTKQQI 261
>gi|380478594|emb|CCF43506.1| CC1-like family splicing factor [Colletotrichum higginsianum]
Length = 565
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 46/81 (56%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF P E DE ++ ELE DV++E K G V + V N +G + ++F +
Sbjct: 470 VVLHNMFDPEEEEGDEWVK-ELEDDVRQEAETKYGRVVHISVDPNS-KGDIYLKFDKVQG 527
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I +NGR+FGGR I AS
Sbjct: 528 GENAIRGLNGRYFGGRMIDAS 548
>gi|294903660|ref|XP_002777532.1| hypothetical protein Pmar_PMAR005675 [Perkinsus marinus ATCC 50983]
gi|239885279|gb|EER09348.1| hypothetical protein Pmar_PMAR005675 [Perkinsus marinus ATCC 50983]
Length = 92
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 31/42 (73%)
Query: 288 NTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIY 329
N ++Y++GLP+D+T+EE+ +F G IK D ET + +IKIY
Sbjct: 36 NPNIYISGLPNDITLEELYNLFKLAGAIKTDMETGRYKIKIY 77
>gi|324507842|gb|ADY43315.1| RNA-binding protein 39 [Ascaris suum]
Length = 618
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 416 LTIPATV----ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVV 471
L IP +L MF PA+ EN E+ DV EEC K G V + V P G V
Sbjct: 496 LAIPPIATQCFLLSNMFDPAQ-ETGENWADEVRDDVIEECAKNGGVVHIFVDRESPSGNV 554
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
V+ A K + ++GRWF G+ I A+
Sbjct: 555 YVKCPSVAAAYKSVNSLHGRWFSGKVITAN 584
>gi|326438129|gb|EGD83699.1| hypothetical protein PTSG_04304 [Salpingoeca sp. ATCC 50818]
Length = 541
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
Query: 424 LRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQK 483
L MF PA D + ++ +V E K GPV + V + PQG V ++FKD + +
Sbjct: 433 LSNMFDPAA-ETDPDWHEDVRDEVLEASTKYGPVFHIFVDKTAPQGQVFLKFKDSVASAQ 491
Query: 484 CIELMNGRWFGGRQIHA 500
+ MNGR F GR + A
Sbjct: 492 AYQAMNGRLFDGRTVKA 508
>gi|326504734|dbj|BAK06658.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 592
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + ++ DVQEEC K G V + V + H G V + F A
Sbjct: 497 LLLKNMFDPS-VETDPDFDLDIRDDVQEECSKFGVVKHIFV-DKHTAGFVYLHFDSATAA 554
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 555 ASAQRSLHGRWFAGKMITAT 574
>gi|326523595|dbj|BAJ92968.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 592
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + ++ DVQEEC K G V + V + H G V + F A
Sbjct: 497 LLLKNMFDPS-VETDPDFDLDIRDDVQEECSKFGVVKHIFV-DKHTAGFVYLHFDSATAA 554
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 555 ASAQRSLHGRWFAGKMITAT 574
>gi|326494786|dbj|BAJ94512.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 592
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ + D + ++ DVQEEC K G V + V + H G V + F A
Sbjct: 497 LLLKNMFDPS-VETDPDFDLDIRDDVQEECSKFGVVKHIFV-DKHTAGFVYLHFDSATAA 554
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 555 ASAQRSLHGRWFAGKMITAT 574
>gi|66810518|ref|XP_638966.1| hypothetical protein DDB_G0283687 [Dictyostelium discoideum AX4]
gi|60467589|gb|EAL65610.1| hypothetical protein DDB_G0283687 [Dictyostelium discoideum AX4]
Length = 727
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 45/83 (54%), Gaps = 2/83 (2%)
Query: 418 IPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKD 477
+ +ILR MF P E +EN ++ ++ EC G + + + +N +G + +R+ +
Sbjct: 633 VSTCLILRNMFDP-ETETEENWDQDILSETHAECSTFGKIMHIYLDKN-TKGCIYIRYDN 690
Query: 478 RKDAQKCIELMNGRWFGGRQIHA 500
+ K I+ ++GRWF +QI A
Sbjct: 691 NESPLKAIQKLHGRWFSKKQIIA 713
>gi|18398260|ref|NP_565399.1| splicing factor, CC1-like protein [Arabidopsis thaliana]
gi|20197280|gb|AAC64224.2| putative splicing factor [Arabidopsis thaliana]
gi|133778824|gb|ABO38752.1| At2g16940 [Arabidopsis thaliana]
gi|330251464|gb|AEC06558.1| splicing factor, CC1-like protein [Arabidopsis thaliana]
Length = 561
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 52/87 (59%), Gaps = 3/87 (3%)
Query: 416 LTIPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
+ +P+ ++L+ MF P+ +++ +++ DV+EEC K G ++ + V +N G V +R
Sbjct: 460 IGVPSECLLLKNMFDPS-TETEDDFDEDIKEDVKEECSKFGKLNHIFVDKNS-VGFVYLR 517
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHAS 501
F++ + A ++GRWF G+ I A+
Sbjct: 518 FENAQAAIGAQRALHGRWFAGKMITAT 544
>gi|21553746|gb|AAM62839.1| putative splicing factor [Arabidopsis thaliana]
Length = 560
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 52/87 (59%), Gaps = 3/87 (3%)
Query: 416 LTIPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
+ +P+ ++L+ MF P+ +++ +++ DV+EEC K G ++ + V +N G V +R
Sbjct: 459 IGVPSECLLLKNMFDPS-TETEDDFDEDIKEDVKEECSKFGKLNHIFVDKNS-VGFVYLR 516
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHAS 501
F++ + A ++GRWF G+ I A+
Sbjct: 517 FENAQAAIGAQRALHGRWFAGKMITAT 543
>gi|390343846|ref|XP_785870.2| PREDICTED: uncharacterized protein LOC580739 [Strongylocentrotus
purpuratus]
Length = 594
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 291 VYVTG-LPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEP 349
++++G LP+ V +++ E F + G++K D T K RI IY +K+T G+A VTY
Sbjct: 332 IFISGMLPNSVDEDKIKEHFGQIGVVKLDKRTNKDRIWIYKNKQTHEPTGEATVTYEDPH 391
Query: 350 SVALATQLLDGTPF 363
+ A +G PF
Sbjct: 392 TATSAIDWFNGKPF 405
>gi|340374709|ref|XP_003385880.1| PREDICTED: hypothetical protein LOC100638210 [Amphimedon
queenslandica]
Length = 443
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 39/69 (56%), Gaps = 1/69 (1%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V+VTG+ D T E + +VFS G IKED T IK+Y DK+ KG+ +VTY+
Sbjct: 217 VFVTGIALDATFENIRDVFSSVGNIKEDHRTGHFMIKMYEDKQRKF-KGECMVTYVSTEG 275
Query: 351 VALATQLLD 359
A + L+
Sbjct: 276 ADGAIKFLN 284
>gi|338710585|ref|XP_003362388.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q [Equus
caballus]
Length = 527
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|169767236|ref|XP_001818089.1| RNA-binding protein rsd1 [Aspergillus oryzae RIB40]
gi|238484073|ref|XP_002373275.1| RNA splicing factor (Pad-1), putative [Aspergillus flavus NRRL3357]
gi|83765944|dbj|BAE56087.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220701325|gb|EED57663.1| RNA splicing factor (Pad-1), putative [Aspergillus flavus NRRL3357]
gi|391870727|gb|EIT79903.1| transcriptional coactivator CAPER [Aspergillus oryzae 3.042]
Length = 568
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+LR MF P E + ++ ELE DV+ EC K G V + + N QG + ++F +
Sbjct: 473 VMLRNMFDPGEEEGESWIK-ELEDDVRAECEEKYGHVVHIALDPNS-QGDIYLKFDRVQG 530
Query: 481 AQKCIELMNGRWFGGRQIHA 500
+ I+ +NGR+FGG+QI A
Sbjct: 531 GENAIKGLNGRFFGGKQITA 550
>gi|324505367|gb|ADY42309.1| Splicing factor U2AF 35 kDa subunit [Ascaris suum]
Length = 275
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K GP+D + VC+N H G V V+F++ +DA KC++ ++ RWF G I+A
Sbjct: 101 KFGPIDEMNVCDNIGEHMIGNVYVKFENEEDADKCVKGLDDRWFNGCPIYA 151
>gi|449454219|ref|XP_004144853.1| PREDICTED: RNA-binding protein 39-like [Cucumis sativus]
gi|449506986|ref|XP_004162902.1| PREDICTED: RNA-binding protein 39-like [Cucumis sativus]
Length = 562
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 49/101 (48%), Gaps = 9/101 (8%)
Query: 401 VEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSV 460
V +LG G T ++L+ MF P ++ ++ DV+EEC + G + +
Sbjct: 456 VTANLLGVG-------TPSECLLLKNMFDPT-AESEPTFDLDIRDDVEEECSRFGKLKHI 507
Query: 461 KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
V N G V +RF+ + A + +NGRWF G+ I A+
Sbjct: 508 YVDRNS-AGFVYLRFEKSESAMEAQRALNGRWFAGKMIGAT 547
>gi|345326440|ref|XP_001510154.2| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 2
[Ornithorhynchus anatinus]
Length = 527
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGSIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQIVEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|384493179|gb|EIE83670.1| hypothetical protein RO3G_08375 [Rhizopus delemar RA 99-880]
Length = 475
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 13/91 (14%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCE-------NHPQGVVLV 473
T++L M P E+ D+ L+ E EEC K G V+ + E H + V
Sbjct: 372 TILLTNMVGPGEV--DDMLQEE----TAEECSKYGKVERCLIFEVPRGQVPEHKAVRIFV 425
Query: 474 RFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
+F D + A++ I+ +NGR+FGGR + AS D
Sbjct: 426 KFSDVESAKRAIQDLNGRFFGGRSVSASYYD 456
>gi|289724641|gb|ADD18299.1| poly-U binding splicing factor [Glossina morsitans morsitans]
Length = 611
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 49/101 (48%), Gaps = 20/101 (19%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG------------ 469
+ILR M P ++ DE L+ E +QEEC K G V V + N Q
Sbjct: 513 IILRNMVGPEDV--DETLQEE----IQEECTKFGTVSRV-IIFNEKQTENEDDDEAEIIV 565
Query: 470 VVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS-EDDGLVNH 509
+ V F +A + E +NGR+FGGR++HA D L +H
Sbjct: 566 KIFVEFSSSAEAVRGKEALNGRFFGGRRVHAELYDQSLFDH 606
>gi|50555327|ref|XP_505072.1| YALI0F06292p [Yarrowia lipolytica]
gi|49650942|emb|CAG77879.1| YALI0F06292p [Yarrowia lipolytica CLIB122]
Length = 203
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 35/56 (62%), Gaps = 3/56 (5%)
Query: 446 DVQEECVKIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
D+ E K+G ++ + VCEN H G +RF++++DAQ+ +L N RW+ GR +
Sbjct: 85 DIYMEAAKLGRLEEMIVCENGNDHLTGNTYLRFRNQEDAQRACDLFNTRWYAGRPV 140
>gi|320163233|gb|EFW40132.1| RNA binding domain-containing protein 39 [Capsaspora owczarzaki
ATCC 30864]
Length = 600
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 99/238 (41%), Gaps = 38/238 (15%)
Query: 286 KVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTY 345
K T VYV L +T ++ +FS CG + I + ++++ G KG A V +
Sbjct: 352 KQQTRVYVGSLDFALTEADVKSLFSPCGEV----------ISVTLNRDNGKSKGYAFVQF 401
Query: 346 LKEPSVALATQL----LDGTPFR------PDGKI---PMSVTQAKFEQKGERFIAKQVDS 392
+ LA +L + G P + P+G+ P + G +
Sbjct: 402 ADAGAAKLAMELNGVEVAGRPLKVNFATDPEGRFLNAPNAGPPMGMPPMGLPPMGMPPMG 461
Query: 393 KKKKKLKKVEEKMLGWGGRD---DAKLTIPAT----VILRFMFTPAEMRADENLRSELEA 445
L + +G D A L+ A ++L+ MF PA+ R D + ++
Sbjct: 462 LPPMGLPPMGLPPMGLPPMDANPTADLSNVAIETQFLLLKNMFDPAQER-DPEWQLDIRD 520
Query: 446 DVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIEL---MNGRWFGGRQIHA 500
V +ECVK G V + V N P G V ++F + CI + M+GRWF G+ I A
Sbjct: 521 TVLDECVKHGSVTHIFVDANSP-GFVFIKFSS---VEACIAVQRTMHGRWFAGKLITA 574
>gi|405969065|gb|EKC34075.1| Splicing factor U2AF 26 kDa subunit [Crassostrea gigas]
Length = 269
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 427 MFTPAEMRADEN----LRSELEADVQEECVKIGPVDSVKVCEN---HPQGVVLVRFKDRK 479
+FT E + D+ E +V E K G ++ + VC+N H G V V+F+ +
Sbjct: 85 VFTELEDKMDDTELQQFYDEFFEEVYVELEKYGEIEEMNVCDNLGDHLVGNVYVKFRYEE 144
Query: 480 DAQKCIELMNGRWFGGRQIHA 500
DA+K + +N RWF GR +HA
Sbjct: 145 DAEKAVNELNNRWFNGRPVHA 165
>gi|62857615|ref|NP_001016788.1| RNA binding motif protein 39 [Xenopus (Silurana) tropicalis]
gi|62201342|gb|AAH93451.1| RNA binding motif protein 39 [Xenopus (Silurana) tropicalis]
gi|89273890|emb|CAJ83908.1| RNA-binding region (RNP1, RRM) containing 2 [Xenopus (Silurana)
tropicalis]
Length = 542
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 35/61 (57%)
Query: 441 SELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
SE++ DV EEC K G V + V +N PQG V V+ A + ++GRWF G+ I A
Sbjct: 456 SEIKEDVIEECNKHGGVVHIYVDKNSPQGNVYVKCPTIASAIAAVNALHGRWFAGKMITA 515
Query: 501 S 501
+
Sbjct: 516 A 516
>gi|395857932|ref|XP_003801334.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 2
[Otolemur garnettii]
Length = 527
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSIYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|334184263|ref|NP_001189538.1| splicing factor, CC1-like protein [Arabidopsis thaliana]
gi|330251466|gb|AEC06560.1| splicing factor, CC1-like protein [Arabidopsis thaliana]
Length = 610
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 52/87 (59%), Gaps = 3/87 (3%)
Query: 416 LTIPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
+ +P+ ++L+ MF P+ +++ +++ DV+EEC K G ++ + V +N G V +R
Sbjct: 509 IGVPSECLLLKNMFDPS-TETEDDFDEDIKEDVKEECSKFGKLNHIFVDKN-SVGFVYLR 566
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHAS 501
F++ + A ++GRWF G+ I A+
Sbjct: 567 FENAQAAIGAQRALHGRWFAGKMITAT 593
>gi|160332383|sp|Q6IQE0.2|PU60B_DANRE RecName: Full=Poly(U)-binding-splicing factor PUF60-B
Length = 516
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 16/101 (15%)
Query: 409 GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ 468
GR A + ++LR M P E++ +LE +V EEC K G V+ V + + Q
Sbjct: 408 AGRQAAIQSKSTVMVLRNMVGP------EDIDDDLEGEVMEECGKYGAVNRVIIYQER-Q 460
Query: 469 G---------VVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
G + V F D + K I+ +N RWF GR++ A
Sbjct: 461 GEEDDAEIIVKIFVEFSDAGEMNKAIQALNNRWFAGRKVVA 501
>gi|426353903|ref|XP_004044414.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q [Gorilla
gorilla gorilla]
Length = 527
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|348503003|ref|XP_003439056.1| PREDICTED: RNA-binding protein 39-like [Oreochromis niloticus]
Length = 498
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 61/256 (23%), Positives = 103/256 (40%), Gaps = 57/256 (22%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T E + +F G I+ I++ +D ETG KG +++
Sbjct: 229 LYVGSLHFNITEEMLRGIFEPFGKIEG--------IQLMMDSETGRSKGYGFISFADAEC 280
Query: 351 VALATQLLDGTPFRPDGKIPMSV----------TQAKFEQKGE--------------RFI 386
A + L+G F G+ PM V T + F E + +
Sbjct: 281 AKKALEQLNG--FELAGR-PMKVGHVTERSDSSTASSFLDNDELERTGIDLGTTGRLQLM 337
Query: 387 AKQVDSKKKKKLKKVEEKM-----LGWGG----------RDDAKLTIP----ATVILRF- 426
A+ + K ++ + + +GG L +P AT L+
Sbjct: 338 ARLAEGTGLKIPPAAQQALQMTGSIPFGGIGAPAAVPTPAPSQALNLPSQPLATHCLQLS 397
Query: 427 -MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCI 485
+F P + D + +E++ DV EEC K G + + V +N PQG V V+ A +
Sbjct: 398 NLFNP-QAENDPSWAAEIQDDVIEECNKHGGIVHIYVDKNSPQGNVYVKCPSIPAAMATV 456
Query: 486 ELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 457 NALHGRWFAGKMITAA 472
>gi|193613005|ref|XP_001950928.1| PREDICTED: splicing factor 45-like [Acyrthosiphon pisum]
gi|193629687|ref|XP_001945492.1| PREDICTED: splicing factor 45-like [Acyrthosiphon pisum]
Length = 404
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 48/86 (55%), Gaps = 12/86 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCE----NHPQGV-VLVR 474
V+LR M P E+ D LE +V++EC K G V+ V + E +H + V + V
Sbjct: 310 VVLLRNMVGPGEVDQD------LEPEVKDECNTKYGDVNKVVIYEVPNVDHEEAVRIFVE 363
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
FK + A K + +NGR+FGGRQ+ A
Sbjct: 364 FKRIESAIKAVVDLNGRFFGGRQVKA 389
>gi|344264677|ref|XP_003404418.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 4
[Loxodonta africana]
Length = 524
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 138 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 189
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 190 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 245
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 246 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 304
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 305 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 360
Query: 496 RQI 498
I
Sbjct: 361 ENI 363
>gi|389644626|ref|XP_003719945.1| RNA splicing factor Pad-1 [Magnaporthe oryzae 70-15]
gi|351639714|gb|EHA47578.1| RNA splicing factor Pad-1 [Magnaporthe oryzae 70-15]
Length = 564
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF PAE D+ ++ ELE DV++E K G V + + N QG + ++F +
Sbjct: 469 VVLHNMFDPAEEEGDDWVK-ELEEDVRQEAEEKYGHVVHISLDPNS-QGDIYLKFDRVQG 526
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I+ +NGR+FGGR I A+
Sbjct: 527 GENAIKGLNGRYFGGRMITAA 547
>gi|228008398|ref|NP_001153146.1| heterogeneous nuclear ribonucleoprotein Q isoform 3 [Homo sapiens]
gi|291396566|ref|XP_002714607.1| PREDICTED: synaptotagmin binding, cytoplasmic RNA interacting
protein isoform 3 [Oryctolagus cuniculus]
gi|296198706|ref|XP_002746829.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 3
[Callithrix jacchus]
gi|301775583|ref|XP_002923212.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like isoform 2
[Ailuropoda melanoleuca]
gi|332218417|ref|XP_003258352.1| PREDICTED: uncharacterized protein LOC100579672 [Nomascus
leucogenys]
gi|348578334|ref|XP_003474938.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like isoform 5
[Cavia porcellus]
gi|397504634|ref|XP_003822889.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 2 [Pan
paniscus]
gi|402867570|ref|XP_003897916.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 2
[Papio anubis]
gi|426234736|ref|XP_004011348.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q isoform 5
[Ovis aries]
gi|21619168|gb|AAH32643.1| SYNCRIP protein [Homo sapiens]
gi|119569013|gb|EAW48628.1| synaptotagmin binding, cytoplasmic RNA interacting protein, isoform
CRA_f [Homo sapiens]
gi|296484070|tpg|DAA26185.1| TPA: synaptotagmin binding, cytoplasmic RNA interacting protein
isoform 2 [Bos taurus]
gi|325464513|gb|ADZ16027.1| synaptotagmin binding, cytoplasmic RNA interacting protein
[synthetic construct]
Length = 527
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 109/243 (44%), Gaps = 32/243 (13%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKL----KKVEEKM---LGWGGRDDAKLTIPATVILRF--MFTPAEMRA--D 436
SK K+++ KV E + + + DD K L + T A+++
Sbjct: 249 SIPKSKTKEQILEEFSKVTEGLTDVILYHQPDDKKKN-RGFCFLEYEDHKTAAQVKVLFV 307
Query: 437 ENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGG 495
NL + + ++ E+ + G ++ VK +++ + F +R A K +E MNG+ G
Sbjct: 308 RNLANTVTEEILEKAFSQFGKLERVKKLKDY----AFIHFDERDGAVKAMEEMNGKDLEG 363
Query: 496 RQI 498
I
Sbjct: 364 ENI 366
>gi|75338884|sp|Q9ZR40.1|U2A2B_NICPL RecName: Full=Splicing factor U2af large subunit B; AltName:
Full=NpU2AF65b; AltName: Full=U2 auxiliary factor 65 kDa
subunit B; AltName: Full=U2 small nuclear
ribonucleoprotein auxiliary factor large subunit B;
Short=U2 snRNP auxiliary factor large subunit B
gi|3850821|emb|CAA77135.1| U2 snRNP auxiliary factor, large subunit [Nicotiana
plumbaginifolia]
Length = 573
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/255 (23%), Positives = 100/255 (39%), Gaps = 50/255 (19%)
Query: 265 QPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKP 324
QP+ + P S L+ ++V GLP T ++ E+ G ++
Sbjct: 335 QPNPNLNLAAVGLSPGSAGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRG------- 387
Query: 325 RIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSV-------TQAK 377
+ D+ETG KG A Y +A L+G G ++V TQ K
Sbjct: 388 -FDLVKDRETGNSKGYAFCVYQDVSVTDIACAALNGIKM---GDKTLTVRRANQGTTQPK 443
Query: 378 FEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFM-----FTPAE 432
EQ+ A+Q + ++ L+ PAT+ + + + E
Sbjct: 444 PEQESVLLHAQQQIALQRLMLQ-------------------PATLATKVLSLTEVISADE 484
Query: 433 MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG-------VVLVRFKDRKDAQKCI 485
+ DE+ + LE D++ EC K G + +V + P G V + + D + K
Sbjct: 485 LNDDEDYQDILE-DMRTECGKFGSLVNVVIPRPSPNGEPTPGVGKVFLEYADVDSSSKAR 543
Query: 486 ELMNGRWFGGRQIHA 500
+ +NGR FGG Q+ A
Sbjct: 544 QSLNGRKFGGNQVVA 558
>gi|402586021|gb|EJW79960.1| U2(RNU2) small nuclear RNA auxiliary factor 1 [Wuchereria
bancrofti]
Length = 215
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K GP+D + VC+N H G V V+F++ +DA KC++ + RWF G ++A
Sbjct: 100 KFGPIDEMNVCDNIGEHMIGNVYVKFENEEDADKCVKGLENRWFNGSPVYA 150
>gi|324508128|gb|ADY43435.1| RNA-binding protein 39 [Ascaris suum]
Length = 535
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 416 LTIPATV----ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVV 471
L IP +L MF PA+ EN E+ DV EEC K G V + V P G V
Sbjct: 413 LAIPPIATQCFLLSNMFDPAQ-ETGENWADEVRDDVIEECAKNGGVVHIFVDRESPSGNV 471
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
V+ A K + ++GRWF G+ I A+
Sbjct: 472 YVKCPSVAAAYKSVNSLHGRWFSGKVITAN 501
>gi|357113992|ref|XP_003558785.1| PREDICTED: RNA-binding protein 39-like [Brachypodium distachyon]
Length = 551
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + +++ DV+EEC K G + + V +N G V +RF A
Sbjct: 461 LLLKNMFDPA-VETDPDFDLDIKDDVREECSKFGQIRHIFVDKNT-AGFVYLRFDSITAA 518
Query: 482 QKCIELMNGRWFGGRQIHAS 501
+ + GRWF G+ I A+
Sbjct: 519 MGAQKALQGRWFAGKMITAT 538
>gi|4530579|gb|AAD22102.1| Pad-1 [Neurospora crassa]
Length = 575
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF PAE EN ELE DV+EE K G V + + N G + ++F +
Sbjct: 475 VVLHNMFDPAEEEG-ENWIKELEDDVREEAEAKYGHVVHISLDPNS-AGDIYLKFDKVQG 532
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I+ +NGR+FGGR I A+
Sbjct: 533 GENAIKGLNGRYFGGRMITAA 553
>gi|399218373|emb|CCF75260.1| unnamed protein product [Babesia microti strain RI]
Length = 485
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 2/92 (2%)
Query: 412 DDAKLTIPATVILRFMF--TPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG 469
+ A + + ++L MF T ++ D ++L DV+ EC K G V V + ++ P G
Sbjct: 379 NSATVGVTENIVLSNMFSATDPQIMEDPEFFTDLVEDVKSECKKYGNVLQVYINKSVPDG 438
Query: 470 VVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
+V V+F + A + +N R+FGG I A+
Sbjct: 439 MVWVKFATVEQAVAAFQSLNDRFFGGNSISAA 470
>gi|440470687|gb|ELQ39749.1| RNA-binding protein rsd1 [Magnaporthe oryzae Y34]
Length = 566
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF PAE D+ ++ ELE DV++E K G V + + N QG + ++F +
Sbjct: 469 VVLHNMFDPAEEEGDDWVK-ELEEDVRQEAEEKYGHVVHISLDPNS-QGDIYLKFDRVQG 526
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I+ +NGR+FGGR I A+
Sbjct: 527 GENAIKGLNGRYFGGRMITAA 547
>gi|50344898|ref|NP_001002121.1| poly(U)-binding-splicing factor PUF60-B [Danio rerio]
gi|47940403|gb|AAH71467.1| Poly-U binding splicing factor b [Danio rerio]
Length = 502
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 16/101 (15%)
Query: 409 GGRDDAKLTIPATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQ 468
GR A + ++LR M P E++ +LE +V EEC K G V+ V + + Q
Sbjct: 394 AGRQAAIQSKSTVMVLRNMVGP------EDIDDDLEGEVMEECGKYGAVNRVIIYQER-Q 446
Query: 469 G---------VVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
G + V F D + K I+ +N RWF GR++ A
Sbjct: 447 GEEDDAEIIVKIFVEFSDAGEMNKAIQALNNRWFAGRKVVA 487
>gi|209155056|gb|ACI33760.1| RNA-binding protein 39 [Salmo salar]
Length = 535
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 34/60 (56%)
Query: 442 ELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
E++ DV EEC K G + + V +N PQG V V+ A + ++GRWF G+ I A+
Sbjct: 450 EIQDDVMEECNKHGGIVHIYVDKNSPQGNVYVKCPTIPTAMAAVNALHGRWFAGKMITAA 509
>gi|443697246|gb|ELT97781.1| hypothetical protein CAPTEDRAFT_161525 [Capitella teleta]
Length = 346
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
Query: 442 ELEADVQEECVKIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
E DV E +G + KVC N H +G V ++++ DA++C+ NGRW+ GRQ+
Sbjct: 34 EFYEDVTPEFRALGRLVQFKVCCNYEPHLRGNVYIQYESESDAERCLSAFNGRWYAGRQL 93
>gi|18376336|emb|CAD21082.1| RNA splicing factor Pad-1 [Neurospora crassa]
Length = 571
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF PAE EN ELE DV+EE K G V + + N G + ++F +
Sbjct: 476 VVLHNMFDPAEEEG-ENWIKELEDDVREEAEAKYGHVVHISLDPNS-AGDIYLKFDKVQG 533
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I+ +NGR+FGGR I A+
Sbjct: 534 GENAIKGLNGRYFGGRMITAA 554
>gi|440791774|gb|ELR13012.1| RNA recognition motif domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 321
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 24/110 (21%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIK--------------------EDPETKKP---RIK 327
VYV+GLP + TV+ + E+F GIIK + + K+P +I
Sbjct: 31 VYVSGLPPETTVDNLGELFGSIGIIKCPSRPFFLMRQSIHNYCVADQIDKKKRPNEKKIY 90
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAK 377
+Y DK TG KGDA VTY + A + T F KI + + Q K
Sbjct: 91 VYTDKATGKPKGDATVTYDDPEAAKAAINWFNNTEFM-GSKIKVEIAQRK 139
>gi|312088442|ref|XP_003145864.1| hypothetical protein LOAG_10291 [Loa loa]
gi|307758974|gb|EFO18208.1| hypothetical protein LOAG_10291 [Loa loa]
Length = 252
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K GP+D + VC+N H G V V+F++ +DA KC++ + RWF G ++A
Sbjct: 100 KFGPIDEMNVCDNIGEHMIGNVYVKFENEEDADKCVKGLENRWFNGSPVYA 150
>gi|170596198|ref|XP_001902678.1| splicing factor U2AF35 [Brugia malayi]
gi|158589514|gb|EDP28475.1| splicing factor U2AF35, putative [Brugia malayi]
Length = 247
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K GP+D + VC+N H G V V+F++ +DA KC++ + RWF G ++A
Sbjct: 100 KFGPIDEMNVCDNIGEHMIGNVYVKFENEEDADKCVKGLENRWFNGSPVYA 150
>gi|426221982|ref|XP_004005184.1| PREDICTED: heterogeneous nuclear ribonucleoprotein R isoform 3
[Ovis aries]
Length = 598
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 53/237 (22%), Positives = 99/237 (41%), Gaps = 34/237 (14%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + + T V+V +P D+ +E+V +F K G I + ++
Sbjct: 144 QRKYGGPPPDSVYSGVQPGIGTEVFVGKIPRDLYEDELVPLFEKAGPIWD--------LR 195
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D +G +G A +T+ + + A +L D RP + + ++ A R
Sbjct: 196 LMMDPLSGQNRGYAFITFCGKEAAQEAVKLCDSYEIRPGKHLGVCISVAN-----NRLFV 250
Query: 388 KQVDSKKKK-----KLKKVEEKMLG---WGGRDDAKLTIPATVILRFMFTPAEMRADENL 439
+ K K + KV E ++ + DD K L + + +
Sbjct: 251 GSIPKNKTKENILEEFSKVTEGLVDVILYHQPDDKKKN-RGFCFLEYEDHKSAAQVKVLF 309
Query: 440 RSELEADVQEECV-----KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGR 491
L V EE + + G ++ VK +++ V F+DR A K ++ MNG+
Sbjct: 310 VRNLATTVTEEILEKSFSEFGKLERVKKLKDY----AFVHFEDRGAAVKAMDEMNGK 362
>gi|336468898|gb|EGO57061.1| hypothetical protein NEUTE1DRAFT_66048 [Neurospora tetrasperma FGSC
2508]
gi|350288804|gb|EGZ70029.1| RNA splicing factor Pad-1 [Neurospora tetrasperma FGSC 2509]
Length = 571
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF PAE EN ELE DV+EE K G V + + N G + ++F +
Sbjct: 476 VVLHNMFDPAEEEG-ENWIKELEDDVREEAEAKYGHVVHISLDPNS-AGDIYLKFDKVQG 533
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I+ +NGR+FGGR I A+
Sbjct: 534 GENAIKGLNGRYFGGRMITAA 554
>gi|255550678|ref|XP_002516388.1| splicing factor, putative [Ricinus communis]
gi|223544486|gb|EEF46005.1| splicing factor, putative [Ricinus communis]
Length = 609
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 51/88 (57%), Gaps = 5/88 (5%)
Query: 416 LTIPA-TVILRFMFTPA-EMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLV 473
+ +P+ ++L+ MF P+ E D +L +++ DVQ EC K G + + V +N G V +
Sbjct: 510 IGVPSECLLLKNMFDPSIETEPDFDL--DIKEDVQLECSKFGNLQHIYVDKNS-AGFVYL 566
Query: 474 RFKDRKDAQKCIELMNGRWFGGRQIHAS 501
RF++ + A ++GRWF G+ I A+
Sbjct: 567 RFENTQSAISAQRALHGRWFAGKMITAT 594
>gi|85076094|ref|XP_955878.1| hypothetical protein NCU03491 [Neurospora crassa OR74A]
gi|28916904|gb|EAA26642.1| hypothetical protein NCU03491 [Neurospora crassa OR74A]
Length = 576
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF PAE EN ELE DV+EE K G V + + N G + ++F +
Sbjct: 476 VVLHNMFDPAEEEG-ENWIKELEDDVREEAEAKYGHVVHISLDPNS-AGDIYLKFDKVQG 533
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I+ +NGR+FGGR I A+
Sbjct: 534 GENAIKGLNGRYFGGRMITAA 554
>gi|221487552|gb|EEE25784.1| splicing factor protein, putative [Toxoplasma gondii GT1]
Length = 633
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 44/86 (51%), Gaps = 2/86 (2%)
Query: 417 TIPATVILRFMFTPAE--MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
T V+L MF + ++ D + +L DV++EC K G V+ V + E + G V +R
Sbjct: 535 TGSCNVVLHNMFAAKDVNLKEDPHFFLDLGDDVRDECKKFGSVEKVWIDERNVDGNVWIR 594
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
F A+ +NGR+F G+ I A
Sbjct: 595 FAHPDQARAAFGALNGRYFAGKPISA 620
>gi|221111756|ref|XP_002159647.1| PREDICTED: RNA-binding protein 39-like [Hydra magnipapillata]
Length = 528
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/233 (23%), Positives = 96/233 (41%), Gaps = 36/233 (15%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T + +F G ++ +++ D ET KG V + + +
Sbjct: 282 LYVGSLHYNITEPMLRAIFEPFGTVES--------VQLQYDSETNRSKGFGFVNFREAGA 333
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGE-RFI--------AKQVDSKKKKKLKKV 401
A + ++G F G+ PM V G F+ +++++ + L +
Sbjct: 334 AKRAMEQMNG--FELAGR-PMKVNTVSERTDGSMSFLDDEETEKGGIEMNAQSRASLMQK 390
Query: 402 EEKMLGWGGRDDAKLTIPATV-------------ILRFMFTP-AEMRADENLRSELEADV 447
+ G G + IPA + IL +F P E +D L ++ DV
Sbjct: 391 LAQTHGSGLQVPTAPIIPAMLPTPMMNVAGSTCLILSNLFDPRKETESDWEL--DIRNDV 448
Query: 448 QEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
EE K+G V + + + +G V ++ AQK ++ NGRWF GR I A
Sbjct: 449 LEEVTKMGIVVHISIDKISAEGNVYIKTLIPDTAQKILQTFNGRWFAGRTIRA 501
>gi|237830355|ref|XP_002364475.1| splicing factor protein, putative [Toxoplasma gondii ME49]
gi|211962139|gb|EEA97334.1| splicing factor protein, putative [Toxoplasma gondii ME49]
gi|221507345|gb|EEE32949.1| splicing factor protein, putative [Toxoplasma gondii VEG]
Length = 633
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 44/86 (51%), Gaps = 2/86 (2%)
Query: 417 TIPATVILRFMFTPAE--MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
T V+L MF + ++ D + +L DV++EC K G V+ V + E + G V +R
Sbjct: 535 TGSCNVVLHNMFAAKDVNLKEDPHFFLDLGDDVRDECKKFGSVEKVWIDERNVDGNVWIR 594
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
F A+ +NGR+F G+ I A
Sbjct: 595 FAHPDQARAAFGALNGRYFAGKPISA 620
>gi|147900472|ref|NP_001086350.1| MGC81970 protein [Xenopus laevis]
gi|49522239|gb|AAH75146.1| MGC81970 protein [Xenopus laevis]
Length = 512
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 441 SELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
SE++ DV EEC K G + V +N PQG V V+ A + ++GRWF G+ I A
Sbjct: 426 SEIKEDVMEECNKHGGAIHIYVDKNSPQGNVYVKCSTITSAIAAVNALHGRWFAGKMITA 485
Query: 501 S 501
+
Sbjct: 486 A 486
>gi|357623461|gb|EHJ74600.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 [Danaus
plexippus]
Length = 350
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 89/222 (40%), Gaps = 32/222 (14%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP+ + +++ E+ G ++ + D TG+ KG A Y+
Sbjct: 144 IFIGGLPNYLNEDQVKELLMSFGQLRA--------FNLVKDSSTGLSKGYAFAEYV---D 192
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
+++ Q + G G + V +A K + L+ + G G
Sbjct: 193 ISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTL---AMTGAAPVTLQVAGLTLAGAG- 248
Query: 411 RDDAKLTIPATVILRF--MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV------ 462
PAT +L M TP E+R +E LE D++EEC K G V S+++
Sbjct: 249 --------PATEVLCLLNMVTPDELRDEEEYEDILE-DIKEECNKYGCVRSIEIPRPIEG 299
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
E G V V F D QK + + GR F R + S D
Sbjct: 300 VEVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYFD 341
>gi|170071297|ref|XP_001869868.1| splicing factor [Culex quinquefasciatus]
gi|167867202|gb|EDS30585.1| splicing factor [Culex quinquefasciatus]
Length = 524
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 49/100 (49%), Gaps = 2/100 (2%)
Query: 423 ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
+L MF PA + + E+E DV EEC K G V V V + P G V V+ A
Sbjct: 420 LLSNMFDPA-TETNPSWDVEIEDDVIEECNKHGGVLHVYVDKQSPAGNVYVKCPSIATAV 478
Query: 483 KCIELMNGRWFGGRQIHASEDDGLVNHAAIRDLDAEASRL 522
+ ++GRWF GR I A+ LVN+ + A AS L
Sbjct: 479 LAVNALHGRWFAGRVIAAAYVP-LVNYHTLFQESATASAL 517
>gi|170058744|ref|XP_001865056.1| splicing factor [Culex quinquefasciatus]
gi|167877732|gb|EDS41115.1| splicing factor [Culex quinquefasciatus]
Length = 546
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 102/272 (37%), Gaps = 63/272 (23%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV L ++T + + +F G I I++ +D +TG KG +T+
Sbjct: 291 LYVGSLHFNITEDMLRGIFEPFGKIDN--------IQLIMDSDTGRSKGYGFITFHNADD 342
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKM----L 406
A + L+G F G+ PM V + +++D L E L
Sbjct: 343 AKKALEQLNG--FELAGR-PMKVGN----------VTERLDVTTHASLDTDEMDRSGIDL 389
Query: 407 GWGGR--------DDAKLTIP----------------------------ATVILRFMFTP 430
G GR + A L +P +L MF P
Sbjct: 390 GATGRLQLMFKLAEGAGLAVPRAAADALLATAPQPAPNQPVQDSPAIATQCFLLSNMFDP 449
Query: 431 AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNG 490
A + + E+E DV EEC K G V V V + P G V V+ A + ++G
Sbjct: 450 A-TETNPSWDVEIEDDVIEECNKHGGVLHVYVDKQSPAGNVYVKCPSIATAVLAVNALHG 508
Query: 491 RWFGGRQIHASEDDGLVNHAAIRDLDAEASRL 522
RWF GR I A+ LVN+ + A AS L
Sbjct: 509 RWFAGRVIAAAYVP-LVNYHTLFQESATASAL 539
>gi|255082273|ref|XP_002508355.1| predicted protein [Micromonas sp. RCC299]
gi|226523631|gb|ACO69613.1| predicted protein [Micromonas sp. RCC299]
Length = 518
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 418 IPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFK 476
IP ++L+ MF PAE + ++ DV++EC K GPV + V + +G V ++F
Sbjct: 423 IPTQCLLLKNMFDPAE-ETEPEWWIDIGEDVKDECSKHGPVSHIHV-DKESRGFVYLKFG 480
Query: 477 DRKDAQKCIELMNGRWFGGRQIHA 500
+ A + ++GRWF G+ I A
Sbjct: 481 STEGASAARQALHGRWFAGKMIAA 504
>gi|412993528|emb|CCO14039.1| predicted protein [Bathycoccus prasinos]
Length = 641
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 418 IPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFK 476
IP V+L+ +F PAE DE ++E DV+ E K G V + +P G ++F
Sbjct: 545 IPTKCVLLKNLFDPAE-ETDEEWWLDIEEDVKGEVSKYGECVHAHVDKENPYGFCYLKFD 603
Query: 477 DRKDAQKCIELMNGRWFGGRQI 498
D + A+K +N RWF GR I
Sbjct: 604 DVEAAKKAQFGLNNRWFAGRSI 625
>gi|429329604|gb|AFZ81363.1| RNA recognition motif domain-containing protein [Babesia equi]
Length = 494
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 442 ELEADVQEECVKIGPVDSVKVCENHPQGV-VLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
EL +V+EEC K G V SV + + + + V F +DA+ ++ +N RWF GRQI
Sbjct: 420 ELPNEVKEECSKYGTVTSVYLHFSQNDSISIFVVFNTHEDAENAVKALNSRWFNGRQIEC 479
Query: 501 SEDDGLVNHAAIRDL 515
D + DL
Sbjct: 480 RSYDSSAYFSGNYDL 494
>gi|195433330|ref|XP_002064668.1| GK23713 [Drosophila willistoni]
gi|194160753|gb|EDW75654.1| GK23713 [Drosophila willistoni]
Length = 443
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 12/86 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCE---NHPQGVV--LVR 474
V+LR M P ++ DE ELE +V++EC K G V+SV + E P+ V V
Sbjct: 350 VVLLRNMVGPGDV--DE----ELEPEVKDECNTKYGEVNSVIIHEAFGTTPEDAVKIFVE 403
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
FK + A K + +NGR+FGGRQ+ A
Sbjct: 404 FKRIESAIKAVVDLNGRFFGGRQVRA 429
>gi|125532099|gb|EAY78664.1| hypothetical protein OsI_33765 [Oryza sativa Indica Group]
Length = 549
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + +++ DVQ+EC K G V+ + V +N G V + F A
Sbjct: 459 LLLKNMFDPA-VETDPDFDLDIKDDVQDECSKFGAVNHIFVDKNT-AGFVYLHFDSVAAA 516
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 517 TAAQRALHGRWFAGKMITAT 536
>gi|76157322|gb|AAX28279.2| SJCHGC07737 protein [Schistosoma japonicum]
Length = 276
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 267 DKQVEKKEANKPPDSWFEL--KVNTHVYVTGLPDDVTVEEMVEVFSKCGII 315
D + K++ P +W+E+ NTHVYV+GLP +T +E + SKCG+I
Sbjct: 224 DPNLRKRKQTTPLPAWYEIDESKNTHVYVSGLPPTITDDEFSALMSKCGVI 274
>gi|71028054|ref|XP_763670.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large subunit
[Theileria parva strain Muguga]
gi|68350624|gb|EAN31387.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large
subunit, putative [Theileria parva]
Length = 380
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 101/236 (42%), Gaps = 31/236 (13%)
Query: 283 FELKVN---THVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
F L N T V+V +P DVT ++M E+ K G +K K P TG+ KG
Sbjct: 143 FNLGTNSDDTKVFVQNIPLDVTEDQMKELLEKHGKLKLANLLKDP--------ATGVSKG 194
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLK 399
+ S LA L+G+ GK +SV A F K +D K
Sbjct: 195 YGFFEFEDARSSKLAVLHLNGSVL---GKNVLSVKHAAFGYFASG--GKPIDCKASNLPN 249
Query: 400 KVEEK-----MLGWGGRDDAKL-TIPATVI-LRFMFTPAEMRADENLRSELEADVQEECV 452
+ + +LG ++ ++ + P+ VI L M ++ +D N +E+ V+EE
Sbjct: 250 SITQSILSNPLLGLQLQNGRRIGSNPSKVIQLLNMVFHEDLISDYNY-NEIVRLVKEEAQ 308
Query: 453 KIGPVDSVKVCE-----NHPQGV--VLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
K GP+ V + +GV V +R++D A+K + NGR F +I S
Sbjct: 309 KYGPLQEVVIPRPDKDLTFKEGVGKVFIRYEDLLSARKAQYMFNGRVFDKNRIVCS 364
>gi|390341852|ref|XP_792919.3| PREDICTED: splicing factor U2AF 50 kDa subunit-like
[Strongylocentrotus purpuratus]
Length = 386
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 55/223 (24%), Positives = 90/223 (40%), Gaps = 29/223 (13%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP+ + +++ E+ S G +K + D T + KG A Y++
Sbjct: 175 IFIGGLPNYLNDDQVKELLSSFGPLKA--------FNLVKDSATSLSKGYAFCEYVETNL 226
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGER-FIAKQVDSKKKKKLKKVEEKMLGWG 409
L + D ++ Q GE+ I ++ K + + ++ +
Sbjct: 227 TDLGWETTDK-----------AIAGLNGMQLGEKKLIVQRASVGAKNAMNQGQQVQINIP 275
Query: 410 GRDDAKLTIPATVILRFM--FTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHP 467
G T P T IL M TP E++ DE +E DV+EEC K G V S+++ P
Sbjct: 276 GLSLPGTTGPNTEILCLMNMVTPEELKDDEEYDDIVE-DVKEECQKYGQVRSLEIPRPIP 334
Query: 468 Q------GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
G + V F DAQ + GR F R + S D
Sbjct: 335 GLDVPGCGKIYVEFMTVMDAQAAQRALAGRKFANRTVVTSFYD 377
>gi|334184261|ref|NP_001189537.1| splicing factor, CC1-like protein [Arabidopsis thaliana]
gi|330251465|gb|AEC06559.1| splicing factor, CC1-like protein [Arabidopsis thaliana]
Length = 599
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 52/87 (59%), Gaps = 3/87 (3%)
Query: 416 LTIPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
+ +P+ ++L+ MF P+ +++ +++ DV+EEC K G ++ + V +N G V +R
Sbjct: 498 IGVPSECLLLKNMFDPS-TETEDDFDEDIKEDVKEECSKFGKLNHIFVDKNS-VGFVYLR 555
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHAS 501
F++ + A ++GRWF G+ I A+
Sbjct: 556 FENAQAAIGAQRALHGRWFAGKMITAT 582
>gi|302843264|ref|XP_002953174.1| hypothetical protein VOLCADRAFT_105799 [Volvox carteri f.
nagariensis]
gi|300261561|gb|EFJ45773.1| hypothetical protein VOLCADRAFT_105799 [Volvox carteri f.
nagariensis]
Length = 556
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 73/160 (45%), Gaps = 27/160 (16%)
Query: 351 VALATQLLD--GTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGW 408
VAL T+L G P P +P VT A +A V S L +++ +LG
Sbjct: 370 VALMTKLAGSAGLPAPPTSVLPAGVTNA---------VAHAVLSVDPVLL--MQQGVLG- 417
Query: 409 GGRDDAKLTIPATVILRFMFTPAEMRADEN--------LRSELEADVQEECVKIGPVDSV 460
+ + P ++L+ MF A D L +E+E DV+EEC + G + V
Sbjct: 418 ---PSSPIATP-CLLLKNMFDAAASAGDAAAQNAAASVLAAEVEVDVREECSRFGELLHV 473
Query: 461 KVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V + +G V +RF + A+ ++GRW+ GRQI A
Sbjct: 474 WV-DAKSKGFVYLRFATTQSAEAAHRALHGRWYSGRQILA 512
>gi|71996475|ref|NP_001022967.1| Protein UAF-1, isoform a [Caenorhabditis elegans]
gi|6226906|sp|P90978.2|U2AF2_CAEEL RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; Short=U2AF65; AltName:
Full=U2 snRNP auxiliary factor large subunit
gi|3334906|gb|AAC26982.1| splicing factor U2AF65 [Caenorhabditis elegans]
gi|351018334|emb|CCD62278.1| Protein UAF-1, isoform a [Caenorhabditis elegans]
Length = 496
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 95/235 (40%), Gaps = 37/235 (15%)
Query: 279 PDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
P S + +++ GLP+ +T +++ E+ G +K + VD + G K
Sbjct: 281 PVSTIVVDSANKIFIGGLPNYLTEDQVKELLCSFGPLKA--------FSLNVDSQ-GNSK 331
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL 398
G A YL A L+G Q G++ + Q+ +++
Sbjct: 332 GYAFAEYLDPTLTDQAIAGLNGM------------------QLGDKQLVVQLACANQQRH 373
Query: 399 K-KVEEKMLGWGGRDDAKLTIPATVILRFM--FTPAEMRADENLRSELEADVQEECVKIG 455
+ G D ++ AT IL M T E++AD+ LE DV++EC K G
Sbjct: 374 NTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELKADDEYEEILE-DVRDECSKYG 432
Query: 456 PVDSVKV---CENHPQ---GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
V S+++ E+HP G V V F D Q+ + GR F R + S D
Sbjct: 433 IVRSLEIPRPYEDHPVPGVGKVFVEFASTSDCQRAQAALTGRKFANRTVVTSYYD 487
>gi|71996485|ref|NP_001022969.1| Protein UAF-1, isoform c [Caenorhabditis elegans]
gi|351018336|emb|CCD62280.1| Protein UAF-1, isoform c [Caenorhabditis elegans]
Length = 474
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 95/235 (40%), Gaps = 37/235 (15%)
Query: 279 PDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
P S + +++ GLP+ +T +++ E+ G +K + VD + G K
Sbjct: 259 PVSTIVVDSANKIFIGGLPNYLTEDQVKELLCSFGPLKA--------FSLNVDSQ-GNSK 309
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL 398
G A YL A L+G Q G++ + Q+ +++
Sbjct: 310 GYAFAEYLDPTLTDQAIAGLNGM------------------QLGDKQLVVQLACANQQRH 351
Query: 399 K-KVEEKMLGWGGRDDAKLTIPATVILRFM--FTPAEMRADENLRSELEADVQEECVKIG 455
+ G D ++ AT IL M T E++AD+ LE DV++EC K G
Sbjct: 352 NTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELKADDEYEEILE-DVRDECSKYG 410
Query: 456 PVDSVKV---CENHPQ---GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
V S+++ E+HP G V V F D Q+ + GR F R + S D
Sbjct: 411 IVRSLEIPRPYEDHPVPGVGKVFVEFASTSDCQRAQAALTGRKFANRTVVTSYYD 465
>gi|406866575|gb|EKD19615.1| RNA splicing factor [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 587
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 60/123 (48%), Gaps = 19/123 (15%)
Query: 396 KKLKKVEEKMLGWGGRDDAKLTI-PAT---------------VILRFMFTPAEMRADENL 439
+KL + +E GRD+ + + P T V+L+ MF PAE D +
Sbjct: 450 RKLARTDEPAPATKGRDERREVVKPKTEAKALPVNVNMASRCVVLKNMFDPAEEEGD-SW 508
Query: 440 RSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQI 498
ELE DV+ E K G V + + N QG + ++F + + I+ +NGR+FGGR I
Sbjct: 509 EKELEDDVRAEAEEKYGHVVHISLDPNS-QGDIYLKFDRVQGGENAIKGLNGRYFGGRMI 567
Query: 499 HAS 501
A+
Sbjct: 568 SAT 570
>gi|341882558|gb|EGT38493.1| hypothetical protein CAEBREN_09163 [Caenorhabditis brenneri]
Length = 757
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 9/94 (9%)
Query: 420 ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVV--LVRFKD 477
+ ++LR M TP +++ LE +++EEC K G V V + GVV V++ D
Sbjct: 666 SVIVLRNMVTP------DDIDEYLEGEIREECGKFGTVLDVVIANFASSGVVKIFVKYAD 719
Query: 478 RKDAQKCIELMNGRWFGGRQIHA-SEDDGLVNHA 510
+ ++GR+FGG + A + D L +HA
Sbjct: 720 SMQVDRAKAALDGRFFGGNSVKAEAYDQILFDHA 753
>gi|91083973|ref|XP_975149.1| PREDICTED: similar to splicing factor 45 [Tribolium castaneum]
gi|270006716|gb|EFA03164.1| hypothetical protein TcasGA2_TC013083 [Tribolium castaneum]
Length = 387
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 12/87 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKV----CENHPQGV-VLVR 474
V+L+ M P E+ D LE +V++EC K G V SV + EN + V + V
Sbjct: 293 VVLLKNMVGPGEVDDD------LEPEVKDECNTKYGEVTSVIIHETQAENPEEAVRIFVE 346
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHAS 501
F+ + A K + +NGR+FGGRQ+ AS
Sbjct: 347 FRRIESAIKAVVDLNGRFFGGRQVKAS 373
>gi|310798506|gb|EFQ33399.1| CC1-like family splicing factor [Glomerella graminicola M1.001]
Length = 565
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 46/81 (56%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF P E D+ ++ ELE DV++E K G V + V N +G + ++F +
Sbjct: 470 VVLHNMFDPEEEEGDDWVK-ELEDDVRQEAETKYGRVVHISVDPNS-KGDIYLKFDKVQG 527
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I +NGR+FGGR I AS
Sbjct: 528 GENAIRGLNGRYFGGRMIDAS 548
>gi|198473430|ref|XP_002133257.1| GA29081 [Drosophila pseudoobscura pseudoobscura]
gi|198139453|gb|EDY70659.1| GA29081 [Drosophila pseudoobscura pseudoobscura]
Length = 418
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 12/86 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCE---NHPQGVV--LVR 474
V+LR M P ++ DE ELE +V++EC K G V+SV + E P+ V V
Sbjct: 325 VVLLRNMVGPGDV--DE----ELEPEVKDECNTKYGEVNSVIIHEAFGTSPEDAVKIFVE 378
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
FK + A K + +NGR+FGGRQ+ A
Sbjct: 379 FKRIESAIKAVVDLNGRFFGGRQVRA 404
>gi|195164546|ref|XP_002023107.1| GL21138 [Drosophila persimilis]
gi|194105192|gb|EDW27235.1| GL21138 [Drosophila persimilis]
Length = 418
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 12/86 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCE---NHPQGVV--LVR 474
V+LR M P ++ DE ELE +V++EC K G V+SV + E P+ V V
Sbjct: 325 VVLLRNMVGPGDV--DE----ELEPEVKDECNTKYGEVNSVIIHEAFGTSPEDAVKIFVE 378
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
FK + A K + +NGR+FGGRQ+ A
Sbjct: 379 FKRIESAIKAVVDLNGRFFGGRQVRA 404
>gi|386764548|ref|NP_001245708.1| U2 small nuclear riboprotein auxiliary factor 50, isoform B
[Drosophila melanogaster]
gi|383293438|gb|AFH07421.1| U2 small nuclear riboprotein auxiliary factor 50, isoform B
[Drosophila melanogaster]
Length = 427
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 90/222 (40%), Gaps = 31/222 (13%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP+ + +++ E+ G ++ + D TG+ KG A Y+
Sbjct: 220 IFIGGLPNYLNDDQVKELLLSFGKLRA--------FNLVKDAATGLSKGYAFCEYV---D 268
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
+++ Q + G G + V +A K + A S ML G
Sbjct: 269 LSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTTQSV-----------MLQVPG 317
Query: 411 RDDAKLTIPATVILRF--MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV------ 462
+ + P T +L M TP E+R +E LE D++EEC K G V SV++
Sbjct: 318 LSNVVTSGPPTEVLCLLNMVTPDELRDEEEYEDILE-DIKEECTKYGVVRSVEIPRPIEG 376
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
E G V V F D QK + + GR F R + S D
Sbjct: 377 VEVPGCGKVFVEFNSVLDCQKAQQALTGRKFSDRVVVTSYFD 418
>gi|301775589|ref|XP_002923215.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like isoform 5
[Ailuropoda melanoleuca]
Length = 626
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 70/150 (46%), Gaps = 16/150 (10%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 141 QRKYGGPPPDSVYSGQQPSVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 192
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D TG+ +G A VT+ + + A +L + R I + ++ A F+
Sbjct: 193 LMMDPLTGLNRGYAFVTFCTKEAAQEAVKLYNNHEIRSGKHIGVCISVA----NNRLFVG 248
Query: 388 KQVDSKKKKKLKKVEEKMLG-WGGRDDAKL 416
SK K+++ + K+ G W G D L
Sbjct: 249 SIPKSKTKEQILEEFSKVTGLWEGLTDVIL 278
>gi|393238548|gb|EJD46084.1| hypothetical protein AURDEDRAFT_184426 [Auricularia delicata
TFB-10046 SS5]
Length = 626
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 8/95 (8%)
Query: 409 GGRDDAKLTIPA-TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSV--KVCEN 465
G D A+ P+ V+L M P ++ DE L D+ EEC K G V+ V +C +
Sbjct: 519 GKEDIARFGKPSRIVVLTEMCDPDDVEDDE-----LREDIGEECAKFGAVERVLPHLCAD 573
Query: 466 HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V V+F A KC+ ++GR+FGG+ + A
Sbjct: 574 GQSVRVFVQFAGEPAAWKCVRDLDGRFFGGKTVRA 608
>gi|71996490|ref|NP_001022970.1| Protein UAF-1, isoform d [Caenorhabditis elegans]
gi|351018337|emb|CCD62281.1| Protein UAF-1, isoform d [Caenorhabditis elegans]
Length = 471
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 95/235 (40%), Gaps = 37/235 (15%)
Query: 279 PDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKK 338
P S + +++ GLP+ +T +++ E+ G +K + VD + G K
Sbjct: 256 PVSTIVVDSANKIFIGGLPNYLTEDQVKELLCSFGPLKA--------FSLNVDSQ-GNSK 306
Query: 339 GDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKL 398
G A YL A L+G Q G++ + Q+ +++
Sbjct: 307 GYAFAEYLDPTLTDQAIAGLNGM------------------QLGDKQLVVQLACANQQRH 348
Query: 399 K-KVEEKMLGWGGRDDAKLTIPATVILRFM--FTPAEMRADENLRSELEADVQEECVKIG 455
+ G D ++ AT IL M T E++AD+ LE DV++EC K G
Sbjct: 349 NTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELKADDEYEEILE-DVRDECSKYG 407
Query: 456 PVDSVKV---CENHPQ---GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
V S+++ E+HP G V V F D Q+ + GR F R + S D
Sbjct: 408 IVRSLEIPRPYEDHPVPGVGKVFVEFASTSDCQRAQAALTGRKFANRTVVTSYYD 462
>gi|257196183|ref|NP_082640.2| poly(U)-binding-splicing factor PUF60 isoform a [Mus musculus]
gi|158563798|sp|Q3UEB3.2|PUF60_MOUSE RecName: Full=Poly(U)-binding-splicing factor PUF60; AltName:
Full=60 kDa poly(U)-binding-splicing factor
gi|158563997|sp|Q9WV25.2|PUF60_RAT RecName: Full=Poly(U)-binding-splicing factor PUF60; AltName:
Full=60 kDa poly(U)-binding-splicing factor; AltName:
Full=RNA-binding protein Siah-BP; AltName:
Full=Siah-binding protein 1
gi|148697573|gb|EDL29520.1| RIKEN cDNA 2410104I19, isoform CRA_a [Mus musculus]
gi|149066142|gb|EDM16015.1| siah binding protein 1; FBP interacting repressor; pyrimidine tract
binding splicing factor; Ro ribonucleoprotein-binding
protein 1, isoform CRA_a [Rattus norvegicus]
Length = 564
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 16/89 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VV 471
++LR M P ++ D LE +V EEC K G V+ V + + QG +
Sbjct: 468 VMVLRNMVDPKDIDDD------LEGEVTEECGKFGAVNRVIIYQEK-QGEEEDAEIIVKI 520
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V F + K I+ +NGRWFGGR++ A
Sbjct: 521 FVEFSMASETHKAIQALNGRWFGGRKVVA 549
>gi|195116165|ref|XP_002002626.1| GI11727 [Drosophila mojavensis]
gi|193913201|gb|EDW12068.1| GI11727 [Drosophila mojavensis]
Length = 438
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 12/86 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENH---PQGVV--LVR 474
V+LR M P ++ DE ELE +V++EC K G V+SV + E P+ V V
Sbjct: 345 VVLLRNMVGPGDV--DE----ELEPEVKDECNTKYGEVNSVIIHEAFGTVPEDAVKIFVE 398
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
FK + A K + +NGR+FGGRQ+ A
Sbjct: 399 FKRIESAIKAVVDLNGRFFGGRQVRA 424
>gi|195051308|ref|XP_001993070.1| GH13625 [Drosophila grimshawi]
gi|193900129|gb|EDV98995.1| GH13625 [Drosophila grimshawi]
Length = 451
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 12/86 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENH---PQGVV--LVR 474
V+LR M P ++ DE ELE +V++EC K G V+SV + E P+ V V
Sbjct: 358 VVLLRNMVGPGDV--DE----ELEPEVKDECNTKYGEVNSVIIHEAFGTVPEDAVKIFVE 411
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
FK + A K + +NGR+FGGRQ+ A
Sbjct: 412 FKRIESAIKAVVDLNGRFFGGRQVRA 437
>gi|194767538|ref|XP_001965872.1| GF15900 [Drosophila ananassae]
gi|190619348|gb|EDV34872.1| GF15900 [Drosophila ananassae]
Length = 401
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 12/86 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCE---NHPQGVV--LVR 474
V+LR M P ++ DE ELE +V++EC K G V+SV + E P+ V V
Sbjct: 308 VVLLRNMVGPGDV--DE----ELEPEVKDECHTKYGEVNSVIIHEAFGTAPEDAVKIFVE 361
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
FK + A K + +NGR+FGGRQ+ A
Sbjct: 362 FKRIESAIKAVVDLNGRFFGGRQVRA 387
>gi|76677895|ref|NP_598452.2| poly(U)-binding-splicing factor PUF60 isoform b [Mus musculus]
gi|74195824|dbj|BAE30474.1| unnamed protein product [Mus musculus]
gi|74221976|dbj|BAE40622.1| unnamed protein product [Mus musculus]
gi|148697574|gb|EDL29521.1| RIKEN cDNA 2410104I19, isoform CRA_b [Mus musculus]
gi|149066143|gb|EDM16016.1| siah binding protein 1; FBP interacting repressor; pyrimidine tract
binding splicing factor; Ro ribonucleoprotein-binding
protein 1, isoform CRA_b [Rattus norvegicus]
Length = 547
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 16/89 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VV 471
++LR M P ++ D LE +V EEC K G V+ V + + QG +
Sbjct: 451 VMVLRNMVDPKDIDDD------LEGEVTEECGKFGAVNRVIIYQEK-QGEEEDAEIIVKI 503
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V F + K I+ +NGRWFGGR++ A
Sbjct: 504 FVEFSMASETHKAIQALNGRWFGGRKVVA 532
>gi|148697575|gb|EDL29522.1| RIKEN cDNA 2410104I19, isoform CRA_c [Mus musculus]
Length = 546
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 16/89 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VV 471
++LR M P ++ D LE +V EEC K G V+ V + + QG +
Sbjct: 450 VMVLRNMVDPKDIDDD------LEGEVTEECGKFGAVNRVIIYQEK-QGEEEDAEIIVKI 502
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V F + K I+ +NGRWFGGR++ A
Sbjct: 503 FVEFSMASETHKAIQALNGRWFGGRKVVA 531
>gi|74203983|dbj|BAE28998.1| unnamed protein product [Mus musculus]
Length = 564
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 16/89 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VV 471
++LR M P ++ D LE +V EEC K G V+ V + + QG +
Sbjct: 468 VMVLRNMVDPKDIDDD------LEGEVTEECGKFGAVNRVIIYQEK-QGEEEDAEIIVKI 520
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V F + K I+ +NGRWFGGR++ A
Sbjct: 521 FVEFSMASETHKAIQALNGRWFGGRKVVA 549
>gi|5524727|gb|AAD44358.1|AF165892_1 RNA-binding protein SiahBP [Rattus norvegicus]
Length = 565
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 16/89 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VV 471
++LR M P ++ D LE +V EEC K G V+ V + + QG +
Sbjct: 469 VMVLRNMVDPKDIDDD------LEGEVTEECGKFGAVNRVIIYQEK-QGEEEDAEIIVKI 521
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V F + K I+ +NGRWFGGR++ A
Sbjct: 522 FVEFSMASETHKAIQALNGRWFGGRKVVA 550
>gi|224080644|ref|XP_002306193.1| predicted protein [Populus trichocarpa]
gi|222849157|gb|EEE86704.1| predicted protein [Populus trichocarpa]
Length = 348
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 46/81 (56%), Gaps = 4/81 (4%)
Query: 422 VILRFMFTP-AEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKD 480
+ L+ MF P E D +L +++ DVQEEC + G V + V +N G V +RF++ +
Sbjct: 256 LFLKNMFDPKTETEPDFDL--DIKEDVQEECSRFGNVKHIYVDKNS-AGFVYMRFENMQA 312
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
A ++GRWF G+ I A+
Sbjct: 313 AINAQHALHGRWFAGKLITAT 333
>gi|300798352|ref|NP_001178809.1| poly(U)-binding-splicing factor PUF60 [Rattus norvegicus]
Length = 563
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 16/89 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VV 471
++LR M P ++ D LE +V EEC K G V+ V + + QG +
Sbjct: 467 VMVLRNMVDPKDIDDD------LEGEVTEECGKFGAVNRVIIYQEK-QGEEEDAEIIVKI 519
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V F + K I+ +NGRWFGGR++ A
Sbjct: 520 FVEFSMASETHKAIQALNGRWFGGRKVVA 548
>gi|357470349|ref|XP_003605459.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355506514|gb|AES87656.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 634
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 97/250 (38%), Gaps = 40/250 (16%)
Query: 265 QPDKQVEKKEANKPPDSWFELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKP 324
QP+ + P S L+ ++V GLP T ++ E+ G ++
Sbjct: 396 QPNPNLNLGAVGLTPGSAGGLEGPDRIFVGGLPYYFTETQIRELLETFGPLRG------- 448
Query: 325 RIKIYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSV-------TQAK 377
+ D+ETG KG A Y +A L+G G ++V TQ K
Sbjct: 449 -FDLVKDRETGNSKGYAFCVYADLAVTDIACAALNGIKM---GDKTLTVRRANQGTTQPK 504
Query: 378 FEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGGRDDAKLTIPATVILRFMFTPAEMRADE 437
EQ+ A+Q + +K + L V L P E++ DE
Sbjct: 505 PEQESILMHAQQQIALQKLIFQPA--------------LVATKVVCLTNAVAPEELKEDE 550
Query: 438 NLRSELEADVQEECVKIGPVDSVKVCENHPQG-------VVLVRFKDRKDAQKCIELMNG 490
+ E+ D+++EC K G + +V + P G V + + D + A K +NG
Sbjct: 551 DFE-EIIDDMRQECSKFGSLVNVVIPRPQPDGDLSGGVGKVFLEYVDIEGATKARTGLNG 609
Query: 491 RWFGGRQIHA 500
R FGG ++ A
Sbjct: 610 RKFGGNEVIA 619
>gi|148697576|gb|EDL29523.1| RIKEN cDNA 2410104I19, isoform CRA_d [Mus musculus]
Length = 553
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 16/89 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VV 471
++LR M P ++ D LE +V EEC K G V+ V + + QG +
Sbjct: 457 VMVLRNMVDPKDIDDD------LEGEVTEECGKFGAVNRVIIYQEK-QGEEEDAEIIVKI 509
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V F + K I+ +NGRWFGGR++ A
Sbjct: 510 FVEFSMASETHKAIQALNGRWFGGRKVVA 538
>gi|195385190|ref|XP_002051289.1| GJ13092 [Drosophila virilis]
gi|194147746|gb|EDW63444.1| GJ13092 [Drosophila virilis]
Length = 432
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 12/86 (13%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEEC-VKIGPVDSVKVCENH---PQGVV--LVR 474
V+LR M P ++ DE ELE +V++EC K G V+SV + E P+ V V
Sbjct: 339 VVLLRNMVGPGDV--DE----ELEPEVKDECNTKYGEVNSVIIHEAFGTVPEDAVKIFVE 392
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
FK + A K + +NGR+FGGRQ+ A
Sbjct: 393 FKRIESAIKAVVDLNGRFFGGRQVRA 418
>gi|118785284|ref|XP_314526.3| AGAP010553-PA [Anopheles gambiae str. PEST]
gi|116128011|gb|EAA09944.4| AGAP010553-PA [Anopheles gambiae str. PEST]
Length = 643
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 50/103 (48%), Gaps = 20/103 (19%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVC-ENHPQG---------- 469
VILR M P ++ DE L+ E +Q+EC K G VD V + E +G
Sbjct: 542 VVILRNMVGPEDV--DETLQEE----IQDECSKYGAVDRVIIYKERQSEGNFAEDDNTDM 595
Query: 470 --VVLVRFKDRKDAQKCIELMNGRWFGGRQIHA-SEDDGLVNH 509
+ V F +A + E +NGR+FGGR + A S D L +H
Sbjct: 596 IVKIFVEFSQATEADRARESLNGRYFGGRLVKAESYDQALFDH 638
>gi|226470094|emb|CAX70328.1| Splicing factor U2AF 35 kDa subunit [Schistosoma japonicum]
gi|226486850|emb|CAX74502.1| Splicing factor U2AF 35 kDa subunit [Schistosoma japonicum]
gi|226486852|emb|CAX74503.1| Splicing factor U2AF 35 kDa subunit [Schistosoma japonicum]
gi|226489605|emb|CAX74953.1| Splicing factor U2AF 35 kDa subunit [Schistosoma japonicum]
gi|226489607|emb|CAX74954.1| Splicing factor U2AF 35 kDa subunit [Schistosoma japonicum]
Length = 221
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K G ++ + VC+N H G V ++F+ +DA+K ++ +N RWFGGR +HA
Sbjct: 92 KYGEIEEMNVCDNLGDHLVGNVYIKFRREEDAEKAVQDLNERWFGGRPVHA 142
>gi|256088283|ref|XP_002580274.1| U2 snRNP auxiliary factor small subunit [Schistosoma mansoni]
gi|108861853|gb|ABG21828.1| U2 snRNP auxiliary factor small subunit-like protein [Schistosoma
mansoni]
gi|360043806|emb|CCD81352.1| putative u2 snRNP auxiliary factor, small subunit [Schistosoma
mansoni]
Length = 221
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K G ++ + VC+N H G V ++F+ +DA+K ++ +N RWFGGR +HA
Sbjct: 92 KYGEIEEMNVCDNLGDHLVGNVYIKFRREEDAEKAVQDLNERWFGGRPVHA 142
>gi|20466772|gb|AAM20703.1| putative splicing factor [Arabidopsis thaliana]
Length = 420
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 48/80 (60%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF P+ +++ +++ DV+EEC K G ++ + V +N G V +RF++ + A
Sbjct: 326 LLLKNMFDPST-ETEDDFDEDIKEDVKEECSKFGKLNHIFVDKNS-VGFVYLRFENAQAA 383
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 384 IGAQRALHGRWFAGKMITAT 403
>gi|28059803|gb|AAO30095.1| splicing factor-like protein [Arabidopsis thaliana]
Length = 527
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + N E+ DV +EC K GPV+ + V +N G V +RF+ + A
Sbjct: 437 LLLKNMFDPA-TETEPNFDLEIRDDVADECSKYGPVNHIYVDKNS-AGFVYLRFQSVEAA 494
Query: 482 QKCIELMNGRWFGGRQIHAS 501
M+ RWF + I A+
Sbjct: 495 AAAQRAMHMRWFAQKMISAT 514
>gi|17136764|ref|NP_476891.1| U2 small nuclear riboprotein auxiliary factor 50, isoform A
[Drosophila melanogaster]
gi|386764552|ref|NP_001245710.1| U2 small nuclear riboprotein auxiliary factor 50, isoform D
[Drosophila melanogaster]
gi|195351420|ref|XP_002042232.1| GM13406 [Drosophila sechellia]
gi|195479195|ref|XP_002100800.1| GE15975 [Drosophila yakuba]
gi|195555160|ref|XP_002077042.1| GD24494 [Drosophila simulans]
gi|4033485|sp|Q24562.1|U2AF2_DROME RecName: Full=Splicing factor U2AF 50 kDa subunit; AltName: Full=U2
auxiliary factor 50 kDa subunit; AltName: Full=U2 snRNP
auxiliary factor large subunit
gi|349761|gb|AAA03548.1| RNA binding protein [Drosophila melanogaster]
gi|7293214|gb|AAF48596.1| U2 small nuclear riboprotein auxiliary factor 50, isoform A
[Drosophila melanogaster]
gi|17861976|gb|AAL39465.1| LD03714p [Drosophila melanogaster]
gi|194124075|gb|EDW46118.1| GM13406 [Drosophila sechellia]
gi|194188324|gb|EDX01908.1| GE15975 [Drosophila yakuba]
gi|194203060|gb|EDX16636.1| GD24494 [Drosophila simulans]
gi|220943258|gb|ACL84172.1| U2af50-PA [synthetic construct]
gi|220953438|gb|ACL89262.1| U2af50-PA [synthetic construct]
gi|383293440|gb|AFH07423.1| U2 small nuclear riboprotein auxiliary factor 50, isoform D
[Drosophila melanogaster]
Length = 416
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 90/222 (40%), Gaps = 31/222 (13%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP+ + +++ E+ G ++ + D TG+ KG A Y+
Sbjct: 209 IFIGGLPNYLNDDQVKELLLSFGKLRA--------FNLVKDAATGLSKGYAFCEYV---D 257
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
+++ Q + G G + V +A K + A S ML G
Sbjct: 258 LSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTTQSV-----------MLQVPG 306
Query: 411 RDDAKLTIPATVILRF--MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV------ 462
+ + P T +L M TP E+R +E LE D++EEC K G V SV++
Sbjct: 307 LSNVVTSGPPTEVLCLLNMVTPDELRDEEEYEDILE-DIKEECTKYGVVRSVEIPRPIEG 365
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
E G V V F D QK + + GR F R + S D
Sbjct: 366 VEVPGCGKVFVEFNSVLDCQKAQQALTGRKFSDRVVVTSYFD 407
>gi|313230331|emb|CBY08035.1| unnamed protein product [Oikopleura dioica]
Length = 286
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 2/73 (2%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V++ GLP+DVT E ++ F + G IK + R+ ++++++T K G A VTY K
Sbjct: 99 VFIEGLPNDVTKEALLSRFKQAGNIKVTKGVE--RVFLFLERKTKRKTGCATVTYFKSED 156
Query: 351 VALATQLLDGTPF 363
+ A +L D + F
Sbjct: 157 ASCAIELFDRSEF 169
>gi|195421794|ref|XP_002060891.1| GK20208 [Drosophila willistoni]
gi|194156976|gb|EDW71877.1| GK20208 [Drosophila willistoni]
Length = 327
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 45/81 (55%), Gaps = 4/81 (4%)
Query: 283 FELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDAL 342
FEL+ T ++V G+ VT +++ + G+I KPRI +Y +K+ G KG+A
Sbjct: 168 FELEPET-IFVLGMRLSVTKNDILMFCTSVGMIDH---ASKPRIFVYKNKQNGRSKGEAT 223
Query: 343 VTYLKEPSVALATQLLDGTPF 363
+TY+ +A + L+G+ F
Sbjct: 224 ITYISPFMAEMAIRCLNGSKF 244
>gi|194893848|ref|XP_001977952.1| GG19328 [Drosophila erecta]
gi|190649601|gb|EDV46879.1| GG19328 [Drosophila erecta]
Length = 416
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 90/222 (40%), Gaps = 31/222 (13%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP+ + +++ E+ G ++ + D TG+ KG A Y+
Sbjct: 209 IFIGGLPNYLNDDQVKELLLSFGKLRA--------FNLVKDAATGLSKGYAFCEYV---D 257
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
+++ Q + G G + V +A K + A S ML G
Sbjct: 258 LSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTTQSV-----------MLQVPG 306
Query: 411 RDDAKLTIPATVILRF--MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV------ 462
+ + P T +L M TP E+R +E LE D++EEC K G V SV++
Sbjct: 307 LSNVVTSGPPTEVLCLLNMVTPDELRDEEEYEDILE-DIKEECTKYGVVRSVEIPRPIEG 365
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
E G V V F D QK + + GR F R + S D
Sbjct: 366 VEVPGCGKVFVEFNSVLDCQKAQQALTGRKFSDRVVVTSYFD 407
>gi|194758325|ref|XP_001961412.1| GF14957 [Drosophila ananassae]
gi|190615109|gb|EDV30633.1| GF14957 [Drosophila ananassae]
Length = 594
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 62/252 (24%), Positives = 95/252 (37%), Gaps = 46/252 (18%)
Query: 283 FELKVNT---HVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKG 339
F+ K +T +YV L ++T + + +F G I I++ +D ETG KG
Sbjct: 326 FQPKSHTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDA--------IQLIMDTETGRSKG 377
Query: 340 DALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQA--KFEQKGERFIAKQVDSKKKKK 397
+TY A + L+G F G++ M V + + ++D +
Sbjct: 378 YGFITYHNADDAKKALEQLNG--FELAGRL-MKVGNVTERLDMNTSSLDTDEMD-RTGID 433
Query: 398 LKKVEEKMLGWGGRDDAKLTIP--------------------------AT--VILRFMFT 429
L L + + A L +P AT IL MF
Sbjct: 434 LGATGRLQLMFKLAEGAGLAVPQAAANALLATAPQPAPVQQQQQAPSIATQCFILSNMFD 493
Query: 430 PAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMN 489
P + +E+ DV EEC K G V + V P G V V+ A + ++
Sbjct: 494 P-RTETNPTWDAEIRDDVLEECAKHGGVLHIHVDTVSPTGTVYVKCPSTTTAVLAVNALH 552
Query: 490 GRWFGGRQIHAS 501
GRWF GR I A+
Sbjct: 553 GRWFAGRVITAA 564
>gi|18416114|ref|NP_568220.1| RNA-binding protein 39 [Arabidopsis thaliana]
gi|15451046|gb|AAK96794.1| splicing factor-like protein [Arabidopsis thaliana]
gi|332004077|gb|AED91460.1| RNA-binding protein 39 [Arabidopsis thaliana]
Length = 527
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + N E+ DV +EC K GPV+ + V +N G V +RF+ + A
Sbjct: 437 LLLKNMFDPA-TETEPNFDLEIRDDVADECSKYGPVNHIYVDKNS-AGFVYLRFQSVEAA 494
Query: 482 QKCIELMNGRWFGGRQIHAS 501
M+ RWF + I A+
Sbjct: 495 AAAQRAMHMRWFAQKMISAT 514
>gi|76156406|gb|AAX27612.2| SJCHGC07381 protein [Schistosoma japonicum]
Length = 219
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K G ++ + VC+N H G V ++F+ +DA+K ++ +N RWFGGR +HA
Sbjct: 90 KYGEIEEMNVCDNLGDHLVGNVYIKFRREEDAEKAVQDLNERWFGGRPVHA 140
>gi|357618954|gb|EHJ71738.1| putative U2 snrnp auxiliary factor, small subunit [Danaus
plexippus]
Length = 227
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 446 DVQEECV-KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
DV EC K G ++ + VC+N H G V ++F+ +DA+K + +N RWFGGR ++A
Sbjct: 87 DVFVECEDKYGEIEEMNVCDNLGDHLVGNVYIKFRREEDAEKAVNDLNNRWFGGRPVYA 145
>gi|156099298|ref|XP_001615651.1| splicing factor [Plasmodium vivax Sal-1]
gi|148804525|gb|EDL45924.1| splicing factor, putative [Plasmodium vivax]
Length = 1016
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 47/86 (54%), Gaps = 2/86 (2%)
Query: 417 TIPATVILRFMFTPAE--MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVR 474
I ++L MF+P + + +D + +++ DV+EEC K G + + + + G + ++
Sbjct: 918 NITPNLVLCNMFSPNDESIGSDPDFFTDIIEDVKEECSKYGSIAKIWLDSKNIDGKIYIK 977
Query: 475 FKDRKDAQKCIELMNGRWFGGRQIHA 500
+ ++ K + +NGR+FGG I A
Sbjct: 978 YATPDESLKAFQFLNGRYFGGSLISA 1003
>gi|34100331|gb|AAQ57271.1| RME8 [Homo sapiens]
Length = 427
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 34/65 (52%)
Query: 12 LSGATNYETAGEEGWYILDENQQHVGPYAISELCEHFLNGYLLETTLVWSQGRSEWQPLS 71
+ A + + E+ WY + +++ GPY E+ E + G L T W+QG W+PL
Sbjct: 291 IEAAPDMKRESEKEWYFGNADKERSGPYGFHEMQELWTKGMLNAKTRCWAQGMDGWRPLQ 350
Query: 72 SIPQF 76
SIPQ
Sbjct: 351 SIPQL 355
>gi|257196186|ref|NP_001158072.1| poly(U)-binding-splicing factor PUF60 isoform c [Mus musculus]
gi|14714891|gb|AAH10601.1| Puf60 protein [Mus musculus]
Length = 499
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 16/89 (17%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQG---------VV 471
++LR M P ++ D LE +V EEC K G V+ V + + QG +
Sbjct: 403 VMVLRNMVDPKDIDDD------LEGEVTEECGKFGAVNRVIIYQEK-QGEEEDAEIIVKI 455
Query: 472 LVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
V F + K I+ +NGRWFGGR++ A
Sbjct: 456 FVEFSMASETHKAIQALNGRWFGGRKVVA 484
>gi|227206234|dbj|BAH57172.1| AT5G09880 [Arabidopsis thaliana]
Length = 505
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + N E+ DV +EC K GPV+ + V +N G V +RF+ + A
Sbjct: 415 LLLKNMFDPA-TETEPNFDLEIRDDVADECSKYGPVNHIYVDKNS-AGFVYLRFQSVEAA 472
Query: 482 QKCIELMNGRWFGGRQIHAS 501
M+ RWF + I A+
Sbjct: 473 AAAQRAMHMRWFAQKMISAT 492
>gi|242014352|ref|XP_002427855.1| Splicing factor U2Af 38 kDa subunit, putative [Pediculus humanus
corporis]
gi|212512324|gb|EEB15117.1| Splicing factor U2Af 38 kDa subunit, putative [Pediculus humanus
corporis]
Length = 253
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 446 DVQEECV-KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
DV EC K G ++ + VC+N H G V ++F+ +DA+K + +N RWFGGR ++A
Sbjct: 86 DVFVECEDKYGEIEEMNVCDNLGDHLVGNVYIKFRREEDAEKAVNDLNNRWFGGRPVYA 144
>gi|343425477|emb|CBQ69012.1| conserved hypothetcial protein [Sporisorium reilianum SRZ2]
Length = 637
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 37/70 (52%), Gaps = 2/70 (2%)
Query: 433 MRADENLRSELEADVQEECVKIGPVDSVKV--CENHPQGVVLVRFKDRKDAQKCIELMNG 490
M D + + +L DV +EC K G V V V CE + V V+F + K + ++G
Sbjct: 544 MPPDAHAQHDLIDDVAQECAKFGIVQRVFVHRCEGEGEVRVFVQFTGEAGSWKAVRNLHG 603
Query: 491 RWFGGRQIHA 500
RWF GRQ+ A
Sbjct: 604 RWFEGRQLQA 613
>gi|340522099|gb|EGR52332.1| predicted protein [Trichoderma reesei QM6a]
Length = 569
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRFKDRKD 480
V+L MF PAE D+ ++ ELE +V++E K G V + V N +G + ++F +
Sbjct: 474 VVLHNMFDPAEEEGDDWVK-ELEDEVRQEAEEKYGHVVHISVDPNS-KGDIYLKFDKVQG 531
Query: 481 AQKCIELMNGRWFGGRQIHAS 501
+ I+ +NGR+FGGR I AS
Sbjct: 532 GENAIKGLNGRYFGGRMIDAS 552
>gi|195042782|ref|XP_001991497.1| GH12033 [Drosophila grimshawi]
gi|195134983|ref|XP_002011915.1| GI14308 [Drosophila mojavensis]
gi|193901255|gb|EDW00122.1| GH12033 [Drosophila grimshawi]
gi|193909169|gb|EDW08036.1| GI14308 [Drosophila mojavensis]
Length = 416
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 89/222 (40%), Gaps = 31/222 (13%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP+ + E++ E+ G ++ + D TG+ KG A Y+
Sbjct: 209 IFIGGLPNYLNDEQVKELLLSFGKLRA--------FNLVKDAATGLSKGYAFCEYV---D 257
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
+++ Q + G G + V +A K + A S ML G
Sbjct: 258 LSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTSQSV-----------MLQVPG 306
Query: 411 RDDAKLTIPATVILRF--MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV------ 462
+ P T +L M TP E+R +E LE D++EEC K G V SV++
Sbjct: 307 LSTVVTSGPPTEVLCLLNMVTPDELRDEEEYEDILE-DIKEECTKYGVVRSVEIPRPIEG 365
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
E G V V F D QK + + GR F R + S D
Sbjct: 366 VEVPGCGKVFVEFNSVLDCQKAQQALTGRKFSDRVVVTSYFD 407
>gi|15988329|pdb|1JMT|A Chain A, X-Ray Structure Of A Core U2af65U2AF35 HETERODIMER
Length = 104
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K G V+ + VC+N H G V V+F+ +DA+K + +N RWF G+ IHA
Sbjct: 50 KYGEVEEMNVCDNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHA 100
>gi|386764550|ref|NP_001245709.1| U2 small nuclear riboprotein auxiliary factor 50, isoform C
[Drosophila melanogaster]
gi|383293439|gb|AFH07422.1| U2 small nuclear riboprotein auxiliary factor 50, isoform C
[Drosophila melanogaster]
Length = 360
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 90/222 (40%), Gaps = 31/222 (13%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP+ + +++ E+ G ++ + D TG+ KG A Y+
Sbjct: 153 IFIGGLPNYLNDDQVKELLLSFGKLRA--------FNLVKDAATGLSKGYAFCEYV---D 201
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWGG 410
+++ Q + G G + V +A K + A S ML G
Sbjct: 202 LSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTTQSV-----------MLQVPG 250
Query: 411 RDDAKLTIPATVILRF--MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV------ 462
+ + P T +L M TP E+R +E LE D++EEC K G V SV++
Sbjct: 251 LSNVVTSGPPTEVLCLLNMVTPDELRDEEEYEDILE-DIKEECTKYGVVRSVEIPRPIEG 309
Query: 463 CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
E G V V F D QK + + GR F R + S D
Sbjct: 310 VEVPGCGKVFVEFNSVLDCQKAQQALTGRKFSDRVVVTSYFD 351
>gi|321259451|ref|XP_003194446.1| hypothetical protein CGB_E5340C [Cryptococcus gattii WM276]
gi|317460917|gb|ADV22659.1| conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 615
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 60/241 (24%), Positives = 110/241 (45%), Gaps = 34/241 (14%)
Query: 279 PDSWFELKVNTH-VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMK 337
PD+ + + H ++V+ L +T +++ +VF G I+ + +++D +G++
Sbjct: 374 PDAHKDAAIPYHRLFVSNLAFSLTADDVRQVFEPFGEIEF--------VDLHMDP-SGLR 424
Query: 338 KGDALVTYLKEPSVALATQLLDGTPFRPDGK-IPMSVTQAKFEQKGERFI------AKQV 390
KG A V + S +A + G F G+ I + Q + + I ++
Sbjct: 425 KGTAYVQFKDVKSAQMALDAMAG--FDLAGRLIKVQTIQERGTYQTPDLIEDSGNYGTRL 482
Query: 391 DSKKKK----KLKKVEEKM---LGWGGRDDAKLTIPAT-----VILRFMFTPAEMRADEN 438
D+ +++ KL + E + L ++ +PA +++ MF P E + N
Sbjct: 483 DANQRQQLMFKLARTEPSVNLSLSAPKISGSQSKVPAMDPTPRIVVHNMFNPEE-ETERN 541
Query: 439 LRSELEADVQEEC-VKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQ 497
+L DV+ E K G V +KV E G V + F D A K ++ +NGR+FGGRQ
Sbjct: 542 WDLDLAEDVKGEVESKYGKVKRIKV-EKMSAGEVYIEFIDTDSAIKAVKGLNGRFFGGRQ 600
Query: 498 I 498
+
Sbjct: 601 L 601
>gi|118788821|ref|XP_317010.3| AGAP008433-PA [Anopheles gambiae str. PEST]
gi|116122929|gb|EAA12873.4| AGAP008433-PA [Anopheles gambiae str. PEST]
Length = 526
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 423 ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
+L MF PA + N E++ DV EEC K G V V V + P G V V+ A
Sbjct: 422 LLSNMFDPA-TETNPNWDLEIQDDVIEECNKHGGVQHVYVDKQSPSGNVYVKCPSIATAV 480
Query: 483 KCIELMNGRWFGGRQIHAS 501
+ ++GRWF GR I A+
Sbjct: 481 LAVNALHGRWFAGRVIGAA 499
>gi|307172750|gb|EFN64036.1| Splicing factor U2af 38 kDa subunit [Camponotus floridanus]
gi|307174220|gb|EFN64865.1| Splicing factor U2af 38 kDa subunit [Camponotus floridanus]
Length = 240
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 446 DVQEECV-KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
DV EC K G ++ + VC+N H G V ++F+ +DA+K + +N RWFGGR ++A
Sbjct: 86 DVFIECEDKYGEIEEMNVCDNLGDHLVGNVYIKFRREEDAEKAVNDLNNRWFGGRPVYA 144
>gi|124513200|ref|XP_001349956.1| splicing factor, putative [Plasmodium falciparum 3D7]
gi|23615373|emb|CAD52364.1| splicing factor, putative [Plasmodium falciparum 3D7]
Length = 864
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/81 (27%), Positives = 48/81 (59%), Gaps = 2/81 (2%)
Query: 422 VILRFMFTPAE--MRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRK 479
++L MF+ + + +D + +++ DV+EEC K G V ++ + + G + +++ +
Sbjct: 769 LVLSNMFSSNDENIGSDPDFFNDILEDVKEECSKYGKVVNIWLDTKNIDGKIYIKYSNND 828
Query: 480 DAQKCIELMNGRWFGGRQIHA 500
++ K + +NGR+FGG I+A
Sbjct: 829 ESLKSFQFLNGRYFGGSLINA 849
>gi|78708720|gb|ABB47695.1| splicing factor, CC1-like family protein, expressed [Oryza sativa
Japonica Group]
gi|215765085|dbj|BAG86782.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 339
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + +++ DVQ+EC K G V+ + V +N G V + F A
Sbjct: 249 LLLKNMFDPA-VETDPDFDLDIKDDVQDECSKFGAVNHIFVDKNT-AGFVYLHFDSVAAA 306
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 307 TAAQRALHGRWFAGKMITAT 326
>gi|84996941|ref|XP_953192.1| RNA binding protein [Theileria annulata strain Ankara]
gi|65304188|emb|CAI76567.1| RNA binding protein, putative [Theileria annulata]
Length = 479
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 40/83 (48%), Gaps = 6/83 (7%)
Query: 435 ADENLRSELEADVQEECVKIGPVDSVKV--CENHPQGVVLVRFKDRKDAQKCIELMNGRW 492
DENL++E V++EC K G V SV + N+ V V F +DA + +N RW
Sbjct: 401 VDENLQNE----VKDECNKYGTVTSVYLHFSPNNDSLSVFVVFNTPQDADSAVRALNTRW 456
Query: 493 FGGRQIHASEDDGLVNHAAIRDL 515
F GRQI D + DL
Sbjct: 457 FNGRQIMCKTYDASAYFSGNYDL 479
>gi|226469292|emb|CAX70125.1| Splicing factor U2AF 35 kDa subunit [Schistosoma japonicum]
Length = 198
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 453 KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
K G ++ + VC+N H G V ++F+ +DA+K ++ +N RWFGGR +HA
Sbjct: 69 KYGEIEEMNVCDNLGDHLVGNVYIKFRREEDAEKAVQDLNERWFGGRPVHA 119
>gi|410915935|ref|XP_003971442.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like isoform 1
[Takifugu rubripes]
Length = 637
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 64/131 (48%), Gaps = 15/131 (11%)
Query: 271 EKKEANKPPDSWF---ELKVNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIK 327
++K PPDS + + V T ++V +P D+ +E+V +F K G I + ++
Sbjct: 145 QRKYGGPPPDSVYTGSQPPVGTEIFVGKIPRDLFEDELVPLFEKAGPIWD--------LR 196
Query: 328 IYVDKETGMKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIA 387
+ +D +G+ +G A VT+ + + A +L + RP +I + ++ A F+
Sbjct: 197 LMMDPLSGLNRGYAFVTFCSKEAAQEAVKLCNNHEIRPGKQIGVCISVA----NNRLFVG 252
Query: 388 KQVDSKKKKKL 398
SK K+++
Sbjct: 253 SIPKSKTKEQI 263
>gi|400598297|gb|EJP66014.1| CC1-like family splicing factor [Beauveria bassiana ARSEF 2860]
Length = 556
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
Query: 417 TIPATVILRFMFTPAEMRADENLRSELEADVQEECV-KIGPVDSVKVCENHPQGVVLVRF 475
T V+L MF P E D+ ++ ELE DV++E K G V + V N +G + ++F
Sbjct: 456 TASRCVVLHNMFNPDEEEGDDWVK-ELEDDVRQEAEDKYGRVVHISVDPNS-KGDIYLKF 513
Query: 476 KDRKDAQKCIELMNGRWFGGRQIHAS 501
+ + + I +NGR+FGGR I AS
Sbjct: 514 EKVQGGENAIRGLNGRYFGGRMIDAS 539
>gi|389610875|dbj|BAM19048.1| U2 small nuclear riboprotein auxiliary factor 50 [Papilio polytes]
Length = 422
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 90/224 (40%), Gaps = 31/224 (13%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+++ GLP+ + +++ E+ G ++ + D TG+ KG A Y+
Sbjct: 211 IFIGGLPNYLNEDQVKELLMSFGQLRA--------FNLVKDSSTGLSKGYAFAEYV---D 259
Query: 351 VALATQLLDGTPFRPDGKIPMSVTQAKFEQKGERF--IAKQVDSKKKKKLKKVEEKMLGW 408
+++ Q + G G + V +A K + + L+ + G
Sbjct: 260 ISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTLGVYIQSMTGAAPVTLQVAGLTLAGA 319
Query: 409 GGRDDAKLTIPATVILRF--MFTPAEMRADENLRSELEADVQEECVKIGPVDSVKV---- 462
G PAT +L M TP E+R +E LE D++EEC K G V S+++
Sbjct: 320 G---------PATEVLCLLNMVTPDELRDEEEYEDILE-DIKEECNKYGCVRSIEIPRPL 369
Query: 463 --CENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
+ G V V F D QK + + GR F R + S D
Sbjct: 370 EGVDVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYYD 413
>gi|383864352|ref|XP_003707643.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Megachile
rotundata]
Length = 530
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 423 ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
+L MF P + + N E+ DV EEC K G V V V + PQG V V+ A
Sbjct: 423 MLSNMFDP-QNETNPNWAKEIRDDVIEECNKHGGVLHVYVDQASPQGNVYVKCPSIATAV 481
Query: 483 KCIELMNGRWFGGRQIHAS 501
+ ++GRWF GR I A+
Sbjct: 482 AAVNSLHGRWFAGRVITAA 500
>gi|291245052|ref|XP_002742405.1| PREDICTED: MGC81970 protein-like isoform 2 [Saccoglossus
kowalevskii]
Length = 444
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 1/71 (1%)
Query: 442 ELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHAS 501
E++ DV EEC K G + V + +P GVV V+ + AQ + ++GRWF G+ I A+
Sbjct: 357 EIQDDVIEECNKHGGCVHIYVDKTNPLGVVYVKCATVQIAQMAVNALHGRWFAGKMITAA 416
Query: 502 EDDGLVNHAAI 512
L N+ A+
Sbjct: 417 YVP-LANYHAL 426
>gi|195489053|ref|XP_002092574.1| GE11595 [Drosophila yakuba]
gi|194178675|gb|EDW92286.1| GE11595 [Drosophila yakuba]
Length = 437
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 56/225 (24%), Positives = 97/225 (43%), Gaps = 41/225 (18%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
+YV GLP + +++ ++ G +K + + D T + KG A Y+ +PS
Sbjct: 233 IYVGGLPTCLDQDQIKDLLQSFGELKG--------LNLVKDINTSLSKGFAFFEYI-DPS 283
Query: 351 VA-LATQLLDGTPFRPDGKIPMSVTQAKFEQKGERFIAKQVDSKKKKKLKKVEEKMLGWG 409
V A L G Q G+R + Q K V++ ++
Sbjct: 284 VTDHAIAGLHGM------------------QLGDRRLVVQRSIPGGKNGLSVQQPIVQVP 325
Query: 410 GRDDAKLTIPAT----VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCEN 465
G + L P + + L M P E+ +E ++ +D+++EC K G V S+K+
Sbjct: 326 G--ISTLLDPGSPTEILCLLNMVLPEELLDNEEF-EDIRSDIKQECAKYGDVRSIKIPRP 382
Query: 466 HPQ------GVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDD 504
Q G V V+F+ +D+QK ++ ++GR F GR + S D
Sbjct: 383 VGQFPKRGCGKVFVQFESVEDSQKALKALSGRKFSGRIVMTSFYD 427
>gi|451850761|gb|EMD64062.1| hypothetical protein COCSADRAFT_199489 [Cochliobolus sativus
ND90Pr]
Length = 211
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 446 DVQEECVKIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
D+ E + G V+ + VC+N H G V VRFK +DAQK + +N RW+ GR I+
Sbjct: 78 DIWCELCQYGLVEELVVCDNNNDHLIGNVYVRFKYEEDAQKACDALNSRWYAGRPIYC 135
>gi|330940524|ref|XP_003305953.1| hypothetical protein PTT_18947 [Pyrenophora teres f. teres 0-1]
gi|311316782|gb|EFQ85946.1| hypothetical protein PTT_18947 [Pyrenophora teres f. teres 0-1]
Length = 214
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 446 DVQEECVKIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
D+ E + G V+ + VC+N H G V VRFK +DAQK + +N RW+ GR I+
Sbjct: 78 DIWCELCQYGLVEELVVCDNNNDHLIGNVYVRFKYEEDAQKACDALNSRWYAGRPIYC 135
>gi|313224119|emb|CBY43585.1| unnamed protein product [Oikopleura dioica]
Length = 317
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 2/73 (2%)
Query: 291 VYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETGMKKGDALVTYLKEPS 350
V++ GLP+DVT E ++ F + G IK + R+ ++++++T K G A VTY K
Sbjct: 130 VFIEGLPNDVTKEALLSRFKQAGNIKVTKGVE--RVFLFLERKTKRKTGCATVTYFKSED 187
Query: 351 VALATQLLDGTPF 363
+ A +L D + F
Sbjct: 188 ASCAIELFDRSEF 200
>gi|428182175|gb|EKX51036.1| hypothetical protein GUITHDRAFT_134574 [Guillardia theta CCMP2712]
Length = 458
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 10/85 (11%)
Query: 421 TVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGV----VLVRFK 476
V+L+ M P E++ LE ++ EEC K G V+ V + QG V V F
Sbjct: 366 VVLLKNMVEP------EDVDPLLEQEIAEECSKFGKVNKVLIVTMVEQGSRLVKVFVEFG 419
Query: 477 DRKDAQKCIELMNGRWFGGRQIHAS 501
D++ A K + ++ RWFGG+ ++AS
Sbjct: 420 DQEAATKAVARLDKRWFGGKIVNAS 444
>gi|307180960|gb|EFN68748.1| RNA-binding protein 39 [Camponotus floridanus]
Length = 529
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 423 ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
+L MF P + + N E+ DV EEC K G V V V + PQG V V+ A
Sbjct: 422 MLSNMFDP-QNETNPNWAKEIRDDVIEECNKHGGVLHVYVDQASPQGNVYVKCPSIATAV 480
Query: 483 KCIELMNGRWFGGRQIHAS 501
+ ++GRWF GR I A+
Sbjct: 481 AAVNSLHGRWFAGRVITAA 499
>gi|78708721|gb|ABB47696.1| splicing factor, CC1-like family protein, expressed [Oryza sativa
Japonica Group]
gi|215695222|dbj|BAG90413.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 320
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + +++ DVQ+EC K G V+ + V +N G V + F A
Sbjct: 230 LLLKNMFDPA-VETDPDFDLDIKDDVQDECSKFGAVNHIFVDKNT-AGFVYLHFDSVAAA 287
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 288 TAAQRALHGRWFAGKMITAT 307
>gi|222612890|gb|EEE51022.1| hypothetical protein OsJ_31660 [Oryza sativa Japonica Group]
Length = 548
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 2/80 (2%)
Query: 422 VILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDA 481
++L+ MF PA + D + +++ DVQ+EC K G V+ + V +N G V + F A
Sbjct: 458 LLLKNMFDPA-VETDPDFDLDIKDDVQDECSKFGAVNHIFVDKNT-AGFVYLHFDSVAAA 515
Query: 482 QKCIELMNGRWFGGRQIHAS 501
++GRWF G+ I A+
Sbjct: 516 TAAQRALHGRWFAGKMITAT 535
>gi|193683845|ref|XP_001950620.1| PREDICTED: heterogeneous nuclear ribonucleoprotein Q-like
[Acyrthosiphon pisum]
Length = 533
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 10/99 (10%)
Query: 278 PPDSWFELK--VNTHVYVTGLPDDVTVEEMVEVFSKCGIIKEDPETKKPRIKIYVDKETG 335
PP W + K V+ LP+ V +E++ +F KCG I + +++ +D +G
Sbjct: 146 PPPDWTDPKPTAGCEVFCGKLPNTVFEDELIPLFEKCGKIYD--------LRLMMDPLSG 197
Query: 336 MKKGDALVTYLKEPSVALATQLLDGTPFRPDGKIPMSVT 374
+G A VTY + AT LDG +P I ++++
Sbjct: 198 TNRGYAFVTYTTKEEAERATVELDGYEIKPGKNIKVNIS 236
>gi|156543322|ref|XP_001606160.1| PREDICTED: splicing factor U2af 38 kDa subunit-like [Nasonia
vitripennis]
Length = 242
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 446 DVQEECV-KIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
DV EC K G ++ + VC+N H G V ++F+ +DA+K + +N RWFGGR ++A
Sbjct: 86 DVFVECEDKYGEIEEMNVCDNLGDHLVGNVYIKFRREEDAEKAVNDLNNRWFGGRPVYA 144
>gi|189209401|ref|XP_001941033.1| splicing factor U2af 38 kDa subunit [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977126|gb|EDU43752.1| splicing factor U2af 38 kDa subunit [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 214
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 446 DVQEECVKIGPVDSVKVCEN---HPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHA 500
D+ E + G V+ + VC+N H G V VRFK +DAQK + +N RW+ GR I+
Sbjct: 78 DIWCELCQYGLVEELVVCDNNNDHLIGNVYVRFKYEEDAQKACDALNSRWYAGRPIYC 135
>gi|341895702|gb|EGT51637.1| CBN-RNP-6 protein [Caenorhabditis brenneri]
Length = 757
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 9/94 (9%)
Query: 420 ATVILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVV--LVRFKD 477
+ ++LR M TP +++ LE +++EEC K G V V + GVV V++ D
Sbjct: 666 SVIVLRNMVTP------DDIDEYLEGEIREECGKFGTVLDVVIANFASSGVVKIFVKYAD 719
Query: 478 RKDAQKCIELMNGRWFGGRQIHA-SEDDGLVNHA 510
+ ++GR+FGG + A + D L +HA
Sbjct: 720 SMQVDRAKAALDGRFFGGNIVKAEAYDQILFDHA 753
>gi|350410158|ref|XP_003488966.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Bombus impatiens]
Length = 532
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 423 ILRFMFTPAEMRADENLRSELEADVQEECVKIGPVDSVKVCENHPQGVVLVRFKDRKDAQ 482
+L MF P + + N E+ DV EEC K G V V V + PQG V V+ A
Sbjct: 425 MLSNMFDP-QNETNPNWAKEIRDDVIEECNKHGGVLHVYVDQASPQGNVYVKCPSIATAV 483
Query: 483 KCIELMNGRWFGGRQIHAS 501
+ ++GRWF GR I A+
Sbjct: 484 AAVNSLHGRWFAGRVITAA 502
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.131 0.378
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,936,582,644
Number of Sequences: 23463169
Number of extensions: 412781812
Number of successful extensions: 1111646
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1173
Number of HSP's successfully gapped in prelim test: 3584
Number of HSP's that attempted gapping in prelim test: 1100312
Number of HSP's gapped (non-prelim): 10838
length of query: 532
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 385
effective length of database: 8,910,109,524
effective search space: 3430392166740
effective search space used: 3430392166740
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 79 (35.0 bits)